[Mesa-dev] gpu hung with mesa 9.1.2~git (was: Re: Thanks for mesa 9.1.2! Vibrant colors in Planeshift)

Martin Steigerwald Martin at lichtvoll.de
Sun May 19 03:59:21 PDT 2013


Hi Kenneth, Brian, everyone,

Am Samstag, 11. Mai 2013, 21:07:55 schrieb Martin Steigerwald:
> It is a complete difference. I always had the feeling that colors were
> to dark since I switched to GLSL. But no comparison, so I didn´t report
> I think.
> 
> Looks marvellously now.
> 
> 
> I got some hangs tough:
> 
> [151564.311483] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
> [151564.311488] [drm] capturing error event; look for more information in/sys/kernel/debug/dri/0/i915_error_state
> [151664.323926] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
> 
> [152099.395463] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
> [152121.399082] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
> 
> Could be quite easy to bisect, cause previous state of mesa 9.1 git
> I compiled didn´t have these hangs. I am right now using mesa 9.1
> branch as of 1e043ebe034b6e33b4c5228770db397b22fd26e0.
> 
> Kernel is 3.9.

Ok, I bisected this meanwhile:

martin at merkaba:~/Spielen/Mesa/mesa.git> git bisect good
9f66038b5bac7f3574a00beda850be31712f9610 is the first bad commit
commit 9f66038b5bac7f3574a00beda850be31712f9610
Author: Kenneth Graunke <kenneth at whitecape.org>
Date:   Mon Apr 22 23:13:47 2013 -0700

    mesa: Add an unpack function for ARGB2101010_UINT.
    
    v2: Remove extra parenthesis (suggested by Brian).
    
    NOTE: This is a candidate for stable branches.
    
    Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=63569
    Signed-off-by: Kenneth Graunke <kenneth at whitecape.org>
    Reviewed-by: Brian Paul <brianp at vmware.com>
    (cherry picked from commit 531be501de7530ede3c0ea7a1d94516df37e4ae5)

:040000 040000 bd2449c5ebebc6f425afcd7c7f6f6de29ade8741 9693c32b87a859bab740ea88c7882206027837c8 M      src



Do you want a fdo bug report for it? I can also add a note to the referenced
bug report.


Fortunately I now know some versions with vibrant colors but without GPU hang :)


merkaba:~> zgrep -A1 -i "gpu" /var/log/kern.log*  | grep drm
/var/log/kern.log.1:May 16 23:07:27 merkaba kernel: [139847.486558] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
/var/log/kern.log.1:May 16 23:07:27 merkaba kernel: [139847.486564] [drm] capturing error event; look for more information in/sys/kernel/debug/dri/0/i915_error_state
/var/log/kern.log.1:May 16 23:41:07 merkaba kernel: [141867.817854] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
/var/log/kern.log.1:May 16 23:41:15 merkaba kernel: [141875.819167] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
/var/log/kern.log.2.gz:May 11 20:49:00 merkaba kernel: [151564.311483] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
/var/log/kern.log.2.gz:May 11 20:49:00 merkaba kernel: [151564.311488] [drm] capturing error event; look for more information in/sys/kernel/debug/dri/0/i915_error_state
/var/log/kern.log.2.gz:May 11 20:50:40 merkaba kernel: [151664.323926] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
/var/log/kern.log.2.gz:May 11 20:57:55 merkaba kernel: [152099.395463] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
/var/log/kern.log.2.gz:May 11 20:58:17 merkaba kernel: [152121.399082] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
/var/log/kern.log.3.gz:May  4 23:57:14 merkaba kernel: [74694.866813] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
/var/log/kern.log.3.gz:May  4 23:57:14 merkaba kernel: [74694.866818] [drm] capturing error event; look for more information in/sys/kernel/debug/dri/0/i915_error_state
/var/log/kern.log.4.gz:Apr 23 22:24:42 merkaba kernel: [28435.584298] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
/var/log/kern.log.4.gz:Apr 23 22:24:42 merkaba kernel: [28435.584303] [drm] capturing error event; look for more information in/sys/kernel/debug/dri/0/i915_error_state


Kernel is still:

merkaba:~> cat /proc/version
Linux version 3.9.0-tp520 (martin at merkaba) (gcc version 4.7.2 (Debian 4.7.2-5) ) #7 SMP PREEMPT Mon Apr 29 15:01:05 CEST 2013


Will try with 3.10 when rc2 is available :)

Thanks,
Martin



> 
> 
> Dunno how to report "/sys/kernel/debug/dri/0/i915_error_state" stuff
> since I got an out of memory error while trying to cat it:
> 
> [151736.238380] cat: page allocation failure: order:6, mode:0x1040d0
> [151736.238387] Pid: 20726, comm: cat Tainted: G           O 3.9.0-tp520 #7
> [151736.238390] Call Trace:
> [151736.238403]  [<ffffffff810d56e0>] warn_alloc_failed+0x10a/0x11e
> [151736.238411]  [<ffffffff81433670>] ? _cond_resched+0x9/0x1d
> [151736.238415]  [<ffffffff8142e05c>] ? __alloc_pages_direct_compact+0x1ae/0x1c0
> [151736.238419]  [<ffffffff810d84c2>] __alloc_pages_nodemask+0x666/0x709
> [151736.238426]  [<ffffffff81107a2f>] alloc_pages_current+0xc0/0xdd
> [151736.238431]  [<ffffffff810d49de>] __get_free_pages+0x9/0x36
> [151736.238435]  [<ffffffff8110dba2>] kmalloc_order_trace+0x26/0x58
> [151736.238438]  [<ffffffff8110f648>] __kmalloc+0x30/0xe5
> [151736.238442]  [<ffffffff8113534a>] seq_read+0x1d7/0x338
> [151736.238448]  [<ffffffff8111b0e4>] vfs_read+0x98/0xfa
> [151736.238453]  [<ffffffff814345b5>] ? _raw_spin_lock+0x1b/0x1f
> [151736.238456]  [<ffffffff8111b25d>] sys_read+0x4d/0x7a
> [151736.238461]  [<ffffffff81439169>] system_call_fastpath+0x16/0x1b
> [151736.238463] Mem-Info:
> [151736.238465] Node 0 DMA per-cpu:
> [151736.238468] CPU    0: hi:    0, btch:   1 usd:   0
> [151736.238471] CPU    1: hi:    0, btch:   1 usd:   0
> [151736.238473] CPU    2: hi:    0, btch:   1 usd:   0
> [151736.238475] CPU    3: hi:    0, btch:   1 usd:   0
> [151736.238476] Node 0 DMA32 per-cpu:
> [151736.238480] CPU    0: hi:  186, btch:  31 usd:   0
> [151736.238482] CPU    1: hi:  186, btch:  31 usd:   0
> [151736.238484] CPU    2: hi:  186, btch:  31 usd:   0
> [151736.238486] CPU    3: hi:  186, btch:  31 usd:   0
> [151736.238488] Node 0 Normal per-cpu:
> [151736.238491] CPU    0: hi:  186, btch:  31 usd:   0
> [151736.238493] CPU    1: hi:  186, btch:  31 usd: 146
> [151736.238495] CPU    2: hi:  186, btch:  31 usd:   0
> [151736.238497] CPU    3: hi:  186, btch:  31 usd:   0
> [151736.238502] active_anon:1000677 inactive_anon:278666 isolated_anon:0
> [151736.238502]  active_file:208660 inactive_file:178845 isolated_file:0
> [151736.238502]  unevictable:2281 dirty:147 writeback:0 unstable:0
> [151736.238502]  free:112943 slab_reclaimable:161313 slab_unreclaimable:16170
> [151736.238502]  mapped:69566 shmem:264437 pagetables:13564 bounce:0
> [151736.238502]  free_cma:0
> [151736.238508] Node 0 DMA free:15344kB min:384kB low:480kB high:576kB active_anon:0kB inactive_anon:0kB active_file:0kB inactive_file:0kB unevictable:0kB isolated(anon):0kB isolated(file):0kB present:15984kB managed:15360kB mlocked:0kB dirty:0kB writeback:0kB mapped:0kB shmem:0kB slab_reclaimable:0kB slab_unreclaimable:16kB kernel_stack:0kB pagetables:0kB unstable:0kB bounce:0kB free_cma:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? yes
> [151736.238516] lowmem_reserve[]: 0 3284 7738 7738
> [151736.238521] Node 0 DMA32 free:257484kB min:84712kB low:105888kB high:127068kB active_anon:1306392kB inactive_anon:380744kB active_file:546496kB inactive_file:546256kB unevictable:4kB isolated(anon):0kB isolated(file):0kB present:3450496kB managed:3363080kB mlocked:4kB dirty:92kB writeback:0kB mapped:121976kB shmem:266380kB slab_reclaimable:315548kB slab_unreclaimable:9084kB kernel_stack:328kB pagetables:5852kB unstable:0kB bounce:0kB free_cma:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? no
> [151736.238529] lowmem_reserve[]: 0 0 4454 4454
> [151736.238532] Node 0 Normal free:178944kB min:114896kB low:143620kB high:172344kB active_anon:2696316kB inactive_anon:733920kB active_file:288144kB inactive_file:169124kB unevictable:9120kB isolated(anon):0kB isolated(file):0kB present:4691968kB managed:4561268kB mlocked:9120kB dirty:496kB writeback:0kB mapped:156288kB shmem:791368kB slab_reclaimable:329704kB slab_unreclaimable:55580kB kernel_stack:4208kB pagetables:48404kB unstable:0kB bounce:0kB free_cma:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? no
> [151736.238540] lowmem_reserve[]: 0 0 0 0
> [151736.238544] Node 0 DMA: 0*4kB 0*8kB 1*16kB (U) 1*32kB (U) 1*64kB (U) 1*128kB (U) 1*256kB (U) 1*512kB (U) 0*1024kB 1*2048kB (R) 3*4096kB (M) = 15344kB
> [151736.238562] Node 0 DMA32: 11494*4kB (UEM) 4568*8kB (UEM) 2031*16kB (UEM) 2712*32kB (UEM) 864*64kB (UEM) 6*128kB (EM) 0*256kB 0*512kB 0*1024kB 0*2048kB 0*4096kB = 257864kB
> [151736.238577] Node 0 Normal: 613*4kB (UEM) 6804*8kB (UEM) 4952*16kB (UEM) 948*32kB (UEM) 119*64kB (UEM) 31*128kB (UEM) 4*256kB (E) 0*512kB 0*1024kB 0*2048kB 0*4096kB = 179060kB
> [151736.238592] 688704 total pagecache pages
> [151736.238595] 35955 pages in swap cache
> [151736.238597] Swap cache stats: add 1327562, delete 1291607, find 615851/684662
> [151736.238599] Free swap  = 11972088kB
> [151736.238601] Total swap = 12582908kB
> [151736.272554] 2057727 pages RAM
> [151736.272558] 69389 pages reserved
> [151736.272560] 1690491 pages shared
> [151736.272561] 1414582 pages non-shared
> 
> Thanks,
> 
-- 
Martin 'Helios' Steigerwald - http://www.Lichtvoll.de
GPG: 03B0 0D6C 0040 0710 4AFA  B82F 991B EAAC A599 84C7


More information about the mesa-dev mailing list