View Single Post
Old 08-18-12, 04:57 AM   #26
rockob
Registered User
 
Join Date: Nov 2008
Posts: 95
Default Re: [BUG] nvidia crashes kernel with 'Xid 13' and attempted to yield the CPU while at

I tried again with kenel 3.6-rc2, and nvidia 304.37 crashed about 30 seconds in to the game. Again there were no Xid errors or "GPU has fallen off the bus" messages, but the kernel reported hung processes within the nvidia module. Eventually I had to hard reset the PC because it became completely unresponsive. Below is the kernel log for the hung processes:

Code:
Aug 18 16:45:29 sierra kernel: [69562.835272] NVRM: loading NVIDIA UNIX x86_64 Kernel Module  304.37
Aug 18 16:46:48 sierra kernel: [69641.144875] NVRM: GPU at 0000:01:00: GPU-1b1589e9-15df-5ca5-919b-2f748fae640f
...
Aug 18 16:51:45 sierra kernel: [69938.074719] INFO: task kworker/0:3:6594 blocked for more than 120 seconds.
Aug 18 16:51:45 sierra kernel: [69938.074723] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Aug 18 16:51:45 sierra kernel: [69938.074725] kworker/0:3     D ffff88023e613dc0     0  6594      2 0x00000000
Aug 18 16:51:45 sierra kernel: [69938.074730]  ffff8802215d7b00 0000000000000046 ffff88023117db40 ffff8802215d7fd8
Aug 18 16:51:45 sierra kernel: [69938.074735]  ffff8802215d7fd8 ffff8802215d7fd8 ffff8800a7b596d0 ffff88023117db40
Aug 18 16:51:45 sierra kernel: [69938.074739]  0000000000000000 7fffffffffffffff ffff88023117db40 ffff880219675388
Aug 18 16:51:45 sierra kernel: [69938.074743] Call Trace:
Aug 18 16:51:45 sierra kernel: [69938.074753]  [<ffffffff81680979>] schedule+0x29/0x70
Aug 18 16:51:45 sierra kernel: [69938.074757]  [<ffffffff8167ee1c>] schedule_timeout+0x1bc/0x280
Aug 18 16:51:45 sierra kernel: [69938.074761]  [<ffffffff8167fc0b>] __down_common+0xa0/0xf7
Aug 18 16:51:45 sierra kernel: [69938.074765]  [<ffffffff8167fcd5>] __down+0x1d/0x1f
Aug 18 16:51:45 sierra kernel: [69938.074770]  [<ffffffff8107fc01>] down+0x41/0x50
Aug 18 16:51:45 sierra kernel: [69938.074866]  [<ffffffffa0f4f9f2>] os_acquire_mutex+0x42/0x50 [nvidia]
Aug 18 16:51:45 sierra kernel: [69938.074947]  [<ffffffffa0f1fe25>] _nv014757rm+0x1c/0x21 [nvidia]
Aug 18 16:51:45 sierra kernel: [69938.075020]  [<ffffffffa095c343>] ? _nv016374rm+0x6c/0x100 [nvidia]
Aug 18 16:51:45 sierra kernel: [69938.075132]  [<ffffffffa0e1afb1>] ? _nv015315rm+0x211/0x358 [nvidia]
Aug 18 16:51:45 sierra kernel: [69938.075215]  [<ffffffffa0f251a7>] ? _nv001080rm+0x298/0x97d [nvidia]
Aug 18 16:51:45 sierra kernel: [69938.075301]  [<ffffffffa0f28550>] ? rm_execute_work_item+0x4c/0xc2 [nvidia]
Aug 18 16:51:45 sierra kernel: [69938.075384]  [<ffffffffa0f5043f>] ? os_execute_work_item+0x4f/0x90 [nvidia]
Aug 18 16:51:45 sierra kernel: [69938.075390]  [<ffffffff810730f3>] ? process_one_work+0x143/0x500
Aug 18 16:51:45 sierra kernel: [69938.075474]  [<ffffffffa0f503f0>] ? nv_printf+0x80/0x80 [nvidia]
Aug 18 16:51:45 sierra kernel: [69938.075480]  [<ffffffff810748be>] ? worker_thread+0x16e/0x480
Aug 18 16:51:45 sierra kernel: [69938.075484]  [<ffffffff81074750>] ? manage_workers.isra.21+0x2b0/0x2b0
Aug 18 16:51:45 sierra kernel: [69938.075488]  [<ffffffff81079793>] ? kthread+0x93/0xa0
Aug 18 16:51:45 sierra kernel: [69938.075493]  [<ffffffff8168ac04>] ? kernel_thread_helper+0x4/0x10
Aug 18 16:51:45 sierra kernel: [69938.075497]  [<ffffffff81079700>] ? kthread_freezable_should_stop+0x70/0x70
Aug 18 16:51:45 sierra kernel: [69938.075501]  [<ffffffff8168ac00>] ? gs_change+0x13/0x13
Aug 18 16:51:45 sierra kernel: [69938.075511] INFO: task Crysis2.exe:8829 blocked for more than 120 seconds.
Aug 18 16:51:45 sierra kernel: [69938.075513] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Aug 18 16:51:45 sierra kernel: [69938.075515] Crysis2.exe     D ffff88023e793dc0     0  8829   8694 0x20020000
Aug 18 16:51:45 sierra kernel: [69938.075518]  ffff88022152db28 0000000000200082 ffff880234df2da0 ffff88022152dfd8
Aug 18 16:51:45 sierra kernel: [69938.075522]  ffff88022152dfd8 ffff88022152dfd8 ffff880234dc5b40 ffff880234df2da0
Aug 18 16:51:45 sierra kernel: [69938.075526]  ffff88022152db18 7fffffffffffffff ffff880234df2da0 ffff880219675388
Aug 18 16:51:45 sierra kernel: [69938.075530] Call Trace:
Aug 18 16:51:45 sierra kernel: [69938.075536]  [<ffffffff81680979>] schedule+0x29/0x70
Aug 18 16:51:45 sierra kernel: [69938.075539]  [<ffffffff8167ee1c>] schedule_timeout+0x1bc/0x280
Aug 18 16:51:45 sierra kernel: [69938.075544]  [<ffffffff8167fc0b>] __down_common+0xa0/0xf7
Aug 18 16:51:45 sierra kernel: [69938.075548]  [<ffffffff8167fcd5>] __down+0x1d/0x1f
Aug 18 16:51:45 sierra kernel: [69938.075552]  [<ffffffff8107fc01>] down+0x41/0x50
Aug 18 16:51:45 sierra kernel: [69938.075639]  [<ffffffffa0f4f9f2>] os_acquire_mutex+0x42/0x50 [nvidia]
Aug 18 16:51:45 sierra kernel: [69938.075724]  [<ffffffffa0f1fe25>] _nv014757rm+0x1c/0x21 [nvidia]
Aug 18 16:51:45 sierra kernel: [69938.075798]  [<ffffffffa095c343>] ? _nv016374rm+0x6c/0x100 [nvidia]
Aug 18 16:51:45 sierra kernel: [69938.075868]  [<ffffffffa0952bfb>] ? _nv014649rm+0x9/0x21 [nvidia]
Aug 18 16:51:45 sierra kernel: [69938.075938]  [<ffffffffa0940ecd>] ? _nv001039rm+0xc5c/0xd59 [nvidia]
Aug 18 16:51:45 sierra kernel: [69938.076005]  [<ffffffffa09410be>] ? _nv001073rm+0x73/0x2d09 [nvidia]
Aug 18 16:51:45 sierra kernel: [69938.076071]  [<ffffffffa09395b8>] ? _nv000947rm+0x26/0x147 [nvidia]
Aug 18 16:51:45 sierra kernel: [69938.076156]  [<ffffffffa0f1b0ed>] ? _nv001106rm+0x34d/0xaaf [nvidia]
Aug 18 16:51:45 sierra kernel: [69938.076237]  [<ffffffffa0f26ce6>] ? rm_ioctl+0x76/0x100 [nvidia]
Aug 18 16:51:45 sierra kernel: [69938.076318]  [<ffffffffa0f4566d>] ? nv_kern_ioctl+0x14d/0x480 [nvidia]
Aug 18 16:51:45 sierra kernel: [69938.076398]  [<ffffffffa0f459c1>] ? nv_kern_compat_ioctl+0x21/0x30 [nvidia]
Aug 18 16:51:45 sierra kernel: [69938.076403]  [<ffffffff811d0051>] ? compat_sys_ioctl+0xd1/0x1330
Aug 18 16:51:45 sierra kernel: [69938.076407]  [<ffffffff8101a2f9>] ? read_tsc+0x9/0x20
Aug 18 16:51:45 sierra kernel: [69938.076412]  [<ffffffff810a57bc>] ? getnstimeofday+0x4c/0xe0
Aug 18 16:51:45 sierra kernel: [69938.076415]  [<ffffffff810a58ba>] ? do_gettimeofday+0x1a/0x50
Aug 18 16:51:45 sierra kernel: [69938.076419]  [<ffffffff810bf3f5>] ? compat_sys_time+0x25/0x70
Aug 18 16:51:45 sierra kernel: [69938.076424]  [<ffffffff8168af26>] ? sysenter_dispatch+0x7/0x21
Aug 18 16:51:45 sierra kernel: [69938.076434] INFO: task kworker/0:0:9270 blocked for more than 120 seconds.
Aug 18 16:51:45 sierra kernel: [69938.076436] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Aug 18 16:51:45 sierra kernel: [69938.076437] kworker/0:0     D ffff88023e613dc0     0  9270      2 0x00000000
Aug 18 16:51:45 sierra kernel: [69938.076441]  ffff8801fdca7b00 0000000000000046 ffff8801f69016d0 ffff8801fdca7fd8
Aug 18 16:51:45 sierra kernel: [69938.076445]  ffff8801fdca7fd8 ffff8801fdca7fd8 ffff88023117db40 ffff8801f69016d0
Aug 18 16:51:45 sierra kernel: [69938.076449]  00000000ffffffff 7fffffffffffffff ffff8801f69016d0 ffff880219675388
Aug 18 16:51:45 sierra kernel: [69938.076452] Call Trace:
Aug 18 16:51:45 sierra kernel: [69938.076458]  [<ffffffff81680979>] schedule+0x29/0x70
Aug 18 16:51:45 sierra kernel: [69938.076462]  [<ffffffff8167ee1c>] schedule_timeout+0x1bc/0x280
Aug 18 16:51:45 sierra kernel: [69938.076466]  [<ffffffff8109015c>] ? update_curr+0xfc/0x190
Aug 18 16:51:45 sierra kernel: [69938.076469]  [<ffffffff8167fc0b>] __down_common+0xa0/0xf7
Aug 18 16:51:45 sierra kernel: [69938.076474]  [<ffffffff8167fcd5>] __down+0x1d/0x1f
Aug 18 16:51:45 sierra kernel: [69938.076478]  [<ffffffff8107fc01>] down+0x41/0x50
Aug 18 16:51:45 sierra kernel: [69938.076561]  [<ffffffffa0f4f9f2>] os_acquire_mutex+0x42/0x50 [nvidia]
Aug 18 16:51:45 sierra kernel: [69938.076645]  [<ffffffffa0f1fe25>] _nv014757rm+0x1c/0x21 [nvidia]
Aug 18 16:51:45 sierra kernel: [69938.076718]  [<ffffffffa095c343>] ? _nv016374rm+0x6c/0x100 [nvidia]
Aug 18 16:51:45 sierra kernel: [69938.076833]  [<ffffffffa0e1afb1>] ? _nv015315rm+0x211/0x358 [nvidia]
Aug 18 16:51:45 sierra kernel: [69938.076917]  [<ffffffffa0f251a7>] ? _nv001080rm+0x298/0x97d [nvidia]
Aug 18 16:51:45 sierra kernel: [69938.077000]  [<ffffffffa0f28550>] ? rm_execute_work_item+0x4c/0xc2 [nvidia]
Aug 18 16:51:45 sierra kernel: [69938.077083]  [<ffffffffa0f5043f>] ? os_execute_work_item+0x4f/0x90 [nvidia]
Aug 18 16:51:45 sierra kernel: [69938.077087]  [<ffffffff810730f3>] ? process_one_work+0x143/0x500
Aug 18 16:51:45 sierra kernel: [69938.077167]  [<ffffffffa0f503f0>] ? nv_printf+0x80/0x80 [nvidia]
Aug 18 16:51:45 sierra kernel: [69938.077172]  [<ffffffff810748be>] ? worker_thread+0x16e/0x480
Aug 18 16:51:45 sierra kernel: [69938.077175]  [<ffffffff81074750>] ? manage_workers.isra.21+0x2b0/0x2b0
Aug 18 16:51:45 sierra kernel: [69938.077179]  [<ffffffff81079793>] ? kthread+0x93/0xa0
Aug 18 16:51:45 sierra kernel: [69938.077184]  [<ffffffff8168ac04>] ? kernel_thread_helper+0x4/0x10
Aug 18 16:51:45 sierra kernel: [69938.077189]  [<ffffffff81079700>] ? kthread_freezable_should_stop+0x70/0x70
Aug 18 16:51:45 sierra kernel: [69938.077192]  [<ffffffff8168ac00>] ? gs_change+0x13/0x13
Aug 18 16:51:45 sierra kernel: [69938.077195] INFO: task kworker/0:4:9686 blocked for more than 120 seconds.
Aug 18 16:51:45 sierra kernel: [69938.077196] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Aug 18 16:51:45 sierra kernel: [69938.077198] kworker/0:4     D ffff88023e613dc0     0  9686      2 0x00000000
Aug 18 16:51:45 sierra kernel: [69938.077201]  ffff880103959b00 0000000000000046 ffff8800865a16d0 ffff880103959fd8
Aug 18 16:51:45 sierra kernel: [69938.077205]  ffff880103959fd8 ffff880103959fd8 ffff8800a7b596d0 ffff8800865a16d0
Aug 18 16:51:45 sierra kernel: [69938.077208]  6db8926917d28c35 7fffffffffffffff ffff8800865a16d0 ffff880219675388
Aug 18 16:51:45 sierra kernel: [69938.077212] Call Trace:
Aug 18 16:51:45 sierra kernel: [69938.077217]  [<ffffffff81680979>] schedule+0x29/0x70
Aug 18 16:51:45 sierra kernel: [69938.077221]  [<ffffffff8167ee1c>] schedule_timeout+0x1bc/0x280
Aug 18 16:51:45 sierra kernel: [69938.077225]  [<ffffffff81327383>] ? cpumask_next_and+0x23/0x40
Aug 18 16:51:45 sierra kernel: [69938.077229]  [<ffffffff81091ee3>] ? update_sd_lb_stats+0x133/0x610
Aug 18 16:51:45 sierra kernel: [69938.077233]  [<ffffffff8167fc0b>] __down_common+0xa0/0xf7
Aug 18 16:51:45 sierra kernel: [69938.077237]  [<ffffffff8167fcd5>] __down+0x1d/0x1f
Aug 18 16:51:45 sierra kernel: [69938.077241]  [<ffffffff8107fc01>] down+0x41/0x50
Aug 18 16:51:45 sierra kernel: [69938.077320]  [<ffffffffa0f4f9f2>] os_acquire_mutex+0x42/0x50 [nvidia]
Aug 18 16:51:45 sierra kernel: [69938.077401]  [<ffffffffa0f1fe25>] _nv014757rm+0x1c/0x21 [nvidia]
Aug 18 16:51:45 sierra kernel: [69938.077473]  [<ffffffffa095c343>] ? _nv016374rm+0x6c/0x100 [nvidia]
Aug 18 16:51:45 sierra kernel: [69938.077585]  [<ffffffffa0e1afb1>] ? _nv015315rm+0x211/0x358 [nvidia]
Aug 18 16:51:45 sierra kernel: [69938.077668]  [<ffffffffa0f251a7>] ? _nv001080rm+0x298/0x97d [nvidia]
Aug 18 16:51:45 sierra kernel: [69938.077752]  [<ffffffffa0f28550>] ? rm_execute_work_item+0x4c/0xc2 [nvidia]
Aug 18 16:51:45 sierra kernel: [69938.077834]  [<ffffffffa0f5043f>] ? os_execute_work_item+0x4f/0x90 [nvidia]
Aug 18 16:51:45 sierra kernel: [69938.077838]  [<ffffffff810730f3>] ? process_one_work+0x143/0x500
Aug 18 16:51:45 sierra kernel: [69938.077845]  [<ffffffff8108e3af>] ? __dequeue_entity+0x2f/0x50
Aug 18 16:51:45 sierra kernel: [69938.077926]  [<ffffffffa0f503f0>] ? nv_printf+0x80/0x80 [nvidia]
Aug 18 16:51:45 sierra kernel: [69938.077936]  [<ffffffff810748be>] ? worker_thread+0x16e/0x480
Aug 18 16:51:45 sierra kernel: [69938.077939]  [<ffffffff81074750>] ? manage_workers.isra.21+0x2b0/0x2b0
Aug 18 16:51:45 sierra kernel: [69938.077944]  [<ffffffff81079793>] ? kthread+0x93/0xa0
Aug 18 16:51:45 sierra kernel: [69938.077948]  [<ffffffff8168ac04>] ? kernel_thread_helper+0x4/0x10
Aug 18 16:51:45 sierra kernel: [69938.077952]  [<ffffffff81079700>] ? kthread_freezable_should_stop+0x70/0x70
Aug 18 16:51:45 sierra kernel: [69938.077955]  [<ffffffff8168ac00>] ? gs_change+0x13/0x13
Aug 18 16:52:02 sierra kernel: [69955.149205] iwlwifi 0000:03:00.0: fail to flush all tx fifo queues
Aug 18 16:52:06 sierra kernel: [69959.163935] iwlwifi 0000:03:00.0: fail to flush all tx fifo queues
Aug 18 16:52:12 sierra kernel: [69963.693042] ------------[ cut here ]------------
rockob is offline   Reply With Quote