View Single Post
Old 06-21-12, 11:09 PM   #56
rockob
Registered User
 
Join Date: Nov 2008
Posts: 95
Default Re: Random crashes, NVRM Xid messages

Yes, I still get this crash on 302.17 as well:
Code:
Jun 20 16:03:59 sierra kernel: [ 2026.426817] NVRM: loading NVIDIA UNIX x86_64 Kernel Module  302.17  Tue Jun 12 16:03:22 PDT 2012
Jun 20 16:08:14 sierra kernel: [ 2280.965226] NVRM: Xid (0000:01:00): 13, 0006 00000000 00009197 00002380 00004100 00000000
Jun 20 16:08:16 sierra kernel: [ 2282.968756] NVRM: os_schedule: Attempted to yield the CPU while in atomic or interrupt context
Jun 20 16:08:18 sierra kernel: [ 2284.978743] NVRM: os_schedule: Attempted to yield the CPU while in atomic or interrupt context
Jun 20 16:08:22 sierra kernel: [ 2288.977479] NVRM: os_schedule: Attempted to yield the CPU while in atomic or interrupt context
Jun 20 16:08:41 sierra kernel: [ 2308.048400] NVRM: GPU at 0000:01:00.0 has fallen off the bus.
In case it helps, it also logs a kernel warning for the stalled process, with stack references to the nvidia module:

Code:
Jun 20 16:09:49 sierra kernel: [ 2375.565364] INFO: rcu_sched self-detected stall on CPU { 5}  (t=60000 jiffies)
Jun 20 16:09:49 sierra kernel: [ 2375.565367] Pid: 7890, comm: Crysis2 Tainted: P           O 3.5.0-rc3-git-20120618.0713 #3
Jun 20 16:09:49 sierra kernel: [ 2375.565369] Call Trace:
Jun 20 16:09:49 sierra kernel: [ 2375.565375]  <IRQ>  [<ffffffff810df10a>] __rcu_pending+0x19a/0x4d0
Jun 20 16:09:49 sierra kernel: [ 2375.565377]  [<ffffffff810df46c>] rcu_pending+0x2c/0x60
Jun 20 16:09:49 sierra kernel: [ 2375.565380]  [<ffffffff810dfc51>] rcu_check_callbacks+0xa1/0x140
Jun 20 16:09:49 sierra kernel: [ 2375.565383]  [<ffffffff8105c7d8>] update_process_times+0x48/0x90
Jun 20 16:09:49 sierra kernel: [ 2375.565386]  [<ffffffff810a1286>] tick_sched_timer+0x66/0xc0
Jun 20 16:09:49 sierra kernel: [ 2375.565388]  [<ffffffff810725b9>] __run_hrtimer+0x79/0x1d0
Jun 20 16:09:49 sierra kernel: [ 2375.565390]  [<ffffffff810a1220>] ? tick_nohz_handler+0xf0/0xf0
Jun 20 16:09:49 sierra kernel: [ 2375.565392]  [<ffffffff81072e87>] hrtimer_interrupt+0xd7/0x200
Jun 20 16:09:49 sierra kernel: [ 2375.565395]  [<ffffffff8161efdc>] ? call_softirq+0x1c/0x30
Jun 20 16:09:49 sierra kernel: [ 2375.565397]  [<ffffffff8161f919>] smp_apic_timer_interrupt+0x69/0x99
Jun 20 16:09:49 sierra kernel: [ 2375.565399]  [<ffffffff8161e68a>] apic_timer_interrupt+0x6a/0x70
Jun 20 16:09:49 sierra kernel: [ 2375.565472]  <EOI>  [<ffffffffa18826db>] ? os_free_mem+0x1b/0x30 [nvidia]
Jun 20 16:09:49 sierra kernel: [ 2375.565512]  [<ffffffffa12b7633>] ? _nv014435rm+0x86/0x180 [nvidia]
Jun 20 16:09:49 sierra kernel: [ 2375.565551]  [<ffffffffa12b7633>] ? _nv014435rm+0x86/0x180 [nvidia]
Jun 20 16:09:49 sierra kernel: [ 2375.565655]  [<ffffffffa161565f>] ? _nv009674rm+0x14/0xa1 [nvidia]
Jun 20 16:09:49 sierra kernel: [ 2375.565757]  [<ffffffffa1636091>] ? _nv003990rm+0x4653/0xa790 [nvidia]
Jun 20 16:09:49 sierra kernel: [ 2375.565810]  [<ffffffffa1311ca1>] ? _nv002291rm+0x2d2/0x2e3 [nvidia]
Jun 20 16:09:49 sierra kernel: [ 2375.565861]  [<ffffffffa1311ea1>] ? _nv002004rm+0x1ef/0x205 [nvidia]
Jun 20 16:09:49 sierra kernel: [ 2375.565946]  [<ffffffffa1510de1>] ? _nv005836rm+0x6d0/0x701 [nvidia]
Jun 20 16:09:49 sierra kernel: [ 2375.566029]  [<ffffffffa1513a5d>] ? _nv005952rm+0xba/0x60e [nvidia]
Jun 20 16:09:49 sierra kernel: [ 2375.566111]  [<ffffffffa1513a18>] ? _nv005952rm+0x75/0x60e [nvidia]
Jun 20 16:09:49 sierra kernel: [ 2375.566212]  [<ffffffffa15f5cc0>] ? _nv008579rm+0x108/0x179 [nvidia]
Jun 20 16:09:49 sierra kernel: [ 2375.566311]  [<ffffffffa160ae5f>] ? _nv008740rm+0x11a/0x336 [nvidia]
Jun 20 16:09:49 sierra kernel: [ 2375.566365]  [<ffffffffa131c4ea>] ? _nv002370rm+0x1b/0x20 [nvidia]
Jun 20 16:09:49 sierra kernel: [ 2375.566418]  [<ffffffffa131f0d9>] ? _nv002345rm+0x265/0x292 [nvidia]
Jun 20 16:09:49 sierra kernel: [ 2375.566471]  [<ffffffffa131ef02>] ? _nv002345rm+0x8e/0x292 [nvidia]
Jun 20 16:09:49 sierra kernel: [ 2375.566571]  [<ffffffffa1608805>] ? _nv008014rm+0x2d1/0x3bc [nvidia]
Jun 20 16:09:49 sierra kernel: [ 2375.566670]  [<ffffffffa1608d41>] ? _nv008016rm+0x6d/0x90 [nvidia]
Jun 20 16:09:49 sierra kernel: [ 2375.566769]  [<ffffffffa160005a>] ? _nv008075rm+0x12c/0x501 [nvidia]
Jun 20 16:09:49 sierra kernel: [ 2375.566807]  [<ffffffffa12a7e5d>] ? _nv001063rm+0x1ccb/0x2afb [nvidia]
Jun 20 16:09:49 sierra kernel: [ 2375.566846]  [<ffffffffa12a60db>] ? _nv001029rm+0xc6a/0xca0 [nvidia]
Jun 20 16:09:49 sierra kernel: [ 2375.566883]  [<ffffffffa12a617a>] ? _nv016149rm+0xe/0x26 [nvidia]
Jun 20 16:09:49 sierra kernel: [ 2375.566921]  [<ffffffffa12a668c>] ? _nv001063rm+0x4fa/0x2afb [nvidia]
Jun 20 16:09:49 sierra kernel: [ 2375.566959]  [<ffffffffa12a60db>] ? _nv001029rm+0xc6a/0xca0 [nvidia]
Jun 20 16:09:49 sierra kernel: [ 2375.566996]  [<ffffffffa12a617a>] ? _nv016149rm+0xe/0x26 [nvidia]
Jun 20 16:09:49 sierra kernel: [ 2375.567034]  [<ffffffffa12a6411>] ? _nv001063rm+0x27f/0x2afb [nvidia]
Jun 20 16:09:49 sierra kernel: [ 2375.567071]  [<ffffffffa12a60db>] ? _nv001029rm+0xc6a/0xca0 [nvidia]
Jun 20 16:09:49 sierra kernel: [ 2375.567109]  [<ffffffffa12a614e>] ? _nv016151rm+0x3d/0x5b [nvidia]
Jun 20 16:09:49 sierra kernel: [ 2375.567157]  [<ffffffffa1858457>] ? _nv001072rm+0xdf/0x1c3 [nvidia]
Jun 20 16:09:49 sierra kernel: [ 2375.567205]  [<ffffffffa185a8e9>] ? rm_free_unused_clients+0x98/0x120 [nvidia]
Jun 20 16:09:49 sierra kernel: [ 2375.567207]  [<ffffffff810744e2>] ? up+0x32/0x50
Jun 20 16:09:49 sierra kernel: [ 2375.567253]  [<ffffffffa1879a4d>] ? nv_kern_ctl_close+0x7d/0x130 [nvidia]
Jun 20 16:09:49 sierra kernel: [ 2375.567300]  [<ffffffffa187a9a3>] ? nv_kern_close+0x3b3/0x440 [nvidia]
Jun 20 16:09:49 sierra kernel: [ 2375.567302]  [<ffffffff8118adf6>] ? d_kill+0xe6/0x140
Jun 20 16:09:49 sierra kernel: [ 2375.567304]  [<ffffffff81176b58>] ? fput+0x118/0x260
Jun 20 16:09:49 sierra kernel: [ 2375.567307]  [<ffffffff811729e6>] ? filp_close+0x66/0xa0
Jun 20 16:09:49 sierra kernel: [ 2375.567309]  [<ffffffff8104fa65>] ? put_files_struct+0xa5/0x100
Jun 20 16:09:49 sierra kernel: [ 2375.567311]  [<ffffffff8104fb85>] ? exit_files+0x55/0x70
Jun 20 16:09:49 sierra kernel: [ 2375.567312]  [<ffffffff81050063>] ? do_exit+0x183/0x8e0
Jun 20 16:09:49 sierra kernel: [ 2375.567314]  [<ffffffff81050b1f>] ? do_group_exit+0x3f/0xa0
Jun 20 16:09:49 sierra kernel: [ 2375.567316]  [<ffffffff81060549>] ? get_signal_to_deliver+0x1a9/0x5c0
Jun 20 16:09:49 sierra kernel: [ 2375.567319]  [<ffffffff810132bf>] ? do_signal+0x3f/0x610
Jun 20 16:09:49 sierra kernel: [ 2375.567322]  [<ffffffff81189b8d>] ? d_free+0x5d/0x70
Jun 20 16:09:49 sierra kernel: [ 2375.567324]  [<ffffffff81076a3a>] ? lg_local_unlock+0x1a/0x20
Jun 20 16:09:49 sierra kernel: [ 2375.567326]  [<ffffffff81192f16>] ? mntput_no_expire+0x46/0x160
Jun 20 16:09:49 sierra kernel: [ 2375.567328]  [<ffffffff8101391d>] ? do_notify_resume+0x6d/0xb0
Jun 20 16:09:49 sierra kernel: [ 2375.567330]  [<ffffffff8161dea2>] ? int_signal+0x12/0x17
rockob is offline   Reply With Quote