View Single Post
Old 04-23-12, 08:22 AM   #15
cdufour
Registered User
 
Join Date: Jan 2007
Posts: 8
Default Re: Random crashes, NVRM Xid messages

Having similar issue here.

100% reproducible on GT520 (GF108; VideoBIOS 70.08.5c.00.00) hardware with both drivers 295.20 and 295.40 (x86_64; kernel 2.6.32, Ubuntu 10.04):
- first launch of X: OK
- whenever X is re-started (e.g. logoff) => drivers locks up (but hosts still accessible via SSH)
- must reboot to solve the issue (rmmod-ing and modprobe-ing the 'nvidia' module is not enough)

Corresponding kernel messages:
Quote:
Apr 23 14:04:11 futurix13 kernel: [ 91.667580] NVRM: Xid (0000:01:00): 31, Ch 00000000, engmask 00000101, intr 10000000
Apr 23 14:04:13 futurix13 kernel: [ 93.666384] NVRM: os_schedule: Attempted to yield the CPU while in atomic or interrupt context
Apr 23 14:04:15 futurix13 kernel: [ 95.665088] NVRM: os_schedule: Attempted to yield the CPU while in atomic or interrupt context
Apr 23 14:04:15 futurix13 kernel: [ 95.707483] NVRM: Xid (0000:01:00): 56, CMDre 00000000 00000088 0100cb05 00000007 00000000
Apr 23 14:04:15 futurix13 kernel: [ 95.707494] NVRM: Xid (0000:01:00): 56, CMDre 00000000 0000008c 00000000 00000005 00000008
Apr 23 14:04:18 futurix13 kernel: [ 98.758866] NVRM: Xid (0000:01:00): 56, CMDre 00000000 00000088 0100cb0b 00000007 00000000
Apr 23 14:04:18 futurix13 kernel: [ 98.758876] NVRM: Xid (0000:01:00): 56, CMDre 00000000 0000008c 00000000 00000005 00000008
Apr 23 14:04:21 futurix13 kernel: [ 101.757547] NVRM: Xid (0000:01:00): 56, CMDre 00000000 00000080 00000000 00000005 00000008
Apr 23 14:04:24 futurix13 kernel: [ 104.758433] NVRM: Xid (0000:01:00): 31, Ch 00000001, engmask 00000101, intr 10000000
Apr 23 14:04:24 futurix13 kernel: [ 104.761334] NVRM: Xid (0000:01:00): 31, Ch 00000001, engmask 00000101, intr 10000000
Apr 23 14:04:24 futurix13 kernel: [ 104.764200] NVRM: Xid (0000:01:00): 31, Ch 00000001, engmask 00000101, intr 10000000
Apr 23 14:04:24 futurix13 kernel: [ 104.767068] NVRM: Xid (0000:01:00): 31, Ch 00000001, engmask 00000101, intr 10000000
Nothing relevant shows up in /var/log/Xorg.*.log

Replacing the card by a GT520 (GF119; VideoBIOS 75.19.1b.00.01) solves the issue.
Also note we use the same driver (295.20 on 100+ hosts with other nVidia chipsets) and the problem does not occur:
10 Device 0de4 (rev a1) - ERROR
1 Device 0e22 (rev a1) - OK
10 Device 1040 (rev a1) - OK
1 Device 1200 (rev a1) - OK
1 Device 1244 (rev a1) - OK
12 G86 [Quadro NVS 290] (rev a1) - OK
40 G98 [GeForce 8400 GS] (rev a1) - OK
9 GT218 [GeForce 210] (rev a2) - OK
6 NV44 [GeForce 6200 TurboCache(TM)] (rev a1) - OK
11 NV44 [Quadro NVS 285] (rev a1) - OK

Hope that problem can be solved soon (as ten of just-acquired workstations are currently just useless)

Best,

CÚdric
cdufour is offline   Reply With Quote