Go Back   nV News Forums > Linux Support Forums > NVIDIA Linux

Newegg Daily Deals

Reply
 
Thread Tools
Old 04-16-12, 07:32 PM   #13
cheechr1
Registered User
 
cheechr1's Avatar
 
Join Date: Nov 2010
Posts: 94
Default Re: Random crashes, NVRM Xid messages

Thank You Nvidia dev team.
cheechr1 is offline   Reply With Quote
Old 04-17-12, 02:46 PM   #14
muadib25
Registered User
 
Join Date: Apr 2012
Location: Sweden
Posts: 10
Default Re: Random crashes, NVRM Xid messages

I've been having the same issue for some months now, whenever I run 3D apps like Flightgear. I just installed Nvidia-driver 295.40 but the bug persists. This is what I get from dmesg:
Code:
[ 3517.677021] NVRM: Xid (0000:02:00): 8, Channel 00000003
[ 3519.674699] NVRM: os_schedule: Attempted to yield the CPU while in atomic or interrupt context
The system freezes for 15 secs, then runs for 10 secs, then freezes back again!

I saw the 'internal Bug ID: 970252'. Is there a place where we can post logs and details, or should I post them here?

Thanks in advance!
/Muadib25

Hardware: i7 920, 12Gb RAM, NVIDIA GTX_260, RME Hammerfall HDSP9652.
Software: Gentoo Linux, kernel 3.2.1-r2, Nvidia-drivers 295.40.
muadib25 is offline   Reply With Quote
Old 04-23-12, 08:22 AM   #15
cdufour
Registered User
 
Join Date: Jan 2007
Posts: 8
Default Re: Random crashes, NVRM Xid messages

Having similar issue here.

100% reproducible on GT520 (GF108; VideoBIOS 70.08.5c.00.00) hardware with both drivers 295.20 and 295.40 (x86_64; kernel 2.6.32, Ubuntu 10.04):
- first launch of X: OK
- whenever X is re-started (e.g. logoff) => drivers locks up (but hosts still accessible via SSH)
- must reboot to solve the issue (rmmod-ing and modprobe-ing the 'nvidia' module is not enough)

Corresponding kernel messages:
Quote:
Apr 23 14:04:11 futurix13 kernel: [ 91.667580] NVRM: Xid (0000:01:00): 31, Ch 00000000, engmask 00000101, intr 10000000
Apr 23 14:04:13 futurix13 kernel: [ 93.666384] NVRM: os_schedule: Attempted to yield the CPU while in atomic or interrupt context
Apr 23 14:04:15 futurix13 kernel: [ 95.665088] NVRM: os_schedule: Attempted to yield the CPU while in atomic or interrupt context
Apr 23 14:04:15 futurix13 kernel: [ 95.707483] NVRM: Xid (0000:01:00): 56, CMDre 00000000 00000088 0100cb05 00000007 00000000
Apr 23 14:04:15 futurix13 kernel: [ 95.707494] NVRM: Xid (0000:01:00): 56, CMDre 00000000 0000008c 00000000 00000005 00000008
Apr 23 14:04:18 futurix13 kernel: [ 98.758866] NVRM: Xid (0000:01:00): 56, CMDre 00000000 00000088 0100cb0b 00000007 00000000
Apr 23 14:04:18 futurix13 kernel: [ 98.758876] NVRM: Xid (0000:01:00): 56, CMDre 00000000 0000008c 00000000 00000005 00000008
Apr 23 14:04:21 futurix13 kernel: [ 101.757547] NVRM: Xid (0000:01:00): 56, CMDre 00000000 00000080 00000000 00000005 00000008
Apr 23 14:04:24 futurix13 kernel: [ 104.758433] NVRM: Xid (0000:01:00): 31, Ch 00000001, engmask 00000101, intr 10000000
Apr 23 14:04:24 futurix13 kernel: [ 104.761334] NVRM: Xid (0000:01:00): 31, Ch 00000001, engmask 00000101, intr 10000000
Apr 23 14:04:24 futurix13 kernel: [ 104.764200] NVRM: Xid (0000:01:00): 31, Ch 00000001, engmask 00000101, intr 10000000
Apr 23 14:04:24 futurix13 kernel: [ 104.767068] NVRM: Xid (0000:01:00): 31, Ch 00000001, engmask 00000101, intr 10000000
Nothing relevant shows up in /var/log/Xorg.*.log

Replacing the card by a GT520 (GF119; VideoBIOS 75.19.1b.00.01) solves the issue.
Also note we use the same driver (295.20 on 100+ hosts with other nVidia chipsets) and the problem does not occur:
10 Device 0de4 (rev a1) - ERROR
1 Device 0e22 (rev a1) - OK
10 Device 1040 (rev a1) - OK
1 Device 1200 (rev a1) - OK
1 Device 1244 (rev a1) - OK
12 G86 [Quadro NVS 290] (rev a1) - OK
40 G98 [GeForce 8400 GS] (rev a1) - OK
9 GT218 [GeForce 210] (rev a2) - OK
6 NV44 [GeForce 6200 TurboCache(TM)] (rev a1) - OK
11 NV44 [Quadro NVS 285] (rev a1) - OK

Hope that problem can be solved soon (as ten of just-acquired workstations are currently just useless)

Best,

Cédric
cdufour is offline   Reply With Quote
Old 04-23-12, 08:45 AM   #16
sandipt
NVIDIA Corporation
 
sandipt's Avatar
 
Join Date: Dec 2010
Posts: 260
Default Re: Random crashes, NVRM Xid messages

Quote:
Originally Posted by cdufour View Post
Having similar issue here.

100% reproducible on GT520 (GF108; VideoBIOS 70.08.5c.00.00) hardware with both drivers 295.20 and 295.40 (x86_64; kernel 2.6.32, Ubuntu 10.04):
- first launch of X: OK
- whenever X is re-started (e.g. logoff) => drivers locks up (but hosts still accessible via SSH)
- must reboot to solve the issue (rmmod-ing and modprobe-ing the 'nvidia' module is not enough)

Corresponding kernel messages:


Nothing relevant shows up in /var/log/Xorg.*.log

Replacing the card by a GT520 (GF119; VideoBIOS 75.19.1b.00.01) solves the issue.
Also note we use the same driver (295.20 on 100+ hosts with other nVidia chipsets) and the problem does not occur:
10 Device 0de4 (rev a1) - ERROR
1 Device 0e22 (rev a1) - OK
10 Device 1040 (rev a1) - OK
1 Device 1200 (rev a1) - OK
1 Device 1244 (rev a1) - OK
12 G86 [Quadro NVS 290] (rev a1) - OK
40 G98 [GeForce 8400 GS] (rev a1) - OK
9 GT218 [GeForce 210] (rev a2) - OK
6 NV44 [GeForce 6200 TurboCache(TM)] (rev a1) - OK
11 NV44 [Quadro NVS 285] (rev a1) - OK

Hope that problem can be solved soon (as ten of just-acquired workstations are currently just useless)

Best,

Cédric

cdufour, Please provide information as much as you can get to reproduce the issue :
- attach nvidia bug report by running nvidia-bug-report.sh script as root.
- Desktop Environment you are using like KDE, GNOME, Unity etc…
- Window Manager you are using Compiz, gnome-shell, Kwin, Unity etc…
- Is the use is specific to GPU, System hardware, OS , Software Component etc…?
- Using any display manager like gdm, kdm etc…
sandipt is offline   Reply With Quote
Old 04-24-12, 03:10 AM   #17
cdufour
Registered User
 
Join Date: Jan 2007
Posts: 8
Default Re: Random crashes, NVRM Xid messages

Hello,

Thanks for coming back to us.

Quote:
Originally Posted by sandipt View Post
cdufour, Please provide information as much as you can get to reproduce the issue :
- attach nvidia bug report by running nvidia-bug-report.sh script as root.
See attached bug report
Quote:
Originally Posted by sandipt View Post
- Desktop Environment you are using like KDE, GNOME, Unity etc…
Gnome
Quote:
Originally Posted by sandipt View Post
- Window Manager you are using Compiz, gnome-shell, Kwin, Unity etc…
Metacity (compositing disabled in xorg.conf; cf. issue with some software we have, e.g. MatLab)
Quote:
Originally Posted by sandipt View Post
- Is the use is specific to GPU, System hardware, OS , Software Component etc…?
So far, GPU specific (GT520, GF108, VideoBIOS 70.08.5c.00.00); we have no problem on *exactly* the same platform (same MB, CPU, RAM, BIOS version, etc.) with the first version of the same GPU (GT520, GF119, VideoBIOS 75.19.1b.00.01) or on 80+ other platforms (other MB, CPU, RAM *and* GPU), running the same driver (295.20; PS: problematic host has been upgraded to 295.40 - as reported in the bug report - for the sake of thoroughness, but problem was identical with 295.20) for over several weeks.
Quote:
Originally Posted by sandipt View Post
- Using any display manager like gdm, kdm etc…
GDM

(PS: we run Ubuntu 10.04 64-bit in an enterprise - academic, actually - environment; all hosts are configured identically, with minimal change to stock Ubuntu settings; workstations are also part of a computation grid, which makes any reboot a hassle, cf. killed computation jobs...)

Best,

Cédric
Attached Files
File Type: gz nvidia-bug-report.log.gz (55.5 KB, 76 views)
cdufour is offline   Reply With Quote
Old 04-24-12, 03:18 AM   #18
cdufour
Registered User
 
Join Date: Jan 2007
Posts: 8
Default Re: Random crashes, NVRM Xid messages

Afterthought: nVidia driver is installed by using a backported/updated Ubuntu package (see the attached "howto"); I will perform a standard install using the nVidia installer and report ASAP if it changes anything (though I doubt it).
Attached Files
File Type: gz nvidia-ubuntu-package-howto.txt.gz (817 Bytes, 82 views)
cdufour is offline   Reply With Quote
Old 04-24-12, 04:06 AM   #19
cdufour
Registered User
 
Join Date: Jan 2007
Posts: 8
Default Re: Random crashes, NVRM Xid messages

Quote:
Originally Posted by cdufour View Post
Afterthought: nVidia driver is installed by using a backported/updated Ubuntu package (see the attached "howto"); I will perform a standard install using the nVidia installer and report ASAP if it changes anything (though I doubt it).
Problem remains after running stock installer (./NVIDIA-Linux-x86_64-295.40.run)
cdufour is offline   Reply With Quote
Old 04-24-12, 12:45 PM   #20
cdufour
Registered User
 
Join Date: Jan 2007
Posts: 8
Default Re: Random crashes, NVRM Xid messages

Quote:
Originally Posted by sandipt View Post
NVIDIA internal Bug ID: 970252 to track this issue.
Is there a way to follow the progress made on this issue?
Or will you report here when a fix is released?
Best,
Cédric
cdufour is offline   Reply With Quote

Old 04-28-12, 05:34 AM   #21
Iesos
Registered User
 
Join Date: Apr 2012
Posts: 15
Default Re: Random crashes, NVRM Xid messages

Quote:
Originally Posted by sandipt View Post
NVIDIA internal Bug ID: 970252 to track this issue.
Thank you.

Please let me know if I can provide any additional information.

For now I can tell you the following:

I also tried running StarCraft II in Windows 7 on the laptop, and also there the computer crashed in a similar way. I tried a few versions of the nvidia driver to see if I could escape the crash. And the beta from the 300 series of the driver seems to have a work-around. What happens with the 200 series driver I tried is that the computer freeze and the speakers emitt a noise. In the 300 series once in ~4 hour game play, there was one incident where the game froze for less than a second and emitted the sound, as it had crashed, but recovered and was able to continue. (But since the crashes are sort of random, it is hard to tell if it is actually fixed.)

Windows also provided me with a "The graphics driver has stopped responding and been reloaded" at one crash. And also allowed me to submit a crash report of some kind, connected to some windows troubleshooting tool.

So there might be a fix in the Win 7 driver. Hope this helps.
Iesos is offline   Reply With Quote
Old 04-29-12, 06:27 PM   #22
Iesos
Registered User
 
Join Date: Apr 2012
Posts: 15
Default Re: Random crashes, NVRM Xid messages

Strike that... apparently there is only a partial work around in the 300 series. Starcraft do crash, but quite gracefully. The screen locks for a few seconds, then the mouse is released and only Starcraft stops responding. The computer then won't shut down properly, seemingly because of unresponsive processess.
Iesos is offline   Reply With Quote
Old 04-30-12, 03:12 PM   #23
muadib25
Registered User
 
Join Date: Apr 2012
Location: Sweden
Posts: 10
Default Re: Random crashes, NVRM Xid messages

Following an advice I found at another forum, I changed the CMOS battery today and to my surprise things got better.
Now the screen freezes occur between longer time intervals (every 3 minutes), and they last for only 2-4 seconds.

"Dmesg | tail" shows the same 2 lines everytime this happens:
Quote:
[ 1072.401066] NVRM: Xid (0000:02:00): 8, Channel 00000003
[ 1074.398747] NVRM: os_schedule: Attempted to yield the CPU while in atomic or interrupt context
I hope this helps a bit...
muadib25 is offline   Reply With Quote
Old 05-01-12, 11:18 AM   #24
alito
Registered User
 
Join Date: Jan 2005
Posts: 4
Default Re: Random crashes, NVRM Xid messages

I was having the same issues (screen corruption, random lockups in that lockup, let go after 3 seconds then total lockup, then black screen after another 20 seconds or so. Happening every couple of hours), and I couldn't figure out what combination of software was causing it. (kernels 3.2.11 to 3.3.4, nvidia 295.20, 33 and 40). This is on a GTS 450.

Going back to glibc-2.14.1 (from glibc 2.15) and nvidia 295.20, while removing one of my two monitors and disabling TwinView seems to have done the trick, although it's only been one day. Just one data point for something to try.

(For those that are not aware, downgrading glibc version is very risky so don't try it unless you know what the issues are)
alito is offline   Reply With Quote
Reply


Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Forum Jump


All times are GMT -5. The time now is 11:19 AM.


Powered by vBulletin® Version 3.7.1
Copyright ©2000 - 2014, Jelsoft Enterprises Ltd.
Copyright ©1998 - 2014, nV News.