Go Back   nV News Forums > Linux Support Forums > NVIDIA Linux

Newegg Daily Deals

Reply
 
Thread Tools
Old 01-11-10, 02:20 PM   #1
ant
A GeForced Ant
 
ant's Avatar
 
Join Date: Sep 2002
Location: An Ant Farm
Posts: 348
Question High CPU usages and sometimes hard crashes.

Hello,

The last few weeks, I noticed my old Linux/Debian box (2.6.30) keeps getting random and rare high CPU due to Xorg and sometimes crashes. My box, even via SSH, felt slow. I checked the processes and saw:

$ w
11:53:37 up 6 days, 4:19, 3 users, load average: 6.26, 6.04, 6.19
USER TTY FROM LOGIN@ IDLE JCPU PCPU WHAT
ant tty1 Wed03 5days 9.79s 0.00s /bin/bash /usr/bin/start
ant pts/3 [deleted IP addy]10:37 0.00s 0.10s 0.00s w
ant pts/4 foobar:S.0 05Jan10 10:30 16.00s 16.00s BitchX Ant...

$ top
top - 11:55:08 up 6 days, 4:20, 3 users, load average: 6.13, 5.91, 6.12
Tasks: 132 total, 3 running, 129 sleeping, 0 stopped, 0 zombie
Cpu0 : 6.9%us, 2.4%sy, 1.3%ni, 88.1%id, 0.6%wa, 0.1%hi, 0.6%si, 0.0%st
Mem: 2594748k total, 2168336k used, 426412k free, 64348k buffers
Swap: 2361512k total, 6452k used, 2355060k free, 1847020k cached

PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
15529 root 20 0 101m 73m 2992 R 99.7 2.9 225:08.76 Xorg
20840 ant 20 0 2468 1180 892 R 0.2 0.0 0:00.01 top
1 root 20 0 2036 348 324 S 0.0 0.0 0:02.62 init
...

I tried to kill startx and Xorg processes, and my box froze (still pingable, remote SSH2 connection frozen but not connectable, and IRC connections lost). I have tried recompling the latest stable NVIDIA driver for GeForce FX 5200 (AGP), redoing my /etc/X11/xorg.conf with NVIDIA's script help, disabling Compiz, etc.

I checked logs. In /var/log/X11, I saw a bunch of:
(EE) NVIDIA(0): Error recovery failed.
(EE) NVIDIA(0): *** Aborting ***
(II) NVIDIA(0): Initialized AGP GART.

This sounds bad? What does that mean? End of dmesg showed these lines:
...
[72619.360521] NVRM: loading NVIDIA UNIX x86 Kernel Module 173.14.22 Sun Nov 8 20:26:31 PST 2009
...
[72833.815914] NVRM: loading NVIDIA UNIX x86 Kernel Module 173.14.22 Sun Nov 8 20:26:31 PST 2009
[72833.947202] agpgart-amd64 0000:00:00.0: AGP 3.5 bridge
[72833.947218] agpgart-amd64 0000:00:00.0: putting AGP V3 device into 8x mode
[72833.947284] nvidia 0000:01:00.0: putting AGP V3 device into 8x mode
...
[99432.775115] NVRM: Xid (0001:00): 6, PE0002 06bc 3f800000 0008fd14 00000000 3f800000
[99469.794940] NVRM: Xid (0001:00): 6, PE0002 06bc 3f800000 0008fd14 00000000 3f800000
[99469.836150] NVRM: Xid (0001:00): 7, Ch 00000002 M 00000a64 D 00000000 intr 00010000
[224756.205022] NVRM: Xid (0001:00): 6, PE0002 06bc 3f800000 0008fd14 00000000 3f800000
[224756.251066] NVRM: Xid (0001:00): 7, Ch 00000002 M 0000069c D 471229dd intr 00010000
[225085.201829] NVRM: Xid (0001:00): 6, PE0002 0000 40000000 0010a7bc c0000000 3f800000
[225085.246217] NVRM: Xid (0001:00): 7, Ch 00000002 M 00001d7c D ffff0000 intr 00010000
...
[526347.572029] NVRM: Xid (0001:00): 8, Channel 00000000

I posted more complete and other logs at, including sensors -f: http://pastie.org/774029 and http://pastebin.ca/1747442 (whole dmesg). My old Debian machine specifications can be found in http://alpha.zimage.com/~ant/antfarm.../computers.txt (Secondary/Backup Computer section).

Any ideas? I do keep my Debian updated daily with apt-get update and upgrade commands. I do not recall any recent X changes.

Thank you in advance.
__________________
Ant @ The Ant Farm (http://antfarm.ma.cx) and Ant's Quality Foraged Links (http://aqfl.net).
ant is offline   Reply With Quote
Old 01-18-10, 02:01 AM   #2
ant
A GeForced Ant
 
ant's Avatar
 
Join Date: Sep 2002
Location: An Ant Farm
Posts: 348
Post Crashed again!

I got another one again. I tried disabling AMD's Cool'n'Quiet in BIOS and powernow in Debian/Linux. They did not fix it. I noticed a pattern that I didn't mentioned before. If I am using the computer and the issue comes up, it shows a screen blink and then the CPU goes up and X stops responding. Some logs bits:

dmesg:
...
[526179.772020] NVRM: Xid (0001:00): 8, Channel 00000000
[526187.923984] Clocksource tsc unstable (delta = 4686847433 ns)
[526195.932026] NVRM: Xid (0001:00): 8, Channel 00000000
[526207.944026] NVRM: Xid (0001:00): 8, Channel 00000000
[526219.956030] NVRM: Xid (0001:00): 8, Channel 00000000
[526231.972025] NVRM: Xid (0001:00): 8, Channel 00000000
[526243.984030] NVRM: Xid (0001:00): 8, Channel 00000000
[526255.996025] NVRM: Xid (0001:00): 8, Channel 00000000
[526268.008028] NVRM: Xid (0001:00): 8, Channel 00000000

$ sensors -f
k8temp-pci-00c3
Adapter: PCI adapter
Core0 Temp: +134.6F

GKrellM showed frozen state with (had to write them down by hand):
Vcor1 = 1.50
+3.3V = 3.33
+12V = 11.3
-12V = 2.11
-5V = 5.10
V5SB = 5.54
VBat = 3.17

I was able to use Terminal very slowly via an existing SSH2 connection:
$ sensors -f
w83697hf-isa-0290
Adapter: ISA adapter
in0: +1.50 V (min = +0.19 V, max = +0.13 V) ALARM
in2: +3.33 V (min = +0.43 V, max = +0.02 V) ALARM
in3: +3.01 V (min = +0.02 V, max = +0.13 V) ALARM
in4: +2.96 V (min = +0.06 V, max = +2.86 V) ALARM
in5: +3.30 V (min = +0.06 V, max = +2.24 V) ALARM
in6: +4.08 V (min = +2.56 V, max = +0.00 V) ALARM
in7: +3.30 V (min = +0.08 V, max = +0.03 V) ALARM
in8: +3.17 V (min = +0.00 V, max = +1.28 V) ALARM
fan1: 0 RPM (min = 73 RPM, div = 128) ALARM
fan2: 2410 RPM (min = 2109 RPM, div = 4)
temp1: +91.4F (high = +172.4°F, hyst = +105.8°F) sensor = thermistor
temp2: +128.3F (high = +176.0°F, hyst = +167.0°F) sensor = thermistor
beep_enable:enabled

Do those power flows look correct? It is with a new Antec Basiq BP550 Plus 550W Continuous Power ATX12V V2.2 Modular Active PFC power supply too unless it is defected? Or maybe the GeForce FX is bad now?

$ top
top - 22:44:31 up 6 days, 2:14, 3 users, load average: 7.72, 4.93, 2.32
Tasks: 142 total, 3 running, 139 sleeping, 0 stopped, 0 zombie
Cpu0 : 4.8%us, 1.4%sy, 0.7%ni, 92.2%id, 0.8%wa, 0.1%hi, 0.1%si, 0.0%st
Mem: 2594748k total, 1474872k used, 1119876k free, 227992k buffers
Swap: 2361512k total, 5820k used, 2355692k free, 761668k cached
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
565 root 20 0 101m 66m 5420 R 99.9 2.6 59:05.45 Xorg
14954 ant 20 0 2468 1180 892 R 0.3 0.0 0:00.02 top
1 root 20 0 2036 368 316 S 0.0 0.0 0:02.06 init
...

I looked at my ~/.xsession-errors file to the end:
Xsession: X session started for ant at Wed Jan 13 06:24:06 PST 2010
startkde: Starting up...
kbuildsycoca running...
/tmp/kde-ant/kcminitizrDCa.tmp:1:2: error: invalid preprocessing directive #http
Gtk-Message: Failed to load module "canberra-gtk-module": libcanberra-gtk-module.so: cannot open shared object file: No such file or directory
...
(seamonkey-bin:26879): Gdk-WARNING **: XID collision, trouble ahead
...
X Error: BadWindow (invalid Window parameter) 3
Major opcode: 19
Minor opcode: 0
Resource id: 0x1a6a36e
X Error: BadWindow (invalid Window parameter) 3
Major opcode: 19
Minor opcode: 0
Resource id: 0x3400008
X Error: BadWindow (invalid Window parameter) 3
Major opcode: 19
Minor opcode: 0
Resource id: 0x3017201
X Error: BadWindow (invalid Window parameter) 3
Major opcode: 19
Minor opcode: 0
Resource id: 0x3000024
kwin: X_SetInputFocus(0x282d216): BadMatch (invalid parameter attributes)
X Error: BadWindow (invalid Window parameter) 3
Major opcode: 19
Minor opcode: 0
Resource id: 0x3200008
I saw a bunch of "(seamonkey-bin:#): Gdk-WARNING **: XID collision, trouble ahead" lines. I did a quick search in Google and saw Firefox users having them too, so I assume this is unrelated to my crashes?

http://pastie.org/782807 for /var/log/X11/Xorg.0.log since the forum said my reply was too long. :P

I just tried another idea was to uninstall NVIDIA drivers with /usr/bin/nvidia-uninstall (never did that in the past), recompiled, reinstall, and restart X. I wonder if that will fix my issue. It will probably take about a week or so like previous crashes.
__________________
Ant @ The Ant Farm (http://antfarm.ma.cx) and Ant's Quality Foraged Links (http://aqfl.net).
ant is offline   Reply With Quote
Old 02-15-10, 11:47 AM   #3
mkkot
Registered User
 
Join Date: Dec 2007
Posts: 4
Default Re: High CPU usages and sometimes hard crashes.

Hi ant!

My friend had very simmilar problem on same system and he said me today there was new version of driver released NVIDIA-Linux-x86-173.14.25-pkg1 which eliminates the problem.
mkkot is offline   Reply With Quote
Old 02-15-10, 12:36 PM   #4
ant
A GeForced Ant
 
ant's Avatar
 
Join Date: Sep 2002
Location: An Ant Farm
Posts: 348
Default Re: High CPU usages and sometimes hard crashes.

Quote:
Originally Posted by mkkot View Post
Hi ant!

My friend had very simmilar problem on same system and he said me today there was new version of driver released NVIDIA-Linux-x86-173.14.25-pkg1 which eliminates the problem.
Same system and similiar problems? Are you using same exact hardwares too? Was this a known issue or something?
__________________
Ant @ The Ant Farm (http://antfarm.ma.cx) and Ant's Quality Foraged Links (http://aqfl.net).

Last edited by ant; 01-16-11 at 04:27 PM.
ant is offline   Reply With Quote
Old 01-16-11, 04:28 PM   #5
ant
A GeForced Ant
 
ant's Avatar
 
Join Date: Sep 2002
Location: An Ant Farm
Posts: 348
Default Re: High CPU usages and sometimes hard crashes.

I forgot to follow-up, but this issue seems to be fixed.
__________________
Ant @ The Ant Farm (http://antfarm.ma.cx) and Ant's Quality Foraged Links (http://aqfl.net).
ant is offline   Reply With Quote
Reply


Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Forum Jump


All times are GMT -5. The time now is 06:16 PM.


Powered by vBulletin® Version 3.7.1
Copyright ©2000 - 2014, Jelsoft Enterprises Ltd.
Copyright 1998 - 2014, nV News.