Go Back   nV News Forums > Linux Support Forums > NVIDIA Linux

Newegg Daily Deals

Reply
 
Thread Tools
Old 11-07-07, 03:50 PM   #13
netllama
NVIDIA Corporation
 
Join Date: Dec 2004
Posts: 8,763
Default Re: Mysterious Message

You should start by following the instructions in all the forum sticky posts.
netllama is offline   Reply With Quote
Old 11-08-07, 10:27 AM   #14
jesmith
Registered User
 
Join Date: Mar 2007
Posts: 64
Default Re: Mysterious Message

Quote:
If you are seeing severe stability problems and you are using a Linux 2.6 SMP kernel on a system with multiple processors (or processor cores) in combination with more than one GPU, please search the output of `dmesg` for the presence of the message below after the system has just been started:
PCI: Using MMCONFIG

If this message is present, please boot the system with the pci=nommconf kernel parameter and check if the stability problems continue to reproduce.
I only have one GPU installed, so this probably does not apply. However, my dmesg says:
[ 0.069897] PCI: Not using MMCONFIG.
so I'm OK either way.

Quote:
If your system is equipped with a dual-core processor, booting with the idle=poll and/or maxcpus=1 kernel parameters may improve reliability with some Linux kernels.
dmesg says:
[ 21.976468] using mwait in idle threads.
and I have two cores, so I can try both of these.

However, poll=idle could make the system run very hot, and losing one of my cores will significantly hurt the performance of our application. So I'll try these to see if they have an effect, but they certainly cannot be considered a solution (or even a work-around).

These tests are now running on two systems. I'll get back to you when I have some results.

Quote:
If you are using an AGP graphics card, please test setting NvAGP to 0 in xorg.conf. If this eliminates the instability, then you are experiencing a problem outside of the NVIDIA X driver, either in the motherboard BIOS, kernel, kernel AGP driver, or possibly in the motherboard itself.
Nope, PCI-E.

Quote:
If you are using a Linux/x86-64 2.6 kernel and see the warning message below
Nope.

Quote:
If you are using the 1.0-7676 NVIDIA Linux/x86-64 graphics driver release and a Linux/x86-64 2.6 kernel < Linux 2.6.11,
Nope.

Quote:
If you are using a Linux/x86-64 kernel >= Linux 2.6.11 and < Linux 2.6.14,
Nope, 2.6.20.

Quote:
If you see warning messages similar to those below in the system log file(s) when starting the X server or OpenGL applications,
Nope.

Quote:
For any problem that involves instability, you should always verify that you are using the most recently released BIOS for the motherboard.
Intel says the latest is:
BIOS Update 1709 [MQ96510J.86A] (1094KB)
1709 10/11/2007
Which is exactly what I'm running.

Quote:
To make sure this log file includes as much relevant information as possible, please start the X server with `startx -- -logverbose 6` and run `nvidia-bug-report.sh` after the problem has occurred.
The bug report I posted before was captured before the problem occurred and I didn't have the logverbose 6 option turned on. There is no way for me to get the report after the problem occurs, since the machine is completely and utterly wedged. Would turning on that X option and rerunning the pre-death report be of any use?
jesmith is offline   Reply With Quote
Old 11-08-07, 10:30 AM   #15
netllama
NVIDIA Corporation
 
Join Date: Dec 2004
Posts: 8,763
Default Re: Mysterious Message

Can you setup a serial console?
netllama is offline   Reply With Quote
Old 11-08-07, 11:27 AM   #16
jesmith
Registered User
 
Join Date: Mar 2007
Posts: 64
Default Re: Mysterious Message

I have no idea! I haven't thought about serial TTYs in 20 years!

Let's wait to see the result of the idle=poll, and maxcpus=1 tests, and then I'll see what's involved in getting a serial console connected.

Is your thought that I might be able to interact with the system through that console when everything else is wedged?
jesmith is offline   Reply With Quote
Old 11-08-07, 11:39 AM   #17
netllama
NVIDIA Corporation
 
Join Date: Dec 2004
Posts: 8,763
Default Re: Mysterious Message

I'm hoping that you'll be able to capture some more useful kernel messages via serial console when the system hangs.
netllama is offline   Reply With Quote
Old 11-19-07, 03:53 PM   #18
jesmith
Registered User
 
Join Date: Mar 2007
Posts: 64
Default Re: Mysterious Message

Setting up a serial console would be a challenge because the motherboard has no serial port. Before trying to find out whether I can get Linux to treat a USB port as a serial console, I figured I'd get the machine on the network and wait for a hang to learn more.

The hang took several days to recur, but now it is hung, and it appears the machine is completely wedged as I suspected. It does not respond to ping, much less ssh. I believe, therefore, that hookup up a serial console is probably pointless, since no processes appear to be getting any CPU.

I'm thinking that perhaps my next step should be to try the new Beta 169.04 driver. Do you agree?
jesmith is offline   Reply With Quote
Old 11-20-07, 02:28 PM   #19
jesmith
Registered User
 
Join Date: Mar 2007
Posts: 64
Default Re: Mysterious Message

I'm running the 169.04 driver now. I'll let you know what happens...
jesmith is offline   Reply With Quote
Old 11-27-07, 10:09 AM   #20
jesmith
Registered User
 
Join Date: Mar 2007
Posts: 64
Default Re: Mysterious Message

It's hard to know when an intermittent problem like this is gone, but SO FAR SO GOOD! I'm running the 169.04 drivers, and there are no more of these XID messages in the log, and the system has been running for 6 days straight without a hang (the previous record was 4 days).
jesmith is offline   Reply With Quote

Reply


Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Forum Jump


Similar Threads
Thread Thread Starter Forum Replies Last Post
The Creator of Linux has a message for NVIDIA wysiwyg NVIDIA Linux 0 06-23-12 03:23 AM
DIY Burglar Alarm Sends You a Text Message Whenever It Detects Motion [DIY] News Archived News Items 0 05-04-12 11:00 AM
Regarding the moderator's message in the OF. PCarr78 Feedback Forum 44 10-12-02 06:22 AM

All times are GMT -5. The time now is 07:55 AM.


Powered by vBulletin® Version 3.7.1
Copyright ©2000 - 2014, Jelsoft Enterprises Ltd.
Copyright 1998 - 2014, nV News.