View Single Post
Old 06-05-09, 05:37 PM   #1
paulbjr
Registered User
 
Join Date: Jun 2009
Location: Oregon
Posts: 3
Default Intermittent problems with Quadro NVS 290 cards

We are developing a new release of an established Solaris based product that we have sold for years on Solaris SPARC. We have now ported it to Solaris 10 - X86 using Nvidia Quadro NVS 290 cards. One system has one Quadro NVS 290 (two displays) the other has two Quadro NVS 290 cards and four displays.

Of the twenty two development and test machines we are using, TWO have on a number of occasions exhibited a graphics freeze and/or system crashes. At other times, NVRM message show up in our /var/adm/messages with no *apparent* harm to the system.

No other systems appear to have any NVRM messages!

We are running the driver that was packaged with the version of Solaris 10 we are using:

- version 100.14.19 dated Sept 12, 2007.

We are updating our systems to the latest 180.51 driver but I have kept this system at the old driver level in case the old driver is helping unmask hardware issues.

The errors occur at unpredictable times (not right after booting). When related crashes occur they are often a half hour to several hours after NVRM messages appear in /var/adm/messages.

This week, after a several weeks of very few NVRM messages, a swarm of NVRM messages showed up and two crashes occurred. I did NOT notice any freeze on this box

The history looks like this:

Jun 4 10:40:14 pluto nvidia: [ID 702911 kern.notice] NVRM: Xid (0001:00): 13, 0001 00000000 0000502d 00000184 00000466 00000008
Jun 4 10:40:14 pluto nvidia: [ID 702911 kern.notice] NVRM: Xid (0001:00): 13, 0001 00000000 0000502d 00000100 00000000 00000100
Jun 4 15:43:23 pluto nvidia: [ID 702911 kern.notice] NVRM: Xid (0001:00): 13, 0001 00000000 00005000 00000000 00000478 00000000
Jun 4 15:43:23 pluto nvidia: [ID 702911 kern.notice] NVRM: Xid (0001:00): 13, 0001 00000000 0000502d 00000100 00000000 00000100

system crash with core dump occurred at 15:44:36

Jun 4 15:49:49 pluto nvidia: [ID 702911 kern.notice] NVRM: Xid (0001:00): 13, 0001 00000000 00005000 00000000 00000478 00000000
Jun 4 15:49:49 pluto nvidia: [ID 702911 kern.notice] NVRM: Xid (0001:00): 13, 0001 00000000 0000502d 00000100 00000000 00000100

system crash with core dump occurred at 19:01:43

Jun 5 04:05:34 pluto nvidia: [ID 702911 kern.notice] NVRM: Xid (0001:00): 6, PE0001
Jun 5 04:05:34 pluto nvidia: [ID 702911 kern.notice] NVRM: Xid (0001:00): 13, 0001 00000000 0000502d 00000100 00000000 00000100
Jun 5 04:05:34 pluto nvidia: [ID 702911 kern.notice] NVRM: Xid (0001:00): 13, 0001 00000000 0000502d 000005e0 007f06a2 00000100
Jun 5 08:15:41 pluto nvidia: [ID 702911 kern.notice] NVRM: Xid (0001:00): 13, 0001 00000000 0000502d 00000184 00000466 00000008
Jun 5 11:11:05 pluto nvidia: [ID 702911 kern.notice] NVRM: Xid (0001:00): 13, 0001 00000000 0000502d 00000184 00000466 00000008
Jun 5 12:17:41 pluto nvidia: [ID 702911 kern.notice] NVRM: Xid (0001:00): 13, 0001 00000000 0000002d 00000000 00000478 00000000


Since I could not find a nvidia-bug-report.sh script on our system, I am uploading bug-report.gz containing /var/adm/messages, Xorg.0.log, and xorg.conf.

Any insight will be much appreciated.
Attached Files
File Type: gz bug-report.gz (19.6 KB, 166 views)
paulbjr is offline   Reply With Quote