Go Back   nV News Forums > Linux Support Forums > NVIDIA Linux

Newegg Daily Deals

Reply
 
Thread Tools
Old 04-16-12, 03:05 PM   #1
rburgess
Registered User
 
Join Date: Apr 2012
Posts: 4
Default Quadro 600 SLES11SP1 variety of problems, driver suspect

Hello,

I have a Q600 that will run X, but cudasdk examples run very inconsistently or not at all (nbody, bandwidthTest, etc.), driver loads slowly (15-30 seconds) and inconsistently.

290.10 driver almost never loads, gives error:
NVIDIA: could not open the device file /dev/nvidia0 (Input/output error).
Failed to initialize NVML: Unknown Error

295.33 intermittently loads, gives same error as above. Also noticed that bandwidthTest reports:
cudaGetDeviceProperties returned 10
-> invalid device ordinal

295.40 seems to fix the driver loading problem, but cudasdk samples do not always run; they often get stuck in a loop (30-80% of the time). I do not see anything different when comparing an strace -f from a good run and a bad run; the bad one just gets stuck looping this:
futex(0x64fd20, FUTEX_WAIT_PRIVATE, 2, NULL) = -1 EAGAIN (Resource temporarily unavailable)
futex(0x64fd20, FUTEX_WAKE_PRIVATE, 1) = 0

An FX1800 in the same system does not have the same problems. I have recompiled the samples on this system with no change. I see this card should have been supported for quite some time. I've also tried older drivers (275, 280) with no luck.

Does anyone have any suggestions for further troubleshooting?
Attached Files
File Type: gz nvidia-bug-report-295.33.log.gz (70.1 KB, 37 views)
File Type: gz nvidia-bug-report-295.40.log.gz (83.8 KB, 40 views)
rburgess is offline   Reply With Quote
Old 04-17-12, 10:45 AM   #2
rburgess
Registered User
 
Join Date: Apr 2012
Posts: 4
Default Re: Quadro 600 SLES11SP1 variety of problems, driver suspect

275.21 drivers load in only 2s, but they appear to be too old to run CUDA sdk examples:
cudaGetDeviceProperties returned 35
-> CUDA driver version is insufficient for CUDA runtime version
[bandwidthTest] test results...
FAILED
rburgess is offline   Reply With Quote
Old 04-18-12, 09:02 AM   #3
rburgess
Registered User
 
Join Date: Apr 2012
Posts: 4
Default Re: Quadro 600 SLES11SP1 variety of problems, driver suspect

It looks like there may be a 64GB system memory limit for this card with current drivers. I can pin the tests to any CPU in the system, as long as I only use memory from CPU0 (64GB of 256GB). Pulling memory from each CPU so the system total is 64GB (16GB/ea) allows tests to run in memory attached to all CPUs. Confirmed adding memory to CPU0 (32GB; system total 80GB) makes tests against CPU3 memory fail.

Is there a better contact than linux-bugs@nvidia.com for reporting these problems? I don't even get an automated response from that address.
rburgess is offline   Reply With Quote
Old 04-19-12, 08:35 AM   #4
sandipt
NVIDIA Corporation
 
sandipt's Avatar
 
Join Date: Dec 2010
Posts: 260
Default Re: Quadro 600 SLES11SP1 variety of problems, driver suspect

what versions of cuda sdk, cuda toolkit and gpu driver you are using?
sandipt is offline   Reply With Quote
Old 04-19-12, 12:48 PM   #5
rburgess
Registered User
 
Join Date: Apr 2012
Posts: 4
Default Re: Quadro 600 SLES11SP1 variety of problems, driver suspect

Quote:
Originally Posted by sandipt View Post
what versions of cuda sdk, cuda toolkit and gpu driver you are using?
sdk/toolkit 4.1.21 and 4.1.28 have been tried; 295.40 driver right now, but others have been tried, as listed above.
rburgess is offline   Reply With Quote
Reply


Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Forum Jump


All times are GMT -5. The time now is 02:22 AM.


Powered by vBulletin® Version 3.7.1
Copyright ©2000 - 2014, Jelsoft Enterprises Ltd.
Copyright 1998 - 2014, nV News.