Go Back   nV News Forums > Linux Support Forums > NVIDIA Linux

Newegg Daily Deals

Reply
 
Thread Tools
Old 07-01-12, 04:37 PM   #1
ssnyder
Registered User
 
Join Date: Oct 2005
Posts: 13
Angry Fatal upgrade: from bad (8800 GTS) to worse 9GTX 560 Ti)

I've had a 8800 GTS 512 since shortly after it came out. It was stable and worked well with the NVidia binary drivers up until Fedora 16 and the v3.2+ Linux kernels. (All references to the NVidia drivers pertain to the latest released drivers from NVidia's website.) This latter combination has never been completely stable, but I got frequent Xorg crashes down to just occasional with the "nvidia-smi -pm 1" command.

Today I upgraded my video card, to a NVidia-branded GTX 560 Ti, purchased from Best Buy. Now I can't even make the transition from text mode to graphics mode.

Before shutting down my computer I uninstalled the x86_64-295.59 and reinstalled the Mesa packages. (You have to reinstall Mesa because the NVidia uninstall leaves dangling symlinks to the 3D libraries.) After swapping video cards and restarted my computer to run level 3 (multi-user text mode) like I always do. No problems seen. My text-mode screen worked fine and lspci correctly identified the GeForce 560 Ti on the PCI bus.

I backed up my xorg.conf file and reinstalled the x86_64-295.59 NVidia drivers, opting to have it modify my xorg.conf file. No problems seen in driver installation. A diff of the old and new xorg.conf files showed that only a comment referring to the file creation date was different between the two.

So now I should have been back to where I was with the 8800 GTS 512, except with faster hardware. From a user prompt I entered "xstart" (KDE desktop) as always. The screen was cleared except for a non-blinking cursor in the top-left corner of the screen. It stayed that way while I waited... and waited... and waited.

I SSH'd into the computer from another machine and what I see is

Code:
kernel: [  573.265300] vgaarb: device changed decodes: PCI:0000:02:00.0,olddecodes=none,decodes=none:owns=io+mem
kernel: [  573.265634] NVRM: loading NVIDIA UNIX x86_64 Kernel Module  295.59  Wed Jun  6 21:19:40 PDT 2012
kernel: [  574.501110] NVRM: GPU at 0000:02:00.0 has fallen off the bus.
followed by an endless series of this:

Code:
kernel: [  600.160984] BUG: soft lockup - CPU#6 stuck for 22s! [X:2800]
kernel: [  600.160987] Modules linked in: nvidia(PO) des_generic md4 nls_utf8 cifs fscache lockd vmnet(O) fuse vsock(O) vmci(O) vmmon(O) reiserfs snd_hda_codec_hdmi raid10 snd_hda_codec_realtek uvcvideo videobuf2_core videodev snd_usb_audio media joydev videobuf2_vmalloc snd_usbmidi_lib igb snd_rawmidi videobuf2_memops ppdev snd_hda_intel snd_hda_codec snd_hwdep snd_seq snd_seq_device iTCO_wdt parport_pc snd_pcm parport iTCO_vendor_support i5400_edac coretemp uinput i2c_i801 ioatdma microcode edac_core serio_raw i5k_amb dca shpchp snd_timer snd soundcore i2c_core snd_page_alloc sunrpc firewire_ohci firewire_core crc_itu_t [last unloaded: nvidia]
kernel: [  600.161002] CPU 6 
kernel: [  600.161002] Modules linked in: nvidia(PO) des_generic md4 nls_utf8 cifs fscache lockd vmnet(O) fuse vsock(O) vmci(O) vmmon(O) reiserfs snd_hda_codec_hdmi raid10 snd_hda_codec_realtek uvcvideo videobuf2_core videodev snd_usb_audio media joydev videobuf2_vmalloc snd_usbmidi_lib igb snd_rawmidi videobuf2_memops ppdev snd_hda_intel snd_hda_codec snd_hwdep snd_seq snd_seq_device iTCO_wdt parport_pc snd_pcm parport iTCO_vendor_support i5400_edac coretemp uinput i2c_i801 ioatdma microcode edac_core serio_raw i5k_amb dca shpchp snd_timer snd soundcore i2c_core snd_page_alloc sunrpc firewire_ohci firewire_core crc_itu_t [last unloaded: nvidia]
kernel: [  600.161002] 
kernel: [  600.161002] Pid: 2800, comm: X Tainted: P           O 3.4.2-1.fc16.x86_64 #1 Supermicro X7DWA/X7DWA
kernel: [  600.161002] RIP: 0010:[<ffffffffa0fc2dcd>]  [<ffffffffa0fc2dcd>] _nv014768rm+0xac/0xc4 [nvidia]
kernel: [  600.161002] RSP: 0018:ffff880220cd7878  EFLAGS: 00000297
kernel: [  600.161002] RAX: 0000000000000005 RBX: ffffffff8101b3d9 RCX: 0000000000000001
kernel: [  600.161002] RDX: 0000000000005400 RSI: 00000000000054a1 RDI: ffff880222fbe034
kernel: [  600.161002] RBP: ffff880215395c70 R08: 0000000000070000 R09: ffff880215395ca8
kernel: [  600.161002] R10: ffff880215395d9c R11: ffff8802201ce0c8 R12: ffffffff8101b3d9
kernel: [  600.161002] R13: ffff880220cd77e8 R14: ffff880213da6470 R15: ffff880222fbe008
kernel: [  600.161002] FS:  00007fbc54707880(0000) GS:ffff88022fd80000(0000) knlGS:0000000000000000
kernel: [  600.161002] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
kernel: [  600.161002] CR2: 00007fbc54743000 CR3: 000000022016a000 CR4: 00000000000007e0
kernel: [  600.161002] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
kernel: [  600.161002] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
kernel: [  600.161002] Process X (pid: 2800, threadinfo ffff880220cd6000, task ffff880222c74530)
kernel: [  600.161002] Stack:
kernel: [  600.161002]  0000000000005400 ffffffffa0fc215c ffffffffa13cb34d ffff880222fbe008
kernel: [  600.161002]  0000000000070000 ffffffffa13b7dd0 ffff880222fbe008 ffffffffa13e1148
kernel: [  600.161002]  ffff880222fbe008 0000000000000045 0000000000000000 0000000000000001
kernel: [  600.161002] Call Trace:
kernel: [  600.161002]  [<ffffffffa0fc215c>] ? _nv014372rm+0x225/0x42e [nvidia]
kernel: [  600.161002]  [<ffffffffa13cb34d>] ? _nv009539rm+0x2e8/0x2e8 [nvidia]
kernel: [  600.161002]  [<ffffffffa13b7dd0>] ? _nv009666rm+0x45/0xa1 [nvidia]
kernel: [  600.161002]  [<ffffffffa13e1148>] ? _nv004000rm+0x695b/0xd2d8 [nvidia]
kernel: [  600.161002]  [<ffffffffa1021359>] ? _nv002305rm+0x4c0/0x4d1 [nvidia]
kernel: [  600.161002]  [<ffffffffa1021561>] ? _nv002010rm+0x1f7/0x20d [nvidia]
kernel: [  600.161002]  [<ffffffffa1020694>] ? _nv002298rm+0x586/0x5ab [nvidia]
kernel: [  600.161002]  [<ffffffffa101afbb>] ? _nv002162rm+0xce8/0x10c9 [nvidia]
kernel: [  600.161002]  [<ffffffffa101b8ad>] ? _nv002190rm+0x511/0x7c3 [nvidia]
kernel: [  600.161002]  [<ffffffffa12850f6>] ? _nv005766rm+0x443/0x5d5 [nvidia]
kernel: [  600.161002]  [<ffffffffa128868b>] ? _nv005927rm+0xcd/0xff [nvidia]
kernel: [  600.161002]  [<ffffffffa1288651>] ? _nv005927rm+0x93/0xff [nvidia]
kernel: [  600.161002]  [<ffffffffa13e5fa5>] ? _nv004000rm+0xb7b8/0xd2d8 [nvidia]
kernel: [  600.161002]  [<ffffffffa13e41eb>] ? _nv004000rm+0x99fe/0xd2d8 [nvidia]
kernel: [  600.161002]  [<ffffffffa13e40c9>] ? _nv004000rm+0x98dc/0xd2d8 [nvidia]
kernel: [  600.161002]  [<ffffffffa0f7f621>] ? _nv009830rm+0x28/0x41 [nvidia]
kernel: [  600.161002]  [<ffffffffa1682d02>] ? _nv014657rm+0x819/0x942 [nvidia]
kernel: [  600.161002]  [<ffffffffa1683e01>] ? _nv001088rm+0x522/0x7a1 [nvidia]
kernel: [  600.161002]  [<ffffffffa167a7cc>] ? rm_init_adapter+0xae/0x1bb [nvidia]
kernel: [  600.161002]  [<ffffffffa169d8ac>] ? nv_kern_open+0x46c/0x820 [nvidia]
kernel: [  600.161002]  [<ffffffff81183d22>] ? chrdev_open+0xa2/0x180
kernel: [  600.161002]  [<ffffffff81183c80>] ? cdev_put+0x30/0x30
kernel: [  600.161002]  [<ffffffff8117d6de>] ? __dentry_open+0x24e/0x310
kernel: [  600.161002]  [<ffffffff8117ead1>] ? nameidata_to_filp+0x71/0x80
kernel: [  600.161002]  [<ffffffff8118e51c>] ? do_last+0x26c/0x8e0
kernel: [  600.161002]  [<ffffffff8118eca3>] ? path_openat+0xd3/0x3c0
kernel: [  600.161002]  [<ffffffff810626d0>] ? ns_capable+0x30/0x60
kernel: [  600.161002]  [<ffffffff8118f0b2>] ? do_filp_open+0x42/0xa0
kernel: [  600.161002]  [<ffffffff8119b01f>] ? alloc_fd+0x4f/0x130
kernel: [  600.161002]  [<ffffffff8117ebd8>] ? do_sys_open+0xf8/0x1d0
kernel: [  600.161002]  [<ffffffff810d06bc>] ? __audit_syscall_entry+0xcc/0x310
kernel: [  600.161002]  [<ffffffff8117ecd1>] ? sys_open+0x21/0x30
kernel: [  600.161002]  [<ffffffff81600329>] ? system_call_fastpath+0x16/0x1b
kernel: [  600.161002] Code: 8b 45 00 c7 40 28 01 00 00 00 f6 83 c0 00 00 00 02 74 11 48 c7 c7 10 2a 86 a1 e8 e4 0b 6b 00 e8 ea 55 6e 00 b8 00 00 00 00 eb 12 <89> c0 ba 01 00 00 00 d3 e2 85 14 87 0f 95 c0 0f b6 c0 48 83 c5 
kernel: [  600.161002] Call Trace:
kernel: [  600.161002]  [<ffffffffa0fc215c>] ? _nv014372rm+0x225/0x42e [nvidia]
kernel: [  600.161002]  [<ffffffffa13cb34d>] ? _nv009539rm+0x2e8/0x2e8 [nvidia]
kernel: [  600.161002]  [<ffffffffa13b7dd0>] ? _nv009666rm+0x45/0xa1 [nvidia]
kernel: [  600.161002]  [<ffffffffa13e1148>] ? _nv004000rm+0x695b/0xd2d8 [nvidia]
kernel: [  600.161002]  [<ffffffffa1021359>] ? _nv002305rm+0x4c0/0x4d1 [nvidia]
kernel: [  600.161002]  [<ffffffffa1021561>] ? _nv002010rm+0x1f7/0x20d [nvidia]
kernel: [  600.161002]  [<ffffffffa1020694>] ? _nv002298rm+0x586/0x5ab [nvidia]
kernel: [  600.161002]  [<ffffffffa101afbb>] ? _nv002162rm+0xce8/0x10c9 [nvidia]
kernel: [  600.161002]  [<ffffffffa101b8ad>] ? _nv002190rm+0x511/0x7c3 [nvidia]
kernel: [  600.161002]  [<ffffffffa12850f6>] ? _nv005766rm+0x443/0x5d5 [nvidia]
kernel: [  600.161002]  [<ffffffffa128868b>] ? _nv005927rm+0xcd/0xff [nvidia]
kernel: [  600.161002]  [<ffffffffa1288651>] ? _nv005927rm+0x93/0xff [nvidia]
kernel: [  600.161002]  [<ffffffffa13e5fa5>] ? _nv004000rm+0xb7b8/0xd2d8 [nvidia]
kernel: [  600.161002]  [<ffffffffa13e41eb>] ? _nv004000rm+0x99fe/0xd2d8 [nvidia]
kernel: [  600.161002]  [<ffffffffa13e40c9>] ? _nv004000rm+0x98dc/0xd2d8 [nvidia]
kernel: [  600.161002]  [<ffffffffa0f7f621>] ? _nv009830rm+0x28/0x41 [nvidia]
kernel: [  600.161002]  [<ffffffffa1682d02>] ? _nv014657rm+0x819/0x942 [nvidia]
kernel: [  600.161002]  [<ffffffffa1683e01>] ? _nv001088rm+0x522/0x7a1 [nvidia]
kernel: [  600.161002]  [<ffffffffa167a7cc>] ? rm_init_adapter+0xae/0x1bb [nvidia]
kernel: [  600.161002]  [<ffffffffa169d8ac>] ? nv_kern_open+0x46c/0x820 [nvidia]
kernel: [  600.161002]  [<ffffffff81183d22>] ? chrdev_open+0xa2/0x180
kernel: [  600.161002]  [<ffffffff81183c80>] ? cdev_put+0x30/0x30
kernel: [  600.161002]  [<ffffffff8117d6de>] ? __dentry_open+0x24e/0x310
kernel: [  600.161002]  [<ffffffff8117ead1>] ? nameidata_to_filp+0x71/0x80
kernel: [  600.161002]  [<ffffffff8118e51c>] ? do_last+0x26c/0x8e0
kernel: [  600.161002]  [<ffffffff8118eca3>] ? path_openat+0xd3/0x3c0
kernel: [  600.161002]  [<ffffffff810626d0>] ? ns_capable+0x30/0x60
kernel: [  600.161002]  [<ffffffff8118f0b2>] ? do_filp_open+0x42/0xa0
kernel: [  600.161002]  [<ffffffff8119b01f>] ? alloc_fd+0x4f/0x130
kernel: [  600.161002]  [<ffffffff8117ebd8>] ? do_sys_open+0xf8/0x1d0
kernel: [  600.161002]  [<ffffffff810d06bc>] ? __audit_syscall_entry+0xcc/0x310
kernel: [  600.161002]  [<ffffffff8117ecd1>] ? sys_open+0x21/0x30
kernel: [  600.161002]  [<ffffffff81600329>] ? system_call_fastpath+0x16/0x1b
Hmmmm.... What's going on? Maybe that "nvidia-smi -pm 1" is inappropriate for the newer hardware? I remove it from my rc.local and reboot. Again the screen shows only the unblinking cursor and again the series of kernel panics.

Maybe that .nvidia-settings-rc file contains settings that worked for the old hardware but are bad for the newer card? Renamed the config file and rebooted. Same behavior on "xstart"

I poked and prodded it a few more times but with no improvement. Every time my screen is locked and I get a series of kernel panics that vary from test to test only in which CPU is locked up.

Finally, to regain the use of my computer I uninstalled the NVidia drivers, removed every trace of them I could find and changed to using the nouveau driver. It started up Xorg/KDE on the first try and has been running without apparent problem for about an hour now. Of course the downside to this refreshing stability is that I do not have access to all the features of my shiny-new video card.

I'm not thrilled about this upgrade experience.
ssnyder is offline   Reply With Quote
Old 07-01-12, 08:14 PM   #2
johnc
Registered User
 
Join Date: May 2011
Posts: 92
Default Re: Fatal upgrade: from bad (8800 GTS) to worse 9GTX 560 Ti)

I would say to post a bug report log, but then you'd have to re-install your nvidia drivers. If you do, you could always try the 302.17 series to see if it works.
johnc is offline   Reply With Quote
Reply


Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Forum Jump


All times are GMT -5. The time now is 01:41 AM.


Powered by vBulletin® Version 3.7.1
Copyright ©2000 - 2014, Jelsoft Enterprises Ltd.
Copyright 1998 - 2014, nV News.