Go Back   nV News Forums > Linux Support Forums > NVIDIA Linux

Newegg Daily Deals

Reply
 
Thread Tools
Old 11-29-06, 03:50 PM   #1
vflorins
Registered User
 
Join Date: Nov 2006
Posts: 10
Exclamation Kernel Oops with 9629 on x86_64

I would like to report what appears to be a serious bug in the kernel module. The system is running Fedora 6 on a 64 bit AMD CPU. The video cards are a 7900GT and a built in 6150. First, the relevant portion from the system log:

Nov 29 10:43:51 zeos kernel: Unable to handle kernel NULL pointer dereference at 00000000000000e8 RIP:
Nov 29 10:43:51 zeos kernel: [<ffffffff881cc7ca>] :nvidia:_nv008724rm+0x17c/0x512
Nov 29 10:43:51 zeos kernel: PGD 6fad3067 PUD 719d3067 PMD 0
Nov 29 10:43:51 zeos kernel: Oops: 0000 [1] SMP
Nov 29 10:43:51 zeos kernel: last sysfs file: /block/hda/size
Nov 29 10:43:51 zeos kernel: CPU 0
Nov 29 10:43:51 zeos kernel: Modules linked in: ipv6 w83627ehf hwmon eeprom i2c_isa ip_conntrack_netbios_ns ipt_REJECT xt_state ip_conntrack nfnetlink xt_tcp
udp iptable_filter ip_tables x_tables cpufreq_ondemand dm_mirror dm_mod video sbs i2c_ec button battery asus_acpi ac parport_pc lp parport snd_hda_intel snd_
hda_codec snd_seq_dummy snd_seq_oss tuner snd_bt87x snd_seq_midi_event tvaudio snd_seq floppy bttv video_buf ir_common compat_ioctl32 i2c_algo_bit btcx_risc
tveeprom sg snd_seq_device videodev ohci1394 snd_pcm_oss ieee1394 ide_cd cdrom snd_mixer_oss forcedeth k8_edac snd_pcm v4l1_compat snd_timer snd soundcore sn
d_page_alloc nvidia(U) serio_raw edac_mc v4l2_common i2c_nforce2 pcspkr shpchp i2c_core usblp sata_nv libata sd_mod scsi_mod ext3 jbd ehci_hcd ohci_hcd uhci_
hcd
Nov 29 10:43:51 zeos kernel: Pid: 2549, comm: X Tainted: P 2.6.18-1.2849.fc6 #1
Nov 29 10:43:51 zeos kernel: RIP: 0010:[<ffffffff881cc7ca>] [<ffffffff881cc7ca>] :nvidia:_nv008724rm+0x17c/0x512
Nov 29 10:43:51 zeos kernel: RSP: 0018:ffff81007a2437e8 EFLAGS: 00010292
Nov 29 10:43:51 zeos kernel: RAX: 0000000000000000 RBX: ffff81006ee6d000 RCX: ffff81006ee6e000
Nov 29 10:43:51 zeos kernel: RDX: ffff81006ee6e000 RSI: 0000000000000003 RDI: 0000000000000000
Nov 29 10:43:51 zeos kernel: RBP: 0000000000000000 R08: 0000000000000000 R09: 0000000000000000
Nov 29 10:43:51 zeos kernel: R10: ffff81006ee6c6d8 R11: ffffffff8845e73c R12: 0000000000000000
Nov 29 10:43:51 zeos kernel: R13: 0000000000000000 R14: ffff81006ee6e000 R15: 0000000000000002
Nov 29 10:43:51 zeos kernel: FS: 00002aaaaaacba80(0000) GS:ffffffff8060a000(0000) knlGS:0000000000000000
Nov 29 10:43:51 zeos kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
Nov 29 10:43:51 zeos kernel: CR2: 00000000000000e8 CR3: 00000000709d8000 CR4: 00000000000006e0
Nov 29 10:43:51 zeos kernel: Process X (pid: 2549, threadinfo ffff81007a242000, task ffff81007a2c47d0)
Nov 29 10:43:51 zeos kernel: Stack: 0000000300000000 ffff81006e603000 000000017043e000 ffff81006ef58800
Nov 29 10:43:51 zeos kernel: ffff81007a2438c4 ffff810070d40000 ffff81006ee6e000 ffffffff00000001
Nov 29 10:43:51 zeos kernel: 0000000000000000 00000000ffffffff ffff810000000000 ffff81006ef58800
Nov 29 10:43:51 zeos kernel: Call Trace:
Nov 29 10:43:51 zeos kernel: [<ffffffff881ccd30>] :nvidia:_nv008037rm+0x1d0/0x81e
Nov 29 10:43:51 zeos kernel: [<ffffffff881cb99a>] :nvidia:_nv008010rm+0x1d8/0xe10
Nov 29 10:43:51 zeos kernel: [<ffffffff881cb77e>] :nvidia:_nv008645rm+0x20e/0x252
Nov 29 10:43:51 zeos kernel: [<ffffffff881cf493>] :nvidia:_nv008917rm+0x17f/0x1f2
Nov 29 10:43:51 zeos kernel: [<ffffffff881cf62d>] :nvidia:_nv008918rm+0x127/0x15e
Nov 29 10:43:51 zeos kernel: [<ffffffff8822c630>] :nvidia:_nv008456rm+0x158/0xb1c
Nov 29 10:43:51 zeos kernel: [<ffffffff881f085b>] :nvidia:_nv008922rm+0x5f/0x72
Nov 29 10:43:51 zeos kernel: [<ffffffff881ef9a1>] :nvidia:_nv004473rm+0xc2d/0x1232
Nov 29 10:43:51 zeos kernel: [<ffffffff881f867d>] :nvidia:_nv001684rm+0x251/0x290
Nov 29 10:43:51 zeos kernel: [<ffffffff882a287d>] :nvidia:_nv007440rm+0x73/0x86
Nov 29 10:43:51 zeos kernel: [<ffffffff8830b9cb>] :nvidia:_nv005932rm+0x1a9/0x208
Nov 29 10:43:51 zeos kernel: [<ffffffff8830b57d>] :nvidia:_nv005672rm+0x5b/0x74
Nov 29 10:43:51 zeos kernel: [<ffffffff88167176>] :nvidia:_nv005818rm+0xa/0x10
Nov 29 10:43:51 zeos kernel: [<ffffffff881909fb>] :nvidia:_nv002002rm+0xf3/0x198
Nov 29 10:43:51 zeos kernel: [<ffffffff8819147f>] :nvidia:_nv002008rm+0x245/0x35e
Nov 29 10:43:51 zeos kernel: [<ffffffff88195ad7>] :nvidia:rm_init_adapter+0x63/0x94
Nov 29 10:43:51 zeos kernel: [<ffffffff8840956d>] :nvidia:nv_kern_open+0x251/0x324
Nov 29 10:43:51 zeos kernel: [<ffffffff80247d62>] chrdev_open+0x149/0x198
Nov 29 10:43:51 zeos kernel: [<ffffffff8021e2bc>] __dentry_open+0xd9/0x1e2
Nov 29 10:43:51 zeos kernel: [<ffffffff8022746c>] do_filp_open+0x2a/0x38
Nov 29 10:43:51 zeos kernel: [<ffffffff802194c1>] do_sys_open+0x44/0xbe
Nov 29 10:43:51 zeos kernel: [<ffffffff8025bf0e>] system_call+0x7e/0x83
Nov 29 10:43:51 zeos kernel: DWARF2 unwinder stuck at system_call+0x7e/0x83
Nov 29 10:43:51 zeos kernel: Leftover inexact backtrace:
Nov 29 10:43:51 zeos kernel:
Nov 29 10:43:51 zeos kernel:
Nov 29 10:43:51 zeos kernel: Code: ff 90 e8 00 00 00 89 44 2c 30 eb 14 44 89 e0 48 c7 44 c4 40
Nov 29 10:43:51 zeos kernel: RIP [<ffffffff881cc7ca>] :nvidia:_nv008724rm+0x17c/0x512
Nov 29 10:43:51 zeos kernel: RSP <ffff81007a2437e8>
Nov 29 10:43:51 zeos kernel: CR2: 00000000000000e8

This occurs when I startx. The nvidia kernel module is already loaded by that time, without errors. Because /proc/nvidia entries become unreadable after the crash, I cannot execute the bug report script. Below, I will try to provide the relevant information obtained by hand.

# uname -a
Linux zeos 2.6.18-1.2849.fc6 #1 SMP Fri Nov 10 12:34:46 EST 2006 x86_64 x86_64 x86_64 GNU/Linux

# lspci
00:00.0 RAM memory: nVidia Corporation C51 Host Bridge (rev a2)
00:00.1 RAM memory: nVidia Corporation C51 Memory Controller 0 (rev a2)
00:00.2 RAM memory: nVidia Corporation C51 Memory Controller 1 (rev a2)
00:00.3 RAM memory: nVidia Corporation C51 Memory Controller 5 (rev a2)
00:00.4 RAM memory: nVidia Corporation C51 Memory Controller 4 (rev a2)
00:00.5 RAM memory: nVidia Corporation C51 Host Bridge (rev a2)
00:00.6 RAM memory: nVidia Corporation C51 Memory Controller 3 (rev a2)
00:00.7 RAM memory: nVidia Corporation C51 Memory Controller 2 (rev a2)
00:02.0 PCI bridge: nVidia Corporation C51 PCI Express Bridge (rev a1)
00:03.0 PCI bridge: nVidia Corporation C51 PCI Express Bridge (rev a1)
00:04.0 PCI bridge: nVidia Corporation C51 PCI Express Bridge (rev a1)
00:05.0 VGA compatible controller: nVidia Corporation C51PV [GeForce 6150] (rev a2)
00:09.0 RAM memory: nVidia Corporation MCP51 Host Bridge (rev a2)
00:0a.0 ISA bridge: nVidia Corporation MCP51 LPC Bridge (rev a2)
00:0a.1 SMBus: nVidia Corporation MCP51 SMBus (rev a2)
00:0b.0 USB Controller: nVidia Corporation MCP51 USB Controller (rev a2)
00:0b.1 USB Controller: nVidia Corporation MCP51 USB Controller (rev a2)
00:0d.0 IDE interface: nVidia Corporation MCP51 IDE (rev a1)
00:0e.0 IDE interface: nVidia Corporation MCP51 Serial ATA Controller (rev a1)
00:0f.0 IDE interface: nVidia Corporation MCP51 Serial ATA Controller (rev a1)
00:10.0 PCI bridge: nVidia Corporation MCP51 PCI Bridge (rev a2)
00:10.1 Audio device: nVidia Corporation MCP51 High Definition Audio (rev a2)
00:14.0 Bridge: nVidia Corporation MCP51 Ethernet Controller (rev a1)
00:18.0 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] HyperTransport Technology Configuration
00:18.1 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] Address Map
00:18.2 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] DRAM Controller
00:18.3 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] Miscellaneous Control
03:00.0 VGA compatible controller: nVidia Corporation GeForce 7900 GT (rev a1)
04:07.0 Multimedia video controller: Brooktree Corporation Bt878 Video Capture (rev 11)
04:07.1 Multimedia controller: Brooktree Corporation Bt878 Audio Capture (rev 11)
04:08.0 FireWire (IEEE 1394): VIA Technologies, Inc. IEEE 1394 Host Controller (rev 80)

# cat /proc/interrupts
CPU0
0: 7173270 IO-APIC-edge timer
1: 335 IO-APIC-edge i8042
6: 6 IO-APIC-edge floppy
7: 1 IO-APIC-edge parport0
8: 0 IO-APIC-edge rtc
9: 0 IO-APIC-level acpi
14: 21447 IO-APIC-edge ide0
50: 0 IO-APIC-level nvidia
58: 2 IO-APIC-level nvidia
66: 3 IO-APIC-level ohci1394
74: 736542 IO-APIC-level bttv0, Bt87x audio
209: 728917 IO-APIC-level ohci_hcd:usb1, eth0
217: 207 IO-APIC-level ehci_hcd:usb2, HDA Intel
225: 19838 IO-APIC-level libata
233: 0 IO-APIC-level libata
NMI: 121
LOC: 7172126
ERR: 0
MIS: 0

The Xorg log and configuration files are also attached.
Attached Files
File Type: log Xorg.0.log (24.5 KB, 100 views)
File Type: txt xorg.conf.txt (1.0 KB, 100 views)
vflorins is offline   Reply With Quote
Old 11-29-06, 04:23 PM   #2
netllama
NVIDIA Corporation
 
Join Date: Dec 2004
Posts: 8,763
Default Re: Kernel Oops with 9629 on x86_64

Please generate a bug report after rebooting (yet before starting X again).

Also, please verify whether you're using the latest BIOS, and whether this reproduces with 1.0-9742.

Thanks,
Lonni
netllama is offline   Reply With Quote
Old 11-29-06, 05:27 PM   #3
vflorins
Registered User
 
Join Date: Nov 2006
Posts: 10
Default Re: Kernel Oops with 9629 on x86_64

Attachment added from nvidia-bug-report run before X started.
Attached Files
File Type: log nvidia-bug-report.log (104.9 KB, 93 views)
vflorins is offline   Reply With Quote
Old 11-29-06, 06:06 PM   #4
netllama
NVIDIA Corporation
 
Join Date: Dec 2004
Posts: 8,763
Default Re: Kernel Oops with 9629 on x86_64

Did you verify whether you're using the latest BIOS, and whether this reproduces with 1.0-9742?

Does this only reproduce with both GPUs in the system (instead of just the integrated)?
netllama is offline   Reply With Quote
Old 11-30-06, 01:30 AM   #5
vflorins
Registered User
 
Join Date: Nov 2006
Posts: 10
Default Re: Kernel Oops with 9629 on x86_64

Updated the BIOS to the latest version - no change.

I decided not to test without the PCIe card. This is a small form factor system with water CPU and GPU cooling, and removing cards is difficult and time consuming. However, after I disabled the integrated GPU in the BIOS, the kernel module stopped crashing. So, the bug is clearly related to the dual GPU setup.

Will test with the new driver version, time permitting.
vflorins is offline   Reply With Quote
Reply


Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Forum Jump


Similar Threads
Thread Thread Starter Forum Replies Last Post
Random crashes, NVRM Xid messages Iesos NVIDIA Linux 90 10-04-12 04:27 AM
Corrupted display - 302.17 - Dell Precision T3500 (G98 [Quadro NVS 295]) gbailey NVIDIA Linux 1 06-27-12 11:24 AM
UEFI+Nvidia - NVRM: Your system is not currently configured to drive a VGA console... interzoneuk NVIDIA Linux 0 06-26-12 05:51 AM
xorg locks-up with newest nvidia drivers w/ vdpau. theroot NVIDIA Linux 1 06-24-12 12:04 PM
Crash when logout from X TGL NVIDIA Linux 10 09-13-02 09:22 PM

All times are GMT -5. The time now is 09:56 PM.


Powered by vBulletin® Version 3.7.1
Copyright ©2000 - 2014, Jelsoft Enterprises Ltd.
Copyright 1998 - 2014, nV News.