PDA

View Full Version : startx on SuSE 8.2 (2.4.20-64GB-SMP) with 1.0-4349 hangs system.


cko
04-17-03, 07:00 PM
Here is what I get:

# startx -- -logverbose 5


XFree86 Version 4.3.0
Release Date: 27 February 2003
X Protocol Version 11, Revision 0, Release 6.6
Build Operating System: SuSE Linux [ELF] SuSE
Build Date: 17 March 2003
Before reporting problems, check http://www.XFree86.Org/
to make sure that you have the latest version.
Module Loader present
Markers: (--) probed, (**) from config file, (==) default setting,
(++) from command line, (!!) notice, (II) informational,
(WW) warning, (EE) error, (NI) not implemented, (??) unknown.
(==) Log file: "/var/log/XFree86.0.log", Time: Fri Apr 18 01:33:02 2003
(==) Using config file: "/etc/X11/XF86Config"



At this point, the screen turns black and the system hangs. Sometimes it still reacts to pings, but never can I log in or continue an existing login session.

/var/log/messages reveals nothing:

Apr 18 01:32:35 hp su: (to root) cko on /dev/pts/1
Apr 18 01:32:35 hp su: pam_unix2: session started for user root, service su
Apr 18 01:39:41 hp syslogd 1.4.1: restart.


XFree86.0.log also contains only the above few lines I got in the console window after startx.

Loading the driver manually gives me:

# modprobe nvidia
Warning: loading /lib/modules/2.4.20-64GB-SMP/kernel/drivers/video/nvidia.o will taint the kernel: non-GPL license - NVIDIA
See http://www.tux.org/lkml/#export-tainted for information about tainted modules
Module nvidia loaded, with warnings

and does not crash the system until I enter startx.

I tried turning nvagp off. It did not help. See the attached XF86Config, which is vanilla sax2-generated.

The machine is a HP xw6000 with 2GB RAM, 2 2.8GHz Xeon processors and a Quadro4 980 XGL, which works fine under Windows XP in the same machine.
I also tried a Quadro FX 2000. Same problem.

Here is an lspci -vv output:


00:00.0 Host bridge: Intel Corp.: Unknown device 2550 (rev 03)
Subsystem: Hewlett-Packard Company: Unknown device 00cc
Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr+ Stepping- SERR+ FastB2B-
Status: Cap+ 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR-
Latency: 0
Region 0: Memory at f4000000 (32-bit, prefetchable) [size=64M]
Capabilities: [40] #09 [0104]
Capabilities: [a0] AGP version 3.0
Status: RQ=32 Iso- ArqSz=2 Cal=0 SBA+ ITACoh- GART64- HTrans- 64bit- FW+ AGP3+ Rate=x4,x8
Command: RQ=1 ArqSz=0 Cal=0 SBA- AGP- GART64- 64bit- FW- Rate=<none>

00:01.0 PCI bridge: Intel Corp.: Unknown device 2552 (rev 03) (prog-if 00 [Normal decode])
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B-
Status: Cap+ 66Mhz+ UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR-
Latency: 64
Region 0: Memory at f8000000 (32-bit, prefetchable) [size=64M]
Bus: primary=00, secondary=01, subordinate=01, sec-latency=32
Memory behind bridge: f1000000-f21fffff
Prefetchable memory behind bridge: e8000000-f01fffff
BridgeCtl: Parity- SERR+ NoISA- VGA+ MAbort- >Reset- FastB2B-
Capabilities: [60] #0e [0035]


00:1d.0 USB Controller: Intel Corp. 82801DB USB (Hub #1) (rev 01) (prog-if 00 [UHCI])
Subsystem: Hewlett-Packard Company: Unknown device 00cc
Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B-
Status: Cap- 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR-
Latency: 0
Interrupt: pin A routed to IRQ 16
Region 4: I/O ports at 2440 [size=32]

00:1d.1 USB Controller: Intel Corp. 82801DB USB (Hub #2) (rev 01) (prog-if 00 [UHCI])
Subsystem: Hewlett-Packard Company: Unknown device 00cc
Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B-
Status: Cap- 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR-
Latency: 0
Interrupt: pin B routed to IRQ 19
Region 4: I/O ports at 2460 [size=32]

00:1d.2 USB Controller: Intel Corp. 82801DB USB (Hub #3) (rev 01) (prog-if 00 [UHCI])
Subsystem: Hewlett-Packard Company: Unknown device 00cc
Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B-
Status: Cap- 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR-
Latency: 0
Interrupt: pin C routed to IRQ 18
Region 4: I/O ports at 2480 [size=32]

00:1d.7 USB Controller: Intel Corp. 82801DB USB EHCI Controller (rev 01) (prog-if 20 [EHCI])
Subsystem: Hewlett-Packard Company: Unknown device 00cc
Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B-
Status: Cap+ 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR-
Latency: 0
Interrupt: pin D routed to IRQ 23
Region 0: Memory at f0500000 (32-bit, non-prefetchable) [size=1K]
Capabilities: [50] Power Management version 2
Flags: PMEClk- DSI- D1- D2- AuxCurrent=375mA PME(D0+,D1-,D2-,D3hot+,D3cold+)
Status: D0 PME-Enable- DSel=0 DScale=0 PME-
Capabilities: [58] #0a [2080]


00:1e.0 PCI bridge: Intel Corp. 82801BA/CA/DB PCI Bridge (rev 81) (prog-if 00 [Normal decode])
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B-
Status: Cap- 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR-
Latency: 0
Bus: primary=00, secondary=05, subordinate=05, sec-latency=64
I/O behind bridge: 00001000-00001fff
Memory behind bridge: f0200000-f04fffff
BridgeCtl: Parity- SERR+ NoISA+ VGA- MAbort- >Reset- FastB2B-

00:1f.0 ISA bridge: Intel Corp. 82801DB ISA Bridge (LPC) (rev 01)
Control: I/O+ Mem+ BusMaster+ SpecCycle+ MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B-
Status: Cap- 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR-
Latency: 0

00:1f.1 IDE interface: Intel Corp. 82801DB ICH4 IDE (rev 01) (prog-if 8a [Master SecP PriP])
Subsystem: Hewlett-Packard Company: Unknown device 00cc
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B-
Status: Cap- 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR-
Latency: 0
Interrupt: pin A routed to IRQ 18
Region 0: I/O ports at <ignored>
Region 1: I/O ports at <ignored>
Region 2: I/O ports at <ignored>
Region 3: I/O ports at <ignored>
Region 4: I/O ports at 24c0 [size=16]
Region 5: Memory at 80000000 (32-bit, non-prefetchable) [size=1K]

00:1f.5 Multimedia audio controller: Intel Corp. 82801DB AC'97 Audio (rev 01)
Subsystem: Hewlett-Packard Company: Unknown device 00c3
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B-
Status: Cap+ 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR-
Latency: 0
Interrupt: pin B routed to IRQ 17
Region 0: I/O ports at 2000 [size=256]
Region 1: I/O ports at 2400 [size=64]
Region 2: Memory at f0500400 (32-bit, non-prefetchable) [size=512]
Region 3: Memory at f0500600 (32-bit, non-prefetchable) [size=256]
Capabilities: [50] Power Management version 2
Flags: PMEClk- DSI- D1- D2- AuxCurrent=375mA PME(D0+,D1-,D2-,D3hot+,D3cold+)
Status: D0 PME-Enable- DSel=0 DScale=0 PME-


01:00.0 VGA compatible controller: nVidia Corporation NV28GL [Quadro4 980 XGL] (rev a1) (prog-if 00 [VGA])
Subsystem: nVidia Corporation: Unknown device 0174
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B-
Status: Cap+ 66Mhz+ UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR-
Latency: 64 (1250ns min, 250ns max)
Interrupt: pin A routed to IRQ 16
Region 0: Memory at f1000000 (32-bit, non-prefetchable) [size=16M]
Region 1: Memory at e8000000 (32-bit, prefetchable) [size=128M]
Expansion ROM at <unassigned> [disabled] [size=128K]
Capabilities: [60] Power Management version 2
Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-)
Status: D0 PME-Enable- DSel=0 DScale=0 PME-
Capabilities: [44] AGP version 3.0
Status: RQ=32 Iso- ArqSz=0 Cal=3 SBA+ ITACoh- GART64- HTrans- 64bit- FW+ AGP3+ Rate=x4,x8
Command: RQ=1 ArqSz=0 Cal=0 SBA- AGP- GART64- 64bit- FW- Rate=<none>

05:02.0 Ethernet controller: Broadcom Corporation NetXtreme BCM5702 Gigabit Ethernet (rev 02)
Subsystem: Hewlett-Packard Company: Unknown device 00cc
Control: I/O- Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr+ Stepping- SERR+ FastB2B-
Status: Cap+ 66Mhz+ UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR-
Latency: 64 (16000ns min), cache line size 10
Interrupt: pin A routed to IRQ 21
Region 0: Memory at f0400000 (64-bit, non-prefetchable) [disabled] [size=64K]
Expansion ROM at <unassigned> [disabled] [size=64K]
Capabilities: [40] PCI-X non-bridge device.
Command: DPERE- ERO- RBC=0 OST=0
Status: Bus=0 Dev=0 Func=0 64bit- 133MHz- SCD- USC-, DC=simple, DMMRBC=0, DMOST=0, DMCRS=0, RSCEM- Capabilities: [48] Power Management version 2
Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot+,D3cold+)
Status: D0 PME-Enable- DSel=0 DScale=1 PME-
Capabilities: [50] Vital Product Data
Capabilities: [58] Message Signalled Interrupts: 64bit+ Queue=0/3 Enable-
Address: 4d00215576c75200 Data: 0909

05:0c.0 SCSI storage controller: Adaptec AIC-7902 U320 (rev 03)
Subsystem: Hewlett-Packard Company: Unknown device 00cc
Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B-
Status: Cap+ 66Mhz+ UDF- FastB2B- ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR-
Latency: 64 (10000ns min, 6250ns max), cache line size 10
Interrupt: pin A routed to IRQ 19
Region 0: I/O ports at 1000 [disabled] [size=256]
Region 1: Memory at f0410000 (64-bit, non-prefetchable) [size=8K]
Region 3: I/O ports at 1400 [disabled] [size=256]
Expansion ROM at <unassigned> [disabled] [size=512K]
Capabilities: [dc] Power Management version 2
Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-)
Status: D0 PME-Enable- DSel=0 DScale=0 PME-
Capabilities: [a0] Message Signalled Interrupts: 64bit+ Queue=0/1 Enable-
Address: 0000000000000000 Data: 0000
Capabilities: [94]

05:0c.1 SCSI storage controller: Adaptec AIC-7902 U320 (rev 03)
Subsystem: Hewlett-Packard Company: Unknown device 00cc
Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B-
Status: Cap+ 66Mhz+ UDF- FastB2B- ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR-
Latency: 64 (10000ns min, 6250ns max), cache line size 10
Interrupt: pin B routed to IRQ 22
Region 0: I/O ports at 1800 [disabled] [size=256]
Region 1: Memory at f0412000 (64-bit, non-prefetchable) [size=8K]
Region 3: I/O ports at 1c00 [disabled] [size=256]
Expansion ROM at <unassigned> [disabled] [size=512K]
Capabilities: [dc] Power Management version 2
Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-)
Status: D0 PME-Enable- DSel=0 DScale=0 PME-
Capabilities: [a0] Message Signalled Interrupts: 64bit+ Queue=0/1 Enable-
Address: 0000000000000000 Data: 0000
Capabilities: [94]

# ls -l /lib/modules/2.4.20-64GB-SMP/kernel/drivers/video/nvidia.o
-rw-r--r-- 1 root root 1806649 Apr 18 00:20 /lib/modules/2.4.20-64GB-SMP/kernel/drivers/video/nvidia.o
# sum /lib/modules/2.4.20-64GB-SMP/kernel/drivers/video/nvidia.o
03195 1765
# strings /lib/modules/2.4.20-64GB-SMP/kernel/drivers/video/nvidia.o|fgrep -i version
version
NVRM: client does not support versioning!!
NVRM: client supports wrong rm api version!!
NVRM version: %s
GCC version: %s
gcc version 3.2
DisableTVClockInversion
kernel_version=2.4.20-64GB-SMP
kernel_version=2.4.20-64GB-SMP
kernel_version=2.4.20-64GB-SMP
kernel_version=2.4.20-64GB-SMP
# strings /lib/modules/2.4.20-64GB-SMP/kernel/drivers/video/nvidia.o|fgrep -i 1.0
nvidia id: NVIDIA Linux x86 nvidia.o Kernel Module 1.0-4349 Thu Mar 27 19:00:02 PST 2003

cko
04-17-03, 08:24 PM
I now built the 4191 drivers from source.

When I tried to install them, I got a strange error message, but I forced the install (see below) and it worked. :)

hp:/usr/src/packages/RPMS/i386 # rpm -ivh *
file /usr/X11R6/lib/modules/drivers/nvidia_drv.o from install of NVIDIA_GLX-1.0-4191 conflicts with file from package XFree86-server-4.3.0-15
hp:/usr/src/packages/RPMS/i386 # mv /usr/X11R6/lib/modules/drivers/nvidia_drv.o /usr/X11R6/lib/modules/drivers/nvidia_drv.o-orig
hp:/usr/src/packages/RPMS/i386 # rpm --force -ivh *
--- WARNING!!
libGL.a
--- The above file(s) possibly belong to a conflicting MESA rpm,
--- or to a previous stale NVIDIA distribution.
--- They have been renamed to xxx.<originalFile>.RPMSAVE to
--- avoid conflicting with the files contained within this
--- package.
--- Please see the FREQUENTLY ASKED QUESTIONS section of
--- /usr/share/doc/NVIDIA_GLX-1.0/README for more details.
NVIDIA_GLX ##################################################
NVIDIA_kernel ##################################################
Warning: loading /lib/modules/2.4.20-64GB-SMP/kernel/drivers/video/nvidia.o will taint the kernel: non-GPL license - NVIDIA
See http://www.tux.org/lkml/#export-tainted for information about tainted modules
Module nvidia loaded, with warnings

nvidia.o installed successfully

glxgears reports 8803 FPS,
so I guess it's up and running.

Carsten.

Andy Mecham
04-17-03, 11:03 PM
Are you using a flat panel monitor?

--andy

cko
04-18-03, 06:32 AM
Originally posted by Andy Mecham
Are you using a flat panel monitor?

--andy

Yes. The Card is connected to two hp 2025 flat panel monitors via DVI.

Carsten.

Andy Mecham
04-18-03, 03:37 PM
It's a known bug in 1.0-4349 with SMP machines and DVI connections. The workarounds are to connect to your DFP with an analog cable, fall back to a UP kernel, or fall back to an earlier driver.

--andy

cko
04-18-03, 04:10 PM
Thank you, Andy. That explains it.

Connecting the flat panel monitors with analog cables is not possible, the analog inputs are connected to another machine.

Using a uniprocessor kernel on a dual processor machine does not seem perfect either. ;-)

So I guess using the 4191 driver is my only choice until the bug gets fixed. Which is fine with me. Any idea when the next driver might get released?

Thanks!

Carsten.