Go Back   nV News Forums > Linux Support Forums > NVIDIA Linux

Newegg Daily Deals

Reply
 
Thread Tools
Old 12-28-02, 12:47 PM   #1
elaine
Registered User
 
Join Date: Dec 2002
Posts: 4
Default GEF 4 on kernel 2.4.19 / smp system

Ok, about ready to give up on this :-(

I have one bios/config diagnostic left to run down,
that or if anyone sees some known problems in how
I've installed, else this card goes back to the vendor :-(.

/proc/driver/nvid0... says I have a gef4 mx/420
which is correct. The two things that look potentially snafu is that /proc thinks:
Video bios = ??.??.??.??.??
and IRQ=18 dec

but the system bios thinks auto-assigned
IRQ= 0xf (15)

config:

Base installation is rh 7.1

kernel 2.4.19 on a netfinity smp box w/ hardware
raid, using NV driver/kernel module 1.0-4191

gcc 3.2, kernel, and the NV glx and nvidia.o
kernel driver have all been installed with this
build environment.

XF 4.2 has been recompiled w/ gcc 3.2

Behavior on 'startx':

hung system, network stops, no kernel Oops
no alt-sysreq diagnostics.

Issues priorly resolve include it was conflicting
with my 2nd scsi bus -- which I need to rectify
long term, for the moment that scsi controller is
disabled.

I can try rebuilding *everything* with gcc 2.95.3
(tho I'd rather not) if anyone can tell me there are
known issues with gcc 3.2?
elaine is offline   Reply With Quote
Old 12-29-02, 06:40 AM   #2
Wolfman [TWP]
Geforce 8800 GTS 512
 
Join Date: Nov 2002
Location: Australia
Posts: 396
Default

Does that system use an APIC controller?? (I'm sure that it most probably would)

I'm running a Dual Processor system and my GF4 IRQ is 18 (Hex). Check your dmseg output to see if the Apic is init.. correctly.

Else there maybe a mother board or BIOS problem. (or maybe a setting in the BIOS that has been overlooked) As I've encountered similar problems with other DUAL CPU Mobo's (MSI) (Mainly with Athlons, I've had no hands on with Dual Intel systems)

GCC 3.2 runs ok on my system (Tyan MPX Mobo)
Wolfman [TWP] is offline   Reply With Quote
Old 12-29-02, 03:56 PM   #3
crimsun
Registered User
 
Join Date: Aug 2002
Posts: 43
Default

Maybe you need to boot with the "noapic" LILO/GRUB append?
crimsun is offline   Reply With Quote
Old 01-02-03, 02:13 PM   #4
elaine
Registered User
 
Join Date: Dec 2002
Posts: 4
Default

Quote:
Originally posted by crimsun
Maybe you need to boot with the "noapic" LILO/GRUB append?
tried, same (crash) result.

I've also tried compiling kernel, nvidia & glx libs
and X11 both as fully gcc 2.9x and gcc 3.2, all
with the same result.

I was hopeful that or

echo "00000001" > /proc/irq/18/smp_affinity

would solve the issue but it looks like no-go

So I think it's time to send this card back, far
more effort than it's worth already with no
usable results :-(
elaine is offline   Reply With Quote
Old 01-02-03, 02:27 PM   #5
elaine
Registered User
 
Join Date: Dec 2002
Posts: 4
Default

Quote:
posted by Wolfman [TWP]
Does that system use an APIC controller?? (I'm sure that it most probably would)


yes it does, as I said to crimsun below, I tried noapic
as well as specifically directing that irq (18 same as
you) to just one CPU, no go / no luck.

Pertinent parts of dmesg, as near as I can see APIC is working:


ENABLING IO-APIC IRQs
Setting 14 in the phys_id_present_map
...changing IO-APIC physical APIC ID to 14 ... ok.
BIOS bug, IO-APIC#1 ID is 15 in the MPC table!...
... fixing up to 15. (tell your hw vendor)
Setting 15 in the phys_id_present_map
...changing IO-APIC physical APIC ID to 15 ... ok.
init IO_APIC IRQs
IO-APIC (apicid-pin) 14-0, 14-5, 15-1, 15-3, 15-12, 15-13, 15-14, 15-15 not connected.
..TIMER: vector=0x31 pin1=2 pin2=0
..MP-BIOS bug: 8254 timer not connected to IO-APIC
...trying to set up timer (IRQ0) through the 8259A ...
..... (found pin 0) ...works.
number of MP IRQ sources: 31.
number of IO-APIC #14 registers: 16.
number of IO-APIC #15 registers: 16.
testing the IO APIC.......................

IO APIC #14......
.... register #00: 0E000000
....... : physical APIC id: 0E
.... register #01: 000F0011
....... : max redirection entries: 000F
....... : PRQ implemented: 0
....... : IO APIC version: 0011
.... register #02: 00000000
....... : arbitration: 00
.... IRQ redirection table:
NR Log Phy Mask Trig IRR Pol Stat Dest Deli Vect:
00 003 03 0 0 0 0 0 1 1 31
01 003 03 0 0 0 0 0 1 1 39
02 000 00 1 0 0 0 0 0 0 00
03 003 03 0 0 0 0 0 1 1 41
04 003 03 0 0 0 0 0 1 1 49
05 000 00 1 0 0 0 0 0 0 00
06 003 03 0 0 0 0 0 1 1 51
07 003 03 0 0 0 0 0 1 1 59
08 003 03 0 0 0 0 0 1 1 61
09 003 03 1 1 0 1 0 1 1 69
0a 003 03 0 0 0 0 0 1 1 71
0b 003 03 0 0 0 0 0 1 1 79
0c 003 03 0 0 0 0 0 1 1 81
0d 003 03 0 0 0 0 0 1 1 89
0e 003 03 0 0 0 0 0 1 1 91
0f 003 03 0 0 0 0 0 1 1 9IO APIC #15......
.... register #00: 0F000000
....... : physical APIC id: 0F
.... register #01: 000F0011
....... : max redirection entries: 000F
....... : PRQ implemented: 0
....... : IO APIC version: 0011
.... register #02: 0A000000
....... : arbitration: 0A
.... IRQ redirection table:
NR Log Phy Mask Trig IRR Pol Stat Dest Deli Vect:
00 003 03 1 1 0 1 0 1 1 A1
01 000 00 1 0 0 0 0 0 0 00
02 003 03 1 1 0 1 0 1 1 A9
03 000 00 1 0 0 0 0 0 0 00
04 003 03 1 1 0 1 0 1 1 B1
05 003 03 1 1 0 1 0 1 1 B9
06 003 03 1 1 0 1 0 1 1 C1
07 003 03 1 1 0 1 0 1 1 C9
08 003 03 1 1 0 1 0 1 1 D1
09 003 03 1 1 0 1 0 1 1 D9
0a 003 03 1 1 0 1 0 1 1 E1
0b 003 03 1 1 0 1 0 1 1 E9
0c 000 00 1 0 0 0 0 0 0 00
0d 000 00 1 0 0 0 0 0 0 00
0e 000 00 1 0 0 0 0 0 0 00
0f 000 00 1 0 0 0 0 0 0 00
...


PCI->APIC IRQ transform: (B0,I2,P0) -> 27
PCI->APIC IRQ transform: (B0,I9,P0) -> 16
PCI->APIC IRQ transform: (B0,I10,P0) -> 18
PCI->APIC IRQ transform: (B0,I15,P0) -> 9
PCI->APIC IRQ transform: (B1,I5,P0) -> 20




snip


Else there maybe a mother board or BIOS problem. (or maybe a setting in the BIOS that has been overlooked) As I've encountered similar problems

GCC 3.2 runs ok on my system (Tyan MPX Mobo)
'near as I can tell the bios is all ok, I tried virtually
every relevant variation on the PCI settings w/ no
visible effect on behavior.
elaine is offline   Reply With Quote
Old 01-04-03, 11:03 PM   #6
Wolfman [TWP]
Geforce 8800 GTS 512
 
Join Date: Nov 2002
Location: Australia
Posts: 396
Default

Does the bios have any way of disabling the APIC?? If so try that. From looking at your Dmesg output it looks like either a hardware or BIOS problem with the APIC. The 'BIOS bug, IO-APIC#1 ID is 15 in the MPC table!...
... fixing up to 15. (tell your hw vendor)
Setting 15 in the phys_id_present_map' errors I don't get with my board. So there maybe a BIOS bug somewhere. And since it involves IRQ's, then incorrect settings can cause system lockups and/or hangs.

Oh, and does that problem only happen with the SMP kernel??? Have you tried the std single processor kernel?? I found on my dual CPU MSI boards, don't hang at all when using the single CPU kernel. So there maybe code that ignores the dual stuff... It's worth a shot...
Wolfman [TWP] is offline   Reply With Quote
Old 01-05-03, 11:43 AM   #7
elaine
Registered User
 
Join Date: Dec 2002
Posts: 4
Default

>> Does the bios have any way of disabling the APIC??

No it doesn't however disabling in the kernel worked
ok to all apearances (everything matched the POST messages)

I know it reports a 'bug' but all APIC mappings are
correct, and in any case w/ APIC disabled everything
else ran ok but the failure continued

>> Setting 15 in the phys_id_present_map

I don't think that's an error, I think it's just the
second (hotswap) pci bus.

>> Oh, and does that problem only happen with the SMP kernel???

No, same fault with a non-smp kernel.

Anyhow the card has been returned, I installed
an ATI radeon 7500 which ran X fine with only
minor hacking.

The nvidia wasn't showing any signs of being
willing to play nice with my scsi, there was some
conflict there so it looked like a no-go in the long
run anyhow. Perhaps a better nvidia vendor, this
was eVGA which happened to be the only
PCI-card nvidia locally available.

Thanks for the help/feedback maybe if I someday
get an AGP-based station I'll have better luck w/
nvidia
elaine is offline   Reply With Quote
Reply


Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Forum Jump


Similar Threads
Thread Thread Starter Forum Replies Last Post
Random crashes, NVRM Xid messages Iesos NVIDIA Linux 90 10-04-12 03:27 AM
Corrupted display - 302.17 - Dell Precision T3500 (G98 [Quadro NVS 295]) gbailey NVIDIA Linux 1 06-27-12 10:24 AM
UEFI+Nvidia - NVRM: Your system is not currently configured to drive a VGA console... interzoneuk NVIDIA Linux 0 06-26-12 04:51 AM
xorg locks-up with newest nvidia drivers w/ vdpau. theroot NVIDIA Linux 1 06-24-12 11:04 AM
Crash when logout from X TGL NVIDIA Linux 10 09-13-02 08:22 PM

All times are GMT -5. The time now is 12:38 AM.


Powered by vBulletin® Version 3.7.1
Copyright ©2000 - 2014, Jelsoft Enterprises Ltd.
Copyright 1998 - 2014, nV News.