Go Back   nV News Forums > Linux Support Forums > NVIDIA Linux

Newegg Daily Deals

Reply
 
Thread Tools
Old 01-25-06, 11:45 PM   #1
gnychis
Registered User
 
Join Date: Jan 2006
Posts: 26
Default X locks up with nvidia-kernel-1.0.8178-r3 and 2.6.15-r1 kernel

Hi,

Myself and about 10 others on the Gentoo forums are encountering a problem with kernel 2.6.15-r1 and nvidia-kernel-1.0.8178-r3.

Whenever we start X, we get a blank screen and the computer becomes unresponsive. A fellow forum member narrowed the problem down to the nvidia driver because when we use the nvidia driver, we get the lockup, however when we use agpgart, we do not get a lockup.

More information can be found in our post:
http://forums.gentoo.org/viewtopic-t...c-start-0.html

Thank you for your help and development,
George
gnychis is offline   Reply With Quote
Old 02-16-06, 01:21 PM   #2
ewiget
Registered User
 
Join Date: Feb 2006
Location: maysville, ky
Posts: 21
Send a message via ICQ to ewiget Send a message via AIM to ewiget Send a message via MSN to ewiget Send a message via Yahoo to ewiget
Default Re: X locks up with nvidia-kernel-1.0.8178-r3 and 2.6.15-r1 kernel

more threads, same/similar problem regarding kernel-2.6.15 and also 2.6.14-r5 now on my system

http://forums.gentoo.org/viewtopic-p...2.html#3116582
http://forums.gentoo.org//viewtopic-...ageattr+c.html
http://forums.gentoo.org//viewtopic-...ageattr+c.html
http://forums.gentoo.org//viewtopic-...ageattr+c.html

Something I noticed, on my system when it locks up, you get "Kernel Bug at arch/386/mm/pageattr.c:137!" error - many others are reporting same.

A little info about my system:
Code:
Portage 2.0.54 (default-linux/x86/2005.0, gcc-3.4.4, glibc-2.3.5-r2, 2.6.14-gentoo-r5 i686)
=================================================================
System uname: 2.6.14-gentoo-r5 i686 Intel(R) Pentium(R) 4 Mobile CPU 1.70GHz
Gentoo Base System version 1.6.14
dev-lang/python:     2.3.5-r2, 2.4.2
sys-apps/sandbox:    1.2.12
sys-devel/autoconf:  2.13, 2.59-r6
sys-devel/automake:  1.4_p6, 1.5, 1.6.3, 1.7.9-r1, 1.8.5-r3, 1.9.6-r1
sys-devel/binutils:  2.16.1
sys-devel/libtool:   1.5.22
virtual/os-headers:  2.6.11-r3
ACCEPT_KEYWORDS="x86"
AUTOCLEAN="yes"
CBUILD="i686-pc-linux-gnu"
CFLAGS="-O2 -march=pentium4 -fomit-frame-pointer"
CHOST="i686-pc-linux-gnu"
CONFIG_PROTECT="/etc /usr/kde/2/share/config /usr/kde/3.5/env /usr/kde/3.5/share/config /usr/kde/3.5/shutdown /usr/kde/3/share/config /usr/lib/X11/xkb /usr/lib/mozilla/defaults/pref /usr/share/config /var/qmail/control"
CONFIG_PROTECT_MASK="/etc/gconf /etc/splash /etc/terminfo /etc/env.d"
CXXFLAGS="-O2 -march=pentium4 -fomit-frame-pointer"
DISTDIR="/usr/portage/distfiles"
FEATURES="autoconfig distlocks sandbox sfperms strict"
GENTOO_MIRRORS="http://gentoo.chem.wisc.edu/gentoo/ http://open-systems.ufl.edu/mirrors/gentoo http://modzer0.cs.uaf.edu/public/gentoo/ http://gentoo.arcticnetwork.ca/ http://gentoo.cs.lewisu.edu/gentoo/"
PKGDIR="/usr/portage/packages"
PORTAGE_TMPDIR="/var/tmp"
PORTDIR="/usr/portage"
PORTDIR_OVERLAY="/usr/local/portage"
SYNC="rsync://rsync.gentoo.org/gentoo-portage"
USE="x86 X a52 aac acpi aim alsa apache2 apm arts audiofile avi bash-completion bashlogger berkdb bigger-fonts bitmap-fonts bonobo bzip2 cardbus cdda cddb cdio cdparanoia cdr clamd crypt css cups curl dbus divx4linux doc dv dvd dvdr dvdread eds emboss encode esd ethereal evo exif expat fam fame fbcon ffmpeg flac flash foomaticdb fortran freetype ftp gb gd gdbm ggi gif gimp gimpprint gkrellm glut gmp gnokii gnome gnome-print gphoto2 gpm gstreamer gtk gtk2 gtkhtml guile icq id3 idn ieee1394 imagemagick imlib ipv6 irda irmc jack java jpeg junit kcal kde kdeenablefinal kdepim kdgraphics lcms libclamav libg++ libwww lm_sensors logitech-mouse logrotate logwatch lua lzo mad mdb mhash mikmod ming mjpeg mng motif mozilla mp3 mpeg mplayer msn mysql ncurses nls ntlm offensive ogg oggvorbis on-the-fly-crypt openal opengl oscar oss pam pcmcia pcre pda pdflib perl php pic png pnp postgres python qt quicktime rar rdesktop readline real recode rss samba scanner screen sdk sdl sftplogging silverxp slang speedo spell sql sse ssl stencil-buffer subject-rewrite subtitles svg svga tcltk tcpd tidy tiff transcode truetype truetype-fonts type1 type1-fonts udev unicode usb uudeview v4l v4l2 vcd vcdimager vim-with-x virus-scan vorbis wifi win32codecs wmf xine xml xml2 xmms xscreensaver xv xvid xvmc yahoo zlib userland_GNU kernel_linux elibc_glibc"
Unset:  ASFLAGS, CTARGET, LANG, LC_ALL, LDFLAGS, LINGUAS, MAKEOPTS
I have tried all of these nvidia:
Gentoo emerge - nvidia-kernel-1.0.6629
Gentoo emerge - nvidia-glx for same version as above
Gentoo emerge - nvidia-kernel-1.0-8178-r3
Gentoo emerge - nvidia-glx for same version as above
Downloaded NVIDIA-Linux-x86-1.0-8178-pkg1.run from Nvidia web site and removing the emerged versions above, no difference

lspci:
Code:
00:00.0 Host bridge: Intel Corporation 82845 845 (Brookdale) Chipset Host Bridge (rev 04)
00:01.0 PCI bridge: Intel Corporation 82845 845 (Brookdale) Chipset AGP Bridge (rev 04)
00:1d.0 USB Controller: Intel Corporation 82801CA/CAM USB (Hub #1) (rev 02)
00:1d.1 USB Controller: Intel Corporation 82801CA/CAM USB (Hub #2) (rev 02)
00:1d.2 USB Controller: Intel Corporation 82801CA/CAM USB (Hub #3) (rev 02)
00:1e.0 PCI bridge: Intel Corporation 82801 Mobile PCI Bridge (rev 42)
00:1f.0 ISA bridge: Intel Corporation 82801CAM ISA Bridge (LPC) (rev 02)
00:1f.1 IDE interface: Intel Corporation 82801CAM IDE U100 (rev 02)
00:1f.5 Multimedia audio controller: Intel Corporation 82801CA/CAM AC'97 Audio Controller (rev 02)
00:1f.6 Modem: Intel Corporation 82801CA/CAM AC'97 Modem Controller (rev 02)
01:00.0 VGA compatible controller: nVidia Corporation NV17 [GeForce4 420 Go] (rev a3)
02:08.0 Ethernet controller: Intel Corporation 82801CAM (ICH3) PRO/100 VE (LOM) Ethernet Controller (rev 42)
02:0b.0 CardBus bridge: Toshiba America Info Systems ToPIC100 PCI to Cardbus Bridge with ZV Support (rev 32)
02:0b.1 CardBus bridge: Toshiba America Info Systems ToPIC100 PCI to Cardbus Bridge with ZV Support (rev 32)
02:0d.0 System peripheral: Toshiba America Info Systems SD TypA Controller (rev 03)
On a side note, nvidia-kernel and nvidia-glx 1.0-6629 was working fine with gentoo-sources-2.6.14-r5 compiled under gcc-3.3 but after I upgraded to gcc-3.4 and reinstalled nvidia-kernel-1.0.6629 and the matching glx, it also hard locks now too - the gcc upgrade and the kernel upgrade to 2.6.15-gentoo-r1 were in tandom - first the gcc upgrade and emerge -e world twice, then recompiled and upgraded the kernel to match the gcc version of the system.

Maybe a problem related to the gcc version? Would take me 4+ days to go back to gcc-3.3 to test this theory......
ewiget is offline   Reply With Quote
Old 02-16-06, 01:47 PM   #3
JaXXoN
Registered User
 
Join Date: Jul 2005
Location: Munich
Posts: 910
Default Re: X locks up with nvidia-kernel-1.0.8178-r3 and 2.6.15-r1 kernel

Hi!

I recently had also quite some problems after a kernel update!

I also suspected the nvidia driver to be the root cause, but
after some experimentation, i came to the conclusion that there
has been some regressions in the kernel ACPI code - or maybe the
kernel is now acting correct on broken ACPI BIOS structures.

Anyway, the kernel boot option "acpi=off" fixed it for me, but please
note that you might put your machine at risk if applying this option:
http://www.nvnews.net/vbulletin/show...06&postcount=7
(demage is highly unlikely, but you have been warned).

regards

Bernhard
JaXXoN is offline   Reply With Quote
Old 02-16-06, 02:19 PM   #4
ewiget
Registered User
 
Join Date: Feb 2006
Location: maysville, ky
Posts: 21
Send a message via ICQ to ewiget Send a message via AIM to ewiget Send a message via MSN to ewiget Send a message via Yahoo to ewiget
Default Re: X locks up with nvidia-kernel-1.0.8178-r3 and 2.6.15-r1 kernel

Quote:
Originally Posted by JaXXoN
Hi!

I recently had also quite some problems after a kernel update!

I also suspected the nvidia driver to be the root cause, but
after some experimentation, i came to the conclusion that there
has been some regressions in the kernel ACPI code - or maybe the
kernel is now acting correct on broken ACPI BIOS structures.

Anyway, the kernel boot option "acpi=off" fixed it for me, but please
note that you might put your machine at risk if applying this option:
http://www.nvnews.net/vbulletin/show...06&postcount=7
(demage is highly unlikely, but you have been warned).

regards

Bernhard
I will try this in just in a few minutes, in the meantime...Iam also following the nvidia bug report guide and adding the necessary reports for both kernel versions. I will update if the acpi-off works
ewiget is offline   Reply With Quote
Old 02-16-06, 02:27 PM   #5
ewiget
Registered User
 
Join Date: Feb 2006
Location: maysville, ky
Posts: 21
Send a message via ICQ to ewiget Send a message via AIM to ewiget Send a message via MSN to ewiget Send a message via Yahoo to ewiget
Default Re: X locks up with nvidia-kernel-1.0.8178-r3 and 2.6.15-r1 kernel

Because nobody else had included the nvidia-bug-reports, here are with both kernels and also an strace on 2.6.15

I had to zip the one from kernel-2.6.14-gentoo-r1 because it was 106kb (over the max size for uploads)

I am also adding a strace of startx. One thing to note, after the system hard locked, I attempted ctrl + alt + F1 - F4 so the last few lines of strace may show those inputs. The strace file is from kernel 2.6.15-gentoo-r1 with nvidia-kernel from the nvidia web site downloads - NVIDIA-Linux-x86-1.0-8178-pkg1.run
Attached Files
File Type: log nvidia-bug-report-2.6.15-gentoo-r1.log (86.7 KB, 279 views)
File Type: txt nvidia-strace.txt (19.8 KB, 402 views)
File Type: zip nvidia-bug-report-2.6.14-gentoo-r5.log.ZIP (22.9 KB, 125 views)
ewiget is offline   Reply With Quote
Old 02-16-06, 03:00 PM   #6
ewiget
Registered User
 
Join Date: Feb 2006
Location: maysville, ky
Posts: 21
Send a message via ICQ to ewiget Send a message via AIM to ewiget Send a message via MSN to ewiget Send a message via Yahoo to ewiget
Default Re: X locks up with nvidia-kernel-1.0.8178-r3 and 2.6.15-r1 kernel

Quote:
Originally Posted by JaXXoN
Hi!

I recently had also quite some problems after a kernel update!

I also suspected the nvidia driver to be the root cause, but
after some experimentation, i came to the conclusion that there
has been some regressions in the kernel ACPI code - or maybe the
kernel is now acting correct on broken ACPI BIOS structures.

Anyway, the kernel boot option "acpi=off" fixed it for me, but please
note that you might put your machine at risk if applying this option:
http://www.nvnews.net/vbulletin/show...06&postcount=7
(demage is highly unlikely, but you have been warned).

regards

Bernhard

I tried acpi=off for both kernel versions without any changes after reinstalling version 8178 from the nvidia web site and resetting driver to nvidia in xorg.conf (I am currently using nv until this is resolved)
ewiget is offline   Reply With Quote
Old 02-16-06, 07:05 PM   #7
netllama
NVIDIA Corporation
 
Join Date: Dec 2004
Posts: 8,763
Default Re: X locks up with nvidia-kernel-1.0.8178-r3 and 2.6.15-r1 kernel

Ed,
I have a few questions:
0) Does problem persist without a PREEMPT enabled kernel?
1) Have you applied these patches?
http://www.nvnews.net/vbulletin/showthread.php?t=62021
2) Does this problem persist without a vesafb ?
3) Does this problem persist without RenderAccel ?
4) Are you using the latest BIOS ?

Thanks,
Lonni
netllama is offline   Reply With Quote
Old 02-17-06, 01:13 AM   #8
ewiget
Registered User
 
Join Date: Feb 2006
Location: maysville, ky
Posts: 21
Send a message via ICQ to ewiget Send a message via AIM to ewiget Send a message via MSN to ewiget Send a message via Yahoo to ewiget
Default Re: X locks up with nvidia-kernel-1.0.8178-r3 and 2.6.15-r1 kernel

Quote:
Originally Posted by netllama
Ed,
I have a few questions:
0) Does problem persist without a PREEMPT enabled kernel?
1) Have you applied these patches?
http://www.nvnews.net/vbulletin/showthread.php?t=62021
2) Does this problem persist without a vesafb ?
3) Does this problem persist without RenderAccel ?
4) Are you using the latest BIOS ?

Thanks,
Lonni
I have stripped a lot out of the kernel since my last post - pretty much anything not related to this specific hardware and a lot of modules that I never used anyways, probably recompiled about 6 different times now.

I enabled agpgart per someone elses post, and now on init 3 reboot I have the nvidia module listed in lsmod - but still hard locks whenever I attempt to startx.

My next course of action prior to reading this post was to start removing one item at a time from the kernel and see what happens....but I will try your suggestions first - maybe not in that order. Probably the simplest to try from your suggestions is to disable renderaccel which I have in my xorg.conf, and then I will try the rest and update more.
ewiget is offline   Reply With Quote

Old 02-17-06, 10:03 AM   #9
ewiget
Registered User
 
Join Date: Feb 2006
Location: maysville, ky
Posts: 21
Send a message via ICQ to ewiget Send a message via AIM to ewiget Send a message via MSN to ewiget Send a message via Yahoo to ewiget
Default Re: X locks up with nvidia-kernel-1.0.8178-r3 and 2.6.15-r1 kernel

Quote:
Originally Posted by netllama
Ed,
I have a few questions:
0) Does problem persist without a PREEMPT enabled kernel?
1) Have you applied these patches?
http://www.nvnews.net/vbulletin/showthread.php?t=62021
2) Does this problem persist without a vesafb ?
3) Does this problem persist without RenderAccel ?
4) Are you using the latest BIOS ?

Thanks,
Lonni
The problem does still exist without a preempt enabled kernel (I posted a link to the kernel config files below).

I am currently using the gentoo nvidia-kernel and nvidia-glx which has these patches applied (but I have also previous tried the nvidia modules straight from the web site without any change, although without the patches you mentioned):
NVIDIA_kernel-1.0-8178-1444349
NVIDIA_kernel-1.0-8178-U011106
NVIDIA_kernel-1.0-8178-U012206
NVIDIA_kernel-1.0-8178-U122205

The problem also existed without vesafb (which was simply enabled for bootsplash), and it existed with RenderAccel commented out of the xorg.conf

Once I tried your suggestions, and after stripping as much out of my kernel configuration as possible, I even tried emerging gentoo-sources-2.6.15-r5 and used my old kernel config from 2.6.14-gentoo-r5 that used to work with nvidia-1.0-6629 and all previous kernels (Im an emerge junkie - always trying new things because I teach linux) after make oldconfig and recompiling - same deal - hard locks.

For those interested, here is everything about this particular system, including kernel .config files at my personal web site - http://www.edwiget.name/content/category/4/16/26/ (indexed) or my grub.conf file http://www.edwiget.name/content/view/43/26/1/1/ or my kernel .config file for 2.6.14-r5 http://www.edwiget.name/files/kernel...-gentoo-r5.txt

Just for comparison, to show you what I removed, here is the one from kernel-2.6.15-r1 http://www.edwiget.name/files/kernel...est-config.txt

As far as bios upgrades go, toshiba hasn't released a bios for this laptop since 2003/2004 and I believe it is current (1.90 listed on their web site but I keep forgetting to check)

What is really strange is that nvidia modules 1.0.6629 with patches applied worked with the old kernel 2.6.14-gentoo-r5 and the system compiled under gcc-3.3.6 but after upgrading to gcc-3.4.4 and recompiling the entire system twice to fix incompatibilities between the gcc versions, and also recompiling the old kernel 2.6.14-gentoo-r5 to match the system...the older nvidia module no longer works

Are there any known issues with gcc-3.4.4 and nvidia module?? That is really the only common thing I can think of that is left....as the problem started after upgrading to gcc-3.4.4 Only problem is, it would take me 4 days to recompile everything back to gcc-3.3.6 to verify this.

Now I have tried 3 different kernels (2.6.14-gentoo-r5, 2.6.15-gentoo-r1, 2.6.15-gentoo-r5), with .config files that always worked previously (for 4 years I have ran linux on this laptop and use it daily) and 3 different versions of nvidia(6629, 7676, and 8178), and even tried stripping everything out of the kernel not needed......

There is one thing I may be able to try later today - I will be at a clients location and they have multiple linux servers....I will attempt to plug my laptop into their network, find its ip address, get it to hard lock by starting x, and then hopefully be able to ssh into it and get a new bug report. I will update later today.
ewiget is offline   Reply With Quote
Old 02-17-06, 11:34 AM   #10
ewiget
Registered User
 
Join Date: Feb 2006
Location: maysville, ky
Posts: 21
Send a message via ICQ to ewiget Send a message via AIM to ewiget Send a message via MSN to ewiget Send a message via Yahoo to ewiget
Default Re: X locks up with nvidia-kernel-1.0.8178-r3 and 2.6.15-r1 kernel

trying to ssh from another machine wont work. I was already logged in via ssh, su - to root, already had the nvidia-bug-report.sh command typed, but the minute I started x on the laptop, the ssh session was unresponsive
ewiget is offline   Reply With Quote
Old 02-17-06, 12:06 PM   #11
120
Registered User
 
Join Date: Dec 2005
Posts: 14
Default Re: X locks up with nvidia-kernel-1.0.8178-r3 and 2.6.15-r1 kernel

I do also have this lock up and did notice that for some reason X seems to crash when claiming the resource ranges or just after maybe (the log stops after ressource range 14). I do join the two different logs in case. There're those things with the adress ranges which are different but I'm not qualified enough to know if it's harmful or not.
Attached Files
File Type: log Xorg.2.6.14.log (28.2 KB, 119 views)
File Type: log Xorg.2.6.15.log (14.3 KB, 122 views)
120 is offline   Reply With Quote
Old 02-17-06, 02:03 PM   #12
netllama
NVIDIA Corporation
 
Join Date: Dec 2004
Posts: 8,763
Default Re: X locks up with nvidia-kernel-1.0.8178-r3 and 2.6.15-r1 kernel

ewiget,
If this is a Toshiba laptop, others have run into this problem back in December. You'll need to set the NVreg_Mobile nvidia kernel module perameter to a specific value. You should search for the old threads that have discussed this for the potential workarounds.

Thanks,
Lonni
netllama is offline   Reply With Quote
Reply


Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Forum Jump


All times are GMT -5. The time now is 06:51 PM.


Powered by vBulletin® Version 3.7.1
Copyright ©2000 - 2014, Jelsoft Enterprises Ltd.
Copyright 1998 - 2014, nV News.