PDA

View Full Version : [solved] nvidia drivers crash system



doctordruidphd
Mar 30th 2009, 03:43 PM
This is a 64-bit system with GFORCE-9800GT-SLI.
When I installed the system from the liveCD, I get video. The fans on the cards run full time.

I installed EnvyNG, and set up the 180 series driver it recommends, following the instructions found in dibl's FAQ, #7. All appears to go well, until I get to the 'startx' part. For some reason, it starts the Gnome display (default is kdm), which freezes (the fans do shut up). On reboot, the system beeper goes wild, and it drops to the console login (which I can do). If I try to startx again, it prints out a bunch of stuff, including a message, "No screens found."
I went through all of the options in Envyng; the 177 driver gives the same result, and the others just plain crash.
Removing the nvidia drivers restores me to a graphical session, including fans at full speed again.
Well, any ideas what to try now?
Thanks.

Followup:
1. I tried the NVIDIA-Linux-x86_64-180.44-pkg2.run available on the nvidia website. Big mistake -- I wound up having to wipe & reinstall to get x back at all.

2. Here is the output of /var/log/Xorg.0.log:





X.Org X Server 1.6.0
Release Date: 2009-2-25
X Protocol Version 11, Revision 0
Build Operating System: Linux 2.6.24-15-server x86_64 Ubuntu
Current Operating System: Linux wolfenstein 2.6.28-11-generic #38-Ubuntu SMP Fri Mar 27 10:01:17 UTC 2009 x86_64
Build Date: 28 March 2009 03:12:17AM
xorg-server 2:1.6.0-0ubuntu6 (buildd@crested.buildd)
Before reporting problems, check http://wiki.x.org
to make sure that you have the latest version.
Markers: (--) probed, (**) from config file, (==) default setting,
(++) from command line, (!!) notice, (II) informational,
(WW) warning, (EE) error, (NI) not implemented, (??) unknown.
(==) Log file: "/var/log/Xorg.0.log", Time: Mon Mar 30 09:21:28 2009
(==) Using config file: "/etc/X11/xorg.conf"
(==) ServerLayout "Layout0"
(**) |-->Screen "Screen0" (0)
(**) | |-->Monitor "Monitor0"
(**) | |-->Device "Device0"
(**) |-->Input Device "Keyboard0"
(**) |-->Input Device "Mouse0"
(==) Automatically adding devices
(==) Automatically enabling devices
(WW) The directory "/usr/share/fonts/X11/cyrillic" does not exist.
Entry deleted from font path.
(==) FontPath set to:
/usr/share/fonts/X11/misc,
/usr/share/fonts/X11/100dpi/:unscaled,
/usr/share/fonts/X11/75dpi/:unscaled,
/usr/share/fonts/X11/Type1,
/usr/share/fonts/X11/100dpi,
/usr/share/fonts/X11/75dpi,
/var/lib/defoma/x-ttcidfont-conf.d/dirs/TrueType,
built-ins
(==) ModulePath set to "/usr/lib/xorg/modules"
(WW) AllowEmptyInput is on, devices using drivers 'kbd', 'mouse' or 'vmmouse' will be disabled.
(WW) Disabling Keyboard0
(WW) Disabling Mouse0
(II) Loader magic: 0xb40
(II) Module ABI versions:
X.Org ANSI C Emulation: 0.4
X.Org Video Driver: 5.0
X.Org XInput driver : 4.0
X.Org Server Extension : 2.0
(II) Loader running on linux
(--) using VT number 7

(!!) More than one possible primary device found
(--) PCI: (0@1:0:0) nVidia Corporation GeForce 9800 GT rev 162, Mem @ 0xe8000000/16777216, 0xc0000000/268435456, 0xe6000000/33554432, I/O @ 0x0000ef00/128, BIOS @ 0x????????/131072
(--) PCI: (0@2:0:0) nVidia Corporation GeForce 9800 GT rev 162, Mem @ 0xec000000/16777216, 0xb0000000/268435456, 0xea000000/33554432, I/O @ 0x0000cf00/128, BIOS @ 0x????????/131072
(II) Open ACPI successful (/var/run/acpid.socket)
(II) System resource ranges:
[0] -1 0 0xffffffff - 0xffffffff (0x1) MX[B]
[1] -1 0 0x000f0000 - 0x000fffff (0x10000) MX[B]
[2] -1 0 0x000c0000 - 0x000effff (0x30000) MX[B]
[3] -1 0 0x00000000 - 0x0009ffff (0xa0000) MX[B]
[4] -1 0 0x0000ffff - 0x0000ffff (0x1) IX[B]
[5] -1 0 0x00000000 - 0x00000000 (0x1) IX[B]
(II) "extmod" will be loaded. This was enabled by default and also specified in the config file.
(II) "dbe" will be loaded. This was enabled by default and also specified in the config file.
(II) "glx" will be loaded. This was enabled by default and also specified in the config file.
(II) "record" will be loaded by default.
(II) "dri" will be loaded by default.
(II) "dri2" will be loaded by default.
(II) LoadModule: "dbe"
(II) Loading /usr/lib/xorg/modules/extensions//libdbe.so
(II) Module dbe: vendor="X.Org Foundation"
compiled for 1.6.0, module version = 1.0.0
Module class: X.Org Server Extension
ABI class: X.Org Server Extension, version 2.0
(II) Loading extension DOUBLE-BUFFER
(II) LoadModule: "extmod"
(II) Loading /usr/lib/xorg/modules/extensions//libextmod.so
(II) Module extmod: vendor="X.Org Foundation"
compiled for 1.6.0, module version = 1.0.0
Module class: X.Org Server Extension
ABI class: X.Org Server Extension, version 2.0
(II) Loading extension MIT-SCREEN-SAVER
(II) Loading extension XFree86-VidModeExtension
(II) Loading extension XFree86-DGA
(II) Loading extension DPMS
(II) Loading extension XVideo
(II) Loading extension XVideo-MotionCompensation
(II) Loading extension X-Resource
(II) LoadModule: "type1"
(WW) Warning, couldn't open module type1
(II) UnloadModule: "type1"
(EE) Failed to load module "type1" (module does not exist, 0)
(II) LoadModule: "freetype"
(WW) Warning, couldn't open module freetype
(II) UnloadModule: "freetype"
(EE) Failed to load module "freetype" (module does not exist, 0)
(II) LoadModule: "glx"
(II) Loading /usr/lib/xorg/modules/extensions//libglx.so
(II) Module glx: vendor="NVIDIA Corporation"
compiled for 4.0.2, module version = 1.0.0
Module class: X.Org Server Extension
(II) NVIDIA GLX Module 180.44 Tue Mar 24 06:11:47 PST 2009
(II) Loading extension GLX
(II) LoadModule: "record"
(II) Loading /usr/lib/xorg/modules/extensions//librecord.so
(II) Module record: vendor="X.Org Foundation"
compiled for 1.6.0, module version = 1.13.0
Module class: X.Org Server Extension
ABI class: X.Org Server Extension, version 2.0
(II) Loading extension RECORD
(II) LoadModule: "dri"
(II) Loading /usr/lib/xorg/modules/extensions//libdri.so
(II) Module dri: vendor="X.Org Foundation"
compiled for 1.6.0, module version = 1.0.0
ABI class: X.Org Server Extension, version 2.0
(II) Loading extension XFree86-DRI
(II) LoadModule: "dri2"
(II) Loading /usr/lib/xorg/modules/extensions//libdri2.so
(II) Module dri2: vendor="X.Org Foundation"
compiled for 1.6.0, module version = 1.0.0
ABI class: X.Org Server Extension, version 2.0
(II) Loading extension DRI2
(II) LoadModule: "nvidia"
(II) Loading /usr/lib/xorg/modules/drivers//nvidia_drv.so
(II) Module nvidia: vendor="NVIDIA Corporation"
compiled for 4.0.2, module version = 1.0.0
Module class: X.Org Video Driver
(II) NVIDIA dlloader X Driver 180.44 Tue Mar 24 05:51:43 PST 2009
(II) NVIDIA Unified Driver for all Supported NVIDIA GPUs
(II) Primary Device is:
(EE) No devices detected.

Fatal server error:
no screens found

Please consult the The X.Org Foundation support
at http://wiki.x.org
for help.
Please also check the log file at "/var/log/Xorg.0.log" for additional information.

ddxSigGiveUp: Closing log




A real mess. Looks like I can't get the nvidia drivers to run at all, at this point.
Well, they did run for a second -- after following the steps in dibl's faq, at the first attempt to "startx", the gnome desktop (why?) came up and then crashed, but for a second, there, it sure was nice...

doctordruidphd
Mar 31st 2009, 10:45 PM
Further information:
I loaded a kubuntu-8.10 system and tried the same process.
Same problem, so it's not primarily a JJ issue.
Still, if anyone has gotten it to work, I'd like to know how.
Thanks.

flarson
Mar 31st 2009, 10:57 PM
The problem is you xorg.conf file. I am surprised that the system does not seem to have done this all by itself. After the install you should be using the xorg nv driver. If you then install the proprietary driver and run nvidia-xconfig it will alter your xorg.conf to load nvidia. At that point you should be set.

I would first try to give Kubuntu a clean shot at installing as it wants to. Maybe purge the nvidia drivers, run the xfix, and reinstall nvidia or something like that.

Otherwise look at this.
The no screens found is telling you that you need some entries like shown here
http://kubuntuforums.net/forums/index.php?topic=3102708.0

Not exactly the same but ones but those that work with your system. Newer Xservers can do without these lines by querying the hardware. Somehow your system is not doing this, but I think it would if you give it a clean shot at it.

dibl
Mar 31st 2009, 11:23 PM
Since EnvyNG didn't work, I recommend you download and install the 185.13 Beta driver -- it works great on my GTX260. Here's the detailed procedure:

http://kubuntuforums.net/forums/index.php?topic=3100807.0

If you have two cards in SLI, there is an issue with the PCI bus ID -- search Ubuntu Forum -- you need two different "Device" stanzas in xorg.conf, each with a different PCI bus ID for the cards.

doctordruidphd
Apr 1st 2009, 05:06 AM
Thanks to those who replied; it now works (I think).
A combination of experiments finally produced results.
Following dibl's instructions in his link, plus checking out the Ubuntu forums for info on dual cards,
and using the 185.13 driver. Finally those #$%^&* fans have shut up!
The screen effects work now; I'm not sure how to test if SLI is working.
I sure hope updating isn't going to be this much of a hassle.

For the record, here is the xorg.conf file I wound up with:



# nvidia-xconfig: X configuration file generated by nvidia-xconfig
# nvidia-xconfig: version 1.0 (buildmeister@builder62) Fri Mar 13 03:18:18 PST 2009

Section "ServerLayout"
Identifier "Layout0"
Screen 0 "Screen0"
InputDevice "Keyboard0" "CoreKeyboard"
InputDevice "Mouse0" "CorePointer"
EndSection

Section "Files"
EndSection

Section "Module"
Load "dbe"
Load "extmod"
Load "type1"
Load "freetype"
Load "glx"
EndSection

Section "InputDevice"
# generated from default
Identifier "Mouse0"
Driver "mouse"
Option "Protocol" "auto"
Option "Device" "/dev/psaux"
Option "Emulate3Buttons" "no"
Option "ZAxisMapping" "4 5"
EndSection

Section "InputDevice"
# generated from default
Identifier "Keyboard0"
Driver "kbd"
EndSection

Section "Monitor"
Identifier "Monitor0"
VendorName "Unknown"
ModelName "Unknown"
HorizSync 28.0 - 33.0
VertRefresh 43.0 - 72.0
Option "DPMS"
EndSection

Section "Device"
Identifier "Device0"
Driver "nvidia"
VendorName "NVIDIA Corporation"
Busid "PCI:01:00:00"
Option "sli" "auto"
EndSection

Section "Device"
Identifier "Device0"
Driver "nvidia"
VendorName "NVIDIA Corporation"
Busid "PCI:02:00:00"
Option "sli" "auto"
EndSection

Section "Screen"
Identifier "Screen0"
Device "Device0"
Monitor "Monitor0"
DefaultDepth 24
SubSection "Display"
Depth 24
EndSubSection
EndSection




N.B. the Device sections and the Busid entries. The PCI numbers are found by doing:


sudo lspci | grep VGA

dibl
Apr 1st 2009, 10:37 AM
Your xorg.conf looks good. Two more things to note:

1. Do you know how to use the nvidia-settings utility? Just Alt-F2 and enter "nvidia-settings" with no quote marks. Use it to set your resolution, and also use the "Detect Displays" function -- maybe it will find an EDID for your monitor.

2. If you are interested in overclocking the cards a little bit, you can manually edit xorg.conf and add the Coolbits option, so the line in the "Screen" stanza looks like the red line:


Section "Screen"
Identifier "Screen0"
Device "Device0"
Monitor "Monitor0"
DefaultDepth 24
Option "TwinView" "0"
Option "TwinViewXineramaInfoOrder" "CRT-0"
Option "metamodes" "1600x1200 +0+0"
Option "Coolbits" "1"
SubSection "Display"
Depth 24
EndSubSection
EndSection

Once you restart X with this option, a new "Clock Frequencies" item will appear in nvidia-settings. When you click it, you will have to accept the nVidia "No Warranty on This Stuff" legalism, and then there is the capability to run an auto-detecting routine to learn how much you can "safely" increase the clock and memory timings.

Also, if you run "nvidia-settings" in Super User mode, i.e. Alt-F2 "kdesudo nvidia-settings" with no quote marks, then you will have the option to "save to X configuration file" which means it will write your resolution settings into the xorg.conf file (which is why mine has the "metamodes" line in it).

doctordruidphd
Apr 1st 2009, 04:06 PM
Will give those a try. Thanks again for the help.
Right now, I'm just enjoying the peace and quiet, as well as the cube.

enkrypt3d
May 18th 2009, 08:55 PM
I'm having the same problem but it happens when I'm trying to install Kubuntu 9.04 on my Dell Precision T5400 with Dual Nvidia Quadro NVS 290's... Should I edit the xorg.conf on the actual ISO before I burn it so that X can start? that would be lame but I can do it if needed... thx

dibl
May 18th 2009, 09:22 PM
Should I edit the xorg.conf on the actual ISO before I burn it so that X can start? that would be lame but I can do it if needed... thx


No, don't try it. First try booting the Live CD and see how it looks running that way. If it won't display correctly, that's a concern -- possibly you'll need to push F4 "Safe Graphics" to get it to display.

If you choose to install, I would say just do the straight installation, and then plan to see a black screen upon booting. Alt-F1 and login to the CLI, and then issue lspci to learn the PCI bus addresses for your dual chips.

Try the "System>Hardware Drivers" routine and see if 9.04 will download and install the correct driver. According to nVidia's site here:

http://www.nvidia.com/Download/index.aspx?lang=en-us

when you navigate to the NVS 290, the driver version is 180.51.

So, if "Hardware Drivers" won't install it for you, you'll have to download the driver from the nVidia site and install it manually. This instruction, although a little dated in version numbers, is still appropos to your task:

http://kubuntuforums.net/forums/index.php?topic=3100807.msg164892#msg164892

:)