I have a Quadro P2000 GPU that I’m using on Rocky 9.3 with KDE. After install the console boots and I login but then I get a black screen and it goes back to login. Googling seems to indicate this is common for other distros but I found no good fix for Rocky. I used the nvidia RHEL9 repository, maybe this won’t work? I’m still able to ssh into the system. I’ve been doing headless sysadmin on linux since RH3 but never once setup a workstation with a GPU and drivers so I’m out of my area.
Here is how I installed the nvidia drivers from nvidia’s RHEL9 repository:
dnf install epel-release
dnf config-manager --add-repo https://developer.download.nvidia.com/compute/cuda/repos/rhel9/x86_64/cuda-rhel9.repo
dnf install kernel-devel-$(uname -r) kernel-headers-$(uname -r)
dnf install nvidia-driver nvidia-settings
dnf install cuda-driver
These all installed fine. Driver is nvidia-driver-3:545.23.08-1.el9.x86_64
uname -a:
Linux Zoomer 5.14.0-362.13.1.el9_3.x86_64 #1 SMP PREEMPT_DYNAMIC Wed Dec 13 14:07:45 UTC 2023 x86_64 x86_64 x86_64 GNU/Linux
dmesg after logging in:
[ 16.740802] nvidia-nvlink: Nvlink Core is being initialized, major device number 237
[ 16.742966] nvidia 0000:03:00.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=none:owns=io+mem
[ 16.756067] intel_rapl_common: Found RAPL domain package
[ 16.756075] intel_rapl_common: Found RAPL domain dram
[ 16.756079] intel_rapl_common: DRAM domain energy unit 15300pj
[ 16.757070] intel_rapl_common: Found RAPL domain package
[ 16.757080] intel_rapl_common: Found RAPL domain dram
[ 16.757086] intel_rapl_common: DRAM domain energy unit 15300pj
[ 16.945814] NVRM: loading NVIDIA UNIX x86_64 Kernel Module 545.23.08 Mon Nov 6 23:49:37 UTC 2023
[ 17.013038] nvidia_uvm: module uses symbols from proprietary module nvidia, inheriting taint.
[ 17.413341] nvidia-uvm: Loaded the UVM driver, major device number 235.
[ 17.443597] nvidia-modeset: Loading NVIDIA Kernel Mode Setting Driver for UNIX platforms 545.23.08 Mon Nov 6 23:23:07 UTC 2023
[ 17.458450] [drm] [nvidia-drm] [GPU ID 0x00000300] Loading driver
[ 17.458453] [drm] Initialized nvidia-drm 0.0.0 20160202 for 0000:03:00.0 on minor 0
[ 17.479476] XFS (dm-2): Mounting V5 Filesystem
[ 17.494614] XFS (sdb2): Mounting V5 Filesystem
[ 17.661618] XFS (dm-2): Ending clean mount
[ 17.676396] XFS (sdb2): Ending clean mount
[ 18.150966] RPC: Registered named UNIX socket transport module.
[ 18.150969] RPC: Registered udp transport module.
[ 18.150970] RPC: Registered tcp transport module.
[ 18.150970] RPC: Registered tcp NFSv4.1 backchannel transport module.
[ 18.762823] msr: Write to unrecognized MSR 0x17f by mcelog (pid: 1956).
[ 18.762827] msr: See https://git.kernel.org/pub/scm/linux/kernel/git/tip/tip.git/about for details.
[ 19.845186] NET: Registered PF_QIPCRTR protocol family
[ 21.541893] ixgbe 0000:86:00.0: registered PHC device on ens5f0
[ 21.858924] ixgbe 0000:86:00.1: registered PHC device on ens5f1
[ 24.928876] e1000e 0000:00:19.0 eno1: NIC Link is Up 1000 Mbps Full Duplex, Flow Control: Rx
[ 24.928937] IPv6: ADDRCONF(NETDEV_CHANGE): eno1: link becomes ready
[ 28.826016] block dm-0: the capability attribute has been deprecated.
[ 28.950196] mpt3sas 0000:01:00.0: invalid VPD tag 0x00 (size 0) at offset 0; assume missing optional EEPROM
[ 1074.172111] logitech-hidpp-device 0003:046D:4051.0007: HID++ 4.5 device connected.
journalctl -xe after logging in:
Dec 28 10:32:26 Zoomer sddm-helper[5496]: Starting X11 session: "" "/usr/bin/sddm-greeter --socket /tmp/sddm-:0-qHVPRL --theme /usr/share/sddm/themes/breeze"
Dec 28 10:32:26 Zoomer sddm-greeter[5497]: Loading file:///usr/share/sddm/themes/breeze/Main.qml...
Dec 28 10:32:26 Zoomer sddm-greeter[5497]: QObject: Cannot create children for a parent that is in a different thread.
(Parent is QGuiApplication(0x7fff933d82e0), parent's thread is QThread(0x56175e3653e0), current thread is QThread(0x56175e45d2d0)
Dec 28 10:32:26 Zoomer sddm-greeter[5497]: QObject: Cannot create children for a parent that is in a different thread.
(Parent is QGuiApplication(0x7fff933d82e0), parent's thread is QThread(0x56175e3653e0), current thread is QThread(0x56175e45d2d0)
Dec 28 10:32:26 Zoomer sddm-greeter[5497]: QObject: Cannot create children for a parent that is in a different thread.
(Parent is QGuiApplication(0x7fff933d82e0), parent's thread is QThread(0x56175e3653e0), current thread is QThread(0x56175e45d2d0)
Dec 28 10:32:26 Zoomer sddm-greeter[5497]: QObject: Cannot create children for a parent that is in a different thread.
(Parent is QGuiApplication(0x7fff933d82e0), parent's thread is QThread(0x56175e3653e0), current thread is QThread(0x56175e45d2d0)
Dec 28 10:32:26 Zoomer sddm-greeter[5497]: QObject: Cannot create children for a parent that is in a different thread.
(Parent is QGuiApplication(0x7fff933d82e0), parent's thread is QThread(0x56175e3653e0), current thread is QThread(0x56175e45d2d0)
Dec 28 10:32:26 Zoomer sddm-greeter[5497]: QObject: Cannot create children for a parent that is in a different thread.
(Parent is QGuiApplication(0x7fff933d82e0), parent's thread is QThread(0x56175e3653e0), current thread is QThread(0x56175e45d2d0)
Dec 28 10:32:26 Zoomer sddm-greeter[5497]: QObject: Cannot create children for a parent that is in a different thread.
(Parent is QGuiApplication(0x7fff933d82e0), parent's thread is QThread(0x56175e3653e0), current thread is QThread(0x56175e45d2d0)
Dec 28 10:32:26 Zoomer sddm-greeter[5497]: QObject::installEventFilter(): Cannot filter events for objects in a different thread.
Dec 28 10:32:26 Zoomer sddm-greeter[5497]: Failed to find a Kirigami platform plugin
Dec 28 10:32:27 Zoomer sddm-greeter[5497]: Qt Quick Layouts: Polish loop detected. Aborting after two iterations.
Dec 28 10:32:27 Zoomer sddm-greeter[5497]: Hunspell dictionary is missing for "en_US" . Search paths ("/usr/share/qt5/qtvirtualkeyboard/hunspell", "/usr/share/hunspell", "/usr/share/myspell/dicts")
xorg.conf.d/10-nvidia:
Section "OutputClass"
Identifier "nvidia"
MatchDriver "nvidia-drm"
Driver "nvidia"
Option "AllowEmptyInitialConfiguration"
Option "PrimaryGPU" "yes"
Option "SLI" "Auto"
Option "BaseMosaic" "on"
EndSection
Section "OutputClass"
Identifier "intel"
MatchDriver "i915"
Driver "modesetting"
EndSection
output of nvidia-smi:
+---------------------------------------------------------------------------------------+
| NVIDIA-SMI 545.23.08 Driver Version: 545.23.08 CUDA Version: 12.3 |
|-----------------------------------------+----------------------+----------------------+
| GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|=========================================+======================+======================|
| 0 Quadro P2000 Off | 00000000:03:00.0 On | N/A |
| 47% 34C P8 5W / 75W | 152MiB / 5120MiB | 0% Default |
| | | N/A |
+-----------------------------------------+----------------------+----------------------+
+---------------------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=======================================================================================|
| 0 N/A N/A 7724 G /usr/libexec/Xorg 40MiB |
| 0 N/A N/A 7909 G /usr/bin/sddm-greeter 109MiB |
+---------------------------------------------------------------------------------------+
lshw -c video
*-display
description: VGA compatible controller
product: GP106GL [Quadro P2000]
vendor: NVIDIA Corporation
physical id: 0
bus info: pci@0000:03:00.0
version: a1
width: 64 bits
clock: 33MHz
capabilities: pm msi pciexpress vga_controller bus_master cap_list rom
configuration: driver=nvidia latency=0
resources: irq:239 memory:f2000000-f2ffffff memory:e0000000-efffffff memory:f0000000-f1ffffff ioport:3000(size=128) memory:c0000-dffff
*-graphics
product: EFI VGA
physical id: 1
logical name: /dev/fb0
capabilities: fb
configuration: depth=32 resolution=1024,768
Secureboot is turned off in the BIOS.
I can do a Ctrl-Alt-F3 on the console and login with the same user and run startx and everything works fine.
Any idea’s would be appreciated!