I encountered a problem while trying to install my GPU drivers.
I installed the 495 NVIDIA driver. Which was the recommended driver for Ubuntu.
Somehow, nvidia-smi doesn’t find the driver I installed. Yet, the driver is present when running DKMS status:
user@server:~$ nvidia-smi
NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.
user@server:~$ dkms status
nvidia, 495.29.05, 5.11.0-41-generic, x86_64: installed
user@server:~$ nvidia-debugdump -l
Error: nvmlInit(): Driver Not Loaded
user@server:~$ lspci -v | grep VGA
01:00.0 VGA compatible controller: NVIDIA Corporation TU106 [GeForce RTX 2060 Rev. A] (rev a1) (prog-if 00 [VGA controller])
I also get this:
user@server:~$ systemctl status nvidia-persistenced.service
● nvidia-persistenced.service - NVIDIA Persistence Daemon
Loaded: loaded (/lib/systemd/system/nvidia-persistenced.service; enabled; vendor preset: enabled)
Active: failed (Result: exit-code) since Mon 2021-12-06 14:50:17 EST; 38min ago
Process: 1109 ExecStart=/usr/bin/nvidia-persistenced --verbose (code=exited, status=1/FAILURE)
Process: 1116 ExecStopPost=/bin/rm -rf /var/run/nvidia-persistenced/* (code=exited, status=0/SUCCESS)
Dec 06 14:50:17 server systemd[1]: nvidia-persistenced.service: Scheduled restart job, restart counter is at 5.
Dec 06 14:50:17 server systemd[1]: Stopped NVIDIA Persistence Daemon.
Dec 06 14:50:17 server systemd[1]: nvidia-persistenced.service: Start request repeated too quickly.
Dec 06 14:50:17 server systemd[1]: nvidia-persistenced.service: Failed with result 'exit-code'.
Dec 06 14:50:17 server systemd[1]: Failed to start NVIDIA Persistence Daemon.
My GPU is detected. Am I missing a step required to link nvidia-smi to the Ubuntu NVIDIA driver?
I have a xorg.conf
file, with which I am able to set a resolution size, but not connect it to nvidia-smi.
Let me know if you need more info on this issue.
Thank you in advance.
(Edit):
Here is sudo lshw -c video
user@server:~$ sudo lshw -c video
*-display UNCLAIMED
description: VGA compatible controller
product: TU106 [GeForce RTX 2060 Rev. A]
vendor: NVIDIA Corporation
physical id: 0
bus info: pci@0000:01:00.0
version: a1
width: 64 bits
clock: 33MHz
capabilities: pm msi pciexpress vga_controller bus_master cap_list
configuration: latency=0
resources: memory:f6000000-f6ffffff memory:e0000000-efffffff memory:f0000000-f1ffffff ioport:e000(size=128) memory:c0000-dffff