Linux NVIDIA
The following troubleshooting techniques may help resolve problems with NVIDIA on Linux.
- If you are using a container, is the container runtime working? Tryย
docker run --gpus all ubuntu nvidia-smiย - if this doesn't work, no programs will be able to see your NVIDIA GPU. - Check if the uvm driver is loadedย
sudo nvidia-modprobe -u - Reloading the nvidia_uvm driver -ย
sudo rmmod nvidia_uvmย thenยsudo modprobe nvidia_uvm - Try rebooting
If none of those resolve the problem, gather additional information:
- Setย
CUDA_ERROR_LEVEL=50ย to get more diagnostic logs - Check dmesg for any errorsย
sudo dmesg | grep -i nvrmย andยsudo dmesg | grep -i nvidia
You may get more details for initialization failures by enabling debug prints in the uvm driver. You should only use this temporarily while troubleshooting
sudo rmmod nvidia_uvmย thenยsudo modprobe nvidia_uvm uvm_debug_prints=1