Steam games freeze after NVIDIA drivers update

My issue is similar to this one, but details are different Nvidia 560.35.03 + optimus-manager-git hybrid mode: game freeze - #19 by Mindar

Recently some Steam games are freezing while using proton. Some on startup, some while playing, some not at all. I’ve reinstalled steam and proton-ge-custom-bin as well as tried to use different proton version.

When the game freezes it only happens to video, sound is fine. So my suspicion that the problem is in NVIDIA drivers that recently got updated from 555.58.02 to 560.35.03.

I’ve tried downgrading them with downgrade lib32-nvidia-utils nvidia-dkms nvidia-utils opencl-nvidia but got this error after install failed to generate CDI spec: failed to create device CDI specs: failed to initialize NVML: Driver/library version mismatch while running nvidia-ctk-cdi.hook. Since I’m not sure what it means, I reupgraded them to 560 without rebooting.

It seems I also need to downgrade libnvidia-container and nvidia-container-toolkit but I don’t know which version to choose and afraid to mess with it.

NVIDIA-related packages:

$ yay -Qs nvidia
local/cuda 12.6.1-1
    NVIDIA's GPU programming toolkit
local/egl-wayland 4:1.1.16-1
    EGLStream-based Wayland external platform
local/lib32-libvdpau 1.5-2
    Nvidia VDPAU library
local/lib32-nvidia-utils 560.35.03-1
    NVIDIA drivers utilities (32-bit)
local/libnvidia-container 1.16.1-1
    NVIDIA container runtime library
local/libvdpau 1.5-3
    Nvidia VDPAU library
local/nvidia-container-toolkit 1.16.1-3
    NVIDIA container toolkit
local/nvidia-dkms 560.35.03-2
    NVIDIA drivers - module sources
local/nvidia-hook 1.5.1-1
    pacman hook for nvidia
local/nvidia-inst 24-1
    Script to setup nvidia drivers (dkms version) in EndeavourOS
local/nvidia-utils 560.35.03-2
    NVIDIA drivers utilities
local/opencl-nvidia 560.35.03-2
    OpenCL implemention for NVIDIA

nvidia-smi:

$ nvidia-smi
Thu Sep  5 14:09:24 2024       
+-----------------------------------------------------------------------------------------+
| NVIDIA-SMI 560.35.03              Driver Version: 560.35.03      CUDA Version: 12.6     |
|-----------------------------------------+------------------------+----------------------+
| GPU  Name                 Persistence-M | Bus-Id          Disp.A | Volatile Uncorr. ECC |
| Fan  Temp   Perf          Pwr:Usage/Cap |           Memory-Usage | GPU-Util  Compute M. |
|                                         |                        |               MIG M. |
|=========================================+========================+======================|
|   0  NVIDIA GeForce RTX 2060        Off |   00000000:01:00.0 Off |                  N/A |
| N/A   49C    P8              6W /   90W |       7MiB /   6144MiB |      0%      Default |
|                                         |                        |                  N/A |
+-----------------------------------------+------------------------+----------------------+
                                                                                         
+-----------------------------------------------------------------------------------------+
| Processes:                                                                              |
|  GPU   GI   CI        PID   Type   Process name                              GPU Memory |
|        ID   ID                                                               Usage      |
|=========================================================================================|
|    0   N/A  N/A      1235      G   /usr/lib/Xorg                                   4MiB |
+-----------------------------------------------------------------------------------------+

inxi -G

$ inxi -G
Graphics:
  Device-1: Intel CoffeeLake-H GT2 [UHD Graphics 630] driver: i915 v: kernel
  Device-2: NVIDIA TU106M [GeForce RTX 2060 Mobile] driver: nvidia
    v: 560.35.03
  Device-3: Bison HD Webcam driver: uvcvideo type: USB
  Display: x11 server: X.Org v: 21.1.13 driver: X:
    loaded: modesetting,nvidia dri: iris gpu: i915 resolution: 1920x1080~144Hz
  API: EGL v: 1.5 drivers: iris,kms_swrast,nvidia,swrast
    platforms: gbm,x11,surfaceless,device
  API: OpenGL v: 4.6.0 compat-v: 4.5 vendor: intel mesa v: 24.2.1-arch1.1
    renderer: Mesa Intel UHD Graphics 630 (CFL GT2)

When a freeze occurs, after restarting immediately run this command to retrieve the last 50 entries from your previous sessions log. Hopefully that’ll point to a possible cause. This command will generate a link you can share here.

journalctl -b -1 | tail -n 50 | eos-sendlog

Sorry, I didn’t explain it quite right. The game itself freezes, but OS continues to work, so I can close the game from Steam.

There is literally nothing in journalctl at the time of the freeze (I use journalctl -f to check). One time I got kernel: i915 0000:00:02.0: [drm] *ERROR* Atomic update failure on pipe A (start=417228 end=417229) time 1121 us, min 1063, max 1079, scanline start 1040, end 52, but this error seems to be unrelated since it’s been present before the freezes began.

Might this be worth trying?
https://wiki.archlinux.org/title/Intel_graphics#Crash/freeze_on_low_power_Intel_CPUs

I managed to fix the issue by downgrading NVIDIA drivers and related libraries. sudo downgrade lib32-nvidia-utils nvidia-dkms nvidia-utils opencl-nvidia nvidia-hook libnvidia-container nvidia-container-toolkit. nvidia-hook 1.5-2, libnvidia-container 1.15.0-1, nvidia-container-toolkit 1.15.0-2, everything else to 555.58.02-1.

I’m tempted to attach that famous video of Linus Torvalds. no-vidia indeed.

1 Like

This topic was automatically closed 2 days after the last reply. New replies are no longer allowed.