Amdgpu [drm] REG_WAIT error causes applications to crash

After a while some of running applications crash with the following message in dmesg. Usually it happens after lockscreen or not using computer for a while. The system is updated.

Any idea what to do?

[157621.551076] amdgpu 0000:14:00.0: [drm] REG_WAIT timeout 1us * 100000 tries - optc1_wait_for_state line:839
[157621.704777] amdgpu 0000:14:00.0: [drm] REG_WAIT timeout 1us * 100000 tries - optc1_wait_for_state line:839
[157621.857677] amdgpu 0000:14:00.0: [drm] REG_WAIT timeout 1us * 100000 tries - optc1_wait_for_state line:839
[157621.939089] QSGRenderThread[81058]: segfault at 0 ip 0000702bdcba543b sp 0000702b895ff770 error 4 in libQt6WaylandClient.so.6.7.2[702b
dcb71000+5f000] likely on CPU 3 (core 3, socket 0)
[157621.939098] Code: 0f 1e fa 55 48 89 e5 41 55 4c 8d 2d 5f df fc ff 41 54 4c 8d 67 10 53 48 89 fb 4c 89 e7 48 83 ec 08 ff 15 90 87 05 00
48 89 c7 <48> 8b 00 48 8b 40 10 4c 39 e8 75 59 4c 89 e7 ff 15 78 87 05 00 49
[157622.129857] recoll[110941]: segfault at 8 ip 00007108a2faaee4 sp 00007ffd6887f4f8 error 4 in libQt6Gui.so.6.7.2[7108a2ed9000+618000] l
ikely on CPU 20 (core 4, socket 0)
[157622.129870] Code: 84 00 00 00 00 00 f0 83 28 01 75 9a 48 8b 7d b0 ff 15 90 87 68 00 eb 8e ff 15 80 8c 68 00 0f 1f 84 00 00 00 00 00 f3
0f 1e fa <48> 8b 57 08 48 8b 02 48 85 c0 74 10 8b 40 04 85 c0 74 09 48 8b 42
[157622.322703] kscreenlocker_g[112526]: segfault at 8 ip 000075daac1aaee4 sp 00007ffe89a0d018 error 4 in libQt6Gui.so.6.7.2[75daac0d9000+
618000] likely on CPU 8 (core 8, socket 0)
[157622.322714] Code: 84 00 00 00 00 00 f0 83 28 01 75 9a 48 8b 7d b0 ff 15 90 87 68 00 eb 8e ff 15 80 8c 68 00 0f 1f 84 00 00 00 00 00 f3
0f 1e fa <48> 8b 57 08 48 8b 02 48 85 c0 74 10 8b 40 04 85 c0 74 09 48 8b 42

I wonder if trying the following kernel parameter would help? If on systemd-boot you need to add it to /etc/kernel/cmdline and then sudo reinstall-kernels

idle=nomwait

Sadly it did not help. New kernel update did not help too.

Is your UEFI Bios up to date?

There are beta versions available and I am reluctant to update, especially on beta, as there is usually something broken in new bios.
I should mention that only some applications segfault: VSCode, Recoll, kscreenlocker. I have not noticed any other applications crash.

I guess keep an eye out when the beta becomes a new version number.

I have encountered a lot of various issues since I have started using EOS a few years ago but advice update BIOS did not help even once and I did update it at least five times.
Is there maybe something else to try?

On the other hand I have found this: https://gitlab.freedesktop.org/drm/amd/-/issues/3368 so the error might not be relevant to the crashes I experience.

Are you having this issue on X11 or Wayland, if with the latter try switching to an X11 session to see what happens? Which gpu do you have maybe other people are having issues with the same gpu and KDE Plasma?

The issue is on Wayland, I cannot use X11 as I’ve been having blackscreen/glitches issues on it, which was my reason to use Wayland.

AMD iGPU (x7950), KDE Plasma.

I came across this issue.

They advice to add this amdgpu.dcdebugmask=0x10 to your kernel boot parameters, it might not be exactly the same issues but it might worth trying? Have you checked your gpu temperatures while your system is on? Have you tried switching to the the linux-lts kernel to see if you are experiencing the issue there?

Sounds like a different issue but I will try it when I get to reboot.
Temperatures seem to be fine.
Now that I think about it, issues have started since 6.9.* kernel version.

Are you strictly using the integrated graphics? You don’t have any other gpu?

I don’t find this to be true. :thinking:

@ricklinux I also have nvidia but the DE runs on amd igpu as I am using render offload.
There are a lot off issues with the BIOS and my motherboard for some reason, the one I am using now is the most stable. Considering there is agesa update in bios, maybe I should but…

@Cphusion Boot parameter did not help.

Updating to kernel 6.10.* fixed these crashes.

This topic was automatically closed 2 days after the last reply. New replies are no longer allowed.