Crash of whole system while playing Yuzu

I don’t know what caused it. Here is the log:

jul 07 20:46:08 ruben kernel: WARNING: CPU: 6 PID: 0 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:656 amdgpu_irq_put+0x46/0x70 [amdgpu]
jul 07 20:46:08 ruben kernel: Modules linked in: uinput squashfs xpad ff_memless udp_diag tcp_diag inet_diag rfcomm snd_seq_dummy snd_hrtimer snd_seq snd_seq_device mousedev joydev cmac algif_hash algif_skcipher af_alg bnep nct6775 nct6775_core hwmon_vid vfat fat usbhid amdgpu snd_hda_codec_realtek mt7921e snd_hda_codec_generic mt7921_common mt76_connac_lib mt76 snd_hda_codec_hdmi intel_rapl_msr snd_hda_intel intel_rapl_common snd_intel_dspcfg btusb snd_intel_sdw_acpi btrtl edac_mce_amd btbcm snd_hda_codec btintel btmtk gpu_sched mac80211 snd_hda_core kvm drm_buddy bluetooth snd_hwdep eeepc_wmi uas drm_ttm_helper snd_pcm irqbypass libarc4 ttm asus_wmi ecdh_generic usb_storage r8169 ledtrig_audio rapl i8042 drm_display_helper snd_timer sparse_keymap cfg80211 realtek platform_profile serio snd mdio_devres cec video wmi_bmof pcspkr soundcore libphy rfkill k10temp i2c_piix4 gpio_amdpt acpi_cpufreq gpio_generic mac_hid fuse loop dm_mod ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 crct10dif_pclmul
jul 07 20:46:08 ruben kernel:  crc32_pclmul crc32c_intel polyval_clmulni polyval_generic gf128mul ghash_clmulni_intel sha512_ssse3 nvme aesni_intel crypto_simd nvme_core cryptd ccp sp5100_tco nvme_common xhci_pci xhci_pci_renesas wmi vboxnetflt(OE) vboxnetadp(OE) vboxdrv(OE) pkcs8_key_parser crypto_user
jul 07 20:46:08 ruben kernel: CPU: 6 PID: 0 Comm: swapper/6 Tainted: G           OE      6.1.38-1-lts #1 5b635add7cb08376448ebab72e56a17f1b0ae5e4
jul 07 20:46:08 ruben kernel: Hardware name: ASUS System Product Name/TUF GAMING B550M-PLUS WIFI II, BIOS 3002 02/23/2023
jul 07 20:46:08 ruben kernel: RIP: 0010:amdgpu_irq_put+0x46/0x70 [amdgpu]
jul 07 20:46:08 ruben kernel: Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 c3 cc cc cc cc e9 8a fd ff ff <0f> 0b b8 ea ff ff ff c3 cc cc cc cc b8 ea ff ff ff c3 cc cc cc cc
jul 07 20:46:08 ruben kernel: RSP: 0018:ffffa77c403c0e20 EFLAGS: 00010046
jul 07 20:46:08 ruben kernel: RAX: ffff9ad3475c3660 RBX: ffff9ad3492c1000 RCX: 0000000000000000
jul 07 20:46:08 ruben kernel: RDX: 0000000000000000 RSI: ffff9ad35cac6560 RDI: ffff9ad35cac0000
jul 07 20:46:08 ruben kernel: RBP: 0000000000000000 R08: ffffffffc20c5a1e R09: 0000000000000000
jul 07 20:46:08 ruben kernel: R10: ffffa77c403c0d10 R11: ffffa77c403c0d14 R12: ffff9ad35cac0010
jul 07 20:46:08 ruben kernel: R13: ffff9ad35cac0000 R14: ffff9ad501b7f000 R15: ffff9ad35df1f048
jul 07 20:46:08 ruben kernel: FS:  0000000000000000(0000) GS:ffff9ada4eb80000(0000) knlGS:0000000000000000
jul 07 20:46:08 ruben kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
jul 07 20:46:08 ruben kernel: CR2: 000055ae04b540e8 CR3: 0000000109d8a000 CR4: 0000000000750ee0
jul 07 20:46:08 ruben kernel: PKRU: 55555554
jul 07 20:46:08 ruben kernel: Call Trace:
jul 07 20:46:08 ruben kernel:  <IRQ>
jul 07 20:46:08 ruben kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu cd86bf80776bceb23aec1a9397da4ef546f7feb5]
jul 07 20:46:08 ruben kernel:  ? __warn+0x7d/0xd0
jul 07 20:46:08 ruben kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu cd86bf80776bceb23aec1a9397da4ef546f7feb5]
jul 07 20:46:08 ruben kernel:  ? report_bug+0xe6/0x150
jul 07 20:46:08 ruben kernel:  ? handle_bug+0x3c/0x80
jul 07 20:46:08 ruben kernel:  ? exc_invalid_op+0x17/0x70
jul 07 20:46:08 ruben kernel:  ? asm_exc_invalid_op+0x1a/0x20
jul 07 20:46:08 ruben kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu cd86bf80776bceb23aec1a9397da4ef546f7feb5]
jul 07 20:46:08 ruben kernel:  dm_set_vblank+0x187/0x1b0 [amdgpu cd86bf80776bceb23aec1a9397da4ef546f7feb5]
jul 07 20:46:08 ruben kernel:  drm_vblank_disable_and_save+0xba/0xf0
jul 07 20:46:08 ruben kernel:  vblank_disable_fn+0x67/0x80
jul 07 20:46:08 ruben kernel:  ? drm_vblank_disable_and_save+0xf0/0xf0
jul 07 20:46:08 ruben kernel:  ? drm_vblank_disable_and_save+0xf0/0xf0
jul 07 20:46:08 ruben kernel:  call_timer_fn+0x27/0x130
jul 07 20:46:08 ruben kernel:  __run_timers+0x21c/0x2a0
jul 07 20:46:08 ruben kernel:  run_timer_softirq+0x1d/0x40
jul 07 20:46:08 ruben kernel:  __do_softirq+0xd4/0x2c9
jul 07 20:46:08 ruben kernel:  ? sched_clock_cpu+0xd/0xb0
jul 07 20:46:08 ruben kernel:  __irq_exit_rcu+0xb7/0xe0
jul 07 20:46:08 ruben kernel:  sysvec_apic_timer_interrupt+0x72/0x90
jul 07 20:46:08 ruben kernel:  </IRQ>
jul 07 20:46:08 ruben kernel:  <TASK>
jul 07 20:46:08 ruben kernel:  asm_sysvec_apic_timer_interrupt+0x1a/0x20
jul 07 20:46:08 ruben kernel: RIP: 0010:cpuidle_enter_state+0xe2/0x420
jul 07 20:46:08 ruben kernel: Code: 00 00 31 ff e8 2f bd 7b ff 45 84 ff 74 16 9c 58 0f 1f 40 00 f6 c4 02 0f 85 25 03 00 00 31 ff e8 04 78 82 ff fb 0f 1f 44 00 00 <45> 85 f6 0f 88 85 01 00 00 49 63 d6 48 8d 04 52 48 8d 04 82 49 8d
jul 07 20:46:08 ruben kernel: RSP: 0018:ffffa77c401afe90 EFLAGS: 00000246
jul 07 20:46:08 ruben kernel: RAX: ffff9ada4ebb2e80 RBX: ffff9ad343500800 RCX: 0000000000000000
jul 07 20:46:08 ruben kernel: RDX: 0000000000000006 RSI: fffffffc71f7f646 RDI: 0000000000000000
jul 07 20:46:08 ruben kernel: RBP: 0000000000000002 R08: 0000000000000002 R09: 0000000025a5a65f
jul 07 20:46:08 ruben kernel: R10: ffff9ada4ebb1984 R11: 0000000000000407 R12: ffffffff99952100
jul 07 20:46:08 ruben kernel: R13: 00000ac88f1d2f62 R14: 0000000000000002 R15: 0000000000000000
jul 07 20:46:08 ruben kernel:  cpuidle_enter+0x2d/0x40
jul 07 20:46:08 ruben kernel:  do_idle+0x1ed/0x270
jul 07 20:46:08 ruben kernel:  cpu_startup_entry+0x1d/0x20
jul 07 20:46:08 ruben kernel:  start_secondary+0x12b/0x150
jul 07 20:46:08 ruben kernel:  secondary_startup_64_no_verify+0xe5/0xeb
jul 07 20:46:08 ruben kernel:  </TASK>
jul 07 20:46:08 ruben kernel: ---[ end trace 0000000000000000 ]---
jul 07 20:46:18 ruben plasmashell[9540]: terminate called after throwing an instance of 'Vulkan::vk::Exception'
jul 07 20:46:18 ruben plasmashell[9540]:   what():  VK_ERROR_DEVICE_LOST
jul 07 20:46:18 ruben systemd[1]: Started Process Core Dump (PID 581001/UID 0).
jul 07 20:46:29 ruben gamemoded[9613]: ERROR: Could not call UnInhibit on org.freedesktop.ScreenSaver: Connection timed out
jul 07 20:46:29 ruben gamemoded[9613]:         org.freedesktop.DBus.Error.Timeout
jul 07 20:46:29 ruben gamemoded[9613]:         Connection timed out
jul 07 20:46:29 ruben pkexec[581020]: pam_unix(polkit-1:session): session opened for user root(uid=0) by ruben(uid=1000)
jul 07 20:46:29 ruben pkexec[581020]: ruben: Executing command [USER=root] [TTY=unknown] [CWD=/home/ruben] [COMMAND=/usr/lib/gamemode/cpugovctl set schedutil]
jul 07 20:46:59 ruben systemd-logind[772]: Power key pressed short.

What gpu do you have?

AMD Sapphire RX 6600

Did this happen before or is this the first time since you upgraded to the 6.4 kernel and does it happen regularly now , if you aren’t running another kernel?

I just built this PC today, didn’t happen before while I was testing this morning. I did stress tests on CPU but not on GPU and was testing since Yuzu puts the GPU to 100%.
The temperatures are not greater than 80ºC and the power supply should be working fine, maybe this is a bug of Yuzu? or vulkan?. I’m scared of this being a problem of hardware :S

How long did it take for the crash to take happen? Try playing again and see if it happens again that way you know if it’s a reoccurring problem and if it is you can then try to install the lts kernel to see if the problem doesn’t occur then?

I tried before and again happened. When I die and the game has to reload, the CPU takes a bit more of effort and then it crashes (still the temperatures are just a bit hotter and should not be a problem).
I’ll keep trying until it happens again.

Also when GPU is at 100% the desktop feels laggy, idk if that’s normal.

edit: (I’m using lts kernel btw)

That doesn’t sound normal. Install the linux-lts kernel and then boot into the linux-lts kernel and see what happens then?

I’m using the lts kernel

What happens when you use the vanilla kernel or the zen kernel?

Should I try them? I was using lts all the time because I though it would be more stable.

It will be useful to see if the problem occurs in newer kernel or not, as a comparison since it looks like your amdgpu driver is crashing because if it was a yuzu application crash or vulkan crash it probably wouldn’t crash your whole system.

I didn’t have REISUB enabled to test, but I think the system was still responsive somehow, it froze the image but the audio was still working for a moment.
Also, I just tried the default kernel: Linux ruben 6.4.1-arch2-1 #1 SMP PREEMPT_DYNAMIC Tue, 04 Jul 2023 08:39:40 +0000 x86_64 GNU/Linux

It’s still laggy.

When it crashes are you able to switch to a tty, restart your display manager and then continue as usual without rebooting your system.

I wasn’t able to TTY, I tried.
Also, the KDE FPS effects shows that my whole system runs at ~50fps when the GPU is at 100%… this is weird…

Also maybe try installing ryujinx to see if you get the same crash then, to rule out whether it’s a yuzu problem or not?

Sure, I’ll try that too.

I must say that the lags disappears when I put the resolution a bit lower in Yuzu. Maybe 2X is too much for this GPU, but it’s still not normal that this causes the whole system to be laggy, right?, it should drop FPS ingame but not in the desktop…

At what resolution do you normally play games with your RX 6600?