Freeze with kernel 5.12 and acpi_enforce_resources=lax

Hi,

my amdgu freezes resp. the PC freezes when I use kernel command line parameter acpi_enforce_resources=lax together with kernel 5.12. The freeze happens when I log out and let my PC sit idle for 10 min.

journal
Jun 05 18:11:25 rakete kernel: [drm] free PSP TMR buffer
Jun 05 18:12:02 rakete kernel: [drm] PCIE GART of 512M enabled (table at 0x0000008000000000).
Jun 05 18:12:02 rakete kernel: [drm] PSP is resuming...
Jun 05 18:12:02 rakete kernel: [drm] reserve 0x900000 from 0x800f400000 for PSP TMR
Jun 05 18:12:02 rakete kernel: amdgpu 0000:0e:00.0: amdgpu: RAS: optional ras ta ucode is not available
Jun 05 18:12:02 rakete kernel: amdgpu 0000:0e:00.0: amdgpu: RAP: optional rap ta ucode is not available
Jun 05 18:12:02 rakete kernel: amdgpu 0000:0e:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available
Jun 05 18:12:02 rakete kernel: amdgpu 0000:0e:00.0: amdgpu: SMU is resuming...
Jun 05 18:12:02 rakete kernel: amdgpu 0000:0e:00.0: amdgpu: smu driver if version = 0x00000036, smu fw if version = 0x00000037, smu fw version = 0x002a3f00 (42.63.0)
Jun 05 18:12:02 rakete kernel: amdgpu 0000:0e:00.0: amdgpu: SMU driver if version not matched
Jun 05 18:12:04 rakete kernel: amdgpu 0000:0e:00.0: amdgpu: message: EnableAllSmuFeatures (6)         param: 0x00000000 is timeout (no response)
Jun 05 18:12:04 rakete kernel: amdgpu 0000:0e:00.0: amdgpu: Failed to enable requested dpm features!
Jun 05 18:12:04 rakete kernel: amdgpu 0000:0e:00.0: amdgpu: Failed to setup smc hw!
Jun 05 18:12:04 rakete kernel: [drm:amdgpu_device_ip_resume_phase2 [amdgpu]] *ERROR* resume of IP block <smu> failed -62
Jun 05 18:12:04 rakete kernel: amdgpu 0000:0e:00.0: amdgpu: amdgpu_device_ip_resume failed (-62).
Jun 05 18:12:04 rakete kernel: snd_hda_intel 0000:0e:00.1: refused to change power state from D3hot to D0
Jun 05 18:12:04 rakete kernel: amdgpu: Move buffer fallback to memcpy unavailable
Jun 05 18:12:04 rakete kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to process the buffer list -19!
Jun 05 18:12:04 rakete kernel: amdgpu: Move buffer fallback to memcpy unavailable
Jun 05 18:12:04 rakete kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to process the buffer list -19!
Jun 05 18:12:04 rakete kernel: amdgpu: Move buffer fallback to memcpy unavailable
Jun 05 18:12:04 rakete kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to process the buffer list -19!
Jun 05 18:12:05 rakete kernel: amdgpu: Move buffer fallback to memcpy unavailable
Jun 05 18:12:05 rakete kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to process the buffer list -19!
Jun 05 18:12:05 rakete kernel: amdgpu: Move buffer fallback to memcpy unavailable
Jun 05 18:12:05 rakete kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to process the buffer list -19!
Jun 05 18:12:05 rakete kernel: amdgpu: Move buffer fallback to memcpy unavailable
Jun 05 18:12:05 rakete kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to process the buffer list -19!
Jun 05 18:12:05 rakete kernel: snd_hda_intel 0000:0e:00.1: CORB reset timeout#2, CORBRP = 65535
Jun 05 18:12:15 rakete kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma1 timeout, signaled seq=37233, emitted seq=37235
Jun 05 18:12:15 rakete kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process  pid 0 thread  pid 0
Jun 05 18:12:15 rakete kernel: amdgpu 0000:0e:00.0: amdgpu: GPU reset begin!
Jun 05 18:12:15 rakete kernel: ------------[ cut here ]------------
Jun 05 18:12:15 rakete kernel: kernel BUG at mm/slub.c:314!
Jun 05 18:12:15 rakete kernel: invalid opcode: 0000 [#1] PREEMPT SMP NOPTI
Jun 05 18:12:15 rakete kernel: CPU: 5 PID: 313159 Comm: kworker/5:0 Tainted: P           OE     5.12.9-zen1-1-zen #1
Jun 05 18:12:15 rakete kernel: Hardware name: Gigabyte Technology Co., Ltd. X570 AORUS ULTRA/X570 AORUS ULTRA, BIOS F33 05/21/2021
Jun 05 18:12:15 rakete kernel: Workqueue: events drm_sched_job_timedout [gpu_sched]
Jun 05 18:12:15 rakete kernel: RIP: 0010:__slab_free+0x292/0x5b0
Jun 05 18:12:15 rakete kernel: Code: c9 0f 84 95 00 00 00 48 8b 44 24 78 65 48 2b 04 25 28 00 00 00 0f 85 80 02 00 00 48 8d 65 d8 5b 41 5c 41 5d 41 5e 41 5f 5d c3 <0f> 0b 65 ff 0d b5 a5 8d 50 0f 84 e6 00 00 00 f3 90 49 8b 04 24 a8
Jun 05 18:12:15 rakete kernel: RSP: 0018:ffffb2e54874bc80 EFLAGS: 00010246
Jun 05 18:12:15 rakete kernel: RAX: ffffa08cdfafcd00 RBX: ffffa08cdfafcc00 RCX: ffffa08cdfafcc00
Jun 05 18:12:15 rakete kernel: RDX: 000000008020001f RSI: ffffe9dd447ebf00 RDI: ffffa08cc0042a00
Jun 05 18:12:15 rakete kernel: RBP: ffffb2e54874bd30 R08: 0000000000000001 R09: ffffffffc187c9e1
Jun 05 18:12:15 rakete kernel: R10: ffffa09bff242000 R11: ffffb2e54874ba40 R12: ffffe9dd447ebf00
Jun 05 18:12:15 rakete kernel: R13: ffffa08cdfafcc00 R14: ffffa08cc0042a00 R15: ffffa08cdfafcc00
Jun 05 18:12:15 rakete kernel: FS:  0000000000000000(0000) GS:ffffa09bbeb40000(0000) knlGS:0000000000000000
Jun 05 18:12:15 rakete kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jun 05 18:12:15 rakete kernel: CR2: 00007f6bdfd7dfa8 CR3: 0000000301584000 CR4: 0000000000350ee0
Jun 05 18:12:15 rakete kernel: Call Trace:
Jun 05 18:12:15 rakete kernel:  ? __flush_work.isra.0+0x18e/0x210
Jun 05 18:12:15 rakete kernel:  ? kfd_gtt_sa_free+0x56/0x80 [amdgpu]
Jun 05 18:12:15 rakete kernel:  ? kernel_queue_uninit+0x81/0xe0 [amdgpu]
Jun 05 18:12:15 rakete kernel:  kernel_queue_uninit+0x81/0xe0 [amdgpu]
Jun 05 18:12:15 rakete kernel:  stop_cpsch+0xa0/0xc0 [amdgpu]
Jun 05 18:12:15 rakete kernel:  kgd2kfd_pre_reset+0x56/0x80 [amdgpu]
Jun 05 18:12:15 rakete kernel:  amdgpu_device_gpu_recover.cold+0x2a8/0xa58 [amdgpu]
Jun 05 18:12:15 rakete kernel:  amdgpu_job_timedout+0x128/0x150 [amdgpu]
Jun 05 18:12:15 rakete kernel:  drm_sched_job_timedout+0x64/0xe0 [gpu_sched]
Jun 05 18:12:15 rakete kernel:  process_one_work+0x214/0x3e0
Jun 05 18:12:15 rakete kernel:  worker_thread+0x4d/0x470
Jun 05 18:12:15 rakete kernel:  ? process_one_work+0x3e0/0x3e0
Jun 05 18:12:15 rakete kernel:  kthread+0x181/0x1b0
Jun 05 18:12:15 rakete kernel:  ? __kthread_init_worker+0x50/0x50
Jun 05 18:12:15 rakete kernel:  ret_from_fork+0x22/0x30
Jun 05 18:12:15 rakete kernel: Modules linked in: uas usb_storage cfg80211 ccm algif_aead cbc des_generic libdes ecb algif_skcipher cmac md4 algif_hash af_alg it87(OE) hwmon_vid amdgpu rc_tt_1500 stb6100 isl6423 stb0899 dvb_usb_pctv452e(OE) dvb_usb(OE) snd_hda_codec_realtek snd_hda_codec_generic ledtrig_audio snd_hda_codec_hdmi ttpci_eeprom dvb_core intel_rapl_msr snd_hda_intel snd_intel_dspcfg videobuf2_vmalloc intel_rapl_common snd_intel_sdw_acpi videobuf2_memops amd64_edac btusb edac_mce_amd videobuf2_common btrtl btbcm snd_hda_codec gpu_sched btintel drm_ttm_helper videodev lzo_rle ttm bluetooth kvm_amd snd_hda_core snd_hwdep mc snd_pcm drm_kms_helper ecdh_generic kvm rfkill snd_timer cec ecc snd crc16 syscopyarea irqbypass sysfillrect igb rapl sysimgblt wmi_bmof mxm_wmi soundcore fb_sys_fops k10temp i2c_piix4 i2c_algo_bit dca acpi_cpufreq vfat fat drm fuse agpgart zram ip_tables x_tables usbhid zfs(POE) xfs zunicode(POE) zzstd(OE) zlua(OE) libcrc32c zavl(POE) crc32c_generic icp(POE) zcommon(POE)
Jun 05 18:12:15 rakete kernel:  znvpair(POE) spl(OE) crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel aesni_intel crypto_simd ccp cryptd rng_core sr_mod cdrom xhci_pci xhci_pci_renesas wmi pinctrl_amd vboxnetflt(OE) vboxnetadp(OE) vboxdrv(OE) pkcs8_key_parser sg crypto_user
Jun 05 18:12:15 rakete kernel: ---[ end trace 4f4690ebc468e2b1 ]---
Jun 05 18:12:15 rakete kernel: RIP: 0010:__slab_free+0x292/0x5b0
Jun 05 18:12:15 rakete kernel: Code: c9 0f 84 95 00 00 00 48 8b 44 24 78 65 48 2b 04 25 28 00 00 00 0f 85 80 02 00 00 48 8d 65 d8 5b 41 5c 41 5d 41 5e 41 5f 5d c3 <0f> 0b 65 ff 0d b5 a5 8d 50 0f 84 e6 00 00 00 f3 90 49 8b 04 24 a8
Jun 05 18:12:15 rakete kernel: RSP: 0018:ffffb2e54874bc80 EFLAGS: 00010246
Jun 05 18:12:15 rakete kernel: RAX: ffffa08cdfafcd00 RBX: ffffa08cdfafcc00 RCX: ffffa08cdfafcc00
Jun 05 18:12:15 rakete kernel: RDX: 000000008020001f RSI: ffffe9dd447ebf00 RDI: ffffa08cc0042a00
Jun 05 18:12:15 rakete kernel: RBP: ffffb2e54874bd30 R08: 0000000000000001 R09: ffffffffc187c9e1
Jun 05 18:12:15 rakete kernel: R10: ffffa09bff242000 R11: ffffb2e54874ba40 R12: ffffe9dd447ebf00
Jun 05 18:12:15 rakete kernel: R13: ffffa08cdfafcc00 R14: ffffa08cc0042a00 R15: ffffa08cdfafcc00
Jun 05 18:12:15 rakete kernel: FS:  0000000000000000(0000) GS:ffffa09bbeb40000(0000) knlGS:0000000000000000
Jun 05 18:12:15 rakete kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jun 05 18:12:15 rakete kernel: CR2: 00007f6bdfd7dfa8 CR3: 0000000301584000 CR4: 0000000000350ee0
Jun 05 18:13:00 rakete kernel: amdgpu: Move buffer fallback to memcpy unavailable
Jun 05 18:13:00 rakete kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to process the buffer list -19!
Jun 05 18:13:00 rakete kernel: amdgpu: Move buffer fallback to memcpy unavailable
Jun 05 18:13:00 rakete kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to process the buffer list -19!
Jun 05 18:13:00 rakete kernel: amdgpu: Move buffer fallback to memcpy unavailable
Jun 05 18:13:00 rakete kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to process the buffer list -19!
Jun 05 18:14:00 rakete kernel: amdgpu: Move buffer fallback to memcpy unavailable
Jun 05 18:14:00 rakete kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to process the buffer list -19!
Jun 05 18:14:00 rakete kernel: amdgpu: Move buffer fallback to memcpy unavailable
Jun 05 18:14:00 rakete kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to process the buffer list -19!
Jun 05 18:14:00 rakete kernel: amdgpu: Move buffer fallback to memcpy unavailable
Jun 05 18:14:00 rakete kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to process the buffer list -19!
Jun 05 18:14:00 rakete kernel: amdgpu: Move buffer fallback to memcpy unavailable
Jun 05 18:14:00 rakete kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to process the buffer list -19!

This is my hardware:

Machine:   Type: Desktop System: Gigabyte product: X570 AORUS ULTRA v: -CF serial: <filter> 
           Mobo: Gigabyte model: X570 AORUS ULTRA serial: <filter> UEFI: American Megatrends LLC. v: F33 date: 05/21/2021 
CPU:       Info: 8-Core model: AMD Ryzen 7 3700X bits: 64 type: MT MCP cache: L2: 4 MiB 
           Speed: 2728 MHz min/max: 2200/3600 MHz Core speeds (MHz): 1: 2728 2: 2053 3: 2050 4: 2051 5: 2194 6: 2195 7: 2193 
           8: 2198 9: 2053 10: 2053 11: 2193 12: 2191 13: 2196 14: 2192 15: 3591 16: 2053 
Graphics:  Device-1: Advanced Micro Devices [AMD/ATI] Navi 10 [Radeon RX 5600 OEM/5600 XT / 5700/5700 XT] driver: amdgpu 
           v: kernel 
           Display: server: X.Org 1.20.11 driver: loaded: amdgpu unloaded: fbdev,modesetting,vesa resolution: 2560x1440~60Hz 
           OpenGL: renderer: llvmpipe (LLVM 11.1.0 256 bits) v: 4.5 Mesa 21.1.1

With this mainboard I need acpi_enforce_resources=lax to see the fan speed the lm_sensors.

Does anybody have the same issue? Any ida how to see the fan speed even without this kernel parameter?

1 Like

I just realized it happens also without acpi_enforce_resources=lax

journal
Jun 05 22:11:22 rakete kernel: [drm] free PSP TMR buffer
Jun 05 22:11:49 rakete kernel: [drm] PCIE GART of 512M enabled (table at 0x0000008000000000).
Jun 05 22:11:49 rakete kernel: [drm] PSP is resuming...
Jun 05 22:11:49 rakete kernel: [drm] reserve 0x900000 from 0x800f400000 for PSP TMR
Jun 05 22:11:49 rakete kernel: amdgpu 0000:0e:00.0: amdgpu: RAS: optional ras ta ucode is not available
Jun 05 22:11:49 rakete kernel: amdgpu 0000:0e:00.0: amdgpu: RAP: optional rap ta ucode is not available
Jun 05 22:11:49 rakete kernel: amdgpu 0000:0e:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available
Jun 05 22:11:49 rakete kernel: amdgpu 0000:0e:00.0: amdgpu: SMU is resuming...
Jun 05 22:11:49 rakete kernel: amdgpu 0000:0e:00.0: amdgpu: smu driver if version = 0x00000036, smu fw if version = 0x00000037, smu fw version = 0x002a3f00 (42.63.0)
Jun 05 22:11:49 rakete kernel: amdgpu 0000:0e:00.0: amdgpu: SMU driver if version not matched
Jun 05 22:11:51 rakete kernel: amdgpu 0000:0e:00.0: amdgpu: message: EnableAllSmuFeatures (6)         param: 0x00000000 is timeout (no response)
Jun 05 22:11:51 rakete kernel: amdgpu 0000:0e:00.0: amdgpu: Failed to enable requested dpm features!
Jun 05 22:11:51 rakete kernel: amdgpu 0000:0e:00.0: amdgpu: Failed to setup smc hw!
Jun 05 22:11:51 rakete kernel: [drm:amdgpu_device_ip_resume_phase2 [amdgpu]] *ERROR* resume of IP block <smu> failed -62
Jun 05 22:11:51 rakete kernel: amdgpu 0000:0e:00.0: amdgpu: amdgpu_device_ip_resume failed (-62).
Jun 05 22:11:51 rakete kernel: snd_hda_intel 0000:0e:00.1: refused to change power state from D3hot to D0
Jun 05 22:11:51 rakete kernel: snd_hda_intel 0000:0e:00.1: CORB reset timeout#2, CORBRP = 65535
Jun 05 22:11:58 rakete kernel: ------------[ cut here ]------------
Jun 05 22:11:58 rakete kernel: WARNING: CPU: 5 PID: 176 at drivers/gpu/drm/amd/amdgpu/../display/amdgpu_dm/amdgpu_dm.c:1859 dm_suspend+0x1b5/0x1d0 [amdgpu]
Jun 05 22:11:58 rakete kernel: Modules linked in: cfg80211 ccm algif_aead cbc des_generic libdes ecb algif_skcipher cmac md4 algif_hash af_alg hwmon_vid amdgpu rc_tt_1500 stb6100 isl6423 stb0899 dvb_usb_pctv452e(OE) lzo_rle dvb_usb(OE) intel_rapl_msr intel_rapl_common snd_hda_codec_realtek amd64_edac edac_mce_amd snd_hda_codec_generic ledtrig_audio snd_hda_codec_hdmi btusb ttpci_eeprom kvm_amd btrtl snd_hda_intel gpu_sched btbcm snd_intel_dspcfg drm_ttm_helper ttm dvb_core btintel snd_intel_sdw_acpi snd_hda_codec drm_kms_helper kvm videobuf2_vmalloc bluetooth videobuf2_memops snd_hda_core videobuf2_common snd_hwdep videodev snd_pcm cec ecdh_generic rfkill snd_timer syscopyarea ecc sysfillrect irqbypass mc snd crc16 sysimgblt igb wmi_bmof mxm_wmi k10temp i2c_piix4 rapl soundcore fb_sys_fops i2c_algo_bit dca acpi_cpufreq vfat fat drm fuse agpgart zram ip_tables x_tables usbhid zfs(POE) xfs zunicode(POE) zzstd(OE) zlua(OE) libcrc32c crc32c_generic zavl(POE) icp(POE) zcommon(POE) znvpair(POE) spl(OE)
Jun 05 22:11:58 rakete kernel:  crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel aesni_intel crypto_simd ccp cryptd rng_core sr_mod xhci_pci cdrom xhci_pci_renesas wmi pinctrl_amd vboxnetflt(OE) vboxnetadp(OE) vboxdrv(OE) pkcs8_key_parser sg crypto_user
Jun 05 22:11:58 rakete kernel: CPU: 5 PID: 176 Comm: kworker/5:1 Tainted: P           OE     5.12.9-zen1-1-zen #1
Jun 05 22:11:58 rakete kernel: Hardware name: Gigabyte Technology Co., Ltd. X570 AORUS ULTRA/X570 AORUS ULTRA, BIOS F33 05/21/2021
Jun 05 22:11:58 rakete kernel: Workqueue: pm pm_runtime_work
Jun 05 22:11:58 rakete kernel: RIP: 0010:dm_suspend+0x1b5/0x1d0 [amdgpu]
Jun 05 22:11:58 rakete kernel: Code: ff 31 d2 4c 89 e6 4c 89 ff e8 a7 8f 17 00 83 f8 01 74 1e 89 c2 48 c7 c6 20 31 55 c2 48 c7 c7 90 db 5f c2 e8 2d f3 32 fe eb c2 <0f> 0b e9 80 fe ff ff 4c 89 e6 4c 89 ff e8 19 7c 16 00 eb ae e8 62
Jun 05 22:11:58 rakete kernel: RSP: 0018:ffffa047407cfca0 EFLAGS: 00010286
Jun 05 22:11:58 rakete kernel: RAX: 0000000000000000 RBX: ffff8ffbf25f5450 RCX: 0000000000000000
Jun 05 22:11:58 rakete kernel: RDX: 000000000000000a RSI: 000000000005a38c RDI: ffff8ffbf25e0000
Jun 05 22:11:58 rakete kernel: RBP: ffff8ffbf25e0000 R08: 0000000001c0ca00 R09: ffff8ffbc86534ac
Jun 05 22:11:58 rakete kernel: R10: 0000000000000003 R11: 0000000000000000 R12: ffff8ffbf25e0000
Jun 05 22:11:58 rakete kernel: R13: ffff8ffbc18470c8 R14: 0000000000000000 R15: ffff8ffbc18470c8
Jun 05 22:11:58 rakete kernel: FS:  0000000000000000(0000) GS:ffff900abeb40000(0000) knlGS:0000000000000000
Jun 05 22:11:58 rakete kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jun 05 22:11:58 rakete kernel: CR2: 000055e457002f50 CR3: 0000000143308000 CR4: 0000000000350ee0
Jun 05 22:11:58 rakete kernel: Call Trace:
Jun 05 22:11:58 rakete kernel:  ? smuio_v11_0_update_rom_clock_gating+0x2c/0x70 [amdgpu]
Jun 05 22:11:58 rakete kernel:  amdgpu_device_ip_suspend_phase1+0x6b/0xc0 [amdgpu]
Jun 05 22:11:58 rakete kernel:  amdgpu_device_suspend+0x4d/0xa0 [amdgpu]
Jun 05 22:11:58 rakete kernel:  amdgpu_pmops_runtime_suspend+0x95/0x140 [amdgpu]
Jun 05 22:11:58 rakete kernel:  pci_pm_runtime_suspend+0x5e/0x170
Jun 05 22:11:58 rakete kernel:  ? pci_dev_put+0x20/0x20
Jun 05 22:11:58 rakete kernel:  __rpm_callback+0x7b/0x130
Jun 05 22:11:58 rakete kernel:  ? pci_dev_put+0x20/0x20
Jun 05 22:11:58 rakete kernel:  rpm_suspend+0x45a/0x9c0
Jun 05 22:11:58 rakete kernel:  pm_runtime_work+0x94/0xa0
Jun 05 22:11:58 rakete kernel:  process_one_work+0x214/0x3e0
Jun 05 22:11:58 rakete kernel:  worker_thread+0x4d/0x470
Jun 05 22:11:58 rakete kernel:  ? process_one_work+0x3e0/0x3e0
Jun 05 22:11:58 rakete kernel:  kthread+0x181/0x1b0
Jun 05 22:11:58 rakete kernel:  ? __kthread_init_worker+0x50/0x50
Jun 05 22:11:58 rakete kernel:  ret_from_fork+0x22/0x30
Jun 05 22:11:58 rakete kernel: ---[ end trace 1b9b07cea82aa9a4 ]---
Jun 05 22:11:58 rakete kernel: general protection fault, probably for non-canonical address 0xa8c0a40e084dc018: 0000 [#1] PREEMPT SMP NOPTI
Jun 05 22:11:58 rakete kernel: CPU: 5 PID: 176 Comm: kworker/5:1 Tainted: P        W  OE     5.12.9-zen1-1-zen #1
Jun 05 22:11:58 rakete kernel: Hardware name: Gigabyte Technology Co., Ltd. X570 AORUS ULTRA/X570 AORUS ULTRA, BIOS F33 05/21/2021
Jun 05 22:11:58 rakete kernel: Workqueue: pm pm_runtime_work
Jun 05 22:11:58 rakete kernel: RIP: 0010:free_mqd_hiq_sdma+0x5/0x20 [amdgpu]
Jun 05 22:11:58 rakete kernel: Code: 00 48 01 d1 48 89 48 18 49 8b 88 08 02 00 00 48 01 d1 48 89 48 08 49 03 90 10 02 00 00 48 89 50 10 5b 5d c3 90 0f 1f 44 00 00 <48> 83 7a 18 00 48 89 d7 74 05 e9 1c 68 8e cf 0f 0b e9 15 68 8e cf
Jun 05 22:11:58 rakete kernel: RSP: 0018:ffffa047407cfcf8 EFLAGS: 00010216
Jun 05 22:11:58 rakete kernel: RAX: ffffffffc22570b0 RBX: ffff8ffbf5b8e400 RCX: 0000000080800079
Jun 05 22:11:58 rakete kernel: RDX: a8c0a40e084dc018 RSI: 0200040600080100 RDI: ffff8ffbf2944c80
Jun 05 22:11:58 rakete kernel: RBP: ffff8ffbf2b829c0 R08: 0000000000000001 R09: 0000000000000001
Jun 05 22:11:58 rakete kernel: R10: 000000000019c700 R11: 000000000019c7e0 R12: ffff8ffbf5b8e4d0
Jun 05 22:11:58 rakete kernel: R13: ffff8ffbc18470c8 R14: 0000000000000000 R15: ffff8ffbc18470c8
Jun 05 22:11:58 rakete kernel: FS:  0000000000000000(0000) GS:ffff900abeb40000(0000) knlGS:0000000000000000
Jun 05 22:11:58 rakete kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jun 05 22:11:58 rakete kernel: CR2: 000055e457002f50 CR3: 0000000143308000 CR4: 0000000000350ee0
Jun 05 22:11:58 rakete kernel: Call Trace:
Jun 05 22:11:58 rakete kernel:  kernel_queue_uninit+0x33/0xe0 [amdgpu]
Jun 05 22:11:58 rakete kernel:  stop_cpsch+0xa0/0xc0 [amdgpu]
Jun 05 22:11:58 rakete kernel:  kgd2kfd_suspend+0x38/0x50 [amdgpu]
Jun 05 22:11:58 rakete kernel:  amdgpu_device_suspend+0x8c/0xa0 [amdgpu]
Jun 05 22:11:58 rakete kernel:  amdgpu_pmops_runtime_suspend+0x95/0x140 [amdgpu]
Jun 05 22:11:58 rakete kernel:  pci_pm_runtime_suspend+0x5e/0x170
Jun 05 22:11:58 rakete kernel:  ? pci_dev_put+0x20/0x20
Jun 05 22:11:58 rakete kernel:  __rpm_callback+0x7b/0x130
Jun 05 22:11:58 rakete kernel:  ? pci_dev_put+0x20/0x20
Jun 05 22:11:58 rakete kernel:  rpm_suspend+0x45a/0x9c0
Jun 05 22:11:58 rakete kernel:  pm_runtime_work+0x94/0xa0
Jun 05 22:11:58 rakete kernel:  process_one_work+0x214/0x3e0
Jun 05 22:11:58 rakete kernel:  worker_thread+0x4d/0x470
Jun 05 22:11:58 rakete kernel:  ? process_one_work+0x3e0/0x3e0
Jun 05 22:11:58 rakete kernel:  kthread+0x181/0x1b0
Jun 05 22:11:58 rakete kernel:  ? __kthread_init_worker+0x50/0x50
Jun 05 22:11:58 rakete kernel:  ret_from_fork+0x22/0x30
Jun 05 22:11:58 rakete kernel: Modules linked in: cfg80211 ccm algif_aead cbc des_generic libdes ecb algif_skcipher cmac md4 algif_hash af_alg hwmon_vid amdgpu rc_tt_1500 stb6100 isl6423 stb0899 dvb_usb_pctv452e(OE) lzo_rle dvb_usb(OE) intel_rapl_msr intel_rapl_common snd_hda_codec_realtek amd64_edac edac_mce_amd snd_hda_codec_generic ledtrig_audio snd_hda_codec_hdmi btusb ttpci_eeprom kvm_amd btrtl snd_hda_intel gpu_sched btbcm snd_intel_dspcfg drm_ttm_helper ttm dvb_core btintel snd_intel_sdw_acpi snd_hda_codec drm_kms_helper kvm videobuf2_vmalloc bluetooth videobuf2_memops snd_hda_core videobuf2_common snd_hwdep videodev snd_pcm cec ecdh_generic rfkill snd_timer syscopyarea ecc sysfillrect irqbypass mc snd crc16 sysimgblt igb wmi_bmof mxm_wmi k10temp i2c_piix4 rapl soundcore fb_sys_fops i2c_algo_bit dca acpi_cpufreq vfat fat drm fuse agpgart zram ip_tables x_tables usbhid zfs(POE) xfs zunicode(POE) zzstd(OE) zlua(OE) libcrc32c crc32c_generic zavl(POE) icp(POE) zcommon(POE) znvpair(POE) spl(OE)
Jun 05 22:11:58 rakete kernel:  crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel aesni_intel crypto_simd ccp cryptd rng_core sr_mod xhci_pci cdrom xhci_pci_renesas wmi pinctrl_amd vboxnetflt(OE) vboxnetadp(OE) vboxdrv(OE) pkcs8_key_parser sg crypto_user
Jun 05 22:11:58 rakete kernel: ---[ end trace 1b9b07cea82aa9a5 ]---
Jun 05 22:11:58 rakete kernel: RIP: 0010:free_mqd_hiq_sdma+0x5/0x20 [amdgpu]
Jun 05 22:11:58 rakete kernel: Code: 00 48 01 d1 48 89 48 18 49 8b 88 08 02 00 00 48 01 d1 48 89 48 08 49 03 90 10 02 00 00 48 89 50 10 5b 5d c3 90 0f 1f 44 00 00 <48> 83 7a 18 00 48 89 d7 74 05 e9 1c 68 8e cf 0f 0b e9 15 68 8e cf
Jun 05 22:11:58 rakete kernel: RSP: 0018:ffffa047407cfcf8 EFLAGS: 00010216
Jun 05 22:11:58 rakete kernel: RAX: ffffffffc22570b0 RBX: ffff8ffbf5b8e400 RCX: 0000000080800079
Jun 05 22:11:58 rakete kernel: RDX: a8c0a40e084dc018 RSI: 0200040600080100 RDI: ffff8ffbf2944c80
Jun 05 22:11:58 rakete kernel: RBP: ffff8ffbf2b829c0 R08: 0000000000000001 R09: 0000000000000001
Jun 05 22:11:58 rakete kernel: R10: 000000000019c700 R11: 000000000019c7e0 R12: ffff8ffbf5b8e4d0
Jun 05 22:11:58 rakete kernel: R13: ffff8ffbc18470c8 R14: 0000000000000000 R15: ffff8ffbc18470c8
Jun 05 22:11:58 rakete kernel: FS:  0000000000000000(0000) GS:ffff900abeb40000(0000) knlGS:0000000000000000
Jun 05 22:11:58 rakete kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jun 05 22:11:58 rakete kernel: CR2: 000055e457002f50 CR3: 0000000143308000 CR4: 0000000000350ee0

I’ve been having some very strange lagging issues where a [gfx] (kernel?) process appears in top and takes up some CPU time.

I haven’t seen traces like above (yet?) though I have seen things like:

Jun 06 01:14:30 strix kernel: [drm:amdgpu_dm_atomic_commit_tail [amdgpu]] *ERROR* Waiting for fences timed out!