Game Crashes - RX 6800 XT - amdgpu: [gfxhub] page fault

I’ve been experiencing crashes with select titles, currently Sons of the Forest and Monster Hunter Wilds. I do note that Monster Hunter is currently unstable.

Running stress tests, the system remains stable.

System information: https://0x0.st/8MOv.txt

Boot: https://0x0.st/8MOx.txt

The journal output:

Mar 03 14:53:49 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:24 vmid:5 pasid:32801)
Mar 03 14:53:49 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:  in process MonsterHunterWi pid 5819 thread vkd3d_queue pid 5970
Mar 03 14:53:49 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:   in page starting at address 0x00008000e4400000 from client 0x1b (UTCL2)
Mar 03 14:53:49 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00501430
Mar 03 14:53:49 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          Faulty UTCL2 client ID: SQC (data) (0xa)
Mar 03 14:53:49 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          MORE_FAULTS: 0x0
Mar 03 14:53:49 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          WALKER_ERROR: 0x0
Mar 03 14:53:49 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          PERMISSION_FAULTS: 0x3
Mar 03 14:53:49 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          MAPPING_ERROR: 0x0
Mar 03 14:53:49 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          RW: 0x0
Mar 03 14:54:00 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu: ring gfx_0.0.0 timeout, signaled seq=1046646, emitted seq=1046648
Mar 03 14:54:00 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu: Process information: process MonsterHunterWi pid 5819 thread vkd3d_queue pid 5873
Mar 03 14:54:00 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu: Starting gfx_0.0.0 ring reset
Mar 03 14:54:00 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu: Ring gfx_0.0.0 reset failure
Mar 03 14:54:01 McTherodin kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
Mar 03 14:54:01 McTherodin kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
Mar 03 14:54:01 McTherodin systemd-coredump[6239]: [🡕] Process 1631 (Xwayland) of user 1000 dumped core.

                                                   Stack trace of thread 1638:
                                                   #0  0x000073ea8bf055e7 n/a (n/a + 0x0)
                                                   #1  0x000073ea893e20e3 n/a (n/a + 0x0)
                                                   #2  0x000073ea893e55b3 n/a (n/a + 0x0)
                                                   #3  0x000073ea88edd8a4 n/a (n/a + 0x0)
                                                   #4  0x000073ea88f1271d n/a (n/a + 0x0)
                                                   #5  0x000073ea8bf7570a n/a (n/a + 0x0)
                                                   #6  0x000073ea8bff9aac n/a (n/a + 0x0)
                                                   ELF object binary architecture: AMD x86-64
Mar 03 14:54:01 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:24 vmid:3 pasid:32772)
Mar 03 14:54:01 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:  in process kwin_wayland pid 1525 thread kwin_wayla:cs0 pid 1575
Mar 03 14:54:01 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:   in page starting at address 0x0000800010000000 from client 0x1b (UTCL2)
Mar 03 14:54:01 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00301430
Mar 03 14:54:01 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          Faulty UTCL2 client ID: SQC (data) (0xa)
Mar 03 14:54:01 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          MORE_FAULTS: 0x0
Mar 03 14:54:01 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          WALKER_ERROR: 0x0
Mar 03 14:54:01 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          PERMISSION_FAULTS: 0x3
Mar 03 14:54:01 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          MAPPING_ERROR: 0x0
Mar 03 14:54:01 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          RW: 0x0
Mar 03 14:54:02 McTherodin systemd-coredump[6237]: [🡕] Process 2091 (corectrl) of user 1000 dumped core.

                                                   Stack trace of thread 2508:
                                                   #0  0x000074a335a335e7 n/a (n/a + 0x0)
                                                   #1  0x000074a319fe20e3 n/a (n/a + 0x0)
                                                   #2  0x000074a319fe55b3 n/a (n/a + 0x0)
                                                   #3  0x000074a319add8a4 n/a (n/a + 0x0)
                                                   #4  0x000074a319b1271d n/a (n/a + 0x0)
                                                   #5  0x000074a335aa370a n/a (n/a + 0x0)
                                                   #6  0x000074a335b27aac n/a (n/a + 0x0)
                                                   ELF object binary architecture: AMD x86-64
Mar 03 14:54:04 McTherodin systemd-coredump[6240]: [🡕] Process 1778 (plasmashell) of user 1000 dumped core.
Mar 03 14:54:04 McTherodin systemd-coredump[6272]: [🡕] Process 3994 (Discord) of user 1000 dumped core.

                                                   Module discord_zstd.node without build-id.
                                                   Module notificationstate.node without build-id.
                                                   Stack trace of thread 171:
                                                   #0  0x00007d36152fedb4 n/a (n/a + 0x0)
                                                   #1  0x00007d36152a608e n/a (n/a + 0x0)
                                                   #2  0x00007d361528d882 n/a (n/a + 0x0)
                                                   #3  0x00007d358bfc7ca3 _ZSt11__terminatePFvvE (discord_utils.node + 0xbdca3)
                                                   #4  0x00007d358bfc7696 _ZN10__cxxabiv1L12failed_throwEPNS_15__cxa_exceptionE (discord_utils.node + 0xbd696)
                                                   #5  0x00007d358bfc762f __cxa_throw (discord_utils.node + 0xbd62f)
                                                   #6  0x00007d358bfc50eb _ZNSt4__Cr20__throw_system_errorEiPKc (discord_utils.node + 0xbb0eb)
                                                   #7  0x00007d358bfc51a3 _ZNSt4__Cr6thread4joinEv (discord_utils.node + 0xbb1a3)
                                                   #8  0x00007d358bf892b8 _ZN7discord2uv17ThreadedEventLoop8ShutdownEv (discord_utils.node + 0x7f2b8)
                                                   #9  0x00007d358bf20db7 _ZNSt4__Cr14__shared_count16__release_sharedB7v160000Ev (discord_utils.node + 0x16db>
                                                   #10 0x00007d36152a87e1 n/a (n/a + 0x0)
                                                   #11 0x00007d36152a88ae n/a (n/a + 0x0)
                                                   #12 0x00007d36157c3448 n/a (n/a + 0x0)
                                                   #13 0x00007d36157c378c n/a (n/a + 0x0)
                                                   #14 0x00007d36157c0aad n/a (n/a + 0x0)
                                                   #15 0x00007d36157b0e41 n/a (n/a + 0x0)
                                                   #16 0x00007d358bf84256 _ZN7discord5inputL15OnDisplayFDReadEP9uv_poll_sii (discord_utils.node + 0x7a256)
                                                   #17 0x000058b19fd16177 n/a (n/a + 0x0)
                                                   #18 0x000058b19fd05c15 n/a (n/a + 0x0)
                                                   #19 0x00007d358bf891d9 _ZN7discord2uv17ThreadedEventLoop10ThreadMainEPKcNSt4__Cr7promiseIvEE (discord_utils>
                                                   #20 0x00007d358bf89404 _ZNSt4__Cr14__thread_proxyB7v160000INS_5tupleIJNS_10unique_ptrINS_15__thread_structE>
                                                   #21 0x00007d36152fce0e n/a (n/a + 0x0)
                                                   #22 0x00007d36153817d4 n/a (n/a + 0x0)
                                                   ELF object binary architecture: AMD x86-64
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:24 vmid:3 pasid:32772)
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:  in process kwin_wayland pid 1525 thread kwin_wayla:cs0 pid 1575
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:   in page starting at address 0x0000800010000000 from client 0x1b (UTCL2)
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00301431
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          Faulty UTCL2 client ID: SQC (data) (0xa)
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          MORE_FAULTS: 0x1
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          WALKER_ERROR: 0x0
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          PERMISSION_FAULTS: 0x3
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          MAPPING_ERROR: 0x0
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          RW: 0x0
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:24 vmid:3 pasid:32772)
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:  in process kwin_wayland pid 1525 thread kwin_wayla:cs0 pid 1575
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:   in page starting at address 0x0000800010000000 from client 0x1b (UTCL2)
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00301431
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          Faulty UTCL2 client ID: SQC (data) (0xa)
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          MORE_FAULTS: 0x1
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          WALKER_ERROR: 0x0
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          PERMISSION_FAULTS: 0x3
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          MAPPING_ERROR: 0x0
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          RW: 0x0
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:24 vmid:3 pasid:32772)
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:  in process kwin_wayland pid 1525 thread kwin_wayla:cs0 pid 1575
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:   in page starting at address 0x0000800010000000 from client 0x1b (UTCL2)
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:24 vmid:3 pasid:32772)
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:  in process kwin_wayland pid 1525 thread kwin_wayla:cs0 pid 1575
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:   in page starting at address 0x0000800010000000 from client 0x1b (UTCL2)
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:24 vmid:3 pasid:32772)
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:  in process kwin_wayland pid 1525 thread kwin_wayla:cs0 pid 1575
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:   in page starting at address 0x0000800010000000 from client 0x1b (UTCL2)
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:24 vmid:3 pasid:32772)
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:  in process kwin_wayland pid 1525 thread kwin_wayla:cs0 pid 1575
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:   in page starting at address 0x0000800010000000 from client 0x1b (UTCL2)
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00301430
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          Faulty UTCL2 client ID: SQC (data) (0xa)
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          MORE_FAULTS: 0x0
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          WALKER_ERROR: 0x0
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          PERMISSION_FAULTS: 0x3
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          MAPPING_ERROR: 0x0
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          RW: 0x0
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:24 vmid:3 pasid:32772)
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:  in process kwin_wayland pid 1525 thread kwin_wayla:cs0 pid 1575
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:   in page starting at address 0x0000800010000000 from client 0x1b (UTCL2)
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00301430
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          Faulty UTCL2 client ID: SQC (data) (0xa)
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          MORE_FAULTS: 0x0
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          WALKER_ERROR: 0x0
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          PERMISSION_FAULTS: 0x3
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          MAPPING_ERROR: 0x0
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          RW: 0x0
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:24 vmid:3 pasid:32772)
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:  in process kwin_wayland pid 1525 thread kwin_wayla:cs0 pid 1575
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:   in page starting at address 0x0000800010000000 from client 0x1b (UTCL2)
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:24 vmid:3 pasid:32772)
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:  in process kwin_wayland pid 1525 thread kwin_wayla:cs0 pid 1575
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:   in page starting at address 0x0000800010000000 from client 0x1b (UTCL2)
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00301430
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          Faulty UTCL2 client ID: SQC (data) (0xa)
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          MORE_FAULTS: 0x0
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          WALKER_ERROR: 0x0
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          PERMISSION_FAULTS: 0x3
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          MAPPING_ERROR: 0x0
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          RW: 0x0
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:24 vmid:3 pasid:32772)
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:  in process kwin_wayland pid 1525 thread kwin_wayla:cs0 pid 1575
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:   in page starting at address 0x0000800010000000 from client 0x1b (UTCL2)
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00301430
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          Faulty UTCL2 client ID: SQC (data) (0xa)
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          MORE_FAULTS: 0x0
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          WALKER_ERROR: 0x0
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          PERMISSION_FAULTS: 0x3
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          MAPPING_ERROR: 0x0
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          RW: 0x0
Mar 03 14:54:12 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu: ring gfx_0.1.0 timeout, but soft recovered
Mar 03 14:54:22 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu: ring gfx_0.1.0 timeout, but soft recovered
Mar 03 14:54:22 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:24 vmid:3 pasid:32772)
Mar 03 14:54:22 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:  in process kwin_wayland pid 1525 thread kwin_wayla:cs0 pid 1575
Mar 03 14:54:22 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:   in page starting at address 0x0000800010000000 from client 0x1b (UTCL2)
Mar 03 14:54:22 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00301430
Mar 03 14:54:22 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          Faulty UTCL2 client ID: SQC (data) (0xa)
Mar 03 14:54:22 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          MORE_FAULTS: 0x0
Mar 03 14:54:22 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          WALKER_ERROR: 0x0
Mar 03 14:54:22 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          PERMISSION_FAULTS: 0x3
Mar 03 14:54:22 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          MAPPING_ERROR: 0x0
Mar 03 14:54:22 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          RW: 0x0
Mar 03 14:54:22 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu: ring gfx_0.0.0 timeout, but soft recovered
Mar 03 14:54:32 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu: ring gfx_0.1.0 timeout, but soft recovered
Mar 03 14:54:32 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:24 vmid:3 pasid:32772)
Mar 03 14:54:32 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:  in process kwin_wayland pid 1525 thread kwin_wayla:cs0 pid 1575
Mar 03 14:54:32 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:   in page starting at address 0x0000800010000000 from client 0x1b (UTCL2)
Mar 03 14:54:32 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00301430
Mar 03 14:54:32 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          Faulty UTCL2 client ID: SQC (data) (0xa)
Mar 03 14:54:32 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          MORE_FAULTS: 0x0
Mar 03 14:54:32 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          WALKER_ERROR: 0x0
Mar 03 14:54:32 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          PERMISSION_FAULTS: 0x3
Mar 03 14:54:32 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          MAPPING_ERROR: 0x0
Mar 03 14:54:32 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu:          RW: 0x0
Mar 03 14:54:43 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu: ring gfx_0.0.0 timeout, signaled seq=1046657, emitted seq=1046660
Mar 03 14:54:43 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu: Process information: process plasmashell pid 6328 thread plasmashel:cs0 pid 6364
Mar 03 14:54:43 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu: Starting gfx_0.0.0 ring reset
Mar 03 14:54:43 McTherodin kernel: amdgpu 0000:0b:00.0: amdgpu: Ring gfx_0.0.0 reset failure
Mar 03 14:54:44 McTherodin kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
Mar 03 14:54:46 McTherodin systemd-coredump[6523]: [🡕] Process 6328 (plasmashell) of user 1000 dumped core.

                                                   Stack trace of thread 6364:
                                                   #0  0x000074f1a2c335e7 n/a (n/a + 0x0)
                                                   #1  0x000074f198de20e3 n/a (n/a + 0x0)
                                                   #2  0x000074f198de55b3 n/a (n/a + 0x0)
                                                   #3  0x000074f1988dd8a4 n/a (n/a + 0x0)
                                                   #4  0x000074f19891271d n/a (n/a + 0x0)
                                                   #5  0x000074f1a2ca370a n/a (n/a + 0x0)
                                                   #6  0x000074f1a2d27aac n/a (n/a + 0x0)
                                                   ELF object binary architecture: AMD x86-64

Any assistance would be greatly appreciated.

Probably has to do with some game platform settings. ex… vram, frame gen, motion blur, AMD FSR

Running the game at its lowest setting without frame generation still results in the above mentioned crash.
Vram usage is around 30-40% at most.

The crashes do appear to be more frequent during multiplayer.

@scannerdarkly

According to ProtonDB - Sons Of The Forest it looks like it works fine on Proton 9.0-4 with nVidea but for AMD you may have to change the game to Proton Experimental for AMD, not sure if you have tried that yet?

With regards to ProtonDB - Monster Hunter Wilds it just looks incredibly unstable and isn’t really working well for anyone, but again you could try switching to Proton Experimental if you haven’t already.

My first stop is usually ProtonDB. I’ve gone through the Proton versions and also tried GloriousEggroll’s releases. I remove the old prefix before trying a new version.

I’ve been looking at similar reports:

Similar issues

Power and boost clocks

The above threads concerning power and boost clocks seem to suggest a possible cause.

I am experiencing this same issue with Doom: The Dark Ages, and Assassin’s Creed: Valhalla. But I can use my PC regularly all day no problem. Its only been the past 2 months or so I have noticed it. As a side note I played For the King 2 for almost 8 hours one day and it was perfectly stable. Very frustrating.

What has helped my setup with certain titles (Baldur’s Gate 3) is to set my GPU’s power profile to Power Saving with Corectrl. I go from having crashes consistently every 10-15min, to no crashes at all.

I’ve recently switched to GE-Proton 10-1 for Monster Hunter Wilds and enabled Wine-Wayland, and the results were pretty promising. Still need more testing, but the crashes might have been solved.

I’ve also added gpu_recovery=1 and lockup_timeout=1000 to my kernel parameters, although I don’t believe I’ve run into a situation where they’ve been triggered just yet. No crashes so far.

But yeah, it’s pretty difficult to troubleshoot since it can appear quite random and present as either a hardware failure or power supply issue.
I’ve undervolted my card and run stress tests without experiencing any issues. Also used frame generation technologies with certain titles and everything worked fine.

1 Like