Crashes since 6.14.11 Asus ROG Strix G513 laptop amd 6800h, NV-3060

I’ve been having system instability for a long time now (since 6.14.11 I believe) and I’m very tired of it. I’m currently using the LTS kernel to use my EOS install but other kernels crash randomly. The crashes leave little to no journalctl logs or anything for me to go on. Kdumpst doesn’t seem to work correctly even though I believe I’ve configured it correctly. I’ll get random restarts as well as full kernel panics. any help would be appreciated.

CPU: 8-core AMD Ryzen 7 6800H with Radeon Graphics (-MT MCP-)
speed/min/max: 1096/404/4787 MHz
Mem: 2.84/30.6 GiB (9.3%) Storage: 3.64 TiB (9.1% used) Procs: 378

Nvidia 3060 Laptop gpu (Max-Q)

can you report

inxi -Fza 
cpupower frequency-info
sudo lscpu -ae 
1 Like

System:
Kernel: 6.16.4-arch1-1 arch: x86_64 bits: 64 compiler: gcc v: 15.2.1
clocksource: tsc avail: acpi_pm parameters: BOOT_IMAGE=/boot/vmlinuz-linux
root=UUID=937412a2-99da-4e21-866a-fc1587c19f4e rw nowatchdog
nvme_load=YES resume=UUID=3467e73f-9093-45dc-8ce3-62e11d17e366
nvidia_drm.modeset=1 loglevel=3 crashkernel=auto iommu=pt
Desktop: GNOME v: 48.4 tk: GTK v: 3.24.50 wm: gnome-shell
tools: gsd-screensaver-proxy dm: GDM v: 48.0 Distro: EndeavourOS
base: Arch Linux
Machine:
Type: Laptop System: ASUSTeK product: ROG Strix G513RM_G513RM v: 1.0
serial:
Mobo: ASUSTeK model: G513RM v: 1.0 serial:
uuid: UEFI: American Megatrends LLC. v: G513RM.327
date: 02/16/2023
Battery:
ID-1: BAT0 charge: 68.4 Wh (99.0%) condition: 69.1/90.0 Wh (76.8%)
power: 10.6 W volts: 16.8 min: 15.9 model: AS3GWAF3KC GA50358 type: Unknown
serial: status: charging
Device-1: hidpp_battery_0 model: Logitech Wireless Mouse MX Master 3
serial: charge: 100% (should be ignored) rechargeable: yes
status: discharging
CPU:
Info: model: AMD Ryzen 7 6800H with Radeon Graphics bits: 64 type: MT MCP
arch: Zen 3+ gen: 3 level: v3 note: check built: 2022 process: TSMC n6 (7nm)
family: 0x19 (25) model-id: 0x44 (68) stepping: 1 microcode: 0xA404108
Topology: cpus: 1x dies: 1 clusters: 1 cores: 8 threads: 16 tpc: 2
smt: enabled cache: L1: 512 KiB desc: d-8x32 KiB; i-8x32 KiB L2: 4 MiB
desc: 8x512 KiB L3: 16 MiB desc: 1x16 MiB
Speed (MHz): avg: 4479 min/max: 404/4787 boost: enabled scaling:
driver: amd-pstate-epp governor: performance cores: 1: 4479 2: 4479 3: 4479
4: 4479 5: 4479 6: 4479 7: 4479 8: 4479 9: 4479 10: 4479 11: 4479 12: 4479
13: 4479 14: 4479 15: 4479 16: 4479 bogomips: 102204
Flags: avx avx2 ht lm nx pae sse sse2 sse3 sse4_1 sse4_2 sse4a ssse3
Vulnerabilities:
Type: gather_data_sampling status: Not affected
Type: ghostwrite status: Not affected
Type: indirect_target_selection status: Not affected
Type: itlb_multihit status: Not affected
Type: l1tf status: Not affected
Type: mds status: Not affected
Type: meltdown status: Not affected
Type: mmio_stale_data status: Not affected
Type: old_microcode status: Not affected
Type: reg_file_data_sampling status: Not affected
Type: retbleed status: Not affected
Type: spec_rstack_overflow mitigation: Safe RET
Type: spec_store_bypass mitigation: Speculative Store Bypass disabled via
prctl
Type: spectre_v1 mitigation: usercopy/swapgs barriers and __user pointer
sanitization
Type: spectre_v2 mitigation: Retpolines; IBPB: conditional; IBRS_FW;
STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not
affected
Type: srbds status: Not affected
Type: tsa mitigation: Clear CPU buffers
Type: tsx_async_abort status: Not affected
Graphics:
Device-1: NVIDIA GA106M [GeForce RTX 3060 Mobile / Max-Q] vendor: ASUSTeK
driver: nvidia v: 580.76.05 alternate: nouveau,nvidia_drm
non-free: 550-570.xx+ status: current (as of 2025-04; EOL~2026-12-xx)
arch: Ampere code: GAxxx process: TSMC n7 (7nm) built: 2020-2023 pcie:
gen: 1 speed: 2.5 GT/s lanes: 8 link-max: gen: 4 speed: 16 GT/s lanes: 16
ports: active: none empty: DP-1,HDMI-A-1,eDP-1 bus-ID: 01:00.0
chip-ID: 10de:2520 class-ID: 0300
Device-2: Advanced Micro Devices [AMD/ATI] Rembrandt [Radeon 680M]
vendor: ASUSTeK driver: amdgpu v: kernel arch: RDNA-2 code: Navi-2x
process: TSMC n7 (7nm) built: 2020-22 pcie: gen: 4 speed: 16 GT/s
lanes: 16 ports: active: eDP-2 empty: DP-2, DP-3, DP-4, DP-5, DP-6,
Writeback-1 bus-ID: 06:00.0 chip-ID: 1002:1681 class-ID: 0300 temp: 47.0 C
Display: wayland server: X.org v: 1.21.1.18 with: Xwayland v: 24.1.8
compositor: gnome-shell driver: gpu: amdgpu display-ID: 0
Monitor-1: eDP-2 model: ChiMei InnoLux 0x1540 built: 2020 res: 2560x1440
dpi: 189 gamma: 1.2 size: 344x193mm (13.54x7.6") diag: 394mm (15.5")
ratio: 16:9 modes: max: 2560x1440 min: 640x480
API: EGL v: 1.5 hw: drv: nvidia drv: amd radeonsi platforms: device: 0
drv: nvidia device: 1 drv: radeonsi device: 3 drv: swrast gbm:
drv: kms_swrast surfaceless: drv: nvidia wayland: drv: radeonsi x11:
drv: radeonsi inactive: device-2
API: OpenGL v: 4.6.0 compat-v: 4.5 vendor: amd mesa v: 25.2.1-arch1.4
glx-v: 1.4 direct-render: yes renderer: AMD Radeon 680M (radeonsi rembrandt
LLVM 20.1.8 DRM 3.64 6.16.4-arch1-1) device-ID: 1002:1681 memory: 500 MiB
unified: no display-ID: :0.0
Info: Tools: api: eglinfo,glxinfo gpu: nvidia-smi
x11: xdpyinfo, xprop, xrandr
Audio:
Device-1: NVIDIA GA106 High Definition Audio vendor: ASUSTeK
driver: snd_hda_intel v: kernel pcie: gen: 4 speed: 16 GT/s lanes: 8
link-max: lanes: 16 bus-ID: 01:00.1 chip-ID: 10de:228e class-ID: 0403
Device-2: Advanced Micro Devices [AMD] Audio Coprocessor vendor: ASUSTeK
driver: snd_pci_acp6x v: kernel alternate: snd_pci_acp3x, snd_rn_pci_acp3x,
snd_pci_acp5x, snd_acp_pci, snd_rpl_pci_acp6x, snd_pci_ps,
snd_sof_amd_renoir, snd_sof_amd_rembrandt, snd_sof_amd_vangogh,
snd_sof_amd_acp63, snd_sof_amd_acp70 pcie: gen: 4 speed: 16 GT/s lanes: 16
bus-ID: 06:00.5 chip-ID: 1022:15e2 class-ID: 0480
Device-3: Advanced Micro Devices [AMD] Family 17h/19h/1ah HD Audio
vendor: ASUSTeK driver: snd_hda_intel v: kernel pcie: gen: 4 speed: 16 GT/s
lanes: 16 bus-ID: 06:00.6 chip-ID: 1022:15e3 class-ID: 0403
API: ALSA v: k6.16.4-arch1-1 status: kernel-api
tools: alsactl,alsamixer,amixer
Server-1: PipeWire v: 1.4.7 status: active with: 1: pipewire-pulse
status: active 2: wireplumber status: active 3: pipewire-alsa type: plugin
4: pw-jack type: plugin tools: pactl,pw-cat,pw-cli,wpctl
Network:
Device-1: Intel Wi-Fi 6E AX210/AX1675 2x2 [Typhoon Peak] driver: iwlwifi
v: kernel pcie: gen: 2 speed: 5 GT/s lanes: 1 bus-ID: 03:00.0
chip-ID: 8086:2725 class-ID: 0280
IF: wlan0 state: up mac:
Device-2: Realtek RTL8125 2.5GbE vendor: ASUSTeK driver: r8169 v: kernel
pcie: gen: 2 speed: 5 GT/s lanes: 1 port: e000 bus-ID: 04:00.0
chip-ID: 10ec:8125 class-ID: 0200
IF: enp4s0 state: down mac:
Info: services: NetworkManager, systemd-timesyncd, wpa_supplicant
Bluetooth:
Device-1: Intel AX210 Bluetooth driver: btusb v: 0.8 type: USB rev: 2.0
speed: 12 Mb/s lanes: 1 mode: 1.1 bus-ID: 1-4:3 chip-ID: 8087:0032
class-ID: e001
Report: btmgmt ID: hci0 rfk-id: 0 state: up address: N/A
Drives:
Local Storage: total: 3.64 TiB used: 339.14 GiB (9.1%)
SMART Message: Unable to run smartctl. Root privileges required.
ID-1: /dev/nvme0n1 maj-min: 259:0 vendor: Samsung model: SSD 980 PRO 2TB
size: 1.82 TiB block-size: physical: 512 B logical: 512 B speed: 63.2 Gb/s
lanes: 4 tech: SSD serial: fw-rev: 5B2QGXA7 temp: 34.9 C
scheme: GPT
ID-2: /dev/nvme1n1 maj-min: 259:4 vendor: Crucial model: CT2000T500SSD8
size: 1.82 TiB block-size: physical: 512 B logical: 512 B speed: 63.2 Gb/s
lanes: 4 tech: SSD serial: fw-rev: P8CR002 temp: 29.9 C
scheme: GPT
Partition:
ID-1: / raw-size: 1.78 TiB size: 1.76 TiB (98.37%) used: 339.14 GiB (18.9%)
fs: ext4 dev: /dev/nvme0n1p2 maj-min: 259:2
ID-2: /boot/efi raw-size: 2 GiB size: 2 GiB (99.80%) used: 320 KiB (0.0%)
fs: vfat dev: /dev/nvme0n1p1 maj-min: 259:1
Swap:
Kernel: swappiness: 60 (default) cache-pressure: 100 (default) zswap: yes
compressor: zstd max-pool: 20%
ID-1: swap-1 type: partition size: 33.67 GiB used: 0 KiB (0.0%)
priority: -2 dev: /dev/nvme0n1p3 maj-min: 259:3
Sensors:
System Temperatures: cpu: 54.9 C mobo: 41.0 C gpu: amdgpu temp: 46.0 C
Fan Speeds (rpm): cpu: 0
Info:
Memory: total: 32 GiB note: est. available: 30.61 GiB used: 2.81 GiB (9.2%)
Processes: 397 Power: uptime: 8m states: freeze,mem,disk suspend: s2idle
wakeups: 0 hibernate: platform avail: shutdown, reboot, suspend, test_resume
image: 12.19 GiB services: gsd-power, power-profiles-daemon, upowerd
Init: systemd v: 257 default: graphical tool: systemctl
Packages: 1264 pm: pacman pkgs: 1242 libs: 406
tools: gnome-software,pacseek,yay pm: flatpak pkgs: 22 Compilers:
clang: 20.1.8 gcc: 15.2.1 Shell: fish v: 4.0.2 running-in: gnome-terminal
inxi: 3.3.38

cpuower output:

analyzing CPU 7:
driver: amd-pstate-epp
CPUs which run at the same hardware frequency: 7
CPUs which need to have their frequency coordinated by software: 7
energy performance preference: performance
hardware limits: 404 MHz - 4.79 GHz
available cpufreq governors: performance powersave
current policy: frequency should be within 1.10 GHz and 4.79 GHz.
The governor “performance” may decide which speed to use
within this range.
current CPU frequency: 2.04 GHz (asserted by call to kernel)
boost state support:
Supported: yes
Active: yes
amd-pstate limits:
Highest Performance: 166. Maximum Frequency: 4.79 GHz.
Nominal Performance: 111. Nominal Frequency: 3.20 GHz.
Lowest Non-linear Performance: 38. Lowest Non-linear Frequency: 1.10 GHz.
Lowest Performance: 14. Lowest Frequency: 400 MHz.
Preferred Core Support: 1. Preferred Core Ranking: 196.

sudo lscpu -ae
CPU NODE SOCKET CORE L1d:L1i:L2:L3 ONLINE MAXMHZ MINMHZ MHZ
0 0 0 0 0:0:0:0 yes 4787.0820 403.7300 2301.3669
1 0 0 0 0:0:0:0 yes 4787.0820 403.7300 2270.5710
2 0 0 1 1:1:1:0 yes 4787.0820 403.7300 2017.6190
3 0 0 1 1:1:1:0 yes 4787.0820 403.7300 2017.9210
4 0 0 2 2:2:2:0 yes 4787.0820 403.7300 2270.3420
5 0 0 2 2:2:2:0 yes 4787.0820 403.7300 3017.9351
6 0 0 3 3:3:3:0 yes 4787.0820 403.7300 2016.4200
7 0 0 3 3:3:3:0 yes 4787.0820 403.7300 2015.8680
8 0 0 4 4:4:4:0 yes 4787.0820 403.7300 4541.2949
9 0 0 4 4:4:4:0 yes 4787.0820 403.7300 1095.8380
10 0 0 5 5:5:5:0 yes 4787.0820 403.7300 4367.2192
11 0 0 5 5:5:5:0 yes 4787.0820 403.7300 1095.8380
12 0 0 6 6:6:6:0 yes 4787.0820 403.7300 2929.2319
13 0 0 6 6:6:6:0 yes 4787.0820 403.7300 2764.2480
14 0 0 7 7:7:7:0 yes 4787.0820 403.7300 2325.4541
15 0 0 7 7:7:7:0 yes 4787.0820 403.7300 1095.8380

*edit for missing entries requested*

1 Like

Please show the full output of command

nvidia-inst --test

Install nvidia-inst if it is missing.

1 Like

pcilib: Error reading /sys/bus/pci/devices/0000:00:08.3/label: Operation not permitted
2025-08-30 23:19:24: Note: 01:00.0 VGA compatible controller [0300]: NVIDIA Corporation GA106M [GeForce RTX 3060 Mobile / Max-Q] [10de:2520] (rev a1) (prog-if 00 [VGA controller])
2025-08-30 23:19:24: Note: Currently installed packages related to Nvidia:
2025-08-30 23:19:24: egl-gbm 1.1.2.1-1
2025-08-30 23:19:24: egl-wayland 4:1.1.20-1
2025-08-30 23:19:24: egl-x11 1.0.3-1
2025-08-30 23:19:24: lib32-nvidia-utils 580.76.05-1
2025-08-30 23:19:24: libvdpau 1.5-3
2025-08-30 23:19:24: linux-firmware-nvidia 20250808-1
2025-08-30 23:19:24: nvidia-hook 1.5.2-1
2025-08-30 23:19:24: nvidia-inst 25.7.2-1
2025-08-30 23:19:24: nvidia-open-dkms 580.76.05-4
2025-08-30 23:19:24: nvidia-prime 1.0-5
2025-08-30 23:19:24: nvidia-utils 580.76.05-4
2025-08-30 23:19:24: nvtop 3.2.0-1
2025-08-30 23:19:24: supergfxctl 5.2.7-2
2025-08-30 23:19:25: Info: nvidia-inst version 25.7.2-1
2025-08-30 23:19:25: Info: Command line: nvidia-inst --test
2025-08-30 23:19:25: Info: Selected mode: nvidia (Nvidia’s open source)
2025-08-30 23:19:25: Info: Installing packages: nvidia-settings
2025-08-30 23:19:25: Info: Removing packages: nvidia-prime

COMMANDS TO RUN:
    pacman -Rs --noconfirm --noprogressbar --nodeps nvidia-prime
    pacman -Syuq --noconfirm --noprogressbar --needed nvidia-settings
1 Like

Thanks for the info.

This line indicates there’s something strange happening.

  • Is your system fully updated?
  • Are there any packages added to the IgnorePkg list in /etc/pacman.conf?
  • Have you installed AUR packages? If so, are they replacing any native packages?

Are there any motherboard firmware/BIOS updates available?

2 Likes

Everything is updated to the latest via yay, latest nvidia-open-dkms, latest mesa and latest linux kernels. I have linux-linux, linux-zen and linux-lts. The only kernel that gives me any remotely stable use is the lts kernel which occasionally will force reboot at GDM login and the run fine, the other kernels reboot at random or panic at random (random gpu/cpu loads both heavy and light). As far as the Bios go for this model laptop the latest is 327 which i have installed. The restarts/crash/panics began before I had any aur packages installed. No packages set to be ignored.

You have 22 flatpaks, are any of them causing problems? And how about AUR packages?

If not, the problem seems hard to figure out. Some potential reasons could be also hardware related.

Maybe eliminating potential reasons one by one might reveal the culprit. For example, blacklisting one GPU driver and checking if system then works any better, etc.

I’ve since done a complete system wipe. Fresh install. Install method for latest nvidia graphics. Still getting forced restarts and panics. The only thing that has changed is I may have got my first helpful journal entry. It seems my problems might be related to amdgpu

Sep 01 14:17:52 ROGStrix-EOS kernel: Oops: general protection fault, probably for non-canonical address 0x1000000000038: 0000 [#1] SMP NOPTI
Sep 01 14:17:52 ROGStrix-EOS kernel: CPU: 10 UID: 0 PID: 121 Comm: kworker/u64:2 Tainted: P           OE       6.16.4-arch1-1 #1 PREEMPT(full)  b08114929c1df8b1436365416f9c912a9cf0a0>
Sep 01 14:17:52 ROGStrix-EOS kernel: Tainted: [P]=PROPRIETARY_MODULE, [O]=OOT_MODULE, [E]=UNSIGNED_MODULE
Sep 01 14:17:52 ROGStrix-EOS kernel: Hardware name: ASUSTeK COMPUTER INC. ROG Strix G513RM_G513RM/G513RM, BIOS G513RM.327 02/16/2023
Sep 01 14:17:52 ROGStrix-EOS kernel: Workqueue: gfx_0.1.0 drm_sched_run_job_work [gpu_sched]
Sep 01 14:17:52 ROGStrix-EOS kernel: RIP: 0010:amdgpu_fence_emit+0x239/0x3a0 [amdgpu]
Sep 01 14:17:52 ROGStrix-EOS kernel: Code: c5 4c 89 04 24 e8 77 09 6c d2 4c 8b 04 24 44 8b 4c 24 08 e9 a7 fe ff ff e8 74 32 c0 d1 48 8b 5d 00 48 85 db 0f 84 04 01 00 00 <8b> 53 38 4c>
Sep 01 14:17:52 ROGStrix-EOS kernel: RSP: 0018:ffffd34f4059bd10 EFLAGS: 00010206
Sep 01 14:17:52 ROGStrix-EOS kernel: RAX: 0000000000000001 RBX: 0001000000000000 RCX: 000000000022f120
Sep 01 14:17:52 ROGStrix-EOS kernel: RDX: ffff8f0d012d1b40 RSI: 000000000022f11f RDI: ffff8f0d34898e10
Sep 01 14:17:52 ROGStrix-EOS kernel: RBP: ffff8f0d05e7cba8 R08: 000000000022f11d R09: 0000000000000000
Sep 01 14:17:52 ROGStrix-EOS kernel: R10: ffffd34f41259000 R11: 000000000022f11b R12: ffff8f0d24fbfd20
Sep 01 14:17:52 ROGStrix-EOS kernel: R13: ffff8f0d34880000 R14: ffffd34f4059bdd8 R15: ffff8f0d24fbfd20
Sep 01 14:17:52 ROGStrix-EOS kernel: FS:  0000000000000000(0000) GS:ffff8f1481b9b000(0000) knlGS:0000000000000000
Sep 01 14:17:52 ROGStrix-EOS kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Sep 01 14:17:52 ROGStrix-EOS kernel: CR2: 00007f6cc2fcd228 CR3: 00000001a3424000 CR4: 0000000000f50ef0
Sep 01 14:17:52 ROGStrix-EOS kernel: PKRU: 55555554
Sep 01 14:17:52 ROGStrix-EOS kernel: Call Trace:
Sep 01 14:17:52 ROGStrix-EOS kernel:  <TASK>
Sep 01 14:17:52 ROGStrix-EOS kernel:  amdgpu_ib_schedule+0x3e2/0x690 [amdgpu 83df8000fe08f433647fc1b74500b9ae7a233e63]
Sep 01 14:17:52 ROGStrix-EOS kernel:  amdgpu_job_run+0x8d/0x1f0 [amdgpu 83df8000fe08f433647fc1b74500b9ae7a233e63]
Sep 01 14:17:52 ROGStrix-EOS kernel:  drm_sched_run_job_work+0x1cb/0x3e0 [gpu_sched e08081b5f7c336639423bc710429a32cbeaf547e]
Sep 01 14:17:52 ROGStrix-EOS kernel:  process_one_work+0x193/0x350
Sep 01 14:17:52 ROGStrix-EOS kernel:  worker_thread+0x2d7/0x410
Sep 01 14:17:52 ROGStrix-EOS kernel:  ? __pfx_worker_thread+0x10/0x10
Sep 01 14:17:52 ROGStrix-EOS kernel:  kthread+0xfc/0x240
Sep 01 14:17:52 ROGStrix-EOS kernel:  ? __pfx_kthread+0x10/0x10
Sep 01 14:17:52 ROGStrix-EOS kernel:  ? __pfx_kthread+0x10/0x10
Sep 01 14:17:52 ROGStrix-EOS kernel:  ret_from_fork+0x19a/0x1d0
Sep 01 14:17:52 ROGStrix-EOS kernel:  ? __pfx_kthread+0x10/0x10
Sep 01 14:17:52 ROGStrix-EOS kernel:  ret_from_fork_asm+0x1a/0x30
Sep 01 14:17:52 ROGStrix-EOS kernel:  </TASK>
Sep 01 14:17:52 ROGStrix-EOS kernel: Modules linked in: ccm snd_seq_dummy snd_hrtimer rfcomm snd_seq snd_seq_device nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_>
Sep 01 14:17:52 ROGStrix-EOS kernel:  snd_pci_acp6x btrtl drm_panel_backlight_quirks snd_hwdep intel_rapl_common pps_core drm_buddy snd_pci_acp5x btintel r8169 spd5118 drm_exec snd_r>
Sep 01 14:17:52 ROGStrix-EOS kernel: ---[ end trace 0000000000000000 ]---
Sep 01 14:17:52 ROGStrix-EOS kernel: RIP: 0010:amdgpu_fence_emit+0x239/0x3a0 [amdgpu]
Sep 01 14:17:52 ROGStrix-EOS kernel: Code: c5 4c 89 04 24 e8 77 09 6c d2 4c 8b 04 24 44 8b 4c 24 08 e9 a7 fe ff ff e8 74 32 c0 d1 48 8b 5d 00 48 85 db 0f 84 04 01 00 00 <8b> 53 38 4c>
Sep 01 14:17:52 ROGStrix-EOS kernel: RSP: 0018:ffffd34f4059bd10 EFLAGS: 00010206
Sep 01 14:17:52 ROGStrix-EOS kernel: RAX: 0000000000000001 RBX: 0001000000000000 RCX: 000000000022f120
Sep 01 14:17:52 ROGStrix-EOS kernel: RDX: ffff8f0d012d1b40 RSI: 000000000022f11f RDI: ffff8f0d34898e10
Sep 01 14:17:52 ROGStrix-EOS kernel: RBP: ffff8f0d05e7cba8 R08: 000000000022f11d R09: 0000000000000000
Sep 01 14:17:52 ROGStrix-EOS kernel: R10: ffffd34f41259000 R11: 000000000022f11b R12: ffff8f0d24fbfd20
Sep 01 14:17:52 ROGStrix-EOS kernel: R13: ffff8f0d34880000 R14: ffffd34f4059bdd8 R15: ffff8f0d24fbfd20
Sep 01 14:17:52 ROGStrix-EOS kernel: FS:  0000000000000000(0000) GS:ffff8f1481b9b000(0000) knlGS:0000000000000000
Sep 01 14:17:52 ROGStrix-EOS kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Sep 01 14:17:52 ROGStrix-EOS kernel: CR2: 00007f6cc2fcd228 CR3: 00000001a3424000 CR4: 0000000000f50ef0
Sep 01 14:17:52 ROGStrix-EOS kernel: PKRU: 55555554
Sep 01 14:18:03 ROGStrix-EOS kernel: amdgpu 0000:06:00.0: amdgpu: Dumping IP State
Sep 01 14:18:03 ROGStrix-EOS kernel: amdgpu 0000:06:00.0: amdgpu: Dumping IP State Completed
Sep 01 14:18:03 ROGStrix-EOS kernel: amdgpu 0000:06:00.0: amdgpu: [drm] AMDGPU device coredump file has been created
Sep 01 14:18:03 ROGStrix-EOS kernel: amdgpu 0000:06:00.0: amdgpu: [drm] Check your /sys/class/drm/card2/device/devcoredump/data
Sep 01 14:18:03 ROGStrix-EOS kernel: amdgpu 0000:06:00.0: amdgpu: ring gfx_0.1.0 timeout, signaled seq=9128, emitted seq=9129
Sep 01 14:18:03 ROGStrix-EOS kernel: amdgpu 0000:06:00.0: amdgpu: Process information: process gnome-shell pid 1707 thread gnome-shel:cs0 pid 1741
Sep 01 14:18:03 ROGStrix-EOS kernel: amdgpu 0000:06:00.0: amdgpu: Starting gfx_0.1.0 ring reset

im going to try a few boot parameters I’ve stumbled across and see if it stops the ring timeout. I’m getting very close to ripping this machine apart…

may be downgrade version firmware amdgpu

Maybe adding a kernel parameter to blacklist amdgpu is a way to check that nvidia works as expected.

Alternatively, try using optimus-manager or optimus-manager-git to switch between the gpus. They might work better than other switching apps.