Hi,
I’m looking for some advice on troubleshooting a GPU that seems to be hard resetting my system at random intervals. When the system resets, the VGA light on my motherboard lights up. Pretty much the only lead I have to work with.
The system may be idle or under load when a reset occurs. The frequency of resets did increase in certain instances, with the common factor being the games in question were made with the Unity engine.
Otherwise it doesn’t really seem to matter how demanding the titles are. Temperatures are within normal operating limits, going no higher than in the 60s.
I’ve been looking over the journal, but I don’t see anything relevant at the time the system resets.
Might be that the GPU is just failing, but I’d like to rule out any other possibilities first.
System:
Host: McTherodin Kernel: 6.12.10-arch1-1 arch: x86_64 bits: 64
Desktop: KDE Plasma v: 6.2.5 Distro: EndeavourOS
Machine:
Type: Desktop Mobo: ASUSTeK model: TUF GAMING X570-PLUS (WI-FI) v: Rev X.0x
serial: <superuser required> UEFI: American Megatrends v: 5013
date: 03/22/2024
CPU:
Info: 6-core model: AMD Ryzen 5 5600X bits: 64 type: MT MCP cache: L2: 3 MiB
Speed (MHz): avg: 1734 min/max: 550/4651 cores: 1: 1734 2: 1734 3: 1734
4: 1734 5: 1734 6: 1734 7: 1734 8: 1734 9: 1734 10: 1734 11: 1734 12: 1734
Graphics:
Device-1: Advanced Micro Devices [AMD/ATI] Navi 21 [Radeon RX 6800/6800 XT
/ 6900 XT] driver: amdgpu v: kernel
Display: wayland server: X.org v: 1.21.1.15 with: Xwayland v: 24.1.4
compositor: kwin_wayland driver: X: loaded: amdgpu
unloaded: modesetting,radeon dri: radeonsi gpu: amdgpu
resolution: 1920x1080~75Hz
API: EGL v: 1.5 drivers: kms_swrast,radeonsi,swrast
platforms: gbm,wayland,x11,surfaceless,device
API: OpenGL v: 4.6 compat-v: 4.5 vendor: amd mesa v: 24.3.3-arch1.2
renderer: AMD Radeon RX 6800 XT (radeonsi navi21 LLVM 19.1.6 DRM 3.59
6.12.10-arch1-1)
API: Vulkan v: 1.4.303 drivers: N/A surfaces: xcb,xlib,wayland
Info: Tools: api: clinfo, eglinfo, glxinfo, vulkaninfo
de: kscreen-console,kscreen-doctor gpu: corectrl wl: wayland-info
x11: xdpyinfo, xprop, xrandr
Audio:
Device-1: Advanced Micro Devices [AMD/ATI] Navi 21/23 HDMI/DP Audio
driver: snd_hda_intel
Device-2: Advanced Micro Devices [AMD] Starship/Matisse HD Audio
driver: snd_hda_intel
API: ALSA v: k6.12.10-arch1-1 status: kernel-api
Server-1: PipeWire v: 1.2.7 status: active
Network:
Device-1: Intel Wi-Fi 5 Wireless-AC 9x6x [Thunder Peak] driver: iwlwifi
IF: wlan0 state: down mac: 02:c1:31:dc:86:9d
Device-2: Realtek RTL8111/8168/8211/8411 PCI Express Gigabit Ethernet
driver: r8169
IF: enp5s0 state: up speed: 100 Mbps duplex: full mac: 24:4b:fe:96:3b:96
Bluetooth:
Device-1: Intel Wireless-AC 9260 Bluetooth Adapter driver: btusb type: USB
Report: btmgmt ID: hci0 rfk-id: 0 state: down bt-service: disabled
rfk-block: hardware: no software: no address: N/A
Drives:
Local Storage: total: 1.82 TiB used: 918.04 GiB (49.3%)
ID-1: /dev/nvme0n1 vendor: Crucial model: CT1000P3SSD8 size: 931.51 GiB
ID-2: /dev/sda vendor: Western Digital model: WD10EZEX-00WN4A0
size: 931.51 GiB
Partition:
ID-1: / size: 914.81 GiB used: 754.81 GiB (82.5%) fs: ext4 dev: /dev/dm-0
Swap:
ID-1: swap-1 type: file size: 512 MiB used: 511.8 MiB (100.0%)
file: /swapfile
Sensors:
System Temperatures: cpu: 41.9 C mobo: N/A gpu: amdgpu temp: 47.0 C
Fan Speeds (rpm): N/A gpu: amdgpu fan: 0
Info:
Memory: total: 32 GiB available: 31.25 GiB used: 7.78 GiB (24.9%)
Processes: 401 Uptime: 15h 10m Shell: Zsh inxi: 3.3.37