RX 6800 XT random resets

Hi,

I’m looking for some advice on troubleshooting a GPU that seems to be hard resetting my system at random intervals. When the system resets, the VGA light on my motherboard lights up. Pretty much the only lead I have to work with.

The system may be idle or under load when a reset occurs. The frequency of resets did increase in certain instances, with the common factor being the games in question were made with the Unity engine.

Otherwise it doesn’t really seem to matter how demanding the titles are. Temperatures are within normal operating limits, going no higher than in the 60s.

I’ve been looking over the journal, but I don’t see anything relevant at the time the system resets.

Might be that the GPU is just failing, but I’d like to rule out any other possibilities first.

System:
  Host: McTherodin Kernel: 6.12.10-arch1-1 arch: x86_64 bits: 64
  Desktop: KDE Plasma v: 6.2.5 Distro: EndeavourOS
Machine:
  Type: Desktop Mobo: ASUSTeK model: TUF GAMING X570-PLUS (WI-FI) v: Rev X.0x
    serial: <superuser required> UEFI: American Megatrends v: 5013
    date: 03/22/2024
CPU:
  Info: 6-core model: AMD Ryzen 5 5600X bits: 64 type: MT MCP cache: L2: 3 MiB
  Speed (MHz): avg: 1734 min/max: 550/4651 cores: 1: 1734 2: 1734 3: 1734
    4: 1734 5: 1734 6: 1734 7: 1734 8: 1734 9: 1734 10: 1734 11: 1734 12: 1734
Graphics:
  Device-1: Advanced Micro Devices [AMD/ATI] Navi 21 [Radeon RX 6800/6800 XT
    / 6900 XT] driver: amdgpu v: kernel
  Display: wayland server: X.org v: 1.21.1.15 with: Xwayland v: 24.1.4
    compositor: kwin_wayland driver: X: loaded: amdgpu
    unloaded: modesetting,radeon dri: radeonsi gpu: amdgpu
    resolution: 1920x1080~75Hz
  API: EGL v: 1.5 drivers: kms_swrast,radeonsi,swrast
    platforms: gbm,wayland,x11,surfaceless,device
  API: OpenGL v: 4.6 compat-v: 4.5 vendor: amd mesa v: 24.3.3-arch1.2
    renderer: AMD Radeon RX 6800 XT (radeonsi navi21 LLVM 19.1.6 DRM 3.59
    6.12.10-arch1-1)
  API: Vulkan v: 1.4.303 drivers: N/A surfaces: xcb,xlib,wayland
  Info: Tools: api: clinfo, eglinfo, glxinfo, vulkaninfo
    de: kscreen-console,kscreen-doctor gpu: corectrl wl: wayland-info
    x11: xdpyinfo, xprop, xrandr
Audio:
  Device-1: Advanced Micro Devices [AMD/ATI] Navi 21/23 HDMI/DP Audio
    driver: snd_hda_intel
  Device-2: Advanced Micro Devices [AMD] Starship/Matisse HD Audio
    driver: snd_hda_intel
  API: ALSA v: k6.12.10-arch1-1 status: kernel-api
  Server-1: PipeWire v: 1.2.7 status: active
Network:
  Device-1: Intel Wi-Fi 5 Wireless-AC 9x6x [Thunder Peak] driver: iwlwifi
  IF: wlan0 state: down mac: 02:c1:31:dc:86:9d
  Device-2: Realtek RTL8111/8168/8211/8411 PCI Express Gigabit Ethernet
    driver: r8169
  IF: enp5s0 state: up speed: 100 Mbps duplex: full mac: 24:4b:fe:96:3b:96
Bluetooth:
  Device-1: Intel Wireless-AC 9260 Bluetooth Adapter driver: btusb type: USB
  Report: btmgmt ID: hci0 rfk-id: 0 state: down bt-service: disabled
    rfk-block: hardware: no software: no address: N/A
Drives:
  Local Storage: total: 1.82 TiB used: 918.04 GiB (49.3%)
  ID-1: /dev/nvme0n1 vendor: Crucial model: CT1000P3SSD8 size: 931.51 GiB
  ID-2: /dev/sda vendor: Western Digital model: WD10EZEX-00WN4A0
    size: 931.51 GiB
Partition:
  ID-1: / size: 914.81 GiB used: 754.81 GiB (82.5%) fs: ext4 dev: /dev/dm-0
Swap:
  ID-1: swap-1 type: file size: 512 MiB used: 511.8 MiB (100.0%)
    file: /swapfile
Sensors:
  System Temperatures: cpu: 41.9 C mobo: N/A gpu: amdgpu temp: 47.0 C
  Fan Speeds (rpm): N/A gpu: amdgpu fan: 0
Info:
  Memory: total: 32 GiB available: 31.25 GiB used: 7.78 GiB (24.9%)
  Processes: 401 Uptime: 15h 10m Shell: Zsh inxi: 3.3.37

Do you have vulkan-radeon isnstalled? Do you have all the 32 bit lib files installed for gaming? Did you set it up as per the arch wiki?

Also what is your power supply?

https://wiki.archlinux.org/title/AMDGPU

Edit: There is also another Bios update for this board.

Yeah power supply too light is coming to my mind too?
I assume you did connect all power lines to the GPU?
And possibly to the motherboard itself (some models have a power socket to power pci express slots)

Thanks for the response.

I have the following drivers and libraries installed:

local/lib32-vulkan-icd-loader 1.4.303-1
local/lib32-vulkan-radeon 1:24.3.3-2
local/spirv-tools 2024.4.rc2-1 (vulkan-devel)
local/vulkan-headers 1:1.4.303-1 (vulkan-devel)
local/vulkan-icd-loader 1.4.303-1
local/vulkan-radeon 1:24.3.3-2
local/vulkan-tools 1.4.303-2 (vulkan-devel)

local/lib32-mesa 1:24.3.3-2
local/mesa 1:24.3.3-2
local/mesa-utils 9.0.0-5

local/amd-ucode 20250109.7673dffd-1
local/opencl-amd 1:6.3.1-1
local/xf86-video-amdgpu 23.0.0-2 (xorg-drivers)

I have a Cooler Master V750 Semi-modular power supply, which does meet the minimum requirement.
The GPU has two 8 pin power cables installed, but the motherboard only has a 4 pin cable installed. That might actually be the issue, since there’s an 8 pin still open.

I have yet to install the most recent Bios update, but it’s been updated within the last three months or so.

Thanks, will report back when I have the other 8 pin cable installed in the motherboard.

Did you set up as per the wiki. Does vulkaninfo give you output?

Edit: 750W pwr supply is possibly okay but myself I would probably have an 850W.

vulkaninfo does provide an output starting with Vulkan Instance Version: 1.4.303.

Yeah, will have to see about getting an 850W if the crashes continue.

This board has one 4 pin 12V connector and one 8 pin 12v connector. Do you have the 8 pin 12v connector filled or just the 4 pin 12v connector? Some power supplies only come with one 4 pin 12V connector.

Edit: You said the Gpu has two pwr connectors connected to it?

The EATX12V_1 8 pin power plug is now connected. The GPU’s two 8 pin plugs have been connected since it was first installed.

Seems like only the EATX12V_2 4 pin plug was connected, which I unplugged in order to connect the 8 pin as mentioned above. Hopefully that helps!

1 Like

So far I’ve had no crashes :tada:
I’ll keep on testing for a day or two before I mark this as solved.

1 Like

This topic was automatically closed 2 days after the last reply. New replies are no longer allowed.