Been having a problem in the last couple of days, where the system will completely lock up and freeze during gaming every hour or two, requiring a hard reboot. Been using this system for months now without issue, doing (roughly) weekly updates, so this is definitely something new since updating on Thursday.
Mostly wondering what steps I can take to try to diagnose this. Checked the journal and there is nothing related in there. Nothing out of the ordinary before the system freezes.
Swapped to a Win11 for the day just to see if the hardware itself is broken, but it seems to be fine here. Issue is only happening when running EOS.
I’ve been having a similar problem while playing for a few days now. All my peripheral devices (display, keyboard, mouse and headphones) switch off at once, but the computer continues to run. I then have to perform a hard reset.
I have the latest packages and play on an AMD iGPU.
I may have found the problem. The day before this started happening I installed xone from the AUR to connect an XBox controller. After uninstalling that, it seems to be fine… for now.
Also realized that yay installed xone-dkms-git (and xone-dongle-firmware), which seems to be severely outdated compared to xone-dkms. (Two years difference)
@dalto that’s a good idea, thanks! I’ll try that next if it keeps happening.
You can’t really use the date on *-git packages. The git versions usually just pull the latest code from git so the packages themselves don’t need to be updated unless something changes.
To reinforce the idea, the -git repos will pull the latest cutting edge code. So if you have an issue, it’s not because it’s not up-to-date, it’ll be because it’s so new it may have bugs still.
That said, I use the xone-dkms-git package because it provides support for some controllers of mine that are too new for the release build.
An important consideration too, is that -git packages will rarely be updated with system updates. You’ll need to manually update these yourself periodically, by installing them again, which will pull the latest committed code from their repository.
You’re currently running BIOS v2.02. You might want to consider updating that as there’s been five releases since, including multiple AGESA updates (updates from AMD) and security updates. Please confirm the link is for your exact motherboard before pursuing any BIOS update.
Sensors:
System Temperatures: cpu: 64.0 C mobo: N/A
Fan Speeds (rpm): N/A
GPU: device: amdgpu temp: 93.0 C mem: 74.0 C fan: 2269 device: amdgpu
temp: 51.0 C
What’s the cooling like in your system? What case do you have? Your Radeon RX 6700 XT looks to be pretty toasty there.
Oh yea, thanks. That’s a good callout. - I’ve kind of been afraid to touch it, since an updated a while back had some regression and got me stuck in memory training hell. Had to revert to fix it. - It’s probably time to look into that again tho, looks like there have been several updates.
The temps should be OK as well. I’ve spent a lot of time tuning my fans and such to be quiet while not overheating anything. As long as the hotspot is reporting <100C I’m not stressing too much about it.
I can report tho that the LTS kernel appears to have stabilized things. It’s been running for over 24h now without any issue, so I’m optimistic about that.