changed BIOS from “low current idle” to “typical idle”
deactivate C-States in BIOS
Issue:
The System only crashes when in idle or under low load.
When it crashes the gui hangs and sound repeate’s.
System is no more accessible over SSH from another System.
I looked at the logs (like in the wiki), but there are no entries releated to this issue.
Is there any possibility to activate extra logs?
Like crash dumps? Or crash messages?
Try to exclude PSU by using other one temporary borrowed from a friend…
It can be a lot of things, but…
32GB DDR4-3200 HyperX
Because KIngston is absolute garbage - try anything other than kingston in your system, one RAM at a time, to exclude it…But if i were you those would be in my trash-bin regardless
This tells not much unfortunately, even memtest devs saying that it’s not reliable to exclude all RAM problems, especially ones that is coming from idle / low states, so i suggest you try to borrow 1 non-Kingston RAM from a friend to be absolutely sure…
Usually RAM is tested by combination of memtest and prolonged ~24 hours stress-tests, but since you’re certain it’s idle / low - in my experience it’s most likely:
CPU
Motherboard / BIOS etc
PSU
RAM
Oh and how can i forget:
SSD (easy to exclude, just unplug it from board and run only from live system to see if it will crash.)
I had similar issue with two of my ryzen machines.
Switching Power supply idle control in bios to “typical current idle” from “auto” fixed idle crash on one of them.
Another machine’s CPU was using an incorrect voltage at idle. The problem can be alleviated by setting the Vcore offset by 0.05V or higher. That machine is still working good to this day with 0.05v offset.
the strange thing is, that I had this issue never unter Linux Mint 20.3 (and older versions).
When the system is under load, like gaming or compiling it never crashes.
I can run heavy loads all day without any issue.
I have only “low current idle” and “typical current idle” in the BIOS options.
I know there was a bug, a long time ago, that the kernel couldn’t wake up sleeping cores and locked up. That in mind, I deactivated the C States (in BIOS) to prevent the sleep of cpu cores.
But this doesn’t solved the issue also.
Could it be an issue with the current kernels or any other software on the system?
Is there a possibility to get more detailed logs then that described in the wiki?
Absolutely, install LTS and boot from it, to check.
Personally i’d use only LTS kernel, unless i really strongly actually need use latest for compatibility reasons (too new hardware)