The full text of the error is as follows:
mce: [Hardware Error]: CPU 22: Machine Check: 0 Bank 2: 8c00004040400151
mce: [Hardware Error]: TSC 0 ADDR 99c43d80 MISC 40
mce: [Hardware Error]: PROCESSOR 0:b0671 TIME 1729871239 SOCKET 0 APIC 4c microcode 129
I first saw it during the startup list of flowing checks that you get before you reach the screen to login. However, because it goes by so quick and it is in gray I couldn’t see it properly. I was traveling for work, so I didn’t pay too much attention for the first week or so.
Now that I am back, I did
journalctl --since “1 minutes ago”
and realized that it appears every second!
I did some searching online and I saw a few similar errors, but not with the same code. Usually Bank 2, 5 6, which for some people are “nothing to worry about”, whereas for others maybe led to hardware issues. One very technical discussion I found talks about changing the voltage on the CPU in BIOS and so on, but this is way above my understanding.
Dell told me to update my BIOS to the latest version which I did. Nothing changed. Nothing from them since. I have always used the default settings and have never overclocked my CPU in any way. I dual boot Windows and Endeavouros. I updated BIOS through Windows (which is more or less the only thing I use it for).
I also ran the 20 min BIOS diagnostic and that did not show any errors.
I have been using my current setup for over a year with no problems and have been super happy.
I am using the linux-zen kernels. Tried a session with the linux-lts ones and the same error was constantly appearing in the logs.
Here is the result from
journalctl -k -b -0 | eos-sendlog
and here the result of
inxi -Fxxc0z | eos-sendlog
I am not sure what else to provide as information. Any help will be greatly appreciated, or steps into deciphering the technical language on other similar errors.