Bit of a "Interesting" problem...RTX 3060 Ti crashing

Well…I have a bit of a real head-scratching problem. I’ve posted on Gigabyte-Aorus reddit, Gridcoin reddit & Nvidia Linux developers forum… (WARNING!!! LONG POST)

So, here it goes: First Post-- AORUS GeForce RTX 3060 Ti ELITE-------FAILED

OK–a bummer of a post…got this card at the one time recently that NewEgg had them at MSRP (3/5/22). Got the card a couple of days later…worked perfectly for about 1 week & then started blank screening everytime I started anything that would clock-up memory (games except strangely my old copy of UT2004) — anything that took the card off base settings. 3DMark is out of the question–crashes as soon as the test starts. It’s seemingly normal without a load…I’m guessing that it’s a memory problem–started during a game.

I contacted NewEgg & have an RMA going–I’m crossing my fingers that they have a NEW replacement for me (the card I got “looked” like new–all the packaging was intact).

Second Post (after more information): I really think I got a bad card—it looks like memory corruption that survives with reboots & needs a complete power off/on cycle to clear. I’m a member of Nvidia Linux forums, so I have posted there to see if an Nvidia Dev might have an idea. I don’t know why the card won’t clear the memory stack after it crashes—must be left in an undetermined state.

Gridcoin reddit: Problem with GPU work & new LHR 3060ti

OK—maybe I brought it on myself…finally bit the bullet & bought an RTX 3060 Ti (https://www.gigabyte.com/Graphics-Card/GV-N306TAORUS-E-8GD-rev-20#kf). Worked very well for about a week then I came into no screen on wakeup. Using Endeavour OS Linux & the current Nvidia driver…so, no surprises there.

What I’ve found is this: I can start a GPU unit…will complete several (currently numberfields@home) & about 10 ~15 minutes after the screen goes to sleep–it won’t wake up again–requiring a hard reboot. After that, I can’t play any games without a crash to a blank screen. I can shut down/power off the system & then games will work normally until I try to run GPU jobs again—at which time I need to redo the above over again to get back to “normal”.

It is starting to look like a progressive memory corruption? is my problem…grasping at straws here…I’m RMAing this card to see if the problems go away, but I am also looking for ideas/thoughts/suggestions to change settings to cure my problem.

Nvidia driver: 510.54 Checked the card BIOS & it is the newest…no O/C on the card.

JUST A FYI at this point—Gridcoin is “pay” for BOINC workunits–you do the regular science work BOINC has & credit all of your work to Gridcoin & then you are “paid” in Gridcoins for it–you can then exchange Gridcoins for whatever Coin(s) you want through a regular Exchange. So you are not Cryptomining in the usual sense – you are doing science for “pay”.

After that post I got the first glimmer of what is going on—as long as the screen is awake = no crash screen asleep = crash. This all started about a week ago. I’m running Gnome–no “funny” addons – So, I’m thinking that something changed in power control through an update in the recent past. I had prior used a pair of GTX 1070 G1’s (about the exact same power requirements at the 3060 Ti they were replaced with). The rest of the system is unchanged from before the RTX 3060 Ti was installed.

Nvidia forums: Same info as above with this & Log info: I was just using a pair of GTX 1070 G1’s with all of the rest of my system exactly the same. The main reason I upgraded was to increase the number of CUDA cores…no other reason…the system would sleep normally with the 1070’s & for a week the 3060 Ti would also sleep normally…

Nvidia driver: 510.54 Checked the card BIOS & it is the newest…no O/C on the card. Suggestions please on capturing this problem, log shows this at logs end:

1:51:56 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0450, 0x000353d8, 0x00032564)
1:51:56 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0450, 0x000353d8, 0x00032564)
1:51:53 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x044c, 0x000353d8, 0x00032530)
1:51:46 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x044c, 0x000353d8, 0x00032530)
1:51:43 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x044f, 0x000353d8, 0x000324fc)
1:51:36 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x044f, 0x000353d8, 0x000324fc)
1:51:33 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x044b, 0x000353d8, 0x000324c8)
1:51:26 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x044b, 0x000353d8, 0x000324c8)
1:51:23 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x044e, 0x000353d8, 0x00032494)
1:51:16 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x044e, 0x000353d8, 0x00032494)
1:51:13 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x044a, 0x000353d8, 0x00032460)
1:51:06 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x044a, 0x000353d8, 0x00032460)
1:51:03 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x044d, 0x000353d8, 0x0003242c)
1:50:56 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x044d, 0x000353d8, 0x0003242c)
1:50:53 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0449, 0x000353d8, 0x000323f8)
1:50:46 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0449, 0x000353d8, 0x000323f8)
1:50:43 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x044c, 0x000353d8, 0x000323c4)
1:50:36 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x044c, 0x000353d8, 0x000323c4)
1:50:33 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0448, 0x000353d8, 0x00032390)
1:50:26 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0448, 0x000353d8, 0x00032390)
1:50:23 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x044b, 0x000353d8, 0x0003235c)
1:50:16 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x044b, 0x000353d8, 0x0003235c)
1:50:13 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0447, 0x000353d8, 0x00032328)
1:50:06 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0447, 0x000353d8, 0x00032328)
1:50:03 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x044a, 0x000353d8, 0x000322f4)
1:49:56 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x044a, 0x000353d8, 0x000322f4)
1:49:53 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0446, 0x000353d8, 0x000322c0)
1:49:46 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0446, 0x000353d8, 0x000322c0)
1:49:43 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0449, 0x000353d8, 0x0003228c)
1:49:36 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0449, 0x000353d8, 0x0003228c)
1:49:33 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0445, 0x000353d8, 0x00032258)
1:49:26 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0445, 0x000353d8, 0x00032258)
1:49:23 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0448, 0x000353d8, 0x00032224)
1:49:16 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0448, 0x000353d8, 0x00032224)
1:49:13 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0444, 0x000353d8, 0x000321f0)
1:49:06 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0444, 0x000353d8, 0x000321f0)
1:49:03 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0447, 0x000353d8, 0x000321bc)
1:48:56 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0447, 0x000353d8, 0x000321bc)
1:48:53 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0443, 0x000353d8, 0x00032188)
1:48:46 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0443, 0x000353d8, 0x00032188)
1:48:43 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0446, 0x000353d8, 0x00032154)
1:48:36 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0446, 0x000353d8, 0x00032154)
1:48:33 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0442, 0x000353d8, 0x00032120)
1:48:26 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0442, 0x000353d8, 0x00032120)
1:48:23 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0445, 0x000353d8, 0x000320ec)
1:48:16 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0445, 0x000353d8, 0x000320ec)
1:48:13 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0441, 0x000353d8, 0x000320b8)
1:48:06 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0441, 0x000353d8, 0x000320b8)
1:48:03 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0444, 0x000353d8, 0x00032084)
1:47:56 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0444, 0x000353d8, 0x00032084)
1:47:53 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0440, 0x000353d8, 0x00032050)
1:47:46 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0440, 0x000353d8, 0x00032050)
1:47:43 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0443, 0x000353d8, 0x0003201c)
1:47:36 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0443, 0x000353d8, 0x0003201c)
1:47:33 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x043f, 0x000353d8, 0x00031fe8)
1:47:26 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x043f, 0x000353d8, 0x00031fe8)
1:47:23 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0442, 0x000353d8, 0x00031fb4)
1:47:16 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0442, 0x000353d8, 0x00031fb4)
1:47:13 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x043e, 0x000353d8, 0x00031f80)
1:47:06 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x043e, 0x000353d8, 0x00031f80)
1:47:03 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0441, 0x000353d8, 0x00031f4c)
1:46:56 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0441, 0x000353d8, 0x00031f4c)
1:46:53 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x043d, 0x000353d8, 0x00031f18)
1:46:46 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x043d, 0x000353d8, 0x00031f18)
1:46:43 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0440, 0x000353d8, 0x00031ee4)
1:46:36 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0440, 0x000353d8, 0x00031ee4)
1:46:33 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x043c, 0x000353d8, 0x00031eb0)
1:46:26 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x043c, 0x000353d8, 0x00031eb0)
1:46:23 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x043f, 0x000353d8, 0x00031e7c)
1:46:16 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x043f, 0x000353d8, 0x00031e7c)
1:46:13 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x043b, 0x000353d8, 0x00031e48)
1:46:06 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x043b, 0x000353d8, 0x00031e48)
1:46:03 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x043e, 0x000353d8, 0x00031e14)
1:45:56 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x043e, 0x000353d8, 0x00031e14)
1:45:53 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x043a, 0x000353d8, 0x00031de0)
1:45:46 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x043a, 0x000353d8, 0x00031de0)
1:45:43 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x043d, 0x000353d8, 0x00031dac)
1:45:36 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x043d, 0x000353d8, 0x00031dac)
1:45:33 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0439, 0x000353d8, 0x00031d78)
1:45:26 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0439, 0x000353d8, 0x00031d78)
1:45:23 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x043c, 0x000353d8, 0x00031d44)
1:45:16 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x043c, 0x000353d8, 0x00031d44)
1:45:13 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0438, 0x000353d8, 0x00031d10)
1:45:06 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0438, 0x000353d8, 0x00031d10)
1:45:03 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x043b, 0x000353d8, 0x00031cdc)
1:44:56 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x043b, 0x000353d8, 0x00031cdc)
1:44:53 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0437, 0x000353d8, 0x00031ca8)
1:44:46 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0437, 0x000353d8, 0x00031ca8)
1:44:43 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x043a, 0x000353d8, 0x00031c74)
1:44:36 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x043a, 0x000353d8, 0x00031c74)
1:44:33 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0436, 0x000353d8, 0x00031c40)
1:44:26 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0436, 0x000353d8, 0x00031c40)
1:44:23 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0439, 0x000353d8, 0x00031c0c)
1:44:16 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0439, 0x000353d8, 0x00031c0c)
1:44:13 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0435, 0x000353d8, 0x00031bd8)
1:44:06 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0435, 0x000353d8, 0x00031bd8)
1:44:03 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0438, 0x000353d8, 0x00031ba4)
1:43:56 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0438, 0x000353d8, 0x00031ba4)
1:43:53 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0434, 0x000353d8, 0x00031b70)
1:43:46 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0434, 0x000353d8, 0x00031b70)
1:43:43 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0437, 0x000353d8, 0x00031b3c)
1:43:36 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0437, 0x000353d8, 0x00031b3c)
1:43:33 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0433, 0x000353d8, 0x00031b08)
1:43:26 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0433, 0x000353d8, 0x00031b08)
1:43:23 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0436, 0x000353d8, 0x00031ad4)
1:43:16 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0436, 0x000353d8, 0x00031ad4)
1:43:13 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0432, 0x000353d8, 0x00031aa0)
1:43:06 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0432, 0x000353d8, 0x00031aa0)
1:43:03 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0435, 0x000353d8, 0x00031a6c)
1:42:56 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0435, 0x000353d8, 0x00031a6c)
1:42:53 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0431, 0x000353d8, 0x00031a38)
1:42:46 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0431, 0x000353d8, 0x00031a38)
1:42:43 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0434, 0x000353d8, 0x00031a04)
1:42:36 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0434, 0x000353d8, 0x00031a04)
1:42:33 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0430, 0x000353d8, 0x000319d0)
1:42:26 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0430, 0x000353d8, 0x000319d0)
1:42:23 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0433, 0x000353d8, 0x0003199c)
1:42:16 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0433, 0x000353d8, 0x0003199c)
1:42:13 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x042f, 0x000353d8, 0x00031968)
1:42:06 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x042f, 0x000353d8, 0x00031968)
1:42:03 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0432, 0x000353d8, 0x00031934)
1:41:56 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0432, 0x000353d8, 0x00031934)
1:41:53 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x042e, 0x000353d8, 0x00031900)
1:41:46 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x042e, 0x000353d8, 0x00031900)
1:41:43 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0431, 0x000353d8, 0x000318cc)
1:41:36 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0431, 0x000353d8, 0x000318cc)
1:41:33 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x042d, 0x000353d8, 0x00031898)
1:41:26 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x042d, 0x000353d8, 0x00031898)
1:41:23 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x0430, 0x000353d8, 0x00031864)
1:41:16 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x0430, 0x000353d8, 0x00031864)
1:41:13 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x042c, 0x000353d8, 0x00031830)
1:41:06 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x042c, 0x000353d8, 0x00031830)
1:41:03 PM Xorg: (WW) NVIDIA(0): WAIT (1-S, 17, 0x042f, 0x000353d8, 0x000317fc)
1:40:56 PM Xorg: (WW) NVIDIA(0): WAIT (2-S, 17, 0x042f, 0x000353d8, 0x000317fc)
1:40:53 PM conky: conky: get_nvidia_value: Something went wrong running nvidia query (arg: memfreq, tid: 1, aid: 4)
1:38:09 PM boinc: 20-Mar-2022 13:38:09 [NumberFields@home] Not requesting tasks: don’t need (CPU: not highest priority project; NVIDIA GPU: not highest priority project)
12:05:37 PM Xorg: (–) NVIDIA(GPU-0):
12:05:25 PM boinc: 20-Mar-2022 12:05:25 [Universe@Home] Not requesting tasks: don’t need (CPU: not highest priority project; NVIDIA GPU: )
12:05:22 PM Xorg: (–) NVIDIA(GPU-0):
12:05:16 PM boinc: 20-Mar-2022 12:05:16 [GPUGRID] Not requesting tasks: don’t need (CPU: not highest priority project; NVIDIA GPU: not highest priority project)
11:23:54 AM Xorg: (II) NVIDIA(GPU-0): Deleting GPU-0
11:23:51 AM boinc: 20-Mar-2022 11:23:51 [World Community Grid] Requesting new tasks for CPU and NVIDIA GPU
11:23:51 AM Xorg: (–) NVIDIA(GPU-0):
11:23:51 AM boinc: 20-Mar-2022 11:23:51 [—] CUDA: NVIDIA GPU 0: NVIDIA GeForce RTX 3060 Ti (driver version 510.54, CUDA version 11.6, compute capability 8.6, 4096MB, 3962MB available, 8682 GFLOPS peak)

So, there is all the info I have at this time–I can reboot into any OS on my system & create a crash after the initial one just by starting a game—unless the screen was not asleep & no crash before reboot…

Screen shot of system running normally under load.
Screenshot-20220321213520-262x642

It looks like the clue to the problem is in the log just before the memory corruption–Conky complains that memfreq has a problem…

can you return option boot kernel used ? ( from inxi -Fza )

inxi -Fza 
System:
  Kernel: 5.16.15-zen1-1-zen x86_64 bits: 64 compiler: gcc v: 11.2.0
    parameters: BOOT_IMAGE=/boot/vmlinuz-linux-zen
    root=UUID=defa66bf-7526-45d2-8331-4ab969be8b5b rw loglevel=3
    intel_pstate=active pcie_aspm=off random.trust_cpu=on
  Desktop: GNOME 41.5 tk: GTK 3.24.33 wm: gnome-shell dm: GDM 41.3
    Distro: EndeavourOS base: Arch Linux
Machine:
  Type: Desktop System: Gigabyte product: X299 DESIGNARE EX v: N/A
    serial: <superuser required>
  Mobo: Gigabyte model: X299 DESIGNARE EX-CF v: x.x
    serial: <superuser required> UEFI: American Megatrends v: F7g
    date: 11/11/2021
CPU:
  Info: model: Intel Core i7-9800X bits: 64 type: MT MCP arch: Skylake
    family: 6 model-id: 0x55 (85) stepping: 4 microcode: 0x2006C0A
  Topology: cpus: 1x cores: 8 tpc: 2 threads: 16 smt: enabled cache:
    L1: 512 KiB desc: d-8x32 KiB; i-8x32 KiB L2: 8 MiB desc: 8x1024 KiB
    L3: 16.5 MiB desc: 1x16.5 MiB
  Speed (MHz): avg: 3898 high: 4158 min/max: 1200/4400:4500 scaling:
    driver: intel_pstate governor: powersave cores: 1: 3518 2: 3836 3: 3957
    4: 3919 5: 4158 6: 3809 7: 3753 8: 3875 9: 3783 10: 4037 11: 3847
    12: 3841 13: 4093 14: 4042 15: 4040 16: 3872 bogomips: 121596
  Flags: avx avx2 ht lm nx pae sse sse2 sse3 sse4_1 sse4_2 ssse3 vmx
  Vulnerabilities:
  Type: itlb_multihit status: KVM: VMX disabled
  Type: l1tf
    mitigation: PTE Inversion; VMX: conditional cache flushes, SMT vulnerable
  Type: mds mitigation: Clear CPU buffers; SMT vulnerable
  Type: meltdown mitigation: PTI
  Type: spec_store_bypass
    mitigation: Speculative Store Bypass disabled via prctl
  Type: spectre_v1
    mitigation: usercopy/swapgs barriers and __user pointer sanitization
  Type: spectre_v2 mitigation: Retpolines, IBPB: conditional, IBRS_FW,
    STIBP: conditional, RSB filling
  Type: srbds status: Not affected
  Type: tsx_async_abort mitigation: Clear CPU buffers; SMT vulnerable
Graphics:
  Device-1: NVIDIA GA104 [GeForce RTX 3060 Ti Lite Hash Rate]
    vendor: Gigabyte driver: nvidia v: 510.54 alternate: nouveau,nvidia_drm
    pcie: gen: 3 speed: 8 GT/s lanes: 8 link-max: gen: 4 speed: 16 GT/s
    lanes: 16 bus-ID: c2:00.0 chip-ID: 10de:2489 class-ID: 0300
  Device-2: Microdia Dual Mode Camera (8006 VGA) type: USB
    driver: hid-generic,usbhid bus-ID: 1-7:3 chip-ID: 0c45:8006 class-ID: 0301
  Display: x11 server: X.org v: 1.21.1.3 compositor: gnome-shell driver: X:
    loaded: nvidia unloaded: fbdev,modesetting,vesa alternate: nouveau,nv
    gpu: nvidia display-ID: :1 screens: 1
  Screen-1: 0 s-res: 3440x1440 s-size: <missing: xdpyinfo>
  Monitor-1: DP-0 res: 3440x1440 dpi: 109 size: 800x330mm (31.5x13.0")
    diag: 865mm (34.1")
  Message: Unable to show GL data. Required tool glxinfo missing.
Audio:
  Device-1: Intel 200 Series PCH HD Audio vendor: Gigabyte
    driver: snd_hda_intel v: kernel bus-ID: 00:1f.3 chip-ID: 8086:a2f0
    class-ID: 0403
  Device-2: NVIDIA GA104 High Definition Audio vendor: Gigabyte
    driver: snd_hda_intel v: kernel pcie: gen: 3 speed: 8 GT/s lanes: 8
    link-max: gen: 4 speed: 16 GT/s lanes: 16 bus-ID: c2:00.1
    chip-ID: 10de:228b class-ID: 0403
  Sound Server-1: ALSA v: k5.16.15-zen1-1-zen running: yes
  Sound Server-2: JACK v: 1.9.20 running: no
  Sound Server-3: PulseAudio v: 15.0 running: yes
  Sound Server-4: PipeWire v: 0.3.48 running: no
Network:
  Device-1: Intel Ethernet I219-V vendor: Gigabyte driver: e1000e v: kernel
    port: N/A bus-ID: 00:1f.6 chip-ID: 8086:15b8 class-ID: 0200
  IF: enp0s31f6 state: up speed: 1000 Mbps duplex: full mac: <filter>
  Device-2: Intel Wireless 8265 / 8275 driver: iwlwifi v: kernel pcie:
    gen: 1 speed: 2.5 GT/s lanes: 1 bus-ID: 02:00.0 chip-ID: 8086:24fd
    class-ID: 0280
  IF: wlan0 state: down mac: <filter>
  Device-3: Intel I211 Gigabit Network vendor: Gigabyte driver: igb
    v: kernel pcie: gen: 1 speed: 2.5 GT/s lanes: 1 port: 2000 bus-ID: 04:00.0
    chip-ID: 8086:1539 class-ID: 0200
  IF: enp4s0 state: up speed: 1000 Mbps duplex: full mac: <filter>
  IF-ID-1: bond0 state: up speed: 2000 Mbps duplex: full mac: <filter>
  IF-ID-2: bonding_masters state: N/A speed: N/A duplex: N/A mac: N/A
Bluetooth:
  Device-1: Intel Bluetooth wireless interface type: USB driver: btusb v: 0.8
    bus-ID: 1-13:7 chip-ID: 8087:0a2b class-ID: e001
  Report: rfkill ID: hci0 rfk-id: 0 state: up address: see --recommends
Drives:
  Local Storage: total: 4.32 TiB used: 1.19 TiB (27.5%)
  SMART Message: Unable to run smartctl. Root privileges required.
  ID-1: /dev/nvme0n1 maj-min: 259:0 vendor: Western Digital
    model: WD BLACK SN850 1TB size: 931.51 GiB block-size: physical: 512 B
    logical: 512 B speed: 63.2 Gb/s lanes: 4 type: SSD serial: <filter>
    rev: 613200WD temp: 44.9 C scheme: GPT
  ID-2: /dev/nvme1n1 maj-min: 259:7 vendor: Samsung
    model: SSD 970 EVO 500GB size: 465.76 GiB block-size: physical: 512 B
    logical: 512 B speed: 31.6 Gb/s lanes: 4 type: SSD serial: <filter>
    rev: 2B2QEXE7 temp: 37.9 C scheme: MBR
  ID-3: /dev/sda maj-min: 8:0 vendor: Samsung model: SSD 840 EVO 250GB
    size: 232.89 GiB block-size: physical: 512 B logical: 512 B speed: 6.0 Gb/s
    type: SSD serial: <filter> rev: DB6Q scheme: GPT
  ID-4: /dev/sdb maj-min: 8:16 vendor: Seagate model: ST1000LX015-1U7172
    size: 931.51 GiB block-size: physical: 4096 B logical: 512 B
    speed: 6.0 Gb/s type: HDD rpm: 5400 serial: <filter> rev: SDM1
    scheme: MBR
  ID-5: /dev/sdc maj-min: 8:32 vendor: Seagate model: ST2000DM008-2FR102
    size: 1.82 TiB block-size: physical: 4096 B logical: 512 B speed: 6.0 Gb/s
    type: HDD rpm: 7200 serial: <filter> rev: 0001 scheme: GPT
Partition:
  ID-1: / raw-size: 76.3 GiB size: 74.1 GiB (97.12%) used: 42.08 GiB (56.8%)
    fs: ext4 dev: /dev/nvme0n1p2 maj-min: 259:2
  ID-2: /boot/efi raw-size: 512 MiB size: 511 MiB (99.80%)
    used: 16 MiB (3.1%) fs: vfat dev: /dev/nvme0n1p1 maj-min: 259:1
  ID-3: /home raw-size: 152.47 GiB size: 149.08 GiB (97.77%)
    used: 78.2 GiB (52.5%) fs: ext4 dev: /dev/nvme0n1p3 maj-min: 259:3
Swap:
  Alert: No swap data was found.
Sensors:
  System Temperatures: cpu: 37.0 C mobo: N/A gpu: nvidia temp: 44 C
  Fan Speeds (RPM): cpu: 865 fan-2: 1182 fan-3: 1231 gpu: nvidia fan: 80%
  Power: 12v: N/A 5v: N/A 3.3v: 1.68 vbat: 1.64
Info:
  Processes: 431 Uptime: 10h 7m wakeups: 0 Memory: 31.06 GiB
  used: 12.26 GiB (39.5%) Init: systemd v: 250 tool: systemctl Compilers:
  gcc: 11.2.0 clang: 13.0.1 Packages: pacman: 1217 lib: 342 Shell: Bash
  v: 5.1.16 running-in: gnome-terminal inxi: 3.3.13

The system is working very well as long as the screen is not asleep… :slight_smile:

then try by adding this
“systemd.unified_cgroup_hierarchy=true scsi_mod.use_blk_mq=1”

in /etc/default/grub
line GRUB_CMDLINE_LINUX_DEFAULT=
or line GRUB_CMDLINE_LINUX=

OK–I’ve set it up & will give it a try this evening…

So far it looks like this “fixed” it…will monitor it for a few days to make sure. Thank You.