Continuous system freezes

inxi -Fxxxza --no-host
System:    Kernel: 5.15.5-zen1-1-zen x86_64 bits: 64 compiler: gcc v: 11.1.0
           parameters: BOOT_IMAGE=/boot/vmlinuz-linux-zen root=UUID=d7eeafb0-0792-490d-99ea-ba2964d99527 rw quiet loglevel=3
           nowatchdog
           Desktop: KDE Plasma 5.23.4 tk: Qt 5.15.2 wm: kwin_x11 vt: 7 dm: LightDM 1.30.0 Distro: EndeavourOS base: Arch Linux
Machine:   Type: Desktop System: Gigabyte product: A320M-S2H v: N/A serial: <superuser required>
           Mobo: Gigabyte model: A320M-S2H-CF serial: <superuser required> UEFI: American Megatrends LLC. v: T54d
           date: 11/24/2021
Battery:   Device-1: hidpp_battery_0 model: Logitech G305 Lightspeed Wireless Gaming Mouse serial: <filter>
           charge: 100% (should be ignored) rechargeable: yes status: Discharging
CPU:       Info: 6-Core model: AMD Ryzen 5 3600 bits: 64 type: MT MCP arch: Zen 2 family: 17 (23) model-id: 71 (113)
           stepping: 0 microcode: 8701021 cache: L1: 384 KiB L2: 3 MiB L3: 32 MiB
           flags: avx avx2 ht lm nx pae sse sse2 sse3 sse4_1 sse4_2 sse4a ssse3 svm bogomips: 86242
           Speed: 2803 MHz min/max: 2200/3600 MHz boost: enabled Core speeds (MHz): 1: 2803 2: 2877 3: 2579 4: 2339 5: 3939
           6: 2476 7: 2938 8: 3062 9: 2370 10: 2071 11: 4141 12: 2253
           Vulnerabilities: Type: itlb_multihit status: Not affected
           Type: l1tf status: Not affected
           Type: mds status: Not affected
           Type: meltdown status: Not affected
           Type: spec_store_bypass mitigation: Speculative Store Bypass disabled via prctl
           Type: spectre_v1 mitigation: usercopy/swapgs barriers and __user pointer sanitization
           Type: spectre_v2 mitigation: Full AMD retpoline, IBPB: conditional, STIBP: conditional, RSB filling
           Type: srbds status: Not affected
           Type: tsx_async_abort status: Not affected
Graphics:  Device-1: Advanced Micro Devices [AMD/ATI] Navi 10 [Radeon RX 5600 OEM/5600 XT / 5700/5700 XT] vendor: XFX Pine
           driver: amdgpu v: kernel bus-ID: 0a:00.0 chip-ID: 1002:731f class-ID: 0300
           Display: x11 server: X.Org 1.21.1.1 compositor: kwin_x11 driver: loaded: amdgpu display-ID: :0 screens: 1
           Screen-1: 0 s-res: 1920x1080 s-dpi: 96 s-size: 507x285mm (20.0x11.2") s-diag: 582mm (22.9")
           Monitor-1: DisplayPort-1 res: 1920x1080 dpi: 92 size: 532x304mm (20.9x12.0") diag: 613mm (24.1")
           OpenGL: renderer: AMD Radeon RX 5600 XT (NAVI10 DRM 3.42.0 5.15.5-zen1-1-zen LLVM 13.0.0) v: 4.6 Mesa 21.2.5
           direct render: Yes
Audio:     Device-1: Advanced Micro Devices [AMD/ATI] Navi 10 HDMI Audio driver: snd_hda_intel v: kernel bus-ID: 0a:00.1
           chip-ID: 1002:ab38 class-ID: 0403
           Device-2: Advanced Micro Devices [AMD] Starship/Matisse HD Audio vendor: Gigabyte driver: snd_hda_intel v: kernel
           bus-ID: 0c:00.4 chip-ID: 1022:1487 class-ID: 0403
           Device-3: C-Media USB Advanced Audio Device type: USB driver: hid-generic,snd-usb-audio,usbhid bus-ID: 3-3:2
           chip-ID: 0d8c:0024 class-ID: 0300
           Device-4: JMTek LLC. USB PnP Audio Device type: USB driver: hid-generic,snd-usb-audio,usbhid bus-ID: 3-4:3
           chip-ID: 0c76:161e class-ID: 0300
           Sound Server-1: ALSA v: k5.15.5-zen1-1-zen running: yes
           Sound Server-2: JACK v: 1.9.19 running: no
           Sound Server-3: PulseAudio v: 15.0 running: yes
           Sound Server-4: PipeWire v: 0.3.40 running: yes
Network:   Device-1: Realtek RTL8111/8168/8411 PCI Express Gigabit Ethernet vendor: Gigabyte driver: r8169 v: kernel
           port: f000 bus-ID: 07:00.0 chip-ID: 10ec:8168 class-ID: 0200
           IF: enp7s0 state: up speed: 1000 Mbps duplex: full mac: <filter>
Drives:    Local Storage: total: 1.36 TiB used: 212.44 GiB (15.2%)
           SMART Message: Unable to run smartctl. Root privileges required.
           ID-1: /dev/nvme0n1 maj-min: 259:0 vendor: Crucial model: CT500P1SSD8 size: 465.76 GiB block-size: physical: 512 B
           logical: 512 B speed: 31.6 Gb/s lanes: 4 type: SSD serial: <filter> rev: P3CR021 temp: 27.9 C scheme: GPT
           ID-2: /dev/sda maj-min: 8:0 vendor: Toshiba model: HDWD110 size: 931.51 GiB block-size: physical: 4096 B
           logical: 512 B speed: 6.0 Gb/s type: HDD rpm: 7200 serial: <filter> rev: A8R0 scheme: GPT
Partition: ID-1: / raw-size: 465.26 GiB size: 456.89 GiB (98.20%) used: 52.31 GiB (11.4%) fs: ext4 dev: /dev/nvme0n1p2
           maj-min: 259:2
           ID-2: /boot/efi raw-size: 512 MiB size: 511 MiB (99.80%) used: 296 KiB (0.1%) fs: vfat dev: /dev/nvme0n1p1
           maj-min: 259:1
Swap:      Alert: No swap data was found.
Sensors:   System Temperatures: cpu: 16.8 C mobo: 16.8 C gpu: amdgpu temp: 34.0 C mem: 32.0 C
           Fan Speeds (RPM): N/A gpu: amdgpu fan: 0
Info:      Processes: 343 Uptime: 5m wakeups: 3 Memory: 15.61 GiB used: 3.11 GiB (19.9%) Init: systemd v: 249 tool: systemctl
           Compilers: gcc: 11.1.0 clang: 13.0.0 Packages: pacman: 1322 lib: 415 flatpak: 0 Shell: Zsh v: 5.8 running-in: kitty
           inxi: 3.3.09

cant i just reboot after the crash and run the command from the actual os?

Yes. Just show your log from the previous boot.

Show

journalctl -b -1 | eos-sendlog

https://clbin.com/Nh6fr

just crashed. Thats the log.
My pc was idling… it has crashed many times while idling so im suspecting it has something to do with power management. Maybe it shutsdown the nvme?!? Idkk

What is on this computer i see wine-drive and a lot of stuff i don’t recognize.

List of all my applications installed:
https://clbin.com/GhfwY
List of all packages installed:
https://clbin.com/Eof8I

What does this show

sudo dmesg | eos-sendlog

https://clbin.com/HLM9x

Have you checked the power supply?

Edit: Tested it with a proper tester that puts load on it to see what power outputs and draw are?

Like how?
Like this? Corsair Video FAQ: How to test a Corsair power supply - YouTube

hahaha crashed again while writing this… uh yeah nope i havnt tested like stress test. What program could i use to achieve it?

No that’s just a basic test for functionality. I mean a proper load test so when the power supply has demand on it and also when idle.

Have you tried any kernel parameters such as iommu=soft? I know you have the latest Bios. If it’s crashing just idling or even if it was crashing using it is instability. So you have to start from a base line and work you way through elimination of where the issue could stem from. Maybe use the journalctl -f command that traces so you can see exactly where it crashes.

If it’s this bad you may have to start with a fresh install and make sure you wipe the drive clean.Then start trying to eliminate memory by doing long tests, powersupply, load test, etc. Make sure you start with Bios settings are all default and then set obviously secure boot off etc. the normal settings that are needed. There is no easy answer. It could boil down to the motherboard, power supply, memory, Bios, drive, or installation? When i say Bios i mean it may require additional parameters because it’s just not working with the existing Bios. :man_shrugging:

:frowning: sigh fresh install for the 6th time :sob: . How do i specify additional parameters?? And ill do a new long memory test, test psu (any good programs for it) and gpu!! And cpu too!!

You add the kernel parameters to the default grub command line in /etc/default/grub and then update grub with sudo grub-mkconfig -o /boot/grub/grub.cfg.

To test the power supply i have a tester. I guess you either take the power supply to a computer store and have it tested or purchase the proper tester or process of elimination? Put in another power supply that is known good. I know this is a pain but if you’ve installed it this many times and tried other things and it hasn’t worked. You have to start from a base line try to eliminate probable causes and narrow it down.

Another power supply?!?! Yeah i guess i could take it to a computer store. Ill try it with the kernel parameter! But does it seem like its the psu?? Like some logs that a pc component suddenly lost power or something??

It’s not about losing power. It’s about stability. Ram timings, cpu power draw & input are extremely critical to stability so if the power supply is off then you get instability and crashes.

Yeah… So i should start from the psu? They are not too cheap. I mean i could just get a new one. But first ill try the parameter and take it to a pc store to be tested! So the psu seems like the most possible problem? Could it cause blue screens on windows too?

Soo just in /etc/defaut/grub in GRUB_CMDLINE_LINUX_DEFAULT="" add the iommu=soft?

What power supply do you have?

Yes.