Random restarts on my computer

I’ve had an issue with random restarts for a while, but lately they’ve been happening every few days.

inxi -Fxxc0z | eos-sendlog: https://0x0.st/8j7V.txt

journalctl -k -b -2 | eos-sendlog: https://0x0.st/8j74.txt – no logs about the crash here

lsblk -o name,type,size,PTTYPE,FSTYPE:

NAME        TYPE   SIZE PTTYPE FSTYPE
nvme0n1     disk 931.5G gpt    
├─nvme0n1p1 part   300M gpt    vfat
├─nvme0n1p2 part 922.4G gpt    ext4
└─nvme0n1p3 part   8.8G gpt    swap

I did manage to capture a kdump for this crash using kdumpst: https://0x0.st/8j7y.txt

I’ve run memtest86+ several times but have yet to find any errors.

What are your temperatures like?

sudo sensors
1 Like

Right now, it shows the following:

iwlwifi_1-virtual-0
Adapter: Virtual device
temp1:        +35.0°C  

nvme-pci-0400
Adapter: PCI adapter
Composite:    +31.9°C  (low  =  -5.2°C, high = +79.8°C)
                       (crit = +84.8°C)

amdgpu-pci-0500
Adapter: PCI adapter
vddgfx:        1.32 V  
vddnb:       768.00 mV 
edge:         +45.0°C  
PPT:           6.00 W  

k10temp-pci-00c3
Adapter: PCI adapter
Tctl:         +52.6°C  

BAT0-acpi-0
Adapter: ACPI interface
in0:          16.25 V  
curr1:         0.00 A  

This just happened again while the temperatures were hardly higher, so it doesn’t seem to be a temperature issue. Unfortunately, I didn’t get a kdump for this one.

Yeh, temps look fine.

Here are some earlier logs captured by kdumpst from the same system, if that happens to be helpful:

Have you installed/enabled all the System76 services?

I have system76.service enabled, but:

  • system76-firmware-daemon.service is loaded but disabled:
    $ systemctl status system76-firmware-daemon.service
    ○ system76-firmware-daemon.service - System76 Firmware Daemon
       Loaded: loaded (/usr/lib/systemd/system/system76-firmware-daemon.service; disabled; preset: disabled)
       Active: inactive (dead)
    
    So I enabled and started it now.
  • system76-power.service is not found:
    $ systemctl status system76-power.service 
    Unit system76-power.service could not be found.
    
1 aur/system76-power 1.2.3-1 (+13 0.06)
    System76 Power Management

TBH do not know much about System76 :frowning:

Strange; I have the system76-power package from AUR installed, but system76-power.service doesn’t appear on the list when I run systemctl, and systemctl status system76-power.service complains about not finding it as I mentioned.

Output for querying for system76-power
$ pacman -Qi system76-power
Name            : system76-power
Version         : 1.2.3-1
Description     : System76 Power Management
Architecture    : any
URL             : https://github.com/pop-os/system76-power
Licenses        : GPL
Groups          : None
Provides        : power-profiles-daemon
Depends On      : dbus  libusb  polkit
Optional Deps   : system76-acpi-dkms: some systems might need this driver [installed]
                  system76-dkms: some systems might need this driver [installed]
                  system76-io-dkms: some systems might need this driver
Required By     : gnome-shell-extension-system76-power-git
Optional For    : gnome-shell  system76-driver
Conflicts With  : None
Replaces        : None
Installed Size  : 5.55 MiB
Packager        : Unknown Packager
Build Date      : Wed 26 Feb 2025 02:32:19 PM EST
Install Date    : Wed 26 Feb 2025 02:33:57 PM EST
Install Reason  : Explicitly installed
Install Script  : Yes
Validated By    : None

This is still happening to me. Latest successful kdumpst capture

I’ll try running memtest86+ again.

Have you tried the LTS kernel if you’re running the latest or visa versa?

I’m using latest right now, but this problem has persisted for a few years, so I don’t think switching to LTS is likely to help.

While running memtest86+, I did find a memory error at address 0x9d4d48, so I added memmap=4K$0x0000009d4000 to the kernel command line to reserve the memory around that. Now I’ll have to see if that improves the situation.

Update (since I can’t edit my last post anymore): Doesn’t seem that this solved the problem, as I got another random reboot. I tried running memtest86+ again, but after 12 hours of testing, I didn’t find any errors. I’ll try the LTS kernel if this problem occurs again.

I had a similar problem. For me, eventually replacing the motherboard seems to have been the fix. Not saying you should jump to that. But search my post in this subforum in case it gives any insights. A frustrating problem for sure.

Sounds like dying power supply or motherboard. If those are good I would check RAM sticks.

Yes it is pain to debug if you don’t have spare ones to test :frowning: