System freezing during shutdown

This problem occurs for some times now and it appears randomly, I don’t know how to recreate it. So… sometimes when I shutdown system, the system completely freezes. Usually its empty black screen with a pointer on it, nothing is work anymore, I can’t enter console or whatever, and the light indicator on Caps Lock starting blinking all the time. And that’s basically it, I have to hold power button to turn it off.

But today there was smth else, as usual during shutdown system shows console output, but it ended up freezing with some error output. I not very knowledgeable about this stuff. I don’t think there’s much information out there, but still.

Also it may not be related, but I have some issue with Bluetooth. The daemon is enabled, but Bluetooth is not working. Very rarely it may start working until next reboot.

I’ll provide any necessary info. For now I attach photo of error output during shutdown and hardware info. Also it worth to mention that this laptop is pretty old, but hardware not so bad. A long time ago I have issues with Wi-Fi adapter, two years ago I replace keyboard (shift and alt didn’t work) and several months ago there was power elements issue which was repaired in service. I don’t have any problems during these months besides some minor software issues. The sytstem was fully reinstalled when Plasma 6.0 released. And yes, I am still using Xorg, it difficult to me use Wayland without some basic features like windows remember their placement and sizes, etc.

Well, there is a last resort by reinstalling the system, but I really tired to reinstall every time big issue appears. I hope the problem is not about hardware and it solvable. Thank you.

Photo of error output during shutdown:

System info:

Operating System: EndeavourOS
KDE Plasma Version: 6.1.1
KDE Frameworks Version: 6.3.0
Qt Version: 6.7.2
Kernel Version: 6.9.7-arch1-1 (64-bit)
Graphics Platform: X11
Processors: 8 × Intel® Core™ i7-6700HQ CPU @ 2.60GHz
Memory: 15,5 GiB of RAM
Graphics Processor: Mesa Intel® HD Graphics 530
Manufacturer: LENOVO
Product Name: 80NV
System Version: Lenovo ideapad Y700-15ISK

Hardware info (inxi -Fz):

System:
  Kernel: 6.9.7-arch1-1 arch: x86_64 bits: 64
  Desktop: KDE Plasma v: 6.1.1 Distro: EndeavourOS
Machine:
  Type: Laptop System: LENOVO product: 80NV v: Lenovo ideapad Y700-15ISK
    serial: <superuser required>
  Mobo: LENOVO model: Allsparks 5A v: NO DPK serial: <superuser required>
    UEFI: LENOVO v: CDCN54WW date: 06/13/2017
Battery:
  ID-1: BAT0 charge: 25.5 Wh (59.0%) condition: 43.2/57.0 Wh (75.8%)
CPU:
  Info: quad core model: Intel Core i7-6700HQ bits: 64 type: MT MCP cache:
    L2: 1024 KiB
  Speed (MHz): avg: 800 min/max: 800/3500 cores: 1: 800 2: 800 3: 800 4: 800
    5: 800 6: 800 7: 800 8: 800
Graphics:
  Device-1: Intel HD Graphics 530 driver: i915 v: kernel
  Device-2: NVIDIA GM107M [GeForce GTX 960M] driver: nvidia v: 555.58.02
  Device-3: Intel RealSense 3D Camera (Front F200) driver: uvcvideo
    type: USB
  Display: x11 server: X.Org v: 21.1.13 with: Xwayland v: 24.1.0 driver: X:
    loaded: modesetting,nvidia dri: iris gpu: i915 resolution: 1920x1080~60Hz
  API: EGL v: 1.5 drivers: iris,nvidia,swrast
    platforms: gbm,x11,surfaceless,device
  API: OpenGL v: 4.6.0 compat-v: 4.5 vendor: intel mesa v: 24.1.2-arch1.1
    renderer: Mesa Intel HD Graphics 530 (SKL GT2)
  API: Vulkan v: 1.3.279 drivers: nvidia surfaces: xcb,xlib
Audio:
  Device-1: Intel 100 Series/C230 Series Family HD Audio driver: snd_hda_intel
  API: ALSA v: k6.9.7-arch1-1 status: kernel-api
  Server-1: PipeWire v: 1.2.0 status: active
Network:
  Device-1: Intel Wireless 8260 driver: iwlwifi
  IF: wlan0 state: up mac: <filter>
  Device-2: Realtek RTL8111/8168/8211/8411 PCI Express Gigabit Ethernet
    driver: r8169
  IF: enp9s0 state: down mac: <filter>
  IF-ID-1: docker0 state: down mac: <filter>
Bluetooth:
  Device-1: Intel Bluetooth wireless interface driver: btusb type: USB
  Report: btmgmt ID: hci0 state: up address: N/A
Drives:
  Local Storage: total: 456.46 GiB used: 202.71 GiB (44.4%)
  ID-1: /dev/nvme0n1 vendor: Samsung model: SSD 960 EVO 250GB
    size: 232.89 GiB
  ID-2: /dev/sda vendor: Kingston model: SA400S37240G size: 223.57 GiB
Partition:
  ID-1: / size: 210.47 GiB used: 96.06 GiB (45.6%) fs: ext4
    dev: /dev/nvme0n1p2
  ID-2: /boot/efi size: 998 MiB used: 296 KiB (0.0%) fs: vfat
    dev: /dev/nvme0n1p1
Swap:
  ID-1: swap-1 type: partition size: 17 GiB used: 0 KiB (0.0%)
    dev: /dev/nvme0n1p3
Sensors:
  System Temperatures: cpu: 38.0 C mobo: N/A
  Fan Speeds (rpm): N/A
Info:
  Memory: total: 16 GiB available: 15.45 GiB used: 1.71 GiB (11.1%)
  Processes: 251 Uptime: 0m Shell: Zsh inxi: 3.3.35

1 Like

If you boot from Live USB and shutdown, will it occur? This way you can exclude hardware problem.

Also post journalctl -b -1 | eos-sendlog so we can see full logs when you shutdown.

Ok, I posted logs (but it seems too much info).
I didn’t try Live USB, but I will when I have time. The problem is it happens randomly. Only thing I noticed that this freeze occurs a lot when I’ve been using laptop for a while.

But those logs are not from time, when it crashed, right? Also I do not see shutdown in them, something like

Jul 07 21:09:44  systemd[1]: Stopped Monitoring of LVM2 mirrors, snapshots etc. using dmeventd or progress polling.
Jul 07 21:09:44  systemd[1]: Reached target System Shutdown.
Jul 07 21:09:44  systemd[1]: Reached target Late Shutdown Services.
Jul 07 21:09:44  systemd[1]: systemd-poweroff.service: Deactivated successfully.
Jul 07 21:09:44  systemd[1]: Finished System Power Off.
Jul 07 21:09:44  systemd[1]: Reached target System Power Off.
Jul 07 21:09:44  systemd[1]: Shutting down.
Jul 07 21:09:44  systemd-shutdown[1]: Syncing filesystems and block devices.
Jul 07 21:09:44  systemd-shutdown[1]: Sending SIGTERM to remaining processes...
Jul 07 21:09:44  systemd-journald[296]: Received SIGTERM from PID 1 (systemd-shutdow).
Jul 07 21:09:44  systemd-journald[296]: Journal stopped

I see it is not happening on every shutdown. The blinking CapsLock is a sign of a kernel panic and could be caused by the Nvidia drivers. /I uninstalled them on my laptop and shut the gpu off using supergfxctl as I need my battery more than my 1650​:rofl:/. If you need to use your gpu you have the option to install an earlier version of the nvidia drivers, the nvidia-dkms or try and see if noveau will stop the kernel panicking. https://wiki.archlinux.org/title/NVIDIA/Troubleshooting look at the bottom here for a bit of troubleshooting advice.
You can also read a bit in here https://bbs.archlinux.org/viewtopic.php?id=293692
Edit: spelling

Well, yesterday I got different case. I finished my work and check for updates. There was update for kernel and (I think) nvidia software. The plasma interface froze during post-process operation on nvidia package. I couldn’t run any app, it was very glitchy and slow. I couldn’t even access konsole or system terminal. So I used hard power off again. I don’t know is it related or not…
It seems to me that trying to reinstall system would be more effective than trying to find what causes this issue.

There a lot of logs there, and I made them right after one of such cases (I mean next boot was after such crash). Obviously, I used power off button to shutdown it. So maybe it didn’t log anything because it froze and I turn off it in such way.

Read in the links I provided, kernel panics happen during reboots, shutdowns and upgrades. Not saying this is 100% the case on your systemy but likely.

1 Like

Oh my… I just tried to shutdown it again. Waited for like 15 minutes. It didn’t froze, but a lot of operations stuck during the process. I make some photos of it. And posted logs again.

It started with “A stop job running for User Manager for UID 1000…”


Then some time were several “A stop job is running for /dev/disk/{every single partition}”

And finally some time with systemd-shutdown messages.

Nice, we are moving somewhere. Please post output of:

sudo inxi -bDGjz

1 Like

Just noticed the ram, what’s up with that? NVM, misread that one big time.

System:
  Kernel: 6.9.8-arch1-1 arch: x86_64 bits: 64
  Desktop: KDE Plasma v: 6.1.2 Distro: EndeavourOS
Machine:
  Type: Laptop System: LENOVO product: 80NV v: Lenovo ideapad Y700-15ISK
    serial: <filter>
  Mobo: LENOVO model: Allsparks 5A v: NO DPK serial: <filter> UEFI: LENOVO
    v: CDCN54WW date: 06/13/2017
Battery:
  ID-1: BAT0 charge: 25.1 Wh (58.1%) condition: 43.2/57.0 Wh (75.8%)
    volts: 15.4 min: 15.0
CPU:
  Info: quad core Intel Core i7-6700HQ [MT MCP] speed (MHz): avg: 2600
    min/max: 800/3500
Graphics:
  Device-1: Intel HD Graphics 530 driver: i915 v: kernel
  Device-2: NVIDIA GM107M [GeForce GTX 960M] driver: nvidia v: 555.58.02
  Device-3: Intel RealSense 3D Camera (Front F200) driver: uvcvideo
    type: USB
  Display: unspecified server: X.Org v: 21.1.13 with: Xwayland v: 24.1.0
    driver: X: loaded: modesetting,nvidia dri: iris gpu: i915
    resolution: 1920x1080~60Hz
  API: EGL v: 1.5 drivers: iris,nvidia,swrast
    platforms: gbm,x11,surfaceless,device
  API: OpenGL v: 4.6.0 compat-v: 4.5 vendor: intel mesa v: 24.1.3-arch1.1
    renderer: Mesa Intel HD Graphics 530 (SKL GT2)
  API: Vulkan v: 1.3.279 drivers: nvidia surfaces: xcb,xlib
Network:
  Device-1: Intel Wireless 8260 driver: iwlwifi
  Device-2: Realtek RTL8111/8168/8211/8411 PCI Express Gigabit Ethernet
    driver: r8169
Drives:
  Local Storage: total: 456.46 GiB used: 231.26 GiB (50.7%)
  ID-1: /dev/nvme0n1 vendor: Samsung model: SSD 960 EVO 250GB
    size: 232.89 GiB
  ID-2: /dev/sda vendor: Kingston model: SA400S37240G size: 223.57 GiB
Swap:
  ID-1: swap-1 type: partition size: 17 GiB used: 385.2 MiB (2.2%)
    dev: /dev/nvme0n1p3
Info:
  Memory: total: 16 GiB available: 15.45 GiB used: 12.6 GiB (81.5%)
    igpu: 64 MiB
  Processes: 316 Uptime: 3h 30m Shell: Sudo inxi: 3.3.35

We need to go deeper :confused:

You can post output of those:
cat /etc/fstab
lsblk -f

And also check SMART data for disks:

Maybe disk failling, maybe race condition when deactivating partitions.

cat /etc/fstab
#
# Use 'blkid' to print the universally unique identifier for a device; this may
# be used with UUID= as a more robust way to name devices that works even if
# disks are added and removed. See fstab(5).
#
# <file system>             <mount point>  <type>  <options>  <dump>  <pass>
UUID=4B80-E0E6                              /boot/efi                vfat    fmask=0137,dmask=0027   0 2 
UUID=2595fc27-c059-4844-b363-bc8878317fc8   /                        ext4    noatime                 0 1 
UUID=bb3def0c-50c9-4279-8478-fc6d99546603   swap                     swap    defaults                0 0 
tmpfs                                       /tmp                     tmpfs   noatime,mode=1777       0 0 
/dev/sda1                                   /home/makzef/Documents   ext4    nofail                  0 0 

lsblk -f
NAME        FSTYPE FSVER LABEL       UUID                                 FSAVAIL FSUSE% MOUNTPOINTS
sda                                                                                      
└─sda1      ext4   1.0               4d958411-fd20-4f8f-a8b6-99f3bfd7fa20   66,3G    65% /home/makzef/Documents
nvme0n1                                                                                  
├─nvme0n1p1 vfat   FAT32             4B80-E0E6                             997,7M     0% /boot/efi
├─nvme0n1p2 ext4   1.0   endeavouros 2595fc27-c059-4844-b363-bc8878317fc8  109,6G    43% /
└─nvme0n1p3 swap   1     swap        bb3def0c-50c9-4279-8478-fc6d99546603                [SWAP]

SMART

sudo smartctl -a /dev/nvme0
smartctl 7.4 2023-08-01 r5530 [x86_64-linux-6.9.8-arch1-1] (local build)
Copyright (C) 2002-23, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Number:                       Samsung SSD 960 EVO 250GB
Serial Number:                      S3ESNX0JA09839K
Firmware Version:                   2B7QCXE7
PCI Vendor/Subsystem ID:            0x144d
IEEE OUI Identifier:                0x002538
Total NVM Capacity:                 250 059 350 016 [250 GB]
Unallocated NVM Capacity:           0
Controller ID:                      2
NVMe Version:                       1.2
Number of Namespaces:               1
Namespace 1 Size/Capacity:          250 059 350 016 [250 GB]
Namespace 1 Utilization:            122 105 012 224 [122 GB]
Namespace 1 Formatted LBA Size:     512
Namespace 1 IEEE EUI-64:            002538 5a71b04bda
Local Time is:                      Wed Jul 10 10:41:08 2024 EEST
Firmware Updates (0x16):            3 Slots, no Reset required
Optional Admin Commands (0x0007):   Security Format Frmw_DL
Optional NVM Commands (0x001f):     Comp Wr_Unc DS_Mngmt Wr_Zero Sav/Sel_Feat
Log Page Attributes (0x03):         S/H_per_NS Cmd_Eff_Lg
Maximum Data Transfer Size:         512 Pages
Warning  Comp. Temp. Threshold:     77 Celsius
Critical Comp. Temp. Threshold:     79 Celsius

Supported Power States
St Op     Max   Active     Idle   RL RT WL WT  Ent_Lat  Ex_Lat
 0 +     6.04W       -        -    0  0  0  0        0       0
 1 +     5.09W       -        -    1  1  1  1        0       0
 2 +     4.08W       -        -    2  2  2  2        0       0
 3 -   0.0400W       -        -    3  3  3  3      210    1500
 4 -   0.0050W       -        -    4  4  4  4     2200    6000

Supported LBA Sizes (NSID 0x1)
Id Fmt  Data  Metadt  Rel_Perf
 0 +     512       0         0

=== START OF SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

SMART/Health Information (NVMe Log 0x02)
Critical Warning:                   0x00
Temperature:                        31 Celsius
Available Spare:                    100%
Available Spare Threshold:          10%
Percentage Used:                    4%
Data Units Read:                    29 228 001 [14,9 TB]
Data Units Written:                 29 981 311 [15,3 TB]
Host Read Commands:                 444 572 501
Host Write Commands:                381 972 763
Controller Busy Time:               1 536
Power Cycles:                       2 501
Power On Hours:                     4 115
Unsafe Shutdowns:                   215
Media and Data Integrity Errors:    0
Error Information Log Entries:      6 354
Warning  Comp. Temperature Time:    0
Critical Comp. Temperature Time:    0
Temperature Sensor 1:               31 Celsius
Temperature Sensor 2:               39 Celsius

Error Information (NVMe Log 0x01, 16 of 64 entries)
Num   ErrCount  SQId   CmdId  Status  PELoc          LBA  NSID    VS  Message
  0       6354     0  0x0014  0x4004      -            0     0     -  Invalid Field in Command
  1       6353     0  0x0018  0x4004      -            0     0     -  Invalid Field in Command
  2       6352     0  0x0019  0x4004      -            0     0     -  Invalid Field in Command
  3       6351     0  0x0018  0x4004      -            0     0     -  Invalid Field in Command
  4       6350     0  0x000d  0x4004      -            0     0     -  Invalid Field in Command
  5       6349     0  0x4017  0x4004      -            0     0     -  Invalid Field in Command
  6       6348     0  0x400a  0x4004      -            0     0     -  Invalid Field in Command
  7       6347     0  0x1004  0x4004      -            0     0     -  Invalid Field in Command
  8       6346     0  0x000c  0x4004      -            0     0     -  Invalid Field in Command
  9       6345     0  0x000e  0x4004      -            0     0     -  Invalid Field in Command
 10       6344     0  0x0004  0x4004      -            0     0     -  Invalid Field in Command
 11       6343     0  0x0008  0x4004      -            0     0     -  Invalid Field in Command
 12       6342     0  0x001c  0x4004      -            0     0     -  Invalid Field in Command
 13       6341     0  0x0004  0x4004      -            0     0     -  Invalid Field in Command
 14       6340     0  0x0004  0x4004      -            0     0     -  Invalid Field in Command
 15       6339     0  0x0014  0x4004      -            0     0     -  Invalid Field in Command
... (48 entries not read)

Self-tests not supported
sudo smartctl -a /dev/sda  
smartctl 7.4 2023-08-01 r5530 [x86_64-linux-6.9.8-arch1-1] (local build)
Copyright (C) 2002-23, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     Phison Driven SSDs
Device Model:     KINGSTON SA400S37240G
Serial Number:    50026B728256B561
LU WWN Device Id: 5 02b2a2 01d1c1b1a
Add. Product Id:  mavlsata
Firmware Version: 03090004
User Capacity:    240 057 409 536 bytes [240 GB]
Sector Size:      512 bytes logical/physical
Rotation Rate:    Solid State Device
TRIM Command:     Available
Device is:        In smartctl database 7.3/5528
ATA Version is:   ACS-3 T13/2161-D revision 4
SATA Version is:  SATA 3.2, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Wed Jul 10 10:43:17 2024 EEST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x02) Offline data collection activity
                                        was completed without error.
                                        Auto Offline Data Collection: Disabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever 
                                        been run.
Total time to complete Offline 
data collection:                (    0) seconds.
Offline data collection
capabilities:                    (0x35) SMART execute Offline immediate.
                                        No Auto Offline data collection support.
                                        Abort Offline collection upon new
                                        command.
                                        No Offline surface scan supported.
                                        Self-test supported.
                                        Conveyance Self-test supported.
                                        No Selective Self-test supported.
SMART capabilities:            (0x0002) Does not save SMART data before
                                        entering power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x00) Error logging NOT supported.
                                        General Purpose Logging supported.
Short self-test routine 
recommended polling time:        (   1) minutes.
Extended self-test routine
recommended polling time:        (   1) minutes.
Conveyance self-test routine
recommended polling time:        (   1) minutes.

SMART Attributes Data Structure revision number: 5
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x0032   100   100   000    Old_age   Always       -       0
  9 Power_On_Hours          0x0032   100   100   000    Old_age   Always       -       5039
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       770
148 Unknown_Attribute       0x0000   100   100   000    Old_age   Offline      -       0
149 Unknown_Attribute       0x0000   100   100   000    Old_age   Offline      -       0
167 Write_Protect_Mode      0x0000   100   100   000    Old_age   Offline      -       0
168 SATA_Phy_Error_Count    0x0012   100   100   000    Old_age   Always       -       0
169 Bad_Block_Rate          0x0000   100   100   000    Old_age   Offline      -       168
170 Bad_Blk_Ct_Lat/Erl      0x0000   100   100   010    Old_age   Offline      -       0/136
172 Erase_Fail_Count        0x0032   100   100   000    Old_age   Always       -       0
173 MaxAvgErase_Ct          0x0000   100   100   000    Old_age   Offline      -       0
181 Program_Fail_Count      0x0032   100   100   000    Old_age   Always       -       0
182 Erase_Fail_Count        0x0000   100   100   000    Old_age   Offline      -       0
187 Reported_Uncorrect      0x0032   100   100   000    Old_age   Always       -       0
192 Unsafe_Shutdown_Count   0x0012   100   100   000    Old_age   Always       -       100
194 Temperature_Celsius     0x0022   023   025   000    Old_age   Always       -       23 (Min/Max 22/25)
196 Reallocated_Event_Count 0x0032   100   100   000    Old_age   Always       -       0
199 SATA_CRC_Error_Count    0x0032   100   100   000    Old_age   Always       -       0
218 CRC_Error_Count         0x0032   100   100   000    Old_age   Always       -       0
231 SSD_Life_Left           0x0000   097   097   000    Old_age   Offline      -       97
233 Flash_Writes_GiB        0x0032   100   100   000    Old_age   Always       -       9413
241 Lifetime_Writes_GiB     0x0032   100   100   000    Old_age   Always       -       8228
242 Lifetime_Reads_GiB      0x0032   100   100   000    Old_age   Always       -       2182
244 Average_Erase_Count     0x0000   100   100   000    Old_age   Offline      -       38
245 Max_Erase_Count         0x0000   100   100   000    Old_age   Offline      -       46
246 Total_Erase_Count       0x0000   100   100   000    Old_age   Offline      -       16297

SMART Error Log not supported

SMART Self-test Log not supported

Selective Self-tests/Logging not supported

The above only provides legacy SMART information - try 'smartctl -x' for more

Thank you all for responses. So I will described what I try and what did I find out.

As I mentioned, it got worse: updates could cause system freeze or plasma could freeze. So I decided to reinstall the system hoping it fix all my problems. This time I use usual installation (without nvidia dkms) and install the system. Also add LTS kernel and switched to wayland. Installed many apps and finally decided to install nvidia drivers, I need them after all. So I use nvidia-inst. For one day it was fine, there was no issue until… On another day the issues appears again. First plasma glitched and froze again. Then shutdown issues again.

So I decided to look more info about what @Steelmountain wrote and find a thread on nvidia forum which exactly described my issues… And it turned out the problem persist for a long time.
I changed back to nouveau drivers and according to some user posts on nvidia forum installed nvidia-open drivers. As I see, it probably has some other issues, but I need some performance and nouveau not so good at it. For now I will test these nvidia-open drivers and hope it would be better.

Yeah, it did sound like nvidia driver induced kernel panics. I hope you succeed in getting the performance you need by using the nvidia-open drivers. Hopefully the problem gets solved on the nvidia side soon.

I did not succeed. The driver is not loading. Because it seems do not support older graphical cards like mine.

I looked into nvidia site and there is also 550.100 driver (stable branch), but I don’t see how I can installed it on my system. So either I try nouveau or use 555 driver with all its issues(( Well, also there is 535 driver in AUR (that the latest I saw there). Nvidia gets on my nerve(