Smart block erase count

I ran smartctl --test=long on my ssd drive and then after checked it. The smart erase count seems ridiculously high. This is running on btrfs and it’s a fairly fresh install. I have btrfs-assistant-git with snapper installed.

fstab

[ricklinux@rick-ms7c37 ~]$ cat /etc/fstab
# /etc/fstab: static file system information.
#
# Use 'blkid' to print the universally unique identifier for a device; this may
# be used with UUID= as a more robust way to name devices that works even if
# disks are added and removed. See fstab(5).
#
# <file system>             <mount point>  <type>  <options>  <dump>  <pass>
UUID=160D-667B                            /boot/efi      vfat    defaults,noatime 0 2
UUID=5763454c-2b31-4858-aeaa-b717a836c98d /              btrfs   subvol=/@,defaults,noatime,compress=zstd 0 0
UUID=5763454c-2b31-4858-aeaa-b717a836c98d /home          btrfs   subvol=/@home,defaults,noatime,compress=zstd 0 0
UUID=5763454c-2b31-4858-aeaa-b717a836c98d /var/cache     btrfs   subvol=/@cache,defaults,noatime,compress=zstd 0 0
UUID=5763454c-2b31-4858-aeaa-b717a836c98d /var/log       btrfs   subvol=/@log,defaults,noatime,compress=zstd 0 0
UUID=5763454c-2b31-4858-aeaa-b717a836c98d /swap          btrfs   subvol=/@swap,defaults,noatime 0 0
/swap/swapfile                            swap           swap    defaults   0 0
tmpfs                                     /tmp           tmpfs   defaults,noatime,mode=1777 0 0
[ricklinux@rick-ms7c37 ~]$ 

smart test

[ricklinux@rick-ms7c37 ~]$ sudo smartctl -a /dev/sda
[sudo] password for ricklinux: 
smartctl 7.3 2022-02-28 r5338 [x86_64-linux-5.17.1-arch1-1] (local build)
Copyright (C) 2002-22, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     WD Blue / Red / Green SSDs
Device Model:     WDC  WDS500G2B0A-00SM50
Serial Number:    19432C804100
LU WWN Device Id: 5 001b44 8b19db68c
Firmware Version: 411030WD
User Capacity:    500,107,862,016 bytes [500 GB]
Sector Size:      512 bytes logical/physical
Rotation Rate:    Solid State Device
Form Factor:      2.5 inches
TRIM Command:     Available, deterministic, zeroed
Device is:        In smartctl database 7.3/5319
ATA Version is:   ACS-4 T13/BSR INCITS 529 revision 5
SATA Version is:  SATA 3.3, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Fri Apr  1 21:08:45 2022 EDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00) Offline data collection activity
                                        was never started.
                                        Auto Offline Data Collection: Disabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever 
                                        been run.
Total time to complete Offline 
data collection:                (    0) seconds.
Offline data collection
capabilities:                    (0x11) SMART execute Offline immediate.
                                        No Auto Offline data collection support.
                                        Suspend Offline collection upon new
                                        command.
                                        No Offline surface scan supported.
                                        Self-test supported.
                                        No Conveyance Self-test supported.
                                        No Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine 
recommended polling time:        (   2) minutes.
Extended self-test routine
recommended polling time:        (  10) minutes.

SMART Attributes Data Structure revision number: 4
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  5 Reallocated_Sector_Ct   0x0032   100   100   ---    Old_age   Always       -       0
  9 Power_On_Hours          0x0032   100   100   ---    Old_age   Always       -       6478
 12 Power_Cycle_Count       0x0032   100   100   ---    Old_age   Always       -       2094
165 Block_Erase_Count       0x0032   100   100   ---    Old_age   Always       -       799149199355
166 Minimum_PE_Cycles_TLC   0x0032   100   100   ---    Old_age   Always       -       23
167 Max_Bad_Blocks_per_Die  0x0032   100   100   ---    Old_age   Always       -       38
168 Maximum_PE_Cycles_TLC   0x0032   100   100   ---    Old_age   Always       -       64
169 Total_Bad_Blocks        0x0032   100   100   ---    Old_age   Always       -       154
170 Grown_Bad_Blocks        0x0032   100   100   ---    Old_age   Always       -       0
171 Program_Fail_Count      0x0032   100   100   ---    Old_age   Always       -       0
172 Erase_Fail_Count        0x0032   100   100   ---    Old_age   Always       -       0
173 Average_PE_Cycles_TLC   0x0032   100   100   ---    Old_age   Always       -       42
174 Unexpected_Power_Loss   0x0032   100   100   ---    Old_age   Always       -       88
184 End-to-End_Error        0x0032   100   100   ---    Old_age   Always       -       0
187 Reported_Uncorrect      0x0032   100   100   ---    Old_age   Always       -       0
188 Command_Timeout         0x0032   100   100   ---    Old_age   Always       -       9
194 Temperature_Celsius     0x0022   073   042   ---    Old_age   Always       -       27 (Min/Max 17/42)
199 UDMA_CRC_Error_Count    0x0032   100   100   ---    Old_age   Always       -       2
230 Media_Wearout_Indicator 0x0032   008   008   ---    Old_age   Always       -       0x081204140812
232 Available_Reservd_Space 0x0033   100   100   004    Pre-fail  Always       -       100
233 NAND_GB_Written_TLC     0x0032   100   100   ---    Old_age   Always       -       20513
234 NAND_GB_Written_SLC     0x0032   100   100   ---    Old_age   Always       -       42192
241 Host_Writes_GiB         0x0030   253   253   ---    Old_age   Offline      -       36236
242 Host_Reads_GiB          0x0030   253   253   ---    Old_age   Offline      -       10002
244 Temp_Throttle_Status    0x0032   000   100   ---    Old_age   Always       -       0

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed without error       00%      6470         -
# 2  Short offline       Completed without error       00%      5905         -

Selective Self-tests/Logging not supported

[ricklinux@rick-ms7c37 ~]$ 

Is my ssd getting destroyed? I don’t know how to read this?

If you’re referring to

then keep in mind that the raw values don’t mean much - it’s an internal value that the disk knows about but doesn’t have very much meaning to anyone else looking at it. The VALUE/WORST/THRESH values are generally a more useful guide to disk health (and these all indicate your disk is pretty darned new).

1 Like

Keep in mind this is my testing desktop and I’m putting it through hell. It gets erased and reloaded at the flip of a switch. I use it with vm installed deleting erasing, formatting etc etc. Many times i have had to hard restart it or shut it down. I’m not that concerned if i destroy the disk They aren’t expensive anyway. I just reloaded it this week so it’s fairly fresh. First time i have run a test on it. This drive has been wiped many times including secure erase. I just wondered as I’m not that familiar with the readout.

Edit: I was wondering more if it had something to do with btrfs.

1 Like

Assuming these raw values are actually GB then you have written to disk around 134GB per day, which is 0.27 “drive writes per day”. You’d need to check the drive specifications to find out what its life expectancy is based on those values.

I think the key here though, if you’re using a drive heavily for testing etc., is to consider the drive as a consumable and assume you’ll get maybe five years’ use out of it, and budget to replace it when it fails.

Or, build in tmpfs. :grin:

1 Like

Ya …i’m not concerned about life expectancy. It works fantastic on KDE and i have no issues. I just wondered because i don’t really know that much about the btrfs set up other than how to install it set it up and make it work. Understanding all the ins and outs of how it works and doing it the manual method is quite daunting because i don’t have a wealth of knowledge about it. Just the basics. I was more curious with the read out. It’s just a Western digital Blue SSD 500 GB. Nothing special really. Decent drives. :man_shrugging:

1 Like

@ricklinux

I had received following output of smartctl command :

deweshk@HomeDesktop:~
➤ sudo smartctl -a /dev/sda
[sudo] password for deweshk: 
smartctl 7.3 2022-02-28 r5338 [x86_64-linux-5.17.1-arch1-1] (local build)
Copyright (C) 2002-22, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     Western Digital Green
Device Model:     WDC WD5000AZRX-00L4HB0
Serial Number:    WD-WCC4JJUHUN78
LU WWN Device Id: 5 0014ee 20addbcc7
Firmware Version: 01.01A01
User Capacity:    500,107,862,016 bytes [500 GB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Rotation Rate:    5400 rpm
Device is:        In smartctl database 7.3/5319
ATA Version is:   ATA8-ACS (minor revision not indicated)
SATA Version is:  SATA 3.0, 6.0 Gb/s (current: 3.0 Gb/s)
Local Time is:    Sat Apr  2 09:24:09 2022 IST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x82) Offline data collection activity
                                        was completed without error.
                                        Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever 
                                        been run.
Total time to complete Offline 
data collection:                ( 7260) seconds.
Offline data collection
capabilities:                    (0x7b) SMART execute Offline immediate.
                                        Auto Offline data collection on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine 
recommended polling time:        (   2) minutes.
Extended self-test routine
recommended polling time:        (  85) minutes.
Conveyance self-test routine
recommended polling time:        (   5) minutes.
SCT capabilities:              (0x3035) SCT Status supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail  Always       -       2
  3 Spin_Up_Time            0x0027   138   132   021    Pre-fail  Always       -       4066
  4 Start_Stop_Count        0x0032   096   096   000    Old_age   Always       -       4825
  5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       1
  7 Seek_Error_Rate         0x002e   200   178   000    Old_age   Always       -       9
  9 Power_On_Hours          0x0032   095   095   000    Old_age   Always       -       4098
 10 Spin_Retry_Count        0x0032   100   100   000    Old_age   Always       -       0
 11 Calibration_Retry_Count 0x0032   100   100   000    Old_age   Always       -       0
 12 Power_Cycle_Count       0x0032   096   096   000    Old_age   Always       -       4408
192 Power-Off_Retract_Count 0x0032   199   199   000    Old_age   Always       -       943
193 Load_Cycle_Count        0x0032   165   165   000    Old_age   Always       -       106996
194 Temperature_Celsius     0x0022   115   096   000    Old_age   Always       -       28
196 Reallocated_Event_Count 0x0032   199   199   000    Old_age   Always       -       1
197 Current_Pending_Sector  0x0032   200   200   000    Old_age   Always       -       2
198 Offline_Uncorrectable   0x0030   200   200   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0
200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       0

SMART Error Log Version: 1
ATA Error Count: 2
        CR = Command Register [HEX]
        FR = Features Register [HEX]
        SC = Sector Count Register [HEX]
        SN = Sector Number Register [HEX]
        CL = Cylinder Low Register [HEX]
        CH = Cylinder High Register [HEX]
        DH = Device/Head Register [HEX]
        DC = Device Command Register [HEX]
        ER = Error register [HEX]
        ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error 2 occurred at disk power-on lifetime: 3738 hours (155 days + 18 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  04 61 00 00 00 00 a0  Device Fault; Error: ABRT

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c6 00 00 00 00 00 a0 00      00:04:42.716  SET MULTIPLE MODE
  ef 03 0c 00 00 00 a0 00      00:04:42.706  SET FEATURES [Set transfer mode]

Error 1 occurred at disk power-on lifetime: 3738 hours (155 days + 18 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  04 61 0c 00 00 00 a0  Device Fault; Error: ABRT

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  ef 03 0c 00 00 00 a0 00      00:04:42.706  SET FEATURES [Set transfer mode]

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline       Completed without error       00%      4096         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

What should I expect from errors visible in the output ?

Your mount options looks good.
I was left wondering, does the snapper need a mountpoint in fstab?
I’m not a snapper user but I was under the impression that it should be.
Gentoo wiki - Arch Wiki - Reddit

This looks like a Hard Drive? I do see the errors reported but I’m no expert at evaluating what it really means. Looks like errors at the start of the power on? I would assume those errors are from a sudden event such as power loss or hard start? I would run the long test first. It takes 10 minutes or more and then run this to show what it found.

sudo smartctl --test=long /dev/sda

After the time it takes for this test to run which it tells you at the start because it will run offline in the background. Then run the other smart test to show what it reported.

Time required for me is huge; it will take 85 minutes to get completed.

Yes this one is HDD. It is not root drive. Surprisingly for me root partition is in dev/sdb , which is SSD.

Well let it run and put a timer on for 90+ minutes or so and then after you could check the result.

Sir, test is completed. Here is the output :

deweshk@HomeDesktop:~
➤ sudo smartctl -l selftest /dev/sda
[sudo] password for deweshk: 
smartctl 7.3 2022-02-28 r5338 [x86_64-linux-5.17.1-arch1-1] (local build)
Copyright (C) 2002-22, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF READ SMART DATA SECTION ===
SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed without error       00%      4101         -
# 2  Short offline       Completed without error       00%      4096         -

Does that satisfy you?

So I should conclude that this HDD is still in healthy condition ?

Also, I want to show this :
image

Was that message from before?

Yes

The test shows good so it probably is related to the earlier report showing the start errors.

Related to troublesome experience in WIndows ?

Are you able to reset those messages?

How should I proceed ?

What do you mean? I just asked if the report on that smart access page can be reset?