Jump to content

Recommended Posts

abarbarian
Posted (edited)

You can find out a lot of information on your ssd/nvme drives with smartmontools. In the past smartmontools did not support nvme drives and you had to use the nvme-cli nvme.

 

Smartmontools Documentation

 

Here is a quick look at the smartmontool smartctl. I have one ssd and two nvme drives in my rig and here are the results of running,

 

smartctl -i   basic information 

smartctl -a   more information

smartctl -x   even more information

for nvme drives "smartctl -a" and "smartctl -x" give the same informarion only

 

 

The ssd drive.

 

Brain engaged-->16:40:10-->Fri Oct 18-->bloodaxe
-->smartctl -i /dev/sda
smartctl 7.4 2023-08-01 r5530 [x86_64-linux-6.11.3-arch1-1] (local build)
Copyright (C) 2002-23, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     Samsung based SSDs
Device Model:     Samsung SSD 860 EVO 500GB
Serial Number:    S3Z2NB0K369918E
LU WWN Device Id: 5 002538 e40246241
Firmware Version: RVT01B6Q
User Capacity:    500,107,862,016 bytes [500 GB]
Sector Size:      512 bytes logical/physical
Rotation Rate:    Solid State Device
Form Factor:      2.5 inches
TRIM Command:     Available, deterministic, zeroed
Device is:        In smartctl database 7.3/5528
ATA Version is:   ACS-4 T13/BSR INCITS 529 revision 5
SATA Version is:  SATA 3.1, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Fri Oct 18 16:40:14 2024 BST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

 

Brain engaged-->16:40:14-->Fri Oct 18-->bloodaxe
-->smartctl -a /dev/sda
smartctl 7.4 2023-08-01 r5530 [x86_64-linux-6.11.3-arch1-1] (local build)
Copyright (C) 2002-23, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     Samsung based SSDs
Device Model:     Samsung SSD 860 EVO 500GB
Serial Number:    S3Z2NB0K369918E
LU WWN Device Id: 5 002538 e40246241
Firmware Version: RVT01B6Q
User Capacity:    500,107,862,016 bytes [500 GB]
Sector Size:      512 bytes logical/physical
Rotation Rate:    Solid State Device
Form Factor:      2.5 inches
TRIM Command:     Available, deterministic, zeroed
Device is:        In smartctl database 7.3/5528
ATA Version is:   ACS-4 T13/BSR INCITS 529 revision 5
SATA Version is:  SATA 3.1, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Fri Oct 18 16:44:34 2024 BST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00) Offline data collection activity
                                        was never started.
                                        Auto Offline Data Collection: Disabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever 
                                        been run.
Total time to complete Offline 
data collection:                (    0) seconds.
Offline data collection
capabilities:                    (0x53) SMART execute Offline immediate.
                                        Auto Offline data collection on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        No Offline surface scan supported.
                                        Self-test supported.
                                        No Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine 
recommended polling time:        (   2) minutes.
Extended self-test routine
recommended polling time:        (  85) minutes.
SCT capabilities:              (0x003d) SCT Status supported.
                                        SCT Error Recovery Control supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 1
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  5 Reallocated_Sector_Ct   0x0033   100   100   010    Pre-fail  Always       -       0
  9 Power_On_Hours          0x0032   094   094   000    Old_age   Always       -       26271
 12 Power_Cycle_Count       0x0032   098   098   000    Old_age   Always       -       1317
177 Wear_Leveling_Count     0x0013   099   099   000    Pre-fail  Always       -       12
179 Used_Rsvd_Blk_Cnt_Tot   0x0013   100   100   010    Pre-fail  Always       -       0
181 Program_Fail_Cnt_Total  0x0032   100   100   010    Old_age   Always       -       0
182 Erase_Fail_Count_Total  0x0032   100   100   010    Old_age   Always       -       0
183 Runtime_Bad_Block       0x0013   100   100   010    Pre-fail  Always       -       0
187 Uncorrectable_Error_Cnt 0x0032   100   100   000    Old_age   Always       -       0
190 Airflow_Temperature_Cel 0x0032   075   053   000    Old_age   Always       -       25
195 ECC_Error_Rate          0x001a   200   200   000    Old_age   Always       -       0
199 CRC_Error_Count         0x003e   100   100   000    Old_age   Always       -       0
235 POR_Recovery_Count      0x0012   099   099   000    Old_age   Always       -       317
241 Total_LBAs_Written      0x0032   099   099   000    Old_age   Always       -       9525864481

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
No self-tests have been logged.  [To run self-tests, use: smartctl -t]

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

The above only provides legacy SMART information - try 'smartctl -x' for more

 

This is the extra information that along with all the information from "smartctl -a" comes with "smartctl -x"



SCT Status Version:                  3
SCT Version (vendor specific):       256 (0x0100)
Device State:                        Active (0)
Current Temperature:                    25 Celsius
Power Cycle Min/Max Temperature:     25/40 Celsius
Lifetime    Min/Max Temperature:     19/47 Celsius
Specified Max Operating Temperature:    55 Celsius
Under/Over Temperature Limit Count:   0/0
SMART Status:                        0xc24f (PASSED)

SCT Temperature History Version:     2
Temperature Sampling Period:         1 minute
Temperature Logging Interval:        10 minutes
Min/Max recommended Temperature:      0/70 Celsius
Min/Max Temperature Limit:            0/70 Celsius
Temperature History Size (Index):    128 (103)

Index    Estimated Time   Temperature Celsius
 104    2024-10-17 19:30    22  ***
 ...    ..(  7 skipped).    ..  ***
 112    2024-10-17 20:50    22  ***
 113    2024-10-17 21:00     ?  -
 114    2024-10-17 21:10    22  ***
 115    2024-10-17 21:20    24  *****
 116    2024-10-17 21:30    25  ******
 117    2024-10-17 21:40    25  ******
 118    2024-10-17 21:50    25  ******
 119    2024-10-17 22:00    26  *******
 120    2024-10-17 22:10    25  ******
 121    2024-10-17 22:20    25  ******
 122    2024-10-17 22:30    25  ******
 123    2024-10-17 22:40    26  *******
 124    2024-10-17 22:50    26  *******
 125    2024-10-17 23:00    26  *******
 126    2024-10-17 23:10    25  ******
 ...    ..(  4 skipped).    ..  ******
   3    2024-10-18 00:00    25  ******
   4    2024-10-18 00:10    26  *******
   5    2024-10-18 00:20    25  ******
 ...    ..(  3 skipped).    ..  ******
   9    2024-10-18 01:00    25  ******
  10    2024-10-18 01:10    26  *******
  11    2024-10-18 01:20    26  *******
  12    2024-10-18 01:30    25  ******
  13    2024-10-18 01:40    26  *******
 ...    ..(  3 skipped).    ..  *******
  17    2024-10-18 02:20    26  *******
  18    2024-10-18 02:30    25  ******
  19    2024-10-18 02:40     ?  -
  20    2024-10-18 02:50    22  ***
  21    2024-10-18 03:00    22  ***
  22    2024-10-18 03:10    23  ****
  23    2024-10-18 03:20    23  ****
  24    2024-10-18 03:30    23  ****
  25    2024-10-18 03:40    24  *****
  26    2024-10-18 03:50    24  *****
  27    2024-10-18 04:00    24  *****
  28    2024-10-18 04:10    26  *******
  29    2024-10-18 04:20    25  ******
  30    2024-10-18 04:30    25  ******
  31    2024-10-18 04:40    26  *******
  32    2024-10-18 04:50    25  ******
 ...    ..(  2 skipped).    ..  ******
  35    2024-10-18 05:20    25  ******
  36    2024-10-18 05:30    24  *****
  37    2024-10-18 05:40    25  ******
 ...    ..(  2 skipped).    ..  ******
  40    2024-10-18 06:10    25  ******
  41    2024-10-18 06:20    26  *******
  42    2024-10-18 06:30    25  ******
 ...    ..( 11 skipped).    ..  ******
  54    2024-10-18 08:30    25  ******
  55    2024-10-18 08:40    24  *****
  56    2024-10-18 08:50    25  ******
 ...    ..(  4 skipped).    ..  ******
  61    2024-10-18 09:40    25  ******
  62    2024-10-18 09:50     ?  -
  63    2024-10-18 10:00    22  ***
  64    2024-10-18 10:10    23  ****
  65    2024-10-18 10:20    23  ****
  66    2024-10-18 10:30    24  *****
 ...    ..( 12 skipped).    ..  *****
  79    2024-10-18 12:40    24  *****
  80    2024-10-18 12:50    25  ******
 ...    ..( 15 skipped).    ..  ******
  96    2024-10-18 15:30    25  ******
  97    2024-10-18 15:40    24  *****
 ...    ..(  2 skipped).    ..  *****
 100    2024-10-18 16:10    24  *****
 101    2024-10-18 16:20     ?  -
 102    2024-10-18 16:30    25  ******
 103    2024-10-18 16:40    25  ******

SCT Error Recovery Control:
           Read: Disabled
          Write: Disabled

Device Statistics (GP Log 0x04)
Page  Offset Size        Value Flags Description
0x01  =====  =               =  ===  == General Statistics (rev 1) ==
0x01  0x008  4            1317  ---  Lifetime Power-On Resets
0x01  0x010  4           26271  ---  Power-on Hours
0x01  0x018  6      9525864481  ---  Logical Sectors Written
0x01  0x020  6        11266366  ---  Number of Write Commands
0x01  0x028  6     10610873701  ---  Logical Sectors Read
0x01  0x030  6        29424397  ---  Number of Read Commands
0x01  0x038  6          699000  ---  Date and Time TimeStamp
0x04  =====  =               =  ===  == General Errors Statistics (rev 1) ==
0x04  0x008  4               0  ---  Number of Reported Uncorrectable Errors
0x04  0x010  4              11  ---  Resets Between Cmd Acceptance and Completion
0x05  =====  =               =  ===  == Temperature Statistics (rev 1) ==
0x05  0x008  1              25  ---  Current Temperature
0x05  0x020  1              47  ---  Highest Temperature
0x05  0x028  1              19  ---  Lowest Temperature
0x05  0x058  1              55  ---  Specified Maximum Operating Temperature
0x06  =====  =               =  ===  == Transport Statistics (rev 1) ==
0x06  0x008  4            5211  ---  Number of Hardware Resets
0x06  0x010  4               0  ---  Number of ASR Events
0x06  0x018  4               0  ---  Number of Interface CRC Errors
0x07  =====  =               =  ===  == Solid State Device Statistics (rev 1) ==
0x07  0x008  1               0  N--  Percentage Used Endurance Indicator
                                |||_ C monitored condition met
                                ||__ D supports DSN
                                |___ N normalized value

Pending Defects log (GP Log 0x0c) not supported

SATA Phy Event Counters (GP Log 0x11)
ID      Size     Value  Description
0x0001  2            0  Command failed due to ICRC error
0x0002  2            0  R_ERR response for data FIS
0x0003  2            0  R_ERR response for device-to-host data FIS
0x0004  2            0  R_ERR response for host-to-device data FIS
0x0005  2            0  R_ERR response for non-data FIS
0x0006  2            0  R_ERR response for device-to-host non-data FIS
0x0007  2            0  R_ERR response for host-to-device non-data FIS
0x0008  2            0  Device-to-host non-data FIS retries
0x0009  2            2  Transition from drive PhyRdy to drive PhyNRdy
0x000a  2            2  Device-to-host register FISes sent due to a COMRESET
0x000b  2            0  CRC errors within host-to-device FIS
0x000d  2            0  Non-CRC errors within host-to-device FIS
0x000f  2            0  R_ERR response for host-to-device data FIS, CRC
0x0010  2            0  R_ERR response for host-to-device data FIS, non-CRC
0x0012  2            0  R_ERR response for host-to-device non-data FIS, CRC
0x0013  2            0  R_ERR response for host-to-device non-data FIS, non-CRC

 

The older 2020 nvme drive.

 

Brain engaged-->17:20:39-->Fri Oct 18-->bloodaxe
-->smartctl -x /dev/nvme1
smartctl 7.4 2023-08-01 r5530 [x86_64-linux-6.11.3-arch1-1] (local build)
Copyright (C) 2002-23, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Number:                       ADATA SX8200PNP
Serial Number:                      2K4529A71GKL
Firmware Version:                   32B3T8EA
PCI Vendor/Subsystem ID:            0x1cc1
IEEE OUI Identifier:                0x000000
Controller ID:                      1
NVMe Version:                       1.3
Number of Namespaces:               1
Namespace 1 Size/Capacity:          1,024,209,543,168 [1.02 TB]
Namespace 1 Utilization:            694,033,825,792 [694 GB]
Namespace 1 Formatted LBA Size:     512
Local Time is:                      Fri Oct 18 17:47:28 2024 BST
Firmware Updates (0x14):            2 Slots, no Reset required
Optional Admin Commands (0x0017):   Security Format Frmw_DL Self_Test
Optional NVM Commands (0x005f):     Comp Wr_Unc DS_Mngmt Wr_Zero Sav/Sel_Feat Timestmp
Log Page Attributes (0x0f):         S/H_per_NS Cmd_Eff_Lg Ext_Get_Lg Telmtry_Lg
Maximum Data Transfer Size:         64 Pages
Warning  Comp. Temp. Threshold:     75 Celsius
Critical Comp. Temp. Threshold:     80 Celsius

Supported Power States
St Op     Max   Active     Idle   RL RT WL WT  Ent_Lat  Ex_Lat
 0 +     9.00W       -        -    0  0  0  0        0       0
 1 +     4.60W       -        -    1  1  1  1        0       0
 2 +     3.80W       -        -    2  2  2  2        0       0
 3 -   0.0450W       -        -    3  3  3  3     2000    2000
 4 -   0.0040W       -        -    4  4  4  4    15000   15000

Supported LBA Sizes (NSID 0x1)
Id Fmt  Data  Metadt  Rel_Perf
 0 +     512       0         0

=== START OF SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

SMART/Health Information (NVMe Log 0x02)
Critical Warning:                   0x00
Temperature:                        40 Celsius
Available Spare:                    100%
Available Spare Threshold:          10%
Percentage Used:                    2%
Data Units Read:                    46,388,015 [23.7 TB]
Data Units Written:                 25,128,979 [12.8 TB]
Host Read Commands:                 289,434,670
Host Write Commands:                294,010,441
Controller Busy Time:               5,737
Power Cycles:                       2,290
Power On Hours:                     9,451
Unsafe Shutdowns:                   260
Media and Data Integrity Errors:    25
Error Information Log Entries:      25
Warning  Comp. Temperature Time:    14
Critical Comp. Temperature Time:    0
Thermal Temp. 1 Transition Count:   158
Thermal Temp. 2 Transition Count:   1
Thermal Temp. 1 Total Time:         742
Thermal Temp. 2 Total Time:         9

Error Information (NVMe Log 0x01, 16 of 256 entries)
No Errors Logged

Self-test Log (NVMe Log 0x06)
Self-test status: No self-test in progress
No Self-tests Logged

 

The newer 2022 nvme drive.

 

Brain engaged-->17:18:42-->Fri Oct 18-->bloodaxe
-->smartctl -x /dev/nvme0
smartctl 7.4 2023-08-01 r5530 [x86_64-linux-6.11.3-arch1-1] (local build)
Copyright (C) 2002-23, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Number:                       SHPP41-1000GM
Serial Number:                      SJB8N828712009333
Firmware Version:                   51060A20
PCI Vendor/Subsystem ID:            0x1c5c
IEEE OUI Identifier:                0xace42e
Controller ID:                      0
NVMe Version:                       1.4
Number of Namespaces:               1
Namespace 1 Size/Capacity:          1,000,204,886,016 [1.00 TB]
Namespace 1 Formatted LBA Size:     512
Namespace 1 IEEE EUI-64:            ace42e 002649fc00
Local Time is:                      Fri Oct 18 17:20:39 2024 BST
Firmware Updates (0x16):            3 Slots, no Reset required
Optional Admin Commands (0x0017):   Security Format Frmw_DL Self_Test
Optional NVM Commands (0x00df):     Comp Wr_Unc DS_Mngmt Wr_Zero Sav/Sel_Feat Timestmp Verify
Log Page Attributes (0x1e):         Cmd_Eff_Lg Ext_Get_Lg Telmtry_Lg Pers_Ev_Lg
Maximum Data Transfer Size:         64 Pages
Warning  Comp. Temp. Threshold:     86 Celsius
Critical Comp. Temp. Threshold:     87 Celsius

Supported Power States
St Op     Max   Active     Idle   RL RT WL WT  Ent_Lat  Ex_Lat
 0 +     7.50W       -        -    0  0  0  0        5     305
 1 +   3.9000W       -        -    1  1  1  1       30     330
 2 +   1.5000W       -        -    2  2  2  2      100     400
 3 -   0.0500W       -        -    3  3  3  3      500    1500
 4 -   0.0050W       -        -    4  4  4  4     1000    9000

Supported LBA Sizes (NSID 0x1)
Id Fmt  Data  Metadt  Rel_Perf
 0 +     512       0         0
 1 -    4096       0         0

=== START OF SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

SMART/Health Information (NVMe Log 0x02)
Critical Warning:                   0x00
Temperature:                        39 Celsius
Available Spare:                    100%
Available Spare Threshold:          10%
Percentage Used:                    0%
Data Units Read:                    14,335,000 [7.33 TB]
Data Units Written:                 13,460,530 [6.89 TB]
Host Read Commands:                 239,613,128
Host Write Commands:                209,145,051
Controller Busy Time:               16,249
Power Cycles:                       826
Power On Hours:                     3,938
Unsafe Shutdowns:                   196
Media and Data Integrity Errors:    0
Error Information Log Entries:      0
Warning  Comp. Temperature Time:    0
Critical Comp. Temperature Time:    0
Temperature Sensor 1:               33 Celsius
Temperature Sensor 2:               41 Celsius

Error Information (NVMe Log 0x01, 16 of 256 entries)
No Errors Logged

Self-test Log (NVMe Log 0x06)
Self-test status: No self-test in progress
No Self-tests Logged

 

Just for comparison here is information from the nvme-cli tool "nvme" run on the older 2020 nvme drive.

 

Brain engaged-->16:35:34-->Fri Oct 18-->bloodaxe
-->nvme smart-log /dev/nvme1
Smart Log for NVME device:nvme1 namespace-id:ffffffff
critical_warning                        : 0
temperature                             : 35 °C (308 K)
available_spare                         : 100%
available_spare_threshold               : 10%
percentage_used                         : 2%
endurance group critical warning summary: 0
Data Units Read                         : 46,388,015 (23.75 TB)
Data Units Written                      : 25,128,979 (12.87 TB)
host_read_commands                      : 289,434,670
host_write_commands                     : 294,010,441
controller_busy_time                    : 5,737
power_cycles                            : 2,290
power_on_hours                          : 9,450
unsafe_shutdowns                        : 260
media_errors                            : 25
num_err_log_entries                     : 25
Warning Temperature Time                : 14
Critical Composite Temperature Time     : 0
Thermal Management T1 Trans Count       : 158
Thermal Management T2 Trans Count       : 1
Thermal Management T1 Total Time        : 742
Thermal Management T2 Total Time        : 9

 

😎

Edited by abarbarian
  • Like 1

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Guest
Reply to this topic...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...