TrueNAS ZFS DEGRADED pool...now what

DozerD42 · October 11, 2022, 12:40am

Hi there! I am a super ultra noob to ZFS and TrueNAS. I am storing photos and old documents and things in my NAS, and I have been wanting to upgrade. After getting a new Fractal Define 7 case and a larger power supply, I moved the six 4TB Seagate Iron Wolf disks over to the new case. My pool is in raid Z2.

After running a scrub, my pool was marked DEGRADED, and one of the drives is marked as faulted. Here is the data:

root@truenas[~]# zpool status -v
  pool: nargothrond
 state: DEGRADED
status: One or more devices are faulted in response to persistent errors.
        Sufficient replicas exist for the pool to continue functioning in a
        degraded state.
action: Replace the faulted device, or use 'zpool clear' to mark the device
        repaired.
  scan: scrub repaired 1.72M in 01:00:02 with 0 errors on Sun Oct  9 07:31:24 2022
config:

        NAME                                      STATE     READ WRITE CKSUM
        nargothrond                               DEGRADED     0     0     0
          raidz2-0                                DEGRADED     0     0     0
            948e0e11-b19a-467c-a273-7c8f77943134  ONLINE       0     0     0
            a15c7c1a-3a40-4229-aa87-a711bc65c738  FAULTED     59     9     0  too many errors
            0ade8e27-a01b-48bf-8dab-744624cae90e  ONLINE       0     0     0
            5edd8b59-3128-463f-83db-f443fd0cbc31  ONLINE       0     0     0
            5984f284-37f4-4693-9911-d0c5c0b122a0  ONLINE       0     0     0
            157cd5f5-8e26-4982-a861-cec077fccbc1  ONLINE       0     0     0

errors: No known data errors

Now, I am a noob, so I did not know what to do with this information except run an extended SMART test through the web GUI. I did that, and I could have sworn a got a failure result after many hours. But today, after I have started learning about smartctl, the results show passing, and I don’t know why SMART would show failing one day and passing the next.

I am now re-running an extended test on the drive to get the CLI data, so you will see that here:

root@truenas[~]# smartctl -a /dev/sdf        
smartctl 7.2 2020-12-30 r5155 [x86_64-linux-5.10.142+truenas] (local build)
Copyright (C) 2002-20, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     Seagate IronWolf
Device Model:     ST4000VN008-2DR166
Serial Number:    WDH222B8
LU WWN Device Id: 5 000c50 0a84cb267
Firmware Version: SC60
User Capacity:    4,000,787,030,016 bytes [4.00 TB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Rotation Rate:    5980 rpm
Form Factor:      3.5 inches
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ACS-3 T13/2161-D revision 5
SATA Version is:  SATA 3.1, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Mon Oct 10 19:26:17 2022 CDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x82) Offline data collection activity
                                        was completed without error.
                                        Auto Offline Data Collection: Enabled.
Self-test execution status:      ( 249) Self-test routine in progress...
                                        90% of test remaining.
Total time to complete Offline 
data collection:                (  591) seconds.
Offline data collection
capabilities:                    (0x7b) SMART execute Offline immediate.
                                        Auto Offline data collection on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine 
recommended polling time:        (   1) minutes.
Extended self-test routine
recommended polling time:        ( 640) minutes.
Conveyance self-test routine
recommended polling time:        (   2) minutes.
SCT capabilities:              (0x50bd) SCT Status supported.
                                        SCT Error Recovery Control supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000f   083   064   044    Pre-fail  Always       -       180718406
  3 Spin_Up_Time            0x0003   093   093   000    Pre-fail  Always       -       0
  4 Start_Stop_Count        0x0032   100   100   020    Old_age   Always       -       69
  5 Reallocated_Sector_Ct   0x0033   100   100   010    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x000f   081   060   045    Pre-fail  Always       -       126477149
  9 Power_On_Hours          0x0032   098   098   000    Old_age   Always       -       1893 (122 189 0)
 10 Spin_Retry_Count        0x0013   100   100   097    Pre-fail  Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   020    Old_age   Always       -       69
184 End-to-End_Error        0x0032   100   100   099    Old_age   Always       -       0
187 Reported_Uncorrect      0x0032   100   100   000    Old_age   Always       -       0
188 Command_Timeout         0x0032   100   099   000    Old_age   Always       -       17180131332
189 High_Fly_Writes         0x003a   100   100   000    Old_age   Always       -       0
190 Airflow_Temperature_Cel 0x0022   070   054   040    Old_age   Always       -       30 (Min/Max 27/30)
191 G-Sense_Error_Rate      0x0032   100   100   000    Old_age   Always       -       0
192 Power-Off_Retract_Count 0x0032   100   100   000    Old_age   Always       -       10
193 Load_Cycle_Count        0x0032   100   100   000    Old_age   Always       -       99
194 Temperature_Celsius     0x0022   030   046   000    Old_age   Always       -       30 (0 16 0 0 0)
197 Current_Pending_Sector  0x0012   100   100   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0010   100   100   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x003e   200   200   000    Old_age   Always       -       9
240 Head_Flying_Hours       0x0000   100   253   000    Old_age   Offline      -       1891h+18m+26.042s
241 Total_LBAs_Written      0x0000   100   253   000    Old_age   Offline      -       34633819266
242 Total_LBAs_Read         0x0000   100   253   000    Old_age   Offline      -       33357568627

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Self-test routine in progress 90%      1893         -
# 2  Extended offline    Completed without error       00%      1865         -
# 3  Extended offline    Interrupted (host reset)      00%      1851         -
# 4  Extended offline    Completed without error       00%      1695         -
# 5  Extended offline    Completed without error       00%      1528         -
# 6  Extended offline    Completed without error       00%      1359         -
# 7  Extended offline    Completed without error       00%      1191         -
# 8  Extended offline    Completed without error       00%      1023         -
# 9  Extended offline    Completed without error       00%       946         -
#10  Extended offline    Completed without error       00%       819         -
#11  Extended offline    Completed without error       00%       650         -
#12  Extended offline    Completed without error       00%       525         -
#13  Extended offline    Interrupted (host reset)      00%       483         -
#14  Extended offline    Interrupted (host reset)      00%       481         -
#15  Extended offline    Completed without error       00%       415         -
#16  Extended offline    Interrupted (host reset)      80%       406         -
#17  Extended offline    Completed without error       00%       306         -
#18  Extended offline    Interrupted (host reset)      60%       218         -
#19  Extended offline    Completed without error       00%       177         -
#20  Extended offline    Completed without error       00%        79         -
#21  Conveyance offline  Completed without error       00%        13         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

As you can see, I have a lot of “Interrupted (host reset)” failures. I checked the other drives, and I saw similar SMART failures, except for the most recent one at the top of this faulted one.

There was a time when I was configuring NUT and testing that TrueNAS would power down correctly, but I don’t remember ever shutting down the NAS in an ungraceful manner. I am not sure what these errors mean.

So now I am waiting for another extended to test to complete around 5am tomorrow. I don’t have any spare drives at the moment, but I just bought two more 4TB Iron Wolf drives on eBay and they are coming within the next few days along with some additional drive caddies.

Should I consider the drive faulted? What would you do in this situation? Do you have any advice for me?

redocbew · October 11, 2022, 12:51am

Yeah you might as well swap in one of your replacements and check the pool status after that. It’s pretty easy, and there’s lots of guides online about it. It’s not a bad idea to do a backup of anything important you’ve got on the pool just in case, but the good thing is it’s a RAIDZ-2, so you’ve still got some redundancy even with one drive down.

Trooper_ish · October 11, 2022, 12:58am

Okay, the array is a raidz2, so no need to panic and do drastic actions.
I suggest taking it easy on the array for a bit till you sorted it, but it should work regardless, with one drive dead/dying

There are only errors showing with one drive at the moment, and you have radz2, And spare/new replacement drives.
So you are in a great position to test.

There could be a hardware errors, for example error with the drive, the connector, the cable, the power supply, the ram and all sorts. Cables can be wiggeld to ensure not come loose, perhaps switch the cable from the failing drive with one from a good drive, ZFS should not care.

Personally, if I have no reason to actually suspect a drive is bad, I would first do a zpool clear, to make it pretend all is well, and then initiate a scrub.

This could make a hardware problem worse, so put a little time aside to watch it, and be prepared to stop the scrub if errors crop back up.

If no errors, you have a spare drive, and it does not hurt to pro-actively replace the suspect drive regardless, while keeping the bad drive in place, and do a zpool replace of the bad for a good drive, letting the system smartly copy across.

This could take Hours, and would not help if the issue is not a disk, but if the issue Is a disk, and another disk goes down then the situation is a bit more tense.

If error crop up on other drive, it could be a deeper issue than a single drive, so just keep an eye out is what I would say to begin with.

redocbew · October 11, 2022, 1:04am

I have had single drives show up bad like that because of a bad backplane when the drive its self was fine.

Wouldn’t hurt to add the second new drive as a hot spare either if there’s room in the case for it.

DozerD42 · October 11, 2022, 1:14am

I switched from a smaller case with a backplane, and I was using a couple SFF-8087 to four SATA connector cables to connect the backplane to the motherboard. In the Define 7 case, there is no backplane, so I am using individual SATA to SATA cables. I am using the same motherboard I was using in the old case.

I have a motherboard with four SATA connectors, and I have an M.2 to five SATA port adapter. The M.2 to SATA adapter is in a PCIE version 4 capable slot. Could the adapter be suspect?

I am only using two SATA ports on the M.2 to SATA adapter at the moment. The boot SSD is connected to a USB 3 header.

I am currently using all the SATA cables I own in this build, so I don’t have any more to cables to swap in or out. I just ordered five more, so maybe I should try swapping the SATA cable out?

Trooper_ish · October 11, 2022, 1:23am

I would try lowest common denominator.
To test for a bad cable, switching two drives’ cables around, and a zpool clear, will show errors on the other drive. If error re-appear on the same drive, then that’s more evidence of the drive.

If the actual adaptor was faulty, then both drives attached should show errors.

It’s a simple thing one can try without buying anything new.

ZFS does not really care which drive is attached to which adapter, so powering down and simply switching one of the cables from the adapter, with one from the motherboard would further help on that, just in case.

I gotta hit the hay, so gonna bail, but I’d say give the cable a shot, and ease up on writing to the pool for now, until you confirmed the drive giving errors, then do a zpool replace.
for peace of mind, would say wait till the other cables come in, so you can have both the suspect bad drive, and the new replacement both plugged in at the same time, if you got other cables on order anyway, and spare drives.
Sorry to bail, but it’s 02:30. I should have let others step in like cookie

wayland · October 11, 2022, 11:11am

Drive is probably OK. Probably a cabling problem. Either the drive is not getting good power or the data cable is not on properly.

Check cables, clear the error and do a scrub.

Drives should be 45C or less. If they get hot they create errors whilst hot. They’re OK again when cooled down.

jode · October 11, 2022, 1:06pm

This just means that the smart tests were interrupted by the system. You can safely ignore these.

Are you sure you’re looking at the correct drive? Look for a drive that has values different from 0 in the following lines:

187 Reported_Uncorrect
197 Current_Pending_Sector
198 Offline_Uncorrectable

If you find one that’s the one you want to replace.

DozerD42 · October 17, 2022, 8:41pm

Hi there,

I just want everyone to know that I disconnected and reconnected all the SATA cables, both at the motherboard and at the disk.

I must have swapped the cables too, because all the errors were suddenly cleared. I never actually ran a “zpool status clear”.

I appreciate all the help. I am running some checks on the new drives and I will probably keep them around as cold spares.

DozerD42 · October 17, 2022, 8:53pm

Thank you for this information! Should I be looking at the RAW_VALUE column, the VALUE column, or some other column?

I did an lsblk and ran a smartctl -a on each disk from /dev/sda to /dev/sdh. There are two boot drives in that list because I have a mirrored boot pool on SSDs. For the HDDs, the VALUE field for all of them was 100, and the RAW_VALUE for all of them was 0. That is for each of the three rows you named, 187, 197, and 198.

jode · October 18, 2022, 12:24pm

The RAW_VALUE column is of interest (yes, I find the column names confusing as well).

The results of you tests mean that the disks don’t report sector level errors.

I compared to some of my hard drives. You have an excessively high “1 Raw_Read_Error_Rate” in the above. My drives show a 0 for raw data after 10x Power_On_Hours.

Check the physical connections to this drive, and check the smartctl output for the other drives as well.

DozerD42 · October 18, 2022, 12:50pm

So I ran smartctl -a /dev/sdx where x is in {a,h} and I recorded the Raw_Read_Error_Rate below.

229224073
229574160
229461904
169108377 ← This is the original drive that was faulted
0 (boot pool SSD)
0 (boot pool SSD)
229456890
229587426

These read error rates do seem to be ridiculously large, but I don’t know what I should be expecting.

jode · October 18, 2022, 1:39pm

I did a quick run through my garage of HDs and found that only the Seagate drives have high values.
My WD drives have single or double digit values, most have a raw value of 0.
The Seagate drives I am referring to have had other documented issues and are being phased out of my garage…

Well, considering our small sample size it seems high values seem to be the norm for Seagate HDs. The noticeable item from your data is that the degraded drive had a significantly lower value than the other drives. Coincidence? Or indicator for some issue? I cannot say.

I’d say do a quick check across the other data points and look for deviations. But mostly out of curiosity.

At this point we pretty much exhausted our access to factual data as indicator for the errors observed.
I’d call it a day and replace the drive.

Trooper_ish · October 18, 2022, 1:47pm

Or not following industry standards when the control chip reports a value.

I would say it is more likely they are reporting the wrong info on that column, set by their firmware …

Seagate uses the general SMART Status, pass or fail. The individual attributes and threshold values are proprietary and we do not offer a utility that will read out the values.

gysi · October 18, 2022, 2:03pm

I found some info how to extract the correct value:

DozerD42 · October 18, 2022, 4:44pm

From that post, it looks like I need to use a special command to read each of the fields on Seagate IronWolf drives.

Something like this?

smartctl -a -v <field>,raw48:54 <drive>

gysi · October 18, 2022, 5:37pm

yes

smartctl -a -v 1,raw48:54 -v 7,raw48:54

1: Raw_Read_Error_Rate
7: Seek_Error_Rate

Looks like seagate is encoding the values differently

DozerD42 · October 18, 2022, 11:14pm

Well, look at this! The read errors are zero for the drive that was flagged as faulted:

root@truenas[~]# smartctl -a -v 1,raw48:54 -v 7,raw48:54
smartctl 7.2 2020-12-30 r5155 [x86_64-linux-5.10.142+truenas] (local build)
Copyright (C) 2002-20, Bruce Allen, Christian Franke, www.smartmontools.org

ERROR: smartctl requires a device name as the final command-line argument.


Use smartctl -h to get a usage summary

root@truenas[~]# smartctl -a -v 1,raw48:54 -v 7,raw48:54 /dev/sdd
smartctl 7.2 2020-12-30 r5155 [x86_64-linux-5.10.142+truenas] (local build)
Copyright (C) 2002-20, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     Seagate IronWolf
Device Model:     ST4000VN008-2DR166
Serial Number:    WDH222B8
LU WWN Device Id: 5 000c50 0a84cb267
Firmware Version: SC60
User Capacity:    4,000,787,030,016 bytes [4.00 TB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Rotation Rate:    5980 rpm
Form Factor:      3.5 inches
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ACS-3 T13/2161-D revision 5
SATA Version is:  SATA 3.1, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Tue Oct 18 17:59:44 2022 CDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x82) Offline data collection activity
                                        was completed without error.
                                        Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever 
                                        been run.
Total time to complete Offline 
data collection:                (  591) seconds.
Offline data collection
capabilities:                    (0x7b) SMART execute Offline immediate.
                                        Auto Offline data collection on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine 
recommended polling time:        (   1) minutes.
Extended self-test routine
recommended polling time:        ( 640) minutes.
Conveyance self-test routine
recommended polling time:        (   2) minutes.
SCT capabilities:              (0x50bd) SCT Status supported.
                                        SCT Error Recovery Control supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000f   082   064   044    Pre-fail  Always       -       0
  3 Spin_Up_Time            0x0003   093   093   000    Pre-fail  Always       -       0
  4 Start_Stop_Count        0x0032   100   100   020    Old_age   Always       -       77
  5 Reallocated_Sector_Ct   0x0033   100   100   010    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x000f   081   060   045    Pre-fail  Always       -       0
  9 Power_On_Hours          0x0032   098   098   000    Old_age   Always       -       2082 (218 123 0)
 10 Spin_Retry_Count        0x0013   100   100   097    Pre-fail  Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   020    Old_age   Always       -       77
184 End-to-End_Error        0x0032   100   100   099    Old_age   Always       -       0
187 Reported_Uncorrect      0x0032   100   100   000    Old_age   Always       -       0
188 Command_Timeout         0x0032   100   099   000    Old_age   Always       -       17180131332
189 High_Fly_Writes         0x003a   100   100   000    Old_age   Always       -       0
190 Airflow_Temperature_Cel 0x0022   075   054   040    Old_age   Always       -       25 (Min/Max 24/28)
191 G-Sense_Error_Rate      0x0032   100   100   000    Old_age   Always       -       0
192 Power-Off_Retract_Count 0x0032   100   100   000    Old_age   Always       -       10
193 Load_Cycle_Count        0x0032   100   100   000    Old_age   Always       -       107
194 Temperature_Celsius     0x0022   025   046   000    Old_age   Always       -       25 (0 16 0 0 0)
197 Current_Pending_Sector  0x0012   100   100   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0010   100   100   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x003e   200   200   000    Old_age   Always       -       9
240 Head_Flying_Hours       0x0000   100   253   000    Old_age   Offline      -       2079h+53m+28.879s
241 Total_LBAs_Written      0x0000   100   253   000    Old_age   Offline      -       34704581861
242 Total_LBAs_Read         0x0000   100   253   000    Old_age   Offline      -       34096617048

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Conveyance offline  Completed without error       00%      2038         -
# 2  Extended offline    Completed without error       00%      2024         -
# 3  Extended offline    Completed without error       00%      1949         -
# 4  Extended offline    Completed without error       00%      1900         -
# 5  Extended offline    Completed without error       00%      1865         -
# 6  Extended offline    Interrupted (host reset)      00%      1851         -
# 7  Extended offline    Completed without error       00%      1695         -
# 8  Extended offline    Completed without error       00%      1528         -
# 9  Extended offline    Completed without error       00%      1359         -
#10  Extended offline    Completed without error       00%      1191         -
#11  Extended offline    Completed without error       00%      1023         -
#12  Extended offline    Completed without error       00%       946         -
#13  Extended offline    Completed without error       00%       819         -
#14  Extended offline    Completed without error       00%       650         -
#15  Extended offline    Completed without error       00%       525         -
#16  Extended offline    Interrupted (host reset)      00%       483         -
#17  Extended offline    Interrupted (host reset)      00%       481         -
#18  Extended offline    Completed without error       00%       415         -
#19  Extended offline    Interrupted (host reset)      80%       406         -
#20  Extended offline    Completed without error       00%       306         -
#21  Extended offline    Interrupted (host reset)      60%       218         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

I went ahead and checked all the rest of the drives. For all the following values, all of the drives reported zero:
1: Raw_Read_Error_Rate
187 Reported_Uncorrect
197 Current_Pending_Sector
198 Offline_Uncorrectable

For all the drives except for /dev/sdc, the “7: Seek_Error_Rate” reported zero. The /dev/sdc drive reported 1 for seek error rate:

root@truenas[~]# smartctl -a -v 1,raw48:54 -v 7,raw48:54 /dev/sdc
smartctl 7.2 2020-12-30 r5155 [x86_64-linux-5.10.142+truenas] (local build)
Copyright (C) 2002-20, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     Seagate IronWolf
Device Model:     ST4000VN008-2DR166
Serial Number:    ZDH2Z2C9
LU WWN Device Id: 5 000c50 0a5ff5ed1
Firmware Version: SC60
User Capacity:    4,000,787,030,016 bytes [4.00 TB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Rotation Rate:    5980 rpm
Form Factor:      3.5 inches
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ACS-3 T13/2161-D revision 5
SATA Version is:  SATA 3.1, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Tue Oct 18 18:09:28 2022 CDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x82) Offline data collection activity
                                        was completed without error.
                                        Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever 
                                        been run.
Total time to complete Offline 
data collection:                (  581) seconds.
Offline data collection
capabilities:                    (0x7b) SMART execute Offline immediate.
                                        Auto Offline data collection on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine 
recommended polling time:        (   1) minutes.
Extended self-test routine
recommended polling time:        ( 613) minutes.
Conveyance self-test routine
recommended polling time:        (   2) minutes.
SCT capabilities:              (0x50bd) SCT Status supported.
                                        SCT Error Recovery Control supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000f   084   064   044    Pre-fail  Always       -       0
  3 Spin_Up_Time            0x0003   093   093   000    Pre-fail  Always       -       0
  4 Start_Stop_Count        0x0032   100   100   020    Old_age   Always       -       77
  5 Reallocated_Sector_Ct   0x0033   100   100   010    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x000f   084   060   045    Pre-fail  Always       -       1
  9 Power_On_Hours          0x0032   098   098   000    Old_age   Always       -       2082 (110 207 0)
 10 Spin_Retry_Count        0x0013   100   100   097    Pre-fail  Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   020    Old_age   Always       -       77
184 End-to-End_Error        0x0032   100   100   099    Old_age   Always       -       0
187 Reported_Uncorrect      0x0032   100   100   000    Old_age   Always       -       0
188 Command_Timeout         0x0032   100   100   000    Old_age   Always       -       0
189 High_Fly_Writes         0x003a   100   100   000    Old_age   Always       -       0
190 Airflow_Temperature_Cel 0x0022   074   053   040    Old_age   Always       -       26 (Min/Max 25/29)
191 G-Sense_Error_Rate      0x0032   100   100   000    Old_age   Always       -       0
192 Power-Off_Retract_Count 0x0032   100   100   000    Old_age   Always       -       9
193 Load_Cycle_Count        0x0032   100   100   000    Old_age   Always       -       107
194 Temperature_Celsius     0x0022   026   047   000    Old_age   Always       -       26 (0 17 0 0 0)
197 Current_Pending_Sector  0x0012   100   100   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0010   100   100   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x003e   200   200   000    Old_age   Always       -       0
240 Head_Flying_Hours       0x0000   100   253   000    Old_age   Offline      -       2080h+12m+30.277s
241 Total_LBAs_Written      0x0000   100   253   000    Old_age   Offline      -       34673086826
242 Total_LBAs_Read         0x0000   100   253   000    Old_age   Offline      -       34841698185

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed without error       00%      2024         -
# 2  Extended offline    Completed without error       00%      1859         -
# 3  Extended offline    Completed without error       00%      1695         -
# 4  Extended offline    Completed without error       00%      1527         -
# 5  Extended offline    Completed without error       00%      1359         -
# 6  Extended offline    Completed without error       00%      1191         -
# 7  Extended offline    Completed without error       00%      1023         -
# 8  Extended offline    Completed without error       00%       946         -
# 9  Extended offline    Completed without error       00%       818         -
#10  Extended offline    Completed without error       00%       650         -
#11  Extended offline    Completed without error       00%       525         -
#12  Extended offline    Interrupted (host reset)      00%       483         -
#13  Extended offline    Interrupted (host reset)      00%       481         -
#14  Extended offline    Completed without error       00%       415         -
#15  Extended offline    Interrupted (host reset)      80%       406         -
#16  Extended offline    Completed without error       00%       306         -
#17  Extended offline    Interrupted (host reset)      60%       218         -
#18  Extended offline    Completed without error       00%       176         -
#19  Extended offline    Completed without error       00%        78         -
#20  Conveyance offline  Completed without error       00%        13         -
#21  Short offline       Completed without error       00%        13         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

Should I be concerned about 1 for seek error rate? I am not sure what this means exactly. Does this mean that the magnetic head moved to the wrong bit once?

system · July 19, 2023, 5:15pm

This topic was automatically closed 273 days after the last reply. New replies are no longer allowed.