On Wednesday 20 October 2021 19:45:52 Thomas Anderson wrote: > Here are the results, of my smartctl test: > This would be much easier to decode if you turned off wordwrap, and repasted from that terminal screen.
> I am trying to parse them myself, to see if I can learn anything. But, > immediate glance > > and queries did not reveal anything that could help me determine if > the drive is good or not. > > > smartctl 6.6 2017-11-05 r4594 [x86_64-linux-4.19.0-18-amd64] (local > build) Copyright (C) 2002-17, Bruce Allen, Christian Franke, > www.smartmontools.org > > === START OF INFORMATION SECTION === > Device Model: ST8000DM004-2CX188 > Serial Number: ZCT1706V > LU WWN Device Id: 5 000c50 0c2b1cd83 > Firmware Version: 0001 > User Capacity: 8,001,563,222,016 bytes [8.00 TB] > Sector Sizes: 512 bytes logical, 4096 bytes physical > Rotation Rate: 5425 rpm > Form Factor: 3.5 inches > Device is: Not in smartctl database [for details use: -P > showall] ATA Version is: ACS-3 T13/2161-D revision 5 > SATA Version is: SATA 3.1, 6.0 Gb/s (current: 6.0 Gb/s) > Local Time is: Wed Oct 20 14:36:49 2021 CEST > SMART support is: Available - device has SMART capability. > SMART support is: Enabled > > === START OF READ SMART DATA SECTION === > SMART overall-health self-assessment test result: PASSED > > General SMART Values: > Offline data collection status: (0x00) Offline data collection > activity was never started. > Auto Offline Data Collection: Disabled. > Self-test execution status: ( 0) The previous self-test > routine completed > without error or no self-test has ever > been run. > Total time to complete Offline > data collection: ( 0) seconds. > Offline data collection > capabilities: (0x73) SMART execute Offline immediate. > Auto Offline data collection on/off support. > Suspend Offline collection upon new > command. > No Offline surface scan supported. > Self-test supported. > Conveyance Self-test supported. > Selective Self-test supported. > SMART capabilities: (0x0003) Saves SMART data before > entering power-saving mode. > Supports SMART auto save timer. > Error logging capability: (0x01) Error logging supported. > General Purpose Logging supported. > Short self-test routine > recommended polling time: ( 1) minutes. > Extended self-test routine > recommended polling time: ( 987) minutes. > Conveyance self-test routine > recommended polling time: ( 2) minutes. > SCT capabilities: (0x30a5) SCT Status supported. > SCT Data Table supported. > > SMART Attributes Data Structure revision number: 10 > Vendor Specific SMART Attributes with Thresholds: > ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED > WHEN_FAILED RAW_VALUE > 1 Raw_Read_Error_Rate 0x000f 064 051 006 Pre-fail > Always - 7226480 > 3 Spin_Up_Time 0x0003 093 091 000 Pre-fail > Always - 0 > 4 Start_Stop_Count 0x0032 100 100 020 Old_age > Always - 361 > 5 Reallocated_Sector_Ct 0x0033 100 100 010 Pre-fail > Always - 0 none is good > 7 Seek_Error_Rate 0x000f 080 060 045 Pre-fail > Always - 104115990 > 9 Power_On_Hours 0x0032 084 084 000 Old_age > Always - 14558 (99 41 0) relatively young > 10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail > Always - 0 > 12 Power_Cycle_Count 0x0032 100 100 020 Old_age > Always - 77 this is the single most dangerous time for a spinning rust drive > 183 Runtime_Bad_Block 0x0032 095 095 000 Old_age > Always - 5 > 184 End-to-End_Error 0x0032 100 100 099 Old_age > Always - 0 > 187 Reported_Uncorrect 0x0032 001 001 000 Old_age > Always - 1334 > 188 Command_Timeout 0x0032 100 100 000 Old_age > Always - 0 > 189 High_Fly_Writes 0x003a 100 100 000 Old_age > Always - 0 > 190 Airflow_Temperature_Cel 0x0022 070 052 040 Old_age > Always - 30 (Min/Max 28/33) operating temps good > 191 G-Sense_Error_Rate 0x0032 100 100 000 Old_age > Always - 0 > 192 Power-Off_Retract_Count 0x0032 100 100 000 Old_age > Always - 25 should normally match power cycle? > 193 Load_Cycle_Count 0x0032 099 099 000 Old_age > Always - 3362 > 194 Temperature_Celsius 0x0022 030 048 000 Old_age > Always - 30 (0 16 0 0 0) > 195 Hardware_ECC_Recovered 0x001a 069 064 000 Old_age > Always - 7226480 > 197 Current_Pending_Sector 0x0012 100 100 000 Old_age > Always - 8 > 198 Offline_Uncorrectable 0x0010 100 100 000 Old_age > Offline - 8 > 199 UDMA_CRC_Error_Count 0x003e 200 198 000 Old_age > Always - 15 > 240 Head_Flying_Hours 0x0000 100 253 000 Old_age > Offline - 2368 (67 0 0) > 241 Total_LBAs_Written 0x0000 100 253 000 Old_age > Offline - 54984038154 > 242 Total_LBAs_Read 0x0000 100 253 000 Old_age > Offline - 185155335586 > > SMART Error Log Version: 1 > ATA Error Count: 1334 (device log contains only the most recent five > errors) CR = Command Register [HEX] > FR = Features Register [HEX] > SC = Sector Count Register [HEX] > SN = Sector Number Register [HEX] > CL = Cylinder Low Register [HEX] > CH = Cylinder High Register [HEX] > DH = Device/Head Register [HEX] > DC = Device Command Register [HEX] > ER = Error register [HEX] > ST = Status register [HEX] > Powered_Up_Time is measured from power on, and printed as > DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes, > SS=sec, and sss=millisec. It "wraps" after 49.710 days. > > Error 1334 occurred at disk power-on lifetime: 10525 hours (438 days + > 13 hours) > When the command that caused the error occurred, the device was > active or idle. > > After command completion occurred, registers were: > ER ST SC SN CL CH DH > -- -- -- -- -- -- -- > 40 53 00 ff ff ff 0f Error: UNC at LBA = 0x0fffffff = 268435455 > > Commands leading to the command that caused the error were: > CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name > -- -- -- -- -- -- -- -- ---------------- -------------------- > 60 00 08 ff ff ff 4f 00 15d+00:06:46.256 READ FPDMA QUEUED > ef 10 02 00 00 00 a0 00 15d+00:06:46.247 SET FEATURES [Enable SATA > feature] > 27 00 00 00 00 00 e0 00 15d+00:06:46.220 READ NATIVE MAX ADDRESS > EXT [OBS-ACS-3] > ec 00 00 00 00 00 a0 00 15d+00:06:46.217 IDENTIFY DEVICE > ef 03 46 00 00 00 a0 00 15d+00:06:46.205 SET FEATURES [Set > transfer mode] > > Error 1333 occurred at disk power-on lifetime: 10525 hours (438 days + > 13 hours) > When the command that caused the error occurred, the device was > active or idle. > > After command completion occurred, registers were: > ER ST SC SN CL CH DH > -- -- -- -- -- -- -- > 40 53 00 ff ff ff 0f Error: UNC at LBA = 0x0fffffff = 268435455 > > Commands leading to the command that caused the error were: > CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name > -- -- -- -- -- -- -- -- ---------------- -------------------- > 60 00 08 ff ff ff 4f 00 15d+00:06:46.090 READ FPDMA QUEUED > ef 10 02 00 00 00 a0 00 15d+00:06:46.081 SET FEATURES [Enable SATA > feature] > 27 00 00 00 00 00 e0 00 15d+00:06:46.054 READ NATIVE MAX ADDRESS > EXT [OBS-ACS-3] > ec 00 00 00 00 00 a0 00 15d+00:06:46.051 IDENTIFY DEVICE > ef 03 46 00 00 00 a0 00 15d+00:06:46.039 SET FEATURES [Set > transfer mode] > > Error 1332 occurred at disk power-on lifetime: 10525 hours (438 days + > 13 hours) > When the command that caused the error occurred, the device was > active or idle. > > After command completion occurred, registers were: > ER ST SC SN CL CH DH > -- -- -- -- -- -- -- > 40 53 00 ff ff ff 0f Error: UNC at LBA = 0x0fffffff = 268435455 > > Commands leading to the command that caused the error were: > CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name > -- -- -- -- -- -- -- -- ---------------- -------------------- > 60 00 08 ff ff ff 4f 00 15d+00:06:45.922 READ FPDMA QUEUED > ef 10 02 00 00 00 a0 00 15d+00:06:45.912 SET FEATURES [Enable SATA > feature] > 27 00 00 00 00 00 e0 00 15d+00:06:45.886 READ NATIVE MAX ADDRESS > EXT [OBS-ACS-3] > ec 00 00 00 00 00 a0 00 15d+00:06:45.883 IDENTIFY DEVICE > ef 03 46 00 00 00 a0 00 15d+00:06:45.871 SET FEATURES [Set > transfer mode] > > Error 1331 occurred at disk power-on lifetime: 10525 hours (438 days + > 13 hours) > When the command that caused the error occurred, the device was > active or idle. > > After command completion occurred, registers were: > ER ST SC SN CL CH DH > -- -- -- -- -- -- -- > 40 53 00 ff ff ff 0f Error: UNC at LBA = 0x0fffffff = 268435455 > > Commands leading to the command that caused the error were: > CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name > -- -- -- -- -- -- -- -- ---------------- -------------------- > 60 00 08 ff ff ff 4f 00 15d+00:06:45.692 READ FPDMA QUEUED > ef 10 02 00 00 00 a0 00 15d+00:06:45.683 SET FEATURES [Enable SATA > feature] > 27 00 00 00 00 00 e0 00 15d+00:06:45.656 READ NATIVE MAX ADDRESS > EXT [OBS-ACS-3] > ec 00 00 00 00 00 a0 00 15d+00:06:45.654 IDENTIFY DEVICE > ef 03 46 00 00 00 a0 00 15d+00:06:45.641 SET FEATURES [Set > transfer mode] > > Error 1330 occurred at disk power-on lifetime: 10525 hours (438 days + > 13 hours) > When the command that caused the error occurred, the device was > active or idle. > > After command completion occurred, registers were: > ER ST SC SN CL CH DH > -- -- -- -- -- -- -- > 40 53 00 ff ff ff 0f Error: UNC at LBA = 0x0fffffff = 268435455 > > Commands leading to the command that caused the error were: > CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name > -- -- -- -- -- -- -- -- ---------------- -------------------- > 60 00 08 ff ff ff 4f 00 15d+00:06:45.526 READ FPDMA QUEUED > ef 10 02 00 00 00 a0 00 15d+00:06:45.517 SET FEATURES [Enable SATA > feature] > 27 00 00 00 00 00 e0 00 15d+00:06:45.491 READ NATIVE MAX ADDRESS > EXT [OBS-ACS-3] > ec 00 00 00 00 00 a0 00 15d+00:06:45.488 IDENTIFY DEVICE > ef 03 46 00 00 00 a0 00 15d+00:06:45.475 SET FEATURES [Set > transfer mode] > > SMART Self-test log structure revision number 1 > Num Test_Description Status Remaining > LifeTime(hours) LBA_of_first_error > # 1 Extended offline Completed without error 00% > 14551 - # 2 Extended offline Completed without error > 00% 14530 - # 3 Conveyance offline Completed without > error 00% 10165 - # 4 Conveyance offline Completed > without error 00% 10165 - # 5 Short offline > Completed without error 00% 10163 - # 6 Extended > offline Completed without error 00% 9997 - # 7 Short > offline Completed without error 00% 0 - > > SMART Selective self-test log data structure revision number 1 > SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS > 1 0 0 Not_testing > 2 0 0 Not_testing > 3 0 0 Not_testing > 4 0 0 Not_testing > 5 0 0 Not_testing > Selective self-test flags (0x0): > After scanning selected spans, do NOT read-scan remainder of disk. > If Selective self-test is pending on power-up, resume after 0 minute > delay. What color is the sata data cable? if "hot red" aka magenta, bump it with a pencil and note if the syslog log blows up with reset errors. If it does, replace the data cable with any other color but that hot red. That particular plastic dye leads to cable failures and has been a known part of the electronics jungle since the 1970's. > > On 10/18/21 6:52 PM, Reco wrote: > > Hi. > > > > On Mon, Oct 18, 2021 at 06:25:19PM +0200, Thomas Anderson wrote: > >> I have been having problems with a drive (non-SSD) for a while now, > >> but I would like to "identify" the problem specifically, so that I > >> may perhaps be able to get the drive replaced. > > > > Assuming it's SATA/IDE drive, all you need to do is: > > > > apt install smartmontools > > smartctl -t long <ur_drive_here> > > # wait for the test to finish > > smartctl -a <ur_drive_here> > > > > Please post the output of the last command. > > > > Reco Cheers, Gene Heskett. -- "There are four boxes to be used in defense of liberty: soap, ballot, jury, and ammo. Please use in that order." -Ed Howdershelt (Author, 1940) If we desire respect for the law, we must first make the law respectable. - Louis D. Brandeis Genes Web page <http://geneslinuxbox.net:6309/gene>