Hi,
I have debian Squeeze installed on my machine. Computer is deployed with
hard drive MAXTOR STM3320613AS. Recently, smartd have printed to logs
the following messages:
Feb 4 20:53:18 zsm-debian smartd[1868]: Device: /dev/sda [SAT], 3
Currently unreadable (pending) sectors
Feb 4 20:53:18 zsm-debian smartd[1868]: Device: /dev/sda [SAT], 3
Offline uncorrectable sectors
Feb 4 20:53:18 zsm-debian smartd[1868]: Device: /dev/sda [SAT], SMART
Usage Attribute: 190 Airflow_Temperature_Cel changed from 68 to 69
Feb 4 20:53:18 zsm-debian smartd[1868]: Device: /dev/sda [SAT], SMART
Usage Attribute: 194 Temperature_Celsius changed from 32 to 31
After that I ran short selftests on given device, but they haven't found
any error:
# smartctl -l selftest /dev/sda
smartctl 5.40 2010-07-12 r3124 [x86_64-unknown-linux-gnu] (local build)
Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net
=== START OF READ SMART DATA SECTION ===
Raw_Read_Error_Rate, Seek_Error_Rate and Reallocated_Sector_Ct - See
more at:
http://www.dlaube.com/2010/04/how-to-determine-if-a-sata-drive-is-failing/#sthash.MbF88UnI.dpuf
Raw_Read_Error_Rate, Seek_Error_Rate and Reallocated_Sector_Ct - See
more at:
http://www.dlaube.com/2010/04/how-to-determine-if-a-sata-drive-is-failing/#sthash.MbF88UnI.dpuf
SMART Self-test log structure revision number 1
Raw_Read_Error_Rate, Seek_Error_Rate and Reallocated_Sector_Ct - See
more at:
http://www.dlaube.com/2010/04/how-to-determine-if-a-sata-drive-is-failing/#sthash.MbF88UnI.dpuf
Num Test_Description Status Remaining
LifeTime(hours) LBA_of_first_error
# 2 Short offline Completed without error 00% 6285 -
# 3 Short offline Completed without error 00% 6285 -
Parameters of given disk are as follows:
# smartctl -A /dev/sda
smartctl 5.40 2010-07-12 r3124 [x86_64-unknown-linux-gnu] (local build)
Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net
=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED
WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x000f 117 099 006 Pre-fail
Always - 145320438
3 Spin_Up_Time 0x0003 100 100 000 Pre-fail
Always - 0
4 Start_Stop_Count 0x0032 100 100 020 Old_age
Always - 505
5 Reallocated_Sector_Ct 0x0033 100 100 036 Pre-fail
Always - 0
7 Seek_Error_Rate 0x000f 077 060 030 Pre-fail
Always - 55435033
9 Power_On_Hours 0x0032 093 093 000 Old_age
Always - 6286
10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail
Always - 0
12 Power_Cycle_Count 0x0032 100 100 020 Old_age
Always - 507
184 End-to-End_Error 0x0032 100 100 099 Old_age
Always - 0
187 Reported_Uncorrect 0x0032 100 100 000 Old_age
Always - 0
188 Command_Timeout 0x0032 100 098 000 Old_age
Always - 44
189 High_Fly_Writes 0x003a 080 080 000 Old_age
Always - 20
190 Airflow_Temperature_Cel 0x0022 069 056 045 Old_age
Always - 31 (Lifetime Min/Max 31/33)
194 Temperature_Celsius 0x0022 031 044 000 Old_age
Always - 31 (0 15 0 0)
195 Hardware_ECC_Recovered 0x001a 039 032 000 Old_age
Always - 145320438
197 Current_Pending_Sector 0x0012 100 100 000 Old_age
Always - 3
198 Offline_Uncorrectable 0x0010 100 100 000 Old_age
Offline - 3
199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age
Always - 0
240 Head_Flying_Hours 0x0000 100 253 000 Old_age
Offline - 146926536235149
241 Total_LBAs_Written 0x0000 100 253 000 Old_age
Offline - 1178970182
242 Total_LBAs_Read 0x0000 100 253 000 Old_age
Offline - 2406483650
where Seek_Error_Rate grows slowly (aprox. by 3 every 2 seconds).
I have read that important pre-failing parameters are
Raw_Read_Error_Rate, Seek_Error_Rate and Reallocated_Sector_Ct but still
I can't decide whether disk is failing or not. Can somebody help,
whether it is needed to replace disk or not? Any help is appreciated...
Regards
Matus Valo
Raw_Read_Error_Rate, Seek_Error_Rate and Reallocated_Sector_Ct - See
more at:
http://www.dlaube.com/2010/04/how-to-determine-if-a-sata-drive-is-failing/#sthash.MbF88UnI.dpuf