Hi All,
I have all my files stored on a central Ubuntu based server with 3 drives
- the OS
- all my data
- local backup
It has been fine for a few years but annoyingly recently when accessing the data through an NFS mount it times out when reading the directory. Remotely logging on to the server if I try to "ls" that directory it takes say 30 mins to do it. Once done, the subsequent "ls" works immediately and also the NFS works correctly again.
I initially thought it was because drive 2 is starting to fail but looking at smartctrl (run long and short tests) and then reading each block with "dd" it seems like there are 2 dodgy blocks but besides that I think it is ok?
smartctl -a gives ========================================== ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 199 199 051 Pre-fail Always - 92225 3 Spin_Up_Time 0x0027 186 171 021 Pre-fail Always - 5683 4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 427 5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0 7 Seek_Error_Rate 0x002e 100 253 000 Old_age Always - 0 9 Power_On_Hours 0x0032 010 010 000 Old_age Always - 66060 10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0 11 Calibration_Retry_Count 0x0032 100 100 000 Old_age Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 424
192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 226 193 Load_Cycle_Count 0x0032 192 192 000 Old_age Always - 25655 194 Temperature_Celsius 0x0022 119 102 000 Old_age Always - 31 196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0 197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 1 198 Offline_Uncorrectable 0x0030 100 253 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x0008 100 253 000 Old_age Offline - 0SMART Error Log Version: 1 No Errors Logged
SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Extended offline Completed: read failure 90% 489 1326848392 # 2 Short offline Completed: read failure 90% 489 1326848392 # 3 Conveyance offline Completed without error 00% 0 - # 4 Short offline Completed without error 00% 0 - =============================================
sudo dd if=/dev/sdb1 of=/dev/null bs=64k conv=noerror ==================================================== dd: error reading '/dev/sdb1': Input/output error
43920419+1 records in 43920419+1 records out 2878368583680 bytes (2.9 TB, 2.6 TiB) copied, 24480.1 s, 118 MB/s 45785391+1 records in 45785391+1 records out 3000591388672 bytes (3.0 TB, 2.7 TiB) copied, 26169.8 s, 115 MB/s =================running smartctl on disk 1(the OS) seems clear although having run the "short" test overnight it is stuck at 90%
So I am thinking the drives are not the cause of this issue. Anyone have any ideas?
Thanks
Lee.