smart is not smart, it thinks a dead drive is still good

I had a dying drive that smart thought until it totally disappeared was a good drive, and actually all parameters did look fine but this system was causing my system to lockup and other bad behavior:

 

=== START OF INFORMATION SECTION ===
Device Model:     WDC WD20EARS-00MVWB0
Serial Number:    WD-WMAZ20139
Firmware Version: 50.0AB50
User Capacity:    2,000,398,934,016 bytes
Device is:        Not in smartctl database [for details use: -P showall]
ATA Version is:   8
ATA Standard is:  Exact ATA specification draft version not indicated
Local Time is:    Fri Jun  3 09:53:36 2011 PDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x84)    Offline data collection activity
                    was suspended by an interrupting command from host.
                    Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0)    The previous self-test routine completed
                    without error or no self-test has ever
                    been run.
Total time to complete Offline
data collection:          (35400) seconds.
Offline data collection
capabilities:              (0x7b) SMART execute Offline immediate.
                    Auto Offline data collection on/off support.
                    Suspend Offline collection upon new
                    command.
                    Offline surface scan supported.
                    Self-test supported.
                    Conveyance Self-test supported.
                    Selective Self-test supported.
SMART capabilities:            (0x0003)    Saves SMART data before entering
                    power-saving mode.
                    Supports SMART auto save timer.
Error logging capability:        (0x01)    Error logging supported.
                    General Purpose Logging supported.
Short self-test routine
recommended polling time:      (   2) minutes.
Extended self-test routine
recommended polling time:      ( 255) minutes.
Conveyance self-test routine
recommended polling time:      (   5) minutes.
SCT capabilities:            (0x3035)    SCT Status supported.
                    SCT Feature Control supported.
                    SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail  Always       -       0
  3 Spin_Up_Time            0x0027   191   170   021    Pre-fail  Always       -       5416
  4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always       -       12
  5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x002e   200   200   000    Old_age   Always       -       0
  9 Power_On_Hours          0x0032   091   091   000    Old_age   Always       -       6833
 10 Spin_Retry_Count        0x0032   100   253   000    Old_age   Always       -       0
 11 Calibration_Retry_Count 0x0032   100   253   000    Old_age   Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       11
192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       6
193 Load_Cycle_Count        0x0032   042   042   000    Old_age   Always       -       475936
194 Temperature_Celsius     0x0022   117   099   000    Old_age   Always       -       33
196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0032   200   200   000    Old_age   Always       -       1
198 Offline_Uncorrectable   0x0030   200   200   000    Old_age   Offline      -       1
199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0
200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       2

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
No self-tests have been logged.  [To run self-tests, use: smartctl -t]


SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

At the end smart did catch it, but just before it died:

 

un  1 17:47:52 box10 smartd[8529]: Device: /dev/sda, opened
Jun  1 17:47:52 box10 smartd[8529]: Device /dev/sda: using '-d sat' for ATA disk behind SAT layer.
Jun  1 17:47:52 box10 smartd[8529]: Device: /dev/sda, opened
Jun  1 17:47:52 box10 smartd[8529]: Device: /dev/sda, not found in smartd database.
Jun  1 17:47:52 box10 smartd[8529]: Device: /dev/sda, is SMART capable. Adding to "monitor" list.
Jun  1 17:47:53 box10 smartd[8529]: Device: /dev/sda, 1 Currently unreadable (pending) sectors
Jun  1 17:47:53 box10 smartd[8529]: Device: /dev/sda, 1 Offline uncorrectable sectors
Jun  1 18:17:54 box10 smartd[8546]: Device: /dev/sda, 1 Currently unreadable (pending) sectors
Jun  1 18:17:54 box10 smartd[8546]: Device: /dev/sda, 1 Offline uncorrectable sectors
Jun  1 18:47:53 box10 smartd[8546]: Device: /dev/sda, 1 Currently unreadable (pending) sectors
Jun  1 18:47:53 box10 smartd[8546]: Device: /dev/sda, 1 Offline uncorrectable sectors
Jun  1 19:17:54 box10 smartd[8546]: Device: /dev/sda, 1 Currently unreadable (pending) sectors
Jun  1 19:17:54 box10 smartd[8546]: Device: /dev/sda, 1 Offline uncorrectable sectors
Jun  1 19:47:54 box10 smartd[8546]: Device: /dev/sda, 1 Currently unreadable (pending) sectors
Jun  1 19:47:54 box10 smartd[8546]: Device: /dev/sda, 1 Offline uncorrectable sectors
Jun  1 20:17:54 box10 smartd[8546]: Device: /dev/sda, 1 Currently unreadable (pending) sectors
Jun  1 20:17:54 box10 smartd[8546]: Device: /dev/sda, 1 Offline uncorrectable sectors
Jun  1 20:47:54 box10 smartd[8546]: Device: /dev/sda, 1 Currently unreadable (pending) sectors
Jun  1 20:47:54 box10 smartd[8546]: Device: /dev/sda, 1 Offline uncorrectable sectors
Jun  1 21:17:54 box10 smartd[8546]: Device: /dev/sda, 1 Currently unreadable (pending) sectors
Jun  1 21:17:54 box10 smartd[8546]: Device: /dev/sda, 1 Offline uncorrectable sectors
Jun  1 21:47:54 box10 smartd[8546]: Device: /dev/sda, 1 Currently unreadable (pending) sectors
Jun  1 21:47:54 box10 smartd[8546]: Device: /dev/sda, 1 Offline uncorrectable sectors
Jun  1 22:17:54 box10 smartd[8546]: Device: /dev/sda, 1 Currently unreadable (pending) sectors
Jun  1 22:17:54 box10 smartd[8546]: Device: /dev/sda, 1 Offline uncorrectable sectors
Jun  1 22:47:53 box10 smartd[8546]: Device: /dev/sda, 1 Currently unreadable (pending) sectors
Jun  1 22:47:53 box10 smartd[8546]: Device: /dev/sda, 1 Offline uncorrectable sectors
Jun  1 23:17:54 box10 smartd[8546]: Device: /dev/sda, 1 Currently unreadable (pending) sectors
Jun  1 23:17:54 box10 smartd[8546]: Device: /dev/sda, 1 Offline uncorrectable sectors
Jun  1 23:47:53 box10 smartd[8546]: Device: /dev/sda, 1 Currently unreadable (pending) sectors
 


Tags:

goodi, parameters, lockup, wdc, wd, mvwb, wmaz, firmware, ab, user, capacity, bytes, smartctl, database, showall, ata, specification, draft, indicated, fri, jun, pdt, capability, enabled, overall, assessment, offline, suspended, auto, execution, previous, completed, capabilities, execute, suspend, scan, supported, conveyance, selective, saves, mode, supports, timer, logging, recommended, polling, extended, sct, feature, attributes, revision, vendor, thresholds, attribute_name, thresh, updated, when_failed, raw_value, raw_read_error_rate, spin_up_time, start_stop_count, old_age, reallocated_sector_ct, seek_error_rate, power_on_hours, spin_retry_count, calibration_retry_count, power_cycle_count, off_retract_count, load_cycle_count, temperature_celsius, reallocated_event_count, current_pending_sector, offline_uncorrectable, udma_crc_error_count, multi_zone_error_rate, errors, logged, span, min_lba, max_lba, current_test_status, not_testing, flags, scanning, selected, spans, remainder, disk, pending, resume, smartd, dev, sda, layer, adding, quot, currently, unreadable, sectors, uncorrectable,

Latest Articles

  • How high can a Xeon CPU get?
  • bash fix PATH environment variable "command not found" solution
  • Ubuntu Linux Mint Debian Redhat Youtube Cannot Play HD or 4K videos, dropped frames or high CPU usage with Nvidia or AMD Driver
  • hostapd example configuration for high speed AC on 5GHz using WPA2
  • hostapd how to enable and use WPS to connect wireless devices like printers
  • Dell Server Workstation iDRAC Dead after Firmware Update Solution R720, R320, R730
  • Cloned VM/Server/Computer in Linux won't boot and goes to initramfs busybox Solution
  • How To Add Windows 7 8 10 11 to GRUB Boot List Dual Booting
  • How to configure OpenDKIM on Linux with Postfix and setup bind zonefile
  • Debian Ubuntu 10/11/12 Linux how to get tftpd-hpa server setup tutorial
  • efibootmgr: option requires an argument -- 'd' efibootmgr version 15 grub-install.real: error: efibootmgr failed to register the boot entry: Operation not permitted.
  • Apache Error Won't start SSL Cert Issue Solution Unable to configure verify locations for client authentication SSL Library Error: 151441510 error:0906D066:PEM routines:PEM_read_bio:bad end line SSL Library Error: 185090057 error:0B084009:x509 certif
  • Linux Debian Mint Ubuntu Bridge br0 gets random IP
  • redis requirements
  • How to kill a docker swarm
  • docker swarm silly issues
  • isc-dhcp-server dhcpd how to get longer lease
  • nvidia cannot resume from sleep Comm: nvidia-sleep.sh Tainted: Linux Ubuntu Mint Debian
  • zfs and LUKS how to recover in Linux
  • [error] (28)No space left on device: Cannot create SSLMutex Apache Solution Linux CentOS Ubuntu Debian Mint