smart is not smart, it thinks a dead drive is still good

I had a dying drive that smart thought until it totally disappeared was a good drive, and actually all parameters did look fine but this system was causing my system to lockup and other bad behavior:

 

=== START OF INFORMATION SECTION ===
Device Model:     WDC WD20EARS-00MVWB0
Serial Number:    WD-WMAZ20139
Firmware Version: 50.0AB50
User Capacity:    2,000,398,934,016 bytes
Device is:        Not in smartctl database [for details use: -P showall]
ATA Version is:   8
ATA Standard is:  Exact ATA specification draft version not indicated
Local Time is:    Fri Jun  3 09:53:36 2011 PDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x84)    Offline data collection activity
                    was suspended by an interrupting command from host.
                    Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0)    The previous self-test routine completed
                    without error or no self-test has ever
                    been run.
Total time to complete Offline
data collection:          (35400) seconds.
Offline data collection
capabilities:              (0x7b) SMART execute Offline immediate.
                    Auto Offline data collection on/off support.
                    Suspend Offline collection upon new
                    command.
                    Offline surface scan supported.
                    Self-test supported.
                    Conveyance Self-test supported.
                    Selective Self-test supported.
SMART capabilities:            (0x0003)    Saves SMART data before entering
                    power-saving mode.
                    Supports SMART auto save timer.
Error logging capability:        (0x01)    Error logging supported.
                    General Purpose Logging supported.
Short self-test routine
recommended polling time:      (   2) minutes.
Extended self-test routine
recommended polling time:      ( 255) minutes.
Conveyance self-test routine
recommended polling time:      (   5) minutes.
SCT capabilities:            (0x3035)    SCT Status supported.
                    SCT Feature Control supported.
                    SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail  Always       -       0
  3 Spin_Up_Time            0x0027   191   170   021    Pre-fail  Always       -       5416
  4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always       -       12
  5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x002e   200   200   000    Old_age   Always       -       0
  9 Power_On_Hours          0x0032   091   091   000    Old_age   Always       -       6833
 10 Spin_Retry_Count        0x0032   100   253   000    Old_age   Always       -       0
 11 Calibration_Retry_Count 0x0032   100   253   000    Old_age   Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       11
192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       6
193 Load_Cycle_Count        0x0032   042   042   000    Old_age   Always       -       475936
194 Temperature_Celsius     0x0022   117   099   000    Old_age   Always       -       33
196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0032   200   200   000    Old_age   Always       -       1
198 Offline_Uncorrectable   0x0030   200   200   000    Old_age   Offline      -       1
199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0
200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       2

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
No self-tests have been logged.  [To run self-tests, use: smartctl -t]


SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

At the end smart did catch it, but just before it died:

 

un  1 17:47:52 box10 smartd[8529]: Device: /dev/sda, opened
Jun  1 17:47:52 box10 smartd[8529]: Device /dev/sda: using '-d sat' for ATA disk behind SAT layer.
Jun  1 17:47:52 box10 smartd[8529]: Device: /dev/sda, opened
Jun  1 17:47:52 box10 smartd[8529]: Device: /dev/sda, not found in smartd database.
Jun  1 17:47:52 box10 smartd[8529]: Device: /dev/sda, is SMART capable. Adding to "monitor" list.
Jun  1 17:47:53 box10 smartd[8529]: Device: /dev/sda, 1 Currently unreadable (pending) sectors
Jun  1 17:47:53 box10 smartd[8529]: Device: /dev/sda, 1 Offline uncorrectable sectors
Jun  1 18:17:54 box10 smartd[8546]: Device: /dev/sda, 1 Currently unreadable (pending) sectors
Jun  1 18:17:54 box10 smartd[8546]: Device: /dev/sda, 1 Offline uncorrectable sectors
Jun  1 18:47:53 box10 smartd[8546]: Device: /dev/sda, 1 Currently unreadable (pending) sectors
Jun  1 18:47:53 box10 smartd[8546]: Device: /dev/sda, 1 Offline uncorrectable sectors
Jun  1 19:17:54 box10 smartd[8546]: Device: /dev/sda, 1 Currently unreadable (pending) sectors
Jun  1 19:17:54 box10 smartd[8546]: Device: /dev/sda, 1 Offline uncorrectable sectors
Jun  1 19:47:54 box10 smartd[8546]: Device: /dev/sda, 1 Currently unreadable (pending) sectors
Jun  1 19:47:54 box10 smartd[8546]: Device: /dev/sda, 1 Offline uncorrectable sectors
Jun  1 20:17:54 box10 smartd[8546]: Device: /dev/sda, 1 Currently unreadable (pending) sectors
Jun  1 20:17:54 box10 smartd[8546]: Device: /dev/sda, 1 Offline uncorrectable sectors
Jun  1 20:47:54 box10 smartd[8546]: Device: /dev/sda, 1 Currently unreadable (pending) sectors
Jun  1 20:47:54 box10 smartd[8546]: Device: /dev/sda, 1 Offline uncorrectable sectors
Jun  1 21:17:54 box10 smartd[8546]: Device: /dev/sda, 1 Currently unreadable (pending) sectors
Jun  1 21:17:54 box10 smartd[8546]: Device: /dev/sda, 1 Offline uncorrectable sectors
Jun  1 21:47:54 box10 smartd[8546]: Device: /dev/sda, 1 Currently unreadable (pending) sectors
Jun  1 21:47:54 box10 smartd[8546]: Device: /dev/sda, 1 Offline uncorrectable sectors
Jun  1 22:17:54 box10 smartd[8546]: Device: /dev/sda, 1 Currently unreadable (pending) sectors
Jun  1 22:17:54 box10 smartd[8546]: Device: /dev/sda, 1 Offline uncorrectable sectors
Jun  1 22:47:53 box10 smartd[8546]: Device: /dev/sda, 1 Currently unreadable (pending) sectors
Jun  1 22:47:53 box10 smartd[8546]: Device: /dev/sda, 1 Offline uncorrectable sectors
Jun  1 23:17:54 box10 smartd[8546]: Device: /dev/sda, 1 Currently unreadable (pending) sectors
Jun  1 23:17:54 box10 smartd[8546]: Device: /dev/sda, 1 Offline uncorrectable sectors
Jun  1 23:47:53 box10 smartd[8546]: Device: /dev/sda, 1 Currently unreadable (pending) sectors
 


Tags:

goodi, parameters, lockup, wdc, wd, mvwb, wmaz, firmware, ab, user, capacity, bytes, smartctl, database, showall, ata, specification, draft, indicated, fri, jun, pdt, capability, enabled, overall, assessment, offline, suspended, auto, execution, previous, completed, capabilities, execute, suspend, scan, supported, conveyance, selective, saves, mode, supports, timer, logging, recommended, polling, extended, sct, feature, attributes, revision, vendor, thresholds, attribute_name, thresh, updated, when_failed, raw_value, raw_read_error_rate, spin_up_time, start_stop_count, old_age, reallocated_sector_ct, seek_error_rate, power_on_hours, spin_retry_count, calibration_retry_count, power_cycle_count, off_retract_count, load_cycle_count, temperature_celsius, reallocated_event_count, current_pending_sector, offline_uncorrectable, udma_crc_error_count, multi_zone_error_rate, errors, logged, span, min_lba, max_lba, current_test_status, not_testing, flags, scanning, selected, spans, remainder, disk, pending, resume, smartd, dev, sda, layer, adding, quot, currently, unreadable, sectors, uncorrectable,

Latest Articles

  • Cisco How To Use a Third Party SIP Phone (eg. Avaya, 3CX)
  • Cisco Unified Communication Manager (CUCM) - How To Add Phones
  • pptp / pptpd not working in DD-WRT iptables / router
  • systemd-journald high memory usage solution
  • How to Install FreePBX in Linux Debian Ubuntu Mint Guide
  • How To Install Cisco's CUCM (Cisco Unified Communication Manager) 12 Guide
  • Linux Ubuntu Redhat How To Extract Images from PDF
  • Linux and Windows Dual Boot Issue NIC Won't work After Booting Windows
  • Cisco CME How To Enable ACD hunt groups
  • How to install gns3 on Linux Ubuntu Mint
  • How to convert audio for Asterisk .wav format
  • Using Cisco CME Router with Asterisk as a dial-peer
  • Cisco CME How To Configure SIP Trunk VOIP
  • Virtualbox host Only Network Error Failed to save host network interface parameter - Cannot change gateway IP of host only network
  • Cisco CME and C7200 Router Testing and Learning Environment on Ubuntu 20+ Setup Tutorial Guide
  • Abusive IP ranges blacklist
  • How to Install Any OS on a Physical Drive from Windows Using VMware Workstation (Linux, Windows, BSD)
  • CDN Cloudflare how to set and preserve the real IP of the client without modifying application code on Apache
  • CentOS 7 fix Could not retrieve mirrorlist http://mirrorlist.centos.org/?release=7&arch=x86_64&repo=os&infra=container error was 14: curl#6 -
  • Ubuntu Debian How To Install Recommended Packages Automatically