Technicus stultissimus: SMART

Showing posts with label SMART. Show all posts

Friday, February 1, 2013

smartctl

Analyse einer fehlerhaften Festplatte mit smartctl
Mit smartctl können Sie unter Linux die SMART Werte von Festplatten auslesen. In diesem Beispiel zeigen wir die Analyse einer defekten Festplatte. Die Festplatte in diesem Beispiel kann mehrere Sektoren nicht mehr lesen und ist somit defekt. Sie muss damit ausgetauscht werden.

===========================
S.M.A.R.T. (Self-Monitoring, Analysis and Reporting Technology; often written as SMART) is a monitoring system for computer hard disk drives to detect and report on various indicators of reliability, in the hope of anticipating failures.
When a failure is anticipated by S.M.A.R.T., the user may choose to replace the drive to avoid unexpected outage and data loss. The manufacturer may be able to use the S.M.A.R.T. data to discover where faults lie and prevent them from recurring in future drive designs.

Background

The purpose of S.M.A.R.T. is to warn a user of impending drive failure while there is still time to take action, such as copying the data to a replacement device.
Hard disk failures fall into one of two basic classes:

Predictable failures: These failures result from slow processes such as mechanical wear and gradual degradation of storage surfaces. Monitoring can determine when such failures are becoming more likely.
Unpredictable failures: These failures happen suddenly and without warning. They range from electronic components becoming defective to a sudden mechanical failure (perhaps due to improper handling).

Mechanical failures account for about 60% of all drive failures.^[1] While the eventual failure may be catastrophic, most mechanical failures result from gradual wear and there are usually certain indications that failure is imminent. These may include increased heat output, increased noise level, problems with reading and writing of data, or an increase in the number of damaged disk sectors.
Work at Google on over 100,000 drives found correlations between certain S.M.A.R.T. information and actual failure rates. In the 60 days following the first off-line scan uncorrectable error on a drive (SMART attribute 0xC6 or 198), the drive was, on average, 39 times more likely to fail than it would have been if no such error occurred. First errors in reallocations, offline reallocations (SMART attributes 0xC4 and 0x05 or 196 and 5) and probational counts (SMART attribute 0xC5 or 197) were also strongly correlated to higher probabilities of failure. Conversely, little correlation was found for increased temperature and no correlation for usage level. However, a large proportion (56%) of the failed drives failed without giving any S.M.A.R.T. warnings at all, meaning that S.M.A.R.T. data alone was of limited usefulness in anticipating failures.^[2]
PCTechGuide's page on S.M.A.R.T. (2003)^[3] comments that the technology has gone through three phases:

"In its original incarnation SMART provided failure prediction by monitoring certain online hard drive activities. A subsequent version improved failure prediction by adding an automatic off-line read scan to monitor additional operations. The latest "SMART" technology not only monitors hard drive activities but adds failure prevention by attempting to detect and repair sector errors. Also, while earlier versions of the technology only monitored hard drive activity for data that was retrieved by the operating system, this latest SMART tests all data and all sectors of a drive by using "off-line data collection" to confirm the drive's health during periods of inactivity."

Saturday, June 23, 2012

Acronis Drive Monitor

Source
Determine the health of your hard disks
There are three unavoidable certainties in life: Death, Taxes and Hard Disk Drive Failures.
Acronis does not have a solution for the first, we leave that to a higher authority; however, we can make sure that your financial records, photos, videos and other items of sentimental or monetary value are protected so that you can cherish the memories and pay your taxes on time!
Computers have become a necessary part of our daily lives and we have become dependent on instant access to the data stored on the hard drives inside our PCs. Just about everyone has either experienced a disk failure or knows someone that did. It's unavoidable as all disks eventually fail. This is why Acronis developed Acronis Drive Monitor as a free tool for the community at large.
Early warning to save your data
Acronis Drive Monitor can help predict when a hard drive is about to fail, giving you the chance to backup your data before disaster strikes. When Acronis Drive Monitor identifies a problem, it generates an email or onscreen alert describing the specific finding. It offers a simple and easy to understand explanation of the alert guiding you to the steps you need to take to remedy the issue. A color coded summary provides an overview of the disk's health at a glance and weekly reports summarize the findings.
Integrates with Acronis Home products for greater data safety
Users of the latest versions of Acronis True Image Home 2012 and Acronis Backup and Security 2011 PC backup and recovery software can take advantage of Acronis Drive Monitor's ability to trigger an immediate, automated backup if any disk shows signs of imminent failure.

* Click here to learn more about how Acronis Drive Monitor integrates with the latest Acronis Home and Business products.

Friday, August 5, 2011

SMART and RAID

What is S.M.A.R.T. and how can we use it to avoid data disaster?

Comparison of S.M.A.R.T. tools
SMART errors are not always a SURE sign of hard disk failure. A hard disk can sometimes run for some time with a SMART error, but replacement is still VERY highly recommended.
You can choose to turn SMART off in most BIOS's today, although it's not typically a good idea.
All the big HDD makers have a utility to look at hard drive S.M.A.R.T. status. Many SMART errors are warranted. The drive doesn't actually have to stop working completely in order to get a warranty replacement, but check the warranty to make sure your SMART error is covered by the warranty.
Western Digital has Datalife tools Hitachi Drive Fitness Test:
www.hitachigst.com/hdd/technolo/dft/dftnew.htm
www.cpuid.com/
ROG CPU-Z 1.57.1 at this address.(ASUS)
http://www.bios-info.de/4p92x846/befs.htm
Ashampoo HDD Control 2

SpeedFan
SpeedFan 4.44 in Windows 7
Original author(s)	Alfredo Milani Comparetti^[1]
Developer(s)	Alfredo Milani Comparetti
Initial release	?
Stable release	4.44 (March 17, 2011; 4 months ago) _[+/−]
Preview release	_[+/−]
Written in	Delphi, C++, C
Operating system	Windows 95 and later^[1]
Available in	Multilanguage
Type	System hardware monitor
License	Freeware^[1]
Website	almico.com/speedfan.php

Version 4.38 added full support for AMCC/3ware SATA and RAID controllers.

Wednesday, July 27, 2011

SMART technology

Self-Monitoring, Analysis, and Reporting Technology, or S.M.A.R.T., is a monitoring system for computer hard disks to detect and report on various indicators of reliability, and assist in the anticipation of failures.

S.M.A.R.T Background

PC techguide's page on S.M.A.R.T. (2003) comments that the technology has gone through three phases: "In its original incarnation SMART provided failure prediction by monitoring certain online hard drive activities. A subsequent version improved failure prediction by adding an automatic off-line read scan to monitor additional operations. The latest SMART III technology not only monitors hard drive activities but adds failure prevention by attempting to detect and repair sector errors. Also, whilst earlier versions of the technology only monitored hard drive activity for data that was retrieved by the operating system, SMART III tests all data and all sectors of a drive by using off-line data collection to confirm the drive's health during periods of inactivity."

S.M.A.R.T History and predecessors

S.M.A.R.T Information

The inability to read some sectors is not always an indication that the drive is about to fail; one way that unreadable sectors can be created even when the drive is functioning within specification is if the power fails while the drive is writing. Even if the physical disk is damaged in one location so that a sector is unreadable, the disk may be able to use spare space to replace the bad area so that the sector can be overwritten.

A drive supporting SMART may optionally support a number of self-test or maintenance routines, and the results of the tests are kept in the self-test log. The self-test routines can be efficiently used to detect any unreadable sectors on the disk so that they may be restored from backup (for example, from other disks in a RAID). This helps to reduce the risk of a situation where one sector on a disk becomes unreadable, then the backup is damaged, and the data is lost forever.

S.M.A.R.T Standards and Implementation

Some S.M.A.R.T.-enabled motherboards and related software may not communicate with certain S.M.A.R.T.-capable drives, depending on the type of interface. Few external drives connected via USB and Firewire correctly send S.M.A.R.T. data over those interfaces. With so many ways to connect a hard drive (e.g. SCSI, Fibre Channel, ATA, SATA, SAS, SSA and SSD) it's difficult to predict whether S.M.A.R.T. reports will function correctly.

Even on hard drives and interfaces that support it, S.M.A.R.T. data may not be reported correctly to the computer's operating system. Some disk controllers can duplicate all write operations on a secondary "backup" drive in real-time. This feature is known as "RAID mirroring". However, many programs which are designed to analyze changes in drive behavior and relay S.M.A.R.T. alerts to the operator do not function when a computer system is configured for RAID support, usually because under normal RAID array operational conditions, the computer may not be permitted to 'see' (or directly access) individual physical drives, but only logical volumes, by the RAID array subsystem.

S.M.A.R.T Attributes
Spinup retry count
Count of retry of spin start attempts. This attribute stores a total count of the spin start attempts to reach the fully operational speed (under the condition that the first attempt was unsuccessful). An increase of this attribute value is a sign of problems in the hard disk mechanical subsystem. (Better: RAW Value LESS)

Note that the attribute values are always mapped to the range of 1 to 253 in a way that means higher values are better. For example, the "Reallocated Sectors Count" attribute value decreases as the number of reallocated sectors increases. In this case, the attribute's raw value will often indicate the actual number of sectors that were reallocated.

Known S.M.A.R.T. attributes

Legend
	Higher value is better			Lower value is better
Critical				Potential indicators of imminent electromechanical failure
ID	Hex	Attribute name	Better	Description
01	01	Read Error Rate		Indicates the rate of hardware read errors that occurred when reading data from a disk surface. Any number indicates a problem with either disk surface or read/write heads.
02	02	Throughput Performance		Overall (general) throughput performance of a hard disk drive. If the value of this attribute is decreasing there is a high probability that there is a problem with the disk.
03	03	Spin-Up Time		Average time of spindle spin up (from zero RPM to fully operational).
04	04	Start/Stop Count		A tally of spindle start/stop cycles.
05	05	Reallocated Sectors Count		Count of reallocated sectors. When the hard drive finds a read/write/verification error, it marks this sector as "reallocated" and transfers data to a special reserved area (spare area). This process is also known as remapping and "reallocated" sectors are called remaps. This is why, on modern hard disks, "bad blocks" cannot be found while testing the surface — all bad blocks are hidden in reallocated sectors. However, the more sectors that are reallocated, the more read/write speed will decrease.
06	06	Read Channel Margin		Margin of a channel while reading data. The function of this attribute is not specified.
07	07	Seek Error Rate		Rate of seek errors of the magnetic heads. If there is a failure in the mechanical positioning system, a servo damage or a thermal widening of the hard disk, seek errors arise. More seek errors indicates a worsening condition of a disk surface and the mechanical subsystem.
08	08	Seek Time Performance		Average performance of seek operations of the magnetic heads. If this attribute is decreasing, it is a sign of problems in the mechanical subsystem.
09	09	Power-On Hours (POH)		Count of hours in power-on state. The raw value of this attribute shows total count of hours (or minutes, or seconds, depending on manufacturer) in power-on state.
10	0A	Spin Retry Count		Count of retry of spin start attempts. This attribute stores a total count of the spin start attempts to reach the fully operational speed under the condition that the first attempt was unsuccessful). An increase of this attribute value is a sign of problems in the hard disk mechanical subsystem.
11	0B	Recalibration Retries		This attribute indicates the number of times recalibration was requested (under the condition that the first attempt was unsuccessful). A decrease of this attribute value is a sign of problems in the hard disk mechanical subsystem.
12	0C	Device Power Cycle Count		This attribute indicates the count of full hard disk power on/off cycles.
13	0D	Soft Read Error Rate		Uncorrected read errors reported to the operating system. If the value is non-zero, you should back up your data.
190	BE	Airflow Temperature (WDC)		Airflow temperature on Western Digital HDs (Same as temp. (C2), but current value is 50 less.)
190	BE	Temperature Difference from 100		Value is equal to (100 -temp °C), allowing manufacturer to set a minimum threshold which corresponds to a maximum temperature. Seagate ST910021AS: Verified Present Seagate ST3802110A: Verified Present 2007-02-13 Seagate ST980825AS: Verified Present 2007-04-05 Seagate ST3320620AS: Verified Present 2007-04-23 Seagate ST3500641AS: Verified Present 2007-06-12 Seagate ST3250824AS: Verified Present 2007-08-07
191	BF	G-sense error rate		Frequency of mistakes as a result of impact loads
192	C0	Power-off Retract Count		Number of times the heads are loaded off the media. Heads can be unloaded without actually powering off. (or Emergency Retract Cycle count -Fujitsu)
193	C1	Load/Unload Cycle		Count of load/unload cycles into head landing zone position.
194	C2	Temperature		Current internal temperature.
195	C3	Hardware ECC Recovered		Time between ECC-corrected errors.
196	C4	Reallocation Event Count		Count of remap operations. The raw value of this attribute shows the total number of attempts to transfer data from reallocated sectors to a spare area. Both successful & unsuccessful attempts are counted.
197	C5	Current Pending Sector Count		Number of "unstable" sectors (waiting to be remapped). If the unstable sector is subsequently written or read successfully, this value is decreased and the sector is not remapped. Read errors on the sector will not remap the sector, it will only be remapped on a failed write attempt. This can be problematic to test because cached writes will not remap the sector, only direct I/O writes to the disk.
198	C6	Uncorrectable Sector Count		The total number of uncorrectable errors when reading/writing a sector. A rise in the value of this attribute indicates defects of the disk surface and/or problems in the mechanical subsystem.
199	C7	UltraDMA CRC Error Count		The number of errors in data transfer via the interface cable as determined by ICRC (Interface Cyclic Redundancy Check).
200	C8	Write Error Rate / Multi-Zone Error Rate		The total number of errors when writing a sector.
201	C9	Soft Read Error Rate		Number of off-track errors. If non-zero, make a backup.
202	CA	Data Address Mark errors		Number of Data Address Mark errors (or vendorspecific).
203	CB	Run Out Cancel		Number of ECC errors
204	CC	Soft ECC Correction		Number of errors corrected by software ECC
205	CD	Thermal Asperity Rate (TAR)		Number of thermal asperity errors.
206	CE	Flying Height
207	CF	Spin High Current		Amount of high current used to spin up the drive.
208	D0	Spin Buzz		Number of buzz routines to spin up the drive
209	D1	Offline Seek Performance		Drive’s seek performance during offline operations
220	DC	Disk Shift		Distance the disk has shifted relative to the spindle (usually due to shock). Unit of measure is unknown.
221	DD	G-Sense Error Rate		The number of errors resulting from externally-induced shock & vibration.
222	DE	Loaded Hours		Time spent operating under data load (movement of magnetic head armature)
223	DF	Load/Unload Retry Count		Number of times head changes position.
224	E0	Load Friction		Resistance caused by friction in mechanical parts while operating.
225	E1	Load/Unload Cycle Count		Total number of load cycles
226	E2	Load 'In'-time		Total time of loading on the magnetic heads actuator (time not spent in parking area).
227	E3	Torque Amplification Count		Number of attempts to compensate for platter speed variations
228	E4	Power- Off Retract Cycle		The number of times the magnetic armature was retracted automatically as a result of cutting power.
230	E6	GMR Head Amplitude		Amplitude of "thrashing" (distance of repetitive forward/reverse head motion)
231	E7	Temperature		Drive Temperature
240	F0	Head Flying Hours		Time while head is positioning
250	FA	Read Error Retry Rate		Number of errors while reading from a disk

Apple	Atari	Commodore	Data General	DEC	Honeywell
Hewlett Packard	IBM	NCR	Olivetti	Sinclair	Sun Microsystem
Silicon Graphics	Unisys	Mattel	Amstrad	Altre marche	Hardware
Calcolatrici	Fuori categoria	Pubblicita	Documentazione	Software	Lista

Bienvenido! - Willkommen! - Welcome!

Friday, February 1, 2013

Background

Saturday, June 23, 2012

Friday, August 5, 2011

Wednesday, July 27, 2011

Labels

Pages

Tux & Cía. -Ventas

Search This Blog

Tux & Cía.

Amenazas Informáticas

Browsers&Security Tests

Diseño Gráfico

Basic Data Management

Windows Updates

Network tools

Forums - Foren - Foros

Which Security SW do you use?

[Your PC]

Piador

Me and Linus Torvalds' work

Free&Open Source SW

Blog Archive

Labels

Comics

About Me

Nerd or Dren?

Live Traffic Map

useful LX commands

Recommended

Networking Technique - Computer Technologies - Commputer Archeology

The Power of Knowledge-El Poder del Conocimiento-Die Macht des Wissens

Visitantes-Besucher-Visitors