ACM Home Page
Please provide us with feedback. Feedback
Digital Library logoTake a look at the new version of this page: [ beta version ]. Tell us what you think.
Reliability and security of RAID storage systems and D2D archives using SATA disk drives
Full text PdfPdf (95 KB)
Source ACM Transactions on Storage (TOS) archive
Volume 1 ,  Issue 1  (February 2005) table of contents
Pages: 95 - 107  
Year of Publication: 2005
ISSN:1553-3077
Authors
Gordon F. Hughes  University of California San Diego, San Diego, CA
Joseph F. Murray  University of California San Diego, San Diego, CA
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 20,   Downloads (12 Months): 168,   Citation Count: 5
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1044956.1044961
What is a DOI?

ABSTRACT

Information storage reliability and security is addressed by using personal computer disk drives in enterprise-class nearline and archival storage systems. The low cost of these serial ATA (SATA) PC drives is a tradeoff against drive reliability design and demonstration test levels, which are higher in the more expensive SCSI and Fibre Channel drives. This article discusses the tradeoff between SATA which has the advantage that fewer higher capacity drives are needed for a given system storage capacity, which further reduces cost and allows higher drive failure rates, and the use of additional storage system redundancy and drive failure prediction to maintain system data integrity using less reliable drives. RAID stripe failure probability is calculated using typical ATA and SCSI drive failure rates, for single and double parity data reconstruction failure, and failure due to drive unrecoverable block errors. Reliability improvement from drive failure prediction is also calculated, and can be significant. Today's SATA drive specifications for unrecoverable block errors appear to allow stripe reconstruction failure, and additional in-drive parity blocks are suggested as a solution. The possibility of using low cost disks data for backup and archiving is discussed, replacing higher cost magnetic tape. This requires significantly better RAID stripe failure probability, and suitable drive technology alternatives are discussed. The failure rate of nonoperating drives is estimated using failure analysis results from ≈4000 drives. Nonoperating RAID stripe failure rates are thereby estimated. User data security needs to be assured in addition to reliability, and to extend past the point where physical control of drives is lost, such as when drives are removed from systems for data vaulting, repair, sale, or discard. Today, over a third of resold drives contain unerased user data. Security is proposed via the existing SATA drive secure-erase command, or via the existing SATA drive password commands, or by data encryption. Finally, backup and archival disc storage is compared to magnetic tape, a technology with a proven reliability record over the full half-century of digital data storage. In contrast, tape archives are not vulnerable to tape transport failure modes. Only failure modes in the archived tapes and reels will make data unrecoverable.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
Daniel, E., Mee, C. D., and Clark, M. C. 1999. Magnetic recording, the first 100 years. IEEE Press. 20.
 
3
Colarelli, D., Grunwald, D., and Neufeld, M. 2002. The case for massive arrays of idle disks. 2002 Conference on File and Storage Technologies.
 
4
 
5
 
6
Hughes, G. F. 2002. Improved disk drive failure warnings. IEEE Trans. Reliab. 51, (Sept.), 350--357.
 
7
Hughes, G. F. 2002. Wise Drives, IEEE Spectrum (Aug.).
 
8
 
9
Lueth, C. 2004. NetApp data double parity RAID for enhanced data protection with RAID DP. Network Appliance Report TR3298 (Jan.).
 
10
Moser, A., Takano, K., Margulies, D., Albrecht, M., Sonobe, Y., Ikeda, Y., Sun, S., and Fullerton, E. 2002. Magnetic recording: advancing into the future. J. Phys. D: Appl. Phys. 35. R157--67.
 
11
Murray, J. F. and Hughes, G. F. 2003. Hard drive failure prediction using non-parametric statistical methods. International Conference on Artificial Neural Networks. Istanbul.
 
12
Joseph F. Murray , Gordon F. Hughes , Kenneth Kreutz-Delgado, Machine Learning Methods for Predicting Failures in Hard Drives: A Multiple-Instance Application, The Journal of Machine Learning Research, 6, p.783-816, 9/1/2005
 
13
Schwarz, T. J. E. and Burkhard, W. A. 1995. Reliability and performance of RAIDs. IEEE Trans. Mag. 31, 2 (March), 1161--1166.
 
14
Storage Networking Industry Association OSD Technical Work Group. www.snia.org.


Collaborative Colleagues:
Gordon F. Hughes: colleagues
Joseph F. Murray: colleagues