|
ABSTRACT
Information storage reliability and security is addressed by using personal computer disk drives in enterprise-class nearline and archival storage systems. The low cost of these serial ATA (SATA) PC drives is a tradeoff against drive reliability design and demonstration test levels, which are higher in the more expensive SCSI and Fibre Channel drives. This article discusses the tradeoff between SATA which has the advantage that fewer higher capacity drives are needed for a given system storage capacity, which further reduces cost and allows higher drive failure rates, and the use of additional storage system redundancy and drive failure prediction to maintain system data integrity using less reliable drives. RAID stripe failure probability is calculated using typical ATA and SCSI drive failure rates, for single and double parity data reconstruction failure, and failure due to drive unrecoverable block errors. Reliability improvement from drive failure prediction is also calculated, and can be significant. Today's SATA drive specifications for unrecoverable block errors appear to allow stripe reconstruction failure, and additional in-drive parity blocks are suggested as a solution. The possibility of using low cost disks data for backup and archiving is discussed, replacing higher cost magnetic tape. This requires significantly better RAID stripe failure probability, and suitable drive technology alternatives are discussed. The failure rate of nonoperating drives is estimated using failure analysis results from ≈4000 drives. Nonoperating RAID stripe failure rates are thereby estimated. User data security needs to be assured in addition to reliability, and to extend past the point where physical control of drives is lost, such as when drives are removed from systems for data vaulting, repair, sale, or discard. Today, over a third of resold drives contain unerased user data. Security is proposed via the existing SATA drive secure-erase command, or via the existing SATA drive password commands, or by data encryption. Finally, backup and archival disc storage is compared to magnetic tape, a technology with a proven reliability record over the full half-century of digital data storage. In contrast, tape archives are not vulnerable to tape transport failure modes. Only failure modes in the archived tapes and reels will make data unrecoverable.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
Daniel, E., Mee, C. D., and Clark, M. C. 1999. Magnetic recording, the first 100 years. IEEE Press. 20.
|
| |
3
|
Colarelli, D., Grunwald, D., and Neufeld, M. 2002. The case for massive arrays of idle disks. 2002 Conference on File and Storage Technologies.
|
| |
4
|
|
| |
5
|
|
| |
6
|
Hughes, G. F. 2002. Improved disk drive failure warnings. IEEE Trans. Reliab. 51, (Sept.), 350--357.
|
| |
7
|
Hughes, G. F. 2002. Wise Drives, IEEE Spectrum (Aug.).
|
| |
8
|
|
| |
9
|
Lueth, C. 2004. NetApp data double parity RAID for enhanced data protection with RAID DP. Network Appliance Report TR3298 (Jan.).
|
| |
10
|
Moser, A., Takano, K., Margulies, D., Albrecht, M., Sonobe, Y., Ikeda, Y., Sun, S., and Fullerton, E. 2002. Magnetic recording: advancing into the future. J. Phys. D: Appl. Phys. 35. R157--67.
|
| |
11
|
Murray, J. F. and Hughes, G. F. 2003. Hard drive failure prediction using non-parametric statistical methods. International Conference on Artificial Neural Networks. Istanbul.
|
| |
12
|
Joseph F. Murray , Gordon F. Hughes , Kenneth Kreutz-Delgado, Machine Learning Methods for Predicting Failures in Hard Drives: A Multiple-Instance Application, The Journal of Machine Learning Research, 6, p.783-816, 9/1/2005
|
| |
13
|
Schwarz, T. J. E. and Burkhard, W. A. 1995. Reliability and performance of RAIDs. IEEE Trans. Mag. 31, 2 (March), 1161--1166.
|
| |
14
|
Storage Networking Industry Association OSD Technical Work Group. www.snia.org.
|
CITED BY 5
|
|
|
|
|
Swetha Krishnan , Giridhar Ravipati , Andrea C. Arpaci-Dusseau , Remzi H. Arpaci-Dusseau , Barton P. Miller, The effects of metadata corruption on nfs, Proceedings of the 2007 ACM workshop on Storage security and survivability, October 29-29, 2007, Alexandria, Virginia, USA
|
|
|
Ajay Dholakia , Evangelos Eleftheriou , Xiao-Yu Hu , Ilias Iliadis , Jai Menon , K.K. Rao, A new intra-disk redundancy scheme for high-reliability RAID storage systems in the presence of unrecoverable errors, ACM Transactions on Storage (TOS), v.4 n.1, p.1-42, May 2008
|
|
|
|
|
|
|
|