ACM Home Page
Please provide us with feedback. Feedback
Exploring data reliability tradeoffs in replicated storage systems
Full text PdfPdf (982 KB)
Source
High Performance Distributed Computing archive
Proceedings of the 18th ACM international symposium on High performance distributed computing table of contents
Garching, Germany
SESSION: Data nabagenebt table of contents
Pages 217-226  
Year of Publication: 2009
ISBN:978-1-60558-587-1
Authors
Abdullah Gharaibeh  The University of British Columbia, Vancouver, Canada
Matei Ripeanu  The University of British Columbia, Vancouver, Canada
Sponsors
ACM: Association for Computing Machinery
SIGARCH: ACM Special Interest Group on Computer Architecture
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 25,   Downloads (12 Months): 75,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1551609.1551643
What is a DOI?

ABSTRACT

This paper explores the feasibility of a cost-efficient storage architecture that offers the reliability and access performance characteristics of a high-end system. This architecture exploits two opportunities: First, scavenging idle storage from LAN-connected desktops not only offers a low-cost storage space, but also high I/O throughput by aggregating the I/O channels of the participating nodes. Second, the two components of data reliability - durability and availability - can be decoupled to control overall system cost. To capitalize on these opportunities, we integrate two types of components: volatile, scavenged storage and dedicated, yet low-bandwidth durable storage. On the one hand, the durable storage forms a low-cost back-end that enables the system to restore the data the volatile nodes may lose. On the other hand, the volatile nodes provide a high-throughput front-end.

While integrating these components has the potential to offer a unique combination of high throughput, low cost, and durability, a number of concerns need to be addressed to architect and correctly provision the system. To this end, we develop analytical- and simulation based tools to evaluate the impact of system characteristics (e.g., bandwidth limitations on the durable and the volatile nodes) and design choices (e.g., replica placement scheme) on data availability and the associated system costs (e.g., maintenance traffic). Further, we implement and evaluate a prototype of the proposed architecture: namely a GridFTP server that aggregates volatile resources. Our evaluation demonstrates an impressive, up to 800MBps transfer throughput for the new GridFTP service.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
 
3
Allcock, W., et al. GridFTP: Protocol Extensions to FTP for the Grid. Global Grid ForumGFD-RP, 20(2003).
 
4
 
5
Amazon Web Services. http:/s3.amazonaws.com.
 
6
 
7
8
 
9
 
10
 
11
 
12
13
 
14
 
15
Fuhrmann, P. dCache, the commodity cache. In proceedings of the Twelfth NASA Goddard and Twenty First IEEE Conference on Mass Storage Systems and Technologies, Washington DC., 2004.
16
17
 
18
Heath, T., et al. The Shape of Failure. In Proceedings of the First Workshop on Evaluating and Architecting System dependabilitY (EASY). 2001.
 
19
20
21
22
 
23
Microsystems, S. and others. NFS: Network file system protocol specification. Request for Comments, 1094(1988).
24
 
25
Nurmi, D., et al. Modeling Machine Availability in Enterprise and Wide-Area Distributed Computing Environments. LECTURE NOTES IN COMPUTER SCIENCE, 3648(2005), 432.
26
 
27
 
28
Ramabhadran, S. and Pasquale, J. Analysis of Long-Running Replicated Systems. In INFOCOM, 2006.
 
29
Rausand, M. and Hoyland, A. System Reliability Theory: Models, Statistical Methods, and Applications. Wiley-Interscience, 2004.
 
30
Simpy homepage. http://simpy.sourceforge.net/ (2009).
 
31
 
32
 
33

Collaborative Colleagues:
Abdullah Gharaibeh: colleagues
Matei Ripeanu: colleagues