| Giggle: a framework for constructing scalable replica location services |
| Full text |
Pdf
(277 KB)
|
| Source
|
Conference on High Performance Networking and Computing
archive
Proceedings of the 2002 ACM/IEEE conference on Supercomputing
table of contents
Baltimore, Maryland
Pages: 1 - 17
Year of Publication: 2002
|
|
Authors
|
|
Ann Chervenak
|
University of Southern California, Marina del Rey, CA
|
|
Ewa Deelman
|
University of Southern California, Marina del Rey, CA
|
|
Ian Foster
|
University of Chicago, Chicago, IL 60637 and Argonne National Laboratory, Argonne, IL
|
|
Leanne Guy
|
CERN, European Organization for Nuclear Research, Geneva, Switzerland
|
|
Wolfgang Hoschek
|
CERN, European Organization for Nuclear Research, Geneva, Switzerland
|
|
Adriana Iamnitchi
|
University of Chicago, Chicago, IL
|
|
Carl Kesselman
|
University of Southern California, Marina del Rey, CA
|
|
Peter Kunszt
|
CERN, European Organization for Nuclear Research, Geneva, Switzerland
|
|
Matei Ripeanu
|
University of Chicago, Chicago, IL
|
|
Bob Schwartzkopf
|
University of Southern California, Marina del Rey, CA
|
|
Heinz Stockinger
|
CERN, European Organization for Nuclear Research, Geneva, Switzerland
|
|
Kurt Stockinger
|
CERN, European Organization for Nuclear Research, Geneva, Switzerland
|
|
Brian Tierney
|
Lawrence Berkeley National Laboratory
|
|
| Sponsors |
|
| Publisher |
IEEE Computer Society Press
Los Alamitos, CA, USA
|
| Bibliometrics |
Downloads (6 Weeks): 7, Downloads (12 Months): 57, Citation Count: 48
|
|
|
ABSTRACT
In wide area computing systems, it is often desirable to create remote read-only copies (replicas) of files. Replication can be used to reduce access latency, improve data locality, and/or increase robustness, scalability and performance for distributed applications. We define a replica location service (RLS) as a system that maintains and provides access to information about the physical locations of copies. An RLS typically functions as one component of a data grid architecture. This paper makes the following contributions. First, we characterize RLS requirements. Next, we describe a parameterized architectural framework, which we name Giggle (for GIGa-scale Global Location Engine), within which a wide range of RLSs can be defined. We define several concrete instantiations of this framework with different performance characteristics. Finally, we present initial performance results for an RLS prototype, demonstrating that RLS systems can be constructed that meet performance goals.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
The DataGrid Architecture. 2001, EU DataGrid Project.
|
| |
2
|
Chervenak, A., et al., The Data Grid: Towards an Architecture for the Distributed Management and Analysis of Large Scientific Data Sets. J. Network and Computer Applications, 2001(23): p. 187--200.
|
| |
3
|
Foster, I. and C. Kesselman, A Data Grid Reference Architecture. 2001.
|
| |
4
|
Bill Allcock , Joe Bester , John Bresnahan , Ann L. Chervenak , Ian Foster , Carl Kesselman , Sam Meder , Veronika Nefedova , Darcy Quesnel , Steven Tuecke, Data management and transfer in high-performance computational grid environments, Parallel Computing, v.28 n.5, p.749-771, May 2002
[doi> 10.1016/S0167-8191(02)00094-7]
|
| |
5
|
|
| |
6
|
Chandy, K.M., A. Rifkin, and E. Schooler, Using Announce-Listen with Global Events to Develop Distributed Control Systems. Concurrency: Practice and Experience, 1998. 10. (11--13): p. 1021--1027.
|
 |
7
|
|
 |
8
|
|
| |
9
|
Zhang, L., et al. RSVP: A new Resource ReSerVation Protocol. in IEEE Network. 1993.
|
 |
10
|
|
| |
11
|
|
| |
12
|
Foster, I., et al., The Physiology of the Grid: An Open Grid Services Architecture for Distributed Systems Integration. 2002, Globus Project.
|
| |
13
|
Hoschek, W., A Unified Peer-to-Peer Database Framework and its Application for Scalable Service Discovery, in 2002. 2002, CERN.
|
| |
14
|
|
 |
15
|
David Karger , Eric Lehman , Tom Leighton , Rina Panigrahy , Matthew Levine , Daniel Lewin, Consistent hashing and random trees: distributed caching protocols for relieving hot spots on the World Wide Web, Proceedings of the twenty-ninth annual ACM symposium on Theory of computing, p.654-663, May 04-06, 1997, El Paso, Texas, United States
[doi> 10.1145/258533.258660]
|
| |
16
|
David Karger , Alex Sherman , Andy Berkheimer , Bill Bogstad , Rizwan Dhanidina , Ken Iwamoto , Brian Kim , Luke Matkins , Yoav Yerushalmi, Web caching with consistent hashing, Computer Networks: The International Journal of Computer and Telecommunications Networking, v.31 n.11-16, p.1203-1213, May 17, 1999
|
| |
17
|
|
| |
18
|
|
 |
19
|
|
| |
20
|
|
 |
21
|
Jim Gray , Pat Helland , Patrick O'Neil , Dennis Shasha, The dangers of replication and a solution, Proceedings of the 1996 ACM SIGMOD international conference on Management of data, p.173-182, June 04-06, 1996, Montreal, Quebec, Canada
|
 |
22
|
Karin Petersen , Mike J. Spreitzer , Douglas B. Terry , Marvin M. Theimer , Alan J. Demers, Flexible update propagation for weakly consistent replication, Proceedings of the sixteenth ACM symposium on Operating systems principles, p.288-301, October 05-08, 1997, Saint Malo, France
|
| |
23
|
Popek, G.J., et al. Replication in Ficus distributed file systems. in Workshop on Management of Replicated Data,. 1990: IEEE.
|
| |
24
|
Jeff Sidell , Paul M. Aoki , Adam Sah , Carl Staelin , Michael Stonebraker , Andrew Yu, Data Replication in Mariposa, Proceedings of the Twelfth International Conference on Data Engineering, p.485-494, February 26-March 01, 1996
|
| |
25
|
|
| |
26
|
Michael Stonebraker , Paul M. Aoki , Witold Litwin , Avi Pfeffer , Adam Sah , Jeff Sidell , Carl Staelin , Andrew Yu, Mariposa: a wide-area distributed database system, The VLDB Journal — The International Journal on Very Large Data Bases, v.5 n.1, p.048-063, January 1996
[doi> 10.1007/s007780050015]
|
| |
27
|
Terry, D.B., et al. The Case for Non-transparent Replication: Examples from Bayou. in IEEE Data Engineering. 1998.
|
| |
28
|
Thomas W. Page, J., et al. Management of replicated volume location data in the Ficus replicated file system. in USENIX Conference Proceedings. 1991.
|
| |
29
|
|
 |
30
|
Alec Wolman , M. Voelker , Nitin Sharma , Neal Cardwell , Anna Karlin , Henry M. Levy, On the scale and performance of cooperative Web proxy caching, Proceedings of the seventeenth ACM symposium on Operating systems principles, p.16-31, December 12-15, 1999, Charleston, South Carolina, United States
|
| |
31
|
Wang, J. A Survey of Web Caching Schemes for the Internet. in Proceedings of ACM SIGCOMM '99 Conference. 1999.
|
| |
32
|
|
| |
33
|
Ross, K.W., Hash routing for collections of shared Web caches. IEEE Network, 1997: p. 37--44.
|
| |
34
|
Valloppillil, V. and K.W. Ross. Cache array routing protocol v1.0. in Internet Draft. 1988.
|
 |
35
|
Sylvia Ratnasamy , Paul Francis , Mark Handley , Richard Karp , Scott Schenker, A scalable content-addressable network, Proceedings of the 2001 conference on Applications, technologies, architectures, and protocols for computer communications, p.161-172, August 2001, San Diego, California, United States
|
 |
36
|
Ion Stoica , Robert Morris , David Karger , M. Frans Kaashoek , Hari Balakrishnan, Chord: A scalable peer-to-peer lookup service for internet applications, Proceedings of the 2001 conference on Applications, technologies, architectures, and protocols for computer communications, p.149-160, August 2001, San Diego, California, United States
|
| |
37
|
|
| |
38
|
Ripeanu, M., I. Foster, and A. Iamnitchi, Mapping the Gnutella Network: Properties of Large-Scale Peer-to-Peer Systems and Implications for System Design. 2001, University of Chicago.
|
| |
39
|
Clarke, I., et al. Freenet: A Distributed Anonymous Information Storage and Retrieval System. in ICSI Workshop on Design Issues in Anonymity and Unobservability. 1999.
|
 |
40
|
Steven E. Czerwinski , Ben Y. Zhao , Todd D. Hodes , Anthony D. Joseph , Randy H. Katz, An architecture for a secure service discovery service, Proceedings of the 5th annual ACM/IEEE international conference on Mobile computing and networking, p.24-35, August 15-19, 1999, Seattle, Washington, United States
[doi> 10.1145/313451.313462]
|
| |
41
|
|
| |
42
|
|
| |
43
|
Hoschek, W. A Database for Dynamic Distributed Content and its Application for Service and Resource Discovery. in International IEEE Symposium on Parallel and Distributed Computing. 2002.
|
| |
44
|
Heinz Stockinger , Asad Samar , Koen Holtman , Bill Allcock , Ian Foster , Brian Tierney, File and Object Replication in Data Grids, Proceedings of the 10th IEEE International Symposium on High Performance Distributed Computing (HPDC-10'01), p.76, August 07-09, 2001
|
| |
45
|
Chaitanya Baru , Reagan Moore , Arcot Rajasekar , Michael Wan, The SDSC storage resource broker, Proceedings of the 1998 conference of the Centre for Advanced Studies on Collaborative research, p.5, November 30-December 03, 1998, Toronto, Ontario, Canada
|
CITED BY 49
|
|
Ann Chervenak , Ewa Deelman , Carl Kesselman , Bill Allcock , Ian Foster , Veronika Nefedova , Jason Lee , Alex Sim , Arie Shoshani , Bob Drach , Dean Williams , Don Middleton, High-performance remote access to climate simulation data: a challenge problem for data grid technologies, Parallel Computing, v.29 n.10, p.1335-1356, October 2003
|
|
|
|
|
|
Jason Lee , Dan Gunter , Martin Stoufer , Brian Tierney, Monitoring data archives for grid environments, Proceedings of the 2002 ACM/IEEE conference on Supercomputing, p.1-10, November 16, 2002, Baltimore, Maryland
|
|
|
|
|
|
|
|
|
|
|
|
O. T. Anderson , L. Luan , C. Everhart , M. Pereira , R. Sarkar , J. Xu, Global namespace for files, IBM Systems Journal, v.43 n.4, p.702-722, January 2004
|
|
|
|
|
|
Gurmeet Singh , Ewa Deelman , Gaurang Mehta , Karan Vahi , Mei-Hui Su , G. Bruce Berriman , John Good , Joseph C. Jacob , Daniel S. Katz , Albert Lazzarini , Kent Blackburn , Scott Koranda, The Pegasus portal: web based grid computing, Proceedings of the 2005 ACM symposium on Applied computing, March 13-17, 2005, Santa Fe, New Mexico
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
R. S. Bhuvaneswaran , Yoshiaki Katayama , Naohisa Takahashi, Redundant parallel data transfer schemes for the grid environment, Proceedings of the 2006 Australasian workshops on Grid computing and e-research, p.71-78, January 16-19, 2006, Hobart, Tasmania, Australia
|
|
|
|
|
|
Nandita Mandal , Ewa Deelman , Gaurang Mehta , Mei-Hui Su , Karan Vahi, Integrating existing scientific workflow systems: the Kepler/Pegasus example, Proceedings of the 2nd workshop on Workflows in support of large-scale science, June 25-25, 2007, Monterey, California, USA
|
|
|
|
|
|
|
|
|
|
|
|
Brandon Szeliga , John Cavicchio , Weisong Shi, DIMM: a distributed metadata management for data-intensive HPC environments, Proceedings of the 2008 international workshop on Data-aware distributed computing, p.19-28, June 24-24, 2008, Boston, MA, USA
|
|
|
|
|
|
|
|
|
Ewa Deelman , Gurmeet Singh , Mei-Hui Su , James Blythe , Yolanda Gil , Carl Kesselman , Gaurang Mehta , Karan Vahi , G. Bruce Berriman , John Good , Anastasia Laity , Joseph C. Jacob , Daniel S. Katz, Pegasus: A framework for mapping complex scientific workflows onto distributed systems, Scientific Programming, v.13 n.3, p.219-237, July 2005
|
|
|
|
|
|
|
|
|
Gurmeet Singh , Shishir Bharathi , Ann Chervenak , Ewa Deelman , Carl Kesselman , Mary Manohar , Sonal Patil , Laura Pearlman, A Metadata Catalog Service for Data Intensive Applications, Proceedings of the 2003 ACM/IEEE conference on Supercomputing, p.33, November 15-21, 2003
|
|
|
Ewa Deelman , Raymond Plante , Carl Kesselman , Gurmeet Singh , Mei-Hui Su , Gretchen Greene , Robert Hanisch , Niall Gaffney , Antonio Volpicelli , James Annis , Vijay Sekhri , Tamas Budavari , Maria Nieto-Santisteban , William O'Mullane , David Bohlender , Tom McGlynn , Arnold Rots , Olga Pevunova, Grid-Based Galaxy Morphology Analysis for the National Virtual Observatory, Proceedings of the 2003 ACM/IEEE conference on Supercomputing, p.47, November 15-21, 2003
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Athanasia Asiki , Katerina Doka , Ioannis Konstantinou , Antonis Zissimos , Dimitrios Tsoumakos , Nectarios Koziris , Panayiotis Tsanakas, A grid middleware for data management exploiting peer-to-peer techniques, Future Generation Computer Systems, v.25 n.4, p.426-435, April, 2009
|
|
|
|
|
|
Emrah Ceyhan , Gabrielle Allen , Christopher White , Tevfik Kosar, A grid-enabled workflow system for reservoir uncertainty analysis, Proceedings of the 6th international workshop on Challenges of large applications in distributed environments, June 23-23, 2008, Boston, MA, USA
|
|
|
|
|
|
|
|
|
Steven Lynden , Arijit Mukherjee , Alastair C. Hume , Alvaro A. A. Fernandes , Norman W. Paton , Rizos Sakellariou , Paul Watson, The design and implementation of OGSA-DQP: A service-based distributed query processor, Future Generation Computer Systems, v.25 n.3, p.224-236, March, 2009
|
|
|
|
|
|
|
|
|
Ann Chervenak , Ewa Deelman , Miron Livny , Mei-Hui Su , Rob Schuler , Shishir Bharathi , Gaurang Mehta , Karan Vahi, Data placement for scientific applications in distributed environments, Proceedings of the 8th IEEE/ACM International Conference on Grid Computing, p.267-274, September 19-21, 2007
|
|
|
Chao-Tung Yang , I-Hsien Yang , Shih-Yu Wang , Ching-Hsien Hsu , Kuan-Ching Li, A Recursively-Adjusting Co-allocation scheme with a Cyber-Transformer in Data Grids, Future Generation Computer Systems, v.25 n.7, p.695-703, July, 2009
|
|
|
Antony Chazapis , Georgios Tsoukalas , Georgios Verigakis , Kornilios Kourtis , Aristidis Sotiropoulos , Nectarios Koziris, Global-scale peer-to-peer file services with DFS, Proceedings of the 8th IEEE/ACM International Conference on Grid Computing, p.251-258, September 19-21, 2007
|
|
|
|
|
|
|
|
|
Joseph C. Jacob , Daniel S. Katz , G. Bruce Berriman , John C. Good , Anastasia C. Laity , Ewa Deelman , Carl Kesselman , Gurmeet Singh , Mei-Hui Su , Thomas A. Prince , Roy Williams, Montage: a grid portal and software toolkit for science-grade astronomical image mosaicking, International Journal of Computational Science and Engineering, v.4 n.2, p.73-87, July 2009
|
|
|
|
|
|
John Bent , Timothy E. Denehy , Miron Livny , Andrea C. Arpaci-Dusseau , Remzi H. Arpaci-Dusseau, Data-driven batch scheduling, Proceedings of the second international workshop on Data-aware distributed computing, p.1-10, June 09-10, 2009, Garching, Germany
|
|
|
|
|
|
|
|