ACM Home Page
Please provide us with feedback. Feedback
HTTP redirection for replica catalogue lookups in data grids
Full text PdfPdf (838 KB)
Source Symposium on Applied Computing archive
Proceedings of the 2002 ACM symposium on Applied computing table of contents
Madrid, Spain
SESSION: Parallel and distributed systems and networking table of contents
Pages: 882 - 889  
Year of Publication: 2002
ISBN:1-58113-445-2
Authors
Heinz Stockinger  CERN, European Organization for Nuclear Research, Geneva, Switzerland and University of Vienna, Austria
Andrew Hanushevsky  Stanford University, Menlo Park, California
Sponsor
SIGAPP: ACM Special Interest Group on Applied Computing
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 4,   Downloads (12 Months): 14,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/508791.508962
What is a DOI?

ABSTRACT

Data distribution and replication in distributed systems require special purpose middleware tools for accessing replicated data. Data Grids, special forms of systems distributed over wide-area networks, need to handle data management issues like distribution and replication of large amounts of data in the Tera- and Petabyte scale. Replica catalogues are used for cataloguing and locating replicated files in distributed sites all around the globe. We present a novel and administratively scalable approach for distributing a replica catalogue and resolving file location information by using HTTP redirection. HTTP redirection servers managing local file catalogues allow for greater flexibility and local file management autonomy whereas a global replica catalogue provides the necessary mapping of logical files to individual sites. By distributing the catalogues a site can autonomously move files for load balancing within a site without notifying a global replica catalogue. Our approach scales well in terms of catalogue administration to a large number of sites and file entries and thus establishes a powerful middleware service. We present the design and implementation of our catalogue redirection servers and report on promising experimental results.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
Jim Basney, Miron Livny, and Todd Tannenbaum, "High Throughput Computing with Condor", HPCU news, Volume 1(2), June 1997.
 
3
 
4
Jacek Becla, Andrew Hanushevsky. Creating Highly Performant Servers For Massively Large Databases. 9th IEEE International Symposium on High Performance Distributed Computing. Pittsburgh, Pennsylvania, August 1-4, 2000.
 
5
Ann Chervenak, Ian Foster, Carl Kesselman, Chuck Salisbury, Steve Tuecke. The Data Grid: Towards an Architecture for the Distributed Management and Analysis is of Large Scientific DataSets. Network Storage Symposium, Seattle 1999.
 
6
 
7
Distributed Computing Environment (DCE), http://www.opengroup.org/dce/
 
8
European DataGrid Project, http://www.eu-datagrid.org
 
9
Roy Fielding, James Gettys, Jeffrey Mogul, Henrik Nielson, Larry Masinter, Paul Leach, Tim Berners-Lee. Hypertext Transfer Protocol --- HTTP/1.1, RFC 2616, June 1999.
 
10
11
12
 
13
Andrew Hanushevsky, Marcin Nowak. Pursuit of a Scalable High Performance Multi-Petabyte Database. IEEE Symposium on Mass Storage Systems, March 15-18, 1999.
 
14
 
15
High Performance Storage System (HPSS): http://www.sdsc.edu/hpss/
 
16
 
17
Newsletter at National University of Singapore, Novel Framework Enhances Web Performance and Scalability, Volume 5 Number 2, Apr./Jun. 2000.
 
18
 
19
 
20
Wengyik Yeong, Tim Howes, Steve Kille. Lightweight Directory Access Protocol, RFC 1777, March 1995.


Collaborative Colleagues:
Heinz Stockinger: colleagues
Andrew Hanushevsky: colleagues