| A Metadata Catalog Service for Data Intensive Applications |
| Full text |
Pdf
(178 KB)
|
| Source
|
Conference on High Performance Networking and Computing
archive
Proceedings of the 2003 ACM/IEEE conference on Supercomputing
table of contents
Page: 33
Year of Publication: 2003
ISBN:1-58113-695-1
|
|
Authors
|
|
Gurmeet Singh
|
Information Sciences Institute, University of Southern California, Marina Del Rey, CA
|
|
Shishir Bharathi
|
Information Sciences Institute, University of Southern California, Marina Del Rey, CA
|
|
Ann Chervenak
|
Information Sciences Institute, University of Southern California, Marina Del Rey, CA
|
|
Ewa Deelman
|
Information Sciences Institute, University of Southern California, Marina Del Rey, CA
|
|
Carl Kesselman
|
Information Sciences Institute, University of Southern California, Marina Del Rey, CA
|
|
Mary Manohar
|
Information Sciences Institute, University of Southern California, Marina Del Rey, CA
|
|
Sonal Patil
|
Information Sciences Institute, University of Southern California, Marina Del Rey, CA
|
|
Laura Pearlman
|
Information Sciences Institute, University of Southern California, Marina Del Rey, CA
|
|
| Sponsor |
|
| Publisher |
IEEE Computer Society
Washington, DC, USA
|
| Bibliometrics |
Downloads (6 Weeks): 9, Downloads (12 Months): 85, Citation Count: 22
|
|
|
ABSTRACT
Advances in computational, storage and network technologies as well as middle ware such as the Globus Toolkit allow scientists to expand the sophistication and scope of data-intensive applications. These applications produce and analyze terabytes and petabytes of data that are distributed in millions of files or objects. To manage these large data sets efficiently, metadata or descriptive information about the data needs to be managed. There are various types of metadata, and it is likely that a range of metadata services will exist in Grid environments that are specialized for particular types of metadata cataloguing and discovery. In this paper, we present the design of a Metadata Catalog Service (MCS) that provides a mechanism for storing and accessing descriptive metadata and allows users to query for data items based on desired attributes. We describe our experience in using the MCS with several applications and present a scalability study of the service.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
[2] I. Foster, "Grid Computing," presented at Advanced Computing and Analysis Techniques in Physics Research (ACAT), 2000.
|
| |
3
|
|
| |
4
|
Ann Chervenak , Ewa Deelman , Ian Foster , Leanne Guy , Wolfgang Hoschek , Adriana Iamnitchi , Carl Kesselman , Peter Kunszt , Matei Ripeanu , Bob Schwartzkopf , Heinz Stockinger , Kurt Stockinger , Brian Tierney, Giggle: a framework for constructing scalable replica location services, Proceedings of the 2002 ACM/IEEE conference on Supercomputing, p.1-17, November 16, 2002, Baltimore, Maryland
|
| |
5
|
[5] ESG, "The Earth Systems Grid." http://www.earthsystemgrid.org
|
| |
6
|
[6] E. Deelman, J. Blythe, Y. Gil, C. Kesselman, G. Mehta, K. Vahi, A. Arbree, R. Cavanaugh, K. Blackburn, A. Lazzarini, and S. Koranda, "Mapping Abstract Complex Workflows onto Grid Environments," Journal of Grid Computing, vol. 1, pp. 25-39, 2003.
|
 |
7
|
Bill Allcock , Ian Foster , Veronika Nefedova , Ann Chervenak , Ewa Deelman , Carl Kesselman , Jason Lee , Alex Sim , Arie Shoshani , Bob Drach , Dean Williams, High-performance remote access to climate simulation data: a challenge problem for data grid technologies, Proceedings of the 2001 ACM/IEEE conference on Supercomputing (CDROM), p.46-46, November 10-16, 2001, Denver, Colorado
[doi> 10.1145/582034.582080]
|
| |
8
|
|
| |
9
|
[9] A. Chervenak, E. Deelman, C. Kesselman, L. Pearlman, and G. Singh, "A Metadata Catalog Service for Data Intensive Applications," GriPhyN technical report, 2002-11 2002.
|
| |
10
|
[10] E. Deelman, J. Blythe, Y. Gil, and C. Kesselman, "Pegasus: Planning for Execution in Grids," GriPhyN 2002-20, 2002.
|
| |
11
|
[11] A. Abramovici, W. E. Althouse, and e. al., "LIGO: The Laser Interferometer Gravitational-Wave Observatory (in Large Scale Measurements)," Science, vol. 256, pp. 325-333, 1992.
|
| |
12
|
Ewa Deelman , Carl Kesselman , Gaurang Mehta , Leila Meshkat , Laura Pearlman , Kent Blackburn , Phil Ehrens , Albert Lazzarini , Roy Williams , Scott Koranda, GriPhyN and LIGO, Building a Virtual Data Grid for Gravitational Wave Scientists, Proceedings of the 11th IEEE International Symposium on High Performance Distributed Computing, p.225, July 24-26, 2002
|
| |
13
|
[13] MCAT, "MCAT - A Meta Information Catalog (Version 1.1)."
|
| |
14
|
Chaitanya Baru , Reagan Moore , Arcot Rajasekar , Michael Wan, The SDSC storage resource broker, Proceedings of the 1998 conference of the Centre for Advanced Studies on Collaborative research, p.5, November 30-December 03, 1998, Toronto, Ontario, Canada
|
| |
15
|
[15] Guy, L., P. Kunszt, E. Laure, H. Stockinger, K. Stockinger (2002). Replica Management in Data Grids. Global Grid Forum 5.
|
| |
16
|
|
CITED BY 22
|
|
Peisheng Zhao , Aijun Chen , Yang Liu , Liping Di , Wenli Yang , Peichuan Li, Grid metadata catalog service-based OGC web registry service, Proceedings of the 12th annual ACM international workshop on Geographic information systems, November 12-13, 2004, Washington DC, USA
|
|
|
|
|
|
|
|
|
Mobin Uddin Ahmed , Raja Asad Zaheer , M. Abdul Qadir, Intelligent cache management for data grid, Proceedings of the 2005 Australasian workshop on Grid computing and e-research, p.5-12, January 01, 2005, Newcastle, New South Wales, Australia
|
|
|
Gurmeet Singh , Ewa Deelman , Gaurang Mehta , Karan Vahi , Mei-Hui Su , G. Bruce Berriman , John Good , Joseph C. Jacob , Daniel S. Katz , Albert Lazzarini , Kent Blackburn , Scott Koranda, The Pegasus portal: web based grid computing, Proceedings of the 2005 ACM symposium on Applied computing, March 13-17, 2005, Santa Fe, New Mexico
|
|
|
Beth Plale , Dennis Gannon , Yi Huang , Gopi Kandaswamy , Sangmi Lee Pallickara , Aleksander Slominski, Cooperating Services for Data-Driven Computational Experimentation, Computing in Science and Engineering, v.7 n.5, p.34-43, September 2005
|
|
|
Beth Plale , Jay Alameda , Bob Wilhelmson , Dennis Gannon , Shawn Hampton , Al Rossi , Kelvin Droegemeier, Active Management of Scientific Data, IEEE Internet Computing, v.9 n.1, p.27-34, January 2005
|
|
|
|
|
|
|
|
|
|
|
|
Nandita Mandal , Ewa Deelman , Gaurang Mehta , Mei-Hui Su , Karan Vahi, Integrating existing scientific workflow systems: the Kepler/Pegasus example, Proceedings of the 2nd workshop on Workflows in support of large-scale science, June 25-25, 2007, Monterey, California, USA
|
|
|
Michael R. Head , Madhusudhan Govindaraju, Approaching a parallelized XML parser optimized for multi-coreprocessors, Proceedings of the 2007 workshop on Service-oriented computing performance: aspects, issues, and approaches, p.17-22, June 25-25, 2007, Monterey, California, USA
|
|
|
Ewa Deelman , Gurmeet Singh , Mei-Hui Su , James Blythe , Yolanda Gil , Carl Kesselman , Gaurang Mehta , Karan Vahi , G. Bruce Berriman , John Good , Anastasia Laity , Joseph C. Jacob , Daniel S. Katz, Pegasus: A framework for mapping complex scientific workflows onto distributed systems, Scientific Programming, v.13 n.3, p.219-237, July 2005
|
|
|
Michael R. Head , Madhusudhan Govindaraju , Aleksander Slominski , Pu Liu , Nayef Abu-Ghazaleh , Robert van Engelen , Kenneth Chiu , Michael J. Lewis, A Benchmark Suite for SOAP-based Communication in Grid Web Services, Proceedings of the 2005 ACM/IEEE conference on Supercomputing, p.19, November 12-18, 2005
|
|
|
|
|
|
Elizeu Santos-Neto , Samer Al-Kiswany , Nazareno Andrade , Sathish Gopalakrishnan , Matei Ripeanu, enabling cross-layer optimizations in storage systems with custom metadata, Proceedings of the 17th international symposium on High performance distributed computing, June 23-27, 2008, Boston, MA, USA
|
|
|
|
|
|
P. Brenner , J. M. Wozniak , D. Thain , A. Striegel , J. W. Peng , J. A. Izaguirre, Biomolecular committor probability calculation enabled by processing in network storage, Parallel Computing, v.34 n.11, p.652-660, November, 2008
|
|
|
|
|
|
Rattapoom Tuchinda , Snehal Thakkar , Yolanda Gil , Ewa Deelman, Artemis: integrating scientific data on the grid, Proceedings of the 16th conference on Innovative applications of artifical intelligence, p.892-899, July 25-29, 2004, San Jose, California
|
|
|
|
|
|
|
|