ACM Home Page
Please provide us with feedback. Feedback
Active Proxy-G: optimizing the query execution process in the grid
Full text PdfPdf (248 KB)
Source Conference on High Performance Networking and Computing archive
Proceedings of the 2002 ACM/IEEE conference on Supercomputing table of contents
Baltimore, Maryland
Pages: 1 - 15  
Year of Publication: 2002
Authors
Henrique Andrade  University of Maryland, College Park, MD
Tahsin Kurc  The Ohio State University, Columbus, OH
Alan Sussman  University of Maryland, College Park, MD
Joel Saltz  University of Maryland, College Park, MD and The Ohio State University, Columbus, OH
Sponsors
IEEE-CS\DATC : IEEE Computer Society
ACM: Association for Computing Machinery
SIGARCH: ACM Special Interest Group on Computer Architecture
Publisher
IEEE Computer Society Press  Los Alamitos, CA, USA
Bibliometrics
Downloads (6 Weeks): 4,   Downloads (12 Months): 16,   Citation Count: 4
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  

ABSTRACT

The Grid environment facilitates collaborative work and allows many users to query and process data over geographically dispersed data repositories. Over the past several years, there has been a growing interest in developing applications that interactively analyze datasets, potentially in a collaborative setting. We describe the Active Proxy-G service that is able to cache query results, use those results for answering new incoming queries, generate subqueries for the parts of a query that cannot be produced from the cache, and submit the subqueries for final processing at application servers that store the raw datasets. We present an experimental evaluation to illustrate the effects of various design tradeoffs. We also show the benefits that two real applications gain from using the middleware.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
M. Aeschlimann, P. Dinda, L. Kallivokas, J. López, B. Lowekamp, and D. O'Hallaron. Preliminary report on the design of a framework for distributed visualization. In Proceedings of the Parallel and Distributed Processing Techniques and Applications (PDPTA99), Las Vegas, NV, 1999.
 
2
A. Afework, M. D. Beynon, F. Bustamante, A. Demarzo, R. Ferreira, R. Miller, M. Silberman, J. Saltz, A. Sussman, and H. Tsang. Digital dynamic telepathology - the Virtual Microscope. In AMIA98. American Medical Informatics Association, November 1998. Also available as University of Maryland Technical Report CS-TR-3892 and UMIACS-TR-98-23.
3
 
4
K. Amiri, D. Petrou, G. R. Ganger, and G. A. Gibson. Dynamic function placement for data-intensive cluster computing. In Proceedings of the USENIX Annual Technical Conference, San Diego, CA, 2000.
 
5
H. Andrade, T. Kurc, U. Catalyurek, A. Sussman, and J. Saltz. Persistent caching in a multiple query optimization framework. In Proceedings of the 6th Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers, Washington, DC, March 2002.
 
6
H. Andrade, T. Kurc, A. Sussman, E. Borovikov, and J. Saltz. On cache replacement policies for servicing mixed data intensive query workloads. In Proceedings of the 2nd Workshop on Caching, Coherence, and Consistency, held in conjunction with the 16th ACM International Conference on Supercomputing, New York, NY, June 2002.
7
 
8
 
9
 
10
11
 
12
 
13
 
14
15
 
16
E. Borovikov, A. Sussman, and L. Davis. An efficient system for multi-perspective imaging and volumetric shape analysis. In Proceedings of the 2001 Workshop on Parallel and Distributed Computing in Imaging Processing, Video Processing, and Multimedia, San Francisco, CA, 2001.
 
17
 
18
Common Component Architecture Forum. http://www.cca-forum.org.
 
19
 
20
 
21
I. Foster, C. Kesselman, J. Nick, and S. Tuecke. The physiology of the grid - an open grid services architecture for distributed systems integration, 2002. Draft document available at http://www.globus.org/research/papers/ogsa.pdf.
22
 
23
Global Grid Forum. http://www.gridforum.org.
 
24
 
25
W. Johnston, J. Guojun, G. Hoo, C. Larsen, J. Lee, B. Tierney, and M. Thompson. Distributed environments for large data-objects: Broadband networks and a new view of high performance, large scale storage-based applications. In Proceedings of Internetworking'96, Nara, Japan, September 1996.
 
26
 
27
 
28
B. Plale, P. Dinda, M. Helm, G. von Laszewski, and J. McGee. Key concepts and services of a grid information service, February 2002. Draft document available at http://www.cs.indiana.edu/plale/GISggf4.pdf.
 
29
30
 
31
SRB: The Storage Resource Broker. http://www.npaci.edu/DICE/SRB/index.html.
 
32
 
33
B. Tierney, W. Johnston, J. Lee, G. Hoo, and M. Thompson. An overview of the distributed parallel storage server (DPSS). Available at http://www-didc.lbl.gov/DPSS/Overview/DPSS.handout.fm.html.
 
34
D. Wessels and K. C. Claffy. ICP and the Squid web cache. IEEE Journal on Selected Areas in Communications, 16(3):345--357, April 1998.
35
 
36


Collaborative Colleagues:
Henrique Andrade: colleagues
Tahsin Kurc: colleagues
Alan Sussman: colleagues
Joel Saltz: colleagues