ACM Home Page
Please provide us with feedback. Feedback
Feedback-controlled resource sharing for predictable eScience
Full text PdfPdf (430 KB)
Source Conference on High Performance Networking and Computing archive
Proceedings of the 2008 ACM/IEEE conference on Supercomputing table of contents
Austin, Texas
SECTION: Papers table of contents
Article No. 13  
Year of Publication: 2008
ISBN:978-1-4244-2835-9
Authors
Sang-Min Park  University of Virginia, Charlottesville, VA
Marty Humphrey  University of Virginia, Charlottesville, VA
Publisher
IEEE Press  Piscataway, NJ, USA
Bibliometrics
Downloads (6 Weeks): 6,   Downloads (12 Months): 92,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1413370.1413384
What is a DOI?

ABSTRACT

The emerging class of adaptive, real-time, data-driven applications are a significant problem for today's HPC systems. In general, it is extremely difficult for queuing-system-controlled HPC resources to make and guarantee a tightly-bounded prediction regarding the time at which a newly-submitted application will execute. While a reservation-based approach partially addresses the problem, it can create severe resource under-utilization (unused reservations, necessary scheduled idle slots, underutilized reservations, etc.) that resource providers are eager to avoid. In contrast, this paper presents a fundamentally different approach to guarantee predictable execution. By creating a virtualized application layer called the performance container, and opportunistically multiplexing concurrent performance containers through the application of formal feedback control theory, we regulate the job's progress such that the job meets its deadline without requiring exclusive access to resources even in the presence of a wide class of unexpected disturbances. Our evaluation using two widely-used applications, WRF and BLAST, on an 8-core server show our approach is predictable and meets deadlines with 3.4 % of errors on average while achieving high overall utilization.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
J. Michalakes, S. Chen, J. Dudhia, L. Hart, J. Klemp, J. Middleco, W. Skamarock. Development of a Next-generation Regional Weather Research and Forecast Model. ECMWF Workshop on the use of Parallel Processors in Meteorology, Reading, U.K., November 2000.
 
3
BLAST: Basic Local Alignment and Search Tool (http://www.ncbi.nlm.nih.gov/blast/)
 
4
I. Foster, C. Kesselman, C. Lee, R. Lindell, K. Nahrstedt, A. Roy. A Distributed Resource Management Architecture that Supports Advance Reservations and Co-Allocation. International Workshop on Quality of Service, 1999.
 
5
R. J. Al-ali, K. Amin, G. V. Laszewski, O. F. Lana, D. W. Walker, M. Hategan, N. Zaluzec. Analysis and Provision of QoS for Distributed Grid Applications. Journal of Grid Computing, 2004.
 
6
 
7
 
8
G. Singh, C. Kesselman, E. Deelman. Adaptive Pricing for Resource Reservations in Shared Environments. IEEE/ACM International Conference on Grid Computing, 2007.
 
9
P. Beckman, S. Nadella, N. Trebon, I. Beschastnikh. SPRUCE: A System for Supporting Urgent High-Performance Computing. Pg 295--316 in Grid-Based Problem Solving Environments by Springer Press, 2007.
 
10
 
11
Windows Server 2008 Hyper-V. http://www.microsoft.com/windowsserver2008/en/us/virtualizati on-consolidation.aspx
12
 
13
 
14
T. Freeman, K. Keahey, I. Foster, A. Rana, B. Sotomayor, F. Wuerthwein. Division of Labor: Tools for Growth and Scalability of Grids. International Conference on Service Oriented Computing, Chicago, IL. December 2006.
 
15
 
16
L. Youseff, R. Wolski, B. Gorda, C. Krintz. Paravirtualization for HPC Systems. Workshop on XEN in HPC Cluster and Grid Computing Environments (XHPC), held in conjunction with ISPA, December 2006.
17
 
18
 
19
 
20
 
21
 
22
23
 
24
B. Plale, D. Gannon, D. Reed, S. Graves, K. Droegemeier, B. Wilhelmson, M. Ramamurthy. Towards Dynamically Adaptive Weather Analysis and Forecasting in LEAD. ICCS workshop on Dynamic Data Driven Applications, Atlanta, Georgia, May 2005.
 
25
SURA Coastal Ocean Observing and Prediction (SCOOP): http://www.scoop.lsu.edu/gridsphere/gridsphere
 
26
Grid Enabled Neurosurgical Imaging Using Simulation: http://wiki.realitygrid.org/wiki/GENIUS
 
27
ShakeMovie: Caltech's Near Real Time Simulation of Southern California Seismic Events Portal. http://shakemovie.caltech.edu/
 
28
29
 
30
 
31
 
32
S-M. Park and M. Humphrey. Data Throttling for Data-Intensive Workflows. IEEE International Parallel and Distributed Processing Symposium (IPDPS 2008). April 14--18, 2008.

Collaborative Colleagues:
Sang-Min Park: colleagues
Marty Humphrey: colleagues