ACM Home Page
Please provide us with feedback. Feedback
The impact of paravirtualized memory hierarchy on linear algebra computational kernels and software
Full text PdfPdf (680 KB)
Source
High Performance Distributed Computing archive
Proceedings of the 17th international symposium on High performance distributed computing table of contents
Boston, MA, USA
SESSION: Virtual machines table of contents
Pages 141-152  
Year of Publication: 2008
ISBN:978-1-59593-997-5
Authors
Lamia Youseff  University of California, Santa Barbara, Santa Barbara, CA, USA
Keith Seymour  University of Tennessee, Knoxville, TN, USA
Haihang You  University of Tennessee, Knoxville, TN, USA
Jack Dongarra  University of Tennessee, Knoxville, TN, USA
Rich Wolski  University of California, Santa Barbara, Santa Barbara, CA, USA
Sponsors
ACM: Association for Computing Machinery
SIGARCH: ACM Special Interest Group on Computer Architecture
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 17,   Downloads (12 Months): 167,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1383422.1383440
What is a DOI?

ABSTRACT

Previous studies have revealed that paravirtualization imposes minimal performance overhead on High Performance Computing (HPC) workloads, while exposing numerous benefits for this field. In this study, we are investigating the memory hierarchy characteristics of paravirtualized systems and their impact on automatically-tuned software systems. We are presenting an accurate characterization of memory attributes using hardware counters and user-process accounting. For that, we examine the proficiency of ATLAS, a quintessential example of an autotuning software system, in tuning the BLAS library routines for paravirtualized systems. In addition, we examine the effects of paravirtualization on the performance boundary. Our results show that the combination of ATLAS and Xen paravirtualization delivers native execution performance and nearly identical memory hierarchy performance profiles. Our research thus exposes new benefits to memory-intensive applications arising from the ability to slim down the guest OS without influencing the system performance. In addition, our findings support a novel and very attractive deployment scenario for computational science and engineering codes on virtual clusters and computational clouds.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
 
4
K. Asanovic, R. Bodik, B. C. Catanzaro, J. J. Gebis, P. Husbands, K. Keutzer, D. A. Patterson, W. L. Plishker, J. Shalf, S. W. Williams, and K. A. Yelick. The Landscape of Parallel Computing Research: A View from Berkeley. Technical Report UCB/EECS-2006-183, EECS Department, University of California, Berkeley, Dec 2006.
 
5
G. Back and D. S. Nikolopoulos. Application-Specific Customization on Many-Core Platforms: The VT-ASOS Framework. In Proceedings of the Second Workshop on Software and Tools for Multi-Core Systems, March 2007.
6
7
 
8
 
9
J. Demmel, J. Dongarra, V. Eijkhout, E. Fuentes, A. Petitet, R. Vuduc, C. Whaley, and K. Yelick. Self Adapting Linear Algebra Algorithms and Software. Proceedings of the IEEE, 93(2), 2005. Special Issue on "Program Generation, Optimization, and Adaptation".
10
 
11
 
12
M. Frigo and S. G. Johnson. FFTW: An Adaptive Software Architecture for the FFT. In Proc. 1998 IEEE Intl. Conf. Acoustics Speech and Signal Processing, volume 3, pages 1381--1384. IEEE, 1998.
 
13
IBM. IBM Blue Cloud. http://www-03.ibm.com/press/us/en/pressrelease/22613.wss, Nov. 2007.
 
14
15
 
16
J. Levon. Oprofile - a system profiler for linux. http://oprofile.sourceforge.net/, 2004.
17
18
 
19
T. Naughton, G. Vallee, and S. Scott. Dynamic Adaptation using Xen. In First Workshop on System-level Virtualization for High Performance Computing (HPCVirt 2007), Mar 2007.
20
21
 
22
P. Ruth, J. Rhee, D. Xu, R. Kennell, and S. Goasguen. Autonomic Live Adaptation of Virtual Computational Environments in a Multi-Domain Infrastructure. In Autonomic Computing, 2006. ICAC '06. IEEE International Conference on, pages 5--14, 2006.
 
23
R. Vuduc, J. Demmel, and K. Yelick. OSKI: A Library of Automatically Tuned Sparse Matrix Kernels. In Proc. SciDAC 2005, Journal of Physics: Conference Series, volume 16, San Francisco, CA, June 2005.
 
24
R. C. Whaley, A. Petitet, and J. Dongarra. Automated Empirical Optimizations of Software and the ATLAS Project. Parallel Computing, 27(1-2):3--35, January 2001.
25
 
26
 
27
L. Youseff, R. Wolski, B. Gorda, and C. Krintz. Paravirtualization for HPC Systems. In G. Min, B. D. Martino, L. T. Yang, M. Guo, and G. Rünger, editors, ISPA Workshops, volume 4331 of Lecture Notes in Computer Science, pages 474--486. Springer, 2006.
 
28
L. Youseff, R. Wolski, and C. Krintz. Linux Kernel Specialization for Scientific Application Performance. Technical Report UCSB Technical Report 2005-29, Univ. of California, Santa Barbara, Nov 2005.


Collaborative Colleagues:
Lamia Youseff: colleagues
Keith Seymour: colleagues
Haihang You: colleagues
Jack Dongarra: colleagues
Rich Wolski: colleagues