| The impact of paravirtualized memory hierarchy on linear algebra computational kernels and software |
| Full text |
Pdf
(680 KB)
|
Source
|
High Performance Distributed Computing
archive
Proceedings of the 17th international symposium on High performance distributed computing
table of contents
Boston, MA, USA
SESSION: Virtual machines
table of contents
Pages 141-152
Year of Publication: 2008
ISBN:978-1-59593-997-5
|
|
Authors
|
|
Lamia Youseff
|
University of California, Santa Barbara, Santa Barbara, CA, USA
|
|
Keith Seymour
|
University of Tennessee, Knoxville, TN, USA
|
|
Haihang You
|
University of Tennessee, Knoxville, TN, USA
|
|
Jack Dongarra
|
University of Tennessee, Knoxville, TN, USA
|
|
Rich Wolski
|
University of California, Santa Barbara, Santa Barbara, CA, USA
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 17, Downloads (12 Months): 167, Citation Count: 1
|
|
|
ABSTRACT
Previous studies have revealed that paravirtualization imposes minimal performance overhead on High Performance Computing (HPC) workloads, while exposing numerous benefits for this field. In this study, we are investigating the memory hierarchy characteristics of paravirtualized systems and their impact on automatically-tuned software systems. We are presenting an accurate characterization of memory attributes using hardware counters and user-process accounting. For that, we examine the proficiency of ATLAS, a quintessential example of an autotuning software system, in tuning the BLAS library routines for paravirtualized systems. In addition, we examine the effects of paravirtualization on the performance boundary. Our results show that the combination of ATLAS and Xen paravirtualization delivers native execution performance and nearly identical memory hierarchy performance profiles. Our research thus exposes new benefits to memory-intensive applications arising from the ability to slim down the guest OS without influencing the system performance. In addition, our findings support a novel and very attractive deployment scenario for computational science and engineering codes on virtual clusters and computational clouds.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
E. Anderson , Z. Bai , C. Bischof , L. S. Blackford , J. Demmel , Jack J. Dongarra , J. Du Croz , S. Hammarling , A. Greenbaum , A. McKenney , D. Sorensen, LAPACK Users' guide (third ed.), Society for Industrial and Applied Mathematics, Philadelphia, PA, 1999
|
| |
4
|
K. Asanovic, R. Bodik, B. C. Catanzaro, J. J. Gebis, P. Husbands, K. Keutzer, D. A. Patterson, W. L. Plishker, J. Shalf, S. W. Williams, and K. A. Yelick. The Landscape of Parallel Computing Research: A View from Berkeley. Technical Report UCB/EECS-2006-183, EECS Department, University of California, Berkeley, Dec 2006.
|
| |
5
|
G. Back and D. S. Nikolopoulos. Application-Specific Customization on Many-Core Platforms: The VT-ASOS Framework. In Proceedings of the Second Workshop on Software and Tools for Multi-Core Systems, March 2007.
|
 |
6
|
Jeff Bilmes , Krste Asanovic , Chee-Whye Chin , Jim Demmel, Optimizing matrix multiply using PHiPAC: a portable, high-performance, ANSI C coding methodology, Proceedings of the 11th international conference on Supercomputing, p.340-347, July 07-11, 1997, Vienna, Austria
[doi> 10.1145/263580.263662]
|
 |
7
|
|
| |
8
|
Christopher Clark , Keir Fraser , Steven Hand , Jacob Gorm Hansen , Eric Jul , Christian Limpach , Ian Pratt , Andrew Warfield, Live migration of virtual machines, Proceedings of the 2nd conference on Symposium on Networked Systems Design & Implementation, p.273-286, May 02-04, 2005
|
| |
9
|
J. Demmel, J. Dongarra, V. Eijkhout, E. Fuentes, A. Petitet, R. Vuduc, C. Whaley, and K. Yelick. Self Adapting Linear Algebra Algorithms and Software. Proceedings of the IEEE, 93(2), 2005. Special Issue on "Program Generation, Optimization, and Adaptation".
|
 |
10
|
|
| |
11
|
I. Foster , T. Freeman , K. Keahy , D. Scheftner , B. Sotomayer , X. Zhang, Virtual Clusters for Grid Communities, Proceedings of the Sixth IEEE International Symposium on Cluster Computing and the Grid, p.513-520, May 16-19, 2006
[doi> 10.1109/CCGRID.2006.108]
|
| |
12
|
M. Frigo and S. G. Johnson. FFTW: An Adaptive Software Architecture for the FFT. In Proc. 1998 IEEE Intl. Conf. Acoustics Speech and Signal Processing, volume 3, pages 1381--1384. IEEE, 1998.
|
| |
13
|
IBM. IBM Blue Cloud. http://www-03.ibm.com/press/us/en/pressrelease/22613.wss, Nov. 2007.
|
| |
14
|
|
 |
15
|
|
| |
16
|
J. Levon. Oprofile - a system profiler for linux. http://oprofile.sourceforge.net/, 2004.
|
 |
17
|
|
 |
18
|
|
| |
19
|
T. Naughton, G. Vallee, and S. Scott. Dynamic Adaptation using Xen. In First Workshop on System-level Virtualization for High Performance Computing (HPCVirt 2007), Mar 2007.
|
 |
20
|
Paul Barham , Boris Dragovic , Keir Fraser , Steven Hand , Tim Harris , Alex Ho , Rolf Neugebauer , Ian Pratt , Andrew Warfield, Xen and the art of virtualization, Proceedings of the nineteenth ACM symposium on Operating systems principles, October 19-22, 2003, Bolton Landing, NY, USA
|
 |
21
|
|
| |
22
|
P. Ruth, J. Rhee, D. Xu, R. Kennell, and S. Goasguen. Autonomic Live Adaptation of Virtual Computational Environments in a Multi-Domain Infrastructure. In Autonomic Computing, 2006. ICAC '06. IEEE International Conference on, pages 5--14, 2006.
|
| |
23
|
R. Vuduc, J. Demmel, and K. Yelick. OSKI: A Library of Automatically Tuned Sparse Matrix Kernels. In Proc. SciDAC 2005, Journal of Physics: Conference Series, volume 16, San Francisco, CA, June 2005.
|
| |
24
|
R. C. Whaley, A. Petitet, and J. Dongarra. Automated Empirical Optimizations of Software and the ATLAS Project. Parallel Computing, 27(1-2):3--35, January 2001.
|
 |
25
|
|
| |
26
|
|
| |
27
|
L. Youseff, R. Wolski, B. Gorda, and C. Krintz. Paravirtualization for HPC Systems. In G. Min, B. D. Martino, L. T. Yang, M. Guo, and G. Rünger, editors, ISPA Workshops, volume 4331 of Lecture Notes in Computer Science, pages 474--486. Springer, 2006.
|
| |
28
|
L. Youseff, R. Wolski, and C. Krintz. Linux Kernel Specialization for Scientific Application Performance. Technical Report UCSB Technical Report 2005-29, Univ. of California, Santa Barbara, Nov 2005.
|
CITED BY
|
|
Daniel Nurmi , Rich Wolski , Chris Grzegorczyk , Graziano Obertelli , Sunil Soman , Lamia Youseff , Dmitrii Zagorodnov, The Eucalyptus Open-Source Cloud-Computing System, Proceedings of the 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid, p.124-131, May 18-21, 2009
|
|