| A regression-based approach to scalability prediction |
| Full text |
Pdf
(202 KB)
|
Source
|
International Conference on Supercomputing
archive
Proceedings of the 22nd annual international conference on Supercomputing
table of contents
Island of Kos, Greece
SESSION: Performance evaluation 2
table of contents
Pages 368-377
Year of Publication: 2008
ISBN:978-1-60558-158-3
|
|
Authors
|
|
Bradley J. Barnes
|
University of Georgia, Athens, GA, USA
|
|
Barry Rountree
|
University of Georgia, Athens, GA, USA
|
|
David K. Lowenthal
|
University of Georgia, Athens, GA, USA
|
|
Jaxk Reeves
|
University of Georgia, Athens, GA, USA
|
|
Bronis de Supinski
|
Lawrence Livermore National Laboratory, Livermore, CA, USA
|
|
Martin Schulz
|
Lawrence Livermore National Laboratory, Livermore, CA, USA
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 18, Downloads (12 Months): 112, Citation Count: 2
|
|
|
ABSTRACT
Many applied scientific domains are increasingly relying on large-scale parallel computation. Consequently, many large clusters now have thousands of processors. However, the ideal number of processors to use for these scientific applications varies with both the input variables and the machine under consideration, and predicting this processor count is rarely straightforward. Accurate prediction mechanisms would provide many benefits, including improving cluster efficiency and identifying system configuration or hardware issues that impede performance. We explore novel regression-based approaches to predict parallel program scalability. We use several program executions on a small subset of the processors to predict execution time on larger numbers of processors. We compare three different regression-based techniques: one based on execution time only; another that uses per-processor information only; and a third one based on the global critical path. These techniques provide accurate scaling predictions, with median prediction errors between 6.2% and 17.3% for seven applications.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
S. R. Alam and J. S. Vetter. Hierarchical model validation of symbolic performance models of scientific applications. In Euro-Par, Aug. 2006.
|
 |
2
|
Sadaf R. Alam , Jeffrey S. Vetter , Pratul K. Agarwal , Al Geist, Performance characterization of molecular dynamics techniques for biomolecular simulations, Proceedings of the eleventh ACM SIGPLAN symposium on Principles and practice of parallel programming, March 29-31, 2006, New York, New York, USA
[doi> 10.1145/1122971.1122983]
|
| |
3
|
D. Bailey, J. Barton, T. Lasinski, and H. Simon. The NAS parallel benchmarks. RNR-91-002, NASA Ames Research Center, Aug. 1991.
|
 |
4
|
Vasanth Balasundaram , Geoffrey Fox , Ken Kennedy , Ulrich Kremer, A static performance estimator to guide data partitioning decisions, Proceedings of the third ACM SIGPLAN symposium on Principles and practice of parallel programming, p.213-223, April 21-24, 1991, Williamsburg, Virginia, United States
|
| |
5
|
R. Bell, A. Malony, and S. Shende. ParaProf: A Portable, Extensible, and Scalable Tool for Parallel Performance Profile Analysis. In Proceedings of the International Conference on Parallel and Distributed Computing (Euro-Par 2003), pages 17--26, Aug. 2003.
|
| |
6
|
J. Brehm, P. H. Worley, and M. Madhukar. Performance modeling for SPMD message-passing programs. Concurrency: Practice and Experience, 10(5):333--357, Apr. 1998.
|
| |
7
|
S. Browne , J. Dongarra , N. Garner , K. London , P. Mucci, A scalable cross-platform infrastructure for application performance tuning using hardware counters, Proceedings of the 2000 ACM/IEEE conference on Supercomputing (CDROM), p.42-es, November 04-10, 2000, Dallas, Texas, United States
|
| |
8
|
|
 |
9
|
Graham Carey , Joe Schmidt , Vineet Singh , Dennis Yelton, A scalable, object-oriented finite element solver for partial differential equations on multicomputers, Proceedings of the 6th international conference on Supercomputing, p.387-396, July 19-24, 1992, Washington, D. C., United States
[doi> 10.1145/143369.143438]
|
 |
10
|
David E. Culler , Richard M. Karp , David Patterson , Abhijit Sahay , Eunice E. Santos , Klaus Erik Schauser , Ramesh Subramonian , Thorsten von Eicken, LogP: a practical model of parallel computation, Communications of the ACM, v.39 n.11, p.78-85, Nov. 1996
[doi> 10.1145/240455.240477]
|
| |
11
|
|
| |
12
|
T. R. P. for Statistical Computing. http://www.r-project.org/.
|
 |
13
|
|
| |
14
|
|
| |
15
|
E. Ipek, B. R. de Supinski, M. Schulz, and S. A. McKee. An approach to performance prediction for parallel applications. In Euro-Par, pages 196--205, Aug 2005.
|
 |
16
|
D. J. Kerbyson , H. J. Alme , A. Hoisie , F. Petrini , H. J. Wasserman , M. Gittings, Predictive performance and scalability modeling of a large-scale application, Proceedings of the 2001 ACM/IEEE conference on Supercomputing (CDROM), p.37-37, November 10-16, 2001, Denver, Colorado
[doi> 10.1145/582034.582071]
|
| |
17
|
|
 |
18
|
Benjamin C. Lee , David M. Brooks , Bronis R. de Supinski , Martin Schulz , Karan Singh , Sally A. McKee, Methods of inference and learning for performance modeling of parallel applications, Proceedings of the 12th ACM SIGPLAN symposium on Principles and practice of parallel programming, March 14-17, 2007, San Jose, California, USA
[doi> 10.1145/1229428.1229479]
|
| |
19
|
|
 |
20
|
|
| |
21
|
G. Marin and J. Mellor-Crummey. Application insight through performance modeling. In IEEE International Performance Computing and Communications Conference, Apr 2007.
|
| |
22
|
M. Müller, H. Brunst, M. Jurenz, A. Knüpfer, M. Lieber, H. Mix, and W. Nagel. Developing Scalable Applications with Vampir, VampirServer and VampirTrace. In Proceedings of the Minisymposium on Scalability and Usability of HPC Programming Tools at PARCO 2007, to appear, Sept. 2007.
|
 |
23
|
|
| |
24
|
|
| |
25
|
V. Pillet, J. Labarta, T. Cortes, and S. Girona. PARAVER: A tool to visualise and analyze parallel code. In Proceedings of WoTUG-18: Transputer and Occam Developments, volume 44 of Transputer and Occam Engineering, pages 17--31, Apr. 1995.
|
 |
26
|
|
| |
27
|
M. Schulz. Extracting critical path graphs from MPI applications. In IEEE Cluster, Sep 2005.
|
| |
28
|
|
| |
29
|
Allan Snavely , Laura Carrington , Nicole Wolter , Jesus Labarta , Rosa Badia , Avi Purkayastha, A framework for performance modeling and prediction, Proceedings of the 2002 ACM/IEEE conference on Supercomputing, p.1-17, November 16, 2002, Baltimore, Maryland
|
 |
30
|
|
 |
31
|
|
 |
32
|
Frederick C. Wong , Richard P. Martin , Remzi H. Arpaci-Dusseau , David E. Culler, Architectural requirements and scalability of the NAS parallel benchmarks, Proceedings of the 1999 ACM/IEEE conference on Supercomputing (CDROM), p.41-es, November 14-19, 1999, Portland, Oregon, United States
[doi> 10.1145/331532.331573]
|
| |
33
|
|
| |
34
|
|
CITED BY 2
|
|
|
|
|
Rubing Duan , Farrukh Nadeem , Jie Wang , Yun Zhang , Radu Prodan , Thomas Fahringer, A Hybrid Intelligent Method for Performance Modeling and Prediction of Workflow Activities in Grids, Proceedings of the 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid, p.339-347, May 18-21, 2009
|
|