|
ABSTRACT
Accurate simulation of large parallel applications can be facilitated with the use of direct execution and parallel discrete event simulation. This paper describes the use of COMPASS, a direct execution-driven, parallel simulator for performance prediction of programs that include both communication and I/O intensive applications. The simulator has been used to predict the performance of such applications on both distributed memory machines like the IBM SP and shared-memory machines like the SGI Origin 2000. The paper illustrates the usefulness of COMPASS as a versatile performance prediction tool. We use both real-world applications and synthetic benchmarks to study application scalability, sensitivity to communication latency, and the interplay between factors like communication pattern and parallel file system caching on application performance. We also show that the simulator is accurate in its predictions and that it is also efficient in its ability to use parallel simulation to reduce its own execution time which, in some cases, has yielded a nearlinear speedup.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
D. Bailey, T. Harris, W. Shaphir, R. van der Wijngaart, A. Woo, and M. Yarrow. "The NAS Parallel Benchmarks 2.0," Report NAS-95-090, NASA Ames Research Center, 1995.
|
| |
3
|
S.J. Baylor, C. Benveniste and L.J. Beolhouwer. "A Methodology for Evaluating Parallel I/O Performance for Massively Parallel Processors." In Proceedings of the 27th Annual Simulation Symposium, 1994, pp.31-40.
|
| |
4
|
|
| |
5
|
M. Chandy and J. Misra. "Distributed Simulation: A Case Study in Design And Verification Of Distributed Programs," IEEE Trans. on Software Engineering, Sept. 1979, pp.440- 452.
|
 |
6
|
|
| |
7
|
|
 |
8
|
R. C. Covington , S. Madala , V. Mehta , J. R. Jump , J. B. Sinclair, The rice parallel processing testbed, Proceedings of the 1988 ACM SIGMETRICS conference on Measurement and modeling of computer systems, p.4-11, May 24-27, 1988, Santa Fe, New Mexico, United States
|
| |
9
|
|
| |
10
|
M.H. Dahlin, R.Y. Wang, T.E. Anderson and D.A. Patterson. "Remote Client Memory to Improve File System Performance." In Proceedings of the 1994 Symposium on Operating Systems.
|
| |
11
|
H.Davis, S.R. Goldschmidt and Hennessey. "Multiprocessor Simulation and Tracing Using Tango." In Proceedings of ICPP'91, pp. 99-107, August 1991.
|
 |
12
|
Ewa Deelman , Aditya Dube , Adolfy Hoisie , Yong Luo , Richard L. Oliver , David Sundaram-Stukel , Harvey Wasserman , Vikram S. Adve , Rajive Bagrodia , James C. Browne , Elias Houstis , Olaf Lubeck , John Rice , Patricia J. Teller , Mary K. Vernon, Poems: end-to-end performance design of large parallel adaptive computational systems, Proceedings of the 1st international workshop on Software and performance, p.18-30, October 12-16, 1998, Santa Fe, New Mexico, United States
[doi> 10.1145/287318.292468]
|
| |
13
|
|
| |
14
|
J2c (Fortran to C converter), http://www.netlib.org/f2c/
|
 |
15
|
James V. Huber, Jr. , Andrew A. Chien , Christopher L. Elford , David S. Blumenthal , Daniel A. Reed, PPFS: a high performance portable parallel file system, Proceedings of the 9th international conference on Supercomputing, p.385-394, July 03-07, 1995, Barcelona, Spain
[doi> 10.1145/224538.224638]
|
 |
16
|
|
| |
17
|
|
| |
18
|
|
 |
19
|
|
| |
20
|
"ASCI Blue-Pacific IBM RS/6000 TR System at Lawrence Livermore National Laboratory," http://www.llnl.gov/asci/platforms/bluepac/tr.hwtable.html.
|
 |
21
|
|
| |
22
|
Y. Luo. "MPI Performance Study on the SGI Origin 2000," Pacific Rim Conference on Communications, Computers and Signal Processing, 1997, pp.269-272.
|
| |
23
|
S.S. Mukherjee, S.K. Reinhardt, B. Falsafi, M. Litzkow, S. Huss-Lederman, M.D. Hill, J.R. Larus, and D.A. Wood. "Wisconsin Wind Tunnel II: A Fast and Portable Parallel Architecture Simulator," Workshop on Performance Analysis and Its Impact on Design (PAID), 1997.
|
 |
24
|
|
| |
25
|
|
| |
26
|
|
| |
27
|
|
 |
28
|
Steven K. Reinhardt , Mark D. Hill , James R. Larus , Alvin R. Lebeck , James C. Lewis , David A. Wood, The Wisconsin Wind Tunnel: virtual prototyping of parallel computers, Proceedings of the 1993 ACM SIGMETRICS conference on Measurement and modeling of computer systems, p.48-60, May 10-14, 1993, Santa Clara, California, United States
|
| |
29
|
J.M. clel Rosario, R. Bordawekar and A. Choudhary. "Improved Parallel I/O via a Two-Phase Runtime Access Strategy." In Proceedings of the IPP '93 Workshop on I/0 in Parallel Computer Systems, 1993, pp. 56-70.
|
 |
30
|
|
| |
31
|
|
| |
32
|
"The ASCI sweep3d Benchmark Code," http://www.llnl.gov/asci_benchmarks/.
|
CITED BY 14
|
|
Vikram S. Adve , Rajive Bagrodia , Ewa Deelman , Thomas Phan , Rizos Sakellariou, Compiler-supported simulation of highly scalable parallel applications, Proceedings of the 1999 ACM/IEEE conference on Supercomputing (CDROM), p.1-es, November 14-19, 1999, Portland, Oregon, United States
|
|
|
Ewa Deelman , Rajive Bagrodia , Rizos Sakellariou , Vikram Adve, Improving lookahead in parallel discrete event simulations of large-scale applications using compiler analysis, Proceedings of the fifteenth workshop on Parallel and distributed simulation, p.5-13, May 15-18, 2001, Lake Arrowhead, California, United States
|
|
|
|
|
|
|
|
|
Vikram S. Adve , Rajive Bagrodia , James C. Browne , Ewa Deelman , Aditya Dube , Elias N. Houstis , John R. Rice , Rizos Sakellariou , David J. Sundaram-Stukel , Patricia J. Teller , Mary K. Vernon, POEMS: End-to-End Performance Design of Large Parallel Adaptive Computational Systems, IEEE Transactions on Software Engineering, v.26 n.11, p.1027-1048, November 2000
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Célia L. O. Kawabata , Regina H. C. Santana , Marcos J. Santana , Sarita M. Bruschi , Kalinka R. L. J. Castelo Branco, Performance evaluation of a CMB protocol, Proceedings of the 37th conference on Winter simulation, December 03-06, 2006, Monterey, California
|
|
|
|
|
|
|
|
|
|
|
|
|
|