|
ABSTRACT
Efficient I/O implementations can have a significant impact on the performance of parallel applications. This paper describes the design and implementation of PIOSIM, a parallel simulation library for MPI-IO programs. The simulator can be used to predict the performance of existing MPI-IO programs as a function of architectural characteristics, caching algorithms, and alternative implementations of collective I/O operations. This paper describes the simulator and presents the results of a number of performance studies to evaluate the impact of the preceding factors on a set of MPI-IO benchmarks, including programs from the NAS benchmark suite.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
{BBB94} Sandra Johnson Baylor, Caroline Benveniste, and Leo J. Beolhouwer. A methodology for evaluating parallel I/O performance for massively parallel processors. In Proceedings of the 27th Annual Simulation Symposium, pages 31-40, April 1994.
|
| |
2
|
{BBH96} Sandra Johnson Baylor, Caroline Benveniste, and Yarsun Hsu. Performance evaluation of a massively parallel I/O subsystem. In Ravi Jain, John Werth, and James C. Browne, editors, Input/Output in Parallel and Distributed Computer Systems, volume 362 of The Kluwer International Series in Engineering and Computer Science, chapter 13, pages 293-311. Kluwer Academic Publishers, 1996.
|
| |
3
|
|
| |
4
|
{BHS+95} D. Bailey, T. Harris, W. Saphir, R. v. d. Wijngaart, A. Woo, and M. Yarrow. The nas parallel benchmarks 2.0. Technical report nas-95-020, NASA Ames Research Center, Moffet Field, CA 94035-1000, December 1995.
|
| |
5
|
{CDJ+91} R. G. Covington, S. Dwarkadas, J. R. Jump, J. B. Sinclair, and S. Madala. The efficient simulation of parallel computer systems. International Journal in Computer Simulation, 1:31-58, 1991.
|
 |
6
|
|
| |
7
|
P. F. Corbett , D. G. Feltelson , J.-P. Prost , G. S. Almasi , S. J. Baylor , A. S. Bolmarcich , Y. Hsu , J. Satran , M. Snir , R. Colao , B. D. Herr , J. Kavaky , T. R. Morgan , A. Ziotek, Parallel file systems for the IBM SP computers, IBM Systems Journal, v.34 n.2, p.222-248, 1995
|
| |
8
|
|
 |
9
|
|
| |
10
|
{DGH91} H. Davis, S. R. Goldschmidt, and Hennessey. Multiprocessor simulation and tracing using Tango. In Proceedings of the 1991 International Conference on Parallel Processing (ICPP'91), pages II99-II107, August 1991.
|
 |
11
|
Phillip M. Dickens , Philip Heidelberger , David M. Nicol, A distributed memory LAPSE: parallel simulation of message-passing programs, Proceedings of the eighth workshop on Parallel and distributed simulation, p.32-38, July 06-08, 1994, Edinburgh, Scotland, United Kingdom
|
| |
12
|
{DWAP94} M. D. Dahlin, R. Y. Wang, T. E. Anderson, and D. A. Patterson. Cooperative Caching: Using Remote Client Memory to Improve File System Performance. In Proceedings of the 1994 Symposium on Operating Systems Design and Implementation, pages 61-74, November 1994.
|
 |
13
|
|
| |
14
|
|
 |
15
|
|
 |
16
|
|
| |
17
|
{MPI96} MPI-IO: a parallel file I/O interface for MPI. The MPI-IO Committee, April 1996. Version 0.5. See WWW http://lovelace.nas.nasa.gov/MPI-IO/mpi-io-report.0.5.ps.
|
| |
18
|
{Nit92} Bill Nitzberg. Performance of the iPSC/860 Concurrent File System. Technical Report RND-92-020, NAS Systems Division, NASA Ames, December 1992.
|
 |
19
|
|
| |
20
|
|
| |
21
|
{Pra96} Sundeep Prakash. Performance Prediction of Parallel Programs. Ph.d. dissertation, Computer Science Dept, UCLA, Los Angeles, CA, November 1996.
|
 |
22
|
Steven K. Reinhardt , Mark D. Hill , James R. Larus , Alvin R. Lebeck , James C. Lewis , David A. Wood, The Wisconsin Wind Tunnel: virtual prototyping of parallel computers, Proceedings of the 1993 ACM SIGMETRICS conference on Measurement and modeling of computer systems, p.48-60, May 10-14, 1993, Santa Clara, California, United States
|
 |
23
|
K. E. Seamons , Y. Chen , P. Jones , J. Jozwiak , M. Winslett, Server-directed collective I/O in Panda, Proceedings of the 1995 ACM/IEEE conference on Supercomputing (CDROM), p.57-es, December 04-08, 1995, San Diego, California, United States
[doi> 10.1145/224170.224371]
|
| |
24
|
{SPB96} Darren Sanders, Yoonho Park, and Maciej Brodowicz. Implementation and performance of MPI-IO file access using MPI datatypes. Technical Report UH-CS-96-12, University of Houston, November 1996.
|
CITED BY 10
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Florin Isaila , Guido Malpohl , Vlad Olaru , Gabor Szeder , Walter Tichy, Integrating collective I/O and cooperative caching into the "clusterfile" parallel file system, Proceedings of the 18th annual international conference on Supercomputing, June 26-July 01, 2004, Malo, France
|
|
|
Tahsin Kurc , Mustafa Uysal , Hyeonsang Eom , Jeff Hollingsworth , Joel Saltz , Alan Sussman, Efficient Performance Prediction for Large-Scale, Data-Intensive Applications, International Journal of High Performance Computing Applications, v.14 n.3, p.216-227, August 2000
|
|
|
Vikram S. Adve , Rajive Bagrodia , James C. Browne , Ewa Deelman , Aditya Dube , Elias N. Houstis , John R. Rice , Rizos Sakellariou , David J. Sundaram-Stukel , Patricia J. Teller , Mary K. Vernon, POEMS: End-to-End Performance Design of Large Parallel Adaptive Computational Systems, IEEE Transactions on Software Engineering, v.26 n.11, p.1027-1048, November 2000
|
|
|
|
|
|
|
|