| A framework for end-to-end simulation of high-performance computing systems |
| Full text |
Pdf
(252 KB)
|
| Source
|
International Conference on Simulation Tools and Techniques for Commuications, Networks and Systems & Workshops
archive
Proceedings of the 1st international conference on Simulation tools and techniques for communications, networks and systems & workshops
table of contents
Marseille, France
SESSION: Technical program
table of contents
Article No. 21
Year of Publication: 2008
ISBN:978-963-9799-20-2
|
|
Authors
|
|
Wolfgang E. Denzel
|
IBM Zurich Research Laboratory, Rüschlikon, Switzerland
|
|
Jian Li
|
IBM Austin Research Laboratory, Austin, TX
|
|
Peter Walker
|
Open Grid Computing, Inc., Austin, TX
|
|
Yuho Jin
|
Texas A&M University, College Station, TX
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 37, Downloads (12 Months): 92, Citation Count: 4
|
|
|
ABSTRACT
We present an end-to-end simulation framework that is capable of simulating High-Performance Computing (HPC) systems with hundreds of thousands of interconnected processors. The tool applies discrete event simulation and is driven by real-world application traces. We refer to it as MARS (MPI Application Replay network Simulator). It maintains reasonable simulation details of both the processors in general and specifically the interconnection network. Among other things, it features several network topologies, flexible routing schemes, arbitrary application task placement, point-to-point statistics collection, and data visualization. With a few case studies, we demonstrate the usefulness of this tool for assisting high-level system design as well as for performance projection and application tuning of future HPC systems.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
J. L. Peterson , P. J. Bohrer , L. Chen , E. N. Elnozahy , A. Gheith , R. H. Jewell , M. D. Kistler , T. R. Maeurer , S. A. Malone , D. B. Murrell , N. Needel , K. Rajamani , M. A. Rinaldi , R. O. Simpson , K. Sudeep , L. Zhang, Application of full-system simulation in exploratory system design and development, IBM Journal of Research and Development, v.50 n.2/3, p.321-332, March 2006
[doi> 10.1147/rd.502.0321]
|
| |
2
|
Peter S. Magnusson , Magnus Christensson , Jesper Eskilson , Daniel Forsgren , Gustav Hållberg , Johan Högberg , Fredrik Larsson , Andreas Moestedt , Bengt Werner, Simics: A Full System Simulation Platform, Computer, v.35 n.2, p.50-58, February 2002
[doi> 10.1109/2.982916]
|
| |
3
|
Varga, A. The OMNeT++ discrete event simulation system. In Proceedings of the European Simulation Multiconference (ESM' 01), Prague, Czech Republic, June 2001.
|
 |
4
|
Piotr R Luszczek , David H Bailey , Jack J Dongarra , Jeremy Kepner , Robert F Lucas , Rolf Rabenseifner , Daisuke Takahashi, The HPC Challenge (HPCC) benchmark suite, Proceedings of the 2006 ACM/IEEE conference on Supercomputing, November 11-17, 2006, Tampa, Florida
[doi> 10.1145/1188455.1188677]
|
| |
5
|
Vetter, J. S., Bhatia, N., Grobelny, E. M., Roth, P. C. Capturing petascale application characteristics with the Sequoia toolkit. In Proceedings of the International Parallel Computing Conference (ParCo '05), Malaga, Spain, 2005.
|
| |
6
|
Sekercioglu, Y. A., Varga, A., Egan, G. K. Parallel simulation made easy with OMNeT++. In Proceedings of the European Simulation Symposium (ESS '03), Delft, The Netherlands, Oct. 2003.
|
| |
7
|
Varga, A., Sekercioglu, Y. A., Egan, G. K. A practical efficiency criterion for the null message algorithm. In Proceedings of the European Simulation Symposium (ESS '03), Delft, The Netherlands, Oct. 2003.
|
| |
8
|
Adiga, N. R., Blumrich, M. A., Chen, D., Coteus, P., Gara, A., Giampapa, M. E., Heidelberger, P., Singh, S., Steinmacher-Burow, B. D., Takken, T., Tsao, M., Vranas, P. Blue Gene/L torus interconnection network. IBM Journal of Research and Development, Vol. 49, No. 2/3, March/May 2005, pp. 265--276.
|
| |
9
|
Nilesh Choudhury , Yogesh Mehta , Terry L. Wilmarth , Eric J. Bohm , Laxmikant V. Kalé, Scaling an optimistic parallel simulation of large-scale interconnection networks, Proceedings of the 37th conference on Winter simulation, December 04-07, 2005, Orlando, Florida
|
| |
10
|
Badia, R. M., Labarta, J., Gimenez, J., Escale, F. Dimemas: Predicting MPI applications behavior in grid environments. In Proceedings of the Workshop on Grid Applications and Programming Tools (GGF '03), 2003.
|
 |
11
|
Ilya Sharapov , Robert Kroeger , Guy Delamarter , Razvan Cheveresan , Matthew Ramsay, A case study in top-down performance estimation for a large-scale parallel application, Proceedings of the eleventh ACM SIGPLAN symposium on Principles and practice of parallel programming, March 29-31, 2006, New York, New York, USA
[doi> 10.1145/1122971.1122985]
|
CITED BY 4
|
|
Jian Li , Lixin Zhang , Charles Lefurgy , Richard Treumann , Wolfgang E. Denzel, Thrifty interconnection network for HPC systems, Proceedings of the 23rd international conference on Supercomputing, June 08-12, 2009, Yorktown Heights, NY, USA
|
|
|
S. D. Hammond , G. R. Mudalige , J. A. Smith , S. A. Jarvis , J. A. Herdman , A. Vadgama, WARPP: a toolkit for simulating high-performance parallel scientific codes, Proceedings of the 2nd International Conference on Simulation Tools and Techniques, March 02-06, 2009, Rome, Italy
|
|
|
|
|
|
|
|