ACM Home Page
Please provide us with feedback. Feedback
A framework for end-to-end simulation of high-performance computing systems
Full text PdfPdf (252 KB)
Source International Conference on Simulation Tools and Techniques for Commuications, Networks and Systems & Workshops archive
Proceedings of the 1st international conference on Simulation tools and techniques for communications, networks and systems & workshops table of contents
Marseille, France
SESSION: Technical program table of contents
Article No. 21  
Year of Publication: 2008
ISBN:978-963-9799-20-2
Authors
Wolfgang E. Denzel  IBM Zurich Research Laboratory, Rüschlikon, Switzerland
Jian Li  IBM Austin Research Laboratory, Austin, TX
Peter Walker  Open Grid Computing, Inc., Austin, TX
Yuho Jin  Texas A&M University, College Station, TX
Sponsors
: ICST
: INRIA
Publisher
Bibliometrics
Downloads (6 Weeks): 37,   Downloads (12 Months): 92,   Citation Count: 4
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  

ABSTRACT

We present an end-to-end simulation framework that is capable of simulating High-Performance Computing (HPC) systems with hundreds of thousands of interconnected processors. The tool applies discrete event simulation and is driven by real-world application traces. We refer to it as MARS (MPI Application Replay network Simulator). It maintains reasonable simulation details of both the processors in general and specifically the interconnection network. Among other things, it features several network topologies, flexible routing schemes, arbitrary application task placement, point-to-point statistics collection, and data visualization. With a few case studies, we demonstrate the usefulness of this tool for assisting high-level system design as well as for performance projection and application tuning of future HPC systems.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
Varga, A. The OMNeT++ discrete event simulation system. In Proceedings of the European Simulation Multiconference (ESM' 01), Prague, Czech Republic, June 2001.
4
 
5
Vetter, J. S., Bhatia, N., Grobelny, E. M., Roth, P. C. Capturing petascale application characteristics with the Sequoia toolkit. In Proceedings of the International Parallel Computing Conference (ParCo '05), Malaga, Spain, 2005.
 
6
Sekercioglu, Y. A., Varga, A., Egan, G. K. Parallel simulation made easy with OMNeT++. In Proceedings of the European Simulation Symposium (ESS '03), Delft, The Netherlands, Oct. 2003.
 
7
Varga, A., Sekercioglu, Y. A., Egan, G. K. A practical efficiency criterion for the null message algorithm. In Proceedings of the European Simulation Symposium (ESS '03), Delft, The Netherlands, Oct. 2003.
 
8
Adiga, N. R., Blumrich, M. A., Chen, D., Coteus, P., Gara, A., Giampapa, M. E., Heidelberger, P., Singh, S., Steinmacher-Burow, B. D., Takken, T., Tsao, M., Vranas, P. Blue Gene/L torus interconnection network. IBM Journal of Research and Development, Vol. 49, No. 2/3, March/May 2005, pp. 265--276.
 
9
 
10
Badia, R. M., Labarta, J., Gimenez, J., Escale, F. Dimemas: Predicting MPI applications behavior in grid environments. In Proceedings of the Workshop on Grid Applications and Programming Tools (GGF '03), 2003.
11


Collaborative Colleagues:
Wolfgang E. Denzel: colleagues
Jian Li: colleagues
Peter Walker: colleagues
Yuho Jin: colleagues