ACM Home Page
Please provide us with feedback. Feedback
Automated, scalable debugging of MPI programs with Intel® Message Checker
Full text PdfPdf (240 KB)
Source International Conference on Software Engineering archive
Proceedings of the second international workshop on Software engineering for high performance computing system applications table of contents
St. Louis, Missouri
WORKSHOP SESSION: Verification table of contents
Pages: 78 - 82  
Year of Publication: 2005
ISBN:1-59593-117-1
Authors
Jayant DeSouza  Intel Corporation, Champaign, IL
Bob Kuhn  Intel Corporation, Champaign, IL
Bronis R. de Supinski  Lawrence Livermore National Lab, Livermore, CA
Victor Samofalov  Intel Russia
Sergey Zheltov  Intel Russia
Stanislav Bratanov  Intel Russia
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 11,   Downloads (12 Months): 55,   Citation Count: 6
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1145319.1145342
What is a DOI?

ABSTRACT

The trend towards many-core multi-processor systems and clusters will make systems with tens and hundreds of processors more widely available. Current manual debugging techniques do not scale well to such large systems. Advanced automated debugging tools are needed for standard programming models based on commodity computing, such as threads and MPI. We surveyed MPI users to identify the kinds of MPI errors that they encounter, and classify the errors into several types. We describe how automated tools can detect such errors and present the Intel® Message Checker (IMC) technology being developed at the Intel Advanced Computing Center. IMC's unique technology automatically detects several kinds of MPI errors such as various types of mismatches, race conditions, deadlocks and potential deadlocks, and resource misuse. Finally, we review the usability and uniqueness of IMC and discuss our future plans.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
Bettina Krammer, Matthias S. Müller, Michael M. Resch. MPI I/O Analysis and Error Detection with MARMOT. Proceedings of EuroPVM/MPI 2004, Budapest, Hungary, September 19-22, 2004. Published in Lecture Notes in Computer Science Vol. 3241, pp. 242--250, Springer, 2004.
 
4
Luecke, G., Chen, H., Coyle, J., Hoekstra, J., Kraeva, M., and Zou, Y. MPI-CHECK: a Tool for Checking Fortran 90 MPI Programs. Concurrency and Computation: Practice and Experience. 2003, vol. 15, pp 93--100.

CITED BY  6

Collaborative Colleagues:
Jayant DeSouza: colleagues
Bob Kuhn: colleagues
Bronis R. de Supinski: colleagues
Victor Samofalov: colleagues
Sergey Zheltov: colleagues
Stanislav Bratanov: colleagues