ACM Home Page
Please provide us with feedback. Feedback
Using MPI file caching to improve parallel write performance for large-scale scientific applications
Full text PdfPdf (524 KB)
Source
Conference on High Performance Networking and Computing archive
Proceedings of the 2007 ACM/IEEE conference on Supercomputing - Volume 00 table of contents
Reno, Nevada
SESSION: System performance table of contents
Article No. 8  
Year of Publication: 2007
ISBN:978-1-59593-764-3
Authors
Wei-keng Liao  Northwestern University, Evanston, Illinois
Avery Ching  Northwestern University, Evanston, Illinois
Kenin Coloma  Northwestern University, Evanston, Illinois
Arifa Nisar  Northwestern University, Evanston, Illinois
Alok Choudhary  Northwestern University, Evanston, Illinois
Jacqueline Chen  Sandia National Laboratories, Livermore, California
Ramanan Sankaran  Oak Ridge National Laboratory, Oak Ridge, Tennessee
Scott Klasky  Oak Ridge National Laboratory, Oak Ridge, Tennessee
Sponsors
IEEE-CS\DATC : IEEE Computer Society
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 38,   Downloads (12 Months): 159,   Citation Count: 2
Additional Information:

abstract   references   cited by   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1362622.1362634
What is a DOI?

ABSTRACT

Typical large-scale scientific applications periodically write checkpoint files to save the computational state throughout execution. Existing parallel file systems improve such write-only I/O patterns through the use of client-side file caching and write-behind strategies. In distributed environments where files are rarely accessed by more than one client concurrently, file caching has achieved significant success; however, in parallel applications where multiple clients manipulate a shared file, cache coherence control can serialize I/O. We have designed a thread based caching layer for the MPI I/O library, which adds a portable caching system closer to user applications so more information about the application's I/O patterns is available for better coherence control. We demonstrate the impact of our caching solution on parallel write performance with a comprehensive evaluation that includes a set of widely used I/O benchmarks and production application I/O kernels.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
 
4
J. del Rosario, R. Brodawekar, and A. Choudhary. Improved Parallel I/O via a Two-Phase Run-time Access Strategy. In the Workshop on I/O in Parallel Computer Systems at IPPS '93, pages 56--70, April 1993.
 
5
 
6
B. Fryxell, K. Olson, P. Ricker, F. X. Timmes, M. Zingale, D. Q. Lamb, P. MacNeice, R. Rosner, and H. Tufo. FLASH: An Adaptive Mesh Hydrodynamics Code for Modelling Astrophysical Thermonuclear Flashes. Astrophysical Journal Suppliment, pages 131--273, 2000.
 
7
HDF Group. Hierarchical Data Format, Version 5. The National Center for Supercomputing Applications, http://hdf.ncsa.uiuc.edu/HDF5.
8
 
9
IEEE/ANSI Std. 1003.1. Portable Operating System Interface (POSIX)-Part 1: System Application Program Interface (API) {C Language}, 1996.
10
 
11
W. Liao, A. Ching, K. Coloma, A. Choudhary, and L. Ward. An Implementation and Evaluation of Client-Side File Caching for MPI-IO. In the International Parallel and Distributed Processing Symposium (IPDPS), March 2007.
 
12
 
13
Lustre: A Scalable, High-Performance File System. Whitepaper. Cluster File Systems, Inc., 2003.
 
14
 
15
Message Passing Interface Forum. MPI: A Message Passing Interface Standard, Version 1.1, June 1995. http://www.nipi-forum.org/docs/docs.html.
 
16
Message Passing Interface Forum. MPI-2: Extensions to the Message Passing Interface, July 1997. http://www.mpi-forum.org/docs/docs.html.
17
 
18
 
19
 
20
R. Sankaran, E. Hawkes, J. Chen, T. Lu, and C. Law. Direct Numerical Simulations of Turbulent Lean Premixed Combustion. Journal of Physics: conference series, 46:38--42, 2006.
 
21
 
22
 
23
 
24
 
25
R. Thakur, W. Gropp, and E. Lusk. Users Guide for ROMIO: A High-Performance, Portable MPI-IO Implementation. Technical Report ANL/MCS-TM-234, Mathematics and Computer Science Division, Argonne National Laboratory, October 1997.
26
27
 
28
P. Wong and R. der Wijngaart. NAS Parallel Benchmarks I/O Version 2.4. Technical Report NAS-03-002, NASA Ames Research Center, Moffet Field, CA, January 2003.
 
29
M. Zingale. FLASH I/O Benchmark Routine -Parallel HDF 5, March 2001. http://flash.uchicago.edu/~zingale/flash_b enchmark_io.

Collaborative Colleagues:
Wei-keng Liao: colleagues
Avery Ching: colleagues
Kenin Coloma: colleagues
Arifa Nisar: colleagues
Alok Choudhary: colleagues
Jacqueline Chen: colleagues
Ramanan Sankaran: colleagues
Scott Klasky: colleagues