ACM Home Page
Please provide us with feedback. Feedback
Dynamically adapting file domain partitioning methods for collective I/O based on underlying parallel file system locking protocols
Full text PdfPdf (219 KB)
Source Conference on High Performance Networking and Computing archive
Proceedings of the 2008 ACM/IEEE conference on Supercomputing - Volume 00 table of contents
Austin, Texas
SECTION: Papers table of contents
Article No. 3  
Year of Publication: 2008
ISBN:978-1-4244-2835-9
Authors
Wei-keng Liao  Northwestern University, Evanston, Illinois
Alok Choudhary  Northwestern University, Evanston, Illinois
Publisher
IEEE Press  Piscataway, NJ, USA
Bibliometrics
Downloads (6 Weeks): 15,   Downloads (12 Months): 184,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  

ABSTRACT

Collective I/O, such as that provided in MPI-IO, enables process collaboration among a group of processes for greater I/O parallelism. Its implementation involves file domain partitioning, and having the right partitioning is a key to achieving high-performance I/O. As modern parallel file systems maintain data consistency by adopting a distributed file locking mechanism to avoid centralized lock management, different locking protocols can have significant impact to the degree of parallelism of a given file domain partitioning method. In this paper, we propose dynamic file partitioning methods that adapt according to the underlying locking protocols in the parallel file systems and evaluate the performance of four partitioning methods under two locking protocols. By running multiple I/O benchmarks, our experiments demonstrate that no single partitioning guarantees the best performance. Using MPI-IO as an implementation platform, we provide guidelines to select the most appropriate partitioning methods for various I/O patterns and file systems.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
H. Shan and J. Shalf, "Using IOR to Analyze the I/O Performance of XT3," in the Cray User Group Conference, May 2007.
 
2
Message Passing Interface Forum, MPI-2: Extensions to the Message Passing Interface, Jul. 1997, http://www.mpi-forum.org/docs/docs.html.
 
3
J. del Rosario, R. Brodawekar, and A. Choudhary, "Improved Parallel I/O via a Two-Phase Run-time Access Strategy," in the Workshop on I/O in Parallel Computer Systems at IPPS '93, Apr. 1993, pp. 56--70.
4
 
5
R. Thakur, W. Gropp, and E. Lusk, Users Guide for ROMIO: A High-Performance, Portable MPI-IO Implementation, Technical Report ANL/MCS-TM-234, Mathematics and Computer Science Division, Argonne National Laboratory, Oct. 1997.
 
6
J. Squyres, A. Lumsdaine, W. George, J. Hagedorn, and J. Devaney, "The interoperable message passing interface (IMPI) extensions to LAM/MPI," in Proceedings, MPI Developers Conference (MPIDC), March 2000.
 
7
 
8
 
9
 
10
 
11
W. Liao, A. Ching, K. Coloma, A. Choudhary, and L. Ward, "An Implementation and Evaluation of Client-Side File Caching for MPI-IO," in the International Parallel and Distributed Processing Symposium, Mar. 2007.
 
12
IEEE/ANSI Std. 1003.1, Portable Operating System Interface (POSIX)-Part 1: System Application Program Interface (API) {C Language}, 1996.
13
 
14
 
15
Lustre: A Scalable, High-Performance File System, Whitepaper, Cluster File Systems, Inc., 2003.
 
16
17
 
18
K. Coloma, A. Choudhary, W. Liao, W. Lee, E. Russell, and N. Pundit, "Scalable High-level Caching for Parallel I/O," in the International Parallel and Distributed Processing Symposium, Apr. 2004.
 
19
K. Coloma, A. Ching, A. Choudhary, W. Liao, R. Ross, R. Thakur, and L. Ward, "A new flexible MPI collective I/O implementation," in the IEEE Conference on Cluster Computing, Sep. 2006.
 
20
 
21
K. Coloma, A. Choudhary, W. Liao, W. Lee, and S. Tideman, "DAChe: Direct Access Cache System for Parallel I/O," in the 20th International Supercomputer Conference, Jun. 2005.
 
22
H. Yu, R. Sahoo, C. Howson, G. Almasi, J. Castanos, M. Gupta, J. Moreira, J. Parker, T. Engelsiepen, R. Ross, R. Thakur, R. Latham, and W. D. Gropp, "High Performance File I/O for the BlueGene/L Supercomputer," in the 12th International Symposium on High-Performance Computer Architecture (HPCA-12), Feb. 2006.
 
23
P. Wong and R. der Wijngaart, "NAS Parallel Benchmarks I/O Version 2.4," NASA Ames Research Center, Moffet Field, CA, Tech. Rep. NAS-03-002, Jan. 2003.
 
24
M. Zingale, "FLASH I/O Benchmark Routine - Parallel HDF 5," Mar. 2001, http://flash.uchicago.edu/~zingale/flash_benchmark_io.
 
25
B. Fryxell, K. Olson, P. Ricker, F. X. Timmes, M. Zingale, D. Q. Lamb, P. MacNeice, R. Rosner, and H. Tufo, "FLASH: An Adaptive Mesh Hydrodynamics Code for Modelling Astrophysical Thermonuclear Flashes," Astrophysical Journal Suppliment, pp. 131--273, 2000.
 
26
HDF Group, Hierarchical Data Format, Version 5, The National Center for Supercomputing Applications, http://hdf.ncsa.uiuc.edu/HDF5.
 
27
R. Sankaran, E. Hawkes, J. Chen, T. Lu, and C. Law, "Direct Numerical Simulations of Turbulent Lean Premixed Combustion," Journal of Physics: conference series, vol. 46, pp. 38--42, 2006.

Collaborative Colleagues:
Wei-keng Liao: colleagues
Alok Choudhary: colleagues