|
ABSTRACT
Collective I/O, such as that provided in MPI-IO, enables process collaboration among a group of processes for greater I/O parallelism. Its implementation involves file domain partitioning, and having the right partitioning is a key to achieving high-performance I/O. As modern parallel file systems maintain data consistency by adopting a distributed file locking mechanism to avoid centralized lock management, different locking protocols can have significant impact to the degree of parallelism of a given file domain partitioning method. In this paper, we propose dynamic file partitioning methods that adapt according to the underlying locking protocols in the parallel file systems and evaluate the performance of four partitioning methods under two locking protocols. By running multiple I/O benchmarks, our experiments demonstrate that no single partitioning guarantees the best performance. Using MPI-IO as an implementation platform, we provide guidelines to select the most appropriate partitioning methods for various I/O patterns and file systems.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
H. Shan and J. Shalf, "Using IOR to Analyze the I/O Performance of XT3," in the Cray User Group Conference, May 2007.
|
| |
2
|
Message Passing Interface Forum, MPI-2: Extensions to the Message Passing Interface, Jul. 1997, http://www.mpi-forum.org/docs/docs.html.
|
| |
3
|
J. del Rosario, R. Brodawekar, and A. Choudhary, "Improved Parallel I/O via a Two-Phase Run-time Access Strategy," in the Workshop on I/O in Parallel Computer Systems at IPPS '93, Apr. 1993, pp. 56--70.
|
 |
4
|
|
| |
5
|
R. Thakur, W. Gropp, and E. Lusk, Users Guide for ROMIO: A High-Performance, Portable MPI-IO Implementation, Technical Report ANL/MCS-TM-234, Mathematics and Computer Science Division, Argonne National Laboratory, Oct. 1997.
|
| |
6
|
J. Squyres, A. Lumsdaine, W. George, J. Hagedorn, and J. Devaney, "The interoperable message passing interface (IMPI) extensions to LAM/MPI," in Proceedings, MPI Developers Conference (MPIDC), March 2000.
|
| |
7
|
|
| |
8
|
|
| |
9
|
|
| |
10
|
|
| |
11
|
W. Liao, A. Ching, K. Coloma, A. Choudhary, and L. Ward, "An Implementation and Evaluation of Client-Side File Caching for MPI-IO," in the International Parallel and Distributed Processing Symposium, Mar. 2007.
|
| |
12
|
IEEE/ANSI Std. 1003.1, Portable Operating System Interface (POSIX)-Part 1: System Application Program Interface (API) {C Language}, 1996.
|
 |
13
|
Jean-Pierre Prost , Richard Treumann , Richard Hedges , Bin Jia , Alice Koniges, MPI-IO/GPFS, an optimized implementation of MPI-IO on top of GPFS, Proceedings of the 2001 ACM/IEEE conference on Supercomputing (CDROM), p.17-17, November 10-16, 2001, Denver, Colorado
[doi> 10.1145/582034.582051]
|
| |
14
|
|
| |
15
|
Lustre: A Scalable, High-Performance File System, Whitepaper, Cluster File Systems, Inc., 2003.
|
| |
16
|
|
 |
17
|
K. E. Seamons , Y. Chen , P. Jones , J. Jozwiak , M. Winslett, Server-directed collective I/O in Panda, Proceedings of the 1995 ACM/IEEE conference on Supercomputing (CDROM), p.57-es, December 04-08, 1995, San Diego, California, United States
[doi> 10.1145/224170.224371]
|
| |
18
|
K. Coloma, A. Choudhary, W. Liao, W. Lee, E. Russell, and N. Pundit, "Scalable High-level Caching for Parallel I/O," in the International Parallel and Distributed Processing Symposium, Apr. 2004.
|
| |
19
|
K. Coloma, A. Ching, A. Choudhary, W. Liao, R. Ross, R. Thakur, and L. Ward, "A new flexible MPI collective I/O implementation," in the IEEE Conference on Cluster Computing, Sep. 2006.
|
| |
20
|
|
| |
21
|
K. Coloma, A. Choudhary, W. Liao, W. Lee, and S. Tideman, "DAChe: Direct Access Cache System for Parallel I/O," in the 20th International Supercomputer Conference, Jun. 2005.
|
| |
22
|
H. Yu, R. Sahoo, C. Howson, G. Almasi, J. Castanos, M. Gupta, J. Moreira, J. Parker, T. Engelsiepen, R. Ross, R. Thakur, R. Latham, and W. D. Gropp, "High Performance File I/O for the BlueGene/L Supercomputer," in the 12th International Symposium on High-Performance Computer Architecture (HPCA-12), Feb. 2006.
|
| |
23
|
P. Wong and R. der Wijngaart, "NAS Parallel Benchmarks I/O Version 2.4," NASA Ames Research Center, Moffet Field, CA, Tech. Rep. NAS-03-002, Jan. 2003.
|
| |
24
|
M. Zingale, "FLASH I/O Benchmark Routine - Parallel HDF 5," Mar. 2001, http://flash.uchicago.edu/~zingale/flash_benchmark_io.
|
| |
25
|
B. Fryxell, K. Olson, P. Ricker, F. X. Timmes, M. Zingale, D. Q. Lamb, P. MacNeice, R. Rosner, and H. Tufo, "FLASH: An Adaptive Mesh Hydrodynamics Code for Modelling Astrophysical Thermonuclear Flashes," Astrophysical Journal Suppliment, pp. 131--273, 2000.
|
| |
26
|
HDF Group, Hierarchical Data Format, Version 5, The National Center for Supercomputing Applications, http://hdf.ncsa.uiuc.edu/HDF5.
|
| |
27
|
R. Sankaran, E. Hawkes, J. Chen, T. Lu, and C. Law, "Direct Numerical Simulations of Turbulent Lean Premixed Combustion," Journal of Physics: conference series, vol. 46, pp. 38--42, 2006.
|
|