ACM Home Page
Please provide us with feedback. Feedback
An Experimental Evaluation of I/O Optimizations on Different Applications
Full text Publisher SitePublisher Site
Source IEEE Transactions on Parallel and Distributed Systems archive
Volume 13 ,  Issue 7  (July 2002) table of contents
Pages: 728 - 744  
Year of Publication: 2002
ISSN:1045-9219
Authors
Publisher
IEEE Press  Piscataway, NJ, USA
Bibliometrics
Downloads (6 Weeks): n/a,   Downloads (12 Months): n/a,   Citation Count: 0
Additional Information:

abstract   references   index terms   review   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: 10.1109/TPDS.2002.1019861

ABSTRACT

Many large scale applications have significant I/O requirements as well as computational and memory requirements. Unfortunately, the limited number of I/O nodes provided in a typical configuration of the modern message-passing distributed-memory architectures such as Intel Paragon and IBM SP-2 limits the I/O performance of these applications severely. In this paper, we examine some software optimization techniques and evaluate their effects in five different I/O-intensive codes from both small and large application domains. Our goals in this study are twofold. First, we want to understand the behavior of large-scale data-intensive applications and the impact of I/O subsystems on their performance and vice versa. Second, and more importantly, we strive to determine the solutions for improving the applications' performance by a mix of software techniques. Our results reveal that different applications can benefit from different optimizations. For example, we found that some applications benefit from file layout optimizations whereas others take advantage of collective I/O. A combination of architectural and software solutions is normally needed to obtain good I/O performance. For example, we show that with a limited number of I/O resources, it is possible to obtain good performance by using appropriate software optimizations. We also show that beyond a certain level, imbalance in the architecture results in performance degradation even when using optimized software, thereby indicating the necessity of an increase in I/O resources.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
R. Bennett, K. Bryant, A. Sussman, R. Das, and J. Saltz, "Jovian: A Framework for Optimizing Parallel I/O," Proc. 1994 Scalable Parallel Libraries Conf. 1994.
3
4
 
5
 
6
P.H. Carns, W.B. Ligon Ill, R.B. Ross, and R. Thakur, "PVFS: A Parallel File System for Linux Clusters," Preprint /INL/MCS-P804-0400, submitted to the 2000 Extreme Linux Workshop April 2000.
7
 
8
A. Choudhary, R. Bordawekar, S. More, K. Sivaram, and R. Thakur, "The PASSION Runtime Library for the Intel Paragon," Proc. Intel Supercomputer User's Group Conf., June 1995.
 
9
P. Corbett, D. Feitelson, S. Fineherg, Y. Hsu, B. Nitzherg, J, Prost, M. Snir, B. Traversal, and P. Wong, "Overview of the MPI-10 Parallel I/O Interface," Pro. Third Workshop I/O in Parallel unit Distributed Systems, Apr. 1995.
10
 
11
 
12
J. Del Rosario, R. Bordawekar, and A. Choudhary, "Improved Parallel I/O via A Two-Phase Run-Time Access Strategy," Proc. 1993 IPPS Workshop Input/Output in Parallel Computer Sys tems, Apr. 1993.
 
13
14
15
 
16
17
 
18
 
19
20
 
21
 
22
 
23
24
 
25
"NWChem, A Computational Chemistry Package for Parallel Computers, Version 1. 1,"High Performance Computational Chemistry Group, Pacific Northwest Laboratory (PNL, 1995.
 
26
M, Paleczny, K. Kennedy, and C. Koelbel, "Compiler Support for Out-of-Core Arrays on Parallel Machines,," CRI'C Technical Report 94509-S. Rice Univ., Houston l ex,, Dec. 1994,
 
27
D. Reed, R. Aydt, R. Noe, P. Ruth, K. Shields, B. Schwartz, and L. Tavera, "Scalable Performance Analysis'. the Pablo Performance Analysis Environment," Proc. scalable Parallel Libraries Cool., pp. 104-113, 1993.
 
28
B. Rullman Paragon Parallel File System, External Product Specification, Intel Supercomputer Systems Division. 1996.
29
 
30
F. Smirni, C. Elford, A, Laevry, D. Reed, and A. Chien, "Algorithmic Influences on I/O Access Patterns and Parallel File System Performance," Technical Report, Table Group, Univ. of Illinois at Urbana-Champaign, 1996.
 
31
 
32
 
33
R. Thakur, W. Cropp, and E. Lusk, "A Case for Using MPI's Derived Data Types to Improve I/O Performance, Preprint, ANL/ MCS-P717-0598, Math. and Computer Science Division, Argonne Nat'l Laboratory, May 1998,
34


REVIEW

"John A. Fulcher : Reviewer"

The authors’ stated aim is twofold: first, to understand the behavior of large-scale, data intensive applications and the impact of I/O subsystems on their performance, and second, to improve the performance of such applications.

The f  more...

Collaborative Colleagues:
Meenakshi A. Kandaswamy: colleagues
Mahmut Kandemir: colleagues
Alok Choudhary: colleagues
David Bernholdt: colleagues