| Server-directed collective I/O in Panda |
| Full text |
Html
(2 KB),
Ps
(367 KB)
|
| Source
|
Conference on High Performance Networking and Computing
archive
Proceedings of the 1995 ACM/IEEE conference on Supercomputing (CDROM)
table of contents
San Diego, California, United States
Article No. 57
Year of Publication: 1995
ISBN:0-89791-816-9
|
|
Authors
|
|
K. E. Seamons
|
Center for Advanced Database Research, Computer Science Department, University of Illinois, Urbana, Illinois
|
|
Y. Chen
|
Center for Advanced Database Research, Computer Science Department, University of Illinois, Urbana, Illinois
|
|
P. Jones
|
Center for Advanced Database Research, Computer Science Department, University of Illinois, Urbana, Illinois
|
|
J. Jozwiak
|
Center for Advanced Database Research, Computer Science Department, University of Illinois, Urbana, Illinois
|
|
M. Winslett
|
Center for Advanced Database Research, Computer Science Department, University of Illinois, Urbana, Illinois
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 2, Downloads (12 Months): 11, Citation Count: 56
|
|
|
ABSTRACT
We present the architecture and implementation results for Panda 2.0, a library for input and output of multidimensional arrays on parallel and sequential platforms. Panda achieves remarkable performance levels on the IBM SP2, showing excellent scalability as data size increases and as the number of nodes increases, and provides throughputs close to the full capacity of the AIX file system on the SP2 we used. We argue that this good performance can be traced to Panda's use of server-directed i/o (a logical-level version of disk-directed i/o [Kotz94b]) to perform array i/o using sequential disk reads and writes, a very high level interface for collective i/o requests, and built-in facilities for arbitrary rearrangements of arrays during i/o. Other advantages of Panda's approach are ease of use, easy application portability, and a reliance on commodity system software.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
Bennett94
|
R. Bennett, K. Bryant, A. Sussman, R. Das, and J. Saltz, Jovian: A framework for optimizing parallel I/O, Proceedings of the 1994 Scalable Parallel Libraries Conference, pages 10-20. IEEE Computer Society Press, October 1994.
|
 |
Bordawekar93
|
|
| |
Brezany95
|
|
| |
Corbett94b
|
P. F. Corbett and D. G. Feitelson, Vesta file system programmer's reference. Technical Report Research Report RC 19898 (88058), IBM T.J. Watson Research Center, Yorktown Heights, NY 10598, October 1994. Version 1.01.
|
| |
Corbett95
|
P. Corbett, D. Feitelson, Y. Hsu, J. Prost, M. Snir, S. Fineberg, B. Nitzberg, B. Traversat, and P. Wong. MPI-IO: A Parallel File I/O Interface for MPI, Technical Report NAS-95-002, NASA Ames Research Center, January 1995.
|
| |
del Rosario94
|
J. M. del Rosario, M. Harry, A. Choudhary, The Design of VIP-FS: A Virtual, Parallel File System for High Performance Parallel and Distributed Computing, Technical Report SCCS-628, NPAC, Syracuse, NY, May 1994.
|
 |
Galbreath93
|
|
 |
Karpovich94
|
John F. Karpovich , Andrew S. Grimshaw , James C. French, Extensible file system (ELFS): an object-oriented approach to high performance file I/O, Proceedings of the ninth annual conference on Object-oriented programming systems, language, and applications, p.191-204, October 23-28, 1994, Portland, Oregon, United States
|
| |
Kotz93b
|
|
| |
Kotz94a
|
|
| |
Kotz94b
|
D. Kotz, Disk-Directed I/O for MIMD Multiprocessors, First Symposium on Operating Systems Design and Implementation, November 1994.
|
| |
Kotz95a
|
|
| |
Kotz95b
|
|
| |
Pasquale94
|
B. Pasquale, and G. Polyzos, Dynamic I/O Characterization of I/O intensive Scientific Applications, Technical Report No. CS94-364, University of California, San Diego, April 1994.
|
| |
Pierce93
|
P. Pierce, A Concurrent File System for a Highly Parallel Mass Storage Subsystem, Proceedings of the 4th Conference on Hypercube Computers and Applications, Monterey, March 1989. , pp. 155-160.
|
| |
Purakayastha94
|
|
| |
Seamons94a
|
|
| |
Seamons94b
|
|
| |
Seligman94
|
|
CITED BY 56
|
|
Jens Mache , Virginia Lo , Marilynn Livingston , Sharad Garg, The impact of spatial layout of jobs on parallel I/O performance, Proceedings of the sixth workshop on I/O in parallel and distributed systems, p.45-56, May 05-05, 1999, Atlanta, Georgia, United States
|
|
|
|
|
|
|
|
|
Len Wisniewski , Brad Smisloff , Nils Nieuwejaar, Sun MPII/O: efficient I/O for parallel applications, Proceedings of the 1999 ACM/IEEE conference on Supercomputing (CDROM), p.14-es, November 14-19, 1999, Portland, Oregon, United States
|
|
|
Ian Foster , David Kohr, Jr. , Rakesh Krishnaiyer , Jace Mogill, Remote I/O: fast access to distant storage, Proceedings of the fifth workshop on I/O in parallel and distributed systems, p.14-25, November 17-17, 1997, San Jose, California, United States
|
|
|
Jaechun No , Rajeev Thakur , Alok Choudhary, Integrating parallel file I/O and database support for high-performance scientific data management, Proceedings of the 2000 ACM/IEEE conference on Supercomputing (CDROM), p.57-es, November 04-10, 2000, Dallas, Texas, United States
|
|
|
Tahsin Kurc , Chialin Chang , Renato Ferreira , Alan Sussman , Joel Saltz, Querying very large multi-dimensional datasets in ADR, Proceedings of the 1999 ACM/IEEE conference on Supercomputing (CDROM), p.12-es, November 14-19, 1999, Portland, Oregon, United States
|
|
|
Y. Chen , M. Winslett , Y. Cho , S. Kuo, Automatic parallel I/O performance optimization in Panda, Proceedings of the tenth annual ACM symposium on Parallel algorithms and architectures, p.108-118, June 28-July 02, 1998, Puerto Vallarta, Mexico
|
|
|
Y. Chen , M. Winslett , K. E. Seamons , S. Kuo , Y. Cho , M. Subramaniam, Scalable message passing in Panda, Proceedings of the fourth workshop on I/O in parallel and distributed systems: part of the federated computing research conference, p.109-121, May 27-27, 1996, Philadelphia, Pennsylvania, United States
|
|
|
Judy Sturtevant , Mark Christon , Philip D. Heermann , Pang-Chieh Chen, PDS/PIO: lightweight libraries for collective parallel I/O, Proceedings of the 1998 ACM/IEEE conference on Supercomputing (CDROM), p.1-11, November 07-13, 1998, San Jose, CA
|
|
|
|
|
|
|
|
|
S. Kuo , M. Winslett , Y. Cho , J. Lee , Y. Chen, Efficient input and output for scientific simulations, Proceedings of the sixth workshop on I/O in parallel and distributed systems, p.33-44, May 05-05, 1999, Atlanta, Georgia, United States
|
|
|
Y. Cho , M. Winslett , M. Subramaniam , Y. Chen , S. Kuo , K. E. Seamons, Exploiting local data in parallel array I/O on a practical network of workstations, Proceedings of the fifth workshop on I/O in parallel and distributed systems, p.1-13, November 17-17, 1997, San Jose, California, United States
|
|
|
Rajeev Thakur , William Gropp , Ewing Lusk, On implementing MPI-IO portably and with high performance, Proceedings of the sixth workshop on I/O in parallel and distributed systems, p.23-32, May 05-05, 1999, Atlanta, Georgia, United States
|
|
|
|
|
|
Ying Chen , Jarek Nieplocha , Ian Foster , Marianne Winslett, Optimizing collective I/O performance on parallel computers: a multisystem study, Proceedings of the 11th international conference on Supercomputing, p.28-35, July 07-11, 1997, Vienna, Austria
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Jack Dongarra , Ian Foster , Geoffrey Fox , William Gropp , Ken Kennedy , Linda Torczon , Andy White, References, Sourcebook of parallel computing, Morgan Kaufmann Publishers Inc., San Francisco, CA, 2003
|
|
|
|
|
|
|
|
|
|
|
|
Tara M. Madhyastha , Garth A. Gibson , Christos Faloutsos, Informed prefetching of collective input/output requests, Proceedings of the 1999 ACM/IEEE conference on Supercomputing (CDROM), p.13-es, November 14-19, 1999, Portland, Oregon, United States
|
|
|
|
|
|
Florin Isaila , Guido Malpohl , Vlad Olaru , Gabor Szeder , Walter Tichy, Integrating collective I/O and cooperative caching into the "clusterfile" parallel file system, Proceedings of the 18th annual international conference on Supercomputing, June 26-July 01, 2004, Malo, France
|
|
|
Ying Chen , Marianne Winslett , Szu-wen Kuo , Yong Cho , Mahesh Subramaniam , Kent Seamons, Performance modeling for the panda array I/O library, Proceedings of the 1996 ACM/IEEE conference on Supercomputing (CDROM), p.45-es, January 01-01, 1996, Pittsburgh, Pennsylvania, United States
|
|
|
|
|
|
|
|
|
Jonghyun Lee , Xiaosong Ma , Marianne Winslett , Shengke Yu, Active buffering plus compressed migration: an integrated solution to parallel simulations' data transport needs, Proceedings of the 16th international conference on Supercomputing, June 22-26, 2002, New York, New York, USA
|
|
|
|
|
|
|
|
|
H. Wang , S. Parthasarathy , A. Ghoting , S. Tatikonda , G. Buehrer , T. Kurc , J. Saltz, Design of a next generation sampling service for large scale data analysis applications, Proceedings of the 19th annual international conference on Supercomputing, June 20-22, 2005, Cambridge, Massachusetts
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Xiaosong Ma , Vincent W. Freeh , Tao Yang , Sudharshan S. Vazhkudai , Tyler A. Simon , Stephen L. Scott, Coupling prefix caching and collective downloads for remote dataset access, Proceedings of the 20th annual international conference on Supercomputing, June 28-July 01, 2006, Cairns, Queensland, Australia
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Wei-keng Liao , Kenin Coloma , Alok Choudhary , Lee Ward , Eric Russell , Neil Pundit, Scalable Design and Implementations for MPI Parallel Overlapping I/O, IEEE Transactions on Parallel and Distributed Systems, v.17 n.11, p.1264-1276, November 2006
|
|
|
|
|
|
|
|
|
Hasan Abbasi , Matthew Wolf , Greg Eisenhauer , Scott Klasky , Karsten Schwan , Fang Zheng, DataStager: scalable data staging services for petascale applications, Proceedings of the 18th ACM international symposium on High performance distributed computing, June 11-13, 2009, Garching, Germany
|
|