| Asynchronous parallel disk sorting |
| Full text |
Pdf
(270 KB)
|
| Source
|
ACM Symposium on Parallel Algorithms and Architectures
archive
Proceedings of the fifteenth annual ACM symposium on Parallel algorithms and architectures
table of contents
San Diego, California, USA
SESSION: Algorithms II
table of contents
Pages: 138 - 148
Year of Publication: 2003
ISBN:1-58113-661-7
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 7, Downloads (12 Months): 50, Citation Count: 5
|
|
|
ABSTRACT
We develop an algorithm for parallel disk sorting, whose I/O cost approaches the lower bound and that guarantees almost perfect overlap between I/O and computation. Previous algorithms have either suboptimal I/O volume or cannot guarantee that I/O and computations can always be overlapped. We give an efficient implementation that can (at least) compete with the best practical implementations but gives additional performance guarantees. For the experiments we have configured a state of the art machine that can sustain full bandwidth I/O with eight disks and is very cost effective.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
A. Aggarwal and J. S. Vitter. The input/output complexity of sorting and related problems.
|
 |
3
|
Susanne Albers , Naveen Garg , Stefano Leonardi, Minimizing stall time in single and parallel disk systems, Proceedings of the thirtieth annual ACM symposium on Theory of computing, p.454-462, May 24-26, 1998, Dallas, Texas, United States
[doi> 10.1145/276698.276858]
|
| |
4
|
|
| |
5
|
|
 |
6
|
|
 |
7
|
Pei Cao , Edward W. Felten , Anna R. Karlin , Kai Li, Implementation and performance of integrated application-controlled file caching, prefetching, and disk scheduling, ACM Transactions on Computer Systems (TOCS), v.14 n.4, p.311-343, Nov. 1996
[doi> 10.1145/235543.235544]
|
| |
8
|
|
 |
9
|
|
| |
10
|
A. Crauser and K. Mehlhorn. LEDA-SM a platform for secondary memory computations. Technical report, MPII, 2000. draft.
|
| |
11
|
R. Dementiev and P. Sanders. Asynchronous parallel disk sorting. Technical Report MPI-I-2003-1-001, MPI Informatik, Germany, 2003.
|
| |
12
|
|
 |
13
|
|
| |
14
|
|
| |
15
|
|
 |
16
|
|
 |
17
|
|
 |
18
|
|
 |
19
|
Chris Nyberg , Tom Barclay , Zarka Cvetanovic , Jim Gray , Dave Lomet, AlphaSort: a RISC machine sort, Proceedings of the 1994 ACM SIGMOD international conference on Management of data, p.233-242, May 24-27, 1994, Minneapolis, Minnesota, United States
|
| |
20
|
C. Nyberg, C. Koester, and J. Gray. Nsort: A parallel sorting program for NUMA and SMP machines, 2000. http://www.ordinal.com/lit.html.
|
| |
21
|
|
 |
22
|
|
 |
23
|
|
| |
24
|
Peter Sanders , Sebastian Egner , Jan Korst, Fast concurrent access to parallel disks, Proceedings of the eleventh annual ACM-SIAM symposium on Discrete algorithms, p.849-858, January 09-11, 2000, San Francisco, California, United States
|
| |
25
|
Jochen Van den Bercken , Björn Blohsfeld , Jens-Peter Dittrich , Jürgen Krämer , Tobias Schäfer , Martin Schneider , Bernhard Seeger, XXL - A Library Approach to Supporting Efficient Implementations of Advanced Database Queries, Proceedings of the 27th International Conference on Very Large Data Bases, p.39-48, September 11-14, 2001
|
 |
26
|
Jochen van den Bercken , Jens-Peter Dittrich , Bernhard Seeger, javax.XXL: a prototype for a library of query processing algorithms, Proceedings of the 2000 ACM SIGMOD international conference on Management of data, p.588, May 15-18, 2000, Dallas, Texas, United States
|
| |
27
|
D. E. Vengroff. TPIE User Manual and Reference. Duke University, 1995. http://www.cs.duke.edu/ dev/tpie_home_page.html.
|
| |
28
|
|
| |
29
|
J. S. Vitter and E. A. M. Shriver. Algorithms for parallel memory, I: Two level memories. Algorithmica, 12(2/3):110--147, 1994.
|
| |
30
|
J. Wyllie. SPsort: How to sort a terabyte quickly.
|
INDEX TERMS
Primary Classification:
D.
Software
D.4
OPERATING SYSTEMS
D.4.2
Storage Management
Subjects:
Secondary storage
Additional Classification:
F.
Theory of Computation
F.2
ANALYSIS OF ALGORITHMS AND PROBLEM COMPLEXITY
F.2.2
Nonnumerical Algorithms and Problems
Subjects:
Sorting and searching
General Terms:
Algorithms,
Performance,
Theory
Keywords:
algorithm engineering,
algorithm library,
external memory sorting,
large data sets,
overlapping I/O and computation,
parallel disks,
prefetching,
randomized algorithm,
secondary memory
|