|
ABSTRACT
In order to obtain very fast parallel algorithms, it is almost always necessary to have some sort of load balancing procedure, so that processors which have finished their required tasks can help processors which have not. If the overloaded processors are not helped, then the expected time of the entire algorithm suffers. In general, we would like to distribute the remaining work as evenly as possible among the processors, or more formally, given at most n independent tasks distributed in an arbitrary way among n processors, we would like to redistribute the tasks so that each processor contains O(1) tasks. We show here that even on the strongest randomized CRCW PRAM model, for a simple random distribution tasks load balancing requires &OHgr;(log* n) expected time. Gil, Matias, and Vishkin [9] give an O(log* n) expected time randomized algorithm which solves the load balancing problem in the worst case, so the lower bound is tight.
By reduction we show that both Padded Sort [12], and Linear Approximate Compaction [13] require &OHgr;(log* n) expected time. We note that our basic technique is one of the few parallel lower bound techniques known which only require 0/1 inputs. We also note that the bounds given in this paper do not place any restriction on the instruction set of the machine, the amount of information which can be stored in a memory cell, or on the number of memory cells.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
D. Angluin and L. G. Valiant, "Fast probabilistic algorithms for hamiltonian circuits and matchings," }. o/Comp, and System Sci., pp. 155-193, 1979.
|
 |
3
|
|
| |
4
|
|
 |
5
|
|
 |
6
|
F E Fich , F Meyer auf der Heide , P Ragde , A Wigderson, One, two, three . . . infinity: lower bounds for parallel computation, Proceedings of the seventeenth annual ACM symposium on Theory of computing, p.48-58, May 06-08, 1985, Providence, Rhode Island, United States
[doi> 10.1145/22145.22151]
|
| |
7
|
J. Gil, "Fast load balancing on pram,' Manuscript, 1990.
|
| |
8
|
|
| |
9
|
J. Gil, Y. Matias, and U. Vishkin, "A fast parallel dictionary," in Proc. 3~nd Syrup. on Found. of Comp. Sci., 1991. to appear.
|
| |
10
|
T. Hagerup, "Fast parallel space allocation, estimation and integer sorting," Technical Report no. pending, Max-Planck-Institut ffir Informatik, 1991.
|
 |
11
|
|
| |
12
|
|
 |
13
|
|
 |
14
|
|
| |
15
|
A. Yao, "Probabilistic computations: Towards a unified measure of complexity," in Proc. 18th Syrup. on Found. o/Comp. Sci., pp. 222-227, 1977.
|
CITED BY 6
|
|
|
|
|
Michael T. Goodrich , Yossi Matias , Uzi Vishkin, Optimal parallel approximation for prefix sums and integer sorting, Proceedings of the fifth annual ACM-SIAM symposium on Discrete algorithms, p.241-250, January 23-25, 1994, Arlington, Virginia, United States
|
|
|
|
|
|
Leslie Ann Goldberg , Mark Jerrum , Philip D. MacKenzie, An Ω(√ log log n) lower bound for routing in optical networks, Proceedings of the sixth annual ACM symposium on Parallel algorithms and architectures, p.147-156, June 27-29, 1994, Cape May, New Jersey, United States
|
|
|
|
|
|
|
|