|
ABSTRACT
We consider two hashing models for storing a set S ⊂ {0, 1, 2, …, m - 1} in a table T of size n.
The first model uses universal hashing for a partially loaded table. A set of hash functions is universal if, for any the input set, a randomly selected function has an efficient expected performance. Universal hash functions originate in [CW79], where they were used for open hashing using chaining. [CW79] poses as an open question whether comparable results can be achieved for any closed hashing schemes.
The second model is perfect hashing for a full table. In preprocessing the input set is used to determine a hash function that achieves some desired performance criteria. This model was used among others in [ME82] and [FKS84].
In both models a key problem is to construct a “small” set of functions, which will permit a short description (program) for each function in the set.
- We show, for the first time, that universal hashing can be successfully used for closed hashing and in particular for double hashing. Specifically, the set of congruential polynomials of &Ogr;(log n) degree is universal for double hashing if the table load is below .75; the program size (or number of random bits generated by the algorithm) is &Ogr;(log log m + log2 n).
For perfect hashing, we obtain nearly tight results on the size of oblivious &Ogr;(1)-probe hash functions:
- Oblivious k-probe hash functions require &OHgr;(n/k2e-k + log log m) bits of description.
- A probabilistic construction is presented, which shows that oblivious k-probe hash functions, can be specified in &Ogr;(ne-k + log log m) bits, which nearly matches the above lower bound.
- We give a variation of an &Ogr;(1) time 1-probe (perfect) hash function that can be specified in &Ogr;(n + log log m) bits, which is tight to within a constant factor of the lower bound.
In view of the adaptive schemes presented in [FNSS88], these bounds establish a significant gap between oblivious and non-oblivious &Ogr;(1)-probe search.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
A85
|
M. Ajtai. "A Lower Bound for Finding Predecessors in Yao's Cell Probe Model," IBM RJ 4867 (51297) 10/3/85, Computer Science.
|
| |
BBDOP86
|
|
| |
CW79
|
J. L. Carter and M. N. Wegman "Universal Classes of Hash Functions," Journal of Computer and System Sciences 18, pp. 143-154 (1979).
|
| |
DKMMRT88
|
M. Dietzfelbinger, A. Karlin, K. Mehlhorn, F. Meyer auf der Heide, H. Rohnert, and R.E. Tarjan. "Dynamic Perfect Hashing: Upper and Lower Bounds," 29th Annual Symposium on the Foundations of Computing, 1988, pp. 524-531.
|
| |
FN
|
A. Fiat and M. Naor. Personal communication, 1988.
|
 |
FN88
|
|
 |
FNSS88
|
Amos Fiat , Moni Naor , Jeanette Schmidt , Alan Siegel, Non-oblivious hashing, Proceedings of the twentieth annual ACM symposium on Theory of computing, p.367-376, May 02-04, 1988, Chicago, Illinois, United States
[doi> 10.1145/62212.62248]
|
| |
F
|
M.L. Fredman. Personal communication, 1988.
|
| |
FK84
|
M.L. Fredman and J. Koml6s. "On The Size Of Separating Systems And Families Of Perfect Hash Functions," SIAM Journal of Algebraic and Discrete Methods, Vo} 5, No. 1, March 1984, pp. 61-68.
|
 |
FKS84
|
|
| |
HK73
|
J.E. Hopcroft and R.M. Karp. "An n5/2 Algorithm for Maximum Match~ngs in Bipartite Graphs," SIAM Journal on Computing, Vol 2, No. 4, (1973), pp. 225-231.
|
| |
JvE86
|
|
 |
LM88
|
|
| |
Ma83
|
H. G. Mairson. "The Program Complexity of Searching a Table," 24th Symposium on Foundation of Computer Science, November, 1983.
|
| |
Ma84
|
|
| |
Me82
|
K. Mehlhorn. "On the Program size of Perfect and Universal Hash functions," Proc. 23rd Ann. Symp. on Foundations of Computer Science, 1982, pp. 170-175.
|
| |
Me84
|
K. Mehlhorn. Data Structures and Algorithms 1: Sorting and Searching, Springer-Verlag, Berlin Heidelberg, 1984.
|
 |
SvE84
|
|
 |
TY79
|
|
| |
WC79
|
M. N. Wegman and J. L. Carter. "New Classes and Applications of Hash Functions," 20th Annual Symposium on Foundations of Computer Science, October 1979, pp. 175-182.
|
 |
Y81
|
|
CITED BY 7
|
|
|
|
|
Jeanette P. Schmidt , Alan Siegel , Aravind Srinivasan, Chernoff-Hoeffding bounds for applications with limited independence, Proceedings of the fourth annual ACM-SIAM Symposium on Discrete algorithms, p.331-340, January 25-27, 1993, Austin, Texas, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|