|
ABSTRACT
The Patricia trie is a simple modification of a regular trie. By eliminating unary branching nodes, the Patricia achieves better performance than regular tries. However, the question is: how much on the average is the Patricia better? This paper offers a thorough answer to this question by considering some statistics of the number of nodes examined in a successful search and an unsuccessful search in the Patricia tries. It is shown that for the Patricia containing n records the average of the successful search length Sn asymptotically becomes 1/h1 · ln n + O(1), and the variance of Sn is either var Sn = c · ln n + 0(1) for an asymmetric Patricia or var Sn = 0(1) for a symmetric Patricia, where h1 is the entropy of the alphabet over which the Patricia is built and c is an explicit constant. Higher moments of Sn are also assessed. The number of nodes examined in an unsuccessful search Un is studied only for binary symmetric Patricia tries. We prove that the mth moment of the unsuccessful search length EUmn satisfies limn→∞ EUmn/logm2n = 1, and the variance of Un is var Un = 0.87907. These results suggest that Patricia tries are very well balanced trees in the sense that a random shape of Patriciatries resembles the shape of complete trees that are ultimately balanced trees.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
ABRAMOWITZ, M., AND STEGUN, I. Handbook of Mathematical Functions. Wiley, New York, 1972.
|
| |
2
|
Alfred V. Aho , John E. Hopcroft , Jeffrey Ullman , J. D. Ullman , J. E. Hopcroft, Data Structures and Algorithms, Addison-Wesley Longman Publishing Co., Inc., Boston, MA, 1983
|
| |
3
|
BERNDT, B. Ramanujan's Notebooks. Springer-Verlag, New York, 1985.
|
| |
4
|
ERDELYI, A. Higher Transcendental Functions. McGraw-Hill, New York, 1953.
|
 |
5
|
|
| |
6
|
FLAJOLET, PH. On the performance evaluation of extendible hashing and trie searching. Acta Inf. 20 (1983), 345-369.
|
| |
7
|
|
| |
8
|
|
| |
9
|
GALLAGER, R. Conflict resolution in random access broadcast networks. In Proceedings of the AFOSR Workshop in Communication Theory and Applications. Provincetown, Mass., 1978, pp. 74-76.
|
| |
10
|
|
| |
11
|
|
| |
12
|
JACQUET, PH., AND REGNIER, M. Limiting distributions for trie parameters. In Lecture Notes in Computer Science, vol. 214. Springer-Verlag, New York, 1986, pp. 196-210.
|
| |
13
|
|
| |
14
|
|
| |
15
|
NORLUND, N.E. Memoire sur les polynomes de Bernoulli. Acta Math. 43 (1923), 124-196.
|
| |
16
|
PAIGE, R., AND TARJAN, R. Three efficient algorithms based on partition refinements, preprint.
|
| |
17
|
PITTEL, B. Asymptotical growth of a class of random trees. Ann. Prob. 13 (1985), 414-427.
|
| |
18
|
RIORDAN, J. Combinatorial Identities. Wiley, New York, 1968.
|
| |
19
|
|
| |
20
|
SZPANKOWSHI, W. Patricia tries again revisited. Tech. Rep. CSD-TR 625. Dept. Comput. Sci., Purdue Univ., West Lafayette, Ind., 1986.
|
| |
21
|
SZPANKOWSKI, W. On the recurrence equation arising in the analysis of conflict resolution algorithms. Stochastic Models 3 (1987), 89- l 14.
|
| |
22
|
|
| |
23
|
|
| |
24
|
|
| |
25
|
WHITTAKER, E., AND WATSON, G. A Course of Modern Analysis. Cambridge Press, London, 1935.
|
CITED BY 5
|
|
|
|
|
|
|
|
|
|
|
|
|
|
K. Selçuk Candan , Mehmet E. Dönderler , J. Ramamoorthy , Jong W. Kim, Clustering and indexing of experience sequences for popularity-driven recommendations, Proceedings of the 3rd ACM workshop on Continuous archival and retrival of personal experences, October 28-28, 2006, Santa Barbara, California, USA
|
|