ACM Home Page
Please provide us with feedback. Feedback
Towards a complete characterization of tries
Full text PdfPdf (742 KB)
Source Symposium on Discrete Algorithms archive
Proceedings of the sixteenth annual ACM-SIAM symposium on Discrete algorithms table of contents
Vancouver, British Columbia
SESSION: Session 1A table of contents
Pages: 33 - 42  
Year of Publication: 2005
ISBN:0-89871-585-7
Authors
Gahyun Park  Purdue University, West Lafayette, IN
Wojciech Szpankowski  Purdue University, West Lafayette, IN
Sponsors
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
: SIAM Activity Group on Discrete Mathematics
Publisher
Society for Industrial and Applied Mathematics  Philadelphia, PA, USA
Bibliometrics
Downloads (6 Weeks): 7,   Downloads (12 Months): 42,   Citation Count: 1
Additional Information:

abstract   references   cited by   collaborative colleagues  

Tools and Actions: Review this Article  

ABSTRACT

Tries and suffix trees are the most popular data structures on words. Tries were introduced in 1960 by Fredkin as an efficient method for searching and sorting digital data. Since then myriad of novel trie applications were found such as dynamic hashing, conflict resolution algorithms, leader election algorithms, IP addresses lookup, coding, polynomial factorization, Lempel-Ziv compression schemes, and so on. Furthermore, various analyses of tries reveal new fundamental properties of strings appearing in those applications. Parameters of interest are the (partial) fillup level (the largest full level of the trie), shortest path, height (longest path), typical depth, and path length (sum of depths). All of these parameters are analyzed here in a unifying manner by studying the external and internal profiles. A profile of a tree at level k is the number of nodes (internal or external) at level k. We derive recurrences for both profiles and solve them asymptotically for various ranges of k when the strings stored in the trie are generated by a memoryless source (extension to a Markov source is possible). In particular, we present asymptotic results for the average profile, the variance and the limiting distribution. As consequences we find the height, shortest path, fillup level, and the depth. These results are derived here by methods of analytic algorithmics such as generating functions, Mellin transform, poissonization and depoissonization, and the saddle point method.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
L. Devroye, A Study of Trie-Like Structures Under the Density Model, Annals of Applied Probability, 2, 402--434, 1992.
 
2
L. Devroye, A Note on the Probabilistic Analysis of Patricia Tries, Random Structures & Algorithms, 3, 203--214, 1992.
 
3
 
4
 
5
 
6
P. Jacquet, and W. Szpankowski, Analysis of Digital Tries with Markovian Dependency, IEEE Trans. Information Theory, 37, 1470--1475, 1991.
 
7
 
8
 
9
S. Janson and W. Szpankowski, Analysis of an Asymmetric Leader Election Algorithm, The Electronic Journal of Combinatorics64(1), R17, 1--16, 1997.
 
10
 
11
 
12
D. E. Knuth, Fundamental Algorithms, 3rd ed. (Reading, Massachusetts: Addison Wesley, 1997).
 
13
 
14
H. Mahmoud, Evolution of Random Search Trees, John Wiley & Sons, New York, 1992.
 
15
S. Nilsson, Radix Sorting & Searching, PhD Thesis, Lund University, 1996.
 
16
B. Pittel, Asymptotic Growth of a Class of Random Trees, Annals of Probability, 18, 414--427, 1985.
 
17
B. Pittel, Paths in a Random Digital Tree: Limiting Distributions, Adv. in Applied Probability, 18, 139--155, 1986.
 
18
 
19
20
 
21
W. Szpankowski, On the Height of Digital Trees and Related Problems, Algorithmica, 6, 256--277, 1991.
 
22

Collaborative Colleagues:
Gahyun Park: colleagues
Wojciech Szpankowski: colleagues