ACM Home Page
Please provide us with feedback. Feedback
On a model of indexability and its bounds for range queries
Full text PdfPdf (191 KB)
Source Journal of the ACM (JACM) archive
Volume 49 ,  Issue 1  (January 2002) table of contents
Pages: 35 - 55  
Year of Publication: 2002
ISSN:0004-5411
Authors
Joseph M. Hellerstein  University of California, Berkeley, CA
Elias Koutsoupias  University of California, Los Angeles, CA
Daniel P. Miranker  University of Texas, Austin, TX
Christos H. Papadimitriou  University of California, Berkeley, CA
Vasilis Samoladas  University of Texas, Austin, TX
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 6,   Downloads (12 Months): 72,   Citation Count: 5
Additional Information:

abstract   references   cited by   index terms   review   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/505241.505244
What is a DOI?

ABSTRACT

We develop a theoretical framework to characterize the hardness of indexing data sets on block-access memory devices like hard disks. We define an indexing workload by a data set and a set of potential queries. For a workload, we can construct an indexing scheme, which is a collection of fixed-sized subsets of the data. We identify two measures of efficiency for an indexing scheme on a workload: storage redundancy, r (how many times each item in the data set is stored), and access overhead, A (how many times more blocks than necessary does a query retrieve).For many interesting families of workloads, there exists a trade-off between storage redundancy and access overhead. Given a desired access overhead A, there is a minimum redundancy that any indexing scheme must exhibit. We prove a lower-bound theorem for deriving the minimum redundancy. By applying this theorem, we show interesting upper and lower bounds and trade-offs between A and r in the case of multidimensional range queries and set queries.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
2
 
3
4
 
5
BAYER, R., AND MCCREIGHT, E. 1972. Organization and maintenance of large ordered indexes. Acta Inf. 1, 173-189.
 
6
7
8
9
10
11
 
12
FIAT, A., AND SHAMIR, A. 1989. How to find a battleship. Networks 19, 361-371.
 
13
FREDMAN, M. L. 1980. The inherent complexity of dynamic data structures which accomodate range queries. In Proceedings of the IEEE Symposium on Foundations of Computer Science. IEEE Computer Society Press, Los Alamitos, Calif., pp. 191-199.
 
14
FREDMAN, M. L. 1981. Lower bounds on the complexity of some optimal data structures. SIAM J. Comput. 10, 1-10.
15
16
 
17
 
18
JOHNSON, S. M. 1962. A new upper bound for error-correcting codes. IEEE Trans. Inf. Theory 8, 203- 207.
 
19
20
21
 
22
23
24
 
25
 
26
MATOUSEK, J. 1999. Geometric Discrepancy. Springer-Verlag, New York.
 
27
28
29
30
31
32
33
34
 
35
 
36
 
37
 
38
 
39
TURAN, P. 1941. An extrenal problem in graph theory (in Hungarian). Mat. Fiz. Lapok. 48, 435-452.
 
40
 
41
VAN LINT, J. H., AND WILSON, R. M. 1992. A Course in Combinatorics. Cambridge University Press, Cambridge, Mass.
42
 
43
44
45



REVIEW

"Fazli Can : Reviewer"

In this paper, the authors introduce a new framework for the modeling of indexing on block-access external devices such as hard disks. The authors define an indexing workload as a data set and the associated set of potential queries. For each work  more...

Collaborative Colleagues:
Joseph M. Hellerstein: colleagues
Elias Koutsoupias: colleagues
Daniel P. Miranker: colleagues
Christos H. Papadimitriou: colleagues
Vasilis Samoladas: colleagues