ACM Home Page
Please provide us with feedback. Feedback
Digital Library logoTake a look at the new version of this page: [ beta version ]. Tell us what you think.
Summarizing level-two topological relations in large spatial datasets
Full text PdfPdf (1.33 MB)
Source ACM Transactions on Database Systems (TODS) archive
Volume 31 ,  Issue 2  (June 2006) table of contents
Pages: 584 - 630  
Year of Publication: 2006
ISSN:0362-5915
Authors
Xuemin Lin  NICTA & University of New South Wales, Sydney, Australia
Qing Liu  NICTA & University of New South Wales, Sydney, Australia
Yidong Yuan  NICTA & University of New South Wales, Sydney, Australia
Xiaofang Zhou  University of Queensland, Australia
Hongjun Lu  Hong Kong University of Science and Technology, Hong Kong
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 7,   Downloads (12 Months): 65,   Citation Count: 0
Additional Information:

appendices and supplements   abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1138394.1138398
What is a DOI?

APPENDICES and SUPPLEMENTS
Online appendix to designing mediation for context-aware applications. The appendix supports the information on page 584.


ABSTRACT

Summarizing topological relations is fundamental to many spatial applications including spatial query optimization. In this article, we present several novel techniques to effectively construct cell density based spatial histograms for range (window) summarizations restricted to the four most important level-two topological relations: contains, contained, overlap, and disjoint. We first present a novel framework to construct a multiscale Euler histogram in 2D space with the guarantee of the exact summarization results for aligned windows in constant time. To minimize the storage space in such a multiscale Euler histogram, an approximate algorithm with the approximate ratio 19/12 is presented, while the problem is shown NP-hard generally. To conform to a limited storage space where a multiscale histogram may be allowed to have only k Euler histograms, an effective algorithm is presented to construct multiscale histograms to achieve high accuracy in approximately summarizing aligned windows. Then, we present a new approximate algorithm to query an Euler histogram that cannot guarantee the exact answers; it runs in constant time. We also investigate the problem of nonaligned windows and the problem of effectively partitioning the data space to support nonaligned window queries. Finally, we extend our techniques to 3D space. Our extensive experiments against both synthetic and real world datasets demonstrate that the approximate multiscale histogram techniques may improve the accuracy of the existing techniques by several orders of magnitude while retaining the cost efficiency, and the exact multiscale histogram technique requires only a storage space linearly proportional to the number of cells for many popular real datasets.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
2
 
3
4
 
5
 
6
Egenhofer, M. J. and Herring, J. R. 1994. Categorizing binary topological relations between regions, lines, and points in geographic databases. In The 9-Intersection: Formalism and Its Use for Natural-Language Spatial Predicates. National Center for Geographic Information and Analysis, Report 94-1, M. J. Egenhofer, D. M. Mark, and J. R. Herring, Eds.
7
 
8
Fleischer, R., Golin, M. J., and Zhang, Y. 2004. Online maintenance of k-medians and k-covers on a line. In Proceedings of SWAT'04 (Humlebaek, Denmark, July 8--10). 102--113.
9
 
10
Garofalakis, M. N. and Gehrke, J. 2002. Querying and mining data streams: You only get one look. In Proceedings of the Symposium on Very Large Data Bases (VLDB'02) (Hong Kong, Aug. 20--23).
 
11
Gilbert, A. C., Kotidis, Y., Muthukrishnan, S., and Strauss, M. 2002. How to summarize the universe: Dynamic maintenance of quantiles. In Proceedings of the Symposium on Very Large Data Bases (VLDB'02). Hong Kong, August 20--23, 454--465.
 
12
Greene, S., Tanin, E., Plaisant, C., Shneiderman, B., Olsen, L., Major, G., and Johns, S. 1999. The end of zero-hit queries: Query previews for NASA's global change master directory. Int. J. Dig. Lib. 2, 2-3, 79--90.
 
13
Grigni, M., Papadias, D., and Papadimitriou, C. H. 1995. Topological inference. In Proceedings of the IJCAI (1). (Montréal, Qué. Aug. 20--25). 901--907.
 
14
 
15
Harary, F. 1969. Graph Theory. Addison-Wesley, Reading, MA.
 
16
Hassin, R. and Tamir, A. 1991. Improved complexity bounds for location problems on the real line. Oper. Res. Lett. 10, 395--402.
17
 
18
 
19
 
20
Lin, X., Liu, Q., Yuan, Y., and Zhou, X. 2003. Multiscale histograms: Summarizing topological relations in large spatial datasets. In Proceedings of the Symposium on Very Large Data Bases (VLDB'03) (Berlin, Germany, Sept. 9--12). 814--825.
21
22
23
24
 
25
Micali, S. and Vazirani, V. V. 1980. An o(√|V||E|) algorithm for finding maximum matching in general graphs. In Proceedings of the Symposium on Foundations of Computer Science (FOCS'80) (Syracuse, NY, Oct. 17--27). IEEE Computer Society Press, Los Alamitos, CA.
26
 
27
 
28
 
29
30
 
31
 
32
Tao, Y., Sun, J., and Papadias, D. 2003. Selectivity estimation for predictive spatio-temporal queries. In Proceedings of the International Conference on Data Engineering (ICDE'03) (Bangalore, India, Mar. 5--8). 417--428.
 
33
TIGER. 2000. Tiger/line files. Technical report, U.S. Census Bureau, Washington, DC.
 
34
Zipf, G. K. 1949. Human Behaviour and the Principle of Least Effort: an Introduction to Human Ecology. Addison-Wesley, Reading, MA.

Collaborative Colleagues:
Xuemin Lin: colleagues
Qing Liu: colleagues
Yidong Yuan: colleagues
Xiaofang Zhou: colleagues
Hongjun Lu: colleagues