| A space-partitioning-based indexing method for multidimensional non-ordered discrete data spaces |
| Full text |
Pdf
(504 KB)
|
| Source
|
ACM Transactions on Information Systems (TOIS)
archive
Volume 24 , Issue 1 (January 2006)
table of contents
Pages: 79 - 110
Year of Publication: 2006
ISSN:1046-8188
|
|
Authors
|
|
Gang Qian
|
University of Central Oklahoma, Edmond, OK
|
|
Qiang Zhu
|
The University of Michigan---Dearborn, Dearborn, MI
|
|
Qiang Xue
|
Michigan State University, East Lansing, MI
|
|
Sakti Pramanik
|
Michigan State University, East Lansing, MI
|
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 9, Downloads (12 Months): 70, Citation Count: 3
|
|
|
ABSTRACT
There is an increasing demand for similarity searches in a multidimensional non-ordered discrete data space (NDDS) from application areas such as bioinformatics and data mining. The non-ordered and discrete nature of an NDDS raises new challenges for developing efficient indexing methods for similarity searches. In this article, we propose a new indexing technique, called the NSP-tree, to support efficient similarity searches in an NDDS. As we know, overlap causes a performance degradation for indexing methods (e.g., the R-tree) for a continuous data space. In an NDDS, this problem is even worse due to the limited number of elements available on each dimension of an NDDS. The key idea of the NSP-tree is to use a novel discrete space-partitioning (SP) scheme to ensure no overlap at each level in the tree. A number of heuristics and strategies are incorporated into the tree construction algorithms to deal with the challenges for developing an SP-based index tree for an NDDS. Our experiments demonstrate that the NSP-tree is quite promising in supporting efficient similarity searches in NDDSs. We have compared the NSP-tree with the ND-tree, a data-partitioning-based indexing technique for NDDSs that was proposed recently, and the linear scan using different NDDSs. It was found that the search performance of the NSP-tree was better than those of both methods.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
Bruno Becker , Paolo Giulio Franciosa , Stephan Gschwind , Thomas Ohler , Gerald Thiemt , Peter Widmayer, An Optimal Algorithm for Approximating a Set of Rectangles by Two Minimum Area Rectangles, Proceedings of the International Workshop on Computational Geometry - Methods, Algorithms and Applications, p.13-25, March 21-22, 1991
|
 |
3
|
Norbert Beckmann , Hans-Peter Kriegel , Ralf Schneider , Bernhard Seeger, The R*-tree: an efficient and robust access method for points and rectangles, Proceedings of the 1990 ACM SIGMOD international conference on Management of data, p.322-331, May 23-26, 1990, Atlantic City, New Jersey, United States
|
| |
4
|
|
 |
5
|
|
| |
6
|
|
| |
7
|
|
| |
8
|
Clement, J., Flajolet, P., and Vallee, B. 2001. Dynamic sources in information theory: A general analysis of trie structures. Algorithm 29, 1/2, 307--369.
|
 |
9
|
|
 |
10
|
|
| |
11
|
|
| |
12
|
|
 |
13
|
|
| |
14
|
Kent, W. J. 2002. BLAT---the BLAST-like aligment tool. Genome Res. 12, 656--664.
|
| |
15
|
Knuth, D. E. 1973. The Art of Computer Programming, Vol. 3. Addison-Wesley, Reading, MA.
|
 |
16
|
|
| |
17
|
|
| |
18
|
Qian, G., Zhu, Q., Xue, Q., and Pramanik, S. 2003. The ND-tree: A dynamic indexing technique for multidimensional non-ordered discrete data spaces. In Proceedings of VLDB. 620--631.
|
| |
19
|
Qian, G., Zhu, Q., Xue, Q., and Pramanik, S. 2006. Dynamic indexing for multidimensional non-ordered discrete data spaces using a data-partitioning approach. ACM Trans. Datab. Syst. 31, 2. To appear.
|
 |
20
|
|
| |
21
|
Skopal, T., Pokorny, J., and Snasel, V. 2004. PM-tree: Pivoting metric tree for similarity search in multimedia databases. In Proceedings of Dateso 2004 Annual International Workshop on DAtabases, TExts, Specifications and Objects (DATESO'04). 27--37.
|
| |
22
|
|
| |
23
|
Uhlmann, J. K. 1991. Satisfying general proximity/similarity queries with metric trees. Inf. Proc. Lett. 40, 4, 175--179.
|
| |
24
|
|
| |
25
|
|
| |
26
|
Xiangmin Zhou , Guoren Wang , Jeffrey Xu Yu , Ge Yu, M+-tree: a new dynamical multidimensional index for metric spaces, Proceedings of the fourteenth Australasian database conference, p.161-168, February 01, 2003, Adelaide, Australia
|
| |
27
|
Zipf, G. K. 1949. Human Behavior and the Principle of Least Effort. Addison-Wesley, Reading, MA.
|
CITED BY 3
|
|
|
|
|
|
|
|
Changqing Chen , Sakti Pramanik , Qiang Zhu , Watve Alok , Gang Qian, The C-ND tree: a multidimensional index for hybrid continuous and non-ordered discrete data spaces, Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, March 24-26, 2009, Saint Petersburg, Russia
|
|