ACM Home Page
Please provide us with feedback. Feedback
A space-partitioning-based indexing method for multidimensional non-ordered discrete data spaces
Full text PdfPdf (504 KB)
Source ACM Transactions on Information Systems (TOIS) archive
Volume 24 ,  Issue 1  (January 2006) table of contents
Pages: 79 - 110  
Year of Publication: 2006
ISSN:1046-8188
Authors
Gang Qian  University of Central Oklahoma, Edmond, OK
Qiang Zhu  The University of Michigan---Dearborn, Dearborn, MI
Qiang Xue  Michigan State University, East Lansing, MI
Sakti Pramanik  Michigan State University, East Lansing, MI
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 9,   Downloads (12 Months): 70,   Citation Count: 3
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1125857.1125860
What is a DOI?

ABSTRACT

There is an increasing demand for similarity searches in a multidimensional non-ordered discrete data space (NDDS) from application areas such as bioinformatics and data mining. The non-ordered and discrete nature of an NDDS raises new challenges for developing efficient indexing methods for similarity searches. In this article, we propose a new indexing technique, called the NSP-tree, to support efficient similarity searches in an NDDS. As we know, overlap causes a performance degradation for indexing methods (e.g., the R-tree) for a continuous data space. In an NDDS, this problem is even worse due to the limited number of elements available on each dimension of an NDDS. The key idea of the NSP-tree is to use a novel discrete space-partitioning (SP) scheme to ensure no overlap at each level in the tree. A number of heuristics and strategies are incorporated into the tree construction algorithms to deal with the challenges for developing an SP-based index tree for an NDDS. Our experiments demonstrate that the NSP-tree is quite promising in supporting efficient similarity searches in NDDSs. We have compared the NSP-tree with the ND-tree, a data-partitioning-based indexing technique for NDDSs that was proposed recently, and the linear scan using different NDDSs. It was found that the search performance of the NSP-tree was better than those of both methods.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
3
 
4
5
 
6
 
7
 
8
Clement, J., Flajolet, P., and Vallee, B. 2001. Dynamic sources in information theory: A general analysis of trie structures. Algorithm 29, 1/2, 307--369.
9
10
 
11
 
12
13
 
14
Kent, W. J. 2002. BLAT---the BLAST-like aligment tool. Genome Res. 12, 656--664.
 
15
Knuth, D. E. 1973. The Art of Computer Programming, Vol. 3. Addison-Wesley, Reading, MA.
16
 
17
 
18
Qian, G., Zhu, Q., Xue, Q., and Pramanik, S. 2003. The ND-tree: A dynamic indexing technique for multidimensional non-ordered discrete data spaces. In Proceedings of VLDB. 620--631.
 
19
Qian, G., Zhu, Q., Xue, Q., and Pramanik, S. 2006. Dynamic indexing for multidimensional non-ordered discrete data spaces using a data-partitioning approach. ACM Trans. Datab. Syst. 31, 2. To appear.
20
 
21
Skopal, T., Pokorny, J., and Snasel, V. 2004. PM-tree: Pivoting metric tree for similarity search in multimedia databases. In Proceedings of Dateso 2004 Annual International Workshop on DAtabases, TExts, Specifications and Objects (DATESO'04). 27--37.
 
22
 
23
Uhlmann, J. K. 1991. Satisfying general proximity/similarity queries with metric trees. Inf. Proc. Lett. 40, 4, 175--179.
 
24
 
25
 
26
 
27
Zipf, G. K. 1949. Human Behavior and the Principle of Least Effort. Addison-Wesley, Reading, MA.


Collaborative Colleagues:
Gang Qian: colleagues
Qiang Zhu: colleagues
Qiang Xue: colleagues
Sakti Pramanik: colleagues