ACM Home Page
Please provide us with feedback. Feedback
K-tree: large scale document clustering
Full text PdfPdf (95 KB)
Source
Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval table of contents
Boston, MA, USA
POSTER SESSION: Posters table of contents
Pages 718-719  
Year of Publication: 2009
ISBN:978-1-60558-483-6
Authors
Christopher M. De Vries  Queensland University of Technology, Brisbane, Australia
Shlomo Geva  Queensland University of Technology, Brisbane, Australia
Sponsors
SIGIR: ACM Special Interest Group on Information Retrieval
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 34,   Downloads (12 Months): 109,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1571941.1572094
What is a DOI?

ABSTRACT

We introduce K-tree in an information retrieval context. It is an efficient approximation of the k-means clustering algorithm. Unlike k-means it forms a hierarchy of clusters. It has been extended to address issues with sparse representations. We compare performance and quality to CLUTO using document collections. The K-tree has a low time complexity that is suitable for large document collections. This tree structure allows for efficient disk based implementations where space requirements exceed that of main memory.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
K-tree project page, http://ktree.sourceforge.net. 2009.
2
 
3
S. Geva. K-tree: a height balanced tree structured vector quantizer. Neural Networks for Signal Processing X, 2000. Proceedings of the 2000 IEEE Signal Processing Society Workshop, 1:271--280 vol.1, 2000.
 
4
G. Karypis. CLUTO-A Clustering Toolkit. 2002.
 
5
L. Kaufman, P. Rousseeuw, Delft(Netherlands). Dept. of Mathematics Technische Hogeschool, and Informatics. Clustering by means of medoids. Technical report, Technische Hogeschool, Delft(Netherlands). Dept. of Mathematics and Informatics., 1987.
 
6
Y. Song, W.Y. Chen, H. Bai, C.J. Lin, and E.Y. Chang. Parallel Spectral Clustering. 2008.
 
7

Collaborative Colleagues:
Christopher M. De Vries: colleagues
Shlomo Geva: colleagues