ACM Home Page
Please provide us with feedback. Feedback
Semantic mapping and K-means applied to hybrid SOM-based document organization system construction
Full text PdfPdf (65 KB)
Source Symposium on Applied Computing archive
Proceedings of the 2008 ACM symposium on Applied computing table of contents
Fortaleza, Ceara, Brazil
SESSION: Information access and retrieval table of contents
Pages 1112-1116  
Year of Publication: 2008
ISBN:978-1-59593-753-7
Authors
Renato Fernandes Corrêa  Cidade Universitária Recife -- PE, Brazil
Teresa Bernarda Ludermir  Cidade Universitária Recife -- PE, Brazil
Sponsor
SIGAPP: ACM Special Interest Group on Applied Computing
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 10,   Downloads (12 Months): 77,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1363686.1363945
What is a DOI?

ABSTRACT

In this paper, we present and evaluate a hybrid document organization system based on Self-Organizing Maps. The proposed system uses Semantic Mapping to dimensionality reduction and K-means to volume reduction of document vectors of a medium text collection. The vectors obtained after dimensionality and volume reduction steps are used to train the document maps with the SOM algorithm, thus the training time is reduced without compromising the quality of the generated map. We compare experimentally the hybrid system with the correspondent SOM system in organization of documents of Reuters-21758 v1.0 collection. The performances of the systems were measured in terms of classification error in text categorization and training time. The experimental results show that the proposed system generates pretty good document maps with smallest training time.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
T. Kohonen, S. Kaski, K. Lagus, J. Salojärvi, J. Honkela, V. Paatero and A. Saarela, Self Organization of a Massive Document Collection, IEEE Transaction on Neural Networks, v. 11, n. 3, 2000, pp. 574--585.
 
2
S. Kaski, Dimensionality Reduction by Random Mapping: Fast Similarity Computation for Clustering, In Proc. IJCNN'98, v. 1, 1998, 413--418.
 
3
 
4
R. F. Corrêa, T. B. Ludermir, Dimensionality Reduction by Semantic Mapping, In Proc. VIII Brazilian Neural Networks Symposium. Los Alamitos: IEEE Computer Society, 2004.
 
5
R. F. Corrêa, T. B. Ludermir, Improving Self-Organization of Document Collections by Semantic Mapping, Neurocomputing (Amsterdam), v.70, 2006, 62--69.
 
6
7
 
8
R. F. Corrêa, T. B. Ludermir, Dimensionality Reduction of very large document collections by Semantic Mapping. In Proc. 6th Workshop on Self-Organizing Maps WSOM 2007, Bielefeld, Germany, in press, 2007.
9
 
10
A. Strehl, J. Ghosh, R. Mooney, Impact of similarity measures on web-page clustering, In Proc. AAAI Workshop on AI for Web Search, 2000, 58--64.
 
11
R. R. Wilcox, Fundamentals of Modern Statistical Methods: Substantially Improving Power and Accuracy, Springer-Verlag, New York, 2001.
 
12
D. D. Lewis, Reuters-21578 Text Categorization Test Colletion, AT&T Labs Research, 1997. Available: http://www.research.att.com/~lewis.
 
13

Collaborative Colleagues:
Renato Fernandes Corrêa: colleagues
Teresa Bernarda Ludermir: colleagues