|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
ABSTRACT
In this paper, we present and evaluate a hybrid document organization system based on Self-Organizing Maps. The proposed system uses Semantic Mapping to dimensionality reduction and K-means to volume reduction of document vectors of a medium text collection. The vectors obtained after dimensionality and volume reduction steps are used to train the document maps with the SOM algorithm, thus the training time is reduced without compromising the quality of the generated map. We compare experimentally the hybrid system with the correspondent SOM system in organization of documents of Reuters-21758 v1.0 collection. The performances of the systems were measured in terms of classification error in text categorization and training time. The experimental results show that the proposed system generates pretty good document maps with smallest training time. REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
INDEX TERMS
Primary Classification:
General Terms:
Keywords:
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||