| A class of multistep sparse matrix strategies for concept decomposition matrix approximation |
| Full text |
Pdf
(3.70 MB)
|
Source
|
Symposium on Applied Computing
archive
Proceedings of the 2009 ACM symposium on Applied Computing
table of contents
Honolulu, Hawaii
SESSION: Information access and retrieval track
table of contents
Pages 1714-1718
Year of Publication: 2009
ISBN:978-1-60558-166-8
|
|
Authors
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 5, Downloads (12 Months): 26, Citation Count: 0
|
|
|
ABSTRACT
In information retrieval, text documents are usually modeled as a term-document matrix which has high dimensional and space vectors. To reduce the high dimensions, one of the various dimensionality reduction methods, concept decomposition, has been developed by [3]. This method is based on document clustering techniques and least-square matrix approximation to approximate the matrix of vectors. Gao and Zhang [4] have indicated that the retrieval accuracy from the concept decomposition can be comparable to that from Latent Semantic Indexing. However the numerical computation is expensive. In this paper we presented a class of multistep spare matrix strategies for concept decomposition matrix approximation. In this approach, a series of simple sparse matrices are used to approximate the decompositions. Our numerical experiments show the advantage of such an approach in terms of storage costs and query time compared with other approaches while maintaining comparable retrieval quality.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Chung-Min Chen , Ned Stoffel , Mike Post , Chumki Basu , Devasis Bassu , Clifford Behrens, Telcordia LSI Engine: Implementation and Scalability Issues, Proceedings of the 11th International Workshop on research Issues in Data Engineering, p.51, April 01-02, 2001
|
| |
2
|
|
| |
3
|
|
| |
4
|
J. Gao and J. Zhang. Text retrieval using sparsified concept decomposition matrix. Book Chapter of Computational and Information Science, vol. 3314, pp. 523--529, 2005
|
| |
5
|
A. Kontostathis, W. M. Pottenger, and B. D. Davison. Identifcation of critical values in Latent Semantic Indexing (LSI). Book chapter of Foundations of Data Mining and Knowledge Discovery, pp.333--346, 2005.
|
| |
6
|
|
| |
7
|
C. Shen and D. Williams. Approximate Matrix Decomposition Technniques in Information Retrieval, In Proceedings of World Congress on Engineering and Computer Science 2007, pp. 347--352, 2007.
|
|