| Latent semantic indexing is an optimal special case of multidimensional scaling |
| Full text |
Pdf
(728 KB)
|
| Source
|
Annual ACM Conference on Research and Development in Information Retrieval
archive
Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval
table of contents
Copenhagen, Denmark
Pages: 161 - 167
Year of Publication: 1992
ISBN:0-89791-523-2
|
|
Authors
|
|
Brian T. Bartell
|
Department of Computer Science & Engineering-0114, University of California, San Diego, La Jolla, California
|
|
Garrison W. Cottrell
|
Department of Computer Science & Engineering-0114, University of California, San Diego, La Jolla, California
|
|
Richard K. Belew
|
Department of Computer Science & Engineering-0114, University of California, San Diego, La Jolla, California
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 6, Downloads (12 Months): 60, Citation Count: 13
|
|
|
ABSTRACT
Latent Semantic Indexing (LSI) is a technique for representing documents, queries, and terms as vectors in a multidimensional real-valued space. The representtions are approximations to the original term space encoding, and are found using the matrix technique of Singular Value Decomposition. In comparison Multidimensional Scaling (MDS) is a class of data analysis techniques for representing data points as points in a multidimensional real-valued space. The objects are represented so that inter-point similarities in the space match inter-object similarity information provided by the researcher. We illustrate how the document representations given by LSI are equivalent to the optimal representations found when solving a particular MDS problem in which the given inter-object similarity information is provided by the inner product similarities between the documents themselves. We further analyze a more general MDS problem in which the interdocument similarity information, although still in inner product form is arbitrary with respect to the vector space encoding of the documents.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
Scott Deerwester, Susan T. Dumais, George W. Furnas, Thomas K. Landauer, and Richard Harshman. Indexing by latent semantic analysis. Journal of the A merzcau Society for inform alion Sczence, 41(6):391-407, 1990.
|
| |
3
|
Susan T. Dumais. Enhancing performance in latent semantic indexing (LSI) retrieval. Technical Report Technical Memorandum, Bellcore, September 1990.
|
| |
4
|
james E. Everett and Antony Pecotich. A combined loglinear/MDS model for mapping journals by citation analysis. Journal of the American Soczely for Information Sczence, 42(6):405-413, 1991.
|
 |
5
|
|
| |
6
|
Michael J. Greenacre and Leslie G. Underhill. Scaling a data matrix in a low-dimensional euclidean space. In Douglas M. Hawkins, editor, Top~cs zn Apphed Mullzvarzate Analyszs, pages 183-268. Cambridge University Press, 1982.
|
| |
7
|
J. B. Kruskal. Multidimensional scaling by optimizing goodness of fit to a nonmetric hypothesis. Psychometrtka, 29(1):i-27, March 1964.
|
| |
8
|
|
| |
9
|
Gilbert W. Stewart. Introduction to Matrix Computations. Academic Press, 1973.
|
| |
10
|
W. S. Torgerson. Theory and Methods of Scahng. New York: John Wiley, 1958.
|
CITED BY 13
|
|
Inien Syu , S. D. Lang , Narsingh Deo, Incorporating latent semantic indexing into a neural network model for information retrieval, Proceedings of the fifth international conference on Information and knowledge management, p.145-153, November 12-16, 1996, Rockville, Maryland, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Xuanhui Wang , Jian-Tao Sun , Zheng Chen , ChengXiang Zhai, Latent semantic analysis for multiple-type interrelated data objects, Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, August 06-11, 2006, Seattle, Washington, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|