ACM Home Page
Please provide us with feedback. Feedback
Digital Library logoTake a look at the new version of this page: [ beta version ]. Tell us what you think.
Latent semantic indexing is an optimal special case of multidimensional scaling
Full text PdfPdf (728 KB)
Source Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval table of contents
Copenhagen, Denmark
Pages: 161 - 167  
Year of Publication: 1992
ISBN:0-89791-523-2
Authors
Brian T. Bartell  Department of Computer Science & Engineering-0114, University of California, San Diego, La Jolla, California
Garrison W. Cottrell  Department of Computer Science & Engineering-0114, University of California, San Diego, La Jolla, California
Richard K. Belew  Department of Computer Science & Engineering-0114, University of California, San Diego, La Jolla, California
Sponsors
Royal School of Lib. : Royal School of Lib.
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 6,   Downloads (12 Months): 60,   Citation Count: 13
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/133160.133191
What is a DOI?

ABSTRACT

Latent Semantic Indexing (LSI) is a technique for representing documents, queries, and terms as vectors in a multidimensional real-valued space. The representtions are approximations to the original term space encoding, and are found using the matrix technique of Singular Value Decomposition. In comparison Multidimensional Scaling (MDS) is a class of data analysis techniques for representing data points as points in a multidimensional real-valued space. The objects are represented so that inter-point similarities in the space match inter-object similarity information provided by the researcher. We illustrate how the document representations given by LSI are equivalent to the optimal representations found when solving a particular MDS problem in which the given inter-object similarity information is provided by the inner product similarities between the documents themselves. We further analyze a more general MDS problem in which the interdocument similarity information, although still in inner product form is arbitrary with respect to the vector space encoding of the documents.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
Scott Deerwester, Susan T. Dumais, George W. Furnas, Thomas K. Landauer, and Richard Harshman. Indexing by latent semantic analysis. Journal of the A merzcau Society for inform alion Sczence, 41(6):391-407, 1990.
 
3
Susan T. Dumais. Enhancing performance in latent semantic indexing (LSI) retrieval. Technical Report Technical Memorandum, Bellcore, September 1990.
 
4
james E. Everett and Antony Pecotich. A combined loglinear/MDS model for mapping journals by citation analysis. Journal of the American Soczely for Information Sczence, 42(6):405-413, 1991.
5
 
6
Michael J. Greenacre and Leslie G. Underhill. Scaling a data matrix in a low-dimensional euclidean space. In Douglas M. Hawkins, editor, Top~cs zn Apphed Mullzvarzate Analyszs, pages 183-268. Cambridge University Press, 1982.
 
7
J. B. Kruskal. Multidimensional scaling by optimizing goodness of fit to a nonmetric hypothesis. Psychometrtka, 29(1):i-27, March 1964.
 
8
 
9
Gilbert W. Stewart. Introduction to Matrix Computations. Academic Press, 1973.
 
10
W. S. Torgerson. Theory and Methods of Scahng. New York: John Wiley, 1958.

CITED BY  13

Collaborative Colleagues:
Brian T. Bartell: colleagues
Garrison W. Cottrell: colleagues
Richard K. Belew: colleagues