ACM Home Page
Please provide us with feedback. Feedback
Information retrieval using a singular value decomposition model of latent semantic structure
Full text PdfPdf (1.35 MB)
Source Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 11th annual international ACM SIGIR conference on Research and development in information retrieval table of contents
Grenoble, France
Pages: 465 - 480  
Year of Publication: 1988
ISBN:2-7061-0309-4
Authors
G. W. Furnas  Bellcore
S. Deerwester  University of Chicago
S. T. Dumais  Bellcore
T. K. Landauer  Bellcore
R. A. Harshman  University of Western Ontario
L. A. Streeter  Bellcore
K. E. Lochbaum  Bellcore
Sponsor
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 30,   Downloads (12 Months): 179,   Citation Count: 44
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/62437.62487
What is a DOI?

ABSTRACT

In a new method for automatic indexing and retrieval, implicit higher-order structure in the association of terms with documents is modeled to improve estimates of term-document association, and therefore the detection of relevant documents on the basis of terms found in queries. Singular-value decomposition is used to decompose a large term by document matrix into 50 to 150 orthogonal factors from which the original matrix can be approximated by linear combination; both documents and terms are represented as vectors in a 50- to 150- dimensional space. Queries are represented as pseudo-documents vectors formed from weighted combinations of terms, and documents are ordered by their similarity to the query. Initial tests find this automatic method very promising.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
ATHERTON65
Atherton, P. and Borko, H. A test of factor-analytically derived automated classification methods. AIP rept AIP-DRP 65-1, Jan. 1965.
BAKER62
 
BATES86
Bates, M.J. Subject access in online catalogs: A design model. JASIS, 1986, 37 (6), 357-376.
BORKO63
 
CARROLL70
Carroll, J.D. and Chang, J.J. Analysis of individual differences in multidimensional scaling via an N-way generalization of "Eckart-Young" decomposition. Psychometrika, 1970, 35, 283-319.
 
CARROLL80
Carroll, J.D. and Arabie, P. Multidimensional scaling. In M.R. Rosenzweig & L.W. Porter (Eds.). Annual Review of Psychology, 1980, 31,607-649.
 
COOMBS64
Coombs, C.H. A Theory of Data. New York: Wiley, 1964.
 
DESARBO85
Desarbo, W.S., and Carroll, J.D. Three-way metric unfolding via alternating weighted least squares. Psychometrika, 1985, 50(3), 275-300.
 
DUMAIS89
Dumais, S.T., Deerwester, S., Furnas, G.W., Landauer T.K., and Harshman, R., Indexing by Latent Structure Analysis. Journal of the American Society for Information Science. 1989, in press.
 
FIDEL85
Fidel, R. Individual variability in online searching behavior. In C.A. Parkhurst (Ed.). ASIS'85: Proceedings of the ASIS 48th Annual Meeting, Vol. 22, October 20-24, 1985, Las Vegas, 69-72.
 
FORSYTHE77
 
FURNAS80
Furnas, G.W. Objects and their features: The metric representation of two-class data. Ph.D. Dissertation. Stanford University, 1980.
 
FURNAS83
Furnas, G.W., Landauer, T.K., Dumais, S.T., and Gomez, L.M. Statistical semantics: Analysis of the potential performance of key-word information systems. Bell System Technical Journal, 1983, 62(6), 1753-1806.
 
HARSHMAN70
Harshman, R.A. Foundations of the PARAFAC procedure: Models and conditions for an "explanatory" multi-modaJ factor analysis. UCLA Work: Papers Phonetics, 1970, 16, 86pp.
 
HARSHMAN84
Harshman, R.A. and {.undy, M.E. Data preprocessing and the extended PARA.FAC model. In H.G. Law, C.W. Snyder, Jr., .I.A. Hattie, and R.P. McDonald (Eds.). Resea#'ch Methods for Multimode Data Anatysia, Praeger, 1984b.
 
HARSHMAN84
Harshman, R.A. and Lundy, M.E. The PARAFAC model for three-way factor analysis and multi-dimensional scaling. In H.G. Law, C.W. Snyder, Jr., J.A. Hattie, and R.P. McDonald (Eds.). Research Methods for Multimode Data Analysis, Praeger, 1984a.
 
HEISER81
Heiser, W.J. Unfolding Analysis of Proximity Data. Leiden, The Netherlands: Reprodienst Psychologic RUL, 1981.
 
JARDIN71
Jardin, N. and van Rijsbergen, C.J. The use of hierarchic clustering in infomlation retrieval. Information Storage and Retrieval, 1971, 7, 217-240.
KOLL79
 
KRUSKAL78
Kruskal, J.B. Factor analysis and principal components: Bilinear methods. In H. Ka#askai, J.M. Tanur (Eds.). lnternatiG, nal Encyclopedia of Statistics, New York: Free Press, 1978.
 
LILEY54
Liley, O. Evaluation of the subject catalog. American Documentation, 1954, 5(2), 41-60,
 
OSSORIO66
Ossorio, P.G. Classification space: A multivariate procedure for automatic document indexing and retrieval. Multivariate Behavioral Research, October 1966, 479-524.
 
SALTON68
 
SALTON83
 
SPARCK71
Sparck Jones, K. Automatic Keyword Classification for Information Retrieval, Buttersworth, London, 1971.
 
SPARCK72
Sparck Jones, K. A stati.stical interpretation of term specificity and its applications in retrieval. Journal of Documentation, March 1972, 28(1), 11-21.
 
STREETER87
Streeter, L. A, and l_x#hb#um, K. E. An expert/expert-locating system based on aultomatic representation of semantic suucture. Proceedings of the Fourth Conference on Artificial Intelligence Applications. March 14-18, 1987, San Deigo, CA., pp. 345-350.
 
TARR74
Tart, D. and Borko, H. Factors influencing inter-indexer consistency. In Proceedings of the ASIS 37th Annual Meeting, Vol. 11, 1974, 50-55.

CITED BY  44

Collaborative Colleagues:
G. W. Furnas: colleagues
S. Deerwester: colleagues
S. T. Dumais: colleagues
T. K. Landauer: colleagues
R. A. Harshman: colleagues
L. A. Streeter: colleagues
K. E. Lochbaum: colleagues