ACM Home Page
Please provide us with feedback. Feedback
A vector space model for automatic indexing
Full text PdfPdf (687 KB)
Source
Communications of the ACM archive
Volume 18 ,  Issue 11  (November 1975) table of contents
Pages: 613 - 620  
Year of Publication: 1975
ISSN:0001-0782
Authors
G. Salton  Cornell Univ., Ithaca, NY
A. Wong  Cornell Univ., Ithaca, NY
C. S. Yang  Cornell Univ., Ithaca, NY
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 173,   Downloads (12 Months): 1070,   Citation Count: 292
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/361219.361220
What is a DOI?

ABSTRACT

In a document retrieval, or other pattern matching environment where stored entities (documents) are compared with each other or with incoming patterns (search requests), it appears that the best indexing (property) space is one where each entity lies as far away from the others as possible; in these circumstances the value of an indexing system may be expressible as a function of the density of the object space; in particular, retrieval performance may correlate inversely with space density. An approach based on space density computations is used to choose an optimum indexing vocabulary for a collection of documents. Typical evaluation results are shown, demonstating the usefulness of the model.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
Salton, G., and Yang, C.S. On the specification of term values in automatic indexing. J. Documen. 29, 4 (Dec. 1973), 351-372.
 
3
Sparck Jones, K. A statistical interpretation of term specificity and its application to retrieval. J. Documen. 28, 1 (March 1972), 11-20.
 
4
Williamson, R.E. Real-time document retrieval. Ph.D. Th., Computer Sci. Dep., Cornell U., June 1974.
 
5
Wong, A. An investigation of the effects of different indexing methods on the document space configuration. Sci. Rep. ISR-22, Computer Sci. Dep., Cornell U., Section II, Nov. 1974.
 
6
 
7
Salton, G., Yang, C.S., and Yu, C.T. Contribution to the theory of indexing. Proc. IFIP Congress 74, Stockholm, August 1974. American Elsevier, New York, 1974.

CITED BY  294

Collaborative Colleagues:
G. Salton: colleagues
A. Wong: colleagues
C. S. Yang: colleagues