ACM Home Page
Please provide us with feedback. Feedback
A cluster-based approach to thesaurus construction
Full text PdfPdf (958 KB)
Source Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 11th annual international ACM SIGIR conference on Research and development in information retrieval table of contents
Grenoble, France
Pages: 309 - 320  
Year of Publication: 1988
ISBN:2-7061-0309-4
Author
C. J. Crouch  Department of Computer Science, Tulane University, New Orleans, LA
Sponsor
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 10,   Downloads (12 Months): 60,   Citation Count: 17
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/62437.62467
What is a DOI?

ABSTRACT

The importance of a thesaurus in the successful operation of an information retrieval system is well recognized. Yet techniques which support the automatic generation of thesauri remain largely undiscovered. This paper describes one approach to the automatic generation of global thesauri, based on the discrimination value model of Salton, Yang, and Yu and on an appropriate clustering algorithm. This method has been implemented and applied to two document collections. Preliminary results indicate that this method, which produces improvements in retrieval performance in excess of 10 and 15 percent in the test collections, is viable and worthy of continued investigation.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

ATTAR77
 
BUCKLEY85
 
CRAWFORD75
Crawford, R., The Computation of Discrimination Values, Information mroces,.#ing and Management, 11 "249-253, 1975.
 
CROUCH88
 
FOX83a
 
FOX83b
Fox, E., Characteri,#tics of Two New Experimental Collections in Computer and Informatio# Science Containing Textual and Bibliographic Concepts, Tech. Report 83-561, Dept. of Computer Science, Cornelt University, Sept. 1983.
 
SALTON75a
Salton, G., C.S. Yang, and C.T. Yu, A Theory of Term Importance in Automatic Text Analysis, Journal of the ASIS , 26(1):33-44, 1975.
SALTON75b
 
SALTON76
Salton, G., and A. Wong, On the Role of Words and Phrases in Automatic Text Analysis, Computers and Humanities, 10:69-87, 1976.
 
SALTON87
 
VANRIJSB79
 
VOORHEES85
 
VOORHEES86
 
WILLET85

CITED BY  17