| A cluster-based approach to thesaurus construction |
| Full text |
Pdf
(958 KB)
|
| Source
|
Annual ACM Conference on Research and Development in Information Retrieval
archive
Proceedings of the 11th annual international ACM SIGIR conference on Research and development in information retrieval
table of contents
Grenoble, France
Pages: 309 - 320
Year of Publication: 1988
ISBN:2-7061-0309-4
|
|
Author
|
|
C. J. Crouch
|
Department of Computer Science, Tulane University, New Orleans, LA
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 10, Downloads (12 Months): 60, Citation Count: 17
|
|
|
ABSTRACT
The importance of a thesaurus in the successful operation of an information retrieval system is well recognized. Yet techniques which support the automatic generation of thesauri remain largely undiscovered. This paper describes one approach to the automatic generation of global thesauri, based on the discrimination value model of Salton, Yang, and Yu and on an appropriate clustering algorithm. This method has been implemented and applied to two document collections. Preliminary results indicate that this method, which produces improvements in retrieval performance in excess of 10 and 15 percent in the test collections, is viable and worthy of continued investigation.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
ATTAR77
|
|
| |
BUCKLEY85
|
|
| |
CRAWFORD75
|
Crawford, R., The Computation of Discrimination Values, Information mroces,.#ing and Management, 11 "249-253, 1975.
|
| |
CROUCH88
|
|
| |
FOX83a
|
|
| |
FOX83b
|
Fox, E., Characteri,#tics of Two New Experimental Collections in Computer and Informatio# Science Containing Textual and Bibliographic Concepts, Tech. Report 83-561, Dept. of Computer Science, Cornelt University, Sept. 1983.
|
| |
SALTON75a
|
Salton, G., C.S. Yang, and C.T. Yu, A Theory of Term Importance in Automatic Text Analysis, Journal of the ASIS , 26(1):33-44, 1975.
|
 |
SALTON75b
|
|
| |
SALTON76
|
Salton, G., and A. Wong, On the Role of Words and Phrases in Automatic Text Analysis, Computers and Humanities, 10:69-87, 1976.
|
| |
SALTON87
|
|
| |
VANRIJSB79
|
|
| |
VOORHEES85
|
|
| |
VOORHEES86
|
|
| |
WILLET85
|
|
CITED BY 17
|
|
|
|
|
|
|
|
Tomek Strzalkowski , Jose Perez-Carballo , Mihnea Marinescu, Natural language information retrieval in digital libraries, Proceedings of the first ACM international conference on Digital libraries, p.117-125, March 20-23, 1996, Bethesda, Maryland, United States
|
|
|
P. Efraimidis , C. Glymidakis , B. Mamalis , P. Spirakis , B. Tampakas, Parallel text retrieval on a high performance supercomputer using the Vector Space Model, Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval, p.58-66, July 09-13, 1995, Seattle, Washington, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Masahiro Ito , Kotaro Nakayama , Takahiro Hara , Shojiro Nishio, Association thesaurus construction methods based on link co-occurrence analysis for wikipedia, Proceeding of the 17th ACM conference on Information and knowledge management, October 26-30, 2008, Napa Valley, California, USA
|
|