| An evaluation of type-10 homograph discrimination at the semi-colon level in Roget's International Thesaurus |
| Full text |
Pdf
(410 KB)
|
| Source
|
Symposium on Small Systems
archive
Proceedings of the 1990 ACM SIGSMALL/PC symposium on Small systems
table of contents
Crystal City, Virginia, United States
Pages: 156 - 159
Year of Publication: 1990
ISBN:0-89791-347-7
|
|
Authors
|
|
John Talburt
|
Department of Computer and Information Science, University of Arkansas at Little Rock, 2801 S. University Ave., Little Rock, AR
|
|
Donna Mooney
|
Department of Electronics and Instrumentation, University of Arkansas at Little Rock, 2801 S. University Ave., Little Rock, AR
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 5, Downloads (12 Months): 14, Citation Count: 0
|
|
|
ABSTRACT
This paper reports the results of evaluating a large sample of the 23,858 type-10 homographs found in Roget's International Thesaurus (3rd Ed.) as defined by the Bryan Model of abstract thesauri, of which Roget's is an instantiation. According to the Bryan model, two different entries in a thesaurus that have the same spelling are homographs (semantically unrelated) if and only if they cannot be the endpoints of a sequence of entries called a type-10 chain. The Bryan definition of a type-10 homograph has not been tested thoroughly until recently because of the combinatorial complexity associated with a direct application of the definition to a large instantiation such as Roget's. However, in 1989, the authors were able to decompose Roget's in into its type-10 components, and as a result, generate all 23,858 type-10 homographs at the semi-colon category level.
The principal result is that Bryan's definition of homographs by type-10 semantic disjunction does not appear to work uniformly over a broad range of entries in Roget's when the selected semantic category is the semi-colon group. Although there are many cases where type-10 homographs agree with conventional classifications, in general type-10 discrimination at the semi-colon level “over discriminates” in that it generates many more homographs than are found in standard English language dictionaries.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Robert M. Bryan, "Abstract Thesauri and Graph Theory Applications to Thesaurus Research", in Sally Yeates Sedelow, ed., Automated Laneuaee A~ (University of Kansas, Lawrence, 1973; also Defense Documentation Center, #AD 774- 692).
|
| |
2
|
Archibald B. Patrick, An Exploration of an Abstract Thesaurus Instantia ili09~(M.S. Thesis, Computer Science Department, University of Kansas, Lawrence, 1985).
|
| |
3
|
Roger's International Thesauri, Third Edition (New York, 1963).
|
| |
4
|
Sally Yeates Sedelow and Donna M. Mooney, "Knowledge Retrieval from Domain Transcendent Expert Systems: Research Results", Proceedings of the 5!st AS!S Annual Meeting, October, 1988.
|
| |
5
|
Sally Yeates Sedelow and Walter A. Sedelow, Jr., "Thesamal Knowledge Representation", Proceedings. Advances in Lexicology, Second Annual Conference of the UW Centre for the New Oxford English Dictionary (University of Waterloo, Canada, 1986) 29-43.
|
| |
6
|
|
| |
7
|
John R. Talburt and Donna M. Mooney, "Determination of Strongly Connected Components in Abstract Thesauri by the Method of Quartets", Proceeding: ACM Workshop on Applied Computing, March, 1989, Stillwater, Oklahoma, in Press.
|
| |
8
|
John R. Talburt and Donna M. Mooney, "The Decomposition of Roget's International Thesaurus into Type-10 Semantically Strong Components", Proceedings~ ACM South Central Regional Conference, November, 1989, Tulsa, Oklahoma, In Press.
|
| |
9
|
John R. Talburt and Donna M. Mooney, "The Determination of All Type-10 Homographs in Roger's International Thesaurus", Proceedings: Third Oklahoma Symposium on Artificial Intelligence, November, 1989, Tulsa, Oklahoma, In Press.
|
| |
10
|
Webster's N e..w Collegiate Dictionary, (G. & C. Merriam Co., 1980).
|
|