ACM Home Page
Please provide us with feedback. Feedback
An evaluation of type-10 homograph discrimination at the semi-colon level in Roget's International Thesaurus
Full text PdfPdf (410 KB)
Source Symposium on Small Systems archive
Proceedings of the 1990 ACM SIGSMALL/PC symposium on Small systems table of contents
Crystal City, Virginia, United States
Pages: 156 - 159  
Year of Publication: 1990
ISBN:0-89791-347-7
Authors
John Talburt  Department of Computer and Information Science, University of Arkansas at Little Rock, 2801 S. University Ave., Little Rock, AR
Donna Mooney  Department of Electronics and Instrumentation, University of Arkansas at Little Rock, 2801 S. University Ave., Little Rock, AR
Sponsor
SIGICE: ACM Special Interest Group on Individual Computing Environment
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 14,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/99412.99453
What is a DOI?

ABSTRACT

This paper reports the results of evaluating a large sample of the 23,858 type-10 homographs found in Roget's International Thesaurus (3rd Ed.) as defined by the Bryan Model of abstract thesauri, of which Roget's is an instantiation. According to the Bryan model, two different entries in a thesaurus that have the same spelling are homographs (semantically unrelated) if and only if they cannot be the endpoints of a sequence of entries called a type-10 chain. The Bryan definition of a type-10 homograph has not been tested thoroughly until recently because of the combinatorial complexity associated with a direct application of the definition to a large instantiation such as Roget's. However, in 1989, the authors were able to decompose Roget's in into its type-10 components, and as a result, generate all 23,858 type-10 homographs at the semi-colon category level. The principal result is that Bryan's definition of homographs by type-10 semantic disjunction does not appear to work uniformly over a broad range of entries in Roget's when the selected semantic category is the semi-colon group. Although there are many cases where type-10 homographs agree with conventional classifications, in general type-10 discrimination at the semi-colon level “over discriminates” in that it generates many more homographs than are found in standard English language dictionaries.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Robert M. Bryan, "Abstract Thesauri and Graph Theory Applications to Thesaurus Research", in Sally Yeates Sedelow, ed., Automated Laneuaee A~ (University of Kansas, Lawrence, 1973; also Defense Documentation Center, #AD 774- 692).
 
2
Archibald B. Patrick, An Exploration of an Abstract Thesaurus Instantia ili09~(M.S. Thesis, Computer Science Department, University of Kansas, Lawrence, 1985).
 
3
Roger's International Thesauri, Third Edition (New York, 1963).
 
4
Sally Yeates Sedelow and Donna M. Mooney, "Knowledge Retrieval from Domain Transcendent Expert Systems: Research Results", Proceedings of the 5!st AS!S Annual Meeting, October, 1988.
 
5
Sally Yeates Sedelow and Walter A. Sedelow, Jr., "Thesamal Knowledge Representation", Proceedings. Advances in Lexicology, Second Annual Conference of the UW Centre for the New Oxford English Dictionary (University of Waterloo, Canada, 1986) 29-43.
 
6
 
7
John R. Talburt and Donna M. Mooney, "Determination of Strongly Connected Components in Abstract Thesauri by the Method of Quartets", Proceeding: ACM Workshop on Applied Computing, March, 1989, Stillwater, Oklahoma, in Press.
 
8
John R. Talburt and Donna M. Mooney, "The Decomposition of Roget's International Thesaurus into Type-10 Semantically Strong Components", Proceedings~ ACM South Central Regional Conference, November, 1989, Tulsa, Oklahoma, In Press.
 
9
John R. Talburt and Donna M. Mooney, "The Determination of All Type-10 Homographs in Roger's International Thesaurus", Proceedings: Third Oklahoma Symposium on Artificial Intelligence, November, 1989, Tulsa, Oklahoma, In Press.
 
10
Webster's N e..w Collegiate Dictionary, (G. & C. Merriam Co., 1980).

Collaborative Colleagues:
John Talburt: colleagues
Donna Mooney: colleagues