| Evaluation of DEFINDER: a system to mine definitions from consumer-oriented medical text |
| Full text |
Pdf
(125 KB)
|
| Source
|
International Conference on Digital Libraries
archive
Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries
table of contents
Roanoke, Virginia, United States
Pages: 201 - 202
Year of Publication: 2001
ISBN:1-58113-345-6
|
|
Authors
|
|
Judith L. Klavans
|
Center for Research on Information Access, Columbia University, New York, NY
|
|
Smaranda Muresan
|
Department of Computer Science, Columbia University, New York, NY
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 5, Downloads (12 Months): 23, Citation Count: 6
|
|
|
ABSTRACT
In this paper we present DEFINDER, a rule-based system that mines cons umer-oriented full text articles in order to extract definitions and the terms they define. This research is part of Digital Library Project at Columbia University, entitled PERSIVAL (PErsonalized Retrieval and Summarization of Image, Video and Language resources) [5]. One goal of the project is to present information to patients in language they can understand. A key component of this stage is to provide accurate and readable lay definitions for technical terms, which may be present in articles of intermediate complexity. The focus of this short paper is on quantitative and qualitative evaluation of the DEFINDER system [3]. Our basis for comparison was definitions from Unified Medical Language System (UMLS), On-line Medical Dictionary (OMD) and Glossary of Popular and Technical Medical Terms (GPTMT). Quantitative evaluations show that DEFINDER obtained 87% precision and 75% recall and reveal the incompleteness of existing resources and the ability of DEFINDER to address gaps. Qualitative evaluation shows that the definitions extracted by our system are ranked higher in terms of user-based criteria of usability and readability than definitions from on-line specialized dictionaries. Thus the output of DEFINDER can be used to enhance existing specialized dictionaries, and also as a key feature in summarizing technical articles for non-specialist users.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
Justeson, J. and Katz, S. Technical Terminology: Some Linguistic Properties and an Algorithm for Identification in Text. Natural Language Engineering. Vol 1(1). 1995. pp. 9- 27.
|
| |
3
|
Klavans J.L., Muresan S. DEFINDER: Rule-Based Methods for the Extraction of Medical Terminology and their Associated Definitions from On-line Text. Proc of AMIA 2000; pp. 1906.
|
| |
4
|
McCord M.C. The Slot Grammar system. IBM Report; 1991.
|
 |
5
|
Kathleen R. McKeown , Shih-Fu Chang , James Cimino , Steven Feiner , Carol Friedman , Luis Gravano , Vasileios Hatzivassiloglou , Steven Johnson , Desmond A. Jordan , Judith L. Klavans , André Kushniruk , Vimla Patel , Simone Teufel, PERSIVAL, a system for personalized search and summarization over multimedia healthcare information, Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries, p.331-340, January 2001, Roanoke, Virginia, United States
[doi> 10.1145/379437.379722]
|
| |
6
|
Siegal, S. and Castellan, N.J. (1988). Non-parametric statistics for the behavioural sciences (2nd Edition). New York: McGraw Hill.
|
| |
7
|
Zweigenbaum P, Bouaud J, Bachimont B, Charlet J, Seroussi B, Boisvieux JF. From Text to Knowledge: a Unifying Document-Oriented View of Analyzed Medical Language. Proceedings of IMIA WG6. 1997.
|
CITED BY 6
|
|
|
|
|
Smaranda Muresan , Samuel D. Popper , Peter T. Davis , Judith L. Klavans, Building a terminological database from heterogeneous definitional sources, Proceedings of the 2003 annual national conference on Digital government research, p.1-4, May 18-21, 2003, Boston, MA
|
|
|
|
|
|
|
|
|
|
|
|
|
|