ACM Home Page
Please provide us with feedback. Feedback
Indexing and searching handwritten medical forms
Full text PdfPdf (304 KB)
Source dg.o; Vol. 151 archive
Proceedings of the 2006 international conference on Digital government research table of contents
San Diego, California
SESSION: Crisis management 1 table of contents
Pages: 73 - 74  
Year of Publication: 2006
Author
Venu Govindaraju  University at Buffalo, NY
Sponsor
NSF : National Science Foundation
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 2,   Downloads (12 Months): 28,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1146598.1146625
What is a DOI?

ABSTRACT

Extracting and reading handwritten data from medical forms is an important task in medical informatics as it paves the way for efficient archival, indexing, and retrieval. This paper addresses two important challenges: (i) extraction of handwritten text data from images of carbon copies, and (ii) intelligent use of context to reduce lexicons to make the task of handwriting recognition tractable. We have developed a smart binarization algorithm targeted to carbon copy images that outperforms methods reported in the literature. The lexicon reduction method is based on learning the medical concept, and hence the probable medical terms to be encountered in the narrative part that describes the chief complaint of the patient by training on OCR output. In our experiments, we have worked with about 600 medical forms, 20 medical concepts, and a lexicon size of 4,700. We have observed that if the concept is one of top 3 choices, the lexicon can be reduced by two-thirds on an unseen form.