ACM Home Page
Please provide us with feedback. Feedback
Digital Library logoTake a look at the new version of this page: [ beta version ]. Tell us what you think.
An optical character recognition approach to qualifying thresholding algorithms
Full text PdfPdf (295 KB)
Source
Document Engineering archive
Proceeding of the eighth ACM symposium on Document engineering table of contents
Sao Paulo, Brazil
SESSION: Recognizing characters table of contents
Pages: 263-266  
Year of Publication: 2008
ISBN:978-1-60558-081-4
Authors
Margaret Sturgill  Hewlett Packard Labs, Fort Collins, CO, USA
Steven J. Simske  Hewlett Packard Labs, Fort Collins, CO, USA
Sponsors
SIGDOC : ACM Special Interest Group on Systems Documentation
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 16,   Downloads (12 Months): 83,   Citation Count: 2
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1410140.1410197
What is a DOI?

ABSTRACT

Pre-processing for raster image based document segmentation begins with image thresholding, which is a binarization process separating foreground from background. In this paper, we compare an existing (Otsu), modified existing (Kittler-Illingworth) and simple peak-based thresholding approach on a set of 982 documents for which existing ground truth (full text) is available. We use the output of an open source OCR engine which incorporates an adaptive/dynamic thresholder that can be bypassed by one of the three global thresholds we tested. This allowed comparison of these three approaches in the aggregate. We then used an independently-generated dictionary as a means of characterizing thresholder efficacy. Such an approach, if successful, will provide the means for selecting an optimal thresholder in the absence of a large set of ground truthed documents. Our preliminary findings here indicate that this approach may provide a reliable means for thresholder comparison and eventually preclude the need for time-intensive human ground truthing.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Otsu, N. 1979 A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man Cybern. Vol. SMC-9, No. 1, 62--66.
 
2
 
3
Mitchell, B.T. and Gillies, A.M. 1989 A model-based computer vision system for recognizing handwritten ZIP codes. Machine Vision and Applications Vol. 2, 231--243.
 
4
 
5
Rice S., Jenkins F., and Nartker T. 1995 The Fourth Annual Test of OCR Accuracy. Technical Report 95-04,, Information Science Research Institute, University of Nevada, Las Vegas
6
 
7
 
8


Collaborative Colleagues:
Margaret Sturgill: colleagues
Steven J. Simske: colleagues