|
ABSTRACT
We present in this paper a method for document layout analysis based on identifying the function of document elements (what they do). This approach is orthogonal and complementary to the traditional view based on the form of document elements (how they are constructed). One key advantage of such functional knowledge is that the functions of some document elements are very stable from document to document and over time. Relying on the stability of such functions, the method is not impacted by layout variability, a key issue in logical document analysis and is thus very robust and versatile. The method starts the recognition process by using functional knowledge and uses in a second step formal knowledge as a source of feedback in order to correct some errors. This allows the method to adapt to specific documents by using formal specificities.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
 |
2
|
Henry S. Baird , Daniel Lopresti , Brian D. Davison , William M. Pottenger, Robust document image understanding technologies, Proceedings of the 1st ACM workshop on Hardcopy document processing, p.9-14, November 12-12, 2004, Washington, DC, USA
[doi> 10.1145/1031442.1031444]
|
| |
3
|
|
| |
4
|
R. Cattoni, T. Coianiz, Geometric Layout Analysis Techniques for Document Image Understanding: a Review, TC-IRST Technical Report #9703-09, 1998.
|
| |
5
|
G. Cavallo, R. Chartier, A history of reading in the west, Cambridge: Polity Press, 1999.
|
| |
6
|
K. Collins-Thompson and R. Novkolov, A Clustering-Based Algorithm for Automatic Document Separation, in proceedings of the SIGIR 2002 Workshop on Information Retrieval and OCR: From Converting Content to Grasping Meaning, Tampere, Finland, 2002.
|
 |
7
|
|
| |
8
|
R. Haralick, Document Image Understanding: Geometric and Logical Layout, EEE Computer Society Conference on Computer Vision and Pattern Recognition, 1994.
|
| |
9
|
Seventh International Workshop, DAS 2006, Nelson, New Zealand, February 13-15, 2006, Proceedings Series: Lecture Notes in Computer Science, Vol. 3872.
|
| |
10
|
S. Hauser, T. Sabir, G. Thoma, OCR Correction Using Historical Relationship from Verified Text in Biomedical Citations, Proceedings of 2003 Symposium on Document Image Understanding Technology, Greenbelt, Maryland, 2003.
|
| |
11
|
Eighth International Conference on Document Analysis and Recognition (ICDAR 2005), August 29- September 1, 2005, Seoul, Korea. IEEE Computer Society.
|
| |
12
|
A. Kawtrakul, C. Yingsaeree, A Unified Framework for Automatic Metadata Extraction from Electronic Document, in proceedings of the International Advanced Digital Library Conference (IADLC), Nagoya, Japan, 2005.
|
| |
13
|
|
| |
14
|
X. Lin, Y. Xiong. Detection and Analysis of Table of Contents Based on Content Association, IJDAR 8(2-3): 132--143, 2006.
|
| |
15
|
|
| |
16
|
S. Mao, A. Rosenfeld, T. Kanungo. Document structure analysis algorithms: a literature survey, Proc. SPIE Electronic Imaging, January 2003 SPIE Vol. 5010:197--207.
|
| |
17
|
S. Mao, J. Woo Kim, G. R. Thoma, Style-Independent Document Labeling: Design and Performance Evaluation, SPIE 2004.
|
| |
18
|
G. Mühlberger, Automated Digitisation of Printed Material for Everyone: The METADATA ENGINE Project, RLG DigiNews, Volume 6, Number 3, 2002.
|
| |
19
|
A. Rosenblueth, N. Wiener, J. Bigelow, "Behavior, Purpose and Teleology", in Philosophy of Science, 10, 1943.
|
| |
20
|
P. Sarkar, E. Saund, Perceptual Organization in Semantic Role Labeling, Symposium on Document Image Understanding Technology, Maryland, US, 2005.
|
| |
21
|
F. Shafait, Daniel Keysers, T. M. Breuel, Performance Comparison of Six Algorithms for Page Segmentation, Workshop in Document Analysis System (DAS'06), Nelson, New-Zealand, 2006.
|
| |
22
|
|
| |
23
|
D. Slocombe, R. Boyd, "There is no unstructured documents," XML Europe 2002, Paris, France 2002.
|
| |
24
|
K. Summers, Using White Space for Automated Document Structuring, Cornell University Computer Science Technical Report TR 94-1452.
|
| |
25
|
L. Todoran, M. Woring, M. Aiello, C. Monz, Document Understanding for a broad class of documents, ISIS technical report series, Vol. 2001-15, 2001.
|
 |
26
|
Sherif Yacoub , John Burns , Paolo Faraboschi , Daniel Ortega , Jose Abad Peiro , Vinay Saxena, Document digitization lifecycle for complex magazine collection, Proceedings of the 2005 ACM symposium on Document engineering, November 02-04, 2005, Bristol, United Kingdom
[doi> 10.1145/1096601.1096650]
|
|