ACM Home Page
Please provide us with feedback. Feedback
Logical document conversion: combining functional and formal knowledge
Full text PdfPdf (298 KB)
Source
Document Engineering archive
Proceedings of the 2007 ACM symposium on Document engineering table of contents
Winnipeg, Manitoba, Canada
SESSION: Layout and aesthetics table of contents
Pages: 135 - 143  
Year of Publication: 2007
ISBN:978-1-59593-776-6
Authors
Hervé Déjean  Xerox Research Centre Europe
Jean-Luc Meunier  Xerox Research Centre Europe
Sponsors
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 4,   Downloads (12 Months): 42,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1284420.1284456
What is a DOI?

ABSTRACT

We present in this paper a method for document layout analysis based on identifying the function of document elements (what they do). This approach is orthogonal and complementary to the traditional view based on the form of document elements (how they are constructed). One key advantage of such functional knowledge is that the functions of some document elements are very stable from document to document and over time. Relying on the stability of such functions, the method is not impacted by layout variability, a key issue in logical document analysis and is thus very robust and versatile. The method starts the recognition process by using functional knowledge and uses in a second step formal knowledge as a source of feedback in order to correct some errors. This allows the method to adapt to specific documents by using formal specificities.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
2
 
3
 
4
R. Cattoni, T. Coianiz, Geometric Layout Analysis Techniques for Document Image Understanding: a Review, TC-IRST Technical Report #9703-09, 1998.
 
5
G. Cavallo, R. Chartier, A history of reading in the west, Cambridge: Polity Press, 1999.
 
6
K. Collins-Thompson and R. Novkolov, A Clustering-Based Algorithm for Automatic Document Separation, in proceedings of the SIGIR 2002 Workshop on Information Retrieval and OCR: From Converting Content to Grasping Meaning, Tampere, Finland, 2002.
7
 
8
R. Haralick, Document Image Understanding: Geometric and Logical Layout, EEE Computer Society Conference on Computer Vision and Pattern Recognition, 1994.
 
9
Seventh International Workshop, DAS 2006, Nelson, New Zealand, February 13-15, 2006, Proceedings Series: Lecture Notes in Computer Science, Vol. 3872.
 
10
S. Hauser, T. Sabir, G. Thoma, OCR Correction Using Historical Relationship from Verified Text in Biomedical Citations, Proceedings of 2003 Symposium on Document Image Understanding Technology, Greenbelt, Maryland, 2003.
 
11
Eighth International Conference on Document Analysis and Recognition (ICDAR 2005), August 29- September 1, 2005, Seoul, Korea. IEEE Computer Society.
 
12
A. Kawtrakul, C. Yingsaeree, A Unified Framework for Automatic Metadata Extraction from Electronic Document, in proceedings of the International Advanced Digital Library Conference (IADLC), Nagoya, Japan, 2005.
 
13
 
14
X. Lin, Y. Xiong. Detection and Analysis of Table of Contents Based on Content Association, IJDAR 8(2-3): 132--143, 2006.
 
15
 
16
S. Mao, A. Rosenfeld, T. Kanungo. Document structure analysis algorithms: a literature survey, Proc. SPIE Electronic Imaging, January 2003 SPIE Vol. 5010:197--207.
 
17
S. Mao, J. Woo Kim, G. R. Thoma, Style-Independent Document Labeling: Design and Performance Evaluation, SPIE 2004.
 
18
G. Mühlberger, Automated Digitisation of Printed Material for Everyone: The METADATA ENGINE Project, RLG DigiNews, Volume 6, Number 3, 2002.
 
19
A. Rosenblueth, N. Wiener, J. Bigelow, "Behavior, Purpose and Teleology", in Philosophy of Science, 10, 1943.
 
20
P. Sarkar, E. Saund, Perceptual Organization in Semantic Role Labeling, Symposium on Document Image Understanding Technology, Maryland, US, 2005.
 
21
F. Shafait, Daniel Keysers, T. M. Breuel, Performance Comparison of Six Algorithms for Page Segmentation, Workshop in Document Analysis System (DAS'06), Nelson, New-Zealand, 2006.
 
22
 
23
D. Slocombe, R. Boyd, "There is no unstructured documents," XML Europe 2002, Paris, France 2002.
 
24
K. Summers, Using White Space for Automated Document Structuring, Cornell University Computer Science Technical Report TR 94-1452.
 
25
L. Todoran, M. Woring, M. Aiello, C. Monz, Document Understanding for a broad class of documents, ISIS technical report series, Vol. 2001-15, 2001.
26

Collaborative Colleagues:
Hervé Déjean: colleagues
Jean-Luc Meunier: colleagues