ACM Home Page
Please provide us with feedback. Feedback
Towards smarter documents
Full text PdfPdf (225 KB)
Source Conference on Information and Knowledge Management archive
Proceedings of the thirteenth ACM international conference on Information and knowledge management table of contents
Washington, D.C., USA
SESSION: KM-4 (knowledge management): distributed knowledge management table of contents
Pages: 634 - 641  
Year of Publication: 2004
ISBN:1-58113-874-1
Authors
Vikas Krishna  IBM Almaden Research Center, San Jose, CA
Prasad M. Deshpande  IBM Almaden Research Center, San Jose, CA
Savitha Srinivasan  IBM Almaden Research Center, San Jose, CA
Sponsors
SIGIR: ACM Special Interest Group on Information Retrieval
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 19,   Downloads (12 Months): 80,   Citation Count: 3
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1031171.1031287
What is a DOI?

ABSTRACT

Document analysis research typically focuses on document image understanding or classic problems in text classification, clustering, summarization and discovery. While that is an important aspect of document management, in practice, documents lifecycles are often determined by the context of the business process that they are relevant to. It therefore becomes necessary for the document analysis techniques to recognize and leverage the contextual information provided by a supporting schema and business process. This paper presents an intelligent document management framework with relevant document analysis, metadata extraction, and business process association algorithms and methodology. The architecture supporting this framework seamlessly integrates a runtime environment with an authoring environment by combining relational data modeling tools with document classification techniques. The runtime environment accepts incoming documents, classifies the document, extracts metadata and executes customized business logic. The authoring environment supports the association of a class of documents with a relational document schema, identification of attribute values that must be extracted automatically, generation of relevant business logic, and deployment of authoring artifacts into the runtime architecture. We demonstrate the use of this framework with representative real-world document transformative applications.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Andries M. and Engels, G., A hybrid query language for the extended entity relationship model. In Journal of Visual Languages and Computing, 8(1), 1997, Special Issue on Visual Query Systems.
 
2
Angelaccio, M., Catarci, T. & Santucci, G., QBD*: A Fully Visual Query System. Journal on Visual Languages and Computing, 1(2), 255--273, 1990.
 
3
Bagdanov, A.D., Worring, M. Fine-Grained Document Genre Classification Using First Order Random Graphs. In Proceedings of ICDAR 01
 
4
Catarci, T., Costabile, M.F., Levialdi, S. and Batini, C. Visual Query Systems for Databases: A Survey. Technical Report SI/RR-95/17, Dipartimento di Scienze dell'Informazione, Universita' di Roma "La Sapienza", 1995.
5
 
6
See http://www-306.ibm.com/software/data/cm/
7
 
8
See: http://www.eclipse.org/emf/
9
 
10
 
11
 
12
 
13
Lyman, Peter and Hal R. Varian, How Much Information, 2000. Retrieved from <http://www.sims.berkeley.edu/how-much-info>
14
15
16
 
17
Gornik, D. UML Data Modeling Profile. IBM Rational Software Whitepaper TP 162 05/02, 2003.
 
18
Gornik, D. Data Modeling for Data Warehouses. IBM Rational Software Whitepaper TP 161 05/02, 2002.
19
 
20
See <http://www.rational.com/eda/ras/preview/index.htm>
21


Collaborative Colleagues:
Vikas Krishna: colleagues
Prasad M. Deshpande: colleagues
Savitha Srinivasan: colleagues