ACM Home Page
Please provide us with feedback. Feedback
Digital Library logoTake a look at the new version of this page: [ beta version ]. Tell us what you think.
From rhetorical structures to document structure: shallow pragmatic analysis for document engineering
Full text PdfPdf (2.13 MB)
Source
Document Engineering archive
Proceedings of the 9th ACM symposium on Document engineering table of contents
Munich, Germany
SESSION: Document and linguistics (I) table of contents
Pages: 185-192  
Year of Publication: 2009
ISBN:978-1-60558-575-8
Authors
Gersende Georg  Haute Autorité de Santé, Saint-Denis La Plaine Cedex, France
Hugo Hernault  The University of Tokyo, Tokyo, Japan
Marc Cavazza  University of Teesside, Middlesbrough, United Kingdom
Helmut Prendinger  National Institute of Informatics, Tokyo, Japan
Mitsuru Ishizuka  The University of Tokyo, Tokyo, Japan
Sponsors
SIGDOC: ACM Special Interest Group for Design of Communications
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 15,   Downloads (12 Months): 43,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1600193.1600235
What is a DOI?

ABSTRACT

In this paper, we extend previous work on the automatic structuring of medical documents using content analysis. Our long-term objective is to take advantage of specific rhetoric markers encountered in specialized medical documents (clinical guidelines) to automatically structure free text according to its role in the document. This should enable to generate multiple views of the same document depending on the target audience, generate document summaries, as well as facilitating knowledge extraction from text. We have established in previous work that the structure of clinical guidelines could be refined through the identification of a limited set of deontic operators. We now propose to extend this approach by analyzing the text delimited by these operators using Rhetorical Structure Theory. The emphasis on causality and time in RST proves a powerful complement to the recognition of deontic structures while retaining the same philosophy of high-level recognition of sentence structure, which can be converted into application-specific mark-ups. Throughout the paper, we illustrate our findings through results produced by the automatic processing of English guidelines for the management of hypertension and Alzheimer disease.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Afantenos S, Karkaletsis V, Stamatopoulos P. Summarization from medical documents: a survey. Artificial Intelligence in Medicine 33 (2), pp. 157--177 (2005).
 
2
 
3
4
 
5
Fuchs N E, Kaljurand K, Schneider G. Attempto Controlled English Meets the Challenges of Knowledge Representation, Reasoning, Interoperability and User Interfaces. FLAIRS Conference, pp. 664--669 (2006).
 
6
Gallardo S. Pragmatic support of medical recommendations in popularized texts. Journal of Pragmatics 37 (6), pp.813--835 (2005).
 
7
8
 
9
Georg G, Jaulent M-C. An Environment for Document Engineering of Clinical Guidelines. In: Proceedings AMIA Symposium, pp.276--280 (2005).
 
10
Grasso F. Rhetorical coding of health promotion dialogues. In M. Dojat, E. Keravnou and P. Barahona (eds). Proceedings of the 9th Conference on Artificial Intelligence in Medecine in Europe. Berlin: Springer, pp. 179--188 (2003).
11
 
12
Mann W C, Thompson S. A. Rhetorical Structure Theory: Toward a functional theory of text organisation. Text 8, no. 3, pp. 243--281 (1988).
13
 
14
Patel V, Arocha J, Diermeier M, How J, Mottur-Pilson C. Cognitive psychological studies of representation and use of clinical practice guidelines. International Journal of Medical Informatics 63(3):147--167 (2001).
 
15
 
16
Reitteer D, Stede M. Step by Step: Underspecified markup in incremental rhetorical analysis. In: Proceedings of the 4th International Workshop on Linguistically Interpreted Corpora (LINC-03) at the EACL, Budapest (2003).
 
17
Shiffman R, Karras B, Agrawal A, Chen R, Marenco L, Nath S. GEM: A proposal for a more comprehensive guideline document model using XML. J Am Med Informatics Assoc 7, pp.488--498 (2000).
 
18
 
19
Taboada M, Mann W C. Applications of Rhetorical Structure Theory. Discourse Studies 8 (4), pp. 567--588 (2006).
20

Collaborative Colleagues:
Gersende Georg: colleagues
Hugo Hernault: colleagues
Marc Cavazza: colleagues
Helmut Prendinger: colleagues
Mitsuru Ishizuka: colleagues