ACM Home Page
Please provide us with feedback. Feedback
When printed hypertexts go digital: information extraction from the parsing of indices
Full text PdfPdf (330 KB)
Source
Conference on Hypertext and Hypermedia archive
Proceedings of the 20th ACM conference on Hypertext and hypermedia table of contents
Torino, Italy
POSTER SESSION: Posters table of contents
Pages 357-358  
Year of Publication: 2009
ISBN:978-1-60558-486-7
Authors
Matteo Romanello  The Perseus Project - Tufts University, Medford, MA, USA
Monica Berti  The Perseus Project - Tufts University, Medford, MA, USA
Alison Babeu  The Perseus Project - Tufts University, Medford, MA, USA
Gregory Crane  The Perseus Project - Tufts University, Medford, MA, USA
Sponsors
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 15,   Downloads (12 Months): 26,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1557914.1557987
What is a DOI?

ABSTRACT

Modern critical editions of ancient works generally include manually created indices of other sources quoted in the text. Since indices can be considered as a form of domain specific language, the paper presents a parsing-based approach to the problem of extracting information from them to support the creation of a collection of fragmentary texts. This paper first considers the characteristics and structure of quotation indices and their importance when dealing with fragmentary texts. It then presents the results of applying a fuzzy parser to the OCR transcription of an index of quotations to extract information from potentially noisy input.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
F. Boschetti. Methods to extend greek and latin corpora with variants and conjectures: Mapping critical apparatuses onto reference text. In Proceedings of the Corpus Linguistics Conference (CL2007), 2007.
3
4
 
5
 
6
7

Collaborative Colleagues:
Matteo Romanello: colleagues
Monica Berti: colleagues
Alison Babeu: colleagues
Gregory Crane: colleagues