ACM Home Page
Please provide us with feedback. Feedback
Corpora and data preparation
Full text Publisher SitePublisher Site PdfPdf (409 KB)
Source Message Understanding Conference archive
Proceedings of the 5th conference on Message understanding table of contents
Baltimore, Maryland
SESSION: Information extraction task table of contents
Pages: 1 - 5  
Year of Publication: 1993
ISBN:1-55860-336-0
Authors
Lynn Carlson  Ft. Meade, MD
Boyan Onyshkevych  Ft. Meade, MD
Mary Ellen Okurowski  Ft. Meade, MD
Publisher
Association for Computational Linguistics  Morristown, NJ, USA
Bibliometrics
Downloads (6 Weeks): 1,   Downloads (12 Months): 11,   Citation Count: 1
Additional Information:

abstract   cited by   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: 10.3115/1072017.1072019

ABSTRACT

The data selection and data preparation efforts which led to the TIPSTER and Fifth Message Understanding Conference (MUC-5) evaluation corpora involved substantial effort, time and resources. The Government commitment to these selection and preparation efforts stems from four TIPSTER Program objectives: (1) to provide training data that would promote the development of information extraction technology, (2) to provide accurate test data to evaluate and baseline system performance in an objective manner, (3) to provide a baseline for human performance to understand and interpret machine performance, and (4) to support the larger Natural Language Processing community by making available a unique set of texts and templates in multiple domains and languages under ARPA support. This commitment was demonstrated through the managerial, technical, and administrative support to these efforts from various Government agencies, as well as through the contractual efforts with the Institute for Defense Analyses for data preparation and New Mexico State University for software tool development.


Collaborative Colleagues:
Lynn Carlson: colleagues
Boyan Onyshkevych: colleagues
Mary Ellen Okurowski: colleagues