ACM Home Page
Please provide us with feedback. Feedback
Unsupervised learning of soft patterns for generating definitions from online news
Full text PdfPdf (166 KB)
Source International World Wide Web Conference archive
Proceedings of the 13th international conference on World Wide Web table of contents
New York, NY, USA
SESSION: Information extraction table of contents
Pages: 90 - 99  
Year of Publication: 2004
ISBN:1-58113-844-X
Authors
Hang Cui  National University of Singapore, Singapore
Min-Yen Kan  National University of Singapore, Singapore
Tat-Seng Chua  National University of Singapore, Singapore
Sponsor
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 9,   Downloads (12 Months): 82,   Citation Count: 16
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/988672.988686
What is a DOI?

ABSTRACT

Breaking news often contains timely definitions and descriptions of current terms, organizations and personalities. We utilize such web sources to construct definitions for such terms. Previous work has identified definitions using hand-crafted rules or supervised learning that constructs rigid, hard text patterns. In contrast, we demonstrate a new approach that uses flexible, soft matching patterns to characterize definition sentences. Our soft patterns are able to effectively accommodate the diversity of definition sentence structure exhibited in news. We use pseudo-relevance feedback to automatically label sentences for use in soft pattern generation. The application of our unsupervised method significantly improves baseline systems on both the standardized TREC corpus as well as crawled online news articles by 27% and 30%, respectively, in terms of F measure. When applied to a state-of-art definition generation system recently fielded in the TREC 2003 definitional question answering task, it improves the performance by 14%.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
S. Blair-Goldensohn, K. R. McKeown and A. H. Schlaikjer. A Hybrid Approach for Answering Definitional Questions. Technical Report CUCS-006-03. Columbia University, 2003.
 
3
C. Buckley, G. Salton, J. Allan, and A. Singhal, Automatic query expansion using SMART, NIST Special Publication 500-225: The Third Text Retrieval conference (TREC 3), 1995, pp. 69--80.
4
 
5
S. Harabagiu, D. Moldovan, R. Mihalcea M. Pasca, R. Bunescu, M. Surdeanu, R. Girju, V. Rus, and P. Morarescu, Falcon: Boosting knowledge for answer engines, Proc. of Ninth Text Retrieval Conference (TREC 9), pp. 479--488, 2000.
 
6
J. L. Klavans, S. Popper and R. Passonneau, Tackling the Internet Glossary Glut: Automatic extraction and Evaluation of Genus Phrases, In Proceedings of Semantic Web Workshop, SIGIR 2003, July 28 - Aug. 1, 2003, Toronto, Canada.
 
7
J. Lannon, 1991, Technical Writing, Ch 5, HarperCollins Publishers Inc., 1991.
 
8
9
 
10
I. Muslea. Extraction patterns for information extraction tasks: A survey. In AAAI-99 Workshop on Machine Learning for Information Extraction, 1999, pp. 1--6.
 
11
U. Y. Nahm and R. J. Mooney, 2001, Mining softmatching rules from textual data. In Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence (IJCAI-2001), pp. 979--984.
 
12
 
13
 
14
 
15
E. Riloff. Automatically generating extraction patterns from untagged text. In Proceedings of the 13th National Conference on Artificial Intelligence (AAAI-96), AAAI Press, 1996, pp. 1044--1049.
 
16
 
17
 
18
 
19
 
20
 
21
 
22
H. Yang, H. Cui, M.-Y. Kan, M. Maslennikov, L. Qiu, T.-S. Chua, QUALIFIER in TREC-12 QA Main Task, In Proceedings of the Twelfth Annual Text Retrieval Conference (TREC- 12), NIST, November, 2003.
 
23

CITED BY  17

Collaborative Colleagues:
Hang Cui: colleagues
Min-Yen Kan: colleagues
Tat-Seng Chua: colleagues