|
ABSTRACT
Many artificial intelligence tasks, such as automated question answering, reasoning or heterogeneous database integration, involve verification of a semantic category (e.g. "coffee" is a drink, "red" is a color, while "steak" is not a drink and "big" is not a color). We present a novel algorithm to automatically validate a semantic category. Contrary to the methods suggested earlier, our approach does not rely on any manually codified knowledge but instead capitalizes on the diversity of topics and word usage on the World Wide Web. We have tested our approach within our online fact-seeking (question answering) environment. When tested on the TREC questions that expect the answer to belong to a specific semantic category, our approach has improved the accuracy by up to 14% depending on the model and metrics used.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
Boris Katz, Jimmy Lin, Daniel Loreto, Wesley Hildebrandt, Matthew Bilotti, Sue Felshin, Aaron Fernandes, Gregory Marton, and Federico Mora. Integrating Web-based and Corpus-based Techniques for Question Answering. (2003). Proceedings of the Twelfth Text REtrieval Conference (TREC 2003), November 2003, Gaithersburg, Maryland.
|
| |
3
|
|
| |
4
|
Brill, E., Dumais, S. and Banko, M., An Analysis of the AskMSR Question-Answering System, Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing, Philadelphia, PA, USA, July 6-7.
|
| |
5
|
Downey, Oren Etzioni, and Stephen Soderland (2005). A Probabilistic Model of Redundancy in Information Extraction. IJCAI-05.
|
 |
6
|
Susan Dumais , Michele Banko , Eric Brill , Jimmy Lin , Andrew Ng, Web question answering: is more always better?, Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval, August 11-15, 2002, Tampere, Finland
[doi> 10.1145/564376.564428]
|
| |
7
|
E. Agirre, O. Ansa, E. Hovy, and D. Martinez. Enriching very large ontologies using the WWW. In Proceedings of the ECAI Ontology Learning Workshop, 2000.
|
| |
8
|
|
| |
9
|
Oren Etzioni , Michael Cafarella , Doug Downey , Ana-Maria Popescu , Tal Shaked , Stephen Soderland , Daniel S. Weld , Alexander Yates, Unsupervised named-entity extraction from the web: an experimental study, Artificial Intelligence, v.165 n.1, p.91-134, June 2005
[doi> 10.1016/j.artint.2005.03.001]
|
| |
10
|
Harabagiu, S., Moldovan, D., Pasca, M., Mihalcea, R., Surdeanu, M., Bunescu, R., Girju, R., Rus, V., and Morarescu, P. (2000). Falcon: Boosting knowledge for answer engines. In NIST Special Publication 500-249: The Ninth Text REtrieval Conference (TREC 9), pages 479--488, Gaithersburg, Maryland, November 13-16.
|
| |
11
|
K. Ahmad, M. Tariq, B. Vrusias, and C. Handy. Corpus-based thesaurus construction for image retrieval in specialist domains. In Proceedings of the 25th European Conference on Advances in Information Retrieval (ECIR), pages 502--510, 2003.
|
| |
12
|
K. Markert, N. Modjeska, and M. Nissim. Using the web for nominal anaphora resolution. In EACL Workshop on the Computational Treatment of Anaphora, 2003.
|
| |
13
|
Katz, B. (1997). From Sentence Processing to Information Access on the World Wide Web. In Natural Language Processing for the World Wide Web: Papers from the 1997 AAAI Spring Symposium, pages 77--94, 1997.
|
 |
14
|
|
 |
15
|
|
| |
16
|
M. Poesio, T. Ishikawa, S. Schulte im Walde, and R. Viera. Acquiring lexical knowledge for anaphora resolution. In Proceedings of the 3rd Conference on Language Resources and Evaluation (LREC), 2002.
|
| |
17
|
|
 |
18
|
|
 |
19
|
|
| |
20
|
|
 |
21
|
Dragomir R. Radev , Hong Qi , Zhiping Zheng , Sasha Blair-Goldensohn , Zhu Zhang , Weiguo Fan , John Prager, Mining the web for answers to natural language questions, Proceedings of the tenth international conference on Information and knowledge management, October 05-10, 2001, Atlanta, Georgia, USA
[doi> 10.1145/502585.502610]
|
| |
22
|
|
| |
23
|
Ravichandran, D., and Hovy, E. (2002). Learning surface text patterns for a question answering system. In Proceedings of ACL, 2002.
|
| |
24
|
Roussinov, D., Chau., M., Filatova, E., Robles, J., Building on Redundancy: Factoid Question Answering, Robust Retrieval and the "Other". In proceedings of TREC 2005, Nov. 15-18, 2005.
|
| |
25
|
Schlobach, S., Olsthoorn, M., and de Rijke, M. (2004). Type Checking in Open-Domain Question Answering (Extended Abstract), In: R. Verbrugge, N. Taatgen, and L. Schomaker, editors, Proceedings BNAIC 2004, pages 367--368, 2004.
|
| |
26
|
Soubbotin, M. and Soubbotin, S. (2002). Use of patterns for detection of likely answer strings: A systematic approach. Proceedings of the Eleventh Text Retrieval Conference TREC 2002. Gaithersburg, Maryland, November 19-22.
|
| |
27
|
Voorhees, E. and Buckland, L. P., Eds. (2004). Proceedings of the Eleventh Text Retrieval Conference TREC 2004. Gaithersburg, Maryland, November 16-19.
|
|