|
ABSTRACT
Our goal is to automatically answer brief factual questions of the form ``When was the Battle of Hastings?'' or ``Who wrote The Wind in the Willows?''. Since the answer to nearly any such question can now be found somewhere on the Web, the problem reduces to finding potential answers in large volumes of data and validating their accuracy. We apply a method for arbitrary passage retrieval to the first half of the problem and demonstrate that answer redundancy can be used to address the second half. The success of our approach depends on the idea that the volume of available Web data is large enough to supply the answer to most factual questions multiple times and in multiple contexts. A query is generated from a question and this query is used to select short passages that may contain the answer from a large collection of Web data. These passages are analyzed to identify candidate answers. The frequency of these candidates within the passages is used to ``vote'' for the most likely answer. The approach is experimentally tested on questions taken from the TREC-9 question-answering test collection. As an additional demonstration, the approach is extended to answer multiple choice trivia questions of the form typically asked in trivia quizzes and television game shows.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Eric Breck, John Burger, David House, Marc Light, and Inderjeet Mani. Question answering from large document collections. In 1999 AAAI Fall Symposium on Question Answering Systems, North Falmouth, MA, 1999.
|
| |
2
|
|
| |
3
|
Claire Cardie. Empirical methods in information extraction. AI Magazine, 18(4):65-79, Winter 1997.
|
| |
4
|
Claire Cardie , Vincent Ng , David Pierce , Chris Buckley, Examining the role of statistical and linguistic knowledge sources in a general-knowledge question-answering system, Proceedings of the sixth conference on Applied natural language processing, p.180-187, April 29-May 04, 2000, Seattle, Washington
[doi> 10.3115/974147.974172]
|
| |
5
|
C. L. A. Clarke, G. V. Cormack, D. I. E. Kisman, and T. R. Lynam. Question answering by passage selection. In 9th Text REtrieval Conference, Gaithersburg, MD, 2000.
|
| |
6
|
|
| |
7
|
G. V. Cormack, C. L. A. Clarke, C. R. Palmer, and D. I. E. Kisman. Fast automatic passage ranking. In 8th Text REtrieval Conference, Gaithersburg, MD, November 1999.
|
| |
8
|
Anne Diekema, Xiaoyong Liu, Jiangping Chen, Hudong Wang, Nancy McCracken, Ozgur Yilmazel, and Elizabeth D. Liddy. Question answering: CNLP at the TREC-9 question answering track. In 9th Text REtrieval Conference, Gaithersburg, MD, 2000.
|
| |
9
|
|
| |
10
|
Sanda M. Harabagiu and Steven J. Maiorano. Finding answers in large collections of texts: Paragraph indexing + abductive inference. In 1999 AAAI Fall Symposium on Question Answering Systems, pages 63-71, North Falmouth, MA, 1999.
|
| |
11
|
Sanda M. Harabagiu, Dan Moldovan, Marius Pacsca, Rada Mihalcea, Mihai Surdeanu, Razvan Bunescu, Roxana Girju, Vasile Rus, and Paul Morarescu. FALCON: Boosting knowledge for answer engines. In 9th Text REtrieval Conference, Gaithersburg, MD, 2000.
|
| |
12
|
|
| |
13
|
Eduard Hovy, Ulf Hermjakob, Chin-Yew Lin, Mike Junk, and Laurie Gerber. The Webclopedia. In 9th Text REtrieval Conference, Gaithersburg, MD, 2000.
|
| |
14
|
Abraham Ittycheriah, Martin Franz, Wei-Jing Zhu, and Adwait Ratnaparkhi. IBM's statistical question answering system. In 9th Text REtrieval Conference, Gaithersburg, MD, 2000.
|
 |
15
|
|
 |
16
|
John Prager , Eric Brown , Anni Coden , Dragomir Radev, Question-answering by predictive annotation, Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval, p.184-191, July 24-28, 2000, Athens, Greece
[doi> 10.1145/345508.345574]
|
| |
17
|
John Prager, Dragomir R Radev, Eric Brown, Amni Coden, and Valerie Samn. The use of predictive annotation for question answering in TREC-8. In 8th Text REtrieval Conference, Gaithersburg, MD, 1999.
|
| |
18
|
|
| |
19
|
|
| |
20
|
Rohini Srihari and Wei Li. Information extraction supported question answering. In 8th Text REtrieval Conference, Gaithersburg, MD, 1999.
|
| |
21
|
Ellen M. Voorhees and Donna Harman, editors. Proceedings of the Ninth Text REtrieval Conference, Gaithersburg, MD, 2000. See trec.nist.goc.
|
 |
22
|
|
| |
23
|
W. A. Woods, Stephen Green, Paul Martin, and Ann Houston. Halfway to question answering. In 9th Text REtrieval Conference, Gaithersburg, MD, 2000.
|
 |
24
|
|
CITED BY 47
|
|
|
|
|
|
|
|
C. L. A. Clarke , G. V. Cormack , M. Laszlo , T. R. Lynam , E. L. Terra, The impact of corpus size on question answering performance, Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval, August 11-15, 2002, Tampere, Finland
|
|
|
Susan Dumais , Michele Banko , Eric Brill , Jimmy Lin , Andrew Ng, Web question answering: is more always better?, Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval, August 11-15, 2002, Tampere, Finland
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Thomas R. Lynam , Chris Buckley , Charles L. A. Clarke , Gordon V. Cormack, A multi-system analysis of document and term selection for blind feedback, Proceedings of the thirteenth ACM international conference on Information and knowledge management, November 08-13, 2004, Washington, D.C., USA
|
|
|
|
|
|
Ganesh Ramakrishnan , Soumen Chakrabarti , Deepa Paranjpe , Pushpak Bhattacharya, Is question answering an acquired skill?, Proceedings of the 13th international conference on World Wide Web, May 17-20, 2004, New York, NY, USA
|
|
|
David Azari , Eric Horvitz , Susan Dumais , Eric Brill, Actions, answers, and uncertainty: a decision-making perspective on Web-based question answering, Information Processing and Management: an International Journal, v.40 n.5, p.849-868, September 2004
|
|
|
|
|
|
|
|
|
Jennifer Chu-Carroll , Krzysztof Czuba , John Prager , Abraham Ittycheriah, In question answering, two heads are better than one, Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, p.24-31, May 27-June 01, 2003, Edmonton, Canada
|
|
|
|
|
|
Wei Li , Rohini K. Srihari , Xiaoge Li , M. Srikanth , Xiuhong Zhang , Cheng Niu, Extracting exact answers to questions based on structural links, proceeding of the 2002 conference on multilingual summarization and question answering, p.1-9, August 31, 2002
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Eduard Hovy , Ulf Hermjakob , Chin-Yew Lin , Deepak Ravichandran, Using knowledge to facilitate factoid answer pinpointing, Proceedings of the 19th international conference on Computational linguistics, p.1-7, August 24-September 01, 2002, Taipei, Taiwan
|
|
|
Yun Niu , Graeme Hirst , Gregory McArthur , Patricia Rodriguez-Gianolli, Answering clinical questions with role identification, Proceedings of the ACL 2003 workshop on Natural language processing in biomedicine, p.73-80, July 11-11, 2003, Sapporo, Japan
|
|
|
|
|
|
|
|
|
|
|
|
Yongping Du , Helen Meng , Xuanjing Huang , Lide Wu, The use of metadata, web-derived answer patterns and passage context to improve reading comprehension performance, Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing, p.604-611, October 06-08, 2005, Vancouver, British Columbia, Canada
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Cheng-Wei Lee , Min-Yuh Day , Cheng-Lung Sung , Yi-Hsun Lee , Tian-Jian Jiang , Chia-Wei Wu , Cheng-Wei Shih , Yu-Ren Chen , Wen-Lian Hsu, Boosting Chinese Question Answering with Two Lightweight Methods: ABSPs and SCO-QAT, ACM Transactions on Asian Language Information Processing (TALIP), v.7 n.4, p.1-29, November 2008
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|