|
ABSTRACT
The TREC-8 Question Answering (QA) Track was the first large-scale evaluation of domain-independent question answering systems. In addition to fostering research on the QA task, the track was used to investigate whether the evaluation methodology used for document retrieval is appropriate for a different natural language processing task. As with document relevance judging, assessors had legitimate differences of opinions as to whether a response actually answers a question, but comparative evaluation of QA systems was stable despite these differences. Creating a reusable QA test collection is fundamentally more difficult than creating a document retrieval test collection since the QA task has no equivalent to document identifiers.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Douglas E. Appelt , Jerry R. Hobbs , John Bear , David Israel , Megumi Kameyama , David Martin , Karen Myers , Mabry Tyson, SRI International FASTUS system: MUC-6 test results and analysis, Proceedings of the 6th conference on Message understanding, November 06-08, 1995, Columbia, Maryland
[doi> 10.3115/1072399.1072420]
|
| |
2
|
|
| |
3
|
Eric Breck, John Burger, Lisa Ferro, David House, Marc Light, and Indeueet Mani. A sys called Qanda. In Proceedings of the Eighth Text REtrieval Conference (TREC- 8), pages 4.43-451, November 1999. Notebook draft.
|
| |
4
|
Robin D. Burke , Kristian J. Hammond , Vladimir A. Kulyukin , Steven L. Lytinen , N. Tomuro , S. Schoenberg, Question Answering from Frequently Asked Question Files: Experiences with the FAQ Finder System, University of Chicago, Chicago, IL, 1997
|
| |
5
|
Paul Cohen, Robert Schrag, Eric Jones, Adam Pease, Albert Lin, Barbara Starr, David Gunning, and Murray Burke. The DARPA high-performance knowledge bases project. AI Magazine, pages 25-49, Winter 1998.
|
| |
6
|
|
| |
7
|
Boris Katz. From sentence processing to information access on the world wide web. Paper presented at the AAAI Spnng Symposium on Natural Language Processing for the World Wide Web, 1997. Electronic version at http ://www.ai.ait.edu/ people/boris/webaccess.
|
 |
8
|
|
| |
9
|
M.E. Lesk and G. Salton. Relevance assessments and retrieval system evaluation. Information Storage and Retrieval. 4:343-359, 1969.
|
| |
10
|
Joel Martin and Chris Lankester. Ask Me Tomorrow: The University of Ottawa question answering system. In Proceedings of the Eighth Text REtrieval Conference (TREC- 8), pages 575-583, November 1999. Notebook draft.
|
| |
11
|
John O'Connor. Answer-passage retrieval by text searching. Journal of the American Society for Information Science, pages 227-239, July 1980.
|
| |
12
|
Linda Schamber. Relevance and information behavior. Annual Review of in formation Science and Technology, 29:3- 48, 1994.
|
| |
13
|
Alan Stuart. Kendalr s tau. In Samuel Kotz and Norman L. Johnson, editors, Encyclopedia of Statistical Sciences, volume 4, pages 367-369. John Wiley & Sons, 1983.
|
 |
14
|
|
| |
15
|
|
| |
16
|
Ellen M. Voorhees and Dawn M. Tice. The TREC-8 question answering track evaluation. In E.M. Voorhees and D.K. Harman, editors, Proceedings of the Eighth Text RE- trieval Conference (TREC-8 ). Electronic version available at http://trec.nist.gov/pubs.htrul, 2000.
|
| |
17
|
B. Webber. Question answering. In Stuart C. Shapiro, editor, Encyclopedia of Articficial Intelligence, volume 2, pages 814.-822. Wiley, 1987.
|
| |
18
|
Terry Winograd. Five lectures on artificial intelligence. In A. Zampolli, editor, Lingusitic Structures Processing, volume 5 of Fundamental Studies in Computer Science, pages 399-520. North Holland, 1977.
|
| |
19
|
W. A. Woods. Lunar rocks in natural english: Explorations in natural language question answering. In A. Zampolli, editor, Lingusitic Structures Processing, volume 5 of Fundamental Studies in Computer Science, pages 521-569. North Holland, 1977.
|
CITED BY 42
|
|
|
|
|
C. L. A. Clarke , G. V. Cormack , M. Laszlo , T. R. Lynam , E. L. Terra, The impact of corpus size on question answering performance, Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval, August 11-15, 2002, Tampere, Finland
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Satoshi Morinaga , Kenji Yamanishi , Kenji Tateishi , Toshikazu Fukushima, Mining product reputations on the Web, Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, July 23-26, 2002, Edmonton, Alberta, Canada
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Sanda Harabagiu , Dan Moldovan , Marius Paşca , Rada Mihalcea , Mihai Surdeanu , Rǎzvan Bunescu , Roxana Gîrju , Vasile Rus , Paul Morǎrescu, The role of lexico-semantic feedback in open-domain textual question-answering, Proceedings of the 39th Annual Meeting on Association for Computational Linguistics, p.282-289, July 06-11, 2001, Toulouse, France
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Marius Paşca , Dekang Lin , Jeffrey Bigham , Andrei Lifchits , Alpa Jain, Names and similarities on the web: fact extraction in the fast lane, Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the ACL, p.809-816, July 17-18, 2006, Sydney, Australia
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Nina Wacholder , Diane Kelly , Paul Kantor , Robert Rittman , Ying Sun , Bing Bai , Sharon Small , Boris Yamrom , Tomek Strzalkowski, A model for quantitative evaluation of an end-to-end question-answering system, Journal of the American Society for Information Science and Technology, v.58 n.8, p.1082-1099, June 2007
|
|
|
|
|
|
|
|
|
|
|
|
Ian Ruthven , Leif Azzopardi Glasgow , Mark Baillie , Ralf Bierig , Emma Nicol , Simon Sweeney , Murat Yakici, Intra-assessor consistency in question answering, Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, July 23-27, 2007, Amsterdam, The Netherlands
|
|
|
|
|
|
Marius Pasca , Dekang Lin , Jeffrey Bigham , Andrei Lifchits , Alpa Jain, Organizing and searching the world wide web of facts - step one: the one-million fact extraction challenge, proceedings of the 21st national conference on Artificial intelligence, p.1400-1405, July 16-20, 2006, Boston, Massachusetts
|
|
|
Pieter Bellekens , Geert-Jan Houben , Lora Aroyo , Krijn Schaap , Annelies Kaptein, User model elicitation and enrichment for context-sensitive personalization in a multiplatform tv environment, Proceedings of the seventh european conference on European interactive television conference, June 03-05, 2009, Leuven, Belgium
|
|
|
|
|
|
|
|
|
|
|