ACM Home Page
Please provide us with feedback. Feedback
Bridging the lexical chasm: statistical approaches to answer-finding
Full text PdfPdf (849 KB)
Source Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval table of contents
Athens, Greece
Pages: 192 - 199  
Year of Publication: 2000
ISBN:1-58113-226-3
Authors
Adam Berger  Just Research, 4616 Henry Street, Pittsburgh, PA and School of Computer Science, Carnegie Mellon University, Pittsburgh, PA
Rich Caruana  Just Research, 4616 Henry Street, Pittsburgh, PA and School of Computer Science, Carnegie Mellon University, Pittsburgh, PA
David Cohn  Just Research, 4616 Henry Street, Pittsburgh, PA and School of Computer Science, Carnegie Mellon University, Pittsburgh, PA
Dayne Freitag  Just Research, 4616 Henry Street, Pittsburgh, PA and School of Computer Science, Carnegie Mellon University, Pittsburgh, PA
Vibhu Mittal  Just Research, 4616 Henry Street, Pittsburgh, PA and School of Computer Science, Carnegie Mellon University, Pittsburgh, PA
Sponsors
Athens U of Econ & Business : Athens University of Economics and Business
Greek Com Soc : Greek Computer Society
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 7,   Downloads (12 Months): 75,   Citation Count: 17
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/345508.345576
What is a DOI?

ABSTRACT

This paper investigates whether a machine can automatically learn the task of finding, within a large collection of candidate responses, the answers to questions. The learning process consists of inspecting a collection of answered questions and characterizing the relation between question and answer with a statistical model. For the purpose of learning this relation, we propose two sources of data: Usenet FAQ documents and customer service call-center dialogues from a large retail company. We will show that the task of “answer-finding” differs from both document retrieval and tradition question-answering, presenting challenges different from those found in these problems. The central aim of this work is to discover, through theoretical and empirical investigation, those statistical techniques best suited to the answer-finding problem.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
AAAI. Proceedings of the AAAl FSS on Question Answering Systems (Cape Cod, MA, November 1999).
2
 
3
 
4
 
5
Deerwester, S., Dumais, S. T., Furnas, G. W., Landauer, T., and Harshman, R. Indexing by latent semantic analysis. Journal of the American Society for Information Science 41, 6 (1990), 391-407.
 
6
Dempster, A., Laird, N., and Rubin, D. Maximum likefihood from incomplete data via the em algorithm. Journal of the Royal Statistical Society 39 (1977), 1-38.
 
7
Efthimiadis, E., and Biron, P. UCLA-Okapi at TREC- 2: Query expansion experiments. In Proceedings of the Second Text Retrieval Conference (1994).
 
8
GARTNER GROUP. Gartner group report, 1998.
9
 
10
Lehnert, W. The process of question answering: A computer simulation of cognition. Lawrence Erlbaum Associates, 1978.
 
11
 
12
Weaver, W. Translation (1949). In Machine Translation of Languages. MIT Press, 1955.
13

CITED BY  17

Collaborative Colleagues:
Adam Berger: colleagues
Rich Caruana: colleagues
David Cohn: colleagues
Dayne Freitag: colleagues
Vibhu Mittal: colleagues