| Mining redundancy in candidate-bearing snippets to improve web question answering |
| Full text |
Pdf
(127 KB)
|
Source
|
Conference on Information and Knowledge Management
archive
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
table of contents
Lisbon, Portugal
POSTER SESSION: Poster session
table of contents
Pages 999-1002
Year of Publication: 2007
ISBN:978-1-59593-803-9
|
|
Authors
|
|
Youzheng Wu
|
National Institute of Information and Communications Technology (NICT), Kyoto, Japan
|
|
Xinhui Hu
|
National Institute of Information and Communications Technology (NICT), Kyoto, Japan
|
|
Hideki Kashioka
|
National Institute of Information and Communications Technology (NICT), Kyoto, Japan
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 7, Downloads (12 Months): 57, Citation Count: 0
|
|
|
ABSTRACT
Conventional question answering (QA) techniques independently process candidate-bearing snippets to select an exact answer to a question from candidate answers. This paper presents two novel ways of utilizing redundancy in candidate-bearing snippets to help select an exact answer to a question in our Web QA system, i.e., cluster-based language model (CLM-M) and unsupervised SVM classifier (U-SVM) techniques. The comparative experiments demonstrate that the proposed methods significantly outperform the language model-based (LM-M) and supervised SVM-based (S-SVM) techniques that do not utilize this redundancy in the candidate-bearing snippets. Using the CLM-M, the top_1 score is increased from 36.03% (LM-M) to 46.96%; and the top_1 improvement in the U-SVM over the S-SVM is about 23%. Moreover, a cross-model comparison shows that the performance ranking of these models is: U-SVM > CLM-LM > LM-M > S-SVM > R-M (the retrieval-based model).
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
Abraham Ittycheriah, Salim Roukos. IBM's Statistical Question Answering System-TREC 11. In Proc. of TREC-11, Gaithersburg, Maryland, 2002.
|
| |
3
|
|
 |
4
|
|
| |
5
|
|
 |
6
|
|
 |
7
|
Susan Dumais , Michele Banko , Eric Brill , Jimmy Lin , Andrew Ng, Web question answering: is more always better?, Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval, August 11-15, 2002, Tampere, Finland
[doi> 10.1145/564376.564428]
|
| |
8
|
Youzheng Wu , Jun Zhao , Bo Xu , Hao Yu, Chinese named entity recognition based on multiple features, Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing, p.427-434, October 06-08, 2005, Vancouver, British Columbia, Canada
[doi> 10.3115/1220575.1220629]
|
| |
9
|
Youzheng Wu, Ruiqiang Zhang, Xinhui HU and Hideki Kashioka. Learning Unsupervised SVM Classifer for Answer Selection in Question Answering. In Proc. of EMNLP-2007, Prague, Czech, pp33--41, 2007.
|
|