ACM Home Page
Please provide us with feedback. Feedback
Mining redundancy in candidate-bearing snippets to improve web question answering
Full text PdfPdf (127 KB)
Source
Conference on Information and Knowledge Management archive
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management table of contents
Lisbon, Portugal
POSTER SESSION: Poster session table of contents
Pages 999-1002  
Year of Publication: 2007
ISBN:978-1-59593-803-9
Authors
Youzheng Wu  National Institute of Information and Communications Technology (NICT), Kyoto, Japan
Xinhui Hu  National Institute of Information and Communications Technology (NICT), Kyoto, Japan
Hideki Kashioka  National Institute of Information and Communications Technology (NICT), Kyoto, Japan
Sponsors
SIGIR: ACM Special Interest Group on Information Retrieval
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 7,   Downloads (12 Months): 57,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1321440.1321595
What is a DOI?

ABSTRACT

Conventional question answering (QA) techniques independently process candidate-bearing snippets to select an exact answer to a question from candidate answers. This paper presents two novel ways of utilizing redundancy in candidate-bearing snippets to help select an exact answer to a question in our Web QA system, i.e., cluster-based language model (CLM-M) and unsupervised SVM classifier (U-SVM) techniques. The comparative experiments demonstrate that the proposed methods significantly outperform the language model-based (LM-M) and supervised SVM-based (S-SVM) techniques that do not utilize this redundancy in the candidate-bearing snippets. Using the CLM-M, the top_1 score is increased from 36.03% (LM-M) to 46.96%; and the top_1 improvement in the U-SVM over the S-SVM is about 23%. Moreover, a cross-model comparison shows that the performance ranking of these models is: U-SVM > CLM-LM > LM-M > S-SVM > R-M (the retrieval-based model).


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
Abraham Ittycheriah, Salim Roukos. IBM's Statistical Question Answering System-TREC 11. In Proc. of TREC-11, Gaithersburg, Maryland, 2002.
 
3
4
 
5
6
7
 
8
 
9
Youzheng Wu, Ruiqiang Zhang, Xinhui HU and Hideki Kashioka. Learning Unsupervised SVM Classifer for Answer Selection in Question Answering. In Proc. of EMNLP-2007, Prague, Czech, pp33--41, 2007.

Collaborative Colleagues:
Youzheng Wu: colleagues
Xinhui Hu: colleagues
Hideki Kashioka: colleagues