| A syntactic tree matching approach to finding similar questions in community-based qa services |
| Full text |
Pdf
(662 KB)
|
Source
|
Annual ACM Conference on Research and Development in Information Retrieval
archive
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
table of contents
Boston, MA, USA
SESSION: Question answering
table of contents
Pages 187-194
Year of Publication: 2009
ISBN:978-1-60558-483-6
|
|
Authors
|
|
Kai Wang
|
National University of Singapore, Singapore, Singapore
|
|
Zhaoyan Ming
|
National University of Singapore, Singapore, Singapore
|
|
Tat-Seng Chua
|
National University of Singapore, Singapore, Singapore
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 56, Downloads (12 Months): 186, Citation Count: 0
|
|
|
ABSTRACT
While traditional question answering (QA) systems tailored to the TREC QA task work relatively well for simple questions, they do not suffice to answer real world questions. The community-based QA systems offer this service well, as they contain large archives of such questions where manually crafted answers are directly available. However, finding similar questions in the QA archive is not trivial. In this paper, we propose a new retrieval framework based on syntactic tree structure to tackle the similar question matching problem. We build a ground-truth set from Yahoo! Answers, and experimental results show that our method outperforms traditional bag-of-word or tree kernel based methods by 8.3% in mean average precision. It further achieves up to 50% improvement by incorporating semantic features as well as matching of potential answers. Our model does not rely on training, and it is demonstrated to be robust against grammatical errors as well.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Trec proceedings. http://trec.nist.gov/proceedings/proceedings.html.
|
 |
2
|
|
 |
3
|
|
| |
4
|
R. D. Burke, K. J. Hammond, V. A. Kulyukin, S. L. Lytinen, N. Tomuro, and S. Schoenberg. Question answering from frequently asked question files: Experiences with the faq finder system. AI Magazine, 18(2):57--66, 1997.
|
| |
5
|
M. Collins and N. Duffy. Convolution kernels for natural language. In Advances in Neural Information Processing Systems 14, pages 625--632. MIT Press, 2001.
|
 |
6
|
Hang Cui , Renxu Sun , Keya Li , Min-Yen Kan , Tat-Seng Chua, Question answering passage retrieval using dependency relations, Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, August 15-19, 2005, Salvador, Brazil
[doi> 10.1145/1076034.1076103]
|
| |
7
|
A. Diekema, X. Liu, J. Chen, H.Wang, N. Mccracken, O. Yilmazel, and E. D. Liddy. Question answering: Cnlp at the trec-9 question answering track. In Proceedings of the Ninth Text REtrieval Conference (TREC-9), pages 501--510. Department of Commerce, National Institute of Standards and Technology, 2000.
|
 |
8
|
|
 |
9
|
|
 |
10
|
|
| |
11
|
|
| |
12
|
C. Leacock and M. Chodrow. Combining local context and WordNet similarity for word sense identification. In WordNet: An Electronic Lexical Database. MIT Press, 1998.
|
| |
13
|
A. Moschitti. Efficient convolution kernels for dependency and constituent syntactic trees. In ECML, pages 318--329. Springer, 2006.
|
| |
14
|
A. Moschitti, S. Quarteroni, R. Basili, and S. Manandhar. Exploiting syntactic and shallow semantic kernels for question answer classification. In ACL. The Association for Computer Linguistics, 2007.
|
| |
15
|
S. Riezler, A. Vasserman, I. Tsochantaridis, V. O. Mittal, and Y. Liu. Statistical machine translation for query expansion in answer retrieval. In ACL. The Association for Computer Linguistics, 2007.
|
 |
16
|
|
| |
17
|
R. Soricut and E. Brill. Automatic question answering: Beyond the factoid. In HLT-NAACL, pages 57--64, 2004.
|
 |
18
|
|
| |
19
|
D. Zhang and W. S. Lee. Question classification using support vector machines. In SIGIR '03, pages 26--32. ACM, 2003.
|
|