ACM Home Page
Please provide us with feedback. Feedback
A syntactic tree matching approach to finding similar questions in community-based qa services
Full text PdfPdf (662 KB)
Source
Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval table of contents
Boston, MA, USA
SESSION: Question answering table of contents
Pages 187-194  
Year of Publication: 2009
ISBN:978-1-60558-483-6
Authors
Kai Wang  National University of Singapore, Singapore, Singapore
Zhaoyan Ming  National University of Singapore, Singapore, Singapore
Tat-Seng Chua  National University of Singapore, Singapore, Singapore
Sponsors
SIGIR: ACM Special Interest Group on Information Retrieval
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 56,   Downloads (12 Months): 186,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1571941.1571975
What is a DOI?

ABSTRACT

While traditional question answering (QA) systems tailored to the TREC QA task work relatively well for simple questions, they do not suffice to answer real world questions. The community-based QA systems offer this service well, as they contain large archives of such questions where manually crafted answers are directly available. However, finding similar questions in the QA archive is not trivial. In this paper, we propose a new retrieval framework based on syntactic tree structure to tackle the similar question matching problem. We build a ground-truth set from Yahoo! Answers, and experimental results show that our method outperforms traditional bag-of-word or tree kernel based methods by 8.3% in mean average precision. It further achieves up to 50% improvement by incorporating semantic features as well as matching of potential answers. Our model does not rely on training, and it is demonstrated to be robust against grammatical errors as well.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Trec proceedings. http://trec.nist.gov/proceedings/proceedings.html.
2
3
 
4
R. D. Burke, K. J. Hammond, V. A. Kulyukin, S. L. Lytinen, N. Tomuro, and S. Schoenberg. Question answering from frequently asked question files: Experiences with the faq finder system. AI Magazine, 18(2):57--66, 1997.
 
5
M. Collins and N. Duffy. Convolution kernels for natural language. In Advances in Neural Information Processing Systems 14, pages 625--632. MIT Press, 2001.
6
 
7
A. Diekema, X. Liu, J. Chen, H.Wang, N. Mccracken, O. Yilmazel, and E. D. Liddy. Question answering: Cnlp at the trec-9 question answering track. In Proceedings of the Ninth Text REtrieval Conference (TREC-9), pages 501--510. Department of Commerce, National Institute of Standards and Technology, 2000.
8
9
10
 
11
 
12
C. Leacock and M. Chodrow. Combining local context and WordNet similarity for word sense identification. In WordNet: An Electronic Lexical Database. MIT Press, 1998.
 
13
A. Moschitti. Efficient convolution kernels for dependency and constituent syntactic trees. In ECML, pages 318--329. Springer, 2006.
 
14
A. Moschitti, S. Quarteroni, R. Basili, and S. Manandhar. Exploiting syntactic and shallow semantic kernels for question answer classification. In ACL. The Association for Computer Linguistics, 2007.
 
15
S. Riezler, A. Vasserman, I. Tsochantaridis, V. O. Mittal, and Y. Liu. Statistical machine translation for query expansion in answer retrieval. In ACL. The Association for Computer Linguistics, 2007.
16
 
17
R. Soricut and E. Brill. Automatic question answering: Beyond the factoid. In HLT-NAACL, pages 57--64, 2004.
18
 
19
D. Zhang and W. S. Lee. Question classification using support vector machines. In SIGIR '03, pages 26--32. ACM, 2003.

Collaborative Colleagues:
Kai Wang: colleagues
Zhaoyan Ming: colleagues
Tat-Seng Chua: colleagues