| Ranking community answers by modeling question-answer relationships via analogical reasoning |
| Full text |
Pdf
(964 KB)
|
Source
|
Annual ACM Conference on Research and Development in Information Retrieval
archive
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
table of contents
Boston, MA, USA
SESSION: Question answering
table of contents
Pages 179-186
Year of Publication: 2009
ISBN:978-1-60558-483-6
|
|
Authors
|
|
Xin-Jing Wang
|
Microsoft Research Asia, Beijing, China
|
|
Xudong Tu
|
Huazhong Science and Technology University, Wuhan, China
|
|
Dan Feng
|
Huazhong Science and Technology University, Wuhan, China
|
|
Lei Zhang
|
Microsoft Research Asia, Beijing, China
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 58, Downloads (12 Months): 180, Citation Count: 0
|
|
|
ABSTRACT
The method of finding high-quality answers has significant impact on user satisfaction in community question answering systems. However, due to the lexical gap between questions and answers as well as spam typically existing in user-generated content, filtering and ranking answers is very challenging. Previous solutions mainly focus on generating redundant features, or finding textual clues using machine learning techniques; none of them ever consider questions and their answers as relational data but instead model them as independent information. Moreover, they only consider the answers of the current question, and ignore any previous knowledge that would be helpful to bridge the lexical and semantic gap. We assume that answers are connected to their questions with various types of latent links, i.e. positive indicating high-quality answers, negative links indicating incorrect answers or user-generated spam, and propose an analogical reasoning-based approach which measures the analogy between the new question-answer linkages and those of relevant knowledge which contains only positive links; the candidate answer which has the most analogous link is assumed to be the best answer. We conducted experiments based on 29.8 million Yahoo!Answer question-answer threads and showed the effectiveness of our approach.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
Eugene Agichtein , Carlos Castillo , Debora Donato , Aristides Gionis , Gilad Mishne, Finding high-quality content in social media, Proceedings of the international conference on Web search and web data mining, February 11-12, 2008, Palo Alto, California, USA
[doi> 10.1145/1341531.1341557]
|
 |
2
|
|
 |
3
|
|
 |
4
|
|
 |
5
|
|
| |
6
|
E. Brill, J. Lin, M. Banko, and etc. Data-intensive question answeringa. In TREC, 2001.
|
| |
7
|
Jennifer Chu-Carroll , Krzysztof Czuba , John Prager , Abraham Ittycheriah, In question answering, two heads are better than one, Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, p.24-31, May 27-June 01, 2003, Edmonton, Canada
[doi> 10.3115/1073445.1073449]
|
 |
8
|
Susan Dumais , Michele Banko , Eric Brill , Jimmy Lin , Andrew Ng, Web question answering: is more always better?, Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval, August 11-15, 2002, Tampere, Finland
[doi> 10.1145/564376.564428]
|
| |
9
|
R. French. The computational modeling of analogy-marking. Trends in cognitive Sciences, 6, 2002.
|
| |
10
|
|
| |
11
|
L. Getoor, N. Friedman, and etc. Probabilistic relational models. Introduction to Statistical Relational Learning, 2007.
|
| |
12
|
Z. Ghahramani and K. Heller. Bayesian sets. In Proc. of NIPS, 2005.
|
| |
13
|
|
 |
14
|
Jiwoon Jeon , W. Bruce Croft , Joon Ho Lee , Soyeon Park, A framework to predict the quality of answers with non-textual features, Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, August 06-11, 2006, Seattle, Washington, USA
[doi> 10.1145/1148170.1148212]
|
 |
15
|
|
 |
16
|
|
 |
17
|
|
 |
18
|
|
| |
19
|
J. Ko, L. Si, and E. Nyberg. A probabilistic framework for answer selection in question answering. In Proc. of NAACL/HLT, 2007.
|
 |
20
|
|
| |
21
|
J. Leibenluft. Librarianąŕs worst nightmare: Yahoo!answers, where 120 million users can be wrong. Slate Magazine, 2007.
|
 |
22
|
|
 |
23
|
|
| |
24
|
|
| |
25
|
D. Molla and J. Vicedo. Question answering in restricted domains: An overview. In Proc. of ACL, 2007.
|
| |
26
|
R. Silva, E. Airoldi, and K. Heller. Small sets of interacting proteins suggest latent linkage mechanisms through analogical reasoning. In Gatsby Technical Report, GCNU TR 2007-001, 2007.
|
| |
27
|
R. Silva, K. Heller, and Z. Ghahramani. Analogical reasoning with relational bayesian sets. In Proc. of AISTATS, 2007.
|
 |
28
|
Qi Su , Dmitry Pavlov , Jyh-Herng Chow , Wendell C. Baker, Internet-scale collection of human-reviewed data, Proceedings of the 16th international conference on World Wide Web, May 08-12, 2007, Banff, Alberta, Canada
[doi> 10.1145/1242572.1242604]
|
 |
29
|
|
 |
30
|
|
|