|
ABSTRACT
Previous work on Natural Language Processing for Information Retrieval has shown the inadequateness of semantic and syntactic structures for both document retrieval and categorization. The main reason is the high reliability and effectiveness of language models, which are sufficient to accurately solve such retrieval tasks. However, when the latter involve the computation of relational semantics between text fragments simple statistical models may result ineffective. In this paper, we show that syntactic and semantic structures can be used to greatly improve complex categorization tasks such as determining if an answer correctly responds to a question. Given the high complexity of representing semantic/syntactic structures in learning algorithms, we applied kernel methods along with Support Vector Machines to better exploit the needed relational information. Our experiments on answer classification on Web and TREC data show that our models greatly improve on bag-of-words.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
J. Allan. Natural language processing for information retrieval. In NAACL/ANLP (tutorial notes), 2000.
|
 |
2
|
|
| |
3
|
S. Blair-Goldensohn, K. R. McKeown, and A. H. Schlaikjer. Answering definitional questions: A hybrid approach. In M. Maybury, editor, New Directions In Question Answering. AAAI Press, 2004.
|
| |
4
|
|
| |
5
|
X. Carreras and L. Màrquez. Introduction to the CoNLL-2005 shared task: SRL. In CoNLL, 2005.
|
| |
6
|
|
| |
7
|
|
 |
8
|
|
 |
9
|
|
| |
10
|
|
| |
11
|
C. Cumby and D. Roth. Kernel Methods for Relational Learning. In Proceedings of ICML 2003, pages 107--114, Washington, DC, USA, 2003.
|
| |
12
|
J. Furnkranz, T. Mitchell, and E. Rilof. A case study in using linguistic phrases for text categorization on the www. In Working Notes of the AAAI/ICML, Workshop on Learning for Text Categorization, 1998.
|
| |
13
|
A. Hickl, J. Williams, J. Bensley, K. Roberts, Y. Shi, and B. Rink. Question answering with LCC's CHAUCER at TREC 2006. In Proceedings of TREC'06, 2006.
|
| |
14
|
R. Jackendoff. Semantic Structures. MIT Press, 1990.
|
| |
15
|
|
| |
16
|
|
| |
17
|
|
| |
18
|
H. Kazawa, H. Isozaki, and E. Maeda. NTT Question Answering system in TREC 2001. In TREC'01, 2001.
|
| |
19
|
P. Kingsbury and M. Palmer. From Treebank to PropBank. In LREC'02, 2002.
|
| |
20
|
|
| |
21
|
|
 |
22
|
|
| |
23
|
|
| |
24
|
M. Surdeanu, M. Ciaramita and H. Zaragoza. Learning to rank answers on large online QA collections. In ACL, 2008.
|
| |
25
|
A. Moschitti. Efficient convolution kernels for dependency and constituent syntactic trees. In ECML'06, 2006.
|
| |
26
|
A. Moschitti and R. Basili. Complex linguistic features for text classification: A comprehensive study. In S. McDonald and J. Tait, editors, ECIR, Sunderland, UK, 2004.
|
| |
27
|
A. Moschitti, B. Coppola, A. Giuglea, and R. Basili. Hierarchical semantic role labeling. In CoNLL 2005 shared task, 2005.
|
| |
28
|
A. Moschitti, S. Quarteroni, R. Basili, and S. Manandhar. Exploiting syntactic and shallow semantic kernels for question/answer classification. In ACL'07, Prague, Czech Republic, 2007.
|
| |
29
|
|
| |
30
|
|
| |
31
|
D. Shen and M. Lapata. Using semantic roles to improve Question Answering. In Proceedings of EMNLP-CoNLL, 2007.
|
| |
32
|
|
| |
33
|
Sharon Small , Tomek Strzalkowski , Ting Liu , Sean Ryan , Robert Salkin , Nobuyuki Shimizu , Paul Kantor , Diane Kelly , Robert Rittman , Nina Wacholder, HITIQA: towards analytical question answering, Proceedings of the 20th international conference on Computational Linguistics, p.1291-es, August 23-27, 2004, Geneva, Switzerland
[doi> 10.3115/1220355.1220544]
|
| |
34
|
A. F. Smeaton. Using NLP or NLP resources for information retrieval tasks. In T. Strzalkowski, editor, Natural language information retrieval, pages 99--111. Kluwer Academic Publishers, Dordrecht, NL, 1999.
|
| |
35
|
|
| |
36
|
T. Strzalkowski, G. C. Stein, G. B. Wise, J. P. Carballo, P. Tapanainen, T. Jarvinen, A. Voutilainen, and J. Karlgren. Natural language information retrieval: TREC-7 report. In Text REtrieval Conference, pages 164--173, 1998.
|
| |
37
|
K. Toutanova, P. Markova, and C. Manning. The Leaf Path Projection View of Parse Trees: Exploring String Kernels for HPSG Parse Selection. In D. Lin and D. Wu, editors, Proceedings of EMNLP 2004, pages 166--173, Barcelona, Spain, July 2004.
|
| |
38
|
|
 |
39
|
|
| |
40
|
|
| |
41
|
E. M. Voorhees. Overview of the TREC 2001 Question Answering track. In Proceedings of the Thirteenth Text REtreival Conference (TREC 2004), 2004.
|
| |
42
|
Y. Wu, R. Zhang, X. Hu, and H. Kashioka. Learning unsupervised SVM classifier for answer selection in Web Question Answering . In Proceedings of EMNLP-CoNLL, 2007.
|
 |
43
|
|
| |
44
|
|
CITED BY 3
|
|
Cristina Giannone , Roberto Basili , Chiara Del Vescovo , Paolo Naggar , Alessandro Moschitti, Kernel-based relation extraction from investigative data, Proceedings of The Third Workshop on Analytics for Noisy Unstructured Text Data, July 23-24, 2009, Barcelona, Spain
|
|
|
|
|
|
|
|