ACM Home Page
Please provide us with feedback. Feedback
Digital Library logoTake a look at the new version of this page: [ beta version ]. Tell us what you think.
Kernel methods, syntax and semantics for relational text categorization
Full text PdfPdf (369 KB)
Source
Conference on Information and Knowledge Management archive
Proceeding of the 17th ACM conference on Information and knowledge management table of contents
Napa Valley, California, USA
SESSION: IR/KM: machine learning table of contents
Pages: 253-262  
Year of Publication: 2008
ISBN:978-1-59593-991-3
Author
Alessandro Moschitti  University of Trento, Povo di Trento, Italy
Sponsors
ACM: Association for Computing Machinery
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 135,   Citation Count: 3
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1458082.1458118
What is a DOI?

ABSTRACT

Previous work on Natural Language Processing for Information Retrieval has shown the inadequateness of semantic and syntactic structures for both document retrieval and categorization. The main reason is the high reliability and effectiveness of language models, which are sufficient to accurately solve such retrieval tasks. However, when the latter involve the computation of relational semantics between text fragments simple statistical models may result ineffective. In this paper, we show that syntactic and semantic structures can be used to greatly improve complex categorization tasks such as determining if an answer correctly responds to a question. Given the high complexity of representing semantic/syntactic structures in learning algorithms, we applied kernel methods along with Support Vector Machines to better exploit the needed relational information. Our experiments on answer classification on Web and TREC data show that our models greatly improve on bag-of-words.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
J. Allan. Natural language processing for information retrieval. In NAACL/ANLP (tutorial notes), 2000.
2
 
3
S. Blair-Goldensohn, K. R. McKeown, and A. H. Schlaikjer. Answering definitional questions: A hybrid approach. In M. Maybury, editor, New Directions In Question Answering. AAAI Press, 2004.
 
4
 
5
X. Carreras and L. Màrquez. Introduction to the CoNLL-2005 shared task: SRL. In CoNLL, 2005.
 
6
 
7
8
9
 
10
 
11
C. Cumby and D. Roth. Kernel Methods for Relational Learning. In Proceedings of ICML 2003, pages 107--114, Washington, DC, USA, 2003.
 
12
J. Furnkranz, T. Mitchell, and E. Rilof. A case study in using linguistic phrases for text categorization on the www. In Working Notes of the AAAI/ICML, Workshop on Learning for Text Categorization, 1998.
 
13
A. Hickl, J. Williams, J. Bensley, K. Roberts, Y. Shi, and B. Rink. Question answering with LCC's CHAUCER at TREC 2006. In Proceedings of TREC'06, 2006.
 
14
R. Jackendoff. Semantic Structures. MIT Press, 1990.
 
15
 
16
 
17
 
18
H. Kazawa, H. Isozaki, and E. Maeda. NTT Question Answering system in TREC 2001. In TREC'01, 2001.
 
19
P. Kingsbury and M. Palmer. From Treebank to PropBank. In LREC'02, 2002.
 
20
 
21
22
 
23
 
24
M. Surdeanu, M. Ciaramita and H. Zaragoza. Learning to rank answers on large online QA collections. In ACL, 2008.
 
25
A. Moschitti. Efficient convolution kernels for dependency and constituent syntactic trees. In ECML'06, 2006.
 
26
A. Moschitti and R. Basili. Complex linguistic features for text classification: A comprehensive study. In S. McDonald and J. Tait, editors, ECIR, Sunderland, UK, 2004.
 
27
A. Moschitti, B. Coppola, A. Giuglea, and R. Basili. Hierarchical semantic role labeling. In CoNLL 2005 shared task, 2005.
 
28
A. Moschitti, S. Quarteroni, R. Basili, and S. Manandhar. Exploiting syntactic and shallow semantic kernels for question/answer classification. In ACL'07, Prague, Czech Republic, 2007.
 
29
 
30
 
31
D. Shen and M. Lapata. Using semantic roles to improve Question Answering. In Proceedings of EMNLP-CoNLL, 2007.
 
32
 
33
 
34
A. F. Smeaton. Using NLP or NLP resources for information retrieval tasks. In T. Strzalkowski, editor, Natural language information retrieval, pages 99--111. Kluwer Academic Publishers, Dordrecht, NL, 1999.
 
35
 
36
T. Strzalkowski, G. C. Stein, G. B. Wise, J. P. Carballo, P. Tapanainen, T. Jarvinen, A. Voutilainen, and J. Karlgren. Natural language information retrieval: TREC-7 report. In Text REtrieval Conference, pages 164--173, 1998.
 
37
K. Toutanova, P. Markova, and C. Manning. The Leaf Path Projection View of Parse Trees: Exploring String Kernels for HPSG Parse Selection. In D. Lin and D. Wu, editors, Proceedings of EMNLP 2004, pages 166--173, Barcelona, Spain, July 2004.
 
38
39
 
40
 
41
E. M. Voorhees. Overview of the TREC 2001 Question Answering track. In Proceedings of the Thirteenth Text REtreival Conference (TREC 2004), 2004.
 
42
Y. Wu, R. Zhang, X. Hu, and H. Kashioka. Learning unsupervised SVM classifier for answer selection in Web Question Answering . In Proceedings of EMNLP-CoNLL, 2007.
43
 
44


Collaborative Colleagues:
Alessandro Moschitti: colleagues