| Support vector machine learning for interdependent and structured output spaces |
| Full text |
Pdf
(197 KB)
|
| Source
|
ACM International Conference Proceeding Series; Vol. 69
archive
Proceedings of the twenty-first international conference on Machine learning
table of contents
Banff, Alberta, Canada
Page: 104
Year of Publication: 2004
ISBN:1-58113-828-5
|
|
Authors
|
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 30, Downloads (12 Months): 278, Citation Count: 70
|
|
|
ABSTRACT
Learning general functional dependencies is one of the main goals in machine learning. Recent progress in kernel-based methods has focused on designing flexible and powerful input representations. This paper addresses the complementary issue of problems involving complex outputs such as multiple dependent output variables and structured output spaces. We propose to generalize multiclass Support Vector Machine learning in a formulation that involves features extracted jointly from inputs and outputs. The resulting optimization problem is solved efficiently by a cutting plane algorithm that exploits the sparseness and structural decomposition of the problem. We demonstrate the versatility and effectiveness of our method on problems ranging from supervised grammar learning and named-entity recognition, to taxonomic text classification and sequence alignment.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Altun, Y., Tsochantaridis, I., & Hofmann, T. (2003). Hidden markov support vector machines. ICML.
|
| |
2
|
|
| |
3
|
Collins, M. (2004). Parameter estimation for statistical parsing models: Theory and practice of distribution-free methods.
|
| |
4
|
|
| |
5
|
Hofmann, T., Tsochantaridis, I., & Altun, Y. (2002). Learning over structured output spaces via joint kernel functions. Sixth Kernel Workshop.
|
| |
6
|
Joachims, T. (2003). Learning to align sequences: A maximum-margin approach (Technical Report). Cornell University.
|
| |
7
|
|
| |
8
|
|
| |
9
|
|
| |
10
|
Taskar, B., Guestrin, C., & Koller, D. (2004). Maxmargin markov networks. NIPS 16.
|
| |
11
|
|
| |
12
|
Weston, J., Chapelle, O., Elisseeff, A., Schölkopf, B., & Vapnik, V. (2003). Kernel dependency estimation. NIPS 15.
|
| |
13
|
Weston, J., & Watkins, C. (1998). Multi-class support vector machines (Technical Report CSD-TR-98-04). Department of Computer Science, Royal Holloway, University of London.
|
CITED BY 70
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Anton Leuski , Jarrell Pair , David Traum , Peter J. McNerney , Panayiotis Georgiou , Ronakkumar Patel, How to talk to a hologram, Proceedings of the 11th international conference on Intelligent user interfaces, January 29-February 01, 2006, Sydney, Australia
|
|
|
|
|
|
|
|
|
Juho Rousu , Craig Saunders , Sandor Szedmak , John Shawe-Taylor, Learning hierarchical multi-category text classification models, Proceedings of the 22nd international conference on Machine learning, p.744-751, August 07-11, 2005, Bonn, Germany
|
|
|
Eugene Ie , Jason Weston , William Stafford Noble , Christina Leslie, Multi-class protein fold recognition using adaptive codes, Proceedings of the 22nd international conference on Machine learning, p.329-336, August 07-11, 2005, Bonn, Germany
|
|
|
Ben Taskar , Vassil Chatalbashev , Daphne Koller , Carlos Guestrin, Learning structured prediction models: a large margin approach, Proceedings of the 22nd international conference on Machine learning, p.896-903, August 07-11, 2005, Bonn, Germany
|
|
|
|
|
|
|
|
|
|
|
|
Linli Xu , Dana Wilkinson , Finnegan Southey , Dale Schuurmans, Discriminative unsupervised learning of structured predictors, Proceedings of the 23rd international conference on Machine learning, p.1057-1064, June 25-29, 2006, Pittsburgh, Pennsylvania
|
|
|
Michael Fink , Shai Shalev-Shwartz , Yoram Singer , Shimon Ullman, Online multiclass learning by interclass hypothesis sharing, Proceedings of the 23rd international conference on Machine learning, p.313-320, June 25-29, 2006, Pittsburgh, Pennsylvania
|
|
|
|
|
|
|
|
|
|
|
|
Jari Björne , Juho Heimonen , Filip Ginter , Antti Airola , Tapio Pahikkala , Tapio Salakoski, Extracting complex biological events with rich graph-based feature sets, Proceedings of the Workshop on BioNLP: Shared Task, June 05-05, 2009, Boulder, Colorado
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Guo-Jun Qi , Xian-Sheng Hua , Yong Rui , Jinhui Tang , Tao Mei , Hong-Jiang Zhang, Correlative multi-label video annotation, Proceedings of the 15th international conference on Multimedia, September 25-29, 2007, Augsburg, Germany
|
|
|
|
|
|
Zhen Guo , Zhongfei Zhang , Eric Xing , Christos Faloutsos, Enhanced max margin learning on multimodal data mining in a multimedia database, Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining, August 12-15, 2007, San Jose, California, USA
|
|
|
|
|
|
Guo-Jun Qi , Xian-Sheng Hua , Yong Rui , Jinhui Tang , Tao Mei , Meng Wang , Hong-Jiang Zhang, Correlative multilabel video annotation with temporal kernels, ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP), v.5 n.1, p.1-27, October 2008
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Qun Ni , Jorge Lobo , Seraphin Calo , Pankaj Rohatgi , Elisa Bertino, Automating role-based provisioning by learning from examples, Proceedings of the 14th ACM symposium on Access control models and technologies, June 03-05, 2009, Stresa, Italy
|
|
|
|
|
|
|
|
|
André F. T. Martins , Noah A. Smith , Eric P. Xing, Polyhedral outer approximations with application to natural language parsing, Proceedings of the 26th Annual International Conference on Machine Learning, p.713-720, June 14-18, 2009, Montreal, Quebec, Canada
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|