ACM Home Page
Please provide us with feedback. Feedback
Laplace maximum margin Markov networks
Full text PdfPdf (752 KB)
Source ICML; Vol. 307 archive
Proceedings of the 25th international conference on Machine learning table of contents
Helsinki, Finland
Pages 1256-1263  
Year of Publication: 2008
ISBN:978-1-60558-205-4
Authors
Jun Zhu  Carnegie Mellon University, Pittsburgh, PA and Tsinghua University, Beijing, China
Eric P. Xing  Carnegie Mellon University, Pittsburgh, PA
Bo Zhang  Tsinghua University, Beijing, China
Sponsors
: Yahoo!
: Xerox
IBM : IBM
: NSF
Microsoft Research : Microsoft Research
: Machine Learning Journal/Springer
: Pascal
: University of Helsinki
: Federation of Finnish Learned Societies
: Intel Corporation
: Google
: Helsinki Institute for Information Technology
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 64,   Citation Count: 3
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1390156.1390314
What is a DOI?

ABSTRACT

We propose Laplace max-margin Markov networks (LapM3N), and a general class of Bayesian M3N (BM3N) of which the LapM3N is a special case with sparse structural bias, for robust structured prediction. BM3N generalizes extant structured prediction rules based on point estimator to a Bayes-predictor using a learnt distribution of rules. We present a novel Structured Maximum Entropy Discrimination (SMED) formalism for combining Bayesian and max-margin learning of Markov networks for structured prediction, and our approach subsumes the conventional M3N as a special case. An efficient learning algorithm based on variational inference and standard convex-optimization solvers for M3N, and a generalization bound are offered. Our method outperforms competing ones on both synthetic and real OCR data.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Altun, Y., Tsochantaridis, I., & Hofmann, T. (2003). Hidden markov support vector machines. ICML.
2
 
3
Bartlett, P., Collins, M., Taskar, B., & McAllester, D. (2004). Exponentiated gradient algorithms for larg-margin structured classification. NIPS.
 
4
Bennett, K. P., & Mangasarian, O. L. (1992). Robust linear programming discrimination of two linearly inseparable sets. Optim. Methods Softw, 23--34.
5
 
6
 
7
8
 
9
Jaakkola, T., Meila, M., & Jebara, T. (1999). Maximum entropy discrimination. NIPS.
 
10
 
11
 
12
 
13
Lebanon, G., & Lafferty, J. (2001). Boosting and maximum likelihood for exponential models. NIPS.
 
14
Lee, S.-I., Ganapathi, V., & Koller, D. (2006). Efficient structure learning of markov networks using l1-regularization. NIPS.
 
15
Qi, Y. A., Szummer, M., & Minka, T. P. (2005). Bayesian conditional random fields. AISTATS.
 
16
Ratliff, N. D., Bagnell, J. A., & Zinkevich, M. A. (2007). (online) subgradient methods for structured prediction. AISTATS.
 
17
Taskar, B., Guestrin, C., & Koller, D. (2003). Max-margin markov networks. NIPS.
 
18
Taskar, B., Lacoste-Julien, S., & Jordan, M. I. (2006). Structured prediction via the extragradient method. NIPS.
 
19
Tibshirani, R. (1996). Regression shrinkage and selection via the lasso. J. Royal. Statist. Soc., B, 267--288.
20
 
21
Wainwright, M. J., Ravikumar, P., & Lafferty, J. (2006). High-dimensional graphical model selection using l1-regularized logistic regression. NIPS.


Collaborative Colleagues:
Jun Zhu: colleagues
Eric P. Xing: colleagues
Bo Zhang: colleagues