ACM Home Page
Please provide us with feedback. Feedback
Structure compilation: trading structure for features
Full text PdfPdf (343 KB)
Source ICML; Vol. 307 archive
Proceedings of the 25th international conference on Machine learning table of contents
Helsinki, Finland
Pages 592-599  
Year of Publication: 2008
ISBN:978-1-60558-205-4
Authors
Percy Liang  University of California, Berkeley, CA
Hal Daumé, III  University of Utah, Salt Lake City, UT
Dan Klein  University of California, Berkeley, CA
Sponsors
: Yahoo!
: Xerox
IBM : IBM
: NSF
Microsoft Research : Microsoft Research
: Machine Learning Journal/Springer
: Pascal
: University of Helsinki
: Federation of Finnish Learned Societies
: Intel Corporation
: Google
: Helsinki Institute for Information Technology
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 2,   Downloads (12 Months): 32,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1390156.1390231
What is a DOI?

ABSTRACT

Structured models often achieve excellent performance but can be slow at test time. We investigate structure compilation, where we replace structure with features, which are often computationally simpler but unfortunately statistically more complex. We analyze this tradeoff theoretically and empirically on three natural language processing tasks. We also introduce a simple method to transfer predictive power from structure to features via unlabeled data, while incurring a minimal statistical penalty.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
Collins, M. (2001). Parameter estimation for statistical parsing models: Theory and practice of distribution-free methods. International Workshop on Parsing Technologies.
 
3
Crammar, K., Kearns, M., & Wortman, J. (2007). Learning from multiple sources. Advances in Neural Information Processing Systems (NIPS).
 
4
 
5
 
6
 
7
Petrov, S., & Klein, D. (2008). Discriminative log-linear grammars with latent variables. Advances in Neural Information Processing Systems (NIPS).
 
8
Pollard, D. (1984). Convergence of stochastic processes. Springer-Verlag.
 
9
Punyakanok, V., Roth, D., Yih, W., & Zimak, D. (2005). Learning and inference over constrained output. International Joint Conference on Artificial Intelligence (IJCAI).
 
10
Sutton, C., & McCallum, A. (2005). Piecewise training of undirected models. Uncertainty in Artificial Intelligence (UAI).
 
11

Collaborative Colleagues:
Percy Liang: colleagues
Hal Daumé, III: colleagues
Dan Klein: colleagues