| Trust region Newton methods for large-scale logistic regression |
| Full text |
Pdf
(251 KB)
|
| Source
|
ICML; Vol. 227
archive
Proceedings of the 24th international conference on Machine learning
table of contents
Corvalis, Oregon
Pages: 561 - 568
Year of Publication: 2007
ISBN:978-1-59593-793-3
|
|
Authors
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 6, Downloads (12 Months): 37, Citation Count: 6
|
|
|
ABSTRACT
Large-scale logistic regression arises in many applications such as document classification and natural language processing. In this paper, we apply a trust region Newton method to maximize the log-likelihood of the logistic regression model. The proposed method uses only approximate Newton steps in the beginning, but achieves fast convergence in the end. Experiments show that it is faster than the commonly used quasi Newton approach for logistic regression. We also compare it with linear SVM implementations.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
Bernhard E. Boser , Isabelle M. Guyon , Vladimir N. Vapnik, A training algorithm for optimal margin classifiers, Proceedings of the fifth annual workshop on Computational learning theory, p.144-152, July 27-29, 1992, Pittsburgh, Pennsylvania, United States
[doi> 10.1145/130385.130401]
|
 |
2
|
|
| |
3
|
|
| |
4
|
Komarek, P., & Moore, A. W. (2005). Making logistic regression a core data mining tool (Technical Report). Carnegie Mellon University.
|
| |
5
|
|
| |
6
|
|
| |
7
|
|
| |
8
|
|
| |
9
|
Mangasarian, O. L. (2002). A finite Newton method for classification. Optimization Methods and Software, 17, 913--929.
|
| |
10
|
|
| |
11
|
Nocedal, J., & Nash, S. G. (1991). A numerical study of the limited memory BFGS method and the truncated-newton method for large scale optimization. SIAM Journal on Optimization, 1, 358--372.
|
| |
12
|
|
| |
13
|
Steihaug, T. (1983). The conjugate gradient method and trust regions in large scale optimization. SIAM Journal on Numerical Analysis, 20, 626--637.
|
| |
14
|
Sutton, C., & McCallum, A. (2006). An introduction to conditional random fields for relational learning. In L. Getoor and B. Taskar (Eds.), Introduction to statistical relational learning. MIT Press.
|
CITED BY 6
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Gunawan Herman , Getian Ye , Jie Xu , Bang Zhang, Improving object detection by removing noisy samples from training sets, Proceeding of the 1st ACM international conference on Multimedia information retrieval, October 30-31, 2008, Vancouver, British Columbia, Canada
|
|
|
Nathan Rosenblum , Xiaojin Zhu , Barton Miller , Karen Hunt, Learning to analyze binary computer code, Proceedings of the 23rd national conference on Artificial intelligence, p.798-804, July 13-17, 2008, Chicago, Illinois
|
|