ACM Home Page
Please provide us with feedback. Feedback
Logistic regression with an auxiliary data source
Full text PdfPdf (932 KB)
Source ACM International Conference Proceeding Series; Vol. 119 archive
Proceedings of the 22nd international conference on Machine learning table of contents
Bonn, Germany
Pages: 505 - 512  
Year of Publication: 2005
ISBN:1-59593-180-5
Authors
Xuejun Liao  Duke University, Durham, NC
Ya Xue  Duke University, Durham, NC
Lawrence Carin  Duke University, Durham, NC
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 4,   Downloads (12 Months): 23,   Citation Count: 6
Additional Information:

abstract   references   cited by   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1102351.1102415
What is a DOI?

ABSTRACT

To achieve good generalization in supervised learning, the training and testing examples are usually required to be drawn from the same source distribution. In this paper we propose a method to relax this requirement in the context of logistic regression. Assuming Dp and Da are two sets of examples drawn from two mismatched distributions, where Da are fully labeled and Dp partially labeled, our objective is to complete the labels of Dp. We introduce an auxiliary variable μ for each example in Da to reflect its mismatch with Dp. Under an appropriate constraint the μ's are estimated as a byproduct, along with the classifier. We also present an active learning approach for selecting the labeled examples in Dp. The proposed algorithm, called "Migratory-Logit" or M-Logit, is demonstrated successfully on simulated as well as real data sets.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Bertsekas, D. P. (1999). Nonlinear programming (2nd edition). Athena Scientific.
 
2
Cohn, D. A., Ghahramani, Z., & Jordan, M. I. (1995). Active learning with statistical models. Advances in Neural Information Processing Systems, 7, 705--712.
 
3
 
4
Fedorov, V. V. (1972). Theory of optimal experiments. Academic Press.
 
5
Heckman, J. (1979). Sample selection bias as a specification error. Econometrica, 47, 153--161.
 
6
Krogh, A., & Vedelsby, J. (1995). Neural network ensembles, cross validation, and active learning. Advances in Neural Information Processing Systems, 7, 231--238.
 
7
 
8
9
10
 
11

Collaborative Colleagues:
Xuejun Liao: colleagues
Ya Xue: colleagues
Lawrence Carin: colleagues