ACM Home Page
Please provide us with feedback. Feedback
Robust bounds for classification via selective sampling
Full text PdfPdf (628 KB)
Source ACM International Conference Proceeding Series; Vol. 382 archive
Proceedings of the 26th Annual International Conference on Machine Learning table of contents
Montreal, Quebec, Canada
Pages 121-128  
Year of Publication: 2009
ISBN:978-1-60558-516-1
Authors
Nicolò Cesa-Bianchi  Università degli Studi di Milano, Milano, Italy
Claudio Gentile  Università dell'Insubria, Varese, Italy
Francesco Orabona  Idiap, Martigny, Switzerland
Sponsors
: MITACS
: NSF
Microsoft Research : Microsoft Research
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 6,   Downloads (12 Months): 29,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1553374.1553390
What is a DOI?

ABSTRACT

We introduce a new algorithm for binary classification in the selective sampling protocol. Our algorithm uses Regularized Least Squares (RLS) as base classifier, and for this reason it can be efficiently run in any RKHS. Unlike previous margin-based semi-supervised algorithms, our sampling condition hinges on a simultaneous upper bound on bias and variance of the RLS estimate under a simple linear label noise model. This fact allows us to prove performance bounds that hold for an arbitrary sequence of instances. In particular, we show that our sampling strategy approximates the margin of the Bayes optimal classifier to any desired accuracy ε by asking Õ (d2) queries (in the RKHS case d is replaced by a suitable spectral quantity). While these are the standard rates in the fully supervised i.i.d. case, the best previously known result in our harder setting was Õ (d34). Preliminary experiments show that some of our algorithms also exhibit a good practical performance.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
2
 
3
Balcan, M., Broder, A., & Zhang, T. (2007). Margin-based active learning. Proceedings of the 20th Annual Conference on Learning Theory (pp. 35--50).
 
4
 
5
Cavallanti, G., Cesa-Bianchi, N., & Gentile, C. (2009). Linear classification and selective sampling under low noise conditions. In Advances in Neural Information Processing Systems 21 (pp. 249--256).
 
6
 
7
 
8
 
9
Dasgupta, S., Hsu, D., & Monteleoni, C. (2008). A general agnostic active learning algorithm. In Advances in Neural Information Processing Systems 21 (pp. 353--360).
 
10
Dasgupta, S., Kalai, A. T., & Monteleoni, C. (2005). Analysis of perceptron-based active learning. Proceedings of the 18th Annual Conference on Learning Theory (pp. 249--263).
 
11
 
12
13
14
 
15
Strehl, A., & Littman, M. (2008). Online linear regression and its application to model-based reinforcement learning. In Advances in Neural Information Processing Systems 20 (pp. 631--638).
 
16
Weston, J., Bordes, A., & Bottou, L. (2005). Online (and offline) on an even tighter budget. Proceedings of the 10th International Workshop on Artificial Intelligence and Statistics (pp. 413--420).

Collaborative Colleagues:
Nicolò Cesa-Bianchi: colleagues
Claudio Gentile: colleagues
Francesco Orabona: colleagues