|
ABSTRACT
Graph transduction methods label input data by learning a classification function that is regularized to exhibit smoothness along a graph over labeled and unlabeled samples. In practice, these algorithms are sensitive to the initial set of labels provided by the user. For instance, classification accuracy drops if the training set contains weak labels, if imbalances exist across label classes or if the labeled portion of the data is not chosen at random. This paper introduces a propagation algorithm that more reliably minimizes a cost function over both a function on the graph and a binary label matrix. The cost function generalizes prior work in graph transduction and also introduces node normalization terms for resilience to label imbalances. We demonstrate that global minimization of the function is intractable but instead provide an alternating minimization scheme that incrementally adjusts the function and the labels towards a reliable local minimum. Unlike prior methods, the resulting propagation of labels does not prematurely commit to an erroneous labeling and obtains more consistent labels. Experiments are shown for synthetic and real classification tasks including digit and text recognition. A substantial improvement in accuracy compared to state of the art semi-supervised methods is achieved. The advantage are even more dramatic when labeled instances are limited.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
Chapelle, O., Sindhwani, V., & Keerthi, S. (2007). Branch and Bound for Semi-Supervised Support Vector Machines. Proc. of NIPS.
|
| |
4
|
Chapelle, O., Weston, J., & Scholkopf, B. (2003). Cluster kernels for semi-supervised learning. Proc. NIPS, 15, 1.
|
 |
5
|
|
| |
6
|
Hein, M., & Maier, M. (2006). Manifold denoising. Proc. NIPS, 19.
|
| |
7
|
|
| |
8
|
Joachims, T. (2003). Transductive learning via spectral graph partitioning. Proc. of ICML, 290--297.
|
| |
9
|
Karp, R. (1972). Reducibility among combinatorial problems. Complexity of Computer Computations, 43, 85--103.
|
 |
10
|
Gideon S. Mann , Andrew McCallum, Simple, robust, scalable semi-supervised learning via expectation regularization, Proceedings of the 24th international conference on Machine learning, p.593-600, June 20-24, 2007, Corvalis, Oregon
[doi> 10.1145/1273496.1273571]
|
| |
11
|
Ng, A., Jordan, M., & Weiss, Y. (2001). On spectral clustering: Analysis and an algorithm. Proc. NIPS, 14, 849--856.
|
 |
12
|
|
| |
13
|
Wang, J., Chang, S.-F., Zhou, X., & Wong, T. C. S. (2008). Active microscopic cellular image annotation by superposable graph transduction with imbalanced labels. IEEE CVPR. Alaska, USA.
|
| |
14
|
Zhou, D., Bousquet, O., Lal, T., Weston, J., & Scholkopf, B. (2004). Learning with local and global consistency. Proc. NIPS (pp. 321--328).
|
| |
15
|
Zhu, X. (2005). Semi-supervised learning literature survey (Technical Report 1530). Computer Sciences, University of Wisconsin-Madison.
|
| |
16
|
Zhu, X., Ghahramani, Z., & Lafferty, J. (2003). Semi-supervised learning using gaussian fields and harmonic functions. Proc. 20th ICML.
|
CITED BY 2
|
|
Tony Jebara , Jun Wang , Shih-Fu Chang, Graph construction and b-matching for semi-supervised learning, Proceedings of the 26th Annual International Conference on Machine Learning, p.441-448, June 14-18, 2009, Montreal, Quebec, Canada
|
|
|
Samuel I. Daitch , Jonathan A. Kelner , Daniel A. Spielman, Fitting a graph to vector data, Proceedings of the 26th Annual International Conference on Machine Learning, p.201-208, June 14-18, 2009, Montreal, Quebec, Canada
|
|