ACM Home Page
Please provide us with feedback. Feedback
Locally linear metric adaptation for semi-supervised clustering
Full text PdfPdf (291 KB)
Source ACM International Conference Proceeding Series; Vol. 69 archive
Proceedings of the twenty-first international conference on Machine learning table of contents
Banff, Alberta, Canada
Page: 20  
Year of Publication: 2004
ISBN:1-58113-828-5
Authors
Hong Chang  Hong Kong University of Science and Technology, Kowloon, Hong Kong
Dit-Yan Yeung  Hong Kong University of Science and Technology, Kowloon, Hong Kong
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 11,   Downloads (12 Months): 59,   Citation Count: 10
Additional Information:

abstract   references   cited by   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1015330.1015391
What is a DOI?

ABSTRACT

Many supervised and unsupervised learning algorithms are very sensitive to the choice of an appropriate distance metric. While classification tasks can make use of class label information for metric learning, such information is generally unavailable in conventional clustering tasks. Some recent research sought to address a variant of the conventional clustering problem called semi-supervised clustering, which performs clustering in the presence of some background knowledge or supervisory information expressed as pairwise similarity or dissimilarity constraints. However, existing metric learning methods for semi-supervised clustering mostly perform global metric learning through a linear transformation. In this paper, we propose a new metric learning method which performs nonlinear transformation globally but linear transformation locally. In particular, we formulate the learning problem as an optimization problem and present two methods for solving it. Through some toy data sets, we show empirically that our locally linear metric adaptation (LLMA) method can handle some difficult cases that cannot be handled satisfactorily by previous methods. We also demonstrate the effectiveness of our method on some real data sets.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Bar-Hillel, A., Hertz, T., Shental, N., & Weinshall, D. (2003). Learning distance functions using equivalence relations. Proceedings of the Twentieth International Conference on Machine Learning (pp. 11--18). Washington, DC, USA.
 
2
 
3
Cheung, K., Yeung, D., & Chin, R. (2002). On deformable models for visual pattern recognition. Pattern Recognition, 35, 1507--1526.
 
4
 
5
Friedman, J. (1994). Flexible metric nearest neighbor classification (Technical Report). Department of Statistics, Stanford University, Stanford, CA, USA.
 
6
Fukunaga, K., & Hostetler, L. (1973). Optimization of knearest neighbor density estimates. IEEE Transactions on Information Theory, 19, 320--326.
 
7
 
8
 
9
Kwok, J., & Tsang, I. (2003). Learning with idealized kernels. Proceedings of the Twentieth International Conference on Machine Learning (pp. 400--407). Washington, DC, USA.
 
10
 
11
 
12
Poggio, T., & Girosi, F. (1990). Networks for approximation and learning. Proceedings of the IEEE, 78, 1481--1497.
 
13
Rand, W. (1971). Objective criteria for the evaluation of clustering methods. Journal of the American Statistical Association, 66, 846--850.
 
14
Roweis, S., & Saul, L. (2000). Nonlinear dimensionality reduction by locally linear embedding. Science, 290, 2323--2326.
 
15
Shental, N., Bar-Hillel, A., Hertz, T., & Weinshall, D. (2004). Computing Gaussian mixture models with EM using equivalence constraints. In Advances in neural information processing systems 16. Cambridge, MA, USA: MIT Press. To appear.
 
16
 
17
 
18
 
19
Webb, A. (1995). Multidimensional scaling by iterative majorization using radial basis functions. Pattern Recognition, 28, 753--759.
 
20
Xing, E., Ng, A., Jordan, M., & Russell, S. (2003). Distance metric learning, with application to clustering with side-information. In S. Becker, S. Thrun and K. Obermayer (Eds.), Advances in neural information processing systems 15, 505--512. Cambridge, MA, USA: MIT Press.
 
21
Zhang, Z., Kwok, J., & Yeung, D. (2003). Parametric distance metric learning with label information. Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence (pp. 1450--1452). Acapulco, Mexico.

CITED BY  10
Collaborative Colleagues:
Hong Chang: colleagues
Dit-Yan Yeung: colleagues