ACM Home Page
Please provide us with feedback. Feedback
Estimating local optimums in EM algorithm over Gaussian mixture model
Full text PdfPdf (249 KB)
Source ICML; Vol. 307 archive
Proceedings of the 25th international conference on Machine learning table of contents
Helsinki, Finland
Pages 1240-1247  
Year of Publication: 2008
ISBN:978-1-60558-205-4
Authors
Zhenjie Zhang  National University of Singapore, Singapore
Bing Tian Dai  National University of Singapore, Singapore
Anthony K. H. Tung  National University of Singapore, Singapore
Sponsors
: Yahoo!
: Xerox
IBM : IBM
: NSF
Microsoft Research : Microsoft Research
: Machine Learning Journal/Springer
: Pascal
: University of Helsinki
: Federation of Finnish Learned Societies
: Intel Corporation
: Google
: Helsinki Institute for Information Technology
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 9,   Downloads (12 Months): 71,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1390156.1390312
What is a DOI?

ABSTRACT

EM algorithm is a very popular iteration-based method to estimate the parameters of Gaussian Mixture Model from a large observation set. However, in most cases, EM algorithm is not guaranteed to converge to the global optimum. Instead, it stops at some local optimums, which can be much worse than the global optimum. Therefore, it is usually required to run multiple procedures of EM algorithm with different initial configurations and return the best solution. To improve the efficiency of this scheme, we propose a new method which can estimate an upper bound on the logarithm likelihood of the local optimum, based on the current configuration after the latest EM iteration. This is accomplished by first deriving some region bounding the possible locations of local optimum, followed by some upper bound estimation on the maximum likelihood. With this estimation, we can terminate an EM algorithm procedure if the estimated local optimum is definitely worse than the best solution seen so far. Extensive experiments show that our method can effectively and efficiently accelerate conventional multiple restart EM algorithm.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Dempster, A. P., Laird, N. M., & Robin, D. B. (1977). Maximum likelihood from incomplete data via the em algorithm (with discussion). Journal of Royal Statistical Society B, 39, 1--38.
 
2
Elkan, C. (2003). Using the triangle inequality to accelerate k-means. ICML (pp. 147--153).
 
3
 
4
Kanungo, T., Mount, D., Netanyahu, N., Piatko, C., Silverman, R., & Wu, A. (2002). An efficient k-means clustering algorithm: analysis and implementation.
 
5
Lutkepohl, H. (1996). Handbook of matrices. John Wiley & Sons Ltd.
 
6
 
7
McLachlan, G., & Krishnan, T. (1996). The em algorithm and extensions. Wiley-Interscience.
 
8
McLachlan, G., & Peel, D. (2000). Finite mixture models. Wiley-Interscience.
 
9
 
10
 
11
Zhang, Z., Dai, B. T., & Tung, A. K. H. (2008). Estimating local optimums in em algorithm over gaussian mixture model. http://www.comp.nus.edu.sg/~zhangzh2/papers/em.pdf.

Collaborative Colleagues:
Zhenjie Zhang: colleagues
Bing Tian Dai: colleagues
Anthony K. H. Tung: colleagues