ACM Home Page
Please provide us with feedback. Feedback
Evaluation methods for topic models
Full text PdfPdf (715 KB)
Source ACM International Conference Proceeding Series; Vol. 382 archive
Proceedings of the 26th Annual International Conference on Machine Learning table of contents
Montreal, Quebec, Canada
Pages 1105-1112  
Year of Publication: 2009
ISBN:978-1-60558-516-1
Authors
Hanna M. Wallach  University of Massachusetts, Amherst, MA
Iain Murray  University of Toronto, Toronto, Ontario, Canada
Ruslan Salakhutdinov  University of Toronto, Toronto, Ontario, Canada
David Mimno  University of Massachusetts, Amherst, MA
Sponsors
: MITACS
: NSF
Microsoft Research : Microsoft Research
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 12,   Downloads (12 Months): 39,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1553374.1553515
What is a DOI?

ABSTRACT

A natural evaluation metric for statistical topic models is the probability of held-out documents given a trained model. While exact computation of this probability is intractable, several estimators for this probability have been used in the topic modeling literature, including the harmonic mean method and empirical likelihood method. In this paper, we demonstrate experimentally that commonly-used methods are unlikely to accurately estimate the probability of held-out documents, and propose two alternative methods that are both accurate and efficient.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
Chib, S. (1995). Marginal likelihood from the Gibbs output. J. American Stat. Assoc., 90, 1313--1321.
 
3
Del Moral, P., Doucet, A., & Jasra, A. (2006). Sequential Monte Carlo samplers. J. Royal Stat. Soc. B, 68, 1--26.
 
4
Gogate, V., Bidyuk, B., & Dechter, R. (2007). Studies in lower bounding probability of evidence using the Markov inequality. Proc. Conf. on Uncertainty in Artificial Intelligence (pp. 141--148).
 
5
Griffiths, T., & Steyvers, M. (2004). Finding scientific topics. Proc. Natl. Acad. Sci., 101, 5228--5235.
 
6
Griffiths, T. L., Steyvers, M., Blei, D. M., & Tenen-baum, J. B. (2005). Integrating topics and syntax. Proc. Neural Information Processing Systems (pp. 537--544).
7
 
8
McCallum, A. (2002). MALLET: A machine learning for language toolkit. http://mallet.cs.umass.edu.
 
9
Murray, I., & Salakhutdinov, R. (2009). Evaluating probabilities under high-dimensional latent variable models. Proc. Neural Information Processing Systems (pp. 1137--1144).
 
10
 
11
Newton, M. A., & Raftery, A. E. (1994). Approximate Bayesian inference with the weighted likelihood bootstrap. J. Royal Stat. Soc. B, 56, 3--48.
 
12
 
13
Teh, Y., Jordan, M., Beal, M., & Blei, D. (2006). Hierarchical Dirichlet processes. J. American Stat. Assoc., 101, 1566--1581.
14
 
15
Wallach, H. M. (2008). Structured topic models for language. PhD thesis, University of Cambridge.

Collaborative Colleagues:
Hanna M. Wallach: colleagues
Iain Murray: colleagues
Ruslan Salakhutdinov: colleagues
David Mimno: colleagues