| Evaluation methods for topic models |
| Full text |
Pdf
(715 KB)
|
| Source
|
ACM International Conference Proceeding Series; Vol. 382
archive
Proceedings of the 26th Annual International Conference on Machine Learning
table of contents
Montreal, Quebec, Canada
Pages 1105-1112
Year of Publication: 2009
ISBN:978-1-60558-516-1
|
|
Authors
|
|
Hanna M. Wallach
|
University of Massachusetts, Amherst, MA
|
|
Iain Murray
|
University of Toronto, Toronto, Ontario, Canada
|
|
Ruslan Salakhutdinov
|
University of Toronto, Toronto, Ontario, Canada
|
|
David Mimno
|
University of Massachusetts, Amherst, MA
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 12, Downloads (12 Months): 39, Citation Count: 0
|
|
|
ABSTRACT
A natural evaluation metric for statistical topic models is the probability of held-out documents given a trained model. While exact computation of this probability is intractable, several estimators for this probability have been used in the topic modeling literature, including the harmonic mean method and empirical likelihood method. In this paper, we demonstrate experimentally that commonly-used methods are unlikely to accurately estimate the probability of held-out documents, and propose two alternative methods that are both accurate and efficient.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
Chib, S. (1995). Marginal likelihood from the Gibbs output. J. American Stat. Assoc., 90, 1313--1321.
|
| |
3
|
Del Moral, P., Doucet, A., & Jasra, A. (2006). Sequential Monte Carlo samplers. J. Royal Stat. Soc. B, 68, 1--26.
|
| |
4
|
Gogate, V., Bidyuk, B., & Dechter, R. (2007). Studies in lower bounding probability of evidence using the Markov inequality. Proc. Conf. on Uncertainty in Artificial Intelligence (pp. 141--148).
|
| |
5
|
Griffiths, T., & Steyvers, M. (2004). Finding scientific topics. Proc. Natl. Acad. Sci., 101, 5228--5235.
|
| |
6
|
Griffiths, T. L., Steyvers, M., Blei, D. M., & Tenen-baum, J. B. (2005). Integrating topics and syntax. Proc. Neural Information Processing Systems (pp. 537--544).
|
 |
7
|
|
| |
8
|
McCallum, A. (2002). MALLET: A machine learning for language toolkit. http://mallet.cs.umass.edu.
|
| |
9
|
Murray, I., & Salakhutdinov, R. (2009). Evaluating probabilities under high-dimensional latent variable models. Proc. Neural Information Processing Systems (pp. 1137--1144).
|
| |
10
|
|
| |
11
|
Newton, M. A., & Raftery, A. E. (1994). Approximate Bayesian inference with the weighted likelihood bootstrap. J. Royal Stat. Soc. B, 56, 3--48.
|
| |
12
|
Michal Rosen-Zvi , Thomas Griffiths , Mark Steyvers , Padhraic Smyth, The author-topic model for authors and documents, Proceedings of the 20th conference on Uncertainty in artificial intelligence, p.487-494, July 07-11, 2004, Banff, Canada
|
| |
13
|
Teh, Y., Jordan, M., Beal, M., & Blei, D. (2006). Hierarchical Dirichlet processes. J. American Stat. Assoc., 101, 1566--1581.
|
 |
14
|
|
| |
15
|
Wallach, H. M. (2008). Structured topic models for language. PhD thesis, University of Cambridge.
|
|