ACM Home Page
Please provide us with feedback. Feedback
On the quantitative analysis of deep belief networks
Full text PdfPdf (374 KB)
Source ICML; Vol. 307 archive
Proceedings of the 25th international conference on Machine learning table of contents
Helsinki, Finland
Pages 872-879  
Year of Publication: 2008
ISBN:978-1-60558-205-4
Authors
Ruslan Salakhutdinov  University of Toronto, Toronto, Ontario, Canada
Iain Murray  University of Toronto, Toronto, Ontario, Canada
Sponsors
: Yahoo!
: Xerox
IBM : IBM
: NSF
Microsoft Research : Microsoft Research
: Machine Learning Journal/Springer
: Pascal
: University of Helsinki
: Federation of Finnish Learned Societies
: Intel Corporation
: Google
: Helsinki Institute for Information Technology
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 10,   Downloads (12 Months): 59,   Citation Count: 3
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1390156.1390266
What is a DOI?

ABSTRACT

Deep Belief Networks (DBN's) are generative models that contain many layers of hidden variables. Efficient greedy algorithms for learning and approximate inference have allowed these models to be applied successfully in many application domains. The main building block of a DBN is a bipartite undirected graphical model called a restricted Boltzmann machine (RBM). Due to the presence of the partition function, model selection, complexity control, and exact maximum likelihood learning in RBM's are intractable. We show that Annealed Importance Sampling (AIS) can be used to efficiently estimate the partition function of an RBM, and we present a novel AIS scheme for comparing RBM's with different architectures. We further show how an AIS estimator, along with approximate inference, can be used to estimate a lower bound on the log-probability that a DBN model with multiple hidden layers assigns to the test data. This is, to our knowledge, the first step towards obtaining quantitative results that would allow us to directly assess the performance of Deep Belief Networks as generative models of data.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Bengio, Y., & LeCun, Y. (2007). Scaling learning algorithms towards AI. Large-Scale Kernel Machines. MIT Press.
 
2
Carreira-Perpinan, M., & Hinton, G. (2005). On contrastive divergence learning. 10th Int. Workshop on Artificial Intelligence and Statistics (AISTATS'2005).
3
 
4
Hinton, & Salakhutdinov (2006). Reducing the dimensionality of data with neural networks. Science, 313, 504--507.
 
5
 
6
 
7
 
8
Neal, R. M. (1993). Probabilistic inference using Markov chain Monte Carlo methods (Technical Report CRG-TR-93-1). Department of Computer Science, University of Toronto.
 
9
 
10
Neal, R. M. (2005). Estimating ratios of normalizing constants using linked importance sampling (Technical Report 0511). Department of Statistics, University of Toronto.
 
11
Osindero, S., & Hinton, G. (2008). Modeling image patches with a directed hierarchy of Markov random fields. NIPS 20. Cambridge, MA: MIT Press.
12
 
13
Skilling, J. (2004). Nested sampling. Bayesian inference and maximum entropy methods in science and engineering, AIP Conference Proceeedings, 735, 395--405.
 
14
Taylor, G. W., Hinton, G. E., & Roweis, S. T. (2006). Modeling human motion using binary latent variables. Advances in Neural Information Processing Systems. MIT Press.
 
15
Wainwright, M. J., Jaakkola, T., & Willsky, A. S. (2005). A new class of upper bounds on the log partition function. IEEE Transactions on Information Theory, 51, 2313--2335.
 
16
Yedidia, J. S., Freeman, W. T., & Weiss, Y. (2005). Constructing free-energy approximations and generalized belief propagation algorithms. IEEE Transactions on Information Theory, 51, 2282--2312.


Collaborative Colleagues:
Ruslan Salakhutdinov: colleagues
Iain Murray: colleagues