| Regression-based latent factor models |
| Full text |
Mov
(16:53),
Pdf
(655 KB)
|
Source
|
International Conference on Knowledge Discovery and Data Mining
archive
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
table of contents
Paris, France
SESSION: Research track papers
table of contents
Pages 19-28
Year of Publication: 2009
ISBN:978-1-60558-495-9
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 65, Downloads (12 Months): 255, Citation Count: 0
|
|
|
ABSTRACT
We propose a novel latent factor model to accurately predict response for large scale dyadic data in the presence of features. Our approach is based on a model that predicts response as a multiplicative function of row and column latent factors that are estimated through separate regressions on known row and column features. In fact, our model provides a single unified framework to address both cold and warm start scenarios that are commonplace in practical applications like recommender systems, online advertising, web search, etc. We provide scalable and accurate model fitting methods based on Iterated Conditional Mode and Monte Carlo EM algorithms. We show our model induces a stochastic process on the dyadic space with kernel (covariance) given by a polynomial function of features. Methods that generalize our procedure to estimate factors in an online fashion for dynamic applications are also considered. Our method is illustrated on benchmark datasets and a novel content recommendation application that arises in the context of Yahoo! Front Page. We report significant improvements over several commonly used methods on all datasets.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
KDD cup and workshop. 2007.
|
 |
2
|
|
| |
3
|
D. Agarwal and B.-C. Chen, et al. Online models for content optimization. In NIPS, 2008.
|
 |
4
|
|
| |
5
|
G. Allenby, P. Rossi, and R. McCulloch. Hierarchical bayes models: A practitioner's guide. http://ssrn.com/abstract=655541, 2005.
|
 |
6
|
|
| |
7
|
|
 |
8
|
|
| |
9
|
J. Booth and J. Hobert. Maximizing generalized linear mixed model likelihoods with an automated monte carlo EM algorithm. J.R.Statist. Soc. B, 1999.
|
| |
10
|
M. Claypool and A. Gokhale, et al. Combining content-based and collaborative filters in an online newspaper. In Recommender Systems Workshop, 1999.
|
| |
11
|
A. P. Dempster, N. M. Laird, and D. B. Rubin. Maximum likelihood from incomplete data via the EM algorithm. J. of the Royal Statistical Society, Series B, 1977.
|
| |
12
|
A. Gelman and J. Hill. Data Analysis using Regression and Multilevel/Hierarchical Models. Cambridge, 2006.
|
| |
13
|
A. Gelman and A. Jakulin, et al. A weakly informative default prior distribution for logistic and other regression models. Annals of Applied Statistics, 2008.
|
| |
14
|
Nathaniel Good , J. Ben Schafer , Joseph A. Konstan , Al Borchers , Badrul Sarwar , Jon Herlocker , John Riedl, Combining collaborative filtering with personal agents for better recommendations, Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence, p.439-446, July 18-22, 1999, Orlando, Florida, United States
|
 |
15
|
Jonathan L. Herlocker , Joseph A. Konstan , Al Borchers , John Riedl, An algorithmic framework for performing collaborative filtering, Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval, p.230-237, August 15-19, 1999, Berkeley, California, United States
[doi> 10.1145/312624.312682]
|
 |
16
|
|
| |
17
|
D. L. Lee and S. Seung. Algorithms for non-negative matrix factorization. In NIPS, 2001.
|
| |
18
|
P. McCullagh and J. A. Nelder. Generalized Linear Models. Chapman&Hall/CRC, 1989.
|
| |
19
|
|
 |
20
|
Seung-Taek Park , David Pennock , Omid Madani , Nathan Good , Dennis DeCoste, Naïve filterbots for robust cold-start recommendations, Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, August 20-23, 2006, Philadelphia, PA, USA
[doi> 10.1145/1150402.1150490]
|
| |
21
|
|
 |
22
|
|
 |
23
|
|
| |
24
|
R. Salakhutdinov and A. Mnih. Probabilistic matrix factorization. In NIPS, 2008.
|
 |
25
|
|
| |
26
|
A. I. Schein, L. K. Saul, and L. H. Ungar. A generalized linear model for principal component analysis of binary data. In AISTATS, 2003.
|
| |
27
|
R. Smith. Bayesian and Frequentist Approaches to Parametric Predictive Inference. Oxford University, 1999.
|
 |
28
|
|
|