|
ABSTRACT
A content-based personalized recommendation system learns user specific profiles from user feedback so that it can deliver information tailored to each individual user's interest. A system serving millions of users can learn a better user profile for a new user, or a user with little feedback, by borrowing information from other users through the use of a Bayesian hierarchical model. Learning the model parameters to optimize the joint data likelihood from millions of users is very computationally expensive. The commonly used EM algorithm converges very slowly due to the sparseness of the data in IR applications. This paper proposes a new fast learning technique to learn a large number of individual user profiles. The efficacy and efficiency of the proposed algorithm are justified by theory and demonstrated on actual user data from Netflix and MovieLens.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
J. S. Breese, D. Heckerman, and C. Kadie. Empirical analysis of predictive algorithms for collaborative filtering. Technical report, Microsoft Research, One Microsoft Way, Redmond, WA 98052, 1998.
|
 |
3
|
|
| |
4
|
N. Cancedda, N. Cesa-Bianchi, A. Conconi, C. Gentile, C. Goutte, T. Graepel, Y. Li, J. M. Renders, J. S. Taylor, and A. Vinokourov. Kernel method for document filtering. In The Eleventh Text REtrieval Conference (TREC11). National Institute of Standards and Technology, special publication 500--249, 2003.
|
| |
5
|
C. Chelba and A. Acero. Adaptation of maximum entropy capitalizer: Little data can help a lot. In D. Lin and D. Wu, editors, Proceedings of EMNLP 2004, pages 285--292, Barcelona, Spain, July 2004. Association for Computational Linguistics.
|
| |
6
|
|
 |
7
|
Aynur Dayanik , David D. Lewis , David Madigan , Vladimir Menkov , Alexander Genkin, Constructing informative prior distributions from domain knowledge in text classification, Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, August 06-11, 2006, Seattle, Washington, USA
[doi> 10.1145/1148170.1148255]
|
| |
8
|
J. Delgado and N. Ishii. Memory-based weighted majority prediction for recommender systems. In ACM SIGIR'99 Workshop on Recommender Systems, 1999.
|
| |
9
|
GroupLens. Movielens. http://www.grouplens.org/taxonomy/term/14, 2006.
|
| |
10
|
|
 |
11
|
Jonathan L. Herlocker , Joseph A. Konstan , Al Borchers , John Riedl, An algorithmic framework for performing collaborative filtering, Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval, p.230-237, August 15-19, 1999, Berkeley, California, United States
[doi> 10.1145/312624.312682]
|
| |
12
|
|
| |
13
|
I. M. D. (IMDB). Internet movie database. http://www.imdb.com/interfaces/, 2006.
|
 |
14
|
|
 |
15
|
Joseph A. Konstan , Bradley N. Miller , David Maltz , Jonathan L. Herlocker , Lee R. Gordon , John Riedl, GroupLens: applying collaborative filtering to Usenet news, Communications of the ACM, v.40 n.3, p.77-87, March 1997
[doi> 10.1145/245108.245126]
|
| |
16
|
D. Lewis. Applying support vector machines to the TREC-2001 batch filtering and routing tasks. In Proceedings of the Eleventh Text REtrieval Conference (TREC-11), 2002.
|
| |
17
|
B. Liu, X. Li, W. S. Lee, , and P. Yu. Text classification by labeling words. In Proceedings of The Nineteenth National Conference on Artificial Intelligence (AAAI-2004), July 25-29, 2004.
|
| |
18
|
Prem Melville , Raymod J. Mooney , Ramadass Nagarajan, Content-boosted collaborative filtering for improved recommendations, Eighteenth national conference on Artificial intelligence, p.187-192, July 28-August 01, 2002, Edmonton, Alberta, Canada
|
| |
19
|
Netflix. Netflix prize. http://www.netflixprize.com (visited on Nov. 30, 2006), 2006.
|
| |
20
|
S. Robertson and K. Sparck-Jones. Relevance weighting of search terms. In Journal of the American Society for Information Science, volume 27, pages 129--146, 1976.
|
 |
21
|
|
 |
22
|
|
 |
23
|
|
 |
24
|
|
 |
25
|
|
| |
26
|
X. Zhu. Semi-supervised learning literature survey. Technical report, University of Wisconsin - Madison, December 9, 2006.
|
 |
27
|
|
CITED BY 8
|
|
Jonathan Koren , Andrew Leung , Yi Zhang , Carlos Maltzahn , Sasha Ames , Ethan Miller, Searching and navigating petabyte-scale file systems based on facets, Proceedings of the 2nd international workshop on Petascale data storage: held in conjunction with Supercomputing '07, November 11-11, 2007, Reno, Nevada
|
|
|
|
|
|
|
|
|
Shengchao Ding , Shiwan Zhao , Quan Yuan , Xiatian Zhang , Rongyao Fu , Lawrence Bergman, Boosting collaborative filtering based on statistical prediction errors, Proceedings of the 2008 ACM conference on Recommender systems, October 23-25, 2008, Lausanne, Switzerland
|
|
|
|
|
|
Kai Yu , Shenghuo Zhu , John Lafferty , Yihong Gong, Fast nonparametric matrix factorization for large-scale collaborative filtering, Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, July 19-23, 2009, Boston, MA, USA
|
|
|
|
|
|
|
|