ACM Home Page
Please provide us with feedback. Feedback
Matchbox: large scale online bayesian recommendations
Full text PdfPdf (1.45 MB)
Source
International World Wide Web Conference archive
Proceedings of the 18th international conference on World wide web table of contents
Madrid, Spain
SESSION: Data mining/session: statistical methods table of contents
Pages 111-120  
Year of Publication: 2009
ISBN:978-1-60558-487-4
Authors
David H. Stern  Microsoft Research Ltd, Cambridge, United Kingdom
Ralf Herbrich  Microsoft Research Ltd, Cambridge, United Kingdom
Thore Graepel  Microsoft Research Ltd, Cambridge, United Kingdom
Sponsor
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 35,   Downloads (12 Months): 171,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1526709.1526725
What is a DOI?

ABSTRACT

We present a probabilistic model for generating personalised recommendations of items to users of a web service. The Matchbox system makes use of content information in the form of user and item meta data in combination with collaborative filtering information from previous user behavior in order to predict the value of an item for a user. Users and items are represented by feature vectors which are mapped into a low-dimensional `trait space' in which similarity is measured in terms of inner products. The model can be trained from different types of feedback in order to learn user-item preferences. Here we present three alternatives: direct observation of an absolute rating each user gives to some items, observation of a binary preference (like/ don't like) and observation of a set of ordinal ratings on a user-specific scale. Efficient inference is achieved by approximate message passing involving a combination of Expectation Propagation (EP) and Variational Message Passing. We also include a dynamics model which allows an item's popularity, a user's taste or a user's personal rating scale to drift over time. By using Assumed-Density Filtering (ADF) for training, the model requires only a single pass through the training data. This is an on-line learning algorithm capable of incrementally taking account of new data so the system can immediately reflect the latest user preferences. We evaluate the performance of the algorithm on the MovieLens and Netflix data sets consisting of approximately 1,000,000 and 100,000,000 ratings respectively. This demonstrates that training the model using the on-line ADF approach yields state-of-the-art performance with the option of improving performance further if computational resources are available by performing multiple EP passes over the training data.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Netflix Cinematch: http://www.netflix.com.
2
 
3
J. S. Breese, D. Heckerman, and C. Kadie. Empirical analysis of predictive algorithms for collaborative filtering. In Proceedings of the 14th ACM Conference on Uncertainty in Artificial Intelligence, pages 34--52, 1998.
 
4
 
5
P. Dangauthier, R. Herbrich, T. Minka, and T. Graepel. Trueskill through time: Revisiting the history of chess. In Advances in Neural Information Processing Systems 20, pages 337--344, 2008.
6
7
 
8
R. Herbrich, T. Minka, and T. Graepel. TrueSkill(TM): A Bayesian skill rating system. In Advances in Neural Information Processing Systems 20, pages 569--576, 2007.
 
9
 
10
F. R. Kschischang, B. Frey, and H.-A. Loeliger. Factor graphs and the sum--product algorithm. IEEE Trans. Inform. Theory, 47(2):498--519, 2001.
11
 
12
Y. J. Lim and Y. W. Teh. Variational Bayesian approach to movie rating prediction. In Proceedings of KDD Cup and Workshop, 2007.
 
13
Benjamin Marlin. Collaborative filtering: A machine learning perspective. Master's thesis, University of Toronto, 2004.
 
14
T. Minka. Divergence measures and message passing. Technical Report MSR-TR-2007-173, Microsoft Research Ltd., 2005.
 
15
T. Minka, J.M. Winn, J.P. Guiver, and A. Kannan. Infer.NET 2.2, 2009. Microsoft Research Cambridge. http://research.microsoft.com/infernet.
 
16
17
18
19
 
20
J. M. Winn. Variational message passing and its application. PhD thesis, Department of Physics, University of Cambridge, 2003.


Collaborative Colleagues:
David H. Stern: colleagues
Ralf Herbrich: colleagues
Thore Graepel: colleagues