ACM Home Page
Please provide us with feedback. Feedback
Boosting collaborative filtering based on statistical prediction errors
Full text PdfPdf (286 KB)
Source
ACM Conference On Recommender Systems archive
Proceedings of the 2008 ACM conference on Recommender systems table of contents
Lausanne, Switzerland
SESSION: Recommendation algorithms table of contents
Pages 3-10  
Year of Publication: 2008
ISBN:978-1-60558-093-7
Authors
Shengchao Ding  Chinese Academy of Sciences, Beijing, China
Shiwan Zhao  IBM China Research Laboratory, Beijing, China
Quan Yuan  IBM China Research Laboratory, Beijing, China
Xiatian Zhang  IBM China Research Laboratory, Beijing, China
Rongyao Fu  IBM China Research Laboratory, Beijing, China
Lawrence Bergman  IBM T.J. Watson Research Center, Hawthorne, NY, USA
Sponsors
ACM: Association for Computing Machinery
SIGKDD: ACM Special Interest Group on Knowledge Discovery in Data
SIGART: ACM Special Interest Group on Artificial Intelligence
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
SIGIR: ACM Special Interest Group on Information Retrieval
SIGCHI: ACM Special Interest Group on Computer-Human Interaction
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 25,   Downloads (12 Months): 287,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1454008.1454011
What is a DOI?

ABSTRACT

User-based collaborative filtering methods typically predict a user's item ratings as a weighted average of the ratings given by similar users, where the weight is proportional to the user similarity. Therefore, the accuracy of user similarity is the key to the success of the recommendation, both for selecting neighborhoods and computing predictions. However, the computed similarities between users are somewhat inaccurate due to data sparsity.

For a given user, the set of neighbors selected for predicting ratings on different items typically exhibit overlap. Thus, error terms contributing to rating predictions will tend to be shared, leading to correlation of the prediction errors.

Through a set of case studies, we discovered that for a given user, the prediction errors on different items are correlated to the similarities of the corresponding items, and to the degree to which they share common neighbors.

We propose a framework to improve prediction accuracy based on these statistical prediction errors. Two different strategies to estimate the prediction error on a desired item are proposed. Our experiments show that these approaches improve the prediction accuracy of standard user based methods significantly, and they outperform other state-of-the-art methods.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
J. S. Breese, D. Heckerman, and C. M. Kadie. Empirical analysis of predictive algorithms for collaborative filtering. In G. F. Cooper and S. Moral, editors, Proceedings of the 14th Conference on Uncertainty in Artificial Intelligence, pages 43--52, 1998.
 
3
M. Claypool, A. Gokhale, T. Miranda, P. Murnikov, D. Netes, and M. Sartin. Combining content-based and collaborative filters in an online newspaper. In Proceedings of the ACM SIGIR '99 Workshop on Recommender Systems: Algorithms and Evaluation, Berkeley, California, 1999. ACM.
4
5
 
6
7
8
 
9
J. Kiefer. Conditional confidence statements and confidence estimators. Journal of the American Statistical Association, 72(360):789--808, Dec. 1977.
10
11
 
12
 
13
14
15
16
17

Collaborative Colleagues:
Shengchao Ding: colleagues
Shiwan Zhao: colleagues
Quan Yuan: colleagues
Xiatian Zhang: colleagues
Rongyao Fu: colleagues
Lawrence Bergman: colleagues