ACM Home Page
Please provide us with feedback. Feedback
Digital Library logoTake a look at the new version of this page: [ beta version ]. Tell us what you think.
Evaluating collaborative filtering recommender systems
Full text PdfPdf (254 KB)
Source ACM Transactions on Information Systems (TOIS) archive
Volume 22 ,  Issue 1  (January 2004) table of contents
Pages: 5 - 53  
Year of Publication: 2004
ISSN:1046-8188
Authors
Jonathan L. Herlocker  Oregon State University, Corvallis, OR
Joseph A. Konstan  University of Minnesota, Minneapolis, MN
Loren G. Terveen  University of Minnesota, Minneapolis, MN
John T. Riedl  University of Minnesota, Minneapolis, MN
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 272,   Downloads (12 Months): 2202,   Citation Count: 197
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/963770.963772
What is a DOI?

ABSTRACT

Recommender systems have been evaluated in many, often incomparable, ways. In this article, we review the key decisions in evaluating collaborative filtering recommender systems: the user tasks being evaluated, the types of analysis and datasets being used, the ways in which prediction quality is measured, the evaluation of prediction attributes other than quality, and the user-based evaluation of the system as a whole. In addition to reviewing the evaluation strategies used by prior researchers, we present empirical results from the analysis of various accuracy metrics on one content domain where all the tested metrics collapsed roughly into three equivalence classes. Metrics within each equivalency class were strongly correlated, while metrics from different equivalency classes were uncorrelated.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
3
 
4
 
5
Bailey, B. P., Gurak, L. J., and Konstan, J. A. 2001. An examination of trust production in computer-mediated exchange. In Proceedings of the 7th Conference on Human Factors and the Web (July).
6
 
7
 
8
 
9
Breese, J. S., Heckerman, D., and Kadie, C. 1998. Empirical analysis of predictive algorithms for collaborative filtering. In Proceedings of the 14th Conference on Uncertainty in Artificial Intelligence (UAI-98). G. F. Cooper, and S. Moral, Eds. Morgan-Kaufmann, San Francisco, Calif., 43--52.
10
 
11
 
12
Cleverdon, C. and Kean, M. 1968. Factors Determining the Performance of Indexing Systems. Aslib Cranfield Research Project, Cranfield, England.
 
13
Cosley, D., Lam, S. K., Albert, I., Konstan, J. A., and Riedl, J. 2003. Is seeing believing? How recommender interfaces affect users' opinions. CHI Lett. 5.
 
14
Dahlen, B. J., Konstan, J. A., Herlocker, J. L., Good, N., Borchers, A., and Riedl, J. 1998. Jump-starting movielens: User benefits of starting a collaborative filtering system with "dead data". TR 98-017. University of Minnesota.
15
16
 
17
 
18
 
19
Hanley, J. A. and Mcneil, B. J. 1982. The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 143, 29--36.
 
20
Harman, D. 1995. The TREC conferences. Hypertext---Information Retrieval---Multimedia: Synergieeffekte Elektronisher Informationssysteme. In Proceedings of HIM '95.
 
21
 
22
 
23
24
25
 
26
 
27
28
 
29
Le, C. T., Lindren, B. R. 1995. Construction and comparison of two receiver operating characteristics curves derived from the same samples. Biom. J. 37, 869--877.
 
30
Linton, F., Charron, A., and Joy, D. 1998. OWL: A recommender system for organziation-wide learning. In Proceedings of the 1998 Workshop on Recommender Systems 65--69.
31
32
33
 
34
Miller, B. N., Riedl, J., and Konstan, J. A. 1997. Experiences with GroupLens: Making Usenet useful again. In Proceedings of the 1997 Usenix Technical Conference.
35
 
36
 
37
Mui, L., Ang, C., and Mohtashemi, M. 2001. A Probabilistic Model for Collaborative Sanctioning. Technical Memorandum 617. MIT LCS.
38
 
39
 
40
41
 
42
43
44
 
45
Rogers, S. C. 2001. Marketing Strategies, Tactics, and Techniques : A handbook for practitioners. Quorum Books, Westport, Conn.
46
 
47
Sarwar, B. M., Karypis, G., Konstan, J. A., and Riedl, J. 2000b. Application of dimensionality reduction in recommender system--A case study. In Proceedings of the ACM WebKDD 2000 Web Mining for E-Commerce Workshop.
48
49
50
51
52
 
53
54
 
55
Swearingen, K. and Sinha, R. 2001. Beyond algorithms: An HCI perspective on recommender systems. In Proceedings of the SIGIR 2001 Workshop on Recommender Systems.
 
56
Swets, J. A. 1963. Information retrieval systems. Science 141, 245--250.
 
57
Swets, J. A. 1969. Effectiveness of information retrieval methods. Amer. Doc. 20, 72--89.
58
 
59
Voorhees, E. M. and Harman, D. K. 1999. Overview of the seventh Text REtrieval Conference (TREC-7). In NIST Special Publication 500-242 (July), E. M. Voorhees, and D. K. Harman, Eds. NIST, 1--24.
60
 
61
Whittaker, S., Terveen, L. G., and Nardi, B. 2000. Let's stop pushing the envelope and start addressing it: A reference task agenda for HCI. Human-Computer Interact. 15, 2-3 (Sept.), 75--106.
 
62

CITED BY  197

Collaborative Colleagues:
Jonathan L. Herlocker: colleagues
Joseph A. Konstan: colleagues
Loren G. Terveen: colleagues
John T. Riedl: colleagues