| Learning consensus opinion: mining data from a labeling game |
| Full text |
Pdf
(1.12 MB)
|
Source
|
International World Wide Web Conference
archive
Proceedings of the 18th international conference on World wide web
table of contents
Madrid, Spain
SESSION: Data mining/session: opinions
table of contents
Pages 121-130
Year of Publication: 2009
ISBN:978-1-60558-487-4
|
|
Authors
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 34, Downloads (12 Months): 128, Citation Count: 0
|
|
|
ABSTRACT
We consider the problem of identifying the consensus ranking for the results of a query, given preferences among those results from a set of individual users. Once consensus rankings are identified for a set of queries, these rankings can serve for both evaluation and training of retrieval and learning systems. We present a novel approach to collecting the individual user preferences over image-search results: we use a collaborative game in which players are rewarded for agreeing on which image result is best for a query. Our approach is distinct from other labeling games because we are able to elicit directly the preferences of interest with respect to image queries extracted from query logs. As a source of relevance judgments, this data provides a useful complement to click data. Furthermore, the data is free of positional biases and is collected by the game without the risk of frustrating users with non-relevant results; this risk is prevalent in standard mechanisms for debiasing clicks. We describe data collected over 34 days from a deployed version of this game that amounts to about 18 million expressed preferences between pairs. Finally, we present several approaches to modeling this data in order to extract the consensus rankings from the preferences and better sort the search results for targeted queries.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
R. Bradley and M. Terry. Rank analysis of incomplete block designs, i. the method of paired comparisons. Biometrika, pages 324--345, 1952.
|
| |
2
|
B. Carterette, P. N. Bennett, D. M. Chickering, and S. T. Dumais. Here or there: Preference judgments for relevance. In ECIR 2008, 2008.
|
 |
3
|
|
 |
4
|
|
| |
5
|
W. Cohen, R. Schapire, and Y. Singer. Learning to order things. Journal of Artificial Intelligence Research, 10:243--270, 1999.
|
 |
6
|
Cynthia Dwork , Ravi Kumar , Moni Naor , D. Sivakumar, Rank aggregation methods for the Web, Proceedings of the 10th international conference on World Wide Web, p.613-622, May 01-05, 2001, Hong Kong, Hong Kong
[doi> 10.1145/371920.372165]
|
| |
7
|
J. Fürnkranz and E. Hüllermeier. Pairwise preference learning and ranking. In ECML 2003, 2003.
|
| |
8
|
R. Herbrich and T. Graepel. Trueskill: A bayesian skill rating system. Technical Report MSR-TR-2006-80, Microsoft Research, June 2006. ftp://ftp.research.microsoft.com/pub/tr/TR-2006-80.pdf.
|
 |
9
|
|
| |
10
|
E. Law, L. von Ahn, R. Dannenberg, and M. Crawford. Tagatune: A game for music and sound annotation. In Proceedings of the 8th International Conference on Music Information Retrieval, 2007.
|
| |
11
|
R. Luce. Individual Choice Behaviours: A Theoretical Analysis. J. Wiley, New York, 1959.
|
| |
12
|
F. Radlinski and T. Joachims. Minimally invasive randomization for collecting unbiased preferences from clickthrough logs. In AAAI 2005, 2005.
|
 |
13
|
|
| |
14
|
|
 |
15
|
|
| |
16
|
L. Thurstone. A law of comparative judgment. Psychological Review, pages 273--286, 1927.
|
| |
17
|
L. Thurstone. Essays in Philosophy by Seventeen Doctors of Philosophy of the University of Chicago, chapter The Measurement of Psychological Value. Open Court, Chicago, 1929.
|
| |
18
|
L. Thurstone. The Measurement of Values. The University of Chicago Press, Chicago, 1959.
|
 |
19
|
|
 |
20
|
|
|