| Interactively optimizing information retrieval systems as a dueling bandits problem |
| Full text |
Pdf
(775 KB)
|
| Source
|
ACM International Conference Proceeding Series; Vol. 382
archive
Proceedings of the 26th Annual International Conference on Machine Learning
table of contents
Montreal, Quebec, Canada
Pages 1201-1208
Year of Publication: 2009
ISBN:978-1-60558-516-1
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 8, Downloads (12 Months): 29, Citation Count: 0
|
|
|
ABSTRACT
We present an on-line learning framework tailored towards real-time learning from observed user behavior in search engines and other information retrieval systems. In particular, we only require pairwise comparisons which were shown to be reliably inferred from implicit feedback (Joachims et al., 2007; Radlinski et al., 2008b). We will present an algorithm with theoretical guarantees as well as simulation results.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
Carterette, B., Bennett, P., Chickering, D. M., & Dumais, S. (2008). Here or There: Preference Judgments for Relevance. European Conference on Information Retrieval (ECIR) (pp. 16--27).
|
| |
3
|
Carterette, B., & Jones, R. (2007). Evaluating Search Engines by Modeling the Relationship Between Relevance and Clicks. Neural Information Processing Systems (NIPS) (pp. 217--224).
|
 |
4
|
Soumen Chakrabarti , Rajiv Khanna , Uma Sawant , Chiru Bhattacharyya, Structured learning for non-smooth ranking losses, Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, August 24-27, 2008, Las Vegas, Nevada, USA
[doi> 10.1145/1401890.1401906]
|
| |
5
|
Donmez, P., Svore, K., & Burges, C. (2009). On the Local Optimality of LambdaRank. ACM Conference on Information Retrieval (SIGIR).
|
 |
6
|
|
| |
7
|
|
 |
8
|
Thorsten Joachims , Laura Granka , Bing Pan , Helene Hembrooke , Filip Radlinski , Geri Gay, Evaluating the accuracy of implicit feedback from clicks and query reformulations in Web search, ACM Transactions on Information Systems (TOIS), v.25 n.2, p.7-es, April 2007
[doi> 10.1145/1229179.1229181]
|
| |
9
|
Kleinberg, R. (2004). Nearly tight bounds for the continuum-armed bandit problem. Neural Information Processing Systems (NIPS) (pp. 697--704).
|
| |
10
|
Langford, J., & Zhang, T. (2007). The Epoch-Greedy Algorithm for Contextual Multi-armed Bandits. Neural Information Processing Systems (NIPS) (pp. 817--824).
|
| |
11
|
Pandey, S., Agarwal, D., Chakrabarti, D., & Josifovski, V. (2007). Bandits for Taxonomies: A Model-based Approach. SIAM Conference on Data Mining (SDM) (pp. 216--227).
|
 |
12
|
|
 |
13
|
|
| |
14
|
Yue, Y., Broder, J., Kleinberg, R., & Joachims, T. (2009). The K-armed Dueling Bandits Problem. Conference on Learning Theory (COLT).
|
 |
15
|
|
| |
16
|
Zinkevich, M. (2003). Online Convex Programming and Generalized Infinitesimal Gradient Ascent. International Conference on Machine Learning (ICML) (pp. 928--936).
|
|