ACM Home Page
Please provide us with feedback. Feedback
Interactively optimizing information retrieval systems as a dueling bandits problem
Full text PdfPdf (775 KB)
Source ACM International Conference Proceeding Series; Vol. 382 archive
Proceedings of the 26th Annual International Conference on Machine Learning table of contents
Montreal, Quebec, Canada
Pages 1201-1208  
Year of Publication: 2009
ISBN:978-1-60558-516-1
Authors
Yisong Yue  Cornell University, Ithaca, NY
Thorsten Joachims  Cornell University, Ithaca, NY
Sponsors
: MITACS
: NSF
Microsoft Research : Microsoft Research
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 29,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1553374.1553527
What is a DOI?

ABSTRACT

We present an on-line learning framework tailored towards real-time learning from observed user behavior in search engines and other information retrieval systems. In particular, we only require pairwise comparisons which were shown to be reliably inferred from implicit feedback (Joachims et al., 2007; Radlinski et al., 2008b). We will present an algorithm with theoretical guarantees as well as simulation results.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
Carterette, B., Bennett, P., Chickering, D. M., & Dumais, S. (2008). Here or There: Preference Judgments for Relevance. European Conference on Information Retrieval (ECIR) (pp. 16--27).
 
3
Carterette, B., & Jones, R. (2007). Evaluating Search Engines by Modeling the Relationship Between Relevance and Clicks. Neural Information Processing Systems (NIPS) (pp. 217--224).
4
 
5
Donmez, P., Svore, K., & Burges, C. (2009). On the Local Optimality of LambdaRank. ACM Conference on Information Retrieval (SIGIR).
6
 
7
8
 
9
Kleinberg, R. (2004). Nearly tight bounds for the continuum-armed bandit problem. Neural Information Processing Systems (NIPS) (pp. 697--704).
 
10
Langford, J., & Zhang, T. (2007). The Epoch-Greedy Algorithm for Contextual Multi-armed Bandits. Neural Information Processing Systems (NIPS) (pp. 817--824).
 
11
Pandey, S., Agarwal, D., Chakrabarti, D., & Josifovski, V. (2007). Bandits for Taxonomies: A Model-based Approach. SIAM Conference on Data Mining (SDM) (pp. 216--227).
12
13
 
14
Yue, Y., Broder, J., Kleinberg, R., & Joachims, T. (2009). The K-armed Dueling Bandits Problem. Conference on Learning Theory (COLT).
15
 
16
Zinkevich, M. (2003). Online Convex Programming and Generalized Infinitesimal Gradient Ascent. International Conference on Machine Learning (ICML) (pp. 928--936).

Collaborative Colleagues:
Yisong Yue: colleagues
Thorsten Joachims: colleagues