ACM Home Page
Please provide us with feedback. Feedback
Robust sparse rank learning for non-smooth ranking measures
Full text PdfPdf (478 KB)
Source
Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval table of contents
Boston, MA, USA
SESSION: Learning to rank I table of contents
Pages 259-266  
Year of Publication: 2009
ISBN:978-1-60558-483-6
Authors
Zhengya Sun  Institute of Automation Chinese Academy of Sciences, Beijing, China
Tao Qin  Microsoft Research Asia, Beijing, China
Qing Tao  Institute of Automation Chinese Academy of Sciences, Beijing, China
Jue Wang  Institute of Automation Chinese Academy of Sciences, Beijing, China
Sponsors
SIGIR: ACM Special Interest Group on Information Retrieval
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 54,   Downloads (12 Months): 223,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1571941.1571987
What is a DOI?

ABSTRACT

Recently increasing attention has been focused on directly optimizing ranking measures and inducing sparsity in learning models. However, few attempts have been made to relate them together in approaching the problem of learning to rank. In this paper, we consider the sparse algorithms to directly optimize the Normalized Discounted Cumulative Gain (NDCG) which is a widely-used ranking measure. We begin by establishing a reduction framework under which we reduce ranking, as measured by NDCG, to the importance weighted pairwise classification. Furthermore, we provide a sound theoretical guarantee for this reduction, bounding the realized NDCG regret in terms of a properly weighted pairwise classification regret, which implies that good performance can be robustly transferred from pairwise classification to ranking. Based on the converted pairwise loss function, it is conceivable to take into account sparsity in ranking models and to come up with a gradient possessing certain performance guarantee. For the sake of achieving sparsity, a novel algorithm named RSRank has also been devised, which performs L1 regularization using truncated gradient descent. Finally, experimental results on benchmark collection confirm the significant advantage of RSRank in comparison with several baseline methods.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
M.R. Asia. Letor3.0: benchmark datasets for learning to rank. Microsoft Corporation, December 2008.
 
2
 
3
 
4
A. Beygelzimer, V. Dani, T. Hayes, and J. Langford. Reductions between classification tasks. Electronic Colloquium on Computational Complexity, 11, 2004.
5
 
6
C. Burges, R. Ragno, and Q. Le. Learning to rank with nonsmooth cost functions. In Advances in Neural Information Processing Systems 19, pages 193--200. MIT Press, 2007.
7
8
 
9
O. Chapelle, Q. Le, and A. Smola. Large margin optimization of ranking measures. In NIPS Workshop on Machine Learning for Web Search, 2007.
10
11
 
12
J. Langford and A. Beygelzimer. Sensitive error correcting output codes. Lecture Notes in Computer Science, 3559:158--172, June 2005.
 
13
J. Langford, L. Li and T. Zhang. Sparse online learning via truncated gradient. CoRR, abs/0806.4686, 2008.
 
14
Q. Le and A. Smola. Direct optimization of ranking measures. CoRR, abs/0704.3359, 2007.
 
15
T. Qin, T.-Y. Liu, and H. Li. A general approximation framework for direct optimization of information retrieval measures. MSR-TR-2008-164, Microsoft Research, 2008.
16
 
17
E. Voorhees. Overview of the TREC 2002 question answering track. In Proceedings of the Eleventh Text Retrieval Conference(TREC), pages 115--123, 2002.
18
 
19
J.-Y. Yeh, J.-Y. Lin, H.-R. Ke, and W.-P. Yang. Learning to rank for information retrieval using genetic programming. In LR4IR, 2007.
 
20
Y. Yue and C. Burges. On using simultaneous perturbation stochastic approximation for learning to rank, and the empirical optimality of lambdarank. MSR-TR-2007-115, Microsoft Research, 2007.
 
21
T. Zhang. Statistical behavior and consistency of classification methods based on convex risk minimization. The Annuals of Statistics, 32:56--85, 2004.
 
22
T. Zhang. Some sharp performance bounds for least squares regression with L<sub>1</sub> regularization. Technical Report TR-2007-005, Rutgers Statistics Department, 2007.
 
23

Collaborative Colleagues:
Zhengya Sun: colleagues
Tao Qin: colleagues
Qing Tao: colleagues
Jue Wang: colleagues