| Robust sparse rank learning for non-smooth ranking measures |
| Full text |
Pdf
(478 KB)
|
Source
|
Annual ACM Conference on Research and Development in Information Retrieval
archive
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
table of contents
Boston, MA, USA
SESSION: Learning to rank I
table of contents
Pages 259-266
Year of Publication: 2009
ISBN:978-1-60558-483-6
|
|
Authors
|
|
Zhengya Sun
|
Institute of Automation Chinese Academy of Sciences, Beijing, China
|
|
Tao Qin
|
Microsoft Research Asia, Beijing, China
|
|
Qing Tao
|
Institute of Automation Chinese Academy of Sciences, Beijing, China
|
|
Jue Wang
|
Institute of Automation Chinese Academy of Sciences, Beijing, China
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 54, Downloads (12 Months): 223, Citation Count: 0
|
|
|
ABSTRACT
Recently increasing attention has been focused on directly optimizing ranking measures and inducing sparsity in learning models. However, few attempts have been made to relate them together in approaching the problem of learning to rank. In this paper, we consider the sparse algorithms to directly optimize the Normalized Discounted Cumulative Gain (NDCG) which is a widely-used ranking measure. We begin by establishing a reduction framework under which we reduce ranking, as measured by NDCG, to the importance weighted pairwise classification. Furthermore, we provide a sound theoretical guarantee for this reduction, bounding the realized NDCG regret in terms of a properly weighted pairwise classification regret, which implies that good performance can be robustly transferred from pairwise classification to ranking. Based on the converted pairwise loss function, it is conceivable to take into account sparsity in ranking models and to come up with a gradient possessing certain performance guarantee. For the sake of achieving sparsity, a novel algorithm named RSRank has also been devised, which performs L1 regularization using truncated gradient descent. Finally, experimental results on benchmark collection confirm the significant advantage of RSRank in comparison with several baseline methods.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
M.R. Asia. Letor3.0: benchmark datasets for learning to rank. Microsoft Corporation, December 2008.
|
| |
2
|
|
| |
3
|
Maria-Florina Balcan , Nikhil Bansal , Alina Beygelzimer , Don Coppersmith , John Langford , Gregory B. Sorkin, Robust reductions from ranking to classification, Machine Learning, v.72 n.1-2, p.139-153, August 2008
[doi> 10.1007/s10994-008-5058-6]
|
| |
4
|
A. Beygelzimer, V. Dani, T. Hayes, and J. Langford. Reductions between classification tasks. Electronic Colloquium on Computational Complexity, 11, 2004.
|
 |
5
|
Alina Beygelzimer , Varsha Dani , Tom Hayes , John Langford , Bianca Zadrozny, Error limiting reductions between classification tasks, Proceedings of the 22nd international conference on Machine learning, p.49-56, August 07-11, 2005, Bonn, Germany
[doi> 10.1145/1102351.1102358]
|
| |
6
|
C. Burges, R. Ragno, and Q. Le. Learning to rank with nonsmooth cost functions. In Advances in Neural Information Processing Systems 19, pages 193--200. MIT Press, 2007.
|
 |
7
|
Zhe Cao , Tao Qin , Tie-Yan Liu , Ming-Feng Tsai , Hang Li, Learning to rank: from pairwise approach to listwise approach, Proceedings of the 24th international conference on Machine learning, p.129-136, June 20-24, 2007, Corvalis, Oregon
[doi> 10.1145/1273496.1273513]
|
 |
8
|
Soumen Chakrabarti , Rajiv Khanna , Uma Sawant , Chiru Bhattacharyya, Structured learning for non-smooth ranking losses, Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, August 24-27, 2008, Las Vegas, Nevada, USA
[doi> 10.1145/1401890.1401906]
|
| |
9
|
O. Chapelle, Q. Le, and A. Smola. Large margin optimization of ranking measures. In NIPS Workshop on Machine Learning for Web Search, 2007.
|
 |
10
|
|
 |
11
|
|
| |
12
|
J. Langford and A. Beygelzimer. Sensitive error correcting output codes. Lecture Notes in Computer Science, 3559:158--172, June 2005.
|
| |
13
|
J. Langford, L. Li and T. Zhang. Sparse online learning via truncated gradient. CoRR, abs/0806.4686, 2008.
|
| |
14
|
Q. Le and A. Smola. Direct optimization of ranking measures. CoRR, abs/0704.3359, 2007.
|
| |
15
|
T. Qin, T.-Y. Liu, and H. Li. A general approximation framework for direct optimization of information retrieval measures. MSR-TR-2008-164, Microsoft Research, 2008.
|
 |
16
|
Michael Taylor , John Guiver , Stephen Robertson , Tom Minka, SoftRank: optimizing non-smooth rank metrics, Proceedings of the international conference on Web search and web data mining, February 11-12, 2008, Palo Alto, California, USA
[doi> 10.1145/1341531.1341544]
|
| |
17
|
E. Voorhees. Overview of the TREC 2002 question answering track. In Proceedings of the Eleventh Text Retrieval Conference(TREC), pages 115--123, 2002.
|
 |
18
|
|
| |
19
|
J.-Y. Yeh, J.-Y. Lin, H.-R. Ke, and W.-P. Yang. Learning to rank for information retrieval using genetic programming. In LR4IR, 2007.
|
| |
20
|
Y. Yue and C. Burges. On using simultaneous perturbation stochastic approximation for learning to rank, and the empirical optimality of lambdarank. MSR-TR-2007-115, Microsoft Research, 2007.
|
| |
21
|
T. Zhang. Statistical behavior and consistency of classification methods based on convex risk minimization. The Annuals of Statistics, 32:56--85, 2004.
|
| |
22
|
T. Zhang. Some sharp performance bounds for least squares regression with L<sub>1</sub> regularization. Technical Report TR-2007-005, Rutgers Statistics Department, 2007.
|
| |
23
|
|
|