|
ABSTRACT
Ranking algorithms, whose goal is to appropriately order a set of objects/documents, are an important component of information retrieval systems. Previous work on ranking algorithms has focused on cases where only labeled data is available for training (i.e. supervised learning). In this paper, we consider the question whether unlabeled (test) data can be exploited to improve ranking performance. We present a framework for transductive learning of ranking functions and show that the answer is affirmative. Our framework is based on generating better features from the test data (via KernelPCA) and incorporating such features via Boosting, thus learning different ranking functions adapted to the individual test queries. We evaluate this method on the LETOR (TREC, OHSUMED) dataset and demonstrate significant improvements.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
|
| |
3
|
S. Ben-David, J. Blitzer, K. Crammer, and F. Pereira. Analysis of representations for domain adaptation. In Neural Information Processing Systems, 2006.
|
| |
4
|
C. Burges, R. Ragno, and Q. Le. Learning to rank with nonsmooth cost functions. In NIPS, 2006.
|
 |
5
|
Chris Burges , Tal Shaked , Erin Renshaw , Ari Lazier , Matt Deeds , Nicole Hamilton , Greg Hullender, Learning to rank using gradient descent, Proceedings of the 22nd international conference on Machine learning, p.89-96, August 07-11, 2005, Bonn, Germany
[doi> 10.1145/1102351.1102363]
|
| |
6
|
W. Chu and Z. Ghahramani. Extensions of gaussian processes for ranking: semi-supervised and active learning. In NIPS Wksp on Learning to Rank, 2005.
|
| |
7
|
K. Crammer and Y. Singer. Pranking with ranking. In Neural Information Processing Systems (NIPS), 2001.
|
| |
8
|
|
 |
9
|
|
 |
10
|
Jingrui He , Mingjing Li , Hong-Jiang Zhang , Hanghang Tong , Changshui Zhang, Manifold-ranking based image retrieval, Proceedings of the 12th annual ACM international conference on Multimedia, October 10-16, 2004, New York, NY, USA
[doi> 10.1145/1027527.1027531]
|
| |
11
|
R. Herbrich, T. Graepel, and K. Obermayer. Advances in Large Margin Classifiers, chapter Large margin rank boundaries for ordinal regression. MIT Press, 2000.
|
 |
12
|
|
 |
13
|
|
 |
14
|
|
| |
15
|
|
 |
16
|
|
| |
17
|
P. Li, C. Burges, and Q. Wu. McRank: Learning to rank using classification and gradient boosting. In NIPS, 2007.
|
| |
18
|
T.-Y. Liu, T. Qin, J. Xu, W. Xiong, and H. Li. LETOR: Benchmark dataset for research on learning to rank for information retrieval. In SIGIR Workshop on Learning to Rank for IR (LR4IR), 2007.
|
| |
19
|
L. Mason, J. Baxter, P. Bartless, and M. Frean. Boosting as gradient descent. In NIPS, 2000.
|
| |
20
|
D. Metzler. Direct maximization of rank-based metrics. Technical report, University of Massachusetts, Amherst CIIR, 2005.
|
 |
21
|
Rajat Raina , Alexis Battle , Honglak Lee , Benjamin Packer , Andrew Y. Ng, Self-taught learning: transfer learning from unlabeled data, Proceedings of the 24th international conference on Machine learning, p.759-766, June 20-24, 2007, Corvalis, Oregon
[doi> 10.1145/1273496.1273592]
|
| |
22
|
S. Robertson. Overview of the Okapi projects. Journal of Documentation, 53(1), 1997.
|
| |
23
|
|
| |
24
|
|
| |
25
|
|
| |
26
|
A. Smola, O. Mangasarian, and B. Schölkopf. Sparse kernel feature analysis. Technical Report 99-03, University of Wisconsin, Data Mining Institute, 1999.
|
| |
27
|
T. Truong, M.-R. Amini, and P. Gallinari. Learning to rank with partially labeled training data. In Int'l Conf. on Multidisciplinary Info Sci/Tech, 2006.
|
 |
28
|
Ming-Feng Tsai , Tie-Yan Liu , Tao Qin , Hsin-Hsi Chen , Wei-Ying Ma, FRank: a ranking method with fidelity loss, Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, July 23-27, 2007, Amsterdam, The Netherlands
[doi> 10.1145/1277741.1277808]
|
| |
29
|
J. Wang, M. Li, Z. Li, and W.-Y. Ma. Learning ranking function via relevance propagation. Technical report, Microsoft Research Asia, 2005.
|
| |
30
|
J. Weston, R. Kuang, C. Leslie, and W. S. Noble. Protein ranking by semi-supervised network propagation. BMC Bioinformatics, 7, 2006.
|
 |
31
|
|
 |
32
|
|
 |
33
|
|
 |
34
|
|
 |
35
|
|
| |
36
|
D. Zhou, J. Weston, A. Gretton, O. Bousquet, and B. Schökopf. Ranking on data manifolds. In NIPS, 2004.
|
| |
37
|
X. Zhu. Semi-supervised learning literature survey. Technical Report 1530, University of Wisconsin, Madison, Computer Science Dept., 2005.
|
|