ACM Home Page
Please provide us with feedback. Feedback
Query sampling for ranking learning in web search
Full text PdfPdf (469 KB)
Source
Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval table of contents
Boston, MA, USA
POSTER SESSION: Posters table of contents
Pages 754-755  
Year of Publication: 2009
ISBN:978-1-60558-483-6
Authors
Linjun Yang  Microsoft Research Asia, Beijing, China
Li Wang  University of Science and Technology of China, Hefei, China
Bo Geng  Peking University, Beijing, China
Xian-Sheng Hua  Microsoft Research Asia, Beijing, China
Sponsors
SIGIR: ACM Special Interest Group on Information Retrieval
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 40,   Downloads (12 Months): 118,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1571941.1572112
What is a DOI?

ABSTRACT

Learning to rank has become a popular approach to build a ranking model for Web search recently. Based on our observation, the constitution of the training set will greatly influence the performance of the learned ranking model. Meanwhile, the number of queries in Web search is nearly infinite and the human labeling cost is expensive, hence a subset of queries need to be carefully selected for training. In this paper, we develop a greedy algorithm to sample the queries, by simultaneously taking the query density, difficulty and diversity into consideration. The experimental results on a collected Web search dataset comprising 2024 queries show that the proposed method can lead to a more informative training set for building an effective model.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
 
3
4
5
 
6
M. G. Kendall. A new measure of rank correlation. Biometrika, 30(1/2):81--93, June 1938.
 
7

Collaborative Colleagues:
Linjun Yang: colleagues
Li Wang: colleagues
Bo Geng: colleagues
Xian-Sheng Hua: colleagues