ACM Home Page
Please provide us with feedback. Feedback
Ranking robustness: a novel framework to predict query performance
Full text PdfPdf (302 KB)
Source Conference on Information and Knowledge Management archive
Proceedings of the 15th ACM international conference on Information and knowledge management table of contents
Arlington, Virginia, USA
SESSION: Ranking and estimation table of contents
Pages: 567 - 574  
Year of Publication: 2006
ISBN:1-59593-433-2
Authors
Yun Zhou  University of Massachusetts, Amherst
W. Bruce Croft  University of Massachusetts, Amherst
Sponsors
ACM: Association for Computing Machinery
SIGIR: ACM Special Interest Group on Information Retrieval
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 11,   Downloads (12 Months): 88,   Citation Count: 12
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1183614.1183696
What is a DOI?

ABSTRACT

In this paper, we introduce the notion of ranking robustness, which refers to a property of a ranked list of documents that indicates how stable the ranking is in the presence of uncertainty in the ranked documents. We propose a statistical measure called the robustness score to quantify this notion. We demonstrate that the robustness score significantly and consistently correlates with query performance in a variety of TREC test collections including the GOV2 collection. We compare the robustness score with the clarity score method which is the state-of-the-art technique for query performance prediction. Our experimental results show that the robustness score performs better than or at least as good as the clarity score. We find that the clarity score is barely correlated with query performance on the GOV2 collection while the correlation between the robustness score and query performance remains significant. We also notice that a combination of the two usually results in more prediction power.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
E.M. Voorhees. Overview of the TREC 2004 Robust Track. In the Online Proceedings of 2004 Text REtrieval Conference (TREC 2004).
 
2
Robust Track http://trec.nist.gov/tracks.html.
 
3
Predicting Query Difficulty. SIGIR workshop 2005 http://www.haifa.ibm.com/sigir05-qp/index.html
4
 
5
S. Tomlinson. Robust, Web and Terabyte Retrieval with Hummingbird SearchServer at TREC 2004. In the Online Proceedings of 2004 Text REtrieval Conference (TREC 2004).
 
6
B. He and I. Ounis. Inferring query performance using pre-retrieval predictors. In proceedings of the SPIRE 2004. pp 43--54, 2004.
 
7
V. Plachouras, B. He, I. Ounis. University of Glasgow at TREC2004: Experiments in Web, Robust, and Terabyte Tracks with Terrier. In the Online Proceedings of 2004 Text REtrieval Conference (TREC 2004).
 
8
Giambattista Amati, Claudio Carpineto, and Giovanni Romano. Query difficulty, robustness and selective application of query expansion In ECIR 2004, pp 127--137.
9
 
10
K. L. Kwok, L. Grunfeld, N. Dinstl, P. Deng. TREC 2005 Robust Track Experiments Using PIRCS. In the Online Proceedings of 2005 Text REtrieval Conference (TREC 2005).
 
11
Y. Bernstein, B. Billerbeck, S. Garcia, N. Lester, F. Scholer, J. Zobe. RMIT University at TREC 2005: Terabyte and Robust Track. In the Online Proceedings of 2005 Text REtrieval Conference (TREC 2005).
12
 
13
K. L. Kwok, L. Grunfeld, H. L. Sun, P. Deng. TREC 2004 Robust Track Experiments Using PIRCS. In the Online Proceedings of 2004 Text REtrieval Conference (TREC 2004).
14
 
15
D. Lopresti and J. Zhou, Retrieval Strategy for Noisy Text, In symposium on document analysis and information retrieval, pp 1--16, 1996.
 
16
A. Singhal, G. Salton and C. Buckley, Length normalization in degraded text collections. In symposium on document analysis and information retrieval, pp 149--162, 1996.
 
17
Elke Mittendorf, Data corruption and information retrieval, PhD Thesis, Department of Computer Science, the Katholieke Universiteit Leuven.
 
18
J. D. Gibbons and S. Chakraborty, Nonparametric statistical inference, Marcel Dekker, New York, 1992.
 
19
Bookstein, A. and Swanson,D. Probabilistic models for automatic indexing. Journal of the American Society for Information Science 25, 5(1974), 312--319.
 
20
Harter, S. P. A probabilistic approach to automatic keyword indexing. Journal of the American Society for Information Science 26, 4 and 5(1975), Part I:197--206; Part II:280--289
 
21
22
 
23
E. Kreyszig, Advanced Enineering Mathematics, chapter 23.10, JOHN WILEY & SONS, INC. 1997.
24
 
25
T. Hastie, R. Tibshirani, J. H. Friedman. The Elements of Statistical Learning, Springer press, 2001.

CITED BY  12

Collaborative Colleagues:
Yun Zhou: colleagues
W. Bruce Croft: colleagues