ACM Home Page
Please provide us with feedback. Feedback
Portfolio theory of information retrieval
Full text PdfPdf (1.12 MB)
Source
Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval table of contents
Boston, MA, USA
SESSION: Retrieval models I table of contents
Pages 115-122  
Year of Publication: 2009
ISBN:978-1-60558-483-6
Authors
Jun Wang  University College London, London, United Kingdom
Jianhan Zhu  university college london, London, United Kingdom
Sponsors
SIGIR: ACM Special Interest Group on Information Retrieval
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 96,   Downloads (12 Months): 279,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1571941.1571963
What is a DOI?

ABSTRACT

This paper studies document ranking under uncertainty. It is tackled in a general situation where the relevance predictions of individual documents have uncertainty, and are dependent between each other. Inspired by the Modern Portfolio Theory, an economic theory dealing with investment in financial markets, we argue that ranking under uncertainty is not just about picking individual relevant documents, but about choosing the right combination of relevant documents. This motivates us to quantify a ranked list of documents on the basis of its expected overall relevance (mean) and its variance; the latter serves as a measure of risk, which was rarely studied for document ranking in the past. Through the analysis of the mean and variance, we show that an optimal rank order is the one that balancing the overall relevance (mean) of the ranked list against its risk level (variance). Based on this principle, we then derive an efficient document ranking algorithm. It generalizes the well-known probability ranking principle (PRP) by considering both the uncertainty of relevance predictions and correlations between retrieved documents. Moreover, the benefit of diversification is mathematically quantified; we show that diversifying documents is an effective way to reduce the risk of document ranking. Experimental results in text retrieval confirm performance.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
3
4
5
 
6
W.S. Cooper. The inadequacy of probability of usefulness as a ranking criterion for retrieval system output. University of California, Berkeley, 1971.
 
7
E.J. Elton and M.J. Gruber. Modern portfolio theory and investment analysis. J. Wiley and Sons, 2006.
 
8
M.D. Gordon and P. Lenk. A utility theoretic examination of the probability ranking principle in information retrieval. JASIS, 42(10):703--714, 1991.
9
10
 
11
H. Markowitz. Portfolio selection. Journal of Finance, 1952.
 
12
D. Metzler, T. Strohman, Y. Zhou, and W.B. Croft. Indri at TREC 2005: Terabyte track. In TREC, 2005.
 
13
S.E. Robertson. The probability ranking principle in IR. Journal of Documentation, pages 294--304, 1977.
 
14
S.E. Robertson and N. Belkin. Ranking in principle. Journal of Documentation, pages 93--100, 1978.
 
15
S.E. Robertson and K. Spärck Jones. Relevance weighting of search terms. Journal of the American Society for Information Science, 27(3):129--46, 1976.
 
16
17
 
18
J.A. Thom and F. Scholer. A comparison of evaluation measures given how users perform on search tasks. In Twelfth Australasian Document Computing Symposium, 2007.
19
 
20
21
 
22
A. Zellner. Bayesian estimation and prediction using asymmetric loss functions. Journal of the American Statistical Association, 81(394):446--451, 1986.
23
 
24
25