ACM Home Page
Please provide us with feedback. Feedback
Evaluating different methods of estimating retrieval quality for resource selection
Full text PdfPdf (126 KB)
Source Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval table of contents
Toronto, Canada
SESSION: Distributed information retrieval table of contents
Pages: 290 - 297  
Year of Publication: 2003
ISBN:1-58113-646-3
Authors
Henrik Nottelmann  University of Duisburg-Essen, Duisburg, Germany
Norbert Fuhr  University of Duisburg-Essen, Duisburg, Germany
Sponsor
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 61,   Citation Count: 21
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/860435.860489
What is a DOI?

ABSTRACT

In a federated digital library system, it is too expensive to query every accessible library. Resource selection is the task to decide to which libraries a query should be routed. Most existing resource selection algorithms compute a library ranking in a heuristic way. In contrast, the decision-theoretic framework (DTF) follows a different approach on a better theoretic foundation: It computes a selection which minimises the overall costs (e.g. retrieval quality, time, money) of the distributed retrieval. For estimating retrieval quality the recall-precision function is proposed. In this paper, we introduce two new methods: The first one computes the empirical distribution of the probabilities of relevance from a small library sample, and assumes it to be representative for the whole library. The second method assumes that the indexing weights follow a normal distribution, leading to a normal distribution for the document scores. Furthermore, we present the first evaluation of DTF by comparing this theoretical approach with the heuristical state-of-the-art system CORI; here we find that DTF outperforms CORI in most cases.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
J. Callan, W. Croft, and S. Harding. The INQUERY retrieval system. In Proceedings of DEXA-92, 3rd International Conference on Database and Expert Systems Applications, pages 78--83, Berlin et al., 1992. Springer.
3
4
5
6
 
7
 
8
9
 
10
H. Nottelmann and N. Fuhr. From uncertain inference to probability of relevance for advanced IR applications. In F. Sebastiani, editor, 25th European Conferenve on Information Retrieval Research (ECIR 2003), pages 235--250. Springer, 2003.
 
11
W. H. Press, S. A. Teukolsky, W. T. Vetterling, and B. P. Flannery, editors. Nested Relations and Complex Objects in Databases. Cambridge University Press, 1992.
 
12
S. E. Robertson, S. Walker, M. Hancock-Beaulieu, A. Gull, and M. Lau. Okapi at TREC. In Text REtrieval Conference, pages 21--30, 1992.
13
 
14
L. Si, R. Jin, J. Callan, and P. Ogilvie. Proceedings of the 11th international conference on information and knowledge management. In D. Grossman, editor, Proceedings of the 11th International Conference on Information and Knowledge Management, New York, 2002. ACM.
 
15
C. J. van Rijsbergen. A non-classical logic for information retrieval. The Computer Journal, 29(6):481--485, 1986.
16
 
17
S. Wu and F. Crestani. Multi-objective resource selection in distributed information retrieval. In Proceedings of The 9th International Conference on Information Processing and Management of Uncertainty in Knowledge-Based Systems (IPMU), 2002.

CITED BY  21

Collaborative Colleagues:
Henrik Nottelmann: colleagues
Norbert Fuhr: colleagues