|
ABSTRACT
In a federated digital library system, it is too expensive to query every accessible library. Resource selection is the task to decide to which libraries a query should be routed. Most existing resource selection algorithms compute a library ranking in a heuristic way. In contrast, the decision-theoretic framework (DTF) follows a different approach on a better theoretic foundation: It computes a selection which minimises the overall costs (e.g. retrieval quality, time, money) of the distributed retrieval. For estimating retrieval quality the recall-precision function is proposed. In this paper, we introduce two new methods: The first one computes the empirical distribution of the probabilities of relevance from a small library sample, and assumes it to be representative for the whole library. The second method assumes that the indexing weights follow a normal distribution, leading to a normal distribution for the document scores. Furthermore, we present the first evaluation of DTF by comparing this theoretical approach with the heuristical state-of-the-art system CORI; here we find that DTF outperforms CORI in most cases.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
J. Callan, W. Croft, and S. Harding. The INQUERY retrieval system. In Proceedings of DEXA-92, 3rd International Conference on Database and Expert Systems Applications, pages 78--83, Berlin et al., 1992. Springer.
|
 |
3
|
James P. Callan , Zhihong Lu , W. Bruce Croft, Searching distributed collections with inference networks, Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval, p.21-28, July 09-13, 1995, Seattle, Washington, United States
[doi> 10.1145/215206.215328]
|
 |
4
|
|
 |
5
|
James C. French , Allison L. Powell , Jamie Callan , Charles L. Viles , Travis Emmitt , Kevin J. Prey , Yun Mou, Comparing the performance of database selection algorithms, Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval, p.238-245, August 15-19, 1999, Berkeley, California, United States
[doi> 10.1145/312624.312684]
|
 |
6
|
|
| |
7
|
|
| |
8
|
|
 |
9
|
|
| |
10
|
H. Nottelmann and N. Fuhr. From uncertain inference to probability of relevance for advanced IR applications. In F. Sebastiani, editor, 25th European Conferenve on Information Retrieval Research (ECIR 2003), pages 235--250. Springer, 2003.
|
| |
11
|
W. H. Press, S. A. Teukolsky, W. T. Vetterling, and B. P. Flannery, editors. Nested Relations and Complex Objects in Databases. Cambridge University Press, 1992.
|
| |
12
|
S. E. Robertson, S. Walker, M. Hancock-Beaulieu, A. Gull, and M. Lau. Okapi at TREC. In Text REtrieval Conference, pages 21--30, 1992.
|
 |
13
|
|
| |
14
|
L. Si, R. Jin, J. Callan, and P. Ogilvie. Proceedings of the 11th international conference on information and knowledge management. In D. Grossman, editor, Proceedings of the 11th International Conference on Information and Knowledge Management, New York, 2002. ACM.
|
| |
15
|
C. J. van Rijsbergen. A non-classical logic for information retrieval. The Computer Journal, 29(6):481--485, 1986.
|
 |
16
|
|
| |
17
|
S. Wu and F. Crestani. Multi-objective resource selection in distributed information retrieval. In Proceedings of The 9th International Conference on Information Processing and Management of Uncertainty in Knowledge-Based Systems (IPMU), 2002.
|
CITED BY 21
|
|
|
|
|
|
|
|
Matthias Bender , Sebastian Michel , Peter Triantafillou , Gerhard Weikum , Christian Zimmer, Improving collection selection with overlap awareness in P2P search engines, Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, August 15-19, 2005, Salvador, Brazil
|
|
|
|
|
|
|
|
|
|
|
|
Klaus Berberich , Manolis Koubarakis , Christos Tryfonopoulos , Gerhard Weikum , Christian Zimmer, MAPS: approximate publish/subscribe functionality in peer-to-peer networks, Proceedings of the 1st international workshop on Advanced data processing in ubiquitous computing (ADPUC 2006), November 27-December 01, 2006, Melbourne, Australia
|
|
|
Milad Shokouhi , Justin Zobel , Yaniv Bernstein, Distributed text retrieval from overlapping collections, Proceedings of the eighteenth conference on Australasian database, p.141-150, January 30-February 02, 2007, Ballarat, Victoria, Australia
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|