ACM Home Page
Please provide us with feedback. Feedback
Efficiency trade-offs in two-tier web search systems
Full text PdfPdf (456 KB)
Source
Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval table of contents
Boston, MA, USA
SESSION: Efficiency table of contents
Pages 163-170  
Year of Publication: 2009
ISBN:978-1-60558-483-6
Authors
Ricardo Baeza-Yates  Yahoo!, Barcelona, Spain
Vanessa Murdock  Yahoo!, Barcelona, Spain
Claudia Hauff  University of Twente, Enschede, Netherlands
Sponsors
SIGIR: ACM Special Interest Group on Information Retrieval
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 62,   Downloads (12 Months): 177,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1571941.1571971
What is a DOI?

ABSTRACT

Search engines rely on searching multiple partitioned corpora to return results to users in a reasonable amount of time. In this paper we analyze the standard two-tier architecture for Web search with the difference that the corpus to be searched for a given query is predicted in advance. We show that any predictor better than random yields time savings, but this decrease in the processing time yields an increase in the infrastructure cost. We provide an analysis and investigate this trade-off in the context of two different scenarios on real-world data. We demonstrate that in general the decrease in answer time is justified by a small increase in infrastructure cost.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
 
3
J. Callan, W. B. Croft, and S. M. Harding. The INQUERY retrieval system. In Proceedings of the 3rd International Conference on Database and Expert Systems Applications, pages 78--83, 1992.
4
5
 
6
7
 
8
 
9
 
10
 
11
 
12
B. He and I. Ounis. Inferring query performance using pre-retrieval predictors. In Proceedings of SPIRE, 2004.
 
13
14
 
15
16
17
 
18
K. M. Risvik. Scaling Internet Search Engines: Methods and Analysis. PhD thesis, NTNU, 2004.
 
19
20
21
22
 
23
A. Spink, J. Bateman, and B. J. Jansen. Searching heterogeneous collections on the web: Behavior of excite users. Information Research, 4(2), 1998. Available at http://informationr.net/ir/4-2/paper53.html.
 
24
E. Yom-Tov, D. Carmel, A. Darlow, D. Pelleg, S. Errera-Yaakov, and S. Fine. Juru at TREC 2005: Query prediction in the teraybyte and the robust tracks. In TREC 2005, 2005.
25
 
26
27
 
28
J. Zobel. Collection selection via lexicon inspection. In Proceedings of the Second Australian Document Computing Symposium, 1997.

Collaborative Colleagues:
Ricardo Baeza-Yates: colleagues
Vanessa Murdock: colleagues
Claudia Hauff: colleagues