ACM Home Page
Please provide us with feedback. Feedback
Optimizing result prefetching in web search engines with segmented indices
Full text PdfPdf (184 KB)
Source ACM Transactions on Internet Technology (TOIT) archive
Volume 4 ,  Issue 1  (February 2004) table of contents
Pages: 31 - 59  
Year of Publication: 2004
ISSN:1533-5399
Authors
Ronny Lempel  IBM Research Labs, Haifa, Israel
Shlomo Moran  Technion, Haifa, Israel
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 6,   Downloads (12 Months): 68,   Citation Count: 5
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/967030.967032
What is a DOI?

ABSTRACT

We study the process in which search engines with segmented indices serve queries. In particular, we investigate the number of result pages that search engines should prepare during the query processing phase.Search engine users have been observed to browse through very few pages of results for queries that they submit. This behavior of users suggests that prefetching many results upon processing an initial query is not efficient, since most of the prefetched results will not be requested by the user who initiated the search. However, a policy that abandons result prefetching in favor of retrieving just the first page of search results might not make optimal use of system resources either.We argue that for a certain behavior of users, engines should prefetch a constant number of result pages per query. We define a concrete query processing model for search engines with segmented indices, and analyze the cost of such prefetching policies. Based on these costs, we show how to determine the constant that optimizes the prefetching policy. Our results are mostly applicable to local index partitions of the inverted files, but are also applicable to processing short queries in global index architectures.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
 
3
 
4
 
5
Broder, A. 2000. Web search. Invited talk at the 9th Text Retrieval Conference (TREC-9), Gaithersburg, Maryland, November.
6
 
7
 
8
 
9
Feller, W. 1970. An Introduction to Probability Theory and Its Applications. John Wiley & Sons.
 
10
 
11
 
12
 
13
Johnson, N. L. and Kotz, S. I. 1977. Urn Models and their Application. John Wiley & Sons, Inc.
 
14
Johnson, N. L. and Young, D. H. 1960. Some applications of two approximations to the multinomial distribution. Biometrika 47, 463--469.
 
15
 
16
Kolchin, V. F., Sevast'yanov, B. A., and Chistyakov, V. P. 1978. Random Allocations. V. H. Winston & Sons.
 
17
Lawrence, S. and Giles, C. L. 1998. Searching the world wide web. Science 280, April.
18
 
19
Markatos, E. P. 2000. On caching search engine query results. In Proceedings of the 5th International Web Caching and Content Delivery Workshop, May.
20
 
21
 
22
23
 
24
 
25
Silverstein, C., Henzinger, M., Marais, H., and Moricz, M. 1998. Analysis of a very large altavista query log. Tech. Rep. 1998-014, Compaq Systems Research Center. October.
 
26
27


Collaborative Colleagues:
Ronny Lempel: colleagues
Shlomo Moran: colleagues