ACM Home Page
Please provide us with feedback. Feedback
Hourly analysis of a very large topically categorized web query log
Full text PdfPdf (286 KB)
Source Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval table of contents
Sheffield, United Kingdom
SESSION: Efficiency and scaling table of contents
Pages: 321 - 328  
Year of Publication: 2004
ISBN:1-58113-881-4
Authors
Steven M. Beitzel  Illinois Institute of Technology, Chicago, IL
Eric C. Jensen  Illinois Institute of Technology, Chicago, IL
Abdur Chowdhury  Illinois Institute of Technology, Chicago, IL
David Grossman  Illinois Institute of Technology, Chicago, IL
Ophir Frieder  Illinois Institute of Technology, Chicago, IL
Sponsors
ACM: Association for Computing Machinery
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 32,   Downloads (12 Months): 225,   Citation Count: 50
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1008992.1009048
What is a DOI?

ABSTRACT

We review a query log of hundreds of millions of queries that constitute the total query traffic for an entire week of a general-purpose commercial web search service. Previously, query logs have been studied from a single, cumulative view. In contrast, our analysis shows changes in popularity and uniqueness of topically categorized queries across the hours of the day. We examine query traffic on an hourly basis by matching it against lists of queries that have been topically pre-categorized by human editors. This represents 13% of the query traffic. We show that query traffic from particular topical categories differs both from the query stream as a whole and from other categories. This analysis provides valuable insight for improving retrieval effectiveness and efficiency. It is also relevant to the development of enhanced query disambiguation, routing, and caching algorithms.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
3
4
5
 
6
Hawking, D., Craswell, N., and Griffiths, K. Which Search Engine is Best at Finding Online Services? In Proceedings of WWW10 (Hong Kong, May 2001), Posters. Actual poster available as http://pigfish.vic.cmis.csiro.au/ nickc/pubs/www10actualposter.pdf
 
7
 
8
 
9
 
10
Lawrence, S. and Giles, C.L. Searching the World Wide Web. Science 280(5360), 98--100, 1998.
11
 
12
Markatos, E.P. On Caching Search Engine Query Results. In the Proceedings of the 5th International Web Caching and Content Delivery Workshop, May 2000.
13
 
14
15
16
17
 
18
 
19
 
20
 
21
Spink, A., Jansen, B.J., and Ozmultu, H.C. Use of query reformulation and relevance feedback by Excite users. Internet Research: Electronic Networking Applications and Policy 10 (4), 2000.
 
22
Sullivan, D. Searches Per Day. Search Engine Watch, February, 2003. http://searchenginewatch.com/reports/article.php/2156461
 
23
24
 
25
 
26
Xie, Y., O'Hallaron, D. Locality in Search Engine Queries and Its Implications for Caching. Infocom 2002.

CITED BY  50

Collaborative Colleagues:
Steven M. Beitzel: colleagues
Eric C. Jensen: colleagues
Abdur Chowdhury: colleagues
David Grossman: colleagues
Ophir Frieder: colleagues