| Hourly analysis of a very large topically categorized web query log |
| Full text |
Pdf
(286 KB)
|
| Source
|
Annual ACM Conference on Research and Development in Information Retrieval
archive
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
table of contents
Sheffield, United Kingdom
SESSION: Efficiency and scaling
table of contents
Pages: 321 - 328
Year of Publication: 2004
ISBN:1-58113-881-4
|
|
Authors
|
|
Steven M. Beitzel
|
Illinois Institute of Technology, Chicago, IL
|
|
Eric C. Jensen
|
Illinois Institute of Technology, Chicago, IL
|
|
Abdur Chowdhury
|
Illinois Institute of Technology, Chicago, IL
|
|
David Grossman
|
Illinois Institute of Technology, Chicago, IL
|
|
Ophir Frieder
|
Illinois Institute of Technology, Chicago, IL
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 33, Downloads (12 Months): 240, Citation Count: 50
|
|
|
ABSTRACT
We review a query log of hundreds of millions of queries that constitute the total query traffic for an entire week of a general-purpose commercial web search service. Previously, query logs have been studied from a single, cumulative view. In contrast, our analysis shows changes in popularity and uniqueness of topically categorized queries across the hours of the day. We examine query traffic on an hourly basis by matching it against lists of queries that have been topically pre-categorized by human editors. This represents 13% of the query traffic. We show that query traffic from particular topical categories differs both from the query stream as a whole and from other categories. This analysis provides valuable insight for improving retrieval effectiveness and efficiency. It is also relevant to the development of enhanced query disambiguation, routing, and caching algorithms.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
Steven M. Beitzel , Eric C. Jensen , Abdur Chowdhury , David Grossman, Using titles and category names from editor-driven taxonomies for automatic evaluation, Proceedings of the twelfth international conference on Information and knowledge management, November 03-08, 2003, New Orleans, LA, USA
[doi> 10.1145/956863.956868]
|
 |
2
|
|
 |
3
|
|
 |
4
|
|
 |
5
|
|
| |
6
|
Hawking, D., Craswell, N., and Griffiths, K. Which Search Engine is Best at Finding Online Services? In Proceedings of WWW10 (Hong Kong, May 2001), Posters. Actual poster available as http://pigfish.vic.cmis.csiro.au/ nickc/pubs/www10actualposter.pdf
|
| |
7
|
|
| |
8
|
|
| |
9
|
|
| |
10
|
Lawrence, S. and Giles, C.L. Searching the World Wide Web. Science 280(5360), 98--100, 1998.
|
 |
11
|
|
| |
12
|
Markatos, E.P. On Caching Search Engine Query Results. In the Proceedings of the 5th International Web Caching and Content Delivery Workshop, May 2000.
|
 |
13
|
|
| |
14
|
|
 |
15
|
Paricia Correia Saraiva , Edleno Silva de Moura , Novio Ziviani , Wagner Meira , Rodrigo Fonseca , Berthier Riberio-Neto, Rank-preserving two-level caching for scalable search engines, Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, p.51-58, September 2001, New Orleans, Louisiana, United States
[doi> 10.1145/383952.383959]
|
 |
16
|
|
 |
17
|
|
| |
18
|
|
| |
19
|
|
| |
20
|
|
| |
21
|
Spink, A., Jansen, B.J., and Ozmultu, H.C. Use of query reformulation and relevance feedback by Excite users. Internet Research: Electronic Networking Applications and Policy 10 (4), 2000.
|
| |
22
|
Sullivan, D. Searches Per Day. Search Engine Watch, February, 2003. http://searchenginewatch.com/reports/article.php/2156461
|
| |
23
|
|
 |
24
|
|
| |
25
|
|
| |
26
|
Xie, Y., O'Hallaron, D. Locality in Search Engine Queries and Its Implications for Caching. Infocom 2002.
|
CITED BY 50
|
|
|
|
|
|
|
|
|
|
|
Ying Liu , Tao Qin , Tie-Yan Liu , Lei Zhang , Wei-Ying Ma, Similarity space projection for web image search and annotation, Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval, November 10-11, 2005, Hilton, Singapore
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Qiankun Zhao , Steven C. H. Hoi , Tie-Yan Liu , Sourav S. Bhowmick , Michael R. Lyu , Wei-Ying Ma, Time-dependent semantic similarity measure of queries using historical click-through data, Proceedings of the 15th international conference on World Wide Web, May 23-26, 2006, Edinburgh, Scotland
|
|
|
|
|
|
|
|
|
Qingqing Gan , Josh Attenberg , Alexander Markowetz , Torsten Suel, Analysis of geographic queries in a search engine log, Proceedings of the first international workshop on Location and the web, p.49-56, April 22-22, 2008, Beijing, China
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Honghua (Kathy) Dai , Lingzhi Zhao , Zaiqing Nie , Ji-Rong Wen , Lee Wang , Ying Li, Detecting online commercial intention (OCI), Proceedings of the 15th international conference on World Wide Web, May 23-26, 2006, Edinburgh, Scotland
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Ricardo Baeza-Yates , Aristides Gionis , Flavio P. Junqueira , Vanessa Murdock , Vassilis Plachouras , Fabrizio Silvestri, Design trade-offs for search engine caching, ACM Transactions on the Web (TWEB), v.2 n.4, p.1-28, October 2008
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Andrew W. Leung , Minglong Shao , Timothy Bisson , Shankar Pasupathy , Ethan L. Miller, Spyglass: fast, scalable metadata search for large-scale storage systems, Proccedings of the 7th conference on File and stroage technologies, p.153-166, February 24-27, 2009, San Francisco, California
|
|
|
|
|
|
|
|
|
|
|
|
Evgeniy Gabrilovich , Andrei Broder , Marcus Fontoura , Amruta Joshi , Vanja Josifovski , Lance Riedel , Tong Zhang, Classifying search queries using the Web as a source of knowledge, ACM Transactions on the Web (TWEB), v.3 n.2, p.1-28, April 2009
|
|
|
Maryam Kamvar , Melanie Kellar , Rajan Patel , Ya Xu, Computers and iphones and mobile phones, oh my!: a logs-based comparison of search users on different devices, Proceedings of the 18th international conference on World wide web, April 20-24, 2009, Madrid, Spain
|
|
|
|
|
|
Huanhuan Cao , Derek Hao Hu , Dou Shen , Daxin Jiang , Jian-Tao Sun , Enhong Chen , Qiang Yang, Context-aware query classification, Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, July 19-23, 2009, Boston, MA, USA
|
|
|
|
|
|
Doug Downey , Susan Dumais , Eric Horvitz, Models of searching and browsing: languages, studies, and applications, Proceedings of the 20th international joint conference on Artifical intelligence, p.2740-2747, January 06-12, 2007, Hyderabad, India
|
|
|
|
|
|
|
|