ACM Home Page
Please provide us with feedback. Feedback
Semantic similarity between search engine queries using temporal correlation
Full text PdfPdf (239 KB)
Source International World Wide Web Conference archive
Proceedings of the 14th international conference on World Wide Web table of contents
Chiba, Japan
SESSION: Usage analysis table of contents
Pages: 2 - 11  
Year of Publication: 2005
ISBN:1-59593-046-9
Authors
Steve Chien  Microsoft Research, Silicon Valley Campus, Mountain View, CA
Nicole Immorlica  Massachusetts Institute of Technology, Cambridge, MA
Sponsor
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 11,   Downloads (12 Months): 106,   Citation Count: 22
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1060745.1060752
What is a DOI?

ABSTRACT

We investigate the idea of finding semantically related search engine queries based on their temporal correlation; in other words, we infer that two queries are related if their popularities behave similarly over time. To this end, we first define a new measure of the temporal correlation of two queries based on the correlation coefficient of their frequency functions. We then conduct extensive experiments using our measure on two massive query streams from the MSN search engine, revealing that this technique can discover a wide range of semantically similar queries. Finally, we develop a method of efficiently finding the highest correlated queries for a given input query using far less space and time than the naive approach, making real-time implementation possible.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
AOL Instant Messenger - AIM Express. http://www.aim.com/get_aim/express/.
 
2
R. Baratto, J. Nieh, and L. Kim. Thinc: A remote display architecture for thin-client computing. Technical Report CUCS-027-04, Department of Computer Science, Columbia University, July 2004.
3
 
4
GMail. https://gmail.google.com/.
 
5
Hotmail. http://www.hotmail.com.
 
6
P.-H. Kamp and R. N. M. Watson. Jails: Confining the omnipotent root. In 2nd International SANE Conference, MECC, Maastricht, The Netherlands, May 2000.
 
7
Linux VServer Project. http://www.linux-vserver.org/.
 
8
9
 
10
Portable Firefox. http://johnhaller.com/jh/mozilla/portable_firefox/.
 
11
M. Raghunath, C. Narayanaswami, C. Caster, and R. Caceres. Reincarnating pcs with portable soulpads. Technical Report RC23418 (W0411-057), IBM Research Division Thomas J. Watson Research Center, Nov. 2004.
 
12
Stealth Surfer. http://www.stealthsurfer.biz/.
 
13
Trek Thumbdrive TOUCH. http://www.thumbdrive.com/touch.htm/.
 
14
A. Tucker and D. Comay. Solaris zones: Operating system support for server consolidaiton, May 2004.
 
15
U3 Platform. http://www.u3.com.
 
16
VMware, Inc. http://www.vmware.com.

CITED BY  22

Collaborative Colleagues:
Steve Chien: colleagues
Nicole Immorlica: colleagues