| Semantic similarity between search engine queries using temporal correlation |
| Full text |
Pdf
(239 KB)
|
| Source
|
International World Wide Web Conference
archive
Proceedings of the 14th international conference on World Wide Web
table of contents
Chiba, Japan
SESSION: Usage analysis
table of contents
Pages: 2 - 11
Year of Publication: 2005
ISBN:1-59593-046-9
|
|
Authors
|
|
Steve Chien
|
Microsoft Research, Silicon Valley Campus, Mountain View, CA
|
|
Nicole Immorlica
|
Massachusetts Institute of Technology, Cambridge, MA
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 11, Downloads (12 Months): 106, Citation Count: 22
|
|
|
ABSTRACT
We investigate the idea of finding semantically related search engine queries based on their temporal correlation; in other words, we infer that two queries are related if their popularities behave similarly over time. To this end, we first define a new measure of the temporal correlation of two queries based on the correlation coefficient of their frequency functions. We then conduct extensive experiments using our measure on two massive query streams from the MSN search engine, revealing that this technique can discover a wide range of semantically similar queries. Finally, we develop a method of efficiently finding the highest correlated queries for a given input query using far less space and time than the naive approach, making real-time implementation possible.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
AOL Instant Messenger - AIM Express. http://www.aim.com/get_aim/express/.
|
| |
2
|
R. Baratto, J. Nieh, and L. Kim. Thinc: A remote display architecture for thin-client computing. Technical Report CUCS-027-04, Department of Computer Science, Columbia University, July 2004.
|
 |
3
|
Ricardo A. Baratto , Shaya Potter , Gong Su , Jason Nieh, MobiDesk: mobile virtual desktop computing, Proceedings of the 10th annual international conference on Mobile computing and networking, September 26-October 01, 2004, Philadelphia, PA, USA
[doi> 10.1145/1023720.1023722]
|
| |
4
|
GMail. https://gmail.google.com/.
|
| |
5
|
Hotmail. http://www.hotmail.com.
|
| |
6
|
P.-H. Kamp and R. N. M. Watson. Jails: Confining the omnipotent root. In 2nd International SANE Conference, MECC, Maastricht, The Netherlands, May 2000.
|
| |
7
|
Linux VServer Project. http://www.linux-vserver.org/.
|
| |
8
|
|
 |
9
|
|
| |
10
|
Portable Firefox. http://johnhaller.com/jh/mozilla/portable_firefox/.
|
| |
11
|
M. Raghunath, C. Narayanaswami, C. Caster, and R. Caceres. Reincarnating pcs with portable soulpads. Technical Report RC23418 (W0411-057), IBM Research Division Thomas J. Watson Research Center, Nov. 2004.
|
| |
12
|
Stealth Surfer. http://www.stealthsurfer.biz/.
|
| |
13
|
Trek Thumbdrive TOUCH. http://www.thumbdrive.com/touch.htm/.
|
| |
14
|
A. Tucker and D. Comay. Solaris zones: Operating system support for server consolidaiton, May 2004.
|
| |
15
|
U3 Platform. http://www.u3.com.
|
| |
16
|
VMware, Inc. http://www.vmware.com.
|
CITED BY 22
|
|
|
|
|
Qiankun Zhao , Steven C. H. Hoi , Tie-Yan Liu , Sourav S. Bhowmick , Michael R. Lyu , Wei-Ying Ma, Time-dependent semantic similarity measure of queries using historical click-through data, Proceedings of the 15th international conference on World Wide Web, May 23-26, 2006, Edinburgh, Scotland
|
|
|
|
|
|
Micah Dubinko , Ravi Kumar , Joseph Magnani , Jasmine Novak , Prabhakar Raghavan , Andrew Tomkins, Visualizing tags over time, Proceedings of the 15th international conference on World Wide Web, May 23-26, 2006, Edinburgh, Scotland
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Eytan Adar , Daniel S. Weld , Brian N. Bershad , Steven S. Gribble, Why we search: visualizing and predicting user behavior, Proceedings of the 16th international conference on World Wide Web, May 08-12, 2007, Banff, Alberta, Canada
|
|
|
|
|
|
|
|
|
|
|
|
Wen Zhang , Jun Yan , Shuicheng Yan , Ning Liu , Zheng Chen, Temporal query substitution for ad search, Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, July 19-23, 2009, Boston, MA, USA
|
|
|
Xuanhui Wang , ChengXiang Zhai , Xiao Hu , Richard Sproat, Mining correlated bursty topic patterns from coordinated text streams, Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining, August 12-15, 2007, San Jose, California, USA
|
|
|
Micah Dubinko , Ravi Kumar , Joseph Magnani , Jasmine Novak , Prabhakar Raghavan , Andrew Tomkins, Visualizing tags over time, ACM Transactions on the Web (TWEB), v.1 n.2, p.7-es, August 2007
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Donald Metzler , Rosie Jones , Fuchun Peng , Ruiqiang Zhang, Improving search relevance for implicitly temporal queries, Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, July 19-23, 2009, Boston, MA, USA
|
|