|
ABSTRACT
There has been recent interests in studying the "goal" behind a user's Web query, so that this goal can be used to improve the quality of a search engine's results. Previous studies have mainly focused on using manual query-log investigation to identify Web query goals. In this paper we study whether and how we can automate this goal-identification process. We first present our results from a human subject study that strongly indicate the feasibility of automatic query-goal identification. We then propose two types of features for the goal-identification task: user-click behavior and anchor-link distribution. Our experimental evaluation shows that by combining these features we can correctly identify the goals for 90% of the queries studied.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
D. Hawking and N. Craswell. Overview of the TREC-2001 Web track. In Proceedings of the Tenth Text REtrieval Conference (TREC-10), 2001.
|
 |
2
|
|
| |
3
|
T. Westerveld, W. Kraaij, and D. Hiemstra. Retrieving web pages using content, links, URLs and anchors. In Proceedings of the Tenth Text REtrieval Conference (TREC-10), 2001.
|
| |
4
|
|
 |
5
|
|
 |
6
|
|
 |
7
|
|
 |
8
|
|
 |
9
|
Hua-Jun Zeng , Qi-Cai He , Zheng Chen , Wei-Ying Ma , Jinwen Ma, Learning to cluster web search results, Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval, July 25-29, 2004, Sheffield, United Kingdom
[doi> 10.1145/1008992.1009030]
|
| |
10
|
|
| |
11
|
Vivisimo search engine. http://vivisimo.com/.
|
 |
12
|
|
| |
13
|
M. Chen, M. Hearst, J. Hong, and J. Lin. Cha-Cha: A system for organizing intranet search results. In Proceedings of the 2nd USENIX Symposium on Internet Technologies and Systems, 1999.
|
| |
14
|
U. Lee, Z. Liu, and J. Cho. Automatic identification of user goals in web search. Technical report, UCLA Computer Science, 2004.
|
| |
15
|
|
 |
16
|
|
| |
17
|
Danny Sullivan. Searches per day. http://searchenginewatch.com/reports/article.php/2156461, 2003.
|
| |
18
|
Z. Gyongyi, H. Garcia-Molina, and J. Pedersen. Combating web spam with trustrank. In Proceedings of VLDB '04, 2004.
|
 |
19
|
|
 |
20
|
Junghoo Cho , Narayanan Shivakumar , Hector Garcia-Molina, Finding replicated Web collections, Proceedings of the 2000 ACM SIGMOD international conference on Management of data, p.355-366, May 15-18, 2000, Dallas, Texas, United States
|
| |
21
|
|
| |
22
|
J.L. Devore. Probability and Statistics for Engineering and the Sciences. Duxbury, 6th edition, 2004.
|
| |
23
|
|
| |
24
|
|
| |
25
|
D.D. Wackerly, W. Mendenhall III, and R.L. Scheaffer. Mathematical Statistics with Applications. Duxbury, 6th edition, 2002.
|
| |
26
|
C. Hoelscher. How Internet experts search for information on the Web. In Proceedings of WebNet '98, 1998.
|
| |
27
|
|
| |
28
|
|
| |
29
|
|
 |
30
|
|
| |
31
|
B.D. Davison, D.G. Deschenes, and D.B. Lewanda. Finding relevant Website queries. In Proceedings of the Twelfth Int'l World Wide Web Conf., 2003.
|
 |
32
|
|
 |
33
|
|
CITED BY 49
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Qingqing Gan , Josh Attenberg , Alexander Markowetz , Torsten Suel, Analysis of geographic queries in a search engine log, Proceedings of the first international workshop on Location and the web, p.49-56, April 22-22, 2008, Beijing, China
|
|
|
|
|
|
|
|
|
Carsten Stolz , Michael Barth , Maximilian Viermetz , Klaus D. Wilde, Searchstrings revealing user intent: a better understanding of user perception, Proceedings of the 6th international conference on Web engineering, July 11-14, 2006, Palo Alto, California, USA
|
|
|
Yunyao Li , Rajasekar Krishnamurthy , Shivakumar Vaithyanathan , H. V. Jagadish, Getting work done on the web: supporting transactional queries, Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, August 06-11, 2006, Seattle, Washington, USA
|
|
|
Yumao Lu , Fuchun Peng , Xin Li , Nawaaz Ahmed, Coupling feature selection and machine learning methods for navigational query identification, Proceedings of the 15th ACM international conference on Information and knowledge management, November 06-11, 2006, Arlington, Virginia, USA
|
|
|
Yiqun Liu , Yupeng Fu , Min Zhang , Shaoping Ma , Liyun Ru, Automatic search engine performance evaluation with click-through data analysis, Proceedings of the 16th international conference on World Wide Web, May 08-12, 2007, Banff, Alberta, Canada
|
|
|
|
|
|
|
|
|
|
|
|
Huaiyu Zhu , Sriram Raghavan , Shivakumar Vaithyanathan , Alexander Löser, Navigating the intranet with high precision, Proceedings of the 16th international conference on World Wide Web, May 08-12, 2007, Banff, Alberta, Canada
|
|
|
|
|
|
Xiubo Geng , Tie-Yan Liu , Tao Qin , Andrew Arnold , Hang Li , Heung-Yeung Shum, Query dependent ranking using K-nearest neighbor, Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval, July 20-24, 2008, Singapore, Singapore
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Dou Shen , Toby Walkery , Zijian Zhengy , Qiang Yangz , Ying Li, Personal name classification in web queries, Proceedings of the international conference on Web search and web data mining, February 11-12, 2008, Palo Alto, California, USA
|
|
|
|
|
|
Bernard J. Jansen , Danielle L. Booth , Amanda Spink, Determining the informational, navigational, and transactional intent of Web queries, Information Processing and Management: an International Journal, v.44 n.3, p.1251-1266, May, 2008
|
|
|
|
|
|
Doug Downey , Susan Dumais , Dan Liebling , Eric Horvitz, Understanding the relationship between searchers' queries and information goals, Proceeding of the 17th ACM conference on Information and knowledge management, October 26-30, 2008, Napa Valley, California, USA
|
|
|
|
|
|
|
|
|
Dou Shen , Min Qin , Weizhu Chen , Qiang Yang , Zheng Chen, Mining web query hierarchies from clickthrough data, Proceedings of the 22nd national conference on Artificial intelligence, p.341-346, July 22-26, 2007, Vancouver, British Columbia, Canada
|
|
|
Fan Guo , Chao Liu , Anitha Kannan , Tom Minka , Michael Taylor , Yi-Min Wang , Christos Faloutsos, Click chain model in web search, Proceedings of the 18th international conference on World wide web, April 20-24, 2009, Madrid, Spain
|
|
|
|
|
|
Ruihua Song , Zhenxiao Luo , Jian-Yun Nie , Yong Yu , Hsiao-Wuen Hon, Identification of ambiguous queries in web search, Information Processing and Management: an International Journal, v.45 n.2, p.216-229, March, 2009
|
|
|
Donald Metzler , Jasmine Novak , Hang Cui , Srihari Reddy, Building enriched document representations using aggregated anchor text, Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, July 19-23, 2009, Boston, MA, USA
|
|
|
|
|
|
|
|
|
Jiafeng Guo , Gu Xu , Xueqi Cheng , Hang Li, Named entity recognition in query, Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, July 19-23, 2009, Boston, MA, USA
|
|
|
Hila Becker , Andrei Broder , Evgeniy Gabrilovich , Vanja Josifovski , Bo Pang, Context transfer in search advertising, Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, July 19-23, 2009, Boston, MA, USA
|
|
|
Zhicheng Dou , Ruihua Song , Jian-Yun Nie , Ji-Rong Wen, Using anchor texts with their hyperlink structure for web search, Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, July 19-23, 2009, Boston, MA, USA
|
|
|
|
|
|
|
|
|
Huanhuan Cao , Derek Hao Hu , Dou Shen , Daxin Jiang , Jian-Tao Sun , Enhong Chen , Qiang Yang, Context-aware query classification, Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, July 19-23, 2009, Boston, MA, USA
|
|
|
|
|
|
Doug Downey , Susan Dumais , Eric Horvitz, Models of searching and browsing: languages, studies, and applications, Proceedings of the 20th international joint conference on Artifical intelligence, p.2740-2747, January 06-12, 2007, Hyderabad, India
|
|