| Optimizing web search using web click-through data |
| Full text |
Pdf
(232 KB)
|
| Source
|
Conference on Information and Knowledge Management
archive
Proceedings of the thirteenth ACM international conference on Information and knowledge management
table of contents
Washington, D.C., USA
SESSION: IR-2 (information retrieval): web information retrieval
table of contents
Pages: 118 - 126
Year of Publication: 2004
ISBN:1-58113-874-1
|
|
Authors
|
|
Gui-Rong Xue
|
Shanghai Jiao-Tong University, Shanghai, P.R.China
|
|
Hua-Jun Zeng
|
Microsoft Research Asia, Beijing, P.R.China
|
|
Zheng Chen
|
Microsoft Research Asia, Beijing, P.R.China
|
|
Yong Yu
|
Shanghai Jiao-Tong University, Shanghai, P.R.China
|
|
Wei-Ying Ma
|
Microsoft Research Asia, Beijing, P.R.China
|
|
WenSi Xi
|
Virginia Polytechnic Institute and State University, VA
|
|
WeiGuo Fan
|
Virginia Polytechnic Institute and State University, VA
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 42, Downloads (12 Months): 248, Citation Count: 37
|
|
|
ABSTRACT
The performance of web search engines may often deteriorate due to the diversity and noisy information contained within web pages. User click-through data can be used to introduce more accurate description (metadata) for web pages, and to improve the search performance. However, noise and incompleteness, sparseness, and the volatility of web pages and queries are three major challenges for research work on user click-through log mining. In this paper, we propose a novel iterative reinforced algorithm to utilize the user click-through data to improve search performance. The algorithm fully explores the interrelations between queries and web pages, and effectively finds "virtual queries" for web pages and overcomes the challenges discussed above. Experiment results on a large set of MSN click-through log data show a significant improvement on search performance over the naive query log mining algorithm as well as the baseline search engine.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
Brian D.D., David, G.D., and David B.L. Finding Relevant Website Queries, in Proceedings of the Twelfth International World Wide Web Conference, 2003.
|
| |
3
|
|
| |
4
|
Cui H., Wen J.R., Nie J.Y., and Ma W.Y., Query Expansion by Mining User Logs, IEEE Transaction on Knowledge and Data Engineering, Vol. 15, No. 4, July/August 2003.
|
 |
5
|
|
 |
6
|
|
 |
7
|
|
 |
8
|
|
| |
9
|
H. Small. Co-citation in the scientific literature: A new measure of the relationship between two documents. Journal of the American Society for Information Science, 24:265--269, 1973.
|
 |
10
|
|
 |
11
|
|
| |
12
|
M. M. Kessler. Bibliographic coupling between scientific papers. American Documentation, 14:10--25, 1963.
|
| |
13
|
MSN Search Engine, http://www.msn.com.
|
 |
14
|
|
 |
15
|
|
| |
16
|
Porter, M. An algorithm for suffix stripping. Program, Vol. 14(3), pp. 130--137, 1980.
|
| |
17
|
|
| |
18
|
Robertson, S.E. et al. Okapi at TREC-3. In Overview of the Third Text REtrieval Conference(TREC-3), 109--126, 1995.
|
| |
19
|
R. R. Larson. Bibliometrics of the World-Wide Web: An exploratory analysis of the intellectual structure of cyberspace. In Proceedings of the Annual Meeting of the American Society for Information Science, Baltimore, Maryland, October 1996.
|
| |
20
|
|
| |
21
|
Soumen Chakrabarti , Byron Dom , Prabhakar Raghavan , Sridhar Rajagopalan , David Gibson , Jon Kleinberg, Automatic resource compilation by analyzing hyperlink structure and associated text, Proceedings of the seventh international conference on World Wide Web 7, p.65-74, April 1998, Brisbane, Australia
|
| |
22
|
Thijs W., Wessel K., and Djoerd H., Retrieving Web Pages using Content, Links, URLs and Anchors, TREC10, 2002.
|
 |
23
|
|
CITED BY 37
|
|
Qiankun Zhao , Tie-Yan Liu , Sourav S. Bhowmick , Wei-Ying Ma, Event detection from evolution of click-through data, Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, August 20-23, 2006, Philadelphia, PA, USA
|
|
|
|
|
|
Qiankun Zhao , Steven C. H. Hoi , Tie-Yan Liu , Sourav S. Bhowmick , Michael R. Lyu , Wei-Ying Ma, Time-dependent semantic similarity measure of queries using historical click-through data, Proceedings of the 15th international conference on World Wide Web, May 23-26, 2006, Edinburgh, Scotland
|
|
|
|
|
|
|
|
|
|
|
|
Shenghua Bao , Guirong Xue , Xiaoyuan Wu , Yong Yu , Ben Fei , Zhong Su, Optimizing web search using social annotations, Proceedings of the 16th international conference on World Wide Web, May 08-12, 2007, Banff, Alberta, Canada
|
|
|
Amir Hosein Keyhanipour , Behzad Moshiri , Majid Kazemian , Maryam Piroozmand , Caro Lucas, Aggregation of web search engines based on users' preferences in WebFusion, Knowledge-Based Systems, v.20 n.4, p.321-328, May, 2007
|
|
|
|
|
|
Hiranmay Ghosh , P. Poornachander , Anupama Mallik , Santanu Chaudhury, Learning ontology for personalized video retrieval, Workshop on multimedia information retrieval on The many faces of multimedia semantics, September 28-28, 2007, Augsburg, Bavaria, Germany
|
|
|
|
|
|
|
|
|
Beate Krause , Robert Jäschke , Andreas Hotho , Gerd Stumme, Logsonomy - social information retrieval with logdata, Proceedings of the nineteenth ACM conference on Hypertext and hypermedia, June 19-21, 2008, Pittsburgh, PA, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Carlos Castillo , Claudio Corsi , Debora Donato , Paolo Ferragina , Aristides Gionis, Query-log mining for detecting spam, Proceedings of the 4th international workshop on Adversarial information retrieval on the web, April 22-22, 2008, Beijing, China
|
|
|
|
|
|
Giorgos Giannopoulos , Theodore Dalamagas , Magdalini Eirinaki , Timos Sellis, Boosting the ranking function learning process using clustering, Proceeding of the 10th ACM workshop on Web information and data management, October 30-30, 2008, Napa Valley, California, USA
|
|
|
|
|
|
|
|
|
Maarten van der Heijden , Max Hinne , Wessel Kraaij , Suzan Verberne , Theo van der Weide, Using query logs and click data to create improved document descriptions, Proceedings of the 2009 workshop on Web Search Click Data, p.64-67, February 09-09, 2009, Barcelona, Spain
|
|
|
|
|
|
M. Srinivas , M. Tech (SE) , K. P. Supreethi , E. V. Prasad, Enhancing the web site search performance using LBI, Proceedings of the International Conference on Advances in Computing, Communication and Control, January 23-24, 2009, Mumbai, India
|
|
|
Masaya Murata , Hiroyuki Toda , Yumiko Matsuura , Ryoji Kataoka, Query-page intention matching using clicked titles and snippets to boost search rankings, Proceedings of the 9th ACM/IEEE-CS joint conference on Digital libraries, June 15-19, 2009, Austin, TX, USA
|
|
|
Fan Guo , Chao Liu , Anitha Kannan , Tom Minka , Michael Taylor , Yi-Min Wang , Christos Faloutsos, Click chain model in web search, Proceedings of the 18th international conference on World wide web, April 20-24, 2009, Madrid, Spain
|
|
|
Jianfeng Gao , Wei Yuan , Xiao Li , Kefeng Deng , Jian-Yun Nie, Smoothing clickthrough data for web search ranking, Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, July 19-23, 2009, Boston, MA, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Shu Huang , Qiankun Zhao , Prasenjit Mitra , C. Lee Giles, Hierarchical location and topic based query expansion, Proceedings of the 23rd national conference on Artificial intelligence, p.1150-1155, July 13-17, 2008, Chicago, Illinois
|
|
|
Hila Becker , Christopher Meek , David Maxwell Chickering, Modeling contextual factors of click rates, Proceedings of the 22nd national conference on Artificial intelligence, p.1310-1315, July 22-26, 2007, Vancouver, British Columbia, Canada
|
|
|
|
|