| Using the wisdom of the crowds for keyword generation |
| Full text |
Pdf
(334 KB)
|
Source
|
International World Wide Web Conference
archive
Proceeding of the 17th international conference on World Wide Web
table of contents
Beijing, China
SESSION: Data mining: log analysis
table of contents
Pages 61-70
Year of Publication: 2008
ISBN:978-1-60558-085-2
|
|
Authors
|
|
Ariel Fuxman
|
Microsoft Research, Mountain View, CA, USA
|
|
Panayiotis Tsaparas
|
Microsoft Research, Mountain View, CA, USA
|
|
Kannan Achan
|
Microsoft Research, Mountain View, CA, USA
|
|
Rakesh Agrawal
|
Microsoft Research, Mountain View, CA, USA
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 24, Downloads (12 Months): 333, Citation Count: 8
|
|
|
ABSTRACT
In the sponsored search model, search engines are paid by businesses that are interested in displaying ads for their site alongside the search results. Businesses bid for keywords, and their ad is displayed when the keyword is queried to the search engine. An important problem in this process is 'keyword generation': given a business that is interested in launching a campaign, suggest keywords that are related to that campaign. We address this problem by making use of the query logs of the search engine. We identify queries related to a campaign by exploiting the associations between queries and URLs as they are captured by the user's clicks. These queries form good keyword suggestions since they capture the "wisdom of the crowd" as to what is related to a site. We formulate the problem as a semi-supervised learning problem, and propose algorithms within the Markov Random Field model. We perform experiments with real query logs, and we demonstrate that our algorithms scale to large query logs and produce meaningful results.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
 |
2
|
|
 |
3
|
|
| |
4
|
K. Bartz, V. Murthi, and S. Sebastian. Logistic regression and collaborative filtering for sponsored search term recommendation. Second Workshop on Sponsored Search Auctions, 2006.
|
 |
5
|
|
 |
6
|
Andrei Z. Broder , Marcus Fontoura , Evgeniy Gabrilovich , Amruta Joshi , Vanja Josifovski , Tong Zhang, Robust classification of rare queries using web knowledge, Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, July 23-27, 2007, Amsterdam, The Netherlands
[doi> 10.1145/1277741.1277783]
|
| |
7
|
Olivier Chapelle, Bernhard Scholkopf, and Alexander Zien. Semi-Supervised Learning. The MIT Press, 2006.
|
 |
8
|
|
| |
9
|
P. Doyle and L. Snell. Random Walks and Electrical Networks. Mathematical Association of America, 1984.
|
| |
10
|
|
| |
11
|
|
| |
12
|
|
| |
13
|
|
| |
14
|
|
| |
15
|
D. Kelleher and S. Luz. Automatic hypertext keyphrase extraction. IJCAI, pages 1608--1609, 2005.
|
 |
16
|
|
| |
17
|
|
| |
18
|
J. Rocchio. The SMART Retrieval System: Experiments in Automatic Document Processing, chapter Relevance feedback in Information Retrieval, pages 313--323. Prentice Hall, 1971.
|
 |
19
|
Dou Shen , Rong Pan , Jian-Tao Sun , Jeffrey Junfeng Pan , Kangheng Wu , Jie Yin , Qiang Yang, Q2C@UST: our winning solution to query classification in KDDCUP 2005, ACM SIGKDD Explorations Newsletter, v.7 n.2, p.100-110, December 2005
[doi> 10.1145/1117454.1117467]
|
| |
20
|
|
| |
21
|
P. Turney. Coherent keyphrase extraction via web mining. IJCAI, pages 434--439, 2003.
|
 |
22
|
|
| |
23
|
Gui-Rong Xue , Yong Yu , Dou Shen , Qiang Yang , Hua-Jun Zeng , Zheng Chen, Reinforcing Web-object Categorization Through Interrelationships, Data Mining and Knowledge Discovery, v.12 n.2-3, p.229-248, May 2006
[doi> 10.1007/s10618-005-0015-5]
|
 |
24
|
Gui-Rong Xue , Hua-Jun Zeng , Zheng Chen , Yong Yu , Wei-Ying Ma , WenSi Xi , WeiGuo Fan, Optimizing web search using web click-through data, Proceedings of the thirteenth ACM international conference on Information and knowledge management, November 08-13, 2004, Washington, D.C., USA
[doi> 10.1145/1031171.1031192]
|
| |
25
|
|
 |
26
|
|
| |
27
|
X. Zhu, Z. Ghahramani, and J. Lafferty. Semi-supervised learning using gaussian fields and harmonic functions. ICML, pages 912--919, 2003.
|
CITED BY 8
|
|
|
|
|
|
|
|
Paolo Boldi , Francesco Bonchi , Carlos Castillo , Debora Donato , Sebastiano Vigna, Query suggestions using query-flow graphs, Proceedings of the 2009 workshop on Web Search Click Data, p.56-63, February 09-09, 2009, Barcelona, Spain
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|