|
ABSTRACT
The business of Web search, a $10 billion industry, relies heavily on sponsored search, whereas a few carefully-selected paid advertisements are displayed alongside algorithmic search results. A key technical challenge in sponsored search is to select ads that are relevant for the user's query. Identifying relevant ads is challenging because queries are usually very short, and because users, consciously or not, choose terms intended to lead to optimal Web search results and not to optimal ads. Furthermore, the ads themselves are short and usually formulated to capture the reader's attention rather than to facilitate query matching. Traditionally, matching of ads to queries employed standard information retrieval techniques using the bag of words approach. Here we propose to go beyond the bag of words, and augment both queries and ads with additional knowledge-rich features. We use Web search results initially returned for the query to create a pool of relevant documents. Classifying these documents with respect to an external taxonomy and identifying salient named entities give rise to two new feature types. Empirical evaluation based on over 9,000 query-ad pairwise judgments confirms that using augmented queries produces highly relevant ads. Our methodology also relaxes the requirement for each ad to explicitly specify the exhaustive list of queries ("bid phrases") that can trigger it.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
 |
2
|
|
 |
3
|
|
 |
4
|
Steven M. Beitzel , Eric C. Jensen , Ophir Frieder , David Grossman , David D. Lewis , Abdur Chowdhury , Aleksandr Kolcz, Automatic web query classification using labeled and unlabeled training data, Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, August 15-19, 2005, Salvador, Brazil
[doi> 10.1145/1076034.1076138]
|
| |
5
|
Steven M. Beitzel , Eric C. Jensen , Ophir Frieder , David D. Lewis , Abdur Chowdhury , Aleksander Kolcz, Improving Automatic Query Classification via Semi-Supervised Learning, Proceedings of the Fifth IEEE International Conference on Data Mining, p.42-49, November 27-30, 2005
[doi> 10.1109/ICDM.2005.80]
|
| |
6
|
P. N. Bennett, S. T. Dumais, and E. Horvitz. Inductive transfer for text classification using generalized reliability indicators. In Proceedings of the ICML-2003 Workshop on The Continuum from Labeled to Unlabeled Data, 2003.
|
 |
7
|
Andrei Z. Broder , Marcus Fontoura , Evgeniy Gabrilovich , Amruta Joshi , Vanja Josifovski , Tong Zhang, Robust classification of rare queries using web knowledge, Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, July 23-27, 2007, Amsterdam, The Netherlands
[doi> 10.1145/1277741.1277783]
|
 |
8
|
Andrei Z. Broder , David Carmel , Michael Herscovici , Aya Soffer , Jason Zien, Efficient query evaluation using a two-level retrieval process, Proceedings of the twelfth international conference on Information and knowledge management, November 03-08, 2003, New Orleans, LA, USA
[doi> 10.1145/956863.956944]
|
| |
9
|
61 billion searches conducted worldwide in August. comscore, October 2007. Available from http://www.comscore.com/press/release.asp?press=1802.
|
 |
10
|
Hang Cui , Ji-Rong Wen , Jian-Yun Nie , Wei-Ying Ma, Probabilistic query expansion using query logs, Proceedings of the 11th international conference on World Wide Web, May 07-11, 2002, Honolulu, Hawaii, USA
[doi> 10.1145/511446.511489]
|
 |
11
|
|
| |
12
|
C. Do and A. Ng. Transfer learning for text classification. In Proceedings of Neural Information Processing Systems (NIPS), 2005.
|
| |
13
|
R. Duda and P. Hart. Pattern Classification and Scene Analysis. John Wiley and Sons, 1973.
|
| |
14
|
B. Edelman, M. Ostrovsky, and M. Schwarz. Internet advertising and the generalized second price auction: Selling billions of dollars worth of keywords. American Economic Review, 97(1):242--259, 2007.
|
| |
15
|
E. Efthimiadis and P. Biron. UCLA-okapi at TREC-2: Query expansion experiments. In Proceedings of TREC-2, 1994.
|
| |
16
|
D. Fain and J. Pedersen. Sponsored search: A brief history. In Second Workshop on Sponsored Search Auctions, 2006.
|
| |
17
|
E. Gabrilovich and S. Markovitch. Feature generation for text categorization using world knowledge. In Proceedings of the 19th International Joint Conference on Artificial Intelligence, pages 1048--1053, Edinburgh, Scotand, August 2005.
|
 |
18
|
|
| |
19
|
|
 |
20
|
|
 |
21
|
Xin Jin , Ying Li , Teresa Mah , Jie Tong, Sensitive webpage classification for content advertising, Proceedings of the 1st international workshop on Data mining and audience intelligence for advertising, p.28-33, August 12-12, 2007, San Jose, California
[doi> 10.1145/1348599.1348604]
|
 |
22
|
|
 |
23
|
|
| |
24
|
|
| |
25
|
P. Kowalczyk, I. Zukerman, and M. Niemann. Analyzing the effect of query class on document retrieval performance. In Proceedings of the Australian Conference on Artificial Intelligence, pages 550--561, 2004.
|
 |
26
|
Anísio Lacerda , Marco Cristo , Marcos André Gonçalves , Weiguo Fan , Nivio Ziviani , Berthier Ribeiro-Neto, Learning to advertise, Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, August 06-11, 2006, Seattle, Washington, USA
[doi> 10.1145/1148170.1148265]
|
| |
27
|
Marc Langheinrich , Atsuyoshi Nakamura , Naoki Abe , Tomonari Kamba , Yoshiyuki Koseki, Unintrusive customization techniques for Web advertising, Computer Networks: The International Journal of Computer and Telecommunications Networking, v.31 n.11-16, p.1259-1272, May 17, 1999
|
 |
28
|
|
 |
29
|
|
| |
30
|
D. Metzler, S. Dumais, and C. Meek. Similarity measures for short segments of text. In Proceedings of the 29th European Conference on Information Retrieval, pages 16--27, 2007.
|
 |
31
|
|
| |
32
|
M. Porter. An algorithm for suffix stripping. Program, 14(3):130--137, 1980.
|
 |
33
|
|
 |
34
|
|
| |
35
|
S. Robertson, S. Walker, S. Jones, M. Hancock-Beaulieu, and M. Gatford. Okapi at TREC-3. In Proceedings of TREC-3, 1995.
|
| |
36
|
S. E. Robertson, S. Walker, S. Jones, M. Hancock-Beaulieu, and M. Gatford. okapi at TREC-3. In Proceedings of the 3rd Text REtrieval Conference, 1994.
|
| |
37
|
J. J. Rocchio. Relevance feedback in information retrieval. In The SMART Retrieval System: Experiments in Automatic Document Processing, pages 313--323. Prentice Hall, 1971.
|
 |
38
|
|
| |
39
|
|
| |
40
|
G. Salton and C. Buckley. Improving retrieval performance by relevance feedback. Journal of the American Society for Information Science, 41(4):288--297, 1990.
|
| |
41
|
|
 |
42
|
|
 |
43
|
Dou Shen , Rong Pan , Jian-Tao Sun , Jeffrey Junfeng Pan , Kangheng Wu , Jie Yin , Qiang Yang, Q2C@UST: our winning solution to query classification in KDDCUP 2005, ACM SIGKDD Explorations Newsletter, v.7 n.2, p.100-110, December 2005
[doi> 10.1145/1117454.1117467]
|
 |
44
|
Dou Shen , Jian-Tao Sun , Qiang Yang , Zheng Chen, Building bridges for web query classification, Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, August 06-11, 2006, Seattle, Washington, USA
[doi> 10.1145/1148170.1148196]
|
| |
45
|
|
| |
46
|
C. Sutton and A. McCallum. Composition of conditional random fields for transfer learning. In Emprical Methods in Natural Language Processing (HLT/EMNLP), 1998.
|
 |
47
|
David Vogel , Steffen Bickel , Peter Haider , Rolf Schimpfky , Peter Siemen , Steve Bridges , Tobias Scheffer, Classifying search engine queries using the web as background knowledge, ACM SIGKDD Explorations Newsletter, v.7 n.2, p.117-122, December 2005
[doi> 10.1145/1117454.1117469]
|
| |
48
|
|
| |
49
|
C. Wang, P. Zhang, R. Choi, and M. D. Eredita. Understanding consumers attitude toward advertising. In 8th Americas Conference on Information Systems, 2002.
|
 |
50
|
|
 |
51
|
|
 |
52
|
|
 |
53
|
|
CITED BY 8
|
|
Xuerui Wang , Andrei Broder , Evgeniy Gabrilovich , Vanja Josifovski , Bo Pang, Cross-lingual query classification: a preliminary study, Proceeding of the 2nd ACM workshop on Improving non english web searching, October 30-30, 2008, Napa Valley, California, USA
|
|
|
Hila Becker , Andrei Broder , Evgeniy Gabrilovich , Vanja Josifovski , Bo Pang, Context transfer in search advertising, Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, July 19-23, 2009, Boston, MA, USA
|
|
|
Xuerui Wang , Andrei Broder , Evgeniy Gabrilovich , Vanja Josifovski , Bo Pang, Cross-language query classification using web search for exogenous knowledge, Proceedings of the Second ACM International Conference on Web Search and Data Mining, February 09-12, 2009, Barcelona, Spain
|
|
|
Andrei Broder , Peter Ciccolo , Evgeniy Gabrilovich , Vanja Josifovski , Donald Metzler , Lance Riedel , Jeffrey Yuan, Online expansion of rare queries for sponsored search, Proceedings of the 18th international conference on World wide web, April 20-24, 2009, Madrid, Spain
|
|
|
Evgeniy Gabrilovich , Andrei Broder , Marcus Fontoura , Amruta Joshi , Vanja Josifovski , Lance Riedel , Tong Zhang, Classifying search queries using the Web as a source of knowledge, ACM Transactions on the Web (TWEB), v.3 n.2, p.1-28, April 2009
|
|
|
|
|
|
Yunzhang Zhu , Gang Wang , Junli Yang , Dakan Wang , Jun Yan , Zheng Chen, Revenue optimization with relevance constraint in sponsored search, Proceedings of the Third International Workshop on Data Mining and Audience Intelligence for Advertising, p.55-60, June 28-28, 2009, Paris, France
|
|
|
|
|