ACM Home Page
Please provide us with feedback. Feedback
Catching the drift: learning broad matches from clickthrough data
Full text MovMov (11:03),  PdfPdf (466 KB)
Source
International Conference on Knowledge Discovery and Data Mining archive
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining table of contents
Paris, France
SESSION: Industrial track papers table of contents
Pages 1165-1174  
Year of Publication: 2009
ISBN:978-1-60558-495-9
Authors
Sonal Gupta  University of Texas at Austin, Austin, TX, USA
Mikhail Bilenko  Microsoft Research, Redmond, WA, USA
Matthew Richardson  Microsoft Research, Redmond, WA, USA
Sponsors
ACM: Association for Computing Machinery
SIGKDD: ACM Special Interest Group on Knowledge Discovery in Data
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 25,   Downloads (12 Months): 91,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1557019.1557145
What is a DOI?

ABSTRACT

Identifying similar keywords, known as broad matches, is an important task in online advertising that has become a standard feature on all major keyword advertising platforms. Effective broad matching leads to improvements in both relevance and monetization, while increasing advertisers' reach and making campaign management easier. In this paper, we present a learning-based approach to broad matching that is based on exploiting implicit feedback in the form of advertisement clickthrough logs. Our method can utilize arbitrary similarity functions by incorporating them as features. We present an online learning algorithm, Amnesiac Averaged Perceptron, that is highly efficient yet able to quickly adjust to the rapidly-changing distributions of bidded keywords, advertisements and user behavior. Experimental results obtained from (1) historical logs and (2) live trials on a large-scale advertising platform demonstrate the effectiveness of the proposed algorithm and the overall success of our approach in identifying high-quality broad match mappings.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
IAB Internet Advertising Revenue Report conducted by PricewaterhouseCoopers (PWC), November 2008. {online} http://www.iab.net/insights_research/530422/1357.
2
3
 
4
 
5
R. C. Bunescu and M. Pasca. Using encyclopedic knowledge for named entity disambiguation. In Proceedings of EACL-06, 2006.
6
 
7
B. Carterette and R. Jones. Evaluating search engines by modeling the relationship between relevance and clicks. In Proceedings of NIPS-07, 2007.
8
 
9
10
 
11
W. W. Cohen, P. Ravikumar, and S. E. Fienberg. A comparison of string distance metrics for name-matching tasks. In Proceedings of the IJCAI-03 Workshop on Information Integration on the Web, pages 73--78, 2003.
 
12
 
13
S. Cucerzan and E. Brill. Spelling correction as an iterative process that exploits the collective knowledge of web users. In Proceedings of EMNLP-04, pages 293--300, 2004.
 
14
 
15
 
16
17
18
 
19
20
 
21
22
23
 
24
25
 
26
D. Metzler, S. T. Dumais, and C. Meek. Similarity measures for short segments of text. In Proceedings of ECIR-07, 2007.
 
27
J. C. Platt. Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. In A. J. Smola, P. Bartlett, B. Schölkopf, and D. Schuurmans, editors, Advances in Large Margin Classifiers, pages 185--208. MIT Press, 1999.
28
 
29
M. Regelson and D. Fain. Predicting click-through rate using keyword clusters. In Proceedings of the Second Workshop on Sponsored Search Auctions, 2006.
30
 
31
32
33
34
 
35
C. Wang, P. Zhang, and R. Choi. Understanding consumers attitude toward advertising. In Eighth Americas Conference on Information Systems, pages 1143--1148, 2002.
 
36
 
37
W. T. Yih and C. Meek. Improving similarity measures for short segments of text. In Proceedings of AAAI-07, 2007.
38

Collaborative Colleagues:
Sonal Gupta: colleagues
Mikhail Bilenko: colleagues
Matthew Richardson: colleagues