|
ABSTRACT
Content-targeted advertising, the task of automatically associating ads to a Web page, constitutes a key Web monetization strategy nowadays. Further, it introduces new challenging technical problems and raises interesting questions. For instance, how to design ranking functions able to satisfy conflicting goals such as selecting advertisements (ads) that are relevant to the users and suitable and profitable to the publishers and advertisers? In this paper we propose a new framework for associating ads with web pages based on Genetic Programming (GP). Our GP method aims at learning functions that select the most appropriate ads, given the contents of a Web page. These ranking functions are designed to optimize overall precision and minimize the number of misplacements. By using a real ad collection and web pages from a newspaper, we obtained a gain over a state-of-the-art baseline method of 61.7% in average precision. Further, by evolving individuals to provide good ranking estimations, GP was able to discover ranking functions that are very effective in placing ads in web pages while avoiding irrelevant ones.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
|
| |
3
|
J. J. Carrasco, D. Fain, K. Lang, and L. Zhukov. Clustering of bipartite advertiser-keyword graph. In Workshop on Clustering Large Datasets, 3th IEEE International Conference on Data Mining, Melbourne, Florida, USA, November 2003. IEEE Computer Society Press. Available at http://research.yahoo.com/publications.xml .
|
| |
4
|
O. Cordon, F. Moya, and C. Zarco. A new evolutionary algorithm combining simulated annealing and genetic programming for relevance feedback in fuzzy information retrieval systems. Soft Computing-A Fusion of Foundations, Methodologies and Applications, 6(5): 308--319, Aug. 2002.
|
| |
5
|
E. Eneva. Detecting invalid clicks in online paid search listings: a problem description for the use of unlabeled data. In T. Fawcett and N. Mishra, editors, Workshop on the Continuum from Labeled to Unlabeled Data, 20th International Conference on Machine Learning, Washington DC, USA, August 2003. AAAI Press.
|
| |
6
|
|
| |
7
|
|
| |
8
|
|
| |
9
|
W. Fan, M. D. Gordon, and P. Pathak. Genetic programming-based discovery of ranking functions for effective web search. Journal of Management Information Systems, 21(4): 37--56, Spring 2005.
|
| |
10
|
|
| |
11
|
J. Feng, H. Bhargava, and D. Pennock. Implementing paid placement in Web search engines: computational evaluation of alternative mechanisms. INFORMS Journal on Computing, 2006. To be published.
|
| |
12
|
|
 |
13
|
|
| |
14
|
M. D. Gordon. User-based document clustering by redescribing subject descriptions with a genetic algorithm. JASIS, 42(5): 311--322, 1991.
|
| |
15
|
D. K. Harman. Overview of the fourth text retrieval conference TREC-4. In D. K. Harman, editor, Proceedings of the Fourth Text REtrieval Conference (TREC-4), pages 1--24, Gaithersburg, Maryland, USA, November 1996. NIST Special Publication 500--236.
|
| |
16
|
D. Hawking, N. Craswell, and P. B. Thistlewaite. Overview of TREC-7 very large collection track. In The Seventh Text REtrieval Conference (TREC-7), pages 91--104, Gaithersburg, Maryland, USA, November 1998.
|
| |
17
|
|
| |
18
|
IAB and PricewaterhouseCoopers. IAB internet advertising revenue report, April 2005. Available at http://www.iab.net/2004adrevenues.
|
| |
19
|
|
| |
20
|
|
| |
21
|
K. Lee. The SEM content conundrum. ClickZ Experts, July 2003. Available at http://www.clickz.com/experts/search/strat/article.php/2233821.
|
| |
22
|
|
| |
23
|
K. Maddox. Forrester reports advertising shift to online, May 2005. Available at http://www.btobonline.com/article.cms?articleId=24191.
|
| |
24
|
T. M. Mitchell. Machine learning. McGraw Hill, New York, US, 1996.
|
| |
25
|
OneUpWeb. How keyword length affects conversion rates, January 2005. Available at http://www.oneupweb.com/landing/keywordstudy_landing.htm .
|
| |
26
|
|
| |
27
|
P. Pathak, M. Gordon, and W. Fan. Effective information retrieval using genetic algorithms based matching function adaptation. In Proceedings of the 33rd Hawaii International Conference on System Science, Hawaii, USA, 2000.
|
 |
28
|
|
| |
29
|
M. Weideman. Ethical issues on content distribution to digital consumers via paid placement as opposed to website visibility in search engine results. In The 17th ETHICOMP, pages 904--915. Troubador Publishing Ltd, April 2004.
|
| |
30
|
|
 |
31
|
Baoping Zhang , Yuxin Chen , Weiguo Fan , Edward A. Fox , Marcos Gonçalves , Marco Cristo , Pável Calado, Intelligent GP fusion from multiple sources for text classification, Proceedings of the 14th ACM international conference on Information and knowledge management, October 31-November 05, 2005, Bremen, Germany
[doi> 10.1145/1099554.1099688]
|
CITED BY 20
|
|
|
|
|
|
|
|
|
|
|
Avaré Stewart , Ling Chen , Raluca Paiu , Wolfgang Nejdl, Discovering information diffusion paths from blogosphere for online advertising, Proceedings of the 1st international workshop on Data mining and audience intelligence for advertising, p.46-54, August 12-12, 2007, San Jose, California
|
|
|
Hua Li , Duo Zhang , Jian Hu , Hua-Jun Zeng , Zheng Chen, Finding keyword from online broadcasting content for targeted advertising, Proceedings of the 1st international workshop on Data mining and audience intelligence for advertising, p.55-62, August 12-12, 2007, San Jose, California
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Aris Anagnostopoulos , Andrei Z. Broder , Evgeniy Gabrilovich , Vanja Josifovski , Lance Riedel, Just-in-time contextual advertising, Proceedings of the sixteenth ACM conference on Conference on information and knowledge management, November 06-10, 2007, Lisbon, Portugal
|
|
|
Andrei Broder , Massimiliano Ciaramita , Marcus Fontoura , Evgeniy Gabrilovich , Vanja Josifovski , Donald Metzler , Vanessa Murdock , Vassilis Plachouras, To swing or not to swing: learning when (not) to advertise, Proceeding of the 17th ACM conference on Information and knowledge management, October 26-30, 2008, Napa Valley, California, USA
|
|
|
Ricardo da S. Torres , Alexandre X. Falcão , Marcos A. Gonçalves , João P. Papa , Baoping Zhang , Weiguo Fan , Edward A. Fox, A genetic programming framework for content-based image retrieval, Pattern Recognition, v.42 n.2, p.283-292, February, 2009
|
|
|
Andrei Z. Broder , Peter Ciccolo , Marcus Fontoura , Evgeniy Gabrilovich , Vanja Josifovski , Lance Riedel, Search advertising using web relevance feedback, Proceeding of the 17th ACM conference on Information and knowledge management, October 26-30, 2008, Napa Valley, California, USA
|
|
|
|
|
|
Cristiano D. Ferreira , Ricardo da S. Torres , Marcos André Gonçalves , Weiguo Fan, Image retrieval with relevance feedback based on genetic programming, Proceedings of the 23rd Brazilian symposium on Databases, October 13-17, 2008, Campinas, Sao Paulo, Brazil
|
|
|
|
|
|
Haofen Wang , Yan Liang , Linyun Fu , Gui-Rong Xue , Yong Yu, Efficient query expansion for advertisement search, Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, July 19-23, 2009, Boston, MA, USA
|
|
|
Xin-Jing Wang , Mo Yu , Lei Zhang , Rui Cai , Wei-Ying Ma, Argo: intelligent advertising by mining a user's interest from his photo collections, Proceedings of the Third International Workshop on Data Mining and Audience Intelligence for Advertising, p.18-26, June 28-28, 2009, Paris, France
|
|
|
Yunzhang Zhu , Gang Wang , Junli Yang , Dakan Wang , Jun Yan , Zheng Chen, Revenue optimization with relevance constraint in sponsored search, Proceedings of the Third International Workshop on Data Mining and Audience Intelligence for Advertising, p.55-60, June 28-28, 2009, Paris, France
|
|
|
Yunzhang Zhu , Gang Wang , Junli Yang , Dakan Wang , Jun Yan , Jian Hu , Zheng Chen, Optimizing search engine revenue in sponsored search, Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, July 19-23, 2009, Boston, MA, USA
|
|