| Sensitive webpage classification for content advertising |
| Full text |
Pdf
(429 KB)
|
| Source
|
International Conference on Knowledge Discovery and Data Mining
archive
Proceedings of the 1st international workshop on Data mining and audience intelligence for advertising
table of contents
San Jose, California
Pages 28-33
Year of Publication: 2007
ISBN:978-1-59593-833-6
|
|
Authors
|
|
Xin Jin
|
Microsoft Corporation, Redmond, WA
|
|
Ying Li
|
Microsoft Corporation, Redmond, WA
|
|
Teresa Mah
|
Microsoft Corporation, Redmond, WA
|
|
Jie Tong
|
Microsoft Corporation, Redmond, WA
|
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 16, Downloads (12 Months): 131, Citation Count: 5
|
|
|
ABSTRACT
Online advertising has been a popular topic in recent years. In this paper, we address one of the important problems in online advertising, i.e., how to detect whether a publisher webpage contains sensitive content and is appropriate for showing advertisement(s) on it. We take a webpage classification approach to solve this problem. First we design a unique sensitive content taxonomy. Then we adopt an iterative training data collection and classifier building approach, to build a hierarchical classifier which can classify webpages into one of the nodes in the sensitive content taxonomy. The experimental result show that using this approach, we are able to build a unique sensitive content classifier with decent accuracy while only requiring limited amount of human labeling effort.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
|
| |
3
|
|
| |
4
|
|
 |
5
|
|
| |
6
|
|
| |
7
|
C. W. Hsu and C. J. Lin, A Comparison of Methods for Multi-class Support Vector Machines. Technical report, Department of Computer Science and Information Engineering, National Taiwan University, 2001.
|
 |
8
|
|
 |
9
|
|
 |
10
|
Tie-Yan Liu , Yiming Yang , Hao Wan , Hua-Jun Zeng , Zheng Chen , Wei-Ying Ma, Support vector machines classification with a very large-scale taxonomy, ACM SIGKDD Explorations Newsletter, v.7 n.1, p.36-43, June 2005
[doi> 10.1145/1089815.1089821]
|
| |
11
|
|
| |
12
|
|
CITED BY 5
|
|
|
|
|
|
|
|
Andrei Broder , Massimiliano Ciaramita , Marcus Fontoura , Evgeniy Gabrilovich , Vanja Josifovski , Donald Metzler , Vanessa Murdock , Vassilis Plachouras, To swing or not to swing: learning when (not) to advertise, Proceeding of the 17th ACM conference on Information and knowledge management, October 26-30, 2008, Napa Valley, California, USA
|
|
|
Andrei Z. Broder , Peter Ciccolo , Marcus Fontoura , Evgeniy Gabrilovich , Vanja Josifovski , Lance Riedel, Search advertising using web relevance feedback, Proceeding of the 17th ACM conference on Information and knowledge management, October 26-30, 2008, Napa Valley, California, USA
|
|
|
Yi Zhang , Arun C. Surendran , John C. Platt , Mukund Narasimhan, Learning from multi-topic web documents for contextual advertisement, Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, August 24-27, 2008, Las Vegas, Nevada, USA
|
|