|
ABSTRACT
Click fraud is jeopardizing the industry of Internet advertising. Internet advertising is crucial for the thriving of the entire Internet, since it allows producers to advertise their products, and hence contributes to the well being of e-commerce. Moreover, advertising supports the intellectual value of the Internet by covering the running expenses of publishing content. Some content publishers are dishonest, and use automation to generate traffic to defraud the advertisers. Similarly, some advertisers automate clicks on the advertisements of their competitors to deplete their competitors' advertising budgets. This paper describes the advertising network model, and focuses on the most sophisticated type of fraud, which involves coalitions among fraudsters. We build on several published theoretical results to devise the Similarity-Seeker algorithm that discovers coalitions made by pairs of fraudsters. We then generalize the solution to coalitions of arbitrary sizes. Before deploying our system on a real network, we conducted comprehensive experiments on data samples for proof of concept. The results were very accurate. We detected several coalitions, formed using various techniques, and spanning numerous sites. This reveals the generality of our model and approach.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
E. Akkoyunlu. The Enumeration of Maximal Cliques of Large Graphs. SIAM Journal on Computing, 2(1):1--6, 1973.
|
| |
3
|
Vinod Anupam , Alain Mayer , Kobbi Nissim , Benny Pinkas , Michael K. Reiter, On the security of pay-per-click and other Web advertising schemes, Proceeding of the eighth international conference on World Wide Web, p.1091-1100, May 1999, Toronto, Canada
|
 |
4
|
|
 |
5
|
|
| |
6
|
T. Bohman, C. Cooper, and A. Frieze. Min-Wise Independent Linear Permutations. Electronic Journal of Combinatorics, 7:R26, 2000.
|
| |
7
|
A. Bowker and G. Lieberman. Engineering Statistics, 2nd Edition. Prentice Hall, 1972.
|
| |
8
|
U. Brandes, M. Gaertler, and D. Wagner. Experiments on Graph Clustering Algorithms. In Proceedings of the 11th ESA European Symposium on Algorithms, pages 568--579, 2003.
|
| |
9
|
|
| |
10
|
|
 |
11
|
Andrei Z. Broder , Moses Charikar , Alan M. Frieze , Michael Mitzenmacher, Min-wise independent permutations (extended abstract), Proceedings of the thirtieth annual ACM symposium on Theory of computing, p.327-336, May 24-26, 1998, Dallas, Texas, United States
[doi> 10.1145/276698.276781]
|
| |
12
|
|
| |
13
|
Andrei Z. Broder , Steven C. Glassman , Mark S. Manasse , Geoffrey Zweig, Syntactic clustering of the Web, Selected papers from the sixth international conference on World Wide Web, p.1157-1166, September 1997, Santa Clara, California, United States
|
 |
14
|
|
| |
15
|
CERT Coordination Center. CERT Advisory CA-1996-21 TCP SYN Flooding and IP Spoofing Attacks. http://www.cert.org/advisories/CA-1996-21.html, September 19 1996.
|
 |
16
|
|
 |
17
|
Surajit Chaudhuri , Rajeev Motwani , Vivek Narasayya, On random sampling over joins, Proceedings of the 1999 ACM SIGMOD international conference on Management of data, p.263-274, May 31-June 03, 1999, Philadelphia, Pennsylvania, United States
|
 |
18
|
|
| |
19
|
|
| |
20
|
L. Gerhards and W. Lindenberg. Clique Detection for Nondirected Graphs: Two New Algorithms. Computing, Volume 21(4):295--322, 1979.
|
| |
21
|
|
 |
22
|
|
| |
23
|
K. Holzapfel, S. Kosub, M. Maaß, and H. Täubig. The Complexity of Detecting Fixed-Density Clusters. In Proceedings of the 5th CIAC Italian Conference on Algorithms and Complexity, pages 201--212, 2003.
|
| |
24
|
|
| |
25
|
|
| |
26
|
H. Johnston. Cliques of a Graph-Variations on the Bron-Kerbosch Algorithm. International Journal of Computer and Information Sciences, 5(3):209--238, 1976.
|
| |
27
|
|
| |
28
|
|
| |
29
|
M. Liedtke. Google to Pay $90M in `Click Fraud' Case. Washington Post Magazine, March 9 2006.
|
| |
30
|
M. Liedtke. Yahoo Settles `Click Fraud' Lawsuit. MSNBC News, June 28 2006.
|
| |
31
|
E. Loukakis. A New Backtracking Algorithm for Generating the Family of Maximal Independent Sets of a Graph. Computers & Mathematics with Applications, 9(4):583--589, 1983.
|
| |
32
|
C. Mann. How Click Fraud Could Swallow the Internet. Wired Magazine, January 2006.
|
| |
33
|
R. McGann. Study: Consumers Delete Cookies at Surprising Rate. ClickZ News, March 14 2005.
|
 |
34
|
|
| |
35
|
|
| |
36
|
A. Metwally, D. Agrawal, and A. El Abbadi. Hide and Seek: Detecting Hit Inflation Fraud in Streams of Web Advertising Networks. Technical Report 2006-06, University of California, Santa Barbara, Department of Computer Science, 2006.
|
| |
37
|
J. Moon and L. Moser. On cliques in graphs. Israel journal of Mathematics, 3:23--28, 1965.
|
| |
38
|
M. Naor and B. Pinkas. Secure and Efficient Metering. In Proceedings EUROCRYPT International Conference on the Theory and Application of Cryptographic Techniques, pages 576--590, 1998.
|
| |
39
|
S. Olsen. Click Fraud Roils Search Advertisers. CNET News, March 4 2005.
|
| |
40
|
|
| |
41
|
|
| |
42
|
J. Sima and S. Schaeffer. On the NP-Completeness of Some Graph Cluster Measures. In Proceedings of the 32nd SOFSEM Conference on Current Trends in Theory and Practice of Informatics, pages 530--537, 2006.
|
| |
43
|
E. Tomita, A. Tanaka, and H. Takahashi. The Worst-Case Time Complexity for Generating All Maximal Cliques. In Proceedings of the 10th COCOON Annual International Conference on Computing and Combinatorics, pages 161--170, 2004.
|
| |
44
|
S. Tsukiyama, M. Ide, H. Ariyoshi, and I. Shirakawa. A New Algorithm for Generating All the Maximal Independent Sets. SIAM Journal on Computing, 6(3):505--517, 1977.
|
| |
45
|
D. Vise. Clicking To Steal. Washington Post Magazine, page F01, April 17 2005.
|
 |
46
|
|
| |
47
|
T. Zeller Jr. With Each Technology Advance, a Scourge. The New York Times, October 18 2004.
|
|