ACM Home Page
Please provide us with feedback. Feedback
Spamalytics: an empirical analysis of spam marketing conversion
Full text Digital EditionDigital Edition HtmlHtml (55 KB),  PdfPdf (2.00 MB)
Source
Communications of the ACM archive
Volume 52 ,  Issue 9  (September 2009) table of contents
The Status of the P versus NP Problem
SECTION: Research highlights table of contents
Pages 99-107  
Year of Publication: 2009
ISSN:0001-0782
Authors
Chris Kanich  Department of Computer Science and Engineering University of California, San Diego
Christian Kreibich  International Computer Science Institute Berkeley
Kirill Levchenko  Department of Computer Science and Engineering University of California, San Diego
Brandon Enright  Department of Computer Science and Engineering University of California, San Diego
Geoffrey M. Voelker  Department of Computer Science and Engineering University of California, San Diego
Vern Paxson  International Computer Science Institute Berkeley
Stefan Savage  Department of Computer Science and Engineering University of California, San Diego
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 188,   Downloads (12 Months): 482,   Citation Count: 0
Additional Information:

abstract   references   index terms  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1562164.1562190
What is a DOI?

ABSTRACT

Spam-based marketing is a curious beast. We all receive the advertisements---"Excellent hardness is easy!"---but few of us have encountered a person who admits to following through on this offer and making a purchase. And yet, the relentlessness by which such spam continually clogs Internet inboxes, despite years of energetic deployment of antispam technology, provides undeniable testament that spammers find their campaigns profitable. Someone is clearly buying. But how many, how often, and how much?

Unraveling such questions is essential for understanding the economic support for spam and hence where any structural weaknesses may lie. Unfortunately, spammers do not file quarterly financial reports, and the underground nature of their activities makes third-party data gathering a challenge at best. Absent an empirical foundation, defenders are often left to speculate as to how successful spam campaigns are and to what degree they are profitable. For example, IBM's Joshua Corman was widely quoted as claiming that spam sent by the Storm worm alone was generating "millions and millions of dollars every day."1 While this claim could in fact be true, we are unaware of any public data or methodology capable of confirming or refuting it.

The key problem is our limited visibility into the three basic parameters of the spam value proposition: the cost to send spam, offset by the "conversion rate" (probability that an email sent will ultimately yield a "sale"), and the marginal profit per sale. The first and last of these are self-contained and can at least be estimated based on the costs charged by third-party spam senders and through the pricing and gross margins offered by various Interne marketing "affiliate programs."a However, the conversion rate depends fundamentally on group actions---on what hundreds of millions of Internet users do when confronted with a new piece of spam---and is much harder to obtain. While a range of anecdotal numbers exist, we are unaware of any well-documented measurement of the spam conversion rate.b

In part, this problem is methodological. There are no apparent methods for indirectly measuring spam conversion. Thus, the only obvious way to extract this data is to build an e-commerce site, market it via spam, and then record the number of sales. Moreover, to capture the spammer's experience with full fidelity, such a study must also mimic their use of illicit botnets for distributing email and proxying user responses. In effect, the best way to measure spam is to be a spammer.

In this paper, we have effectively conducted this study, though sidestepping the obvious legal and ethical problems associated with sending spam.c Critically, our study makes use of an existing spamming botnet. By infiltrating the botnet parasitically, we convinced it to modify a subset of the spam it already sends, thereby directing any interested recipients to Web sites under our control, rather than those belonging to the spammer. In turn, our Web sites presented "defanged" versions of the spammer's own sites, with functionality removed that would compromise the victim's system or receive sensitive personal information such as name, address or credit card information.

Using this methodology, we have documented three spam campaigns comprising over 469 million emails. We identified how much of this spam is successfully delivered, how much is filtered by popular antispam solutions, and, most importantly, how many users "click-through" to the site being advertised (response rate) and how many of those progress to a "sale" or "infection" (conversion rate).

The remainder of this paper is structured as follows. Section 2 describes the economic basis for spam and reviews prior research in this area. Section 4 describes our experimental methodology for botnet infiltration. Section 5 describes our spam filtering and conversion results, Section 6 analyzes the effects of blacklisting on spam delivery, and Section 7 analyzes the possible influences on spam responses. We synthesize our findings in Section 8 and conclude.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Akass, C. Storm worm 'making millions a day.' http://www.pcw.co.uk/personal-computer-world/news/2209293/strom-worm-making-millions-day, February 2008.
 
2
Anderson, D.S., Fleizach, C., Savage, S., Voelker, G.M. Spamscatter: Characterizing internet scam hosting infrastructure. In Proceedings of the USENIX Security Symposium (Boston, MA, August 2007).
 
3
Angwin, J. Elusive Spammer Sends EarthLink on Long Chase. http://online.wsj.com/article_email/SB105225593382372600.html, May 2003.
 
4
D. M. Association. DMA Releases 5th Annual 'Response Rate Trends Report.' http://www.the-dma.org/cgi/disppressrelease?article=1008, october 2007.
 
5
Boehme, R., Ho, T. The effect of stock spam on financial markets. In Proceedings of the 5th Workshop on the Economics of Information Security (WEIS) (June 2006).
 
6
Composite Blocking List (CBL). http://cbl.abuseat.org/, March 2008.
 
7
Frieder, L., Zittrain, J. Spam works: evidence from stock touts and corresponding market activity. Berkman Center Research Publication, 2006.
 
8
Goodman, J., Rounthwaite, R. Stopping outgoing spam. Proceedings of the 5th ACM Conference on Electronic Commerce (2004), 30--39.
 
9
Hanke, M., Hauser, F. On the effects of stock spam emails. J. Financ. Mark. 11, 1 (2008), 57--83.
 
10
Kirk, J. Former spammer: 'I Know I'm Going to Hell.' http://www.macworld.com/article/58997/2007/07/spammer.html, July 2007.
 
11
Kreibich, C., Kanich, C., Levchenko, K., Enright, B., Voelker, G.M., Paxson, V., Savage, S. On the Spam Campaign Trail. In First USENIX Workshop on Large-Scale Exploits and Emergent Threats (LEET'08), April 2008.
 
12
Judge, W.Y.P., Alperovitch, D. Understanding and Reversing the Profit Model of Spam. In Workshop on Economics of Information Security 2005 (WEIS 2005) (Boston, MA, USA, June 2005).
 
13
Watson, D. All Spammers Go to Hell (posting to funsec list). http://www.mail-archive.com/funsec%40linuxbox.org/msg03346.html, July 2007.
 
14
Wilson, T. Competition May Be Driving surge in Botnets, spam. http://www.darkreading.com/document.asp?doc_id=142690, 200