|
ABSTRACT
In the span of only a few years, the Internet has experienced an astronomical increase in the use of specialized content delivery systems, such as content delivery networks and peer-to-peer file sharing systems. Therefore, an understanding of content delivery on the Internet now requires a detailed understanding of how these systems are used in practice.This paper examines content delivery from the point of view of four content delivery systems: HTTP web traffic, the Akamai content delivery network, and Kazaa and Gnutella peer-to-peer file sharing traffic. We collected a trace of all incoming and outgoing network traffic at the University of Washington, a large university with over 60,000 students, faculty, and staff. From this trace, we isolated and characterized traffic belonging to each of these four delivery classes. Our results (1) quantify the rapidly increasing importance of new content delivery systems, particularly peer-to-peer networks, (2) characterize the behavior of these systems from the perspectives of clients, objects, and servers, and (3) derive implications for caching in these systems.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Akamai. http://www. akamai. com.
|
| |
2
|
|
| |
3
|
L. Breslau, P. Cao, L. Fan, G. Phillips, and S. Shenker. Web caching and zipf-like distributions: Evidence and implications. In Proc. of IEEE INFOCOM 1999, March 1999.
|
| |
4
|
R. Caceres, F. Douglis, A. Feldmann, G. Glass, and M. Rabinovich. Web proxy caching: The devil is in the details. In Workshop on Internet Server Performance, June 1998.
|
| |
5
|
P. Cao, J. Zhang, and K. Beach. Active cache: Caching dynamic contents on the web. In Proc. of IFIP Int. Conf. on Distributed Systems Platforms and Open Distributed Processing, Sep. 1998.
|
| |
6
|
Soumen Chakrabarti , Byron E. Dom , S. Ravi Kumar , Prabhakar Raghavan , Sridhar Rajagopalan , Andrew Tomkins , David Gibson , Jon Kleinberg, Mining the Web's Link Structure, Computer, v.32 n.8, p.60-67, August 1999
[doi> 10.1109/2.781636]
|
| |
7
|
A. Chankhunthod, P. B. Danzig, C. Neerdaels, M. F. Schwartz, and K. J. Worrell. A hierarchical internet object cache. In Proc. of the 1996 USENIX Annual Technical Conf., Jan. 1996.
|
| |
8
|
M. Chesire, A. Wolman, G. Voelker, and H. Levy. Measurement and analysis of a streaming media workload. In Proc. of the 2001 USENIX Symp. on Internet Technologies and Systems, March 2001.
|
| |
9
|
Ian Clarke , Oskar Sandberg , Brandon Wiley , Theodore W. Hong, Freenet: a distributed anonymous information storage and retrieval system, International workshop on Designing privacy enhancing technologies: design issues in anonymity and unobservability, p.46-66, January 2001, Berkeley, California, United States
|
| |
10
|
Clip2. The Gnutella protocol specification v.0.4, March 2001. http://www.clip2.com/GnutellaProtocol04.pdf.
|
| |
11
|
F. Douglis, A. Feldmann, B. Krishnamurthy, and J. C. Mogul. Rate of change and other metrics: a live study of the world wide web. In Proc. of the 1997 USENIX Symp. on Internet Technologies and Systems, Dec. 1997.
|
| |
12
|
B. Duska, D. Marwood, and M. J. Feeley. The measured access characteristics of World Wide Web client proxy caches. In Proc. of the 1st USENIX Symp. on Internet Technologies and Systems, Dec. 1997.
|
 |
13
|
Li Fan , Pei Cao , Jussara Almeida , Andrei Z. Broder, Summary cache: a scalable wide-area Web cache sharing protocol, Proceedings of the ACM SIGCOMM '98 conference on Applications, technologies, architectures, and protocols for computer communication, p.254-265, August 31-September 04, 1998, Vancouver, British Columbia, Canada
|
| |
14
|
S. Gadde, J. Chase, and M. Rabinovich. Web caching and content distribution: A view from the interior. In Proc. of the 5th International Web Caching and Content Delivery Workshop, May 2000.
|
| |
15
|
S. D. Gribble and E. A. Brewer. System design issues for internet middleware services: Deductions from a large client trace. In Proc. of the 1997 USENIX Symp. on Internet Technologies and Systems, Dec. 1997.
|
| |
16
|
Internet Engineering Task Force. Hypertext transfer protocol - http 1.1. RFC 2068, March 1997.
|
| |
17
|
Internet2. http://netflow.internet2.edu/weekly/20020422.
|
| |
18
|
K. L. Johnson, J. F. Carr, M. S. Day, and M. Frans Kaashoek. The measured performance of content distribution networks. Computer Communications, 24(2), 2001.
|
| |
19
|
J. Kangasharju, K. W. Ross, and J. W. Roberts. Performance evaluation of redirection schemes in content distribution networks. Computer Communications, 24(2):207--214, 2001.
|
| |
20
|
Kazaa. http://www.kazaa.com.
|
| |
21
|
J. M. Kleinberg, R. Kumar, P. Raghavan, S. Rajagopalan, and A. S. Tomkins. The Web as a graph: Measurements, models, and methods. In T. Asano, H. Imai, D. T. Lee, S. Nakano, and T. Tokuyama, editors, Proc. of the 5th Annual Int. Conf. Computing and Combinatorics, number 1627. Springer-Verlag, 1999.
|
| |
22
|
M. Koletsou and G. M. Voelker. The Medusa proxy: A tool for exploring user-perceived web performance. In Proc. of the Sixth Int. Workshop on Web Caching and Content Distribution, June 2001.
|
 |
23
|
|
| |
24
|
N. Leibowitz, A. Bergman, R. Ben-Shaul, and A. Shavit. Are file swapping networks cacheable? Characterizing P2P traffic. In Proc. of the 7th Int. WWW Caching Workshop, August 2002.
|
| |
25
|
B. Maggs. Global Internet Content Delivery. Talk delivered in the Internet and Distributed Systems Seminar at Stanford University. http://www.stanford.edu/class/cs548/abstracts.shtml#bruce.
|
| |
26
|
S. McCanne and V. Jacobson. The BSD packet filter: A new architecture for user-level packet capture. In Proc. of the USENIX Technical Conf., Winter 1993.
|
| |
27
|
|
| |
28
|
Napster. http://www.napster.com.
|
 |
29
|
Venkata N. Padmanabhan , Lili Qiu, The content and access dynamics of a busy Web site: findings and implications, Proceedings of the conference on Applications, Technologies, Architectures, and Protocols for Computer Communication, p.111-123, August 28-September 01, 2000, Stockholm, Sweden
|
| |
30
|
M. Rabinovich, J. Chase, and S. Gadde. Not all hits are created equal: Cooperative proxy caching over a wide area network. In Proc. of the 3rd Int. WWW Caching Workshop, June 1998.
|
| |
31
|
S. Saroiu, P. K. Gummadi, and S. D. Gribble. A measurement study of peer-to-peer file sharing systems. In Proc. of Multimedia Computing and Networking 2002, Jan. 2002.
|
| |
32
|
A. Shaikh, R. Tewari, and M. Agrawal. On the effectiveness of DNS-based server selection. In Proc. of IEEE INFOCOM 2001, Anchorage, AK, USA 2001.
|
| |
33
|
|
| |
34
|
D. Wessels, K. Claffy, and H.-W. Braun. NLANR prototype web caching system. http://ircache.n1anr.net/.
|
| |
35
|
A. Wolman, G. Voelker, N. Sharma, N. Cardwell, M. Brown, T. Landray, D. Pinnel, A. Karlin, and H. Levy. Organization based analysis of web-object sharing and caching. In Proc. of the 2nd USENIX Conf. on Internet Technologies and Systems, Oct. 1999.
|
 |
36
|
Alec Wolman , M. Voelker , Nitin Sharma , Neal Cardwell , Anna Karlin , Henry M. Levy, On the scale and performance of cooperative Web proxy caching, Proceedings of the seventeenth ACM symposium on Operating systems principles, p.16-31, December 12-15, 1999, Charleston, South Carolina, United States
|
| |
37
|
|
CITED BY 78
|
|
Krishna P. Gummadi , Richard J. Dunn , Stefan Saroiu , Steven D. Gribble , Henry M. Levy , John Zahorjan, Measurement, modeling, and analysis of a peer-to-peer file-sharing workload, Proceedings of the nineteenth ACM symposium on Operating systems principles, October 19-22, 2003, Bolton Landing, NY, USA
|
|
|
|
|
|
|
|
|
Cristian Estan , Stefan Savage , George Varghese, Automatically inferring patterns of resource consumption in network traffic, Proceedings of the 2003 conference on Applications, technologies, architectures, and protocols for computer communications, August 25-29, 2003, Karlsruhe, Germany
|
|
|
|
|
|
Yatin Chawathe , Sylvia Ratnasamy , Lee Breslau , Nick Lanham , Scott Shenker, Making gnutella-like P2P systems scalable, Proceedings of the 2003 conference on Applications, technologies, architectures, and protocols for computer communications, August 25-29, 2003, Karlsruhe, Germany
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Anja Feldmann , Nils Kammenhuber , Olaf Maennel , Bruce Maggs , Roberto De Prisco , Ravi Sundaram, A methodology for estimating interdomain web traffic demand, Proceedings of the 4th ACM SIGCOMM conference on Internet measurement, October 25-27, 2004, Taormina, Sicily, Italy
|
|
|
Subhabrata Sen , Oliver Spatscheck , Dongmei Wang, Accurate, scalable in-network identification of p2p traffic using application signatures, Proceedings of the 13th international conference on World Wide Web, May 17-20, 2004, New York, NY, USA
|
|
|
Michal Feldman , Christos Papadimitriou , John Chuang , Ion Stoica, Free-riding and whitewashing in peer-to-peer systems, Proceedings of the ACM SIGCOMM workshop on Practice and theory of incentives in networked systems, September 03-03, 2004, Portland, Oregon, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Alexander Klemm , Christoph Lindemann , Mary K. Vernon , Oliver P. Waldhorst, Characterizing the query behavior in peer-to-peer file sharing systems, Proceedings of the 4th ACM SIGCOMM conference on Internet measurement, October 25-27, 2004, Taormina, Sicily, Italy
|
|
|
|
|
|
Nicolas Christin , Andreas S. Weigend , John Chuang, Content availability, pollution and poisoning in file sharing peer-to-peer networks, Proceedings of the 6th ACM conference on Electronic commerce, p.68-77, June 05-08, 2005, Vancouver, BC, Canada
|
|
|
|
|
|
|
|
|
Yi Qiao , Dong Lu , Fabián E. Bustamante , Peter A. Dinda, Looking at the server side of peer-to-peer systems, Proceedings of the 7th workshop on Workshop on languages, compilers, and run-time support for scalable systems, p.1-8, October 22-23, 2004, Houston, Texas
|
|
|
|
|
|
Tsung-Ching Lin , Yeali S. Sun , Shi-Chung Chang , Shao-I Chu , Yi-Ting Chou , Mei-Wen Li, Management of abusive and unfair Internet access by quota-based priority control, Computer Networks: The International Journal of Computer and Telecommunications Networking, v.44 n.4, p.441-462, 15 March 2004
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Petros Maniatis , TJ Giuli , Mema Roussopoulos , David S. H. Rosenthal , Mary Baker, Impeding attrition attacks in P2P systems, Proceedings of the 11th workshop on ACM SIGOPS European workshop: beyond the PC, September 19-22, 2004, Leuven, Belgium
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Alexander Moshchuk , Tanya Bragin , Damien Deville , Steven D. Gribble , Henry M. Levy, SpyProxy: execution-based detection of malicious web content, Proceedings of 16th USENIX Security Symposium on USENIX Security Symposium, p.1-16, August 06-10, 2007, Boston, MA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Shambhu Shrestha , Aki Kobayashi , Katsunori Yamaoka , Yoshinori Sakai , Noboru Sonehara, Efficient content location algorithm for content distribution networks based on distributed construction of search tree from contents of proximal nodes, Proceedings of the 24th IASTED international conference on Database and applications, p.101-108, February 13-15, 2006, Innsbruck, Austria
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Dejan Kostić , Alex C. Snoeren , Amin Vahdat , Ryan Braud , Charles Killian , James W. Anderson , Jeannie Albrecht , Adolfo Rodriguez , Erik Vandekieft, High-bandwidth data dissemination for large-scale distributed systems, ACM Transactions on Computer Systems (TOCS), v.26 n.1, p.1-61, February 2008
|
|
|
|
|
|
|
|
|
Tim Berners-Lee , Wendy Hall , James A. Hendler , Kieron O'Hara , Nigel Shadbolt , Daniel J. Weitzner, A framework for web science, Foundations and Trends in Web Science, v.1 n.1, p.1-130, January 2006
|
|
|
|
|
|
|
|
|
|
|
|
Xiaoming Zhang , Zhoujun Li , Yijie Wang, A distributed topology-aware overlays construction algorithm, Proceedings of the 15th ACM Mardi Gras conference: From lightweight mash-ups to lambda grids: Understanding the spectrum of distributed computing requirements, applications, tools, infrastructures, interoperability, and the incremental adoption of key capabilities, January 29-February 03, 2008, Baton Rouge, Louisiana
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Dawoon Jung , Jaegeuk Kim , Jin-Soo Kim , Joonwon Lee, ScaleFFS: A scalable log-structured flash file system for mobile multimedia systems, ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP), v.5 n.1, p.1-18, October 2008
|
|
|
Y. C. Tay , Dinh Nguyen Tran , Eric Yi Liu , Wei Tsang Ooi , Robert Morris, Equilibrium analysis through separation of user and network behavior, Computer Networks: The International Journal of Computer and Telecommunications Networking, v.52 n.18, p.3405-3420, December, 2008
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Cristian Lumezanu , Randy Baden , Dave Levin , Neil Spring , Bobby Bhattacharjee, Symbiotic relationships in internet routing overlays, Proceedings of the 6th USENIX symposium on Networked systems design and implementation, p.467-480, April 22-24, 2009, Boston, Massachusetts
|
|
|
Andrew G. Miklas , Stefan Saroiu , Alec Wolman , Angela Demke Brown, Bunker: a privacy-oriented platform for network tracing, Proceedings of the 6th USENIX symposium on Networked systems design and implementation, p.29-42, April 22-24, 2009, Boston, Massachusetts
|
|
|
|
|
|
|
|