|
ABSTRACT
One role for workload generation is as a means for understanding how servers and networks respond to variation in load. This enables management and capacity planning based on current and projected usage. This paper applies a number of observations of Web server usage to create a realistic Web workload generation tool which mimics a set of real users accessing a server. The tool, called Surge (Scalable URL Reference Generator) generates references matching empirical measurements of 1) server file size distribution; 2) request size distribution; 3) relative file popularity; 4) embedded file references; 5) temporal locality of reference; and 6) idle periods of individual users. This paper reviews the essential elements required in the generation of a representative Web workload. It also addresses the technical challenges to satisfying this large set of simultaneous constraints on the properties of the reference stream, the solutions we adopted, and their associated accuracy. Finally, we present evidence that Surge exercises servers in a manner significantly different from other Web server benchmarks.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Virgílio Almeida , Azer Bestavros , Mark Crovella , Adriana de Oliveira, Characterizing reference locality in the WWW, Proceedings of the fourth international conference on on Parallel and distributed information systems, p.92-107, December 18-20, 1996, Miami Beach, Florida, United States
|
 |
2
|
|
| |
3
|
Henry Braun. A simple metllod for testing goodness of fit in tile presence of nuisance parameters. Journal of the Royal Statistical Society, }980.
|
| |
4
|
|
| |
5
|
The Standard Performance Evaluation Corporation. Specweb96, http://www.specbench.org/org/web96/.
|
 |
6
|
|
| |
7
|
|
| |
8
|
|
| |
9
|
S. Deng. Empirical model of WWW document arrivals at access link. In Proceedings of the 1996 IEEE International Conference on Communication, June 1996.
|
| |
10
|
|
| |
11
|
A. Feldmann. Modelling characteristics of tcp connections. Technical report, AT$cT Laboratories, 1996.
|
| |
12
|
|
| |
13
|
|
| |
14
|
R. Mattson, .I. Gecsei, D. Slutz, and I. Traiger. Evaluation techniques and storage hierarchies. IBM Systems Journal, 9:78-117, 1970.
|
| |
15
|
d.C. Mogul. Network behavior of a busy web server and its clients. Technical Report WRL 95/5, DEC Western Research Laboratory, Palo Alto, CA, 1995.
|
| |
16
|
University of Minnesota. {(}stone version 1. htt p://web 66. coled, umn .ed u / gsto ne/in fo.html.
|
| |
17
|
|
| |
18
|
|
| |
19
|
|
| |
20
|
Gene Trent and Mark Sake. Webstone: The first generation in http server benchmarking, February 1995. Silicon (#raphics White Paper.
|
| |
21
|
|
| |
22
|
G. K. Zipf. Human Behavior and the Principle of Least- Effort. Addison-Wesley, Cambridge, MA, 1949.
|
CITED BY 269
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Polly Huang , Anja Feldmann , Walter Willinger, A non-instrusive, wavelet-based approach to detecting network performance problems, Proceedings of the 1st ACM SIGCOMM Workshop on Internet Measurement, November 01-02, 2001, San Francisco, California, USA
|
|
|
Eveline Veloso , Virgílio Almeida , Wagner Meira , Azer Bestavros , Shudong Jin, A hierarchical characterization of a live streaming media workload, Proceedings of the 2nd ACM SIGCOMM Workshop on Internet measurment, November 06-08, 2002, Marseille, France
|
|
|
|
|
|
Milo M. K. Martin , Daniel J. Sorin , Anastassia Ailamaki , Alaa R. Alameldeen , Ross M. Dickson , Carl J. Mauer , Kevin E. Moore , Manoj Plakal , Mark D. Hill , David H. Wood, Timestamp snooping: an approach for extending SMPs, ACM SIGPLAN Notices, v.35 n.11, p.25-36, Nov. 2000
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Gregory R. Ganger , Dawson R. Engler , M. Frans Kaashoek , Hector M. Briceño , Russell Hunt , Thomas Pinckney, Fast and flexible application-level networking on exokernel systems, ACM Transactions on Computer Systems (TOCS), v.20 n.1, p.49-83, February 2002
|
|
|
|
|
|
|
|
|
Youngmi Joo , Vinay Ribeiro , Anja Feldmann , Anna C. Gilbert , Walter Willinger, TCP/IP traffic dynamics and network performance: a lesson in workload modeling, flow control, and trace-driven simulations, ACM SIGCOMM Computer Communication Review, v.31 n.2, April 2001
|
|
|
|
|
|
Niranjan Joshi , Srinivas R. Kadaba , Sarvar Patel , Ganapathy S. Sundaram, Downlink scheduling in CDMA data networks, Proceedings of the 6th annual international conference on Mobile computing and networking, p.179-190, August 06-11, 2000, Boston, Massachusetts, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Erich Plasser , Thomas Ziegler , Peter Reichl, On the non-linearity of the RED drop function, Proceedings of the 15th international conference on Computer communication, p.515-534, August 12-14, 2002, Mumbai, Maharashtra, India
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Krishna P. Gummadi , Richard J. Dunn , Stefan Saroiu , Steven D. Gribble , Henry M. Levy , John Zahorjan, Measurement, modeling, and analysis of a peer-to-peer file-sharing workload, Proceedings of the nineteenth ACM symposium on Operating systems principles, October 19-22, 2003, Bolton Landing, NY, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Wenting Tang , Yun Fu , Ludmila Cherkasova , Amin Vahdat, MediSyn: a synthetic streaming media service workload generator, Proceedings of the 13th international workshop on Network and operating systems support for digital audio and video, June 01-03, 2003, Monterey, CA, USA
|
|
|
|
|
|
|
|
|
|
|
|
Hani Jamjoom , Kang G. Shin, Persistent dropping: an efficient control of traffic aggregates, Proceedings of the 2003 conference on Applications, technologies, architectures, and protocols for computer communications, August 25-29, 2003, Karlsruhe, Germany
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Alaa R. Alameldeen , Milo M. K. Martin , Carl J. Mauer , Kevin E. Moore , Min Xu , Mark D. Hill , David A. Wood , Daniel J. Sorin, Simulating a $2M Commercial Server on a $2K PC, Computer, v.36 n.2, p.50-57, February 2003
|
|
|
Daniel A. Menascé , Virgilio A. F. Almeida , Rodrigo Fonseca , Marco A. Mendes, A methodology for workload characterization of E-commerce sites, Proceedings of the 1st ACM conference on Electronic commerce, p.119-128, November 03-05, 1999, Denver, Colorado, United States
|
|
|
|
|
|
Milo M. K. Martin , Daniel J. Sorin , Anatassia Ailamaki , Alaa R. Alameldeen , Ross M. Dickson , Carl J. Mauer , Kevin E. Moore , Manoj Plakal , Mark D. Hill , David A. Wood, Timestamp snooping: an approach for extending SMPs, ACM SIGARCH Computer Architecture News, v.28 n.5, p.25-36, Dec. 2000
|
|
|
|
|
|
|
|
|
|
|
|
G. I. Papadimitriou , A. I. Vakali , G. Pallis , S. Petridou , A. S. Pomportsis, Simulation in Web data management, Applied system simulation: methodologies and applications, Kluwer Academic Publishers, Norwell, MA, 2003
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Matthew Roughan , Subhabrata Sen , Oliver Spatscheck , Nick Duffield, Class-of-service mapping for QoS: a statistical signature-based approach to IP traffic classification, Proceedings of the 4th ACM SIGCOMM conference on Internet measurement, October 25-27, 2004, Taormina, Sicily, Italy
|
|
|
|
|
|
|
|
|
Antonio Magnaghi , Takeo Hamada , Tsuneo Katsuyama, A wavelet-based framework for proactive detection of network misconfigurations, Proceedings of the ACM SIGCOMM workshop on Network troubleshooting: research, theory and operations practice meet malfunctioning reality, September 03-03, 2004, Portland, Oregon, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Raúl Peña-Ortiz , Julio Sahuquillo , Ana Pont , José A. Gil, Modeling continuous changes of the user's dynamic behavior in the WWW, Proceedings of the 5th international workshop on Software and performance, p.175-180, July 12-14, 2005, Palma, Illes Balears, Spain
|
|
|
|
|
|
|
|
|
|
|
|
Sunil Nakrani , Craig Tovey, On Honey Bees and Dynamic Server Allocation in Internet Hosting Centers, Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems, v.12 n.3-4, p.223-240, September-December 2004
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Wee Teck Ng , Hao Sun , Bruce Hillyer , Elizabeth Shriver , Eran Gabber , Banu Ozden, Obtaining High Performance for Storage Outsourcing, Proceedings of the 1st USENIX Conference on File and Storage Technologies, January 28-30, 2002, Monterey, CA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Eveline Veloso , Virgílio Almeida , Wagner Meira, Jr. , Azer Bestavros , Shudong Jin, A hierarchical characterization of a live streaming media workload, IEEE/ACM Transactions on Networking (TON), v.14 n.1, p.133-146, February 2006
|
|
|
|
|
|
|
|
|
|
|
|
Simon Künzli , Francesco Poletti , Luca Benini , Lothar Thiele, Combining simulation and formal methods for system-level performance analysis, Proceedings of the conference on Design, automation and test in Europe: Proceedings, March 06-10, 2006, Munich, Germany
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Sugato Bagchi , Eugene Hung , Arun Iyengar , Norbert Vogl , Noshir Wadia, Capacity planning tools for web and grid environments, Proceedings of the 1st international conference on Performance evaluation methodolgies and tools, October 11-13, 2006, Pisa, Italy
|
|
|
|
|
|
Taeho Kgil , Shaun D'Souza , Ali Saidi , Nathan Binkert , Ronald Dreslinski , Trevor Mudge , Steven Reinhardt , Krisztian Flautner, PicoServer: using 3D stacking technology to enable a compact energy efficient chip multiprocessor, ACM SIGPLAN Notices, v.41 n.11, November 2006
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Félix Hernández-Campos , Merkouris Karaliopoulos , Maria Papadopouli , Haipeng Shen, Spatio-temporal modeling of traffic workload in a campus WLAN, Proceedings of the 2nd annual international workshop on Wireless internet, p.1-es, August 02-05, 2006, Boston, Massachusetts
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Ronald P. Doyle , Jeffrey S. Chase , Omer M. Asad , Wei Jin , Amin M. Vahdat, Model-based resource provisioning in a web service utility, Proceedings of the 4th conference on USENIX Symposium on Internet Technologies and Systems, p.5-5, March 26-28, 2003, Seattle, WA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Dennis Lee , Jean-Loup Baer , Brian Bershad , Tom Anderson, Reducing startup latency in web and desktop applications, Proceedings of the 3rd conference on USENIX Windows NT Symposium, p.17-17, July 12-15, 1999, Seattle, Washington
|
|
|
|
|
|
K. Sleurs , J. Potemans , J. Theunis , D. Li , E. Van Lil , A. Van de Capelle, Evaluation of network traffic workload scaling techniques, Computer Communications, v.30 n.16, p.3096-3106, November, 2007
|
|
|
|
|
|
|
|
|
Huajing Li , Wang-Chien Lee , Anand Sivasubramaniam , Lee Giles, SearchGen: a synthetic workload generator for scientific literature digital libraries and search engines, Proceedings of the 2007 conference on Digital libraries, June 18-23, 2007, Vancouver, BC, Canada
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Han Cai , Do Young Eun , Sangtae Ha , Injong Rhee , Lisong Xu, Stochastic convex ordering for multiplicative decrease internet congestion control, Computer Networks: The International Journal of Computer and Telecommunications Networking, v.53 n.3, p.365-381, February, 2009
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Xabiel G. Paneda , R. Garcia , D. Melendi , M. Vilas , V. Garcia, Popularity analysis of a video-on-demand service in a digital newspaper: influence of the subject, video characteristics and new content publication policy, International Journal of Advanced Media and Communication, v.1 n.4, p.369-385, August 2007
|
|
|
|
|
|
Ana Bosque , Pablo Ibañez , Víctor Viñals , Per Stenström , Jose M. Llabería, Characterization of Apache web server with Specweb2005, Proceedings of the 2007 workshop on MEmory performance: DEaling with Applications, systems and architecture, p.65-72, September 16-16, 2007, Brasov, Romania
|
|
|
|
|
|
|
|
|
David G. Cameron , Rubén Carvajal-Schiaffino , A. Paul Millar , Caitriana Nicholson , Kurt Stockinger , Floriano Zini, Evaluating Scheduling and Replica Optimisation Strategies in OptorSim, Proceedings of the 4th International Workshop on Grid Computing, p.52, November 17-17, 2003
|
|
|
Dongkook Park , Soumya Eachempati , Reetuparna Das , Asit K. Mishra , Yuan Xie , N. Vijaykrishnan , Chita R. Das, MIRA: A Multi-layered On-Chip Interconnect Router Architecture, ACM SIGARCH Computer Architecture News, v.36 n.3, p.251-261, June 2008
|
|
|
|
|
|
Y. C. Tay , Dinh Nguyen Tran , Eric Yi Liu , Wei Tsang Ooi , Robert Morris, Equilibrium analysis through separation of user and network behavior, Computer Networks: The International Journal of Computer and Telecommunications Networking, v.52 n.18, p.3405-3420, December, 2008
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Taeho Kgil , Ali Saidi , Nathan Binkert , Steve Reinhardt , Krisztian Flautner , Trevor Mudge, PicoServer: Using 3D stacking technology to build energy efficient servers, ACM Journal on Emerging Technologies in Computing Systems (JETC), v.4 n.4, p.1-34, October 2008
|
|
|
|
|
|
Ping Zhou , Bo Zhao , Yu Du , Yi Xu , Youtao Zhang , Jun Yang , Li Zhao, Frequent value compression in packet-based NoC architectures, Proceedings of the 2009 Conference on Asia and South Pacific Design Automation, January 19-22, 2009, Yokohama, Japan
|
|
|
|
|
|
|
|
|
Xinming He , Christos Papadopoulos , John Heidemann , Urbashi Mitra , Usman Riaz, Remote detection of bottleneck links using spectral and statistical methods, Computer Networks: The International Journal of Computer and Telecommunications Networking, v.53 n.3, p.279-298, February, 2009
|
|
|
|
|
|
Katja Gilly , Carlos Quesada-Granja , Salvador Alcaraz , Carlos Juiz , Ramon Puigjaner, A Statistically Customisable Web Benchmarking Tool, Electronic Notes in Theoretical Computer Science (ENTCS), 232, p.89-99, March, 2009
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Tongqing Qiu , Zihui Ge , Seungjoon Lee , Jia Wang , Qi Zhao , Jun Xu, Modeling channel popularity dynamics in a large IPTV system, Proceedings of the eleventh international joint conference on Measurement and modeling of computer systems, June 15-19, 2009, Seattle, WA, USA
|
|
|
|
|
|
Andreas Petlund , Paul Beskow , Jon Pedersen , Espen Søgård Paaby , Carsten Griwodz , Pål Halvorsen, Improving SCTP retransmission delays for time-dependent thin streams, Multimedia Tools and Applications, v.45 n.1-3, p.33-60, October 2009
|
|