ACM Home Page
Please provide us with feedback. Feedback
Service-oriented data denormalization for scalable web applications
Full text PdfPdf (221 KB)
Source
International World Wide Web Conference archive
Proceeding of the 17th international conference on World Wide Web table of contents
Beijing, China
SESSION: Performance and scalability table of contents
Pages 267-276  
Year of Publication: 2008
ISBN:978-1-60558-085-2
Authors
Zhou Wei  Tsinghua University, Beijing, China
Jiang Dejun  Tsinghua University, Beijing, China
Guillaume Pierre  Vrije Universiteit, Amsterdam, Netherlands
Chi-Hung Chi  Tsinghua University, Beijing, China
Maarten van Steen  Vrije Universiteit, Amsterdam, Netherlands
Sponsor
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 12,   Downloads (12 Months): 192,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1367497.1367535
What is a DOI?

ABSTRACT

Many techniques have been proposed to scale web applications. However, the data interdependencies between the database queries and transactions issued by the applications limit their efficiency. We claim that major scalability improvements can be gained by restructuring the web application data into multiple independent data services with exclusive access to their private data store. While this restructuring does not provide performance gains by itself, the implied simplification of each database workload allows a much more efficient use of classical techniques. We illustrate the data denormalization process on three benchmark applications: TPC-W, RUBiS and RUBBoS. We deploy the resulting service-oriented implementation of TPC-W across an 85-node cluster and show that restructuring its data can provide at least an order of magnitude improvement in the maximum sustainable throughput compared to master-slave database replication, while preserving strong consistency and transactional properties.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
B. Abrahao, V. Almeida, J. Almeida, A. Zhang, D. Beyer, and F. Safai. Self-adaptive SLA-driven capacity management for internet services. In Proc. NOMS, Apr. 2006.
 
2
K. Amiri, S. Park, R. Tewari, and S. Padmanabhan. DBProxy: A dynamic data cache for Web applications. In Proc. ICDE, Mar. 2003.
 
3
C. Amza, E. Cecchet, A. Chanda, A. Cox, S. Elnikety, R. Gil, J. Marguerite, K. Rajamani, and W. Zwaenepoel. Specification and implementation of dynamic web site benchmarks. In Proc. Intl. Workshop on Workload Characterization, Nov. 2002.
 
4
C. Bornhövd, M. Altinel, C. Mohan, H. Pirahesh, and B. Reinwald. Adaptive database caching with DBCache. Data Engineering, 27(2):11--18, June 2004.
 
5
E. Cecchet. C-JDBC: a middleware framework for database clustering. Data Engineering, 27(2):19--26, June 2004.
 
6
 
7
I. Cunha, J. Almeida, V. Almeida, and M. dos Santos. Self-adaptive capacity management for multi-tier virtualized environments. In Proc. Intl. Symposium on Integrated Network Management, May 2007.
 
8
DAS3: The Distributed ASCI Supercomputer 3. http://www.cs.vu.nl/das3/.
9
10
11
12
 
13
14
 
15
Y. Huang and J. Chen. Fragment allocation in distributed database design. Information Science and Engineering, 17(3):491--506, May 2001.
 
16
Java TPC-W implementation distribution. http://www.ece.wisc.edu/pharm/tpcw.shtml.
 
17
L. Kazerouni and K. Karlapalem. Stepwise redesign of distributed relational databases. Technical Report HKUST-CS97-12, Hong Kong Univ. of Science and Technology, Dept. of Computer Science, Sept. 1997.
 
18
 
19
S. Navathe, K. Karlapalem, and M. Ra. A mixed fragmentation methodology for initial distributed database design. Computer and Software Engineering, 3(4), 1995.
20
 
21
C. Olston, A. Manjhi, C. Garrod, A. Ailamaki, B. Maggs, and T. Mowry. A scalability service for dynamic web applications. In Proc. Conf. on Innovative Data Systems Research, Jan. 2005.
 
22
 
23
 
24
M. Rabinovich, Z. Xiao, and A. Agarwal. Computing on the edge: A platform for replicating internet applications. In Proc. Intl. Workshop on Web Content Caching and Distribution, Sept. 2003.
 
25
M. Ronstrom and L. Thalmann. MySQL cluster architecture overview. MySQL Technical White Paper, Apr. 2004.
 
26
RUBBoS: Bulletin board system benchmark. http://jmob.objectweb.org/rubbos.html.
 
27
 
28
29
 
30
S. Sivasubramanian, G. Pierre, M. van Steen, and G. Alonso. GlobeCBC: Content-blind result caching for dynamic web applications. Technical Report IR-CS-022, Vrije Universiteit, Amsterdam, The Netherlands, June 2006.
 
31
 
32
W. D. Smith. TPC-W: Benchmarking an ecommerce solution. White paper, Transaction Processing Performance Council.
33
 
34
TPC-W frequently asked questions, question 2.10: "Why was the concept of atomic set of operations added and what are its requirements?", Aug. 1999.
 
35


Collaborative Colleagues:
Zhou Wei: colleagues
Jiang Dejun: colleagues
Guillaume Pierre: colleagues
Chi-Hung Chi: colleagues
Maarten van Steen: colleagues