ACM Home Page
Please provide us with feedback. Feedback
Affinity-based management of main memory database clusters
Full text PdfPdf (554 KB)
Source ACM Transactions on Internet Technology (TOIT) archive
Volume 2 ,  Issue 4  (November 2002) table of contents
Pages: 307 - 339  
Year of Publication: 2002
ISSN:1533-5399
Author
Minwen Ji  Hewlett Packard Laboratories, Palo Alto, CA
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 71,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/604596.604599
What is a DOI?

ABSTRACT

We study management strategies for main memory database clusters that are interposed between Internet applications and back-end databases as content caches. The task of management is to allocate data across individual cache databases and to route queries to the appropriate databases for execution. The goal is to maximize effective cache capacity and to minimize synchronization cost. We propose an affinity-based management system for main memory database cLUsters (ALBUM). ALBUM executes each query in two stages in order to take advantage of the query affinity that is observed in a wide range of applications. We evaluate the data/query distribution strategy in ALBUM with a set of trace-based simulations. The results show that ALBUM reduces cache miss ratio by a factor of 1.7 to 9 over alternative strategies. We have implemented a prototype of ALBUM, and compare its performance to that of an existing infrastructure: a fully replicated database with large buffer cache. The results show that ALBUM outperforms the existing infrastructure with the same number of server machines by a factor of 2 to 7, and that ALBUM with only 1/3 to 1/2 of the server machines achieves the same throughput as the existing infrastructure.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Allen, N. 2001. Don't waste your storage dollars: What you need to know. Research Note, Gartner Group, 20 March, 2001.
 
2
3
 
4
Aron, M., Sanders, D., Druschel, P., and Zwaenepoel, W. 2000. Scalable content-aware request distribution in cluster-based network servers. In Proceedings of USENIX Annual Technical Conference, June 2000.
5
 
6
Cao, P., and Irani, S. 1997. Cost-aware www proxy caching algorithms. In Proceedings of the USENIX Symposium on Internet Technologies and Systems, Dec. 1997.
 
7
Carter, J. L. and Wegman, M. N. 1979. Universal classes of hash functions. J. Comput. Syst. Sci. 18 (1979).
8
9
 
10
Davis, J. R. 1999. DataLinks: Managing External Data with DB2 Universal Database. IBM, Feb. 1999.
11
12
13
14
15
 
16
17
 
18
Princeton University Campus Directory. http://www.princeton.edu/Siteware/puphf.shtml.
 
19
Iyengar, A. and Challenger, J. 1997. Improving web server performance by caching dynamic data. In Proceedings of the 1st USENIX Symposium on Internet Technologies and Systems, Dec. 1997.
 
20
Ji, M. 2000. A low-cost consistency protocol for replicated directory data in cluster-based storage systems. In Proceedings of 1st IEEE International Conference on Cluster Computing. Extended Abstract/Poster, Full Paper as Technical Report 620-00, Dept. of Computer Science, Princeton University, Nov. 2000.
21
22
23
24
 
25
Navathe, S. B., Karlapalem, K., and Ra, M. 1995. A mixed fragmentation methodology for initial distributed database design. J. Comput. Softw. Eng. 3, 4 (1995).
 
26
Open Market. 1996. FastCGI:A High-Performance Web Server Interface, April 1996.
 
27
28
29
30
31
 
32
33
 
34
Smith, B., Acharya, A., Yang, T., and Zhu, H. 1999. Exploiting result equivalence in caching dynamic web content. In Proceedings of the 2nd USENIX Symposium on Internet Technologies and Systems, Oct. 1999.
 
35
TimesTen Performance Software. 2000. In-Memory Data Management in the Application Tier, 2000.
 
36
Transaction Processing Performance Council (TPC). 2001. TPC Benchmark C Standard Specification Rev. 5.0, 2001.
 
37
Wirzenius, L. and Oja, J. 1993. The Linux System Administrators' Guide Version 0.6.2. Ch. 5. Linux Documentation Project, 1993.
 
38
Yang, C., and Luo, M. 1999. Efficient support for content-based routing in web server clusters. In Proceedings of the 2nd USENIX Symposium on Internet Technologies and Systems, Oct. 1999.
 
39
Zhang, X., Barientos, M., Chen, J. B., and Seltzer, M. 1999. Hacc: An architecture for cluster-based web servers. In Proceedings of the 3rd USENIX Windows NT Symposium, July 1999.