|
ABSTRACT
Enterprise-scale storage systems, which can contain hundreds of host computers and storage devices and up to tens of thousands of disks and logical volumes, are difficult to design. The volume of choices that need to be made is massive, and many choices have unforeseen interactions. Storage system design is tedious and complicated to do by hand, usually leading to solutions that are grossly over-provisioned, substantially under-performing or, in the worst case, both.To solve the configuration nightmare, we present minerva: a suite of tools for designing storage systems automatically. Minerva uses declarative specifications of application requirements and device capabilities; constraint-based formulations of the various sub-problems; and optimization techniques to explore the search space of possible solutions.This paper also explores and evaluates the design decisions that went into Minerva, using specialized micro- and macro-benchmarks. We show that Minerva can successfully handle a workload with substantial complexity (a decision-support database benchmark). Minerva created a 16-disk design in only a few minutes that achieved the same performance as a 30-disk system manually designed by human experts. Of equal importance, Minerva was able to predict the resulting system's performance before it was built.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
3COM CORP. 1996. Gigabit Ethernet comes of age. 3Com Corporation. Technology white paper.
|
| |
2
|
ANSI. 1996. Fibre Channel Arbitrated Loop. ANSI. Standard X3.272-1996.
|
 |
3
|
Baruch Awerbuch , Yair Bartal , Amos Fiat, Competitive distributed file allocation, Proceedings of the twenty-fifth annual ACM symposium on Theory of computing, p.164-173, May 16-18, 1993, San Diego, California, United States
[doi> 10.1145/167088.167142]
|
| |
4
|
BOROWSKY, E., GOLDING, R., MERCHANT, A., SCHREIER, L., SHRIVER, E., SPASOJEVIC, M., AND WILKES,J. 1997. Using attribute-managed storage to achieve QoS. In Proceedings of the 5th Intl. Workshop on Quality of Service (June 1997).
|
| |
5
|
|
| |
6
|
COFFMAN, E., GAREY, M., AND JOHNSON, D. 1984. Approximation algorithms for bin-packing: An updated survey. In G. AUSIELLO,M.LUCERTINI, AND P. SERAFINI Eds., Algorithm Design for Computer System Design, pp. 49-106. Springer-Verlag.
|
 |
7
|
|
| |
8
|
|
| |
9
|
|
| |
10
|
|
| |
11
|
HEWLETT-PACKARD COMPANY. 1998. Model 30/FC High Availability Disk Array-User's Guide. Hewlett-Packard Company. Pub. No. A3661-90001.
|
| |
12
|
|
| |
13
|
MENON,J.AND MATTSON, D. 1992. Performance of disk arrays in transaction processing environments. In Proceedings of the 12th ICDCS (June 1992), pp. 302-309.
|
| |
14
|
MERCHANT,A.AND ALVAREZ, G. A. 2001. Disk array models in Minerva. Technical Report HPL-2001- 118 (April), Hewlett-Packard Laboratories. http://www.hpl.hp.com/techreports.
|
| |
15
|
|
 |
16
|
David A. Patterson , Garth Gibson , Randy H. Katz, A case for redundant arrays of inexpensive disks (RAID), Proceedings of the 1988 ACM SIGMOD international conference on Management of data, p.109-116, June 01-03, 1988, Chicago, Illinois, United States
|
| |
17
|
PATTIPATI,K.R.AND WOLF, J. L. 1990. A file assignment problem model for extended local area network environments. In Proceedings of the 10th ICDCS (May 1990), pp. 554-61.
|
| |
18
|
RUEMMLER,C.AND WILKES, J. 1993. Unix disk access patterns. In Proceedings of the Winter USENIX (Jan. 1993), pp. 405-420.
|
| |
19
|
|
| |
20
|
TOYODA, Y. 1975. A simplified algorithm for obtaining approximate solutions to zero-one programming problems. Management Science 21, 12 (Aug.), 1417-27.
|
| |
21
|
TRANSACTION PROCESSING PERFORMANCE COUNCIL. 1992. TPC benchmark C, standard specification, revision 1.0. Transaction Processing Performance Council.
|
| |
22
|
TRANSACTION PROCESSING PERFORMANCE COUNCIL. 1996. TPC benchmark D, standard specification, revision 1.2. Transaction Processing Performance Council.
|
| |
23
|
TRICK, M. 1992. A linear relaxation heuristic for the generalized assignment problem. Naval Research Logistics 39, 137-51.
|
 |
24
|
|
CITED BY 37
|
|
Sandeep Uttamchandani , Kaladhar Voruganti , Sudarshan Srinivasan , John Palmer , David Pease, Polus: Growing Storage QoS Management Beyond a "4-Year Old Kid", Proceedings of the 3rd USENIX Conference on File and Storage Technologies, March 31-31, 2004, San Francisco, CA
|
|
|
|
|
|
|
|
|
Kimberley Keeton , Cipriano Santos , Dirk Beyer , Jeffrey Chase , John Wilkes, Designing for Disasters, Proceedings of the 3rd USENIX Conference on File and Storage Technologies, March 31-31, 2004, San Francisco, CA
|
|
|
|
|
|
|
|
|
|
|
|
John D. Strunk , Eno Thereska , Christos Faloutsos , Gregory R. Ganger, Using utility to provision storage systems, Proceedings of the 6th USENIX Conference on File and Storage Technologies, p.1-16, February 26-29, 2008, San Jose, California
|
|
|
Kimberly Keeton , Terence Kelly , Arif Merchant , Cipriano Santos , Janet Wiener , Xiaoyun Zhu , Dirk Beyer, Don't settle for less than the best: use optimization to make decisions, Proceedings of the 11th USENIX workshop on Hot topics in operating systems, p.1-6, May 07-09, 2007, San Diego, CA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Christopher Stewart , Ming Zhong , Kai Shen , Thomas O'Neill, Comprehensive depiction of configuration-dependent performance anomalies in distributed server systems, Proceedings of the 2nd conference on Hot Topics in System Dependability, p.1-1, November 08, 2006, Seattle, WA
|
|
|
|
|
|
Ira Cohen , Moises Goldszmidt , Terence Kelly , Julie Symons , Jeffrey S. Chase, Correlating instrumentation data to system states: a building block for automated diagnosis and control, Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation, p.16-16, December 06-08, 2004, San Francisco, CA
|
|
|
|
|
|
Sandeep Uttamchandani , Li Yin , Guillermo A. Alvarez , John Palmer , Gul Agha, CHAMELEON: a self-evolving, fully-adaptive resource arbitrator for storage systems, Proceedings of the USENIX Annual Technical Conference 2005 on USENIX Annual Technical Conference, p.6-6, April 10-15, 2005, Anaheim, CA
|
|
|
|
|
|
Nava Aizikowitz , Alex Glikson , Ariel Landau , Bilha Mendelson , Tommy Sandbank, Component-based performance modeling of a storage area network, Proceedings of the 37th conference on Winter simulation, December 04-07, 2005, Orlando, Florida
|
|
|
|
|
|
|
|
|
|
|
|
Eric Anderson , Michael Hobbs , Kimberly Keeton , Susan Spence , Mustafa Uysal , Alistair Veitch, Hippodrome: Running Circles Around Storage Administration, Proceedings of the 1st USENIX Conference on File and Storage Technologies, January 28-30, 2002, Monterey, CA
|
|
|
Eric Anderson , Ram Swaminathan , Alistair Veitch , Guillermo A. Alvarez , John Wilkes, Selecting RAID Levels for Disk Arrays, Proceedings of the 1st USENIX Conference on File and Storage Technologies, January 28-30, 2002, Monterey, CA
|
|
|
|
|
|
|
|
|
|
|
|
Kirsten Hildrum , Fred Douglis , Joel L. Wolf , Philip S. Yu , Lisa Fleischer , Akshay Katta, Storage optimization for large-scale distributed stream-processing systems, ACM Transactions on Storage (TOS), v.3 n.4, p.1-28, February 2008
|
|
|
S. Gopisetty , S. Agarwala , E. Butler , D. Jadav , S. Jaquet , M. Korupolu , R. Routray , P. Sarkar , A. Singh , M. Sivan-Zimet , C.-H. Tan , S. Uttamchandani , D. Merbach , S. Padbidri , A. Dieberger , E. M. Haber , E. Kandogan , C. A. Kieliszewski , D. Agrawal , M. Devarakonda , K.-W. Lee , K. Magoutis , D. C. Verma , N. G. Vogl, Evolution of storage management: transforming raw data into information, IBM Journal of Research and Development, v.52 n.4, p.341-352, July 2008
|
|
|
S. Gopisetty , E. Butler , S. Jaquet , M. Korupolu , T. K. Nayak , R. Routray , M. Seaman , A. Singh , C.-H. Tan , S. Uttamchandani , A. Verma, Automated planners for storage provisioning and disaster recovery, IBM Journal of Research and Development, v.52 n.4, p.353-365, July 2008
|
|
|
|
|
|
|
|
|
|
|
|
|
|