| A scalable, commodity data center network architecture |
| Full text |
Pdf
(986 KB)
|
Source
|
Applications, Technologies, Architectures, and Protocols for Computer Communication
archive
Proceedings of the ACM SIGCOMM 2008 conference on Data communication
table of contents
Seattle, WA, USA
SESSION: Data Center networking
table of contents
Pages 63-74
Year of Publication: 2008
ISBN:978-1-60558-175-0
Also published in ...
|
|
Authors
|
|
Mohammad Al-Fares
|
University of California, San Diego, La Jolla, CA, USA
|
|
Alexander Loukissas
|
University of California, San Diego, La Jolla, CA, USA
|
|
Amin Vahdat
|
University of California, San Diego, La Jolla, CA, USA
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 82, Downloads (12 Months): 457, Citation Count: 7
|
|
|
ABSTRACT
Today's data centers may contain tens of thousands of computers with significant aggregate bandwidth requirements. The network architecture typically consists of a tree of routing and switching elements with progressively more specialized and expensive equipment moving up the network hierarchy. Unfortunately, even when deploying the highest-end IP switches/routers, resulting topologies may only support 50% of the aggregate bandwidth available at the edge of the network, while still incurring tremendous cost. Non-uniform bandwidth among data center nodes complicates application design and limits overall system performance. In this paper, we show how to leverage largely commodity Ethernet switches to support the full aggregate bandwidth of clusters consisting of tens of thousands of elements. Similar to how clusters of commodity computers have largely replaced more specialized SMPs and MPPs, we argue that appropriately architected and interconnected commodity switches may deliver more performance at less cost than available from today's higher-end solutions. Our approach requires no modifications to the end host network interface, operating system, or applications; critically, it is fully backward compatible with Ethernet, IP, and TCP.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Cisco Data Center Infrastructure 2.5 Design Guide. http://www.cisco.com/univercd/cc/td/doc/solution/dcidg21.pdf.
|
| |
2
|
InfiniBand Architecture Specification Volume 1, Release 1.0. http://www.infinibandta.org/specs.
|
| |
3
|
Juniper J-Flow. http://www.juniper.net/techpubs/software/erx/junose61/swconfig-routing-vol1/html/ip-jflow-stats-config2.html.
|
| |
4
|
Sun Datacenter Switch 3456 Architecture White Paper. http://www.sun.com/products/networking/datacenter/ds3456/ds3456_wp.pdf.
|
| |
5
|
M. Blumrich, D. Chen, P. Coteus, A. Gara, M. Giampapa, P. Heidelberger, S. Singh, B. Steinmacher-Burow, T. Takken, and P. Vranas. Design and Analysis of the BlueGene/L Torus Interconnection Network. IBM Research Report RC23025 (W0312--022), 3, 2003.
|
| |
6
|
Nanette J. Boden , Danny Cohen , Robert E. Felderman , Alan E. Kulawik , Charles L. Seitz , Jakov N. Seizovic , Wen-King Su, Myrinet: A Gigabit-per-Second Local Area Network, IEEE Micro, v.15 n.1, p.29-36, February 1995
[doi> 10.1109/40.342015]
|
| |
7
|
|
 |
8
|
|
| |
9
|
|
| |
10
|
B. Claise. Cisco Systems NetFlow Services Export Version 9. RFC 3954, Internet Engineering Task Force, 2004.
|
| |
11
|
C. Clos. A Study of Non-blocking Switching Networks. Bell System Technical Journal, 32(2), 1953.
|
| |
12
|
|
 |
13
|
Giuseppe DeCandia , Deniz Hastorun , Madan Jampani , Gunavardhan Kakulapati , Avinash Lakshman , Alex Pilchin , Swaminathan Sivasubramanian , Peter Vosshall , Werner Vogels, Dynamo: amazon's highly available key-value store, Proceedings of twenty-first ACM SIGOPS symposium on Operating systems principles, October 14-17, 2007, Stevenson, Washington, USA
|
 |
14
|
|
 |
15
|
|
 |
16
|
S. Ben Fred , T. Bonald , A. Proutiere , G. Régnié , J. W. Roberts, Statistical bandwidth sharing: a study of congestion at flow level, Proceedings of the 2001 conference on Applications, technologies, architectures, and protocols for computer communications, p.111-122, August 2001, San Diego, California, United States
|
| |
17
|
|
 |
18
|
|
| |
19
|
|
| |
20
|
D. Katz, D. Ward. BFD for IPv4 and IPv6 (Single Hop) (Draft). Technical report, Internet Engineering Task Force, 2008.
|
 |
21
|
|
 |
22
|
Charles E. Leiserson , Zahi S. Abuhamdeh , David C. Douglas , Carl R. Feynman , Mahesh N. Ganmukhi , Jeffrey V. Hill , Daniel Hillis , Bradley C. Kuszmaul , Margaret A. St. Pierre , David S. Wells , Monica C. Wong , Shaw-Wen Yang , Robert Zak, The network architecture of the Connection Machine CM-5 (extended abstract), Proceedings of the fourth annual ACM symposium on Parallel algorithms and architectures, p.272-285, June 29-July 01, 1992, San Diego, California, United States
[doi> 10.1145/140901.141883]
|
| |
23
|
|
| |
24
|
John W. Lockwood , Nick McKeown , Greg Watson , Glen Gibb , Paul Hartke , Jad Naous , Ramanan Raghuraman , Jianying Luo, NetFPGA--An Open Platform for Gigabit-Rate Network Switching and Routing, Proceedings of the 2007 IEEE International Conference on Microelectronic Systems Education, p.160-161, June 03-04, 2007
[doi> 10.1109/MSE.2007.69]
|
| |
25
|
J. Moy. OSPF Version 2. RFC 2328, Internet Engineering Task Force, 1998.
|
| |
26
|
|
| |
27
|
|
| |
28
|
SGI Developer Central Open Source Linux XFS. XFS: A High-performance Journaling Filesystem. http://oss.sgi.com/projects/xfs/.
|
 |
29
|
|
| |
30
|
|
| |
31
|
|
| |
32
|
J. Vetter, S. Alam, J. Dunigan, T.H., M. Fahey, P. Roth, and P. Worley. Early Evaluation of the Cray XT3. In IEEE International Parallel and Distributed Processing Symposium, 2006.
|
| |
33
|
M. Woodacre, D. Robb, D. Roe, and K. Feind. The SGI Altix 3000 Global Shared-Memory Architecture. SGI White Paper, 2003.
|
CITED BY 7
|
|
|
|
|
Michael A. Kozuch , Michael P. Ryan , Richard Gass , Steven W. Schlosser , David O'Hallaron , James Cipar , Elie Krevat , Julio López , Michael Stroucken , Gregory R. Ganger, Tashi: location-aware cluster management, Proceedings of the 1st workshop on Automated control for datacenters and clouds, June 19-19, 2009, Barcelona, Spain
|
|
|
|
|
|
Radhika Niranjan Mysore , Andreas Pamboris , Nathan Farrington , Nelson Huang , Pardis Miri , Sivasankar Radhakrishnan , Vikram Subramanya , Amin Vahdat, PortLand: a scalable fault-tolerant layer 2 data center network fabric, ACM SIGCOMM Computer Communication Review, v.39 n.4, October 2009
|
|
|
|
|
|
Albert Greenberg , James R. Hamilton , Navendu Jain , Srikanth Kandula , Changhoon Kim , Parantap Lahiri , David A. Maltz , Parveen Patel , Sudipta Sengupta, VL2: a scalable and flexible data center network, ACM SIGCOMM Computer Communication Review, v.39 n.4, October 2009
|
|
|
Chuanxiong Guo , Guohan Lu , Dan Li , Haitao Wu , Xuan Zhang , Yunfeng Shi , Chen Tian , Yongguang Zhang , Songwu Lu, BCube: a high performance, server-centric network architecture for modular data centers, ACM SIGCOMM Computer Communication Review, v.39 n.4, October 2009
|
|