ACM Home Page
Please provide us with feedback. Feedback
High performance wide-area overlay using deadlock-free routing
Full text PdfPdf (692 KB)
Source
High Performance Distributed Computing archive
Proceedings of the 18th ACM international symposium on High performance distributed computing table of contents
Garching, Germany
SESSION: Grid middleware and distributed algorithms table of contents
Pages 81-90  
Year of Publication: 2009
ISBN:978-1-60558-587-1
Authors
Ken Hironaka  The University of Tokyo, Tokyo, Japan
Hideo Saito  The University of Tokyo, Tokyo, Japan
Kenjiro Taura  The University of Tokyo, Tokyo, Japan
Sponsors
ACM: Association for Computing Machinery
SIGARCH: ACM Special Interest Group on Computer Architecture
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 52,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1551609.1551628
What is a DOI?

ABSTRACT

Overlay networks as the communication medium in parallel and distributed applications have gained prominence, especially in Grid environments. However, providing both throughput performance and reliable communication on overlays have been given little attention. The core of this problem is that intermediate nodes have limited buffer memory, while the forwarding throughput must yield Gbps. Yet, implementing a naive flow control can deadlock the overlay. Thus, high performance flow control on overlays is a critical concern in heterogeneous wide-area networks, where input/output link throughput can vary significantly. We propose an overlay scheme that couples TCP connections and fixed intermediate buffer memory while adapting deadlock-free routing for our overlay routing in heterogeneous wide-area networks. Our scheme eliminates memory overflows at forwarding nodes by fixed buffer memory and deadlocks via a deadlock-free routing algorithm that resolves adaptation challenges for heterogeneous wide-area networks. Our overlay construction and routing optimizations account for underlying network latency and bandwidth information. Simulation on 13 clusters (515 nodes) and evaluation on 7 clusters (170 nodes) show that our deadlock-free routing poses negligible overhead in comparison to deadlock-unaware routing, and comparably with direct communication. We further demonstrate that for certain collective communications, our overlay even out-performs direct communication by mitigating or completely avoiding network contention. We show this on systems ranging from a single-switch cluster with 36 nodes to a Grid environment with 4 clusters and 291 nodes.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
DAS-3. http://www.cs.vu.nl/das3.
 
2
Grid5000. https://www.grid5000.org.
 
3
InTrigger. http://www.intrigger.jp.
 
4
Planetlab. http://www.planet-lab.org.
 
5
Y. Amir, B. Awerbuch, C. Danilov, and J. Stanton. Global Flow Control for Wide Area Overlay Networks: a Cost-Benefit Approach. In OPENARCH 2002: Proceedings of the 5th IEEE Conference on Open Architectures and Network Programming, pages 155--166. IEEE Computer Society, 2002.
 
6
Y. Amir and C. Danilov. Spines. http://www.spines.org.
 
7
Y. Amir and C. Danilov. Reliable Communication in Overlay Networks. In DSN 2003: Proceedings of the 33rd Annual IEEE International Conference on Dependable Systems and Networks, pages 511--520. IEEE Computer Society, June 2003.
8
 
9
 
10
 
11
 
12
 
13
 
14
 
15
K. Kar, S. Sarkar, and L. Tassiulas. A Simple Rate Control Algorithm for Maximizing Total User Utility. INFOCOM 2001: Proceedings of the 20th Annual Joint Conference of the IEEE Computer and Communications Societies, 1:133--141, 2001.
16
 
17
 
18
G. Kola and M. Livny. DiskRouter: A Flexible Infrastructure for High Performance Large Scale Data Transfers. Technical Report Technical Report CS-TR-2003-1484, University of Wisconsin, 2003.
 
19
 
20
G.-I. Kwon and J. W. Byers. ROMA: Reliable Overlay Multicast with Loosely Coupled TCP Connections. In INFOCOM 2004: Proceedings of the 23rd Conference of the IEEE Communications Society, pages 385--395, March 2004.
 
21
S.-J. Lee, S. Banerjee, P. Sharma, P. Yalagandula, and S. Basu. Bandwidth-Aware Routing in Overlay Networks. In INFOCOM 2008. Proceedings of the 27th IEEE International Conference on Computer Communications, pages 1732--1740, April 2008.
22
 
23
S. Naganuma, K. Takahashi, H. Saito, T. Shibata, K. Taura, and T. Chikayama. Improving Effciency of Network Bandwdith Estimation Using Topology Information (in Japanese). In SACSIS '08: Proceedings of the Annual IPSJ Synposium on Advanced Computing Systems and Infrastructures, pages 359--366. IPSJ, 2008.
24
25
 
26
 
27
 
28
 
29
M. D. Schroeder, A. D. Birrell, M. Burrows, H. Murray, R. M. Needham, T. L. Rodeheffer, E. H. Satterthwaite, and C. P. Thacker. Autonet: a High-Speed, Self-Configuring Local Area Network Using Point-to-Point Links. IEEE Journal on Selected Areas in Communications, 9:1318 -- 1335, 1991.
30
 
31
G. Urvoy Keller and E. W. Biersack. A Congestion Control Model for Multicast Overlay Networks and its Performance. In NGC 2002: Proceedings of the 4th International Workshop on Network Group Communication, October 2002.
 
32
L. Xu, K. Harfoush, and I. Rhee. Binary Increase Congestion Control (BIC) for Fast Long-Distance Networks. INFOCOM 2004: Proceedings of the 23rd Conference of the IEEE Communications Society, 4:2514--2524, March 2004.
 
33
A. Young, J. Chen, Z. Ma, A. Krishnamurthy, L. Peterson, and R. Y. Wang. Overlay Mesh Construction Using Interleaved Spanning Trees. In INFOCOM 2004: Proceedings of the 23th Conference of the IEEE Communications Society, March 2004.

Collaborative Colleagues:
Ken Hironaka: colleagues
Hideo Saito: colleagues
Kenjiro Taura: colleagues