|
ABSTRACT
Adapting to the network is the key to achieving high performance for communication-intensive applications, including scientific computing,data intensive computing, and multicast, especially in Grid environments. This paper investigates an approach of representing network as a tree of participating hosts and switches matching or approximating their physical topology, and describes a fast, non-intrusive, and portable algorithm for inferring such a topology. This representation and the proposed inference algorithm serves as a key to building network-aware applications in a portable manner. The algorithm is based solely on RTTs of small packets between end hosts; it does not rely on popular but not universally available protocols such as trace route and SNMP. Another benefit is that it can handle all layers of network uniformly without any a priori knowledge of cluster configurations. The required number of measurements is O(Nd) in certain idealizing assumptions made for the purpose of analysis, where N is the number of participating processes and d the diameter of the network, which is usually small in real networks. In our experimental environment, the inference algorithm built a topology of 64 hosts in a single cluster in 4 seconds and and that of 256 hosts across 4 clusters in 15 seconds. It is able to not only identify clusters within a Grid, but also to partially identify the Layer 2 topology within a cluster. This is important for optimizing bandwidth-limited operations such as broadcast. We built several network-aware applications upon the inference system, including efficient bandwidth measurements and long message broadcasts. The topology is used to schedule as many measurements as possible in parallel without competing on shared links. We were able to build a bandwidth map of 256 hosts across 4 clusters in 27 seconds.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
Yuri Breitbart , Minos Garofalakis , Ben Jai , Cliff Martin , Rajeev Rastogi , Avi Silberschatz, Topology discovery in heterogeneous IP networks: the NetInventory system, IEEE/ACM Transactions on Networking (TON), v.12 n.3, p.401-414, June 2004
[doi> 10.1109/TNET.2004.828963]
|
| |
4
|
|
 |
5
|
|
 |
6
|
|
| |
7
|
|
| |
8
|
|
 |
9
|
|
| |
10
|
N. Duffield, J. Horowitz, F. L. Presti, and D. Towsley. Multicast topology inference from measured end-to-end loss. IEEE Transactions in Information Theory, 48: 26--45, 2002.
|
| |
11
|
|
| |
12
|
|
| |
13
|
|
 |
14
|
Thilo Kielmann , Rutger F. H. Hofman , Henri E. Bal , Aske Plaat , Raoul A. F. Bhoedjang, MagPIe: MPI's collective communication operations for clustered wide area systems, Proceedings of the seventh ACM SIGPLAN symposium on Principles and practice of parallel programming, p.131-140, May 04-06, 1999, Atlanta, Georgia, United States
|
 |
15
|
Bruce Lowekamp , David O'Hallaron , Thomas Gross, Topology discovery for large ethernet networks, Proceedings of the 2001 conference on Applications, technologies, architectures, and protocols for computer communications, p.237-248, August 2001, San Diego, California, United States
|
| |
16
|
Bruce B. Lowekamp , Brian Tierney , Les Cottrell , Richard Hughes-Jones , Thilo Kielmann , Martin Swany, Enabling Network Measurement Portability Through a Hierarchy of Characteristics, Proceedings of the Fourth International Workshop on Grid Computing, p.68, November 17-17, 2003
|
| |
17
|
G. Shao, F. Berman, and R. Wolski. Using Effective Network Views to Promote Distributed Application Performance. In Proceedings of the 1999 International Conference on Parallel and Distributed Processing Techniques and Applications, pages 2649--2656, 1999.
|
| |
18
|
Skitter. http://www.caida.org/tools/measurement/skitter.
|
| |
19
|
|
|