ACM Home Page
Please provide us with feedback. Feedback
Scalability of network-failure resilience: analysis using multi-layer probabilistic graphical models
Full text PdfPdf (654 KB)
Source IEEE/ACM Transactions on Networking (TON) archive
Volume 17 ,  Issue 1  (February 2009) table of contents
Pages 319-331  
Year of Publication: 2009
ISSN:1063-6692
Authors
Guanglei Liu  Department of Computer Science, Roane State Community College, Harriman, TN and School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, GA
Chuanyi Ji  School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, GA
Publisher
IEEE Press  Piscataway, NJ, USA
Bibliometrics
Downloads (6 Weeks): 20,   Downloads (12 Months): 147,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: 10.1109/TNET.2008.925944

ABSTRACT

In this work, we quantify scalability of network re-silience upon failures. We characterize resilience as the percentage of lost traffic upon failures and define scalability as the growth rate of the percentage of lost traffic with respect to network size, link failure probability, and network traffic for given failure protection schemes. We apply probabilistic graphical models to characterize statistical dependence between physical-layer failures and the net-work-layer traffic, and analyze the scalability for large networks of different topologies.

We first focus on the scalability of resilience for regular topolo-gies under uniform deterministic traffic with independent and de-pendent link failures, with and without protection. For large net-works with small probabilities of failures and without protection, we show that the scalability of network resilience grows linearly with the average route length and with the "effective" link failure probability. For large networks with 1 + 1 protection, we obtain lower and upper bound of the percentage of lost traffic. We de-rive approximations of the scalability for arbitrary topologies, and attain close-form analytical results for ring, star, and mesh-torus topologies. We then study network resilience under random traffic with Poisson arrivals. We find that when the network is under light load, the network resilience is reduced to that under uniform de-terministic traffic. When the network load is under heavy load, the percentage of lost traffic approaches the marginal probability of link failure. Our scalability analysis shows explicitly how network resilience varies with different factors and provides insights for re-silient network design.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Report of the National Science Foundation Workshop on Fundamental Research in Networking, Arlington, VA, Apr. 24-25, 2003.
 
2
Y. Liu and K. S. Trivedi, "A general framework for network surviv-ability quantification," presented at the 12th GI/ITG Conf. Measuring, Modeling, and Evaluation of Computer and Communication Systems, Dresden, Germany, Sep. 2004.
 
3
G. Weichenberg, V. Chan, and M. Médard, "High-reliability topo-logical architectures for networks under stress," IEEE J. Sel. Areas Commun., vol. 22, no. 9, pp. 1830-1845, Nov. 2004.
 
4
F. Harary, Graph Theory. Reading, MA: Addison Wesley, Jan. 1995, new education edition.
 
5
J. J. Shi and J. P. Fonseka, "Analysis and design of survivable telecom-munications networks," IEE Proc. Commun., vol. 144, no. 5, pp. 322-330, Oct. 1997.
 
6
T. P. Ng, "K-terminal reliability of hierarchical networks," IEEE Trans. Reliabil., vol. 40, no. 2, pp. 218-225, Jun. 1991.
 
7
J. Zhang and B. Mukherjee, "A review of fault management in WDM mesh networks: Basic concepts and research challenges," IEEE Net-work, vol. 18, no. 2, pp. 41-48, Mar.-Apr. 2004.
 
8
F. Kelly, "Loss networks," Ann. Appl. Probabil., vol. 1, no. 3, pp. 319-378, 1991.
 
9
A. Zolfaghari and F. J. Kaudel, "Framework for network survivability performance," IEEE J. Sel. Areas Commun., vol. 12, no. 1, pp. 46-51, Jan. 1994.
 
10
S. C. Liew and K. W. Lu, "A framework for characterizing disaster-based network survivability," IEEE J. Sel. Areas Commun., vol. 12, no. 1, pp. 52-58, Jan. 1994.
 
11
V. Tamilraj and S. Subramaniam, "A comparison of optical network topologies," in Proc. Allerton Conf. Communications, Control, and Computing, Oct. 2003, pp. 1337-1346.
 
12
 
13
 
14
W. Feller, An Introduction to Probability Theory and Its Applications, 3rd ed. New York: Wiley, 1968.
 
15
J. D. Spragins, "Dependent failures in data communication systems," IEEE Trans. Commun., vol. 25, no. 12, pp. 1494-1499, Dec. 1977.
 
16
K. V. Le and V. O. Li, "Modeling and analysis of systems with multi-mode components and dependent failures," IEEE Trans. Reliabil., vol. 38, no. 1, pp. 68-75, Apr. 1989.
 
17
D. Papadimitriou, F. Poppe, J. Jones, S. Venkatachalam, S. Dha-ranikota, R. Jain, R. Hartani, and D. Griffith, "Inference of shared risk link groups," in Optical Internetworking Forum (OIF), Fremont, CA, 2001, contribution oif2001-066.
18
 
19
 
20
P. Erdos and A. Renyi, "On the evolution of random graphs," Publ. Math. Inst. Hungarian Acad. Sci., vol. 5, pp. 17-61, 1960.
 
21
G. Liu and C. Ji, "Resilience of all-optical network architectures under in-band crosstalk attacks: A probabilistic graphical model approach," IEEE J. Sel. Areas Commun., vol. 25, no. 4, pp. 2-17, Apr. 2007.
 
22
23
 
24
S. A. Berezner, A. E. Krzesinski, and P. G. Taylor, "On the inverse of Erlang's function," J. Appl. Probabil., vol. 35, no. 1, pp. 246-252, Mar. 1998.


Collaborative Colleagues:
Guanglei Liu: colleagues
Chuanyi Ji: colleagues