ACM Home Page
Please provide us with feedback. Feedback
Reliability estimation for large distributed software systems
Full text PdfPdf (123 KB)
Source IBM Centre for Advanced Studies Conference archive
Proceedings of the 2008 conference of the center for advanced studies on collaborative research: meeting of minds table of contents
Ontario, Canada
SESSION: Systems I table of contents
Article No. 12  
Year of Publication: 2008
Authors
Alberto Avritzer  Siemens Corporate Research
Flávio P. Duarte  Siemens Corporate Research
Rosa Maria Meri Leão  Universidade Federal do Rio de Janeiro
Edmundo de Souza e Silva  Universidade Federal do Rio de Janeiro
Michal Cohen  Siemens Transportation Systems
David Costello  Siemens Transportation Systems
Sponsors
: IBM Toronto Software Lab
: IBM Centers for Advanced Studies (CAS)
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 91,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1463788.1463804
What is a DOI?

ABSTRACT

In this paper, we present our experience to estimate the reliability of a large distributed system composed of several hundred points of presence. The system's reliability metric is required by contract to be obtained. A simple approach is presented to accurately approximate the desired metrics.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
2
 
3
 
4
 
5
M. O. Ball. Computational Complexity of Network Reliability Analysis: An Overview. IEEE Trans. on Reliability, Vol. R-35, NO.3, August, 1986, pp. 230--238.
 
6
 
7
I. James and J. Marshal, M. Evans, B. Newman. Reliability Metrics and the REMM Model. IEEE RAMS 2004, pages 474--479.
 
8
J. Jones and J. Hayes. Estimation of System Reliability Using a 'Non-Constant Failure Rate' Model. IEEE Transactions on Reliability, Vol. 50, No. 3, Sept 2001.
 
9
 
10
 
11
 
12
J. H. Wensley. Fault-tolerant computers ensure reliable industrial controls. August Systems Incorporated, June 1981, OR 97302.
13
14
 
15
E. de Souza e Silva and H. R. Gail. Transient Solutions for Markov Chains. Computational Probability, W. Grassmann, editor, Kluwer Academic Publishers, 2000, pp. 43--81.

Collaborative Colleagues:
Alberto Avritzer: colleagues
Flávio P. Duarte: colleagues
Rosa Maria Meri Leão: colleagues
Edmundo de Souza e Silva: colleagues
Michal Cohen: colleagues
David Costello: colleagues