ACM Home Page
Please provide us with feedback. Feedback
An empirical study of operating systems errors
Full text PdfPdf (1.45 MB)
Source ACM Symposium on Operating Systems Principles archive
Proceedings of the eighteenth ACM symposium on Operating systems principles table of contents
Banff, Alberta, Canada
SESSION: Deconstructing the OS table of contents
Pages: 73 - 88  
Year of Publication: 2001
ISBN:1-58113-389-8
Also published in ...
Authors
Andy Chou  Stanford University, Stanford, CA
Junfeng Yang  Stanford University, Stanford, CA
Benjamin Chelf  Stanford University, Stanford, CA
Seth Hallem  Stanford University, Stanford, CA
Dawson Engler  Stanford University, Stanford, CA
Sponsor
SIGOPS: ACM Special Interest Group on Operating Systems
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 50,   Downloads (12 Months): 268,   Citation Count: 78
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/502034.502042
What is a DOI?

ABSTRACT

We present a study of operating system errors found by automatic, static, compiler analysis applied to the Linux and OpenBSD kernels. Our approach differs from previous studies that consider errors found by manual inspection of logs, testing, and surveys because static analysis is applied uniformly to the entire kernel source, though our approach necessarily considers a less comprehensive variety of errors than previous studies. In addition, automation allows us to track errors over multiple versions of the kernel source to estimate how long errors remain in the system before they are fixed.We found that device drivers have error rates up to three to seven times higher than the rest of the kernel. We found that the largest quartile of functions have error rates two to six times higher than the smallest quartile. We found that the newest quartile of files have error rates up to twice that of the oldest quartile, which provides evidence that code "hardens" over time. Finally, we found that bugs remain in the Linux kernel an average of 1.8 years before being fixed.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
3
 
4
M. Bishop and M. Dilger. Checking for Race Conditions in File Accesses. Computing systems, pages 131-152, Spring 1996.
5
 
6
R. Chillarege and N. Bowen. Understanding Large System Failures - A Fault Injection Experiment. In The lgth International Symposium on Fault Tolerant Computing, June 1989.
 
7
D.R. Cox and D. Oakes. Analysis of Survival Data. Chapman and Hall, London, UK, 1984.
 
8
D.R. Engler, B. Chelf, A. Chou, and S. Hallem. Checking System Rules Using System-Specific, Programmer-Written Compiler Extensions. In Proceedings of Operating Systems Design and Implementation (OSDI), September 2000.
 
9
 
10
J. Gray. A Census of Tandem System Availability Between 1985 and 1990. IEEE Transactions on Software Engineering, 39(4), October 1990.
 
11
Intrinsa. A Technical Introduction to PREfix/Enterprise. Technical report, Intrinsa Corporation, 1998.
 
12
N.L. Johnson and S. Kotz. Discrete Distributions. John Wiley & Sons, New York, NY, 1969.
 
13
 
14
 
15
I. Lee, R. Iyer, and F. Symptoms. Faults, Symptoms, and Software Fault Tolerance in the Tandem GUARDIAN Operating System. In International Symposium on Fault- Tolerant Computing (FTCS), 1993.
16
17
 
18
B.P. Miller, D. Koski, C.P. Lee, V. Maganty, R. Murthy, A. Natarajan, and J. Steidl. Fuzz Revisited: A Reexamination of the Reliability of UNIX Utilities and Services. Technical Report CS-TR-1995-1268, University of Wisconsin, 1995.
 
19
20
21
 
22
S.D. Silvey. Statistical Inference. Chapman and Hall, London, UK, 1975.
 
23
M. Sullivan and R. Chillarege. Software Defects and Their Impact on System 118 Availability - A Study of Field Failures in Operating Systems. In Plst International Symposium on Fault Tolerant Computing, June 1991.
 
24
M. Sullivan and R. Chillarege. A Comparison of Software Defects in Database Management Systems and Operating Systems. In 22nd International Symposium on Fault- Tolerant Computing, July 1992.
 
25
D. Wagner, J. Foster, E. Brewer, and A. Aiken. A First Step Towards Automated Detection of Buffer Overrun Vulnerabilities. In The 2000 Network and Distributed Systems Security Conference. San Diego, CA, February 2000.
 
26

CITED BY  81

Collaborative Colleagues:
Andy Chou: colleagues
Junfeng Yang: colleagues
Benjamin Chelf: colleagues
Seth Hallem: colleagues
Dawson Engler: colleagues