|
ABSTRACT
The empirical assessment of test techniques plays an important role in software testing research. One common practice is to instrument faults, either manually or by using mutation operators. The latter allows the systematic, repeatable seeding of large numbers of faults; however, we do not know whether empirical results obtained this way lead to valid, representative conclusions. This paper investigates this important question based on a number of programs with comprehensive pools of test cases and known faults. It is concluded that, based on the data available thus far, the use of mutation operators is yielding trustworthy results (generated mutants are similar to real faults). Mutants appear however to be different from hand-seeded faults that seem to be harder to detect than real faults.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
T. A. Budd and D. Angluin, "Two Notions of Correctness and their Relation to Testing," Acta Informatica, vol. 18 (1), pp. 31--45, 1982.
|
| |
4
|
D. T. Campbell and J. C. Stanley, Experimental and Quasi-Experimental Designs for Research, Houghton Mifflin Company, 1990.
|
| |
5
|
W. Chen, R. H. Untch, G. Rothermel, S. Elbaum and J. von Ronne, "Can fault-exposure-potential estimates improve the fault detection abilities of test suites?," Software Testing, Verification and Reliability, vol. 12 (4), pp. 197--218, 2002.
|
| |
6
|
R. A. DeMillo, R. J. Lipton and F. G. Sayward, "Hints on Test Data Selection: Help for the Practicing Programmer," IEEE Computer, vol. 11 (4), pp. 34--41, 1978.
|
| |
7
|
J. L. Devore, Probability and Statistics for Engineering and the Sciences, Duxbury Press, 5th Edition, 1999.
|
| |
8
|
H. Do, G. Rothermel and S. Elbaum, "Infrastructure support for controlled experimentation with software testing and regression testing techniques," Oregon State University, Corvallis, OR, USA, Technical report 04-06-01, January, 2004.
|
 |
9
|
|
 |
10
|
Phyllis G. Frankl , Stewart N. Weiss, An experimental comparison of the effectiveness of the all-uses and all-edges adequacy criteria, Proceedings of the symposium on Testing, analysis, and verification, p.154-164, October 08-10, 1991, Victoria, British Columbia, Canada
[doi> 10.1145/120807.120821]
|
 |
11
|
|
| |
12
|
R. G. Hamlet, "Testing programs with the aid of a compiler," IEEE Transactions on Software Engineering, vol. 3 (4), pp. 279--290, 1977.
|
| |
13
|
|
| |
14
|
Monica Hutchins , Herb Foster , Tarak Goradia , Thomas Ostrand, Experiments of the effectiveness of dataflow- and controlflow-based test adequacy criteria, Proceedings of the 16th international conference on Software engineering, p.191-200, May 16-21, 1994, Sorrento, Italy
|
| |
15
|
S. Kim, J. A. Clark and J. A. McDermid, "Investigating the Effectiveness of Object-Oriented Testing Strategies with the Mutation Method," Software Testing, Verification and Reliability, vol. 11 (3), pp. 207--225, 2001.
|
| |
16
|
A. M. Memon, I. Banerjee and A. Nagarajan, "What Test Oracle Should I use for Effective GUI Testing?," Proc. IEEE International Conference on Automated Software Engineering (ASE'03), Montreal, Quebec, Canada, pp. 164--173, October, 2003.
|
 |
17
|
|
 |
18
|
|
| |
19
|
A. J. Offutt and J. Pan, "Detecting Equivalent Mutants and the Feasible Path Problem," Software Testing, Verification, and Reliability, vol. 7 (3), pp. 165--192, 1997.
|
| |
20
|
A. J. Offutt and R. H. Untch, "Mutation 2000: Uniting the Orthogonal," Proc. Mutation, San Jose, CA, USA, pp. 45--55, October, 2000.
|
 |
21
|
|
| |
22
|
J. Rice, Mathematical Statistics and Data Analysis, Duxbury press, 2nd Edition, 1995.
|
| |
23
|
|
| |
24
|
P. Thévenod-Fosse, H. Waeselynck and Y. Crouzet, "An experimental study on software structural testing: deterministic versus random input generation," Proc. 21st International Symposium on Fault-Tolerant Computing, Montreal, Canada, pp. 410--417, June, 1991.
|
| |
25
|
|
| |
26
|
Claes Wohlin , Per Runeson , Martin Höst , Magnus C. Ohlsson , Bjöorn Regnell , Anders Wesslén, Experimentation in software engineering: an introduction, Kluwer Academic Publishers, Norwell, MA, 2000
|
CITED BY 38
|
|
|
|
|
|
|
|
Sara Sprenkle , Emily Gibson , Sreedevi Sampath , Lori Pollock, Automated replay and failure detection for web applications, Proceedings of the 20th IEEE/ACM international Conference on Automated software engineering, November 07-11, 2005, Long Beach, CA, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Kristen R. Walcott , Mary Lou Soffa , Gregory M. Kapfhammer , Robert S. Roos, TimeAware test suite prioritization, Proceedings of the 2006 international symposium on Software testing and analysis, July 17-20, 2006, Portland, Maine, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Shaukat Ali , Lionel C. Briand , Muhammad Jaffar-ur Rehman , Hajra Asghar , Muhammad Zohaib Z. Iqbal , Aamer Nadeem, A state-based approach to integration testing based on UML models, Information and Software Technology, v.49 n.11-12, p.1087-1106, November, 2007
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Lu Zhang , Shan-Shan Hou , Chao Guo , Tao Xie , Hong Mei, Time-aware test-case prioritization using integer linear programming, Proceedings of the eighteenth international symposium on Software testing and analysis, July 19-23, 2009, Chicago, IL, USA
|
|
|
|
|
|
|
REVIEW
"Andrew Brooks : Reviewer"
The effectiveness of a program test suite can be measured by how many mutated versions of the program are detected that contain an injected defect. Mutations (defects) are injected through the application of simple rules, such as "negate decision"
more...
|