| Is it a bug or an enhancement?: a text-based approach to classify change requests |
| Full text |
Pdf
(280 KB)
|
| Source
|
IBM Centre for Advanced Studies Conference
archive
Proceedings of the 2008 conference of the center for advanced studies on collaborative research: meeting of minds
table of contents
Ontario, Canada
SESSION: Software engineering III
table of contents
Article No. 23
Year of Publication: 2008
|
|
Authors
|
|
Giuliano Antoniol
|
SOCCER Lab. -- DGIGL, Québec, Canada
|
|
Kamel Ayari
|
SOCCER Lab. -- DGIGL, Québec, Canada
|
|
Massimiliano Di Penta
|
University of Sannio, Benevento, Italy
|
|
Foutse Khomh
|
Université de Montréal, Québec, Canada
|
|
Yann-Gaël Guéhéneuc
|
Université de Montréal, Québec, Canada
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 17, Downloads (12 Months): 154, Citation Count: 0
|
|
|
ABSTRACT
Bug tracking systems are valuable assets for managing maintenance activities. They are widely used in open-source projects as well as in the software industry. They collect many different kinds of issues: requests for defect fixing, enhancements, refactoring/restructuring activities and organizational issues. These different kinds of issues are simply labeled as "bug" for lack of a better classification support or of knowledge about the possible kinds. This paper investigates whether the text of the issues posted in bug tracking systems is enough to classify them into corrective maintenance and other kinds of activities. We show that alternating decision trees, naive Bayes classifiers, and logistic regression can be used to accurately distinguish bugs from other kinds of issues. Results from empirical studies performed on issues for Mozilla, Eclipse, and JBoss indicate that issues can be classified with between 77% and 82% of correct decisions.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
 |
2
|
K. Ayari , P. Meshkinfam , G. Antoniol , M. Di Penta, Threats on building models from CVS and Bugzilla repositories: the Mozilla case study, Proceedings of the 2007 conference of the center for advanced studies on Collaborative research, October 22-25, 2007, Richmond Hill, Ontario, Canada
[doi> 10.1145/1321211.1321234]
|
| |
3
|
V. Basili, G. Caldiera, and D. H. Rombach. The Goal Question Metric Paradigm Encyclopedia of Software Engineering. John Wiley and Sons, 1994.
|
| |
4
|
|
| |
5
|
S. E. Robertson C. J. van Rijsbergen and M. F. Porter. New models in probabilistic information retrieval. London: British Library, Research and Development Report, no. 5587, 1980.
|
| |
6
|
Rumelhart D. E., Hinton G. E., and Williams R. J. Learning representations by back-propagating errors. Nature, 323:533--536, 1986.
|
| |
7
|
|
| |
8
|
|
| |
9
|
W. B. Frakes and R. Baeza-Yates. Information Retrieval: Data Structures and Algorithms. Prentice-Hall, Englewood Cliffs, NJ, 1992.
|
| |
10
|
|
| |
11
|
|
| |
12
|
|
| |
13
|
|
| |
14
|
|
| |
15
|
|
| |
16
|
J. O. Rawlings, S. G. Pandula, and D. A. Dickey. Applied Regression Analysis a Research Tool. Springer Texts in Statistics. New York: Springer-Verlag, second edition edition, 1998.
|
 |
17
|
|
| |
18
|
M. Stone. Cross-validatory choice and assesment of statistical predictions (with discussion). Journal of the Royal Statistical Society B, 36:111--147, 1974.
|
| |
19
|
|
 |
20
|
Xiaoyin Wang , Lu Zhang , Tao Xie , John Anvik , Jiasu Sun, An approach to detecting duplicate bug reports using natural language and execution information, Proceedings of the 30th international conference on Software engineering, May 10-18, 2008, Leipzig, Germany
[doi> 10.1145/1368088.1368151]
|
 |
21
|
|
| |
22
|
|
| |
23
|
R. K. Yin. Case Study Research: Design and Methods - Third Edition. SAGE Publications, London, 2002.
|
| |
24
|
|
| |
25
|
|
|