ACM Home Page
Please provide us with feedback. Feedback
Is it a bug or an enhancement?: a text-based approach to classify change requests
Full text PdfPdf (280 KB)
Source IBM Centre for Advanced Studies Conference archive
Proceedings of the 2008 conference of the center for advanced studies on collaborative research: meeting of minds table of contents
Ontario, Canada
SESSION: Software engineering III table of contents
Article No. 23  
Year of Publication: 2008
Authors
Giuliano Antoniol  SOCCER Lab. -- DGIGL, Québec, Canada
Kamel Ayari  SOCCER Lab. -- DGIGL, Québec, Canada
Massimiliano Di Penta  University of Sannio, Benevento, Italy
Foutse Khomh  Université de Montréal, Québec, Canada
Yann-Gaël Guéhéneuc  Université de Montréal, Québec, Canada
Sponsors
: IBM Toronto Software Lab
: IBM Centers for Advanced Studies (CAS)
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 17,   Downloads (12 Months): 154,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1463788.1463819
What is a DOI?

ABSTRACT

Bug tracking systems are valuable assets for managing maintenance activities. They are widely used in open-source projects as well as in the software industry. They collect many different kinds of issues: requests for defect fixing, enhancements, refactoring/restructuring activities and organizational issues. These different kinds of issues are simply labeled as "bug" for lack of a better classification support or of knowledge about the possible kinds.

This paper investigates whether the text of the issues posted in bug tracking systems is enough to classify them into corrective maintenance and other kinds of activities.

We show that alternating decision trees, naive Bayes classifiers, and logistic regression can be used to accurately distinguish bugs from other kinds of issues. Results from empirical studies performed on issues for Mozilla, Eclipse, and JBoss indicate that issues can be classified with between 77% and 82% of correct decisions.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
2
 
3
V. Basili, G. Caldiera, and D. H. Rombach. The Goal Question Metric Paradigm Encyclopedia of Software Engineering. John Wiley and Sons, 1994.
 
4
 
5
S. E. Robertson C. J. van Rijsbergen and M. F. Porter. New models in probabilistic information retrieval. London: British Library, Research and Development Report, no. 5587, 1980.
 
6
Rumelhart D. E., Hinton G. E., and Williams R. J. Learning representations by back-propagating errors. Nature, 323:533--536, 1986.
 
7
 
8
 
9
W. B. Frakes and R. Baeza-Yates. Information Retrieval: Data Structures and Algorithms. Prentice-Hall, Englewood Cliffs, NJ, 1992.
 
10
 
11
 
12
 
13
 
14
 
15
 
16
J. O. Rawlings, S. G. Pandula, and D. A. Dickey. Applied Regression Analysis a Research Tool. Springer Texts in Statistics. New York: Springer-Verlag, second edition edition, 1998.
17
 
18
M. Stone. Cross-validatory choice and assesment of statistical predictions (with discussion). Journal of the Royal Statistical Society B, 36:111--147, 1974.
 
19
20
21
 
22
 
23
R. K. Yin. Case Study Research: Design and Methods - Third Edition. SAGE Publications, London, 2002.
 
24
 
25

Collaborative Colleagues:
Giuliano Antoniol: colleagues
Kamel Ayari: colleagues
Massimiliano Di Penta: colleagues
Foutse Khomh: colleagues
Yann-Gaël Guéhéneuc: colleagues