| An approach to detecting duplicate bug reports using natural language and execution information |
| Full text |
Pdf
(330 KB)
|
Source
|
International Conference on Software Engineering
archive
Proceedings of the 30th international conference on Software engineering
table of contents
Leipzig, Germany
SESSION: Evolution
table of contents
Pages 461-470
Year of Publication: 2008
ISBN:978-1-60558-079-1
|
|
Authors
|
|
Xiaoyin Wang
|
Peking University, Beijing, China
|
|
Lu Zhang
|
Peking University, Beijing, China
|
|
Tao Xie
|
North Carolina State University, Raleigh, NC, USA
|
|
John Anvik
|
University of Victoria, Victoria, BC, Canada
|
|
Jiasu Sun
|
Peking University, Beijing, China
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 31, Downloads (12 Months): 215, Citation Count: 4
|
|
|
ABSTRACT
An open source project typically maintains an open bug repository so that bug reports from all over the world can be gathered. When a new bug report is submitted to the repository, a person, called a triager, examines whether it is a duplicate of an existing bug report. If it is, the triager marks it as DUPLICATE and the bug report is removed from consideration for further work. In the literature, there are approaches exploiting only natural language information to detect duplicate bug reports. In this paper we present a new approach that further involves execution information. In our approach, when a new bug report arrives, its natural language information and execution information are compared with those of the existing bug reports. Then, a small number of existing bug reports are suggested to the triager as the most similar bug reports to the new bug report. Finally, the triager examines the suggested bug reports to determine whether the new bug report duplicates an existing bug report. We calibrated our approach on a subset of the Eclipse bug repository and evaluated our approach on a subset of the Firefox bug repository. The experimental results show that our approach can detect 67%-93% of duplicate bug reports in the Firefox bug repository, compared to 43%-72% using natural language information alone.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
 |
2
|
|
| |
3
|
|
| |
4
|
Cubranic, D. and Murphy, G. Automatic Bug Triage Using Text Classification. In Proc. SEKE, 2004, 92--97.
|
| |
5
|
|
| |
6
|
|
| |
7
|
Greengrass, E. Information Retrieval: A Survey, University of Maryland, Baltimore County, 2000
|
| |
8
|
Hiew, L. Assisted Detection of Duplicate Bug Reports. Master's thesis, University of British Columbia, Canada, 2006.
|
 |
9
|
|
| |
10
|
|
 |
11
|
Ben Liblit , Alex Aiken , Alice X. Zheng , Michael I. Jordan, Bug isolation via remote program sampling, Proceedings of the ACM SIGPLAN 2003 conference on Programming language design and implementation, June 09-11, 2003, San Diego, California, USA
|
| |
12
|
|
| |
13
|
|
 |
14
|
|
| |
15
|
Andy Podgurski , David Leon , Patrick Francis , Wes Masri , Melinda Minch , Jiayang Sun , Bin Wang, Automated support for classifying software failure reports, Proceedings of the 25th International Conference on Software Engineering, May 03-10, 2003, Portland, Oregon
|
| |
16
|
Raghavan, V., Wong, M. A critical analysis of vector space model for information retrieval. Journal of the American Society for Information Science, 37, 5: p279--287, 1986.
|
| |
17
|
|
| |
18
|
|
| |
19
|
Sandusky, J., Gasser, L., and Ripoche, G. Bug Report Networks: Varieties, Strategies, and Impacts in an OSS Development Community, In Proc. MSR, 2004, 80--84.
|
CITED BY 4
|
|
Giuliano Antoniol , Kamel Ayari , Massimiliano Di Penta , Foutse Khomh , Yann-Gaël Guéhéneuc, Is it a bug or an enhancement?: a text-based approach to classify change requests, Proceedings of the 2008 conference of the center for advanced studies on collaborative research: meeting of minds, October 27-30, 2008, Ontario, Canada
|
|
|
Nicolas Bettenburg , Sascha Just , Adrian Schröter , Cathrin Weiss , Rahul Premraj , Thomas Zimmermann, What makes a good bug report?, Proceedings of the 16th ACM SIGSOFT International Symposium on Foundations of software engineering, November 09-14, 2008, Atlanta, Georgia
|
|
|
|
|
|
|
|