| TA-RE: an exchange language for mining software repositories |
| Full text |
Pdf
(287 KB)
|
| Source
|
International Conference on Software Engineering
archive
Proceedings of the 2006 international workshop on Mining software repositories
table of contents
Shanghai, China
SESSION: Repositories
table of contents
Pages: 22 - 25
Year of Publication: 2006
ISBN:1-59593-397-2
|
|
Authors
|
|
Sunghun Kim
|
University of California, Santa Cruz, CA
|
|
Thomas Zimmermann
|
Saarland University, Saarbrücken, Germany
|
|
Miryung Kim
|
University of Washington
|
|
Ahmed Hassan
|
University of Waterloo, Canada
|
|
Audris Mockus
|
Avaya labs
|
|
Tudor Girba
|
University of Berne, Switzerland
|
|
Martin Pinzger
|
University of Zurich, Switzerland
|
|
E. James Whitehead, Jr.
|
University of California, Santa Cruz, CA
|
|
Andreas Zeller
|
Saarland University, Saarbrücken, Germany
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 5, Downloads (12 Months): 41, Citation Count: 1
|
|
|
ABSTRACT
Software repositories have been getting a lot of attention from researchers in recent years. In order to analyze software repositories, it is necessary to first extract raw data from the version control and problem tracking systems. This poses two challenges: (1) extraction requires a non-trivial effort, and (2) the results depend on the heuristics used during extraction. These challenges burden researchers that are new to the community and make it difficult to benchmark software repository mining since it is almost impossible to reproduce experiments done by another team. In this paper we present the TA-RE corpus. TA-RE collects extracted data from software repositories in order to build a collection of projects that will simplify extraction process. Additionally the collection can be used for benchmarking. As the first step we propose an exchange language capable of making sharing and reusing data as simple as possible.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
 |
2
|
Jennifer Bevan , E. James Whitehead, Jr. , Sunghun Kim , Michael Godfrey, Facilitating software evolution research with kenyon, Proceedings of the 10th European software engineering conference held jointly with 13th ACM SIGSOFT international symposium on Foundations of software engineering, September 05-09, 2005, Lisbon, Portugal
|
| |
3
|
|
| |
4
|
V. Dallmeier, P. Weißgerber, and T. Zimmermann, "APFEL: A Preprocessing Framework For Eclipse," 2005, http://www.st.cs.uni-sb.de/softevo/apfel/.
|
| |
5
|
|
| |
6
|
|
| |
7
|
|
| |
8
|
|
| |
9
|
|
 |
10
|
|
| |
11
|
|
 |
12
|
|
| |
13
|
|
 |
14
|
|
 |
15
|
|
| |
16
|
|
| |
17
|
|
 |
18
|
Audris Mockus , Ping Zhang , Paul Luo Li, Predictors of customer perceived software quality, Proceedings of the 27th international conference on Software engineering, p.225-233, May 15-21, 2005, St. Louis, MO, USA
[doi> 10.1145/1062455.1062506]
|
| |
19
|
D. J. Newman, S. Hettich, C. L. Blake, and C. J. Merz, "UCI Repository of machine learning databases," 1988, http://www.ics.uci.edu/~mlearn/MLRepository.html.
|
| |
20
|
J. Sayyad Shirabad and T. J. Menzies, "The PROMISE Repository of Software Engineering Databases," 2005, http://promise.site.uottawa.ca/SERepository.
|
 |
21
|
|
| |
22
|
T. Zimmermann and P. Weißgerber, "Preprocessing CVS Data for Fine-Grained Analysis," Proc. of Int'l Workshop on Mining Software Repositories (MSR 2004), Edinburgh, Scotland, pp. 2--6, 2004.
|
| |
23
|
|
|