|
ABSTRACT
The development of libre (free/open source) software is usually performed by geographically distributed teams. Participation in most cases is voluntary, sometimes sporadic, and often not framed by a pre-defined management structure. This means that anybody can contribute, and in principle no national origin has advantages over others, except for the differences in availability and quality of Internet connections and language. However, differences in participation across regions do exist, although there are little studies about them. In this paper we present some data which can be the basis for some of those studies. We have taken the database of users registered at SourceForge, the largest libre software development web-based platform, and have inferred their geographical locations. For this, we have applied several techniques and heuristics on the available data (mainly e-mail addresses and time zones), which are presented and discussed in detail. The results show a snapshot of the regional distribution of SourceForge users, which may be a good proxy of the actual distribution of libre software developers. In addition, the methodology may be of interest for similar studies in other domains, when the available data is similar (as is the case of mailing lists related to software projects).
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
P. A. David, A. Waterman, and S. Arora. FLOSS-US. The Free/Libre/Open Source Software Survey for 2003. Technical report, Stanford Institute for Economic and Policy Research, Stanford, USA, 2003.
|
| |
3
|
B. J. Dempsey, D. Weiss, P. Jones, and J. Greenberg. A quantitative profile of a community of Open Source Linux developers. Technical report, October 1999.
|
| |
4
|
R. A. Ghosh, R. Glott, B. Krieger, and G. Robles. Survey of developers (free/libre and open source software: Survey and study). Technical report, International Institute of Infonomics. University of Maastricht, The Netherlands, June 2002.
|
| |
5
|
K. Healy and A. Schussman. The ecology of open-source software development. Technical report, University of Arizona, USA, Jan. 2003.
|
| |
6
|
D. Lancashire. Code, culture and cash: The fading altruism of Open Source development. First Monday, 6(12), 2001.
|
| |
7
|
L. Lopez, J. M. Gonzalez-Barahona, and G. Robles. Applying social network analysis to the information in CVS repositories. In Proc Intl Workshop on Mining Software Repositories, pages 101--105, Edinburg, UK, 2004.
|
| |
8
|
G. Madey, V. Freeh, and R. Tynan. The open source development phenomenon: An analysis based on social network theory. In Americas Conf on Information Systems, pages 1806--1813, Dallas, TX, USA, 2002.
|
 |
9
|
Masao Ohira , Naoki Ohsugi , Tetsuya Ohoka , Ken-ichi Matsumoto, Accelerating cross-project knowledge collaboration using collaborative filtering and social networks, Proceedings of the 2005 international workshop on Mining software repositories, p.1-5, May 17-17, 2005, St. Louis, Missouri
|
 |
10
|
|
| |
11
|
G. Robles, S. Koch, and J. M. Gonzalez-Barahona. Remote analysis and measurement of libre software systems by means of the CVSAnalY tool. In Proc 2nd Workshop on Remote Analysis and Measurement of Software Systems, pages 51--56, Edinburg, UK, 2004.
|
| |
12
|
G. Robles, H. Scheider, I. Tretkowski, and N. Weber. Who is doing it? A research on libre software developers. Technical report, Technische Universitaet Berlin, Berlin, Germany, Aug. 2001.
|
| |
13
|
I. Tuomi. Evolution of the Linux Credits file: Methodological challenges and reference data for Open Source research. First Monday, 9(6), 2004.
|
|