A historical dataset for the Gnome software ecosystem

1,012 views

Published on

Presentation of co-authored MSR 2013 paper "A historical dataset for Gnome contributors", presented by Maelick Claes at MSR 2013, San Francisco.
Abstract: We present a dataset of the open source software ecosystem GNOME from a social point of view. We have collected historical data about the contributors to all GNOME projects stored on git.gnome.org, taking into account the problem of identity matching, and as- sociating different activity types to the contributors. This type of information is very useful to complement the traditional, source-code related information one can ob- tain by mining and analyzing the actual source code. The dataset can be obtained at https://bitbucket.org/ mgoeminne/sgl-flossmetric-dbmerge.

Published in: Education, Technology
1 Comment
1 Like
Statistics
Notes
  • Presentation by Maelick Claes, PhD student of Tom Mens at the Software Engineering Lab, during the International Conference on Mining Software Repositories (MSR 2013) in San Francisco, May 2013.
    Check the dataset on https://bitbucket.org/mgoeminne/sgl-flossmetric-dbmerge
       Reply 
    Are you sure you want to  Yes  No
    Your message goes here
No Downloads
Views
Total views
1,012
On SlideShare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
0
Comments
1
Likes
1
Embeds 0
No embeds

No notes for slide

A historical dataset for the Gnome software ecosystem

  1. 1. A Historical Dataset for the Gnome EcosystemM. Goeminne, M. Claes, T. MensSoftware Engineering Lab, Computer Science DepartmentFaculty of Science, University of MonsMSR 2013
  2. 2. GnomeGoalStudy evolution of social aspects of the Gnome ecosystem.About GnomeGit repositories at http://git.gnome.org16 years of history (1997 to 2012)1,418 projects (stored in git version repositories)1,315,997 commits12,285,518 file touches11,094 contributor accountsM. Goeminne, M. Claes, T. Mens (G´enie Logiciel)A Historical Dataset for the Gnome Ecosystem MSR 2013 2 / 5
  3. 3. GnomeAbout the datasetFLOSSMetrics MySQL database:https://bitbucket.org/mgoeminne/sgl-flossmetric-dbmergeM. Goeminne, M. Claes, T. Mens (G´enie Logiciel)A Historical Dataset for the Gnome Ecosystem MSR 2013 3 / 5
  4. 4. GnomeIdentity merging, Activity typesIdentity mergingCSVAnalY2 hackSemi-automatic identity merging based on name and e-mail.5,923 after merging.Activity typesRegular expressions on file extension, file name and path.Coding activity e.g. .c, .h, .cpp, .py, .java.Future work: CVSAnalY2 extension with refined activity types.M. Goeminne, M. Claes, T. Mens (G´enie Logiciel)A Historical Dataset for the Gnome Ecosystem MSR 2013 4 / 5
  5. 5. GnomeUse caseM. Goeminne, M. Claes, T. Mens (G´enie Logiciel)A Historical Dataset for the Gnome Ecosystem MSR 2013 5 / 5

×