1. About MSR
Types of analysis
Possible lessons to be learned
10+ years of software analytics
Lessons learned that may be useful for learning analytics
Gregorio Robles, Jes´us M. Gonz´alez Barahona
{grex,jgb}@gsyc.urjc.es
GSyC/LibreSoft, Universidad Rey Juan Carlos, Madrid, Spain
LASI, July 5th, 2013
Gregorio Robles, Jes´us M. Gonz´alez Barahona 10+ years of software analytics
2. About MSR
Types of analysis
Possible lessons to be learned
c 2013 Gregorio Robles, Jes´us M. Gonz´alez-Barahona
All figures are ours, except when the original source is specified.
Some rights reserved. This presentation is distributed under the
“Attribution-ShareAlike 3.0” license, by Creative Commons, available at
http://creativecommons.org/licenses/by-sa/3.0/
Gregorio Robles, Jes´us M. Gonz´alez Barahona 10+ years of software analytics
3. About MSR
Types of analysis
Possible lessons to be learned
What is this talk about?
Gregorio Robles, Jes´us M. Gonz´alez Barahona 10+ years of software analytics
4. About MSR
Types of analysis
Possible lessons to be learned
About us
What we do
Main goal: Understanding
free software
Main focus: empirical
software engineering
Main method: retrieval of
publicly available data
Part of the GSyC/LibreSoft
research team
Spin-off company: Bitergia
The other author
Jes´us M. Gonz´alez-Barahona
Member of eMadrid
Universidad Rey Juan Carlos
Madrid, Spain
Gregorio Robles, Jes´us M. Gonz´alez Barahona 10+ years of software analytics
5. About MSR
Types of analysis
Possible lessons to be learned
Table of Contents
1 About MSR
2 Types of analysis
3 Possible lessons to be learned
Gregorio Robles, Jes´us M. Gonz´alez Barahona 10+ years of software analytics
6. About MSR
Types of analysis
Possible lessons to be learned
Data in software development
Gregorio Robles, Jes´us M. Gonz´alez Barahona 10+ years of software analytics
7. About MSR
Types of analysis
Possible lessons to be learned
Mining Software Repositories
Figure: Mining Software Repositories: http://msrconf.org
Gregorio Robles, Jes´us M. Gonz´alez Barahona 10+ years of software analytics
8. About MSR
Types of analysis
Possible lessons to be learned
Public available data sources
Mainly from free software projects
Artifacts
Mainly source code
Repositories (with meta-data)
Versioning systems
Bug-tracking system
Mailing list, forums, etc.
Other: twitter, chats, etc.
Gregorio Robles, Jes´us M. Gonz´alez Barahona 10+ years of software analytics
9. About MSR
Types of analysis
Possible lessons to be learned
Artifacts vs. repositories
Gregorio Robles, Jes´us M. Gonz´alez Barahona 10+ years of software analytics
10. About MSR
Types of analysis
Possible lessons to be learned
Table of Contents
1 About MSR
2 Types of analysis
3 Possible lessons to be learned
Gregorio Robles, Jes´us M. Gonz´alez Barahona 10+ years of software analytics
11. About MSR
Types of analysis
Possible lessons to be learned
Important keywords
Gregorio Robles, Jes´us M. Gonz´alez Barahona 10+ years of software analytics
12. About MSR
Types of analysis
Possible lessons to be learned
Recommender systems
Figure: Ahmed Lamkanfi et al., MSR 2010
Gregorio Robles, Jes´us M. Gonz´alez Barahona 10+ years of software analytics
13. About MSR
Types of analysis
Possible lessons to be learned
Social Network Analysis
Figure: Interactions for Linux 1.0.
Gregorio Robles, Jes´us M. Gonz´alez Barahona 10+ years of software analytics
14. About MSR
Types of analysis
Possible lessons to be learned
Developer territoriality (the toothbrush effect)
Gregorio Robles, Jes´us M. Gonz´alez Barahona 10+ years of software analytics
15. About MSR
Types of analysis
Possible lessons to be learned
Entry patterns
Gregorio Robles, Jes´us M. Gonz´alez Barahona 10+ years of software analytics
16. About MSR
Types of analysis
Possible lessons to be learned
Table of Contents
1 About MSR
2 Types of analysis
3 Possible lessons to be learned
Gregorio Robles, Jes´us M. Gonz´alez Barahona 10+ years of software analytics
17. About MSR
Types of analysis
Possible lessons to be learned
Correlation and causation
Gregorio Robles, Jes´us M. Gonz´alez Barahona 10+ years of software analytics
18. About MSR
Types of analysis
Possible lessons to be learned
Not the right data
Figure: Sometimes we have lots of data... but not the one required to
answer the question! Picture: (c) StatusMind.com
Gregorio Robles, Jes´us M. Gonz´alez Barahona 10+ years of software analytics
19. About MSR
Types of analysis
Possible lessons to be learned
Noise!
Figure: Tools have been thought for learning, not for analytics! Pic. (c)
www.socialresearchmethods.net
Gregorio Robles, Jes´us M. Gonz´alez Barahona 10+ years of software analytics
20. About MSR
Types of analysis
Possible lessons to be learned
Diversity
Figure: There is not size-fits-all. Every project has its own processes,
peculiarities, history and culture.
Gregorio Robles, Jes´us M. Gonz´alez Barahona 10+ years of software analytics
21. About MSR
Types of analysis
Possible lessons to be learned
Challenges for the research community
Figure: Challenges not only on existing data sources (it is not only about
the method!). Allow having new data sources as a challenge per se.
Picture: (c) Redeem the commute
Gregorio Robles, Jes´us M. Gonz´alez Barahona 10+ years of software analytics
22. About MSR
Types of analysis
Possible lessons to be learned
Repositories
Figure: There is a need for the existence of public repositories where data
is shared. There is always a data source that you did not think about!
Links among data in the sources is difficult to gather.
Gregorio Robles, Jes´us M. Gonz´alez Barahona 10+ years of software analytics
23. About MSR
Types of analysis
Possible lessons to be learned
Replication
Figure: Have you thought about replicating your studies later? And
having it replicated by an independent resarch group? Figure: (c) 2009
Archangel
Gregorio Robles, Jes´us M. Gonz´alez Barahona 10+ years of software analytics
24. About MSR
Types of analysis
Possible lessons to be learned
To take away!
We’ve shown the experience of analytics in another field
There are plenty of ideas, concepts, models and algorithms
that can be used
We may have lots of data to analyze... but
Be aware that correlation is not causation!
We need good (and the right) data
We need diverse data
We need to think about replicability
Gregorio Robles, Jes´us M. Gonz´alez Barahona 10+ years of software analytics
25. About MSR
Types of analysis
Possible lessons to be learned
10+ years of software analytics
Lessons learned that may be useful for learning analytics
Gregorio Robles, Jes´us M. Gonz´alez Barahona
{grex,jgb}@gsyc.urjc.es
GSyC/LibreSoft, Universidad Rey Juan Carlos, Madrid, Spain
LASI, July 5th, 2013
Gregorio Robles, Jes´us M. Gonz´alez Barahona 10+ years of software analytics