software evolution & architecture lab

Replication and Benchmarking 

in Software Analytics
Harald Gall
University of Zuri...
Panel @ ESEC/FSE 2013
The Screening Plant of a SW Miner
SQA Mashup Teaser
Roadmap for the talk
‣

Challenges of Software Mining Studies


‣

Mining Studies: Where are we now?

!

‣

‣

Software An...
I. Challenges of 

Software Mining Studies

software evolution & architecture lab
Which data sources?
‣

Evolution analysis data repositories à la PROMISE

‣
‣

‣

Flossmole, Sourcerer, Ultimate Debian DB...
What kind of studies?
‣

Source code

‣
‣

‣

Which entities co-evolve/co-change?

How to identify code smells or design d...
Example: Bug Prediction
Using Code Churn vs.
Fine-Grained Changes

Predicting the 

Types of Code Changes

Predicting the
...
Replication and Benchmarking in Software Analytics
Replication and Benchmarking in Software Analytics
Replication and Benchmarking in Software Analytics
Replication and Benchmarking in Software Analytics
Replication and Benchmarking in Software Analytics
Replication and Benchmarking in Software Analytics
Replication and Benchmarking in Software Analytics
Replication and Benchmarking in Software Analytics
Replication and Benchmarking in Software Analytics
Replication and Benchmarking in Software Analytics
Replication and Benchmarking in Software Analytics
Replication and Benchmarking in Software Analytics
Replication and Benchmarking in Software Analytics
Replication and Benchmarking in Software Analytics
Replication and Benchmarking in Software Analytics
Replication and Benchmarking in Software Analytics
Replication and Benchmarking in Software Analytics
Replication and Benchmarking in Software Analytics
Replication and Benchmarking in Software Analytics
Replication and Benchmarking in Software Analytics
Replication and Benchmarking in Software Analytics
Replication and Benchmarking in Software Analytics
Replication and Benchmarking in Software Analytics
Replication and Benchmarking in Software Analytics
Replication and Benchmarking in Software Analytics
Replication and Benchmarking in Software Analytics
Replication and Benchmarking in Software Analytics
Replication and Benchmarking in Software Analytics
Replication and Benchmarking in Software Analytics
Replication and Benchmarking in Software Analytics
Replication and Benchmarking in Software Analytics
Replication and Benchmarking in Software Analytics
Replication and Benchmarking in Software Analytics
Replication and Benchmarking in Software Analytics
Replication and Benchmarking in Software Analytics
Replication and Benchmarking in Software Analytics
Upcoming SlideShare
Loading in...5
×

Replication and Benchmarking in Software Analytics

457

Published on

Keynote at BENEVOL 2013, Mons, Belgium

Published in: Technology
0 Comments
2 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
457
On Slideshare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
12
Comments
0
Likes
2
Embeds 0
No embeds

No notes for slide

Replication and Benchmarking in Software Analytics

  1. 1. software evolution & architecture lab Replication and Benchmarking in Software Analytics Harald Gall University of Zurich, Switzerland http:/ /seal.ifi.uzh.ch @ BENEVOL, Mons, 17 Dec 2013
  2. 2. Panel @ ESEC/FSE 2013
  3. 3. The Screening Plant of a SW Miner
  4. 4. SQA Mashup Teaser
  5. 5. Roadmap for the talk ‣ Challenges of Software Mining Studies ‣ Mining Studies: Where are we now? ! ‣ ‣ Software Analytics: Replication and Benchmarking An Infrastructure for Software Analytics
  6. 6. I. Challenges of Software Mining Studies software evolution & architecture lab
  7. 7. Which data sources? ‣ Evolution analysis data repositories à la PROMISE ‣ ‣ ‣ Flossmole, Sourcerer, Ultimate Debian DB Provide benchmark (raw) data Interactive online web platforms that provide various analyses ‣ ‣ Analyses offered by design ‣ ‣ Boa, FOSSology, Alitheia core, Ohloh Data produced is best used within the system Industrial project data (not widely accessible)
  8. 8. What kind of studies? ‣ Source code ‣ ‣ ‣ Which entities co-evolve/co-change? How to identify code smells or design disharmonies? Bugs and changes ‣ ‣ When do changes induce fixes? ‣ ‣ Who should / how long will it take to fix this bug? Predicting bugs and their components? Project and process ‣ Do code and comments co-evolve? ‣ Who are the experts of a piece of code?
  9. 9. Example: Bug Prediction Using Code Churn vs. Fine-Grained Changes Predicting the Types of Code Changes Predicting the Method Using the Gini Coefficient for Bug Prediction Using developer networks for Bug Prediction
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×