SlideShare a Scribd company logo
Evidence for the Pareto
principle in OSS Activity
         Mathieu Goeminne & Tom Mens
 Service de Génie Logiciel, Institut d’Informatique
    Faculté des Sciences, Université de Mons




                                                      1
Research topic

       • Study of open source software evolution.
       • Taking into account the community (social
           network) of persons surrounding the
           software project (developers, users).
       • Looking for recurrent behaviour in this
           community.


Université de Mons   Tuesday 2011-03-01, SQM workshop, Oldenburg, Germany   Mathieu Goeminne & Tom Mens
                                                                                                      2
Goals
       • Long-term goal
          •    Understand how software community and software
               product/project co-evolve

          •    Provide guidelines and tools to support this

       • Short-term goal
          •    Study of how development activity is distributed
               over the different stakeholders.

          •    Find evidence for the Pareto principle in evolving
               OSS.
Université de Mons   Tuesday 2011-03-01, SQM workshop, Oldenburg, Germany   Mathieu Goeminne & Tom Mens
                                                                                                      3
Questions
       •   Is there a core group (of developers and/or
           users) being significantly more active than the
           others?
       •   How does the activity distribution evolve over
           time?
       •   Is there an overlap between the different
           activities?
       •   How does the activity distribution vary across
           different projects?

Université de Mons   Tuesday 2011-03-01, SQM workshop, Oldenburg, Germany   Mathieu Goeminne & Tom Mens
                                                                                                      4
Methodology
       • Exploiting available data from source
           code repositories, mailing lists and
           bug trackers.
       • Use of economy metrics measuring
           distribution (in)equality.
       • Empirical study of 3 OSS : Brasero,
           Evince and Wine.


Université de Mons   Tuesday 2011-03-01, SQM workshop, Oldenburg, Germany   Mathieu Goeminne & Tom Mens
                                                                                                      5
Econometrics
       • Gini, Hoover, Theil (normalised) : express
           inequality in a distribution.
       • Values between 0 and 1
        • 0 reflects a perfect equality
        • 1 reflects a perfect inequality
       • Have similar behaviors.
Université de Mons   Tuesday 2011-03-01, SQM workshop, Oldenburg, Germany   Mathieu Goeminne & Tom Mens
                                                                                                      6
The 3 notions of
               activities we used
       • # commits done
       • # mails sent
       • # bug status changed
 All of them are related to typical developer
                   activities



Université de Mons   Tuesday 2011-03-01, SQM workshop, Oldenburg, Germany   Mathieu Goeminne & Tom Mens
                                                                                                      7
Activity distributions
                                  (Gini index)
  $"
!#,"                                                                                                                                         Brasero
!#+"
!#*"
!#)"
!#("
!#'"                 .4556/7"
                                                                                                                                               Evince
!#&"                 58697"
!#%"                 :;"0".<8=>?7"
!#$"
  !"
  -./0!)"    1230!*"    -./0!*"    1230!+"    -./0!+"     1230!,"    -./0!,"      1230$!"   -./0$!"
                                                                                                                                                 Wine
  $"                                                                                                    $"
!#,"                                                                                                  !#,"
!#+"                                                                                                  !#+"
!#*"                                                                                                  !#*"
!#)"                                                                                                  !#)"                 2.44536"
!#("                                                                                                  !#("                 47586"
!#'"                                                                     1233456"                     !#'"                 9:;"<=>.<3"2?7@;=6"
!#&"                                                                     37486"                       !#&"
!#%"                                                                     9:;"/<.2/5"1=7>;<6"          !#%"
!#$"                                                                                                  !#$"
  !"                                                                                                    !"
  -./0,," -./0!!" -./0!$" -./0!%" -./0!&" -./0!'" -./0!(" -./0!)" -./0!*" -./0!+" -./0!," -./0$!"       -./0,+" -./0,," 1230!!" 1230!$" 1230!%" 1230!&" 1230!'" 1230!(" 1230!)" 1230!*" 1230!+" 1230!," 1230$!"




       Université de Mons                            Tuesday 2011-03-01, SQM workshop, Oldenburg, Germany                                                           Mathieu Goeminne & Tom Mens
                                                                                                                                                                                                                  8
Pareto principle

       • Most of the activity is carried out by a small
           group of persons.
       • Typically : 20% do 80% of the job.
       • Doesn’t necessarily imply that the activity
           distribution follows a Pareto law.



Université de Mons   Tuesday 2011-03-01, SQM workshop, Oldenburg, Germany   Mathieu Goeminne & Tom Mens
                                                                                                      9
Pareto principle (cont.)
   1

 0.9
                                                                                      Brasero                 commits
 0.8

 0.7
                                                                                                              mails
 0.6
                                                                                                              br changes
 0.5

 0.4
                                                                                       Evince
 0.3

 0.2




                                                                                        Wine
 0.1

   0
       0     0.1   0.2   0.3   0.4   0.5   0.6      0.7   0.8   0.9   1




 1

0.9

0.8

0.7

0.6

0.5

0.4

0.3

0.2

0.1

 0
       0     0.1   0.2   0.3   0.4   0.5   0.6     0.7    0.8   0.9   1




           Université de Mons                    Tuesday 2011-03-01, SQM workshop, Oldenburg, Germany   Mathieu Goeminne & Tom Mens
                                                                                                                                 10
Core groups
       • Display Venn diagrams of most active (top
           20) persons, according to each definition of
           activity.
       • For each person, show the percentage of
           activity attributable to this person.
       • Use heuristics to take into account and
           merge multiple identities representing the
           same real person.

Université de Mons   Tuesday 2011-03-01, SQM workshop, Oldenburg, Germany   Mathieu Goeminne & Tom Mens
                                                                                                     11
Core groups (cont.)
                                                                      Brasero

                                                                       Evince

                                                                        Wine




Université de Mons   Tuesday 2011-03-01, SQM workshop, Oldenburg, Germany   Mathieu Goeminne & Tom Mens
                                                                                                     12
Conclusions
       • Activity distributions seem to become
           more and more unequally distributed.
       • The Pareto principle is clearly present in
           studied projects.
       • For Brasero and Evince, the activity is led
           by a limited number of persons involved in
           2 or 3 of the defined activities.
          • For Wine, it seems not to be the case.
Université de Mons   Tuesday 2011-03-01, SQM workshop, Oldenburg, Germany   Mathieu Goeminne & Tom Mens
                                                                                                     13
Future work
       • Determine if the core group of a project
           evolves over time.
       • Use sliding windows to ignore inactive
           persons and discover new active persons.
       • Study “bus factor” and the persons involved.
       • Automatic generation of Venn diagrams
           including all persons involved in software
           evolution.

Université de Mons   Tuesday 2011-03-01, SQM workshop, Oldenburg, Germany   Mathieu Goeminne & Tom Mens
                                                                                                     14
Future work (cont.)
       •   Study correlation between community structure
           (social network) and source code quality (as
           computed using software metrics).
       •   Automatic statistical analysis to determine the
           distributions fitting the data.
       •   Extend and refine types of activity. For instance:
           •   different types of commit activity (doc, source
               code, test, etc.); of mail activity (information,
               asking, answering, etc.); of bug repository activity
               (bug creation, modification and commenting)
Université de Mons   Tuesday 2011-03-01, SQM workshop, Oldenburg, Germany   Mathieu Goeminne & Tom Mens
                                                                                                     15
Thank you



Université de Mons   Tuesday 2011-03-01, SQM workshop, Oldenburg, Germany   Mathieu Goeminne & Tom Mens
                                                                                                     16

More Related Content

Similar to Evidence for the Pareto principle in open source software activity

Understanding the evolution of software project communities
Understanding the evolution of software project communitiesUnderstanding the evolution of software project communities
Understanding the evolution of software project communities
Tom Mens
 
EarthCube DDMA AGU
EarthCube DDMA AGUEarthCube DDMA AGU
EarthCube DDMA AGU
Tanu Malik
 
U of U Undergraduate IMC Class
U of U Undergraduate IMC ClassU of U Undergraduate IMC Class
U of U Undergraduate IMC Class
Chris Carlston
 
Critical Mass 'Adaptive Brands'
Critical Mass 'Adaptive Brands' Critical Mass 'Adaptive Brands'
Critical Mass 'Adaptive Brands'
iStrategy
 
MA Veterans Portal Design Process
MA Veterans Portal Design ProcessMA Veterans Portal Design Process
MA Veterans Portal Design Process
Yasmin Fodil
 
Ma veterans portal design process 20111207 final
Ma veterans portal design process 20111207 finalMa veterans portal design process 20111207 final
Ma veterans portal design process 20111207 final
Yasmin Fodil
 
7 data citation challenges, illustrated with data (includes elephants)
7 data citation challenges, illustrated with data (includes elephants) 7 data citation challenges, illustrated with data (includes elephants)
7 data citation challenges, illustrated with data (includes elephants)
Heather Piwowar
 
Webanalytics as inspiration - Maarten Berge
Webanalytics as inspiration - Maarten BergeWebanalytics as inspiration - Maarten Berge
Webanalytics as inspiration - Maarten Berge
Bluerank
 
Developing a Social Intelligence Program - FSMU
Developing a Social Intelligence Program - FSMUDeveloping a Social Intelligence Program - FSMU
Developing a Social Intelligence Program - FSMU
W2O Group
 
MOD2014-Mens-Lecture4
MOD2014-Mens-Lecture4MOD2014-Mens-Lecture4
MOD2014-Mens-Lecture4
Tom Mens
 
!#$&()&#+,$)!#$$&())• +,-.$0$12,#-34-$#3.docx
!#$&()&#+,$)!#$$&())• +,-.$0$12,#-34-$#3.docx!#$&()&#+,$)!#$$&())• +,-.$0$12,#-34-$#3.docx
!#$&()&#+,$)!#$$&())• +,-.$0$12,#-34-$#3.docx
katherncarlyle
 
Analysing the evolution of social aspects of open source software ecosystems
Analysing the evolution of social aspects of open source software ecosystemsAnalysing the evolution of social aspects of open source software ecosystems
Analysing the evolution of social aspects of open source software ecosystems
Tom Mens
 
Presentation Master Thesis CDC2
Presentation Master Thesis CDC2Presentation Master Thesis CDC2
Presentation Master Thesis CDC2
Jackie Arango
 
IMC UX
IMC UXIMC UX
LinkediI for Social Selling
LinkediI for Social SellingLinkediI for Social Selling
LinkediI for Social Selling
David Blake
 
Pilot Interim Results
Pilot Interim ResultsPilot Interim Results
Pilot Interim Results
Artoo IT Solutions Pvt Ltd
 
Momentum of Open Research Data: now in 5-d!
Momentum of Open Research Data: now in 5-d!Momentum of Open Research Data: now in 5-d!
Momentum of Open Research Data: now in 5-d!
Heather Piwowar
 
ICE Conference, June 2007,Sophia-Antipolis, France
ICE Conference, June 2007,Sophia-Antipolis, FranceICE Conference, June 2007,Sophia-Antipolis, France
ICE Conference, June 2007,Sophia-Antipolis, France
Ilkka Kakko
 
How to manage Crowdfunding for artists and musicians
How to manage Crowdfunding for artists and musiciansHow to manage Crowdfunding for artists and musicians
How to manage Crowdfunding for artists and musicians
Virginie BERGER
 
Discus
DiscusDiscus

Similar to Evidence for the Pareto principle in open source software activity (20)

Understanding the evolution of software project communities
Understanding the evolution of software project communitiesUnderstanding the evolution of software project communities
Understanding the evolution of software project communities
 
EarthCube DDMA AGU
EarthCube DDMA AGUEarthCube DDMA AGU
EarthCube DDMA AGU
 
U of U Undergraduate IMC Class
U of U Undergraduate IMC ClassU of U Undergraduate IMC Class
U of U Undergraduate IMC Class
 
Critical Mass 'Adaptive Brands'
Critical Mass 'Adaptive Brands' Critical Mass 'Adaptive Brands'
Critical Mass 'Adaptive Brands'
 
MA Veterans Portal Design Process
MA Veterans Portal Design ProcessMA Veterans Portal Design Process
MA Veterans Portal Design Process
 
Ma veterans portal design process 20111207 final
Ma veterans portal design process 20111207 finalMa veterans portal design process 20111207 final
Ma veterans portal design process 20111207 final
 
7 data citation challenges, illustrated with data (includes elephants)
7 data citation challenges, illustrated with data (includes elephants) 7 data citation challenges, illustrated with data (includes elephants)
7 data citation challenges, illustrated with data (includes elephants)
 
Webanalytics as inspiration - Maarten Berge
Webanalytics as inspiration - Maarten BergeWebanalytics as inspiration - Maarten Berge
Webanalytics as inspiration - Maarten Berge
 
Developing a Social Intelligence Program - FSMU
Developing a Social Intelligence Program - FSMUDeveloping a Social Intelligence Program - FSMU
Developing a Social Intelligence Program - FSMU
 
MOD2014-Mens-Lecture4
MOD2014-Mens-Lecture4MOD2014-Mens-Lecture4
MOD2014-Mens-Lecture4
 
!#$&()&#+,$)!#$$&())• +,-.$0$12,#-34-$#3.docx
!#$&()&#+,$)!#$$&())• +,-.$0$12,#-34-$#3.docx!#$&()&#+,$)!#$$&())• +,-.$0$12,#-34-$#3.docx
!#$&()&#+,$)!#$$&())• +,-.$0$12,#-34-$#3.docx
 
Analysing the evolution of social aspects of open source software ecosystems
Analysing the evolution of social aspects of open source software ecosystemsAnalysing the evolution of social aspects of open source software ecosystems
Analysing the evolution of social aspects of open source software ecosystems
 
Presentation Master Thesis CDC2
Presentation Master Thesis CDC2Presentation Master Thesis CDC2
Presentation Master Thesis CDC2
 
IMC UX
IMC UXIMC UX
IMC UX
 
LinkediI for Social Selling
LinkediI for Social SellingLinkediI for Social Selling
LinkediI for Social Selling
 
Pilot Interim Results
Pilot Interim ResultsPilot Interim Results
Pilot Interim Results
 
Momentum of Open Research Data: now in 5-d!
Momentum of Open Research Data: now in 5-d!Momentum of Open Research Data: now in 5-d!
Momentum of Open Research Data: now in 5-d!
 
ICE Conference, June 2007,Sophia-Antipolis, France
ICE Conference, June 2007,Sophia-Antipolis, FranceICE Conference, June 2007,Sophia-Antipolis, France
ICE Conference, June 2007,Sophia-Antipolis, France
 
How to manage Crowdfunding for artists and musicians
How to manage Crowdfunding for artists and musiciansHow to manage Crowdfunding for artists and musicians
How to manage Crowdfunding for artists and musicians
 
Discus
DiscusDiscus
Discus
 

More from Tom Mens

How to be(come) a successful PhD student
How to be(come) a successful PhD studentHow to be(come) a successful PhD student
How to be(come) a successful PhD student
Tom Mens
 
Recognising bot activity in collaborative software development
Recognising bot activity in collaborative software developmentRecognising bot activity in collaborative software development
Recognising bot activity in collaborative software development
Tom Mens
 
A Dataset of Bot and Human Activities in GitHub
A Dataset of Bot and Human Activities in GitHubA Dataset of Bot and Human Activities in GitHub
A Dataset of Bot and Human Activities in GitHub
Tom Mens
 
The (r)evolution of CI/CD on GitHub
 The (r)evolution of CI/CD on GitHub The (r)evolution of CI/CD on GitHub
The (r)evolution of CI/CD on GitHub
Tom Mens
 
Nurturing the Software Ecosystems of the Future
Nurturing the Software Ecosystems of the FutureNurturing the Software Ecosystems of the Future
Nurturing the Software Ecosystems of the Future
Tom Mens
 
Comment programmer un robot en 30 minutes?
Comment programmer un robot en 30 minutes?Comment programmer un robot en 30 minutes?
Comment programmer un robot en 30 minutes?
Tom Mens
 
On the rise and fall of CI services in GitHub
On the rise and fall of CI services in GitHubOn the rise and fall of CI services in GitHub
On the rise and fall of CI services in GitHub
Tom Mens
 
On backporting practices in package dependency networks
On backporting practices in package dependency networksOn backporting practices in package dependency networks
On backporting practices in package dependency networks
Tom Mens
 
Comparing semantic versioning practices in Cargo, npm, Packagist and Rubygems
Comparing semantic versioning practices in Cargo, npm, Packagist and RubygemsComparing semantic versioning practices in Cargo, npm, Packagist and Rubygems
Comparing semantic versioning practices in Cargo, npm, Packagist and Rubygems
Tom Mens
 
Lost in Zero Space
Lost in Zero SpaceLost in Zero Space
Lost in Zero Space
Tom Mens
 
Evaluating a bot detection model on git commit messages
Evaluating a bot detection model on git commit messagesEvaluating a bot detection model on git commit messages
Evaluating a bot detection model on git commit messages
Tom Mens
 
Is my software ecosystem healthy? It depends!
Is my software ecosystem healthy? It depends!Is my software ecosystem healthy? It depends!
Is my software ecosystem healthy? It depends!
Tom Mens
 
Bot or not? Detecting bots in GitHub pull request activity based on comment s...
Bot or not? Detecting bots in GitHub pull request activity based on comment s...Bot or not? Detecting bots in GitHub pull request activity based on comment s...
Bot or not? Detecting bots in GitHub pull request activity based on comment s...
Tom Mens
 
On the fragility of open source software packaging ecosystems
On the fragility of open source software packaging ecosystemsOn the fragility of open source software packaging ecosystems
On the fragility of open source software packaging ecosystems
Tom Mens
 
How magic is zero? An Empirical Analysis of Initial Development Releases in S...
How magic is zero? An Empirical Analysis of Initial Development Releases in S...How magic is zero? An Empirical Analysis of Initial Development Releases in S...
How magic is zero? An Empirical Analysis of Initial Development Releases in S...
Tom Mens
 
Comparing dependency issues across software package distributions (FOSDEM 2020)
Comparing dependency issues across software package distributions (FOSDEM 2020)Comparing dependency issues across software package distributions (FOSDEM 2020)
Comparing dependency issues across software package distributions (FOSDEM 2020)
Tom Mens
 
Measuring Technical Lag in Software Deployments (CHAOSScon 2020)
Measuring Technical Lag in Software Deployments (CHAOSScon 2020)Measuring Technical Lag in Software Deployments (CHAOSScon 2020)
Measuring Technical Lag in Software Deployments (CHAOSScon 2020)
Tom Mens
 
SecoHealth 2019 Research Achievements
SecoHealth 2019 Research AchievementsSecoHealth 2019 Research Achievements
SecoHealth 2019 Research Achievements
Tom Mens
 
SECO-Assist 2019 research seminar
SECO-Assist 2019 research seminarSECO-Assist 2019 research seminar
SECO-Assist 2019 research seminar
Tom Mens
 
Empirically Analysing the Socio-Technical Health of Software Package Managers
Empirically Analysing the Socio-Technical Health of Software Package ManagersEmpirically Analysing the Socio-Technical Health of Software Package Managers
Empirically Analysing the Socio-Technical Health of Software Package Managers
Tom Mens
 

More from Tom Mens (20)

How to be(come) a successful PhD student
How to be(come) a successful PhD studentHow to be(come) a successful PhD student
How to be(come) a successful PhD student
 
Recognising bot activity in collaborative software development
Recognising bot activity in collaborative software developmentRecognising bot activity in collaborative software development
Recognising bot activity in collaborative software development
 
A Dataset of Bot and Human Activities in GitHub
A Dataset of Bot and Human Activities in GitHubA Dataset of Bot and Human Activities in GitHub
A Dataset of Bot and Human Activities in GitHub
 
The (r)evolution of CI/CD on GitHub
 The (r)evolution of CI/CD on GitHub The (r)evolution of CI/CD on GitHub
The (r)evolution of CI/CD on GitHub
 
Nurturing the Software Ecosystems of the Future
Nurturing the Software Ecosystems of the FutureNurturing the Software Ecosystems of the Future
Nurturing the Software Ecosystems of the Future
 
Comment programmer un robot en 30 minutes?
Comment programmer un robot en 30 minutes?Comment programmer un robot en 30 minutes?
Comment programmer un robot en 30 minutes?
 
On the rise and fall of CI services in GitHub
On the rise and fall of CI services in GitHubOn the rise and fall of CI services in GitHub
On the rise and fall of CI services in GitHub
 
On backporting practices in package dependency networks
On backporting practices in package dependency networksOn backporting practices in package dependency networks
On backporting practices in package dependency networks
 
Comparing semantic versioning practices in Cargo, npm, Packagist and Rubygems
Comparing semantic versioning practices in Cargo, npm, Packagist and RubygemsComparing semantic versioning practices in Cargo, npm, Packagist and Rubygems
Comparing semantic versioning practices in Cargo, npm, Packagist and Rubygems
 
Lost in Zero Space
Lost in Zero SpaceLost in Zero Space
Lost in Zero Space
 
Evaluating a bot detection model on git commit messages
Evaluating a bot detection model on git commit messagesEvaluating a bot detection model on git commit messages
Evaluating a bot detection model on git commit messages
 
Is my software ecosystem healthy? It depends!
Is my software ecosystem healthy? It depends!Is my software ecosystem healthy? It depends!
Is my software ecosystem healthy? It depends!
 
Bot or not? Detecting bots in GitHub pull request activity based on comment s...
Bot or not? Detecting bots in GitHub pull request activity based on comment s...Bot or not? Detecting bots in GitHub pull request activity based on comment s...
Bot or not? Detecting bots in GitHub pull request activity based on comment s...
 
On the fragility of open source software packaging ecosystems
On the fragility of open source software packaging ecosystemsOn the fragility of open source software packaging ecosystems
On the fragility of open source software packaging ecosystems
 
How magic is zero? An Empirical Analysis of Initial Development Releases in S...
How magic is zero? An Empirical Analysis of Initial Development Releases in S...How magic is zero? An Empirical Analysis of Initial Development Releases in S...
How magic is zero? An Empirical Analysis of Initial Development Releases in S...
 
Comparing dependency issues across software package distributions (FOSDEM 2020)
Comparing dependency issues across software package distributions (FOSDEM 2020)Comparing dependency issues across software package distributions (FOSDEM 2020)
Comparing dependency issues across software package distributions (FOSDEM 2020)
 
Measuring Technical Lag in Software Deployments (CHAOSScon 2020)
Measuring Technical Lag in Software Deployments (CHAOSScon 2020)Measuring Technical Lag in Software Deployments (CHAOSScon 2020)
Measuring Technical Lag in Software Deployments (CHAOSScon 2020)
 
SecoHealth 2019 Research Achievements
SecoHealth 2019 Research AchievementsSecoHealth 2019 Research Achievements
SecoHealth 2019 Research Achievements
 
SECO-Assist 2019 research seminar
SECO-Assist 2019 research seminarSECO-Assist 2019 research seminar
SECO-Assist 2019 research seminar
 
Empirically Analysing the Socio-Technical Health of Software Package Managers
Empirically Analysing the Socio-Technical Health of Software Package ManagersEmpirically Analysing the Socio-Technical Health of Software Package Managers
Empirically Analysing the Socio-Technical Health of Software Package Managers
 

Recently uploaded

Advanced Java[Extra Concepts, Not Difficult].docx
Advanced Java[Extra Concepts, Not Difficult].docxAdvanced Java[Extra Concepts, Not Difficult].docx
Advanced Java[Extra Concepts, Not Difficult].docx
adhitya5119
 
World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024
ak6969907
 
Walmart Business+ and Spark Good for Nonprofits.pdf
Walmart Business+ and Spark Good for Nonprofits.pdfWalmart Business+ and Spark Good for Nonprofits.pdf
Walmart Business+ and Spark Good for Nonprofits.pdf
TechSoup
 
The Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collectionThe Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collection
Israel Genealogy Research Association
 
Cognitive Development Adolescence Psychology
Cognitive Development Adolescence PsychologyCognitive Development Adolescence Psychology
Cognitive Development Adolescence Psychology
paigestewart1632
 
C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptxC1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
mulvey2
 
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
IreneSebastianRueco1
 
A Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdfA Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdf
Jean Carlos Nunes Paixão
 
S1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptxS1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptx
tarandeep35
 
MARY JANE WILSON, A “BOA MÃE” .
MARY JANE WILSON, A “BOA MÃE”           .MARY JANE WILSON, A “BOA MÃE”           .
MARY JANE WILSON, A “BOA MÃE” .
Colégio Santa Teresinha
 
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama UniversityNatural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Akanksha trivedi rama nursing college kanpur.
 
clinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdfclinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdf
Priyankaranawat4
 
Film vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movieFilm vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movie
Nicholas Montgomery
 
The basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptxThe basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptx
heathfieldcps1
 
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat  Leveraging AI for Diversity, Equity, and InclusionExecutive Directors Chat  Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
TechSoup
 
Digital Artifact 1 - 10VCD Environments Unit
Digital Artifact 1 - 10VCD Environments UnitDigital Artifact 1 - 10VCD Environments Unit
Digital Artifact 1 - 10VCD Environments Unit
chanes7
 
How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17
Celine George
 
The History of Stoke Newington Street Names
The History of Stoke Newington Street NamesThe History of Stoke Newington Street Names
The History of Stoke Newington Street Names
History of Stoke Newington
 
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
GeorgeMilliken2
 
Pride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School DistrictPride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School District
David Douglas School District
 

Recently uploaded (20)

Advanced Java[Extra Concepts, Not Difficult].docx
Advanced Java[Extra Concepts, Not Difficult].docxAdvanced Java[Extra Concepts, Not Difficult].docx
Advanced Java[Extra Concepts, Not Difficult].docx
 
World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024
 
Walmart Business+ and Spark Good for Nonprofits.pdf
Walmart Business+ and Spark Good for Nonprofits.pdfWalmart Business+ and Spark Good for Nonprofits.pdf
Walmart Business+ and Spark Good for Nonprofits.pdf
 
The Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collectionThe Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collection
 
Cognitive Development Adolescence Psychology
Cognitive Development Adolescence PsychologyCognitive Development Adolescence Psychology
Cognitive Development Adolescence Psychology
 
C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptxC1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
 
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
 
A Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdfA Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdf
 
S1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptxS1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptx
 
MARY JANE WILSON, A “BOA MÃE” .
MARY JANE WILSON, A “BOA MÃE”           .MARY JANE WILSON, A “BOA MÃE”           .
MARY JANE WILSON, A “BOA MÃE” .
 
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama UniversityNatural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
 
clinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdfclinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdf
 
Film vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movieFilm vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movie
 
The basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptxThe basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptx
 
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat  Leveraging AI for Diversity, Equity, and InclusionExecutive Directors Chat  Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
 
Digital Artifact 1 - 10VCD Environments Unit
Digital Artifact 1 - 10VCD Environments UnitDigital Artifact 1 - 10VCD Environments Unit
Digital Artifact 1 - 10VCD Environments Unit
 
How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17
 
The History of Stoke Newington Street Names
The History of Stoke Newington Street NamesThe History of Stoke Newington Street Names
The History of Stoke Newington Street Names
 
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
 
Pride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School DistrictPride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School District
 

Evidence for the Pareto principle in open source software activity

  • 1. Evidence for the Pareto principle in OSS Activity Mathieu Goeminne & Tom Mens Service de Génie Logiciel, Institut d’Informatique Faculté des Sciences, Université de Mons 1
  • 2. Research topic • Study of open source software evolution. • Taking into account the community (social network) of persons surrounding the software project (developers, users). • Looking for recurrent behaviour in this community. Université de Mons Tuesday 2011-03-01, SQM workshop, Oldenburg, Germany Mathieu Goeminne & Tom Mens 2
  • 3. Goals • Long-term goal • Understand how software community and software product/project co-evolve • Provide guidelines and tools to support this • Short-term goal • Study of how development activity is distributed over the different stakeholders. • Find evidence for the Pareto principle in evolving OSS. Université de Mons Tuesday 2011-03-01, SQM workshop, Oldenburg, Germany Mathieu Goeminne & Tom Mens 3
  • 4. Questions • Is there a core group (of developers and/or users) being significantly more active than the others? • How does the activity distribution evolve over time? • Is there an overlap between the different activities? • How does the activity distribution vary across different projects? Université de Mons Tuesday 2011-03-01, SQM workshop, Oldenburg, Germany Mathieu Goeminne & Tom Mens 4
  • 5. Methodology • Exploiting available data from source code repositories, mailing lists and bug trackers. • Use of economy metrics measuring distribution (in)equality. • Empirical study of 3 OSS : Brasero, Evince and Wine. Université de Mons Tuesday 2011-03-01, SQM workshop, Oldenburg, Germany Mathieu Goeminne & Tom Mens 5
  • 6. Econometrics • Gini, Hoover, Theil (normalised) : express inequality in a distribution. • Values between 0 and 1 • 0 reflects a perfect equality • 1 reflects a perfect inequality • Have similar behaviors. Université de Mons Tuesday 2011-03-01, SQM workshop, Oldenburg, Germany Mathieu Goeminne & Tom Mens 6
  • 7. The 3 notions of activities we used • # commits done • # mails sent • # bug status changed All of them are related to typical developer activities Université de Mons Tuesday 2011-03-01, SQM workshop, Oldenburg, Germany Mathieu Goeminne & Tom Mens 7
  • 8. Activity distributions (Gini index) $" !#," Brasero !#+" !#*" !#)" !#(" !#'" .4556/7" Evince !#&" 58697" !#%" :;"0".<8=>?7" !#$" !" -./0!)" 1230!*" -./0!*" 1230!+" -./0!+" 1230!," -./0!," 1230$!" -./0$!" Wine $" $" !#," !#," !#+" !#+" !#*" !#*" !#)" !#)" 2.44536" !#(" !#(" 47586" !#'" 1233456" !#'" 9:;"<=>.<3"2?7@;=6" !#&" 37486" !#&" !#%" 9:;"/<.2/5"1=7>;<6" !#%" !#$" !#$" !" !" -./0,," -./0!!" -./0!$" -./0!%" -./0!&" -./0!'" -./0!(" -./0!)" -./0!*" -./0!+" -./0!," -./0$!" -./0,+" -./0,," 1230!!" 1230!$" 1230!%" 1230!&" 1230!'" 1230!(" 1230!)" 1230!*" 1230!+" 1230!," 1230$!" Université de Mons Tuesday 2011-03-01, SQM workshop, Oldenburg, Germany Mathieu Goeminne & Tom Mens 8
  • 9. Pareto principle • Most of the activity is carried out by a small group of persons. • Typically : 20% do 80% of the job. • Doesn’t necessarily imply that the activity distribution follows a Pareto law. Université de Mons Tuesday 2011-03-01, SQM workshop, Oldenburg, Germany Mathieu Goeminne & Tom Mens 9
  • 10. Pareto principle (cont.) 1 0.9 Brasero commits 0.8 0.7 mails 0.6 br changes 0.5 0.4 Evince 0.3 0.2 Wine 0.1 0 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Université de Mons Tuesday 2011-03-01, SQM workshop, Oldenburg, Germany Mathieu Goeminne & Tom Mens 10
  • 11. Core groups • Display Venn diagrams of most active (top 20) persons, according to each definition of activity. • For each person, show the percentage of activity attributable to this person. • Use heuristics to take into account and merge multiple identities representing the same real person. Université de Mons Tuesday 2011-03-01, SQM workshop, Oldenburg, Germany Mathieu Goeminne & Tom Mens 11
  • 12. Core groups (cont.) Brasero Evince Wine Université de Mons Tuesday 2011-03-01, SQM workshop, Oldenburg, Germany Mathieu Goeminne & Tom Mens 12
  • 13. Conclusions • Activity distributions seem to become more and more unequally distributed. • The Pareto principle is clearly present in studied projects. • For Brasero and Evince, the activity is led by a limited number of persons involved in 2 or 3 of the defined activities. • For Wine, it seems not to be the case. Université de Mons Tuesday 2011-03-01, SQM workshop, Oldenburg, Germany Mathieu Goeminne & Tom Mens 13
  • 14. Future work • Determine if the core group of a project evolves over time. • Use sliding windows to ignore inactive persons and discover new active persons. • Study “bus factor” and the persons involved. • Automatic generation of Venn diagrams including all persons involved in software evolution. Université de Mons Tuesday 2011-03-01, SQM workshop, Oldenburg, Germany Mathieu Goeminne & Tom Mens 14
  • 15. Future work (cont.) • Study correlation between community structure (social network) and source code quality (as computed using software metrics). • Automatic statistical analysis to determine the distributions fitting the data. • Extend and refine types of activity. For instance: • different types of commit activity (doc, source code, test, etc.); of mail activity (information, asking, answering, etc.); of bug repository activity (bug creation, modification and commenting) Université de Mons Tuesday 2011-03-01, SQM workshop, Oldenburg, Germany Mathieu Goeminne & Tom Mens 15
  • 16. Thank you Université de Mons Tuesday 2011-03-01, SQM workshop, Oldenburg, Germany Mathieu Goeminne & Tom Mens 16