SlideShare a Scribd company logo
1 of 12
Download to read offline
SLOC and defect prediction




/   department of mathematics and computer science
2


     By no means:
     A study on aggregating
     software metrics
     Bogdan Vasilescu
     Alexander Serebrenik
     Mark van den Brand



May 20, 2011
                              Where innovation starts
Methodology                                                                                  3/5


                              Issue tracker                                     Software system


                                                                                          0
                                                                                1
                    Version control system
     r3780 | kataka | 2003-04-12 00:43:24 +0200 (za, 12 apr 2003) | 2 lines
     Changed paths:
       M /argouml/model/uml/modelmanagement/ModelManagementHelper.java
       M /argouml/uml/ui/foundation/core/ActionSetParameterType.java                  2
      Fixed issue 1544
     ------------------------------------------------------------------------
     r3769 | alexb | 2003-04-11 11:27:55 +0200 (vr, 11 apr 2003) | 4 lines
     Changed paths:
       M /argouml/uml/ui/foundation/core/PropPanelClass.java
       M /argouml/uml/ui/foundation/core/PropPanelInterface.java                          1
      fix for
                                                                                1
      Issue number: 1736



/   department of mathematics and computer science
Correlation between SLOC and defects                                                  4/5




                                                      ArgoUML     Adempiere     Mogwai
                    #Java classes                        1230         4047        2310
                    #Packages                               94         152         365
                    #Bugs mapped                            39         163           38
                    mean                              0.023 (7)    0.392 (3)   0.197 (2)
                    median                           -0.142 (8)    0.311 (4)   0.129 (7)
                    sum                               0.313 (1)    0.510 (1)   0.151 (3)
                    IGini                             0.267 (3)    0.225 (5)   0.134 (6)
                    ITheil                            0.269 (2)    0.185 (6)   0.135 (5)
                    IAtkinson                         0.245 (4)    0.168 (7)   0.138 (4)
                    IHoover                           0.240 (5)    0.113 (8)   0.122 (8)
                    IKolm                             0.144 (6)    0.412 (2)   0.204 (1)



/   department of mathematics and computer science
5/5




       The aggregation technique
       influences the correlation.

       Mean, median are inconsistent.

/   department of mathematics and computer science
Emerging trend                                  6/5




/   department of mathematics and computer science
Inequality indices                                                  7/5




     Econometrics: measure/explain the inequality of income or wealth.




/   department of mathematics and computer science
Inequality indices                                                                                                                                       8/5




     Econometrics: measure/explain the inequality of income or wealth.

     Software metrics and econometric variables have distributions with
     similar shapes.

                                   Household income in Ilocos, the Phillippines (1998)                       hibernate−3.6.0−beta4: org.hibernate.criterion
                             600




                                                                                                        35
                                                                                                        25
                             400
                 Frequency




                                                                                            Frequency

                                                                                                        15
                             200




                                                                                                        5
                             0




                                                                                                        0
                                     0     500000   1000000   1500000   2000000   2500000                    0     50     100     150    200     250    300

                                                         Income                                                                  SLOC




/   department of mathematics and computer science
Inequality indices                                                                                                                                        8/5




     Econometrics: measure/explain the inequality of income or wealth.

     Software metrics and econometric variables have distributions with
     similar shapes.

                                   Household income in Ilocos, the Phillippines (1998)                        hibernate−3.6.0−beta4: org.hibernate.criterion
                             600




                                                                                                         35
                                                                                                         25
                             400
                 Frequency




                                                                                             Frequency

                                                                                                         15
                             200




                                                                                                         5
                             0




                                                                                                         0
                                     0     500000    1000000   1500000   2000000   2500000                    0     50     100     150    200     250    300

                                                          Income                                                                  SLOC




                                                    Inequality in quality = low quality !


/   department of mathematics and computer science
Inequality indices and software metrics         9/5




/   department of mathematics and computer science
Inequality indices and software metrics                                     9/5




     Decomposable indices (partition the population into MECE groups):
             which partition provides the best explanation for the inequality?




/   department of mathematics and computer science
Traceability via decomposability                                  10/5




     Which individuals (classes in package) contribute to 80% of the
     inequality (of SLOC)?

     Which class contributes the most to the inequality?




/   department of mathematics and computer science

More Related Content

More from Bogdan Vasilescu (7)

Benevol 2012
Benevol 2012Benevol 2012
Benevol 2012
 
SOS-Evol 2012
SOS-Evol 2012SOS-Evol 2012
SOS-Evol 2012
 
IPA Spring Days 2012
IPA Spring Days 2012IPA Spring Days 2012
IPA Spring Days 2012
 
Sattose 2011
Sattose 2011Sattose 2011
Sattose 2011
 
Benevol 2010
Benevol 2010Benevol 2010
Benevol 2010
 
Master Thesis presentation
Master Thesis presentationMaster Thesis presentation
Master Thesis presentation
 
Seeing the forest for the trees, UMons 2011
Seeing the forest for the trees, UMons 2011Seeing the forest for the trees, UMons 2011
Seeing the forest for the trees, UMons 2011
 

Recently uploaded

Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Krashi Coaching
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 

Recently uploaded (20)

Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdf
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 

WETSoM 2011

  • 1. SLOC and defect prediction / department of mathematics and computer science
  • 2. 2 By no means: A study on aggregating software metrics Bogdan Vasilescu Alexander Serebrenik Mark van den Brand May 20, 2011 Where innovation starts
  • 3. Methodology 3/5 Issue tracker Software system 0 1 Version control system r3780 | kataka | 2003-04-12 00:43:24 +0200 (za, 12 apr 2003) | 2 lines Changed paths: M /argouml/model/uml/modelmanagement/ModelManagementHelper.java M /argouml/uml/ui/foundation/core/ActionSetParameterType.java 2 Fixed issue 1544 ------------------------------------------------------------------------ r3769 | alexb | 2003-04-11 11:27:55 +0200 (vr, 11 apr 2003) | 4 lines Changed paths: M /argouml/uml/ui/foundation/core/PropPanelClass.java M /argouml/uml/ui/foundation/core/PropPanelInterface.java 1 fix for 1 Issue number: 1736 / department of mathematics and computer science
  • 4. Correlation between SLOC and defects 4/5 ArgoUML Adempiere Mogwai #Java classes 1230 4047 2310 #Packages 94 152 365 #Bugs mapped 39 163 38 mean 0.023 (7) 0.392 (3) 0.197 (2) median -0.142 (8) 0.311 (4) 0.129 (7) sum 0.313 (1) 0.510 (1) 0.151 (3) IGini 0.267 (3) 0.225 (5) 0.134 (6) ITheil 0.269 (2) 0.185 (6) 0.135 (5) IAtkinson 0.245 (4) 0.168 (7) 0.138 (4) IHoover 0.240 (5) 0.113 (8) 0.122 (8) IKolm 0.144 (6) 0.412 (2) 0.204 (1) / department of mathematics and computer science
  • 5. 5/5 The aggregation technique influences the correlation. Mean, median are inconsistent. / department of mathematics and computer science
  • 6. Emerging trend 6/5 / department of mathematics and computer science
  • 7. Inequality indices 7/5 Econometrics: measure/explain the inequality of income or wealth. / department of mathematics and computer science
  • 8. Inequality indices 8/5 Econometrics: measure/explain the inequality of income or wealth. Software metrics and econometric variables have distributions with similar shapes. Household income in Ilocos, the Phillippines (1998) hibernate−3.6.0−beta4: org.hibernate.criterion 600 35 25 400 Frequency Frequency 15 200 5 0 0 0 500000 1000000 1500000 2000000 2500000 0 50 100 150 200 250 300 Income SLOC / department of mathematics and computer science
  • 9. Inequality indices 8/5 Econometrics: measure/explain the inequality of income or wealth. Software metrics and econometric variables have distributions with similar shapes. Household income in Ilocos, the Phillippines (1998) hibernate−3.6.0−beta4: org.hibernate.criterion 600 35 25 400 Frequency Frequency 15 200 5 0 0 0 500000 1000000 1500000 2000000 2500000 0 50 100 150 200 250 300 Income SLOC Inequality in quality = low quality ! / department of mathematics and computer science
  • 10. Inequality indices and software metrics 9/5 / department of mathematics and computer science
  • 11. Inequality indices and software metrics 9/5 Decomposable indices (partition the population into MECE groups): which partition provides the best explanation for the inequality? / department of mathematics and computer science
  • 12. Traceability via decomposability 10/5 Which individuals (classes in package) contribute to 80% of the inequality (of SLOC)? Which class contributes the most to the inequality? / department of mathematics and computer science