SlideShare a Scribd company logo
1 of 22
Metrics validation criteria: How do we know when a metric is worthwhile? Ben Smith Andy Meneely Laurie Williams
Scenario ,[object Object]
Metric Uses Metrics Quality Assessment Process Certification Process Improvement Task Planning Research Prediction
Motivation Software System Component  m=.25 Component  m=.95 Component  m=. 05 Component  m=.21 Component  m=.15 Component  m=.01 Prediction M < .2
Well, the metric was predictive… ,[object Object],[object Object]
Metrics Validation Criteria ,[object Object],[object Object],[object Object]
Agenda ,[object Object],[object Object],[object Object],[object Object]
Objective ,[object Object],[object Object],[object Object],[object Object]
Foundation in the Literature
Systematic literature review Phase Size of Source List Literature Index 2,228 Title 536 Cross-confirmed Title 156 Abstract 44 Full-text 17 Follow-up 20
Results of the Review ,[object Object],[object Object],[object Object],[object Object]
Two Competing Philosophies ,[object Object],[object Object]
Agenda ,[object Object],[object Object],[object Object],[object Object]
Scenario ,[object Object]
Choosing the best criteria ,[object Object],[object Object],[object Object],[object Object]
Metrics Validation Criteria A priori validity Actionability Appropriate Continuity Appropriate Granularity Association Attribute validity Causal model validity Causal relationship validity Content validity Construct validity Constructiveness Definition validity  Discriminative power  Dimensional consistency  Economic productivity  Empirical validity External validity  Factor independence  Improvement validity  Instrument validity  Increasing growth validity  Interaction sensitivity  Internal consistency  Internal validity  Monotonicity  Metric Reliability  Non-collinearity  Non-exploitability  Non-uniformity  Notation validity  Permutation validity  Predictability  Prediction system validity  Process or Product Relevance  Protocol validity  Rank Consistency  Renaming insensitivity  Repeatability  Representation condition  Scale validity  Stability  Theoretical validity  Trackability  Transformation invariance  Underlying theory validity  Unit validity  Usability
Reduced Metrics Validation Criteria
Rejected (and why) ,[object Object],[object Object]
Accepted (and why) ,[object Object],[object Object]
Agenda ,[object Object],[object Object],[object Object],[object Object]
Measurement Theory ,[object Object],[object Object],[object Object]
Questions?

More Related Content

What's hot

Workshop healthy ingredients ppm[1]
Workshop healthy ingredients ppm[1]Workshop healthy ingredients ppm[1]
Workshop healthy ingredients ppm[1]JULIO GONZALEZ SANZ
 
Forecasting
ForecastingForecasting
Forecastingsumit235
 
A Survey of Software Reliability factor
A Survey of Software Reliability factorA Survey of Software Reliability factor
A Survey of Software Reliability factorIOSR Journals
 
Feasible
FeasibleFeasible
Feasiblelearnt
 
Managing in the Presence of Uncertanty
Managing in the Presence of UncertantyManaging in the Presence of Uncertanty
Managing in the Presence of UncertantyGlen Alleman
 
Risk Driven Approach to Test Device Software
Risk Driven Approach to Test Device SoftwareRisk Driven Approach to Test Device Software
Risk Driven Approach to Test Device Softwareijtsrd
 
Introducing a tool into an organization
Introducing a tool into an organizationIntroducing a tool into an organization
Introducing a tool into an organizationmuhammad afif
 
The Perils of Clinical Trial Budgeting
The Perils of Clinical Trial BudgetingThe Perils of Clinical Trial Budgeting
The Perils of Clinical Trial BudgetingPerficient
 
Simplifying Analytics - by Novoniel Deb
Simplifying Analytics - by Novoniel DebSimplifying Analytics - by Novoniel Deb
Simplifying Analytics - by Novoniel DebNovoniel Deb
 
Testability measurement model for object oriented design (tmmood)
Testability measurement model for object oriented design (tmmood)Testability measurement model for object oriented design (tmmood)
Testability measurement model for object oriented design (tmmood)ijcsit
 
How to get the most from your clinical outcome assessment (COA) measure - Tes...
How to get the most from your clinical outcome assessment (COA) measure - Tes...How to get the most from your clinical outcome assessment (COA) measure - Tes...
How to get the most from your clinical outcome assessment (COA) measure - Tes...Keith Meadows
 
Hardwiring Safety 7 Tips For Changing Culture
Hardwiring Safety 7 Tips For Changing CultureHardwiring Safety 7 Tips For Changing Culture
Hardwiring Safety 7 Tips For Changing CulturePhil La Duke
 
840 plenary elder_using his laptop
840 plenary elder_using his laptop840 plenary elder_using his laptop
840 plenary elder_using his laptopRising Media, Inc.
 
Software estimation
Software estimationSoftware estimation
Software estimationguest4454de
 
Strategic vendor evaluation
Strategic vendor evaluationStrategic vendor evaluation
Strategic vendor evaluationBradr998
 
Hpe program rating #5 ops performance feedback
Hpe program rating #5 ops performance feedbackHpe program rating #5 ops performance feedback
Hpe program rating #5 ops performance feedbackPMHaas
 
Hack2Secure Assists Organization in Secure Application Development Through BS...
Hack2Secure Assists Organization in Secure Application Development Through BS...Hack2Secure Assists Organization in Secure Application Development Through BS...
Hack2Secure Assists Organization in Secure Application Development Through BS...hack2s
 
Product Evaluation HIS
Product Evaluation HISProduct Evaluation HIS
Product Evaluation HISAnil Patil
 

What's hot (20)

Workshop healthy ingredients ppm[1]
Workshop healthy ingredients ppm[1]Workshop healthy ingredients ppm[1]
Workshop healthy ingredients ppm[1]
 
Forecasting
ForecastingForecasting
Forecasting
 
A Survey of Software Reliability factor
A Survey of Software Reliability factorA Survey of Software Reliability factor
A Survey of Software Reliability factor
 
Feasible
FeasibleFeasible
Feasible
 
Foresight 4 Cybersecurity
Foresight 4 CybersecurityForesight 4 Cybersecurity
Foresight 4 Cybersecurity
 
Managing in the Presence of Uncertanty
Managing in the Presence of UncertantyManaging in the Presence of Uncertanty
Managing in the Presence of Uncertanty
 
Risk Driven Approach to Test Device Software
Risk Driven Approach to Test Device SoftwareRisk Driven Approach to Test Device Software
Risk Driven Approach to Test Device Software
 
Introducing a tool into an organization
Introducing a tool into an organizationIntroducing a tool into an organization
Introducing a tool into an organization
 
The Perils of Clinical Trial Budgeting
The Perils of Clinical Trial BudgetingThe Perils of Clinical Trial Budgeting
The Perils of Clinical Trial Budgeting
 
Simplifying Analytics - by Novoniel Deb
Simplifying Analytics - by Novoniel DebSimplifying Analytics - by Novoniel Deb
Simplifying Analytics - by Novoniel Deb
 
Testability measurement model for object oriented design (tmmood)
Testability measurement model for object oriented design (tmmood)Testability measurement model for object oriented design (tmmood)
Testability measurement model for object oriented design (tmmood)
 
How to get the most from your clinical outcome assessment (COA) measure - Tes...
How to get the most from your clinical outcome assessment (COA) measure - Tes...How to get the most from your clinical outcome assessment (COA) measure - Tes...
How to get the most from your clinical outcome assessment (COA) measure - Tes...
 
Hardwiring Safety 7 Tips For Changing Culture
Hardwiring Safety 7 Tips For Changing CultureHardwiring Safety 7 Tips For Changing Culture
Hardwiring Safety 7 Tips For Changing Culture
 
840 plenary elder_using his laptop
840 plenary elder_using his laptop840 plenary elder_using his laptop
840 plenary elder_using his laptop
 
Software estimation
Software estimationSoftware estimation
Software estimation
 
Strategic vendor evaluation
Strategic vendor evaluationStrategic vendor evaluation
Strategic vendor evaluation
 
Hpe program rating #5 ops performance feedback
Hpe program rating #5 ops performance feedbackHpe program rating #5 ops performance feedback
Hpe program rating #5 ops performance feedback
 
Hack2Secure Assists Organization in Secure Application Development Through BS...
Hack2Secure Assists Organization in Secure Application Development Through BS...Hack2Secure Assists Organization in Secure Application Development Through BS...
Hack2Secure Assists Organization in Secure Application Development Through BS...
 
Product Evaluation HIS
Product Evaluation HISProduct Evaluation HIS
Product Evaluation HIS
 
Testing strategies
Testing strategiesTesting strategies
Testing strategies
 

Similar to Security Predictions

A Comprehensive Guide to Measuring Success with Test Automation KPIs.pdf
A Comprehensive Guide to Measuring Success with Test Automation KPIs.pdfA Comprehensive Guide to Measuring Success with Test Automation KPIs.pdf
A Comprehensive Guide to Measuring Success with Test Automation KPIs.pdfkalichargn70th171
 
Effectiveness of software product metrics for mobile application
Effectiveness of software product metrics for mobile application Effectiveness of software product metrics for mobile application
Effectiveness of software product metrics for mobile application tanveer ahmad
 
Old Presentation on Security Metrics 2005
Old Presentation on Security Metrics 2005Old Presentation on Security Metrics 2005
Old Presentation on Security Metrics 2005Anton Chuvakin
 
Presenting Metrics to the Executive Team
Presenting Metrics to the Executive TeamPresenting Metrics to the Executive Team
Presenting Metrics to the Executive TeamJohn D. Johnson
 
Importance of software quality metrics
Importance of software quality metricsImportance of software quality metrics
Importance of software quality metricsPiyush Sohaney
 
SELECTING PROJECTS FOR VENTURE CAPITAL FUNDING updated
SELECTING PROJECTS FOR VENTURE CAPITAL FUNDING updatedSELECTING PROJECTS FOR VENTURE CAPITAL FUNDING updated
SELECTING PROJECTS FOR VENTURE CAPITAL FUNDING updatedGina Beim, PE, M ASCE
 
Developing useful metrics
Developing useful metricsDeveloping useful metrics
Developing useful metricsPriyanka Aash
 
Testing Metrics: Project, Product, Process
Testing Metrics: Project, Product, ProcessTesting Metrics: Project, Product, Process
Testing Metrics: Project, Product, ProcessTechWell
 
Unique fundamentals of software
Unique fundamentals of softwareUnique fundamentals of software
Unique fundamentals of softwareijcsit
 
The Complete Guide to Building an Effective Enterprise Testing Strategy.pdf
The Complete Guide to Building an Effective Enterprise Testing Strategy.pdfThe Complete Guide to Building an Effective Enterprise Testing Strategy.pdf
The Complete Guide to Building an Effective Enterprise Testing Strategy.pdfkalichargn70th171
 
ISTQB Advanced Study Guide - 8
ISTQB Advanced Study Guide - 8ISTQB Advanced Study Guide - 8
ISTQB Advanced Study Guide - 8Yogindernath Gupta
 
Security Metrics Program
Security Metrics ProgramSecurity Metrics Program
Security Metrics ProgramCydney Davis
 
Practical Measures for Measuring Security
Practical Measures for Measuring SecurityPractical Measures for Measuring Security
Practical Measures for Measuring SecurityChris Mullins
 
Paper 9: Innovation Assessment and Improvement (D. Xu)
Paper 9: Innovation Assessment and Improvement (D. Xu)Paper 9: Innovation Assessment and Improvement (D. Xu)
Paper 9: Innovation Assessment and Improvement (D. Xu)Kent Business School
 
Value Summary 2.0 Overview
Value Summary 2.0 OverviewValue Summary 2.0 Overview
Value Summary 2.0 Overviewbpatterson888
 
Bca 5th sem seminar(software measurements)
Bca 5th sem seminar(software measurements)Bca 5th sem seminar(software measurements)
Bca 5th sem seminar(software measurements)MuskanSony
 
Performance Measurement
Performance MeasurementPerformance Measurement
Performance Measurementlleuciuc1
 
Tricentis-report_Forrester-Modernizing-Testing-to-Accelerate-Digital-Business...
Tricentis-report_Forrester-Modernizing-Testing-to-Accelerate-Digital-Business...Tricentis-report_Forrester-Modernizing-Testing-to-Accelerate-Digital-Business...
Tricentis-report_Forrester-Modernizing-Testing-to-Accelerate-Digital-Business...havoc2003
 

Similar to Security Predictions (20)

A Comprehensive Guide to Measuring Success with Test Automation KPIs.pdf
A Comprehensive Guide to Measuring Success with Test Automation KPIs.pdfA Comprehensive Guide to Measuring Success with Test Automation KPIs.pdf
A Comprehensive Guide to Measuring Success with Test Automation KPIs.pdf
 
Effectiveness of software product metrics for mobile application
Effectiveness of software product metrics for mobile application Effectiveness of software product metrics for mobile application
Effectiveness of software product metrics for mobile application
 
Old Presentation on Security Metrics 2005
Old Presentation on Security Metrics 2005Old Presentation on Security Metrics 2005
Old Presentation on Security Metrics 2005
 
Presenting Metrics to the Executive Team
Presenting Metrics to the Executive TeamPresenting Metrics to the Executive Team
Presenting Metrics to the Executive Team
 
Importance of software quality metrics
Importance of software quality metricsImportance of software quality metrics
Importance of software quality metrics
 
SELECTING PROJECTS FOR VENTURE CAPITAL FUNDING updated
SELECTING PROJECTS FOR VENTURE CAPITAL FUNDING updatedSELECTING PROJECTS FOR VENTURE CAPITAL FUNDING updated
SELECTING PROJECTS FOR VENTURE CAPITAL FUNDING updated
 
Developing useful metrics
Developing useful metricsDeveloping useful metrics
Developing useful metrics
 
Testing Metrics: Project, Product, Process
Testing Metrics: Project, Product, ProcessTesting Metrics: Project, Product, Process
Testing Metrics: Project, Product, Process
 
Unique fundamentals of software
Unique fundamentals of softwareUnique fundamentals of software
Unique fundamentals of software
 
The Complete Guide to Building an Effective Enterprise Testing Strategy.pdf
The Complete Guide to Building an Effective Enterprise Testing Strategy.pdfThe Complete Guide to Building an Effective Enterprise Testing Strategy.pdf
The Complete Guide to Building an Effective Enterprise Testing Strategy.pdf
 
Default Credit Loss
Default Credit LossDefault Credit Loss
Default Credit Loss
 
ISTQB Advanced Study Guide - 8
ISTQB Advanced Study Guide - 8ISTQB Advanced Study Guide - 8
ISTQB Advanced Study Guide - 8
 
Security Metrics Program
Security Metrics ProgramSecurity Metrics Program
Security Metrics Program
 
Practical Measures for Measuring Security
Practical Measures for Measuring SecurityPractical Measures for Measuring Security
Practical Measures for Measuring Security
 
Paper 9: Innovation Assessment and Improvement (D. Xu)
Paper 9: Innovation Assessment and Improvement (D. Xu)Paper 9: Innovation Assessment and Improvement (D. Xu)
Paper 9: Innovation Assessment and Improvement (D. Xu)
 
Value Summary 2.0 Overview
Value Summary 2.0 OverviewValue Summary 2.0 Overview
Value Summary 2.0 Overview
 
Ijcatr04051006
Ijcatr04051006Ijcatr04051006
Ijcatr04051006
 
Bca 5th sem seminar(software measurements)
Bca 5th sem seminar(software measurements)Bca 5th sem seminar(software measurements)
Bca 5th sem seminar(software measurements)
 
Performance Measurement
Performance MeasurementPerformance Measurement
Performance Measurement
 
Tricentis-report_Forrester-Modernizing-Testing-to-Accelerate-Digital-Business...
Tricentis-report_Forrester-Modernizing-Testing-to-Accelerate-Digital-Business...Tricentis-report_Forrester-Modernizing-Testing-to-Accelerate-Digital-Business...
Tricentis-report_Forrester-Modernizing-Testing-to-Accelerate-Digital-Business...
 

Security Predictions

  • 1. Metrics validation criteria: How do we know when a metric is worthwhile? Ben Smith Andy Meneely Laurie Williams
  • 2.
  • 3. Metric Uses Metrics Quality Assessment Process Certification Process Improvement Task Planning Research Prediction
  • 4. Motivation Software System Component m=.25 Component m=.95 Component m=. 05 Component m=.21 Component m=.15 Component m=.01 Prediction M < .2
  • 5.
  • 6.
  • 7.
  • 8.
  • 9. Foundation in the Literature
  • 10. Systematic literature review Phase Size of Source List Literature Index 2,228 Title 536 Cross-confirmed Title 156 Abstract 44 Full-text 17 Follow-up 20
  • 11.
  • 12.
  • 13.
  • 14.
  • 15.
  • 16. Metrics Validation Criteria A priori validity Actionability Appropriate Continuity Appropriate Granularity Association Attribute validity Causal model validity Causal relationship validity Content validity Construct validity Constructiveness Definition validity Discriminative power Dimensional consistency Economic productivity Empirical validity External validity Factor independence Improvement validity Instrument validity Increasing growth validity Interaction sensitivity Internal consistency Internal validity Monotonicity Metric Reliability Non-collinearity Non-exploitability Non-uniformity Notation validity Permutation validity Predictability Prediction system validity Process or Product Relevance Protocol validity Rank Consistency Renaming insensitivity Repeatability Representation condition Scale validity Stability Theoretical validity Trackability Transformation invariance Underlying theory validity Unit validity Usability
  • 18.
  • 19.
  • 20.
  • 21.

Editor's Notes

  1. You have the burden of proof. Not just that these metrics point to something, but that they are meaningful.
  2. A metric is a &amp;quot;quantitative scale and method that can be used to determine the value a feature takes for a specific software product”.
  3. Model for less than a specific value, let’s say .20.
  4. Concrete evidence of proposed metrics which emanates upward into increasingly abstracted analysis of the information we discovered These sections are actually from the journal paper we are submitting to EmSE. Backwards informative sometimes we would learn something later down the line that would help us go back to do an earlier process better.
  5. Google, CiteSeerX, IEEExplore, ACM Portal
  6. So which one of these does prediction fall into?
  7. Again, which one of these does prediction fall into?
  8. You have the burden of proof. Not just that these metrics point to something, but that they are meaningful.
  9. 47 Total – 21 Removed = 26 Remaining / 47 = 55%
  10. If we have to redefine a given metric, so it’s just a rephrasing of a well-known one, so that it can be applied to the project at hand, then that’s OK as long as the newly defined metric is predictive. This isn’t a property we necessarily want out of a metric, for example, code coverage shouldn’t increase when concatenating two components together, it should be the average of the two…
  11. Imagine a metric that is always predictive and with 100% accuracy will tell you which files are vulnerable in a system but which takes half a year to calculate and extract. Such a metric is not usable because by the time you obtain your much-needed values, the software system has changed—not to mention that you might have had a release or a complete architectural revamping. Alternatively imagine a metric which costs twice the budget of the entire project to collect—such a metric, no matter how accurate, is not worth collecting. The instrument can be a collection method or something a concrete as the tool used to measure some part of the metric. For example, imagine a test coverage utility that doesn’t accurately calculate branch coverage. This version of branch coverage is invalid, even if it’s predictive, because the method to increase the value of the metric is unclear. Testing more branches may decrease the value of the measurement, or increase the value too much.