SlideShare a Scribd company logo
1 of 19
Supplemental Data To Linked In
Post On 6/29/2015 On Capture-
Recapture For Software
Inspections With A Single Defect
James K. Orr
6/30/2015
Motivation For This Presentation
• I have been reviewing information that was still available from my time with
the Space Shuttle Primary Avionics Software (PASS) system.
• In August / September, 2011, I emailed a number of files to colleagues
continuing to work at NASA. This material was on PASS quality data and
discrepancy reports/analysis, with copies sent to my home email address.
• Recently, I was reviewing what information I still had access to, and moving
data from email attachments to more permanent backup.
• In doing this, I realized that I did not have any of the data on the study that I
did on using the Capture-Recapture concepts applied to inspections with a
single defect.
• I created the post on 6/29/2015 by dumping what I remembers in a single
composition activity.
• In hindsight, the absence on any data probably means that most readers will
miss key insights that I gained from the original data.
• This presentation is an attempt to quantify the concepts, although by
definition I am limited to artificial / simulated data only.
Warning On Potential For Misuse
• This presentation attempts to show significant
advantages in use of data as collected on the
Space Shuttle Primary Avionics Software (PASS)
project for quality and process management.
• However, this data must never be used in
personnel decisions such as appraisals.
• In the hands of software process professionals, it
can provide accurate assessment of the state of
the project and information for team training.
Reference, Background Information
• Reference:
– http://www.sei.cmu.edu/library/assets/capture-recapture1.pdf
– Getting More Out of Your Inspection Data: Using Capture-Recapture
Models for the Reinspection Decision; Julie Barnard, Khaled El Emam, and
Dave Zubrow; European SEPG Conference – Amsterdam - April 12, 2002
• In the sample of data that I was using, there were 861 inspections, of which
553 had no major errors. Of the 308 with major errors, only 89 had meet the
criteria for the above referenced study. My memory is that there were 71
inspections that had one and only one major error present.
Source Of Data
• I have spent about two hours recreating a simulation of software inspections
for the following narrow scope:
– Inspection randomly (equally probably) have 3 to 6 inspectors
– Inspection material contains one and only one major error
– Probably that a single inspector will detect the one major error crudely
model on my memory of Space Shuttle Primary Avionics Software (PASS)
project consistent with 2001 study in collaboration with Software
Engineering Institute
• Probability of a single inspector detecting on major error was
between 0 % and 40 % (randomly distributed).
Comment On Inspector Effectiveness
• For the entire set of 308 inspections, I analyzed each inspector for their individual
effectiveness in finding major Errors
– The data used in the SEI collaboration was provided to me with all indication of
inspector’s name converted to aliases. The maintainer of the inspection data could
relate an alias to an individual inspector. I never desired nor never knew the identity of
the individual inspectors; I only worked with the aliases.
– I identified data for all aliases (all inspectors) in the 308 inspections with major errors.
For each alias, I computed the present of errors found by the individual alias (individual
inspector) in all inspections by the total number of errors and later defects found in all
inspections that the individual participated. This represented the probability that this
individual would discover an error present in an individual inspection.
• Summary
– A relatively large percentage of inspectors had never participated in a single inspection
with a major error present (553 of 861 inspections had no major errors).
– Several inspectors had zero major error detected when in inspections with one or more
major errors.
– For those inspectors who had been in inspections with a large number of major errors,
most had detected less than 40 % of the major errors present.
Results
• Data is show for three separate simulations
• Data presented is
– Probability that single major error is found by one or more inspectors
– Probability that single major error is found by two or more inspectors
• Statistical simulation for each inspection (number of inspectors,
fixed probability of finding error for each inspector) was done 1000
times
• “Actual” results was simulated by randomly selecting the outcome
(results) on only one of the 1000 simulations
• This was done for 80 different inspections
• 80 inspections were grouped into 8 groups of 10 each where the
lowest group contained the 10 inspections with the lowest mean
probability of finding the error and the highest group contained the
10 inspection with the highest mean probability of finding the error.
Data For Simulation # 1
• Data will be shown for three simulations
• Inputs for all three are identical
• The only differences are the random statistical
differences (e.g., a different random number
selected to simulate “actual”, different
inspection team randomly assigned
characteristics)
Probability Of Major Error Found
Probability Of Error Found By Two Or
More Inspectors
Correlation Of Probability Of Two Inspectors
Finding Error To Inspection Effectiveness
Data For Simulation # 2
• The only differences are the random statistical
differences (e.g., a different random number
selected to simulate “actual”, different
inspection team randomly assigned
characteristics)
Probability Of Major Error Found
Probability Of Error Found By Two Or
More Inspectors
Correlation Of Probability Of Two Inspectors
Finding Error To Inspection Effectiveness
Data For Simulation # 3
• The only differences are the random statistical
differences (e.g., a different random number
selected to simulate “actual”, different
inspection team randomly assigned
characteristics)
Probability Of Major Error Found
Probability Of Error Found By Two Or
More Inspectors
Correlation Of Probability Of Two Inspectors
Finding Error To Inspection Effectiveness

More Related Content

What's hot

Monitoring "unknown unknowns" - Guy Fighel - DevOpsDays Tel Aviv 2017
Monitoring "unknown unknowns" - Guy Fighel - DevOpsDays Tel Aviv 2017Monitoring "unknown unknowns" - Guy Fighel - DevOpsDays Tel Aviv 2017
Monitoring "unknown unknowns" - Guy Fighel - DevOpsDays Tel Aviv 2017DevOpsDays Tel Aviv
 
Panda Provenance
Panda ProvenancePanda Provenance
Panda ProvenanceVlad Vega
 
Automating the process of continuously prioritising data, updating and deploy...
Automating the process of continuously prioritising data, updating and deploy...Automating the process of continuously prioritising data, updating and deploy...
Automating the process of continuously prioritising data, updating and deploy...Ola Spjuth
 
Practitioners’ Expectations on Automated Fault Localization
Practitioners’ Expectations on Automated Fault LocalizationPractitioners’ Expectations on Automated Fault Localization
Practitioners’ Expectations on Automated Fault LocalizationPavneet Singh Kochhar
 
Software Analytics: Towards Software Mining that Matters
Software Analytics: Towards Software Mining that MattersSoftware Analytics: Towards Software Mining that Matters
Software Analytics: Towards Software Mining that MattersTao Xie
 
Metric Abuse: Frequently Misused Metrics in Oracle
Metric Abuse: Frequently Misused Metrics in OracleMetric Abuse: Frequently Misused Metrics in Oracle
Metric Abuse: Frequently Misused Metrics in OracleSteve Karam
 
Analytics for large-scale time series and event data
Analytics for large-scale time series and event dataAnalytics for large-scale time series and event data
Analytics for large-scale time series and event dataAnodot
 
Poster Electronic vs handsearching of conference abstracts-18th Cochrane Coll...
Poster Electronic vs handsearching of conference abstracts-18th Cochrane Coll...Poster Electronic vs handsearching of conference abstracts-18th Cochrane Coll...
Poster Electronic vs handsearching of conference abstracts-18th Cochrane Coll...Edith Leclercq
 
DMAP: Data Aggregation and Presentation Framework
DMAP: Data Aggregation and Presentation FrameworkDMAP: Data Aggregation and Presentation Framework
DMAP: Data Aggregation and Presentation FrameworkParang Saraf
 
Knowledge and Data Engineering IEEE 2015 Projects
Knowledge and Data Engineering IEEE 2015 ProjectsKnowledge and Data Engineering IEEE 2015 Projects
Knowledge and Data Engineering IEEE 2015 ProjectsVijay Karan
 
Knowledge and Data Engineering IEEE 2015 Projects
Knowledge and Data Engineering IEEE 2015 ProjectsKnowledge and Data Engineering IEEE 2015 Projects
Knowledge and Data Engineering IEEE 2015 ProjectsVijay Karan
 
Foursquare data @ Strata NY 2011
Foursquare data @ Strata NY 2011Foursquare data @ Strata NY 2011
Foursquare data @ Strata NY 2011Justin Moore
 
Pick Any Three: Good, Fast, or Safe - Devops from Scratch
Pick Any Three: Good, Fast, or Safe - Devops from ScratchPick Any Three: Good, Fast, or Safe - Devops from Scratch
Pick Any Three: Good, Fast, or Safe - Devops from ScratchPete Cheslock
 
Cheminformatics Workflows Using Mobile Apps for Drug Discovery
Cheminformatics Workflows Using Mobile Apps for Drug DiscoveryCheminformatics Workflows Using Mobile Apps for Drug Discovery
Cheminformatics Workflows Using Mobile Apps for Drug DiscoverySean Ekins
 

What's hot (15)

Monitoring "unknown unknowns" - Guy Fighel - DevOpsDays Tel Aviv 2017
Monitoring "unknown unknowns" - Guy Fighel - DevOpsDays Tel Aviv 2017Monitoring "unknown unknowns" - Guy Fighel - DevOpsDays Tel Aviv 2017
Monitoring "unknown unknowns" - Guy Fighel - DevOpsDays Tel Aviv 2017
 
Panda Provenance
Panda ProvenancePanda Provenance
Panda Provenance
 
Automating the process of continuously prioritising data, updating and deploy...
Automating the process of continuously prioritising data, updating and deploy...Automating the process of continuously prioritising data, updating and deploy...
Automating the process of continuously prioritising data, updating and deploy...
 
Practitioners’ Expectations on Automated Fault Localization
Practitioners’ Expectations on Automated Fault LocalizationPractitioners’ Expectations on Automated Fault Localization
Practitioners’ Expectations on Automated Fault Localization
 
Software Analytics: Towards Software Mining that Matters
Software Analytics: Towards Software Mining that MattersSoftware Analytics: Towards Software Mining that Matters
Software Analytics: Towards Software Mining that Matters
 
Metric Abuse: Frequently Misused Metrics in Oracle
Metric Abuse: Frequently Misused Metrics in OracleMetric Abuse: Frequently Misused Metrics in Oracle
Metric Abuse: Frequently Misused Metrics in Oracle
 
Analytics for large-scale time series and event data
Analytics for large-scale time series and event dataAnalytics for large-scale time series and event data
Analytics for large-scale time series and event data
 
Poster Electronic vs handsearching of conference abstracts-18th Cochrane Coll...
Poster Electronic vs handsearching of conference abstracts-18th Cochrane Coll...Poster Electronic vs handsearching of conference abstracts-18th Cochrane Coll...
Poster Electronic vs handsearching of conference abstracts-18th Cochrane Coll...
 
DMAP: Data Aggregation and Presentation Framework
DMAP: Data Aggregation and Presentation FrameworkDMAP: Data Aggregation and Presentation Framework
DMAP: Data Aggregation and Presentation Framework
 
Knowledge and Data Engineering IEEE 2015 Projects
Knowledge and Data Engineering IEEE 2015 ProjectsKnowledge and Data Engineering IEEE 2015 Projects
Knowledge and Data Engineering IEEE 2015 Projects
 
Knowledge and Data Engineering IEEE 2015 Projects
Knowledge and Data Engineering IEEE 2015 ProjectsKnowledge and Data Engineering IEEE 2015 Projects
Knowledge and Data Engineering IEEE 2015 Projects
 
Foursquare data @ Strata NY 2011
Foursquare data @ Strata NY 2011Foursquare data @ Strata NY 2011
Foursquare data @ Strata NY 2011
 
Pick Any Three: Good, Fast, or Safe - Devops from Scratch
Pick Any Three: Good, Fast, or Safe - Devops from ScratchPick Any Three: Good, Fast, or Safe - Devops from Scratch
Pick Any Three: Good, Fast, or Safe - Devops from Scratch
 
Cheminformatics Workflows Using Mobile Apps for Drug Discovery
Cheminformatics Workflows Using Mobile Apps for Drug DiscoveryCheminformatics Workflows Using Mobile Apps for Drug Discovery
Cheminformatics Workflows Using Mobile Apps for Drug Discovery
 
Expert system
Expert systemExpert system
Expert system
 

Similar to capture-recapture Single Defect

How to Actually DO High-volume Automated Testing
How to Actually DO High-volume Automated TestingHow to Actually DO High-volume Automated Testing
How to Actually DO High-volume Automated TestingTechWell
 
sience 2.0 : an illustration of good research practices in a real study
sience 2.0 : an illustration of good research practices in a real studysience 2.0 : an illustration of good research practices in a real study
sience 2.0 : an illustration of good research practices in a real studywolf vanpaemel
 
Empirical research methods for software engineering
Empirical research methods for software engineeringEmpirical research methods for software engineering
Empirical research methods for software engineeringsarfraznawaz
 
Effective Test Cases & Introduction to Hexawise
Effective Test Cases & Introduction to HexawiseEffective Test Cases & Introduction to Hexawise
Effective Test Cases & Introduction to HexawiseNilenth Selvaraja
 
DoWhy Python library for causal inference: An End-to-End tool
DoWhy Python library for causal inference: An End-to-End toolDoWhy Python library for causal inference: An End-to-End tool
DoWhy Python library for causal inference: An End-to-End toolAmit Sharma
 
Test design techniques
Test design techniquesTest design techniques
Test design techniquesOksana
 
Scientific Revenue USF 2016 talk
Scientific Revenue USF 2016 talkScientific Revenue USF 2016 talk
Scientific Revenue USF 2016 talkScientificRevenue
 
Auditing sampling presentation
Auditing sampling  presentationAuditing sampling  presentation
Auditing sampling presentationDominic Korkoryi
 
Did Something Change? - Using Statistical Techniques to Interpret Service and...
Did Something Change? - Using Statistical Techniques to Interpret Service and...Did Something Change? - Using Statistical Techniques to Interpret Service and...
Did Something Change? - Using Statistical Techniques to Interpret Service and...Joao Galdino Mello de Souza
 
Combinatorial testing ppt
Combinatorial testing pptCombinatorial testing ppt
Combinatorial testing pptKedar Kumar
 
Analyst’s Nightmare or Laundering Massive Spreadsheets
Analyst’s Nightmare or Laundering Massive SpreadsheetsAnalyst’s Nightmare or Laundering Massive Spreadsheets
Analyst’s Nightmare or Laundering Massive SpreadsheetsPyData
 
Software testing part
Software testing partSoftware testing part
Software testing partPreeti Mishra
 
Finger pointing
Finger pointingFinger pointing
Finger pointingBoundary
 
A beginners guide to testing
A beginners guide to testingA beginners guide to testing
A beginners guide to testingPhilip Johnson
 
How Data Collection Shapes MI Performance
How Data Collection Shapes MI PerformanceHow Data Collection Shapes MI Performance
How Data Collection Shapes MI PerformanceNorthwest Analytics
 
Random-Sampling-w-RAT-Stats.pptx
Random-Sampling-w-RAT-Stats.pptxRandom-Sampling-w-RAT-Stats.pptx
Random-Sampling-w-RAT-Stats.pptxAkhtaruzzamanlimon1
 
Audit Sampling. Murodullo Turdiyev.pptx
Audit Sampling. Murodullo Turdiyev.pptxAudit Sampling. Murodullo Turdiyev.pptx
Audit Sampling. Murodullo Turdiyev.pptxJanobHechkim1
 
Data Data Everywhere: Not An Insight to Take Action Upon
Data Data Everywhere: Not An Insight to Take Action UponData Data Everywhere: Not An Insight to Take Action Upon
Data Data Everywhere: Not An Insight to Take Action UponArun Kejariwal
 

Similar to capture-recapture Single Defect (20)

How to Actually DO High-volume Automated Testing
How to Actually DO High-volume Automated TestingHow to Actually DO High-volume Automated Testing
How to Actually DO High-volume Automated Testing
 
Software testing overview subbu
Software testing overview subbuSoftware testing overview subbu
Software testing overview subbu
 
sience 2.0 : an illustration of good research practices in a real study
sience 2.0 : an illustration of good research practices in a real studysience 2.0 : an illustration of good research practices in a real study
sience 2.0 : an illustration of good research practices in a real study
 
Empirical research methods for software engineering
Empirical research methods for software engineeringEmpirical research methods for software engineering
Empirical research methods for software engineering
 
Effective Test Cases & Introduction to Hexawise
Effective Test Cases & Introduction to HexawiseEffective Test Cases & Introduction to Hexawise
Effective Test Cases & Introduction to Hexawise
 
Software testing
Software testingSoftware testing
Software testing
 
DoWhy Python library for causal inference: An End-to-End tool
DoWhy Python library for causal inference: An End-to-End toolDoWhy Python library for causal inference: An End-to-End tool
DoWhy Python library for causal inference: An End-to-End tool
 
Test design techniques
Test design techniquesTest design techniques
Test design techniques
 
Scientific Revenue USF 2016 talk
Scientific Revenue USF 2016 talkScientific Revenue USF 2016 talk
Scientific Revenue USF 2016 talk
 
Auditing sampling presentation
Auditing sampling  presentationAuditing sampling  presentation
Auditing sampling presentation
 
Did Something Change? - Using Statistical Techniques to Interpret Service and...
Did Something Change? - Using Statistical Techniques to Interpret Service and...Did Something Change? - Using Statistical Techniques to Interpret Service and...
Did Something Change? - Using Statistical Techniques to Interpret Service and...
 
Combinatorial testing ppt
Combinatorial testing pptCombinatorial testing ppt
Combinatorial testing ppt
 
Analyst’s Nightmare or Laundering Massive Spreadsheets
Analyst’s Nightmare or Laundering Massive SpreadsheetsAnalyst’s Nightmare or Laundering Massive Spreadsheets
Analyst’s Nightmare or Laundering Massive Spreadsheets
 
Software testing part
Software testing partSoftware testing part
Software testing part
 
Finger pointing
Finger pointingFinger pointing
Finger pointing
 
A beginners guide to testing
A beginners guide to testingA beginners guide to testing
A beginners guide to testing
 
How Data Collection Shapes MI Performance
How Data Collection Shapes MI PerformanceHow Data Collection Shapes MI Performance
How Data Collection Shapes MI Performance
 
Random-Sampling-w-RAT-Stats.pptx
Random-Sampling-w-RAT-Stats.pptxRandom-Sampling-w-RAT-Stats.pptx
Random-Sampling-w-RAT-Stats.pptx
 
Audit Sampling. Murodullo Turdiyev.pptx
Audit Sampling. Murodullo Turdiyev.pptxAudit Sampling. Murodullo Turdiyev.pptx
Audit Sampling. Murodullo Turdiyev.pptx
 
Data Data Everywhere: Not An Insight to Take Action Upon
Data Data Everywhere: Not An Insight to Take Action UponData Data Everywhere: Not An Insight to Take Action Upon
Data Data Everywhere: Not An Insight to Take Action Upon
 

More from James Orr

Truth how texas and houston compare to new york
Truth how texas and houston compare to new yorkTruth how texas and houston compare to new york
Truth how texas and houston compare to new yorkJames Orr
 
Covid 19 cases per day per 100,000 popylation 5-15-2020
Covid 19 cases per day per 100,000 popylation 5-15-2020Covid 19 cases per day per 100,000 popylation 5-15-2020
Covid 19 cases per day per 100,000 popylation 5-15-2020James Orr
 
Coronavirus data focused on new york comparison and data per population
Coronavirus data focused on new york comparison and data per populationCoronavirus data focused on new york comparison and data per population
Coronavirus data focused on new york comparison and data per populationJames Orr
 
Us coronavirus data as of may 1, 2020 just data
Us coronavirus data as of may 1, 2020  just dataUs coronavirus data as of may 1, 2020  just data
Us coronavirus data as of may 1, 2020 just dataJames Orr
 
Predicting coronavirus cases and questions need answering
Predicting coronavirus cases and questions need answeringPredicting coronavirus cases and questions need answering
Predicting coronavirus cases and questions need answeringJames Orr
 
Criteria to begin relaxing social distancing revision A
Criteria to begin relaxing social distancing revision ACriteria to begin relaxing social distancing revision A
Criteria to begin relaxing social distancing revision AJames Orr
 
Orr's model ix three Week forward coronavirus case predictions 04 12-2020
Orr's model ix three Week forward coronavirus case predictions 04 12-2020Orr's model ix three Week forward coronavirus case predictions 04 12-2020
Orr's model ix three Week forward coronavirus case predictions 04 12-2020James Orr
 
Measuring the effect of social distancing On Coronavirus
Measuring the effect of social distancing On CoronavirusMeasuring the effect of social distancing On Coronavirus
Measuring the effect of social distancing On CoronavirusJames Orr
 
Look at mathematics of flatten the curve
Look at mathematics of flatten the curveLook at mathematics of flatten the curve
Look at mathematics of flatten the curveJames Orr
 
Orr's model vii projection of cases to 4 16-2020 created 04-03-2020
Orr's model vii projection of cases to 4 16-2020 created 04-03-2020Orr's model vii projection of cases to 4 16-2020 created 04-03-2020
Orr's model vii projection of cases to 4 16-2020 created 04-03-2020James Orr
 
Orr's model vi projection of cases to 4 16-2020 created 04-03-2020
Orr's model vi projection of cases to 4 16-2020 created 04-03-2020Orr's model vi projection of cases to 4 16-2020 created 04-03-2020
Orr's model vi projection of cases to 4 16-2020 created 04-03-2020James Orr
 
Orr' projection model v 2020 03-29
Orr' projection model v 2020 03-29Orr' projection model v 2020 03-29
Orr' projection model v 2020 03-29James Orr
 
Coronavirus cases by state - projections for growther from 03-21-2020 to 04-...
Coronavirus cases by state  - projections for growther from 03-21-2020 to 04-...Coronavirus cases by state  - projections for growther from 03-21-2020 to 04-...
Coronavirus cases by state - projections for growther from 03-21-2020 to 04-...James Orr
 
Coronavirus pandemic public health - lessons in mathematics
Coronavirus pandemic public health  - lessons in mathematicsCoronavirus pandemic public health  - lessons in mathematics
Coronavirus pandemic public health - lessons in mathematicsJames Orr
 
Coronavirus case growth by country j.k.orr 2020 03-07
Coronavirus case growth by country  j.k.orr 2020 03-07Coronavirus case growth by country  j.k.orr 2020 03-07
Coronavirus case growth by country j.k.orr 2020 03-07James Orr
 
2018 is year to consider conversions of tax deferred funds to roth
2018 is year to consider conversions of tax deferred funds to roth2018 is year to consider conversions of tax deferred funds to roth
2018 is year to consider conversions of tax deferred funds to rothJames Orr
 
Effect of tax cut and job act for couples over 65
Effect of tax cut and job act for couples over 65Effect of tax cut and job act for couples over 65
Effect of tax cut and job act for couples over 65James Orr
 
The mythical 100 flood plain Houston Texas 2017
The mythical 100 flood plain Houston Texas 2017The mythical 100 flood plain Houston Texas 2017
The mythical 100 flood plain Houston Texas 2017James Orr
 
Hurricane harvey Impact On Houston Rainfall and Water Depth
Hurricane harvey Impact On Houston Rainfall and Water DepthHurricane harvey Impact On Houston Rainfall and Water Depth
Hurricane harvey Impact On Houston Rainfall and Water DepthJames Orr
 
“Lessons learned” not being learned
“Lessons learned” not being learned“Lessons learned” not being learned
“Lessons learned” not being learnedJames Orr
 

More from James Orr (20)

Truth how texas and houston compare to new york
Truth how texas and houston compare to new yorkTruth how texas and houston compare to new york
Truth how texas and houston compare to new york
 
Covid 19 cases per day per 100,000 popylation 5-15-2020
Covid 19 cases per day per 100,000 popylation 5-15-2020Covid 19 cases per day per 100,000 popylation 5-15-2020
Covid 19 cases per day per 100,000 popylation 5-15-2020
 
Coronavirus data focused on new york comparison and data per population
Coronavirus data focused on new york comparison and data per populationCoronavirus data focused on new york comparison and data per population
Coronavirus data focused on new york comparison and data per population
 
Us coronavirus data as of may 1, 2020 just data
Us coronavirus data as of may 1, 2020  just dataUs coronavirus data as of may 1, 2020  just data
Us coronavirus data as of may 1, 2020 just data
 
Predicting coronavirus cases and questions need answering
Predicting coronavirus cases and questions need answeringPredicting coronavirus cases and questions need answering
Predicting coronavirus cases and questions need answering
 
Criteria to begin relaxing social distancing revision A
Criteria to begin relaxing social distancing revision ACriteria to begin relaxing social distancing revision A
Criteria to begin relaxing social distancing revision A
 
Orr's model ix three Week forward coronavirus case predictions 04 12-2020
Orr's model ix three Week forward coronavirus case predictions 04 12-2020Orr's model ix three Week forward coronavirus case predictions 04 12-2020
Orr's model ix three Week forward coronavirus case predictions 04 12-2020
 
Measuring the effect of social distancing On Coronavirus
Measuring the effect of social distancing On CoronavirusMeasuring the effect of social distancing On Coronavirus
Measuring the effect of social distancing On Coronavirus
 
Look at mathematics of flatten the curve
Look at mathematics of flatten the curveLook at mathematics of flatten the curve
Look at mathematics of flatten the curve
 
Orr's model vii projection of cases to 4 16-2020 created 04-03-2020
Orr's model vii projection of cases to 4 16-2020 created 04-03-2020Orr's model vii projection of cases to 4 16-2020 created 04-03-2020
Orr's model vii projection of cases to 4 16-2020 created 04-03-2020
 
Orr's model vi projection of cases to 4 16-2020 created 04-03-2020
Orr's model vi projection of cases to 4 16-2020 created 04-03-2020Orr's model vi projection of cases to 4 16-2020 created 04-03-2020
Orr's model vi projection of cases to 4 16-2020 created 04-03-2020
 
Orr' projection model v 2020 03-29
Orr' projection model v 2020 03-29Orr' projection model v 2020 03-29
Orr' projection model v 2020 03-29
 
Coronavirus cases by state - projections for growther from 03-21-2020 to 04-...
Coronavirus cases by state  - projections for growther from 03-21-2020 to 04-...Coronavirus cases by state  - projections for growther from 03-21-2020 to 04-...
Coronavirus cases by state - projections for growther from 03-21-2020 to 04-...
 
Coronavirus pandemic public health - lessons in mathematics
Coronavirus pandemic public health  - lessons in mathematicsCoronavirus pandemic public health  - lessons in mathematics
Coronavirus pandemic public health - lessons in mathematics
 
Coronavirus case growth by country j.k.orr 2020 03-07
Coronavirus case growth by country  j.k.orr 2020 03-07Coronavirus case growth by country  j.k.orr 2020 03-07
Coronavirus case growth by country j.k.orr 2020 03-07
 
2018 is year to consider conversions of tax deferred funds to roth
2018 is year to consider conversions of tax deferred funds to roth2018 is year to consider conversions of tax deferred funds to roth
2018 is year to consider conversions of tax deferred funds to roth
 
Effect of tax cut and job act for couples over 65
Effect of tax cut and job act for couples over 65Effect of tax cut and job act for couples over 65
Effect of tax cut and job act for couples over 65
 
The mythical 100 flood plain Houston Texas 2017
The mythical 100 flood plain Houston Texas 2017The mythical 100 flood plain Houston Texas 2017
The mythical 100 flood plain Houston Texas 2017
 
Hurricane harvey Impact On Houston Rainfall and Water Depth
Hurricane harvey Impact On Houston Rainfall and Water DepthHurricane harvey Impact On Houston Rainfall and Water Depth
Hurricane harvey Impact On Houston Rainfall and Water Depth
 
“Lessons learned” not being learned
“Lessons learned” not being learned“Lessons learned” not being learned
“Lessons learned” not being learned
 

capture-recapture Single Defect

  • 1. Supplemental Data To Linked In Post On 6/29/2015 On Capture- Recapture For Software Inspections With A Single Defect James K. Orr 6/30/2015
  • 2. Motivation For This Presentation • I have been reviewing information that was still available from my time with the Space Shuttle Primary Avionics Software (PASS) system. • In August / September, 2011, I emailed a number of files to colleagues continuing to work at NASA. This material was on PASS quality data and discrepancy reports/analysis, with copies sent to my home email address. • Recently, I was reviewing what information I still had access to, and moving data from email attachments to more permanent backup. • In doing this, I realized that I did not have any of the data on the study that I did on using the Capture-Recapture concepts applied to inspections with a single defect. • I created the post on 6/29/2015 by dumping what I remembers in a single composition activity. • In hindsight, the absence on any data probably means that most readers will miss key insights that I gained from the original data. • This presentation is an attempt to quantify the concepts, although by definition I am limited to artificial / simulated data only.
  • 3. Warning On Potential For Misuse • This presentation attempts to show significant advantages in use of data as collected on the Space Shuttle Primary Avionics Software (PASS) project for quality and process management. • However, this data must never be used in personnel decisions such as appraisals. • In the hands of software process professionals, it can provide accurate assessment of the state of the project and information for team training.
  • 4. Reference, Background Information • Reference: – http://www.sei.cmu.edu/library/assets/capture-recapture1.pdf – Getting More Out of Your Inspection Data: Using Capture-Recapture Models for the Reinspection Decision; Julie Barnard, Khaled El Emam, and Dave Zubrow; European SEPG Conference – Amsterdam - April 12, 2002 • In the sample of data that I was using, there were 861 inspections, of which 553 had no major errors. Of the 308 with major errors, only 89 had meet the criteria for the above referenced study. My memory is that there were 71 inspections that had one and only one major error present.
  • 5. Source Of Data • I have spent about two hours recreating a simulation of software inspections for the following narrow scope: – Inspection randomly (equally probably) have 3 to 6 inspectors – Inspection material contains one and only one major error – Probably that a single inspector will detect the one major error crudely model on my memory of Space Shuttle Primary Avionics Software (PASS) project consistent with 2001 study in collaboration with Software Engineering Institute • Probability of a single inspector detecting on major error was between 0 % and 40 % (randomly distributed).
  • 6. Comment On Inspector Effectiveness • For the entire set of 308 inspections, I analyzed each inspector for their individual effectiveness in finding major Errors – The data used in the SEI collaboration was provided to me with all indication of inspector’s name converted to aliases. The maintainer of the inspection data could relate an alias to an individual inspector. I never desired nor never knew the identity of the individual inspectors; I only worked with the aliases. – I identified data for all aliases (all inspectors) in the 308 inspections with major errors. For each alias, I computed the present of errors found by the individual alias (individual inspector) in all inspections by the total number of errors and later defects found in all inspections that the individual participated. This represented the probability that this individual would discover an error present in an individual inspection. • Summary – A relatively large percentage of inspectors had never participated in a single inspection with a major error present (553 of 861 inspections had no major errors). – Several inspectors had zero major error detected when in inspections with one or more major errors. – For those inspectors who had been in inspections with a large number of major errors, most had detected less than 40 % of the major errors present.
  • 7. Results • Data is show for three separate simulations • Data presented is – Probability that single major error is found by one or more inspectors – Probability that single major error is found by two or more inspectors • Statistical simulation for each inspection (number of inspectors, fixed probability of finding error for each inspector) was done 1000 times • “Actual” results was simulated by randomly selecting the outcome (results) on only one of the 1000 simulations • This was done for 80 different inspections • 80 inspections were grouped into 8 groups of 10 each where the lowest group contained the 10 inspections with the lowest mean probability of finding the error and the highest group contained the 10 inspection with the highest mean probability of finding the error.
  • 8. Data For Simulation # 1 • Data will be shown for three simulations • Inputs for all three are identical • The only differences are the random statistical differences (e.g., a different random number selected to simulate “actual”, different inspection team randomly assigned characteristics)
  • 9. Probability Of Major Error Found
  • 10. Probability Of Error Found By Two Or More Inspectors
  • 11. Correlation Of Probability Of Two Inspectors Finding Error To Inspection Effectiveness
  • 12. Data For Simulation # 2 • The only differences are the random statistical differences (e.g., a different random number selected to simulate “actual”, different inspection team randomly assigned characteristics)
  • 13. Probability Of Major Error Found
  • 14. Probability Of Error Found By Two Or More Inspectors
  • 15. Correlation Of Probability Of Two Inspectors Finding Error To Inspection Effectiveness
  • 16. Data For Simulation # 3 • The only differences are the random statistical differences (e.g., a different random number selected to simulate “actual”, different inspection team randomly assigned characteristics)
  • 17. Probability Of Major Error Found
  • 18. Probability Of Error Found By Two Or More Inspectors
  • 19. Correlation Of Probability Of Two Inspectors Finding Error To Inspection Effectiveness