SlideShare a Scribd company logo
1 of 41
Download to read offline
NOTICE: Proprietary and Confidential
This material is proprietary to Centric Consulting, LLC. It contains trade secrets and informationwhich is solely the property of Centric Consulting,LLC. This material is
solely for the Client’sinternaluse. This material shall not be used, reproduced, copied, disclosed, transmitted, in whole or in part, without the express consent of Centric
Consulting,LLC.
© 2013 Centric Consulting,LLC. All rights reserved
Bad Metric. Bad!
Teaching an old dog, nothing new
What are some typical metrics that you measure?
Other Examples of Software Testing Metrics
• Test Case Counts by Execution Status
• Test Case Percentages by Execution Status
• Test Case Execution Status Trend
• Test Case Status Planned vs Executed
• Test Case Coverage
• Test Case Status vs Coverage
• Test Case First Run Failure Counts
• Test Case Re– Run Counts
Test Cases
• Automation Index (Percent Automatable)
• Automation Progress
• Automation Test Coverage
Automation extras
More Examples of Software Testing Metrics
• Defect Counts by Status
• Defect Counts by Priority
• Defect Status Trend
• Defect Density
• Defect Remove Efficiency
• Defect Leakage
• Average Defect Response Time
Defects
• Requirements Volatility Index
• Testing Process Efficiency
Other
Common Themes
Counts
Metric (Counts/Counts)
Trends
Other Examples of Software Testing Metrics
• Test Case Counts by Execution Status – Count
• Test Case Percentages by Execution Status – Count
• Test Case Execution Status Trend – Trend
• Test Case Executed vs Planned – Metric and Trend
• Test Case Coverage – Metric
• Test Case Status vs Coverage – Metric
• Test Case First Run Failure Counts – Count
• Test Case Re– Run Counts – Count
Test Cases
• Automation Index (Percent Automatable) – Metric
• Automation Progress – Count
• Automation Test Coverage – Metric
Automation extras
More Examples of Software Testing Metrics
• Defect Counts by Status – Count
• Defect Counts by Priority – Count
• Defect Status Trend – Trend
• Defect Density – Metric
• Defect Remove Efficiency – Metric
• Defect Leakage – Metric
• Average Defect Response Time – Trend
Defects
• Requirements Volatility Index – Metric
• Testing Process Efficiency – Metric
Other
The Problem We Typically Face?
They Fail to Communicate
• Present data instead of information
• Offer no interpretation, allow user to draw own conclusion
They Are Often Inaccurate
• The act of measuring lacks of consistency
• The measures themselves have inherent variability
• No one reports margin of errors
They Do Not Measure a Control
• Can’t make decision based on number
• The measurement isn’t a lever to introduce change
They Are Not Tied to Organizational Objectives
• No threshold set for desired goal
• No action or consequence if not achieved
Counting
Counting
Exercise #1
1. Need 3 volunteers
2. Assume 1 scoop equals 1 days worth of testing effort
3. Hershey Kisses and Tootsie Rolls are tests, Starbursts are
bugs
4. Take a scoop
5. How many tests did you execute?
6. Based on how many tests you ran, how many more scoops
do you need to execute the rest (there are 180 total)?
Exercise #1 Questions
• Was the same scoop used? Were the results the
same?
• Was there variability in the number of tests run in
each scoop.
• Is that typical in testing?
• Was there variability in the estimate of the number
of tests left?
• Is this similar to guessing how much time is effort is left in
a test cycle?
• Are these numbers reliable?
• Are they repeatable?
Exercise #2
1. Need 3 volunteers
2. Assume 1 scoop equals 1 days worth of testing effort
3. Hershey Kisses and Tootsie Rolls are tests, Starbursts are bugs (Red are
severe)
4. Take a scoop
5. How many tests did you execute?
6. How many defects did you find?
7. Based on how many tests you ran, how many more scoops do you need
to execute the rest?
8. Based on how much effort you put in, how many more scoops do you
need to find the rest of the defects?
Exercise #2 Questions
• Was the same scoop used? Were the results the same?
• With an estimate of the number of tests remaining, is it reasonable to
estimate the number of defects will be found?
• Do people ask you to guess this type of information?
• If you know how many tests (Starbursts) are left and how many man-
hours you will use (scoop size), can you estimate how many scoops are
needed to execute all tests (find all Starbursts)?
• Is it accurate? Is it close enough?
• Are these numbers reliable?
• Are they repeatable?
• Does encountering defects (M&M’s) reveal anything about the overall
quality (how many M&M’s exist, or what it’ll take to find them)?
Challenges with Counting
Label does not equal content
Inherent variability
Not evenly spaced
Lacks reference for context
Lack of consistency
Metrics (Measure over Measure)
Sampling
Target Population
Matched Samples
Independent Samples
Random Sampling
Simple Random Sampling
Stratified Sampling
Cluster Sampling
Quota Sampling
Spatial Sampling
Sampling Variability
Standard Error
Bias
Precision
For each population there are many possible samples. A sample statistic
gives information about a corresponding population parameter
Sampling in Testing
Does testing use sampling?
Consider in most corporate environments:
• We never test the entire application
• It is not realistically possible to find
every defect
• So, does testing use sampling?
Ponder this as we discuss the next section…
Is Testing a Methodical Defect Searching
Activity?
Sampling
Remember, We can’t test everything – not enough time/people/budget
So, which sample approach better approximates an actual measure (e.g.
dots per sq. inch?)
5.25 dots/sq. in. 6.5 dots/sq. in.
Ponder this as we discuss the next section…
Is Testing a Methodical Defect Searching
Activity?
Sampling
Which sample approach better approximates an actual measure (e.g. dots
per sq. inch?)
• What is more accurate, random or methodical searching?
5.25 dots/sq. in. 6.5 dots/sq. in.
4.95 dots/sq. in. 6.3 dots/sq. in.
There are actually 6.6
dots/sq. in.
Exercise #3
Exercise #3
1. Need 3 volunteers
2. Assume 1 scoop equals 1 days worth of testing effort
3. Hershey Kisses and Tootsie Rolls are tests, Starbursts are bugs (Red are
severe)
4. Each volunteer grab 1 scoop of candy
5. How many (total) tests did you execute?
6. How many (total) defects did you find?
7. Log results
8. Repeat 2 more times
Exercise #3 Questions
• Does this graph represent anything useful?
• Does a trend line help or mean anything?
• Is it possible or reasonable to estimate the # of
defects you’ll see based on the number of
tests, from even 9 samples?
• Compare scoop 1 to scoop 9 – does any scoop
seem to be a reasonable estimate?
Challenges with Metrics (Measure over Measure)
Implied Derivations and Forecasting
Counts over Counts
Denominator Rules
Implies Velocity
Measure over Measure
Trends
Trend
Trend is a change in a measure (or metric) over time interval.
Has three components
Direction/Movement Speed/Size Cause (Implied)
Exercise #4
1. Need 3 volunteers
2. Assume 1 scoop equals 1 days worth of testing effort
3. Hershey Kisses and Tootsie Rolls are tests, Starbursts are bugs (Red are
severe)
4. Each volunteer grab 1 scoop of candy
5. How many of EACH type of tests did you execute?
6. How many of EACH type of defect did you find?
7. Log results
8. Repeat 2 more times
Exercise #4 Questions
• Does the graph line represent any information of value?
• Is there assurance (control) that simply taking a scoop (e.g.
executing tests in a given day) will result in defects being
found?
• Is the shape of the defect cumulative line representative of
anything?
• If we only look at scoops 1-3 or 7-9, does it tell us anything or
mislead us?
• What if we took 2 scoops per day (added a tester – but still
counted as 1 day), would that affect anything how things
look?
• Is M&M’s per scoop or M&M’s per skittles/starbursts mean
anything?
Challenges with Trends
Affected by challenges of counting
Affected by challenges of metrics
Time Based Series
Intervals and Activity Pause
Purpose of Metrics
Measure of
Performance
Conformance to
Best Practice
Deviation from Goal
Issues affecting purpose
Misaligned with strategy
Using metrics as outputs only
Too many metrics
Ease of measure does not equal importance
Lack of context
Limited dimensions
Lack behavioral aspects
Changing the World
How to Leverage Metrics
Explicitly link metrics to goals
Use trends over absolute numbers
Use shorter tracking periods
Change metrics when they stop
driving change
Account for error and confidence
Q&A
Joseph Ours
Email:
Joseph.ours@centricconsulting.com
Company Website:
https://centricconsulting.com/technol
ogy-solutions/software-quality-
assurance-and-testing/
Twitter:
@justjoehere
LinkedIN:
www.linkedin.com/josephours
Personal Blog:
http://josephours.blogspot.com

More Related Content

What's hot

Things Could Get Worse: Ideas About Regression Testing
Things Could Get Worse: Ideas About Regression TestingThings Could Get Worse: Ideas About Regression Testing
Things Could Get Worse: Ideas About Regression TestingTechWell
 
Will Robots Replace Testers?
Will Robots Replace Testers?Will Robots Replace Testers?
Will Robots Replace Testers?TEST Huddle
 
Ajay Balamnrugadas - Weekend Testing, Skilled Software Testing Unleashed - Eu...
Ajay Balamnrugadas - Weekend Testing, Skilled Software Testing Unleashed - Eu...Ajay Balamnrugadas - Weekend Testing, Skilled Software Testing Unleashed - Eu...
Ajay Balamnrugadas - Weekend Testing, Skilled Software Testing Unleashed - Eu...TEST Huddle
 
Julian Harty - Alternatives To Testing - EuroSTAR 2010
Julian Harty - Alternatives To Testing - EuroSTAR 2010Julian Harty - Alternatives To Testing - EuroSTAR 2010
Julian Harty - Alternatives To Testing - EuroSTAR 2010TEST Huddle
 
Testing for everyone agile yorkshire
Testing for everyone agile yorkshireTesting for everyone agile yorkshire
Testing for everyone agile yorkshireAdy Stokes
 
Rapid Software Testing: Reporting
Rapid Software Testing: ReportingRapid Software Testing: Reporting
Rapid Software Testing: ReportingTechWell
 
Root Cause Analysis | 5 whys | Tools of accident investigation I Gaurav Singh...
Root Cause Analysis | 5 whys | Tools of accident investigation I Gaurav Singh...Root Cause Analysis | 5 whys | Tools of accident investigation I Gaurav Singh...
Root Cause Analysis | 5 whys | Tools of accident investigation I Gaurav Singh...Gaurav Singh Rajput
 
TDD Refresh - Austin RB - 2013-07-01 - public
TDD Refresh - Austin RB - 2013-07-01 - publicTDD Refresh - Austin RB - 2013-07-01 - public
TDD Refresh - Austin RB - 2013-07-01 - publicsbellware
 
Mixed Effects Models - Effect Size
Mixed Effects Models - Effect SizeMixed Effects Models - Effect Size
Mixed Effects Models - Effect SizeScott Fraundorf
 
Shrini Kulkarni - Software Metrics - So Simple, Yet So Dangerous
Shrini Kulkarni -  Software Metrics - So Simple, Yet So Dangerous Shrini Kulkarni -  Software Metrics - So Simple, Yet So Dangerous
Shrini Kulkarni - Software Metrics - So Simple, Yet So Dangerous TEST Huddle
 
Defining Test Competence
Defining Test CompetenceDefining Test Competence
Defining Test CompetenceJohan Hoberg
 
Challenging Your Project’s Testing Mindsets - Joe DeMeyer
Challenging Your Project’s Testing Mindsets - Joe DeMeyerChallenging Your Project’s Testing Mindsets - Joe DeMeyer
Challenging Your Project’s Testing Mindsets - Joe DeMeyerQA or the Highway
 
Ishikawa's Fish Bone
Ishikawa's Fish BoneIshikawa's Fish Bone
Ishikawa's Fish BonePrithvi Ghag
 
Alessandro Magnani, Data Scientist, @WalmartLabs at MLconf SF - 11/13/15
Alessandro Magnani, Data Scientist, @WalmartLabs at MLconf SF - 11/13/15Alessandro Magnani, Data Scientist, @WalmartLabs at MLconf SF - 11/13/15
Alessandro Magnani, Data Scientist, @WalmartLabs at MLconf SF - 11/13/15MLconf
 

What's hot (20)

Things Could Get Worse: Ideas About Regression Testing
Things Could Get Worse: Ideas About Regression TestingThings Could Get Worse: Ideas About Regression Testing
Things Could Get Worse: Ideas About Regression Testing
 
Will Robots Replace Testers?
Will Robots Replace Testers?Will Robots Replace Testers?
Will Robots Replace Testers?
 
Ajay Balamnrugadas - Weekend Testing, Skilled Software Testing Unleashed - Eu...
Ajay Balamnrugadas - Weekend Testing, Skilled Software Testing Unleashed - Eu...Ajay Balamnrugadas - Weekend Testing, Skilled Software Testing Unleashed - Eu...
Ajay Balamnrugadas - Weekend Testing, Skilled Software Testing Unleashed - Eu...
 
Julian Harty - Alternatives To Testing - EuroSTAR 2010
Julian Harty - Alternatives To Testing - EuroSTAR 2010Julian Harty - Alternatives To Testing - EuroSTAR 2010
Julian Harty - Alternatives To Testing - EuroSTAR 2010
 
Testing for everyone agile yorkshire
Testing for everyone agile yorkshireTesting for everyone agile yorkshire
Testing for everyone agile yorkshire
 
Rapid Software Testing: Reporting
Rapid Software Testing: ReportingRapid Software Testing: Reporting
Rapid Software Testing: Reporting
 
Root Cause Analysis | 5 whys | Tools of accident investigation I Gaurav Singh...
Root Cause Analysis | 5 whys | Tools of accident investigation I Gaurav Singh...Root Cause Analysis | 5 whys | Tools of accident investigation I Gaurav Singh...
Root Cause Analysis | 5 whys | Tools of accident investigation I Gaurav Singh...
 
TDD Refresh - Austin RB - 2013-07-01 - public
TDD Refresh - Austin RB - 2013-07-01 - publicTDD Refresh - Austin RB - 2013-07-01 - public
TDD Refresh - Austin RB - 2013-07-01 - public
 
SAM
SAMSAM
SAM
 
Mixed Effects Models - Effect Size
Mixed Effects Models - Effect SizeMixed Effects Models - Effect Size
Mixed Effects Models - Effect Size
 
Shrini Kulkarni - Software Metrics - So Simple, Yet So Dangerous
Shrini Kulkarni -  Software Metrics - So Simple, Yet So Dangerous Shrini Kulkarni -  Software Metrics - So Simple, Yet So Dangerous
Shrini Kulkarni - Software Metrics - So Simple, Yet So Dangerous
 
Testing for everyone
Testing for everyoneTesting for everyone
Testing for everyone
 
Defining Test Competence
Defining Test CompetenceDefining Test Competence
Defining Test Competence
 
Challenging Your Project’s Testing Mindsets - Joe DeMeyer
Challenging Your Project’s Testing Mindsets - Joe DeMeyerChallenging Your Project’s Testing Mindsets - Joe DeMeyer
Challenging Your Project’s Testing Mindsets - Joe DeMeyer
 
Problem solving
Problem solvingProblem solving
Problem solving
 
5 whys
5 whys5 whys
5 whys
 
Ishikawa's Fish Bone
Ishikawa's Fish BoneIshikawa's Fish Bone
Ishikawa's Fish Bone
 
Cause and effect analysis
Cause and effect analysisCause and effect analysis
Cause and effect analysis
 
Why why analysis
Why why analysisWhy why analysis
Why why analysis
 
Alessandro Magnani, Data Scientist, @WalmartLabs at MLconf SF - 11/13/15
Alessandro Magnani, Data Scientist, @WalmartLabs at MLconf SF - 11/13/15Alessandro Magnani, Data Scientist, @WalmartLabs at MLconf SF - 11/13/15
Alessandro Magnani, Data Scientist, @WalmartLabs at MLconf SF - 11/13/15
 

Viewers also liked

Building Out Business Process Capabilities With Business Process Centers of E...
Building Out Business Process Capabilities With Business Process Centers of E...Building Out Business Process Capabilities With Business Process Centers of E...
Building Out Business Process Capabilities With Business Process Centers of E...Centric Consulting
 
The Art & Science of LifeCycle Marketing
The Art & Science of LifeCycle MarketingThe Art & Science of LifeCycle Marketing
The Art & Science of LifeCycle MarketingCentric Consulting
 
DevOps: Sprinkle Dev, Sprinkle Ops, Let's make Cake, not Mud Pies
DevOps: Sprinkle Dev, Sprinkle Ops, Let's make Cake, not Mud PiesDevOps: Sprinkle Dev, Sprinkle Ops, Let's make Cake, not Mud Pies
DevOps: Sprinkle Dev, Sprinkle Ops, Let's make Cake, not Mud PiesCentric Consulting
 
Metrics on the Money: The Art & Science of Change Measurement
Metrics on the Money: The Art & Science of Change MeasurementMetrics on the Money: The Art & Science of Change Measurement
Metrics on the Money: The Art & Science of Change MeasurementCentric Consulting
 
Finally, A Voice for the Enterprise!
Finally, A Voice for the Enterprise!Finally, A Voice for the Enterprise!
Finally, A Voice for the Enterprise!Centric Consulting
 
Marketing Automation Done Right 2017
Marketing Automation Done Right 2017Marketing Automation Done Right 2017
Marketing Automation Done Right 2017Centric Consulting
 
Business Process Excellence: Building Out Business Process Capabilities
Business Process Excellence: Building Out Business Process CapabilitiesBusiness Process Excellence: Building Out Business Process Capabilities
Business Process Excellence: Building Out Business Process CapabilitiesCentric Consulting
 
Mann india SAP Service Offerings- IS Retail
Mann india SAP Service Offerings- IS RetailMann india SAP Service Offerings- IS Retail
Mann india SAP Service Offerings- IS RetailMann-India
 
Microservices Application Simplicity Infrastructure Complexity
Microservices Application Simplicity Infrastructure ComplexityMicroservices Application Simplicity Infrastructure Complexity
Microservices Application Simplicity Infrastructure ComplexityCentric Consulting
 

Viewers also liked (15)

Building Out Business Process Capabilities With Business Process Centers of E...
Building Out Business Process Capabilities With Business Process Centers of E...Building Out Business Process Capabilities With Business Process Centers of E...
Building Out Business Process Capabilities With Business Process Centers of E...
 
Rise of the Wearables
Rise of the WearablesRise of the Wearables
Rise of the Wearables
 
Reclaiming Agile Development
Reclaiming Agile Development Reclaiming Agile Development
Reclaiming Agile Development
 
The Art & Science of LifeCycle Marketing
The Art & Science of LifeCycle MarketingThe Art & Science of LifeCycle Marketing
The Art & Science of LifeCycle Marketing
 
DevOps: Sprinkle Dev, Sprinkle Ops, Let's make Cake, not Mud Pies
DevOps: Sprinkle Dev, Sprinkle Ops, Let's make Cake, not Mud PiesDevOps: Sprinkle Dev, Sprinkle Ops, Let's make Cake, not Mud Pies
DevOps: Sprinkle Dev, Sprinkle Ops, Let's make Cake, not Mud Pies
 
Event-driven Architecture
Event-driven ArchitectureEvent-driven Architecture
Event-driven Architecture
 
Metrics on the Money: The Art & Science of Change Measurement
Metrics on the Money: The Art & Science of Change MeasurementMetrics on the Money: The Art & Science of Change Measurement
Metrics on the Money: The Art & Science of Change Measurement
 
Finally, A Voice for the Enterprise!
Finally, A Voice for the Enterprise!Finally, A Voice for the Enterprise!
Finally, A Voice for the Enterprise!
 
Marketing Automation Done Right 2017
Marketing Automation Done Right 2017Marketing Automation Done Right 2017
Marketing Automation Done Right 2017
 
Thinking Fast and Slow
Thinking Fast and SlowThinking Fast and Slow
Thinking Fast and Slow
 
Business Process Excellence: Building Out Business Process Capabilities
Business Process Excellence: Building Out Business Process CapabilitiesBusiness Process Excellence: Building Out Business Process Capabilities
Business Process Excellence: Building Out Business Process Capabilities
 
How to Run a Hackathon
How to Run a HackathonHow to Run a Hackathon
How to Run a Hackathon
 
Mann india SAP Service Offerings- IS Retail
Mann india SAP Service Offerings- IS RetailMann india SAP Service Offerings- IS Retail
Mann india SAP Service Offerings- IS Retail
 
Micro-Location with Beacons
Micro-Location with BeaconsMicro-Location with Beacons
Micro-Location with Beacons
 
Microservices Application Simplicity Infrastructure Complexity
Microservices Application Simplicity Infrastructure ComplexityMicroservices Application Simplicity Infrastructure Complexity
Microservices Application Simplicity Infrastructure Complexity
 

Similar to Proprietary Software Testing Metrics and Measures

How much testing is enough
How much testing is enoughHow much testing is enough
How much testing is enoughReti Yulvenia
 
Anton Muzhailo - Practical Test Process Improvement using ISTQB
Anton Muzhailo - Practical Test Process Improvement using ISTQBAnton Muzhailo - Practical Test Process Improvement using ISTQB
Anton Muzhailo - Practical Test Process Improvement using ISTQBIevgenii Katsan
 
Analytic emperical Mehods
Analytic emperical MehodsAnalytic emperical Mehods
Analytic emperical MehodsM Surendar
 
MLSEV Virtual. Automating Model Selection
MLSEV Virtual. Automating Model SelectionMLSEV Virtual. Automating Model Selection
MLSEV Virtual. Automating Model SelectionBigML, Inc
 
Evaluating tests
Evaluating testsEvaluating tests
Evaluating testscwhms
 
Software Quality Metrics Do's and Don'ts - QAI-Quest 1 Hour Presentation
Software Quality Metrics Do's and Don'ts - QAI-Quest 1 Hour PresentationSoftware Quality Metrics Do's and Don'ts - QAI-Quest 1 Hour Presentation
Software Quality Metrics Do's and Don'ts - QAI-Quest 1 Hour PresentationXBOSoft
 
Planning of experiment in industrial research
Planning of experiment in industrial researchPlanning of experiment in industrial research
Planning of experiment in industrial researchpbbharate
 
software testing metrics do's - don'ts-XBOSoft-QAI Webinar
software testing metrics do's - don'ts-XBOSoft-QAI Webinarsoftware testing metrics do's - don'ts-XBOSoft-QAI Webinar
software testing metrics do's - don'ts-XBOSoft-QAI WebinarXBOSoft
 
Software Quality Metrics Do's and Don'ts - XBOSoft-QAI Webinar
Software Quality Metrics Do's and Don'ts - XBOSoft-QAI WebinarSoftware Quality Metrics Do's and Don'ts - XBOSoft-QAI Webinar
Software Quality Metrics Do's and Don'ts - XBOSoft-QAI WebinarXBOSoft
 
Test case design techniques
Test case design techniquesTest case design techniques
Test case design techniquesAshutosh Garg
 
Test case design techniques
Test case design techniquesTest case design techniques
Test case design techniques2PiRTechnologies
 
5. testing differences
5. testing differences5. testing differences
5. testing differencesSteve Saffhill
 
Testing Metrics and Tools, Analyse de tests
Testing Metrics and Tools, Analyse de testsTesting Metrics and Tools, Analyse de tests
Testing Metrics and Tools, Analyse de testsHervKoya
 
validity and reliability ppt.ppt
validity and reliability ppt.pptvalidity and reliability ppt.ppt
validity and reliability ppt.pptrahulranjan215851
 
Ch. 7 finish and review
Ch. 7 finish and reviewCh. 7 finish and review
Ch. 7 finish and reviewjbnx
 

Similar to Proprietary Software Testing Metrics and Measures (20)

How much testing is enough
How much testing is enoughHow much testing is enough
How much testing is enough
 
Anton Muzhailo - Practical Test Process Improvement using ISTQB
Anton Muzhailo - Practical Test Process Improvement using ISTQBAnton Muzhailo - Practical Test Process Improvement using ISTQB
Anton Muzhailo - Practical Test Process Improvement using ISTQB
 
Model validation
Model validationModel validation
Model validation
 
Analytic emperical Mehods
Analytic emperical MehodsAnalytic emperical Mehods
Analytic emperical Mehods
 
MLSEV Virtual. Automating Model Selection
MLSEV Virtual. Automating Model SelectionMLSEV Virtual. Automating Model Selection
MLSEV Virtual. Automating Model Selection
 
Evaluating tests
Evaluating testsEvaluating tests
Evaluating tests
 
Software Quality Metrics Do's and Don'ts - QAI-Quest 1 Hour Presentation
Software Quality Metrics Do's and Don'ts - QAI-Quest 1 Hour PresentationSoftware Quality Metrics Do's and Don'ts - QAI-Quest 1 Hour Presentation
Software Quality Metrics Do's and Don'ts - QAI-Quest 1 Hour Presentation
 
Planning of experiment in industrial research
Planning of experiment in industrial researchPlanning of experiment in industrial research
Planning of experiment in industrial research
 
[Paul Holland] Bad Metrics and What You Can Do About It
[Paul Holland] Bad Metrics and What You Can Do About It[Paul Holland] Bad Metrics and What You Can Do About It
[Paul Holland] Bad Metrics and What You Can Do About It
 
Session 3 sample design
Session 3   sample designSession 3   sample design
Session 3 sample design
 
software testing metrics do's - don'ts-XBOSoft-QAI Webinar
software testing metrics do's - don'ts-XBOSoft-QAI Webinarsoftware testing metrics do's - don'ts-XBOSoft-QAI Webinar
software testing metrics do's - don'ts-XBOSoft-QAI Webinar
 
Software Quality Metrics Do's and Don'ts - XBOSoft-QAI Webinar
Software Quality Metrics Do's and Don'ts - XBOSoft-QAI WebinarSoftware Quality Metrics Do's and Don'ts - XBOSoft-QAI Webinar
Software Quality Metrics Do's and Don'ts - XBOSoft-QAI Webinar
 
Test case design techniques
Test case design techniquesTest case design techniques
Test case design techniques
 
Test case design techniques
Test case design techniquesTest case design techniques
Test case design techniques
 
5. testing differences
5. testing differences5. testing differences
5. testing differences
 
TCI in primary care - SEM (2006)
TCI in primary care - SEM (2006)TCI in primary care - SEM (2006)
TCI in primary care - SEM (2006)
 
Sampling brm chap-4
Sampling brm chap-4Sampling brm chap-4
Sampling brm chap-4
 
Testing Metrics and Tools, Analyse de tests
Testing Metrics and Tools, Analyse de testsTesting Metrics and Tools, Analyse de tests
Testing Metrics and Tools, Analyse de tests
 
validity and reliability ppt.ppt
validity and reliability ppt.pptvalidity and reliability ppt.ppt
validity and reliability ppt.ppt
 
Ch. 7 finish and review
Ch. 7 finish and reviewCh. 7 finish and review
Ch. 7 finish and review
 

Recently uploaded

Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfngoud9212
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 

Recently uploaded (20)

Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdf
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 

Proprietary Software Testing Metrics and Measures

  • 1. NOTICE: Proprietary and Confidential This material is proprietary to Centric Consulting, LLC. It contains trade secrets and informationwhich is solely the property of Centric Consulting,LLC. This material is solely for the Client’sinternaluse. This material shall not be used, reproduced, copied, disclosed, transmitted, in whole or in part, without the express consent of Centric Consulting,LLC. © 2013 Centric Consulting,LLC. All rights reserved Bad Metric. Bad! Teaching an old dog, nothing new
  • 2.
  • 3.
  • 4. What are some typical metrics that you measure?
  • 5. Other Examples of Software Testing Metrics • Test Case Counts by Execution Status • Test Case Percentages by Execution Status • Test Case Execution Status Trend • Test Case Status Planned vs Executed • Test Case Coverage • Test Case Status vs Coverage • Test Case First Run Failure Counts • Test Case Re– Run Counts Test Cases • Automation Index (Percent Automatable) • Automation Progress • Automation Test Coverage Automation extras
  • 6. More Examples of Software Testing Metrics • Defect Counts by Status • Defect Counts by Priority • Defect Status Trend • Defect Density • Defect Remove Efficiency • Defect Leakage • Average Defect Response Time Defects • Requirements Volatility Index • Testing Process Efficiency Other
  • 8. Other Examples of Software Testing Metrics • Test Case Counts by Execution Status – Count • Test Case Percentages by Execution Status – Count • Test Case Execution Status Trend – Trend • Test Case Executed vs Planned – Metric and Trend • Test Case Coverage – Metric • Test Case Status vs Coverage – Metric • Test Case First Run Failure Counts – Count • Test Case Re– Run Counts – Count Test Cases • Automation Index (Percent Automatable) – Metric • Automation Progress – Count • Automation Test Coverage – Metric Automation extras
  • 9. More Examples of Software Testing Metrics • Defect Counts by Status – Count • Defect Counts by Priority – Count • Defect Status Trend – Trend • Defect Density – Metric • Defect Remove Efficiency – Metric • Defect Leakage – Metric • Average Defect Response Time – Trend Defects • Requirements Volatility Index – Metric • Testing Process Efficiency – Metric Other
  • 10. The Problem We Typically Face? They Fail to Communicate • Present data instead of information • Offer no interpretation, allow user to draw own conclusion They Are Often Inaccurate • The act of measuring lacks of consistency • The measures themselves have inherent variability • No one reports margin of errors They Do Not Measure a Control • Can’t make decision based on number • The measurement isn’t a lever to introduce change They Are Not Tied to Organizational Objectives • No threshold set for desired goal • No action or consequence if not achieved
  • 13.
  • 14. Exercise #1 1. Need 3 volunteers 2. Assume 1 scoop equals 1 days worth of testing effort 3. Hershey Kisses and Tootsie Rolls are tests, Starbursts are bugs 4. Take a scoop 5. How many tests did you execute? 6. Based on how many tests you ran, how many more scoops do you need to execute the rest (there are 180 total)?
  • 15. Exercise #1 Questions • Was the same scoop used? Were the results the same? • Was there variability in the number of tests run in each scoop. • Is that typical in testing? • Was there variability in the estimate of the number of tests left? • Is this similar to guessing how much time is effort is left in a test cycle? • Are these numbers reliable? • Are they repeatable?
  • 16. Exercise #2 1. Need 3 volunteers 2. Assume 1 scoop equals 1 days worth of testing effort 3. Hershey Kisses and Tootsie Rolls are tests, Starbursts are bugs (Red are severe) 4. Take a scoop 5. How many tests did you execute? 6. How many defects did you find? 7. Based on how many tests you ran, how many more scoops do you need to execute the rest? 8. Based on how much effort you put in, how many more scoops do you need to find the rest of the defects?
  • 17. Exercise #2 Questions • Was the same scoop used? Were the results the same? • With an estimate of the number of tests remaining, is it reasonable to estimate the number of defects will be found? • Do people ask you to guess this type of information? • If you know how many tests (Starbursts) are left and how many man- hours you will use (scoop size), can you estimate how many scoops are needed to execute all tests (find all Starbursts)? • Is it accurate? Is it close enough? • Are these numbers reliable? • Are they repeatable? • Does encountering defects (M&M’s) reveal anything about the overall quality (how many M&M’s exist, or what it’ll take to find them)?
  • 18. Challenges with Counting Label does not equal content Inherent variability Not evenly spaced Lacks reference for context Lack of consistency
  • 20. Sampling Target Population Matched Samples Independent Samples Random Sampling Simple Random Sampling Stratified Sampling Cluster Sampling Quota Sampling Spatial Sampling Sampling Variability Standard Error Bias Precision For each population there are many possible samples. A sample statistic gives information about a corresponding population parameter
  • 21. Sampling in Testing Does testing use sampling? Consider in most corporate environments: • We never test the entire application • It is not realistically possible to find every defect • So, does testing use sampling?
  • 22. Ponder this as we discuss the next section… Is Testing a Methodical Defect Searching Activity?
  • 23. Sampling Remember, We can’t test everything – not enough time/people/budget So, which sample approach better approximates an actual measure (e.g. dots per sq. inch?) 5.25 dots/sq. in. 6.5 dots/sq. in.
  • 24. Ponder this as we discuss the next section… Is Testing a Methodical Defect Searching Activity?
  • 25. Sampling Which sample approach better approximates an actual measure (e.g. dots per sq. inch?) • What is more accurate, random or methodical searching? 5.25 dots/sq. in. 6.5 dots/sq. in. 4.95 dots/sq. in. 6.3 dots/sq. in. There are actually 6.6 dots/sq. in.
  • 27. Exercise #3 1. Need 3 volunteers 2. Assume 1 scoop equals 1 days worth of testing effort 3. Hershey Kisses and Tootsie Rolls are tests, Starbursts are bugs (Red are severe) 4. Each volunteer grab 1 scoop of candy 5. How many (total) tests did you execute? 6. How many (total) defects did you find? 7. Log results 8. Repeat 2 more times
  • 28. Exercise #3 Questions • Does this graph represent anything useful? • Does a trend line help or mean anything? • Is it possible or reasonable to estimate the # of defects you’ll see based on the number of tests, from even 9 samples? • Compare scoop 1 to scoop 9 – does any scoop seem to be a reasonable estimate?
  • 29. Challenges with Metrics (Measure over Measure) Implied Derivations and Forecasting Counts over Counts Denominator Rules Implies Velocity Measure over Measure
  • 31. Trend Trend is a change in a measure (or metric) over time interval. Has three components Direction/Movement Speed/Size Cause (Implied)
  • 32.
  • 33. Exercise #4 1. Need 3 volunteers 2. Assume 1 scoop equals 1 days worth of testing effort 3. Hershey Kisses and Tootsie Rolls are tests, Starbursts are bugs (Red are severe) 4. Each volunteer grab 1 scoop of candy 5. How many of EACH type of tests did you execute? 6. How many of EACH type of defect did you find? 7. Log results 8. Repeat 2 more times
  • 34. Exercise #4 Questions • Does the graph line represent any information of value? • Is there assurance (control) that simply taking a scoop (e.g. executing tests in a given day) will result in defects being found? • Is the shape of the defect cumulative line representative of anything? • If we only look at scoops 1-3 or 7-9, does it tell us anything or mislead us? • What if we took 2 scoops per day (added a tester – but still counted as 1 day), would that affect anything how things look? • Is M&M’s per scoop or M&M’s per skittles/starbursts mean anything?
  • 35. Challenges with Trends Affected by challenges of counting Affected by challenges of metrics Time Based Series Intervals and Activity Pause
  • 36.
  • 37. Purpose of Metrics Measure of Performance Conformance to Best Practice Deviation from Goal
  • 38. Issues affecting purpose Misaligned with strategy Using metrics as outputs only Too many metrics Ease of measure does not equal importance Lack of context Limited dimensions Lack behavioral aspects
  • 40. How to Leverage Metrics Explicitly link metrics to goals Use trends over absolute numbers Use shorter tracking periods Change metrics when they stop driving change Account for error and confidence