SlideShare a Scribd company logo
The Tester’s Dashboard:
Release Decision Support

         Robert V. Binder
System Verification Associates, LLC
        rbinder@ieee.org
          Peter B. Lakey
     Cognitive Concepts, Inc.
    peterlakey@sbcglobal.net
Overview
• Complementary metrics for release decision-
  support
  – Model-based testing
     • Operational profile
     • Model coverage metrics
  – Reliability Demonstration Chart
  – Relative Proximity
• Case Study
• Observations
Release Decision Support
Model-based Reliability Estimation
• Test suites must be
  – Proportional to operational profile
  – Sequentially feasible
  – Input feasible
• Approach
  – Markov model
  – Monte Carlo simulation
  – Post run analytics
Model Coverage Metrics
                                                         • % States
      Usage Profile


                                                           Reached
S                                                 T
                                                         • % State-
                                                           Transitions
                                              Observe
                                              System
                                                           Reached
                                              Failure

                        Trigger
                        Latent
                        Defect       Software System


                                                        Process Space
              Observe             fault
              System              activated
              Failure
                                                        Data Space
Reliability Demonstration Chart
• Sequential
  Sampling
• Risk-
  Adjusted
• Musa
  equations




      http://sourceforge.net/projects/rdc/
Relative Proximity
• Kullback-Lieber Distance
   – Information theoretic
   – Characterizes difference in variation of message population E
     (expected) and sample A (actual) as “relative entropy”



       KLD = ∑ 𝐴 𝑖 (𝑙𝑜𝑔2 (𝐴 𝑖 / E 𝑖 ))

• Relative Proximity
   – KLD math doesn’t work unless failures modeled (sum of the
     actuals must be 1.0)
   – Assume the target failure rate is aggregate
   – Allocate failure rate in proportion to each operation
Profile Explicit Failure Modes
• Assume maximum acceptable failure rate intensity of 1 in 10,000

  Operation   Mode     Standard     Explicit Failure Expected Number,
                        Profile         Profile         10000 Tests
      A       Pass              0.7           0.6993               6993
      B       Pass              0.2           0.1998               1998
      C       Pass              0.1           0.0999                999
Profile Explicit Failure Modes
      Mode    Expected Actual     KL Distance   Actual     KL Distance
  A   Pass        6993    7000           10.104    6990             -4.327
       Fail           7       0           0.000      10              5.146
  B   Pass        1998    1990          -11.518    2000              2.887
       Fail           2     10           23.219        0             0.000
  C   Pass          999    980          -27.149     994             -7.195
       Fail           1     20           86.439        6           15.510
                 10000   10000           81.094  10000             12.020

• Relative Proximity indicates the difference
  between actual and observed failure rates
• Many possible operation failure rates with better
  or worse fidelity
• RDC based on aggregate FIO, not sensitive to
  operation variance
Case Studies
•   Stochastic Models
•   Assumed Failure Rates
•   Word Processing Application
•   Ground-Based Midcourse Missile Defense
GBMD Test Run, 0-100
GBMD Test Run, 1K, 5K
GMBD Test Run, 10K
GBMD Relative Proximity Trend

  1600.00
                 1484.00
  1400.00

  1200.00

  1000.00

   800.00

   600.00
                             418.20
   400.00

   200.00                             67.90
                                               12.00 6.30
     0.00
            10             100          1000           10000
Observations
• Model coverage indicates minimal sufficiency
   – Wouldn’t release without all state-xtn pairs covered
   – Stochastic can take a long time to do this
   – Cover with N+ first
• RDC assumes “flat” profile
   – With sequential constraints, may be optimistic
   – Strength is explicit risk-adjustment
• Relative Proximity will indicate when operation-
  specific Failure Intensity is as expected (or not)
Q&A

More Related Content

Similar to The Tester’s Dashboard: Release Decision Support

Thesis PresentationvFinal3
Thesis PresentationvFinal3Thesis PresentationvFinal3
Thesis PresentationvFinal3
M Ghorbanzadeh
 
PhD_defense_presentation_Oct2013
PhD_defense_presentation_Oct2013PhD_defense_presentation_Oct2013
PhD_defense_presentation_Oct2013
Selvi Kadirvel
 
Rf network design
Rf network designRf network design
Rf network design
Nguyen Le
 
Overcoming challenges of_verifying complex mixed signal designs
Overcoming challenges of_verifying complex mixed signal designsOvercoming challenges of_verifying complex mixed signal designs
Overcoming challenges of_verifying complex mixed signal designs
Pankaj Singh
 
Using Six Sigma to Optimize Performance and Reliability
Using Six Sigma to Optimize Performance and ReliabilityUsing Six Sigma to Optimize Performance and Reliability
Using Six Sigma to Optimize Performance and Reliability
Timothy Williams
 
Deccan UGC 2011 Chiefs Forum - Seattle Fire Presentation - P Di Turi
Deccan UGC 2011 Chiefs Forum - Seattle Fire Presentation - P Di TuriDeccan UGC 2011 Chiefs Forum - Seattle Fire Presentation - P Di Turi
Deccan UGC 2011 Chiefs Forum - Seattle Fire Presentation - P Di Turi
pdituri
 

Similar to The Tester’s Dashboard: Release Decision Support (20)

(BDT207) Real-Time Analytics In Service Of Self-Healing Ecosystems
(BDT207) Real-Time Analytics In Service Of Self-Healing Ecosystems(BDT207) Real-Time Analytics In Service Of Self-Healing Ecosystems
(BDT207) Real-Time Analytics In Service Of Self-Healing Ecosystems
 
Enabling Carrier-Grade Availability Within a Cloud Infrastructure
Enabling Carrier-Grade Availability Within a Cloud InfrastructureEnabling Carrier-Grade Availability Within a Cloud Infrastructure
Enabling Carrier-Grade Availability Within a Cloud Infrastructure
 
Thesis PresentationvFinal3
Thesis PresentationvFinal3Thesis PresentationvFinal3
Thesis PresentationvFinal3
 
PhD_defense_presentation_Oct2013
PhD_defense_presentation_Oct2013PhD_defense_presentation_Oct2013
PhD_defense_presentation_Oct2013
 
Rf network design
Rf network designRf network design
Rf network design
 
Ithings2012 20nov
Ithings2012 20novIthings2012 20nov
Ithings2012 20nov
 
Multilin™ Intelligent Line Monitoring System
Multilin™ Intelligent Line Monitoring SystemMultilin™ Intelligent Line Monitoring System
Multilin™ Intelligent Line Monitoring System
 
EXTENT-2017: Climbing Out of the Stability Sinkhole - Survivor’s Guide
EXTENT-2017: Climbing Out of the Stability Sinkhole - Survivor’s GuideEXTENT-2017: Climbing Out of the Stability Sinkhole - Survivor’s Guide
EXTENT-2017: Climbing Out of the Stability Sinkhole - Survivor’s Guide
 
Virtual Power Plant - Becky Harrison, Progress Energy
Virtual Power Plant - Becky Harrison, Progress EnergyVirtual Power Plant - Becky Harrison, Progress Energy
Virtual Power Plant - Becky Harrison, Progress Energy
 
Overcoming challenges of_verifying complex mixed signal designs
Overcoming challenges of_verifying complex mixed signal designsOvercoming challenges of_verifying complex mixed signal designs
Overcoming challenges of_verifying complex mixed signal designs
 
Fuzzy Control meets Software Engineering
Fuzzy Control meets Software EngineeringFuzzy Control meets Software Engineering
Fuzzy Control meets Software Engineering
 
Nokia kpi and_core_optimization
Nokia kpi and_core_optimizationNokia kpi and_core_optimization
Nokia kpi and_core_optimization
 
(ISM301) Engineering Netflix Global Operations In The Cloud
(ISM301) Engineering Netflix Global Operations In The Cloud(ISM301) Engineering Netflix Global Operations In The Cloud
(ISM301) Engineering Netflix Global Operations In The Cloud
 
Fuzzy Self-Learning Controllers for Elasticity Management in Dynamic Cloud Ar...
Fuzzy Self-Learning Controllers for Elasticity Management in Dynamic Cloud Ar...Fuzzy Self-Learning Controllers for Elasticity Management in Dynamic Cloud Ar...
Fuzzy Self-Learning Controllers for Elasticity Management in Dynamic Cloud Ar...
 
Towards a Unified View of Cloud Elasticity
Towards a Unified View of Cloud ElasticityTowards a Unified View of Cloud Elasticity
Towards a Unified View of Cloud Elasticity
 
Overview of life testing in Minitab
Overview of life testing in MinitabOverview of life testing in Minitab
Overview of life testing in Minitab
 
Using Six Sigma to Optimize Performance and Reliability
Using Six Sigma to Optimize Performance and ReliabilityUsing Six Sigma to Optimize Performance and Reliability
Using Six Sigma to Optimize Performance and Reliability
 
FIELD TESTING of 2G 3G Devices.ppt
FIELD TESTING  of 2G 3G Devices.pptFIELD TESTING  of 2G 3G Devices.ppt
FIELD TESTING of 2G 3G Devices.ppt
 
Deccan UGC 2011 Chiefs Forum - Seattle Fire Presentation - P Di Turi
Deccan UGC 2011 Chiefs Forum - Seattle Fire Presentation - P Di TuriDeccan UGC 2011 Chiefs Forum - Seattle Fire Presentation - P Di Turi
Deccan UGC 2011 Chiefs Forum - Seattle Fire Presentation - P Di Turi
 
Wcdma planning
Wcdma planningWcdma planning
Wcdma planning
 

More from Bob Binder

More from Bob Binder (18)

How to Release Rock-solid RESTful APIs and Ice the Testing BackBlob
How to Release Rock-solid RESTful APIs and Ice the Testing BackBlobHow to Release Rock-solid RESTful APIs and Ice the Testing BackBlob
How to Release Rock-solid RESTful APIs and Ice the Testing BackBlob
 
Model-based Testing: Taking BDD/ATDD to the Next Level
Model-based Testing: Taking BDD/ATDD to the Next LevelModel-based Testing: Taking BDD/ATDD to the Next Level
Model-based Testing: Taking BDD/ATDD to the Next Level
 
Model-based Testing: Today And Tomorrow
Model-based Testing: Today And TomorrowModel-based Testing: Today And Tomorrow
Model-based Testing: Today And Tomorrow
 
Mobile App Assurance: Yesterday, Today, and Tomorrow.
Mobile App Assurance: Yesterday, Today, and Tomorrow.Mobile App Assurance: Yesterday, Today, and Tomorrow.
Mobile App Assurance: Yesterday, Today, and Tomorrow.
 
Popular Delusions, Crowds, and the Coming Deluge: end of the Oracle?
Popular Delusions, Crowds, and the Coming Deluge: end of the Oracle?Popular Delusions, Crowds, and the Coming Deluge: end of the Oracle?
Popular Delusions, Crowds, and the Coming Deluge: end of the Oracle?
 
Achieving Very High Reliability for Ubiquitous Information Technology
Achieving Very High Reliability for Ubiquitous Information Technology Achieving Very High Reliability for Ubiquitous Information Technology
Achieving Very High Reliability for Ubiquitous Information Technology
 
Testing Object-Oriented Systems: Lessons Learned
Testing Object-Oriented Systems: Lessons LearnedTesting Object-Oriented Systems: Lessons Learned
Testing Object-Oriented Systems: Lessons Learned
 
Model-Based Testing: Why, What, How
Model-Based Testing: Why, What, HowModel-Based Testing: Why, What, How
Model-Based Testing: Why, What, How
 
MDD and the Tautology Problem: Discussion Notes.
MDD and the Tautology Problem: Discussion Notes.MDD and the Tautology Problem: Discussion Notes.
MDD and the Tautology Problem: Discussion Notes.
 
Testability: Factors and Strategy
Testability: Factors and StrategyTestability: Factors and Strategy
Testability: Factors and Strategy
 
Test Objects -- They Just Work
Test Objects -- They Just WorkTest Objects -- They Just Work
Test Objects -- They Just Work
 
A Million Users in a Box: The WTS Story
A Million Users in a Box: The WTS StoryA Million Users in a Box: The WTS Story
A Million Users in a Box: The WTS Story
 
Software Test Patterns: Successes and Challenges
Software Test Patterns: Successes and ChallengesSoftware Test Patterns: Successes and Challenges
Software Test Patterns: Successes and Challenges
 
Assurance for Cloud Computing
Assurance for Cloud ComputingAssurance for Cloud Computing
Assurance for Cloud Computing
 
The Advanced Mobile Application Testing Environment: Project Report
The Advanced Mobile Application Testing Environment: Project ReportThe Advanced Mobile Application Testing Environment: Project Report
The Advanced Mobile Application Testing Environment: Project Report
 
Software Testing: Models, Patterns, Tools
Software Testing: Models, Patterns, ToolsSoftware Testing: Models, Patterns, Tools
Software Testing: Models, Patterns, Tools
 
The Tester’s Dashboard: Release Decision Support
The Tester’s Dashboard: Release Decision SupportThe Tester’s Dashboard: Release Decision Support
The Tester’s Dashboard: Release Decision Support
 
Testability: Factors and Strategy
Testability: Factors and StrategyTestability: Factors and Strategy
Testability: Factors and Strategy
 

Recently uploaded

Recently uploaded (20)

НАДІЯ ФЕДЮШКО БАЦ «Професійне зростання QA спеціаліста»
НАДІЯ ФЕДЮШКО БАЦ  «Професійне зростання QA спеціаліста»НАДІЯ ФЕДЮШКО БАЦ  «Професійне зростання QA спеціаліста»
НАДІЯ ФЕДЮШКО БАЦ «Професійне зростання QA спеціаліста»
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi
 
Speed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in MinutesSpeed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in Minutes
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024
 
UiPath Test Automation using UiPath Test Suite series, part 1
UiPath Test Automation using UiPath Test Suite series, part 1UiPath Test Automation using UiPath Test Suite series, part 1
UiPath Test Automation using UiPath Test Suite series, part 1
 
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
 
In-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT ProfessionalsIn-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT Professionals
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxIOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
 
UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2
 

The Tester’s Dashboard: Release Decision Support

  • 1. The Tester’s Dashboard: Release Decision Support Robert V. Binder System Verification Associates, LLC rbinder@ieee.org Peter B. Lakey Cognitive Concepts, Inc. peterlakey@sbcglobal.net
  • 2. Overview • Complementary metrics for release decision- support – Model-based testing • Operational profile • Model coverage metrics – Reliability Demonstration Chart – Relative Proximity • Case Study • Observations
  • 4.
  • 5. Model-based Reliability Estimation • Test suites must be – Proportional to operational profile – Sequentially feasible – Input feasible • Approach – Markov model – Monte Carlo simulation – Post run analytics
  • 6. Model Coverage Metrics • % States Usage Profile Reached S T • % State- Transitions Observe System Reached Failure Trigger Latent Defect Software System Process Space Observe fault System activated Failure Data Space
  • 7. Reliability Demonstration Chart • Sequential Sampling • Risk- Adjusted • Musa equations http://sourceforge.net/projects/rdc/
  • 8. Relative Proximity • Kullback-Lieber Distance – Information theoretic – Characterizes difference in variation of message population E (expected) and sample A (actual) as “relative entropy” KLD = ∑ 𝐴 𝑖 (𝑙𝑜𝑔2 (𝐴 𝑖 / E 𝑖 )) • Relative Proximity – KLD math doesn’t work unless failures modeled (sum of the actuals must be 1.0) – Assume the target failure rate is aggregate – Allocate failure rate in proportion to each operation
  • 9. Profile Explicit Failure Modes • Assume maximum acceptable failure rate intensity of 1 in 10,000 Operation Mode Standard Explicit Failure Expected Number, Profile Profile 10000 Tests A Pass 0.7 0.6993 6993 B Pass 0.2 0.1998 1998 C Pass 0.1 0.0999 999
  • 10. Profile Explicit Failure Modes Mode Expected Actual KL Distance Actual KL Distance A Pass 6993 7000 10.104 6990 -4.327 Fail 7 0 0.000 10 5.146 B Pass 1998 1990 -11.518 2000 2.887 Fail 2 10 23.219 0 0.000 C Pass 999 980 -27.149 994 -7.195 Fail 1 20 86.439 6 15.510 10000 10000 81.094 10000 12.020 • Relative Proximity indicates the difference between actual and observed failure rates • Many possible operation failure rates with better or worse fidelity • RDC based on aggregate FIO, not sensitive to operation variance
  • 11. Case Studies • Stochastic Models • Assumed Failure Rates • Word Processing Application • Ground-Based Midcourse Missile Defense
  • 12. GBMD Test Run, 0-100
  • 13. GBMD Test Run, 1K, 5K
  • 15. GBMD Relative Proximity Trend 1600.00 1484.00 1400.00 1200.00 1000.00 800.00 600.00 418.20 400.00 200.00 67.90 12.00 6.30 0.00 10 100 1000 10000
  • 16. Observations • Model coverage indicates minimal sufficiency – Wouldn’t release without all state-xtn pairs covered – Stochastic can take a long time to do this – Cover with N+ first • RDC assumes “flat” profile – With sequential constraints, may be optimistic – Strength is explicit risk-adjustment • Relative Proximity will indicate when operation- specific Failure Intensity is as expected (or not)
  • 17. Q&A