The Tester’s Dashboard: Release Decision Support

•

0 likes•498 views

The document discusses metrics for supporting release decisions based on model-based testing. It describes using an operational profile to generate test cases, calculating model coverage metrics, using a reliability demonstration chart to assess risk, and measuring relative proximity to compare expected and actual failure rates. A case study applies these methods to a word processing app and missile defense system. Key observations are that model coverage ensures sufficient testing, reliability demonstration charts assume flat profiles which may be optimistic, and relative proximity indicates when failure intensities match expectations.

Technology

The Tester’s Dashboard:
Release Decision Support

Robert V. Binder
System Verification Associates, LLC
rbinder@ieee.org
Peter B. Lakey
Cognitive Concepts, Inc.
peterlakey@sbcglobal.net

Reliability Demonstration Chart
• Sequential
Sampling
• Risk-
Adjusted
• Musa
equations

http://sourceforge.net/projects/rdc/

Relative Proximity
• Kullback-Lieber Distance
– Information theoretic
– Characterizes difference in variation of message population E
(expected) and sample A (actual) as “relative entropy”

KLD = ∑ 𝐴 𝑖 (𝑙𝑜𝑔2 (𝐴 𝑖 / E 𝑖 ))

• Relative Proximity
– KLD math doesn’t work unless failures modeled (sum of the
actuals must be 1.0)
– Assume the target failure rate is aggregate
– Allocate failure rate in proportion to each operation

Case Studies
• Stochastic Models
• Assumed Failure Rates
• Word Processing Application
• Ground-Based Midcourse Missile Defense

GBMD Relative Proximity Trend

1600.00
1484.00
1400.00

1200.00

1000.00

800.00

600.00
418.20
400.00

200.00 67.90
12.00 6.30
0.00
10 100 1000 10000

Similar to The Tester’s Dashboard: Release Decision Support

Netflix strives to provide an amazing experience to each member. To accomplish this, Netflix needs to maintain very high availability across our systems. However, at a certain scale, humans can no longer scale their ability to monitor the status of all systems, making it critical for Netflix to build tools and platforms that can automatically monitor their production environments and make intelligent real-time operational decisions to remedy the problems they identify. In this session, we discuss how Netflix uses data mining and machine learning techniques to automate decisions in real-time with the goal of supporting operational availability, reliability, and consistency. We review how we got to the current states, the lessons we learned, and the future of real-time analytics at Netflix. While Netflix's scale is larger than most other companies, we believe the approaches and technologies we discuss are highly relevant to other production environments, and audience members should come away with actionable ideas that are implementable in, and benefit, most other environments.

(BDT207) Real-Time Analytics In Service Of Self-Healing Ecosystems

Amazon Web Services

Aaron Smith, Red hat, Pasi Vaananen, Red Hat Carrier-Grade Cloud Infrastructure (Aaron Smith, Pasi Vaananen, Red Hat): The move from vertically integrated hardware and software to distributed execution in a cloud complicates the delivery of highly available services. Vertically integrated systems enabled all system layers required to communicate and participate in the support of availability of the service to be under control of single system vendor. With NFV, the cloud philosophy of infrastructure and application decoupling requires new open interfaces to support the necessary flow of information between layers and clear separation of the fault and availability management responsibilities between the infrastructure and application SW subsystems. Even in the cloud environment, traditional availability concepts such as fast detection, correlation, and fault notification still apply. A fast, low-latency fault management platform will be presented that allows cloud-based services to achieve 5NINES of availability and service continuity. Performance measurements from a prototype of the system will be presented along with a demo of the operation of a service requiring 50 ms fault remediation.

Enabling Carrier-Grade Availability Within a Cloud Infrastructure

OPNFV

Thesis PresentationvFinal3

M Ghorbanzadeh

PhD_defense_presentation_Oct2013

Selvi Kadirvel

Rf network design

Nguyen Le

Ithings2012 20nov

Panagiotis Garefalakis

Multilin™ Intelligent Line Monitoring System

Corporación Eléctrica del Ecuador, CELEC EP

EXTENT-2017: Software Testing & Trading Technology Trends Conference 29 June, 2017, 10 Paternoster Square, London Climbing Out of the Stability Sinkhole - Survivor’s Guide Sergei Poliakoff, CIO, Moscow Exchange Would like to know more? Visit our website: extentconf.com Follow us: https://www.linkedin.com/company/exactpro-systems-llc?trk=biz-companies-cym https://twitter.com/exactpro #extentconf #exactpro

EXTENT-2017: Climbing Out of the Stability Sinkhole - Survivor’s Guide

Iosif Itkin

Virtual Power Plant - Becky Harrison, Progress Energy

Energy Network marcus evans

Efficient and Innovative Digital Mixed-Signal (DMS) verification methodology is required to enable effective verification of RX path of SERDES. This presentation describes the usage of Real value models and Capture -Verify approach to verify complex high speed mixed signal design. Real value models are the backbone of DMS methodology. Real value models are created for all critical modules in Receive path like Equalizer and Sampler and its associated peripheral modules. It is critical to make sure created models are functionally equivalent to respective designs. This is achieved by verifying each created model with respective designs for all functional modes. While the Real Value models are effective in meeting overcoming the simulation performance bottleneck by achieving 10x faster simulation time; the Nonlinearity factors of the front-end design are not represented accurately in discrete domain real value models for next generation of SerDes Design at very high data rate. To overcome this problem, a novel approach called ‘capture and verify’ is used for verifying the jitter tolerance and eye parameters. In this approach, waveforms from spice level verification of Equalizer for different functional modes are captured and stored. These stored waveforms are used to generate run time table-based models to accurately represent the analog modules. These run time models are used in top-level simulations along with real value models thereby achieving required goal of simulation performance without compromising on accuracy of results. The complete Design Verification (DV) environment is developed using UVM-e Methodology. Verification environment contains model for transmitter with all de-emphasis settings along with protocol compliant channels with multiple attenuations. DV infrastructure has hooks to plug-in required channel models to verify SERDES. This verification environment is also capable of verifying the clock data recovery (CDR) path of the design using protocol compliant jitter and Spread-Spectrum Clocking (SSC) stimulus. The real value modelling bridges the gap between the performance requirements of the simulation and accuracy limitations of design. A significant speed-up in simulation performance is achieved (almost 10X in this case) by replacing with functionally equivalent real value models for mixed signal designs. Usage of Capture and Verify methodology with spice simulation waveforms for critical blocks ensures non-linearity of the next generation high speed SerDes design is well captured in simulations provide complete comprehensive solution for high speed mixed signal designs.

Overcoming challenges of_verifying complex mixed signal designs

Pankaj Singh

Fuzzy Control meets Software Engineering

Pooyan Jamshidi

Nokia kpi and_core_optimization

debasish goswami

Operating a massively scalable, constantly changing, distributed global service is a daunting task. We innovate at breakneck speed to attract new customers and stay ahead of the competition. This means more features, more experiments, more deployments, more engineers making changes in production environments, and ever-increasing complexity. Simultaneously improving service availability and accelerating rate of change seems impossible on the surface. At Netflix, operations engineering is both a technical and organizational construct designed to accomplish just that by integrating disciplines like continuous delivery, fault injection, regional traffic management, crisis response, best practice automation, and real-time analytics. In this talk, designed for technical leaders seeking a path to operational excellence, we'll explore these disciplines in depth and how they integrate and create competitive advantages.

(ISM301) Engineering Netflix Global Operations In The Cloud

Amazon Web Services

Fuzzy Self-Learning Controllers for Elasticity Management in Dynamic Cloud Ar...

Pooyan Jamshidi

Towards a Unified View of Cloud Elasticity

Srikumar Venugopal

After you have data from life testing, what do you do with it? Covering the basics, starting by showing "individual distribution identification" to ensure your data fits one of the reliability models, Showing how and when data can and should be analyzed with parametric distribution (Right censoring vs arbitrary censoring). Finally going through the Accelerated life testing function and how to interpret results.

Overview of life testing in Minitab

Rob Schubert

Using Six Sigma to Optimize Performance and Reliability

Timothy Williams

FIELD TESTING of 2G 3G Devices.ppt

patrickwang85

Deccan UGC 2011 Chiefs Forum - Seattle Fire Presentation - P Di Turi

pdituri

Wcdma planning

Shantanu Mukherjee

Similar to The Tester’s Dashboard: Release Decision Support (20)

(BDT207) Real-Time Analytics In Service Of Self-Healing Ecosystems

Enabling Carrier-Grade Availability Within a Cloud Infrastructure

Thesis PresentationvFinal3

PhD_defense_presentation_Oct2013

Rf network design

Ithings2012 20nov

Multilin™ Intelligent Line Monitoring System

EXTENT-2017: Climbing Out of the Stability Sinkhole - Survivor’s Guide

Virtual Power Plant - Becky Harrison, Progress Energy

Overcoming challenges of_verifying complex mixed signal designs

Fuzzy Control meets Software Engineering

Nokia kpi and_core_optimization

(ISM301) Engineering Netflix Global Operations In The Cloud

Fuzzy Self-Learning Controllers for Elasticity Management in Dynamic Cloud Ar...

Towards a Unified View of Cloud Elasticity

Overview of life testing in Minitab

Using Six Sigma to Optimize Performance and Reliability

FIELD TESTING of 2G 3G Devices.ppt

Deccan UGC 2011 Chiefs Forum - Seattle Fire Presentation - P Di Turi

Wcdma planning

More from Bob Binder

REST APIs are a key enabling technology for the cloud. Mobile applications, service-oriented architecture, and the Internet of Things depend on reliable and usable REST APIs. Unlike browser, native, and mobile apps, REST APIs can only be tested with software that drives the APIs. Unlike developer-centric hand-coded unit testing, adequate testing of REST APIs is truly well-suited to advanced automated testing. As most web service applications are developed following an Agile process, effective testing must also avoid the "testing backblob," in which work to maintain hand-coded BDD-style test suites exceeds available time after a few iterations. This talk presents a methodology for developing and testing REST APIs using model-based automation that has the beneficial side-effect of shrinking the testing backblob.

How to Release Rock-solid RESTful APIs and Ice the Testing BackBlob

Bob Binder

Slides from presentation at the Chicago Quality Assurance Association, February 25, 2014. Acceptance Test Driven Development (ATDD) and Behavior Driven Development (BDD) are well-established Agile practices that rely on the knowledge and intuition of testers, product owners, and developers to identify and then translate statements into test suites. But the resulting test suites often cover only a small slice of happy-path behavior. And, as a BDD specification and its associated test code base grows over time, work to maintain it either crowds out new development and testing or, typically, is simply ignored. Either is high-risk. That’s how Agile teams get eaten by the testing BackBlob.Model Based Testing is a tool-based approach to automate the creation of test cases. This presentation will outline the techniques and benefits of MBT, and show how model-based testing can address both problems. A detailed demo of Spec Explorer, a free model-based testing tool shows how a model is constructed and used to create and maintain a test suite.

Model-based Testing: Taking BDD/ATDD to the Next Level

Bob Binder

Keynote, ETSI Model-Based Testing User Conference. Tallinn, Estonia September 27, 2012. High-level discussion of model-based testing and trends driving software/system reliability. Explains how emergent behavior in complex systems ("dragon kings") causes catastrophic failures. My Multi-dimensional testing strategy can reveal this hard to find bugs/failure modes, but this requires a better approach to model-based testing. Overview: Is software eating the world? Bugs, Black Swans, Dragon Kings. Multi-dimensional Testing. Challenges.

Model-based Testing: Today And Tomorrow

Bob Binder

Mobile App Assurance: Yesterday, Today, and Tomorrow.

Bob Binder

Popular Delusions, Crowds, and the Coming Deluge: end of the Oracle?

Bob Binder

Achieving Very High Reliability for Ubiquitous Information Technology

Bob Binder

Testing Object-Oriented Systems: Lessons Learned

Bob Binder

Model-Based Testing: Why, What, How

Bob Binder

MDD and the Tautology Problem: Discussion Notes.

Bob Binder

Testability: Factors and Strategy

Bob Binder

Test Objects -- They Just Work

Bob Binder

A Million Users in a Box: The WTS Story

Bob Binder

Software Test Patterns: Successes and Challenges

Bob Binder

Assurance for Cloud Computing

Bob Binder

The Advanced Mobile Application Testing Environment: Project Report

Bob Binder

Software Testing: Models, Patterns, Tools

Bob Binder

The Tester’s Dashboard: Release Decision Support

Bob Binder

Testability: Factors and Strategy

Bob Binder

More from Bob Binder (18)

How to Release Rock-solid RESTful APIs and Ice the Testing BackBlob

Model-based Testing: Taking BDD/ATDD to the Next Level

Model-based Testing: Today And Tomorrow

Mobile App Assurance: Yesterday, Today, and Tomorrow.

Popular Delusions, Crowds, and the Coming Deluge: end of the Oracle?

Achieving Very High Reliability for Ubiquitous Information Technology

Testing Object-Oriented Systems: Lessons Learned

Model-Based Testing: Why, What, How

MDD and the Tautology Problem: Discussion Notes.

Testability: Factors and Strategy

Test Objects -- They Just Work

A Million Users in a Box: The WTS Story

Software Test Patterns: Successes and Challenges

Assurance for Cloud Computing

The Advanced Mobile Application Testing Environment: Project Report

Software Testing: Models, Patterns, Tools

The Tester’s Dashboard: Release Decision Support

Testability: Factors and Strategy

Recently uploaded

НАДІЯ ФЕДЮШКО БАЦ «Професійне зростання QA спеціаліста»

QADay

I have heard many times that architecture is not important for the front-end. Also, many times I have seen how developers implement features on the front-end just following the standard rules for a framework and think that this is enough to successfully launch the project, and then the project fails. How to prevent this and what approach to choose? I have launched dozens of complex projects and during the talk we will analyze which approaches have worked for me and which have not.

"Impact of front-end architecture on development cost", Viktor Turskyi

Fwdays

Speed Wins: From Kafka to APIs in Minutes

confluent

In this insightful webinar, Inflectra explores how artificial intelligence (AI) is transforming software development and testing. Discover how AI-powered tools are revolutionizing every stage of the software development lifecycle (SDLC), from design and prototyping to testing, deployment, and monitoring. Learn about: • The Future of Testing: How AI is shifting testing towards verification, analysis, and higher-level skills, while reducing repetitive tasks. • Test Automation: How AI-powered test case generation, optimization, and self-healing tests are making testing more efficient and effective. • Visual Testing: Explore the emerging capabilities of AI in visual testing and how it's set to revolutionize UI verification. • Inflectra's AI Solutions: See demonstrations of Inflectra's cutting-edge AI tools like the ChatGPT plugin and Azure Open AI platform, designed to streamline your testing process. Whether you're a developer, tester, or QA professional, this webinar will give you valuable insights into how AI is shaping the future of software delivery.

Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality

Inflectra

IoT Analytics Company Presentation May 2024

IoTAnalytics

In this session, we will showcase how to revolutionize automated testing for your software, automation, and QA teams with UiPath Test Suite. In part 1 of UiPath test automation using UiPath Test Suite – developer series, we will cover, Software testing overview What is software testing Why software testing is required Typical test types and levels Continuous testing and challenges Introduction to UiPath Test Suite UiPath Test Suite family of products Speaker: Atul Trikha, Chief Technologist & Solutions Architect, Peraton and UiPath MVP Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP

UiPath Test Automation using UiPath Test Suite series, part 1

DianaGray10

Join us as we dive into the latest updates to the UiPath Orchestrator API, including new limits and features for 2024. Discover how these changes can enhance your automation projects and streamline your workflows. 📚 Overview of UiPath Orchestrator API 🔧 Recent changes to API limits 🛠️ How to adapt to new limits 📋 Best practices for using the Orchestrator API efficiently ❓ Q&A session

Exploring UiPath Orchestrator API: updates and limits in 2024 🚀

DianaGray10

FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf

FIDO Alliance

From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...

Product School

ODC, Data Fabric and Architecture User Group

CatarinaPereira64715

Discover the essentials of performance testing in the IT sector with our concise guide. Learn about various testing types such as load, stress, endurance, spike, scalability, and volume testing. Understand key performance metrics like response time, throughput, CPU and memory utilization, and error rate. Explore top tools like Apache JMeter, LoadRunner, Gatling, Neoload, and BlazeMeter. Gain insights into best practices for defining objectives, creating realistic scenarios, automating tests, and optimizing performance to ensure user satisfaction, reliability, scalability, and cost efficiency. Ideal for developers, QA engineers, and IT professionals. Visit Expeed Software for more information. https://expeed.com/

In-Depth Performance Testing Guide for IT Professionals

Expeed Software

Neuro-symbolic (NeSy) AI is on the rise. However, simply machine learning on just any symbolic structure is not sufficient to really harvest the gains of NeSy. These will only be gained when the symbolic structures have an actual semantics. I give an operational definition of semantics as “predictable inference”. All of this illustrated with link prediction over knowledge graphs, but the argument is general.

Neuro-symbolic is not enough, we need neuro-*semantic*

Frank van Harmelen

In today's fast-changing business world, Companies that adapt and embrace new ideas often need help to keep up with the competition. However, fostering a culture of innovation takes much work. It takes vision, leadership and willingness to take risks in the right proportion. Sachin Dev Duggal, co-founder of Builder.ai, has perfected the art of this balance, creating a company culture where creativity and growth are nurtured at each stage.

Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...

Ramesh Iyer

From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...

Product School

IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx

Abida Shariff

Knowledge engineering: from people to machines and back

Elena Simperl

💥 Speed, accuracy, and scaling – discover the superpowers of GenAI in action with UiPath Document Understanding and Communications Mining™: See how to accelerate model training and optimize model performance with active learning Learn about the latest enhancements to out-of-the-box document processing – with little to no training required Get an exclusive demo of the new family of UiPath LLMs – GenAI models specialized for processing different types of documents and messages This is a hands-on session specifically designed for automation developers and AI enthusiasts seeking to enhance their knowledge in leveraging the latest intelligent document processing capabilities offered by UiPath. Speakers: 👨‍🏫 Andras Palfi, Senior Product Manager, UiPath 👩‍🏫 Lenka Dulovicova, Product Program Manager, UiPath

Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...

UiPathCommunity

Designing Great Products: The Power of Design and Leadership by Chief Designe...

Product School

The Future of Platform Engineering

Jemma Hussein Allen

Welcome to UiPath Test Automation using UiPath Test Suite series part 2. In this session, we will cover API test automation along with a web automation demo. Topics covered: Test Automation introduction API Example of API automation Web automation demonstration Speaker Pathrudu Chintakayala, Associate Technical Architect @Yash and UiPath MVP Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP

UiPath Test Automation using UiPath Test Suite series, part 2

DianaGray10

Recently uploaded (20)

НАДІЯ ФЕДЮШКО БАЦ «Професійне зростання QA спеціаліста»

"Impact of front-end architecture on development cost", Viktor Turskyi

Speed Wins: From Kafka to APIs in Minutes

Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality

IoT Analytics Company Presentation May 2024

UiPath Test Automation using UiPath Test Suite series, part 1

Exploring UiPath Orchestrator API: updates and limits in 2024 🚀

FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf

From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...

ODC, Data Fabric and Architecture User Group

In-Depth Performance Testing Guide for IT Professionals

Neuro-symbolic is not enough, we need neuro-*semantic*

Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...

From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...

IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx

Knowledge engineering: from people to machines and back

Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...

Designing Great Products: The Power of Design and Leadership by Chief Designe...

The Future of Platform Engineering

UiPath Test Automation using UiPath Test Suite series, part 2

The Tester’s Dashboard: Release Decision Support

1. The Tester’s Dashboard: Release Decision Support Robert V. Binder System Verification Associates, LLC rbinder@ieee.org Peter B. Lakey Cognitive Concepts, Inc. peterlakey@sbcglobal.net

2. Overview • Complementary metrics for release decision- support – Model-based testing • Operational profile • Model coverage metrics – Reliability Demonstration Chart – Relative Proximity • Case Study • Observations

3. Release Decision Support

5. Model-based Reliability Estimation • Test suites must be – Proportional to operational profile – Sequentially feasible – Input feasible • Approach – Markov model – Monte Carlo simulation – Post run analytics

6. Model Coverage Metrics • % States Usage Profile Reached S T • % State- Transitions Observe System Reached Failure Trigger Latent Defect Software System Process Space Observe fault System activated Failure Data Space

7. Reliability Demonstration Chart • Sequential Sampling • Risk- Adjusted • Musa equations http://sourceforge.net/projects/rdc/

8. Relative Proximity • Kullback-Lieber Distance – Information theoretic – Characterizes difference in variation of message population E (expected) and sample A (actual) as “relative entropy” KLD = ∑ 𝐴 𝑖 (𝑙𝑜𝑔2 (𝐴 𝑖 / E 𝑖 )) • Relative Proximity – KLD math doesn’t work unless failures modeled (sum of the actuals must be 1.0) – Assume the target failure rate is aggregate – Allocate failure rate in proportion to each operation

9. Profile Explicit Failure Modes • Assume maximum acceptable failure rate intensity of 1 in 10,000 Operation Mode Standard Explicit Failure Expected Number, Profile Profile 10000 Tests A Pass 0.7 0.6993 6993 B Pass 0.2 0.1998 1998 C Pass 0.1 0.0999 999

10. Profile Explicit Failure Modes Mode Expected Actual KL Distance Actual KL Distance A Pass 6993 7000 10.104 6990 -4.327 Fail 7 0 0.000 10 5.146 B Pass 1998 1990 -11.518 2000 2.887 Fail 2 10 23.219 0 0.000 C Pass 999 980 -27.149 994 -7.195 Fail 1 20 86.439 6 15.510 10000 10000 81.094 10000 12.020 • Relative Proximity indicates the difference between actual and observed failure rates • Many possible operation failure rates with better or worse fidelity • RDC based on aggregate FIO, not sensitive to operation variance

11. Case Studies • Stochastic Models • Assumed Failure Rates • Word Processing Application • Ground-Based Midcourse Missile Defense

12. GBMD Test Run, 0-100

13. GBMD Test Run, 1K, 5K

14. GMBD Test Run, 10K

15. GBMD Relative Proximity Trend 1600.00 1484.00 1400.00 1200.00 1000.00 800.00 600.00 418.20 400.00 200.00 67.90 12.00 6.30 0.00 10 100 1000 10000

16. Observations • Model coverage indicates minimal sufficiency – Wouldn’t release without all state-xtn pairs covered – Stochastic can take a long time to do this – Cover with N+ first • RDC assumes “flat” profile – With sequential constraints, may be optimistic – Strength is explicit risk-adjustment • Relative Proximity will indicate when operation- specific Failure Intensity is as expected (or not)

17. Q&A

The Tester’s Dashboard: Release Decision Support

Recommended

Recommended

More Related Content

Similar to The Tester’s Dashboard: Release Decision Support

Similar to The Tester’s Dashboard: Release Decision Support (20)

More from Bob Binder

More from Bob Binder (18)

Recently uploaded

Recently uploaded (20)

The Tester’s Dashboard: Release Decision Support