SlideShare a Scribd company logo
1 of 9
Download to read offline
#techtuesdays
Chaos Engineering to
Establish Software Reliability
#techtuesdays
Fallacies of Distributed Systems
The network is reliable
Latency is zero
Bandwidth is infinite
The network is secure
Topology doesn't change
There is one administrator
Transport cost is zero
The network is homogeneous
#techtuesdays
What is Chaos Engineering?
Chaos Engineering is the practice of
intentionally introducing controlled and
measurable failures into software systems
to build resilience and confidence in their
ability to withstand unexpected conditions.
#techtuesdays
Principles of Chaos Engineering
01 Build a hypothesis around a steady-state
03 Run experiments in production
05 Minimize blast radius
02 Simulate real-world events
04 Automate experiments and run them continuously
#techtuesdays
Types of Chaos Engineering
Experiment
Game Days Latency Injection
Infrastructure Failure
Volume Testing
Latency injection is the deliberate
introduction of a delay in system
response times to understand the
impact of degraded performance.
A game day is a simulation of a
disaster scenario, designed to test
the system's resiliency.
Volume testing is the process of
increasing the volume of traffic to
the system to assess its response
to high levels of traffic.
Simulating infrastructure failures,
such as server crashes or network
outages, can help to identify
weaknesses in the system's failover
processes.
#techtuesdays
Common Implementation Areas
for Chaos Engineering
Blind spots of
monitoring software
Hidden bugs
Performance
bottlenecks
#techtuesdays
Benefits of Using Chaos
Engineering for Software Testing
Improved System Resilience
Increase Customer Confidence
Improved Team Culture
Chaos Engineering can identify and resolve vulnerabilities in a system, making it
more resilient to unexpected conditions.
By improving system reliability and reducing downtime, Chaos Engineering can
increase customer confidence in your product or service.
Chaos Engineering requires engineers to work together and learn from their
mistakes, which can lead to an improved team culture and a stronger sense of
collaboration.
#techtuesdays
Common Tools of Chaos
Engineering
Chaos Kong
Chaos Monkey
Latency
Gremlin
Disables entire AWS availability zones.
Randomly disables production environment instances to cause a system failure.
Introduces latency to simulate network outages and degradation.
A chaos engineering program that works with AWS and Kubernetes
Thank you for
your time.
#techtuesdays www.Gleecus.com
hello@gleecus.com
Contact us:

More Related Content

Similar to Chaos Engineering to Establish Software Reliability

IOT Design: An Embedded System & its Applications
IOT Design: An Embedded System & its ApplicationsIOT Design: An Embedded System & its Applications
IOT Design: An Embedded System & its Applications
SruthiReddy112
 
IOT based embedded systems using arduino
IOT based embedded systems using arduinoIOT based embedded systems using arduino
IOT based embedded systems using arduino
Nagen87
 

Similar to Chaos Engineering to Establish Software Reliability (20)

Chaos engineering & Gameday on AWS
Chaos engineering & Gameday on AWSChaos engineering & Gameday on AWS
Chaos engineering & Gameday on AWS
 
Operations: Production Readiness Review – How to stop bad things from Happening
Operations: Production Readiness Review – How to stop bad things from HappeningOperations: Production Readiness Review – How to stop bad things from Happening
Operations: Production Readiness Review – How to stop bad things from Happening
 
Chaos Engineering: Injecting Failure for Building Resilience in Systems
Chaos Engineering: Injecting Failure for Building Resilience in SystemsChaos Engineering: Injecting Failure for Building Resilience in Systems
Chaos Engineering: Injecting Failure for Building Resilience in Systems
 
Resume SrEE
Resume SrEEResume SrEE
Resume SrEE
 
MTV15
MTV15MTV15
MTV15
 
Chaos Engineering with Gremlin Platform
Chaos Engineering with Gremlin PlatformChaos Engineering with Gremlin Platform
Chaos Engineering with Gremlin Platform
 
Operations: Production Readiness
Operations: Production ReadinessOperations: Production Readiness
Operations: Production Readiness
 
DOST 2016 Cloud Without Failures
DOST 2016 Cloud Without FailuresDOST 2016 Cloud Without Failures
DOST 2016 Cloud Without Failures
 
Unsustainable Regaining Control of Uncontrollable Apps
Unsustainable Regaining Control of Uncontrollable AppsUnsustainable Regaining Control of Uncontrollable Apps
Unsustainable Regaining Control of Uncontrollable Apps
 
Path to continuous delivery
Path to continuous deliveryPath to continuous delivery
Path to continuous delivery
 
From Monoliths to Microservices at Realestate.com.au
From Monoliths to Microservices at Realestate.com.auFrom Monoliths to Microservices at Realestate.com.au
From Monoliths to Microservices at Realestate.com.au
 
Automation test bed at offshore to optimize cost, effort and timing for a wor...
Automation test bed at offshore to optimize cost, effort and timing for a wor...Automation test bed at offshore to optimize cost, effort and timing for a wor...
Automation test bed at offshore to optimize cost, effort and timing for a wor...
 
Cloud Operations and Analytics: Improving Distributed Systems Reliability usi...
Cloud Operations and Analytics: Improving Distributed Systems Reliability usi...Cloud Operations and Analytics: Improving Distributed Systems Reliability usi...
Cloud Operations and Analytics: Improving Distributed Systems Reliability usi...
 
Scaling Enterprise DevOps with CloudBees
Scaling Enterprise DevOps with CloudBeesScaling Enterprise DevOps with CloudBees
Scaling Enterprise DevOps with CloudBees
 
Scaling Enterprise DevOps with CloudBees
Scaling Enterprise DevOps with CloudBeesScaling Enterprise DevOps with CloudBees
Scaling Enterprise DevOps with CloudBees
 
Scaling Enterprise DevOps with CloudBees
Scaling Enterprise DevOps with CloudBeesScaling Enterprise DevOps with CloudBees
Scaling Enterprise DevOps with CloudBees
 
CIO Technical Series - Solving Scan Gun Performance Issues
CIO Technical Series - Solving Scan Gun Performance IssuesCIO Technical Series - Solving Scan Gun Performance Issues
CIO Technical Series - Solving Scan Gun Performance Issues
 
IOT Design: An Embedded System & its Applications
IOT Design: An Embedded System & its ApplicationsIOT Design: An Embedded System & its Applications
IOT Design: An Embedded System & its Applications
 
IOT_Embedded_Systems_industrail applications.ppt
IOT_Embedded_Systems_industrail applications.pptIOT_Embedded_Systems_industrail applications.ppt
IOT_Embedded_Systems_industrail applications.ppt
 
IOT based embedded systems using arduino
IOT based embedded systems using arduinoIOT based embedded systems using arduino
IOT based embedded systems using arduino
 

More from GleecusTechlabs1

More from GleecusTechlabs1 (19)

Event Driven Microservices Architecture_ Supporting Growing Performance Need
Event Driven Microservices Architecture_ Supporting Growing Performance NeedEvent Driven Microservices Architecture_ Supporting Growing Performance Need
Event Driven Microservices Architecture_ Supporting Growing Performance Need
 
Enterprise Integration Patterns - Overview
Enterprise Integration Patterns - OverviewEnterprise Integration Patterns - Overview
Enterprise Integration Patterns - Overview
 
Domain Driven Design for Microservice Architecture
Domain Driven Design for Microservice ArchitectureDomain Driven Design for Microservice Architecture
Domain Driven Design for Microservice Architecture
 
Distributed Cloud Computing in a Nutshell
Distributed Cloud Computing in a NutshellDistributed Cloud Computing in a Nutshell
Distributed Cloud Computing in a Nutshell
 
An Overview on Data Enrichment and techniques
An Overview on Data Enrichment and techniquesAn Overview on Data Enrichment and techniques
An Overview on Data Enrichment and techniques
 
An Overview on Container Orchestration Services
An Overview on Container Orchestration ServicesAn Overview on Container Orchestration Services
An Overview on Container Orchestration Services
 
The Role of Cloud Computing in Healthcare
The Role of Cloud Computing in HealthcareThe Role of Cloud Computing in Healthcare
The Role of Cloud Computing in Healthcare
 
Cloud Based POS Streamlining Shopping Experience
Cloud Based POS Streamlining Shopping ExperienceCloud Based POS Streamlining Shopping Experience
Cloud Based POS Streamlining Shopping Experience
 
Bringing a Product Mindset into DevOps - Overview
Bringing a Product Mindset into DevOps - OverviewBringing a Product Mindset into DevOps - Overview
Bringing a Product Mindset into DevOps - Overview
 
What is Observability and how is it different from Monitoring?
What is Observability and how is it different from Monitoring?What is Observability and how is it different from Monitoring?
What is Observability and how is it different from Monitoring?
 
The Path to Application Modernization.pdf
The Path to Application Modernization.pdfThe Path to Application Modernization.pdf
The Path to Application Modernization.pdf
 
The Cloud Agnostic Architecture - Overview
The Cloud Agnostic Architecture - OverviewThe Cloud Agnostic Architecture - Overview
The Cloud Agnostic Architecture - Overview
 
7 Evolving Trends in Digital Transformation for Mid-Sized Enterprises.pdf
7  Evolving Trends in Digital Transformation for Mid-Sized Enterprises.pdf7  Evolving Trends in Digital Transformation for Mid-Sized Enterprises.pdf
7 Evolving Trends in Digital Transformation for Mid-Sized Enterprises.pdf
 
How to maximize your business ROI with custom software solution
How to maximize your business ROI with custom software solutionHow to maximize your business ROI with custom software solution
How to maximize your business ROI with custom software solution
 
Unlocking the Cloud's True Potential: Why Multitenancy Is The Key?
Unlocking the Cloud's True Potential: Why Multitenancy Is The Key?Unlocking the Cloud's True Potential: Why Multitenancy Is The Key?
Unlocking the Cloud's True Potential: Why Multitenancy Is The Key?
 
The BDD Process (Behavior-driven development)
The BDD Process (Behavior-driven development)The BDD Process (Behavior-driven development)
The BDD Process (Behavior-driven development)
 
SaaS Product Development Checklist.pdf
SaaS Product Development Checklist.pdfSaaS Product Development Checklist.pdf
SaaS Product Development Checklist.pdf
 
Cloud Computing in Healthcare
Cloud Computing in HealthcareCloud Computing in Healthcare
Cloud Computing in Healthcare
 
Qualities of an On-Demand App.pdf
Qualities of an On-Demand App.pdfQualities of an On-Demand App.pdf
Qualities of an On-Demand App.pdf
 

Recently uploaded

TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc
 
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptxHarnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
FIDO Alliance
 

Recently uploaded (20)

Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on ThanabotsContinuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
 
ADP Passwordless Journey Case Study.pptx
ADP Passwordless Journey Case Study.pptxADP Passwordless Journey Case Study.pptx
ADP Passwordless Journey Case Study.pptx
 
Introduction to FIDO Authentication and Passkeys.pptx
Introduction to FIDO Authentication and Passkeys.pptxIntroduction to FIDO Authentication and Passkeys.pptx
Introduction to FIDO Authentication and Passkeys.pptx
 
Portal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russePortal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russe
 
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
 
WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024
 
WebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM PerformanceWebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM Performance
 
How to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cfHow to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cf
 
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
 
الأمن السيبراني - ما لا يسع للمستخدم جهله
الأمن السيبراني - ما لا يسع للمستخدم جهلهالأمن السيبراني - ما لا يسع للمستخدم جهله
الأمن السيبراني - ما لا يسع للمستخدم جهله
 
Frisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdf
Frisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdfFrisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdf
Frisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdf
 
Oauth 2.0 Introduction and Flows with MuleSoft
Oauth 2.0 Introduction and Flows with MuleSoftOauth 2.0 Introduction and Flows with MuleSoft
Oauth 2.0 Introduction and Flows with MuleSoft
 
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
 
Introduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMIntroduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDM
 
JavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate GuideJavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate Guide
 
AI mind or machine power point presentation
AI mind or machine power point presentationAI mind or machine power point presentation
AI mind or machine power point presentation
 
AI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by Anitaraj
 
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptxHarnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
 
Google I/O Extended 2024 Warsaw
Google I/O Extended 2024 WarsawGoogle I/O Extended 2024 Warsaw
Google I/O Extended 2024 Warsaw
 
Vector Search @ sw2con for slideshare.pptx
Vector Search @ sw2con for slideshare.pptxVector Search @ sw2con for slideshare.pptx
Vector Search @ sw2con for slideshare.pptx
 

Chaos Engineering to Establish Software Reliability

  • 2. #techtuesdays Fallacies of Distributed Systems The network is reliable Latency is zero Bandwidth is infinite The network is secure Topology doesn't change There is one administrator Transport cost is zero The network is homogeneous
  • 3. #techtuesdays What is Chaos Engineering? Chaos Engineering is the practice of intentionally introducing controlled and measurable failures into software systems to build resilience and confidence in their ability to withstand unexpected conditions.
  • 4. #techtuesdays Principles of Chaos Engineering 01 Build a hypothesis around a steady-state 03 Run experiments in production 05 Minimize blast radius 02 Simulate real-world events 04 Automate experiments and run them continuously
  • 5. #techtuesdays Types of Chaos Engineering Experiment Game Days Latency Injection Infrastructure Failure Volume Testing Latency injection is the deliberate introduction of a delay in system response times to understand the impact of degraded performance. A game day is a simulation of a disaster scenario, designed to test the system's resiliency. Volume testing is the process of increasing the volume of traffic to the system to assess its response to high levels of traffic. Simulating infrastructure failures, such as server crashes or network outages, can help to identify weaknesses in the system's failover processes.
  • 6. #techtuesdays Common Implementation Areas for Chaos Engineering Blind spots of monitoring software Hidden bugs Performance bottlenecks
  • 7. #techtuesdays Benefits of Using Chaos Engineering for Software Testing Improved System Resilience Increase Customer Confidence Improved Team Culture Chaos Engineering can identify and resolve vulnerabilities in a system, making it more resilient to unexpected conditions. By improving system reliability and reducing downtime, Chaos Engineering can increase customer confidence in your product or service. Chaos Engineering requires engineers to work together and learn from their mistakes, which can lead to an improved team culture and a stronger sense of collaboration.
  • 8. #techtuesdays Common Tools of Chaos Engineering Chaos Kong Chaos Monkey Latency Gremlin Disables entire AWS availability zones. Randomly disables production environment instances to cause a system failure. Introduces latency to simulate network outages and degradation. A chaos engineering program that works with AWS and Kubernetes
  • 9. Thank you for your time. #techtuesdays www.Gleecus.com hello@gleecus.com Contact us: