SlideShare a Scribd company logo
Ensuring Business
Continuity in AWS
Krzysztof Kąkol
Chief of Data Engineering and
Solutions Architect in Xebia Poland
AWS Community Builder & AWS Ambassador
https://www.linkedin.com/in/krzysztofkakol/
Other stuff:
Classical and jazz pianist
PhD in AI-driven sound processing
Business continuity can be defined as the processes, procedures, decisions and activities to ensure that an
organization can continue to function through an operational interruption.
https://www.continuitycentral.com/index.php/businesscontinuity
Business Continuity
Redundancy Resillience Rapid recovery
A business impact analysis (BIA) anticipates the repercussions of a business disruption and compiles
essential information required for formulating recovery strategies. The identification of potential loss
scenarios should occur as part of a comprehensive risk assessment.
Business impact analysis
Identify critical business
functions
Define potential
disruptions
Analyse business impact
Define recovery
strategies
Test and validate
Documentation
Business Continuity
BUSINESS CONTINUITY
AVAILABILITY DISASTER RECOVERY
What to do to avoid failures? What to do if failure occurs?
Availability
Availability in the cloud refers to the accessibility and reliability
of services and resources hosted on a cloud platform. It
measures the extent to which applications and data are reliably
accessible and operational for users, without significant
downtime or disruptions.
High availability is a key goal in cloud computing to ensure
continuous and reliable access to services.
Availability
1. Mean Time Between Failures (MTBF)
2. Mean Time to Recover (MTTR)
Availability - metrics
Difficult to
measure
• Workload in multiple AZs
• Self-healing infrastructure – CI/CD pipelines supporting service recovery, multi-AZ setups, auto-scaling
• All components highly-available: ALB, multi-AZ RDS, EKS nodes in ASG, replicated NATs
• Using managed services
Availability – methods in AWS
Availability – multi-AZ
High Availability in AWS
Disaster Recovery
Disaster recovery is a comprehensive strategy outlining procedures and protocols to recover and restore critical
business operations in the event of a disruptive incident or disaster.
Some important reasons:
• Minimize downtime: Rapid recovery protocols help minimize downtime, ensuring that essential business functions
can resume swiftly.
• Data protection: Safeguard critical data and information through backup and recovery measures, preventing loss
and ensuring data integrity.
• Business resilience: Enhances the organization's ability to withstand and bounce back from unforeseen disruptions,
maintaining operational resilience.
• Compliance requirements: Meets regulatory and compliance standards by implementing measures to protect
sensitive data and maintain business continuity.
• Customer confidence: Demonstrates a commitment to reliability, instilling confidence in customers, stakeholders,
and partners.
Disaster Recovery
1. Recovery Point Objective (RPO)
2. Recovery Time Objective (RTO)
Disaster Recovery - metrics
Disaster recovery strategies – RTO
Disaster recovery strategies – RPO
Pilot light example
Warm standby example
Developing a comprehensive disaster recovery plan:
• Identifying critical assets and processes
• Establishing RPO and RTO objectives
• Assigning responsibilities
Testing and validating disaster recovery plans:
• Regular simulation exercises
• Automation of testing processes
Planning and Testing Disaster Recovery
Backups & Resilience
Backup for database – automatic snapshots, PITR,
manual snapshots
Object replication in S3 (multi-region)
Backup retention planning
Using DDoS protection (CDN)
Resources firewall setup – using semantic Security Groups
Application firewalls – WAF
Workload in private subnets
Using AWS services like Security Hub, Inspector, Detective etc.
Backup strategy Resilience to attacks
Resilience
• Autoscaling solutions (auto-scaling groups, ECS, EKS)
• Redundancy solutions (RDS Multi-AZ, database read replicas, DynamoDB global tables etc.)
• AWS Backup:
• Centralized backup solution
• Automated backup policies
• AWS CloudFormation/Terraform:
• Infrastructure as Code (IaC) for consistent deployments
• AWS CloudWatch and AWS CloudTrail:
• Monitoring and logging for proactive management
Tools for Business Continuity in AWS
• Learning best practices from Well-Architected Review (especially Reliability Pillar)
• Doing business impact analysis and risk analysis
• Implementing scalable and redundant infrastructure (in multiple AZs)
• Planning and testing Disaster Recovery
Best practices summary
Questions?
https://www.linkedin.com/in/krzysztofkakol/

More Related Content

Similar to Ensuring Business Continuity in the cloud

Smaller Presentation on Enterprise Resiliency and Corporate Certification
Smaller Presentation on Enterprise Resiliency and Corporate CertificationSmaller Presentation on Enterprise Resiliency and Corporate Certification
Smaller Presentation on Enterprise Resiliency and Corporate Certification
Thomas Bronack
 
Disaster Biz Resumpt
Disaster Biz ResumptDisaster Biz Resumpt
Disaster Biz Resumpt
JimGroark
 
Rob kloots auditingforscyandbcm
Rob kloots auditingforscyandbcmRob kloots auditingforscyandbcm
Rob kloots auditingforscyandbcm
Robert Kloots
 
Recovery and Compliance Services provided by Tom Bronack
Recovery and Compliance Services provided by Tom BronackRecovery and Compliance Services provided by Tom Bronack
Recovery and Compliance Services provided by Tom Bronack
Thomas Bronack
 
The Science of Predictive Maintenance: IBM's Predictive Analytics Solution
The Science of Predictive Maintenance: IBM's Predictive Analytics SolutionThe Science of Predictive Maintenance: IBM's Predictive Analytics Solution
The Science of Predictive Maintenance: IBM's Predictive Analytics Solution
Senturus
 
Cyber Security and Business Continuity an Integrated Discipline
Cyber Security and Business Continuity an Integrated DisciplineCyber Security and Business Continuity an Integrated Discipline
Cyber Security and Business Continuity an Integrated Discipline
Graeme Parker
 
Optimizing the IT and Business Environment
Optimizing the IT and Business EnvironmentOptimizing the IT and Business Environment
Optimizing the IT and Business Environment
Thomas Bronack
 
Dcag service optimization offering01
Dcag service optimization offering01Dcag service optimization offering01
Dcag service optimization offering01
Thomas Bronack
 
CMGT 400 Effective Communication/tutorialrank.com
 CMGT 400 Effective Communication/tutorialrank.com CMGT 400 Effective Communication/tutorialrank.com
CMGT 400 Effective Communication/tutorialrank.com
jonhson268
 
Business continuity planning guide
Business continuity planning guideBusiness continuity planning guide
Business continuity planning guide
AstalapulosListestos
 
Business continuity planning guide
Business continuity planning guideBusiness continuity planning guide
Business continuity planning guide
CenapSerdarolu
 
The Cost of Downtime
The Cost of DowntimeThe Cost of Downtime
The Cost of Downtime
Miriam O'Brien
 
The Cost of Downtime
The Cost of DowntimeThe Cost of Downtime
The Cost of Downtime
Miriam O'Brien
 
Security & Risk Management
Security & Risk ManagementSecurity & Risk Management
Security & Risk Management
Ahmed Sayed-
 
Business Continuity Awareness Week 2009
Business Continuity Awareness Week 2009Business Continuity Awareness Week 2009
Business Continuity Awareness Week 2009
Brigitte Theuma
 
Qatar Proposal
Qatar ProposalQatar Proposal
Qatar Proposal
Absar Husain
 
Availability and Business Resiliency Strategies
Availability and Business Resiliency StrategiesAvailability and Business Resiliency Strategies
Availability and Business Resiliency Strategies
n|u - The Open Security Community
 
IT infra, emerging technology and governance
IT infra, emerging technology and governance IT infra, emerging technology and governance
IT infra, emerging technology and governance
Azra'ee Mamat
 
Bronack Skills - Risk Management and SRE v1.0 12-10-2023.pdf
Bronack Skills - Risk Management and SRE v1.0 12-10-2023.pdfBronack Skills - Risk Management and SRE v1.0 12-10-2023.pdf
Bronack Skills - Risk Management and SRE v1.0 12-10-2023.pdf
ThomasBronack
 
Does Anyone Remember Enterprise Security Architecture?
Does Anyone Remember Enterprise Security Architecture?Does Anyone Remember Enterprise Security Architecture?
Does Anyone Remember Enterprise Security Architecture?
rbrockway
 

Similar to Ensuring Business Continuity in the cloud (20)

Smaller Presentation on Enterprise Resiliency and Corporate Certification
Smaller Presentation on Enterprise Resiliency and Corporate CertificationSmaller Presentation on Enterprise Resiliency and Corporate Certification
Smaller Presentation on Enterprise Resiliency and Corporate Certification
 
Disaster Biz Resumpt
Disaster Biz ResumptDisaster Biz Resumpt
Disaster Biz Resumpt
 
Rob kloots auditingforscyandbcm
Rob kloots auditingforscyandbcmRob kloots auditingforscyandbcm
Rob kloots auditingforscyandbcm
 
Recovery and Compliance Services provided by Tom Bronack
Recovery and Compliance Services provided by Tom BronackRecovery and Compliance Services provided by Tom Bronack
Recovery and Compliance Services provided by Tom Bronack
 
The Science of Predictive Maintenance: IBM's Predictive Analytics Solution
The Science of Predictive Maintenance: IBM's Predictive Analytics SolutionThe Science of Predictive Maintenance: IBM's Predictive Analytics Solution
The Science of Predictive Maintenance: IBM's Predictive Analytics Solution
 
Cyber Security and Business Continuity an Integrated Discipline
Cyber Security and Business Continuity an Integrated DisciplineCyber Security and Business Continuity an Integrated Discipline
Cyber Security and Business Continuity an Integrated Discipline
 
Optimizing the IT and Business Environment
Optimizing the IT and Business EnvironmentOptimizing the IT and Business Environment
Optimizing the IT and Business Environment
 
Dcag service optimization offering01
Dcag service optimization offering01Dcag service optimization offering01
Dcag service optimization offering01
 
CMGT 400 Effective Communication/tutorialrank.com
 CMGT 400 Effective Communication/tutorialrank.com CMGT 400 Effective Communication/tutorialrank.com
CMGT 400 Effective Communication/tutorialrank.com
 
Business continuity planning guide
Business continuity planning guideBusiness continuity planning guide
Business continuity planning guide
 
Business continuity planning guide
Business continuity planning guideBusiness continuity planning guide
Business continuity planning guide
 
The Cost of Downtime
The Cost of DowntimeThe Cost of Downtime
The Cost of Downtime
 
The Cost of Downtime
The Cost of DowntimeThe Cost of Downtime
The Cost of Downtime
 
Security & Risk Management
Security & Risk ManagementSecurity & Risk Management
Security & Risk Management
 
Business Continuity Awareness Week 2009
Business Continuity Awareness Week 2009Business Continuity Awareness Week 2009
Business Continuity Awareness Week 2009
 
Qatar Proposal
Qatar ProposalQatar Proposal
Qatar Proposal
 
Availability and Business Resiliency Strategies
Availability and Business Resiliency StrategiesAvailability and Business Resiliency Strategies
Availability and Business Resiliency Strategies
 
IT infra, emerging technology and governance
IT infra, emerging technology and governance IT infra, emerging technology and governance
IT infra, emerging technology and governance
 
Bronack Skills - Risk Management and SRE v1.0 12-10-2023.pdf
Bronack Skills - Risk Management and SRE v1.0 12-10-2023.pdfBronack Skills - Risk Management and SRE v1.0 12-10-2023.pdf
Bronack Skills - Risk Management and SRE v1.0 12-10-2023.pdf
 
Does Anyone Remember Enterprise Security Architecture?
Does Anyone Remember Enterprise Security Architecture?Does Anyone Remember Enterprise Security Architecture?
Does Anyone Remember Enterprise Security Architecture?
 

Recently uploaded

Beginner's Guide to Observability@Devoxx PL 2024
Beginner's  Guide to Observability@Devoxx PL 2024Beginner's  Guide to Observability@Devoxx PL 2024
Beginner's Guide to Observability@Devoxx PL 2024
michniczscribd
 
The Comprehensive Guide to Validating Audio-Visual Performances.pdf
The Comprehensive Guide to Validating Audio-Visual Performances.pdfThe Comprehensive Guide to Validating Audio-Visual Performances.pdf
The Comprehensive Guide to Validating Audio-Visual Performances.pdf
kalichargn70th171
 
Hyperledger Besu 빨리 따라하기 (Private Networks)
Hyperledger Besu 빨리 따라하기 (Private Networks)Hyperledger Besu 빨리 따라하기 (Private Networks)
Hyperledger Besu 빨리 따라하기 (Private Networks)
wonyong hwang
 
How GenAI Can Improve Supplier Performance Management.pdf
How GenAI Can Improve Supplier Performance Management.pdfHow GenAI Can Improve Supplier Performance Management.pdf
How GenAI Can Improve Supplier Performance Management.pdf
Zycus
 
14 th Edition of International conference on computer vision
14 th Edition of International conference on computer vision14 th Edition of International conference on computer vision
14 th Edition of International conference on computer vision
ShulagnaSarkar2
 
Software Test Automation - A Comprehensive Guide on Automated Testing.pdf
Software Test Automation - A Comprehensive Guide on Automated Testing.pdfSoftware Test Automation - A Comprehensive Guide on Automated Testing.pdf
Software Test Automation - A Comprehensive Guide on Automated Testing.pdf
kalichargn70th171
 
Strengthening Web Development with CommandBox 6: Seamless Transition and Scal...
Strengthening Web Development with CommandBox 6: Seamless Transition and Scal...Strengthening Web Development with CommandBox 6: Seamless Transition and Scal...
Strengthening Web Development with CommandBox 6: Seamless Transition and Scal...
Ortus Solutions, Corp
 
一比一原版(sdsu毕业证书)圣地亚哥州立大学毕业证如何办理
一比一原版(sdsu毕业证书)圣地亚哥州立大学毕业证如何办理一比一原版(sdsu毕业证书)圣地亚哥州立大学毕业证如何办理
一比一原版(sdsu毕业证书)圣地亚哥州立大学毕业证如何办理
kgyxske
 
Baha Majid WCA4Z IBM Z Customer Council Boston June 2024.pdf
Baha Majid WCA4Z IBM Z Customer Council Boston June 2024.pdfBaha Majid WCA4Z IBM Z Customer Council Boston June 2024.pdf
Baha Majid WCA4Z IBM Z Customer Council Boston June 2024.pdf
Baha Majid
 
Unlock the Secrets to Effortless Video Creation with Invideo: Your Ultimate G...
Unlock the Secrets to Effortless Video Creation with Invideo: Your Ultimate G...Unlock the Secrets to Effortless Video Creation with Invideo: Your Ultimate G...
Unlock the Secrets to Effortless Video Creation with Invideo: Your Ultimate G...
The Third Creative Media
 
🏎️Tech Transformation: DevOps Insights from the Experts 👩‍💻
🏎️Tech Transformation: DevOps Insights from the Experts 👩‍💻🏎️Tech Transformation: DevOps Insights from the Experts 👩‍💻
🏎️Tech Transformation: DevOps Insights from the Experts 👩‍💻
campbellclarkson
 
TheFutureIsDynamic-BoxLang-CFCamp2024.pdf
TheFutureIsDynamic-BoxLang-CFCamp2024.pdfTheFutureIsDynamic-BoxLang-CFCamp2024.pdf
TheFutureIsDynamic-BoxLang-CFCamp2024.pdf
Ortus Solutions, Corp
 
Operational ease MuleSoft and Salesforce Service Cloud Solution v1.0.pptx
Operational ease MuleSoft and Salesforce Service Cloud Solution v1.0.pptxOperational ease MuleSoft and Salesforce Service Cloud Solution v1.0.pptx
Operational ease MuleSoft and Salesforce Service Cloud Solution v1.0.pptx
sandeepmenon62
 
如何办理(hull学位证书)英国赫尔大学毕业证硕士文凭原版一模一样
如何办理(hull学位证书)英国赫尔大学毕业证硕士文凭原版一模一样如何办理(hull学位证书)英国赫尔大学毕业证硕士文凭原版一模一样
如何办理(hull学位证书)英国赫尔大学毕业证硕士文凭原版一模一样
gapen1
 
Orca: Nocode Graphical Editor for Container Orchestration
Orca: Nocode Graphical Editor for Container OrchestrationOrca: Nocode Graphical Editor for Container Orchestration
Orca: Nocode Graphical Editor for Container Orchestration
Pedro J. Molina
 
Ensuring Efficiency and Speed with Practical Solutions for Clinical Operations
Ensuring Efficiency and Speed with Practical Solutions for Clinical OperationsEnsuring Efficiency and Speed with Practical Solutions for Clinical Operations
Ensuring Efficiency and Speed with Practical Solutions for Clinical Operations
OnePlan Solutions
 
Boost Your Savings with These Money Management Apps
Boost Your Savings with These Money Management AppsBoost Your Savings with These Money Management Apps
Boost Your Savings with These Money Management Apps
Jhone kinadey
 
Going AOT: Everything you need to know about GraalVM for Java applications
Going AOT: Everything you need to know about GraalVM for Java applicationsGoing AOT: Everything you need to know about GraalVM for Java applications
Going AOT: Everything you need to know about GraalVM for Java applications
Alina Yurenko
 
Photoshop Tutorial for Beginners (2024 Edition)
Photoshop Tutorial for Beginners (2024 Edition)Photoshop Tutorial for Beginners (2024 Edition)
Photoshop Tutorial for Beginners (2024 Edition)
alowpalsadig
 
美洲杯赔率投注网【​网址​🎉3977·EE​🎉】
美洲杯赔率投注网【​网址​🎉3977·EE​🎉】美洲杯赔率投注网【​网址​🎉3977·EE​🎉】
美洲杯赔率投注网【​网址​🎉3977·EE​🎉】
widenerjobeyrl638
 

Recently uploaded (20)

Beginner's Guide to Observability@Devoxx PL 2024
Beginner's  Guide to Observability@Devoxx PL 2024Beginner's  Guide to Observability@Devoxx PL 2024
Beginner's Guide to Observability@Devoxx PL 2024
 
The Comprehensive Guide to Validating Audio-Visual Performances.pdf
The Comprehensive Guide to Validating Audio-Visual Performances.pdfThe Comprehensive Guide to Validating Audio-Visual Performances.pdf
The Comprehensive Guide to Validating Audio-Visual Performances.pdf
 
Hyperledger Besu 빨리 따라하기 (Private Networks)
Hyperledger Besu 빨리 따라하기 (Private Networks)Hyperledger Besu 빨리 따라하기 (Private Networks)
Hyperledger Besu 빨리 따라하기 (Private Networks)
 
How GenAI Can Improve Supplier Performance Management.pdf
How GenAI Can Improve Supplier Performance Management.pdfHow GenAI Can Improve Supplier Performance Management.pdf
How GenAI Can Improve Supplier Performance Management.pdf
 
14 th Edition of International conference on computer vision
14 th Edition of International conference on computer vision14 th Edition of International conference on computer vision
14 th Edition of International conference on computer vision
 
Software Test Automation - A Comprehensive Guide on Automated Testing.pdf
Software Test Automation - A Comprehensive Guide on Automated Testing.pdfSoftware Test Automation - A Comprehensive Guide on Automated Testing.pdf
Software Test Automation - A Comprehensive Guide on Automated Testing.pdf
 
Strengthening Web Development with CommandBox 6: Seamless Transition and Scal...
Strengthening Web Development with CommandBox 6: Seamless Transition and Scal...Strengthening Web Development with CommandBox 6: Seamless Transition and Scal...
Strengthening Web Development with CommandBox 6: Seamless Transition and Scal...
 
一比一原版(sdsu毕业证书)圣地亚哥州立大学毕业证如何办理
一比一原版(sdsu毕业证书)圣地亚哥州立大学毕业证如何办理一比一原版(sdsu毕业证书)圣地亚哥州立大学毕业证如何办理
一比一原版(sdsu毕业证书)圣地亚哥州立大学毕业证如何办理
 
Baha Majid WCA4Z IBM Z Customer Council Boston June 2024.pdf
Baha Majid WCA4Z IBM Z Customer Council Boston June 2024.pdfBaha Majid WCA4Z IBM Z Customer Council Boston June 2024.pdf
Baha Majid WCA4Z IBM Z Customer Council Boston June 2024.pdf
 
Unlock the Secrets to Effortless Video Creation with Invideo: Your Ultimate G...
Unlock the Secrets to Effortless Video Creation with Invideo: Your Ultimate G...Unlock the Secrets to Effortless Video Creation with Invideo: Your Ultimate G...
Unlock the Secrets to Effortless Video Creation with Invideo: Your Ultimate G...
 
🏎️Tech Transformation: DevOps Insights from the Experts 👩‍💻
🏎️Tech Transformation: DevOps Insights from the Experts 👩‍💻🏎️Tech Transformation: DevOps Insights from the Experts 👩‍💻
🏎️Tech Transformation: DevOps Insights from the Experts 👩‍💻
 
TheFutureIsDynamic-BoxLang-CFCamp2024.pdf
TheFutureIsDynamic-BoxLang-CFCamp2024.pdfTheFutureIsDynamic-BoxLang-CFCamp2024.pdf
TheFutureIsDynamic-BoxLang-CFCamp2024.pdf
 
Operational ease MuleSoft and Salesforce Service Cloud Solution v1.0.pptx
Operational ease MuleSoft and Salesforce Service Cloud Solution v1.0.pptxOperational ease MuleSoft and Salesforce Service Cloud Solution v1.0.pptx
Operational ease MuleSoft and Salesforce Service Cloud Solution v1.0.pptx
 
如何办理(hull学位证书)英国赫尔大学毕业证硕士文凭原版一模一样
如何办理(hull学位证书)英国赫尔大学毕业证硕士文凭原版一模一样如何办理(hull学位证书)英国赫尔大学毕业证硕士文凭原版一模一样
如何办理(hull学位证书)英国赫尔大学毕业证硕士文凭原版一模一样
 
Orca: Nocode Graphical Editor for Container Orchestration
Orca: Nocode Graphical Editor for Container OrchestrationOrca: Nocode Graphical Editor for Container Orchestration
Orca: Nocode Graphical Editor for Container Orchestration
 
Ensuring Efficiency and Speed with Practical Solutions for Clinical Operations
Ensuring Efficiency and Speed with Practical Solutions for Clinical OperationsEnsuring Efficiency and Speed with Practical Solutions for Clinical Operations
Ensuring Efficiency and Speed with Practical Solutions for Clinical Operations
 
Boost Your Savings with These Money Management Apps
Boost Your Savings with These Money Management AppsBoost Your Savings with These Money Management Apps
Boost Your Savings with These Money Management Apps
 
Going AOT: Everything you need to know about GraalVM for Java applications
Going AOT: Everything you need to know about GraalVM for Java applicationsGoing AOT: Everything you need to know about GraalVM for Java applications
Going AOT: Everything you need to know about GraalVM for Java applications
 
Photoshop Tutorial for Beginners (2024 Edition)
Photoshop Tutorial for Beginners (2024 Edition)Photoshop Tutorial for Beginners (2024 Edition)
Photoshop Tutorial for Beginners (2024 Edition)
 
美洲杯赔率投注网【​网址​🎉3977·EE​🎉】
美洲杯赔率投注网【​网址​🎉3977·EE​🎉】美洲杯赔率投注网【​网址​🎉3977·EE​🎉】
美洲杯赔率投注网【​网址​🎉3977·EE​🎉】
 

Ensuring Business Continuity in the cloud

  • 2. Krzysztof Kąkol Chief of Data Engineering and Solutions Architect in Xebia Poland AWS Community Builder & AWS Ambassador https://www.linkedin.com/in/krzysztofkakol/ Other stuff: Classical and jazz pianist PhD in AI-driven sound processing
  • 3. Business continuity can be defined as the processes, procedures, decisions and activities to ensure that an organization can continue to function through an operational interruption. https://www.continuitycentral.com/index.php/businesscontinuity Business Continuity Redundancy Resillience Rapid recovery
  • 4. A business impact analysis (BIA) anticipates the repercussions of a business disruption and compiles essential information required for formulating recovery strategies. The identification of potential loss scenarios should occur as part of a comprehensive risk assessment. Business impact analysis Identify critical business functions Define potential disruptions Analyse business impact Define recovery strategies Test and validate Documentation
  • 5. Business Continuity BUSINESS CONTINUITY AVAILABILITY DISASTER RECOVERY What to do to avoid failures? What to do if failure occurs?
  • 7. Availability in the cloud refers to the accessibility and reliability of services and resources hosted on a cloud platform. It measures the extent to which applications and data are reliably accessible and operational for users, without significant downtime or disruptions. High availability is a key goal in cloud computing to ensure continuous and reliable access to services. Availability
  • 8. 1. Mean Time Between Failures (MTBF) 2. Mean Time to Recover (MTTR) Availability - metrics Difficult to measure
  • 9. • Workload in multiple AZs • Self-healing infrastructure – CI/CD pipelines supporting service recovery, multi-AZ setups, auto-scaling • All components highly-available: ALB, multi-AZ RDS, EKS nodes in ASG, replicated NATs • Using managed services Availability – methods in AWS
  • 13. Disaster recovery is a comprehensive strategy outlining procedures and protocols to recover and restore critical business operations in the event of a disruptive incident or disaster. Some important reasons: • Minimize downtime: Rapid recovery protocols help minimize downtime, ensuring that essential business functions can resume swiftly. • Data protection: Safeguard critical data and information through backup and recovery measures, preventing loss and ensuring data integrity. • Business resilience: Enhances the organization's ability to withstand and bounce back from unforeseen disruptions, maintaining operational resilience. • Compliance requirements: Meets regulatory and compliance standards by implementing measures to protect sensitive data and maintain business continuity. • Customer confidence: Demonstrates a commitment to reliability, instilling confidence in customers, stakeholders, and partners. Disaster Recovery
  • 14. 1. Recovery Point Objective (RPO) 2. Recovery Time Objective (RTO) Disaster Recovery - metrics
  • 19. Developing a comprehensive disaster recovery plan: • Identifying critical assets and processes • Establishing RPO and RTO objectives • Assigning responsibilities Testing and validating disaster recovery plans: • Regular simulation exercises • Automation of testing processes Planning and Testing Disaster Recovery
  • 21. Backup for database – automatic snapshots, PITR, manual snapshots Object replication in S3 (multi-region) Backup retention planning Using DDoS protection (CDN) Resources firewall setup – using semantic Security Groups Application firewalls – WAF Workload in private subnets Using AWS services like Security Hub, Inspector, Detective etc. Backup strategy Resilience to attacks Resilience
  • 22. • Autoscaling solutions (auto-scaling groups, ECS, EKS) • Redundancy solutions (RDS Multi-AZ, database read replicas, DynamoDB global tables etc.) • AWS Backup: • Centralized backup solution • Automated backup policies • AWS CloudFormation/Terraform: • Infrastructure as Code (IaC) for consistent deployments • AWS CloudWatch and AWS CloudTrail: • Monitoring and logging for proactive management Tools for Business Continuity in AWS
  • 23. • Learning best practices from Well-Architected Review (especially Reliability Pillar) • Doing business impact analysis and risk analysis • Implementing scalable and redundant infrastructure (in multiple AZs) • Planning and testing Disaster Recovery Best practices summary