SlideShare a Scribd company logo
1 of 19
DevOps for 24 X 7 SaaS and disaster
recovery
Shikha Srivastav
Distinguished Engineer, IBM
SaaS, Master Inventor
Amitabh Prasad
Software Architect, IBM
Agenda
• What is SaaS and the challenges
• Business Continuity and disaster recovery is critical
• Disaster Recovery Models
• What we learned
• Patterns based on our learning
Shikha
What is SaaS
IaaS
PaaS
SaaS
Virtual Machines,Network
Storage
Business Applications
Container Orchestration , CICD automation
Auto scaling ,Application development tools
• Stable - providing consistent
acceptable performance
• Available - providing 24X7
availability
• Reliable - for customers to
leverage SaaS in their mission-
critical business needs
• Resilient - allowing updates
(security, features, bugs), and
upgrades with zero downtime
• Secure and Compliant
What it takes
Collaboration and
Communication
Continuous Integration Continuous Delivery Observability
Change and incident
management
Security and Compliance Business Continuity Test, Quality and Scale
Plan Build Test Secure Release Operate
Ideas
Ideas
Feedbac
k
Feedbac
k
Shikha
What it takes
Collaboration and
Communication
Continuous Integration Continuous Delivery Observability
Change and incident
management
Security and Compliance Business Continuity Test, Quality and Scale
Plan Build Test Secure Release Operate
Ideas
Ideas
Feedbac
k
Feedbac
k
Shikha
Disaster Recovery
• Service needs to meet SLA
• Service Recovery most critical in
Cloud
• Recovery Point Objective (RPO): the
amount of data that will be lost
• Recovery Time Objective (RTO):
the amount of downtime a
business can tolerate.
Disaster Recovery
Key Considerations for Disaster Recovery
WHY backup
- Business continuity
WHAT to back up
- Do you need to back up all data ?
WHERE to back up
- Security and privacy for customer data consideration
HOW to back up
- Automation for backup
RECOVERY plans in case of disaster
- Algorithm for recovery with automated steps
Disaster recovery models
Typical SaaS characteristics
• Multi-tenant architecture
• Multi Zone deployment
• Rely on cloud services for data
backup and recovery
What we learned
Highly available architecture with robust backup & restore for business continue
• HA architecture ensures 99.999% per year availability
• Backup & restore strategy ensure we have < 12 hrs of RPO and <12 hrs
of RTO (for regional failures)
• Within same region it's < 10 min of RPO and < 6 hrs RTO
What we learned
What to protect
• Runtime Config
• Volume
• Cloud resources (databases)
• Backup agents
• Valero Operator to backup up configs
• Volume backup agent
• tag volumes
• Backup of volumes & Cloud services
What we learned
How to protect
Patterns we established based on our learning
• Backup Runtime Kubernetes Objects
• Configure Cloud Data store backups
• Enable Volume replication
• Handle non-cloud data store
• gitOps to redeploy static and runtime objects
Patterns
Backup Runtime Kubernetes Objects
Use native data stores as much as
possible, if in cloud make sure to use
cloud native datastores
RDS, ICD etc
Tag Cloud Data Resources instances
and use them to to identify resources
to backup using Cloud Provider
Backup or any provider specific
backup techniques
This backup technique identifies all
the resources to backup using tags,
calls snapshot and other required
api's to do backup
Patterns
Backup Cloud resources
• Runs as pods
• Watches pvc and tags related
volumes
• Backup of volumes managed by
Cloud Provider’s api’s
Patterns
Backup Volumes (Replication)
• Used for application deployment
• Manage drifts between git repo and
registered clusters
• Restore data and artifacts
• Argo CD is used to deploy rest of
the application during recovery flow
Patterns
Gitops
THANK YOU
References
Kubernetes volume backup for disaster recovery
Backup and Restore of Kubernetes Stateful Application Data with CSI Volume Snapshots
Speed and resiliency: two sides of the same coin
AWS backup strategies
Valero.io
Disaster Recovery with GitOps
Useful if leveraging Kubernetes as Container orchestrator:
Kubernetes & 12-factor apps
7 missing factors from 12 factor application

More Related Content

Similar to DevOpsEnterpriseSummit_SaaSAnd DisasterRecovery.pptx

AWS User Group Sydney - Meetup #60
AWS User Group Sydney - Meetup #60AWS User Group Sydney - Meetup #60
AWS User Group Sydney - Meetup #60PolarSeven Pty Ltd
 
AWS Innovate: Running SAP Solutions on AWS Cloud- Shailesh Albuquerque
AWS Innovate: Running SAP Solutions on AWS Cloud- Shailesh AlbuquerqueAWS Innovate: Running SAP Solutions on AWS Cloud- Shailesh Albuquerque
AWS Innovate: Running SAP Solutions on AWS Cloud- Shailesh AlbuquerqueAmazon Web Services Korea
 
MySQL in oracle_environments(Part 2): MySQL Enterprise Monitor & Oracle Enter...
MySQL in oracle_environments(Part 2): MySQL Enterprise Monitor & Oracle Enter...MySQL in oracle_environments(Part 2): MySQL Enterprise Monitor & Oracle Enter...
MySQL in oracle_environments(Part 2): MySQL Enterprise Monitor & Oracle Enter...OracleMySQL
 
DevOps in the Cloud with Microsoft Azure
DevOps in the Cloud with Microsoft AzureDevOps in the Cloud with Microsoft Azure
DevOps in the Cloud with Microsoft Azuregjuljo
 
Customer Sharing: HTC - What is in AWS Cloud for me?
Customer Sharing: HTC - What is in AWS Cloud for me?Customer Sharing: HTC - What is in AWS Cloud for me?
Customer Sharing: HTC - What is in AWS Cloud for me?Amazon Web Services
 
The Fastest Way to Redis on Pivotal Cloud Foundry
The Fastest Way to Redis on Pivotal Cloud FoundryThe Fastest Way to Redis on Pivotal Cloud Foundry
The Fastest Way to Redis on Pivotal Cloud FoundryVMware Tanzu
 
AWS Webcast - Introduction to Amazon Kinesis
AWS Webcast - Introduction to Amazon KinesisAWS Webcast - Introduction to Amazon Kinesis
AWS Webcast - Introduction to Amazon KinesisAmazon Web Services
 
AWS FSI Symposium 2017 NYC - Moving at the Speed of Serverless ft Broadridge
AWS FSI Symposium 2017 NYC - Moving at the Speed of Serverless ft BroadridgeAWS FSI Symposium 2017 NYC - Moving at the Speed of Serverless ft Broadridge
AWS FSI Symposium 2017 NYC - Moving at the Speed of Serverless ft BroadridgeAmazon Web Services
 
Enterprise Serverless Adoption. An Experience Report
Enterprise Serverless Adoption. An Experience ReportEnterprise Serverless Adoption. An Experience Report
Enterprise Serverless Adoption. An Experience ReportSheenBrisals
 
Build your Cloud Solution for Success - Tel Aviv Summit 2018
Build your Cloud Solution for Success - Tel Aviv Summit 2018Build your Cloud Solution for Success - Tel Aviv Summit 2018
Build your Cloud Solution for Success - Tel Aviv Summit 2018Amazon Web Services
 
Reduce SRE Stress: Minimizing Service Downtime with Grafana, InfluxDB and Tel...
Reduce SRE Stress: Minimizing Service Downtime with Grafana, InfluxDB and Tel...Reduce SRE Stress: Minimizing Service Downtime with Grafana, InfluxDB and Tel...
Reduce SRE Stress: Minimizing Service Downtime with Grafana, InfluxDB and Tel...InfluxData
 
Running SAP All-in-One ERP production system deployment on the AWS cloud
Running SAP All-in-One ERP production system deployment on the AWS cloudRunning SAP All-in-One ERP production system deployment on the AWS cloud
Running SAP All-in-One ERP production system deployment on the AWS cloudAmazon Web Services
 
Spinnaker summit: Design Considerations for Enterprise-wide roll out of Spin...
Spinnaker summit:  Design Considerations for Enterprise-wide roll out of Spin...Spinnaker summit:  Design Considerations for Enterprise-wide roll out of Spin...
Spinnaker summit: Design Considerations for Enterprise-wide roll out of Spin...Balaji Sivasubramanian
 
SecureKloud_Corporate Deck.pdf
SecureKloud_Corporate Deck.pdfSecureKloud_Corporate Deck.pdf
SecureKloud_Corporate Deck.pdfSrinivasMahankali3
 
(BIZ301) Getting Started: Running SAP on AWS | AWS re:Invent 2014
(BIZ301) Getting Started: Running SAP on AWS | AWS re:Invent 2014(BIZ301) Getting Started: Running SAP on AWS | AWS re:Invent 2014
(BIZ301) Getting Started: Running SAP on AWS | AWS re:Invent 2014Amazon Web Services
 
AWS re:Invent 2016: Continuous Compliance in the AWS Cloud for Regulated Life...
AWS re:Invent 2016: Continuous Compliance in the AWS Cloud for Regulated Life...AWS re:Invent 2016: Continuous Compliance in the AWS Cloud for Regulated Life...
AWS re:Invent 2016: Continuous Compliance in the AWS Cloud for Regulated Life...Amazon Web Services
 
MySQL Enterprise Edition Overview
MySQL Enterprise Edition OverviewMySQL Enterprise Edition Overview
MySQL Enterprise Edition OverviewMario Beck
 
AWS re:Invent 2016: Workshop: Migrating Microsoft Applications to AWS (ENT216)
AWS re:Invent 2016: Workshop: Migrating Microsoft Applications to AWS (ENT216)AWS re:Invent 2016: Workshop: Migrating Microsoft Applications to AWS (ENT216)
AWS re:Invent 2016: Workshop: Migrating Microsoft Applications to AWS (ENT216)Amazon Web Services
 
Too Many Tools? How AWS Systems Manager Bridges Operational Models - AWS Summ...
Too Many Tools? How AWS Systems Manager Bridges Operational Models - AWS Summ...Too Many Tools? How AWS Systems Manager Bridges Operational Models - AWS Summ...
Too Many Tools? How AWS Systems Manager Bridges Operational Models - AWS Summ...Amazon Web Services
 

Similar to DevOpsEnterpriseSummit_SaaSAnd DisasterRecovery.pptx (20)

AWS User Group Sydney - Meetup #60
AWS User Group Sydney - Meetup #60AWS User Group Sydney - Meetup #60
AWS User Group Sydney - Meetup #60
 
AWS Innovate: Running SAP Solutions on AWS Cloud- Shailesh Albuquerque
AWS Innovate: Running SAP Solutions on AWS Cloud- Shailesh AlbuquerqueAWS Innovate: Running SAP Solutions on AWS Cloud- Shailesh Albuquerque
AWS Innovate: Running SAP Solutions on AWS Cloud- Shailesh Albuquerque
 
MySQL in oracle_environments(Part 2): MySQL Enterprise Monitor & Oracle Enter...
MySQL in oracle_environments(Part 2): MySQL Enterprise Monitor & Oracle Enter...MySQL in oracle_environments(Part 2): MySQL Enterprise Monitor & Oracle Enter...
MySQL in oracle_environments(Part 2): MySQL Enterprise Monitor & Oracle Enter...
 
DevOps in the Cloud with Microsoft Azure
DevOps in the Cloud with Microsoft AzureDevOps in the Cloud with Microsoft Azure
DevOps in the Cloud with Microsoft Azure
 
Customer Sharing: HTC - What is in AWS Cloud for me?
Customer Sharing: HTC - What is in AWS Cloud for me?Customer Sharing: HTC - What is in AWS Cloud for me?
Customer Sharing: HTC - What is in AWS Cloud for me?
 
The Fastest Way to Redis on Pivotal Cloud Foundry
The Fastest Way to Redis on Pivotal Cloud FoundryThe Fastest Way to Redis on Pivotal Cloud Foundry
The Fastest Way to Redis on Pivotal Cloud Foundry
 
AWS Webcast - Introduction to Amazon Kinesis
AWS Webcast - Introduction to Amazon KinesisAWS Webcast - Introduction to Amazon Kinesis
AWS Webcast - Introduction to Amazon Kinesis
 
AWS FSI Symposium 2017 NYC - Moving at the Speed of Serverless ft Broadridge
AWS FSI Symposium 2017 NYC - Moving at the Speed of Serverless ft BroadridgeAWS FSI Symposium 2017 NYC - Moving at the Speed of Serverless ft Broadridge
AWS FSI Symposium 2017 NYC - Moving at the Speed of Serverless ft Broadridge
 
Enterprise Serverless Adoption. An Experience Report
Enterprise Serverless Adoption. An Experience ReportEnterprise Serverless Adoption. An Experience Report
Enterprise Serverless Adoption. An Experience Report
 
Build your Cloud Solution for Success - Tel Aviv Summit 2018
Build your Cloud Solution for Success - Tel Aviv Summit 2018Build your Cloud Solution for Success - Tel Aviv Summit 2018
Build your Cloud Solution for Success - Tel Aviv Summit 2018
 
Reduce SRE Stress: Minimizing Service Downtime with Grafana, InfluxDB and Tel...
Reduce SRE Stress: Minimizing Service Downtime with Grafana, InfluxDB and Tel...Reduce SRE Stress: Minimizing Service Downtime with Grafana, InfluxDB and Tel...
Reduce SRE Stress: Minimizing Service Downtime with Grafana, InfluxDB and Tel...
 
Running SAP All-in-One ERP production system deployment on the AWS cloud
Running SAP All-in-One ERP production system deployment on the AWS cloudRunning SAP All-in-One ERP production system deployment on the AWS cloud
Running SAP All-in-One ERP production system deployment on the AWS cloud
 
Spinnaker summit: Design Considerations for Enterprise-wide roll out of Spin...
Spinnaker summit:  Design Considerations for Enterprise-wide roll out of Spin...Spinnaker summit:  Design Considerations for Enterprise-wide roll out of Spin...
Spinnaker summit: Design Considerations for Enterprise-wide roll out of Spin...
 
SecureKloud_Corporate Deck.pdf
SecureKloud_Corporate Deck.pdfSecureKloud_Corporate Deck.pdf
SecureKloud_Corporate Deck.pdf
 
(BIZ301) Getting Started: Running SAP on AWS | AWS re:Invent 2014
(BIZ301) Getting Started: Running SAP on AWS | AWS re:Invent 2014(BIZ301) Getting Started: Running SAP on AWS | AWS re:Invent 2014
(BIZ301) Getting Started: Running SAP on AWS | AWS re:Invent 2014
 
Rushcode overview
Rushcode overviewRushcode overview
Rushcode overview
 
AWS re:Invent 2016: Continuous Compliance in the AWS Cloud for Regulated Life...
AWS re:Invent 2016: Continuous Compliance in the AWS Cloud for Regulated Life...AWS re:Invent 2016: Continuous Compliance in the AWS Cloud for Regulated Life...
AWS re:Invent 2016: Continuous Compliance in the AWS Cloud for Regulated Life...
 
MySQL Enterprise Edition Overview
MySQL Enterprise Edition OverviewMySQL Enterprise Edition Overview
MySQL Enterprise Edition Overview
 
AWS re:Invent 2016: Workshop: Migrating Microsoft Applications to AWS (ENT216)
AWS re:Invent 2016: Workshop: Migrating Microsoft Applications to AWS (ENT216)AWS re:Invent 2016: Workshop: Migrating Microsoft Applications to AWS (ENT216)
AWS re:Invent 2016: Workshop: Migrating Microsoft Applications to AWS (ENT216)
 
Too Many Tools? How AWS Systems Manager Bridges Operational Models - AWS Summ...
Too Many Tools? How AWS Systems Manager Bridges Operational Models - AWS Summ...Too Many Tools? How AWS Systems Manager Bridges Operational Models - AWS Summ...
Too Many Tools? How AWS Systems Manager Bridges Operational Models - AWS Summ...
 

More from Shikha Srivastava

ADDO_2022_SRE Architectural Patterns_Nov10.pptx
ADDO_2022_SRE Architectural Patterns_Nov10.pptxADDO_2022_SRE Architectural Patterns_Nov10.pptx
ADDO_2022_SRE Architectural Patterns_Nov10.pptxShikha Srivastava
 
WITS 2022_ModernizationAndInfrastructureAsCode.pptx
WITS 2022_ModernizationAndInfrastructureAsCode.pptxWITS 2022_ModernizationAndInfrastructureAsCode.pptx
WITS 2022_ModernizationAndInfrastructureAsCode.pptxShikha Srivastava
 
Using Cloud-Native and SRE Principles to Achieve Speed and Resiliency
Using Cloud-Native and SRE Principles to Achieve Speed and ResiliencyUsing Cloud-Native and SRE Principles to Achieve Speed and Resiliency
Using Cloud-Native and SRE Principles to Achieve Speed and ResiliencyShikha Srivastava
 
How kubernetes operators can rescue dev secops in midst of a pandemic updated
How kubernetes operators can rescue dev secops in midst of a pandemic updatedHow kubernetes operators can rescue dev secops in midst of a pandemic updated
How kubernetes operators can rescue dev secops in midst of a pandemic updatedShikha Srivastava
 
Managing integration in a multi cluster world
Managing integration in a multi cluster worldManaging integration in a multi cluster world
Managing integration in a multi cluster worldShikha Srivastava
 
Helm summit 2019_handling large number of charts_sept 10
Helm summit 2019_handling large number of charts_sept 10Helm summit 2019_handling large number of charts_sept 10
Helm summit 2019_handling large number of charts_sept 10Shikha Srivastava
 
Kube con china_2019_7 missing factors for your production-quality 12-factor apps
Kube con china_2019_7 missing factors for your production-quality 12-factor appsKube con china_2019_7 missing factors for your production-quality 12-factor apps
Kube con china_2019_7 missing factors for your production-quality 12-factor appsShikha Srivastava
 
Bluemix application monitoring
Bluemix application monitoring Bluemix application monitoring
Bluemix application monitoring Shikha Srivastava
 
Modernization: Moving workloads to cloud
Modernization: Moving workloads to cloud Modernization: Moving workloads to cloud
Modernization: Moving workloads to cloud Shikha Srivastava
 
Kibana globalization at the RTP meetup
Kibana globalization at the RTP meetupKibana globalization at the RTP meetup
Kibana globalization at the RTP meetupShikha Srivastava
 
Localizing kibana for the global language landscape
Localizing kibana for the global language landscapeLocalizing kibana for the global language landscape
Localizing kibana for the global language landscapeShikha Srivastava
 
From Containerized Application to Secure and Scaling With Kubernetes
From Containerized Application to Secure and Scaling With KubernetesFrom Containerized Application to Secure and Scaling With Kubernetes
From Containerized Application to Secure and Scaling With KubernetesShikha Srivastava
 
Developing and Deploying Microservices to IBM Cloud Private
Developing and Deploying Microservices to IBM Cloud PrivateDeveloping and Deploying Microservices to IBM Cloud Private
Developing and Deploying Microservices to IBM Cloud PrivateShikha Srivastava
 
4789 creating production-ready, secure and scalable applications in ibm cloud...
4789 creating production-ready, secure and scalable applications in ibm cloud...4789 creating production-ready, secure and scalable applications in ibm cloud...
4789 creating production-ready, secure and scalable applications in ibm cloud...Shikha Srivastava
 
Panelist at women breakfast discussing latest technology trends at Elasticon
Panelist at women breakfast discussing latest technology trends at Elasticon Panelist at women breakfast discussing latest technology trends at Elasticon
Panelist at women breakfast discussing latest technology trends at Elasticon Shikha Srivastava
 

More from Shikha Srivastava (16)

ADDO_2022_SRE Architectural Patterns_Nov10.pptx
ADDO_2022_SRE Architectural Patterns_Nov10.pptxADDO_2022_SRE Architectural Patterns_Nov10.pptx
ADDO_2022_SRE Architectural Patterns_Nov10.pptx
 
WITS 2022_ModernizationAndInfrastructureAsCode.pptx
WITS 2022_ModernizationAndInfrastructureAsCode.pptxWITS 2022_ModernizationAndInfrastructureAsCode.pptx
WITS 2022_ModernizationAndInfrastructureAsCode.pptx
 
Using Cloud-Native and SRE Principles to Achieve Speed and Resiliency
Using Cloud-Native and SRE Principles to Achieve Speed and ResiliencyUsing Cloud-Native and SRE Principles to Achieve Speed and Resiliency
Using Cloud-Native and SRE Principles to Achieve Speed and Resiliency
 
How kubernetes operators can rescue dev secops in midst of a pandemic updated
How kubernetes operators can rescue dev secops in midst of a pandemic updatedHow kubernetes operators can rescue dev secops in midst of a pandemic updated
How kubernetes operators can rescue dev secops in midst of a pandemic updated
 
Managing integration in a multi cluster world
Managing integration in a multi cluster worldManaging integration in a multi cluster world
Managing integration in a multi cluster world
 
Helm summit 2019_handling large number of charts_sept 10
Helm summit 2019_handling large number of charts_sept 10Helm summit 2019_handling large number of charts_sept 10
Helm summit 2019_handling large number of charts_sept 10
 
Kube con china_2019_7 missing factors for your production-quality 12-factor apps
Kube con china_2019_7 missing factors for your production-quality 12-factor appsKube con china_2019_7 missing factors for your production-quality 12-factor apps
Kube con china_2019_7 missing factors for your production-quality 12-factor apps
 
Why Ibm cloud private
Why Ibm cloud private Why Ibm cloud private
Why Ibm cloud private
 
Bluemix application monitoring
Bluemix application monitoring Bluemix application monitoring
Bluemix application monitoring
 
Modernization: Moving workloads to cloud
Modernization: Moving workloads to cloud Modernization: Moving workloads to cloud
Modernization: Moving workloads to cloud
 
Kibana globalization at the RTP meetup
Kibana globalization at the RTP meetupKibana globalization at the RTP meetup
Kibana globalization at the RTP meetup
 
Localizing kibana for the global language landscape
Localizing kibana for the global language landscapeLocalizing kibana for the global language landscape
Localizing kibana for the global language landscape
 
From Containerized Application to Secure and Scaling With Kubernetes
From Containerized Application to Secure and Scaling With KubernetesFrom Containerized Application to Secure and Scaling With Kubernetes
From Containerized Application to Secure and Scaling With Kubernetes
 
Developing and Deploying Microservices to IBM Cloud Private
Developing and Deploying Microservices to IBM Cloud PrivateDeveloping and Deploying Microservices to IBM Cloud Private
Developing and Deploying Microservices to IBM Cloud Private
 
4789 creating production-ready, secure and scalable applications in ibm cloud...
4789 creating production-ready, secure and scalable applications in ibm cloud...4789 creating production-ready, secure and scalable applications in ibm cloud...
4789 creating production-ready, secure and scalable applications in ibm cloud...
 
Panelist at women breakfast discussing latest technology trends at Elasticon
Panelist at women breakfast discussing latest technology trends at Elasticon Panelist at women breakfast discussing latest technology trends at Elasticon
Panelist at women breakfast discussing latest technology trends at Elasticon
 

Recently uploaded

Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...OnePlan Solutions
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️Delhi Call girls
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...panagenda
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsArshad QA
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...Health
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai
 
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AIABDERRAOUF MEHENNI
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...ICS
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdfWave PLM
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01
 
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️anilsa9823
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...MyIntelliSource, Inc.
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsAndolasoft Inc
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfkalichargn70th171
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerThousandEyes
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsJhone kinadey
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfkalichargn70th171
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...harshavardhanraghave
 

Recently uploaded (20)

Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
 
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.js
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
 
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS LiveVip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
 

DevOpsEnterpriseSummit_SaaSAnd DisasterRecovery.pptx

  • 1. DevOps for 24 X 7 SaaS and disaster recovery Shikha Srivastav Distinguished Engineer, IBM SaaS, Master Inventor Amitabh Prasad Software Architect, IBM
  • 2. Agenda • What is SaaS and the challenges • Business Continuity and disaster recovery is critical • Disaster Recovery Models • What we learned • Patterns based on our learning Shikha
  • 3. What is SaaS IaaS PaaS SaaS Virtual Machines,Network Storage Business Applications Container Orchestration , CICD automation Auto scaling ,Application development tools • Stable - providing consistent acceptable performance • Available - providing 24X7 availability • Reliable - for customers to leverage SaaS in their mission- critical business needs • Resilient - allowing updates (security, features, bugs), and upgrades with zero downtime • Secure and Compliant
  • 4. What it takes Collaboration and Communication Continuous Integration Continuous Delivery Observability Change and incident management Security and Compliance Business Continuity Test, Quality and Scale Plan Build Test Secure Release Operate Ideas Ideas Feedbac k Feedbac k Shikha
  • 5. What it takes Collaboration and Communication Continuous Integration Continuous Delivery Observability Change and incident management Security and Compliance Business Continuity Test, Quality and Scale Plan Build Test Secure Release Operate Ideas Ideas Feedbac k Feedbac k Shikha Disaster Recovery
  • 6. • Service needs to meet SLA • Service Recovery most critical in Cloud • Recovery Point Objective (RPO): the amount of data that will be lost • Recovery Time Objective (RTO): the amount of downtime a business can tolerate. Disaster Recovery
  • 7. Key Considerations for Disaster Recovery WHY backup - Business continuity WHAT to back up - Do you need to back up all data ? WHERE to back up - Security and privacy for customer data consideration HOW to back up - Automation for backup RECOVERY plans in case of disaster - Algorithm for recovery with automated steps
  • 9. Typical SaaS characteristics • Multi-tenant architecture • Multi Zone deployment • Rely on cloud services for data backup and recovery
  • 10. What we learned Highly available architecture with robust backup & restore for business continue • HA architecture ensures 99.999% per year availability • Backup & restore strategy ensure we have < 12 hrs of RPO and <12 hrs of RTO (for regional failures) • Within same region it's < 10 min of RPO and < 6 hrs RTO
  • 11. What we learned What to protect • Runtime Config • Volume • Cloud resources (databases)
  • 12. • Backup agents • Valero Operator to backup up configs • Volume backup agent • tag volumes • Backup of volumes & Cloud services What we learned How to protect
  • 13. Patterns we established based on our learning • Backup Runtime Kubernetes Objects • Configure Cloud Data store backups • Enable Volume replication • Handle non-cloud data store • gitOps to redeploy static and runtime objects
  • 15. Use native data stores as much as possible, if in cloud make sure to use cloud native datastores RDS, ICD etc Tag Cloud Data Resources instances and use them to to identify resources to backup using Cloud Provider Backup or any provider specific backup techniques This backup technique identifies all the resources to backup using tags, calls snapshot and other required api's to do backup Patterns Backup Cloud resources
  • 16. • Runs as pods • Watches pvc and tags related volumes • Backup of volumes managed by Cloud Provider’s api’s Patterns Backup Volumes (Replication)
  • 17. • Used for application deployment • Manage drifts between git repo and registered clusters • Restore data and artifacts • Argo CD is used to deploy rest of the application during recovery flow Patterns Gitops
  • 19. References Kubernetes volume backup for disaster recovery Backup and Restore of Kubernetes Stateful Application Data with CSI Volume Snapshots Speed and resiliency: two sides of the same coin AWS backup strategies Valero.io Disaster Recovery with GitOps Useful if leveraging Kubernetes as Container orchestrator: Kubernetes & 12-factor apps 7 missing factors from 12 factor application

Editor's Notes

  1. RPO: Max targeted period for which data will be lost in the event of disaster, this is based on how frequent you want to take backup and what’s your backup architecture. What kind of underlying backup technology and application architecture etc. RTO: Time to restore system after disaster, again lot depends on how complex your system is and what automation is inplace to get a new system up and running. For example if we use gitops other then boot time and middleware/platform deployment, new application instance can be made up and running in no time
  2. "An entire AWS region?? That will never happen!” (US East 1 went down in Dec 2021) Ransomeware attack with Garmin, such risk are increasingly common and these can be easily mitigatAs part of enterprise SaaS solution, we have to ensure, there is a robust disaster  recovery plan in place to restore service in the unlikely event that the entire region goes down.
  3. Backup and restore: backing up of individual system in cloud native world this means backing up database files, volumes, configuration etc in traditional world this requires vm backup and so on (RTO in hrs) Pilot light: Core components of the system is always running in dr region, with automation in place to quickly provision rest of the system. Like DB may be in non active state with replication enabled and in case of disaster automation takes care of rest of the application deployment (RTO in min) Warm standby: scaled down version of fully functional env running in DR region, further reduces recovery time (RTO < 10 min) Multisite active-active: Specialize configuration both env are in use, traffic is distributed across these env, with autoscaling capabilities to handle increased load DR can be easily managed (almost 0 RTO)
  4. Emphasize on diff between HA deployment vs disaster recovery HA can take care of local failures, node going down, corrupt instance etc
  5. Runtime configuration Volumes This is where application may store some of the documents and in certain cases in-cluster db's Cloud resources (databases) - Rest of the application artifacts can be recreated as part of deployment
  6. Velero for k8s backup filter what we want to backup based on labels Volume backup using native snapshot DB backup using cloud provider techniques or native DB techniques
  7. Velero for Kubernetes objects Data stored in s3 bucket replicated to DR region
  8. Use cloud native or native DB Abstractly identify services to be backed up using tags Uses storage replication (or cloud provider techniques) to move data to DR location
  9. Any typical k8s applications does have stateful sets Use volume snapshot to take backups Replication depends on cloud provider or existing storage
  10. Gitops for application deployment ArgoCD: k8s controller that monitors git repo and ensures all the deployed application is in sync with the application artifacts