SlideShare a Scribd company logo
1 of 27
Download to read offline
Accelerating Enterprise OpenStack
When Disaster Strikes the Cloud
Michael Factor
IBM Research - Haifa
factor@il.ibm.com
Who, What, When, Where and How to Recover
Ronen Kat
IBM Research - Haifa
ronenkat@il.ibm.com
Sean Cohen
RedHat
scohen@redhat.com
2
Talk Outline
q What is disaster recovery?
q Concepts and basics
q Protecting data and applications from disasters
q OpenStack Cinder toolbox for disaster recovery
q Applications are more than just data
q The road ahead: Kilo and beyond
3
What is Disaster Recovery?
According to Wikipedia, Disaster Recovery (DR) is "the process, policies and
procedures . . . for recovery . . . of technology infrastructure . . . after a natural or
human-induced disaster.”
Servers Storage Network Software Configuration
Surviving a disaster requires geographic dispersion
4
Recovery Point Objective and Recovery Time Objective
How far back in time a
disaster takes one
How long until operational
after a disaster
Seconds 0
RECOVERY POINT OBJECTIVE
(RPO)
MinutesHoursDaysWeeks Weeks
RECOVERY POINT TIME
(RTO)
DaysHoursMinutesSeconds
Replication
Backup
restore Active site Hot site
5
Data and Metadata Consistency
Data consistency
q If a modified datum is available,
all data it depends upon is also
available
Metadata consistency
q Configuration updates are seen
in the same order relative to one
another and to data updates
Application VM
DB LOG
DB LOG
Remote Site
6
OpenStack Cloud Metadata
Virtual networks between the cloud VM
External network access
Attached volumes
Volume types
Virtual machines flavors
SSH keys for VM access
Virtual machines images
Identities of users
Accelerating Enterprise OpenStack
Protecting Data and Applications
from Disasters
8
Data Protection: Cinder Backup and Restore
q Cinder backup
q Backup a volume to backup storage
Swift
backup-create
Primary Cloud
9
Data Protection: Cinder Backup and Restore
q Can Cinder restore on secondary
cloud?
q Problem: Cinder on secondary
cloud is not aware of the backup
Swift
backup-restore
Primary Cloud
Secondary Cloud
10
Data Protection: Cinder Backup and Restore
q Solution: “electronic tape shipping”
q backup-export
q backup-import
q Cinder supports since Icehouse
Swift
backup-export
Primary Cloud
Secondary Cloud
Backup reference
backup-import
11
Data Protection: Cinder Backup and Restore
q After backup-import Cinder can
restore on secondary cloud
q backup-restore
Swift
backup-restore
Primary Cloud
Secondary Cloud
12
Data Protection: Cinder Volume replication
q Cinder has initial support for
volume replication in Juno release
q Cinder back-ends can “advertise”
support for replication
q Volume created with replication
extra-spec will be allocated on
back-end supporting replication and
will be replicated
q Supporting back ends:
q IBM Storwize, more expected in Kilo
Cinder back-end
Cinder back-end
Volume-type extra specs:
“capabilities:replication
<is> True”
13
Data Protection: Cinder Volume replication
q Secondary volume can become
primary when promoted
q replication-promote
q Replication can be reversed
following a replication-promote
q replication-reenable
Cinder back-end
Cinder back-end
14
Consistency Groups
q New in Juno
q Support for volume grouping for consistency
q Grouping of volumes is based on the volume-type
q Supporting
q Consistency group snapshots
q Needs to be extended to support
q Cinder backup
q Cinder volume replication
DB LOG
15
Protecting Applications from Disasters
Servers Storage Network Software Configuration
Disaster Recovery Orchestration
16
OpenStack Tools
q Applications are defined in OpenStack by
q Heat Orchestration Templates
q However
q Not all applications are template based
q Deployments (including configuration) change over time
q Some definitions are cloud specific, e.g., networks, types
q Heat templates and Stacks don’t stay consistent
q Tools that can create a template from deployment, e.g., Flame, ReHeat
q But, template will only fit the current cloud
17
OpenStack Tools and Beyond
q Demo:
A technology preview for disaster recovery with IBM Cloud Manager
18
THE ROAD AHEAD
19
Ceph Multi-Site & Disaster Recovery (Block) example
q Export snapshots to geographically dispersed data centers
q Provides disaster recovery
q Export incremental snapshots
q Minimize network bandwidth by only sending changes
q Kilo cycle focus to extends the multi-site and disaster recovery options
q  RBD Mirroring
q  Cinder Volume Replication
20
Ceph Multi-Site & Disaster Recovery (Object) example
q Zones and region support
q  Deploy topologies similar to S3
and others with a global
namespace
q Data center synchronization
q  Back-up full or partial sets of data
between regions
q Read affinity
q  Serve local copies of data to local
users
21
Disaster Recovery as a Service Catalog
q Pluggable Disaster Recovery policies
q Replication targets can specify different RPO/RTO levels that can be
offered based on the supported backend capabilities
q Disaster Recovery Policies
q  Active - Cold standby
q  Active - Hot standby
q  Active - Active (requires application awareness and transaction integrity)
q  Backup to Cloud / From the Cloud
22
Extending Heat Orchestration for Disaster Recovery
q Heat can be used to automate
q Add support for Cinder replication
q Need to make Consistency group across OpenStack projects
q Nova Cinder, Trove….
q Stack Snapshot Backup / Rollback
q Enable customization of workload components at recovery site.
q Networks, VM configurations changes, guest agent etc.
23
The Road Toward Application Consistency
First phase: File system consistency
q Integrate into OpenStack to allow consistent snapshots and
backups
q Nova needs to request QEMU Guest Agent to freeze the file systems
(and applications if fsfreeze-hook is installed) during the snapshot
q Patches has proposed for
Nova and Cinder, targeting
the Kilo release
Source: Hitachi
24
The Road Toward Application Consistency
Next phase: Consistency at the application level
q Application-Aware on Windows with VSS Support on qemu-ga
q Application notification via Microsoft Volume Shadow Copy Service (VSS)
q Application-Aware on Linux Using qemu-ga Hooks
q Application-consistent snapshots can be created with scripts interacting with the
QEMU guest agent
q The scripts can notify applications to flush their data
25
Disaster Recovery at Scale
q  Site evacuation holy grail is an automatic planned migration of the
workloads and data from one cloud-scale datacenter to another.
q  New OpenStack HA approaches to help Recovery from infrastructure
failures:
q  Leveraging Pacemaker to provide automated detection of a failed hypervisor
and the recovery of the VMs that were running there.
q  Evacuate instance to a scheduled host was added in Juno
q  Simple tagging API for instances in Nova was accepted for Kilo release
q  Can support automatic-recovery new tag
Suggest removing – no time
26
OpenStack Documentation needs to catch up…
q Join the OpenStack Disaster Recovery Guide
q We have a basic OpenStack High Availability Guide
q http://docs.openstack.org/high-availability-guide/content/
q A very outdated “Recover cloud after disaster” section in the Admin guide
http://docs.openstack.org/admin-guide-cloud/content/section_nova-disaster-
recovery-process.html
Accelerating Enterprise OpenStack
Q&A
Michael Factor
IBM Research - Haifa
factor@il.ibm.com
THANK YOU
Ronen Kat
IBM Research - Haifa
ronenkat@il.ibm.com
Sean Cohen
RedHat
scohen@redhat.com

More Related Content

What's hot

Ceph & OpenStack talk given @ OpenStack Meetup @ Bangalore, June 2015
Ceph & OpenStack talk given @ OpenStack Meetup @ Bangalore, June 2015Ceph & OpenStack talk given @ OpenStack Meetup @ Bangalore, June 2015
Ceph & OpenStack talk given @ OpenStack Meetup @ Bangalore, June 2015Deepak Shetty
 
OpenStack in Action 4! Sebastien Han - Ceph: de facto storage backend for Ope...
OpenStack in Action 4! Sebastien Han - Ceph: de facto storage backend for Ope...OpenStack in Action 4! Sebastien Han - Ceph: de facto storage backend for Ope...
OpenStack in Action 4! Sebastien Han - Ceph: de facto storage backend for Ope...eNovance
 
Deploying openstack using ansible
Deploying openstack using ansibleDeploying openstack using ansible
Deploying openstack using ansibleopenstackindia
 
OpenNebula Conf 2014 | Understanding the OpenNebula Model for Cloud Provision...
OpenNebula Conf 2014 | Understanding the OpenNebula Model for Cloud Provision...OpenNebula Conf 2014 | Understanding the OpenNebula Model for Cloud Provision...
OpenNebula Conf 2014 | Understanding the OpenNebula Model for Cloud Provision...NETWAYS
 
OpenNebula Conf 2014 | Using Ceph to provide scalable storage for OpenNebula ...
OpenNebula Conf 2014 | Using Ceph to provide scalable storage for OpenNebula ...OpenNebula Conf 2014 | Using Ceph to provide scalable storage for OpenNebula ...
OpenNebula Conf 2014 | Using Ceph to provide scalable storage for OpenNebula ...NETWAYS
 
Ceph de facto storage backend for OpenStack
Ceph de facto storage backend for OpenStack Ceph de facto storage backend for OpenStack
Ceph de facto storage backend for OpenStack eNovance
 
Enhancing Kubernetes with Autoscaling & Hybrid Cloud IaaS
Enhancing Kubernetes with Autoscaling & Hybrid Cloud IaaSEnhancing Kubernetes with Autoscaling & Hybrid Cloud IaaS
Enhancing Kubernetes with Autoscaling & Hybrid Cloud IaaSMatt Baldwin
 
John Spray - Ceph in Kubernetes
John Spray - Ceph in KubernetesJohn Spray - Ceph in Kubernetes
John Spray - Ceph in KubernetesShapeBlue
 
Ceph Tech Talk: Ceph at DigitalOcean
Ceph Tech Talk: Ceph at DigitalOceanCeph Tech Talk: Ceph at DigitalOcean
Ceph Tech Talk: Ceph at DigitalOceanCeph Community
 
Introduction to rook
Introduction to rookIntroduction to rook
Introduction to rookRohan Gupta
 
High Availability from the DevOps side - OpenStack Summit Portland
High Availability from the DevOps side - OpenStack Summit PortlandHigh Availability from the DevOps side - OpenStack Summit Portland
High Availability from the DevOps side - OpenStack Summit PortlandeNovance
 
Database experiences designing cassandra schema for keystone
Database experiences designing cassandra schema for keystone Database experiences designing cassandra schema for keystone
Database experiences designing cassandra schema for keystone openstackindia
 
Antoine Coetsier - billing the cloud
Antoine Coetsier - billing the cloudAntoine Coetsier - billing the cloud
Antoine Coetsier - billing the cloudShapeBlue
 
6 open stack_swift_panoramic_view
6 open stack_swift_panoramic_view6 open stack_swift_panoramic_view
6 open stack_swift_panoramic_viewopenstackindia
 
OpenNebula Conf 2014 | OpenNebula as Open Replacement of vCloud by Javier Fontan
OpenNebula Conf 2014 | OpenNebula as Open Replacement of vCloud by Javier FontanOpenNebula Conf 2014 | OpenNebula as Open Replacement of vCloud by Javier Fontan
OpenNebula Conf 2014 | OpenNebula as Open Replacement of vCloud by Javier FontanNETWAYS
 
A Container Stack for Openstack - OpenStack Silicon Valley
A Container Stack for Openstack - OpenStack Silicon ValleyA Container Stack for Openstack - OpenStack Silicon Valley
A Container Stack for Openstack - OpenStack Silicon ValleyStephen Gordon
 
Using Cinder Block Storage
Using Cinder Block StorageUsing Cinder Block Storage
Using Cinder Block StorageRed_Hat_Storage
 

What's hot (20)

Ceph & OpenStack talk given @ OpenStack Meetup @ Bangalore, June 2015
Ceph & OpenStack talk given @ OpenStack Meetup @ Bangalore, June 2015Ceph & OpenStack talk given @ OpenStack Meetup @ Bangalore, June 2015
Ceph & OpenStack talk given @ OpenStack Meetup @ Bangalore, June 2015
 
OpenStack Kolla project update rocky release
OpenStack Kolla project update rocky releaseOpenStack Kolla project update rocky release
OpenStack Kolla project update rocky release
 
OpenStack in Action 4! Sebastien Han - Ceph: de facto storage backend for Ope...
OpenStack in Action 4! Sebastien Han - Ceph: de facto storage backend for Ope...OpenStack in Action 4! Sebastien Han - Ceph: de facto storage backend for Ope...
OpenStack in Action 4! Sebastien Han - Ceph: de facto storage backend for Ope...
 
Deploying openstack using ansible
Deploying openstack using ansibleDeploying openstack using ansible
Deploying openstack using ansible
 
OpenNebula Conf 2014 | Understanding the OpenNebula Model for Cloud Provision...
OpenNebula Conf 2014 | Understanding the OpenNebula Model for Cloud Provision...OpenNebula Conf 2014 | Understanding the OpenNebula Model for Cloud Provision...
OpenNebula Conf 2014 | Understanding the OpenNebula Model for Cloud Provision...
 
OpenNebula Conf 2014 | Using Ceph to provide scalable storage for OpenNebula ...
OpenNebula Conf 2014 | Using Ceph to provide scalable storage for OpenNebula ...OpenNebula Conf 2014 | Using Ceph to provide scalable storage for OpenNebula ...
OpenNebula Conf 2014 | Using Ceph to provide scalable storage for OpenNebula ...
 
Ceph de facto storage backend for OpenStack
Ceph de facto storage backend for OpenStack Ceph de facto storage backend for OpenStack
Ceph de facto storage backend for OpenStack
 
Enhancing Kubernetes with Autoscaling & Hybrid Cloud IaaS
Enhancing Kubernetes with Autoscaling & Hybrid Cloud IaaSEnhancing Kubernetes with Autoscaling & Hybrid Cloud IaaS
Enhancing Kubernetes with Autoscaling & Hybrid Cloud IaaS
 
John Spray - Ceph in Kubernetes
John Spray - Ceph in KubernetesJohn Spray - Ceph in Kubernetes
John Spray - Ceph in Kubernetes
 
Ceph Tech Talk: Ceph at DigitalOcean
Ceph Tech Talk: Ceph at DigitalOceanCeph Tech Talk: Ceph at DigitalOcean
Ceph Tech Talk: Ceph at DigitalOcean
 
Rook cncf-wg-storage
Rook cncf-wg-storageRook cncf-wg-storage
Rook cncf-wg-storage
 
Introduction to rook
Introduction to rookIntroduction to rook
Introduction to rook
 
High Availability from the DevOps side - OpenStack Summit Portland
High Availability from the DevOps side - OpenStack Summit PortlandHigh Availability from the DevOps side - OpenStack Summit Portland
High Availability from the DevOps side - OpenStack Summit Portland
 
Database experiences designing cassandra schema for keystone
Database experiences designing cassandra schema for keystone Database experiences designing cassandra schema for keystone
Database experiences designing cassandra schema for keystone
 
Antoine Coetsier - billing the cloud
Antoine Coetsier - billing the cloudAntoine Coetsier - billing the cloud
Antoine Coetsier - billing the cloud
 
6 open stack_swift_panoramic_view
6 open stack_swift_panoramic_view6 open stack_swift_panoramic_view
6 open stack_swift_panoramic_view
 
OpenNebula Conf 2014 | OpenNebula as Open Replacement of vCloud by Javier Fontan
OpenNebula Conf 2014 | OpenNebula as Open Replacement of vCloud by Javier FontanOpenNebula Conf 2014 | OpenNebula as Open Replacement of vCloud by Javier Fontan
OpenNebula Conf 2014 | OpenNebula as Open Replacement of vCloud by Javier Fontan
 
A Container Stack for Openstack - OpenStack Silicon Valley
A Container Stack for Openstack - OpenStack Silicon ValleyA Container Stack for Openstack - OpenStack Silicon Valley
A Container Stack for Openstack - OpenStack Silicon Valley
 
KubeWHAT!?
KubeWHAT!?KubeWHAT!?
KubeWHAT!?
 
Using Cinder Block Storage
Using Cinder Block StorageUsing Cinder Block Storage
Using Cinder Block Storage
 

Similar to When disaster strikes the cloud: Who, what, when, where and how to recover

#VirtualDesignMaster 3 Challenge 3 - Harshvardhan Gupta
#VirtualDesignMaster 3 Challenge 3 - Harshvardhan Gupta#VirtualDesignMaster 3 Challenge 3 - Harshvardhan Gupta
#VirtualDesignMaster 3 Challenge 3 - Harshvardhan Guptavdmchallenge
 
Dragon and cinder v brownbag
Dragon and cinder v brownbagDragon and cinder v brownbag
Dragon and cinder v brownbagAlon Marx
 
Kubernetes Disaster Recovery - Los Angeles K8s meetup Dec 10 2019
Kubernetes Disaster Recovery - Los Angeles K8s meetup Dec 10 2019Kubernetes Disaster Recovery - Los Angeles K8s meetup Dec 10 2019
Kubernetes Disaster Recovery - Los Angeles K8s meetup Dec 10 2019Steve Wong
 
The road to enterprise ready open stack storage as service
The road to enterprise ready open stack storage as serviceThe road to enterprise ready open stack storage as service
The road to enterprise ready open stack storage as serviceSean Cohen
 
Webinar: What’s Breaking Your VMware Backups? And How You Can Fix Them Quickly
Webinar: What’s Breaking Your VMware Backups? And How You Can Fix Them QuicklyWebinar: What’s Breaking Your VMware Backups? And How You Can Fix Them Quickly
Webinar: What’s Breaking Your VMware Backups? And How You Can Fix Them QuicklyStorage Switzerland
 
Trilio for Red Hat OpenStack: The Missing Link for Cloud-Native Data Protection
Trilio for Red Hat OpenStack: The Missing Link for Cloud-Native Data ProtectionTrilio for Red Hat OpenStack: The Missing Link for Cloud-Native Data Protection
Trilio for Red Hat OpenStack: The Missing Link for Cloud-Native Data ProtectionDevOps.com
 
ACDKOCHI19 - Journey from a traditional on-prem Datacenter to AWS: Challenges...
ACDKOCHI19 - Journey from a traditional on-prem Datacenter to AWS: Challenges...ACDKOCHI19 - Journey from a traditional on-prem Datacenter to AWS: Challenges...
ACDKOCHI19 - Journey from a traditional on-prem Datacenter to AWS: Challenges...AWS User Group Kochi
 
Private Cloud with Open Stack, Docker
Private Cloud with Open Stack, DockerPrivate Cloud with Open Stack, Docker
Private Cloud with Open Stack, DockerDavinder Kohli
 
KubeCon US 2021 - Recap - DCMeetup
KubeCon US 2021 - Recap - DCMeetupKubeCon US 2021 - Recap - DCMeetup
KubeCon US 2021 - Recap - DCMeetupFaheem Memon
 
La sécurité avec Kubernetes et les conteneurs Docker (June 19th, 2019)
La sécurité avec Kubernetes et les conteneurs Docker (June 19th, 2019)La sécurité avec Kubernetes et les conteneurs Docker (June 19th, 2019)
La sécurité avec Kubernetes et les conteneurs Docker (June 19th, 2019)Alexandre Roman
 
CN Asturias - Stateful application for kubernetes
CN Asturias -  Stateful application for kubernetes CN Asturias -  Stateful application for kubernetes
CN Asturias - Stateful application for kubernetes Cédrick Lunven
 
OpenEBS Technical Workshop - KubeCon San Diego 2019
OpenEBS Technical Workshop - KubeCon San Diego 2019OpenEBS Technical Workshop - KubeCon San Diego 2019
OpenEBS Technical Workshop - KubeCon San Diego 2019MayaData Inc
 
Towards the Cloud: Architecture Patterns and VDI Story
Towards the Cloud: Architecture Patterns and VDI StoryTowards the Cloud: Architecture Patterns and VDI Story
Towards the Cloud: Architecture Patterns and VDI StoryIT Expert Club
 
A hitchhiker‘s guide to the cloud native stack
A hitchhiker‘s guide to the cloud native stackA hitchhiker‘s guide to the cloud native stack
A hitchhiker‘s guide to the cloud native stackQAware GmbH
 
A Hitchhiker’s Guide to the Cloud Native Stack. #CDS17
A Hitchhiker’s Guide to the Cloud Native Stack. #CDS17A Hitchhiker’s Guide to the Cloud Native Stack. #CDS17
A Hitchhiker’s Guide to the Cloud Native Stack. #CDS17Mario-Leander Reimer
 
2016 08-30 Kubernetes talk for Waterloo DevOps
2016 08-30 Kubernetes talk for Waterloo DevOps2016 08-30 Kubernetes talk for Waterloo DevOps
2016 08-30 Kubernetes talk for Waterloo DevOpscraigbox
 
Taufik kurniawan strategy and approach to private cloud infrastructure impl...
Taufik kurniawan   strategy and approach to private cloud infrastructure impl...Taufik kurniawan   strategy and approach to private cloud infrastructure impl...
Taufik kurniawan strategy and approach to private cloud infrastructure impl...teknologiinformasi
 

Similar to When disaster strikes the cloud: Who, what, when, where and how to recover (20)

#VirtualDesignMaster 3 Challenge 3 - Harshvardhan Gupta
#VirtualDesignMaster 3 Challenge 3 - Harshvardhan Gupta#VirtualDesignMaster 3 Challenge 3 - Harshvardhan Gupta
#VirtualDesignMaster 3 Challenge 3 - Harshvardhan Gupta
 
Dragon and cinder v brownbag
Dragon and cinder v brownbagDragon and cinder v brownbag
Dragon and cinder v brownbag
 
Kubernetes Disaster Recovery - Los Angeles K8s meetup Dec 10 2019
Kubernetes Disaster Recovery - Los Angeles K8s meetup Dec 10 2019Kubernetes Disaster Recovery - Los Angeles K8s meetup Dec 10 2019
Kubernetes Disaster Recovery - Los Angeles K8s meetup Dec 10 2019
 
The road to enterprise ready open stack storage as service
The road to enterprise ready open stack storage as serviceThe road to enterprise ready open stack storage as service
The road to enterprise ready open stack storage as service
 
Webinar: What’s Breaking Your VMware Backups? And How You Can Fix Them Quickly
Webinar: What’s Breaking Your VMware Backups? And How You Can Fix Them QuicklyWebinar: What’s Breaking Your VMware Backups? And How You Can Fix Them Quickly
Webinar: What’s Breaking Your VMware Backups? And How You Can Fix Them Quickly
 
Trilio for Red Hat OpenStack: The Missing Link for Cloud-Native Data Protection
Trilio for Red Hat OpenStack: The Missing Link for Cloud-Native Data ProtectionTrilio for Red Hat OpenStack: The Missing Link for Cloud-Native Data Protection
Trilio for Red Hat OpenStack: The Missing Link for Cloud-Native Data Protection
 
ACDKOCHI19 - Journey from a traditional on-prem Datacenter to AWS: Challenges...
ACDKOCHI19 - Journey from a traditional on-prem Datacenter to AWS: Challenges...ACDKOCHI19 - Journey from a traditional on-prem Datacenter to AWS: Challenges...
ACDKOCHI19 - Journey from a traditional on-prem Datacenter to AWS: Challenges...
 
Private Cloud with Open Stack, Docker
Private Cloud with Open Stack, DockerPrivate Cloud with Open Stack, Docker
Private Cloud with Open Stack, Docker
 
KubeCon US 2021 - Recap - DCMeetup
KubeCon US 2021 - Recap - DCMeetupKubeCon US 2021 - Recap - DCMeetup
KubeCon US 2021 - Recap - DCMeetup
 
La sécurité avec Kubernetes et les conteneurs Docker (June 19th, 2019)
La sécurité avec Kubernetes et les conteneurs Docker (June 19th, 2019)La sécurité avec Kubernetes et les conteneurs Docker (June 19th, 2019)
La sécurité avec Kubernetes et les conteneurs Docker (June 19th, 2019)
 
IT Resilience Technical
IT Resilience TechnicalIT Resilience Technical
IT Resilience Technical
 
CN Asturias - Stateful application for kubernetes
CN Asturias -  Stateful application for kubernetes CN Asturias -  Stateful application for kubernetes
CN Asturias - Stateful application for kubernetes
 
OpenEBS Technical Workshop - KubeCon San Diego 2019
OpenEBS Technical Workshop - KubeCon San Diego 2019OpenEBS Technical Workshop - KubeCon San Diego 2019
OpenEBS Technical Workshop - KubeCon San Diego 2019
 
Towards the Cloud: Architecture Patterns and VDI Story
Towards the Cloud: Architecture Patterns and VDI StoryTowards the Cloud: Architecture Patterns and VDI Story
Towards the Cloud: Architecture Patterns and VDI Story
 
A hitchhiker‘s guide to the cloud native stack
A hitchhiker‘s guide to the cloud native stackA hitchhiker‘s guide to the cloud native stack
A hitchhiker‘s guide to the cloud native stack
 
A Hitchhiker’s Guide to the Cloud Native Stack. #CDS17
A Hitchhiker’s Guide to the Cloud Native Stack. #CDS17A Hitchhiker’s Guide to the Cloud Native Stack. #CDS17
A Hitchhiker’s Guide to the Cloud Native Stack. #CDS17
 
CloudStack vs Openstack
CloudStack vs OpenstackCloudStack vs Openstack
CloudStack vs Openstack
 
2016 08-30 Kubernetes talk for Waterloo DevOps
2016 08-30 Kubernetes talk for Waterloo DevOps2016 08-30 Kubernetes talk for Waterloo DevOps
2016 08-30 Kubernetes talk for Waterloo DevOps
 
Autopilot : Securing Cloud Native Storage
Autopilot : Securing Cloud Native StorageAutopilot : Securing Cloud Native Storage
Autopilot : Securing Cloud Native Storage
 
Taufik kurniawan strategy and approach to private cloud infrastructure impl...
Taufik kurniawan   strategy and approach to private cloud infrastructure impl...Taufik kurniawan   strategy and approach to private cloud infrastructure impl...
Taufik kurniawan strategy and approach to private cloud infrastructure impl...
 

More from Sean Cohen

Storage 101: Rook and Ceph - Open Infrastructure Denver 2019
Storage 101: Rook and Ceph - Open Infrastructure Denver 2019Storage 101: Rook and Ceph - Open Infrastructure Denver 2019
Storage 101: Rook and Ceph - Open Infrastructure Denver 2019Sean Cohen
 
3-2-1 Action! Running OpenStack Shared File System Service in Production
3-2-1 Action! Running OpenStack Shared File System Service in Production3-2-1 Action! Running OpenStack Shared File System Service in Production
3-2-1 Action! Running OpenStack Shared File System Service in ProductionSean Cohen
 
Manila, an update from Liberty, OpenStack Summit - Tokyo
Manila, an update from Liberty, OpenStack Summit - TokyoManila, an update from Liberty, OpenStack Summit - Tokyo
Manila, an update from Liberty, OpenStack Summit - TokyoSean Cohen
 
Dude where's my volume, open stack summit vancouver 2015
Dude where's my volume, open stack summit vancouver 2015Dude where's my volume, open stack summit vancouver 2015
Dude where's my volume, open stack summit vancouver 2015Sean Cohen
 
Deterministic capacity planning for OpenStack as elastic cloud infrastructure
Deterministic capacity planning for OpenStack as elastic cloud infrastructureDeterministic capacity planning for OpenStack as elastic cloud infrastructure
Deterministic capacity planning for OpenStack as elastic cloud infrastructureSean Cohen
 
Deep dive into OpenStack storage, Sean Cohen, Red Hat
Deep dive into OpenStack storage, Sean Cohen, Red HatDeep dive into OpenStack storage, Sean Cohen, Red Hat
Deep dive into OpenStack storage, Sean Cohen, Red HatSean Cohen
 
Kvm forum 2013 - future integration points for oVirt storage
Kvm forum 2013 - future integration points for oVirt storageKvm forum 2013 - future integration points for oVirt storage
Kvm forum 2013 - future integration points for oVirt storageSean Cohen
 
Integration of Storage, OpenStack & Virtualization
 Integration of Storage, OpenStack & Virtualization Integration of Storage, OpenStack & Virtualization
Integration of Storage, OpenStack & VirtualizationSean Cohen
 

More from Sean Cohen (8)

Storage 101: Rook and Ceph - Open Infrastructure Denver 2019
Storage 101: Rook and Ceph - Open Infrastructure Denver 2019Storage 101: Rook and Ceph - Open Infrastructure Denver 2019
Storage 101: Rook and Ceph - Open Infrastructure Denver 2019
 
3-2-1 Action! Running OpenStack Shared File System Service in Production
3-2-1 Action! Running OpenStack Shared File System Service in Production3-2-1 Action! Running OpenStack Shared File System Service in Production
3-2-1 Action! Running OpenStack Shared File System Service in Production
 
Manila, an update from Liberty, OpenStack Summit - Tokyo
Manila, an update from Liberty, OpenStack Summit - TokyoManila, an update from Liberty, OpenStack Summit - Tokyo
Manila, an update from Liberty, OpenStack Summit - Tokyo
 
Dude where's my volume, open stack summit vancouver 2015
Dude where's my volume, open stack summit vancouver 2015Dude where's my volume, open stack summit vancouver 2015
Dude where's my volume, open stack summit vancouver 2015
 
Deterministic capacity planning for OpenStack as elastic cloud infrastructure
Deterministic capacity planning for OpenStack as elastic cloud infrastructureDeterministic capacity planning for OpenStack as elastic cloud infrastructure
Deterministic capacity planning for OpenStack as elastic cloud infrastructure
 
Deep dive into OpenStack storage, Sean Cohen, Red Hat
Deep dive into OpenStack storage, Sean Cohen, Red HatDeep dive into OpenStack storage, Sean Cohen, Red Hat
Deep dive into OpenStack storage, Sean Cohen, Red Hat
 
Kvm forum 2013 - future integration points for oVirt storage
Kvm forum 2013 - future integration points for oVirt storageKvm forum 2013 - future integration points for oVirt storage
Kvm forum 2013 - future integration points for oVirt storage
 
Integration of Storage, OpenStack & Virtualization
 Integration of Storage, OpenStack & Virtualization Integration of Storage, OpenStack & Virtualization
Integration of Storage, OpenStack & Virtualization
 

Recently uploaded

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 

Recently uploaded (20)

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 

When disaster strikes the cloud: Who, what, when, where and how to recover

  • 1. Accelerating Enterprise OpenStack When Disaster Strikes the Cloud Michael Factor IBM Research - Haifa factor@il.ibm.com Who, What, When, Where and How to Recover Ronen Kat IBM Research - Haifa ronenkat@il.ibm.com Sean Cohen RedHat scohen@redhat.com
  • 2. 2 Talk Outline q What is disaster recovery? q Concepts and basics q Protecting data and applications from disasters q OpenStack Cinder toolbox for disaster recovery q Applications are more than just data q The road ahead: Kilo and beyond
  • 3. 3 What is Disaster Recovery? According to Wikipedia, Disaster Recovery (DR) is "the process, policies and procedures . . . for recovery . . . of technology infrastructure . . . after a natural or human-induced disaster.” Servers Storage Network Software Configuration Surviving a disaster requires geographic dispersion
  • 4. 4 Recovery Point Objective and Recovery Time Objective How far back in time a disaster takes one How long until operational after a disaster Seconds 0 RECOVERY POINT OBJECTIVE (RPO) MinutesHoursDaysWeeks Weeks RECOVERY POINT TIME (RTO) DaysHoursMinutesSeconds Replication Backup restore Active site Hot site
  • 5. 5 Data and Metadata Consistency Data consistency q If a modified datum is available, all data it depends upon is also available Metadata consistency q Configuration updates are seen in the same order relative to one another and to data updates Application VM DB LOG DB LOG Remote Site
  • 6. 6 OpenStack Cloud Metadata Virtual networks between the cloud VM External network access Attached volumes Volume types Virtual machines flavors SSH keys for VM access Virtual machines images Identities of users
  • 7. Accelerating Enterprise OpenStack Protecting Data and Applications from Disasters
  • 8. 8 Data Protection: Cinder Backup and Restore q Cinder backup q Backup a volume to backup storage Swift backup-create Primary Cloud
  • 9. 9 Data Protection: Cinder Backup and Restore q Can Cinder restore on secondary cloud? q Problem: Cinder on secondary cloud is not aware of the backup Swift backup-restore Primary Cloud Secondary Cloud
  • 10. 10 Data Protection: Cinder Backup and Restore q Solution: “electronic tape shipping” q backup-export q backup-import q Cinder supports since Icehouse Swift backup-export Primary Cloud Secondary Cloud Backup reference backup-import
  • 11. 11 Data Protection: Cinder Backup and Restore q After backup-import Cinder can restore on secondary cloud q backup-restore Swift backup-restore Primary Cloud Secondary Cloud
  • 12. 12 Data Protection: Cinder Volume replication q Cinder has initial support for volume replication in Juno release q Cinder back-ends can “advertise” support for replication q Volume created with replication extra-spec will be allocated on back-end supporting replication and will be replicated q Supporting back ends: q IBM Storwize, more expected in Kilo Cinder back-end Cinder back-end Volume-type extra specs: “capabilities:replication <is> True”
  • 13. 13 Data Protection: Cinder Volume replication q Secondary volume can become primary when promoted q replication-promote q Replication can be reversed following a replication-promote q replication-reenable Cinder back-end Cinder back-end
  • 14. 14 Consistency Groups q New in Juno q Support for volume grouping for consistency q Grouping of volumes is based on the volume-type q Supporting q Consistency group snapshots q Needs to be extended to support q Cinder backup q Cinder volume replication DB LOG
  • 15. 15 Protecting Applications from Disasters Servers Storage Network Software Configuration Disaster Recovery Orchestration
  • 16. 16 OpenStack Tools q Applications are defined in OpenStack by q Heat Orchestration Templates q However q Not all applications are template based q Deployments (including configuration) change over time q Some definitions are cloud specific, e.g., networks, types q Heat templates and Stacks don’t stay consistent q Tools that can create a template from deployment, e.g., Flame, ReHeat q But, template will only fit the current cloud
  • 17. 17 OpenStack Tools and Beyond q Demo: A technology preview for disaster recovery with IBM Cloud Manager
  • 19. 19 Ceph Multi-Site & Disaster Recovery (Block) example q Export snapshots to geographically dispersed data centers q Provides disaster recovery q Export incremental snapshots q Minimize network bandwidth by only sending changes q Kilo cycle focus to extends the multi-site and disaster recovery options q  RBD Mirroring q  Cinder Volume Replication
  • 20. 20 Ceph Multi-Site & Disaster Recovery (Object) example q Zones and region support q  Deploy topologies similar to S3 and others with a global namespace q Data center synchronization q  Back-up full or partial sets of data between regions q Read affinity q  Serve local copies of data to local users
  • 21. 21 Disaster Recovery as a Service Catalog q Pluggable Disaster Recovery policies q Replication targets can specify different RPO/RTO levels that can be offered based on the supported backend capabilities q Disaster Recovery Policies q  Active - Cold standby q  Active - Hot standby q  Active - Active (requires application awareness and transaction integrity) q  Backup to Cloud / From the Cloud
  • 22. 22 Extending Heat Orchestration for Disaster Recovery q Heat can be used to automate q Add support for Cinder replication q Need to make Consistency group across OpenStack projects q Nova Cinder, Trove…. q Stack Snapshot Backup / Rollback q Enable customization of workload components at recovery site. q Networks, VM configurations changes, guest agent etc.
  • 23. 23 The Road Toward Application Consistency First phase: File system consistency q Integrate into OpenStack to allow consistent snapshots and backups q Nova needs to request QEMU Guest Agent to freeze the file systems (and applications if fsfreeze-hook is installed) during the snapshot q Patches has proposed for Nova and Cinder, targeting the Kilo release Source: Hitachi
  • 24. 24 The Road Toward Application Consistency Next phase: Consistency at the application level q Application-Aware on Windows with VSS Support on qemu-ga q Application notification via Microsoft Volume Shadow Copy Service (VSS) q Application-Aware on Linux Using qemu-ga Hooks q Application-consistent snapshots can be created with scripts interacting with the QEMU guest agent q The scripts can notify applications to flush their data
  • 25. 25 Disaster Recovery at Scale q  Site evacuation holy grail is an automatic planned migration of the workloads and data from one cloud-scale datacenter to another. q  New OpenStack HA approaches to help Recovery from infrastructure failures: q  Leveraging Pacemaker to provide automated detection of a failed hypervisor and the recovery of the VMs that were running there. q  Evacuate instance to a scheduled host was added in Juno q  Simple tagging API for instances in Nova was accepted for Kilo release q  Can support automatic-recovery new tag Suggest removing – no time
  • 26. 26 OpenStack Documentation needs to catch up… q Join the OpenStack Disaster Recovery Guide q We have a basic OpenStack High Availability Guide q http://docs.openstack.org/high-availability-guide/content/ q A very outdated “Recover cloud after disaster” section in the Admin guide http://docs.openstack.org/admin-guide-cloud/content/section_nova-disaster- recovery-process.html
  • 27. Accelerating Enterprise OpenStack Q&A Michael Factor IBM Research - Haifa factor@il.ibm.com THANK YOU Ronen Kat IBM Research - Haifa ronenkat@il.ibm.com Sean Cohen RedHat scohen@redhat.com