SlideShare a Scribd company logo
1 of 34
Download to read offline
Running backups with Ceph-to-Ceph
OSDC 2019 May 15th, 2019
Michel Raabe
Cloud Solution Architect
B1 Systems GmbH
raabe@b1-systems.de
Introducing B1 Systems
founded in 2004
operating both nationally and internationally
more than 100 employees
vendor-independent (hardware and software)
focus:
consulting
support
development
training
operations
solutions
branch offices in Rockolding, Berlin, Cologne & Dresden
B1 Systems GmbH Running backups with Ceph-to-Ceph 2 / 34
Areas of Expertise
B1 Systems GmbH Running backups with Ceph-to-Ceph 3 / 34
Running backups with Ceph-to-Ceph
B1 Systems GmbH Running backups with Ceph-to-Ceph 4 / 34
Requirements
“We want to backup our Ceph cluster”
independent from OpenStack
asynchronous
not always offsite
fuc**** file browser
B1 Systems GmbH Running backups with Ceph-to-Ceph 5 / 34
Methods
native
rbd-mirror
s3 multisite
3rd party
“scripts”
backy2
rbd2qcow
???
B1 Systems GmbH Running backups with Ceph-to-Ceph 6 / 34
Challenges
B1 Systems GmbH Running backups with Ceph-to-Ceph 7 / 34
Challenges
crash consistency
disaster recovery
bandwidth
costs
B1 Systems GmbH Running backups with Ceph-to-Ceph 8 / 34
Crash consistency
only access to the base layer
unknown workload
corrupt filesystem
lost transactions
B1 Systems GmbH Running backups with Ceph-to-Ceph 9 / 34
Disaster recovery
how to access the remote cluster?
bandwidth?
route?
switch the storage backend?
supported by the application?
B1 Systems GmbH Running backups with Ceph-to-Ceph 10 / 34
Bandwidth
bandwidth vs. backup volume
20TB in 24h over 800mbit - ?
network latency
B1 Systems GmbH Running backups with Ceph-to-Ceph 11 / 34
Costs
a second cluster
different type of disks (hdd/ssd/nvme)
similar amount of available disk space
uplink
1/10/100 Gbit
B1 Systems GmbH Running backups with Ceph-to-Ceph 12 / 34
What can we do?
B1 Systems GmbH Running backups with Ceph-to-Ceph 13 / 34
rbd-mirror – Overview
does mirror support:
single image
whole pool
available since jewel (10.2.x)
asynchronous
daemon: rbd-mirror
no “file browser”
B1 Systems GmbH Running backups with Ceph-to-Ceph 14 / 34
rbd-mirror – What’s needed?
rbd feature: journaling (+ exclusive lock):
rbd feature enable ... journaling
30s default trigger:
rbd_mirror_sync_point_update_age
cluster name
default is “ceph”
it’s possible but ...
... hard to track
B1 Systems GmbH Running backups with Ceph-to-Ceph 15 / 34
rbd-mirror – Keyring sample
key layout:
ceph.client.admin.keyring
<clustername>.<type>.<id>.keyring
example config files:
remote.client.mirror.keyring
remote.conf
local.client.mirror.keyring
local.conf
B1 Systems GmbH Running backups with Ceph-to-Ceph 16 / 34
rbd-mirror – Problems
rbd-daemons must be able to connect to both clusters
no two public networks
same network or
routing kung fu
krbd module
B1 Systems GmbH Running backups with Ceph-to-Ceph 17 / 34
s3 multisite – Overview
S3 - simple storage service
compatible with Amazon S3
Swift compatible
Keystone integration
encryption
no “file browser”
B1 Systems GmbH Running backups with Ceph-to-Ceph 18 / 34
s3 multisite – What’s needed?
read-write or read-only
cloudsync plugin (e.g. aws)
NFS export possible
S3 “client”
B1 Systems GmbH Running backups with Ceph-to-Ceph 19 / 34
s3 multisite – Problems
Masterzone “default”
Zonegroup(s) “default”
<zone>-<zonegroup1> <-> <zone>-<zonegroup2>
default-default <-> default-default
de-fra <-> de-muc
Zonegroups are synced
one connection ...
... radosgw to radosgw
B1 Systems GmbH Running backups with Ceph-to-Ceph 20 / 34
3rd party
custom scripts
backy2
...
all based on snapshots and diff exports
B1 Systems GmbH Running backups with Ceph-to-Ceph 21 / 34
3rd party – Scripts – Overview
should use snapshot and ‘diff’
rbd snap create ..
rbd export-diff .. | ssh rbd import-diff ..
rbd export-diff --from-snap .. | ssh rbd import-diff ..
someone has to track the snapshots
B1 Systems GmbH Running backups with Ceph-to-Ceph 22 / 34
3rd party – backy2 – Overview
internal db
snapshots
rbd and file
can do backups to s3
tricky with k8s
python (only deb)
nbd mount
B1 Systems GmbH Running backups with Ceph-to-Ceph 23 / 34
3rd party – Problems
active/old snapshots
k8s with ceph backend
pv cannot be deleted
tracking/database
still no “file browser”
B1 Systems GmbH Running backups with Ceph-to-Ceph 24 / 34
3rd party – Example workflow
1 Create initial snapshot.
2 Copy first snapshot to remote cluster.
3 ... next hour/day/week/month ...
4 Create a snapshot.
5 Copy diff between snap1 vs snap2 to remote cluster.
6 Delete old snapshots.
B1 Systems GmbH Running backups with Ceph-to-Ceph 25 / 34
What can’t we do
B1 Systems GmbH Running backups with Ceph-to-Ceph 26 / 34
rbd export
plain rbd export
for one-time syncs
only disk images
Figure: rbd export | rbd import - 21min (20G)
B1 Systems GmbH Running backups with Ceph-to-Ceph 27 / 34
rbd export-diff
plain rbd export-diff
depends on the (total) diff size
only disk images
fast(er)?
scheduled
Figure: rbd export-diff - 8min (20G)
B1 Systems GmbH Running backups with Ceph-to-Ceph 28 / 34
rbd mirror
rbd-mirror
slow?
runs in the background
no schedule
Figure: rbd mirror - 30min (20G)
B1 Systems GmbH Running backups with Ceph-to-Ceph 29 / 34
s3 multisite
s3 capable client
“only” http(s)
scalable (really, really well)
B1 Systems GmbH Running backups with Ceph-to-Ceph 30 / 34
Wrap-up
B1 Systems GmbH Running backups with Ceph-to-Ceph 31 / 34
What now?
1 rbd snapshots
simple, easy, fast
controllable
can be a backup
2 s3 multisite
simple, fast
built-in
nfs export (ganesha)
no backup at all
3 rbd-mirror
disk-based
built-in
no backup at all
B1 Systems GmbH Running backups with Ceph-to-Ceph 32 / 34
What’s with ...
cephfs
no built-in replication
hourly/daily rsync?
snapshots?
B1 Systems GmbH Running backups with Ceph-to-Ceph 33 / 34
Thank You!
For more information, refer to info@b1-systems.de
or +49 (0)8457 - 931096

More Related Content

Similar to OSDC 2019 | Running backups with Ceph-to-Ceph by Michael Raabe

Images for the Clouds with KIWI & OBS
Images for the Clouds with KIWI & OBSImages for the Clouds with KIWI & OBS
Images for the Clouds with KIWI & OBSB1 Systems GmbH
 
Images for the Clouds with KIWI & OBS
Images for the Clouds with KIWI & OBSImages for the Clouds with KIWI & OBS
Images for the Clouds with KIWI & OBSAnke Börnig
 
End of the Road - Facing Current Scaling Limits within OpenStack
End of the Road - Facing Current Scaling Limits within OpenStackEnd of the Road - Facing Current Scaling Limits within OpenStack
End of the Road - Facing Current Scaling Limits within OpenStackB1 Systems GmbH
 
C# Production Debugging Made Easy
 C# Production Debugging Made Easy C# Production Debugging Made Easy
C# Production Debugging Made EasyAlon Fliess
 
Jenkins to Gitlab - Intelligent Build-Pipelines
Jenkins to Gitlab - Intelligent Build-PipelinesJenkins to Gitlab - Intelligent Build-Pipelines
Jenkins to Gitlab - Intelligent Build-PipelinesChristian Münch
 
Kubernetes - Shifting the mindset from servers to containers - microxchg 201...
Kubernetes  - Shifting the mindset from servers to containers - microxchg 201...Kubernetes  - Shifting the mindset from servers to containers - microxchg 201...
Kubernetes - Shifting the mindset from servers to containers - microxchg 201...Schlomo Schapiro
 
Release webinar architecture
Release webinar   architectureRelease webinar   architecture
Release webinar architectureBigData_Europe
 
Scaffolding for Serverless: lightning talk for AWS Arlington Meetup
Scaffolding for Serverless: lightning talk for AWS Arlington MeetupScaffolding for Serverless: lightning talk for AWS Arlington Meetup
Scaffolding for Serverless: lightning talk for AWS Arlington MeetupChris Shenton
 
Deep Learning and Gene Computing Acceleration with Alluxio in Kubernetes
Deep Learning and Gene Computing Acceleration with Alluxio in KubernetesDeep Learning and Gene Computing Acceleration with Alluxio in Kubernetes
Deep Learning and Gene Computing Acceleration with Alluxio in KubernetesAlluxio, Inc.
 
Testing Persistent Storage Performance in Kubernetes with Sherlock
Testing Persistent Storage Performance in Kubernetes with SherlockTesting Persistent Storage Performance in Kubernetes with Sherlock
Testing Persistent Storage Performance in Kubernetes with SherlockScyllaDB
 
Creating a dynamic software deployment solution using free/libre software
Creating a dynamic software deployment solution using free/libre softwareCreating a dynamic software deployment solution using free/libre software
Creating a dynamic software deployment solution using free/libre softwareB1 Systems GmbH
 
Automated CI with AEM Cloud service
Automated CI with AEM Cloud serviceAutomated CI with AEM Cloud service
Automated CI with AEM Cloud serviceJakub Wadolowski
 
DCEU 18: Developing with Docker Containers
DCEU 18: Developing with Docker ContainersDCEU 18: Developing with Docker Containers
DCEU 18: Developing with Docker ContainersDocker, Inc.
 
Scaleable PHP Applications in Kubernetes
Scaleable PHP Applications in KubernetesScaleable PHP Applications in Kubernetes
Scaleable PHP Applications in KubernetesRobert Lemke
 
Salt - A Scalable Systems Management Solution for Datacenters
Salt - A Scalable Systems Management Solution for DatacentersSalt - A Scalable Systems Management Solution for Datacenters
Salt - A Scalable Systems Management Solution for DatacentersB1 Systems GmbH
 
414: Build an agile CI/CD Pipeline for application integration
414: Build an agile CI/CD Pipeline for application integration414: Build an agile CI/CD Pipeline for application integration
414: Build an agile CI/CD Pipeline for application integrationTrevor Dolby
 
Using Docker For Development
Using Docker For DevelopmentUsing Docker For Development
Using Docker For DevelopmentLaura Frank Tacho
 
Some experiences for porting application to Intel Xeon Phi
Some experiences for porting application to Intel Xeon PhiSome experiences for porting application to Intel Xeon Phi
Some experiences for porting application to Intel Xeon PhiMaho Nakata
 

Similar to OSDC 2019 | Running backups with Ceph-to-Ceph by Michael Raabe (20)

Images for the Clouds with KIWI & OBS
Images for the Clouds with KIWI & OBSImages for the Clouds with KIWI & OBS
Images for the Clouds with KIWI & OBS
 
Images for the Clouds with KIWI & OBS
Images for the Clouds with KIWI & OBSImages for the Clouds with KIWI & OBS
Images for the Clouds with KIWI & OBS
 
End of the Road - Facing Current Scaling Limits within OpenStack
End of the Road - Facing Current Scaling Limits within OpenStackEnd of the Road - Facing Current Scaling Limits within OpenStack
End of the Road - Facing Current Scaling Limits within OpenStack
 
C# Production Debugging Made Easy
 C# Production Debugging Made Easy C# Production Debugging Made Easy
C# Production Debugging Made Easy
 
Jenkins to Gitlab - Intelligent Build-Pipelines
Jenkins to Gitlab - Intelligent Build-PipelinesJenkins to Gitlab - Intelligent Build-Pipelines
Jenkins to Gitlab - Intelligent Build-Pipelines
 
Kubernetes - Shifting the mindset from servers to containers - microxchg 201...
Kubernetes  - Shifting the mindset from servers to containers - microxchg 201...Kubernetes  - Shifting the mindset from servers to containers - microxchg 201...
Kubernetes - Shifting the mindset from servers to containers - microxchg 201...
 
PowerAI Deep Dive ( key points )
PowerAI Deep Dive ( key points )PowerAI Deep Dive ( key points )
PowerAI Deep Dive ( key points )
 
Release webinar architecture
Release webinar   architectureRelease webinar   architecture
Release webinar architecture
 
Scaffolding for Serverless: lightning talk for AWS Arlington Meetup
Scaffolding for Serverless: lightning talk for AWS Arlington MeetupScaffolding for Serverless: lightning talk for AWS Arlington Meetup
Scaffolding for Serverless: lightning talk for AWS Arlington Meetup
 
Deep Learning and Gene Computing Acceleration with Alluxio in Kubernetes
Deep Learning and Gene Computing Acceleration with Alluxio in KubernetesDeep Learning and Gene Computing Acceleration with Alluxio in Kubernetes
Deep Learning and Gene Computing Acceleration with Alluxio in Kubernetes
 
Testing Persistent Storage Performance in Kubernetes with Sherlock
Testing Persistent Storage Performance in Kubernetes with SherlockTesting Persistent Storage Performance in Kubernetes with Sherlock
Testing Persistent Storage Performance in Kubernetes with Sherlock
 
Creating a dynamic software deployment solution using free/libre software
Creating a dynamic software deployment solution using free/libre softwareCreating a dynamic software deployment solution using free/libre software
Creating a dynamic software deployment solution using free/libre software
 
Building For Mer
Building For MerBuilding For Mer
Building For Mer
 
Automated CI with AEM Cloud service
Automated CI with AEM Cloud serviceAutomated CI with AEM Cloud service
Automated CI with AEM Cloud service
 
DCEU 18: Developing with Docker Containers
DCEU 18: Developing with Docker ContainersDCEU 18: Developing with Docker Containers
DCEU 18: Developing with Docker Containers
 
Scaleable PHP Applications in Kubernetes
Scaleable PHP Applications in KubernetesScaleable PHP Applications in Kubernetes
Scaleable PHP Applications in Kubernetes
 
Salt - A Scalable Systems Management Solution for Datacenters
Salt - A Scalable Systems Management Solution for DatacentersSalt - A Scalable Systems Management Solution for Datacenters
Salt - A Scalable Systems Management Solution for Datacenters
 
414: Build an agile CI/CD Pipeline for application integration
414: Build an agile CI/CD Pipeline for application integration414: Build an agile CI/CD Pipeline for application integration
414: Build an agile CI/CD Pipeline for application integration
 
Using Docker For Development
Using Docker For DevelopmentUsing Docker For Development
Using Docker For Development
 
Some experiences for porting application to Intel Xeon Phi
Some experiences for porting application to Intel Xeon PhiSome experiences for porting application to Intel Xeon Phi
Some experiences for porting application to Intel Xeon Phi
 

Recently uploaded

Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...stazi3110
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio, Inc.
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEOrtus Solutions, Corp
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfjoe51371421
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataBradBedford3
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...MyIntelliSource, Inc.
 
What are the features of Vehicle Tracking System?
What are the features of Vehicle Tracking System?What are the features of Vehicle Tracking System?
What are the features of Vehicle Tracking System?Watsoo Telematics
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...MyIntelliSource, Inc.
 
buds n tech IT solutions
buds n  tech IT                solutionsbuds n  tech IT                solutions
buds n tech IT solutionsmonugehlot87
 
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...OnePlan Solutions
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxTier1 app
 
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样umasea
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Andreas Granig
 
Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...aditisharan08
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number SystemsJheuzeDellosa
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01
 
Asset Management Software - Infographic
Asset Management Software - InfographicAsset Management Software - Infographic
Asset Management Software - InfographicHr365.us smith
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmSujith Sukumaran
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityNeo4j
 

Recently uploaded (20)

Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdf
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
 
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
 
What are the features of Vehicle Tracking System?
What are the features of Vehicle Tracking System?What are the features of Vehicle Tracking System?
What are the features of Vehicle Tracking System?
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
 
buds n tech IT solutions
buds n  tech IT                solutionsbuds n  tech IT                solutions
buds n tech IT solutions
 
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
 
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024
 
Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number Systems
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
 
Asset Management Software - Infographic
Asset Management Software - InfographicAsset Management Software - Infographic
Asset Management Software - Infographic
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalm
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 

OSDC 2019 | Running backups with Ceph-to-Ceph by Michael Raabe

  • 1. Running backups with Ceph-to-Ceph OSDC 2019 May 15th, 2019 Michel Raabe Cloud Solution Architect B1 Systems GmbH raabe@b1-systems.de
  • 2. Introducing B1 Systems founded in 2004 operating both nationally and internationally more than 100 employees vendor-independent (hardware and software) focus: consulting support development training operations solutions branch offices in Rockolding, Berlin, Cologne & Dresden B1 Systems GmbH Running backups with Ceph-to-Ceph 2 / 34
  • 3. Areas of Expertise B1 Systems GmbH Running backups with Ceph-to-Ceph 3 / 34
  • 4. Running backups with Ceph-to-Ceph B1 Systems GmbH Running backups with Ceph-to-Ceph 4 / 34
  • 5. Requirements “We want to backup our Ceph cluster” independent from OpenStack asynchronous not always offsite fuc**** file browser B1 Systems GmbH Running backups with Ceph-to-Ceph 5 / 34
  • 7. Challenges B1 Systems GmbH Running backups with Ceph-to-Ceph 7 / 34
  • 8. Challenges crash consistency disaster recovery bandwidth costs B1 Systems GmbH Running backups with Ceph-to-Ceph 8 / 34
  • 9. Crash consistency only access to the base layer unknown workload corrupt filesystem lost transactions B1 Systems GmbH Running backups with Ceph-to-Ceph 9 / 34
  • 10. Disaster recovery how to access the remote cluster? bandwidth? route? switch the storage backend? supported by the application? B1 Systems GmbH Running backups with Ceph-to-Ceph 10 / 34
  • 11. Bandwidth bandwidth vs. backup volume 20TB in 24h over 800mbit - ? network latency B1 Systems GmbH Running backups with Ceph-to-Ceph 11 / 34
  • 12. Costs a second cluster different type of disks (hdd/ssd/nvme) similar amount of available disk space uplink 1/10/100 Gbit B1 Systems GmbH Running backups with Ceph-to-Ceph 12 / 34
  • 13. What can we do? B1 Systems GmbH Running backups with Ceph-to-Ceph 13 / 34
  • 14. rbd-mirror – Overview does mirror support: single image whole pool available since jewel (10.2.x) asynchronous daemon: rbd-mirror no “file browser” B1 Systems GmbH Running backups with Ceph-to-Ceph 14 / 34
  • 15. rbd-mirror – What’s needed? rbd feature: journaling (+ exclusive lock): rbd feature enable ... journaling 30s default trigger: rbd_mirror_sync_point_update_age cluster name default is “ceph” it’s possible but ... ... hard to track B1 Systems GmbH Running backups with Ceph-to-Ceph 15 / 34
  • 16. rbd-mirror – Keyring sample key layout: ceph.client.admin.keyring <clustername>.<type>.<id>.keyring example config files: remote.client.mirror.keyring remote.conf local.client.mirror.keyring local.conf B1 Systems GmbH Running backups with Ceph-to-Ceph 16 / 34
  • 17. rbd-mirror – Problems rbd-daemons must be able to connect to both clusters no two public networks same network or routing kung fu krbd module B1 Systems GmbH Running backups with Ceph-to-Ceph 17 / 34
  • 18. s3 multisite – Overview S3 - simple storage service compatible with Amazon S3 Swift compatible Keystone integration encryption no “file browser” B1 Systems GmbH Running backups with Ceph-to-Ceph 18 / 34
  • 19. s3 multisite – What’s needed? read-write or read-only cloudsync plugin (e.g. aws) NFS export possible S3 “client” B1 Systems GmbH Running backups with Ceph-to-Ceph 19 / 34
  • 20. s3 multisite – Problems Masterzone “default” Zonegroup(s) “default” <zone>-<zonegroup1> <-> <zone>-<zonegroup2> default-default <-> default-default de-fra <-> de-muc Zonegroups are synced one connection ... ... radosgw to radosgw B1 Systems GmbH Running backups with Ceph-to-Ceph 20 / 34
  • 21. 3rd party custom scripts backy2 ... all based on snapshots and diff exports B1 Systems GmbH Running backups with Ceph-to-Ceph 21 / 34
  • 22. 3rd party – Scripts – Overview should use snapshot and ‘diff’ rbd snap create .. rbd export-diff .. | ssh rbd import-diff .. rbd export-diff --from-snap .. | ssh rbd import-diff .. someone has to track the snapshots B1 Systems GmbH Running backups with Ceph-to-Ceph 22 / 34
  • 23. 3rd party – backy2 – Overview internal db snapshots rbd and file can do backups to s3 tricky with k8s python (only deb) nbd mount B1 Systems GmbH Running backups with Ceph-to-Ceph 23 / 34
  • 24. 3rd party – Problems active/old snapshots k8s with ceph backend pv cannot be deleted tracking/database still no “file browser” B1 Systems GmbH Running backups with Ceph-to-Ceph 24 / 34
  • 25. 3rd party – Example workflow 1 Create initial snapshot. 2 Copy first snapshot to remote cluster. 3 ... next hour/day/week/month ... 4 Create a snapshot. 5 Copy diff between snap1 vs snap2 to remote cluster. 6 Delete old snapshots. B1 Systems GmbH Running backups with Ceph-to-Ceph 25 / 34
  • 26. What can’t we do B1 Systems GmbH Running backups with Ceph-to-Ceph 26 / 34
  • 27. rbd export plain rbd export for one-time syncs only disk images Figure: rbd export | rbd import - 21min (20G) B1 Systems GmbH Running backups with Ceph-to-Ceph 27 / 34
  • 28. rbd export-diff plain rbd export-diff depends on the (total) diff size only disk images fast(er)? scheduled Figure: rbd export-diff - 8min (20G) B1 Systems GmbH Running backups with Ceph-to-Ceph 28 / 34
  • 29. rbd mirror rbd-mirror slow? runs in the background no schedule Figure: rbd mirror - 30min (20G) B1 Systems GmbH Running backups with Ceph-to-Ceph 29 / 34
  • 30. s3 multisite s3 capable client “only” http(s) scalable (really, really well) B1 Systems GmbH Running backups with Ceph-to-Ceph 30 / 34
  • 31. Wrap-up B1 Systems GmbH Running backups with Ceph-to-Ceph 31 / 34
  • 32. What now? 1 rbd snapshots simple, easy, fast controllable can be a backup 2 s3 multisite simple, fast built-in nfs export (ganesha) no backup at all 3 rbd-mirror disk-based built-in no backup at all B1 Systems GmbH Running backups with Ceph-to-Ceph 32 / 34
  • 33. What’s with ... cephfs no built-in replication hourly/daily rsync? snapshots? B1 Systems GmbH Running backups with Ceph-to-Ceph 33 / 34
  • 34. Thank You! For more information, refer to info@b1-systems.de or +49 (0)8457 - 931096