SlideShare a Scribd company logo
RED HAT CONFIDENTIAL | NDA ONLY
CACHE TIERING AND ERASURE CODING
#ceph-devel
shinobu
RED HAT CONFIDENTIAL | NDA ONLY
■ CEPH MOTIVATING PRINCIPLES
■ CEPH COMPONENTS
■ ARCHITECTURE COMPONENT
■ RADOS
■ LIBRADOS
■ RADOS COMPONENTS
■ DATA PLACEMENT
■ CACHE TIERING
■ ERASURE CODING
AGENDA
1
RED HAT CONFIDENTIAL | NDA ONLY
■ All components must scale horizontally
■ There can be no single point of failure
■ The solution must be hardware agnostic
■ Should use commodity hardware
■ Self-manage whenever possible
■ Open source (LGPL)
■ Move beyond legacy approaches
■ Client / cluster instead of client / server
■ Ad hoc HA
CEPH MOTIVATING PRINCIPLES
2
RED HAT CONFIDENTIAL | NDA ONLY
RADOS
A software-based, reliable, autonomous, distributed object store comprised of self-healing, self-managing,
intelligent storage nodes and lightweight monitors
LIBRADOS
A library allowing apps to directly access RADOS (C, C++, Java, Python, Ruby, PHP)
RGW
A web services
gateway for object
storage, compatible
with S3 and Swift
RBD
A reliable, fully-
distributed block
device with cloud
platform integration
CephFS
A distributed file
system with POSIX
semantics and scale-
out metadata
management
APP HOST/VM CLIENT
CEPH COMPONENTS
3
RED HAT CONFIDENTIAL | NDA ONLY
RADOS
A software-based, reliable, autonomous, distributed object store comprised of self-healing, self-managing,
intelligent storage nodes and lightweight monitors
LIBRADOS
A library allowing apps to directly access RADOS (C, C++, Java, Python, Ruby, PHP)
RGW
A web services
gateway for object
storage, compatible
with S3 and Swift
RBD
A reliable, fully-
distributed block
device with cloud
platform integration
CephFS
A distributed file
system with POSIX
semantics and scale-
out metadata
management
APP HOST/VM CLIENT
ARCHITECTURE COMPONENTS
4
RED HAT CONFIDENTIAL | NDA ONLY
THE RADOS GATEWAY
APPLICATION
RADOSGW
LIBRADOS
APPLICATION
RADOSGW
LIBRADOS
RADOS CLUSTER
M
M
M
5
RED HAT CONFIDENTIAL | NDA ONLY
RADOS
A software-based, reliable, autonomous, distributed object store comprised of self-healing, self-managing,
intelligent storage nodes and lightweight monitors
LIBRADOS
A library allowing apps to directly access RADOS (C, C++, Java, Python, Ruby, PHP)
RGW
A web services
gateway for object
storage, compatible
with S3 and Swift
RBD
A reliable, fully-
distributed block
device with cloud
platform integration
CephFS
A distributed file
system with POSIX
semantics and scale-
out metadata
management
APP HOST/VM CLIENT
ARCHITECTURE COMPONENTS
6
RED HAT CONFIDENTIAL | NDA ONLY
RADOS CLUSTER
M
M
STORING VIRTUAL DISK: LIBRBD
VM
HYPERVISOR
LIBRBD
7
RED HAT CONFIDENTIAL | NDA ONLY
RADOS CLUSTER
M
M
KERNEL MODULE: KRBD
LINUX HOST
KRBD
8
RED HAT CONFIDENTIAL | NDA ONLY
RBD FEATURES
■ Stripe images across entire cluster (pool)
■ Read-only snapshots
■ Copy-on-Write clones
■ Broad integration
■ Qemu
■ Linux kernel
■ iSCSI (STGT, LIO)
■ OpenStack, CloudStack, Nebula, Geneti, Proxmox
■ Incremental backup (relative to snapshot)
9
RED HAT CONFIDENTIAL | NDA ONLY
RBD FEATURES
■ image mirroring
■ Asynchronous replication to another cluster
■ Replica(s) crash consistent
■ Replication is per-image
■ Each image has a data journal
■ RBD mirror daemon does the work
CLUSTER A
HYPERVISOR
LIBRBD
Journal
CLUSTER B
HYPERVISOR
LIBRBD
rbd-mirror
10
RED HAT CONFIDENTIAL | NDA ONLY
ARCHITECTURE COMPONENTS
RADOS
A software-based, reliable, autonomous, distributed object store comprised of self-healing, self-managing,
intelligent storage nodes and lightweight monitors
LIBRADOS
A library allowing apps to directly access RADOS (C, C++, Java, Python, Ruby, PHP)
RGW
A web services
gateway for object
storage, compatible
with S3 and Swift
RBD
A reliable, fully-
distributed block
device with cloud
platform integration
CephFS
A distributed file
system with POSIX
semantics and scale-
out metadata
management
APP HOST/VM CLIENT
11
RED HAT CONFIDENTIAL | NDA ONLY
SEPARATE METADATA SERVER
LINUX HOST
KERNEL MODULE
RADOS CLUSTER
M
M
M
01
10metadata data
12
RED HAT CONFIDENTIAL | NDA ONLY
SCALABLE METADATA SERVERS
MDS
■ Manages metadata for a POSIX-compliant shared filesystem
■ Directory hierarchy
■ File metadata (owner, timestamps, mode, etc)
■ Snapshots on any directory
■ Clients stripe file data in RADOS
■ MDS not in data path
■ MDS stores metadata in RADOS
■ Dynamic MDS cluster scales to 10s or 100s
■ Only required for shared file system
13
RED HAT CONFIDENTIAL | NDA ONLY
LIBRADOS
RADOS
A software-based, reliable, autonomous, distributed object store comprised of self-healing, self-managing,
intelligent storage nodes and lightweight monitors
RGW
A web services
gateway for object
storage, compatible
with S3 and Swift
RBD
A reliable, fully-
distributed block
device with cloud
platform integration
CephFS
A distributed file
system with POSIX
semantics and scale-
out metadata
management
APP HOST/VM CLIENT
LIBRADOS
A library allowing apps to directly access RADOS (C, C++, Java, Python, Ruby, PHP)
14
RED HAT CONFIDENTIAL | NDA ONLY
LIBRADOS API
#include <rados/librados.hpp>
librados::IoCtx io_ctx;
librados::Rados rados;
rados.init("admin");
rados.connect();
rados.pool_create("swimming_pool");
rados.ioctx_create("swimming_pool", io_ctx);
librados::bufferlist bl;
bl.append("water");
io_ctx.write_full("octopus", bl)
librados::bufferlist rbl;
librados::AioCompletion *read_completion1 = librados::Rados::aio_create_completion();
io_ctx.aio_read("octopus", read_completion1, &rbl, 4193404, 0);
read_completion1->wait_for_safe();
read_completion1->get_return_value()
librados::ObjectWriteOperation write_op;
librados::bufferlist xbl;
xbl.append('2');
write_op.setxattr("version", xbl);
15
RED HAT CONFIDENTIAL | NDA ONLY
RADOS
LIBRADOS
A library allowing apps to directly access RADOS (C, C++, Java, Python, Ruby, PHP)
RGW
A web services
gateway for object
storage, compatible
with S3 and Swift
RBD
A reliable, fully-
distributed block
device with cloud
platform integration
CephFS
A distributed file
system with POSIX
semantics and scale-
out metadata
management
APP HOST/VM CLIENT
RADOS
A software-based, reliable, autonomous, distributed object store comprised of self-healing, self-managing,
intelligent storage nodes and lightweight monitors
16
RED HAT CONFIDENTIAL | NDA ONLY
RADOS COMPONENTS
OSD:
■ 10s to 1000s in a cluster
■ One per disk (or one per SSD, RAID group…)
■ Server stored objects to clients
■ Intelligently peer for replication & recovery
17
RED HAT CONFIDENTIAL | NDA ONLY
RADOS
M
M
M
OSD
DISK
FS
OSD
DISK
FS
OSD
DISK
FS
OSD
DISK
FS
OBJECT STORAGE DAEMON
18
RED HAT CONFIDENTIAL | NDA ONLY
M
RADOS COMPONENTS
MON:
■ Maintain cluster membership and state
■ Provide consensus of distributed decision making
■ Small, odd number (e.g., 5)
■ Not part of data path
19
RED HAT CONFIDENTIAL | NDA ONLY
CRUSH
CRUSH:
■ Pseudo-random placement algorithm
■ Fast calculation, no lookup
■ Repeatable, deterministic
■ Statically uniform distribution
■ Stable mapping
■ Limited data migration on change
■ Rule-based configuration
■ Infrastructure topology aware
■ Adjustable replication
■ Weighting
20
RED HAT CONFIDENTIAL | NDA ONLY
DATA PLACEMENT
21
RED HAT CONFIDENTIAL | NDA ONLY
DATA PLACEMENT
RADOS
10
01
01
11
10
01
01
11
11
11
11
10
10
01
10
01
0110
10
10
1101
01
01
22
RED HAT CONFIDENTIAL | NDA ONLY
DATA PLACEMENT
RADOS
10
01
01
11
10
01
01
11
11
11
11
10
10
01
10
01
0110
10
10
1101
01
01
23
RED HAT CONFIDENTIAL | NDA ONLY
DATA PLACEMENT
RADOS
10
01
01
11
10
01
01
11
11
11
11
10
10
01
10
01
0110
10
10
11
01
01
10
01
01
11
10
01
01
11
01
01
24
RED HAT CONFIDENTIAL | NDA ONLY
25
CACHE TIERING
RED HAT CONFIDENTIAL | NDA ONLY
26
TWO WAYS TO CACHE
RED HAT CONFIDENTIAL | NDA ONLY
■ Within each OSD
■ Combine SSD and HDD under each OSD
■ Make localized promote / demote decisions
■ Leverage existing tools
■ dm-cache, bcache, flashcache
■ Variety of caching controllers
■ We can help with hints
TWO WAYS TO CACHE
OSD
DISK
BLOCKDEV
DISK
FS
27
RED HAT CONFIDENTIAL | NDA ONLY
TWO WAYS TO CACHE
BLOCKDEV
Data Cache
Metadata
FS
OSD
dm-cache
28
RED HAT CONFIDENTIAL | NDA ONLY
■ Cache on separate devices / nodes
■ Different hardware for devices / nodes
■ Slow nodes for cold data
■ High performance nodes for hot data
■ Add, remove, scale each tier independently
■ Unlikely to choose right ratios at procurement time
TWO WAYS TO CACHE
OSD
DISK
BLOCKDEV
FS
29
RED HAT CONFIDENTIAL | NDA ONLY
APPLICATION
RADOS
CACHE POOL (Replicated)
BACKING POOL (ERASURE CODED)
TIERED STORAGE
30
RED HAT CONFIDENTIAL | NDA ONLY
RADOS TIERING PRINCIPLES
■ Each tier is a RADOS pool
■ Replicated or erasure coded
■ Tiers are durable
■ replicate across OSDs in multiple hosts
■ Each tier has its own CRUSH policy
■ map to SSDs devices / hosts only
■ librados clients adapt to tiering topology
■ Transparently direct requests accordingly
■ No changes to RBD, RGW, CephFS, etc
RADOS
CACHE TIER
Promotion
logic
Tiering
agent
BASE TIER
Client
Objecter
31
RED HAT CONFIDENTIAL | NDA ONLY
32
I/O PATTERN
CACHE TIERING
RED HAT CONFIDENTIAL | NDA ONLY
33
WRITE HIT
CACHE TIERING
RED HAT CONFIDENTIAL | NDA ONLY
APPLICATION
RADOS
CACHE POOL (SSD): WRITEBACK
BACKING POOL (HDD)
WRITE INTO CACHE POOL
WRITE ACK
34
RED HAT CONFIDENTIAL | NDA ONLY
35
WRITE MISS
CACHE TIERING
RED HAT CONFIDENTIAL | NDA ONLY
APPLICATION
RADOS
CACHE POOL (SSD): WRITEBACK
BACKING POOL (HDD)
WRITE MISS
WRITE
PROMOTE
ACK
36
RED HAT CONFIDENTIAL | NDA ONLY
37
PROXY WRITE
CACHE TIERING
RED HAT CONFIDENTIAL | NDA ONLY
APPLICATION
RADOS
CACHE POOL (SSD): WRITEBACK
BACKING POOL (HDD)
PROXY WRITE
WRITE
PROXY WRITE
ACK
38
RED HAT CONFIDENTIAL | NDA ONLY
39
READ: CACHE HIT
CACHE TIERING
RED HAT CONFIDENTIAL | NDA ONLY
APPLICATION
RADOS
CACHE POOL (SSD): WRITEBACK
BACKING POOL (HDD)
READ: CACHE HIT
READ READ REPLY
40
RED HAT CONFIDENTIAL | NDA ONLY
41
READ: CACHE MISS
CACHE TIERING
RED HAT CONFIDENTIAL | NDA ONLY
APPLICATION
RADOS
CACHE POOL (SSD): WRITEBACK
BACKING POOL (HDD)
READ: CACHE MISS
READ READ REPLY
PROMOTE
42
RED HAT CONFIDENTIAL | NDA ONLY
43
READFORWARD
CACHE TIERING
RED HAT CONFIDENTIAL | NDA ONLY
APPLICATION
RADOS
CACHE POOL (SSD)
BACKING POOL (HDD)
READFORWARD
READ REDIRECT READ READ REPLY
44
RED HAT CONFIDENTIAL | NDA ONLY
45
FLUSH AND EVICT
CACHE TIERING
RED HAT CONFIDENTIAL | NDA ONLY
APPLICATION
RADOS
CACHE POOL (SSD): WRITEBACK
BACKING POOL (HDD)
FLUSH AND/OR EVICT COLD DATA
EVICTACKFLUSH
46
RED HAT CONFIDENTIAL | NDA ONLY
47
ERASURE CODING
RED HAT CONFIDENTIAL | NDA ONLY
OBJECT
ERASURE CODING
RADOS
REPLICATED POOL
COPYCOPYCOPY
RADOS
ERASURE CODED POOL
1 2 3 5 64
OBJECT
■ Full copy of stored objects
■ Very high durability
■ 3x (200% overhead)
■ Quick recovery
■ One copy plus parity
■ Cost-effective durability
■ 1.5x (50% overhead)
■ Expensive recovery
48
RED HAT CONFIDENTIAL | NDA ONLY
RADOS
ERASURE CODED POOL
ERASURE CODING
OSD
1
OSD
2
OSD
3
OSD
5
OSD
6
OSD
4
49
RED HAT CONFIDENTIAL | NDA ONLY
RADOS
ERASURE CODED POOL
ERASURE CODING
OSD
1
OSD
2
OSD
3
OSD
5
OSD
6
OSD
4
50
DATA CHUNKS
RED HAT CONFIDENTIAL | NDA ONLY
RADOS
ERASURE CODED POOL
ERASURE CODING
OSD
1
OSD
2
OSD
3
OSD
5
OSD
6
OSD
4
51
CODING CHUNKS
RED HAT CONFIDENTIAL | NDA ONLY
OBJECT
RADOS
ERASURE CODED POOL
ERASURE CODING
OSD
1
OSD
2
OSD
3
OSD
5
OSD
6
OSD
4
52
RED HAT CONFIDENTIAL | NDA ONLY
53
I/O PATTERN
ERASURE CODING
RED HAT CONFIDENTIAL | NDA ONLY
54
EC READ
ERASURE CODING
RED HAT CONFIDENTIAL | NDA ONLY
CLIENT
RADOS
ERASURE CODED POOL
EC READ
READ
OSD
1
OSD
2
OSD
3
OSD
5
OSD
6
OSD
4
55
RED HAT CONFIDENTIAL | NDA ONLY
CLIENT
RADOS
ERASURE CODED POOL
EC READ
READ
OSD
1
OSD
2
OSD
3
OSD
5
OSD
6
OSD
4
READS
56
RED HAT CONFIDENTIAL | NDA ONLY
CLIENT
RADOS
ERASURE CODED POOL
EC READ
READ REPLY
OSD
1
OSD
2
OSD
3
OSD
5
OSD
6
OSD
4
57
RED HAT CONFIDENTIAL | NDA ONLY
58
EC WRITE
ERASURE CODING
RED HAT CONFIDENTIAL | NDA ONLY
CLIENT
RADOS
ERASURE CODED POOL
EC WRITE
WRITE
OSD
1
OSD
2
OSD
3
OSD
5
OSD
6
OSD
4
59
RED HAT CONFIDENTIAL | NDA ONLY
CLIENT
RADOS
ERASURE CODED POOL
EC WRITE
WRITE
OSD
1
OSD
2
OSD
3
OSD
5
OSD
6
OSD
4
WRITES
60
RED HAT CONFIDENTIAL | NDA ONLY
CLIENT
RADOS
ERASURE CODED POOL
EC WRITE
WRITE ACK
OSD
1
OSD
2
OSD
3
OSD
5
OSD
6
OSD
4
61
RED HAT CONFIDENTIAL | NDA ONLY
62
EC WRITE: DEGRADED
ERASURE CODING
RED HAT CONFIDENTIAL | NDA ONLY
CLIENT
RADOS
ERASURE CODED POOL
EC WRITE: DEGRADED
WRITE
OSD
1
OSD
2
OSD
3
OSD
5
OSD
6
OSD
4
WRITES
63
RED HAT CONFIDENTIAL | NDA ONLY
64
EC WRITE: PARTIAL FAILURE
ERASURE CODING
RED HAT CONFIDENTIAL | NDA ONLY
CLIENT
RADOS
ERASURE CODED POOL
EC WRITE: PARTIAL FAILURE
WRITE
OSD
1
OSD
2
OSD
3
OSD
5
OSD
6
OSD
4
WRITES
65
RED HAT CONFIDENTIAL | NDA ONLY
CLIENT
RADOS
ERASURE CODED POOL
EC WRITE: PARTIAL FAILURE
OSD
1
OSD
2
OSD
3
OSD
5
OSD
6
OSD
4
WRITES
66
B B BA A A
RED HAT CONFIDENTIAL | NDA ONLY
CONFIGURATION EXAMPLE
/// Create pools
sudo ceph osd erasure-code-profile set myecprofile ruleset-failure-domain=osd k=3 m=1
sudo ceph osd pool create myecpool 12 12 erasure myecprofile
sudo ceph osd pool create mycache 64 64
sudo ceph osd pool set mycache crush_ruleset 3
/// Set up a read/write cache pool mycache for pool myecpool
sudo ceph osd tier add myecpool mycache
sudo ceph osd tier cache-mode mycache writeback
sudo ceph osd tier set-overlay myecpool mycache
/// Set the target size and enable the tiering agent
sudo ceph osd pool set mycache hit_set_type bloom
sudo ceph osd pool set mycache hit_set_count 1
sudo ceph osd pool set mycache hit_set_period 3600
sudo ceph osd pool set mycache target_max_objects 250
sudo ceph osd pool set foo-hot target_max_bytes 1000000000000 # 1 TB
sudo ceph osd pool set foo-hot min_read_recency_for_promote 1
sudo ceph osd pool set foo-hot min_write_recency_for_promote 1
67
/// CRUSH Rule
root ssd {
id -6
# weight 8.000
alg straw
hash 0 # rjenkins1
item octopus01-ssd weight 1.000
item octopus02-ssd weight 1.000
item octopus03-ssd weight 1.000
}
rule cacher {
ruleset 3
type replicated
min_size 3
max_size 10
step take ssd
step choose firstn 0 type host
step emit
}
RED HAT CONFIDENTIAL | NDA ONLY
CONFIGURATION EXAMPLE
68
CONTRIBUTION
http://docs.ceph.com/docs/master/dev/
IRC AND MAILING LIST
http://ceph.com/resources/mailing-list-irc/
BUG REPORT
http://tracker.ceph.com/projects/ceph/issues/
BENCHMARKING
Cache Tiering
http://www.flashmemorysummit.com/English/Collaterals/Proceedings/2015/20150813_S303E_Zhang.pdf
Erasure Coding
http://www.flashmemorysummit.com/English/Collaterals/Proceedings/2015/20150813_S303E_Roy.pdf
RED HAT CONFIDENTIAL | NDA ONLY
Red Hat
shinobu@redhat.com
Shinobu Kinjo
THANK YOU!

More Related Content

What's hot

Linux Stammtisch Munich: Ceph - Overview, Experiences and Outlook
Linux Stammtisch Munich: Ceph - Overview, Experiences and OutlookLinux Stammtisch Munich: Ceph - Overview, Experiences and Outlook
Linux Stammtisch Munich: Ceph - Overview, Experiences and Outlook
Danny Al-Gaaf
 
Ceph data services in a multi- and hybrid cloud world
Ceph data services in a multi- and hybrid cloud worldCeph data services in a multi- and hybrid cloud world
Ceph data services in a multi- and hybrid cloud world
Sage Weil
 
Ceph Introduction 2017
Ceph Introduction 2017  Ceph Introduction 2017
Ceph Introduction 2017
Karan Singh
 
What you need to know about ceph
What you need to know about cephWhat you need to know about ceph
What you need to know about ceph
Emma Haruka Iwao
 
BlueStore: a new, faster storage backend for Ceph
BlueStore: a new, faster storage backend for CephBlueStore: a new, faster storage backend for Ceph
BlueStore: a new, faster storage backend for Ceph
Sage Weil
 
The State of Ceph, Manila, and Containers in OpenStack
The State of Ceph, Manila, and Containers in OpenStackThe State of Ceph, Manila, and Containers in OpenStack
The State of Ceph, Manila, and Containers in OpenStack
Sage Weil
 
BlueStore: a new, faster storage backend for Ceph
BlueStore: a new, faster storage backend for CephBlueStore: a new, faster storage backend for Ceph
BlueStore: a new, faster storage backend for Ceph
Sage Weil
 
Scalable POSIX File Systems in the Cloud
Scalable POSIX File Systems in the CloudScalable POSIX File Systems in the Cloud
Scalable POSIX File Systems in the Cloud
Red_Hat_Storage
 
QCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference ArchitectureQCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference Architecture
Patrick McGarry
 
Ceph - A distributed storage system
Ceph - A distributed storage systemCeph - A distributed storage system
Ceph - A distributed storage system
Italo Santos
 
Ceph and RocksDB
Ceph and RocksDBCeph and RocksDB
Ceph and RocksDB
Sage Weil
 
Keeping OpenStack storage trendy with Ceph and containers
Keeping OpenStack storage trendy with Ceph and containersKeeping OpenStack storage trendy with Ceph and containers
Keeping OpenStack storage trendy with Ceph and containers
Sage Weil
 
Reliable Storage for High Availability, Disaster Recovery, Clouds and Contain...
Reliable Storage for High Availability, Disaster Recovery, Clouds and Contain...Reliable Storage for High Availability, Disaster Recovery, Clouds and Contain...
Reliable Storage for High Availability, Disaster Recovery, Clouds and Contain...
Celia Chase
 
Ceph, Now and Later: Our Plan for Open Unified Cloud Storage
Ceph, Now and Later: Our Plan for Open Unified Cloud StorageCeph, Now and Later: Our Plan for Open Unified Cloud Storage
Ceph, Now and Later: Our Plan for Open Unified Cloud Storage
Sage Weil
 
Distributed Storage and Compute With Ceph's librados (Vault 2015)
Distributed Storage and Compute With Ceph's librados (Vault 2015)Distributed Storage and Compute With Ceph's librados (Vault 2015)
Distributed Storage and Compute With Ceph's librados (Vault 2015)
Sage Weil
 
Community Update at OpenStack Summit Boston
Community Update at OpenStack Summit BostonCommunity Update at OpenStack Summit Boston
Community Update at OpenStack Summit Boston
Sage Weil
 
LINSTOR - Linux Block storage management tool (march 2019)
LINSTOR - Linux Block storage management tool (march 2019)LINSTOR - Linux Block storage management tool (march 2019)
LINSTOR - Linux Block storage management tool (march 2019)
Sebastian Schinhammer
 
A crash course in CRUSH
A crash course in CRUSHA crash course in CRUSH
A crash course in CRUSH
Sage Weil
 
Ceph as software define storage
Ceph as software define storageCeph as software define storage
Ceph as software define storage
Mahmoud Shiri Varamini
 

What's hot (19)

Linux Stammtisch Munich: Ceph - Overview, Experiences and Outlook
Linux Stammtisch Munich: Ceph - Overview, Experiences and OutlookLinux Stammtisch Munich: Ceph - Overview, Experiences and Outlook
Linux Stammtisch Munich: Ceph - Overview, Experiences and Outlook
 
Ceph data services in a multi- and hybrid cloud world
Ceph data services in a multi- and hybrid cloud worldCeph data services in a multi- and hybrid cloud world
Ceph data services in a multi- and hybrid cloud world
 
Ceph Introduction 2017
Ceph Introduction 2017  Ceph Introduction 2017
Ceph Introduction 2017
 
What you need to know about ceph
What you need to know about cephWhat you need to know about ceph
What you need to know about ceph
 
BlueStore: a new, faster storage backend for Ceph
BlueStore: a new, faster storage backend for CephBlueStore: a new, faster storage backend for Ceph
BlueStore: a new, faster storage backend for Ceph
 
The State of Ceph, Manila, and Containers in OpenStack
The State of Ceph, Manila, and Containers in OpenStackThe State of Ceph, Manila, and Containers in OpenStack
The State of Ceph, Manila, and Containers in OpenStack
 
BlueStore: a new, faster storage backend for Ceph
BlueStore: a new, faster storage backend for CephBlueStore: a new, faster storage backend for Ceph
BlueStore: a new, faster storage backend for Ceph
 
Scalable POSIX File Systems in the Cloud
Scalable POSIX File Systems in the CloudScalable POSIX File Systems in the Cloud
Scalable POSIX File Systems in the Cloud
 
QCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference ArchitectureQCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference Architecture
 
Ceph - A distributed storage system
Ceph - A distributed storage systemCeph - A distributed storage system
Ceph - A distributed storage system
 
Ceph and RocksDB
Ceph and RocksDBCeph and RocksDB
Ceph and RocksDB
 
Keeping OpenStack storage trendy with Ceph and containers
Keeping OpenStack storage trendy with Ceph and containersKeeping OpenStack storage trendy with Ceph and containers
Keeping OpenStack storage trendy with Ceph and containers
 
Reliable Storage for High Availability, Disaster Recovery, Clouds and Contain...
Reliable Storage for High Availability, Disaster Recovery, Clouds and Contain...Reliable Storage for High Availability, Disaster Recovery, Clouds and Contain...
Reliable Storage for High Availability, Disaster Recovery, Clouds and Contain...
 
Ceph, Now and Later: Our Plan for Open Unified Cloud Storage
Ceph, Now and Later: Our Plan for Open Unified Cloud StorageCeph, Now and Later: Our Plan for Open Unified Cloud Storage
Ceph, Now and Later: Our Plan for Open Unified Cloud Storage
 
Distributed Storage and Compute With Ceph's librados (Vault 2015)
Distributed Storage and Compute With Ceph's librados (Vault 2015)Distributed Storage and Compute With Ceph's librados (Vault 2015)
Distributed Storage and Compute With Ceph's librados (Vault 2015)
 
Community Update at OpenStack Summit Boston
Community Update at OpenStack Summit BostonCommunity Update at OpenStack Summit Boston
Community Update at OpenStack Summit Boston
 
LINSTOR - Linux Block storage management tool (march 2019)
LINSTOR - Linux Block storage management tool (march 2019)LINSTOR - Linux Block storage management tool (march 2019)
LINSTOR - Linux Block storage management tool (march 2019)
 
A crash course in CRUSH
A crash course in CRUSHA crash course in CRUSH
A crash course in CRUSH
 
Ceph as software define storage
Ceph as software define storageCeph as software define storage
Ceph as software define storage
 

Similar to Cache Tiering and Erasure Coding

Ceph Block Devices: A Deep Dive
Ceph Block Devices:  A Deep DiveCeph Block Devices:  A Deep Dive
Ceph Block Devices: A Deep Dive
Red_Hat_Storage
 
Ceph Block Devices: A Deep Dive
Ceph Block Devices: A Deep DiveCeph Block Devices: A Deep Dive
Ceph Block Devices: A Deep Dive
joshdurgin
 
Ceph Day LA - RBD: A deep dive
Ceph Day LA - RBD: A deep dive Ceph Day LA - RBD: A deep dive
Ceph Day LA - RBD: A deep dive
Ceph Community
 
Quick-and-Easy Deployment of a Ceph Storage Cluster
Quick-and-Easy Deployment of a Ceph Storage ClusterQuick-and-Easy Deployment of a Ceph Storage Cluster
Quick-and-Easy Deployment of a Ceph Storage Cluster
Patrick Quairoli
 
What CloudStackers Need To Know About LINSTOR/DRBD
What CloudStackers Need To Know About LINSTOR/DRBDWhat CloudStackers Need To Know About LINSTOR/DRBD
What CloudStackers Need To Know About LINSTOR/DRBD
ShapeBlue
 
The Future of Cloud Software Defined Storage with Ceph: Andrew Hatfield, Red Hat
The Future of Cloud Software Defined Storage with Ceph: Andrew Hatfield, Red HatThe Future of Cloud Software Defined Storage with Ceph: Andrew Hatfield, Red Hat
The Future of Cloud Software Defined Storage with Ceph: Andrew Hatfield, Red Hat
OpenStack
 
OSDC 2015: John Spray | The Ceph Storage System
OSDC 2015: John Spray | The Ceph Storage SystemOSDC 2015: John Spray | The Ceph Storage System
OSDC 2015: John Spray | The Ceph Storage System
NETWAYS
 
Ceph Day London 2014 - Ceph Ecosystem Overview
Ceph Day London 2014 - Ceph Ecosystem Overview Ceph Day London 2014 - Ceph Ecosystem Overview
Ceph Day London 2014 - Ceph Ecosystem Overview
Ceph Community
 
Open Source Storage at Scale: Ceph @ GRNET
Open Source Storage at Scale: Ceph @ GRNETOpen Source Storage at Scale: Ceph @ GRNET
Open Source Storage at Scale: Ceph @ GRNET
Nikos Kormpakis
 
Red Hat Storage 2014 - Product(s) Overview
Red Hat Storage 2014 - Product(s) OverviewRed Hat Storage 2014 - Product(s) Overview
Red Hat Storage 2014 - Product(s) Overview
Marcel Hergaarden
 
Red hat storage el almacenamiento disruptivo
Red hat storage el almacenamiento disruptivoRed hat storage el almacenamiento disruptivo
Red hat storage el almacenamiento disruptivo
Nextel S.A.
 
Ceph - Desmistificando Software-Define Storage
Ceph - Desmistificando Software-Define StorageCeph - Desmistificando Software-Define Storage
Ceph - Desmistificando Software-Define Storage
Italo Santos
 
librados
libradoslibrados
librados
Ceph Community
 
Crimson: Ceph for the Age of NVMe and Persistent Memory
Crimson: Ceph for the Age of NVMe and Persistent MemoryCrimson: Ceph for the Age of NVMe and Persistent Memory
Crimson: Ceph for the Age of NVMe and Persistent Memory
ScyllaDB
 
Ceph Day NYC: The Future of CephFS
Ceph Day NYC: The Future of CephFSCeph Day NYC: The Future of CephFS
Ceph Day NYC: The Future of CephFS
Ceph Community
 
LINSTOR - Resilient OSS Storage for OpenNebula - September 2018
LINSTOR - Resilient OSS Storage for OpenNebula - September 2018LINSTOR - Resilient OSS Storage for OpenNebula - September 2018
LINSTOR - Resilient OSS Storage for OpenNebula - September 2018
OpenNebula Project
 
Aerospike Architecture
Aerospike ArchitectureAerospike Architecture
Aerospike Architecture
Peter Milne
 
OpenNebulaConf2018 - LINSTOR - Philipp Reisner - LINBIT
OpenNebulaConf2018 - LINSTOR  - Philipp Reisner - LINBIT OpenNebulaConf2018 - LINSTOR  - Philipp Reisner - LINBIT
OpenNebulaConf2018 - LINSTOR - Philipp Reisner - LINBIT
OpenNebula Project
 
London Ceph Day: The Future of CephFS
London Ceph Day: The Future of CephFSLondon Ceph Day: The Future of CephFS
London Ceph Day: The Future of CephFS
Ceph Community
 
Performant and Resilient Storage: The Open Source & Linux Way
Performant and Resilient Storage: The Open Source & Linux WayPerformant and Resilient Storage: The Open Source & Linux Way
Performant and Resilient Storage: The Open Source & Linux Way
OpenNebula Project
 

Similar to Cache Tiering and Erasure Coding (20)

Ceph Block Devices: A Deep Dive
Ceph Block Devices:  A Deep DiveCeph Block Devices:  A Deep Dive
Ceph Block Devices: A Deep Dive
 
Ceph Block Devices: A Deep Dive
Ceph Block Devices: A Deep DiveCeph Block Devices: A Deep Dive
Ceph Block Devices: A Deep Dive
 
Ceph Day LA - RBD: A deep dive
Ceph Day LA - RBD: A deep dive Ceph Day LA - RBD: A deep dive
Ceph Day LA - RBD: A deep dive
 
Quick-and-Easy Deployment of a Ceph Storage Cluster
Quick-and-Easy Deployment of a Ceph Storage ClusterQuick-and-Easy Deployment of a Ceph Storage Cluster
Quick-and-Easy Deployment of a Ceph Storage Cluster
 
What CloudStackers Need To Know About LINSTOR/DRBD
What CloudStackers Need To Know About LINSTOR/DRBDWhat CloudStackers Need To Know About LINSTOR/DRBD
What CloudStackers Need To Know About LINSTOR/DRBD
 
The Future of Cloud Software Defined Storage with Ceph: Andrew Hatfield, Red Hat
The Future of Cloud Software Defined Storage with Ceph: Andrew Hatfield, Red HatThe Future of Cloud Software Defined Storage with Ceph: Andrew Hatfield, Red Hat
The Future of Cloud Software Defined Storage with Ceph: Andrew Hatfield, Red Hat
 
OSDC 2015: John Spray | The Ceph Storage System
OSDC 2015: John Spray | The Ceph Storage SystemOSDC 2015: John Spray | The Ceph Storage System
OSDC 2015: John Spray | The Ceph Storage System
 
Ceph Day London 2014 - Ceph Ecosystem Overview
Ceph Day London 2014 - Ceph Ecosystem Overview Ceph Day London 2014 - Ceph Ecosystem Overview
Ceph Day London 2014 - Ceph Ecosystem Overview
 
Open Source Storage at Scale: Ceph @ GRNET
Open Source Storage at Scale: Ceph @ GRNETOpen Source Storage at Scale: Ceph @ GRNET
Open Source Storage at Scale: Ceph @ GRNET
 
Red Hat Storage 2014 - Product(s) Overview
Red Hat Storage 2014 - Product(s) OverviewRed Hat Storage 2014 - Product(s) Overview
Red Hat Storage 2014 - Product(s) Overview
 
Red hat storage el almacenamiento disruptivo
Red hat storage el almacenamiento disruptivoRed hat storage el almacenamiento disruptivo
Red hat storage el almacenamiento disruptivo
 
Ceph - Desmistificando Software-Define Storage
Ceph - Desmistificando Software-Define StorageCeph - Desmistificando Software-Define Storage
Ceph - Desmistificando Software-Define Storage
 
librados
libradoslibrados
librados
 
Crimson: Ceph for the Age of NVMe and Persistent Memory
Crimson: Ceph for the Age of NVMe and Persistent MemoryCrimson: Ceph for the Age of NVMe and Persistent Memory
Crimson: Ceph for the Age of NVMe and Persistent Memory
 
Ceph Day NYC: The Future of CephFS
Ceph Day NYC: The Future of CephFSCeph Day NYC: The Future of CephFS
Ceph Day NYC: The Future of CephFS
 
LINSTOR - Resilient OSS Storage for OpenNebula - September 2018
LINSTOR - Resilient OSS Storage for OpenNebula - September 2018LINSTOR - Resilient OSS Storage for OpenNebula - September 2018
LINSTOR - Resilient OSS Storage for OpenNebula - September 2018
 
Aerospike Architecture
Aerospike ArchitectureAerospike Architecture
Aerospike Architecture
 
OpenNebulaConf2018 - LINSTOR - Philipp Reisner - LINBIT
OpenNebulaConf2018 - LINSTOR  - Philipp Reisner - LINBIT OpenNebulaConf2018 - LINSTOR  - Philipp Reisner - LINBIT
OpenNebulaConf2018 - LINSTOR - Philipp Reisner - LINBIT
 
London Ceph Day: The Future of CephFS
London Ceph Day: The Future of CephFSLondon Ceph Day: The Future of CephFS
London Ceph Day: The Future of CephFS
 
Performant and Resilient Storage: The Open Source & Linux Way
Performant and Resilient Storage: The Open Source & Linux WayPerformant and Resilient Storage: The Open Source & Linux Way
Performant and Resilient Storage: The Open Source & Linux Way
 

Recently uploaded

How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
Product School
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
Product School
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
Elena Simperl
 
Generating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using SmithyGenerating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using Smithy
g2nightmarescribd
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
DianaGray10
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
Product School
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Ramesh Iyer
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Product School
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
91mobiles
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
DianaGray10
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Inflectra
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 

Recently uploaded (20)

How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
Generating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using SmithyGenerating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using Smithy
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 

Cache Tiering and Erasure Coding

  • 1. RED HAT CONFIDENTIAL | NDA ONLY CACHE TIERING AND ERASURE CODING #ceph-devel shinobu
  • 2. RED HAT CONFIDENTIAL | NDA ONLY ■ CEPH MOTIVATING PRINCIPLES ■ CEPH COMPONENTS ■ ARCHITECTURE COMPONENT ■ RADOS ■ LIBRADOS ■ RADOS COMPONENTS ■ DATA PLACEMENT ■ CACHE TIERING ■ ERASURE CODING AGENDA 1
  • 3. RED HAT CONFIDENTIAL | NDA ONLY ■ All components must scale horizontally ■ There can be no single point of failure ■ The solution must be hardware agnostic ■ Should use commodity hardware ■ Self-manage whenever possible ■ Open source (LGPL) ■ Move beyond legacy approaches ■ Client / cluster instead of client / server ■ Ad hoc HA CEPH MOTIVATING PRINCIPLES 2
  • 4. RED HAT CONFIDENTIAL | NDA ONLY RADOS A software-based, reliable, autonomous, distributed object store comprised of self-healing, self-managing, intelligent storage nodes and lightweight monitors LIBRADOS A library allowing apps to directly access RADOS (C, C++, Java, Python, Ruby, PHP) RGW A web services gateway for object storage, compatible with S3 and Swift RBD A reliable, fully- distributed block device with cloud platform integration CephFS A distributed file system with POSIX semantics and scale- out metadata management APP HOST/VM CLIENT CEPH COMPONENTS 3
  • 5. RED HAT CONFIDENTIAL | NDA ONLY RADOS A software-based, reliable, autonomous, distributed object store comprised of self-healing, self-managing, intelligent storage nodes and lightweight monitors LIBRADOS A library allowing apps to directly access RADOS (C, C++, Java, Python, Ruby, PHP) RGW A web services gateway for object storage, compatible with S3 and Swift RBD A reliable, fully- distributed block device with cloud platform integration CephFS A distributed file system with POSIX semantics and scale- out metadata management APP HOST/VM CLIENT ARCHITECTURE COMPONENTS 4
  • 6. RED HAT CONFIDENTIAL | NDA ONLY THE RADOS GATEWAY APPLICATION RADOSGW LIBRADOS APPLICATION RADOSGW LIBRADOS RADOS CLUSTER M M M 5
  • 7. RED HAT CONFIDENTIAL | NDA ONLY RADOS A software-based, reliable, autonomous, distributed object store comprised of self-healing, self-managing, intelligent storage nodes and lightweight monitors LIBRADOS A library allowing apps to directly access RADOS (C, C++, Java, Python, Ruby, PHP) RGW A web services gateway for object storage, compatible with S3 and Swift RBD A reliable, fully- distributed block device with cloud platform integration CephFS A distributed file system with POSIX semantics and scale- out metadata management APP HOST/VM CLIENT ARCHITECTURE COMPONENTS 6
  • 8. RED HAT CONFIDENTIAL | NDA ONLY RADOS CLUSTER M M STORING VIRTUAL DISK: LIBRBD VM HYPERVISOR LIBRBD 7
  • 9. RED HAT CONFIDENTIAL | NDA ONLY RADOS CLUSTER M M KERNEL MODULE: KRBD LINUX HOST KRBD 8
  • 10. RED HAT CONFIDENTIAL | NDA ONLY RBD FEATURES ■ Stripe images across entire cluster (pool) ■ Read-only snapshots ■ Copy-on-Write clones ■ Broad integration ■ Qemu ■ Linux kernel ■ iSCSI (STGT, LIO) ■ OpenStack, CloudStack, Nebula, Geneti, Proxmox ■ Incremental backup (relative to snapshot) 9
  • 11. RED HAT CONFIDENTIAL | NDA ONLY RBD FEATURES ■ image mirroring ■ Asynchronous replication to another cluster ■ Replica(s) crash consistent ■ Replication is per-image ■ Each image has a data journal ■ RBD mirror daemon does the work CLUSTER A HYPERVISOR LIBRBD Journal CLUSTER B HYPERVISOR LIBRBD rbd-mirror 10
  • 12. RED HAT CONFIDENTIAL | NDA ONLY ARCHITECTURE COMPONENTS RADOS A software-based, reliable, autonomous, distributed object store comprised of self-healing, self-managing, intelligent storage nodes and lightweight monitors LIBRADOS A library allowing apps to directly access RADOS (C, C++, Java, Python, Ruby, PHP) RGW A web services gateway for object storage, compatible with S3 and Swift RBD A reliable, fully- distributed block device with cloud platform integration CephFS A distributed file system with POSIX semantics and scale- out metadata management APP HOST/VM CLIENT 11
  • 13. RED HAT CONFIDENTIAL | NDA ONLY SEPARATE METADATA SERVER LINUX HOST KERNEL MODULE RADOS CLUSTER M M M 01 10metadata data 12
  • 14. RED HAT CONFIDENTIAL | NDA ONLY SCALABLE METADATA SERVERS MDS ■ Manages metadata for a POSIX-compliant shared filesystem ■ Directory hierarchy ■ File metadata (owner, timestamps, mode, etc) ■ Snapshots on any directory ■ Clients stripe file data in RADOS ■ MDS not in data path ■ MDS stores metadata in RADOS ■ Dynamic MDS cluster scales to 10s or 100s ■ Only required for shared file system 13
  • 15. RED HAT CONFIDENTIAL | NDA ONLY LIBRADOS RADOS A software-based, reliable, autonomous, distributed object store comprised of self-healing, self-managing, intelligent storage nodes and lightweight monitors RGW A web services gateway for object storage, compatible with S3 and Swift RBD A reliable, fully- distributed block device with cloud platform integration CephFS A distributed file system with POSIX semantics and scale- out metadata management APP HOST/VM CLIENT LIBRADOS A library allowing apps to directly access RADOS (C, C++, Java, Python, Ruby, PHP) 14
  • 16. RED HAT CONFIDENTIAL | NDA ONLY LIBRADOS API #include <rados/librados.hpp> librados::IoCtx io_ctx; librados::Rados rados; rados.init("admin"); rados.connect(); rados.pool_create("swimming_pool"); rados.ioctx_create("swimming_pool", io_ctx); librados::bufferlist bl; bl.append("water"); io_ctx.write_full("octopus", bl) librados::bufferlist rbl; librados::AioCompletion *read_completion1 = librados::Rados::aio_create_completion(); io_ctx.aio_read("octopus", read_completion1, &rbl, 4193404, 0); read_completion1->wait_for_safe(); read_completion1->get_return_value() librados::ObjectWriteOperation write_op; librados::bufferlist xbl; xbl.append('2'); write_op.setxattr("version", xbl); 15
  • 17. RED HAT CONFIDENTIAL | NDA ONLY RADOS LIBRADOS A library allowing apps to directly access RADOS (C, C++, Java, Python, Ruby, PHP) RGW A web services gateway for object storage, compatible with S3 and Swift RBD A reliable, fully- distributed block device with cloud platform integration CephFS A distributed file system with POSIX semantics and scale- out metadata management APP HOST/VM CLIENT RADOS A software-based, reliable, autonomous, distributed object store comprised of self-healing, self-managing, intelligent storage nodes and lightweight monitors 16
  • 18. RED HAT CONFIDENTIAL | NDA ONLY RADOS COMPONENTS OSD: ■ 10s to 1000s in a cluster ■ One per disk (or one per SSD, RAID group…) ■ Server stored objects to clients ■ Intelligently peer for replication & recovery 17
  • 19. RED HAT CONFIDENTIAL | NDA ONLY RADOS M M M OSD DISK FS OSD DISK FS OSD DISK FS OSD DISK FS OBJECT STORAGE DAEMON 18
  • 20. RED HAT CONFIDENTIAL | NDA ONLY M RADOS COMPONENTS MON: ■ Maintain cluster membership and state ■ Provide consensus of distributed decision making ■ Small, odd number (e.g., 5) ■ Not part of data path 19
  • 21. RED HAT CONFIDENTIAL | NDA ONLY CRUSH CRUSH: ■ Pseudo-random placement algorithm ■ Fast calculation, no lookup ■ Repeatable, deterministic ■ Statically uniform distribution ■ Stable mapping ■ Limited data migration on change ■ Rule-based configuration ■ Infrastructure topology aware ■ Adjustable replication ■ Weighting 20
  • 22. RED HAT CONFIDENTIAL | NDA ONLY DATA PLACEMENT 21
  • 23. RED HAT CONFIDENTIAL | NDA ONLY DATA PLACEMENT RADOS 10 01 01 11 10 01 01 11 11 11 11 10 10 01 10 01 0110 10 10 1101 01 01 22
  • 24. RED HAT CONFIDENTIAL | NDA ONLY DATA PLACEMENT RADOS 10 01 01 11 10 01 01 11 11 11 11 10 10 01 10 01 0110 10 10 1101 01 01 23
  • 25. RED HAT CONFIDENTIAL | NDA ONLY DATA PLACEMENT RADOS 10 01 01 11 10 01 01 11 11 11 11 10 10 01 10 01 0110 10 10 11 01 01 10 01 01 11 10 01 01 11 01 01 24
  • 26. RED HAT CONFIDENTIAL | NDA ONLY 25 CACHE TIERING
  • 27. RED HAT CONFIDENTIAL | NDA ONLY 26 TWO WAYS TO CACHE
  • 28. RED HAT CONFIDENTIAL | NDA ONLY ■ Within each OSD ■ Combine SSD and HDD under each OSD ■ Make localized promote / demote decisions ■ Leverage existing tools ■ dm-cache, bcache, flashcache ■ Variety of caching controllers ■ We can help with hints TWO WAYS TO CACHE OSD DISK BLOCKDEV DISK FS 27
  • 29. RED HAT CONFIDENTIAL | NDA ONLY TWO WAYS TO CACHE BLOCKDEV Data Cache Metadata FS OSD dm-cache 28
  • 30. RED HAT CONFIDENTIAL | NDA ONLY ■ Cache on separate devices / nodes ■ Different hardware for devices / nodes ■ Slow nodes for cold data ■ High performance nodes for hot data ■ Add, remove, scale each tier independently ■ Unlikely to choose right ratios at procurement time TWO WAYS TO CACHE OSD DISK BLOCKDEV FS 29
  • 31. RED HAT CONFIDENTIAL | NDA ONLY APPLICATION RADOS CACHE POOL (Replicated) BACKING POOL (ERASURE CODED) TIERED STORAGE 30
  • 32. RED HAT CONFIDENTIAL | NDA ONLY RADOS TIERING PRINCIPLES ■ Each tier is a RADOS pool ■ Replicated or erasure coded ■ Tiers are durable ■ replicate across OSDs in multiple hosts ■ Each tier has its own CRUSH policy ■ map to SSDs devices / hosts only ■ librados clients adapt to tiering topology ■ Transparently direct requests accordingly ■ No changes to RBD, RGW, CephFS, etc RADOS CACHE TIER Promotion logic Tiering agent BASE TIER Client Objecter 31
  • 33. RED HAT CONFIDENTIAL | NDA ONLY 32 I/O PATTERN CACHE TIERING
  • 34. RED HAT CONFIDENTIAL | NDA ONLY 33 WRITE HIT CACHE TIERING
  • 35. RED HAT CONFIDENTIAL | NDA ONLY APPLICATION RADOS CACHE POOL (SSD): WRITEBACK BACKING POOL (HDD) WRITE INTO CACHE POOL WRITE ACK 34
  • 36. RED HAT CONFIDENTIAL | NDA ONLY 35 WRITE MISS CACHE TIERING
  • 37. RED HAT CONFIDENTIAL | NDA ONLY APPLICATION RADOS CACHE POOL (SSD): WRITEBACK BACKING POOL (HDD) WRITE MISS WRITE PROMOTE ACK 36
  • 38. RED HAT CONFIDENTIAL | NDA ONLY 37 PROXY WRITE CACHE TIERING
  • 39. RED HAT CONFIDENTIAL | NDA ONLY APPLICATION RADOS CACHE POOL (SSD): WRITEBACK BACKING POOL (HDD) PROXY WRITE WRITE PROXY WRITE ACK 38
  • 40. RED HAT CONFIDENTIAL | NDA ONLY 39 READ: CACHE HIT CACHE TIERING
  • 41. RED HAT CONFIDENTIAL | NDA ONLY APPLICATION RADOS CACHE POOL (SSD): WRITEBACK BACKING POOL (HDD) READ: CACHE HIT READ READ REPLY 40
  • 42. RED HAT CONFIDENTIAL | NDA ONLY 41 READ: CACHE MISS CACHE TIERING
  • 43. RED HAT CONFIDENTIAL | NDA ONLY APPLICATION RADOS CACHE POOL (SSD): WRITEBACK BACKING POOL (HDD) READ: CACHE MISS READ READ REPLY PROMOTE 42
  • 44. RED HAT CONFIDENTIAL | NDA ONLY 43 READFORWARD CACHE TIERING
  • 45. RED HAT CONFIDENTIAL | NDA ONLY APPLICATION RADOS CACHE POOL (SSD) BACKING POOL (HDD) READFORWARD READ REDIRECT READ READ REPLY 44
  • 46. RED HAT CONFIDENTIAL | NDA ONLY 45 FLUSH AND EVICT CACHE TIERING
  • 47. RED HAT CONFIDENTIAL | NDA ONLY APPLICATION RADOS CACHE POOL (SSD): WRITEBACK BACKING POOL (HDD) FLUSH AND/OR EVICT COLD DATA EVICTACKFLUSH 46
  • 48. RED HAT CONFIDENTIAL | NDA ONLY 47 ERASURE CODING
  • 49. RED HAT CONFIDENTIAL | NDA ONLY OBJECT ERASURE CODING RADOS REPLICATED POOL COPYCOPYCOPY RADOS ERASURE CODED POOL 1 2 3 5 64 OBJECT ■ Full copy of stored objects ■ Very high durability ■ 3x (200% overhead) ■ Quick recovery ■ One copy plus parity ■ Cost-effective durability ■ 1.5x (50% overhead) ■ Expensive recovery 48
  • 50. RED HAT CONFIDENTIAL | NDA ONLY RADOS ERASURE CODED POOL ERASURE CODING OSD 1 OSD 2 OSD 3 OSD 5 OSD 6 OSD 4 49
  • 51. RED HAT CONFIDENTIAL | NDA ONLY RADOS ERASURE CODED POOL ERASURE CODING OSD 1 OSD 2 OSD 3 OSD 5 OSD 6 OSD 4 50 DATA CHUNKS
  • 52. RED HAT CONFIDENTIAL | NDA ONLY RADOS ERASURE CODED POOL ERASURE CODING OSD 1 OSD 2 OSD 3 OSD 5 OSD 6 OSD 4 51 CODING CHUNKS
  • 53. RED HAT CONFIDENTIAL | NDA ONLY OBJECT RADOS ERASURE CODED POOL ERASURE CODING OSD 1 OSD 2 OSD 3 OSD 5 OSD 6 OSD 4 52
  • 54. RED HAT CONFIDENTIAL | NDA ONLY 53 I/O PATTERN ERASURE CODING
  • 55. RED HAT CONFIDENTIAL | NDA ONLY 54 EC READ ERASURE CODING
  • 56. RED HAT CONFIDENTIAL | NDA ONLY CLIENT RADOS ERASURE CODED POOL EC READ READ OSD 1 OSD 2 OSD 3 OSD 5 OSD 6 OSD 4 55
  • 57. RED HAT CONFIDENTIAL | NDA ONLY CLIENT RADOS ERASURE CODED POOL EC READ READ OSD 1 OSD 2 OSD 3 OSD 5 OSD 6 OSD 4 READS 56
  • 58. RED HAT CONFIDENTIAL | NDA ONLY CLIENT RADOS ERASURE CODED POOL EC READ READ REPLY OSD 1 OSD 2 OSD 3 OSD 5 OSD 6 OSD 4 57
  • 59. RED HAT CONFIDENTIAL | NDA ONLY 58 EC WRITE ERASURE CODING
  • 60. RED HAT CONFIDENTIAL | NDA ONLY CLIENT RADOS ERASURE CODED POOL EC WRITE WRITE OSD 1 OSD 2 OSD 3 OSD 5 OSD 6 OSD 4 59
  • 61. RED HAT CONFIDENTIAL | NDA ONLY CLIENT RADOS ERASURE CODED POOL EC WRITE WRITE OSD 1 OSD 2 OSD 3 OSD 5 OSD 6 OSD 4 WRITES 60
  • 62. RED HAT CONFIDENTIAL | NDA ONLY CLIENT RADOS ERASURE CODED POOL EC WRITE WRITE ACK OSD 1 OSD 2 OSD 3 OSD 5 OSD 6 OSD 4 61
  • 63. RED HAT CONFIDENTIAL | NDA ONLY 62 EC WRITE: DEGRADED ERASURE CODING
  • 64. RED HAT CONFIDENTIAL | NDA ONLY CLIENT RADOS ERASURE CODED POOL EC WRITE: DEGRADED WRITE OSD 1 OSD 2 OSD 3 OSD 5 OSD 6 OSD 4 WRITES 63
  • 65. RED HAT CONFIDENTIAL | NDA ONLY 64 EC WRITE: PARTIAL FAILURE ERASURE CODING
  • 66. RED HAT CONFIDENTIAL | NDA ONLY CLIENT RADOS ERASURE CODED POOL EC WRITE: PARTIAL FAILURE WRITE OSD 1 OSD 2 OSD 3 OSD 5 OSD 6 OSD 4 WRITES 65
  • 67. RED HAT CONFIDENTIAL | NDA ONLY CLIENT RADOS ERASURE CODED POOL EC WRITE: PARTIAL FAILURE OSD 1 OSD 2 OSD 3 OSD 5 OSD 6 OSD 4 WRITES 66 B B BA A A
  • 68. RED HAT CONFIDENTIAL | NDA ONLY CONFIGURATION EXAMPLE /// Create pools sudo ceph osd erasure-code-profile set myecprofile ruleset-failure-domain=osd k=3 m=1 sudo ceph osd pool create myecpool 12 12 erasure myecprofile sudo ceph osd pool create mycache 64 64 sudo ceph osd pool set mycache crush_ruleset 3 /// Set up a read/write cache pool mycache for pool myecpool sudo ceph osd tier add myecpool mycache sudo ceph osd tier cache-mode mycache writeback sudo ceph osd tier set-overlay myecpool mycache /// Set the target size and enable the tiering agent sudo ceph osd pool set mycache hit_set_type bloom sudo ceph osd pool set mycache hit_set_count 1 sudo ceph osd pool set mycache hit_set_period 3600 sudo ceph osd pool set mycache target_max_objects 250 sudo ceph osd pool set foo-hot target_max_bytes 1000000000000 # 1 TB sudo ceph osd pool set foo-hot min_read_recency_for_promote 1 sudo ceph osd pool set foo-hot min_write_recency_for_promote 1 67 /// CRUSH Rule root ssd { id -6 # weight 8.000 alg straw hash 0 # rjenkins1 item octopus01-ssd weight 1.000 item octopus02-ssd weight 1.000 item octopus03-ssd weight 1.000 } rule cacher { ruleset 3 type replicated min_size 3 max_size 10 step take ssd step choose firstn 0 type host step emit }
  • 69. RED HAT CONFIDENTIAL | NDA ONLY CONFIGURATION EXAMPLE 68 CONTRIBUTION http://docs.ceph.com/docs/master/dev/ IRC AND MAILING LIST http://ceph.com/resources/mailing-list-irc/ BUG REPORT http://tracker.ceph.com/projects/ceph/issues/ BENCHMARKING Cache Tiering http://www.flashmemorysummit.com/English/Collaterals/Proceedings/2015/20150813_S303E_Zhang.pdf Erasure Coding http://www.flashmemorysummit.com/English/Collaterals/Proceedings/2015/20150813_S303E_Roy.pdf
  • 70. RED HAT CONFIDENTIAL | NDA ONLY Red Hat shinobu@redhat.com Shinobu Kinjo THANK YOU!