Scaling and High Performance Storage System: LeoFS

Rakuten Group, Inc.
Rakuten Group, Inc.Rakuten Group, Inc.
Scaling and High Performance
Storage System
Yosuke Hara - @yosukehara
A Researcher of R.I.T. and Tech Lead LeoFS
with Hiroki Matsue, LeoFS Support and Rakuten Software Engineer
LeoFS is "Unstructured Big Data Storage for the Web"
and a highly available, distributed, eventually consistent
storage system.
!
Organizations can use LeoFS to store lots of data
efficently, safely and inexpensively.
!
LeoFS was published as OSS
on July of 2012
leo-project.net/leofs
Overview
Brief Benchmark Report
Multi Data Center Replication
LeoFS Administration at Rakuten
Future Plans
“NFS” Support and more
Overview
Tokyo, Japan
The Lion of Storage Systems
HIGH Availability
HIGH Cost
Performance Ratio
HIGH
Scalability
LeoFS Non Stop
Velocity: Low Latency
Minimum Resources
Volume: Petabyte / Exabyte
Variety: Photo, Movie, Unstructured-data
3 Vs in 3 HIGHs
Metadata Object Storage
Storage Engine/Router
Monitor
GUI Console
( Erlang RPC)
LeoFS Overview
Storage
Manager
( Erlang RPC)
Gateway
( TCP/IP,SNMP )
Request from
Web Applications / Browsers
w/HTTP over REST-API / S3-API
Load Balancer
Keeping High Availability
Keeping High Performance
Easy Administration
Metadata Object Storage
Storage Engine/Router
Metadata Object Storage
Storage Engine/Router
Gateway
LeoFS Overview - Gateway
Stateless Proxy + Object Cache
REST-API / S3-API
Use Consistent Hashing
for decision of a primary node
[ Memory Cache, Disc Cache ]
StorageClusterGateway(s)Clients
HTTP Request and Response
Built in Object Cache Mechanism
Storage Cluster
Fast HTTP Server - Cowboy
API Handler
Object Cache Mechanism
Storage
Storage(StorageCluster)GatewayLeoFS Overview - Storage
Use "Consistent Hashing"
for Data Operation
in the Storage Cluster
Choosing Replica Target Node(s)
RING
2 ^ 128 (MD5)
# of replicas = 3
KEY = “bucket/leofs.key”
Hash = md5(Filename)
Secondary-1
Secondary-2
Primary Node
"P2P"
WRITE: Auto Replication
READ : Auto Repair of an Inconsistent Object with Async
Request From Gateway
LeoFS Overview - Storage
...
LeoFS Storage
Replicator
Recoverer
...
Storage Engine
StorageEngine,Metadata+ObjectStorageGateway
Storage consists of Object Storage and Metadata Storage
Includes Replicator and Recoverer for the eventual consistency
Metadata
Storage Object
Storage
LeoFS Overview - Storage - Data StructureMetadata
Storage
ObjectStorage
Robust and
High Performance
Necessary for GC
Offset Version
Time-
stampKey
<Metadata>
Checksum
for Sync
KeySize
Custom
Meta Size File Size
for retrieving an object
Footer (8B)
Checksum KeySize DataSize Offset Version
Time-
stamp
Key User-Meta Footer
Header (Metadata - Fixed length) Body (Variable Length)
User-Meta
Size
Actual
File
<Needle>
Super-block
Needle-1
Needle-2
Needle-3
<Object Container>
Needle-4
Needle-5
To Equalize Disk Usage in Every Storage Node
To Realize High I/O efficiency and High Availability
LeoFS Overview - Storage - Large Object Support
chunk-0
chunk-1
chunk-2
chunk-3
An Original Object’s Metadata
Original Object Name
Original Object Size
# of Chunks
Storage ClusterGatewayClient(s)
[ WRITE Operation ]
Chunked Objects
Every chunked object and
metadata are replicated
in the cluster
Manager
Storage Cluster
LeoFS Overview - Manager
Monitor
Operate
RING, Node State
status, suspend,
resume, detach,
whereis, ...
Gateway(s)
StorageClusterGateway(s)
Manager(s)
Operate LeoFS - Gateway and Storage Cluster
"RING Monitor" and "NodeState Monitor"
Brief
Benchmark
Report
Hokkaido, Japan
LeoFS kept in a stable performance
through the benchmark
Brief Benchmark Report
Bottleneck is Disk I/O
The cache mechanism contributed to reduce
network traffic between Gateway and Storage
Summary of the benchmark results
Brief Benchmark Report
1st Case:
Group of Value Ranges (HDD)
Storage:5, Gateway:1, Manager:2
R:W = 9:1
2nd Case:
Group of Value Ranges (HDD)
Storage:5, Gateway:1, Manager:2
R:W = 8:2
source: https://github.com/leo-project/notes/tree/master/leofs/benchmark/leofs/20140605/tests/1m_r9w1_240min
source: https://github.com/leo-project/notes/tree/master/leofs/benchmark/leofs/20140605/tests/1m_r8w2_120min
Brief Benchmark Report
CPU Intel(R) Xeon(R) CPU X5650 @ 2.67GHz * 2 (12 cores / 24 threads)
Memory 96GB
Disk HDD - 240GB RAID0
Network 10G-Ether
Server Spec - Gateway:
CPU Intel(R) Xeon(R) CPU X5650 @ 2.67GHz * 2 (12 cores / 24 threads)
Memory 96GB
Disk
HDD - 240GB RAID0 (System)
HDD - 2TB RAID0 (Data)
Network 10G-Ether
Server Spec - Storage x5:
Network 10Gbps
OS CentOS release 6.5 (Final)
Erlang OTP R16B03-1
LeoFS v1.0.2
Environment:
System Consistency Level: [ N:3, W:2, R:1, D:2 ]
Duration 4.0h
R:W 9:1
# of Concurrent Processes 64
# of Keys 100,000
Value Size
!
!
!
!
!
Benchmark Configuration:
Range (byte) Percentage
1024 10240 24%
10241 102400 30%
10241 819200 30%
819201 1572864 16%
Brief Benchmark Report - 1st Case (HDD / R:W=9:1)
source: https://github.com/leo-project/notes/tree/master/leofs/benchmark/leofs/20140601/tests/1m_r9w1_240min
50ms
Brief Benchmark Report - 1st Case (HDD / R:W=9:1)
50ms
1,500ops
No Errors
0
50,000
100,000
150,000
200,000
250,000
300,000
350,000
400,000
450,000
500,000
550,000
600,000
650,000
700,000
750,000
800,000
850,000
900,000
950,000
1,000,000
1,050,000
1,100,000
1,150,000
1,200,000
1,250,000
1,300,000
1,350,000
1,400,000
0s
500s
1000s
1500s
2000s
2500s
3000s
3500s
4000s
4500s
5000s
5500s
6000s
6500s
7000s
7500s
8000s
8500s
9000s
9500s
10000s
10500s
11000s
11500s
12000s
12500s
13000s
13500s
14000s
gateway rxbyt/s gateway txbyt/s
storage-1 rxbyt/s storage-1 txbyt/s
storage-2 rxbyt/s storage-2 txbyt/s
storage-3 rxbyt/s storage-3 txbyt/s
storage-4 rxbyt/s storage-4 txbyt/s
storage-5 rxbyt/s storage-5 txbyt/s
Brief Benchmark Report - 1st Case / Network Traffic
10.0Gbps
7.0Gbps
5.0Gbps
6.0Gbps
StorageGateway
60%
0.0
0.1
0.3
0.4
0.6
0.7
0.9
1.0
1.1
1.3
1.4
1.6
1.7
1.9
2.0
0s
500s
1000s
1500s
2000s
2500s
3000s
3500s
4000s
4500s
5000s
5500s
6000s
6500s
7000s
7500s
8000s
8500s
9000s
9500s
10000s
10500s
11000s
11500s
12000s
12500s
13000s
13500s
14000s
Memory Usage
CPU Load 5min
Brief Benchmark Report - 1st Case / Memory and CPU
1.0
0
10
20
30
40
50
60
70
80
90
100
0s
500s
1000s
1500s
2000s
2500s
3000s
3500s
4000s
4500s
5000s
5500s
6000s
6500s
7000s
7500s
8000s
8500s
9000s
9500s
10000s
10500s
11000s
11500s
12000s
12500s
13000s
13500s
14000s
gateway storage-1 storage-2 storage-3 storage-4 storage-5
Network 10Gbps
OS CentOS release 6.5 (Final)
Erlang OTP R16B03-1
LeoFS v1.0.2
Environment:
System Consistency Level: [ N:3, W:2, R:1, D:2 ]
Duration 2.0h
R:W 8:2
# of Concurrent Processes 64
# of Keys 100,000
Value Size
!
!
!
!
!
Benchmark Configuration:
Brief Benchmark Report - 2nd Case (HDD / R:W=8:2)
Range (byte) Percentage
1024 10240 24%
10241 102400 30%
10241 819200 30%
819201 1572864 16%
Brief Benchmark Report - 2nd Case (HDD / R:W=8:2)
60-70ms 80-90ms
1,000ops
No Errors
Compare 1st case
with 2nd case
0
100,000
200,000
300,000
400,000
500,000
600,000
700,000
800,000
900,000
1,000,000
1,100,000
1,200,000
1,300,000
1,400,000
0s
500s
1000s
1500s
2000s
2500s
3000s
3500s
4000s
4500s
5000s
5500s
6000s
6500s
7000s
gateway rxbyt/s gateway txbyt/s
storage-1 rxbyt/s storage-1 txbyt/s
storage-2 rxbyt/s storage-2 txbyt/s
storage-3 rxbyt/s storage-3 txbyt/s
storage-4 rxbyt/s storage-4 txbyt/s
storage-5 rxbyt/s storage-5 txbyt/s
0
100,000
200,000
300,000
400,000
500,000
600,000
700,000
800,000
900,000
1,000,000
1,100,000
1,200,000
1,300,000
1,400,000
1,500,000
1,600,000
1,700,000
1,800,000
1,900,000
2,000,000
2,100,000
2,200,000
0s
500s
1000s
1500s
2000s
2500s
3000s
3500s
4000s
4500s
5000s
5500s
6000s
6500s
7000s
6.0Gbps
Brief Benchmark Report
7.0Gbps
6.0Gbps
7.0Gbps
minus 0.7Gbps
1st Case - Network Traffic
2nd Case - Network Traffic
0.0
50.0
100.0
150.0
200.0
250.0
300.0
350.0
400.0
450.0
500.0
550.0
600.0
0s
500s
1000s
1500s
2000s
2500s
3000s
3500s
4000s
4500s
5000s
5500s
6000s
6500s
7000s
0.0
50.0
100.0
150.0
200.0
250.0
300.0
350.0
400.0
450.0
500.0
550.0
600.0
0s
500s
1000s
1500s
2000s
2500s
3000s
3500s
4000s
4500s
5000s
5500s
6000s
6500s
7000s
storage-1
storage-2
storage-3
storage-4
storage-5
100
100
Brief Benchmark Report
2nd Case - Disk util%
200
200
1st Case - Disk util%
1.8x high
0.0
0.2
0.4
0.6
0.8
1.0
1.2
1.4
1.6
1.8
2.0
2.2
2.4
2.6
2.8
3.0
0s
500s
1000s
1500s
2000s
2500s
3000s
3500s
4000s
4500s
5000s
5500s
6000s
6500s
7000s
Brief Benchmark Report
0.0
0.2
0.4
0.6
0.8
1.0
1.2
1.4
1.6
1.8
2.0
2.2
2.4
2.6
2.8
3.0
0s
500s
1000s
1500s
2000s
2500s
3000s
3500s
4000s
4500s
5000s
5500s
6000s
6500s
7000s
1.00
1.00
1.6x high
2nd Case - CPU Load 5min
1st Case - CPU Load 5min
LeoFS kept in a stable performance
through the benchmark
Brief Benchmark Report
Bottleneck is Disk I/O
The cache mechanism contributed to reduce
network traffic between Gateway and Storage
Conclusion:
Multi
Data Center
Replication
Hokkaido, Japan
Tokyo
Europe
US
Multi Data Center Replication
HIGH-Scalability
HIGH-Availability
Easy Operation for Admins
+
NO SPOF
NO Performance Degradation
Singapore
1. Easy Operation to build multi clusters.
2. Asynchronous data replication between clusters
Stacked data is transferred to remote cluster(s)
3. Eventual consistency
Multi Data Center Replication
Designed it as simple as possible
DC-3DC-2
StorageclusterManagerclusterClient
DC-1
Monitors and Replicates each “RING” and “System Configuration”
"Leo Storage Platform"
[# of replicas:1] [# of replicas:1][# of replicas:3]
"join cluster DC-2 and DC-3"
leo_rpcleo_rpc
Multi Data Center Replication
Executing “Join Cluster”
on Manager Console
Preparing MDC Replication
DC-3DC-2
StorageclusterManagerclusterClient
Monitors and Replicates each “RING” and “System Configuration”
"Leo Storage Platform"
[# of replicas:1] [# of replicas:1]
Request to
the Target Region
Application(s)
DC-1
[# of replicas:3]
Temporally Stacking objects
- One container's capacity is *32MB
- When capacity is full,
send it to remote cluster(s)
* 32MB: default capacity - able to set optional value
leo_rpcleo_rpc
Multi Data Center Replication
Stacking objects
DC-3DC-2
StorageclusterManagerclusterClient
Monitors and Replicates each “RING” and “System Configuration”
"Leo Storage Platform"
DC-1
Stacked an object with a metadata
Compress it with LZ4
Replicated an object
Request to
the Target Region
Application(s)
leo_rpc
leo_rpc
leo_rpc
Multi Data Center Replication
Transferring stacked objects
Stacked objects
DC-3DC-2
StorageclusterManagerclusterClient
Monitor and Replicate each “RING” and “System Configuration”
"Leo Storage Platform"
Request to
the Target Region
Application(s)
DC-1
1) Receive metadata of stored objects
2) Compare them at the local cluster
3) Fix inconsistent objects
leo_rpcleo_rpc
leo_rpc
leo_rpc
Multi Data Center Replication
Investigating stored objects
LeoFS

Administration
at Rakuten
Presented by Hiroki Matsue
Rakuten Software Engineer
Tokyo, JapanKyoto, Japan
Storage Platform
File Sharing Service
Others
Portal Site
Photo Storage
Background Storage of OpenStack
LeoFS Administration at Rakuten
Storage Platform
Storage Platform - Scaling the Storage Platform
(Movie)
Reduce Costs
High Reliability
Easy to Scale
S3-API
Using Various Services
Total Usage: 450TB	
# of Files: 600Million	
Daily Growth: 100GB	
Daily Reqs: 13Million
Storage Platform - Scaling the Storage Platform
E-Commerce
Blog
Insurance Calendar
Recruiting
Review Photo
share
Portal &
Contents
Bookmark
B
Storage
Platform
(Movie)
Monitor
GUI Console
( Erlang RPC)
( Erlang RPC)
( TCP/IP,SNMP )
Gatewayx4Storagex14
Manager x 2
Requests from
Web Applications / Browsers
w/HTTP over S3-API
Load Balancer / Cache Servers
Storage Platform - System Layout
Total disk space: 600TB
Number of Files: 600Million
Access Stats:
800Mbps (MAX)
400Mbps (AVG)
Monitor
GUI Console
( Erlang RPC)
( Erlang RPC)
( TCP/IP,SNMP )
Gatewayx4Storagex14
Manager x 2
Storage Platform - Monitor
Send Mail Alert
Ganglia and
Nagios Agent
Status Collection (Ganglia)
Status Check (Nagios)
Port + Threshold Check
Storage Platform - Spreading Globally
Covering All Services
with Multi DC Replication
File Sharing Service
+
https://owncloud.com/
+
File Sharing Service - Required Targets
Reduce Costs
Handle Confidential Files
Store Large Files
Scale Easily
+
Share Docs and Movies with Group Companies
Over 20 Companies, Over 10 Countries
Over 4,000 Users, Over 10,000 Teams
File Sharing Service - Usage
LDAP
Monitor
GUI Console
( Erlang RPC)
( Erlang RPC)
( TCP/IP,SNMP )
Manager x 2
Authenticate Users
Manage
Configurations
Manage
Login Session
(KVS)
File Sharing Service - System Layout
Web GUI File Browser
Cover 25 Countries/Regions
Over 20,000 Users
+
File Sharing Service - Future Plans
Empowering the Services and the Users
Through the Cloud Storage
Future Plans
Tokyo, JapanHokkaido, Japan
NFS Support
Future Plans
Data-HUB: Centralize unstructured data in LeoFS
Search / Analysis
PaaS / IaaS Photo-Storage
Many Kind of Data PhotoLog / Event Data
Loading Data
Analysis Data
Stream Processing
SavannaDB for Statistics Data
Retrieve
m
etrics
and
stats
from
SavannaDB's
Agents
Storage Cluster
ManagerGateway
The Lion of Storage Systems
REST-API (JSON)
Operate LeoFS
Notify
a
m
essage
of over #
of req
threshold
SavannaDB's Agent
Insight LeoFS
LeoInsight
Future Plans
+
Set Sail for “Cloud Storage”
Website: leo-project.net
Twitter: @LeoFastStorage
Facebook: www.facebook.com/org.leofs
1 of 55

Recommended

Introduction to Redis by
Introduction to RedisIntroduction to Redis
Introduction to RedisArnab Mitra
11K views31 slides
Real-Time Data Flows with Apache NiFi by
Real-Time Data Flows with Apache NiFiReal-Time Data Flows with Apache NiFi
Real-Time Data Flows with Apache NiFiManish Gupta
19.8K views29 slides
Tame the small files problem and optimize data layout for streaming ingestion... by
Tame the small files problem and optimize data layout for streaming ingestion...Tame the small files problem and optimize data layout for streaming ingestion...
Tame the small files problem and optimize data layout for streaming ingestion...Flink Forward
800 views68 slides
Redis cluster by
Redis clusterRedis cluster
Redis clusteriammutex
7K views17 slides
Apache Solr-Webinar by
Apache Solr-WebinarApache Solr-Webinar
Apache Solr-WebinarEdureka!
4.8K views41 slides
Autoscaling Flink with Reactive Mode by
Autoscaling Flink with Reactive ModeAutoscaling Flink with Reactive Mode
Autoscaling Flink with Reactive ModeFlink Forward
921 views17 slides

More Related Content

What's hot

Iceberg: a fast table format for S3 by
Iceberg: a fast table format for S3Iceberg: a fast table format for S3
Iceberg: a fast table format for S3DataWorks Summit
7.5K views30 slides
Introduction to Kafka Streams by
Introduction to Kafka StreamsIntroduction to Kafka Streams
Introduction to Kafka StreamsGuozhang Wang
29.7K views136 slides
A Thorough Comparison of Delta Lake, Iceberg and Hudi by
A Thorough Comparison of Delta Lake, Iceberg and HudiA Thorough Comparison of Delta Lake, Iceberg and Hudi
A Thorough Comparison of Delta Lake, Iceberg and HudiDatabricks
11.1K views27 slides
“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i... by
“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...
“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...Flink Forward
183 views13 slides
Introducing ELK by
Introducing ELKIntroducing ELK
Introducing ELKAllBits BVBA (freelancer)
2.9K views16 slides
Building Reliable Lakehouses with Apache Flink and Delta Lake by
Building Reliable Lakehouses with Apache Flink and Delta LakeBuilding Reliable Lakehouses with Apache Flink and Delta Lake
Building Reliable Lakehouses with Apache Flink and Delta LakeFlink Forward
564 views31 slides

What's hot(20)

Iceberg: a fast table format for S3 by DataWorks Summit
Iceberg: a fast table format for S3Iceberg: a fast table format for S3
Iceberg: a fast table format for S3
DataWorks Summit7.5K views
Introduction to Kafka Streams by Guozhang Wang
Introduction to Kafka StreamsIntroduction to Kafka Streams
Introduction to Kafka Streams
Guozhang Wang29.7K views
A Thorough Comparison of Delta Lake, Iceberg and Hudi by Databricks
A Thorough Comparison of Delta Lake, Iceberg and HudiA Thorough Comparison of Delta Lake, Iceberg and Hudi
A Thorough Comparison of Delta Lake, Iceberg and Hudi
Databricks11.1K views
“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i... by Flink Forward
“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...
“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...
Flink Forward183 views
Building Reliable Lakehouses with Apache Flink and Delta Lake by Flink Forward
Building Reliable Lakehouses with Apache Flink and Delta LakeBuilding Reliable Lakehouses with Apache Flink and Delta Lake
Building Reliable Lakehouses with Apache Flink and Delta Lake
Flink Forward564 views
Deep Dive into Spark SQL with Advanced Performance Tuning with Xiao Li & Wenc... by Databricks
Deep Dive into Spark SQL with Advanced Performance Tuning with Xiao Li & Wenc...Deep Dive into Spark SQL with Advanced Performance Tuning with Xiao Li & Wenc...
Deep Dive into Spark SQL with Advanced Performance Tuning with Xiao Li & Wenc...
Databricks10.8K views
Large Scale Lakehouse Implementation Using Structured Streaming by Databricks
Large Scale Lakehouse Implementation Using Structured StreamingLarge Scale Lakehouse Implementation Using Structured Streaming
Large Scale Lakehouse Implementation Using Structured Streaming
Databricks489 views
No instrumentation Golang Logging with eBPF (GoSF talk 11/11/20) by Zain Asgar
No instrumentation Golang Logging with eBPF (GoSF talk 11/11/20)No instrumentation Golang Logging with eBPF (GoSF talk 11/11/20)
No instrumentation Golang Logging with eBPF (GoSF talk 11/11/20)
Zain Asgar272 views
SeaweedFS introduction by chrislusf
SeaweedFS introductionSeaweedFS introduction
SeaweedFS introduction
chrislusf6.1K views
Common Patterns of Multi Data-Center Architectures with Apache Kafka by confluent
Common Patterns of Multi Data-Center Architectures with Apache KafkaCommon Patterns of Multi Data-Center Architectures with Apache Kafka
Common Patterns of Multi Data-Center Architectures with Apache Kafka
confluent21K views
Set your Data in Motion with Confluent & Apache Kafka Tech Talk Series LME by confluent
Set your Data in Motion with Confluent & Apache Kafka Tech Talk Series LMESet your Data in Motion with Confluent & Apache Kafka Tech Talk Series LME
Set your Data in Motion with Confluent & Apache Kafka Tech Talk Series LME
confluent348 views
Streaming Visualization by Guido Schmutz
Streaming VisualizationStreaming Visualization
Streaming Visualization
Guido Schmutz1.7K views
Kafka replication apachecon_2013 by Jun Rao
Kafka replication apachecon_2013Kafka replication apachecon_2013
Kafka replication apachecon_2013
Jun Rao21.3K views
Openshift Container Platform by DLT Solutions
Openshift Container PlatformOpenshift Container Platform
Openshift Container Platform
DLT Solutions4.7K views
Kafka Streams State Stores Being Persistent by confluent
Kafka Streams State Stores Being PersistentKafka Streams State Stores Being Persistent
Kafka Streams State Stores Being Persistent
confluent803 views
Flink vs. Spark by Slim Baltagi
Flink vs. SparkFlink vs. Spark
Flink vs. Spark
Slim Baltagi69.5K views
Everyday I'm Shuffling - Tips for Writing Better Spark Programs, Strata San J... by Databricks
Everyday I'm Shuffling - Tips for Writing Better Spark Programs, Strata San J...Everyday I'm Shuffling - Tips for Writing Better Spark Programs, Strata San J...
Everyday I'm Shuffling - Tips for Writing Better Spark Programs, Strata San J...
Databricks98.6K views
A visual introduction to Apache Kafka by Paul Brebner
A visual introduction to Apache KafkaA visual introduction to Apache Kafka
A visual introduction to Apache Kafka
Paul Brebner4.7K views
Redpanda and ClickHouse by Altinity Ltd
Redpanda and ClickHouseRedpanda and ClickHouse
Redpanda and ClickHouse
Altinity Ltd776 views

Viewers also liked

[RakutenTechConf2014] [D-4] The next step of LeoFS and Introducing NewDB Project by
[RakutenTechConf2014] [D-4] The next step of LeoFS and Introducing NewDB Project[RakutenTechConf2014] [D-4] The next step of LeoFS and Introducing NewDB Project
[RakutenTechConf2014] [D-4] The next step of LeoFS and Introducing NewDB ProjectRakuten Group, Inc.
45.5K views65 slides
Rakuten LeoFs - distributed file system by
Rakuten LeoFs - distributed file systemRakuten LeoFs - distributed file system
Rakuten LeoFs - distributed file systemRakuten Group, Inc.
8.3K views33 slides
The State of Ceph, Manila, and Containers in OpenStack by
The State of Ceph, Manila, and Containers in OpenStackThe State of Ceph, Manila, and Containers in OpenStack
The State of Ceph, Manila, and Containers in OpenStackSage Weil
5.5K views43 slides
하둡 알아보기(Learn about Hadoop basic), NetApp FAS NFS Connector for Hadoop by
하둡 알아보기(Learn about Hadoop basic), NetApp FAS NFS Connector for Hadoop하둡 알아보기(Learn about Hadoop basic), NetApp FAS NFS Connector for Hadoop
하둡 알아보기(Learn about Hadoop basic), NetApp FAS NFS Connector for HadoopSeungYong Baek
2K views42 slides
Deep Dive on the AWS Storage Gateway - April 2017 AWS Online Tech Talks by
Deep Dive on the AWS Storage Gateway - April 2017 AWS Online Tech TalksDeep Dive on the AWS Storage Gateway - April 2017 AWS Online Tech Talks
Deep Dive on the AWS Storage Gateway - April 2017 AWS Online Tech TalksAmazon Web Services
14.8K views25 slides
대용량 분산 아키텍쳐 설계 #3 대용량 분산 시스템 아키텍쳐 by
대용량 분산 아키텍쳐 설계 #3 대용량 분산 시스템 아키텍쳐대용량 분산 아키텍쳐 설계 #3 대용량 분산 시스템 아키텍쳐
대용량 분산 아키텍쳐 설계 #3 대용량 분산 시스템 아키텍쳐Terry Cho
46.8K views37 slides

Viewers also liked(6)

[RakutenTechConf2014] [D-4] The next step of LeoFS and Introducing NewDB Project by Rakuten Group, Inc.
[RakutenTechConf2014] [D-4] The next step of LeoFS and Introducing NewDB Project[RakutenTechConf2014] [D-4] The next step of LeoFS and Introducing NewDB Project
[RakutenTechConf2014] [D-4] The next step of LeoFS and Introducing NewDB Project
Rakuten Group, Inc.45.5K views
The State of Ceph, Manila, and Containers in OpenStack by Sage Weil
The State of Ceph, Manila, and Containers in OpenStackThe State of Ceph, Manila, and Containers in OpenStack
The State of Ceph, Manila, and Containers in OpenStack
Sage Weil5.5K views
하둡 알아보기(Learn about Hadoop basic), NetApp FAS NFS Connector for Hadoop by SeungYong Baek
하둡 알아보기(Learn about Hadoop basic), NetApp FAS NFS Connector for Hadoop하둡 알아보기(Learn about Hadoop basic), NetApp FAS NFS Connector for Hadoop
하둡 알아보기(Learn about Hadoop basic), NetApp FAS NFS Connector for Hadoop
SeungYong Baek2K views
Deep Dive on the AWS Storage Gateway - April 2017 AWS Online Tech Talks by Amazon Web Services
Deep Dive on the AWS Storage Gateway - April 2017 AWS Online Tech TalksDeep Dive on the AWS Storage Gateway - April 2017 AWS Online Tech Talks
Deep Dive on the AWS Storage Gateway - April 2017 AWS Online Tech Talks
Amazon Web Services14.8K views
대용량 분산 아키텍쳐 설계 #3 대용량 분산 시스템 아키텍쳐 by Terry Cho
대용량 분산 아키텍쳐 설계 #3 대용량 분산 시스템 아키텍쳐대용량 분산 아키텍쳐 설계 #3 대용량 분산 시스템 아키텍쳐
대용량 분산 아키텍쳐 설계 #3 대용량 분산 시스템 아키텍쳐
Terry Cho46.8K views

Similar to Scaling and High Performance Storage System: LeoFS

RakutenTechConf2013] [D-3_1] LeoFS - Open the New Door by
RakutenTechConf2013] [D-3_1] LeoFS - Open the New DoorRakutenTechConf2013] [D-3_1] LeoFS - Open the New Door
RakutenTechConf2013] [D-3_1] LeoFS - Open the New DoorRakuten Group, Inc.
3.1K views34 slides
Ceph Day Berlin: Ceph on All Flash Storage - Breaking Performance Barriers by
Ceph Day Berlin: Ceph on All Flash Storage - Breaking Performance BarriersCeph Day Berlin: Ceph on All Flash Storage - Breaking Performance Barriers
Ceph Day Berlin: Ceph on All Flash Storage - Breaking Performance BarriersCeph Community
453 views30 slides
User-space Network Processing by
User-space Network ProcessingUser-space Network Processing
User-space Network ProcessingRyousei Takano
5.7K views22 slides
optimizing_ceph_flash by
optimizing_ceph_flashoptimizing_ceph_flash
optimizing_ceph_flashVijayendra Shamanna
1K views27 slides
Stephan Ewen - Experiences running Flink at Very Large Scale by
Stephan Ewen -  Experiences running Flink at Very Large ScaleStephan Ewen -  Experiences running Flink at Very Large Scale
Stephan Ewen - Experiences running Flink at Very Large ScaleVerverica
3.5K views76 slides
LLAP: Sub-Second Analytical Queries in Hive by
LLAP: Sub-Second Analytical Queries in HiveLLAP: Sub-Second Analytical Queries in Hive
LLAP: Sub-Second Analytical Queries in HiveDataWorks Summit/Hadoop Summit
595 views24 slides

Similar to Scaling and High Performance Storage System: LeoFS(20)

RakutenTechConf2013] [D-3_1] LeoFS - Open the New Door by Rakuten Group, Inc.
RakutenTechConf2013] [D-3_1] LeoFS - Open the New DoorRakutenTechConf2013] [D-3_1] LeoFS - Open the New Door
RakutenTechConf2013] [D-3_1] LeoFS - Open the New Door
Rakuten Group, Inc.3.1K views
Ceph Day Berlin: Ceph on All Flash Storage - Breaking Performance Barriers by Ceph Community
Ceph Day Berlin: Ceph on All Flash Storage - Breaking Performance BarriersCeph Day Berlin: Ceph on All Flash Storage - Breaking Performance Barriers
Ceph Day Berlin: Ceph on All Flash Storage - Breaking Performance Barriers
Ceph Community 453 views
User-space Network Processing by Ryousei Takano
User-space Network ProcessingUser-space Network Processing
User-space Network Processing
Ryousei Takano5.7K views
Stephan Ewen - Experiences running Flink at Very Large Scale by Ververica
Stephan Ewen -  Experiences running Flink at Very Large ScaleStephan Ewen -  Experiences running Flink at Very Large Scale
Stephan Ewen - Experiences running Flink at Very Large Scale
Ververica 3.5K views
[OpenStack Days Korea 2016] Track1 - All flash CEPH 구성 및 최적화 by OpenStack Korea Community
[OpenStack Days Korea 2016] Track1 - All flash CEPH 구성 및 최적화[OpenStack Days Korea 2016] Track1 - All flash CEPH 구성 및 최적화
[OpenStack Days Korea 2016] Track1 - All flash CEPH 구성 및 최적화
Ceph Day Beijing - Ceph on All-Flash Storage - Breaking Performance Barriers by Ceph Community
Ceph Day Beijing - Ceph on All-Flash Storage - Breaking Performance BarriersCeph Day Beijing - Ceph on All-Flash Storage - Breaking Performance Barriers
Ceph Day Beijing - Ceph on All-Flash Storage - Breaking Performance Barriers
Ceph Community 667 views
Ceph: Open Source Storage Software Optimizations on Intel® Architecture for C... by Odinot Stanislas
Ceph: Open Source Storage Software Optimizations on Intel® Architecture for C...Ceph: Open Source Storage Software Optimizations on Intel® Architecture for C...
Ceph: Open Source Storage Software Optimizations on Intel® Architecture for C...
Odinot Stanislas23.5K views
Apache Big Data EU 2015 - HBase by Nick Dimiduk
Apache Big Data EU 2015 - HBaseApache Big Data EU 2015 - HBase
Apache Big Data EU 2015 - HBase
Nick Dimiduk3.3K views
Технологии работы с дисковыми хранилищами и файловыми системами Windows Serve... by Виталий Стародубцев
Технологии работы с дисковыми хранилищами и файловыми системами Windows Serve...Технологии работы с дисковыми хранилищами и файловыми системами Windows Serve...
Технологии работы с дисковыми хранилищами и файловыми системами Windows Serve...
IMCSummit 2015 - Day 2 IT Business Track - 4 Myths about In-Memory Databases ... by In-Memory Computing Summit
IMCSummit 2015 - Day 2 IT Business Track - 4 Myths about In-Memory Databases ...IMCSummit 2015 - Day 2 IT Business Track - 4 Myths about In-Memory Databases ...
IMCSummit 2015 - Day 2 IT Business Track - 4 Myths about In-Memory Databases ...
Five major tips to maximize performance on a 200+ SQL HBase/Phoenix cluster by mas4share
Five major tips to maximize performance on a 200+ SQL HBase/Phoenix clusterFive major tips to maximize performance on a 200+ SQL HBase/Phoenix cluster
Five major tips to maximize performance on a 200+ SQL HBase/Phoenix cluster
mas4share1.3K views
Near Real time Indexing Kafka Messages to Apache Blur using Spark Streaming by Dibyendu Bhattacharya
Near Real time Indexing Kafka Messages to Apache Blur using Spark StreamingNear Real time Indexing Kafka Messages to Apache Blur using Spark Streaming
Near Real time Indexing Kafka Messages to Apache Blur using Spark Streaming
Plank by FNian
PlankPlank
Plank
FNian641 views

More from Rakuten Group, Inc.

コードレビュー改善のためにJenkinsとIntelliJ IDEAのプラグインを自作してみた話 by
コードレビュー改善のためにJenkinsとIntelliJ IDEAのプラグインを自作してみた話コードレビュー改善のためにJenkinsとIntelliJ IDEAのプラグインを自作してみた話
コードレビュー改善のためにJenkinsとIntelliJ IDEAのプラグインを自作してみた話Rakuten Group, Inc.
112 views32 slides
楽天における安全な秘匿情報管理への道のり by
楽天における安全な秘匿情報管理への道のり楽天における安全な秘匿情報管理への道のり
楽天における安全な秘匿情報管理への道のりRakuten Group, Inc.
172 views43 slides
What Makes Software Green? by
What Makes Software Green?What Makes Software Green?
What Makes Software Green?Rakuten Group, Inc.
138 views39 slides
Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product At... by
Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product At...Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product At...
Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product At...Rakuten Group, Inc.
222 views33 slides
大規模なリアルタイム監視の導入と展開 by
大規模なリアルタイム監視の導入と展開大規模なリアルタイム監視の導入と展開
大規模なリアルタイム監視の導入と展開Rakuten Group, Inc.
524 views18 slides
楽天における大規模データベースの運用 by
楽天における大規模データベースの運用楽天における大規模データベースの運用
楽天における大規模データベースの運用Rakuten Group, Inc.
776 views20 slides

More from Rakuten Group, Inc.(20)

コードレビュー改善のためにJenkinsとIntelliJ IDEAのプラグインを自作してみた話 by Rakuten Group, Inc.
コードレビュー改善のためにJenkinsとIntelliJ IDEAのプラグインを自作してみた話コードレビュー改善のためにJenkinsとIntelliJ IDEAのプラグインを自作してみた話
コードレビュー改善のためにJenkinsとIntelliJ IDEAのプラグインを自作してみた話
楽天における安全な秘匿情報管理への道のり by Rakuten Group, Inc.
楽天における安全な秘匿情報管理への道のり楽天における安全な秘匿情報管理への道のり
楽天における安全な秘匿情報管理への道のり
Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product At... by Rakuten Group, Inc.
Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product At...Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product At...
Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product At...
大規模なリアルタイム監視の導入と展開 by Rakuten Group, Inc.
大規模なリアルタイム監視の導入と展開大規模なリアルタイム監視の導入と展開
大規模なリアルタイム監視の導入と展開
楽天における大規模データベースの運用 by Rakuten Group, Inc.
楽天における大規模データベースの運用楽天における大規模データベースの運用
楽天における大規模データベースの運用
楽天サービスを支えるネットワークインフラストラクチャー by Rakuten Group, Inc.
楽天サービスを支えるネットワークインフラストラクチャー楽天サービスを支えるネットワークインフラストラクチャー
楽天サービスを支えるネットワークインフラストラクチャー
楽天の規模とクラウドプラットフォーム統括部の役割 by Rakuten Group, Inc.
楽天の規模とクラウドプラットフォーム統括部の役割楽天の規模とクラウドプラットフォーム統括部の役割
楽天の規模とクラウドプラットフォーム統括部の役割
The Data Platform Administration Handling the 100 PB.pdf by Rakuten Group, Inc.
The Data Platform Administration Handling the 100 PB.pdfThe Data Platform Administration Handling the 100 PB.pdf
The Data Platform Administration Handling the 100 PB.pdf
Supporting Internal Customers as Technical Account Managers.pdf by Rakuten Group, Inc.
Supporting Internal Customers as Technical Account Managers.pdfSupporting Internal Customers as Technical Account Managers.pdf
Supporting Internal Customers as Technical Account Managers.pdf
Travel & Leisure Platform Department's tech info by Rakuten Group, Inc.
Travel & Leisure Platform Department's tech infoTravel & Leisure Platform Department's tech info
Travel & Leisure Platform Department's tech info
Travel & Leisure Platform Department's tech info by Rakuten Group, Inc.
Travel & Leisure Platform Department's tech infoTravel & Leisure Platform Department's tech info
Travel & Leisure Platform Department's tech info
100PBを越えるデータプラットフォームの実情 by Rakuten Group, Inc.
100PBを越えるデータプラットフォームの実情100PBを越えるデータプラットフォームの実情
100PBを越えるデータプラットフォームの実情
社内エンジニアを支えるテクニカルアカウントマネージャー by Rakuten Group, Inc.
社内エンジニアを支えるテクニカルアカウントマネージャー社内エンジニアを支えるテクニカルアカウントマネージャー
社内エンジニアを支えるテクニカルアカウントマネージャー
モニタリングプラットフォーム開発の裏側 by Rakuten Group, Inc.
モニタリングプラットフォーム開発の裏側モニタリングプラットフォーム開発の裏側
モニタリングプラットフォーム開発の裏側

Recently uploaded

Melek BEN MAHMOUD.pdf by
Melek BEN MAHMOUD.pdfMelek BEN MAHMOUD.pdf
Melek BEN MAHMOUD.pdfMelekBenMahmoud
14 views1 slide
Transcript: The Details of Description Techniques tips and tangents on altern... by
Transcript: The Details of Description Techniques tips and tangents on altern...Transcript: The Details of Description Techniques tips and tangents on altern...
Transcript: The Details of Description Techniques tips and tangents on altern...BookNet Canada
130 views15 slides
6g - REPORT.pdf by
6g - REPORT.pdf6g - REPORT.pdf
6g - REPORT.pdfLiveplex
9 views23 slides
SAP Automation Using Bar Code and FIORI.pdf by
SAP Automation Using Bar Code and FIORI.pdfSAP Automation Using Bar Code and FIORI.pdf
SAP Automation Using Bar Code and FIORI.pdfVirendra Rai, PMP
19 views38 slides
Perth MeetUp November 2023 by
Perth MeetUp November 2023 Perth MeetUp November 2023
Perth MeetUp November 2023 Michael Price
15 views44 slides
Scaling Knowledge Graph Architectures with AI by
Scaling Knowledge Graph Architectures with AIScaling Knowledge Graph Architectures with AI
Scaling Knowledge Graph Architectures with AIEnterprise Knowledge
24 views15 slides

Recently uploaded(20)

Transcript: The Details of Description Techniques tips and tangents on altern... by BookNet Canada
Transcript: The Details of Description Techniques tips and tangents on altern...Transcript: The Details of Description Techniques tips and tangents on altern...
Transcript: The Details of Description Techniques tips and tangents on altern...
BookNet Canada130 views
6g - REPORT.pdf by Liveplex
6g - REPORT.pdf6g - REPORT.pdf
6g - REPORT.pdf
Liveplex9 views
SAP Automation Using Bar Code and FIORI.pdf by Virendra Rai, PMP
SAP Automation Using Bar Code and FIORI.pdfSAP Automation Using Bar Code and FIORI.pdf
SAP Automation Using Bar Code and FIORI.pdf
Perth MeetUp November 2023 by Michael Price
Perth MeetUp November 2023 Perth MeetUp November 2023
Perth MeetUp November 2023
Michael Price15 views
From chaos to control: Managing migrations and Microsoft 365 with ShareGate! by sammart93
From chaos to control: Managing migrations and Microsoft 365 with ShareGate!From chaos to control: Managing migrations and Microsoft 365 with ShareGate!
From chaos to control: Managing migrations and Microsoft 365 with ShareGate!
sammart939 views
GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N... by James Anderson
GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N...GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N...
GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N...
James Anderson33 views
Case Study Copenhagen Energy and Business Central.pdf by Aitana
Case Study Copenhagen Energy and Business Central.pdfCase Study Copenhagen Energy and Business Central.pdf
Case Study Copenhagen Energy and Business Central.pdf
Aitana12 views
Piloting & Scaling Successfully With Microsoft Viva by Richard Harbridge
Piloting & Scaling Successfully With Microsoft VivaPiloting & Scaling Successfully With Microsoft Viva
Piloting & Scaling Successfully With Microsoft Viva
Special_edition_innovator_2023.pdf by WillDavies22
Special_edition_innovator_2023.pdfSpecial_edition_innovator_2023.pdf
Special_edition_innovator_2023.pdf
WillDavies2216 views
TouchLog: Finger Micro Gesture Recognition Using Photo-Reflective Sensors by sugiuralab
TouchLog: Finger Micro Gesture Recognition  Using Photo-Reflective SensorsTouchLog: Finger Micro Gesture Recognition  Using Photo-Reflective Sensors
TouchLog: Finger Micro Gesture Recognition Using Photo-Reflective Sensors
sugiuralab15 views
Igniting Next Level Productivity with AI-Infused Data Integration Workflows by Safe Software
Igniting Next Level Productivity with AI-Infused Data Integration Workflows Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Safe Software225 views
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLive by Network Automation Forum
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLiveAutomating a World-Class Technology Conference; Behind the Scenes of CiscoLive
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLive
Unit 1_Lecture 2_Physical Design of IoT.pdf by StephenTec
Unit 1_Lecture 2_Physical Design of IoT.pdfUnit 1_Lecture 2_Physical Design of IoT.pdf
Unit 1_Lecture 2_Physical Design of IoT.pdf
StephenTec11 views

Scaling and High Performance Storage System: LeoFS