SlideShare a Scribd company logo
1 of 21
Download to read offline
High Availability
.
Mission Statement
1. high availability business-level cloud data store
2. federated clouds = diversification
3. many DCs and/or cloud providers
4. we care mostly about performance = high availability
5. practical solutions are needed
Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 2/21
...
2/21
.
haStore : The Short Story
Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 3/21
...
3/21
.
haStore: One DC is Not Enough
• remember June 2013?
• most services today use vertical intergration -- no diversity
• Hitachi does not share DCs with NEC
• regional diversity of one provider is bad
◦ how many Amazon DCs in Japan?
.
(the only possible) Solution
..
.
... is to sign contracts with multiple DCs and manage on
client side
Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 4/21
...
4/21
.
haStore: One DC is Not Enough
Kansai
DC1
OkinawaLocations
Data
Centers
DC2
Kyushu
Osaka Office DC1
DC1 DC2 Naha Office
Network
distance
Network
distance
storage
network
Employee A ….
Content / Social Metadata
High Availability Data Store
DC1 DC2 ….
DC1 DC2 Business trip
Store
APIs
Proposed
Software
Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 5/21
...
5/21
.
haStore: Store Diversification
• store = sum of multiple substores
• in software: not a priority list -- optimization engine!
• realtime performance monitoring, read/write optimization, etc.
• sub-file data unit -- chunks
SSD
Growing network
distance
User
HDD DC1 DC2 …
Network
Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 6/21
...
6/21
.
haStore: Socially Aware Store
• content relevance based on
social graph
• relevance is a distribution
• individual redundancy based on distribution
• other link types: same time, location,
filetype, ...
• link strength != 1
Descending
order
Relevance
Distribution
Redundancy
(user setting)
Physical limit
of redundancy
End of
content
There is
a link
When
a file
is …
Between
Created
Viewed
Edited
Deleted
Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 7/21
...
7/21
.
hsStore: Software Design
Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 8/21
...
8/21
.
Design: Specs
• many substores, heterogeneous e2e performance and capacity
• each substore has its own API (Dropbox, GDrive, SSD, etc.), but haStore exports a
generic API
• data unit: sub-file blobs, for now fixed 100kb size
• social graph is used to define priority lists of files
◦ different for each user
• optimization is key element of software engines
1. sync logic
2. redundancy logic
Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 9/21
...
9/21
.
Design: API Stack
• Generic API starts from Level 2, similar to drivers
• the stack is implemented by each client = each user
Employee A ….
Content / Social Metadata
High Availability Data Store
DC1 DC2 …. Store
Proposed
Software
Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 10/21
...
10/21
.
Design: Sync Engine
• optimization for throughput minimization
• same logic for SSD, HDD and over-the-network
haStore
Storage Sync
Engine
Optimization
Local
Cache
Check
1 2
Use
GUI,
Clients
Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 11/21
...
11/21
.
Design: Sync Engine Logic
Bulk
Throughput
History Data
Increase timeout
Performance
Tradeoff
Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 12/21
...
12/21
.
Design: Redundancy Logic (1)
Descending
order
Relevance
Distribution
Redundancy
(user setting)
Physical limit
of redundancy
End of
content
Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 13/21
...
13/21
.
Design: Redundancy Logic (2)
Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 14/21
...
14/21
.
haStore: Social Graph
Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 15/21
...
15/21
.
Social Graph : Basics
• current version: only simple types of links
• no link strength
There is
a link
When
a file
is …
Between
Created
Viewed
Edited
Deleted
Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 16/21
...
16/21
.
Social Graph : Advanced
• community detection
• files that could be linked:
1. touched at roughly the same time
2. touched by the same user
3. same location, filetype, size, etc.
• link strength, different for each kind of relation, variable e2e cost on
paths
• discovery based on e2e cost, not hop count
Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 17/21
...
17/21
.
Implementation, Tests
Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 18/21
...
18/21
.
Performance : Demo
A-san B-san
DBX GDR
2014-01-22 12:13:30 Block DONE
Block UPLOAD
Block DOWNLOAD
• also demo
Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 19/21
...
19/21
.
Wrapup
• haStore: high availability cloud store
• main features
◦ throughput-awaresync/redundancy optimization
◦ sub-file blocks, smart distribution
◦ social graph
• current status: v1.0 in operation, v2.0 on the way
Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 20/21
...
20/21
.
That’s all, thank you ...
Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 21/21
...
21/21

More Related Content

Similar to High Availability Cloud Storage as a Software Package with Social Graph, Throughput Awareness, and Smart Distribution Features

Simplifying and Future-Proofing Hadoop
Simplifying and Future-Proofing HadoopSimplifying and Future-Proofing Hadoop
Simplifying and Future-Proofing HadoopPrecisely
 
Strategies for Context Data Persistence
Strategies for Context Data PersistenceStrategies for Context Data Persistence
Strategies for Context Data PersistenceFIWARE
 
Apache Kafka and the Data Mesh | Ben Stopford and Michael Noll, Confluent
Apache Kafka and the Data Mesh | Ben Stopford and Michael Noll, ConfluentApache Kafka and the Data Mesh | Ben Stopford and Michael Noll, Confluent
Apache Kafka and the Data Mesh | Ben Stopford and Michael Noll, ConfluentHostedbyConfluent
 
Michael stack -the state of apache h base
Michael stack -the state of apache h baseMichael stack -the state of apache h base
Michael stack -the state of apache h basehdhappy001
 
4D Pubs - Distributed Dynamic Document Dsplay
4D Pubs - Distributed Dynamic Document Dsplay4D Pubs - Distributed Dynamic Document Dsplay
4D Pubs - Distributed Dynamic Document DsplayChris Despopoulos
 
Apache Kafka® and the Data Mesh
Apache Kafka® and the Data MeshApache Kafka® and the Data Mesh
Apache Kafka® and the Data MeshConfluentInc1
 
FIWARE Wednesday Webinars - Strategies for Context Data Persistence
FIWARE Wednesday Webinars - Strategies for Context Data PersistenceFIWARE Wednesday Webinars - Strategies for Context Data Persistence
FIWARE Wednesday Webinars - Strategies for Context Data PersistenceFIWARE
 
Introduction to Cloud computing and Big Data-Hadoop
Introduction to Cloud computing and  Big Data-HadoopIntroduction to Cloud computing and  Big Data-Hadoop
Introduction to Cloud computing and Big Data-HadoopNagarjuna D.N
 
Simplifying Big Data Integration with Syncsort DMX and DMX-h
Simplifying Big Data Integration with Syncsort DMX and DMX-hSimplifying Big Data Integration with Syncsort DMX and DMX-h
Simplifying Big Data Integration with Syncsort DMX and DMX-hPrecisely
 
Big data and hadoop
Big data and hadoopBig data and hadoop
Big data and hadoopMohit Tare
 
Fog Cloud Caching at Network Edge via Local Hardware Awareness Spaces
Fog Cloud Caching at Network Edge via Local Hardware Awareness SpacesFog Cloud Caching at Network Edge via Local Hardware Awareness Spaces
Fog Cloud Caching at Network Edge via Local Hardware Awareness SpacesTokyo University of Science
 
Tachyon-2014-11-21-amp-camp5
Tachyon-2014-11-21-amp-camp5Tachyon-2014-11-21-amp-camp5
Tachyon-2014-11-21-amp-camp5Haoyuan Li
 
facebook architecture for 600M users
facebook architecture for 600M usersfacebook architecture for 600M users
facebook architecture for 600M usersJongyoon Choi
 
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)Denodo
 
Data stream with cruise control
Data stream with cruise controlData stream with cruise control
Data stream with cruise controlBill Liu
 
From Single Purpose to Multi Purpose Data Lakes - Broadening End Users
From Single Purpose to Multi Purpose Data Lakes - Broadening End UsersFrom Single Purpose to Multi Purpose Data Lakes - Broadening End Users
From Single Purpose to Multi Purpose Data Lakes - Broadening End UsersDenodo
 
Interactive Hadoop via Flash and Memory
Interactive Hadoop via Flash and MemoryInteractive Hadoop via Flash and Memory
Interactive Hadoop via Flash and MemoryChris Nauroth
 

Similar to High Availability Cloud Storage as a Software Package with Social Graph, Throughput Awareness, and Smart Distribution Features (20)

Analysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data AnalyticsAnalysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data Analytics
 
Analysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data AnalyticsAnalysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data Analytics
 
Simplifying and Future-Proofing Hadoop
Simplifying and Future-Proofing HadoopSimplifying and Future-Proofing Hadoop
Simplifying and Future-Proofing Hadoop
 
Strategies for Context Data Persistence
Strategies for Context Data PersistenceStrategies for Context Data Persistence
Strategies for Context Data Persistence
 
Apache Kafka and the Data Mesh | Ben Stopford and Michael Noll, Confluent
Apache Kafka and the Data Mesh | Ben Stopford and Michael Noll, ConfluentApache Kafka and the Data Mesh | Ben Stopford and Michael Noll, Confluent
Apache Kafka and the Data Mesh | Ben Stopford and Michael Noll, Confluent
 
Michael stack -the state of apache h base
Michael stack -the state of apache h baseMichael stack -the state of apache h base
Michael stack -the state of apache h base
 
4D Pubs - Distributed Dynamic Document Dsplay
4D Pubs - Distributed Dynamic Document Dsplay4D Pubs - Distributed Dynamic Document Dsplay
4D Pubs - Distributed Dynamic Document Dsplay
 
Apache Kafka® and the Data Mesh
Apache Kafka® and the Data MeshApache Kafka® and the Data Mesh
Apache Kafka® and the Data Mesh
 
FIWARE Wednesday Webinars - Strategies for Context Data Persistence
FIWARE Wednesday Webinars - Strategies for Context Data PersistenceFIWARE Wednesday Webinars - Strategies for Context Data Persistence
FIWARE Wednesday Webinars - Strategies for Context Data Persistence
 
Introduction to Cloud computing and Big Data-Hadoop
Introduction to Cloud computing and  Big Data-HadoopIntroduction to Cloud computing and  Big Data-Hadoop
Introduction to Cloud computing and Big Data-Hadoop
 
Simplifying Big Data Integration with Syncsort DMX and DMX-h
Simplifying Big Data Integration with Syncsort DMX and DMX-hSimplifying Big Data Integration with Syncsort DMX and DMX-h
Simplifying Big Data Integration with Syncsort DMX and DMX-h
 
Big data and hadoop
Big data and hadoopBig data and hadoop
Big data and hadoop
 
Fog Cloud Caching at Network Edge via Local Hardware Awareness Spaces
Fog Cloud Caching at Network Edge via Local Hardware Awareness SpacesFog Cloud Caching at Network Edge via Local Hardware Awareness Spaces
Fog Cloud Caching at Network Edge via Local Hardware Awareness Spaces
 
Tachyon-2014-11-21-amp-camp5
Tachyon-2014-11-21-amp-camp5Tachyon-2014-11-21-amp-camp5
Tachyon-2014-11-21-amp-camp5
 
JahiaOne - Semantic Web with Jahia
JahiaOne - Semantic Web with JahiaJahiaOne - Semantic Web with Jahia
JahiaOne - Semantic Web with Jahia
 
facebook architecture for 600M users
facebook architecture for 600M usersfacebook architecture for 600M users
facebook architecture for 600M users
 
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
 
Data stream with cruise control
Data stream with cruise controlData stream with cruise control
Data stream with cruise control
 
From Single Purpose to Multi Purpose Data Lakes - Broadening End Users
From Single Purpose to Multi Purpose Data Lakes - Broadening End UsersFrom Single Purpose to Multi Purpose Data Lakes - Broadening End Users
From Single Purpose to Multi Purpose Data Lakes - Broadening End Users
 
Interactive Hadoop via Flash and Memory
Interactive Hadoop via Flash and MemoryInteractive Hadoop via Flash and Memory
Interactive Hadoop via Flash and Memory
 

More from Tokyo University of Science

A Method for Cloud-Assisted Secure Wireless Grouping of Client Devices at Net...
A Method for Cloud-Assisted Secure Wireless Grouping of Client Devices at Net...A Method for Cloud-Assisted Secure Wireless Grouping of Client Devices at Net...
A Method for Cloud-Assisted Secure Wireless Grouping of Client Devices at Net...Tokyo University of Science
 
Ultrasound Relative Positioning for IoT Devices in Dense Wireless Spaces
Ultrasound Relative Positioning for IoT Devices in Dense Wireless SpacesUltrasound Relative Positioning for IoT Devices in Dense Wireless Spaces
Ultrasound Relative Positioning for IoT Devices in Dense Wireless SpacesTokyo University of Science
 
Towards a Packet Traffic Genome Project as a Method for Realtime Sub-Flow Tra...
Towards a Packet Traffic Genome Project as a Method for Realtime Sub-Flow Tra...Towards a Packet Traffic Genome Project as a Method for Realtime Sub-Flow Tra...
Towards a Packet Traffic Genome Project as a Method for Realtime Sub-Flow Tra...Tokyo University of Science
 
What if We Atomize Student Data and Apps and Put Them on Docker Containers?
What if We Atomize Student Data and Apps and Put Them on Docker Containers?What if We Atomize Student Data and Apps and Put Them on Docker Containers?
What if We Atomize Student Data and Apps and Put Them on Docker Containers?Tokyo University of Science
 
Large-Scale Crowdsourcing by Vehicular Data Packets in a Sparse Roadside Infr...
Large-Scale Crowdsourcing by Vehicular Data Packets in a Sparse Roadside Infr...Large-Scale Crowdsourcing by Vehicular Data Packets in a Sparse Roadside Infr...
Large-Scale Crowdsourcing by Vehicular Data Packets in a Sparse Roadside Infr...Tokyo University of Science
 
On Performance Under Hotspots in Hadoop versus Bigdata Replay Platforms
On Performance Under Hotspots in Hadoop versus Bigdata Replay PlatformsOn Performance Under Hotspots in Hadoop versus Bigdata Replay Platforms
On Performance Under Hotspots in Hadoop versus Bigdata Replay PlatformsTokyo University of Science
 
Taking the Step from Software to Product Development \\ when teaching PBL at ...
Taking the Step from Software to Product Development \\ when teaching PBL at ...Taking the Step from Software to Product Development \\ when teaching PBL at ...
Taking the Step from Software to Product Development \\ when teaching PBL at ...Tokyo University of Science
 
Design and Implementation of a 3-Party Cloud-Backed Handshake for Secure Grou...
Design and Implementation of a 3-Party Cloud-Backed Handshake for Secure Grou...Design and Implementation of a 3-Party Cloud-Backed Handshake for Secure Grou...
Design and Implementation of a 3-Party Cloud-Backed Handshake for Secure Grou...Tokyo University of Science
 
The Switchboard Optimization Problem and Heuristics for Cut-Through Networking
The Switchboard Optimization Problem and Heuristics for Cut-Through NetworkingThe Switchboard Optimization Problem and Heuristics for Cut-Through Networking
The Switchboard Optimization Problem and Heuristics for Cut-Through NetworkingTokyo University of Science
 
The Switchboard Traffic Engineering Problem for Mixed Contention/Cut-Through ...
The Switchboard Traffic Engineering Problem for Mixed Contention/Cut-Through ...The Switchboard Traffic Engineering Problem for Mixed Contention/Cut-Through ...
The Switchboard Traffic Engineering Problem for Mixed Contention/Cut-Through ...Tokyo University of Science
 
Bulk-n-Pick Method for One-to-Many Data Transfer in Dense Wireless Spaces
Bulk-n-Pick Method for One-to-Many Data Transfer in Dense Wireless SpacesBulk-n-Pick Method for One-to-Many Data Transfer in Dense Wireless Spaces
Bulk-n-Pick Method for One-to-Many Data Transfer in Dense Wireless SpacesTokyo University of Science
 
On a Hybrid Packets-and-Circuits Switching Logic
On a Hybrid Packets-and-Circuits Switching LogicOn a Hybrid Packets-and-Circuits Switching Logic
On a Hybrid Packets-and-Circuits Switching LogicTokyo University of Science
 
Image-Related Uses for Roadside Infrastructure \\ based on Wireless Beacons
Image-Related Uses for Roadside Infrastructure \\ based on Wireless BeaconsImage-Related Uses for Roadside Infrastructure \\ based on Wireless Beacons
Image-Related Uses for Roadside Infrastructure \\ based on Wireless BeaconsTokyo University of Science
 
Complexity Resolution Control for Context Based on Metromaps
Complexity Resolution Control for Context Based on MetromapsComplexity Resolution Control for Context Based on Metromaps
Complexity Resolution Control for Context Based on MetromapsTokyo University of Science
 
The Declarative-Coordinated Model for Self-Optimization of Service Networks
The Declarative-Coordinated Model for Self-Optimization of Service NetworksThe Declarative-Coordinated Model for Self-Optimization of Service Networks
The Declarative-Coordinated Model for Self-Optimization of Service NetworksTokyo University of Science
 
3-Way Scripts as a Practical Platform for Secure Distributed Code in Clouds
3-Way Scripts as a Practical Platform for Secure Distributed Code in Clouds3-Way Scripts as a Practical Platform for Secure Distributed Code in Clouds
3-Way Scripts as a Practical Platform for Secure Distributed Code in CloudsTokyo University of Science
 
3-Way Scripts as a Base Unit for Flexible Scale-Out Code
3-Way Scripts as a Base Unit for Flexible Scale-Out Code3-Way Scripts as a Base Unit for Flexible Scale-Out Code
3-Way Scripts as a Base Unit for Flexible Scale-Out CodeTokyo University of Science
 
Towards Social Robotics on Smartphones with Simple XYZV Sensor Feedback
Towards Social Robotics on Smartphones with Simple XYZV Sensor FeedbackTowards Social Robotics on Smartphones with Simple XYZV Sensor Feedback
Towards Social Robotics on Smartphones with Simple XYZV Sensor FeedbackTokyo University of Science
 
Back to Rings but not Tokens: Physical and Logical Designs for Distributed Fi...
Back to Rings but not Tokens: Physical and Logical Designs for Distributed Fi...Back to Rings but not Tokens: Physical and Logical Designs for Distributed Fi...
Back to Rings but not Tokens: Physical and Logical Designs for Distributed Fi...Tokyo University of Science
 
Browser Visualization using PNGs Generated by HTML5 Workers on Multicore
Browser Visualization using PNGs Generated by HTML5 Workers on MulticoreBrowser Visualization using PNGs Generated by HTML5 Workers on Multicore
Browser Visualization using PNGs Generated by HTML5 Workers on MulticoreTokyo University of Science
 

More from Tokyo University of Science (20)

A Method for Cloud-Assisted Secure Wireless Grouping of Client Devices at Net...
A Method for Cloud-Assisted Secure Wireless Grouping of Client Devices at Net...A Method for Cloud-Assisted Secure Wireless Grouping of Client Devices at Net...
A Method for Cloud-Assisted Secure Wireless Grouping of Client Devices at Net...
 
Ultrasound Relative Positioning for IoT Devices in Dense Wireless Spaces
Ultrasound Relative Positioning for IoT Devices in Dense Wireless SpacesUltrasound Relative Positioning for IoT Devices in Dense Wireless Spaces
Ultrasound Relative Positioning for IoT Devices in Dense Wireless Spaces
 
Towards a Packet Traffic Genome Project as a Method for Realtime Sub-Flow Tra...
Towards a Packet Traffic Genome Project as a Method for Realtime Sub-Flow Tra...Towards a Packet Traffic Genome Project as a Method for Realtime Sub-Flow Tra...
Towards a Packet Traffic Genome Project as a Method for Realtime Sub-Flow Tra...
 
What if We Atomize Student Data and Apps and Put Them on Docker Containers?
What if We Atomize Student Data and Apps and Put Them on Docker Containers?What if We Atomize Student Data and Apps and Put Them on Docker Containers?
What if We Atomize Student Data and Apps and Put Them on Docker Containers?
 
Large-Scale Crowdsourcing by Vehicular Data Packets in a Sparse Roadside Infr...
Large-Scale Crowdsourcing by Vehicular Data Packets in a Sparse Roadside Infr...Large-Scale Crowdsourcing by Vehicular Data Packets in a Sparse Roadside Infr...
Large-Scale Crowdsourcing by Vehicular Data Packets in a Sparse Roadside Infr...
 
On Performance Under Hotspots in Hadoop versus Bigdata Replay Platforms
On Performance Under Hotspots in Hadoop versus Bigdata Replay PlatformsOn Performance Under Hotspots in Hadoop versus Bigdata Replay Platforms
On Performance Under Hotspots in Hadoop versus Bigdata Replay Platforms
 
Taking the Step from Software to Product Development \\ when teaching PBL at ...
Taking the Step from Software to Product Development \\ when teaching PBL at ...Taking the Step from Software to Product Development \\ when teaching PBL at ...
Taking the Step from Software to Product Development \\ when teaching PBL at ...
 
Design and Implementation of a 3-Party Cloud-Backed Handshake for Secure Grou...
Design and Implementation of a 3-Party Cloud-Backed Handshake for Secure Grou...Design and Implementation of a 3-Party Cloud-Backed Handshake for Secure Grou...
Design and Implementation of a 3-Party Cloud-Backed Handshake for Secure Grou...
 
The Switchboard Optimization Problem and Heuristics for Cut-Through Networking
The Switchboard Optimization Problem and Heuristics for Cut-Through NetworkingThe Switchboard Optimization Problem and Heuristics for Cut-Through Networking
The Switchboard Optimization Problem and Heuristics for Cut-Through Networking
 
The Switchboard Traffic Engineering Problem for Mixed Contention/Cut-Through ...
The Switchboard Traffic Engineering Problem for Mixed Contention/Cut-Through ...The Switchboard Traffic Engineering Problem for Mixed Contention/Cut-Through ...
The Switchboard Traffic Engineering Problem for Mixed Contention/Cut-Through ...
 
Bulk-n-Pick Method for One-to-Many Data Transfer in Dense Wireless Spaces
Bulk-n-Pick Method for One-to-Many Data Transfer in Dense Wireless SpacesBulk-n-Pick Method for One-to-Many Data Transfer in Dense Wireless Spaces
Bulk-n-Pick Method for One-to-Many Data Transfer in Dense Wireless Spaces
 
On a Hybrid Packets-and-Circuits Switching Logic
On a Hybrid Packets-and-Circuits Switching LogicOn a Hybrid Packets-and-Circuits Switching Logic
On a Hybrid Packets-and-Circuits Switching Logic
 
Image-Related Uses for Roadside Infrastructure \\ based on Wireless Beacons
Image-Related Uses for Roadside Infrastructure \\ based on Wireless BeaconsImage-Related Uses for Roadside Infrastructure \\ based on Wireless Beacons
Image-Related Uses for Roadside Infrastructure \\ based on Wireless Beacons
 
Complexity Resolution Control for Context Based on Metromaps
Complexity Resolution Control for Context Based on MetromapsComplexity Resolution Control for Context Based on Metromaps
Complexity Resolution Control for Context Based on Metromaps
 
The Declarative-Coordinated Model for Self-Optimization of Service Networks
The Declarative-Coordinated Model for Self-Optimization of Service NetworksThe Declarative-Coordinated Model for Self-Optimization of Service Networks
The Declarative-Coordinated Model for Self-Optimization of Service Networks
 
3-Way Scripts as a Practical Platform for Secure Distributed Code in Clouds
3-Way Scripts as a Practical Platform for Secure Distributed Code in Clouds3-Way Scripts as a Practical Platform for Secure Distributed Code in Clouds
3-Way Scripts as a Practical Platform for Secure Distributed Code in Clouds
 
3-Way Scripts as a Base Unit for Flexible Scale-Out Code
3-Way Scripts as a Base Unit for Flexible Scale-Out Code3-Way Scripts as a Base Unit for Flexible Scale-Out Code
3-Way Scripts as a Base Unit for Flexible Scale-Out Code
 
Towards Social Robotics on Smartphones with Simple XYZV Sensor Feedback
Towards Social Robotics on Smartphones with Simple XYZV Sensor FeedbackTowards Social Robotics on Smartphones with Simple XYZV Sensor Feedback
Towards Social Robotics on Smartphones with Simple XYZV Sensor Feedback
 
Back to Rings but not Tokens: Physical and Logical Designs for Distributed Fi...
Back to Rings but not Tokens: Physical and Logical Designs for Distributed Fi...Back to Rings but not Tokens: Physical and Logical Designs for Distributed Fi...
Back to Rings but not Tokens: Physical and Logical Designs for Distributed Fi...
 
Browser Visualization using PNGs Generated by HTML5 Workers on Multicore
Browser Visualization using PNGs Generated by HTML5 Workers on MulticoreBrowser Visualization using PNGs Generated by HTML5 Workers on Multicore
Browser Visualization using PNGs Generated by HTML5 Workers on Multicore
 

Recently uploaded

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????blackmambaettijean
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 

Recently uploaded (20)

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 

High Availability Cloud Storage as a Software Package with Social Graph, Throughput Awareness, and Smart Distribution Features

  • 2. . Mission Statement 1. high availability business-level cloud data store 2. federated clouds = diversification 3. many DCs and/or cloud providers 4. we care mostly about performance = high availability 5. practical solutions are needed Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 2/21 ... 2/21
  • 3. . haStore : The Short Story Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 3/21 ... 3/21
  • 4. . haStore: One DC is Not Enough • remember June 2013? • most services today use vertical intergration -- no diversity • Hitachi does not share DCs with NEC • regional diversity of one provider is bad ◦ how many Amazon DCs in Japan? . (the only possible) Solution .. . ... is to sign contracts with multiple DCs and manage on client side Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 4/21 ... 4/21
  • 5. . haStore: One DC is Not Enough Kansai DC1 OkinawaLocations Data Centers DC2 Kyushu Osaka Office DC1 DC1 DC2 Naha Office Network distance Network distance storage network Employee A …. Content / Social Metadata High Availability Data Store DC1 DC2 …. DC1 DC2 Business trip Store APIs Proposed Software Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 5/21 ... 5/21
  • 6. . haStore: Store Diversification • store = sum of multiple substores • in software: not a priority list -- optimization engine! • realtime performance monitoring, read/write optimization, etc. • sub-file data unit -- chunks SSD Growing network distance User HDD DC1 DC2 … Network Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 6/21 ... 6/21
  • 7. . haStore: Socially Aware Store • content relevance based on social graph • relevance is a distribution • individual redundancy based on distribution • other link types: same time, location, filetype, ... • link strength != 1 Descending order Relevance Distribution Redundancy (user setting) Physical limit of redundancy End of content There is a link When a file is … Between Created Viewed Edited Deleted Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 7/21 ... 7/21
  • 8. . hsStore: Software Design Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 8/21 ... 8/21
  • 9. . Design: Specs • many substores, heterogeneous e2e performance and capacity • each substore has its own API (Dropbox, GDrive, SSD, etc.), but haStore exports a generic API • data unit: sub-file blobs, for now fixed 100kb size • social graph is used to define priority lists of files ◦ different for each user • optimization is key element of software engines 1. sync logic 2. redundancy logic Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 9/21 ... 9/21
  • 10. . Design: API Stack • Generic API starts from Level 2, similar to drivers • the stack is implemented by each client = each user Employee A …. Content / Social Metadata High Availability Data Store DC1 DC2 …. Store Proposed Software Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 10/21 ... 10/21
  • 11. . Design: Sync Engine • optimization for throughput minimization • same logic for SSD, HDD and over-the-network haStore Storage Sync Engine Optimization Local Cache Check 1 2 Use GUI, Clients Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 11/21 ... 11/21
  • 12. . Design: Sync Engine Logic Bulk Throughput History Data Increase timeout Performance Tradeoff Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 12/21 ... 12/21
  • 13. . Design: Redundancy Logic (1) Descending order Relevance Distribution Redundancy (user setting) Physical limit of redundancy End of content Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 13/21 ... 13/21
  • 14. . Design: Redundancy Logic (2) Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 14/21 ... 14/21
  • 15. . haStore: Social Graph Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 15/21 ... 15/21
  • 16. . Social Graph : Basics • current version: only simple types of links • no link strength There is a link When a file is … Between Created Viewed Edited Deleted Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 16/21 ... 16/21
  • 17. . Social Graph : Advanced • community detection • files that could be linked: 1. touched at roughly the same time 2. touched by the same user 3. same location, filetype, size, etc. • link strength, different for each kind of relation, variable e2e cost on paths • discovery based on e2e cost, not hop count Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 17/21 ... 17/21
  • 18. . Implementation, Tests Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 18/21 ... 18/21
  • 19. . Performance : Demo A-san B-san DBX GDR 2014-01-22 12:13:30 Block DONE Block UPLOAD Block DOWNLOAD • also demo Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 19/21 ... 19/21
  • 20. . Wrapup • haStore: high availability cloud store • main features ◦ throughput-awaresync/redundancy optimization ◦ sub-file blocks, smart distribution ◦ social graph • current status: v1.0 in operation, v2.0 on the way Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 20/21 ... 20/21
  • 21. . That’s all, thank you ... Marat Zhanikeev -- maratishe@gmail.com High Availability Cloud Storage: Social, Throughput, Smart -- http://tinyurl.com/marat140417 21/21 ... 21/21