SlideShare a Scribd company logo
1 of 25
Download to read offline
ZCloud
Consensus on Hardware for Distributed Systems
3rd Workshop on Dependability,
May 8, Monday 2017, İYTE, https://goo.gl/fSVnZy
Gökhan Boranalp, gokhan@zetaops.io
“Simplicity is the ultimate sophistication.”
Leonardo Da Vinci
Road Map
➔ Problem Definition
➔ Our Solution
➔ ZCloud Components
◆ ZCloud Hardware
◆ ZCloud Cluster Management Tools
◆ ZCloud Protocol
➔ Benefits
➔ Similar Work
➔ Discussion
Problem Definition
Increasing,
➢ computing power,
➢ data storage, analysis and
➢ sophisticated network communication requirements
in modern “data centers”, reveals the strong need for
“distributed” operation for both networking devices and
applications.
Apache Mesos and Google Kubernetes, which are using
the container virtualization technique to develop, scale and
manage applications in distributed systems, have emerged.
These applications uses different "consensus" approaches
in their internal processes.
Problem Definition
Problem Definition
These approaches for distributed application development
seem to be efficient enough for the time being, yet new
approaches are needed in terms of latency, number of
transactions and throughput in distributed systems,
taking into account of physical boundaries and the increase
in the size of future applications and the number of cluster
members.
● With Mesos, practically 50,000 instance tests were
performed on 24,000 core 500 physical servers while
Kubernetes was tested on 500 physical servers.
Kubernetes Architecture
- Raft protocol,
- Based on etcd
Mesos Architecture
In distributed applications where the number of members in
the cluster increases, the separation of the consensus
related operations at the hardware level is essential for the
following reasons:
1. At the operating system level, messages broadcast on
the protocol stack cause latency.
2. It is necessary to increase the number of completed
transactions in the communication of distributed system
components and on the network unit (throughput).
Problem Definition
Problem Definition
3. For devices with limited storage and CPU computing
facilities that use embedded operating systems such as
IOT devices, it is also necessary to reduce the processing
burden due to "consensus" operations.
4. A better common consensus communication model is
needed for different applications that need to work together
in (BFT) environment.
Network Hardware-Accelerated Consensus, page 4, USI-INF-TR-2016-03
The Performance of Paxos in the Cloud, p. 46, DOI: 10.1109/SRDS.2014.15
ZooKeeper Throughput as the Read-Write Ratio Varies, https://goo.gl/8aR35E
ZCloud is an hardware and software solution for
distributed systems and conceived to provide
Byzantine fault tolerance (BFT) consensus primitives.
Our Solution
ZCloud came out as a solution for today's demands
such as distributed hardware and software solutions
and network components in data centers, distributed
software architectures with very high number of
components, IOT components for distributed
architecture, distributed data processing and
distributed data storage.
● ZCloud is ongoing research effort and is in early stage.
Our Solution
ZCloud Components - Hardware
* CPU, changeable, low energy consumption, high processor power CPU
* SoC and related hardware to manage distributed operations
* RAM, variable 4, 8, 16, 32, 64 GB, type will be determined
* 2 or 4 x Ethernet connections, changeable 40 GbE, 100 GbE ethernet
* SD card slot, Mini USB input, USB-C, EEPROM, Flash memory
* Wireless 802.11b, g, n, a Ethernet
Notes:
- The hardware components that can be used for IOT devices will be diversified
during the research.
- Storage is separate and distributed in cloud, see CEPH.
ZCloud Cluster Management Tool
ZCloud Cluster Manager enables ZCloud hardware to
be managed under Openstack for cloud adaptation
and use with existing systems.
ZCloud SDK will be developed for use with the Nova,
Neutron, Heath, Telemetry, Ironic, Manila, Magnum
components found in the Openstack system.
ZCloud Cluster Management Tool
Related information and operations in Openstack Horizon
panel will be available below;
● Add new hardware to the cluster,
● Remove hardware manually from cluster,
● Automatically mark and remove faulty hardware,
● Ensemble requested topologies,
● Collect logs (CPU, RAM, network load, parametric
values)
ZCloud Communication Protocol
● Newly designed Paxos based protocol.
○ Masquerade agents
○ Separation of request types
○ More to come, still in development.
Some Benefits
● Drop in usage with an existing cloud operated apps.
● Ability to develop fast applications for network communicating
devices.
● Ability to develop very *very* large applications.
● Reduction of electricity consumption (power consumption, cooling,
etc.) in data centers.
● The possibility of accelerating the analysis results by efficiently
distributing the analysis of the data derived from web applications
in a large cluster.
● Prevention of energy and resource losses due to inefficient
management of resource usage on the cloud.
Similar Work
● Network Hardware-Accelerated Consensus
○ CAANS provides a complete Paxos protocol, is a dropin
replacement for software-based implementations of Paxos,
makes no restrictions on network topologies, and is
implemented in a higher-level, data-plane programming
language, allowing for portability across a range of target
devices.
● Consensus in a Box: Inexpensive Coordination in
Hardware
○ Zookeeper’s atomic broadcast at the network level using an
FPGA.
Discussion
Dual-leader Master Election for Distributed Systems (Obiden), http://www.cse.scu.edu/~mwang2/projects/Distributed_dualLeaders_15s.pdf
The Performance of Paxos in the Cloud, http://sci-hub.cc/10.1109/SRDS.2014.15
The Performance of Paxos and Fast Paxos, http://www.ic.unicamp.br/~reltech/2008/08-35.pdf
Consensus in the Cloud: Paxos Systems Demystified, https://www.cse.buffalo.edu/tech-reports/2016-02.pdf
Seamless Paxos coordinators, http://sci-hub.cc/10.1007/s10586-013-0264-9
Implementing Fault-Tolerant Services Using the State Machine Approach: A Tutorial
http://www-users.cselabs.umn.edu/classes/Spring-2014/csci8980-sds/Papers/ProcessReplication/p299-schneider.pdf
Holistic Configuration Management at Facebook, http://sigops.org/sosp/sosp15/current/2015-Monterey/printable/008-tang.pdf
Optimistic Replication, http://sci-hub.cc/10.1145/1057977.1057980
Dotted Version Vectors: Logical Clocks for Optimistic Replication, https://arxiv.org/pdf/1011.5808.pdf
In Search of an Understandable Consensus Algorithm, https://web.stanford.edu/~ouster/cgi-bin/papers/raft-atc14
Fast Quantum Byzantine Agreement, https://pdfs.semanticscholar.org/73ab/ef762dd61fdd388173f24f811e8693a79d7c.pdf
Asynchronous Consensus and Broadcast Protocols, http://zoo.cs.yale.edu/classes/cs426/2013/bib/bracha85asynchronous.pdf
Customizable and Extensible Deployment for Mobile/Cloud Applications, https://sapphire.cs.washington.edu/papers/sapphire-osdi14.pdf
Ovid: A Software-Defined Distributed Systems Framework,
https://www.usenix.org/system/files/conference/hotcloud16/hotcloud16_altinbuken.pdf
Resources
Resources
Network Hardware-Accelerated Consensus, https://arxiv.org/pdf/1605.05619.pdf
Consensus in a Box: Inexpensive Coordination in Hardware, https://www.usenix.org/system/files/conference/nsdi16/nsdi16-paper-istvan.pdf
HT-Paxos- High Throughput State-Machine Replication Protocol for Large Clustered Data Centers, https://arxiv.org/abs/1407.1237
Ring Paxos: A High-Throughput Atomic Broadcast Protocol, http://www.inf.usi.ch/phd/jalili/RingPaxos-DSN2010.pdf
http://sci-hub.cc/10.1109/SRDS.2014.15
https://infoscience.epfl.ch/record/49946/files/HUS+02b.pdf
http://libpaxos.sourceforge.net/files/Primim-SPLab08.pdf
http://www.ic.unicamp.br/~reltech/2008/08-35.pdf
http://www.inf.usi.ch/faculty/soule/2015-06-22-disn.pdf
Megastore: Providing Scalable, Highly Available Storage for Interactive Servicesx
https://static.googleusercontent.com/media/research.google.com/en//pubs/archive/36971.pdf
A Beginner’s Guide to Understanding the Leaf-Spine Network Topology
http://blog.westmonroepartners.com/a-beginners-guide-to-understanding-the-leaf-spine-network-topology/
http://pbs.cs.berkeley.edu/
There Is More Consensus in Egalitarian Parliaments, https://www.cs.cmu.edu/~dga/papers/epaxos-sosp2013.pdf
Sinfonia: a new paradigm for building scalable distributed systems, http://www.sosp2007.org/papers/sosp064-aguilera.pdf
https://web.archive.org/web/20131017235612/http://www.temple.edu/cis/icdcs2013/data/5000a011.pdf
A Scalable Conflict-free Replicated Set Data Type,
https://web.archive.org/web/20131017233249/http://www.temple.edu/cis/icdcs2013/data/5000a186.pdf
FChain: Toward Black-box Online Fault Localization for Cloud Systems,
https://web.archive.org/web/20131017235525/http://www.temple.edu/cis/icdcs2013/data/5000a021.pdf
Diagnosing Data Center Behavior Flow by Flow,
https://web.archive.org/web/20131017235612/http://www.temple.edu/cis/icdcs2013/data/5000a011.pdf
Experimental Demonstration of a Quantum Protocol for Byzantine Agreement and Liar Detection, https://arxiv.org/pdf/0710.0290v2.pdf
https://tendermint.com/intro
Performance Comparison Between the Paxos and Chandra-Toueg Consensus Algorithms,
https://infoscience.epfl.ch/record/49946/files/HUS+02b.pdf
A Distributed Lock Manager Using Paxos Design and Implementation of Warlock, a Consensus Based Lock Manager,
http://uu.diva-portal.org/smash/get/diva2:615805/FULLTEXT01.pdf
Resources

More Related Content

What's hot

Docker 101 - all about Docker containers
Docker 101 - all about Docker containers Docker 101 - all about Docker containers
Docker 101 - all about Docker containers
Ian Lumb
 

What's hot (20)

Docker 101 - all about Docker containers
Docker 101 - all about Docker containers Docker 101 - all about Docker containers
Docker 101 - all about Docker containers
 
Pachyderm: Building a Big Data Beast On Kubernetes
Pachyderm: Building a Big Data Beast On KubernetesPachyderm: Building a Big Data Beast On Kubernetes
Pachyderm: Building a Big Data Beast On Kubernetes
 
KubeCon USA 2017 brief Overview - from Kubernetes meetup Bangalore
KubeCon USA 2017 brief Overview - from Kubernetes meetup BangaloreKubeCon USA 2017 brief Overview - from Kubernetes meetup Bangalore
KubeCon USA 2017 brief Overview - from Kubernetes meetup Bangalore
 
Bitfusion Nimbix Dev Summit Heterogeneous Architectures
Bitfusion Nimbix Dev Summit Heterogeneous Architectures Bitfusion Nimbix Dev Summit Heterogeneous Architectures
Bitfusion Nimbix Dev Summit Heterogeneous Architectures
 
Beyond Ingresses - Better Traffic Management in Kubernetes
Beyond Ingresses - Better Traffic Management in KubernetesBeyond Ingresses - Better Traffic Management in Kubernetes
Beyond Ingresses - Better Traffic Management in Kubernetes
 
Google Anthos - Azure Stack - AWS Outposts :Comparison
Google Anthos - Azure Stack - AWS Outposts :ComparisonGoogle Anthos - Azure Stack - AWS Outposts :Comparison
Google Anthos - Azure Stack - AWS Outposts :Comparison
 
Containerizing GPU Applications with Docker for Scaling to the Cloud
Containerizing GPU Applications with Docker for Scaling to the CloudContainerizing GPU Applications with Docker for Scaling to the Cloud
Containerizing GPU Applications with Docker for Scaling to the Cloud
 
DevNexus 2015: Kubernetes & Container Engine
DevNexus 2015: Kubernetes & Container EngineDevNexus 2015: Kubernetes & Container Engine
DevNexus 2015: Kubernetes & Container Engine
 
Introduction to kubernetes
Introduction to kubernetesIntroduction to kubernetes
Introduction to kubernetes
 
Harnessing the virtual realm for successful real world artificial intelligence
Harnessing the virtual realm for successful real world artificial intelligenceHarnessing the virtual realm for successful real world artificial intelligence
Harnessing the virtual realm for successful real world artificial intelligence
 
Kubernetes: https://youtu.be/KnjnQj-FvfQ
Kubernetes: https://youtu.be/KnjnQj-FvfQKubernetes: https://youtu.be/KnjnQj-FvfQ
Kubernetes: https://youtu.be/KnjnQj-FvfQ
 
How Kubernetes make OpenStack & Ceph better
How Kubernetes make OpenStack & Ceph betterHow Kubernetes make OpenStack & Ceph better
How Kubernetes make OpenStack & Ceph better
 
Using Deep Learning Toolkits with Kubernetes clusters
Using Deep Learning Toolkits with Kubernetes clustersUsing Deep Learning Toolkits with Kubernetes clusters
Using Deep Learning Toolkits with Kubernetes clusters
 
How to Integrate Kubernetes in OpenStack
 How to Integrate Kubernetes in OpenStack  How to Integrate Kubernetes in OpenStack
How to Integrate Kubernetes in OpenStack
 
Cloud Native Applications on Kubernetes: a DevOps Approach
Cloud Native Applications on Kubernetes: a DevOps ApproachCloud Native Applications on Kubernetes: a DevOps Approach
Cloud Native Applications on Kubernetes: a DevOps Approach
 
What we Learned About Application Resiliency When the Data Center Burned Down
What we Learned About Application Resiliency When the Data Center Burned DownWhat we Learned About Application Resiliency When the Data Center Burned Down
What we Learned About Application Resiliency When the Data Center Burned Down
 
DDDP 2019 - Brown to Green
DDDP 2019  - Brown to GreenDDDP 2019  - Brown to Green
DDDP 2019 - Brown to Green
 
prodops.io k8s presentation
prodops.io k8s presentationprodops.io k8s presentation
prodops.io k8s presentation
 
Google Cloud Container Security Quick Overview
Google Cloud Container Security Quick OverviewGoogle Cloud Container Security Quick Overview
Google Cloud Container Security Quick Overview
 
Azure ai on premises with docker
Azure ai on premises with  dockerAzure ai on premises with  docker
Azure ai on premises with docker
 

Similar to ZCloud Consensus on Hardware for Distributed Systems

OpenPackProcessingAccelearation
OpenPackProcessingAccelearationOpenPackProcessingAccelearation
OpenPackProcessingAccelearation
Craig Nuzzo
 
CloudCamp Milan 2009: Telecom Italia
CloudCamp Milan 2009: Telecom ItaliaCloudCamp Milan 2009: Telecom Italia
CloudCamp Milan 2009: Telecom Italia
Gabriele Bozzi
 
Cluster Computing
Cluster ComputingCluster Computing
Cluster Computing
NIKHIL NAIR
 
Seed block algorithm
Seed block algorithmSeed block algorithm
Seed block algorithm
Dipak Badhe
 

Similar to ZCloud Consensus on Hardware for Distributed Systems (20)

OpenPackProcessingAccelearation
OpenPackProcessingAccelearationOpenPackProcessingAccelearation
OpenPackProcessingAccelearation
 
Cluster computing
Cluster computingCluster computing
Cluster computing
 
ICCT2017: A user mode implementation of filtering rule management plane using...
ICCT2017: A user mode implementation of filtering rule management plane using...ICCT2017: A user mode implementation of filtering rule management plane using...
ICCT2017: A user mode implementation of filtering rule management plane using...
 
Cloud Camp Milan 2K9 Telecom Italia: Where P2P?
Cloud Camp Milan 2K9 Telecom Italia: Where P2P?Cloud Camp Milan 2K9 Telecom Italia: Where P2P?
Cloud Camp Milan 2K9 Telecom Italia: Where P2P?
 
CloudCamp Milan 2009: Telecom Italia
CloudCamp Milan 2009: Telecom ItaliaCloudCamp Milan 2009: Telecom Italia
CloudCamp Milan 2009: Telecom Italia
 
Grid Computing
Grid ComputingGrid Computing
Grid Computing
 
Privacy preserving public auditing for secured cloud storage
Privacy preserving public auditing for secured cloud storagePrivacy preserving public auditing for secured cloud storage
Privacy preserving public auditing for secured cloud storage
 
Device Data Directory and Asynchronous execution: A path to heterogeneous com...
Device Data Directory and Asynchronous execution: A path to heterogeneous com...Device Data Directory and Asynchronous execution: A path to heterogeneous com...
Device Data Directory and Asynchronous execution: A path to heterogeneous com...
 
H017144148
H017144148H017144148
H017144148
 
Comparative Analysis, Security Aspects & Optimization of Workload in Gfs Base...
Comparative Analysis, Security Aspects & Optimization of Workload in Gfs Base...Comparative Analysis, Security Aspects & Optimization of Workload in Gfs Base...
Comparative Analysis, Security Aspects & Optimization of Workload in Gfs Base...
 
云计算及其应用
云计算及其应用云计算及其应用
云计算及其应用
 
Introduction to Cloud Computing
Introduction to Cloud ComputingIntroduction to Cloud Computing
Introduction to Cloud Computing
 
Lecture_IIITD.pptx
Lecture_IIITD.pptxLecture_IIITD.pptx
Lecture_IIITD.pptx
 
Cluster cmputing
Cluster cmputingCluster cmputing
Cluster cmputing
 
Cluster Computers
Cluster ComputersCluster Computers
Cluster Computers
 
8. 9590 1-pb
8. 9590 1-pb8. 9590 1-pb
8. 9590 1-pb
 
CPaaS.io Y1 Review Meeting - Cloud & Edge Programming
CPaaS.io Y1 Review Meeting - Cloud & Edge ProgrammingCPaaS.io Y1 Review Meeting - Cloud & Edge Programming
CPaaS.io Y1 Review Meeting - Cloud & Edge Programming
 
Cluster Computing
Cluster ComputingCluster Computing
Cluster Computing
 
CC LECTURE NOTES (1).pdf
CC LECTURE NOTES (1).pdfCC LECTURE NOTES (1).pdf
CC LECTURE NOTES (1).pdf
 
Seed block algorithm
Seed block algorithmSeed block algorithm
Seed block algorithm
 

More from Gokhan Boranalp

More from Gokhan Boranalp (14)

Zkernel
ZkernelZkernel
Zkernel
 
Zetaops ZOPS presentation
Zetaops ZOPS presentationZetaops ZOPS presentation
Zetaops ZOPS presentation
 
Siz değil iş sizi nasıl bulur? GDG İzmir
Siz değil iş sizi nasıl bulur? GDG İzmir Siz değil iş sizi nasıl bulur? GDG İzmir
Siz değil iş sizi nasıl bulur? GDG İzmir
 
Kak depolama stratejiniz nasıl olmalı?
Kak depolama stratejiniz nasıl olmalı?Kak depolama stratejiniz nasıl olmalı?
Kak depolama stratejiniz nasıl olmalı?
 
Açık kaynak iş modeli nedir?
Açık kaynak iş modeli nedir?Açık kaynak iş modeli nedir?
Açık kaynak iş modeli nedir?
 
Buluta Ilk Adım Analizi
Buluta Ilk Adım AnaliziBuluta Ilk Adım Analizi
Buluta Ilk Adım Analizi
 
Acik Kaynak Kodlu Kurumsal Eposta Sistemleri
Acik Kaynak Kodlu Kurumsal Eposta SistemleriAcik Kaynak Kodlu Kurumsal Eposta Sistemleri
Acik Kaynak Kodlu Kurumsal Eposta Sistemleri
 
Kurumlar icin openstack rehberi
Kurumlar icin openstack rehberi Kurumlar icin openstack rehberi
Kurumlar icin openstack rehberi
 
Riak ve RiakCS
Riak ve RiakCSRiak ve RiakCS
Riak ve RiakCS
 
erlang 101
erlang 101erlang 101
erlang 101
 
Git 101
Git 101Git 101
Git 101
 
Scrum ve Redmine ile yazılım projesi yönetimi
Scrum ve Redmine ile yazılım projesi yönetimiScrum ve Redmine ile yazılım projesi yönetimi
Scrum ve Redmine ile yazılım projesi yönetimi
 
Nosql veritabanlari
Nosql veritabanlariNosql veritabanlari
Nosql veritabanlari
 
Erlang web-frameworks
Erlang web-frameworksErlang web-frameworks
Erlang web-frameworks
 

Recently uploaded

CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
anilsa9823
 
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
mohitmore19
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
Health
 

Recently uploaded (20)

CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.js
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptx
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with Precision
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
 
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 

ZCloud Consensus on Hardware for Distributed Systems

  • 1. ZCloud Consensus on Hardware for Distributed Systems 3rd Workshop on Dependability, May 8, Monday 2017, İYTE, https://goo.gl/fSVnZy Gökhan Boranalp, gokhan@zetaops.io “Simplicity is the ultimate sophistication.” Leonardo Da Vinci
  • 2. Road Map ➔ Problem Definition ➔ Our Solution ➔ ZCloud Components ◆ ZCloud Hardware ◆ ZCloud Cluster Management Tools ◆ ZCloud Protocol ➔ Benefits ➔ Similar Work ➔ Discussion
  • 3. Problem Definition Increasing, ➢ computing power, ➢ data storage, analysis and ➢ sophisticated network communication requirements in modern “data centers”, reveals the strong need for “distributed” operation for both networking devices and applications.
  • 4. Apache Mesos and Google Kubernetes, which are using the container virtualization technique to develop, scale and manage applications in distributed systems, have emerged. These applications uses different "consensus" approaches in their internal processes. Problem Definition
  • 5. Problem Definition These approaches for distributed application development seem to be efficient enough for the time being, yet new approaches are needed in terms of latency, number of transactions and throughput in distributed systems, taking into account of physical boundaries and the increase in the size of future applications and the number of cluster members. ● With Mesos, practically 50,000 instance tests were performed on 24,000 core 500 physical servers while Kubernetes was tested on 500 physical servers.
  • 6. Kubernetes Architecture - Raft protocol, - Based on etcd
  • 8. In distributed applications where the number of members in the cluster increases, the separation of the consensus related operations at the hardware level is essential for the following reasons: 1. At the operating system level, messages broadcast on the protocol stack cause latency. 2. It is necessary to increase the number of completed transactions in the communication of distributed system components and on the network unit (throughput). Problem Definition
  • 9. Problem Definition 3. For devices with limited storage and CPU computing facilities that use embedded operating systems such as IOT devices, it is also necessary to reduce the processing burden due to "consensus" operations. 4. A better common consensus communication model is needed for different applications that need to work together in (BFT) environment.
  • 10. Network Hardware-Accelerated Consensus, page 4, USI-INF-TR-2016-03
  • 11. The Performance of Paxos in the Cloud, p. 46, DOI: 10.1109/SRDS.2014.15
  • 12. ZooKeeper Throughput as the Read-Write Ratio Varies, https://goo.gl/8aR35E
  • 13. ZCloud is an hardware and software solution for distributed systems and conceived to provide Byzantine fault tolerance (BFT) consensus primitives. Our Solution
  • 14. ZCloud came out as a solution for today's demands such as distributed hardware and software solutions and network components in data centers, distributed software architectures with very high number of components, IOT components for distributed architecture, distributed data processing and distributed data storage. ● ZCloud is ongoing research effort and is in early stage. Our Solution
  • 15.
  • 16. ZCloud Components - Hardware * CPU, changeable, low energy consumption, high processor power CPU * SoC and related hardware to manage distributed operations * RAM, variable 4, 8, 16, 32, 64 GB, type will be determined * 2 or 4 x Ethernet connections, changeable 40 GbE, 100 GbE ethernet * SD card slot, Mini USB input, USB-C, EEPROM, Flash memory * Wireless 802.11b, g, n, a Ethernet Notes: - The hardware components that can be used for IOT devices will be diversified during the research. - Storage is separate and distributed in cloud, see CEPH.
  • 17. ZCloud Cluster Management Tool ZCloud Cluster Manager enables ZCloud hardware to be managed under Openstack for cloud adaptation and use with existing systems. ZCloud SDK will be developed for use with the Nova, Neutron, Heath, Telemetry, Ironic, Manila, Magnum components found in the Openstack system.
  • 18. ZCloud Cluster Management Tool Related information and operations in Openstack Horizon panel will be available below; ● Add new hardware to the cluster, ● Remove hardware manually from cluster, ● Automatically mark and remove faulty hardware, ● Ensemble requested topologies, ● Collect logs (CPU, RAM, network load, parametric values)
  • 19. ZCloud Communication Protocol ● Newly designed Paxos based protocol. ○ Masquerade agents ○ Separation of request types ○ More to come, still in development.
  • 20. Some Benefits ● Drop in usage with an existing cloud operated apps. ● Ability to develop fast applications for network communicating devices. ● Ability to develop very *very* large applications. ● Reduction of electricity consumption (power consumption, cooling, etc.) in data centers. ● The possibility of accelerating the analysis results by efficiently distributing the analysis of the data derived from web applications in a large cluster. ● Prevention of energy and resource losses due to inefficient management of resource usage on the cloud.
  • 21. Similar Work ● Network Hardware-Accelerated Consensus ○ CAANS provides a complete Paxos protocol, is a dropin replacement for software-based implementations of Paxos, makes no restrictions on network topologies, and is implemented in a higher-level, data-plane programming language, allowing for portability across a range of target devices. ● Consensus in a Box: Inexpensive Coordination in Hardware ○ Zookeeper’s atomic broadcast at the network level using an FPGA.
  • 23. Dual-leader Master Election for Distributed Systems (Obiden), http://www.cse.scu.edu/~mwang2/projects/Distributed_dualLeaders_15s.pdf The Performance of Paxos in the Cloud, http://sci-hub.cc/10.1109/SRDS.2014.15 The Performance of Paxos and Fast Paxos, http://www.ic.unicamp.br/~reltech/2008/08-35.pdf Consensus in the Cloud: Paxos Systems Demystified, https://www.cse.buffalo.edu/tech-reports/2016-02.pdf Seamless Paxos coordinators, http://sci-hub.cc/10.1007/s10586-013-0264-9 Implementing Fault-Tolerant Services Using the State Machine Approach: A Tutorial http://www-users.cselabs.umn.edu/classes/Spring-2014/csci8980-sds/Papers/ProcessReplication/p299-schneider.pdf Holistic Configuration Management at Facebook, http://sigops.org/sosp/sosp15/current/2015-Monterey/printable/008-tang.pdf Optimistic Replication, http://sci-hub.cc/10.1145/1057977.1057980 Dotted Version Vectors: Logical Clocks for Optimistic Replication, https://arxiv.org/pdf/1011.5808.pdf In Search of an Understandable Consensus Algorithm, https://web.stanford.edu/~ouster/cgi-bin/papers/raft-atc14 Fast Quantum Byzantine Agreement, https://pdfs.semanticscholar.org/73ab/ef762dd61fdd388173f24f811e8693a79d7c.pdf Asynchronous Consensus and Broadcast Protocols, http://zoo.cs.yale.edu/classes/cs426/2013/bib/bracha85asynchronous.pdf Customizable and Extensible Deployment for Mobile/Cloud Applications, https://sapphire.cs.washington.edu/papers/sapphire-osdi14.pdf Ovid: A Software-Defined Distributed Systems Framework, https://www.usenix.org/system/files/conference/hotcloud16/hotcloud16_altinbuken.pdf Resources
  • 24. Resources Network Hardware-Accelerated Consensus, https://arxiv.org/pdf/1605.05619.pdf Consensus in a Box: Inexpensive Coordination in Hardware, https://www.usenix.org/system/files/conference/nsdi16/nsdi16-paper-istvan.pdf HT-Paxos- High Throughput State-Machine Replication Protocol for Large Clustered Data Centers, https://arxiv.org/abs/1407.1237 Ring Paxos: A High-Throughput Atomic Broadcast Protocol, http://www.inf.usi.ch/phd/jalili/RingPaxos-DSN2010.pdf http://sci-hub.cc/10.1109/SRDS.2014.15 https://infoscience.epfl.ch/record/49946/files/HUS+02b.pdf http://libpaxos.sourceforge.net/files/Primim-SPLab08.pdf http://www.ic.unicamp.br/~reltech/2008/08-35.pdf http://www.inf.usi.ch/faculty/soule/2015-06-22-disn.pdf Megastore: Providing Scalable, Highly Available Storage for Interactive Servicesx https://static.googleusercontent.com/media/research.google.com/en//pubs/archive/36971.pdf A Beginner’s Guide to Understanding the Leaf-Spine Network Topology http://blog.westmonroepartners.com/a-beginners-guide-to-understanding-the-leaf-spine-network-topology/ http://pbs.cs.berkeley.edu/ There Is More Consensus in Egalitarian Parliaments, https://www.cs.cmu.edu/~dga/papers/epaxos-sosp2013.pdf
  • 25. Sinfonia: a new paradigm for building scalable distributed systems, http://www.sosp2007.org/papers/sosp064-aguilera.pdf https://web.archive.org/web/20131017235612/http://www.temple.edu/cis/icdcs2013/data/5000a011.pdf A Scalable Conflict-free Replicated Set Data Type, https://web.archive.org/web/20131017233249/http://www.temple.edu/cis/icdcs2013/data/5000a186.pdf FChain: Toward Black-box Online Fault Localization for Cloud Systems, https://web.archive.org/web/20131017235525/http://www.temple.edu/cis/icdcs2013/data/5000a021.pdf Diagnosing Data Center Behavior Flow by Flow, https://web.archive.org/web/20131017235612/http://www.temple.edu/cis/icdcs2013/data/5000a011.pdf Experimental Demonstration of a Quantum Protocol for Byzantine Agreement and Liar Detection, https://arxiv.org/pdf/0710.0290v2.pdf https://tendermint.com/intro Performance Comparison Between the Paxos and Chandra-Toueg Consensus Algorithms, https://infoscience.epfl.ch/record/49946/files/HUS+02b.pdf A Distributed Lock Manager Using Paxos Design and Implementation of Warlock, a Consensus Based Lock Manager, http://uu.diva-portal.org/smash/get/diva2:615805/FULLTEXT01.pdf Resources