Strict-Data-Consistency-in-Distrbuted-Systems-With-Failures

Cacheonix: !
Architecture for !
Strict Data Consistency !
in Distributed Systems !
with Failures!
Slava Imeshev!
simeshev@cacheonix.org!
July 29, 2015!

Agenda

•  Strict data consistency
•  Lessons learned
•  Q&A

Introductions

Slava Imeshev:
•  Management style: my team is my family
•  For fun: sci-fi, hard rock, hiking, camping
•  Hobbies: software development, ham radio
•  E-mail: simeshev@cacheonix.org

Cacheonix!
https://github.com/cacheonix/cacheonix-core

Cacheonix Open Source distributed Java
cache:
–  Strict data consistency
–  Horizontal scalability
–  Fault-tolerance
–  Concurrency
–  Distributed state sharing
–  Coherent front cache
–  Distributed locks
–  Compute grid with data affinity
–  Load balancing

Strict Data Consistency

•  A guaranty that once an update to the key
happened, all members of the cluster will
see the new value
•  Knowing where the key value is at all times

Architecture for Strict Data
Consistency

These key components working together…
•  Replicated state machine
•  Cluster management protocol
•  Reliable totally-ordered multicast protocol
•  State transfer on join
•  P2P protocol with re-transmits
… allow to know EXACTLY where the data
in the cluster is.

Replicated State Machine

•  Maintains a consistent replicated configuration
of the cluster by:
•  Executing cluster, cache and partition configuration
events
–  On all members of the cluster
–  In the same total order

Cluster Management Protocol

•  Detects nodes joining and failing
•  Maintains replicated cluster view
•  Feeds the cluster events in total order to
reliable totally ordered multicast protocol

Reliable Totally Ordered
Multicast

•  Carries cache member events (leave/join)
•  Carries partition configuration messages
•  Executes replicated bucket ownership
assignment table part of the replicated state
machine

State Transfer on Join

•  When a node joins a cluster, it receives a
replicated state machine from its join
coordinator
•  Total order of events including join / leave
guarantees that events are executed in this
order on all members of the cluster:
•  At t0 there is no new node
•  At t1 there is new member fully aware of cluster
topology, data bucket locations and ready to
operate
•  At t2 replicated state machine begin to execute
repartitioning protocol to move data to the new
member of the cluster

P2P Protocol With Retransmits

•  Carries data modification messages in the
cluster (get, put, execute etc)
•  Automatically resends messages if a partition
undergoing re-configuration (move, replicate,
restore etc)
•  Ensures that reads and writes to a key served
one and only by a guaranteed owner of the
key.

Member Failure Example

1.  Member fails, then, on all nodes, synchronously:
2.  Cluster management protocol executes command Node
Left of the state machine ClusterView
3.  ClusterView executes Remove Node command of the
state machine BucketOwnershipAssingmentTable
4.  BucketOwnershipAssingmentTable executes the
repartitioning algorithm
5.  Repartitioning algorithm marks buckets as
reconfiguring and sends P2P messages to move
buckets around
6.  P2P messages send a reliable mcast message Move
Complete
7.  BucketOwnershipAssingmentTable marks buckets as
operational
8.  All members of the cluster in the same state.

Lessons Learned

•  Tackle hard problems first:
–  Hard problems define the architecture
–  Hard problems drive the schedule
–  Start with handling failure modes
•  Make unknowns known, do research

Cacheonix Roadmap

•  Fully-replicated cache
•  Weighted partitioning
•  Read/write affinity
•  Cluster-optimized serialization
•  Version-based clustering
•  Off-heap storage

Q&A

Ask me anything!

Slava Imeshev

simeshev@cacheonix.org

Strict-Data-Consistency-in-Distrbuted-Systems-With-Failures

Recommended

Recommended

More Related Content

What's hot

What's hot (13)

Viewers also liked

Viewers also liked (15)

Similar to Strict-Data-Consistency-in-Distrbuted-Systems-With-Failures

Similar to Strict-Data-Consistency-in-Distrbuted-Systems-With-Failures (20)

Strict-Data-Consistency-in-Distrbuted-Systems-With-Failures