Eventually Consistent Data Structures (from strangeloop12)

•

13 likes•20,474 views

There are many reasons to use an eventually-consistent database — like Riak, Voldemort, or Cassandra — including increased availability, lower latency, and fault-tolerance. However, doing so requires a mental shift in how to structure client applications, and certain types of traditional data-structures, like sets, registers, and counters can’t be resolved simply in the face of race-conditions. It is difficult to achieve “logical monotonicity” except for the most trivial data-types. That is, until the advent of Convergent Replicated Data Types (CRDTs). CRDTs are data-structures that tolerate eventual consistency. They replace traditional data-structure implementations and all have the property that, given any number of conflicting versions of the same datum, there is a single state on which they converge (monotonicity). This talk will discuss some of the most useful CRDTs and how to apply them to solve real-world data problems.

Technology

Eventually-
Consistent Data
Structures
Sean Cribbs
@seancribbs #CRDT
StrangeLoop 2012

Riak is
Eventually
Consistent
So are Voldemort and Cassandra

Duals or Duels?
object-oriented / functional

Duals or Duels?
object-oriented / functional
static / dynamic

Duals or Duels?
object-oriented / functional
static / dynamic
consistency / availability

Duals or Duels?
object-oriented / functional
static / dynamic
consistency / availability
throughput / latency

Duals or Duels?
object-oriented / functional
static / dynamic
consistency / availability
throughput / latency
threaded / evented

Duals or Duels?
object-oriented / functional
static / dynamic
consistency / availability
throughput / latency
threaded / evented
safety / liveness

Safety / Liveness
Proving the Correctness of Multiprocess Programs - Leslie
Lamport (March 1977)

Safety / Liveness
Proving the Correctness of Multiprocess Programs - Leslie
Lamport (March 1977)

•Safety: “nothing bad happens”
(partial correctness)

Eventual
Consistency

Replicated
Loose coordination
3 Convergence

Eventual is Good

✔ Fault-tolerant
✔ Highly available
✔ Low-latency

Consistency?

No clear winner!
Throw one out?
3
Keep both?
B

Consistency?

No clear winner!
Throw one out?
3
Keep both?
B Cassandra

Consistency?

No clear winner!
Throw one out?
3
Keep both?
B Cassandra

Riak & Voldemort

Semantic
Resolution
• Your app knows the domain - use
business rules to resolve

• Amazon Dynamo’s shopping cart

Semantic
Resolution
• Your app knows the domain - use
business rules to resolve

• Amazon Dynamo’s shopping cart
“Ad hoc approaches have proven brittle and
error-prone”

Conﬂict-Free
Replicated
Data Types
useful abstractions

Conﬂict-Free
Replicated
Data Types
multiple
independent copies useful abstractions

resolves automatically
toward a single value

Conﬂict-Free
Replicated
Data Types
multiple
independent copies useful abstractions

http://db.cs.berkeley.edu/papers/UCB-lattice-tr.pdf

Bounded Join Semi-Lattices
〈S, ⊔, ⊥〉

‣ S is a set

Bounded Join Semi-Lattices
〈S, ⊔, ⊥〉

‣ S is a set
‣ ⊔ is a least-upper bound (join/merge) on
S

Bounded Join Semi-Lattices
〈S, ⊔, ⊥〉

‣ S is a set
‣ ⊔ is a least-upper bound (join/merge) on
S
‣ ⊥∈S

Bounded Join Semi-Lattices
〈S, ⊔, ⊥〉

‣ S is a set
‣ ⊔ is a least-upper bound (join/merge) on
S
‣ ⊥∈S
‣ ∀x, y ∈ S: x ≤S y x⊔y=y

Bounded Join Semi-Lattices
〈S, ⊔, ⊥〉

‣ S is a set
‣ ⊔ is a least-upper bound (join/merge) on
S
‣ ⊥∈S
‣ ∀x, y ∈ S: x ≤S y x⊔y=y

‣ ∀x ∈ S: x ⊔ ⊥ = x

lmax Lattice

S≔ℛ
a ⊔b ≔ max(a,b)
⊥ ≔ -∞

lset Lattice
{a,b,c,d,e}

{a,b,c,d} {b,c,d,e}
{b,c,d}
Time

{a,b,c} {c,d,e}
{b,c} {c,d} {d,e}
{a,b}

{a} {b} {c} {d} {e}

CRDT Flavors
• Convergent: State
• Weak messaging requirements
•Commutative: Operations
•Reliable broadcast required
•Causal ordering sufficient

Registers

• Last-Write Wins (LWW-Register)
• e.g. Columns in Cassandra
• Multi-Valued (MV-Register)
• e.g. Objects (values) in Riak

G-Counter
// Starts empty
[]

// A increments twice, forwarding state
[{a,1}] // == 1
[{a,2}] // == 2

G-Counter
// Starts empty
[]

// A increments twice, forwarding state
[{a,1}] // == 1
[{a,2}] // == 2

// B increments
[{b,1}] // == 1

G-Counter
// Starts empty
[]

// A increments twice, forwarding state
[{a,1}] // == 1
[{a,2}] // == 2

// B increments
[{b,1}] // == 1

// Merging
[{a,2}, {b,1}] [{a,1}, {b,1}]

PN-Counter
// A PN-Counter
{
P = [{a,10},{b,2}],
N = [{a,1},{c,5}]
}
// == (10+2)-(1+5) == 12-6 == 6

G-Set
// Starts empty
{}

// A adds a and b, forwarding state
{a}
{a,b}

G-Set
// Starts empty
{}

// A adds a and b, forwarding state
{a}
{a,b}

// B adds c
{c}

G-Set
// Starts empty
{}

// A adds a and b, forwarding state
{a}
{a,b}

// B adds c
{c}

// Merging
{a,b,c} {a,c}

2P-Set
// Starts empty
{A={},R={}}

// A adds a and b, forwarding state,
// removes a
{A={a}, R={}} // == {a}
{A={a,b},R={}} // == {a,b}
{A={a,b},R={a}} // == {b}

2P-Set
// Starts empty
{A={},R={}}

// A adds a and b, forwarding state,
// removes a
{A={a}, R={}} // == {a}
{A={a,b},R={}} // == {a,b}
{A={a,b},R={a}} // == {b}

// B adds c
{A={c},R={}} // == {c}

2P-Set
// Starts empty
{A={},R={}}

// A adds a and b, forwarding state,
// removes a
{A={a}, R={}} // == {a}
{A={a,b},R={}} // == {a,b}
{A={a,b},R={a}} // == {b}

// B adds c
{A={c},R={}} // == {c}
// Merging
{A={a,b,c},R={a}} {A={a,c}, R={}}

Use-Cases

• Social graph (OR-Set or a Graph)
• Web page visits (G-Counter)
• Shopping Cart (Modiﬁed OR-Set)
• “Like” button (U-Set)

Challenges: GC

• CRDTs are inefficient
• Synchronization may be required

Challenges:
Responsibility
• Client
• Erlang: mochi/statebox
• Clojure: reiddraper/knockbox
• Ruby: aphyr/meangirls, bkerley/
hanover

• Server

The growth of observability trends and Kubernetes adoption generates more demanding requirements for monitoring systems. Volumes of time series data increase exponentially, and old solutions just can’t keep up with the pace. The talk will cover how and why we created a new open source time series database from scratch. Which architectural decisions, which trade-offs we had to take in order to match the new expectations and handle 100 million metrics per second with VictoriaMetrics. The talk will be interesting for software engineers and DevOps familiar with observability and modern monitoring systems, or for those who’re interested in building scalable high performant databases for time series.

Monitoring Kubernetes with Prometheus

Grafana Labs

Prometheus has become the defacto monitoring system for cloud native applications, with systems like Kubernetes and Etcd natively exposing Prometheus metrics. In this talk Tom will explore all the moving part for a working Prometheus-on-Kubernetes monitoring system, including kube-state-metrics, node-exporter, cAdvisor and Grafana. You will learn about the various methods for getting to a working setup: the manual approach, using CoreOSs Prometheus Operator, or using Prometheus Ksonnet Mixin. Tom will also share some little tips and tricks for getting the most out of your Prometheus monitoring, including the common pitfalls and what you should be alerting on.

Monitoring What Matters: The Prometheus Approach to Whitebox Monitoring (Berl...

Brian Brazil

Often what you monitor and get alerted on is defined by your tools, rather than what makes the most sense to you and your organisation. Alerts on metrics such as CPU usage which are noisy and rarely spot real problems, while outages go undetected. Monitoring systems can also be challenging to maintain, and overall provide a poor return on investment. In the past few years several new monitoring systems have appeared with more powerful semantics and which are easier to run, which offer a way to vastly improve how your organisation operates Prometheus is one such system. This talk will look at the monitoring ideal and how whitebox monitoring with a time series database, multi-dimensional labels and a powerful querying/alerting language can free you from midnight pages.

Redmineの全文検索機能はバージョンが上がるたびに改良されていますが、まだ少し「弱い」機能です。長年活用しているRedmineにはたくさんの有用な情報が入っているので、すごく「使える」全文検索があると既存の情報を有効活用できます。Redmineの全文検索機能をすごく「使える」全文検索機能にする方法を紹介します。

PyCUDAの紹介Yosuke Onoue

Infrastructure & System Monitoring using Prometheus

Marco Pas

Getting Ready to Use Redis with Apache Spark with Dvir Volk

Spark Summit

Getting Ready to use Redis with Apache Spark is a technical tutorial designed to address integrating Redis with an Apache Spark deployment to increase the performance of serving complex decision models. To set the context for the session, we start with a quick introduction to Redis and the capabilities Redis provides. We cover the basic data types provided by Redis and cover the module system. Using an ad serving use-case, we look at how Redis can improve the performance and reduce the cost of using complex ML-models in production. Attendees will be guided through the key steps of setting up and integrating Redis with Spark, including how to train a model using Spark then load and serve it using Redis, as well as how to work with the Spark Redis module. The capabilities of the Redis Machine Learning Module (redis-ml) will be discussed focusing primarily on decision trees and regression (linear and logistic) with code examples to demonstrate how to use these feature. At the end of the session, developers should feel confident building a prototype/proof-of-concept application using Redis and Spark. Attendees will understand how Redis complements Spark and how to use Redis to serve complex, ML-models with high performance.

Prometheus入門から運用まで徹底解説

貴仁大和屋

はじめてのElasticsearchクラスタ

Satoyuki Tsukano

OpenStack Swift紹介

Kota Tsuyuzaki

Spring Boot on Kubernetes : Yahoo!ズバトク事例 #jjug_ccc

Yahoo!デベロッパーネットワーク

OCIv2？！軽量高速なイケてる次世代イメージ仕様の最新動向を抑えよう！

Kohei Tokunaga

【誤りの訂正】 P4における「OCI Distribution Specification」の説明に誤りがありました。お詫びの上、以下のように訂正いたします（P9、P10の記述の要約になります）。「イメージを構成するメタデータやrootfsデータをコンテナレジストリへ格納、取得するためのHTTP API群を定義。」 ----- 【概要】 CloudNative Days Tokyo 2019 ( https://cloudnativedays.jp/cndt2019/ )での発表資料です。ランタイム・イメージ・レジストリまわりの要素技術とその最新動向をご紹介しています。 - イメージ構造・レジストリAPI・ランタイムに関する、OCI仕様と要素技術 - 次世代の軽量コンテナイメージ、汎用レジストリ仕様のOCIでの議論模様 - コンテナのpullにかかる時間を短縮して高速に起動させる技術

俺とHashiCorp

Toru Makabe

Red Hat OpenShift Container Storage

Takuya Utsunomiya

ストリームデータ分散処理基盤Storm

NTT DATA OSS Professional Services

Graal in GraalVM - A New JIT Compiler

Koichi Sakata

2018 Jul 25th LINE Developer Meetup #41 in Fukuoka Session Slide in English / セッションスライドです。 Graal in GraalVM - A New JIT Compiler オラクル社からGraalVMというものが発表され、話題を呼んでいます。GraalVMはHotSpot VM上に新しいJITコンパイラGraalと言語実装用フレームワーク/ASTインタプリタであるTruffle、さらにネイティブイメージ作成機能とその実行に使われるSubstrateVMを併せ持ったものです。すでにTruffleを使用したJavaScriptやRuby、R、Pythonの実装も提供されており、これらの言語とJavaはコードから相互に呼び出しができます。このセッションではGraalVMを概観したあと、JITコンパイラGraalにとくに注力して解説します。GraalとTruffleはOracle Labsとヨハネス・ケプラー大学で共同研究されており、多くの論文が発表されています。HotSpotのJITコンパイラとパフォーマンスや構造などを比較しつつ、GraalのJITコンパイルのテクニックについてもいくつか触れます。とにかく、私がGraalをとても好きなのです。デモも実施しつつ、Graalのすごさを伝えられればと考えています。

Airflow and supervisor

Rafael Roman Otero

YugaByte DB Internals - Storage Engine and Transactions

Yugabyte

Prometheus-Grafana-RahulSoni1584KnolX.pptx.pdf

Knoldus Inc.

Hiveを高速化するLLAP

Yahoo!デベロッパーネットワーク

Datadog Agent on CloudRunによるGCPトレービリティ向上

Ryo Sasaki

PostgreSQLでスケールアウト

Masahiko Sawada

きっと今すぐ使えるハッシュの基礎知識

Ryo Sasaki

Eventually-Consistent Data Structures

Sean Cribbs

There are many reasons to use an eventually-consistent database -- like Riak, Voldemort, or Cassandra -- including increased availability, lower latency, and fault-tolerance. However, doing so requires a mental shift in how to structure client applications, and certain types of traditional data-structures, like sets, registers, and counters can't be resolved simply in the face of race-conditions. It is difficult to achieve "logical monotonicity" except for the most trivial data-types. That is, until the advent of Conflict-Free Replicated Data Types (CRDTs). CRDTs are data-structures that tolerate eventual consistency. They replace traditional data-structure implementations and all have the property that, given any number of conflicting versions of the same datum, there is a single state on which they converge (monotonicity). This talk will discuss some of the most useful CRDTs and how to apply them to solve real-world data problems.

Concurrent and Distributed Applications with Akka, Java and Scala

Fernando Rodriguez

What's hot

全文検索でRedmineをさらに活用！

Kouhei Sutou

PyCUDAの紹介Yosuke Onoue

Infrastructure & System Monitoring using Prometheus

Marco Pas

Getting Ready to Use Redis with Apache Spark with Dvir Volk

Spark Summit

Prometheus入門から運用まで徹底解説

貴仁大和屋

はじめてのElasticsearchクラスタ

Satoyuki Tsukano

OpenStack Swift紹介

Kota Tsuyuzaki

Spring Boot on Kubernetes : Yahoo!ズバトク事例 #jjug_ccc

Yahoo!デベロッパーネットワーク

OCIv2？！軽量高速なイケてる次世代イメージ仕様の最新動向を抑えよう！

Kohei Tokunaga

俺とHashiCorp

Toru Makabe

Red Hat OpenShift Container Storage

Takuya Utsunomiya

ストリームデータ分散処理基盤Storm

NTT DATA OSS Professional Services

Graal in GraalVM - A New JIT Compiler

Koichi Sakata

Airflow and supervisor

Rafael Roman Otero

YugaByte DB Internals - Storage Engine and Transactions

Yugabyte

Prometheus-Grafana-RahulSoni1584KnolX.pptx.pdf

Knoldus Inc.

Hiveを高速化するLLAP

Yahoo!デベロッパーネットワーク

Datadog Agent on CloudRunによるGCPトレービリティ向上

Ryo Sasaki

PostgreSQLでスケールアウト

Masahiko Sawada

きっと今すぐ使えるハッシュの基礎知識

Ryo Sasaki

What's hot (20)

全文検索でRedmineをさらに活用！

PyCUDAの紹介

Infrastructure & System Monitoring using Prometheus

Getting Ready to Use Redis with Apache Spark with Dvir Volk

Prometheus入門から運用まで徹底解説

はじめてのElasticsearchクラスタ

OpenStack Swift紹介

Spring Boot on Kubernetes : Yahoo!ズバトク事例 #jjug_ccc

OCIv2？！軽量高速なイケてる次世代イメージ仕様の最新動向を抑えよう！

俺とHashiCorp

Red Hat OpenShift Container Storage

ストリームデータ分散処理基盤Storm

Graal in GraalVM - A New JIT Compiler

Airflow and supervisor

YugaByte DB Internals - Storage Engine and Transactions

Prometheus-Grafana-RahulSoni1584KnolX.pptx.pdf

Hiveを高速化するLLAP

Datadog Agent on CloudRunによるGCPトレービリティ向上

PostgreSQLでスケールアウト

きっと今すぐ使えるハッシュの基礎知識

Similar to Eventually Consistent Data Structures (from strangeloop12)

Eventually-Consistent Data Structures

Sean Cribbs

Concurrent and Distributed Applications with Akka, Java and Scala

Fernando Rodriguez

Introduction to Riak - Red Dirt Ruby Conf Training

Sean Cribbs

Introducing RiakKevin Smith

Storing and manipulating graphs in HBaseDan Lynn

HBaseCon 2012 | Storing and Manipulating Graphs in HBase

Cloudera, Inc.

Google’s original use case for BigTable was the storage and processing of web graph information, represented as sparse matrices. However, many organizations tend to treat HBase as merely a “web scale” RDBMS. This session will cover several use cases for storing graph data in HBase, including social networks and web link graphs, MapReduce processes like cached traversal, PageRank, and clustering and lastly will look at some lower-level modeling details like row key and column qualifier design, using FullContact’s graph processing systems as a real-world use case.

Archipelagos

msramanujan

Consistency without Consensus: CRDTs in Production at SoundCloud

C4Media

Video and slides synchronized, mp3 and slide download available at URL http://bit.ly/1DKnwXr. Peter Bourgon provides a practical introduction to Conflict-free Replicated Data Types (CRDTs) and describes a production CRDT system built at SoundCloud to serve several product features. Filmed at qconsf.com. Peter Bourgon is a distributed systems engineer who has seen things. He works at SoundCloud, building and improving the infrastructure that powers the world's largest audio platform.

Guaranteeing Consensus in Distriubuted Systems with CRDTs

Sun-Li Beatteay

Consensus in distributed systems has been a debated topic every since programmers discovered they could run the same program on multiple machines. Researchers have been studying consensus for decades, resulting in numerous algorithms and white papers. Unfortunately, many of these algorithms are flawed and unreliable. However, in 2011, a team of researchers published a paper on a novel approach to distributed consensus using Conflict-free Replicated Data Types (https://hal.inria.fr/inria-00609399v1...). This paper created quite a buzz as it showed that CRDTs were mathematically proven to guarantee consensus through "Strong Eventual Consistency." They also claimed to have solved the CAP conundrum. This presentation dives into this seminal paper in order to answer the hard questions. What are CRDTs? How do they work? And most importantly, does it actually solve CAP? By the end of this talk, everyone in the audience will have a foundational understanding of CRDTs and how they can be applied to their own work. Best of all, I will be explaining all of this is as simple language as possible. No advanced math degree required! Sound too good to be true? You'll just have to come see for yourself!

Embrace NoSQL and Eventual Consistency with Ripple

Sean Cribbs

So, there's this "NoSQL" thing you may have heard of, and this related thing called "eventual consistency". Supposedly, they help you scale, but no one has ever explained why! Well, wonder no more! This talk will demystify NoSQL, eventual consistency, how they might help you scale, and -- most importantly -- why you should care. We'll look closely at how Riak, a linearly-scalable, distributed and fault-tolerant NoSQL datastore, implements eventual consistency, and how you can harness it from Ruby via the slick Ripple client/ORM. When the talk is finished, you'll have the tools both to understand eventual consistency and to handle it like a pro inside your next Ruby application.

Map reduce and the art of Thinking Parallel - Dr. Shailesh Kumar

Hyderabad Scalability Meetup

Here we describe how to "think" mapreduce not just "code" mapreduce. We solve some interesting problems using mapreduce (e.g. how to compute similarity between all pair of documents on the web, how to do k-means clustering using map-reduce, and how to find cliques in a graph using map-reduce). These solutions are simple, elegant, and open up new ways for people to actually use mapreduce more than just simple number crunching.

Incremental View Maintenance for openCypher Queries

Gábor Szárnyas

Presented at the Fourth openCypher Implementers Meeting Numerous graph use cases require continuous evaluation of queries over a constantly changing data set, e.g. fraud detection in financial systems, recommendations, and checking integrity constraints. For relational systems, incremental view maintenance has been researched for three decades, resulting in a wide body of literature. The property graph data model and the openCypher language, however, are recent developments, and therefore lack established techniques to perform efficient view maintenance. In this talk, we give an overview of the view maintenance problem for property graphs, discuss why it is particularly difficult and present an approach that tackles a meaningful subset of the language.

Incremental View Maintenance for openCypher Queries

openCypher

Real World Optimization

David Golden

flowr streamlining computing workflows

sahil seth

Regexp secrets

Hiro Asari

SVD and the Netflix Dataset

Ben Mabey

Scala + WattzOn, sitting in a tree....

Raffi Krikorian

MLconf NYC Shan Shan HuangMLconf

Similar to Eventually Consistent Data Structures (from strangeloop12) (20)

Eventually-Consistent Data Structures

Concurrent and Distributed Applications with Akka, Java and Scala

Introduction to Riak - Red Dirt Ruby Conf Training

Introducing Riak

Storing and manipulating graphs in HBase

HBaseCon 2012 | Storing and Manipulating Graphs in HBase

Archipelagos

Consistency without Consensus: CRDTs in Production at SoundCloud

Guaranteeing Consensus in Distriubuted Systems with CRDTs

Embrace NoSQL and Eventual Consistency with Ripple

Map reduce and the art of Thinking Parallel - Dr. Shailesh Kumar

Incremental View Maintenance for openCypher Queries

Real World Optimization

flowr streamlining computing workflows

Regexp secrets

SVD and the Netflix Dataset

Scala + WattzOn, sitting in a tree....

MLconf NYC Shan Shan Huang

More from Sean Cribbs

A Case of Accidental Concurrency

Sean Cribbs

Concurrency in Ruby is all the rage these days, and people can't seem to agree whether Threads, Fibers, event loops, or actors are the best solution. But did you ever consider that your *sequential* Ruby program might be concurrent, with nary a Thread, Fiber, or callback in sight? Well, it happened to me. This is the story of how accidental concurrency (also known as re-entrancy) broke my brain multiple times over the course of two years, spawned flamewars on Twitter, long blog posts, and the various solutions I took to solve the problem. Along the way we'll illuminate some subtleties of concurrent programming in Ruby, differences between several Ruby implementations, and how we can all write code that is friendlier when accidental concurrency strikes.

Riak with node.js

Sean Cribbs

Schema Design for Riak (Take 2)

Sean Cribbs

Riak (Øredev nosql day)Sean Cribbs

Riak Tutorial (Øredev)

Sean Cribbs

The Radiant Ethic

Sean Cribbs

Introduction to Riak and Ripple (KC.rb)Sean Cribbs

Riak with Rails

Sean Cribbs

Schema Design for Riak

Sean Cribbs

Introducing Riak and Ripple

Sean Cribbs

Round PEG, Round Hole - Parsing Functionally

Sean Cribbs

Many developers will be familiar with lex, flex, yacc, bison, ANTLR, and other related tools to generate parsers for use inside their own code. For recognizing computer-friendly languages, however, context-free grammars and their parser-generators leave a few things to be desired. This is about how the seemingly simple prospect of parsing some text turned into a new parser toolkit for Erlang, and why functional programming makes parsing fun and awesome

Story Driven Development With Cucumber

Sean Cribbs

Software projects are rarely on-spec, on-time and on-budget, and the primary cause is miscommunication. As Martin Fowler says, there is a "yawning crevasse of doom" between stakeholders and developers, full of misunderstanding. How do you make sure that you're building something that adds value? How do you know you're building the thing that was asked for? How does your bottom line affect user experience? Into the fray leaps Cucumber, a business-readable DSL combined with an awesome Ruby library that lets domain experts express business requirements as executable user stories. We'll cover outside-in, story-driven development with Cucumber, how to write effective stories, and how to make Cucumber work for your project. (as given to CharlotteRuby on Jan 6, 2010)

Achieving Parsing Sanity In Erlang

Sean Cribbs

Most developers will be familiar with lex, flex, yacc, bison, ANTLR, and other tools to generate parsers for use inside their own code. Erlang, the concurrent functional programming language, has its own pair, leex and yecc, for accomplishing most complicated text-processing tasks. This talk is about how the seemingly simple prospect of parsing text turned into a new parser toolkit for Erlang, and why functional programming makes parsing fun and awesome.

Of Rats And Dragons

Sean Cribbs

Erlang/OTP for Rubyists

Sean Cribbs

Content Management That Won't Rot Your BrainSean Cribbs

More from Sean Cribbs (16)

A Case of Accidental Concurrency

Riak with node.js

Schema Design for Riak (Take 2)

Riak (Øredev nosql day)

Riak Tutorial (Øredev)

The Radiant Ethic

Introduction to Riak and Ripple (KC.rb)

Riak with Rails

Schema Design for Riak

Introducing Riak and Ripple

Round PEG, Round Hole - Parsing Functionally

Story Driven Development With Cucumber

Achieving Parsing Sanity In Erlang

Of Rats And Dragons

Erlang/OTP for Rubyists

Content Management That Won't Rot Your Brain

Recently uploaded

Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...

James Anderson

Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management. The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM). Speakers: Bob Boule Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle. Gopinath Rebala Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.

Microsoft - Power Platform_G.Aspiotis.pdf

Uni Systems S.M.S.A.

UiPath Test Automation using UiPath Test Suite series, part 4

DianaGray10

Welcome to UiPath Test Automation using UiPath Test Suite series part 4. In this session, we will cover Test Manager overview along with SAP heatmap. The UiPath Test Manager overview with SAP heatmap webinar offers a concise yet comprehensive exploration of the role of a Test Manager within SAP environments, coupled with the utilization of heatmaps for effective testing strategies. Participants will gain insights into the responsibilities, challenges, and best practices associated with test management in SAP projects. Additionally, the webinar delves into the significance of heatmaps as a visual aid for identifying testing priorities, areas of risk, and resource allocation within SAP landscapes. Through this session, attendees can expect to enhance their understanding of test management principles while learning practical approaches to optimize testing processes in SAP environments using heatmap visualization techniques What will you get from this session? 1. Insights into SAP testing best practices 2. Heatmap utilization for testing 3. Optimization of testing processes 4. Demo Topics covered: Execution from the test manager Orchestrator execution result Defect reporting SAP heatmap example with demo Speaker: Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP

GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...

Neo4j

Sudheer Mechineni, Head of Application Frameworks, Standard Chartered Bank Discover how Standard Chartered Bank harnessed the power of Neo4j to transform complex data access challenges into a dynamic, scalable graph database solution. This keynote will cover their journey from initial adoption to deploying a fully automated, enterprise-grade causal cluster, highlighting key strategies for modelling organisational changes and ensuring robust disaster recovery. Learn how these innovations have not only enhanced Standard Chartered Bank’s data infrastructure but also positioned them as pioneers in the banking sector’s adoption of graph technology.

SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf

Peter Spielvogel

Building better applications for business users with SAP Fiori. • What is SAP Fiori and why it matters to you • How a better user experience drives measurable business benefits • How to get started with SAP Fiori today • How SAP Fiori elements accelerates application development • How SAP Build Code includes SAP Fiori tools and other generative artificial intelligence capabilities • How SAP Fiori paves the way for using AI in SAP apps

Uni Systems Copilot event_05062024_C.Vlachos.pdf

Uni Systems S.M.S.A.

Encryption in Microsoft 365 - ExpertsLive Netherlands 2024

Albert Hoitingh

FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf

FIDO Alliance

GraphRAG is All You need? LLM & Knowledge Graph

Guy Korland

Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs. 1. Unifying Large Language Models and Knowledge Graphs: A Roadmap. https://arxiv.org/abs/2306.08302 2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs: https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/

DevOps and Testing slides at DASA Connect

Kari Kakkonen

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Aggregage

LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...

DanBrown980551

Do you want to learn how to model and simulate an electrical network from scratch in under an hour? Then welcome to this PowSyBl workshop, hosted by Rte, the French Transmission System Operator (TSO)! During the webinar, you will discover the PowSyBl ecosystem as well as handle and study an electrical network through an interactive Python notebook. PowSyBl is an open source project hosted by LF Energy, which offers a comprehensive set of features for electrical grid modelling and simulation. Among other advanced features, PowSyBl provides: - A fully editable and extendable library for grid component modelling; - Visualization tools to display your network; - Grid simulation tools, such as power flows, security analyses (with or without remedial actions) and sensitivity analyses; The framework is mostly written in Java, with a Python binding so that Python developers can access PowSyBl functionalities as well. What you will learn during the webinar: - For beginners: discover PowSyBl's functionalities through a quick general presentation and the notebook, without needing any expert coding skills; - For advanced developers: master the skills to efficiently apply PowSyBl functionalities to your real-world scenarios.

GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024

Neo4j

FIDO Alliance Osaka Seminar: Overview.pdf

FIDO Alliance

Introduction to CHERI technology - Cybersecurity

mikeeftimakis1

20240605 QFM017 Machine Intelligence Reading List May 2024

Matthew Sinclair

Video Streaming: Then, Now, and in the Future

Alpen-Adria-Universität

In his public lecture, Christian Timmerer provides insights into the fascinating history of video streaming, starting from its humble beginnings before YouTube to the groundbreaking technologies that now dominate platforms like Netflix and ORF ON. Timmerer also presents provocative contributions of his own that have significantly influenced the industry. He concludes by looking at future challenges and invites the audience to join in a discussion.

Monitoring Java Application Security with JDK Tools and JFR Events

Ana-Maria Mihalceanu

Climate Impact of Software Testing at Nordic Testing Days

Kari Kakkonen

My slides at Nordic Testing Days 6.6.2024 Climate impact / sustainability of software testing discussed on the talk. ICT and testing must carry their part of global responsibility to help with the climat warming. We can minimize the carbon footprint but we can also have a carbon handprint, a positive impact on the climate. Quality characteristics can be added with sustainability, and then measured continuously. Test environments can be used less, and in smaller scale and on demand. Test techniques can be used in optimizing or minimizing number of tests. Test automation can be used to speed up testing.

GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...

James Anderson

Recently uploaded (20)

Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...

Microsoft - Power Platform_G.Aspiotis.pdf

UiPath Test Automation using UiPath Test Suite series, part 4

GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...

SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf

Uni Systems Copilot event_05062024_C.Vlachos.pdf

Encryption in Microsoft 365 - ExpertsLive Netherlands 2024

FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf

GraphRAG is All You need? LLM & Knowledge Graph

DevOps and Testing slides at DASA Connect

Generative AI Deep Dive: Advancing from Proof of Concept to Production

LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...

GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024

FIDO Alliance Osaka Seminar: Overview.pdf

Introduction to CHERI technology - Cybersecurity

20240605 QFM017 Machine Intelligence Reading List May 2024

Video Streaming: Then, Now, and in the Future

Monitoring Java Application Security with JDK Tools and JFR Events

Climate Impact of Software Testing at Nordic Testing Days

GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...

Eventually Consistent Data Structures (from strangeloop12)

1. Eventually- Consistent Data Structures Sean Cribbs @seancribbs #CRDT StrangeLoop 2012

2. I work for Basho We make

3. Riak is Eventually Consistent So are Voldemort and Cassandra

4. No ACID!

5. Duals or Duels?

6. Duals or Duels? object-oriented / functional

7. Duals or Duels? object-oriented / functional static / dynamic

8. Duals or Duels? object-oriented / functional static / dynamic consistency / availability

9. Duals or Duels? object-oriented / functional static / dynamic consistency / availability throughput / latency

10. Duals or Duels? object-oriented / functional static / dynamic consistency / availability throughput / latency threaded / evented

11. Duals or Duels? object-oriented / functional static / dynamic consistency / availability throughput / latency threaded / evented safety / liveness

12. Safety / Liveness Proving the Correctness of Multiprocess Programs - Leslie Lamport (March 1977)

13. Safety / Liveness Proving the Correctness of Multiprocess Programs - Leslie Lamport (March 1977) •Safety: “nothing bad happens” (partial correctness)

14. Safety / Liveness Proving the Correctness of Multiprocess Programs - Leslie Lamport (March 1977) •Safety: “nothing bad happens” (partial correctness) •Liveness: “something good eventually happens” (termination)

15. Safety / Liveness Proving the Correctness of Multiprocess Programs - Leslie Lamport (March 1977) •Safety: “nothing bad happens” (partial correctness) •Liveness: “something good eventually happens” (termination) “Safety and liveness: Eventual consistency is not safe” - Peter Bailis http://www.bailis.org/blog/safety-and-liveness-eventual-consistency-is- not-safe/

16. Eventual Consistency Replicated Loose coordination 3 Convergence

17. Eventual is Good ✔ Fault-tolerant ✔ Highly available ✔ Low-latency

18. Consistency? No clear winner! Throw one out? 3 Keep both? B

19. Consistency? No clear winner! Throw one out? 3 Keep both? B Cassandra

20. Consistency? No clear winner! Throw one out? 3 Keep both? B Cassandra Riak & Voldemort

21. Conﬂicts! A! B!

22. Semantic Resolution • Your app knows the domain - use business rules to resolve • Amazon Dynamo’s shopping cart

23. Semantic Resolution • Your app knows the domain - use business rules to resolve • Amazon Dynamo’s shopping cart “Ad hoc approaches have proven brittle and error-prone”

24. Conﬂict-Free Replicated Data Types

25. Conﬂict-Free Replicated Data Types useful abstractions

26. Conﬂict-Free Replicated Data Types multiple independent copies useful abstractions

27. resolves automatically toward a single value Conﬂict-Free Replicated Data Types multiple independent copies useful abstractions

28. http://db.cs.berkeley.edu/papers/UCB-lattice-tr.pdf

29. Bounded Join Semi-Lattices

30. Bounded Join Semi-Lattices 〈S, ⊔, ⊥〉

31. Bounded Join Semi-Lattices 〈S, ⊔, ⊥〉 ‣ S is a set

32. Bounded Join Semi-Lattices 〈S, ⊔, ⊥〉 ‣ S is a set ‣ ⊔ is a least-upper bound (join/merge) on S

33. Bounded Join Semi-Lattices 〈S, ⊔, ⊥〉 ‣ S is a set ‣ ⊔ is a least-upper bound (join/merge) on S ‣ ⊥∈S

34. Bounded Join Semi-Lattices 〈S, ⊔, ⊥〉 ‣ S is a set ‣ ⊔ is a least-upper bound (join/merge) on S ‣ ⊥∈S ‣ ∀x, y ∈ S: x ≤S y x⊔y=y

35. Bounded Join Semi-Lattices 〈S, ⊔, ⊥〉 ‣ S is a set ‣ ⊔ is a least-upper bound (join/merge) on S ‣ ⊥∈S ‣ ∀x, y ∈ S: x ≤S y x⊔y=y ‣ ∀x ∈ S: x ⊔ ⊥ = x

36. lmax Lattice S≔ℛ a ⊔b ≔ max(a,b) ⊥ ≔ -∞

37. lset Lattice {a,b,c,d,e} {a,b,c,d} {b,c,d,e} {b,c,d} Time {a,b,c} {c,d,e} {b,c} {c,d} {d,e} {a,b} {a} {b} {c} {d} {e}

38.

39. CRDT Flavors • Convergent: State • Weak messaging requirements •Commutative: Operations •Reliable broadcast required •Causal ordering sufficient

40. Convergent CRDTs

41. Commutative CRDTs

42. Registers A place to put your stuff

43. Registers • Last-Write Wins (LWW-Register) • e.g. Columns in Cassandra • Multi-Valued (MV-Register) • e.g. Objects (values) in Riak

44. Counters Keeping tabs

45. G-Counter

46. G-Counter // Starts empty []

47. G-Counter // Starts empty [] // A increments twice, forwarding state [{a,1}] // == 1 [{a,2}] // == 2

48. G-Counter // Starts empty [] // A increments twice, forwarding state [{a,1}] // == 1 [{a,2}] // == 2 // B increments [{b,1}] // == 1

49. G-Counter // Starts empty [] // A increments twice, forwarding state [{a,1}] // == 1 [{a,2}] // == 2 // B increments [{b,1}] // == 1 // Merging [{a,2}, {b,1}] [{a,1}, {b,1}]

50. PN-Counter // A PN-Counter { P = [{a,10},{b,2}], N = [{a,1},{c,5}] } // == (10+2)-(1+5) == 12-6 == 6

51. Sets Members Only

52. G-Set

53. G-Set // Starts empty {}

54. G-Set // Starts empty {} // A adds a and b, forwarding state {a} {a,b}

55. G-Set // Starts empty {} // A adds a and b, forwarding state {a} {a,b} // B adds c {c}

56. G-Set // Starts empty {} // A adds a and b, forwarding state {a} {a,b} // B adds c {c} // Merging {a,b,c} {a,c}

57. 2P-Set

58. 2P-Set // Starts empty {A={},R={}}

59. 2P-Set // Starts empty {A={},R={}} // A adds a and b, forwarding state, // removes a {A={a}, R={}} // == {a} {A={a,b},R={}} // == {a,b} {A={a,b},R={a}} // == {b}

60. 2P-Set // Starts empty {A={},R={}} // A adds a and b, forwarding state, // removes a {A={a}, R={}} // == {a} {A={a,b},R={}} // == {a,b} {A={a,b},R={a}} // == {b} // B adds c {A={c},R={}} // == {c}

61. 2P-Set // Starts empty {A={},R={}} // A adds a and b, forwarding state, // removes a {A={a}, R={}} // == {a} {A={a,b},R={}} // == {a,b} {A={a,b},R={a}} // == {b} // B adds c {A={c},R={}} // == {c} // Merging {A={a,b,c},R={a}} {A={a,c}, R={}}

62. LWW-Element-Set

63. OR-Set

64. G = (V,E) Graphs E⊆V×V

65. G = (V,E) Graphs E⊆V×V

66. G = (V,E) Graphs E⊆V×V

67. Use-Cases • Social graph (OR-Set or a Graph) • Web page visits (G-Counter) • Shopping Cart (Modiﬁed OR-Set) • “Like” button (U-Set)

68. Challenges: GC • CRDTs are inefficient • Synchronization may be required

69. Challenges: Responsibility • Client • Erlang: mochi/statebox • Clojure: reiddraper/knockbox • Ruby: aphyr/meangirls, bkerley/ hanover • Server

70. Thanks

Editor's Notes

\n
\n
\n
There&#x2019;s no ACID! But don&#x2019;t worry, there&#x2019;s no need to be upset, despite what you may have heard.\n
I think the fear people have about giving up ACID is really just a tendency to see things in black and white, because subtlety is much harder to understand and accept. Everyday in the wider technical community and the Internet we are presented with binary choices which are often not really in conflict, but either orthogonal albeit related concepts, or simply different ends of a spectrum. We too often perceive a Hegelian dialectic when one doesn&#x2019;t exist (without the synthesis part!). \n\nAn important pair we need to understand, but is not frequently discussed outside of academia is safety and liveness.\n
I think the fear people have about giving up ACID is really just a tendency to see things in black and white, because subtlety is much harder to understand and accept. Everyday in the wider technical community and the Internet we are presented with binary choices which are often not really in conflict, but either orthogonal albeit related concepts, or simply different ends of a spectrum. We too often perceive a Hegelian dialectic when one doesn&#x2019;t exist (without the synthesis part!). \n\nAn important pair we need to understand, but is not frequently discussed outside of academia is safety and liveness.\n
I think the fear people have about giving up ACID is really just a tendency to see things in black and white, because subtlety is much harder to understand and accept. Everyday in the wider technical community and the Internet we are presented with binary choices which are often not really in conflict, but either orthogonal albeit related concepts, or simply different ends of a spectrum. We too often perceive a Hegelian dialectic when one doesn&#x2019;t exist (without the synthesis part!). \n\nAn important pair we need to understand, but is not frequently discussed outside of academia is safety and liveness.\n
I think the fear people have about giving up ACID is really just a tendency to see things in black and white, because subtlety is much harder to understand and accept. Everyday in the wider technical community and the Internet we are presented with binary choices which are often not really in conflict, but either orthogonal albeit related concepts, or simply different ends of a spectrum. We too often perceive a Hegelian dialectic when one doesn&#x2019;t exist (without the synthesis part!). \n\nAn important pair we need to understand, but is not frequently discussed outside of academia is safety and liveness.\n
I think the fear people have about giving up ACID is really just a tendency to see things in black and white, because subtlety is much harder to understand and accept. Everyday in the wider technical community and the Internet we are presented with binary choices which are often not really in conflict, but either orthogonal albeit related concepts, or simply different ends of a spectrum. We too often perceive a Hegelian dialectic when one doesn&#x2019;t exist (without the synthesis part!). \n\nAn important pair we need to understand, but is not frequently discussed outside of academia is safety and liveness.\n
I think the fear people have about giving up ACID is really just a tendency to see things in black and white, because subtlety is much harder to understand and accept. Everyday in the wider technical community and the Internet we are presented with binary choices which are often not really in conflict, but either orthogonal albeit related concepts, or simply different ends of a spectrum. We too often perceive a Hegelian dialectic when one doesn&#x2019;t exist (without the synthesis part!). \n\nAn important pair we need to understand, but is not frequently discussed outside of academia is safety and liveness.\n
Safety and Liveness were terms which were defined for concurrent programs in this 1977 paper by Leslie Lamport. Colloquially, safety means that in the course of running your program, &#x201C;nothing bad will happen&#x201D; and liveness means that &#x201C;something good will eventually happen&#x201D;. Both are desirable properties, but sometimes enforcing one property may cause you to give up the other. I thought Peter Bailis stated this eloquently in his recent blog post that Eventual Consistency is not safe by itself - but a trivially satisfiable liveness property. That is, it helps keep your system available, but doesn&#x2019;t make any guarantees about whether correct answers will be given at all. His larger point was that practical systems like Riak/Voldemort/Cassandra do make safety guarantees but tend not to state them. It&#x2019;s not all &#x201C;garbage&#x201D;.\n
Safety and Liveness were terms which were defined for concurrent programs in this 1977 paper by Leslie Lamport. Colloquially, safety means that in the course of running your program, &#x201C;nothing bad will happen&#x201D; and liveness means that &#x201C;something good will eventually happen&#x201D;. Both are desirable properties, but sometimes enforcing one property may cause you to give up the other. I thought Peter Bailis stated this eloquently in his recent blog post that Eventual Consistency is not safe by itself - but a trivially satisfiable liveness property. That is, it helps keep your system available, but doesn&#x2019;t make any guarantees about whether correct answers will be given at all. His larger point was that practical systems like Riak/Voldemort/Cassandra do make safety guarantees but tend not to state them. It&#x2019;s not all &#x201C;garbage&#x201D;.\n
Safety and Liveness were terms which were defined for concurrent programs in this 1977 paper by Leslie Lamport. Colloquially, safety means that in the course of running your program, &#x201C;nothing bad will happen&#x201D; and liveness means that &#x201C;something good will eventually happen&#x201D;. Both are desirable properties, but sometimes enforcing one property may cause you to give up the other. I thought Peter Bailis stated this eloquently in his recent blog post that Eventual Consistency is not safe by itself - but a trivially satisfiable liveness property. That is, it helps keep your system available, but doesn&#x2019;t make any guarantees about whether correct answers will be given at all. His larger point was that practical systems like Riak/Voldemort/Cassandra do make safety guarantees but tend not to state them. It&#x2019;s not all &#x201C;garbage&#x201D;.\n
In an eventually consistent system, you tend to have multiple copies of the same datum, which means that it&#x2019;s replicated. They also tend to allow loose coordination and things like sloppy quorums, since you don&#x2019;t require expensive multi-phase commit protocols. This also makes them resilient to network partitions, which DO EXIST. Eventually consistent systems must also include means for state to move forward when staleness is detected. In Dynamo-like systems, this is usually done with read-repair, that is, writing the newer value to stale replicas when reading.\n
While not as simple to understand as an ACID system, eventual consistency has many practical benefits. When encountering failures, especially network-related ones, the system can more often remain available to reads and writes despite the failures. In the same vein, relying on dynamic participation in operations lends itself to systems with low, consistent latency because only promptly-responding replicas need to be considered.\n
Of course the tradeoff of those benefits, thanks to the CAP theorem, is that you sacrifice strict consistency. There is no total ordering of events in the system, you have no transactions, you have weak guarantees of delivery at best. This means it&#x2019;s incredibly difficult to decide who wins when there are concurrent writes in the system. The solutions to the problem are both non-ideal, but they are generally: first, to throw one version out by applying an arbitrary ordering, usually a timestamp of sorts; second, to keep both values around and let the user decide. These are the approaches of Cassandra, and Riak/Voldemort respectively.\n
Of course the tradeoff of those benefits, thanks to the CAP theorem, is that you sacrifice strict consistency. There is no total ordering of events in the system, you have no transactions, you have weak guarantees of delivery at best. This means it&#x2019;s incredibly difficult to decide who wins when there are concurrent writes in the system. The solutions to the problem are both non-ideal, but they are generally: first, to throw one version out by applying an arbitrary ordering, usually a timestamp of sorts; second, to keep both values around and let the user decide. These are the approaches of Cassandra, and Riak/Voldemort respectively.\n
So maybe you chose Riak or Voldemort, you get write conflicts (Riak calls them siblings). Now that you&#x2019;ve got both values, how do you decide what the real state should be?\n
One strategy, which I call &#x201C;semantic resolution&#x201D;, is to say that your application encodes the domain of the problem and so it can use business rules to resolve the conflict. This is the strategy implemented by the &#x201C;shopping cart&#x201D; described in the Amazon Dynamo paper. It merges toward the maximum quantity of each item in the cart; however, it exhibits some problems -- namely that sometimes items that were removed from the cart can reappear! From Amazon&#x2019;s point of view this is okay because it might encourage the customer to buy more, but it is a bewildering user-experience!\n\nFortunately, there is some interesting recent research about a more rigorous approach to eventual consistency.\n\n\n
...and that is Conflict-Free Replicated Data Types. This basically means that instead of strictly opaque values, the datastore provides useful abstract data structures. Since we&#x2019;re in an eventually consistent system, the data structure is replicated to multiple locations, all of which act independently. But by far the most compelling part is that these data structures have the ability to resolve automatically toward a single value, given any number of conflicting values at individual replicas. CRDTs provide a strong safety property for eventually consistent systems that doesn&#x2019;t sacrifice liveness in the process.\n
...and that is Conflict-Free Replicated Data Types. This basically means that instead of strictly opaque values, the datastore provides useful abstract data structures. Since we&#x2019;re in an eventually consistent system, the data structure is replicated to multiple locations, all of which act independently. But by far the most compelling part is that these data structures have the ability to resolve automatically toward a single value, given any number of conflicting values at individual replicas. CRDTs provide a strong safety property for eventually consistent systems that doesn&#x2019;t sacrifice liveness in the process.\n
...and that is Conflict-Free Replicated Data Types. This basically means that instead of strictly opaque values, the datastore provides useful abstract data structures. Since we&#x2019;re in an eventually consistent system, the data structure is replicated to multiple locations, all of which act independently. But by far the most compelling part is that these data structures have the ability to resolve automatically toward a single value, given any number of conflicting values at individual replicas. CRDTs provide a strong safety property for eventually consistent systems that doesn&#x2019;t sacrifice liveness in the process.\n
The theory behind what I&#x2019;m going to talk about is the idea of bounded join semi-lattices, or &#x201C;lattices&#x201D; for short, and is rooted in the theory of monotonic logic. The definition I&#x2019;m giving here comes from a recent paper by Neil Conway and others at UC-Berkeley.\n
A lattice is a triple of a set, a function, and a value. S is a set (possibly infinite) representing the possible values of the lattice. The upside-down T is the &#x201C;least element&#x201D; of the set. The &#x201C;square U&#x201D; is a binary operator over S that produces a least-upper bound of its operands that is also a member of S, also called the &#x201C;join&#x201D; or &#x201C;merge&#x201D; operator. The merge operator is commutative, associative, and idempotent. Finally, a lattice has the property such that for any two members of the set S, the merge operator creates a partial ordering over the set. This also means that merging any element with the least element is an identity operation.\n
A lattice is a triple of a set, a function, and a value. S is a set (possibly infinite) representing the possible values of the lattice. The upside-down T is the &#x201C;least element&#x201D; of the set. The &#x201C;square U&#x201D; is a binary operator over S that produces a least-upper bound of its operands that is also a member of S, also called the &#x201C;join&#x201D; or &#x201C;merge&#x201D; operator. The merge operator is commutative, associative, and idempotent. Finally, a lattice has the property such that for any two members of the set S, the merge operator creates a partial ordering over the set. This also means that merging any element with the least element is an identity operation.\n
A lattice is a triple of a set, a function, and a value. S is a set (possibly infinite) representing the possible values of the lattice. The upside-down T is the &#x201C;least element&#x201D; of the set. The &#x201C;square U&#x201D; is a binary operator over S that produces a least-upper bound of its operands that is also a member of S, also called the &#x201C;join&#x201D; or &#x201C;merge&#x201D; operator. The merge operator is commutative, associative, and idempotent. Finally, a lattice has the property such that for any two members of the set S, the merge operator creates a partial ordering over the set. This also means that merging any element with the least element is an identity operation.\n
A lattice is a triple of a set, a function, and a value. S is a set (possibly infinite) representing the possible values of the lattice. The upside-down T is the &#x201C;least element&#x201D; of the set. The &#x201C;square U&#x201D; is a binary operator over S that produces a least-upper bound of its operands that is also a member of S, also called the &#x201C;join&#x201D; or &#x201C;merge&#x201D; operator. The merge operator is commutative, associative, and idempotent. Finally, a lattice has the property such that for any two members of the set S, the merge operator creates a partial ordering over the set. This also means that merging any element with the least element is an identity operation.\n
A lattice is a triple of a set, a function, and a value. S is a set (possibly infinite) representing the possible values of the lattice. The upside-down T is the &#x201C;least element&#x201D; of the set. The &#x201C;square U&#x201D; is a binary operator over S that produces a least-upper bound of its operands that is also a member of S, also called the &#x201C;join&#x201D; or &#x201C;merge&#x201D; operator. The merge operator is commutative, associative, and idempotent. Finally, a lattice has the property such that for any two members of the set S, the merge operator creates a partial ordering over the set. This also means that merging any element with the least element is an identity operation.\n
A lattice is a triple of a set, a function, and a value. S is a set (possibly infinite) representing the possible values of the lattice. The upside-down T is the &#x201C;least element&#x201D; of the set. The &#x201C;square U&#x201D; is a binary operator over S that produces a least-upper bound of its operands that is also a member of S, also called the &#x201C;join&#x201D; or &#x201C;merge&#x201D; operator. The merge operator is commutative, associative, and idempotent. Finally, a lattice has the property such that for any two members of the set S, the merge operator creates a partial ordering over the set. This also means that merging any element with the least element is an identity operation.\n
Just for the sake of illustration, let&#x2019;s look at one of the simpler lattices defined in Conway&#x2019;s paper, the &#x201C;lmax&#x201D; lattice. The set of values in the lattice are the Real numbers. The merge function is defined as taking the maximum of the two values. The minimum value is negative infinity. I hope you can see that this definition is a lattice: nothing is less than negative infinity, and the merging of any two values trends toward positive infinity, without exceeding the seen values.\n
Let&#x2019;s take another example for those who might be visual learners, the lset lattice. The set of values for the lattice are all simple sets, with the empty set being the minimum value. The merge function is set-union, which you should be able to see in this diagram, allow any ordering of operation delivery to eventually converge on the same value. This diagram doesn&#x2019;t even show all of the possible orderings, in fact.\n\nNow why is this stuff important? Remember how we had conflicts and we needed a sane way to resolve those conflicts? Lattices are a generic type that give us determinism in how we merge our conflicts. In the case of the &#x201C;lmax&#x201D; lattice, if one value has 10 and another has 15, you pick 15 because it&#x2019;s the larger one. This foundation gives us what we need to understand a larger study of the topic of conflict-resolution in eventual consistency.\n
The primary work on this research has been done by two researchers at INRIA and their colleagues in Portugal. Marc Shapiro also gave a great talk on the subject at Microsoft Research called &#x201C;Strong Eventual Consistency&#x201D; which you can easily find online.\n\nThe paper above is where I&#x2019;ve gotten most of the content and diagrams, but I&#x2019;ve tried to simplify the content so that we can get through it in the scope of this talk. If you want the real thing, search for <title>, it&#x2019;s free to download.\n
There are two flavors of CRDTs as you might have noticed. They both provide the same conflict-free property, but differ in their implementation strategy.\n\nConvergent types are based on a local modification of state, followed by forwarding the resulting state downstream, where a merge operation is performed at other replicas. The state itself encodes all information needed to converge. They are great for systems with weak message delivery guarantees - for example, a Dynamo-style system. Convergent types can also be resolved in clients, which is helpful for systems that do not provide rich datatypes.\n\nCommutative types, on the other hand, replicate commutative operations rather than state, and tend to rely on systems with reliable broadcast (that assures operations reach all replicas). Operations are generally not required to have a total ordering -- a local causal ordering is sufficient.\n
This diagram from the paper shows the basic format of a convergent, state based CRDT. Note how the mutation is applied locally, then forwarded downstream as a merge operation. As long as all replicas eventually receive states that include all mutations, they will converge on the same value. (The merge function is basically the merge function in a lattice.)\n
Again, in Commutative types forward operations to other replicas, not the state. Obviously, if an operation is not delivered, or applied out-of-order locally, the states don&#x2019;t converge. However, again, unlike the convergent type, a reliable broadcast channel is required. As long as functions f() and g() commute, state will converge.\n
A register is the simplest type of data structure - a memory cell storing an opaque value. It only supports two operations - &#x201C;assign&#x201D; and &#x201C;value&#x201D; (get and set). Concurrent updates will not commute (who should win?). We&#x2019;ve seen this problem before.\n
The two approaches to concurrent resolution are the same ones taken by Cassandra and Riak, respectively. That is, Last-Write-Wins (called an LWW-Register) and Multi-Valued (called MV-Register)-- keeping all divergent values. For resolution, LWW tend to use timestamps with a reasonable guarantee of ordering (which is difficult in practice, but in some systems sufficient). MV on the other hand, requires the more expensive version vector to resolve conflicts and produces the union of all divergent values (but it doesn&#x2019;t behave like a set!)\n
Counters are simply integers that are replicated and support the increment and decrement operations. Counters are useful for things like tracking the number of logged-in users, or click-throughs on an advertisement.\n\nThe simplest type of counter is a Commutative or operation-based type, since add and subtract are commutative, any delivery order is sufficient (ignoring over-/under-flow). The state-based counters are more interesting so we&#x2019;ll look at those.\n
A G-Counter only counts up and is basically a version vector (vector clock). Each replica increments its own pair only, the value is computed by summing the count of all replicas. Convergence is achieved by taking the maximum count for each replica. This is basically the Cassandra counters implementation.\n
A G-Counter only counts up and is basically a version vector (vector clock). Each replica increments its own pair only, the value is computed by summing the count of all replicas. Convergence is achieved by taking the maximum count for each replica. This is basically the Cassandra counters implementation.\n
A G-Counter only counts up and is basically a version vector (vector clock). Each replica increments its own pair only, the value is computed by summing the count of all replicas. Convergence is achieved by taking the maximum count for each replica. This is basically the Cassandra counters implementation.\n
A G-Counter only counts up and is basically a version vector (vector clock). Each replica increments its own pair only, the value is computed by summing the count of all replicas. Convergence is achieved by taking the maximum count for each replica. This is basically the Cassandra counters implementation.\n
A G-Counter only counts up and is basically a version vector (vector clock). Each replica increments its own pair only, the value is computed by summing the count of all replicas. Convergence is achieved by taking the maximum count for each replica. This is basically the Cassandra counters implementation.\n
A G-Counter only counts up and is basically a version vector (vector clock). Each replica increments its own pair only, the value is computed by summing the count of all replicas. Convergence is achieved by taking the maximum count for each replica. This is basically the Cassandra counters implementation.\n
A G-Counter only counts up and is basically a version vector (vector clock). Each replica increments its own pair only, the value is computed by summing the count of all replicas. Convergence is achieved by taking the maximum count for each replica. This is basically the Cassandra counters implementation.\n
A G-Counter only counts up and is basically a version vector (vector clock). Each replica increments its own pair only, the value is computed by summing the count of all replicas. Convergence is achieved by taking the maximum count for each replica. This is basically the Cassandra counters implementation.\n
A G-Counter only counts up and is basically a version vector (vector clock). Each replica increments its own pair only, the value is computed by summing the count of all replicas. Convergence is achieved by taking the maximum count for each replica. This is basically the Cassandra counters implementation.\n
A G-Counter only counts up and is basically a version vector (vector clock). Each replica increments its own pair only, the value is computed by summing the count of all replicas. Convergence is achieved by taking the maximum count for each replica. This is basically the Cassandra counters implementation.\n
A G-Counter only counts up and is basically a version vector (vector clock). Each replica increments its own pair only, the value is computed by summing the count of all replicas. Convergence is achieved by taking the maximum count for each replica. This is basically the Cassandra counters implementation.\n
A G-Counter only counts up and is basically a version vector (vector clock). Each replica increments its own pair only, the value is computed by summing the count of all replicas. Convergence is achieved by taking the maximum count for each replica. This is basically the Cassandra counters implementation.\n
A G-Counter only counts up and is basically a version vector (vector clock). Each replica increments its own pair only, the value is computed by summing the count of all replicas. Convergence is achieved by taking the maximum count for each replica. This is basically the Cassandra counters implementation.\n
A G-Counter only counts up and is basically a version vector (vector clock). Each replica increments its own pair only, the value is computed by summing the count of all replicas. Convergence is achieved by taking the maximum count for each replica. This is basically the Cassandra counters implementation.\n
PN-Counter - composed of two G-Counters - P for increments and N for decrements. The value is the difference between the values of the two G-Counters. The resolution is the pairwise resolution of the P and N counters.\n
Sets constitute one of the most basic data structures. Containers, Maps, and Graphs are all based on Sets. There are two operations, add and remove.\n
Like a G-Counter, a G-Set only grows in size. That is, it doesn&#x2019;t allow removal - its merge operation is a simple set-union, returning the maximal grouping without duplicates. Since add commutes with union, a G-Set can also be implemented as a commutative type. However, it&#x2019;s not an incredibly useful data-type on its own, but it can be part of another data structure.\n
Like a G-Counter, a G-Set only grows in size. That is, it doesn&#x2019;t allow removal - its merge operation is a simple set-union, returning the maximal grouping without duplicates. Since add commutes with union, a G-Set can also be implemented as a commutative type. However, it&#x2019;s not an incredibly useful data-type on its own, but it can be part of another data structure.\n
Like a G-Counter, a G-Set only grows in size. That is, it doesn&#x2019;t allow removal - its merge operation is a simple set-union, returning the maximal grouping without duplicates. Since add commutes with union, a G-Set can also be implemented as a commutative type. However, it&#x2019;s not an incredibly useful data-type on its own, but it can be part of another data structure.\n
Like a G-Counter, a G-Set only grows in size. That is, it doesn&#x2019;t allow removal - its merge operation is a simple set-union, returning the maximal grouping without duplicates. Since add commutes with union, a G-Set can also be implemented as a commutative type. However, it&#x2019;s not an incredibly useful data-type on its own, but it can be part of another data structure.\n
Like a G-Counter, a G-Set only grows in size. That is, it doesn&#x2019;t allow removal - its merge operation is a simple set-union, returning the maximal grouping without duplicates. Since add commutes with union, a G-Set can also be implemented as a commutative type. However, it&#x2019;s not an incredibly useful data-type on its own, but it can be part of another data structure.\n
Like a G-Counter, a G-Set only grows in size. That is, it doesn&#x2019;t allow removal - its merge operation is a simple set-union, returning the maximal grouping without duplicates. Since add commutes with union, a G-Set can also be implemented as a commutative type. However, it&#x2019;s not an incredibly useful data-type on its own, but it can be part of another data structure.\n
Like a G-Counter, a G-Set only grows in size. That is, it doesn&#x2019;t allow removal - its merge operation is a simple set-union, returning the maximal grouping without duplicates. Since add commutes with union, a G-Set can also be implemented as a commutative type. However, it&#x2019;s not an incredibly useful data-type on its own, but it can be part of another data structure.\n
Like a G-Counter, a G-Set only grows in size. That is, it doesn&#x2019;t allow removal - its merge operation is a simple set-union, returning the maximal grouping without duplicates. Since add commutes with union, a G-Set can also be implemented as a commutative type. However, it&#x2019;s not an incredibly useful data-type on its own, but it can be part of another data structure.\n
Like a G-Counter, a G-Set only grows in size. That is, it doesn&#x2019;t allow removal - its merge operation is a simple set-union, returning the maximal grouping without duplicates. Since add commutes with union, a G-Set can also be implemented as a commutative type. However, it&#x2019;s not an incredibly useful data-type on its own, but it can be part of another data structure.\n
Like a G-Counter, a G-Set only grows in size. That is, it doesn&#x2019;t allow removal - its merge operation is a simple set-union, returning the maximal grouping without duplicates. Since add commutes with union, a G-Set can also be implemented as a commutative type. However, it&#x2019;s not an incredibly useful data-type on its own, but it can be part of another data structure.\n
Like a G-Counter, a G-Set only grows in size. That is, it doesn&#x2019;t allow removal - its merge operation is a simple set-union, returning the maximal grouping without duplicates. Since add commutes with union, a G-Set can also be implemented as a commutative type. However, it&#x2019;s not an incredibly useful data-type on its own, but it can be part of another data structure.\n
Like a G-Counter, a G-Set only grows in size. That is, it doesn&#x2019;t allow removal - its merge operation is a simple set-union, returning the maximal grouping without duplicates. Since add commutes with union, a G-Set can also be implemented as a commutative type. However, it&#x2019;s not an incredibly useful data-type on its own, but it can be part of another data structure.\n
Like a G-Counter, a G-Set only grows in size. That is, it doesn&#x2019;t allow removal - its merge operation is a simple set-union, returning the maximal grouping without duplicates. Since add commutes with union, a G-Set can also be implemented as a commutative type. However, it&#x2019;s not an incredibly useful data-type on its own, but it can be part of another data structure.\n
Like a G-Counter, a G-Set only grows in size. That is, it doesn&#x2019;t allow removal - its merge operation is a simple set-union, returning the maximal grouping without duplicates. Since add commutes with union, a G-Set can also be implemented as a commutative type. However, it&#x2019;s not an incredibly useful data-type on its own, but it can be part of another data structure.\n
The second type of Set is a two-phase set, where a removed set member cannot be re-added. It is basically two G-Sets, one for add and one for remove. The removal set is sometimes called a tombstone set. To prevent spurious states (e.g. remove-before-add, making add have no effect), it has a precondition for remove that the local state must already contain the member.\n\nA special case of the 2P-Set is the U-Set. If the system can reasonably guarantee uniqueness, that is, the element will never be added again after removal, then the tombstone set is unnecessary. Uniqueness could be satisfied with a Lamport clock or suitably large RNG space.\n
The second type of Set is a two-phase set, where a removed set member cannot be re-added. It is basically two G-Sets, one for add and one for remove. The removal set is sometimes called a tombstone set. To prevent spurious states (e.g. remove-before-add, making add have no effect), it has a precondition for remove that the local state must already contain the member.\n\nA special case of the 2P-Set is the U-Set. If the system can reasonably guarantee uniqueness, that is, the element will never be added again after removal, then the tombstone set is unnecessary. Uniqueness could be satisfied with a Lamport clock or suitably large RNG space.\n
The second type of Set is a two-phase set, where a removed set member cannot be re-added. It is basically two G-Sets, one for add and one for remove. The removal set is sometimes called a tombstone set. To prevent spurious states (e.g. remove-before-add, making add have no effect), it has a precondition for remove that the local state must already contain the member.\n\nA special case of the 2P-Set is the U-Set. If the system can reasonably guarantee uniqueness, that is, the element will never be added again after removal, then the tombstone set is unnecessary. Uniqueness could be satisfied with a Lamport clock or suitably large RNG space.\n
The second type of Set is a two-phase set, where a removed set member cannot be re-added. It is basically two G-Sets, one for add and one for remove. The removal set is sometimes called a tombstone set. To prevent spurious states (e.g. remove-before-add, making add have no effect), it has a precondition for remove that the local state must already contain the member.\n\nA special case of the 2P-Set is the U-Set. If the system can reasonably guarantee uniqueness, that is, the element will never be added again after removal, then the tombstone set is unnecessary. Uniqueness could be satisfied with a Lamport clock or suitably large RNG space.\n
The second type of Set is a two-phase set, where a removed set member cannot be re-added. It is basically two G-Sets, one for add and one for remove. The removal set is sometimes called a tombstone set. To prevent spurious states (e.g. remove-before-add, making add have no effect), it has a precondition for remove that the local state must already contain the member.\n\nA special case of the 2P-Set is the U-Set. If the system can reasonably guarantee uniqueness, that is, the element will never be added again after removal, then the tombstone set is unnecessary. Uniqueness could be satisfied with a Lamport clock or suitably large RNG space.\n
The second type of Set is a two-phase set, where a removed set member cannot be re-added. It is basically two G-Sets, one for add and one for remove. The removal set is sometimes called a tombstone set. To prevent spurious states (e.g. remove-before-add, making add have no effect), it has a precondition for remove that the local state must already contain the member.\n\nA special case of the 2P-Set is the U-Set. If the system can reasonably guarantee uniqueness, that is, the element will never be added again after removal, then the tombstone set is unnecessary. Uniqueness could be satisfied with a Lamport clock or suitably large RNG space.\n
The second type of Set is a two-phase set, where a removed set member cannot be re-added. It is basically two G-Sets, one for add and one for remove. The removal set is sometimes called a tombstone set. To prevent spurious states (e.g. remove-before-add, making add have no effect), it has a precondition for remove that the local state must already contain the member.\n\nA special case of the 2P-Set is the U-Set. If the system can reasonably guarantee uniqueness, that is, the element will never be added again after removal, then the tombstone set is unnecessary. Uniqueness could be satisfied with a Lamport clock or suitably large RNG space.\n
The second type of Set is a two-phase set, where a removed set member cannot be re-added. It is basically two G-Sets, one for add and one for remove. The removal set is sometimes called a tombstone set. To prevent spurious states (e.g. remove-before-add, making add have no effect), it has a precondition for remove that the local state must already contain the member.\n\nA special case of the 2P-Set is the U-Set. If the system can reasonably guarantee uniqueness, that is, the element will never be added again after removal, then the tombstone set is unnecessary. Uniqueness could be satisfied with a Lamport clock or suitably large RNG space.\n
The second type of Set is a two-phase set, where a removed set member cannot be re-added. It is basically two G-Sets, one for add and one for remove. The removal set is sometimes called a tombstone set. To prevent spurious states (e.g. remove-before-add, making add have no effect), it has a precondition for remove that the local state must already contain the member.\n\nA special case of the 2P-Set is the U-Set. If the system can reasonably guarantee uniqueness, that is, the element will never be added again after removal, then the tombstone set is unnecessary. Uniqueness could be satisfied with a Lamport clock or suitably large RNG space.\n
The second type of Set is a two-phase set, where a removed set member cannot be re-added. It is basically two G-Sets, one for add and one for remove. The removal set is sometimes called a tombstone set. To prevent spurious states (e.g. remove-before-add, making add have no effect), it has a precondition for remove that the local state must already contain the member.\n\nA special case of the 2P-Set is the U-Set. If the system can reasonably guarantee uniqueness, that is, the element will never be added again after removal, then the tombstone set is unnecessary. Uniqueness could be satisfied with a Lamport clock or suitably large RNG space.\n
The second type of Set is a two-phase set, where a removed set member cannot be re-added. It is basically two G-Sets, one for add and one for remove. The removal set is sometimes called a tombstone set. To prevent spurious states (e.g. remove-before-add, making add have no effect), it has a precondition for remove that the local state must already contain the member.\n\nA special case of the 2P-Set is the U-Set. If the system can reasonably guarantee uniqueness, that is, the element will never be added again after removal, then the tombstone set is unnecessary. Uniqueness could be satisfied with a Lamport clock or suitably large RNG space.\n
The second type of Set is a two-phase set, where a removed set member cannot be re-added. It is basically two G-Sets, one for add and one for remove. The removal set is sometimes called a tombstone set. To prevent spurious states (e.g. remove-before-add, making add have no effect), it has a precondition for remove that the local state must already contain the member.\n\nA special case of the 2P-Set is the U-Set. If the system can reasonably guarantee uniqueness, that is, the element will never be added again after removal, then the tombstone set is unnecessary. Uniqueness could be satisfied with a Lamport clock or suitably large RNG space.\n
The second type of Set is a two-phase set, where a removed set member cannot be re-added. It is basically two G-Sets, one for add and one for remove. The removal set is sometimes called a tombstone set. To prevent spurious states (e.g. remove-before-add, making add have no effect), it has a precondition for remove that the local state must already contain the member.\n\nA special case of the 2P-Set is the U-Set. If the system can reasonably guarantee uniqueness, that is, the element will never be added again after removal, then the tombstone set is unnecessary. Uniqueness could be satisfied with a Lamport clock or suitably large RNG space.\n
The second type of Set is a two-phase set, where a removed set member cannot be re-added. It is basically two G-Sets, one for add and one for remove. The removal set is sometimes called a tombstone set. To prevent spurious states (e.g. remove-before-add, making add have no effect), it has a precondition for remove that the local state must already contain the member.\n\nA special case of the 2P-Set is the U-Set. If the system can reasonably guarantee uniqueness, that is, the element will never be added again after removal, then the tombstone set is unnecessary. Uniqueness could be satisfied with a Lamport clock or suitably large RNG space.\n
The second type of Set is a two-phase set, where a removed set member cannot be re-added. It is basically two G-Sets, one for add and one for remove. The removal set is sometimes called a tombstone set. To prevent spurious states (e.g. remove-before-add, making add have no effect), it has a precondition for remove that the local state must already contain the member.\n\nA special case of the 2P-Set is the U-Set. If the system can reasonably guarantee uniqueness, that is, the element will never be added again after removal, then the tombstone set is unnecessary. Uniqueness could be satisfied with a Lamport clock or suitably large RNG space.\n
The second type of Set is a two-phase set, where a removed set member cannot be re-added. It is basically two G-Sets, one for add and one for remove. The removal set is sometimes called a tombstone set. To prevent spurious states (e.g. remove-before-add, making add have no effect), it has a precondition for remove that the local state must already contain the member.\n\nA special case of the 2P-Set is the U-Set. If the system can reasonably guarantee uniqueness, that is, the element will never be added again after removal, then the tombstone set is unnecessary. Uniqueness could be satisfied with a Lamport clock or suitably large RNG space.\n
Tag each element in A and R with timestamp. Greatest timestamp wins out for each individual element. Could be implemented with Cassandra super-columns.\n\nFigure 12: LWW-element-Set; elements masked by one with a higher timestamp are elided (state-based)\n\n
Tag each added element uniquely (without exposing them). When removing, remove all seen and forward operation downstream with tags. State-based version would be based on U-Set.\n\n
You might notice we&#x2019;re going up in complexity here in terms of the types of data-structures. Graphs are incredibly useful for many problems, but also have a bunch of potential anomalies within them - concurrent add/removes of vertices and edges may not converge - that is, global invariants can&#x2019;t be guaranteed. For example, in the case of a DAG or linked-list where elements can be removed or added concurrently. Some anomalies may be removed via restricting the semantics, for example, making a graph add-only. I&#x2019;m not going to go into detail about how Graphs are implemented, but a simple one is the 2P2P graph, based on a pair of 2P-sets, one for vertices and one for edges. In the case where a vertex is removed, the most reliable (and intuitive) solution is to remove all attached edges, thus a 2P-Set paradigm works well for the components of a generic graph.\n\n\n
You might notice we&#x2019;re going up in complexity here in terms of the types of data-structures. Graphs are incredibly useful for many problems, but also have a bunch of potential anomalies within them - concurrent add/removes of vertices and edges may not converge - that is, global invariants can&#x2019;t be guaranteed. For example, in the case of a DAG or linked-list where elements can be removed or added concurrently. Some anomalies may be removed via restricting the semantics, for example, making a graph add-only. I&#x2019;m not going to go into detail about how Graphs are implemented, but a simple one is the 2P2P graph, based on a pair of 2P-sets, one for vertices and one for edges. In the case where a vertex is removed, the most reliable (and intuitive) solution is to remove all attached edges, thus a 2P-Set paradigm works well for the components of a generic graph.\n\n\n
You might notice we&#x2019;re going up in complexity here in terms of the types of data-structures. Graphs are incredibly useful for many problems, but also have a bunch of potential anomalies within them - concurrent add/removes of vertices and edges may not converge - that is, global invariants can&#x2019;t be guaranteed. For example, in the case of a DAG or linked-list where elements can be removed or added concurrently. Some anomalies may be removed via restricting the semantics, for example, making a graph add-only. I&#x2019;m not going to go into detail about how Graphs are implemented, but a simple one is the 2P2P graph, based on a pair of 2P-sets, one for vertices and one for edges. In the case where a vertex is removed, the most reliable (and intuitive) solution is to remove all attached edges, thus a 2P-Set paradigm works well for the components of a generic graph.\n\n\n
You might notice we&#x2019;re going up in complexity here in terms of the types of data-structures. Graphs are incredibly useful for many problems, but also have a bunch of potential anomalies within them - concurrent add/removes of vertices and edges may not converge - that is, global invariants can&#x2019;t be guaranteed. For example, in the case of a DAG or linked-list where elements can be removed or added concurrently. Some anomalies may be removed via restricting the semantics, for example, making a graph add-only. I&#x2019;m not going to go into detail about how Graphs are implemented, but a simple one is the 2P2P graph, based on a pair of 2P-sets, one for vertices and one for edges. In the case where a vertex is removed, the most reliable (and intuitive) solution is to remove all attached edges, thus a 2P-Set paradigm works well for the components of a generic graph.\n\n\n
\n
CRDTs tend to create a lot of garbage: tombstones grow and internal structures become unbalanced. In general, garbage collection is extremely difficult to do without synchronization. Luckily, this doesn&#x2019;t impact correctness, only efficiency and performance.\n
Client - have to come up with a common representation across languages, allocation of actor IDs is problematic, can only use state-based CRDTs.\nServer - no one implements them yet, really (Cassandra&#x2019;s counter has some anomalies), but we&#x2019;re working hard to bring them to Riak.\n
\n

Eventually Consistent Data Structures (from strangeloop12)

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Eventually Consistent Data Structures (from strangeloop12)

Similar to Eventually Consistent Data Structures (from strangeloop12) (20)

More from Sean Cribbs

More from Sean Cribbs (16)

Recently uploaded

Recently uploaded (20)

Eventually Consistent Data Structures (from strangeloop12)

Editor's Notes