Göteborg Distributed: Eventual Consistency in Apache Cassandra

•

0 likes•1,046 views

Jeremy Hanna

A brief introduction to Cassandra and an overview of eventual consistency in Cassandra.

Software Technology

©2013 DataStax Conﬁdential. Do not distribute without consent.
Jeremy Hanna
Support Engineer
Eventual Consistency in Apache Cassandra

Cassandra Design
•Massive scalability
•High Performance
•Reliability/Availability
•Ease of use

Developer friendly
•CQL3
•Collections (List, Map, Set)
•User defined types (2.1)
•Cassandra native drivers
•Native paging
•Tracing
•DataStax DevCenter tool
•Atomic batches
•Lightweight transactions
•Triggers

CQL3 examples
CREATE KEYSPACE shire WITH
REPLICATION = {'class': 'NetworkTopologyStrategy', 'eu' : 3, 'us-east' : 2};
SELECT * FROM emp WHERE empID IN (130,104) ORDER BY deptID DESC;
INSERT INTO excelsior.clicks (userid, url, date, name) 
VALUES ( 
3715e600-2eb0-11e2-81c1-0800200c9a66, 
‘http://cassandra.apache.org', 
‘2013-10-09',
‘Mary') 
USING TTL 86400;
UPDATE users
SET email = ‘charlie@wonka.com’
WHERE login = ‘cbucket64'
IF email = ‘cbucket@wonka.com’
CREATE USER bombadil WITH PASSWORD 'goldberry4ever' SUPERUSER;
GRANT ALTER ON KEYSPACE shire TO gandalf;

Ops Friendly
•Simple design
•no special role, no single point of failure
•Lots of exposed metrics via JMX
•Nodes and entire datacenters can go down with no
loss of service
•Rapid read protection
•DataStax OpsCenter
•Visual monitoring tool
•REST interface to metric data
•Free version
•Hands-off services

Cassandra Design
•Massive scalability
•Multi-datacenter
•High Performance
•Reliability/Availability
•no SPOF, no special roles
•Ease of Use

Fully Distributed
•Distributed systems introduce complex problems
•What is “down”?
•Individual server is down
•Network link is down
•Long server pause (e.g. GC pause)
•Variable network latency
•What do I do when a server is overloaded?
•How can I stay available/reliable in such
circumstances?
•How can I maintain consistency?
•How do I reconcile differences?

CAP Theorem
•Select two
Consistency
Availability
Partition Tolerance

Eventual Consistency
•Individual server durability
•Write to commitlog (batch or periodic sync)
•Write to memtable (which gets flushed to disk)
•Achieving consistency level
•ONE, QUORUM, ALL
•LOCAL_ONE, LOCAL_QUORUM
•ANY, EACH_QUORUM (for writes)
•Important to note:
•All replicas always get a copy of the write

Stuff happens
•Overloaded node
•“Down” node(s)
•Network partition
•Datacenter down
•Outcome: inconsistency among replicas

Continually cleaning
•Hinted handoff
•valid for a window of time
•replays back to node restored to service
•Read repair
•after a read, check that data for agreement (digest)
•read_repair_chance defaults to 0.1
•also dclocal_read_repair_chance
•Anti-entropy service (manual repair)
•Check for agreement for all data for range A-B
•Run manual repair every gc grace seconds

Advanced Repair
•Manual repairs have limited resolution
•“There is something different in these 1000 rows”
•Therefore you have to stream all 1000 rows
•Leads to overstreaming, waste
•You can specify start/end keys
•Get row level precision
•More complicated to execute
•DataStax has a repair service to help

Safely consistent?
•(LOCAL_)QUORUM reads/writes to be safe?
•Ultimately depends on your requirements
•Theoretical versus empirical

Netﬂix Study
•Two datacenters (US-East and US-West)
•Wrote 500,000 records in each datacenter
•50k write operations per second in each DC
•Wrote at consistency level ONE
•All data read back correctly in other DC
•Tried 5 different runs, introduced failures along
the way
See planetcassandra.org/blog/post/a-netﬂix-experiment-eventual-consistency-hopeful-consistency-by-christos-kalantzis/

Practical Consistency
•ONE is not suitable for all cases
•Review your requirements, SLA
•Do your own testing to get comfortable
•Flexibility translates into the best performance for
your use case

What's hot

Building Large-Scale Stream Infrastructures Across Multiple Data Centers with...DataWorks Summit/Hadoop Summit

Cassandra Summit 2014: Active-Active Cassandra Behind the ScenesDataStax Academy

Stream your Operational Data with Apache Spark & Kafka into Hadoop using Couc...Data Con LA

Mesosphere and Contentteam: A New Way to Run CassandraDataStax Academy

Cassandra @ Sony: The good, the bad, and the ugly part 2DataStax Academy

Make 2016 your year of SMACK talkDataStax Academy

Infrastructure at Scale: Apache Kafka, Twitter Storm & Elastic Search (ARC303...Amazon Web Services

Scaling Twitter with CassandraRyan King

Reactive Streams 1.0.0 and Why You Should Care (webinar)Legacy Typesafe (now Lightbend)

Load testing Cassandra applicationsBen Slater

Webinar: Diagnosing Apache Cassandra Problems in ProductionDataStax Academy

From Three Nines to Five Nines - A Kafka JourneyAllen (Xiaozhong) Wang

Webinar: Getting Started with Apache CassandraDataStax

One Billion Black Friday Shoppers on a Distributed Data Store (Fahd Siddiqui,...DataStax

Advanced OperationsDataStax Academy

Multi cluster, multitenant and hierarchical kafka messaging service slideshareAllen (Xiaozhong) Wang

Scylla Summit 2018: Make Scylla Fast Again! Find out how using Tools, Talent,...ScyllaDB

Cassandra - Tips And TechniquesKnoldus Inc.

Tales From The Front: An Architecture For Multi-Data Center Scalable Applicat...DataStax Academy

KSQL Introconfluent

What's hot (20)

Building Large-Scale Stream Infrastructures Across Multiple Data Centers with...

Cassandra Summit 2014: Active-Active Cassandra Behind the Scenes

Stream your Operational Data with Apache Spark & Kafka into Hadoop using Couc...

Mesosphere and Contentteam: A New Way to Run Cassandra

Cassandra @ Sony: The good, the bad, and the ugly part 2

Make 2016 your year of SMACK talk

Infrastructure at Scale: Apache Kafka, Twitter Storm & Elastic Search (ARC303...

Scaling Twitter with Cassandra

Reactive Streams 1.0.0 and Why You Should Care (webinar)

Load testing Cassandra applications

Webinar: Diagnosing Apache Cassandra Problems in Production

From Three Nines to Five Nines - A Kafka Journey

Webinar: Getting Started with Apache Cassandra

One Billion Black Friday Shoppers on a Distributed Data Store (Fahd Siddiqui,...

Advanced Operations

Multi cluster, multitenant and hierarchical kafka messaging service slideshare

Scylla Summit 2018: Make Scylla Fast Again! Find out how using Tools, Talent,...

Cassandra - Tips And Techniques

Tales From The Front: An Architecture For Multi-Data Center Scalable Applicat...

KSQL Intro

Viewers also liked

Eventually ConsistentWilfred Springer

CS519 - Cloud Types for Eventual ConsistencySergii Shmarkatiuk

ETL With Cassandra Streaming Bulk Loadingalex_araujo

Cassandra+HadoopJeremy Hanna

Online Analytics with Hadoop and CassandraRobbie Strickland

Cassandra/Hadoop IntegrationJeremy Hanna

Viewers also liked (6)

Eventually Consistent

CS519 - Cloud Types for Eventual Consistency

ETL With Cassandra Streaming Bulk Loading

Cassandra+Hadoop

Online Analytics with Hadoop and Cassandra

Cassandra/Hadoop Integration

Similar to Göteborg Distributed: Eventual Consistency in Apache Cassandra

Apache Cassandra at the Geek2Geek BerlinChristian Johannsen

Cassandra trainingAndrás Fehér

Apache Cassandra in the Real WorldJeremy Hanna

Scaling web applications with cassandra presentationMurat Çakal

BigData Developers MeetUpChristian Johannsen

Migrating 500 Nodes from Rackspace to Google Cloud with Zero DowntimePaul Chandler

Real-Time Analytics with Kafka, Cassandra and StormJohn Georgiadis

Load Testing Cassandra Applications Instaclustr

Load Testing Cassandra Applications (Ben Slater, Instaclustr) | C* Summit 2016DataStax

London + Dublin Cassandra 2.0jbellis

Cassandraexsuns

Cassandra serving netflix @ scaleVinay Kumar Chella

Intro to cassandraAaron Ploetz

Intro to CassandraJon Haddad

High Throughput Analytics with Cassandra & AzureDataStax Academy

Apache Cassandra multi-datacenter essentialsJulien Anguenot

Apache Cassandra Multi-Datacenter Essentials (Julien Anguenot, iLand Internet...DataStax

Micro-batching: High-performance writesInstaclustr

Micro-batching: High-performance Writes (Adam Zegelin, Instaclustr) | Cassand...DataStax

Toward 10,000 Containers on OpenStackTon Ngo

Similar to Göteborg Distributed: Eventual Consistency in Apache Cassandra (20)

Apache Cassandra at the Geek2Geek Berlin

Cassandra training

Apache Cassandra in the Real World

Scaling web applications with cassandra presentation

BigData Developers MeetUp

Migrating 500 Nodes from Rackspace to Google Cloud with Zero Downtime

Real-Time Analytics with Kafka, Cassandra and Storm

Load Testing Cassandra Applications

Load Testing Cassandra Applications (Ben Slater, Instaclustr) | C* Summit 2016

London + Dublin Cassandra 2.0

Cassandra

Cassandra serving netflix @ scale

Intro to cassandra

Intro to Cassandra

High Throughput Analytics with Cassandra & Azure

Apache Cassandra multi-datacenter essentials

Apache Cassandra Multi-Datacenter Essentials (Julien Anguenot, iLand Internet...

Micro-batching: High-performance writes

Micro-batching: High-performance Writes (Adam Zegelin, Instaclustr) | Cassand...

Toward 10,000 Containers on OpenStack

Recently uploaded

Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfkalichargn70th171

Der Spagat zwischen BIAS und FAIRNESS (2024)OPEN KNOWLEDGE GmbH

Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝soniya singh

What is Fashion PLM and Why Do You Need ItWave PLM

Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

EY_Graph Database Powered SustainabilityNeo4j

chapter--4-software-project-planning.pptkotipi9215

cybersecurity notes for mca students for learningVitsRangannavar

Professional Resume Template for Software DevelopersVinodh Ram

DNT_Corporate presentation know about usDynamic Netsoft

ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...Christina Lin

Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...soniya singh

The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...ICS

5 Signs You Need a Fashion PLM Software.pdfWave PLM

Optimizing AI for immediate response in Smart CCTVshikhaohhpro

KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxTier1 app

HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai

why an Opensea Clone Script might be your perfect match.pdfjoe51371421

Unit 1.1 Excite Part 1, class 9, cbse...aditisharan08

Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq

Recently uploaded (20)

Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf

Der Spagat zwischen BIAS und FAIRNESS (2024)

Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝

What is Fashion PLM and Why Do You Need It

Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...

EY_Graph Database Powered Sustainability

chapter--4-software-project-planning.ppt

cybersecurity notes for mca students for learning

Professional Resume Template for Software Developers

DNT_Corporate presentation know about us

ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...

Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...

The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...

5 Signs You Need a Fashion PLM Software.pdf

Optimizing AI for immediate response in Smart CCTV

KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx

HR Software Buyers Guide in 2024 - HRSoftware.com

why an Opensea Clone Script might be your perfect match.pdf

Unit 1.1 Excite Part 1, class 9, cbse...

Salesforce Certified Field Service Consultant

Göteborg Distributed: Eventual Consistency in Apache Cassandra

2. Cassandra Design •Massive scalability •High Performance •Reliability/Availability •Ease of use

3. Developer friendly •CQL3 •Collections (List, Map, Set) •User defined types (2.1) •Cassandra native drivers •Native paging •Tracing •DataStax DevCenter tool •Atomic batches •Lightweight transactions •Triggers

4. CQL3 examples CREATE KEYSPACE shire WITH REPLICATION = {'class': 'NetworkTopologyStrategy', 'eu' : 3, 'us-east' : 2}; SELECT * FROM emp WHERE empID IN (130,104) ORDER BY deptID DESC; INSERT INTO excelsior.clicks (userid, url, date, name)  VALUES (  3715e600-2eb0-11e2-81c1-0800200c9a66,  ‘http://cassandra.apache.org',  ‘2013-10-09', ‘Mary')  USING TTL 86400; UPDATE users SET email = ‘charlie@wonka.com’ WHERE login = ‘cbucket64' IF email = ‘cbucket@wonka.com’ CREATE USER bombadil WITH PASSWORD 'goldberry4ever' SUPERUSER; GRANT ALTER ON KEYSPACE shire TO gandalf;

5. Ops Friendly •Simple design •no special role, no single point of failure •Lots of exposed metrics via JMX •Nodes and entire datacenters can go down with no loss of service •Rapid read protection •DataStax OpsCenter •Visual monitoring tool •REST interface to metric data •Free version •Hands-off services

6. Some C* Users

7. Cassandra Design •Massive scalability •Multi-datacenter •High Performance •Reliability/Availability •no SPOF, no special roles •Ease of Use

8. Fully Distributed •Distributed systems introduce complex problems •What is “down”? •Individual server is down •Network link is down •Long server pause (e.g. GC pause) •Variable network latency •What do I do when a server is overloaded? •How can I stay available/reliable in such circumstances? •How can I maintain consistency? •How do I reconcile differences?

9. CAP Theorem •Select two Consistency Availability Partition Tolerance

10. Eventual Consistency •Individual server durability •Write to commitlog (batch or periodic sync) •Write to memtable (which gets flushed to disk) •Achieving consistency level •ONE, QUORUM, ALL •LOCAL_ONE, LOCAL_QUORUM •ANY, EACH_QUORUM (for writes) •Important to note: •All replicas always get a copy of the write

11. Stuff happens •Overloaded node •“Down” node(s) •Network partition •Datacenter down •Outcome: inconsistency among replicas

12. Continually cleaning •Hinted handoff •valid for a window of time •replays back to node restored to service •Read repair •after a read, check that data for agreement (digest) •read_repair_chance defaults to 0.1 •also dclocal_read_repair_chance •Anti-entropy service (manual repair) •Check for agreement for all data for range A-B •Run manual repair every gc grace seconds

13. Advanced Repair •Manual repairs have limited resolution •“There is something different in these 1000 rows” •Therefore you have to stream all 1000 rows •Leads to overstreaming, waste •You can specify start/end keys •Get row level precision •More complicated to execute •DataStax has a repair service to help

14. Safely consistent? •(LOCAL_)QUORUM reads/writes to be safe? •Ultimately depends on your requirements •Theoretical versus empirical

15. Netﬂix Study •Two datacenters (US-East and US-West) •Wrote 500,000 records in each datacenter •50k write operations per second in each DC •Wrote at consistency level ONE •All data read back correctly in other DC •Tried 5 different runs, introduced failures along the way See planetcassandra.org/blog/post/a-netﬂix-experiment-eventual-consistency-hopeful-consistency-by-christos-kalantzis/

16. Practical Consistency •ONE is not suitable for all cases •Review your requirements, SLA •Do your own testing to get comfortable •Flexibility translates into the best performance for your use case

17. Questions?

Göteborg Distributed: Eventual Consistency in Apache Cassandra

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (6)

Similar to Göteborg Distributed: Eventual Consistency in Apache Cassandra

Similar to Göteborg Distributed: Eventual Consistency in Apache Cassandra (20)

More from Jeremy Hanna

More from Jeremy Hanna (8)

Recently uploaded

Recently uploaded (20)

Göteborg Distributed: Eventual Consistency in Apache Cassandra