Advantages of Cassandra's masterless architecture

•Download as PPTX, PDF•

0 likes•1,967 views

Duy Lâm

Outstanding Cassandra's features benefiting from masterless architecture

Software

Advantages of Cassandra’s
masterless architecture
Duy Lam @ RD
KMS TechCon 2015

Introduction
An Apache
Software
Foundation
project
Built on
Java
DataStax

Masterless Architecture
Peer-to-peer
Database
Automatic
Data
Distribution
Built-in
Replication

Masterless Architecture
Cassandra cluster
Background
service
Web
service
Application
Peer-to-peer
Database
Automatic Data
Distribution
Built-in Replication
No SPOF
A node

Masterless Architecture
Peer-to-peer
Database
Automatic Data
Distribution
Built-in Replication
partition id name
usa_1994 ae12f… The Shawshank Redemption
usa_1994 bb2ac… Forrest Gump
skorea_2002 31dcc… The Way Home
vn_2015 890af… Jackpot
2
usa_1994 ae… The Sha..
usa_1994 bb… Forrest..
1 skorea_2002 31… The Way …
3 vn_2015 89… Jackpot
Cassandra cluster
CREATE TABLE movie (
partition text ,
id uuid,
name text
)

Masterless Architecture
Peer-to-peer
Database
Automatic Data
Distribution
Built-in Replication
partition id name
(A)
usa_1994 ae12f… The Shawshank Redemption
usa_1994 bb2ac… Forrest Gump
(B) skorea_2002 31dcc… The Way Home
(C) vn_2015 890af… Jackpot
2
1
3
Cassandra cluster
CREATE TABLE movie (
partition text ,
id uuid,
name text
)
(B) (A’) (C’)
(A) (B’) (C’)
(C) (A’) (B’)

The Advantages
Peer-to-peer
Database
Automatic Data
Distribution
Built-in
Replication
?

The Advantages
Peer-to-peer
Database
Automatic Data
Distribution
Built-in
Replication
High
Availability
Fault
Tolerance
Scalability

Demo now
Fault
Tolerance
Scalability
High
Availability

Cautions
Cassandra is from NoSQL family
So
?

Cautions
Cassandra is from NoSQL family
So
 Different data modeling (try to google “nosql
data modeling”)
 No Atomicity for multiple tables (or
“documents”)  no transaction
 Not good at complex query (group by, having,
like, >=, etc.)

Cautions
CREATE TABLE movie (
partition text, id uuid,
name text, released_year int, country text,
PRIMARY KEY (partition, id)
)
Limited query capability in Cassandra
SELECT * FROM movie WHERE
release_year < 2000
name LIKE “Star Trek”
name IN ( “Tangled”, “Frozen” )
name = “Inside Out” and year = 2015
length = null
length = 120 OR release_year = 2015
UPDATE movie SET length=95 WHERE partition = “vn” AND id = 10
INSERT INTO movie(partition, id, length) VALUES(“usa”, 15, 120)
(1)
(2)
(3)
(4)
(5)
(6)

References
DataStax Cassandra Tutorials
https://www.youtube.com/playlist?list=PL3E5AC388940EEC0A
(thanks to Provision feature) Launch
Cassandra clustering system at
http://bit.ly/kms-techcon2015-cassandra-demo

Take Away
Masterless Architecture
– Peer-to-peer Database
– Automatic Data Distribution
– Built-in Replication
Cautions
– Cassandra is from NoSQL family
– Limited query capability

What's hot

Running Free with the Monadskenbot

ATT&CK Metaverse - Exploring the Limitations of Applying ATT&CKMITRE ATT&CK

Y2K and YouBDPA Education and Technology Foundation

Automating security hardeningUgljesa Novak, CISSP

Knowledge for the masses: Storytelling with ATT&CKMITRE ATT&CK

Tracking Noisy Behavior and Risk-Based Alerting with ATT&CKMITRE ATT&CK

Projects to Impact- Operationalizing Work from the CenterMITRE ATT&CK

Exploring how Students Map Social Engineering Techniques to the ATT&CK Framew...MITRE ATT&CK

What's hot (8)

Running Free with the Monads

ATT&CK Metaverse - Exploring the Limitations of Applying ATT&CK

Y2K and You

Automating security hardening

Knowledge for the masses: Storytelling with ATT&CK

Tracking Noisy Behavior and Risk-Based Alerting with ATT&CK

Projects to Impact- Operationalizing Work from the Center

Exploring how Students Map Social Engineering Techniques to the ATT&CK Framew...

Similar to Advantages of Cassandra's masterless architecture

Oracle to Cassandra Core Concepts Guide Pt. 2DataStax Academy

DataStax: Old Dogs, New Tricks. Teaching your Relational DBA to fetchDataStax Academy

Beyond the Query – Bringing Complex Access Patterns to NoSQL with DataStax - ...StampedeCon

Cassandra lesson learned - extendedAndrzej Ludwikowski

A Cassandra + Solr + Spark Love Triangle Using DataStax EnterprisePatrick McFadin

Cassandra, web scale no sql data platformMarko Švaljek

Introduction to Data Modeling with Apache CassandraDataStax Academy

Cassandra - lesson learnedAndrzej Ludwikowski

Starburnrafaelreckelberg

Beyond the Query: A Cassandra + Solr + Spark Love Triangle Using Datastax Ent...DataStax Academy

November Camp - Spec BDD with PHPSpec 2Kacper Gunia

Enter the Snake Pit for Fast and Easy SparkJon Haddad

Data Science - The Most Profitable Movie CharacteristicCheah Eng Soon

Advanced Data Modeling with Apache CassandraDataStax Academy

Building a Bridge between Terraform and ArgoCDCarlos Santana

Kief Morris - Infrastructure is terribleThoughtworks

Modify Assignment 5 toReplace the formatted output method (toStri.docxadelaidefarmer322

Atmosphere Conference 2015: Taming the Modern DatacenterPROIDEA

GraphFrames Access Methods in DSE GraphJim Hatcher

Using Stargate APIs for Cassandra-Backed ApplicationsNordic APIs

Similar to Advantages of Cassandra's masterless architecture (20)

Oracle to Cassandra Core Concepts Guide Pt. 2

DataStax: Old Dogs, New Tricks. Teaching your Relational DBA to fetch

Beyond the Query – Bringing Complex Access Patterns to NoSQL with DataStax - ...

Cassandra lesson learned - extended

A Cassandra + Solr + Spark Love Triangle Using DataStax Enterprise

Cassandra, web scale no sql data platform

Introduction to Data Modeling with Apache Cassandra

Cassandra - lesson learned

Starburn

Beyond the Query: A Cassandra + Solr + Spark Love Triangle Using Datastax Ent...

November Camp - Spec BDD with PHPSpec 2

Enter the Snake Pit for Fast and Easy Spark

Data Science - The Most Profitable Movie Characteristic

Advanced Data Modeling with Apache Cassandra

Building a Bridge between Terraform and ArgoCD

Kief Morris - Infrastructure is terrible

Modify Assignment 5 toReplace the formatted output method (toStri.docx

Atmosphere Conference 2015: Taming the Modern Datacenter

GraphFrames Access Methods in DSE Graph

Using Stargate APIs for Cassandra-Backed Applications

Recently uploaded

Professional Resume Template for Software DevelopersVinodh Ram

What is Fashion PLM and Why Do You Need ItWave PLM

Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...MyIntelliSource, Inc.

Intelligent Home Wi-Fi Solutions | ThinkPalmSujith Sukumaran

Asset Management Software - InfographicHr365.us smith

Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideChristina Lin

The Evolution of Karaoke From Analog to App.pdfPower Karaoke

chapter--4-software-project-planning.pptkotipi9215

Implementing Zero Trust strategy with AzureDinusha Kumarasiri

Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataBradBedford3

Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service9953056974 Low Rate Call Girls In Saket, Delhi NCR

办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样umasea

Automate your Kamailio Test Calls - Kamailio World 2024Andreas Granig

Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq

Cloud Data Center Network Construction - IEEEVICTOR MAESTRE RAMIREZ

EY_Graph Database Powered SustainabilityNeo4j

Unveiling Design Patterns: A Visual Guide with UML DiagramsAhmed Mohamed

Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfAlina Yurenko

Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...MyIntelliSource, Inc.

Recently uploaded (20)

Professional Resume Template for Software Developers

What is Fashion PLM and Why Do You Need It

Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...

Intelligent Home Wi-Fi Solutions | ThinkPalm

Asset Management Software - Infographic

Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide

The Evolution of Karaoke From Analog to App.pdf

chapter--4-software-project-planning.ppt

Implementing Zero Trust strategy with Azure

Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data

Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service

办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样

Automate your Kamailio Test Calls - Kamailio World 2024

Salesforce Certified Field Service Consultant

Cloud Data Center Network Construction - IEEE

EY_Graph Database Powered Sustainability

Unveiling Design Patterns: A Visual Guide with UML Diagrams

Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...

GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf

Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...

Advantages of Cassandra's masterless architecture

1. Advantages of Cassandra’s masterless architecture Duy Lam @ RD KMS TechCon 2015

2. Masterless Architecture Cautions

3. Introduction An Apache Software Foundation project Built on Java DataStax

4. Masterless Architecture Cautions

5. Masterless Architecture Peer-to-peer Database Automatic Data Distribution Built-in Replication

6. Masterless Architecture Cassandra cluster Background service Web service Application Peer-to-peer Database Automatic Data Distribution Built-in Replication No SPOF A node

7. Masterless Architecture Peer-to-peer Database Automatic Data Distribution Built-in Replication partition id name usa_1994 ae12f… The Shawshank Redemption usa_1994 bb2ac… Forrest Gump skorea_2002 31dcc… The Way Home vn_2015 890af… Jackpot 2 usa_1994 ae… The Sha.. usa_1994 bb… Forrest.. 1 skorea_2002 31… The Way … 3 vn_2015 89… Jackpot Cassandra cluster CREATE TABLE movie ( partition text , id uuid, name text )

8. Masterless Architecture Peer-to-peer Database Automatic Data Distribution Built-in Replication partition id name (A) usa_1994 ae12f… The Shawshank Redemption usa_1994 bb2ac… Forrest Gump (B) skorea_2002 31dcc… The Way Home (C) vn_2015 890af… Jackpot 2 1 3 Cassandra cluster CREATE TABLE movie ( partition text , id uuid, name text ) (B) (A’) (C’) (A) (B’) (C’) (C) (A’) (B’)

9. The Advantages Peer-to-peer Database Automatic Data Distribution Built-in Replication ?

10. The Advantages Peer-to-peer Database Automatic Data Distribution Built-in Replication High Availability Fault Tolerance Scalability

11. Demo now Fault Tolerance Scalability High Availability

12. Masterless Architecture Cautions

13. Cautions Cassandra is from NoSQL family So ?

14. Cautions Cassandra is from NoSQL family So  Different data modeling (try to google “nosql data modeling”)  No Atomicity for multiple tables (or “documents”)  no transaction  Not good at complex query (group by, having, like, >=, etc.)

15. Cautions CREATE TABLE movie ( partition text, id uuid, name text, released_year int, country text, PRIMARY KEY (partition, id) ) Limited query capability in Cassandra SELECT * FROM movie WHERE release_year < 2000 name LIKE “Star Trek” name IN ( “Tangled”, “Frozen” ) name = “Inside Out” and year = 2015 length = null length = 120 OR release_year = 2015 UPDATE movie SET length=95 WHERE partition = “vn” AND id = 10 INSERT INTO movie(partition, id, length) VALUES(“usa”, 15, 120) (1) (2) (3) (4) (5) (6)

16. Cautions CREATE TABLE movie ( partition text, id uuid, name text, released_year int, country text, PRIMARY KEY (partition, id) ) Limited query capability in Cassandra SELECT * FROM movie WHERE release_year < 2000 name LIKE “Star Trek” name IN ( “Tangled”, “Frozen” ) name = “Inside Out” and year = 2015 length = null length = 120 OR release_year = 2015 UPDATE movie SET length=95 WHERE partition = “vn” AND id = 10 INSERT INTO movie(partition, id, length) VALUES(“usa”, 15, 120) (1) (2) (3) (4) (5) (6)

17. References DataStax Cassandra Tutorials https://www.youtube.com/playlist?list=PL3E5AC388940EEC0A (thanks to Provision feature) Launch Cassandra clustering system at http://bit.ly/kms-techcon2015-cassandra-demo

18. Take Away Masterless Architecture – Peer-to-peer Database – Automatic Data Distribution – Built-in Replication Cautions – Cassandra is from NoSQL family – Limited query capability

19. THANKS YOU

Editor's Notes

Open source, backed by big sponsor, commercial use Hosted by any platform A company provides many things around Cassandra (documentation, tools, etc.) both free and commercial Note: products using Cassandra listed in slide are more about user-generated content
03 characteristic make Cassandra becoming preferable choice for Big Data
Eliminate SPOF and gain (single point of failure) App can connect to another node if the current node is shutdown Processing capability is spread out
Partitioner: a function for deriving a token representing a row from its partition key (typically by hashing). A partitioner determines how data is distributed across the nodes in the cluster (including replicas) Cassandra offers the following partitioners: (from Cassandra v1.2+) Murmur3Partitioner (default): uniformly distributes data across the cluster based on MurmurHash (better than MD5 in term of distribution purpose) hash values. RandomPartitioner: uniformly distributes data across the cluster based on MD5 hash values. ByteOrderedPartitioner: keeps an ordered distribution of data lexically by key bytes
Replicate factor: the total number of replicas across the cluster. A replication factor of 1 means that there is only one copy of each row on one node Replication strategy: a replication strategy determines which nodes to place replicas on
Summarize things
Peer-to-peer  High Availability Data Distribution  Scalability Replication  Fault Tolerance
Prepare Start guest OS: vagrant up Launch 5 terminal and remote to guest OS in all of them. From now on, below commands are executed in remote SSH terminals Launch Cassandra seed node. When it’s fully starting, launch remaining Cassandra nodes Explain setup.cql file Setup schema and seed data: $ ~/apps/cassandra/bin/cqlsh -f ~/provision/setup.cql High Availability Insert new row in node2 (127.0.0.2): $ ~/apps/cassandra/bin/cqlsh -e "USE techcon2015; INSERT INTO participant(department, employee_id, name) VALUES (‘Delivery', 100, 'Nguyen Van An');" 127.0.0.2 See new row in seed node (127.0.0.4): $ ~/apps/cassandra/bin/cqlsh -e "USE techcon2015; select * from participant;" 127.0.0.4  Data is available on any node Fault Tolerance Show the node status: $ ~/apps/cassandra/bin/nodetool status (explain the status and state) Shutdown node2 (127.0.0.2) Show the node status again, indicate the Down status Show that data is still available: $ ~/apps/cassandra/bin/cqlsh -e "USE techcon2015; select * from participant;" 127.0.0.x (x is the other alive IP) Insert new row in seed node (127.0.0.4): $ ~/apps/cassandra/bin/cqlsh -e "USE techcon2015; INSERT INTO participant(department, employee_id, name) VALUES (‘Delivery', 200, 'Nguyen Van Chau');" 127.0.0.4 See new row in seed node (127.0.0.4): $ ~/apps/cassandra/bin/cqlsh -e "USE techcon2015; select * from participant;" 127.0.0.x (x is the other alive IP) No downtime for disconnected node Scalability Show the node status: $ ~/apps/cassandra/bin/nodetool status Remove node 127.0.0.2: $ ~/apps/cassandra/bin/nodetool -h 127.0.0.1 -p 7182 decommission (1782 is node2 jmx port) Show the node status again (indicate node removed): $ ~/apps/cassandra/bin/nodetool status Shutdown node 127.0.0.2 Restart node 127.0.0.2 to join again Cluster can be effortlessly reduced (or increased)
google term “nosql data modeling” to see more explanation No transaction (commit, fallback) (from experience in Cassandra and mongo)

Advantages of Cassandra's masterless architecture

Recommended

Recommended

More Related Content

What's hot

What's hot (8)

Similar to Advantages of Cassandra's masterless architecture

Similar to Advantages of Cassandra's masterless architecture (20)

More from Duy Lâm

More from Duy Lâm (7)

Recently uploaded

Recently uploaded (20)

Advantages of Cassandra's masterless architecture

Editor's Notes