SlideShare a Scribd company logo
NoSQL Session II
Agenda
• Session I recap
- Why NoSQL/ Drawback of Relational DB
- Common Characteristics
- Storage Mechanism
- CAP Theorem & Advantages
• Data stax Apache Cassandra Installation
• Cassandra Concepts
Features of Cassandra
• Column based storage mechanism
• High Availability
• High Scalability/ Horizontal scaling
• Predictable performance
• No SPOF – Single point of failure
• Multi DC – Data Center/ Multi region availability
• Commodity Hardware
• Easy to manage operationally
ARCHITECTURE
• Node – One Cassandra instance
• Rack – A Logical set of Nodes
• Data Center – A Logical set of Racks
• Cluster - The full set of nodes which map to a
single complete token ring
CQL
• CREATE KEYSPACE “KeySpace Name” WITH
replication = {'class': ‘Strategy name’,
'replication_factor' : ‘No.Of replicas’}
• CREATE TABLE tablename( column1 name
datatype PRIMARYKEY, column2 name data
type, column3 name data type, PRIMARY KEY
(column1) )
Strategy name Description
Simple Strategy' Specifies a simple replication
factor for the cluster.
Network Topology
Strategy
Using this option, you can set
the replication factor for each
data-center independently.
The replication option is to specify the Replica
Placement strategy and the number of replicas
wanted. The following table lists all the replica
placement strategies.
CONSISTENCY
• Consistency levels are available for Read and
Write Operations.
• ANY, ALL, QUORUM([RF/2]+1), EACH, etc
• High Consistency – Low Availability
• Low Consistency – High Availability
SEED & CO-ORDINATOR NODE
• Seeds and Coordinators serve different purposes.
• Seed nodes: In general it is recommended to have 2 seeds for
the whole cluster. If you have multi-datacenter cluster then
you may want to distribute the seeds across each datacenter.
• Coordinator nodes: Every node can be a coordinator (as
designed by Cassandra). Coordinator is picked by Cassandra
per request and the only thing you can change is how it is
picked - for example Round-Robin (default) or DC-aware,
LatencyAware. This is found in the cassandra.yaml file.
• Maximum columns per row is 2 billion, but in
practical it is about 10 to 20 thousand max
used
• Maximum data size per cell (Column value) is
2 GB, but in practical it is about 10MB used.
CLUSTER TOPOLOGY
• Cluster communicated - SNITCH and Gossip
• Hinted- Handoff
• Write path
• Read path
• Read Repair
• Configuration – Cassandra.yaml file
SNITCHES & GOSSIP
• Snitch - Cassandra does its best not to have
more than one replica on the same rack to
avoid duplicate
• Determines the location of nodes by rack and
data center corresponding to the IP addresses
• Gossip – Once per second each node gossip’s
each other to update themselves
• Hinted Hand Off – A recovery mechanism for
writes targeting offline nodes
• Grace time can be maintained yaml file
• Property – max_hint_window_in_ms : 1000
• hinted_handoff enable: true
Write Path
SSTable – Static & Sorted Table
• Immutable data file for row storage
• Partition is spread across multiple SS Table
based on timestamp
• Easy Backup – Delete is marked as
“TombStones”
Read Path
• Read Repair – When any node is stale it is
marked as read-repair
• Property – read_repair_chance
Thank You !!!
Continue in Next session

More Related Content

What's hot

Kafka website activity architecture
Kafka website activity architectureKafka website activity architecture
Kafka website activity architecture
Omid Vahdaty
 
Using ZFS file system with MySQL
Using ZFS file system with MySQLUsing ZFS file system with MySQL
Using ZFS file system with MySQL
Mydbops
 
Apache Cassandra in the Real World
Apache Cassandra in the Real WorldApache Cassandra in the Real World
Apache Cassandra in the Real World
Jeremy Hanna
 
Hadoop2
Hadoop2Hadoop2
Hadoop2
Gagan Agrawal
 
Apache cassandra architecture internals
Apache cassandra architecture internalsApache cassandra architecture internals
Apache cassandra architecture internals
Bhuvan Rawal
 
MySQL HA
MySQL HAMySQL HA
MySQL HA
Kris Buytaert
 
Building AuroraObjects- Ceph Day Frankfurt
Building AuroraObjects- Ceph Day Frankfurt Building AuroraObjects- Ceph Day Frankfurt
Building AuroraObjects- Ceph Day Frankfurt
Ceph Community
 
Cassandra Workshop - Cassandra from scratch in one day
Cassandra Workshop - Cassandra from scratch in one dayCassandra Workshop - Cassandra from scratch in one day
Cassandra Workshop - Cassandra from scratch in one day
Carlos Alonso Pérez
 
UJUG Craftsmanship Roundup April 2017
UJUG Craftsmanship Roundup April 2017UJUG Craftsmanship Roundup April 2017
UJUG Craftsmanship Roundup April 2017
Derrick Isaacson
 
MEETUP - Unboxing Apache Cassandra 3.10
MEETUP - Unboxing Apache Cassandra 3.10MEETUP - Unboxing Apache Cassandra 3.10
MEETUP - Unboxing Apache Cassandra 3.10
Erick Ramirez
 
HBaseCon 2015- HBase @ Flipboard
HBaseCon 2015- HBase @ FlipboardHBaseCon 2015- HBase @ Flipboard
HBaseCon 2015- HBase @ Flipboard
Matthew Blair
 
Cassandra Redis
Cassandra RedisCassandra Redis
Cassandra Redis
Diego Pacheco
 
Orchestrating Cassandra with Kubernetes: Challenges and Opportunities
Orchestrating Cassandra with Kubernetes: Challenges and OpportunitiesOrchestrating Cassandra with Kubernetes: Challenges and Opportunities
Orchestrating Cassandra with Kubernetes: Challenges and Opportunities
Raghavendra Prabhu
 
ScyllaDB: NoSQL at Ludicrous Speed
ScyllaDB: NoSQL at Ludicrous SpeedScyllaDB: NoSQL at Ludicrous Speed
ScyllaDB: NoSQL at Ludicrous Speed
J On The Beach
 
CASSANDRA MEETUP - Choosing the right cloud instances for success
CASSANDRA MEETUP - Choosing the right cloud instances for successCASSANDRA MEETUP - Choosing the right cloud instances for success
CASSANDRA MEETUP - Choosing the right cloud instances for success
Erick Ramirez
 
Intro to cassandra
Intro to cassandraIntro to cassandra
Intro to cassandra
Aaron Ploetz
 
DataStax: Backup and Restore in Cassandra and OpsCenter
DataStax: Backup and Restore in Cassandra and OpsCenterDataStax: Backup and Restore in Cassandra and OpsCenter
DataStax: Backup and Restore in Cassandra and OpsCenter
DataStax Academy
 
Cassandra on Mesos Across Multiple Datacenters at Uber (Abhishek Verma) | C* ...
Cassandra on Mesos Across Multiple Datacenters at Uber (Abhishek Verma) | C* ...Cassandra on Mesos Across Multiple Datacenters at Uber (Abhishek Verma) | C* ...
Cassandra on Mesos Across Multiple Datacenters at Uber (Abhishek Verma) | C* ...
DataStax
 
Hadoop Meetup Jan 2019 - Hadoop On Azure
Hadoop Meetup Jan 2019 - Hadoop On AzureHadoop Meetup Jan 2019 - Hadoop On Azure
Hadoop Meetup Jan 2019 - Hadoop On Azure
Erik Krogen
 
Seastar / ScyllaDB, or how we implemented a 10-times faster Cassandra
Seastar / ScyllaDB,  or how we implemented a 10-times faster CassandraSeastar / ScyllaDB,  or how we implemented a 10-times faster Cassandra
Seastar / ScyllaDB, or how we implemented a 10-times faster Cassandra
Tzach Livyatan
 

What's hot (20)

Kafka website activity architecture
Kafka website activity architectureKafka website activity architecture
Kafka website activity architecture
 
Using ZFS file system with MySQL
Using ZFS file system with MySQLUsing ZFS file system with MySQL
Using ZFS file system with MySQL
 
Apache Cassandra in the Real World
Apache Cassandra in the Real WorldApache Cassandra in the Real World
Apache Cassandra in the Real World
 
Hadoop2
Hadoop2Hadoop2
Hadoop2
 
Apache cassandra architecture internals
Apache cassandra architecture internalsApache cassandra architecture internals
Apache cassandra architecture internals
 
MySQL HA
MySQL HAMySQL HA
MySQL HA
 
Building AuroraObjects- Ceph Day Frankfurt
Building AuroraObjects- Ceph Day Frankfurt Building AuroraObjects- Ceph Day Frankfurt
Building AuroraObjects- Ceph Day Frankfurt
 
Cassandra Workshop - Cassandra from scratch in one day
Cassandra Workshop - Cassandra from scratch in one dayCassandra Workshop - Cassandra from scratch in one day
Cassandra Workshop - Cassandra from scratch in one day
 
UJUG Craftsmanship Roundup April 2017
UJUG Craftsmanship Roundup April 2017UJUG Craftsmanship Roundup April 2017
UJUG Craftsmanship Roundup April 2017
 
MEETUP - Unboxing Apache Cassandra 3.10
MEETUP - Unboxing Apache Cassandra 3.10MEETUP - Unboxing Apache Cassandra 3.10
MEETUP - Unboxing Apache Cassandra 3.10
 
HBaseCon 2015- HBase @ Flipboard
HBaseCon 2015- HBase @ FlipboardHBaseCon 2015- HBase @ Flipboard
HBaseCon 2015- HBase @ Flipboard
 
Cassandra Redis
Cassandra RedisCassandra Redis
Cassandra Redis
 
Orchestrating Cassandra with Kubernetes: Challenges and Opportunities
Orchestrating Cassandra with Kubernetes: Challenges and OpportunitiesOrchestrating Cassandra with Kubernetes: Challenges and Opportunities
Orchestrating Cassandra with Kubernetes: Challenges and Opportunities
 
ScyllaDB: NoSQL at Ludicrous Speed
ScyllaDB: NoSQL at Ludicrous SpeedScyllaDB: NoSQL at Ludicrous Speed
ScyllaDB: NoSQL at Ludicrous Speed
 
CASSANDRA MEETUP - Choosing the right cloud instances for success
CASSANDRA MEETUP - Choosing the right cloud instances for successCASSANDRA MEETUP - Choosing the right cloud instances for success
CASSANDRA MEETUP - Choosing the right cloud instances for success
 
Intro to cassandra
Intro to cassandraIntro to cassandra
Intro to cassandra
 
DataStax: Backup and Restore in Cassandra and OpsCenter
DataStax: Backup and Restore in Cassandra and OpsCenterDataStax: Backup and Restore in Cassandra and OpsCenter
DataStax: Backup and Restore in Cassandra and OpsCenter
 
Cassandra on Mesos Across Multiple Datacenters at Uber (Abhishek Verma) | C* ...
Cassandra on Mesos Across Multiple Datacenters at Uber (Abhishek Verma) | C* ...Cassandra on Mesos Across Multiple Datacenters at Uber (Abhishek Verma) | C* ...
Cassandra on Mesos Across Multiple Datacenters at Uber (Abhishek Verma) | C* ...
 
Hadoop Meetup Jan 2019 - Hadoop On Azure
Hadoop Meetup Jan 2019 - Hadoop On AzureHadoop Meetup Jan 2019 - Hadoop On Azure
Hadoop Meetup Jan 2019 - Hadoop On Azure
 
Seastar / ScyllaDB, or how we implemented a 10-times faster Cassandra
Seastar / ScyllaDB,  or how we implemented a 10-times faster CassandraSeastar / ScyllaDB,  or how we implemented a 10-times faster Cassandra
Seastar / ScyllaDB, or how we implemented a 10-times faster Cassandra
 

Viewers also liked

Stations and Yards of Railway
Stations and Yards of RailwayStations and Yards of Railway
Stations and Yards of Railway
Souvik Mondal
 
Top 5 - Insight Report
Top 5 - Insight ReportTop 5 - Insight Report
Top 5 - Insight Report
Christina Eleen Riojas
 
La corrosión
La corrosiónLa corrosión
La corrosión
Mariana Fernandez
 
Ccn pm1ch02 v5
Ccn pm1ch02 v5Ccn pm1ch02 v5
Ccn pm1ch02 v5
sasuukee
 
Web Académica 2.0
Web Académica 2.0Web Académica 2.0
Web Académica 2.0
pedro0953
 
Phlipp, Franco, Nacho y Lucas
Phlipp, Franco, Nacho y LucasPhlipp, Franco, Nacho y Lucas
Phlipp, Franco, Nacho y Lucas
Bricktowers
 
Vfd drive motor load
Vfd drive motor loadVfd drive motor load
Vfd drive motor load
Zhaoyi Fang
 
Comparacion netiner linkedin aguilar-moron
Comparacion netiner linkedin aguilar-moronComparacion netiner linkedin aguilar-moron
Comparacion netiner linkedin aguilar-moron
Dirección General de Escuelas. Gobierno de Mendoza
 
Ud06 e04 comandos avanzados
Ud06 e04 comandos avanzadosUd06 e04 comandos avanzados
Ud06 e04 comandos avanzados
Ángela B
 
Errores Historicos
Errores HistoricosErrores Historicos
Errores Historicos
guestba90299
 
Legislación penal especial presentación sandimar
Legislación penal especial presentación sandimarLegislación penal especial presentación sandimar
Legislación penal especial presentación sandimar
Rainmar Cedeño
 
3Com 005686-03
3Com 005686-033Com 005686-03
3Com 005686-03
savomir
 
Vii+jornadas+de+historia+en+llerena
Vii+jornadas+de+historia+en+llerenaVii+jornadas+de+historia+en+llerena
Vii+jornadas+de+historia+en+llerena
arturogazulbiblioteca
 
Conferencia comunicacion-interna
Conferencia comunicacion-internaConferencia comunicacion-interna
Conferencia comunicacion-interna
Carlos Andrés Masso Lopez
 
16. columna vertebral torácica
16. columna vertebral torácica16. columna vertebral torácica
16. columna vertebral torácica
Emagister
 

Viewers also liked (15)

Stations and Yards of Railway
Stations and Yards of RailwayStations and Yards of Railway
Stations and Yards of Railway
 
Top 5 - Insight Report
Top 5 - Insight ReportTop 5 - Insight Report
Top 5 - Insight Report
 
La corrosión
La corrosiónLa corrosión
La corrosión
 
Ccn pm1ch02 v5
Ccn pm1ch02 v5Ccn pm1ch02 v5
Ccn pm1ch02 v5
 
Web Académica 2.0
Web Académica 2.0Web Académica 2.0
Web Académica 2.0
 
Phlipp, Franco, Nacho y Lucas
Phlipp, Franco, Nacho y LucasPhlipp, Franco, Nacho y Lucas
Phlipp, Franco, Nacho y Lucas
 
Vfd drive motor load
Vfd drive motor loadVfd drive motor load
Vfd drive motor load
 
Comparacion netiner linkedin aguilar-moron
Comparacion netiner linkedin aguilar-moronComparacion netiner linkedin aguilar-moron
Comparacion netiner linkedin aguilar-moron
 
Ud06 e04 comandos avanzados
Ud06 e04 comandos avanzadosUd06 e04 comandos avanzados
Ud06 e04 comandos avanzados
 
Errores Historicos
Errores HistoricosErrores Historicos
Errores Historicos
 
Legislación penal especial presentación sandimar
Legislación penal especial presentación sandimarLegislación penal especial presentación sandimar
Legislación penal especial presentación sandimar
 
3Com 005686-03
3Com 005686-033Com 005686-03
3Com 005686-03
 
Vii+jornadas+de+historia+en+llerena
Vii+jornadas+de+historia+en+llerenaVii+jornadas+de+historia+en+llerena
Vii+jornadas+de+historia+en+llerena
 
Conferencia comunicacion-interna
Conferencia comunicacion-internaConferencia comunicacion-interna
Conferencia comunicacion-interna
 
16. columna vertebral torácica
16. columna vertebral torácica16. columna vertebral torácica
16. columna vertebral torácica
 

Similar to NoSQL Session II

Cassandra - A Distributed Database System
Cassandra - A Distributed Database System Cassandra - A Distributed Database System
Cassandra - A Distributed Database System
Md. Shohel Rana
 
BigData Developers MeetUp
BigData Developers MeetUpBigData Developers MeetUp
BigData Developers MeetUp
Christian Johannsen
 
Apache Cassandra at the Geek2Geek Berlin
Apache Cassandra at the Geek2Geek BerlinApache Cassandra at the Geek2Geek Berlin
Apache Cassandra at the Geek2Geek Berlin
Christian Johannsen
 
04-Introduction-to-CassandraDB-.pdf
04-Introduction-to-CassandraDB-.pdf04-Introduction-to-CassandraDB-.pdf
04-Introduction-to-CassandraDB-.pdf
hothyfa
 
Cassandra for mission critical data
Cassandra for mission critical dataCassandra for mission critical data
Cassandra for mission critical data
Oleksandr Semenov
 
Cassandra - A decentralized storage system
Cassandra - A decentralized storage systemCassandra - A decentralized storage system
Cassandra - A decentralized storage system
Arunit Gupta
 
Talk about apache cassandra, TWJUG 2011
Talk about apache cassandra, TWJUG 2011Talk about apache cassandra, TWJUG 2011
Talk about apache cassandra, TWJUG 2011
Boris Yen
 
Talk About Apache Cassandra
Talk About Apache CassandraTalk About Apache Cassandra
Talk About Apache Cassandra
Jacky Chu
 
Trivadis TechEvent 2016 Big Data Cassandra, wieso brauche ich das? by Jan Ott
Trivadis TechEvent 2016 Big Data Cassandra, wieso brauche ich das? by Jan OttTrivadis TechEvent 2016 Big Data Cassandra, wieso brauche ich das? by Jan Ott
Trivadis TechEvent 2016 Big Data Cassandra, wieso brauche ich das? by Jan Ott
Trivadis
 
Appache Cassandra
Appache Cassandra  Appache Cassandra
Appache Cassandra
nehabsairam
 
NoSQL - Cassandra & MongoDB.pptx
NoSQL -  Cassandra & MongoDB.pptxNoSQL -  Cassandra & MongoDB.pptx
NoSQL - Cassandra & MongoDB.pptx
Naveen Kumar
 
CASSANDRA - Next to RDBMS
CASSANDRA - Next to RDBMSCASSANDRA - Next to RDBMS
CASSANDRA - Next to RDBMS
Vipul Thakur
 
4 use cases for C* to Scylla
4 use cases for C*  to Scylla4 use cases for C*  to Scylla
4 use cases for C* to Scylla
◄ ★ Jack Pavlov ★ ►
 
Cassndra (4).pptx
Cassndra (4).pptxCassndra (4).pptx
Cassndra (4).pptx
NikhilAmauriya
 
Using cassandra as a distributed logging to store pb data
Using cassandra as a distributed logging to store pb dataUsing cassandra as a distributed logging to store pb data
Using cassandra as a distributed logging to store pb data
Ramesh Veeramani
 
Yaroslav Nedashkovsky - "Data Engineering in Information Security: how to col...
Yaroslav Nedashkovsky - "Data Engineering in Information Security: how to col...Yaroslav Nedashkovsky - "Data Engineering in Information Security: how to col...
Yaroslav Nedashkovsky - "Data Engineering in Information Security: how to col...
Lviv Startup Club
 
Apache Cassandra in the Real World
Apache Cassandra in the Real WorldApache Cassandra in the Real World
Apache Cassandra in the Real World
Jeremy Hanna
 
M6d cassandrapresentation
M6d cassandrapresentationM6d cassandrapresentation
M6d cassandrapresentation
Edward Capriolo
 
Cassandra
CassandraCassandra
Cassandra
exsuns
 
Real-Time Analytics with Kafka, Cassandra and Storm
Real-Time Analytics with Kafka, Cassandra and StormReal-Time Analytics with Kafka, Cassandra and Storm
Real-Time Analytics with Kafka, Cassandra and Storm
John Georgiadis
 

Similar to NoSQL Session II (20)

Cassandra - A Distributed Database System
Cassandra - A Distributed Database System Cassandra - A Distributed Database System
Cassandra - A Distributed Database System
 
BigData Developers MeetUp
BigData Developers MeetUpBigData Developers MeetUp
BigData Developers MeetUp
 
Apache Cassandra at the Geek2Geek Berlin
Apache Cassandra at the Geek2Geek BerlinApache Cassandra at the Geek2Geek Berlin
Apache Cassandra at the Geek2Geek Berlin
 
04-Introduction-to-CassandraDB-.pdf
04-Introduction-to-CassandraDB-.pdf04-Introduction-to-CassandraDB-.pdf
04-Introduction-to-CassandraDB-.pdf
 
Cassandra for mission critical data
Cassandra for mission critical dataCassandra for mission critical data
Cassandra for mission critical data
 
Cassandra - A decentralized storage system
Cassandra - A decentralized storage systemCassandra - A decentralized storage system
Cassandra - A decentralized storage system
 
Talk about apache cassandra, TWJUG 2011
Talk about apache cassandra, TWJUG 2011Talk about apache cassandra, TWJUG 2011
Talk about apache cassandra, TWJUG 2011
 
Talk About Apache Cassandra
Talk About Apache CassandraTalk About Apache Cassandra
Talk About Apache Cassandra
 
Trivadis TechEvent 2016 Big Data Cassandra, wieso brauche ich das? by Jan Ott
Trivadis TechEvent 2016 Big Data Cassandra, wieso brauche ich das? by Jan OttTrivadis TechEvent 2016 Big Data Cassandra, wieso brauche ich das? by Jan Ott
Trivadis TechEvent 2016 Big Data Cassandra, wieso brauche ich das? by Jan Ott
 
Appache Cassandra
Appache Cassandra  Appache Cassandra
Appache Cassandra
 
NoSQL - Cassandra & MongoDB.pptx
NoSQL -  Cassandra & MongoDB.pptxNoSQL -  Cassandra & MongoDB.pptx
NoSQL - Cassandra & MongoDB.pptx
 
CASSANDRA - Next to RDBMS
CASSANDRA - Next to RDBMSCASSANDRA - Next to RDBMS
CASSANDRA - Next to RDBMS
 
4 use cases for C* to Scylla
4 use cases for C*  to Scylla4 use cases for C*  to Scylla
4 use cases for C* to Scylla
 
Cassndra (4).pptx
Cassndra (4).pptxCassndra (4).pptx
Cassndra (4).pptx
 
Using cassandra as a distributed logging to store pb data
Using cassandra as a distributed logging to store pb dataUsing cassandra as a distributed logging to store pb data
Using cassandra as a distributed logging to store pb data
 
Yaroslav Nedashkovsky - "Data Engineering in Information Security: how to col...
Yaroslav Nedashkovsky - "Data Engineering in Information Security: how to col...Yaroslav Nedashkovsky - "Data Engineering in Information Security: how to col...
Yaroslav Nedashkovsky - "Data Engineering in Information Security: how to col...
 
Apache Cassandra in the Real World
Apache Cassandra in the Real WorldApache Cassandra in the Real World
Apache Cassandra in the Real World
 
M6d cassandrapresentation
M6d cassandrapresentationM6d cassandrapresentation
M6d cassandrapresentation
 
Cassandra
CassandraCassandra
Cassandra
 
Real-Time Analytics with Kafka, Cassandra and Storm
Real-Time Analytics with Kafka, Cassandra and StormReal-Time Analytics with Kafka, Cassandra and Storm
Real-Time Analytics with Kafka, Cassandra and Storm
 

NoSQL Session II

  • 2. Agenda • Session I recap - Why NoSQL/ Drawback of Relational DB - Common Characteristics - Storage Mechanism - CAP Theorem & Advantages • Data stax Apache Cassandra Installation • Cassandra Concepts
  • 3. Features of Cassandra • Column based storage mechanism • High Availability • High Scalability/ Horizontal scaling • Predictable performance • No SPOF – Single point of failure • Multi DC – Data Center/ Multi region availability • Commodity Hardware • Easy to manage operationally
  • 5. • Node – One Cassandra instance • Rack – A Logical set of Nodes • Data Center – A Logical set of Racks • Cluster - The full set of nodes which map to a single complete token ring
  • 6. CQL • CREATE KEYSPACE “KeySpace Name” WITH replication = {'class': ‘Strategy name’, 'replication_factor' : ‘No.Of replicas’} • CREATE TABLE tablename( column1 name datatype PRIMARYKEY, column2 name data type, column3 name data type, PRIMARY KEY (column1) )
  • 7.
  • 8. Strategy name Description Simple Strategy' Specifies a simple replication factor for the cluster. Network Topology Strategy Using this option, you can set the replication factor for each data-center independently. The replication option is to specify the Replica Placement strategy and the number of replicas wanted. The following table lists all the replica placement strategies.
  • 9. CONSISTENCY • Consistency levels are available for Read and Write Operations. • ANY, ALL, QUORUM([RF/2]+1), EACH, etc • High Consistency – Low Availability • Low Consistency – High Availability
  • 10. SEED & CO-ORDINATOR NODE • Seeds and Coordinators serve different purposes. • Seed nodes: In general it is recommended to have 2 seeds for the whole cluster. If you have multi-datacenter cluster then you may want to distribute the seeds across each datacenter. • Coordinator nodes: Every node can be a coordinator (as designed by Cassandra). Coordinator is picked by Cassandra per request and the only thing you can change is how it is picked - for example Round-Robin (default) or DC-aware, LatencyAware. This is found in the cassandra.yaml file.
  • 11.
  • 12. • Maximum columns per row is 2 billion, but in practical it is about 10 to 20 thousand max used • Maximum data size per cell (Column value) is 2 GB, but in practical it is about 10MB used.
  • 13. CLUSTER TOPOLOGY • Cluster communicated - SNITCH and Gossip • Hinted- Handoff • Write path • Read path • Read Repair • Configuration – Cassandra.yaml file
  • 14. SNITCHES & GOSSIP • Snitch - Cassandra does its best not to have more than one replica on the same rack to avoid duplicate • Determines the location of nodes by rack and data center corresponding to the IP addresses • Gossip – Once per second each node gossip’s each other to update themselves
  • 15. • Hinted Hand Off – A recovery mechanism for writes targeting offline nodes • Grace time can be maintained yaml file • Property – max_hint_window_in_ms : 1000 • hinted_handoff enable: true
  • 17. SSTable – Static & Sorted Table • Immutable data file for row storage • Partition is spread across multiple SS Table based on timestamp • Easy Backup – Delete is marked as “TombStones”
  • 18. Read Path • Read Repair – When any node is stale it is marked as read-repair • Property – read_repair_chance
  • 19. Thank You !!! Continue in Next session