Deep dive into event store using Apache Cassandra

It's about time :
Deep dive into event store using
Apache Cassandra
by Nikunj Thakkar

Agenda
●
What is Big Data?
●
So far in AJM Bigdata
Series
●
Where it is? Am I using
it?
Big Data at-a-glance
Introduction to
Apache Cassandra
●
What, When and Why of
Cassandra
●
Protocol, Architecture,
Queries and Evrything
else
●
Interesting Use-cases
●
Demo

What is Big Data?
Large amount of data
that can only be
processed during
night hours

क्यया कममी रह गयमीथमी ममेरमे
जजॉब डडिडसकस रप्शन मम?
What is Big Data?

4 V's Of
Big Data
Volume
Variety
Velocity
Veracity

CAP
Theoram
Consistency
Availability
Partition tolerance

Family
of
NOSQL
DATABASES
Wide Column Store / Column Families
Document Store
Wide Column Store / Column FamiliesWide Column Store / Column Families
Key Value / Tuple Store
Graph Databases
Multimodel Databases
Object Databases
Grid & Cloud Database Solutions
Object Databases
XML Databases
Multidimensional Databases
Multivalue Databases

Targeted marketing
Public sector
Big Data:
Am I using it?
Health care
Social media and web
data
Global personal location tracking
data
data
Automated device generated data

Introduction to
Apache Cassandra

Hey wait,
First tell me about
events and event
stores.

Google's Big Table + Amazone's Dynamo =
Cassandra

Network Topology – Multiple DC

Always on architecture -
No single point of failure

Apache Cassandra: When to use?

Just kidding.... We will cover this part in use cases. :) :)

Apache Cassandra: Interesting Facts

Apache
Cassandra:
Interesting
Facts
Protocol

Apache
Cassandra:
Interesting
Facts
Protocol
Thrift
vs
CQL Binary Protocol

Apache
Cassandra:
Interesting
Facts
Architecture

Apache
Cassandra:
Interesting
Facts
Architecture
Key structures
➔
Node
➔
Data Center
➔
Cluster
➔
Commit Log
➔
Table
➔
SSTable

Apache
Cassandra:
Interesting
Facts
Architecture
Key components

Apache
Cassandra:
Interesting
Facts
Architecture
Key components
➔
Gossip
➔
Partitioner
➔
Replication factor
➔
Replica placement
strategy
➔
Snitch

Apache
Cassandra:
Interesting
Facts
Cassandra Query
Language

Apache
Cassandra:
Interesting
Facts
Cassandra Query Language
➔
CRUD
➔
Data Modeling
➔
Indexing

Apache Cassandra@
Disqus
➔
Disqus - Disqus is a discussion platform for the
web. It connects publishers with users and
allow them to have a public discourse in a
medium that allows communication across the
web.

Apache Cassandra@
Disqus
➔
Disqus uses Cassandra in a number of different places. Mainly
in the product; it’s used for content recommendation and also
a little bit of advertising. Let’s say you’re on that article
reading about the war in Syria and you notice that there’s
another interesting article relating to what the British PMs
have released as a public statement relating to whether or
not it’s legal to go to war, and maybe you’re interested in
reading that response. What Cassandra does is it powers the
analytics and content engine behind how disqus recommends
content.

Apache Cassandra@
Disqus
➔
Main cluster - 24 nodes
➔
CPU - 6-core Xeons 3Ghz – Biggest – Because
turning out to be a small bottleneck at times
➔
24GB RAM – Per node – 8 GB Heap Size
➔
32 or 48GBs RAM wasn't helping much
➔
it’s handling our load of about 30,000 reads a second

Apache Cassandra@
Many other companies

Resources for Material
● http://smartdatacollective.com/bernardmarr/277731/big-data-25-facts-everyone-need
s-know
● http://blog.gramener.com/1984/indian-elections-2014-big-data-for-billion-people
● http://indiaspora.org/blog/indian-elections-2014-big-data-for-billion-people/
● http://www.slideshare.net/BernardMarr/140228-big-data-volume-velocity-variety-varac
ity-value
● http://www.datastax.com/documentation/cassandra/2.1/cassandra/gettingStartedCass
andraIntro.html
● http://planetcassandra.org/
● http://planetcassandra.org/blog/disqus-discusses-migration-from-redis-to-cassandra-for
-horizontal-scalability/
● http://wiki.apache.org/cassandra/

Resources for Graphics
● http://newstonight.net/content/obesity-pushing-diabetes-cases
● http://lordapes.blogspot.in/
● http://blog.marketo.com/2013/07/big-data-it-doesnt-mean-what-you-think-it-means.html
● http://www.portaloko.hr/clanak/20-stvari-koje-muskarci-nikada-nece-shvatiti-kod-zena/0/59710/
● http://www.blankchapters.com/wp-content/uploads/2012/12/meme-data-data-everywhere.png
● https://medium.com/media-changes/don-draper-applies-for-a-job-in-2013-59aec7398582
● http://techblog.netflix.com/2011/11/benchmarking-cassandra-scalability-on.html
● http://www.slideshare.net/planetcassandra/apache-cassandra-and-datastax-enterprise-explained-
with-peter-halliday-at-wildhacks-nu
● http://qph.is.quoracdn.net/main-qimg-dce3b73956c5313650022a5b22068982?convert_to_webp
=true
● http://treasure.diylol.com/uploads/post/image/553987/resized_kevin-hart-meme-generator-i-woul
d-take-questions-but-the-way-my-presentation-is-set-up-f1cfd6.jpg

Deep dive into event store using Apache Cassandra

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (17)

Similar to Deep dive into event store using Apache Cassandra

Similar to Deep dive into event store using Apache Cassandra (20)

Recently uploaded

Recently uploaded (20)

Deep dive into event store using Apache Cassandra