Cassandra 101

Cassandra 101
Scala by the Lagoon
Treviso, 28 April 2016

THE SPEAKER
• Software engineer at Radicalbit
• Blogger at codingjam.it
• (Re)twitter guy with @supergiampaolo
Giampaolo

AGENDA
1.What’s Cassandra
2.The CAP Theorem
3.“Internals”
4.CQL
5.Scala/Phantom
Cassandra 101
(2.x)

What’s Cassandra
• NoSQL
• High availability
• Linear Scalability
• Predictable Performance
• No Single Point of Failure (peer to peer)
• Multi DataCenter (DC)
• A hybrid between a key-value and a column oriented
• Clients see data in tables
• Data manipulation through CQL (similar to SQL)
• Not a replacement for a RDBMS
“Apache Cassandra is an open
source distributed database
management system designed
to handle large amounts of data
across many servers, providing
high availability with no single
point of failure.”

What’s Cassandra
Who uses
Cassandra?

What’s Cassandra
• Internet of Things: time series from users,
device and sensors
• Personalization: ingest and analyze to
personalize user experience
• Messaging: storing, managing, and
analyzing messages
• Fraud detection: to analyze quickly patterns
• Playlist: storing collections of user selected
items
Use cases

What’s Cassandra
• Developed at Facebook for messaging
system
• From Google Big Table: the storage model
• From Amazon Dynamo: the distribution
backbone
• Open source since 2008
• Apache top level since 2010
• Most stable release: 2.2 (2015)
• Latest release 3.4 (March 2016)
• Tick-tock releases in 2016
Quick history so far

• No master/slave, no Zookeeper, no config server
• All nodes can reply to writes and reads
• Data is partitioned around the ring
• Location of data on ring is determined a function
• Data is replicated with a specific factor
What’s Cassandra
1
4 2
3
-128
-64
0
64
C* is a big hash ring

Three properties
of shared data
system operating
over a network
The CAP Theorem
Consistency
every read would get you the most recent write
Availability
every node (if not failed) always executes queries within
unbounded time (but finite)
Partition Tolerance
the system continues to operate despite arbitrary
partitioning due to network failures

The CAP Theorem (rough)
1.AP: the system is Partitioned and
Available to clients but not
Consistent
2.CP: the system is Consistent and
working during a Partition, but
there is no Availability
3.AC: the system is Consistent and
Available there is no Partition.
Does this system really exist?
“pick two”
“this is often considered to be a misleading
presentation, since you cannot build - or
choose - 'partition tolerance': your system
either might experience partitions or it won't”.

The CAP Theorem (finer)
1. Do you respond to the reads with
potentially stale information
2. Do you wait (potentially forever) to
hear from the other side of the
partition and compromise availability?
“pick one”
or what you give up?
If a client writes to one side of a partition, any
reads that go to the other side of that partition
can't possibly know about the most recent
write. We are in a critical condition

Consistency & Availability Tradeoff
Spectrum
Both C and A are strong
guarantees in a critical condition
But out of critical conditions, a
system can be happily consistent
and available
AP to CP is a spectrum and
the whole space is useful
Eventual consistency and high
availability are good enough
AVAILABILITY CONSISTENCY
AP CP

Beyond CAP: PACELC
P(A/C)E(L/C)
AP CP
(LESS) LATENCY
CONSISTENCY
If there is a partition (P)
how does the system trade off
availability and consistency (A and C)
else (E)
when the system is running normally
in the absence of partitions, how does
the system trade off latency (L) and
consistency (C)?

Cassandra & PACELC
C* has a tunable
consistency
Can move from PC/EC to PA/EL
Consistency can be tuned at query
level from max availability to
immediate consistency
AP CP
(LESS) LATENCY
CONSISTENCY
PA/EL
PC/EC

Internals
A cluster is a peer to peer set of servers
Organized in
• Data centers
• Racks
• Nodes
Cluster

Internals
A node needs
• Cluster name
• Seeds addresses
• Its listen address
Every node is aware of status of
cluster through gossip
Joining a
cluster

Internals
Replication Tables are grouped by keyspace
Every keyspace has its own replication properties
CREATE KEYSPACE books
WITH REPLICATION =
{'class': 'SimpleStrategy',
'replication_factor' : 3};
CREATE KEYSPACE books
WITH REPLICATION =
{'class': 'NetworkTopologyStrategy',
'dc-newyork': 3,
'dc-london': 4};

Internals
Request coordinator
SELECT * FROM authors
Where name = ‘Poe’
• The node that handles communication with client
• Every node can do the coordinator
• The driver decides which is the coordinator

Internals
Consistent
hashing
CREATE TABLE authors(
name text,
bio text ,
PRIMARY KEY (name));
INSERT INTO authors
(name, bio)
VALUES
('Poe', ’Poe was a writer');
• -263..263 tokens (3.4*1038) distributed in cluster
• Every node knows its tokens
• Partitioner is a function from a partition key to a token
• Nodes and drivers use the same partitioner

Internals
Replication
Poe, Poe was a writer
Poe, Poe was a
writer
name text,
bio text ,
INSERT INTO authors
(name, bio)
VALUES
('Poe', ’Poe was a writer');

Internals
Hinted
handoff
Poe, Poe was an
American writer
Poe, Poe was an
American writer
• What if node 1 is down? Do we give up writing?
• Hinted handoff allows to replay writes to a momentary
unavailable node
• It’s optional
• There are some limitations
• Coordinator keeps the data
• What about consistency?
UPDATE authors
SET bio = ‘Poe was an
American writer’
WHERE name = ‘Poe’

Internals
Consistency
level
CONSISTENCY = TWO;
WHERE name= 'Poe’;
Real read will happen for one node only, the other two will send a digest
Poe was an
American
writer
Poe was
an
American
writer
Poe was a
writer
• How many nodes must acknowledge a request to return a
positive response to the client (Read/Write)
• Set at query level
• What if replicas do not agree?

Internals
Consistency
levels
there are more
• ANY (writes only)
• ONE
• TWO
• THREE
• QUORUM
• LOCAL_ONE
• LOCAL_QUORUM
ALL for immediate consistency (high latency)
ONE/ANY for eventual consistency (lowest latency)

Internals
Immediate
consistency
• Combine read and write to get consistency as this
rule:
If (nodes_written + nodes_read) > RF
then immediate consistency
• Examples with RF = 3
• Write: ALL, Read: ONE
• Write: QUORUM, Read: QUORUM
• Write: ONE, Read: ALL
• Is stale data really a problem?

CQL
• CQL is declarative like SQL and the
very basic structure of the query
component of the language
(SELECTs, INSERTs, …) is the same.
• But there are enough differences that
one should not approach using it in the
same way as conventional SQL
(because Cassandra is not relational!)
Cassandra
Query
Language

CQL
Cassandra
Query
Language
name text,
gender text,
born int,
died int,
bio text,
genre set<text>,
ALTER TABLE authors ADD picture blob;
ALTER TABLE authors DROP gender;
DROP TABLE authors;

CQL
Cassandra
Query
Language
Cassandra has upserts, what
happens if I want to update the
name of an author?
SELECT * FROM authors LIMIT 10;
where NAME = 'Herman Melville';
INSERT INTO authors(name)
VALUES ('Edgar Alan Poe');
UPDATE authors SET born = 1797
WHERE name = 'Mary Shelley';
UPDATE authors SET genre = genre + {'Gothic
literature'} where name = 'Mary Shelley';
DELETE from authors
WHERE name = 'Charles Dickens';

CQL
Invalid queries
• SELECT * FROM authors WHERE born = 1797;
• SELECT * FROM authors WHERE name like
'Mar%';
• SELECT * FROM authors WHERE NOT name = 'Mary
Shelley';
• SELECT * FROM authors WHERE name = 'Mary
Shelley' or name = 'Edgar Alan Poe';
• SELECT max(born) FROM authors group by
genre;
• SELECT * FROM authors ORDER by name;
• SELECT * FROM authors JOIN books ON
author.name = books.author;

CQL
How to solve
the problem?
Create tables that responds to
specific queries
Logged batch to the rescue
Actually won’t work for two
authors of 1819, try to insert
(George Eliot, 1819)
CREATE TABLE authors_by_year (
born int,
name text,
PRIMARY KEY (born));
INSERT INTO authors_by_year (born, name)
VALUES (1797, 'Mary Shelley')
INSERT INTO authors_by_year (born, name)
VALUES (1819, 'Herman Melville')
SELECT * FROM authors_by_year
WHERE born = 1797;

CQL
Partition key
Primary key
Partition
Row
• Partition is a list of rows and has a Partition
key, simple or compound
• Every row has its primary key composed by
partition key and (eventual) other columns
(said clustering) and it’s ordered inside
partition
• Partition key and primary key, something like
a two level key
• Partition key is related to token so a partition
cannot be split between nodes

CQL
Partition
example
CREATE TABLE books(
author text,
born int static,
title text,
published int,
PRIMARY KEY ((author), title);
INSERT INTO books(author, title) VALUES
('Melville', 'White-Jacket');
UPDATE books SET born = 1819 WHERE author =
'Melville’;
INSERT INTO books(author, title, published)
VALUES ('Melville', 'Moby Dick' , 1851);

CQL
Filtering and
ordering is
possible on
clustered
columns
ORDER BY title DESC;
----------+--------------+------+-----------
Melville | Moby Dick | 1819 | 1851
AND title >= 'O';
----------+--------------+------+-----------

CQL
Filtering and
ordering is
possible on
clustered
columns
SELECT * FROM books WHERE title >= 'O';
InvalidRequest: code=2200 [Invalid query]
message="Cannot execute this query as it might
involve data filtering and thus may have
unpredictable performance. If you want to
execute this query despite the performance
unpredictability, use ALLOW FILTERING”
SELECT * FROM books WHERE title >= 'O’ ALLOW
FILTERING;
----------+--------------+------+-----------

PHANTOM
Phantom
• Type-safe Scala DSL for Cassandra
• Built atop Datastax Java Driver
• Adds type safety to CQL statements
• Add reactivity (Futures!)
• Performances comparable to direct Java implementation
• Freeware/Commercial versions
• https://github.com/outworkers/phantom
• Sample code at
https://github.com/giampaolotrapasso/cassandra-101-scala-
examples

PHANTOM
Books
case class Book(
author: String,
born: Int,
title: String,
published: Option[Int] = None
)

PHANTOM
Books
class BooksTable extends CassandraTable[BooksDAO, Book] {
override val tableName = "books"
object author extends StringColumn(this) with PartitionKey[String]
object born extends IntColumn(this) with StaticColumn[Int]
object title extends StringColumn(this) with ClusteringOrder[String]
object published extends OptionalIntColumn(this)
override def fromRow(row: Row): Book = {
Book(
author = author(row),
born = born(row),
title = title(row),
published = published(row)
)
}
}

PHANTOM
Books
abstract class BooksDAO extends BooksTable with RootConnector {
def insertNewStatement(book: Book) = {
insert
.value(_.author, book.author)
.value(_.born, book.born)
.value(_.title, book.title)
.value(_.published, book.published)
}
def insertNew(book: Book): Future[dsl.ResultSet] =
insertNewStatement(book).future()
def selectByAuthor(author: String): Future[List[Book]] = {
select.where(_.author eqs author).fetch()
}
def updateBookStatement(author: String, book: String, published: Option[Int]) = {
update
.where(_.author eqs author).and(_.title eqs book)
.modify(_.published setTo published)
}
}

PHANTOM
Books by
year
case class BookByYear(year: Int, book: String)
class BooksByYearTable extends CassandraTable[BooksByYearTable, BookByYear] {
override val tableName = "books_by_year”
object year extends IntColumn(this) with PartitionKey[Int]
object book extends StringColumn(this) with ClusteringOrder[String] with Ascending
override def fromRow(row: Row): BookByYear = {
BookByYear(
year = year(row),
book = book(row))
}
}
abstract class BooksByYearDAO extends BooksByYearTable with RootConnector {
def insertNewStatement(bookByYear: BookByYear) = {
insert
.value(_.year, bookByYear.year)
.value(_.book, bookByYear.book)
}
def insertNew(postByAuthor: BookByYear) = insertNewStatement(postByAuthor).future
def selectByYear(year: Int): Future[List[BookByYear]] = {
select.where(_.year eqs year).fetch()
}
}

PHANTOM
Configuration object Config {
val keySpace = KeySpace("books")
val contactPoint: KeySpaceBuilder = ContactPoint(
host = "localhost",
port = ContactPoint.DefaultPorts.live)
val keySpaceConnector: KeySpaceDef =
contactPoint.keySpace(keySpace.name)
}

PHANTOM
Database class Database(val keySpaceConnector: KeySpaceDef)
extends DatabaseImpl(keySpaceConnector) {
object books extends BooksDAO with keySpaceConnector.Connector
object booksByYear extends BooksByYearDAO
with keySpaceConnector.Connector
def insertBook(book: Book) = {
var batch = Batch.logged
.add(books.insertNewStatement(book))
book.published.map { year =>
val byYear = BookByYear(year, book.title)
batch = batch.add(booksByYear.insertNewStatement(byYear))
}
batch.future()
}
}
object BookDatabase extends Database(Config.keySpaceConnector)

PHANTOM
Run! implicit val space: KeySpace = BookDatabase.space
implicit val session: Session = BookDatabase.session
Await.result(BookDatabase.autocreate().future, 10 seconds)
val book1 = Book("Melville”, 1819, "White-Jacket”, Some(1850))
val book2 = book1.copy(title = "Moby Dick", published = Some(1851))
val book3 = Book("Hemingway”, 1899, "For Whom the Bell tolls”)
val book4 = Book("Dino Buzzati”, 1906, "Il deserto dei tartari”, Some(1940))
val books: List[Future[ResultSet]] =
List(book1, book2, book3, book4).map(p => BookDatabase.insertBook(p))
val operations: Future[List[ResultSet]] = Future.sequence(books)
Await.result(operations, 10.seconds)

PHANTOM
Run! val listOfMelville: Future[List[Book]] =
BookDatabase.books.selectByAuthor("Melville")
listOfMelville.onComplete {
case Success(list) => println(s"${list.size} books by Melville ")
case Failure(x) => println(x)
}
Await.result(listOfMelville, 3.seconds)
val booksOf1940 = BookDatabase.booksByYear.selectByYear(1940)
booksOf1940.onComplete {
case Success(list) => println(s"${list.size} books in 1940")
case Failure(x) => println(x)
}
Await.ready(booksOf1940, 3.seconds)

Thanks!
Giampaolo Trapasso
Software engineer
giampaolo.trapasso@radicalbit.io

Cassandra 101

Recommended

Recommended

More Related Content

Similar to Cassandra 101

Similar to Cassandra 101 (20)

Recently uploaded

Recently uploaded (20)

Cassandra 101