Cassandra @ Sony: The good, the bad, and the ugly part 2

Who is talking
Alexander Filipchik (PSN: LaserToy)
Principal Software Engineer
at Sony Network Entertainment

Who is talking
Alexander Filipchik (PSN: LaserToy)
Principal Software Engineer
at Sony Interactive Entertainment

The Rise of PlayStation4
PlayStation Network is big and growing.
– Over 65 million monthly active users.
– Hundreds of millions of users.
– A Lot of Services.

PlayStation 4 growth
• Pre warm – November 2013, couple
thousands PS4s for Taco Bell.
• Launch Day – 1,000,000 PS4s several days
later.
• Adding 1.3 Millions devices a month.

2009 MySql
Year Unicorn’s Tech Our Tech
2011 MongoDB/MySql
2012 Redis/MySql PS3: MySQL + Memcached
2013 Redis/Postgres MySQL + Memcached/Cassandra
2014 Redis/Shards For Postgres + MySql MySQL + Memcached/Cassandra
2015 Riak/Shards For Postgres + MySql MySQL + Memcached/Cassandra +
Redis
2016 ??? MySQL + Memcached/Cassandra +
Redis
Ready for BigBang

The Problem
• Legacy System use well known Relational DB to
handle our transactions.
• It is state of the art software that doesn’t scale
well in our circumstances.
• We wanted to allow client to run any queries
without consulting with hundreds of DBAs first.
• Sharding sounds like a pain.
• Multiple regions should be easy.

The Bad
Axiom
It is Not Easy to Replace Relational Database
with Cassandra for user facing traffic.

Simple Digital Store Model
Anotherhundredtables

CQL Going to Save Us!!!
• No Joins.
• No Transactions.
• No search.
• Just weird.

What if we denormalize?
Purchased

Thrift Schema
Account1 Json 1 Json 2 …. Json n
Now it horizontally scalable
We have in row transactions
Read is very fast – no joins
Now we need to propagate user purchases
from DB to C*
And figure out how to support queries
And sometimes to synchronize changes
in related objects (metadata)

Solving the Puzzle
• There are number of ways we can use to notify
C* about account level changes in the source of
truth - let’s not talk about it for now.
• Same applies to syncing meta (I’d love to have a
separate presentation on how we can use Apache
Samza to do it).
• Let’s talk about queries.

Going deeper
• What client wants:
– Search, sort, filter.
• What can we do:
– Use secondary Index.
– Use Solr integration.
– Fetch everything in memory and process it.

Can We Do Better?
• We can index, and writing indexer sounds like
a lot of fun.
• Wait, someone already had the fun and made:

Account1 Json 1 Json 2 …. Json n
Thrift Schema v2
Account1 Json 1 Json n Version
Now We can Search on anything inside the row that represents the user
Index is small and it is fast to pull it from C*
But we still pulling all this bytes all he time
And what if 2 servers write to the same row?

Distributed Cache?
• It is nice to keep things as close to our MicroService as
possible.
• In something that can do fast reads.
• And we have a lot of RAM these days.
• So we can have a beefy Memcached/Redis box.
• And Still pay Network penalty and think about scaling
them.
• What if

Semi Sticky Approach
• Cache lives inside the MicroService, so no network penalty.
• Requests for the same user are processed on the same
instance, so we can save network roundtrip and also have
some optimizations done (sequencing).
• Changes to State also are replicated to the storage (C*) and
are identified with some version number.
• If instance goes down, user session will be moved to
another alive instance automatically.
• It is much easier to scale up Microservices than C*.

Or in Other Words
Account
1
Version
Account 2
Version
Account 3
Version
Account
4
Version
Account
5
Version
Account 6
Version
Account1 jsons Version
…. … … …
Account n jsons Version
Instance 1
Instance 2
Instance 3
Cassandra

My Fish Phrase
Give a man a fish and you will have
to give him one every day.
Teach a man how to fish
and move on to something
more interesting.

Personalized Search (Friends)
Real time Indexer
Friends Graph
Local Cache
In Memory
Personal
Index
Get Friends

Some Stats
• Around 1 Million of Documents are indexed
per second.
• 10s of thousands of searches per second.
• Couple dozens of moderate powered EC2s.

Astyanax/Thrift. Memory Leak
Connections Buffers
…
Max connections
per node

Astyanax Row Slice. Improved
…

The most important link:
https://issues.apache.org
/jira/browse/cassandra/
Check it daily.

Invisible Assassin
• Small key space in a medium cluster (30 Rows,
1kb).
• CQL: select * from BlockList.
• Cache it in a local cache for 5 minutes.
• CPU 100%, timeouts across the cluster.
• Cluster of 20 nodes DIED after 3 hours.
• Root cause was never found.

Non VNodes to VNodes migration
Assigned Tokens: dc1 Vnodes: dc2
Applications

Went Wrong For Cache
Assigned Tokens: dc1 Vnodes: dc2
Applications
CL_ONE
Local=dc1
CL_ONE
Fastest Replica
But it is empty
Downstream dependency
now in trouble

Conclusion
• Pretty Stable and Scalable.
• Important
link:https://issues.apache.org/jira/browse/cas
sandra/
• Keeps you in shape.
• Easy To Fork to experiment.

But How To Get Replication Info?

Replication Logs Example
17:06:52 Received from DC1, R1: update KS Test CF test K 1000 C hello Size 76
Timestamp 1456333612729000 at 1456333612735000. Diff is: 6000

Cassandra @ Sony: The good, the bad, and the ugly part 2

Cassandra @ Sony: The good, the bad, and the ugly part 2

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (20)

Similar to Cassandra @ Sony: The good, the bad, and the ugly part 2

Similar to Cassandra @ Sony: The good, the bad, and the ugly part 2 (20)

More from DataStax Academy

More from DataStax Academy (7)

Recently uploaded

Recently uploaded (20)

Cassandra @ Sony: The good, the bad, and the ugly part 2