Declarative benchmarking of cassandra and it's data models

Monal Daxini @ monaldax
11/11/2019 ApacheCon, Las Vegas, 2019
https://www.linkedin.com/in/monaldaxini
Declarative Benchmarking of
Cassandra and It's Data Models

● Cloud Data Engineering @ Netﬂix, work on many data stores
● Help engineers build scalable solutions
● Built scalable data platforms using Apache Flink / Kafka / Docker
● Working with distributed systems for 18+ years
Proﬁle
@monaldax

• 100’s of applications using Cassandra
• (several unique data models / conﬁg)
• 10’s of thousands instances
• 100’s of global C* clusters
• > 6 PB of data
• Millions of requests/ seconds
Netﬂix Cassandra Footprint
@monaldax

• Challenges developing a scalable data model (Cassandra)
• Declarative Cassandra benchmarking tool in action
• Tool’s philosophy, how it works, & how it can apply to other data stores
Structure Of The Talk
@monaldax

1. Design data model & schema
2. Design application queries
3. Identify application load & query
distribution
4. Prepare test data
5. Prepare query parameter values to
run queries eﬃciently
Developing a Scalable Cassandra Data Model
For each application:
6. Code an app to execute queries, and
instrument to capture metrics
7. Generate load against application to run
queries with desired distribution
8. Analyze results (build dashboard)
9. If results unsatisfactory, iterate from step 1
@monaldax

In addition,
We may need to test application workload on different
versions of Cassandra and or data models.
@monaldax

That’s a lot of steps, duplicate effort, and its cumbersome!
@monaldax
We want it to be easy, quick, and ergonomic!

1. Design data model & schema
2. Design application queries
3. Identify the application load & query
distribution
4. Prepare test data (generate)
9. Conﬁg tool, run test, if results
unsatisfactory, iterate from step 1
Developing a Scalable Cassandra Data Model
With tooling for each application:
5. Prepare query parameter values to run
queries eﬃciently
6. Code an app to execute queries, and
instrument to capture metrics
7. Generate load against application to run
queries with desired distribution
8. Analyze results (build dashboard)
Heavy Lifting in a Tool
@monaldax

● Generic benchmarking tool
● Support different data stores via plugin (available plugins)
● Dynamically tunable RPS and conﬁguration
● Load patterns - random, time window, zipﬁan
What is NDBench?
@monaldax

NDBench In Action
NDBench NodeNDBench Node
(EC2 Instance)
NDBench Node
NDBench Node
(EC2 Instance)
Test
Cassandra Cluster
Schema & Test Data
reads / writes
Record Metrics
NDBench NodeNDBench APP UI
@monaldax

• Emulate application query logic runs against real or generated data
• Specify the traﬃc % distribution
• Basic data type coalescing for using query result in another query
• Run any CQL statement (Select, Update, Insert, Delete) & support all CQL types
• Support any Cassandra version with CQL support
Cassandra NDBench CQL plugin
@monaldax

• Validate scalability of data model and application query workload
• Compare the performance of data model for Cassandra version 3.x & 2.x
• Help certify Cassandra updates / upgrades - test different data models and
application workloads
• Use for data generation for given schema before running queries
What Do We Use It For / Plan To Use It For
@monaldax

Walkthrough of NDBench
CQL Plugin In Action
Steps 1-4, 9
@monaldax

Cassandra Schema Of Sample Application (step 1)
@monaldax

Application CQL Queries For API 1 (steps 2, 3)
Query Group 1: 70%
SELECT user_id, profile_id FROM user WHERE user_id = ?;
SELECT foreign_keys FROM user_index WHERE type =
'profile_id' AND value = ?;
@monaldax

Application CQL Queries For API 2 (steps 2, 3)
Query Group 2: 30%
SELECT user_id, profile_id, acc_guid FROM user WHERE user_id = ?;
BEGIN BATCH INSERT INTO user_index (create_time, foreign_keys, type, value)
VALUES (?, [ ?, ? ], ''profile_id'', ?); INSERT INTO user_index (create_time,
foreign_keys, type, value) VALUES (?, [ ? ], ''acc_guid'', ?); APPLY BATCH;
INSERT INTO map_test (id, uid_pid) VALUES (''1'', {user_id : ?, profile_id: ?});
INSERT INTO set_test(id, uid_pid) VALUES (''2'', {?});
@monaldax

NDBench CQL Plugin Overview
Test
Cassandra Cluster
Schema &
Test Data
Run Queries
ndb_perf_queries
Perf Test Profile
NDBench NodeNDBench NodeNDBench Node
With CQL Plugin
(EC2 Instance)
Record Metrics
@monaldax

NDBench CQL Plugin Perf-Test-Proﬁle Schema (step 9)
@monaldax
var_* columns point to
different sources for
query parameter values.
Only one is used
ordered CQL in group (id)

Modiﬁed App Query With Parameter Reference - Group 1 (70%)
SELECT user_id, profile_id FROM user WHERE user_id = ?user_id?;
SELECT foreign_keys FROM user_index WHERE type = 'profile_id' AND value
= ?profile_id?;
@monaldax

Modiﬁed App Query With Reference - 2 (30%)
SELECT user_id, profile_id, acc_guid FROM user WHERE user_id = ?user_id?;
BEGIN BATCH INSERT INTO user_index (create_time, foreign_keys, type, value)
VALUES (?:TS?, ?[user_id, profile_id]?, ''profile_id'', ?profile_id?); INSERT
INTO user_index (create_time, foreign_keys, type, value) VALUES (?:TS?,
?[user_id]?, ''acc_guid'', ?acc_guid?); APPLY BATCH;
INSERT INTO map_test (id, uid_pid) VALUES (''1'', ?{user_id : user_id,
profile_id: profile_id}?);
INSERT INTO set_test(id, uid_pid) VALUES (''2'', ?s{user_id}s?);
Type Coercion
@monaldax

NDBench CQL Plugin Perf Test Proﬁle - 2 Query Groups
@monaldax

NDBench CQL Plugin Perf Test Proﬁle - Select source
@monaldax

NDBench CQL Plugin Perf Test Proﬁle - Source Precedence

• Total traﬃc % of query groups must add up to 100
• Support different consistency level for each statement
• Columns in cql statement inferred, and available from the parameter source
• Parameter source - Table, Previous query results, SELECT statement
• Support large number of parameters to perf test CQL queries
Summary - Ergonomic Perf Test Proﬁle, & Comprehensive Validation
@monaldax

Run Load Test
Spinnaker Pipeline
@monaldax

Run Load Test
Spinnaker Pipeline
Manual Judgement
@monaldax
Test Specific Link

NDBenchUI-CQLPlugin
@monaldax
CassCQLPlugin

NDBenchUI-CQLPlugin
CassCQLPlugin
@monaldax

30:00
(mm: ss)
25 min perf test proﬁle table entry, 5 min run test
@monaldax

Dashboard - CQL Plugin Speciﬁc
@monaldax

Dashboard - Query Execution Latency Per Group
@monaldax

• Test scale up to 1.2 million ops / second (1.2 billion parameter rows)
• 96 nodes i3.8xl, LCS (compaction), LZ4, mostly read heavy
• Found data model bug, slowly leading to wide rows
• Client wrapper bugs - slow memory leak, metrics, prepared statement
caching not working
Testing C* Data Model For A Critical Service On 2.x & 3.x
@monaldax

We Would Like To Use Plugin To Test Cassandra @ Netflix
Use restores from prod data backups and define of
CQL Perf Test Profiles, exercised by the NDBench
CQL plugin, and triggered by Cassandra builds
@monaldax

Under The Hood Of
The CQL Plugin
@monaldax

NDBench CQL Plugin Architecture
Test
Cassandra Cluster
Schema &
Test Data
ndb_perf_queries
Run QueriesNDBench NodeNDBench Node
(EC2 Instance)
With CQL Plugin
(EC2 Instance)
Record Metrics
@monaldax
Perf Test Profile

@monaldax
Sqlite
Param store
Cassandra Cluster
ndb_perf_queries
Schema &
Test Data
Metadata could live on
any Cassandra cluster.
Parse metadata1
Load from user & Storeon node in Sqlite
2
Run queries with param values from Sqlite
& record metrics
4
NDBench UI
/init/
all nodes
0
REST
/start/ all nodes3
High-level Architecture
Randomize start

High-level Architecture (optimized)
@monaldax
Sqlite
Param store
Cassandra Cluster
Schema &
Test Data
Metadata could live on
any Cassandra cluster.
Parse metadata1
If ! user param on S3Load from & Store on1 node in Sqlite
2
Run queries with param values from Sqlite
& record metrics
7
Upload Sqllite ﬁle3
/init/ a node0
NDBench UI
/init/
all nodes
4
REST
/start/ all nodes6
Download Sqllite ﬁle
from each node
5
Randomize start
ndb_perf_queries

Dashboard - Parameters Values Uploaded and Shared
@monaldax

Lock-free Randomized Deterministic % Query Distribution On Each Node
Query Group ID 1: 70% Query Group ID 2: 30% ( 1 )
1 1 1 1 1 1 1 2 2 2 2
70 1s for Query Group 1 30 2s for Query Group 2
100 Element Array ↓
@monaldax

1 2 1 1 2 1 2 1 2 1 1
1 time Fisher-Yates Shuffle
@monaldax

1 2 1 1 2 1 2 1 2 1 1
Thread 1
︴ThreadLocal
Array Index
Thread n
︴ThreadLocal
Array Index
@monaldax

Data Generators And Generating Test Data
• ?:TS? - This is replaced by a timestamp.
• Add more generators (future)
• generation of non-collection (bigint, text, uuid, etc.) and collection types
• Use generators in INSERT to generate data for new schema
@monaldax

• Declaratively benchmarking significantly reduces overhead in iterating over
schema and Cassandra config to achieve scale
• Used to test and benchmark against curated data sets and perf-test-profiles
• Support all data types & LWT Support (beta)
• Randomized deterministic percentage distribution of queries
Summary
@monaldax

• Open source NDBench CQL plugin (WIP)
• Add more generators
• Load sharded query parameter data on each NDBench node
• UDT Support in dynamic collections
• Build support for other data stores - leverage same philosophy & reuse code
Future Enhancements (Lazily)
@monaldax

@monaldax
End of Season 1
Q & A
@monaldax

Declarative benchmarking of cassandra and it's data models

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Declarative benchmarking of cassandra and it's data models

Similar to Declarative benchmarking of cassandra and it's data models (20)

More from Monal Daxini

More from Monal Daxini (11)

Recently uploaded

Recently uploaded (20)

Declarative benchmarking of cassandra and it's data models