Honest Performance Testing with "NDBench" (Vinay Chella, Netflix) | Cassandra Summit 2016

Honest Performance Testing with
NDBench
Netflix Data Benchmark

Vinay Chella
Cloud Data Architect
Cassandra MVP
Cloud Database Engineering @ Netflix

• CDE, Cloud Database
Engineering
• Providing data stores as a
service
○Cassandra
○Dynomite
○Elasticsearch and RDS
Who are we?

• 98% of streaming data is stored in
Cassandra
• Data ranges from customer
details to Viewing history /
streaming bookmarks to billing
and payment
Cassandra @ Netflix

Agenda
•Background
•Why NDBench?
•Architecture
•Usage
•Achievements
•Roadmap
•Take away

Perf testing persistence layer?

Capacity in my existing fleet?

Why?
Why not already existing Perf testing tools?

What is NDBench?
Netflix Data Benchmark (NDBench) is a Pluggable
cloud-enabled benchmarking tool that can be used
across any data store system.

Different driver/ software versions

Dynamically tunable configurations

Netflix homegrown
• Well integrated with
netflix OSS infrastructure

What is Pluggable?
• Load Patterns
• Load tests

Load Patterns
• Random
• Sliding Window

Load Tests
• Cassandra-JavaDriver
• Elastic Search
• Dynomite
• Cassandra-Astyanax
• In-Memory Test

What can be configured?
• ndbench.driver.numKeys - 1000000
• ndbench.driver.dataSize - 200 bytes
• ndbench.driver.numWriters - 1
• ndbench.driver.numReaders - 1
• ndbench.driver.writeRateLimit - 100
• ndbench.driver.readRateLimit - 200
• ndbench.driver.useVariableDataSize - false
• Many more….

How to use it
• REST API
• UI

REST API
• Initialization
- Initialize: /pappy/driver/init/{DriverName}
- Init Script: /pappy/driver/initfromscript
• Perf API
- Start writes: /pappy/driver/startWrites
- Start reads: /pappy/driver/startReads
- Stop everything: /pappy/driver/stop
• Sanity check
- Verify Read: /pappy/driver/readSingle/key
- Verify Write: /pappy/driver/writeSingle/key
- Verify Read: /pappy/driver/readSingle/key
• Backfill
- Data Backfill: /pappy/driver/startDataFill
- DataBackfill Async: /pappy/driver/startDataFillAsync
• Status API
- /pappy/driver/{getRead/Write}Status
- /pappy/driver/getserverstatus

NDBench @ Netflix
• As a
- Benchmarking Tool
- Integration Tests
- Deployment Validation

NDBench’s Achievements @ Netflix

C* 1.2 → C* 2.0, C*2.0 → C* 2.1

C* 2.0 vs C* 2.1 (Reads - Thrift)

C* 2.0 vs C* 2.1 (Writes - Thrift)

LCS on CentOS vs Trusty (writes)

LCS on CentOS vs Trusty (Reads)

C* on Java 7 vs Java 8 (Writes)

C* on Java 7 vs Java 8 (Reads)

Dynomite benchmarking
• Generating Millions of
Ops/ sec

Roadmap
• Performance profile management
• Automated metrics analysis
• Dynamic load generation based on destination
schema

https://github.com/Netflix/ndbench

Take away
“Test the honesty of your data models, persistence
layers in Cloud ecosystem using NDBench”

Honest Performance Testing with "NDBench" (Vinay Chella, Netflix) | Cassandra Summit 2016

More Related Content

What's hot

Similar to Honest Performance Testing with "NDBench" (Vinay Chella, Netflix) | Cassandra Summit 2016

More from DataStax

Recently uploaded

Honest Performance Testing with "NDBench" (Vinay Chella, Netflix) | Cassandra Summit 2016

Editor's Notes