ScyllaDB
Virtual Workshop
Felipe Cardeneti Mendes, Solutions Architect, ScyllaDB
+ For data-intensive applications that require high
throughput and predictable low latencies
+ Close-to-the-metal design takes full advantage of
modern infrastructure
+ >5x higher throughput
+ >20x lower latency
+ >75% TCO savings
+ Compatible with Apache Cassandra and Amazon
DynamoDB
+ DBaaS/Cloud, Enterprise and Open Source
solutions
The database for gamechangers
2
“ScyllaDB stands apart...It’s the rare product
that exceeds my expectations.”
– Martin Heller, InfoWorld contributing editor and reviewer
“For 99.9% of applications, ScyllaDB delivers all the
power a customer will ever need, on workloads that other
databases can’t touch – and at a fraction of the cost of
an in-memory solution.”
– Adrian Bridgewater, Forbes senior contributor
3
+400 gamechangers leverage ScyllaDB
Seamless experiences
across content + devices
Digital experiences at
massive scale
Corporate fleet
management
Real-time analytics 2,000,000 SKU -commerce
management
Video recommendation
management
Threat intelligence service
using JanusGraph
Real time fraud detection
across 6M transactions/day
Uber scale, mission critical
chat & messaging app
Network security threat
detection
Power ~50M X1 DVRs with
billions of reqs/day
Precision healthcare via
Edison AI
Inventory hub for retail
operations
Property listings and
updates
Unified ML feature store
across the business
Cryptocurrency exchange
app
Geography-based
recommendations
Global operations- Avon,
Body Shop + more
Predictable performance for
on sale surges
GPS-based exercise
tracking
Serving dynamic live
streams at scale
Powering India's top
social media platform
Personalized
advertising to players
Distribution of game
assets in Unreal Engine
Presenter
4
Felipe Cardeneti Mendes
Felipe Mendes is an IT Specialist with years of experience with
Linux and other distributed systems. Felipe has a vast
experience deploying workloads across a variety of different
computing architectures, including Mainframes. An open
source enthusiast, he has a passion towards helping
businesses to achieve their most challenging goals.
In ScyllaDB, he works as a Solutions Architect.
Agenda + Database Pain Points
+ Where ScyllaDB fits
+ Getting Started
+ Next Steps
Modern scaling challenges
Keep CapEx
& OpEx in check
Reduce complexity
Scale as the
data grows
Queries in
milliseconds
Leverage massive
amounts of data
Predictable, consistent
performance
Poll: What are your
existing
challenges?
Store and retrieve massive
amounts of data arriving at
high velocity
What's needed?
Consistent user experiences
for competitive advantages
Always available,
always on
Freedom to focus on
creating applications
Increase productivity
while decreasing cost
Anywhere, anytime
Data-intensive apps must be
distributed
Human data
“The size of the digital universe will double every two years.” — IDC
10X
Faster growth
than traditional
business data
4.4ZB | 44.4ZB
50X
.09ZB | 4.4ZB
Business data
Sensor data
9
Server sprawl
Increased administrative complexity and increased costs are a symptom when
databases do not take advantage of modern hardware.
Node
Sprawl
Tail latency problem
Refresh
User App Business Logic Database
API Calls
DB Calls
Slowest 1% dominates latency
What most people do
Refresh
User App Business Logic
Database
API Calls
D
B
C
a
l
l
s
Problem solved?
Cache
Cache
Calls
What’s your biggest database headache?
High latency
Low throughput
Need 3 PhDs to manage
Too expensive
Database consolidation
Hard to scale
Survey of 1,850 attendees at AWS re:Invent
Where ScyllaDB fits
Database evolution
1970s
Mainframes:
inception of the
relational model
1990s
LAN age:
replication, external
caching, ORMs
SQL
1980s
SQL, relational
databases become
de-facto standard
2000s
WEB 2.0:
NoSQL databases
for scale
2010s
Cloud age:
commoditization
of NoSQL, NewSQL
inception
1996
1995
1978 2008 2015
Cloud evolution: Last ~15 years
16
SSD: $2500/TB
2008 2012
Typical instance 4 cores
SSD $100/TB - 1000x faster, 10x cheaper
96 core VMs - 20x more cores
100Gbps NICs - 100x more throughput
2015 2022
2000 CPU core systems and
beyond
Previous databases don’t suffice
SQL Database:
+ Structured data
+ Doesn’t scale
Document Store:
+ No structure
+ Limited scale
Key-Value Store:
+ Simple
+ Fast
+ No persistence
Column Store:
+ Some structure
+ Best HA/DR
Lower Node
Count
Efficient hardware usage
results in direct savings
What if we had it all in one…
Predictable,
Low Latencies
Consistent single-digit
millisecond p99 latencies
Less
Complexity
Self-optimizing, smaller
footprint, easy to use
Developer-friendly
APIs
Ease of migration and less
development cycles
ScyllaDB is the database for data-intensive apps
that require high performance and low latency
Lower Your Cost
of Ownership
Requires far fewer
infrastructure resources
The database for gamechangers
Operate in Real-Time
All the Time
ScyllaDB consistently
delivers low latencies
Eliminate Complexity
from Your Systems
Self-optimizing & self-tuning
reduces administration
Scale to Match
Your Growth
ScyllaDB scales out and up
to match any workload
unique
21
Our technology
Horizontal & Vertical Scaling
Unique Close-to-Metal Architecture
Built in C++
(no Java overhead)
Everything
Asynchronous
Shared Nothing Shard per Core Autonomous
Network
Processor NUMA
Storage
Ecosystem compatibility
+ Cassandra (CQL) protocol
+ DynamoDB HTTP protocol
+ Shard-aware drivers
+ Active-Active topology
+ Geo-replication built-in
+ Always-on deployments
What a difference a database makes
ScyllaDB vs. DynamoDB
1/5th cost
20x better throughput
ScyllaDB vs Google Bigtable
1/5th the cost
26x better throughput
ScyllaDB vs. Cassandra
5x better throughput
2-20x better latency
Poll: Which databases
are you currently
working with?
Getting started
Deployment options
On-Prem
Cloud Hosted
ScyllaDB Cloud
Best High Availability in the industry
Best Disaster Recovery in the industry
Best scalability in the industry
Best Performance in the industry
Auto-tune out of the box performance
Fully compatible with Cassandra & DynamoDB
Wide-range of options to get started with ScyllaDB
ScyllaDB Monitoring Stack
▪ Based on Prometheus and Grafana
▪ CQL and DynamoDB dashboards
▪ Query Optimization panels
▪ Alerting integration via Alert Manager
ScyllaDB Manager
▪ Cluster management made easy
▪ Repairs and Backups without a hassle
▪ Restore with a single command
▪ Integrated with ScyllaDB Monitoring
sctool
CLI
ScyllaDB Manager ScyllaDB cluster
REST API Scylla REST API
Workshop Demo
Part 1: Real-time Streaming
CDC Streams
Token ring
CDC
Go
Driver
The CDC Go driver handles round-robin traversal.
Part 2: Let's play a game!
Token ring
+ Tic-tac-toe sample using the boto3 framework;
+ Code is licensed under the Apache-2.0 and initially written by Amazon;
+ Works out of the box with ScyllaDB Dynamo API;
+ Online web multiplayer game;
Book a session with us!
+ If you are interested in evaluating your current workloads to learn how you can
save more, you can sign up for a Technical Evaluation session with me.
Link : https://www.scylladb.com/product/technical-consultation/
+ Or email me directly if you have any questions.
+ felipemendes@scylladb.com
32
Summary
+ Start deploying ScyllaDB today, it is simple and effortless
+ Gather your applications and business requirements
+ Select the right infrastructure
+ Think about scalability, resiliency and high availability
We will send you these links via email along with the video for reference.
33
We value your feedback!
Resources
Learn More on ScyllaDB University
Join Slack and Talk to Our Community and Developers
Learn How Other People and Companies are Using ScyllaDB
NoSQL Concepts Whitepaper
Apache Cassandra Query Language (CQL)
Tips on How to Design an Application to Use ScyllaDB Efficiently
Maximizing Performance While Minimizing Timeouts
Next Steps
ScyllaDB Cloud
Start free Trial
scylladb.com/cloud
ScyllaDB University
Free online learning
scylladb.com/university
ScyllaDB
Resources
Visit our resource center
resources.scylladb.com
Thank you
for joining us today.
@scylladb scylladb/
slack.scylladb.com
@scylladb company/scylladb/
scylladb/

ScyllaDB Virtual Workshop

  • 1.
    ScyllaDB Virtual Workshop Felipe CardenetiMendes, Solutions Architect, ScyllaDB
  • 2.
    + For data-intensiveapplications that require high throughput and predictable low latencies + Close-to-the-metal design takes full advantage of modern infrastructure + >5x higher throughput + >20x lower latency + >75% TCO savings + Compatible with Apache Cassandra and Amazon DynamoDB + DBaaS/Cloud, Enterprise and Open Source solutions The database for gamechangers 2 “ScyllaDB stands apart...It’s the rare product that exceeds my expectations.” – Martin Heller, InfoWorld contributing editor and reviewer “For 99.9% of applications, ScyllaDB delivers all the power a customer will ever need, on workloads that other databases can’t touch – and at a fraction of the cost of an in-memory solution.” – Adrian Bridgewater, Forbes senior contributor
  • 3.
    3 +400 gamechangers leverageScyllaDB Seamless experiences across content + devices Digital experiences at massive scale Corporate fleet management Real-time analytics 2,000,000 SKU -commerce management Video recommendation management Threat intelligence service using JanusGraph Real time fraud detection across 6M transactions/day Uber scale, mission critical chat & messaging app Network security threat detection Power ~50M X1 DVRs with billions of reqs/day Precision healthcare via Edison AI Inventory hub for retail operations Property listings and updates Unified ML feature store across the business Cryptocurrency exchange app Geography-based recommendations Global operations- Avon, Body Shop + more Predictable performance for on sale surges GPS-based exercise tracking Serving dynamic live streams at scale Powering India's top social media platform Personalized advertising to players Distribution of game assets in Unreal Engine
  • 4.
    Presenter 4 Felipe Cardeneti Mendes FelipeMendes is an IT Specialist with years of experience with Linux and other distributed systems. Felipe has a vast experience deploying workloads across a variety of different computing architectures, including Mainframes. An open source enthusiast, he has a passion towards helping businesses to achieve their most challenging goals. In ScyllaDB, he works as a Solutions Architect.
  • 5.
    Agenda + DatabasePain Points + Where ScyllaDB fits + Getting Started + Next Steps
  • 6.
    Modern scaling challenges KeepCapEx & OpEx in check Reduce complexity Scale as the data grows Queries in milliseconds Leverage massive amounts of data Predictable, consistent performance
  • 7.
    Poll: What areyour existing challenges?
  • 8.
    Store and retrievemassive amounts of data arriving at high velocity What's needed? Consistent user experiences for competitive advantages Always available, always on Freedom to focus on creating applications Increase productivity while decreasing cost Anywhere, anytime
  • 9.
    Data-intensive apps mustbe distributed Human data “The size of the digital universe will double every two years.” — IDC 10X Faster growth than traditional business data 4.4ZB | 44.4ZB 50X .09ZB | 4.4ZB Business data Sensor data 9
  • 10.
    Server sprawl Increased administrativecomplexity and increased costs are a symptom when databases do not take advantage of modern hardware. Node Sprawl
  • 11.
    Tail latency problem Refresh UserApp Business Logic Database API Calls DB Calls Slowest 1% dominates latency
  • 12.
    What most peopledo Refresh User App Business Logic Database API Calls D B C a l l s Problem solved? Cache Cache Calls
  • 13.
    What’s your biggestdatabase headache? High latency Low throughput Need 3 PhDs to manage Too expensive Database consolidation Hard to scale Survey of 1,850 attendees at AWS re:Invent
  • 14.
  • 15.
    Database evolution 1970s Mainframes: inception ofthe relational model 1990s LAN age: replication, external caching, ORMs SQL 1980s SQL, relational databases become de-facto standard 2000s WEB 2.0: NoSQL databases for scale 2010s Cloud age: commoditization of NoSQL, NewSQL inception 1996 1995 1978 2008 2015
  • 16.
    Cloud evolution: Last~15 years 16 SSD: $2500/TB 2008 2012 Typical instance 4 cores SSD $100/TB - 1000x faster, 10x cheaper 96 core VMs - 20x more cores 100Gbps NICs - 100x more throughput 2015 2022 2000 CPU core systems and beyond
  • 17.
    Previous databases don’tsuffice SQL Database: + Structured data + Doesn’t scale Document Store: + No structure + Limited scale Key-Value Store: + Simple + Fast + No persistence Column Store: + Some structure + Best HA/DR
  • 18.
    Lower Node Count Efficient hardwareusage results in direct savings What if we had it all in one… Predictable, Low Latencies Consistent single-digit millisecond p99 latencies Less Complexity Self-optimizing, smaller footprint, easy to use Developer-friendly APIs Ease of migration and less development cycles
  • 19.
    ScyllaDB is thedatabase for data-intensive apps that require high performance and low latency
  • 20.
    Lower Your Cost ofOwnership Requires far fewer infrastructure resources The database for gamechangers Operate in Real-Time All the Time ScyllaDB consistently delivers low latencies Eliminate Complexity from Your Systems Self-optimizing & self-tuning reduces administration Scale to Match Your Growth ScyllaDB scales out and up to match any workload
  • 21.
    unique 21 Our technology Horizontal &Vertical Scaling Unique Close-to-Metal Architecture Built in C++ (no Java overhead) Everything Asynchronous Shared Nothing Shard per Core Autonomous Network Processor NUMA Storage
  • 22.
    Ecosystem compatibility + Cassandra(CQL) protocol + DynamoDB HTTP protocol + Shard-aware drivers + Active-Active topology + Geo-replication built-in + Always-on deployments
  • 23.
    What a differencea database makes ScyllaDB vs. DynamoDB 1/5th cost 20x better throughput ScyllaDB vs Google Bigtable 1/5th the cost 26x better throughput ScyllaDB vs. Cassandra 5x better throughput 2-20x better latency
  • 24.
    Poll: Which databases areyou currently working with?
  • 25.
  • 26.
    Deployment options On-Prem Cloud Hosted ScyllaDBCloud Best High Availability in the industry Best Disaster Recovery in the industry Best scalability in the industry Best Performance in the industry Auto-tune out of the box performance Fully compatible with Cassandra & DynamoDB Wide-range of options to get started with ScyllaDB
  • 27.
    ScyllaDB Monitoring Stack ▪Based on Prometheus and Grafana ▪ CQL and DynamoDB dashboards ▪ Query Optimization panels ▪ Alerting integration via Alert Manager
  • 28.
    ScyllaDB Manager ▪ Clustermanagement made easy ▪ Repairs and Backups without a hassle ▪ Restore with a single command ▪ Integrated with ScyllaDB Monitoring sctool CLI ScyllaDB Manager ScyllaDB cluster REST API Scylla REST API
  • 29.
  • 30.
    Part 1: Real-timeStreaming CDC Streams Token ring CDC Go Driver The CDC Go driver handles round-robin traversal.
  • 31.
    Part 2: Let'splay a game! Token ring + Tic-tac-toe sample using the boto3 framework; + Code is licensed under the Apache-2.0 and initially written by Amazon; + Works out of the box with ScyllaDB Dynamo API; + Online web multiplayer game;
  • 32.
    Book a sessionwith us! + If you are interested in evaluating your current workloads to learn how you can save more, you can sign up for a Technical Evaluation session with me. Link : https://www.scylladb.com/product/technical-consultation/ + Or email me directly if you have any questions. + felipemendes@scylladb.com 32
  • 33.
    Summary + Start deployingScyllaDB today, it is simple and effortless + Gather your applications and business requirements + Select the right infrastructure + Think about scalability, resiliency and high availability We will send you these links via email along with the video for reference. 33
  • 34.
    We value yourfeedback!
  • 35.
    Resources Learn More onScyllaDB University Join Slack and Talk to Our Community and Developers Learn How Other People and Companies are Using ScyllaDB NoSQL Concepts Whitepaper Apache Cassandra Query Language (CQL) Tips on How to Design an Application to Use ScyllaDB Efficiently Maximizing Performance While Minimizing Timeouts
  • 36.
    Next Steps ScyllaDB Cloud Startfree Trial scylladb.com/cloud ScyllaDB University Free online learning scylladb.com/university ScyllaDB Resources Visit our resource center resources.scylladb.com
  • 37.
    Thank you for joiningus today. @scylladb scylladb/ slack.scylladb.com @scylladb company/scylladb/ scylladb/