O'Reilly Webinar: Simplicity Scales - Big Data

SIMPLICITY SCALES
Big data application management &
operations
1

WHOAMI?
Tyler Hannan
Director, Technical Marketing
tyler@basho.com
@tylerhannan

CHANGE IN ARCHITECTURAL DESIGN
App App App App
Virtualization
Server
App
Aggregation
Server Server Server Server
SMALL APPS
BIG SERVERS
ONE LOCATION
BIG APPS
COMMODITY SERVERS
MANY LOCATIONS

In 2014, 20% of
enterprise data projects
add distributed
processes into
production

THE BENEFITS OF RIAK
Riak is an operationally friendly database that is:
• Fault-tolerant
• Highly-available
• Scalable
• Self-healing

THE PROPERTIES OF A DISTRIBUTED DB
Riak is a multi-model database that is:
• Open Source & Commercial
• Distributed
• Masterless
• Eventually Consistent

This is NOT about Riak.
This is about design
decisions in distributed
systems.

This IS about Riak.
And learning from
Basho’s architectural
decisions.

DISTRIBUTED SYSTEMS – A DEFINITION
“A distributed system is a software
system in which components located
on network computers communicate
and coordinate their actions by passing
messages. The components interact
with each other in order to achieve a
common goal.”
--Wikipedia

“A distributed system is one in which
the failure of a computer you didn't
even know existed can render your
own computer unusable”
--Leslie Lamport

“Everything works at small scale.
Understand failure modalities to
understand your realities.”
--Tyler Hannan

THINKING DISTRIBUTED
What we consider, when we think distributed:
• Availability
• Fault Tolerance
• Latency

UPTIME IS A POOR METRIC…
Availability

AVAILABILITY
“…widespread underestimation of the
specific difficulties of size seems one of
the major underlying causes of current
software failure.”
--Wikipedia

HARVEST AND YIELD
Harvest
• a fraction
• data available / complete data
Yield
• a probability
• queries completed / queries requested
Failure will cause known linear reduction to one of
these

UNDERSTANDING “CONSISTENCY
H
A
R
V
E
S
T
YIELD
Queries Issued
Queries Offered
Data Available
Total Dataset

HARVEST AND YIELD
Traditional design demands
100% HARVEST…
but success of modern
applications is often measured in
YIELD.

RELATIONAL AVAILABILITY
primary
replica replica replica
coordination
Write/
Read

RELATIONAL AVAILABILITY
primary replica replica
coordination
X
Write/
Read

RIAK AVAILABILITY
Riak has a masterless architecture in which every node in a
cluster is capable of serving read and write requests.

Each
1/Nth the data
1/Nth the performance
=
RIAK AVAILABILITY

Availability Requires
Scalability

RELATIONAL SCALABILITY
A - K L - P Q - Z
Designed to scale
vertically
Cost of vertical
scaling
Sharding

ADD CAPACITY AS NEEDED
Node 3
Node 4
Node 5
Node 0
Node 1
Node 2
Designed for
Horizontal Scale
Deployed on
commodity
Hardware
Consistent
Hashing

PERFECTION IS UNATTAINABLE…
FAILURES WILL HAPPEN
Fault Tolerance

FAULT TOLERANCE
How many hosts/replicas do you need
to survive “F” failures?
• F + 1 – fundamental minimum
• 2F + 1 – a majority are alive
• 3F + 1 – Byzantine Fault Tolerance

NAÏVE HASHING
NH(Ka) = 0
NH(Kb) = 1
NH(Kc) = 2
NH(Kd) = 0
…
Node # = HASH(KEY) % NUM_NODES

NAÏVE HASHING
Node 0 Node 1 Node 2
Ka Kd Kg
Kj Km Kp
Kb Ke Kh
Kk Kn Kq
Kc Kf Ki
Kl Ko Kr

NAÏVE HASHING
Ka Ke Ki
Km Kq
Kb Kf Kj
Kn Kr
Kc Kg Kk
Ko
Node 4
Kd Kh Kl
Kp

NAÏVE HASHING
• K = # of Keys
• NN = # of Nodes
As NN grows factor essentially becomes 1,
thus ALL keys move
K * (NN – 1) / NN => K

CONSISTENT HASHING
• # of Partitions remains CONSTANT
• Key always maps to the SAME Partition
• Node owns Partitions
• Partitions contain keys
• Extra Level of Indirection
Partition # = HASH(KEY) % Partitions

CONSISTENT HASHING
P4 P7 P2 P5 P8 P3 P6 P9P1
Ka Kd Kg
Kj Km Kp
Kb Ke Kh
Kk Kn Kq
Kc Kf Ki
Kl Ko Kr

CONSISTENT HASHING
P4 P7 P2 P5P8 P3 P6 P9 P1
KaKd Kg
KjKm Kp
Kb KeKh
Kk KnKq
Kc Kf Ki
Kl Ko Kr
Node 4

CONSISTENT HASHING
• K = # of Keys
• NN = # of Nodes
• Q = # of Partitions
As K grows NN becomes constant, thus K/Q
keys move
NN * K/Q => K/Q

RIAK AVAILABILITY
Node 0
Node 1
Node 2

BRIEF MILLISECONDS
PHOTONS FLYING THROUGH GLASS
TIME STOPS FOR NO ONE
Latency

UNDERSTANDING LATENCY
299,792,458 meters/second
in a vacuum

LATENCY: GOOGLE’S BIG TABLE
95th percentile: 24 ms
99.9th percentile: 994 ms

REDUNDANCY REDUCES LATENCY
IF response > 10 ms
THEN send 2nd request
5% increase in total requests
99.9th percentile latency = 50 ms

UNDERSTANDING LATENCY
Overall latency is determined by
latency of the SLOWEST
machine.
Get data close to your users.

MULTI-CLUSTER REPLICATION
Replicate data
across datacenters
or across the world

UNDERSTANDING PERFORMANCE
You get fast read and
write performance
What does this mean?

THE PLAN, THE ENVIRONMENT
• How do we measure performance?
• What do we measure when we
measure performance?
• basho_bench
• Google Cloud

CV CV
NoSQL
Database
Unstructured Data
No pre-defined Schema
Small and Large Data Sets
on Commodity HW
Many Models:
K/V, document store, graph
Variety of Query Methods
RELATIONAL & NOSQL
What’s the difference?
Relational
Database
Structured Data
Defined Schema
Tables with
Rows/Columns
Indexed
w/ Table Joins
SQL

THE EVOLUTION OF NOSQL
Unstructured
Data Platforms
Multi-Model
Solutions
Point
Solutions

42% of database decision makers admit they
struggle to manage the NoSQL solutions
deployed in their environments”
Riak
Spark
COMPLEX TECHNOLOGY STACK

Simplify the Complexity
Ensure High Availability
Scale Horizontally

Riak KV
Client
Basho Redis Proxy
Client
Application
Redis Redis
Riak
KV
Riak
KV
Service
Manager Spark Spark
Spark Connector
Spark
Client
Riak KV EnsembleBasho Data Platform
Read
Write
Read
Write
Read
Redis Services
Read
Spark Services
Leader Election
Query
Write
Read/Solr
MANAGING COMPLEXITY AT SCALE

MILLIONS OF RECORDS
Information requested
and amended more
than 2.6 BILLION
times a year
42 MILLION Summary
Care Records
1.3 BILLION
prescription messages

BILLIONS OF MOBILE DEVICES
10 BILLION data
transactions a day –
150,000 a second
Forecasting 2.8
BILLION locations
around the world
Generates 4GB OF
DATA every second

O'Reilly Webinar: Simplicity Scales - Big Data

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to O'Reilly Webinar: Simplicity Scales - Big Data

Similar to O'Reilly Webinar: Simplicity Scales - Big Data (20)

More from Basho Technologies

More from Basho Technologies (11)

O'Reilly Webinar: Simplicity Scales - Big Data

Editor's Notes