How in memory technology will impact machine deep learning services (redis labs case study)

How in-memory technology will impact
Machine/Deep Learning services
(Redis Labs case study)
DECEMBER 2017

Who We Are
Open source. The leading in-memory database platform,
supporting any high performance operational, analytics or
hybrid use case.
The open source home and commercial provider of Redis
Enterprise (Redise) technology, platform, products & services.
2

Why Redis ?
Simplicity ExtensibilityPerformance
NoSQL Benchmark
1
Redis Data Structures
2 3
Redis Modules
3
Lists
Hashes
Bitmaps
Strings
Bit field
Streams
Hyperloglog
Sorted Sets
Sets
Geospatial Indexes

The New Standard for Application Serving
4
100 msec - The new standard for E2E application response time, under any load
50 msec - Average roundtrip internet latency
50 msec - Required round trip app response time (includes
processing & multi-DB access)
1 msec - required DB response time
DatabaseApp Servers

The Machine / Deep Learning (ML/DL) World
(1) Training (2) Creating a model (3) Serving the model

The Machine / Deep Learning (ML/DL) World
(1) Training (2) Creating a model (3) Serving the model
?

Challenges in Serving ML Models
– Models are becoming biggers and complex
– Not standardized
– Can be very complex to deploy
– Do not scale well, speed and size
– Reliable services are hard to do
– Can be very expensive

Accurate Model is Important
Otherwise….

Random Forest Model
• A collection of decision trees
• Supports classification & regression
• Splitter Node can be
• Categorical (e.g. day == “Sunday”)
• Numerical (e.g. age < 43)
– Decision is taken by the majority of decision trees

Titanic Survival Predictor on a Decision Tree
YES
Sex =
Male ?
Age <
9.5?
*Sibps >
2.5?
Survived
Died
SurvivedDied
*Sibps = siblings + spouses
NO

Titanic Survival Predictor on a Random Forest
YES
Sex =
Male ?
Age <
9.5?
*Sibps >
2.5?
Survived
Died
SurvivedDied
NO YES
Country=
US?
State =
CA?
Height>
1.60m?
Survived
Died
SurvivedDied
NO YES
Weight<
80kg?
I.Q<100?
Eye color
=blue?
Survived
Died
SurvivedDied
NO
Tree #1 Tree #2 Tree #3

Would John Survive The Titanic
• John’s features:
{male, 34, married + 2, US, CA, 1.78m, 78kg, 110iq, blue eyes}
• Tree#1 – Survived
• Tree#2 – Failed
• Tree#3 – Survived
• Random forest decision - Survived

Real World Challenge
• Ads serving company
• Need to serve 20,000 ads/sec @ 50msec data-center latency
• Runs 1k campaigns  1K random forest
• Each forest has 15K trees
• On average each tree has 7 levels (depth)

Accurate Models are Expensive to Serve !
Item Calculation Total
Random forest
ops/sec
20K (ads/sec) x
1K (forests) x
15K(trees) x
7 x 0.5 (levels)
1.05 trillion
Max ops/sec on the
strongest AWS
instance vcore
2.6Ghz x
0.9 (OS overhead) x
0.1 (10 lines of code per ops) x
0.1 (Java overhead)
23.4 million
# of vcores needed 2.1 trillion / 23.4 million 44,872
# of c4.8xlarge
instances needed
44,872 / 36 1,247
Total cost
reserved instances
1,247 x 9213 ~$11.5M/yr

Redis-ML: a Redis Module for Serving ML models
Supported Models
Tree Ensembles
Linear Regression
Logistic Regression
Matrix + Vector Operations
K-Means
Neural Networks

Redis-ML: Predictive Model Serving Engine
• Predictive models as native Redis types
• Perform evaluation directly in Redis
• Store training output as “hot model”
Spark Training
Data loaded into Spark Model is saved in
Redis-ML
Redis-ML
Serving Client
Client
App
Client
App
Client
App
Any Training
Platform

Ads Serving Use-Case w/ and w/o Redise + ML
Homegrown
1,247 x c4.8xlarge 35 x c4.8xlarge
Cut computing infrastructure by
97%

Ads Serving Use-Case w/ and w/o Redise + ML
msec msec
x2,000 Faster

Why Serving ML/DL at the Data Layer ?

Traditional ML Pipeline
Streams
Engine
A cat or a dog?
1

Streams
Engine
A cat or a dog?
Ingestion
1 2
Pre-processing

Streams
Engine
A cat or a dog?
Ingestion
Custom
App
1 2 1
Pre-processing Notification

Streams
Engine
A cat or a dog?
Ingestion
Custom
App
1 2 1
DBMS
2

Streams
Engine
A cat or a dog?
Ingestion
Custom
App
1 2 1
DBMS
2
43
Data+Metadata
ML/DL
Engine

Streams
Engine
A cat or a dog?
Ingestion
Custom
App
1 2 1
DBMS
2
4
Data+Metadata
3
ML/DL
Engine
3

Streams
Engine
A cat or a dog?
Ingestion
Custom
App
1 2 1
DBMS
2
4
Data+Metadata
3
ML/DL
Engine
4x data-movement
O(3)x messages
3

Streams
Engine
A cat or a dog?
Ingestion
Custom
App
1 2 1
DBMS
2
4
Data+Metadata
3
ML/DL
Engine
> 100msec !!!
3

What if Streams+ML runs on your in-memory database ?
A cat or a dog?
1
Redis Enterprise
+

A cat or a dog?
Custom
App
1 1
Redis Enterprise
+
Notification

A cat or a dog?
Custom
App
1 1
Redis Enterprise
+
Notification
2 One API call

A cat or a dog?
Custom
App
1 1
Redis Enterprise
+
Notification
2 One API call
3

A cat or a dog?
Custom
App
1 1
Redis Enterprise
+
Notification
2 One API call
3
1x data-movement
3x messages

A cat or a dog?
Custom
App
1 1
Redis Enterprise
+
Notification
2 One API call
3
Single digit latency (<10msec)

How in memory technology will impact machine deep learning services (redis labs case study)

More Related Content

Similar to How in memory technology will impact machine deep learning services (redis labs case study)

More from Avner Algom

Recently uploaded

How in memory technology will impact machine deep learning services (redis labs case study)