SlideShare a Scribd company logo
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
AWS re:INVENT
Large Scale Deep Learning
with BigDL
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Large Scale Deep Learning
with BigDL
T i m F o x | B i g D a t a a n d M a c h i n e L e a r n i n g C o n s u l t a n t | E l e p h a n t S c a l e
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
ABOUT ME
Ti m F o x ,
P r i n c i p a l @ E l e p h a n t S c a l e
P r a c t i t i o n e r a n d Tr a i n e r i n D a t a E n g i n e e r i n g
a n d D a t a S c i e n c e
Author of “Data Science in Python” on LinkedIn Learning
t i m @ e l e p h a n t s c a l e . c o m
L i n k e d i n : t i m - f o x - 0 0 6 3 5 4 1
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
ABOUT Elephant Scale
• Tr a i n i n g i n B i g D a t a a n d A I t e c h n o l o g i e s
• B i g D a t a : S p a r k , H a d o o p , C l o u d , N o S Q L , S t r e a m i n g
• A I : M a c h i n e L e a r n i n g , D e e p L e a r n i n g , B i g D L ,
Te n s o r f l o w
• B i g D L t r a i n i n g a v a i l a b l e !
• P u b l i c a n d P r i v a t e t r a i n i n g s a v a i l a b l e
• B i g D L S a n d b o x : e l e p h a n t s c a l e . c o m / s a n d b o x
E l e p h a n t s c a l e . c o m
i n f o @ e l e p h a n t s c a l e . c o m
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Quick Roundup of AI / Machine
Learning / Deep Learning
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
AI / MACHINE LEARNING / DEEP LEARNING
Artificial Intelligence (AI):
Broader concept of machines being able to carry
out 'smart' tasks
Machine Learning:
A type of AI that allows software to learn from
data without explicitly programmed
Deep Learning:
Using Neural Networks to solve some hard
problems
A r t i f i c i a l
I n t e l l i g e n c e
M a c h i n e
L e a r n i n g
D e e p
L e a r n i n g
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
DEEP LEARNING APPLICATIONS
S e l f D r i v i n g C a r s
• ML system using image recognition
• Where the edge of the road / road sign / car in front
F a c e r e c o g n i t i o n
• Facebook images
• System learns from images manually tagged and then automatically detects faces in
uploaded photos
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
DEEP LEARNING HISTORY
E a r l y a t t e m p t s a t D e e p L e a r n i n g d i d n o t s u c c e e d .
• Compute Power was insufficient for the time.
• Training Datasets were insufficiently sized for good results.
• We lacked the ability to parallelize our work.
I n t h e m o d e r n e r a , D e e p L e a r n i n g h a s b e e n s u c c e s s f u l .
• 'Big Data' – now we have so much data to train our models
• 'Big Data ecosystem' – excellent big data platforms (Hadoop, Spark, NoSQL)
are available as open source
• 'Big Compute' - cloud platforms significantly lowered the barrier to massive
compute power
• $1 buys you 16 core + 128 G + 10 Gigabit machine for 1 hr on AWS!
• So running a 100 node cluster for 5 hrs  $500
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
AI SOFTWARE ECO SYSTEM
M a c h i n e L e a r n i n g D e e p L e a r n i n g
Java
- Weka
- Mahout
- DeepLearning4J
Python
- SciKit
- Tensorflow
- Theano
- Caffe
R
- Many libraries - Deepnet
- Darch
Distributed
- H20
- Spark
- H20
- Spark
- BigDL
Cloud - AWS - AWS
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
MACHINE LEARNING AND BIG DATA
Until recently most of the machine learning is done on “single computer” (with lots of
memory–100s of GBs)
Most R/Python/Java libraries are “single node based”
Now Big Data tools make it possible to run machine learning algorithms at massive scale –
distributed across a cluster
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
MODERN DEEP LEARNING FRAMEWORKS
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
TOOLS FOR SCALABLE MACHINE LEARNING
A p a c h e S p a r k M L
• Runs on top of popular Spark framework
• Massively scalable
• Can use memory (caching) effectively for
iterative algorithms
• Language support: Scala, Java, Python, R
B i g D L
• Built for Apache Spark and Optimized for Intel Xeon
• Language Support: Scala, Java, Python
Te n s o r F l o w
• Based on “data flow graphs”
• Language support: Python, C++
• https://www.tensorflow.org/
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
TOOLS FOR SCALABLE CLOUD
MACHINE LEARNING
A m a z o n M a c h i n e L e a r n i n g
• Ready to go algorithms
• Visualization tools
• Wizards to guide
• Scalable on Amazon Cloud
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
BigDL
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
WHAT IS BIGDL
A d i s t r i b u t e d d e e p l e a r n i n g l i b r a r y f o r A p a c h e S p a r k
F e a t u r e p a r i t y wi t h p o p u l a r d e e p l e a r n i n g f r a m e wo r k s
• Caffe, Torch, Tensorflow
H i g h P e r f o r m a n c e
• Powered by Intel Math Kernel Library (MKL) and multi threaded programming
C a n s c a l e t o h u g e d a t a s e t s
• Using Apache Spark for scale
O p e n s o u r c e ! ( D e c 2 0 1 6 )
A c t i v e D e v e l o p m e n t
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
PRODUCTION ML/DL SYSTEMS ARE COMPLEX!
A c t u a l M L / D L i s o n l y s m a l l p o r t i o n o f m a s s i v e p r o d u c t i o n
s y s t e m
B i g D L r u n n i n g o n a s c a l a b l e p l a t f o r m l i k e S p a r k h e l p s
s i m p l i f y t h e c o m p l e x i t y
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
BIGDL FILLS THE 'GAP' IN BIG DATA +
DEEP LEARNING
F o l l o ws p r o v e n d e s i g n p a t t e r n s f o r d e a l i n g wi t h B i g D a t a
S e n d s ' c o m p u t e t o d a t a ' r a t h e r t h a n r e a d i n g m a s s i v e d a t a
o v e r n e t wo r k .
U s e s ' d a t a l o c a l i t y ' o f H D F S ( H a d o o p F i l e S y s t e m
U t i l i z e s ' c l u s t e r m a n a g e r s ' l i k e YA R N / M E S O S
• Automatically handles hardware/software failures
• Elasticity and resource sharing in a cluster
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
BIGDL & SPARK
R u n B i g D L a p p l i c a t i o n s a s S p a r k a p p l i c a t i o n s
S c a l a , J a v a , a n d P y t h o n s u p p o r t
U s e o t h e r S p a r k ' s f e a t u r e s
• In memory compute
• Integrate with Spark ML and Streaming
E a s y d e v e l o p m e n t wi t h J u p y t e r N o t e b o o k
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
BIGDL VS TENSORFLOW
B i g D L Te n s o r f l o w
Runtime Scala Engine with Python
front-end
C++ Engine with Python front-
end
Hadoop compatibility Can run natively on Spark &
Hadoop
Accesses Hadoop data as a
client only
Distributed Operation Scalable with Apache Spark
for massive scale out of the
box
Does not support massive
distribution out of the box
Runs Tensorflow Models Yes Yes
Acceleration CPU w/MKL CPU/GPU
Summary Excellent for distributing
deep-learning models to
massive scale on big-data.
Great TCO value.
Excellent library for small-
medium scale data, although
GPU hardware costs can be
significant.
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
BIGDL: BIG COMPUTE PLUS BIG DATA
B i g D L h e l p s u s i n b a l a n c i n g o u r n e e d s
• Big Compute: Fast Linear Algebra, Intel MKL library
• Optimized for Intel Xeon
• Big Data: I/O parallelized to run on many CPUs
B i g D L A l l o ws M a s s i v e S c a l a b i l i t y
• Natively Designed to run on Spark
• Works with Hadoop eco system (via Spark)
Hadoop is THE Big Data platform for on-premise deployments
P l a y s n i c e l y wi t h o t h e r B i g D L f r a m e wo r k s
• Use existing Tensorflow or Caffe at scale in BigDL
• Train new models based on existing TF / Caffe models
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
BIGDL USE CASES
F r a u d d e t e c t i o n
S e n t i m e n t a n a l y s i s
I m a g e r e c o g n i t i o n
Find more at: https://github.com/intel-analytics/analytics-zoo/
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
GPUs and CPUs
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
GPUS (GRAPHICS PROCESSING UNITS)
G P U s h a v e a d d r e s s e d p a s t i s s u e s i n t r a i n i n g p e r f o r m a n c e
• Example: Tensorflow - optimized to run well on GPUs.
C P U i n p a s t n o t v e c t o r i z e d f o r p a r a l l e l c o m p u t e
• Meant that GPUs were much faster for deep learning
M o d e r n I n t e l X e o n C P U s h a v e v e c t o r i z e d l i n e a r a l g e b r a
• Properly optimized, approaches speed of GPUs
• CPUs are now a credible alternative to running on GPUs
• Cost Advantage and Scalability
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
INTEL MATH KERNEL LIBRARY (MKL)
F e a t u r e s h i g h l y o p t i m i z e d , t h r e a d e d , a n d v e c t o r i z e d m a t h
f u n c t i o n s t h a t m a x i m i z e p e r f o r m a n c e o n e a c h p r o c e s s o r f a m i l y .
U t i l i z e s i n d u s t r y - s t a n d a r d C a n d F o r t r a n A P I s f o r c o m p a t i b i l i t y
w i t h p o p u l a r B L A S , L A PA C K , a n d F F T W f u n c t i o n s — n o c o d e
c h a n g e s r e q u i r e d .
D i s p a t c h e s o p t i m i z e d c o d e f o r e a c h p r o c e s s o r a u t o m a t i c a l l y
w i t h o u t t h e n e e d t o b r a n c h c o d e .
P r o v i d e s p r i o r i t y s u p p o r t , c o n n e c t i n g y o u d i r e c t l y t o I n t e l
e n g i n e e r s f o r c o n f i d e n t i a l a n s w e r s t o t e c h n i c a l q u e s t i o n s
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
INTEL MKL PERFORMANCE
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
CPU VERSUS GPU FOR BIG DATA
C P U o f f e r s h i g h e r s c a l a b i l i t y a t l o w e r c o s t v e r s u s G P U
O p t i m i z e d S o f t w a r e a n d l i b r a r i e s o n C P U a l l o w s i n g l e - n o d e
p e r f o r m a n c e t o a p p r o a c h G P U p e r f o r m a n c e .
G P U p l u s C P U a r c h i t e c t u r e s c a n b e e f f e c t i v e f o r s m a l l e r n u m b e r
o f n o d e s , w h e n c o s t i s n o t a c o n c e r n .
” B i g C o m p u t e ” v e r s u s “ B i g D a t a ”
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Running BigDL
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
RUNNING BIGDL
D e ve l o p i n g :
U s e t h e f o l l o wi n g t o d e v e l o p
y o u r B i g D L a p p s e ff o r t l e s s l y
• Docker
• VM Sandbox
D e p l o y i n g :
C l o u d r e a d y d e p l o y m e n t
• Amazon AMI
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
DEMO: GETTING STARTED WITH BIGDL
We wi l l p r o v i d e :
• Docker
• Sandbox VM
• AWS Marketplace AMI
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
BigDL Summary
B i g D L o f f e r s o u t s t a n d i n g s c a l a b i l i t y a n d p e r f o r m a n c e
B i g D L o p t i m i z e s T C O b y r u n n i n g b e i n g t u n e d a n d o p t i m i z e d f o r
I n t e l X e o n P r o c e s s o r s
B i g D L b r i n g s d e e p l e a r n i n g t o S p a r k C l u s t e r s a n d H a d o o p
D a t a s e t s
B i g D L c a n b e u s e d t o d e p l o y Te n s o r f l o w a n d C a f f e m o d e l s t o b i g
d a t a .
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
IMAGE RECOGNITION WITH
APACHE SPARK AND BIGDL
A l e x K a l i n i n | V P, A I / M a c h i n e L e a r n i n g | S i z m e k
ABOUT ME
A l e x K a l i n i n
V P, A I / M a c h i n e L e a r n i n g | S i z m e k
a l e x . k a l i n i n @ s i z m e k . c o m
L i n k e d i n : l i n k e d i n . c o m / i n / a l e x k a l i n i n /
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
https://www.linkedin.com/in/alexkalinin/
alex.kalinin@sizmek.com
1 0 0 , 0 0 0 , 0 0 0 , 0 0 0 r e q u e s t s p e r d a y
P B s o f t r a i n i n g d a t a
AI-POWERED MARKETING AND OPTIMIZATION
7 0 , 0 0 0 , 0 0 0 / m i n u t e
1 , 2 0 0 , 0 0 0 / s e c
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
FEED-FORWARD NETWORK
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
43
37
45
40
𝑦 = (𝑤𝑖 ∗ 𝑥𝑖)
?
?
?
?
?
?
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
-0.53
0.01
-0.17
0.70
0.51
?
?
𝑦 = (𝑤𝑖 ∗ 𝑥𝑖)
43
37
45
40
?
?
?
?
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
-0.53
0.01
-0.17
0.70
0.51
?
?
𝑦 = (𝑤𝑖 ∗ 𝑥𝑖)
43
37
45
40
-1.56
?
?
?
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
-0.53
0.01
-0.17
0.70
0.51
?
?
𝑦 = 𝑅𝑒𝐿𝑈( 𝑤𝑖 ∗ 𝑥𝑖 )
43
37
45
40
-1.56
?
?
?
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
-0.53
0.01
-0.17
0.70
0.51
?
?
𝑦 = 𝑅𝑒𝐿𝑈( 𝑤𝑖 ∗ 𝑥𝑖 )
43
37
45
40
?
?
?
0
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
43
37
45
40
-0.12
0.13
0.21
-0.07
-0.05
?
?
𝑦 = 𝑅𝑒𝐿𝑈( 𝑤𝑖 ∗ 𝑥𝑖 )
?
?
0
11.9
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
?
?
𝑦 = 𝑅𝑒𝐿𝑈( 𝑤𝑖 ∗ 𝑥𝑖 )
43
37
45
40
-.11
?
0
11.9
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
?
?
𝑦 = 𝑅𝑒𝐿𝑈( 𝑤𝑖 ∗ 𝑥𝑖 )
43
37
45
40 ?
0
11.9
0
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
?
?
𝑦 = 𝑅𝑒𝐿𝑈( 𝑤𝑖 ∗ 𝑥𝑖 )
43
37
45
40 0.15
0
11.9
0
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
-0.67
?
𝑦 = 𝑅𝑒𝐿𝑈( 𝑤𝑖 ∗ 𝑥𝑖 )
43
37
45
40 0.15
0
11.9
0
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
0
?
𝑦 = 𝑅𝑒𝐿𝑈( 𝑤𝑖 ∗ 𝑥𝑖 )
43
37
45
40 0.15
0
11.9
0
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
0
0.52
𝑦 = 𝑅𝑒𝐿𝑈( 𝑤𝑖 ∗ 𝑥𝑖 )
43
37
45
40 0.15
0
11.9
0
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
FEED-FORWARD NETWORK
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
FULLY CONNECTED
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
FULLY CONNECTED
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
FULLY CONNECTED
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
FULLY CONNECTED
Input Size:
Connections:
40,000
1,600,000,000
200
200
200
200
10 layers: 16 billion
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
INVENTION OF CONVOLUTIONAL
NEURAL NETWORKS
• L e N e t - 5 n e t wo r k d e v e l o p e d i n 1 9 9 8 b y Ya n n L e C u n
• To r s t e n H u b e l a n d D a v i d W i e s e l
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
HUBEL & WIESEL
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1363130/
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
HIERARCHICAL & LOCAL VISUAL CORTEX
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
HIERARCHICAL & LOCAL VISUAL CORTEX
Lines,
Dots
Orientation,
Movement
High-Level
Shapes
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
KEY FEATURES OF CONVOLUTIONAL
NETWORK
• C o n v o l u t i o n
• P o o l i n g
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
FULLY CONNECTED
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
CONVOLUTION
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
CONVOLUTION
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
CONVOLUTION
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
CONVOLUTION
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
CONVOLUTION
O n l y f o u r we i g h t s
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
CONVOLUTION
0.10 -0.06
0.24 0.17
Filter
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
POOLING
Source: https://cs231n.github.io/convolutional-networks/
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
CONVOLUTIONAL NETWORK
0
0
0
0
1
0
0
0
0
0
0
1
2
3
4
5
6
7
8
9
Convolution Pooling Convolution PoolingInput FC FC
Source: https://www.clarifai.com /technology
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
http://scs.ryerson.ca/~aharley/vis/conv/flat.html
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
DEMO
G i t H u b : h t t p s : / / g i t h u b . c o m / a l e x - k a l i n i n / l e n e t - b i g d l
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Getting Started With BigDL
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
ABOUT ME
S u j e e M a n i y a m
F o u n d e r / P r i n c i p a l @ E l e p h a n t S c a l e
P r a c t i t i o n e r a n d Tr a i n e r i n D a t a E n g i n e e r i n g
a n d D a t a S c i e n c e
Author
- "Hadoop and Spark" video training on O'Reilly Media
- "HBase Design Patterns"
- "Hadoop illuminated"
s u j e e @ e l e p h a n t s c a l e . c o m
L i n k e d i n : l i n k e d i n . c o m / i n / s u j e e m a n i y a m
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
RUNNING BIGDL
D e ve l o p i n g :
U s e t h e f o l l o wi n g t o d e v e l o p
y o u r B i g D L a p p s e ff o r t l e s s l y
• Docker
• VM Sandbox
• Amazon AMI
D e p l o y i n g :
C l o u d r e a d y d e p l o y m e n t
• Amazon AMI
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
GETTING STARTED WITH BIGDL
We wi l l d e m o n s t r a t e
• Docker
• Sandbox VM
• AWS Marketplace AMI
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Docker
Step 1 : Install Docker on your laptop
Step 2 : get docker image
docker pull elephantscale/bigdl-sandbox
Step 3 : download tutorials
git clone https://github.com/elephantscale/bigdl-tutorials
Step 4 : Launch docker
cd bigdl-tutorials
./run-bigdl-docker.sh elephantscale/bigdl-sandbox
Step 5 : Go to Jupyter notebook
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
VM-Sandbox
Step 1 : Install VMware Player or VirtualBox on your laptop
Step 2 : Download BigDL-Sandbox image
http://elephantscale.com/sandbox/
Step 3 : (In Sandbox) download tutorials
git clone https://github.com/elephantscale/bigdl-tutorials
Step 4 : (In Sandbox) Run BigDL natively
cd bigdl-tutorials
./run-bigdl-native.sh
Step 5 : (In Sandbox) Go to Jupyter notebook
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Docker on AWS
Step 1 : Spin up an AMI (Ubuntu recommended)
Step 2 : Install Docker on the instance
Step 3 : get docker image
docker pull elephantscale/bigdl-sandbox
Step 4 : download tutorials
git clone https://github.com/elephantscale/bigdl-tutorials
Step 5 : Launch docker
cd bigdl-tutorials
./run-bigdl-docker.sh elephantscale/bigdl-sandbox
Step 6 : Go to Jupyter notebook
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
AMI on AWS
Step 1 : Spin up BigDL AMI
Step 2 : download tutorials
git clone https://github.com/elephantscale/bigdl-tutorials
Step 3 : Run BigDL
cd bigdl-tutorials
./run-bigdl-native.sh
Step 4 : Go to Jupyter notebook
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
QUESTIONS
G i t H u b : h t t p s : / / g i t h u b . c o m / a l e x - k a l i n i n / l e n e t - b i g d l
L i n k e d I n : h t t p s : / / ww w. l i n k e d i n . c o m / i n / a l e x k a l i n i n /
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Notebooks and Resources
BigDL: software.intel.com/bigdl
Tutorials: github.com/dnielsen/bigdl-resources
Sandbox: elephantscale.com/sandbox
BigDL AMI: aws.amazon.com/marketplace/
Training: elephantscale.com
Slides: slideshare.net/dcnielsen/
Tim Fox Sujee Maniyam Alex Kalinin Dave Nielsen
Elephant Scale Elephant Scale Sizmek Intel Software
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
THANK YOU!

More Related Content

What's hot

ABD316_American Heart Association Finding Cures to Heart Disease Through the ...
ABD316_American Heart Association Finding Cures to Heart Disease Through the ...ABD316_American Heart Association Finding Cures to Heart Disease Through the ...
ABD316_American Heart Association Finding Cures to Heart Disease Through the ...
Amazon Web Services
 
How to build a data lake with aws glue data catalog (ABD213-R) re:Invent 2017
How to build a data lake with aws glue data catalog (ABD213-R)  re:Invent 2017How to build a data lake with aws glue data catalog (ABD213-R)  re:Invent 2017
How to build a data lake with aws glue data catalog (ABD213-R) re:Invent 2017
Amazon Web Services
 
Best Practices for Building a Data Lake in Amazon S3 and Amazon Glacier, with...
Best Practices for Building a Data Lake in Amazon S3 and Amazon Glacier, with...Best Practices for Building a Data Lake in Amazon S3 and Amazon Glacier, with...
Best Practices for Building a Data Lake in Amazon S3 and Amazon Glacier, with...
Amazon Web Services
 
Technology Trends in Data Processing - DAT311 - re:Invent 2017
Technology Trends in Data Processing - DAT311 - re:Invent 2017Technology Trends in Data Processing - DAT311 - re:Invent 2017
Technology Trends in Data Processing - DAT311 - re:Invent 2017
Amazon Web Services
 
Big Data Breakthroughs: Process and Query Data In Place with Amazon S3 Select...
Big Data Breakthroughs: Process and Query Data In Place with Amazon S3 Select...Big Data Breakthroughs: Process and Query Data In Place with Amazon S3 Select...
Big Data Breakthroughs: Process and Query Data In Place with Amazon S3 Select...
Amazon Web Services
 
ABD315_Serverless ETL with AWS Glue
ABD315_Serverless ETL with AWS GlueABD315_Serverless ETL with AWS Glue
ABD315_Serverless ETL with AWS Glue
Amazon Web Services
 
ABD311_Deploying Amazon QuickSight For Enterprise
ABD311_Deploying Amazon QuickSight For EnterpriseABD311_Deploying Amazon QuickSight For Enterprise
ABD311_Deploying Amazon QuickSight For Enterprise
Amazon Web Services
 
ABD318_Architecting a data lake with Amazon S3, Amazon Kinesis, AWS Glue and ...
ABD318_Architecting a data lake with Amazon S3, Amazon Kinesis, AWS Glue and ...ABD318_Architecting a data lake with Amazon S3, Amazon Kinesis, AWS Glue and ...
ABD318_Architecting a data lake with Amazon S3, Amazon Kinesis, AWS Glue and ...
Amazon Web Services
 
Migrating your traditional Data Warehouse to a Modern Data Lake
Migrating your traditional Data Warehouse to a Modern Data LakeMigrating your traditional Data Warehouse to a Modern Data Lake
Migrating your traditional Data Warehouse to a Modern Data Lake
Amazon Web Services
 
ABD324_Migrating Your Oracle Data Warehouse to Amazon Redshift Using AWS DMS ...
ABD324_Migrating Your Oracle Data Warehouse to Amazon Redshift Using AWS DMS ...ABD324_Migrating Your Oracle Data Warehouse to Amazon Redshift Using AWS DMS ...
ABD324_Migrating Your Oracle Data Warehouse to Amazon Redshift Using AWS DMS ...
Amazon Web Services
 
DAT324_Expedia Flies with DynamoDB Lightning Fast Stream Processing for Trave...
DAT324_Expedia Flies with DynamoDB Lightning Fast Stream Processing for Trave...DAT324_Expedia Flies with DynamoDB Lightning Fast Stream Processing for Trave...
DAT324_Expedia Flies with DynamoDB Lightning Fast Stream Processing for Trave...
Amazon Web Services
 
FINRA's Managed Data Lake: Next-Gen Analytics in the Cloud - ENT328 - re:Inve...
FINRA's Managed Data Lake: Next-Gen Analytics in the Cloud - ENT328 - re:Inve...FINRA's Managed Data Lake: Next-Gen Analytics in the Cloud - ENT328 - re:Inve...
FINRA's Managed Data Lake: Next-Gen Analytics in the Cloud - ENT328 - re:Inve...
Amazon Web Services
 
Graph and Amazon Neptune
Graph and Amazon NeptuneGraph and Amazon Neptune
Graph and Amazon Neptune
Amazon Web Services
 
(BDT306) Mission-Critical Stream Processing with Amazon EMR and Amazon Kinesi...
(BDT306) Mission-Critical Stream Processing with Amazon EMR and Amazon Kinesi...(BDT306) Mission-Critical Stream Processing with Amazon EMR and Amazon Kinesi...
(BDT306) Mission-Critical Stream Processing with Amazon EMR and Amazon Kinesi...
Amazon Web Services
 
Building Data Lakes with AWS
Building Data Lakes with AWSBuilding Data Lakes with AWS
Building Data Lakes with AWS
Amazon Web Services
 
How Twilio Scaled Its Data Driven Culture - ABD309 - re:Invent 2017
How Twilio Scaled Its Data Driven Culture - ABD309 - re:Invent 2017How Twilio Scaled Its Data Driven Culture - ABD309 - re:Invent 2017
How Twilio Scaled Its Data Driven Culture - ABD309 - re:Invent 2017
Amazon Web Services
 
The AWS Big Data Platform – Overview
The AWS Big Data Platform – OverviewThe AWS Big Data Platform – Overview
The AWS Big Data Platform – Overview
Amazon Web Services
 
Using Graph Databases
Using Graph DatabasesUsing Graph Databases
Using Graph Databases
Amazon Web Services
 
AWS 機器學習 I ─ 人工智慧 AI
AWS 機器學習 I ─ 人工智慧 AIAWS 機器學習 I ─ 人工智慧 AI
AWS 機器學習 I ─ 人工智慧 AI
Amazon Web Services
 
Using Data Lakes
Using Data LakesUsing Data Lakes
Using Data Lakes
Amazon Web Services
 

What's hot (20)

ABD316_American Heart Association Finding Cures to Heart Disease Through the ...
ABD316_American Heart Association Finding Cures to Heart Disease Through the ...ABD316_American Heart Association Finding Cures to Heart Disease Through the ...
ABD316_American Heart Association Finding Cures to Heart Disease Through the ...
 
How to build a data lake with aws glue data catalog (ABD213-R) re:Invent 2017
How to build a data lake with aws glue data catalog (ABD213-R)  re:Invent 2017How to build a data lake with aws glue data catalog (ABD213-R)  re:Invent 2017
How to build a data lake with aws glue data catalog (ABD213-R) re:Invent 2017
 
Best Practices for Building a Data Lake in Amazon S3 and Amazon Glacier, with...
Best Practices for Building a Data Lake in Amazon S3 and Amazon Glacier, with...Best Practices for Building a Data Lake in Amazon S3 and Amazon Glacier, with...
Best Practices for Building a Data Lake in Amazon S3 and Amazon Glacier, with...
 
Technology Trends in Data Processing - DAT311 - re:Invent 2017
Technology Trends in Data Processing - DAT311 - re:Invent 2017Technology Trends in Data Processing - DAT311 - re:Invent 2017
Technology Trends in Data Processing - DAT311 - re:Invent 2017
 
Big Data Breakthroughs: Process and Query Data In Place with Amazon S3 Select...
Big Data Breakthroughs: Process and Query Data In Place with Amazon S3 Select...Big Data Breakthroughs: Process and Query Data In Place with Amazon S3 Select...
Big Data Breakthroughs: Process and Query Data In Place with Amazon S3 Select...
 
ABD315_Serverless ETL with AWS Glue
ABD315_Serverless ETL with AWS GlueABD315_Serverless ETL with AWS Glue
ABD315_Serverless ETL with AWS Glue
 
ABD311_Deploying Amazon QuickSight For Enterprise
ABD311_Deploying Amazon QuickSight For EnterpriseABD311_Deploying Amazon QuickSight For Enterprise
ABD311_Deploying Amazon QuickSight For Enterprise
 
ABD318_Architecting a data lake with Amazon S3, Amazon Kinesis, AWS Glue and ...
ABD318_Architecting a data lake with Amazon S3, Amazon Kinesis, AWS Glue and ...ABD318_Architecting a data lake with Amazon S3, Amazon Kinesis, AWS Glue and ...
ABD318_Architecting a data lake with Amazon S3, Amazon Kinesis, AWS Glue and ...
 
Migrating your traditional Data Warehouse to a Modern Data Lake
Migrating your traditional Data Warehouse to a Modern Data LakeMigrating your traditional Data Warehouse to a Modern Data Lake
Migrating your traditional Data Warehouse to a Modern Data Lake
 
ABD324_Migrating Your Oracle Data Warehouse to Amazon Redshift Using AWS DMS ...
ABD324_Migrating Your Oracle Data Warehouse to Amazon Redshift Using AWS DMS ...ABD324_Migrating Your Oracle Data Warehouse to Amazon Redshift Using AWS DMS ...
ABD324_Migrating Your Oracle Data Warehouse to Amazon Redshift Using AWS DMS ...
 
DAT324_Expedia Flies with DynamoDB Lightning Fast Stream Processing for Trave...
DAT324_Expedia Flies with DynamoDB Lightning Fast Stream Processing for Trave...DAT324_Expedia Flies with DynamoDB Lightning Fast Stream Processing for Trave...
DAT324_Expedia Flies with DynamoDB Lightning Fast Stream Processing for Trave...
 
FINRA's Managed Data Lake: Next-Gen Analytics in the Cloud - ENT328 - re:Inve...
FINRA's Managed Data Lake: Next-Gen Analytics in the Cloud - ENT328 - re:Inve...FINRA's Managed Data Lake: Next-Gen Analytics in the Cloud - ENT328 - re:Inve...
FINRA's Managed Data Lake: Next-Gen Analytics in the Cloud - ENT328 - re:Inve...
 
Graph and Amazon Neptune
Graph and Amazon NeptuneGraph and Amazon Neptune
Graph and Amazon Neptune
 
(BDT306) Mission-Critical Stream Processing with Amazon EMR and Amazon Kinesi...
(BDT306) Mission-Critical Stream Processing with Amazon EMR and Amazon Kinesi...(BDT306) Mission-Critical Stream Processing with Amazon EMR and Amazon Kinesi...
(BDT306) Mission-Critical Stream Processing with Amazon EMR and Amazon Kinesi...
 
Building Data Lakes with AWS
Building Data Lakes with AWSBuilding Data Lakes with AWS
Building Data Lakes with AWS
 
How Twilio Scaled Its Data Driven Culture - ABD309 - re:Invent 2017
How Twilio Scaled Its Data Driven Culture - ABD309 - re:Invent 2017How Twilio Scaled Its Data Driven Culture - ABD309 - re:Invent 2017
How Twilio Scaled Its Data Driven Culture - ABD309 - re:Invent 2017
 
The AWS Big Data Platform – Overview
The AWS Big Data Platform – OverviewThe AWS Big Data Platform – Overview
The AWS Big Data Platform – Overview
 
Using Graph Databases
Using Graph DatabasesUsing Graph Databases
Using Graph Databases
 
AWS 機器學習 I ─ 人工智慧 AI
AWS 機器學習 I ─ 人工智慧 AIAWS 機器學習 I ─ 人工智慧 AI
AWS 機器學習 I ─ 人工智慧 AI
 
Using Data Lakes
Using Data LakesUsing Data Lakes
Using Data Lakes
 

Similar to BigDL Deep Learning in Apache Spark - AWS re:invent 2017

BigDL: Image Recognition Using Apache Spark with BigDL - MCL358 - re:Invent 2017
BigDL: Image Recognition Using Apache Spark with BigDL - MCL358 - re:Invent 2017BigDL: Image Recognition Using Apache Spark with BigDL - MCL358 - re:Invent 2017
BigDL: Image Recognition Using Apache Spark with BigDL - MCL358 - re:Invent 2017
Amazon Web Services
 
Database Backup
Database BackupDatabase Backup
Database Backup
Amazon Web Services
 
AWS Storage State of the Union
AWS Storage State of the UnionAWS Storage State of the Union
AWS Storage State of the Union
Amazon Web Services
 
Data Lake na área da saúde- AWS
Data Lake na área da saúde- AWSData Lake na área da saúde- AWS
Data Lake na área da saúde- AWS
Amazon Web Services LATAM
 
Building Text Analytics Applications on AWS using Amazon Comprehend - AWS Onl...
Building Text Analytics Applications on AWS using Amazon Comprehend - AWS Onl...Building Text Analytics Applications on AWS using Amazon Comprehend - AWS Onl...
Building Text Analytics Applications on AWS using Amazon Comprehend - AWS Onl...
Amazon Web Services
 
GPSBUS206_Best Practices for Building a Partner Database Practice on AWS
GPSBUS206_Best Practices for Building a Partner Database Practice on AWSGPSBUS206_Best Practices for Building a Partner Database Practice on AWS
GPSBUS206_Best Practices for Building a Partner Database Practice on AWS
Amazon Web Services
 
Accelerate machine-learning workloads using Amazon EC2 P3 instances - CMP204 ...
Accelerate machine-learning workloads using Amazon EC2 P3 instances - CMP204 ...Accelerate machine-learning workloads using Amazon EC2 P3 instances - CMP204 ...
Accelerate machine-learning workloads using Amazon EC2 P3 instances - CMP204 ...
Amazon Web Services
 
Generative Adversarial Networks (GANs) using Apache MXNet
Generative Adversarial Networks (GANs) using Apache MXNetGenerative Adversarial Networks (GANs) using Apache MXNet
Generative Adversarial Networks (GANs) using Apache MXNet
Apache MXNet
 
NEW LAUNCH! Amazon Neptune Overview and Customer Use Cases - DAT319 - re:Inve...
NEW LAUNCH! Amazon Neptune Overview and Customer Use Cases - DAT319 - re:Inve...NEW LAUNCH! Amazon Neptune Overview and Customer Use Cases - DAT319 - re:Inve...
NEW LAUNCH! Amazon Neptune Overview and Customer Use Cases - DAT319 - re:Inve...
Amazon Web Services
 
Serverless Text Analytics with Amazon Comprehend
Serverless Text Analytics with Amazon ComprehendServerless Text Analytics with Amazon Comprehend
Serverless Text Analytics with Amazon Comprehend
Donnie Prakoso
 
GPSBUS216-GPS Applying AI-ML to Find Security Needles in the Haystack
GPSBUS216-GPS Applying AI-ML to Find Security Needles in the HaystackGPSBUS216-GPS Applying AI-ML to Find Security Needles in the Haystack
GPSBUS216-GPS Applying AI-ML to Find Security Needles in the Haystack
Amazon Web Services
 
Create an IoT Gateway and Establish a Data Pipeline to AWS IoT with Intel - I...
Create an IoT Gateway and Establish a Data Pipeline to AWS IoT with Intel - I...Create an IoT Gateway and Establish a Data Pipeline to AWS IoT with Intel - I...
Create an IoT Gateway and Establish a Data Pipeline to AWS IoT with Intel - I...
Amazon Web Services
 
GAM311-How Linden Lab Built a Virtual World on the AWS Cloud.pdf
GAM311-How Linden Lab Built a Virtual World on the AWS Cloud.pdfGAM311-How Linden Lab Built a Virtual World on the AWS Cloud.pdf
GAM311-How Linden Lab Built a Virtual World on the AWS Cloud.pdf
Amazon Web Services
 
Data Modelling at Scale
Data Modelling at ScaleData Modelling at Scale
Data Modelling at Scale
David Simons
 
AWS AI Services - What's new
AWS AI Services - What's newAWS AI Services - What's new
AWS AI Services - What's new
Amazon Web Services
 
Open Source at AWS: Code, Contributions, Collaboration, and Communication
Open Source at AWS: Code, Contributions, Collaboration, and CommunicationOpen Source at AWS: Code, Contributions, Collaboration, and Communication
Open Source at AWS: Code, Contributions, Collaboration, and Communication
Amazon Web Services
 
Latam virtual event_keynote-pt-br_americo
Latam virtual event_keynote-pt-br_americoLatam virtual event_keynote-pt-br_americo
Latam virtual event_keynote-pt-br_americo
Sandro Borges
 
Initiate Edinburgh 2019 - Big Data Meets AI
Initiate Edinburgh 2019 - Big Data Meets AIInitiate Edinburgh 2019 - Big Data Meets AI
Initiate Edinburgh 2019 - Big Data Meets AI
Amazon Web Services
 
AWS re:Invent Recap 2016 Taiwan part 2
AWS re:Invent Recap 2016 Taiwan part 2AWS re:Invent Recap 2016 Taiwan part 2
AWS re:Invent Recap 2016 Taiwan part 2
Amazon Web Services
 
Amir sadoughi developing large-scale machine learning algorithms on amazon ...
Amir sadoughi   developing large-scale machine learning algorithms on amazon ...Amir sadoughi   developing large-scale machine learning algorithms on amazon ...
Amir sadoughi developing large-scale machine learning algorithms on amazon ...
MLconf
 

Similar to BigDL Deep Learning in Apache Spark - AWS re:invent 2017 (20)

BigDL: Image Recognition Using Apache Spark with BigDL - MCL358 - re:Invent 2017
BigDL: Image Recognition Using Apache Spark with BigDL - MCL358 - re:Invent 2017BigDL: Image Recognition Using Apache Spark with BigDL - MCL358 - re:Invent 2017
BigDL: Image Recognition Using Apache Spark with BigDL - MCL358 - re:Invent 2017
 
Database Backup
Database BackupDatabase Backup
Database Backup
 
AWS Storage State of the Union
AWS Storage State of the UnionAWS Storage State of the Union
AWS Storage State of the Union
 
Data Lake na área da saúde- AWS
Data Lake na área da saúde- AWSData Lake na área da saúde- AWS
Data Lake na área da saúde- AWS
 
Building Text Analytics Applications on AWS using Amazon Comprehend - AWS Onl...
Building Text Analytics Applications on AWS using Amazon Comprehend - AWS Onl...Building Text Analytics Applications on AWS using Amazon Comprehend - AWS Onl...
Building Text Analytics Applications on AWS using Amazon Comprehend - AWS Onl...
 
GPSBUS206_Best Practices for Building a Partner Database Practice on AWS
GPSBUS206_Best Practices for Building a Partner Database Practice on AWSGPSBUS206_Best Practices for Building a Partner Database Practice on AWS
GPSBUS206_Best Practices for Building a Partner Database Practice on AWS
 
Accelerate machine-learning workloads using Amazon EC2 P3 instances - CMP204 ...
Accelerate machine-learning workloads using Amazon EC2 P3 instances - CMP204 ...Accelerate machine-learning workloads using Amazon EC2 P3 instances - CMP204 ...
Accelerate machine-learning workloads using Amazon EC2 P3 instances - CMP204 ...
 
Generative Adversarial Networks (GANs) using Apache MXNet
Generative Adversarial Networks (GANs) using Apache MXNetGenerative Adversarial Networks (GANs) using Apache MXNet
Generative Adversarial Networks (GANs) using Apache MXNet
 
NEW LAUNCH! Amazon Neptune Overview and Customer Use Cases - DAT319 - re:Inve...
NEW LAUNCH! Amazon Neptune Overview and Customer Use Cases - DAT319 - re:Inve...NEW LAUNCH! Amazon Neptune Overview and Customer Use Cases - DAT319 - re:Inve...
NEW LAUNCH! Amazon Neptune Overview and Customer Use Cases - DAT319 - re:Inve...
 
Serverless Text Analytics with Amazon Comprehend
Serverless Text Analytics with Amazon ComprehendServerless Text Analytics with Amazon Comprehend
Serverless Text Analytics with Amazon Comprehend
 
GPSBUS216-GPS Applying AI-ML to Find Security Needles in the Haystack
GPSBUS216-GPS Applying AI-ML to Find Security Needles in the HaystackGPSBUS216-GPS Applying AI-ML to Find Security Needles in the Haystack
GPSBUS216-GPS Applying AI-ML to Find Security Needles in the Haystack
 
Create an IoT Gateway and Establish a Data Pipeline to AWS IoT with Intel - I...
Create an IoT Gateway and Establish a Data Pipeline to AWS IoT with Intel - I...Create an IoT Gateway and Establish a Data Pipeline to AWS IoT with Intel - I...
Create an IoT Gateway and Establish a Data Pipeline to AWS IoT with Intel - I...
 
GAM311-How Linden Lab Built a Virtual World on the AWS Cloud.pdf
GAM311-How Linden Lab Built a Virtual World on the AWS Cloud.pdfGAM311-How Linden Lab Built a Virtual World on the AWS Cloud.pdf
GAM311-How Linden Lab Built a Virtual World on the AWS Cloud.pdf
 
Data Modelling at Scale
Data Modelling at ScaleData Modelling at Scale
Data Modelling at Scale
 
AWS AI Services - What's new
AWS AI Services - What's newAWS AI Services - What's new
AWS AI Services - What's new
 
Open Source at AWS: Code, Contributions, Collaboration, and Communication
Open Source at AWS: Code, Contributions, Collaboration, and CommunicationOpen Source at AWS: Code, Contributions, Collaboration, and Communication
Open Source at AWS: Code, Contributions, Collaboration, and Communication
 
Latam virtual event_keynote-pt-br_americo
Latam virtual event_keynote-pt-br_americoLatam virtual event_keynote-pt-br_americo
Latam virtual event_keynote-pt-br_americo
 
Initiate Edinburgh 2019 - Big Data Meets AI
Initiate Edinburgh 2019 - Big Data Meets AIInitiate Edinburgh 2019 - Big Data Meets AI
Initiate Edinburgh 2019 - Big Data Meets AI
 
AWS re:Invent Recap 2016 Taiwan part 2
AWS re:Invent Recap 2016 Taiwan part 2AWS re:Invent Recap 2016 Taiwan part 2
AWS re:Invent Recap 2016 Taiwan part 2
 
Amir sadoughi developing large-scale machine learning algorithms on amazon ...
Amir sadoughi   developing large-scale machine learning algorithms on amazon ...Amir sadoughi   developing large-scale machine learning algorithms on amazon ...
Amir sadoughi developing large-scale machine learning algorithms on amazon ...
 

More from Dave Nielsen

10 Ways to Scale with Redis - LA Redis Meetup 2019
10 Ways to Scale with Redis - LA Redis Meetup 201910 Ways to Scale with Redis - LA Redis Meetup 2019
10 Ways to Scale with Redis - LA Redis Meetup 2019
Dave Nielsen
 
10 Ways to Scale Your Website Silicon Valley Code Camp 2019
10 Ways to Scale Your Website Silicon Valley Code Camp 201910 Ways to Scale Your Website Silicon Valley Code Camp 2019
10 Ways to Scale Your Website Silicon Valley Code Camp 2019
Dave Nielsen
 
Redis Streams plus Spark Structured Streaming
Redis Streams plus Spark Structured StreamingRedis Streams plus Spark Structured Streaming
Redis Streams plus Spark Structured Streaming
Dave Nielsen
 
Microservices - Is it time to breakup?
Microservices - Is it time to breakup? Microservices - Is it time to breakup?
Microservices - Is it time to breakup?
Dave Nielsen
 
Add Redis to Postgres to Make Your Microservices Go Boom!
Add Redis to Postgres to Make Your Microservices Go Boom!Add Redis to Postgres to Make Your Microservices Go Boom!
Add Redis to Postgres to Make Your Microservices Go Boom!
Dave Nielsen
 
Redis as a Main Database, Scaling and HA
Redis as a Main Database, Scaling and HARedis as a Main Database, Scaling and HA
Redis as a Main Database, Scaling and HA
Dave Nielsen
 
Redis Functions, Data Structures for Web Scale Apps
Redis Functions, Data Structures for Web Scale AppsRedis Functions, Data Structures for Web Scale Apps
Redis Functions, Data Structures for Web Scale Apps
Dave Nielsen
 
Cloud Storage API
Cloud Storage APICloud Storage API
Cloud Storage API
Dave Nielsen
 
Mashery
MasheryMashery
Mashery
Dave Nielsen
 
Google App Engine
Google App EngineGoogle App Engine
Google App Engine
Dave Nielsen
 
Unified Cloud Storage Api
Unified Cloud Storage ApiUnified Cloud Storage Api
Unified Cloud Storage Api
Dave Nielsen
 
Integrating Wikis And Other Social Content
Integrating Wikis And Other Social ContentIntegrating Wikis And Other Social Content
Integrating Wikis And Other Social Content
Dave Nielsen
 

More from Dave Nielsen (12)

10 Ways to Scale with Redis - LA Redis Meetup 2019
10 Ways to Scale with Redis - LA Redis Meetup 201910 Ways to Scale with Redis - LA Redis Meetup 2019
10 Ways to Scale with Redis - LA Redis Meetup 2019
 
10 Ways to Scale Your Website Silicon Valley Code Camp 2019
10 Ways to Scale Your Website Silicon Valley Code Camp 201910 Ways to Scale Your Website Silicon Valley Code Camp 2019
10 Ways to Scale Your Website Silicon Valley Code Camp 2019
 
Redis Streams plus Spark Structured Streaming
Redis Streams plus Spark Structured StreamingRedis Streams plus Spark Structured Streaming
Redis Streams plus Spark Structured Streaming
 
Microservices - Is it time to breakup?
Microservices - Is it time to breakup? Microservices - Is it time to breakup?
Microservices - Is it time to breakup?
 
Add Redis to Postgres to Make Your Microservices Go Boom!
Add Redis to Postgres to Make Your Microservices Go Boom!Add Redis to Postgres to Make Your Microservices Go Boom!
Add Redis to Postgres to Make Your Microservices Go Boom!
 
Redis as a Main Database, Scaling and HA
Redis as a Main Database, Scaling and HARedis as a Main Database, Scaling and HA
Redis as a Main Database, Scaling and HA
 
Redis Functions, Data Structures for Web Scale Apps
Redis Functions, Data Structures for Web Scale AppsRedis Functions, Data Structures for Web Scale Apps
Redis Functions, Data Structures for Web Scale Apps
 
Cloud Storage API
Cloud Storage APICloud Storage API
Cloud Storage API
 
Mashery
MasheryMashery
Mashery
 
Google App Engine
Google App EngineGoogle App Engine
Google App Engine
 
Unified Cloud Storage Api
Unified Cloud Storage ApiUnified Cloud Storage Api
Unified Cloud Storage Api
 
Integrating Wikis And Other Social Content
Integrating Wikis And Other Social ContentIntegrating Wikis And Other Social Content
Integrating Wikis And Other Social Content
 

Recently uploaded

一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理
一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理
一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理
z6osjkqvd
 
How To Control IO Usage using Resource Manager
How To Control IO Usage using Resource ManagerHow To Control IO Usage using Resource Manager
How To Control IO Usage using Resource Manager
Alireza Kamrani
 
[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024
Vietnam Cotton & Spinning Association
 
一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理
一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理
一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理
slg6lamcq
 
一比一原版(曼大毕业证书)曼尼托巴大学毕业证如何办理
一比一原版(曼大毕业证书)曼尼托巴大学毕业证如何办理一比一原版(曼大毕业证书)曼尼托巴大学毕业证如何办理
一比一原版(曼大毕业证书)曼尼托巴大学毕业证如何办理
ytypuem
 
一比一原版加拿大麦吉尔大学毕业证(mcgill毕业证书)如何办理
一比一原版加拿大麦吉尔大学毕业证(mcgill毕业证书)如何办理一比一原版加拿大麦吉尔大学毕业证(mcgill毕业证书)如何办理
一比一原版加拿大麦吉尔大学毕业证(mcgill毕业证书)如何办理
agdhot
 
REUSE-SCHOOL-DATA-INTEGRATED-SYSTEMS.pptx
REUSE-SCHOOL-DATA-INTEGRATED-SYSTEMS.pptxREUSE-SCHOOL-DATA-INTEGRATED-SYSTEMS.pptx
REUSE-SCHOOL-DATA-INTEGRATED-SYSTEMS.pptx
KiriakiENikolaidou
 
一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理
一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理
一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理
nyvan3
 
Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...
Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...
Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...
Marlon Dumas
 
Data Scientist Machine Learning Profiles .pdf
Data Scientist Machine Learning  Profiles .pdfData Scientist Machine Learning  Profiles .pdf
Data Scientist Machine Learning Profiles .pdf
Vineet
 
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
ihavuls
 
一比一原版莱斯大学毕业证(rice毕业证)如何办理
一比一原版莱斯大学毕业证(rice毕业证)如何办理一比一原版莱斯大学毕业证(rice毕业证)如何办理
一比一原版莱斯大学毕业证(rice毕业证)如何办理
zsafxbf
 
一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理
一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理
一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理
ywqeos
 
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docxDATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
SaffaIbrahim1
 
Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024
ElizabethGarrettChri
 
一比一原版马来西亚博特拉大学毕业证(upm毕业证)如何办理
一比一原版马来西亚博特拉大学毕业证(upm毕业证)如何办理一比一原版马来西亚博特拉大学毕业证(upm毕业证)如何办理
一比一原版马来西亚博特拉大学毕业证(upm毕业证)如何办理
eudsoh
 
一比一原版斯威本理工大学毕业证(swinburne毕业证)如何办理
一比一原版斯威本理工大学毕业证(swinburne毕业证)如何办理一比一原版斯威本理工大学毕业证(swinburne毕业证)如何办理
一比一原版斯威本理工大学毕业证(swinburne毕业证)如何办理
actyx
 
A gentle exploration of Retrieval Augmented Generation
A gentle exploration of Retrieval Augmented GenerationA gentle exploration of Retrieval Augmented Generation
A gentle exploration of Retrieval Augmented Generation
dataschool1
 
一比一原版悉尼大学毕业证如何办理
一比一原版悉尼大学毕业证如何办理一比一原版悉尼大学毕业证如何办理
一比一原版悉尼大学毕业证如何办理
keesa2
 
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
mkkikqvo
 

Recently uploaded (20)

一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理
一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理
一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理
 
How To Control IO Usage using Resource Manager
How To Control IO Usage using Resource ManagerHow To Control IO Usage using Resource Manager
How To Control IO Usage using Resource Manager
 
[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024
 
一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理
一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理
一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理
 
一比一原版(曼大毕业证书)曼尼托巴大学毕业证如何办理
一比一原版(曼大毕业证书)曼尼托巴大学毕业证如何办理一比一原版(曼大毕业证书)曼尼托巴大学毕业证如何办理
一比一原版(曼大毕业证书)曼尼托巴大学毕业证如何办理
 
一比一原版加拿大麦吉尔大学毕业证(mcgill毕业证书)如何办理
一比一原版加拿大麦吉尔大学毕业证(mcgill毕业证书)如何办理一比一原版加拿大麦吉尔大学毕业证(mcgill毕业证书)如何办理
一比一原版加拿大麦吉尔大学毕业证(mcgill毕业证书)如何办理
 
REUSE-SCHOOL-DATA-INTEGRATED-SYSTEMS.pptx
REUSE-SCHOOL-DATA-INTEGRATED-SYSTEMS.pptxREUSE-SCHOOL-DATA-INTEGRATED-SYSTEMS.pptx
REUSE-SCHOOL-DATA-INTEGRATED-SYSTEMS.pptx
 
一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理
一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理
一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理
 
Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...
Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...
Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...
 
Data Scientist Machine Learning Profiles .pdf
Data Scientist Machine Learning  Profiles .pdfData Scientist Machine Learning  Profiles .pdf
Data Scientist Machine Learning Profiles .pdf
 
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
 
一比一原版莱斯大学毕业证(rice毕业证)如何办理
一比一原版莱斯大学毕业证(rice毕业证)如何办理一比一原版莱斯大学毕业证(rice毕业证)如何办理
一比一原版莱斯大学毕业证(rice毕业证)如何办理
 
一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理
一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理
一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理
 
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docxDATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
 
Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024
 
一比一原版马来西亚博特拉大学毕业证(upm毕业证)如何办理
一比一原版马来西亚博特拉大学毕业证(upm毕业证)如何办理一比一原版马来西亚博特拉大学毕业证(upm毕业证)如何办理
一比一原版马来西亚博特拉大学毕业证(upm毕业证)如何办理
 
一比一原版斯威本理工大学毕业证(swinburne毕业证)如何办理
一比一原版斯威本理工大学毕业证(swinburne毕业证)如何办理一比一原版斯威本理工大学毕业证(swinburne毕业证)如何办理
一比一原版斯威本理工大学毕业证(swinburne毕业证)如何办理
 
A gentle exploration of Retrieval Augmented Generation
A gentle exploration of Retrieval Augmented GenerationA gentle exploration of Retrieval Augmented Generation
A gentle exploration of Retrieval Augmented Generation
 
一比一原版悉尼大学毕业证如何办理
一比一原版悉尼大学毕业证如何办理一比一原版悉尼大学毕业证如何办理
一比一原版悉尼大学毕业证如何办理
 
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
 

BigDL Deep Learning in Apache Spark - AWS re:invent 2017

  • 1. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. AWS re:INVENT Large Scale Deep Learning with BigDL
  • 2. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Large Scale Deep Learning with BigDL T i m F o x | B i g D a t a a n d M a c h i n e L e a r n i n g C o n s u l t a n t | E l e p h a n t S c a l e
  • 3. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. ABOUT ME Ti m F o x , P r i n c i p a l @ E l e p h a n t S c a l e P r a c t i t i o n e r a n d Tr a i n e r i n D a t a E n g i n e e r i n g a n d D a t a S c i e n c e Author of “Data Science in Python” on LinkedIn Learning t i m @ e l e p h a n t s c a l e . c o m L i n k e d i n : t i m - f o x - 0 0 6 3 5 4 1
  • 4. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. ABOUT Elephant Scale • Tr a i n i n g i n B i g D a t a a n d A I t e c h n o l o g i e s • B i g D a t a : S p a r k , H a d o o p , C l o u d , N o S Q L , S t r e a m i n g • A I : M a c h i n e L e a r n i n g , D e e p L e a r n i n g , B i g D L , Te n s o r f l o w • B i g D L t r a i n i n g a v a i l a b l e ! • P u b l i c a n d P r i v a t e t r a i n i n g s a v a i l a b l e • B i g D L S a n d b o x : e l e p h a n t s c a l e . c o m / s a n d b o x E l e p h a n t s c a l e . c o m i n f o @ e l e p h a n t s c a l e . c o m
  • 5. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Quick Roundup of AI / Machine Learning / Deep Learning
  • 6. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. AI / MACHINE LEARNING / DEEP LEARNING Artificial Intelligence (AI): Broader concept of machines being able to carry out 'smart' tasks Machine Learning: A type of AI that allows software to learn from data without explicitly programmed Deep Learning: Using Neural Networks to solve some hard problems A r t i f i c i a l I n t e l l i g e n c e M a c h i n e L e a r n i n g D e e p L e a r n i n g
  • 7. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. DEEP LEARNING APPLICATIONS S e l f D r i v i n g C a r s • ML system using image recognition • Where the edge of the road / road sign / car in front F a c e r e c o g n i t i o n • Facebook images • System learns from images manually tagged and then automatically detects faces in uploaded photos
  • 8. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. DEEP LEARNING HISTORY E a r l y a t t e m p t s a t D e e p L e a r n i n g d i d n o t s u c c e e d . • Compute Power was insufficient for the time. • Training Datasets were insufficiently sized for good results. • We lacked the ability to parallelize our work. I n t h e m o d e r n e r a , D e e p L e a r n i n g h a s b e e n s u c c e s s f u l . • 'Big Data' – now we have so much data to train our models • 'Big Data ecosystem' – excellent big data platforms (Hadoop, Spark, NoSQL) are available as open source • 'Big Compute' - cloud platforms significantly lowered the barrier to massive compute power • $1 buys you 16 core + 128 G + 10 Gigabit machine for 1 hr on AWS! • So running a 100 node cluster for 5 hrs  $500
  • 9. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. AI SOFTWARE ECO SYSTEM M a c h i n e L e a r n i n g D e e p L e a r n i n g Java - Weka - Mahout - DeepLearning4J Python - SciKit - Tensorflow - Theano - Caffe R - Many libraries - Deepnet - Darch Distributed - H20 - Spark - H20 - Spark - BigDL Cloud - AWS - AWS
  • 10. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. MACHINE LEARNING AND BIG DATA Until recently most of the machine learning is done on “single computer” (with lots of memory–100s of GBs) Most R/Python/Java libraries are “single node based” Now Big Data tools make it possible to run machine learning algorithms at massive scale – distributed across a cluster
  • 11. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. MODERN DEEP LEARNING FRAMEWORKS
  • 12. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. TOOLS FOR SCALABLE MACHINE LEARNING A p a c h e S p a r k M L • Runs on top of popular Spark framework • Massively scalable • Can use memory (caching) effectively for iterative algorithms • Language support: Scala, Java, Python, R B i g D L • Built for Apache Spark and Optimized for Intel Xeon • Language Support: Scala, Java, Python Te n s o r F l o w • Based on “data flow graphs” • Language support: Python, C++ • https://www.tensorflow.org/
  • 13. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. TOOLS FOR SCALABLE CLOUD MACHINE LEARNING A m a z o n M a c h i n e L e a r n i n g • Ready to go algorithms • Visualization tools • Wizards to guide • Scalable on Amazon Cloud
  • 14. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. BigDL
  • 15. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. WHAT IS BIGDL A d i s t r i b u t e d d e e p l e a r n i n g l i b r a r y f o r A p a c h e S p a r k F e a t u r e p a r i t y wi t h p o p u l a r d e e p l e a r n i n g f r a m e wo r k s • Caffe, Torch, Tensorflow H i g h P e r f o r m a n c e • Powered by Intel Math Kernel Library (MKL) and multi threaded programming C a n s c a l e t o h u g e d a t a s e t s • Using Apache Spark for scale O p e n s o u r c e ! ( D e c 2 0 1 6 ) A c t i v e D e v e l o p m e n t
  • 16. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. PRODUCTION ML/DL SYSTEMS ARE COMPLEX! A c t u a l M L / D L i s o n l y s m a l l p o r t i o n o f m a s s i v e p r o d u c t i o n s y s t e m B i g D L r u n n i n g o n a s c a l a b l e p l a t f o r m l i k e S p a r k h e l p s s i m p l i f y t h e c o m p l e x i t y
  • 17. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. BIGDL FILLS THE 'GAP' IN BIG DATA + DEEP LEARNING F o l l o ws p r o v e n d e s i g n p a t t e r n s f o r d e a l i n g wi t h B i g D a t a S e n d s ' c o m p u t e t o d a t a ' r a t h e r t h a n r e a d i n g m a s s i v e d a t a o v e r n e t wo r k . U s e s ' d a t a l o c a l i t y ' o f H D F S ( H a d o o p F i l e S y s t e m U t i l i z e s ' c l u s t e r m a n a g e r s ' l i k e YA R N / M E S O S • Automatically handles hardware/software failures • Elasticity and resource sharing in a cluster
  • 18. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. BIGDL & SPARK R u n B i g D L a p p l i c a t i o n s a s S p a r k a p p l i c a t i o n s S c a l a , J a v a , a n d P y t h o n s u p p o r t U s e o t h e r S p a r k ' s f e a t u r e s • In memory compute • Integrate with Spark ML and Streaming E a s y d e v e l o p m e n t wi t h J u p y t e r N o t e b o o k
  • 19. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. BIGDL VS TENSORFLOW B i g D L Te n s o r f l o w Runtime Scala Engine with Python front-end C++ Engine with Python front- end Hadoop compatibility Can run natively on Spark & Hadoop Accesses Hadoop data as a client only Distributed Operation Scalable with Apache Spark for massive scale out of the box Does not support massive distribution out of the box Runs Tensorflow Models Yes Yes Acceleration CPU w/MKL CPU/GPU Summary Excellent for distributing deep-learning models to massive scale on big-data. Great TCO value. Excellent library for small- medium scale data, although GPU hardware costs can be significant.
  • 20. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. BIGDL: BIG COMPUTE PLUS BIG DATA B i g D L h e l p s u s i n b a l a n c i n g o u r n e e d s • Big Compute: Fast Linear Algebra, Intel MKL library • Optimized for Intel Xeon • Big Data: I/O parallelized to run on many CPUs B i g D L A l l o ws M a s s i v e S c a l a b i l i t y • Natively Designed to run on Spark • Works with Hadoop eco system (via Spark) Hadoop is THE Big Data platform for on-premise deployments P l a y s n i c e l y wi t h o t h e r B i g D L f r a m e wo r k s • Use existing Tensorflow or Caffe at scale in BigDL • Train new models based on existing TF / Caffe models
  • 21. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. BIGDL USE CASES F r a u d d e t e c t i o n S e n t i m e n t a n a l y s i s I m a g e r e c o g n i t i o n Find more at: https://github.com/intel-analytics/analytics-zoo/
  • 22. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. GPUs and CPUs
  • 23. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. GPUS (GRAPHICS PROCESSING UNITS) G P U s h a v e a d d r e s s e d p a s t i s s u e s i n t r a i n i n g p e r f o r m a n c e • Example: Tensorflow - optimized to run well on GPUs. C P U i n p a s t n o t v e c t o r i z e d f o r p a r a l l e l c o m p u t e • Meant that GPUs were much faster for deep learning M o d e r n I n t e l X e o n C P U s h a v e v e c t o r i z e d l i n e a r a l g e b r a • Properly optimized, approaches speed of GPUs • CPUs are now a credible alternative to running on GPUs • Cost Advantage and Scalability
  • 24. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. INTEL MATH KERNEL LIBRARY (MKL) F e a t u r e s h i g h l y o p t i m i z e d , t h r e a d e d , a n d v e c t o r i z e d m a t h f u n c t i o n s t h a t m a x i m i z e p e r f o r m a n c e o n e a c h p r o c e s s o r f a m i l y . U t i l i z e s i n d u s t r y - s t a n d a r d C a n d F o r t r a n A P I s f o r c o m p a t i b i l i t y w i t h p o p u l a r B L A S , L A PA C K , a n d F F T W f u n c t i o n s — n o c o d e c h a n g e s r e q u i r e d . D i s p a t c h e s o p t i m i z e d c o d e f o r e a c h p r o c e s s o r a u t o m a t i c a l l y w i t h o u t t h e n e e d t o b r a n c h c o d e . P r o v i d e s p r i o r i t y s u p p o r t , c o n n e c t i n g y o u d i r e c t l y t o I n t e l e n g i n e e r s f o r c o n f i d e n t i a l a n s w e r s t o t e c h n i c a l q u e s t i o n s
  • 25. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. INTEL MKL PERFORMANCE
  • 26. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. CPU VERSUS GPU FOR BIG DATA C P U o f f e r s h i g h e r s c a l a b i l i t y a t l o w e r c o s t v e r s u s G P U O p t i m i z e d S o f t w a r e a n d l i b r a r i e s o n C P U a l l o w s i n g l e - n o d e p e r f o r m a n c e t o a p p r o a c h G P U p e r f o r m a n c e . G P U p l u s C P U a r c h i t e c t u r e s c a n b e e f f e c t i v e f o r s m a l l e r n u m b e r o f n o d e s , w h e n c o s t i s n o t a c o n c e r n . ” B i g C o m p u t e ” v e r s u s “ B i g D a t a ”
  • 27. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Running BigDL
  • 28. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. RUNNING BIGDL D e ve l o p i n g : U s e t h e f o l l o wi n g t o d e v e l o p y o u r B i g D L a p p s e ff o r t l e s s l y • Docker • VM Sandbox D e p l o y i n g : C l o u d r e a d y d e p l o y m e n t • Amazon AMI
  • 29. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. DEMO: GETTING STARTED WITH BIGDL We wi l l p r o v i d e : • Docker • Sandbox VM • AWS Marketplace AMI
  • 30. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. BigDL Summary B i g D L o f f e r s o u t s t a n d i n g s c a l a b i l i t y a n d p e r f o r m a n c e B i g D L o p t i m i z e s T C O b y r u n n i n g b e i n g t u n e d a n d o p t i m i z e d f o r I n t e l X e o n P r o c e s s o r s B i g D L b r i n g s d e e p l e a r n i n g t o S p a r k C l u s t e r s a n d H a d o o p D a t a s e t s B i g D L c a n b e u s e d t o d e p l o y Te n s o r f l o w a n d C a f f e m o d e l s t o b i g d a t a .
  • 31. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. IMAGE RECOGNITION WITH APACHE SPARK AND BIGDL A l e x K a l i n i n | V P, A I / M a c h i n e L e a r n i n g | S i z m e k
  • 32. ABOUT ME A l e x K a l i n i n V P, A I / M a c h i n e L e a r n i n g | S i z m e k a l e x . k a l i n i n @ s i z m e k . c o m L i n k e d i n : l i n k e d i n . c o m / i n / a l e x k a l i n i n /
  • 33. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. https://www.linkedin.com/in/alexkalinin/ alex.kalinin@sizmek.com 1 0 0 , 0 0 0 , 0 0 0 , 0 0 0 r e q u e s t s p e r d a y P B s o f t r a i n i n g d a t a AI-POWERED MARKETING AND OPTIMIZATION 7 0 , 0 0 0 , 0 0 0 / m i n u t e 1 , 2 0 0 , 0 0 0 / s e c
  • 34. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. FEED-FORWARD NETWORK
  • 35. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. 43 37 45 40 𝑦 = (𝑤𝑖 ∗ 𝑥𝑖) ? ? ? ? ? ?
  • 36. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. -0.53 0.01 -0.17 0.70 0.51 ? ? 𝑦 = (𝑤𝑖 ∗ 𝑥𝑖) 43 37 45 40 ? ? ? ?
  • 37. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. -0.53 0.01 -0.17 0.70 0.51 ? ? 𝑦 = (𝑤𝑖 ∗ 𝑥𝑖) 43 37 45 40 -1.56 ? ? ?
  • 38. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. -0.53 0.01 -0.17 0.70 0.51 ? ? 𝑦 = 𝑅𝑒𝐿𝑈( 𝑤𝑖 ∗ 𝑥𝑖 ) 43 37 45 40 -1.56 ? ? ?
  • 39. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. -0.53 0.01 -0.17 0.70 0.51 ? ? 𝑦 = 𝑅𝑒𝐿𝑈( 𝑤𝑖 ∗ 𝑥𝑖 ) 43 37 45 40 ? ? ? 0
  • 40. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. 43 37 45 40 -0.12 0.13 0.21 -0.07 -0.05 ? ? 𝑦 = 𝑅𝑒𝐿𝑈( 𝑤𝑖 ∗ 𝑥𝑖 ) ? ? 0 11.9
  • 41. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. ? ? 𝑦 = 𝑅𝑒𝐿𝑈( 𝑤𝑖 ∗ 𝑥𝑖 ) 43 37 45 40 -.11 ? 0 11.9
  • 42. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. ? ? 𝑦 = 𝑅𝑒𝐿𝑈( 𝑤𝑖 ∗ 𝑥𝑖 ) 43 37 45 40 ? 0 11.9 0
  • 43. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. ? ? 𝑦 = 𝑅𝑒𝐿𝑈( 𝑤𝑖 ∗ 𝑥𝑖 ) 43 37 45 40 0.15 0 11.9 0
  • 44. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. -0.67 ? 𝑦 = 𝑅𝑒𝐿𝑈( 𝑤𝑖 ∗ 𝑥𝑖 ) 43 37 45 40 0.15 0 11.9 0
  • 45. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. 0 ? 𝑦 = 𝑅𝑒𝐿𝑈( 𝑤𝑖 ∗ 𝑥𝑖 ) 43 37 45 40 0.15 0 11.9 0
  • 46. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. 0 0.52 𝑦 = 𝑅𝑒𝐿𝑈( 𝑤𝑖 ∗ 𝑥𝑖 ) 43 37 45 40 0.15 0 11.9 0
  • 47. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. FEED-FORWARD NETWORK
  • 48. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
  • 49. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
  • 50. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
  • 51. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. FULLY CONNECTED
  • 52. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. FULLY CONNECTED
  • 53. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. FULLY CONNECTED
  • 54. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. FULLY CONNECTED Input Size: Connections: 40,000 1,600,000,000 200 200 200 200 10 layers: 16 billion
  • 55. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. INVENTION OF CONVOLUTIONAL NEURAL NETWORKS • L e N e t - 5 n e t wo r k d e v e l o p e d i n 1 9 9 8 b y Ya n n L e C u n • To r s t e n H u b e l a n d D a v i d W i e s e l
  • 56. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. HUBEL & WIESEL https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1363130/
  • 57. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. HIERARCHICAL & LOCAL VISUAL CORTEX
  • 58. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. HIERARCHICAL & LOCAL VISUAL CORTEX Lines, Dots Orientation, Movement High-Level Shapes
  • 59. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. KEY FEATURES OF CONVOLUTIONAL NETWORK • C o n v o l u t i o n • P o o l i n g
  • 60. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. FULLY CONNECTED
  • 61. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. CONVOLUTION
  • 62. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. CONVOLUTION
  • 63. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. CONVOLUTION
  • 64. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. CONVOLUTION
  • 65. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. CONVOLUTION O n l y f o u r we i g h t s
  • 66. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. CONVOLUTION 0.10 -0.06 0.24 0.17 Filter
  • 67. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. POOLING Source: https://cs231n.github.io/convolutional-networks/
  • 68. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. CONVOLUTIONAL NETWORK 0 0 0 0 1 0 0 0 0 0 0 1 2 3 4 5 6 7 8 9 Convolution Pooling Convolution PoolingInput FC FC Source: https://www.clarifai.com /technology
  • 69. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. http://scs.ryerson.ca/~aharley/vis/conv/flat.html
  • 70. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. DEMO G i t H u b : h t t p s : / / g i t h u b . c o m / a l e x - k a l i n i n / l e n e t - b i g d l
  • 71. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Getting Started With BigDL
  • 72. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. ABOUT ME S u j e e M a n i y a m F o u n d e r / P r i n c i p a l @ E l e p h a n t S c a l e P r a c t i t i o n e r a n d Tr a i n e r i n D a t a E n g i n e e r i n g a n d D a t a S c i e n c e Author - "Hadoop and Spark" video training on O'Reilly Media - "HBase Design Patterns" - "Hadoop illuminated" s u j e e @ e l e p h a n t s c a l e . c o m L i n k e d i n : l i n k e d i n . c o m / i n / s u j e e m a n i y a m
  • 73. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. RUNNING BIGDL D e ve l o p i n g : U s e t h e f o l l o wi n g t o d e v e l o p y o u r B i g D L a p p s e ff o r t l e s s l y • Docker • VM Sandbox • Amazon AMI D e p l o y i n g : C l o u d r e a d y d e p l o y m e n t • Amazon AMI
  • 74. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. GETTING STARTED WITH BIGDL We wi l l d e m o n s t r a t e • Docker • Sandbox VM • AWS Marketplace AMI
  • 75. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Docker Step 1 : Install Docker on your laptop Step 2 : get docker image docker pull elephantscale/bigdl-sandbox Step 3 : download tutorials git clone https://github.com/elephantscale/bigdl-tutorials Step 4 : Launch docker cd bigdl-tutorials ./run-bigdl-docker.sh elephantscale/bigdl-sandbox Step 5 : Go to Jupyter notebook
  • 76. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. VM-Sandbox Step 1 : Install VMware Player or VirtualBox on your laptop Step 2 : Download BigDL-Sandbox image http://elephantscale.com/sandbox/ Step 3 : (In Sandbox) download tutorials git clone https://github.com/elephantscale/bigdl-tutorials Step 4 : (In Sandbox) Run BigDL natively cd bigdl-tutorials ./run-bigdl-native.sh Step 5 : (In Sandbox) Go to Jupyter notebook
  • 77. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Docker on AWS Step 1 : Spin up an AMI (Ubuntu recommended) Step 2 : Install Docker on the instance Step 3 : get docker image docker pull elephantscale/bigdl-sandbox Step 4 : download tutorials git clone https://github.com/elephantscale/bigdl-tutorials Step 5 : Launch docker cd bigdl-tutorials ./run-bigdl-docker.sh elephantscale/bigdl-sandbox Step 6 : Go to Jupyter notebook
  • 78. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. AMI on AWS Step 1 : Spin up BigDL AMI Step 2 : download tutorials git clone https://github.com/elephantscale/bigdl-tutorials Step 3 : Run BigDL cd bigdl-tutorials ./run-bigdl-native.sh Step 4 : Go to Jupyter notebook
  • 79. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. QUESTIONS G i t H u b : h t t p s : / / g i t h u b . c o m / a l e x - k a l i n i n / l e n e t - b i g d l L i n k e d I n : h t t p s : / / ww w. l i n k e d i n . c o m / i n / a l e x k a l i n i n /
  • 80. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Notebooks and Resources BigDL: software.intel.com/bigdl Tutorials: github.com/dnielsen/bigdl-resources Sandbox: elephantscale.com/sandbox BigDL AMI: aws.amazon.com/marketplace/ Training: elephantscale.com Slides: slideshare.net/dcnielsen/ Tim Fox Sujee Maniyam Alex Kalinin Dave Nielsen Elephant Scale Elephant Scale Sizmek Intel Software
  • 81. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. THANK YOU!

Editor's Notes

  1. 19,
  2. http://www.teglor.com/b/deep-learning-libraries-language-cm569/
  3. Image credit : Wikimedia : CCZero license : https://commons.wikimedia.org/wiki/File:Driver_free_car.jpg Image credit : WikiMedia : (Creative Commons) : https://commons.wikimedia.org/wiki/File:Face_detection.jpg
  4. http://www.kdnuggets.com/2016/04/top-15-frameworks-machine-learning-experts.html http://www.infoworld.com/article/2853707/machine-learning/11-open-source-tools-machine-learning.html https://aws.amazon.com/machine-learning/
  5. http://www.kdnuggets.com/2016/04/top-15-frameworks-machine-learning-experts.html http://www.infoworld.com/article/2853707/machine-learning/11-open-source-tools-machine-learning.html https://aws.amazon.com/machine-learning/
  6. Image credit : laptop : pixabay – Creative Commons - https://www.pexels.com/photo/author-blog-create-creative-267569/ Image credit : CC commons : https://pixabay.com/en/cloud-computing-black-white-1924338/
  7. 37325
  8. 37325
  9. Image credit : laptop : pixabay – Creative Commons - https://www.pexels.com/photo/author-blog-create-creative-267569/ Image credit : CC commons : https://pixabay.com/en/cloud-computing-black-white-1924338/