SlideShare a Scribd company logo
deepscale.ai
F O R R E S T 	
   I A N D O L A
Co-­‐founder	
  and	
  CEO,	
  DeepScale
How	
  to	
  become
a	
  full-­‐stack	
  deep	
  learning	
  engineer
OUTLINE
• Part	
  1:	
  My	
  adventures	
  in	
  entrepreneurship	
  and	
  deep	
  learning
• How	
  I	
  became	
  a	
  full-­‐stack	
  DL	
  engineer
• DeepScale's	
  founding	
  story
• Part	
  2:	
  Full-­‐stack	
  deep	
  learning	
  engineering
• What	
  is	
  the	
  full	
  stack?
• How	
  to	
  organize	
  a	
  full-­‐stack	
  DL	
  team
• How	
  to	
  become	
  a	
  full-­‐stack	
  DL	
  engineer
PART	
  1
My	
  adventures	
  in	
  entrepreneurship	
  and	
  deep	
  learning
For	
  as	
  long	
  as	
  I	
  can	
  remember,	
  I	
  have	
  been	
  obsessed	
  with	
  cars	
  and	
  trucks
Fo's	
  
Truck	
  
Company
Illinois	
  Math	
  and	
  Science	
  Academy	
  (2005)
Mechanical	
  Engineering	
  Building
at	
  University	
  of	
  Illinois	
  (2008)
Siebel	
  Center	
  for	
  Computer	
  Science
at	
  University	
  of	
  Illinois	
  (2008)
What	
  my	
  classmates	
  planned	
  to	
  do	
  after	
  finishing	
  
their	
  undergraduate	
  degree	
  in	
  CS	
  at	
  University	
  of	
  Illinois	
  (2012)
Entry-­‐level	
  jobs	
  at	
  
Big	
  Companies
Starting	
  Companies
…and	
  countless	
  smartphone	
  
app	
  startups
Grad	
  
School
Starting	
  my	
  PhD	
  in	
  EECS	
  at	
  UC	
  Berkeley	
  (2012)
Kurt	
  Keutzer
Our	
  full-­‐stack	
  approach
FOR	
  DEEP	
  NEURAL	
  NETWORKS	
  v0.1	
  (2016)
Train	
  rapidly	
  using	
  
multiprocessor	
  scaling
Collect	
  and	
  annotate	
  
adequate	
  training	
  data
Develop	
  small	
  and	
  efficient	
  deep	
  
neural	
  network	
  architectures
Create	
  efficient	
  implementations	
  
for	
  embedded	
  hardware
F IREC AF F E	
  [ 1] T H E 	
   D E E P S C A L E 	
   C A R
SQ U EEZEN ET	
  [ 2]
B O DA	
  [ 3]
[1]	
  F.N.	
  Iandola,	
  K.	
  Ashraf,	
  M.W.	
  Moskewicz,	
  and	
  K.	
  Keutzer.	
  FireCaffe:	
  
near-­‐linear	
  acceleration	
  of	
  deep	
  neural	
  network	
  training	
  on	
  compute	
  
clusters.	
  CVPR,	
  2016.
[2]	
  F.N.	
  Iandola,	
  M.	
  Moskewicz,	
  K.	
  Ashraf,	
  S.	
  Han,	
  W.	
  Dally,	
  K.	
  Keutzer.	
  
SqueezeNet:	
  AlexNet-­‐level	
  accuracy	
  with	
  50x	
  fewer	
  parameters	
  and	
  
<1MB	
  model	
  size.	
  arXiv,	
  2016.
[3]	
  M.W.	
  Moskewicz,	
  F.N.	
  Iandola,	
  K.	
  Keutzer.	
  Boda-­‐RTC:	
  Productive	
  
Generation	
  of	
  Portable,	
  Efficient	
  Code	
  for	
  Convolutional	
  Neural	
  
Networks	
  on	
  Mobile	
  Computing	
  Platforms.	
  WiMob,	
  2016.	
  
DeepScaleTHE	
  BEGINNING	
  (2015)
50x	
  speedup
DeepScaleHOW	
  OUR	
  BUSINESS	
  MODEL	
  CRYSTALLIZED	
  (2016)
First	
  we	
  learned…
• Automakers	
  have	
  gathered	
  lots	
  of	
  data	
  for	
  training	
  computer	
  vision	
  models	
  – for	
  
use	
  in	
  autonomous	
  driving
• Training	
  deep	
  neural	
  nets	
  on	
  these	
  data	
  volumes	
  is	
  really	
  slow	
  (months	
  or	
  even	
  
years)
• DeepScale's	
  distributed	
  training	
  system	
  can	
  help
After	
  more	
  discussions,	
  we	
  learned…
• Automakers	
  would	
  prefer	
  to	
  just	
  buy	
  the	
  "right"	
  deep	
  neural	
  networks,	
  already	
  
trained
• For	
  mass-­‐production,	
  automakers	
  have	
  really small	
  processors	
  (100x	
  less	
  
computation	
  than	
  full-­‐size	
  NVIDIA	
  GPUs)
Our	
  full-­‐stack	
  approach
FOR	
  DEEP	
  NEURAL	
  NETWORKS	
  v0.1	
  (2016)
Train	
  rapidly	
  using	
  
multiprocessor	
  scaling
Collect	
  and	
  annotate	
  
adequate	
  training	
  data
Develop	
  small	
  and	
  efficient	
  deep	
  
neural	
  network	
  architectures
Create	
  efficient	
  implementations	
  
for	
  embedded	
  hardware
F IREC AF F E	
  [ 1] T H E 	
   D E E P S C A L E 	
   C A R
SQ U EEZEN ET	
  [ 2]
B O DA	
  [ 3]
[1]	
  F.N.	
  Iandola,	
  K.	
  Ashraf,	
  M.W.	
  Moskewicz,	
  and	
  K.	
  Keutzer.	
  FireCaffe:	
  
near-­‐linear	
  acceleration	
  of	
  deep	
  neural	
  network	
  training	
  on	
  compute	
  
clusters.	
  CVPR,	
  2016.
[2]	
  F.N.	
  Iandola,	
  M.	
  Moskewicz,	
  K.	
  Ashraf,	
  S.	
  Han,	
  W.	
  Dally,	
  K.	
  Keutzer.	
  
SqueezeNet:	
  AlexNet-­‐level	
  accuracy	
  with	
  50x	
  fewer	
  parameters	
  and	
  
<1MB	
  model	
  size.	
  arXiv,	
  2016.
[3]	
  M.W.	
  Moskewicz,	
  F.N.	
  Iandola,	
  K.	
  Keutzer.	
  Boda-­‐RTC:	
  Productive	
  
Generation	
  of	
  Portable,	
  Efficient	
  Code	
  for	
  Convolutional	
  Neural	
  
Networks	
  on	
  Mobile	
  Computing	
  Platforms.	
  WiMob,	
  2016.	
  
DeepScale
PERCEPTION	
  SYSTEMS	
  
FOR	
  AUTOMATED	
  VEHICLES
http://deepscale.ai
My	
  story	
  in	
  one	
  slide
1993:	
  Planned	
  to	
  start	
  "Forrest's	
  Truck	
  Company"	
  when	
  I	
  grow	
  up
2008:	
  Went	
  to	
  college	
  at	
  University	
  of	
  Illinois	
  with	
  the	
  plan	
  to	
  study	
  Mechanical	
  Engineering	
  and	
  then	
  go	
  
into	
  automotive	
  industry
2009:	
  Switched	
  to	
  computer	
  science;	
  gave	
  up	
  my	
  dreams	
  of	
  working	
  in	
  automotive
2012:	
  Considered	
  starting	
  a	
  startup,	
  but	
  choose	
  grad	
  school	
  instead
2013:	
  Started	
  doing	
  research	
  in	
  deep	
  learning
2015:	
  Started	
  DeepScale
2016:	
  Graduated	
  with	
  PhD	
  in	
  EECS
2016:	
  Focused	
  DeepScale	
  entirely	
  on	
  the	
  automotive	
  industry
2018:	
  Today,	
  DeepScale	
  is	
  on	
  track	
  to	
  supply	
  lifesaving	
  deep	
  learning	
  software	
  to	
  automakers
You	
  can't	
  connect	
  the	
  dots	
  looking	
  forward;	
  you	
  can	
  only	
  connect	
  them	
  looking	
  backwards.	
  
So,	
  you	
  have	
  to	
  trust	
  that	
  the	
  dots	
  will	
  somehow	
  connect	
  in	
  your	
  future."	
  – Steve	
  Jobs
PART	
  2
The	
  playbook	
  for	
  full-­‐stack	
  deep	
  learning
Data
Infrastructure
Models
Applications
The	
  full	
  system	
  stack
FOR	
  DEEP	
  NEURAL	
  NETWORKS	
  v1.0 (2018)
Data
Data	
  Collection Data	
  Annotation Task	
  Definition
Accuracy	
  MetricsData	
  Storage Simulation
Models
Design	
  Space	
  
ExplorationNew	
  Layer	
  Types
New	
  Objective	
  
Functions
DNN	
  Model	
  
Structure
Preprocessing	
  &	
  
Featurization
Quantization	
  &	
  
Compression
Infrastructure
DNN	
  Frameworks
Visualization	
  Tools
Deploy	
  on	
  cloud	
  &	
  embedded	
  hardware
Efficient	
  Computational	
  Kernels
Train	
  on	
  large-­‐scale	
  hardware
Distributed	
  Communication
Applications
GADGETSDATACENTERS
SOCIAL	
  MEDIA	
  ANALYSIS
WEB	
  INDEXING
GOVERNMENT	
  INTELLIGENCE
DRONES
SELF-­‐DRIVING	
  CARSSMARTPHONES
OTHER	
  ROBOTICS
Traditional	
  Workflow	
  for	
  
DL	
  Research	
  and	
  Engineering
Models Infrastructure
(FYI,	
  these	
  are	
  supposed	
  to	
  be	
  silos)
Applications
Data
Breaking	
  Down	
  Silos	
  in
DL	
  Research	
  and	
  Engineering
Data
Models
Infrastructure
Applications
How	
  full-­‐stack	
  DL	
  teams	
  produce	
  
full-­‐stack	
  engineers
Data
Models
Infrastructure
Applications
So,	
  how	
  do	
  I	
  get	
  started?
First	
  go	
  deep
Bring	
  your	
  skills	
  to	
  a	
  full-­‐stack	
  Deep	
  Learning	
  team
• I	
  bet	
  you're	
  already	
  pretty	
  good	
  some	
  piece	
  of	
  the	
  stack
• So,	
  start	
  by	
  applying	
  your	
  existing	
  skills to	
  an	
  DL-­‐related	
  project
• Work	
  with	
  a	
  group	
  who	
  has	
  complementary	
  skills
• Or,	
  choose	
  a	
  project	
  that	
  plays	
  to	
  your	
  strengths
Data
Models
Infrastructure
Applications
Then	
  go	
  wide
Becoming	
  a	
  full-­‐stack	
  Deep	
  Learning	
  engineer
• At	
  this	
  point,	
  you've	
  built	
  experience	
  applying	
  your	
  skills in	
  the	
  DL	
  field
• Now,	
  challenge	
  yourself	
  to	
  learn	
  new	
  skills
• Lots	
  of	
  places	
  to	
  learn
• If	
  you	
  worked	
  with	
  a	
  group,	
  you	
  probably	
  already	
  learned	
  from	
  your	
  team	
  
members
• If	
  you're	
  in	
  a	
  company,	
  hopefully	
  you	
  work	
  with	
  people	
  who	
  have	
  
complementary	
  skills
Data
Models
Infrastructure
Applications
Conclusions
• Full-­‐stack	
  Deep	
  Learning	
  spans	
  several	
  disciplines
• For	
  entrepreneurs	
  /	
  managers:	
  if	
  you	
  want	
  to	
  do	
  deep	
  learning	
  
at	
  scale,	
  you	
  may	
  need	
  to	
  build	
  a	
  full-­‐stack	
  team
• For	
  engineers:	
  there	
  are	
  lots	
  of	
  ways	
  to	
  get	
  into	
  deep	
  learning	
  
and	
  ultimately	
  to	
  be	
  come	
  a	
  full-­‐stack	
  DL	
  engineer

More Related Content

What's hot

Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
Impetus Technologies
 
“Case Study: Facial Detection and Recognition for Always-On Applications,” a ...
“Case Study: Facial Detection and Recognition for Always-On Applications,” a ...“Case Study: Facial Detection and Recognition for Always-On Applications,” a ...
“Case Study: Facial Detection and Recognition for Always-On Applications,” a ...
Edge AI and Vision Alliance
 
Image transformation using grid(synopsis)
Image transformation using grid(synopsis)Image transformation using grid(synopsis)
Image transformation using grid(synopsis)Mumbai Academisc
 
“Applying the Right Deep Learning Model with the Right Data for Your Applicat...
“Applying the Right Deep Learning Model with the Right Data for Your Applicat...“Applying the Right Deep Learning Model with the Right Data for Your Applicat...
“Applying the Right Deep Learning Model with the Right Data for Your Applicat...
Edge AI and Vision Alliance
 
The deep learning tour - Q1 2017
The deep learning tour - Q1 2017 The deep learning tour - Q1 2017
The deep learning tour - Q1 2017
Eran Shlomo
 
State-of-the-art Image Processing across all domains
State-of-the-art Image Processing across all domainsState-of-the-art Image Processing across all domains
State-of-the-art Image Processing across all domains
Knoldus Inc.
 
“Modern Machine Vision from Basics to Advanced Deep Learning,” a Presentation...
“Modern Machine Vision from Basics to Advanced Deep Learning,” a Presentation...“Modern Machine Vision from Basics to Advanced Deep Learning,” a Presentation...
“Modern Machine Vision from Basics to Advanced Deep Learning,” a Presentation...
Edge AI and Vision Alliance
 
End-to-end Optical Fiber Cyberinfrastructure for Data-Intensive Research: Imp...
End-to-end Optical Fiber Cyberinfrastructure for Data-Intensive Research: Imp...End-to-end Optical Fiber Cyberinfrastructure for Data-Intensive Research: Imp...
End-to-end Optical Fiber Cyberinfrastructure for Data-Intensive Research: Imp...
Larry Smarr
 
A Distributed Deep Learning Approach for the Mitosis Detection from Big Medic...
A Distributed Deep Learning Approach for the Mitosis Detection from Big Medic...A Distributed Deep Learning Approach for the Mitosis Detection from Big Medic...
A Distributed Deep Learning Approach for the Mitosis Detection from Big Medic...
Databricks
 
"Enabling Ubiquitous Visual Intelligence Through Deep Learning," a Keynote Pr...
"Enabling Ubiquitous Visual Intelligence Through Deep Learning," a Keynote Pr..."Enabling Ubiquitous Visual Intelligence Through Deep Learning," a Keynote Pr...
"Enabling Ubiquitous Visual Intelligence Through Deep Learning," a Keynote Pr...
Edge AI and Vision Alliance
 
Yangqing Jia at AI Frontiers: Towards Better DL Frameworks
Yangqing Jia at AI Frontiers: Towards Better DL FrameworksYangqing Jia at AI Frontiers: Towards Better DL Frameworks
Yangqing Jia at AI Frontiers: Towards Better DL Frameworks
AI Frontiers
 
IaaS Cloud Benchmarking: Approaches, Challenges, and Experience
IaaS Cloud Benchmarking: Approaches, Challenges, and ExperienceIaaS Cloud Benchmarking: Approaches, Challenges, and Experience
IaaS Cloud Benchmarking: Approaches, Challenges, and Experience
Alexandru Iosup
 
“A Highly Data-Efficient Deep Learning Approach,” a Presentation from Samsung
“A Highly Data-Efficient Deep Learning Approach,” a Presentation from Samsung“A Highly Data-Efficient Deep Learning Approach,” a Presentation from Samsung
“A Highly Data-Efficient Deep Learning Approach,” a Presentation from Samsung
Edge AI and Vision Alliance
 
Dl 0n mobile jeff shomaker_jan-2018_final
Dl 0n mobile jeff shomaker_jan-2018_finalDl 0n mobile jeff shomaker_jan-2018_final
Dl 0n mobile jeff shomaker_jan-2018_final
Jeffrey Shomaker
 
"Deploying Deep Learning Models on Embedded Processors for Autonomous Systems...
"Deploying Deep Learning Models on Embedded Processors for Autonomous Systems..."Deploying Deep Learning Models on Embedded Processors for Autonomous Systems...
"Deploying Deep Learning Models on Embedded Processors for Autonomous Systems...
Edge AI and Vision Alliance
 
“Introduction to DNN Model Compression Techniques,” a Presentation from Xailient
“Introduction to DNN Model Compression Techniques,” a Presentation from Xailient“Introduction to DNN Model Compression Techniques,” a Presentation from Xailient
“Introduction to DNN Model Compression Techniques,” a Presentation from Xailient
Edge AI and Vision Alliance
 
Intel and Amazon - Powering your innovation together.
Intel and Amazon - Powering your innovation together. Intel and Amazon - Powering your innovation together.
Intel and Amazon - Powering your innovation together.
Eran Shlomo
 
On-Device AI
On-Device AIOn-Device AI
On-Device AI
LGCNSairesearch
 
Serguei “SB” Beloussov - Future Of Computing at SIT Insights in Technology 2019
Serguei “SB” Beloussov - Future Of Computing at SIT Insights in Technology 2019Serguei “SB” Beloussov - Future Of Computing at SIT Insights in Technology 2019
Serguei “SB” Beloussov - Future Of Computing at SIT Insights in Technology 2019
Schaffhausen Institute of Technology
 
"Machine Learning- based Image Compression: Ready for Prime Time?," a Present...
"Machine Learning- based Image Compression: Ready for Prime Time?," a Present..."Machine Learning- based Image Compression: Ready for Prime Time?," a Present...
"Machine Learning- based Image Compression: Ready for Prime Time?," a Present...
Edge AI and Vision Alliance
 

What's hot (20)

Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
 
“Case Study: Facial Detection and Recognition for Always-On Applications,” a ...
“Case Study: Facial Detection and Recognition for Always-On Applications,” a ...“Case Study: Facial Detection and Recognition for Always-On Applications,” a ...
“Case Study: Facial Detection and Recognition for Always-On Applications,” a ...
 
Image transformation using grid(synopsis)
Image transformation using grid(synopsis)Image transformation using grid(synopsis)
Image transformation using grid(synopsis)
 
“Applying the Right Deep Learning Model with the Right Data for Your Applicat...
“Applying the Right Deep Learning Model with the Right Data for Your Applicat...“Applying the Right Deep Learning Model with the Right Data for Your Applicat...
“Applying the Right Deep Learning Model with the Right Data for Your Applicat...
 
The deep learning tour - Q1 2017
The deep learning tour - Q1 2017 The deep learning tour - Q1 2017
The deep learning tour - Q1 2017
 
State-of-the-art Image Processing across all domains
State-of-the-art Image Processing across all domainsState-of-the-art Image Processing across all domains
State-of-the-art Image Processing across all domains
 
“Modern Machine Vision from Basics to Advanced Deep Learning,” a Presentation...
“Modern Machine Vision from Basics to Advanced Deep Learning,” a Presentation...“Modern Machine Vision from Basics to Advanced Deep Learning,” a Presentation...
“Modern Machine Vision from Basics to Advanced Deep Learning,” a Presentation...
 
End-to-end Optical Fiber Cyberinfrastructure for Data-Intensive Research: Imp...
End-to-end Optical Fiber Cyberinfrastructure for Data-Intensive Research: Imp...End-to-end Optical Fiber Cyberinfrastructure for Data-Intensive Research: Imp...
End-to-end Optical Fiber Cyberinfrastructure for Data-Intensive Research: Imp...
 
A Distributed Deep Learning Approach for the Mitosis Detection from Big Medic...
A Distributed Deep Learning Approach for the Mitosis Detection from Big Medic...A Distributed Deep Learning Approach for the Mitosis Detection from Big Medic...
A Distributed Deep Learning Approach for the Mitosis Detection from Big Medic...
 
"Enabling Ubiquitous Visual Intelligence Through Deep Learning," a Keynote Pr...
"Enabling Ubiquitous Visual Intelligence Through Deep Learning," a Keynote Pr..."Enabling Ubiquitous Visual Intelligence Through Deep Learning," a Keynote Pr...
"Enabling Ubiquitous Visual Intelligence Through Deep Learning," a Keynote Pr...
 
Yangqing Jia at AI Frontiers: Towards Better DL Frameworks
Yangqing Jia at AI Frontiers: Towards Better DL FrameworksYangqing Jia at AI Frontiers: Towards Better DL Frameworks
Yangqing Jia at AI Frontiers: Towards Better DL Frameworks
 
IaaS Cloud Benchmarking: Approaches, Challenges, and Experience
IaaS Cloud Benchmarking: Approaches, Challenges, and ExperienceIaaS Cloud Benchmarking: Approaches, Challenges, and Experience
IaaS Cloud Benchmarking: Approaches, Challenges, and Experience
 
“A Highly Data-Efficient Deep Learning Approach,” a Presentation from Samsung
“A Highly Data-Efficient Deep Learning Approach,” a Presentation from Samsung“A Highly Data-Efficient Deep Learning Approach,” a Presentation from Samsung
“A Highly Data-Efficient Deep Learning Approach,” a Presentation from Samsung
 
Dl 0n mobile jeff shomaker_jan-2018_final
Dl 0n mobile jeff shomaker_jan-2018_finalDl 0n mobile jeff shomaker_jan-2018_final
Dl 0n mobile jeff shomaker_jan-2018_final
 
"Deploying Deep Learning Models on Embedded Processors for Autonomous Systems...
"Deploying Deep Learning Models on Embedded Processors for Autonomous Systems..."Deploying Deep Learning Models on Embedded Processors for Autonomous Systems...
"Deploying Deep Learning Models on Embedded Processors for Autonomous Systems...
 
“Introduction to DNN Model Compression Techniques,” a Presentation from Xailient
“Introduction to DNN Model Compression Techniques,” a Presentation from Xailient“Introduction to DNN Model Compression Techniques,” a Presentation from Xailient
“Introduction to DNN Model Compression Techniques,” a Presentation from Xailient
 
Intel and Amazon - Powering your innovation together.
Intel and Amazon - Powering your innovation together. Intel and Amazon - Powering your innovation together.
Intel and Amazon - Powering your innovation together.
 
On-Device AI
On-Device AIOn-Device AI
On-Device AI
 
Serguei “SB” Beloussov - Future Of Computing at SIT Insights in Technology 2019
Serguei “SB” Beloussov - Future Of Computing at SIT Insights in Technology 2019Serguei “SB” Beloussov - Future Of Computing at SIT Insights in Technology 2019
Serguei “SB” Beloussov - Future Of Computing at SIT Insights in Technology 2019
 
"Machine Learning- based Image Compression: Ready for Prime Time?," a Present...
"Machine Learning- based Image Compression: Ready for Prime Time?," a Present..."Machine Learning- based Image Compression: Ready for Prime Time?," a Present...
"Machine Learning- based Image Compression: Ready for Prime Time?," a Present...
 

Similar to Forrest Iandola: How to become a full-stack deep learning engineer

Duc le CV
Duc le CVDuc le CV
Duc le CV
Duc Minh Le
 
Continuously Design your Continuous Deployment
Continuously Design your Continuous DeploymentContinuously Design your Continuous Deployment
Continuously Design your Continuous Deployment
Michael Elder
 
Kadir A_20160804_res_tea
Kadir A_20160804_res_teaKadir A_20160804_res_tea
Kadir A_20160804_res_teaKadir A Peker
 
CLOUD COMPUTING BY SIVASANKARI
CLOUD COMPUTING BY SIVASANKARICLOUD COMPUTING BY SIVASANKARI
CLOUD COMPUTING BY SIVASANKARI
SivaSankari36
 
Cc unit 1 ppt
Cc unit 1 pptCc unit 1 ppt
Cc unit 1 ppt
Dr VISU P
 
Resume Nimisha Jha Mainframe Developer 6 Years 5 Months
Resume Nimisha Jha Mainframe Developer 6 Years 5 MonthsResume Nimisha Jha Mainframe Developer 6 Years 5 Months
Resume Nimisha Jha Mainframe Developer 6 Years 5 MonthsNimisha Jha
 
Build, Scale, and Deploy Deep Learning Pipelines with Ease
Build, Scale, and Deploy Deep Learning Pipelines with EaseBuild, Scale, and Deploy Deep Learning Pipelines with Ease
Build, Scale, and Deploy Deep Learning Pipelines with Ease
Databricks
 
Developing and Operating Microservice-based Applications on Modern Cloud Plat...
Developing and Operating Microservice-based Applications on Modern Cloud Plat...Developing and Operating Microservice-based Applications on Modern Cloud Plat...
Developing and Operating Microservice-based Applications on Modern Cloud Plat...
Michael O'Sullivan
 
GOTO Amsterdam 2017 - Enterprise Fast Lane
GOTO Amsterdam 2017 - Enterprise Fast LaneGOTO Amsterdam 2017 - Enterprise Fast Lane
GOTO Amsterdam 2017 - Enterprise Fast Lane
Christian Deger
 
Using Algorithmia to leverage AI and Machine Learning APIs
Using Algorithmia to leverage AI and Machine Learning APIsUsing Algorithmia to leverage AI and Machine Learning APIs
Using Algorithmia to leverage AI and Machine Learning APIs
Rakuten Group, Inc.
 
Making Software and Software Engineering visible
Making Software and Software Engineering visibleMaking Software and Software Engineering visible
Making Software and Software Engineering visible
CREST
 
Deep learning on mobile
Deep learning on mobileDeep learning on mobile
Deep learning on mobile
Anirudh Koul
 
Cloud Architecture Tutorial - Why and What (1of 3)
Cloud Architecture Tutorial - Why and What (1of 3) Cloud Architecture Tutorial - Why and What (1of 3)
Cloud Architecture Tutorial - Why and What (1of 3)
Adrian Cockcroft
 
GOTO Berlin 2016
GOTO Berlin 2016GOTO Berlin 2016
GOTO Berlin 2016
Christian Deger
 
OA centre of excellence
OA centre of excellenceOA centre of excellence
Deep Learning with CNTK
Deep Learning with CNTKDeep Learning with CNTK
Deep Learning with CNTK
Ashish Jaiman
 
Constructing Knowledge Graph for Social Networks in a Deep and Holistic Way
Constructing Knowledge Graph for Social Networks in a Deep and Holistic WayConstructing Knowledge Graph for Social Networks in a Deep and Holistic Way
Constructing Knowledge Graph for Social Networks in a Deep and Holistic Way
Baoxu Shi
 

Similar to Forrest Iandola: How to become a full-stack deep learning engineer (20)

Duc le CV
Duc le CVDuc le CV
Duc le CV
 
Continuously Design your Continuous Deployment
Continuously Design your Continuous DeploymentContinuously Design your Continuous Deployment
Continuously Design your Continuous Deployment
 
Kadir A_20160804_res_tea
Kadir A_20160804_res_teaKadir A_20160804_res_tea
Kadir A_20160804_res_tea
 
BDNSbrochure
BDNSbrochureBDNSbrochure
BDNSbrochure
 
sample-resume
sample-resumesample-resume
sample-resume
 
CLOUD COMPUTING BY SIVASANKARI
CLOUD COMPUTING BY SIVASANKARICLOUD COMPUTING BY SIVASANKARI
CLOUD COMPUTING BY SIVASANKARI
 
Cc unit 1 ppt
Cc unit 1 pptCc unit 1 ppt
Cc unit 1 ppt
 
Resume Nimisha Jha Mainframe Developer 6 Years 5 Months
Resume Nimisha Jha Mainframe Developer 6 Years 5 MonthsResume Nimisha Jha Mainframe Developer 6 Years 5 Months
Resume Nimisha Jha Mainframe Developer 6 Years 5 Months
 
Build, Scale, and Deploy Deep Learning Pipelines with Ease
Build, Scale, and Deploy Deep Learning Pipelines with EaseBuild, Scale, and Deploy Deep Learning Pipelines with Ease
Build, Scale, and Deploy Deep Learning Pipelines with Ease
 
Developing and Operating Microservice-based Applications on Modern Cloud Plat...
Developing and Operating Microservice-based Applications on Modern Cloud Plat...Developing and Operating Microservice-based Applications on Modern Cloud Plat...
Developing and Operating Microservice-based Applications on Modern Cloud Plat...
 
GOTO Amsterdam 2017 - Enterprise Fast Lane
GOTO Amsterdam 2017 - Enterprise Fast LaneGOTO Amsterdam 2017 - Enterprise Fast Lane
GOTO Amsterdam 2017 - Enterprise Fast Lane
 
Using Algorithmia to leverage AI and Machine Learning APIs
Using Algorithmia to leverage AI and Machine Learning APIsUsing Algorithmia to leverage AI and Machine Learning APIs
Using Algorithmia to leverage AI and Machine Learning APIs
 
Making Software and Software Engineering visible
Making Software and Software Engineering visibleMaking Software and Software Engineering visible
Making Software and Software Engineering visible
 
Deep learning on mobile
Deep learning on mobileDeep learning on mobile
Deep learning on mobile
 
Cloud Architecture Tutorial - Why and What (1of 3)
Cloud Architecture Tutorial - Why and What (1of 3) Cloud Architecture Tutorial - Why and What (1of 3)
Cloud Architecture Tutorial - Why and What (1of 3)
 
GOTO Berlin 2016
GOTO Berlin 2016GOTO Berlin 2016
GOTO Berlin 2016
 
OA centre of excellence
OA centre of excellenceOA centre of excellence
OA centre of excellence
 
Deep Learning with CNTK
Deep Learning with CNTKDeep Learning with CNTK
Deep Learning with CNTK
 
Constructing Knowledge Graph for Social Networks in a Deep and Holistic Way
Constructing Knowledge Graph for Social Networks in a Deep and Holistic WayConstructing Knowledge Graph for Social Networks in a Deep and Holistic Way
Constructing Knowledge Graph for Social Networks in a Deep and Holistic Way
 
803042764
803042764803042764
803042764
 

Recently uploaded

在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
obonagu
 
addressing modes in computer architecture
addressing modes  in computer architectureaddressing modes  in computer architecture
addressing modes in computer architecture
ShahidSultan24
 
Cosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdfCosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdf
Kamal Acharya
 
ethical hacking in wireless-hacking1.ppt
ethical hacking in wireless-hacking1.pptethical hacking in wireless-hacking1.ppt
ethical hacking in wireless-hacking1.ppt
Jayaprasanna4
 
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptxCFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
R&R Consult
 
The role of big data in decision making.
The role of big data in decision making.The role of big data in decision making.
The role of big data in decision making.
ankuprajapati0525
 
Democratizing Fuzzing at Scale by Abhishek Arya
Democratizing Fuzzing at Scale by Abhishek AryaDemocratizing Fuzzing at Scale by Abhishek Arya
Democratizing Fuzzing at Scale by Abhishek Arya
abh.arya
 
Final project report on grocery store management system..pdf
Final project report on grocery store management system..pdfFinal project report on grocery store management system..pdf
Final project report on grocery store management system..pdf
Kamal Acharya
 
Forklift Classes Overview by Intella Parts
Forklift Classes Overview by Intella PartsForklift Classes Overview by Intella Parts
Forklift Classes Overview by Intella Parts
Intella Parts
 
Automobile Management System Project Report.pdf
Automobile Management System Project Report.pdfAutomobile Management System Project Report.pdf
Automobile Management System Project Report.pdf
Kamal Acharya
 
Planning Of Procurement o different goods and services
Planning Of Procurement o different goods and servicesPlanning Of Procurement o different goods and services
Planning Of Procurement o different goods and services
JoytuBarua2
 
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
MdTanvirMahtab2
 
Standard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - NeometrixStandard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - Neometrix
Neometrix_Engineering_Pvt_Ltd
 
Immunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary AttacksImmunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary Attacks
gerogepatton
 
Quality defects in TMT Bars, Possible causes and Potential Solutions.
Quality defects in TMT Bars, Possible causes and Potential Solutions.Quality defects in TMT Bars, Possible causes and Potential Solutions.
Quality defects in TMT Bars, Possible causes and Potential Solutions.
PrashantGoswami42
 
power quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptxpower quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptx
ViniHema
 
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
AJAYKUMARPUND1
 
Vaccine management system project report documentation..pdf
Vaccine management system project report documentation..pdfVaccine management system project report documentation..pdf
Vaccine management system project report documentation..pdf
Kamal Acharya
 
Architectural Portfolio Sean Lockwood
Architectural Portfolio Sean LockwoodArchitectural Portfolio Sean Lockwood
Architectural Portfolio Sean Lockwood
seandesed
 
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdfWater Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation & Control
 

Recently uploaded (20)

在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
 
addressing modes in computer architecture
addressing modes  in computer architectureaddressing modes  in computer architecture
addressing modes in computer architecture
 
Cosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdfCosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdf
 
ethical hacking in wireless-hacking1.ppt
ethical hacking in wireless-hacking1.pptethical hacking in wireless-hacking1.ppt
ethical hacking in wireless-hacking1.ppt
 
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptxCFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
 
The role of big data in decision making.
The role of big data in decision making.The role of big data in decision making.
The role of big data in decision making.
 
Democratizing Fuzzing at Scale by Abhishek Arya
Democratizing Fuzzing at Scale by Abhishek AryaDemocratizing Fuzzing at Scale by Abhishek Arya
Democratizing Fuzzing at Scale by Abhishek Arya
 
Final project report on grocery store management system..pdf
Final project report on grocery store management system..pdfFinal project report on grocery store management system..pdf
Final project report on grocery store management system..pdf
 
Forklift Classes Overview by Intella Parts
Forklift Classes Overview by Intella PartsForklift Classes Overview by Intella Parts
Forklift Classes Overview by Intella Parts
 
Automobile Management System Project Report.pdf
Automobile Management System Project Report.pdfAutomobile Management System Project Report.pdf
Automobile Management System Project Report.pdf
 
Planning Of Procurement o different goods and services
Planning Of Procurement o different goods and servicesPlanning Of Procurement o different goods and services
Planning Of Procurement o different goods and services
 
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
 
Standard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - NeometrixStandard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - Neometrix
 
Immunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary AttacksImmunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary Attacks
 
Quality defects in TMT Bars, Possible causes and Potential Solutions.
Quality defects in TMT Bars, Possible causes and Potential Solutions.Quality defects in TMT Bars, Possible causes and Potential Solutions.
Quality defects in TMT Bars, Possible causes and Potential Solutions.
 
power quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptxpower quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptx
 
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
 
Vaccine management system project report documentation..pdf
Vaccine management system project report documentation..pdfVaccine management system project report documentation..pdf
Vaccine management system project report documentation..pdf
 
Architectural Portfolio Sean Lockwood
Architectural Portfolio Sean LockwoodArchitectural Portfolio Sean Lockwood
Architectural Portfolio Sean Lockwood
 
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdfWater Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdf
 

Forrest Iandola: How to become a full-stack deep learning engineer

  • 1. deepscale.ai F O R R E S T   I A N D O L A Co-­‐founder  and  CEO,  DeepScale How  to  become a  full-­‐stack  deep  learning  engineer
  • 2. OUTLINE • Part  1:  My  adventures  in  entrepreneurship  and  deep  learning • How  I  became  a  full-­‐stack  DL  engineer • DeepScale's  founding  story • Part  2:  Full-­‐stack  deep  learning  engineering • What  is  the  full  stack? • How  to  organize  a  full-­‐stack  DL  team • How  to  become  a  full-­‐stack  DL  engineer
  • 3. PART  1 My  adventures  in  entrepreneurship  and  deep  learning
  • 4. For  as  long  as  I  can  remember,  I  have  been  obsessed  with  cars  and  trucks
  • 6. Illinois  Math  and  Science  Academy  (2005)
  • 7. Mechanical  Engineering  Building at  University  of  Illinois  (2008)
  • 8. Siebel  Center  for  Computer  Science at  University  of  Illinois  (2008)
  • 9. What  my  classmates  planned  to  do  after  finishing   their  undergraduate  degree  in  CS  at  University  of  Illinois  (2012) Entry-­‐level  jobs  at   Big  Companies Starting  Companies …and  countless  smartphone   app  startups Grad   School
  • 10. Starting  my  PhD  in  EECS  at  UC  Berkeley  (2012) Kurt  Keutzer
  • 11. Our  full-­‐stack  approach FOR  DEEP  NEURAL  NETWORKS  v0.1  (2016) Train  rapidly  using   multiprocessor  scaling Collect  and  annotate   adequate  training  data Develop  small  and  efficient  deep   neural  network  architectures Create  efficient  implementations   for  embedded  hardware F IREC AF F E  [ 1] T H E   D E E P S C A L E   C A R SQ U EEZEN ET  [ 2] B O DA  [ 3] [1]  F.N.  Iandola,  K.  Ashraf,  M.W.  Moskewicz,  and  K.  Keutzer.  FireCaffe:   near-­‐linear  acceleration  of  deep  neural  network  training  on  compute   clusters.  CVPR,  2016. [2]  F.N.  Iandola,  M.  Moskewicz,  K.  Ashraf,  S.  Han,  W.  Dally,  K.  Keutzer.   SqueezeNet:  AlexNet-­‐level  accuracy  with  50x  fewer  parameters  and   <1MB  model  size.  arXiv,  2016. [3]  M.W.  Moskewicz,  F.N.  Iandola,  K.  Keutzer.  Boda-­‐RTC:  Productive   Generation  of  Portable,  Efficient  Code  for  Convolutional  Neural   Networks  on  Mobile  Computing  Platforms.  WiMob,  2016.  
  • 13. DeepScaleHOW  OUR  BUSINESS  MODEL  CRYSTALLIZED  (2016) First  we  learned… • Automakers  have  gathered  lots  of  data  for  training  computer  vision  models  – for   use  in  autonomous  driving • Training  deep  neural  nets  on  these  data  volumes  is  really  slow  (months  or  even   years) • DeepScale's  distributed  training  system  can  help After  more  discussions,  we  learned… • Automakers  would  prefer  to  just  buy  the  "right"  deep  neural  networks,  already   trained • For  mass-­‐production,  automakers  have  really small  processors  (100x  less   computation  than  full-­‐size  NVIDIA  GPUs)
  • 14. Our  full-­‐stack  approach FOR  DEEP  NEURAL  NETWORKS  v0.1  (2016) Train  rapidly  using   multiprocessor  scaling Collect  and  annotate   adequate  training  data Develop  small  and  efficient  deep   neural  network  architectures Create  efficient  implementations   for  embedded  hardware F IREC AF F E  [ 1] T H E   D E E P S C A L E   C A R SQ U EEZEN ET  [ 2] B O DA  [ 3] [1]  F.N.  Iandola,  K.  Ashraf,  M.W.  Moskewicz,  and  K.  Keutzer.  FireCaffe:   near-­‐linear  acceleration  of  deep  neural  network  training  on  compute   clusters.  CVPR,  2016. [2]  F.N.  Iandola,  M.  Moskewicz,  K.  Ashraf,  S.  Han,  W.  Dally,  K.  Keutzer.   SqueezeNet:  AlexNet-­‐level  accuracy  with  50x  fewer  parameters  and   <1MB  model  size.  arXiv,  2016. [3]  M.W.  Moskewicz,  F.N.  Iandola,  K.  Keutzer.  Boda-­‐RTC:  Productive   Generation  of  Portable,  Efficient  Code  for  Convolutional  Neural   Networks  on  Mobile  Computing  Platforms.  WiMob,  2016.  
  • 15. DeepScale PERCEPTION  SYSTEMS   FOR  AUTOMATED  VEHICLES http://deepscale.ai
  • 16. My  story  in  one  slide 1993:  Planned  to  start  "Forrest's  Truck  Company"  when  I  grow  up 2008:  Went  to  college  at  University  of  Illinois  with  the  plan  to  study  Mechanical  Engineering  and  then  go   into  automotive  industry 2009:  Switched  to  computer  science;  gave  up  my  dreams  of  working  in  automotive 2012:  Considered  starting  a  startup,  but  choose  grad  school  instead 2013:  Started  doing  research  in  deep  learning 2015:  Started  DeepScale 2016:  Graduated  with  PhD  in  EECS 2016:  Focused  DeepScale  entirely  on  the  automotive  industry 2018:  Today,  DeepScale  is  on  track  to  supply  lifesaving  deep  learning  software  to  automakers You  can't  connect  the  dots  looking  forward;  you  can  only  connect  them  looking  backwards.   So,  you  have  to  trust  that  the  dots  will  somehow  connect  in  your  future."  – Steve  Jobs
  • 17. PART  2 The  playbook  for  full-­‐stack  deep  learning
  • 18. Data Infrastructure Models Applications The  full  system  stack FOR  DEEP  NEURAL  NETWORKS  v1.0 (2018)
  • 19. Data Data  Collection Data  Annotation Task  Definition Accuracy  MetricsData  Storage Simulation
  • 20. Models Design  Space   ExplorationNew  Layer  Types New  Objective   Functions DNN  Model   Structure Preprocessing  &   Featurization Quantization  &   Compression
  • 21. Infrastructure DNN  Frameworks Visualization  Tools Deploy  on  cloud  &  embedded  hardware Efficient  Computational  Kernels Train  on  large-­‐scale  hardware Distributed  Communication
  • 22. Applications GADGETSDATACENTERS SOCIAL  MEDIA  ANALYSIS WEB  INDEXING GOVERNMENT  INTELLIGENCE DRONES SELF-­‐DRIVING  CARSSMARTPHONES OTHER  ROBOTICS
  • 23. Traditional  Workflow  for   DL  Research  and  Engineering Models Infrastructure (FYI,  these  are  supposed  to  be  silos) Applications Data
  • 24. Breaking  Down  Silos  in DL  Research  and  Engineering Data Models Infrastructure Applications
  • 25. How  full-­‐stack  DL  teams  produce   full-­‐stack  engineers Data Models Infrastructure Applications
  • 26.
  • 27. So,  how  do  I  get  started?
  • 28. First  go  deep Bring  your  skills  to  a  full-­‐stack  Deep  Learning  team • I  bet  you're  already  pretty  good  some  piece  of  the  stack • So,  start  by  applying  your  existing  skills to  an  DL-­‐related  project • Work  with  a  group  who  has  complementary  skills • Or,  choose  a  project  that  plays  to  your  strengths Data Models Infrastructure Applications
  • 29. Then  go  wide Becoming  a  full-­‐stack  Deep  Learning  engineer • At  this  point,  you've  built  experience  applying  your  skills in  the  DL  field • Now,  challenge  yourself  to  learn  new  skills • Lots  of  places  to  learn • If  you  worked  with  a  group,  you  probably  already  learned  from  your  team   members • If  you're  in  a  company,  hopefully  you  work  with  people  who  have   complementary  skills Data Models Infrastructure Applications
  • 30. Conclusions • Full-­‐stack  Deep  Learning  spans  several  disciplines • For  entrepreneurs  /  managers:  if  you  want  to  do  deep  learning   at  scale,  you  may  need  to  build  a  full-­‐stack  team • For  engineers:  there  are  lots  of  ways  to  get  into  deep  learning   and  ultimately  to  be  come  a  full-­‐stack  DL  engineer