SlideShare a Scribd company logo
1 of 19
The image part with relationship ID rId14 was not found in the file.
{
Deep  Learning
And  Its  Applications:  Computer  Vision
Adam  Gibson
{  deeplearning4j.org  //  skymind.io  //  zipfian  academy
The image part with relationship ID rId14 was not found in the file.
•  Object  Recognition
•  Image  Categorization
•  Scene  Parsing
•  Face  Recognition
Computer  Vision:  A  Primer
The image part with relationship ID rId14 was not found in the file.
•  OpenCV  
•  SIFT
•  Filters/Edge  Detection
•  Feature  Extraction
What’s  currently  done?
The image part with relationship ID rId14 was not found in the file.
•  Representation  Learning  
•  More  precise  than  hand-­‐‑done  
features
•  Non-­‐‑linearities  and  higher-­‐‑order  
trends
•  Pretrain  and  Hessian  Free
This  is  manual!
The image part with relationship ID rId14 was not found in the file.
•  Representation  Learning
•  Position  Invariance  with  convolutions
•  Semantic  Hashing  
Deep  Learning  and  Images
The image part with relationship ID rId14 was not found in the file.
•  Normal  pixels  –  0-­‐‑255  –  
normalization
•  Sparse  –  binarization  (depending  on  
pixel  presence)
Different  kinds  of  images
The image part with relationship ID rId14 was not found in the file.
•  Faces  =  a  collection  of  images.
•  With  persistent  pa_erns  of  pixels.
•  Pixel  pa_erns  =  features.
•  Nets  learn  to  identify  features  in  data,  to  
classify  faces  as  faces  and  label  them:  John  or  
Sarah.
•  Nets  train  by  reconstructing  faces  from  features  
many  times.	
•  Measuring  their  work  against  a  benchmark.
Facial  recognition
The image part with relationship ID rId14 was not found in the file.
DL4J’s  Facial  Reconstructions
The image part with relationship ID rId14 was not found in the file.
•  Slices  of  a  feature  space  (Max  pooling)	
•  Learns  different  portions  for  easily  scalable  
and  robust  feature  engineering.	
Position  Invariance  -­‐‑  Convolutions
The image part with relationship ID rId14 was not found in the file.
Visual  Example  -­‐‑  Convolutions
The image part with relationship ID rId14 was not found in the file.
Pen  Strokes
The image part with relationship ID rId14 was not found in the file.
•  Facebook  uses  facial  recognition  to  make  
itself  stickier  and  know  more  about  us.
•  Government  agencies  use  it  to  secure  
national  borders.
•  Video  game  makers  use  it  to  construct  more  
realistic  worlds.
•  Stores  use  it  to  identify  customers  and  track  
behavior.
What  are  faces  for?
The image part with relationship ID rId14 was not found in the file.
•  2  layers  of  neuron-­‐‑like  nodes.	
•  The  1st  is  the  visible,  or  input,  layer	
•  The  2nd  is  “hidden.”  It  identifies  features  in  input	
•  Symmetrically  connected.	
•  “Restricted”  =  no  visible-­‐‑visible  or  hidden-­‐‑hidden  
ties	
•  All  connections  happen  between  layers.	
Restricted  Bolgmann  
Machines  (RBMs)
The image part with relationship ID rId14 was not found in the file.
•  A  stack  of  RBMs.	
•  Each  RBM’s  hidden  layer  à  Next  RBM’s  visible/input  
layer.  	
•  DBNs  learn  more  &  more  complex  features	
•  Example:  	
•  1)  Pixels  =  input;  	
•  2)  H1  learns  an  edge  or  line;  	
•  3)  H2  learns  a  corner  or  set  of  lines;  	
•  4)  H3  learns  two  groups  of  lines  forming  an  object  
-­‐‑-­‐‑  a  face!	
•  Final  layer  classifies  feature  groups:  sunset,  elephant,  
flower,  John,  Sarah.	
Deep-­‐‑Belief  Net  (DBN)
The image part with relationship ID rId14 was not found in the file.
•  2  DBNs.
•  1st  DBN  *encodes*  data  into  vector  of  10-­‐‑30  
numbers  =  Pre-­‐‑training.
•  2nd  DBN  decodes  data  into  original  state.
•  Backprop  only  happens  on  2nd  DBN
•  2nd  is  the  fine-­‐‑tuning  stage  (reconstruction  entropy).
•  Reduces  documents  or  images  to  compact  vectors  .
•  Useful  in  search,  QA  and  information  retrieval.
Deep  Autoencoder
The image part with relationship ID rId14 was not found in the file.
Deep  Autoencoder  Architecture
The image part with relationship ID rId14 was not found in the file.
Image  Search  Results
The image part with relationship ID rId14 was not found in the file.
•  Top-­‐‑down  &  hierarchical  rather  than  feed-­‐‑forward  (DBNs).
•  Handles  sequence-­‐‑based  classification,  windows  of  several  
events,  entire  scenes  (multiple  objects).
•  Features  themselves  are  vectors.  
•  A  tensor  =  a  multi-­‐‑dimensional  matrix,  or  multiple  matrices  of  
the  same  size.
Recursive  Neural  Tensor  Net
The image part with relationship ID rId14 was not found in the file.
RNTNs  &  Scene  Composition

More Related Content

Viewers also liked

General introduction to computer vision
General introduction to computer visionGeneral introduction to computer vision
General introduction to computer visionbutest
 
COM2304: Introduction to Computer Vision & Image Processing
COM2304: Introduction to Computer Vision & Image Processing COM2304: Introduction to Computer Vision & Image Processing
COM2304: Introduction to Computer Vision & Image Processing Hemantha Kulathilake
 
Cross platform computer vision optimization
Cross platform computer vision optimizationCross platform computer vision optimization
Cross platform computer vision optimizationYoss Cohen
 
Understanding hearing impairment
Understanding hearing impairmentUnderstanding hearing impairment
Understanding hearing impairmentDen Osmena
 
Sign Language in Communication
Sign Language in CommunicationSign Language in Communication
Sign Language in CommunicationMeera Hapaliya
 
Matrix and Tensor Tools for Computer Vision
Matrix and Tensor Tools for Computer VisionMatrix and Tensor Tools for Computer Vision
Matrix and Tensor Tools for Computer VisionActiveEon
 
Hearing impairment
Hearing impairmentHearing impairment
Hearing impairmentbutterbear
 
What Is Sign Language
What Is Sign LanguageWhat Is Sign Language
What Is Sign LanguageArun Rao
 
applications of computer graphics
applications of computer graphicsapplications of computer graphics
applications of computer graphicsAaina Katyal
 

Viewers also liked (20)

General introduction to computer vision
General introduction to computer visionGeneral introduction to computer vision
General introduction to computer vision
 
Computer Vision
Computer VisionComputer Vision
Computer Vision
 
Computer vision
Computer visionComputer vision
Computer vision
 
COM2304: Introduction to Computer Vision & Image Processing
COM2304: Introduction to Computer Vision & Image Processing COM2304: Introduction to Computer Vision & Image Processing
COM2304: Introduction to Computer Vision & Image Processing
 
Nonverbal
NonverbalNonverbal
Nonverbal
 
Computer Vision Introduction
Computer Vision IntroductionComputer Vision Introduction
Computer Vision Introduction
 
Cross platform computer vision optimization
Cross platform computer vision optimizationCross platform computer vision optimization
Cross platform computer vision optimization
 
Understanding hearing impairment
Understanding hearing impairmentUnderstanding hearing impairment
Understanding hearing impairment
 
Deaf culture
Deaf cultureDeaf culture
Deaf culture
 
Sign Language in Communication
Sign Language in CommunicationSign Language in Communication
Sign Language in Communication
 
Sign Language
Sign LanguageSign Language
Sign Language
 
Matrix and Tensor Tools for Computer Vision
Matrix and Tensor Tools for Computer VisionMatrix and Tensor Tools for Computer Vision
Matrix and Tensor Tools for Computer Vision
 
Computer Vision
Computer VisionComputer Vision
Computer Vision
 
Hearing impairment
Hearing impairmentHearing impairment
Hearing impairment
 
Dbms Project
Dbms ProjectDbms Project
Dbms Project
 
American Sign Language
American Sign LanguageAmerican Sign Language
American Sign Language
 
What Is Sign Language
What Is Sign LanguageWhat Is Sign Language
What Is Sign Language
 
Sign language ppt
Sign language pptSign language ppt
Sign language ppt
 
Image processing ppt
Image processing pptImage processing ppt
Image processing ppt
 
applications of computer graphics
applications of computer graphicsapplications of computer graphics
applications of computer graphics
 

More from Adam Gibson

End to end MLworkflows
End to end MLworkflowsEnd to end MLworkflows
End to end MLworkflowsAdam Gibson
 
World Artificial Intelligence Conference Shanghai 2018
World Artificial Intelligence Conference Shanghai 2018World Artificial Intelligence Conference Shanghai 2018
World Artificial Intelligence Conference Shanghai 2018Adam Gibson
 
Deploying signature verification with deep learning
Deploying signature verification with deep learningDeploying signature verification with deep learning
Deploying signature verification with deep learningAdam Gibson
 
Self driving computers active learning workflows with human interpretable ve...
Self driving computers  active learning workflows with human interpretable ve...Self driving computers  active learning workflows with human interpretable ve...
Self driving computers active learning workflows with human interpretable ve...Adam Gibson
 
Anomaly Detection and Automatic Labeling with Deep Learning
Anomaly Detection and Automatic Labeling with Deep LearningAnomaly Detection and Automatic Labeling with Deep Learning
Anomaly Detection and Automatic Labeling with Deep LearningAdam Gibson
 
Strata Beijing 2017: Jumpy, a python interface for nd4j
Strata Beijing 2017: Jumpy, a python interface for nd4jStrata Beijing 2017: Jumpy, a python interface for nd4j
Strata Beijing 2017: Jumpy, a python interface for nd4jAdam Gibson
 
Boolan machine learning summit
Boolan machine learning summitBoolan machine learning summit
Boolan machine learning summitAdam Gibson
 
Advanced deeplearning4j features
Advanced deeplearning4j featuresAdvanced deeplearning4j features
Advanced deeplearning4j featuresAdam Gibson
 
Deep Learning with GPUs in Production - AI By the Bay
Deep Learning with GPUs in Production - AI By the BayDeep Learning with GPUs in Production - AI By the Bay
Deep Learning with GPUs in Production - AI By the BayAdam Gibson
 
Big Data Analytics Tokyo
Big Data Analytics TokyoBig Data Analytics Tokyo
Big Data Analytics TokyoAdam Gibson
 
Wrangleconf Big Data Malaysia 2016
Wrangleconf Big Data Malaysia 2016Wrangleconf Big Data Malaysia 2016
Wrangleconf Big Data Malaysia 2016Adam Gibson
 
Distributed deep rl on spark strata singapore
Distributed deep rl on spark   strata singaporeDistributed deep rl on spark   strata singapore
Distributed deep rl on spark strata singaporeAdam Gibson
 
Deep learning in production with the best
Deep learning in production   with the bestDeep learning in production   with the best
Deep learning in production with the bestAdam Gibson
 
Dl4j in the wild
Dl4j in the wildDl4j in the wild
Dl4j in the wildAdam Gibson
 
SKIL - Dl4j in the wild meetup
SKIL - Dl4j in the wild meetupSKIL - Dl4j in the wild meetup
SKIL - Dl4j in the wild meetupAdam Gibson
 
Strata Beijing - Deep Learning in Production on Spark
Strata Beijing - Deep Learning in Production on SparkStrata Beijing - Deep Learning in Production on Spark
Strata Beijing - Deep Learning in Production on SparkAdam Gibson
 
Anomaly detection in deep learning (Updated) English
Anomaly detection in deep learning (Updated) EnglishAnomaly detection in deep learning (Updated) English
Anomaly detection in deep learning (Updated) EnglishAdam Gibson
 
Skymind - Udacity China presentation
Skymind - Udacity China presentationSkymind - Udacity China presentation
Skymind - Udacity China presentationAdam Gibson
 
Anomaly Detection in Deep Learning (Updated)
Anomaly Detection in Deep Learning (Updated)Anomaly Detection in Deep Learning (Updated)
Anomaly Detection in Deep Learning (Updated)Adam Gibson
 
Hadoop summit 2016
Hadoop summit 2016Hadoop summit 2016
Hadoop summit 2016Adam Gibson
 

More from Adam Gibson (20)

End to end MLworkflows
End to end MLworkflowsEnd to end MLworkflows
End to end MLworkflows
 
World Artificial Intelligence Conference Shanghai 2018
World Artificial Intelligence Conference Shanghai 2018World Artificial Intelligence Conference Shanghai 2018
World Artificial Intelligence Conference Shanghai 2018
 
Deploying signature verification with deep learning
Deploying signature verification with deep learningDeploying signature verification with deep learning
Deploying signature verification with deep learning
 
Self driving computers active learning workflows with human interpretable ve...
Self driving computers  active learning workflows with human interpretable ve...Self driving computers  active learning workflows with human interpretable ve...
Self driving computers active learning workflows with human interpretable ve...
 
Anomaly Detection and Automatic Labeling with Deep Learning
Anomaly Detection and Automatic Labeling with Deep LearningAnomaly Detection and Automatic Labeling with Deep Learning
Anomaly Detection and Automatic Labeling with Deep Learning
 
Strata Beijing 2017: Jumpy, a python interface for nd4j
Strata Beijing 2017: Jumpy, a python interface for nd4jStrata Beijing 2017: Jumpy, a python interface for nd4j
Strata Beijing 2017: Jumpy, a python interface for nd4j
 
Boolan machine learning summit
Boolan machine learning summitBoolan machine learning summit
Boolan machine learning summit
 
Advanced deeplearning4j features
Advanced deeplearning4j featuresAdvanced deeplearning4j features
Advanced deeplearning4j features
 
Deep Learning with GPUs in Production - AI By the Bay
Deep Learning with GPUs in Production - AI By the BayDeep Learning with GPUs in Production - AI By the Bay
Deep Learning with GPUs in Production - AI By the Bay
 
Big Data Analytics Tokyo
Big Data Analytics TokyoBig Data Analytics Tokyo
Big Data Analytics Tokyo
 
Wrangleconf Big Data Malaysia 2016
Wrangleconf Big Data Malaysia 2016Wrangleconf Big Data Malaysia 2016
Wrangleconf Big Data Malaysia 2016
 
Distributed deep rl on spark strata singapore
Distributed deep rl on spark   strata singaporeDistributed deep rl on spark   strata singapore
Distributed deep rl on spark strata singapore
 
Deep learning in production with the best
Deep learning in production   with the bestDeep learning in production   with the best
Deep learning in production with the best
 
Dl4j in the wild
Dl4j in the wildDl4j in the wild
Dl4j in the wild
 
SKIL - Dl4j in the wild meetup
SKIL - Dl4j in the wild meetupSKIL - Dl4j in the wild meetup
SKIL - Dl4j in the wild meetup
 
Strata Beijing - Deep Learning in Production on Spark
Strata Beijing - Deep Learning in Production on SparkStrata Beijing - Deep Learning in Production on Spark
Strata Beijing - Deep Learning in Production on Spark
 
Anomaly detection in deep learning (Updated) English
Anomaly detection in deep learning (Updated) EnglishAnomaly detection in deep learning (Updated) English
Anomaly detection in deep learning (Updated) English
 
Skymind - Udacity China presentation
Skymind - Udacity China presentationSkymind - Udacity China presentation
Skymind - Udacity China presentation
 
Anomaly Detection in Deep Learning (Updated)
Anomaly Detection in Deep Learning (Updated)Anomaly Detection in Deep Learning (Updated)
Anomaly Detection in Deep Learning (Updated)
 
Hadoop summit 2016
Hadoop summit 2016Hadoop summit 2016
Hadoop summit 2016
 

Recently uploaded

Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptDineshKumar4165
 
Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086anil_gaur
 
AIRCANVAS[1].pdf mini project for btech students
AIRCANVAS[1].pdf mini project for btech studentsAIRCANVAS[1].pdf mini project for btech students
AIRCANVAS[1].pdf mini project for btech studentsvanyagupta248
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapRishantSharmaFr
 
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills KuwaitKuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwaitjaanualu31
 
Computer Networks Basics of Network Devices
Computer Networks  Basics of Network DevicesComputer Networks  Basics of Network Devices
Computer Networks Basics of Network DevicesChandrakantDivate1
 
2016EF22_0 solar project report rooftop projects
2016EF22_0 solar project report rooftop projects2016EF22_0 solar project report rooftop projects
2016EF22_0 solar project report rooftop projectssmsksolar
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VDineshKumar4165
 
Engineering Drawing focus on projection of planes
Engineering Drawing focus on projection of planesEngineering Drawing focus on projection of planes
Engineering Drawing focus on projection of planesRAJNEESHKUMAR341697
 
Online electricity billing project report..pdf
Online electricity billing project report..pdfOnline electricity billing project report..pdf
Online electricity billing project report..pdfKamal Acharya
 
DeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakesDeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakesMayuraD1
 
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptxHOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptxSCMS School of Architecture
 
Block diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.pptBlock diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.pptNANDHAKUMARA10
 
Rums floating Omkareshwar FSPV IM_16112021.pdf
Rums floating Omkareshwar FSPV IM_16112021.pdfRums floating Omkareshwar FSPV IM_16112021.pdf
Rums floating Omkareshwar FSPV IM_16112021.pdfsmsksolar
 
+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...
+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...
+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...Health
 
Learn the concepts of Thermodynamics on Magic Marks
Learn the concepts of Thermodynamics on Magic MarksLearn the concepts of Thermodynamics on Magic Marks
Learn the concepts of Thermodynamics on Magic MarksMagic Marks
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . pptDineshKumar4165
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfJiananWang21
 
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best ServiceTamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Servicemeghakumariji156
 

Recently uploaded (20)

Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
 
Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086
 
AIRCANVAS[1].pdf mini project for btech students
AIRCANVAS[1].pdf mini project for btech studentsAIRCANVAS[1].pdf mini project for btech students
AIRCANVAS[1].pdf mini project for btech students
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leap
 
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills KuwaitKuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
 
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak HamilCara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
 
Computer Networks Basics of Network Devices
Computer Networks  Basics of Network DevicesComputer Networks  Basics of Network Devices
Computer Networks Basics of Network Devices
 
2016EF22_0 solar project report rooftop projects
2016EF22_0 solar project report rooftop projects2016EF22_0 solar project report rooftop projects
2016EF22_0 solar project report rooftop projects
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
 
Engineering Drawing focus on projection of planes
Engineering Drawing focus on projection of planesEngineering Drawing focus on projection of planes
Engineering Drawing focus on projection of planes
 
Online electricity billing project report..pdf
Online electricity billing project report..pdfOnline electricity billing project report..pdf
Online electricity billing project report..pdf
 
DeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakesDeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakes
 
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptxHOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
 
Block diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.pptBlock diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.ppt
 
Rums floating Omkareshwar FSPV IM_16112021.pdf
Rums floating Omkareshwar FSPV IM_16112021.pdfRums floating Omkareshwar FSPV IM_16112021.pdf
Rums floating Omkareshwar FSPV IM_16112021.pdf
 
+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...
+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...
+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...
 
Learn the concepts of Thermodynamics on Magic Marks
Learn the concepts of Thermodynamics on Magic MarksLearn the concepts of Thermodynamics on Magic Marks
Learn the concepts of Thermodynamics on Magic Marks
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdf
 
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best ServiceTamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
 

Deep Learning and its Applications - Computer Vision

  • 1. The image part with relationship ID rId14 was not found in the file. { Deep  Learning And  Its  Applications:  Computer  Vision Adam  Gibson {  deeplearning4j.org  //  skymind.io  //  zipfian  academy
  • 2. The image part with relationship ID rId14 was not found in the file. •  Object  Recognition •  Image  Categorization •  Scene  Parsing •  Face  Recognition Computer  Vision:  A  Primer
  • 3. The image part with relationship ID rId14 was not found in the file. •  OpenCV   •  SIFT •  Filters/Edge  Detection •  Feature  Extraction What’s  currently  done?
  • 4. The image part with relationship ID rId14 was not found in the file. •  Representation  Learning   •  More  precise  than  hand-­‐‑done   features •  Non-­‐‑linearities  and  higher-­‐‑order   trends •  Pretrain  and  Hessian  Free This  is  manual!
  • 5. The image part with relationship ID rId14 was not found in the file. •  Representation  Learning •  Position  Invariance  with  convolutions •  Semantic  Hashing   Deep  Learning  and  Images
  • 6. The image part with relationship ID rId14 was not found in the file. •  Normal  pixels  –  0-­‐‑255  –   normalization •  Sparse  –  binarization  (depending  on   pixel  presence) Different  kinds  of  images
  • 7. The image part with relationship ID rId14 was not found in the file. •  Faces  =  a  collection  of  images. •  With  persistent  pa_erns  of  pixels. •  Pixel  pa_erns  =  features. •  Nets  learn  to  identify  features  in  data,  to   classify  faces  as  faces  and  label  them:  John  or   Sarah. •  Nets  train  by  reconstructing  faces  from  features   many  times. •  Measuring  their  work  against  a  benchmark. Facial  recognition
  • 8. The image part with relationship ID rId14 was not found in the file. DL4J’s  Facial  Reconstructions
  • 9. The image part with relationship ID rId14 was not found in the file. •  Slices  of  a  feature  space  (Max  pooling) •  Learns  different  portions  for  easily  scalable   and  robust  feature  engineering. Position  Invariance  -­‐‑  Convolutions
  • 10. The image part with relationship ID rId14 was not found in the file. Visual  Example  -­‐‑  Convolutions
  • 11. The image part with relationship ID rId14 was not found in the file. Pen  Strokes
  • 12. The image part with relationship ID rId14 was not found in the file. •  Facebook  uses  facial  recognition  to  make   itself  stickier  and  know  more  about  us. •  Government  agencies  use  it  to  secure   national  borders. •  Video  game  makers  use  it  to  construct  more   realistic  worlds. •  Stores  use  it  to  identify  customers  and  track   behavior. What  are  faces  for?
  • 13. The image part with relationship ID rId14 was not found in the file. •  2  layers  of  neuron-­‐‑like  nodes. •  The  1st  is  the  visible,  or  input,  layer •  The  2nd  is  “hidden.”  It  identifies  features  in  input •  Symmetrically  connected. •  “Restricted”  =  no  visible-­‐‑visible  or  hidden-­‐‑hidden   ties •  All  connections  happen  between  layers. Restricted  Bolgmann   Machines  (RBMs)
  • 14. The image part with relationship ID rId14 was not found in the file. •  A  stack  of  RBMs. •  Each  RBM’s  hidden  layer  à  Next  RBM’s  visible/input   layer.   •  DBNs  learn  more  &  more  complex  features •  Example:   •  1)  Pixels  =  input;   •  2)  H1  learns  an  edge  or  line;   •  3)  H2  learns  a  corner  or  set  of  lines;   •  4)  H3  learns  two  groups  of  lines  forming  an  object   -­‐‑-­‐‑  a  face! •  Final  layer  classifies  feature  groups:  sunset,  elephant,   flower,  John,  Sarah. Deep-­‐‑Belief  Net  (DBN)
  • 15. The image part with relationship ID rId14 was not found in the file. •  2  DBNs. •  1st  DBN  *encodes*  data  into  vector  of  10-­‐‑30   numbers  =  Pre-­‐‑training. •  2nd  DBN  decodes  data  into  original  state. •  Backprop  only  happens  on  2nd  DBN •  2nd  is  the  fine-­‐‑tuning  stage  (reconstruction  entropy). •  Reduces  documents  or  images  to  compact  vectors  . •  Useful  in  search,  QA  and  information  retrieval. Deep  Autoencoder
  • 16. The image part with relationship ID rId14 was not found in the file. Deep  Autoencoder  Architecture
  • 17. The image part with relationship ID rId14 was not found in the file. Image  Search  Results
  • 18. The image part with relationship ID rId14 was not found in the file. •  Top-­‐‑down  &  hierarchical  rather  than  feed-­‐‑forward  (DBNs). •  Handles  sequence-­‐‑based  classification,  windows  of  several   events,  entire  scenes  (multiple  objects). •  Features  themselves  are  vectors.   •  A  tensor  =  a  multi-­‐‑dimensional  matrix,  or  multiple  matrices  of   the  same  size. Recursive  Neural  Tensor  Net
  • 19. The image part with relationship ID rId14 was not found in the file. RNTNs  &  Scene  Composition