SlideShare a Scribd company logo
DYNAMIC ROUTING BETWEEN
CAPSULES
AND OTHER CAPSULE THINGS
PART 1
WHY CNNs SUCK
The pooling operation used in
convolutional neural networks is a big
mistake and the fact that it works so
well is a disaster.
- Hinton
Max pooling loses the spatial information
Max pooling loses the spatial information
- We don’t use the relationship between objects. Is this a face?
Equivariance and invariance
- CNNs without max pooling are equivariant regarding translation.
- That’s something we want! But max pooling breaks it.
We work with frames of reference. CNNs do not.
PART 2
THE KEY CONCEPTS OF CAPSULES
VISION IS THE INVERSE OF GRAPHICS
The fundamental idea
Computer
Graphics
From a parameter
vector and a
projection matrix,
we generate a 3D
image.
This parameter
vector is viewpoint
invariant!
Can’t we go the
other way around
and achieve
viewpoint
invariance?
Computer
Vision?
CAPSULES ENCODE AN ENTITY
A capsule votes to say if a certain entity
is in the image.
Layer L Layer L+1
building
tea cup
face
nose
window
leaf
window nose leaf eye
face tea cup building
Correspondence
between network and
graph structure
Layer L Layer L+1
building
tea cup
face
nose
window
leaf
nose eye
face
Correspondence
between network and
graph structure
This graph has been carved out from
the full graph.
CAPSULES OUTPUT A VECTOR
A capsule encodes an entity (and its
properties) via its output vector.
Layer L Layer L+1
i
0.456
Fully
Connected
Net
The output of a
node (neuron) is a
scalar value.
0.456
0.456
Layer L Layer L+1
i
Capsules
Net
The output of a
node (capsule) is a
vector.
Layer L Layer L+1
digit 6
Capsules
Net:
an example
The first dimension
of the output vector
encodes for the
scale and thickness
of the digit.
Layer L Layer L+1
digit 6
Capsules
Net:
an example
The second
dimension of the
output vector
encodes for the
roundness of the
top part of the digit.
ROUTING MECHANISM
The information flows towards the
neurons that are the most adapted to
use this type of information.
Layer L Layer L+1
j+1
j
j-1
i
Wi,j-1
Wi,j
Wi,j+1
Fully
Connected
Net
The information is
distributed
uniformly to every
other node in the
next layer.
Layer L Layer L+1
j+1
j
j-1
i
ci,j-1
Wi,j-1
ci,j
Wi,j
ci,j+1
Wi,j+1
Capsules
Net
The information is
distributed to a
specific node in the
next layer.
Routing mechanism (bonus slide)
- In a CNN, this routing mechanism is ‘inverted’.
- In a CapsNet, the routing is learned.
0.2
0.1
0.6
Layer L Layer L+1
building
tea cup
face
nose
ci,j-1
Wi,j-1
ci,j
Wi,j
ci,j+1
Wi,j+1
Capsules
Net:
an example
window
leaf
HOW CAPSULES VOTE
How is the presence of an entity
encoded in the network?
Layer L Layer L+1
Norm of the
output
The norm of the
vector encodes the
probability that the
entity is in the image.
nose
window
leaf
Layer L Layer L+1
Learning
the routing
Capsules that detect
similar pose (via
scalar product)
estimations tend to
be coupled.
j+1
j
j-1
i
PART 3
FORMALISATION
Layer L Layer L+1
j+1
j
j-1
i
Computing
the output
vector
i-1
i+1
Weighted sum of
the inputs (before
activation function).
Layer L Layer L+1
j+1
j
j-1
i
Computing
the output
vector
Squashing the
output vector to
fallback on a
probability (non
linear activation
function).
i-1
i+1
How
routing is
achieved
How do we obtain
the ?
1 Start with log priors:
2 Initialise with
3 Make a forward pass to obtain the
4 Update the :
PART 4
RESULTS
MNIST
Error rate (%)
Number of
parameters
CNN baseline 0.39 35.4M
CapsNet 0.25 8.2M
MultiMNIST
Error rate
(%)
Number of
parameters
CNN baseline 8.1 24.56M
CapsNet 5.2 11.36M
affNIST
Accuracy (%) on... Original MNIST affNIST
CNN baseline 99.22 66.
CapsNet 99.23 79.
smallNORB
Error rate
(%)
Azimuth Elevation
CNN CapsNet CNN CapsNet
Familiar
Viewpoints
3.7 3.7 4.3 4.3
Novel
viewpoints
20. 13.5 17.8 12.3
FURTHER READING
- https://medium.com/mlreview/deep-neural-network-capsules-137be2877d44
- https://medium.com/@pechyonkin/understanding-hintons-capsule-networks-part
-i-intuition-b4b559d1159b
- https://hackernoon.com/what-is-a-capsnet-or-capsule-network-2bfbe48769cc
- https://hackernoon.com/capsule-networks-are-shaking-up-ai-heres-how-to-us
e-them-c233a0971952
- https://www.reddit.com/r/MachineLearning/comments/7bz5x9/d_eli5_capsule_ne
tworks_how_are_they_unique_and/
- https://kndrck.co/posts/capsule_networks_explained/

More Related Content

What's hot

Convolution neural networks
Convolution neural networksConvolution neural networks
Convolution neural networks
Fares Hasan
 
(Research Note) Delving deeper into convolutional neural networks for camera ...
(Research Note) Delving deeper into convolutional neural networks for camera ...(Research Note) Delving deeper into convolutional neural networks for camera ...
(Research Note) Delving deeper into convolutional neural networks for camera ...
Jacky Liu
 
[PR12] Generative Models as Distributions of Functions
[PR12] Generative Models as Distributions of Functions[PR12] Generative Models as Distributions of Functions
[PR12] Generative Models as Distributions of Functions
JaeJun Yoo
 
Convolutional Neural Network (CNN)
Convolutional Neural Network (CNN)Convolutional Neural Network (CNN)
Convolutional Neural Network (CNN)
Muhammad Haroon
 
Domain Invariant Representation Learning with Domain Density Transformations
Domain Invariant Representation Learning with Domain Density TransformationsDomain Invariant Representation Learning with Domain Density Transformations
Domain Invariant Representation Learning with Domain Density Transformations
HyunKyu Jeon
 
MNIST and machine learning - presentation
MNIST and machine learning - presentationMNIST and machine learning - presentation
MNIST and machine learning - presentation
Steve Dias da Cruz
 
[CVPR2020] Simple but effective image enhancement techniques
[CVPR2020] Simple but effective image enhancement techniques[CVPR2020] Simple but effective image enhancement techniques
[CVPR2020] Simple but effective image enhancement techniques
JaeJun Yoo
 
Design and Implementation of EZW & SPIHT Image Coder for Virtual Images
Design and Implementation of EZW & SPIHT Image Coder for Virtual ImagesDesign and Implementation of EZW & SPIHT Image Coder for Virtual Images
Design and Implementation of EZW & SPIHT Image Coder for Virtual Images
CSCJournals
 
Mnist report ppt
Mnist report pptMnist report ppt
Mnist report ppt
RaghunandanJairam
 
Pixel Recurrent Neural Networks
Pixel Recurrent Neural NetworksPixel Recurrent Neural Networks
Pixel Recurrent Neural Networks
neouyghur
 
Image classification using cnn
Image classification using cnnImage classification using cnn
Image classification using cnn
SumeraHangi
 
Matching Network
Matching NetworkMatching Network
Matching Network
SuwhanBaek
 
Image compression using EZW encoding
Image compression using EZW encoding Image compression using EZW encoding
Deep learning for image super resolution
Deep learning for image super resolutionDeep learning for image super resolution
Deep learning for image super resolution
Prudhvi Raj
 
Convolutional neural network
Convolutional neural networkConvolutional neural network
Convolutional neural network
Itachi SK
 
AlexNet(ImageNet Classification with Deep Convolutional Neural Networks)
AlexNet(ImageNet Classification with Deep Convolutional Neural Networks)AlexNet(ImageNet Classification with Deep Convolutional Neural Networks)
AlexNet(ImageNet Classification with Deep Convolutional Neural Networks)
Fellowship at Vodafone FutureLab
 
Conditional Image Generation with PixelCNN Decoders
Conditional Image Generation with PixelCNN DecodersConditional Image Generation with PixelCNN Decoders
Conditional Image Generation with PixelCNN Decoders
suga93
 
PR243: Designing Network Design Spaces
PR243: Designing Network Design SpacesPR243: Designing Network Design Spaces
PR243: Designing Network Design Spaces
Jinwon Lee
 
20141003.journal club
20141003.journal club20141003.journal club
20141003.journal club
Hayaru SHOUNO
 
Sift detector boosted by adaptive contrast threshold to improve matching robu...
Sift detector boosted by adaptive contrast threshold to improve matching robu...Sift detector boosted by adaptive contrast threshold to improve matching robu...
Sift detector boosted by adaptive contrast threshold to improve matching robu...
Venkat Projects
 

What's hot (20)

Convolution neural networks
Convolution neural networksConvolution neural networks
Convolution neural networks
 
(Research Note) Delving deeper into convolutional neural networks for camera ...
(Research Note) Delving deeper into convolutional neural networks for camera ...(Research Note) Delving deeper into convolutional neural networks for camera ...
(Research Note) Delving deeper into convolutional neural networks for camera ...
 
[PR12] Generative Models as Distributions of Functions
[PR12] Generative Models as Distributions of Functions[PR12] Generative Models as Distributions of Functions
[PR12] Generative Models as Distributions of Functions
 
Convolutional Neural Network (CNN)
Convolutional Neural Network (CNN)Convolutional Neural Network (CNN)
Convolutional Neural Network (CNN)
 
Domain Invariant Representation Learning with Domain Density Transformations
Domain Invariant Representation Learning with Domain Density TransformationsDomain Invariant Representation Learning with Domain Density Transformations
Domain Invariant Representation Learning with Domain Density Transformations
 
MNIST and machine learning - presentation
MNIST and machine learning - presentationMNIST and machine learning - presentation
MNIST and machine learning - presentation
 
[CVPR2020] Simple but effective image enhancement techniques
[CVPR2020] Simple but effective image enhancement techniques[CVPR2020] Simple but effective image enhancement techniques
[CVPR2020] Simple but effective image enhancement techniques
 
Design and Implementation of EZW & SPIHT Image Coder for Virtual Images
Design and Implementation of EZW & SPIHT Image Coder for Virtual ImagesDesign and Implementation of EZW & SPIHT Image Coder for Virtual Images
Design and Implementation of EZW & SPIHT Image Coder for Virtual Images
 
Mnist report ppt
Mnist report pptMnist report ppt
Mnist report ppt
 
Pixel Recurrent Neural Networks
Pixel Recurrent Neural NetworksPixel Recurrent Neural Networks
Pixel Recurrent Neural Networks
 
Image classification using cnn
Image classification using cnnImage classification using cnn
Image classification using cnn
 
Matching Network
Matching NetworkMatching Network
Matching Network
 
Image compression using EZW encoding
Image compression using EZW encoding Image compression using EZW encoding
Image compression using EZW encoding
 
Deep learning for image super resolution
Deep learning for image super resolutionDeep learning for image super resolution
Deep learning for image super resolution
 
Convolutional neural network
Convolutional neural networkConvolutional neural network
Convolutional neural network
 
AlexNet(ImageNet Classification with Deep Convolutional Neural Networks)
AlexNet(ImageNet Classification with Deep Convolutional Neural Networks)AlexNet(ImageNet Classification with Deep Convolutional Neural Networks)
AlexNet(ImageNet Classification with Deep Convolutional Neural Networks)
 
Conditional Image Generation with PixelCNN Decoders
Conditional Image Generation with PixelCNN DecodersConditional Image Generation with PixelCNN Decoders
Conditional Image Generation with PixelCNN Decoders
 
PR243: Designing Network Design Spaces
PR243: Designing Network Design SpacesPR243: Designing Network Design Spaces
PR243: Designing Network Design Spaces
 
20141003.journal club
20141003.journal club20141003.journal club
20141003.journal club
 
Sift detector boosted by adaptive contrast threshold to improve matching robu...
Sift detector boosted by adaptive contrast threshold to improve matching robu...Sift detector boosted by adaptive contrast threshold to improve matching robu...
Sift detector boosted by adaptive contrast threshold to improve matching robu...
 

Similar to Dynamic routing between capsules - A brief presentation

Deep Neural Network DNN.docx
Deep Neural Network DNN.docxDeep Neural Network DNN.docx
Deep Neural Network DNN.docx
jaffarbikat
 
Deep Learning Training at Intel
Deep Learning Training at IntelDeep Learning Training at Intel
Deep Learning Training at Intel
Atul Vaish
 
Neural networks and deep learning
Neural networks and deep learningNeural networks and deep learning
Neural networks and deep learning
RADO7900
 
[Revised] Intro to CNN
[Revised] Intro to CNN[Revised] Intro to CNN
[Revised] Intro to CNN
Vincent Tatan
 
Classification case study + intro to cnn
Classification case study + intro to cnnClassification case study + intro to cnn
Classification case study + intro to cnn
Vincent Tatan
 
Machine learning and vulnerabilities
Machine learning and vulnerabilitiesMachine learning and vulnerabilities
Machine learning and vulnerabilities
galazzo
 
improving Profile detection using Deep Learning
improving Profile detection using Deep Learningimproving Profile detection using Deep Learning
improving Profile detection using Deep Learning
Sahil Kaw
 
Facebook Deep face
Facebook Deep faceFacebook Deep face
Facebook Deep face
Emanuele Santellani
 
NIPS2007: deep belief nets
NIPS2007: deep belief netsNIPS2007: deep belief nets
NIPS2007: deep belief nets
zukun
 
A Study On Deep Learning
A Study On Deep LearningA Study On Deep Learning
A Study On Deep Learning
Abdelrahman Hosny
 
Deep Learning Survey
Deep Learning SurveyDeep Learning Survey
Deep Learning Survey
Anthony Parziale
 
Face recognition using artificial neural network
Face recognition using artificial neural networkFace recognition using artificial neural network
Face recognition using artificial neural network
Sumeet Kakani
 
最近の研究情勢についていくために - Deep Learningを中心に -
最近の研究情勢についていくために - Deep Learningを中心に - 最近の研究情勢についていくために - Deep Learningを中心に -
最近の研究情勢についていくために - Deep Learningを中心に -
Hiroshi Fukui
 
A Survey on Image Processing using CNN in Deep Learning
A Survey on Image Processing using CNN in Deep LearningA Survey on Image Processing using CNN in Deep Learning
A Survey on Image Processing using CNN in Deep Learning
IRJET Journal
 
Deep learning for pose-invariant face detection in unconstrained environment
Deep learning for pose-invariant face detection in unconstrained environmentDeep learning for pose-invariant face detection in unconstrained environment
Deep learning for pose-invariant face detection in unconstrained environment
IJECEIAES
 
Convolutional Neural Networks
Convolutional Neural NetworksConvolutional Neural Networks
Convolutional Neural Networks
TayleeGray
 
introduction to deeplearning
introduction to deeplearningintroduction to deeplearning
introduction to deeplearning
Eyad Alshami
 
Project Presentation.pptx
Project Presentation.pptxProject Presentation.pptx
Project Presentation.pptx
BME62ThejeswarSeggam
 
Attention mechanism in brain and deep neural network
Attention mechanism in brain and deep neural networkAttention mechanism in brain and deep neural network
Attention mechanism in brain and deep neural network
Zahra Sadeghi
 
Understanding Deep Learning & Parameter Tuning with MXnet, H2o Package in R
Understanding Deep Learning & Parameter Tuning with MXnet, H2o Package in RUnderstanding Deep Learning & Parameter Tuning with MXnet, H2o Package in R
Understanding Deep Learning & Parameter Tuning with MXnet, H2o Package in R
Manish Saraswat
 

Similar to Dynamic routing between capsules - A brief presentation (20)

Deep Neural Network DNN.docx
Deep Neural Network DNN.docxDeep Neural Network DNN.docx
Deep Neural Network DNN.docx
 
Deep Learning Training at Intel
Deep Learning Training at IntelDeep Learning Training at Intel
Deep Learning Training at Intel
 
Neural networks and deep learning
Neural networks and deep learningNeural networks and deep learning
Neural networks and deep learning
 
[Revised] Intro to CNN
[Revised] Intro to CNN[Revised] Intro to CNN
[Revised] Intro to CNN
 
Classification case study + intro to cnn
Classification case study + intro to cnnClassification case study + intro to cnn
Classification case study + intro to cnn
 
Machine learning and vulnerabilities
Machine learning and vulnerabilitiesMachine learning and vulnerabilities
Machine learning and vulnerabilities
 
improving Profile detection using Deep Learning
improving Profile detection using Deep Learningimproving Profile detection using Deep Learning
improving Profile detection using Deep Learning
 
Facebook Deep face
Facebook Deep faceFacebook Deep face
Facebook Deep face
 
NIPS2007: deep belief nets
NIPS2007: deep belief netsNIPS2007: deep belief nets
NIPS2007: deep belief nets
 
A Study On Deep Learning
A Study On Deep LearningA Study On Deep Learning
A Study On Deep Learning
 
Deep Learning Survey
Deep Learning SurveyDeep Learning Survey
Deep Learning Survey
 
Face recognition using artificial neural network
Face recognition using artificial neural networkFace recognition using artificial neural network
Face recognition using artificial neural network
 
最近の研究情勢についていくために - Deep Learningを中心に -
最近の研究情勢についていくために - Deep Learningを中心に - 最近の研究情勢についていくために - Deep Learningを中心に -
最近の研究情勢についていくために - Deep Learningを中心に -
 
A Survey on Image Processing using CNN in Deep Learning
A Survey on Image Processing using CNN in Deep LearningA Survey on Image Processing using CNN in Deep Learning
A Survey on Image Processing using CNN in Deep Learning
 
Deep learning for pose-invariant face detection in unconstrained environment
Deep learning for pose-invariant face detection in unconstrained environmentDeep learning for pose-invariant face detection in unconstrained environment
Deep learning for pose-invariant face detection in unconstrained environment
 
Convolutional Neural Networks
Convolutional Neural NetworksConvolutional Neural Networks
Convolutional Neural Networks
 
introduction to deeplearning
introduction to deeplearningintroduction to deeplearning
introduction to deeplearning
 
Project Presentation.pptx
Project Presentation.pptxProject Presentation.pptx
Project Presentation.pptx
 
Attention mechanism in brain and deep neural network
Attention mechanism in brain and deep neural networkAttention mechanism in brain and deep neural network
Attention mechanism in brain and deep neural network
 
Understanding Deep Learning & Parameter Tuning with MXnet, H2o Package in R
Understanding Deep Learning & Parameter Tuning with MXnet, H2o Package in RUnderstanding Deep Learning & Parameter Tuning with MXnet, H2o Package in R
Understanding Deep Learning & Parameter Tuning with MXnet, H2o Package in R
 

Recently uploaded

aziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobelaziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobel
İsa Badur
 
Shallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptxShallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptx
Gokturk Mehmet Dilci
 
Thornton ESPP slides UK WW Network 4_6_24.pdf
Thornton ESPP slides UK WW Network 4_6_24.pdfThornton ESPP slides UK WW Network 4_6_24.pdf
Thornton ESPP slides UK WW Network 4_6_24.pdf
European Sustainable Phosphorus Platform
 
8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf
by6843629
 
The binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defectsThe binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defects
Sérgio Sacani
 
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdfwaterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
LengamoLAppostilic
 
Basics of crystallography, crystal systems, classes and different forms
Basics of crystallography, crystal systems, classes and different formsBasics of crystallography, crystal systems, classes and different forms
Basics of crystallography, crystal systems, classes and different forms
MaheshaNanjegowda
 
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
David Osipyan
 
Applied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdfApplied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdf
University of Hertfordshire
 
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
yqqaatn0
 
Oedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptxOedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptx
muralinath2
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
University of Rennes, INSA Rennes, Inria/IRISA, CNRS
 
Medical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptxMedical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptx
terusbelajar5
 
Cytokines and their role in immune regulation.pptx
Cytokines and their role in immune regulation.pptxCytokines and their role in immune regulation.pptx
Cytokines and their role in immune regulation.pptx
Hitesh Sikarwar
 
Sharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Sharlene Leurig - Enabling Onsite Water Use with Net Zero WaterSharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Sharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Texas Alliance of Groundwater Districts
 
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills MN
 
Compexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titrationCompexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titration
Vandana Devesh Sharma
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
PRIYANKA PATEL
 
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
Sérgio Sacani
 
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
University of Maribor
 

Recently uploaded (20)

aziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobelaziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobel
 
Shallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptxShallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptx
 
Thornton ESPP slides UK WW Network 4_6_24.pdf
Thornton ESPP slides UK WW Network 4_6_24.pdfThornton ESPP slides UK WW Network 4_6_24.pdf
Thornton ESPP slides UK WW Network 4_6_24.pdf
 
8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf
 
The binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defectsThe binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defects
 
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdfwaterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
 
Basics of crystallography, crystal systems, classes and different forms
Basics of crystallography, crystal systems, classes and different formsBasics of crystallography, crystal systems, classes and different forms
Basics of crystallography, crystal systems, classes and different forms
 
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
 
Applied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdfApplied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdf
 
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
 
Oedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptxOedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptx
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
 
Medical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptxMedical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptx
 
Cytokines and their role in immune regulation.pptx
Cytokines and their role in immune regulation.pptxCytokines and their role in immune regulation.pptx
Cytokines and their role in immune regulation.pptx
 
Sharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Sharlene Leurig - Enabling Onsite Water Use with Net Zero WaterSharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Sharlene Leurig - Enabling Onsite Water Use with Net Zero Water
 
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
 
Compexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titrationCompexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titration
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
 
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
 
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
 

Dynamic routing between capsules - A brief presentation

  • 3. The pooling operation used in convolutional neural networks is a big mistake and the fact that it works so well is a disaster. - Hinton
  • 4. Max pooling loses the spatial information
  • 5. Max pooling loses the spatial information - We don’t use the relationship between objects. Is this a face?
  • 6. Equivariance and invariance - CNNs without max pooling are equivariant regarding translation. - That’s something we want! But max pooling breaks it.
  • 7. We work with frames of reference. CNNs do not.
  • 8. PART 2 THE KEY CONCEPTS OF CAPSULES
  • 9. VISION IS THE INVERSE OF GRAPHICS The fundamental idea
  • 10. Computer Graphics From a parameter vector and a projection matrix, we generate a 3D image. This parameter vector is viewpoint invariant!
  • 11. Can’t we go the other way around and achieve viewpoint invariance? Computer Vision?
  • 12. CAPSULES ENCODE AN ENTITY A capsule votes to say if a certain entity is in the image.
  • 13. Layer L Layer L+1 building tea cup face nose window leaf window nose leaf eye face tea cup building Correspondence between network and graph structure
  • 14. Layer L Layer L+1 building tea cup face nose window leaf nose eye face Correspondence between network and graph structure This graph has been carved out from the full graph.
  • 15. CAPSULES OUTPUT A VECTOR A capsule encodes an entity (and its properties) via its output vector.
  • 16. Layer L Layer L+1 i 0.456 Fully Connected Net The output of a node (neuron) is a scalar value. 0.456 0.456
  • 17. Layer L Layer L+1 i Capsules Net The output of a node (capsule) is a vector.
  • 18. Layer L Layer L+1 digit 6 Capsules Net: an example The first dimension of the output vector encodes for the scale and thickness of the digit.
  • 19. Layer L Layer L+1 digit 6 Capsules Net: an example The second dimension of the output vector encodes for the roundness of the top part of the digit.
  • 20. ROUTING MECHANISM The information flows towards the neurons that are the most adapted to use this type of information.
  • 21. Layer L Layer L+1 j+1 j j-1 i Wi,j-1 Wi,j Wi,j+1 Fully Connected Net The information is distributed uniformly to every other node in the next layer.
  • 22. Layer L Layer L+1 j+1 j j-1 i ci,j-1 Wi,j-1 ci,j Wi,j ci,j+1 Wi,j+1 Capsules Net The information is distributed to a specific node in the next layer.
  • 23. Routing mechanism (bonus slide) - In a CNN, this routing mechanism is ‘inverted’. - In a CapsNet, the routing is learned. 0.2 0.1 0.6
  • 24. Layer L Layer L+1 building tea cup face nose ci,j-1 Wi,j-1 ci,j Wi,j ci,j+1 Wi,j+1 Capsules Net: an example window leaf
  • 25. HOW CAPSULES VOTE How is the presence of an entity encoded in the network?
  • 26. Layer L Layer L+1 Norm of the output The norm of the vector encodes the probability that the entity is in the image. nose window leaf
  • 27. Layer L Layer L+1 Learning the routing Capsules that detect similar pose (via scalar product) estimations tend to be coupled. j+1 j j-1 i
  • 29. Layer L Layer L+1 j+1 j j-1 i Computing the output vector i-1 i+1 Weighted sum of the inputs (before activation function).
  • 30. Layer L Layer L+1 j+1 j j-1 i Computing the output vector Squashing the output vector to fallback on a probability (non linear activation function). i-1 i+1
  • 31.
  • 32. How routing is achieved How do we obtain the ? 1 Start with log priors: 2 Initialise with 3 Make a forward pass to obtain the 4 Update the :
  • 34. MNIST Error rate (%) Number of parameters CNN baseline 0.39 35.4M CapsNet 0.25 8.2M MultiMNIST Error rate (%) Number of parameters CNN baseline 8.1 24.56M CapsNet 5.2 11.36M
  • 35. affNIST Accuracy (%) on... Original MNIST affNIST CNN baseline 99.22 66. CapsNet 99.23 79.
  • 36. smallNORB Error rate (%) Azimuth Elevation CNN CapsNet CNN CapsNet Familiar Viewpoints 3.7 3.7 4.3 4.3 Novel viewpoints 20. 13.5 17.8 12.3
  • 37. FURTHER READING - https://medium.com/mlreview/deep-neural-network-capsules-137be2877d44 - https://medium.com/@pechyonkin/understanding-hintons-capsule-networks-part -i-intuition-b4b559d1159b - https://hackernoon.com/what-is-a-capsnet-or-capsule-network-2bfbe48769cc - https://hackernoon.com/capsule-networks-are-shaking-up-ai-heres-how-to-us e-them-c233a0971952 - https://www.reddit.com/r/MachineLearning/comments/7bz5x9/d_eli5_capsule_ne tworks_how_are_they_unique_and/ - https://kndrck.co/posts/capsule_networks_explained/