SlideShare a Scribd company logo
Dynamic Routing Between Capsules
Sara Sabour, Nicholas Frosst, Geoffrey E Hinton, 10, 2017, Arxiv
LAB SEMINAR
1
2017.11.13
SNU DATAMINING CENTER
MINKI CHUNG
TABLE OF CONTENTS
▸ Intuition
▸ Problems of ConvNet
▸ How brain works, Inverse graphics
▸ Capsule Theory
▸ CapsNet
▸ Capsule
▸ CapsNet architecture
▸ Experiment
▸ Classification on MNIST
▸ Reconstruction on MNIST
▸ Dimension perturbation on MNIST
▸ Discussion
2
INTUITION
▸ Problems of ConvNet
▸ How brain works, Inverse graphics
▸ Capsule Theory
3
PROBLEMS OF CONVNET 4
▸ ConvNet Architecture
PROBLEMS IS ‘POOLING’
https://hackernoon.com/what-is-a-capsnet-or-capsule-network-2bfbe48769cc
Obtain translational, rotational invariance
PROBLEMS OF CONVNET 5
▸
@REDDIT, MACHINE LEARNING
https://www.reddit.com/r/MachineLearning/comments/2lmo0l/ama_geoffrey_hinton/clyj4jv/
PROBLEMS OF CONVNET 6
▸
WHAT IS THIS PICTURE?
https://hackernoon.com/capsule-networks-are-shaking-up-ai-heres-how-to-use-them-c233a0971952
PROBLEMS OF CONVNET 7
▸
HOW ABOUT THIS?
https://hackernoon.com/capsule-networks-are-shaking-up-ai-heres-how-to-use-them-c233a0971952
PROBLEMS OF CONVNET 8
▸
NEED EQUIVARIANCE, NOT INVARIANCE
https://hackernoon.com/capsule-networks-are-shaking-up-ai-heres-how-to-use-them-c233a0971952
HOW BRAIN WORKS, INVERSE GRAPHICS 9
▸ Constructing a visual image from some internal hierarchical representation of
geometric data
▸ Internal representation is stored in computer’s memory as arrays of geometrical
objects and matrices that represent relative positions and orientation of these
objects
▸ Special software takes that representation and converts it into an image on the screen.
This is called rendering
▸ Brains, in fact, do the opposite of rendering. Hinton calls it inverse graphics: Visual
information received by eyes, they deconstruct a hierarchical representation of the
world around us and try to match it with already learned patterns and relationships
stored in the brain
▸ Key idea is that representation of objects in the brain does not depend on view angle
COMPUTER GRAPHICS
https://medium.com/@pechyonkin/understanding-hintons-capsule-networks-part-i-intuition-b4b559d1159b
CAPSULE THEORY 10
▸ In 3D graphics, relationships between 3D objects can be represented by a so-
called pose, which is in essence translation plus rotation
▸ Capsule approach: It incorporates relative relationships between objects (Internal
representation) and it is represented numerically as a 4D pose matrix
▸ by ‘Dynamic Routing’ (more details later)
▸ allows capsules to communicate with each other and create representations
similar to scene graphs in computer graphics
https://medium.com/@pechyonkin/understanding-hintons-capsule-networks-part-i-intuition-b4b559d1159b
YOU CAN EASILY RECOGNIZE THAT THIS IS THE STATUE OF LIBERTY,
EVEN THOUGH ALL THE IMAGES SHOW IT FROM DIFFERENT ANGLES
CAPSULE THEORY 11
▸ Benifits:
▸ Better understanding 3D Space
▸ Achieve state-of-the art performance by only using a fraction of the data that a CNN
would use
▸ In order to learn to tell digits apart, the human brain needs only a couple of dozens of
examples, hundreds at most, while CNN need tens of thousands of examples
https://medium.com/@pechyonkin/understanding-hintons-capsule-networks-part-i-intuition-b4b559d1159b
CAPSNET
▸ Capsule
▸ CapsNet architecture
▸ Experiment
12
CAPSULE 13
▸ Comparison with traditional neuron
https://www.zhihu.com/question/67287444/answer/251460831
V
VEC LENGTH WORKS LIKE PROBABILITY
ACTIVATION OF NEXT CAPSULE
DYNAMIC ROUTING
CAPSNET ARCHITECTURE 14
ARCHITECTURE
Sara Sabour, Nicholas Frosst, Geoffrey E Hinton, 10, 2017, Arxiv. Dynamic Routing Between Capsules
CONV CAPS.CONV CAPS.FC
DYNAMIC ROUTING
8X
32
X
MNIST
LOCAL FEATURE DETECTION
6*6*32=1152 CAPSULES,
EACH HAS 8 PROPERTIES
10 CAPSULES (CLASS),
EACH HAS 16 PROPERTIES
DEEPER MEANS MORE COMPLEX, DIMENSION SHOULD INCREASE
CAPSNET ARCHITECTURE 15
▸ naturomics github
CAPSNET-TENSORFLOW
CAPS.CONVCONV
CONV
X 32
MNIST
X 8
https://github.com/naturomics/CapsNet-Tensorflow
X 32
X 8
CAPS.FC
CAPS.CONV
CAPS.FC
DYNAMIC ROUTING
CAPSNET ARCHITECTURE 16
▸ Place-coded Capsule
▸ Concatenate (=8 different regular conv layers)
▸ Consider each feature map as capsule (6*6*32=1152 capsules with 8
properties)
CAPS.CONV, PRIMARYCAPS
CAPS.CONV
X 32
MNIST
X 8
https://github.com/naturomics/CapsNet-Tensorflow
DIRECTION
CAPSNET ARCHITECTURE 17
▸ Place-coded Capsule
▸ Concatenate (=8 different regular conv layers)
▸ Consider each feature map as capsule (6*6*32=1152 capsules with 8
properties)
▸ Use squashing function in the end
CAPS.CONV, PRIMARYCAPS
CAPS.CONV
X 32
MNIST
X 8
https://github.com/naturomics/CapsNet-Tensorflow
CAPSNET ARCHITECTURE 18
▸ Rate-coded capsules
▸ caps: 1152 → 10
▸ vec-len: 8 → 16
▸ Dynamic Routing
CAPS.FC, DIGITCAPS
https://github.com/naturomics/CapsNet-Tensorflow
X 32
MNIST
X 8
CAPS.FC
DYNAMIC ROUTING
DYNAMIC ROUTING
CAPSNET ARCHITECTURE 19
▸ Dynamic Routing
▸ Top-down feedback
▸ Routing by agreement
▸ Works like attention
CAPS.FC, DIGITCAPS
https://github.com/naturomics/CapsNet-Tensorflow
IF MULTIPLE PREDICTIONS
AGREE, HIGHER LEVEL CAPSULE
BECOMES ACTIVE
VEC LENGTH WORKS LIKE PROBABILITY
ACTIVATION OF NEXT CAPSULE
COUPLING COEFFICIENTS
TOPDOWN FEEDBACK: IF RELATION EXISTS COUPLING COEFFICIENTS INCREASE
AGREEMENT
CAPSNET ARCHITECTURE 20
▸ Dynamic Routing
CAPS.FC, DIGITCAPS
https://github.com/naturomics/CapsNet-Tensorflow
X 32
MNIST
X 8
CAPS.FC
DYNAMIC ROUTING
3 ITERATIONS WILL DO
EXPERIMENT
▸ Classification on MNIST
▸ Reconstruction on MNIST
▸ Dimension perturbation on MNIST
21
EXPERIMENT 22
▸ Introduce first three
▸ Classification on MNIST (99.75%, conv 99.61%)
▸ Reconstruction on MNIST
▸ Dimension Perturbation on MNIST
▸ Robustness to Affine Transformation on MNIST (79%, conv 66%)
▸ Classification on MultiMNIST (5% error)
▸ Classification on CIFAR 10 (10.6% error - ZFNet)
▸ Classification on SVHN (4.3% error)
Sara Sabour, Nicholas Frosst, Geoffrey E Hinton, 10, 2017, Arxiv. Dynamic Routing Between Capsules
EXPERIMENT 23
▸ 99.75% (baseline 99.61%)
1. CLASSIFICATION ON MNIST
Sara Sabour, Nicholas Frosst, Geoffrey E Hinton, 10, 2017, Arxiv. Dynamic Routing Between Capsules
EXPERIMENT 24
▸
2. RECONSTRUCTION ON MNIST
Sara Sabour, Nicholas Frosst, Geoffrey E Hinton, 10, 2017, Arxiv. Dynamic Routing Between Capsules
EXPERIMENT 25
▸
3. DIMENSION PERTURBATION ON MNIST
Sara Sabour, Nicholas Frosst, Geoffrey E Hinton, 10, 2017, Arxiv. Dynamic Routing Between Capsules
DISCUSSION
26
_ 27
▸ Capsule(Vector),
▸ Not conventional neuron(Scalar)
NOVELTY
_ 28
▸ Still use regular conv layer at first for local feature extraction
▸ Capsule cannot extract local feature?
STILL USE CONV LAYER
HOW TO RESTRICT TO GET CERTAIN FEATURE?
▸ Disentangling features
▸ How to obtain ‘certain features’?
ANY Q?
29
REFERENCE
▸ Sara Sabour, Nicholas Frosst, Geoffrey E Hinton, 10, 2017, Arxiv. Dynamic Routing Between Capsules (https://
arxiv.org/abs/1710.09829)
▸ Geoffrey Hinton et al., Matrix Capsules With EM Routing, Under review as a conference paper at ICLR 2018 (https://
openreview.net/pdf?id=HJWLfGWRb)
▸ https://medium.com/@pechyonkin/understanding-hintons-capsule-networks-part-i-intuition-b4b559d1159b
▸ https://hackernoon.com/what-is-a-capsnet-or-capsule-network-2bfbe48769cc
▸ https://hackernoon.com/capsule-networks-are-shaking-up-ai-heres-how-to-use-them-c233a0971952
▸ https://github.com/naturomics/CapsNet-Tensorflow
▸ https://www.zhihu.com/question/67287444/answer/251460831
▸ https://www.reddit.com/r/MachineLearning/comments/2lmo0l/ama_geoffrey_hinton/clyj4jv/
▸ Geoffrey Hinton: "Does the Brain do Inverse Graphics?” (https://www.youtube.com/watch?
v=TFIMqt0yT2I&feature=youtu.be)
▸ Geoffrey Hinton talk "What is wrong with convolutional neural nets ?” (https://www.youtube.com/watch?
v=rTawFwUvnLE&t=1214s)
▸ https://www.youtube.com/watch?v=u50nqWMQe1k
30
END OF
DOCUMENT
31

More Related Content

What's hot

Overview on Optimization algorithms in Deep Learning
Overview on Optimization algorithms in Deep LearningOverview on Optimization algorithms in Deep Learning
Overview on Optimization algorithms in Deep Learning
Khang Pham
 
Cheatsheet deep-learning
Cheatsheet deep-learningCheatsheet deep-learning
Cheatsheet deep-learning
Steve Nouri
 
[PR12] Capsule Networks - Jaejun Yoo
[PR12] Capsule Networks - Jaejun Yoo[PR12] Capsule Networks - Jaejun Yoo
[PR12] Capsule Networks - Jaejun Yoo
JaeJun Yoo
 
Convolutional Neural Network and Its Applications
Convolutional Neural Network and Its ApplicationsConvolutional Neural Network and Its Applications
Convolutional Neural Network and Its Applications
Kasun Chinthaka Piyarathna
 
Training Neural Networks
Training Neural NetworksTraining Neural Networks
Training Neural Networks
Databricks
 
Machine Learning and Inductive Inference
Machine Learning and Inductive InferenceMachine Learning and Inductive Inference
Machine Learning and Inductive Inferencebutest
 
Image classification with Deep Neural Networks
Image classification with Deep Neural NetworksImage classification with Deep Neural Networks
Image classification with Deep Neural Networks
Yogendra Tamang
 
An Introduction to Neural Architecture Search
An Introduction to Neural Architecture SearchAn Introduction to Neural Architecture Search
An Introduction to Neural Architecture Search
Bill Liu
 
Classification Based Machine Learning Algorithms
Classification Based Machine Learning AlgorithmsClassification Based Machine Learning Algorithms
Classification Based Machine Learning Algorithms
Md. Main Uddin Rony
 
Introduction to Deep learning
Introduction to Deep learningIntroduction to Deep learning
Introduction to Deep learning
leopauly
 
An introduction to Deep Learning
An introduction to Deep LearningAn introduction to Deep Learning
An introduction to Deep Learning
Julien SIMON
 
An introduction to quantum machine learning.pptx
An introduction to quantum machine learning.pptxAn introduction to quantum machine learning.pptx
An introduction to quantum machine learning.pptx
Colleen Farrelly
 
Graph Neural Network - Introduction
Graph Neural Network - IntroductionGraph Neural Network - Introduction
Graph Neural Network - Introduction
Jungwon Kim
 
Unsupervised learning clustering
Unsupervised learning clusteringUnsupervised learning clustering
Unsupervised learning clustering
Arshad Farhad
 
ResNet basics (Deep Residual Network for Image Recognition)
ResNet basics (Deep Residual Network for Image Recognition)ResNet basics (Deep Residual Network for Image Recognition)
ResNet basics (Deep Residual Network for Image Recognition)
Sanjay Saha
 
Attention in Deep Learning
Attention in Deep LearningAttention in Deep Learning
Attention in Deep Learning
健程 杨
 
Recurrent Neural Networks. Part 1: Theory
Recurrent Neural Networks. Part 1: TheoryRecurrent Neural Networks. Part 1: Theory
Recurrent Neural Networks. Part 1: Theory
Andrii Gakhov
 
Deep Learning in Bio-Medical Imaging
Deep Learning in Bio-Medical ImagingDeep Learning in Bio-Medical Imaging
Deep Learning in Bio-Medical Imaging
Joonhyung Lee
 
Quantum computing in machine learning
Quantum computing in machine learningQuantum computing in machine learning
Quantum computing in machine learning
khalidhassan105
 

What's hot (20)

Overview on Optimization algorithms in Deep Learning
Overview on Optimization algorithms in Deep LearningOverview on Optimization algorithms in Deep Learning
Overview on Optimization algorithms in Deep Learning
 
Cheatsheet deep-learning
Cheatsheet deep-learningCheatsheet deep-learning
Cheatsheet deep-learning
 
[PR12] Capsule Networks - Jaejun Yoo
[PR12] Capsule Networks - Jaejun Yoo[PR12] Capsule Networks - Jaejun Yoo
[PR12] Capsule Networks - Jaejun Yoo
 
Convolutional Neural Network and Its Applications
Convolutional Neural Network and Its ApplicationsConvolutional Neural Network and Its Applications
Convolutional Neural Network and Its Applications
 
Training Neural Networks
Training Neural NetworksTraining Neural Networks
Training Neural Networks
 
Machine Learning and Inductive Inference
Machine Learning and Inductive InferenceMachine Learning and Inductive Inference
Machine Learning and Inductive Inference
 
Image classification with Deep Neural Networks
Image classification with Deep Neural NetworksImage classification with Deep Neural Networks
Image classification with Deep Neural Networks
 
An Introduction to Neural Architecture Search
An Introduction to Neural Architecture SearchAn Introduction to Neural Architecture Search
An Introduction to Neural Architecture Search
 
Classification Based Machine Learning Algorithms
Classification Based Machine Learning AlgorithmsClassification Based Machine Learning Algorithms
Classification Based Machine Learning Algorithms
 
Introduction to Deep learning
Introduction to Deep learningIntroduction to Deep learning
Introduction to Deep learning
 
An introduction to Deep Learning
An introduction to Deep LearningAn introduction to Deep Learning
An introduction to Deep Learning
 
An introduction to quantum machine learning.pptx
An introduction to quantum machine learning.pptxAn introduction to quantum machine learning.pptx
An introduction to quantum machine learning.pptx
 
Graph Neural Network - Introduction
Graph Neural Network - IntroductionGraph Neural Network - Introduction
Graph Neural Network - Introduction
 
Unsupervised learning clustering
Unsupervised learning clusteringUnsupervised learning clustering
Unsupervised learning clustering
 
ResNet basics (Deep Residual Network for Image Recognition)
ResNet basics (Deep Residual Network for Image Recognition)ResNet basics (Deep Residual Network for Image Recognition)
ResNet basics (Deep Residual Network for Image Recognition)
 
Attention in Deep Learning
Attention in Deep LearningAttention in Deep Learning
Attention in Deep Learning
 
Recurrent Neural Networks. Part 1: Theory
Recurrent Neural Networks. Part 1: TheoryRecurrent Neural Networks. Part 1: Theory
Recurrent Neural Networks. Part 1: Theory
 
Deep Learning in Bio-Medical Imaging
Deep Learning in Bio-Medical ImagingDeep Learning in Bio-Medical Imaging
Deep Learning in Bio-Medical Imaging
 
Quantum computing in machine learning
Quantum computing in machine learningQuantum computing in machine learning
Quantum computing in machine learning
 
Lecture5 - C4.5
Lecture5 - C4.5Lecture5 - C4.5
Lecture5 - C4.5
 

Similar to capsule network

[PR12] Inception and Xception - Jaejun Yoo
[PR12] Inception and Xception - Jaejun Yoo[PR12] Inception and Xception - Jaejun Yoo
[PR12] Inception and Xception - Jaejun Yoo
JaeJun Yoo
 
캡슐 네트워크를 이용한 엔드투엔드 음성 단어 인식, 배재성(KAIST 석사과정)
캡슐 네트워크를 이용한 엔드투엔드 음성 단어 인식, 배재성(KAIST 석사과정)캡슐 네트워크를 이용한 엔드투엔드 음성 단어 인식, 배재성(KAIST 석사과정)
캡슐 네트워크를 이용한 엔드투엔드 음성 단어 인식, 배재성(KAIST 석사과정)
NAVER Engineering
 
Illustrative Introductory CNN
Illustrative Introductory CNNIllustrative Introductory CNN
Illustrative Introductory CNN
YasutoTamura1
 
Distributed deep learning
Distributed deep learningDistributed deep learning
Distributed deep learning
Mehdi Shibahara
 
Capsule Networks
Capsule NetworksCapsule Networks
Capsule Networks
Jyoti Prakash Maheswari
 
Distributed Systems Theory for Mere Mortals - Topconf Dusseldorf October 2017
Distributed Systems Theory for Mere Mortals - Topconf Dusseldorf October 2017Distributed Systems Theory for Mere Mortals - Topconf Dusseldorf October 2017
Distributed Systems Theory for Mere Mortals - Topconf Dusseldorf October 2017
Ensar Basri Kahveci
 
Capsules Network Overview
Capsules Network OverviewCapsules Network Overview
Capsules Network Overview
Hiep Pham
 
Deep Neural Networks 
that talk (Back)… with style
Deep Neural Networks 
that talk (Back)… with styleDeep Neural Networks 
that talk (Back)… with style
Deep Neural Networks 
that talk (Back)… with style
Roelof Pieters
 
Paper Reviews on Visual Attention
Paper Reviews on Visual AttentionPaper Reviews on Visual Attention
Paper Reviews on Visual Attention
민기 정
 
02.cnn - CNN 파헤치기 3탄
02.cnn - CNN 파헤치기 3탄02.cnn - CNN 파헤치기 3탄
02.cnn - CNN 파헤치기 3탄
Jeong-gyu Kim
 
(Research Note) Delving deeper into convolutional neural networks for camera ...
(Research Note) Delving deeper into convolutional neural networks for camera ...(Research Note) Delving deeper into convolutional neural networks for camera ...
(Research Note) Delving deeper into convolutional neural networks for camera ...
Jacky Liu
 
Distributed Systems Theory for Mere Mortals - Software Craftsmanship Turkey
Distributed Systems Theory for Mere Mortals - Software Craftsmanship TurkeyDistributed Systems Theory for Mere Mortals - Software Craftsmanship Turkey
Distributed Systems Theory for Mere Mortals - Software Craftsmanship Turkey
Ensar Basri Kahveci
 
Lausanne 2019 #2
Lausanne 2019 #2Lausanne 2019 #2
Lausanne 2019 #2
Arthur Charpentier
 
IRJET-Breast Cancer Detection using Convolution Neural Network
IRJET-Breast Cancer Detection using Convolution Neural NetworkIRJET-Breast Cancer Detection using Convolution Neural Network
IRJET-Breast Cancer Detection using Convolution Neural Network
IRJET Journal
 
Intro. Capsule network
Intro. Capsule networkIntro. Capsule network
Intro. Capsule network
Joowon Moon
 
"Designing CNN Algorithms for Real-time Applications," a Presentation from Al...
"Designing CNN Algorithms for Real-time Applications," a Presentation from Al..."Designing CNN Algorithms for Real-time Applications," a Presentation from Al...
"Designing CNN Algorithms for Real-time Applications," a Presentation from Al...
Edge AI and Vision Alliance
 
Distance-based bias in model-directed optimization of additively decomposable...
Distance-based bias in model-directed optimization of additively decomposable...Distance-based bias in model-directed optimization of additively decomposable...
Distance-based bias in model-directed optimization of additively decomposable...
Martin Pelikan
 
Accelerating Science with Generative Adversarial Networks
Accelerating Science with Generative Adversarial NetworksAccelerating Science with Generative Adversarial Networks
Accelerating Science with Generative Adversarial Networks
Michela Paganini
 
Artificial Intelligence Applications in Petroleum Engineering - Part I
Artificial Intelligence Applications in Petroleum Engineering - Part IArtificial Intelligence Applications in Petroleum Engineering - Part I
Artificial Intelligence Applications in Petroleum Engineering - Part I
Ramez Abdalla, M.Sc
 
Neural Networks and Deep Learning: An Intro
Neural Networks and Deep Learning: An IntroNeural Networks and Deep Learning: An Intro
Neural Networks and Deep Learning: An Intro
Fariz Darari
 

Similar to capsule network (20)

[PR12] Inception and Xception - Jaejun Yoo
[PR12] Inception and Xception - Jaejun Yoo[PR12] Inception and Xception - Jaejun Yoo
[PR12] Inception and Xception - Jaejun Yoo
 
캡슐 네트워크를 이용한 엔드투엔드 음성 단어 인식, 배재성(KAIST 석사과정)
캡슐 네트워크를 이용한 엔드투엔드 음성 단어 인식, 배재성(KAIST 석사과정)캡슐 네트워크를 이용한 엔드투엔드 음성 단어 인식, 배재성(KAIST 석사과정)
캡슐 네트워크를 이용한 엔드투엔드 음성 단어 인식, 배재성(KAIST 석사과정)
 
Illustrative Introductory CNN
Illustrative Introductory CNNIllustrative Introductory CNN
Illustrative Introductory CNN
 
Distributed deep learning
Distributed deep learningDistributed deep learning
Distributed deep learning
 
Capsule Networks
Capsule NetworksCapsule Networks
Capsule Networks
 
Distributed Systems Theory for Mere Mortals - Topconf Dusseldorf October 2017
Distributed Systems Theory for Mere Mortals - Topconf Dusseldorf October 2017Distributed Systems Theory for Mere Mortals - Topconf Dusseldorf October 2017
Distributed Systems Theory for Mere Mortals - Topconf Dusseldorf October 2017
 
Capsules Network Overview
Capsules Network OverviewCapsules Network Overview
Capsules Network Overview
 
Deep Neural Networks 
that talk (Back)… with style
Deep Neural Networks 
that talk (Back)… with styleDeep Neural Networks 
that talk (Back)… with style
Deep Neural Networks 
that talk (Back)… with style
 
Paper Reviews on Visual Attention
Paper Reviews on Visual AttentionPaper Reviews on Visual Attention
Paper Reviews on Visual Attention
 
02.cnn - CNN 파헤치기 3탄
02.cnn - CNN 파헤치기 3탄02.cnn - CNN 파헤치기 3탄
02.cnn - CNN 파헤치기 3탄
 
(Research Note) Delving deeper into convolutional neural networks for camera ...
(Research Note) Delving deeper into convolutional neural networks for camera ...(Research Note) Delving deeper into convolutional neural networks for camera ...
(Research Note) Delving deeper into convolutional neural networks for camera ...
 
Distributed Systems Theory for Mere Mortals - Software Craftsmanship Turkey
Distributed Systems Theory for Mere Mortals - Software Craftsmanship TurkeyDistributed Systems Theory for Mere Mortals - Software Craftsmanship Turkey
Distributed Systems Theory for Mere Mortals - Software Craftsmanship Turkey
 
Lausanne 2019 #2
Lausanne 2019 #2Lausanne 2019 #2
Lausanne 2019 #2
 
IRJET-Breast Cancer Detection using Convolution Neural Network
IRJET-Breast Cancer Detection using Convolution Neural NetworkIRJET-Breast Cancer Detection using Convolution Neural Network
IRJET-Breast Cancer Detection using Convolution Neural Network
 
Intro. Capsule network
Intro. Capsule networkIntro. Capsule network
Intro. Capsule network
 
"Designing CNN Algorithms for Real-time Applications," a Presentation from Al...
"Designing CNN Algorithms for Real-time Applications," a Presentation from Al..."Designing CNN Algorithms for Real-time Applications," a Presentation from Al...
"Designing CNN Algorithms for Real-time Applications," a Presentation from Al...
 
Distance-based bias in model-directed optimization of additively decomposable...
Distance-based bias in model-directed optimization of additively decomposable...Distance-based bias in model-directed optimization of additively decomposable...
Distance-based bias in model-directed optimization of additively decomposable...
 
Accelerating Science with Generative Adversarial Networks
Accelerating Science with Generative Adversarial NetworksAccelerating Science with Generative Adversarial Networks
Accelerating Science with Generative Adversarial Networks
 
Artificial Intelligence Applications in Petroleum Engineering - Part I
Artificial Intelligence Applications in Petroleum Engineering - Part IArtificial Intelligence Applications in Petroleum Engineering - Part I
Artificial Intelligence Applications in Petroleum Engineering - Part I
 
Neural Networks and Deep Learning: An Intro
Neural Networks and Deep Learning: An IntroNeural Networks and Deep Learning: An Intro
Neural Networks and Deep Learning: An Intro
 

Recently uploaded

原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 
Nanandann Nilekani's ppt On India's .pdf
Nanandann Nilekani's ppt On India's .pdfNanandann Nilekani's ppt On India's .pdf
Nanandann Nilekani's ppt On India's .pdf
eddie19851
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
oz8q3jxlp
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
slg6lamcq
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
v3tuleee
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
ahzuo
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
ahzuo
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Subhajit Sahu
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
74nqk8xf
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
manishkhaire30
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
g4dpvqap0
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
Walaa Eldin Moustafa
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
dwreak4tg
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
balafet
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
roli9797
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
Roger Valdez
 

Recently uploaded (20)

原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 
Nanandann Nilekani's ppt On India's .pdf
Nanandann Nilekani's ppt On India's .pdfNanandann Nilekani's ppt On India's .pdf
Nanandann Nilekani's ppt On India's .pdf
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
 

capsule network

  • 1. Dynamic Routing Between Capsules Sara Sabour, Nicholas Frosst, Geoffrey E Hinton, 10, 2017, Arxiv LAB SEMINAR 1 2017.11.13 SNU DATAMINING CENTER MINKI CHUNG
  • 2. TABLE OF CONTENTS ▸ Intuition ▸ Problems of ConvNet ▸ How brain works, Inverse graphics ▸ Capsule Theory ▸ CapsNet ▸ Capsule ▸ CapsNet architecture ▸ Experiment ▸ Classification on MNIST ▸ Reconstruction on MNIST ▸ Dimension perturbation on MNIST ▸ Discussion 2
  • 3. INTUITION ▸ Problems of ConvNet ▸ How brain works, Inverse graphics ▸ Capsule Theory 3
  • 4. PROBLEMS OF CONVNET 4 ▸ ConvNet Architecture PROBLEMS IS ‘POOLING’ https://hackernoon.com/what-is-a-capsnet-or-capsule-network-2bfbe48769cc Obtain translational, rotational invariance
  • 5. PROBLEMS OF CONVNET 5 ▸ @REDDIT, MACHINE LEARNING https://www.reddit.com/r/MachineLearning/comments/2lmo0l/ama_geoffrey_hinton/clyj4jv/
  • 6. PROBLEMS OF CONVNET 6 ▸ WHAT IS THIS PICTURE? https://hackernoon.com/capsule-networks-are-shaking-up-ai-heres-how-to-use-them-c233a0971952
  • 7. PROBLEMS OF CONVNET 7 ▸ HOW ABOUT THIS? https://hackernoon.com/capsule-networks-are-shaking-up-ai-heres-how-to-use-them-c233a0971952
  • 8. PROBLEMS OF CONVNET 8 ▸ NEED EQUIVARIANCE, NOT INVARIANCE https://hackernoon.com/capsule-networks-are-shaking-up-ai-heres-how-to-use-them-c233a0971952
  • 9. HOW BRAIN WORKS, INVERSE GRAPHICS 9 ▸ Constructing a visual image from some internal hierarchical representation of geometric data ▸ Internal representation is stored in computer’s memory as arrays of geometrical objects and matrices that represent relative positions and orientation of these objects ▸ Special software takes that representation and converts it into an image on the screen. This is called rendering ▸ Brains, in fact, do the opposite of rendering. Hinton calls it inverse graphics: Visual information received by eyes, they deconstruct a hierarchical representation of the world around us and try to match it with already learned patterns and relationships stored in the brain ▸ Key idea is that representation of objects in the brain does not depend on view angle COMPUTER GRAPHICS https://medium.com/@pechyonkin/understanding-hintons-capsule-networks-part-i-intuition-b4b559d1159b
  • 10. CAPSULE THEORY 10 ▸ In 3D graphics, relationships between 3D objects can be represented by a so- called pose, which is in essence translation plus rotation ▸ Capsule approach: It incorporates relative relationships between objects (Internal representation) and it is represented numerically as a 4D pose matrix ▸ by ‘Dynamic Routing’ (more details later) ▸ allows capsules to communicate with each other and create representations similar to scene graphs in computer graphics https://medium.com/@pechyonkin/understanding-hintons-capsule-networks-part-i-intuition-b4b559d1159b YOU CAN EASILY RECOGNIZE THAT THIS IS THE STATUE OF LIBERTY, EVEN THOUGH ALL THE IMAGES SHOW IT FROM DIFFERENT ANGLES
  • 11. CAPSULE THEORY 11 ▸ Benifits: ▸ Better understanding 3D Space ▸ Achieve state-of-the art performance by only using a fraction of the data that a CNN would use ▸ In order to learn to tell digits apart, the human brain needs only a couple of dozens of examples, hundreds at most, while CNN need tens of thousands of examples https://medium.com/@pechyonkin/understanding-hintons-capsule-networks-part-i-intuition-b4b559d1159b
  • 12. CAPSNET ▸ Capsule ▸ CapsNet architecture ▸ Experiment 12
  • 13. CAPSULE 13 ▸ Comparison with traditional neuron https://www.zhihu.com/question/67287444/answer/251460831 V VEC LENGTH WORKS LIKE PROBABILITY ACTIVATION OF NEXT CAPSULE DYNAMIC ROUTING
  • 14. CAPSNET ARCHITECTURE 14 ARCHITECTURE Sara Sabour, Nicholas Frosst, Geoffrey E Hinton, 10, 2017, Arxiv. Dynamic Routing Between Capsules CONV CAPS.CONV CAPS.FC DYNAMIC ROUTING 8X 32 X MNIST LOCAL FEATURE DETECTION 6*6*32=1152 CAPSULES, EACH HAS 8 PROPERTIES 10 CAPSULES (CLASS), EACH HAS 16 PROPERTIES DEEPER MEANS MORE COMPLEX, DIMENSION SHOULD INCREASE
  • 15. CAPSNET ARCHITECTURE 15 ▸ naturomics github CAPSNET-TENSORFLOW CAPS.CONVCONV CONV X 32 MNIST X 8 https://github.com/naturomics/CapsNet-Tensorflow X 32 X 8 CAPS.FC CAPS.CONV CAPS.FC DYNAMIC ROUTING
  • 16. CAPSNET ARCHITECTURE 16 ▸ Place-coded Capsule ▸ Concatenate (=8 different regular conv layers) ▸ Consider each feature map as capsule (6*6*32=1152 capsules with 8 properties) CAPS.CONV, PRIMARYCAPS CAPS.CONV X 32 MNIST X 8 https://github.com/naturomics/CapsNet-Tensorflow DIRECTION
  • 17. CAPSNET ARCHITECTURE 17 ▸ Place-coded Capsule ▸ Concatenate (=8 different regular conv layers) ▸ Consider each feature map as capsule (6*6*32=1152 capsules with 8 properties) ▸ Use squashing function in the end CAPS.CONV, PRIMARYCAPS CAPS.CONV X 32 MNIST X 8 https://github.com/naturomics/CapsNet-Tensorflow
  • 18. CAPSNET ARCHITECTURE 18 ▸ Rate-coded capsules ▸ caps: 1152 → 10 ▸ vec-len: 8 → 16 ▸ Dynamic Routing CAPS.FC, DIGITCAPS https://github.com/naturomics/CapsNet-Tensorflow X 32 MNIST X 8 CAPS.FC DYNAMIC ROUTING DYNAMIC ROUTING
  • 19. CAPSNET ARCHITECTURE 19 ▸ Dynamic Routing ▸ Top-down feedback ▸ Routing by agreement ▸ Works like attention CAPS.FC, DIGITCAPS https://github.com/naturomics/CapsNet-Tensorflow IF MULTIPLE PREDICTIONS AGREE, HIGHER LEVEL CAPSULE BECOMES ACTIVE VEC LENGTH WORKS LIKE PROBABILITY ACTIVATION OF NEXT CAPSULE COUPLING COEFFICIENTS TOPDOWN FEEDBACK: IF RELATION EXISTS COUPLING COEFFICIENTS INCREASE AGREEMENT
  • 20. CAPSNET ARCHITECTURE 20 ▸ Dynamic Routing CAPS.FC, DIGITCAPS https://github.com/naturomics/CapsNet-Tensorflow X 32 MNIST X 8 CAPS.FC DYNAMIC ROUTING 3 ITERATIONS WILL DO
  • 21. EXPERIMENT ▸ Classification on MNIST ▸ Reconstruction on MNIST ▸ Dimension perturbation on MNIST 21
  • 22. EXPERIMENT 22 ▸ Introduce first three ▸ Classification on MNIST (99.75%, conv 99.61%) ▸ Reconstruction on MNIST ▸ Dimension Perturbation on MNIST ▸ Robustness to Affine Transformation on MNIST (79%, conv 66%) ▸ Classification on MultiMNIST (5% error) ▸ Classification on CIFAR 10 (10.6% error - ZFNet) ▸ Classification on SVHN (4.3% error) Sara Sabour, Nicholas Frosst, Geoffrey E Hinton, 10, 2017, Arxiv. Dynamic Routing Between Capsules
  • 23. EXPERIMENT 23 ▸ 99.75% (baseline 99.61%) 1. CLASSIFICATION ON MNIST Sara Sabour, Nicholas Frosst, Geoffrey E Hinton, 10, 2017, Arxiv. Dynamic Routing Between Capsules
  • 24. EXPERIMENT 24 ▸ 2. RECONSTRUCTION ON MNIST Sara Sabour, Nicholas Frosst, Geoffrey E Hinton, 10, 2017, Arxiv. Dynamic Routing Between Capsules
  • 25. EXPERIMENT 25 ▸ 3. DIMENSION PERTURBATION ON MNIST Sara Sabour, Nicholas Frosst, Geoffrey E Hinton, 10, 2017, Arxiv. Dynamic Routing Between Capsules
  • 27. _ 27 ▸ Capsule(Vector), ▸ Not conventional neuron(Scalar) NOVELTY
  • 28. _ 28 ▸ Still use regular conv layer at first for local feature extraction ▸ Capsule cannot extract local feature? STILL USE CONV LAYER HOW TO RESTRICT TO GET CERTAIN FEATURE? ▸ Disentangling features ▸ How to obtain ‘certain features’?
  • 30. REFERENCE ▸ Sara Sabour, Nicholas Frosst, Geoffrey E Hinton, 10, 2017, Arxiv. Dynamic Routing Between Capsules (https:// arxiv.org/abs/1710.09829) ▸ Geoffrey Hinton et al., Matrix Capsules With EM Routing, Under review as a conference paper at ICLR 2018 (https:// openreview.net/pdf?id=HJWLfGWRb) ▸ https://medium.com/@pechyonkin/understanding-hintons-capsule-networks-part-i-intuition-b4b559d1159b ▸ https://hackernoon.com/what-is-a-capsnet-or-capsule-network-2bfbe48769cc ▸ https://hackernoon.com/capsule-networks-are-shaking-up-ai-heres-how-to-use-them-c233a0971952 ▸ https://github.com/naturomics/CapsNet-Tensorflow ▸ https://www.zhihu.com/question/67287444/answer/251460831 ▸ https://www.reddit.com/r/MachineLearning/comments/2lmo0l/ama_geoffrey_hinton/clyj4jv/ ▸ Geoffrey Hinton: "Does the Brain do Inverse Graphics?” (https://www.youtube.com/watch? v=TFIMqt0yT2I&feature=youtu.be) ▸ Geoffrey Hinton talk "What is wrong with convolutional neural nets ?” (https://www.youtube.com/watch? v=rTawFwUvnLE&t=1214s) ▸ https://www.youtube.com/watch?v=u50nqWMQe1k 30