Brief intro : Invariance and Equivariance

•Download as PPTX, PDF•

12 likes•5,949 views

홍

요즘 Image관련 Deep learning 관련 논문에서 많이 나오는 용어인 Invariance와 Equivariance의 차이를 알기쉽게 설명하는 자료를 만들어봤습니다. Image의 Transformation에 대해 Equivariant한 feature를 만들기 위하여 제안된 Group equivariant Convolutional. Neural Networks 와 Capsule Nets에 대하여 설명

Technology

Brief intro : Invariance and Equivariance

CovNet are translational Equivalent
This demonstrates LeNet-5's invariance to small rotations (+/-40 degrees).
How about Rotation ?
Limitation of Conventional CovNet

2D convolution is equivariant under translation, but not under rotation
Limitation of Conventional CovNet

Invariance
Φ
Image(X)
Feature(Z) Z1 = Z = Z2
𝑇𝑔
1
Mapping
ft’n(Φ(·))
Φ
Transformation
X1 X2
Z = Z1 = Φ(X1) = Z2 = Φ(X2) = Φ(𝑻 𝒈
𝟏
X1 )
: Mapping independent of transformation, 𝑇𝑔, for all 𝑇𝑔
X2 = 𝑇𝑔
1
X1

To make a Convolutional Neural Networks (CNN) transformation-
invariant, data augmentation with training samples is generally used
Invariance

Equivariance
Φ
Image(X)
Feature(Z) Z1 Z2
𝑇𝑔
2
𝑇𝑔
1
Φ
Transformation
X1 X2
Z2 = 𝑻 𝒈
𝟐
Z1 = 𝑻 𝒈
𝟐
Φ(X1) = Φ(𝑻 𝒈
𝟏
X1 )
: Invariance is special case of equivariance where 𝑇𝑔
2 is the identity.
X2 = 𝑇𝑔
1
X1
Z2 = 𝑇𝑔
2
Z1
: Mapping preserves algebraic structure of transformation
Z1 ≠ Z2 but keeps the relationship
Mapping
ft’n(Φ(·))

Equivariance : Group CovNet
To understand the rotation or proportion change of a given entity, a
group of filters(a combination of rotated and mirror reflected versions of
filter) is adopted.
For example, the group p4 which contains translations and rotations by
multiples of ninety degrees, or, which additionally contains mirror
reflections.
: Rotation
: Mirror reflections

A filter in a G-CNN detects co-occurrences of features that have the
preferred relative pose, and can match such a feature constellation in
every global pose through an operation called the G-convolution.
Equivariance : Group CovNet
Filter group 1
Filter group 2
Filter group N

Visualization of classic 2D convolution
Visualization of the G-Conv for the roto-translation group
G-Convolution
Equivariance : Group CovNet

G-convolution is equivariant under rotation
G-Convolution
Equivariance : Group CovNet

Equivariance : Group CovNet
Latent representations learnt by a CNN and a G-CNN.
- The left part is the result of a typical CNN while the right one is that of a G-
CNN.
- In both parts, the outer cycles consist of the rotated images while the inner
cycles consist of the learnt representations.
- Features produced by a G-CNN is equivariant to rotation while that produced
by a typical CNN is not.

What we need : EQUIVARIANCE (not invariance)
“Equivariance makes a CNN understand the rotation or proportion change”
Equivariance : Capsule Net

“A capsule is a group of neurons whose activity vector represents
the instantiation parameters of a specific type of entity such as an
object or an object part.”
Equivariance : Capsule Net

Equivariance of Capsules
“A capsule is a group of neurons whose activity vector represents the
instantiation parameters of a specific type of entity such as an object or
an object part.”
Activity vector map Object
Equivariance : Capsule Net

What's hot

最近のDeep Learning (NLP) 界隈におけるAttention事情Yuta Kikuchi

[DL輪読会]Wasserstein GAN/Towards Principled Methods for Training Generative Adv...Deep Learning JP

Skip Connection まとめ（Neural Network）Yamato OKAMOTO

[DL輪読会]Model-Based Reinforcement Learning via Meta-Policy OptimizationDeep Learning JP

[DL輪読会]Temporal Abstraction in NeurIPS2019Deep Learning JP

Sliced Wasserstein Distance for Learning Gaussian Mixture ModelsFujimoto Keisuke

2019年度チュートリアルBPE広樹本間

[DL輪読会]陰関数微分を用いた深層学習Deep Learning JP

[DL輪読会]Revisiting Deep Learning Models for Tabular Data (NeurIPS 2021) 表形式デー...Deep Learning JP

Sliced Wasserstein距離と生成モデルohken

Noisy Labels と戦う深層学習Plot Hong

劣モジュラ最適化と機械学習1章Hakky St

Swin Transformer (ICCV'21 Best Paper) を完璧に理解する資料Yusuke Uchida

[論文紹介] LSTM (LONG SHORT-TERM MEMORY)Tomoyuki Hioki

【DL輪読会】High-Resolution Image Synthesis with Latent Diffusion ModelsDeep Learning JP

ResNetの仕組みKota Nagasato

Dimensionality reduction with t-SNE(Rtsne) and UMAP(uwot) using R packages. Satoshi Kato

VAEs for multimodal disentanglementAntonio Tejero de Pablos

[DL輪読会]Learning Transferable Visual Models From Natural Language SupervisionDeep Learning JP

画像生成・生成モデルメタサーベイcvpaper. challenge

What's hot (20)

最近のDeep Learning (NLP) 界隈におけるAttention事情

[DL輪読会]Wasserstein GAN/Towards Principled Methods for Training Generative Adv...

Skip Connection まとめ（Neural Network）

[DL輪読会]Model-Based Reinforcement Learning via Meta-Policy Optimization

[DL輪読会]Temporal Abstraction in NeurIPS2019

Sliced Wasserstein Distance for Learning Gaussian Mixture Models

2019年度チュートリアルBPE

[DL輪読会]陰関数微分を用いた深層学習

[DL輪読会]Revisiting Deep Learning Models for Tabular Data (NeurIPS 2021) 表形式デー...

Sliced Wasserstein距離と生成モデル

Noisy Labels と戦う深層学習

劣モジュラ最適化と機械学習1章

Swin Transformer (ICCV'21 Best Paper) を完璧に理解する資料

[論文紹介] LSTM (LONG SHORT-TERM MEMORY)

【DL輪読会】High-Resolution Image Synthesis with Latent Diffusion Models

ResNetの仕組み

Dimensionality reduction with t-SNE(Rtsne) and UMAP(uwot) using R packages.

VAEs for multimodal disentanglement

[DL輪読会]Learning Transferable Visual Models From Natural Language Supervision

画像生成・生成モデルメタサーベイ

Similar to Brief intro : Invariance and Equivariance

Applied Deep Learning 11/03 Convolutional Neural NetworksMark Chang

ARCHITECTURAL CONDITIONING FOR DISENTANGLEMENT OF OBJECT IDENTITY AND POSTURE...홍배 김

Frequency Analysis using Z Transform.pptxDrPVIngole

Z trasnform & Inverse Z-transform in matlabHasnain Yaseen

Unit iiChetan Selukar

Lossless image compression via by lifting schemeSubhashini Subramanian

Oleksandr Obiednikov “Affine transforms and how CNN lives with them”Lviv Startup Club

Convolutional Neural Networks (CNN)Gaurav Mittal

Design of IIR filtersop205

From RNN to neural networks for cyclic undirected graphstuxette

Digital signal processing part2Vaagdevi College of Engineering

Image transforms 2Ali Baig

Z transfrm pptSWATI MISHRA

PPT Image Analysis(IRDE, DRDO)Nidhi Gopal

Dsp U Lec06 The Z Transform And Its Applicationtaha25

Review-image-segmentation-by-deep-learningTrong-An Bui

Discrete control2cairo university

Z transforms and their applicationsRam Kumar K R

Z transform Day 1vijayanand Kandaswamy

Lec11.pptssuser637f3e1

Similar to Brief intro : Invariance and Equivariance (20)

Applied Deep Learning 11/03 Convolutional Neural Networks

ARCHITECTURAL CONDITIONING FOR DISENTANGLEMENT OF OBJECT IDENTITY AND POSTURE...

Frequency Analysis using Z Transform.pptx

Z trasnform & Inverse Z-transform in matlab

Unit ii

Lossless image compression via by lifting scheme

Oleksandr Obiednikov “Affine transforms and how CNN lives with them”

Convolutional Neural Networks (CNN)

Design of IIR filters

From RNN to neural networks for cyclic undirected graphs

Digital signal processing part2

Image transforms 2

Z transfrm ppt

PPT Image Analysis(IRDE, DRDO)

Dsp U Lec06 The Z Transform And Its Application

Review-image-segmentation-by-deep-learning

Discrete control2

Z transforms and their applications

Z transform Day 1

Lec11.ppt

Recently uploaded

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j

Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106

How to convert PDF to text with Nanonetsnaman860154

The transition to renewables in India.pdfCompetition Advisory Services (India) LLP

Presentation on how to chat with PDF using ChatGPT code interpreternaman860154

08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls

Next-generation AAM aircraft unveiled by Supernal, S-A2Hyundai Motor Group

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxnull - The Open Security Community

Install Stable Diffusion in windows machinePadma Pradeep

Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited

Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4

SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j

FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh

Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group

Understanding the Laravel MVC ArchitecturePixlogix Infotech

Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies

AI as an Interface for Commercial BuildingsMemoori

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada

Recently uploaded (20)

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...

Injustice - Developers Among Us (SciFiDevCon 2024)

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics

How to convert PDF to text with Nanonets

The transition to renewables in India.pdf

Presentation on how to chat with PDF using ChatGPT code interpreter

08448380779 Call Girls In Greater Kailash - I Women Seeking Men

Next-generation AAM aircraft unveiled by Supernal, S-A2

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx

Install Stable Diffusion in windows machine

Human Factors of XR: Using Human Factors to Design XR Systems

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365

Azure Monitor & Application Insight to monitor Infrastructure & Application

SIEMENS: RAPUNZEL – A Tale About Knowledge Graph

FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi

Snow Chain-Integrated Tire for a Safe Drive on Winter Roads

Understanding the Laravel MVC Architecture

Benefits Of Flutter Compared To Other Frameworks

AI as an Interface for Commercial Buildings

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024

Brief intro : Invariance and Equivariance

1. Brief intro : Invariance and Equivariance

2. CovNet are translational Equivalent This demonstrates LeNet-5's invariance to small rotations (+/-40 degrees). How about Rotation ? Limitation of Conventional CovNet

3. 2D convolution is equivariant under translation, but not under rotation Limitation of Conventional CovNet

4. Invariance Φ Image(X) Feature(Z) Z1 = Z = Z2 𝑇𝑔 1 Mapping ft’n(Φ(·)) Φ Transformation X1 X2 Z = Z1 = Φ(X1) = Z2 = Φ(X2) = Φ(𝑻 𝒈 𝟏 X1 ) : Mapping independent of transformation, 𝑇𝑔, for all 𝑇𝑔 X2 = 𝑇𝑔 1 X1

5. To make a Convolutional Neural Networks (CNN) transformation- invariant, data augmentation with training samples is generally used Invariance

6. Equivariance Φ Image(X) Feature(Z) Z1 Z2 𝑇𝑔 2 𝑇𝑔 1 Φ Transformation X1 X2 Z2 = 𝑻 𝒈 𝟐 Z1 = 𝑻 𝒈 𝟐 Φ(X1) = Φ(𝑻 𝒈 𝟏 X1 ) : Invariance is special case of equivariance where 𝑇𝑔 2 is the identity. X2 = 𝑇𝑔 1 X1 Z2 = 𝑇𝑔 2 Z1 : Mapping preserves algebraic structure of transformation Z1 ≠ Z2 but keeps the relationship Mapping ft’n(Φ(·))

7. Equivariance : Group CovNet To understand the rotation or proportion change of a given entity, a group of filters(a combination of rotated and mirror reflected versions of filter) is adopted. For example, the group p4 which contains translations and rotations by multiples of ninety degrees, or, which additionally contains mirror reflections. : Rotation : Mirror reflections

8. A filter in a G-CNN detects co-occurrences of features that have the preferred relative pose, and can match such a feature constellation in every global pose through an operation called the G-convolution. Equivariance : Group CovNet Filter group 1 Filter group 2 Filter group N

9. Visualization of classic 2D convolution Visualization of the G-Conv for the roto-translation group G-Convolution Equivariance : Group CovNet

10. G-convolution is equivariant under rotation G-Convolution Equivariance : Group CovNet

11. Equivariance : Group CovNet Latent representations learnt by a CNN and a G-CNN. - The left part is the result of a typical CNN while the right one is that of a G- CNN. - In both parts, the outer cycles consist of the rotated images while the inner cycles consist of the learnt representations. - Features produced by a G-CNN is equivariant to rotation while that produced by a typical CNN is not.

12. What we need : EQUIVARIANCE (not invariance) “Equivariance makes a CNN understand the rotation or proportion change” Equivariance : Capsule Net

13. “A capsule is a group of neurons whose activity vector represents the instantiation parameters of a specific type of entity such as an object or an object part.” Equivariance : Capsule Net

14. Equivariance of Capsules “A capsule is a group of neurons whose activity vector represents the instantiation parameters of a specific type of entity such as an object or an object part.” Activity vector map Object Equivariance : Capsule Net

Brief intro : Invariance and Equivariance

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Brief intro : Invariance and Equivariance

Similar to Brief intro : Invariance and Equivariance (20)

More from 홍배 김

More from 홍배 김 (20)

Recently uploaded

Recently uploaded (20)

Brief intro : Invariance and Equivariance