The Origin of Grad-CAM

•

0 likes•137 views

Shintaro Yoshida

Grad-CAM is most famous XAI Method in the world.

Engineering

The Origin of
Grad-CAM
AI Study Meeting #4 @Eaglys on 2020/10/25
Shintaro Yoshida
@sht_47

The Features of Grad-CAM
● Grad-CAM(Gradient-weighted Class Activation Mapping, 2016, Ramprasaath)
○ Most Famous Method in XAI ( I described the reason in later slide)
○ Update CAM(2015, Zhou) 、Generalize to Any Kind of CNN Architecture
● The Goal of XAI(Explainable Artificial Intelligence)
Identify the Mode of Failure (AI << Human)
Predict with more Confidence (AI ≒ Human)
AI teaches Human (AI >> Human)

The Content
- The referred Paper of Grad-CAM
-
-
- Grad-CAMのモデル中身
- Result and Discussion
- Implement with Pytorch and Google Colaboratory

NIN(Network In Network, 2014 Lin et al)
- Proficient Paper because of two great ideas
Introduce 1x1 Conv to reduce the calculation cost
( Applied to InceptionNet、ResNet Botttleneck Block)
Introduce GAP(Global Average Pooling)
→ Recently Adaptive Average Pooling is used
● GAP
Performed as a Structural Regularizer
○ More Native to the correspondence between Feature Map and Category
○ NO Added Parameter
○ Robust to Spatial Translation

Object Detectors Emerge In Deep Scene Cnns(2015 Zhou et al)
- CNN Model Scene Recognition → Object Detector Emerges
No Supervised Dataset of Object Classification and Detection
In Previous Research, Object Classification → Object Localization
Places Database (2014 Zhou et al )

CAM(Class Activation Mapping 2015 Zhou et al)
…
…
Final
Conv
GAP FC
K Featuer Maps K Element
…
C class
a
a
1
Generate CAM
Using

CAM(Class Activation Mapping)
…
…
Final
Conv
GAP FC
4096 Feature Maps 4096 Element
…
1000 Class
VGG16
(ImageNet)
7
7

Math Equation and Concept of CAM
Sum with
i, j
Weighted
Sum with k
Each Process is Independent
Z is size of Feature Map (Z=49)

Usage of CAM( After Inference)
Average
With i, j
(Image Source : Zhou et al 2015)
CAMWeighted
Sum with k
Inference Generate
CAM
Weighted
Sum with k

Guided Back-Propagation(2015 Springenberg)
- Deconvolutional Network (2011 Zeiler)
Opposite Process of Max Pooling
- Guided Backprop
Combine with DeconvNet and
ReLU BackPropagation

Result of Guided-Backprop
Batch Size : 64 Learning Rate : 0.01
Weight Decay : 0.001 Optimizer : SGD
Conv6
Conv9

Grad-CAM(2016 Ramprasaath)
CAM limits with GAP → Grad-CAM generalize to Any Architecture
Combine CAM(Corase) with Guided-Backprop(Fined-Grained)
Insert ReLU to CAM(Only Positive Value is enough)
No need to Architectural Change and Re-Train
Sum with
i and j
Weighted
Sum with
Weighted
Sum with

Result 1 of Grad-CAM
- Microsoft COCO
Dataset
- Sample from
Validation Dataset
- Mistake with
Ice Cream

Result 2 of Grad-CAM
Mistake at VGG@ImageNet Whether the model has bias or not

Implement
- Pytorch 1.6
https://github.com/sht47/grad-cam-Pytorch1.6
- Tensorflow 2.3 (Under Construction)
https://github.com/sht47/grad-cam-Tensorflow2.3

What's hot

動画認識サーベイv1（メタサーベイ）cvpaper. challenge

【チュートリアル】コンピュータビジョンによる動画認識 v2Hirokatsu Kataoka

Graph neural networks overviewRodion Kiryukhin

AlexNet(ImageNet Classification with Deep Convolutional Neural Networks)Fellowship at Vodafone FutureLab

[DL輪読会]Swin Transformer: Hierarchical Vision Transformer using Shifted WindowsDeep Learning JP

SSII2021 [OS2-03] 自己教師あり学習における対照学習の基礎と応用SSII

文献紹介：Swin Transformer: Hierarchical Vision Transformer Using Shifted WindowsToru Tamaki

Deep Learning in Bio-Medical ImagingJoonhyung Lee

Batch normalization effectiveness_20190206Masakazu Shinoda

[DL輪読会]ドメイン転移と不変表現に関するサーベイDeep Learning JP

Swin Transformer (ICCV'21 Best Paper) を完璧に理解する資料Yusuke Uchida

第52回SWO研究会チュートリアル資料Takanori Ugai

Introduction to Generative Adversarial Networks (GANs)Appsilon Data Science

SSII2022 [SS1] ニューラル3D表現の最新動向〜ニューラルネットでなんでも表せる？？〜SSII

ゼロから作るDeepLearning 5章輪読KCS Keio Computer Society

論文紹介 Semi-supervised Learning with Deep Generative ModelsSeiya Tokui

[DL輪読会]Vision Transformer with Deformable Attention （Deformable Attention Tra...Deep Learning JP

Domain Adaptation 発展と動向まとめ（サーベイ資料）Yamato OKAMOTO

Faster R-CNN: Towards real-time object detection with region proposal network...Universitat Politècnica de Catalunya

【DL輪読会】Segment AnythingDeep Learning JP

What's hot (20)

動画認識サーベイv1（メタサーベイ）

【チュートリアル】コンピュータビジョンによる動画認識 v2

Graph neural networks overview

AlexNet(ImageNet Classification with Deep Convolutional Neural Networks)

[DL輪読会]Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

SSII2021 [OS2-03] 自己教師あり学習における対照学習の基礎と応用

文献紹介：Swin Transformer: Hierarchical Vision Transformer Using Shifted Windows

Deep Learning in Bio-Medical Imaging

Batch normalization effectiveness_20190206

[DL輪読会]ドメイン転移と不変表現に関するサーベイ

Swin Transformer (ICCV'21 Best Paper) を完璧に理解する資料

第52回SWO研究会チュートリアル資料

Introduction to Generative Adversarial Networks (GANs)

SSII2022 [SS1] ニューラル3D表現の最新動向〜ニューラルネットでなんでも表せる？？〜

ゼロから作るDeepLearning 5章輪読

論文紹介 Semi-supervised Learning with Deep Generative Models

[DL輪読会]Vision Transformer with Deformable Attention （Deformable Attention Tra...

Domain Adaptation 発展と動向まとめ（サーベイ資料）

Faster R-CNN: Towards real-time object detection with region proposal network...

【DL輪読会】Segment Anything

Similar to The Origin of Grad-CAM

Large-scale Recommendation Systems on Just a PCAapo Kyrölä

# Can we trust ai. the dilemma of model adjustmentTerence Huang

Point-GNN: Graph Neural Network for 3D Object Detection in a Point CloudNuwan Sriyantha Bandara

Interpretability of Convolutional Neural Networks - Eva Mohedano - UPC Barcel...Universitat Politècnica de Catalunya

Deep image retrieval learning global representations for image searchUniversitat Politècnica de Catalunya

D1L5 Visualization (D1L2 Insight@DCU Machine Learning Workshop 2017)Universitat Politècnica de Catalunya

Lec07 aggregation-and-retrieval-systemUnited States Air Force Academy

Interactive Stereoscopic Rendering for Non-Planar Projections (GRAPP 2009)Matthias Trapp

DiscoGANIl Gu Yi

PointNetPetteriTeikariPhD

Semantic Segmentation on Satellite ImageryRAHUL BHOJWANI

The next generation of the Montage image mosaic engineG. Bruce Berriman

Deep image retrieval - learning global representations for image search - ub ...Universitat de Barcelona

ICIAM 2019: A New Algorithm Model for Massive-Scale Streaming Graph AnalysisJason Riedy

Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)Universitat Politècnica de Catalunya

The Knowledge Graph Conference 2022 - Bo Wu's PresentationKatana Graph

Lec16 subspace optimizationUnited States Air Force Academy

Performance evaluation of GANs in a semisupervised OCR use caseFlorian Wilhelm

Performance evaluation of GANs in a semisupervised OCR use caseinovex GmbH

Content Based Image Retrieval (CBIR)Behzad Shomali

Similar to The Origin of Grad-CAM (20)

Large-scale Recommendation Systems on Just a PC

# Can we trust ai. the dilemma of model adjustment

Point-GNN: Graph Neural Network for 3D Object Detection in a Point Cloud

Interpretability of Convolutional Neural Networks - Eva Mohedano - UPC Barcel...

Deep image retrieval learning global representations for image search

D1L5 Visualization (D1L2 Insight@DCU Machine Learning Workshop 2017)

Lec07 aggregation-and-retrieval-system

Interactive Stereoscopic Rendering for Non-Planar Projections (GRAPP 2009)

DiscoGAN

PointNet

Semantic Segmentation on Satellite Imagery

The next generation of the Montage image mosaic engine

Deep image retrieval - learning global representations for image search - ub ...

ICIAM 2019: A New Algorithm Model for Massive-Scale Streaming Graph Analysis

Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)

The Knowledge Graph Conference 2022 - Bo Wu's Presentation

Lec16 subspace optimization

Performance evaluation of GANs in a semisupervised OCR use case

Content Based Image Retrieval (CBIR)

Recently uploaded

Roadmap to Membership of RICS - Pathways and RoutesM Maged Hegazy, LLM, MBA, CCP, P3O

(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat

Software Development Life Cycle By Team Orange (Dept. of Pharmacy)Suman Mia

College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile

OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal

DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEslot gacor bisa pakai pulsa

247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).pptssuser5c9d4b1

Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile

HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSRajkumarAkumalla

MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N

Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...Call Girls in Nagpur High Profile

HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95

(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat

Introduction to IEEE STANDARDS and its different types.pptxupamatechverse

UNIT-III FMM. DIMENSIONAL ANALYSISrknatarajan

VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor

Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona

(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat

Introduction and different types of Ethernet.pptxupamatechverse

IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...RajaP95

Recently uploaded (20)

Roadmap to Membership of RICS - Pathways and Routes

(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...

Software Development Life Cycle By Team Orange (Dept. of Pharmacy)

College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik

OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...

DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE

247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt

Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik

HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS

MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS

Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...

HARMONY IN THE NATURE AND EXISTENCE - Unit-IV

(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...

Introduction to IEEE STANDARDS and its different types.pptx

UNIT-III FMM. DIMENSIONAL ANALYSIS

VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130

Processing & Properties of Floor and Wall Tiles.pptx

(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts

Introduction and different types of Ethernet.pptx

IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...

The Origin of Grad-CAM

1. The Origin of Grad-CAM AI Study Meeting #4 @Eaglys on 2020/10/25 Shintaro Yoshida @sht_47

2. The Features of Grad-CAM ● Grad-CAM(Gradient-weighted Class Activation Mapping, 2016, Ramprasaath) ○ Most Famous Method in XAI ( I described the reason in later slide) ○ Update CAM(2015, Zhou) 、Generalize to Any Kind of CNN Architecture ● The Goal of XAI(Explainable Artificial Intelligence) Identify the Mode of Failure (AI << Human) Predict with more Confidence (AI ≒ Human) AI teaches Human (AI >> Human)

3. The Content - The referred Paper of Grad-CAM - - - Grad-CAMのモデル中身 - Result and Discussion - Implement with Pytorch and Google Colaboratory

4. NIN(Network In Network, 2014 Lin et al) - Proficient Paper because of two great ideas Introduce 1x1 Conv to reduce the calculation cost ( Applied to InceptionNet、ResNet Botttleneck Block) Introduce GAP(Global Average Pooling) → Recently Adaptive Average Pooling is used ● GAP Performed as a Structural Regularizer ○ More Native to the correspondence between Feature Map and Category ○ NO Added Parameter ○ Robust to Spatial Translation

5. Object Detectors Emerge In Deep Scene Cnns(2015 Zhou et al) - CNN Model Scene Recognition → Object Detector Emerges No Supervised Dataset of Object Classification and Detection In Previous Research, Object Classification → Object Localization Places Database (2014 Zhou et al )

6. CAM(Class Activation Mapping 2015 Zhou et al) … … Final Conv GAP FC K Featuer Maps K Element … C class a a 1 Generate CAM Using

7. CAM(Class Activation Mapping) … … Final Conv GAP FC 4096 Feature Maps 4096 Element … 1000 Class VGG16 (ImageNet) 7 7

8. Math Equation and Concept of CAM Sum with i, j Weighted Sum with k Each Process is Independent Z is size of Feature Map (Z=49)

9. Usage of CAM( After Inference) Average With i, j (Image Source : Zhou et al 2015) CAMWeighted Sum with k Inference Generate CAM Weighted Sum with k

10. Guided Back-Propagation(2015 Springenberg) - Deconvolutional Network (2011 Zeiler) Opposite Process of Max Pooling - Guided Backprop Combine with DeconvNet and ReLU BackPropagation

11. Result of Guided-Backprop Batch Size : 64 Learning Rate : 0.01 Weight Decay : 0.001 Optimizer : SGD Conv6 Conv9

12. Grad-CAM(2016 Ramprasaath) CAM limits with GAP → Grad-CAM generalize to Any Architecture Combine CAM(Corase) with Guided-Backprop(Fined-Grained) Insert ReLU to CAM(Only Positive Value is enough) No need to Architectural Change and Re-Train Sum with i and j Weighted Sum with Weighted Sum with

13. Result 1 of Grad-CAM - Microsoft COCO Dataset - Sample from Validation Dataset - Mistake with Ice Cream

14. Result 2 of Grad-CAM Mistake at VGG@ImageNet Whether the model has bias or not

15. Implement - Pytorch 1.6 https://github.com/sht47/grad-cam-Pytorch1.6 - Tensorflow 2.3 (Under Construction) https://github.com/sht47/grad-cam-Tensorflow2.3

The Origin of Grad-CAM

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to The Origin of Grad-CAM

Similar to The Origin of Grad-CAM (20)

Recently uploaded

Recently uploaded (20)

The Origin of Grad-CAM