SlideShare a Scribd company logo
1 of 17
Download to read offline
김 성 철
Contents
1. Introduction
2. Generalized Intersection over Union
3. Experimental Results
4. Conclusion
2
Introduction
• Loss function은 𝐿1, 𝐿2-norms / Metric은 𝐼𝑜𝑈 사용
• 𝑳 𝒏-norms는 높여야 할 𝑰𝒐𝑼와 strong correlation이 없음!
→ 𝑰𝒐𝑼와 직접적인 관계가 있는 loss function을 제시
3
Introduction
• 두 물체가 겹치지 않는다면
→ 𝑰𝒐𝑼 = 0
→ 두 물체가 얼마나 떨어져 있는지 모름!
→ 𝑰𝒐𝑼의 약점을 보완하는 loss function 제시
1. 𝑰𝒐𝑼와 같은 definition
2. scale invariant property 유지
3. 겹치는 물체간 strong correlation
4
Generalized Intersection over Union
• Intersection over Union (𝑰𝒐𝑼)
• Loss (ℒ 𝐼𝑜𝑈 = 1 − 𝐼𝑜𝑈)
• Scale invariant (independent from the scale)
• If 𝑨⋂𝑩 = 𝟎, 𝑰𝒐𝑼 𝑨, 𝑩 = 𝟎
→ 두 물체 간 거리가 반영되지 않음!
5
Generalized Intersection over Union
• Generalized Intersection over Union (𝑮𝑰𝒐𝑼)
For two arbitrary convex shapes (volumes) 𝑨, 𝑩 ⊆ 𝕊 ∈ ℝ 𝒏
1. Find the smallest convex shapes 𝑪 ⊆ 𝕊 ∈ ℝ 𝒏
enclosing both 𝑨, 𝑩
2. Calculate a ratio between the volume (area) occupied by 𝑪 excluding 𝑨, 𝑩
3. Divide by the total volume (area) occupied by 𝑪
4. Subtract this ratio from the 𝑰𝒐𝑼 value
6
Generalized Intersection over Union
• Generalized Intersection over Union (𝑮𝑰𝒐𝑼)
• Loss ℒ 𝐺𝐼𝑜𝑈 = 1 − 𝐺𝐼𝑜𝑈
• Scale invariant
• ∀𝐴, 𝐵 ⊆ 𝕊, 𝐺𝐼𝑜𝑈 𝐴, 𝐵 ≤ 𝐼𝑜𝑈(𝐴, 𝐵) → lim
𝐴→𝐵
𝐺𝐼𝑜𝑈 𝐴, 𝐵 = 𝐼𝑜𝑈(𝐴, 𝐵)
• ∀𝐴, 𝐵 ⊆ 𝕊, −1 ≤ 𝐺𝐼𝑜𝑈 𝐴, 𝐵 ≤ 1
→ if A⋃𝐵 = A⋂𝐵 , 𝐺𝐼𝑜𝑈 = 𝐼𝑜𝑈 = 1
→ lim
A⋃𝐵
|𝐶|
→0
𝐺𝐼𝑜𝑈 𝐴, 𝐵 = −1
7
Generalized Intersection over Union
• 𝑮𝑰𝒐𝑼 as Loss for Bounding Box Regression
• 𝐼𝑜𝑈 or 𝐺𝐼𝑜𝑈 can be directly used as a loss
→ 두 물체가 겹치지 않으면, 𝑰𝒐𝑼 has zero gradient
8
𝑰𝒐𝑼
Maximize this area 𝑮𝑰𝒐𝑼
Minimize this area
Generalized Intersection over Union
• 𝑮𝑰𝒐𝑼 as Loss for Bounding Box Regression
• Strong correlation with 𝑰𝒐𝑼
i.e. 𝐼𝑜𝑈 ≤ 0.2 and 𝐺𝐼𝑜𝑈 ≤ 0.2
→ 끝에서는 𝐺𝐼𝑜𝑈가 더 가파른 gradient를 가짐
9
Generalized Intersection over Union
• 𝑮𝑰𝒐𝑼 as Loss for Bounding Box Regression
• Loss Stability
Always 𝑩 𝒄
≥ 𝑩 𝒈
→
𝑨 𝒄−𝓤
𝑨 𝒄 > 𝟎 (𝑨 𝒄
≥ 𝑨 𝒈
, ∀𝑩 𝒑
∈ ℝ 𝟒
and 𝑨 𝒈
≥ 𝟎)
𝑨 𝒄 ≥ 𝓤, ∀𝑩 𝒑 ∈ ℝ 𝟒 → 𝟎 ≤ 𝓛 𝑮𝑰𝒐𝑼 ≤ 𝟐, ∀𝑩 𝒑 ∈ ℝ 𝟒
• ℒ 𝐺𝐼𝑜𝑈 behavior when 𝐼𝑜𝑈 = 0
𝓛 𝑮𝑰𝒐𝑼 = 𝟏 − 𝑮𝑰𝒐𝑼 = 𝟏 +
𝑨 𝒄−𝓤
𝑨 𝒄 − 𝑰𝒐𝑼 = 𝟐 −
𝓤
𝑨 𝒄
To minimize 𝓛 𝑮𝑰𝒐𝑼, maximize
𝓤
𝑨 𝒄 → 𝟎 ≤
𝓤
𝑨 𝒄 ≤ 𝟏 → minimize 𝑨 𝒄 or maximize 𝓤 (𝑨 𝒑)
10
Experimental Results
11
• YOLO v3
Experimental Results
• Faster R-CNN
12
Experimental Results
• Mask R-CNN
13
Experimental Results
14
15
Conclusions
• 𝑰𝒐𝑼의 약점을 보안한 𝑮𝑰𝒐𝑼 제안
• 𝑮𝑰𝒐𝑼를 bounding box regression loss로써 사용
• 회전 도형과 3D object detection에도 적용 예정
https://github.com/generalized-iou
16
감 사 합 니 다
17

More Related Content

What's hot

PR-207: YOLOv3: An Incremental Improvement
PR-207: YOLOv3: An Incremental ImprovementPR-207: YOLOv3: An Incremental Improvement
PR-207: YOLOv3: An Incremental ImprovementJinwon Lee
 
[PR12] You Only Look Once (YOLO): Unified Real-Time Object Detection
[PR12] You Only Look Once (YOLO): Unified Real-Time Object Detection[PR12] You Only Look Once (YOLO): Unified Real-Time Object Detection
[PR12] You Only Look Once (YOLO): Unified Real-Time Object DetectionTaegyun Jeon
 
Feed forward ,back propagation,gradient descent
Feed forward ,back propagation,gradient descentFeed forward ,back propagation,gradient descent
Feed forward ,back propagation,gradient descentMuhammad Rasel
 
PR-217: EfficientDet: Scalable and Efficient Object Detection
PR-217: EfficientDet: Scalable and Efficient Object DetectionPR-217: EfficientDet: Scalable and Efficient Object Detection
PR-217: EfficientDet: Scalable and Efficient Object DetectionJinwon Lee
 
Single Image Super Resolution Overview
Single Image Super Resolution OverviewSingle Image Super Resolution Overview
Single Image Super Resolution OverviewLEE HOSEONG
 
Activation function
Activation functionActivation function
Activation functionAstha Jain
 
CONVOLUTIONAL NEURAL NETWORK
CONVOLUTIONAL NEURAL NETWORKCONVOLUTIONAL NEURAL NETWORK
CONVOLUTIONAL NEURAL NETWORKMd Rajib Bhuiyan
 
Object detection with deep learning
Object detection with deep learningObject detection with deep learning
Object detection with deep learningSushant Shrivastava
 
Convolutional neural network
Convolutional neural network Convolutional neural network
Convolutional neural network Yan Xu
 
Artificial Neural Networks Lect5: Multi-Layer Perceptron & Backpropagation
Artificial Neural Networks Lect5: Multi-Layer Perceptron & BackpropagationArtificial Neural Networks Lect5: Multi-Layer Perceptron & Backpropagation
Artificial Neural Networks Lect5: Multi-Layer Perceptron & BackpropagationMohammed Bennamoun
 
Multi-agent Inverse reinforcement learning: 相互作用する行動主体の報酬推定
Multi-agent Inverse reinforcement learning: 相互作用する行動主体の報酬推定Multi-agent Inverse reinforcement learning: 相互作用する行動主体の報酬推定
Multi-agent Inverse reinforcement learning: 相互作用する行動主体の報酬推定Keiichi Namikoshi
 
MIRU2016 チュートリアル
MIRU2016 チュートリアルMIRU2016 チュートリアル
MIRU2016 チュートリアルShunsuke Ono
 
Hyperparameter Tuning
Hyperparameter TuningHyperparameter Tuning
Hyperparameter TuningJon Lederman
 
Autoencoders
AutoencodersAutoencoders
AutoencodersCloudxLab
 
Convolutional Neural Networks
Convolutional Neural NetworksConvolutional Neural Networks
Convolutional Neural NetworksAshray Bhandare
 
Artificial Neural Networks Lect7: Neural networks based on competition
Artificial Neural Networks Lect7: Neural networks based on competitionArtificial Neural Networks Lect7: Neural networks based on competition
Artificial Neural Networks Lect7: Neural networks based on competitionMohammed Bennamoun
 

What's hot (20)

PR-207: YOLOv3: An Incremental Improvement
PR-207: YOLOv3: An Incremental ImprovementPR-207: YOLOv3: An Incremental Improvement
PR-207: YOLOv3: An Incremental Improvement
 
[PR12] You Only Look Once (YOLO): Unified Real-Time Object Detection
[PR12] You Only Look Once (YOLO): Unified Real-Time Object Detection[PR12] You Only Look Once (YOLO): Unified Real-Time Object Detection
[PR12] You Only Look Once (YOLO): Unified Real-Time Object Detection
 
YOLO
YOLOYOLO
YOLO
 
Feed forward ,back propagation,gradient descent
Feed forward ,back propagation,gradient descentFeed forward ,back propagation,gradient descent
Feed forward ,back propagation,gradient descent
 
PR-217: EfficientDet: Scalable and Efficient Object Detection
PR-217: EfficientDet: Scalable and Efficient Object DetectionPR-217: EfficientDet: Scalable and Efficient Object Detection
PR-217: EfficientDet: Scalable and Efficient Object Detection
 
Single Image Super Resolution Overview
Single Image Super Resolution OverviewSingle Image Super Resolution Overview
Single Image Super Resolution Overview
 
Activation function
Activation functionActivation function
Activation function
 
CONVOLUTIONAL NEURAL NETWORK
CONVOLUTIONAL NEURAL NETWORKCONVOLUTIONAL NEURAL NETWORK
CONVOLUTIONAL NEURAL NETWORK
 
Object detection with deep learning
Object detection with deep learningObject detection with deep learning
Object detection with deep learning
 
Convolutional neural network
Convolutional neural network Convolutional neural network
Convolutional neural network
 
Gradient descent method
Gradient descent methodGradient descent method
Gradient descent method
 
Artificial Neural Networks Lect5: Multi-Layer Perceptron & Backpropagation
Artificial Neural Networks Lect5: Multi-Layer Perceptron & BackpropagationArtificial Neural Networks Lect5: Multi-Layer Perceptron & Backpropagation
Artificial Neural Networks Lect5: Multi-Layer Perceptron & Backpropagation
 
Autoencoder
AutoencoderAutoencoder
Autoencoder
 
Multi-agent Inverse reinforcement learning: 相互作用する行動主体の報酬推定
Multi-agent Inverse reinforcement learning: 相互作用する行動主体の報酬推定Multi-agent Inverse reinforcement learning: 相互作用する行動主体の報酬推定
Multi-agent Inverse reinforcement learning: 相互作用する行動主体の報酬推定
 
MIRU2016 チュートリアル
MIRU2016 チュートリアルMIRU2016 チュートリアル
MIRU2016 チュートリアル
 
Hyperparameter Tuning
Hyperparameter TuningHyperparameter Tuning
Hyperparameter Tuning
 
Autoencoders
AutoencodersAutoencoders
Autoencoders
 
Convolutional Neural Networks
Convolutional Neural NetworksConvolutional Neural Networks
Convolutional Neural Networks
 
Artificial Neural Networks Lect7: Neural networks based on competition
Artificial Neural Networks Lect7: Neural networks based on competitionArtificial Neural Networks Lect7: Neural networks based on competition
Artificial Neural Networks Lect7: Neural networks based on competition
 
Yolo
YoloYolo
Yolo
 

Similar to Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression

Proving Decidability of Intuitionistic Propositional Calculus on Coq
Proving Decidability of Intuitionistic Propositional Calculus on CoqProving Decidability of Intuitionistic Propositional Calculus on Coq
Proving Decidability of Intuitionistic Propositional Calculus on CoqMasaki Hara
 
Hilbert space representation of the minimal length uncertainty relation
Hilbert space representation of the minimal length uncertainty relationHilbert space representation of the minimal length uncertainty relation
Hilbert space representation of the minimal length uncertainty relationMilad Hajebrahimi
 
Regularisation & Auxiliary Information in OOD Detection
Regularisation & Auxiliary Information in OOD DetectionRegularisation & Auxiliary Information in OOD Detection
Regularisation & Auxiliary Information in OOD Detectionkirk68
 
Page rank - from theory to application
Page rank - from theory to applicationPage rank - from theory to application
Page rank - from theory to applicationGAYO3
 
3). work & energy (finished)
3). work & energy (finished)3). work & energy (finished)
3). work & energy (finished)PhysicsLover
 
NIPS KANSAI Reading Group #5: State Aware Imitation Learning
NIPS KANSAI Reading Group #5: State Aware Imitation LearningNIPS KANSAI Reading Group #5: State Aware Imitation Learning
NIPS KANSAI Reading Group #5: State Aware Imitation LearningEiji Uchibe
 
Conformal Boundary conditions
Conformal Boundary conditionsConformal Boundary conditions
Conformal Boundary conditionsHassaan Saleem
 
Quadratic form and functional optimization
Quadratic form and functional optimizationQuadratic form and functional optimization
Quadratic form and functional optimizationJunpei Tsuji
 
積分 (人間科学のための基礎数学)
積分 (人間科学のための基礎数学)積分 (人間科学のための基礎数学)
積分 (人間科学のための基礎数学)Masahiro Okano
 
Review of Seiberg Witten duality.pptx
Review of Seiberg Witten duality.pptxReview of Seiberg Witten duality.pptx
Review of Seiberg Witten duality.pptxHassaan Saleem
 
Optimum Engineering Design - Day 2b. Classical Optimization methods
Optimum Engineering Design - Day 2b. Classical Optimization methodsOptimum Engineering Design - Day 2b. Classical Optimization methods
Optimum Engineering Design - Day 2b. Classical Optimization methodsSantiagoGarridoBulln
 
Section 11: Normal Subgroups
Section 11: Normal SubgroupsSection 11: Normal Subgroups
Section 11: Normal SubgroupsKevin Johnson
 
PR 113: The Perception Distortion Tradeoff
PR 113: The Perception Distortion TradeoffPR 113: The Perception Distortion Tradeoff
PR 113: The Perception Distortion TradeoffTaeoh Kim
 
DL_lecture3_regularization_I.pdf
DL_lecture3_regularization_I.pdfDL_lecture3_regularization_I.pdf
DL_lecture3_regularization_I.pdfsagayalavanya2
 
Bounded arithmetic in free logic
Bounded arithmetic in free logicBounded arithmetic in free logic
Bounded arithmetic in free logicYamagata Yoriyuki
 

Similar to Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression (20)

Presentation
PresentationPresentation
Presentation
 
GR analysis techniques
GR analysis techniquesGR analysis techniques
GR analysis techniques
 
Proving Decidability of Intuitionistic Propositional Calculus on Coq
Proving Decidability of Intuitionistic Propositional Calculus on CoqProving Decidability of Intuitionistic Propositional Calculus on Coq
Proving Decidability of Intuitionistic Propositional Calculus on Coq
 
Hilbert space representation of the minimal length uncertainty relation
Hilbert space representation of the minimal length uncertainty relationHilbert space representation of the minimal length uncertainty relation
Hilbert space representation of the minimal length uncertainty relation
 
Basic calculus (i)
Basic calculus (i)Basic calculus (i)
Basic calculus (i)
 
Regularisation & Auxiliary Information in OOD Detection
Regularisation & Auxiliary Information in OOD DetectionRegularisation & Auxiliary Information in OOD Detection
Regularisation & Auxiliary Information in OOD Detection
 
Page rank - from theory to application
Page rank - from theory to applicationPage rank - from theory to application
Page rank - from theory to application
 
3). work & energy (finished)
3). work & energy (finished)3). work & energy (finished)
3). work & energy (finished)
 
NIPS KANSAI Reading Group #5: State Aware Imitation Learning
NIPS KANSAI Reading Group #5: State Aware Imitation LearningNIPS KANSAI Reading Group #5: State Aware Imitation Learning
NIPS KANSAI Reading Group #5: State Aware Imitation Learning
 
Conformal Boundary conditions
Conformal Boundary conditionsConformal Boundary conditions
Conformal Boundary conditions
 
Quadratic form and functional optimization
Quadratic form and functional optimizationQuadratic form and functional optimization
Quadratic form and functional optimization
 
積分 (人間科学のための基礎数学)
積分 (人間科学のための基礎数学)積分 (人間科学のための基礎数学)
積分 (人間科学のための基礎数学)
 
Basic calculus (ii) recap
Basic calculus (ii) recapBasic calculus (ii) recap
Basic calculus (ii) recap
 
Review of Seiberg Witten duality.pptx
Review of Seiberg Witten duality.pptxReview of Seiberg Witten duality.pptx
Review of Seiberg Witten duality.pptx
 
Optimum Engineering Design - Day 2b. Classical Optimization methods
Optimum Engineering Design - Day 2b. Classical Optimization methodsOptimum Engineering Design - Day 2b. Classical Optimization methods
Optimum Engineering Design - Day 2b. Classical Optimization methods
 
Lecture_10_SVD.pptx
Lecture_10_SVD.pptxLecture_10_SVD.pptx
Lecture_10_SVD.pptx
 
Section 11: Normal Subgroups
Section 11: Normal SubgroupsSection 11: Normal Subgroups
Section 11: Normal Subgroups
 
PR 113: The Perception Distortion Tradeoff
PR 113: The Perception Distortion TradeoffPR 113: The Perception Distortion Tradeoff
PR 113: The Perception Distortion Tradeoff
 
DL_lecture3_regularization_I.pdf
DL_lecture3_regularization_I.pdfDL_lecture3_regularization_I.pdf
DL_lecture3_regularization_I.pdf
 
Bounded arithmetic in free logic
Bounded arithmetic in free logicBounded arithmetic in free logic
Bounded arithmetic in free logic
 

More from Sungchul Kim

PR-343: Semi-Supervised Semantic Segmentation with Cross Pseudo Supervision
PR-343: Semi-Supervised Semantic Segmentation with Cross Pseudo SupervisionPR-343: Semi-Supervised Semantic Segmentation with Cross Pseudo Supervision
PR-343: Semi-Supervised Semantic Segmentation with Cross Pseudo SupervisionSungchul Kim
 
Revisiting the Calibration of Modern Neural Networks
Revisiting the Calibration of Modern Neural NetworksRevisiting the Calibration of Modern Neural Networks
Revisiting the Calibration of Modern Neural NetworksSungchul Kim
 
Emerging Properties in Self-Supervised Vision Transformers
Emerging Properties in Self-Supervised Vision TransformersEmerging Properties in Self-Supervised Vision Transformers
Emerging Properties in Self-Supervised Vision TransformersSungchul Kim
 
PR-305: Exploring Simple Siamese Representation Learning
PR-305: Exploring Simple Siamese Representation LearningPR-305: Exploring Simple Siamese Representation Learning
PR-305: Exploring Simple Siamese Representation LearningSungchul Kim
 
Score based Generative Modeling through Stochastic Differential Equations
Score based Generative Modeling through Stochastic Differential EquationsScore based Generative Modeling through Stochastic Differential Equations
Score based Generative Modeling through Stochastic Differential EquationsSungchul Kim
 
Exploring Simple Siamese Representation Learning
Exploring Simple Siamese Representation LearningExploring Simple Siamese Representation Learning
Exploring Simple Siamese Representation LearningSungchul Kim
 
Revisiting the Sibling Head in Object Detector
Revisiting the Sibling Head in Object DetectorRevisiting the Sibling Head in Object Detector
Revisiting the Sibling Head in Object DetectorSungchul Kim
 
Do Wide and Deep Networks Learn the Same Things: Uncovering How Neural Networ...
Do Wide and Deep Networks Learn the Same Things: Uncovering How Neural Networ...Do Wide and Deep Networks Learn the Same Things: Uncovering How Neural Networ...
Do Wide and Deep Networks Learn the Same Things: Uncovering How Neural Networ...Sungchul Kim
 
Deeplabv1, v2, v3, v3+
Deeplabv1, v2, v3, v3+Deeplabv1, v2, v3, v3+
Deeplabv1, v2, v3, v3+Sungchul Kim
 
Going Deeper with Convolutions
Going Deeper with ConvolutionsGoing Deeper with Convolutions
Going Deeper with ConvolutionsSungchul Kim
 
Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization
Grad-CAM: Visual Explanations from Deep Networks via Gradient-based LocalizationGrad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization
Grad-CAM: Visual Explanations from Deep Networks via Gradient-based LocalizationSungchul Kim
 
Panoptic Segmentation
Panoptic SegmentationPanoptic Segmentation
Panoptic SegmentationSungchul Kim
 
On the Variance of the Adaptive Learning Rate and Beyond
On the Variance of the Adaptive Learning Rate and BeyondOn the Variance of the Adaptive Learning Rate and Beyond
On the Variance of the Adaptive Learning Rate and BeyondSungchul Kim
 
A Benchmark for Interpretability Methods in Deep Neural Networks
A Benchmark for Interpretability Methods in Deep Neural NetworksA Benchmark for Interpretability Methods in Deep Neural Networks
A Benchmark for Interpretability Methods in Deep Neural NetworksSungchul Kim
 
KDGAN: Knowledge Distillation with Generative Adversarial Networks
KDGAN: Knowledge Distillation with Generative Adversarial NetworksKDGAN: Knowledge Distillation with Generative Adversarial Networks
KDGAN: Knowledge Distillation with Generative Adversarial NetworksSungchul Kim
 
Designing Network Design Spaces
Designing Network Design SpacesDesigning Network Design Spaces
Designing Network Design SpacesSungchul Kim
 
Search to Distill: Pearls are Everywhere but not the Eyes
Search to Distill: Pearls are Everywhere but not the EyesSearch to Distill: Pearls are Everywhere but not the Eyes
Search to Distill: Pearls are Everywhere but not the EyesSungchul Kim
 
Supervised Constrastive Learning
Supervised Constrastive LearningSupervised Constrastive Learning
Supervised Constrastive LearningSungchul Kim
 
Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning
Bootstrap Your Own Latent: A New Approach to Self-Supervised LearningBootstrap Your Own Latent: A New Approach to Self-Supervised Learning
Bootstrap Your Own Latent: A New Approach to Self-Supervised LearningSungchul Kim
 
FickleNet: Weakly and Semi-supervised Semantic Image Segmentation using Stoch...
FickleNet: Weakly and Semi-supervised Semantic Image Segmentation using Stoch...FickleNet: Weakly and Semi-supervised Semantic Image Segmentation using Stoch...
FickleNet: Weakly and Semi-supervised Semantic Image Segmentation using Stoch...Sungchul Kim
 

More from Sungchul Kim (20)

PR-343: Semi-Supervised Semantic Segmentation with Cross Pseudo Supervision
PR-343: Semi-Supervised Semantic Segmentation with Cross Pseudo SupervisionPR-343: Semi-Supervised Semantic Segmentation with Cross Pseudo Supervision
PR-343: Semi-Supervised Semantic Segmentation with Cross Pseudo Supervision
 
Revisiting the Calibration of Modern Neural Networks
Revisiting the Calibration of Modern Neural NetworksRevisiting the Calibration of Modern Neural Networks
Revisiting the Calibration of Modern Neural Networks
 
Emerging Properties in Self-Supervised Vision Transformers
Emerging Properties in Self-Supervised Vision TransformersEmerging Properties in Self-Supervised Vision Transformers
Emerging Properties in Self-Supervised Vision Transformers
 
PR-305: Exploring Simple Siamese Representation Learning
PR-305: Exploring Simple Siamese Representation LearningPR-305: Exploring Simple Siamese Representation Learning
PR-305: Exploring Simple Siamese Representation Learning
 
Score based Generative Modeling through Stochastic Differential Equations
Score based Generative Modeling through Stochastic Differential EquationsScore based Generative Modeling through Stochastic Differential Equations
Score based Generative Modeling through Stochastic Differential Equations
 
Exploring Simple Siamese Representation Learning
Exploring Simple Siamese Representation LearningExploring Simple Siamese Representation Learning
Exploring Simple Siamese Representation Learning
 
Revisiting the Sibling Head in Object Detector
Revisiting the Sibling Head in Object DetectorRevisiting the Sibling Head in Object Detector
Revisiting the Sibling Head in Object Detector
 
Do Wide and Deep Networks Learn the Same Things: Uncovering How Neural Networ...
Do Wide and Deep Networks Learn the Same Things: Uncovering How Neural Networ...Do Wide and Deep Networks Learn the Same Things: Uncovering How Neural Networ...
Do Wide and Deep Networks Learn the Same Things: Uncovering How Neural Networ...
 
Deeplabv1, v2, v3, v3+
Deeplabv1, v2, v3, v3+Deeplabv1, v2, v3, v3+
Deeplabv1, v2, v3, v3+
 
Going Deeper with Convolutions
Going Deeper with ConvolutionsGoing Deeper with Convolutions
Going Deeper with Convolutions
 
Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization
Grad-CAM: Visual Explanations from Deep Networks via Gradient-based LocalizationGrad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization
Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization
 
Panoptic Segmentation
Panoptic SegmentationPanoptic Segmentation
Panoptic Segmentation
 
On the Variance of the Adaptive Learning Rate and Beyond
On the Variance of the Adaptive Learning Rate and BeyondOn the Variance of the Adaptive Learning Rate and Beyond
On the Variance of the Adaptive Learning Rate and Beyond
 
A Benchmark for Interpretability Methods in Deep Neural Networks
A Benchmark for Interpretability Methods in Deep Neural NetworksA Benchmark for Interpretability Methods in Deep Neural Networks
A Benchmark for Interpretability Methods in Deep Neural Networks
 
KDGAN: Knowledge Distillation with Generative Adversarial Networks
KDGAN: Knowledge Distillation with Generative Adversarial NetworksKDGAN: Knowledge Distillation with Generative Adversarial Networks
KDGAN: Knowledge Distillation with Generative Adversarial Networks
 
Designing Network Design Spaces
Designing Network Design SpacesDesigning Network Design Spaces
Designing Network Design Spaces
 
Search to Distill: Pearls are Everywhere but not the Eyes
Search to Distill: Pearls are Everywhere but not the EyesSearch to Distill: Pearls are Everywhere but not the Eyes
Search to Distill: Pearls are Everywhere but not the Eyes
 
Supervised Constrastive Learning
Supervised Constrastive LearningSupervised Constrastive Learning
Supervised Constrastive Learning
 
Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning
Bootstrap Your Own Latent: A New Approach to Self-Supervised LearningBootstrap Your Own Latent: A New Approach to Self-Supervised Learning
Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning
 
FickleNet: Weakly and Semi-supervised Semantic Image Segmentation using Stoch...
FickleNet: Weakly and Semi-supervised Semantic Image Segmentation using Stoch...FickleNet: Weakly and Semi-supervised Semantic Image Segmentation using Stoch...
FickleNet: Weakly and Semi-supervised Semantic Image Segmentation using Stoch...
 

Recently uploaded

Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona
 
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxBSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxfenichawla
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSISrknatarajan
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performancesivaprakash250
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)simmis5
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...roncy bisnoi
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...ranjana rawat
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college projectTonystark477637
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...ranjana rawat
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfKamal Acharya
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Christo Ananth
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdfKamal Acharya
 
Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur EscortsRussian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 

Recently uploaded (20)

Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
 
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxBSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSIS
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college project
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdf
 
Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur EscortsRussian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 

Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression

  • 2. Contents 1. Introduction 2. Generalized Intersection over Union 3. Experimental Results 4. Conclusion 2
  • 3. Introduction • Loss function은 𝐿1, 𝐿2-norms / Metric은 𝐼𝑜𝑈 사용 • 𝑳 𝒏-norms는 높여야 할 𝑰𝒐𝑼와 strong correlation이 없음! → 𝑰𝒐𝑼와 직접적인 관계가 있는 loss function을 제시 3
  • 4. Introduction • 두 물체가 겹치지 않는다면 → 𝑰𝒐𝑼 = 0 → 두 물체가 얼마나 떨어져 있는지 모름! → 𝑰𝒐𝑼의 약점을 보완하는 loss function 제시 1. 𝑰𝒐𝑼와 같은 definition 2. scale invariant property 유지 3. 겹치는 물체간 strong correlation 4
  • 5. Generalized Intersection over Union • Intersection over Union (𝑰𝒐𝑼) • Loss (ℒ 𝐼𝑜𝑈 = 1 − 𝐼𝑜𝑈) • Scale invariant (independent from the scale) • If 𝑨⋂𝑩 = 𝟎, 𝑰𝒐𝑼 𝑨, 𝑩 = 𝟎 → 두 물체 간 거리가 반영되지 않음! 5
  • 6. Generalized Intersection over Union • Generalized Intersection over Union (𝑮𝑰𝒐𝑼) For two arbitrary convex shapes (volumes) 𝑨, 𝑩 ⊆ 𝕊 ∈ ℝ 𝒏 1. Find the smallest convex shapes 𝑪 ⊆ 𝕊 ∈ ℝ 𝒏 enclosing both 𝑨, 𝑩 2. Calculate a ratio between the volume (area) occupied by 𝑪 excluding 𝑨, 𝑩 3. Divide by the total volume (area) occupied by 𝑪 4. Subtract this ratio from the 𝑰𝒐𝑼 value 6
  • 7. Generalized Intersection over Union • Generalized Intersection over Union (𝑮𝑰𝒐𝑼) • Loss ℒ 𝐺𝐼𝑜𝑈 = 1 − 𝐺𝐼𝑜𝑈 • Scale invariant • ∀𝐴, 𝐵 ⊆ 𝕊, 𝐺𝐼𝑜𝑈 𝐴, 𝐵 ≤ 𝐼𝑜𝑈(𝐴, 𝐵) → lim 𝐴→𝐵 𝐺𝐼𝑜𝑈 𝐴, 𝐵 = 𝐼𝑜𝑈(𝐴, 𝐵) • ∀𝐴, 𝐵 ⊆ 𝕊, −1 ≤ 𝐺𝐼𝑜𝑈 𝐴, 𝐵 ≤ 1 → if A⋃𝐵 = A⋂𝐵 , 𝐺𝐼𝑜𝑈 = 𝐼𝑜𝑈 = 1 → lim A⋃𝐵 |𝐶| →0 𝐺𝐼𝑜𝑈 𝐴, 𝐵 = −1 7
  • 8. Generalized Intersection over Union • 𝑮𝑰𝒐𝑼 as Loss for Bounding Box Regression • 𝐼𝑜𝑈 or 𝐺𝐼𝑜𝑈 can be directly used as a loss → 두 물체가 겹치지 않으면, 𝑰𝒐𝑼 has zero gradient 8 𝑰𝒐𝑼 Maximize this area 𝑮𝑰𝒐𝑼 Minimize this area
  • 9. Generalized Intersection over Union • 𝑮𝑰𝒐𝑼 as Loss for Bounding Box Regression • Strong correlation with 𝑰𝒐𝑼 i.e. 𝐼𝑜𝑈 ≤ 0.2 and 𝐺𝐼𝑜𝑈 ≤ 0.2 → 끝에서는 𝐺𝐼𝑜𝑈가 더 가파른 gradient를 가짐 9
  • 10. Generalized Intersection over Union • 𝑮𝑰𝒐𝑼 as Loss for Bounding Box Regression • Loss Stability Always 𝑩 𝒄 ≥ 𝑩 𝒈 → 𝑨 𝒄−𝓤 𝑨 𝒄 > 𝟎 (𝑨 𝒄 ≥ 𝑨 𝒈 , ∀𝑩 𝒑 ∈ ℝ 𝟒 and 𝑨 𝒈 ≥ 𝟎) 𝑨 𝒄 ≥ 𝓤, ∀𝑩 𝒑 ∈ ℝ 𝟒 → 𝟎 ≤ 𝓛 𝑮𝑰𝒐𝑼 ≤ 𝟐, ∀𝑩 𝒑 ∈ ℝ 𝟒 • ℒ 𝐺𝐼𝑜𝑈 behavior when 𝐼𝑜𝑈 = 0 𝓛 𝑮𝑰𝒐𝑼 = 𝟏 − 𝑮𝑰𝒐𝑼 = 𝟏 + 𝑨 𝒄−𝓤 𝑨 𝒄 − 𝑰𝒐𝑼 = 𝟐 − 𝓤 𝑨 𝒄 To minimize 𝓛 𝑮𝑰𝒐𝑼, maximize 𝓤 𝑨 𝒄 → 𝟎 ≤ 𝓤 𝑨 𝒄 ≤ 𝟏 → minimize 𝑨 𝒄 or maximize 𝓤 (𝑨 𝒑) 10
  • 15. 15
  • 16. Conclusions • 𝑰𝒐𝑼의 약점을 보안한 𝑮𝑰𝒐𝑼 제안 • 𝑮𝑰𝒐𝑼를 bounding box regression loss로써 사용 • 회전 도형과 3D object detection에도 적용 예정 https://github.com/generalized-iou 16
  • 17. 감 사 합 니 다 17