DeepLabv2 deeplabv2 machine learning description

DeepLab: Semantic Image
Segmentation with Deep
Convolutional Nets, Atrous
Convolution, and Fully
Connected CRFs
(A.K.A. DeepLabv2)
J. Miguel Valverde
@jmlipman
14/02/2021
Semantic Segmentation
Source: https://pixabay.com/photos/girl-kid-coloring-colors-art-2586719/

J. Miguel Valverde 2
The DeepLab family
DeepLabv1
DeepLabv2
DeepLabv3
DeepLabv3+
...

DeepLabv2 tackles three problems
1) Problem: Reduced feature resolution
- Maxpooling
Causes:
Source: computersciencewiki.org
Source: https://github.com/vdumoulin/conv_arithmetic
- Conv, strides = 2

- Maxpooling
Causes:
Source: computersciencewiki.org
Source: https://github.com/vdumoulin/conv_arithmetic
- Conv, strides = 2

Atrous conv. Regular conv.

2) Problem: Existence of multiple-scale objects
Same class
Different size (FoV)

3) Reduced accuracy (in borders)

3) Reduced accuracy (in borders)
Downsampling, maxpooling, useful
to achieve invariance in
classification
In Segmentation we want to preserve spatial information
Cause:

DeepLabv2 proposes three methods
1) Atrous convolution (or dilated convolution)
2) Atrous Spatial Pyramid Pooling (ASPP)
3) Conditional Random Fields (CRFs)
Backbone
ASPP
Upsample x8
CRF
Pipeline

Dilated Regular
Filter = 3x3 → Increases the field of view.

(Field of view, reminder)
Operation Field of view (size)
Conv 3x3 3

(Field of view, reminder)
Operation Field of view (size)
Conv 3x3 3
Conv 3x3 3 + 2 = 5
... ...

Dilated Regular
Filter = 3x3
● Same # of params
● Same amount of computation
● Adjust FoV with `rate`

FoV: 1 3 3
5
Inceptionv3
Conv
Conv
Conv
Conv
Conv Conv Conv
Conv
Conv
...

Source: http://www.jonathanfischer.net/lets-build-gameplaykit-grid-pathfinding/
Short-range / local CRFs Fully-connected CRFs
Image → Graph

CNN

p = coordinates
I = RGB intensity values

Always positive

Always positive
Always negative

Small distance, similar intensities

Small negative values → large penalty

Small negative values → large penalty
But such penalty → only if labels are different!

Large distance, different intensities

Large distance, different intensities
Large negative values → very small penalty

Experiments
Learning rate policy
vs. decreasing with a fixed step size

Experiments
Learning rate policy
Different FoV in the ASPP
Larger → Better
vs. decreasing with a fixed step size

Experiments
Different architectures
ResNet-101 > VGG-16
CRFs
With > Without

The code!

The original ideas were old
1989
2001
2014
DeepLabv2 paper: 2017

Do you want to learn more about CRFs?
“Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data”. J. Lafferty et al. (2001)
Original work
“An Introduction to Conditional Random Fields”. C. Sutton and A. McCallum (2011)
https://homepages.inf.ed.ac.uk/csutton/publications/crftut-fnt.pdf
Great introduction
Coursera course

DeepLabv2 deeplabv2 machine learning description

More Related Content

Similar to DeepLabv2 deeplabv2 machine learning description

Recently uploaded

DeepLabv2 deeplabv2 machine learning description