Making Convolutional Networks
Shift-Invariant Again
Hyeongmin Lee
Image and Video Pattern Recognition LAB
Electrical and Electronic Engineering Dept, Yonsei University
4th Semester
2019.9.21
What is shift-invariancy??
What is shift-invariancy??
◆ Shift-variant??
Change in performance??
What is shift-invariancy??
◆ Shift-variant??
Aliasing
Aliasing
◆ Fourier Transform & Frequency Domain
𝑡 𝐹
𝑋 𝐹 = න
−∞
∞
𝑥 𝑡 𝑒−𝑗2𝜋𝐹𝑡 𝑑𝑡
Aliasing
◆ Down-sampling in frequency domain
𝑡 𝐹
𝑡 𝐹
Aliasing
◆ Discrete Signal in Frequency Domain
𝑓𝑛 2𝜋−2𝜋
Aliasing
◆ Discrete Down-Sampling in Frequency Domain
𝑓
𝑓
𝑛 2𝜋−2𝜋
𝑛
2𝜋−2𝜋
Aliasing Aliasing
Aliasing in CNN
◆ Max pooling
◆ Average pooling
◆ Strided convolution
0 0 1 1 0 0 1 1 0 1 0 1
0 1 1 0 0 1 1 0 1 1 1 1
Max-pooling
Max-pooling
Anti-aliasing
Anti-Aliasing
◆ Shift Invariancy & Shift Equivariance
• Shift Equivariance
• Shift Invariancy
Shift-Equivariant ➔ Shift-Invariant
Anti-Aliasing
◆ Anti-aliasing
𝑓2𝜋−2𝜋
Low Pass Filtering
(Blurring)
𝑓2𝜋−2𝜋
𝑓2𝜋−2𝜋
Sampling
Anti-Aliasing
◆ Anti-aliasing for max pooling
0 0 1 1 0 0 1 1 0 1 0 10 1 1 1 0 1 1 1
Max Sampling
0 1 1 0 0 1 1 0 1 1 1 11 1 1 0 1 1 1 0
Shift-Equivariant!!
Anti-Aliasing
◆ Anti-aliasing for max pooling
0 1 1 1 0 1 1 1 0.5 1 0.5 10.5 1 1 0.5 0.5 1 1 0.5
Blurring Sampling
1 1 1 0 1 1 1 0 1 0.5 1 0.51 1 0.5 0.5 1 1 0.5 0.5
Max
Results
Results
◆ Improvement in Consistency
Results
◆ Improvement in Accuracy
Results
◆ Improvement in Image Translation
Thank You!

[PR12] Making Convolutional Networks Shift-Invariant Again