Review EDSR

•Download as PPTX, PDF•

1 like•810 views

Woojin Jeong

Enhanced Deep Residual networks for Single Image Super-Resolution

Software

Enhanced Deep Residual Network for
Single Image Super-Resolution
NTIRE 2017 1st Place Award
(Challenge on Single Image Super-Resolution)
서울대 이경무 교수팀
arXiv 버전
발표자 : 정우진
한양대학교 컴퓨터 비전 및 패턴 인식 연구실

/ 20
• NTIRE 2017
– DIV2K
• Introduction
• Proposed Method
• 실험 및 분석
2
Contents

/ 20
NTIRE : New Trends in Image Restoration and Enhancement
workshop
• 기간
– 2017년 7월 21일
• 분야
– Papers addressing topics related to image restoration and
enhancement are invited. The topics include, but are not limited to:
• 2017년에는 초해상도 복원 경진대회(NTIRE challenge on
example-based single image super-resolution)를 진행
– 1위 : 서울대 이경무 교수팀
– 2위 : 중국팀
– 3위 : 카이스트 예종철 교수팀
– 4위 : 카이스트 김문철 교수팀
– 5위 :중국팀
3
NTIRE 2017

/ 20
• NTIRE 2017 초해상도 복원 경진 대회를 위해 준비한 데이터 셋
– 2040x1356 크기 HR
– 블러 없음
– Track 1 : Matlab bicubic 함수로 2배, 3배, 4배 축소한 LR
– Track 2 : 어떻게 저해상도가 되었는지 알 수 없는 LR, 2배, 3배, 4배 축
소
4
DIV2K

/ 20
• 기존 방법의 문제점
– 학습이 잘 안됨
– 주의 : 경진 대회 용 DNN이므로 매우
깊음. 그래서 학습이 잘 안됨
– 한번에 한가지 영상만 생성함
– 2배, 3배, 4배 용 DNN이 독립적으로
필요함
• 기여
– SRResNet을 기반으로
– 1. 필요없는 부분 제거
– 2. 3배, 4배 학습을 위해 2배 학습 결
과 사용함
– 3. 한번에 여러 배율 확대 가능
5
Introduction

/ 20
Baseline
• SRResNet에서 변화
• Batch normalization 층 제거
– BN이 초해상도를 방해
– 초해상도에서는 정규화가 악영향
– 대신 residual scaling을 사용
– 다음 슬라이드에서 설명…
6
Proposed Method

/ 20
※ Residual Scaling
• Google Inception-v4 에서 소
개
• 기존의 문제점
– 매우 깊은 레지듀얼 네트워크 훈
련 불가능(학습중 네트워크가
‘사망’, 0 값만 만들어 냄)
– BN으로 해결되지 않음
• 레지듀얼 스케일링
– Residual의 끝에 0.1~0.3 곱함
• 다른 해결책
– Training warm-up : 학습 초기
에 아주 작은 학습률로 학습
– 구글팀은 웜업으로도 불가능한
경우가 있다고 보고함
7

/ 20
EDSR
• Baseline 모델에서 깊이와 폭을 확장
MDSR
• 한번에 2배, 3배, 4배를 모두
처리할 수 잇는 구조
• 도입 부분과 재구성 부분이
각각 존재함
• 중간 부분은 공유
8
Proposed Method

/ 20
Geometric Self-ensemble
• 7개 입력 영상 추가 생성
– flip, rotation하여 7개 입력 영상 생성 + 원래 영상
• 결과 영상을 앙상블
• EDSR+, MDSR+ 로 표기
9
Proposed Method

/ 20
MDSR이 가능한 이유
• EDSR훈련중
• 3배, 4배 모델 학습을 위해
2배 학습 결과를 초기 가중치로
활용함
• 학습도 빠르고 결과도 우수함
• 따라서 SR은 배율에 상관없는
유사성이 있다고 판단
10
실험 및 분석

/ 20
모델 변화에 따른 성능 향상
• 발전과정
– SRResNet(L2 loss)
– SRResNet(L1 loss)
– Baseline
– EDSR, MDSR
– EDSR+, MDSR+
11
실험 및 분석

/ 20
NTIRE 2017 Track 1 : bicubic downscaling
13
실험 및 분석

/ 20
NTIRE 2017 Track 2 : unknown downscailing
14
실험 및 분석

What's hot

(Paper Review)Kernel predicting-convolutional-networks-for-denoising-monte-ca...MYEONGGYU LEE

Photo realistic single image super-resolution using a generative adversarial ...soul8085

PR-339: Maintaining discrimination and fairness in class incremental learningSunghoon Joo

PR-203: Class-Balanced Loss Based on Effective Number of SamplesSunghoon Joo

[신경망기초] 심층신경망개요jaypi Ko

Simple Review of Single Image Super Resolution TaskMYEONGGYU LEE

Survey on Monocular Depth Estimation범준 김

Face Feature Recognition System with Deep Belief Networks, for Korean/KIISE T...Mad Scientists

[한국어] Neural Architecture Search with Reinforcement LearningKiho Suh

Designing more efficient convolution neural networkDongyi Kim

딥러닝 논문읽기 efficient netv2 논문리뷰taeseon ryu

Encoding in Style: a Style Encoder for Image-to-Image Translationtaeseon ryu

Detecting fake jpeg imagesNAVER Engineering

HistoryOfCNNTae Young Lee

Convolutional Neural NetworksSanghoon Yoon

PR-313 Training BatchNorm and Only BatchNorm: On the Expressive Power of Rand...Sunghoon Joo

Deep learning seminar_snu_161031Jinwon Lee

"Learning transferable architectures for scalable image recognition" Paper Re...LEE HOSEONG

Dense sparse-dense training for dnn and Other ModelsDong Heon Cho

PR-218: MFAS: Multimodal Fusion Architecture SearchSunghoon Joo

What's hot (20)

(Paper Review)Kernel predicting-convolutional-networks-for-denoising-monte-ca...

Photo realistic single image super-resolution using a generative adversarial ...

PR-339: Maintaining discrimination and fairness in class incremental learning

PR-203: Class-Balanced Loss Based on Effective Number of Samples

[신경망기초] 심층신경망개요

Simple Review of Single Image Super Resolution Task

Survey on Monocular Depth Estimation

Face Feature Recognition System with Deep Belief Networks, for Korean/KIISE T...

[한국어] Neural Architecture Search with Reinforcement Learning

Designing more efficient convolution neural network

딥러닝 논문읽기 efficient netv2 논문리뷰

Encoding in Style: a Style Encoder for Image-to-Image Translation

Detecting fake jpeg images

HistoryOfCNN

Convolutional Neural Networks

PR-313 Training BatchNorm and Only BatchNorm: On the Expressive Power of Rand...

Deep learning seminar_snu_161031

"Learning transferable architectures for scalable image recognition" Paper Re...

Dense sparse-dense training for dnn and Other Models

PR-218: MFAS: Multimodal Fusion Architecture Search

Similar to Review EDSR

Deep learning super resolutionNAVER Engineering

Progressive Growing of GANs for Improved Quality, Stability, and Variation Re...태엽 김

"simple does it weakly supervised instance and semantic segmentation" Paper r...LEE HOSEONG

Refinenet오 혜린

180212 normalization hyu_dakeDongGyun Hong

[Paper Review] Visualizing and understanding convolutional networksKorea, Sejong University.

Bag of Tricks for Image Classification with Convolutional Neural Networks (C...gohyunwoong

Learning how to explain neural networks: PatternNet and PatternAttributionGyubin Son

carrier of_tricks_for_image_classificationLEE HOSEONG

History of Vision AITae Young Lee

[paper review] 손규빈 - Eye in the sky & 3D human pose estimation in video with ...Gyubin Son

PR-383: Solving ImageNet: a Unified Scheme for Training any Backbone to Top R...Sunghoon Joo

09_Bilateral filtering/Reprojection Cache 소개noerror

Deep Object Detectors #1 (~2016.6)Ildoo Kim

Similar to Review EDSR (14)

Deep learning super resolution

Progressive Growing of GANs for Improved Quality, Stability, and Variation Re...

"simple does it weakly supervised instance and semantic segmentation" Paper r...

Refinenet

180212 normalization hyu_dake

[Paper Review] Visualizing and understanding convolutional networks

Bag of Tricks for Image Classification with Convolutional Neural Networks (C...

Learning how to explain neural networks: PatternNet and PatternAttribution

carrier of_tricks_for_image_classification

History of Vision AI

[paper review] 손규빈 - Eye in the sky & 3D human pose estimation in video with ...

PR-383: Solving ImageNet: a Unified Scheme for Training any Backbone to Top R...

09_Bilateral filtering/Reprojection Cache 소개

Deep Object Detectors #1 (~2016.6)

Review EDSR

1. Enhanced Deep Residual Network for Single Image Super-Resolution NTIRE 2017 1st Place Award (Challenge on Single Image Super-Resolution) 서울대 이경무 교수팀 arXiv 버전 발표자 : 정우진 한양대학교 컴퓨터 비전 및 패턴 인식 연구실

2. / 20 • NTIRE 2017 – DIV2K • Introduction • Proposed Method • 실험 및 분석 2 Contents

3. / 20 NTIRE : New Trends in Image Restoration and Enhancement workshop • 기간 – 2017년 7월 21일 • 분야 – Papers addressing topics related to image restoration and enhancement are invited. The topics include, but are not limited to: • 2017년에는 초해상도 복원 경진대회(NTIRE challenge on example-based single image super-resolution)를 진행 – 1위 : 서울대 이경무 교수팀 – 2위 : 중국팀 – 3위 : 카이스트 예종철 교수팀 – 4위 : 카이스트 김문철 교수팀 – 5위 :중국팀 3 NTIRE 2017

4. / 20 • NTIRE 2017 초해상도 복원 경진 대회를 위해 준비한 데이터 셋 – 2040x1356 크기 HR – 블러 없음 – Track 1 : Matlab bicubic 함수로 2배, 3배, 4배 축소한 LR – Track 2 : 어떻게 저해상도가 되었는지 알 수 없는 LR, 2배, 3배, 4배 축 소 4 DIV2K

5. / 20 • 기존 방법의 문제점 – 학습이 잘 안됨 – 주의 : 경진 대회 용 DNN이므로 매우 깊음. 그래서 학습이 잘 안됨 – 한번에 한가지 영상만 생성함 – 2배, 3배, 4배 용 DNN이 독립적으로 필요함 • 기여 – SRResNet을 기반으로 – 1. 필요없는 부분 제거 – 2. 3배, 4배 학습을 위해 2배 학습 결 과 사용함 – 3. 한번에 여러 배율 확대 가능 5 Introduction

6. / 20 Baseline • SRResNet에서 변화 • Batch normalization 층 제거 – BN이 초해상도를 방해 – 초해상도에서는 정규화가 악영향 – 대신 residual scaling을 사용 – 다음 슬라이드에서 설명… 6 Proposed Method

7. / 20 ※ Residual Scaling • Google Inception-v4 에서 소 개 • 기존의 문제점 – 매우 깊은 레지듀얼 네트워크 훈 련 불가능(학습중 네트워크가 ‘사망’, 0 값만 만들어 냄) – BN으로 해결되지 않음 • 레지듀얼 스케일링 – Residual의 끝에 0.1~0.3 곱함 • 다른 해결책 – Training warm-up : 학습 초기 에 아주 작은 학습률로 학습 – 구글팀은 웜업으로도 불가능한 경우가 있다고 보고함 7

8. / 20 EDSR • Baseline 모델에서 깊이와 폭을 확장 MDSR • 한번에 2배, 3배, 4배를 모두 처리할 수 잇는 구조 • 도입 부분과 재구성 부분이 각각 존재함 • 중간 부분은 공유 8 Proposed Method

9. / 20 Geometric Self-ensemble • 7개 입력 영상 추가 생성 – flip, rotation하여 7개 입력 영상 생성 + 원래 영상 • 결과 영상을 앙상블 • EDSR+, MDSR+ 로 표기 9 Proposed Method

10. / 20 MDSR이 가능한 이유 • EDSR훈련중 • 3배, 4배 모델 학습을 위해 2배 학습 결과를 초기 가중치로 활용함 • 학습도 빠르고 결과도 우수함 • 따라서 SR은 배율에 상관없는 유사성이 있다고 판단 10 실험 및 분석

11. / 20 모델 변화에 따른 성능 향상 • 발전과정 – SRResNet(L2 loss) – SRResNet(L1 loss) – Baseline – EDSR, MDSR – EDSR+, MDSR+ 11 실험 및 분석

12. / 20 Benchmark Results 12 실험 및 분석

13. / 20 NTIRE 2017 Track 1 : bicubic downscaling 13 실험 및 분석

14. / 20 NTIRE 2017 Track 2 : unknown downscailing 14 실험 및 분석

15. / 20 NTIRE 2017 Track 1 & 2 15 실험 및 분석

16. / 20 감사합니다. 16

Review EDSR

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Review EDSR

Similar to Review EDSR (14)

Review EDSR