SlideShare a Scribd company logo
1 of 16
Enhanced Deep Residual Network for
Single Image Super-Resolution
NTIRE 2017 1st Place Award
(Challenge on Single Image Super-Resolution)
서울대 이경무 교수팀
arXiv 버전
발표자 : 정우진
한양대학교 컴퓨터 비전 및 패턴 인식 연구실
/ 20
• NTIRE 2017
– DIV2K
• Introduction
• Proposed Method
• 실험 및 분석
2
Contents
/ 20
NTIRE : New Trends in Image Restoration and Enhancement
workshop
• 기간
– 2017년 7월 21일
• 분야
– Papers addressing topics related to image restoration and
enhancement are invited. The topics include, but are not limited to:
• 2017년에는 초해상도 복원 경진대회(NTIRE challenge on
example-based single image super-resolution)를 진행
– 1위 : 서울대 이경무 교수팀
– 2위 : 중국팀
– 3위 : 카이스트 예종철 교수팀
– 4위 : 카이스트 김문철 교수팀
– 5위 :중국팀
3
NTIRE 2017
/ 20
• NTIRE 2017 초해상도 복원 경진 대회를 위해 준비한 데이터 셋
– 2040x1356 크기 HR
– 블러 없음
– Track 1 : Matlab bicubic 함수로 2배, 3배, 4배 축소한 LR
– Track 2 : 어떻게 저해상도가 되었는지 알 수 없는 LR, 2배, 3배, 4배 축
소
4
DIV2K
/ 20
• 기존 방법의 문제점
– 학습이 잘 안됨
– 주의 : 경진 대회 용 DNN이므로 매우
깊음. 그래서 학습이 잘 안됨
– 한번에 한가지 영상만 생성함
– 2배, 3배, 4배 용 DNN이 독립적으로
필요함
• 기여
– SRResNet을 기반으로
– 1. 필요없는 부분 제거
– 2. 3배, 4배 학습을 위해 2배 학습 결
과 사용함
– 3. 한번에 여러 배율 확대 가능
5
Introduction
/ 20
Baseline
• SRResNet에서 변화
• Batch normalization 층 제거
– BN이 초해상도를 방해
– 초해상도에서는 정규화가 악영향
– 대신 residual scaling을 사용
– 다음 슬라이드에서 설명…
6
Proposed Method
/ 20
※ Residual Scaling
• Google Inception-v4 에서 소
개
• 기존의 문제점
– 매우 깊은 레지듀얼 네트워크 훈
련 불가능(학습중 네트워크가
‘사망’, 0 값만 만들어 냄)
– BN으로 해결되지 않음
• 레지듀얼 스케일링
– Residual의 끝에 0.1~0.3 곱함
• 다른 해결책
– Training warm-up : 학습 초기
에 아주 작은 학습률로 학습
– 구글팀은 웜업으로도 불가능한
경우가 있다고 보고함
7
/ 20
EDSR
• Baseline 모델에서 깊이와 폭을 확장
MDSR
• 한번에 2배, 3배, 4배를 모두
처리할 수 잇는 구조
• 도입 부분과 재구성 부분이
각각 존재함
• 중간 부분은 공유
8
Proposed Method
/ 20
Geometric Self-ensemble
• 7개 입력 영상 추가 생성
– flip, rotation하여 7개 입력 영상 생성 + 원래 영상
• 결과 영상을 앙상블
• EDSR+, MDSR+ 로 표기
9
Proposed Method
/ 20
MDSR이 가능한 이유
• EDSR훈련중
• 3배, 4배 모델 학습을 위해
2배 학습 결과를 초기 가중치로
활용함
• 학습도 빠르고 결과도 우수함
• 따라서 SR은 배율에 상관없는
유사성이 있다고 판단
10
실험 및 분석
/ 20
모델 변화에 따른 성능 향상
• 발전과정
– SRResNet(L2 loss)
– SRResNet(L1 loss)
– Baseline
– EDSR, MDSR
– EDSR+, MDSR+
11
실험 및 분석
/ 20
Benchmark Results
12
실험 및 분석
/ 20
NTIRE 2017 Track 1 : bicubic downscaling
13
실험 및 분석
/ 20
NTIRE 2017 Track 2 : unknown downscailing
14
실험 및 분석
/ 20
NTIRE 2017 Track 1 & 2
15
실험 및 분석
/ 20
감사합니다.
16

More Related Content

What's hot

(Paper Review)Kernel predicting-convolutional-networks-for-denoising-monte-ca...
(Paper Review)Kernel predicting-convolutional-networks-for-denoising-monte-ca...(Paper Review)Kernel predicting-convolutional-networks-for-denoising-monte-ca...
(Paper Review)Kernel predicting-convolutional-networks-for-denoising-monte-ca...MYEONGGYU LEE
 
Photo realistic single image super-resolution using a generative adversarial ...
Photo realistic single image super-resolution using a generative adversarial ...Photo realistic single image super-resolution using a generative adversarial ...
Photo realistic single image super-resolution using a generative adversarial ...soul8085
 
PR-339: Maintaining discrimination and fairness in class incremental learning
PR-339: Maintaining discrimination and fairness in class incremental learningPR-339: Maintaining discrimination and fairness in class incremental learning
PR-339: Maintaining discrimination and fairness in class incremental learningSunghoon Joo
 
PR-203: Class-Balanced Loss Based on Effective Number of Samples
PR-203: Class-Balanced Loss Based on Effective Number of SamplesPR-203: Class-Balanced Loss Based on Effective Number of Samples
PR-203: Class-Balanced Loss Based on Effective Number of SamplesSunghoon Joo
 
[신경망기초] 심층신경망개요
[신경망기초] 심층신경망개요[신경망기초] 심층신경망개요
[신경망기초] 심층신경망개요jaypi Ko
 
Simple Review of Single Image Super Resolution Task
Simple Review of Single Image Super Resolution TaskSimple Review of Single Image Super Resolution Task
Simple Review of Single Image Super Resolution TaskMYEONGGYU LEE
 
Survey on Monocular Depth Estimation
Survey on Monocular Depth EstimationSurvey on Monocular Depth Estimation
Survey on Monocular Depth Estimation범준 김
 
Face Feature Recognition System with Deep Belief Networks, for Korean/KIISE T...
Face Feature Recognition System with Deep Belief Networks, for Korean/KIISE T...Face Feature Recognition System with Deep Belief Networks, for Korean/KIISE T...
Face Feature Recognition System with Deep Belief Networks, for Korean/KIISE T...Mad Scientists
 
[한국어] Neural Architecture Search with Reinforcement Learning
[한국어] Neural Architecture Search with Reinforcement Learning[한국어] Neural Architecture Search with Reinforcement Learning
[한국어] Neural Architecture Search with Reinforcement LearningKiho Suh
 
Designing more efficient convolution neural network
Designing more efficient convolution neural networkDesigning more efficient convolution neural network
Designing more efficient convolution neural networkDongyi Kim
 
딥러닝 논문읽기 efficient netv2 논문리뷰
딥러닝 논문읽기 efficient netv2  논문리뷰딥러닝 논문읽기 efficient netv2  논문리뷰
딥러닝 논문읽기 efficient netv2 논문리뷰taeseon ryu
 
Encoding in Style: a Style Encoder for Image-to-Image Translation
Encoding in Style: a Style Encoder for Image-to-Image TranslationEncoding in Style: a Style Encoder for Image-to-Image Translation
Encoding in Style: a Style Encoder for Image-to-Image Translationtaeseon ryu
 
Convolutional Neural Networks
Convolutional Neural NetworksConvolutional Neural Networks
Convolutional Neural NetworksSanghoon Yoon
 
PR-313 Training BatchNorm and Only BatchNorm: On the Expressive Power of Rand...
PR-313 Training BatchNorm and Only BatchNorm: On the Expressive Power of Rand...PR-313 Training BatchNorm and Only BatchNorm: On the Expressive Power of Rand...
PR-313 Training BatchNorm and Only BatchNorm: On the Expressive Power of Rand...Sunghoon Joo
 
Deep learning seminar_snu_161031
Deep learning seminar_snu_161031Deep learning seminar_snu_161031
Deep learning seminar_snu_161031Jinwon Lee
 
"Learning transferable architectures for scalable image recognition" Paper Re...
"Learning transferable architectures for scalable image recognition" Paper Re..."Learning transferable architectures for scalable image recognition" Paper Re...
"Learning transferable architectures for scalable image recognition" Paper Re...LEE HOSEONG
 
Dense sparse-dense training for dnn and Other Models
Dense sparse-dense training for dnn and Other ModelsDense sparse-dense training for dnn and Other Models
Dense sparse-dense training for dnn and Other ModelsDong Heon Cho
 
PR-218: MFAS: Multimodal Fusion Architecture Search
PR-218: MFAS: Multimodal Fusion Architecture SearchPR-218: MFAS: Multimodal Fusion Architecture Search
PR-218: MFAS: Multimodal Fusion Architecture SearchSunghoon Joo
 

What's hot (20)

(Paper Review)Kernel predicting-convolutional-networks-for-denoising-monte-ca...
(Paper Review)Kernel predicting-convolutional-networks-for-denoising-monte-ca...(Paper Review)Kernel predicting-convolutional-networks-for-denoising-monte-ca...
(Paper Review)Kernel predicting-convolutional-networks-for-denoising-monte-ca...
 
Photo realistic single image super-resolution using a generative adversarial ...
Photo realistic single image super-resolution using a generative adversarial ...Photo realistic single image super-resolution using a generative adversarial ...
Photo realistic single image super-resolution using a generative adversarial ...
 
PR-339: Maintaining discrimination and fairness in class incremental learning
PR-339: Maintaining discrimination and fairness in class incremental learningPR-339: Maintaining discrimination and fairness in class incremental learning
PR-339: Maintaining discrimination and fairness in class incremental learning
 
PR-203: Class-Balanced Loss Based on Effective Number of Samples
PR-203: Class-Balanced Loss Based on Effective Number of SamplesPR-203: Class-Balanced Loss Based on Effective Number of Samples
PR-203: Class-Balanced Loss Based on Effective Number of Samples
 
[신경망기초] 심층신경망개요
[신경망기초] 심층신경망개요[신경망기초] 심층신경망개요
[신경망기초] 심층신경망개요
 
Simple Review of Single Image Super Resolution Task
Simple Review of Single Image Super Resolution TaskSimple Review of Single Image Super Resolution Task
Simple Review of Single Image Super Resolution Task
 
Survey on Monocular Depth Estimation
Survey on Monocular Depth EstimationSurvey on Monocular Depth Estimation
Survey on Monocular Depth Estimation
 
Face Feature Recognition System with Deep Belief Networks, for Korean/KIISE T...
Face Feature Recognition System with Deep Belief Networks, for Korean/KIISE T...Face Feature Recognition System with Deep Belief Networks, for Korean/KIISE T...
Face Feature Recognition System with Deep Belief Networks, for Korean/KIISE T...
 
[한국어] Neural Architecture Search with Reinforcement Learning
[한국어] Neural Architecture Search with Reinforcement Learning[한국어] Neural Architecture Search with Reinforcement Learning
[한국어] Neural Architecture Search with Reinforcement Learning
 
Designing more efficient convolution neural network
Designing more efficient convolution neural networkDesigning more efficient convolution neural network
Designing more efficient convolution neural network
 
딥러닝 논문읽기 efficient netv2 논문리뷰
딥러닝 논문읽기 efficient netv2  논문리뷰딥러닝 논문읽기 efficient netv2  논문리뷰
딥러닝 논문읽기 efficient netv2 논문리뷰
 
Encoding in Style: a Style Encoder for Image-to-Image Translation
Encoding in Style: a Style Encoder for Image-to-Image TranslationEncoding in Style: a Style Encoder for Image-to-Image Translation
Encoding in Style: a Style Encoder for Image-to-Image Translation
 
Detecting fake jpeg images
Detecting fake jpeg imagesDetecting fake jpeg images
Detecting fake jpeg images
 
HistoryOfCNN
HistoryOfCNNHistoryOfCNN
HistoryOfCNN
 
Convolutional Neural Networks
Convolutional Neural NetworksConvolutional Neural Networks
Convolutional Neural Networks
 
PR-313 Training BatchNorm and Only BatchNorm: On the Expressive Power of Rand...
PR-313 Training BatchNorm and Only BatchNorm: On the Expressive Power of Rand...PR-313 Training BatchNorm and Only BatchNorm: On the Expressive Power of Rand...
PR-313 Training BatchNorm and Only BatchNorm: On the Expressive Power of Rand...
 
Deep learning seminar_snu_161031
Deep learning seminar_snu_161031Deep learning seminar_snu_161031
Deep learning seminar_snu_161031
 
"Learning transferable architectures for scalable image recognition" Paper Re...
"Learning transferable architectures for scalable image recognition" Paper Re..."Learning transferable architectures for scalable image recognition" Paper Re...
"Learning transferable architectures for scalable image recognition" Paper Re...
 
Dense sparse-dense training for dnn and Other Models
Dense sparse-dense training for dnn and Other ModelsDense sparse-dense training for dnn and Other Models
Dense sparse-dense training for dnn and Other Models
 
PR-218: MFAS: Multimodal Fusion Architecture Search
PR-218: MFAS: Multimodal Fusion Architecture SearchPR-218: MFAS: Multimodal Fusion Architecture Search
PR-218: MFAS: Multimodal Fusion Architecture Search
 

Similar to Review EDSR

Deep learning super resolution
Deep learning super resolutionDeep learning super resolution
Deep learning super resolutionNAVER Engineering
 
Progressive Growing of GANs for Improved Quality, Stability, and Variation Re...
Progressive Growing of GANs for Improved Quality, Stability, and Variation Re...Progressive Growing of GANs for Improved Quality, Stability, and Variation Re...
Progressive Growing of GANs for Improved Quality, Stability, and Variation Re...태엽 김
 
"simple does it weakly supervised instance and semantic segmentation" Paper r...
"simple does it weakly supervised instance and semantic segmentation" Paper r..."simple does it weakly supervised instance and semantic segmentation" Paper r...
"simple does it weakly supervised instance and semantic segmentation" Paper r...LEE HOSEONG
 
180212 normalization hyu_dake
180212 normalization hyu_dake180212 normalization hyu_dake
180212 normalization hyu_dakeDongGyun Hong
 
[Paper Review] Visualizing and understanding convolutional networks
[Paper Review] Visualizing and understanding convolutional networks[Paper Review] Visualizing and understanding convolutional networks
[Paper Review] Visualizing and understanding convolutional networksKorea, Sejong University.
 
Bag of Tricks for Image Classification with Convolutional Neural Networks (C...
Bag of Tricks for Image Classification  with Convolutional Neural Networks (C...Bag of Tricks for Image Classification  with Convolutional Neural Networks (C...
Bag of Tricks for Image Classification with Convolutional Neural Networks (C...gohyunwoong
 
Learning how to explain neural networks: PatternNet and PatternAttribution
Learning how to explain neural networks: PatternNet and PatternAttributionLearning how to explain neural networks: PatternNet and PatternAttribution
Learning how to explain neural networks: PatternNet and PatternAttributionGyubin Son
 
carrier of_tricks_for_image_classification
carrier of_tricks_for_image_classificationcarrier of_tricks_for_image_classification
carrier of_tricks_for_image_classificationLEE HOSEONG
 
History of Vision AI
History of Vision AIHistory of Vision AI
History of Vision AITae Young Lee
 
[paper review] 손규빈 - Eye in the sky & 3D human pose estimation in video with ...
[paper review] 손규빈 - Eye in the sky & 3D human pose estimation in video with ...[paper review] 손규빈 - Eye in the sky & 3D human pose estimation in video with ...
[paper review] 손규빈 - Eye in the sky & 3D human pose estimation in video with ...Gyubin Son
 
PR-383: Solving ImageNet: a Unified Scheme for Training any Backbone to Top R...
PR-383: Solving ImageNet: a Unified Scheme for Training any Backbone to Top R...PR-383: Solving ImageNet: a Unified Scheme for Training any Backbone to Top R...
PR-383: Solving ImageNet: a Unified Scheme for Training any Backbone to Top R...Sunghoon Joo
 
09_Bilateral filtering/Reprojection Cache 소개
09_Bilateral filtering/Reprojection Cache 소개09_Bilateral filtering/Reprojection Cache 소개
09_Bilateral filtering/Reprojection Cache 소개noerror
 
Deep Object Detectors #1 (~2016.6)
Deep Object Detectors #1 (~2016.6)Deep Object Detectors #1 (~2016.6)
Deep Object Detectors #1 (~2016.6)Ildoo Kim
 

Similar to Review EDSR (14)

Deep learning super resolution
Deep learning super resolutionDeep learning super resolution
Deep learning super resolution
 
Progressive Growing of GANs for Improved Quality, Stability, and Variation Re...
Progressive Growing of GANs for Improved Quality, Stability, and Variation Re...Progressive Growing of GANs for Improved Quality, Stability, and Variation Re...
Progressive Growing of GANs for Improved Quality, Stability, and Variation Re...
 
"simple does it weakly supervised instance and semantic segmentation" Paper r...
"simple does it weakly supervised instance and semantic segmentation" Paper r..."simple does it weakly supervised instance and semantic segmentation" Paper r...
"simple does it weakly supervised instance and semantic segmentation" Paper r...
 
Refinenet
RefinenetRefinenet
Refinenet
 
180212 normalization hyu_dake
180212 normalization hyu_dake180212 normalization hyu_dake
180212 normalization hyu_dake
 
[Paper Review] Visualizing and understanding convolutional networks
[Paper Review] Visualizing and understanding convolutional networks[Paper Review] Visualizing and understanding convolutional networks
[Paper Review] Visualizing and understanding convolutional networks
 
Bag of Tricks for Image Classification with Convolutional Neural Networks (C...
Bag of Tricks for Image Classification  with Convolutional Neural Networks (C...Bag of Tricks for Image Classification  with Convolutional Neural Networks (C...
Bag of Tricks for Image Classification with Convolutional Neural Networks (C...
 
Learning how to explain neural networks: PatternNet and PatternAttribution
Learning how to explain neural networks: PatternNet and PatternAttributionLearning how to explain neural networks: PatternNet and PatternAttribution
Learning how to explain neural networks: PatternNet and PatternAttribution
 
carrier of_tricks_for_image_classification
carrier of_tricks_for_image_classificationcarrier of_tricks_for_image_classification
carrier of_tricks_for_image_classification
 
History of Vision AI
History of Vision AIHistory of Vision AI
History of Vision AI
 
[paper review] 손규빈 - Eye in the sky & 3D human pose estimation in video with ...
[paper review] 손규빈 - Eye in the sky & 3D human pose estimation in video with ...[paper review] 손규빈 - Eye in the sky & 3D human pose estimation in video with ...
[paper review] 손규빈 - Eye in the sky & 3D human pose estimation in video with ...
 
PR-383: Solving ImageNet: a Unified Scheme for Training any Backbone to Top R...
PR-383: Solving ImageNet: a Unified Scheme for Training any Backbone to Top R...PR-383: Solving ImageNet: a Unified Scheme for Training any Backbone to Top R...
PR-383: Solving ImageNet: a Unified Scheme for Training any Backbone to Top R...
 
09_Bilateral filtering/Reprojection Cache 소개
09_Bilateral filtering/Reprojection Cache 소개09_Bilateral filtering/Reprojection Cache 소개
09_Bilateral filtering/Reprojection Cache 소개
 
Deep Object Detectors #1 (~2016.6)
Deep Object Detectors #1 (~2016.6)Deep Object Detectors #1 (~2016.6)
Deep Object Detectors #1 (~2016.6)
 

Review EDSR

  • 1. Enhanced Deep Residual Network for Single Image Super-Resolution NTIRE 2017 1st Place Award (Challenge on Single Image Super-Resolution) 서울대 이경무 교수팀 arXiv 버전 발표자 : 정우진 한양대학교 컴퓨터 비전 및 패턴 인식 연구실
  • 2. / 20 • NTIRE 2017 – DIV2K • Introduction • Proposed Method • 실험 및 분석 2 Contents
  • 3. / 20 NTIRE : New Trends in Image Restoration and Enhancement workshop • 기간 – 2017년 7월 21일 • 분야 – Papers addressing topics related to image restoration and enhancement are invited. The topics include, but are not limited to: • 2017년에는 초해상도 복원 경진대회(NTIRE challenge on example-based single image super-resolution)를 진행 – 1위 : 서울대 이경무 교수팀 – 2위 : 중국팀 – 3위 : 카이스트 예종철 교수팀 – 4위 : 카이스트 김문철 교수팀 – 5위 :중국팀 3 NTIRE 2017
  • 4. / 20 • NTIRE 2017 초해상도 복원 경진 대회를 위해 준비한 데이터 셋 – 2040x1356 크기 HR – 블러 없음 – Track 1 : Matlab bicubic 함수로 2배, 3배, 4배 축소한 LR – Track 2 : 어떻게 저해상도가 되었는지 알 수 없는 LR, 2배, 3배, 4배 축 소 4 DIV2K
  • 5. / 20 • 기존 방법의 문제점 – 학습이 잘 안됨 – 주의 : 경진 대회 용 DNN이므로 매우 깊음. 그래서 학습이 잘 안됨 – 한번에 한가지 영상만 생성함 – 2배, 3배, 4배 용 DNN이 독립적으로 필요함 • 기여 – SRResNet을 기반으로 – 1. 필요없는 부분 제거 – 2. 3배, 4배 학습을 위해 2배 학습 결 과 사용함 – 3. 한번에 여러 배율 확대 가능 5 Introduction
  • 6. / 20 Baseline • SRResNet에서 변화 • Batch normalization 층 제거 – BN이 초해상도를 방해 – 초해상도에서는 정규화가 악영향 – 대신 residual scaling을 사용 – 다음 슬라이드에서 설명… 6 Proposed Method
  • 7. / 20 ※ Residual Scaling • Google Inception-v4 에서 소 개 • 기존의 문제점 – 매우 깊은 레지듀얼 네트워크 훈 련 불가능(학습중 네트워크가 ‘사망’, 0 값만 만들어 냄) – BN으로 해결되지 않음 • 레지듀얼 스케일링 – Residual의 끝에 0.1~0.3 곱함 • 다른 해결책 – Training warm-up : 학습 초기 에 아주 작은 학습률로 학습 – 구글팀은 웜업으로도 불가능한 경우가 있다고 보고함 7
  • 8. / 20 EDSR • Baseline 모델에서 깊이와 폭을 확장 MDSR • 한번에 2배, 3배, 4배를 모두 처리할 수 잇는 구조 • 도입 부분과 재구성 부분이 각각 존재함 • 중간 부분은 공유 8 Proposed Method
  • 9. / 20 Geometric Self-ensemble • 7개 입력 영상 추가 생성 – flip, rotation하여 7개 입력 영상 생성 + 원래 영상 • 결과 영상을 앙상블 • EDSR+, MDSR+ 로 표기 9 Proposed Method
  • 10. / 20 MDSR이 가능한 이유 • EDSR훈련중 • 3배, 4배 모델 학습을 위해 2배 학습 결과를 초기 가중치로 활용함 • 학습도 빠르고 결과도 우수함 • 따라서 SR은 배율에 상관없는 유사성이 있다고 판단 10 실험 및 분석
  • 11. / 20 모델 변화에 따른 성능 향상 • 발전과정 – SRResNet(L2 loss) – SRResNet(L1 loss) – Baseline – EDSR, MDSR – EDSR+, MDSR+ 11 실험 및 분석
  • 13. / 20 NTIRE 2017 Track 1 : bicubic downscaling 13 실험 및 분석
  • 14. / 20 NTIRE 2017 Track 2 : unknown downscailing 14 실험 및 분석
  • 15. / 20 NTIRE 2017 Track 1 & 2 15 실험 및 분석