SlideShare a Scribd company logo
1 of 26
Download to read offline
RankSRGAN:
Generative Adversarial Networks with Ranker for Image
Super-Resolution
Wenlong Zhang, Yihao Liu, Chao Dong, Yu Qiao
ICCV 2019
Wonbeom Jang
Introduction
Single image super resolution aims at reconstructing/generating a high-resolution (HR)
image from a low resolution (LR) observation.
Single Image Super Resolution
PSNR
Introduction
SSIM
NIQE
Evaluation Index
Single Image Super Resolution
- MSE Loss
- Achieve high PSNR value
- Tend to produce overly smoothed/sharpened images.
- GAN Loss
- Generate realistic texture and details.
Introduction
Single Image Super Resolution
Most of these NR-IQA metrics are not differentiable, making them infeasible to serve as
loss functions.
To overcome this obstacle, we propose a general and differentiable model – Ranker, which
can mimic any NR-IQA metric and provide a clear goal (as loss function) for optimizing
perceptual quality.
Introduction
Overview
Overview
Stage 1: Generate pair-wise rank images.
1. Employ different SR methods
2. Apply a chosen perceptual matrix
3. Pick up two image of same contents
4. Rank the pair-wise image according to the quality score calculated by the
perceptual metric
SRGAN / NIQE / 3.82 ESRGAN / NIQE / 3.28
Overview
Stage 2: Train Ranker
The Ranker adopts a Siamese architecture to learn the behavior of perceptual metrics
—> use margin-ranking loss
Overview
Stage 3: Introduce rank-content loss
Use it to define a rank-content loss for a standard SRGAN
Ranker
The Ranker uses a Siamese-like architecture [5, 9, 38], which is effective for pair-wise inputs.
Siamese Architecture
Ranker
- Two input images y1, y2, the ranking scores s1 and s2
- ΘR represents the network weights and R(.) indicates the mapping function of Ranker.
- my1 and my2 represent the quality scores of image
Rank Dataset
Ranker
To train Ranker, we employ margin ranking loss where the s1 and s2 represent the ranking
scores of pairwise images.
Optimization
Label: x1 > x2
Prediction: x1 > x2, loss == 0
Prediction: x1 < x2, loss > 0
Ranker
To train Ranker, we employ margin ranking loss where the s1 and s2 represent the ranking
scores of pairwise images.
Optimization
RankSRGAN
- RankSRGAN consists of a standard SRGAN and the proposed Ranker
- Compared with existing SRGAN, our framework simply adds a well-trained Ranker to
constrain the generator in SR space.
Details
Loss
RankSRGAN
The perceptual loss is proposed to measure the perceptual similarity between two images
Perceptual loss
where φ(yi) and φ(yi) represent the feature maps of HR and SR images, respectively. Here φ is
obtained by the 5-th convolution (before maxpooling) layer within VGG19 network
RankSRGAN
- Adversarial training is recently used to produce natural-looking images
- Generator loss LG is defined based on the output of discriminator where xi is the LR image
Adversarial loss
Rank-content loss
- The generated image is put into the Ranker to predict the ranking score.
- R(G(xi)) is the ranking score of the generated image. A lower ranking score indicates
better perceptual quality.
Comparison with the-state-of-the-arts
Comparison with the-state-of-the-arts
Analysis of Ranker
1. Select the state-of-the-art perceptual SR methods – SRGAN and ESRGAN to build the
rank dataset
2. Use the perceptual metric NIQE for evaluation
Elaborately selecting the SR algorithms and the perceptual metric
Analysis of Ranker
Evaluation
Spearman Rank Order Correlation Coefficient (SROCC)
The larger value of SROCC represents the better accuracy of Ranker
For validation dataset, the ranker achieves a SROCC of 0.88
SROCC
PSNR -0.8197
SSIM -0.8522
Ranker 0.88 https://bskyvision.com/392
Ablation study
In the main experiments, we use (SRResNet, SRGAN, ESRGAN) to generate the rank dataset.
- Then what if we select other SR algorithms?
- Will we always obtain better results than SRGAN?
We train our Ranker with (SRResNet, SRGAN, HR) and obtain the new SR model – RankSRGAN-HR.
Effect of different rank datasets
Ablation study
Effect of different perceptual metrics
Ablation study
Effect of Ranker: Rank VS. Regression
Ablation study
Effect of different losses.
Expect to add MSE loss to achieve improvement in PSNR
User study
Effect of different losses.
- Two different SR images are shown at the same time where one is generated by
the proposed RankSRGAN and the other is generated by SRGAN or ESRGAN.
- The participants are required to pick the image that is more visually pleasant
Conclusion
- The key idea is introducing a Ranker to learn the behavior of the perceptual
metrics by learning to rank approach.
- Our proposed method can combine the strengths of different SR methods and
generate better results.

More Related Content

Similar to RankSRGAN

Similar to RankSRGAN (20)

IRJET- Remote Sensing Image Retrieval using Convolutional Neural Network with...
IRJET- Remote Sensing Image Retrieval using Convolutional Neural Network with...IRJET- Remote Sensing Image Retrieval using Convolutional Neural Network with...
IRJET- Remote Sensing Image Retrieval using Convolutional Neural Network with...
 
SAR ICE IMAGE CLASSIFICATION USING PARALLELEPIPED CLASSIFIER BASED ON GRAM-SC...
SAR ICE IMAGE CLASSIFICATION USING PARALLELEPIPED CLASSIFIER BASED ON GRAM-SC...SAR ICE IMAGE CLASSIFICATION USING PARALLELEPIPED CLASSIFIER BASED ON GRAM-SC...
SAR ICE IMAGE CLASSIFICATION USING PARALLELEPIPED CLASSIFIER BASED ON GRAM-SC...
 
Sar ice image classification using parallelepiped classifier based on gram sc...
Sar ice image classification using parallelepiped classifier based on gram sc...Sar ice image classification using parallelepiped classifier based on gram sc...
Sar ice image classification using parallelepiped classifier based on gram sc...
 
Image super resolution based on
Image super resolution based onImage super resolution based on
Image super resolution based on
 
IRJET- Comparative Analysis of MSE and PSNR Q-Factor with Cameraman, Lena and...
IRJET- Comparative Analysis of MSE and PSNR Q-Factor with Cameraman, Lena and...IRJET- Comparative Analysis of MSE and PSNR Q-Factor with Cameraman, Lena and...
IRJET- Comparative Analysis of MSE and PSNR Q-Factor with Cameraman, Lena and...
 
Rendering Techniques in Virtual Reality.pdf
Rendering Techniques in Virtual Reality.pdfRendering Techniques in Virtual Reality.pdf
Rendering Techniques in Virtual Reality.pdf
 
Analysis of Various Single Frame Super Resolution Techniques for better PSNR
Analysis of Various Single Frame Super Resolution Techniques for better PSNRAnalysis of Various Single Frame Super Resolution Techniques for better PSNR
Analysis of Various Single Frame Super Resolution Techniques for better PSNR
 
2021 04-03-sean
2021 04-03-sean2021 04-03-sean
2021 04-03-sean
 
Video Stitching using Improved RANSAC and SIFT
Video Stitching using Improved RANSAC and SIFTVideo Stitching using Improved RANSAC and SIFT
Video Stitching using Improved RANSAC and SIFT
 
cvpresentation-190812154654 (1).pptx
cvpresentation-190812154654 (1).pptxcvpresentation-190812154654 (1).pptx
cvpresentation-190812154654 (1).pptx
 
ppt 20BET1024.pptx
ppt 20BET1024.pptxppt 20BET1024.pptx
ppt 20BET1024.pptx
 
An attribute assisted reranking model
An attribute assisted reranking modelAn attribute assisted reranking model
An attribute assisted reranking model
 
A REGULARIZED ROBUST SUPER-RESOLUTION APPROACH FORALIASED IMAGES AND LOW RESO...
A REGULARIZED ROBUST SUPER-RESOLUTION APPROACH FORALIASED IMAGES AND LOW RESO...A REGULARIZED ROBUST SUPER-RESOLUTION APPROACH FORALIASED IMAGES AND LOW RESO...
A REGULARIZED ROBUST SUPER-RESOLUTION APPROACH FORALIASED IMAGES AND LOW RESO...
 
StarGAN
StarGANStarGAN
StarGAN
 
A Fully Progressive approach to Single image super-resolution
A Fully Progressive approach to Single image super-resolution A Fully Progressive approach to Single image super-resolution
A Fully Progressive approach to Single image super-resolution
 
MACHINE LEARNING.pptx
MACHINE LEARNING.pptxMACHINE LEARNING.pptx
MACHINE LEARNING.pptx
 
Gan
GanGan
Gan
 
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
 
Decomposing image generation into layout priction and conditional synthesis
Decomposing image generation into layout priction and conditional synthesisDecomposing image generation into layout priction and conditional synthesis
Decomposing image generation into layout priction and conditional synthesis
 
Performance Evaluation of Illumination Invariant Face Recognition Algorthims
Performance Evaluation of Illumination Invariant Face Recognition AlgorthimsPerformance Evaluation of Illumination Invariant Face Recognition Algorthims
Performance Evaluation of Illumination Invariant Face Recognition Algorthims
 

Recently uploaded

Online crime reporting system project.pdf
Online crime reporting system project.pdfOnline crime reporting system project.pdf
Online crime reporting system project.pdf
Kamal Acharya
 
Seizure stage detection of epileptic seizure using convolutional neural networks
Seizure stage detection of epileptic seizure using convolutional neural networksSeizure stage detection of epileptic seizure using convolutional neural networks
Seizure stage detection of epileptic seizure using convolutional neural networks
IJECEIAES
 
Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...
Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...
Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...
drjose256
 
Maher Othman Interior Design Portfolio..
Maher Othman Interior Design Portfolio..Maher Othman Interior Design Portfolio..
Maher Othman Interior Design Portfolio..
MaherOthman7
 

Recently uploaded (20)

Raashid final report on Embedded Systems
Raashid final report on Embedded SystemsRaashid final report on Embedded Systems
Raashid final report on Embedded Systems
 
Dynamo Scripts for Task IDs and Space Naming.pptx
Dynamo Scripts for Task IDs and Space Naming.pptxDynamo Scripts for Task IDs and Space Naming.pptx
Dynamo Scripts for Task IDs and Space Naming.pptx
 
Research Methodolgy & Intellectual Property Rights Series 2
Research Methodolgy & Intellectual Property Rights Series 2Research Methodolgy & Intellectual Property Rights Series 2
Research Methodolgy & Intellectual Property Rights Series 2
 
SLIDESHARE PPT-DECISION MAKING METHODS.pptx
SLIDESHARE PPT-DECISION MAKING METHODS.pptxSLIDESHARE PPT-DECISION MAKING METHODS.pptx
SLIDESHARE PPT-DECISION MAKING METHODS.pptx
 
Augmented Reality (AR) with Augin Software.pptx
Augmented Reality (AR) with Augin Software.pptxAugmented Reality (AR) with Augin Software.pptx
Augmented Reality (AR) with Augin Software.pptx
 
Passive Air Cooling System and Solar Water Heater.ppt
Passive Air Cooling System and Solar Water Heater.pptPassive Air Cooling System and Solar Water Heater.ppt
Passive Air Cooling System and Solar Water Heater.ppt
 
NEWLETTER FRANCE HELICES/ SDS SURFACE DRIVES - MAY 2024
NEWLETTER FRANCE HELICES/ SDS SURFACE DRIVES - MAY 2024NEWLETTER FRANCE HELICES/ SDS SURFACE DRIVES - MAY 2024
NEWLETTER FRANCE HELICES/ SDS SURFACE DRIVES - MAY 2024
 
Online crime reporting system project.pdf
Online crime reporting system project.pdfOnline crime reporting system project.pdf
Online crime reporting system project.pdf
 
Piping and instrumentation diagram p.pdf
Piping and instrumentation diagram p.pdfPiping and instrumentation diagram p.pdf
Piping and instrumentation diagram p.pdf
 
AI in Healthcare Innovative use cases and applications.pdf
AI in Healthcare Innovative use cases and applications.pdfAI in Healthcare Innovative use cases and applications.pdf
AI in Healthcare Innovative use cases and applications.pdf
 
Research Methodolgy & Intellectual Property Rights Series 1
Research Methodolgy & Intellectual Property Rights Series 1Research Methodolgy & Intellectual Property Rights Series 1
Research Methodolgy & Intellectual Property Rights Series 1
 
Seizure stage detection of epileptic seizure using convolutional neural networks
Seizure stage detection of epileptic seizure using convolutional neural networksSeizure stage detection of epileptic seizure using convolutional neural networks
Seizure stage detection of epileptic seizure using convolutional neural networks
 
Linux Systems Programming: Semaphores, Shared Memory, and Message Queues
Linux Systems Programming: Semaphores, Shared Memory, and Message QueuesLinux Systems Programming: Semaphores, Shared Memory, and Message Queues
Linux Systems Programming: Semaphores, Shared Memory, and Message Queues
 
analog-vs-digital-communication (concept of analog and digital).pptx
analog-vs-digital-communication (concept of analog and digital).pptxanalog-vs-digital-communication (concept of analog and digital).pptx
analog-vs-digital-communication (concept of analog and digital).pptx
 
Seismic Hazard Assessment Software in Python by Prof. Dr. Costas Sachpazis
Seismic Hazard Assessment Software in Python by Prof. Dr. Costas SachpazisSeismic Hazard Assessment Software in Python by Prof. Dr. Costas Sachpazis
Seismic Hazard Assessment Software in Python by Prof. Dr. Costas Sachpazis
 
handbook on reinforce concrete and detailing
handbook on reinforce concrete and detailinghandbook on reinforce concrete and detailing
handbook on reinforce concrete and detailing
 
Lab Manual Arduino UNO Microcontrollar.docx
Lab Manual Arduino UNO Microcontrollar.docxLab Manual Arduino UNO Microcontrollar.docx
Lab Manual Arduino UNO Microcontrollar.docx
 
Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...
Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...
Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...
 
Maher Othman Interior Design Portfolio..
Maher Othman Interior Design Portfolio..Maher Othman Interior Design Portfolio..
Maher Othman Interior Design Portfolio..
 
Diploma Engineering Drawing Qp-2024 Ece .pdf
Diploma Engineering Drawing Qp-2024 Ece .pdfDiploma Engineering Drawing Qp-2024 Ece .pdf
Diploma Engineering Drawing Qp-2024 Ece .pdf
 

RankSRGAN

  • 1. RankSRGAN: Generative Adversarial Networks with Ranker for Image Super-Resolution Wenlong Zhang, Yihao Liu, Chao Dong, Yu Qiao ICCV 2019 Wonbeom Jang
  • 2. Introduction Single image super resolution aims at reconstructing/generating a high-resolution (HR) image from a low resolution (LR) observation. Single Image Super Resolution
  • 4. Single Image Super Resolution - MSE Loss - Achieve high PSNR value - Tend to produce overly smoothed/sharpened images. - GAN Loss - Generate realistic texture and details. Introduction
  • 5. Single Image Super Resolution Most of these NR-IQA metrics are not differentiable, making them infeasible to serve as loss functions. To overcome this obstacle, we propose a general and differentiable model – Ranker, which can mimic any NR-IQA metric and provide a clear goal (as loss function) for optimizing perceptual quality. Introduction
  • 7. Overview Stage 1: Generate pair-wise rank images. 1. Employ different SR methods 2. Apply a chosen perceptual matrix 3. Pick up two image of same contents 4. Rank the pair-wise image according to the quality score calculated by the perceptual metric SRGAN / NIQE / 3.82 ESRGAN / NIQE / 3.28
  • 8. Overview Stage 2: Train Ranker The Ranker adopts a Siamese architecture to learn the behavior of perceptual metrics —> use margin-ranking loss
  • 9. Overview Stage 3: Introduce rank-content loss Use it to define a rank-content loss for a standard SRGAN
  • 10. Ranker The Ranker uses a Siamese-like architecture [5, 9, 38], which is effective for pair-wise inputs. Siamese Architecture
  • 11. Ranker - Two input images y1, y2, the ranking scores s1 and s2 - ΘR represents the network weights and R(.) indicates the mapping function of Ranker. - my1 and my2 represent the quality scores of image Rank Dataset
  • 12. Ranker To train Ranker, we employ margin ranking loss where the s1 and s2 represent the ranking scores of pairwise images. Optimization Label: x1 > x2 Prediction: x1 > x2, loss == 0 Prediction: x1 < x2, loss > 0
  • 13. Ranker To train Ranker, we employ margin ranking loss where the s1 and s2 represent the ranking scores of pairwise images. Optimization
  • 14. RankSRGAN - RankSRGAN consists of a standard SRGAN and the proposed Ranker - Compared with existing SRGAN, our framework simply adds a well-trained Ranker to constrain the generator in SR space. Details Loss
  • 15. RankSRGAN The perceptual loss is proposed to measure the perceptual similarity between two images Perceptual loss where φ(yi) and φ(yi) represent the feature maps of HR and SR images, respectively. Here φ is obtained by the 5-th convolution (before maxpooling) layer within VGG19 network
  • 16. RankSRGAN - Adversarial training is recently used to produce natural-looking images - Generator loss LG is defined based on the output of discriminator where xi is the LR image Adversarial loss Rank-content loss - The generated image is put into the Ranker to predict the ranking score. - R(G(xi)) is the ranking score of the generated image. A lower ranking score indicates better perceptual quality.
  • 19. Analysis of Ranker 1. Select the state-of-the-art perceptual SR methods – SRGAN and ESRGAN to build the rank dataset 2. Use the perceptual metric NIQE for evaluation Elaborately selecting the SR algorithms and the perceptual metric
  • 20. Analysis of Ranker Evaluation Spearman Rank Order Correlation Coefficient (SROCC) The larger value of SROCC represents the better accuracy of Ranker For validation dataset, the ranker achieves a SROCC of 0.88 SROCC PSNR -0.8197 SSIM -0.8522 Ranker 0.88 https://bskyvision.com/392
  • 21. Ablation study In the main experiments, we use (SRResNet, SRGAN, ESRGAN) to generate the rank dataset. - Then what if we select other SR algorithms? - Will we always obtain better results than SRGAN? We train our Ranker with (SRResNet, SRGAN, HR) and obtain the new SR model – RankSRGAN-HR. Effect of different rank datasets
  • 22. Ablation study Effect of different perceptual metrics
  • 23. Ablation study Effect of Ranker: Rank VS. Regression
  • 24. Ablation study Effect of different losses. Expect to add MSE loss to achieve improvement in PSNR
  • 25. User study Effect of different losses. - Two different SR images are shown at the same time where one is generated by the proposed RankSRGAN and the other is generated by SRGAN or ESRGAN. - The participants are required to pick the image that is more visually pleasant
  • 26. Conclusion - The key idea is introducing a Ranker to learn the behavior of the perceptual metrics by learning to rank approach. - Our proposed method can combine the strengths of different SR methods and generate better results.