RankSRGAN

RankSRGAN:
Generative Adversarial Networks with Ranker for Image
Super-Resolution
Wenlong Zhang, Yihao Liu, Chao Dong, Yu Qiao
ICCV 2019
Wonbeom Jang

Introduction
Single image super resolution aims at reconstructing/generating a high-resolution (HR)
image from a low resolution (LR) observation.
Single Image Super Resolution

PSNR
Introduction
SSIM
NIQE
Evaluation Index

- MSE Loss
- Achieve high PSNR value
- Tend to produce overly smoothed/sharpened images.
- GAN Loss
- Generate realistic texture and details.
Introduction

Most of these NR-IQA metrics are not differentiable, making them infeasible to serve as
loss functions.
To overcome this obstacle, we propose a general and differentiable model – Ranker, which
can mimic any NR-IQA metric and provide a clear goal (as loss function) for optimizing
perceptual quality.
Introduction

Overview
Stage 1: Generate pair-wise rank images.
1. Employ different SR methods
2. Apply a chosen perceptual matrix
3. Pick up two image of same contents
4. Rank the pair-wise image according to the quality score calculated by the
perceptual metric
SRGAN / NIQE / 3.82 ESRGAN / NIQE / 3.28

Overview
Stage 2: Train Ranker
The Ranker adopts a Siamese architecture to learn the behavior of perceptual metrics
—> use margin-ranking loss

Overview
Stage 3: Introduce rank-content loss
Use it to deﬁne a rank-content loss for a standard SRGAN

Ranker
The Ranker uses a Siamese-like architecture [5, 9, 38], which is effective for pair-wise inputs.
Siamese Architecture

Ranker
- Two input images y1, y2, the ranking scores s1 and s2
- ΘR represents the network weights and R(.) indicates the mapping function of Ranker.
- my1 and my2 represent the quality scores of image
Rank Dataset

Ranker
To train Ranker, we employ margin ranking loss where the s1 and s2 represent the ranking
scores of pairwise images.
Optimization
Label: x1 > x2
Prediction: x1 > x2, loss == 0
Prediction: x1 < x2, loss > 0

Ranker
To train Ranker, we employ margin ranking loss where the s1 and s2 represent the ranking
scores of pairwise images.
Optimization

RankSRGAN
- RankSRGAN consists of a standard SRGAN and the proposed Ranker
- Compared with existing SRGAN, our framework simply adds a well-trained Ranker to
constrain the generator in SR space.
Details
Loss

RankSRGAN
The perceptual loss is proposed to measure the perceptual similarity between two images
Perceptual loss
where φ(yi) and φ(yi) represent the feature maps of HR and SR images, respectively. Here φ is
obtained by the 5-th convolution (before maxpooling) layer within VGG19 network

RankSRGAN
- Adversarial training is recently used to produce natural-looking images
- Generator loss LG is deﬁned based on the output of discriminator where xi is the LR image
Adversarial loss
Rank-content loss
- The generated image is put into the Ranker to predict the ranking score.
- R(G(xi)) is the ranking score of the generated image. A lower ranking score indicates
better perceptual quality.

Comparison with the-state-of-the-arts

Analysis of Ranker
1. Select the state-of-the-art perceptual SR methods – SRGAN and ESRGAN to build the
rank dataset
2. Use the perceptual metric NIQE for evaluation
Elaborately selecting the SR algorithms and the perceptual metric

Analysis of Ranker
Evaluation
Spearman Rank Order Correlation Coefﬁcient (SROCC)
The larger value of SROCC represents the better accuracy of Ranker
For validation dataset, the ranker achieves a SROCC of 0.88
SROCC
PSNR -0.8197
SSIM -0.8522
Ranker 0.88 https://bskyvision.com/392

Ablation study
In the main experiments, we use (SRResNet, SRGAN, ESRGAN) to generate the rank dataset.
- Then what if we select other SR algorithms?
- Will we always obtain better results than SRGAN?
We train our Ranker with (SRResNet, SRGAN, HR) and obtain the new SR model – RankSRGAN-HR.
Effect of different rank datasets

Ablation study
Effect of different perceptual metrics

Ablation study
Effect of Ranker: Rank VS. Regression

Ablation study
Effect of different losses.
Expect to add MSE loss to achieve improvement in PSNR

User study
Effect of different losses.
- Two different SR images are shown at the same time where one is generated by
the proposed RankSRGAN and the other is generated by SRGAN or ESRGAN.
- The participants are required to pick the image that is more visually pleasant

Conclusion
- The key idea is introducing a Ranker to learn the behavior of the perceptual
metrics by learning to rank approach.
- Our proposed method can combine the strengths of different SR methods and
generate better results.

RankSRGAN

Recommended

Recommended

More Related Content

Similar to RankSRGAN

Similar to RankSRGAN (20)

Recently uploaded

Recently uploaded (20)

RankSRGAN