Gan

•Download as PPTX, PDF•

0 likes•75 views

This document discusses single image super-resolution (SISR) techniques. SISR aims to estimate a high-resolution image from a low-resolution input using examples from a single image. Early methods like bicubic interpolation were simple but yielded oversmoothed results. SRCNN introduced a three-layer convolutional network to learn nonlinear mappings from low to high resolution. DRCN and SRResNet used deeper networks with skip connections. SRGAN introduced an adversarial loss to generate perceptually realistic textures. It trains a generator network with a discriminator to produce outputs indistinguishable from real images. Benchmark datasets and mean opinion scores are used to evaluate state-of-the-art methods.

Technology

Single Image Super-Resolution
Supervision: Eng. Fadi Taki Al-Deen
Students:
Abdulrahman Baqdunes Mhd Wajeeh Ajajeh
Manaf Alabd Alrahim Nour Eddin Ramadan
1

What is Single Image Super-Resolution
SISR
 Estimating a high-resolution image from its counterpart low-resolution image.
 What distinguishes SISR is that it is more flexible and has wider range of
applications than Multiple-Image Super Resolution since it needs only one picture
for its input, but it is indeed more challenging.
2

Applications
 Improves normal video resolution.
 Improves surveillance video resolution.
 Medical usages.
 Improves satellite picture resolution.
3

Bicubic
 Filter.
 Very fast.
 Oversimplifies the picture.
 Yields solutions with overly smooth textures.
4

SRCNN
 Three layers.
 Fully feed-forward, thus pretty fast.
 Upscale it to the desired size using bicubic interpolation.
 The first convolutional layer of the SRCNN extracts a set of feature maps. The
second layer maps these feature maps nonlinearly to high-resolution patch
representations. The last layer combines the predictions within a spatial
neighborhood to produce the final high-resolution image.
5

SRCNN cont.
 Loss function is Mean Squared Error (MSE)
6

DRCN
 The network takes an interpolated input image (to the desired size).
 Consists of three sub-networks: embedding, inference and reconstruction
networks.
 Embedding net is used to represent the given image as feature maps ready for
inference.
 The inference net solves the task.
7

DRCN Cont.
 For intermediate outputs, we have the loss function.
 For the final output, we have
8

DRCN Cont.
 Now we give the final loss function L(θ). The training is regularized by weight decay
(L2 penalty multiplied by β).
9

SRRes Net
 16 blocks deep ResNet.
 Optimized for MSE.
 Has skip-connections.
10

SRGAN
 Generator.
 Deep network with B residual blocks.
 Two trained sub-pixel convolutional layers with small kernel 3*3.
 Batch-Normalization layers.
 Activation function is ParametricReLU
11

SRGAN Cont.
 Discriminator.
 Eight convolutional layers.
 Activation function is Sigmoid.
12

SRGAN Cont.
 Introduces Perceptual Loss Function.
13

SRGAN Cont.
 Classic content loss.
 Perceptual contest loss.
14

SRGAN Cont.
 Adversarial loss function.
15

SRGAN Cont.
 Introduces Mean Opinion Score (MOS).
 26 raters were asked to assign an integral score from 1 (bad quality) to 5 (excellent
quality) to the super-resolved images.
 Each rater rated 1128 images from the given datasets.
16

Datasets
 The models were tested on three of the most known benchmark datasets for SISR,
Set5, Set14 and BSD100.
17

State of the Art
 https://www.shamra.sy/academia/show/5b0cf3d685112
19

What's hot

Introduction to Grad-CAM (short version)Hsing-chuan Hsieh

CONVOLUTIONAL NEURAL NETWORKMd Rajib Bhuiyan

PR-144: SqueezeNext: Hardware-Aware Neural Network DesignJinwon Lee

Efficient anomaly detection via matrix sketchingHsing-chuan Hsieh

Convolutional Neural Network (CNN)Muhammad Haroon

Machine Learning - Convolutional Neural NetworkRichard Kuo

Swin transformerJAEMINJEONG5

2020 12-03-vitJAEMINJEONG5

CnnMehrnaz Faraz

Convolutional neural network from VGG to DenseNetSungminYou

Explainable deep learning with applications in Healthcare By Sunil Kumar Vupp...Analytics India Magazine

Introduction to Grad-CAM (complete version)Hsing-chuan Hsieh

Oleksandr Obiednikov “Affine transforms and how CNN lives with them”Lviv Startup Club

Convolutional Neural Network and Its ApplicationsKasun Chinthaka Piyarathna

2021 03-02-transformer interpretabilityJAEMINJEONG5

A Survey of Convolutional Neural NetworksRimzim Thube

Machine Vision on Embedded HardwareJash Shah

JPM1414 Progressive Image Denoising Through Hybrid Graph Laplacian Regulariz...chennaijp

2021 06-02-tabnetJAEMINJEONG5

Convolution Neural Network (CNN)Suraj Aavula

What's hot (20)

Introduction to Grad-CAM (short version)

CONVOLUTIONAL NEURAL NETWORK

PR-144: SqueezeNext: Hardware-Aware Neural Network Design

Efficient anomaly detection via matrix sketching

Convolutional Neural Network (CNN)

Machine Learning - Convolutional Neural Network

Swin transformer

2020 12-03-vit

Cnn

Convolutional neural network from VGG to DenseNet

Explainable deep learning with applications in Healthcare By Sunil Kumar Vupp...

Introduction to Grad-CAM (complete version)

Oleksandr Obiednikov “Affine transforms and how CNN lives with them”

Convolutional Neural Network and Its Applications

2021 03-02-transformer interpretability

A Survey of Convolutional Neural Networks

Machine Vision on Embedded Hardware

JPM1414 Progressive Image Denoising Through Hybrid Graph Laplacian Regulariz...

2021 06-02-tabnet

Convolution Neural Network (CNN)

Similar to Gan

CNN.pptxAbrarRana10

Deep learning for image super resolutionPrudhvi Raj

Single Image Depth Estimation using frequency domain analysis and Deep learningAhan M R

Deep learning for image video processingYu Huang

Effective Pixel Interpolation for Image Super ResolutionIOSR Journals

An introduction to super resolution using deep learningAnil Chandra Naidu Matcha

Final PosterElizabeth Koshelev

Automatic Detection of Window Regions in Indoor Point Clouds Using R-CNNZihao(Gerald) Zhang

Deferred Pixel Shading on the PLAYSTATION®3Slide_N

A Novel and Robust Wavelet based Super Resolution Reconstruction of Low Resol...CSCJournals

Mnist reportRaghunandanJairam

A Review on Color Recognition using Deep Learning and Different Image Segment...IRJET Journal

Survey on Single image Super Resolution TechniquesIOSR Journals

Mnist report pptRaghunandanJairam

2022-01-17-Rethinking_Bisenet.pptxJAEMINJEONG5

NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis taeseon ryu

SeminarpaperPrashantChaudhari75

Similar to Gan (20)

CNN.pptx

Deep learning for image super resolution

Single Image Depth Estimation using frequency domain analysis and Deep learning

Deep learning for image video processing

Effective Pixel Interpolation for Image Super Resolution

An introduction to super resolution using deep learning

Final Poster

Automatic Detection of Window Regions in Indoor Point Clouds Using R-CNN

Deferred Pixel Shading on the PLAYSTATION®3

A Novel and Robust Wavelet based Super Resolution Reconstruction of Low Resol...

Mnist report

A Review on Color Recognition using Deep Learning and Different Image Segment...

Survey on Single image Super Resolution Techniques

Mnist report ppt

2022-01-17-Rethinking_Bisenet.pptx

NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis

Seminarpaper

Recently uploaded

TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...TrustArc

The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdfFIDO Alliance

Continuing Bonds Through AI: A Hermeneutic Reflection on ThanabotsLeah Henrickson

AI mind or machine power point presentationyogeshlabana357357

Collecting & Temporal Analysis of Behavioral Web Data - Tales From The InsideStefan Dietze

Hyatt driving innovation and exceptional customer experiences with FIDO passw...FIDO Alliance

Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)Paige Cruz

Google I/O Extended 2024 WarsawGDSC PJATK

Using IESVE for Room Loads Analysis - UK & IrelandIES VE

ERP Contender Series: Acumatica vs. Sage IntacctBrainSell Technologies

Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...FIDO Alliance

Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdfFIDO Alliance

Intro to Passkeys and the State of Passwordless.pptxFIDO Alliance

Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...panagenda

TopCryptoSupers 12thReport OrionX May2024Stephen Perrenod

Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptxFIDO Alliance

Where to Learn More About FDO _ Richard at FIDO Alliance.pdfFIDO Alliance

Portal Kombat : extension du réseau de propagande russe中央社

WebAssembly is Key to Better LLM PerformanceSamy Fodil

UiPath manufacturing technology benefits and AI overviewDianaGray10

Recently uploaded (20)

TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...

The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf

Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots

AI mind or machine power point presentation

Collecting & Temporal Analysis of Behavioral Web Data - Tales From The Inside

Hyatt driving innovation and exceptional customer experiences with FIDO passw...

Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)

Google I/O Extended 2024 Warsaw

Using IESVE for Room Loads Analysis - UK & Ireland

ERP Contender Series: Acumatica vs. Sage Intacct

Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...

Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf

Intro to Passkeys and the State of Passwordless.pptx

Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...

TopCryptoSupers 12thReport OrionX May2024

Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx

Where to Learn More About FDO _ Richard at FIDO Alliance.pdf

Portal Kombat : extension du réseau de propagande russe

WebAssembly is Key to Better LLM Performance

UiPath manufacturing technology benefits and AI overview

Gan

1. Single Image Super-Resolution Supervision: Eng. Fadi Taki Al-Deen Students: Abdulrahman Baqdunes Mhd Wajeeh Ajajeh Manaf Alabd Alrahim Nour Eddin Ramadan 1

2. What is Single Image Super-Resolution SISR  Estimating a high-resolution image from its counterpart low-resolution image.  What distinguishes SISR is that it is more flexible and has wider range of applications than Multiple-Image Super Resolution since it needs only one picture for its input, but it is indeed more challenging. 2

3. Applications  Improves normal video resolution.  Improves surveillance video resolution.  Medical usages.  Improves satellite picture resolution. 3

4. Bicubic  Filter.  Very fast.  Oversimplifies the picture.  Yields solutions with overly smooth textures. 4

5. SRCNN  Three layers.  Fully feed-forward, thus pretty fast.  Upscale it to the desired size using bicubic interpolation.  The first convolutional layer of the SRCNN extracts a set of feature maps. The second layer maps these feature maps nonlinearly to high-resolution patch representations. The last layer combines the predictions within a spatial neighborhood to produce the final high-resolution image. 5

6. SRCNN cont.  Loss function is Mean Squared Error (MSE) 6

7. DRCN  The network takes an interpolated input image (to the desired size).  Consists of three sub-networks: embedding, inference and reconstruction networks.  Embedding net is used to represent the given image as feature maps ready for inference.  The inference net solves the task. 7

8. DRCN Cont.  For intermediate outputs, we have the loss function.  For the final output, we have 8

9. DRCN Cont.  Now we give the final loss function L(θ). The training is regularized by weight decay (L2 penalty multiplied by β). 9

10. SRRes Net  16 blocks deep ResNet.  Optimized for MSE.  Has skip-connections. 10

11. SRGAN  Generator.  Deep network with B residual blocks.  Two trained sub-pixel convolutional layers with small kernel 3*3.  Batch-Normalization layers.  Activation function is ParametricReLU 11

12. SRGAN Cont.  Discriminator.  Eight convolutional layers.  Activation function is Sigmoid. 12

13. SRGAN Cont.  Introduces Perceptual Loss Function. 13

14. SRGAN Cont.  Classic content loss.  Perceptual contest loss. 14

15. SRGAN Cont.  Adversarial loss function. 15

16. SRGAN Cont.  Introduces Mean Opinion Score (MOS).  26 raters were asked to assign an integral score from 1 (bad quality) to 5 (excellent quality) to the super-resolved images.  Each rater rated 1128 images from the given datasets. 16

17. Datasets  The models were tested on three of the most known benchmark datasets for SISR, Set5, Set14 and BSD100. 17

18. Results18

19. State of the Art  https://www.shamra.sy/academia/show/5b0cf3d685112 19

20. Thank You ! 20

Gan

Recommended