SlideShare a Scribd company logo
1 of 23
Download to read offline
ABV-IIITM
Semantic Image Completion
and Enhancement using
Deep Learning
ICCCNT - 2019
Priyansh Saxena
ICCCNT | 2019
Objectives
To develop a well trained Wasserstein
GAN model capable of completing
masked region in the image.
To further enhance the completed image
using enhancement network for better
computer vision applications.
Novelty of
the proposal
In most of the existing completion
techniques, the completed region is noisy
and blurry and not of satisfactory quality.
To overcome this, first the Wasserstein
GAN is trained to complete the image
and then the completed image is passed
through the enhancement network which
will enhance the quality of the completed
image further thus giving better
inpainting solution than the present
techniques.
OVERALL
IMPLEMENTATION
FLOWCHART
DATA
PREPROCESSING
IMAGE MASKING:
To create corrupted images for training
the Wasserstein GAN, a binary mask is
used with values 0 or 1. 0 corresponds to
the corrupted region while 1 corresponds
to the uncorrupted region in the image.
This binary mask is applied to all images
to make them corrupted which will serve
as input of the training process.
The method is evaluated on CelebA-hq dataset.1.
64 x 64 x 3 pixels to train the Wasserstein GAN
model.
2. Each face image in the dataset is resized to
3. The enhancement network is
trained using 1000 image pairs
containing blurry images and its
corresponding clean images.
Implementation
Details
2. Each face image in the dataset is
resized to 64* 64* 3 pixels to train the
Wasserstein GAN model.
THE METHODOLOGY COULD BE
DIVIDED INTO TWO DIFFERENT
STEPS:
In first step, Wasserstein GAN based
model is developed to complete the
missing pixels in the image. The image
completion GAN gives a completed
image with missing area filled and
having contextual similarity with the
input image.
In second step,the output of the
generator is passed through the
enhancement network to further refine
the completed image.
Implementation
Details
WASSERSTEIN GAN:
GAN trains two networks
simultaneously: the generator network
G to learn the distribution of training
data and the critic network C which
distinguishes the generated samples
from the original samples.
The GAN architeture used is
Wasserstein GAN which uses
Wasserstein distance as to train the
generator so that it can capture training
data distribution and generate images
similar to those in the training data.
(Contd.)
Implementation Details
The Wasserstein
distance loss function L
to train the generator
can be mathematically
represented as:
Here, the first term represents
the expectation of the
distribution generated by the
generator and the second
term represents the
expectation of the real
training data distribution.
Wasserstein distance as Wasserstein GAN loss function:
(Contd.)
Implementation
Details
WASSERSTEIN DISTANCE AS
WASSERSTEIN GAN LOSS
FUNCTION:
By minimizing the difference between
the two, the generator learns to
generate samples having probability
distribution similar to training data
distribution.
Now, to make the learning faster we
have added gradient penalty term. So,
the overall loss function L becomes:
(Contd.)
Implementation
Details
IMAGE COMPLETION WITH
WASSERSTEIN GAN:
After training the generator to generate
samples which look real, the next aim is to
ensure that the missing region generated
has similar context to the non-missing
region so that the model gives sensible
looking completed images as output.
(Contd.)
CONTEXTUAL
LOSS:
PERCEPTUAL
LOSS:
TOTAL LOSS:
Implementation Details
Image completion losses with Wasserstein GAN:
This loss function L(z) is minimized to ensure completed image is contextually similar to input image.
+ =
(Contd.)
Implementation
Details
ENHANCEMENT
NETWORK CONSISTS OF
FOLLOWING LAYERS:
Enhancement network using
residual learning:
Conv + ReLU: It creates feature maps,
and ReLU adds the non-linearity.
Conv + BN + ReLU: This layers contains
added batch normalization between
Conv and ReLU.
Conv: It is used to get the output
residual image.
(Contd.)
The input to the network is blurry image y = x + v,
Implementation Details
Training enhancement network :
Here, N represents total training images.
Here x is clear image, v represents the blur added.
The residual network is trained to grasp the mapping R(y)≈v , to get the
clear image x as x = y- R(y). The loss function to learn the trainable
parameters θ in the enhancement network is as follows:
(Contd.)
binary mask, y represents the original image and G(z') represents the an image
from the generator G for some z' that gives a reasonable reconstruction of the
missing portions.
Implementation Details
Image completion with Wasserstein GAN:
We find z' that suitably completes the image by minimizing L(z).
reconstructed represents the completed image through our model, M representsLet x
(Contd.)
Results
The following plot was obtained by
training the enhancement network
on 1000 celebA-hq image pairs of
clean and its corresponding blurr
images.
Figure: Enhancement network training plot
Results 
The following Wasserstein distance
values and plot was obtained while
training Wasserstein GAN on around
15000 Celeba-hq images for 10000
epochs and batch size of 128.
(Contd.)
Results
Contextual loss plot
for image completion
Perceptual loss plot
for image completion
Total loss plot for
image completion
(Contd.)
The above contextual, perceptual and total loss plots were obtained while training the Wasserstein GAN for
image completion for 1250 epochs on 15000 Celeba-hq images.
(Contd.)
Results
Peak Signal-to-Noise
Ratio (PSNR):
(Contd.)
The following two evaluation metrics were used to evaluate the quality of the
output images by the model:
PSNR is measured in decibels (dB).The
higher the PSNR, the better image has
been completed to match the original
image.
Results
Structural Similarity Index (SSIM):
(Contd.)
The following two evaluation metrics were used to evaluate the quality of the
output images by the model:
It is used to measure the similarity between two images.
where, C represents contrast, I represents luminance, S represents structural term,
x represents original, y represents completed images.
The parameters α > 0, β > 0, and γ > 0, are used to adjust the relative importance of
the three components.
Results
Comparison of PSNR values
(Contd.)
The following PSNR and SSIM values through the model:
Comparison of SSIM values
Output
ORIGINAL
Some of the results obtained through the presented image completion technique are as follows:
INPUT OUTPUT ORIGINAL INPUT OUTPUT
ORIGINAL INPUT OUTPUT ORIGINAL INPUT OUTPUT
CONCLUSION
In most of the existing completion techniques,the completed
images are blurry due to the noise which is inevitably mixed by
the generator.of PSNR values.
To overcome this, the WGAN is first trained to generate the
missing patches in the image and then passed the completed
image given by the WGAN through a enhancement network to
remove the blur and provide better inpainting solutions.
However, in this approach, overall training is highly depended on
the data used for training.
References
Philipp Krahenbuhl Deepak Pathak.
Context encoders: feature learning by inpainting.
Proceedings of the IEEE Conference on Computer Vision and
Pattern Recognition, pages 2436–2544, 2016.
Yizhen Chen and Haifeng Hu.
Neural Processing Letters, Springer, pages 1–13, Jun 2018.
Ruijun Liu, Rui Yang, Shanxi Li, Yuqian Shi, and Xin Jin.
Painting completion with generative translation models.
Multimedia Tools and Applications, Springer, pages 1–14, 2018.
An improved method for semantic image inpainting with
gans: Progressive inpainting.
Martin Arjovsky, Soumith Chintala and Leon Bottou
Wasserstein gan
In Courant Institute of Mathematical Sciences Facebook AI
Research, 2017

More Related Content

What's hot

Comparison between JPEG(DCT) and JPEG 2000(DWT) compression standards
Comparison between JPEG(DCT) and JPEG 2000(DWT) compression standardsComparison between JPEG(DCT) and JPEG 2000(DWT) compression standards
Comparison between JPEG(DCT) and JPEG 2000(DWT) compression standardsRishab2612
 
Basics of Image Processing using MATLAB
Basics of Image Processing using MATLABBasics of Image Processing using MATLAB
Basics of Image Processing using MATLABvkn13
 
Compression using JPEG
Compression using JPEGCompression using JPEG
Compression using JPEGSabih Hasan
 
Multimedia communication jpeg
Multimedia communication jpegMultimedia communication jpeg
Multimedia communication jpegDr. Kapil Gupta
 
COMPUTER GRAPHICS
COMPUTER GRAPHICSCOMPUTER GRAPHICS
COMPUTER GRAPHICSJagan Raja
 
Introductory Digital Image Processing using Matlab, IIT Roorkee
Introductory Digital Image Processing using Matlab, IIT RoorkeeIntroductory Digital Image Processing using Matlab, IIT Roorkee
Introductory Digital Image Processing using Matlab, IIT RoorkeeVinayak Sahai
 
DCT image compression
DCT image compressionDCT image compression
DCT image compressionyoussef ramzy
 
Dct,gibbs phen,oversampled adc,polyphase decomposition
Dct,gibbs phen,oversampled adc,polyphase decompositionDct,gibbs phen,oversampled adc,polyphase decomposition
Dct,gibbs phen,oversampled adc,polyphase decompositionMuhammad Younas
 
3.point operation and histogram based image enhancement
3.point operation and histogram based image enhancement3.point operation and histogram based image enhancement
3.point operation and histogram based image enhancementmukesh bhardwaj
 
Introduction to Image Processing with MATLAB
Introduction to Image Processing with MATLABIntroduction to Image Processing with MATLAB
Introduction to Image Processing with MATLABSriram Emarose
 
Image processing with matlab
Image processing with matlabImage processing with matlab
Image processing with matlabAman Gupta
 
Multimedia image compression standards
Multimedia image compression standardsMultimedia image compression standards
Multimedia image compression standardsMazin Alwaaly
 

What's hot (20)

Computer graphics notes watermark
Computer graphics notes watermarkComputer graphics notes watermark
Computer graphics notes watermark
 
Comparison between JPEG(DCT) and JPEG 2000(DWT) compression standards
Comparison between JPEG(DCT) and JPEG 2000(DWT) compression standardsComparison between JPEG(DCT) and JPEG 2000(DWT) compression standards
Comparison between JPEG(DCT) and JPEG 2000(DWT) compression standards
 
Basics of Image Processing using MATLAB
Basics of Image Processing using MATLABBasics of Image Processing using MATLAB
Basics of Image Processing using MATLAB
 
JPEG
JPEGJPEG
JPEG
 
Clipping 22
Clipping 22Clipping 22
Clipping 22
 
Compression using JPEG
Compression using JPEGCompression using JPEG
Compression using JPEG
 
Multimedia communication jpeg
Multimedia communication jpegMultimedia communication jpeg
Multimedia communication jpeg
 
COMPUTER GRAPHICS
COMPUTER GRAPHICSCOMPUTER GRAPHICS
COMPUTER GRAPHICS
 
Introductory Digital Image Processing using Matlab, IIT Roorkee
Introductory Digital Image Processing using Matlab, IIT RoorkeeIntroductory Digital Image Processing using Matlab, IIT Roorkee
Introductory Digital Image Processing using Matlab, IIT Roorkee
 
DCT image compression
DCT image compressionDCT image compression
DCT image compression
 
Dct,gibbs phen,oversampled adc,polyphase decomposition
Dct,gibbs phen,oversampled adc,polyphase decompositionDct,gibbs phen,oversampled adc,polyphase decomposition
Dct,gibbs phen,oversampled adc,polyphase decomposition
 
3.point operation and histogram based image enhancement
3.point operation and histogram based image enhancement3.point operation and histogram based image enhancement
3.point operation and histogram based image enhancement
 
Ec section
Ec section Ec section
Ec section
 
point operations in image processing
point operations in image processingpoint operations in image processing
point operations in image processing
 
Introduction to Image Processing with MATLAB
Introduction to Image Processing with MATLABIntroduction to Image Processing with MATLAB
Introduction to Image Processing with MATLAB
 
Jpeg
JpegJpeg
Jpeg
 
Image processing with matlab
Image processing with matlabImage processing with matlab
Image processing with matlab
 
Multimedia image compression standards
Multimedia image compression standardsMultimedia image compression standards
Multimedia image compression standards
 
Point Processing
Point ProcessingPoint Processing
Point Processing
 
Presentation 1
Presentation 1Presentation 1
Presentation 1
 

Similar to Semantic image completion and enhancement using Deep Learning

SinGAN for Image Denoising
SinGAN for Image DenoisingSinGAN for Image Denoising
SinGAN for Image DenoisingKhalilBergaoui
 
Implementing Neural Style Transfer
Implementing Neural Style Transfer Implementing Neural Style Transfer
Implementing Neural Style Transfer Tahsin Mayeesha
 
Low Light Image Enhancement Using Zero-DCE algorithm
Low Light Image Enhancement Using Zero-DCE algorithmLow Light Image Enhancement Using Zero-DCE algorithm
Low Light Image Enhancement Using Zero-DCE algorithmIRJET Journal
 
nips report
nips reportnips report
nips report?? ?
 
Decomposing image generation into layout priction and conditional synthesis
Decomposing image generation into layout priction and conditional synthesisDecomposing image generation into layout priction and conditional synthesis
Decomposing image generation into layout priction and conditional synthesisNaeem Shehzad
 
IRJET- An Approach to FPGA based Implementation of Image Mosaicing using Neur...
IRJET- An Approach to FPGA based Implementation of Image Mosaicing using Neur...IRJET- An Approach to FPGA based Implementation of Image Mosaicing using Neur...
IRJET- An Approach to FPGA based Implementation of Image Mosaicing using Neur...IRJET Journal
 
Eugene Khvedchenya. State of the art Image Augmentations with Albumentations.
Eugene Khvedchenya. State of the art Image Augmentations with Albumentations.Eugene Khvedchenya. State of the art Image Augmentations with Albumentations.
Eugene Khvedchenya. State of the art Image Augmentations with Albumentations.Lviv Startup Club
 
Automatic Detection of Window Regions in Indoor Point Clouds Using R-CNN
Automatic Detection of Window Regions in Indoor Point Clouds Using R-CNNAutomatic Detection of Window Regions in Indoor Point Clouds Using R-CNN
Automatic Detection of Window Regions in Indoor Point Clouds Using R-CNNZihao(Gerald) Zhang
 
CNN and its applications by ketaki
CNN and its applications by ketakiCNN and its applications by ketaki
CNN and its applications by ketakiKetaki Patwari
 
IRJET- 3D Vision System using Calibrated Stereo Camera
IRJET- 3D Vision System using Calibrated Stereo CameraIRJET- 3D Vision System using Calibrated Stereo Camera
IRJET- 3D Vision System using Calibrated Stereo CameraIRJET Journal
 
Traffic sign classification
Traffic sign classificationTraffic sign classification
Traffic sign classificationBill Kromydas
 
imageenhancementtechniques-140316011049-phpapp01 (1).pptx
imageenhancementtechniques-140316011049-phpapp01 (1).pptximageenhancementtechniques-140316011049-phpapp01 (1).pptx
imageenhancementtechniques-140316011049-phpapp01 (1).pptxsalutiontechnology
 
Survey paper on image compression techniques
Survey paper on image compression techniquesSurvey paper on image compression techniques
Survey paper on image compression techniquesIRJET Journal
 
Image De-Noising Using Deep Neural Network
Image De-Noising Using Deep Neural NetworkImage De-Noising Using Deep Neural Network
Image De-Noising Using Deep Neural Networkaciijournal
 
Image enhancement techniques
Image enhancement techniques Image enhancement techniques
Image enhancement techniques Arshad khan
 
Learning from Simulated and Unsupervised Images through Adversarial Training....
Learning from Simulated and Unsupervised Images through Adversarial Training....Learning from Simulated and Unsupervised Images through Adversarial Training....
Learning from Simulated and Unsupervised Images through Adversarial Training....eraser Juan José Calderón
 

Similar to Semantic image completion and enhancement using Deep Learning (20)

Real-Time Face Tracking with GPU Acceleration
Real-Time Face Tracking with GPU AccelerationReal-Time Face Tracking with GPU Acceleration
Real-Time Face Tracking with GPU Acceleration
 
CUDA Accelerated Face Recognition
CUDA Accelerated Face RecognitionCUDA Accelerated Face Recognition
CUDA Accelerated Face Recognition
 
SinGAN for Image Denoising
SinGAN for Image DenoisingSinGAN for Image Denoising
SinGAN for Image Denoising
 
Implementing Neural Style Transfer
Implementing Neural Style Transfer Implementing Neural Style Transfer
Implementing Neural Style Transfer
 
stylegan.pdf
stylegan.pdfstylegan.pdf
stylegan.pdf
 
Low Light Image Enhancement Using Zero-DCE algorithm
Low Light Image Enhancement Using Zero-DCE algorithmLow Light Image Enhancement Using Zero-DCE algorithm
Low Light Image Enhancement Using Zero-DCE algorithm
 
nips report
nips reportnips report
nips report
 
Decomposing image generation into layout priction and conditional synthesis
Decomposing image generation into layout priction and conditional synthesisDecomposing image generation into layout priction and conditional synthesis
Decomposing image generation into layout priction and conditional synthesis
 
image enhancement.pptx
image enhancement.pptximage enhancement.pptx
image enhancement.pptx
 
IRJET- An Approach to FPGA based Implementation of Image Mosaicing using Neur...
IRJET- An Approach to FPGA based Implementation of Image Mosaicing using Neur...IRJET- An Approach to FPGA based Implementation of Image Mosaicing using Neur...
IRJET- An Approach to FPGA based Implementation of Image Mosaicing using Neur...
 
Eugene Khvedchenya. State of the art Image Augmentations with Albumentations.
Eugene Khvedchenya. State of the art Image Augmentations with Albumentations.Eugene Khvedchenya. State of the art Image Augmentations with Albumentations.
Eugene Khvedchenya. State of the art Image Augmentations with Albumentations.
 
Automatic Detection of Window Regions in Indoor Point Clouds Using R-CNN
Automatic Detection of Window Regions in Indoor Point Clouds Using R-CNNAutomatic Detection of Window Regions in Indoor Point Clouds Using R-CNN
Automatic Detection of Window Regions in Indoor Point Clouds Using R-CNN
 
CNN and its applications by ketaki
CNN and its applications by ketakiCNN and its applications by ketaki
CNN and its applications by ketaki
 
IRJET- 3D Vision System using Calibrated Stereo Camera
IRJET- 3D Vision System using Calibrated Stereo CameraIRJET- 3D Vision System using Calibrated Stereo Camera
IRJET- 3D Vision System using Calibrated Stereo Camera
 
Traffic sign classification
Traffic sign classificationTraffic sign classification
Traffic sign classification
 
imageenhancementtechniques-140316011049-phpapp01 (1).pptx
imageenhancementtechniques-140316011049-phpapp01 (1).pptximageenhancementtechniques-140316011049-phpapp01 (1).pptx
imageenhancementtechniques-140316011049-phpapp01 (1).pptx
 
Survey paper on image compression techniques
Survey paper on image compression techniquesSurvey paper on image compression techniques
Survey paper on image compression techniques
 
Image De-Noising Using Deep Neural Network
Image De-Noising Using Deep Neural NetworkImage De-Noising Using Deep Neural Network
Image De-Noising Using Deep Neural Network
 
Image enhancement techniques
Image enhancement techniques Image enhancement techniques
Image enhancement techniques
 
Learning from Simulated and Unsupervised Images through Adversarial Training....
Learning from Simulated and Unsupervised Images through Adversarial Training....Learning from Simulated and Unsupervised Images through Adversarial Training....
Learning from Simulated and Unsupervised Images through Adversarial Training....
 

Recently uploaded

AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdfankushspencer015
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)simmis5
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations120cr0395
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...Call Girls in Nagpur High Profile
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingrakeshbaidya232001
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...roncy bisnoi
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxupamatechverse
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Call Girls in Nagpur High Profile
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan
 
Glass Ceramics: Processing and Properties
Glass Ceramics: Processing and PropertiesGlass Ceramics: Processing and Properties
Glass Ceramics: Processing and PropertiesPrabhanshu Chaturvedi
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfKamal Acharya
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingrknatarajan
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college projectTonystark477637
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...ranjana rawat
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordAsst.prof M.Gokilavani
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 

Recently uploaded (20)

AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)
 
Roadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and RoutesRoadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and Routes
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writing
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptx
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
 
Glass Ceramics: Processing and Properties
Glass Ceramics: Processing and PropertiesGlass Ceramics: Processing and Properties
Glass Ceramics: Processing and Properties
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college project
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
 

Semantic image completion and enhancement using Deep Learning

  • 1. ABV-IIITM Semantic Image Completion and Enhancement using Deep Learning ICCCNT - 2019 Priyansh Saxena
  • 2. ICCCNT | 2019 Objectives To develop a well trained Wasserstein GAN model capable of completing masked region in the image. To further enhance the completed image using enhancement network for better computer vision applications.
  • 3. Novelty of the proposal In most of the existing completion techniques, the completed region is noisy and blurry and not of satisfactory quality. To overcome this, first the Wasserstein GAN is trained to complete the image and then the completed image is passed through the enhancement network which will enhance the quality of the completed image further thus giving better inpainting solution than the present techniques.
  • 5. DATA PREPROCESSING IMAGE MASKING: To create corrupted images for training the Wasserstein GAN, a binary mask is used with values 0 or 1. 0 corresponds to the corrupted region while 1 corresponds to the uncorrupted region in the image. This binary mask is applied to all images to make them corrupted which will serve as input of the training process. The method is evaluated on CelebA-hq dataset.1. 64 x 64 x 3 pixels to train the Wasserstein GAN model. 2. Each face image in the dataset is resized to 3. The enhancement network is trained using 1000 image pairs containing blurry images and its corresponding clean images.
  • 6. Implementation Details 2. Each face image in the dataset is resized to 64* 64* 3 pixels to train the Wasserstein GAN model. THE METHODOLOGY COULD BE DIVIDED INTO TWO DIFFERENT STEPS: In first step, Wasserstein GAN based model is developed to complete the missing pixels in the image. The image completion GAN gives a completed image with missing area filled and having contextual similarity with the input image. In second step,the output of the generator is passed through the enhancement network to further refine the completed image.
  • 7. Implementation Details WASSERSTEIN GAN: GAN trains two networks simultaneously: the generator network G to learn the distribution of training data and the critic network C which distinguishes the generated samples from the original samples. The GAN architeture used is Wasserstein GAN which uses Wasserstein distance as to train the generator so that it can capture training data distribution and generate images similar to those in the training data. (Contd.)
  • 8. Implementation Details The Wasserstein distance loss function L to train the generator can be mathematically represented as: Here, the first term represents the expectation of the distribution generated by the generator and the second term represents the expectation of the real training data distribution. Wasserstein distance as Wasserstein GAN loss function: (Contd.)
  • 9. Implementation Details WASSERSTEIN DISTANCE AS WASSERSTEIN GAN LOSS FUNCTION: By minimizing the difference between the two, the generator learns to generate samples having probability distribution similar to training data distribution. Now, to make the learning faster we have added gradient penalty term. So, the overall loss function L becomes: (Contd.)
  • 10. Implementation Details IMAGE COMPLETION WITH WASSERSTEIN GAN: After training the generator to generate samples which look real, the next aim is to ensure that the missing region generated has similar context to the non-missing region so that the model gives sensible looking completed images as output. (Contd.)
  • 11. CONTEXTUAL LOSS: PERCEPTUAL LOSS: TOTAL LOSS: Implementation Details Image completion losses with Wasserstein GAN: This loss function L(z) is minimized to ensure completed image is contextually similar to input image. + = (Contd.)
  • 12. Implementation Details ENHANCEMENT NETWORK CONSISTS OF FOLLOWING LAYERS: Enhancement network using residual learning: Conv + ReLU: It creates feature maps, and ReLU adds the non-linearity. Conv + BN + ReLU: This layers contains added batch normalization between Conv and ReLU. Conv: It is used to get the output residual image. (Contd.)
  • 13. The input to the network is blurry image y = x + v, Implementation Details Training enhancement network : Here, N represents total training images. Here x is clear image, v represents the blur added. The residual network is trained to grasp the mapping R(y)≈v , to get the clear image x as x = y- R(y). The loss function to learn the trainable parameters θ in the enhancement network is as follows: (Contd.)
  • 14. binary mask, y represents the original image and G(z') represents the an image from the generator G for some z' that gives a reasonable reconstruction of the missing portions. Implementation Details Image completion with Wasserstein GAN: We find z' that suitably completes the image by minimizing L(z). reconstructed represents the completed image through our model, M representsLet x (Contd.)
  • 15. Results The following plot was obtained by training the enhancement network on 1000 celebA-hq image pairs of clean and its corresponding blurr images. Figure: Enhancement network training plot
  • 16. Results  The following Wasserstein distance values and plot was obtained while training Wasserstein GAN on around 15000 Celeba-hq images for 10000 epochs and batch size of 128. (Contd.)
  • 17. Results Contextual loss plot for image completion Perceptual loss plot for image completion Total loss plot for image completion (Contd.) The above contextual, perceptual and total loss plots were obtained while training the Wasserstein GAN for image completion for 1250 epochs on 15000 Celeba-hq images. (Contd.)
  • 18. Results Peak Signal-to-Noise Ratio (PSNR): (Contd.) The following two evaluation metrics were used to evaluate the quality of the output images by the model: PSNR is measured in decibels (dB).The higher the PSNR, the better image has been completed to match the original image.
  • 19. Results Structural Similarity Index (SSIM): (Contd.) The following two evaluation metrics were used to evaluate the quality of the output images by the model: It is used to measure the similarity between two images. where, C represents contrast, I represents luminance, S represents structural term, x represents original, y represents completed images. The parameters α > 0, β > 0, and γ > 0, are used to adjust the relative importance of the three components.
  • 20. Results Comparison of PSNR values (Contd.) The following PSNR and SSIM values through the model: Comparison of SSIM values
  • 21. Output ORIGINAL Some of the results obtained through the presented image completion technique are as follows: INPUT OUTPUT ORIGINAL INPUT OUTPUT ORIGINAL INPUT OUTPUT ORIGINAL INPUT OUTPUT
  • 22. CONCLUSION In most of the existing completion techniques,the completed images are blurry due to the noise which is inevitably mixed by the generator.of PSNR values. To overcome this, the WGAN is first trained to generate the missing patches in the image and then passed the completed image given by the WGAN through a enhancement network to remove the blur and provide better inpainting solutions. However, in this approach, overall training is highly depended on the data used for training.
  • 23. References Philipp Krahenbuhl Deepak Pathak. Context encoders: feature learning by inpainting. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 2436–2544, 2016. Yizhen Chen and Haifeng Hu. Neural Processing Letters, Springer, pages 1–13, Jun 2018. Ruijun Liu, Rui Yang, Shanxi Li, Yuqian Shi, and Xin Jin. Painting completion with generative translation models. Multimedia Tools and Applications, Springer, pages 1–14, 2018. An improved method for semantic image inpainting with gans: Progressive inpainting. Martin Arjovsky, Soumith Chintala and Leon Bottou Wasserstein gan In Courant Institute of Mathematical Sciences Facebook AI Research, 2017