SlideShare a Scribd company logo
1 of 22
Download to read offline
Semantic Segmentation
"Conditional Random Fields Meet Deep Neural Network for Semantic Segmentation"
: https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8254255
๏‚ง Proposed the spatially-adaptive normalization
-> Effective layer for synthesizing photorealistic images given an input semantic layout.
๏‚ง Previous methods Problem : Normalization layers tend to "wash away" semantic information.
๏‚ง Finally, our model allows user control over both semantic and style.
Abstract
๐‘š๐‘š โˆˆ ๐•ƒ๐•ƒ๐ป๐ปร—๐‘Š๐‘Š
: Semantic Segmentation Mask
where ๐•ƒ๐•ƒ is a set of integers denoting the semantic labels, ๐ป๐ป and ๐‘Š๐‘Š are the image height and
width.
Each entry in ๐’Ž๐’Ž denotes the semantic label of a pixel.
We aim to learn a mapping function that can convert an input segmentation mask ๐’Ž๐’Ž to a
photorealistic image.
Semantic Image Synthesis
๏‚ง ์ž„์˜์˜ ์‚ผ๊ฐํ˜•์„ ์ž„์˜์˜ ์‚ผ๊ฐํ˜•์œผ๋กœ ๋งคํ•‘์‹œํ‚ฌ ์ˆ˜ ์žˆ๋Š” ๋ณ€ํ™˜.
๏‚ง ์ œ์•ฝ ์กฐ๊ฑด : ๊ฐ™์€ ํ‰๋ฉด์— ์žˆ์–ด์•ผํ•œ๋‹ค.
๏‚ง ํ‰ํ–‰ ์ด๋™, ํšŒ์ „, ํฌ๊ธฐ ๋ณ€ํ˜•, ๋น„ํ‹€๊ธฐ
Affine transformation
https://darkpgmr.tistory.com/79
(Unconditional) Batch Normalization
Conditional Batch Normalization
Adaptive Instance Normalization(AdalN)
style
content
Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization
: https://arxiv.org/abs/1703.06868
SPatially-Adaptive DEnormalization(SPADE)
https://nvlabs.github.io/SPADE/
activaton
mean
std
SPADE generator
noise
downsampling
With the SPADE, there is no need to feed the segmentation map to the first layer of the generator, since
the learned modulation parameters have encoded enough information about the label layout.
SPADE generator
SPADE
output a mean vector ยต
output a variance vector ฯƒ
๏‚ง A short answer is that it can better preserve semantic information against common
normalization layers.
๏‚ง Specifically, while normalization layers such as the InstanceNorm are essential pieces in
almost all the state-of-the-art conditional image synthesis models, they tend to wash away
semantic information when applied to uniform or flat segmentation masks.
Why does the SPADE work better?
์ƒ์„ฑ์ž์— ๋ชจ๋“  ๋™์ผํ•œ ๋ผ๋ฒจ์„ ๊ฐ€์ง€๋Š” Mask๋ฅผ ๋„ฃ๋Š” ๊ฒฝ์šฐ
InstanceNorm์ด ํ†ต๊ณผ๋˜๋ฉด ๋ชจ๋“  ๊ฐ’์ด 0์ด ๋œ๋‹ค.
-> pix2pix generator๋Š” semantic mask๋ฅผ ์ž…๋ ฅ์œผ๋กœ ๋„ฃ
์–ด์„œ InstanceNorm์„ ํ†ต๊ณผํ•˜๊ฒŒ ๋œ๋‹ค.
-> SPADE generator๋Š” Semantic mask์˜ ์ •๊ทœํ™” ๋ถ€๋ถ„
์ด ์—†๋‹ค. ์ด์ „ layer์˜ ํ™œ์„ฑํ™”๋งŒ ์ •๊ทœํ™” ๋œ๋‹ค.
Why does the SPADE work better?
๏‚ง By using a random vector as the input of the
generator, our architecture provides a simple
way for multi-modal synthesis.
๏‚ง Namely, one can attach an encoder that
processes a real image into a random vector,
which will be then fed to the generator.
๏‚ง The encoder and generator form a VAE.
๏‚ง For training, we add a KL-Divergence loss
term.
Multi-modal synthesis
๋‘ ํ™•๋ฅ ๋ถ„ํฌ์˜ ๋‹ค๋ฅธ ์ •๋„๋ฅผ ์ธก์ •
Multi-modal synthesis
๏‚ง Apply the Spectral Norm to all the layers in the Generator and Discriminator.
๏‚ง Learning rate : Generator - 0.0001, Discriminator - 0.0004
๏‚ง ADAM : ๐›ฝ๐›ฝ1 = 0, ๐›ฝ๐›ฝ2 = 0.999
๏‚ง synchronized BatchNorm - these statistics are collected from all the GPUs.
Experiments - Implementation details
๏‚ง If the output images are realistic, a well-trained semantic segmentation model should be
able to predict the ground truth label.
๏‚ง Cascaded Refinement Network(CRN)
๏‚ง Semiparametric Image Synthesis method(SIMS)
๏‚ง ๊ณตํ‰ํ•œ ํ‰๊ฐ€๋ฅผ ์œ„ํ•ด
๏‚ง CRN, pix2pixHD models -> ์ €์ž๊ฐ€ ๊ตฌํ˜„ํ•œ ๋ชจ๋ธ์„ ์‚ฌ์šฉ
๏‚ง SIMS -> ๋…ผ๋ฌธ์—์„œ ๊ฒฐ๊ณผ๋ฅผ ๊ฐ€์ ธ์˜ด
Experiments - Performance metrics
Experiments - Qualitative results
Experiments - Human evaluation
We use the Amazon Mechanical Turk (AMT) to compare the perceived visual fidelity of our method against
existing approaches.
Specifically, we give the AMT workers an input segmentation mask and two synthesis outputs from
different methods and ask them to choose the output image that looks more like a corresponding image of
the segmentation mask.
Randomly generate 500 questions for each dataset
Experiments - Effectiveness of the SPADE
๏‚ง Proposed the spatially-adaptive normalization, which utilizes the input semantic layout while
performing the affine transformation in the normalization layers.
๏‚ง The proposed normalization leads to the first semantic image synthesis model that can
produce photorealistic outputs for diverse scenes including indoor, outdoor, landscape, and
street scenes.
๏‚ง We further demonstrate its application for multi-modal synthesis and guided image synthesis.
Conclusion

More Related Content

What's hot

2022-01-17-Rethinking_Bisenet.pptx
2022-01-17-Rethinking_Bisenet.pptx2022-01-17-Rethinking_Bisenet.pptx
2022-01-17-Rethinking_Bisenet.pptxJAEMINJEONG5
ย 
2020 12-2-detr
2020 12-2-detr2020 12-2-detr
2020 12-2-detrJAEMINJEONG5
ย 
2020 12-1-adam w
2020 12-1-adam w2020 12-1-adam w
2020 12-1-adam wJAEMINJEONG5
ย 
2020 12-04-shake shake
2020 12-04-shake shake2020 12-04-shake shake
2020 12-04-shake shakeJAEMINJEONG5
ย 
2021 03-02-transformer interpretability
2021 03-02-transformer interpretability2021 03-02-transformer interpretability
2021 03-02-transformer interpretabilityJAEMINJEONG5
ย 
MobileNet - PR044
MobileNet - PR044MobileNet - PR044
MobileNet - PR044Jinwon Lee
ย 
2020 11 2_automated sleep stage scoring of the sleep heart
2020 11 2_automated sleep stage scoring of the sleep heart2020 11 2_automated sleep stage scoring of the sleep heart
2020 11 2_automated sleep stage scoring of the sleep heartJAEMINJEONG5
ย 
2021 01-04-learning filter-basis
2021 01-04-learning filter-basis2021 01-04-learning filter-basis
2021 01-04-learning filter-basisJAEMINJEONG5
ย 
PR095: Modularity Matters: Learning Invariant Relational Reasoning Tasks
PR095: Modularity Matters: Learning Invariant Relational Reasoning TasksPR095: Modularity Matters: Learning Invariant Relational Reasoning Tasks
PR095: Modularity Matters: Learning Invariant Relational Reasoning TasksJinwon Lee
ย 
Deep Learning Fast MRI Using Channel Attention in Magnitude Domain
Deep Learning Fast MRI Using Channel Attention in Magnitude DomainDeep Learning Fast MRI Using Channel Attention in Magnitude Domain
Deep Learning Fast MRI Using Channel Attention in Magnitude DomainJoonhyung Lee
ย 
PR-108: MobileNetV2: Inverted Residuals and Linear Bottlenecks
PR-108: MobileNetV2: Inverted Residuals and Linear BottlenecksPR-108: MobileNetV2: Inverted Residuals and Linear Bottlenecks
PR-108: MobileNetV2: Inverted Residuals and Linear BottlenecksJinwon Lee
ย 
MobileNet V3
MobileNet V3MobileNet V3
MobileNet V3Wonbeom Jang
ย 
Introduction to Grad-CAM (short version)
Introduction to Grad-CAM (short version)Introduction to Grad-CAM (short version)
Introduction to Grad-CAM (short version)Hsing-chuan Hsieh
ย 
Efficient anomaly detection via matrix sketching
Efficient anomaly detection via matrix sketchingEfficient anomaly detection via matrix sketching
Efficient anomaly detection via matrix sketchingHsing-chuan Hsieh
ย 
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...Jinwon Lee
ย 
Rethinking Attention with Performers
Rethinking Attention with PerformersRethinking Attention with Performers
Rethinking Attention with PerformersJoonhyung Lee
ย 
PR-270: PP-YOLO: An Effective and Efficient Implementation of Object Detector
PR-270: PP-YOLO: An Effective and Efficient Implementation of Object DetectorPR-270: PP-YOLO: An Effective and Efficient Implementation of Object Detector
PR-270: PP-YOLO: An Effective and Efficient Implementation of Object DetectorJinwon Lee
ย 
Machine Learning - Convolutional Neural Network
Machine Learning - Convolutional Neural NetworkMachine Learning - Convolutional Neural Network
Machine Learning - Convolutional Neural NetworkRichard Kuo
ย 
Locating texture boundaries using a fast unsupervised approach based on clust...
Locating texture boundaries using a fast unsupervised approach based on clust...Locating texture boundaries using a fast unsupervised approach based on clust...
Locating texture boundaries using a fast unsupervised approach based on clust...Mehryar (Mike) E., Ph.D.
ย 

What's hot (20)

2022-01-17-Rethinking_Bisenet.pptx
2022-01-17-Rethinking_Bisenet.pptx2022-01-17-Rethinking_Bisenet.pptx
2022-01-17-Rethinking_Bisenet.pptx
ย 
2020 12-2-detr
2020 12-2-detr2020 12-2-detr
2020 12-2-detr
ย 
2020 12-1-adam w
2020 12-1-adam w2020 12-1-adam w
2020 12-1-adam w
ย 
2020 12-04-shake shake
2020 12-04-shake shake2020 12-04-shake shake
2020 12-04-shake shake
ย 
2021 03-02-transformer interpretability
2021 03-02-transformer interpretability2021 03-02-transformer interpretability
2021 03-02-transformer interpretability
ย 
MobileNet - PR044
MobileNet - PR044MobileNet - PR044
MobileNet - PR044
ย 
2020 11 2_automated sleep stage scoring of the sleep heart
2020 11 2_automated sleep stage scoring of the sleep heart2020 11 2_automated sleep stage scoring of the sleep heart
2020 11 2_automated sleep stage scoring of the sleep heart
ย 
2021 01-04-learning filter-basis
2021 01-04-learning filter-basis2021 01-04-learning filter-basis
2021 01-04-learning filter-basis
ย 
PR095: Modularity Matters: Learning Invariant Relational Reasoning Tasks
PR095: Modularity Matters: Learning Invariant Relational Reasoning TasksPR095: Modularity Matters: Learning Invariant Relational Reasoning Tasks
PR095: Modularity Matters: Learning Invariant Relational Reasoning Tasks
ย 
Deep Learning Fast MRI Using Channel Attention in Magnitude Domain
Deep Learning Fast MRI Using Channel Attention in Magnitude DomainDeep Learning Fast MRI Using Channel Attention in Magnitude Domain
Deep Learning Fast MRI Using Channel Attention in Magnitude Domain
ย 
PR-108: MobileNetV2: Inverted Residuals and Linear Bottlenecks
PR-108: MobileNetV2: Inverted Residuals and Linear BottlenecksPR-108: MobileNetV2: Inverted Residuals and Linear Bottlenecks
PR-108: MobileNetV2: Inverted Residuals and Linear Bottlenecks
ย 
MobileNet V3
MobileNet V3MobileNet V3
MobileNet V3
ย 
Introduction to Grad-CAM (short version)
Introduction to Grad-CAM (short version)Introduction to Grad-CAM (short version)
Introduction to Grad-CAM (short version)
ย 
Mobilenetv1 v2 slide
Mobilenetv1 v2 slideMobilenetv1 v2 slide
Mobilenetv1 v2 slide
ย 
Efficient anomaly detection via matrix sketching
Efficient anomaly detection via matrix sketchingEfficient anomaly detection via matrix sketching
Efficient anomaly detection via matrix sketching
ย 
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...
ย 
Rethinking Attention with Performers
Rethinking Attention with PerformersRethinking Attention with Performers
Rethinking Attention with Performers
ย 
PR-270: PP-YOLO: An Effective and Efficient Implementation of Object Detector
PR-270: PP-YOLO: An Effective and Efficient Implementation of Object DetectorPR-270: PP-YOLO: An Effective and Efficient Implementation of Object Detector
PR-270: PP-YOLO: An Effective and Efficient Implementation of Object Detector
ย 
Machine Learning - Convolutional Neural Network
Machine Learning - Convolutional Neural NetworkMachine Learning - Convolutional Neural Network
Machine Learning - Convolutional Neural Network
ย 
Locating texture boundaries using a fast unsupervised approach based on clust...
Locating texture boundaries using a fast unsupervised approach based on clust...Locating texture boundaries using a fast unsupervised approach based on clust...
Locating texture boundaries using a fast unsupervised approach based on clust...
ย 

Similar to 2021 03-02-spade

Decomposing image generation into layout priction and conditional synthesis
Decomposing image generation into layout priction and conditional synthesisDecomposing image generation into layout priction and conditional synthesis
Decomposing image generation into layout priction and conditional synthesisNaeem Shehzad
ย 
A Novel Approach to Image Denoising and Image in Painting
A Novel Approach to Image Denoising and Image in PaintingA Novel Approach to Image Denoising and Image in Painting
A Novel Approach to Image Denoising and Image in PaintingEswar Publications
ย 
Kq3518291832
Kq3518291832Kq3518291832
Kq3518291832IJERA Editor
ย 
Reading group nfm - 20170312
Reading group  nfm - 20170312Reading group  nfm - 20170312
Reading group nfm - 20170312Shuai Zhang
ย 
Introduction to Grad-CAM (complete version)
Introduction to Grad-CAM (complete version)Introduction to Grad-CAM (complete version)
Introduction to Grad-CAM (complete version)Hsing-chuan Hsieh
ย 
Structure Unstructured Data
Structure Unstructured DataStructure Unstructured Data
Structure Unstructured DataCarmine Paolino
ย 
IRJET - Hand Gesture Recognition to Perform System Operations
IRJET -  	  Hand Gesture Recognition to Perform System OperationsIRJET -  	  Hand Gesture Recognition to Perform System Operations
IRJET - Hand Gesture Recognition to Perform System OperationsIRJET Journal
ย 
Image Masking.pdf
Image Masking.pdfImage Masking.pdf
Image Masking.pdffarin11
ย 
A Smart Camera Processing Pipeline for Image Applications Utilizing Marching ...
A Smart Camera Processing Pipeline for Image Applications Utilizing Marching ...A Smart Camera Processing Pipeline for Image Applications Utilizing Marching ...
A Smart Camera Processing Pipeline for Image Applications Utilizing Marching ...sipij
ย 
Automatic Detection of Window Regions in Indoor Point Clouds Using R-CNN
Automatic Detection of Window Regions in Indoor Point Clouds Using R-CNNAutomatic Detection of Window Regions in Indoor Point Clouds Using R-CNN
Automatic Detection of Window Regions in Indoor Point Clouds Using R-CNNZihao(Gerald) Zhang
ย 
Video Stitching using Improved RANSAC and SIFT
Video Stitching using Improved RANSAC and SIFTVideo Stitching using Improved RANSAC and SIFT
Video Stitching using Improved RANSAC and SIFTIRJET Journal
ย 
Image attendance system
Image attendance systemImage attendance system
Image attendance systemMayank Garg
ย 
Realtime pothole detection system using improved CNN Models
Realtime pothole detection system using improved CNN ModelsRealtime pothole detection system using improved CNN Models
Realtime pothole detection system using improved CNN Modelsnithinsai2992
ย 
IRJET- Automatic Data Collection from Forms using Optical Character Recognition
IRJET- Automatic Data Collection from Forms using Optical Character RecognitionIRJET- Automatic Data Collection from Forms using Optical Character Recognition
IRJET- Automatic Data Collection from Forms using Optical Character RecognitionIRJET Journal
ย 
Java image processing ieee projects 2012 @ Seabirds ( Chennai, Bangalore, Hyd...
Java image processing ieee projects 2012 @ Seabirds ( Chennai, Bangalore, Hyd...Java image processing ieee projects 2012 @ Seabirds ( Chennai, Bangalore, Hyd...
Java image processing ieee projects 2012 @ Seabirds ( Chennai, Bangalore, Hyd...SBGC
ย 
All projects
All projectsAll projects
All projectsKarishma Jain
ย 
Final year automobile projects in bangalore
Final year automobile projects in bangaloreFinal year automobile projects in bangalore
Final year automobile projects in bangaloreThirumal Krishnan
ย 
Final year automobile projects in bangalore
Final year automobile projects in bangaloreFinal year automobile projects in bangalore
Final year automobile projects in bangaloreThirumal Krishnan
ย 
DIGEST PODCAST
DIGEST PODCASTDIGEST PODCAST
DIGEST PODCASTIRJET Journal
ย 

Similar to 2021 03-02-spade (20)

Math of Explaining SAM
Math of Explaining SAMMath of Explaining SAM
Math of Explaining SAM
ย 
Decomposing image generation into layout priction and conditional synthesis
Decomposing image generation into layout priction and conditional synthesisDecomposing image generation into layout priction and conditional synthesis
Decomposing image generation into layout priction and conditional synthesis
ย 
A Novel Approach to Image Denoising and Image in Painting
A Novel Approach to Image Denoising and Image in PaintingA Novel Approach to Image Denoising and Image in Painting
A Novel Approach to Image Denoising and Image in Painting
ย 
Kq3518291832
Kq3518291832Kq3518291832
Kq3518291832
ย 
Reading group nfm - 20170312
Reading group  nfm - 20170312Reading group  nfm - 20170312
Reading group nfm - 20170312
ย 
Introduction to Grad-CAM (complete version)
Introduction to Grad-CAM (complete version)Introduction to Grad-CAM (complete version)
Introduction to Grad-CAM (complete version)
ย 
Structure Unstructured Data
Structure Unstructured DataStructure Unstructured Data
Structure Unstructured Data
ย 
IRJET - Hand Gesture Recognition to Perform System Operations
IRJET -  	  Hand Gesture Recognition to Perform System OperationsIRJET -  	  Hand Gesture Recognition to Perform System Operations
IRJET - Hand Gesture Recognition to Perform System Operations
ย 
Image Masking.pdf
Image Masking.pdfImage Masking.pdf
Image Masking.pdf
ย 
A Smart Camera Processing Pipeline for Image Applications Utilizing Marching ...
A Smart Camera Processing Pipeline for Image Applications Utilizing Marching ...A Smart Camera Processing Pipeline for Image Applications Utilizing Marching ...
A Smart Camera Processing Pipeline for Image Applications Utilizing Marching ...
ย 
Automatic Detection of Window Regions in Indoor Point Clouds Using R-CNN
Automatic Detection of Window Regions in Indoor Point Clouds Using R-CNNAutomatic Detection of Window Regions in Indoor Point Clouds Using R-CNN
Automatic Detection of Window Regions in Indoor Point Clouds Using R-CNN
ย 
Video Stitching using Improved RANSAC and SIFT
Video Stitching using Improved RANSAC and SIFTVideo Stitching using Improved RANSAC and SIFT
Video Stitching using Improved RANSAC and SIFT
ย 
Image attendance system
Image attendance systemImage attendance system
Image attendance system
ย 
Realtime pothole detection system using improved CNN Models
Realtime pothole detection system using improved CNN ModelsRealtime pothole detection system using improved CNN Models
Realtime pothole detection system using improved CNN Models
ย 
IRJET- Automatic Data Collection from Forms using Optical Character Recognition
IRJET- Automatic Data Collection from Forms using Optical Character RecognitionIRJET- Automatic Data Collection from Forms using Optical Character Recognition
IRJET- Automatic Data Collection from Forms using Optical Character Recognition
ย 
Java image processing ieee projects 2012 @ Seabirds ( Chennai, Bangalore, Hyd...
Java image processing ieee projects 2012 @ Seabirds ( Chennai, Bangalore, Hyd...Java image processing ieee projects 2012 @ Seabirds ( Chennai, Bangalore, Hyd...
Java image processing ieee projects 2012 @ Seabirds ( Chennai, Bangalore, Hyd...
ย 
All projects
All projectsAll projects
All projects
ย 
Final year automobile projects in bangalore
Final year automobile projects in bangaloreFinal year automobile projects in bangalore
Final year automobile projects in bangalore
ย 
Final year automobile projects in bangalore
Final year automobile projects in bangaloreFinal year automobile projects in bangalore
Final year automobile projects in bangalore
ย 
DIGEST PODCAST
DIGEST PODCASTDIGEST PODCAST
DIGEST PODCAST
ย 

More from JAEMINJEONG5

Jaemin_230701_Simple_Copy_paste.pptx
Jaemin_230701_Simple_Copy_paste.pptxJaemin_230701_Simple_Copy_paste.pptx
Jaemin_230701_Simple_Copy_paste.pptxJAEMINJEONG5
ย 
2021 04-04-google nmt
2021 04-04-google nmt2021 04-04-google nmt
2021 04-04-google nmtJAEMINJEONG5
ย 
2021 03-02-distributed representations-of_words_and_phrases
2021 03-02-distributed representations-of_words_and_phrases2021 03-02-distributed representations-of_words_and_phrases
2021 03-02-distributed representations-of_words_and_phrasesJAEMINJEONG5
ย 
2021 01-02-linformer
2021 01-02-linformer2021 01-02-linformer
2021 01-02-linformerJAEMINJEONG5
ย 
2020 11 1_sleep_net
2020 11 1_sleep_net2020 11 1_sleep_net
2020 11 1_sleep_netJAEMINJEONG5
ย 
2020 11 3_face_detection
2020 11 3_face_detection2020 11 3_face_detection
2020 11 3_face_detectionJAEMINJEONG5
ย 
white blood cell classification
white blood cell classificationwhite blood cell classification
white blood cell classificationJAEMINJEONG5
ย 

More from JAEMINJEONG5 (7)

Jaemin_230701_Simple_Copy_paste.pptx
Jaemin_230701_Simple_Copy_paste.pptxJaemin_230701_Simple_Copy_paste.pptx
Jaemin_230701_Simple_Copy_paste.pptx
ย 
2021 04-04-google nmt
2021 04-04-google nmt2021 04-04-google nmt
2021 04-04-google nmt
ย 
2021 03-02-distributed representations-of_words_and_phrases
2021 03-02-distributed representations-of_words_and_phrases2021 03-02-distributed representations-of_words_and_phrases
2021 03-02-distributed representations-of_words_and_phrases
ย 
2021 01-02-linformer
2021 01-02-linformer2021 01-02-linformer
2021 01-02-linformer
ย 
2020 11 1_sleep_net
2020 11 1_sleep_net2020 11 1_sleep_net
2020 11 1_sleep_net
ย 
2020 11 3_face_detection
2020 11 3_face_detection2020 11 3_face_detection
2020 11 3_face_detection
ย 
white blood cell classification
white blood cell classificationwhite blood cell classification
white blood cell classification
ย 

Recently uploaded

Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile servicerehmti665
ย 
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxthe ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxhumanexperienceaaa
ย 
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptxExploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptxnull - The Open Security Community
ย 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxupamatechverse
ย 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan
ย 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis
ย 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
ย 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona
ย 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingrakeshbaidya232001
ย 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSCAESB
ย 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
ย 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
ย 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)Suman Mia
ย 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N
ย 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...ranjana rawat
ย 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
ย 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor
ย 
Model Call Girl in Narela Delhi reach out to us at ๐Ÿ”8264348440๐Ÿ”
Model Call Girl in Narela Delhi reach out to us at ๐Ÿ”8264348440๐Ÿ”Model Call Girl in Narela Delhi reach out to us at ๐Ÿ”8264348440๐Ÿ”
Model Call Girl in Narela Delhi reach out to us at ๐Ÿ”8264348440๐Ÿ”soniya singh
ย 

Recently uploaded (20)

Roadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and RoutesRoadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and Routes
ย 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile service
ย 
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxthe ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
ย 
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptxExploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
ย 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptx
ย 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
ย 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
ย 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
ย 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
ย 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writing
ย 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentation
ย 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
ย 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
ย 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
ย 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
ย 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
ย 
โ˜… CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
โ˜… CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCRโ˜… CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
โ˜… CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
ย 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
ย 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
ย 
Model Call Girl in Narela Delhi reach out to us at ๐Ÿ”8264348440๐Ÿ”
Model Call Girl in Narela Delhi reach out to us at ๐Ÿ”8264348440๐Ÿ”Model Call Girl in Narela Delhi reach out to us at ๐Ÿ”8264348440๐Ÿ”
Model Call Girl in Narela Delhi reach out to us at ๐Ÿ”8264348440๐Ÿ”
ย 

2021 03-02-spade

  • 1.
  • 2. Semantic Segmentation "Conditional Random Fields Meet Deep Neural Network for Semantic Segmentation" : https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8254255
  • 3. ๏‚ง Proposed the spatially-adaptive normalization -> Effective layer for synthesizing photorealistic images given an input semantic layout. ๏‚ง Previous methods Problem : Normalization layers tend to "wash away" semantic information. ๏‚ง Finally, our model allows user control over both semantic and style. Abstract
  • 4. ๐‘š๐‘š โˆˆ ๐•ƒ๐•ƒ๐ป๐ปร—๐‘Š๐‘Š : Semantic Segmentation Mask where ๐•ƒ๐•ƒ is a set of integers denoting the semantic labels, ๐ป๐ป and ๐‘Š๐‘Š are the image height and width. Each entry in ๐’Ž๐’Ž denotes the semantic label of a pixel. We aim to learn a mapping function that can convert an input segmentation mask ๐’Ž๐’Ž to a photorealistic image. Semantic Image Synthesis
  • 5. ๏‚ง ์ž„์˜์˜ ์‚ผ๊ฐํ˜•์„ ์ž„์˜์˜ ์‚ผ๊ฐํ˜•์œผ๋กœ ๋งคํ•‘์‹œํ‚ฌ ์ˆ˜ ์žˆ๋Š” ๋ณ€ํ™˜. ๏‚ง ์ œ์•ฝ ์กฐ๊ฑด : ๊ฐ™์€ ํ‰๋ฉด์— ์žˆ์–ด์•ผํ•œ๋‹ค. ๏‚ง ํ‰ํ–‰ ์ด๋™, ํšŒ์ „, ํฌ๊ธฐ ๋ณ€ํ˜•, ๋น„ํ‹€๊ธฐ Affine transformation https://darkpgmr.tistory.com/79
  • 8. Adaptive Instance Normalization(AdalN) style content Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization : https://arxiv.org/abs/1703.06868
  • 10. SPADE generator noise downsampling With the SPADE, there is no need to feed the segmentation map to the first layer of the generator, since the learned modulation parameters have encoded enough information about the label layout.
  • 12. SPADE output a mean vector ยต output a variance vector ฯƒ
  • 13. ๏‚ง A short answer is that it can better preserve semantic information against common normalization layers. ๏‚ง Specifically, while normalization layers such as the InstanceNorm are essential pieces in almost all the state-of-the-art conditional image synthesis models, they tend to wash away semantic information when applied to uniform or flat segmentation masks. Why does the SPADE work better? ์ƒ์„ฑ์ž์— ๋ชจ๋“  ๋™์ผํ•œ ๋ผ๋ฒจ์„ ๊ฐ€์ง€๋Š” Mask๋ฅผ ๋„ฃ๋Š” ๊ฒฝ์šฐ InstanceNorm์ด ํ†ต๊ณผ๋˜๋ฉด ๋ชจ๋“  ๊ฐ’์ด 0์ด ๋œ๋‹ค. -> pix2pix generator๋Š” semantic mask๋ฅผ ์ž…๋ ฅ์œผ๋กœ ๋„ฃ ์–ด์„œ InstanceNorm์„ ํ†ต๊ณผํ•˜๊ฒŒ ๋œ๋‹ค. -> SPADE generator๋Š” Semantic mask์˜ ์ •๊ทœํ™” ๋ถ€๋ถ„ ์ด ์—†๋‹ค. ์ด์ „ layer์˜ ํ™œ์„ฑํ™”๋งŒ ์ •๊ทœํ™” ๋œ๋‹ค.
  • 14. Why does the SPADE work better?
  • 15. ๏‚ง By using a random vector as the input of the generator, our architecture provides a simple way for multi-modal synthesis. ๏‚ง Namely, one can attach an encoder that processes a real image into a random vector, which will be then fed to the generator. ๏‚ง The encoder and generator form a VAE. ๏‚ง For training, we add a KL-Divergence loss term. Multi-modal synthesis ๋‘ ํ™•๋ฅ ๋ถ„ํฌ์˜ ๋‹ค๋ฅธ ์ •๋„๋ฅผ ์ธก์ •
  • 17. ๏‚ง Apply the Spectral Norm to all the layers in the Generator and Discriminator. ๏‚ง Learning rate : Generator - 0.0001, Discriminator - 0.0004 ๏‚ง ADAM : ๐›ฝ๐›ฝ1 = 0, ๐›ฝ๐›ฝ2 = 0.999 ๏‚ง synchronized BatchNorm - these statistics are collected from all the GPUs. Experiments - Implementation details
  • 18. ๏‚ง If the output images are realistic, a well-trained semantic segmentation model should be able to predict the ground truth label. ๏‚ง Cascaded Refinement Network(CRN) ๏‚ง Semiparametric Image Synthesis method(SIMS) ๏‚ง ๊ณตํ‰ํ•œ ํ‰๊ฐ€๋ฅผ ์œ„ํ•ด ๏‚ง CRN, pix2pixHD models -> ์ €์ž๊ฐ€ ๊ตฌํ˜„ํ•œ ๋ชจ๋ธ์„ ์‚ฌ์šฉ ๏‚ง SIMS -> ๋…ผ๋ฌธ์—์„œ ๊ฒฐ๊ณผ๋ฅผ ๊ฐ€์ ธ์˜ด Experiments - Performance metrics
  • 20. Experiments - Human evaluation We use the Amazon Mechanical Turk (AMT) to compare the perceived visual fidelity of our method against existing approaches. Specifically, we give the AMT workers an input segmentation mask and two synthesis outputs from different methods and ask them to choose the output image that looks more like a corresponding image of the segmentation mask. Randomly generate 500 questions for each dataset
  • 22. ๏‚ง Proposed the spatially-adaptive normalization, which utilizes the input semantic layout while performing the affine transformation in the normalization layers. ๏‚ง The proposed normalization leads to the first semantic image synthesis model that can produce photorealistic outputs for diverse scenes including indoor, outdoor, landscape, and street scenes. ๏‚ง We further demonstrate its application for multi-modal synthesis and guided image synthesis. Conclusion