SlideShare a Scribd company logo
1 of 16
Download to read offline
Jaemin Jeong Seminar 2
Proposed Semantic rEgion-Adaptive Normalization
Control the style of each semantic region individually.
Good Performance
Reconstruction quality
Variabliity
Visual Quality
Abstract
SPADE is starting point for out research.
Jaemin Jeong Seminar 3
1. SPADE use only one style code to control the entire style of an
image, which is not sufficient for high quality synthesis or
detailed control.
2. we believe that inserting style information only in the beginning
of a network is not a good architecture choice.
We found two shorcomings ..
Jaemin Jeong Seminar 4
1. SEAN improves the quality of the synthesized images for
conditional GANs.
2. SEAN improves the per-region style encoding, so that
reconstructed images can be made more similar to the input style
images as measured by PSNR and visual inspection.
3. SEAN allows the user to select a different style input image for
each semantic region.
Advantages of SEAN
Jaemin Jeong Seminar 5
1. How to distill the per-region styles from the image according to
the mask ?
2. How to use the distilled per-region style codes to synthesize
photo-realistic images ?
Per Region Style Encoding and Control
Jaemin Jeong Seminar 6
Per-Region Style Encoder
 ST(Style Matrix) : πŸ“πŸ“πŸ“πŸ“πŸ“πŸ“ Γ— 𝒔𝒔
 𝒔𝒔 : number of semantic regions
set columns corresponding to regions that do not
exist in an input image to 0.
-> 항상 λͺ¨λ“  semantic regionsκ°€ λ™μΌν•˜κ²Œ μžˆλŠ”κ²Œ μ•„λ‹ˆλ‹ˆκΉŒ
μ—†λŠ” 건 0으둜 ν•©μ‹œλ‹€.
How to Encode Style?
Jaemin Jeong Seminar 7
 SEAN
 Semantic Region-Adaptive Normalization
𝛾𝛾𝑐𝑐,𝑦𝑦,π‘₯π‘₯ 𝑆𝑆𝑆𝑆, 𝑀𝑀
β„Žπ‘›π‘›,𝑐𝑐,𝑦𝑦,π‘₯π‘₯ βˆ’ πœ‡πœ‡π‘π‘
πœŽπœŽπ‘π‘
+ 𝛽𝛽𝑐𝑐,𝑦𝑦,π‘₯π‘₯(𝑆𝑆𝑆𝑆, 𝑀𝑀)
ST : encoding per-region style code
M : segmentation mask
𝛾𝛾𝑐𝑐,𝑦𝑦,π‘₯π‘₯ 𝑆𝑆𝑆𝑆, 𝑀𝑀 = 𝛼𝛼𝛾𝛾𝛾𝛾𝑐𝑐,𝑦𝑦,π‘₯π‘₯
𝑠𝑠
𝑆𝑆𝑆𝑆 + 1 βˆ’ 𝛼𝛼𝛾𝛾 𝛾𝛾𝑐𝑐,𝑦𝑦,π‘₯π‘₯
π‘œπ‘œ
𝛽𝛽𝑐𝑐,𝑦𝑦,π‘₯π‘₯ 𝑆𝑆𝑆𝑆, 𝑀𝑀 = 𝛼𝛼𝛾𝛾𝛽𝛽𝑐𝑐,𝑦𝑦,π‘₯π‘₯
𝑠𝑠
𝑆𝑆𝑆𝑆 + 1 βˆ’ 𝛼𝛼𝛾𝛾 𝛽𝛽𝑐𝑐,𝑦𝑦,π‘₯π‘₯
π‘œπ‘œ
How to Control Style?
Jaemin Jeong Seminar 8
Network Architecture
1x1
conv
Jaemin Jeong Seminar 9
Network Architecture
Generator
Discriminator
Encoder
Jaemin Jeong Seminar 10
 We observed that the initial transformations are indispensable components of the
architecture because they transform the style codes according to the different roles of each
neural network layer.
 For example, early layers might control the hair styles (e.g. wavy, straight) of human face
images while later layers might refer to the lighting and color. In addition, we observed that
adding noise to the input of SEAN can improve the quality of synthesized images.
 The scale of such noise is adjusted by a per-channel scaling factor B that is learnt during
training, similar to StyleGAN.
Experiments
Jaemin Jeong Seminar 11
Quantitative comparisons.
Quantitative comparison of reconstruction quality.
Quantitative comparison using semantic segmentation performance (Deeplab v3+)
Jaemin Jeong Seminar 12
Qualitative results
Jaemin Jeong Seminar 13
Qualitative results
Jaemin Jeong Seminar 14
Ablation study on CelebAMask-HQ dataset.
Jaemin Jeong Seminar 15
Appendix Experiments
AMT Benchmark
Jaemin Jeong Seminar 16
 Proposed semantic region-adaptive normalization (SEAN).
 Our main idea is to extend SPADE, the currently best network, to control the style of each
semantic region individually.
 In summary, SEAN is better suited to encode, transfer, and synthesize style than the best
previous method in terms of reconstruction quality, variability, and visual quality.
 SEAN also pushes the frontier of interactive image editing.
 In future work, we plan to extend SEAN to processing meshes, point clouds, and textures on
surfaces.
Conclusion

More Related Content

What's hot

2021 03-01-on the relationship between self-attention and convolutional layers
2021 03-01-on the relationship between self-attention and convolutional layers2021 03-01-on the relationship between self-attention and convolutional layers
2021 03-01-on the relationship between self-attention and convolutional layersJAEMINJEONG5
Β 
Swin transformer
Swin transformerSwin transformer
Swin transformerJAEMINJEONG5
Β 
2020 12-04-shake shake
2020 12-04-shake shake2020 12-04-shake shake
2020 12-04-shake shakeJAEMINJEONG5
Β 
2020 11 2_automated sleep stage scoring of the sleep heart
2020 11 2_automated sleep stage scoring of the sleep heart2020 11 2_automated sleep stage scoring of the sleep heart
2020 11 2_automated sleep stage scoring of the sleep heartJAEMINJEONG5
Β 
2021 03-02-transformer interpretability
2021 03-02-transformer interpretability2021 03-02-transformer interpretability
2021 03-02-transformer interpretabilityJAEMINJEONG5
Β 
2022-01-17-Rethinking_Bisenet.pptx
2022-01-17-Rethinking_Bisenet.pptx2022-01-17-Rethinking_Bisenet.pptx
2022-01-17-Rethinking_Bisenet.pptxJAEMINJEONG5
Β 
2021 01-04-learning filter-basis
2021 01-04-learning filter-basis2021 01-04-learning filter-basis
2021 01-04-learning filter-basisJAEMINJEONG5
Β 
PR-351: Adaptive Aggregation Networks for Class-Incremental Learning
PR-351: Adaptive Aggregation Networks for Class-Incremental LearningPR-351: Adaptive Aggregation Networks for Class-Incremental Learning
PR-351: Adaptive Aggregation Networks for Class-Incremental LearningSunghoon Joo
Β 
Deep Learning Fast MRI Using Channel Attention in Magnitude Domain
Deep Learning Fast MRI Using Channel Attention in Magnitude DomainDeep Learning Fast MRI Using Channel Attention in Magnitude Domain
Deep Learning Fast MRI Using Channel Attention in Magnitude DomainJoonhyung Lee
Β 
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...Jinwon Lee
Β 
Exploring Simple Siamese Representation Learning
Exploring Simple Siamese Representation LearningExploring Simple Siamese Representation Learning
Exploring Simple Siamese Representation LearningSungchul Kim
Β 
Locating texture boundaries using a fast unsupervised approach based on clust...
Locating texture boundaries using a fast unsupervised approach based on clust...Locating texture boundaries using a fast unsupervised approach based on clust...
Locating texture boundaries using a fast unsupervised approach based on clust...Mehryar (Mike) E., Ph.D.
Β 
Rethinking Attention with Performers
Rethinking Attention with PerformersRethinking Attention with Performers
Rethinking Attention with PerformersJoonhyung Lee
Β 
Multimedia image compression standards
Multimedia image compression standardsMultimedia image compression standards
Multimedia image compression standardsMazin Alwaaly
Β 
PR-270: PP-YOLO: An Effective and Efficient Implementation of Object Detector
PR-270: PP-YOLO: An Effective and Efficient Implementation of Object DetectorPR-270: PP-YOLO: An Effective and Efficient Implementation of Object Detector
PR-270: PP-YOLO: An Effective and Efficient Implementation of Object DetectorJinwon Lee
Β 
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis taeseon ryu
Β 
Rethinking Data Augmentation for Image Super-resolution: A Comprehensive Anal...
Rethinking Data Augmentation for Image Super-resolution: A Comprehensive Anal...Rethinking Data Augmentation for Image Super-resolution: A Comprehensive Anal...
Rethinking Data Augmentation for Image Super-resolution: A Comprehensive Anal...JaeJun Yoo
Β 
Squeeze Excitation Networks, The simple idea that won the final ImageNet Chal...
Squeeze Excitation Networks, The simple idea that won the final ImageNet Chal...Squeeze Excitation Networks, The simple idea that won the final ImageNet Chal...
Squeeze Excitation Networks, The simple idea that won the final ImageNet Chal...Joonhyung Lee
Β 
PR095: Modularity Matters: Learning Invariant Relational Reasoning Tasks
PR095: Modularity Matters: Learning Invariant Relational Reasoning TasksPR095: Modularity Matters: Learning Invariant Relational Reasoning Tasks
PR095: Modularity Matters: Learning Invariant Relational Reasoning TasksJinwon Lee
Β 
PR-344: A Battle of Network Structures: An Empirical Study of CNN, Transforme...
PR-344: A Battle of Network Structures: An Empirical Study of CNN, Transforme...PR-344: A Battle of Network Structures: An Empirical Study of CNN, Transforme...
PR-344: A Battle of Network Structures: An Empirical Study of CNN, Transforme...Jinwon Lee
Β 

What's hot (20)

2021 03-01-on the relationship between self-attention and convolutional layers
2021 03-01-on the relationship between self-attention and convolutional layers2021 03-01-on the relationship between self-attention and convolutional layers
2021 03-01-on the relationship between self-attention and convolutional layers
Β 
Swin transformer
Swin transformerSwin transformer
Swin transformer
Β 
2020 12-04-shake shake
2020 12-04-shake shake2020 12-04-shake shake
2020 12-04-shake shake
Β 
2020 11 2_automated sleep stage scoring of the sleep heart
2020 11 2_automated sleep stage scoring of the sleep heart2020 11 2_automated sleep stage scoring of the sleep heart
2020 11 2_automated sleep stage scoring of the sleep heart
Β 
2021 03-02-transformer interpretability
2021 03-02-transformer interpretability2021 03-02-transformer interpretability
2021 03-02-transformer interpretability
Β 
2022-01-17-Rethinking_Bisenet.pptx
2022-01-17-Rethinking_Bisenet.pptx2022-01-17-Rethinking_Bisenet.pptx
2022-01-17-Rethinking_Bisenet.pptx
Β 
2021 01-04-learning filter-basis
2021 01-04-learning filter-basis2021 01-04-learning filter-basis
2021 01-04-learning filter-basis
Β 
PR-351: Adaptive Aggregation Networks for Class-Incremental Learning
PR-351: Adaptive Aggregation Networks for Class-Incremental LearningPR-351: Adaptive Aggregation Networks for Class-Incremental Learning
PR-351: Adaptive Aggregation Networks for Class-Incremental Learning
Β 
Deep Learning Fast MRI Using Channel Attention in Magnitude Domain
Deep Learning Fast MRI Using Channel Attention in Magnitude DomainDeep Learning Fast MRI Using Channel Attention in Magnitude Domain
Deep Learning Fast MRI Using Channel Attention in Magnitude Domain
Β 
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...
Β 
Exploring Simple Siamese Representation Learning
Exploring Simple Siamese Representation LearningExploring Simple Siamese Representation Learning
Exploring Simple Siamese Representation Learning
Β 
Locating texture boundaries using a fast unsupervised approach based on clust...
Locating texture boundaries using a fast unsupervised approach based on clust...Locating texture boundaries using a fast unsupervised approach based on clust...
Locating texture boundaries using a fast unsupervised approach based on clust...
Β 
Rethinking Attention with Performers
Rethinking Attention with PerformersRethinking Attention with Performers
Rethinking Attention with Performers
Β 
Multimedia image compression standards
Multimedia image compression standardsMultimedia image compression standards
Multimedia image compression standards
Β 
PR-270: PP-YOLO: An Effective and Efficient Implementation of Object Detector
PR-270: PP-YOLO: An Effective and Efficient Implementation of Object DetectorPR-270: PP-YOLO: An Effective and Efficient Implementation of Object Detector
PR-270: PP-YOLO: An Effective and Efficient Implementation of Object Detector
Β 
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
Β 
Rethinking Data Augmentation for Image Super-resolution: A Comprehensive Anal...
Rethinking Data Augmentation for Image Super-resolution: A Comprehensive Anal...Rethinking Data Augmentation for Image Super-resolution: A Comprehensive Anal...
Rethinking Data Augmentation for Image Super-resolution: A Comprehensive Anal...
Β 
Squeeze Excitation Networks, The simple idea that won the final ImageNet Chal...
Squeeze Excitation Networks, The simple idea that won the final ImageNet Chal...Squeeze Excitation Networks, The simple idea that won the final ImageNet Chal...
Squeeze Excitation Networks, The simple idea that won the final ImageNet Chal...
Β 
PR095: Modularity Matters: Learning Invariant Relational Reasoning Tasks
PR095: Modularity Matters: Learning Invariant Relational Reasoning TasksPR095: Modularity Matters: Learning Invariant Relational Reasoning Tasks
PR095: Modularity Matters: Learning Invariant Relational Reasoning Tasks
Β 
PR-344: A Battle of Network Structures: An Empirical Study of CNN, Transforme...
PR-344: A Battle of Network Structures: An Empirical Study of CNN, Transforme...PR-344: A Battle of Network Structures: An Empirical Study of CNN, Transforme...
PR-344: A Battle of Network Structures: An Empirical Study of CNN, Transforme...
Β 

Similar to Jaemin Jeong Seminar - Semantic Region-Adaptive Normalization for High-Quality Image Synthesis

IRJET- Comparative Analysis of MSE and PSNR Q-Factor with Cameraman, Lena and...
IRJET- Comparative Analysis of MSE and PSNR Q-Factor with Cameraman, Lena and...IRJET- Comparative Analysis of MSE and PSNR Q-Factor with Cameraman, Lena and...
IRJET- Comparative Analysis of MSE and PSNR Q-Factor with Cameraman, Lena and...IRJET Journal
Β 
IRJET- American Sign Language Classification
IRJET- American Sign Language ClassificationIRJET- American Sign Language Classification
IRJET- American Sign Language ClassificationIRJET Journal
Β 
Final year embedded projects in bangalore
Final year embedded projects in bangaloreFinal year embedded projects in bangalore
Final year embedded projects in bangaloreAshok Kumar.k
Β 
International Journal of Computational Engineering Research(IJCER)
 International Journal of Computational Engineering Research(IJCER)  International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER) ijceronline
Β 
Faster Training Algorithms in Neural Network Based Approach For Handwritten T...
Faster Training Algorithms in Neural Network Based Approach For Handwritten T...Faster Training Algorithms in Neural Network Based Approach For Handwritten T...
Faster Training Algorithms in Neural Network Based Approach For Handwritten T...CSCJournals
Β 
A binarization technique for extraction of devanagari text from camera based ...
A binarization technique for extraction of devanagari text from camera based ...A binarization technique for extraction of devanagari text from camera based ...
A binarization technique for extraction of devanagari text from camera based ...sipij
Β 
Performance Evaluation of Illumination Invariant Face Recognition Algorthims
Performance Evaluation of Illumination Invariant Face Recognition AlgorthimsPerformance Evaluation of Illumination Invariant Face Recognition Algorthims
Performance Evaluation of Illumination Invariant Face Recognition AlgorthimsIRJET Journal
Β 
COLOUR BASED IMAGE SEGMENTATION USING HYBRID KMEANS WITH WATERSHED SEGMENTATION
COLOUR BASED IMAGE SEGMENTATION USING HYBRID KMEANS WITH WATERSHED SEGMENTATIONCOLOUR BASED IMAGE SEGMENTATION USING HYBRID KMEANS WITH WATERSHED SEGMENTATION
COLOUR BASED IMAGE SEGMENTATION USING HYBRID KMEANS WITH WATERSHED SEGMENTATIONIAEME Publication
Β 
Realtime pothole detection system using improved CNN Models
Realtime pothole detection system using improved CNN ModelsRealtime pothole detection system using improved CNN Models
Realtime pothole detection system using improved CNN Modelsnithinsai2992
Β 
IRJET- Deep Convolutional Neural Network for Natural Image Matting using Init...
IRJET- Deep Convolutional Neural Network for Natural Image Matting using Init...IRJET- Deep Convolutional Neural Network for Natural Image Matting using Init...
IRJET- Deep Convolutional Neural Network for Natural Image Matting using Init...IRJET Journal
Β 
Evaluation of conditional images synthesis: generating a photorealistic image...
Evaluation of conditional images synthesis: generating a photorealistic image...Evaluation of conditional images synthesis: generating a photorealistic image...
Evaluation of conditional images synthesis: generating a photorealistic image...SamanthaGallone
Β 
Iaetsd multi-view and multi band face recognition
Iaetsd multi-view and multi band face recognitionIaetsd multi-view and multi band face recognition
Iaetsd multi-view and multi band face recognitionIaetsd Iaetsd
Β 
ROI Based Image Compression in Baseline JPEG
ROI Based Image Compression in Baseline JPEGROI Based Image Compression in Baseline JPEG
ROI Based Image Compression in Baseline JPEGIJERA Editor
Β 
Revealing the trace of high quality jpeg
Revealing the trace of high quality jpegRevealing the trace of high quality jpeg
Revealing the trace of high quality jpegPvrtechnologies Nellore
Β 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)IJERD Editor
Β 
A Review of Image Contrast Enhancement Techniques
A Review of Image Contrast Enhancement TechniquesA Review of Image Contrast Enhancement Techniques
A Review of Image Contrast Enhancement TechniquesIRJET Journal
Β 
Robust face name graph matching for movie character identification - Final PPT
 Robust face name graph matching for movie character identification - Final PPT Robust face name graph matching for movie character identification - Final PPT
Robust face name graph matching for movie character identification - Final PPTPriyadarshini Dasarathan
Β 

Similar to Jaemin Jeong Seminar - Semantic Region-Adaptive Normalization for High-Quality Image Synthesis (20)

IRJET- Comparative Analysis of MSE and PSNR Q-Factor with Cameraman, Lena and...
IRJET- Comparative Analysis of MSE and PSNR Q-Factor with Cameraman, Lena and...IRJET- Comparative Analysis of MSE and PSNR Q-Factor with Cameraman, Lena and...
IRJET- Comparative Analysis of MSE and PSNR Q-Factor with Cameraman, Lena and...
Β 
IRJET- American Sign Language Classification
IRJET- American Sign Language ClassificationIRJET- American Sign Language Classification
IRJET- American Sign Language Classification
Β 
RankSRGAN
RankSRGANRankSRGAN
RankSRGAN
Β 
Final year embedded projects in bangalore
Final year embedded projects in bangaloreFinal year embedded projects in bangalore
Final year embedded projects in bangalore
Β 
International Journal of Computational Engineering Research(IJCER)
 International Journal of Computational Engineering Research(IJCER)  International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)
Β 
Faster Training Algorithms in Neural Network Based Approach For Handwritten T...
Faster Training Algorithms in Neural Network Based Approach For Handwritten T...Faster Training Algorithms in Neural Network Based Approach For Handwritten T...
Faster Training Algorithms in Neural Network Based Approach For Handwritten T...
Β 
A binarization technique for extraction of devanagari text from camera based ...
A binarization technique for extraction of devanagari text from camera based ...A binarization technique for extraction of devanagari text from camera based ...
A binarization technique for extraction of devanagari text from camera based ...
Β 
Performance Evaluation of Illumination Invariant Face Recognition Algorthims
Performance Evaluation of Illumination Invariant Face Recognition AlgorthimsPerformance Evaluation of Illumination Invariant Face Recognition Algorthims
Performance Evaluation of Illumination Invariant Face Recognition Algorthims
Β 
Seminarpaper
SeminarpaperSeminarpaper
Seminarpaper
Β 
COLOUR BASED IMAGE SEGMENTATION USING HYBRID KMEANS WITH WATERSHED SEGMENTATION
COLOUR BASED IMAGE SEGMENTATION USING HYBRID KMEANS WITH WATERSHED SEGMENTATIONCOLOUR BASED IMAGE SEGMENTATION USING HYBRID KMEANS WITH WATERSHED SEGMENTATION
COLOUR BASED IMAGE SEGMENTATION USING HYBRID KMEANS WITH WATERSHED SEGMENTATION
Β 
Realtime pothole detection system using improved CNN Models
Realtime pothole detection system using improved CNN ModelsRealtime pothole detection system using improved CNN Models
Realtime pothole detection system using improved CNN Models
Β 
S0450598102
S0450598102S0450598102
S0450598102
Β 
IRJET- Deep Convolutional Neural Network for Natural Image Matting using Init...
IRJET- Deep Convolutional Neural Network for Natural Image Matting using Init...IRJET- Deep Convolutional Neural Network for Natural Image Matting using Init...
IRJET- Deep Convolutional Neural Network for Natural Image Matting using Init...
Β 
Evaluation of conditional images synthesis: generating a photorealistic image...
Evaluation of conditional images synthesis: generating a photorealistic image...Evaluation of conditional images synthesis: generating a photorealistic image...
Evaluation of conditional images synthesis: generating a photorealistic image...
Β 
Iaetsd multi-view and multi band face recognition
Iaetsd multi-view and multi band face recognitionIaetsd multi-view and multi band face recognition
Iaetsd multi-view and multi band face recognition
Β 
ROI Based Image Compression in Baseline JPEG
ROI Based Image Compression in Baseline JPEGROI Based Image Compression in Baseline JPEG
ROI Based Image Compression in Baseline JPEG
Β 
Revealing the trace of high quality jpeg
Revealing the trace of high quality jpegRevealing the trace of high quality jpeg
Revealing the trace of high quality jpeg
Β 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
Β 
A Review of Image Contrast Enhancement Techniques
A Review of Image Contrast Enhancement TechniquesA Review of Image Contrast Enhancement Techniques
A Review of Image Contrast Enhancement Techniques
Β 
Robust face name graph matching for movie character identification - Final PPT
 Robust face name graph matching for movie character identification - Final PPT Robust face name graph matching for movie character identification - Final PPT
Robust face name graph matching for movie character identification - Final PPT
Β 

More from JAEMINJEONG5

Jaemin_230701_Simple_Copy_paste.pptx
Jaemin_230701_Simple_Copy_paste.pptxJaemin_230701_Simple_Copy_paste.pptx
Jaemin_230701_Simple_Copy_paste.pptxJAEMINJEONG5
Β 
2021 04-04-google nmt
2021 04-04-google nmt2021 04-04-google nmt
2021 04-04-google nmtJAEMINJEONG5
Β 
2021 03-02-distributed representations-of_words_and_phrases
2021 03-02-distributed representations-of_words_and_phrases2021 03-02-distributed representations-of_words_and_phrases
2021 03-02-distributed representations-of_words_and_phrasesJAEMINJEONG5
Β 
2021 01-02-linformer
2021 01-02-linformer2021 01-02-linformer
2021 01-02-linformerJAEMINJEONG5
Β 
2020 11 1_sleep_net
2020 11 1_sleep_net2020 11 1_sleep_net
2020 11 1_sleep_netJAEMINJEONG5
Β 
2020 11 3_face_detection
2020 11 3_face_detection2020 11 3_face_detection
2020 11 3_face_detectionJAEMINJEONG5
Β 
white blood cell classification
white blood cell classificationwhite blood cell classification
white blood cell classificationJAEMINJEONG5
Β 

More from JAEMINJEONG5 (7)

Jaemin_230701_Simple_Copy_paste.pptx
Jaemin_230701_Simple_Copy_paste.pptxJaemin_230701_Simple_Copy_paste.pptx
Jaemin_230701_Simple_Copy_paste.pptx
Β 
2021 04-04-google nmt
2021 04-04-google nmt2021 04-04-google nmt
2021 04-04-google nmt
Β 
2021 03-02-distributed representations-of_words_and_phrases
2021 03-02-distributed representations-of_words_and_phrases2021 03-02-distributed representations-of_words_and_phrases
2021 03-02-distributed representations-of_words_and_phrases
Β 
2021 01-02-linformer
2021 01-02-linformer2021 01-02-linformer
2021 01-02-linformer
Β 
2020 11 1_sleep_net
2020 11 1_sleep_net2020 11 1_sleep_net
2020 11 1_sleep_net
Β 
2020 11 3_face_detection
2020 11 3_face_detection2020 11 3_face_detection
2020 11 3_face_detection
Β 
white blood cell classification
white blood cell classificationwhite blood cell classification
white blood cell classification
Β 

Recently uploaded

(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
Β 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...ranjana rawat
Β 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
Β 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome
Β 
Model Call Girl in Narela Delhi reach out to us at πŸ”8264348440πŸ”
Model Call Girl in Narela Delhi reach out to us at πŸ”8264348440πŸ”Model Call Girl in Narela Delhi reach out to us at πŸ”8264348440πŸ”
Model Call Girl in Narela Delhi reach out to us at πŸ”8264348440πŸ”soniya singh
Β 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingrakeshbaidya232001
Β 
chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learningmisbanausheenparvam
Β 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...ranjana rawat
Β 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)Suman Mia
Β 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
Β 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxupamatechverse
Β 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Christo Ananth
Β 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
Β 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N
Β 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile servicerehmti665
Β 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidNikhilNagaraju
Β 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis
Β 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
Β 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024hassan khalil
Β 

Recently uploaded (20)

(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
Β 
β˜… CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
β˜… CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCRβ˜… CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
β˜… CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
Β 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
Β 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
Β 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
Β 
Model Call Girl in Narela Delhi reach out to us at πŸ”8264348440πŸ”
Model Call Girl in Narela Delhi reach out to us at πŸ”8264348440πŸ”Model Call Girl in Narela Delhi reach out to us at πŸ”8264348440πŸ”
Model Call Girl in Narela Delhi reach out to us at πŸ”8264348440πŸ”
Β 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writing
Β 
chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learning
Β 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
Β 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Β 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
Β 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptx
Β 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Β 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
Β 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
Β 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Β 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfid
Β 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Β 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Β 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024
Β 

Jaemin Jeong Seminar - Semantic Region-Adaptive Normalization for High-Quality Image Synthesis

  • 1.
  • 2. Jaemin Jeong Seminar 2 Proposed Semantic rEgion-Adaptive Normalization Control the style of each semantic region individually. Good Performance Reconstruction quality Variabliity Visual Quality Abstract SPADE is starting point for out research.
  • 3. Jaemin Jeong Seminar 3 1. SPADE use only one style code to control the entire style of an image, which is not sufficient for high quality synthesis or detailed control. 2. we believe that inserting style information only in the beginning of a network is not a good architecture choice. We found two shorcomings ..
  • 4. Jaemin Jeong Seminar 4 1. SEAN improves the quality of the synthesized images for conditional GANs. 2. SEAN improves the per-region style encoding, so that reconstructed images can be made more similar to the input style images as measured by PSNR and visual inspection. 3. SEAN allows the user to select a different style input image for each semantic region. Advantages of SEAN
  • 5. Jaemin Jeong Seminar 5 1. How to distill the per-region styles from the image according to the mask ? 2. How to use the distilled per-region style codes to synthesize photo-realistic images ? Per Region Style Encoding and Control
  • 6. Jaemin Jeong Seminar 6 Per-Region Style Encoder  ST(Style Matrix) : πŸ“πŸ“πŸ“πŸ“πŸ“πŸ“ Γ— 𝒔𝒔  𝒔𝒔 : number of semantic regions set columns corresponding to regions that do not exist in an input image to 0. -> 항상 λͺ¨λ“  semantic regionsκ°€ λ™μΌν•˜κ²Œ μžˆλŠ”κ²Œ μ•„λ‹ˆλ‹ˆκΉŒ μ—†λŠ” 건 0으둜 ν•©μ‹œλ‹€. How to Encode Style?
  • 7. Jaemin Jeong Seminar 7  SEAN  Semantic Region-Adaptive Normalization 𝛾𝛾𝑐𝑐,𝑦𝑦,π‘₯π‘₯ 𝑆𝑆𝑆𝑆, 𝑀𝑀 β„Žπ‘›π‘›,𝑐𝑐,𝑦𝑦,π‘₯π‘₯ βˆ’ πœ‡πœ‡π‘π‘ πœŽπœŽπ‘π‘ + 𝛽𝛽𝑐𝑐,𝑦𝑦,π‘₯π‘₯(𝑆𝑆𝑆𝑆, 𝑀𝑀) ST : encoding per-region style code M : segmentation mask 𝛾𝛾𝑐𝑐,𝑦𝑦,π‘₯π‘₯ 𝑆𝑆𝑆𝑆, 𝑀𝑀 = 𝛼𝛼𝛾𝛾𝛾𝛾𝑐𝑐,𝑦𝑦,π‘₯π‘₯ 𝑠𝑠 𝑆𝑆𝑆𝑆 + 1 βˆ’ 𝛼𝛼𝛾𝛾 𝛾𝛾𝑐𝑐,𝑦𝑦,π‘₯π‘₯ π‘œπ‘œ 𝛽𝛽𝑐𝑐,𝑦𝑦,π‘₯π‘₯ 𝑆𝑆𝑆𝑆, 𝑀𝑀 = 𝛼𝛼𝛾𝛾𝛽𝛽𝑐𝑐,𝑦𝑦,π‘₯π‘₯ 𝑠𝑠 𝑆𝑆𝑆𝑆 + 1 βˆ’ 𝛼𝛼𝛾𝛾 𝛽𝛽𝑐𝑐,𝑦𝑦,π‘₯π‘₯ π‘œπ‘œ How to Control Style?
  • 8. Jaemin Jeong Seminar 8 Network Architecture 1x1 conv
  • 9. Jaemin Jeong Seminar 9 Network Architecture Generator Discriminator Encoder
  • 10. Jaemin Jeong Seminar 10  We observed that the initial transformations are indispensable components of the architecture because they transform the style codes according to the different roles of each neural network layer.  For example, early layers might control the hair styles (e.g. wavy, straight) of human face images while later layers might refer to the lighting and color. In addition, we observed that adding noise to the input of SEAN can improve the quality of synthesized images.  The scale of such noise is adjusted by a per-channel scaling factor B that is learnt during training, similar to StyleGAN. Experiments
  • 11. Jaemin Jeong Seminar 11 Quantitative comparisons. Quantitative comparison of reconstruction quality. Quantitative comparison using semantic segmentation performance (Deeplab v3+)
  • 12. Jaemin Jeong Seminar 12 Qualitative results
  • 13. Jaemin Jeong Seminar 13 Qualitative results
  • 14. Jaemin Jeong Seminar 14 Ablation study on CelebAMask-HQ dataset.
  • 15. Jaemin Jeong Seminar 15 Appendix Experiments AMT Benchmark
  • 16. Jaemin Jeong Seminar 16  Proposed semantic region-adaptive normalization (SEAN).  Our main idea is to extend SPADE, the currently best network, to control the style of each semantic region individually.  In summary, SEAN is better suited to encode, transfer, and synthesize style than the best previous method in terms of reconstruction quality, variability, and visual quality.  SEAN also pushes the frontier of interactive image editing.  In future work, we plan to extend SEAN to processing meshes, point clouds, and textures on surfaces. Conclusion