SlideShare a Scribd company logo
1 of 32
Download to read offline
Sou Hasegawa
20191103 LT
Self-Introduce




https://medium.com/@crosssceneofwindff
My Deep Learning Work for Creativity
My Deep Learning Work for Creativity
Premises
1. 

- ( )

- (CNN) 

2. 

What is Line Art Colorization ?
Brown_hair
Pink_dress
or or
Line Art Colorization Method
1.
a. 

- PaintsChainer

- Two-stage Sketch Colorization (style2paints v3)[1]

- User-Guided Deep Anime Line Art Colorization with Conditional Adversarial Networks[2]

2.
a. 

- Tag2Pix[3]

3.
a. 

- Style Transfer for Anime Sketches with Enhanced Residual U-net and Auxiliary Classifier GAN

(style2paints v1)[4]
Agenda
1. Basic Architecture

2. Hint

3. Tag

4. Reference
Agenda
1. Basic Architecture

2. Hint

3. Tag

4. Reference
pix2pix: Architecture
UNet
pix2pix[5] UNet
Discriminator Adversarial loss
pix2pix: Objective Function
1. Content loss
2. Adversarial loss
The Problems of Line Art Colorization
Kawaii Illustrations
Agenda
1. Basic Architecture

2. Hint

3. Tag

4. Reference
style2paints V3: Method
UNet
- Draft
-
- Content loss, Adversarial loss Positive regulation loss 

- Refinement Draft
-
- Content loss, Adversarial loss
xc, i : c i
m: ×
style2paints V3: Architecture
style2paints V3: Refinement
Refinement Draft 



3
1. ( etc)
a. 

2.
a. 

Fig. 7 

3.
a. Spatial Transformer
Agenda
1. Basic Architecture

2. Hint

3. Tag

4. Reference
Tag2pix: Method
Tag
- Color Invariant Tag(CIT): ( ) 370
- Color Variant Tag(CVT): ( ) 115
- Semantic
- Content loss, Adversarial loss Content loss 

Real/Fake
-
- Content loss, Adversarial loss Classification loss 

CIT CVT
Tag2pix: Generator Architecture
UNet 

SECat-ResNeXt CVT
Tag2pix: SECat Architecture
Squeeze-and-Excitation Network[6] GAP CVT Encode 

AdaIN SECat
FC: Fully-Connected Layer
GAP: Global Average Pooling
Tag2pix: Discriminator Architecture
CVT
CIT
Agenda
1. Basic Architecture

2. Hint

3. Tag

4. Reference
style2paints V1: Method
Vgg19 

Vgg19 (4096 ) UNet
Adversarial loss 

- Discriminator 0 

- Discriminator Vgg19 



- UNet Guided Decoder
-
- Fig.7 Guide Decoder
style2paints V1: Architecture
Reference

Image
Input
Image
Output
Image
Guide
Decoder1
Output
Guide
Decoder2
Output
My Approach: Methods
style2paints V1 4096 

( ….)
→Adaptive Instance Normalization(AdaIN)
- 

- x, y
- FUNIT[7]
σ: µ:
My Approach: Architecture
Line Art
Reference
Content

Encoder
Style

Encoder
AdaIN ResBlocks Decoder
Colored

Art
Discriminator
Content Encoder Style Encoder
AdaIN ResBlocks
x: Output of content encoder
y: Output of style encoder
Summary
1. pix2pix
a. UNet + Discriminator
b. 

2. : style2paints V3
a.
b. 

3. : Tag2pix
a. 

4. : style2paints V1
a. Vgg19 4096 Discriminator 4096
b. AdaIN
End
Appendix
Line Extraction Method




(Tag2pix )
1. sketchKeras[8]

- UNet
2. Sketch Simplification[9]

- Fully-Convolutional Network sketchKeras
3. XDoG[10]

- Gaussian
References
[1] Lvmin Zhang, et al., “Two-stage Sketch Colorization”. SIGGRAPH ASIA 2018

[2] Yuanzheng Ci, et al., “User-Guided Deep Anime Line Art Colorization with Conditional 

Adversarial Networks”. ACM Multimedia Conference 2018

[3] Hyunsu Kim, et al., “Tag2Pix: Line Art Colorization Using Text Tag With SECat and Changing 

Loss”. ICCV2019

[4] Lvmin Zhang, et al., “Style Transfer for Anime Sketches with Enhanced Residual U-net and 

Auxiliary Classifier GAN”. ACPR2017

[5] Phillip Isola, et al., “Image-to-Image Translation with Conditional Adversarial Nets”. CVPR2017

[6] Jie Hu, et al., “Squeeze-and-Excitation Networks”. CVPR2018

[7] Ming-Yu Liu, et al., “Few-Shot Unsupervised Image-to-Image Translation”. ICCV2019

[8] Illyasviel, “sketchKeras”. https://github.com/lllyasviel/sketchKeras

[9] Edgar Simo-Serra, et al., “Learning to Simplify: Fully Convolutional Networks for Rough Sketch 

Cleanup” SIGGRAPH2016

[10] Holger Winnemoeller, et al., “XDoG: An eXtended difference-of-Gaussians compendium 

including advanced image stylization” Computer & Graphics 36(6):740-753 2012

More Related Content

Similar to データに寄りそう着色の作法

Introduction to Native 2D Tools - Touch
Introduction to Native 2D Tools - TouchIntroduction to Native 2D Tools - Touch
Introduction to Native 2D Tools - TouchCodemotion
 
B. SC CSIT Computer Graphics Lab By Tekendra Nath Yogi
B. SC CSIT Computer Graphics Lab By Tekendra Nath YogiB. SC CSIT Computer Graphics Lab By Tekendra Nath Yogi
B. SC CSIT Computer Graphics Lab By Tekendra Nath YogiTekendra Nath Yogi
 
We are restricted from importing cv2 numpy stats and other.pdf
We are restricted from importing cv2 numpy stats and other.pdfWe are restricted from importing cv2 numpy stats and other.pdf
We are restricted from importing cv2 numpy stats and other.pdfDARSHANACHARYA13
 
Implementation of Picwords to Warping Pictures and Keywords through Calligram
Implementation of Picwords to Warping Pictures and Keywords through CalligramImplementation of Picwords to Warping Pictures and Keywords through Calligram
Implementation of Picwords to Warping Pictures and Keywords through CalligramIRJET Journal
 
Image style transfer & AI on App
Image style transfer & AI on AppImage style transfer & AI on App
Image style transfer & AI on AppChihyang Li
 
Computer Graphics
Computer GraphicsComputer Graphics
Computer GraphicsAdri Jovin
 
Learning to Spot and Refactor Inconsistent Method Names
Learning to Spot and Refactor Inconsistent Method NamesLearning to Spot and Refactor Inconsistent Method Names
Learning to Spot and Refactor Inconsistent Method NamesDongsun Kim
 
Computer Graphics Introduction
Computer Graphics IntroductionComputer Graphics Introduction
Computer Graphics IntroductionGhaffar Khan
 
Performance Anaysis for Imaging System
Performance Anaysis for Imaging SystemPerformance Anaysis for Imaging System
Performance Anaysis for Imaging SystemVrushali Lanjewar
 
Encryption of Decomposed Image by using ASCII Code based Carrier Signal
Encryption of Decomposed Image by using ASCII Code based Carrier SignalEncryption of Decomposed Image by using ASCII Code based Carrier Signal
Encryption of Decomposed Image by using ASCII Code based Carrier SignalIRJET Journal
 
Image processing for robotics
Image processing for roboticsImage processing for robotics
Image processing for roboticsSALAAMCHAUS
 
cs2401-cg-attributsofprimitives-unit1.ppt
cs2401-cg-attributsofprimitives-unit1.pptcs2401-cg-attributsofprimitives-unit1.ppt
cs2401-cg-attributsofprimitives-unit1.pptAteeqAhmad48
 

Similar to データに寄りそう着色の作法 (20)

Introduction to Native 2D Tools - Touch
Introduction to Native 2D Tools - TouchIntroduction to Native 2D Tools - Touch
Introduction to Native 2D Tools - Touch
 
B. SC CSIT Computer Graphics Lab By Tekendra Nath Yogi
B. SC CSIT Computer Graphics Lab By Tekendra Nath YogiB. SC CSIT Computer Graphics Lab By Tekendra Nath Yogi
B. SC CSIT Computer Graphics Lab By Tekendra Nath Yogi
 
Cad notes
Cad notesCad notes
Cad notes
 
We are restricted from importing cv2 numpy stats and other.pdf
We are restricted from importing cv2 numpy stats and other.pdfWe are restricted from importing cv2 numpy stats and other.pdf
We are restricted from importing cv2 numpy stats and other.pdf
 
Implementation of Picwords to Warping Pictures and Keywords through Calligram
Implementation of Picwords to Warping Pictures and Keywords through CalligramImplementation of Picwords to Warping Pictures and Keywords through Calligram
Implementation of Picwords to Warping Pictures and Keywords through Calligram
 
Image style transfer & AI on App
Image style transfer & AI on AppImage style transfer & AI on App
Image style transfer & AI on App
 
Computer Graphics
Computer GraphicsComputer Graphics
Computer Graphics
 
Ec section
Ec section Ec section
Ec section
 
Image processing
Image processingImage processing
Image processing
 
Image processing
Image processingImage processing
Image processing
 
Learning to Spot and Refactor Inconsistent Method Names
Learning to Spot and Refactor Inconsistent Method NamesLearning to Spot and Refactor Inconsistent Method Names
Learning to Spot and Refactor Inconsistent Method Names
 
Computer Graphics Introduction
Computer Graphics IntroductionComputer Graphics Introduction
Computer Graphics Introduction
 
Performance Anaysis for Imaging System
Performance Anaysis for Imaging SystemPerformance Anaysis for Imaging System
Performance Anaysis for Imaging System
 
Encryption of Decomposed Image by using ASCII Code based Carrier Signal
Encryption of Decomposed Image by using ASCII Code based Carrier SignalEncryption of Decomposed Image by using ASCII Code based Carrier Signal
Encryption of Decomposed Image by using ASCII Code based Carrier Signal
 
CAD/CAM/CIM ( Lecture 2 model construction and product design)
CAD/CAM/CIM ( Lecture 2 model construction and product design)CAD/CAM/CIM ( Lecture 2 model construction and product design)
CAD/CAM/CIM ( Lecture 2 model construction and product design)
 
Visual CryptoGraphy
Visual CryptoGraphyVisual CryptoGraphy
Visual CryptoGraphy
 
Image processing for robotics
Image processing for roboticsImage processing for robotics
Image processing for robotics
 
Image processing in MATLAB
Image processing in MATLABImage processing in MATLAB
Image processing in MATLAB
 
Image Processing Using MATLAB
Image Processing Using MATLABImage Processing Using MATLAB
Image Processing Using MATLAB
 
cs2401-cg-attributsofprimitives-unit1.ppt
cs2401-cg-attributsofprimitives-unit1.pptcs2401-cg-attributsofprimitives-unit1.ppt
cs2401-cg-attributsofprimitives-unit1.ppt
 

Recently uploaded

My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 

Recently uploaded (20)

My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 

データに寄りそう着色の作法