SlideShare a Scribd company logo
Semantic Image Inpainting
Semantic Image Inpainting with Deep Generative Models,
R Yeh et al. CVPR 2017
LAB SEMINAR
1
2018.02.13
SNU DATAMINING CENTER
MINKI CHUNG
TABLE OF CONTENTS
▸ Motivation
▸ What is image inpainting
▸ Problem statement
▸ Baseline
▸ Semantic image inpainting with Deep Generative
Models
▸ My work
▸ Discussion
2
MOTIVATION
3
MOTIVATION 4
▸ What is image inpainting?
https://www.youtube.com/watch?v=1F-6iRrgh1s
MOTIVATION 5
▸ Objective: Make attentive inpainter
IF BACKGROUND OF TARGET REMOVING OBJECT IS SIMPLE, EXISTING METHOD WORKS FINE
HOWEVER, IF BACKGROUND OF TARGET REMOVING OBJECT IS COMPLEX, BETTER NEED ANOTHER METHOD
BASELINE
▸ Semantic Image Inpainting with Deep Generative
Models, R Yeh et al. CVPR 2017
6
SEMANTIC IMAGE INPAINTING WITH DEEP GENERATIVE MODELS 7
▸ DCGAN-Based
▸ Not end-to-end:
▸ 1. Train generator first (uncorrupted data)
▸ 2. Find z_hat for inpainting
CONTEXTUAL LOSSPRIOR LOSS
https://arxiv.org/abs/1607.07539
SEMANTIC IMAGE INPAINTING WITH DEEP GENERATIVE MODELS 8
▸ Hypothesis: Trained G is efficient- image not from pdata (e.g., corrupted data) should
not lie on the learned encoding manifold, z
▸ Objective: Find encoding z_hat: “closest” to the corrupted image while being
constrained to the manifold,
▸ y: corrupted image
M: binary mask(size equal to the image)
https://arxiv.org/abs/1607.07539
PRIOR LOSSCONTEXTUAL LOSS
SEMANTIC IMAGE INPAINTING WITH DEEP GENERATIVE MODELS 9
▸ Contextual Loss: Not simply l1 norm between G(z) and uncorrupted portion of
input image y, do consider corrupted area
▸ Weighting term W,
▸ So,
Wi: importance weight at pixel location i
N(i): set of neighbors of pixel i in a local window
BIGGER WEIGHT
y: corrupted image
M: binary mask(size equal to the image)
https://arxiv.org/abs/1607.07539
SEMANTIC IMAGE INPAINTING WITH DEEP GENERATIVE MODELS 10
▸ Prior Loss: how realistic the generated image is
▸ Identical to the GAN loss for training the discriminator D,
▸
▸ Without Lp, the mapping from y to z may converge to a perceptually implausible
result
https://arxiv.org/abs/1607.07539
SEMANTIC IMAGE INPAINTING WITH DEEP GENERATIVE MODELS 11
▸ Tackling points:
▸ Object-level occlusion: Narrowing down for object removal
▸ Contextual loss: A pixel that is very far away from any holes plays very little role
in the inpainting process.
▸ What if..?
▸ Interpretation: Want to see the pixel which plays key role in deciding z_hat
▸ → Attention
1
2
3
MY WORK
12
MY WORK 13
▸ Object-level occlusion: Narrowing down to object removal
▸ MS-COCO Dataset
▸ Train set: 118287
▸ COCO Api: Get annotations(instance)
▸ Use images which have person instance such that
smaller than 1/4 of the image bigger than 1/20 of the
image
▸ 30830, (rescale to 256x256)
1
MY WORK 14
▸ Limitation of contextual loss: less influence of farther part on inpainting
▸ Naive approach: for each grid of image, find pixel influence(attention_ratio) on
finding optimal z_hat
▸ Do it subsequently, grid by grid
occlusio
0.1 0.1 0.1
0.4 0.6
0.4 0.3
0.1 0.7
0.5 0.3
0.8 0.7
0.6 0.7
0.2 0.40.1 0.3
0.1 0.1
0.2
0.1
0.7
0.1
occlusio
2
MY WORK 15
▸ After finding optimal attention_ratio for each grid
▸ Find noize z hat based on ‘Original * Attn_Ratio image’ to reconstruct image
▸ Visualization of pixel influence on inpainting
ORIGINAL ORIGINAL*ATTN_RATIOMASKED
3
MY WORK 16
▸ However… because of computation inefficiency, unable to learn
▸ (Current situation) Rethinking about the attention method..
WITHOUT ATTENTION, 1000 EPOCH WITH ATTENTION, 20 EPOCH
ANY Q?
17
REFERENCE
▸ Semantic image inpainting with Deep Generative Models, Raymond A.
Yeh, Chen Chen, Teck Yian Lim, Alexander G. Schwing, Mark Hasegawa-
Johnson, Minh N. Do, CVPR 2017, https://arxiv.org/abs/1607.07539
▸ MS COCO dataset, http://cocodataset.org/#home
18
END OF
DOCUMENT
19

More Related Content

Recently uploaded

06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
v3tuleee
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
Walaa Eldin Moustafa
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
sameer shah
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
manishkhaire30
 
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdfUnleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Enterprise Wired
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
Roger Valdez
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
AnirbanRoy608946
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
ahzuo
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
slg6lamcq
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
u86oixdj
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
slg6lamcq
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
74nqk8xf
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
mbawufebxi
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
rwarrenll
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
GetInData
 

Recently uploaded (20)

06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
 
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdfUnleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
 

Featured

AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
marketingartwork
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
Skeleton Technologies
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
SpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Lily Ray
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
Rajiv Jayarajah, MAppComm, ACC
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
Christy Abraham Joy
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
Vit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
MindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
RachelPearson36
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Applitools
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
GetSmarter
 
More than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike RoutesMore than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike Routes
Project for Public Spaces & National Center for Biking and Walking
 

Featured (20)

AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
 
More than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike RoutesMore than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike Routes
 

Semantic Image Inpainting

  • 1. Semantic Image Inpainting Semantic Image Inpainting with Deep Generative Models, R Yeh et al. CVPR 2017 LAB SEMINAR 1 2018.02.13 SNU DATAMINING CENTER MINKI CHUNG
  • 2. TABLE OF CONTENTS ▸ Motivation ▸ What is image inpainting ▸ Problem statement ▸ Baseline ▸ Semantic image inpainting with Deep Generative Models ▸ My work ▸ Discussion 2
  • 4. MOTIVATION 4 ▸ What is image inpainting? https://www.youtube.com/watch?v=1F-6iRrgh1s
  • 5. MOTIVATION 5 ▸ Objective: Make attentive inpainter IF BACKGROUND OF TARGET REMOVING OBJECT IS SIMPLE, EXISTING METHOD WORKS FINE HOWEVER, IF BACKGROUND OF TARGET REMOVING OBJECT IS COMPLEX, BETTER NEED ANOTHER METHOD
  • 6. BASELINE ▸ Semantic Image Inpainting with Deep Generative Models, R Yeh et al. CVPR 2017 6
  • 7. SEMANTIC IMAGE INPAINTING WITH DEEP GENERATIVE MODELS 7 ▸ DCGAN-Based ▸ Not end-to-end: ▸ 1. Train generator first (uncorrupted data) ▸ 2. Find z_hat for inpainting CONTEXTUAL LOSSPRIOR LOSS https://arxiv.org/abs/1607.07539
  • 8. SEMANTIC IMAGE INPAINTING WITH DEEP GENERATIVE MODELS 8 ▸ Hypothesis: Trained G is efficient- image not from pdata (e.g., corrupted data) should not lie on the learned encoding manifold, z ▸ Objective: Find encoding z_hat: “closest” to the corrupted image while being constrained to the manifold, ▸ y: corrupted image M: binary mask(size equal to the image) https://arxiv.org/abs/1607.07539 PRIOR LOSSCONTEXTUAL LOSS
  • 9. SEMANTIC IMAGE INPAINTING WITH DEEP GENERATIVE MODELS 9 ▸ Contextual Loss: Not simply l1 norm between G(z) and uncorrupted portion of input image y, do consider corrupted area ▸ Weighting term W, ▸ So, Wi: importance weight at pixel location i N(i): set of neighbors of pixel i in a local window BIGGER WEIGHT y: corrupted image M: binary mask(size equal to the image) https://arxiv.org/abs/1607.07539
  • 10. SEMANTIC IMAGE INPAINTING WITH DEEP GENERATIVE MODELS 10 ▸ Prior Loss: how realistic the generated image is ▸ Identical to the GAN loss for training the discriminator D, ▸ ▸ Without Lp, the mapping from y to z may converge to a perceptually implausible result https://arxiv.org/abs/1607.07539
  • 11. SEMANTIC IMAGE INPAINTING WITH DEEP GENERATIVE MODELS 11 ▸ Tackling points: ▸ Object-level occlusion: Narrowing down for object removal ▸ Contextual loss: A pixel that is very far away from any holes plays very little role in the inpainting process. ▸ What if..? ▸ Interpretation: Want to see the pixel which plays key role in deciding z_hat ▸ → Attention 1 2 3
  • 13. MY WORK 13 ▸ Object-level occlusion: Narrowing down to object removal ▸ MS-COCO Dataset ▸ Train set: 118287 ▸ COCO Api: Get annotations(instance) ▸ Use images which have person instance such that smaller than 1/4 of the image bigger than 1/20 of the image ▸ 30830, (rescale to 256x256) 1
  • 14. MY WORK 14 ▸ Limitation of contextual loss: less influence of farther part on inpainting ▸ Naive approach: for each grid of image, find pixel influence(attention_ratio) on finding optimal z_hat ▸ Do it subsequently, grid by grid occlusio 0.1 0.1 0.1 0.4 0.6 0.4 0.3 0.1 0.7 0.5 0.3 0.8 0.7 0.6 0.7 0.2 0.40.1 0.3 0.1 0.1 0.2 0.1 0.7 0.1 occlusio 2
  • 15. MY WORK 15 ▸ After finding optimal attention_ratio for each grid ▸ Find noize z hat based on ‘Original * Attn_Ratio image’ to reconstruct image ▸ Visualization of pixel influence on inpainting ORIGINAL ORIGINAL*ATTN_RATIOMASKED 3
  • 16. MY WORK 16 ▸ However… because of computation inefficiency, unable to learn ▸ (Current situation) Rethinking about the attention method.. WITHOUT ATTENTION, 1000 EPOCH WITH ATTENTION, 20 EPOCH
  • 18. REFERENCE ▸ Semantic image inpainting with Deep Generative Models, Raymond A. Yeh, Chen Chen, Teck Yian Lim, Alexander G. Schwing, Mark Hasegawa- Johnson, Minh N. Do, CVPR 2017, https://arxiv.org/abs/1607.07539 ▸ MS COCO dataset, http://cocodataset.org/#home 18