Funit

•Download as PPTX, PDF•

0 likes•95 views

The document describes a new framework called FUNIT for few-shot unsupervised image-to-image translation. FUNIT aims to map an image from a source class to an analogous target class using only a few examples of the target class provided during testing, without requiring examples of the target class during training. During training, FUNIT learns to translate between any two classes from a set of source classes, and during testing can translate an input to a never-before-seen target class by leveraging only a few examples of that new target class. FUNIT achieves this using an adversarial loss, content reconstruction loss, and feature matching loss.

Technology

Few-Shot Unsupervised Image-to-Image Translation
Ming-Yu Liu Xun Huang Arun Mallya Tero Karras Timo Aila Jaakko Lehtinen Jan Kautz
Bingwen Hu
2019-05-19

Problems
While unsupervised/unpaired image-to-image translation methods (e.g., Liu and Tuzel,
Liu et. al., Zhu et. al., and Huang et. al.) have achieved remarkable success, they are
still limited in two aspects.
• First, they generally require seeing a lot of images from target class in the training
time; generating poor translation outputs if only few images are given at training time
• Second, a trained model for a translation task cannot be repurposed for another
translation task in the test time, the learned models are limited for translating images
between two classes.

FUNIT
• The proposed FUNIT framework aims at mapping an image of a source
class to an analogous image of an unseen target class by leveraging a few
target class images that are made available at test time.
• In the training time, the FUNIT model learns to translate images between
any two classes sampled from a set of source classes. In the test time, the
model is presented a few images of a target class that the model has never
seen before. The model leverages these few example images to translate
an input image of a source class to the target class.

We assume the content image belongs to object class cx while each of the K
class images belong to object class cy. In general, K is a small number and cx
is different from cy.

where LGAN, LR, and LF are the GAN loss, the content image
reconstruction loss, and the feature matching loss.
Learning
GAN loss:
Total:
Content reconstruction loss:
Feature matching loss:

Recently uploaded

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays

2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong

Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub

Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays

MS Copilot expands with MS Graph connectorsNanddeep Nachan

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea

DBX First Quarter 2024 Investor PresentationDropbox

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays

FWD Group - Insurer Innovation Award 2024The Digital Insurer

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@

MINDCTI Revenue Release Quarter One 2024MIND CTI

Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10

Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer

Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez

Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz

Recently uploaded (20)

How to Troubleshoot Apps for the Modern Connected Worker

Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...

2024: Domino Containers - The Next Step. News from the Domino Container commu...

Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...

Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...

MS Copilot expands with MS Graph connectors

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024

DBX First Quarter 2024 Investor Presentation

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe

FWD Group - Insurer Innovation Award 2024

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...

MINDCTI Revenue Release Quarter One 2024

Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...

Axa Assurance Maroc - Insurer Innovation Award 2024

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood

Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...

Featured

How to Prepare For a Successful Job Search for 2024Albert Qian

Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)

Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal

5 Public speaking tips from TED - Visualized summarySpeakerHub

ChatGPT and the Future of Work - Clark Boyd Clark Boyd

Getting into the tech field. what next Tessa Mero

Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray

How to have difficult conversations Rajiv Jayarajah, MAppComm, ACC

Introduction to Data ScienceChristy Abraham Joy

Time Management & Productivity - Best PracticesVit Horky

The six step guide to practical project managementMindGenius

Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36

Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools

12 Ways to Increase Your Influence at WorkGetSmarter

ChatGPT webinar slidesAlireza Esmikhani

More than Just Lines on a Map: Best Practices for U.S Bike RoutesProject for Public Spaces & National Center for Biking and Walking

Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...DevGAMM Conference

Barbie - Brand Strategy PresentationErica Santiago

Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them wellSaba Software

Introduction to C Programming LanguageSimplilearn

Featured (20)

How to Prepare For a Successful Job Search for 2024

Social Media Marketing Trends 2024 // The Global Indie Insights

Trends In Paid Search: Navigating The Digital Landscape In 2024

5 Public speaking tips from TED - Visualized summary

ChatGPT and the Future of Work - Clark Boyd

Getting into the tech field. what next

Google's Just Not That Into You: Understanding Core Updates & Search Intent

How to have difficult conversations

Introduction to Data Science

Time Management & Productivity - Best Practices

The six step guide to practical project management

Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...

Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...

12 Ways to Increase Your Influence at Work

ChatGPT webinar slides

More than Just Lines on a Map: Best Practices for U.S Bike Routes

Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...

Barbie - Brand Strategy Presentation

Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them well

Introduction to C Programming Language

Funit

1. Few-Shot Unsupervised Image-to-Image Translation Ming-Yu Liu Xun Huang Arun Mallya Tero Karras Timo Aila Jaakko Lehtinen Jan Kautz Bingwen Hu 2019-05-19

2. Problems While unsupervised/unpaired image-to-image translation methods (e.g., Liu and Tuzel, Liu et. al., Zhu et. al., and Huang et. al.) have achieved remarkable success, they are still limited in two aspects. • First, they generally require seeing a lot of images from target class in the training time; generating poor translation outputs if only few images are given at training time • Second, a trained model for a translation task cannot be repurposed for another translation task in the test time, the learned models are limited for translating images between two classes.

4. FUNIT • The proposed FUNIT framework aims at mapping an image of a source class to an analogous image of an unseen target class by leveraging a few target class images that are made available at test time. • In the training time, the FUNIT model learns to translate images between any two classes sampled from a set of source classes. In the test time, the model is presented a few images of a target class that the model has never seen before. The model leverages these few example images to translate an input image of a source class to the target class.

6. We assume the content image belongs to object class cx while each of the K class images belong to object class cy. In general, K is a small number and cx is different from cy.

8. where LGAN, LR, and LF are the GAN loss, the content image reconstruction loss, and the feature matching loss. Learning GAN loss: Total: Content reconstruction loss: Feature matching loss:

Funit

Recommended

Recommended

More Related Content

Recently uploaded

Recently uploaded (20)

Featured

Featured (20)

Funit