[CVPR 2021] VITON-HD: High-Resolution Virtual Try-On via Misalignment-Aware Normalization

•

0 likes•492 views

Title: VITON-HD: High-Resolution Virtual Try-On via Misalignment-Aware Normalization Authors: Seunghwan Choi*, Sunghyun Park*, Minsoo Lee*, Jaegul Choo (*: equal contributions) Abstract: The task of image-based virtual try-on aims to transfer a target clothing item onto the corresponding region of a person, which is commonly tackled by fitting the item to the desired body part and fusing the warped item with the person. While an increasing number of studies have been conducted, the resolution of synthesized images is still limited to low (e.g., 256x192), which acts as the critical limitation against satisfying online consumers. We argue that the limitation stems from several challenges: as the resolution increases, the artifacts in the misaligned areas between the warped clothes and the desired clothing regions become noticeable in the final results; the architectures used in existing methods have low performance in generating high-quality body parts and maintaining the texture sharpness of the clothes. To address the challenges, we propose a novel virtual try-on method called VITON-HD that successfully synthesizes 1024x768 virtual try-on images. Specifically, we first prepare the segmentation map to guide our virtual try-on synthesis, and then roughly fit the target clothing item to a given person's body. Next, we propose ALIgnment-Aware Segment (ALIAS) normalization and ALIAS generator to handle the misaligned areas and preserve the details of 1024x768 inputs. Through rigorous comparison with existing methods, we demonstrate that VITON-HD highly sur-passes the baselines in terms of synthesized image quality both qualitatively and quantitatively.

Data & Analytics

VITON-HD: High-Resolution Virtual Try-On
via Misalignment-Aware Normalization
1Korea Advanced Institute of Science and Technology (KAIST)
Seunghwan Choi1* Sunghyun Park1* Minsoo Lee1* Jaegul Choo1

Image-based Virtual Try-on
2
• Virtual try-on refers to an image generation task of changing the clothing item on a
person into a different item.
• Various image-based virtual try-on methods follow two processes in common:
1. Warping the clothing images initially to fit the human body.
2. Fusing the warped clothes and the image of a person to generate the final result
that includes pixel-level refinement.

Limitation of Existing Virtual Try-On Approach
3
• The misalignment between the warped clothes and a person’s body occurs.
• The misaligned regions become more noticeable as the size of the image increases.
• U-Net is insufficient in synthesizing initially occluded body parts in a final high-
resolution images.

Key Contributions
4
• We propose a novel virtual try-on network called VITON-HD, which is the first model
to successfully synthesize high-resolution images.
• We introduce a clothing-agnostic person representation that removes the
dependency on the clothing item originally worn.
• To address the misalignment problem, we propose ALIAS normalization and ALIAS
generator, which is effective in maintaining the details of clothes.
• We demonstrate the superior performance of our method through experiments with
baselines on the newly collected dataset.

VITON-HD
6
• Given a reference image 𝐼 containing a target person, we predict the segmentation
map 𝑆 and pose map 𝑃.
• We utilize 𝑆 and 𝑃 to make a clothing-agnostic person image 𝐼𝑎 and segmentation
𝑆𝑎 where the information of the clothing item originally worn is truly eliminated.

VITON-HD
7
• Given the clothing-agnostic person representation (𝑆𝑎, 𝑃), and target clothing
item 𝑐, the segmentation generator predicts the segmentation map መ
𝑆.

VITON-HD
8
• Using the clothing-agnostic person representation (𝐼𝑎, 𝑃) and ෡
𝑆𝑐 as inputs,
Geometric Matching Module predicts TPS transformation parameter θ, and
then 𝑐 is warped by 𝜃.

VITON-HD
9
• ALIAS generator synthesizes the final output image መ
𝐼 via ALIAS normalization
and the multi-scale refinement.
• Warped clothing image 𝑊(𝑐, 𝜃) and (𝐼𝑎, 𝑃) are injected into each layer of the
generator to preserve the clothing details.
• Also, each layer of the generator utilize ALIAS normalization.

ALIAS Normalization
10
• ALIAS normalization standardizes the regions of
𝑀𝑚𝑖𝑠𝑎𝑙𝑖𝑔𝑛 and the other regions in ℎ𝑖
separately,
and then modulates the standardized activation
using affine transformation parameters.
• The rationale behind this strategy is to remove
the misleading information in the misaligned regions.

Experiments
Figure: Qualitative comparison of the segmentation generator of ACGPN and VITON-HD.
The clothing-agnostic segmentation map used by each model is also reported. 11

Experiments
Table: Quantitative comparison with baselines across different resolutions. VITON-HD* is a VITON-HD variant
where the standardization in ALIAS normalization is replace by channel-wise standardization as in the original
instance normalization. For the SSIM, higher is better. For the LPIPS and the FID, lower is better.
13

Experiments
Figure: Effects of ALIAS normalization. The orange colored areas in the enlarged images indicate the misaligned regions.
14

Recently uploaded

dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach

VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor

VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor

Ukraine War presentation: KNOW THE BASICSAishani27

(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat

Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083

Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha

Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71

꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

定制英国白金汉大学毕业证（UCB毕业证书）成绩单原版一比一ffjhghh

Introduction-to-Machine-Learning (1).pptxfirstjob4

Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda

FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg

Midocean dropshipping via API with DroFxolyaivanovalion

VidaXL dropshipping via API with DroFx.pptxolyaivanovalion

04242024_CCC TUG_Joins and Relationshipsccctableauusergroup

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083

Halmar dropshipping via API with DroFxolyaivanovalion

RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh

Brighton SEO | April 2024 | Data StorytellingNeil Barnes

Recently uploaded (20)

dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt

VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...

VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati

Ukraine War presentation: KNOW THE BASICS

(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service

Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call

Call Girls In Mahipalpur O9654467111 Escorts Service

Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha

꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...

定制英国白金汉大学毕业证（UCB毕业证书）成绩单原版一比一

Introduction-to-Machine-Learning (1).pptx

Customer Service Analytics - Make Sense of All Your Data.pptx

FESE Capital Markets Fact Sheet 2024 Q1.pdf

Midocean dropshipping via API with DroFx

VidaXL dropshipping via API with DroFx.pptx

04242024_CCC TUG_Joins and Relationships

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call

Halmar dropshipping via API with DroFx

RA-11058_IRR-COMPRESS Do 198 series of 1998

Brighton SEO | April 2024 | Data Storytelling

Featured

Content Methodology: A Best Practices Report (Webinar)contently

How to Prepare For a Successful Job Search for 2024Albert Qian

Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)

Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal

5 Public speaking tips from TED - Visualized summarySpeakerHub

ChatGPT and the Future of Work - Clark Boyd Clark Boyd

Getting into the tech field. what next Tessa Mero

Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray

How to have difficult conversations Rajiv Jayarajah, MAppComm, ACC

Introduction to Data ScienceChristy Abraham Joy

Time Management & Productivity - Best PracticesVit Horky

The six step guide to practical project managementMindGenius

Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36

Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools

12 Ways to Increase Your Influence at WorkGetSmarter

ChatGPT webinar slidesAlireza Esmikhani

More than Just Lines on a Map: Best Practices for U.S Bike RoutesProject for Public Spaces & National Center for Biking and Walking

Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...DevGAMM Conference

Barbie - Brand Strategy PresentationErica Santiago

Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them wellSaba Software

Featured (20)

Content Methodology: A Best Practices Report (Webinar)

How to Prepare For a Successful Job Search for 2024

Social Media Marketing Trends 2024 // The Global Indie Insights

Trends In Paid Search: Navigating The Digital Landscape In 2024

5 Public speaking tips from TED - Visualized summary

ChatGPT and the Future of Work - Clark Boyd

Getting into the tech field. what next

Google's Just Not That Into You: Understanding Core Updates & Search Intent

How to have difficult conversations

Introduction to Data Science

Time Management & Productivity - Best Practices

The six step guide to practical project management

Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...

Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...

12 Ways to Increase Your Influence at Work

ChatGPT webinar slides

More than Just Lines on a Map: Best Practices for U.S Bike Routes

Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...

Barbie - Brand Strategy Presentation

Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them well

[CVPR 2021] VITON-HD: High-Resolution Virtual Try-On via Misalignment-Aware Normalization

1. VITON-HD: High-Resolution Virtual Try-On via Misalignment-Aware Normalization 1Korea Advanced Institute of Science and Technology (KAIST) Seunghwan Choi1* Sunghyun Park1* Minsoo Lee1* Jaegul Choo1

2. Image-based Virtual Try-on 2 • Virtual try-on refers to an image generation task of changing the clothing item on a person into a different item. • Various image-based virtual try-on methods follow two processes in common: 1. Warping the clothing images initially to fit the human body. 2. Fusing the warped clothes and the image of a person to generate the final result that includes pixel-level refinement.

3. Limitation of Existing Virtual Try-On Approach 3 • The misalignment between the warped clothes and a person’s body occurs. • The misaligned regions become more noticeable as the size of the image increases. • U-Net is insufficient in synthesizing initially occluded body parts in a final high- resolution images.

4. Key Contributions 4 • We propose a novel virtual try-on network called VITON-HD, which is the first model to successfully synthesize high-resolution images. • We introduce a clothing-agnostic person representation that removes the dependency on the clothing item originally worn. • To address the misalignment problem, we propose ALIAS normalization and ALIAS generator, which is effective in maintaining the details of clothes. • We demonstrate the superior performance of our method through experiments with baselines on the newly collected dataset.

5. VITON-HD 5

6. VITON-HD 6 • Given a reference image 𝐼 containing a target person, we predict the segmentation map 𝑆 and pose map 𝑃. • We utilize 𝑆 and 𝑃 to make a clothing-agnostic person image 𝐼𝑎 and segmentation 𝑆𝑎 where the information of the clothing item originally worn is truly eliminated.

7. VITON-HD 7 • Given the clothing-agnostic person representation (𝑆𝑎, 𝑃), and target clothing item 𝑐, the segmentation generator predicts the segmentation map መ 𝑆.

8. VITON-HD 8 • Using the clothing-agnostic person representation (𝐼𝑎, 𝑃) and ෡ 𝑆𝑐 as inputs, Geometric Matching Module predicts TPS transformation parameter θ, and then 𝑐 is warped by 𝜃.

9. VITON-HD 9 • ALIAS generator synthesizes the final output image መ 𝐼 via ALIAS normalization and the multi-scale refinement. • Warped clothing image 𝑊(𝑐, 𝜃) and (𝐼𝑎, 𝑃) are injected into each layer of the generator to preserve the clothing details. • Also, each layer of the generator utilize ALIAS normalization.

10. ALIAS Normalization 10 • ALIAS normalization standardizes the regions of 𝑀𝑚𝑖𝑠𝑎𝑙𝑖𝑔𝑛 and the other regions in ℎ𝑖 separately, and then modulates the standardized activation using affine transformation parameters. • The rationale behind this strategy is to remove the misleading information in the misaligned regions.

11. Experiments Figure: Qualitative comparison of the segmentation generator of ACGPN and VITON-HD. The clothing-agnostic segmentation map used by each model is also reported. 11

12. Experiments 12

13. Experiments Table: Quantitative comparison with baselines across different resolutions. VITON-HD* is a VITON-HD variant where the standardization in ALIAS normalization is replace by channel-wise standardization as in the original instance normalization. For the SSIM, higher is better. For the LPIPS and the FID, lower is better. 13

14. Experiments Figure: Effects of ALIAS normalization. The orange colored areas in the enlarged images indicate the misaligned regions. 14

15. 15 Thank you!

[CVPR 2021] VITON-HD: High-Resolution Virtual Try-On via Misalignment-Aware Normalization

Recommended

Recommended

More Related Content

Recently uploaded

Recently uploaded (20)

Featured

Featured (20)

[CVPR 2021] VITON-HD: High-Resolution Virtual Try-On via Misalignment-Aware Normalization