SlideShare a Scribd company logo
1 of 15
CVPR 2021
Citation: 581
Presentor: Jaemin Jeong
Abstract
Annotation for Instance segmentation is usually expensive and time-consuming.
=> For example, 22 worker hours were spent per 1000 instance mask for COCO
We can summarize their contribution
• They find that the simple mechanism of pasting objects randomly is good enough.
• On COCO instance segmentation, we achieve 49.1 mask AP and 57.3 box AP, an
improvement of +0.6 mask AP and +1.5 box AP over the previous state-of-the-art.
• They further demonstrate that Copy-Paste can lead to significant improvements on the
LVIS 2020 Challenge winning entry by + 3.6 mask AP on rare categories.
Method
Method
• They approach for generating new data using Copy-Paste is very simple.
• They randomly select two images and apply random scale jittering and random
horizontal flipping on each of them.
• Then they select a random subset of objects from one of the images and paste them
onto the other image. Lastly, we adjust the ground-truth annotations accordingly they
remove fully occluded objects and update the masks and bounding boxes of
partially occluded objects.
Method – Blending Pasted Objects
• For composing new objects into an image, they compute the binary mask (𝜶) of
pasted objects using ground-truth annotations and compute the new image as
𝑰𝟏 × 𝜶 + 𝑰𝟐 × (𝟏 − 𝜶) where 𝑰𝟏 is the pasted image and 𝑰𝟐 is the main image.
• To smooth out the edges of the
pasted objects we apply a
Gaussian filter to α similar to
“blending” in.
• But unlike, we also found that
simply composing without any
blending has similar performance.
Method – Large Scale Jittering
• We use two different types of augmentation methods in conjunction with Copy-Paste throughout the text:
standard scale jittering (SSJ) and large scale jittering (LSJ).
• These methods randomly resize and crop images.
Method – Self-training Copy-Paste
• In addition to studying Copy-Paste on supervised data, we also
experiment with it as a way of incorporating additional
unlabeled images.
• Our self-training Copy-Paste procedure is as follows:
(1) train a supervised model with Copy-Paste augmentation on
labeled data
(2) generate pseudo labels on unlabeled data
(3) paste ground-truth instance into pseudo labeled and
supervised labeled images and train a model on this new
data
Experiments - Experimental Settings
Architecture
They use Mask R-CNN with Efficient-Net or ResNet as the backbone architecture.
Dataset
They use the COCO dataset which has 118k training images.
For self-training experiments, they use the unlabeled COCO dataset (120k images)
and the Objects365 dataset (610k images) as unlabeled images.
For transfer learning experiments, they pre-train our models on the COCO dataset
and then fine-tune on the Pascal VOC dataset.
For semantic segmentation, they train our models on the train set (1.5k images) of the
PASCAL VOC 2012 segmentation dataset.
For detection, they train on the trainval set of PASCAL VOC 2007 and PASCAL VOC
2007 and PASCAL VOC 2012.
Experiments
Rand init : backbone by Random initialization
ImageNet init : backbone ImageNet pre-training initialization
Experiments
Experiments
Experiments
Experiments RFS: Repeat Factor Sampling
• We also benchmark Copy-Paste on LVIS v1.0 (100k training images) and report
results on LVIS v1.0 val (20k images).
• LVIS has 1203 classes to simulate the long-tail distribution of classes in natural
images
Repeat Factor Sampling
RFS is a simple and effective approach that balances the class distribution by
oversampling images containing rare classes.
For each category 𝒄, let 𝒇𝒄 be the fraction of training images that contain at least
one instance of 𝒄. Accordingly, a category-level repeat factor 𝒓𝒄 is defined as
𝑟𝑐 = max 1,
𝑡
𝑓𝑐
What 𝒕 is a hyper-parameter that intuitively controls the point at which
oversampling begins.
If 𝒇𝒄 is greater than or equal to 𝒕, then there is no oversampling for that category.
Conclusion
• Copy-Paste performs well across multiple experimental settings and provides
significant improvements on top of strong baselines, both on the COCO and LVIS
instance segmentation benchmarks.
• The Copy-Paste augmentation strategy is simple, easy to plug into any instance
segmentation codebase, and does not increase the training cost or inference time.
• We also showed that Copy-Paste is useful for incorporating extra unlabeled images
during training and is additive on top of successful self-training techniques.

More Related Content

Similar to Jaemin_230701_Simple_Copy_paste.pptx

Saptashwa_Mitra_Sitakanta_Mishra_Final_Project_Report
Saptashwa_Mitra_Sitakanta_Mishra_Final_Project_ReportSaptashwa_Mitra_Sitakanta_Mishra_Final_Project_Report
Saptashwa_Mitra_Sitakanta_Mishra_Final_Project_ReportSitakanta Mishra
 
Repairing and Inpainting Damaged Images using Adaptive Diffusion Technique
Repairing and Inpainting Damaged Images using Adaptive Diffusion TechniqueRepairing and Inpainting Damaged Images using Adaptive Diffusion Technique
Repairing and Inpainting Damaged Images using Adaptive Diffusion TechniqueIJMTST Journal
 
Transfer Learning and Fine Tuning for Cross Domain Image Classification with ...
Transfer Learning and Fine Tuning for Cross Domain Image Classification with ...Transfer Learning and Fine Tuning for Cross Domain Image Classification with ...
Transfer Learning and Fine Tuning for Cross Domain Image Classification with ...Sujit Pal
 
ResNeSt: Split-Attention Networks
ResNeSt: Split-Attention NetworksResNeSt: Split-Attention Networks
ResNeSt: Split-Attention NetworksSeunghyun Hwang
 
Convolutional Neural Networks for Image Classification (Cape Town Deep Learni...
Convolutional Neural Networks for Image Classification (Cape Town Deep Learni...Convolutional Neural Networks for Image Classification (Cape Town Deep Learni...
Convolutional Neural Networks for Image Classification (Cape Town Deep Learni...Alex Conway
 
深度學習在AOI的應用
深度學習在AOI的應用深度學習在AOI的應用
深度學習在AOI的應用CHENHuiMei
 
The deep bootstrap 논문 리뷰
The deep bootstrap 논문 리뷰The deep bootstrap 논문 리뷰
The deep bootstrap 논문 리뷰Seonghoon Jung
 
Avihu Efrat's Viola and Jones face detection slides
Avihu Efrat's Viola and Jones face detection slidesAvihu Efrat's Viola and Jones face detection slides
Avihu Efrat's Viola and Jones face detection slideswolf
 
Bayesian Model-Agnostic Meta-Learning
Bayesian Model-Agnostic Meta-LearningBayesian Model-Agnostic Meta-Learning
Bayesian Model-Agnostic Meta-LearningSangwoo Mo
 
CNN FEATURES ARE ALSO GREAT AT UNSUPERVISED CLASSIFICATION
CNN FEATURES ARE ALSO GREAT AT UNSUPERVISED CLASSIFICATION CNN FEATURES ARE ALSO GREAT AT UNSUPERVISED CLASSIFICATION
CNN FEATURES ARE ALSO GREAT AT UNSUPERVISED CLASSIFICATION cscpconf
 
PR-411: Model soups: averaging weights of multiple fine-tuned models improves...
PR-411: Model soups: averaging weights of multiple fine-tuned models improves...PR-411: Model soups: averaging weights of multiple fine-tuned models improves...
PR-411: Model soups: averaging weights of multiple fine-tuned models improves...Sunghoon Joo
 
CAPSULE NETWORK PERFORMANCE WITH AUTONOMOUS NAVIGATION
CAPSULE NETWORK PERFORMANCE WITH AUTONOMOUS NAVIGATIONCAPSULE NETWORK PERFORMANCE WITH AUTONOMOUS NAVIGATION
CAPSULE NETWORK PERFORMANCE WITH AUTONOMOUS NAVIGATIONgerogepatton
 
High performance large-scale image recognition without normalization
High performance large-scale image recognition without normalizationHigh performance large-scale image recognition without normalization
High performance large-scale image recognition without normalizationtaeseon ryu
 
教師なし画像特徴表現学習の動向 {Un, Self} supervised representation learning (CVPR 2018 完全読破...
教師なし画像特徴表現学習の動向 {Un, Self} supervised representation learning (CVPR 2018 完全読破...教師なし画像特徴表現学習の動向 {Un, Self} supervised representation learning (CVPR 2018 完全読破...
教師なし画像特徴表現学習の動向 {Un, Self} supervised representation learning (CVPR 2018 完全読破...cvpaper. challenge
 
4 high performance large-scale image recognition without normalization
4 high performance large-scale image recognition without normalization4 high performance large-scale image recognition without normalization
4 high performance large-scale image recognition without normalizationDonghoon Park
 

Similar to Jaemin_230701_Simple_Copy_paste.pptx (20)

lec10svm.ppt
lec10svm.pptlec10svm.ppt
lec10svm.ppt
 
Svm ms
Svm msSvm ms
Svm ms
 
lec10svm.ppt
lec10svm.pptlec10svm.ppt
lec10svm.ppt
 
Saptashwa_Mitra_Sitakanta_Mishra_Final_Project_Report
Saptashwa_Mitra_Sitakanta_Mishra_Final_Project_ReportSaptashwa_Mitra_Sitakanta_Mishra_Final_Project_Report
Saptashwa_Mitra_Sitakanta_Mishra_Final_Project_Report
 
Mnist soln
Mnist solnMnist soln
Mnist soln
 
Repairing and Inpainting Damaged Images using Adaptive Diffusion Technique
Repairing and Inpainting Damaged Images using Adaptive Diffusion TechniqueRepairing and Inpainting Damaged Images using Adaptive Diffusion Technique
Repairing and Inpainting Damaged Images using Adaptive Diffusion Technique
 
Transfer Learning and Fine Tuning for Cross Domain Image Classification with ...
Transfer Learning and Fine Tuning for Cross Domain Image Classification with ...Transfer Learning and Fine Tuning for Cross Domain Image Classification with ...
Transfer Learning and Fine Tuning for Cross Domain Image Classification with ...
 
ResNeSt: Split-Attention Networks
ResNeSt: Split-Attention NetworksResNeSt: Split-Attention Networks
ResNeSt: Split-Attention Networks
 
Convolutional Neural Networks for Image Classification (Cape Town Deep Learni...
Convolutional Neural Networks for Image Classification (Cape Town Deep Learni...Convolutional Neural Networks for Image Classification (Cape Town Deep Learni...
Convolutional Neural Networks for Image Classification (Cape Town Deep Learni...
 
深度學習在AOI的應用
深度學習在AOI的應用深度學習在AOI的應用
深度學習在AOI的應用
 
The deep bootstrap 논문 리뷰
The deep bootstrap 논문 리뷰The deep bootstrap 논문 리뷰
The deep bootstrap 논문 리뷰
 
Avihu Efrat's Viola and Jones face detection slides
Avihu Efrat's Viola and Jones face detection slidesAvihu Efrat's Viola and Jones face detection slides
Avihu Efrat's Viola and Jones face detection slides
 
lec6a.ppt
lec6a.pptlec6a.ppt
lec6a.ppt
 
Bayesian Model-Agnostic Meta-Learning
Bayesian Model-Agnostic Meta-LearningBayesian Model-Agnostic Meta-Learning
Bayesian Model-Agnostic Meta-Learning
 
CNN FEATURES ARE ALSO GREAT AT UNSUPERVISED CLASSIFICATION
CNN FEATURES ARE ALSO GREAT AT UNSUPERVISED CLASSIFICATION CNN FEATURES ARE ALSO GREAT AT UNSUPERVISED CLASSIFICATION
CNN FEATURES ARE ALSO GREAT AT UNSUPERVISED CLASSIFICATION
 
PR-411: Model soups: averaging weights of multiple fine-tuned models improves...
PR-411: Model soups: averaging weights of multiple fine-tuned models improves...PR-411: Model soups: averaging weights of multiple fine-tuned models improves...
PR-411: Model soups: averaging weights of multiple fine-tuned models improves...
 
CAPSULE NETWORK PERFORMANCE WITH AUTONOMOUS NAVIGATION
CAPSULE NETWORK PERFORMANCE WITH AUTONOMOUS NAVIGATIONCAPSULE NETWORK PERFORMANCE WITH AUTONOMOUS NAVIGATION
CAPSULE NETWORK PERFORMANCE WITH AUTONOMOUS NAVIGATION
 
High performance large-scale image recognition without normalization
High performance large-scale image recognition without normalizationHigh performance large-scale image recognition without normalization
High performance large-scale image recognition without normalization
 
教師なし画像特徴表現学習の動向 {Un, Self} supervised representation learning (CVPR 2018 完全読破...
教師なし画像特徴表現学習の動向 {Un, Self} supervised representation learning (CVPR 2018 完全読破...教師なし画像特徴表現学習の動向 {Un, Self} supervised representation learning (CVPR 2018 完全読破...
教師なし画像特徴表現学習の動向 {Un, Self} supervised representation learning (CVPR 2018 完全読破...
 
4 high performance large-scale image recognition without normalization
4 high performance large-scale image recognition without normalization4 high performance large-scale image recognition without normalization
4 high performance large-scale image recognition without normalization
 

More from JAEMINJEONG5

2022-01-17-Rethinking_Bisenet.pptx
2022-01-17-Rethinking_Bisenet.pptx2022-01-17-Rethinking_Bisenet.pptx
2022-01-17-Rethinking_Bisenet.pptxJAEMINJEONG5
 
2021 04-04-google nmt
2021 04-04-google nmt2021 04-04-google nmt
2021 04-04-google nmtJAEMINJEONG5
 
2021 03-02-distributed representations-of_words_and_phrases
2021 03-02-distributed representations-of_words_and_phrases2021 03-02-distributed representations-of_words_and_phrases
2021 03-02-distributed representations-of_words_and_phrasesJAEMINJEONG5
 
2021 03-02-transformer interpretability
2021 03-02-transformer interpretability2021 03-02-transformer interpretability
2021 03-02-transformer interpretabilityJAEMINJEONG5
 
2021 03-01-on the relationship between self-attention and convolutional layers
2021 03-01-on the relationship between self-attention and convolutional layers2021 03-01-on the relationship between self-attention and convolutional layers
2021 03-01-on the relationship between self-attention and convolutional layersJAEMINJEONG5
 
2021 01-04-learning filter-basis
2021 01-04-learning filter-basis2021 01-04-learning filter-basis
2021 01-04-learning filter-basisJAEMINJEONG5
 
2021 01-02-linformer
2021 01-02-linformer2021 01-02-linformer
2021 01-02-linformerJAEMINJEONG5
 
2020 12-04-shake shake
2020 12-04-shake shake2020 12-04-shake shake
2020 12-04-shake shakeJAEMINJEONG5
 
2020 11 4_bag_of_tricks
2020 11 4_bag_of_tricks2020 11 4_bag_of_tricks
2020 11 4_bag_of_tricksJAEMINJEONG5
 
2020 11 2_automated sleep stage scoring of the sleep heart
2020 11 2_automated sleep stage scoring of the sleep heart2020 11 2_automated sleep stage scoring of the sleep heart
2020 11 2_automated sleep stage scoring of the sleep heartJAEMINJEONG5
 
2020 11 1_sleep_net
2020 11 1_sleep_net2020 11 1_sleep_net
2020 11 1_sleep_netJAEMINJEONG5
 

More from JAEMINJEONG5 (20)

2022-01-17-Rethinking_Bisenet.pptx
2022-01-17-Rethinking_Bisenet.pptx2022-01-17-Rethinking_Bisenet.pptx
2022-01-17-Rethinking_Bisenet.pptx
 
Swin transformer
Swin transformerSwin transformer
Swin transformer
 
2021 06-02-tabnet
2021 06-02-tabnet2021 06-02-tabnet
2021 06-02-tabnet
 
2021 05-04-u2-net
2021 05-04-u2-net2021 05-04-u2-net
2021 05-04-u2-net
 
2021 04-04-google nmt
2021 04-04-google nmt2021 04-04-google nmt
2021 04-04-google nmt
 
2021 04-03-sean
2021 04-03-sean2021 04-03-sean
2021 04-03-sean
 
2021 04-01-dalle
2021 04-01-dalle2021 04-01-dalle
2021 04-01-dalle
 
2021 03-02-spade
2021 03-02-spade2021 03-02-spade
2021 03-02-spade
 
2021 03-02-distributed representations-of_words_and_phrases
2021 03-02-distributed representations-of_words_and_phrases2021 03-02-distributed representations-of_words_and_phrases
2021 03-02-distributed representations-of_words_and_phrases
 
2021 03-02-transformer interpretability
2021 03-02-transformer interpretability2021 03-02-transformer interpretability
2021 03-02-transformer interpretability
 
2021 03-01-on the relationship between self-attention and convolutional layers
2021 03-01-on the relationship between self-attention and convolutional layers2021 03-01-on the relationship between self-attention and convolutional layers
2021 03-01-on the relationship between self-attention and convolutional layers
 
2021 01-04-learning filter-basis
2021 01-04-learning filter-basis2021 01-04-learning filter-basis
2021 01-04-learning filter-basis
 
2021 01-02-linformer
2021 01-02-linformer2021 01-02-linformer
2021 01-02-linformer
 
2020 12-04-shake shake
2020 12-04-shake shake2020 12-04-shake shake
2020 12-04-shake shake
 
2020 12-03-vit
2020 12-03-vit2020 12-03-vit
2020 12-03-vit
 
2020 12-2-detr
2020 12-2-detr2020 12-2-detr
2020 12-2-detr
 
2020 11 4_bag_of_tricks
2020 11 4_bag_of_tricks2020 11 4_bag_of_tricks
2020 11 4_bag_of_tricks
 
2020 11 2_automated sleep stage scoring of the sleep heart
2020 11 2_automated sleep stage scoring of the sleep heart2020 11 2_automated sleep stage scoring of the sleep heart
2020 11 2_automated sleep stage scoring of the sleep heart
 
2020 11 1_sleep_net
2020 11 1_sleep_net2020 11 1_sleep_net
2020 11 1_sleep_net
 
2020 12-1-adam w
2020 12-1-adam w2020 12-1-adam w
2020 12-1-adam w
 

Recently uploaded

CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Onlineanilsa9823
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramMoniSankarHazra
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxolyaivanovalion
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxolyaivanovalion
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlCall Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlkumarajju5765
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Delhi Call girls
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 

Recently uploaded (20)

CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptx
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptx
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlCall Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 

Jaemin_230701_Simple_Copy_paste.pptx

  • 2. Abstract Annotation for Instance segmentation is usually expensive and time-consuming. => For example, 22 worker hours were spent per 1000 instance mask for COCO We can summarize their contribution • They find that the simple mechanism of pasting objects randomly is good enough. • On COCO instance segmentation, we achieve 49.1 mask AP and 57.3 box AP, an improvement of +0.6 mask AP and +1.5 box AP over the previous state-of-the-art. • They further demonstrate that Copy-Paste can lead to significant improvements on the LVIS 2020 Challenge winning entry by + 3.6 mask AP on rare categories.
  • 4. Method • They approach for generating new data using Copy-Paste is very simple. • They randomly select two images and apply random scale jittering and random horizontal flipping on each of them. • Then they select a random subset of objects from one of the images and paste them onto the other image. Lastly, we adjust the ground-truth annotations accordingly they remove fully occluded objects and update the masks and bounding boxes of partially occluded objects.
  • 5. Method – Blending Pasted Objects • For composing new objects into an image, they compute the binary mask (𝜶) of pasted objects using ground-truth annotations and compute the new image as 𝑰𝟏 × 𝜶 + 𝑰𝟐 × (𝟏 − 𝜶) where 𝑰𝟏 is the pasted image and 𝑰𝟐 is the main image. • To smooth out the edges of the pasted objects we apply a Gaussian filter to α similar to “blending” in. • But unlike, we also found that simply composing without any blending has similar performance.
  • 6. Method – Large Scale Jittering • We use two different types of augmentation methods in conjunction with Copy-Paste throughout the text: standard scale jittering (SSJ) and large scale jittering (LSJ). • These methods randomly resize and crop images.
  • 7. Method – Self-training Copy-Paste • In addition to studying Copy-Paste on supervised data, we also experiment with it as a way of incorporating additional unlabeled images. • Our self-training Copy-Paste procedure is as follows: (1) train a supervised model with Copy-Paste augmentation on labeled data (2) generate pseudo labels on unlabeled data (3) paste ground-truth instance into pseudo labeled and supervised labeled images and train a model on this new data
  • 8. Experiments - Experimental Settings Architecture They use Mask R-CNN with Efficient-Net or ResNet as the backbone architecture. Dataset They use the COCO dataset which has 118k training images. For self-training experiments, they use the unlabeled COCO dataset (120k images) and the Objects365 dataset (610k images) as unlabeled images. For transfer learning experiments, they pre-train our models on the COCO dataset and then fine-tune on the Pascal VOC dataset. For semantic segmentation, they train our models on the train set (1.5k images) of the PASCAL VOC 2012 segmentation dataset. For detection, they train on the trainval set of PASCAL VOC 2007 and PASCAL VOC 2007 and PASCAL VOC 2012.
  • 9. Experiments Rand init : backbone by Random initialization ImageNet init : backbone ImageNet pre-training initialization
  • 13. Experiments RFS: Repeat Factor Sampling • We also benchmark Copy-Paste on LVIS v1.0 (100k training images) and report results on LVIS v1.0 val (20k images). • LVIS has 1203 classes to simulate the long-tail distribution of classes in natural images
  • 14. Repeat Factor Sampling RFS is a simple and effective approach that balances the class distribution by oversampling images containing rare classes. For each category 𝒄, let 𝒇𝒄 be the fraction of training images that contain at least one instance of 𝒄. Accordingly, a category-level repeat factor 𝒓𝒄 is defined as 𝑟𝑐 = max 1, 𝑡 𝑓𝑐 What 𝒕 is a hyper-parameter that intuitively controls the point at which oversampling begins. If 𝒇𝒄 is greater than or equal to 𝒕, then there is no oversampling for that category.
  • 15. Conclusion • Copy-Paste performs well across multiple experimental settings and provides significant improvements on top of strong baselines, both on the COCO and LVIS instance segmentation benchmarks. • The Copy-Paste augmentation strategy is simple, easy to plug into any instance segmentation codebase, and does not increase the training cost or inference time. • We also showed that Copy-Paste is useful for incorporating extra unlabeled images during training and is additive on top of successful self-training techniques.

Editor's Notes

  1. Hello, today my presentation is Simple Copy-Paste that a Strong Data Augmentation Method for Instnce Segmentation. This paper propose Simple Copy-Paste Augmentation with many experiments They accepted CVPR 2021 They have a 581 citation Okay, Let’s start presentation
  2. What is it about instance segmentation problem. Instance segmentation models are often data-hungry. At the same time, annotating large datasets for instance segmentation is usually expensive and time-consuming. For example, 22 worker hours were spent per 1000 instance mask for COCO They propose the method called Copy-Paste for solving this problem. This is paper of abstract. We can summarize to 3 sentences their contribution. First, They find that the simple mechanism of pasting objects randomly is good enough. Second, They get more mask average precision in the previous state-of-the-art. Third, They demonstrate improvements of mask average precision on rare categories. Rare categories means small amount class.
  3. It’s just object copy and paste. It’s very simple
  4. Okay, now we should know copy-paste augmentation. Copy paste is very simple, They just randomly select two images and then apply random scale jittering and random horizontal flipping on each of them. Then we select a random subset of objects from one of the images and paste them onto the other image. That’s very simple
  5. They use alpha blending for Copy-Paste. The Alpha Blending can copy-paste with 2 rgb images and 1 gray scale mask image. We can calculate with simple equation
  6. They make a two different type of augmentation methods SSJ and LSJ SSJ mean standard scale jittering and LSJ mean large scale jittering Standard scale jittering mean augmentation method with resizes and crops an image with a resize range of 0.8 to 1.25 of the original image size. Large Scale Jittering mean augmentation method with resizes and crops an image with a resize range of 0.1 to 2.0 of the original image size.
  7. In addition, they experiment with Copy-Paste for self-training as well. The self-training is according to 3 step process. First train a supervised model with Copy-Paste augmentation on labeled data Second generate pseudo labels on unlabeled data Third paste ground-truth instance into pseudo labeled and supervised labeled images and train a model on this new data
  8. Architecture This Is about Model architecture They use Mask-R-CNN with Efficient Net or ResNet as the backbone architecture We also employ feature pyramid networks for multi-scale feature fusion. We use pyramid levels from 𝑃 2 to 𝑃 0 , with an anchor size of 8× 2 𝑙 and 3 anchors per pixel. Our strongest model uses Cascade R-CNN, EfficientNet-B7 as the backbone and NAS-FPN as the feature pyramid with levels from 𝑃 3 to 𝑃 7 . The anchor size is 4× 2 𝑙 and we have 9 anchors per pixel. Our NAS-FPN model uses 5 repeats and we replace convolution layers with ResNet bottleneck blocks.
  9. This left graph show mask average precision between Rand init and ImageNet init. This right graph show mask average precision between SSJ and LSJ. Random initialization is better than ImageNet initialization. LSJ is better than SSJ
  10. These graph show COCO average precision between mixup and Copy-Paste. Copy Paste is better than mixup
  11. Copy-paste works well across a variety of different model architectures, model sizes and image resolutions. And Copy-Paste with self-training works well especially when used together
  12. This table shows the average precision improvement in semantic segmentation and object detection and instance segmentation when applied to SOTA with COCO and PASCAL VOC 2007, 2012 dataset.
  13. Finally, they shows the average precision improvement in object detection and instance segmentation when applied to various model with LVIS dataset. And they compared the performance using RFS together. However, there was no significant performance improvement.
  14. What is