SlideShare a Scribd company logo
1 of 27
Redefining transfer learning
for computer vision
Sayak Paul
PyImageSearch
@RisingSayak
Doing it better with self-supervised learning
Souradip Chakraborty
Data Scientist @ WalmartLabs
Agenda
● The ImageNet pre-training era in Computer Vision
○ Success
○ Challenges
● Introduction to self-supervised learning
● Pre-text tasks
● Downstream tasks
● Examples
● Code demo with image inpainting
● Challenges and directions
The ImageNet pre-training era
Transfer learning
traditionally
ImageNet pre-training era:
Success● Less data
● Architectural decision making
● Faster prototyping
ImageNet pre-training era:
Success● Some architectures widely used for the purpose:
○ ResNet and its different variants
○ EfficientNet
○ MobileNet
○ ...
Challenges
Transfer learning
traditionally
ImageNet pre-training era:
Challenges● Knowledge of the pre-trained models not applicable to the target data
domain
● Source data distribution of the pre-trained models differ from the target
data distribution
● Shortage of labeled data
Can we leverage the power of
inherent patterns of
(unlabeled) data?
Self-supervised learning 101
● Pre-text tasks
● Downstream tasks
● Examples (one from both of the following domains)
○ Computer Vision
○ Natural Language Processing
Remarkable results
Remarkable results
The idea of training models with
masked inputs and having the
models learn to unmask them
(courtesy of Yann LeCun*)
The idea of training models with
masked inputs and having the
models learn to unmask them
(courtesy of Yann LeCun*)
Using image inpainting as a pre-
text task
Image inpainting as a pre-text task
● Image inpainting basics
● Our approach
○ Partial Convs
○ U-Net based architectures for information propagation
Image inpainting data preparation
Base dataset: CIFAR10
Image inpainting model schematics
● Partial convs
● U-Net like architecture
● Loss function
● Performance metric
Image inpainting results
Now, can we leverage the
knowledge of image inpainting
for other tasks like image
classification?
Using an image inpainting model for
other tasks
● Training an image classification model just like we train a text-
classification model with pre-trained embeddings
Image classification model schematics
WIP
Image classification results
WIP
Self-supervised learning
challenges
How to choose a pre-text task?
● Pre-text invariant representation
● Jeremy Howard’s suggestions via his blog - Self-supervised learning and
computer vision
Measuring performance
● How to evaluate a self-supervised learning system?
● Noise contrastive estimation/consistency loss
Self-supervised learning: “the
next step in AI” - Yann LeCun
Recent approaches
● FixMatch
● SimCLR

More Related Content

Similar to Transfer Learning CV - Souradip and Sayak

contrastive-learning2.pdf
contrastive-learning2.pdfcontrastive-learning2.pdf
contrastive-learning2.pdf
omogire
 
ICLR 2020 Recap
ICLR 2020 RecapICLR 2020 Recap
ICLR 2020 Recap
Sri Ambati
 

Similar to Transfer Learning CV - Souradip and Sayak (20)

Overview of machine learning
Overview of machine learning Overview of machine learning
Overview of machine learning
 
contrastive-learning2.pdf
contrastive-learning2.pdfcontrastive-learning2.pdf
contrastive-learning2.pdf
 
mPLUG
mPLUGmPLUG
mPLUG
 
Deep Generative Modelling
Deep Generative ModellingDeep Generative Modelling
Deep Generative Modelling
 
Project report
Project reportProject report
Project report
 
Real world machine learning with Java for Fumankaitori.com
Real world machine learning with Java for Fumankaitori.comReal world machine learning with Java for Fumankaitori.com
Real world machine learning with Java for Fumankaitori.com
 
Discrete Event Simulation, CASE tool built using C#
Discrete Event Simulation, CASE tool built using C#Discrete Event Simulation, CASE tool built using C#
Discrete Event Simulation, CASE tool built using C#
 
Interactive Image Processing Demos for the Web
Interactive Image Processing Demos for the WebInteractive Image Processing Demos for the Web
Interactive Image Processing Demos for the Web
 
Lecture_16_Self-supervised_Learning.pptx
Lecture_16_Self-supervised_Learning.pptxLecture_16_Self-supervised_Learning.pptx
Lecture_16_Self-supervised_Learning.pptx
 
Machine Learning - From Scratch to Production
Machine Learning - From Scratch to ProductionMachine Learning - From Scratch to Production
Machine Learning - From Scratch to Production
 
Teaching Data-driven Video Processing via Crowdsourced Data Collection
Teaching Data-driven Video Processing via Crowdsourced Data CollectionTeaching Data-driven Video Processing via Crowdsourced Data Collection
Teaching Data-driven Video Processing via Crowdsourced Data Collection
 
AI hype or reality
AI  hype or realityAI  hype or reality
AI hype or reality
 
BIG2016- Lessons Learned from building real-life user-focused Big Data systems
BIG2016- Lessons Learned from building real-life user-focused Big Data systemsBIG2016- Lessons Learned from building real-life user-focused Big Data systems
BIG2016- Lessons Learned from building real-life user-focused Big Data systems
 
Full-stack Data Scientist
Full-stack Data ScientistFull-stack Data Scientist
Full-stack Data Scientist
 
Meetup 18/10/2018 - Artificiële intelligentie en mobiliteit
Meetup 18/10/2018 - Artificiële intelligentie en mobiliteitMeetup 18/10/2018 - Artificiële intelligentie en mobiliteit
Meetup 18/10/2018 - Artificiële intelligentie en mobiliteit
 
ICLR 2020 Recap
ICLR 2020 RecapICLR 2020 Recap
ICLR 2020 Recap
 
L15.pptx
L15.pptxL15.pptx
L15.pptx
 
Unit 4
Unit 4Unit 4
Unit 4
 
jefferson-mae Masked Autoencoders based Pretraining
jefferson-mae Masked Autoencoders based Pretrainingjefferson-mae Masked Autoencoders based Pretraining
jefferson-mae Masked Autoencoders based Pretraining
 
cnn explainer.pptx
cnn explainer.pptxcnn explainer.pptx
cnn explainer.pptx
 

Recently uploaded

TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Recently uploaded (20)

TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Design Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptxDesign Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptx
 
AI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by Anitaraj
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Decarbonising Commercial Real Estate: The Role of Operational Performance
Decarbonising Commercial Real Estate: The Role of Operational PerformanceDecarbonising Commercial Real Estate: The Role of Operational Performance
Decarbonising Commercial Real Estate: The Role of Operational Performance
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
API Governance and Monetization - The evolution of API governance
API Governance and Monetization -  The evolution of API governanceAPI Governance and Monetization -  The evolution of API governance
API Governance and Monetization - The evolution of API governance
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Design and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data ScienceDesign and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data Science
 
How to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cfHow to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cf
 
Modernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using BallerinaModernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using Ballerina
 
ADP Passwordless Journey Case Study.pptx
ADP Passwordless Journey Case Study.pptxADP Passwordless Journey Case Study.pptx
ADP Passwordless Journey Case Study.pptx
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
 

Transfer Learning CV - Souradip and Sayak

Editor's Notes

  1. During the first half of the talk, we would discuss a bit about the traditional means of doing transfer learning in the world of computer vision.
  2. Paper link: https://arxiv.org/abs/1911.04252
  3. BERT Paper link: https://arxiv.org/pdf/1810.04805
  4. * Source: Self-Supervised Learning: the next step in AI
  5. * Source: Self-Supervised Learning: the next step in AI