SlideShare a Scribd company logo
1 of 18
MINOR PROJECT REPORT
By: Druv Gera
DENOISING DIFFUSION
PROBABILISTIC MODEL
AGENDA
Introduction​
Methodology
​Diffusion Models
Training Results
​Results and Discussions
References​
INTRODUCTION
• Evolution of Generative Models:
• Rapid progress in recent years.
• Capable of generating human-like language, synthetic
images, and diverse audio.
• Challenges with Current Techniques:
• Existing computer vision techniques predict predetermined
object characteristics.
• Limitations in generality and usability.
• Alternative Approach - Learning from Raw Text:
• Proposal to learn directly from raw text describing the image.
• Overcomes limitations of predefined labels.
3
INTRODUCTION – CONT.…
• Introduction to CLIP:
• Contrastive models like CLIP as a key inspiration.
• Demonstrates robust image representations capturing
both semantics and style.
• Project Objectives:
• Two-stage model proposed:
• Prior generating a CLIP image embedding from a
given text.
• Decoder generating an image based on these CLIP
image embeddings.
4
METHODOLOGY
METHODOLOGY
• CLIP as a Representation Learner:
• CLIP (Contrastive Language-Image Pretraining) recognized for
robust image representations.
• Desirable properties, including robustness and applicability to
various tasks.
• Two-Stage Model: Prior and Decoder:
• Introduction of the proposed two-stage model for image generation.
• Prior Mechanism:
• Generates a CLIP image embedding from a given text caption.
• Decoder:
• Generates an image based on CLIP image embeddings.
• Leveraging CLIP Image Embeddings:
• CLIP image embeddings used as a bridge between textual
descriptions and image generation.
• Enhances generality and adaptability in the image generation
process.
6
METHODOLOGY
• Combining CLIP and Diffusion Models:
• Synergy between CLIP and diffusion models for improved
image synthesis.
• Language-guided image manipulations achieved through the
joint embedding space of CLIP.
• Language-Guided Image Manipulations:
• Application of CLIP for language-guided modifications in
generated images.
• Explores the joint embedding space of CLIP for enhanced
control and manipulation
7
DIFFUSION MODELS
• Introduction to Diffusion Models:
• Diffusion models represent a state-of-the-art family of deep
generative models.
• Break the long-time dominance of GANs in challenging
image synthesis tasks.
• Role of Denoising Diffusion Probabilistic Models (DDPM):
• DDPMs use two Markov chains: forward and reverse.
• Forward chain perturbs data to noise, while the reverse chain
converts noise back to data.
• Three Predominant Formulations:
• DDPMs, Score-Based Generative Models (SGMs), and
Stochastic Differential Equations (Score SDEs).
• Each formulation offers unique features and benefits for the
diffusion model.
8
DIFFUSION MODELS
• Training the Diffusion Model on Stanford Cars Dataset:
• Dataset: Stanford Cars dataset with approximately 8000
images in the train set.
• Training process involves progressively destructing data by
injecting noise.
• Forward and Reverse Markov Chains in Diffusion Models:
• Forward process injects noise into data until all structures are
lost.
• Reverse process gradually removes noise by running a
learnable Markov chain.
9
DENOISING DIFFUSION
PROBABILISTIC MODELS (DDPM)
A denoising diffusion probabilistic model (DDPM) makes use of two
Markov chains: a forward chain that perturbs data to noise, and a
reverse chain that converts noise back to data.
New data points are subsequently generated by first sampling a
random vector from the prior distribution, followed by ancestral
sampling through the reverse Markov chain.
Using the chain rule of probability and the Markov property, we can
factorize the joint
distribution of x1, x2 . . . xT conditioned on x0, denoted as q(x1, . . . ,
xT| x0), into
10
TRAINING
We trained the diffusion model to generate images of cars as per the
theory explained above. As a dataset, we used the Stanford Cars
dataset which consists of around 8000 images in the trainset
11
TRAINING RESULTS
• Training the Diffusion Model on Stanford Cars Dataset:
• Utilized the Stanford Cars dataset comprising around 8000
images in the training set.
• Focus on generating images of cars as per the project's
objectives.
• Results from the Forward Markov Chain:
• Forward process involves injecting noise into data
progressively.
• Reverse Markov Chain Using Simple UNet Architecture:
• Utilized a Simple UNet Architecture for the reverse Markov
chain.
• Trained on the dataset for 100 epochs to optimize the
learnable transition kernel.
12
TRAINING RESULTS
13
TRAINING RESULTS
14
RESULTS
15
• Explored diffusion models comprehensively.
• Showed diffusion models, as likelihood-based models,
outperform state-of-the-art GANs.
• Highlighted the stationary training objective's role in
achieving superior sample quality.
• Introduced an improved architecture, successfully
applied to unconditional image generation.
• Presented a classifier guidance technique for extending
quality to class-conditional tasks.
RESULTS
16
REFERENCES
17
[1] Aditya Ramesh, Prafulla Dhariwal, Alex Nicho, Casey Chu, Mark
Chen. Hierarchical
Text-Conditional Image Generation with CLIP Latents,
arXiv:2204.06125
[2] Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh,
Gabriel Goh, Sandhini
Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark,
Gretchen Krueger, Ilya
Sutskever. Learning Transferable Visual Models From Natural
Language Supervision,
arXiv:2103.000201
THANK YOU
DRUV GERA
COMPUTER SCIENCE
DELHI TECHNICAL
UNIVERSITY

More Related Content

Similar to Minor Project Report on Denoising Diffusion Probabilistic Model

Abstractions and Directives for Adapting Wavefront Algorithms to Future Archi...
Abstractions and Directives for Adapting Wavefront Algorithms to Future Archi...Abstractions and Directives for Adapting Wavefront Algorithms to Future Archi...
Abstractions and Directives for Adapting Wavefront Algorithms to Future Archi...
inside-BigData.com
 
A neural image caption generator
A neural image caption generatorA neural image caption generator
A neural image caption generator
heedaeKwon
 

Similar to Minor Project Report on Denoising Diffusion Probabilistic Model (20)

ANALYSIS OF INSTANCE SEGMENTATION APPROACH FOR LANE DETECTION
ANALYSIS OF INSTANCE SEGMENTATION APPROACH FOR LANE DETECTIONANALYSIS OF INSTANCE SEGMENTATION APPROACH FOR LANE DETECTION
ANALYSIS OF INSTANCE SEGMENTATION APPROACH FOR LANE DETECTION
 
How to use transfer learning to bootstrap image classification and question a...
How to use transfer learning to bootstrap image classification and question a...How to use transfer learning to bootstrap image classification and question a...
How to use transfer learning to bootstrap image classification and question a...
 
Talk@rmit 09112017
Talk@rmit 09112017Talk@rmit 09112017
Talk@rmit 09112017
 
DreamPose: Fashion Image to Video Synthesis via Stable Diffusion
DreamPose: Fashion Image to Video Synthesis via Stable DiffusionDreamPose: Fashion Image to Video Synthesis via Stable Diffusion
DreamPose: Fashion Image to Video Synthesis via Stable Diffusion
 
Transfer Learning: Breve introducción a modelos pre-entrenados.
Transfer Learning: Breve introducción a modelos pre-entrenados.Transfer Learning: Breve introducción a modelos pre-entrenados.
Transfer Learning: Breve introducción a modelos pre-entrenados.
 
OReilly AI Transfer Learning
OReilly AI Transfer LearningOReilly AI Transfer Learning
OReilly AI Transfer Learning
 
Bangla Hand Written Digit Recognition presentation slide .pptx
Bangla Hand Written Digit Recognition presentation slide .pptxBangla Hand Written Digit Recognition presentation slide .pptx
Bangla Hand Written Digit Recognition presentation slide .pptx
 
Computer vision-nit-silchar-hackathon
Computer vision-nit-silchar-hackathonComputer vision-nit-silchar-hackathon
Computer vision-nit-silchar-hackathon
 
Build, Scale, and Deploy Deep Learning Pipelines with Ease
Build, Scale, and Deploy Deep Learning Pipelines with EaseBuild, Scale, and Deploy Deep Learning Pipelines with Ease
Build, Scale, and Deploy Deep Learning Pipelines with Ease
 
Transfer Learning (20230516)
Transfer Learning (20230516)Transfer Learning (20230516)
Transfer Learning (20230516)
 
StackNet Meta-Modelling framework
StackNet Meta-Modelling frameworkStackNet Meta-Modelling framework
StackNet Meta-Modelling framework
 
Review : Multi-Domain Image Completion for Random Missing Input Data [cdm]
Review : Multi-Domain Image Completion for Random Missing Input Data [cdm]Review : Multi-Domain Image Completion for Random Missing Input Data [cdm]
Review : Multi-Domain Image Completion for Random Missing Input Data [cdm]
 
Abstractions and Directives for Adapting Wavefront Algorithms to Future Archi...
Abstractions and Directives for Adapting Wavefront Algorithms to Future Archi...Abstractions and Directives for Adapting Wavefront Algorithms to Future Archi...
Abstractions and Directives for Adapting Wavefront Algorithms to Future Archi...
 
Deep-Dive into Deep Learning Pipelines with Sue Ann Hong and Tim Hunter
Deep-Dive into Deep Learning Pipelines with Sue Ann Hong and Tim HunterDeep-Dive into Deep Learning Pipelines with Sue Ann Hong and Tim Hunter
Deep-Dive into Deep Learning Pipelines with Sue Ann Hong and Tim Hunter
 
JRs presentation-few-shot-learning-overview @ AI4Media WP5 workshop
JRs presentation-few-shot-learning-overview @ AI4Media WP5 workshopJRs presentation-few-shot-learning-overview @ AI4Media WP5 workshop
JRs presentation-few-shot-learning-overview @ AI4Media WP5 workshop
 
IncQuery-D: Distributed Incremental Model Queries over the Cloud: Engineerin...
IncQuery-D: Distributed Incremental Model Queries over the Cloud: Engineerin...IncQuery-D: Distributed Incremental Model Queries over the Cloud: Engineerin...
IncQuery-D: Distributed Incremental Model Queries over the Cloud: Engineerin...
 
Generative Models for General Audiences
Generative Models for General AudiencesGenerative Models for General Audiences
Generative Models for General Audiences
 
YU CS Summer 2021 Project | TensorFlow Street Image Classification and Object...
YU CS Summer 2021 Project | TensorFlow Street Image Classification and Object...YU CS Summer 2021 Project | TensorFlow Street Image Classification and Object...
YU CS Summer 2021 Project | TensorFlow Street Image Classification and Object...
 
A neural image caption generator
A neural image caption generatorA neural image caption generator
A neural image caption generator
 
Transfer Learning and Fine Tuning for Cross Domain Image Classification with ...
Transfer Learning and Fine Tuning for Cross Domain Image Classification with ...Transfer Learning and Fine Tuning for Cross Domain Image Classification with ...
Transfer Learning and Fine Tuning for Cross Domain Image Classification with ...
 

Recently uploaded

+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...
+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...
+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...
Health
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
ssuser89054b
 
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
HenryBriggs2
 
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak HamilCara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Kandungan 087776558899
 

Recently uploaded (20)

Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torque
 
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptxS1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdf
 
+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...
+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...
+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...
 
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
 
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best ServiceTamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
 
Online food ordering system project report.pdf
Online food ordering system project report.pdfOnline food ordering system project report.pdf
Online food ordering system project report.pdf
 
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
COST-EFFETIVE  and Energy Efficient BUILDINGS ptxCOST-EFFETIVE  and Energy Efficient BUILDINGS ptx
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
 
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKARHAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
 
Learn the concepts of Thermodynamics on Magic Marks
Learn the concepts of Thermodynamics on Magic MarksLearn the concepts of Thermodynamics on Magic Marks
Learn the concepts of Thermodynamics on Magic Marks
 
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
 
Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086
 
Air Compressor reciprocating single stage
Air Compressor reciprocating single stageAir Compressor reciprocating single stage
Air Compressor reciprocating single stage
 
Computer Lecture 01.pptxIntroduction to Computers
Computer Lecture 01.pptxIntroduction to ComputersComputer Lecture 01.pptxIntroduction to Computers
Computer Lecture 01.pptxIntroduction to Computers
 
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptx
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptxA CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptx
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptx
 
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
 
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak HamilCara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
 
Design For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startDesign For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the start
 
Block diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.pptBlock diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.ppt
 

Minor Project Report on Denoising Diffusion Probabilistic Model

  • 1. MINOR PROJECT REPORT By: Druv Gera DENOISING DIFFUSION PROBABILISTIC MODEL
  • 3. INTRODUCTION • Evolution of Generative Models: • Rapid progress in recent years. • Capable of generating human-like language, synthetic images, and diverse audio. • Challenges with Current Techniques: • Existing computer vision techniques predict predetermined object characteristics. • Limitations in generality and usability. • Alternative Approach - Learning from Raw Text: • Proposal to learn directly from raw text describing the image. • Overcomes limitations of predefined labels. 3
  • 4. INTRODUCTION – CONT.… • Introduction to CLIP: • Contrastive models like CLIP as a key inspiration. • Demonstrates robust image representations capturing both semantics and style. • Project Objectives: • Two-stage model proposed: • Prior generating a CLIP image embedding from a given text. • Decoder generating an image based on these CLIP image embeddings. 4
  • 6. METHODOLOGY • CLIP as a Representation Learner: • CLIP (Contrastive Language-Image Pretraining) recognized for robust image representations. • Desirable properties, including robustness and applicability to various tasks. • Two-Stage Model: Prior and Decoder: • Introduction of the proposed two-stage model for image generation. • Prior Mechanism: • Generates a CLIP image embedding from a given text caption. • Decoder: • Generates an image based on CLIP image embeddings. • Leveraging CLIP Image Embeddings: • CLIP image embeddings used as a bridge between textual descriptions and image generation. • Enhances generality and adaptability in the image generation process. 6
  • 7. METHODOLOGY • Combining CLIP and Diffusion Models: • Synergy between CLIP and diffusion models for improved image synthesis. • Language-guided image manipulations achieved through the joint embedding space of CLIP. • Language-Guided Image Manipulations: • Application of CLIP for language-guided modifications in generated images. • Explores the joint embedding space of CLIP for enhanced control and manipulation 7
  • 8. DIFFUSION MODELS • Introduction to Diffusion Models: • Diffusion models represent a state-of-the-art family of deep generative models. • Break the long-time dominance of GANs in challenging image synthesis tasks. • Role of Denoising Diffusion Probabilistic Models (DDPM): • DDPMs use two Markov chains: forward and reverse. • Forward chain perturbs data to noise, while the reverse chain converts noise back to data. • Three Predominant Formulations: • DDPMs, Score-Based Generative Models (SGMs), and Stochastic Differential Equations (Score SDEs). • Each formulation offers unique features and benefits for the diffusion model. 8
  • 9. DIFFUSION MODELS • Training the Diffusion Model on Stanford Cars Dataset: • Dataset: Stanford Cars dataset with approximately 8000 images in the train set. • Training process involves progressively destructing data by injecting noise. • Forward and Reverse Markov Chains in Diffusion Models: • Forward process injects noise into data until all structures are lost. • Reverse process gradually removes noise by running a learnable Markov chain. 9
  • 10. DENOISING DIFFUSION PROBABILISTIC MODELS (DDPM) A denoising diffusion probabilistic model (DDPM) makes use of two Markov chains: a forward chain that perturbs data to noise, and a reverse chain that converts noise back to data. New data points are subsequently generated by first sampling a random vector from the prior distribution, followed by ancestral sampling through the reverse Markov chain. Using the chain rule of probability and the Markov property, we can factorize the joint distribution of x1, x2 . . . xT conditioned on x0, denoted as q(x1, . . . , xT| x0), into 10
  • 11. TRAINING We trained the diffusion model to generate images of cars as per the theory explained above. As a dataset, we used the Stanford Cars dataset which consists of around 8000 images in the trainset 11
  • 12. TRAINING RESULTS • Training the Diffusion Model on Stanford Cars Dataset: • Utilized the Stanford Cars dataset comprising around 8000 images in the training set. • Focus on generating images of cars as per the project's objectives. • Results from the Forward Markov Chain: • Forward process involves injecting noise into data progressively. • Reverse Markov Chain Using Simple UNet Architecture: • Utilized a Simple UNet Architecture for the reverse Markov chain. • Trained on the dataset for 100 epochs to optimize the learnable transition kernel. 12
  • 15. RESULTS 15 • Explored diffusion models comprehensively. • Showed diffusion models, as likelihood-based models, outperform state-of-the-art GANs. • Highlighted the stationary training objective's role in achieving superior sample quality. • Introduced an improved architecture, successfully applied to unconditional image generation. • Presented a classifier guidance technique for extending quality to class-conditional tasks.
  • 17. REFERENCES 17 [1] Aditya Ramesh, Prafulla Dhariwal, Alex Nicho, Casey Chu, Mark Chen. Hierarchical Text-Conditional Image Generation with CLIP Latents, arXiv:2204.06125 [2] Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, Gretchen Krueger, Ilya Sutskever. Learning Transferable Visual Models From Natural Language Supervision, arXiv:2103.000201
  • 18. THANK YOU DRUV GERA COMPUTER SCIENCE DELHI TECHNICAL UNIVERSITY