Meta learned Confidence for Few-shot Learning

•Download as PPTX, PDF•

0 likes•67 views

This was presented Meta learned Confidence for Few-shot Learning on CVPR in 2020. Few-shot learning is an important challenge under data scarcity. When there is a lot of unlabeled data and data scarcity, a) leveraging nearest neighbor graph b) using predicted soft or hard labels on unlabeled samples to update the class prototype. the model confidence may be unreliable, which may lead to incorrect predictions.

Technology

Meta-Learned Confidence for
Few-shot Learning
Seong Min Kye1 , Hae Beom Lee1 , Hoirin Kim1 , Sung Ju Hwang1,2
1KAIST, 2AITRICS, South Korea
Computer Vision and Pattern Recognition (CVPR(2020))
This slide is made by Minha Kim (kimminha@g.skku.edu)

Introduction
 Few-shot learning is an important challenge under data scarcity.
 When there is a lot of unlabeled data and data scarcity,
a) leveraging nearest neighbor graph
b) using predicted soft or hard labels on unlabeled samples to update the class
prototype.
 the model confidence may be unreliable, which may lead to incorrect predictions.
They introduce novel confidence-based transductive inference scheme
for metric-based meta-learning models.
Intoduction Overview Loss Function Ablation Study Contribution

Transductive inference?
(Transduction)
Inductive inference?
Training
? ?
A lot
Few
? ?
-> to infer certain test data from
certain observed data.
->to predict the test data from trained
model using a lot of train data
Intoduction Overview Loss Function Ablation Study Contribution

Meta Learning?
Support Set Query Set
Task 1
Task 2
Meta Learning
Meta Test
the data similarity between the support set and the query set
allows the model to derive learning pattern
Intoduction Overview Loss Function Ablation Study Contribution

Overview - Model and data perturbation
< Data perturbation >
1. apply Horizontal flipped to ‘Support set’
2. apply Horizontal flipped
+ shifting
+ RandAugment
+ CutOut
to ‘Query set’
Data Perturbation allows to achieve the same effect
as a regularization without an explicit consistency loss
Intoduction Overview Loss Function Ablation Study Contribution

< Model >
1. generated by dropping a block (perturbation)
2. no perturbation block
the meta-learned confidence can better account for
uncertainties at unseen tasks.
Intoduction Overview Loss Function Ablation Study Contribution

Overview
distance metric
(Euclidean distance)
1.
Intoduction Loss Function Ablation Study Contribution

Overview
2.
Confidence score using the ‘Soft-k means’
: embedding function
D : layer dropped
A : image applied Horizontal flip
Intoduction Loss Function Ablation Study Contribution

3.
Updating Prototype
: embedding function
D : layer dropped
A : image applied Horizontal flip
Intoduction Overview Loss Function Ablation Study Contribution

Overview
Intoduction Loss Function Ablation Study Contribution

Ablation Studies
Overview
Intoduction Loss Function Ablation Study Contribution

Conclusion
• we proposed to takle them by meta-learning confidence scores, such that the
prototypes updated with meta-learned scores optimize for the transductive
inference performance.
• proposed to meta-learn the parameter of the length-scaling function, such
that the proper distance metric for the confidence scores can be automatically
determined.
• To enhance the quality of confidence scores, we suggest a consistency
regularization for data and embedding
• Validateion our transductive inference model on four benchmark datasets and
get state-of-the-art performances on both transductive and semi-supervised
few-shot classification tasks.
Overview
Intoduction Loss Function Ablation Study Contribution

What's hot

Noisy student imagesDevansh16

Scalable and Order-robust Continual Learning with Additive Parameter Decompos...MLAI2

Survey on contrastive self supervised l earningAnirudh Ganguly

AAAI02-150-2James Swafford

Machine learning with scikitlearnPratap Dangeti

Theory and evaluation metrics for learning disentangled representationsKien Duc Do

a deep reinforced model for abstractive summarizationJEE HYUN PARK

Semi-Supervised Learning with Variational Bayesian Inference and Maximum Unce...Kien Duc Do

Types of Machine Learnig Algorithms(CART, ID3)Fatimakhan325

TestingBas Bossink

MediaEval 2015 - RFA at MediaEval 2015 Affective Impact of Movies Task: A Mul...multimediaeval

Clustering by Maximizing Mutual Information Across ViewsKien Duc Do

BACKPROPAGATION LEARNING ALGORITHM BASED ON LEVENBERG MARQUARDT ALGORITHMcscpconf

Volume 2-issue-6-2165-2172Editor IJARCET

A Defect Prediction Model for Software Product based on ANFISIJSRD

Introduction to Machine LearningShimi Bandiel

Understanding Black-box Predictions via Influence FunctionsZabir Al Nazi Nabil

Robust Fault-Tolerant Training Strategy Using Neural Network to Perform Funct...Eswar Publications

What's hot (18)

Noisy student images

Scalable and Order-robust Continual Learning with Additive Parameter Decompos...

Survey on contrastive self supervised l earning

AAAI02-150-2

Machine learning with scikitlearn

Theory and evaluation metrics for learning disentangled representations

a deep reinforced model for abstractive summarization

Semi-Supervised Learning with Variational Bayesian Inference and Maximum Unce...

Types of Machine Learnig Algorithms(CART, ID3)

Testing

MediaEval 2015 - RFA at MediaEval 2015 Affective Impact of Movies Task: A Mul...

Clustering by Maximizing Mutual Information Across Views

BACKPROPAGATION LEARNING ALGORITHM BASED ON LEVENBERG MARQUARDT ALGORITHM

Volume 2-issue-6-2165-2172

A Defect Prediction Model for Software Product based on ANFIS

Introduction to Machine Learning

Understanding Black-box Predictions via Influence Functions

Robust Fault-Tolerant Training Strategy Using Neural Network to Perform Funct...

Similar to Meta learned Confidence for Few-shot Learning

Representational Continuity for Unsupervised Continual LearningMLAI2

Robust Tracking Via Feature Mapping Method and Support Vector MachineIRJET Journal

PPT - Deep and Confident Prediction For Time Series at UberJisang Yoon

Pydata Global 2023 - How can a learnt model unlearn somethingSARADINDU SENGUPTA

An Approach for Image Deblurring: Based on Sparse Representation and Regulari...IRJET Journal

EDM2013Alireza Davoodi

深度學習在AOI的應用CHENHuiMei

LAK13 linkedup tutorial_evaluation_frameworkHendrik Drachsler

imageclassification-160206090009.pdfKammetaJoshna

Comparative Study of Pre-Trained Neural Network Models in Detection of GlaucomaIRJET Journal

DEF CON 24 - Clarence Chio - machine duping 101Felipe Prado

Detection focal loss 딥러닝 논문읽기 모임 발표자료taeseon ryu

Self-training with Noisy Student improves ImageNet classificationChaehyeon Lee

2014 IEEE JAVA DATA MINING PROJECT Mining weakly labeled web facial images fo...IEEEFINALYEARSTUDENTPROJECT

2014 IEEE JAVA DATA MINING PROJECT Mining weakly labeled web facial images fo...IEEEMEMTECHSTUDENTSPROJECTS

IEEE 2014 JAVA DATA MINING PROJECTS Mining weakly labeled web facial images f...IEEEFINALYEARSTUDENTPROJECTS

IMAGE CAPTION GENERATOR USING DEEP LEARNINGIRJET Journal

An Approach for Image Deblurring: Based on Sparse Representation and Regulari...IRJET Journal

An Effective Attendance Management System using Face RecognitionIRJET Journal

Whiteboard image reconstruction using matlabeSAT Publishing House

Similar to Meta learned Confidence for Few-shot Learning (20)

Representational Continuity for Unsupervised Continual Learning

Robust Tracking Via Feature Mapping Method and Support Vector Machine

PPT - Deep and Confident Prediction For Time Series at Uber

Pydata Global 2023 - How can a learnt model unlearn something

An Approach for Image Deblurring: Based on Sparse Representation and Regulari...

EDM2013

深度學習在AOI的應用

LAK13 linkedup tutorial_evaluation_framework

imageclassification-160206090009.pdf

Comparative Study of Pre-Trained Neural Network Models in Detection of Glaucoma

DEF CON 24 - Clarence Chio - machine duping 101

Detection focal loss 딥러닝 논문읽기 모임 발표자료

Self-training with Noisy Student improves ImageNet classification

2014 IEEE JAVA DATA MINING PROJECT Mining weakly labeled web facial images fo...

IEEE 2014 JAVA DATA MINING PROJECTS Mining weakly labeled web facial images f...

IMAGE CAPTION GENERATOR USING DEEP LEARNING

An Approach for Image Deblurring: Based on Sparse Representation and Regulari...

An Effective Attendance Management System using Face Recognition

Whiteboard image reconstruction using matlab

Recently uploaded

A Call to Action for Generative AI in 2024Results

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung

[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745

GenCyber Cyber Security Day PresentationMichael W. Hawkins

Scaling API-first – The story of a global engineering organizationRadu Cotescu

🐬 The future of MySQL is Postgres 🐘RTylerCroy

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia

Finology Group – Insurtech Innovation Award 2024The Digital Insurer

Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge

Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer

08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls

Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science

A Year of the Servo Reboot: Where Are We Now?Igalia

IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge

04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG

The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad

The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los

TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc

Recently uploaded (20)

A Call to Action for Generative AI in 2024

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

[2024]Digital Global Overview Report 2024 Meltwater.pdf

GenCyber Cyber Security Day Presentation

Scaling API-first – The story of a global engineering organization

🐬 The future of MySQL is Postgres 🐘

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...

Finology Group – Insurtech Innovation Award 2024

Driving Behavioral Change for Information Management through Data-Driven Gree...

Tata AIG General Insurance Company - Insurer Innovation Award 2024

08448380779 Call Girls In Friends Colony Women Seeking Men

Axa Assurance Maroc - Insurer Innovation Award 2024

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx

A Year of the Servo Reboot: Where Are We Now?

IAC 2024 - IA Fast Track to Search Focused AI Solutions

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx

The Codex of Business Writing Software for Real-World Solutions 2.pptx

The 7 Things I Know About Cyber Security After 25 Years | April 2024

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments

Meta learned Confidence for Few-shot Learning

1. Meta-Learned Confidence for Few-shot Learning Seong Min Kye1 , Hae Beom Lee1 , Hoirin Kim1 , Sung Ju Hwang1,2 1KAIST, 2AITRICS, South Korea Computer Vision and Pattern Recognition (CVPR(2020)) This slide is made by Minha Kim (kimminha@g.skku.edu)

2. Introduction  Few-shot learning is an important challenge under data scarcity.  When there is a lot of unlabeled data and data scarcity, a) leveraging nearest neighbor graph b) using predicted soft or hard labels on unlabeled samples to update the class prototype.  the model confidence may be unreliable, which may lead to incorrect predictions. They introduce novel confidence-based transductive inference scheme for metric-based meta-learning models. Intoduction Overview Loss Function Ablation Study Contribution

3. Transductive inference? (Transduction) Inductive inference? Training ? ? A lot Few ? ? -> to infer certain test data from certain observed data. ->to predict the test data from trained model using a lot of train data Intoduction Overview Loss Function Ablation Study Contribution

4. Meta Learning? Support Set Query Set Task 1 Task 2 Meta Learning Meta Test the data similarity between the support set and the query set allows the model to derive learning pattern Intoduction Overview Loss Function Ablation Study Contribution

5. Overview - Model and data perturbation < Data perturbation > 1. apply Horizontal flipped to ‘Support set’ 2. apply Horizontal flipped + shifting + RandAugment + CutOut to ‘Query set’ Data Perturbation allows to achieve the same effect as a regularization without an explicit consistency loss Intoduction Overview Loss Function Ablation Study Contribution

6. < Model > 1. generated by dropping a block (perturbation) 2. no perturbation block the meta-learned confidence can better account for uncertainties at unseen tasks. Intoduction Overview Loss Function Ablation Study Contribution

7. Overview distance metric (Euclidean distance) 1. Intoduction Loss Function Ablation Study Contribution

8. Overview 2. Confidence score using the ‘Soft-k means’ : embedding function D : layer dropped A : image applied Horizontal flip Intoduction Loss Function Ablation Study Contribution

9. 3. Updating Prototype : embedding function D : layer dropped A : image applied Horizontal flip Intoduction Overview Loss Function Ablation Study Contribution

10. Overview Intoduction Loss Function Ablation Study Contribution

11. Ablation Studies Overview Intoduction Loss Function Ablation Study Contribution

12. Ablation Studies Overview Intoduction Loss Function Ablation Study Contribution

13. Conclusion • we proposed to takle them by meta-learning confidence scores, such that the prototypes updated with meta-learned scores optimize for the transductive inference performance. • proposed to meta-learn the parameter of the length-scaling function, such that the proper distance metric for the confidence scores can be automatically determined. • To enhance the quality of confidence scores, we suggest a consistency regularization for data and embedding • Validateion our transductive inference model on four benchmark datasets and get state-of-the-art performances on both transductive and semi-supervised few-shot classification tasks. Overview Intoduction Loss Function Ablation Study Contribution

14. Thank you !

Editor's Notes

Before I explain about details, there are introduction and backgraund. Few-shot learning, the problem of learning under data scarcity, is an important challenge in deep learning as large number of training instances may not be available in many real-world settings, If there are not only data scarcity but a lot of unlabeled data for solve this problem, usually the nearest neighbor graph method or predied soft or hard labels on unlabeled samples is used 2. When there is a lot of unlabeled data and data scarcity, Popular approach for these problem includes leveraging nearest neighbor graph or using predicted soft or hard labels on unlabeled samples to update the class prototype. 3..A popular transductive inference technique for few-shot metric-based approaches, is to update the prototype of each class with the mean of the most confident query examples, or confidence-weighted average of all the query samples. However, a caveat here is that the model confidence may be unreliable, which may lead to incorrect predictions. aim) To tackle this issue, we propose to meta-learn the confidence for each query sample, to assign optimal weights to unlabeled queries such that they improve the model’s transductive inference performance on unseen tasks
befor explain about this paper, I’ll explain about what difference is transductive inference and inductive inference. And then I’ll explain about meta learning. transduction or Transductive inference is to infer certain test data from certain observed data. in contrasts, inductive inference is to predict the test data from trained model using a lot of train data. you can think of the classifier that generally know. This difference can be applied if the prediction of model cannot be obtained by any inductive model. for example, When input the test set to model after train the classifier both dog and cat , It is inductive inference to learn from the model so that the test set can be well predicted. In other words, If train data is too small to be leaned or a lot of train data is unlabeled, You can use transductive inference. So, Transductive inferenceis one of semi-supervised learning.
meta learning is In each task, the data similarity between the support set and the query set allows the model to derive learning pattern on its own to improve generalized performance. In this work, 이런 Unlabeled 데이터 자체를 메타러닝을 위한 쿼리셋으로 사용하고, They propose a novel confidence – based transductive inference scheme for metric-based meta-learning models
this is overview of Meta-llled confidence for Few-shot Learning. First, They approach to data perturbation and model perturbation. To further enhance the reliability of the learned confidence, they introduce various types of model and data perturbations during meta-learning First, they apply various augmentations to disjoint sets rather than to the same instance, which allows to achieve the same effect as a regularization without an explicit consistency loss
Also they consider two confidence scores, one from the full network and the other from a sub-network generated by dropping a block. this approachs able to be the meta-learned confidence can better account for uncertainties at unseen tasks.
And then, After inputting Data perturbation to embedding function 데이터 pertur과 without pertur을 embedding function을 구축한 후에, 유클리디안 distance metric을 구한다. distance metric define as Euclidean distance with normalization and instance-wise metric scaling gi or pair-wise metric scaling gp both scaling is used in semi-supervising learning for correctly assigning confidence scores to unlabeled data.
and then, we calculate the confidence scord and prototype. This step is to modify the protocol cluster using the Support set and the unlabeled query set. At this time, soft k-mean clustering is used that can be differential. 2 equation is confidence score which is obtained soft k means. in other word, It is probability of it beloging to each class c. and then finally, using the without perturbation, perturbation and confidence score equation 2, we can get updated prototype from 3 equation.
and then finally, using the without perturbation, perturbation and confidence score equation 2, we can get updated prototype from 3 equation.
this Algorithm1 is what I've explained so far, it's as follows. in summury,
This table is few shot classification performance of miniImageNet and tiredImageNet both are used for meta learning As you can see, the top rows of this table show the accuracy of MCI and the existing inductive inference methods for few-shot classification. Also the bottom rows of this table show the accuracy of MTC and the transductive inference methods First, MCI is defined the meta confidence induction as an proposed metric with consistency regularization only. Also Second result is about transductive inference. MCT that is meta confidence transduction which performs transductive inference with the meta-learned confidence. both are gain achieve new state-of-the-art results on all the datasets, with particularly good performance on one-shot classification.
또한 data per, model per 두 개 모두 진행하였을때 performance가 가장 높게 나왔다.

Meta learned Confidence for Few-shot Learning

Recommended

Recommended

More Related Content

What's hot

What's hot (18)

Similar to Meta learned Confidence for Few-shot Learning

Similar to Meta learned Confidence for Few-shot Learning (20)

More from KIMMINHA3

More from KIMMINHA3 (15)

Recently uploaded

Recently uploaded (20)

Meta learned Confidence for Few-shot Learning

Editor's Notes