SlideShare a Scribd company logo
1 of 26
Download to read offline
PR-331
Sungchul Kim
Contents
1. Introduction & Related Work
2. (Conclusion)
3. Empirical Evaluation
4. Pitfalls and Limitations
5. Conclusion
2
Introduction & Related Work
3
• What is ‘Model Calibration’?
• The accuracy with which the scores provided by the model reflect its predictive uncertainty
• It is important in safety-critical applications, such as autonomous driving, medical diagnosis, forecasting
• On Calibration of Modern Neural Networks, ICML 2017. (PR-075)
https://arxiv.org/abs/1706.04599
PR-075: On Calibration of Modern Neural Networks (2017)
Introduction & Related Work
4
• On Calibration of Modern Neural Networks, ICML 2017. (PR-075)
• Expected Calibration Error (ECE)
https://arxiv.org/abs/1706.04599
PR-075: On Calibration of Modern Neural Networks (2017)
Video by Geoff Pleiss
Neural
Network
Input 𝑥
Logits 𝑧 𝑥
𝑒𝑧 𝑥
σ𝑖 𝑒𝑧𝑖 𝑥
Probability Ƹ
𝑝 𝑥
0 ≤ Ƹ
𝑝 𝑥 < 0.2 0.2 ≤ Ƹ
𝑝 𝑥 < 0.4 0.4 ≤ Ƹ
𝑝 𝑥 < 0.6 0.6 ≤ Ƹ
𝑝 𝑥 < 0.8 0.8 ≤ Ƹ
𝑝 𝑥 < 1
Introduction & Related Work
5
• On Calibration of Modern Neural Networks, ICML 2017. (PR-075)
• Expected Calibration Error (ECE)
0 ≤ Ƹ
𝑝 𝑥 < 0.2 0.2 ≤ Ƹ
𝑝 𝑥 < 0.4 0.4 ≤ Ƹ
𝑝 𝑥 < 0.6 0.6 ≤ Ƹ
𝑝 𝑥 < 0.8 0.8 ≤ Ƹ
𝑝 𝑥 < 1
X O O O
O
O
O
https://arxiv.org/abs/1706.04599
PR-075: On Calibration of Modern Neural Networks (2017)
Video by Geoff Pleiss
Avg. conf : 0.86
Accuracy : 0.86
Introduction & Related Work
6
• On Calibration of Modern Neural Networks, ICML 2017. (PR-075)
• Expected Calibration Error (ECE)
0 ≤ Ƹ
𝑝 𝑥 < 0.2 0.2 ≤ Ƹ
𝑝 𝑥 < 0.4 0.4 ≤ Ƹ
𝑝 𝑥 < 0.6 0.6 ≤ Ƹ
𝑝 𝑥 < 0.8 0.8 ≤ Ƹ
𝑝 𝑥 < 1
X O O
O
O
O
Avg. conf : 0.86
Accuracy : 0.71
Gap : 0.15
X
https://arxiv.org/abs/1706.04599
PR-075: On Calibration of Modern Neural Networks (2017)
Video by Geoff Pleiss
Introduction & Related Work
7
• On Calibration of Modern Neural Networks, ICML 2017. (PR-075)
• Expected Calibration Error (ECE)
0 ≤ Ƹ
𝑝 𝑥 < 0.2 0.2 ≤ Ƹ
𝑝 𝑥 < 0.4 0.4 ≤ Ƹ
𝑝 𝑥 < 0.6 0.6 ≤ Ƹ
𝑝 𝑥 < 0.8 0.8 ≤ Ƹ
𝑝 𝑥 < 1
X O O
O
O
O
0.86
0.71
7 x 0.15
X
https://arxiv.org/abs/1706.04599
PR-075: On Calibration of Modern Neural Networks (2017)
Video by Geoff Pleiss
X O
X
O
O
O
O
X
Avg. conf :
Accuracy :
Gap :
0.74
0.67
6 x 0.07
0.55
0.50
2 x 0.05
15
Expected Calibration Error (ECE) : 0.11 (Naeini et al. 2015)
Introduction & Related Work
8
• On Calibration of Modern Neural Networks, ICML 2017. (PR-075)
• Reliability Diagrams
acc 𝐵𝑚 =
1
𝐵𝑚
σ𝑖∈𝐵𝑚
1 ො
𝑦𝑖 = 𝑦𝑖 , conf 𝐵𝑚 =
1
𝐵𝑚
σ𝑖∈𝐵𝑚
Ƹ
𝑝𝑖
• Expected Calibration Error (ECE)
ECE = ෍
𝑚=1
𝑀
𝐵𝑚
𝑛
acc 𝐵𝑚 − conf 𝐵𝑚
• Negative log likelihood (NLL)
ℒ = − ෍
𝑖=1
𝑛
log ො
𝜋 𝑦𝑖|𝑥𝑖
https://arxiv.org/abs/1706.04599
PR-075: On Calibration of Modern Neural Networks (2017)
Video by Geoff Pleiss
Introduction & Related Work
9
• On Calibration of Modern Neural Networks, ICML 2017. (PR-075)
• Temperature Scaling
ො
𝑞𝑖 = max
𝑘
𝜎SM 𝑧𝑖/𝑇 (𝑘)
https://arxiv.org/abs/1706.04599
PR-075: On Calibration of Modern Neural Networks (2017)
Video by Geoff Pleiss
Introduction & Related Work
10
• Why should ‘Model Calibration’ be considered in safety-critical applications?
Video by Geoff Pleiss
Model : plastic bag (55% conf.)
Other sensors : person (90% conf.)
Model : cyst (55% conf.)
Clinician : HCC (90% conf.)
Introduction & Related Work
11
• Some works suggest a trend for larger, more accurate models to be worse calibrated.
• Recent researches have focused on improving accuracy rather than calibration.
• architecture size, amount of training data, and computing power
• Architecture : MLP-Mixer, ViT, …
• Training approaches : SimCLR, ResNext-WSL, CLIP, …
Q. Do past results on calibration extend to current state-of-the-art models?
https://arxiv.org/abs/1706.04599
Empirical Evaluation
12
• Experimental Setup
• Model families
• MLP-Mixer (PR-317)
• ViT (PR-281)
• BiT
• ResNeXt-WSL
• SimCLR (PR-231)
• CLIP
• AlexNet
https://arxiv.org/abs/2105.01601
https://arxiv.org/abs/2010.11929
https://arxiv.org/abs/1912.11370
https://arxiv.org/abs/1805.00932
https://arxiv.org/abs/2002.05709
https://arxiv.org/abs/2103.00020
https://papers.nips.cc/paper/2012/file/c399862d3b9d6b76c8436e924a68c45b-Paper.pdf
Empirical Evaluation
13
• Experimental Setup
• Datasets
• ImageNet
• ImageNetV2
• ImageNet-C (PR-277)
• ImageNet-R
• ImageNet-A
• ImageNet-21K
• JFT-300
https://arxiv.org/abs/1902.10811
https://arxiv.org/abs/1903.12261
https://arxiv.org/abs/2006.16241
https://arxiv.org/abs/1907.07174
https://ieeexplore.ieee.org/document/5206848
https://arxiv.org/abs/1707.02968
(Conclusion)
14
• Modern image models are well calibrated across distribution shift despite being designed with a focus on
accuracy.
• Model size and pretraining amount do not fully account for the performance differences between families.
• Architecture is a major determinant of calibration!
• MLP-Mixer and ViT (non-conv) are among the best-calibrated models both in-distribution and OOD.
• Further work on the influence of architecture inductive biases on calibration and OOD robustness will be
necessary to tell whether these results generalize.
Empirical Evaluation
15
• In-Distribution Calibration
• Several recent model families (MLP-Mixer, ViT, and BiT) are both highly accurate and well-calibrated.
• A zero-shot model, CLIP, is well-calibrated given its accuracy.
Empirical Evaluation
16
• In-Distribution Calibration
• Temperature scaling reveals consistent properties of model families.
• The poor calibration of past models can often be remedied by post-hoc recalibration. (temperature scaling)
→ Does a difference between models remain after recalibration?
→ The most recent architectures are better calibrated than past models even after temperature scaling.
• SimCLR or ResNeXt-WSL tend to be worse calibrated than BiT.
• MLP-Mixer and ViT (non-conv) can perform as well.
Empirical Evaluation
17
• In-Distribution Calibration
• Differences between families are not explained by model size or pretraining amount
• Disentangle how the differences between model families affect their calibration properties
• Model size
• Prior work : Larger neural networks are worse calibrated and have lower classification error.
• This study : ViT models are better calibrated than BiT models.
• Changing the size of BiT model cannot move it into the Pareto set of ViT models.
Empirical Evaluation
18
• In-Distribution Calibration
• Differences between families are not explained by model size or pretraining amount
• Disentangle how the differences between model families affect their calibration properties
• Model pretraining
• BiT models pretrained on ImageNet (1.3M), ImageNet-21K (12.8M), and JFT-300 (300M)
• It has no consistent effect on calibration. (ViT, MLP-Mixer < BiT < SimCLR)
• The amount of pretraining data and the duration of pretraining
Empirical Evaluation
19
• Accuracy and Calibration Under Distribution Shift
• ImageNet-C
Empirical Evaluation
20
• Accuracy and Calibration Under Distribution Shift
• What degree the insights from in-distribution and ImageNet-C calibration transfer to natural OOD data
• Previous : Better in-distribution calibration and accuracy do not predict better calibration under distribution shift.
• This study : The performance on several natural OOD datasets is largely consistent with that on ImageNet.
Empirical Evaluation
21
• Relating Accuracy and Calibration Within Model Families
• How to compare individual models within a family, where one model is more accurate but worse
calibrated, and the other is less accurate but better calibrated.
→ Which model should a practitioner choose for a safety-critical application?
• The cost structure of the specific application
• The scenario of selective prediction
• One can choose to ignore the model prediction (“abstain”) at a fixed cost if the prediction confidence is low, rather
than risking a (more costly) prediction error.
Empirical Evaluation
22
• Relating Accuracy and Calibration Within Model Families
• Total cost : a linear combination of the misclassification cost and abstention cost at a given cost ratio (x-
axis) and abstention rate (y-axis).
• Blue regions : The higher-accuracy model (R152x4) achieves a lower cost that the better-calibrated model (R50x1).
Pitfalls and Limitations
23
• There are two sources of bias
• From estimating ECE by binning : always negative
• From the finite sample size used to estimate the per-bin statistics : always positive
• A larger number of bins → fewer points per bin → higher variance in each bin → positive bias of ECE
Pitfalls and Limitations
24
• More formally,
• To estimate accuracy 𝐵𝑖 precisely, we need a number of samples inversely proportional to the standard
deviation 𝒑𝒊 𝟏 − 𝒑𝒊 / 𝑩𝒊 , where 𝑝𝑖 is the expected accuracy in 𝐵𝑖.
• Bias : models with extreme average accuracies (i.e. close to 0 or 1) < models with an accuracy close to 0.5
• If we estimate 𝔼 accuracy 𝐵𝑖 − 𝔼 confidence 𝐵𝑖
2
for any bin 𝑖 with 𝑛𝑖 samples
𝑏𝑖𝑎𝑠 =
1
𝑛𝑖
𝕍 𝐴 + 𝕍 𝐶 − 2Cov 𝐶, 𝐴
• 𝐴 = 𝑌 ∈ arg max 𝑓 𝑋 , 𝐶 = max 𝑓 𝑋
• Higher accuracy models have a lower bias not only due to higher accuracy (lower 𝕍 𝑨 ), but also because their
outputs correlate more with the correct label (higher covariance).
Conclusion
25
• Modern image models are well calibrated across distribution shift despite being designed with a focus on
accuracy.
• Model size and pretraining amount do not fully account for the performance differences between families.
• Architecture is a major determinant of calibration!
• MLP-Mixer and ViT (non-conv) are among the best-calibrated models both in-distribution and OOD.
• Further work on the influence of architecture inductive biases on calibration and OOD robustness will be
necessary to tell whether these results generalize.
Thank you!
26

More Related Content

What's hot

Batch normalization presentation
Batch normalization presentationBatch normalization presentation
Batch normalization presentationOwin Will
 
Representational Continuity for Unsupervised Continual Learning
Representational Continuity for Unsupervised Continual LearningRepresentational Continuity for Unsupervised Continual Learning
Representational Continuity for Unsupervised Continual LearningMLAI2
 
You Only Look Once: Unified, Real-Time Object Detection
You Only Look Once: Unified, Real-Time Object DetectionYou Only Look Once: Unified, Real-Time Object Detection
You Only Look Once: Unified, Real-Time Object DetectionDADAJONJURAKUZIEV
 
State of transformers in Computer Vision
State of transformers in Computer VisionState of transformers in Computer Vision
State of transformers in Computer VisionDeep Kayal
 
Object detection and Instance Segmentation
Object detection and Instance SegmentationObject detection and Instance Segmentation
Object detection and Instance SegmentationHichem Felouat
 
Focal loss for dense object detection
Focal loss for dense object detectionFocal loss for dense object detection
Focal loss for dense object detectionDaeHeeKim31
 
Object Detection Methods using Deep Learning
Object Detection Methods using Deep LearningObject Detection Methods using Deep Learning
Object Detection Methods using Deep LearningSungjoon Choi
 
Object Detection with Transformers
Object Detection with TransformersObject Detection with Transformers
Object Detection with TransformersDatabricks
 
The Factorization Machines algorithm for building recommendation system - Paw...
The Factorization Machines algorithm for building recommendation system - Paw...The Factorization Machines algorithm for building recommendation system - Paw...
The Factorization Machines algorithm for building recommendation system - Paw...Evention
 
[DLHacks 実装]Perceptual Adversarial Networks for Image-to-Image Transformation
[DLHacks 実装]Perceptual Adversarial Networks for Image-to-Image Transformation[DLHacks 実装]Perceptual Adversarial Networks for Image-to-Image Transformation
[DLHacks 実装]Perceptual Adversarial Networks for Image-to-Image TransformationDeep Learning JP
 
Feature Engineering
Feature EngineeringFeature Engineering
Feature EngineeringHJ van Veen
 
論文紹介:Multimodal Learning with Transformers: A Survey
論文紹介:Multimodal Learning with Transformers: A Survey論文紹介:Multimodal Learning with Transformers: A Survey
論文紹介:Multimodal Learning with Transformers: A SurveyToru Tamaki
 
Attention is All You Need (Transformer)
Attention is All You Need (Transformer)Attention is All You Need (Transformer)
Attention is All You Need (Transformer)Jeong-Gwan Lee
 
Relational knowledge distillation
Relational knowledge distillationRelational knowledge distillation
Relational knowledge distillationNAVER Engineering
 
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...Jinwon Lee
 
【DL輪読会】ViT + Self Supervised Learningまとめ
【DL輪読会】ViT + Self Supervised Learningまとめ【DL輪読会】ViT + Self Supervised Learningまとめ
【DL輪読会】ViT + Self Supervised LearningまとめDeep Learning JP
 
Introduction to Visual transformers
Introduction to Visual transformers Introduction to Visual transformers
Introduction to Visual transformers leopauly
 
SSII2021 [OS2-03] 自己教師あり学習における対照学習の基礎と応用
SSII2021 [OS2-03] 自己教師あり学習における対照学習の基礎と応用SSII2021 [OS2-03] 自己教師あり学習における対照学習の基礎と応用
SSII2021 [OS2-03] 自己教師あり学習における対照学習の基礎と応用SSII
 

What's hot (20)

Batch normalization presentation
Batch normalization presentationBatch normalization presentation
Batch normalization presentation
 
Representational Continuity for Unsupervised Continual Learning
Representational Continuity for Unsupervised Continual LearningRepresentational Continuity for Unsupervised Continual Learning
Representational Continuity for Unsupervised Continual Learning
 
You Only Look Once: Unified, Real-Time Object Detection
You Only Look Once: Unified, Real-Time Object DetectionYou Only Look Once: Unified, Real-Time Object Detection
You Only Look Once: Unified, Real-Time Object Detection
 
State of transformers in Computer Vision
State of transformers in Computer VisionState of transformers in Computer Vision
State of transformers in Computer Vision
 
Object detection and Instance Segmentation
Object detection and Instance SegmentationObject detection and Instance Segmentation
Object detection and Instance Segmentation
 
Focal loss for dense object detection
Focal loss for dense object detectionFocal loss for dense object detection
Focal loss for dense object detection
 
Mask R-CNN
Mask R-CNNMask R-CNN
Mask R-CNN
 
Object Detection Methods using Deep Learning
Object Detection Methods using Deep LearningObject Detection Methods using Deep Learning
Object Detection Methods using Deep Learning
 
Object Detection with Transformers
Object Detection with TransformersObject Detection with Transformers
Object Detection with Transformers
 
The Factorization Machines algorithm for building recommendation system - Paw...
The Factorization Machines algorithm for building recommendation system - Paw...The Factorization Machines algorithm for building recommendation system - Paw...
The Factorization Machines algorithm for building recommendation system - Paw...
 
[DLHacks 実装]Perceptual Adversarial Networks for Image-to-Image Transformation
[DLHacks 実装]Perceptual Adversarial Networks for Image-to-Image Transformation[DLHacks 実装]Perceptual Adversarial Networks for Image-to-Image Transformation
[DLHacks 実装]Perceptual Adversarial Networks for Image-to-Image Transformation
 
Yolov5
Yolov5 Yolov5
Yolov5
 
Feature Engineering
Feature EngineeringFeature Engineering
Feature Engineering
 
論文紹介:Multimodal Learning with Transformers: A Survey
論文紹介:Multimodal Learning with Transformers: A Survey論文紹介:Multimodal Learning with Transformers: A Survey
論文紹介:Multimodal Learning with Transformers: A Survey
 
Attention is All You Need (Transformer)
Attention is All You Need (Transformer)Attention is All You Need (Transformer)
Attention is All You Need (Transformer)
 
Relational knowledge distillation
Relational knowledge distillationRelational knowledge distillation
Relational knowledge distillation
 
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...
 
【DL輪読会】ViT + Self Supervised Learningまとめ
【DL輪読会】ViT + Self Supervised Learningまとめ【DL輪読会】ViT + Self Supervised Learningまとめ
【DL輪読会】ViT + Self Supervised Learningまとめ
 
Introduction to Visual transformers
Introduction to Visual transformers Introduction to Visual transformers
Introduction to Visual transformers
 
SSII2021 [OS2-03] 自己教師あり学習における対照学習の基礎と応用
SSII2021 [OS2-03] 自己教師あり学習における対照学習の基礎と応用SSII2021 [OS2-03] 自己教師あり学習における対照学習の基礎と応用
SSII2021 [OS2-03] 自己教師あり学習における対照学習の基礎と応用
 

Similar to Revisiting the Calibration of Modern Neural Networks

深度學習在AOI的應用
深度學習在AOI的應用深度學習在AOI的應用
深度學習在AOI的應用CHENHuiMei
 
addressing tim/quality trade-off in view maintenance
addressing tim/quality trade-off in view maintenanceaddressing tim/quality trade-off in view maintenance
addressing tim/quality trade-off in view maintenanceSoheila Dehghanzadeh
 
Optimizing SPARQL Query Processing On Dynamic and Static Data Based on Query ...
Optimizing SPARQL Query Processing On Dynamic and Static Data Based on Query ...Optimizing SPARQL Query Processing On Dynamic and Static Data Based on Query ...
Optimizing SPARQL Query Processing On Dynamic and Static Data Based on Query ...Soheila Dehghanzadeh
 
Augmix review [cdm]
Augmix review [cdm]Augmix review [cdm]
Augmix review [cdm]Dongmin Choi
 
Bridging the Gap: Machine Learning for Ubiquitous Computing -- Evaluation
Bridging the Gap: Machine Learning for Ubiquitous Computing -- EvaluationBridging the Gap: Machine Learning for Ubiquitous Computing -- Evaluation
Bridging the Gap: Machine Learning for Ubiquitous Computing -- EvaluationThomas Ploetz
 
MVTec AD: A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection
MVTec AD: A Comprehensive Real-World Dataset for Unsupervised Anomaly DetectionMVTec AD: A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection
MVTec AD: A Comprehensive Real-World Dataset for Unsupervised Anomaly DetectionLEE HOSEONG
 
Hybrid predictive modelling of geometry with limited data in cold spray addit...
Hybrid predictive modelling of geometry with limited data in cold spray addit...Hybrid predictive modelling of geometry with limited data in cold spray addit...
Hybrid predictive modelling of geometry with limited data in cold spray addit...Daiki Ikeuchi
 
Application of machine learning and cognitive computing in intrusion detectio...
Application of machine learning and cognitive computing in intrusion detectio...Application of machine learning and cognitive computing in intrusion detectio...
Application of machine learning and cognitive computing in intrusion detectio...Mahdi Hosseini Moghaddam
 
quality control STUDY ON 3 POLE MCCB MBA SIP report
quality control STUDY ON 3 POLE MCCB MBA SIP report quality control STUDY ON 3 POLE MCCB MBA SIP report
quality control STUDY ON 3 POLE MCCB MBA SIP report Akshay Nair
 
Mx net image segmentation to predict and diagnose the cardiac diseases karp...
Mx net image segmentation to predict and diagnose the cardiac diseases   karp...Mx net image segmentation to predict and diagnose the cardiac diseases   karp...
Mx net image segmentation to predict and diagnose the cardiac diseases karp...KannanRamasamy25
 
|QAB> : Quantum Computing, AI and Blockchain
|QAB> : Quantum Computing, AI and Blockchain|QAB> : Quantum Computing, AI and Blockchain
|QAB> : Quantum Computing, AI and BlockchainKan Yuenyong
 
Cvpr 2018 papers review (efficient computing)
Cvpr 2018 papers review (efficient computing)Cvpr 2018 papers review (efficient computing)
Cvpr 2018 papers review (efficient computing)DonghyunKang12
 
Extend Your Journey: Introducing Signal Strength into Location-based Applicat...
Extend Your Journey: Introducing Signal Strength into Location-based Applicat...Extend Your Journey: Introducing Signal Strength into Location-based Applicat...
Extend Your Journey: Introducing Signal Strength into Location-based Applicat...Chih-Chuan Cheng
 
Comparative Study of Pre-Trained Neural Network Models in Detection of Glaucoma
Comparative Study of Pre-Trained Neural Network Models in Detection of GlaucomaComparative Study of Pre-Trained Neural Network Models in Detection of Glaucoma
Comparative Study of Pre-Trained Neural Network Models in Detection of GlaucomaIRJET Journal
 
Blood Cell Image Classification for Detecting Malaria using CNN
Blood Cell Image Classification for Detecting Malaria using CNNBlood Cell Image Classification for Detecting Malaria using CNN
Blood Cell Image Classification for Detecting Malaria using CNNIRJET Journal
 
DATI, AI E ROBOTICA @POLITO
DATI, AI E ROBOTICA @POLITODATI, AI E ROBOTICA @POLITO
DATI, AI E ROBOTICA @POLITOMarcoMellia
 

Similar to Revisiting the Calibration of Modern Neural Networks (20)

OOD_PPT.pptx
OOD_PPT.pptxOOD_PPT.pptx
OOD_PPT.pptx
 
深度學習在AOI的應用
深度學習在AOI的應用深度學習在AOI的應用
深度學習在AOI的應用
 
addressing tim/quality trade-off in view maintenance
addressing tim/quality trade-off in view maintenanceaddressing tim/quality trade-off in view maintenance
addressing tim/quality trade-off in view maintenance
 
Optimizing SPARQL Query Processing On Dynamic and Static Data Based on Query ...
Optimizing SPARQL Query Processing On Dynamic and Static Data Based on Query ...Optimizing SPARQL Query Processing On Dynamic and Static Data Based on Query ...
Optimizing SPARQL Query Processing On Dynamic and Static Data Based on Query ...
 
Augmix review [cdm]
Augmix review [cdm]Augmix review [cdm]
Augmix review [cdm]
 
Bridging the Gap: Machine Learning for Ubiquitous Computing -- Evaluation
Bridging the Gap: Machine Learning for Ubiquitous Computing -- EvaluationBridging the Gap: Machine Learning for Ubiquitous Computing -- Evaluation
Bridging the Gap: Machine Learning for Ubiquitous Computing -- Evaluation
 
MVTec AD: A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection
MVTec AD: A Comprehensive Real-World Dataset for Unsupervised Anomaly DetectionMVTec AD: A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection
MVTec AD: A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection
 
Hybrid predictive modelling of geometry with limited data in cold spray addit...
Hybrid predictive modelling of geometry with limited data in cold spray addit...Hybrid predictive modelling of geometry with limited data in cold spray addit...
Hybrid predictive modelling of geometry with limited data in cold spray addit...
 
Application of machine learning and cognitive computing in intrusion detectio...
Application of machine learning and cognitive computing in intrusion detectio...Application of machine learning and cognitive computing in intrusion detectio...
Application of machine learning and cognitive computing in intrusion detectio...
 
quality control STUDY ON 3 POLE MCCB MBA SIP report
quality control STUDY ON 3 POLE MCCB MBA SIP report quality control STUDY ON 3 POLE MCCB MBA SIP report
quality control STUDY ON 3 POLE MCCB MBA SIP report
 
Mx net image segmentation to predict and diagnose the cardiac diseases karp...
Mx net image segmentation to predict and diagnose the cardiac diseases   karp...Mx net image segmentation to predict and diagnose the cardiac diseases   karp...
Mx net image segmentation to predict and diagnose the cardiac diseases karp...
 
Generalization Ability of MOS Prediction Networks
Generalization Ability of MOS Prediction NetworksGeneralization Ability of MOS Prediction Networks
Generalization Ability of MOS Prediction Networks
 
|QAB> : Quantum Computing, AI and Blockchain
|QAB> : Quantum Computing, AI and Blockchain|QAB> : Quantum Computing, AI and Blockchain
|QAB> : Quantum Computing, AI and Blockchain
 
Cvpr 2018 papers review (efficient computing)
Cvpr 2018 papers review (efficient computing)Cvpr 2018 papers review (efficient computing)
Cvpr 2018 papers review (efficient computing)
 
Extend Your Journey: Introducing Signal Strength into Location-based Applicat...
Extend Your Journey: Introducing Signal Strength into Location-based Applicat...Extend Your Journey: Introducing Signal Strength into Location-based Applicat...
Extend Your Journey: Introducing Signal Strength into Location-based Applicat...
 
Poster_group22
Poster_group22Poster_group22
Poster_group22
 
Comparative Study of Pre-Trained Neural Network Models in Detection of Glaucoma
Comparative Study of Pre-Trained Neural Network Models in Detection of GlaucomaComparative Study of Pre-Trained Neural Network Models in Detection of Glaucoma
Comparative Study of Pre-Trained Neural Network Models in Detection of Glaucoma
 
Blood Cell Image Classification for Detecting Malaria using CNN
Blood Cell Image Classification for Detecting Malaria using CNNBlood Cell Image Classification for Detecting Malaria using CNN
Blood Cell Image Classification for Detecting Malaria using CNN
 
Seminar nov2017
Seminar nov2017Seminar nov2017
Seminar nov2017
 
DATI, AI E ROBOTICA @POLITO
DATI, AI E ROBOTICA @POLITODATI, AI E ROBOTICA @POLITO
DATI, AI E ROBOTICA @POLITO
 

More from Sungchul Kim

PR-343: Semi-Supervised Semantic Segmentation with Cross Pseudo Supervision
PR-343: Semi-Supervised Semantic Segmentation with Cross Pseudo SupervisionPR-343: Semi-Supervised Semantic Segmentation with Cross Pseudo Supervision
PR-343: Semi-Supervised Semantic Segmentation with Cross Pseudo SupervisionSungchul Kim
 
Emerging Properties in Self-Supervised Vision Transformers
Emerging Properties in Self-Supervised Vision TransformersEmerging Properties in Self-Supervised Vision Transformers
Emerging Properties in Self-Supervised Vision TransformersSungchul Kim
 
PR-305: Exploring Simple Siamese Representation Learning
PR-305: Exploring Simple Siamese Representation LearningPR-305: Exploring Simple Siamese Representation Learning
PR-305: Exploring Simple Siamese Representation LearningSungchul Kim
 
Score based Generative Modeling through Stochastic Differential Equations
Score based Generative Modeling through Stochastic Differential EquationsScore based Generative Modeling through Stochastic Differential Equations
Score based Generative Modeling through Stochastic Differential EquationsSungchul Kim
 
Exploring Simple Siamese Representation Learning
Exploring Simple Siamese Representation LearningExploring Simple Siamese Representation Learning
Exploring Simple Siamese Representation LearningSungchul Kim
 
Revisiting the Sibling Head in Object Detector
Revisiting the Sibling Head in Object DetectorRevisiting the Sibling Head in Object Detector
Revisiting the Sibling Head in Object DetectorSungchul Kim
 
Do Wide and Deep Networks Learn the Same Things: Uncovering How Neural Networ...
Do Wide and Deep Networks Learn the Same Things: Uncovering How Neural Networ...Do Wide and Deep Networks Learn the Same Things: Uncovering How Neural Networ...
Do Wide and Deep Networks Learn the Same Things: Uncovering How Neural Networ...Sungchul Kim
 
Deeplabv1, v2, v3, v3+
Deeplabv1, v2, v3, v3+Deeplabv1, v2, v3, v3+
Deeplabv1, v2, v3, v3+Sungchul Kim
 
Going Deeper with Convolutions
Going Deeper with ConvolutionsGoing Deeper with Convolutions
Going Deeper with ConvolutionsSungchul Kim
 
Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization
Grad-CAM: Visual Explanations from Deep Networks via Gradient-based LocalizationGrad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization
Grad-CAM: Visual Explanations from Deep Networks via Gradient-based LocalizationSungchul Kim
 
Generalized Intersection over Union: A Metric and A Loss for Bounding Box Reg...
Generalized Intersection over Union: A Metric and A Loss for Bounding Box Reg...Generalized Intersection over Union: A Metric and A Loss for Bounding Box Reg...
Generalized Intersection over Union: A Metric and A Loss for Bounding Box Reg...Sungchul Kim
 
Panoptic Segmentation
Panoptic SegmentationPanoptic Segmentation
Panoptic SegmentationSungchul Kim
 
On the Variance of the Adaptive Learning Rate and Beyond
On the Variance of the Adaptive Learning Rate and BeyondOn the Variance of the Adaptive Learning Rate and Beyond
On the Variance of the Adaptive Learning Rate and BeyondSungchul Kim
 
A Benchmark for Interpretability Methods in Deep Neural Networks
A Benchmark for Interpretability Methods in Deep Neural NetworksA Benchmark for Interpretability Methods in Deep Neural Networks
A Benchmark for Interpretability Methods in Deep Neural NetworksSungchul Kim
 
KDGAN: Knowledge Distillation with Generative Adversarial Networks
KDGAN: Knowledge Distillation with Generative Adversarial NetworksKDGAN: Knowledge Distillation with Generative Adversarial Networks
KDGAN: Knowledge Distillation with Generative Adversarial NetworksSungchul Kim
 
Designing Network Design Spaces
Designing Network Design SpacesDesigning Network Design Spaces
Designing Network Design SpacesSungchul Kim
 
Search to Distill: Pearls are Everywhere but not the Eyes
Search to Distill: Pearls are Everywhere but not the EyesSearch to Distill: Pearls are Everywhere but not the Eyes
Search to Distill: Pearls are Everywhere but not the EyesSungchul Kim
 
Supervised Constrastive Learning
Supervised Constrastive LearningSupervised Constrastive Learning
Supervised Constrastive LearningSungchul Kim
 
Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning
Bootstrap Your Own Latent: A New Approach to Self-Supervised LearningBootstrap Your Own Latent: A New Approach to Self-Supervised Learning
Bootstrap Your Own Latent: A New Approach to Self-Supervised LearningSungchul Kim
 
FickleNet: Weakly and Semi-supervised Semantic Image Segmentation using Stoch...
FickleNet: Weakly and Semi-supervised Semantic Image Segmentation using Stoch...FickleNet: Weakly and Semi-supervised Semantic Image Segmentation using Stoch...
FickleNet: Weakly and Semi-supervised Semantic Image Segmentation using Stoch...Sungchul Kim
 

More from Sungchul Kim (20)

PR-343: Semi-Supervised Semantic Segmentation with Cross Pseudo Supervision
PR-343: Semi-Supervised Semantic Segmentation with Cross Pseudo SupervisionPR-343: Semi-Supervised Semantic Segmentation with Cross Pseudo Supervision
PR-343: Semi-Supervised Semantic Segmentation with Cross Pseudo Supervision
 
Emerging Properties in Self-Supervised Vision Transformers
Emerging Properties in Self-Supervised Vision TransformersEmerging Properties in Self-Supervised Vision Transformers
Emerging Properties in Self-Supervised Vision Transformers
 
PR-305: Exploring Simple Siamese Representation Learning
PR-305: Exploring Simple Siamese Representation LearningPR-305: Exploring Simple Siamese Representation Learning
PR-305: Exploring Simple Siamese Representation Learning
 
Score based Generative Modeling through Stochastic Differential Equations
Score based Generative Modeling through Stochastic Differential EquationsScore based Generative Modeling through Stochastic Differential Equations
Score based Generative Modeling through Stochastic Differential Equations
 
Exploring Simple Siamese Representation Learning
Exploring Simple Siamese Representation LearningExploring Simple Siamese Representation Learning
Exploring Simple Siamese Representation Learning
 
Revisiting the Sibling Head in Object Detector
Revisiting the Sibling Head in Object DetectorRevisiting the Sibling Head in Object Detector
Revisiting the Sibling Head in Object Detector
 
Do Wide and Deep Networks Learn the Same Things: Uncovering How Neural Networ...
Do Wide and Deep Networks Learn the Same Things: Uncovering How Neural Networ...Do Wide and Deep Networks Learn the Same Things: Uncovering How Neural Networ...
Do Wide and Deep Networks Learn the Same Things: Uncovering How Neural Networ...
 
Deeplabv1, v2, v3, v3+
Deeplabv1, v2, v3, v3+Deeplabv1, v2, v3, v3+
Deeplabv1, v2, v3, v3+
 
Going Deeper with Convolutions
Going Deeper with ConvolutionsGoing Deeper with Convolutions
Going Deeper with Convolutions
 
Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization
Grad-CAM: Visual Explanations from Deep Networks via Gradient-based LocalizationGrad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization
Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization
 
Generalized Intersection over Union: A Metric and A Loss for Bounding Box Reg...
Generalized Intersection over Union: A Metric and A Loss for Bounding Box Reg...Generalized Intersection over Union: A Metric and A Loss for Bounding Box Reg...
Generalized Intersection over Union: A Metric and A Loss for Bounding Box Reg...
 
Panoptic Segmentation
Panoptic SegmentationPanoptic Segmentation
Panoptic Segmentation
 
On the Variance of the Adaptive Learning Rate and Beyond
On the Variance of the Adaptive Learning Rate and BeyondOn the Variance of the Adaptive Learning Rate and Beyond
On the Variance of the Adaptive Learning Rate and Beyond
 
A Benchmark for Interpretability Methods in Deep Neural Networks
A Benchmark for Interpretability Methods in Deep Neural NetworksA Benchmark for Interpretability Methods in Deep Neural Networks
A Benchmark for Interpretability Methods in Deep Neural Networks
 
KDGAN: Knowledge Distillation with Generative Adversarial Networks
KDGAN: Knowledge Distillation with Generative Adversarial NetworksKDGAN: Knowledge Distillation with Generative Adversarial Networks
KDGAN: Knowledge Distillation with Generative Adversarial Networks
 
Designing Network Design Spaces
Designing Network Design SpacesDesigning Network Design Spaces
Designing Network Design Spaces
 
Search to Distill: Pearls are Everywhere but not the Eyes
Search to Distill: Pearls are Everywhere but not the EyesSearch to Distill: Pearls are Everywhere but not the Eyes
Search to Distill: Pearls are Everywhere but not the Eyes
 
Supervised Constrastive Learning
Supervised Constrastive LearningSupervised Constrastive Learning
Supervised Constrastive Learning
 
Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning
Bootstrap Your Own Latent: A New Approach to Self-Supervised LearningBootstrap Your Own Latent: A New Approach to Self-Supervised Learning
Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning
 
FickleNet: Weakly and Semi-supervised Semantic Image Segmentation using Stoch...
FickleNet: Weakly and Semi-supervised Semantic Image Segmentation using Stoch...FickleNet: Weakly and Semi-supervised Semantic Image Segmentation using Stoch...
FickleNet: Weakly and Semi-supervised Semantic Image Segmentation using Stoch...
 

Recently uploaded

chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learningmisbanausheenparvam
 
microprocessor 8085 and its interfacing
microprocessor 8085  and its interfacingmicroprocessor 8085  and its interfacing
microprocessor 8085 and its interfacingjaychoudhary37
 
Past, Present and Future of Generative AI
Past, Present and Future of Generative AIPast, Present and Future of Generative AI
Past, Present and Future of Generative AIabhishek36461
 
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝soniya singh
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfAsst.prof M.Gokilavani
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome
 
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidNikhilNagaraju
 
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfAsst.prof M.Gokilavani
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxDeepakSakkari2
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionDr.Costas Sachpazis
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile servicerehmti665
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girlsssuser7cb4ff
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxPoojaBan
 

Recently uploaded (20)

chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learning
 
microprocessor 8085 and its interfacing
microprocessor 8085  and its interfacingmicroprocessor 8085  and its interfacing
microprocessor 8085 and its interfacing
 
Past, Present and Future of Generative AI
Past, Present and Future of Generative AIPast, Present and Future of Generative AI
Past, Present and Future of Generative AI
 
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
 
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfid
 
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptx
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile service
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girls
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptx
 

Revisiting the Calibration of Modern Neural Networks

  • 2. Contents 1. Introduction & Related Work 2. (Conclusion) 3. Empirical Evaluation 4. Pitfalls and Limitations 5. Conclusion 2
  • 3. Introduction & Related Work 3 • What is ‘Model Calibration’? • The accuracy with which the scores provided by the model reflect its predictive uncertainty • It is important in safety-critical applications, such as autonomous driving, medical diagnosis, forecasting • On Calibration of Modern Neural Networks, ICML 2017. (PR-075) https://arxiv.org/abs/1706.04599 PR-075: On Calibration of Modern Neural Networks (2017)
  • 4. Introduction & Related Work 4 • On Calibration of Modern Neural Networks, ICML 2017. (PR-075) • Expected Calibration Error (ECE) https://arxiv.org/abs/1706.04599 PR-075: On Calibration of Modern Neural Networks (2017) Video by Geoff Pleiss Neural Network Input 𝑥 Logits 𝑧 𝑥 𝑒𝑧 𝑥 σ𝑖 𝑒𝑧𝑖 𝑥 Probability Ƹ 𝑝 𝑥 0 ≤ Ƹ 𝑝 𝑥 < 0.2 0.2 ≤ Ƹ 𝑝 𝑥 < 0.4 0.4 ≤ Ƹ 𝑝 𝑥 < 0.6 0.6 ≤ Ƹ 𝑝 𝑥 < 0.8 0.8 ≤ Ƹ 𝑝 𝑥 < 1
  • 5. Introduction & Related Work 5 • On Calibration of Modern Neural Networks, ICML 2017. (PR-075) • Expected Calibration Error (ECE) 0 ≤ Ƹ 𝑝 𝑥 < 0.2 0.2 ≤ Ƹ 𝑝 𝑥 < 0.4 0.4 ≤ Ƹ 𝑝 𝑥 < 0.6 0.6 ≤ Ƹ 𝑝 𝑥 < 0.8 0.8 ≤ Ƹ 𝑝 𝑥 < 1 X O O O O O O https://arxiv.org/abs/1706.04599 PR-075: On Calibration of Modern Neural Networks (2017) Video by Geoff Pleiss Avg. conf : 0.86 Accuracy : 0.86
  • 6. Introduction & Related Work 6 • On Calibration of Modern Neural Networks, ICML 2017. (PR-075) • Expected Calibration Error (ECE) 0 ≤ Ƹ 𝑝 𝑥 < 0.2 0.2 ≤ Ƹ 𝑝 𝑥 < 0.4 0.4 ≤ Ƹ 𝑝 𝑥 < 0.6 0.6 ≤ Ƹ 𝑝 𝑥 < 0.8 0.8 ≤ Ƹ 𝑝 𝑥 < 1 X O O O O O Avg. conf : 0.86 Accuracy : 0.71 Gap : 0.15 X https://arxiv.org/abs/1706.04599 PR-075: On Calibration of Modern Neural Networks (2017) Video by Geoff Pleiss
  • 7. Introduction & Related Work 7 • On Calibration of Modern Neural Networks, ICML 2017. (PR-075) • Expected Calibration Error (ECE) 0 ≤ Ƹ 𝑝 𝑥 < 0.2 0.2 ≤ Ƹ 𝑝 𝑥 < 0.4 0.4 ≤ Ƹ 𝑝 𝑥 < 0.6 0.6 ≤ Ƹ 𝑝 𝑥 < 0.8 0.8 ≤ Ƹ 𝑝 𝑥 < 1 X O O O O O 0.86 0.71 7 x 0.15 X https://arxiv.org/abs/1706.04599 PR-075: On Calibration of Modern Neural Networks (2017) Video by Geoff Pleiss X O X O O O O X Avg. conf : Accuracy : Gap : 0.74 0.67 6 x 0.07 0.55 0.50 2 x 0.05 15 Expected Calibration Error (ECE) : 0.11 (Naeini et al. 2015)
  • 8. Introduction & Related Work 8 • On Calibration of Modern Neural Networks, ICML 2017. (PR-075) • Reliability Diagrams acc 𝐵𝑚 = 1 𝐵𝑚 σ𝑖∈𝐵𝑚 1 ො 𝑦𝑖 = 𝑦𝑖 , conf 𝐵𝑚 = 1 𝐵𝑚 σ𝑖∈𝐵𝑚 Ƹ 𝑝𝑖 • Expected Calibration Error (ECE) ECE = ෍ 𝑚=1 𝑀 𝐵𝑚 𝑛 acc 𝐵𝑚 − conf 𝐵𝑚 • Negative log likelihood (NLL) ℒ = − ෍ 𝑖=1 𝑛 log ො 𝜋 𝑦𝑖|𝑥𝑖 https://arxiv.org/abs/1706.04599 PR-075: On Calibration of Modern Neural Networks (2017) Video by Geoff Pleiss
  • 9. Introduction & Related Work 9 • On Calibration of Modern Neural Networks, ICML 2017. (PR-075) • Temperature Scaling ො 𝑞𝑖 = max 𝑘 𝜎SM 𝑧𝑖/𝑇 (𝑘) https://arxiv.org/abs/1706.04599 PR-075: On Calibration of Modern Neural Networks (2017) Video by Geoff Pleiss
  • 10. Introduction & Related Work 10 • Why should ‘Model Calibration’ be considered in safety-critical applications? Video by Geoff Pleiss Model : plastic bag (55% conf.) Other sensors : person (90% conf.) Model : cyst (55% conf.) Clinician : HCC (90% conf.)
  • 11. Introduction & Related Work 11 • Some works suggest a trend for larger, more accurate models to be worse calibrated. • Recent researches have focused on improving accuracy rather than calibration. • architecture size, amount of training data, and computing power • Architecture : MLP-Mixer, ViT, … • Training approaches : SimCLR, ResNext-WSL, CLIP, … Q. Do past results on calibration extend to current state-of-the-art models? https://arxiv.org/abs/1706.04599
  • 12. Empirical Evaluation 12 • Experimental Setup • Model families • MLP-Mixer (PR-317) • ViT (PR-281) • BiT • ResNeXt-WSL • SimCLR (PR-231) • CLIP • AlexNet https://arxiv.org/abs/2105.01601 https://arxiv.org/abs/2010.11929 https://arxiv.org/abs/1912.11370 https://arxiv.org/abs/1805.00932 https://arxiv.org/abs/2002.05709 https://arxiv.org/abs/2103.00020 https://papers.nips.cc/paper/2012/file/c399862d3b9d6b76c8436e924a68c45b-Paper.pdf
  • 13. Empirical Evaluation 13 • Experimental Setup • Datasets • ImageNet • ImageNetV2 • ImageNet-C (PR-277) • ImageNet-R • ImageNet-A • ImageNet-21K • JFT-300 https://arxiv.org/abs/1902.10811 https://arxiv.org/abs/1903.12261 https://arxiv.org/abs/2006.16241 https://arxiv.org/abs/1907.07174 https://ieeexplore.ieee.org/document/5206848 https://arxiv.org/abs/1707.02968
  • 14. (Conclusion) 14 • Modern image models are well calibrated across distribution shift despite being designed with a focus on accuracy. • Model size and pretraining amount do not fully account for the performance differences between families. • Architecture is a major determinant of calibration! • MLP-Mixer and ViT (non-conv) are among the best-calibrated models both in-distribution and OOD. • Further work on the influence of architecture inductive biases on calibration and OOD robustness will be necessary to tell whether these results generalize.
  • 15. Empirical Evaluation 15 • In-Distribution Calibration • Several recent model families (MLP-Mixer, ViT, and BiT) are both highly accurate and well-calibrated. • A zero-shot model, CLIP, is well-calibrated given its accuracy.
  • 16. Empirical Evaluation 16 • In-Distribution Calibration • Temperature scaling reveals consistent properties of model families. • The poor calibration of past models can often be remedied by post-hoc recalibration. (temperature scaling) → Does a difference between models remain after recalibration? → The most recent architectures are better calibrated than past models even after temperature scaling. • SimCLR or ResNeXt-WSL tend to be worse calibrated than BiT. • MLP-Mixer and ViT (non-conv) can perform as well.
  • 17. Empirical Evaluation 17 • In-Distribution Calibration • Differences between families are not explained by model size or pretraining amount • Disentangle how the differences between model families affect their calibration properties • Model size • Prior work : Larger neural networks are worse calibrated and have lower classification error. • This study : ViT models are better calibrated than BiT models. • Changing the size of BiT model cannot move it into the Pareto set of ViT models.
  • 18. Empirical Evaluation 18 • In-Distribution Calibration • Differences between families are not explained by model size or pretraining amount • Disentangle how the differences between model families affect their calibration properties • Model pretraining • BiT models pretrained on ImageNet (1.3M), ImageNet-21K (12.8M), and JFT-300 (300M) • It has no consistent effect on calibration. (ViT, MLP-Mixer < BiT < SimCLR) • The amount of pretraining data and the duration of pretraining
  • 19. Empirical Evaluation 19 • Accuracy and Calibration Under Distribution Shift • ImageNet-C
  • 20. Empirical Evaluation 20 • Accuracy and Calibration Under Distribution Shift • What degree the insights from in-distribution and ImageNet-C calibration transfer to natural OOD data • Previous : Better in-distribution calibration and accuracy do not predict better calibration under distribution shift. • This study : The performance on several natural OOD datasets is largely consistent with that on ImageNet.
  • 21. Empirical Evaluation 21 • Relating Accuracy and Calibration Within Model Families • How to compare individual models within a family, where one model is more accurate but worse calibrated, and the other is less accurate but better calibrated. → Which model should a practitioner choose for a safety-critical application? • The cost structure of the specific application • The scenario of selective prediction • One can choose to ignore the model prediction (“abstain”) at a fixed cost if the prediction confidence is low, rather than risking a (more costly) prediction error.
  • 22. Empirical Evaluation 22 • Relating Accuracy and Calibration Within Model Families • Total cost : a linear combination of the misclassification cost and abstention cost at a given cost ratio (x- axis) and abstention rate (y-axis). • Blue regions : The higher-accuracy model (R152x4) achieves a lower cost that the better-calibrated model (R50x1).
  • 23. Pitfalls and Limitations 23 • There are two sources of bias • From estimating ECE by binning : always negative • From the finite sample size used to estimate the per-bin statistics : always positive • A larger number of bins → fewer points per bin → higher variance in each bin → positive bias of ECE
  • 24. Pitfalls and Limitations 24 • More formally, • To estimate accuracy 𝐵𝑖 precisely, we need a number of samples inversely proportional to the standard deviation 𝒑𝒊 𝟏 − 𝒑𝒊 / 𝑩𝒊 , where 𝑝𝑖 is the expected accuracy in 𝐵𝑖. • Bias : models with extreme average accuracies (i.e. close to 0 or 1) < models with an accuracy close to 0.5 • If we estimate 𝔼 accuracy 𝐵𝑖 − 𝔼 confidence 𝐵𝑖 2 for any bin 𝑖 with 𝑛𝑖 samples 𝑏𝑖𝑎𝑠 = 1 𝑛𝑖 𝕍 𝐴 + 𝕍 𝐶 − 2Cov 𝐶, 𝐴 • 𝐴 = 𝑌 ∈ arg max 𝑓 𝑋 , 𝐶 = max 𝑓 𝑋 • Higher accuracy models have a lower bias not only due to higher accuracy (lower 𝕍 𝑨 ), but also because their outputs correlate more with the correct label (higher covariance).
  • 25. Conclusion 25 • Modern image models are well calibrated across distribution shift despite being designed with a focus on accuracy. • Model size and pretraining amount do not fully account for the performance differences between families. • Architecture is a major determinant of calibration! • MLP-Mixer and ViT (non-conv) are among the best-calibrated models both in-distribution and OOD. • Further work on the influence of architecture inductive biases on calibration and OOD robustness will be necessary to tell whether these results generalize.