Proposal for Linking Concept Drift and uncertainty of Machine learning

•

0 likes•157 views

Bang Xiang Yong

This is my proposal for studying the use of uncertainty of ML predictions as a means of detecting concept drifts.

Data & Analytics

PAPER – CONCEPT DRIFT + UNCERTAINTY
Wednesday, November 13, 2019 8:32 AM
BROAD AIM (within PHD):
• Study the relationship between concept drift and uncertainty of
ML
Specific aims:
• Investigate the effectiveness of using ML uncertainties to detect
concept drift
○ Aleatoric (which can be further divided into:)
▪ Homoscedestic [1]
▪ Heteroskedestic [1]
○ Epistemic [1]
• Two types of learning (supervised & unsupervised) will be
employed within families of Bayesian Neural Network and a
combination of both (semi-supervised)
○ Supervised learning (Vanilla BNN)
○ Unsupervised learning (Bayesian Autoencoder)
○ Semi-supervised learning (A combination of both)
Background & Motivation
• In manufacturing context, data of CPMS is characterized as
dynamic, uncertain and evolving distributions. The use of batch
training with the assumption that the data distribution remains
stationary is easily challenged and hence, ML systems are
required to detect these change-points in data distribution also
known as concept drifts and subsequently, adapt the ML models.
• From [2]
• We propose the use of uncertainties of predictions in Bayesian
Neural Networks to detect concept drifts.
• The possible benefits are :
○ No need of external detection methods. The uncertainties
quantification and calibration are incorporated during
training of BNN models and as such, we can readily get CD
detection for 'free' in addition to accurate predictions.
○ Quantify degree of severity of CD and knowing where CD
occurs in the latent space (Concept drift understanding) [2]
• However, the use of BNN in this context is not straightforward as
there are many approximation methods to achieving it and to
date, the best method remains an open question [3]. For
instance, some research emphasized that the uncertainty of BNN
trained using Factorised Gaussian and MC Dropout is unreliable
[4].
• The use of uncertainty is relatively new, and only recently, a
paper proposed the use of uncertainty of SVM for concept drift
[5].
Nomenclature:
• CPMS – Cyberphysical manufacturing
systems
• CD – Concept Drift
• BNN – Bayesian Neural Network

Experiment setup
• All experiments will be implemented using agentMET4FOF
software package
• Datasets (limited to time-series sensors
○ Synthetic data [2]
▪ SEA concepts
▪ Sine, waveform
○ Real data [2]
• ML Models (to be implemented)
○ BNN
○ Variational Autoencoder
• Variables
○ Types of Bayesian inference
▪ MC Dropout
▪ Ensembles
▪ Variational inference
▪ Etc...
○ Types of uncertainty
▪ Aleatoric uncertainty
▪ Epistemic uncertainty
○ Uncertainty thresholds level
▪ Q: How to determine the threshold level?
• Performance metric of using uncertainties as drift detection [2]
[5]
○ True positive (Sensitivity)
○ True negative (Specificity)
○ Delay in detection
References
1. Kendall, Alex, and Yarin Gal. "What uncertainties do we need in bayesian deep learning for
computer vision?." Advances in neural information processing systems. 2017.
2. Lu, Jie, et al. "Learning under concept drift: A review." IEEE Transactions on Knowledge and
Data Engineering (2018).
3. Yao, Jiayu, et al. "Quality of Uncertainty Quantification for Bayesian Neural Network
Inference." arXiv preprint arXiv:1906.09686 (2019).
4. Foong, Andrew YK, et al. "Pathologies of Factorised Gaussian and MC Dropout Posteriors in
Bayesian Neural Networks." arXiv preprint arXiv:1909.00719 (2019).
5. Yu, Shujian, Xiaoyang Wang, and José C. Príncipe. "Request-and-reverify: hierarchical
hypothesis testing for concept drift detection with expensive labels." arXiv preprint
arXiv:1806.10131 (2018).

What's hot

01 Introduction to Machine LearningTamer Ahmed Farrag, PhD

Artificial neural networks in hydrology Jonathan D'Cruz

Meta learning with memory augmented neural networkKaty Lee

Mfcc based enlargement of the training set for emotion recognition in speechsipij

Reinforcement Learning with Deep ArchitecturesWilly Marroquin (WillyDevNET)

Random Valued Impulse Noise Elimination using Neural FilterEditor IJCATR

Optimization as a model for few shot learningKaty Lee

ECKOVATION_MACHINE LEARNINGSpriha Srivastava

Policy Based reinforcement Learning for time series Anomaly detectionKishor Datta Gupta

Survey on contrastive self supervised l earningAnirudh Ganguly

Handwritten digits recognition reportSwayamdipta Saha

a deep reinforced model for abstractive summarizationJEE HYUN PARK

Few shot learning/ one shot learning/ machine learningﺁﺻﻒ ﻋﻠﯽ ﻣﯿﺮ

Deep vs diverse architectures for classification problemsColleen Farrelly

Learning to compare: relation network for few shot learningSimon John

Neuro-fuzzy systemsSagar Ahire

Gradient-Based Meta-Learning with Learned Layerwise Metric and SubspaceYoonho Lee

What's hot (17)

01 Introduction to Machine Learning

Artificial neural networks in hydrology

Meta learning with memory augmented neural network

Mfcc based enlargement of the training set for emotion recognition in speech

Reinforcement Learning with Deep Architectures

Random Valued Impulse Noise Elimination using Neural Filter

Optimization as a model for few shot learning

ECKOVATION_MACHINE LEARNING

Policy Based reinforcement Learning for time series Anomaly detection

Survey on contrastive self supervised l earning

Handwritten digits recognition report

a deep reinforced model for abstractive summarization

Few shot learning/ one shot learning/ machine learning

Deep vs diverse architectures for classification problems

Learning to compare: relation network for few shot learning

Neuro-fuzzy systems

Gradient-Based Meta-Learning with Learned Layerwise Metric and Subspace

Recently uploaded (20)

VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...

Log Analysis using OSSEC sasoasasasas.pptx

Carero dropshipping via API with DroFx.pptx

CebaBaby dropshipping via API with DroFX.pptx

Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai

Smarteg dropshipping via API with DroFx.pptx

BabyOno dropshipping via API with DroFx.pptx

Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...

Edukaciniai dropshipping via API with DroFx

VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130

Introduction-to-Machine-Learning (1).pptx

Brighton SEO | April 2024 | Data Storytelling

代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改

VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati

Week-01-2.ppt BBB human Computer interaction

April 2024 - Crypto Market Report's Analysis

Unveiling Insights: The Role of a Data Analyst

Data-Analysis for Chicago Crime Data 2023

Sampling (random) method and Non random.ppt

VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...

Proposal for Linking Concept Drift and uncertainty of Machine learning

1. PAPER – CONCEPT DRIFT + UNCERTAINTY Wednesday, November 13, 2019 8:32 AM BROAD AIM (within PHD): • Study the relationship between concept drift and uncertainty of ML Specific aims: • Investigate the effectiveness of using ML uncertainties to detect concept drift ○ Aleatoric (which can be further divided into:) ▪ Homoscedestic [1] ▪ Heteroskedestic [1] ○ Epistemic [1] • Two types of learning (supervised & unsupervised) will be employed within families of Bayesian Neural Network and a combination of both (semi-supervised) ○ Supervised learning (Vanilla BNN) ○ Unsupervised learning (Bayesian Autoencoder) ○ Semi-supervised learning (A combination of both) Background & Motivation • In manufacturing context, data of CPMS is characterized as dynamic, uncertain and evolving distributions. The use of batch training with the assumption that the data distribution remains stationary is easily challenged and hence, ML systems are required to detect these change-points in data distribution also known as concept drifts and subsequently, adapt the ML models. • From [2] • We propose the use of uncertainties of predictions in Bayesian Neural Networks to detect concept drifts. • The possible benefits are : ○ No need of external detection methods. The uncertainties quantification and calibration are incorporated during training of BNN models and as such, we can readily get CD detection for 'free' in addition to accurate predictions. ○ Quantify degree of severity of CD and knowing where CD occurs in the latent space (Concept drift understanding) [2] • However, the use of BNN in this context is not straightforward as there are many approximation methods to achieving it and to date, the best method remains an open question [3]. For instance, some research emphasized that the uncertainty of BNN trained using Factorised Gaussian and MC Dropout is unreliable [4]. • The use of uncertainty is relatively new, and only recently, a paper proposed the use of uncertainty of SVM for concept drift [5]. Nomenclature: • CPMS – Cyberphysical manufacturing systems • CD – Concept Drift • BNN – Bayesian Neural Network

2. Experiment setup • All experiments will be implemented using agentMET4FOF software package • Datasets (limited to time-series sensors ○ Synthetic data [2] ▪ SEA concepts ▪ Sine, waveform ○ Real data [2] • ML Models (to be implemented) ○ BNN ○ Variational Autoencoder • Variables ○ Types of Bayesian inference ▪ MC Dropout ▪ Ensembles ▪ Variational inference ▪ Etc... ○ Types of uncertainty ▪ Aleatoric uncertainty ▪ Epistemic uncertainty ○ Uncertainty thresholds level ▪ Q: How to determine the threshold level? • Performance metric of using uncertainties as drift detection [2] [5] ○ True positive (Sensitivity) ○ True negative (Specificity) ○ Delay in detection References 1. Kendall, Alex, and Yarin Gal. "What uncertainties do we need in bayesian deep learning for computer vision?." Advances in neural information processing systems. 2017. 2. Lu, Jie, et al. "Learning under concept drift: A review." IEEE Transactions on Knowledge and Data Engineering (2018). 3. Yao, Jiayu, et al. "Quality of Uncertainty Quantification for Bayesian Neural Network Inference." arXiv preprint arXiv:1906.09686 (2019). 4. Foong, Andrew YK, et al. "Pathologies of Factorised Gaussian and MC Dropout Posteriors in Bayesian Neural Networks." arXiv preprint arXiv:1909.00719 (2019). 5. Yu, Shujian, Xiaoyang Wang, and José C. Príncipe. "Request-and-reverify: hierarchical hypothesis testing for concept drift detection with expensive labels." arXiv preprint arXiv:1806.10131 (2018).

Proposal for Linking Concept Drift and uncertainty of Machine learning

Recommended

Recommended

More Related Content

What's hot

What's hot (17)

Similar to Proposal for Linking Concept Drift and uncertainty of Machine learning

Similar to Proposal for Linking Concept Drift and uncertainty of Machine learning (20)

More from Bang Xiang Yong

More from Bang Xiang Yong (7)

Recently uploaded

Recently uploaded (20)

Proposal for Linking Concept Drift and uncertainty of Machine learning