SlideShare a Scribd company logo
Inbar Naor / Data Scientist, Taboola April 2018
Don't believe everything your
network tells you
Uncertainty in deep learning
Outline
• Why do we need uncertainty?
• Uncertainty Types
• Uncertainty in Recommendation Systems
• Bayesian Learning 101
• Data Uncertainty
• Model Uncertainty
DON'T BELIEVE EVERYTHING YOUR NETWORK TELLS YOU
Why Do We Need Uncertainty?
● High risk applications
● Out of data samples
● Improving the model:
○ mistakes with high confidence
○ true labels with low confidence
Capturing Model Blind spots is Crucial
UNCERTAINTY IN DEEP LEARNING
Why Do We Need Uncertainty?
● High risk applications
● Out of data samples
● Improving the model:
○ mistakes with high confidence
○ true labels with low confidence
SLIDE | 6
Why Do We Need Uncertainty?
● High risk applications
● Out of data samples
● Improving the model:
○ mistakes with high confidence
○ true labels with low confidence
Model Uncertainty
UNCERTAINTY IN DEEP LEARNING
More Data Please!
Data Uncertainty
UNCERTAINTY IN DEEP LEARNING
More Data Won’t Help – I want to
know how good my predictions are
Uncertainty in
Recommendation
Systems
Content Discovery
A DEEP LEARNING APPROACH TO DISCOVERY
Context
Metadata
User Historical Data
Rank N
recommendations by
CTR * CPC
~1M Possible
Recommendations Region-based
Location
Information
Machine Learning + Discovery = Hard
A DEEP LEARNING APPROACH TO DISCOVERY
Rank N
recommendations by
CTR * CPC
Implicit
Feedback
Data is Very
Sparse
No Human
Baseline
World is
Ever-changing
“Walmart cameras captured these awesome photos”“Walmart cameras captured these hilarious photos”
“15 rarely seen WW2 Photos Discovered”
Why Go Deep for Discovery?
• Cold start is a huge issue
• Many hard sub problems
– Language modeling
– Image classification
– User Profiling
• There are many complex relations
A DEEP LEARNING APPROACH TO DISCOVERY
Exploration/Exploitation in Recommender Systems
UNCERTAINTY IN DEEP LEARNING
Best Performing
Recommendations
Add new Information
Fight selection biasSearch for new stars
Exploration at random doesn’t work
Bayesian Learning
● Probability is a measure of belief
● Prior distribution represents
our belief about the world
● Likelihood
● Posterior
Introduction
Capturing Data Uncertainty
Know what you don’t know
Likelihood as loss
CAPTURING DATA UNCERTAINTY
h1
h2
… hn
LSTM LSTM LSTM LSTM
WordWordWord …
y
Likelihood as loss
CAPTURING DATA UNCERTAINTY
(*) In classification we can
assume a binomial
distribution with Beta prior
h1
h2
… hn
LSTM LSTM LSTM LSTM
WordWordWord …
Mu Std
Mixture Density Network
CAPTURING DATA UNCERTAINTY
Christopher M. Bishop, Mixture density networks (1994)
Capturing Data Uncertainty
CAPTURING DATA UNCERTAINTY
Data Uncertainty and Training Error
CAPTURING DATA UNCERTAINTY
Data Uncertainty and OOV
CAPTURING DATA UNCERTAINTY
h1
h2
… hn
LSTM LSTM LSTM LSTM
Word
OOV
Word1
Word …
Mu Std
OOV
Word2
Probably
higher
variance
Data Uncertainty and OOV
CAPTURING DATA UNCERTAINTY
Capturing Model
Uncertainty
Bayesian Learning
● Probability is a measure of belief
● Prior distribution represents
our belief about the world
● Likelihood
● Posterior
Introduction
Neural Network - A Probabilistic Perspective
Introduction
Neural Network is a probabilistic model
We learn W using MLE:
Adding regularization using MAP:
Bayesian Neural Networks
Model Uncertainty
Weights uncertainty
Prior:
Posterior:
Prediction:
is often intractable
Bayesian Posterior Inference
Model Uncertainty
Variational Inference Sampling Methods
High bias - low variance High variance - low bias
Dropouts – a regularization technique
CAPTURING MODEL UNCERTAINTY
Dropout variational inference as Bayesian Approximation
• Monte-Carlo Dropouts as Bayesian Approximation
• A little bit like bagging
CAPTURING MODEL UNCERTAINTY
gal & ghahramani, Bayesian convolutional neural networks with bernoulli approximate variational inference (2016)
Uncertainty as a function of amount of data
CAPTURING MODEL UNCERTAINTY
Image: Yarin Gal, “what my deep model doesn’t know”
Uncertainty as a function of amount of data
CAPTURING MODEL UNCERTAINTY
Uncertainty as a function of amount of data
CAPTURING MODEL UNCERTAINTY
Uncertainty in exploration
● Upper Confidence Bound - adding sigma to the score
● Thompson Sampling - sample from the distribution
Summary
• Two types of uncertainty: model and data
• Mixture Density Networks
• Captures the true variance of your prediction
• Monte-Carlo Dropouts Variational Inference
• Sheds light on where the model lacks data
• Why not both?
• Also interesting: uncertainty due to measurement noise
DON'T BELIEVE EVERYTHING YOUR NETWORK TELLS YOU
Model Uncertainty Data Uncertainty
Thank You
Feel free to contact with questions!
Inbar Naor - inbar.naor1@gmail.com
Taboola Engineering Blog
Unsupervised - A Podcast about Data Science in Israel
Bibliography
● What Uncertainties Do We Need in Bayesian Deep Learning for Computer Vision? Kendall & Gal, 2017
● Dropout Inference In Bayesian Neural Networks with Alpha-Divergence. Li & Gal, ICML 2017
● On Modern Deep Learning and Variational Inference. Gal & Ghahramani, Advances in Approximate Bayesian Inference
workshop, NIPS 2015
● Bayesian Convolutional Neural Networks with Bernoulli Approximate Variational Inference. Gal & Ghahramani, ICLR 2016
● Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning. Gal & Ghahramani, ICML 2016
● Dropout as a Bayesian Approximation: Insights and Applications. Gal & Ghahramani, Deep Learning Workshop, ICML
2015
● Weight Uncertainty in Neural Networks . Blundell, Cornebise, Kavakcuoglu, Wierstra 2015
● Training Deep Learning Neural Networks Based on Unreliable Labels. Bekker & Goldberger, ICASSP 2016
● Keeping Neural Networks Simple by Minimizing the description length of weights. Hinton & van Camp, Proceedings of
COLT- 1993
● Practical Variational Inference for Neural Networks. Graves, NIPS 2011
● Learning Weight Uncertainty with Stochastic Gradient MCMC for Shape Classification. Chunyuan et al. CVPR 2016

More Related Content

Similar to Uncertainty in deep learning sandaysky.pptx

Introduction to Visualizing Uncertainties
Introduction to Visualizing UncertaintiesIntroduction to Visualizing Uncertainties
Introduction to Visualizing UncertaintiesKai Li
 
Designing Data Visualization
Designing Data VisualizationDesigning Data Visualization
Designing Data Visualization
FITC
 
Measuring the Speed of the Red Queen's Race; Adaption and Evasion in Malware
Measuring the Speed of the Red Queen's Race; Adaption and Evasion in MalwareMeasuring the Speed of the Red Queen's Race; Adaption and Evasion in Malware
Measuring the Speed of the Red Queen's Race; Adaption and Evasion in Malware
Priyanka Aash
 
Rsqrd AI - ML Interpretability: Beyond Feature Importance
Rsqrd AI - ML Interpretability: Beyond Feature ImportanceRsqrd AI - ML Interpretability: Beyond Feature Importance
Rsqrd AI - ML Interpretability: Beyond Feature Importance
Alessya Visnjic
 
Neur ips yomikai_at_ridgei_aaron_jan312020
Neur ips yomikai_at_ridgei_aaron_jan312020Neur ips yomikai_at_ridgei_aaron_jan312020
Neur ips yomikai_at_ridgei_aaron_jan312020
Dr. Aaron C. Bell
 
CS194Lec0hbh6EDA.pptx
CS194Lec0hbh6EDA.pptxCS194Lec0hbh6EDA.pptx
CS194Lec0hbh6EDA.pptx
PrudhvirajEluri1
 
MLSEV. Models, Evaluations and Ensembles
MLSEV. Models, Evaluations and Ensembles MLSEV. Models, Evaluations and Ensembles
MLSEV. Models, Evaluations and Ensembles
BigML, Inc
 
Stance classification. Uni Cambridge 22 Jan 2021
Stance classification. Uni Cambridge 22 Jan 2021Stance classification. Uni Cambridge 22 Jan 2021
Stance classification. Uni Cambridge 22 Jan 2021
Weverify
 
Stance classification. Presentation at QMUL 11 Nov 2020
Stance classification. Presentation at QMUL 11 Nov 2020Stance classification. Presentation at QMUL 11 Nov 2020
Stance classification. Presentation at QMUL 11 Nov 2020
Weverify
 
Jarrod Lopiccolo - Big Data
Jarrod Lopiccolo - Big DataJarrod Lopiccolo - Big Data
Jarrod Lopiccolo - Big DataRenoTahoeAMA
 
The Dangers of Machine Learning
The Dangers of Machine LearningThe Dangers of Machine Learning
The Dangers of Machine Learning
tothepointIT
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
Niko Vuokko
 
Max Shron, Thinking with Data at the NYC Data Science Meetup
Max Shron, Thinking with Data at the NYC Data Science MeetupMax Shron, Thinking with Data at the NYC Data Science Meetup
Max Shron, Thinking with Data at the NYC Data Science Meetup
mortardata
 
unba.se - ACM CSCW 2017 - IWCES15
unba.se - ACM CSCW 2017 - IWCES15unba.se - ACM CSCW 2017 - IWCES15
unba.se - ACM CSCW 2017 - IWCES15
Daniel Norman
 
Getting Started in Data Science
Getting Started in Data ScienceGetting Started in Data Science
Getting Started in Data Science
Thinkful
 
Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)
Thinkful
 
PR-048: Towards Principled Methods for Training Generative Adversarial Networks
PR-048: Towards Principled Methods for Training Generative Adversarial NetworksPR-048: Towards Principled Methods for Training Generative Adversarial Networks
PR-048: Towards Principled Methods for Training Generative Adversarial Networks
Ji-Hoon Kim
 
Privacy, security and ethics in data science
Privacy, security and ethics in data sciencePrivacy, security and ethics in data science
Privacy, security and ethics in data science
Nikolaos Vasiloglou
 
Outliers and Inconsistency
Outliers and InconsistencyOutliers and Inconsistency
Outliers and Inconsistency
Neil Rubens
 
Digital Pragmatism with Business Intelligence, Big Data and Data Visualisation
Digital Pragmatism with Business Intelligence, Big Data and Data VisualisationDigital Pragmatism with Business Intelligence, Big Data and Data Visualisation
Digital Pragmatism with Business Intelligence, Big Data and Data Visualisation
Jen Stirrup
 

Similar to Uncertainty in deep learning sandaysky.pptx (20)

Introduction to Visualizing Uncertainties
Introduction to Visualizing UncertaintiesIntroduction to Visualizing Uncertainties
Introduction to Visualizing Uncertainties
 
Designing Data Visualization
Designing Data VisualizationDesigning Data Visualization
Designing Data Visualization
 
Measuring the Speed of the Red Queen's Race; Adaption and Evasion in Malware
Measuring the Speed of the Red Queen's Race; Adaption and Evasion in MalwareMeasuring the Speed of the Red Queen's Race; Adaption and Evasion in Malware
Measuring the Speed of the Red Queen's Race; Adaption and Evasion in Malware
 
Rsqrd AI - ML Interpretability: Beyond Feature Importance
Rsqrd AI - ML Interpretability: Beyond Feature ImportanceRsqrd AI - ML Interpretability: Beyond Feature Importance
Rsqrd AI - ML Interpretability: Beyond Feature Importance
 
Neur ips yomikai_at_ridgei_aaron_jan312020
Neur ips yomikai_at_ridgei_aaron_jan312020Neur ips yomikai_at_ridgei_aaron_jan312020
Neur ips yomikai_at_ridgei_aaron_jan312020
 
CS194Lec0hbh6EDA.pptx
CS194Lec0hbh6EDA.pptxCS194Lec0hbh6EDA.pptx
CS194Lec0hbh6EDA.pptx
 
MLSEV. Models, Evaluations and Ensembles
MLSEV. Models, Evaluations and Ensembles MLSEV. Models, Evaluations and Ensembles
MLSEV. Models, Evaluations and Ensembles
 
Stance classification. Uni Cambridge 22 Jan 2021
Stance classification. Uni Cambridge 22 Jan 2021Stance classification. Uni Cambridge 22 Jan 2021
Stance classification. Uni Cambridge 22 Jan 2021
 
Stance classification. Presentation at QMUL 11 Nov 2020
Stance classification. Presentation at QMUL 11 Nov 2020Stance classification. Presentation at QMUL 11 Nov 2020
Stance classification. Presentation at QMUL 11 Nov 2020
 
Jarrod Lopiccolo - Big Data
Jarrod Lopiccolo - Big DataJarrod Lopiccolo - Big Data
Jarrod Lopiccolo - Big Data
 
The Dangers of Machine Learning
The Dangers of Machine LearningThe Dangers of Machine Learning
The Dangers of Machine Learning
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Max Shron, Thinking with Data at the NYC Data Science Meetup
Max Shron, Thinking with Data at the NYC Data Science MeetupMax Shron, Thinking with Data at the NYC Data Science Meetup
Max Shron, Thinking with Data at the NYC Data Science Meetup
 
unba.se - ACM CSCW 2017 - IWCES15
unba.se - ACM CSCW 2017 - IWCES15unba.se - ACM CSCW 2017 - IWCES15
unba.se - ACM CSCW 2017 - IWCES15
 
Getting Started in Data Science
Getting Started in Data ScienceGetting Started in Data Science
Getting Started in Data Science
 
Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)
 
PR-048: Towards Principled Methods for Training Generative Adversarial Networks
PR-048: Towards Principled Methods for Training Generative Adversarial NetworksPR-048: Towards Principled Methods for Training Generative Adversarial Networks
PR-048: Towards Principled Methods for Training Generative Adversarial Networks
 
Privacy, security and ethics in data science
Privacy, security and ethics in data sciencePrivacy, security and ethics in data science
Privacy, security and ethics in data science
 
Outliers and Inconsistency
Outliers and InconsistencyOutliers and Inconsistency
Outliers and Inconsistency
 
Digital Pragmatism with Business Intelligence, Big Data and Data Visualisation
Digital Pragmatism with Business Intelligence, Big Data and Data VisualisationDigital Pragmatism with Business Intelligence, Big Data and Data Visualisation
Digital Pragmatism with Business Intelligence, Big Data and Data Visualisation
 

Recently uploaded

LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
Pierluigi Pugliese
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
Neo4j
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
DianaGray10
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
Matthew Sinclair
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
Octavian Nadolu
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofszkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
Alex Pruden
 
GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...
ThomasParaiso2
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
sonjaschweigert1
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
KAMESHS29
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
Neo4j
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
Neo4j
 

Recently uploaded (20)

LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofszkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
 
GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
 

Uncertainty in deep learning sandaysky.pptx

  • 1. Inbar Naor / Data Scientist, Taboola April 2018 Don't believe everything your network tells you Uncertainty in deep learning
  • 2. Outline • Why do we need uncertainty? • Uncertainty Types • Uncertainty in Recommendation Systems • Bayesian Learning 101 • Data Uncertainty • Model Uncertainty DON'T BELIEVE EVERYTHING YOUR NETWORK TELLS YOU
  • 3. Why Do We Need Uncertainty? ● High risk applications ● Out of data samples ● Improving the model: ○ mistakes with high confidence ○ true labels with low confidence
  • 4. Capturing Model Blind spots is Crucial UNCERTAINTY IN DEEP LEARNING
  • 5. Why Do We Need Uncertainty? ● High risk applications ● Out of data samples ● Improving the model: ○ mistakes with high confidence ○ true labels with low confidence
  • 7. Why Do We Need Uncertainty? ● High risk applications ● Out of data samples ● Improving the model: ○ mistakes with high confidence ○ true labels with low confidence
  • 8. Model Uncertainty UNCERTAINTY IN DEEP LEARNING More Data Please!
  • 9. Data Uncertainty UNCERTAINTY IN DEEP LEARNING More Data Won’t Help – I want to know how good my predictions are
  • 11. Content Discovery A DEEP LEARNING APPROACH TO DISCOVERY Context Metadata User Historical Data Rank N recommendations by CTR * CPC ~1M Possible Recommendations Region-based Location Information
  • 12. Machine Learning + Discovery = Hard A DEEP LEARNING APPROACH TO DISCOVERY Rank N recommendations by CTR * CPC Implicit Feedback Data is Very Sparse No Human Baseline World is Ever-changing “Walmart cameras captured these awesome photos”“Walmart cameras captured these hilarious photos” “15 rarely seen WW2 Photos Discovered”
  • 13. Why Go Deep for Discovery? • Cold start is a huge issue • Many hard sub problems – Language modeling – Image classification – User Profiling • There are many complex relations A DEEP LEARNING APPROACH TO DISCOVERY
  • 14. Exploration/Exploitation in Recommender Systems UNCERTAINTY IN DEEP LEARNING Best Performing Recommendations Add new Information Fight selection biasSearch for new stars Exploration at random doesn’t work
  • 15. Bayesian Learning ● Probability is a measure of belief ● Prior distribution represents our belief about the world ● Likelihood ● Posterior Introduction
  • 16. Capturing Data Uncertainty Know what you don’t know
  • 17. Likelihood as loss CAPTURING DATA UNCERTAINTY h1 h2 … hn LSTM LSTM LSTM LSTM WordWordWord … y
  • 18. Likelihood as loss CAPTURING DATA UNCERTAINTY (*) In classification we can assume a binomial distribution with Beta prior h1 h2 … hn LSTM LSTM LSTM LSTM WordWordWord … Mu Std
  • 19. Mixture Density Network CAPTURING DATA UNCERTAINTY Christopher M. Bishop, Mixture density networks (1994)
  • 21. Data Uncertainty and Training Error CAPTURING DATA UNCERTAINTY
  • 22. Data Uncertainty and OOV CAPTURING DATA UNCERTAINTY h1 h2 … hn LSTM LSTM LSTM LSTM Word OOV Word1 Word … Mu Std OOV Word2 Probably higher variance
  • 23. Data Uncertainty and OOV CAPTURING DATA UNCERTAINTY
  • 25. Bayesian Learning ● Probability is a measure of belief ● Prior distribution represents our belief about the world ● Likelihood ● Posterior Introduction
  • 26. Neural Network - A Probabilistic Perspective Introduction Neural Network is a probabilistic model We learn W using MLE: Adding regularization using MAP:
  • 27. Bayesian Neural Networks Model Uncertainty Weights uncertainty Prior: Posterior: Prediction: is often intractable
  • 28. Bayesian Posterior Inference Model Uncertainty Variational Inference Sampling Methods High bias - low variance High variance - low bias
  • 29. Dropouts – a regularization technique CAPTURING MODEL UNCERTAINTY
  • 30. Dropout variational inference as Bayesian Approximation • Monte-Carlo Dropouts as Bayesian Approximation • A little bit like bagging CAPTURING MODEL UNCERTAINTY gal & ghahramani, Bayesian convolutional neural networks with bernoulli approximate variational inference (2016)
  • 31. Uncertainty as a function of amount of data CAPTURING MODEL UNCERTAINTY Image: Yarin Gal, “what my deep model doesn’t know”
  • 32. Uncertainty as a function of amount of data CAPTURING MODEL UNCERTAINTY
  • 33. Uncertainty as a function of amount of data CAPTURING MODEL UNCERTAINTY
  • 34. Uncertainty in exploration ● Upper Confidence Bound - adding sigma to the score ● Thompson Sampling - sample from the distribution
  • 35. Summary • Two types of uncertainty: model and data • Mixture Density Networks • Captures the true variance of your prediction • Monte-Carlo Dropouts Variational Inference • Sheds light on where the model lacks data • Why not both? • Also interesting: uncertainty due to measurement noise DON'T BELIEVE EVERYTHING YOUR NETWORK TELLS YOU Model Uncertainty Data Uncertainty
  • 36. Thank You Feel free to contact with questions! Inbar Naor - inbar.naor1@gmail.com Taboola Engineering Blog Unsupervised - A Podcast about Data Science in Israel
  • 37. Bibliography ● What Uncertainties Do We Need in Bayesian Deep Learning for Computer Vision? Kendall & Gal, 2017 ● Dropout Inference In Bayesian Neural Networks with Alpha-Divergence. Li & Gal, ICML 2017 ● On Modern Deep Learning and Variational Inference. Gal & Ghahramani, Advances in Approximate Bayesian Inference workshop, NIPS 2015 ● Bayesian Convolutional Neural Networks with Bernoulli Approximate Variational Inference. Gal & Ghahramani, ICLR 2016 ● Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning. Gal & Ghahramani, ICML 2016 ● Dropout as a Bayesian Approximation: Insights and Applications. Gal & Ghahramani, Deep Learning Workshop, ICML 2015 ● Weight Uncertainty in Neural Networks . Blundell, Cornebise, Kavakcuoglu, Wierstra 2015 ● Training Deep Learning Neural Networks Based on Unreliable Labels. Bekker & Goldberger, ICASSP 2016 ● Keeping Neural Networks Simple by Minimizing the description length of weights. Hinton & van Camp, Proceedings of COLT- 1993 ● Practical Variational Inference for Neural Networks. Graves, NIPS 2011 ● Learning Weight Uncertainty with Stochastic Gradient MCMC for Shape Classification. Chunyuan et al. CVPR 2016