SlideShare a Scribd company logo
1 of 17
Guiding Deep Learning System Testing Using
Surprise Adequacy
Authors: Jinhan Kim, Robert Feldt, Shin Yoo
Presented by
Fatemeh Ghorbani
2019 IEEE/ACM 41st International Conference on Software Engineering (ICSE)
Outline
• Introduction
▫ Statement of the problem
▫ Related works
▫ Surprise Adequacy for Deep Learning Systems (SADL)
• SADL measurement
▫ Surprise Adequacy (SA)
▫ Likelihood-based SA (LSA)
▫ Distance-based SA (DSA)
▫ Surprise Coverage (SC)
• Experimental setup
• Research questions and results
• Conclusion
1/15
Introduction
• Statement of the problem
• Related works
• Surprise Adequacy for Deep Learning Systems (SADL)
2/15
Statement of the problem
• Unexpected behaviors of deep learning (DL) systems
▫ Adversarial examples
• Essential need to verify behaviors
• Testing DL systems correctness
3/15
• DeepTest
• DeepXplore
▫ Neuron Coverage (NC)
• Major limitation
▫ Convey little information
▫ Discretization
Related works and their limitations
4/15
SADL
• Based on the behavior of DL systems
▫ With respect to training data
▫ Data-flow
• The actual measure of surprise:
▫ The likelihood of new input and training data
▫ The distance between activation trace vectors of new input and training
data
• Quantitively measurement
5/15
Surprise Adequacy (SA)
• Activation traces of inputs and training data over neurons in N:
• Compare them
Fully captures the behaviors of the DL system
6/15
Likelihood-based SA (LSA)
• Applies KDE to estimate the probability density of each activation value:
• Obtains the surprise of a new input
• To reduce computational cost: consider layer L activation trace
7/15
Distance-based SA (DSA)
• Use the distances between activation traces
as the measure of surprise
• Only apply DSA for classification task
8/15
Surprise Coverage (SC)
• SC can only be measured with
respect to predefined upper band
• Sense of redundancy is weaker
LSA and DSA (continuous)
bucketing (discretise)
LSC and DSC
9/15
Experimental setup
• Data sets and DL systems:
▫ MNIST, CIFAR-10
▫ Self-driving car challenge
▫ Pre-trained Dave-2 and Chauffeur model
▫ Evaluation of SADL accuracy: CNN and MSE
• Adversarial examples and synthetic inputs
▫ Five attack strategies
▫ DeepXplore, DeepTest
10/15
Research questions and results
1) SADL capability of capturing the relative surprise
▫ Training adversarial example classifier using logistic
regression
▫ Quantitatively and visually: SADL can measure how
surprising the input is
▫ DSA from specific layer: produce higher accuracy
▫ Inputs with higher SA: harder to classify
▫ Adversarial examples: higher SA value
10000
adversarial
examples
1000 for
training
9000 for
evaluation
11/15
Research questions and results (Con.)
2) Selection of layer impact on accuracy
▫ With LSA: there is no strong evidence
▫ With DSA: deepest layer produces the most accurate
classifier
▫ The layer sensitivity varies across different attack strategies
12/15
Research questions and results (Con.)
3) Correlation between SC and
other criteria
• Most of the criteria increase as
additional inputs are added at
each step (exception: NC)
MNIST and
CIFAR-10
add
1000 adversarial
examples
Dave-2 add
700 synthetic
images
(by DeepXplore)
Chauffeur add
1000 synthetic
images
(by DeepTest)
13/15
Research questions and results (Con.)
Choose four sets of 100
images
Train existing models for
five additional epochs
Measure performance
4) Retraining guide
• Sampling from wider ranges
improves accuracy
• Best retraining performance: full
range
• SA can provide guidance
14/15
Conclusion
• SC and SA are good indicators of DL systems behavior
• SA is correlated with how difficult a DL system finds an input
• SC can be used to guide selection of inputs for effective training
• SA can classify adversarial examples accurately
15/15
Any Question?

More Related Content

What's hot

パターン認識と機械学習入門
パターン認識と機械学習入門パターン認識と機械学習入門
パターン認識と機械学習入門
Momoko Hayamizu
 
不均衡データのクラス分類
不均衡データのクラス分類不均衡データのクラス分類
不均衡データのクラス分類
Shintaro Fukushima
 
私は如何にして心配するのを止めてPyTorchを愛するようになったか
私は如何にして心配するのを止めてPyTorchを愛するようになったか私は如何にして心配するのを止めてPyTorchを愛するようになったか
私は如何にして心配するのを止めてPyTorchを愛するようになったか
Yuta Kashino
 

What's hot (20)

固有表現抽出について
固有表現抽出について固有表現抽出について
固有表現抽出について
 
機械学習~データを予測に変える技術~で化学に挑む! (サイエンスアゴラ2021)
機械学習~データを予測に変える技術~で化学に挑む! (サイエンスアゴラ2021)機械学習~データを予測に変える技術~で化学に挑む! (サイエンスアゴラ2021)
機械学習~データを予測に変える技術~で化学に挑む! (サイエンスアゴラ2021)
 
Gradient Tree Boosting はいいぞ
Gradient Tree Boosting はいいぞGradient Tree Boosting はいいぞ
Gradient Tree Boosting はいいぞ
 
RDRAモデリングを見てみよう
RDRAモデリングを見てみようRDRAモデリングを見てみよう
RDRAモデリングを見てみよう
 
リプシッツ連続性に基づく勾配法・ニュートン型手法の計算量解析
リプシッツ連続性に基づく勾配法・ニュートン型手法の計算量解析リプシッツ連続性に基づく勾配法・ニュートン型手法の計算量解析
リプシッツ連続性に基づく勾配法・ニュートン型手法の計算量解析
 
DataRobotによる予測モデルを用いた シミュレーションと最適化(抜粋)
DataRobotによる予測モデルを用いた シミュレーションと最適化(抜粋)DataRobotによる予測モデルを用いた シミュレーションと最適化(抜粋)
DataRobotによる予測モデルを用いた シミュレーションと最適化(抜粋)
 
【DL輪読会】Foundation Models for Decision Making: Problems, Methods, and Opportun...
【DL輪読会】Foundation Models for Decision Making: Problems, Methods, and Opportun...【DL輪読会】Foundation Models for Decision Making: Problems, Methods, and Opportun...
【DL輪読会】Foundation Models for Decision Making: Problems, Methods, and Opportun...
 
パターン認識と機械学習入門
パターン認識と機械学習入門パターン認識と機械学習入門
パターン認識と機械学習入門
 
不均衡データのクラス分類
不均衡データのクラス分類不均衡データのクラス分類
不均衡データのクラス分類
 
WBICによる混合正規分布の分離と抽出
WBICによる混合正規分布の分離と抽出WBICによる混合正規分布の分離と抽出
WBICによる混合正規分布の分離と抽出
 
Random Forestsとその応用
Random Forestsとその応用Random Forestsとその応用
Random Forestsとその応用
 
勾配ブースティングの基礎と最新の動向 (MIRU2020 Tutorial)
勾配ブースティングの基礎と最新の動向 (MIRU2020 Tutorial)勾配ブースティングの基礎と最新の動向 (MIRU2020 Tutorial)
勾配ブースティングの基礎と最新の動向 (MIRU2020 Tutorial)
 
機械学習の理論と実践
機械学習の理論と実践機械学習の理論と実践
機械学習の理論と実践
 
ひたすら楽してスライド作成
ひたすら楽してスライド作成ひたすら楽してスライド作成
ひたすら楽してスライド作成
 
【論文読み会】PiCO_Contrastive Label Disambiguation for Partial Label Learning.pptx
【論文読み会】PiCO_Contrastive Label Disambiguation for Partial Label Learning.pptx【論文読み会】PiCO_Contrastive Label Disambiguation for Partial Label Learning.pptx
【論文読み会】PiCO_Contrastive Label Disambiguation for Partial Label Learning.pptx
 
[DL輪読会]Understanding deep learning requires rethinking generalization
[DL輪読会]Understanding deep learning requires rethinking generalization[DL輪読会]Understanding deep learning requires rethinking generalization
[DL輪読会]Understanding deep learning requires rethinking generalization
 
Encoder-decoder 翻訳 (TISハンズオン資料)
Encoder-decoder 翻訳 (TISハンズオン資料)Encoder-decoder 翻訳 (TISハンズオン資料)
Encoder-decoder 翻訳 (TISハンズオン資料)
 
機械学習で泣かないためのコード設計 2018
機械学習で泣かないためのコード設計 2018機械学習で泣かないためのコード設計 2018
機械学習で泣かないためのコード設計 2018
 
ドメイン駆動設計 at DDD.rb #5
ドメイン駆動設計 at DDD.rb #5ドメイン駆動設計 at DDD.rb #5
ドメイン駆動設計 at DDD.rb #5
 
私は如何にして心配するのを止めてPyTorchを愛するようになったか
私は如何にして心配するのを止めてPyTorchを愛するようになったか私は如何にして心配するのを止めてPyTorchを愛するようになったか
私は如何にして心配するのを止めてPyTorchを愛するようになったか
 

Similar to Surprise Adequacy for Deep Learning Systems (SADL)

Autonomous Systems: How to Address the Dilemma between Autonomy and Safety
Autonomous Systems: How to Address the Dilemma between Autonomy and SafetyAutonomous Systems: How to Address the Dilemma between Autonomy and Safety
Autonomous Systems: How to Address the Dilemma between Autonomy and Safety
Lionel Briand
 
Efficient Online Testing for DNN-Enabled Systems using Surrogate-Assisted and...
Efficient Online Testing for DNN-Enabled Systems using Surrogate-Assisted and...Efficient Online Testing for DNN-Enabled Systems using Surrogate-Assisted and...
Efficient Online Testing for DNN-Enabled Systems using Surrogate-Assisted and...
Lionel Briand
 
Applications of Search-based Software Testing to Trustworthy Artificial Intel...
Applications of Search-based Software Testing to Trustworthy Artificial Intel...Applications of Search-based Software Testing to Trustworthy Artificial Intel...
Applications of Search-based Software Testing to Trustworthy Artificial Intel...
Lionel Briand
 
Computer Vision for Beginners
Computer Vision for BeginnersComputer Vision for Beginners
Computer Vision for Beginners
Sanghamitra Deb
 
From ensembles to computer networks
From ensembles to computer networksFrom ensembles to computer networks
From ensembles to computer networks
CSIRO
 

Similar to Surprise Adequacy for Deep Learning Systems (SADL) (20)

Autonomous Systems: How to Address the Dilemma between Autonomy and Safety
Autonomous Systems: How to Address the Dilemma between Autonomy and SafetyAutonomous Systems: How to Address the Dilemma between Autonomy and Safety
Autonomous Systems: How to Address the Dilemma between Autonomy and Safety
 
Icann2018ppt final
Icann2018ppt finalIcann2018ppt final
Icann2018ppt final
 
Testing Machine Learning-enabled Systems: A Personal Perspective
Testing Machine Learning-enabled Systems: A Personal PerspectiveTesting Machine Learning-enabled Systems: A Personal Perspective
Testing Machine Learning-enabled Systems: A Personal Perspective
 
Efficient Online Testing for DNN-Enabled Systems using Surrogate-Assisted and...
Efficient Online Testing for DNN-Enabled Systems using Surrogate-Assisted and...Efficient Online Testing for DNN-Enabled Systems using Surrogate-Assisted and...
Efficient Online Testing for DNN-Enabled Systems using Surrogate-Assisted and...
 
Applications of Search-based Software Testing to Trustworthy Artificial Intel...
Applications of Search-based Software Testing to Trustworthy Artificial Intel...Applications of Search-based Software Testing to Trustworthy Artificial Intel...
Applications of Search-based Software Testing to Trustworthy Artificial Intel...
 
Computer Vision for Beginners
Computer Vision for BeginnersComputer Vision for Beginners
Computer Vision for Beginners
 
MLSEV Virtual. Searching for Anomalies
MLSEV Virtual. Searching for AnomaliesMLSEV Virtual. Searching for Anomalies
MLSEV Virtual. Searching for Anomalies
 
Algorithm evaluation using item response theory
Algorithm evaluation using item response theoryAlgorithm evaluation using item response theory
Algorithm evaluation using item response theory
 
Ista presentation-dga
Ista presentation-dgaIsta presentation-dga
Ista presentation-dga
 
NSL KDD Cup 99 dataset Anomaly Detection using Machine Learning Technique
NSL KDD Cup 99 dataset Anomaly Detection using Machine Learning Technique NSL KDD Cup 99 dataset Anomaly Detection using Machine Learning Technique
NSL KDD Cup 99 dataset Anomaly Detection using Machine Learning Technique
 
IDS for IoT.pptx
IDS for IoT.pptxIDS for IoT.pptx
IDS for IoT.pptx
 
Zero-shot Image Recognition Using Relational Matching, Adaptation and Calibra...
Zero-shot Image Recognition Using Relational Matching, Adaptation and Calibra...Zero-shot Image Recognition Using Relational Matching, Adaptation and Calibra...
Zero-shot Image Recognition Using Relational Matching, Adaptation and Calibra...
 
From ensembles to computer networks
From ensembles to computer networksFrom ensembles to computer networks
From ensembles to computer networks
 
Kaggle Gold Medal Case Study
Kaggle Gold Medal Case StudyKaggle Gold Medal Case Study
Kaggle Gold Medal Case Study
 
[IUI 2017] Criteria Chains: A Novel Multi-Criteria Recommendation Approach
[IUI 2017] Criteria Chains: A Novel Multi-Criteria Recommendation Approach[IUI 2017] Criteria Chains: A Novel Multi-Criteria Recommendation Approach
[IUI 2017] Criteria Chains: A Novel Multi-Criteria Recommendation Approach
 
Distributed Monte Carlo Feature Selection: Extracting Informative Features Ou...
Distributed Monte Carlo Feature Selection: Extracting Informative Features Ou...Distributed Monte Carlo Feature Selection: Extracting Informative Features Ou...
Distributed Monte Carlo Feature Selection: Extracting Informative Features Ou...
 
Long-term Face Tracking in the Wild using Deep Learning
Long-term Face Tracking in the Wild using Deep LearningLong-term Face Tracking in the Wild using Deep Learning
Long-term Face Tracking in the Wild using Deep Learning
 
Bangla Hand Written Digit Recognition presentation slide .pptx
Bangla Hand Written Digit Recognition presentation slide .pptxBangla Hand Written Digit Recognition presentation slide .pptx
Bangla Hand Written Digit Recognition presentation slide .pptx
 
Presentazione Tesi Laurea Triennale in Informatica
Presentazione Tesi Laurea Triennale in InformaticaPresentazione Tesi Laurea Triennale in Informatica
Presentazione Tesi Laurea Triennale in Informatica
 
DutchMLSchool 2022 - History and Developments in ML
DutchMLSchool 2022 - History and Developments in MLDutchMLSchool 2022 - History and Developments in ML
DutchMLSchool 2022 - History and Developments in ML
 

Recently uploaded

Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 

Recently uploaded (20)

Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
 
UNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICS
UNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICSUNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICS
UNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICS
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdf
 
chapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringchapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineering
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
 
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
 
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
 
Generative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTGenerative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPT
 
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
 
UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduits
 
Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations
 

Surprise Adequacy for Deep Learning Systems (SADL)

  • 1. Guiding Deep Learning System Testing Using Surprise Adequacy Authors: Jinhan Kim, Robert Feldt, Shin Yoo Presented by Fatemeh Ghorbani 2019 IEEE/ACM 41st International Conference on Software Engineering (ICSE)
  • 2. Outline • Introduction ▫ Statement of the problem ▫ Related works ▫ Surprise Adequacy for Deep Learning Systems (SADL) • SADL measurement ▫ Surprise Adequacy (SA) ▫ Likelihood-based SA (LSA) ▫ Distance-based SA (DSA) ▫ Surprise Coverage (SC) • Experimental setup • Research questions and results • Conclusion 1/15
  • 3. Introduction • Statement of the problem • Related works • Surprise Adequacy for Deep Learning Systems (SADL) 2/15
  • 4. Statement of the problem • Unexpected behaviors of deep learning (DL) systems ▫ Adversarial examples • Essential need to verify behaviors • Testing DL systems correctness 3/15
  • 5. • DeepTest • DeepXplore ▫ Neuron Coverage (NC) • Major limitation ▫ Convey little information ▫ Discretization Related works and their limitations 4/15
  • 6. SADL • Based on the behavior of DL systems ▫ With respect to training data ▫ Data-flow • The actual measure of surprise: ▫ The likelihood of new input and training data ▫ The distance between activation trace vectors of new input and training data • Quantitively measurement 5/15
  • 7. Surprise Adequacy (SA) • Activation traces of inputs and training data over neurons in N: • Compare them Fully captures the behaviors of the DL system 6/15
  • 8. Likelihood-based SA (LSA) • Applies KDE to estimate the probability density of each activation value: • Obtains the surprise of a new input • To reduce computational cost: consider layer L activation trace 7/15
  • 9. Distance-based SA (DSA) • Use the distances between activation traces as the measure of surprise • Only apply DSA for classification task 8/15
  • 10. Surprise Coverage (SC) • SC can only be measured with respect to predefined upper band • Sense of redundancy is weaker LSA and DSA (continuous) bucketing (discretise) LSC and DSC 9/15
  • 11. Experimental setup • Data sets and DL systems: ▫ MNIST, CIFAR-10 ▫ Self-driving car challenge ▫ Pre-trained Dave-2 and Chauffeur model ▫ Evaluation of SADL accuracy: CNN and MSE • Adversarial examples and synthetic inputs ▫ Five attack strategies ▫ DeepXplore, DeepTest 10/15
  • 12. Research questions and results 1) SADL capability of capturing the relative surprise ▫ Training adversarial example classifier using logistic regression ▫ Quantitatively and visually: SADL can measure how surprising the input is ▫ DSA from specific layer: produce higher accuracy ▫ Inputs with higher SA: harder to classify ▫ Adversarial examples: higher SA value 10000 adversarial examples 1000 for training 9000 for evaluation 11/15
  • 13. Research questions and results (Con.) 2) Selection of layer impact on accuracy ▫ With LSA: there is no strong evidence ▫ With DSA: deepest layer produces the most accurate classifier ▫ The layer sensitivity varies across different attack strategies 12/15
  • 14. Research questions and results (Con.) 3) Correlation between SC and other criteria • Most of the criteria increase as additional inputs are added at each step (exception: NC) MNIST and CIFAR-10 add 1000 adversarial examples Dave-2 add 700 synthetic images (by DeepXplore) Chauffeur add 1000 synthetic images (by DeepTest) 13/15
  • 15. Research questions and results (Con.) Choose four sets of 100 images Train existing models for five additional epochs Measure performance 4) Retraining guide • Sampling from wider ranges improves accuracy • Best retraining performance: full range • SA can provide guidance 14/15
  • 16. Conclusion • SC and SA are good indicators of DL systems behavior • SA is correlated with how difficult a DL system finds an input • SC can be used to guide selection of inputs for effective training • SA can classify adversarial examples accurately 15/15

Editor's Notes

  1. How presentation will benefit audience: Adult learners are more interested in a subject if they know how or why it is important to them. Presenter’s level of expertise in the subject: Briefly state your credentials in this area, or explain why participants should listen to you.
  2. Example objectives At the end of this lesson, you will be able to: Save files to the team Web server. Move files to different locations on the team Web server. Share files on the team Web server.