SlideShare a Scribd company logo
1 of 32
Download to read offline
COMPARING INCREMENTAL LEARNING STRATEGIES
FOR
CONVOLUTIONAL NEURAL NETWORKS
Vincenzo Lomonaco & Davide Maltoni
{vincenzo.lomonaco, davide.maltoni}@unibo.it
Department of Computer Science and Engineering – DISI
University of Bologna
OUTLINE
1. Introduction
• CNNs and current limitations
• Incremental learning: Why?
2. Incremental learning Strategies
for CNNs
• Definitions
• Possible instantiations used during
the experimentations
3. Datasets
• iCubWorld28
• BigBrother
4. Experiments and Results
• Exp. design
• Results analysis
5. Conclusions and Future Works
OUTLINE
1. Introduction
• CNNs and current limitations
• Incremental learning: Why?
2. Incremental learning Strategies
for CNNs
• Definitions
• Possible instantiations used during
the experimentations
3. Datasets
• iCubWorld28
• BigBrother
4. Experiments and Results
• Exp. design
• Results analysis
5. Conclusions and Future Works
INTRODUCTION – CNNs and Current Limitations
State-of-the-art algorithm for many
tasks in CV, NLP, SR, etc..
Very general and adaptive
Works directy on raw data (no
hand-engineered features required)
Computational demanding
Tricky hyper-parametrization
Applicability in Incremental
Learning Scenario?
INTRODUCTION – Incremental learnig: Why?
𝐵𝑎𝑡𝑐ℎ0 𝐵𝑎𝑡𝑐ℎ1 𝐵𝑎𝑡𝑐ℎ 𝑛
. . .
Initial Batch Incremental Batches

INTRODUCTION – Incremental learnig: Why?
Constraints:
• Memory: We can’t afford to keep in memory all the batches.
• Computational power: We can’t afford to train our classification model from
scratch after each batch.
𝐵𝑎𝑡𝑐ℎ0 𝐵𝑎𝑡𝑐ℎ1 𝐵𝑎𝑡𝑐ℎ 𝑛
. . .

Initial Batch Incremental Batches
INTRODUCTION – Incremental learnig: Why?
Goal:
• Maximize the Accuracy % after each batch
• Going towards a more smooth and natural learning but still using CNNs
𝐵𝑎𝑡𝑐ℎ0 𝐵𝑎𝑡𝑐ℎ1 𝐵𝑎𝑡𝑐ℎ 𝑛
. . .

Initial Batch Incremental Batches
INTRODUCTION – Incremental learnig: Why?
𝐵𝑎𝑡𝑐ℎ0 𝐵𝑎𝑡𝑐ℎ1 𝐵𝑎𝑡𝑐ℎ 𝑛
. . .
𝑀0
INTRODUCTION – Incremental learnig: Why?
𝐵𝑎𝑡𝑐ℎ0 𝐵𝑎𝑡𝑐ℎ1 𝐵𝑎𝑡𝑐ℎ 𝑛
. . .
𝑀0
𝑀1
• We can free the memory
occupied by 𝐵𝑎𝑡𝑐ℎ0 and
get 𝑀1 just by updating
𝑀0 with the new coming
batch
• However, we risk to forget
what we’ve previously
learned
INTRODUCTION – Incremental learnig: Why?
𝐵𝑎𝑡𝑐ℎ0 𝐵𝑎𝑡𝑐ℎ1 𝐵𝑎𝑡𝑐ℎ 𝑛
. . .
𝑀0
𝑀1
INTRODUCTION – Incremental learnig: Why?
𝐵𝑎𝑡𝑐ℎ0 𝐵𝑎𝑡𝑐ℎ1 𝐵𝑎𝑡𝑐ℎ 𝑛
. . .
𝑀0
𝑀1
INTRODUCTION – Incremental learnig: Why?
𝐵𝑎𝑡𝑐ℎ0 𝐵𝑎𝑡𝑐ℎ1 𝐵𝑎𝑡𝑐ℎ 𝑛
. . .
𝑀0
𝑀1
. . . 𝑀 𝑛
OUTLINE
1. Introduction
• CNNs and current limitations
• Incremental learning: Why?
2. Incremental learning Strategies
for CNNs
• Definitions
• Possible instantiations used during
the experimentations
3. Datasets
• iCubWorld28
• BigBrother
4. Experiments and Results
• Exp. design
• Results analysis
5. Conclusions and Future Works
INC. LEARNING STRATEGIES FOR CNNS - Definitions
The different possibilities we explored to deal with an incremental
tuning/learning scenario, can be conveniently framed in three main strategies:
1. Training/tuning an ad hoc CNN architecture suitable for the problem.
2. Using an already trained CNN as a fixed feature extractor in conjunction with
an incremental classifier.
3. Fine-tuning an already trained CNN.
INC. LEARNING STRATEGIES FOR CNNS - Instantiations
In our experiments (with focus on image classification) we tested three instantiations of the
aforementioned strategies, respectively:
1. (Ad-hoc arch.)  LeNet7
Consists of the classical “LeNet7” proposed by Yan LeCun in 2004. Still competitive on low/medium
scale problems.
2. (CNN-fixed w. inc. Classifier)  CaffeNet + SVM
Consists of a pre-trained CNN provided in the Caffe library (“BVLC Reference CaffeNet”, based on
the “AlexNet” architecture; An incremental and linear SVM as Classifier.
3. (CNN-Finetuning)  CaffeNet + FT
Consists again of the “BVLC Reference CaffeNet” but instead of using it as a fixed feature extractor the
network is fine-tuned to suit the new task.
INC. LEARNING STRATEGIES FOR CNNS - Instantiations
Furthermore, for the “BigBrother” dataset we decided to test an additional pair of strategies:
4. (CNN-fixed w. inc. Classifier)  VGG_Face + SVM
Consists of a pre-trained (16-levels) CNN called “VGG Face” which has been trained on a very large
dataset of faces (2,622 Subjects and 2.6M images); Again, a incremental and linear SVM as Classifier.
5. (CNN-Finetuning)  VGG_Face + FT
Consists again of the “VGG_Face” CNN but instead of using it as a fixed feature extractor the network
is fine-tuned to suit the new task.
OUTLINE
1. Introduction
• CNNs and current limitations
• Incremental learning: Why?
2. Incremental Learning Strategies
for CNNs
• Definitions
• Possible instantiations used during
the experimentations
3. Datasets
• iCubWorld28
• BigBrother
4. Experiments and Results
• Exp. design
• Results analysis
5. Conclusions and Future Works
DATASETS
We were interested in datasets where:
• The objects of interest have been acquired in a number of successive sessions
• The environmental condition can change among the sessions.
We focused on two applicative fields where incremental learning is very relevant
(robotics and biometrics) and chose two datasets respectively:
• iCubWorld28
• BigBrother
DATASETS – iCubWorld28
Key Features:
• Img size: 128×128
• Num. classes: 7 (× 4 obj)
• Tot. imgs: 39,693
• Num batches: 9 +1 (test)
DATASETS – BigBrother
Key Features:
• Img size: 70×70
• Num. classes: 7
• Tot. imgs: 23,842
• Num batches: 56 +1 (test)
OUTLINE
1. Introduction
• CNNs and current limitations
• Incremental learning: Why?
2. Incremental Learning Strategies
for CNNs
• Definitions
• Possible instantiations used during
the experimentations
3. Datasets
• iCubWorld28
• BigBrother
4. Experiments and Results
• Exp. design
• Results analysis
5. Conclusions and Future Works
EXPERIMENTS AND RESULTS – Exp. Design
Experiments Policy:
• We trained the models until full convergence on the first batch of data
• We tuned them on the successive incremental batches, trying to balance the trade-off
between accuracy gain and forgetting.
EXPERIMENTS AND RESULTS – iCubWorld28 Results
EXPERIMENTS AND RESULTS – iCubWorld28 Results
• CaffeNet + SVM has a very
good recognition rate
increment
• CaffeNet + FT is the most
effective
• LeNet7 struggles to learn
complex invariant features
necessary for this problem
EXPERIMENTS AND RESULTS – BigBrother Results
EXPERIMENTS AND RESULTS – BigBrother Results
• LeNet7 model performs
slightly better than CaffeNet
+ SVM or CaffeNet + FT
• VGG_Face + SVM and
VGG_Face + FT have
impressive performance on
this problem
• VGG_Face + SVM seems to
be the best choice both for
the accuracy and the
stability
EXPERIMENTS AND RESULTS – Dealing with Forgetting
EXPERIMENTS AND RESULTS – Dealing with Forgetting
• An adjustable learning
rate is significantly more
stable
• A simple thresholding
approach has been used.
• We did not found any
significant difference using
a continuous approach
OUTLINE
1. Introduction
• CNNs and current limitations
• Incremental learning: Why?
2. Incremental Learning Strategies
for CNNs
• Definitions
• Possible instantiations used during
the experimentations
3. Datasets
• iCubWorld28
• BigBrother
4. Experiments and Results
• Exp. design
• Results analysis
5. Conclusions and Future Works
CONCLUSIONS AND FUTURE WORKS
• When possible (i.e., transfer learning from the same domain), it is preferable to use
CNN as a fixed feature extractor to feed an incremental classifier
• If the features are not optimized, the tuning of low level layers may be preferable
and the learning strength can be used to control forgetting.
• Training a CNN from scratch can be advantageous if the problem patterns (and
feature invariances) are highly specific and a sufficient number of samples are
available.
CONCLUSIONS AND FUTURE WORKS
In the near future we plan to extend this work by:
• Performing a more extensive experimental evaluation
• Finding a more principled way to control forgetting and adapting the tuning
parameters to the size (and bias) of each incremental batch.
• Studying real-world applications of semi-supervised incremental learning strategies
for CNNs.
COMPARING INCREMENTAL LEARNING STRATEGIES FOR
CONVOLUTIONAL NEURAL NETWORKS
Vincenzo Lomonaco & Davide Maltoni
{vincenzo.lomonaco, davide.maltoni}@unibo.it
Department of Computer Science and Engineering – DISI
University of Bologna
Thank you for your attention.
Any Questions?

More Related Content

What's hot

Introduction to Deep Learning
Introduction to Deep LearningIntroduction to Deep Learning
Introduction to Deep LearningOleg Mygryn
 
Introduction to MAML (Model Agnostic Meta Learning) with Discussions
Introduction to MAML (Model Agnostic Meta Learning) with DiscussionsIntroduction to MAML (Model Agnostic Meta Learning) with Discussions
Introduction to MAML (Model Agnostic Meta Learning) with DiscussionsJoonyoung Yi
 
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Model-Agnostic Meta-Learning for Fast Adaptation of Deep NetworksModel-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Model-Agnostic Meta-Learning for Fast Adaptation of Deep NetworksYoonho Lee
 
Scalable and Order-robust Continual Learning with Additive Parameter Decompos...
Scalable and Order-robust Continual Learning with Additive Parameter Decompos...Scalable and Order-robust Continual Learning with Additive Parameter Decompos...
Scalable and Order-robust Continual Learning with Additive Parameter Decompos...MLAI2
 
[CVPR2022, LongVersion] Online Continual Learning on a Contaminated Data Stre...
[CVPR2022, LongVersion] Online Continual Learning on a Contaminated Data Stre...[CVPR2022, LongVersion] Online Continual Learning on a Contaminated Data Stre...
[CVPR2022, LongVersion] Online Continual Learning on a Contaminated Data Stre...Jihwan Bang
 
Continual Learning: why, how, and when
Continual Learning: why, how, and whenContinual Learning: why, how, and when
Continual Learning: why, how, and whenGabriele Graffieti
 
Self-supervised Learning Lecture Note
Self-supervised Learning Lecture NoteSelf-supervised Learning Lecture Note
Self-supervised Learning Lecture NoteSangwoo Mo
 
PR12-094: Model-Agnostic Meta-Learning for fast adaptation of deep networks
PR12-094: Model-Agnostic Meta-Learning for fast adaptation of deep networksPR12-094: Model-Agnostic Meta-Learning for fast adaptation of deep networks
PR12-094: Model-Agnostic Meta-Learning for fast adaptation of deep networksTaesu Kim
 
Overcoming catastrophic forgetting in neural network
Overcoming catastrophic forgetting in neural networkOvercoming catastrophic forgetting in neural network
Overcoming catastrophic forgetting in neural networkKaty Lee
 
Masked Autoencoders Are Scalable Vision Learners.pptx
Masked Autoencoders Are Scalable Vision Learners.pptxMasked Autoencoders Are Scalable Vision Learners.pptx
Masked Autoencoders Are Scalable Vision Learners.pptxSangmin Woo
 
Deep learning - what is it and why now?
Deep learning - what is it and why now?Deep learning - what is it and why now?
Deep learning - what is it and why now?Natalia Konstantinova
 
introduction to deep Learning with full detail
introduction to deep Learning with full detailintroduction to deep Learning with full detail
introduction to deep Learning with full detailsonykhan3
 
Deep Learning - RNN and CNN
Deep Learning - RNN and CNNDeep Learning - RNN and CNN
Deep Learning - RNN and CNNPradnya Saval
 
An introduction to Deep Learning
An introduction to Deep LearningAn introduction to Deep Learning
An introduction to Deep LearningJulien SIMON
 
Intro to deep learning
Intro to deep learning Intro to deep learning
Intro to deep learning David Voyles
 
Transfer learning-presentation
Transfer learning-presentationTransfer learning-presentation
Transfer learning-presentationBushra Jbawi
 
Image classification on Imagenet (D1L4 2017 UPC Deep Learning for Computer Vi...
Image classification on Imagenet (D1L4 2017 UPC Deep Learning for Computer Vi...Image classification on Imagenet (D1L4 2017 UPC Deep Learning for Computer Vi...
Image classification on Imagenet (D1L4 2017 UPC Deep Learning for Computer Vi...Universitat Politècnica de Catalunya
 
Few shot learning/ one shot learning/ machine learning
Few shot learning/ one shot learning/ machine learningFew shot learning/ one shot learning/ machine learning
Few shot learning/ one shot learning/ machine learningﺁﺻﻒ ﻋﻠﯽ ﻣﯿﺮ
 

What's hot (20)

Introduction to Deep Learning
Introduction to Deep LearningIntroduction to Deep Learning
Introduction to Deep Learning
 
Meta-Learning Presentation
Meta-Learning PresentationMeta-Learning Presentation
Meta-Learning Presentation
 
Introduction to MAML (Model Agnostic Meta Learning) with Discussions
Introduction to MAML (Model Agnostic Meta Learning) with DiscussionsIntroduction to MAML (Model Agnostic Meta Learning) with Discussions
Introduction to MAML (Model Agnostic Meta Learning) with Discussions
 
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Model-Agnostic Meta-Learning for Fast Adaptation of Deep NetworksModel-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
 
Scalable and Order-robust Continual Learning with Additive Parameter Decompos...
Scalable and Order-robust Continual Learning with Additive Parameter Decompos...Scalable and Order-robust Continual Learning with Additive Parameter Decompos...
Scalable and Order-robust Continual Learning with Additive Parameter Decompos...
 
[CVPR2022, LongVersion] Online Continual Learning on a Contaminated Data Stre...
[CVPR2022, LongVersion] Online Continual Learning on a Contaminated Data Stre...[CVPR2022, LongVersion] Online Continual Learning on a Contaminated Data Stre...
[CVPR2022, LongVersion] Online Continual Learning on a Contaminated Data Stre...
 
Continual Learning: why, how, and when
Continual Learning: why, how, and whenContinual Learning: why, how, and when
Continual Learning: why, how, and when
 
Self-supervised Learning Lecture Note
Self-supervised Learning Lecture NoteSelf-supervised Learning Lecture Note
Self-supervised Learning Lecture Note
 
PR12-094: Model-Agnostic Meta-Learning for fast adaptation of deep networks
PR12-094: Model-Agnostic Meta-Learning for fast adaptation of deep networksPR12-094: Model-Agnostic Meta-Learning for fast adaptation of deep networks
PR12-094: Model-Agnostic Meta-Learning for fast adaptation of deep networks
 
Overcoming catastrophic forgetting in neural network
Overcoming catastrophic forgetting in neural networkOvercoming catastrophic forgetting in neural network
Overcoming catastrophic forgetting in neural network
 
Masked Autoencoders Are Scalable Vision Learners.pptx
Masked Autoencoders Are Scalable Vision Learners.pptxMasked Autoencoders Are Scalable Vision Learners.pptx
Masked Autoencoders Are Scalable Vision Learners.pptx
 
Deep learning - what is it and why now?
Deep learning - what is it and why now?Deep learning - what is it and why now?
Deep learning - what is it and why now?
 
introduction to deep Learning with full detail
introduction to deep Learning with full detailintroduction to deep Learning with full detail
introduction to deep Learning with full detail
 
Deep Learning - RNN and CNN
Deep Learning - RNN and CNNDeep Learning - RNN and CNN
Deep Learning - RNN and CNN
 
An introduction to Deep Learning
An introduction to Deep LearningAn introduction to Deep Learning
An introduction to Deep Learning
 
Intro to deep learning
Intro to deep learning Intro to deep learning
Intro to deep learning
 
Transfer learning-presentation
Transfer learning-presentationTransfer learning-presentation
Transfer learning-presentation
 
Image classification on Imagenet (D1L4 2017 UPC Deep Learning for Computer Vi...
Image classification on Imagenet (D1L4 2017 UPC Deep Learning for Computer Vi...Image classification on Imagenet (D1L4 2017 UPC Deep Learning for Computer Vi...
Image classification on Imagenet (D1L4 2017 UPC Deep Learning for Computer Vi...
 
Few shot learning/ one shot learning/ machine learning
Few shot learning/ one shot learning/ machine learningFew shot learning/ one shot learning/ machine learning
Few shot learning/ one shot learning/ machine learning
 
Transfer Learning (D2L4 Insight@DCU Machine Learning Workshop 2017)
Transfer Learning (D2L4 Insight@DCU Machine Learning Workshop 2017)Transfer Learning (D2L4 Insight@DCU Machine Learning Workshop 2017)
Transfer Learning (D2L4 Insight@DCU Machine Learning Workshop 2017)
 

Similar to Comparing Incremental Learning Strategies for Convolutional Neural Networks

Large Scale GAN Training for High Fidelity Natural Image Synthesis
Large Scale GAN Training for High Fidelity Natural Image SynthesisLarge Scale GAN Training for High Fidelity Natural Image Synthesis
Large Scale GAN Training for High Fidelity Natural Image SynthesisSeunghyun Hwang
 
Centertrack and naver airush 2020 review
Centertrack and naver airush 2020 reviewCentertrack and naver airush 2020 review
Centertrack and naver airush 2020 review경훈 김
 
Model-Based User Interface Optimization: Part IV: ADVANCED TOPICS - At SICSA ...
Model-Based User Interface Optimization: Part IV: ADVANCED TOPICS - At SICSA ...Model-Based User Interface Optimization: Part IV: ADVANCED TOPICS - At SICSA ...
Model-Based User Interface Optimization: Part IV: ADVANCED TOPICS - At SICSA ...Aalto University
 
Deep Learning for Natural Language Processing
Deep Learning for Natural Language ProcessingDeep Learning for Natural Language Processing
Deep Learning for Natural Language ProcessingSangwoo Mo
 
ResNeSt: Split-Attention Networks
ResNeSt: Split-Attention NetworksResNeSt: Split-Attention Networks
ResNeSt: Split-Attention NetworksSeunghyun Hwang
 
Transfer Learning: Breve introducción a modelos pre-entrenados.
Transfer Learning: Breve introducción a modelos pre-entrenados.Transfer Learning: Breve introducción a modelos pre-entrenados.
Transfer Learning: Breve introducción a modelos pre-entrenados.Fernando Constantino
 
SYNOPSIS on Parse representation and Linear SVM.
SYNOPSIS on Parse representation and Linear SVM.SYNOPSIS on Parse representation and Linear SVM.
SYNOPSIS on Parse representation and Linear SVM.bhavinecindus
 
Experimental Design for Distributed Machine Learning with Myles Baker
Experimental Design for Distributed Machine Learning with Myles BakerExperimental Design for Distributed Machine Learning with Myles Baker
Experimental Design for Distributed Machine Learning with Myles BakerDatabricks
 
Advanced Optimization for the Enterprise Webinar
Advanced Optimization for the Enterprise WebinarAdvanced Optimization for the Enterprise Webinar
Advanced Optimization for the Enterprise WebinarSigOpt
 
Using SigOpt to Tune Deep Learning Models with Nervana Cloud
Using SigOpt to Tune Deep Learning Models with Nervana CloudUsing SigOpt to Tune Deep Learning Models with Nervana Cloud
Using SigOpt to Tune Deep Learning Models with Nervana CloudSigOpt
 
Strata 2016 - Lessons Learned from building real-life Machine Learning Systems
Strata 2016 -  Lessons Learned from building real-life Machine Learning SystemsStrata 2016 -  Lessons Learned from building real-life Machine Learning Systems
Strata 2016 - Lessons Learned from building real-life Machine Learning SystemsXavier Amatriain
 
Deep learning summary
Deep learning summaryDeep learning summary
Deep learning summaryankit_ppt
 
Bag of tricks for image classification with convolutional neural networks r...
Bag of tricks for image classification with convolutional neural networks   r...Bag of tricks for image classification with convolutional neural networks   r...
Bag of tricks for image classification with convolutional neural networks r...Dongmin Choi
 
Revisiting the Notion of Diversity in Software Testing
Revisiting the Notion of Diversity in Software TestingRevisiting the Notion of Diversity in Software Testing
Revisiting the Notion of Diversity in Software TestingLionel Briand
 
a deep reinforced model for abstractive summarization
a deep reinforced model for abstractive summarizationa deep reinforced model for abstractive summarization
a deep reinforced model for abstractive summarizationJEE HYUN PARK
 
Benchmarking Automated Machine Learning For Clustering
Benchmarking Automated Machine Learning For ClusteringBenchmarking Automated Machine Learning For Clustering
Benchmarking Automated Machine Learning For Clusteringbiagiolicari7
 
Discovering the New SuccessFactors LMS Admin Features
Discovering the New SuccessFactors LMS Admin FeaturesDiscovering the New SuccessFactors LMS Admin Features
Discovering the New SuccessFactors LMS Admin FeaturesAshton Plusquellec
 
Presentation of master thesis
Presentation of master thesisPresentation of master thesis
Presentation of master thesisSeoung-Ho Choi
 

Similar to Comparing Incremental Learning Strategies for Convolutional Neural Networks (20)

Large Scale GAN Training for High Fidelity Natural Image Synthesis
Large Scale GAN Training for High Fidelity Natural Image SynthesisLarge Scale GAN Training for High Fidelity Natural Image Synthesis
Large Scale GAN Training for High Fidelity Natural Image Synthesis
 
Centertrack and naver airush 2020 review
Centertrack and naver airush 2020 reviewCentertrack and naver airush 2020 review
Centertrack and naver airush 2020 review
 
Model-Based User Interface Optimization: Part IV: ADVANCED TOPICS - At SICSA ...
Model-Based User Interface Optimization: Part IV: ADVANCED TOPICS - At SICSA ...Model-Based User Interface Optimization: Part IV: ADVANCED TOPICS - At SICSA ...
Model-Based User Interface Optimization: Part IV: ADVANCED TOPICS - At SICSA ...
 
Deep Learning for Natural Language Processing
Deep Learning for Natural Language ProcessingDeep Learning for Natural Language Processing
Deep Learning for Natural Language Processing
 
nnUNet
nnUNetnnUNet
nnUNet
 
ResNeSt: Split-Attention Networks
ResNeSt: Split-Attention NetworksResNeSt: Split-Attention Networks
ResNeSt: Split-Attention Networks
 
Transfer Learning: Breve introducción a modelos pre-entrenados.
Transfer Learning: Breve introducción a modelos pre-entrenados.Transfer Learning: Breve introducción a modelos pre-entrenados.
Transfer Learning: Breve introducción a modelos pre-entrenados.
 
SYNOPSIS on Parse representation and Linear SVM.
SYNOPSIS on Parse representation and Linear SVM.SYNOPSIS on Parse representation and Linear SVM.
SYNOPSIS on Parse representation and Linear SVM.
 
Experimental Design for Distributed Machine Learning with Myles Baker
Experimental Design for Distributed Machine Learning with Myles BakerExperimental Design for Distributed Machine Learning with Myles Baker
Experimental Design for Distributed Machine Learning with Myles Baker
 
Advanced Optimization for the Enterprise Webinar
Advanced Optimization for the Enterprise WebinarAdvanced Optimization for the Enterprise Webinar
Advanced Optimization for the Enterprise Webinar
 
Using SigOpt to Tune Deep Learning Models with Nervana Cloud
Using SigOpt to Tune Deep Learning Models with Nervana CloudUsing SigOpt to Tune Deep Learning Models with Nervana Cloud
Using SigOpt to Tune Deep Learning Models with Nervana Cloud
 
Strata 2016 - Lessons Learned from building real-life Machine Learning Systems
Strata 2016 -  Lessons Learned from building real-life Machine Learning SystemsStrata 2016 -  Lessons Learned from building real-life Machine Learning Systems
Strata 2016 - Lessons Learned from building real-life Machine Learning Systems
 
Spoofing-aware Attention Back-end with Multiple Enrollment and Novel Trials S...
Spoofing-aware Attention Back-end with Multiple Enrollment and Novel Trials S...Spoofing-aware Attention Back-end with Multiple Enrollment and Novel Trials S...
Spoofing-aware Attention Back-end with Multiple Enrollment and Novel Trials S...
 
Deep learning summary
Deep learning summaryDeep learning summary
Deep learning summary
 
Bag of tricks for image classification with convolutional neural networks r...
Bag of tricks for image classification with convolutional neural networks   r...Bag of tricks for image classification with convolutional neural networks   r...
Bag of tricks for image classification with convolutional neural networks r...
 
Revisiting the Notion of Diversity in Software Testing
Revisiting the Notion of Diversity in Software TestingRevisiting the Notion of Diversity in Software Testing
Revisiting the Notion of Diversity in Software Testing
 
a deep reinforced model for abstractive summarization
a deep reinforced model for abstractive summarizationa deep reinforced model for abstractive summarization
a deep reinforced model for abstractive summarization
 
Benchmarking Automated Machine Learning For Clustering
Benchmarking Automated Machine Learning For ClusteringBenchmarking Automated Machine Learning For Clustering
Benchmarking Automated Machine Learning For Clustering
 
Discovering the New SuccessFactors LMS Admin Features
Discovering the New SuccessFactors LMS Admin FeaturesDiscovering the New SuccessFactors LMS Admin Features
Discovering the New SuccessFactors LMS Admin Features
 
Presentation of master thesis
Presentation of master thesisPresentation of master thesis
Presentation of master thesis
 

More from Vincenzo Lomonaco

2023-08-22 CoLLAs Tutorial - Beyond CIL.pdf
2023-08-22 CoLLAs Tutorial - Beyond CIL.pdf2023-08-22 CoLLAs Tutorial - Beyond CIL.pdf
2023-08-22 CoLLAs Tutorial - Beyond CIL.pdfVincenzo Lomonaco
 
Toward Continual Learning on the Edge
Toward Continual Learning on the EdgeToward Continual Learning on the Edge
Toward Continual Learning on the EdgeVincenzo Lomonaco
 
Continual Learning: Another Step Towards Truly Intelligent Machines
Continual Learning: Another Step Towards Truly Intelligent MachinesContinual Learning: Another Step Towards Truly Intelligent Machines
Continual Learning: Another Step Towards Truly Intelligent MachinesVincenzo Lomonaco
 
Continual Reinforcement Learning in 3D Non-stationary Environments
Continual Reinforcement Learning in 3D Non-stationary EnvironmentsContinual Reinforcement Learning in 3D Non-stationary Environments
Continual Reinforcement Learning in 3D Non-stationary EnvironmentsVincenzo Lomonaco
 
Continual Learning for Robotics
Continual Learning for RoboticsContinual Learning for Robotics
Continual Learning for RoboticsVincenzo Lomonaco
 
Don't forget, there is more than forgetting: new metrics for Continual Learni...
Don't forget, there is more than forgetting: new metrics for Continual Learni...Don't forget, there is more than forgetting: new metrics for Continual Learni...
Don't forget, there is more than forgetting: new metrics for Continual Learni...Vincenzo Lomonaco
 
Open-Source Frameworks for Deep Learning: an Overview
Open-Source Frameworks for Deep Learning: an OverviewOpen-Source Frameworks for Deep Learning: an Overview
Open-Source Frameworks for Deep Learning: an OverviewVincenzo Lomonaco
 
Continual Learning with Deep Architectures Workshop @ Computer VISIONers Conf...
Continual Learning with Deep Architectures Workshop @ Computer VISIONers Conf...Continual Learning with Deep Architectures Workshop @ Computer VISIONers Conf...
Continual Learning with Deep Architectures Workshop @ Computer VISIONers Conf...Vincenzo Lomonaco
 
CORe50: a New Dataset and Benchmark for Continual Learning and Object Recogni...
CORe50: a New Dataset and Benchmark for Continual Learning and Object Recogni...CORe50: a New Dataset and Benchmark for Continual Learning and Object Recogni...
CORe50: a New Dataset and Benchmark for Continual Learning and Object Recogni...Vincenzo Lomonaco
 
Continuous Learning with Deep Architectures
Continuous Learning with Deep ArchitecturesContinuous Learning with Deep Architectures
Continuous Learning with Deep ArchitecturesVincenzo Lomonaco
 
CORe50: a New Dataset and Benchmark for Continuous Object Recognition Poster
CORe50: a New Dataset and Benchmark for Continuous Object Recognition PosterCORe50: a New Dataset and Benchmark for Continuous Object Recognition Poster
CORe50: a New Dataset and Benchmark for Continuous Object Recognition PosterVincenzo Lomonaco
 
Continuous Unsupervised Training of Deep Architectures
Continuous Unsupervised Training of Deep ArchitecturesContinuous Unsupervised Training of Deep Architectures
Continuous Unsupervised Training of Deep ArchitecturesVincenzo Lomonaco
 
Deep Learning for Computer Vision: A comparision between Convolutional Neural...
Deep Learning for Computer Vision: A comparision between Convolutional Neural...Deep Learning for Computer Vision: A comparision between Convolutional Neural...
Deep Learning for Computer Vision: A comparision between Convolutional Neural...Vincenzo Lomonaco
 
Deep Learning for Computer Vision: A comparision between Convolutional Neural...
Deep Learning for Computer Vision: A comparision between Convolutional Neural...Deep Learning for Computer Vision: A comparision between Convolutional Neural...
Deep Learning for Computer Vision: A comparision between Convolutional Neural...Vincenzo Lomonaco
 
A Framework for Deadlock Detection in Java
A Framework for Deadlock Detection in JavaA Framework for Deadlock Detection in Java
A Framework for Deadlock Detection in JavaVincenzo Lomonaco
 
Deep Learning libraries and first experiments with Theano
Deep Learning libraries and first experiments with TheanoDeep Learning libraries and first experiments with Theano
Deep Learning libraries and first experiments with TheanoVincenzo Lomonaco
 
Word2vec on the italian language: first experiments
Word2vec on the italian language: first experimentsWord2vec on the italian language: first experiments
Word2vec on the italian language: first experimentsVincenzo Lomonaco
 
Machine Learning for Automated Reasoning: An Overview
Machine Learning for Automated Reasoning: An OverviewMachine Learning for Automated Reasoning: An Overview
Machine Learning for Automated Reasoning: An OverviewVincenzo Lomonaco
 

More from Vincenzo Lomonaco (19)

2023-08-22 CoLLAs Tutorial - Beyond CIL.pdf
2023-08-22 CoLLAs Tutorial - Beyond CIL.pdf2023-08-22 CoLLAs Tutorial - Beyond CIL.pdf
2023-08-22 CoLLAs Tutorial - Beyond CIL.pdf
 
Toward Continual Learning on the Edge
Toward Continual Learning on the EdgeToward Continual Learning on the Edge
Toward Continual Learning on the Edge
 
Continual Learning: Another Step Towards Truly Intelligent Machines
Continual Learning: Another Step Towards Truly Intelligent MachinesContinual Learning: Another Step Towards Truly Intelligent Machines
Continual Learning: Another Step Towards Truly Intelligent Machines
 
Tutorial inns2019 full
Tutorial inns2019 fullTutorial inns2019 full
Tutorial inns2019 full
 
Continual Reinforcement Learning in 3D Non-stationary Environments
Continual Reinforcement Learning in 3D Non-stationary EnvironmentsContinual Reinforcement Learning in 3D Non-stationary Environments
Continual Reinforcement Learning in 3D Non-stationary Environments
 
Continual Learning for Robotics
Continual Learning for RoboticsContinual Learning for Robotics
Continual Learning for Robotics
 
Don't forget, there is more than forgetting: new metrics for Continual Learni...
Don't forget, there is more than forgetting: new metrics for Continual Learni...Don't forget, there is more than forgetting: new metrics for Continual Learni...
Don't forget, there is more than forgetting: new metrics for Continual Learni...
 
Open-Source Frameworks for Deep Learning: an Overview
Open-Source Frameworks for Deep Learning: an OverviewOpen-Source Frameworks for Deep Learning: an Overview
Open-Source Frameworks for Deep Learning: an Overview
 
Continual Learning with Deep Architectures Workshop @ Computer VISIONers Conf...
Continual Learning with Deep Architectures Workshop @ Computer VISIONers Conf...Continual Learning with Deep Architectures Workshop @ Computer VISIONers Conf...
Continual Learning with Deep Architectures Workshop @ Computer VISIONers Conf...
 
CORe50: a New Dataset and Benchmark for Continual Learning and Object Recogni...
CORe50: a New Dataset and Benchmark for Continual Learning and Object Recogni...CORe50: a New Dataset and Benchmark for Continual Learning and Object Recogni...
CORe50: a New Dataset and Benchmark for Continual Learning and Object Recogni...
 
Continuous Learning with Deep Architectures
Continuous Learning with Deep ArchitecturesContinuous Learning with Deep Architectures
Continuous Learning with Deep Architectures
 
CORe50: a New Dataset and Benchmark for Continuous Object Recognition Poster
CORe50: a New Dataset and Benchmark for Continuous Object Recognition PosterCORe50: a New Dataset and Benchmark for Continuous Object Recognition Poster
CORe50: a New Dataset and Benchmark for Continuous Object Recognition Poster
 
Continuous Unsupervised Training of Deep Architectures
Continuous Unsupervised Training of Deep ArchitecturesContinuous Unsupervised Training of Deep Architectures
Continuous Unsupervised Training of Deep Architectures
 
Deep Learning for Computer Vision: A comparision between Convolutional Neural...
Deep Learning for Computer Vision: A comparision between Convolutional Neural...Deep Learning for Computer Vision: A comparision between Convolutional Neural...
Deep Learning for Computer Vision: A comparision between Convolutional Neural...
 
Deep Learning for Computer Vision: A comparision between Convolutional Neural...
Deep Learning for Computer Vision: A comparision between Convolutional Neural...Deep Learning for Computer Vision: A comparision between Convolutional Neural...
Deep Learning for Computer Vision: A comparision between Convolutional Neural...
 
A Framework for Deadlock Detection in Java
A Framework for Deadlock Detection in JavaA Framework for Deadlock Detection in Java
A Framework for Deadlock Detection in Java
 
Deep Learning libraries and first experiments with Theano
Deep Learning libraries and first experiments with TheanoDeep Learning libraries and first experiments with Theano
Deep Learning libraries and first experiments with Theano
 
Word2vec on the italian language: first experiments
Word2vec on the italian language: first experimentsWord2vec on the italian language: first experiments
Word2vec on the italian language: first experiments
 
Machine Learning for Automated Reasoning: An Overview
Machine Learning for Automated Reasoning: An OverviewMachine Learning for Automated Reasoning: An Overview
Machine Learning for Automated Reasoning: An Overview
 

Recently uploaded

EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
(ISHITA) Call Girls Service Hyderabad Call Now 8617697112 Hyderabad Escorts
(ISHITA) Call Girls Service Hyderabad Call Now 8617697112 Hyderabad Escorts(ISHITA) Call Girls Service Hyderabad Call Now 8617697112 Hyderabad Escorts
(ISHITA) Call Girls Service Hyderabad Call Now 8617697112 Hyderabad EscortsCall girls in Ahmedabad High profile
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 

Recently uploaded (20)

EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
(ISHITA) Call Girls Service Hyderabad Call Now 8617697112 Hyderabad Escorts
(ISHITA) Call Girls Service Hyderabad Call Now 8617697112 Hyderabad Escorts(ISHITA) Call Girls Service Hyderabad Call Now 8617697112 Hyderabad Escorts
(ISHITA) Call Girls Service Hyderabad Call Now 8617697112 Hyderabad Escorts
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 

Comparing Incremental Learning Strategies for Convolutional Neural Networks

  • 1. COMPARING INCREMENTAL LEARNING STRATEGIES FOR CONVOLUTIONAL NEURAL NETWORKS Vincenzo Lomonaco & Davide Maltoni {vincenzo.lomonaco, davide.maltoni}@unibo.it Department of Computer Science and Engineering – DISI University of Bologna
  • 2. OUTLINE 1. Introduction • CNNs and current limitations • Incremental learning: Why? 2. Incremental learning Strategies for CNNs • Definitions • Possible instantiations used during the experimentations 3. Datasets • iCubWorld28 • BigBrother 4. Experiments and Results • Exp. design • Results analysis 5. Conclusions and Future Works
  • 3. OUTLINE 1. Introduction • CNNs and current limitations • Incremental learning: Why? 2. Incremental learning Strategies for CNNs • Definitions • Possible instantiations used during the experimentations 3. Datasets • iCubWorld28 • BigBrother 4. Experiments and Results • Exp. design • Results analysis 5. Conclusions and Future Works
  • 4. INTRODUCTION – CNNs and Current Limitations State-of-the-art algorithm for many tasks in CV, NLP, SR, etc.. Very general and adaptive Works directy on raw data (no hand-engineered features required) Computational demanding Tricky hyper-parametrization Applicability in Incremental Learning Scenario?
  • 5. INTRODUCTION – Incremental learnig: Why? 𝐵𝑎𝑡𝑐ℎ0 𝐵𝑎𝑡𝑐ℎ1 𝐵𝑎𝑡𝑐ℎ 𝑛 . . . Initial Batch Incremental Batches 
  • 6. INTRODUCTION – Incremental learnig: Why? Constraints: • Memory: We can’t afford to keep in memory all the batches. • Computational power: We can’t afford to train our classification model from scratch after each batch. 𝐵𝑎𝑡𝑐ℎ0 𝐵𝑎𝑡𝑐ℎ1 𝐵𝑎𝑡𝑐ℎ 𝑛 . . .  Initial Batch Incremental Batches
  • 7. INTRODUCTION – Incremental learnig: Why? Goal: • Maximize the Accuracy % after each batch • Going towards a more smooth and natural learning but still using CNNs 𝐵𝑎𝑡𝑐ℎ0 𝐵𝑎𝑡𝑐ℎ1 𝐵𝑎𝑡𝑐ℎ 𝑛 . . .  Initial Batch Incremental Batches
  • 8. INTRODUCTION – Incremental learnig: Why? 𝐵𝑎𝑡𝑐ℎ0 𝐵𝑎𝑡𝑐ℎ1 𝐵𝑎𝑡𝑐ℎ 𝑛 . . . 𝑀0
  • 9. INTRODUCTION – Incremental learnig: Why? 𝐵𝑎𝑡𝑐ℎ0 𝐵𝑎𝑡𝑐ℎ1 𝐵𝑎𝑡𝑐ℎ 𝑛 . . . 𝑀0 𝑀1 • We can free the memory occupied by 𝐵𝑎𝑡𝑐ℎ0 and get 𝑀1 just by updating 𝑀0 with the new coming batch • However, we risk to forget what we’ve previously learned
  • 10. INTRODUCTION – Incremental learnig: Why? 𝐵𝑎𝑡𝑐ℎ0 𝐵𝑎𝑡𝑐ℎ1 𝐵𝑎𝑡𝑐ℎ 𝑛 . . . 𝑀0 𝑀1
  • 11. INTRODUCTION – Incremental learnig: Why? 𝐵𝑎𝑡𝑐ℎ0 𝐵𝑎𝑡𝑐ℎ1 𝐵𝑎𝑡𝑐ℎ 𝑛 . . . 𝑀0 𝑀1
  • 12. INTRODUCTION – Incremental learnig: Why? 𝐵𝑎𝑡𝑐ℎ0 𝐵𝑎𝑡𝑐ℎ1 𝐵𝑎𝑡𝑐ℎ 𝑛 . . . 𝑀0 𝑀1 . . . 𝑀 𝑛
  • 13. OUTLINE 1. Introduction • CNNs and current limitations • Incremental learning: Why? 2. Incremental learning Strategies for CNNs • Definitions • Possible instantiations used during the experimentations 3. Datasets • iCubWorld28 • BigBrother 4. Experiments and Results • Exp. design • Results analysis 5. Conclusions and Future Works
  • 14. INC. LEARNING STRATEGIES FOR CNNS - Definitions The different possibilities we explored to deal with an incremental tuning/learning scenario, can be conveniently framed in three main strategies: 1. Training/tuning an ad hoc CNN architecture suitable for the problem. 2. Using an already trained CNN as a fixed feature extractor in conjunction with an incremental classifier. 3. Fine-tuning an already trained CNN.
  • 15. INC. LEARNING STRATEGIES FOR CNNS - Instantiations In our experiments (with focus on image classification) we tested three instantiations of the aforementioned strategies, respectively: 1. (Ad-hoc arch.)  LeNet7 Consists of the classical “LeNet7” proposed by Yan LeCun in 2004. Still competitive on low/medium scale problems. 2. (CNN-fixed w. inc. Classifier)  CaffeNet + SVM Consists of a pre-trained CNN provided in the Caffe library (“BVLC Reference CaffeNet”, based on the “AlexNet” architecture; An incremental and linear SVM as Classifier. 3. (CNN-Finetuning)  CaffeNet + FT Consists again of the “BVLC Reference CaffeNet” but instead of using it as a fixed feature extractor the network is fine-tuned to suit the new task.
  • 16. INC. LEARNING STRATEGIES FOR CNNS - Instantiations Furthermore, for the “BigBrother” dataset we decided to test an additional pair of strategies: 4. (CNN-fixed w. inc. Classifier)  VGG_Face + SVM Consists of a pre-trained (16-levels) CNN called “VGG Face” which has been trained on a very large dataset of faces (2,622 Subjects and 2.6M images); Again, a incremental and linear SVM as Classifier. 5. (CNN-Finetuning)  VGG_Face + FT Consists again of the “VGG_Face” CNN but instead of using it as a fixed feature extractor the network is fine-tuned to suit the new task.
  • 17. OUTLINE 1. Introduction • CNNs and current limitations • Incremental learning: Why? 2. Incremental Learning Strategies for CNNs • Definitions • Possible instantiations used during the experimentations 3. Datasets • iCubWorld28 • BigBrother 4. Experiments and Results • Exp. design • Results analysis 5. Conclusions and Future Works
  • 18. DATASETS We were interested in datasets where: • The objects of interest have been acquired in a number of successive sessions • The environmental condition can change among the sessions. We focused on two applicative fields where incremental learning is very relevant (robotics and biometrics) and chose two datasets respectively: • iCubWorld28 • BigBrother
  • 19. DATASETS – iCubWorld28 Key Features: • Img size: 128×128 • Num. classes: 7 (× 4 obj) • Tot. imgs: 39,693 • Num batches: 9 +1 (test)
  • 20. DATASETS – BigBrother Key Features: • Img size: 70×70 • Num. classes: 7 • Tot. imgs: 23,842 • Num batches: 56 +1 (test)
  • 21. OUTLINE 1. Introduction • CNNs and current limitations • Incremental learning: Why? 2. Incremental Learning Strategies for CNNs • Definitions • Possible instantiations used during the experimentations 3. Datasets • iCubWorld28 • BigBrother 4. Experiments and Results • Exp. design • Results analysis 5. Conclusions and Future Works
  • 22. EXPERIMENTS AND RESULTS – Exp. Design Experiments Policy: • We trained the models until full convergence on the first batch of data • We tuned them on the successive incremental batches, trying to balance the trade-off between accuracy gain and forgetting.
  • 23. EXPERIMENTS AND RESULTS – iCubWorld28 Results
  • 24. EXPERIMENTS AND RESULTS – iCubWorld28 Results • CaffeNet + SVM has a very good recognition rate increment • CaffeNet + FT is the most effective • LeNet7 struggles to learn complex invariant features necessary for this problem
  • 25. EXPERIMENTS AND RESULTS – BigBrother Results
  • 26. EXPERIMENTS AND RESULTS – BigBrother Results • LeNet7 model performs slightly better than CaffeNet + SVM or CaffeNet + FT • VGG_Face + SVM and VGG_Face + FT have impressive performance on this problem • VGG_Face + SVM seems to be the best choice both for the accuracy and the stability
  • 27. EXPERIMENTS AND RESULTS – Dealing with Forgetting
  • 28. EXPERIMENTS AND RESULTS – Dealing with Forgetting • An adjustable learning rate is significantly more stable • A simple thresholding approach has been used. • We did not found any significant difference using a continuous approach
  • 29. OUTLINE 1. Introduction • CNNs and current limitations • Incremental learning: Why? 2. Incremental Learning Strategies for CNNs • Definitions • Possible instantiations used during the experimentations 3. Datasets • iCubWorld28 • BigBrother 4. Experiments and Results • Exp. design • Results analysis 5. Conclusions and Future Works
  • 30. CONCLUSIONS AND FUTURE WORKS • When possible (i.e., transfer learning from the same domain), it is preferable to use CNN as a fixed feature extractor to feed an incremental classifier • If the features are not optimized, the tuning of low level layers may be preferable and the learning strength can be used to control forgetting. • Training a CNN from scratch can be advantageous if the problem patterns (and feature invariances) are highly specific and a sufficient number of samples are available.
  • 31. CONCLUSIONS AND FUTURE WORKS In the near future we plan to extend this work by: • Performing a more extensive experimental evaluation • Finding a more principled way to control forgetting and adapting the tuning parameters to the size (and bias) of each incremental batch. • Studying real-world applications of semi-supervised incremental learning strategies for CNNs.
  • 32. COMPARING INCREMENTAL LEARNING STRATEGIES FOR CONVOLUTIONAL NEURAL NETWORKS Vincenzo Lomonaco & Davide Maltoni {vincenzo.lomonaco, davide.maltoni}@unibo.it Department of Computer Science and Engineering – DISI University of Bologna Thank you for your attention. Any Questions?