SlideShare a Scribd company logo
1 of 31
Learning without
forgetting
Introduction Neural network
Introduction
Neural network
Benefits are:
▫ To store information
▫ To retrieve lost
information
▫ To act when faced with
new situation
Introduction
Neural network Problem:
forgets the old task when new task is stored.
Introduction
Problem in
Applications
Image recognition
• new capabilities are to be added
• Assumes old data set is available
• Infeasible
• Vision system using CNN
• No old task data
• Train using new data
• Preserves old task
• A new method for Neural Nets
• To learn without forgetting
Introduction
CNN
Convolutional neural network
• Operates on volumes
• Convolution (filter):
• reduces to smaller size with specific information
• ReLu
• Pooling : reduces the spatial size to reduce computation and
parameters
• Maxpooling is to reduce to max parameter
Present Methods
Common approaches
Currently developed
Present methods Model
• Structure
• Parameter
• ᶿs : shared parameter
• ᶿo : old task parameter
• ᶿn : new task parameter
Present methods
Common
approaches
Feature extraction
• ᶿs & ᶿo is unchanged
• Layers used new task
Present methods
Common
approaches
Fine tuning
• ᶿs & ᶿn is optimized, ᶿo is fixed
• Low learning rate
• Can be duplicated
Present methods
Common
approaches
Joint training
• ᶿs , ᶿn & ᶿo are jointly optimised
• The tasks are interleaved
• Multi task learning
Present methods
Currently developed methods
 A-LTM (Active Long-Term Memory)
 Less forgetting Learning
 Cross-stitch Network
 WA-CNN
Present methods
Currently developed
methods
A-LTM:
• Identical to LwF
• Differ –
• weight decay regularization for training
• Warm-up step used after FT
• Dataset size
• Large loss
• Needs old task DS
Present methods
Currently developed
methods
Less Forgetting Learning:
• Similar
• Hinders change in
• Task specific decision boundary
• Shared representation
• Adds L2 loss :
• No change in ᶿs for new task
• ᶿo remains same
Present methods
Currently developed
methods
Cross-stitch Network:
• Works on MTL
• Introduces cross-stitch module
• Jointly learns:
• 2 same structure network blocks
• 2 pairs of weight -> same output(s)
• Outperforms joint training
• Needs old task DS
• Increases network size
Present methods
Currently developed
methods
WA-CNN:
• Expands the network (ᶿs)
• Improves new task performance
• Freezing ᶿo
• Maintains old-task
• Outperforms traditional fine-tuning
• But it increases network size faster than LwF
Learning
without
forgetting
The proposed method
• Uses only new task data to train
• Preserves the original capabilities
• Performs favourably :
• Feature extraction
• Fine tuning adaptation
• Similar to MTL that uses old DS
The proposed
method
18
• a Unified vision system
• The CNN has parameters:
• ᶿs : shared parameter
• ᶿo : old / specific task parameter
• The goal is to add ᶿn : new task parameter
• Learn parameters
• Works well on old & new task
• using only new & not old task DS
The proposed
method
19
• Advantage over common approaches :
• Classification performance
• Outperforms feature extraction & fine-tuning
• Computation efficiency
• Faster training & test time
• But slower than fine tuning
• Simplicity in deployment
• No need to retrain in adapting network
The proposed
method
20
Procedure
21
Phase I : Initialization
• The output is recorded (Yo) on old task for new data
• Response is a set of label probabilities
• A node for new class
• Weights initialised randomly
Procedure
22
Phase II : Training
• Train to minimize loss for all task
• Regularization using Stochastic gradient descent
• Two steps:
• Warm-up step
• Freeze ᶿo & ᶿs & trains ᶿn
• Joint optimization step
• Train all weights
Procedure
Phase II : Training
• Logistic loss
• Knowledge distillation loss
Procedure
Phase II : Training
• For calculating Knowledge distillation function
• Recorded & current probabilities
• T > 1 ; usually T = 2
• λo is a loss balance weight = 1
• Larger = greater old task performance
• Smaller = greater new task performance
Procedure
Experiment • Use
• Large dataset to train initial net
• Smaller dataset to add new task
• Old/Original task :
• ImageNet
• Contains 1000 object category
• more than 1000K training images
• Places365-standard
• Contains 365 scene classes
• ~1600K training images
• New task:
• PASCAL VOC (“VOC”) ~6K
• Caltech-UCSD Birds (“CUB”) ~6K
• MIT indoor scene (“Scenes”) ~6K
Experiment • Two scenario:
• Single new task scenario:
• On new task, LwF outperformed
• LFL, fine-tuning FC, feature extraction
& fine-tuning in most pair
• On old task, performs
• Better than fine-tuning
• Underperforms feature extraction,fine-
tuning FC & LFL
• Multiple new task scenario:
• LwF outperforms all except joint training
Experiment
Extension
of
LwF
• Network expansion
• Adds nodes to some layers
• Allows new-task-specific information to be stored
• Used along with LwF
• Performs better feature extraction
29
Limitations
30
• Can`t deal with change in domain
• All new-task data to be present before
computing their old task response
• Learning new task decreases old-task
recovery
The End
31
Learned something
without forgetting
Learned
something
without
forgetting

More Related Content

What's hot

Image net classification with deep convolutional neural networks
Image net classification with deep convolutional neural networksImage net classification with deep convolutional neural networks
Image net classification with deep convolutional neural networksheedaeKwon
 
2013 06-03 berlin buzzwords
2013 06-03 berlin buzzwords2013 06-03 berlin buzzwords
2013 06-03 berlin buzzwordsNitay Joffe
 
2013.09.10 Giraph at London Hadoop Users Group
2013.09.10 Giraph at London Hadoop Users Group2013.09.10 Giraph at London Hadoop Users Group
2013.09.10 Giraph at London Hadoop Users GroupNitay Joffe
 
Generalized Linear Models in Spark MLlib and SparkR
Generalized Linear Models in Spark MLlib and SparkRGeneralized Linear Models in Spark MLlib and SparkR
Generalized Linear Models in Spark MLlib and SparkRDatabricks
 
Simulation of Heterogeneous Cloud Infrastructures
Simulation of Heterogeneous Cloud InfrastructuresSimulation of Heterogeneous Cloud Infrastructures
Simulation of Heterogeneous Cloud InfrastructuresCloudLightning
 
C-Cube: Elastic Continuous Clustering in the Cloud
C-Cube: Elastic Continuous Clustering in the CloudC-Cube: Elastic Continuous Clustering in the Cloud
C-Cube: Elastic Continuous Clustering in the CloudQian Lin
 
Netflix's Recommendation ML Pipeline Using Apache Spark: Spark Summit East ta...
Netflix's Recommendation ML Pipeline Using Apache Spark: Spark Summit East ta...Netflix's Recommendation ML Pipeline Using Apache Spark: Spark Summit East ta...
Netflix's Recommendation ML Pipeline Using Apache Spark: Spark Summit East ta...Spark Summit
 
Data Mining: Implementation of Data Mining Techniques using RapidMiner software
Data Mining: Implementation of Data Mining Techniques using RapidMiner softwareData Mining: Implementation of Data Mining Techniques using RapidMiner software
Data Mining: Implementation of Data Mining Techniques using RapidMiner softwareMohammed Kharma
 
"Эффективность и оптимизация кода в Java 8" Сергей Моренец
"Эффективность и оптимизация кода в Java 8" Сергей Моренец"Эффективность и оптимизация кода в Java 8" Сергей Моренец
"Эффективность и оптимизация кода в Java 8" Сергей МоренецFwdays
 
Sebastian Schelter – Distributed Machine Learing with the Samsara DSL
Sebastian Schelter – Distributed Machine Learing with the Samsara DSLSebastian Schelter – Distributed Machine Learing with the Samsara DSL
Sebastian Schelter – Distributed Machine Learing with the Samsara DSLFlink Forward
 
Thinking Functionally with Clojure
Thinking Functionally with ClojureThinking Functionally with Clojure
Thinking Functionally with ClojureJohn Stevenson
 
Spark Summit EU talk by Herman van Hovell
Spark Summit EU talk by Herman van HovellSpark Summit EU talk by Herman van Hovell
Spark Summit EU talk by Herman van HovellSpark Summit
 
KnittingBoar Toronto Hadoop User Group Nov 27 2012
KnittingBoar Toronto Hadoop User Group Nov 27 2012KnittingBoar Toronto Hadoop User Group Nov 27 2012
KnittingBoar Toronto Hadoop User Group Nov 27 2012Adam Muise
 
Sql saturday azure storage by Anton Vidishchev
Sql saturday azure storage by Anton VidishchevSql saturday azure storage by Anton Vidishchev
Sql saturday azure storage by Anton VidishchevAlex Tumanoff
 
Optimizing Terascale Machine Learning Pipelines with Keystone ML
Optimizing Terascale Machine Learning Pipelines with Keystone MLOptimizing Terascale Machine Learning Pipelines with Keystone ML
Optimizing Terascale Machine Learning Pipelines with Keystone MLSpark Summit
 
2015-06-15 Large-Scale Elastic-Net Regularized Generalized Linear Models at S...
2015-06-15 Large-Scale Elastic-Net Regularized Generalized Linear Models at S...2015-06-15 Large-Scale Elastic-Net Regularized Generalized Linear Models at S...
2015-06-15 Large-Scale Elastic-Net Regularized Generalized Linear Models at S...DB Tsai
 
Ufuc Celebi – Stream & Batch Processing in one System
Ufuc Celebi – Stream & Batch Processing in one SystemUfuc Celebi – Stream & Batch Processing in one System
Ufuc Celebi – Stream & Batch Processing in one SystemFlink Forward
 
Terascale Learning
Terascale LearningTerascale Learning
Terascale Learningpauldix
 

What's hot (19)

Image net classification with deep convolutional neural networks
Image net classification with deep convolutional neural networksImage net classification with deep convolutional neural networks
Image net classification with deep convolutional neural networks
 
2013 06-03 berlin buzzwords
2013 06-03 berlin buzzwords2013 06-03 berlin buzzwords
2013 06-03 berlin buzzwords
 
2013.09.10 Giraph at London Hadoop Users Group
2013.09.10 Giraph at London Hadoop Users Group2013.09.10 Giraph at London Hadoop Users Group
2013.09.10 Giraph at London Hadoop Users Group
 
Generalized Linear Models in Spark MLlib and SparkR
Generalized Linear Models in Spark MLlib and SparkRGeneralized Linear Models in Spark MLlib and SparkR
Generalized Linear Models in Spark MLlib and SparkR
 
Comparison between Cloud Mirror, Mesos Cluster, and Google Omega
Comparison between Cloud Mirror, Mesos Cluster, and Google OmegaComparison between Cloud Mirror, Mesos Cluster, and Google Omega
Comparison between Cloud Mirror, Mesos Cluster, and Google Omega
 
Simulation of Heterogeneous Cloud Infrastructures
Simulation of Heterogeneous Cloud InfrastructuresSimulation of Heterogeneous Cloud Infrastructures
Simulation of Heterogeneous Cloud Infrastructures
 
C-Cube: Elastic Continuous Clustering in the Cloud
C-Cube: Elastic Continuous Clustering in the CloudC-Cube: Elastic Continuous Clustering in the Cloud
C-Cube: Elastic Continuous Clustering in the Cloud
 
Netflix's Recommendation ML Pipeline Using Apache Spark: Spark Summit East ta...
Netflix's Recommendation ML Pipeline Using Apache Spark: Spark Summit East ta...Netflix's Recommendation ML Pipeline Using Apache Spark: Spark Summit East ta...
Netflix's Recommendation ML Pipeline Using Apache Spark: Spark Summit East ta...
 
Data Mining: Implementation of Data Mining Techniques using RapidMiner software
Data Mining: Implementation of Data Mining Techniques using RapidMiner softwareData Mining: Implementation of Data Mining Techniques using RapidMiner software
Data Mining: Implementation of Data Mining Techniques using RapidMiner software
 
"Эффективность и оптимизация кода в Java 8" Сергей Моренец
"Эффективность и оптимизация кода в Java 8" Сергей Моренец"Эффективность и оптимизация кода в Java 8" Сергей Моренец
"Эффективность и оптимизация кода в Java 8" Сергей Моренец
 
Sebastian Schelter – Distributed Machine Learing with the Samsara DSL
Sebastian Schelter – Distributed Machine Learing with the Samsara DSLSebastian Schelter – Distributed Machine Learing with the Samsara DSL
Sebastian Schelter – Distributed Machine Learing with the Samsara DSL
 
Thinking Functionally with Clojure
Thinking Functionally with ClojureThinking Functionally with Clojure
Thinking Functionally with Clojure
 
Spark Summit EU talk by Herman van Hovell
Spark Summit EU talk by Herman van HovellSpark Summit EU talk by Herman van Hovell
Spark Summit EU talk by Herman van Hovell
 
KnittingBoar Toronto Hadoop User Group Nov 27 2012
KnittingBoar Toronto Hadoop User Group Nov 27 2012KnittingBoar Toronto Hadoop User Group Nov 27 2012
KnittingBoar Toronto Hadoop User Group Nov 27 2012
 
Sql saturday azure storage by Anton Vidishchev
Sql saturday azure storage by Anton VidishchevSql saturday azure storage by Anton Vidishchev
Sql saturday azure storage by Anton Vidishchev
 
Optimizing Terascale Machine Learning Pipelines with Keystone ML
Optimizing Terascale Machine Learning Pipelines with Keystone MLOptimizing Terascale Machine Learning Pipelines with Keystone ML
Optimizing Terascale Machine Learning Pipelines with Keystone ML
 
2015-06-15 Large-Scale Elastic-Net Regularized Generalized Linear Models at S...
2015-06-15 Large-Scale Elastic-Net Regularized Generalized Linear Models at S...2015-06-15 Large-Scale Elastic-Net Regularized Generalized Linear Models at S...
2015-06-15 Large-Scale Elastic-Net Regularized Generalized Linear Models at S...
 
Ufuc Celebi – Stream & Batch Processing in one System
Ufuc Celebi – Stream & Batch Processing in one SystemUfuc Celebi – Stream & Batch Processing in one System
Ufuc Celebi – Stream & Batch Processing in one System
 
Terascale Learning
Terascale LearningTerascale Learning
Terascale Learning
 

Similar to Neural network learning ability

An Introduction to Deep Learning
An Introduction to Deep LearningAn Introduction to Deep Learning
An Introduction to Deep Learningmilad abbasi
 
Introduction to Deep Learning
Introduction to Deep LearningIntroduction to Deep Learning
Introduction to Deep LearningMehrnaz Faraz
 
Week 12 Dimensionality Reduction Bagian 1
Week 12 Dimensionality Reduction Bagian 1Week 12 Dimensionality Reduction Bagian 1
Week 12 Dimensionality Reduction Bagian 1khairulhuda242
 
ResNeSt: Split-Attention Networks
ResNeSt: Split-Attention NetworksResNeSt: Split-Attention Networks
ResNeSt: Split-Attention NetworksSeunghyun Hwang
 
Neural network pruning with residual connections and limited-data review [cdm]
Neural network pruning with residual connections and limited-data review [cdm]Neural network pruning with residual connections and limited-data review [cdm]
Neural network pruning with residual connections and limited-data review [cdm]Dongmin Choi
 
Expanding HPCC Systems Deep Neural Network Capabilities
Expanding HPCC Systems Deep Neural Network CapabilitiesExpanding HPCC Systems Deep Neural Network Capabilities
Expanding HPCC Systems Deep Neural Network CapabilitiesHPCC Systems
 
Parallel Distributed Deep Learning on HPCC Systems
Parallel Distributed Deep Learning on HPCC SystemsParallel Distributed Deep Learning on HPCC Systems
Parallel Distributed Deep Learning on HPCC SystemsHPCC Systems
 
SOA with PHP and Symfony
SOA with PHP and SymfonySOA with PHP and Symfony
SOA with PHP and SymfonyMichalSchroeder
 
NVIDIA 深度學習教育機構 (DLI): Medical image segmentation using digits
NVIDIA 深度學習教育機構 (DLI): Medical image segmentation using digitsNVIDIA 深度學習教育機構 (DLI): Medical image segmentation using digits
NVIDIA 深度學習教育機構 (DLI): Medical image segmentation using digitsNVIDIA Taiwan
 
Deep learning summary
Deep learning summaryDeep learning summary
Deep learning summaryankit_ppt
 
Regularization in deep learning
Regularization in deep learningRegularization in deep learning
Regularization in deep learningKien Le
 
Application of machine learning and cognitive computing in intrusion detectio...
Application of machine learning and cognitive computing in intrusion detectio...Application of machine learning and cognitive computing in intrusion detectio...
Application of machine learning and cognitive computing in intrusion detectio...Mahdi Hosseini Moghaddam
 
Cvpr 2018 papers review (efficient computing)
Cvpr 2018 papers review (efficient computing)Cvpr 2018 papers review (efficient computing)
Cvpr 2018 papers review (efficient computing)DonghyunKang12
 
ICML 2017 Meta network
ICML 2017 Meta networkICML 2017 Meta network
ICML 2017 Meta networkKaty Lee
 
Deep_Learning_Frameworks_CNTK_PyTorch
Deep_Learning_Frameworks_CNTK_PyTorchDeep_Learning_Frameworks_CNTK_PyTorch
Deep_Learning_Frameworks_CNTK_PyTorchSubhashis Hazarika
 
Webinar: How We Evaluated MongoDB as a Relational Database Replacement
Webinar: How We Evaluated MongoDB as a Relational Database ReplacementWebinar: How We Evaluated MongoDB as a Relational Database Replacement
Webinar: How We Evaluated MongoDB as a Relational Database ReplacementMongoDB
 

Similar to Neural network learning ability (20)

An Introduction to Deep Learning
An Introduction to Deep LearningAn Introduction to Deep Learning
An Introduction to Deep Learning
 
Introduction to Deep Learning
Introduction to Deep LearningIntroduction to Deep Learning
Introduction to Deep Learning
 
Deeplearning
Deeplearning Deeplearning
Deeplearning
 
Week 12 Dimensionality Reduction Bagian 1
Week 12 Dimensionality Reduction Bagian 1Week 12 Dimensionality Reduction Bagian 1
Week 12 Dimensionality Reduction Bagian 1
 
ResNeSt: Split-Attention Networks
ResNeSt: Split-Attention NetworksResNeSt: Split-Attention Networks
ResNeSt: Split-Attention Networks
 
Network recasting
Network recastingNetwork recasting
Network recasting
 
Neural network pruning with residual connections and limited-data review [cdm]
Neural network pruning with residual connections and limited-data review [cdm]Neural network pruning with residual connections and limited-data review [cdm]
Neural network pruning with residual connections and limited-data review [cdm]
 
Expanding HPCC Systems Deep Neural Network Capabilities
Expanding HPCC Systems Deep Neural Network CapabilitiesExpanding HPCC Systems Deep Neural Network Capabilities
Expanding HPCC Systems Deep Neural Network Capabilities
 
Parallel Distributed Deep Learning on HPCC Systems
Parallel Distributed Deep Learning on HPCC SystemsParallel Distributed Deep Learning on HPCC Systems
Parallel Distributed Deep Learning on HPCC Systems
 
SOA with PHP and Symfony
SOA with PHP and SymfonySOA with PHP and Symfony
SOA with PHP and Symfony
 
NVIDIA 深度學習教育機構 (DLI): Medical image segmentation using digits
NVIDIA 深度學習教育機構 (DLI): Medical image segmentation using digitsNVIDIA 深度學習教育機構 (DLI): Medical image segmentation using digits
NVIDIA 深度學習教育機構 (DLI): Medical image segmentation using digits
 
Deep learning summary
Deep learning summaryDeep learning summary
Deep learning summary
 
Neural Network Part-2
Neural Network Part-2Neural Network Part-2
Neural Network Part-2
 
Regularization in deep learning
Regularization in deep learningRegularization in deep learning
Regularization in deep learning
 
Application of machine learning and cognitive computing in intrusion detectio...
Application of machine learning and cognitive computing in intrusion detectio...Application of machine learning and cognitive computing in intrusion detectio...
Application of machine learning and cognitive computing in intrusion detectio...
 
Cvpr 2018 papers review (efficient computing)
Cvpr 2018 papers review (efficient computing)Cvpr 2018 papers review (efficient computing)
Cvpr 2018 papers review (efficient computing)
 
ICML 2017 Meta network
ICML 2017 Meta networkICML 2017 Meta network
ICML 2017 Meta network
 
OOW-IMC-final
OOW-IMC-finalOOW-IMC-final
OOW-IMC-final
 
Deep_Learning_Frameworks_CNTK_PyTorch
Deep_Learning_Frameworks_CNTK_PyTorchDeep_Learning_Frameworks_CNTK_PyTorch
Deep_Learning_Frameworks_CNTK_PyTorch
 
Webinar: How We Evaluated MongoDB as a Relational Database Replacement
Webinar: How We Evaluated MongoDB as a Relational Database ReplacementWebinar: How We Evaluated MongoDB as a Relational Database Replacement
Webinar: How We Evaluated MongoDB as a Relational Database Replacement
 

Recently uploaded

HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackVICTOR MAESTRE RAMIREZ
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataBradBedford3
 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...Christina Lin
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxComplianceQuest1
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)OPEN KNOWLEDGE GmbH
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq
 
chapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptchapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptkotipi9215
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfkalichargn70th171
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...gurkirankumar98700
 
Asset Management Software - Infographic
Asset Management Software - InfographicAsset Management Software - Infographic
Asset Management Software - InfographicHr365.us smith
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software DevelopersVinodh Ram
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio, Inc.
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVshikhaohhpro
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationkaushalgiri8080
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideChristina Lin
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfkalichargn70th171
 
Introduction to Decentralized Applications (dApps)
Introduction to Decentralized Applications (dApps)Introduction to Decentralized Applications (dApps)
Introduction to Decentralized Applications (dApps)Intelisync
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number SystemsJheuzeDellosa
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providermohitmore19
 

Recently uploaded (20)

HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStack
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service Consultant
 
chapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptchapter--4-software-project-planning.ppt
chapter--4-software-project-planning.ppt
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
 
Asset Management Software - Infographic
Asset Management Software - InfographicAsset Management Software - Infographic
Asset Management Software - Infographic
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software Developers
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanation
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
 
Introduction to Decentralized Applications (dApps)
Introduction to Decentralized Applications (dApps)Introduction to Decentralized Applications (dApps)
Introduction to Decentralized Applications (dApps)
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number Systems
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
 

Neural network learning ability

  • 3. Introduction Neural network Benefits are: ▫ To store information ▫ To retrieve lost information ▫ To act when faced with new situation
  • 4. Introduction Neural network Problem: forgets the old task when new task is stored.
  • 5. Introduction Problem in Applications Image recognition • new capabilities are to be added • Assumes old data set is available • Infeasible • Vision system using CNN • No old task data • Train using new data • Preserves old task • A new method for Neural Nets • To learn without forgetting
  • 6. Introduction CNN Convolutional neural network • Operates on volumes • Convolution (filter): • reduces to smaller size with specific information • ReLu • Pooling : reduces the spatial size to reduce computation and parameters • Maxpooling is to reduce to max parameter
  • 8. Present methods Model • Structure • Parameter • ᶿs : shared parameter • ᶿo : old task parameter • ᶿn : new task parameter
  • 9. Present methods Common approaches Feature extraction • ᶿs & ᶿo is unchanged • Layers used new task
  • 10. Present methods Common approaches Fine tuning • ᶿs & ᶿn is optimized, ᶿo is fixed • Low learning rate • Can be duplicated
  • 11. Present methods Common approaches Joint training • ᶿs , ᶿn & ᶿo are jointly optimised • The tasks are interleaved • Multi task learning
  • 12. Present methods Currently developed methods  A-LTM (Active Long-Term Memory)  Less forgetting Learning  Cross-stitch Network  WA-CNN
  • 13. Present methods Currently developed methods A-LTM: • Identical to LwF • Differ – • weight decay regularization for training • Warm-up step used after FT • Dataset size • Large loss • Needs old task DS
  • 14. Present methods Currently developed methods Less Forgetting Learning: • Similar • Hinders change in • Task specific decision boundary • Shared representation • Adds L2 loss : • No change in ᶿs for new task • ᶿo remains same
  • 15. Present methods Currently developed methods Cross-stitch Network: • Works on MTL • Introduces cross-stitch module • Jointly learns: • 2 same structure network blocks • 2 pairs of weight -> same output(s) • Outperforms joint training • Needs old task DS • Increases network size
  • 16. Present methods Currently developed methods WA-CNN: • Expands the network (ᶿs) • Improves new task performance • Freezing ᶿo • Maintains old-task • Outperforms traditional fine-tuning • But it increases network size faster than LwF
  • 18. • Uses only new task data to train • Preserves the original capabilities • Performs favourably : • Feature extraction • Fine tuning adaptation • Similar to MTL that uses old DS The proposed method 18
  • 19. • a Unified vision system • The CNN has parameters: • ᶿs : shared parameter • ᶿo : old / specific task parameter • The goal is to add ᶿn : new task parameter • Learn parameters • Works well on old & new task • using only new & not old task DS The proposed method 19
  • 20. • Advantage over common approaches : • Classification performance • Outperforms feature extraction & fine-tuning • Computation efficiency • Faster training & test time • But slower than fine tuning • Simplicity in deployment • No need to retrain in adapting network The proposed method 20
  • 22. Phase I : Initialization • The output is recorded (Yo) on old task for new data • Response is a set of label probabilities • A node for new class • Weights initialised randomly Procedure 22
  • 23. Phase II : Training • Train to minimize loss for all task • Regularization using Stochastic gradient descent • Two steps: • Warm-up step • Freeze ᶿo & ᶿs & trains ᶿn • Joint optimization step • Train all weights Procedure
  • 24. Phase II : Training • Logistic loss • Knowledge distillation loss Procedure
  • 25. Phase II : Training • For calculating Knowledge distillation function • Recorded & current probabilities • T > 1 ; usually T = 2 • λo is a loss balance weight = 1 • Larger = greater old task performance • Smaller = greater new task performance Procedure
  • 26. Experiment • Use • Large dataset to train initial net • Smaller dataset to add new task • Old/Original task : • ImageNet • Contains 1000 object category • more than 1000K training images • Places365-standard • Contains 365 scene classes • ~1600K training images • New task: • PASCAL VOC (“VOC”) ~6K • Caltech-UCSD Birds (“CUB”) ~6K • MIT indoor scene (“Scenes”) ~6K
  • 27. Experiment • Two scenario: • Single new task scenario: • On new task, LwF outperformed • LFL, fine-tuning FC, feature extraction & fine-tuning in most pair • On old task, performs • Better than fine-tuning • Underperforms feature extraction,fine- tuning FC & LFL • Multiple new task scenario: • LwF outperforms all except joint training
  • 29. Extension of LwF • Network expansion • Adds nodes to some layers • Allows new-task-specific information to be stored • Used along with LwF • Performs better feature extraction 29
  • 30. Limitations 30 • Can`t deal with change in domain • All new-task data to be present before computing their old task response • Learning new task decreases old-task recovery
  • 31. The End 31 Learned something without forgetting Learned something without forgetting

Editor's Notes

  1. We learn new things when new neurons are created for it in our brain Better the plasticity , better we remember As time goes, without refreshing these things tends to fade away This is same in case of ANN
  2. need to do : Explain about the neural network,The basics
  3. Notes: if I include lemur identification to the net , it forgets the dog functionality
  4. Forgets the old task Explain with an example of forgetting with basic neural network Need to add : forgetting example
  5. One of the well known victim of forgetting is in Image recognition Explain paper: slide 1 Introduction slide 1
  6. Explain Internet: Convolutional neural network :
  7. To tackle the problem of catastrophic forgetting
  8. Explain paper: model
  9. Explain paper: Feature extraction explain , disadvantages
  10. Explain paper: Fine tuning explain , disadvantages
  11. Explain paper: Joint training explain , disadvantages
  12. Explain paper: Concurrently developed method explain , disadvantages
  13. Explain paper: A-LTM explain , disadvantages
  14. Explain paper: Less forgetting learning, compare it with LwF
  15. Explain paper: Cross-stitch network Web: Cross-stitch network Cross-stitch module
  16. Explain paper: Cross-stitch network Web: Cross-stitch network Cross-stitch module