SlideShare a Scribd company logo
1 of 41
Download to read offline
Affiliated with the Novartis Institutes for BioMedical Research
Affiliated Institute of the University of Basel
Information processing with artificial spiking neural networks
Friedemann Zenke
Computational Neuroscience @ FMI
www.zenkelab.org
.ch www.zenkelab.org
Some ways to improve efficiency of deep nets for AI
●
Improve models (inductive biases, architecture)
●
Continual learning
●
Unsupervised learning (data efficiency)
●
Reduce data transfer and communications
(avoid von Neumann bottleneck)
●
Use of analog and mixed signal circuitry
(in-memory computing, analog neurons/synapses)
●
Sparsity of activity and connections
(e.g., event-based processing, network pruning)
.ch www.zenkelab.org
Today’s focus is on spiking neural networks
Training spiking neural networks
●
Network translation
Usually makes rate-coding assumption and results in high activity levels.
●
Direct training (surrogate gradients)
Very general. No coding assumptions. Truly end-to-end.
.ch www.zenkelab.org
Today’s focus is on spiking neural networks
●
Part 1: Brief intro on surrogate gradients
●
Part 2: Self-calibrating analog neuromorphic hardware
●
Part 3: Can we do without backprop through time?
What’s the optimal initialization for spiking nets?
.ch www.zenkelab.org
Task-optimizing spiking neural networks end-to-end
Sensory inputs
“Sophisticated function”
Intelligent
behavior
.ch www.zenkelab.org
Task-optimizing spiking neural networks end-to-end
“Sophisticated function”
1) Input data
Intelligent
behavior
.ch www.zenkelab.org
Task-optimizing spiking neural networks end-to-end
“Sophisticated function”
1) Input data 2) Desired output
.ch www.zenkelab.org
Task-optimizing spiking neural networks end-to-end
“Sophisticated function”
1) Input data 2) Desired output
3) Find connectivity (learning)
.ch www.zenkelab.org
Input: Spatiotemporal spike trains
Example for this talk
“Randman” Synthetic spike times from smooth random manifolds.
Binary discrimination task
Zenke and Vogels (2021)
.ch www.zenkelab.org
Output: Linear combination of filtered output
spike trains
Activation
Time (s)
Readout
.ch www.zenkelab.org
Output: Linear combination of filtered output
spike trains
Activation
Time (s)
Readout
Max over time idea from Tempotron
Gütig & Sompolinsky (2006); Gütig (2016)
.ch www.zenkelab.org
Learning: Training spiking networks end-to-end
●
Spiking neurons & networks are RNNs
●
They have implicit and explicit recurrence
●
Known training procedures for networks
with hidden units
●
Backpropagation-through time (BPTT)
●
Real-time recurrent learning (RTRL)
Forward Euler integration
implicit
explicit
Neuronal dynamics
Activity going through
synaptic connections
Neftci, Mostafa, & Zenke (2019)
.ch www.zenkelab.org
Activity snapshots (network has not learned anything)
Synthetic data set: 2000 samples from two smooth random manifolds
Example: A two-fold classification problem
.ch www.zenkelab.org
Example: A two-fold classification problem
Evolution of loss during gradient descent
Trials
.ch www.zenkelab.org
Learning: Training spiking networks end-to-end
●
Spiking neurons & networks are RNNs
●
They have implicit and explicit recurrence
●
Known training procedures for networks
with hidden units
●
Backpropagation-through time (BPTT)
●
Real-time recurrent learning (RTRL)
Problem
Forward Euler integration
Neftci, Mostafa, & Zenke (2019)
.ch www.zenkelab.org
Problem: The derivative of a spike train
is zero almost everywhere
??
almost everywhere
Mem. Parameter
Loss
Neftci, Mostafa, & Zenke (2019)
.ch www.zenkelab.org
??
Mem. Parameter
Loss
Option 1 (“classic”): Noise injection → gradient in expectation values.
e.g.: Pfister, Toyoizumi, Barber & Gerstner (2006), Gardner, Sporea & Grüning (2015)
Option 2: Make spikes differentiable.
Huh & Sejnowski (2018)
Option 3: “Know hidden layer targets”
Gilra & Gerstner (2017), Nicola & Clopath (2017)
Option 4: Use surrogate gradients. Bohte (2011),
Bellec, Salaj, Subramoney, Legenstein, and Maass (2018)
Shrestha & Orchard (2018), Zenke & Ganguli (2018), ...
In ML: “Straight-through estimators” Bengio et al. (2013)
Neftci, Mostafa, & Zenke (2019)
.ch www.zenkelab.org
Example: A two-fold classification problem
Evolution of loss during surrogate gradient descent
.ch www.zenkelab.org
Example: A two-fold classification problem
Activity snapshots (trained network)
.ch www.zenkelab.org
Example: Solving latency coded MNIST
Zenke & Vogels (2021)
.ch www.zenkelab.org
Surrogate gradients: A tool for task-optimizing spiking neural networks
100ms
Dataset: Spiking Heidelberg Digits (SHD) – Cramer, Stradmann, Schemmel, and Zenke (2020)
Available at www.compneuro.net
Input data
Linear decoder at output
~95% classification accuracy
Objective
Classify words
Area
1
Area
2
Area
3
Channel
/
Neurons
Exc
Inh
.ch www.zenkelab.org
Part 2
Neuromorphic engineers build hardware that seeks to
emulate neural networks instead of simulating them.
.ch www.zenkelab.org
Can analog neuromorphic substrates self-calibrate through
learning and thereby overcome the problem of device mismatch?
BrainScales Neuromorphic System
Karlheinz Meier
Sebastian
Billaudelle
Benjamin
Cramer
Johannes Schemmel
Uni Heidelberg
.ch www.zenkelab.org
Problem: Device mismatch – A stumbling block for
widespread use of analog neuromorphic hardware
Accuracy
100%
Software
simulation
Weight
transfer
→ costly calibration
.ch www.zenkelab.org
To study this question we used the BrainScaleS-2
analog neuromorphic hardware system
Cramer, B., Billaudelle, S., Kanya, S., Leibfried, A., Grübl, A., Karasenko, V., Pehle, C., Schreiber, K., Stradmann, Y.,
Weis, J., Schemmel, J., and Zenke, F. (2022). PNAS
Sebastian
Billaudelle
Benjamin
Cramer
.ch www.zenkelab.org
E-phys on analog neuromorphic hardware
Sebastian
Billaudelle
Benjamin
Cramer
.ch www.zenkelab.org
In-the-loop surrogate gradient training
Forward-pass on chip and backward pass in software
2) Measure on-chip
analog voltage traces
3) Inject voltage into computational graph
4) Compute surrogate gradients → update weights
1) Forward pass on chip
(in-the-loop training)
Cramer, B., Billaudelle, S., Kanya, S., Leibfried, A., Grübl, A., Karasenko, V., Pehle, C., Schreiber, K., Stradmann, Y.,
Weis, J., Schemmel, J., and Zenke, F. (2022). PNAS
.ch www.zenkelab.org
Functional spiking neural networks trained on accelerated
analog neuromorphic hardware
>80k
images/sec
@ 200mW
Sebastian
Billaudelle
Benjamin
Cramer
Cramer, B., Billaudelle, S., Kanya, S., Leibfried, A., Grübl, A., Karasenko, V., Pehle, C., Schreiber, K., Stradmann, Y.,
Weis, J., Schemmel, J., and Zenke, F. (2022). PNAS
.ch www.zenkelab.org
.ch www.zenkelab.org
Surrogate gradient learning self-calibrates the analog
neuromorphic substrate
Cramer, B., Billaudelle, S., Kanya, S., Leibfried, A., Grübl, A., Karasenko, V., Pehle, C., Schreiber, K., Stradmann, Y.,
Weis, J., Schemmel, J., and Zenke, F. (2022). PNAS
.ch www.zenkelab.org
Cramer, B., Billaudelle, S., Kanya, S., Leibfried, A., Grübl, A., Karasenko, V., Pehle, C., Schreiber, K., Stradmann, Y.,
Weis, J., Schemmel, J., and Zenke, F. (2022). PNAS
.ch www.zenkelab.org
Heidelberg spiking digits: Speech recognition
Sebastian
Billaudelle
Benjamin
Cramer
Cramer, B., Billaudelle, S., Kanya, S., Leibfried, A., Grübl, A., Karasenko, V., Pehle, C., Schreiber, K., Stradmann, Y.,
Weis, J., Schemmel, J., and Zenke, F. (2022). PNAS
.ch www.zenkelab.org
Part 3
Can we avoid back-propagation through time?
How should we initialize spiking nets?
.ch www.zenkelab.org
Ignoring explicit recurrence in gradient computation yields
plausible three-factor online learning rules*
(* we still have to worry about spatial credit assignment)
Backpropagation-through time
Real-time recurrent learning (Williams and Zipser; 1989)
Approximate RTRL
●
Sparse
Jacobians →
efficient comp.
●
Local rules
Great review:
Marschall, O., Cho, K., and Savin, C. (2019)
Zenke and Neftci (2021)
.ch www.zenkelab.org
Ignoring explicit recurrence in gradient computation
yields plausible three-factor online learning rules*
(* we still have to worry about spatial credit assignment)
●
Hebbian part
●
Voltage-based nonlinearity
●
Eligibility trace (“Ca transient”)
●
Spatial credit assignment (feedback alignment, burstprop, etc...)
Zenke and Ganguli (2018). Neural Computation
Zenke and Neftci (2021). Proceedings of the IEEE
Hebbian part
Presynaptic filter – e.g., presence
of neurotransmitter at the PSD
All recent online learning rules use this trick
i.e., SuperSpike, e-Prop, RFLO, OSTL, …
.ch www.zenkelab.org
End-to-end online learning
Zenke and Ganguli (2018). Neural Computation
Spiking network activity during training
.ch www.zenkelab.org
Approximate learning rules still take advantage of
recurrence and do almost as well as full BPTT
Zenke and Neftci (2021)
No recurrent connections
Recurrent (full BPTT)
Recurrent (ignoring explicit recurrence in gradient)
Speech processing problems (require memory)
.ch www.zenkelab.org
Part 3
Can we avoid back-propagation through time?
How should we initialize spiking nets?
.ch www.zenkelab.org
Fluctuation-driven firing as observed in the brain is an
ideal starting point for learning in spiking neural nets
Julian Rossbroich Julia Gygax
Based on balanced net theory by
van Vreeswijk, Sompolinski, Brunel, ...
Unpublished
Spiking Conv Nets
Firing rate homeostasis
.ch www.zenkelab.org
Conclusion and summary
●
Surrogate gradients allow flexible end-to-end training of
spiking neural networks
●
Voltage-dependent plasticity adds self-calibration
capabilities to analog neuromorphic substrates
●
Yes, we can avoid back-propagation through time!
Approximate RTRL→ three-factor learning rules
.ch www.zenkelab.org
Thanks!
Manu
●
Nikolaos Papanikolaou
●
Yue Kris Wu
●
Matthias Depoortere
●
Tianlin Liu
Benjamin
Artwork: kyadava.net
Julian
Sebastian
With Karlheinz Meier † &
Johannes Schemmel
Collaborators
Johannes Schemmel
Uni Heidelberg
Julia
Manvi
Axel
Alumni
Hiring post-docs and PhD students!
Peter
Emre Neftci
FZ Julich

More Related Content

Similar to Information processing with artificial spiking neural networks

Deep learning lecture - part 1 (basics, CNN)
Deep learning lecture - part 1 (basics, CNN)Deep learning lecture - part 1 (basics, CNN)
Deep learning lecture - part 1 (basics, CNN)SungminYou
 
Deep learning from a novice perspective
Deep learning from a novice perspectiveDeep learning from a novice perspective
Deep learning from a novice perspectiveAnirban Santara
 
Deep learning and image analytics using Python by Dr Sanparit
Deep learning and image analytics using Python by Dr SanparitDeep learning and image analytics using Python by Dr Sanparit
Deep learning and image analytics using Python by Dr SanparitBAINIDA
 
Convolutional neural network
Convolutional neural network Convolutional neural network
Convolutional neural network Yan Xu
 
Artificial Neural Network Implementation on FPGA – a Modular Approach
Artificial Neural Network Implementation on FPGA – a Modular ApproachArtificial Neural Network Implementation on FPGA – a Modular Approach
Artificial Neural Network Implementation on FPGA – a Modular ApproachRoee Levy
 
Feasibility of moment tensor inversion for a single-well microseismic data us...
Feasibility of moment tensor inversion for a single-well microseismic data us...Feasibility of moment tensor inversion for a single-well microseismic data us...
Feasibility of moment tensor inversion for a single-well microseismic data us...Oleg Ovcharenko
 
Artificial Neural Network Seminar Report
Artificial Neural Network Seminar ReportArtificial Neural Network Seminar Report
Artificial Neural Network Seminar ReportTodd Turner
 
  Brain-Inspired Computation based on Spiking Neural Networks ...
 Brain-Inspired Computation based on Spiking Neural Networks               ... Brain-Inspired Computation based on Spiking Neural Networks               ...
  Brain-Inspired Computation based on Spiking Neural Networks ...Jorge Pires
 
Details of Lazy Deep Learning for Images Recognition in ZZ Photo app
Details of Lazy Deep Learning for Images Recognition in ZZ Photo appDetails of Lazy Deep Learning for Images Recognition in ZZ Photo app
Details of Lazy Deep Learning for Images Recognition in ZZ Photo appPAY2 YOU
 
Neural Networks in Data Mining - “An Overview”
Neural Networks  in Data Mining -   “An Overview”Neural Networks  in Data Mining -   “An Overview”
Neural Networks in Data Mining - “An Overview”Dr.(Mrs).Gethsiyal Augasta
 
Accelerating stochastic gradient descent using adaptive mini batch size3
Accelerating stochastic gradient descent using adaptive mini batch size3Accelerating stochastic gradient descent using adaptive mini batch size3
Accelerating stochastic gradient descent using adaptive mini batch size3muayyad alsadi
 
Electricity price forecasting with Recurrent Neural Networks
Electricity price forecasting with Recurrent Neural NetworksElectricity price forecasting with Recurrent Neural Networks
Electricity price forecasting with Recurrent Neural NetworksTaegyun Jeon
 
FPGA2018: A Lightweight YOLOv2: A binarized CNN with a parallel support vecto...
FPGA2018: A Lightweight YOLOv2: A binarized CNN with a parallel support vecto...FPGA2018: A Lightweight YOLOv2: A binarized CNN with a parallel support vecto...
FPGA2018: A Lightweight YOLOv2: A binarized CNN with a parallel support vecto...Hiroki Nakahara
 
DEF CON 24 - Clarence Chio - machine duping 101
DEF CON 24 - Clarence Chio - machine duping 101DEF CON 24 - Clarence Chio - machine duping 101
DEF CON 24 - Clarence Chio - machine duping 101Felipe Prado
 
Web spam classification using supervised artificial neural network algorithms
Web spam classification using supervised artificial neural network algorithmsWeb spam classification using supervised artificial neural network algorithms
Web spam classification using supervised artificial neural network algorithmsaciijournal
 
artificial neural network
artificial neural networkartificial neural network
artificial neural networkPallavi Yadav
 
Artificial Neural Network for hand Gesture recognition
Artificial Neural Network for hand Gesture recognitionArtificial Neural Network for hand Gesture recognition
Artificial Neural Network for hand Gesture recognitionVigneshwer Dhinakaran
 
Deep learning for molecules, introduction to chainer chemistry
Deep learning for molecules, introduction to chainer chemistryDeep learning for molecules, introduction to chainer chemistry
Deep learning for molecules, introduction to chainer chemistryKenta Oono
 

Similar to Information processing with artificial spiking neural networks (20)

Deep learning lecture - part 1 (basics, CNN)
Deep learning lecture - part 1 (basics, CNN)Deep learning lecture - part 1 (basics, CNN)
Deep learning lecture - part 1 (basics, CNN)
 
Deep learning from a novice perspective
Deep learning from a novice perspectiveDeep learning from a novice perspective
Deep learning from a novice perspective
 
Deep learning and image analytics using Python by Dr Sanparit
Deep learning and image analytics using Python by Dr SanparitDeep learning and image analytics using Python by Dr Sanparit
Deep learning and image analytics using Python by Dr Sanparit
 
Convolutional neural network
Convolutional neural network Convolutional neural network
Convolutional neural network
 
Artificial Neural Network Implementation on FPGA – a Modular Approach
Artificial Neural Network Implementation on FPGA – a Modular ApproachArtificial Neural Network Implementation on FPGA – a Modular Approach
Artificial Neural Network Implementation on FPGA – a Modular Approach
 
Feasibility of moment tensor inversion for a single-well microseismic data us...
Feasibility of moment tensor inversion for a single-well microseismic data us...Feasibility of moment tensor inversion for a single-well microseismic data us...
Feasibility of moment tensor inversion for a single-well microseismic data us...
 
Introduction to deep learning
Introduction to deep learningIntroduction to deep learning
Introduction to deep learning
 
Artificial Neural Network Seminar Report
Artificial Neural Network Seminar ReportArtificial Neural Network Seminar Report
Artificial Neural Network Seminar Report
 
  Brain-Inspired Computation based on Spiking Neural Networks ...
 Brain-Inspired Computation based on Spiking Neural Networks               ... Brain-Inspired Computation based on Spiking Neural Networks               ...
  Brain-Inspired Computation based on Spiking Neural Networks ...
 
Details of Lazy Deep Learning for Images Recognition in ZZ Photo app
Details of Lazy Deep Learning for Images Recognition in ZZ Photo appDetails of Lazy Deep Learning for Images Recognition in ZZ Photo app
Details of Lazy Deep Learning for Images Recognition in ZZ Photo app
 
Neural Networks in Data Mining - “An Overview”
Neural Networks  in Data Mining -   “An Overview”Neural Networks  in Data Mining -   “An Overview”
Neural Networks in Data Mining - “An Overview”
 
Accelerating stochastic gradient descent using adaptive mini batch size3
Accelerating stochastic gradient descent using adaptive mini batch size3Accelerating stochastic gradient descent using adaptive mini batch size3
Accelerating stochastic gradient descent using adaptive mini batch size3
 
Electricity price forecasting with Recurrent Neural Networks
Electricity price forecasting with Recurrent Neural NetworksElectricity price forecasting with Recurrent Neural Networks
Electricity price forecasting with Recurrent Neural Networks
 
FPGA2018: A Lightweight YOLOv2: A binarized CNN with a parallel support vecto...
FPGA2018: A Lightweight YOLOv2: A binarized CNN with a parallel support vecto...FPGA2018: A Lightweight YOLOv2: A binarized CNN with a parallel support vecto...
FPGA2018: A Lightweight YOLOv2: A binarized CNN with a parallel support vecto...
 
DEF CON 24 - Clarence Chio - machine duping 101
DEF CON 24 - Clarence Chio - machine duping 101DEF CON 24 - Clarence Chio - machine duping 101
DEF CON 24 - Clarence Chio - machine duping 101
 
Web spam classification using supervised artificial neural network algorithms
Web spam classification using supervised artificial neural network algorithmsWeb spam classification using supervised artificial neural network algorithms
Web spam classification using supervised artificial neural network algorithms
 
artificial neural network
artificial neural networkartificial neural network
artificial neural network
 
Artificial Neural Network for hand Gesture recognition
Artificial Neural Network for hand Gesture recognitionArtificial Neural Network for hand Gesture recognition
Artificial Neural Network for hand Gesture recognition
 
Deep learning for molecules, introduction to chainer chemistry
Deep learning for molecules, introduction to chainer chemistryDeep learning for molecules, introduction to chainer chemistry
Deep learning for molecules, introduction to chainer chemistry
 
Defense_thesis
Defense_thesisDefense_thesis
Defense_thesis
 

More from Advanced-Concepts-Team

2024.03.22 - Mike Heddes - Introduction to Hyperdimensional Computing.pdf
2024.03.22 - Mike Heddes - Introduction to Hyperdimensional Computing.pdf2024.03.22 - Mike Heddes - Introduction to Hyperdimensional Computing.pdf
2024.03.22 - Mike Heddes - Introduction to Hyperdimensional Computing.pdfAdvanced-Concepts-Team
 
Isabelle Diacaire - From Ariadnas to Industry R&D in optics and photonics
Isabelle Diacaire - From Ariadnas to Industry R&D in optics and photonicsIsabelle Diacaire - From Ariadnas to Industry R&D in optics and photonics
Isabelle Diacaire - From Ariadnas to Industry R&D in optics and photonicsAdvanced-Concepts-Team
 
The ExoGRAVITY project - observations of exoplanets from the ground with opti...
The ExoGRAVITY project - observations of exoplanets from the ground with opti...The ExoGRAVITY project - observations of exoplanets from the ground with opti...
The ExoGRAVITY project - observations of exoplanets from the ground with opti...Advanced-Concepts-Team
 
Pablo Gomez - Solving Large-scale Challenges with ESA Datalabs
Pablo Gomez - Solving Large-scale Challenges with ESA DatalabsPablo Gomez - Solving Large-scale Challenges with ESA Datalabs
Pablo Gomez - Solving Large-scale Challenges with ESA DatalabsAdvanced-Concepts-Team
 
Jonathan Sauder - Miniaturizing Mechanical Systems for CubeSats: Design Princ...
Jonathan Sauder - Miniaturizing Mechanical Systems for CubeSats: Design Princ...Jonathan Sauder - Miniaturizing Mechanical Systems for CubeSats: Design Princ...
Jonathan Sauder - Miniaturizing Mechanical Systems for CubeSats: Design Princ...Advanced-Concepts-Team
 
Towards an Artificial Muse for new Ideas in Quantum Physics
Towards an Artificial Muse for new Ideas in Quantum PhysicsTowards an Artificial Muse for new Ideas in Quantum Physics
Towards an Artificial Muse for new Ideas in Quantum PhysicsAdvanced-Concepts-Team
 
EDEN ISS - A space greenhouse analogue in Antarctica
EDEN ISS - A space greenhouse analogue in AntarcticaEDEN ISS - A space greenhouse analogue in Antarctica
EDEN ISS - A space greenhouse analogue in AntarcticaAdvanced-Concepts-Team
 
Exploring Architected Materials Using Machine Learning
Exploring Architected Materials Using Machine LearningExploring Architected Materials Using Machine Learning
Exploring Architected Materials Using Machine LearningAdvanced-Concepts-Team
 
Electromagnetically Actuated Systems for Modular, Self-Assembling and Self-Re...
Electromagnetically Actuated Systems for Modular, Self-Assembling and Self-Re...Electromagnetically Actuated Systems for Modular, Self-Assembling and Self-Re...
Electromagnetically Actuated Systems for Modular, Self-Assembling and Self-Re...Advanced-Concepts-Team
 
HORUS: Peering into Lunar Shadowed Regions with AI
HORUS: Peering into Lunar Shadowed Regions with AIHORUS: Peering into Lunar Shadowed Regions with AI
HORUS: Peering into Lunar Shadowed Regions with AIAdvanced-Concepts-Team
 
META-SPACE: Psycho-physiologically Adaptive and Personalized Virtual Reality ...
META-SPACE: Psycho-physiologically Adaptive and Personalized Virtual Reality ...META-SPACE: Psycho-physiologically Adaptive and Personalized Virtual Reality ...
META-SPACE: Psycho-physiologically Adaptive and Personalized Virtual Reality ...Advanced-Concepts-Team
 
The Large Interferometer For Exoplanets (LIFE) II: Key Methods and Technologies
The Large Interferometer For Exoplanets (LIFE) II: Key Methods and TechnologiesThe Large Interferometer For Exoplanets (LIFE) II: Key Methods and Technologies
The Large Interferometer For Exoplanets (LIFE) II: Key Methods and TechnologiesAdvanced-Concepts-Team
 
In vitro simulation of spaceflight environment to elucidate combined effect o...
In vitro simulation of spaceflight environment to elucidate combined effect o...In vitro simulation of spaceflight environment to elucidate combined effect o...
In vitro simulation of spaceflight environment to elucidate combined effect o...Advanced-Concepts-Team
 
The Large Interferometer For Exoplanets (LIFE): the science of characterising...
The Large Interferometer For Exoplanets (LIFE): the science of characterising...The Large Interferometer For Exoplanets (LIFE): the science of characterising...
The Large Interferometer For Exoplanets (LIFE): the science of characterising...Advanced-Concepts-Team
 
Vernal pools a new ecosystem for astrobiology studies
Vernal pools a new ecosystem for astrobiology studiesVernal pools a new ecosystem for astrobiology studies
Vernal pools a new ecosystem for astrobiology studiesAdvanced-Concepts-Team
 
Keeping a Sentinel Eye on the Volcanoes – from Space!
Keeping a Sentinel Eye on the Volcanoes – from Space!Keeping a Sentinel Eye on the Volcanoes – from Space!
Keeping a Sentinel Eye on the Volcanoes – from Space!Advanced-Concepts-Team
 
AI4Space –Artificial Intelligence at ISTA - Hülsmann & Haser
AI4Space –Artificial Intelligence at ISTA - Hülsmann & HaserAI4Space –Artificial Intelligence at ISTA - Hülsmann & Haser
AI4Space –Artificial Intelligence at ISTA - Hülsmann & HaserAdvanced-Concepts-Team
 

More from Advanced-Concepts-Team (20)

2024.03.22 - Mike Heddes - Introduction to Hyperdimensional Computing.pdf
2024.03.22 - Mike Heddes - Introduction to Hyperdimensional Computing.pdf2024.03.22 - Mike Heddes - Introduction to Hyperdimensional Computing.pdf
2024.03.22 - Mike Heddes - Introduction to Hyperdimensional Computing.pdf
 
Isabelle Diacaire - From Ariadnas to Industry R&D in optics and photonics
Isabelle Diacaire - From Ariadnas to Industry R&D in optics and photonicsIsabelle Diacaire - From Ariadnas to Industry R&D in optics and photonics
Isabelle Diacaire - From Ariadnas to Industry R&D in optics and photonics
 
The ExoGRAVITY project - observations of exoplanets from the ground with opti...
The ExoGRAVITY project - observations of exoplanets from the ground with opti...The ExoGRAVITY project - observations of exoplanets from the ground with opti...
The ExoGRAVITY project - observations of exoplanets from the ground with opti...
 
MOND_famaey.pdf
MOND_famaey.pdfMOND_famaey.pdf
MOND_famaey.pdf
 
Pablo Gomez - Solving Large-scale Challenges with ESA Datalabs
Pablo Gomez - Solving Large-scale Challenges with ESA DatalabsPablo Gomez - Solving Large-scale Challenges with ESA Datalabs
Pablo Gomez - Solving Large-scale Challenges with ESA Datalabs
 
Jonathan Sauder - Miniaturizing Mechanical Systems for CubeSats: Design Princ...
Jonathan Sauder - Miniaturizing Mechanical Systems for CubeSats: Design Princ...Jonathan Sauder - Miniaturizing Mechanical Systems for CubeSats: Design Princ...
Jonathan Sauder - Miniaturizing Mechanical Systems for CubeSats: Design Princ...
 
Towards an Artificial Muse for new Ideas in Quantum Physics
Towards an Artificial Muse for new Ideas in Quantum PhysicsTowards an Artificial Muse for new Ideas in Quantum Physics
Towards an Artificial Muse for new Ideas in Quantum Physics
 
EDEN ISS - A space greenhouse analogue in Antarctica
EDEN ISS - A space greenhouse analogue in AntarcticaEDEN ISS - A space greenhouse analogue in Antarctica
EDEN ISS - A space greenhouse analogue in Antarctica
 
How to give a robot a soul
How to give a robot a soulHow to give a robot a soul
How to give a robot a soul
 
Exploring Architected Materials Using Machine Learning
Exploring Architected Materials Using Machine LearningExploring Architected Materials Using Machine Learning
Exploring Architected Materials Using Machine Learning
 
Electromagnetically Actuated Systems for Modular, Self-Assembling and Self-Re...
Electromagnetically Actuated Systems for Modular, Self-Assembling and Self-Re...Electromagnetically Actuated Systems for Modular, Self-Assembling and Self-Re...
Electromagnetically Actuated Systems for Modular, Self-Assembling and Self-Re...
 
HORUS: Peering into Lunar Shadowed Regions with AI
HORUS: Peering into Lunar Shadowed Regions with AIHORUS: Peering into Lunar Shadowed Regions with AI
HORUS: Peering into Lunar Shadowed Regions with AI
 
META-SPACE: Psycho-physiologically Adaptive and Personalized Virtual Reality ...
META-SPACE: Psycho-physiologically Adaptive and Personalized Virtual Reality ...META-SPACE: Psycho-physiologically Adaptive and Personalized Virtual Reality ...
META-SPACE: Psycho-physiologically Adaptive and Personalized Virtual Reality ...
 
The Large Interferometer For Exoplanets (LIFE) II: Key Methods and Technologies
The Large Interferometer For Exoplanets (LIFE) II: Key Methods and TechnologiesThe Large Interferometer For Exoplanets (LIFE) II: Key Methods and Technologies
The Large Interferometer For Exoplanets (LIFE) II: Key Methods and Technologies
 
Black Holes and Bright Quasars
Black Holes and Bright QuasarsBlack Holes and Bright Quasars
Black Holes and Bright Quasars
 
In vitro simulation of spaceflight environment to elucidate combined effect o...
In vitro simulation of spaceflight environment to elucidate combined effect o...In vitro simulation of spaceflight environment to elucidate combined effect o...
In vitro simulation of spaceflight environment to elucidate combined effect o...
 
The Large Interferometer For Exoplanets (LIFE): the science of characterising...
The Large Interferometer For Exoplanets (LIFE): the science of characterising...The Large Interferometer For Exoplanets (LIFE): the science of characterising...
The Large Interferometer For Exoplanets (LIFE): the science of characterising...
 
Vernal pools a new ecosystem for astrobiology studies
Vernal pools a new ecosystem for astrobiology studiesVernal pools a new ecosystem for astrobiology studies
Vernal pools a new ecosystem for astrobiology studies
 
Keeping a Sentinel Eye on the Volcanoes – from Space!
Keeping a Sentinel Eye on the Volcanoes – from Space!Keeping a Sentinel Eye on the Volcanoes – from Space!
Keeping a Sentinel Eye on the Volcanoes – from Space!
 
AI4Space –Artificial Intelligence at ISTA - Hülsmann & Haser
AI4Space –Artificial Intelligence at ISTA - Hülsmann & HaserAI4Space –Artificial Intelligence at ISTA - Hülsmann & Haser
AI4Space –Artificial Intelligence at ISTA - Hülsmann & Haser
 

Recently uploaded

SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxkessiyaTpeter
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |aasikanpl
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...jana861314
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Work, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE PhysicsWork, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE Physicsvishikhakeshava1
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 sciencefloriejanemacaya1
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCEPRINCE C P
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxAArockiyaNisha
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 
Types of different blotting techniques.pptx
Types of different blotting techniques.pptxTypes of different blotting techniques.pptx
Types of different blotting techniques.pptxkhadijarafiq2012
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡anilsa9823
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 

Recently uploaded (20)

SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Work, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE PhysicsWork, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE Physics
 
Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 science
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
 
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptx
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
Types of different blotting techniques.pptx
Types of different blotting techniques.pptxTypes of different blotting techniques.pptx
Types of different blotting techniques.pptx
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 

Information processing with artificial spiking neural networks

  • 1. Affiliated with the Novartis Institutes for BioMedical Research Affiliated Institute of the University of Basel Information processing with artificial spiking neural networks Friedemann Zenke Computational Neuroscience @ FMI www.zenkelab.org
  • 2. .ch www.zenkelab.org Some ways to improve efficiency of deep nets for AI ● Improve models (inductive biases, architecture) ● Continual learning ● Unsupervised learning (data efficiency) ● Reduce data transfer and communications (avoid von Neumann bottleneck) ● Use of analog and mixed signal circuitry (in-memory computing, analog neurons/synapses) ● Sparsity of activity and connections (e.g., event-based processing, network pruning)
  • 3. .ch www.zenkelab.org Today’s focus is on spiking neural networks Training spiking neural networks ● Network translation Usually makes rate-coding assumption and results in high activity levels. ● Direct training (surrogate gradients) Very general. No coding assumptions. Truly end-to-end.
  • 4. .ch www.zenkelab.org Today’s focus is on spiking neural networks ● Part 1: Brief intro on surrogate gradients ● Part 2: Self-calibrating analog neuromorphic hardware ● Part 3: Can we do without backprop through time? What’s the optimal initialization for spiking nets?
  • 5. .ch www.zenkelab.org Task-optimizing spiking neural networks end-to-end Sensory inputs “Sophisticated function” Intelligent behavior
  • 6. .ch www.zenkelab.org Task-optimizing spiking neural networks end-to-end “Sophisticated function” 1) Input data Intelligent behavior
  • 7. .ch www.zenkelab.org Task-optimizing spiking neural networks end-to-end “Sophisticated function” 1) Input data 2) Desired output
  • 8. .ch www.zenkelab.org Task-optimizing spiking neural networks end-to-end “Sophisticated function” 1) Input data 2) Desired output 3) Find connectivity (learning)
  • 9. .ch www.zenkelab.org Input: Spatiotemporal spike trains Example for this talk “Randman” Synthetic spike times from smooth random manifolds. Binary discrimination task Zenke and Vogels (2021)
  • 10. .ch www.zenkelab.org Output: Linear combination of filtered output spike trains Activation Time (s) Readout
  • 11. .ch www.zenkelab.org Output: Linear combination of filtered output spike trains Activation Time (s) Readout Max over time idea from Tempotron Gütig & Sompolinsky (2006); Gütig (2016)
  • 12. .ch www.zenkelab.org Learning: Training spiking networks end-to-end ● Spiking neurons & networks are RNNs ● They have implicit and explicit recurrence ● Known training procedures for networks with hidden units ● Backpropagation-through time (BPTT) ● Real-time recurrent learning (RTRL) Forward Euler integration implicit explicit Neuronal dynamics Activity going through synaptic connections Neftci, Mostafa, & Zenke (2019)
  • 13. .ch www.zenkelab.org Activity snapshots (network has not learned anything) Synthetic data set: 2000 samples from two smooth random manifolds Example: A two-fold classification problem
  • 14. .ch www.zenkelab.org Example: A two-fold classification problem Evolution of loss during gradient descent Trials
  • 15. .ch www.zenkelab.org Learning: Training spiking networks end-to-end ● Spiking neurons & networks are RNNs ● They have implicit and explicit recurrence ● Known training procedures for networks with hidden units ● Backpropagation-through time (BPTT) ● Real-time recurrent learning (RTRL) Problem Forward Euler integration Neftci, Mostafa, & Zenke (2019)
  • 16. .ch www.zenkelab.org Problem: The derivative of a spike train is zero almost everywhere ?? almost everywhere Mem. Parameter Loss Neftci, Mostafa, & Zenke (2019)
  • 17. .ch www.zenkelab.org ?? Mem. Parameter Loss Option 1 (“classic”): Noise injection → gradient in expectation values. e.g.: Pfister, Toyoizumi, Barber & Gerstner (2006), Gardner, Sporea & Grüning (2015) Option 2: Make spikes differentiable. Huh & Sejnowski (2018) Option 3: “Know hidden layer targets” Gilra & Gerstner (2017), Nicola & Clopath (2017) Option 4: Use surrogate gradients. Bohte (2011), Bellec, Salaj, Subramoney, Legenstein, and Maass (2018) Shrestha & Orchard (2018), Zenke & Ganguli (2018), ... In ML: “Straight-through estimators” Bengio et al. (2013) Neftci, Mostafa, & Zenke (2019)
  • 18. .ch www.zenkelab.org Example: A two-fold classification problem Evolution of loss during surrogate gradient descent
  • 19. .ch www.zenkelab.org Example: A two-fold classification problem Activity snapshots (trained network)
  • 20. .ch www.zenkelab.org Example: Solving latency coded MNIST Zenke & Vogels (2021)
  • 21. .ch www.zenkelab.org Surrogate gradients: A tool for task-optimizing spiking neural networks 100ms Dataset: Spiking Heidelberg Digits (SHD) – Cramer, Stradmann, Schemmel, and Zenke (2020) Available at www.compneuro.net Input data Linear decoder at output ~95% classification accuracy Objective Classify words Area 1 Area 2 Area 3 Channel / Neurons Exc Inh
  • 22. .ch www.zenkelab.org Part 2 Neuromorphic engineers build hardware that seeks to emulate neural networks instead of simulating them.
  • 23. .ch www.zenkelab.org Can analog neuromorphic substrates self-calibrate through learning and thereby overcome the problem of device mismatch? BrainScales Neuromorphic System Karlheinz Meier Sebastian Billaudelle Benjamin Cramer Johannes Schemmel Uni Heidelberg
  • 24. .ch www.zenkelab.org Problem: Device mismatch – A stumbling block for widespread use of analog neuromorphic hardware Accuracy 100% Software simulation Weight transfer → costly calibration
  • 25. .ch www.zenkelab.org To study this question we used the BrainScaleS-2 analog neuromorphic hardware system Cramer, B., Billaudelle, S., Kanya, S., Leibfried, A., Grübl, A., Karasenko, V., Pehle, C., Schreiber, K., Stradmann, Y., Weis, J., Schemmel, J., and Zenke, F. (2022). PNAS Sebastian Billaudelle Benjamin Cramer
  • 26. .ch www.zenkelab.org E-phys on analog neuromorphic hardware Sebastian Billaudelle Benjamin Cramer
  • 27. .ch www.zenkelab.org In-the-loop surrogate gradient training Forward-pass on chip and backward pass in software 2) Measure on-chip analog voltage traces 3) Inject voltage into computational graph 4) Compute surrogate gradients → update weights 1) Forward pass on chip (in-the-loop training) Cramer, B., Billaudelle, S., Kanya, S., Leibfried, A., Grübl, A., Karasenko, V., Pehle, C., Schreiber, K., Stradmann, Y., Weis, J., Schemmel, J., and Zenke, F. (2022). PNAS
  • 28. .ch www.zenkelab.org Functional spiking neural networks trained on accelerated analog neuromorphic hardware >80k images/sec @ 200mW Sebastian Billaudelle Benjamin Cramer Cramer, B., Billaudelle, S., Kanya, S., Leibfried, A., Grübl, A., Karasenko, V., Pehle, C., Schreiber, K., Stradmann, Y., Weis, J., Schemmel, J., and Zenke, F. (2022). PNAS
  • 30. .ch www.zenkelab.org Surrogate gradient learning self-calibrates the analog neuromorphic substrate Cramer, B., Billaudelle, S., Kanya, S., Leibfried, A., Grübl, A., Karasenko, V., Pehle, C., Schreiber, K., Stradmann, Y., Weis, J., Schemmel, J., and Zenke, F. (2022). PNAS
  • 31. .ch www.zenkelab.org Cramer, B., Billaudelle, S., Kanya, S., Leibfried, A., Grübl, A., Karasenko, V., Pehle, C., Schreiber, K., Stradmann, Y., Weis, J., Schemmel, J., and Zenke, F. (2022). PNAS
  • 32. .ch www.zenkelab.org Heidelberg spiking digits: Speech recognition Sebastian Billaudelle Benjamin Cramer Cramer, B., Billaudelle, S., Kanya, S., Leibfried, A., Grübl, A., Karasenko, V., Pehle, C., Schreiber, K., Stradmann, Y., Weis, J., Schemmel, J., and Zenke, F. (2022). PNAS
  • 33. .ch www.zenkelab.org Part 3 Can we avoid back-propagation through time? How should we initialize spiking nets?
  • 34. .ch www.zenkelab.org Ignoring explicit recurrence in gradient computation yields plausible three-factor online learning rules* (* we still have to worry about spatial credit assignment) Backpropagation-through time Real-time recurrent learning (Williams and Zipser; 1989) Approximate RTRL ● Sparse Jacobians → efficient comp. ● Local rules Great review: Marschall, O., Cho, K., and Savin, C. (2019) Zenke and Neftci (2021)
  • 35. .ch www.zenkelab.org Ignoring explicit recurrence in gradient computation yields plausible three-factor online learning rules* (* we still have to worry about spatial credit assignment) ● Hebbian part ● Voltage-based nonlinearity ● Eligibility trace (“Ca transient”) ● Spatial credit assignment (feedback alignment, burstprop, etc...) Zenke and Ganguli (2018). Neural Computation Zenke and Neftci (2021). Proceedings of the IEEE Hebbian part Presynaptic filter – e.g., presence of neurotransmitter at the PSD All recent online learning rules use this trick i.e., SuperSpike, e-Prop, RFLO, OSTL, …
  • 36. .ch www.zenkelab.org End-to-end online learning Zenke and Ganguli (2018). Neural Computation Spiking network activity during training
  • 37. .ch www.zenkelab.org Approximate learning rules still take advantage of recurrence and do almost as well as full BPTT Zenke and Neftci (2021) No recurrent connections Recurrent (full BPTT) Recurrent (ignoring explicit recurrence in gradient) Speech processing problems (require memory)
  • 38. .ch www.zenkelab.org Part 3 Can we avoid back-propagation through time? How should we initialize spiking nets?
  • 39. .ch www.zenkelab.org Fluctuation-driven firing as observed in the brain is an ideal starting point for learning in spiking neural nets Julian Rossbroich Julia Gygax Based on balanced net theory by van Vreeswijk, Sompolinski, Brunel, ... Unpublished Spiking Conv Nets Firing rate homeostasis
  • 40. .ch www.zenkelab.org Conclusion and summary ● Surrogate gradients allow flexible end-to-end training of spiking neural networks ● Voltage-dependent plasticity adds self-calibration capabilities to analog neuromorphic substrates ● Yes, we can avoid back-propagation through time! Approximate RTRL→ three-factor learning rules
  • 41. .ch www.zenkelab.org Thanks! Manu ● Nikolaos Papanikolaou ● Yue Kris Wu ● Matthias Depoortere ● Tianlin Liu Benjamin Artwork: kyadava.net Julian Sebastian With Karlheinz Meier † & Johannes Schemmel Collaborators Johannes Schemmel Uni Heidelberg Julia Manvi Axel Alumni Hiring post-docs and PhD students! Peter Emre Neftci FZ Julich