Neural Network Architectures Explained: Feed-Forward, Convolutional, Recurrent

•

0 likes•357 views

Many state of the art machine learning applications today are based on artifical neural networks. In this talk we explore several commonly used neural network architectures. We identify the ideas behind their design, describe their topologies, outline their properties and discuss their use. You might be enjoy this talk if you are interested in: * Discovering some of the popular neural network types * Learning about their design and how they work * Understanding what are they are good for

Software

Outline
●
Introduction
●
Feed-forward networks
●
Convolutional networks
●
Recurrent networks
●
Learning more

Overview
●
Class of machine learning models
●
Inspired by brain biology
●
Connectionist AI approach
●
Highly parallel computation
●
Various learning types
●
Supervised
●
Reinforcement
●
Unsupervised

Applications
●
Character recognition
●
Medical diagnostics
●
Speech recognition
●
Machine translation
●
Text generation
●
Stock price prediction
●
Optimization problems

Advantages
●
Prediction accuracy
●
Complex non-linear relationships
●
Non-constantly variable data – heteroskedasticity
●
Hard to understand problems
●
Many possible architectures

Disadvantages
●
Large amount of training data
●
Long time to train
●
Computationally expensive
●
Hard to interpret - black box
●
Many possible architectures

Perceptron
●
Simplified model of a neuron (1957)
●
Linear binary classifier
●
Multiple numeric inputs
●
One boolean output
●
Linearly separable classes only

Perceptron
0.5 1 1.5 2
-0.5
-1
-1.5
-2
0.5
1
t
f(t)

Perceptron
●
Inputs
●
Weights
●
Bias
●
w0
●
Sum
●
Activation function
●
Unit step

Multi-layer perceptron
●
Nonlinear classification or regression
●
Inputs
●
Features
●
Hidden layers
●
Parallel neurons feeding the next layer
●
Dot product
●
Sigmoid activation function
●
Output layer
●
Arbitrary activation function

Training
●
Calculate the output
●
Apply differentiable loss function
●
Must be differentiable
●
Should be minimized – optimization problem
●
Gradient descent to update the weights
●
Proportional to the learning rate
●
Stochastic approximations

Training
●
Backpropagation (1974)
●
Derivative of the loss with regard to the weights
●
Apply to previous layers by using the chain rule
●
Regularization
●
Reduce overfitting
●
L1 or L2 norm
●
Dropout – ignore random neurons during training

Convolutional networks
●
Image classification (1998)
●
Image analysis
●
Object detection
●
Recommender systems
●
Text classification
●
Spatial patterns

Convolutional networks
●
Convolutional layer
●
Filter that scans the image – convolution matrix
●
Receptive field – filter size
●
Depth – number of filters
●
Space invariant
●
Pooling layer
●
Combine cluster of neurons into one
●
Non-linear down-sampling

Convolutional networks
●
Fully connected layer
●
Dense
●
Just like in multi-layer perceptron
●
Activation function
●
Rectifier – linear but remove negative values
●
Trains faster and reduces the vanishing gradient problem
●
Output activation function
●
Softmax - single-class
●
Sigmoid - multi-class

Recurrent networks
●
Sequence prediction (1986)
●
Natural language processing
●
Speech recognition
●
Machine translation
●
Generative models
●
Temporal patterns

Recurrent networks
●
Multi-layer perceptron with back-connections
●
Topology is a directed graph
●
Internal state – memory
●
Variable length sequence with dependencies within
●
Training
●
Backpropagation through time
●
Vanishing gradient problem reduction via gated state
●
Long short-term memory (1997)
●
Gated recurrent unit (2014)

Materials
●
Deep Learning @ MIT Press
●
Neural Networks and Deep Learning @ Michael Nielsen
●
Practical Deep Learning @ Coursera
●
Deep Learning Specialization @ Coursera
●
Deep Learning Courses @ edX

Libraries
●
Keras
●
Tensorflow
●
MXNet
●
Theano
●
CNTK
●
PyTorch
●
Deeplearning4j

What's hot

Decision trees in Machine Learning Mohammad Junaid Khan

Randomized AlgorithmKanishka Khandelwal

KmeansNikita Goyal

Neural networkSilicon

Clustering in Data MiningArchana Swaminathan

Linear algebra for deep learningSwayam Mittal

Ensemble learningHaris Jamil

Naive BayesCloudxLab

Presentation on supervised learningTonmoy Bhagawati

Learning from imbalanced data Aboul Ella Hassanien

MACHINE LEARNING - GENETIC ALGORITHMPuneet Kulyana

Advanced data structures & algorithms important questionsselvaraniArunkumar

Randomized algorithms ver 1.0Dr. C.V. Suresh Babu

Clustering, k-means clusteringMegha Sharma

K mean-clustering algorithmparry prabhu

K mean-clusteringAfzaal Subhani

K means clusteringKuppusamy P

Decision treeAmi_Surati

Bagging.pptxComsatsSahiwal1

Crisp setDeepikaT13

What's hot (20)

Decision trees in Machine Learning

Randomized Algorithm

Kmeans

Neural network

Clustering in Data Mining

Linear algebra for deep learning

Ensemble learning

Naive Bayes

Presentation on supervised learning

Learning from imbalanced data

MACHINE LEARNING - GENETIC ALGORITHM

Advanced data structures & algorithms important questions

Randomized algorithms ver 1.0

Clustering, k-means clustering

K mean-clustering algorithm

K mean-clustering

K means clustering

Decision tree

Bagging.pptx

Crisp set

Similar to Neural Network Architectures Explained: Feed-Forward, Convolutional, Recurrent

From neural networks to deep learningViet-Trung TRAN

Recurrent Neural Networks, LSTM and GRUananth

Productionizing dl from the ground upAdam Gibson

Taskerman - a distributed cluster task managerRaghavendra Prabhu

Deep Learning Tutorial Ligeng Zhu

Intro to TensorFlow and PyTorch Workshop at Tubular LabsKendall

Deep learning for smart manufacturingSunil Kumar Pradhan

Introduction to deep learningAbhishek Bhandwaldar

Deep Neural Networks (D1L2 Insight@DCU Machine Learning Workshop 2017)Universitat Politècnica de Catalunya

Automatic Machine Learning, AutoMLHimadri Mishra

An End to OrderRobert Burrell Donkin

Netflix machine learningAmer Ather

Cassandra - A Decentralized Structured Storage SystemVarad Meru

State of the art time-series analysis with deep learning by Javier Ordóñez at...Big Data Spain

An End to Order (many cores with java, session two)Robert Burrell Donkin

Deep learning from a novice perspectiveAnirban Santara

Deep Learning and Automatic Differentiation from Theano to PyTorchinside-BigData.com

Deep learning internalsAnand Narayanan

Intro to Deep LearningKushal Arora

CQRS: Theory Topu Newaj

Similar to Neural Network Architectures Explained: Feed-Forward, Convolutional, Recurrent (20)

From neural networks to deep learning

Recurrent Neural Networks, LSTM and GRU

Productionizing dl from the ground up

Taskerman - a distributed cluster task manager

Deep Learning Tutorial

Intro to TensorFlow and PyTorch Workshop at Tubular Labs

Deep learning for smart manufacturing

Introduction to deep learning

Deep Neural Networks (D1L2 Insight@DCU Machine Learning Workshop 2017)

Automatic Machine Learning, AutoML

An End to Order

Netflix machine learning

Cassandra - A Decentralized Structured Storage System

State of the art time-series analysis with deep learning by Javier Ordóñez at...

An End to Order (many cores with java, session two)

Deep learning from a novice perspective

Deep Learning and Automatic Differentiation from Theano to PyTorch

Deep learning internals

Intro to Deep Learning

CQRS: Theory

Recently uploaded

Project Based Learning (A.I).pptx detail explanationkaushalgiri8080

What is Binary Language? Computer Number SystemsJheuzeDellosa

Engage Usergroup 2024 - The Good The Bad_The UglyFrank van der Linden

Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

cybersecurity notes for mca students for learningVitsRangannavar

The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfkalichargn70th171

(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...gurkirankumar98700

Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideChristina Lin

Automate your Kamailio Test Calls - Kamailio World 2024Andreas Granig

why an Opensea Clone Script might be your perfect match.pdfjoe51371421

Der Spagat zwischen BIAS und FAIRNESS (2024)OPEN KNOWLEDGE GmbH

Intelligent Home Wi-Fi Solutions | ThinkPalmSujith Sukumaran

Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝soniya singh

Asset Management Software - InfographicHr365.us smith

Implementing Zero Trust strategy with AzureDinusha Kumarasiri

Cloud Management Software Platforms: OpenStackVICTOR MAESTRE RAMIREZ

ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...Christina Lin

Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...soniya singh

Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq

Professional Resume Template for Software DevelopersVinodh Ram

Recently uploaded (20)

Project Based Learning (A.I).pptx detail explanation

What is Binary Language? Computer Number Systems

Engage Usergroup 2024 - The Good The Bad_The Ugly

Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...

cybersecurity notes for mca students for learning

The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf

(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...

Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide

Automate your Kamailio Test Calls - Kamailio World 2024

why an Opensea Clone Script might be your perfect match.pdf

Der Spagat zwischen BIAS und FAIRNESS (2024)

Intelligent Home Wi-Fi Solutions | ThinkPalm

Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝

Asset Management Software - Infographic

Implementing Zero Trust strategy with Azure

Cloud Management Software Platforms: OpenStack

ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...

Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...

Salesforce Certified Field Service Consultant

Professional Resume Template for Software Developers

Neural Network Architectures Explained: Feed-Forward, Convolutional, Recurrent

1. Neural Network Architectures

2. Martin Ockajak from Zürich Software Engineer

3. Outline ● Introduction ● Feed-forward networks ● Convolutional networks ● Recurrent networks ● Learning more

4. Introduction

5. Overview ● Class of machine learning models ● Inspired by brain biology ● Connectionist AI approach ● Highly parallel computation ● Various learning types ● Supervised ● Reinforcement ● Unsupervised

6. Applications ● Character recognition ● Medical diagnostics ● Speech recognition ● Machine translation ● Text generation ● Stock price prediction ● Optimization problems

7. Advantages ● Prediction accuracy ● Complex non-linear relationships ● Non-constantly variable data – heteroskedasticity ● Hard to understand problems ● Many possible architectures

8. Disadvantages ● Large amount of training data ● Long time to train ● Computationally expensive ● Hard to interpret - black box ● Many possible architectures

9. Feed-forward networks

10. Perceptron ● Simplified model of a neuron (1957) ● Linear binary classifier ● Multiple numeric inputs ● One boolean output ● Linearly separable classes only

11. Perceptron 0.5 1 1.5 2 -0.5 -1 -1.5 -2 0.5 1 t f(t)

12. Perceptron ● Inputs ● Weights ● Bias ● w0 ● Sum ● Activation function ● Unit step

13. Multi-layer perceptron

14. Multi-layer perceptron ● Nonlinear classification or regression ● Inputs ● Features ● Hidden layers ● Parallel neurons feeding the next layer ● Dot product ● Sigmoid activation function ● Output layer ● Arbitrary activation function

15. Training ● Calculate the output ● Apply differentiable loss function ● Must be differentiable ● Should be minimized – optimization problem ● Gradient descent to update the weights ● Proportional to the learning rate ● Stochastic approximations

16. Training ● Backpropagation (1974) ● Derivative of the loss with regard to the weights ● Apply to previous layers by using the chain rule ● Regularization ● Reduce overfitting ● L1 or L2 norm ● Dropout – ignore random neurons during training

17. Convolutional networks

18. Convolutional networks ● Image classification (1998) ● Image analysis ● Object detection ● Recommender systems ● Text classification ● Spatial patterns

19. Convolutional networks

20. Convolutional networks ● Convolutional layer ● Filter that scans the image – convolution matrix ● Receptive field – filter size ● Depth – number of filters ● Space invariant ● Pooling layer ● Combine cluster of neurons into one ● Non-linear down-sampling

21. Convolutional networks ● Fully connected layer ● Dense ● Just like in multi-layer perceptron ● Activation function ● Rectifier – linear but remove negative values ● Trains faster and reduces the vanishing gradient problem ● Output activation function ● Softmax - single-class ● Sigmoid - multi-class

22. Convolutional networks

23. Recurrent networks

24. Recurrent networks ● Sequence prediction (1986) ● Natural language processing ● Speech recognition ● Machine translation ● Generative models ● Temporal patterns

25. Recurrent networks

26. Recurrent networks ● Multi-layer perceptron with back-connections ● Topology is a directed graph ● Internal state – memory ● Variable length sequence with dependencies within ● Training ● Backpropagation through time ● Vanishing gradient problem reduction via gated state ● Long short-term memory (1997) ● Gated recurrent unit (2014)

27. Long short-term memory

28. Gated recurrent unit

29. Learning more

30. Materials ● Deep Learning @ MIT Press ● Neural Networks and Deep Learning @ Michael Nielsen ● Practical Deep Learning @ Coursera ● Deep Learning Specialization @ Coursera ● Deep Learning Courses @ edX

31. Libraries ● Keras ● Tensorflow ● MXNet ● Theano ● CNTK ● PyTorch ● Deeplearning4j

32. Thank you :-)

Neural Network Architectures Explained: Feed-Forward, Convolutional, Recurrent

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Neural Network Architectures Explained: Feed-Forward, Convolutional, Recurrent

Similar to Neural Network Architectures Explained: Feed-Forward, Convolutional, Recurrent (20)

Recently uploaded

Recently uploaded (20)

Neural Network Architectures Explained: Feed-Forward, Convolutional, Recurrent