Deep learning

•Download as PPTX, PDF•

0 likes•114 views

Jin Sakuma

Simple explanation of what is deep learning

Technology

Neural Network
● Neural network is what in the deep learning
● Designed to simulate human brain
● Consist of several layers and data will be passed
one by one
Input Layer
Hidden Layer
Hidden Layer
Output Layer
Data Input
Data Output

Perceptron
● In one perceptron, output value is computed by
● Where xi is the input, wi is weight and f is activation function
y
x1
x2
x3
f z

One Layer
● For input vector x, output of one layer is computed by
y = Wx+b
z = f(y)
Where W is weight of the layer, b is bias, f is activation function.
● W and b are parameters because we will modify
them to optimize the model

Activation Function
● Without activation function, multiple layers would be meaningless
● Good activation function is non-linear, differentiable, monotonically increasing.
● Logistic function
● Hyperbolic Tangent
● Retified linear function (ReLU)

For Various Problems
● The activation function of the output layer depends on the type of problem
● For regression:
○ Activation function: Identity function
○ Length of output vector: Any
● For binary classification:
○ Activation function: Logistic function
○ Length of output vector: 1
● For multi-class classification:

Example Task
● Task setting:
○ Given picture of hand-written numbers 0-9, we want to tell which number
it is
○ Training dataset consist of a lot of pictures and all picture is labeled with
correct answer.
● Analyze task:
○ Problem type: Multi-class classification
○ Activation function of output layer: Softmax function

Learn: Minimize Error
● Consider error function E(xi, di; W, b) which represents how off the model is
from the true value for the ith picture. Here xi is the vector representation of
picture.
● In our example we use
Here, dik is 1 only if xi is actually picture of number k and otherwise 0.
● We want to modify W and b to minimize error function.

Learn: Gradient Descent
We use Gradient Descent to modify parameters W and b.
● Consider yourself in mountain and willing to reach the top of mountain, but you
lost map and you can’t look distance because of smog. How do you reach the
top?
➔Move to direction that bring you to the highest.
● The vector that indicate the direction to move is called gradient
● Since we want to minimize (instead of maximize), we update parameters by
subtracting gradient.

Learn: Backpropagation
● Neural network with multiple layers are too complex to
simply compute gradients.
● This problem was one bottleneck in early stage of
development of deep learning.
● Backpropagation compute gradient from output layer
to input layer (A lot of chain rules).
Input Layer
Hidden Layer
Hidden Layer
Output Layer
Data Input
Data Output

Why Deep Learning?
● Deep learning has tons of parameters (things that we can change to optimize
the model)
➔Better accuracy
➔Hard to optimize
➔Need a lot of data
● Flexible model
➔Can be used to different types of problems
➔Easy to modify models for various situations

Experiment
● Run python script for the example task
● Input vector is given as vector of length 784
● Configurations are
○ 2 hidden layers
○ 1000 perceptrons for each layers
○ ReLU function for activation function
○ Softmax function for activation function for the output layer
○ Batch size: 100

Result
● For this experiment, I used example code of chainer
● Execution time: about 45min
● Final Validation Loss: 0.107
● Final Validation Accuracy: 0.98

Use case of Deep Learning
● Convolutional Neural Network: Deep learning specific to picture data
○ Object identification
○ Face recognition
● Recurrent Neural Network: Deep learning for sequential data
○ Speech recognition
○ For text
● DQN: Combination of deep learning and Q learning
○ Alpha Go uses DQN and won top level Go player

What's hot

Accelerated Logistic Regression on GPU(s)RAHUL BHOJWANI

Multilayer Perceptron (DLAI D1L2 2017 UPC Deep Learning for Artificial Intell...Universitat Politècnica de Catalunya

Transfer Learning and Domain Adaptation (DLAI D5L2 2017 UPC Deep Learning for...Universitat Politècnica de Catalunya

Optimization for Deep Networks (D2L1 2017 UPC Deep Learning for Computer Vision)Universitat Politècnica de Catalunya

Set Transfomer: A Framework for Attention-based Permutaion-Invariant Neural N...Thien Q. Tran

Deep Learning for Computer Vision: Deep Networks (UPC 2016)Universitat Politècnica de Catalunya

Optimizing Deep Networks (D1L6 Insight@DCU Machine Learning Workshop 2017)Universitat Politècnica de Catalunya

Semantic Segmentation on Satellite ImageryRAHUL BHOJWANI

Multilayer Perceptron - Elisa Sayrol - UPC Barcelona 2018Universitat Politècnica de Catalunya

Quantization and Training of Neural Networks for Efficient Integer-Arithmetic...Ryo Takahashi

Transfer Learning and Domain Adaptation - Ramon Morros - UPC Barcelona 2018Universitat Politècnica de Catalunya

Convolutional Neural Networks (D1L3 2017 UPC Deep Learning for Computer Vision)Universitat Politècnica de Catalunya

CenternetArithmer Inc.

Deep Learning for Computer Vision: Attention Models (UPC 2016)Universitat Politècnica de Catalunya

Training Deep Networks with Backprop (D1L4 Insight@DCU Machine Learning Works...Universitat Politècnica de Catalunya

Deep Neural Networks (D1L2 Insight@DCU Machine Learning Workshop 2017)Universitat Politècnica de Catalunya

Backpropagation - Elisa Sayrol - UPC Barcelona 2018Universitat Politècnica de Catalunya

Skip RNN: Learning to Skip State Updates in Recurrent Neural NetworksUniversitat Politècnica de Catalunya

Deep Learning for Computer Vision: Backward Propagation (UPC 2016)Universitat Politècnica de Catalunya

Generative Models and Adversarial Training (D2L3 Insight@DCU Machine Learning...Universitat Politècnica de Catalunya

What's hot (20)

Accelerated Logistic Regression on GPU(s)

Multilayer Perceptron (DLAI D1L2 2017 UPC Deep Learning for Artificial Intell...

Transfer Learning and Domain Adaptation (DLAI D5L2 2017 UPC Deep Learning for...

Optimization for Deep Networks (D2L1 2017 UPC Deep Learning for Computer Vision)

Set Transfomer: A Framework for Attention-based Permutaion-Invariant Neural N...

Deep Learning for Computer Vision: Deep Networks (UPC 2016)

Optimizing Deep Networks (D1L6 Insight@DCU Machine Learning Workshop 2017)

Semantic Segmentation on Satellite Imagery

Multilayer Perceptron - Elisa Sayrol - UPC Barcelona 2018

Quantization and Training of Neural Networks for Efficient Integer-Arithmetic...

Transfer Learning and Domain Adaptation - Ramon Morros - UPC Barcelona 2018

Convolutional Neural Networks (D1L3 2017 UPC Deep Learning for Computer Vision)

Centernet

Deep Learning for Computer Vision: Attention Models (UPC 2016)

Training Deep Networks with Backprop (D1L4 Insight@DCU Machine Learning Works...

Deep Neural Networks (D1L2 Insight@DCU Machine Learning Workshop 2017)

Backpropagation - Elisa Sayrol - UPC Barcelona 2018

Skip RNN: Learning to Skip State Updates in Recurrent Neural Networks

Deep Learning for Computer Vision: Backward Propagation (UPC 2016)

Generative Models and Adversarial Training (D2L3 Insight@DCU Machine Learning...

Viewers also liked

No 2º Período foram desenvolvidos trabalhos subordinados aos temas "Moda" e "...guest486a60

Portafolio fotocopias yennycopiasyenny

WMDGoshi Barraza

South korea tablet listeningMeagan Kaiser

TruchaWilliam Cornejo Nole

Prwr625 bizplan menukeythjones

8 aula 6 república velhaprofdu

Mono lds2012wilderysebas

GOVT2305SyllabusHCCClinton Brown

шинагулова даметай+одежда для дома+создать магазин одежды для домаДаметай Шинагулова

20151107 - The Influences on Motivation in Online Educational EnvironmentsWilliam Harding

Europa e Europeus - CeltasCarlos Ribeiro Medeiros

Stock and Share Market Technical AnalysisDhanashri Academy

The back of the napkin PresentationVM87

Viewers also liked (14)

No 2º Período foram desenvolvidos trabalhos subordinados aos temas "Moda" e "...

Portafolio fotocopias yenny

WMD

South korea tablet listening

Trucha

Prwr625 bizplan menu

8 aula 6 república velha

Mono lds2012

GOVT2305SyllabusHCC

шинагулова даметай+одежда для дома+создать магазин одежды для дома

20151107 - The Influences on Motivation in Online Educational Environments

Europa e Europeus - Celtas

Stock and Share Market Technical Analysis

The back of the napkin Presentation

Similar to Deep learning

Deep Learning Module 2A Training MLP.pptxvipul6601

Deep Learning Tutorial Ligeng Zhu

Machine Learning With Neural NetworksKnoldus Inc.

Neural networksPrakhar Mishra

Fully Homomorphic Encryption (1).pptxssuser1716c81

Introduction to Machine Learning with Sparkdatamantra

Practical MLAntonio Pitasi

物件偵測與辨識技術CHENHuiMei

Neural network basic and introduction of Deep learningTapas Majumdar

Intro to TensorFlow and PyTorch Workshop at Tubular LabsKendall

Introduction to Applied Machine LearningSheilaJimenezMorejon

Fundamental of deep learningStanley Wang

Computer Vision.pptxGDSCIIITDHARWAD

Introduction to Neural Netwoks Abdallah Bashir

Eye deepsveitser

V2.0 open power ai virtual university deep learning and ai introductionGanesan Narayanasamy

Netflix machine learningAmer Ather

19 - Neural Networks I.pptxEmanAl15

Unit 1Vinod Srinivasan

Convolutional Neural Networks (CNN)Gaurav Mittal

Similar to Deep learning (20)

Deep Learning Module 2A Training MLP.pptx

Deep Learning Tutorial

Machine Learning With Neural Networks

Neural networks

Fully Homomorphic Encryption (1).pptx

Introduction to Machine Learning with Spark

Practical ML

物件偵測與辨識技術

Neural network basic and introduction of Deep learning

Intro to TensorFlow and PyTorch Workshop at Tubular Labs

Introduction to Applied Machine Learning

Fundamental of deep learning

Computer Vision.pptx

Introduction to Neural Netwoks

Eye deep

V2.0 open power ai virtual university deep learning and ai introduction

Netflix machine learning

19 - Neural Networks I.pptx

Unit 1

Convolutional Neural Networks (CNN)

Recently uploaded

Manulife - Insurer Transformation Award 2024The Digital Insurer

FWD Group - Insurer Innovation Award 2024The Digital Insurer

Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays

2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93

Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services

Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2

A Year of the Servo Reboot: Where Are We Now?Igalia

TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@

Apidays New York 2024 - The value of a flexible API Management solution for O...apidays

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays

AXA XL - Insurer Innovation Award Americas 2024The Digital Insurer

Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez

Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh

Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra

Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz

A Beginners Guide to Building a RAG App Using Open Source MilvusZilliz

AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin

Recently uploaded (20)

Manulife - Insurer Transformation Award 2024

FWD Group - Insurer Innovation Award 2024

Artificial Intelligence Chap.5 : Uncertainty

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe

2024: Domino Containers - The Next Step. News from the Domino Container commu...

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff

Strategies for Landing an Oracle DBA Job as a Fresher

Exploring the Future Potential of AI-Enabled Smartphone Processors

A Year of the Servo Reboot: Where Are We Now?

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...

Apidays New York 2024 - The value of a flexible API Management solution for O...

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

AXA XL - Insurer Innovation Award Americas 2024

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood

Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model

Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving

Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...

A Beginners Guide to Building a RAG App Using Open Source Milvus

AWS Community Day CPH - Three problems of Terraform

Deep learning

1. Deep Learning Jin Sakuma

2. Neural Network ● Neural network is what in the deep learning ● Designed to simulate human brain ● Consist of several layers and data will be passed one by one Input Layer Hidden Layer Hidden Layer Output Layer Data Input Data Output

3. Perceptron ● In one perceptron, output value is computed by ● Where xi is the input, wi is weight and f is activation function y x1 x2 x3 f z

4. One Layer ● For input vector x, output of one layer is computed by y = Wx+b z = f(y) Where W is weight of the layer, b is bias, f is activation function. ● W and b are parameters because we will modify them to optimize the model

5. Activation Function ● Without activation function, multiple layers would be meaningless ● Good activation function is non-linear, differentiable, monotonically increasing. ● Logistic function ● Hyperbolic Tangent ● Retified linear function (ReLU)

6. For Various Problems ● The activation function of the output layer depends on the type of problem ● For regression: ○ Activation function: Identity function ○ Length of output vector: Any ● For binary classification: ○ Activation function: Logistic function ○ Length of output vector: 1 ● For multi-class classification:

7. Example Task ● Task setting: ○ Given picture of hand-written numbers 0-9, we want to tell which number it is ○ Training dataset consist of a lot of pictures and all picture is labeled with correct answer. ● Analyze task: ○ Problem type: Multi-class classification ○ Activation function of output layer: Softmax function

8. Learn: Minimize Error ● Consider error function E(xi, di; W, b) which represents how off the model is from the true value for the ith picture. Here xi is the vector representation of picture. ● In our example we use Here, dik is 1 only if xi is actually picture of number k and otherwise 0. ● We want to modify W and b to minimize error function.

9. Learn: Gradient Descent We use Gradient Descent to modify parameters W and b. ● Consider yourself in mountain and willing to reach the top of mountain, but you lost map and you can’t look distance because of smog. How do you reach the top? ➔Move to direction that bring you to the highest. ● The vector that indicate the direction to move is called gradient ● Since we want to minimize (instead of maximize), we update parameters by subtracting gradient.

10. Learn: Backpropagation ● Neural network with multiple layers are too complex to simply compute gradients. ● This problem was one bottleneck in early stage of development of deep learning. ● Backpropagation compute gradient from output layer to input layer (A lot of chain rules). Input Layer Hidden Layer Hidden Layer Output Layer Data Input Data Output

11. Why Deep Learning? ● Deep learning has tons of parameters (things that we can change to optimize the model) ➔Better accuracy ➔Hard to optimize ➔Need a lot of data ● Flexible model ➔Can be used to different types of problems ➔Easy to modify models for various situations

12. Experiment ● Run python script for the example task ● Input vector is given as vector of length 784 ● Configurations are ○ 2 hidden layers ○ 1000 perceptrons for each layers ○ ReLU function for activation function ○ Softmax function for activation function for the output layer ○ Batch size: 100

13. Result ● For this experiment, I used example code of chainer ● Execution time: about 45min ● Final Validation Loss: 0.107 ● Final Validation Accuracy: 0.98

14. Use case of Deep Learning ● Convolutional Neural Network: Deep learning specific to picture data ○ Object identification ○ Face recognition ● Recurrent Neural Network: Deep learning for sequential data ○ Speech recognition ○ For text ● DQN: Combination of deep learning and Q learning ○ Alpha Go uses DQN and won top level Go player

Editor's Notes

Deep learningの全体的な話をする
目の例を使って、人間の脳をもとに考えられたことを説明一つ一つの層が何をしているかをざっくりと説明して次のスライドにつなげる
一つの神経細胞内で何が起こるかの説明。activation functionに関しては、次に説明するからってことにする
一つの層の線形性について
Activation Functionの条件についてちゃんと話す
ここまででdeep learningの全体像については一通り終わり。次に行く前にまとめをする

Deep learning

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (14)

Similar to Deep learning

Similar to Deep learning (20)

Recently uploaded

Recently uploaded (20)

Deep learning

Editor's Notes