SlideShare a Scribd company logo
1 of 19
CS295: Modern Systems:
Application Case Study
Neural Network Accelerator
Sang-Woo Jun
Spring 2019
Many slides adapted from
Hyoukjun Kwon‘s Gatech “Designing CNN Accelerators”
Usefulness of Deep Neural Networks
 No need to further emphasize the obvious
Convolutional Neural Network for
Image/Video Recognition
ImageNet Top-5 Classification Accuracy
Over the Years
image-net.org “ImageNet Large Scale Visual Recognition Challenge (ILSVRC) 2017,” 2017
AlexNet, The Beginning
15 million images 1000 classes in the ImageNet challenge
“The first* fast**
GPU-accelerated Deep Convolutional Neural Network
to win an image recognition contest
Convolutional Neural Networks Overview
Convolution
Layer
Fully
Connected
Layer
Convolution
Layer
Convolution
Layer
Fully
Connected
Layer
Fully
Connected
Layer
goldfish: 0.002%
shark: 0.08%
magpie: 0. 02%
Palace: 89%
…
…
Paper towel: 1.4%
Spatula: 0.001%
…
… …
“Convolution” “Neural Network”
…
Training vs. Inference
 Training: Tuning parameters using training data
o Backpropagation using stochastic gradient descent is the most popular algorithm
o Training in data centers and distributing trained data is a common model*
o Because training algorithm changes rapidly, GPU cluster is the most popular
hardware (Low demand for application-specific accelerators)
 Inference: Determining class of a new input data
o Using a trained model, determine class of a new input data
o Inference usually occurs close to clients
o Low-latency and power-efficiency is required
(High demand for application specific accelerators)
Deep Neural Networks (“Fully Connected”*)
Chris Edwards, “Deep Learning Hunts for Signals Among the Noise,” Communications of the ACM, June 2018
 Each layer may have a different number of neurons
goldfish: 0.002%
Palace: 89%
Paper towel: 1.4%
Spatula: 0.001%
An Artificial Neuron
 Effectively weight vector multiplied
by input vector to obtain a scalar
 May apply activation function to
output
o Adds non-linearity
Sigmoid Rectified Linear Unit
(ReLU)
Jed Fox, “Neural Networks 101,” 2017
Convolution Layer
31 7 44
65 35 40
46 29 32
33
46
30
24 49 8 64
65 46
46 64
Convolution layer Optional pooling layer
Convolution Example
1 2 3
-2 0 -1
5 -2 4
Channel partial sum[0][0] =
1 x 0 + 2 x 1 + 3 x 0
+ (-2) x 2 + 0 x 4 + (-1) x 3
+ 5 x 5 + (-2) x 2 + 4 x 7
= 44
44
0 1 0
2 4 3
5 2 7
1 0 1
1 0 0
2 1 5
4 1 8
5 0 1
0 0 0
4 2 8
5 8 3
5 2 6 Channel partial sum[0][0] =
1 x 0 + 2 x 1 + 3 x 0
+ (-2) x 2 + 0 x 4 + (-1) x 3
+ 5 x 5 + (-2) x 2 + 4 x 7
= 44
44 -1
Typically adds zero padding to source matrix
to maintain dimensions
Convolution
Filter
Input map Output map
× =
Multidimensional Convolution
 “Feature Map” usually has multiple layers
o An image has R, G, B layers, or “channels”
 One layer has many convolution filters, which create a multichannel
output map
1 2 3
-2 0 -1
5 -2 4
Input feature map 3x3x3 filter
×
Output feature map
=
Multiple Convolutions
Filter 0
Filter 1
Input feature map
Output feature map 0
Output feature map 1
Example Learned Convolution Filters
Alex Krizhevsky et al., “ImageNet Classification with Deep Convolutional Neural Networks,” NIPS, 2012
Multidimensional Convolution
Image found online. Original source unknown
Computation in the Convolution Layer
for(n=0; n<N; n++) { // Input feature maps (IFMaps)
for(m=0; m<M; m++) { // Weight Filters
for(c=0; c<C; c++) { // IFMap/Weight Channels
for(y=0; y<H; y++) { // Input feature map row
for(x=0; x<H; x++) { // Input feature map column
for(j=0; j<R; j++) { // Weight filter row
for(i=0; i<R; i++) { // Weight filter column
O[n][m][x][y] += W[m][c][i][j] * I[n][c][y+j][x+i]}}}}}}}
Pooling Layer
 Reduces size of the feature map
o Max pooling, Average pooling, …
31 7 44
65 35 40
46 29 32
33
46
30
24 49 8 64
65 46
46 64
Max pooling example
Real Convolutional Neural Network
-- AlexNet
Alex Krizhevsky et al., “ImageNet Classification with Deep Convolutional Neural Networks,” NIPS, 2012
96 11x11x3 kernels 256 5x5x48 384 3x3x128 …
Simplified intuition: Higher order information at later layer
Real Convolutional Neural Network
-- VGG 16
Heuritech blog (https://blog.heuritech.com/2016/02/29/a-brief-report-of-the-heuritech-deep-learning-meetup-5/)
Contains 138 million weights and
15.5G MACs to process one 224 × 224 input image
There are Many, Many Neural Networks
 GoogLeNet, ResNet, YOLO, …
o Share common building blocks, but look drastically different
GoogLeNet (ImageNet 2014 winner)
ResNet
(ImageNet 2015 winner)

More Related Content

Similar to Convolution Neural Network Lecture Slides

Deep learning for molecules, introduction to chainer chemistry
Deep learning for molecules, introduction to chainer chemistryDeep learning for molecules, introduction to chainer chemistry
Deep learning for molecules, introduction to chainer chemistryKenta Oono
 
Introduction to Neural Networks and Deep Learning
Introduction to Neural Networks and Deep LearningIntroduction to Neural Networks and Deep Learning
Introduction to Neural Networks and Deep LearningVahid Mirjalili
 
Find nuclei in images with U-net
Find nuclei in images with U-netFind nuclei in images with U-net
Find nuclei in images with U-netDing Li
 
Ieee 2016 nss mic poster N30-21
Ieee 2016 nss mic poster N30-21Ieee 2016 nss mic poster N30-21
Ieee 2016 nss mic poster N30-21Dae Woon Kim
 
Le Song, Assistant Professor, College of Computing, Georgia Institute of Tech...
Le Song, Assistant Professor, College of Computing, Georgia Institute of Tech...Le Song, Assistant Professor, College of Computing, Georgia Institute of Tech...
Le Song, Assistant Professor, College of Computing, Georgia Institute of Tech...MLconf
 
Diving into Deep Learning (Silicon Valley Code Camp 2017)
Diving into Deep Learning (Silicon Valley Code Camp 2017)Diving into Deep Learning (Silicon Valley Code Camp 2017)
Diving into Deep Learning (Silicon Valley Code Camp 2017)Oswald Campesato
 
Neural Networks: Principal Component Analysis (PCA)
Neural Networks: Principal Component Analysis (PCA)Neural Networks: Principal Component Analysis (PCA)
Neural Networks: Principal Component Analysis (PCA)Mostafa G. M. Mostafa
 
ieee nss mic 2016 poster N30-21
ieee nss mic 2016 poster N30-21ieee nss mic 2016 poster N30-21
ieee nss mic 2016 poster N30-21Dae Woon Kim
 
딥러닝 중급 - AlexNet과 VggNet (Basic of DCNN : AlexNet and VggNet)
딥러닝 중급 - AlexNet과 VggNet (Basic of DCNN : AlexNet and VggNet)딥러닝 중급 - AlexNet과 VggNet (Basic of DCNN : AlexNet and VggNet)
딥러닝 중급 - AlexNet과 VggNet (Basic of DCNN : AlexNet and VggNet)Hansol Kang
 
Tutorial on convolutional neural networks
Tutorial on convolutional neural networksTutorial on convolutional neural networks
Tutorial on convolutional neural networksHojin Yang
 
SimCLR: A Simple Framework for Contrastive Learning of Visual Representations
SimCLR: A Simple Framework for Contrastive Learning of Visual RepresentationsSimCLR: A Simple Framework for Contrastive Learning of Visual Representations
SimCLR: A Simple Framework for Contrastive Learning of Visual Representationsynxm25hpxp
 
ImageNet classification with deep convolutional neural networks(2012)
ImageNet classification with deep convolutional neural networks(2012)ImageNet classification with deep convolutional neural networks(2012)
ImageNet classification with deep convolutional neural networks(2012)WoochulShin10
 
Deep Learning, Keras, and TensorFlow
Deep Learning, Keras, and TensorFlowDeep Learning, Keras, and TensorFlow
Deep Learning, Keras, and TensorFlowOswald Campesato
 
Deep Neural Networks for Computer Vision
Deep Neural Networks for Computer VisionDeep Neural Networks for Computer Vision
Deep Neural Networks for Computer VisionAlex Conway
 
ECCV2010: feature learning for image classification, part 4
ECCV2010: feature learning for image classification, part 4ECCV2010: feature learning for image classification, part 4
ECCV2010: feature learning for image classification, part 4zukun
 

Similar to Convolution Neural Network Lecture Slides (20)

Deep learning for molecules, introduction to chainer chemistry
Deep learning for molecules, introduction to chainer chemistryDeep learning for molecules, introduction to chainer chemistry
Deep learning for molecules, introduction to chainer chemistry
 
Introduction to Neural Networks and Deep Learning
Introduction to Neural Networks and Deep LearningIntroduction to Neural Networks and Deep Learning
Introduction to Neural Networks and Deep Learning
 
Find nuclei in images with U-net
Find nuclei in images with U-netFind nuclei in images with U-net
Find nuclei in images with U-net
 
Ieee 2016 nss mic poster N30-21
Ieee 2016 nss mic poster N30-21Ieee 2016 nss mic poster N30-21
Ieee 2016 nss mic poster N30-21
 
B.tech_project_ppt.pptx
B.tech_project_ppt.pptxB.tech_project_ppt.pptx
B.tech_project_ppt.pptx
 
Le Song, Assistant Professor, College of Computing, Georgia Institute of Tech...
Le Song, Assistant Professor, College of Computing, Georgia Institute of Tech...Le Song, Assistant Professor, College of Computing, Georgia Institute of Tech...
Le Song, Assistant Professor, College of Computing, Georgia Institute of Tech...
 
Diving into Deep Learning (Silicon Valley Code Camp 2017)
Diving into Deep Learning (Silicon Valley Code Camp 2017)Diving into Deep Learning (Silicon Valley Code Camp 2017)
Diving into Deep Learning (Silicon Valley Code Camp 2017)
 
Neural Networks: Principal Component Analysis (PCA)
Neural Networks: Principal Component Analysis (PCA)Neural Networks: Principal Component Analysis (PCA)
Neural Networks: Principal Component Analysis (PCA)
 
ieee nss mic 2016 poster N30-21
ieee nss mic 2016 poster N30-21ieee nss mic 2016 poster N30-21
ieee nss mic 2016 poster N30-21
 
딥러닝 중급 - AlexNet과 VggNet (Basic of DCNN : AlexNet and VggNet)
딥러닝 중급 - AlexNet과 VggNet (Basic of DCNN : AlexNet and VggNet)딥러닝 중급 - AlexNet과 VggNet (Basic of DCNN : AlexNet and VggNet)
딥러닝 중급 - AlexNet과 VggNet (Basic of DCNN : AlexNet and VggNet)
 
ann-ics320Part4.ppt
ann-ics320Part4.pptann-ics320Part4.ppt
ann-ics320Part4.ppt
 
ann-ics320Part4.ppt
ann-ics320Part4.pptann-ics320Part4.ppt
ann-ics320Part4.ppt
 
Tutorial on convolutional neural networks
Tutorial on convolutional neural networksTutorial on convolutional neural networks
Tutorial on convolutional neural networks
 
SimCLR: A Simple Framework for Contrastive Learning of Visual Representations
SimCLR: A Simple Framework for Contrastive Learning of Visual RepresentationsSimCLR: A Simple Framework for Contrastive Learning of Visual Representations
SimCLR: A Simple Framework for Contrastive Learning of Visual Representations
 
ImageNet classification with deep convolutional neural networks(2012)
ImageNet classification with deep convolutional neural networks(2012)ImageNet classification with deep convolutional neural networks(2012)
ImageNet classification with deep convolutional neural networks(2012)
 
Deep Learning, Keras, and TensorFlow
Deep Learning, Keras, and TensorFlowDeep Learning, Keras, and TensorFlow
Deep Learning, Keras, and TensorFlow
 
Deep learning
Deep learningDeep learning
Deep learning
 
Deep Learning
Deep LearningDeep Learning
Deep Learning
 
Deep Neural Networks for Computer Vision
Deep Neural Networks for Computer VisionDeep Neural Networks for Computer Vision
Deep Neural Networks for Computer Vision
 
ECCV2010: feature learning for image classification, part 4
ECCV2010: feature learning for image classification, part 4ECCV2010: feature learning for image classification, part 4
ECCV2010: feature learning for image classification, part 4
 

Recently uploaded

原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一F La
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAbdelrhman abooda
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degreeyuu sss
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...limedy534
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 

Recently uploaded (20)

原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 

Convolution Neural Network Lecture Slides

  • 1. CS295: Modern Systems: Application Case Study Neural Network Accelerator Sang-Woo Jun Spring 2019 Many slides adapted from Hyoukjun Kwon‘s Gatech “Designing CNN Accelerators”
  • 2. Usefulness of Deep Neural Networks  No need to further emphasize the obvious
  • 3. Convolutional Neural Network for Image/Video Recognition
  • 4. ImageNet Top-5 Classification Accuracy Over the Years image-net.org “ImageNet Large Scale Visual Recognition Challenge (ILSVRC) 2017,” 2017 AlexNet, The Beginning 15 million images 1000 classes in the ImageNet challenge “The first* fast** GPU-accelerated Deep Convolutional Neural Network to win an image recognition contest
  • 5. Convolutional Neural Networks Overview Convolution Layer Fully Connected Layer Convolution Layer Convolution Layer Fully Connected Layer Fully Connected Layer goldfish: 0.002% shark: 0.08% magpie: 0. 02% Palace: 89% … … Paper towel: 1.4% Spatula: 0.001% … … … “Convolution” “Neural Network” …
  • 6. Training vs. Inference  Training: Tuning parameters using training data o Backpropagation using stochastic gradient descent is the most popular algorithm o Training in data centers and distributing trained data is a common model* o Because training algorithm changes rapidly, GPU cluster is the most popular hardware (Low demand for application-specific accelerators)  Inference: Determining class of a new input data o Using a trained model, determine class of a new input data o Inference usually occurs close to clients o Low-latency and power-efficiency is required (High demand for application specific accelerators)
  • 7. Deep Neural Networks (“Fully Connected”*) Chris Edwards, “Deep Learning Hunts for Signals Among the Noise,” Communications of the ACM, June 2018  Each layer may have a different number of neurons goldfish: 0.002% Palace: 89% Paper towel: 1.4% Spatula: 0.001%
  • 8. An Artificial Neuron  Effectively weight vector multiplied by input vector to obtain a scalar  May apply activation function to output o Adds non-linearity Sigmoid Rectified Linear Unit (ReLU) Jed Fox, “Neural Networks 101,” 2017
  • 9. Convolution Layer 31 7 44 65 35 40 46 29 32 33 46 30 24 49 8 64 65 46 46 64 Convolution layer Optional pooling layer
  • 10. Convolution Example 1 2 3 -2 0 -1 5 -2 4 Channel partial sum[0][0] = 1 x 0 + 2 x 1 + 3 x 0 + (-2) x 2 + 0 x 4 + (-1) x 3 + 5 x 5 + (-2) x 2 + 4 x 7 = 44 44 0 1 0 2 4 3 5 2 7 1 0 1 1 0 0 2 1 5 4 1 8 5 0 1 0 0 0 4 2 8 5 8 3 5 2 6 Channel partial sum[0][0] = 1 x 0 + 2 x 1 + 3 x 0 + (-2) x 2 + 0 x 4 + (-1) x 3 + 5 x 5 + (-2) x 2 + 4 x 7 = 44 44 -1 Typically adds zero padding to source matrix to maintain dimensions Convolution Filter Input map Output map × =
  • 11. Multidimensional Convolution  “Feature Map” usually has multiple layers o An image has R, G, B layers, or “channels”  One layer has many convolution filters, which create a multichannel output map 1 2 3 -2 0 -1 5 -2 4 Input feature map 3x3x3 filter × Output feature map =
  • 12. Multiple Convolutions Filter 0 Filter 1 Input feature map Output feature map 0 Output feature map 1
  • 13. Example Learned Convolution Filters Alex Krizhevsky et al., “ImageNet Classification with Deep Convolutional Neural Networks,” NIPS, 2012
  • 14. Multidimensional Convolution Image found online. Original source unknown
  • 15. Computation in the Convolution Layer for(n=0; n<N; n++) { // Input feature maps (IFMaps) for(m=0; m<M; m++) { // Weight Filters for(c=0; c<C; c++) { // IFMap/Weight Channels for(y=0; y<H; y++) { // Input feature map row for(x=0; x<H; x++) { // Input feature map column for(j=0; j<R; j++) { // Weight filter row for(i=0; i<R; i++) { // Weight filter column O[n][m][x][y] += W[m][c][i][j] * I[n][c][y+j][x+i]}}}}}}}
  • 16. Pooling Layer  Reduces size of the feature map o Max pooling, Average pooling, … 31 7 44 65 35 40 46 29 32 33 46 30 24 49 8 64 65 46 46 64 Max pooling example
  • 17. Real Convolutional Neural Network -- AlexNet Alex Krizhevsky et al., “ImageNet Classification with Deep Convolutional Neural Networks,” NIPS, 2012 96 11x11x3 kernels 256 5x5x48 384 3x3x128 … Simplified intuition: Higher order information at later layer
  • 18. Real Convolutional Neural Network -- VGG 16 Heuritech blog (https://blog.heuritech.com/2016/02/29/a-brief-report-of-the-heuritech-deep-learning-meetup-5/) Contains 138 million weights and 15.5G MACs to process one 224 × 224 input image
  • 19. There are Many, Many Neural Networks  GoogLeNet, ResNet, YOLO, … o Share common building blocks, but look drastically different GoogLeNet (ImageNet 2014 winner) ResNet (ImageNet 2015 winner)