SlideShare a Scribd company logo
1 of 33
Download to read offline
FPGA
Chen
Zhang,et al
Motivation
Background
Method
Exploration
Implementation
Experiment
and Result
Conclusion
Optimizing FPGA-Based Accelerator Design
for Deep Convolutional Neural Network
Chen Zhang, Peng Li, Guangyu Sun, et al
Presented by Zhuangwei Zhuang
South China University of Technology
August 14, 2017
1 / 33
FPGA
Chen
Zhang,et al
Motivation
Background
Method
Exploration
Implementation
Experiment
and Result
Conclusion
Content
1 Motivation
2 Background
3 Method
4 Exploration
5 Implementation
6 Experiment and Result
7 Conclusion
2 / 33
FPGA
Chen
Zhang,et al
Motivation
Background
Method
Exploration
Implementation
Experiment
and Result
Conclusion
Motivation
Acceleration Platforms
Convolutional neural network (CNN) has been widely employed
for image recognition for its ability to achieve high accuracy.
Recently, various FPGA-based accelerators for deep CNN have
been proposed.
Platforms to Accelerate CNN
FPGA - Field Programmable Gate Array
GPU - Graphics Processing Unit
ASIC - Application Specific Integrated Circuit
3 / 33
FPGA
Chen
Zhang,et al
Motivation
Background
Method
Exploration
Implementation
Experiment
and Result
Conclusion
Motivation
Compare FPGA with GPU
Table: Projected ImageNet 1K Performance and Power1
Platform
ImageNet 1K
(Image/s)
Max Device
Power (W)
Efficiency
(Image/Sec/W)
Catapult Server + Stratix V D5 134 25 5.36
Catapult Server + Arria 10 GX1150 233 25 9.32
Caffe + cuDNN on Tesla K20 376 235 1.6
Caffe + cuDNN on Tesla K40 500∼824 235 2.13∼3.51
Table: Projected AlexNet Performance and Power2
CNN Classification Platform Performance(Image/s) Power(W)
Efficiency
(Image/Sec/W)
E52699 Dual Xeon CPU (18
core per Xeon)
1320 321 4.11
PCIe w/Dual Arria 10 1150 1200 130 9.27
1
Kalin et al.Accelerating Deep Convolutional Neural Networks Using Specialized Hardware
2
Xilinxr
.Efficient Implementation of Neural Network Systems Built on FPGAs,Programmed with OpenCL
4 / 33
FPGA
Chen
Zhang,et al
Motivation
Background
Method
Exploration
Implementation
Experiment
and Result
Conclusion
Motivation
FPGA Structure
Figure: FPGA Structure
5 / 33
FPGA
Chen
Zhang,et al
Motivation
Background
Method
Exploration
Implementation
Experiment
and Result
Conclusion
Motivation
FPGA-based Accelerator for Deep CNN
Advantages
High performance
High energy efficiency
Capability of Reconfiguration
Fast development round
Disadvantages
Limited computation resource
Limited memory bandwidth
6 / 33
FPGA
Chen
Zhang,et al
Motivation
Background
Method
Exploration
Implementation
Experiment
and Result
Conclusion
Motivation
Problem
There would be as much as 90%
performance difference between two
different solutions with the same logic
resource utilization.
7 / 33
FPGA
Chen
Zhang,et al
Motivation
Background
Method
Exploration
Implementation
Experiment
and Result
Conclusion
Background
FPGA Design in Roofline Model
FPGA-based Accelerator Structure
8 / 33
FPGA
Chen
Zhang,et al
Motivation
Background
Method
Exploration
Implementation
Experiment
and Result
Conclusion
Background
FPGA Design in Roofline Model
9 / 33
FPGA
Chen
Zhang,et al
Motivation
Background
Method
Exploration
Implementation
Experiment
and Result
Conclusion
Background
FPGA Design in Roofline Model
10 / 33
FPGA
Chen
Zhang,et al
Motivation
Background
Method
Exploration
Implementation
Experiment
and Result
Conclusion
Background
Attainable Performance vs. Computation performance
11 / 33
FPGA
Chen
Zhang,et al
Motivation
Background
Method
Exploration
Implementation
Experiment
and Result
Conclusion
Background
Attainable Performance vs. Computation performance
12 / 33
FPGA
Chen
Zhang,et al
Motivation
Background
Method
Exploration
Implementation
Experiment
and Result
Conclusion
Background
Optimization
Optimizing Target:
Find a design with maximized attainable performance
Optimizing Methods
Loop Unroll
Pipeline
Loop Tiling
Data Reuse
13 / 33
FPGA
Chen
Zhang,et al
Motivation
Background
Method
Exploration
Implementation
Experiment
and Result
Conclusion
Background
Convolutional Neural Network
14 / 33
FPGA
Chen
Zhang,et al
Motivation
Background
Method
Exploration
Implementation
Experiment
and Result
Conclusion
Background
Convolutional Neural Network
Convolutional layers account for over
90% computation3,4
3
A. Krizhevsky, etc. Imagenet classification with deep convolutional neural networks. NIPS 2012.
4
J. Cong and B. Xiao. Minimizing computation in convolutional neural networks. ICANN 2014.
15 / 33
FPGA
Chen
Zhang,et al
Motivation
Background
Method
Exploration
Implementation
Experiment
and Result
Conclusion
Method
Loop Unroll
Y [m][r][c] =
N−1
X
n=0
K−1
X
i=0
K−1
X
j=0
W[m][n][i][j]×X[n][S×r+i][S×c+j]
(1)
16 / 33
FPGA
Chen
Zhang,et al
Motivation
Background
Method
Exploration
Implementation
Experiment
and Result
Conclusion
Method
Loop Unroll
Pseudo code of a convolutional layer
1 for(row=0;row<R;row++){
2 for(col=0;col<C;col++){
3 for(to=0;to<M;to++){
4 for(ti=0;ti<N;ti++){
5 for(i=0;i<K;i++){
6 for(j=0;j<K;j++){
output fm[to][row][col] +=
weights[to][i][j]*input fm[ti][S*row+i][S*col+j];
}}}}}}
17 / 33
FPGA
Chen
Zhang,et al
Motivation
Background
Method
Exploration
Implementation
Experiment
and Result
Conclusion
Method
Loop Tiling
Pseudo code of a tiled convolutional layer
External data transfer (Tile Loop)
1 for(row=0;row<R;row+=Tr){
2 for(col=0;col<C;col+=Tc){
3 for(to=0;to<M;to+=Tm){
4 for(ti=0;ti<N;ti+=Tn){
5 //load output feature map
6 //load weight
7 //load input feature map
On-chip data computation (Point Loop)
8 for(trr=row;trr<min(row+Tr,R);trr++){
9 for(tcc=col;tcc<min(col+Tc,C);tcc++){
10 for(too=to;too<min(To+Tm,M);too++){
11 for(tii=ti;ti<min(ti+Tn,N);tii++){
12 for(i=0;i<K;i++){
13 for(j=0;j<K;j++){
14 L: output fm[too][trr][tcc] += weight[too][tii][i][j]*
input fm[tii][S*trr+i][S*tcc+j];
}}}}}}//store output feature maps
}}}}
18 / 33
FPGA
Chen
Zhang,et al
Motivation
Background
Method
Exploration
Implementation
Experiment
and Result
Conclusion
Method
Computation Optimization
Different unrolling way will affect the complexity of generated
hardware, and even affect the hardware operation frequency.
On-chip data computation (Point Loop)
8 for(trr=row;trr<min(row+Tr,R);trr++){
9 for(tcc=col;tcc<min(col+Tc,C);tcc++){
10 for(too=to;too<min(To+Tm,M);too++){
11 for(tii=ti;ti<min(ti+Tn,N);tii++){
12 for(i=0;i<K;i++){
13 for(j=0;j<K;j++){
14 L: output fm[too][trr][tcc] +=
weight[too][tii][i][j]*
input fm[tii][S*trr+i][S*tcc+j];
}}}}}}
Before optimized
On-chip data computation (Point Loop)
8 for(i=0;i<K;i++){
9 for(j=0;j<K;j++){
10 for(trr=row;trr<min(row+Tr,R);trr++){
11 for(tcc=col;tcc<min(col+Tc,C);tcc++){
#pragma HLS pipeline
12 for(too=to;too<min(To+Tm,M);too++){
#pragma HLS UNROLL
13 for(tii=ti;ti<min(ti+Tn,N);tii++){
#pragma HLS UNROLL
14 L: output fm[too][trr][tcc] +=
weight[too][tii][i][j]*
input fm[tii][S*trr+i][S*tcc+j];
}}}}}}
After optimized
19 / 33
FPGA
Chen
Zhang,et al
Motivation
Background
Method
Exploration
Implementation
Experiment
and Result
Conclusion
Method
Computation Optimization
On-chip data computation (Point Loop)
8 for(i=0;i<K;i++){
9 for(j=0;j<K;j++){
10 for(trr=row;trr<min(row+Tr,R);trr++){
11 for(tcc=col;tcc<min(col+Tc,C);tcc++){
#pragma HLS pipeline
12 for(too=to;too<min(To+Tm,M);too++){
#pragma HLS UNROLL
13 for(tii=ti;ti<min(ti+Tn,N);tii++){
#pragma HLS UNROLL
14 L: output fm[too][trr][tcc] +=
weight[too][tii][i][j]*
input fm[tii][S*trr+i][S*tcc+j];
}}}}}}
20 / 33
FPGA
Chen
Zhang,et al
Motivation
Background
Method
Exploration
Implementation
Experiment
and Result
Conclusion
Method
Computational Roof
Computational roof
=
total number of operations
number of execution cycles
=
2 × R × C × M × N × K × K
[ M
Tm
] × [ N
Tn
] × [ R
Tr
] × [ C
Tc
] × (Tr × Tc × K × K + P)
≈
2 × R × C × M × N × K × K
[ M
Tm
] × [ N
Tn
] × R × C × K × K
(2)
≈ 2 × Tm × Tn
where 














P = pipeline depth − 1
0 < Tm ≤ M
0 < Tn ≤ N
0 < Tr ≤ R
0 < Tc ≤ C
21 / 33
FPGA
Chen
Zhang,et al
Motivation
Background
Method
Exploration
Implementation
Experiment
and Result
Conclusion
Method
Memory Access Optimization
External data transfer (Tile Loop)
1 for(row=0;row<R;row+=Tr){
2 for(col=0;col<C;col+=Tc){
3 for(to=0;to<M;to+=Tm){
4 for(ti=0;ti<N;ti+=Tn){
5 //load output feature map
6 //load weight
7 //load input feature map
8 L: foo(output fm(to,row,col),
weights(to,ti),
input fm(ti,to,col));
//store output feature maps
}}}}
Before local memory promotion
0output fm0 memory access
= 2× R
Tr
× C
Tc
× M
Tm
× N
Tn
×Sizearray
External data transfer (Tile Loop)
1 for(row=0;row<R;row+=Tr){
2 for(col=0;col<C;col+=Tc){
3 for(to=0;to<M;to+=Tm){
4 //load output feature map
4 for(ti=0;ti<N;ti+=Tn){
6 //load weight
7 //load input feature map
8 L: foo(output fm(to,row,col),
weights(to,ti),
input fm(ti,to,col));
}//store output feature maps
}}}
After local memory promotion
0output fm0 memory access
= R
Tr
× C
Tc
× M
Tm
× Sizearray
22 / 33
FPGA
Chen
Zhang,et al
Motivation
Background
Method
Exploration
Implementation
Experiment
and Result
Conclusion
Method
Computation to Communication Ratio
Computation to Communication Ratio
=
total number ofoperations
total amount of external data access
=
2 × R × C × M × N × K × K
αin × Bin + αweight × Bweight + αout × Bout
(3)
where
Bin = Tn(STr + K − S)(STc + K − S) (4)
Bwight = TmTnK2
(5)
Bout = TmTrTc (6)
0 < Bin + Bwight + Bout ≤ BRAMcapacity (7)
αin = αweight =
M
Tm
×
N
Tn
×
R
Tr
×
C
Tc
(8)
αout =
M
Tm
×
R
Tr
×
C
Tc
(9)
23 / 33
FPGA
Chen
Zhang,et al
Motivation
Background
Method
Exploration
Implementation
Experiment
and Result
Conclusion
Exploration
Design Space
Computation Engine:
Constrains for CNN configurations:
Tm ∈ (Integer, 1 < Tm < M) M = 128
Tn × Tm ∈ (Integer, 1 < Tn < N)N = 129
Constrains for FPGA resource:
Tn ×Tm ∈ (Integer, 1 < Tm ×Tn < #of PE)
#of PE = 450
Communication:
# of memory access methods
Legal Solutions of Tm&Tn:
2097
Legal Solutions:
3
Total Legal Solutions: 6291
24 / 33
FPGA
Chen
Zhang,et al
Motivation
Background
Method
Exploration
Implementation
Experiment
and Result
Conclusion
Exploration
Design Space Exploration
25 / 33
FPGA
Chen
Zhang,et al
Motivation
Background
Method
Exploration
Implementation
Experiment
and Result
Conclusion
Exploration
Multi-Layer CNN Accelerator Design
Optimal Unroll
Factor< Tm, Tn >
Execution
Cycles
Layer 1 < 48, 3 > 366025
Layer 2 < 20, 24 > 237185
Layer 3 < 96, 5 > 160264
Layer 4 < 95, 5 > 120198
Layer 5 < 32, 15 > 80132
Total - 963804
Cross-Layer Optimization < 64, 7 > 1008246
With unified unroll factors, the degradation is within 5% compared
to the total execution cycles of each optimized convolutional layer.
26 / 33
FPGA
Chen
Zhang,et al
Motivation
Background
Method
Exploration
Implementation
Experiment
and Result
Conclusion
Implementation
System Overview
27 / 33
FPGA
Chen
Zhang,et al
Motivation
Background
Method
Exploration
Implementation
Experiment
and Result
Conclusion
Implementation
Accelerator structure
28 / 33
FPGA
Chen
Zhang,et al
Motivation
Background
Method
Exploration
Implementation
Experiment
and Result
Conclusion
Implementation
Resource Utilization
Resource DSP BRAM LUT FF
Used 240 1024 186251 205704
Available 2800 2060 303600 607200
Utilization 80% 50% 61.3% 33.87%
29 / 33
FPGA
Chen
Zhang,et al
Motivation
Background
Method
Exploration
Implementation
Experiment
and Result
Conclusion
Experiment and Result
Vs. CPU
CPU
Xeon E5-2430
(32nm)
16 cores 2.2GHz
gcc 4.7.2 -O3
OpenMP 3.0
FPGA
Virtex7-485t
(28nm)
448 PEs 100MHz
Vivado 2013.4
Vivado HLS 2013.4
30 / 33
FPGA
Chen
Zhang,et al
Motivation
Background
Method
Exploration
Implementation
Experiment
and Result
Conclusion
Experiment and Result
Vs. Other Implementations
ICCD2013 ASAP2009 FPL2009 PACT2010 ISCA2010 Our Impl.
FPGA Chip Virtex6 Virtex5 Virtex4 Virtex5 Virtex5 Virtex7
Performance 17GOPS 6.74GOPS 5.25 GOPS 7.0GOPS 16GOPS 61.62GOPS
Performance
Density
4.5E-04
GOPS/Slice
1.3E-04
GOPS/Slice
3.42E-04
GOPS/Slice
1.9E-04
GOPS/Slice
4.3E-04
GOPS/Slice
8.12E-04
GOPS/Slice
31 / 33
FPGA
Chen
Zhang,et al
Motivation
Background
Method
Exploration
Implementation
Experiment
and Result
Conclusion
Conclusion
An accelerator for convolutional neural network
Contribution:
Accurate analytical model for computation &
communication
Find the best solution with roofline model
Result:
On-board run implementation
3.5x better performance over other FPGA
implementations
80% performance/area improvement
32 / 33
FPGA
Chen
Zhang,et al
Motivation
Background
Method
Exploration
Implementation
Experiment
and Result
Conclusion
THANK YOU
Q & A?
33 / 33

More Related Content

Similar to FPGA-BASED-CNN.pdf

Introduction to datastructure and algorithm
Introduction to datastructure and algorithmIntroduction to datastructure and algorithm
Introduction to datastructure and algorithmPratik Mota
 
S4495-plasma-turbulence-sims-gyrokinetic-tokamak-solver
S4495-plasma-turbulence-sims-gyrokinetic-tokamak-solverS4495-plasma-turbulence-sims-gyrokinetic-tokamak-solver
S4495-plasma-turbulence-sims-gyrokinetic-tokamak-solverPraveen Narayanan
 
Hardware & Software Platforms for HPC, AI and ML
Hardware & Software Platforms for HPC, AI and MLHardware & Software Platforms for HPC, AI and ML
Hardware & Software Platforms for HPC, AI and MLinside-BigData.com
 
design-compiler.pdf
design-compiler.pdfdesign-compiler.pdf
design-compiler.pdfFrangoCamila
 
Cache Optimization Techniques for General Purpose Graphic Processing Units
Cache Optimization Techniques for General Purpose Graphic Processing UnitsCache Optimization Techniques for General Purpose Graphic Processing Units
Cache Optimization Techniques for General Purpose Graphic Processing UnitsVajira Thambawita
 
The CAOS framework: democratize the acceleration of compute intensive applica...
The CAOS framework: democratize the acceleration of compute intensive applica...The CAOS framework: democratize the acceleration of compute intensive applica...
The CAOS framework: democratize the acceleration of compute intensive applica...NECST Lab @ Politecnico di Milano
 
Pycon 2016-open-space
Pycon 2016-open-spacePycon 2016-open-space
Pycon 2016-open-spaceChetan Khatri
 
20180920_DBTS_PGStrom_EN
20180920_DBTS_PGStrom_EN20180920_DBTS_PGStrom_EN
20180920_DBTS_PGStrom_ENKohei KaiGai
 
pgconfasia2016 plcuda en
pgconfasia2016 plcuda enpgconfasia2016 plcuda en
pgconfasia2016 plcuda enKohei KaiGai
 
Introduction to Monte Carlo Ray Tracing, OpenCL Implementation (CEDEC 2014)
Introduction to Monte Carlo Ray Tracing, OpenCL Implementation (CEDEC 2014)Introduction to Monte Carlo Ray Tracing, OpenCL Implementation (CEDEC 2014)
Introduction to Monte Carlo Ray Tracing, OpenCL Implementation (CEDEC 2014)Takahiro Harada
 
Impact of Auto-tuning of Kernel Loop Transformation by using ppOpen-AT
Impact of Auto-tuning of Kernel Loop Transformation by using ppOpen-ATImpact of Auto-tuning of Kernel Loop Transformation by using ppOpen-AT
Impact of Auto-tuning of Kernel Loop Transformation by using ppOpen-ATTakahiro Katagiri
 
Profiling PyTorch for Efficiency & Sustainability
Profiling PyTorch for Efficiency & SustainabilityProfiling PyTorch for Efficiency & Sustainability
Profiling PyTorch for Efficiency & Sustainabilitygeetachauhan
 
Machine-Learning-based Performance Heuristics for Runtime CPU/GPU Selection
Machine-Learning-based Performance Heuristics for Runtime CPU/GPU SelectionMachine-Learning-based Performance Heuristics for Runtime CPU/GPU Selection
Machine-Learning-based Performance Heuristics for Runtime CPU/GPU SelectionAkihiro Hayashi
 
Parallel Application Performance Prediction of Using Analysis Based Modeling
Parallel Application Performance Prediction of Using Analysis Based ModelingParallel Application Performance Prediction of Using Analysis Based Modeling
Parallel Application Performance Prediction of Using Analysis Based ModelingJason Liu
 
Distributed Deep Q-Learning
Distributed Deep Q-LearningDistributed Deep Q-Learning
Distributed Deep Q-LearningLyft
 
IEEE P2P 2013 - Bootstrapping Skynet: Calibration and Autonomic Self-Control ...
IEEE P2P 2013 - Bootstrapping Skynet: Calibration and Autonomic Self-Control ...IEEE P2P 2013 - Bootstrapping Skynet: Calibration and Autonomic Self-Control ...
IEEE P2P 2013 - Bootstrapping Skynet: Calibration and Autonomic Self-Control ...Kalman Graffi
 
TensorFrames: Google Tensorflow on Apache Spark
TensorFrames: Google Tensorflow on Apache SparkTensorFrames: Google Tensorflow on Apache Spark
TensorFrames: Google Tensorflow on Apache SparkDatabricks
 
Introduction to FPGA acceleration
Introduction to FPGA accelerationIntroduction to FPGA acceleration
Introduction to FPGA accelerationMarco77328
 
OXiGen: Automated FPGA design flow from C applications to dataflow kernels - ...
OXiGen: Automated FPGA design flow from C applications to dataflow kernels - ...OXiGen: Automated FPGA design flow from C applications to dataflow kernels - ...
OXiGen: Automated FPGA design flow from C applications to dataflow kernels - ...NECST Lab @ Politecnico di Milano
 
Architecture Aware Partitioning of Open-CL Programs
Architecture Aware Partitioning of Open-CL Programs Architecture Aware Partitioning of Open-CL Programs
Architecture Aware Partitioning of Open-CL Programs Ankit Singh
 

Similar to FPGA-BASED-CNN.pdf (20)

Introduction to datastructure and algorithm
Introduction to datastructure and algorithmIntroduction to datastructure and algorithm
Introduction to datastructure and algorithm
 
S4495-plasma-turbulence-sims-gyrokinetic-tokamak-solver
S4495-plasma-turbulence-sims-gyrokinetic-tokamak-solverS4495-plasma-turbulence-sims-gyrokinetic-tokamak-solver
S4495-plasma-turbulence-sims-gyrokinetic-tokamak-solver
 
Hardware & Software Platforms for HPC, AI and ML
Hardware & Software Platforms for HPC, AI and MLHardware & Software Platforms for HPC, AI and ML
Hardware & Software Platforms for HPC, AI and ML
 
design-compiler.pdf
design-compiler.pdfdesign-compiler.pdf
design-compiler.pdf
 
Cache Optimization Techniques for General Purpose Graphic Processing Units
Cache Optimization Techniques for General Purpose Graphic Processing UnitsCache Optimization Techniques for General Purpose Graphic Processing Units
Cache Optimization Techniques for General Purpose Graphic Processing Units
 
The CAOS framework: democratize the acceleration of compute intensive applica...
The CAOS framework: democratize the acceleration of compute intensive applica...The CAOS framework: democratize the acceleration of compute intensive applica...
The CAOS framework: democratize the acceleration of compute intensive applica...
 
Pycon 2016-open-space
Pycon 2016-open-spacePycon 2016-open-space
Pycon 2016-open-space
 
20180920_DBTS_PGStrom_EN
20180920_DBTS_PGStrom_EN20180920_DBTS_PGStrom_EN
20180920_DBTS_PGStrom_EN
 
pgconfasia2016 plcuda en
pgconfasia2016 plcuda enpgconfasia2016 plcuda en
pgconfasia2016 plcuda en
 
Introduction to Monte Carlo Ray Tracing, OpenCL Implementation (CEDEC 2014)
Introduction to Monte Carlo Ray Tracing, OpenCL Implementation (CEDEC 2014)Introduction to Monte Carlo Ray Tracing, OpenCL Implementation (CEDEC 2014)
Introduction to Monte Carlo Ray Tracing, OpenCL Implementation (CEDEC 2014)
 
Impact of Auto-tuning of Kernel Loop Transformation by using ppOpen-AT
Impact of Auto-tuning of Kernel Loop Transformation by using ppOpen-ATImpact of Auto-tuning of Kernel Loop Transformation by using ppOpen-AT
Impact of Auto-tuning of Kernel Loop Transformation by using ppOpen-AT
 
Profiling PyTorch for Efficiency & Sustainability
Profiling PyTorch for Efficiency & SustainabilityProfiling PyTorch for Efficiency & Sustainability
Profiling PyTorch for Efficiency & Sustainability
 
Machine-Learning-based Performance Heuristics for Runtime CPU/GPU Selection
Machine-Learning-based Performance Heuristics for Runtime CPU/GPU SelectionMachine-Learning-based Performance Heuristics for Runtime CPU/GPU Selection
Machine-Learning-based Performance Heuristics for Runtime CPU/GPU Selection
 
Parallel Application Performance Prediction of Using Analysis Based Modeling
Parallel Application Performance Prediction of Using Analysis Based ModelingParallel Application Performance Prediction of Using Analysis Based Modeling
Parallel Application Performance Prediction of Using Analysis Based Modeling
 
Distributed Deep Q-Learning
Distributed Deep Q-LearningDistributed Deep Q-Learning
Distributed Deep Q-Learning
 
IEEE P2P 2013 - Bootstrapping Skynet: Calibration and Autonomic Self-Control ...
IEEE P2P 2013 - Bootstrapping Skynet: Calibration and Autonomic Self-Control ...IEEE P2P 2013 - Bootstrapping Skynet: Calibration and Autonomic Self-Control ...
IEEE P2P 2013 - Bootstrapping Skynet: Calibration and Autonomic Self-Control ...
 
TensorFrames: Google Tensorflow on Apache Spark
TensorFrames: Google Tensorflow on Apache SparkTensorFrames: Google Tensorflow on Apache Spark
TensorFrames: Google Tensorflow on Apache Spark
 
Introduction to FPGA acceleration
Introduction to FPGA accelerationIntroduction to FPGA acceleration
Introduction to FPGA acceleration
 
OXiGen: Automated FPGA design flow from C applications to dataflow kernels - ...
OXiGen: Automated FPGA design flow from C applications to dataflow kernels - ...OXiGen: Automated FPGA design flow from C applications to dataflow kernels - ...
OXiGen: Automated FPGA design flow from C applications to dataflow kernels - ...
 
Architecture Aware Partitioning of Open-CL Programs
Architecture Aware Partitioning of Open-CL Programs Architecture Aware Partitioning of Open-CL Programs
Architecture Aware Partitioning of Open-CL Programs
 

Recently uploaded

WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 

Recently uploaded (20)

WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 

FPGA-BASED-CNN.pdf