SlideShare a Scribd company logo
1 of 44
Download to read offline
Brad L. Hutchings Electrical and Computer EngineeringBYU
1
Run-Time Reconfiguration
Evolution of a New
Strategy for Computing
Brad L. Hutchings Electrical and Computer EngineeringBYU
2
Reconfigurable Logic at BYU
l Lab Resources:
l Splash-II, TeraMAC, CLAyFun, Xilinx, Altera.
l Applications developed for most major platforms:
l Run-Time Reconfigurable Neural Networks
l Application-Specific Processors
l Image Processing
l Genetic Algorithms
l Linear Algebra
l Processor/Configurable-Logic Integration.
Brad L. Hutchings Electrical and Computer EngineeringBYU
3
http://splish.ee.byu.edu
l List of work in progress.
l All BYU papers are downloadable.
l Bibliography with text abstracts:
» Keyword search: device, tool, app, system.
l Tutorials for FPGA Platforms and CAD
Tools.
Brad L. Hutchings Electrical and Computer EngineeringBYU
4
What This Talk is About
Computing with FPGAs
Brad L. Hutchings Electrical and Computer EngineeringBYU
5
Assumptions
l Things I won’t tell you:
+how wonderful FPGAs are.
+how cost-effective FPGAs are.
+how FPGAs will replace all of your home appliances.
Everyone knows how to spell “FPGA”
Brad L. Hutchings Electrical and Computer EngineeringBYU
6
Device Assumptions
l Assume an FPGA that
» reconfigures rapidly (<10 ms or so)
» reconfigures in-circuit
» reconfigures an infinite number of times.
Brad L. Hutchings Electrical and Computer EngineeringBYU
7
FPGAs are Big
FETs ‘R US
Brad L. Hutchings Electrical and Computer EngineeringBYU
8
FPGAs are Slow
2K
2K
2K
Brad L. Hutchings Electrical and Computer EngineeringBYU
9
Most FPGA Designs are Static
l Eternal prototypes.
l Low volume or low gate counts.
l Evolving standards or requirements.
Final Application is Still Static
Brad L. Hutchings Electrical and Computer EngineeringBYU
10
After the application is
finished...
What are those other 900,000
transistors doing right now?
Brad L. Hutchings Electrical and Computer EngineeringBYU
11
FPGAs are Reusable
Convolution
Skeletonization
Neural Networks
Edge Detection
ATR
FFT
Encryption
Compression
Same
Device
Brad L. Hutchings Electrical and Computer EngineeringBYU
12
Slightly Less-Static Examples
l Diagnostic-specific configurations.
– Test with internal circuitry.
l Mode-specific configurations.
– Pixel depths on monitors.
l I/O, data format-specific configurations.
– Bus-couplers, etc.
Unrelated configurations.
Occasional reconfiguration.
Brad L. Hutchings Electrical and Computer EngineeringBYU
13
Run-Time Reconfiguration (RTR)
Implement application as a
set of multiple, cooperative configurations
At run-time, execute and reconfigure
as necessary.
Brad L. Hutchings Electrical and Computer EngineeringBYU
14
Evolutionary Progression of RTR
l RRANN-1(Backpropagation)
» Global RTR
l RRANN-2 (Backpropagation)
» Fixed-Schedule, Local RTR
l DPCC (General Computing)
» Demand-Paged, Local RTR
Brad L. Hutchings Electrical and Computer EngineeringBYU
15
Neural Network
Eye Color
Income
Debt
Need
Make of Car
Do I Make Loan
Brad L. Hutchings Electrical and Computer EngineeringBYU
16
Training Mode
Eye Color
Income
Debt
Need
Make of Car
Do I Make Loan
Training
Output
Data[n]
Training
Input
Data[n]
Brad L. Hutchings Electrical and Computer EngineeringBYU
17
Operational Mode
Eye Color
Income
Debt
Need
Make of Car
Do I Make Loan
Loan
Application
Answer
Brad L. Hutchings Electrical and Computer EngineeringBYU
18
Backpropagation
neti = j A OjWji ∀ i : i B
Oi = f (neti) = 1
1+e−neti
δi = f (neti)(Ti − Oi) ∀ i : i C
δi = f (neti) j E δjWij ∀ i : i D
Wij = Cl Oiδj ∀ i, j : i A, j B
Wijt+1 = Wijt + Wij
Brad L. Hutchings Electrical and Computer EngineeringBYU
19
RRANN Temporal Phases
Feedforward
Error Propagation
Update
neti = j A OjWji ∀ i : i B
Oi = f (neti) = 1
1+e−neti
δi = f (neti)(Ti − Oi) ∀ i : i C δi = f (neti) j E δjWij ∀ i : i D
Wij = Cl Oiδj ∀ i, j : i A, j B
Wijt+1 = Wijt + Wij
Brad L. Hutchings Electrical and Computer EngineeringBYU
20
RRANN Implementation
Feedforward
Weight Update
Error Propagate Feedforward
Error Propagate
Weight Update
Unified
Implementation
RTR
Implementation
Brad L. Hutchings Electrical and Computer EngineeringBYU
21
RRANN-1
Implementation Details
l Global RTR
» All FPGA resources configured globally.
» All state stored external to FPGA.
l Xilinx XC3090 FPGAs
» Achieved 6 neurons per FPGA (RTR)
– Unified version achieved 1 neuron per FPGA
Brad L. Hutchings Electrical and Computer EngineeringBYU
22
RRANN-1
Training Performance
0 10 20 30 40 50 60
0
2
4
6
8
10
12
Statically Configured (1 Neuron per FPGA)
Run-Tim
e
Reconfigured
(6
Neurons
perFPG
A)
Number of FPGAs (XC3090s)
PerformanceRelativetoHP735-125
Peak Training Performance
Brad L. Hutchings Electrical and Computer EngineeringBYU
23
RRANN-1
Operational Performance
0 10 20 30 40 50 60
0
20
40
60
80
100
120
Statically Configured (1 Neuron per FPGA)
Run-Tim
e
Reconfigured
(6
Neurons per FPGA)
Number of FPGAs (XC3090s)
PerformanceRelativetoHP735-125 Peak Operational Performance
Brad L. Hutchings Electrical and Computer EngineeringBYU
24
RRANN-2
Implementation Details
l Local RTR
» Exploit commonality to reduce configuration.
» Partially configure FPGAs in each step.
l National Semiconductor Clay-31 FPGA.
» similar to CLI/Atmel.
» Achieved 9 neurons per FPGA.
– Unified version achieved 3 neurons per FPGA.
Brad L. Hutchings Electrical and Computer EngineeringBYU
25
RRANN-2
Training Performance
5 10 15 20 25 30 35 40
0
5
10
15
20
25
30
35
40
45
Peak Training Performance
Number of FPGAs
PerformanceRelativetoHP735-125
Unified
Complete
Partial
RRANN
Brad L. Hutchings Electrical and Computer EngineeringBYU
26
RRANN-2
Operational Performance
5 10 15 20 25 30 35 40
0
20
40
60
80
100
120
140
Peak Operational Performance
Number of FPGAs
PerformanceRelativetoHP735-125
Unified
Partial & Complete
RRANN
Brad L. Hutchings Electrical and Computer EngineeringBYU
27
RRANN-2 Demo
l 5 minutes
Brad L. Hutchings Electrical and Computer EngineeringBYU
28
RTR Point of Interest
A system computing only 30% of the time can
outperform a system computing 100% of the time
Brad L. Hutchings Electrical and Computer EngineeringBYU
29
DPCC
l Experiment with RTR architecture.
» Study impact of RTR on system architecture.
l FPGA-based processor architecture.
» NSC CLAy-31
» Partial configuration.
» Application-specific instruction sets.
Brad L. Hutchings Electrical and Computer EngineeringBYU
30
General DPCC Approach
l Library-based Approach
– Library contains application-specific circuit modules.
l Software Control
– Sequencing, complex control, I/O controlled by software.
l Linear Hardware Space.
– Hardware relocated at run-time to available space.
l Hardware Paging.
– Idle hardware replaced with active modules.
l Demand-Driven Execution
– Hardware modules loaded as required by application.
Brad L. Hutchings Electrical and Computer EngineeringBYU
31
DISC Internal Architecture
Static Control Circuitry
Dynamic Instruction Module A
Dynamic Instruction Module B
Brad L. Hutchings Electrical and Computer EngineeringBYU
32
DISC Programming Model
main() {
...
while(...) {
hist(image1);
median(image1,t);
thresh(t,image1);
copy(image1,i2);
erode(image1);
diff(i2, image1);
}
}
DISC
(FPGA)
Program/Data
RAM
others...
hist
median
add
inc
cmp
others...
Compile
+
Assemble
Circuit
Design
Application
Programming
Static Circuitry
thresh
add
Instruction Library
thresh
erode
Brad L. Hutchings Electrical and Computer EngineeringBYU
33
DISC Operation
Yes
No
Fetch
Instruction
Hardware
Module
Present?
Relocate & Load
Hardwareð Module
Execute
Hardware Module
Brad L. Hutchings Electrical and Computer EngineeringBYU
34
Demo Hardware
Brad L. Hutchings Electrical and Computer EngineeringBYU
35
DISC Demo
– 00:00 Opening
– 01:00 Picture
– 01:30 Demo (2 passes)
– 02:00 Library
– 05:00 Algorithm
– 07:00 Execution Profile
– 08:30 Edit and Compile
– 10:15 Library Picture (for 10 minutes)
Brad L. Hutchings Electrical and Computer EngineeringBYU
36
DPCC Conclusions
l Provides modular programming model.
l Isolates capacity/timing issues.
l Allows reuse at the design level.
l Automated reuse of silicon:
» demand paging,
» module relocation.
Brad L. Hutchings Electrical and Computer EngineeringBYU
37
Summary
l Illustrated RTR as it evolved at BYU:
» Global RTR
» Local RTR
» DPCC
l RTR exploits FPGA overhead
» Static approaches waste FPGA resources.
Brad L. Hutchings Electrical and Computer EngineeringBYU
38
The Future
l FPGAs are reusable, VHDL is not!
» Must be able to reuse optimized designs.
l CAD/Compilers are far behind.
» Very dependent on GPR.
l Devices are not well-suited.
» Better devices will expand application base.
Brad L. Hutchings Electrical and Computer EngineeringBYU
39
Acknowledgement
Thanks to National Semiconductor for
funding this work through an
ARPA contract.
Brad L. Hutchings Electrical and Computer EngineeringBYU
40
End of Presentation
Brad L. Hutchings Electrical and Computer EngineeringBYU
41
Initial Motivation
l I was new faculty.
» I could do something completely new.
l I enjoyed custom hardware.
» FPGAs looked like the ultimate “tinkertoy.”
l I had to restructure CompEng.
» FPGAs could impact the undergraduates.
Brad L. Hutchings Electrical and Computer EngineeringBYU
42
Brad L. Hutchings Electrical and Computer EngineeringBYU
43
Configurable Computing
l Most applications are still static.
» One configuration per application.
l Compile-Time Reconfiguration
» Configure once per application, prior to execution.
l Splash, Perle, Teramac
» Large ASIC emulators.
Brad L. Hutchings Electrical and Computer EngineeringBYU
44
Flexibility eases Development
l Software-like implementation strategy:
» Iterative
» Incremental
» Debuggable
» Execution versus Simulation
» ASIC results without ASIC development

More Related Content

What's hot

09_KHIN AYE MU.docx - Abstract
09_KHIN AYE MU.docx - Abstract09_KHIN AYE MU.docx - Abstract
09_KHIN AYE MU.docx - Abstractbutest
 
Reconfigurable Platform Composer Tool Project
Reconfigurable Platform Composer Tool ProjectReconfigurable Platform Composer Tool Project
Reconfigurable Platform Composer Tool ProjectMDC_UNICA
 
ICIAM 2019: Reproducible Linear Algebra from Application to Architecture
ICIAM 2019: Reproducible Linear Algebra from Application to ArchitectureICIAM 2019: Reproducible Linear Algebra from Application to Architecture
ICIAM 2019: Reproducible Linear Algebra from Application to ArchitectureJason Riedy
 
IRJET- Design and HW/SW Implementation of a Nonlinear Interpolator for Border...
IRJET- Design and HW/SW Implementation of a Nonlinear Interpolator for Border...IRJET- Design and HW/SW Implementation of a Nonlinear Interpolator for Border...
IRJET- Design and HW/SW Implementation of a Nonlinear Interpolator for Border...IRJET Journal
 
LOGIC OPTIMIZATION USING TECHNOLOGY INDEPENDENT MUX BASED ADDERS IN FPGA
LOGIC OPTIMIZATION USING TECHNOLOGY INDEPENDENT MUX BASED ADDERS IN FPGALOGIC OPTIMIZATION USING TECHNOLOGY INDEPENDENT MUX BASED ADDERS IN FPGA
LOGIC OPTIMIZATION USING TECHNOLOGY INDEPENDENT MUX BASED ADDERS IN FPGAVLSICS Design
 
High speed adder used in digital signal processing
High speed adder used in  digital signal processingHigh speed adder used in  digital signal processing
High speed adder used in digital signal processingSajan Sahu
 
Xilinx Cool Runner Architecture
Xilinx Cool Runner ArchitectureXilinx Cool Runner Architecture
Xilinx Cool Runner Architecturedragonpradeep
 

What's hot (9)

09_KHIN AYE MU.docx - Abstract
09_KHIN AYE MU.docx - Abstract09_KHIN AYE MU.docx - Abstract
09_KHIN AYE MU.docx - Abstract
 
Reconfigurable Platform Composer Tool Project
Reconfigurable Platform Composer Tool ProjectReconfigurable Platform Composer Tool Project
Reconfigurable Platform Composer Tool Project
 
ICIAM 2019: Reproducible Linear Algebra from Application to Architecture
ICIAM 2019: Reproducible Linear Algebra from Application to ArchitectureICIAM 2019: Reproducible Linear Algebra from Application to Architecture
ICIAM 2019: Reproducible Linear Algebra from Application to Architecture
 
IRJET- Design and HW/SW Implementation of a Nonlinear Interpolator for Border...
IRJET- Design and HW/SW Implementation of a Nonlinear Interpolator for Border...IRJET- Design and HW/SW Implementation of a Nonlinear Interpolator for Border...
IRJET- Design and HW/SW Implementation of a Nonlinear Interpolator for Border...
 
Vlsi titles
Vlsi titlesVlsi titles
Vlsi titles
 
LOGIC OPTIMIZATION USING TECHNOLOGY INDEPENDENT MUX BASED ADDERS IN FPGA
LOGIC OPTIMIZATION USING TECHNOLOGY INDEPENDENT MUX BASED ADDERS IN FPGALOGIC OPTIMIZATION USING TECHNOLOGY INDEPENDENT MUX BASED ADDERS IN FPGA
LOGIC OPTIMIZATION USING TECHNOLOGY INDEPENDENT MUX BASED ADDERS IN FPGA
 
High speed adder used in digital signal processing
High speed adder used in  digital signal processingHigh speed adder used in  digital signal processing
High speed adder used in digital signal processing
 
Xilinx Cool Runner Architecture
Xilinx Cool Runner ArchitectureXilinx Cool Runner Architecture
Xilinx Cool Runner Architecture
 
Cuda project paper
Cuda project paperCuda project paper
Cuda project paper
 

Viewers also liked

3 bhubaneshwari devi
3 bhubaneshwari devi3 bhubaneshwari devi
3 bhubaneshwari deviDheeraj Vasu
 
21 clever search engine marketing seo tips for global banking, finance & insu...
21 clever search engine marketing seo tips for global banking, finance & insu...21 clever search engine marketing seo tips for global banking, finance & insu...
21 clever search engine marketing seo tips for global banking, finance & insu...Social Bubble
 
Practica1
Practica1Practica1
Practica1nurds
 
06 febbraio 2011_mi_aiuti_sognare
06 febbraio 2011_mi_aiuti_sognare06 febbraio 2011_mi_aiuti_sognare
06 febbraio 2011_mi_aiuti_sognareCarmen Giordano
 
Dilip Suryavanshi Dilip Buildcon
Dilip Suryavanshi Dilip BuildconDilip Suryavanshi Dilip Buildcon
Dilip Suryavanshi Dilip BuildconDilip Buildcon
 
Experience Letter - Rajendran
Experience Letter - RajendranExperience Letter - Rajendran
Experience Letter - RajendranVyshakh Rajendran
 
красне літечко прийшло
красне літечко прийшлокрасне літечко прийшло
красне літечко прийшлоSashaAnna
 
F. Jafari *, A. Eslami **, M. Hasani*** and S.A. Hashemi***
F. Jafari *, A. Eslami **, M. Hasani*** and S.A. Hashemi***F. Jafari *, A. Eslami **, M. Hasani*** and S.A. Hashemi***
F. Jafari *, A. Eslami **, M. Hasani*** and S.A. Hashemi***Dheeraj Vasu
 
Trường cao đẳng công thương tp
Trường cao đẳng công thương tpTrường cao đẳng công thương tp
Trường cao đẳng công thương tpcaothisuong
 

Viewers also liked (11)

3 bhubaneshwari devi
3 bhubaneshwari devi3 bhubaneshwari devi
3 bhubaneshwari devi
 
21 clever search engine marketing seo tips for global banking, finance & insu...
21 clever search engine marketing seo tips for global banking, finance & insu...21 clever search engine marketing seo tips for global banking, finance & insu...
21 clever search engine marketing seo tips for global banking, finance & insu...
 
Practica1
Practica1Practica1
Practica1
 
06 febbraio 2011_mi_aiuti_sognare
06 febbraio 2011_mi_aiuti_sognare06 febbraio 2011_mi_aiuti_sognare
06 febbraio 2011_mi_aiuti_sognare
 
Dilip Suryavanshi Dilip Buildcon
Dilip Suryavanshi Dilip BuildconDilip Suryavanshi Dilip Buildcon
Dilip Suryavanshi Dilip Buildcon
 
Experience Letter - Rajendran
Experience Letter - RajendranExperience Letter - Rajendran
Experience Letter - Rajendran
 
красне літечко прийшло
красне літечко прийшлокрасне літечко прийшло
красне літечко прийшло
 
F. Jafari *, A. Eslami **, M. Hasani*** and S.A. Hashemi***
F. Jafari *, A. Eslami **, M. Hasani*** and S.A. Hashemi***F. Jafari *, A. Eslami **, M. Hasani*** and S.A. Hashemi***
F. Jafari *, A. Eslami **, M. Hasani*** and S.A. Hashemi***
 
Trường cao đẳng công thương tp
Trường cao đẳng công thương tpTrường cao đẳng công thương tp
Trường cao đẳng công thương tp
 
№7(07) ноябрь 2011
№7(07) ноябрь 2011№7(07) ноябрь 2011
№7(07) ноябрь 2011
 
Facebook
FacebookFacebook
Facebook
 

Similar to Cwf96 (1)

Deploying a Task-based Runtime System on Raspberry Pi Clusters
Deploying a Task-based Runtime System on Raspberry Pi ClustersDeploying a Task-based Runtime System on Raspberry Pi Clusters
Deploying a Task-based Runtime System on Raspberry Pi ClustersPatrick Diehl
 
To Design and simulate 3-Ø Induction motor drive
To Design and simulate 3-Ø Induction motor driveTo Design and simulate 3-Ø Induction motor drive
To Design and simulate 3-Ø Induction motor driveUmang Patel
 
Power Over Fiber_PCBProject
Power Over Fiber_PCBProjectPower Over Fiber_PCBProject
Power Over Fiber_PCBProjectInbar Kinarty
 
digitaldesign-s20-lecture3b-fpga-afterlecture.pdf
digitaldesign-s20-lecture3b-fpga-afterlecture.pdfdigitaldesign-s20-lecture3b-fpga-afterlecture.pdf
digitaldesign-s20-lecture3b-fpga-afterlecture.pdfDuy-Hieu Bui
 
EFFICIENT ABSOLUTE DIFFERENCE CIRCUIT FOR SAD COMPUTATION ON FPGA
EFFICIENT ABSOLUTE DIFFERENCE CIRCUIT FOR SAD COMPUTATION ON FPGAEFFICIENT ABSOLUTE DIFFERENCE CIRCUIT FOR SAD COMPUTATION ON FPGA
EFFICIENT ABSOLUTE DIFFERENCE CIRCUIT FOR SAD COMPUTATION ON FPGAVLSICS Design
 
Cygnus - World First Multi-Hybrid Accelerated Cluster with GPU and FPGA Coupling
Cygnus - World First Multi-Hybrid Accelerated Cluster with GPU and FPGA CouplingCygnus - World First Multi-Hybrid Accelerated Cluster with GPU and FPGA Coupling
Cygnus - World First Multi-Hybrid Accelerated Cluster with GPU and FPGA CouplingCarlos Reaño González
 
EFFICIENT ABSOLUTE DIFFERENCE CIRCUIT FOR SAD COMPUTATION ON FPGA
EFFICIENT ABSOLUTE DIFFERENCE CIRCUIT FOR SAD COMPUTATION ON FPGAEFFICIENT ABSOLUTE DIFFERENCE CIRCUIT FOR SAD COMPUTATION ON FPGA
EFFICIENT ABSOLUTE DIFFERENCE CIRCUIT FOR SAD COMPUTATION ON FPGAVLSICS Design
 
IRJET - High Speed Inexact Speculative Adder using Carry Look Ahead Adder...
IRJET -  	  High Speed Inexact Speculative Adder using Carry Look Ahead Adder...IRJET -  	  High Speed Inexact Speculative Adder using Carry Look Ahead Adder...
IRJET - High Speed Inexact Speculative Adder using Carry Look Ahead Adder...IRJET Journal
 
09_KHIN AYE MU.docx - Abstract
09_KHIN AYE MU.docx - Abstract09_KHIN AYE MU.docx - Abstract
09_KHIN AYE MU.docx - Abstractbutest
 
Design_amp_analysis_of_16_bit_RISC_processor_using_low_power_pipelining.pdf
Design_amp_analysis_of_16_bit_RISC_processor_using_low_power_pipelining.pdfDesign_amp_analysis_of_16_bit_RISC_processor_using_low_power_pipelining.pdf
Design_amp_analysis_of_16_bit_RISC_processor_using_low_power_pipelining.pdfssuser1e1bab
 
IRJET- A Survey on Reconstruct Structural Design of FPGA
IRJET-  	  A Survey on Reconstruct Structural Design of FPGAIRJET-  	  A Survey on Reconstruct Structural Design of FPGA
IRJET- A Survey on Reconstruct Structural Design of FPGAIRJET Journal
 
Adiabatic technique based low power synchronous counter design
Adiabatic technique based low power synchronous counter  designAdiabatic technique based low power synchronous counter  design
Adiabatic technique based low power synchronous counter designIJECEIAES
 
ECE321322 Electronics I & Lab Spring 2015 1 Final P.docx
ECE321322 Electronics I & Lab Spring 2015 1 Final P.docxECE321322 Electronics I & Lab Spring 2015 1 Final P.docx
ECE321322 Electronics I & Lab Spring 2015 1 Final P.docxjack60216
 
Lecture 6: Infrastructure & Tooling (Full Stack Deep Learning - Spring 2021)
Lecture 6: Infrastructure & Tooling (Full Stack Deep Learning - Spring 2021)Lecture 6: Infrastructure & Tooling (Full Stack Deep Learning - Spring 2021)
Lecture 6: Infrastructure & Tooling (Full Stack Deep Learning - Spring 2021)Sergey Karayev
 
D-Flip Flop Layout: Efficient in Terms of Area and Power
D-Flip Flop Layout: Efficient in Terms of  Area and Power D-Flip Flop Layout: Efficient in Terms of  Area and Power
D-Flip Flop Layout: Efficient in Terms of Area and Power IJEEE
 

Similar to Cwf96 (1) (20)

Deploying a Task-based Runtime System on Raspberry Pi Clusters
Deploying a Task-based Runtime System on Raspberry Pi ClustersDeploying a Task-based Runtime System on Raspberry Pi Clusters
Deploying a Task-based Runtime System on Raspberry Pi Clusters
 
To Design and simulate 3-Ø Induction motor drive
To Design and simulate 3-Ø Induction motor driveTo Design and simulate 3-Ø Induction motor drive
To Design and simulate 3-Ø Induction motor drive
 
Power Over Fiber_PCBProject
Power Over Fiber_PCBProjectPower Over Fiber_PCBProject
Power Over Fiber_PCBProject
 
CHANDAN RESUME
CHANDAN RESUMECHANDAN RESUME
CHANDAN RESUME
 
uday ppt pcb.pptx
uday ppt pcb.pptxuday ppt pcb.pptx
uday ppt pcb.pptx
 
digitaldesign-s20-lecture3b-fpga-afterlecture.pdf
digitaldesign-s20-lecture3b-fpga-afterlecture.pdfdigitaldesign-s20-lecture3b-fpga-afterlecture.pdf
digitaldesign-s20-lecture3b-fpga-afterlecture.pdf
 
ICTP Workshop 2022
ICTP Workshop 2022ICTP Workshop 2022
ICTP Workshop 2022
 
EFFICIENT ABSOLUTE DIFFERENCE CIRCUIT FOR SAD COMPUTATION ON FPGA
EFFICIENT ABSOLUTE DIFFERENCE CIRCUIT FOR SAD COMPUTATION ON FPGAEFFICIENT ABSOLUTE DIFFERENCE CIRCUIT FOR SAD COMPUTATION ON FPGA
EFFICIENT ABSOLUTE DIFFERENCE CIRCUIT FOR SAD COMPUTATION ON FPGA
 
Cygnus - World First Multi-Hybrid Accelerated Cluster with GPU and FPGA Coupling
Cygnus - World First Multi-Hybrid Accelerated Cluster with GPU and FPGA CouplingCygnus - World First Multi-Hybrid Accelerated Cluster with GPU and FPGA Coupling
Cygnus - World First Multi-Hybrid Accelerated Cluster with GPU and FPGA Coupling
 
EFFICIENT ABSOLUTE DIFFERENCE CIRCUIT FOR SAD COMPUTATION ON FPGA
EFFICIENT ABSOLUTE DIFFERENCE CIRCUIT FOR SAD COMPUTATION ON FPGAEFFICIENT ABSOLUTE DIFFERENCE CIRCUIT FOR SAD COMPUTATION ON FPGA
EFFICIENT ABSOLUTE DIFFERENCE CIRCUIT FOR SAD COMPUTATION ON FPGA
 
IRJET - High Speed Inexact Speculative Adder using Carry Look Ahead Adder...
IRJET -  	  High Speed Inexact Speculative Adder using Carry Look Ahead Adder...IRJET -  	  High Speed Inexact Speculative Adder using Carry Look Ahead Adder...
IRJET - High Speed Inexact Speculative Adder using Carry Look Ahead Adder...
 
09_KHIN AYE MU.docx - Abstract
09_KHIN AYE MU.docx - Abstract09_KHIN AYE MU.docx - Abstract
09_KHIN AYE MU.docx - Abstract
 
slides
slidesslides
slides
 
blue gene report
blue gene  reportblue gene  report
blue gene report
 
Design_amp_analysis_of_16_bit_RISC_processor_using_low_power_pipelining.pdf
Design_amp_analysis_of_16_bit_RISC_processor_using_low_power_pipelining.pdfDesign_amp_analysis_of_16_bit_RISC_processor_using_low_power_pipelining.pdf
Design_amp_analysis_of_16_bit_RISC_processor_using_low_power_pipelining.pdf
 
IRJET- A Survey on Reconstruct Structural Design of FPGA
IRJET-  	  A Survey on Reconstruct Structural Design of FPGAIRJET-  	  A Survey on Reconstruct Structural Design of FPGA
IRJET- A Survey on Reconstruct Structural Design of FPGA
 
Adiabatic technique based low power synchronous counter design
Adiabatic technique based low power synchronous counter  designAdiabatic technique based low power synchronous counter  design
Adiabatic technique based low power synchronous counter design
 
ECE321322 Electronics I & Lab Spring 2015 1 Final P.docx
ECE321322 Electronics I & Lab Spring 2015 1 Final P.docxECE321322 Electronics I & Lab Spring 2015 1 Final P.docx
ECE321322 Electronics I & Lab Spring 2015 1 Final P.docx
 
Lecture 6: Infrastructure & Tooling (Full Stack Deep Learning - Spring 2021)
Lecture 6: Infrastructure & Tooling (Full Stack Deep Learning - Spring 2021)Lecture 6: Infrastructure & Tooling (Full Stack Deep Learning - Spring 2021)
Lecture 6: Infrastructure & Tooling (Full Stack Deep Learning - Spring 2021)
 
D-Flip Flop Layout: Efficient in Terms of Area and Power
D-Flip Flop Layout: Efficient in Terms of  Area and Power D-Flip Flop Layout: Efficient in Terms of  Area and Power
D-Flip Flop Layout: Efficient in Terms of Area and Power
 

More from manoj g

Syllabus
SyllabusSyllabus
Syllabusmanoj g
 
Blooms taxonomy action_verbs
Blooms taxonomy action_verbsBlooms taxonomy action_verbs
Blooms taxonomy action_verbsmanoj g
 
Blooms taxonomy action_verbs
Blooms taxonomy action_verbsBlooms taxonomy action_verbs
Blooms taxonomy action_verbsmanoj g
 
Blooms taxonomy action_verbs
Blooms taxonomy action_verbsBlooms taxonomy action_verbs
Blooms taxonomy action_verbsmanoj g
 
Guidelines ph,d thesis
Guidelines ph,d thesisGuidelines ph,d thesis
Guidelines ph,d thesismanoj g
 
Guidelines for-jyrc-lr
Guidelines for-jyrc-lrGuidelines for-jyrc-lr
Guidelines for-jyrc-lrmanoj g
 
Permission letter for external visits
Permission letter for external visitsPermission letter for external visits
Permission letter for external visitsmanoj g
 
Extension programme cst
Extension programme cstExtension programme cst
Extension programme cstmanoj g
 

More from manoj g (10)

Syllabus
SyllabusSyllabus
Syllabus
 
Blooms taxonomy action_verbs
Blooms taxonomy action_verbsBlooms taxonomy action_verbs
Blooms taxonomy action_verbs
 
Blooms taxonomy action_verbs
Blooms taxonomy action_verbsBlooms taxonomy action_verbs
Blooms taxonomy action_verbs
 
Blooms taxonomy action_verbs
Blooms taxonomy action_verbsBlooms taxonomy action_verbs
Blooms taxonomy action_verbs
 
Success
SuccessSuccess
Success
 
Guidelines ph,d thesis
Guidelines ph,d thesisGuidelines ph,d thesis
Guidelines ph,d thesis
 
Guidelines for-jyrc-lr
Guidelines for-jyrc-lrGuidelines for-jyrc-lr
Guidelines for-jyrc-lr
 
Permission letter for external visits
Permission letter for external visitsPermission letter for external visits
Permission letter for external visits
 
Extension programme cst
Extension programme cstExtension programme cst
Extension programme cst
 
Dove
DoveDove
Dove
 

Recently uploaded

Internshala Student Partner 6.0 Jadavpur University Certificate
Internshala Student Partner 6.0 Jadavpur University CertificateInternshala Student Partner 6.0 Jadavpur University Certificate
Internshala Student Partner 6.0 Jadavpur University CertificateSoham Mondal
 
Experience Certificate - Marketing Analyst-Soham Mondal.pdf
Experience Certificate - Marketing Analyst-Soham Mondal.pdfExperience Certificate - Marketing Analyst-Soham Mondal.pdf
Experience Certificate - Marketing Analyst-Soham Mondal.pdfSoham Mondal
 
CFO_SB_Career History_Multi Sector Experience
CFO_SB_Career History_Multi Sector ExperienceCFO_SB_Career History_Multi Sector Experience
CFO_SB_Career History_Multi Sector ExperienceSanjay Bokadia
 
Dubai Call Girls Starlet O525547819 Call Girls Dubai Showen Dating
Dubai Call Girls Starlet O525547819 Call Girls Dubai Showen DatingDubai Call Girls Starlet O525547819 Call Girls Dubai Showen Dating
Dubai Call Girls Starlet O525547819 Call Girls Dubai Showen Datingkojalkojal131
 
Low Rate Call Girls Cuttack Anika 8250192130 Independent Escort Service Cuttack
Low Rate Call Girls Cuttack Anika 8250192130 Independent Escort Service CuttackLow Rate Call Girls Cuttack Anika 8250192130 Independent Escort Service Cuttack
Low Rate Call Girls Cuttack Anika 8250192130 Independent Escort Service CuttackSuhani Kapoor
 
VIP High Profile Call Girls Jamshedpur Aarushi 8250192130 Independent Escort ...
VIP High Profile Call Girls Jamshedpur Aarushi 8250192130 Independent Escort ...VIP High Profile Call Girls Jamshedpur Aarushi 8250192130 Independent Escort ...
VIP High Profile Call Girls Jamshedpur Aarushi 8250192130 Independent Escort ...Suhani Kapoor
 
VIP Kolkata Call Girl Lake Gardens 👉 8250192130 Available With Room
VIP Kolkata Call Girl Lake Gardens 👉 8250192130  Available With RoomVIP Kolkata Call Girl Lake Gardens 👉 8250192130  Available With Room
VIP Kolkata Call Girl Lake Gardens 👉 8250192130 Available With Roomdivyansh0kumar0
 
Production Day 1.pptxjvjbvbcbcb bj bvcbj
Production Day 1.pptxjvjbvbcbcb bj bvcbjProduction Day 1.pptxjvjbvbcbcb bj bvcbj
Production Day 1.pptxjvjbvbcbcb bj bvcbjLewisJB
 
Delhi Call Girls In Atta Market 9711199012 Book Your One night Stand Call Girls
Delhi Call Girls In Atta Market 9711199012 Book Your One night Stand Call GirlsDelhi Call Girls In Atta Market 9711199012 Book Your One night Stand Call Girls
Delhi Call Girls In Atta Market 9711199012 Book Your One night Stand Call Girlsshivangimorya083
 
VIP Russian Call Girls Amravati Chhaya 8250192130 Independent Escort Service ...
VIP Russian Call Girls Amravati Chhaya 8250192130 Independent Escort Service ...VIP Russian Call Girls Amravati Chhaya 8250192130 Independent Escort Service ...
VIP Russian Call Girls Amravati Chhaya 8250192130 Independent Escort Service ...Suhani Kapoor
 
Zeeman Effect normal and Anomalous zeeman effect
Zeeman Effect normal and Anomalous zeeman effectZeeman Effect normal and Anomalous zeeman effect
Zeeman Effect normal and Anomalous zeeman effectPriyanshuRawat56
 
VIP Call Girls Service Cuttack Aishwarya 8250192130 Independent Escort Servic...
VIP Call Girls Service Cuttack Aishwarya 8250192130 Independent Escort Servic...VIP Call Girls Service Cuttack Aishwarya 8250192130 Independent Escort Servic...
VIP Call Girls Service Cuttack Aishwarya 8250192130 Independent Escort Servic...Suhani Kapoor
 
CALL ON ➥8923113531 🔝Call Girls Nishatganj Lucknow best sexual service
CALL ON ➥8923113531 🔝Call Girls Nishatganj Lucknow best sexual serviceCALL ON ➥8923113531 🔝Call Girls Nishatganj Lucknow best sexual service
CALL ON ➥8923113531 🔝Call Girls Nishatganj Lucknow best sexual serviceanilsa9823
 
Low Rate Call Girls Gorakhpur Anika 8250192130 Independent Escort Service Gor...
Low Rate Call Girls Gorakhpur Anika 8250192130 Independent Escort Service Gor...Low Rate Call Girls Gorakhpur Anika 8250192130 Independent Escort Service Gor...
Low Rate Call Girls Gorakhpur Anika 8250192130 Independent Escort Service Gor...Suhani Kapoor
 
CALL ON ➥8923113531 🔝Call Girls Husainganj Lucknow best Female service 🧳
CALL ON ➥8923113531 🔝Call Girls Husainganj Lucknow best Female service  🧳CALL ON ➥8923113531 🔝Call Girls Husainganj Lucknow best Female service  🧳
CALL ON ➥8923113531 🔝Call Girls Husainganj Lucknow best Female service 🧳anilsa9823
 
VIP Call Girl Bhilai Aashi 8250192130 Independent Escort Service Bhilai
VIP Call Girl Bhilai Aashi 8250192130 Independent Escort Service BhilaiVIP Call Girl Bhilai Aashi 8250192130 Independent Escort Service Bhilai
VIP Call Girl Bhilai Aashi 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 
PM Job Search Council Info Session - PMI Silver Spring Chapter
PM Job Search Council Info Session - PMI Silver Spring ChapterPM Job Search Council Info Session - PMI Silver Spring Chapter
PM Job Search Council Info Session - PMI Silver Spring ChapterHector Del Castillo, CPM, CPMM
 
Delhi Call Girls Preet Vihar 9711199171 ☎✔👌✔ Whatsapp Body to body massage wi...
Delhi Call Girls Preet Vihar 9711199171 ☎✔👌✔ Whatsapp Body to body massage wi...Delhi Call Girls Preet Vihar 9711199171 ☎✔👌✔ Whatsapp Body to body massage wi...
Delhi Call Girls Preet Vihar 9711199171 ☎✔👌✔ Whatsapp Body to body massage wi...shivangimorya083
 
Preventing and ending sexual harassment in the workplace.pptx
Preventing and ending sexual harassment in the workplace.pptxPreventing and ending sexual harassment in the workplace.pptx
Preventing and ending sexual harassment in the workplace.pptxGry Tina Tinde
 
女王大学硕士毕业证成绩单(加急办理)认证海外毕业证
女王大学硕士毕业证成绩单(加急办理)认证海外毕业证女王大学硕士毕业证成绩单(加急办理)认证海外毕业证
女王大学硕士毕业证成绩单(加急办理)认证海外毕业证obuhobo
 

Recently uploaded (20)

Internshala Student Partner 6.0 Jadavpur University Certificate
Internshala Student Partner 6.0 Jadavpur University CertificateInternshala Student Partner 6.0 Jadavpur University Certificate
Internshala Student Partner 6.0 Jadavpur University Certificate
 
Experience Certificate - Marketing Analyst-Soham Mondal.pdf
Experience Certificate - Marketing Analyst-Soham Mondal.pdfExperience Certificate - Marketing Analyst-Soham Mondal.pdf
Experience Certificate - Marketing Analyst-Soham Mondal.pdf
 
CFO_SB_Career History_Multi Sector Experience
CFO_SB_Career History_Multi Sector ExperienceCFO_SB_Career History_Multi Sector Experience
CFO_SB_Career History_Multi Sector Experience
 
Dubai Call Girls Starlet O525547819 Call Girls Dubai Showen Dating
Dubai Call Girls Starlet O525547819 Call Girls Dubai Showen DatingDubai Call Girls Starlet O525547819 Call Girls Dubai Showen Dating
Dubai Call Girls Starlet O525547819 Call Girls Dubai Showen Dating
 
Low Rate Call Girls Cuttack Anika 8250192130 Independent Escort Service Cuttack
Low Rate Call Girls Cuttack Anika 8250192130 Independent Escort Service CuttackLow Rate Call Girls Cuttack Anika 8250192130 Independent Escort Service Cuttack
Low Rate Call Girls Cuttack Anika 8250192130 Independent Escort Service Cuttack
 
VIP High Profile Call Girls Jamshedpur Aarushi 8250192130 Independent Escort ...
VIP High Profile Call Girls Jamshedpur Aarushi 8250192130 Independent Escort ...VIP High Profile Call Girls Jamshedpur Aarushi 8250192130 Independent Escort ...
VIP High Profile Call Girls Jamshedpur Aarushi 8250192130 Independent Escort ...
 
VIP Kolkata Call Girl Lake Gardens 👉 8250192130 Available With Room
VIP Kolkata Call Girl Lake Gardens 👉 8250192130  Available With RoomVIP Kolkata Call Girl Lake Gardens 👉 8250192130  Available With Room
VIP Kolkata Call Girl Lake Gardens 👉 8250192130 Available With Room
 
Production Day 1.pptxjvjbvbcbcb bj bvcbj
Production Day 1.pptxjvjbvbcbcb bj bvcbjProduction Day 1.pptxjvjbvbcbcb bj bvcbj
Production Day 1.pptxjvjbvbcbcb bj bvcbj
 
Delhi Call Girls In Atta Market 9711199012 Book Your One night Stand Call Girls
Delhi Call Girls In Atta Market 9711199012 Book Your One night Stand Call GirlsDelhi Call Girls In Atta Market 9711199012 Book Your One night Stand Call Girls
Delhi Call Girls In Atta Market 9711199012 Book Your One night Stand Call Girls
 
VIP Russian Call Girls Amravati Chhaya 8250192130 Independent Escort Service ...
VIP Russian Call Girls Amravati Chhaya 8250192130 Independent Escort Service ...VIP Russian Call Girls Amravati Chhaya 8250192130 Independent Escort Service ...
VIP Russian Call Girls Amravati Chhaya 8250192130 Independent Escort Service ...
 
Zeeman Effect normal and Anomalous zeeman effect
Zeeman Effect normal and Anomalous zeeman effectZeeman Effect normal and Anomalous zeeman effect
Zeeman Effect normal and Anomalous zeeman effect
 
VIP Call Girls Service Cuttack Aishwarya 8250192130 Independent Escort Servic...
VIP Call Girls Service Cuttack Aishwarya 8250192130 Independent Escort Servic...VIP Call Girls Service Cuttack Aishwarya 8250192130 Independent Escort Servic...
VIP Call Girls Service Cuttack Aishwarya 8250192130 Independent Escort Servic...
 
CALL ON ➥8923113531 🔝Call Girls Nishatganj Lucknow best sexual service
CALL ON ➥8923113531 🔝Call Girls Nishatganj Lucknow best sexual serviceCALL ON ➥8923113531 🔝Call Girls Nishatganj Lucknow best sexual service
CALL ON ➥8923113531 🔝Call Girls Nishatganj Lucknow best sexual service
 
Low Rate Call Girls Gorakhpur Anika 8250192130 Independent Escort Service Gor...
Low Rate Call Girls Gorakhpur Anika 8250192130 Independent Escort Service Gor...Low Rate Call Girls Gorakhpur Anika 8250192130 Independent Escort Service Gor...
Low Rate Call Girls Gorakhpur Anika 8250192130 Independent Escort Service Gor...
 
CALL ON ➥8923113531 🔝Call Girls Husainganj Lucknow best Female service 🧳
CALL ON ➥8923113531 🔝Call Girls Husainganj Lucknow best Female service  🧳CALL ON ➥8923113531 🔝Call Girls Husainganj Lucknow best Female service  🧳
CALL ON ➥8923113531 🔝Call Girls Husainganj Lucknow best Female service 🧳
 
VIP Call Girl Bhilai Aashi 8250192130 Independent Escort Service Bhilai
VIP Call Girl Bhilai Aashi 8250192130 Independent Escort Service BhilaiVIP Call Girl Bhilai Aashi 8250192130 Independent Escort Service Bhilai
VIP Call Girl Bhilai Aashi 8250192130 Independent Escort Service Bhilai
 
PM Job Search Council Info Session - PMI Silver Spring Chapter
PM Job Search Council Info Session - PMI Silver Spring ChapterPM Job Search Council Info Session - PMI Silver Spring Chapter
PM Job Search Council Info Session - PMI Silver Spring Chapter
 
Delhi Call Girls Preet Vihar 9711199171 ☎✔👌✔ Whatsapp Body to body massage wi...
Delhi Call Girls Preet Vihar 9711199171 ☎✔👌✔ Whatsapp Body to body massage wi...Delhi Call Girls Preet Vihar 9711199171 ☎✔👌✔ Whatsapp Body to body massage wi...
Delhi Call Girls Preet Vihar 9711199171 ☎✔👌✔ Whatsapp Body to body massage wi...
 
Preventing and ending sexual harassment in the workplace.pptx
Preventing and ending sexual harassment in the workplace.pptxPreventing and ending sexual harassment in the workplace.pptx
Preventing and ending sexual harassment in the workplace.pptx
 
女王大学硕士毕业证成绩单(加急办理)认证海外毕业证
女王大学硕士毕业证成绩单(加急办理)认证海外毕业证女王大学硕士毕业证成绩单(加急办理)认证海外毕业证
女王大学硕士毕业证成绩单(加急办理)认证海外毕业证
 

Cwf96 (1)

  • 1. Brad L. Hutchings Electrical and Computer EngineeringBYU 1 Run-Time Reconfiguration Evolution of a New Strategy for Computing
  • 2. Brad L. Hutchings Electrical and Computer EngineeringBYU 2 Reconfigurable Logic at BYU l Lab Resources: l Splash-II, TeraMAC, CLAyFun, Xilinx, Altera. l Applications developed for most major platforms: l Run-Time Reconfigurable Neural Networks l Application-Specific Processors l Image Processing l Genetic Algorithms l Linear Algebra l Processor/Configurable-Logic Integration.
  • 3. Brad L. Hutchings Electrical and Computer EngineeringBYU 3 http://splish.ee.byu.edu l List of work in progress. l All BYU papers are downloadable. l Bibliography with text abstracts: » Keyword search: device, tool, app, system. l Tutorials for FPGA Platforms and CAD Tools.
  • 4. Brad L. Hutchings Electrical and Computer EngineeringBYU 4 What This Talk is About Computing with FPGAs
  • 5. Brad L. Hutchings Electrical and Computer EngineeringBYU 5 Assumptions l Things I won’t tell you: +how wonderful FPGAs are. +how cost-effective FPGAs are. +how FPGAs will replace all of your home appliances. Everyone knows how to spell “FPGA”
  • 6. Brad L. Hutchings Electrical and Computer EngineeringBYU 6 Device Assumptions l Assume an FPGA that » reconfigures rapidly (<10 ms or so) » reconfigures in-circuit » reconfigures an infinite number of times.
  • 7. Brad L. Hutchings Electrical and Computer EngineeringBYU 7 FPGAs are Big FETs ‘R US
  • 8. Brad L. Hutchings Electrical and Computer EngineeringBYU 8 FPGAs are Slow 2K 2K 2K
  • 9. Brad L. Hutchings Electrical and Computer EngineeringBYU 9 Most FPGA Designs are Static l Eternal prototypes. l Low volume or low gate counts. l Evolving standards or requirements. Final Application is Still Static
  • 10. Brad L. Hutchings Electrical and Computer EngineeringBYU 10 After the application is finished... What are those other 900,000 transistors doing right now?
  • 11. Brad L. Hutchings Electrical and Computer EngineeringBYU 11 FPGAs are Reusable Convolution Skeletonization Neural Networks Edge Detection ATR FFT Encryption Compression Same Device
  • 12. Brad L. Hutchings Electrical and Computer EngineeringBYU 12 Slightly Less-Static Examples l Diagnostic-specific configurations. – Test with internal circuitry. l Mode-specific configurations. – Pixel depths on monitors. l I/O, data format-specific configurations. – Bus-couplers, etc. Unrelated configurations. Occasional reconfiguration.
  • 13. Brad L. Hutchings Electrical and Computer EngineeringBYU 13 Run-Time Reconfiguration (RTR) Implement application as a set of multiple, cooperative configurations At run-time, execute and reconfigure as necessary.
  • 14. Brad L. Hutchings Electrical and Computer EngineeringBYU 14 Evolutionary Progression of RTR l RRANN-1(Backpropagation) » Global RTR l RRANN-2 (Backpropagation) » Fixed-Schedule, Local RTR l DPCC (General Computing) » Demand-Paged, Local RTR
  • 15. Brad L. Hutchings Electrical and Computer EngineeringBYU 15 Neural Network Eye Color Income Debt Need Make of Car Do I Make Loan
  • 16. Brad L. Hutchings Electrical and Computer EngineeringBYU 16 Training Mode Eye Color Income Debt Need Make of Car Do I Make Loan Training Output Data[n] Training Input Data[n]
  • 17. Brad L. Hutchings Electrical and Computer EngineeringBYU 17 Operational Mode Eye Color Income Debt Need Make of Car Do I Make Loan Loan Application Answer
  • 18. Brad L. Hutchings Electrical and Computer EngineeringBYU 18 Backpropagation neti = j A OjWji ∀ i : i B Oi = f (neti) = 1 1+e−neti δi = f (neti)(Ti − Oi) ∀ i : i C δi = f (neti) j E δjWij ∀ i : i D Wij = Cl Oiδj ∀ i, j : i A, j B Wijt+1 = Wijt + Wij
  • 19. Brad L. Hutchings Electrical and Computer EngineeringBYU 19 RRANN Temporal Phases Feedforward Error Propagation Update neti = j A OjWji ∀ i : i B Oi = f (neti) = 1 1+e−neti δi = f (neti)(Ti − Oi) ∀ i : i C δi = f (neti) j E δjWij ∀ i : i D Wij = Cl Oiδj ∀ i, j : i A, j B Wijt+1 = Wijt + Wij
  • 20. Brad L. Hutchings Electrical and Computer EngineeringBYU 20 RRANN Implementation Feedforward Weight Update Error Propagate Feedforward Error Propagate Weight Update Unified Implementation RTR Implementation
  • 21. Brad L. Hutchings Electrical and Computer EngineeringBYU 21 RRANN-1 Implementation Details l Global RTR » All FPGA resources configured globally. » All state stored external to FPGA. l Xilinx XC3090 FPGAs » Achieved 6 neurons per FPGA (RTR) – Unified version achieved 1 neuron per FPGA
  • 22. Brad L. Hutchings Electrical and Computer EngineeringBYU 22 RRANN-1 Training Performance 0 10 20 30 40 50 60 0 2 4 6 8 10 12 Statically Configured (1 Neuron per FPGA) Run-Tim e Reconfigured (6 Neurons perFPG A) Number of FPGAs (XC3090s) PerformanceRelativetoHP735-125 Peak Training Performance
  • 23. Brad L. Hutchings Electrical and Computer EngineeringBYU 23 RRANN-1 Operational Performance 0 10 20 30 40 50 60 0 20 40 60 80 100 120 Statically Configured (1 Neuron per FPGA) Run-Tim e Reconfigured (6 Neurons per FPGA) Number of FPGAs (XC3090s) PerformanceRelativetoHP735-125 Peak Operational Performance
  • 24. Brad L. Hutchings Electrical and Computer EngineeringBYU 24 RRANN-2 Implementation Details l Local RTR » Exploit commonality to reduce configuration. » Partially configure FPGAs in each step. l National Semiconductor Clay-31 FPGA. » similar to CLI/Atmel. » Achieved 9 neurons per FPGA. – Unified version achieved 3 neurons per FPGA.
  • 25. Brad L. Hutchings Electrical and Computer EngineeringBYU 25 RRANN-2 Training Performance 5 10 15 20 25 30 35 40 0 5 10 15 20 25 30 35 40 45 Peak Training Performance Number of FPGAs PerformanceRelativetoHP735-125 Unified Complete Partial RRANN
  • 26. Brad L. Hutchings Electrical and Computer EngineeringBYU 26 RRANN-2 Operational Performance 5 10 15 20 25 30 35 40 0 20 40 60 80 100 120 140 Peak Operational Performance Number of FPGAs PerformanceRelativetoHP735-125 Unified Partial & Complete RRANN
  • 27. Brad L. Hutchings Electrical and Computer EngineeringBYU 27 RRANN-2 Demo l 5 minutes
  • 28. Brad L. Hutchings Electrical and Computer EngineeringBYU 28 RTR Point of Interest A system computing only 30% of the time can outperform a system computing 100% of the time
  • 29. Brad L. Hutchings Electrical and Computer EngineeringBYU 29 DPCC l Experiment with RTR architecture. » Study impact of RTR on system architecture. l FPGA-based processor architecture. » NSC CLAy-31 » Partial configuration. » Application-specific instruction sets.
  • 30. Brad L. Hutchings Electrical and Computer EngineeringBYU 30 General DPCC Approach l Library-based Approach – Library contains application-specific circuit modules. l Software Control – Sequencing, complex control, I/O controlled by software. l Linear Hardware Space. – Hardware relocated at run-time to available space. l Hardware Paging. – Idle hardware replaced with active modules. l Demand-Driven Execution – Hardware modules loaded as required by application.
  • 31. Brad L. Hutchings Electrical and Computer EngineeringBYU 31 DISC Internal Architecture Static Control Circuitry Dynamic Instruction Module A Dynamic Instruction Module B
  • 32. Brad L. Hutchings Electrical and Computer EngineeringBYU 32 DISC Programming Model main() { ... while(...) { hist(image1); median(image1,t); thresh(t,image1); copy(image1,i2); erode(image1); diff(i2, image1); } } DISC (FPGA) Program/Data RAM others... hist median add inc cmp others... Compile + Assemble Circuit Design Application Programming Static Circuitry thresh add Instruction Library thresh erode
  • 33. Brad L. Hutchings Electrical and Computer EngineeringBYU 33 DISC Operation Yes No Fetch Instruction Hardware Module Present? Relocate & Load Hardwareð Module Execute Hardware Module
  • 34. Brad L. Hutchings Electrical and Computer EngineeringBYU 34 Demo Hardware
  • 35. Brad L. Hutchings Electrical and Computer EngineeringBYU 35 DISC Demo – 00:00 Opening – 01:00 Picture – 01:30 Demo (2 passes) – 02:00 Library – 05:00 Algorithm – 07:00 Execution Profile – 08:30 Edit and Compile – 10:15 Library Picture (for 10 minutes)
  • 36. Brad L. Hutchings Electrical and Computer EngineeringBYU 36 DPCC Conclusions l Provides modular programming model. l Isolates capacity/timing issues. l Allows reuse at the design level. l Automated reuse of silicon: » demand paging, » module relocation.
  • 37. Brad L. Hutchings Electrical and Computer EngineeringBYU 37 Summary l Illustrated RTR as it evolved at BYU: » Global RTR » Local RTR » DPCC l RTR exploits FPGA overhead » Static approaches waste FPGA resources.
  • 38. Brad L. Hutchings Electrical and Computer EngineeringBYU 38 The Future l FPGAs are reusable, VHDL is not! » Must be able to reuse optimized designs. l CAD/Compilers are far behind. » Very dependent on GPR. l Devices are not well-suited. » Better devices will expand application base.
  • 39. Brad L. Hutchings Electrical and Computer EngineeringBYU 39 Acknowledgement Thanks to National Semiconductor for funding this work through an ARPA contract.
  • 40. Brad L. Hutchings Electrical and Computer EngineeringBYU 40 End of Presentation
  • 41. Brad L. Hutchings Electrical and Computer EngineeringBYU 41 Initial Motivation l I was new faculty. » I could do something completely new. l I enjoyed custom hardware. » FPGAs looked like the ultimate “tinkertoy.” l I had to restructure CompEng. » FPGAs could impact the undergraduates.
  • 42. Brad L. Hutchings Electrical and Computer EngineeringBYU 42
  • 43. Brad L. Hutchings Electrical and Computer EngineeringBYU 43 Configurable Computing l Most applications are still static. » One configuration per application. l Compile-Time Reconfiguration » Configure once per application, prior to execution. l Splash, Perle, Teramac » Large ASIC emulators.
  • 44. Brad L. Hutchings Electrical and Computer EngineeringBYU 44 Flexibility eases Development l Software-like implementation strategy: » Iterative » Incremental » Debuggable » Execution versus Simulation » ASIC results without ASIC development