SlideShare a Scribd company logo
MACHINE LEARNING AND
FPGA-BASED HARDWARE
ACCELERATION
Andreea-Ingrid Funie
PhD Candidate, Imperial College
London
0
Custom-Computing Group
Head: Prof. Wayne
Luk
1
Field Programmable Gate
Array
Next Generation Computing
• Existing computers:
- Slow
- Power hungry
- Complex to implement applications
• Our focus: custom computing
- Customise hardware/software to applications
- Enhance design quality and designer productivity
• Research strategy:
- FPGA: reconfigurable acceleration
- DFE: data flow engine = FPGA + memory + dataflow
2
Our Hardware Devices
• 3 MPCX nodes with 8 cards each with Stratix V FPGA. Each card has
a measured main memory throughput of 65GB/s :
 1.5 TB/s of potential access
 130PB/day of potential processing
• 10 FPGAs (e.g. : Altera Stratix V, Xilinx Virtex VI)
• 6 GPUs (e.g. : NVIDIA Tesla C2070 GPU: 448 cores running at
1.15GHz; NVIDIA Kepler k40/k80)
At Imperial College London HPC center: https://wiki.imperial.ac.uk/display/HPC/Systems
 ax4 (15 Tbytes of RAM & 1280 cores, 1.5PBytes fast raid
storage), cx2 (456 nodes & 5272 cores comprised of SGI Altix ICE
hardware), cx1 (1395 nodes & 13558 cores, 8 Nvidia K80, 4 Nvidia K40)
3
Financial simulation:
163 times faster1, 170 times less energy
Genomic analysis:
88 times faster, 3 times less energy
DFE: speed + energy efficiency
String MatchingLeast Square Monte Carlo Method
[1] Chow et al. (FPGA Conference,
2012)
[2] Arram et al. (FPGA Conference,
2015)
4
1faster than the equivalent single/multi-
core implementation
Climate modelling: 13 times faster1
DFE: speed + energy efficiency
5
Stencil Computation
Air traffic management:
17 times faster, 15 times less energy
Sequential Monte Carlo Method
[3] Russell et al. (FCCM Conference,
2015)
[4] Chau et al. (HEART Conference,
2013)1faster than the equivalent single/multi-
core implementation
DFE: speed + energy efficiency
Optimal architecture up to 47
times faster
(UoF Benchmark)
Iterative Sparse Linear Solvers
Computational
Fluid Dynamics
Power Systems
Simulation
6
[5] Grigoras et al.
(FPGA
Conference,
2016)
Machine Learning on DFEs
7
Multi Objective Machine Learning Optimizer
• Self-optimization of reconfigurable designs through automatic
analysis and adaptation of design parameters
• Can switch between a fast/power hungry design and a
relatively slow/low power alternative
• Uses:
- Gaussian Process Regression
- Support Vector Machine Classification
- Particle Swarm Optimization
[6] Kurek et al. (FCCM Conference,
2014)
Machine Learning on DFEs
Pipelined Genetic Propagation
Travelling salesman problem:
90 times faster
8
Neural Networks Simulation
Polychronous spiking neural network:
34 times faster1
[7] Cheung et al. (Frontiers in Neuroscience,
2016)
[8] Guo et al. (FCCM Conference,
2015)1faster than the equivalent single/multi-
core implementation
Incremental Support Vector Machine
Stock trading:
41 times faster1
One-class Support Vector Machine
Network anomaly detection:
6 times faster
Machine Learning on DFEs
9
[9] Shao et al. (FPT Conference,
2016)
[10] Bara et al. (FPT Conference,
2014)1faster than the equivalent single/multi-
core implementation
Machine Learning for
Financial Applications on
DFEs
Challenges:
• Quantity of data
• Speed of processing
• Accuracy of results
10
Genetic Programming
for Trading
needs
acceleration
[11] Funie et al.
(ASAP Conference, 201
11
DFE Speedup over CPU
DFE: Maxeler Maia DFE, 8 customised computing units
CPU: Dual Intel Xeon E5-2640, 12 cores 20 times speedup
992 expressions
12
Capability from acceleration
3.5x higher returns
20x speedup
Financial institution:
means:
Regulators analyze:
20x more rules
Return
s
Data
Points 13
Machine Learning on DFEs:
Future Work
• Deep Boltzmann Machine for financial market
direction prediction
• Support Vector Machines for satellite image
classification
• Data analysis and clustering methods such as
DBSCAN
14
Summary
15
• FPGAs accelerate many machine learning applications:
- Genetic Programming for optimized trading strategies
- Incremental Support Vector Machine for stock trading
- Deep Boltzmann Machine for financial market direction
prediction
- Support Vector Machine for satellite image classification
• Tools to enhance designer productivity:
- Aid users without electronic design experience
- Ensure high quality implementation: speed, accuracy, energy
efficiency.
References
16
[1] Gary C.T. Chow, Anson H.T. Tse, Qiwei Jin, Wayne Luk, Philip H.W. Leong,
David B. Thomas, “A Mixed Precision Monte Carlo Methodology for
Reconfigurable Accelerator Systems”, FPGA 2012.
[2] James Arram, Wayne Luk, Peiyong Jiang, "Ramethy: reconfigurable
acceleration of Bisulfite sequence alignment", FPGA, 2015.
[3] Francis P. Russell, Peter D. Duben, Xinyu Niu, Wayne Luk, T. N. Palmer,
“Architectures and precision analysis for modelling atmospheric variables with
chaotic behaviour”, FCCM 2015.
[4] Thomas C.P. Chau, James Targett, Marlon Wijeyasinghe,
Wayne Luk, Peter Y.K. Cheung, Benjamin Cope, Alison Eele, Jan Maciejowski,
“Accelerating Sequential Monte Carlo Method for Real-time Air Traffic
Management”, HEART 2013.
[5] Paul Grigoras, Pavel Burovskiy, Wayne Luk, “CASK – Open-Source
Custom Architectures for Sparse Kernels”, FPGA 2016.
References
17
[6] Maciej Kurek, Tobias Becker, Thomas P. Chau, Wayne Luk, “Automating
Optimization of Reconfigurable Designs”, FCCM 2014.
[7] Kit Cheung, Simon R. Schultz, Wayne Luk, “NeuroFlow: A general purpose
spiking neural network simulation platform using customizable processors”,
Frontiers in Neuroscience, 2016.
[8] Guo, Liucheng, Ce Guo, David B. Thomas, and Wayne Luk. “Pipelined
Genetic Propagation”, FCCM 2015.
[9] Shengjia Shao, Oskar Mencer, Wayne Luk, "Dataflow design for optimal
incremental SVM training", FPT, 2016
[10] Andrei bara, Xinyu Niu, Wayne Luk, “A dataflow system for anomaly
detection and analysis”, FPT 2014.
[11] Andreea-Ingrid Funie, Paul Grigoras, Pavel Burovskiy, Wayne Luk, Mark
Salmon, “Reconfigurable acceleration of fitness evaluation in trading
strategies”, ASAP 2015.

More Related Content

Viewers also liked

BDW16 London - Roland Major, Transport for London - Cloud Search Secured
BDW16 London - Roland Major, Transport for London - Cloud Search SecuredBDW16 London - Roland Major, Transport for London - Cloud Search Secured
BDW16 London - Roland Major, Transport for London - Cloud Search Secured
Big Data Week
 
BDW16 London - Amjad Zaim, Cognitro Analytics: How Deep is Your Learning
BDW16 London - Amjad Zaim, Cognitro Analytics: How Deep is Your Learning BDW16 London - Amjad Zaim, Cognitro Analytics: How Deep is Your Learning
BDW16 London - Amjad Zaim, Cognitro Analytics: How Deep is Your Learning
Big Data Week
 
BDW16 London - Mishal Patel, NHS - Modernising Routine Breast Cancer Using Bi...
BDW16 London - Mishal Patel, NHS - Modernising Routine Breast Cancer Using Bi...BDW16 London - Mishal Patel, NHS - Modernising Routine Breast Cancer Using Bi...
BDW16 London - Mishal Patel, NHS - Modernising Routine Breast Cancer Using Bi...
Big Data Week
 
BDW16 London - John Callan, Boxever - Data and Analytics - The Fuel Your Bran...
BDW16 London - John Callan, Boxever - Data and Analytics - The Fuel Your Bran...BDW16 London - John Callan, Boxever - Data and Analytics - The Fuel Your Bran...
BDW16 London - John Callan, Boxever - Data and Analytics - The Fuel Your Bran...
Big Data Week
 
BDW16 London - Vojta Rocek, Trologic - Challenging Big Data
BDW16 London - Vojta Rocek, Trologic - Challenging Big DataBDW16 London - Vojta Rocek, Trologic - Challenging Big Data
BDW16 London - Vojta Rocek, Trologic - Challenging Big Data
Big Data Week
 
BDW16 London - Scott Krueger, skyscanner - Does More Data Mean Better Decisio...
BDW16 London - Scott Krueger, skyscanner - Does More Data Mean Better Decisio...BDW16 London - Scott Krueger, skyscanner - Does More Data Mean Better Decisio...
BDW16 London - Scott Krueger, skyscanner - Does More Data Mean Better Decisio...
Big Data Week
 
BDW16 London - Josh Partridge, Shazam - How Labels, Radio Stations and Brand...
BDW16 London - Josh Partridge, Shazam -  How Labels, Radio Stations and Brand...BDW16 London - Josh Partridge, Shazam -  How Labels, Radio Stations and Brand...
BDW16 London - Josh Partridge, Shazam - How Labels, Radio Stations and Brand...
Big Data Week
 
BDW16 London - Wael Elrifai, Pentaho - Big Data-Driven Innovatiom
BDW16 London - Wael Elrifai, Pentaho - Big Data-Driven InnovatiomBDW16 London - Wael Elrifai, Pentaho - Big Data-Driven Innovatiom
BDW16 London - Wael Elrifai, Pentaho - Big Data-Driven Innovatiom
Big Data Week
 
BDW16 London - Charlie Ballard, TripAdvisor - TripAdvisor and Constant Change...
BDW16 London - Charlie Ballard, TripAdvisor - TripAdvisor and Constant Change...BDW16 London - Charlie Ballard, TripAdvisor - TripAdvisor and Constant Change...
BDW16 London - Charlie Ballard, TripAdvisor - TripAdvisor and Constant Change...
Big Data Week
 
BDW16 London - Marius Boeru, Bigstep - How to Automate Big Data with Ansible
BDW16 London -  Marius Boeru, Bigstep - How to Automate Big Data with AnsibleBDW16 London -  Marius Boeru, Bigstep - How to Automate Big Data with Ansible
BDW16 London - Marius Boeru, Bigstep - How to Automate Big Data with Ansible
Big Data Week
 
BDW16 London - Chris von Csefalvay, Helioserv - Cats and What They Tell us Ab...
BDW16 London - Chris von Csefalvay, Helioserv - Cats and What They Tell us Ab...BDW16 London - Chris von Csefalvay, Helioserv - Cats and What They Tell us Ab...
BDW16 London - Chris von Csefalvay, Helioserv - Cats and What They Tell us Ab...
Big Data Week
 
BDW16 London - Jonny Voon, Innovate UK - Smart Cities and the Buzz Word Bingo
BDW16 London - Jonny Voon, Innovate UK - Smart Cities and the Buzz Word BingoBDW16 London - Jonny Voon, Innovate UK - Smart Cities and the Buzz Word Bingo
BDW16 London - Jonny Voon, Innovate UK - Smart Cities and the Buzz Word Bingo
Big Data Week
 
BDW16 London - John Belchamber, Telefonica - New Data, New Strategies, New Op...
BDW16 London - John Belchamber, Telefonica - New Data, New Strategies, New Op...BDW16 London - John Belchamber, Telefonica - New Data, New Strategies, New Op...
BDW16 London - John Belchamber, Telefonica - New Data, New Strategies, New Op...
Big Data Week
 
BDW16 London - William Vambenepe, Google - 3rd Generation Data Platform
BDW16 London - William Vambenepe, Google - 3rd Generation Data PlatformBDW16 London - William Vambenepe, Google - 3rd Generation Data Platform
BDW16 London - William Vambenepe, Google - 3rd Generation Data Platform
Big Data Week
 
BDW16 London - Alex Bordei, Bigstep - Building Data Labs in the Cloud
BDW16 London - Alex Bordei, Bigstep - Building Data Labs in the CloudBDW16 London - Alex Bordei, Bigstep - Building Data Labs in the Cloud
BDW16 London - Alex Bordei, Bigstep - Building Data Labs in the Cloud
Big Data Week
 
BDW16 London - Deenar Toraskar, Think Reactive - Fast Data Key to Efficient C...
BDW16 London - Deenar Toraskar, Think Reactive - Fast Data Key to Efficient C...BDW16 London - Deenar Toraskar, Think Reactive - Fast Data Key to Efficient C...
BDW16 London - Deenar Toraskar, Think Reactive - Fast Data Key to Efficient C...
Big Data Week
 
BDW16 London - Nondas Sourlas, Bupa - Big Data in Healthcare
BDW16 London  - Nondas Sourlas, Bupa - Big Data in HealthcareBDW16 London  - Nondas Sourlas, Bupa - Big Data in Healthcare
BDW16 London - Nondas Sourlas, Bupa - Big Data in Healthcare
Big Data Week
 
BDW16 London - Rob Anderson, MapR - Big Data and Everyday Lives
BDW16 London - Rob Anderson, MapR - Big Data and Everyday LivesBDW16 London - Rob Anderson, MapR - Big Data and Everyday Lives
BDW16 London - Rob Anderson, MapR - Big Data and Everyday Lives
Big Data Week
 
BDW16 London - Mark van Rijmenam, Datafloq - Big Data is Dead, Long Live Big ...
BDW16 London - Mark van Rijmenam, Datafloq - Big Data is Dead, Long Live Big ...BDW16 London - Mark van Rijmenam, Datafloq - Big Data is Dead, Long Live Big ...
BDW16 London - Mark van Rijmenam, Datafloq - Big Data is Dead, Long Live Big ...
Big Data Week
 
BDW16 London - Harry Powell & Raffael Strassnig, Barclays UK - Graph-Based Re...
BDW16 London - Harry Powell & Raffael Strassnig, Barclays UK - Graph-Based Re...BDW16 London - Harry Powell & Raffael Strassnig, Barclays UK - Graph-Based Re...
BDW16 London - Harry Powell & Raffael Strassnig, Barclays UK - Graph-Based Re...
Big Data Week
 

Viewers also liked (20)

BDW16 London - Roland Major, Transport for London - Cloud Search Secured
BDW16 London - Roland Major, Transport for London - Cloud Search SecuredBDW16 London - Roland Major, Transport for London - Cloud Search Secured
BDW16 London - Roland Major, Transport for London - Cloud Search Secured
 
BDW16 London - Amjad Zaim, Cognitro Analytics: How Deep is Your Learning
BDW16 London - Amjad Zaim, Cognitro Analytics: How Deep is Your Learning BDW16 London - Amjad Zaim, Cognitro Analytics: How Deep is Your Learning
BDW16 London - Amjad Zaim, Cognitro Analytics: How Deep is Your Learning
 
BDW16 London - Mishal Patel, NHS - Modernising Routine Breast Cancer Using Bi...
BDW16 London - Mishal Patel, NHS - Modernising Routine Breast Cancer Using Bi...BDW16 London - Mishal Patel, NHS - Modernising Routine Breast Cancer Using Bi...
BDW16 London - Mishal Patel, NHS - Modernising Routine Breast Cancer Using Bi...
 
BDW16 London - John Callan, Boxever - Data and Analytics - The Fuel Your Bran...
BDW16 London - John Callan, Boxever - Data and Analytics - The Fuel Your Bran...BDW16 London - John Callan, Boxever - Data and Analytics - The Fuel Your Bran...
BDW16 London - John Callan, Boxever - Data and Analytics - The Fuel Your Bran...
 
BDW16 London - Vojta Rocek, Trologic - Challenging Big Data
BDW16 London - Vojta Rocek, Trologic - Challenging Big DataBDW16 London - Vojta Rocek, Trologic - Challenging Big Data
BDW16 London - Vojta Rocek, Trologic - Challenging Big Data
 
BDW16 London - Scott Krueger, skyscanner - Does More Data Mean Better Decisio...
BDW16 London - Scott Krueger, skyscanner - Does More Data Mean Better Decisio...BDW16 London - Scott Krueger, skyscanner - Does More Data Mean Better Decisio...
BDW16 London - Scott Krueger, skyscanner - Does More Data Mean Better Decisio...
 
BDW16 London - Josh Partridge, Shazam - How Labels, Radio Stations and Brand...
BDW16 London - Josh Partridge, Shazam -  How Labels, Radio Stations and Brand...BDW16 London - Josh Partridge, Shazam -  How Labels, Radio Stations and Brand...
BDW16 London - Josh Partridge, Shazam - How Labels, Radio Stations and Brand...
 
BDW16 London - Wael Elrifai, Pentaho - Big Data-Driven Innovatiom
BDW16 London - Wael Elrifai, Pentaho - Big Data-Driven InnovatiomBDW16 London - Wael Elrifai, Pentaho - Big Data-Driven Innovatiom
BDW16 London - Wael Elrifai, Pentaho - Big Data-Driven Innovatiom
 
BDW16 London - Charlie Ballard, TripAdvisor - TripAdvisor and Constant Change...
BDW16 London - Charlie Ballard, TripAdvisor - TripAdvisor and Constant Change...BDW16 London - Charlie Ballard, TripAdvisor - TripAdvisor and Constant Change...
BDW16 London - Charlie Ballard, TripAdvisor - TripAdvisor and Constant Change...
 
BDW16 London - Marius Boeru, Bigstep - How to Automate Big Data with Ansible
BDW16 London -  Marius Boeru, Bigstep - How to Automate Big Data with AnsibleBDW16 London -  Marius Boeru, Bigstep - How to Automate Big Data with Ansible
BDW16 London - Marius Boeru, Bigstep - How to Automate Big Data with Ansible
 
BDW16 London - Chris von Csefalvay, Helioserv - Cats and What They Tell us Ab...
BDW16 London - Chris von Csefalvay, Helioserv - Cats and What They Tell us Ab...BDW16 London - Chris von Csefalvay, Helioserv - Cats and What They Tell us Ab...
BDW16 London - Chris von Csefalvay, Helioserv - Cats and What They Tell us Ab...
 
BDW16 London - Jonny Voon, Innovate UK - Smart Cities and the Buzz Word Bingo
BDW16 London - Jonny Voon, Innovate UK - Smart Cities and the Buzz Word BingoBDW16 London - Jonny Voon, Innovate UK - Smart Cities and the Buzz Word Bingo
BDW16 London - Jonny Voon, Innovate UK - Smart Cities and the Buzz Word Bingo
 
BDW16 London - John Belchamber, Telefonica - New Data, New Strategies, New Op...
BDW16 London - John Belchamber, Telefonica - New Data, New Strategies, New Op...BDW16 London - John Belchamber, Telefonica - New Data, New Strategies, New Op...
BDW16 London - John Belchamber, Telefonica - New Data, New Strategies, New Op...
 
BDW16 London - William Vambenepe, Google - 3rd Generation Data Platform
BDW16 London - William Vambenepe, Google - 3rd Generation Data PlatformBDW16 London - William Vambenepe, Google - 3rd Generation Data Platform
BDW16 London - William Vambenepe, Google - 3rd Generation Data Platform
 
BDW16 London - Alex Bordei, Bigstep - Building Data Labs in the Cloud
BDW16 London - Alex Bordei, Bigstep - Building Data Labs in the CloudBDW16 London - Alex Bordei, Bigstep - Building Data Labs in the Cloud
BDW16 London - Alex Bordei, Bigstep - Building Data Labs in the Cloud
 
BDW16 London - Deenar Toraskar, Think Reactive - Fast Data Key to Efficient C...
BDW16 London - Deenar Toraskar, Think Reactive - Fast Data Key to Efficient C...BDW16 London - Deenar Toraskar, Think Reactive - Fast Data Key to Efficient C...
BDW16 London - Deenar Toraskar, Think Reactive - Fast Data Key to Efficient C...
 
BDW16 London - Nondas Sourlas, Bupa - Big Data in Healthcare
BDW16 London  - Nondas Sourlas, Bupa - Big Data in HealthcareBDW16 London  - Nondas Sourlas, Bupa - Big Data in Healthcare
BDW16 London - Nondas Sourlas, Bupa - Big Data in Healthcare
 
BDW16 London - Rob Anderson, MapR - Big Data and Everyday Lives
BDW16 London - Rob Anderson, MapR - Big Data and Everyday LivesBDW16 London - Rob Anderson, MapR - Big Data and Everyday Lives
BDW16 London - Rob Anderson, MapR - Big Data and Everyday Lives
 
BDW16 London - Mark van Rijmenam, Datafloq - Big Data is Dead, Long Live Big ...
BDW16 London - Mark van Rijmenam, Datafloq - Big Data is Dead, Long Live Big ...BDW16 London - Mark van Rijmenam, Datafloq - Big Data is Dead, Long Live Big ...
BDW16 London - Mark van Rijmenam, Datafloq - Big Data is Dead, Long Live Big ...
 
BDW16 London - Harry Powell & Raffael Strassnig, Barclays UK - Graph-Based Re...
BDW16 London - Harry Powell & Raffael Strassnig, Barclays UK - Graph-Based Re...BDW16 London - Harry Powell & Raffael Strassnig, Barclays UK - Graph-Based Re...
BDW16 London - Harry Powell & Raffael Strassnig, Barclays UK - Graph-Based Re...
 

Similar to BDW16 London - Ingrid Funie, Imperial College London - Machine Learning and FPGA Based Hardware Acceleration

FPGA-accelerated High-Performance Computing – Close to Breakthrough or Pipedr...
FPGA-accelerated High-Performance Computing – Close to Breakthrough or Pipedr...FPGA-accelerated High-Performance Computing – Close to Breakthrough or Pipedr...
FPGA-accelerated High-Performance Computing – Close to Breakthrough or Pipedr...
Christian Plessl
 
Deep Learning Initiative @ NECSTLab
Deep Learning Initiative @ NECSTLabDeep Learning Initiative @ NECSTLab
Deep Learning Initiative @ NECSTLab
NECST Lab @ Politecnico di Milano
 
Parallex - The Supercomputer
Parallex - The SupercomputerParallex - The Supercomputer
Parallex - The Supercomputer
Ankit Singh
 
FPGA Hardware Accelerator for Machine Learning
FPGA Hardware Accelerator for Machine Learning FPGA Hardware Accelerator for Machine Learning
FPGA Hardware Accelerator for Machine Learning
Dr. Swaminathan Kathirvel
 
A Collaborative Research Proposal To The NSF Research Accelerator For Multip...
A Collaborative Research Proposal To The NSF  Research Accelerator For Multip...A Collaborative Research Proposal To The NSF  Research Accelerator For Multip...
A Collaborative Research Proposal To The NSF Research Accelerator For Multip...
Scott Donald
 
Enabling Insight to Support World-Class Supercomputing (Stefan Ceballos, Oak ...
Enabling Insight to Support World-Class Supercomputing (Stefan Ceballos, Oak ...Enabling Insight to Support World-Class Supercomputing (Stefan Ceballos, Oak ...
Enabling Insight to Support World-Class Supercomputing (Stefan Ceballos, Oak ...
confluent
 
Barcelona Supercomputing Center, Generador de Riqueza
Barcelona Supercomputing Center, Generador de RiquezaBarcelona Supercomputing Center, Generador de Riqueza
Barcelona Supercomputing Center, Generador de Riqueza
Facultad de Informática UCM
 
Panel: NRP Science Impacts​
Panel: NRP Science Impacts​Panel: NRP Science Impacts​
Panel: NRP Science Impacts​
Larry Smarr
 
Designing HPC & Deep Learning Middleware for Exascale Systems
Designing HPC & Deep Learning Middleware for Exascale SystemsDesigning HPC & Deep Learning Middleware for Exascale Systems
Designing HPC & Deep Learning Middleware for Exascale Systems
inside-BigData.com
 
CloudLightning and the OPM-based Use Case
CloudLightning and the OPM-based Use CaseCloudLightning and the OPM-based Use Case
CloudLightning and the OPM-based Use Case
CloudLightning
 
Modern Computing: Cloud, Distributed, & High Performance
Modern Computing: Cloud, Distributed, & High PerformanceModern Computing: Cloud, Distributed, & High Performance
Modern Computing: Cloud, Distributed, & High Performance
inside-BigData.com
 
Adaptive Computing Seminar - Suyog Potdar
Adaptive Computing Seminar - Suyog PotdarAdaptive Computing Seminar - Suyog Potdar
Adaptive Computing Seminar - Suyog Potdar
Suyog Potdar
 
Run-Time Adaptive Processor Allocation of Self-Configurable Intel IXP2400 Net...
Run-Time Adaptive Processor Allocation of Self-Configurable Intel IXP2400 Net...Run-Time Adaptive Processor Allocation of Self-Configurable Intel IXP2400 Net...
Run-Time Adaptive Processor Allocation of Self-Configurable Intel IXP2400 Net...
CSCJournals
 
The CAOS framework: Democratize the acceleration of compute intensive applica...
The CAOS framework: Democratize the acceleration of compute intensive applica...The CAOS framework: Democratize the acceleration of compute intensive applica...
The CAOS framework: Democratize the acceleration of compute intensive applica...
NECST Lab @ Politecnico di Milano
 
"Semantic Segmentation for Scene Understanding: Algorithms and Implementation...
"Semantic Segmentation for Scene Understanding: Algorithms and Implementation..."Semantic Segmentation for Scene Understanding: Algorithms and Implementation...
"Semantic Segmentation for Scene Understanding: Algorithms and Implementation...
Edge AI and Vision Alliance
 
Available HPC Resources at CSUC
Available HPC Resources at CSUCAvailable HPC Resources at CSUC
Dell High-Performance Computing solutions: Enable innovations, outperform exp...
Dell High-Performance Computing solutions: Enable innovations, outperform exp...Dell High-Performance Computing solutions: Enable innovations, outperform exp...
Dell High-Performance Computing solutions: Enable innovations, outperform exp...
Dell World
 
Deep Learning for Fast Simulation
Deep Learning for Fast SimulationDeep Learning for Fast Simulation
Deep Learning for Fast Simulation
Helix Nebula The Science Cloud
 
[EWiLi2016] Towards a performance-aware power capping orchestrator for the Xe...
[EWiLi2016] Towards a performance-aware power capping orchestrator for the Xe...[EWiLi2016] Towards a performance-aware power capping orchestrator for the Xe...
[EWiLi2016] Towards a performance-aware power capping orchestrator for the Xe...
Matteo Ferroni
 
Harnessing OpenCL in Modern Coprocessors
Harnessing OpenCL in Modern CoprocessorsHarnessing OpenCL in Modern Coprocessors
Harnessing OpenCL in Modern Coprocessors
Unai Lopez-Novoa
 

Similar to BDW16 London - Ingrid Funie, Imperial College London - Machine Learning and FPGA Based Hardware Acceleration (20)

FPGA-accelerated High-Performance Computing – Close to Breakthrough or Pipedr...
FPGA-accelerated High-Performance Computing – Close to Breakthrough or Pipedr...FPGA-accelerated High-Performance Computing – Close to Breakthrough or Pipedr...
FPGA-accelerated High-Performance Computing – Close to Breakthrough or Pipedr...
 
Deep Learning Initiative @ NECSTLab
Deep Learning Initiative @ NECSTLabDeep Learning Initiative @ NECSTLab
Deep Learning Initiative @ NECSTLab
 
Parallex - The Supercomputer
Parallex - The SupercomputerParallex - The Supercomputer
Parallex - The Supercomputer
 
FPGA Hardware Accelerator for Machine Learning
FPGA Hardware Accelerator for Machine Learning FPGA Hardware Accelerator for Machine Learning
FPGA Hardware Accelerator for Machine Learning
 
A Collaborative Research Proposal To The NSF Research Accelerator For Multip...
A Collaborative Research Proposal To The NSF  Research Accelerator For Multip...A Collaborative Research Proposal To The NSF  Research Accelerator For Multip...
A Collaborative Research Proposal To The NSF Research Accelerator For Multip...
 
Enabling Insight to Support World-Class Supercomputing (Stefan Ceballos, Oak ...
Enabling Insight to Support World-Class Supercomputing (Stefan Ceballos, Oak ...Enabling Insight to Support World-Class Supercomputing (Stefan Ceballos, Oak ...
Enabling Insight to Support World-Class Supercomputing (Stefan Ceballos, Oak ...
 
Barcelona Supercomputing Center, Generador de Riqueza
Barcelona Supercomputing Center, Generador de RiquezaBarcelona Supercomputing Center, Generador de Riqueza
Barcelona Supercomputing Center, Generador de Riqueza
 
Panel: NRP Science Impacts​
Panel: NRP Science Impacts​Panel: NRP Science Impacts​
Panel: NRP Science Impacts​
 
Designing HPC & Deep Learning Middleware for Exascale Systems
Designing HPC & Deep Learning Middleware for Exascale SystemsDesigning HPC & Deep Learning Middleware for Exascale Systems
Designing HPC & Deep Learning Middleware for Exascale Systems
 
CloudLightning and the OPM-based Use Case
CloudLightning and the OPM-based Use CaseCloudLightning and the OPM-based Use Case
CloudLightning and the OPM-based Use Case
 
Modern Computing: Cloud, Distributed, & High Performance
Modern Computing: Cloud, Distributed, & High PerformanceModern Computing: Cloud, Distributed, & High Performance
Modern Computing: Cloud, Distributed, & High Performance
 
Adaptive Computing Seminar - Suyog Potdar
Adaptive Computing Seminar - Suyog PotdarAdaptive Computing Seminar - Suyog Potdar
Adaptive Computing Seminar - Suyog Potdar
 
Run-Time Adaptive Processor Allocation of Self-Configurable Intel IXP2400 Net...
Run-Time Adaptive Processor Allocation of Self-Configurable Intel IXP2400 Net...Run-Time Adaptive Processor Allocation of Self-Configurable Intel IXP2400 Net...
Run-Time Adaptive Processor Allocation of Self-Configurable Intel IXP2400 Net...
 
The CAOS framework: Democratize the acceleration of compute intensive applica...
The CAOS framework: Democratize the acceleration of compute intensive applica...The CAOS framework: Democratize the acceleration of compute intensive applica...
The CAOS framework: Democratize the acceleration of compute intensive applica...
 
"Semantic Segmentation for Scene Understanding: Algorithms and Implementation...
"Semantic Segmentation for Scene Understanding: Algorithms and Implementation..."Semantic Segmentation for Scene Understanding: Algorithms and Implementation...
"Semantic Segmentation for Scene Understanding: Algorithms and Implementation...
 
Available HPC Resources at CSUC
Available HPC Resources at CSUCAvailable HPC Resources at CSUC
Available HPC Resources at CSUC
 
Dell High-Performance Computing solutions: Enable innovations, outperform exp...
Dell High-Performance Computing solutions: Enable innovations, outperform exp...Dell High-Performance Computing solutions: Enable innovations, outperform exp...
Dell High-Performance Computing solutions: Enable innovations, outperform exp...
 
Deep Learning for Fast Simulation
Deep Learning for Fast SimulationDeep Learning for Fast Simulation
Deep Learning for Fast Simulation
 
[EWiLi2016] Towards a performance-aware power capping orchestrator for the Xe...
[EWiLi2016] Towards a performance-aware power capping orchestrator for the Xe...[EWiLi2016] Towards a performance-aware power capping orchestrator for the Xe...
[EWiLi2016] Towards a performance-aware power capping orchestrator for the Xe...
 
Harnessing OpenCL in Modern Coprocessors
Harnessing OpenCL in Modern CoprocessorsHarnessing OpenCL in Modern Coprocessors
Harnessing OpenCL in Modern Coprocessors
 

More from Big Data Week

BDW17 London - Edward Kibardin - Mitie PLC - Learning and Topological Data A...
 BDW17 London - Edward Kibardin - Mitie PLC - Learning and Topological Data A... BDW17 London - Edward Kibardin - Mitie PLC - Learning and Topological Data A...
BDW17 London - Edward Kibardin - Mitie PLC - Learning and Topological Data A...
Big Data Week
 
BDWW17 London - Steve Bradbury, GRSC - Big Data to the Rescue: A Fraud Case S...
BDWW17 London - Steve Bradbury, GRSC - Big Data to the Rescue: A Fraud Case S...BDWW17 London - Steve Bradbury, GRSC - Big Data to the Rescue: A Fraud Case S...
BDWW17 London - Steve Bradbury, GRSC - Big Data to the Rescue: A Fraud Case S...
Big Data Week
 
BDW17 London - Totte Harinen, Uber - Why Big Data Didn’t End Causal Inference
BDW17 London - Totte Harinen, Uber - Why Big Data Didn’t End Causal InferenceBDW17 London - Totte Harinen, Uber - Why Big Data Didn’t End Causal Inference
BDW17 London - Totte Harinen, Uber - Why Big Data Didn’t End Causal Inference
Big Data Week
 
BDW17 London - Rita Simoes, Boehringer Ingelheim - Big Data in Pharma: Sittin...
BDW17 London - Rita Simoes, Boehringer Ingelheim - Big Data in Pharma: Sittin...BDW17 London - Rita Simoes, Boehringer Ingelheim - Big Data in Pharma: Sittin...
BDW17 London - Rita Simoes, Boehringer Ingelheim - Big Data in Pharma: Sittin...
Big Data Week
 
BDW17 London - Mick Ridley, Exterion Media & Dale Campbell , TfL - Transformi...
BDW17 London - Mick Ridley, Exterion Media & Dale Campbell , TfL - Transformi...BDW17 London - Mick Ridley, Exterion Media & Dale Campbell , TfL - Transformi...
BDW17 London - Mick Ridley, Exterion Media & Dale Campbell , TfL - Transformi...
Big Data Week
 
BDW17 London - Abed Ajraou - First Utility - Putting Data Science in your Bus...
BDW17 London - Abed Ajraou - First Utility - Putting Data Science in your Bus...BDW17 London - Abed Ajraou - First Utility - Putting Data Science in your Bus...
BDW17 London - Abed Ajraou - First Utility - Putting Data Science in your Bus...
Big Data Week
 
BDW17 London - Steve Bradbury - GRSC - Making Sense of the Chaos of Data
BDW17 London - Steve Bradbury - GRSC - Making Sense of the Chaos of DataBDW17 London - Steve Bradbury - GRSC - Making Sense of the Chaos of Data
BDW17 London - Steve Bradbury - GRSC - Making Sense of the Chaos of Data
Big Data Week
 
BDW17 London - Andy Boura - Thomson Reuters - Does Big Data Have to Mean Big ...
BDW17 London - Andy Boura - Thomson Reuters - Does Big Data Have to Mean Big ...BDW17 London - Andy Boura - Thomson Reuters - Does Big Data Have to Mean Big ...
BDW17 London - Andy Boura - Thomson Reuters - Does Big Data Have to Mean Big ...
Big Data Week
 
BDW17 London - Tom Woolrich, Financial Times - What Does Big Data Mean for th...
BDW17 London - Tom Woolrich, Financial Times - What Does Big Data Mean for th...BDW17 London - Tom Woolrich, Financial Times - What Does Big Data Mean for th...
BDW17 London - Tom Woolrich, Financial Times - What Does Big Data Mean for th...
Big Data Week
 
BDW17 London - Andrew Fryer, Microsoft - Everybody Needs a Bit of Science in ...
BDW17 London - Andrew Fryer, Microsoft - Everybody Needs a Bit of Science in ...BDW17 London - Andrew Fryer, Microsoft - Everybody Needs a Bit of Science in ...
BDW17 London - Andrew Fryer, Microsoft - Everybody Needs a Bit of Science in ...
Big Data Week
 

More from Big Data Week (10)

BDW17 London - Edward Kibardin - Mitie PLC - Learning and Topological Data A...
 BDW17 London - Edward Kibardin - Mitie PLC - Learning and Topological Data A... BDW17 London - Edward Kibardin - Mitie PLC - Learning and Topological Data A...
BDW17 London - Edward Kibardin - Mitie PLC - Learning and Topological Data A...
 
BDWW17 London - Steve Bradbury, GRSC - Big Data to the Rescue: A Fraud Case S...
BDWW17 London - Steve Bradbury, GRSC - Big Data to the Rescue: A Fraud Case S...BDWW17 London - Steve Bradbury, GRSC - Big Data to the Rescue: A Fraud Case S...
BDWW17 London - Steve Bradbury, GRSC - Big Data to the Rescue: A Fraud Case S...
 
BDW17 London - Totte Harinen, Uber - Why Big Data Didn’t End Causal Inference
BDW17 London - Totte Harinen, Uber - Why Big Data Didn’t End Causal InferenceBDW17 London - Totte Harinen, Uber - Why Big Data Didn’t End Causal Inference
BDW17 London - Totte Harinen, Uber - Why Big Data Didn’t End Causal Inference
 
BDW17 London - Rita Simoes, Boehringer Ingelheim - Big Data in Pharma: Sittin...
BDW17 London - Rita Simoes, Boehringer Ingelheim - Big Data in Pharma: Sittin...BDW17 London - Rita Simoes, Boehringer Ingelheim - Big Data in Pharma: Sittin...
BDW17 London - Rita Simoes, Boehringer Ingelheim - Big Data in Pharma: Sittin...
 
BDW17 London - Mick Ridley, Exterion Media & Dale Campbell , TfL - Transformi...
BDW17 London - Mick Ridley, Exterion Media & Dale Campbell , TfL - Transformi...BDW17 London - Mick Ridley, Exterion Media & Dale Campbell , TfL - Transformi...
BDW17 London - Mick Ridley, Exterion Media & Dale Campbell , TfL - Transformi...
 
BDW17 London - Abed Ajraou - First Utility - Putting Data Science in your Bus...
BDW17 London - Abed Ajraou - First Utility - Putting Data Science in your Bus...BDW17 London - Abed Ajraou - First Utility - Putting Data Science in your Bus...
BDW17 London - Abed Ajraou - First Utility - Putting Data Science in your Bus...
 
BDW17 London - Steve Bradbury - GRSC - Making Sense of the Chaos of Data
BDW17 London - Steve Bradbury - GRSC - Making Sense of the Chaos of DataBDW17 London - Steve Bradbury - GRSC - Making Sense of the Chaos of Data
BDW17 London - Steve Bradbury - GRSC - Making Sense of the Chaos of Data
 
BDW17 London - Andy Boura - Thomson Reuters - Does Big Data Have to Mean Big ...
BDW17 London - Andy Boura - Thomson Reuters - Does Big Data Have to Mean Big ...BDW17 London - Andy Boura - Thomson Reuters - Does Big Data Have to Mean Big ...
BDW17 London - Andy Boura - Thomson Reuters - Does Big Data Have to Mean Big ...
 
BDW17 London - Tom Woolrich, Financial Times - What Does Big Data Mean for th...
BDW17 London - Tom Woolrich, Financial Times - What Does Big Data Mean for th...BDW17 London - Tom Woolrich, Financial Times - What Does Big Data Mean for th...
BDW17 London - Tom Woolrich, Financial Times - What Does Big Data Mean for th...
 
BDW17 London - Andrew Fryer, Microsoft - Everybody Needs a Bit of Science in ...
BDW17 London - Andrew Fryer, Microsoft - Everybody Needs a Bit of Science in ...BDW17 London - Andrew Fryer, Microsoft - Everybody Needs a Bit of Science in ...
BDW17 London - Andrew Fryer, Microsoft - Everybody Needs a Bit of Science in ...
 

Recently uploaded

Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
Brandon Minnick, MBA
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
Ivanti
 
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying AheadDigital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Wask
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
Jason Packer
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
kumardaparthi1024
 
UI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentationUI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentation
Wouter Lemaire
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
panagenda
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
Hiroshi SHIBATA
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Alpen-Adria-Universität
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
Jakub Marek
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
Tomaz Bratanic
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
Postman
 
Webinar: Designing a schema for a Data Warehouse
Webinar: Designing a schema for a Data WarehouseWebinar: Designing a schema for a Data Warehouse
Webinar: Designing a schema for a Data Warehouse
Federico Razzoli
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
Mariano Tinti
 
5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides
DanBrown980551
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
Edge AI and Vision Alliance
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 

Recently uploaded (20)

Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
 
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying AheadDigital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying Ahead
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
 
UI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentationUI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentation
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
 
Webinar: Designing a schema for a Data Warehouse
Webinar: Designing a schema for a Data WarehouseWebinar: Designing a schema for a Data Warehouse
Webinar: Designing a schema for a Data Warehouse
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
 
5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 

BDW16 London - Ingrid Funie, Imperial College London - Machine Learning and FPGA Based Hardware Acceleration

  • 1. MACHINE LEARNING AND FPGA-BASED HARDWARE ACCELERATION Andreea-Ingrid Funie PhD Candidate, Imperial College London 0
  • 2. Custom-Computing Group Head: Prof. Wayne Luk 1 Field Programmable Gate Array
  • 3. Next Generation Computing • Existing computers: - Slow - Power hungry - Complex to implement applications • Our focus: custom computing - Customise hardware/software to applications - Enhance design quality and designer productivity • Research strategy: - FPGA: reconfigurable acceleration - DFE: data flow engine = FPGA + memory + dataflow 2
  • 4. Our Hardware Devices • 3 MPCX nodes with 8 cards each with Stratix V FPGA. Each card has a measured main memory throughput of 65GB/s :  1.5 TB/s of potential access  130PB/day of potential processing • 10 FPGAs (e.g. : Altera Stratix V, Xilinx Virtex VI) • 6 GPUs (e.g. : NVIDIA Tesla C2070 GPU: 448 cores running at 1.15GHz; NVIDIA Kepler k40/k80) At Imperial College London HPC center: https://wiki.imperial.ac.uk/display/HPC/Systems  ax4 (15 Tbytes of RAM & 1280 cores, 1.5PBytes fast raid storage), cx2 (456 nodes & 5272 cores comprised of SGI Altix ICE hardware), cx1 (1395 nodes & 13558 cores, 8 Nvidia K80, 4 Nvidia K40) 3
  • 5. Financial simulation: 163 times faster1, 170 times less energy Genomic analysis: 88 times faster, 3 times less energy DFE: speed + energy efficiency String MatchingLeast Square Monte Carlo Method [1] Chow et al. (FPGA Conference, 2012) [2] Arram et al. (FPGA Conference, 2015) 4 1faster than the equivalent single/multi- core implementation
  • 6. Climate modelling: 13 times faster1 DFE: speed + energy efficiency 5 Stencil Computation Air traffic management: 17 times faster, 15 times less energy Sequential Monte Carlo Method [3] Russell et al. (FCCM Conference, 2015) [4] Chau et al. (HEART Conference, 2013)1faster than the equivalent single/multi- core implementation
  • 7. DFE: speed + energy efficiency Optimal architecture up to 47 times faster (UoF Benchmark) Iterative Sparse Linear Solvers Computational Fluid Dynamics Power Systems Simulation 6 [5] Grigoras et al. (FPGA Conference, 2016)
  • 8. Machine Learning on DFEs 7 Multi Objective Machine Learning Optimizer • Self-optimization of reconfigurable designs through automatic analysis and adaptation of design parameters • Can switch between a fast/power hungry design and a relatively slow/low power alternative • Uses: - Gaussian Process Regression - Support Vector Machine Classification - Particle Swarm Optimization [6] Kurek et al. (FCCM Conference, 2014)
  • 9. Machine Learning on DFEs Pipelined Genetic Propagation Travelling salesman problem: 90 times faster 8 Neural Networks Simulation Polychronous spiking neural network: 34 times faster1 [7] Cheung et al. (Frontiers in Neuroscience, 2016) [8] Guo et al. (FCCM Conference, 2015)1faster than the equivalent single/multi- core implementation
  • 10. Incremental Support Vector Machine Stock trading: 41 times faster1 One-class Support Vector Machine Network anomaly detection: 6 times faster Machine Learning on DFEs 9 [9] Shao et al. (FPT Conference, 2016) [10] Bara et al. (FPT Conference, 2014)1faster than the equivalent single/multi- core implementation
  • 11. Machine Learning for Financial Applications on DFEs Challenges: • Quantity of data • Speed of processing • Accuracy of results 10
  • 12. Genetic Programming for Trading needs acceleration [11] Funie et al. (ASAP Conference, 201 11
  • 13. DFE Speedup over CPU DFE: Maxeler Maia DFE, 8 customised computing units CPU: Dual Intel Xeon E5-2640, 12 cores 20 times speedup 992 expressions 12
  • 14. Capability from acceleration 3.5x higher returns 20x speedup Financial institution: means: Regulators analyze: 20x more rules Return s Data Points 13
  • 15. Machine Learning on DFEs: Future Work • Deep Boltzmann Machine for financial market direction prediction • Support Vector Machines for satellite image classification • Data analysis and clustering methods such as DBSCAN 14
  • 16. Summary 15 • FPGAs accelerate many machine learning applications: - Genetic Programming for optimized trading strategies - Incremental Support Vector Machine for stock trading - Deep Boltzmann Machine for financial market direction prediction - Support Vector Machine for satellite image classification • Tools to enhance designer productivity: - Aid users without electronic design experience - Ensure high quality implementation: speed, accuracy, energy efficiency.
  • 17. References 16 [1] Gary C.T. Chow, Anson H.T. Tse, Qiwei Jin, Wayne Luk, Philip H.W. Leong, David B. Thomas, “A Mixed Precision Monte Carlo Methodology for Reconfigurable Accelerator Systems”, FPGA 2012. [2] James Arram, Wayne Luk, Peiyong Jiang, "Ramethy: reconfigurable acceleration of Bisulfite sequence alignment", FPGA, 2015. [3] Francis P. Russell, Peter D. Duben, Xinyu Niu, Wayne Luk, T. N. Palmer, “Architectures and precision analysis for modelling atmospheric variables with chaotic behaviour”, FCCM 2015. [4] Thomas C.P. Chau, James Targett, Marlon Wijeyasinghe, Wayne Luk, Peter Y.K. Cheung, Benjamin Cope, Alison Eele, Jan Maciejowski, “Accelerating Sequential Monte Carlo Method for Real-time Air Traffic Management”, HEART 2013. [5] Paul Grigoras, Pavel Burovskiy, Wayne Luk, “CASK – Open-Source Custom Architectures for Sparse Kernels”, FPGA 2016.
  • 18. References 17 [6] Maciej Kurek, Tobias Becker, Thomas P. Chau, Wayne Luk, “Automating Optimization of Reconfigurable Designs”, FCCM 2014. [7] Kit Cheung, Simon R. Schultz, Wayne Luk, “NeuroFlow: A general purpose spiking neural network simulation platform using customizable processors”, Frontiers in Neuroscience, 2016. [8] Guo, Liucheng, Ce Guo, David B. Thomas, and Wayne Luk. “Pipelined Genetic Propagation”, FCCM 2015. [9] Shengjia Shao, Oskar Mencer, Wayne Luk, "Dataflow design for optimal incremental SVM training", FPT, 2016 [10] Andrei bara, Xinyu Niu, Wayne Luk, “A dataflow system for anomaly detection and analysis”, FPT 2014. [11] Andreea-Ingrid Funie, Paul Grigoras, Pavel Burovskiy, Wayne Luk, Mark Salmon, “Reconfigurable acceleration of fitness evaluation in trading strategies”, ASAP 2015.