Christoforos Kachris, Elias Koromilas, Ioannis Stamelos, Dimitrios Soudris
kachris@microlab.ntua.gr
ICCS-National Technical University of Athens
ARC 2018, Santorini
Seamless FPGA deployment over Spark in cloud
computing: A use case on machine learning
hardware acceleration
www.vineyard-h2020.eu
Network traffic in the data centers
2
Christoforos Kachris, ICCS, Greece
www.vineyard-h2020.eu
Power consumption in the data centers
3
• Currently Data
Centers consume
huge amounts of
energy
• Servers consume
around 30% of the
total power budget of
the IT infrastructure
Christoforos Kachris, ICCS, Greece
www.vineyard-h2020.eu
Diverse Data Center Demands
4
Christoforos Kachris, ICCS, Greece
www.vineyard-h2020.eu
FPGAs at the spotlight
5
April 2015
Submission of VINEYARD proposal
2015
2016
2017
December 2016
Overall, Intel now has five different AI
platforms; FPGAs, the Xeon Phi, the
Nervana NNP, the Myriad X, and its
traditional Core processor. The Core
processor still performs most AI tasks.
www.vineyard-h2020.eu
FPGAs in the news
Christoforos Kachris, ICCS, Greece 6
www.vineyard-h2020.eu
FPGAs in Data Center
• Intel: “Two orders of magnitude faster than GPU by 2020”
($16.7 billion bet)
Broadwel Xeon with Arria 10
• Microsoft Bing with Altera Stratix V
• IBM SupperVessel with Power8 + Xilinx
• Xilinx SDAccel on Nimbix Cloud
• Google has released TPU only for Tensorflow – ISCA 2017
7
Christoforos Kachris, ICCS, Greece
www.vineyard-h2020.eu
Machine learning market size
• The machine learning market
size is expected to grow from
USD 1.41 Billion in 2017 to
USD 8.81 Billion by 2022, at a
Compound Annual Growth
Rate (CAGR) of 44.1%.
https://www.marketsandmarkets.com/PressReleas
es/machine-learning.asp
Christoforos Kachris, ICCS, Greece 8
www.vineyard-h2020.eu
Machine learning as a service
Christoforos Kachris, ICCS, Greece 9
www.vineyard-h2020.eu
Apache Spark
The largest open source project in
data processing.
• Structured Data
• Streaming Analytics
• Machine Learning
• Graph Computation
Provides an interface for
programming entire clusters with
implicit data parallelism and fault-
tolerance.
10
Christoforos Kachris, ICCS, Greece
www.vineyard-h2020.eu
Contributions
• The FPGA driver API is packed in a shared object library and
can be used in a transparent way hiding all the low level
details.
• We implemented top level APIs in Python for standalone and
Apache Spark integrated use, that are easy to be used and are
also easily maintained since the middle layer, our shared
library remains the same for all of the above.
Christoforos Kachris, ICCS, Greece 11
www.vineyard-h2020.eu
System stack
• Application Layer: This layer
hosts users’ applications. The
applications can run natively
using Python.
• Vineyard Layer: This layer hosts
the whole functionality of our
framework. The key element of
this layer is the implemented
shared library
• SDSoC-HLS API and FPGA
layerhared library
Christoforos Kachris, ICCS, Greece 12
www.vineyard-h2020.eu
Flow for data movement - RDDS
• Flow of the original
and optimized
method for the
DMA transfers to
the accelerator
Christoforos Kachris, ICCS, Greece 13
www.vineyard-h2020.eu
Use case: K-means clustering
Christoforos Kachris, ICCS, Greece 14
www.vineyard-h2020.eu
Evaluation platforms
Christoforos Kachris, ICCS, Greece 15
www.vineyard-h2020.eu
Pynq: Python Productivity for Zynq
• An open-source project from Xilinx that
makes it easy to design embedded
systems with Zynq MPSoCs.
• The APSoC is programmed using
Python.
• The code is developed and tested
directly on the PYNQ-Z1 board.
• The programmable logic circuits are
imported as hardware libraries and
programmed through their APIs in
essentially the same way as the
software libraries.
16
Christoforos Kachris, ICCS, Greece
www.vineyard-h2020.eu
Spark integration on Heterogeneous MPSoC
17
K-means clustering
Christoforos Kachris, ICCS, Greece
www.vineyard-h2020.eu
Spynq: Spark on Pynq integration
18
Christoforos Kachris, ICCS, Greece
www.vineyard-h2020.eu
Speedup
Christoforos Kachris, ICCS, Greece 19
www.vineyard-h2020.eu
Energy consumption
• Intel Xeon vs Pynq
cluster
• ARM vs Pynq cluster
Christoforos Kachris, ICCS, Greece 20
www.vineyard-h2020.eu
Available on github
21
Christoforos Kachris, ICCS, Greece
www.vineyard-h2020.eu
Cluster of Zynq (Pynq devices) running Spark
22
Christoforos Kachris, ICCS, Greece
www.vineyard-h2020.eu
SW on Intel Xeon
23
Accel on FPGA MPSoC
Christoforos Kachris, ICCS, Greece
www.vineyard-h2020.eu
VINEYARD Framework
24
• Accelerators stored
in an AppStore
• Cloud users request
accelerators based
on applications
requirements
• Decouple Hardware
– Software
designers
Cloud computing Applications
VINEYARD Cloud Resource Manager
3rd party IP
developersLibrary of Hardware
accelerators as IP
Blocks
Heterogeneous Data Center
DFE
Processors Dataflow Proc.+FPGA
IP Accelerator’s
App store
Cloud tenants
Acc
Acc
Acc
Acc
DFE
DFE
DFE
Accelerator Controller
Accelerator Virtualization
Scheduler
Accelerator API
Performance
Energy
Christoforos Kachris, ICCS, Greece
www.vineyard-h2020.eu 25
www.vineyard-h2020.eu
Consortium
26
Platform Evaluator
Data Centre Vendor
System Vendor (Dataflow Eng.)
System Software
Programming framework &
Hardware accelerators
Data Centre
Software developers
Data Centre End User
www.vineyard-h2020.eu
VINEYARD appears in Xilinx’s website
27
www.vineyard-h2020.eu
Ecosystem on FPGAs in the cloud
Christoforos Kachris, ICCS, Greece 28
www.vineyard-h2020.eu
Main goals
VINEYARD AIMS TO
• Build an integrated platform for energy-efficient data
centres based on novel programmable hardware
accelerators
• Develop a high-level programming framework and big
data infrastructure for allowing end-users to seamlessly
utilize these accelerators in heterogeneous computing
systems by employing typical data-centre programming
frameworks (i.e. Spark.).
• VINEYARD also foster the establishment of an
ecosystem that will empower open innovation based on
hardware accelerators as data-centre plugins for
marketplace, thereby facilitating innovative enterprises
(large industries, SMEs, and creative start-ups) to
develop novel solutions using VINEYARDS’s leading
edge developments.
29
Christoforos Kachris, ICCS, Greece
• Speedup your application seamlessly
• An integrated framework for the utilization of hardware
accelerators in HPC and data center seamlessly
Contact detais: kachris@microlab.ntua.gr

Seamless FPGA deployment over Spark in cloud computing: A use case on machine learning hardware acceleration

  • 1.
    Christoforos Kachris, EliasKoromilas, Ioannis Stamelos, Dimitrios Soudris kachris@microlab.ntua.gr ICCS-National Technical University of Athens ARC 2018, Santorini Seamless FPGA deployment over Spark in cloud computing: A use case on machine learning hardware acceleration
  • 2.
    www.vineyard-h2020.eu Network traffic inthe data centers 2 Christoforos Kachris, ICCS, Greece
  • 3.
    www.vineyard-h2020.eu Power consumption inthe data centers 3 • Currently Data Centers consume huge amounts of energy • Servers consume around 30% of the total power budget of the IT infrastructure Christoforos Kachris, ICCS, Greece
  • 4.
    www.vineyard-h2020.eu Diverse Data CenterDemands 4 Christoforos Kachris, ICCS, Greece
  • 5.
    www.vineyard-h2020.eu FPGAs at thespotlight 5 April 2015 Submission of VINEYARD proposal 2015 2016 2017 December 2016 Overall, Intel now has five different AI platforms; FPGAs, the Xeon Phi, the Nervana NNP, the Myriad X, and its traditional Core processor. The Core processor still performs most AI tasks.
  • 6.
    www.vineyard-h2020.eu FPGAs in thenews Christoforos Kachris, ICCS, Greece 6
  • 7.
    www.vineyard-h2020.eu FPGAs in DataCenter • Intel: “Two orders of magnitude faster than GPU by 2020” ($16.7 billion bet) Broadwel Xeon with Arria 10 • Microsoft Bing with Altera Stratix V • IBM SupperVessel with Power8 + Xilinx • Xilinx SDAccel on Nimbix Cloud • Google has released TPU only for Tensorflow – ISCA 2017 7 Christoforos Kachris, ICCS, Greece
  • 8.
    www.vineyard-h2020.eu Machine learning marketsize • The machine learning market size is expected to grow from USD 1.41 Billion in 2017 to USD 8.81 Billion by 2022, at a Compound Annual Growth Rate (CAGR) of 44.1%. https://www.marketsandmarkets.com/PressReleas es/machine-learning.asp Christoforos Kachris, ICCS, Greece 8
  • 9.
    www.vineyard-h2020.eu Machine learning asa service Christoforos Kachris, ICCS, Greece 9
  • 10.
    www.vineyard-h2020.eu Apache Spark The largestopen source project in data processing. • Structured Data • Streaming Analytics • Machine Learning • Graph Computation Provides an interface for programming entire clusters with implicit data parallelism and fault- tolerance. 10 Christoforos Kachris, ICCS, Greece
  • 11.
    www.vineyard-h2020.eu Contributions • The FPGAdriver API is packed in a shared object library and can be used in a transparent way hiding all the low level details. • We implemented top level APIs in Python for standalone and Apache Spark integrated use, that are easy to be used and are also easily maintained since the middle layer, our shared library remains the same for all of the above. Christoforos Kachris, ICCS, Greece 11
  • 12.
    www.vineyard-h2020.eu System stack • ApplicationLayer: This layer hosts users’ applications. The applications can run natively using Python. • Vineyard Layer: This layer hosts the whole functionality of our framework. The key element of this layer is the implemented shared library • SDSoC-HLS API and FPGA layerhared library Christoforos Kachris, ICCS, Greece 12
  • 13.
    www.vineyard-h2020.eu Flow for datamovement - RDDS • Flow of the original and optimized method for the DMA transfers to the accelerator Christoforos Kachris, ICCS, Greece 13
  • 14.
    www.vineyard-h2020.eu Use case: K-meansclustering Christoforos Kachris, ICCS, Greece 14
  • 15.
  • 16.
    www.vineyard-h2020.eu Pynq: Python Productivityfor Zynq • An open-source project from Xilinx that makes it easy to design embedded systems with Zynq MPSoCs. • The APSoC is programmed using Python. • The code is developed and tested directly on the PYNQ-Z1 board. • The programmable logic circuits are imported as hardware libraries and programmed through their APIs in essentially the same way as the software libraries. 16 Christoforos Kachris, ICCS, Greece
  • 17.
    www.vineyard-h2020.eu Spark integration onHeterogeneous MPSoC 17 K-means clustering Christoforos Kachris, ICCS, Greece
  • 18.
    www.vineyard-h2020.eu Spynq: Spark onPynq integration 18 Christoforos Kachris, ICCS, Greece
  • 19.
  • 20.
    www.vineyard-h2020.eu Energy consumption • IntelXeon vs Pynq cluster • ARM vs Pynq cluster Christoforos Kachris, ICCS, Greece 20
  • 21.
  • 22.
    www.vineyard-h2020.eu Cluster of Zynq(Pynq devices) running Spark 22 Christoforos Kachris, ICCS, Greece
  • 23.
    www.vineyard-h2020.eu SW on IntelXeon 23 Accel on FPGA MPSoC Christoforos Kachris, ICCS, Greece
  • 24.
    www.vineyard-h2020.eu VINEYARD Framework 24 • Acceleratorsstored in an AppStore • Cloud users request accelerators based on applications requirements • Decouple Hardware – Software designers Cloud computing Applications VINEYARD Cloud Resource Manager 3rd party IP developersLibrary of Hardware accelerators as IP Blocks Heterogeneous Data Center DFE Processors Dataflow Proc.+FPGA IP Accelerator’s App store Cloud tenants Acc Acc Acc Acc DFE DFE DFE Accelerator Controller Accelerator Virtualization Scheduler Accelerator API Performance Energy Christoforos Kachris, ICCS, Greece
  • 25.
  • 26.
    www.vineyard-h2020.eu Consortium 26 Platform Evaluator Data CentreVendor System Vendor (Dataflow Eng.) System Software Programming framework & Hardware accelerators Data Centre Software developers Data Centre End User
  • 27.
  • 28.
    www.vineyard-h2020.eu Ecosystem on FPGAsin the cloud Christoforos Kachris, ICCS, Greece 28
  • 29.
    www.vineyard-h2020.eu Main goals VINEYARD AIMSTO • Build an integrated platform for energy-efficient data centres based on novel programmable hardware accelerators • Develop a high-level programming framework and big data infrastructure for allowing end-users to seamlessly utilize these accelerators in heterogeneous computing systems by employing typical data-centre programming frameworks (i.e. Spark.). • VINEYARD also foster the establishment of an ecosystem that will empower open innovation based on hardware accelerators as data-centre plugins for marketplace, thereby facilitating innovative enterprises (large industries, SMEs, and creative start-ups) to develop novel solutions using VINEYARDS’s leading edge developments. 29 Christoforos Kachris, ICCS, Greece
  • 30.
    • Speedup yourapplication seamlessly • An integrated framework for the utilization of hardware accelerators in HPC and data center seamlessly Contact detais: kachris@microlab.ntua.gr