SlideShare a Scribd company logo
1 of 21
Download to read offline
Konstantinos Katsantonis, Christoforos Kachris, Dimitrios Soudris
kachris@microlab.ntua.gr
ICCS-National Technical University of Athens
ARC , May 2018
Efficient hardware acceleration of
recommendation engines: a use case on
collaborative filtering
www.vineyard-h2020.eu
Recommendation engines Market size
2
• The global recommendation engine market, is
expected to grow from USD 801.1 Million in 2017 to
USD 4414.8 Million by 2022, at a Compound Annual
Growth Rate (CAGR) of 40.7% during the forecast
period.
• The collaborative filtering technique uses a large
volume of information, such as users' behavior,
preferences, and activities from the past records to
segment users based on similarity of likings
• https://www.prnewswire.com/news-releases/global-recommendation-engine-market-2018-2022---key-
innovators-include-fuzzyai--infinite-analytics-300624258.html
Christoforos Kachris, ICCS-NTUA, ARC 2018
www.vineyard-h2020.eu
Objectives
Christoforos Kachris, ICCS-NTUA, ARC 2018 3
• Design Space Exploration of a Recommendation System using
Matrix Factorization trained by Alternating Least Squares.
• Efficient mapping in reconfigurable computing using High-
Level Synthesis (HLS).
• Performance and power evaluation.
• Creation of a python interface for the accelerator.
• Integration with the Spark framework through python.
www.vineyard-h2020.eu
Algorithm overview
• Predict The Missing/Unknown ratings using the
following model
Christoforos Kachris, ICCS-NTUA, ARC 2018 4
www.vineyard-h2020.eu
Architecture
• For every user and
every movie solve
the following system
• Efficient mapping on
BRAM and comput.
resources
Christoforos Kachris, ICCS-NTUA, ARC 2018 5
www.vineyard-h2020.eu
Prototype on Zedboard
Christoforos Kachris, ICCS-NTUA, ARC 2018 6
Software - Kernel
interface version 1
Software - Kernel
interface version 2
Software - Kernel
interface version 3
AXI4-Stream AXI4-Stream AXI4-Stream
Hand Written Driver Xilinx IP “Accelerator FIFO
Adapter”
Xilinx IP “Accelerator FIFO
Adapter”
2-dimensional Zero
Padding
2-dimensional Zero
Padding
1-dimensional Zero
Padding
Fixed Input Data Window
Dimensions
Fixed Input Data Window
Dimensions
Variable Size Input
Windows, Determined on
Runtime
www.vineyard-h2020.eu
Pynq: Python Productivity for Zynq
• An open-source project from Xilinx that
makes it easy to design embedded
systems with Zynq MPSoCs.
• The APSoC is programmed using
Python.
• The code is developed and tested
directly on the PYNQ-Z1 board.
• The programmable logic circuits are
imported as hardware libraries and
programmed through their APIs in
essentially the same way as the
software libraries.
7
Christoforos Kachris, ICCS-NTUA, ARC 2018
www.vineyard-h2020.eu
Apache Spark
The largest open source project in
data processing.
• Structured Data
• Streaming Analytics
• Machine Learning
• Graph Computation
Provides an interface for
programming entire clusters with
implicit data parallelism and fault-
tolerance.
8
Christoforos Kachris, ICCS-NTUA, ARC 2018
www.vineyard-h2020.eu
Spynq: Spark on Pynq integration
9
Christoforos Kachris, ICCS-NTUA, ARC 2018
www.vineyard-h2020.eu
Spark integration on Heterogeneous MPSoC
10
Recommendation engine
Christoforos Kachris, ICCS-NTUA, ARC 2018
www.vineyard-h2020.eu
Pynq and Spark integration
Christoforos Kachris, ICCS-NTUA, ARC 2018 11
www.vineyard-h2020.eu
Cluster of Zynq (Pynq devices) running Spark
12
Christoforos Kachris, ICCS-NTUA, ARC 2018
www.vineyard-h2020.eu
Results
Christoforos Kachris, ICCS-NTUA, ARC 2018 13
www.vineyard-h2020.eu
Results
• Kernel Integration with ALS algorithm. Acceleration
achieved using Zedboard with input datasets movielens
100k & 1m vs arm-only execution.
Christoforos Kachris, ICCS-NTUA, ARC 2018 14
www.vineyard-h2020.eu
Results
• System Integration with ALS algorithm. Acceleration
achieved using PyNQ with input datasets movielens
100k & 1m vs arm-only execution.
Christoforos Kachris, ICCS-NTUA, ARC 2018 15
www.vineyard-h2020.eu
Xeon vs 4-node Pynq cluster
• Xeon outperformed the Spark Cluster on the proposed
schema, due to the enormous data transfers required
by ALS, over Ethernet and the additional software
computations introduced by Spark.
Christoforos Kachris, ICCS-NTUA, ARC 2018 16
www.vineyard-h2020.eu
Power consumption
Christoforos Kachris, ICCS-NTUA, ARC 2018 17
12x Less Energy
27x Less Energy
www.vineyard-h2020.eu
Available on github 18
Christoforos Kachris, ICCS-NTUA, ARC 2018
www.vineyard-h2020.eu
VINEYARD Framework
19
• Accelerators stored
in an AppStore
• Cloud users request
accelerators based
on applications
requirements
• Decouple Hardware
– Software
designers
Cloud computing Applications
VINEYARD Cloud Resource Manager
3rd party IP
developersLibrary of Hardware
accelerators as IP
Blocks
Heterogeneous Data Center
DFE
Processors Dataflow Proc.+FPGA
IP Accelerator’s
App store
Cloud tenants
Acc
Acc
Acc
Acc
DFE
DFE
DFE
Accelerator Controller
Accelerator Virtualization
Scheduler
Accelerator API
Performance
Energy
Christoforos Kachris, ICCS-NTUA, ARC 2018
www.vineyard-h2020.eu
Main goals
VINEYARD AIMS TO
• Build an integrated platform for energy-efficient data
centres based on novel programmable hardware
accelerators
• Develop a high-level programming framework and big
data infrastructure for allowing end-users to seamlessly
utilize these accelerators in heterogeneous computing
systems by employing typical data-centre programming
frameworks (i.e. Spark.).
• VINEYARD also foster the establishment of an
ecosystem that will empower open innovation based on
hardware accelerators as data-centre plugins for
marketplace, thereby facilitating innovative enterprises
(large industries, SMEs, and creative start-ups) to
develop novel solutions using VINEYARDS’s leading
edge developments.
20
Christoforos Kachris, ICCS-NTUA, ARC 2018
• Speedup your application seamlessly
• An integrated framework for the utilization of hardware
accelerators in HPC and data center seamlessly
Contact detais: kachris@microlab.ntua.gr

More Related Content

What's hot

Real time Analytics in IoT - Marcel Lattmann Codit Switzerland @.NET Day 2019
Real time Analytics in IoT - Marcel Lattmann Codit Switzerland @.NET Day 2019Real time Analytics in IoT - Marcel Lattmann Codit Switzerland @.NET Day 2019
Real time Analytics in IoT - Marcel Lattmann Codit Switzerland @.NET Day 2019Codit
 
Getting started with IoT
Getting started with IoTGetting started with IoT
Getting started with IoTCodit
 
SAP Leonardo Foundation IoT - A Practical Walkthrough sitWDF
SAP Leonardo Foundation IoT - A Practical Walkthrough sitWDFSAP Leonardo Foundation IoT - A Practical Walkthrough sitWDF
SAP Leonardo Foundation IoT - A Practical Walkthrough sitWDFFabian Lehmann
 
Real-time analytics in IoT by Sam Vanhoutte (@Building The Future 2019)
Real-time analytics in IoT by Sam Vanhoutte (@Building The Future 2019)Real-time analytics in IoT by Sam Vanhoutte (@Building The Future 2019)
Real-time analytics in IoT by Sam Vanhoutte (@Building The Future 2019)Codit
 
IoT Slam Keynote: Harnessing the Flood of Data with Heterogeneous Computing a...
IoT Slam Keynote: Harnessing the Flood of Data with Heterogeneous Computing a...IoT Slam Keynote: Harnessing the Flood of Data with Heterogeneous Computing a...
IoT Slam Keynote: Harnessing the Flood of Data with Heterogeneous Computing a...Ryft
 
How to connect a 30-year-old car to the cloud (Sam Vanhoutte @Techorama 2018)
How to connect a 30-year-old car to the cloud (Sam Vanhoutte @Techorama 2018)How to connect a 30-year-old car to the cloud (Sam Vanhoutte @Techorama 2018)
How to connect a 30-year-old car to the cloud (Sam Vanhoutte @Techorama 2018)Codit
 
SAP Leonardo Foundation IoT - A Practical Walkthrough
SAP Leonardo Foundation IoT - A Practical WalkthroughSAP Leonardo Foundation IoT - A Practical Walkthrough
SAP Leonardo Foundation IoT - A Practical WalkthroughFabian Lehmann
 
Roadshow Chicago - Introduction
Roadshow   Chicago - IntroductionRoadshow   Chicago - Introduction
Roadshow Chicago - IntroductionInfluxData
 
Introducing the Digital Repository for Museum Collections (DRMC)
Introducing the Digital Repository for Museum Collections (DRMC)Introducing the Digital Repository for Museum Collections (DRMC)
Introducing the Digital Repository for Museum Collections (DRMC)Artefactual Systems - AtoM
 
FIWARE Global Summit - Professional Dashboards for Dummies - Build Your Smart...
FIWARE Global Summit - Professional Dashboards for Dummies - Build Your Smart...FIWARE Global Summit - Professional Dashboards for Dummies - Build Your Smart...
FIWARE Global Summit - Professional Dashboards for Dummies - Build Your Smart...FIWARE
 
Introduction to Time Series Analytics with Microsoft Azure
Introduction to Time Series Analytics with Microsoft AzureIntroduction to Time Series Analytics with Microsoft Azure
Introduction to Time Series Analytics with Microsoft AzureCodit
 
SmartCity IoT on Kubernetes and OpenStack
SmartCity IoT on Kubernetes and OpenStackSmartCity IoT on Kubernetes and OpenStack
SmartCity IoT on Kubernetes and OpenStackJakub Pavlik
 
Next Generation of Data Integration with Azure Data Factory by Tom Kerkhove
Next Generation of Data Integration with Azure Data Factory by Tom KerkhoveNext Generation of Data Integration with Azure Data Factory by Tom Kerkhove
Next Generation of Data Integration with Azure Data Factory by Tom KerkhoveCodit
 
Shashi Raina [AWS] & Al Sargent [InfluxData] | Build Modern Monitoring with I...
Shashi Raina [AWS] & Al Sargent [InfluxData] | Build Modern Monitoring with I...Shashi Raina [AWS] & Al Sargent [InfluxData] | Build Modern Monitoring with I...
Shashi Raina [AWS] & Al Sargent [InfluxData] | Build Modern Monitoring with I...InfluxData
 
Big & Open Data: Challenges for Smartcity
Big & Open Data:  Challenges for SmartcityBig & Open Data:  Challenges for Smartcity
Big & Open Data: Challenges for SmartcityVictoria López
 
Accelerating Digital Transformation with App Modernization
Accelerating Digital Transformation with App ModernizationAccelerating Digital Transformation with App Modernization
Accelerating Digital Transformation with App ModernizationDavid J Rosenthal
 
Data Science at Netflix - Principles for Speed & Scale [Rev 2019 keynote]
Data Science at Netflix - Principles for Speed & Scale [Rev 2019 keynote]Data Science at Netflix - Principles for Speed & Scale [Rev 2019 keynote]
Data Science at Netflix - Principles for Speed & Scale [Rev 2019 keynote]Michelle Ufford
 
Extending Operations from On-premises Solutions Towards Hybrid and Cloud - Da...
Extending Operations from On-premises Solutions Towards Hybrid and Cloud - Da...Extending Operations from On-premises Solutions Towards Hybrid and Cloud - Da...
Extending Operations from On-premises Solutions Towards Hybrid and Cloud - Da...Codit
 
Upgrading Made Easy: Moving to InfluxDB 2.x or InfluxDB Cloud with Cribl LogS...
Upgrading Made Easy: Moving to InfluxDB 2.x or InfluxDB Cloud with Cribl LogS...Upgrading Made Easy: Moving to InfluxDB 2.x or InfluxDB Cloud with Cribl LogS...
Upgrading Made Easy: Moving to InfluxDB 2.x or InfluxDB Cloud with Cribl LogS...InfluxData
 

What's hot (20)

Real time Analytics in IoT - Marcel Lattmann Codit Switzerland @.NET Day 2019
Real time Analytics in IoT - Marcel Lattmann Codit Switzerland @.NET Day 2019Real time Analytics in IoT - Marcel Lattmann Codit Switzerland @.NET Day 2019
Real time Analytics in IoT - Marcel Lattmann Codit Switzerland @.NET Day 2019
 
Getting started with IoT
Getting started with IoTGetting started with IoT
Getting started with IoT
 
SAP Leonardo Foundation IoT - A Practical Walkthrough sitWDF
SAP Leonardo Foundation IoT - A Practical Walkthrough sitWDFSAP Leonardo Foundation IoT - A Practical Walkthrough sitWDF
SAP Leonardo Foundation IoT - A Practical Walkthrough sitWDF
 
Real-time analytics in IoT by Sam Vanhoutte (@Building The Future 2019)
Real-time analytics in IoT by Sam Vanhoutte (@Building The Future 2019)Real-time analytics in IoT by Sam Vanhoutte (@Building The Future 2019)
Real-time analytics in IoT by Sam Vanhoutte (@Building The Future 2019)
 
IoT Slam Keynote: Harnessing the Flood of Data with Heterogeneous Computing a...
IoT Slam Keynote: Harnessing the Flood of Data with Heterogeneous Computing a...IoT Slam Keynote: Harnessing the Flood of Data with Heterogeneous Computing a...
IoT Slam Keynote: Harnessing the Flood of Data with Heterogeneous Computing a...
 
How to connect a 30-year-old car to the cloud (Sam Vanhoutte @Techorama 2018)
How to connect a 30-year-old car to the cloud (Sam Vanhoutte @Techorama 2018)How to connect a 30-year-old car to the cloud (Sam Vanhoutte @Techorama 2018)
How to connect a 30-year-old car to the cloud (Sam Vanhoutte @Techorama 2018)
 
SAP Leonardo Foundation IoT - A Practical Walkthrough
SAP Leonardo Foundation IoT - A Practical WalkthroughSAP Leonardo Foundation IoT - A Practical Walkthrough
SAP Leonardo Foundation IoT - A Practical Walkthrough
 
Roadshow Chicago - Introduction
Roadshow   Chicago - IntroductionRoadshow   Chicago - Introduction
Roadshow Chicago - Introduction
 
Introducing the Digital Repository for Museum Collections (DRMC)
Introducing the Digital Repository for Museum Collections (DRMC)Introducing the Digital Repository for Museum Collections (DRMC)
Introducing the Digital Repository for Museum Collections (DRMC)
 
FIWARE Global Summit - Professional Dashboards for Dummies - Build Your Smart...
FIWARE Global Summit - Professional Dashboards for Dummies - Build Your Smart...FIWARE Global Summit - Professional Dashboards for Dummies - Build Your Smart...
FIWARE Global Summit - Professional Dashboards for Dummies - Build Your Smart...
 
Introduction to Time Series Analytics with Microsoft Azure
Introduction to Time Series Analytics with Microsoft AzureIntroduction to Time Series Analytics with Microsoft Azure
Introduction to Time Series Analytics with Microsoft Azure
 
SmartCity IoT on Kubernetes and OpenStack
SmartCity IoT on Kubernetes and OpenStackSmartCity IoT on Kubernetes and OpenStack
SmartCity IoT on Kubernetes and OpenStack
 
Next Generation of Data Integration with Azure Data Factory by Tom Kerkhove
Next Generation of Data Integration with Azure Data Factory by Tom KerkhoveNext Generation of Data Integration with Azure Data Factory by Tom Kerkhove
Next Generation of Data Integration with Azure Data Factory by Tom Kerkhove
 
Shashi Raina [AWS] & Al Sargent [InfluxData] | Build Modern Monitoring with I...
Shashi Raina [AWS] & Al Sargent [InfluxData] | Build Modern Monitoring with I...Shashi Raina [AWS] & Al Sargent [InfluxData] | Build Modern Monitoring with I...
Shashi Raina [AWS] & Al Sargent [InfluxData] | Build Modern Monitoring with I...
 
BI + Big Data
BI + Big DataBI + Big Data
BI + Big Data
 
Big & Open Data: Challenges for Smartcity
Big & Open Data:  Challenges for SmartcityBig & Open Data:  Challenges for Smartcity
Big & Open Data: Challenges for Smartcity
 
Accelerating Digital Transformation with App Modernization
Accelerating Digital Transformation with App ModernizationAccelerating Digital Transformation with App Modernization
Accelerating Digital Transformation with App Modernization
 
Data Science at Netflix - Principles for Speed & Scale [Rev 2019 keynote]
Data Science at Netflix - Principles for Speed & Scale [Rev 2019 keynote]Data Science at Netflix - Principles for Speed & Scale [Rev 2019 keynote]
Data Science at Netflix - Principles for Speed & Scale [Rev 2019 keynote]
 
Extending Operations from On-premises Solutions Towards Hybrid and Cloud - Da...
Extending Operations from On-premises Solutions Towards Hybrid and Cloud - Da...Extending Operations from On-premises Solutions Towards Hybrid and Cloud - Da...
Extending Operations from On-premises Solutions Towards Hybrid and Cloud - Da...
 
Upgrading Made Easy: Moving to InfluxDB 2.x or InfluxDB Cloud with Cribl LogS...
Upgrading Made Easy: Moving to InfluxDB 2.x or InfluxDB Cloud with Cribl LogS...Upgrading Made Easy: Moving to InfluxDB 2.x or InfluxDB Cloud with Cribl LogS...
Upgrading Made Easy: Moving to InfluxDB 2.x or InfluxDB Cloud with Cribl LogS...
 

Similar to Efficient hardware acceleration of recommendation engines: a use case on collaborative filtering

Real-Time Analytics with Confluent and MemSQL
Real-Time Analytics with Confluent and MemSQLReal-Time Analytics with Confluent and MemSQL
Real-Time Analytics with Confluent and MemSQLSingleStore
 
A Journey to Building an Autonomous Streaming Data Platform—Scaling to Trilli...
A Journey to Building an Autonomous Streaming Data Platform—Scaling to Trilli...A Journey to Building an Autonomous Streaming Data Platform—Scaling to Trilli...
A Journey to Building an Autonomous Streaming Data Platform—Scaling to Trilli...Databricks
 
Apache Kafka for Smart Grid, Utilities and Energy Production
Apache Kafka for Smart Grid, Utilities and Energy ProductionApache Kafka for Smart Grid, Utilities and Energy Production
Apache Kafka for Smart Grid, Utilities and Energy ProductionKai Wähner
 
Ovh analytics data compute with apache spark as a service meetup ovh bordeaux
Ovh analytics data compute with apache spark as a service   meetup ovh bordeauxOvh analytics data compute with apache spark as a service   meetup ovh bordeaux
Ovh analytics data compute with apache spark as a service meetup ovh bordeauxMojtaba Imani
 
OVH Analytics Data Compute - Apache Spark Cluster as a Service
OVH Analytics Data Compute - Apache Spark Cluster as a ServiceOVH Analytics Data Compute - Apache Spark Cluster as a Service
OVH Analytics Data Compute - Apache Spark Cluster as a ServiceOVHcloud
 
How to scale your PaaS with OVH infrastructure?
How to scale your PaaS with OVH infrastructure?How to scale your PaaS with OVH infrastructure?
How to scale your PaaS with OVH infrastructure?OVHcloud
 
Computaris builds analytics solution for large datacenter network traffic
Computaris builds analytics solution for large datacenter network trafficComputaris builds analytics solution for large datacenter network traffic
Computaris builds analytics solution for large datacenter network trafficComputaris
 
Democratizing Data Science on Kubernetes
Democratizing Data Science on Kubernetes Democratizing Data Science on Kubernetes
Democratizing Data Science on Kubernetes John Archer
 
Seven Ways to Boost Artificial Intelligence Research
Seven Ways to Boost Artificial Intelligence ResearchSeven Ways to Boost Artificial Intelligence Research
Seven Ways to Boost Artificial Intelligence ResearchNVIDIA
 
The Rise Of Event Streaming – Why Apache Kafka Changes Everything
The Rise Of Event Streaming – Why Apache Kafka Changes EverythingThe Rise Of Event Streaming – Why Apache Kafka Changes Everything
The Rise Of Event Streaming – Why Apache Kafka Changes EverythingKai Wähner
 
Apache kafka event_streaming___kai_waehner
Apache kafka event_streaming___kai_waehnerApache kafka event_streaming___kai_waehner
Apache kafka event_streaming___kai_waehnerconfluent
 
Gilmore, Palani [InfluxData] | Use Case: Crypto & Fintech | InfluxDays 2022
Gilmore, Palani [InfluxData] | Use Case: Crypto & Fintech | InfluxDays 2022Gilmore, Palani [InfluxData] | Use Case: Crypto & Fintech | InfluxDays 2022
Gilmore, Palani [InfluxData] | Use Case: Crypto & Fintech | InfluxDays 2022InfluxData
 
HKG18-300K2 - Keynote: Tomas Evensen - All Programmable SoCs? – Platforms to ...
HKG18-300K2 - Keynote: Tomas Evensen - All Programmable SoCs? – Platforms to ...HKG18-300K2 - Keynote: Tomas Evensen - All Programmable SoCs? – Platforms to ...
HKG18-300K2 - Keynote: Tomas Evensen - All Programmable SoCs? – Platforms to ...Linaro
 
Immutable Infrastructure
Immutable InfrastructureImmutable Infrastructure
Immutable Infrastructurestrikr .
 
Real-time processing of large amounts of data
Real-time processing of large amounts of dataReal-time processing of large amounts of data
Real-time processing of large amounts of dataconfluent
 
Big Data LDN 2017: BI Converges with AI - GPUs for Fast Data
Big Data LDN 2017: BI Converges with AI - GPUs for Fast DataBig Data LDN 2017: BI Converges with AI - GPUs for Fast Data
Big Data LDN 2017: BI Converges with AI - GPUs for Fast DataMatt Stubbs
 
Real-World, Open Source, End-to-End JavaScript in IoT
Real-World, Open Source, End-to-End JavaScript in IoTReal-World, Open Source, End-to-End JavaScript in IoT
Real-World, Open Source, End-to-End JavaScript in IoTAll Things Open
 
FIWARE Overview (University Cairo 20Aug2017)
FIWARE Overview (University Cairo 20Aug2017)FIWARE Overview (University Cairo 20Aug2017)
FIWARE Overview (University Cairo 20Aug2017)FIWARE
 

Similar to Efficient hardware acceleration of recommendation engines: a use case on collaborative filtering (20)

VINEYARD Spynq framework -SAMOS 2017
VINEYARD Spynq framework -SAMOS 2017VINEYARD Spynq framework -SAMOS 2017
VINEYARD Spynq framework -SAMOS 2017
 
Real-Time Analytics with Confluent and MemSQL
Real-Time Analytics with Confluent and MemSQLReal-Time Analytics with Confluent and MemSQL
Real-Time Analytics with Confluent and MemSQL
 
A Journey to Building an Autonomous Streaming Data Platform—Scaling to Trilli...
A Journey to Building an Autonomous Streaming Data Platform—Scaling to Trilli...A Journey to Building an Autonomous Streaming Data Platform—Scaling to Trilli...
A Journey to Building an Autonomous Streaming Data Platform—Scaling to Trilli...
 
Apache Kafka for Smart Grid, Utilities and Energy Production
Apache Kafka for Smart Grid, Utilities and Energy ProductionApache Kafka for Smart Grid, Utilities and Energy Production
Apache Kafka for Smart Grid, Utilities and Energy Production
 
Ovh analytics data compute with apache spark as a service meetup ovh bordeaux
Ovh analytics data compute with apache spark as a service   meetup ovh bordeauxOvh analytics data compute with apache spark as a service   meetup ovh bordeaux
Ovh analytics data compute with apache spark as a service meetup ovh bordeaux
 
OVH Analytics Data Compute - Apache Spark Cluster as a Service
OVH Analytics Data Compute - Apache Spark Cluster as a ServiceOVH Analytics Data Compute - Apache Spark Cluster as a Service
OVH Analytics Data Compute - Apache Spark Cluster as a Service
 
How to scale your PaaS with OVH infrastructure?
How to scale your PaaS with OVH infrastructure?How to scale your PaaS with OVH infrastructure?
How to scale your PaaS with OVH infrastructure?
 
Computaris builds analytics solution for large datacenter network traffic
Computaris builds analytics solution for large datacenter network trafficComputaris builds analytics solution for large datacenter network traffic
Computaris builds analytics solution for large datacenter network traffic
 
Democratizing Data Science on Kubernetes
Democratizing Data Science on Kubernetes Democratizing Data Science on Kubernetes
Democratizing Data Science on Kubernetes
 
Seven Ways to Boost Artificial Intelligence Research
Seven Ways to Boost Artificial Intelligence ResearchSeven Ways to Boost Artificial Intelligence Research
Seven Ways to Boost Artificial Intelligence Research
 
The Rise Of Event Streaming – Why Apache Kafka Changes Everything
The Rise Of Event Streaming – Why Apache Kafka Changes EverythingThe Rise Of Event Streaming – Why Apache Kafka Changes Everything
The Rise Of Event Streaming – Why Apache Kafka Changes Everything
 
Apache kafka event_streaming___kai_waehner
Apache kafka event_streaming___kai_waehnerApache kafka event_streaming___kai_waehner
Apache kafka event_streaming___kai_waehner
 
Gilmore, Palani [InfluxData] | Use Case: Crypto & Fintech | InfluxDays 2022
Gilmore, Palani [InfluxData] | Use Case: Crypto & Fintech | InfluxDays 2022Gilmore, Palani [InfluxData] | Use Case: Crypto & Fintech | InfluxDays 2022
Gilmore, Palani [InfluxData] | Use Case: Crypto & Fintech | InfluxDays 2022
 
HKG18-300K2 - Keynote: Tomas Evensen - All Programmable SoCs? – Platforms to ...
HKG18-300K2 - Keynote: Tomas Evensen - All Programmable SoCs? – Platforms to ...HKG18-300K2 - Keynote: Tomas Evensen - All Programmable SoCs? – Platforms to ...
HKG18-300K2 - Keynote: Tomas Evensen - All Programmable SoCs? – Platforms to ...
 
Immutable Infrastructure
Immutable InfrastructureImmutable Infrastructure
Immutable Infrastructure
 
AI + E-commerce
AI + E-commerceAI + E-commerce
AI + E-commerce
 
Real-time processing of large amounts of data
Real-time processing of large amounts of dataReal-time processing of large amounts of data
Real-time processing of large amounts of data
 
Big Data LDN 2017: BI Converges with AI - GPUs for Fast Data
Big Data LDN 2017: BI Converges with AI - GPUs for Fast DataBig Data LDN 2017: BI Converges with AI - GPUs for Fast Data
Big Data LDN 2017: BI Converges with AI - GPUs for Fast Data
 
Real-World, Open Source, End-to-End JavaScript in IoT
Real-World, Open Source, End-to-End JavaScript in IoTReal-World, Open Source, End-to-End JavaScript in IoT
Real-World, Open Source, End-to-End JavaScript in IoT
 
FIWARE Overview (University Cairo 20Aug2017)
FIWARE Overview (University Cairo 20Aug2017)FIWARE Overview (University Cairo 20Aug2017)
FIWARE Overview (University Cairo 20Aug2017)
 

Recently uploaded

Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsAndrey Dotsenko
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentationphoebematthew05
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 

Recently uploaded (20)

Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentation
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 

Efficient hardware acceleration of recommendation engines: a use case on collaborative filtering

  • 1. Konstantinos Katsantonis, Christoforos Kachris, Dimitrios Soudris kachris@microlab.ntua.gr ICCS-National Technical University of Athens ARC , May 2018 Efficient hardware acceleration of recommendation engines: a use case on collaborative filtering
  • 2. www.vineyard-h2020.eu Recommendation engines Market size 2 • The global recommendation engine market, is expected to grow from USD 801.1 Million in 2017 to USD 4414.8 Million by 2022, at a Compound Annual Growth Rate (CAGR) of 40.7% during the forecast period. • The collaborative filtering technique uses a large volume of information, such as users' behavior, preferences, and activities from the past records to segment users based on similarity of likings • https://www.prnewswire.com/news-releases/global-recommendation-engine-market-2018-2022---key- innovators-include-fuzzyai--infinite-analytics-300624258.html Christoforos Kachris, ICCS-NTUA, ARC 2018
  • 3. www.vineyard-h2020.eu Objectives Christoforos Kachris, ICCS-NTUA, ARC 2018 3 • Design Space Exploration of a Recommendation System using Matrix Factorization trained by Alternating Least Squares. • Efficient mapping in reconfigurable computing using High- Level Synthesis (HLS). • Performance and power evaluation. • Creation of a python interface for the accelerator. • Integration with the Spark framework through python.
  • 4. www.vineyard-h2020.eu Algorithm overview • Predict The Missing/Unknown ratings using the following model Christoforos Kachris, ICCS-NTUA, ARC 2018 4
  • 5. www.vineyard-h2020.eu Architecture • For every user and every movie solve the following system • Efficient mapping on BRAM and comput. resources Christoforos Kachris, ICCS-NTUA, ARC 2018 5
  • 6. www.vineyard-h2020.eu Prototype on Zedboard Christoforos Kachris, ICCS-NTUA, ARC 2018 6 Software - Kernel interface version 1 Software - Kernel interface version 2 Software - Kernel interface version 3 AXI4-Stream AXI4-Stream AXI4-Stream Hand Written Driver Xilinx IP “Accelerator FIFO Adapter” Xilinx IP “Accelerator FIFO Adapter” 2-dimensional Zero Padding 2-dimensional Zero Padding 1-dimensional Zero Padding Fixed Input Data Window Dimensions Fixed Input Data Window Dimensions Variable Size Input Windows, Determined on Runtime
  • 7. www.vineyard-h2020.eu Pynq: Python Productivity for Zynq • An open-source project from Xilinx that makes it easy to design embedded systems with Zynq MPSoCs. • The APSoC is programmed using Python. • The code is developed and tested directly on the PYNQ-Z1 board. • The programmable logic circuits are imported as hardware libraries and programmed through their APIs in essentially the same way as the software libraries. 7 Christoforos Kachris, ICCS-NTUA, ARC 2018
  • 8. www.vineyard-h2020.eu Apache Spark The largest open source project in data processing. • Structured Data • Streaming Analytics • Machine Learning • Graph Computation Provides an interface for programming entire clusters with implicit data parallelism and fault- tolerance. 8 Christoforos Kachris, ICCS-NTUA, ARC 2018
  • 9. www.vineyard-h2020.eu Spynq: Spark on Pynq integration 9 Christoforos Kachris, ICCS-NTUA, ARC 2018
  • 10. www.vineyard-h2020.eu Spark integration on Heterogeneous MPSoC 10 Recommendation engine Christoforos Kachris, ICCS-NTUA, ARC 2018
  • 11. www.vineyard-h2020.eu Pynq and Spark integration Christoforos Kachris, ICCS-NTUA, ARC 2018 11
  • 12. www.vineyard-h2020.eu Cluster of Zynq (Pynq devices) running Spark 12 Christoforos Kachris, ICCS-NTUA, ARC 2018
  • 14. www.vineyard-h2020.eu Results • Kernel Integration with ALS algorithm. Acceleration achieved using Zedboard with input datasets movielens 100k & 1m vs arm-only execution. Christoforos Kachris, ICCS-NTUA, ARC 2018 14
  • 15. www.vineyard-h2020.eu Results • System Integration with ALS algorithm. Acceleration achieved using PyNQ with input datasets movielens 100k & 1m vs arm-only execution. Christoforos Kachris, ICCS-NTUA, ARC 2018 15
  • 16. www.vineyard-h2020.eu Xeon vs 4-node Pynq cluster • Xeon outperformed the Spark Cluster on the proposed schema, due to the enormous data transfers required by ALS, over Ethernet and the additional software computations introduced by Spark. Christoforos Kachris, ICCS-NTUA, ARC 2018 16
  • 17. www.vineyard-h2020.eu Power consumption Christoforos Kachris, ICCS-NTUA, ARC 2018 17 12x Less Energy 27x Less Energy
  • 18. www.vineyard-h2020.eu Available on github 18 Christoforos Kachris, ICCS-NTUA, ARC 2018
  • 19. www.vineyard-h2020.eu VINEYARD Framework 19 • Accelerators stored in an AppStore • Cloud users request accelerators based on applications requirements • Decouple Hardware – Software designers Cloud computing Applications VINEYARD Cloud Resource Manager 3rd party IP developersLibrary of Hardware accelerators as IP Blocks Heterogeneous Data Center DFE Processors Dataflow Proc.+FPGA IP Accelerator’s App store Cloud tenants Acc Acc Acc Acc DFE DFE DFE Accelerator Controller Accelerator Virtualization Scheduler Accelerator API Performance Energy Christoforos Kachris, ICCS-NTUA, ARC 2018
  • 20. www.vineyard-h2020.eu Main goals VINEYARD AIMS TO • Build an integrated platform for energy-efficient data centres based on novel programmable hardware accelerators • Develop a high-level programming framework and big data infrastructure for allowing end-users to seamlessly utilize these accelerators in heterogeneous computing systems by employing typical data-centre programming frameworks (i.e. Spark.). • VINEYARD also foster the establishment of an ecosystem that will empower open innovation based on hardware accelerators as data-centre plugins for marketplace, thereby facilitating innovative enterprises (large industries, SMEs, and creative start-ups) to develop novel solutions using VINEYARDS’s leading edge developments. 20 Christoforos Kachris, ICCS-NTUA, ARC 2018
  • 21. • Speedup your application seamlessly • An integrated framework for the utilization of hardware accelerators in HPC and data center seamlessly Contact detais: kachris@microlab.ntua.gr