SlideShare a Scribd company logo
1
The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111.The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111.
Heterogeneous HPC Computing
in the DeepHealth Project
José Flich (UPV)
Monica Caballero (everis)
European Big Data Value Forum (EBDVF) 2019
15 October 2019, Helsinki
2
The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111.
About DeepHealth
Aim & Goals
§ Facilitate the daily work and increase the productivity of medical personnel and IT professionals in terms of image
processing and the use and training of predictive models without the need of combining numerous tools.
§ Offer a unified framework adapted to exploit underlying heterogeneous HPC and Big Data architectures
supporting state-of-the-art and next-generation Deep Learning (AI) and Computer Vision algorithms to enhance
European-based medical software platforms.
§ Put HPC computing power at the service of biomedical applications with DL needs and, through an
interdisciplinary approach, apply DL techniques on large and complex image biomedical datasets to support new and
more efficient ways of diagnosis, monitoring and treatment of diseases.
Duration: 36 months
Starting date: Jan 2019
Budget 14.642.366 €
EU funding 12.774.824 €
21 partners from 9 countries: Research
centers, Health organizations, large industries
and SMEs
3
The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111.
About DeepHealth
• The DeepHealth toolkit: Free and open-source software with two core technology libraries and a dedicated
front-end.
• EDDLL: The European Distributed Deep Learning Library
• ECVL: the European Computer Vision Library
• Ready to run algorithms on Hybrid HPC + Big Data architectures with heterogeneous hardware
• Seven biomedical and AI software platforms will integrate the DeepHealth libraries to improve their
potential.
Use-cases
• 14 pilot test-beds in 3 areas:
• Neurological diseases
• Tumor detection and early cancer prediction
• Digital pathology and automated image annotation.
• Pilots will allow to train models and evaluate the performance of the proposed solutions in terms of time
and accuracy.
Expected results
4
The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111.The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111.
DeepHealth HPC Goals
5
The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111.
DeepHealth Goals
• Develop a European Distributed Deep-Learning Library (EDDL)
• Develop a European Computer Vision Library (ECVL)
• Adapt EDDL/ECVL to HPC infrastructure
• Heterogeneous Architectures
• Apply the EDDL/ECVL to 7 European Platforms for Medical applications
• Apply the DeepHealth solution to 14 use cases (pilots) for medical diagnosis
development adaptation use
6
The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111.
HPC Goals and Related Challenges
• Adapt EDDL and ECVL libraries to HPC infrastructure
• Computation
• CPUs, GPUs, FPGAs
• Communication
• Distribution of training process
• KPI
• 4X performance improvement and 7X better power efficiency for target
DeepHealth infrastructure with advanced HPC technologies
(combining manycores with vectorial units, GPUs, FPGAs, and low-
latency interconnects) compared to standard HPC infrastructure
7
The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111.
Platform
Platform
Platform
Challenges
At different levels
EDDL
library
ECVL
library
Use case
Heterog.HPC
CPU CPU CPU GPU GPU GPU FPGA FPGA FPGA FPGA
Interconnect
Use caseUse case
Use caseUse caseUse case
• Develop EDDL/ECVL
• Adapt Platforms
• Adapt Use Cases
• Adapt HPC
• computation, runtime, distribution, interconnect
1
1
1
2 2
3
3 3
4
4 4
4 4 4
4
Implementation Challenge:
Adapting new libraries (for performance)
as they are being implemented and tested
8
The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111.
Types of Systems
Heterogeneity support
CPU
GPU
Interconnect
CPU
GPU
CPU GPU CPU
GPU
CPU
GPU
Interconnect
CPU
GPU
CPU
FPGA
CPU
FPGA
CPU
Interconnect
CPU CPU CPU
CPU
GPU
Interconnect
GPU
CPU
GPU
GPU
CPU
GPU
GPU
CPU
GPU
Interconnect
CPU
GPU
CPU
FPGA
FPGA
GPU
9
The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111.
DeepHealth HPC Goals
• Reinvest in FET-HPC projects (MANGO)
• Large FPGA cluster for heterogeneous HPC Exploration
10
The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111.The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111.
Target HPC Systems
11
The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111.
MareNostrum 4
Total	peak	performance:	13,7 Pflops
General	Purpose	Cluster:	 11.15	Pflops (1.07.2017)
CTE1-P9+Volta:	 1.57	Pflops (1.03.2018)
CTE2-Arm	V8:	 0.5	Pflops (????)
CTE3-KNH?:	 0.5	Pflops (????)
MareNostrum 1
2004	– 42,3	Tflops
1st Europe	/	4th World
New	technologies	
MareNostrum 2
2006	– 94,2	Tflops
1st Europe	/	5th World
New	technologies
MareNostrum 3
2012	– 1,1	Pflops
12th Europe	/	36th World
MareNostrum 4
2017	– 11,1	Pflops
2nd Europe	/	13th World
New	technologies
12
The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111.
BSC HPC Infrastructures
PUT YOUR SMART SUBTITLE HERE
• General Purpose Cluster (in production)
• 48 racks with 3456 nodes, each with 2 Intel Xeon Platinum proc.
• Total of 11.15 PFLOPs in Double Precision
• System with total of 165888 processors and 390TB of main memory
• 29th fastest supercomputer in top500, 7th fastest supercomputer in Europe
• CTE1-P9+VOLTA (in production)
• 54 nodes, each with 2 POWER9 proc., 4 Volta GPUs, 6.4TB NVMe
• Total of 1.57 PFLOPs in Double Precision
• Same node as Sierra supercomputer at LLNL (2nd fastest supercomputer in
top500)
• Suitable for HPC and Machine Learning workloads
13
The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111.
BSC HPC Infrastructures
PUT YOUR SMART SUBTITLE HERE
• CTE2-Arm v8 (to be deployed in 2020)
• Same processor as in the future post-K supercomputer in Japan
• Targets Exascale workloads: 2.7 TFLOPS double precision compute power,
5.4 TFLOPS in single precision; 10.8 TFLOPS in half-precision (16 bits)
• HPC and AI convergence: up to 21.6 TOPS in 8-bit int precision
• 7nm technology; 48 cores; 4 stacks of 8GB HBM2 (total of 32GB)
• Novel 512-bit SVE ext. with specific instructions for machine learning
• Might be interesting as a cutting edge system by the end of DeepHealth
• Mont-Blanc 3 prototype (in production)
• 48 nodes, 2 processors/node (96 processors in total)
• Cavium Thunder X2 processor: 32-core Arm v8, 4-way SMT, up to 2.5GHz
• Targets HPC workloads in datacenters
• System with up to 3K cores and 12K threads
• Liquid cooling
14
The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111.
MANGO prototype
From FET-HPC MANGO project
• 16 (interconnected) clusters, each with
• One Server node
• 12 FPGAs (lego system)
• Xilinx 7–series, Zynq-7000, Kintex Ultrascale+
• Intel Stratix-10
• DDR3, DDR4 pluggable memory modules
• Connections: PCIe Express Gen 2/3 lanes, 40Gbps QSFP
prototype
onecluster
15
The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111.
PROD: Development of a customized FPGA-
based PCIe Board
• Based on latest Intel or Xilinx FPGA
technology (TBD)
• High bandwidth and low latency PCIe
interface for data exchange with host
• Modular peripherals (memories,
interfaces) - TBD
16
The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111.
The DeepHealth Computing Infrastructure
Overview
COMPSs
Global Resource Manage
(Slurm-based)
Distributed Programming Model
(e.g., M/R, task-based)
Non-functional
requirements description
API provided to ECVL and EDDLL developers (WP2/WP3)
Parallel
Run-time
Netlist Partitioning
Vivado tools
N2D2
framework
Mango
Run-time
Mango
Cluster
MareNostrum 4 (Intel)
Arm ThunderX2
POWER9+Voltas Cluster
Private (NVIDIA)
+ Public Cloud
DeepHealth HPC HW Resources DeepHealth Cloud HW Resources
OpenStack
platform
Parallel Programming Models
(e.g., CUDA, OpenCL, OpenMP)
Cloud
API
DeepHealth SW Architecture
Private Cloud
(x86+NVIDIA T4)Tailored FPGA PCIe card
1200 cores
cluster (x86)
BSC
UNITO
PROD
UPV
UNITOTREE
Programming models and access methods for
EDDLL and ECVL development
The DeepHealth computing infrastructure including
HPC and big-data cloud-based resources
Multiple Workloads Scheduling
Single Workload
Scheduling
Container-based
(Parallel) Programming Models
HW
EDDLL workload
(e.g., training)
EDDL workload
(e.g., inference)
Single Workload
Scheduling
17
The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111.
COMPSs
• Framework (programming model + runtime system) to develop parallel
applications for distributed infrastructures
• Abstract model: exposes parallelism while hides the infrastructure
• Agnostic of computing platform
• Task-based programming model build on top of general purpose sequential
programming languages (Python, C, C++, Java)
def display(c):
…
def add(a, b, c):
c = a + b
for i in range(MSIZE):
add(A[i],B[i],C[i])
display(C)
@task(c=INOUT)
def display(c):
…
@task(a=IN,b=IN,c=OUT)
def add(a, b, c):
c = a + b
for i in range(MSIZE):
add(A[i],B[i],C[i])
display(C)
ad
d
ad
d
ad
d
dis
pla
y
…
MSIZE
18
The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111.
EPFL: Multi-objective RM policies
• Power/performance/accuracy-aware
runtime resource management policies
• Automatic selection of the most efficient
resources
• Adding one new axis: accuracy!
• Heuristics, ML-based and hyper-heuristic
RM policies (algorithms)
• Single-node: selection of accelerators
(allocation), DVFS settings
• Multiple nodes (Global RM of MANGO)
• Integrated with DeepHealth SW stack
• MANGO API + COMPS + Slurm
19
The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111.
Data Parallelism
• Training batch distribution
• Gradient collection and weights distribution
• AllReduce, Broadcast support to be exploited
• Different strategies will be implemented and evaluated
• Synchronization primitives (relaxed models)
CPU
GPU
Interconnect
CPU GPU CPU
FPGA
FPGA
GPU
High Pressure
on the
Interconnect
20
The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111.
Netlist partitioning (CEA)
• Use a multi-FPGA platform as a single virtual large FPGA
• For very large inference networks that do not fit into a single
FPGA
• Direct IO-to-IO connection between FPGAs
• Optimized partitioning of the netlist into several netlists
• Combinatiorial optimization model, taking into account
critical paths & resource quantities in each FPGA
• Several state-of-the-art optimization methods, from
Kernighan-Lin to simulated annealing
• Execution of the design on the multi-FPGA platform
• Multiplexing of signals to deal with the limited
interconnection
21
The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111.
Heterogeneous Computing
• DeepLearning and Computer Vision kernels to be deployed for
• CPU
• Math processing routines (MKL, Eigen)
• GPU
• CUDA vs OpenCL programming
• FPGA
• OpenCL vs HLS vs RTL programming
• Intel/Altera vs Xilinx platforms
22
The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111.
HPC Things to Explore in DeepHealth
• Communication impact
• Will the network become the bottleneck?
• Use cases sizes
• Accuracy vs performance trade-off
• FPGA suitability for Training (Floating point precision requirement)
• Will be energy efficient for such large challenge?
• Which FPGA devices will perform better (accuracy vs. energy trade-off)
• Scalability of the solution (EDDL/ECVL)
• Will perform well on any end-used HPC-like platform?
• … so, ahead a challenging future for DeepHealth HPC teams!
23
The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111.The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111.
José Flich (jflich@disca.upv.es)
Mónica Caballero (monica.caballero.galeote@everis.com)
Thank you!

More Related Content

Similar to Heterogeneous HPC Computing in the DeepHealth Project

Virtual BenchLearning - DeepHealth - Needs & Requirements for Benchmarking
Virtual BenchLearning - DeepHealth - Needs & Requirements for BenchmarkingVirtual BenchLearning - DeepHealth - Needs & Requirements for Benchmarking
Virtual BenchLearning - DeepHealth - Needs & Requirements for Benchmarking
Big Data Value Association
 
High-Performance Computing Research in Europe
High-Performance Computing Research in EuropeHigh-Performance Computing Research in Europe
High-Performance Computing Research in Europe
Govnet Events
 
Optalysis: Disruptive Optical Processing Technology for HPC
Optalysis: Disruptive Optical Processing Technology for HPCOptalysis: Disruptive Optical Processing Technology for HPC
Optalysis: Disruptive Optical Processing Technology for HPC
inside-BigData.com
 
e-Infrastructure available for research, using the right tool for the right job
e-Infrastructure available for research, using the right tool for the right jobe-Infrastructure available for research, using the right tool for the right job
e-Infrastructure available for research, using the right tool for the right job
David Wallom
 
Gergely Sipos (EGI): Exploiting scientific data in the international context ...
Gergely Sipos (EGI): Exploiting scientific data in the international context ...Gergely Sipos (EGI): Exploiting scientific data in the international context ...
Gergely Sipos (EGI): Exploiting scientific data in the international context ...
Gergely Sipos
 
LCG project description
LCG project descriptionLCG project description
LCG project description
louisponcet
 
An Experimental Workflow Development Platform for Historical Document Digitis...
An Experimental Workflow Development Platform for Historical Document Digitis...An Experimental Workflow Development Platform for Historical Document Digitis...
An Experimental Workflow Development Platform for Historical Document Digitis...
cneudecker
 
DATE 2018
DATE 2018DATE 2018
DATE 2018
LEGATO project
 
Available HPC Resources at CSUC
Available HPC Resources at CSUCAvailable HPC Resources at CSUC
OpenPOWER System Marconi100
OpenPOWER System Marconi100OpenPOWER System Marconi100
OpenPOWER System Marconi100
Ganesan Narayanasamy
 
Deep Hybrid DataCloud
Deep Hybrid DataCloudDeep Hybrid DataCloud
Deep Hybrid DataCloud
EOSC-hub project
 
eROSA Policy WS2: European Open Science Cloud (EOSC) - The Perspective of e-I...
eROSA Policy WS2: European Open Science Cloud (EOSC) - The Perspective of e-I...eROSA Policy WS2: European Open Science Cloud (EOSC) - The Perspective of e-I...
eROSA Policy WS2: European Open Science Cloud (EOSC) - The Perspective of e-I...
e-ROSA
 
DRAC: Designing RISC-V-based Accelerators for next generation Computers
DRAC: Designing RISC-V-based Accelerators for next generation ComputersDRAC: Designing RISC-V-based Accelerators for next generation Computers
DRAC: Designing RISC-V-based Accelerators for next generation Computers
Facultad de Informática UCM
 
Design phase kick-off event and Ceremony
Design phase kick-off event and CeremonyDesign phase kick-off event and Ceremony
Design phase kick-off event and Ceremony
Archiver
 
Reducing Infrastructure and Service Fragmentation
Reducing Infrastructure and Service Fragmentation Reducing Infrastructure and Service Fragmentation
Reducing Infrastructure and Service Fragmentation
EOSCpilot .eu
 
EUDAT Generic Execution Framework
EUDAT Generic Execution FrameworkEUDAT Generic Execution Framework
EUDAT Generic Execution Framework
EUDAT
 
2019 05-21 egi and eosc - final
2019 05-21 egi and eosc - final2019 05-21 egi and eosc - final
2019 05-21 egi and eosc - final
EOSC-hub project
 
EGI Engage: Impact & Results
EGI Engage: Impact & ResultsEGI Engage: Impact & Results
EGI Engage: Impact & Results
EGI Federation
 
Hopsworks - ExtremeEarth Open Workshop
Hopsworks - ExtremeEarth Open WorkshopHopsworks - ExtremeEarth Open Workshop
Hopsworks - ExtremeEarth Open Workshop
ExtremeEarth
 
The EOSC Compute Platform with the EGI-ACE project
The EOSC Compute Platform with the EGI-ACE project The EOSC Compute Platform with the EGI-ACE project
The EOSC Compute Platform with the EGI-ACE project
EGI Federation
 

Similar to Heterogeneous HPC Computing in the DeepHealth Project (20)

Virtual BenchLearning - DeepHealth - Needs & Requirements for Benchmarking
Virtual BenchLearning - DeepHealth - Needs & Requirements for BenchmarkingVirtual BenchLearning - DeepHealth - Needs & Requirements for Benchmarking
Virtual BenchLearning - DeepHealth - Needs & Requirements for Benchmarking
 
High-Performance Computing Research in Europe
High-Performance Computing Research in EuropeHigh-Performance Computing Research in Europe
High-Performance Computing Research in Europe
 
Optalysis: Disruptive Optical Processing Technology for HPC
Optalysis: Disruptive Optical Processing Technology for HPCOptalysis: Disruptive Optical Processing Technology for HPC
Optalysis: Disruptive Optical Processing Technology for HPC
 
e-Infrastructure available for research, using the right tool for the right job
e-Infrastructure available for research, using the right tool for the right jobe-Infrastructure available for research, using the right tool for the right job
e-Infrastructure available for research, using the right tool for the right job
 
Gergely Sipos (EGI): Exploiting scientific data in the international context ...
Gergely Sipos (EGI): Exploiting scientific data in the international context ...Gergely Sipos (EGI): Exploiting scientific data in the international context ...
Gergely Sipos (EGI): Exploiting scientific data in the international context ...
 
LCG project description
LCG project descriptionLCG project description
LCG project description
 
An Experimental Workflow Development Platform for Historical Document Digitis...
An Experimental Workflow Development Platform for Historical Document Digitis...An Experimental Workflow Development Platform for Historical Document Digitis...
An Experimental Workflow Development Platform for Historical Document Digitis...
 
DATE 2018
DATE 2018DATE 2018
DATE 2018
 
Available HPC Resources at CSUC
Available HPC Resources at CSUCAvailable HPC Resources at CSUC
Available HPC Resources at CSUC
 
OpenPOWER System Marconi100
OpenPOWER System Marconi100OpenPOWER System Marconi100
OpenPOWER System Marconi100
 
Deep Hybrid DataCloud
Deep Hybrid DataCloudDeep Hybrid DataCloud
Deep Hybrid DataCloud
 
eROSA Policy WS2: European Open Science Cloud (EOSC) - The Perspective of e-I...
eROSA Policy WS2: European Open Science Cloud (EOSC) - The Perspective of e-I...eROSA Policy WS2: European Open Science Cloud (EOSC) - The Perspective of e-I...
eROSA Policy WS2: European Open Science Cloud (EOSC) - The Perspective of e-I...
 
DRAC: Designing RISC-V-based Accelerators for next generation Computers
DRAC: Designing RISC-V-based Accelerators for next generation ComputersDRAC: Designing RISC-V-based Accelerators for next generation Computers
DRAC: Designing RISC-V-based Accelerators for next generation Computers
 
Design phase kick-off event and Ceremony
Design phase kick-off event and CeremonyDesign phase kick-off event and Ceremony
Design phase kick-off event and Ceremony
 
Reducing Infrastructure and Service Fragmentation
Reducing Infrastructure and Service Fragmentation Reducing Infrastructure and Service Fragmentation
Reducing Infrastructure and Service Fragmentation
 
EUDAT Generic Execution Framework
EUDAT Generic Execution FrameworkEUDAT Generic Execution Framework
EUDAT Generic Execution Framework
 
2019 05-21 egi and eosc - final
2019 05-21 egi and eosc - final2019 05-21 egi and eosc - final
2019 05-21 egi and eosc - final
 
EGI Engage: Impact & Results
EGI Engage: Impact & ResultsEGI Engage: Impact & Results
EGI Engage: Impact & Results
 
Hopsworks - ExtremeEarth Open Workshop
Hopsworks - ExtremeEarth Open WorkshopHopsworks - ExtremeEarth Open Workshop
Hopsworks - ExtremeEarth Open Workshop
 
The EOSC Compute Platform with the EGI-ACE project
The EOSC Compute Platform with the EGI-ACE project The EOSC Compute Platform with the EGI-ACE project
The EOSC Compute Platform with the EGI-ACE project
 

More from Big Data Value Association

Data Privacy, Security in personal data sharing
Data Privacy, Security in personal data sharingData Privacy, Security in personal data sharing
Data Privacy, Security in personal data sharing
Big Data Value Association
 
Key Modules for a trsuted and privacy preserving personal data marketplace
Key Modules for a trsuted and privacy preserving personal data marketplaceKey Modules for a trsuted and privacy preserving personal data marketplace
Key Modules for a trsuted and privacy preserving personal data marketplace
Big Data Value Association
 
GDPR and Data Ethics considerations in personal data sharing
GDPR and Data Ethics considerations in personal data sharingGDPR and Data Ethics considerations in personal data sharing
GDPR and Data Ethics considerations in personal data sharing
Big Data Value Association
 
Intro - Three pillars for building a Smart Data Ecosystem: Trust, Security an...
Intro - Three pillars for building a Smart Data Ecosystem: Trust, Security an...Intro - Three pillars for building a Smart Data Ecosystem: Trust, Security an...
Intro - Three pillars for building a Smart Data Ecosystem: Trust, Security an...
Big Data Value Association
 
Three pillars for building a Smart Data Ecosystem: Trust, Security and Privacy
Three pillars for building a Smart Data Ecosystem: Trust, Security and PrivacyThree pillars for building a Smart Data Ecosystem: Trust, Security and Privacy
Three pillars for building a Smart Data Ecosystem: Trust, Security and Privacy
Big Data Value Association
 
Market into context - Three pillars for building a Smart Data Ecosystem: Trus...
Market into context - Three pillars for building a Smart Data Ecosystem: Trus...Market into context - Three pillars for building a Smart Data Ecosystem: Trus...
Market into context - Three pillars for building a Smart Data Ecosystem: Trus...
Big Data Value Association
 
BDV Skills Accreditation - Future of digital skills in Europe reskilling and ...
BDV Skills Accreditation - Future of digital skills in Europe reskilling and ...BDV Skills Accreditation - Future of digital skills in Europe reskilling and ...
BDV Skills Accreditation - Future of digital skills in Europe reskilling and ...
Big Data Value Association
 
BDV Skills Accreditation - Big Data skilling in Emilia-Romagna
BDV Skills Accreditation - Big Data skilling in Emilia-Romagna BDV Skills Accreditation - Big Data skilling in Emilia-Romagna
BDV Skills Accreditation - Big Data skilling in Emilia-Romagna
Big Data Value Association
 
BDV Skills Accreditation - EIT labels for professionals
BDV Skills Accreditation - EIT labels for professionalsBDV Skills Accreditation - EIT labels for professionals
BDV Skills Accreditation - EIT labels for professionals
Big Data Value Association
 
BDV Skills Accreditation - Recognizing Data Science Skills with BDV Data Scie...
BDV Skills Accreditation - Recognizing Data Science Skills with BDV Data Scie...BDV Skills Accreditation - Recognizing Data Science Skills with BDV Data Scie...
BDV Skills Accreditation - Recognizing Data Science Skills with BDV Data Scie...
Big Data Value Association
 
BDV Skills Accreditation - Objectives of the workshop
BDV Skills Accreditation - Objectives of the workshopBDV Skills Accreditation - Objectives of the workshop
BDV Skills Accreditation - Objectives of the workshop
Big Data Value Association
 
BDV Skills Accreditation - Welcome introduction to the workshop
BDV Skills Accreditation - Welcome introduction to the workshopBDV Skills Accreditation - Welcome introduction to the workshop
BDV Skills Accreditation - Welcome introduction to the workshop
Big Data Value Association
 
BDV Skills Accreditation - Definition and ensuring of digital roles and compe...
BDV Skills Accreditation - Definition and ensuring of digital roles and compe...BDV Skills Accreditation - Definition and ensuring of digital roles and compe...
BDV Skills Accreditation - Definition and ensuring of digital roles and compe...
Big Data Value Association
 
BigDataPilotDemoDays - I BiDaaS Application to the Manufacturing Sector Webinar
BigDataPilotDemoDays - I BiDaaS Application to the Manufacturing Sector WebinarBigDataPilotDemoDays - I BiDaaS Application to the Manufacturing Sector Webinar
BigDataPilotDemoDays - I BiDaaS Application to the Manufacturing Sector Webinar
Big Data Value Association
 
BigDataPilotDemoDays - I-BiDaaS Application to the Financial Sector Webinar
BigDataPilotDemoDays - I-BiDaaS Application to the Financial Sector WebinarBigDataPilotDemoDays - I-BiDaaS Application to the Financial Sector Webinar
BigDataPilotDemoDays - I-BiDaaS Application to the Financial Sector Webinar
Big Data Value Association
 
Virtual BenchLearning - Data Bench Framework
Virtual BenchLearning - Data Bench FrameworkVirtual BenchLearning - Data Bench Framework
Virtual BenchLearning - Data Bench Framework
Big Data Value Association
 
Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...
Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...
Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...
Big Data Value Association
 
Policy Cloud Data Driven Policies against Radicalisation - Technical Overview
Policy Cloud Data Driven Policies against Radicalisation - Technical OverviewPolicy Cloud Data Driven Policies against Radicalisation - Technical Overview
Policy Cloud Data Driven Policies against Radicalisation - Technical Overview
Big Data Value Association
 
Policy Cloud Data Driven Policies against Radicalisation - Participatory poli...
Policy Cloud Data Driven Policies against Radicalisation - Participatory poli...Policy Cloud Data Driven Policies against Radicalisation - Participatory poli...
Policy Cloud Data Driven Policies against Radicalisation - Participatory poli...
Big Data Value Association
 
Policy Cloud Data Driven Policies against Radicalisation
Policy Cloud Data Driven Policies against RadicalisationPolicy Cloud Data Driven Policies against Radicalisation
Policy Cloud Data Driven Policies against Radicalisation
Big Data Value Association
 

More from Big Data Value Association (20)

Data Privacy, Security in personal data sharing
Data Privacy, Security in personal data sharingData Privacy, Security in personal data sharing
Data Privacy, Security in personal data sharing
 
Key Modules for a trsuted and privacy preserving personal data marketplace
Key Modules for a trsuted and privacy preserving personal data marketplaceKey Modules for a trsuted and privacy preserving personal data marketplace
Key Modules for a trsuted and privacy preserving personal data marketplace
 
GDPR and Data Ethics considerations in personal data sharing
GDPR and Data Ethics considerations in personal data sharingGDPR and Data Ethics considerations in personal data sharing
GDPR and Data Ethics considerations in personal data sharing
 
Intro - Three pillars for building a Smart Data Ecosystem: Trust, Security an...
Intro - Three pillars for building a Smart Data Ecosystem: Trust, Security an...Intro - Three pillars for building a Smart Data Ecosystem: Trust, Security an...
Intro - Three pillars for building a Smart Data Ecosystem: Trust, Security an...
 
Three pillars for building a Smart Data Ecosystem: Trust, Security and Privacy
Three pillars for building a Smart Data Ecosystem: Trust, Security and PrivacyThree pillars for building a Smart Data Ecosystem: Trust, Security and Privacy
Three pillars for building a Smart Data Ecosystem: Trust, Security and Privacy
 
Market into context - Three pillars for building a Smart Data Ecosystem: Trus...
Market into context - Three pillars for building a Smart Data Ecosystem: Trus...Market into context - Three pillars for building a Smart Data Ecosystem: Trus...
Market into context - Three pillars for building a Smart Data Ecosystem: Trus...
 
BDV Skills Accreditation - Future of digital skills in Europe reskilling and ...
BDV Skills Accreditation - Future of digital skills in Europe reskilling and ...BDV Skills Accreditation - Future of digital skills in Europe reskilling and ...
BDV Skills Accreditation - Future of digital skills in Europe reskilling and ...
 
BDV Skills Accreditation - Big Data skilling in Emilia-Romagna
BDV Skills Accreditation - Big Data skilling in Emilia-Romagna BDV Skills Accreditation - Big Data skilling in Emilia-Romagna
BDV Skills Accreditation - Big Data skilling in Emilia-Romagna
 
BDV Skills Accreditation - EIT labels for professionals
BDV Skills Accreditation - EIT labels for professionalsBDV Skills Accreditation - EIT labels for professionals
BDV Skills Accreditation - EIT labels for professionals
 
BDV Skills Accreditation - Recognizing Data Science Skills with BDV Data Scie...
BDV Skills Accreditation - Recognizing Data Science Skills with BDV Data Scie...BDV Skills Accreditation - Recognizing Data Science Skills with BDV Data Scie...
BDV Skills Accreditation - Recognizing Data Science Skills with BDV Data Scie...
 
BDV Skills Accreditation - Objectives of the workshop
BDV Skills Accreditation - Objectives of the workshopBDV Skills Accreditation - Objectives of the workshop
BDV Skills Accreditation - Objectives of the workshop
 
BDV Skills Accreditation - Welcome introduction to the workshop
BDV Skills Accreditation - Welcome introduction to the workshopBDV Skills Accreditation - Welcome introduction to the workshop
BDV Skills Accreditation - Welcome introduction to the workshop
 
BDV Skills Accreditation - Definition and ensuring of digital roles and compe...
BDV Skills Accreditation - Definition and ensuring of digital roles and compe...BDV Skills Accreditation - Definition and ensuring of digital roles and compe...
BDV Skills Accreditation - Definition and ensuring of digital roles and compe...
 
BigDataPilotDemoDays - I BiDaaS Application to the Manufacturing Sector Webinar
BigDataPilotDemoDays - I BiDaaS Application to the Manufacturing Sector WebinarBigDataPilotDemoDays - I BiDaaS Application to the Manufacturing Sector Webinar
BigDataPilotDemoDays - I BiDaaS Application to the Manufacturing Sector Webinar
 
BigDataPilotDemoDays - I-BiDaaS Application to the Financial Sector Webinar
BigDataPilotDemoDays - I-BiDaaS Application to the Financial Sector WebinarBigDataPilotDemoDays - I-BiDaaS Application to the Financial Sector Webinar
BigDataPilotDemoDays - I-BiDaaS Application to the Financial Sector Webinar
 
Virtual BenchLearning - Data Bench Framework
Virtual BenchLearning - Data Bench FrameworkVirtual BenchLearning - Data Bench Framework
Virtual BenchLearning - Data Bench Framework
 
Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...
Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...
Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...
 
Policy Cloud Data Driven Policies against Radicalisation - Technical Overview
Policy Cloud Data Driven Policies against Radicalisation - Technical OverviewPolicy Cloud Data Driven Policies against Radicalisation - Technical Overview
Policy Cloud Data Driven Policies against Radicalisation - Technical Overview
 
Policy Cloud Data Driven Policies against Radicalisation - Participatory poli...
Policy Cloud Data Driven Policies against Radicalisation - Participatory poli...Policy Cloud Data Driven Policies against Radicalisation - Participatory poli...
Policy Cloud Data Driven Policies against Radicalisation - Participatory poli...
 
Policy Cloud Data Driven Policies against Radicalisation
Policy Cloud Data Driven Policies against RadicalisationPolicy Cloud Data Driven Policies against Radicalisation
Policy Cloud Data Driven Policies against Radicalisation
 

Recently uploaded

一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
vcaxypu
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
yhkoc
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
MaleehaSheikh2
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
vcaxypu
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
Subhajit Sahu
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
enxupq
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization Sample
James Polillo
 
tapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive datatapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive data
theahmadsaood
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
ukgaet
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Subhajit Sahu
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
ewymefz
 
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
Tiktokethiodaily
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
John Andrews
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Boston Institute of Analytics
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
axoqas
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
Oppotus
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Linda486226
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
StarCompliance.io
 

Recently uploaded (20)

一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization Sample
 
tapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive datatapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive data
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
 
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
 

Heterogeneous HPC Computing in the DeepHealth Project

  • 1. 1 The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111.The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111. Heterogeneous HPC Computing in the DeepHealth Project José Flich (UPV) Monica Caballero (everis) European Big Data Value Forum (EBDVF) 2019 15 October 2019, Helsinki
  • 2. 2 The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111. About DeepHealth Aim & Goals § Facilitate the daily work and increase the productivity of medical personnel and IT professionals in terms of image processing and the use and training of predictive models without the need of combining numerous tools. § Offer a unified framework adapted to exploit underlying heterogeneous HPC and Big Data architectures supporting state-of-the-art and next-generation Deep Learning (AI) and Computer Vision algorithms to enhance European-based medical software platforms. § Put HPC computing power at the service of biomedical applications with DL needs and, through an interdisciplinary approach, apply DL techniques on large and complex image biomedical datasets to support new and more efficient ways of diagnosis, monitoring and treatment of diseases. Duration: 36 months Starting date: Jan 2019 Budget 14.642.366 € EU funding 12.774.824 € 21 partners from 9 countries: Research centers, Health organizations, large industries and SMEs
  • 3. 3 The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111. About DeepHealth • The DeepHealth toolkit: Free and open-source software with two core technology libraries and a dedicated front-end. • EDDLL: The European Distributed Deep Learning Library • ECVL: the European Computer Vision Library • Ready to run algorithms on Hybrid HPC + Big Data architectures with heterogeneous hardware • Seven biomedical and AI software platforms will integrate the DeepHealth libraries to improve their potential. Use-cases • 14 pilot test-beds in 3 areas: • Neurological diseases • Tumor detection and early cancer prediction • Digital pathology and automated image annotation. • Pilots will allow to train models and evaluate the performance of the proposed solutions in terms of time and accuracy. Expected results
  • 4. 4 The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111.The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111. DeepHealth HPC Goals
  • 5. 5 The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111. DeepHealth Goals • Develop a European Distributed Deep-Learning Library (EDDL) • Develop a European Computer Vision Library (ECVL) • Adapt EDDL/ECVL to HPC infrastructure • Heterogeneous Architectures • Apply the EDDL/ECVL to 7 European Platforms for Medical applications • Apply the DeepHealth solution to 14 use cases (pilots) for medical diagnosis development adaptation use
  • 6. 6 The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111. HPC Goals and Related Challenges • Adapt EDDL and ECVL libraries to HPC infrastructure • Computation • CPUs, GPUs, FPGAs • Communication • Distribution of training process • KPI • 4X performance improvement and 7X better power efficiency for target DeepHealth infrastructure with advanced HPC technologies (combining manycores with vectorial units, GPUs, FPGAs, and low- latency interconnects) compared to standard HPC infrastructure
  • 7. 7 The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111. Platform Platform Platform Challenges At different levels EDDL library ECVL library Use case Heterog.HPC CPU CPU CPU GPU GPU GPU FPGA FPGA FPGA FPGA Interconnect Use caseUse case Use caseUse caseUse case • Develop EDDL/ECVL • Adapt Platforms • Adapt Use Cases • Adapt HPC • computation, runtime, distribution, interconnect 1 1 1 2 2 3 3 3 4 4 4 4 4 4 4 Implementation Challenge: Adapting new libraries (for performance) as they are being implemented and tested
  • 8. 8 The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111. Types of Systems Heterogeneity support CPU GPU Interconnect CPU GPU CPU GPU CPU GPU CPU GPU Interconnect CPU GPU CPU FPGA CPU FPGA CPU Interconnect CPU CPU CPU CPU GPU Interconnect GPU CPU GPU GPU CPU GPU GPU CPU GPU Interconnect CPU GPU CPU FPGA FPGA GPU
  • 9. 9 The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111. DeepHealth HPC Goals • Reinvest in FET-HPC projects (MANGO) • Large FPGA cluster for heterogeneous HPC Exploration
  • 10. 10 The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111.The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111. Target HPC Systems
  • 11. 11 The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111. MareNostrum 4 Total peak performance: 13,7 Pflops General Purpose Cluster: 11.15 Pflops (1.07.2017) CTE1-P9+Volta: 1.57 Pflops (1.03.2018) CTE2-Arm V8: 0.5 Pflops (????) CTE3-KNH?: 0.5 Pflops (????) MareNostrum 1 2004 – 42,3 Tflops 1st Europe / 4th World New technologies MareNostrum 2 2006 – 94,2 Tflops 1st Europe / 5th World New technologies MareNostrum 3 2012 – 1,1 Pflops 12th Europe / 36th World MareNostrum 4 2017 – 11,1 Pflops 2nd Europe / 13th World New technologies
  • 12. 12 The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111. BSC HPC Infrastructures PUT YOUR SMART SUBTITLE HERE • General Purpose Cluster (in production) • 48 racks with 3456 nodes, each with 2 Intel Xeon Platinum proc. • Total of 11.15 PFLOPs in Double Precision • System with total of 165888 processors and 390TB of main memory • 29th fastest supercomputer in top500, 7th fastest supercomputer in Europe • CTE1-P9+VOLTA (in production) • 54 nodes, each with 2 POWER9 proc., 4 Volta GPUs, 6.4TB NVMe • Total of 1.57 PFLOPs in Double Precision • Same node as Sierra supercomputer at LLNL (2nd fastest supercomputer in top500) • Suitable for HPC and Machine Learning workloads
  • 13. 13 The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111. BSC HPC Infrastructures PUT YOUR SMART SUBTITLE HERE • CTE2-Arm v8 (to be deployed in 2020) • Same processor as in the future post-K supercomputer in Japan • Targets Exascale workloads: 2.7 TFLOPS double precision compute power, 5.4 TFLOPS in single precision; 10.8 TFLOPS in half-precision (16 bits) • HPC and AI convergence: up to 21.6 TOPS in 8-bit int precision • 7nm technology; 48 cores; 4 stacks of 8GB HBM2 (total of 32GB) • Novel 512-bit SVE ext. with specific instructions for machine learning • Might be interesting as a cutting edge system by the end of DeepHealth • Mont-Blanc 3 prototype (in production) • 48 nodes, 2 processors/node (96 processors in total) • Cavium Thunder X2 processor: 32-core Arm v8, 4-way SMT, up to 2.5GHz • Targets HPC workloads in datacenters • System with up to 3K cores and 12K threads • Liquid cooling
  • 14. 14 The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111. MANGO prototype From FET-HPC MANGO project • 16 (interconnected) clusters, each with • One Server node • 12 FPGAs (lego system) • Xilinx 7–series, Zynq-7000, Kintex Ultrascale+ • Intel Stratix-10 • DDR3, DDR4 pluggable memory modules • Connections: PCIe Express Gen 2/3 lanes, 40Gbps QSFP prototype onecluster
  • 15. 15 The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111. PROD: Development of a customized FPGA- based PCIe Board • Based on latest Intel or Xilinx FPGA technology (TBD) • High bandwidth and low latency PCIe interface for data exchange with host • Modular peripherals (memories, interfaces) - TBD
  • 16. 16 The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111. The DeepHealth Computing Infrastructure Overview COMPSs Global Resource Manage (Slurm-based) Distributed Programming Model (e.g., M/R, task-based) Non-functional requirements description API provided to ECVL and EDDLL developers (WP2/WP3) Parallel Run-time Netlist Partitioning Vivado tools N2D2 framework Mango Run-time Mango Cluster MareNostrum 4 (Intel) Arm ThunderX2 POWER9+Voltas Cluster Private (NVIDIA) + Public Cloud DeepHealth HPC HW Resources DeepHealth Cloud HW Resources OpenStack platform Parallel Programming Models (e.g., CUDA, OpenCL, OpenMP) Cloud API DeepHealth SW Architecture Private Cloud (x86+NVIDIA T4)Tailored FPGA PCIe card 1200 cores cluster (x86) BSC UNITO PROD UPV UNITOTREE Programming models and access methods for EDDLL and ECVL development The DeepHealth computing infrastructure including HPC and big-data cloud-based resources Multiple Workloads Scheduling Single Workload Scheduling Container-based (Parallel) Programming Models HW EDDLL workload (e.g., training) EDDL workload (e.g., inference) Single Workload Scheduling
  • 17. 17 The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111. COMPSs • Framework (programming model + runtime system) to develop parallel applications for distributed infrastructures • Abstract model: exposes parallelism while hides the infrastructure • Agnostic of computing platform • Task-based programming model build on top of general purpose sequential programming languages (Python, C, C++, Java) def display(c): … def add(a, b, c): c = a + b for i in range(MSIZE): add(A[i],B[i],C[i]) display(C) @task(c=INOUT) def display(c): … @task(a=IN,b=IN,c=OUT) def add(a, b, c): c = a + b for i in range(MSIZE): add(A[i],B[i],C[i]) display(C) ad d ad d ad d dis pla y … MSIZE
  • 18. 18 The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111. EPFL: Multi-objective RM policies • Power/performance/accuracy-aware runtime resource management policies • Automatic selection of the most efficient resources • Adding one new axis: accuracy! • Heuristics, ML-based and hyper-heuristic RM policies (algorithms) • Single-node: selection of accelerators (allocation), DVFS settings • Multiple nodes (Global RM of MANGO) • Integrated with DeepHealth SW stack • MANGO API + COMPS + Slurm
  • 19. 19 The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111. Data Parallelism • Training batch distribution • Gradient collection and weights distribution • AllReduce, Broadcast support to be exploited • Different strategies will be implemented and evaluated • Synchronization primitives (relaxed models) CPU GPU Interconnect CPU GPU CPU FPGA FPGA GPU High Pressure on the Interconnect
  • 20. 20 The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111. Netlist partitioning (CEA) • Use a multi-FPGA platform as a single virtual large FPGA • For very large inference networks that do not fit into a single FPGA • Direct IO-to-IO connection between FPGAs • Optimized partitioning of the netlist into several netlists • Combinatiorial optimization model, taking into account critical paths & resource quantities in each FPGA • Several state-of-the-art optimization methods, from Kernighan-Lin to simulated annealing • Execution of the design on the multi-FPGA platform • Multiplexing of signals to deal with the limited interconnection
  • 21. 21 The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111. Heterogeneous Computing • DeepLearning and Computer Vision kernels to be deployed for • CPU • Math processing routines (MKL, Eigen) • GPU • CUDA vs OpenCL programming • FPGA • OpenCL vs HLS vs RTL programming • Intel/Altera vs Xilinx platforms
  • 22. 22 The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111. HPC Things to Explore in DeepHealth • Communication impact • Will the network become the bottleneck? • Use cases sizes • Accuracy vs performance trade-off • FPGA suitability for Training (Floating point precision requirement) • Will be energy efficient for such large challenge? • Which FPGA devices will perform better (accuracy vs. energy trade-off) • Scalability of the solution (EDDL/ECVL) • Will perform well on any end-used HPC-like platform? • … so, ahead a challenging future for DeepHealth HPC teams!
  • 23. 23 The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111.The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825111. José Flich (jflich@disca.upv.es) Mónica Caballero (monica.caballero.galeote@everis.com) Thank you!