SlideShare a Scribd company logo
1 of 14
Download to read offline
Kouichi Hirai
FUJITSU LIMITED
Dec 12th, 2017
Post-K:
Building the Arm HPC Ecosystem
0 Copyright 2017 FUJITSU LIMITEDLinaro Work Shop, Dec. 12, 2017
Post-K: Building up Arm HPC Ecosystem
 Fujitsu’s approach for HPC
 For making the Post-K a resounding success
 The high performance compiler increases software portability
 Summary
Copyright 2017 FUJITSU LIMITEDLinaro Work Shop, Dec. 12, 2017 1
Fujitsu HPC Solutions to Meet Customer Demands
 Supercomputers, both Fujitsu-developed CPUs and x86
 Single system image operation w/ Fujitsu system software
 High performance, high availability, and high reliability
Copyright 2017 FUJITSU LIMITED
x86 Cluster
RX2530/RX2540 CX600CX400
High scalability
with Fujitsu-
developed CPU
and interconnect
PRIMERGY x86
cluster systems
support the
latest CPUs and
accelerators
Under Development
w/ RIKEN
High
-end
Divisional
Departmental
Workgroup
PRIMEHPC FX10 PRIMEHPC FX100 Post-KK computer
Co-developed with RIKEN
© RIKEN
Large-Scale
SMP System
RX900
Linaro Work Shop, Dec. 12, 2017 2
Fujitsu High-end Supercomputers Development
2011 2012 2013 2014 2015 2016 2017 2018 2019 2020
PRIMEHPC FX10
 1.8x CPU perf. of K
 Easier installation
 4x(DP) / 8x(SP) CPU per. of K, Tofu2
 High-density pkg & lower energy
App.
review
FS
projects
HPCI strategic apps program
Operation of K computerDevelopment
Japan’s National Projects
FUJITSU
Post-K computer development
PRIMEHPC FX100
K computer and PRIMEHPC
FX10/FX100 in operation
The CPU and interconnect of
FX10/FX100 inherit the K computer
architectural concept, featuring state-of-
the-art technologies
System software “TCS” supports Fujitsu
supercomputer with originally introduced
technologies
Many applications are currently running
and being developed for science and
various industries
RIKEN and Fujitsu are working together
to provide a successor to K computer
with application R&D teams using co-
design approach
Technical Computing Suite (TCS)
Handles millions of parallel jobs
FEFS: super scalable file system
MPI: Ultra scalable collective
communication libraries
 OS: Lower OS jitter w/
assistant core
Copyright 2017 FUJITSU LIMITED
Post-K supercomputer
Post-K
Linaro Work Shop, Dec. 12, 2017 3
Post-K Features and Status
 Fujitsu CPU core (w/ Arm SVE) and Tofu maintain the programming models
and provide high application performance
 RIKEN & Fujitsu system software enable high performance and low power
consumption with flexible operations
 Apps from 9 “priority issues” & many “exploratory challenges” are being
optimized for the Post-K
Functions & architecture
Post-K FX100 FX10 K
CPU Core
Instruction set architecture Armv8-A SPARC V9
SIMD width 512bit 256bit 128bit 128bit
Double precision (64bit) ✔ ✔ ✔ ✔
Single precision (32bit) ✔ ✔ ✔ ✔
Half precision (16bit) ✔ - - -
Interconnect Tofu interconnect Enhanced Tofu2 Tofu Tofu
Copyright 2017 FUJITSU LIMITED
Post-K
Linaro Work Shop, Dec. 12, 2017 4
Post-K Software Stack
 Valuable feedbacks through “co-design” from application R&D teams
Post-K System Hardware
FUJITSU Technical Computing Suite / RIKEN Advanced System Software
Linux OS / McKernel (Lightweight Kernel)
Post-K Applications
System management
for highly available & power
saving operation
Job management for higher
system utilization & power
efficiency
Lustre-based
distributed file system
FEFS
OpenMP, COARRAY, Math Libs
Compilers (C, C++, Fortran)
Debugging and tuning tools
Management Software Programming EnvironmentHierarchical File I/O Software
MPI (Open MPI, MPICH)
XcalableMP
Application-oriented
file I/O middleware
Post-K
Under Development
w/ RIKEN
Copyright 2017 FUJITSU LIMITEDLinaro Work Shop, Dec. 12, 2017 5
Post-K to be More Useful?
 More apps from OSS & ISVs
High performance on “real” applications
Lower TCO
• Low power consumption
• Water cooling
De-facto standards
• Lowering barriers in developing and porting
Ecosystem
• More Arm platforms
• More partners
• More knowledge/experience inside/outside of communities
Copyright 2017 FUJITSU LIMITEDLinaro Work Shop, Dec. 12, 2017 6
Making the Post-K a Resounding Success
 Recapping the goal & requirements
 High performance HW and SW complying open standards
 Apps in quality & variety
 Environments – rich, modern, and comprehensive
 Our approach
 Arm architecture (w/ Fujitsu’s proven microarchitecture)
• SBSA: Server Base System Architecture
• SBBR: Server Base Boot Requirements
• VLA: Vector-Length Agnostic
 Fujitsu enhanced/maintained system software
• Based on Linux & OSSs
• Single source for x86 & Arm
• Open MPI, OpenMP, Libraries,
• Performance analyzer, Debugger
 Powerful but original compilers --- will be aligned to be useful & popular
Copyright 2017 FUJITSU LIMITED
Assure binary compatibility
Lowering barriers for single
source development
Linaro Work Shop, Dec. 12, 2017 7
 Transform our original & powerful compilers to be all-around
 Working and contributing for the Clang project to satisfy both high
performance and portability
 Fujitsu’s back-end advantage
 Auto-parallelization for many-core architecture
 Auto-vectorization for Scalable Vector Extension
 Strong software pipelining with loop fission
Compilers to Increase Software Portability
Copyright 2017 FUJITSU LIMITED
Utilize Post-K μArch:
• Rich & wide SIMD
• Sector cache…
Software:
Apps, Middleware,
and Basics (written
in variety of styles)
Portable
binariesFujitsu original
front-end
Fujitsu original
back-end from
knowledge of
CPU
development
Clang front-end Clang back-end
Linaro Work Shop, Dec. 12, 2017 8
Auto-vectorization for Arm SVE
 4 Byte x 16 SIMD List Memory Access by utilizing 512bit Register
 Various Types of SIMD Optimization by Utilizing Predicate Registers
Copyright 2017 FUJITSU LIMITED
for (int i=0; i<n; ++i) {
if (mask[i] !=0) { a[i] = b[i]; }
}
for (int i=0; i<VL/2; ++i) {
a[i] = b[i] * c[i];
}
do {
b[i] = a[i];
} while(a[i++] != 0);
Loop including IF clause
Small Loop less
than SIMD length
While Loop with
Data Dependency
SVE
Reg. dest.
Reg. index
int index[n]
float P[n], Q[n];
for (i=0; i<n; ++i) {
P[i] = Q[index[i]];
}
Q[14] Q[1] ・ Q[13] ・ Q[0] Q[3] Q[15] Q[2]
14 1 ・ 13 ・ 0 3 15 2
Memory Q [15] [14] [13] ・ ・ [3] [2] [1] [0]1
2
3
4
5
6
7
1
2
3
1
2
3
1
2
3
Linaro Work Shop, Dec. 12, 2017 9
Fujitsu Compiler Back-end Optimization Flow
 Loop Fission reduces required resources, such as registers
 Software Pipelining and Register Allocation
 Best utilization of hardware functions and resources
Copyright 2017 FUJITSU LIMITED
Back-end optimization pipeline
Portable
Arm
binaries
SIMDize
Loop
Fission
Software
Pipelining
Register
Allocation
Instruction
Scheduling
for (...) {
}
// Reduced # of Regs.
for (...) {
}
// Reduced # of Regs.
for (...) {
}
// Higher ILP
for (...) {
}
// Higher ILP
for (...) {
}
Software pipelined #1
Software pipelined #2
Divided # 1
Divided# 2
Original
1
2
3
4
5
6
7
8
9
1
2
3
4
5
6
7
8
9
1
2
3
4
5
6
7
8
9
Linaro Work Shop, Dec. 12, 2017 10
Copyright 2017 FUJITSU LIMITED
Effectiveness of SWP w/ Loop Fission and SoA
 Runs on FX100 w/ 32 registers
 72% speed-up per core is observed
 >2x speed-up compared w/ K computer
 Software Pipelining w/ Loop Fission
utilizes CPU resources
 SoA-style layout extracts more
NICAM* single core performance on FX100 w/ 32 regs
(Source: http://www.riken.jp/pr/topics/2013/20130920_1/)
CPUclocksnormalized
byKcomputer
*NICAM-DC-MINI: Climate simulations with fine mesh, https://github.com/fiber-miniapp/nicam-dc-mini
SWP w/
Loop
fission
+ SoA
style
72% speedup w/ loop fission + SoA
Without
Loop
fission
Linaro Work Shop, Dec. 12, 2017 11
Summary
 Fujitsu’s Approach to HPC
 Supporting high-end supercomputers with original CPU & x86 clusters
 Developing the Post-K for app performance and low power consumption
 Expecting more apps from OSS & ISVs through growing ecosystem
 Keys for Post-K Success
 High performance standard-compliant HW and SW
 All-around high performance compiler with binary compatibility
 Many and varied high quality apps with x86 software compatibility
 Open & Highly Optimized Compilers
 Clang + Fujitsu technologies
 Tentative evaluation results are encouraging
Copyright 2017 FUJITSU LIMITEDLinaro Work Shop, Dec. 12, 2017 12
Copyright 2017 FUJITSU LIMITED

More Related Content

What's hot

Performance evaluation with Arm HPC tools for SVE
Performance evaluation with Arm HPC tools for SVEPerformance evaluation with Arm HPC tools for SVE
Performance evaluation with Arm HPC tools for SVELinaro
 
Exploring the Programming Models for the LUMI Supercomputer
Exploring the Programming Models for the LUMI Supercomputer Exploring the Programming Models for the LUMI Supercomputer
Exploring the Programming Models for the LUMI Supercomputer George Markomanolis
 
Porting and Optimization of Numerical Libraries for ARM SVE
Porting and Optimization of Numerical Libraries for ARM SVEPorting and Optimization of Numerical Libraries for ARM SVE
Porting and Optimization of Numerical Libraries for ARM SVELinaro
 
Evaluating GPU programming Models for the LUMI Supercomputer
Evaluating GPU programming Models for the LUMI SupercomputerEvaluating GPU programming Models for the LUMI Supercomputer
Evaluating GPU programming Models for the LUMI SupercomputerGeorge Markomanolis
 
Arm tools and roadmap for SVE compiler support
Arm tools and roadmap for SVE compiler supportArm tools and roadmap for SVE compiler support
Arm tools and roadmap for SVE compiler supportLinaro
 
An evaluation of LLVM compiler for SVE with fairly complicated loops
An evaluation of LLVM compiler for SVE with fairly complicated loopsAn evaluation of LLVM compiler for SVE with fairly complicated loops
An evaluation of LLVM compiler for SVE with fairly complicated loopsLinaro
 
SGI: Meeting Manufacturing's Need for Production Supercomputing
SGI: Meeting Manufacturing's Need for Production SupercomputingSGI: Meeting Manufacturing's Need for Production Supercomputing
SGI: Meeting Manufacturing's Need for Production Supercomputinginside-BigData.com
 
Escape the Walls of PaaS: Unlock the Power & Flexibility of DigitalOcean App ...
Escape the Walls of PaaS: Unlock the Power & Flexibility of DigitalOcean App ...Escape the Walls of PaaS: Unlock the Power & Flexibility of DigitalOcean App ...
Escape the Walls of PaaS: Unlock the Power & Flexibility of DigitalOcean App ...DigitalOcean
 
Disaggregation a Primer: Optimizing design for Edge Cloud & Bare Metal applic...
Disaggregation a Primer: Optimizing design for Edge Cloud & Bare Metal applic...Disaggregation a Primer: Optimizing design for Edge Cloud & Bare Metal applic...
Disaggregation a Primer: Optimizing design for Edge Cloud & Bare Metal applic...Netronome
 
HKG18-411 - Introduction to OpenAMP which is an open source solution for hete...
HKG18-411 - Introduction to OpenAMP which is an open source solution for hete...HKG18-411 - Introduction to OpenAMP which is an open source solution for hete...
HKG18-411 - Introduction to OpenAMP which is an open source solution for hete...Linaro
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Clusterinside-BigData.com
 
Introduction to OpenCL, 2010
Introduction to OpenCL, 2010Introduction to OpenCL, 2010
Introduction to OpenCL, 2010Tomasz Bednarz
 
BPF Hardware Offload Deep Dive
BPF Hardware Offload Deep DiveBPF Hardware Offload Deep Dive
BPF Hardware Offload Deep DiveNetronome
 
OpenCL Programming 101
OpenCL Programming 101OpenCL Programming 101
OpenCL Programming 101Yoss Cohen
 

What's hot (20)

Performance evaluation with Arm HPC tools for SVE
Performance evaluation with Arm HPC tools for SVEPerformance evaluation with Arm HPC tools for SVE
Performance evaluation with Arm HPC tools for SVE
 
Exploring the Programming Models for the LUMI Supercomputer
Exploring the Programming Models for the LUMI Supercomputer Exploring the Programming Models for the LUMI Supercomputer
Exploring the Programming Models for the LUMI Supercomputer
 
Porting and Optimization of Numerical Libraries for ARM SVE
Porting and Optimization of Numerical Libraries for ARM SVEPorting and Optimization of Numerical Libraries for ARM SVE
Porting and Optimization of Numerical Libraries for ARM SVE
 
Evaluating GPU programming Models for the LUMI Supercomputer
Evaluating GPU programming Models for the LUMI SupercomputerEvaluating GPU programming Models for the LUMI Supercomputer
Evaluating GPU programming Models for the LUMI Supercomputer
 
Arm tools and roadmap for SVE compiler support
Arm tools and roadmap for SVE compiler supportArm tools and roadmap for SVE compiler support
Arm tools and roadmap for SVE compiler support
 
Introduction to GPUs in HPC
Introduction to GPUs in HPCIntroduction to GPUs in HPC
Introduction to GPUs in HPC
 
Getting started with AMD GPUs
Getting started with AMD GPUsGetting started with AMD GPUs
Getting started with AMD GPUs
 
Circuit Simplifier
Circuit SimplifierCircuit Simplifier
Circuit Simplifier
 
An evaluation of LLVM compiler for SVE with fairly complicated loops
An evaluation of LLVM compiler for SVE with fairly complicated loopsAn evaluation of LLVM compiler for SVE with fairly complicated loops
An evaluation of LLVM compiler for SVE with fairly complicated loops
 
SGI: Meeting Manufacturing's Need for Production Supercomputing
SGI: Meeting Manufacturing's Need for Production SupercomputingSGI: Meeting Manufacturing's Need for Production Supercomputing
SGI: Meeting Manufacturing's Need for Production Supercomputing
 
Escape the Walls of PaaS: Unlock the Power & Flexibility of DigitalOcean App ...
Escape the Walls of PaaS: Unlock the Power & Flexibility of DigitalOcean App ...Escape the Walls of PaaS: Unlock the Power & Flexibility of DigitalOcean App ...
Escape the Walls of PaaS: Unlock the Power & Flexibility of DigitalOcean App ...
 
IBM AI at Scale
IBM AI at ScaleIBM AI at Scale
IBM AI at Scale
 
Disaggregation a Primer: Optimizing design for Edge Cloud & Bare Metal applic...
Disaggregation a Primer: Optimizing design for Edge Cloud & Bare Metal applic...Disaggregation a Primer: Optimizing design for Edge Cloud & Bare Metal applic...
Disaggregation a Primer: Optimizing design for Edge Cloud & Bare Metal applic...
 
ARM and Machine Learning
ARM and Machine LearningARM and Machine Learning
ARM and Machine Learning
 
Hands on OpenCL
Hands on OpenCLHands on OpenCL
Hands on OpenCL
 
HKG18-411 - Introduction to OpenAMP which is an open source solution for hete...
HKG18-411 - Introduction to OpenAMP which is an open source solution for hete...HKG18-411 - Introduction to OpenAMP which is an open source solution for hete...
HKG18-411 - Introduction to OpenAMP which is an open source solution for hete...
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Cluster
 
Introduction to OpenCL, 2010
Introduction to OpenCL, 2010Introduction to OpenCL, 2010
Introduction to OpenCL, 2010
 
BPF Hardware Offload Deep Dive
BPF Hardware Offload Deep DiveBPF Hardware Offload Deep Dive
BPF Hardware Offload Deep Dive
 
OpenCL Programming 101
OpenCL Programming 101OpenCL Programming 101
OpenCL Programming 101
 

Similar to Post-K: Building the Arm HPC Ecosystem

08 Supercomputer Fugaku
08 Supercomputer Fugaku08 Supercomputer Fugaku
08 Supercomputer FugakuRCCSRENKEI
 
Cisco Connect Toronto 2017 - UCS and Hyperflex update
Cisco Connect Toronto 2017 - UCS and Hyperflex updateCisco Connect Toronto 2017 - UCS and Hyperflex update
Cisco Connect Toronto 2017 - UCS and Hyperflex updateCisco Canada
 
Cisco Connect Halifax 2018 Compute infrastructure for a hybrid cloud ucs an...
Cisco Connect Halifax 2018   Compute infrastructure for a hybrid cloud ucs an...Cisco Connect Halifax 2018   Compute infrastructure for a hybrid cloud ucs an...
Cisco Connect Halifax 2018 Compute infrastructure for a hybrid cloud ucs an...Cisco Canada
 
Introduction of Fujitsu's HPC Processor for the Post-K Computer
Introduction of Fujitsu's HPC Processor for the Post-K ComputerIntroduction of Fujitsu's HPC Processor for the Post-K Computer
Introduction of Fujitsu's HPC Processor for the Post-K Computerinside-BigData.com
 
Fujitsu World Tour 2017: Hyper Converged Data Centre
Fujitsu World Tour 2017: Hyper Converged Data CentreFujitsu World Tour 2017: Hyper Converged Data Centre
Fujitsu World Tour 2017: Hyper Converged Data CentreFujitsu India
 
SUSE Expert Days 2017 FUJITSU
SUSE Expert Days 2017 FUJITSUSUSE Expert Days 2017 FUJITSU
SUSE Expert Days 2017 FUJITSUSUSE España
 
Hyper-convergence – The only way to the software-defined data center? - Gerno...
Hyper-convergence – The only way to the software-defined data center? - Gerno...Hyper-convergence – The only way to the software-defined data center? - Gerno...
Hyper-convergence – The only way to the software-defined data center? - Gerno...Fujitsu Middle East
 
Cisco Connect Vancouver 2017 - Compute infrastructure for a hybrid cloud
Cisco Connect Vancouver 2017 - Compute infrastructure for a hybrid cloudCisco Connect Vancouver 2017 - Compute infrastructure for a hybrid cloud
Cisco Connect Vancouver 2017 - Compute infrastructure for a hybrid cloudCisco Canada
 
Fujitsu World Tour 2017 - Compute Platform For The Digital World
Fujitsu World Tour 2017 - Compute Platform For The Digital WorldFujitsu World Tour 2017 - Compute Platform For The Digital World
Fujitsu World Tour 2017 - Compute Platform For The Digital WorldFujitsu India
 
BOS - Hyperkonvergenz – Der einzige Weg zum Software definierten Rechenzentrum?
BOS - Hyperkonvergenz – Der einzige Weg zum Software definierten Rechenzentrum?BOS - Hyperkonvergenz – Der einzige Weg zum Software definierten Rechenzentrum?
BOS - Hyperkonvergenz – Der einzige Weg zum Software definierten Rechenzentrum?Fujitsu Central Europe
 
Cisco connect winnipeg 2018 compute infrastructure for a hybrid cloud
Cisco connect winnipeg 2018   compute infrastructure for a hybrid cloudCisco connect winnipeg 2018   compute infrastructure for a hybrid cloud
Cisco connect winnipeg 2018 compute infrastructure for a hybrid cloudCisco Canada
 
Compute Infrastructure for Hybrid Cloud
Compute Infrastructure for Hybrid CloudCompute Infrastructure for Hybrid Cloud
Compute Infrastructure for Hybrid CloudCisco Canada
 
Compute Infrastructure for a Hybrid Cloud
Compute Infrastructure for a Hybrid CloudCompute Infrastructure for a Hybrid Cloud
Compute Infrastructure for a Hybrid CloudCisco Canada
 
Netronome Corporate Brochure
Netronome Corporate BrochureNetronome Corporate Brochure
Netronome Corporate BrochureNetronome
 
HKG18-100K1 - George Grey: Opening Keynote
HKG18-100K1 - George Grey: Opening KeynoteHKG18-100K1 - George Grey: Opening Keynote
HKG18-100K1 - George Grey: Opening KeynoteLinaro
 
Db2 analytics accelerator on ibm integrated analytics system technical over...
Db2 analytics accelerator on ibm integrated analytics system   technical over...Db2 analytics accelerator on ibm integrated analytics system   technical over...
Db2 analytics accelerator on ibm integrated analytics system technical over...Daniel Martin
 
Presentazione IBM Flex System e System x Evento Venaria 14 ottobre
Presentazione IBM Flex System e System x Evento Venaria 14 ottobrePresentazione IBM Flex System e System x Evento Venaria 14 ottobre
Presentazione IBM Flex System e System x Evento Venaria 14 ottobrePRAGMA PROGETTI
 
C122 a015-en
C122 a015-enC122 a015-en
C122 a015-enTommy Lee
 

Similar to Post-K: Building the Arm HPC Ecosystem (20)

08 Supercomputer Fugaku
08 Supercomputer Fugaku08 Supercomputer Fugaku
08 Supercomputer Fugaku
 
Cisco Connect Toronto 2017 - UCS and Hyperflex update
Cisco Connect Toronto 2017 - UCS and Hyperflex updateCisco Connect Toronto 2017 - UCS and Hyperflex update
Cisco Connect Toronto 2017 - UCS and Hyperflex update
 
Cisco Connect Halifax 2018 Compute infrastructure for a hybrid cloud ucs an...
Cisco Connect Halifax 2018   Compute infrastructure for a hybrid cloud ucs an...Cisco Connect Halifax 2018   Compute infrastructure for a hybrid cloud ucs an...
Cisco Connect Halifax 2018 Compute infrastructure for a hybrid cloud ucs an...
 
IBM PureSystems
IBM PureSystemsIBM PureSystems
IBM PureSystems
 
Introduction of Fujitsu's HPC Processor for the Post-K Computer
Introduction of Fujitsu's HPC Processor for the Post-K ComputerIntroduction of Fujitsu's HPC Processor for the Post-K Computer
Introduction of Fujitsu's HPC Processor for the Post-K Computer
 
Fujitsu World Tour 2017: Hyper Converged Data Centre
Fujitsu World Tour 2017: Hyper Converged Data CentreFujitsu World Tour 2017: Hyper Converged Data Centre
Fujitsu World Tour 2017: Hyper Converged Data Centre
 
SUSE Expert Days 2017 FUJITSU
SUSE Expert Days 2017 FUJITSUSUSE Expert Days 2017 FUJITSU
SUSE Expert Days 2017 FUJITSU
 
Hyper-convergence – The only way to the software-defined data center? - Gerno...
Hyper-convergence – The only way to the software-defined data center? - Gerno...Hyper-convergence – The only way to the software-defined data center? - Gerno...
Hyper-convergence – The only way to the software-defined data center? - Gerno...
 
Cisco Connect Vancouver 2017 - Compute infrastructure for a hybrid cloud
Cisco Connect Vancouver 2017 - Compute infrastructure for a hybrid cloudCisco Connect Vancouver 2017 - Compute infrastructure for a hybrid cloud
Cisco Connect Vancouver 2017 - Compute infrastructure for a hybrid cloud
 
Fujitsu World Tour 2017 - Compute Platform For The Digital World
Fujitsu World Tour 2017 - Compute Platform For The Digital WorldFujitsu World Tour 2017 - Compute Platform For The Digital World
Fujitsu World Tour 2017 - Compute Platform For The Digital World
 
BOS - Hyperkonvergenz – Der einzige Weg zum Software definierten Rechenzentrum?
BOS - Hyperkonvergenz – Der einzige Weg zum Software definierten Rechenzentrum?BOS - Hyperkonvergenz – Der einzige Weg zum Software definierten Rechenzentrum?
BOS - Hyperkonvergenz – Der einzige Weg zum Software definierten Rechenzentrum?
 
Cisco connect winnipeg 2018 compute infrastructure for a hybrid cloud
Cisco connect winnipeg 2018   compute infrastructure for a hybrid cloudCisco connect winnipeg 2018   compute infrastructure for a hybrid cloud
Cisco connect winnipeg 2018 compute infrastructure for a hybrid cloud
 
Compute Infrastructure for Hybrid Cloud
Compute Infrastructure for Hybrid CloudCompute Infrastructure for Hybrid Cloud
Compute Infrastructure for Hybrid Cloud
 
Compute Infrastructure for a Hybrid Cloud
Compute Infrastructure for a Hybrid CloudCompute Infrastructure for a Hybrid Cloud
Compute Infrastructure for a Hybrid Cloud
 
Netronome Corporate Brochure
Netronome Corporate BrochureNetronome Corporate Brochure
Netronome Corporate Brochure
 
HKG18-100K1 - George Grey: Opening Keynote
HKG18-100K1 - George Grey: Opening KeynoteHKG18-100K1 - George Grey: Opening Keynote
HKG18-100K1 - George Grey: Opening Keynote
 
All in one
All in oneAll in one
All in one
 
Db2 analytics accelerator on ibm integrated analytics system technical over...
Db2 analytics accelerator on ibm integrated analytics system   technical over...Db2 analytics accelerator on ibm integrated analytics system   technical over...
Db2 analytics accelerator on ibm integrated analytics system technical over...
 
Presentazione IBM Flex System e System x Evento Venaria 14 ottobre
Presentazione IBM Flex System e System x Evento Venaria 14 ottobrePresentazione IBM Flex System e System x Evento Venaria 14 ottobre
Presentazione IBM Flex System e System x Evento Venaria 14 ottobre
 
C122 a015-en
C122 a015-enC122 a015-en
C122 a015-en
 

More from Linaro

Deep Learning Neural Network Acceleration at the Edge - Andrea Gallo
Deep Learning Neural Network Acceleration at the Edge - Andrea GalloDeep Learning Neural Network Acceleration at the Edge - Andrea Gallo
Deep Learning Neural Network Acceleration at the Edge - Andrea GalloLinaro
 
Arm Architecture HPC Workshop Santa Clara 2018 - Kanta Vekaria
Arm Architecture HPC Workshop Santa Clara 2018 - Kanta VekariaArm Architecture HPC Workshop Santa Clara 2018 - Kanta Vekaria
Arm Architecture HPC Workshop Santa Clara 2018 - Kanta VekariaLinaro
 
Huawei’s requirements for the ARM based HPC solution readiness - Joshua Mora
Huawei’s requirements for the ARM based HPC solution readiness - Joshua MoraHuawei’s requirements for the ARM based HPC solution readiness - Joshua Mora
Huawei’s requirements for the ARM based HPC solution readiness - Joshua MoraLinaro
 
Bud17 113: distribution ci using qemu and open qa
Bud17 113: distribution ci using qemu and open qaBud17 113: distribution ci using qemu and open qa
Bud17 113: distribution ci using qemu and open qaLinaro
 
OpenHPC Automation with Ansible - Renato Golin - Linaro Arm HPC Workshop 2018
OpenHPC Automation with Ansible - Renato Golin - Linaro Arm HPC Workshop 2018OpenHPC Automation with Ansible - Renato Golin - Linaro Arm HPC Workshop 2018
OpenHPC Automation with Ansible - Renato Golin - Linaro Arm HPC Workshop 2018Linaro
 
HPC network stack on ARM - Linaro HPC Workshop 2018
HPC network stack on ARM - Linaro HPC Workshop 2018HPC network stack on ARM - Linaro HPC Workshop 2018
HPC network stack on ARM - Linaro HPC Workshop 2018Linaro
 
It just keeps getting better - SUSE enablement for Arm - Linaro HPC Workshop ...
It just keeps getting better - SUSE enablement for Arm - Linaro HPC Workshop ...It just keeps getting better - SUSE enablement for Arm - Linaro HPC Workshop ...
It just keeps getting better - SUSE enablement for Arm - Linaro HPC Workshop ...Linaro
 
Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ...
Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ...Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ...
Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ...Linaro
 
Yutaka Ishikawa - Post-K and Arm HPC Ecosystem - Linaro Arm HPC Workshop Sant...
Yutaka Ishikawa - Post-K and Arm HPC Ecosystem - Linaro Arm HPC Workshop Sant...Yutaka Ishikawa - Post-K and Arm HPC Ecosystem - Linaro Arm HPC Workshop Sant...
Yutaka Ishikawa - Post-K and Arm HPC Ecosystem - Linaro Arm HPC Workshop Sant...Linaro
 
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...Linaro
 
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainlineHKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainlineLinaro
 
HKG18-318 - OpenAMP Workshop
HKG18-318 - OpenAMP WorkshopHKG18-318 - OpenAMP Workshop
HKG18-318 - OpenAMP WorkshopLinaro
 
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainlineHKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainlineLinaro
 
HKG18-315 - Why the ecosystem is a wonderful thing, warts and all
HKG18-315 - Why the ecosystem is a wonderful thing, warts and allHKG18-315 - Why the ecosystem is a wonderful thing, warts and all
HKG18-315 - Why the ecosystem is a wonderful thing, warts and allLinaro
 
HKG18- 115 - Partitioning ARM Systems with the Jailhouse Hypervisor
HKG18- 115 - Partitioning ARM Systems with the Jailhouse HypervisorHKG18- 115 - Partitioning ARM Systems with the Jailhouse Hypervisor
HKG18- 115 - Partitioning ARM Systems with the Jailhouse HypervisorLinaro
 
HKG18-TR08 - Upstreaming SVE in QEMU
HKG18-TR08 - Upstreaming SVE in QEMUHKG18-TR08 - Upstreaming SVE in QEMU
HKG18-TR08 - Upstreaming SVE in QEMULinaro
 
HKG18-113- Secure Data Path work with i.MX8M
HKG18-113- Secure Data Path work with i.MX8MHKG18-113- Secure Data Path work with i.MX8M
HKG18-113- Secure Data Path work with i.MX8MLinaro
 
HKG18-120 - Devicetree Schema Documentation and Validation
HKG18-120 - Devicetree Schema Documentation and Validation HKG18-120 - Devicetree Schema Documentation and Validation
HKG18-120 - Devicetree Schema Documentation and Validation Linaro
 
HKG18-223 - Trusted FirmwareM: Trusted boot
HKG18-223 - Trusted FirmwareM: Trusted bootHKG18-223 - Trusted FirmwareM: Trusted boot
HKG18-223 - Trusted FirmwareM: Trusted bootLinaro
 
HKG18-500K1 - Keynote: Dileep Bhandarkar - Emerging Computing Trends in the D...
HKG18-500K1 - Keynote: Dileep Bhandarkar - Emerging Computing Trends in the D...HKG18-500K1 - Keynote: Dileep Bhandarkar - Emerging Computing Trends in the D...
HKG18-500K1 - Keynote: Dileep Bhandarkar - Emerging Computing Trends in the D...Linaro
 

More from Linaro (20)

Deep Learning Neural Network Acceleration at the Edge - Andrea Gallo
Deep Learning Neural Network Acceleration at the Edge - Andrea GalloDeep Learning Neural Network Acceleration at the Edge - Andrea Gallo
Deep Learning Neural Network Acceleration at the Edge - Andrea Gallo
 
Arm Architecture HPC Workshop Santa Clara 2018 - Kanta Vekaria
Arm Architecture HPC Workshop Santa Clara 2018 - Kanta VekariaArm Architecture HPC Workshop Santa Clara 2018 - Kanta Vekaria
Arm Architecture HPC Workshop Santa Clara 2018 - Kanta Vekaria
 
Huawei’s requirements for the ARM based HPC solution readiness - Joshua Mora
Huawei’s requirements for the ARM based HPC solution readiness - Joshua MoraHuawei’s requirements for the ARM based HPC solution readiness - Joshua Mora
Huawei’s requirements for the ARM based HPC solution readiness - Joshua Mora
 
Bud17 113: distribution ci using qemu and open qa
Bud17 113: distribution ci using qemu and open qaBud17 113: distribution ci using qemu and open qa
Bud17 113: distribution ci using qemu and open qa
 
OpenHPC Automation with Ansible - Renato Golin - Linaro Arm HPC Workshop 2018
OpenHPC Automation with Ansible - Renato Golin - Linaro Arm HPC Workshop 2018OpenHPC Automation with Ansible - Renato Golin - Linaro Arm HPC Workshop 2018
OpenHPC Automation with Ansible - Renato Golin - Linaro Arm HPC Workshop 2018
 
HPC network stack on ARM - Linaro HPC Workshop 2018
HPC network stack on ARM - Linaro HPC Workshop 2018HPC network stack on ARM - Linaro HPC Workshop 2018
HPC network stack on ARM - Linaro HPC Workshop 2018
 
It just keeps getting better - SUSE enablement for Arm - Linaro HPC Workshop ...
It just keeps getting better - SUSE enablement for Arm - Linaro HPC Workshop ...It just keeps getting better - SUSE enablement for Arm - Linaro HPC Workshop ...
It just keeps getting better - SUSE enablement for Arm - Linaro HPC Workshop ...
 
Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ...
Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ...Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ...
Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ...
 
Yutaka Ishikawa - Post-K and Arm HPC Ecosystem - Linaro Arm HPC Workshop Sant...
Yutaka Ishikawa - Post-K and Arm HPC Ecosystem - Linaro Arm HPC Workshop Sant...Yutaka Ishikawa - Post-K and Arm HPC Ecosystem - Linaro Arm HPC Workshop Sant...
Yutaka Ishikawa - Post-K and Arm HPC Ecosystem - Linaro Arm HPC Workshop Sant...
 
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
 
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainlineHKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
 
HKG18-318 - OpenAMP Workshop
HKG18-318 - OpenAMP WorkshopHKG18-318 - OpenAMP Workshop
HKG18-318 - OpenAMP Workshop
 
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainlineHKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
 
HKG18-315 - Why the ecosystem is a wonderful thing, warts and all
HKG18-315 - Why the ecosystem is a wonderful thing, warts and allHKG18-315 - Why the ecosystem is a wonderful thing, warts and all
HKG18-315 - Why the ecosystem is a wonderful thing, warts and all
 
HKG18- 115 - Partitioning ARM Systems with the Jailhouse Hypervisor
HKG18- 115 - Partitioning ARM Systems with the Jailhouse HypervisorHKG18- 115 - Partitioning ARM Systems with the Jailhouse Hypervisor
HKG18- 115 - Partitioning ARM Systems with the Jailhouse Hypervisor
 
HKG18-TR08 - Upstreaming SVE in QEMU
HKG18-TR08 - Upstreaming SVE in QEMUHKG18-TR08 - Upstreaming SVE in QEMU
HKG18-TR08 - Upstreaming SVE in QEMU
 
HKG18-113- Secure Data Path work with i.MX8M
HKG18-113- Secure Data Path work with i.MX8MHKG18-113- Secure Data Path work with i.MX8M
HKG18-113- Secure Data Path work with i.MX8M
 
HKG18-120 - Devicetree Schema Documentation and Validation
HKG18-120 - Devicetree Schema Documentation and Validation HKG18-120 - Devicetree Schema Documentation and Validation
HKG18-120 - Devicetree Schema Documentation and Validation
 
HKG18-223 - Trusted FirmwareM: Trusted boot
HKG18-223 - Trusted FirmwareM: Trusted bootHKG18-223 - Trusted FirmwareM: Trusted boot
HKG18-223 - Trusted FirmwareM: Trusted boot
 
HKG18-500K1 - Keynote: Dileep Bhandarkar - Emerging Computing Trends in the D...
HKG18-500K1 - Keynote: Dileep Bhandarkar - Emerging Computing Trends in the D...HKG18-500K1 - Keynote: Dileep Bhandarkar - Emerging Computing Trends in the D...
HKG18-500K1 - Keynote: Dileep Bhandarkar - Emerging Computing Trends in the D...
 

Recently uploaded

Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Ryan Mahoney - Will Artificial Intelligence Replace Real Estate Agents
Ryan Mahoney - Will Artificial Intelligence Replace Real Estate AgentsRyan Mahoney - Will Artificial Intelligence Replace Real Estate Agents
Ryan Mahoney - Will Artificial Intelligence Replace Real Estate AgentsRyan Mahoney
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 

Recently uploaded (20)

Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Ryan Mahoney - Will Artificial Intelligence Replace Real Estate Agents
Ryan Mahoney - Will Artificial Intelligence Replace Real Estate AgentsRyan Mahoney - Will Artificial Intelligence Replace Real Estate Agents
Ryan Mahoney - Will Artificial Intelligence Replace Real Estate Agents
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 

Post-K: Building the Arm HPC Ecosystem

  • 1. Kouichi Hirai FUJITSU LIMITED Dec 12th, 2017 Post-K: Building the Arm HPC Ecosystem 0 Copyright 2017 FUJITSU LIMITEDLinaro Work Shop, Dec. 12, 2017
  • 2. Post-K: Building up Arm HPC Ecosystem  Fujitsu’s approach for HPC  For making the Post-K a resounding success  The high performance compiler increases software portability  Summary Copyright 2017 FUJITSU LIMITEDLinaro Work Shop, Dec. 12, 2017 1
  • 3. Fujitsu HPC Solutions to Meet Customer Demands  Supercomputers, both Fujitsu-developed CPUs and x86  Single system image operation w/ Fujitsu system software  High performance, high availability, and high reliability Copyright 2017 FUJITSU LIMITED x86 Cluster RX2530/RX2540 CX600CX400 High scalability with Fujitsu- developed CPU and interconnect PRIMERGY x86 cluster systems support the latest CPUs and accelerators Under Development w/ RIKEN High -end Divisional Departmental Workgroup PRIMEHPC FX10 PRIMEHPC FX100 Post-KK computer Co-developed with RIKEN © RIKEN Large-Scale SMP System RX900 Linaro Work Shop, Dec. 12, 2017 2
  • 4. Fujitsu High-end Supercomputers Development 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 PRIMEHPC FX10  1.8x CPU perf. of K  Easier installation  4x(DP) / 8x(SP) CPU per. of K, Tofu2  High-density pkg & lower energy App. review FS projects HPCI strategic apps program Operation of K computerDevelopment Japan’s National Projects FUJITSU Post-K computer development PRIMEHPC FX100 K computer and PRIMEHPC FX10/FX100 in operation The CPU and interconnect of FX10/FX100 inherit the K computer architectural concept, featuring state-of- the-art technologies System software “TCS” supports Fujitsu supercomputer with originally introduced technologies Many applications are currently running and being developed for science and various industries RIKEN and Fujitsu are working together to provide a successor to K computer with application R&D teams using co- design approach Technical Computing Suite (TCS) Handles millions of parallel jobs FEFS: super scalable file system MPI: Ultra scalable collective communication libraries  OS: Lower OS jitter w/ assistant core Copyright 2017 FUJITSU LIMITED Post-K supercomputer Post-K Linaro Work Shop, Dec. 12, 2017 3
  • 5. Post-K Features and Status  Fujitsu CPU core (w/ Arm SVE) and Tofu maintain the programming models and provide high application performance  RIKEN & Fujitsu system software enable high performance and low power consumption with flexible operations  Apps from 9 “priority issues” & many “exploratory challenges” are being optimized for the Post-K Functions & architecture Post-K FX100 FX10 K CPU Core Instruction set architecture Armv8-A SPARC V9 SIMD width 512bit 256bit 128bit 128bit Double precision (64bit) ✔ ✔ ✔ ✔ Single precision (32bit) ✔ ✔ ✔ ✔ Half precision (16bit) ✔ - - - Interconnect Tofu interconnect Enhanced Tofu2 Tofu Tofu Copyright 2017 FUJITSU LIMITED Post-K Linaro Work Shop, Dec. 12, 2017 4
  • 6. Post-K Software Stack  Valuable feedbacks through “co-design” from application R&D teams Post-K System Hardware FUJITSU Technical Computing Suite / RIKEN Advanced System Software Linux OS / McKernel (Lightweight Kernel) Post-K Applications System management for highly available & power saving operation Job management for higher system utilization & power efficiency Lustre-based distributed file system FEFS OpenMP, COARRAY, Math Libs Compilers (C, C++, Fortran) Debugging and tuning tools Management Software Programming EnvironmentHierarchical File I/O Software MPI (Open MPI, MPICH) XcalableMP Application-oriented file I/O middleware Post-K Under Development w/ RIKEN Copyright 2017 FUJITSU LIMITEDLinaro Work Shop, Dec. 12, 2017 5
  • 7. Post-K to be More Useful?  More apps from OSS & ISVs High performance on “real” applications Lower TCO • Low power consumption • Water cooling De-facto standards • Lowering barriers in developing and porting Ecosystem • More Arm platforms • More partners • More knowledge/experience inside/outside of communities Copyright 2017 FUJITSU LIMITEDLinaro Work Shop, Dec. 12, 2017 6
  • 8. Making the Post-K a Resounding Success  Recapping the goal & requirements  High performance HW and SW complying open standards  Apps in quality & variety  Environments – rich, modern, and comprehensive  Our approach  Arm architecture (w/ Fujitsu’s proven microarchitecture) • SBSA: Server Base System Architecture • SBBR: Server Base Boot Requirements • VLA: Vector-Length Agnostic  Fujitsu enhanced/maintained system software • Based on Linux & OSSs • Single source for x86 & Arm • Open MPI, OpenMP, Libraries, • Performance analyzer, Debugger  Powerful but original compilers --- will be aligned to be useful & popular Copyright 2017 FUJITSU LIMITED Assure binary compatibility Lowering barriers for single source development Linaro Work Shop, Dec. 12, 2017 7
  • 9.  Transform our original & powerful compilers to be all-around  Working and contributing for the Clang project to satisfy both high performance and portability  Fujitsu’s back-end advantage  Auto-parallelization for many-core architecture  Auto-vectorization for Scalable Vector Extension  Strong software pipelining with loop fission Compilers to Increase Software Portability Copyright 2017 FUJITSU LIMITED Utilize Post-K μArch: • Rich & wide SIMD • Sector cache… Software: Apps, Middleware, and Basics (written in variety of styles) Portable binariesFujitsu original front-end Fujitsu original back-end from knowledge of CPU development Clang front-end Clang back-end Linaro Work Shop, Dec. 12, 2017 8
  • 10. Auto-vectorization for Arm SVE  4 Byte x 16 SIMD List Memory Access by utilizing 512bit Register  Various Types of SIMD Optimization by Utilizing Predicate Registers Copyright 2017 FUJITSU LIMITED for (int i=0; i<n; ++i) { if (mask[i] !=0) { a[i] = b[i]; } } for (int i=0; i<VL/2; ++i) { a[i] = b[i] * c[i]; } do { b[i] = a[i]; } while(a[i++] != 0); Loop including IF clause Small Loop less than SIMD length While Loop with Data Dependency SVE Reg. dest. Reg. index int index[n] float P[n], Q[n]; for (i=0; i<n; ++i) { P[i] = Q[index[i]]; } Q[14] Q[1] ・ Q[13] ・ Q[0] Q[3] Q[15] Q[2] 14 1 ・ 13 ・ 0 3 15 2 Memory Q [15] [14] [13] ・ ・ [3] [2] [1] [0]1 2 3 4 5 6 7 1 2 3 1 2 3 1 2 3 Linaro Work Shop, Dec. 12, 2017 9
  • 11. Fujitsu Compiler Back-end Optimization Flow  Loop Fission reduces required resources, such as registers  Software Pipelining and Register Allocation  Best utilization of hardware functions and resources Copyright 2017 FUJITSU LIMITED Back-end optimization pipeline Portable Arm binaries SIMDize Loop Fission Software Pipelining Register Allocation Instruction Scheduling for (...) { } // Reduced # of Regs. for (...) { } // Reduced # of Regs. for (...) { } // Higher ILP for (...) { } // Higher ILP for (...) { } Software pipelined #1 Software pipelined #2 Divided # 1 Divided# 2 Original 1 2 3 4 5 6 7 8 9 1 2 3 4 5 6 7 8 9 1 2 3 4 5 6 7 8 9 Linaro Work Shop, Dec. 12, 2017 10
  • 12. Copyright 2017 FUJITSU LIMITED Effectiveness of SWP w/ Loop Fission and SoA  Runs on FX100 w/ 32 registers  72% speed-up per core is observed  >2x speed-up compared w/ K computer  Software Pipelining w/ Loop Fission utilizes CPU resources  SoA-style layout extracts more NICAM* single core performance on FX100 w/ 32 regs (Source: http://www.riken.jp/pr/topics/2013/20130920_1/) CPUclocksnormalized byKcomputer *NICAM-DC-MINI: Climate simulations with fine mesh, https://github.com/fiber-miniapp/nicam-dc-mini SWP w/ Loop fission + SoA style 72% speedup w/ loop fission + SoA Without Loop fission Linaro Work Shop, Dec. 12, 2017 11
  • 13. Summary  Fujitsu’s Approach to HPC  Supporting high-end supercomputers with original CPU & x86 clusters  Developing the Post-K for app performance and low power consumption  Expecting more apps from OSS & ISVs through growing ecosystem  Keys for Post-K Success  High performance standard-compliant HW and SW  All-around high performance compiler with binary compatibility  Many and varied high quality apps with x86 software compatibility  Open & Highly Optimized Compilers  Clang + Fujitsu technologies  Tentative evaluation results are encouraging Copyright 2017 FUJITSU LIMITEDLinaro Work Shop, Dec. 12, 2017 12