SlideShare a Scribd company logo
Kouichi Hirai
FUJITSU LIMITED
Dec 12th, 2017
Post-K:
Building the Arm HPC Ecosystem
0 Copyright 2017 FUJITSU LIMITEDLinaro Work Shop, Dec. 12, 2017
Post-K: Building up Arm HPC Ecosystem
 Fujitsu’s approach for HPC
 For making the Post-K a resounding success
 The high performance compiler increases software portability
 Summary
Copyright 2017 FUJITSU LIMITEDLinaro Work Shop, Dec. 12, 2017 1
Fujitsu HPC Solutions to Meet Customer Demands
 Supercomputers, both Fujitsu-developed CPUs and x86
 Single system image operation w/ Fujitsu system software
 High performance, high availability, and high reliability
Copyright 2017 FUJITSU LIMITED
x86 Cluster
RX2530/RX2540 CX600CX400
High scalability
with Fujitsu-
developed CPU
and interconnect
PRIMERGY x86
cluster systems
support the
latest CPUs and
accelerators
Under Development
w/ RIKEN
High
-end
Divisional
Departmental
Workgroup
PRIMEHPC FX10 PRIMEHPC FX100 Post-KK computer
Co-developed with RIKEN
© RIKEN
Large-Scale
SMP System
RX900
Linaro Work Shop, Dec. 12, 2017 2
Fujitsu High-end Supercomputers Development
2011 2012 2013 2014 2015 2016 2017 2018 2019 2020
PRIMEHPC FX10
 1.8x CPU perf. of K
 Easier installation
 4x(DP) / 8x(SP) CPU per. of K, Tofu2
 High-density pkg & lower energy
App.
review
FS
projects
HPCI strategic apps program
Operation of K computerDevelopment
Japan’s National Projects
FUJITSU
Post-K computer development
PRIMEHPC FX100
K computer and PRIMEHPC
FX10/FX100 in operation
The CPU and interconnect of
FX10/FX100 inherit the K computer
architectural concept, featuring state-of-
the-art technologies
System software “TCS” supports Fujitsu
supercomputer with originally introduced
technologies
Many applications are currently running
and being developed for science and
various industries
RIKEN and Fujitsu are working together
to provide a successor to K computer
with application R&D teams using co-
design approach
Technical Computing Suite (TCS)
Handles millions of parallel jobs
FEFS: super scalable file system
MPI: Ultra scalable collective
communication libraries
 OS: Lower OS jitter w/
assistant core
Copyright 2017 FUJITSU LIMITED
Post-K supercomputer
Post-K
Linaro Work Shop, Dec. 12, 2017 3
Post-K Features and Status
 Fujitsu CPU core (w/ Arm SVE) and Tofu maintain the programming models
and provide high application performance
 RIKEN & Fujitsu system software enable high performance and low power
consumption with flexible operations
 Apps from 9 “priority issues” & many “exploratory challenges” are being
optimized for the Post-K
Functions & architecture
Post-K FX100 FX10 K
CPU Core
Instruction set architecture Armv8-A SPARC V9
SIMD width 512bit 256bit 128bit 128bit
Double precision (64bit) ✔ ✔ ✔ ✔
Single precision (32bit) ✔ ✔ ✔ ✔
Half precision (16bit) ✔ - - -
Interconnect Tofu interconnect Enhanced Tofu2 Tofu Tofu
Copyright 2017 FUJITSU LIMITED
Post-K
Linaro Work Shop, Dec. 12, 2017 4
Post-K Software Stack
 Valuable feedbacks through “co-design” from application R&D teams
Post-K System Hardware
FUJITSU Technical Computing Suite / RIKEN Advanced System Software
Linux OS / McKernel (Lightweight Kernel)
Post-K Applications
System management
for highly available & power
saving operation
Job management for higher
system utilization & power
efficiency
Lustre-based
distributed file system
FEFS
OpenMP, COARRAY, Math Libs
Compilers (C, C++, Fortran)
Debugging and tuning tools
Management Software Programming EnvironmentHierarchical File I/O Software
MPI (Open MPI, MPICH)
XcalableMP
Application-oriented
file I/O middleware
Post-K
Under Development
w/ RIKEN
Copyright 2017 FUJITSU LIMITEDLinaro Work Shop, Dec. 12, 2017 5
Post-K to be More Useful?
 More apps from OSS & ISVs
High performance on “real” applications
Lower TCO
• Low power consumption
• Water cooling
De-facto standards
• Lowering barriers in developing and porting
Ecosystem
• More Arm platforms
• More partners
• More knowledge/experience inside/outside of communities
Copyright 2017 FUJITSU LIMITEDLinaro Work Shop, Dec. 12, 2017 6
Making the Post-K a Resounding Success
 Recapping the goal & requirements
 High performance HW and SW complying open standards
 Apps in quality & variety
 Environments – rich, modern, and comprehensive
 Our approach
 Arm architecture (w/ Fujitsu’s proven microarchitecture)
• SBSA: Server Base System Architecture
• SBBR: Server Base Boot Requirements
• VLA: Vector-Length Agnostic
 Fujitsu enhanced/maintained system software
• Based on Linux & OSSs
• Single source for x86 & Arm
• Open MPI, OpenMP, Libraries,
• Performance analyzer, Debugger
 Powerful but original compilers --- will be aligned to be useful & popular
Copyright 2017 FUJITSU LIMITED
Assure binary compatibility
Lowering barriers for single
source development
Linaro Work Shop, Dec. 12, 2017 7
 Transform our original & powerful compilers to be all-around
 Working and contributing for the Clang project to satisfy both high
performance and portability
 Fujitsu’s back-end advantage
 Auto-parallelization for many-core architecture
 Auto-vectorization for Scalable Vector Extension
 Strong software pipelining with loop fission
Compilers to Increase Software Portability
Copyright 2017 FUJITSU LIMITED
Utilize Post-K μArch:
• Rich & wide SIMD
• Sector cache…
Software:
Apps, Middleware,
and Basics (written
in variety of styles)
Portable
binariesFujitsu original
front-end
Fujitsu original
back-end from
knowledge of
CPU
development
Clang front-end Clang back-end
Linaro Work Shop, Dec. 12, 2017 8
Auto-vectorization for Arm SVE
 4 Byte x 16 SIMD List Memory Access by utilizing 512bit Register
 Various Types of SIMD Optimization by Utilizing Predicate Registers
Copyright 2017 FUJITSU LIMITED
for (int i=0; i<n; ++i) {
if (mask[i] !=0) { a[i] = b[i]; }
}
for (int i=0; i<VL/2; ++i) {
a[i] = b[i] * c[i];
}
do {
b[i] = a[i];
} while(a[i++] != 0);
Loop including IF clause
Small Loop less
than SIMD length
While Loop with
Data Dependency
SVE
Reg. dest.
Reg. index
int index[n]
float P[n], Q[n];
for (i=0; i<n; ++i) {
P[i] = Q[index[i]];
}
Q[14] Q[1] ・ Q[13] ・ Q[0] Q[3] Q[15] Q[2]
14 1 ・ 13 ・ 0 3 15 2
Memory Q [15] [14] [13] ・ ・ [3] [2] [1] [0]1
2
3
4
5
6
7
1
2
3
1
2
3
1
2
3
Linaro Work Shop, Dec. 12, 2017 9
Fujitsu Compiler Back-end Optimization Flow
 Loop Fission reduces required resources, such as registers
 Software Pipelining and Register Allocation
 Best utilization of hardware functions and resources
Copyright 2017 FUJITSU LIMITED
Back-end optimization pipeline
Portable
Arm
binaries
SIMDize
Loop
Fission
Software
Pipelining
Register
Allocation
Instruction
Scheduling
for (...) {
}
// Reduced # of Regs.
for (...) {
}
// Reduced # of Regs.
for (...) {
}
// Higher ILP
for (...) {
}
// Higher ILP
for (...) {
}
Software pipelined #1
Software pipelined #2
Divided # 1
Divided# 2
Original
1
2
3
4
5
6
7
8
9
1
2
3
4
5
6
7
8
9
1
2
3
4
5
6
7
8
9
Linaro Work Shop, Dec. 12, 2017 10
Copyright 2017 FUJITSU LIMITED
Effectiveness of SWP w/ Loop Fission and SoA
 Runs on FX100 w/ 32 registers
 72% speed-up per core is observed
 >2x speed-up compared w/ K computer
 Software Pipelining w/ Loop Fission
utilizes CPU resources
 SoA-style layout extracts more
NICAM* single core performance on FX100 w/ 32 regs
(Source: http://www.riken.jp/pr/topics/2013/20130920_1/)
CPUclocksnormalized
byKcomputer
*NICAM-DC-MINI: Climate simulations with fine mesh, https://github.com/fiber-miniapp/nicam-dc-mini
SWP w/
Loop
fission
+ SoA
style
72% speedup w/ loop fission + SoA
Without
Loop
fission
Linaro Work Shop, Dec. 12, 2017 11
Summary
 Fujitsu’s Approach to HPC
 Supporting high-end supercomputers with original CPU & x86 clusters
 Developing the Post-K for app performance and low power consumption
 Expecting more apps from OSS & ISVs through growing ecosystem
 Keys for Post-K Success
 High performance standard-compliant HW and SW
 All-around high performance compiler with binary compatibility
 Many and varied high quality apps with x86 software compatibility
 Open & Highly Optimized Compilers
 Clang + Fujitsu technologies
 Tentative evaluation results are encouraging
Copyright 2017 FUJITSU LIMITEDLinaro Work Shop, Dec. 12, 2017 12
Copyright 2017 FUJITSU LIMITED

More Related Content

What's hot

Performance evaluation with Arm HPC tools for SVE
Performance evaluation with Arm HPC tools for SVEPerformance evaluation with Arm HPC tools for SVE
Performance evaluation with Arm HPC tools for SVE
Linaro
 
Exploring the Programming Models for the LUMI Supercomputer
Exploring the Programming Models for the LUMI Supercomputer Exploring the Programming Models for the LUMI Supercomputer
Exploring the Programming Models for the LUMI Supercomputer
George Markomanolis
 
Porting and Optimization of Numerical Libraries for ARM SVE
Porting and Optimization of Numerical Libraries for ARM SVEPorting and Optimization of Numerical Libraries for ARM SVE
Porting and Optimization of Numerical Libraries for ARM SVE
Linaro
 
Evaluating GPU programming Models for the LUMI Supercomputer
Evaluating GPU programming Models for the LUMI SupercomputerEvaluating GPU programming Models for the LUMI Supercomputer
Evaluating GPU programming Models for the LUMI Supercomputer
George Markomanolis
 
Arm tools and roadmap for SVE compiler support
Arm tools and roadmap for SVE compiler supportArm tools and roadmap for SVE compiler support
Arm tools and roadmap for SVE compiler support
Linaro
 
Introduction to GPUs in HPC
Introduction to GPUs in HPCIntroduction to GPUs in HPC
Introduction to GPUs in HPC
inside-BigData.com
 
Getting started with AMD GPUs
Getting started with AMD GPUsGetting started with AMD GPUs
Getting started with AMD GPUs
George Markomanolis
 
Circuit Simplifier
Circuit SimplifierCircuit Simplifier
Circuit Simplifier
Vineet Markan
 
An evaluation of LLVM compiler for SVE with fairly complicated loops
An evaluation of LLVM compiler for SVE with fairly complicated loopsAn evaluation of LLVM compiler for SVE with fairly complicated loops
An evaluation of LLVM compiler for SVE with fairly complicated loops
Linaro
 
SGI: Meeting Manufacturing's Need for Production Supercomputing
SGI: Meeting Manufacturing's Need for Production SupercomputingSGI: Meeting Manufacturing's Need for Production Supercomputing
SGI: Meeting Manufacturing's Need for Production Supercomputing
inside-BigData.com
 
Escape the Walls of PaaS: Unlock the Power & Flexibility of DigitalOcean App ...
Escape the Walls of PaaS: Unlock the Power & Flexibility of DigitalOcean App ...Escape the Walls of PaaS: Unlock the Power & Flexibility of DigitalOcean App ...
Escape the Walls of PaaS: Unlock the Power & Flexibility of DigitalOcean App ...
DigitalOcean
 
IBM AI at Scale
IBM AI at ScaleIBM AI at Scale
IBM AI at Scale
Ganesan Narayanasamy
 
Disaggregation a Primer: Optimizing design for Edge Cloud & Bare Metal applic...
Disaggregation a Primer: Optimizing design for Edge Cloud & Bare Metal applic...Disaggregation a Primer: Optimizing design for Edge Cloud & Bare Metal applic...
Disaggregation a Primer: Optimizing design for Edge Cloud & Bare Metal applic...
Netronome
 
ARM and Machine Learning
ARM and Machine LearningARM and Machine Learning
ARM and Machine Learning
inside-BigData.com
 
Hands on OpenCL
Hands on OpenCLHands on OpenCL
Hands on OpenCL
Vladimir Starostenkov
 
HKG18-411 - Introduction to OpenAMP which is an open source solution for hete...
HKG18-411 - Introduction to OpenAMP which is an open source solution for hete...HKG18-411 - Introduction to OpenAMP which is an open source solution for hete...
HKG18-411 - Introduction to OpenAMP which is an open source solution for hete...
Linaro
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Cluster
inside-BigData.com
 
Introduction to OpenCL, 2010
Introduction to OpenCL, 2010Introduction to OpenCL, 2010
Introduction to OpenCL, 2010
Tomasz Bednarz
 
BPF Hardware Offload Deep Dive
BPF Hardware Offload Deep DiveBPF Hardware Offload Deep Dive
BPF Hardware Offload Deep Dive
Netronome
 
OpenCL Programming 101
OpenCL Programming 101OpenCL Programming 101
OpenCL Programming 101
Yoss Cohen
 

What's hot (20)

Performance evaluation with Arm HPC tools for SVE
Performance evaluation with Arm HPC tools for SVEPerformance evaluation with Arm HPC tools for SVE
Performance evaluation with Arm HPC tools for SVE
 
Exploring the Programming Models for the LUMI Supercomputer
Exploring the Programming Models for the LUMI Supercomputer Exploring the Programming Models for the LUMI Supercomputer
Exploring the Programming Models for the LUMI Supercomputer
 
Porting and Optimization of Numerical Libraries for ARM SVE
Porting and Optimization of Numerical Libraries for ARM SVEPorting and Optimization of Numerical Libraries for ARM SVE
Porting and Optimization of Numerical Libraries for ARM SVE
 
Evaluating GPU programming Models for the LUMI Supercomputer
Evaluating GPU programming Models for the LUMI SupercomputerEvaluating GPU programming Models for the LUMI Supercomputer
Evaluating GPU programming Models for the LUMI Supercomputer
 
Arm tools and roadmap for SVE compiler support
Arm tools and roadmap for SVE compiler supportArm tools and roadmap for SVE compiler support
Arm tools and roadmap for SVE compiler support
 
Introduction to GPUs in HPC
Introduction to GPUs in HPCIntroduction to GPUs in HPC
Introduction to GPUs in HPC
 
Getting started with AMD GPUs
Getting started with AMD GPUsGetting started with AMD GPUs
Getting started with AMD GPUs
 
Circuit Simplifier
Circuit SimplifierCircuit Simplifier
Circuit Simplifier
 
An evaluation of LLVM compiler for SVE with fairly complicated loops
An evaluation of LLVM compiler for SVE with fairly complicated loopsAn evaluation of LLVM compiler for SVE with fairly complicated loops
An evaluation of LLVM compiler for SVE with fairly complicated loops
 
SGI: Meeting Manufacturing's Need for Production Supercomputing
SGI: Meeting Manufacturing's Need for Production SupercomputingSGI: Meeting Manufacturing's Need for Production Supercomputing
SGI: Meeting Manufacturing's Need for Production Supercomputing
 
Escape the Walls of PaaS: Unlock the Power & Flexibility of DigitalOcean App ...
Escape the Walls of PaaS: Unlock the Power & Flexibility of DigitalOcean App ...Escape the Walls of PaaS: Unlock the Power & Flexibility of DigitalOcean App ...
Escape the Walls of PaaS: Unlock the Power & Flexibility of DigitalOcean App ...
 
IBM AI at Scale
IBM AI at ScaleIBM AI at Scale
IBM AI at Scale
 
Disaggregation a Primer: Optimizing design for Edge Cloud & Bare Metal applic...
Disaggregation a Primer: Optimizing design for Edge Cloud & Bare Metal applic...Disaggregation a Primer: Optimizing design for Edge Cloud & Bare Metal applic...
Disaggregation a Primer: Optimizing design for Edge Cloud & Bare Metal applic...
 
ARM and Machine Learning
ARM and Machine LearningARM and Machine Learning
ARM and Machine Learning
 
Hands on OpenCL
Hands on OpenCLHands on OpenCL
Hands on OpenCL
 
HKG18-411 - Introduction to OpenAMP which is an open source solution for hete...
HKG18-411 - Introduction to OpenAMP which is an open source solution for hete...HKG18-411 - Introduction to OpenAMP which is an open source solution for hete...
HKG18-411 - Introduction to OpenAMP which is an open source solution for hete...
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Cluster
 
Introduction to OpenCL, 2010
Introduction to OpenCL, 2010Introduction to OpenCL, 2010
Introduction to OpenCL, 2010
 
BPF Hardware Offload Deep Dive
BPF Hardware Offload Deep DiveBPF Hardware Offload Deep Dive
BPF Hardware Offload Deep Dive
 
OpenCL Programming 101
OpenCL Programming 101OpenCL Programming 101
OpenCL Programming 101
 

Similar to Post-K: Building the Arm HPC Ecosystem

08 Supercomputer Fugaku
08 Supercomputer Fugaku08 Supercomputer Fugaku
08 Supercomputer Fugaku
RCCSRENKEI
 
Cisco Connect Toronto 2017 - UCS and Hyperflex update
Cisco Connect Toronto 2017 - UCS and Hyperflex updateCisco Connect Toronto 2017 - UCS and Hyperflex update
Cisco Connect Toronto 2017 - UCS and Hyperflex update
Cisco Canada
 
Cisco Connect Halifax 2018 Compute infrastructure for a hybrid cloud ucs an...
Cisco Connect Halifax 2018   Compute infrastructure for a hybrid cloud ucs an...Cisco Connect Halifax 2018   Compute infrastructure for a hybrid cloud ucs an...
Cisco Connect Halifax 2018 Compute infrastructure for a hybrid cloud ucs an...
Cisco Canada
 
IBM PureSystems
IBM PureSystemsIBM PureSystems
IBM PureSystems
HelpSystems
 
Introduction of Fujitsu's HPC Processor for the Post-K Computer
Introduction of Fujitsu's HPC Processor for the Post-K ComputerIntroduction of Fujitsu's HPC Processor for the Post-K Computer
Introduction of Fujitsu's HPC Processor for the Post-K Computer
inside-BigData.com
 
Fujitsu World Tour 2017: Hyper Converged Data Centre
Fujitsu World Tour 2017: Hyper Converged Data CentreFujitsu World Tour 2017: Hyper Converged Data Centre
Fujitsu World Tour 2017: Hyper Converged Data Centre
Fujitsu India
 
SUSE Expert Days 2017 FUJITSU
SUSE Expert Days 2017 FUJITSUSUSE Expert Days 2017 FUJITSU
SUSE Expert Days 2017 FUJITSU
SUSE España
 
Hyper-convergence – The only way to the software-defined data center? - Gerno...
Hyper-convergence – The only way to the software-defined data center? - Gerno...Hyper-convergence – The only way to the software-defined data center? - Gerno...
Hyper-convergence – The only way to the software-defined data center? - Gerno...
Fujitsu Middle East
 
Cisco Connect Vancouver 2017 - Compute infrastructure for a hybrid cloud
Cisco Connect Vancouver 2017 - Compute infrastructure for a hybrid cloudCisco Connect Vancouver 2017 - Compute infrastructure for a hybrid cloud
Cisco Connect Vancouver 2017 - Compute infrastructure for a hybrid cloud
Cisco Canada
 
Fujitsu World Tour 2017 - Compute Platform For The Digital World
Fujitsu World Tour 2017 - Compute Platform For The Digital WorldFujitsu World Tour 2017 - Compute Platform For The Digital World
Fujitsu World Tour 2017 - Compute Platform For The Digital World
Fujitsu India
 
BOS - Hyperkonvergenz – Der einzige Weg zum Software definierten Rechenzentrum?
BOS - Hyperkonvergenz – Der einzige Weg zum Software definierten Rechenzentrum?BOS - Hyperkonvergenz – Der einzige Weg zum Software definierten Rechenzentrum?
BOS - Hyperkonvergenz – Der einzige Weg zum Software definierten Rechenzentrum?
Fujitsu Central Europe
 
Cisco connect winnipeg 2018 compute infrastructure for a hybrid cloud
Cisco connect winnipeg 2018   compute infrastructure for a hybrid cloudCisco connect winnipeg 2018   compute infrastructure for a hybrid cloud
Cisco connect winnipeg 2018 compute infrastructure for a hybrid cloud
Cisco Canada
 
Compute Infrastructure for a Hybrid Cloud
Compute Infrastructure for a Hybrid CloudCompute Infrastructure for a Hybrid Cloud
Compute Infrastructure for a Hybrid Cloud
Cisco Canada
 
Compute Infrastructure for Hybrid Cloud
Compute Infrastructure for Hybrid CloudCompute Infrastructure for Hybrid Cloud
Compute Infrastructure for Hybrid Cloud
Cisco Canada
 
Netronome Corporate Brochure
Netronome Corporate BrochureNetronome Corporate Brochure
Netronome Corporate Brochure
Netronome
 
HKG18-100K1 - George Grey: Opening Keynote
HKG18-100K1 - George Grey: Opening KeynoteHKG18-100K1 - George Grey: Opening Keynote
HKG18-100K1 - George Grey: Opening Keynote
Linaro
 
All in one
All in oneAll in one
All in one
Akash Sahoo
 
Db2 analytics accelerator on ibm integrated analytics system technical over...
Db2 analytics accelerator on ibm integrated analytics system   technical over...Db2 analytics accelerator on ibm integrated analytics system   technical over...
Db2 analytics accelerator on ibm integrated analytics system technical over...
Daniel Martin
 
Presentazione IBM Flex System e System x Evento Venaria 14 ottobre
Presentazione IBM Flex System e System x Evento Venaria 14 ottobrePresentazione IBM Flex System e System x Evento Venaria 14 ottobre
Presentazione IBM Flex System e System x Evento Venaria 14 ottobre
PRAGMA PROGETTI
 
C122 a015-en
C122 a015-enC122 a015-en
C122 a015-en
Tommy Lee
 

Similar to Post-K: Building the Arm HPC Ecosystem (20)

08 Supercomputer Fugaku
08 Supercomputer Fugaku08 Supercomputer Fugaku
08 Supercomputer Fugaku
 
Cisco Connect Toronto 2017 - UCS and Hyperflex update
Cisco Connect Toronto 2017 - UCS and Hyperflex updateCisco Connect Toronto 2017 - UCS and Hyperflex update
Cisco Connect Toronto 2017 - UCS and Hyperflex update
 
Cisco Connect Halifax 2018 Compute infrastructure for a hybrid cloud ucs an...
Cisco Connect Halifax 2018   Compute infrastructure for a hybrid cloud ucs an...Cisco Connect Halifax 2018   Compute infrastructure for a hybrid cloud ucs an...
Cisco Connect Halifax 2018 Compute infrastructure for a hybrid cloud ucs an...
 
IBM PureSystems
IBM PureSystemsIBM PureSystems
IBM PureSystems
 
Introduction of Fujitsu's HPC Processor for the Post-K Computer
Introduction of Fujitsu's HPC Processor for the Post-K ComputerIntroduction of Fujitsu's HPC Processor for the Post-K Computer
Introduction of Fujitsu's HPC Processor for the Post-K Computer
 
Fujitsu World Tour 2017: Hyper Converged Data Centre
Fujitsu World Tour 2017: Hyper Converged Data CentreFujitsu World Tour 2017: Hyper Converged Data Centre
Fujitsu World Tour 2017: Hyper Converged Data Centre
 
SUSE Expert Days 2017 FUJITSU
SUSE Expert Days 2017 FUJITSUSUSE Expert Days 2017 FUJITSU
SUSE Expert Days 2017 FUJITSU
 
Hyper-convergence – The only way to the software-defined data center? - Gerno...
Hyper-convergence – The only way to the software-defined data center? - Gerno...Hyper-convergence – The only way to the software-defined data center? - Gerno...
Hyper-convergence – The only way to the software-defined data center? - Gerno...
 
Cisco Connect Vancouver 2017 - Compute infrastructure for a hybrid cloud
Cisco Connect Vancouver 2017 - Compute infrastructure for a hybrid cloudCisco Connect Vancouver 2017 - Compute infrastructure for a hybrid cloud
Cisco Connect Vancouver 2017 - Compute infrastructure for a hybrid cloud
 
Fujitsu World Tour 2017 - Compute Platform For The Digital World
Fujitsu World Tour 2017 - Compute Platform For The Digital WorldFujitsu World Tour 2017 - Compute Platform For The Digital World
Fujitsu World Tour 2017 - Compute Platform For The Digital World
 
BOS - Hyperkonvergenz – Der einzige Weg zum Software definierten Rechenzentrum?
BOS - Hyperkonvergenz – Der einzige Weg zum Software definierten Rechenzentrum?BOS - Hyperkonvergenz – Der einzige Weg zum Software definierten Rechenzentrum?
BOS - Hyperkonvergenz – Der einzige Weg zum Software definierten Rechenzentrum?
 
Cisco connect winnipeg 2018 compute infrastructure for a hybrid cloud
Cisco connect winnipeg 2018   compute infrastructure for a hybrid cloudCisco connect winnipeg 2018   compute infrastructure for a hybrid cloud
Cisco connect winnipeg 2018 compute infrastructure for a hybrid cloud
 
Compute Infrastructure for a Hybrid Cloud
Compute Infrastructure for a Hybrid CloudCompute Infrastructure for a Hybrid Cloud
Compute Infrastructure for a Hybrid Cloud
 
Compute Infrastructure for Hybrid Cloud
Compute Infrastructure for Hybrid CloudCompute Infrastructure for Hybrid Cloud
Compute Infrastructure for Hybrid Cloud
 
Netronome Corporate Brochure
Netronome Corporate BrochureNetronome Corporate Brochure
Netronome Corporate Brochure
 
HKG18-100K1 - George Grey: Opening Keynote
HKG18-100K1 - George Grey: Opening KeynoteHKG18-100K1 - George Grey: Opening Keynote
HKG18-100K1 - George Grey: Opening Keynote
 
All in one
All in oneAll in one
All in one
 
Db2 analytics accelerator on ibm integrated analytics system technical over...
Db2 analytics accelerator on ibm integrated analytics system   technical over...Db2 analytics accelerator on ibm integrated analytics system   technical over...
Db2 analytics accelerator on ibm integrated analytics system technical over...
 
Presentazione IBM Flex System e System x Evento Venaria 14 ottobre
Presentazione IBM Flex System e System x Evento Venaria 14 ottobrePresentazione IBM Flex System e System x Evento Venaria 14 ottobre
Presentazione IBM Flex System e System x Evento Venaria 14 ottobre
 
C122 a015-en
C122 a015-enC122 a015-en
C122 a015-en
 

More from Linaro

Deep Learning Neural Network Acceleration at the Edge - Andrea Gallo
Deep Learning Neural Network Acceleration at the Edge - Andrea GalloDeep Learning Neural Network Acceleration at the Edge - Andrea Gallo
Deep Learning Neural Network Acceleration at the Edge - Andrea Gallo
Linaro
 
Arm Architecture HPC Workshop Santa Clara 2018 - Kanta Vekaria
Arm Architecture HPC Workshop Santa Clara 2018 - Kanta VekariaArm Architecture HPC Workshop Santa Clara 2018 - Kanta Vekaria
Arm Architecture HPC Workshop Santa Clara 2018 - Kanta Vekaria
Linaro
 
Huawei’s requirements for the ARM based HPC solution readiness - Joshua Mora
Huawei’s requirements for the ARM based HPC solution readiness - Joshua MoraHuawei’s requirements for the ARM based HPC solution readiness - Joshua Mora
Huawei’s requirements for the ARM based HPC solution readiness - Joshua Mora
Linaro
 
Bud17 113: distribution ci using qemu and open qa
Bud17 113: distribution ci using qemu and open qaBud17 113: distribution ci using qemu and open qa
Bud17 113: distribution ci using qemu and open qa
Linaro
 
OpenHPC Automation with Ansible - Renato Golin - Linaro Arm HPC Workshop 2018
OpenHPC Automation with Ansible - Renato Golin - Linaro Arm HPC Workshop 2018OpenHPC Automation with Ansible - Renato Golin - Linaro Arm HPC Workshop 2018
OpenHPC Automation with Ansible - Renato Golin - Linaro Arm HPC Workshop 2018
Linaro
 
HPC network stack on ARM - Linaro HPC Workshop 2018
HPC network stack on ARM - Linaro HPC Workshop 2018HPC network stack on ARM - Linaro HPC Workshop 2018
HPC network stack on ARM - Linaro HPC Workshop 2018
Linaro
 
It just keeps getting better - SUSE enablement for Arm - Linaro HPC Workshop ...
It just keeps getting better - SUSE enablement for Arm - Linaro HPC Workshop ...It just keeps getting better - SUSE enablement for Arm - Linaro HPC Workshop ...
It just keeps getting better - SUSE enablement for Arm - Linaro HPC Workshop ...
Linaro
 
Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ...
Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ...Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ...
Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ...
Linaro
 
Yutaka Ishikawa - Post-K and Arm HPC Ecosystem - Linaro Arm HPC Workshop Sant...
Yutaka Ishikawa - Post-K and Arm HPC Ecosystem - Linaro Arm HPC Workshop Sant...Yutaka Ishikawa - Post-K and Arm HPC Ecosystem - Linaro Arm HPC Workshop Sant...
Yutaka Ishikawa - Post-K and Arm HPC Ecosystem - Linaro Arm HPC Workshop Sant...
Linaro
 
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
Linaro
 
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainlineHKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
Linaro
 
HKG18-318 - OpenAMP Workshop
HKG18-318 - OpenAMP WorkshopHKG18-318 - OpenAMP Workshop
HKG18-318 - OpenAMP Workshop
Linaro
 
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainlineHKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
Linaro
 
HKG18-315 - Why the ecosystem is a wonderful thing, warts and all
HKG18-315 - Why the ecosystem is a wonderful thing, warts and allHKG18-315 - Why the ecosystem is a wonderful thing, warts and all
HKG18-315 - Why the ecosystem is a wonderful thing, warts and all
Linaro
 
HKG18- 115 - Partitioning ARM Systems with the Jailhouse Hypervisor
HKG18- 115 - Partitioning ARM Systems with the Jailhouse HypervisorHKG18- 115 - Partitioning ARM Systems with the Jailhouse Hypervisor
HKG18- 115 - Partitioning ARM Systems with the Jailhouse Hypervisor
Linaro
 
HKG18-TR08 - Upstreaming SVE in QEMU
HKG18-TR08 - Upstreaming SVE in QEMUHKG18-TR08 - Upstreaming SVE in QEMU
HKG18-TR08 - Upstreaming SVE in QEMU
Linaro
 
HKG18-113- Secure Data Path work with i.MX8M
HKG18-113- Secure Data Path work with i.MX8MHKG18-113- Secure Data Path work with i.MX8M
HKG18-113- Secure Data Path work with i.MX8M
Linaro
 
HKG18-120 - Devicetree Schema Documentation and Validation
HKG18-120 - Devicetree Schema Documentation and Validation HKG18-120 - Devicetree Schema Documentation and Validation
HKG18-120 - Devicetree Schema Documentation and Validation
Linaro
 
HKG18-223 - Trusted FirmwareM: Trusted boot
HKG18-223 - Trusted FirmwareM: Trusted bootHKG18-223 - Trusted FirmwareM: Trusted boot
HKG18-223 - Trusted FirmwareM: Trusted boot
Linaro
 
HKG18-500K1 - Keynote: Dileep Bhandarkar - Emerging Computing Trends in the D...
HKG18-500K1 - Keynote: Dileep Bhandarkar - Emerging Computing Trends in the D...HKG18-500K1 - Keynote: Dileep Bhandarkar - Emerging Computing Trends in the D...
HKG18-500K1 - Keynote: Dileep Bhandarkar - Emerging Computing Trends in the D...
Linaro
 

More from Linaro (20)

Deep Learning Neural Network Acceleration at the Edge - Andrea Gallo
Deep Learning Neural Network Acceleration at the Edge - Andrea GalloDeep Learning Neural Network Acceleration at the Edge - Andrea Gallo
Deep Learning Neural Network Acceleration at the Edge - Andrea Gallo
 
Arm Architecture HPC Workshop Santa Clara 2018 - Kanta Vekaria
Arm Architecture HPC Workshop Santa Clara 2018 - Kanta VekariaArm Architecture HPC Workshop Santa Clara 2018 - Kanta Vekaria
Arm Architecture HPC Workshop Santa Clara 2018 - Kanta Vekaria
 
Huawei’s requirements for the ARM based HPC solution readiness - Joshua Mora
Huawei’s requirements for the ARM based HPC solution readiness - Joshua MoraHuawei’s requirements for the ARM based HPC solution readiness - Joshua Mora
Huawei’s requirements for the ARM based HPC solution readiness - Joshua Mora
 
Bud17 113: distribution ci using qemu and open qa
Bud17 113: distribution ci using qemu and open qaBud17 113: distribution ci using qemu and open qa
Bud17 113: distribution ci using qemu and open qa
 
OpenHPC Automation with Ansible - Renato Golin - Linaro Arm HPC Workshop 2018
OpenHPC Automation with Ansible - Renato Golin - Linaro Arm HPC Workshop 2018OpenHPC Automation with Ansible - Renato Golin - Linaro Arm HPC Workshop 2018
OpenHPC Automation with Ansible - Renato Golin - Linaro Arm HPC Workshop 2018
 
HPC network stack on ARM - Linaro HPC Workshop 2018
HPC network stack on ARM - Linaro HPC Workshop 2018HPC network stack on ARM - Linaro HPC Workshop 2018
HPC network stack on ARM - Linaro HPC Workshop 2018
 
It just keeps getting better - SUSE enablement for Arm - Linaro HPC Workshop ...
It just keeps getting better - SUSE enablement for Arm - Linaro HPC Workshop ...It just keeps getting better - SUSE enablement for Arm - Linaro HPC Workshop ...
It just keeps getting better - SUSE enablement for Arm - Linaro HPC Workshop ...
 
Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ...
Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ...Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ...
Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ...
 
Yutaka Ishikawa - Post-K and Arm HPC Ecosystem - Linaro Arm HPC Workshop Sant...
Yutaka Ishikawa - Post-K and Arm HPC Ecosystem - Linaro Arm HPC Workshop Sant...Yutaka Ishikawa - Post-K and Arm HPC Ecosystem - Linaro Arm HPC Workshop Sant...
Yutaka Ishikawa - Post-K and Arm HPC Ecosystem - Linaro Arm HPC Workshop Sant...
 
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
 
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainlineHKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
 
HKG18-318 - OpenAMP Workshop
HKG18-318 - OpenAMP WorkshopHKG18-318 - OpenAMP Workshop
HKG18-318 - OpenAMP Workshop
 
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainlineHKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
 
HKG18-315 - Why the ecosystem is a wonderful thing, warts and all
HKG18-315 - Why the ecosystem is a wonderful thing, warts and allHKG18-315 - Why the ecosystem is a wonderful thing, warts and all
HKG18-315 - Why the ecosystem is a wonderful thing, warts and all
 
HKG18- 115 - Partitioning ARM Systems with the Jailhouse Hypervisor
HKG18- 115 - Partitioning ARM Systems with the Jailhouse HypervisorHKG18- 115 - Partitioning ARM Systems with the Jailhouse Hypervisor
HKG18- 115 - Partitioning ARM Systems with the Jailhouse Hypervisor
 
HKG18-TR08 - Upstreaming SVE in QEMU
HKG18-TR08 - Upstreaming SVE in QEMUHKG18-TR08 - Upstreaming SVE in QEMU
HKG18-TR08 - Upstreaming SVE in QEMU
 
HKG18-113- Secure Data Path work with i.MX8M
HKG18-113- Secure Data Path work with i.MX8MHKG18-113- Secure Data Path work with i.MX8M
HKG18-113- Secure Data Path work with i.MX8M
 
HKG18-120 - Devicetree Schema Documentation and Validation
HKG18-120 - Devicetree Schema Documentation and Validation HKG18-120 - Devicetree Schema Documentation and Validation
HKG18-120 - Devicetree Schema Documentation and Validation
 
HKG18-223 - Trusted FirmwareM: Trusted boot
HKG18-223 - Trusted FirmwareM: Trusted bootHKG18-223 - Trusted FirmwareM: Trusted boot
HKG18-223 - Trusted FirmwareM: Trusted boot
 
HKG18-500K1 - Keynote: Dileep Bhandarkar - Emerging Computing Trends in the D...
HKG18-500K1 - Keynote: Dileep Bhandarkar - Emerging Computing Trends in the D...HKG18-500K1 - Keynote: Dileep Bhandarkar - Emerging Computing Trends in the D...
HKG18-500K1 - Keynote: Dileep Bhandarkar - Emerging Computing Trends in the D...
 

Recently uploaded

RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
KAMESHS29
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
名前 です男
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
Neo4j
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Nexer Digital
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
Pierluigi Pugliese
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
SOFTTECHHUB
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIEnchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Vladimir Iglovikov, Ph.D.
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
Neo4j
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 

Recently uploaded (20)

RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIEnchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 

Post-K: Building the Arm HPC Ecosystem

  • 1. Kouichi Hirai FUJITSU LIMITED Dec 12th, 2017 Post-K: Building the Arm HPC Ecosystem 0 Copyright 2017 FUJITSU LIMITEDLinaro Work Shop, Dec. 12, 2017
  • 2. Post-K: Building up Arm HPC Ecosystem  Fujitsu’s approach for HPC  For making the Post-K a resounding success  The high performance compiler increases software portability  Summary Copyright 2017 FUJITSU LIMITEDLinaro Work Shop, Dec. 12, 2017 1
  • 3. Fujitsu HPC Solutions to Meet Customer Demands  Supercomputers, both Fujitsu-developed CPUs and x86  Single system image operation w/ Fujitsu system software  High performance, high availability, and high reliability Copyright 2017 FUJITSU LIMITED x86 Cluster RX2530/RX2540 CX600CX400 High scalability with Fujitsu- developed CPU and interconnect PRIMERGY x86 cluster systems support the latest CPUs and accelerators Under Development w/ RIKEN High -end Divisional Departmental Workgroup PRIMEHPC FX10 PRIMEHPC FX100 Post-KK computer Co-developed with RIKEN © RIKEN Large-Scale SMP System RX900 Linaro Work Shop, Dec. 12, 2017 2
  • 4. Fujitsu High-end Supercomputers Development 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 PRIMEHPC FX10  1.8x CPU perf. of K  Easier installation  4x(DP) / 8x(SP) CPU per. of K, Tofu2  High-density pkg & lower energy App. review FS projects HPCI strategic apps program Operation of K computerDevelopment Japan’s National Projects FUJITSU Post-K computer development PRIMEHPC FX100 K computer and PRIMEHPC FX10/FX100 in operation The CPU and interconnect of FX10/FX100 inherit the K computer architectural concept, featuring state-of- the-art technologies System software “TCS” supports Fujitsu supercomputer with originally introduced technologies Many applications are currently running and being developed for science and various industries RIKEN and Fujitsu are working together to provide a successor to K computer with application R&D teams using co- design approach Technical Computing Suite (TCS) Handles millions of parallel jobs FEFS: super scalable file system MPI: Ultra scalable collective communication libraries  OS: Lower OS jitter w/ assistant core Copyright 2017 FUJITSU LIMITED Post-K supercomputer Post-K Linaro Work Shop, Dec. 12, 2017 3
  • 5. Post-K Features and Status  Fujitsu CPU core (w/ Arm SVE) and Tofu maintain the programming models and provide high application performance  RIKEN & Fujitsu system software enable high performance and low power consumption with flexible operations  Apps from 9 “priority issues” & many “exploratory challenges” are being optimized for the Post-K Functions & architecture Post-K FX100 FX10 K CPU Core Instruction set architecture Armv8-A SPARC V9 SIMD width 512bit 256bit 128bit 128bit Double precision (64bit) ✔ ✔ ✔ ✔ Single precision (32bit) ✔ ✔ ✔ ✔ Half precision (16bit) ✔ - - - Interconnect Tofu interconnect Enhanced Tofu2 Tofu Tofu Copyright 2017 FUJITSU LIMITED Post-K Linaro Work Shop, Dec. 12, 2017 4
  • 6. Post-K Software Stack  Valuable feedbacks through “co-design” from application R&D teams Post-K System Hardware FUJITSU Technical Computing Suite / RIKEN Advanced System Software Linux OS / McKernel (Lightweight Kernel) Post-K Applications System management for highly available & power saving operation Job management for higher system utilization & power efficiency Lustre-based distributed file system FEFS OpenMP, COARRAY, Math Libs Compilers (C, C++, Fortran) Debugging and tuning tools Management Software Programming EnvironmentHierarchical File I/O Software MPI (Open MPI, MPICH) XcalableMP Application-oriented file I/O middleware Post-K Under Development w/ RIKEN Copyright 2017 FUJITSU LIMITEDLinaro Work Shop, Dec. 12, 2017 5
  • 7. Post-K to be More Useful?  More apps from OSS & ISVs High performance on “real” applications Lower TCO • Low power consumption • Water cooling De-facto standards • Lowering barriers in developing and porting Ecosystem • More Arm platforms • More partners • More knowledge/experience inside/outside of communities Copyright 2017 FUJITSU LIMITEDLinaro Work Shop, Dec. 12, 2017 6
  • 8. Making the Post-K a Resounding Success  Recapping the goal & requirements  High performance HW and SW complying open standards  Apps in quality & variety  Environments – rich, modern, and comprehensive  Our approach  Arm architecture (w/ Fujitsu’s proven microarchitecture) • SBSA: Server Base System Architecture • SBBR: Server Base Boot Requirements • VLA: Vector-Length Agnostic  Fujitsu enhanced/maintained system software • Based on Linux & OSSs • Single source for x86 & Arm • Open MPI, OpenMP, Libraries, • Performance analyzer, Debugger  Powerful but original compilers --- will be aligned to be useful & popular Copyright 2017 FUJITSU LIMITED Assure binary compatibility Lowering barriers for single source development Linaro Work Shop, Dec. 12, 2017 7
  • 9.  Transform our original & powerful compilers to be all-around  Working and contributing for the Clang project to satisfy both high performance and portability  Fujitsu’s back-end advantage  Auto-parallelization for many-core architecture  Auto-vectorization for Scalable Vector Extension  Strong software pipelining with loop fission Compilers to Increase Software Portability Copyright 2017 FUJITSU LIMITED Utilize Post-K μArch: • Rich & wide SIMD • Sector cache… Software: Apps, Middleware, and Basics (written in variety of styles) Portable binariesFujitsu original front-end Fujitsu original back-end from knowledge of CPU development Clang front-end Clang back-end Linaro Work Shop, Dec. 12, 2017 8
  • 10. Auto-vectorization for Arm SVE  4 Byte x 16 SIMD List Memory Access by utilizing 512bit Register  Various Types of SIMD Optimization by Utilizing Predicate Registers Copyright 2017 FUJITSU LIMITED for (int i=0; i<n; ++i) { if (mask[i] !=0) { a[i] = b[i]; } } for (int i=0; i<VL/2; ++i) { a[i] = b[i] * c[i]; } do { b[i] = a[i]; } while(a[i++] != 0); Loop including IF clause Small Loop less than SIMD length While Loop with Data Dependency SVE Reg. dest. Reg. index int index[n] float P[n], Q[n]; for (i=0; i<n; ++i) { P[i] = Q[index[i]]; } Q[14] Q[1] ・ Q[13] ・ Q[0] Q[3] Q[15] Q[2] 14 1 ・ 13 ・ 0 3 15 2 Memory Q [15] [14] [13] ・ ・ [3] [2] [1] [0]1 2 3 4 5 6 7 1 2 3 1 2 3 1 2 3 Linaro Work Shop, Dec. 12, 2017 9
  • 11. Fujitsu Compiler Back-end Optimization Flow  Loop Fission reduces required resources, such as registers  Software Pipelining and Register Allocation  Best utilization of hardware functions and resources Copyright 2017 FUJITSU LIMITED Back-end optimization pipeline Portable Arm binaries SIMDize Loop Fission Software Pipelining Register Allocation Instruction Scheduling for (...) { } // Reduced # of Regs. for (...) { } // Reduced # of Regs. for (...) { } // Higher ILP for (...) { } // Higher ILP for (...) { } Software pipelined #1 Software pipelined #2 Divided # 1 Divided# 2 Original 1 2 3 4 5 6 7 8 9 1 2 3 4 5 6 7 8 9 1 2 3 4 5 6 7 8 9 Linaro Work Shop, Dec. 12, 2017 10
  • 12. Copyright 2017 FUJITSU LIMITED Effectiveness of SWP w/ Loop Fission and SoA  Runs on FX100 w/ 32 registers  72% speed-up per core is observed  >2x speed-up compared w/ K computer  Software Pipelining w/ Loop Fission utilizes CPU resources  SoA-style layout extracts more NICAM* single core performance on FX100 w/ 32 regs (Source: http://www.riken.jp/pr/topics/2013/20130920_1/) CPUclocksnormalized byKcomputer *NICAM-DC-MINI: Climate simulations with fine mesh, https://github.com/fiber-miniapp/nicam-dc-mini SWP w/ Loop fission + SoA style 72% speedup w/ loop fission + SoA Without Loop fission Linaro Work Shop, Dec. 12, 2017 11
  • 13. Summary  Fujitsu’s Approach to HPC  Supporting high-end supercomputers with original CPU & x86 clusters  Developing the Post-K for app performance and low power consumption  Expecting more apps from OSS & ISVs through growing ecosystem  Keys for Post-K Success  High performance standard-compliant HW and SW  All-around high performance compiler with binary compatibility  Many and varied high quality apps with x86 software compatibility  Open & Highly Optimized Compilers  Clang + Fujitsu technologies  Tentative evaluation results are encouraging Copyright 2017 FUJITSU LIMITEDLinaro Work Shop, Dec. 12, 2017 12