SlideShare a Scribd company logo
1 of 18
Download to read offline
Scaling Green
Instrumentation to more than
10 million Cores
Satoshi Matsuoka
Director, Riken Center for Computational Science /
Professor, Tokyo Institute of Technology
SC18 EE-Panel
Dallas TX
20181115
RIKEN ADVANCED INSTITUTE FOR COMPUTATIONAL SCIENCE
Computer simulations create the future
2
Fumiyoshi Shoji
Operations and Computer Technologies Division
RIKEN AICS
Operation improvements of facility for the K computer
at RIKEN AICS
Eighth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG)@SC’17
12 November, 2017
RIKEN ADVANCED INSTITUTE FOR COMPUTATIONAL SCIENCE
Gas Co-generation System (CGS)
The K computer & the facility
3
• The K computer:
• Computing Capability 10.51 PetaFlops (HPL score)
• Power Consumption 12-15MW
• Water/Air cooling ratio 70 : 30
• Operation start 28 September, 2012
• Facility:
• Gas Turbine Power Generator 5MW x 2
• Chiller(absorption) 1700USRt(5.98MW) x 2
• Chiller(Centrifugal) 1400USRt(4.93MW) x 2, 700USRt(2.46MW) x 2
• Air handlers, voltage transformers, etc.
Gas
Gas turbine
power generator
~30%
~45% steam
Electricity
Absorption chillers
Chilled water
Electric power source balance
Provider: Generator = 70 : 30
energy reuse
RIKEN ADVANCED INSTITUTE FOR COMPUTATIONAL SCIENCE
Efforts for better contract
4
• Electric power
• A private contract with a regional power supply company until FY2016.
• A competitor appeared for FY2017 supply contract renewal, then we
introduced competing bidding.
• ---> The challenger beat the champion then the electric power cost
decreased by 10% and no minimal consumption constraint is included.
• ---> No quality degradation observed so far.
Gasrate
2015 2016 2017 2018
fixed rate
floating rate
• Gas
• A contract based on a floating gas rate until Jun.2015.
• A fixed rate service has started since Aug.2015.
• In our case, a fixed rate, which is determined 6 month ago,
is applied for 1 year.
• ---> Consequently, the fixed rate level was higher than the
floating for 2 years, but the trend was reversed recently.
• ---> If the fixed rate contract needs more cost, it is
beneficial for us because no shortage/residue is guaranteed.
fixed rate
started at
Aug.2015
RIKEN ADVANCED INSTITUTE FOR COMPUTATIONAL SCIENCE
Lessons learned for CGS operation
5
1. Generation efficiency of gas turbine
power generator increases when it is
driven at higher output level.
• ex. when generator is driven at 50%
output level, efficiency decreases by
30%.
2. When power generator is driven at
higher level, richer steam is also
generated then power consumption of
chillers can be saved.
• ex. when generator is driven at 50%
output level, power consumption of
chillers increases by 30%.
0.0
0.5
1.0
1.5
2.0
2.5
3.0
3.5
4.0
0 1,000 2,000 3,000 4,000 5,000 6,000 7,000
Generationefficiency
(kWh/m3)
Generation output level (kW)
-30%
Higherisbetter
Generation level vs efficiency
900
1100
1300
1500
1700
1900
2100
2300
2000 2500 3000 3500 4000 4500 5000 5500
Generation output level (kW)
Generation level vs power consumption of chillers
powerconsumption
ofchillers(kW)
+30%
Loweris
better
Power generator should be driven at higher output level.
100%50%
100%50%
RIKEN ADVANCED INSTITUTE FOR COMPUTATIONAL SCIENCE
Power consumption fluctuation on K
6
Typical power consumption history (4/14-15, 2016)
Large scale job mode Normal mode
• The power consumption trend of the K computer dominates that of site total.
• In the normal mode, the power consumption changes gently due to mixture
of many jobs which have different power consumption profile.
• In the large scale job mode, the width of fluctuation become wider than that
of normal mode because there is only one huge job.
powerconsumption(MW)
RIKEN ADVANCED INSTITUTE FOR COMPUTATIONAL SCIENCE
Cooling in the idle and peak (in future)
7
Electric chiller
~4.9MW
Absorption
chiller
~5.9MW
idle (??MW)
peak (??MW)
Absorption
chiller
~5.9MW
Electric chiller
~4.9MW
Absorption
chiller
~5.9MW
Absorption
chiller
~5.9MW
Electric chiller
~4.9MW
Electric chiller
~4.9MW
Absorption
chiller
~5.9MW
Absorption
chiller
~5.9MW
Electric chiller
~4.9MW
Absorption
chiller
~5.9MW
Absorption
chiller
~5.9MW
Electric chiller
~4.9MW
Absorption
chiller
~5.9MW
Absorption
chiller
~5.9MW
gap(>??MW)
1 ~ 2hours
• If the gap between the idle and peak become huge, cooling supply might not in
time due to the activation latency of absorption chillers.
• How can we bridge the gap?
activate 2nd generator & 3rd and 4th absorption chillers
during high load job
idle is too lower for 3rd
and 4th absorption
chillers operation
peak is too higher
without 3rd and 4th
absorption chillers
operation
RIKEN ADVANCED INSTITUTE FOR COMPUTATIONAL SCIENCE
The Good/Bad of K Power Mgmt
8
• Good: Achieved 40% reduction in Power OPEX cost
– Efficient use of CGS
– Monitoring Grid vs. CGS cost to see which is cheaper
– Prepping the cooling system in anticipation of jobs in the queue
– Etc.
• Bad: We cant measure the power of the independent nodes
– So since we cant measure we cannot to job/node/.. wise fine-
grained power control
– Some studies to correlate perf counters to power but we don’t
have the control plane
Japan Flagship 2020 “Post K”
Supercomputer
9
: Compute
Node
:Interconnect
Login
Servers
Login
Servers
Maitenance
Servers
Maitenance
Servers
I/O NetworkI/O Network
…
…
…
…
…
…
…
…
…
…
…
…
Hierarchical
Storage System
Hierarchical
Storage System
Portal
Servers
Portal
Servers
CPU
• Many core, Xeon-Class ARM v8 cores + 512
bit SVE (scalable vector extensions)
• Multi-hundred petaflops peak total
• Power Knob feature
Memory
3-D stacked DRAM, Terabyte/s BW /chip
Interconnect
• TOFU-D CPU-integrated 6-D torus network
• I/O acceleration with massive SDs
• 30MW+ Power, liquid cooled
• Riken co-design with Fujitsu
•? Million cores in system
Prime Minister Abe visiting K Computer 2013
• R
NOW
10
1. Heritage of the K-Computer, HP in simulation via extensive Co-Design
• High performance: up to x100 performance of K in real applications
• Multitudes of Scientific Breakthroughs via Post-K application programs
• Simultaneous high performance and ease-of-programming
2. New Technology Innovations of Post-K
• High Performance, esp. via high memory BW
Performance boost by “factors” c.f. mainstream CPUs in
many HPC & Society5.0 apps
• Very Green e.g. extreme power efficiency
Ultra Power efficient design & various power control knobs
• Arm Global Ecosystem & SVE contribution
Top CPU in ARM Ecosystem of 21 billion chips/year, SVE co-
design and world’s first implementation by Fujitsu
• High Perf. on Society5.0 apps incl. AI
Architectural features for high perf on Society 5.0 apps
based on Big Data, AI/ML, CAE/EDA, Blockchain security,
etc.
Post-K: The Game Changer
ARM: Massive ecosystem
from embedded to HPC
Global leadership not just in
the machine & apps, but as
cutting edge IT
Technology not just limited to Post-K, but into societal IT infrastructures e.g. Clouds
C P U
A64fx
Post K Processor is…
11
 an Many-Core ARM CPU…
 48 compute cores + 2 or 4 assistant (OS) cores
 Brand new core design
 Near Xeon-Class Integer performance core
 ARM V8 --- 64bit ARM ecosystem
 Tofu 3 + PCIe 3 external connection
 …but also a GPU-like processor
 SVE 512 bit vector extensions (ARM & Fujitsu)
 Integer (1, 2, 4, 8 bytes) + Float (16, 32, 64 bytes)
 Cache + scratchpad local memory (sector cache)
 Multi-stack 3D memory – Massive Mem BW (Bytes/DPF ~0.4)
 Streaming memory access, strided access, scatter/gather etc.
 Intra-chip barrier synch. and other memory enhancing features
 GPU-like High performance in HPC, AI/Big Data, Auto Driving…20018/3/13
Post-K A64fx A0 (ES) performance
Performance / CPU Machine Performance (HPC)
Peak TF
(DFP)
Peak Mem.
BW
Stream
Triad
Theor
etical
B/F
DGEMM
Efficiency
Linpack
Efficiency
GF/W
Network BW
Per Chip
Post-K A64fx
(A0 Eng.
Sample)
2.764/
3.072
1024GB/s 840GB/s
0.37/
0.33
94 % 87.7 % >15
TOFU-D
40.8GB/s
(6.8x 6)
Intel KNL 3.0464 600GB/s 490GB/s 0.20 66% 54.4 % 4.9 12.5 GB/s
Intel Skylake 1.6128 127.8GB/s 97 GB/s 0.08 80 % 66.7 % 4.5 6.2GB/s
NVIDIA V100
(DGX-2)
7.8 900 GB/s 855GB/s 0.12 76 % 15.113
160GB/s
6.2GB/s
Power Management
 “Energy monitor” / “Energy analyzer” for activity-based power estimation
 Energy monitor (per chip) : Node power via Power API* (~msec)
• Average power estimation of a node, CMG (cores, an L2 cache, a memory) etc.
Power
calc.
Core
activity
Core L2 cache
Power
calc.
L2 cache
activity
Memory
Power
calc.
Tofu PCIe
Power calc. for PCIePower calc. for Tofu
Node 12 Cores L2 cache HBM2
 Energy analyzer (per core) : Power profiler via PAPI** (~nsec)
• Fine grained power analysis of a core, an L2 cache and a memory
→ Enabling chip-level power monitoring and detailed power analysis of applications
<A64FX Energy monitor/ Energy analyzer>
Memory
activity
CMG#0 CMG#1 CMG#2 CMG#3
Energy Analyzer
Core
L2 Cache
Memory
Assistant
core
Energy Monitor CMG
15 All Rights Reserved. Copyright © FUJITSU LIMITED 2018
*Sandia National Laboratory
** Performance Application Programming Interface
A64FX
 “Power knob” for power optimization
 A64FX provides power management function called “Power Knob”
• Applications can change hardware configurations for power optimization
→ Power knobs and Energy monitor/analyzer will help users to optimize power consumption of
their applications
<A64FX Power Knob Diagram>
L1 I$
Branch
Predictor
Decode
& Issue
RSE0
RSA
RSE1
RSBR
PGPR
EAGA
EXC
EAGB
EXD
EXA
EXB
PFPR
Fetch
Port
Store
Port L1D$
HBM2 Controller
Fetch Issue Dispatch Reg-Read Execute Cache and Memory
CSE
Commit
PC
Control
Registers
L2$
HBM2
Write
Buffer
Tofu controller
FLA
PPR
FLB
PRX
Power Management (Cont.)
FL pipeline usage: FLA only
HBM2 B/W adjustment (units of 10%)
Frequency reductionDecode width: 2 EX pipeline usage : EXA only
PCI Controller
52 cores
16 All Rights Reserved. Copyright © FUJITSU LIMITED 2018
Node Power (48 cores)
DGEMM : ~160W
STREAM: ~190W
STREAM w/power optimized
for BW: ~160W
FUJITSU CONFIDENTIAL
Post-K Chassis, PCB (w/DLC), and CPU Package
CMU
CPU Package
A0 Chip Booted in June
Undergoing Tests
60
mm
60
mm
230 mm
280
mm
CPU
A64fx
Copyright 2018 FUJITSU LIMITED
W 800㎜
D1400㎜
H2000㎜
384 nodes
An Overview of Post-K Hardware
 Compute Node, Compute + I/O Node connected by
6D mesh/torus Interconnect
 3-level hierarchical storage system
 1st Layer
 Cache for global file system
 Temporary file systems
- Local file system for compute node
- Shared file system for a job
 2nd Layer
 Lustre-based global file system
 3rd Layer
 Storage for archive
 >100,000 nodes
 x100PF, >100PB/s mem BW
 Approaching 10 million
CPU cores 1620018/6/26
Rack-level FPGA to cap
power total < 37MW
RIKEN Center for Computational Science
From K to Post K
17
K computer Post K
Voltage 3-phase AC 200V ->
Peak Power 15-20MW 30-40MW
Idle Power ~10MW >10MW
Fluctuation width ~5MW 10~18MW
Rack weight ~1t ~2t
Cooling
ratio(water:air)
65:35 90:10
Water supply 44l/rack/min 117l/rack/min
Peak Power can be > 40 MW,
> facility machine cap (37MW)
Demand-Aware Power Management for PCAS
[Masaaki Kondo et. al., U-Tokyo/Riken R-CCS]
18
 Control jobs’ power-cap based on their QoS requirement
 Performance degradation prediction + fine grained power control
Available Power Job Power
perf. loss smaller
than expected
Power cap
perf. loss larger
than expected
Power
Time
Max power
demand Power cap
Time
Power
Adaptive cappingFixed aapping at ave.
Adaptively set
power cap
100.00%
68.45%
53.86% 50.72%
34.25% 31.37% 30.61%
16.27%
0.00%
25.00%
50.00%
75.00%
100.00%
TurnaroundTime
without backfilling with easy backfilling
 1.74x throughput
improvement
 83.7% turnaround
time reduction

More Related Content

What's hot

Small Scale Photovoltaic Installations - Use of RETScreen Software
Small Scale Photovoltaic Installations - Use of RETScreen SoftwareSmall Scale Photovoltaic Installations - Use of RETScreen Software
Small Scale Photovoltaic Installations - Use of RETScreen Softwarefernando nuño
 
Holistic District Heating Grid Design with SimulationX & Green City
Holistic District Heating Grid Design with SimulationX & Green CityHolistic District Heating Grid Design with SimulationX & Green City
Holistic District Heating Grid Design with SimulationX & Green CitySimulationX
 
Il CSP di piccola taglia e il calore di processo: esempi pratici e casi studi...
Il CSP di piccola taglia e il calore di processo: esempi pratici e casi studi...Il CSP di piccola taglia e il calore di processo: esempi pratici e casi studi...
Il CSP di piccola taglia e il calore di processo: esempi pratici e casi studi...Sardegna Ricerche
 
IEA DSM Task 17: Integration of DSM, DG, RES and ES – Outcome of Phase 1 and ...
IEA DSM Task 17: Integration of DSM, DG, RES and ES – Outcome of Phase 1 and ...IEA DSM Task 17: Integration of DSM, DG, RES and ES – Outcome of Phase 1 and ...
IEA DSM Task 17: Integration of DSM, DG, RES and ES – Outcome of Phase 1 and ...IEA DSM Implementing Agreement (IA)
 
Solar Power Generation 2011 (Las Vegas, NV)
Solar Power Generation 2011 (Las Vegas, NV)Solar Power Generation 2011 (Las Vegas, NV)
Solar Power Generation 2011 (Las Vegas, NV)MitchellDong
 
Benefits of csp with thermal storage
Benefits of csp with thermal storageBenefits of csp with thermal storage
Benefits of csp with thermal storageSolarReference
 
parabolic trough solar collectors power plants (designe)
parabolic trough solar collectors power plants (designe)parabolic trough solar collectors power plants (designe)
parabolic trough solar collectors power plants (designe)Eslam Zaki
 
A micro chp solution based on a micro gas turbine
A micro chp solution based on a micro gas turbineA micro chp solution based on a micro gas turbine
A micro chp solution based on a micro gas turbinedkdk444
 
solar power by molten salt technology
solar power by molten salt technologysolar power by molten salt technology
solar power by molten salt technologymadhu124521
 
DEWI Magazin49_Ingeteam_INGECON WIND FIX2VAR SPEED
DEWI Magazin49_Ingeteam_INGECON WIND FIX2VAR SPEEDDEWI Magazin49_Ingeteam_INGECON WIND FIX2VAR SPEED
DEWI Magazin49_Ingeteam_INGECON WIND FIX2VAR SPEEDIngeteam Wind Energy
 
Forecasting commercial services from S2M - Juan Liria (Sun2Market)
Forecasting commercial services from S2M - Juan Liria (Sun2Market)Forecasting commercial services from S2M - Juan Liria (Sun2Market)
Forecasting commercial services from S2M - Juan Liria (Sun2Market)IrSOLaV Pomares
 
Design of hybrid micro thermoelectric and stirling engine
Design of hybrid micro thermoelectric and stirling engineDesign of hybrid micro thermoelectric and stirling engine
Design of hybrid micro thermoelectric and stirling engineUNIVERSITI MALAYSIA PAHANG
 
Concentrated Solar Power Course - Session 4 - Thermal Storage and Hybridization
Concentrated Solar Power Course - Session 4 - Thermal Storage and HybridizationConcentrated Solar Power Course - Session 4 - Thermal Storage and Hybridization
Concentrated Solar Power Course - Session 4 - Thermal Storage and HybridizationLeonardo ENERGY
 
IECON Amjad Optimal planning and operatoin Management of a ship Electrical Po...
IECON Amjad Optimal planning and operatoin Management of a ship Electrical Po...IECON Amjad Optimal planning and operatoin Management of a ship Electrical Po...
IECON Amjad Optimal planning and operatoin Management of a ship Electrical Po...Juan C. Vasquez
 
Simon Taylor Dissertation
Simon Taylor DissertationSimon Taylor Dissertation
Simon Taylor Dissertationsimontaylor1990
 
Performance Presentation - NREL March 2010
Performance Presentation - NREL March 2010Performance Presentation - NREL March 2010
Performance Presentation - NREL March 2010Jacob McKee
 

What's hot (20)

Small Scale Photovoltaic Installations - Use of RETScreen Software
Small Scale Photovoltaic Installations - Use of RETScreen SoftwareSmall Scale Photovoltaic Installations - Use of RETScreen Software
Small Scale Photovoltaic Installations - Use of RETScreen Software
 
Holistic District Heating Grid Design with SimulationX & Green City
Holistic District Heating Grid Design with SimulationX & Green CityHolistic District Heating Grid Design with SimulationX & Green City
Holistic District Heating Grid Design with SimulationX & Green City
 
Il CSP di piccola taglia e il calore di processo: esempi pratici e casi studi...
Il CSP di piccola taglia e il calore di processo: esempi pratici e casi studi...Il CSP di piccola taglia e il calore di processo: esempi pratici e casi studi...
Il CSP di piccola taglia e il calore di processo: esempi pratici e casi studi...
 
IEA DSM Task 17: Integration of DSM, DG, RES and ES – Outcome of Phase 1 and ...
IEA DSM Task 17: Integration of DSM, DG, RES and ES – Outcome of Phase 1 and ...IEA DSM Task 17: Integration of DSM, DG, RES and ES – Outcome of Phase 1 and ...
IEA DSM Task 17: Integration of DSM, DG, RES and ES – Outcome of Phase 1 and ...
 
Solar Power Generation 2011 (Las Vegas, NV)
Solar Power Generation 2011 (Las Vegas, NV)Solar Power Generation 2011 (Las Vegas, NV)
Solar Power Generation 2011 (Las Vegas, NV)
 
Benefits of csp with thermal storage
Benefits of csp with thermal storageBenefits of csp with thermal storage
Benefits of csp with thermal storage
 
Prajwal Patil
Prajwal PatilPrajwal Patil
Prajwal Patil
 
Ajay ntpc
Ajay ntpcAjay ntpc
Ajay ntpc
 
parabolic trough solar collectors power plants (designe)
parabolic trough solar collectors power plants (designe)parabolic trough solar collectors power plants (designe)
parabolic trough solar collectors power plants (designe)
 
A micro chp solution based on a micro gas turbine
A micro chp solution based on a micro gas turbineA micro chp solution based on a micro gas turbine
A micro chp solution based on a micro gas turbine
 
18 deceglie modeling and monitoring rtsr
18 deceglie modeling and monitoring rtsr18 deceglie modeling and monitoring rtsr
18 deceglie modeling and monitoring rtsr
 
solar power by molten salt technology
solar power by molten salt technologysolar power by molten salt technology
solar power by molten salt technology
 
DEWI Magazin49_Ingeteam_INGECON WIND FIX2VAR SPEED
DEWI Magazin49_Ingeteam_INGECON WIND FIX2VAR SPEEDDEWI Magazin49_Ingeteam_INGECON WIND FIX2VAR SPEED
DEWI Magazin49_Ingeteam_INGECON WIND FIX2VAR SPEED
 
Forecasting commercial services from S2M - Juan Liria (Sun2Market)
Forecasting commercial services from S2M - Juan Liria (Sun2Market)Forecasting commercial services from S2M - Juan Liria (Sun2Market)
Forecasting commercial services from S2M - Juan Liria (Sun2Market)
 
Design of hybrid micro thermoelectric and stirling engine
Design of hybrid micro thermoelectric and stirling engineDesign of hybrid micro thermoelectric and stirling engine
Design of hybrid micro thermoelectric and stirling engine
 
Concentrated Solar Power Course - Session 4 - Thermal Storage and Hybridization
Concentrated Solar Power Course - Session 4 - Thermal Storage and HybridizationConcentrated Solar Power Course - Session 4 - Thermal Storage and Hybridization
Concentrated Solar Power Course - Session 4 - Thermal Storage and Hybridization
 
21 gi 8th_pvpmc_talk_abq_final_ext_170508t19_new
21 gi 8th_pvpmc_talk_abq_final_ext_170508t19_new21 gi 8th_pvpmc_talk_abq_final_ext_170508t19_new
21 gi 8th_pvpmc_talk_abq_final_ext_170508t19_new
 
IECON Amjad Optimal planning and operatoin Management of a ship Electrical Po...
IECON Amjad Optimal planning and operatoin Management of a ship Electrical Po...IECON Amjad Optimal planning and operatoin Management of a ship Electrical Po...
IECON Amjad Optimal planning and operatoin Management of a ship Electrical Po...
 
Simon Taylor Dissertation
Simon Taylor DissertationSimon Taylor Dissertation
Simon Taylor Dissertation
 
Performance Presentation - NREL March 2010
Performance Presentation - NREL March 2010Performance Presentation - NREL March 2010
Performance Presentation - NREL March 2010
 

Similar to Scaling Green Instrumentation to more than 10 Million Cores

Run-time power management in cloud and containerized environments
Run-time power management in cloud and containerized environmentsRun-time power management in cloud and containerized environments
Run-time power management in cloud and containerized environmentsNECST Lab @ Politecnico di Milano
 
Opportunities of ML-based data analytics in ABCI
Opportunities of ML-based data analytics in ABCIOpportunities of ML-based data analytics in ABCI
Opportunities of ML-based data analytics in ABCIRyousei Takano
 
Windows server power_efficiency___robben_and_worthington__final
Windows server power_efficiency___robben_and_worthington__finalWindows server power_efficiency___robben_and_worthington__final
Windows server power_efficiency___robben_and_worthington__finalBruce Worthington
 
Aged Data Center Infrastructure.pptx
Aged Data Center Infrastructure.pptxAged Data Center Infrastructure.pptx
Aged Data Center Infrastructure.pptxSchneider Electric
 
Designing Next Generation Data Centres
Designing Next Generation Data CentresDesigning Next Generation Data Centres
Designing Next Generation Data CentresSam Wood
 
Arm A64fx and Post-K: Game-Changing CPU & Supercomputer for HPC, Big Data, & AI
Arm A64fx and Post-K: Game-Changing CPU & Supercomputer for HPC, Big Data, & AIArm A64fx and Post-K: Game-Changing CPU & Supercomputer for HPC, Big Data, & AI
Arm A64fx and Post-K: Game-Changing CPU & Supercomputer for HPC, Big Data, & AIinside-BigData.com
 
KBC decision making tool optimal planning scheduling utility
KBC decision making tool optimal planning scheduling utilityKBC decision making tool optimal planning scheduling utility
KBC decision making tool optimal planning scheduling utilityKBC (A Yokogawa Company)
 
Large-Scale Optimization Strategies for Typical HPC Workloads
Large-Scale Optimization Strategies for Typical HPC WorkloadsLarge-Scale Optimization Strategies for Typical HPC Workloads
Large-Scale Optimization Strategies for Typical HPC Workloadsinside-BigData.com
 
Optimizing High Performance Computing Applications for Energy
Optimizing High Performance Computing Applications for EnergyOptimizing High Performance Computing Applications for Energy
Optimizing High Performance Computing Applications for EnergyDavid Lecomber
 
インドネシア JCM事業 FS報告会
インドネシア JCM事業 FS報告会インドネシア JCM事業 FS報告会
インドネシア JCM事業 FS報告会Takeshi Mitsuishi
 
Lenovo HPC: Energy Efficiency and Water-Cool-Technology Innovations
Lenovo HPC: Energy Efficiency and Water-Cool-Technology InnovationsLenovo HPC: Energy Efficiency and Water-Cool-Technology Innovations
Lenovo HPC: Energy Efficiency and Water-Cool-Technology Innovationsinside-BigData.com
 
Area Efficient Pulsed Clock Generator Using Pulsed Latch Shift Register
Area Efficient Pulsed Clock Generator Using Pulsed Latch Shift RegisterArea Efficient Pulsed Clock Generator Using Pulsed Latch Shift Register
Area Efficient Pulsed Clock Generator Using Pulsed Latch Shift RegisterIJMTST Journal
 
Nikravesh big datafeb2013bt
Nikravesh big datafeb2013btNikravesh big datafeb2013bt
Nikravesh big datafeb2013btMasoud Nikravesh
 
LEGaTO: Low-Energy Heterogeneous Computing Use of AI in the project
LEGaTO: Low-Energy Heterogeneous Computing Use of AI in the projectLEGaTO: Low-Energy Heterogeneous Computing Use of AI in the project
LEGaTO: Low-Energy Heterogeneous Computing Use of AI in the projectLEGATO project
 
Mod05lec24(resource mgmt i)
Mod05lec24(resource mgmt i)Mod05lec24(resource mgmt i)
Mod05lec24(resource mgmt i)Ankit Gupta
 
Exceeding the Limits of Air Cooling to Unlock Greater Potential in HPC
Exceeding the Limits of Air Cooling to Unlock Greater Potential in HPCExceeding the Limits of Air Cooling to Unlock Greater Potential in HPC
Exceeding the Limits of Air Cooling to Unlock Greater Potential in HPCinside-BigData.com
 

Similar to Scaling Green Instrumentation to more than 10 Million Cores (20)

Run-time power management in cloud and containerized environments
Run-time power management in cloud and containerized environmentsRun-time power management in cloud and containerized environments
Run-time power management in cloud and containerized environments
 
Opportunities of ML-based data analytics in ABCI
Opportunities of ML-based data analytics in ABCIOpportunities of ML-based data analytics in ABCI
Opportunities of ML-based data analytics in ABCI
 
Windows server power_efficiency___robben_and_worthington__final
Windows server power_efficiency___robben_and_worthington__finalWindows server power_efficiency___robben_and_worthington__final
Windows server power_efficiency___robben_and_worthington__final
 
Aged Data Center Infrastructure.pptx
Aged Data Center Infrastructure.pptxAged Data Center Infrastructure.pptx
Aged Data Center Infrastructure.pptx
 
Designing Next Generation Data Centres
Designing Next Generation Data CentresDesigning Next Generation Data Centres
Designing Next Generation Data Centres
 
Arm A64fx and Post-K: Game-Changing CPU & Supercomputer for HPC, Big Data, & AI
Arm A64fx and Post-K: Game-Changing CPU & Supercomputer for HPC, Big Data, & AIArm A64fx and Post-K: Game-Changing CPU & Supercomputer for HPC, Big Data, & AI
Arm A64fx and Post-K: Game-Changing CPU & Supercomputer for HPC, Big Data, & AI
 
KBC decision making tool optimal planning scheduling utility
KBC decision making tool optimal planning scheduling utilityKBC decision making tool optimal planning scheduling utility
KBC decision making tool optimal planning scheduling utility
 
Large-Scale Optimization Strategies for Typical HPC Workloads
Large-Scale Optimization Strategies for Typical HPC WorkloadsLarge-Scale Optimization Strategies for Typical HPC Workloads
Large-Scale Optimization Strategies for Typical HPC Workloads
 
Optimizing High Performance Computing Applications for Energy
Optimizing High Performance Computing Applications for EnergyOptimizing High Performance Computing Applications for Energy
Optimizing High Performance Computing Applications for Energy
 
Indonesia JCM
Indonesia JCM Indonesia JCM
Indonesia JCM
 
インドネシア JCM事業 FS報告会
インドネシア JCM事業 FS報告会インドネシア JCM事業 FS報告会
インドネシア JCM事業 FS報告会
 
Lenovo HPC: Energy Efficiency and Water-Cool-Technology Innovations
Lenovo HPC: Energy Efficiency and Water-Cool-Technology InnovationsLenovo HPC: Energy Efficiency and Water-Cool-Technology Innovations
Lenovo HPC: Energy Efficiency and Water-Cool-Technology Innovations
 
Area Efficient Pulsed Clock Generator Using Pulsed Latch Shift Register
Area Efficient Pulsed Clock Generator Using Pulsed Latch Shift RegisterArea Efficient Pulsed Clock Generator Using Pulsed Latch Shift Register
Area Efficient Pulsed Clock Generator Using Pulsed Latch Shift Register
 
Nikravesh big datafeb2013bt
Nikravesh big datafeb2013btNikravesh big datafeb2013bt
Nikravesh big datafeb2013bt
 
E3MV - Embedded Vision - Sundance
E3MV - Embedded Vision - SundanceE3MV - Embedded Vision - Sundance
E3MV - Embedded Vision - Sundance
 
HYPPO - NECSTTechTalk 23/04/2020
HYPPO - NECSTTechTalk 23/04/2020HYPPO - NECSTTechTalk 23/04/2020
HYPPO - NECSTTechTalk 23/04/2020
 
LEGaTO: Low-Energy Heterogeneous Computing Use of AI in the project
LEGaTO: Low-Energy Heterogeneous Computing Use of AI in the projectLEGaTO: Low-Energy Heterogeneous Computing Use of AI in the project
LEGaTO: Low-Energy Heterogeneous Computing Use of AI in the project
 
Build Energy Saving into Your Datacenter
Build Energy Saving into Your DatacenterBuild Energy Saving into Your Datacenter
Build Energy Saving into Your Datacenter
 
Mod05lec24(resource mgmt i)
Mod05lec24(resource mgmt i)Mod05lec24(resource mgmt i)
Mod05lec24(resource mgmt i)
 
Exceeding the Limits of Air Cooling to Unlock Greater Potential in HPC
Exceeding the Limits of Air Cooling to Unlock Greater Potential in HPCExceeding the Limits of Air Cooling to Unlock Greater Potential in HPC
Exceeding the Limits of Air Cooling to Unlock Greater Potential in HPC
 

More from inside-BigData.com

Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...inside-BigData.com
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networksinside-BigData.com
 
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...inside-BigData.com
 
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...inside-BigData.com
 
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...inside-BigData.com
 
HPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural NetworksHPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural Networksinside-BigData.com
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoringinside-BigData.com
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecastsinside-BigData.com
 
HPC AI Advisory Council Update
HPC AI Advisory Council UpdateHPC AI Advisory Council Update
HPC AI Advisory Council Updateinside-BigData.com
 
Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19inside-BigData.com
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuninginside-BigData.com
 
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODHPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODinside-BigData.com
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Accelerationinside-BigData.com
 
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance EfficientlyZettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance Efficientlyinside-BigData.com
 
Scaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's EraScaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's Erainside-BigData.com
 
CUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computingCUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computinginside-BigData.com
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Clusterinside-BigData.com
 

More from inside-BigData.com (20)

Major Market Shifts in IT
Major Market Shifts in ITMajor Market Shifts in IT
Major Market Shifts in IT
 
Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networks
 
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
 
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
 
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
 
HPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural NetworksHPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural Networks
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecasts
 
HPC AI Advisory Council Update
HPC AI Advisory Council UpdateHPC AI Advisory Council Update
HPC AI Advisory Council Update
 
Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuning
 
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODHPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
 
State of ARM-based HPC
State of ARM-based HPCState of ARM-based HPC
State of ARM-based HPC
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Acceleration
 
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance EfficientlyZettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
 
Scaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's EraScaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's Era
 
CUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computingCUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computing
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Cluster
 
Overview of HPC Interconnects
Overview of HPC InterconnectsOverview of HPC Interconnects
Overview of HPC Interconnects
 

Recently uploaded

Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 

Recently uploaded (20)

Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 

Scaling Green Instrumentation to more than 10 Million Cores

  • 1. Scaling Green Instrumentation to more than 10 million Cores Satoshi Matsuoka Director, Riken Center for Computational Science / Professor, Tokyo Institute of Technology SC18 EE-Panel Dallas TX 20181115
  • 2. RIKEN ADVANCED INSTITUTE FOR COMPUTATIONAL SCIENCE Computer simulations create the future 2 Fumiyoshi Shoji Operations and Computer Technologies Division RIKEN AICS Operation improvements of facility for the K computer at RIKEN AICS Eighth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG)@SC’17 12 November, 2017
  • 3. RIKEN ADVANCED INSTITUTE FOR COMPUTATIONAL SCIENCE Gas Co-generation System (CGS) The K computer & the facility 3 • The K computer: • Computing Capability 10.51 PetaFlops (HPL score) • Power Consumption 12-15MW • Water/Air cooling ratio 70 : 30 • Operation start 28 September, 2012 • Facility: • Gas Turbine Power Generator 5MW x 2 • Chiller(absorption) 1700USRt(5.98MW) x 2 • Chiller(Centrifugal) 1400USRt(4.93MW) x 2, 700USRt(2.46MW) x 2 • Air handlers, voltage transformers, etc. Gas Gas turbine power generator ~30% ~45% steam Electricity Absorption chillers Chilled water Electric power source balance Provider: Generator = 70 : 30 energy reuse
  • 4. RIKEN ADVANCED INSTITUTE FOR COMPUTATIONAL SCIENCE Efforts for better contract 4 • Electric power • A private contract with a regional power supply company until FY2016. • A competitor appeared for FY2017 supply contract renewal, then we introduced competing bidding. • ---> The challenger beat the champion then the electric power cost decreased by 10% and no minimal consumption constraint is included. • ---> No quality degradation observed so far. Gasrate 2015 2016 2017 2018 fixed rate floating rate • Gas • A contract based on a floating gas rate until Jun.2015. • A fixed rate service has started since Aug.2015. • In our case, a fixed rate, which is determined 6 month ago, is applied for 1 year. • ---> Consequently, the fixed rate level was higher than the floating for 2 years, but the trend was reversed recently. • ---> If the fixed rate contract needs more cost, it is beneficial for us because no shortage/residue is guaranteed. fixed rate started at Aug.2015
  • 5. RIKEN ADVANCED INSTITUTE FOR COMPUTATIONAL SCIENCE Lessons learned for CGS operation 5 1. Generation efficiency of gas turbine power generator increases when it is driven at higher output level. • ex. when generator is driven at 50% output level, efficiency decreases by 30%. 2. When power generator is driven at higher level, richer steam is also generated then power consumption of chillers can be saved. • ex. when generator is driven at 50% output level, power consumption of chillers increases by 30%. 0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 4.0 0 1,000 2,000 3,000 4,000 5,000 6,000 7,000 Generationefficiency (kWh/m3) Generation output level (kW) -30% Higherisbetter Generation level vs efficiency 900 1100 1300 1500 1700 1900 2100 2300 2000 2500 3000 3500 4000 4500 5000 5500 Generation output level (kW) Generation level vs power consumption of chillers powerconsumption ofchillers(kW) +30% Loweris better Power generator should be driven at higher output level. 100%50% 100%50%
  • 6. RIKEN ADVANCED INSTITUTE FOR COMPUTATIONAL SCIENCE Power consumption fluctuation on K 6 Typical power consumption history (4/14-15, 2016) Large scale job mode Normal mode • The power consumption trend of the K computer dominates that of site total. • In the normal mode, the power consumption changes gently due to mixture of many jobs which have different power consumption profile. • In the large scale job mode, the width of fluctuation become wider than that of normal mode because there is only one huge job. powerconsumption(MW)
  • 7. RIKEN ADVANCED INSTITUTE FOR COMPUTATIONAL SCIENCE Cooling in the idle and peak (in future) 7 Electric chiller ~4.9MW Absorption chiller ~5.9MW idle (??MW) peak (??MW) Absorption chiller ~5.9MW Electric chiller ~4.9MW Absorption chiller ~5.9MW Absorption chiller ~5.9MW Electric chiller ~4.9MW Electric chiller ~4.9MW Absorption chiller ~5.9MW Absorption chiller ~5.9MW Electric chiller ~4.9MW Absorption chiller ~5.9MW Absorption chiller ~5.9MW Electric chiller ~4.9MW Absorption chiller ~5.9MW Absorption chiller ~5.9MW gap(>??MW) 1 ~ 2hours • If the gap between the idle and peak become huge, cooling supply might not in time due to the activation latency of absorption chillers. • How can we bridge the gap? activate 2nd generator & 3rd and 4th absorption chillers during high load job idle is too lower for 3rd and 4th absorption chillers operation peak is too higher without 3rd and 4th absorption chillers operation
  • 8. RIKEN ADVANCED INSTITUTE FOR COMPUTATIONAL SCIENCE The Good/Bad of K Power Mgmt 8 • Good: Achieved 40% reduction in Power OPEX cost – Efficient use of CGS – Monitoring Grid vs. CGS cost to see which is cheaper – Prepping the cooling system in anticipation of jobs in the queue – Etc. • Bad: We cant measure the power of the independent nodes – So since we cant measure we cannot to job/node/.. wise fine- grained power control – Some studies to correlate perf counters to power but we don’t have the control plane
  • 9. Japan Flagship 2020 “Post K” Supercomputer 9 : Compute Node :Interconnect Login Servers Login Servers Maitenance Servers Maitenance Servers I/O NetworkI/O Network … … … … … … … … … … … … Hierarchical Storage System Hierarchical Storage System Portal Servers Portal Servers CPU • Many core, Xeon-Class ARM v8 cores + 512 bit SVE (scalable vector extensions) • Multi-hundred petaflops peak total • Power Knob feature Memory 3-D stacked DRAM, Terabyte/s BW /chip Interconnect • TOFU-D CPU-integrated 6-D torus network • I/O acceleration with massive SDs • 30MW+ Power, liquid cooled • Riken co-design with Fujitsu •? Million cores in system Prime Minister Abe visiting K Computer 2013 • R NOW
  • 10. 10 1. Heritage of the K-Computer, HP in simulation via extensive Co-Design • High performance: up to x100 performance of K in real applications • Multitudes of Scientific Breakthroughs via Post-K application programs • Simultaneous high performance and ease-of-programming 2. New Technology Innovations of Post-K • High Performance, esp. via high memory BW Performance boost by “factors” c.f. mainstream CPUs in many HPC & Society5.0 apps • Very Green e.g. extreme power efficiency Ultra Power efficient design & various power control knobs • Arm Global Ecosystem & SVE contribution Top CPU in ARM Ecosystem of 21 billion chips/year, SVE co- design and world’s first implementation by Fujitsu • High Perf. on Society5.0 apps incl. AI Architectural features for high perf on Society 5.0 apps based on Big Data, AI/ML, CAE/EDA, Blockchain security, etc. Post-K: The Game Changer ARM: Massive ecosystem from embedded to HPC Global leadership not just in the machine & apps, but as cutting edge IT Technology not just limited to Post-K, but into societal IT infrastructures e.g. Clouds C P U A64fx
  • 11. Post K Processor is… 11  an Many-Core ARM CPU…  48 compute cores + 2 or 4 assistant (OS) cores  Brand new core design  Near Xeon-Class Integer performance core  ARM V8 --- 64bit ARM ecosystem  Tofu 3 + PCIe 3 external connection  …but also a GPU-like processor  SVE 512 bit vector extensions (ARM & Fujitsu)  Integer (1, 2, 4, 8 bytes) + Float (16, 32, 64 bytes)  Cache + scratchpad local memory (sector cache)  Multi-stack 3D memory – Massive Mem BW (Bytes/DPF ~0.4)  Streaming memory access, strided access, scatter/gather etc.  Intra-chip barrier synch. and other memory enhancing features  GPU-like High performance in HPC, AI/Big Data, Auto Driving…20018/3/13
  • 12. Post-K A64fx A0 (ES) performance Performance / CPU Machine Performance (HPC) Peak TF (DFP) Peak Mem. BW Stream Triad Theor etical B/F DGEMM Efficiency Linpack Efficiency GF/W Network BW Per Chip Post-K A64fx (A0 Eng. Sample) 2.764/ 3.072 1024GB/s 840GB/s 0.37/ 0.33 94 % 87.7 % >15 TOFU-D 40.8GB/s (6.8x 6) Intel KNL 3.0464 600GB/s 490GB/s 0.20 66% 54.4 % 4.9 12.5 GB/s Intel Skylake 1.6128 127.8GB/s 97 GB/s 0.08 80 % 66.7 % 4.5 6.2GB/s NVIDIA V100 (DGX-2) 7.8 900 GB/s 855GB/s 0.12 76 % 15.113 160GB/s 6.2GB/s
  • 13. Power Management  “Energy monitor” / “Energy analyzer” for activity-based power estimation  Energy monitor (per chip) : Node power via Power API* (~msec) • Average power estimation of a node, CMG (cores, an L2 cache, a memory) etc. Power calc. Core activity Core L2 cache Power calc. L2 cache activity Memory Power calc. Tofu PCIe Power calc. for PCIePower calc. for Tofu Node 12 Cores L2 cache HBM2  Energy analyzer (per core) : Power profiler via PAPI** (~nsec) • Fine grained power analysis of a core, an L2 cache and a memory → Enabling chip-level power monitoring and detailed power analysis of applications <A64FX Energy monitor/ Energy analyzer> Memory activity CMG#0 CMG#1 CMG#2 CMG#3 Energy Analyzer Core L2 Cache Memory Assistant core Energy Monitor CMG 15 All Rights Reserved. Copyright © FUJITSU LIMITED 2018 *Sandia National Laboratory ** Performance Application Programming Interface A64FX
  • 14.  “Power knob” for power optimization  A64FX provides power management function called “Power Knob” • Applications can change hardware configurations for power optimization → Power knobs and Energy monitor/analyzer will help users to optimize power consumption of their applications <A64FX Power Knob Diagram> L1 I$ Branch Predictor Decode & Issue RSE0 RSA RSE1 RSBR PGPR EAGA EXC EAGB EXD EXA EXB PFPR Fetch Port Store Port L1D$ HBM2 Controller Fetch Issue Dispatch Reg-Read Execute Cache and Memory CSE Commit PC Control Registers L2$ HBM2 Write Buffer Tofu controller FLA PPR FLB PRX Power Management (Cont.) FL pipeline usage: FLA only HBM2 B/W adjustment (units of 10%) Frequency reductionDecode width: 2 EX pipeline usage : EXA only PCI Controller 52 cores 16 All Rights Reserved. Copyright © FUJITSU LIMITED 2018 Node Power (48 cores) DGEMM : ~160W STREAM: ~190W STREAM w/power optimized for BW: ~160W
  • 15. FUJITSU CONFIDENTIAL Post-K Chassis, PCB (w/DLC), and CPU Package CMU CPU Package A0 Chip Booted in June Undergoing Tests 60 mm 60 mm 230 mm 280 mm CPU A64fx Copyright 2018 FUJITSU LIMITED W 800㎜ D1400㎜ H2000㎜ 384 nodes
  • 16. An Overview of Post-K Hardware  Compute Node, Compute + I/O Node connected by 6D mesh/torus Interconnect  3-level hierarchical storage system  1st Layer  Cache for global file system  Temporary file systems - Local file system for compute node - Shared file system for a job  2nd Layer  Lustre-based global file system  3rd Layer  Storage for archive  >100,000 nodes  x100PF, >100PB/s mem BW  Approaching 10 million CPU cores 1620018/6/26 Rack-level FPGA to cap power total < 37MW
  • 17. RIKEN Center for Computational Science From K to Post K 17 K computer Post K Voltage 3-phase AC 200V -> Peak Power 15-20MW 30-40MW Idle Power ~10MW >10MW Fluctuation width ~5MW 10~18MW Rack weight ~1t ~2t Cooling ratio(water:air) 65:35 90:10 Water supply 44l/rack/min 117l/rack/min Peak Power can be > 40 MW, > facility machine cap (37MW)
  • 18. Demand-Aware Power Management for PCAS [Masaaki Kondo et. al., U-Tokyo/Riken R-CCS] 18  Control jobs’ power-cap based on their QoS requirement  Performance degradation prediction + fine grained power control Available Power Job Power perf. loss smaller than expected Power cap perf. loss larger than expected Power Time Max power demand Power cap Time Power Adaptive cappingFixed aapping at ave. Adaptively set power cap 100.00% 68.45% 53.86% 50.72% 34.25% 31.37% 30.61% 16.27% 0.00% 25.00% 50.00% 75.00% 100.00% TurnaroundTime without backfilling with easy backfilling  1.74x throughput improvement  83.7% turnaround time reduction