Lenovo HPC
Strategy
Lenovo
Luigi Brochard – Distinguished Engineer, WW HPC & AI, DCPG
2
Lenovo Proven T1 HPC Partner
Co-Innovation with LRZ
First
WARM WATER
COOLED SUPER
COMPUTER
Number 12 on TOP500
World’s #3 KNL/OPA System
SUPERCOMPUTER IN
EMEA
2nd
Largest
#2 WW – 99 listings
#1 China
Multiple PFLOP+ SKL wins
Fastest
GROWING
TOP 500 HPC
VENDOR
2017 Lenovo. All rights reserved.
3
Lenovo Goal – Be the #1 Trusted HPC Partner
• MFG/Supply Chain
• Energy Mgmt
• DC IoT
• Warm Water Cooling
• Benchmark Centers
• Innovation centers
• Deep Skills/Experts
• STIC and R&T
• Global arch. reviews
• Open source focus
• Partner openess
• Open/Shared IP
• Lead w/ communities
Limited Budgets;
Higher Demands
Co-Design
is Mandatory
Resurgence of
Specialization
Open
Everything
• Artifical Intelligence
• Modular designs
• SW Defined HPC
• Be fast and agile
2017 Lenovo. All rights reserved.
4
Machine Learning workflow requires specialized architectures for each process step
AI and Machine Learning is part of HPC Lenovo
Data Train Deploy
model
Classification
Detection
Segmentation
Big data
• Storage
• Preparation
• Feeding
• Hadoop/Spark
Training
• HPC characteristics
- Compute intensive
- Scale up and out
- Accelerators/high-
speed networks
Inference/Scoring
• Enterprise/device
computing
• Scale out architectures
• Xeon+FPGA
Input
2017 Lenovo. All rights reserved.
552017 Lenovo confidential. All rights reserved.
HPC PORTFOLIO AND
ROADMAP
6
Lenovo Scalable Infrastructure (LeSI)
Lenovo Scalable Infrastructure (LeSI) is a framework for development,
configuration, build, delivery and support of integrated data center solutions
• Complete HPC data center portfolio with the best-of-breed partner technology
• Collaborate on OpenSource HPC software in true commitment to Openess
• End-to-end expert-designed, tested, integrated and supported HPC solutions
Software
Options
Networking
Storage
Server
Product Portfolio Scalable Infrastructure
Scalable Infrastructure
Components
HPC Network systems/options
Infrastructure options
Scalable Infrastructure
Solutions
ThinkAgile HPC
GPFS Storage Server
OpenSource HPC Software
(LiCO, xCAT/Confluent, Antilles, ...)
Distributed Storage Solution
for IBM Spectrum Scale
2017 Lenovo. All rights reserved.
7
From Complete Portfolio to Results Driven DesignsThinkAgile HPC
Best of the
Lenovo
Portfolio
ThinkAgile
HPC
Scalable
Infrastructure
OEM Portfolio
Scalable
Infrastructure
HPC Support
Factory
Integration
Services
+ + + =
1410 physical rack
ship racked/cabled
testing power &
reduncancy, p2p
cabling, best recipe,
factory settings, run
Linpack and load
client xCat image
Testing hardware
and software
interoperability
resulting in best
recipe code level:
SLES, RHEL
MOFED, IFS
xCat/Confluent,
Spectrum Scale
Bringing in best-of-
breed HPC partner
components,
especially HPC
switches and
adapters
• Mellanox FDR/
EDR Infiniband
• Intel Omnipath
100 Series
Selection from off-
the-shelf HPC
proven Lenovo
Portfolio
• Server
• Storage
• Networking
• Options
• Software
A single cluster,
end2end designed &
configured, built and
shipped ready to
accept the client
images through
xCAT or alternatives.
Value Add Choice:
Installation Services
Managed Services
Full HPC SW Suite
Customizing solutions for each client’s unique needs
2017 Lenovo. All rights reserved.
8
End to end services that span from basic to consultative engagements
Lenovo DCG Portfolio Spans the Entire Data Center
SAP BWA
Cloud
Big Data
Client Virtualization
Database & Analytics
Rack Scale Arch.
Application Optimized
Compute & Warm Storage
Project Scorpio
1.Solution-centric approach to IT
2.Rack-level solution
3.Ease of config, deploy, manage
4.Seamless mgmt. framework
5.Optimized ROI by workload
6.Reduced OPEX w/
consolidated, familiar
mgmt. tools
Discrete offerings
1.Broad datacenter offerings;
servers, storage, networking
2.Highest configuration flexibility
3.Open and optional networking
4.Simplified and open mgmt
World Class, end-to-end Data Center portfolio, delivered as Discrete or as Integrated offerings
Engineered Solutions HPC/AI Hyperscale
Server • Rack & Tower
• Mission Critical: X6
• Dense: NeXtScale &
ThinkServer
• Blades: Flex System
Storage • SAN: S Series, V Series
• SDS: DX8000C, DX8000N
• AFA: Coming soon
• Scale-out: GSS
• Direct Attach & Archive
Hyper-converged • ROBO: HX1000
• SMB: HX2000
• Compute Heavy: HX3000
• Storage Heavy: HX5000
• High-Performance: HX7000
Networking • Embedded
• Top of Rack
• Campus and Core
• Storage Switches
• Networking OS
CAE & EDA
Weather&Climate
Machine Learning
Academia
Cluster & Storage
2017 Lenovo. All rights reserved.
9
Moving forward into a decade of dense HPC
iDataPlex – Air / DWC
dx360 - Harpertown
dx360M2 - Nehalem
dx360M3 - Westmere
dx360M4 - Sandy Bridge
NeXtScale – Air / DWC
nx360M4 - Sandy Bridge/Ivy Bridge
nx360M5 - Haswell/Broadwell
Stark – Air
sd530 - Skylake/Icelake
NeXtScale – DWC
sd650- Skylake/Icelake
2017 Lenovo. All rights reserved.
10
Deliver Results Faster With HPC
Challenge What is Possible Best cost, speed and skill Deliver Unique Thinking
Helping Researchers and Companies Make Immediate Impact
2017 Lenovo. All rights reserved.
11112017 Lenovo confidential. All rights reserved.
COOLING TECHNOLOGIES
122017 Lenovo. All rights reserved.
Lenovo Choice of Cooling
 Standard air flow with internal fans
 Fits in any datacenter
 Maximum flexibility
 Broadest choice of configurable
options supported
 Supports Native Expansion nodes
(Storage NeX, PCI NeX)
PUE ~2 – 1.5
ERE ~2 – 1.5
 Air cool, supplemented with RDHX
door on rack
 Uses chilled water with economizer
(18°C water)
 Enables extremely tight rack
placement
PUE ~1.4 – 1.2
ERE ~ 1.4 – 1.2
 Direct water cooling without fans
 Higher performance per watt
 Free cooling (inlet up to 50°C water)
 Energy re-use
 Densest footprint and high TDP SKU
 Ideal for geos with high electricity
costs and new data centers
PUE ~ 1.1
ERE < 1 (with hot water)
Direct Water CooledAir Cooled Indirect Water Cooled
Choose for broadest choice
of customizable options
Choose for highest performance
and energy efficiency
Choose for compromise between
flexibility and energy efficiency
132017 Lenovo. All rights reserved.
NeXtScale System with Water Cool Technology (WCT)
• Water Cooled Node & Chassis
– Full wide, 2-node compute tray
– 6U Chassis, 6 bays (12 nodes/chassis)
– Manifolds deliver water directly to nodes
– Water circulated through cooling tubes for
component level cooling
– Intel E5-2600 v4 CPUs
– 16x DDR4 DIMM slots
– InfiniBand FDR/EDR and OPA support
– 6x 900W or 1300W PSU
– No fans except PSUs
– Drip sensor / Error LEDs
 Up to 85% heat to water ratio
nx360 M5 WCT Compute Tray (2 nodes)
Power,
LEDsDual-port ML2
(IB/Ethernet)
Labeling
tag
1 GbE
ports
PCI slot for
Connect IB
CPU
with
liquid
cooled
heatsink
Cooling
tubes
x16
DIMMs
1 GbE
ports
n1200 WCT Enclosure
6 full wide bays
12 compute nodes
n1200 WCT Manifold
142017 Lenovo. All rights reserved.
DWC reduces Processor Temperature on Xeon 2697v4
Conclusion: Direct Water Cooling lowers
processor power consumption by about
5% or allows higher performance
NXT with 2 socket 2697v4, 128 GB 2400 MHz DIMM Inlet Water temperature is 28°C
DC power and energy is measured at node level
15
Value of Direct Water Cooling
• Any temperature:
– Lower processor power consumption (~ 6%), No fan per node ( ~4%)
– Higher TDP processor
– Higher performance ( ~5%)
- In this case, processor power consumption is not reduced
• Hot water temperature
– Provides free cooling all year long
- ~ 20% power savings on compression chillers to generate chilled water
- potentially no/less chillers
DC energy is measured through aem DC energy accumulator
2017 Lenovo. All rights reserved.
16
TCO: return on investment for DWC vs RDHx
• New data centers: Water cooling has immediate payback.
• Existing air-cooled data center payback period strongly depends on electricity rate
DWC RDHx
$0.06/kWh $0.12/kWh $0.20/kWh
New Existing Existing Existing
2017 Lenovo. All rights reserved.
17
Lenovo references with DWC (2012-2016)
18.000 nodes up and running with DWC technology from Lenovo !
2017 Lenovo. All rights reserved.
Sites Nodes Country Instal date Max. In. Water
LRZ SuperMUC 9298 Germany 2012 45°C
LRZ SuperMUC 2 3096 Germany 2014 45°C
LRZ SuperCool2 438 Germany 2015 50°C
NTU 40 Singapore 2012 45°C
Enercon 136 Germany 2013 45°C
US Army 756 Hawai 2013 45°C
Exxon Research 504 NA 2014 45°C
NASA Goddard 80 NA 2014 45°C
PIK 312 Germany 2015 45°C
KIT 1152 Germany 2015 45°C
Birmingham U ph1 28 UK 2015 45°C
Birmingham U ph2 132 UK 2016 45°C
T-Systems 316 Germany 2016 45°C
MMD 296 Malaysia 2016 45°C
UNINET 964 Norway 2016 45°C
Peking U 204 China 2017 45°C
1818
2017 Lenovo confidential. All rights reserved.
LENOVO HPC STORAGE
Key Component of our overall Data Center Success
19
Storage is Essential and We Have the Tools
Taking our off-the-shelf server and storage portfolio
marrying it with leading HPC Storage Software
Distributed Storage
Solution
for
IBM Spectrum Scale
Defined Solution especially
for large capacity, high
performance workloads in
HPC environments
Distributed Storage
Architecture
for
Intel Lustre EE
Tested architecture as
entry point and mid range
Lustre offering in HPC
environments.
Distributed Storage
Solution
for
SUSE Enterprise Storage
Defined Solution especially
for interaction with Lenovo
scale-out HANA solutions.*
Distributed Storage
Architecture
for
SUSE Enterprise Storage
/ Red Hat Ceph Storage
Tested architecture as
entry point and mid range
CEPH offering in HPC
environments.
Future PlansMid April Announcement
DSS - G DSS - C DSA - LDSA - C
*formerly published as „ThinkStorage for SAP HANA TDI“2017 Lenovo. All rights reserved.
20
DSS Product Overview
A fully integrated data center solution fulfilled through Lenovo Scalable Infrastructure (LeSI)
Distributed Storage Solution for IBM Spectrum ScaleTM
DSS is an LeSI solution for a selection of file and object storage offerings delivering high storage
density and I/O performance with superior availability, reliability and resiliency.
Project DSS-G for IBM Spectrum Scale™
Solution Definition  2 x3650 M5 HPIO Servers
 Software: RedHat Enterprise Linux, IBM Spectrum Scale
for DSS Standard or Data Management Edition
 1-6 Storage Enclosures
 Lenovo D3284 12Gb JBOD (5U84) or
 4TB, 6TB, 8TB, 10TB
 Lenovo D1224 12Gb JBOD (2U24)
 0.3TB – 1.8TB SAS, 0.4TB – 3.8TB SSD
 Connectivity: 10GbE//25GbE/40GbE/100GbE/FDR IB/EDR IB/OPA
Target Market  HPC, BigData, Cloud
Target Workloads  HPC and Distributed File Systems
Value Proposition  High storage density and I/O performance with
superior availability, reliability and resiliency
License Model  Licensed by drive/capacity, no add. server/client licenses
Target GEO  World Wide Product
Target Launch  Q2 2017 (SC’16 Public Disclosure 11/14) – Mid April Avail.
DSS-G
x3650 M5
D3284
2017 Lenovo. All rights reserved.
21
DSS-G Overview
Distributed Storage Solution Configurations Examples for IBM Spectrum ScaleTM
x3650M5 HPIO
x3650M5 HPIO
D3284
D3284
D3284
D3284
D3284
D3284
D3284
D3284
670 x NL-SAS
DSS G280
x3650M5 HPIO
x3650M5 HPIO
D3284
D3284
D3284
D3284
D3284
D3284
502 x NL-SAS
DSS G260
x3650M5 HPIO
x3650M5 HPIO
D3284
D3284
D3284
D3284
334 x NL-SAS
DSS G240
x3650M5 HPIO
x3650M5 HPIO
D3284
D3284
164 x NL-SAS
DSS G220
x3650M5 HPIO
x3650M5 HPIO
x3650M5 HPIO
x3650M5 HPIO
D1224
D1224
D1224
D1224
D1224
D1224
DSS G204DSS G202
x3650M5 HPIO
x3650M5 HPIO
D1224
D1224
D1224
D1224
DSS G206
D1224
D1224
SSD / SAS Option for High Performance / IOPS
HPIO = High Performance I/O
Future Announce
2017 Lenovo. All rights reserved.
22
Intel EE Lustre on Lenovo S-Series Storage
X3550M5 Intel Lustre MDS Cluster
S3200
12GB SAS
X3550M5 Lustre Management
Intel Manager for Lustre
This module consists of a pair of clustered
servers, loaded with Linux and the Lustre
Metadata (MDS) component. The servers
connect to a dedicated S3200 array via SAS
connections, and to the file serving network via
high speed InfiniBand.
10/40 Gb Ethernet Infiniband / OPAOR
X3650M5 Intel Lustre OSS Cluster
12GB SAS
S3200
E1012
E1012
E1012
E1012
E1012
E1012
E1012
Cntrl A Cntrl B
12GB SAS
S3200
E1012
E1012
E1012
E1012
E1012
E1012
E1012
Cntrl A Cntrl B
2 SAS 2 SAS 2 SAS 2 SAS
RAID6 8+2P+4 Global Spares/Build. Block
Enclosures Drive Size Raw Nett 192 Total Drives
16 TB TB TB 180 RAID drives
Drives Raw 2 384 304 8 Global Spares
192 4 768 608 4 Rest
Drives Nett 6 1152 912
152 8 1536 1216
18 RAID Stripes2017 Lenovo. All rights reserved.
2323
2017 Lenovo confidential. All rights reserved.
LENOVO HPC SOFTWARE
Key Component of our overall Data Center Success
24
Lenovo Suite for ThinkAgile HPC Cluster
Lenovo OpenSource HPC Stack
An OpenSource IaaS suite to run and
manage optimally and transparently
HPC, Big Data and Workflows on a
virtualized infrastructure adjusting
dynamically to user and datacenter
needs through energy policies
• Bundle best-of-breed software
• Enhance with Lenovo configuration,
plugins and scripts
• Add in Energy Aware run time
• All open and flexible
A ready to use HPC Stack
* collaborating w/ Greg Kurtzer from Lawrence Berkeley National Lab
Lenovo HPC Stack Open Source / 3rd Party SW Lenovo / 3rd Party HW
Antilles Web Console GUI
Server Storage Network DataCenter
Ganglia, Icinga SLURM, Torque, Maui
xCAT / Confluent
OpenMPI, MVAPICH,
MPICH
Lustre, CEPH, Scale (GPFS)
OFED
Containers / Singularity *
RedHat, CentOS, SLES
Antilles
Management
Deployment
guide / scripts
Admin
guide / scripts
Customer Application
Eclipse PTP, debugger, gdb
GCC, Intel Parallel Studio,
MKL, …
Demo at
ISC‘17
2017 Lenovo. All rights reserved.
25
Lenovo OpenSource Initiatives and OpenHPC
• Status of Lenovo OpenHPC
Engagement
– Since May 2016, OpenHPC as an
organization have been having Technical
Steering Committee (TSC) meetings to
help create processes, rules and
procedures to help organize the group to
function smoothly.
- Meeting minutes are posted for
everyone to see
– xCAT and Confluent was submitted as
future system management components
into OpenHPC and this was accepted by
OpenHPC TSC in October 2016.
– Lenovo will help integrate and validate
xCAT and Confluent into OpenHPC
- Expected timeline is first half of 2017
• All Lenovo OpenSource HPC Engagements
– Antilles
– xCat
– Confluent
– Singularity/Capsules
Base (OS* without
Virtualized Network)
Libs/Tools App
requires
Application Space
2017 Lenovo. All rights reserved.
26
Antilles - Simplified Web Portal for HPC
• Antilles – Web Portal Goals
– Allow users to access the system
from any client environment
– Enable easy deployment of jobs to
the cluster
– Give a clean and simplified view of
jobs running on the cluster
• Integration with xCAT/Confluent
– Built on xCAT/Confluent
– Paves the way for future
enhancements of xCAT/Confluent
– Delivers a graphical user
experience
– Allows easier access for operators
and system administrators
2017 Lenovo. All rights reserved.
27
Confluent @Lenovo Stuttgart Innovation Center
2017 Lenovo. All rights reserved.
2828
2017 Lenovo confidential. All rights reserved.
LENOVO INNOVATIONS
Key Component of our overall Data Center Success
29
A Strong Footprint for Lenovo R&D
• $1.7B investment globally
• 8,000 Global Researchers/Engineers
2-tier R&D System2-tier R&D System
Corporate Level
BU Level
Research & Technology
BU development
R&T
 Leading Innovation
 Core Technology
 New Opportunity Incubation
 Build Tech Platform
BU development
 Lead System Integration
 Product Development
 Engineering & Quality Control
 Systems Technology Innovation
Center (STIC)
Japan
Raleigh
Beijing
Lenovo Development (& Systems Technology Innovation Center)
Chicago
Europe
Stuttgart
Silicon Valley
Shenzhen
Hong Kong
Shanghai
Brazil
Israel
Israel
Lenovo Research & Technology (co-located with
development)
Raleigh
2017 Lenovo. All rights reserved.
30
Forging Dreams into Reality
• HPC Innovation Centers
– Focused HPC team bringing best of breed
x86 technology and excellent talent with
passion to innovate
– Collaboration is Key to Success
– Industry leaders bring together the newest
technology and skills
– Visionary client partners bring focused
knowledge & deep skills in specific areas
of science
• Parallel Benchmarking System in
Stuttgart, Germany
– A benchmarking system available to
Lenovo, our clients and our partners.
– A broad ecosystem of technologies &
skills to help clients and partners
move/optimize/ demo their applications
Research Triangle Park Stuttgart Beijing
2017 Lenovo. All rights reserved.
31
Lenovo HPC Innovation Center Stuttgart - Partners
HPC Innovation
Center Europe
FZJ – HPC Storage Technologies
LRZ – Energy Efficient Systems
Oxford – HPC Software Stack
MPCDF – Advancing Material Science
STFC Hatree – New Technologies (ARM)
BSC – Capsules, Energy Runtime & Big Data
CINECA – ManyCores and FET4HPC
IBM – File System and Workload Mgmt
Mellanox – High Speed Networking
SUSE – Operating System
DDN – Lustre solutions
Samsung – High Performance Memory
Nvidia – Graphic Processing Acceleration
Seagate – Storage subsystems
NICE – Remote Visualization
Intel – Scalable Systems Framework
Allinea – Performance Analytics
Technology Partners (HW/SW) Client Partners and Projects
Focused knowledge and deep skills
advance the science of HPC
ARC – Open Source & Energy Efficiency
CERFACS – Code optimzation (AVBP)
2017 Lenovo. All rights reserved.
32
AI Innovation Center
• Goals
• Demonstrate leadership by collaborating with universities,
partners, and customers making advances in AI
• Make customers experience the value of AI through
demos
• Activities
• Give access to the latest hardware and software
• Provide training on machine/deep learning by experts
from Lenovo and partners
• Give visibility at important AI events
• Demonstrating AI applications to wider audience for
impact
• Demos at our briefing centers, events etc.
• Location
• Physical labs in Stuttgart and Morrisville. WW Remote
access
2017 Lenovo. All rights reserved.
33
Lenovo HPC & AI platform concept
2016 Lenovo
C/C++
Caffe
NVIDIA cuDNN
NVIDIA CUDA Intel® Math Kernal Library
Intel® MKL-DNN
Frameworks
Developer tools
Low-level libraries
System management
Hardware
Intel Deep Learning
SDKProgramming resources
Big data analytics Voice
recognition
Image/video
recognition
Text
recognition
Language
processing
Applications
Healthcare Finance Academia Hyperscale Manufacturing
Industry
verticals
Lenovoplatformfor
trainingworkloads
xCAT/Confluence/ Antilles
Lenovo Open HPC
2017 Lenovo. All rights reserved.
3434
2017 Lenovo confidential. All rights reserved.
LENOVO SERVICE
35
Services
• Implementation
– Project Management
– Physical Setup
– Software Setup
• Operation Support
– Onsite Managed Service
– Remote Managed Service
• Application Support
– Onsite Application Service
• Maintenance Support
– Based on the classification of components
2017 Lenovo. All rights reserved.
36
Services – Continuing a successful way
• We will subcontract IBM for maintenance services during whole contract lifecycle
– You still have ONE contract partner - Lenovo
Customer
opens a call
• via phone
• via internet
IBM Frontdesk
Searches for already
known solutions
Routes to 2nd/3rd level
2nd / 3rd Level
Finds the root cause
Routes to field service to
replace failing device
Field Service
Replaces failing device
Lenovo quality control / Customer satisfaction
= Tasks subcontracted to IBM = Tasks done by Lenovo
2017 Lenovo. All rights reserved.
Lenovo HPC Strategy Update

Lenovo HPC Strategy Update

  • 1.
    Lenovo HPC Strategy Lenovo Luigi Brochard– Distinguished Engineer, WW HPC & AI, DCPG
  • 2.
    2 Lenovo Proven T1HPC Partner Co-Innovation with LRZ First WARM WATER COOLED SUPER COMPUTER Number 12 on TOP500 World’s #3 KNL/OPA System SUPERCOMPUTER IN EMEA 2nd Largest #2 WW – 99 listings #1 China Multiple PFLOP+ SKL wins Fastest GROWING TOP 500 HPC VENDOR 2017 Lenovo. All rights reserved.
  • 3.
    3 Lenovo Goal –Be the #1 Trusted HPC Partner • MFG/Supply Chain • Energy Mgmt • DC IoT • Warm Water Cooling • Benchmark Centers • Innovation centers • Deep Skills/Experts • STIC and R&T • Global arch. reviews • Open source focus • Partner openess • Open/Shared IP • Lead w/ communities Limited Budgets; Higher Demands Co-Design is Mandatory Resurgence of Specialization Open Everything • Artifical Intelligence • Modular designs • SW Defined HPC • Be fast and agile 2017 Lenovo. All rights reserved.
  • 4.
    4 Machine Learning workflowrequires specialized architectures for each process step AI and Machine Learning is part of HPC Lenovo Data Train Deploy model Classification Detection Segmentation Big data • Storage • Preparation • Feeding • Hadoop/Spark Training • HPC characteristics - Compute intensive - Scale up and out - Accelerators/high- speed networks Inference/Scoring • Enterprise/device computing • Scale out architectures • Xeon+FPGA Input 2017 Lenovo. All rights reserved.
  • 5.
    552017 Lenovo confidential.All rights reserved. HPC PORTFOLIO AND ROADMAP
  • 6.
    6 Lenovo Scalable Infrastructure(LeSI) Lenovo Scalable Infrastructure (LeSI) is a framework for development, configuration, build, delivery and support of integrated data center solutions • Complete HPC data center portfolio with the best-of-breed partner technology • Collaborate on OpenSource HPC software in true commitment to Openess • End-to-end expert-designed, tested, integrated and supported HPC solutions Software Options Networking Storage Server Product Portfolio Scalable Infrastructure Scalable Infrastructure Components HPC Network systems/options Infrastructure options Scalable Infrastructure Solutions ThinkAgile HPC GPFS Storage Server OpenSource HPC Software (LiCO, xCAT/Confluent, Antilles, ...) Distributed Storage Solution for IBM Spectrum Scale 2017 Lenovo. All rights reserved.
  • 7.
    7 From Complete Portfolioto Results Driven DesignsThinkAgile HPC Best of the Lenovo Portfolio ThinkAgile HPC Scalable Infrastructure OEM Portfolio Scalable Infrastructure HPC Support Factory Integration Services + + + = 1410 physical rack ship racked/cabled testing power & reduncancy, p2p cabling, best recipe, factory settings, run Linpack and load client xCat image Testing hardware and software interoperability resulting in best recipe code level: SLES, RHEL MOFED, IFS xCat/Confluent, Spectrum Scale Bringing in best-of- breed HPC partner components, especially HPC switches and adapters • Mellanox FDR/ EDR Infiniband • Intel Omnipath 100 Series Selection from off- the-shelf HPC proven Lenovo Portfolio • Server • Storage • Networking • Options • Software A single cluster, end2end designed & configured, built and shipped ready to accept the client images through xCAT or alternatives. Value Add Choice: Installation Services Managed Services Full HPC SW Suite Customizing solutions for each client’s unique needs 2017 Lenovo. All rights reserved.
  • 8.
    8 End to endservices that span from basic to consultative engagements Lenovo DCG Portfolio Spans the Entire Data Center SAP BWA Cloud Big Data Client Virtualization Database & Analytics Rack Scale Arch. Application Optimized Compute & Warm Storage Project Scorpio 1.Solution-centric approach to IT 2.Rack-level solution 3.Ease of config, deploy, manage 4.Seamless mgmt. framework 5.Optimized ROI by workload 6.Reduced OPEX w/ consolidated, familiar mgmt. tools Discrete offerings 1.Broad datacenter offerings; servers, storage, networking 2.Highest configuration flexibility 3.Open and optional networking 4.Simplified and open mgmt World Class, end-to-end Data Center portfolio, delivered as Discrete or as Integrated offerings Engineered Solutions HPC/AI Hyperscale Server • Rack & Tower • Mission Critical: X6 • Dense: NeXtScale & ThinkServer • Blades: Flex System Storage • SAN: S Series, V Series • SDS: DX8000C, DX8000N • AFA: Coming soon • Scale-out: GSS • Direct Attach & Archive Hyper-converged • ROBO: HX1000 • SMB: HX2000 • Compute Heavy: HX3000 • Storage Heavy: HX5000 • High-Performance: HX7000 Networking • Embedded • Top of Rack • Campus and Core • Storage Switches • Networking OS CAE & EDA Weather&Climate Machine Learning Academia Cluster & Storage 2017 Lenovo. All rights reserved.
  • 9.
    9 Moving forward intoa decade of dense HPC iDataPlex – Air / DWC dx360 - Harpertown dx360M2 - Nehalem dx360M3 - Westmere dx360M4 - Sandy Bridge NeXtScale – Air / DWC nx360M4 - Sandy Bridge/Ivy Bridge nx360M5 - Haswell/Broadwell Stark – Air sd530 - Skylake/Icelake NeXtScale – DWC sd650- Skylake/Icelake 2017 Lenovo. All rights reserved.
  • 10.
    10 Deliver Results FasterWith HPC Challenge What is Possible Best cost, speed and skill Deliver Unique Thinking Helping Researchers and Companies Make Immediate Impact 2017 Lenovo. All rights reserved.
  • 11.
    11112017 Lenovo confidential.All rights reserved. COOLING TECHNOLOGIES
  • 12.
    122017 Lenovo. Allrights reserved. Lenovo Choice of Cooling  Standard air flow with internal fans  Fits in any datacenter  Maximum flexibility  Broadest choice of configurable options supported  Supports Native Expansion nodes (Storage NeX, PCI NeX) PUE ~2 – 1.5 ERE ~2 – 1.5  Air cool, supplemented with RDHX door on rack  Uses chilled water with economizer (18°C water)  Enables extremely tight rack placement PUE ~1.4 – 1.2 ERE ~ 1.4 – 1.2  Direct water cooling without fans  Higher performance per watt  Free cooling (inlet up to 50°C water)  Energy re-use  Densest footprint and high TDP SKU  Ideal for geos with high electricity costs and new data centers PUE ~ 1.1 ERE < 1 (with hot water) Direct Water CooledAir Cooled Indirect Water Cooled Choose for broadest choice of customizable options Choose for highest performance and energy efficiency Choose for compromise between flexibility and energy efficiency
  • 13.
    132017 Lenovo. Allrights reserved. NeXtScale System with Water Cool Technology (WCT) • Water Cooled Node & Chassis – Full wide, 2-node compute tray – 6U Chassis, 6 bays (12 nodes/chassis) – Manifolds deliver water directly to nodes – Water circulated through cooling tubes for component level cooling – Intel E5-2600 v4 CPUs – 16x DDR4 DIMM slots – InfiniBand FDR/EDR and OPA support – 6x 900W or 1300W PSU – No fans except PSUs – Drip sensor / Error LEDs  Up to 85% heat to water ratio nx360 M5 WCT Compute Tray (2 nodes) Power, LEDsDual-port ML2 (IB/Ethernet) Labeling tag 1 GbE ports PCI slot for Connect IB CPU with liquid cooled heatsink Cooling tubes x16 DIMMs 1 GbE ports n1200 WCT Enclosure 6 full wide bays 12 compute nodes n1200 WCT Manifold
  • 14.
    142017 Lenovo. Allrights reserved. DWC reduces Processor Temperature on Xeon 2697v4 Conclusion: Direct Water Cooling lowers processor power consumption by about 5% or allows higher performance NXT with 2 socket 2697v4, 128 GB 2400 MHz DIMM Inlet Water temperature is 28°C DC power and energy is measured at node level
  • 15.
    15 Value of DirectWater Cooling • Any temperature: – Lower processor power consumption (~ 6%), No fan per node ( ~4%) – Higher TDP processor – Higher performance ( ~5%) - In this case, processor power consumption is not reduced • Hot water temperature – Provides free cooling all year long - ~ 20% power savings on compression chillers to generate chilled water - potentially no/less chillers DC energy is measured through aem DC energy accumulator 2017 Lenovo. All rights reserved.
  • 16.
    16 TCO: return oninvestment for DWC vs RDHx • New data centers: Water cooling has immediate payback. • Existing air-cooled data center payback period strongly depends on electricity rate DWC RDHx $0.06/kWh $0.12/kWh $0.20/kWh New Existing Existing Existing 2017 Lenovo. All rights reserved.
  • 17.
    17 Lenovo references withDWC (2012-2016) 18.000 nodes up and running with DWC technology from Lenovo ! 2017 Lenovo. All rights reserved. Sites Nodes Country Instal date Max. In. Water LRZ SuperMUC 9298 Germany 2012 45°C LRZ SuperMUC 2 3096 Germany 2014 45°C LRZ SuperCool2 438 Germany 2015 50°C NTU 40 Singapore 2012 45°C Enercon 136 Germany 2013 45°C US Army 756 Hawai 2013 45°C Exxon Research 504 NA 2014 45°C NASA Goddard 80 NA 2014 45°C PIK 312 Germany 2015 45°C KIT 1152 Germany 2015 45°C Birmingham U ph1 28 UK 2015 45°C Birmingham U ph2 132 UK 2016 45°C T-Systems 316 Germany 2016 45°C MMD 296 Malaysia 2016 45°C UNINET 964 Norway 2016 45°C Peking U 204 China 2017 45°C
  • 18.
    1818 2017 Lenovo confidential.All rights reserved. LENOVO HPC STORAGE Key Component of our overall Data Center Success
  • 19.
    19 Storage is Essentialand We Have the Tools Taking our off-the-shelf server and storage portfolio marrying it with leading HPC Storage Software Distributed Storage Solution for IBM Spectrum Scale Defined Solution especially for large capacity, high performance workloads in HPC environments Distributed Storage Architecture for Intel Lustre EE Tested architecture as entry point and mid range Lustre offering in HPC environments. Distributed Storage Solution for SUSE Enterprise Storage Defined Solution especially for interaction with Lenovo scale-out HANA solutions.* Distributed Storage Architecture for SUSE Enterprise Storage / Red Hat Ceph Storage Tested architecture as entry point and mid range CEPH offering in HPC environments. Future PlansMid April Announcement DSS - G DSS - C DSA - LDSA - C *formerly published as „ThinkStorage for SAP HANA TDI“2017 Lenovo. All rights reserved.
  • 20.
    20 DSS Product Overview Afully integrated data center solution fulfilled through Lenovo Scalable Infrastructure (LeSI) Distributed Storage Solution for IBM Spectrum ScaleTM DSS is an LeSI solution for a selection of file and object storage offerings delivering high storage density and I/O performance with superior availability, reliability and resiliency. Project DSS-G for IBM Spectrum Scale™ Solution Definition  2 x3650 M5 HPIO Servers  Software: RedHat Enterprise Linux, IBM Spectrum Scale for DSS Standard or Data Management Edition  1-6 Storage Enclosures  Lenovo D3284 12Gb JBOD (5U84) or  4TB, 6TB, 8TB, 10TB  Lenovo D1224 12Gb JBOD (2U24)  0.3TB – 1.8TB SAS, 0.4TB – 3.8TB SSD  Connectivity: 10GbE//25GbE/40GbE/100GbE/FDR IB/EDR IB/OPA Target Market  HPC, BigData, Cloud Target Workloads  HPC and Distributed File Systems Value Proposition  High storage density and I/O performance with superior availability, reliability and resiliency License Model  Licensed by drive/capacity, no add. server/client licenses Target GEO  World Wide Product Target Launch  Q2 2017 (SC’16 Public Disclosure 11/14) – Mid April Avail. DSS-G x3650 M5 D3284 2017 Lenovo. All rights reserved.
  • 21.
    21 DSS-G Overview Distributed StorageSolution Configurations Examples for IBM Spectrum ScaleTM x3650M5 HPIO x3650M5 HPIO D3284 D3284 D3284 D3284 D3284 D3284 D3284 D3284 670 x NL-SAS DSS G280 x3650M5 HPIO x3650M5 HPIO D3284 D3284 D3284 D3284 D3284 D3284 502 x NL-SAS DSS G260 x3650M5 HPIO x3650M5 HPIO D3284 D3284 D3284 D3284 334 x NL-SAS DSS G240 x3650M5 HPIO x3650M5 HPIO D3284 D3284 164 x NL-SAS DSS G220 x3650M5 HPIO x3650M5 HPIO x3650M5 HPIO x3650M5 HPIO D1224 D1224 D1224 D1224 D1224 D1224 DSS G204DSS G202 x3650M5 HPIO x3650M5 HPIO D1224 D1224 D1224 D1224 DSS G206 D1224 D1224 SSD / SAS Option for High Performance / IOPS HPIO = High Performance I/O Future Announce 2017 Lenovo. All rights reserved.
  • 22.
    22 Intel EE Lustreon Lenovo S-Series Storage X3550M5 Intel Lustre MDS Cluster S3200 12GB SAS X3550M5 Lustre Management Intel Manager for Lustre This module consists of a pair of clustered servers, loaded with Linux and the Lustre Metadata (MDS) component. The servers connect to a dedicated S3200 array via SAS connections, and to the file serving network via high speed InfiniBand. 10/40 Gb Ethernet Infiniband / OPAOR X3650M5 Intel Lustre OSS Cluster 12GB SAS S3200 E1012 E1012 E1012 E1012 E1012 E1012 E1012 Cntrl A Cntrl B 12GB SAS S3200 E1012 E1012 E1012 E1012 E1012 E1012 E1012 Cntrl A Cntrl B 2 SAS 2 SAS 2 SAS 2 SAS RAID6 8+2P+4 Global Spares/Build. Block Enclosures Drive Size Raw Nett 192 Total Drives 16 TB TB TB 180 RAID drives Drives Raw 2 384 304 8 Global Spares 192 4 768 608 4 Rest Drives Nett 6 1152 912 152 8 1536 1216 18 RAID Stripes2017 Lenovo. All rights reserved.
  • 23.
    2323 2017 Lenovo confidential.All rights reserved. LENOVO HPC SOFTWARE Key Component of our overall Data Center Success
  • 24.
    24 Lenovo Suite forThinkAgile HPC Cluster Lenovo OpenSource HPC Stack An OpenSource IaaS suite to run and manage optimally and transparently HPC, Big Data and Workflows on a virtualized infrastructure adjusting dynamically to user and datacenter needs through energy policies • Bundle best-of-breed software • Enhance with Lenovo configuration, plugins and scripts • Add in Energy Aware run time • All open and flexible A ready to use HPC Stack * collaborating w/ Greg Kurtzer from Lawrence Berkeley National Lab Lenovo HPC Stack Open Source / 3rd Party SW Lenovo / 3rd Party HW Antilles Web Console GUI Server Storage Network DataCenter Ganglia, Icinga SLURM, Torque, Maui xCAT / Confluent OpenMPI, MVAPICH, MPICH Lustre, CEPH, Scale (GPFS) OFED Containers / Singularity * RedHat, CentOS, SLES Antilles Management Deployment guide / scripts Admin guide / scripts Customer Application Eclipse PTP, debugger, gdb GCC, Intel Parallel Studio, MKL, … Demo at ISC‘17 2017 Lenovo. All rights reserved.
  • 25.
    25 Lenovo OpenSource Initiativesand OpenHPC • Status of Lenovo OpenHPC Engagement – Since May 2016, OpenHPC as an organization have been having Technical Steering Committee (TSC) meetings to help create processes, rules and procedures to help organize the group to function smoothly. - Meeting minutes are posted for everyone to see – xCAT and Confluent was submitted as future system management components into OpenHPC and this was accepted by OpenHPC TSC in October 2016. – Lenovo will help integrate and validate xCAT and Confluent into OpenHPC - Expected timeline is first half of 2017 • All Lenovo OpenSource HPC Engagements – Antilles – xCat – Confluent – Singularity/Capsules Base (OS* without Virtualized Network) Libs/Tools App requires Application Space 2017 Lenovo. All rights reserved.
  • 26.
    26 Antilles - SimplifiedWeb Portal for HPC • Antilles – Web Portal Goals – Allow users to access the system from any client environment – Enable easy deployment of jobs to the cluster – Give a clean and simplified view of jobs running on the cluster • Integration with xCAT/Confluent – Built on xCAT/Confluent – Paves the way for future enhancements of xCAT/Confluent – Delivers a graphical user experience – Allows easier access for operators and system administrators 2017 Lenovo. All rights reserved.
  • 27.
    27 Confluent @Lenovo StuttgartInnovation Center 2017 Lenovo. All rights reserved.
  • 28.
    2828 2017 Lenovo confidential.All rights reserved. LENOVO INNOVATIONS Key Component of our overall Data Center Success
  • 29.
    29 A Strong Footprintfor Lenovo R&D • $1.7B investment globally • 8,000 Global Researchers/Engineers 2-tier R&D System2-tier R&D System Corporate Level BU Level Research & Technology BU development R&T  Leading Innovation  Core Technology  New Opportunity Incubation  Build Tech Platform BU development  Lead System Integration  Product Development  Engineering & Quality Control  Systems Technology Innovation Center (STIC) Japan Raleigh Beijing Lenovo Development (& Systems Technology Innovation Center) Chicago Europe Stuttgart Silicon Valley Shenzhen Hong Kong Shanghai Brazil Israel Israel Lenovo Research & Technology (co-located with development) Raleigh 2017 Lenovo. All rights reserved.
  • 30.
    30 Forging Dreams intoReality • HPC Innovation Centers – Focused HPC team bringing best of breed x86 technology and excellent talent with passion to innovate – Collaboration is Key to Success – Industry leaders bring together the newest technology and skills – Visionary client partners bring focused knowledge & deep skills in specific areas of science • Parallel Benchmarking System in Stuttgart, Germany – A benchmarking system available to Lenovo, our clients and our partners. – A broad ecosystem of technologies & skills to help clients and partners move/optimize/ demo their applications Research Triangle Park Stuttgart Beijing 2017 Lenovo. All rights reserved.
  • 31.
    31 Lenovo HPC InnovationCenter Stuttgart - Partners HPC Innovation Center Europe FZJ – HPC Storage Technologies LRZ – Energy Efficient Systems Oxford – HPC Software Stack MPCDF – Advancing Material Science STFC Hatree – New Technologies (ARM) BSC – Capsules, Energy Runtime & Big Data CINECA – ManyCores and FET4HPC IBM – File System and Workload Mgmt Mellanox – High Speed Networking SUSE – Operating System DDN – Lustre solutions Samsung – High Performance Memory Nvidia – Graphic Processing Acceleration Seagate – Storage subsystems NICE – Remote Visualization Intel – Scalable Systems Framework Allinea – Performance Analytics Technology Partners (HW/SW) Client Partners and Projects Focused knowledge and deep skills advance the science of HPC ARC – Open Source & Energy Efficiency CERFACS – Code optimzation (AVBP) 2017 Lenovo. All rights reserved.
  • 32.
    32 AI Innovation Center •Goals • Demonstrate leadership by collaborating with universities, partners, and customers making advances in AI • Make customers experience the value of AI through demos • Activities • Give access to the latest hardware and software • Provide training on machine/deep learning by experts from Lenovo and partners • Give visibility at important AI events • Demonstrating AI applications to wider audience for impact • Demos at our briefing centers, events etc. • Location • Physical labs in Stuttgart and Morrisville. WW Remote access 2017 Lenovo. All rights reserved.
  • 33.
    33 Lenovo HPC &AI platform concept 2016 Lenovo C/C++ Caffe NVIDIA cuDNN NVIDIA CUDA Intel® Math Kernal Library Intel® MKL-DNN Frameworks Developer tools Low-level libraries System management Hardware Intel Deep Learning SDKProgramming resources Big data analytics Voice recognition Image/video recognition Text recognition Language processing Applications Healthcare Finance Academia Hyperscale Manufacturing Industry verticals Lenovoplatformfor trainingworkloads xCAT/Confluence/ Antilles Lenovo Open HPC 2017 Lenovo. All rights reserved.
  • 34.
    3434 2017 Lenovo confidential.All rights reserved. LENOVO SERVICE
  • 35.
    35 Services • Implementation – ProjectManagement – Physical Setup – Software Setup • Operation Support – Onsite Managed Service – Remote Managed Service • Application Support – Onsite Application Service • Maintenance Support – Based on the classification of components 2017 Lenovo. All rights reserved.
  • 36.
    36 Services – Continuinga successful way • We will subcontract IBM for maintenance services during whole contract lifecycle – You still have ONE contract partner - Lenovo Customer opens a call • via phone • via internet IBM Frontdesk Searches for already known solutions Routes to 2nd/3rd level 2nd / 3rd Level Finds the root cause Routes to field service to replace failing device Field Service Replaces failing device Lenovo quality control / Customer satisfaction = Tasks subcontracted to IBM = Tasks done by Lenovo 2017 Lenovo. All rights reserved.