SlideShare a Scribd company logo
OpenHPC: A Cohesive and
Comprehensive System Software Stack
The Time is Right
IDC HPC User Forum
April 19, 2017
Dr. Robert W. Wisniewski
Chief Software Architect Extreme Scale Computing
Senior Principal Engineer, Intel
1
2
• Intel technologies’ features and benefits depend on system configuration and may require enabled hardware, software or
service activation. Learn more at intel.com, or from the OEM or retailer.
• No computer system can be absolutely secure.
• Software and workloads used in performance tests may have been optimized for performance only on Intel
microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems,
components, software, operations and functions. Any change to any of those factors may cause the results to vary. You
should consult other information and performance tests to assist you in fully evaluating your contemplated purchases,
including the performance of that product when combined with other products. For more complete information visit
http://www.intel.com/performance.
• Cost reduction scenarios described are intended as examples of how a given Intel- based product, in the specified
circumstances and configurations, may affect future costs and provide cost savings. Circumstances will vary. Intel does
not guarantee any costs or cost reduction.
• No license (express or implied, by estoppel or otherwise) to any intellectual property rights is granted by this document.
• Intel, the Intel logo and others are trademarks of Intel Corporation in the U.S. and/or other countries.
• *Other names and brands may be claimed as the property of others.
• © 2017 Intel Corporation.
Legal Notices and Disclaimers
Agenda
•Trends and Challenges
•OpenHPC Vision
•OpenHPC Architecture and Implementation of Stack
•Intel® HPC Orchestrator
•Questions
3
Trends and Challenges
• Talk is software focused, but
–Hardware: scale, power, reliability, network bandwidth and latency, memory
bandwidth and latency
• Complexity of software stack
–Increase in classical HPC computing
–Richer environments, python
–New models: UQ, workflows
–Introduction of big data and analytics
–BDEC
–Multi tenancy
–Need for new frameworks
–AI: ML/DL
–Cloud 4
Trends and Challenges  Needs
• Talk is software focused, but
–Hardware: Value of co-designing and integrating cores, network, and memory
•Complexity drives the need to integrate and provide a coherent and
comprehensive system software stack rather than a bag of parts
–More components
–More components lead to a greater potential for incompatibilities
–Co-design applies within system software also
–Need to test and continuously integrate
–Over time fewer organizations could assemble the whole stack
–Increasing time going to just standing up
–Versus focusing on mission needs
5
Overview
Status
Governance:
 Technical steering committee active
since June 2016
 Technical submission process
published
 Currently there are 30 official members
in Platinum, Silver, Academic and
Technical Committees
 Held first face to face meeting, post-
formation, in June 2016 at ISC, one at
SC 2016, and one planned for ISC 2017
Goals
 Provide a common SW platform to the HPC
community that works across multiple
segments and on which end-users can
collaborate and innovate
 Simplify the complexity of installation,
configuration, and ongoing maintenance of a
custom software stack
 Receive contributions and feedback from
community to drive innovation
 Enable developers to focus on their
differentiation and unique value, rather than
having to spend on developing, testing, and
maintaining a core stack
 Deliver integrated hardware and software
innovations to ease the path to exascale
6Courtesy of OpenHPC*
*Other names and brands may be claimed as the property of others.
Background Motivation for Community Effort
7Courtesy of OpenHPC*
• Many sites spend considerable effort aggregating a large suite of open-
source projects to provide a capable HPC environment for their users:
–necessary to build/deploy HPC focused packages that are either absent or do not keep
pace from distro providers
–local packaging or customization frequently tries to give software versioning access to
users (e.g. via modules or similar equivalent)
–hierarchal packaging necessary for multiple compiler/mpi families
• On the developer front, many successful projects must engage in continual
triage and debugging regarding configuration and installation issues on HPC
systems
*Other names and brands may be claimed as the property of others.
What is OpenHPC?
8Courtesy of OpenHPC*
• OpenHPC is a community effort endeavoring to:
–provide collection(s) of pre-packaged components that can be used to
help install and manage flexible HPC systems throughout their lifecycle
–leverage standard Linux delivery model
to retain admin familiarity (ie. package repos)
–allow and promote multiple system configuration recipes that leverage
community reference
designs and best practices
–implement integration testing to
gain validation confidence
–provide additional distribution
mechanism for groups releasing
open-source software
–provide a stable platform for
new R&D initiatives
Install Guide - CentOS7.1 Version (v1.0)
eth1eth0
Data
Center
Network
high speed network
tcp networking
to compute eth interface
to compute BMC interface
compute
nodes
Lustre* storage system
Master
(SMS)
Figure 1: Overview of physical cluster architecture.
Typical cluster
architecture
*Other names and brands may be claimed as the property of others.
OpenHPC: Mission and Vision
9Courtesy of OpenHPC*
• Mission: to provide a reference collection of open-source HPC software
components and best practices, lowering barriers to deployment, advancement,
and use of modern HPC methods and tools.
• Vision: OpenHPC components and best practices will enable and accelerate
innovation and discoveries by broadening access to state-of-the-art, open-source
HPC methods and tools in a consistent environment, supported by a collaborative,
worldwide community of HPC users, developers, researchers, administrators, and
vendors.
*Other names and brands may be claimed as the property of others.
OpenHPC: Project Members
10Courtesy of OpenHPC*
Project member participation interest?
Contact Kevlin Husser or Jeff ErnstFriedman
jernstfriedman@linuxfoundation.org
Mixture of Academics, Labs, OEMs, and ISVs/OSVs
• Argonne National Laboratory • Center for Research in Extreme Scale Technologies
– Indiana University
30 Members
OpenHPC is a Linux Foundation Project
initiated by Intel and gained wide
participation right away
The goal is to collaboratively advance
the state of the software ecosystem
Governing board is composed of
Platinum members (Intel, Dell, HPE,
SUSE) plus reps from Silver &
Academic, Technical committees
WWW.OpenHPC.Community
*Other names and brands may be claimed as the property of others.
• University of Cambridge
Repository server metrics: monthly visitors
11Courtesy of OpenHPC*
v1.0
v1.0.1
v1.1
v1.1.1
v1.2
v1.2.1
v1.3
0
500
1000
1500
2000
Jul-15 Oct-15 Jan-16 May-16 Aug-16 Nov-16 Mar-17
#ofUniqueVisitors
Build Server Access: Unique Visitors
Releases
*Other names and brands may be claimed as the property of others.
Intel® HPC Orchestrator Modular View
• Intra-stack APIs to allow for customization/differentiation (OEMs enabling)
• Defined external APIs for consistency across versions (ISVs)
Node-specific OS Kernel(s)
Linux* Distro Runtime Libraries
Overlay & Pub-sub Networks, Identity
User Space
Utilities
SW
Development
Toolchain
Compiler &
Programming
Model
Runtimes
High
Performance
Parallel
LibrariesScalable
Debugging
& Perf
Analysis
Tools
Optimized
I/O
Libraries
I/O
Services
Data
Collection
And
System
Monitors
Workload
Manager
Resource
Mgmnt
Runtimes
DB
Schema
Scalable
DB
SystemManagement
(Confi,Inventory)
Provisioning
SystemDiagnostics
FabricMgmnt
Operator Interface Applications (not part of initial stack)
ISV Applications
Hardware
12*Other names and brands may be claimed as the property of others.
OpenHPC v1.3 - Current S/W components
13Courtesy of OpenHPC*
Functional Areas Components
Base OS CentOS 7.3, SLES12 SP2
Architecture x86_64, aarch64 (Tech Preview)
Administrative Tools
Conman, Ganglia, Lmod, LosF, Nagios, pdsh, prun, EasyBuild, ClusterShell,
mrsh, Genders, Shine, Spack, test-suite
Provisioning Warewulf
Resource Mgmt. SLURM, Munge, PBS Professional
Runtimes OpenMP, OCR
I/O Services Lustre client (community version)
Numerical/Scientific
Libraries
Boost, GSL, FFTW, Metis, PETSc, Trilinos, Hypre, SuperLU, SuperLU_Dist,
Mumps, OpenBLAS, Scalapack
I/O Libraries HDF5 (pHDF5), NetCDF (including C++ and Fortran interfaces), Adios
Compiler Families GNU (gcc, g++, gfortran)
MPI Families MVAPICH2, OpenMPI, MPICH
Development Tools Autotools (autoconf, automake, libtool), Valgrind,R, SciPy/NumPy
Performance Tools PAPI, IMB, mpiP, pdtoolkit TAU, Scalasca, ScoreP, SIONLib
Notes:
• Additional dependencies
that are not provided by
the BaseOS or community
repos (e.g. EPEL) are also
included
• 3rd Party libraries are built
for each compiler/MPI
family
• Resulting repositories
currently comprised of
~300 RPMs
Future additions approved
for inclusion:
• BeeGFS client
• hwloc
• Singularity
• xCAT
*Other names and brands may be claimed as the property of others.
Intel® HPC Orchestrator Framework
14
University
Community
OEM
CommunityGNU
Linux
Parallel File system
Upstream
source
Communities
Resource Manager
Upstream
source
Communities
Upstream
source
Communities
Upstream
source
Communities
Integrates and
tests HPC
stacks and
makes them
available as OS
Base
HPC Stack
OEM
Stack
University
Stack
Contributors include
Intel, OEMs, ISVs,
labs, academia
RRV
RRV
RRV
RRV
RRVs
Continuous Integration Environment
-Build Environment & Source Control
-Bug Tracking
-User & Dev Forums
-Collaboration tools
-Validation Environment
Cadence 6~12 mo
“RRV” = Relevant and Reliable Version
Intel® HPC Orchestrator
Core HPC Stack
PRODUCT
Supported HPC Stack
-Premium Features
-Advanced Integration Testing
-Testing at scale
-Validated updates
-Level3 Support across stack
OEM
Stack
PROJECT
*Other names and brands may be claimed as the property of others.
Base Stack and Derivatives
- Sufficient
performance
and scalability
- Ease of Install
- Performance &
scalability
- Energy efficiency
- Ease of use &
administration
- Auto-configuration
Ease of
administration
across multiple
tiers in the same
data center
Common Core
(same across all offerings)
Additions Targeting High End Market
Additions Targeting Volume Market
“CUSTOM”
AdditionsTargetingTop500
&Verticals
Provides
“TURNKEY”
Provides
15
“ADVANCED”
Conclusion
•Trends and challenges led to the need for a cohesive and
comprehensive system software stack for HPC
•OpenHPC provides a vehicle that facilitates collaboration,
removes replicative work, and provides a more efficient
ecosystem
•Intel® HPC Orchestrator provides a supported version of
OpenHPC analogous to CentOS and RHEL with three tiers for
different computing needs
•OpenHPC is gaining momentum with increased contributions
16
*Other names and brands may be claimed as the property of others.

More Related Content

What's hot

Room 1 - 6 - Trần Quốc Sang - Autoscaling for multi cloud platform based on S...
Room 1 - 6 - Trần Quốc Sang - Autoscaling for multi cloud platform based on S...Room 1 - 6 - Trần Quốc Sang - Autoscaling for multi cloud platform based on S...
Room 1 - 6 - Trần Quốc Sang - Autoscaling for multi cloud platform based on S...
Vietnam Open Infrastructure User Group
 
Apache Kafka Architecture & Fundamentals Explained
Apache Kafka Architecture & Fundamentals ExplainedApache Kafka Architecture & Fundamentals Explained
Apache Kafka Architecture & Fundamentals Explained
confluent
 
Prometheus in openstack-helm
Prometheus in openstack-helmPrometheus in openstack-helm
Prometheus in openstack-helm
성일 임
 
Performance Wins with eBPF: Getting Started (2021)
Performance Wins with eBPF: Getting Started (2021)Performance Wins with eBPF: Getting Started (2021)
Performance Wins with eBPF: Getting Started (2021)
Brendan Gregg
 
오픈스택 기반 클라우드 서비스 구축 방안 및 사례
오픈스택 기반 클라우드 서비스 구축 방안 및 사례오픈스택 기반 클라우드 서비스 구축 방안 및 사례
오픈스택 기반 클라우드 서비스 구축 방안 및 사례
SONG INSEOB
 
Anatomy of a Container: Namespaces, cgroups & Some Filesystem Magic - LinuxCon
Anatomy of a Container: Namespaces, cgroups & Some Filesystem Magic - LinuxConAnatomy of a Container: Namespaces, cgroups & Some Filesystem Magic - LinuxCon
Anatomy of a Container: Namespaces, cgroups & Some Filesystem Magic - LinuxCon
Jérôme Petazzoni
 
High performance computing for research
High performance computing for researchHigh performance computing for research
High performance computing for research
Esteban Hernandez
 
RedHat Linux
RedHat LinuxRedHat Linux
RedHat Linux
Apo
 
Alphorm.com Formation Ansible : Le Guide Complet du Débutant
Alphorm.com Formation Ansible : Le Guide Complet du DébutantAlphorm.com Formation Ansible : Le Guide Complet du Débutant
Alphorm.com Formation Ansible : Le Guide Complet du Débutant
Alphorm
 
Cloud Computing: Overview and Examples
Cloud Computing: Overview and ExamplesCloud Computing: Overview and Examples
Cloud Computing: Overview and Examples
Eueung Mulyana
 
Apache Kafka Fundamentals for Architects, Admins and Developers
Apache Kafka Fundamentals for Architects, Admins and DevelopersApache Kafka Fundamentals for Architects, Admins and Developers
Apache Kafka Fundamentals for Architects, Admins and Developers
confluent
 
Prometheus and Grafana
Prometheus and GrafanaPrometheus and Grafana
Prometheus and Grafana
Lhouceine OUHAMZA
 
Service Discovery In Kubernetes
Service Discovery In KubernetesService Discovery In Kubernetes
Service Discovery In Kubernetes
Knoldus Inc.
 
Autoscale a self-healing cluster in OpenStack with Heat
Autoscale a self-healing cluster in OpenStack with HeatAutoscale a self-healing cluster in OpenStack with Heat
Autoscale a self-healing cluster in OpenStack with Heat
Rico Lin
 
High-Performance Networking Using eBPF, XDP, and io_uring
High-Performance Networking Using eBPF, XDP, and io_uringHigh-Performance Networking Using eBPF, XDP, and io_uring
High-Performance Networking Using eBPF, XDP, and io_uring
ScyllaDB
 
Notes on NUMA architecture
Notes on NUMA architectureNotes on NUMA architecture
Notes on NUMA architecture
Intel Software Brasil
 
Deep dive into Kubernetes Networking
Deep dive into Kubernetes NetworkingDeep dive into Kubernetes Networking
Deep dive into Kubernetes Networking
Sreenivas Makam
 
BPF: Tracing and more
BPF: Tracing and moreBPF: Tracing and more
BPF: Tracing and more
Brendan Gregg
 
Grafana introduction
Grafana introductionGrafana introduction
Grafana introduction
Rico Chen
 
Process Scheduler and Balancer in Linux Kernel
Process Scheduler and Balancer in Linux KernelProcess Scheduler and Balancer in Linux Kernel
Process Scheduler and Balancer in Linux Kernel
Haifeng Li
 

What's hot (20)

Room 1 - 6 - Trần Quốc Sang - Autoscaling for multi cloud platform based on S...
Room 1 - 6 - Trần Quốc Sang - Autoscaling for multi cloud platform based on S...Room 1 - 6 - Trần Quốc Sang - Autoscaling for multi cloud platform based on S...
Room 1 - 6 - Trần Quốc Sang - Autoscaling for multi cloud platform based on S...
 
Apache Kafka Architecture & Fundamentals Explained
Apache Kafka Architecture & Fundamentals ExplainedApache Kafka Architecture & Fundamentals Explained
Apache Kafka Architecture & Fundamentals Explained
 
Prometheus in openstack-helm
Prometheus in openstack-helmPrometheus in openstack-helm
Prometheus in openstack-helm
 
Performance Wins with eBPF: Getting Started (2021)
Performance Wins with eBPF: Getting Started (2021)Performance Wins with eBPF: Getting Started (2021)
Performance Wins with eBPF: Getting Started (2021)
 
오픈스택 기반 클라우드 서비스 구축 방안 및 사례
오픈스택 기반 클라우드 서비스 구축 방안 및 사례오픈스택 기반 클라우드 서비스 구축 방안 및 사례
오픈스택 기반 클라우드 서비스 구축 방안 및 사례
 
Anatomy of a Container: Namespaces, cgroups & Some Filesystem Magic - LinuxCon
Anatomy of a Container: Namespaces, cgroups & Some Filesystem Magic - LinuxConAnatomy of a Container: Namespaces, cgroups & Some Filesystem Magic - LinuxCon
Anatomy of a Container: Namespaces, cgroups & Some Filesystem Magic - LinuxCon
 
High performance computing for research
High performance computing for researchHigh performance computing for research
High performance computing for research
 
RedHat Linux
RedHat LinuxRedHat Linux
RedHat Linux
 
Alphorm.com Formation Ansible : Le Guide Complet du Débutant
Alphorm.com Formation Ansible : Le Guide Complet du DébutantAlphorm.com Formation Ansible : Le Guide Complet du Débutant
Alphorm.com Formation Ansible : Le Guide Complet du Débutant
 
Cloud Computing: Overview and Examples
Cloud Computing: Overview and ExamplesCloud Computing: Overview and Examples
Cloud Computing: Overview and Examples
 
Apache Kafka Fundamentals for Architects, Admins and Developers
Apache Kafka Fundamentals for Architects, Admins and DevelopersApache Kafka Fundamentals for Architects, Admins and Developers
Apache Kafka Fundamentals for Architects, Admins and Developers
 
Prometheus and Grafana
Prometheus and GrafanaPrometheus and Grafana
Prometheus and Grafana
 
Service Discovery In Kubernetes
Service Discovery In KubernetesService Discovery In Kubernetes
Service Discovery In Kubernetes
 
Autoscale a self-healing cluster in OpenStack with Heat
Autoscale a self-healing cluster in OpenStack with HeatAutoscale a self-healing cluster in OpenStack with Heat
Autoscale a self-healing cluster in OpenStack with Heat
 
High-Performance Networking Using eBPF, XDP, and io_uring
High-Performance Networking Using eBPF, XDP, and io_uringHigh-Performance Networking Using eBPF, XDP, and io_uring
High-Performance Networking Using eBPF, XDP, and io_uring
 
Notes on NUMA architecture
Notes on NUMA architectureNotes on NUMA architecture
Notes on NUMA architecture
 
Deep dive into Kubernetes Networking
Deep dive into Kubernetes NetworkingDeep dive into Kubernetes Networking
Deep dive into Kubernetes Networking
 
BPF: Tracing and more
BPF: Tracing and moreBPF: Tracing and more
BPF: Tracing and more
 
Grafana introduction
Grafana introductionGrafana introduction
Grafana introduction
 
Process Scheduler and Balancer in Linux Kernel
Process Scheduler and Balancer in Linux KernelProcess Scheduler and Balancer in Linux Kernel
Process Scheduler and Balancer in Linux Kernel
 

Similar to OpenHPC: A Comprehensive System Software Stack

OpenPOWER Foundation Overview
OpenPOWER Foundation OverviewOpenPOWER Foundation Overview
OpenPOWER Foundation Overview
inside-BigData.com
 
Red Hat® Ceph Storage and Network Solutions for Software Defined Infrastructure
Red Hat® Ceph Storage and Network Solutions for Software Defined InfrastructureRed Hat® Ceph Storage and Network Solutions for Software Defined Infrastructure
Red Hat® Ceph Storage and Network Solutions for Software Defined Infrastructure
Intel® Software
 
What HPC can learn from DevOps?
What HPC can learn from DevOps?What HPC can learn from DevOps?
What HPC can learn from DevOps?
Walid Shaari
 
Benefits of disaggregation and open source networking in data centers
Benefits of disaggregation and open source networking in data centersBenefits of disaggregation and open source networking in data centers
Benefits of disaggregation and open source networking in data centers
Aruba, a Hewlett Packard Enterprise company
 
Petapath HP Cast 12 - Programming for High Performance Accelerated Systems
Petapath HP Cast 12 - Programming for High Performance Accelerated SystemsPetapath HP Cast 12 - Programming for High Performance Accelerated Systems
Petapath HP Cast 12 - Programming for High Performance Accelerated Systems
dairsie
 
Pain points of agile development
Pain points of agile developmentPain points of agile development
Pain points of agile development
Perforce
 
Intel and Red Hat: Enhancing OpenStack for Enterprise Deployment
Intel and Red Hat: Enhancing OpenStack for Enterprise DeploymentIntel and Red Hat: Enhancing OpenStack for Enterprise Deployment
Intel and Red Hat: Enhancing OpenStack for Enterprise Deployment
Intel® Software
 
Introducing the Civil Infrastructure Platform Project
Introducing the Civil Infrastructure Platform ProjectIntroducing the Civil Infrastructure Platform Project
Introducing the Civil Infrastructure Platform Project
Yoshitake Kobayashi
 
IBM COE - AI /HPC/CLOUD at your university
IBM COE - AI /HPC/CLOUD at your university IBM COE - AI /HPC/CLOUD at your university
IBM COE - AI /HPC/CLOUD at your university
Ganesan Narayanasamy
 
Accelerate Big Data Processing with High-Performance Computing Technologies
Accelerate Big Data Processing with High-Performance Computing TechnologiesAccelerate Big Data Processing with High-Performance Computing Technologies
Accelerate Big Data Processing with High-Performance Computing Technologies
Intel® Software
 
Tracing the evolution - Open source & Embedded systems
Tracing the evolution - Open source & Embedded systemsTracing the evolution - Open source & Embedded systems
Tracing the evolution - Open source & Embedded systems
Emertxe Information Technologies Pvt Ltd
 
Introducing the Civil Infrastructure Platform
Introducing the Civil Infrastructure PlatformIntroducing the Civil Infrastructure Platform
Introducing the Civil Infrastructure Platform
Yoshitake Kobayashi
 
Software update for embedded systems
Software update for embedded systemsSoftware update for embedded systems
Software update for embedded systems
SZ Lin
 
Best Practices for Deploying Enterprise Applications on UNIX
Best Practices for Deploying Enterprise Applications on UNIXBest Practices for Deploying Enterprise Applications on UNIX
Best Practices for Deploying Enterprise Applications on UNIX
Noel McKeown
 
Develop Your Own Operating Systems using Cheap ARM Boards
Develop Your Own Operating Systems using Cheap ARM BoardsDevelop Your Own Operating Systems using Cheap ARM Boards
Develop Your Own Operating Systems using Cheap ARM Boards
National Cheng Kung University
 
Open source-options-v1
Open source-options-v1Open source-options-v1
Open source-options-v1
Hieu Le Trung
 
Tracing The Evolution Open Source & Embedded Systems - Mr. Jayakumar Balasubr...
Tracing The Evolution Open Source & Embedded Systems - Mr. Jayakumar Balasubr...Tracing The Evolution Open Source & Embedded Systems - Mr. Jayakumar Balasubr...
Tracing The Evolution Open Source & Embedded Systems - Mr. Jayakumar Balasubr...
Lounge47
 
KB Seminars: Working with Technology - Platforms; 10/13
KB Seminars: Working with Technology - Platforms; 10/13KB Seminars: Working with Technology - Platforms; 10/13
KB Seminars: Working with Technology - Platforms; 10/13
MDIF
 
Japan's post K Computer
Japan's post K ComputerJapan's post K Computer
Japan's post K Computer
inside-BigData.com
 
Helion meetup-2014
Helion meetup-2014Helion meetup-2014
Helion meetup-2014
Bruno Cornec
 

Similar to OpenHPC: A Comprehensive System Software Stack (20)

OpenPOWER Foundation Overview
OpenPOWER Foundation OverviewOpenPOWER Foundation Overview
OpenPOWER Foundation Overview
 
Red Hat® Ceph Storage and Network Solutions for Software Defined Infrastructure
Red Hat® Ceph Storage and Network Solutions for Software Defined InfrastructureRed Hat® Ceph Storage and Network Solutions for Software Defined Infrastructure
Red Hat® Ceph Storage and Network Solutions for Software Defined Infrastructure
 
What HPC can learn from DevOps?
What HPC can learn from DevOps?What HPC can learn from DevOps?
What HPC can learn from DevOps?
 
Benefits of disaggregation and open source networking in data centers
Benefits of disaggregation and open source networking in data centersBenefits of disaggregation and open source networking in data centers
Benefits of disaggregation and open source networking in data centers
 
Petapath HP Cast 12 - Programming for High Performance Accelerated Systems
Petapath HP Cast 12 - Programming for High Performance Accelerated SystemsPetapath HP Cast 12 - Programming for High Performance Accelerated Systems
Petapath HP Cast 12 - Programming for High Performance Accelerated Systems
 
Pain points of agile development
Pain points of agile developmentPain points of agile development
Pain points of agile development
 
Intel and Red Hat: Enhancing OpenStack for Enterprise Deployment
Intel and Red Hat: Enhancing OpenStack for Enterprise DeploymentIntel and Red Hat: Enhancing OpenStack for Enterprise Deployment
Intel and Red Hat: Enhancing OpenStack for Enterprise Deployment
 
Introducing the Civil Infrastructure Platform Project
Introducing the Civil Infrastructure Platform ProjectIntroducing the Civil Infrastructure Platform Project
Introducing the Civil Infrastructure Platform Project
 
IBM COE - AI /HPC/CLOUD at your university
IBM COE - AI /HPC/CLOUD at your university IBM COE - AI /HPC/CLOUD at your university
IBM COE - AI /HPC/CLOUD at your university
 
Accelerate Big Data Processing with High-Performance Computing Technologies
Accelerate Big Data Processing with High-Performance Computing TechnologiesAccelerate Big Data Processing with High-Performance Computing Technologies
Accelerate Big Data Processing with High-Performance Computing Technologies
 
Tracing the evolution - Open source & Embedded systems
Tracing the evolution - Open source & Embedded systemsTracing the evolution - Open source & Embedded systems
Tracing the evolution - Open source & Embedded systems
 
Introducing the Civil Infrastructure Platform
Introducing the Civil Infrastructure PlatformIntroducing the Civil Infrastructure Platform
Introducing the Civil Infrastructure Platform
 
Software update for embedded systems
Software update for embedded systemsSoftware update for embedded systems
Software update for embedded systems
 
Best Practices for Deploying Enterprise Applications on UNIX
Best Practices for Deploying Enterprise Applications on UNIXBest Practices for Deploying Enterprise Applications on UNIX
Best Practices for Deploying Enterprise Applications on UNIX
 
Develop Your Own Operating Systems using Cheap ARM Boards
Develop Your Own Operating Systems using Cheap ARM BoardsDevelop Your Own Operating Systems using Cheap ARM Boards
Develop Your Own Operating Systems using Cheap ARM Boards
 
Open source-options-v1
Open source-options-v1Open source-options-v1
Open source-options-v1
 
Tracing The Evolution Open Source & Embedded Systems - Mr. Jayakumar Balasubr...
Tracing The Evolution Open Source & Embedded Systems - Mr. Jayakumar Balasubr...Tracing The Evolution Open Source & Embedded Systems - Mr. Jayakumar Balasubr...
Tracing The Evolution Open Source & Embedded Systems - Mr. Jayakumar Balasubr...
 
KB Seminars: Working with Technology - Platforms; 10/13
KB Seminars: Working with Technology - Platforms; 10/13KB Seminars: Working with Technology - Platforms; 10/13
KB Seminars: Working with Technology - Platforms; 10/13
 
Japan's post K Computer
Japan's post K ComputerJapan's post K Computer
Japan's post K Computer
 
Helion meetup-2014
Helion meetup-2014Helion meetup-2014
Helion meetup-2014
 

More from inside-BigData.com

Major Market Shifts in IT
Major Market Shifts in ITMajor Market Shifts in IT
Major Market Shifts in IT
inside-BigData.com
 
Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...
inside-BigData.com
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networks
inside-BigData.com
 
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
inside-BigData.com
 
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
inside-BigData.com
 
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
inside-BigData.com
 
HPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural NetworksHPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural Networks
inside-BigData.com
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
inside-BigData.com
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecasts
inside-BigData.com
 
HPC AI Advisory Council Update
HPC AI Advisory Council UpdateHPC AI Advisory Council Update
HPC AI Advisory Council Update
inside-BigData.com
 
Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19
inside-BigData.com
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuning
inside-BigData.com
 
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODHPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
inside-BigData.com
 
State of ARM-based HPC
State of ARM-based HPCState of ARM-based HPC
State of ARM-based HPC
inside-BigData.com
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Acceleration
inside-BigData.com
 
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance EfficientlyZettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
inside-BigData.com
 
Scaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's EraScaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's Era
inside-BigData.com
 
CUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computingCUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computing
inside-BigData.com
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Cluster
inside-BigData.com
 
Overview of HPC Interconnects
Overview of HPC InterconnectsOverview of HPC Interconnects
Overview of HPC Interconnects
inside-BigData.com
 

More from inside-BigData.com (20)

Major Market Shifts in IT
Major Market Shifts in ITMajor Market Shifts in IT
Major Market Shifts in IT
 
Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networks
 
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
 
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
 
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
 
HPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural NetworksHPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural Networks
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecasts
 
HPC AI Advisory Council Update
HPC AI Advisory Council UpdateHPC AI Advisory Council Update
HPC AI Advisory Council Update
 
Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuning
 
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODHPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
 
State of ARM-based HPC
State of ARM-based HPCState of ARM-based HPC
State of ARM-based HPC
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Acceleration
 
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance EfficientlyZettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
 
Scaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's EraScaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's Era
 
CUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computingCUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computing
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Cluster
 
Overview of HPC Interconnects
Overview of HPC InterconnectsOverview of HPC Interconnects
Overview of HPC Interconnects
 

Recently uploaded

GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
Neo4j
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
Infrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI modelsInfrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI models
Zilliz
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
Zilliz
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
kumardaparthi1024
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
Daiki Mogmet Ito
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
Claudio Di Ciccio
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
Octavian Nadolu
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
Neo4j
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
DianaGray10
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Malak Abu Hammad
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
Matthew Sinclair
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
Neo4j
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
名前 です男
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Speck&Tech
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
Pixlogix Infotech
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
Tomaz Bratanic
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
danishmna97
 

Recently uploaded (20)

GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
Infrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI modelsInfrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI models
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
 

OpenHPC: A Comprehensive System Software Stack

  • 1. OpenHPC: A Cohesive and Comprehensive System Software Stack The Time is Right IDC HPC User Forum April 19, 2017 Dr. Robert W. Wisniewski Chief Software Architect Extreme Scale Computing Senior Principal Engineer, Intel 1
  • 2. 2 • Intel technologies’ features and benefits depend on system configuration and may require enabled hardware, software or service activation. Learn more at intel.com, or from the OEM or retailer. • No computer system can be absolutely secure. • Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more complete information visit http://www.intel.com/performance. • Cost reduction scenarios described are intended as examples of how a given Intel- based product, in the specified circumstances and configurations, may affect future costs and provide cost savings. Circumstances will vary. Intel does not guarantee any costs or cost reduction. • No license (express or implied, by estoppel or otherwise) to any intellectual property rights is granted by this document. • Intel, the Intel logo and others are trademarks of Intel Corporation in the U.S. and/or other countries. • *Other names and brands may be claimed as the property of others. • © 2017 Intel Corporation. Legal Notices and Disclaimers
  • 3. Agenda •Trends and Challenges •OpenHPC Vision •OpenHPC Architecture and Implementation of Stack •Intel® HPC Orchestrator •Questions 3
  • 4. Trends and Challenges • Talk is software focused, but –Hardware: scale, power, reliability, network bandwidth and latency, memory bandwidth and latency • Complexity of software stack –Increase in classical HPC computing –Richer environments, python –New models: UQ, workflows –Introduction of big data and analytics –BDEC –Multi tenancy –Need for new frameworks –AI: ML/DL –Cloud 4
  • 5. Trends and Challenges  Needs • Talk is software focused, but –Hardware: Value of co-designing and integrating cores, network, and memory •Complexity drives the need to integrate and provide a coherent and comprehensive system software stack rather than a bag of parts –More components –More components lead to a greater potential for incompatibilities –Co-design applies within system software also –Need to test and continuously integrate –Over time fewer organizations could assemble the whole stack –Increasing time going to just standing up –Versus focusing on mission needs 5
  • 6. Overview Status Governance:  Technical steering committee active since June 2016  Technical submission process published  Currently there are 30 official members in Platinum, Silver, Academic and Technical Committees  Held first face to face meeting, post- formation, in June 2016 at ISC, one at SC 2016, and one planned for ISC 2017 Goals  Provide a common SW platform to the HPC community that works across multiple segments and on which end-users can collaborate and innovate  Simplify the complexity of installation, configuration, and ongoing maintenance of a custom software stack  Receive contributions and feedback from community to drive innovation  Enable developers to focus on their differentiation and unique value, rather than having to spend on developing, testing, and maintaining a core stack  Deliver integrated hardware and software innovations to ease the path to exascale 6Courtesy of OpenHPC* *Other names and brands may be claimed as the property of others.
  • 7. Background Motivation for Community Effort 7Courtesy of OpenHPC* • Many sites spend considerable effort aggregating a large suite of open- source projects to provide a capable HPC environment for their users: –necessary to build/deploy HPC focused packages that are either absent or do not keep pace from distro providers –local packaging or customization frequently tries to give software versioning access to users (e.g. via modules or similar equivalent) –hierarchal packaging necessary for multiple compiler/mpi families • On the developer front, many successful projects must engage in continual triage and debugging regarding configuration and installation issues on HPC systems *Other names and brands may be claimed as the property of others.
  • 8. What is OpenHPC? 8Courtesy of OpenHPC* • OpenHPC is a community effort endeavoring to: –provide collection(s) of pre-packaged components that can be used to help install and manage flexible HPC systems throughout their lifecycle –leverage standard Linux delivery model to retain admin familiarity (ie. package repos) –allow and promote multiple system configuration recipes that leverage community reference designs and best practices –implement integration testing to gain validation confidence –provide additional distribution mechanism for groups releasing open-source software –provide a stable platform for new R&D initiatives Install Guide - CentOS7.1 Version (v1.0) eth1eth0 Data Center Network high speed network tcp networking to compute eth interface to compute BMC interface compute nodes Lustre* storage system Master (SMS) Figure 1: Overview of physical cluster architecture. Typical cluster architecture *Other names and brands may be claimed as the property of others.
  • 9. OpenHPC: Mission and Vision 9Courtesy of OpenHPC* • Mission: to provide a reference collection of open-source HPC software components and best practices, lowering barriers to deployment, advancement, and use of modern HPC methods and tools. • Vision: OpenHPC components and best practices will enable and accelerate innovation and discoveries by broadening access to state-of-the-art, open-source HPC methods and tools in a consistent environment, supported by a collaborative, worldwide community of HPC users, developers, researchers, administrators, and vendors. *Other names and brands may be claimed as the property of others.
  • 10. OpenHPC: Project Members 10Courtesy of OpenHPC* Project member participation interest? Contact Kevlin Husser or Jeff ErnstFriedman jernstfriedman@linuxfoundation.org Mixture of Academics, Labs, OEMs, and ISVs/OSVs • Argonne National Laboratory • Center for Research in Extreme Scale Technologies – Indiana University 30 Members OpenHPC is a Linux Foundation Project initiated by Intel and gained wide participation right away The goal is to collaboratively advance the state of the software ecosystem Governing board is composed of Platinum members (Intel, Dell, HPE, SUSE) plus reps from Silver & Academic, Technical committees WWW.OpenHPC.Community *Other names and brands may be claimed as the property of others. • University of Cambridge
  • 11. Repository server metrics: monthly visitors 11Courtesy of OpenHPC* v1.0 v1.0.1 v1.1 v1.1.1 v1.2 v1.2.1 v1.3 0 500 1000 1500 2000 Jul-15 Oct-15 Jan-16 May-16 Aug-16 Nov-16 Mar-17 #ofUniqueVisitors Build Server Access: Unique Visitors Releases *Other names and brands may be claimed as the property of others.
  • 12. Intel® HPC Orchestrator Modular View • Intra-stack APIs to allow for customization/differentiation (OEMs enabling) • Defined external APIs for consistency across versions (ISVs) Node-specific OS Kernel(s) Linux* Distro Runtime Libraries Overlay & Pub-sub Networks, Identity User Space Utilities SW Development Toolchain Compiler & Programming Model Runtimes High Performance Parallel LibrariesScalable Debugging & Perf Analysis Tools Optimized I/O Libraries I/O Services Data Collection And System Monitors Workload Manager Resource Mgmnt Runtimes DB Schema Scalable DB SystemManagement (Confi,Inventory) Provisioning SystemDiagnostics FabricMgmnt Operator Interface Applications (not part of initial stack) ISV Applications Hardware 12*Other names and brands may be claimed as the property of others.
  • 13. OpenHPC v1.3 - Current S/W components 13Courtesy of OpenHPC* Functional Areas Components Base OS CentOS 7.3, SLES12 SP2 Architecture x86_64, aarch64 (Tech Preview) Administrative Tools Conman, Ganglia, Lmod, LosF, Nagios, pdsh, prun, EasyBuild, ClusterShell, mrsh, Genders, Shine, Spack, test-suite Provisioning Warewulf Resource Mgmt. SLURM, Munge, PBS Professional Runtimes OpenMP, OCR I/O Services Lustre client (community version) Numerical/Scientific Libraries Boost, GSL, FFTW, Metis, PETSc, Trilinos, Hypre, SuperLU, SuperLU_Dist, Mumps, OpenBLAS, Scalapack I/O Libraries HDF5 (pHDF5), NetCDF (including C++ and Fortran interfaces), Adios Compiler Families GNU (gcc, g++, gfortran) MPI Families MVAPICH2, OpenMPI, MPICH Development Tools Autotools (autoconf, automake, libtool), Valgrind,R, SciPy/NumPy Performance Tools PAPI, IMB, mpiP, pdtoolkit TAU, Scalasca, ScoreP, SIONLib Notes: • Additional dependencies that are not provided by the BaseOS or community repos (e.g. EPEL) are also included • 3rd Party libraries are built for each compiler/MPI family • Resulting repositories currently comprised of ~300 RPMs Future additions approved for inclusion: • BeeGFS client • hwloc • Singularity • xCAT *Other names and brands may be claimed as the property of others.
  • 14. Intel® HPC Orchestrator Framework 14 University Community OEM CommunityGNU Linux Parallel File system Upstream source Communities Resource Manager Upstream source Communities Upstream source Communities Upstream source Communities Integrates and tests HPC stacks and makes them available as OS Base HPC Stack OEM Stack University Stack Contributors include Intel, OEMs, ISVs, labs, academia RRV RRV RRV RRV RRVs Continuous Integration Environment -Build Environment & Source Control -Bug Tracking -User & Dev Forums -Collaboration tools -Validation Environment Cadence 6~12 mo “RRV” = Relevant and Reliable Version Intel® HPC Orchestrator Core HPC Stack PRODUCT Supported HPC Stack -Premium Features -Advanced Integration Testing -Testing at scale -Validated updates -Level3 Support across stack OEM Stack PROJECT *Other names and brands may be claimed as the property of others.
  • 15. Base Stack and Derivatives - Sufficient performance and scalability - Ease of Install - Performance & scalability - Energy efficiency - Ease of use & administration - Auto-configuration Ease of administration across multiple tiers in the same data center Common Core (same across all offerings) Additions Targeting High End Market Additions Targeting Volume Market “CUSTOM” AdditionsTargetingTop500 &Verticals Provides “TURNKEY” Provides 15 “ADVANCED”
  • 16. Conclusion •Trends and challenges led to the need for a cohesive and comprehensive system software stack for HPC •OpenHPC provides a vehicle that facilitates collaboration, removes replicative work, and provides a more efficient ecosystem •Intel® HPC Orchestrator provides a supported version of OpenHPC analogous to CentOS and RHEL with three tiers for different computing needs •OpenHPC is gaining momentum with increased contributions 16 *Other names and brands may be claimed as the property of others.