SlideShare a Scribd company logo
Supercomputing 2016
NCAR Globally Accessible
User Environment
DDN Booth
Supercomputing 2016
Pamela Hill, Manager, Data Analysis Services
Supercomputing 2016
National Center For Atmospheric Research - NCAR
•  Federally(Funded(Research(
and(Development(Center(
(FFRDC)(sponsored(by(the(
Na:onal(Science(Founda:on(
(NSF)(and(established(in(1959(
•  Operated(by(the(University(
Corpora:on(for(Atmospheric(
Research((UCAR),(a(nonEprofit(
consor:um(of(>(100(member(
universi:es,(academic(
affiliates,(and(interna:onal(
affiliates(
“I$have$a$very$strong$feeling$that$
science$exists$to$serve$human$welfare.$$
It’s$wonderful$to$have$the$opportunity$
given$us$by$society$to$do$basic$research,$
but$in$return,$we$have$a$very$important$
moral$responsibility$to$apply$that$
research$to$benefi?ng$humanity.”$
$
Walter(Orr(Roberts,((
(((((((NCAR(Founding(Director(
Supercomputing 2016
Computational & Information Systems Laboratory –
CISL Mission
•  To(support,(enhance,(and(extend(the(capabili:es(for(
transforma:ve(science(to(the(university(community(and(
the(broader(scien:fic(community,(na:onally(and(
interna:onally(
•  Provide(capacity(and(
capability(
supercompu:ng(
•  Develop(and(support(
robust,(accessible,(
innova:ve(and(
advanced(services(and(
tools(
•  Create(an(Earth(
System(Knowledge(
Environment(
Supercomputing 2016
Data Analysis Services Group
NCAR / CISL / HSS / DASG
•  Data Transfer and
Storage Services
•  Pamela Hill
•  Joey Mendoza
•  High-Performance File
Systems
•  Data Transfer Protocols
•  Science Gateway
Support
•  Innovative I/O
Solutions
•  Visualization Services
•  John Clyne
•  Scott Pearse
•  Samuel Li
•  VAPOR development
and support
•  3D visualization
•  Visualization User
Support
Supercomputing 2016
GLADE Mission
GLoballyAccessible Data Environment
•  Unified and consistent data environment for NCAR HPC
•  Supercomputers, Data Analysis and Visualization Clusters
•  Support for project work spaces
•  Support for shared data transfer interfaces
•  Support for Science Gateways and access to Research Data Archive (RDA) &
Earth Systems Grid (ESG) data sets
•  Data is available at high bandwidth to any server or supercomputer
within the GLADE environment
•  Resources outside the environment can manipulate data using
common interfaces
•  Choice of interfaces supports current projects; platform is flexible to
support future projects
Supercomputing 2016
GLADE Environment
GLADE$
HPSS$
Data$Transfer$
Gateways$
Science$
Gateways$
Project$Spaces$
Data$Collec?ons$
$HOME$$$$$$WORK$
$SCRATCH$
Remote$
Visualiza?on$
Computa?on$
Analysis$&$
Visualiza?on$
RDA$
ESG$
CDP$
geyser$
caldera$
pronghorn$
yellowstone$
cheyenne&
Globus$Online$
Globus+$Data$Share$
GridFTP$
HSI$/$HTAR$
scp,$sXp,$bbcp$
VirtualGL$
Supercomputing 2016
GLADE Today
•  90 GB/s bandwidth
•  16 PB useable capacity
•  76 IBM DCS3700
•  6840 3TB drives
•  shared data +
metadata
•  20 NSD servers
•  6 management nodes
•  File Services
•  FDR
•  10 GbE
Production Jan 2017
•  220 GB/s bandwidth
•  20 PB useable capacity
•  8 DDN SFA14KXE
•  3360 8TB drives
•  data only
•  48 800GB SSD
•  Metadata
•  File Services
•  EDR
•  40 GbE
Expansion Spring 2017
•  19 PB useable
capacity
Supercomputing 2016
DDN SFA14KXE Scalable System Unit
8 x SSU’s
•  (4) NSD VM’s
•  (4) Dual Port ConnectX-4 VPI
HBA’s
•  (4) EDR IB Connections
•  (4) 40 GbE Connections
•  (420) 8 TB NL-SAS
•  (6) 800 GB SSD
(42) 8 TB
(42) 8 TB
(42) 8 TB
(42) 8 TB
(42) 8 TB, (1) SSD
(42) 8 TB, (1) SSD
(42) 8 TB, (1) SSD
(42) 8 TB, (1) SSD
(42) 8 TB, (1) SSD
(42) 8 TB, (1) SSD
(2) NSD VM’s
(2) NSD VM’s
EDR IB
40 GbE
Supercomputing 2016
GLADE I/O Network
•  Network architecture providing global access to data storage from multiple
HPC resources
•  Flexibility provided by support of multiple connectivity options and multiple
compute network topologies
•  10GbE, 40GbE, FDR, EDR
•  Full Fat Tree, Quasi Fat Tree, Hypercube
•  Scalability allows for addition of new HPC or storage resources
•  Agnostic with respect to vendor and file system
•  Can support multiple solutions simultaneously
Supercomputing 2016
HPSS Archive
160PB, 12.5 GB/s
60PB holdings
HPSS
GLADE
16 PB, 90 GB/s, GPFS
20 PB, 220GB/s, GPFS
(4) DTN
Data Services
20GB/s
FRGP
Network
Science
Gateways
10 Gb Ethernet
40 Gb Ethernet
100 Gb Ethernet
Infiniband
100 GbE
Data Networks
Infiniband
FDR
Geyser, Caldera
Pronghorn
46 nodes
HPC / DAV
HPSS (Boulder)
Disaster Recovery
HPSS
Cheyenne
4,032 nodes
Infiniband
EDR
NWSC
Core
Ethern
et
Switch
10/40/10
0
10 GbE 10 GbE
40 GbE
40 GbE
Yellowstone
4,536 nodes
Infiniband
FDR
Supercomputing 2016
GLADE I/O Network Connections
•  All NSD servers are connected to 40GbE network
•  IBM NSD servers are connected to the FDR IB network
•  Yellowstone is a full fat tree network
•  Geyser/Caldera/Pronghorn quasi fat tree, up/dwn routing
•  NSD servers are up/dwn routing
•  DDN SFA’s are connected to the EDR IB network
•  Cheyenne is an enhanced hypercube
•  NSD VM’s are nodes in the hypercube
•  Each NSD VM is connected to two points in the hypercube
•  Data transfer gateways, RDA, ESG and CDP science gateways are
connected to 40GbE and 10GbE networks
•  NSD servers will route traffic over the 40GbE network to serve data to both
IB networks
Supercomputing 2016
GLADE Manager Nodes
•  Secondary GPFS cluster manager, quorum node
glademgt1
•  Primary GPFS cluster manager, quorum node
glademgt2
•  token manager, quorum node, file system manager
glademgt3
•  token manger, quorum node, file system manager
glademgt4
•  Primary GPFS configuration manager
•  token manager, quorum node, file system manager, multi-cluster
contact node
glademgt5
•  Secondary GPFS configuration manager
•  token manager, multi-cluster contact node
glademgt6
Supercomputing 2016
GLADE File System Structure
GLADE Spectrum Scale
Cluster
scratch
SSD metadata
SFA14KXE
15 PB
8M block, 4K
inode
p
DCS3700
10 PB
4M block
512 byte inode
p2
SSD metadata
SFA14KXE
5 PB
8M block, 4K
inode
DCS3700
5 PB
home
apps
SSD metadata
SSD applications
SFA7700
100 TB
1M block, 4K
inode
•  ILM rules to migrate between pools
•  Data collections pinned to DCS3700
Supercomputing 2016
Spectrum Scale Multi-Cluster
•  Clusters can serve file systems or be diskless
•  Each cluster is managed separately
•  can have different administrative domains
•  Each storage cluster controls who has access to which file systems
•  Any cluster mounting a file system must have TCP/IP access from all nodes to the
storage cluster
•  Diskless clusters only need TCP/IP access to the storage cluster hosting the file
system they wish to mount
•  No need for access to other diskless clusters
•  Each cluster is configured with a GPFS subnet which allows NSD servers in the
storage cluster to act as routers
•  Allows blocking of a subnet when necessary
•  User names and group need to be sync’d across the multi-cluster domain
Supercomputing 2016
Spectrum Scale Multi-Cluster
Geyser, Caldera
Pronghorn
46 nodes
Cheyenne
4,032 nodes
Yellowstone
4,536 nodes
n1 n2 n4536
n1 n2 n46
nsd21
nsd22
nsd54
nsd1
nsd2
nsd20
ESG / CDP
Science
Gateways
Data Transfer
Services
RDA
Science Gateway
web n1 n8
n1 n2 n6
web n1 n8
EDR Network
FDR Network
40GbE Network
. . .
. . .
. . .
. . .
. . .
. . .
Storage Cluster
YS
Cluster
SAGE
Cluster
RDA
Cluster
DAV
Cluster
CH
Cluster
DTN
Cluster
n1 n2 n10
CH PBS
Cluster
n1 n2 n4032
Cheyenne
PBS, Login Nodes
. . .
......
GLADE
Supercomputing 2016
HPC Futures Lab - I/O Innovation Lab
•  Provide infrastructure necessary for I/O hardware and software testing
•  Focused on future I/O architectures and how they might relate to NCAR HPC
•  Proof of Concept for future HPC procurements
•  Development environment for upcoming I/O environment changes
•  Domains
•  High-Bandwidth Networks (40/100 GbE, IB, OPA)
•  Parallel File Systems, early release & pre-production testing (Spectrum Scale, Lustre, ??)
•  Storage Hardware (block storage, object storage, SSD, NVMe, ??)
•  Hadoop Cluster (testing of local temporary storage)
•  Cloud Storage Clusters (targeted for Data Service Gateways)
•  Compute Cluster (to drive I/O tests)
Supercomputing 2016
HPC Futures Lab - I/O Innovation Lab
•  Boulder Colorado Lab
•  Network Infrastructure to support 40/100 GbE, EDR IB,
OPA
•  Ability to add additional network technology as
necessary
•  Management Infrastructure
•  Cluster management tools
•  Monitoring tools
•  Persistent storage for home directories and
application code
•  Storage and server hardware
•  Cheyenne Wyoming Lab
•  Leverages current test infrastructure
•  Production network support for 40/100 GbE, IB
•  Integration with current pre-production
compute clusters
•  Focused on ‘Burst Buffer’ and NVMe environments
•  Easily coupled to test compute clusters
Supercomputing 2016
I/O Innovation Lab – Boulder Lab
Parallel File System Research
Compute / Hadoop Cluster
Cloud
Cloud
Cloud
Cloud
40GbE
iolab
mgt1
iolab
mgt2
40GbE
40GbE
OPA
EDR
GPFS
Mgt
FDR
GPFS
NSD
GPFS
NSD
Lustre
OSS
Lustre
OSS
Lustre
MS
Lustre
Mgt
SSD
SFA 7700 Storage
40GbE
EDR
OPA
Hadoop Hadoop Hadoop Hadoop Hadoop Hadoop Hadoop Hadoop
Cloud Data
Services Research
Lab Infrastructure
Supercomputing 2016
I/O Innovation Lab – Cheyenne Lab
IBM
GPFS
IBM
DCS3700
DDN
GPFS
DDN
SFA14KXE
IBM
Jellystone
SGI
Laramie
IBM
yogi/booboo
DDN
IME
DDN
IME
SGI
UV300
SGI
NVMe
40GbE
FDR
EDR
GPFS I/O Routing from
FDR to EDR networks
Connected to
EDR at FDR rate
Leverages picnic
infrastructure
Leverages current
test systems for I/O
testing
Supercomputing 2016
QUESTIONS?
pjg@ucar.edu

More Related Content

What's hot

How to Increase Performance of Your Hadoop Cluster
How to Increase Performance of Your Hadoop ClusterHow to Increase Performance of Your Hadoop Cluster
How to Increase Performance of Your Hadoop Cluster
Altoros
 
UberCloud HPC Experiment Introduction for Beginners
UberCloud HPC Experiment Introduction for BeginnersUberCloud HPC Experiment Introduction for Beginners
UberCloud HPC Experiment Introduction for Beginners
hpcexperiment
 
Introduction to Hadoop part 2
Introduction to Hadoop part 2Introduction to Hadoop part 2
Introduction to Hadoop part 2
Giovanna Roda
 
Realistic Synthetic Generation Allows Secure Development
Realistic Synthetic Generation Allows Secure DevelopmentRealistic Synthetic Generation Allows Secure Development
Realistic Synthetic Generation Allows Secure Development
DataWorks Summit
 
Yarn by default (Spark on YARN)
Yarn by default (Spark on YARN)Yarn by default (Spark on YARN)
Yarn by default (Spark on YARN)
Ferran Galí Reniu
 
Hadoop Summit San Jose 2014: Costing Your Big Data Operations
Hadoop Summit San Jose 2014: Costing Your Big Data Operations Hadoop Summit San Jose 2014: Costing Your Big Data Operations
Hadoop Summit San Jose 2014: Costing Your Big Data Operations
Sumeet Singh
 
Spark and Shark: Lightning-Fast Analytics over Hadoop and Hive Data
Spark and Shark: Lightning-Fast Analytics over Hadoop and Hive DataSpark and Shark: Lightning-Fast Analytics over Hadoop and Hive Data
Spark and Shark: Lightning-Fast Analytics over Hadoop and Hive Data
Jetlore
 
GLORIAD's New Measurement and Monitoring System
GLORIAD's New Measurement and Monitoring SystemGLORIAD's New Measurement and Monitoring System
GLORIAD's New Measurement and Monitoring System
Ed Dodds
 
Improving Hadoop Performance via Linux
Improving Hadoop Performance via LinuxImproving Hadoop Performance via Linux
Improving Hadoop Performance via Linux
Alex Moundalexis
 
NSCC Training Introductory Class
NSCC Training Introductory Class NSCC Training Introductory Class
NSCC Training Introductory Class
National Supercomputing Centre Singapore
 
NSCC Training - Introductory Class
NSCC Training - Introductory ClassNSCC Training - Introductory Class
NSCC Training - Introductory Class
National Supercomputing Centre Singapore
 
NSCC Training Introductory Class
NSCC Training  Introductory ClassNSCC Training  Introductory Class
NSCC Training Introductory Class
National Supercomputing Centre Singapore
 
Hadoop2.2
Hadoop2.2Hadoop2.2
Hadoop2.2
Sreejith P
 
Introduction to HDFS and MapReduce
Introduction to HDFS and MapReduceIntroduction to HDFS and MapReduce
Introduction to HDFS and MapReduce
Uday Vakalapudi
 
Implementation of k means algorithm on Hadoop
Implementation of k means algorithm on HadoopImplementation of k means algorithm on Hadoop
Implementation of k means algorithm on Hadoop
Lamprini Koutsokera
 
Optimizing your Infrastrucure and Operating System for Hadoop
Optimizing your Infrastrucure and Operating System for HadoopOptimizing your Infrastrucure and Operating System for Hadoop
Optimizing your Infrastrucure and Operating System for Hadoop
DataWorks Summit
 
An Introduction to Hadoop
An Introduction to HadoopAn Introduction to Hadoop
An Introduction to Hadoop
DerrekYoungDotCom
 
HPC Storage and IO Trends and Workflows
HPC Storage and IO Trends and WorkflowsHPC Storage and IO Trends and Workflows
HPC Storage and IO Trends and Workflows
inside-BigData.com
 
Database Research on Modern Computing Architecture
Database Research on Modern Computing ArchitectureDatabase Research on Modern Computing Architecture
Database Research on Modern Computing Architecture
Kyong-Ha Lee
 
HBase with MapR
HBase with MapRHBase with MapR
HBase with MapR
Tomer Shiran
 

What's hot (20)

How to Increase Performance of Your Hadoop Cluster
How to Increase Performance of Your Hadoop ClusterHow to Increase Performance of Your Hadoop Cluster
How to Increase Performance of Your Hadoop Cluster
 
UberCloud HPC Experiment Introduction for Beginners
UberCloud HPC Experiment Introduction for BeginnersUberCloud HPC Experiment Introduction for Beginners
UberCloud HPC Experiment Introduction for Beginners
 
Introduction to Hadoop part 2
Introduction to Hadoop part 2Introduction to Hadoop part 2
Introduction to Hadoop part 2
 
Realistic Synthetic Generation Allows Secure Development
Realistic Synthetic Generation Allows Secure DevelopmentRealistic Synthetic Generation Allows Secure Development
Realistic Synthetic Generation Allows Secure Development
 
Yarn by default (Spark on YARN)
Yarn by default (Spark on YARN)Yarn by default (Spark on YARN)
Yarn by default (Spark on YARN)
 
Hadoop Summit San Jose 2014: Costing Your Big Data Operations
Hadoop Summit San Jose 2014: Costing Your Big Data Operations Hadoop Summit San Jose 2014: Costing Your Big Data Operations
Hadoop Summit San Jose 2014: Costing Your Big Data Operations
 
Spark and Shark: Lightning-Fast Analytics over Hadoop and Hive Data
Spark and Shark: Lightning-Fast Analytics over Hadoop and Hive DataSpark and Shark: Lightning-Fast Analytics over Hadoop and Hive Data
Spark and Shark: Lightning-Fast Analytics over Hadoop and Hive Data
 
GLORIAD's New Measurement and Monitoring System
GLORIAD's New Measurement and Monitoring SystemGLORIAD's New Measurement and Monitoring System
GLORIAD's New Measurement and Monitoring System
 
Improving Hadoop Performance via Linux
Improving Hadoop Performance via LinuxImproving Hadoop Performance via Linux
Improving Hadoop Performance via Linux
 
NSCC Training Introductory Class
NSCC Training Introductory Class NSCC Training Introductory Class
NSCC Training Introductory Class
 
NSCC Training - Introductory Class
NSCC Training - Introductory ClassNSCC Training - Introductory Class
NSCC Training - Introductory Class
 
NSCC Training Introductory Class
NSCC Training  Introductory ClassNSCC Training  Introductory Class
NSCC Training Introductory Class
 
Hadoop2.2
Hadoop2.2Hadoop2.2
Hadoop2.2
 
Introduction to HDFS and MapReduce
Introduction to HDFS and MapReduceIntroduction to HDFS and MapReduce
Introduction to HDFS and MapReduce
 
Implementation of k means algorithm on Hadoop
Implementation of k means algorithm on HadoopImplementation of k means algorithm on Hadoop
Implementation of k means algorithm on Hadoop
 
Optimizing your Infrastrucure and Operating System for Hadoop
Optimizing your Infrastrucure and Operating System for HadoopOptimizing your Infrastrucure and Operating System for Hadoop
Optimizing your Infrastrucure and Operating System for Hadoop
 
An Introduction to Hadoop
An Introduction to HadoopAn Introduction to Hadoop
An Introduction to Hadoop
 
HPC Storage and IO Trends and Workflows
HPC Storage and IO Trends and WorkflowsHPC Storage and IO Trends and Workflows
HPC Storage and IO Trends and Workflows
 
Database Research on Modern Computing Architecture
Database Research on Modern Computing ArchitectureDatabase Research on Modern Computing Architecture
Database Research on Modern Computing Architecture
 
HBase with MapR
HBase with MapRHBase with MapR
HBase with MapR
 

Viewers also liked

Survey of clustered_parallel_file_systems_004_lanl.ppt
Survey of clustered_parallel_file_systems_004_lanl.pptSurvey of clustered_parallel_file_systems_004_lanl.ppt
Survey of clustered_parallel_file_systems_004_lanl.ppt
Rohn Wood
 
Facilitating Collaborative Life Science Research in Commercial & Enterprise E...
Facilitating Collaborative Life Science Research in Commercial & Enterprise E...Facilitating Collaborative Life Science Research in Commercial & Enterprise E...
Facilitating Collaborative Life Science Research in Commercial & Enterprise E...
Chris Dagdigian
 
Architecture of the Upcoming OrangeFS v3 Distributed Parallel File System
Architecture of the Upcoming OrangeFS v3 Distributed Parallel File SystemArchitecture of the Upcoming OrangeFS v3 Distributed Parallel File System
Architecture of the Upcoming OrangeFS v3 Distributed Parallel File System
All Things Open
 
Ibm spectrum scale fundamentals workshop for americas part 6 spectrumscale el...
Ibm spectrum scale fundamentals workshop for americas part 6 spectrumscale el...Ibm spectrum scale fundamentals workshop for americas part 6 spectrumscale el...
Ibm spectrum scale fundamentals workshop for americas part 6 spectrumscale el...
xKinAnx
 
HSM migration with EasyHSM and Nirvana
HSM migration with EasyHSM and NirvanaHSM migration with EasyHSM and Nirvana
HSM migration with EasyHSM and Nirvana
Igor Sfiligoi
 
BioIT World 2016 - HPC Trends from the Trenches
BioIT World 2016 - HPC Trends from the TrenchesBioIT World 2016 - HPC Trends from the Trenches
BioIT World 2016 - HPC Trends from the Trenches
Chris Dagdigian
 
EasyHSM Overview
EasyHSM OverviewEasyHSM Overview
EasyHSM Overview
Igor Sfiligoi
 
A escolha da profissão!
A escolha da profissão!   A escolha da profissão!
A escolha da profissão!
Marina Sorrentino
 
Big Lab Problems Solved with Spectrum Scale: Innovations for the Coral Program
Big Lab Problems Solved with Spectrum Scale: Innovations for the Coral ProgramBig Lab Problems Solved with Spectrum Scale: Innovations for the Coral Program
Big Lab Problems Solved with Spectrum Scale: Innovations for the Coral Program
inside-BigData.com
 
Ibm spectrum scale_backup_n_archive_v03_ash
Ibm spectrum scale_backup_n_archive_v03_ashIbm spectrum scale_backup_n_archive_v03_ash
Ibm spectrum scale_backup_n_archive_v03_ash
Ashutosh Mate
 
Ibm spectrum scale fundamentals workshop for americas part 1 components archi...
Ibm spectrum scale fundamentals workshop for americas part 1 components archi...Ibm spectrum scale fundamentals workshop for americas part 1 components archi...
Ibm spectrum scale fundamentals workshop for americas part 1 components archi...
xKinAnx
 

Viewers also liked (11)

Survey of clustered_parallel_file_systems_004_lanl.ppt
Survey of clustered_parallel_file_systems_004_lanl.pptSurvey of clustered_parallel_file_systems_004_lanl.ppt
Survey of clustered_parallel_file_systems_004_lanl.ppt
 
Facilitating Collaborative Life Science Research in Commercial & Enterprise E...
Facilitating Collaborative Life Science Research in Commercial & Enterprise E...Facilitating Collaborative Life Science Research in Commercial & Enterprise E...
Facilitating Collaborative Life Science Research in Commercial & Enterprise E...
 
Architecture of the Upcoming OrangeFS v3 Distributed Parallel File System
Architecture of the Upcoming OrangeFS v3 Distributed Parallel File SystemArchitecture of the Upcoming OrangeFS v3 Distributed Parallel File System
Architecture of the Upcoming OrangeFS v3 Distributed Parallel File System
 
Ibm spectrum scale fundamentals workshop for americas part 6 spectrumscale el...
Ibm spectrum scale fundamentals workshop for americas part 6 spectrumscale el...Ibm spectrum scale fundamentals workshop for americas part 6 spectrumscale el...
Ibm spectrum scale fundamentals workshop for americas part 6 spectrumscale el...
 
HSM migration with EasyHSM and Nirvana
HSM migration with EasyHSM and NirvanaHSM migration with EasyHSM and Nirvana
HSM migration with EasyHSM and Nirvana
 
BioIT World 2016 - HPC Trends from the Trenches
BioIT World 2016 - HPC Trends from the TrenchesBioIT World 2016 - HPC Trends from the Trenches
BioIT World 2016 - HPC Trends from the Trenches
 
EasyHSM Overview
EasyHSM OverviewEasyHSM Overview
EasyHSM Overview
 
A escolha da profissão!
A escolha da profissão!   A escolha da profissão!
A escolha da profissão!
 
Big Lab Problems Solved with Spectrum Scale: Innovations for the Coral Program
Big Lab Problems Solved with Spectrum Scale: Innovations for the Coral ProgramBig Lab Problems Solved with Spectrum Scale: Innovations for the Coral Program
Big Lab Problems Solved with Spectrum Scale: Innovations for the Coral Program
 
Ibm spectrum scale_backup_n_archive_v03_ash
Ibm spectrum scale_backup_n_archive_v03_ashIbm spectrum scale_backup_n_archive_v03_ash
Ibm spectrum scale_backup_n_archive_v03_ash
 
Ibm spectrum scale fundamentals workshop for americas part 1 components archi...
Ibm spectrum scale fundamentals workshop for americas part 1 components archi...Ibm spectrum scale fundamentals workshop for americas part 1 components archi...
Ibm spectrum scale fundamentals workshop for americas part 1 components archi...
 

Similar to Ncar globally accessible user environment

The Proto-Burst Buffer: Experience with the flash-based file system on SDSC's...
The Proto-Burst Buffer: Experience with the flash-based file system on SDSC's...The Proto-Burst Buffer: Experience with the flash-based file system on SDSC's...
The Proto-Burst Buffer: Experience with the flash-based file system on SDSC's...
Glenn K. Lockwood
 
Empower Data-Driven Organizations
Empower Data-Driven OrganizationsEmpower Data-Driven Organizations
Empower Data-Driven Organizations
DataWorks Summit/Hadoop Summit
 
Big Data Meets HPC - Exploiting HPC Technologies for Accelerating Big Data Pr...
Big Data Meets HPC - Exploiting HPC Technologies for Accelerating Big Data Pr...Big Data Meets HPC - Exploiting HPC Technologies for Accelerating Big Data Pr...
Big Data Meets HPC - Exploiting HPC Technologies for Accelerating Big Data Pr...
inside-BigData.com
 
Ceph Day Berlin: Scaling an Academic Cloud
Ceph Day Berlin: Scaling an Academic CloudCeph Day Berlin: Scaling an Academic Cloud
Ceph Day Berlin: Scaling an Academic Cloud
Ceph Community
 
Ceph Day Berlin: Scaling an Academic Cloud
Ceph Day Berlin: Scaling an Academic CloudCeph Day Berlin: Scaling an Academic Cloud
Ceph Day Berlin: Scaling an Academic Cloud
Ceph Community
 
Using Big Data techniques to query and store OpenStreetMap data. Stephen Knox...
Using Big Data techniques to query and store OpenStreetMap data. Stephen Knox...Using Big Data techniques to query and store OpenStreetMap data. Stephen Knox...
Using Big Data techniques to query and store OpenStreetMap data. Stephen Knox...
huguk
 
Pilot Hadoop Towards 2500 Nodes and Cluster Redundancy
Pilot Hadoop Towards 2500 Nodes and Cluster RedundancyPilot Hadoop Towards 2500 Nodes and Cluster Redundancy
Pilot Hadoop Towards 2500 Nodes and Cluster Redundancy
Stuart Pook
 
Hadoop Operations - Best practices from the field
Hadoop Operations - Best practices from the fieldHadoop Operations - Best practices from the field
Hadoop Operations - Best practices from the field
Uwe Printz
 
The state of SQL-on-Hadoop in the Cloud
The state of SQL-on-Hadoop in the CloudThe state of SQL-on-Hadoop in the Cloud
The state of SQL-on-Hadoop in the Cloud
DataWorks Summit/Hadoop Summit
 
Tugdual Grall - Real World Use Cases: Hadoop and NoSQL in Production
Tugdual Grall - Real World Use Cases: Hadoop and NoSQL in ProductionTugdual Grall - Real World Use Cases: Hadoop and NoSQL in Production
Tugdual Grall - Real World Use Cases: Hadoop and NoSQL in Production
Codemotion
 
Bruno Silva - eMedLab: Merging HPC and Cloud for Biomedical Research
Bruno Silva - eMedLab: Merging HPC and Cloud for Biomedical ResearchBruno Silva - eMedLab: Merging HPC and Cloud for Biomedical Research
Bruno Silva - eMedLab: Merging HPC and Cloud for Biomedical Research
Danny Abukalam
 
QCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference ArchitectureQCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference Architecture
Ceph Community
 
QCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference ArchitectureQCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference Architecture
Patrick McGarry
 
Ceph Day New York 2014: Best Practices for Ceph-Powered Implementations of St...
Ceph Day New York 2014: Best Practices for Ceph-Powered Implementations of St...Ceph Day New York 2014: Best Practices for Ceph-Powered Implementations of St...
Ceph Day New York 2014: Best Practices for Ceph-Powered Implementations of St...
Ceph Community
 
Introduction to Galaxy and RNA-Seq
Introduction to Galaxy and RNA-SeqIntroduction to Galaxy and RNA-Seq
Introduction to Galaxy and RNA-Seq
Enis Afgan
 
Experience of Running Spark on Kubernetes on OpenStack for High Energy Physic...
Experience of Running Spark on Kubernetes on OpenStack for High Energy Physic...Experience of Running Spark on Kubernetes on OpenStack for High Energy Physic...
Experience of Running Spark on Kubernetes on OpenStack for High Energy Physic...
Databricks
 
Johnny Miller – Cassandra + Spark = Awesome- NoSQL matters Barcelona 2014
Johnny Miller – Cassandra + Spark = Awesome- NoSQL matters Barcelona 2014Johnny Miller – Cassandra + Spark = Awesome- NoSQL matters Barcelona 2014
Johnny Miller – Cassandra + Spark = Awesome- NoSQL matters Barcelona 2014
NoSQLmatters
 
OpenStack Toronto Q3 MeetUp - September 28th 2017
OpenStack Toronto Q3 MeetUp - September 28th 2017OpenStack Toronto Q3 MeetUp - September 28th 2017
OpenStack Toronto Q3 MeetUp - September 28th 2017
Stacy Véronneau
 
Bringing Structure, Scalability, and Services to Cloud-Scale Storage
Bringing Structure, Scalability, and Services to Cloud-Scale StorageBringing Structure, Scalability, and Services to Cloud-Scale Storage
Bringing Structure, Scalability, and Services to Cloud-Scale Storage
MapR Technologies
 
Ceph Day Melbourne - Scale and performance: Servicing the Fabric and the Work...
Ceph Day Melbourne - Scale and performance: Servicing the Fabric and the Work...Ceph Day Melbourne - Scale and performance: Servicing the Fabric and the Work...
Ceph Day Melbourne - Scale and performance: Servicing the Fabric and the Work...
Ceph Community
 

Similar to Ncar globally accessible user environment (20)

The Proto-Burst Buffer: Experience with the flash-based file system on SDSC's...
The Proto-Burst Buffer: Experience with the flash-based file system on SDSC's...The Proto-Burst Buffer: Experience with the flash-based file system on SDSC's...
The Proto-Burst Buffer: Experience with the flash-based file system on SDSC's...
 
Empower Data-Driven Organizations
Empower Data-Driven OrganizationsEmpower Data-Driven Organizations
Empower Data-Driven Organizations
 
Big Data Meets HPC - Exploiting HPC Technologies for Accelerating Big Data Pr...
Big Data Meets HPC - Exploiting HPC Technologies for Accelerating Big Data Pr...Big Data Meets HPC - Exploiting HPC Technologies for Accelerating Big Data Pr...
Big Data Meets HPC - Exploiting HPC Technologies for Accelerating Big Data Pr...
 
Ceph Day Berlin: Scaling an Academic Cloud
Ceph Day Berlin: Scaling an Academic CloudCeph Day Berlin: Scaling an Academic Cloud
Ceph Day Berlin: Scaling an Academic Cloud
 
Ceph Day Berlin: Scaling an Academic Cloud
Ceph Day Berlin: Scaling an Academic CloudCeph Day Berlin: Scaling an Academic Cloud
Ceph Day Berlin: Scaling an Academic Cloud
 
Using Big Data techniques to query and store OpenStreetMap data. Stephen Knox...
Using Big Data techniques to query and store OpenStreetMap data. Stephen Knox...Using Big Data techniques to query and store OpenStreetMap data. Stephen Knox...
Using Big Data techniques to query and store OpenStreetMap data. Stephen Knox...
 
Pilot Hadoop Towards 2500 Nodes and Cluster Redundancy
Pilot Hadoop Towards 2500 Nodes and Cluster RedundancyPilot Hadoop Towards 2500 Nodes and Cluster Redundancy
Pilot Hadoop Towards 2500 Nodes and Cluster Redundancy
 
Hadoop Operations - Best practices from the field
Hadoop Operations - Best practices from the fieldHadoop Operations - Best practices from the field
Hadoop Operations - Best practices from the field
 
The state of SQL-on-Hadoop in the Cloud
The state of SQL-on-Hadoop in the CloudThe state of SQL-on-Hadoop in the Cloud
The state of SQL-on-Hadoop in the Cloud
 
Tugdual Grall - Real World Use Cases: Hadoop and NoSQL in Production
Tugdual Grall - Real World Use Cases: Hadoop and NoSQL in ProductionTugdual Grall - Real World Use Cases: Hadoop and NoSQL in Production
Tugdual Grall - Real World Use Cases: Hadoop and NoSQL in Production
 
Bruno Silva - eMedLab: Merging HPC and Cloud for Biomedical Research
Bruno Silva - eMedLab: Merging HPC and Cloud for Biomedical ResearchBruno Silva - eMedLab: Merging HPC and Cloud for Biomedical Research
Bruno Silva - eMedLab: Merging HPC and Cloud for Biomedical Research
 
QCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference ArchitectureQCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference Architecture
 
QCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference ArchitectureQCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference Architecture
 
Ceph Day New York 2014: Best Practices for Ceph-Powered Implementations of St...
Ceph Day New York 2014: Best Practices for Ceph-Powered Implementations of St...Ceph Day New York 2014: Best Practices for Ceph-Powered Implementations of St...
Ceph Day New York 2014: Best Practices for Ceph-Powered Implementations of St...
 
Introduction to Galaxy and RNA-Seq
Introduction to Galaxy and RNA-SeqIntroduction to Galaxy and RNA-Seq
Introduction to Galaxy and RNA-Seq
 
Experience of Running Spark on Kubernetes on OpenStack for High Energy Physic...
Experience of Running Spark on Kubernetes on OpenStack for High Energy Physic...Experience of Running Spark on Kubernetes on OpenStack for High Energy Physic...
Experience of Running Spark on Kubernetes on OpenStack for High Energy Physic...
 
Johnny Miller – Cassandra + Spark = Awesome- NoSQL matters Barcelona 2014
Johnny Miller – Cassandra + Spark = Awesome- NoSQL matters Barcelona 2014Johnny Miller – Cassandra + Spark = Awesome- NoSQL matters Barcelona 2014
Johnny Miller – Cassandra + Spark = Awesome- NoSQL matters Barcelona 2014
 
OpenStack Toronto Q3 MeetUp - September 28th 2017
OpenStack Toronto Q3 MeetUp - September 28th 2017OpenStack Toronto Q3 MeetUp - September 28th 2017
OpenStack Toronto Q3 MeetUp - September 28th 2017
 
Bringing Structure, Scalability, and Services to Cloud-Scale Storage
Bringing Structure, Scalability, and Services to Cloud-Scale StorageBringing Structure, Scalability, and Services to Cloud-Scale Storage
Bringing Structure, Scalability, and Services to Cloud-Scale Storage
 
Ceph Day Melbourne - Scale and performance: Servicing the Fabric and the Work...
Ceph Day Melbourne - Scale and performance: Servicing the Fabric and the Work...Ceph Day Melbourne - Scale and performance: Servicing the Fabric and the Work...
Ceph Day Melbourne - Scale and performance: Servicing the Fabric and the Work...
 

More from inside-BigData.com

Major Market Shifts in IT
Major Market Shifts in ITMajor Market Shifts in IT
Major Market Shifts in IT
inside-BigData.com
 
Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...
inside-BigData.com
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networks
inside-BigData.com
 
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
inside-BigData.com
 
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
inside-BigData.com
 
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
inside-BigData.com
 
HPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural NetworksHPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural Networks
inside-BigData.com
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
inside-BigData.com
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecasts
inside-BigData.com
 
HPC AI Advisory Council Update
HPC AI Advisory Council UpdateHPC AI Advisory Council Update
HPC AI Advisory Council Update
inside-BigData.com
 
Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19
inside-BigData.com
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuning
inside-BigData.com
 
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODHPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
inside-BigData.com
 
State of ARM-based HPC
State of ARM-based HPCState of ARM-based HPC
State of ARM-based HPC
inside-BigData.com
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Acceleration
inside-BigData.com
 
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance EfficientlyZettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
inside-BigData.com
 
Scaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's EraScaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's Era
inside-BigData.com
 
CUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computingCUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computing
inside-BigData.com
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Cluster
inside-BigData.com
 
Overview of HPC Interconnects
Overview of HPC InterconnectsOverview of HPC Interconnects
Overview of HPC Interconnects
inside-BigData.com
 

More from inside-BigData.com (20)

Major Market Shifts in IT
Major Market Shifts in ITMajor Market Shifts in IT
Major Market Shifts in IT
 
Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networks
 
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
 
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
 
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
 
HPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural NetworksHPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural Networks
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecasts
 
HPC AI Advisory Council Update
HPC AI Advisory Council UpdateHPC AI Advisory Council Update
HPC AI Advisory Council Update
 
Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuning
 
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODHPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
 
State of ARM-based HPC
State of ARM-based HPCState of ARM-based HPC
State of ARM-based HPC
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Acceleration
 
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance EfficientlyZettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
 
Scaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's EraScaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's Era
 
CUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computingCUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computing
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Cluster
 
Overview of HPC Interconnects
Overview of HPC InterconnectsOverview of HPC Interconnects
Overview of HPC Interconnects
 

Recently uploaded

System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - HiikeSystem Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
Hiike
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
MichaelKnudsen27
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
Tomaz Bratanic
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
Zilliz
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
Jakub Marek
 
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
saastr
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
akankshawande
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
Jason Packer
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
Octavian Nadolu
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
Zilliz
 
Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStrDeep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
saastr
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 
AWS Cloud Cost Optimization Presentation.pptx
AWS Cloud Cost Optimization Presentation.pptxAWS Cloud Cost Optimization Presentation.pptx
AWS Cloud Cost Optimization Presentation.pptx
HarisZaheer8
 
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying AheadDigital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Wask
 
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Jeffrey Haguewood
 
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
alexjohnson7307
 
Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...
Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...
Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...
Tatiana Kojar
 
Nunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdf
Nunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdfNunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdf
Nunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdf
flufftailshop
 
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Jeffrey Haguewood
 

Recently uploaded (20)

System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - HiikeSystem Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
 
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
 
Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStrDeep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 
AWS Cloud Cost Optimization Presentation.pptx
AWS Cloud Cost Optimization Presentation.pptxAWS Cloud Cost Optimization Presentation.pptx
AWS Cloud Cost Optimization Presentation.pptx
 
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying AheadDigital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying Ahead
 
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
 
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
 
Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...
Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...
Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...
 
Nunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdf
Nunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdfNunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdf
Nunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdf
 
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
 

Ncar globally accessible user environment

  • 1. Supercomputing 2016 NCAR Globally Accessible User Environment DDN Booth Supercomputing 2016 Pamela Hill, Manager, Data Analysis Services
  • 2. Supercomputing 2016 National Center For Atmospheric Research - NCAR •  Federally(Funded(Research( and(Development(Center( (FFRDC)(sponsored(by(the( Na:onal(Science(Founda:on( (NSF)(and(established(in(1959( •  Operated(by(the(University( Corpora:on(for(Atmospheric( Research((UCAR),(a(nonEprofit( consor:um(of(>(100(member( universi:es,(academic( affiliates,(and(interna:onal( affiliates( “I$have$a$very$strong$feeling$that$ science$exists$to$serve$human$welfare.$$ It’s$wonderful$to$have$the$opportunity$ given$us$by$society$to$do$basic$research,$ but$in$return,$we$have$a$very$important$ moral$responsibility$to$apply$that$ research$to$benefi?ng$humanity.”$ $ Walter(Orr(Roberts,(( (((((((NCAR(Founding(Director(
  • 3. Supercomputing 2016 Computational & Information Systems Laboratory – CISL Mission •  To(support,(enhance,(and(extend(the(capabili:es(for( transforma:ve(science(to(the(university(community(and( the(broader(scien:fic(community,(na:onally(and( interna:onally( •  Provide(capacity(and( capability( supercompu:ng( •  Develop(and(support( robust,(accessible,( innova:ve(and( advanced(services(and( tools( •  Create(an(Earth( System(Knowledge( Environment(
  • 4. Supercomputing 2016 Data Analysis Services Group NCAR / CISL / HSS / DASG •  Data Transfer and Storage Services •  Pamela Hill •  Joey Mendoza •  High-Performance File Systems •  Data Transfer Protocols •  Science Gateway Support •  Innovative I/O Solutions •  Visualization Services •  John Clyne •  Scott Pearse •  Samuel Li •  VAPOR development and support •  3D visualization •  Visualization User Support
  • 5. Supercomputing 2016 GLADE Mission GLoballyAccessible Data Environment •  Unified and consistent data environment for NCAR HPC •  Supercomputers, Data Analysis and Visualization Clusters •  Support for project work spaces •  Support for shared data transfer interfaces •  Support for Science Gateways and access to Research Data Archive (RDA) & Earth Systems Grid (ESG) data sets •  Data is available at high bandwidth to any server or supercomputer within the GLADE environment •  Resources outside the environment can manipulate data using common interfaces •  Choice of interfaces supports current projects; platform is flexible to support future projects
  • 7. Supercomputing 2016 GLADE Today •  90 GB/s bandwidth •  16 PB useable capacity •  76 IBM DCS3700 •  6840 3TB drives •  shared data + metadata •  20 NSD servers •  6 management nodes •  File Services •  FDR •  10 GbE Production Jan 2017 •  220 GB/s bandwidth •  20 PB useable capacity •  8 DDN SFA14KXE •  3360 8TB drives •  data only •  48 800GB SSD •  Metadata •  File Services •  EDR •  40 GbE Expansion Spring 2017 •  19 PB useable capacity
  • 8. Supercomputing 2016 DDN SFA14KXE Scalable System Unit 8 x SSU’s •  (4) NSD VM’s •  (4) Dual Port ConnectX-4 VPI HBA’s •  (4) EDR IB Connections •  (4) 40 GbE Connections •  (420) 8 TB NL-SAS •  (6) 800 GB SSD (42) 8 TB (42) 8 TB (42) 8 TB (42) 8 TB (42) 8 TB, (1) SSD (42) 8 TB, (1) SSD (42) 8 TB, (1) SSD (42) 8 TB, (1) SSD (42) 8 TB, (1) SSD (42) 8 TB, (1) SSD (2) NSD VM’s (2) NSD VM’s EDR IB 40 GbE
  • 9. Supercomputing 2016 GLADE I/O Network •  Network architecture providing global access to data storage from multiple HPC resources •  Flexibility provided by support of multiple connectivity options and multiple compute network topologies •  10GbE, 40GbE, FDR, EDR •  Full Fat Tree, Quasi Fat Tree, Hypercube •  Scalability allows for addition of new HPC or storage resources •  Agnostic with respect to vendor and file system •  Can support multiple solutions simultaneously
  • 10. Supercomputing 2016 HPSS Archive 160PB, 12.5 GB/s 60PB holdings HPSS GLADE 16 PB, 90 GB/s, GPFS 20 PB, 220GB/s, GPFS (4) DTN Data Services 20GB/s FRGP Network Science Gateways 10 Gb Ethernet 40 Gb Ethernet 100 Gb Ethernet Infiniband 100 GbE Data Networks Infiniband FDR Geyser, Caldera Pronghorn 46 nodes HPC / DAV HPSS (Boulder) Disaster Recovery HPSS Cheyenne 4,032 nodes Infiniband EDR NWSC Core Ethern et Switch 10/40/10 0 10 GbE 10 GbE 40 GbE 40 GbE Yellowstone 4,536 nodes Infiniband FDR
  • 11. Supercomputing 2016 GLADE I/O Network Connections •  All NSD servers are connected to 40GbE network •  IBM NSD servers are connected to the FDR IB network •  Yellowstone is a full fat tree network •  Geyser/Caldera/Pronghorn quasi fat tree, up/dwn routing •  NSD servers are up/dwn routing •  DDN SFA’s are connected to the EDR IB network •  Cheyenne is an enhanced hypercube •  NSD VM’s are nodes in the hypercube •  Each NSD VM is connected to two points in the hypercube •  Data transfer gateways, RDA, ESG and CDP science gateways are connected to 40GbE and 10GbE networks •  NSD servers will route traffic over the 40GbE network to serve data to both IB networks
  • 12. Supercomputing 2016 GLADE Manager Nodes •  Secondary GPFS cluster manager, quorum node glademgt1 •  Primary GPFS cluster manager, quorum node glademgt2 •  token manager, quorum node, file system manager glademgt3 •  token manger, quorum node, file system manager glademgt4 •  Primary GPFS configuration manager •  token manager, quorum node, file system manager, multi-cluster contact node glademgt5 •  Secondary GPFS configuration manager •  token manager, multi-cluster contact node glademgt6
  • 13. Supercomputing 2016 GLADE File System Structure GLADE Spectrum Scale Cluster scratch SSD metadata SFA14KXE 15 PB 8M block, 4K inode p DCS3700 10 PB 4M block 512 byte inode p2 SSD metadata SFA14KXE 5 PB 8M block, 4K inode DCS3700 5 PB home apps SSD metadata SSD applications SFA7700 100 TB 1M block, 4K inode •  ILM rules to migrate between pools •  Data collections pinned to DCS3700
  • 14. Supercomputing 2016 Spectrum Scale Multi-Cluster •  Clusters can serve file systems or be diskless •  Each cluster is managed separately •  can have different administrative domains •  Each storage cluster controls who has access to which file systems •  Any cluster mounting a file system must have TCP/IP access from all nodes to the storage cluster •  Diskless clusters only need TCP/IP access to the storage cluster hosting the file system they wish to mount •  No need for access to other diskless clusters •  Each cluster is configured with a GPFS subnet which allows NSD servers in the storage cluster to act as routers •  Allows blocking of a subnet when necessary •  User names and group need to be sync’d across the multi-cluster domain
  • 15. Supercomputing 2016 Spectrum Scale Multi-Cluster Geyser, Caldera Pronghorn 46 nodes Cheyenne 4,032 nodes Yellowstone 4,536 nodes n1 n2 n4536 n1 n2 n46 nsd21 nsd22 nsd54 nsd1 nsd2 nsd20 ESG / CDP Science Gateways Data Transfer Services RDA Science Gateway web n1 n8 n1 n2 n6 web n1 n8 EDR Network FDR Network 40GbE Network . . . . . . . . . . . . . . . . . . Storage Cluster YS Cluster SAGE Cluster RDA Cluster DAV Cluster CH Cluster DTN Cluster n1 n2 n10 CH PBS Cluster n1 n2 n4032 Cheyenne PBS, Login Nodes . . . ...... GLADE
  • 16. Supercomputing 2016 HPC Futures Lab - I/O Innovation Lab •  Provide infrastructure necessary for I/O hardware and software testing •  Focused on future I/O architectures and how they might relate to NCAR HPC •  Proof of Concept for future HPC procurements •  Development environment for upcoming I/O environment changes •  Domains •  High-Bandwidth Networks (40/100 GbE, IB, OPA) •  Parallel File Systems, early release & pre-production testing (Spectrum Scale, Lustre, ??) •  Storage Hardware (block storage, object storage, SSD, NVMe, ??) •  Hadoop Cluster (testing of local temporary storage) •  Cloud Storage Clusters (targeted for Data Service Gateways) •  Compute Cluster (to drive I/O tests)
  • 17. Supercomputing 2016 HPC Futures Lab - I/O Innovation Lab •  Boulder Colorado Lab •  Network Infrastructure to support 40/100 GbE, EDR IB, OPA •  Ability to add additional network technology as necessary •  Management Infrastructure •  Cluster management tools •  Monitoring tools •  Persistent storage for home directories and application code •  Storage and server hardware •  Cheyenne Wyoming Lab •  Leverages current test infrastructure •  Production network support for 40/100 GbE, IB •  Integration with current pre-production compute clusters •  Focused on ‘Burst Buffer’ and NVMe environments •  Easily coupled to test compute clusters
  • 18. Supercomputing 2016 I/O Innovation Lab – Boulder Lab Parallel File System Research Compute / Hadoop Cluster Cloud Cloud Cloud Cloud 40GbE iolab mgt1 iolab mgt2 40GbE 40GbE OPA EDR GPFS Mgt FDR GPFS NSD GPFS NSD Lustre OSS Lustre OSS Lustre MS Lustre Mgt SSD SFA 7700 Storage 40GbE EDR OPA Hadoop Hadoop Hadoop Hadoop Hadoop Hadoop Hadoop Hadoop Cloud Data Services Research Lab Infrastructure
  • 19. Supercomputing 2016 I/O Innovation Lab – Cheyenne Lab IBM GPFS IBM DCS3700 DDN GPFS DDN SFA14KXE IBM Jellystone SGI Laramie IBM yogi/booboo DDN IME DDN IME SGI UV300 SGI NVMe 40GbE FDR EDR GPFS I/O Routing from FDR to EDR networks Connected to EDR at FDR rate Leverages picnic infrastructure Leverages current test systems for I/O testing