SlideShare a Scribd company logo
HPC Cluster Computing 
from 64 to 156,000 cores 
Jason Stowe, Founder
You’ll learn about: 
How continuous/instant delivery 
helps invention/discovery
You’ll learn about: 
Climate, Organic Semiconductors, 
Lean manufacturing 
Technical Computing
About Cycle Computing 
We believe utility access to technical 
computing accelerates discovery & 
invention
Our mission: 
Easy HPC applications 
for anyone, 
on any resources, 
at any scale
About real use cases: 
Life Sciences 
Life Sciences 
• 50,000-core utility supercomputer in the Cloud 
• Analyzed 21 million compounds in 3 hours 
• 12.5 processor years for < $15,000 
Nuclear engineering Utilities / Energy 
• Approximately 600-core MPI workloads run in Cloud 
• Ran workloads in months rather than years 
• Introduction to production in 3 weeks 
Rocket Design & 
Simulation 
Asset & Liability Modeling 
(Insurance / Risk Mgmt) 
• Completes monthly/quarterly runs 5-10x faster 
• Use 1600 cores in the cloud to shorten elapsed run time 
from ~ 10 days to ~ 1-2 days 
Manufacturing & Design 
• Enable end user on-demand access to 1000s of cores 
• Avoid cost of buying new servers 
• Accelerated science and CAD/CAM process 
• 39.5 years of drug compound computations in 9 hours, 
at a total cost of $4,372 
• 10,000 server cluster seamlessly spanned regions 
• Advanced 3 new otherwise unknown compounds in wetlab 
• Moving HPC workloads to cloud for burst capacity 
• Amazon GovCloud, Security, and key management 
• Up to 1000s of cores for burst / agility
About me: 
I’m a geek from the 90s
90s geek?? 
Whilst I like hacker news, reddit, etc. 
Slashdot is my homepage
It will affect everyone
What do you think?
Buy land in Canada?
Maybe, but hopefully you’re getting: 
this is a big deal
Just in case… 
Chairman Rajendra Pachauri 
Intergovernmental Panel on Climate Change 
"Nobody on this planet is going to be 
untouched by the impacts of climate 
change”
US Secretary of State John Kerry: 
"Unless we act dramatically and quickly, 
science tells us our climate and our way of 
life are literally in jeopardy. 
Denial of the science is malpractice."
Climate Engineering?
Possibilities? 
Wind 
power 
Nuclear 
Fusion 
GeoThermal 
Climate 
Engineering 
Nuclear 
Fission 
energy 
Solar 
Energy 
BioFuels
How do we simulate/engineer a solution?
How do they create these?
They use a supercomputer…
Supercomputers = great 
• Supercomputers have special 
features, like fast interconnects 
• So good they should never run R, 
genomics, monte carlo, etc. that 
don’t use *Fast Interconnect* 
• During peak usage, our users 
have wanted: 
– Latency sensitive 
– Large scale/grid MPI 
Running on bare metal, put other jobs 
elsewhere 
Flickr: 
LouisRix
Limitations of fixed infrastructure 
Too small when needed most, 
Too large every other time… 
• Upfront CapEx anchors you to 
aging servers 
• Costly administration 
• Miss opportunities to do better 
risk management, product 
design, science, engineering
Lets talk about 
materials science
Specifically…
Designing Better Solar Materials 
The challenge is efficiency - turning photons to electricity 
The number of possible materials is limitless: 
• Need to separate the right compounds from the useless ones 
• Researcher Mark Thompson, PhD: 
“If the 20th century was the century of silicon, the 21st will be 
all organic. Problem is, how do we find the right material 
without spending the entire 21st century looking for it?”
Needle in a Haystack 
Challenge: 
205,000 compound families 
totaling 2,312,959 core-hours 
or 264 core-years
Supercomputers = Fast & costly 
Flickr: 
LouisRix
Another tool in the box: Cloud 
• At peak load, use up to 
massive scale to run 
throughput workloads on 
cloud: 
– Needle in a Haystack 
– Whole Sample set 
– BigData/Throughput i/o 
– Integration, monte-carlo paths 
– Interactive, Service oriented 
workloads (master worker) 
• Workload run-time rather than 
individual job run-time 
Flickr: 
MaCDavis
Most of Cycle’s users use 
40 - 4000 cores, 
Mark Thompson’s problem 
wasn’t that problem
Needle in a Haystack 
Challenge: 
205,000 compounds 
totaling 2,312,959 core-hours 
or 264 core-years
205,000 molecules 
264 years of computing 
16,788 Spot Instances, 
156,314 cores
8-Region Deployment 
US-West-1 
US-East 
EU 
US-West-2 
Brazil 
Tokyo 
Singapore 
Australia
205,000 molecules 
264 years of computing 
156,314 cores = 
1.21 PetaFLOPS (~Rpeak)
1.21 PetaFLOPS (Rmax+Rpeak)/2, 
156,314 cores
Benchmark individual machines
205,000 molecules 
264 years of computing 
Done in 18 hours 
Access to $68M system 
for $33k
Resilient Workload Scheduling
How did we do this? 
Automated in 8 Cloud Regions, 
5 continents, double resiliency 
Auto-scaling 
Execute Nodes 
JUPITER 
Distributed 
Queue 
Data 
… 
14 nodes controlling 16,788
Continuous Software 
Deployment/Configuration
Now Dr. Mark Thompson 
Is 264 compute years 
closer to making 
efficient solar a reality 
using organic 
semiconductors…
Important?
Solving this problem…
History repeats itself
Lean Manufacturing 
(Japanese Manufacturing to turn 
raw materials to inventory)
Continuous Delivery 
(Modern software process to turn 
builds to production applications)
Users want to decrease 
Cycle time 
= 
(prep time + queue time + 
run time)
The Scientific Method 
Ask a 
Question 
Hypothesiz 
e Predict Experiment 
/ Test Analyze Final 
Results 
Test and Analyze stages 
require the most time, 
compute, and data
What’s holding us 
(Engineering, Science, Risk) 
back?
Utility HPC Access = 
Continuous HPC capacity 
delivery
The Innovation Bottleneck: 
Scientists/Engineers 
forced to size expectations to 
the 
infrastructure they have access 
to
How does cloud help?
Put the important, 
fast interconnect jobs on 
fast interconnect machines 
Put everything else on 
cloud capacity, when needed
Workloads are actually helping 
Serial or MPI job 
Serial or MPI job 
(may need 1 core or 
many servers) 
Serial or MPI job 
Serial or MPI job 
(may need 1 core or 
many servers) 
Serial or MPI job 
(may need 1 core or 
many servers) 
Serial or MPI job 
(may need 1 core or 
many servers) 
Fixed 
Cluster 
Workload 
can 
benefit 
from 
dynamic 
cluster 
For 
these 
workloads: 
$$$/science 
and 
throughput 
**are 
as 
important 
as** 
individual 
task 
performance 
RUN 
TIME
Utility Access is Revolutionizing clients 
Life Sciences 
Life Sciences 
• 156,000-core utility supercomputer in the Cloud 
• Used $68M in servers for 18 hours for $33,000 
• Simulated 205,000 materials (264 years) in 18 hours 
Nuclear engineering Utilities / Energy 
• Approximately 600-core MPI workloads run in Cloud 
• Ran workloads in months rather than years 
• Introduction to production in 3 weeks 
Rocket Design & 
Simulation 
Asset & Liability Modeling 
(Insurance / Risk Mgmt) 
• Completes monthly/quarterly runs 5-10x faster 
• Use 1600 cores in the cloud to shorten elapsed run time 
from ~10 days to ~ 1-2 days 
Manufacturing & Design 
• Enable end user on-demand access to 1000s of cores 
• Avoid cost of buying new servers 
• Accelerated science and CAD/CAM process 
• 39.5 years of drug compound computations in 9 hours, 
at a total cost of $4,372 
• 10,000 server cluster seamlessly spanned US/EU regions 
• Advanced 3 new otherwise unknown compounds in wet lab 
• Moving HPC workloads to cloud for burst capacity 
• Amazon GovCloud, Security, and key management 
• Up to 1000s of cores for burst / agility
Cycle HPC Deployment Architecture 
Internal & External 
Jobs & data 
Faster interconnect 
capability machine 
File System (PBs) 
Cloud Filer 
Cloud Filer 
Cloud Filer 
Blob data 
(S3) 
Glacier 
Auto-scaling 
external 
environment 
HPC 
Cluster 
Internal HPC 
Blob data 
(S3) 
Glacier 
Auto-scaling 
external 
environment 
HPC 
Blob data Cluster 
Cold Storage 
Auto-scaling 
external 
environment 
HPC 
Cluster 
Config 
Data
Example Use Cases 
Cycle software tools securely orchestrate scientific, 
engineering, and finance workloads across AWS 
10,600 instance 
Top 10 Pharma 
Molecular 
Modeling for 
Cancer Targets 
160 core 
MPI/Engineering 
Nuclear Reactor, 
Hard Disk, & 
Rocket 
1 Million hours 
RNA Analysis 
Genomics/Next- 
Gen Sequencing
Survey of Use Cases 
þ Drug Design 
þ CAE 
þ Genomics
(W.H.O./Globocan 2008)
Every day is ! 
crucial and costly
Challenge: Novartis run 341,700 hours of docking! 
against a cancer target (impossible to do in-house)
Novartis Instance 
Benchmarking
Utility technical computing 
server count:
AWS Console view:
Cycle Computing’s cluster view:
Metric 
Count 
Compute Hours of Science 
341,700 hours 
Compute Days of Science 
14,238 days 
Compute Years of Science 
39 years 
AWS Instance Count 
10,600 instances 
CycleCloud & Cloud! 
finished impossible run, ! 
$44 Million in servers, in…
39 years of drug design in 11 
hours 
on 10,600 servers for < $4,372
Does this lead to new 
drugs?
Novartis announced 
3 new compounds going 
into screening based 
upon this 1 run
Survey of Use Cases 
þ Drug Design 
þ CAE 
þ Genomics 
…
The Aerospace Corporation
Aerospace Corp @ AWS re:Invent
Reached out to Cycle 
Computing in Summer 2013
Aerospace Corp @ AWS re:Invent
Rocket Design on AWS/Cycle 
Remove caps on scale, speed time to result 
No QueueWait for users 
TBs FS Engineer 
File System Corporate 
Firewall 
Of 3 Quarters TC 
Cluster 
Of 3 Quarters TC 
Cluster Auto-scale Secure 
HPC Cluster 
Internal Cluster 
Application 
AWS GovCloud
HGST, A Western Digital 
Company
HGST, A Western Digital Company
Burst for TC workloads
Results 
• Gave 10 minutes to get 64-core to 512-core 
environments! 
• Parallel scaling greatly improved time-to-results 
• Simplified submission interface removed file 
transfer and remote computing complexity 
• Auto-scaling environments decreased costs 
while increasing science completed.
Survey of Use Cases 
þ Drug Design 
þ CAD/CAM 
þ Genomics 
…
IonTorrent on Cloud
IonTorrent on CycleCloud
IonTorrent on CycleCloud
IonTorrent on CycleCloud
Results 
• Continuously deliver up to 8,000 core of 
compute power to support online genomic 
analysis 
• Complicated application w/Web, NoSQL, TC 
clusters 
• Analysis as a Service for all IonTorrent users
Survey of Use Cases 
þ Drug Design 
þ CAD/CAM 
þ Genomics 
…
Now hopefully…
You believe 
utility access to technical computing 
accelerates invention
Supercomputers = great 
Flickr: 
LouisRix
And so is Cloud… 
Flickr: 
MaCDavis
Computing can revolutionize 
materials science
We need to engineer solutions for 
climate, and lower our footprint
Buy land in Canada?
We can solve this problem
Across many science, engineering, and risk 
Life Sciences 
Life Sciences 
• 156,000-core utility supercomputer in the Cloud 
• Used $68M in servers for 18 hours for $33,000 
• Simulated 205,000 materials (264 years) in 18 hours 
Nuclear engineering Utilities / Energy 
• Approximately 600-core MPI workloads run in Cloud 
• Ran workloads in months rather than years 
• Introduction to production in 3 weeks 
Rocket Design & 
Simulation 
Asset & Liability Modeling 
(Insurance / Risk Mgmt) 
• Completes monthly/quarterly runs 5-10x faster 
• Use 1600 cores in the cloud to shorten elapsed run time 
from ~10 days to ~ 1-2 days 
Manufacturing & Design 
• Enable end user on-demand access to 1000s of cores 
• Avoid cost of buying new servers 
• Accelerated science and CAD/CAM process 
• 39.5 years of drug compound computations in 9 hours, 
at a total cost of $4,372 
• 10,000 server cluster seamlessly spanned US/EU regions 
• Advanced 3 new otherwise unknown compounds in wet lab 
• Moving HPC workloads to cloud for burst capacity 
• Amazon GovCloud, Security, and key management 
• Up to 1000s of cores for burst / agility
To continuously deliver HPC to 
accelerate the human mind…
Does this resonate with you? 
We’re hiring like crazy: 
Software developers, 
HPC engineers, 
devops, sales, etc. 
jobs@ 
cyclecomputing.com

More Related Content

What's hot

Innovation Roundtable
Innovation RoundtableInnovation Roundtable
Innovation Roundtable
Alison B. Lowndes
 
Harnessing the virtual realm
Harnessing the virtual realmHarnessing the virtual realm
Harnessing the virtual realm
Alison B. Lowndes
 
AI + E-commerce
AI + E-commerceAI + E-commerce
AI + E-commerce
Alison B. Lowndes
 
EPSRC CDT Conference
EPSRC CDT ConferenceEPSRC CDT Conference
EPSRC CDT Conference
Alison B. Lowndes
 
The What, Who & Why of NVIDIA
The What, Who & Why of NVIDIAThe What, Who & Why of NVIDIA
The What, Who & Why of NVIDIA
Alison B. Lowndes
 
Enabling Artificial Intelligence - Alison B. Lowndes
Enabling Artificial Intelligence - Alison B. LowndesEnabling Artificial Intelligence - Alison B. Lowndes
Enabling Artificial Intelligence - Alison B. Lowndes
WithTheBest
 
Arm Neoverse solutions @Graviton2-AWS Japan Webinar Oct2020
Arm Neoverse solutions @Graviton2-AWS Japan Webinar Oct2020Arm Neoverse solutions @Graviton2-AWS Japan Webinar Oct2020
Arm Neoverse solutions @Graviton2-AWS Japan Webinar Oct2020
Toshinori Kujiraoka
 
HPC Top 5 Stories: May 18th, 2018
HPC Top 5 Stories: May 18th, 2018HPC Top 5 Stories: May 18th, 2018
HPC Top 5 Stories: May 18th, 2018
NVIDIA
 
Hardware in Space
Hardware in SpaceHardware in Space
Hardware in Space
Alison B. Lowndes
 
Shattering AI Performance Records
Shattering AI Performance RecordsShattering AI Performance Records
Shattering AI Performance Records
NVIDIA
 
NVIDIA Corporation Brochure: Who We Are
NVIDIA Corporation Brochure: Who We AreNVIDIA Corporation Brochure: Who We Are
NVIDIA Corporation Brochure: Who We Are
NVIDIA
 
Simplifying AI Infrastructure: Lessons in Scaling on DGX Systems
Simplifying AI Infrastructure: Lessons in Scaling on DGX SystemsSimplifying AI Infrastructure: Lessons in Scaling on DGX Systems
Simplifying AI Infrastructure: Lessons in Scaling on DGX Systems
Renee Yao
 
GTC 2018: A New AI Era Dawns
GTC 2018: A New AI Era DawnsGTC 2018: A New AI Era Dawns
GTC 2018: A New AI Era Dawns
NVIDIA
 
Opening Keynote at GTC 2015: Leaps in Visual Computing
Opening Keynote at GTC 2015: Leaps in Visual ComputingOpening Keynote at GTC 2015: Leaps in Visual Computing
Opening Keynote at GTC 2015: Leaps in Visual Computing
NVIDIA
 
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019
NVIDIA
 
Top 5 Data Science Sessions from GTC 2019
Top 5 Data Science Sessions from GTC 2019Top 5 Data Science Sessions from GTC 2019
Top 5 Data Science Sessions from GTC 2019
NVIDIA
 
組み込みから HPC まで ARM コアで実現するエコシステム
組み込みから HPC まで ARM コアで実現するエコシステム組み込みから HPC まで ARM コアで実現するエコシステム
組み込みから HPC まで ARM コアで実現するエコシステム
Shinnosuke Furuya
 
GTC 2012 Jen-Hsun Huang Keynote
GTC 2012 Jen-Hsun Huang KeynoteGTC 2012 Jen-Hsun Huang Keynote
GTC 2012 Jen-Hsun Huang Keynote
NVIDIA
 
GPU Technology Conference 2014 Keynote
GPU Technology Conference 2014 KeynoteGPU Technology Conference 2014 Keynote
GPU Technology Conference 2014 Keynote
NVIDIA
 
GTC China 2016
GTC China 2016GTC China 2016
GTC China 2016
NVIDIA
 

What's hot (20)

Innovation Roundtable
Innovation RoundtableInnovation Roundtable
Innovation Roundtable
 
Harnessing the virtual realm
Harnessing the virtual realmHarnessing the virtual realm
Harnessing the virtual realm
 
AI + E-commerce
AI + E-commerceAI + E-commerce
AI + E-commerce
 
EPSRC CDT Conference
EPSRC CDT ConferenceEPSRC CDT Conference
EPSRC CDT Conference
 
The What, Who & Why of NVIDIA
The What, Who & Why of NVIDIAThe What, Who & Why of NVIDIA
The What, Who & Why of NVIDIA
 
Enabling Artificial Intelligence - Alison B. Lowndes
Enabling Artificial Intelligence - Alison B. LowndesEnabling Artificial Intelligence - Alison B. Lowndes
Enabling Artificial Intelligence - Alison B. Lowndes
 
Arm Neoverse solutions @Graviton2-AWS Japan Webinar Oct2020
Arm Neoverse solutions @Graviton2-AWS Japan Webinar Oct2020Arm Neoverse solutions @Graviton2-AWS Japan Webinar Oct2020
Arm Neoverse solutions @Graviton2-AWS Japan Webinar Oct2020
 
HPC Top 5 Stories: May 18th, 2018
HPC Top 5 Stories: May 18th, 2018HPC Top 5 Stories: May 18th, 2018
HPC Top 5 Stories: May 18th, 2018
 
Hardware in Space
Hardware in SpaceHardware in Space
Hardware in Space
 
Shattering AI Performance Records
Shattering AI Performance RecordsShattering AI Performance Records
Shattering AI Performance Records
 
NVIDIA Corporation Brochure: Who We Are
NVIDIA Corporation Brochure: Who We AreNVIDIA Corporation Brochure: Who We Are
NVIDIA Corporation Brochure: Who We Are
 
Simplifying AI Infrastructure: Lessons in Scaling on DGX Systems
Simplifying AI Infrastructure: Lessons in Scaling on DGX SystemsSimplifying AI Infrastructure: Lessons in Scaling on DGX Systems
Simplifying AI Infrastructure: Lessons in Scaling on DGX Systems
 
GTC 2018: A New AI Era Dawns
GTC 2018: A New AI Era DawnsGTC 2018: A New AI Era Dawns
GTC 2018: A New AI Era Dawns
 
Opening Keynote at GTC 2015: Leaps in Visual Computing
Opening Keynote at GTC 2015: Leaps in Visual ComputingOpening Keynote at GTC 2015: Leaps in Visual Computing
Opening Keynote at GTC 2015: Leaps in Visual Computing
 
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019
 
Top 5 Data Science Sessions from GTC 2019
Top 5 Data Science Sessions from GTC 2019Top 5 Data Science Sessions from GTC 2019
Top 5 Data Science Sessions from GTC 2019
 
組み込みから HPC まで ARM コアで実現するエコシステム
組み込みから HPC まで ARM コアで実現するエコシステム組み込みから HPC まで ARM コアで実現するエコシステム
組み込みから HPC まで ARM コアで実現するエコシステム
 
GTC 2012 Jen-Hsun Huang Keynote
GTC 2012 Jen-Hsun Huang KeynoteGTC 2012 Jen-Hsun Huang Keynote
GTC 2012 Jen-Hsun Huang Keynote
 
GPU Technology Conference 2014 Keynote
GPU Technology Conference 2014 KeynoteGPU Technology Conference 2014 Keynote
GPU Technology Conference 2014 Keynote
 
GTC China 2016
GTC China 2016GTC China 2016
GTC China 2016
 

Viewers also liked

Multi-Tenant Pharma HPC Clusters
Multi-Tenant Pharma HPC ClustersMulti-Tenant Pharma HPC Clusters
Multi-Tenant Pharma HPC Clusters
Chris Dagdigian
 
Linux10 sendmail
Linux10 sendmailLinux10 sendmail
Linux10 sendmail
Jainul Musani
 
Linux12 clustering onlinux
Linux12 clustering onlinuxLinux12 clustering onlinux
Linux12 clustering onlinux
Jainul Musani
 
Linux05 DHCP Server
Linux05 DHCP ServerLinux05 DHCP Server
Linux05 DHCP Server
Jainul Musani
 
(BDT201) Big Data and HPC State of the Union | AWS re:Invent 2014
(BDT201) Big Data and HPC State of the Union | AWS re:Invent 2014(BDT201) Big Data and HPC State of the Union | AWS re:Invent 2014
(BDT201) Big Data and HPC State of the Union | AWS re:Invent 2014
Amazon Web Services
 
Building HPC Clusters as Code in the (Almost) Infinite Cloud | AWS Public Sec...
Building HPC Clusters as Code in the (Almost) Infinite Cloud | AWS Public Sec...Building HPC Clusters as Code in the (Almost) Infinite Cloud | AWS Public Sec...
Building HPC Clusters as Code in the (Almost) Infinite Cloud | AWS Public Sec...
Amazon Web Services
 
AWS re:Invent 2016: Building HPC Clusters as Code in the (Almost) Infinite Cl...
AWS re:Invent 2016: Building HPC Clusters as Code in the (Almost) Infinite Cl...AWS re:Invent 2016: Building HPC Clusters as Code in the (Almost) Infinite Cl...
AWS re:Invent 2016: Building HPC Clusters as Code in the (Almost) Infinite Cl...
Amazon Web Services
 
A Project Report on Linux Server Administration
A Project Report on Linux Server AdministrationA Project Report on Linux Server Administration
A Project Report on Linux Server Administration
Avinash Kumar
 
HPC in the Cloud
HPC in the CloudHPC in the Cloud
HPC in the Cloud
Amazon Web Services
 
Trends towards the merge of HPC + Big Data systems
Trends towards the merge of HPC + Big Data systemsTrends towards the merge of HPC + Big Data systems
Trends towards the merge of HPC + Big Data systems
Igor José F. Freitas
 
Linux Cluster Concepts
Linux Cluster ConceptsLinux Cluster Concepts
Linux Cluster Concepts
nixsavy
 
IT Change Management Using JIRA
IT Change Management Using JIRAIT Change Management Using JIRA
IT Change Management Using JIRA
Atlassian
 
Cluster Computers
Cluster ComputersCluster Computers
Cluster Computers
shopnil786
 
DockerCon14 Cluster Management and Containerization
DockerCon14 Cluster Management and ContainerizationDockerCon14 Cluster Management and Containerization
DockerCon14 Cluster Management and Containerization
Docker, Inc.
 
Basic command ppt
Basic command pptBasic command ppt
Basic command ppt
Rohit Kumar
 
Linux introduction
Linux introductionLinux introduction
Linux introduction
Md. Zahid Hossain Shoeb
 
Linux command ppt
Linux command pptLinux command ppt
Linux command ppt
kalyanineve
 
Linux.ppt
Linux.ppt Linux.ppt
Linux.ppt
onu9
 

Viewers also liked (18)

Multi-Tenant Pharma HPC Clusters
Multi-Tenant Pharma HPC ClustersMulti-Tenant Pharma HPC Clusters
Multi-Tenant Pharma HPC Clusters
 
Linux10 sendmail
Linux10 sendmailLinux10 sendmail
Linux10 sendmail
 
Linux12 clustering onlinux
Linux12 clustering onlinuxLinux12 clustering onlinux
Linux12 clustering onlinux
 
Linux05 DHCP Server
Linux05 DHCP ServerLinux05 DHCP Server
Linux05 DHCP Server
 
(BDT201) Big Data and HPC State of the Union | AWS re:Invent 2014
(BDT201) Big Data and HPC State of the Union | AWS re:Invent 2014(BDT201) Big Data and HPC State of the Union | AWS re:Invent 2014
(BDT201) Big Data and HPC State of the Union | AWS re:Invent 2014
 
Building HPC Clusters as Code in the (Almost) Infinite Cloud | AWS Public Sec...
Building HPC Clusters as Code in the (Almost) Infinite Cloud | AWS Public Sec...Building HPC Clusters as Code in the (Almost) Infinite Cloud | AWS Public Sec...
Building HPC Clusters as Code in the (Almost) Infinite Cloud | AWS Public Sec...
 
AWS re:Invent 2016: Building HPC Clusters as Code in the (Almost) Infinite Cl...
AWS re:Invent 2016: Building HPC Clusters as Code in the (Almost) Infinite Cl...AWS re:Invent 2016: Building HPC Clusters as Code in the (Almost) Infinite Cl...
AWS re:Invent 2016: Building HPC Clusters as Code in the (Almost) Infinite Cl...
 
A Project Report on Linux Server Administration
A Project Report on Linux Server AdministrationA Project Report on Linux Server Administration
A Project Report on Linux Server Administration
 
HPC in the Cloud
HPC in the CloudHPC in the Cloud
HPC in the Cloud
 
Trends towards the merge of HPC + Big Data systems
Trends towards the merge of HPC + Big Data systemsTrends towards the merge of HPC + Big Data systems
Trends towards the merge of HPC + Big Data systems
 
Linux Cluster Concepts
Linux Cluster ConceptsLinux Cluster Concepts
Linux Cluster Concepts
 
IT Change Management Using JIRA
IT Change Management Using JIRAIT Change Management Using JIRA
IT Change Management Using JIRA
 
Cluster Computers
Cluster ComputersCluster Computers
Cluster Computers
 
DockerCon14 Cluster Management and Containerization
DockerCon14 Cluster Management and ContainerizationDockerCon14 Cluster Management and Containerization
DockerCon14 Cluster Management and Containerization
 
Basic command ppt
Basic command pptBasic command ppt
Basic command ppt
 
Linux introduction
Linux introductionLinux introduction
Linux introduction
 
Linux command ppt
Linux command pptLinux command ppt
Linux command ppt
 
Linux.ppt
Linux.ppt Linux.ppt
Linux.ppt
 

Similar to HPC Cluster Computing from 64 to 156,000 Cores 

Cycle Computing Record-breaking Petascale HPC Run
Cycle Computing Record-breaking Petascale HPC RunCycle Computing Record-breaking Petascale HPC Run
Cycle Computing Record-breaking Petascale HPC Run
inside-BigData.com
 
So Long Computer Overlords
So Long Computer OverlordsSo Long Computer Overlords
So Long Computer Overlords
Ian Foster
 
How HPC and large-scale data analytics are transforming experimental science
How HPC and large-scale data analytics are transforming experimental scienceHow HPC and large-scale data analytics are transforming experimental science
How HPC and large-scale data analytics are transforming experimental science
inside-BigData.com
 
Rpi talk foster september 2011
Rpi talk foster september 2011Rpi talk foster september 2011
Rpi talk foster september 2011
Ian Foster
 
Better Information Faster: Programming the Continuum
Better Information Faster: Programming the ContinuumBetter Information Faster: Programming the Continuum
Better Information Faster: Programming the Continuum
Ian Foster
 
Future of hpc
Future of hpcFuture of hpc
Future of hpc
Putchong Uthayopas
 
Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...
Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...
Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...
BigDataEverywhere
 
Intro to High Performance Computing in the AWS Cloud
Intro to High Performance Computing in the AWS CloudIntro to High Performance Computing in the AWS Cloud
Intro to High Performance Computing in the AWS Cloud
Amazon Web Services
 
Accelerating Time to Science: Transforming Research in the Cloud
Accelerating Time to Science: Transforming Research in the CloudAccelerating Time to Science: Transforming Research in the Cloud
Accelerating Time to Science: Transforming Research in the Cloud
Jamie Kinney
 
Utility HPC: Right Systems, Right Scale, Right Science
Utility HPC: Right Systems, Right Scale, Right ScienceUtility HPC: Right Systems, Right Scale, Right Science
Utility HPC: Right Systems, Right Scale, Right Science
Chef Software, Inc.
 
Cloud Computing Berkeley.pdf
Cloud Computing Berkeley.pdfCloud Computing Berkeley.pdf
Cloud Computing Berkeley.pdf
AtaulAzizIkram
 
(BDT311) MegaRun: Behind the 156,000 Core HPC Run on AWS and Experience of On...
(BDT311) MegaRun: Behind the 156,000 Core HPC Run on AWS and Experience of On...(BDT311) MegaRun: Behind the 156,000 Core HPC Run on AWS and Experience of On...
(BDT311) MegaRun: Behind the 156,000 Core HPC Run on AWS and Experience of On...
Amazon Web Services
 
Modern Computing: Cloud, Distributed, & High Performance
Modern Computing: Cloud, Distributed, & High PerformanceModern Computing: Cloud, Distributed, & High Performance
Modern Computing: Cloud, Distributed, & High Performance
inside-BigData.com
 
2015 04 bio it world
2015 04 bio it world2015 04 bio it world
2015 04 bio it world
Chris Dwan
 
Using the Open Science Data Cloud for Data Science Research
Using the Open Science Data Cloud for Data Science ResearchUsing the Open Science Data Cloud for Data Science Research
Using the Open Science Data Cloud for Data Science Research
Robert Grossman
 
The Coming Age of Extreme Heterogeneity in HPC
The Coming Age of Extreme Heterogeneity in HPCThe Coming Age of Extreme Heterogeneity in HPC
The Coming Age of Extreme Heterogeneity in HPC
inside-BigData.com
 
Application Optimized Performance: Choosing the Right Instance (CPN212) | AWS...
Application Optimized Performance: Choosing the Right Instance (CPN212) | AWS...Application Optimized Performance: Choosing the Right Instance (CPN212) | AWS...
Application Optimized Performance: Choosing the Right Instance (CPN212) | AWS...
Amazon Web Services
 
Sc10 slide share
Sc10 slide shareSc10 slide share
Sc10 slide share
Guy Tel-Zur
 
Applying Cloud Techniques to Address Complexity in HPC System Integrations
Applying Cloud Techniques to Address Complexity in HPC System IntegrationsApplying Cloud Techniques to Address Complexity in HPC System Integrations
Applying Cloud Techniques to Address Complexity in HPC System Integrations
inside-BigData.com
 
Stories About Spark, HPC and Barcelona by Jordi Torres
Stories About Spark, HPC and Barcelona by Jordi TorresStories About Spark, HPC and Barcelona by Jordi Torres
Stories About Spark, HPC and Barcelona by Jordi Torres
Spark Summit
 

Similar to HPC Cluster Computing from 64 to 156,000 Cores  (20)

Cycle Computing Record-breaking Petascale HPC Run
Cycle Computing Record-breaking Petascale HPC RunCycle Computing Record-breaking Petascale HPC Run
Cycle Computing Record-breaking Petascale HPC Run
 
So Long Computer Overlords
So Long Computer OverlordsSo Long Computer Overlords
So Long Computer Overlords
 
How HPC and large-scale data analytics are transforming experimental science
How HPC and large-scale data analytics are transforming experimental scienceHow HPC and large-scale data analytics are transforming experimental science
How HPC and large-scale data analytics are transforming experimental science
 
Rpi talk foster september 2011
Rpi talk foster september 2011Rpi talk foster september 2011
Rpi talk foster september 2011
 
Better Information Faster: Programming the Continuum
Better Information Faster: Programming the ContinuumBetter Information Faster: Programming the Continuum
Better Information Faster: Programming the Continuum
 
Future of hpc
Future of hpcFuture of hpc
Future of hpc
 
Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...
Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...
Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...
 
Intro to High Performance Computing in the AWS Cloud
Intro to High Performance Computing in the AWS CloudIntro to High Performance Computing in the AWS Cloud
Intro to High Performance Computing in the AWS Cloud
 
Accelerating Time to Science: Transforming Research in the Cloud
Accelerating Time to Science: Transforming Research in the CloudAccelerating Time to Science: Transforming Research in the Cloud
Accelerating Time to Science: Transforming Research in the Cloud
 
Utility HPC: Right Systems, Right Scale, Right Science
Utility HPC: Right Systems, Right Scale, Right ScienceUtility HPC: Right Systems, Right Scale, Right Science
Utility HPC: Right Systems, Right Scale, Right Science
 
Cloud Computing Berkeley.pdf
Cloud Computing Berkeley.pdfCloud Computing Berkeley.pdf
Cloud Computing Berkeley.pdf
 
(BDT311) MegaRun: Behind the 156,000 Core HPC Run on AWS and Experience of On...
(BDT311) MegaRun: Behind the 156,000 Core HPC Run on AWS and Experience of On...(BDT311) MegaRun: Behind the 156,000 Core HPC Run on AWS and Experience of On...
(BDT311) MegaRun: Behind the 156,000 Core HPC Run on AWS and Experience of On...
 
Modern Computing: Cloud, Distributed, & High Performance
Modern Computing: Cloud, Distributed, & High PerformanceModern Computing: Cloud, Distributed, & High Performance
Modern Computing: Cloud, Distributed, & High Performance
 
2015 04 bio it world
2015 04 bio it world2015 04 bio it world
2015 04 bio it world
 
Using the Open Science Data Cloud for Data Science Research
Using the Open Science Data Cloud for Data Science ResearchUsing the Open Science Data Cloud for Data Science Research
Using the Open Science Data Cloud for Data Science Research
 
The Coming Age of Extreme Heterogeneity in HPC
The Coming Age of Extreme Heterogeneity in HPCThe Coming Age of Extreme Heterogeneity in HPC
The Coming Age of Extreme Heterogeneity in HPC
 
Application Optimized Performance: Choosing the Right Instance (CPN212) | AWS...
Application Optimized Performance: Choosing the Right Instance (CPN212) | AWS...Application Optimized Performance: Choosing the Right Instance (CPN212) | AWS...
Application Optimized Performance: Choosing the Right Instance (CPN212) | AWS...
 
Sc10 slide share
Sc10 slide shareSc10 slide share
Sc10 slide share
 
Applying Cloud Techniques to Address Complexity in HPC System Integrations
Applying Cloud Techniques to Address Complexity in HPC System IntegrationsApplying Cloud Techniques to Address Complexity in HPC System Integrations
Applying Cloud Techniques to Address Complexity in HPC System Integrations
 
Stories About Spark, HPC and Barcelona by Jordi Torres
Stories About Spark, HPC and Barcelona by Jordi TorresStories About Spark, HPC and Barcelona by Jordi Torres
Stories About Spark, HPC and Barcelona by Jordi Torres
 

More from inside-BigData.com

Major Market Shifts in IT
Major Market Shifts in ITMajor Market Shifts in IT
Major Market Shifts in IT
inside-BigData.com
 
Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...
inside-BigData.com
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networks
inside-BigData.com
 
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
inside-BigData.com
 
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
inside-BigData.com
 
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
inside-BigData.com
 
HPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural NetworksHPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural Networks
inside-BigData.com
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
inside-BigData.com
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecasts
inside-BigData.com
 
HPC AI Advisory Council Update
HPC AI Advisory Council UpdateHPC AI Advisory Council Update
HPC AI Advisory Council Update
inside-BigData.com
 
Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19
inside-BigData.com
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuning
inside-BigData.com
 
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODHPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
inside-BigData.com
 
State of ARM-based HPC
State of ARM-based HPCState of ARM-based HPC
State of ARM-based HPC
inside-BigData.com
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Acceleration
inside-BigData.com
 
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance EfficientlyZettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
inside-BigData.com
 
Scaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's EraScaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's Era
inside-BigData.com
 
CUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computingCUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computing
inside-BigData.com
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Cluster
inside-BigData.com
 
Overview of HPC Interconnects
Overview of HPC InterconnectsOverview of HPC Interconnects
Overview of HPC Interconnects
inside-BigData.com
 

More from inside-BigData.com (20)

Major Market Shifts in IT
Major Market Shifts in ITMajor Market Shifts in IT
Major Market Shifts in IT
 
Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networks
 
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
 
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
 
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
 
HPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural NetworksHPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural Networks
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecasts
 
HPC AI Advisory Council Update
HPC AI Advisory Council UpdateHPC AI Advisory Council Update
HPC AI Advisory Council Update
 
Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuning
 
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODHPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
 
State of ARM-based HPC
State of ARM-based HPCState of ARM-based HPC
State of ARM-based HPC
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Acceleration
 
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance EfficientlyZettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
 
Scaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's EraScaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's Era
 
CUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computingCUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computing
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Cluster
 
Overview of HPC Interconnects
Overview of HPC InterconnectsOverview of HPC Interconnects
Overview of HPC Interconnects
 

Recently uploaded

Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
Kumud Singh
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
Claudio Di Ciccio
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
Zilliz
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
Matthew Sinclair
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
KAMESHS29
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
SOFTTECHHUB
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
Neo4j
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
DianaGray10
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
Edge AI and Vision Alliance
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
DianaGray10
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
sonjaschweigert1
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
Neo4j
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Paige Cruz
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Speck&Tech
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
danishmna97
 

Recently uploaded (20)

Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
 

HPC Cluster Computing from 64 to 156,000 Cores 

  • 1. HPC Cluster Computing from 64 to 156,000 cores Jason Stowe, Founder
  • 2. You’ll learn about: How continuous/instant delivery helps invention/discovery
  • 3. You’ll learn about: Climate, Organic Semiconductors, Lean manufacturing Technical Computing
  • 4. About Cycle Computing We believe utility access to technical computing accelerates discovery & invention
  • 5. Our mission: Easy HPC applications for anyone, on any resources, at any scale
  • 6. About real use cases: Life Sciences Life Sciences • 50,000-core utility supercomputer in the Cloud • Analyzed 21 million compounds in 3 hours • 12.5 processor years for < $15,000 Nuclear engineering Utilities / Energy • Approximately 600-core MPI workloads run in Cloud • Ran workloads in months rather than years • Introduction to production in 3 weeks Rocket Design & Simulation Asset & Liability Modeling (Insurance / Risk Mgmt) • Completes monthly/quarterly runs 5-10x faster • Use 1600 cores in the cloud to shorten elapsed run time from ~ 10 days to ~ 1-2 days Manufacturing & Design • Enable end user on-demand access to 1000s of cores • Avoid cost of buying new servers • Accelerated science and CAD/CAM process • 39.5 years of drug compound computations in 9 hours, at a total cost of $4,372 • 10,000 server cluster seamlessly spanned regions • Advanced 3 new otherwise unknown compounds in wetlab • Moving HPC workloads to cloud for burst capacity • Amazon GovCloud, Security, and key management • Up to 1000s of cores for burst / agility
  • 7. About me: I’m a geek from the 90s
  • 8. 90s geek?? Whilst I like hacker news, reddit, etc. Slashdot is my homepage
  • 9.
  • 10.
  • 11.
  • 12. It will affect everyone
  • 13.
  • 14.
  • 15. What do you think?
  • 16. Buy land in Canada?
  • 17. Maybe, but hopefully you’re getting: this is a big deal
  • 18. Just in case… Chairman Rajendra Pachauri Intergovernmental Panel on Climate Change "Nobody on this planet is going to be untouched by the impacts of climate change”
  • 19. US Secretary of State John Kerry: "Unless we act dramatically and quickly, science tells us our climate and our way of life are literally in jeopardy. Denial of the science is malpractice."
  • 20.
  • 21.
  • 23. Possibilities? Wind power Nuclear Fusion GeoThermal Climate Engineering Nuclear Fission energy Solar Energy BioFuels
  • 24. How do we simulate/engineer a solution?
  • 25. How do they create these?
  • 26. They use a supercomputer…
  • 27. Supercomputers = great • Supercomputers have special features, like fast interconnects • So good they should never run R, genomics, monte carlo, etc. that don’t use *Fast Interconnect* • During peak usage, our users have wanted: – Latency sensitive – Large scale/grid MPI Running on bare metal, put other jobs elsewhere Flickr: LouisRix
  • 28. Limitations of fixed infrastructure Too small when needed most, Too large every other time… • Upfront CapEx anchors you to aging servers • Costly administration • Miss opportunities to do better risk management, product design, science, engineering
  • 29. Lets talk about materials science
  • 31. Designing Better Solar Materials The challenge is efficiency - turning photons to electricity The number of possible materials is limitless: • Need to separate the right compounds from the useless ones • Researcher Mark Thompson, PhD: “If the 20th century was the century of silicon, the 21st will be all organic. Problem is, how do we find the right material without spending the entire 21st century looking for it?”
  • 32. Needle in a Haystack Challenge: 205,000 compound families totaling 2,312,959 core-hours or 264 core-years
  • 33. Supercomputers = Fast & costly Flickr: LouisRix
  • 34. Another tool in the box: Cloud • At peak load, use up to massive scale to run throughput workloads on cloud: – Needle in a Haystack – Whole Sample set – BigData/Throughput i/o – Integration, monte-carlo paths – Interactive, Service oriented workloads (master worker) • Workload run-time rather than individual job run-time Flickr: MaCDavis
  • 35. Most of Cycle’s users use 40 - 4000 cores, Mark Thompson’s problem wasn’t that problem
  • 36. Needle in a Haystack Challenge: 205,000 compounds totaling 2,312,959 core-hours or 264 core-years
  • 37. 205,000 molecules 264 years of computing 16,788 Spot Instances, 156,314 cores
  • 38. 8-Region Deployment US-West-1 US-East EU US-West-2 Brazil Tokyo Singapore Australia
  • 39. 205,000 molecules 264 years of computing 156,314 cores = 1.21 PetaFLOPS (~Rpeak)
  • 42. 205,000 molecules 264 years of computing Done in 18 hours Access to $68M system for $33k
  • 44. How did we do this? Automated in 8 Cloud Regions, 5 continents, double resiliency Auto-scaling Execute Nodes JUPITER Distributed Queue Data … 14 nodes controlling 16,788
  • 46. Now Dr. Mark Thompson Is 264 compute years closer to making efficient solar a reality using organic semiconductors…
  • 50. Lean Manufacturing (Japanese Manufacturing to turn raw materials to inventory)
  • 51. Continuous Delivery (Modern software process to turn builds to production applications)
  • 52. Users want to decrease Cycle time = (prep time + queue time + run time)
  • 53. The Scientific Method Ask a Question Hypothesiz e Predict Experiment / Test Analyze Final Results Test and Analyze stages require the most time, compute, and data
  • 54. What’s holding us (Engineering, Science, Risk) back?
  • 55. Utility HPC Access = Continuous HPC capacity delivery
  • 56. The Innovation Bottleneck: Scientists/Engineers forced to size expectations to the infrastructure they have access to
  • 57. How does cloud help?
  • 58. Put the important, fast interconnect jobs on fast interconnect machines Put everything else on cloud capacity, when needed
  • 59. Workloads are actually helping Serial or MPI job Serial or MPI job (may need 1 core or many servers) Serial or MPI job Serial or MPI job (may need 1 core or many servers) Serial or MPI job (may need 1 core or many servers) Serial or MPI job (may need 1 core or many servers) Fixed Cluster Workload can benefit from dynamic cluster For these workloads: $$$/science and throughput **are as important as** individual task performance RUN TIME
  • 60. Utility Access is Revolutionizing clients Life Sciences Life Sciences • 156,000-core utility supercomputer in the Cloud • Used $68M in servers for 18 hours for $33,000 • Simulated 205,000 materials (264 years) in 18 hours Nuclear engineering Utilities / Energy • Approximately 600-core MPI workloads run in Cloud • Ran workloads in months rather than years • Introduction to production in 3 weeks Rocket Design & Simulation Asset & Liability Modeling (Insurance / Risk Mgmt) • Completes monthly/quarterly runs 5-10x faster • Use 1600 cores in the cloud to shorten elapsed run time from ~10 days to ~ 1-2 days Manufacturing & Design • Enable end user on-demand access to 1000s of cores • Avoid cost of buying new servers • Accelerated science and CAD/CAM process • 39.5 years of drug compound computations in 9 hours, at a total cost of $4,372 • 10,000 server cluster seamlessly spanned US/EU regions • Advanced 3 new otherwise unknown compounds in wet lab • Moving HPC workloads to cloud for burst capacity • Amazon GovCloud, Security, and key management • Up to 1000s of cores for burst / agility
  • 61. Cycle HPC Deployment Architecture Internal & External Jobs & data Faster interconnect capability machine File System (PBs) Cloud Filer Cloud Filer Cloud Filer Blob data (S3) Glacier Auto-scaling external environment HPC Cluster Internal HPC Blob data (S3) Glacier Auto-scaling external environment HPC Blob data Cluster Cold Storage Auto-scaling external environment HPC Cluster Config Data
  • 62. Example Use Cases Cycle software tools securely orchestrate scientific, engineering, and finance workloads across AWS 10,600 instance Top 10 Pharma Molecular Modeling for Cancer Targets 160 core MPI/Engineering Nuclear Reactor, Hard Disk, & Rocket 1 Million hours RNA Analysis Genomics/Next- Gen Sequencing
  • 63. Survey of Use Cases þ Drug Design þ CAE þ Genomics
  • 65. Every day is ! crucial and costly
  • 66. Challenge: Novartis run 341,700 hours of docking! against a cancer target (impossible to do in-house)
  • 71. Metric Count Compute Hours of Science 341,700 hours Compute Days of Science 14,238 days Compute Years of Science 39 years AWS Instance Count 10,600 instances CycleCloud & Cloud! finished impossible run, ! $44 Million in servers, in…
  • 72. 39 years of drug design in 11 hours on 10,600 servers for < $4,372
  • 73. Does this lead to new drugs?
  • 74. Novartis announced 3 new compounds going into screening based upon this 1 run
  • 75. Survey of Use Cases þ Drug Design þ CAE þ Genomics …
  • 77. Aerospace Corp @ AWS re:Invent
  • 78. Reached out to Cycle Computing in Summer 2013
  • 79. Aerospace Corp @ AWS re:Invent
  • 80. Rocket Design on AWS/Cycle Remove caps on scale, speed time to result No QueueWait for users TBs FS Engineer File System Corporate Firewall Of 3 Quarters TC Cluster Of 3 Quarters TC Cluster Auto-scale Secure HPC Cluster Internal Cluster Application AWS GovCloud
  • 81. HGST, A Western Digital Company
  • 82. HGST, A Western Digital Company
  • 83. Burst for TC workloads
  • 84. Results • Gave 10 minutes to get 64-core to 512-core environments! • Parallel scaling greatly improved time-to-results • Simplified submission interface removed file transfer and remote computing complexity • Auto-scaling environments decreased costs while increasing science completed.
  • 85. Survey of Use Cases þ Drug Design þ CAD/CAM þ Genomics …
  • 90. Results • Continuously deliver up to 8,000 core of compute power to support online genomic analysis • Complicated application w/Web, NoSQL, TC clusters • Analysis as a Service for all IonTorrent users
  • 91. Survey of Use Cases þ Drug Design þ CAD/CAM þ Genomics …
  • 93. You believe utility access to technical computing accelerates invention
  • 94. Supercomputers = great Flickr: LouisRix
  • 95. And so is Cloud… Flickr: MaCDavis
  • 96. Computing can revolutionize materials science
  • 97. We need to engineer solutions for climate, and lower our footprint
  • 98. Buy land in Canada?
  • 99. We can solve this problem
  • 100. Across many science, engineering, and risk Life Sciences Life Sciences • 156,000-core utility supercomputer in the Cloud • Used $68M in servers for 18 hours for $33,000 • Simulated 205,000 materials (264 years) in 18 hours Nuclear engineering Utilities / Energy • Approximately 600-core MPI workloads run in Cloud • Ran workloads in months rather than years • Introduction to production in 3 weeks Rocket Design & Simulation Asset & Liability Modeling (Insurance / Risk Mgmt) • Completes monthly/quarterly runs 5-10x faster • Use 1600 cores in the cloud to shorten elapsed run time from ~10 days to ~ 1-2 days Manufacturing & Design • Enable end user on-demand access to 1000s of cores • Avoid cost of buying new servers • Accelerated science and CAD/CAM process • 39.5 years of drug compound computations in 9 hours, at a total cost of $4,372 • 10,000 server cluster seamlessly spanned US/EU regions • Advanced 3 new otherwise unknown compounds in wet lab • Moving HPC workloads to cloud for burst capacity • Amazon GovCloud, Security, and key management • Up to 1000s of cores for burst / agility
  • 101. To continuously deliver HPC to accelerate the human mind…
  • 102. Does this resonate with you? We’re hiring like crazy: Software developers, HPC engineers, devops, sales, etc. jobs@ cyclecomputing.com