SlideShare a Scribd company logo
1 of 14
Office of Instructional and
Research Technology
Very large computing and the real
world
a very few thoughts
Eric Marshall
Associate Director for Research Technology
Rutgers University
Office of Instructional and
Research Technology
Shock and awe
Bigger is better!
Office of Instructional and
Research Technology
The shiny future
• Newer is Better!
Office of Instructional and
Research Technology
The real world
• Bugs, warts, and the eternal problem of hindsight
Office of Instructional and
Research Technology
The problem of architecture
• Build as you go vs. predicting the future
Office of Instructional and
Research Technology
Where do you put and for how long?
• The problem of 2x foot print in the land of 24x7
Office of Instructional and
Research Technology
Who is expert?
• Is the architect, programmer, scientist, owner, vendor
or bottle washer expert? Complex problems are hard.
Office of Instructional and
Research Technology
“Anyone who understands the system isn’t doing
science!”
• The problem of users
Office of Instructional and
Research Technology
Supercomputers are disposable
• 3 to 5 year ‘shelf life’
Office of Instructional and
Research Technology
“This system sucks, the last one was better!”
(no matter how many systems)
• The problem of transition: porting, change and habits
Office of Instructional and
Research Technology
Goldlock’s paradox
• The problem of useful use: efficient programming, useful scaling,
overhead, keeping track of results, allocation, etc.
Office of Instructional and
Research Technology
Goldlock’s paradox (cont’d)
• Someone will always say the solution is around around the corner!
Office of Instructional and
Research Technology
Scaling is deadly
• Scaling problems: OS/SAN/code/people/etc.
Large Scale Cluster (LSC)
SGI Origin 3800 + 3900, 600MHz
2 Nodes x 512 PE + 512GB + 2.9TB disk
5 Nodes x 256 PE + 256GB + .9TB disk
1 Node x 128 PE + 128GB + .9TB disk
SAN Bandwidth: 2GB/s per LSC Node
CXFS, PCP, Workshop Pro,GridEngine, S-Plus,
TotalView, Matlab, NAG SMP, Mathmatica
Analysis Cluster (ANC)
SGI Origin 3900, 600 MHz, 2 Nodes x 96 PE + 96GB + 4.2TB disk
SAN Bandwidth: 2GB/s per ANC Node
GridEngine, CXFS, PCP, Workshop Pro
Tape SAN
4 x STK 9310 Tape Libraries
24 x 9940B Drives (200GB, 30MB/s)
22 x 9840A Drives (20GB, 10MB/s)
3.5PB Tape Storage On-Line
1.5PB Off-Line
LAN
Cisco Catalyst 6509
4 x 16 GbE
2 x 48 Fast Ethernet
SAN (FC) Switch
Brocade 2800 & 3800
Redundant Access
Dual-Ported
Fiber Channel
MetaData Server (MDS)
HFS & HSMS Server
SGI Origin 3800, 600 MHz,
2 Nodes x 64 PE + 64GB
Disk SAN: 4GB/s per MDS Node
Tape SAN: 1GB/s per MDS Node
2.8TB disk, Failsafe, DMF, CXFS
Onyx 3 - Infinite Reality 3
Computational Capability & Capacity
89 Coupled Climate Model Years
Per Computational Day
1 deg. Ocean Model
2 deg. Atmospheric
Disk SAN
23.6TB SAN Disk
TP9100B
5+P+HS RAID5
w/Dual Controllers
2Gbit/s Fibre
GFDL HPCS
July 2005
CCCI Cluster (IC)
SGI Altix 3700, 1.5GHz
2 Nodes x 256 PE + 512GB + 2TB disk
1 Node x 96 PE + 192GB + 3TB disk
SAN Bandwidth:
2GigE/Node, NFS mounted
PCP, Workshop Pro,GridEngine,
TotalView,
NAG
Office of Instructional and
Research Technology
Questions?
Eric Marshall
Office of Instructional and Research Technology
eric.marshall@rutgers.edu
732 445-2262

More Related Content

Viewers also liked

壮絶!さくらのレンタルサーバ構築・運用の舞台裏~The nightmare of onpremise~(第25回さくらの夕べ)
壮絶!さくらのレンタルサーバ構築・運用の舞台裏~The nightmare of onpremise~(第25回さくらの夕べ)壮絶!さくらのレンタルサーバ構築・運用の舞台裏~The nightmare of onpremise~(第25回さくらの夕べ)
壮絶!さくらのレンタルサーバ構築・運用の舞台裏~The nightmare of onpremise~(第25回さくらの夕べ)さくらインターネット株式会社
 
Azure: Docker Container orchestration, PaaS ( Service Farbic ) and High avail...
Azure: Docker Container orchestration, PaaS ( Service Farbic ) and High avail...Azure: Docker Container orchestration, PaaS ( Service Farbic ) and High avail...
Azure: Docker Container orchestration, PaaS ( Service Farbic ) and High avail...Alexey Bokov
 
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...Spark Summit
 
Docker Swarm モード にゅうもん
Docker Swarm モード にゅうもんDocker Swarm モード にゅうもん
Docker Swarm モード にゅうもんMasahito Zembutsu
 
Service Fabric – building tomorrows applications today
Service Fabric – building tomorrows applications todayService Fabric – building tomorrows applications today
Service Fabric – building tomorrows applications todayBizTalk360
 
.NET Coreとツール類の今
.NET Coreとツール類の今.NET Coreとツール類の今
.NET Coreとツール類の今Yuki Igarashi
 
Apache Flink vs Apache Spark - Reproducible experiments on cloud.
Apache Flink vs Apache Spark - Reproducible experiments on cloud.Apache Flink vs Apache Spark - Reproducible experiments on cloud.
Apache Flink vs Apache Spark - Reproducible experiments on cloud.Shelan Perera
 
me-2.0-6-years-of-life
me-2.0-6-years-of-life me-2.0-6-years-of-life
me-2.0-6-years-of-life Kenji Hiranabe
 
45分で理解する SQL Serverでできることできないこと
45分で理解する SQL Serverでできることできないこと45分で理解する SQL Serverでできることできないこと
45分で理解する SQL ServerでできることできないことInsight Technology, Inc.
 
Windows10とRaspberryPiで MakerFaireTokyoに参加
Windows10とRaspberryPiでMakerFaireTokyoに参加Windows10とRaspberryPiでMakerFaireTokyoに参加
Windows10とRaspberryPiで MakerFaireTokyoに参加Yuki Ando
 
Introduction to ACI APIs
Introduction to ACI APIsIntroduction to ACI APIs
Introduction to ACI APIsCisco DevNet
 

Viewers also liked (15)

壮絶!さくらのレンタルサーバ構築・運用の舞台裏~The nightmare of onpremise~(第25回さくらの夕べ)
壮絶!さくらのレンタルサーバ構築・運用の舞台裏~The nightmare of onpremise~(第25回さくらの夕べ)壮絶!さくらのレンタルサーバ構築・運用の舞台裏~The nightmare of onpremise~(第25回さくらの夕べ)
壮絶!さくらのレンタルサーバ構築・運用の舞台裏~The nightmare of onpremise~(第25回さくらの夕べ)
 
【さくらのクラウド】DNSアプライアンス導入ガイド
【さくらのクラウド】DNSアプライアンス導入ガイド【さくらのクラウド】DNSアプライアンス導入ガイド
【さくらのクラウド】DNSアプライアンス導入ガイド
 
Actors evolved- Rotem Hermon
Actors evolved- Rotem HermonActors evolved- Rotem Hermon
Actors evolved- Rotem Hermon
 
BMP Test Results
BMP Test ResultsBMP Test Results
BMP Test Results
 
Azure: Docker Container orchestration, PaaS ( Service Farbic ) and High avail...
Azure: Docker Container orchestration, PaaS ( Service Farbic ) and High avail...Azure: Docker Container orchestration, PaaS ( Service Farbic ) and High avail...
Azure: Docker Container orchestration, PaaS ( Service Farbic ) and High avail...
 
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
 
MicroServices on Azure
MicroServices on AzureMicroServices on Azure
MicroServices on Azure
 
Docker Swarm モード にゅうもん
Docker Swarm モード にゅうもんDocker Swarm モード にゅうもん
Docker Swarm モード にゅうもん
 
Service Fabric – building tomorrows applications today
Service Fabric – building tomorrows applications todayService Fabric – building tomorrows applications today
Service Fabric – building tomorrows applications today
 
.NET Coreとツール類の今
.NET Coreとツール類の今.NET Coreとツール類の今
.NET Coreとツール類の今
 
Apache Flink vs Apache Spark - Reproducible experiments on cloud.
Apache Flink vs Apache Spark - Reproducible experiments on cloud.Apache Flink vs Apache Spark - Reproducible experiments on cloud.
Apache Flink vs Apache Spark - Reproducible experiments on cloud.
 
me-2.0-6-years-of-life
me-2.0-6-years-of-life me-2.0-6-years-of-life
me-2.0-6-years-of-life
 
45分で理解する SQL Serverでできることできないこと
45分で理解する SQL Serverでできることできないこと45分で理解する SQL Serverでできることできないこと
45分で理解する SQL Serverでできることできないこと
 
Windows10とRaspberryPiで MakerFaireTokyoに参加
Windows10とRaspberryPiでMakerFaireTokyoに参加Windows10とRaspberryPiでMakerFaireTokyoに参加
Windows10とRaspberryPiで MakerFaireTokyoに参加
 
Introduction to ACI APIs
Introduction to ACI APIsIntroduction to ACI APIs
Introduction to ACI APIs
 

Similar to high performance computing exposed

Convolutional neural networks for speech controlled prosthetic hands
Convolutional neural networks for speech controlled prosthetic handsConvolutional neural networks for speech controlled prosthetic hands
Convolutional neural networks for speech controlled prosthetic handsMohsen Jafarzadeh
 
Modern Computing: Cloud, Distributed, & High Performance
Modern Computing: Cloud, Distributed, & High PerformanceModern Computing: Cloud, Distributed, & High Performance
Modern Computing: Cloud, Distributed, & High Performanceinside-BigData.com
 
How HPC and large-scale data analytics are transforming experimental science
How HPC and large-scale data analytics are transforming experimental scienceHow HPC and large-scale data analytics are transforming experimental science
How HPC and large-scale data analytics are transforming experimental scienceinside-BigData.com
 
Barcelona Supercomputing Center, Generador de Riqueza
Barcelona Supercomputing Center, Generador de RiquezaBarcelona Supercomputing Center, Generador de Riqueza
Barcelona Supercomputing Center, Generador de RiquezaFacultad de Informática UCM
 
Making Machine Learning Scale: Single Machine and Distributed
Making Machine Learning Scale: Single Machine and DistributedMaking Machine Learning Scale: Single Machine and Distributed
Making Machine Learning Scale: Single Machine and DistributedTuri, Inc.
 
Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...
Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...
Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...BigDataEverywhere
 
SF Big Analytics & SF Machine Learning Meetup: Machine Learning at the Limit ...
SF Big Analytics & SF Machine Learning Meetup: Machine Learning at the Limit ...SF Big Analytics & SF Machine Learning Meetup: Machine Learning at the Limit ...
SF Big Analytics & SF Machine Learning Meetup: Machine Learning at the Limit ...Chester Chen
 
A Modern Introduction to Decision Tree Ensembles
A Modern Introduction to Decision Tree EnsemblesA Modern Introduction to Decision Tree Ensembles
A Modern Introduction to Decision Tree EnsemblesIchigaku Takigawa
 
Project Tungsten Phase II: Joining a Billion Rows per Second on a Laptop
Project Tungsten Phase II: Joining a Billion Rows per Second on a LaptopProject Tungsten Phase II: Joining a Billion Rows per Second on a Laptop
Project Tungsten Phase II: Joining a Billion Rows per Second on a LaptopDatabricks
 
Cluster Filesystems and the next 1000 human genomes
Cluster Filesystems and the next 1000 human genomesCluster Filesystems and the next 1000 human genomes
Cluster Filesystems and the next 1000 human genomesGuy Coates
 
OpenNebulaconf2017US: Rapid scaling of research computing to over 70,000 cor...
OpenNebulaconf2017US:  Rapid scaling of research computing to over 70,000 cor...OpenNebulaconf2017US:  Rapid scaling of research computing to over 70,000 cor...
OpenNebulaconf2017US: Rapid scaling of research computing to over 70,000 cor...OpenNebula Project
 
Implementing AI: Hardware Challenges
Implementing AI: Hardware ChallengesImplementing AI: Hardware Challenges
Implementing AI: Hardware ChallengesKTN
 
Ceph Day Melbourne - Scale and performance: Servicing the Fabric and the Work...
Ceph Day Melbourne - Scale and performance: Servicing the Fabric and the Work...Ceph Day Melbourne - Scale and performance: Servicing the Fabric and the Work...
Ceph Day Melbourne - Scale and performance: Servicing the Fabric and the Work...Ceph Community
 
Ncar globally accessible user environment
Ncar globally accessible user environmentNcar globally accessible user environment
Ncar globally accessible user environmentinside-BigData.com
 
E Science As A Lens On The World Lazowska
E Science As A Lens On The World   LazowskaE Science As A Lens On The World   Lazowska
E Science As A Lens On The World Lazowskaguest43b4df3
 
E Science As A Lens On The World Lazowska
E Science As A Lens On The World   LazowskaE Science As A Lens On The World   Lazowska
E Science As A Lens On The World LazowskaWCET
 

Similar to high performance computing exposed (20)

Convolutional neural networks for speech controlled prosthetic hands
Convolutional neural networks for speech controlled prosthetic handsConvolutional neural networks for speech controlled prosthetic hands
Convolutional neural networks for speech controlled prosthetic hands
 
Modern Computing: Cloud, Distributed, & High Performance
Modern Computing: Cloud, Distributed, & High PerformanceModern Computing: Cloud, Distributed, & High Performance
Modern Computing: Cloud, Distributed, & High Performance
 
How HPC and large-scale data analytics are transforming experimental science
How HPC and large-scale data analytics are transforming experimental scienceHow HPC and large-scale data analytics are transforming experimental science
How HPC and large-scale data analytics are transforming experimental science
 
Available HPC resources at CSUC
Available HPC resources at CSUCAvailable HPC resources at CSUC
Available HPC resources at CSUC
 
Barcelona Supercomputing Center, Generador de Riqueza
Barcelona Supercomputing Center, Generador de RiquezaBarcelona Supercomputing Center, Generador de Riqueza
Barcelona Supercomputing Center, Generador de Riqueza
 
Making Machine Learning Scale: Single Machine and Distributed
Making Machine Learning Scale: Single Machine and DistributedMaking Machine Learning Scale: Single Machine and Distributed
Making Machine Learning Scale: Single Machine and Distributed
 
Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...
Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...
Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...
 
SF Big Analytics & SF Machine Learning Meetup: Machine Learning at the Limit ...
SF Big Analytics & SF Machine Learning Meetup: Machine Learning at the Limit ...SF Big Analytics & SF Machine Learning Meetup: Machine Learning at the Limit ...
SF Big Analytics & SF Machine Learning Meetup: Machine Learning at the Limit ...
 
A Modern Introduction to Decision Tree Ensembles
A Modern Introduction to Decision Tree EnsemblesA Modern Introduction to Decision Tree Ensembles
A Modern Introduction to Decision Tree Ensembles
 
Project Tungsten Phase II: Joining a Billion Rows per Second on a Laptop
Project Tungsten Phase II: Joining a Billion Rows per Second on a LaptopProject Tungsten Phase II: Joining a Billion Rows per Second on a Laptop
Project Tungsten Phase II: Joining a Billion Rows per Second on a Laptop
 
Cluster Filesystems and the next 1000 human genomes
Cluster Filesystems and the next 1000 human genomesCluster Filesystems and the next 1000 human genomes
Cluster Filesystems and the next 1000 human genomes
 
OpenNebulaconf2017US: Rapid scaling of research computing to over 70,000 cor...
OpenNebulaconf2017US:  Rapid scaling of research computing to over 70,000 cor...OpenNebulaconf2017US:  Rapid scaling of research computing to over 70,000 cor...
OpenNebulaconf2017US: Rapid scaling of research computing to over 70,000 cor...
 
Available HPC resources at CSUC
Available HPC resources at CSUCAvailable HPC resources at CSUC
Available HPC resources at CSUC
 
Available HPC resources at CSUC
Available HPC resources at CSUCAvailable HPC resources at CSUC
Available HPC resources at CSUC
 
Implementing AI: Hardware Challenges
Implementing AI: Hardware ChallengesImplementing AI: Hardware Challenges
Implementing AI: Hardware Challenges
 
Ceph Day Melbourne - Scale and performance: Servicing the Fabric and the Work...
Ceph Day Melbourne - Scale and performance: Servicing the Fabric and the Work...Ceph Day Melbourne - Scale and performance: Servicing the Fabric and the Work...
Ceph Day Melbourne - Scale and performance: Servicing the Fabric and the Work...
 
Ncar globally accessible user environment
Ncar globally accessible user environmentNcar globally accessible user environment
Ncar globally accessible user environment
 
What is 3d torus
What is 3d torusWhat is 3d torus
What is 3d torus
 
E Science As A Lens On The World Lazowska
E Science As A Lens On The World   LazowskaE Science As A Lens On The World   Lazowska
E Science As A Lens On The World Lazowska
 
E Science As A Lens On The World Lazowska
E Science As A Lens On The World   LazowskaE Science As A Lens On The World   Lazowska
E Science As A Lens On The World Lazowska
 

Recently uploaded

Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsPrecisely
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfngoud9212
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsAndrey Dotsenko
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 

Recently uploaded (20)

Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power Systems
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdf
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 

high performance computing exposed

  • 1. Office of Instructional and Research Technology Very large computing and the real world a very few thoughts Eric Marshall Associate Director for Research Technology Rutgers University
  • 2. Office of Instructional and Research Technology Shock and awe Bigger is better!
  • 3. Office of Instructional and Research Technology The shiny future • Newer is Better!
  • 4. Office of Instructional and Research Technology The real world • Bugs, warts, and the eternal problem of hindsight
  • 5. Office of Instructional and Research Technology The problem of architecture • Build as you go vs. predicting the future
  • 6. Office of Instructional and Research Technology Where do you put and for how long? • The problem of 2x foot print in the land of 24x7
  • 7. Office of Instructional and Research Technology Who is expert? • Is the architect, programmer, scientist, owner, vendor or bottle washer expert? Complex problems are hard.
  • 8. Office of Instructional and Research Technology “Anyone who understands the system isn’t doing science!” • The problem of users
  • 9. Office of Instructional and Research Technology Supercomputers are disposable • 3 to 5 year ‘shelf life’
  • 10. Office of Instructional and Research Technology “This system sucks, the last one was better!” (no matter how many systems) • The problem of transition: porting, change and habits
  • 11. Office of Instructional and Research Technology Goldlock’s paradox • The problem of useful use: efficient programming, useful scaling, overhead, keeping track of results, allocation, etc.
  • 12. Office of Instructional and Research Technology Goldlock’s paradox (cont’d) • Someone will always say the solution is around around the corner!
  • 13. Office of Instructional and Research Technology Scaling is deadly • Scaling problems: OS/SAN/code/people/etc. Large Scale Cluster (LSC) SGI Origin 3800 + 3900, 600MHz 2 Nodes x 512 PE + 512GB + 2.9TB disk 5 Nodes x 256 PE + 256GB + .9TB disk 1 Node x 128 PE + 128GB + .9TB disk SAN Bandwidth: 2GB/s per LSC Node CXFS, PCP, Workshop Pro,GridEngine, S-Plus, TotalView, Matlab, NAG SMP, Mathmatica Analysis Cluster (ANC) SGI Origin 3900, 600 MHz, 2 Nodes x 96 PE + 96GB + 4.2TB disk SAN Bandwidth: 2GB/s per ANC Node GridEngine, CXFS, PCP, Workshop Pro Tape SAN 4 x STK 9310 Tape Libraries 24 x 9940B Drives (200GB, 30MB/s) 22 x 9840A Drives (20GB, 10MB/s) 3.5PB Tape Storage On-Line 1.5PB Off-Line LAN Cisco Catalyst 6509 4 x 16 GbE 2 x 48 Fast Ethernet SAN (FC) Switch Brocade 2800 & 3800 Redundant Access Dual-Ported Fiber Channel MetaData Server (MDS) HFS & HSMS Server SGI Origin 3800, 600 MHz, 2 Nodes x 64 PE + 64GB Disk SAN: 4GB/s per MDS Node Tape SAN: 1GB/s per MDS Node 2.8TB disk, Failsafe, DMF, CXFS Onyx 3 - Infinite Reality 3 Computational Capability & Capacity 89 Coupled Climate Model Years Per Computational Day 1 deg. Ocean Model 2 deg. Atmospheric Disk SAN 23.6TB SAN Disk TP9100B 5+P+HS RAID5 w/Dual Controllers 2Gbit/s Fibre GFDL HPCS July 2005 CCCI Cluster (IC) SGI Altix 3700, 1.5GHz 2 Nodes x 256 PE + 512GB + 2TB disk 1 Node x 96 PE + 192GB + 3TB disk SAN Bandwidth: 2GigE/Node, NFS mounted PCP, Workshop Pro,GridEngine, TotalView, NAG
  • 14. Office of Instructional and Research Technology Questions? Eric Marshall Office of Instructional and Research Technology eric.marshall@rutgers.edu 732 445-2262

Editor's Notes

  1. Note title change Personal intro
  2. Human’s ability to plan and abstract is powerful and useful, however… Side effects happen – Aswan high dam, Egypt –> fish populations and salinity of the Mediterranean Sea
  3. First artificial heart lasted 50 minutes! (http://en.wikipedia.org/wiki/Artificial_heart)
  4. Heidemarie Stefanyshyn-Piper’stool box (http://www.google.com/hostednews/ap/article/ALeqM5h1W8dcUP9H70AmlSfDSenPteDT9gD94HJO401) Nov. 17th 2008 Human’s ability to plan and abstract is powerful and useful, however… Side effects happen – Aswan high dam, Egypt –> fish populations and salinity of the Mediterranean Sea First artificial heart lasted 50 minutes! (http://en.wikipedia.org/wiki/Artificial_heart)
  5. Solvable for repeatable tasks, not so much for the bleeding edge
  6. Systems are complex enough that computer scientists, IT/sys admins, and domain scientists are forced into each others domain. Most have no wish to do this! The result is ugly and wasteful.
  7. ENIAC My pocket has more computing power than the entire Allied forces of the Second World War. Yet supercomputers are not built to be replaced.
  8. Computers come and go – CODE is FOREVER! Also the user experience.
  9. Engineering moves ahead not always in sync with the users needs. Changing systems is a pain – bigger systems = bigger problems. Does not help the user experience.
  10. Humans are compelled to try big tasks Staff does not scale.