SlideShare a Scribd company logo
Convergence for scientific
method – HPC, AI, Simulation
and Experiment
HPC-AI Advisory Council
Presented by Alan Real, Durham University
17th September 2019
∂
• A City in the North East of England
• Steeped in Heritage and Culture
• UNESCO world heritage site & other heritage assets
• 3rd oldest University in UK, 1837 Royal Charter
• 7th Century history: Venerable Bede.
• 16 Colleges, Faculties of Sciences, Social Sciences
and Arts and Humanities
∂
• 18,000 students, 3000 staff
• HPC, high priority for science faculty.
• Most highly-cited papers are computational
• Created dedicated research computing
support Directorate, within Research function
∂
ARC@Durham
Advanced
Research
Computing
Research
&
Innovation
CIS / IT
Services
Applied
Research
Computer
Science
Enabling research to be at the forefront of computational
and data-intensive practice.
Key components
• People & Skills
• Research Aspirations
• eInfrastructure: Hardware & Software
Interdisciplinarity
• Collaborative culture
• Bridge between Academic Practice and operational
services
• Networks of cross-cutting methods, underpinning
disciplinary themes.
• Business development help maximise opportunity.
Team
• Research Software Engineering (3 FTE)
• Platforms (5 FTE)
• Business development
Values criticality and contribution of technical expertise.
∂
The research computing platform
Data
storage &
Networking
Cloud
HPC
Visualization
Data
repository
Database
Lab
equipment
Data sets
Advice
Software
Research
question Value
∂
HPC @Durham
18 Years of frankly heroic effort by 2 individuals running two facilities:
Lydia Heck – COSMA – Institute of computational cosmology
Henk Slim – Hamilton – general purpose, mainly science Faculty
Purpose built room: AHDC-1
Upgrades to power and cooling ~540kW capacity (2001-4)
Generator, cooling and UPS refurbishment (2009)
Second purpose built room: AHDC-2 (adjacent & Now Lydia Heck DC)
1MW electrical infrastructure, 800kW cooling load.
Have housed 7 generations of COSMA and Hamilton facilities
∂
COSMA-3 (2006)
∂
COSMA-4 (2010)
∂
COSMA-5 (2012)
∂
COSMA6&7 (2017-18)
COSMA6 relocated from Hartree centre & combined with COSMA5 ~11k cores
∂
Main driver for HPC strategy…
The scientific method requires it!
Key metric of success:
Were the objectives of the science case met?
∂
Virgo consortium outputs
Simulations run for months
Millennium:
3rd most cited paper in
Nature
Eagle: 4096 cores, 47 days
Most highly cited paper
in space science in
2015
5 years from inception to publication of the first paper
10-15 people designing and executing the simulations, exploited by 100s
∂
System stability
Understanding the formation of the Universe in unprecedented detail
Begins here…
∂
The knowledge continuum
Plumbing
Researchquestion
Application
Software
ComputerHardware
Domainknowledge
Systemdesign
Computercomponents
Workflow
Numerics
Operatingenvironment
Require building of intellectual capability across whole spectrum
Interplay of components is important
Ensure flow of knowledge across spectrum to enable research.
Algorithms
∂
Conveying computational challenges
What does successful research
computing look like?
Lots of conversations & conferences
Some internal business cases.
Not much cite-able literature.
If it matters:
Can we publish a peer-reviewed
publication?
Can we do this within the research
domain?
Encourage other members of this
community to do the same!
∂
N8 Research Partnership is the 8 most research intensive universities in
the North of England – Durham, Lancaster, Leeds, Liverpool,
Manchester, Newcastle, Sheffield and York
N8 Centre of Excellence for Computationally Intensive Research (N8 CIR):
– Established to build on this partnership to expand the art of the possible
within research themes by leveraging computational- and data-intensive
practice.
Derived from N8 HPC which offered access to a shared compute platform
for all 8 universities for 6 years.
∂
NICE19
Conversation around future of AI & accelerated computing
GPU card memory regarded as limiting factor – explore coherence.
Quantification of inferencing error through model uncertainty.
Conversation with the Bioscientists
Want to get the most out of lab experiment data
Identical challenges within advanced materials
Bring experiment and simulation data together to inform each other.
- Requires ability to drive the network.
- Combine observation, AI, models & simulation to advance
understanding.
- In AI, only the machine learns to predict.
- We build models to understand & use simulation to verify.
∂
Experiment & AI workflow
Experiment Observation
Data
Analysis
AI Prediction
Where is the hypothesis?
Where is the Understanding?
∂
Convergence…
Data
Observation
PredictionUnderstanding
Experiment
ComputationTheory
Instrumentation
SimulationModelling
∂
Final Remarks
HPC (& broader research computing) is part of scientific method.
Be willing to do something different!!
It is getting increasingly difficult to undertake modern experiment without
HPC.
Convergence, for me is not having a single platform to conquer them all:
It is about the need to explore phenomena from different perspectives.
Only through using simulations of models in conjunction with AI and data,
will we understand the models our machines learn.
∂
Acknowledgements
ICC @ Durham
Lydia Heck, Carlos Frenk, Richard Bower
DiRAC consortium
Mark Wilkinson, Clare Jenner, Jeremy Yates
Hamilton HPC
Stewart Clark, Henk Slim

More Related Content

What's hot

OpenCL
OpenCLOpenCL
DATE 2018
DATE 2018DATE 2018
DATE 2018
LEGATO project
 
Survey and Research Challenges in Big Data
Survey and Research Challenges in Big DataSurvey and Research Challenges in Big Data
Survey and Research Challenges in Big Data
Association of Scientists, Developers and Faculties
 
Использование стека Hadoop для построения сервиса сверки данных НДС
Использование стека Hadoop для построения сервиса сверки данных НДСИспользование стека Hadoop для построения сервиса сверки данных НДС
Использование стека Hadoop для построения сервиса сверки данных НДС
CEE-SEC(R)
 
When Holt-Winters is Better Than ML | Anais Dotis-Georgiou | InfluxData
When Holt-Winters is Better Than ML | Anais Dotis-Georgiou | InfluxDataWhen Holt-Winters is Better Than ML | Anais Dotis-Georgiou | InfluxData
When Holt-Winters is Better Than ML | Anais Dotis-Georgiou | InfluxData
InfluxData
 
The (hidden) patterns - networks
The (hidden) patterns - networksThe (hidden) patterns - networks
The (hidden) patterns - networks
Henk Harmsen
 
Document semantic characterization
Document semantic characterizationDocument semantic characterization
Document semantic characterization
Luca Mazzola
 
Advanced Systems Engineering
Advanced Systems EngineeringAdvanced Systems Engineering
Advanced Systems Engineering
FoCAS Initiative
 
Digipedia Pitch
Digipedia PitchDigipedia Pitch
Digipedia PitchTony Hirst
 
Pl data science october 2017
Pl data science october 2017Pl data science october 2017
Pl data science october 2017
Data Science Leuven
 
Data Science vs. Machine Learning vs. Artificial Intelligience
Data Science vs. Machine Learning vs. Artificial IntelligienceData Science vs. Machine Learning vs. Artificial Intelligience
Data Science vs. Machine Learning vs. Artificial Intelligience
Stefan Nica
 
Call for Papers - International Journal of Information Sciences and Technique...
Call for Papers - International Journal of Information Sciences and Technique...Call for Papers - International Journal of Information Sciences and Technique...
Call for Papers - International Journal of Information Sciences and Technique...
ijistjournal
 
Call for Papers - International Journal of Information Sciences and Technique...
Call for Papers - International Journal of Information Sciences and Technique...Call for Papers - International Journal of Information Sciences and Technique...
Call for Papers - International Journal of Information Sciences and Technique...
ijistjournal
 
Call for Papers - International Journal of Information Sciences and Technique...
Call for Papers - International Journal of Information Sciences and Technique...Call for Papers - International Journal of Information Sciences and Technique...
Call for Papers - International Journal of Information Sciences and Technique...
ijistjournal
 
CZECH REPUBLIC IN THE EOSC ARENA
CZECH REPUBLIC IN THE EOSC ARENACZECH REPUBLIC IN THE EOSC ARENA
CZECH REPUBLIC IN THE EOSC ARENA
EOSC-hub project
 
Call for Papers - International Journal of Information Sciences and Technique...
Call for Papers - International Journal of Information Sciences and Technique...Call for Papers - International Journal of Information Sciences and Technique...
Call for Papers - International Journal of Information Sciences and Technique...
ijistjournal
 

What's hot (16)

OpenCL
OpenCLOpenCL
OpenCL
 
DATE 2018
DATE 2018DATE 2018
DATE 2018
 
Survey and Research Challenges in Big Data
Survey and Research Challenges in Big DataSurvey and Research Challenges in Big Data
Survey and Research Challenges in Big Data
 
Использование стека Hadoop для построения сервиса сверки данных НДС
Использование стека Hadoop для построения сервиса сверки данных НДСИспользование стека Hadoop для построения сервиса сверки данных НДС
Использование стека Hadoop для построения сервиса сверки данных НДС
 
When Holt-Winters is Better Than ML | Anais Dotis-Georgiou | InfluxData
When Holt-Winters is Better Than ML | Anais Dotis-Georgiou | InfluxDataWhen Holt-Winters is Better Than ML | Anais Dotis-Georgiou | InfluxData
When Holt-Winters is Better Than ML | Anais Dotis-Georgiou | InfluxData
 
The (hidden) patterns - networks
The (hidden) patterns - networksThe (hidden) patterns - networks
The (hidden) patterns - networks
 
Document semantic characterization
Document semantic characterizationDocument semantic characterization
Document semantic characterization
 
Advanced Systems Engineering
Advanced Systems EngineeringAdvanced Systems Engineering
Advanced Systems Engineering
 
Digipedia Pitch
Digipedia PitchDigipedia Pitch
Digipedia Pitch
 
Pl data science october 2017
Pl data science october 2017Pl data science october 2017
Pl data science october 2017
 
Data Science vs. Machine Learning vs. Artificial Intelligience
Data Science vs. Machine Learning vs. Artificial IntelligienceData Science vs. Machine Learning vs. Artificial Intelligience
Data Science vs. Machine Learning vs. Artificial Intelligience
 
Call for Papers - International Journal of Information Sciences and Technique...
Call for Papers - International Journal of Information Sciences and Technique...Call for Papers - International Journal of Information Sciences and Technique...
Call for Papers - International Journal of Information Sciences and Technique...
 
Call for Papers - International Journal of Information Sciences and Technique...
Call for Papers - International Journal of Information Sciences and Technique...Call for Papers - International Journal of Information Sciences and Technique...
Call for Papers - International Journal of Information Sciences and Technique...
 
Call for Papers - International Journal of Information Sciences and Technique...
Call for Papers - International Journal of Information Sciences and Technique...Call for Papers - International Journal of Information Sciences and Technique...
Call for Papers - International Journal of Information Sciences and Technique...
 
CZECH REPUBLIC IN THE EOSC ARENA
CZECH REPUBLIC IN THE EOSC ARENACZECH REPUBLIC IN THE EOSC ARENA
CZECH REPUBLIC IN THE EOSC ARENA
 
Call for Papers - International Journal of Information Sciences and Technique...
Call for Papers - International Journal of Information Sciences and Technique...Call for Papers - International Journal of Information Sciences and Technique...
Call for Papers - International Journal of Information Sciences and Technique...
 

Similar to Convergence for Scientific Method - HPC, AI, Simulation and Experiment

Practitioner's perspective on High Performance Computing services for innovat...
Practitioner's perspective on High Performance Computing services for innovat...Practitioner's perspective on High Performance Computing services for innovat...
Practitioner's perspective on High Performance Computing services for innovat...
Huawei Enterprise Hong Kong
 
Implementing AI: Hardware Challenges
Implementing AI: Hardware ChallengesImplementing AI: Hardware Challenges
Implementing AI: Hardware Challenges
KTN
 
The Developing Needs for e-infrastructures
The Developing Needs for e-infrastructuresThe Developing Needs for e-infrastructures
The Developing Needs for e-infrastructures
guest0dc425
 
Data-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and CloudData-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and Cloud
Ola Spjuth
 
Cloud Programming Models: eScience, Big Data, etc.
Cloud Programming Models: eScience, Big Data, etc.Cloud Programming Models: eScience, Big Data, etc.
Cloud Programming Models: eScience, Big Data, etc.Alexandru Iosup
 
Pioneering and Democratizing Scalable HPC+AI at PSC
Pioneering and Democratizing Scalable HPC+AI at PSCPioneering and Democratizing Scalable HPC+AI at PSC
Pioneering and Democratizing Scalable HPC+AI at PSC
inside-BigData.com
 
AI for Science
AI for ScienceAI for Science
AI for Science
inside-BigData.com
 
National scale research computing and beyond pearc panel 2017
National scale research computing and beyond   pearc panel 2017National scale research computing and beyond   pearc panel 2017
National scale research computing and beyond pearc panel 2017
Gregory Newby
 
Future of hpc
Future of hpcFuture of hpc
Future of hpc
Putchong Uthayopas
 
Panel: NRP Science Impacts​
Panel: NRP Science Impacts​Panel: NRP Science Impacts​
Panel: NRP Science Impacts​
Larry Smarr
 
2023comp90024_Spartan.pdf
2023comp90024_Spartan.pdf2023comp90024_Spartan.pdf
2023comp90024_Spartan.pdf
LevLafayette1
 
Stories About Spark, HPC and Barcelona by Jordi Torres
Stories About Spark, HPC and Barcelona by Jordi TorresStories About Spark, HPC and Barcelona by Jordi Torres
Stories About Spark, HPC and Barcelona by Jordi Torres
Spark Summit
 
Introducing SURF
Introducing SURF Introducing SURF
Introducing SURF
annetteuva
 
Nicholas Dacre ai_pm_presentation
Nicholas Dacre ai_pm_presentationNicholas Dacre ai_pm_presentation
Nicholas Dacre ai_pm_presentation
Association for Project Management
 
Convergencia HPC - Big Data: Arquitectura y modelos de programación
Convergencia HPC - Big Data:  Arquitectura y modelos de programaciónConvergencia HPC - Big Data:  Arquitectura y modelos de programación
Convergencia HPC - Big Data: Arquitectura y modelos de programación
Facultad de Informática UCM
 
The Pacific Research Platform: a Science-Driven Big-Data Freeway System
The Pacific Research Platform: a Science-Driven Big-Data Freeway SystemThe Pacific Research Platform: a Science-Driven Big-Data Freeway System
The Pacific Research Platform: a Science-Driven Big-Data Freeway System
Larry Smarr
 
SC13 Diary
SC13 DiarySC13 Diary
SC13 Diary
Guy Tel-Zur
 
Linked Open Data: Combining Data for the Social Sciences and Humanities (and ...
Linked Open Data: Combining Data for the Social Sciences and Humanities (and ...Linked Open Data: Combining Data for the Social Sciences and Humanities (and ...
Linked Open Data: Combining Data for the Social Sciences and Humanities (and ...
Richard Zijdeman
 
NIST Big Data Public Working Group NBD-PWG
NIST Big Data Public Working Group NBD-PWGNIST Big Data Public Working Group NBD-PWG
NIST Big Data Public Working Group NBD-PWG
Geoffrey Fox
 

Similar to Convergence for Scientific Method - HPC, AI, Simulation and Experiment (20)

Practitioner's perspective on High Performance Computing services for innovat...
Practitioner's perspective on High Performance Computing services for innovat...Practitioner's perspective on High Performance Computing services for innovat...
Practitioner's perspective on High Performance Computing services for innovat...
 
Implementing AI: Hardware Challenges
Implementing AI: Hardware ChallengesImplementing AI: Hardware Challenges
Implementing AI: Hardware Challenges
 
The Developing Needs for e-infrastructures
The Developing Needs for e-infrastructuresThe Developing Needs for e-infrastructures
The Developing Needs for e-infrastructures
 
Data-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and CloudData-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and Cloud
 
Cloud Programming Models: eScience, Big Data, etc.
Cloud Programming Models: eScience, Big Data, etc.Cloud Programming Models: eScience, Big Data, etc.
Cloud Programming Models: eScience, Big Data, etc.
 
Pioneering and Democratizing Scalable HPC+AI at PSC
Pioneering and Democratizing Scalable HPC+AI at PSCPioneering and Democratizing Scalable HPC+AI at PSC
Pioneering and Democratizing Scalable HPC+AI at PSC
 
AI for Science
AI for ScienceAI for Science
AI for Science
 
National scale research computing and beyond pearc panel 2017
National scale research computing and beyond   pearc panel 2017National scale research computing and beyond   pearc panel 2017
National scale research computing and beyond pearc panel 2017
 
Future of hpc
Future of hpcFuture of hpc
Future of hpc
 
Panel: NRP Science Impacts​
Panel: NRP Science Impacts​Panel: NRP Science Impacts​
Panel: NRP Science Impacts​
 
2023comp90024_Spartan.pdf
2023comp90024_Spartan.pdf2023comp90024_Spartan.pdf
2023comp90024_Spartan.pdf
 
Stories About Spark, HPC and Barcelona by Jordi Torres
Stories About Spark, HPC and Barcelona by Jordi TorresStories About Spark, HPC and Barcelona by Jordi Torres
Stories About Spark, HPC and Barcelona by Jordi Torres
 
Introducing SURF
Introducing SURF Introducing SURF
Introducing SURF
 
Nicholas Dacre ai_pm_presentation
Nicholas Dacre ai_pm_presentationNicholas Dacre ai_pm_presentation
Nicholas Dacre ai_pm_presentation
 
grid computing
grid computinggrid computing
grid computing
 
Convergencia HPC - Big Data: Arquitectura y modelos de programación
Convergencia HPC - Big Data:  Arquitectura y modelos de programaciónConvergencia HPC - Big Data:  Arquitectura y modelos de programación
Convergencia HPC - Big Data: Arquitectura y modelos de programación
 
The Pacific Research Platform: a Science-Driven Big-Data Freeway System
The Pacific Research Platform: a Science-Driven Big-Data Freeway SystemThe Pacific Research Platform: a Science-Driven Big-Data Freeway System
The Pacific Research Platform: a Science-Driven Big-Data Freeway System
 
SC13 Diary
SC13 DiarySC13 Diary
SC13 Diary
 
Linked Open Data: Combining Data for the Social Sciences and Humanities (and ...
Linked Open Data: Combining Data for the Social Sciences and Humanities (and ...Linked Open Data: Combining Data for the Social Sciences and Humanities (and ...
Linked Open Data: Combining Data for the Social Sciences and Humanities (and ...
 
NIST Big Data Public Working Group NBD-PWG
NIST Big Data Public Working Group NBD-PWGNIST Big Data Public Working Group NBD-PWG
NIST Big Data Public Working Group NBD-PWG
 

More from inside-BigData.com

Major Market Shifts in IT
Major Market Shifts in ITMajor Market Shifts in IT
Major Market Shifts in IT
inside-BigData.com
 
Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...
inside-BigData.com
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networks
inside-BigData.com
 
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
inside-BigData.com
 
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
inside-BigData.com
 
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
inside-BigData.com
 
HPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural NetworksHPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural Networks
inside-BigData.com
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
inside-BigData.com
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecasts
inside-BigData.com
 
HPC AI Advisory Council Update
HPC AI Advisory Council UpdateHPC AI Advisory Council Update
HPC AI Advisory Council Update
inside-BigData.com
 
Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19
inside-BigData.com
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuning
inside-BigData.com
 
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODHPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
inside-BigData.com
 
State of ARM-based HPC
State of ARM-based HPCState of ARM-based HPC
State of ARM-based HPC
inside-BigData.com
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Acceleration
inside-BigData.com
 
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance EfficientlyZettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
inside-BigData.com
 
Scaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's EraScaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's Era
inside-BigData.com
 
CUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computingCUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computing
inside-BigData.com
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Cluster
inside-BigData.com
 
Overview of HPC Interconnects
Overview of HPC InterconnectsOverview of HPC Interconnects
Overview of HPC Interconnects
inside-BigData.com
 

More from inside-BigData.com (20)

Major Market Shifts in IT
Major Market Shifts in ITMajor Market Shifts in IT
Major Market Shifts in IT
 
Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networks
 
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
 
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
 
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
 
HPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural NetworksHPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural Networks
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecasts
 
HPC AI Advisory Council Update
HPC AI Advisory Council UpdateHPC AI Advisory Council Update
HPC AI Advisory Council Update
 
Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuning
 
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODHPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
 
State of ARM-based HPC
State of ARM-based HPCState of ARM-based HPC
State of ARM-based HPC
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Acceleration
 
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance EfficientlyZettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
 
Scaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's EraScaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's Era
 
CUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computingCUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computing
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Cluster
 
Overview of HPC Interconnects
Overview of HPC InterconnectsOverview of HPC Interconnects
Overview of HPC Interconnects
 

Recently uploaded

Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
RinaMondal9
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
Alpen-Adria-Universität
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
Pierluigi Pugliese
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
Ralf Eggert
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...
Jen Stirrup
 
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfSAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
Peter Spielvogel
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofszkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
Alex Pruden
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
Dorra BARTAGUIZ
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 

Recently uploaded (20)

Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...
 
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfSAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofszkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 

Convergence for Scientific Method - HPC, AI, Simulation and Experiment

  • 1. Convergence for scientific method – HPC, AI, Simulation and Experiment HPC-AI Advisory Council Presented by Alan Real, Durham University 17th September 2019
  • 2. ∂ • A City in the North East of England • Steeped in Heritage and Culture • UNESCO world heritage site & other heritage assets • 3rd oldest University in UK, 1837 Royal Charter • 7th Century history: Venerable Bede. • 16 Colleges, Faculties of Sciences, Social Sciences and Arts and Humanities
  • 3. ∂ • 18,000 students, 3000 staff • HPC, high priority for science faculty. • Most highly-cited papers are computational • Created dedicated research computing support Directorate, within Research function
  • 4. ∂ ARC@Durham Advanced Research Computing Research & Innovation CIS / IT Services Applied Research Computer Science Enabling research to be at the forefront of computational and data-intensive practice. Key components • People & Skills • Research Aspirations • eInfrastructure: Hardware & Software Interdisciplinarity • Collaborative culture • Bridge between Academic Practice and operational services • Networks of cross-cutting methods, underpinning disciplinary themes. • Business development help maximise opportunity. Team • Research Software Engineering (3 FTE) • Platforms (5 FTE) • Business development Values criticality and contribution of technical expertise.
  • 5. ∂ The research computing platform Data storage & Networking Cloud HPC Visualization Data repository Database Lab equipment Data sets Advice Software Research question Value
  • 6. ∂ HPC @Durham 18 Years of frankly heroic effort by 2 individuals running two facilities: Lydia Heck – COSMA – Institute of computational cosmology Henk Slim – Hamilton – general purpose, mainly science Faculty Purpose built room: AHDC-1 Upgrades to power and cooling ~540kW capacity (2001-4) Generator, cooling and UPS refurbishment (2009) Second purpose built room: AHDC-2 (adjacent & Now Lydia Heck DC) 1MW electrical infrastructure, 800kW cooling load. Have housed 7 generations of COSMA and Hamilton facilities
  • 10. ∂ COSMA6&7 (2017-18) COSMA6 relocated from Hartree centre & combined with COSMA5 ~11k cores
  • 11. ∂ Main driver for HPC strategy… The scientific method requires it! Key metric of success: Were the objectives of the science case met?
  • 12. ∂ Virgo consortium outputs Simulations run for months Millennium: 3rd most cited paper in Nature Eagle: 4096 cores, 47 days Most highly cited paper in space science in 2015 5 years from inception to publication of the first paper 10-15 people designing and executing the simulations, exploited by 100s
  • 13. ∂ System stability Understanding the formation of the Universe in unprecedented detail Begins here…
  • 14. ∂ The knowledge continuum Plumbing Researchquestion Application Software ComputerHardware Domainknowledge Systemdesign Computercomponents Workflow Numerics Operatingenvironment Require building of intellectual capability across whole spectrum Interplay of components is important Ensure flow of knowledge across spectrum to enable research. Algorithms
  • 15. ∂ Conveying computational challenges What does successful research computing look like? Lots of conversations & conferences Some internal business cases. Not much cite-able literature. If it matters: Can we publish a peer-reviewed publication? Can we do this within the research domain? Encourage other members of this community to do the same!
  • 16. ∂ N8 Research Partnership is the 8 most research intensive universities in the North of England – Durham, Lancaster, Leeds, Liverpool, Manchester, Newcastle, Sheffield and York N8 Centre of Excellence for Computationally Intensive Research (N8 CIR): – Established to build on this partnership to expand the art of the possible within research themes by leveraging computational- and data-intensive practice. Derived from N8 HPC which offered access to a shared compute platform for all 8 universities for 6 years.
  • 17. ∂ NICE19 Conversation around future of AI & accelerated computing GPU card memory regarded as limiting factor – explore coherence. Quantification of inferencing error through model uncertainty. Conversation with the Bioscientists Want to get the most out of lab experiment data Identical challenges within advanced materials Bring experiment and simulation data together to inform each other. - Requires ability to drive the network. - Combine observation, AI, models & simulation to advance understanding. - In AI, only the machine learns to predict. - We build models to understand & use simulation to verify.
  • 18. ∂ Experiment & AI workflow Experiment Observation Data Analysis AI Prediction Where is the hypothesis? Where is the Understanding?
  • 20. ∂ Final Remarks HPC (& broader research computing) is part of scientific method. Be willing to do something different!! It is getting increasingly difficult to undertake modern experiment without HPC. Convergence, for me is not having a single platform to conquer them all: It is about the need to explore phenomena from different perspectives. Only through using simulations of models in conjunction with AI and data, will we understand the models our machines learn.
  • 21. ∂ Acknowledgements ICC @ Durham Lydia Heck, Carlos Frenk, Richard Bower DiRAC consortium Mark Wilkinson, Clare Jenner, Jeremy Yates Hamilton HPC Stewart Clark, Henk Slim