SlideShare a Scribd company logo
HPC at NIBR
Nick Holway
Scientific Computing Group, NIBR
HPC Advisory Council, Lugano
Twitter: @nickholway LinkedIn: https://www.linkedin.com/in/nickholway/
April 2017
Novartis Institutes for
Biomedical Research
(NIBR)
HPC means we can get
better, more targeted
drugs to patients quicker
Public
Novartis Institutes for Biomedical Research
Today’s talk
1. Introducing Novartis
2. A very quick introduction to drug discovery
3. What our HPC looks like
4. Examples of how we use HPC to accelerate drug discovery
5. Outlook
Public
Introducing Novartis
Novartis Institutes for Biomedical Research
Novartis
Public
Innovative Medicines
Sandoz
Pharmaceuticals
business unit
Oncology
business unit
Alcon
R&DR&D
Novartis Institutes for Biomedical Research
Drug discovery and early development
Public
~6,000
Scientists /
7 sites globally
~90
New Molecular Entities
~400
Research projects
>500
Ongoing
clinical trials>400 Computational Scientists
Novartis Institutes for Biomedical Research
NIBR
ATI
CVM
Other
ID
MSD
ONC
IO
OPH
RESP
NEURO
Public
Organised around prevalent Disease Areas
Note: Distribution of ~90 New Molecular Entities at NIBR
Immuno-Oncology
Oncology
Ophthalmology
Respiratory Diseases
Neuroscience
Autoimmunity, Transplantation & Inflammation
Cardiovascular & Metabolism
Infectious Diseases
Musculoskeletal
A very quick introduction to
drug discovery
Novartis Institutes for Biomedical Research
What’s a drug?
• ”A pharmaceutical drug is a drug used to diagnose, cure, treat, or prevent
disease” (https://en.wikipedia.org/wiki/Pharmaceutical_drug)
• Examples of some medicines and their “mechanisms”:
– Cancer: blocking cells dividing by disrupting DNA replication or the cells’ internal
skeletons
– Infectious diseases: disrupting bacterial cell walls
– Depression: Preventing serotonin re-uptake in nerve cells in the brain
Public
Novartis Institutes for Biomedical Research
The path to a new medicine
Public
Discovery Clinical trials Evaluation
Post-
approval
Target
selection
Drug
research and
design
Preclinical
research
Proof of
Concept
5–15
patients
Phase I
20–100
healthy
volunteers
and/or
patients
Phase II
100–500
patients
Phase III
1000–5000
patients
Submission
Review by
regulatory
authority
Phase IV
Post-marketing
surveillance and
research
Manufacturing
Investigational New Drug (IND)
Application submitted
NDA/ BLA*
submitted
Approval of
one new medicine
10 – 15 years
>10 000
Compounds
<250
Compounds
<5
Compounds
*New Drug Application / Biologics Licence Application
What our HPC looks like
Novartis Institutes for Biomedical Research
HPC at NIBR - Hardware
• x86 servers
– Intel Xeon CPUs
– 128-768GB RAM
– FDR Infiniband
– 10GigE
• Specialised nodes
– Nvidia GPUs
– >=1TB RAM
• Isilon storage
– CIFS/NFS
– 10GigE to Arista switches
• Lustre
– Scratch
Public
Novartis Institutes for Biomedical Research
HPC at NIBR - Software
• RHEL 6.x
• Univa Grid Engine for scheduling
• Software compilation & configuration
– Easybuild
– Modules
– GCC, Intel, Nvidia compilers
• Languages: C++, Fortran, CUDA, Python, R, Matlab
• Libraries: *MPI, MKL etc
• The software stack is identical on Linux desktops and “scientific servers”
Public
Novartis Institutes for Biomedical Research
HPC at NIBR - Humans
• HPC is provided by the Scientific Computing Group (SciComp)
• Global team (Europe, USA, Asia)
• Complementary backgrounds and skills
– Sysadmins
– Mathematicians
– Scientists
• HPCWire award winners in 2014
• Other teams in NIBR Informatics provide storage, Linux servers, etc.
Public
Novartis Institutes for Biomedical Research
HPC at NIBR: Community
• We’ve worked very hard to build an interdisciplinary group of informatics
scientists to share knowledge
• Various activities
– Fortnightly informal talks
– Social events
– Deep Learning “bootcamp”
– 24hr virtual multi-site workshop (Shanghai -> California!)
• This started out from the grassroots and has now been formally funded within
the Company
Public
Novartis Institutes for Biomedical Research
HPC elsewhere in Novartis
• Today’s talk covers Research; however HPC is used elsewhere in the
Company for
– Modelling Drug absorption, metabolism & secretion (PK/PD)
– Processing data from Clinical Trials
– Predicting where in the lungs inhaled drugs go (CFD)
• The cluster used for this work is much more tightly controlled and tested than
NIBR’s systems
Public
Examples of how we use HPC
to accelerate drug discovery
Novartis Institutes for Biomedical Research
Using HPC in early drug discovery
• There are many different ways NIBR scientists use HPC
– Molecular dynamics
– NGS analysis
– Ligand-protein docking
– Image analysis
– Cryo-EM analysis
• Our usage is similar to a university with biology and chemistry departments
• In today’s talk I’ll focus on using HPC to accelerate phenotypic assays
Public
Novartis Institutes for Biomedical Research
Phenotypic assays
• Traditionally our scientists have used biochemical assays in early stage drug
discovery
– Assays use an isolated enzyme or protein and measure fluorescence etc.
– This tells us very little about the cells and how they react
• Increasingly our scientists are using “phenotypic assays” using cells grown in a
lab
– Scientists can see the impact of their drug on an entire cell or population of
cells
Public
Novartis Institutes for Biomedical Research
Example: wound healing
Public
24 hrs
Images from http://cellprofiler.org/examples/#Wound
Novartis Institutes for Biomedical Research
What is High Content Screening
(HCS)
• A method for identifying molecules which alter the phenotype of cells (eg cell
shape, number etc) or small organisms (eg Malaria parasites)
• Using robotics & automated microscopes a large number of potential drugs can
be ”screened” in a few hours or days
• Assays can generate a lot of data
– Videos
– Millions of images
– >600TB/yr for some HCS instruments
Public
Novartis Institutes for Biomedical Research
Accelerating MND/ALS disease
research with GPUs
Public
Novartis Institutes for Biomedical Research
In-vitro model for neuromuscular
junctions
• Faulty junctions between motor neurons and muscle cells are implicated in
MND/ALS
• We’d like to create a drug which corrects this
• Motor neurons & myotube (muscle fibre) cells were “co-cultured” in a “plate” to
which drug candidates are added
• Cells were imaged in real time to measure their contractility
• This is very hard to see by eye and also hard to segment using computers
Public
Novartis Institutes for Biomedical Research
What do the cells look like?
Public
Figure: I Hossain
Novartis Institutes for Biomedical Research
Motion estimated with Optic Flow
Public
Different contracting regions
Total area under contraction
Figure: I Hossain
Novartis Institutes for Biomedical Research
Impact of HPC
• A good joint project between bench scientists, lab automation experts &
informaticians
• 80x increase of throughput compared to CPU
• NIBR scientists have access to new method of monitoring myotube contractility
Public
Novartis Institutes for Biomedical Research
Deep learning for HCS image
analysis
Public
Novartis Institutes for Biomedical Research
CNNs for HCS image analysis
• HCS analysis is traditionally performed using tools such as CellProfiler, Fiji or
commercial tools
• Deep Learning approaches are becoming increasingly used for image analysis
• A team has investigated Convolutional Neural Networks for deriving images’
phenotypes
• They used only the images’ pixel intensity values with no a priori knowledge
• They used public and Novartis datasets
Public
Novartis Institutes for Biomedical Research
Outcome
• The images were classified better than conventional methods
• This is included tracking a response to drugs
• There was no need to design a unique pipeline for the processing
Public
Novartis Institutes for Biomedical Research
Interested in knowing more?
• This work has been published (including some code) in Godinez et al: “A multi-
scale convolutional neural network for phenotyping high-content cellular
images”, Bioinformatics btx069, https://doi.org/10.1093/bioinformatics/btx069
Public
Novartis Institutes for Biomedical Research
Pushing HPC to non-technical
scientists
Public
Novartis Institutes for Biomedical Research
Why bench scientists need HPC
(and don’t realise it!)
• Bench scientists generally do not know how to programme or use the Linux
command line
• Many scientists’ data has grown too big to be processed on a single
workstation
• This means they have to wait a long time for the data to be processed and also
they may need to wait for an informatician to become available
• If you can give a scientist the tools to analyse their data at scale then they get
their data sooner and enables the informaticians to focus on more complex
tasks
Public
Novartis Institutes for Biomedical Research
Pushing HCS analysis to bench
scientists
• Our scientists create pipelines using CellProfiler (http://cellprofiler.org/) using
the normal GUI on their laptops
• The Pipeline is then uploaded to a central server at each site
• The scientist can kick off a analysis run on the cluster using the same webpage
that they use to visualise their images
Public
Novartis Institutes for Biomedical Research
Screenshots of the GUI
Public
Novartis Institutes for Biomedical Research
Also (ab)using Jenkins
• Our scientists have automated cluster submission using the continuous
integration tool, Jenkins, again with a web front end
• The work has been published: https://doi.org/10.1177/1087057116679993
• Source freely available at https://github.com/Novartis/Jenkins-LSCI
Public
Outlook
Novartis Institutes for Biomedical Research
HPC Trends
• GPUs / Intel Phi / FPGAs
– Deep learning
– Cryo-EM
• Real time collection & processing of data from clinical trials
• Integration of “big data” technologies such as Apache Spark into HPC
• HPC in the cloud
– Currently most useful for bursting or embarrassingly parallel jobs
Public
Thank you
Back up
HPC in the cloud
Novartis Institutes for Biomedical Research
HPC in the cloud
• NIBR have used Amazon EC2 for compute workloads
– Cycle computing
• ISVs eg DNANexus
– Bioinformatics NGS
Public
Novartis Institutes for Biomedical Research
Docking at scale in the cloud
• Ligand-protein docking is “to predict the position and orientation of a ligand (a
small molecule) when it is bound to a protein receptor or enzyme” (Wikipedia)
• Embarrassingly parallel - compute-heavy / data-light
• We used the cloud to screen 10 million molecules against a cancer target
Public
Novartis Institutes for Biomedical Research
How we did it
• Cycle computing’s software (Cycle server, Cyclecloud)
• Over 10,000 EC2 spot instances
– Extensive benchmarking to select instance type
• Licence files (licence servers cannot cope with the load)
• Proprietary compounds run in NIBR’s VPC, others in “public”
• See http://opensource.nibr.com/videos/aws-litster/ and
http://cyclecomputing.com/novartis-taps-cloud-hpc-for-faster-drug-discovery-
better-science/
Public
Novartis Institutes for Biomedical Research
Where we’re going in the cloud
• “Cloud by default” for many non-HPC applications
• Clinical data (subject to “informed consent”)
• HPC where appropriate
– IB etc for tightly-coupled parallel jobs usually unavailable
– Data locality challenging
Public

More Related Content

What's hot

Overview of HPC Interconnects
Overview of HPC InterconnectsOverview of HPC Interconnects
Overview of HPC Interconnects
inside-BigData.com
 
Accelerating apache spark with rdma
Accelerating apache spark with rdmaAccelerating apache spark with rdma
Accelerating apache spark with rdma
inside-BigData.com
 
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
inside-BigData.com
 
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
Linaro
 
Microsoft Project Olympus AI Accelerator Chassis (HGX-1)
Microsoft Project Olympus AI Accelerator Chassis (HGX-1)Microsoft Project Olympus AI Accelerator Chassis (HGX-1)
Microsoft Project Olympus AI Accelerator Chassis (HGX-1)
inside-BigData.com
 
Modern Software Architecture
Modern Software Architecture Modern Software Architecture
Modern Software Architecture
Ahmed Marzouk
 
Systems Support for Many Task Computing
Systems Support for Many Task ComputingSystems Support for Many Task Computing
Systems Support for Many Task Computing
Eric Van Hensbergen
 
Windows Server 2008 R2 Dev Session 02
Windows Server 2008 R2 Dev Session 02Windows Server 2008 R2 Dev Session 02
Windows Server 2008 R2 Dev Session 02
Clint Edmonson
 
NNSA Explorations: ARM for Supercomputing
NNSA Explorations: ARM for SupercomputingNNSA Explorations: ARM for Supercomputing
NNSA Explorations: ARM for Supercomputing
inside-BigData.com
 
Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...
inside-BigData.com
 
Ceph Day SF 2015 - Keynote
Ceph Day SF 2015 - Keynote Ceph Day SF 2015 - Keynote
Ceph Day SF 2015 - Keynote
Ceph Community
 
The ECP Exascale Computing Project
The ECP Exascale Computing ProjectThe ECP Exascale Computing Project
The ECP Exascale Computing Project
inside-BigData.com
 
Red Hat® Ceph Storage and Network Solutions for Software Defined Infrastructure
Red Hat® Ceph Storage and Network Solutions for Software Defined InfrastructureRed Hat® Ceph Storage and Network Solutions for Software Defined Infrastructure
Red Hat® Ceph Storage and Network Solutions for Software Defined Infrastructure
Intel® Software
 
Accelerating Hadoop, Spark, and Memcached with HPC Technologies
Accelerating Hadoop, Spark, and Memcached with HPC TechnologiesAccelerating Hadoop, Spark, and Memcached with HPC Technologies
Accelerating Hadoop, Spark, and Memcached with HPC Technologies
inside-BigData.com
 
CUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computingCUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computing
inside-BigData.com
 
DPDK In Depth
DPDK In DepthDPDK In Depth
DPDK In Depth
Kernel TLV
 
EBPF and Linux Networking
EBPF and Linux NetworkingEBPF and Linux Networking
EBPF and Linux Networking
PLUMgrid
 
Hardware & Software Platforms for HPC, AI and ML
Hardware & Software Platforms for HPC, AI and MLHardware & Software Platforms for HPC, AI and ML
Hardware & Software Platforms for HPC, AI and ML
inside-BigData.com
 
Foss Gadgematics
Foss GadgematicsFoss Gadgematics
Foss Gadgematics
Bud Siddhisena
 
TAU E4S ON OpenPOWER /POWER9 platform
TAU E4S ON OpenPOWER /POWER9 platformTAU E4S ON OpenPOWER /POWER9 platform
TAU E4S ON OpenPOWER /POWER9 platform
Ganesan Narayanasamy
 

What's hot (20)

Overview of HPC Interconnects
Overview of HPC InterconnectsOverview of HPC Interconnects
Overview of HPC Interconnects
 
Accelerating apache spark with rdma
Accelerating apache spark with rdmaAccelerating apache spark with rdma
Accelerating apache spark with rdma
 
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
 
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
 
Microsoft Project Olympus AI Accelerator Chassis (HGX-1)
Microsoft Project Olympus AI Accelerator Chassis (HGX-1)Microsoft Project Olympus AI Accelerator Chassis (HGX-1)
Microsoft Project Olympus AI Accelerator Chassis (HGX-1)
 
Modern Software Architecture
Modern Software Architecture Modern Software Architecture
Modern Software Architecture
 
Systems Support for Many Task Computing
Systems Support for Many Task ComputingSystems Support for Many Task Computing
Systems Support for Many Task Computing
 
Windows Server 2008 R2 Dev Session 02
Windows Server 2008 R2 Dev Session 02Windows Server 2008 R2 Dev Session 02
Windows Server 2008 R2 Dev Session 02
 
NNSA Explorations: ARM for Supercomputing
NNSA Explorations: ARM for SupercomputingNNSA Explorations: ARM for Supercomputing
NNSA Explorations: ARM for Supercomputing
 
Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...
 
Ceph Day SF 2015 - Keynote
Ceph Day SF 2015 - Keynote Ceph Day SF 2015 - Keynote
Ceph Day SF 2015 - Keynote
 
The ECP Exascale Computing Project
The ECP Exascale Computing ProjectThe ECP Exascale Computing Project
The ECP Exascale Computing Project
 
Red Hat® Ceph Storage and Network Solutions for Software Defined Infrastructure
Red Hat® Ceph Storage and Network Solutions for Software Defined InfrastructureRed Hat® Ceph Storage and Network Solutions for Software Defined Infrastructure
Red Hat® Ceph Storage and Network Solutions for Software Defined Infrastructure
 
Accelerating Hadoop, Spark, and Memcached with HPC Technologies
Accelerating Hadoop, Spark, and Memcached with HPC TechnologiesAccelerating Hadoop, Spark, and Memcached with HPC Technologies
Accelerating Hadoop, Spark, and Memcached with HPC Technologies
 
CUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computingCUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computing
 
DPDK In Depth
DPDK In DepthDPDK In Depth
DPDK In Depth
 
EBPF and Linux Networking
EBPF and Linux NetworkingEBPF and Linux Networking
EBPF and Linux Networking
 
Hardware & Software Platforms for HPC, AI and ML
Hardware & Software Platforms for HPC, AI and MLHardware & Software Platforms for HPC, AI and ML
Hardware & Software Platforms for HPC, AI and ML
 
Foss Gadgematics
Foss GadgematicsFoss Gadgematics
Foss Gadgematics
 
TAU E4S ON OpenPOWER /POWER9 platform
TAU E4S ON OpenPOWER /POWER9 platformTAU E4S ON OpenPOWER /POWER9 platform
TAU E4S ON OpenPOWER /POWER9 platform
 

Similar to HPC at NIBR

Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
Spark Summit
 
Reproducibility in human cognitive neuroimaging: a community-­driven data sha...
Reproducibility in human cognitive neuroimaging: a community-­driven data sha...Reproducibility in human cognitive neuroimaging: a community-­driven data sha...
Reproducibility in human cognitive neuroimaging: a community-­driven data sha...
Nolan Nichols
 
Advanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven ResearchAdvanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven Research
European Bioinformatics Institute
 
Data-knowledge transition zones within the biomedical research ecosystem
Data-knowledge transition zones within the biomedical research ecosystemData-knowledge transition zones within the biomedical research ecosystem
Data-knowledge transition zones within the biomedical research ecosystem
Maryann Martone
 
Big data's impact on healthcare
Big data's impact on healthcareBig data's impact on healthcare
Big data's impact on healthcare
René Kuipers
 
Supporting high throughput high-biotechnologies in today’s research environme...
Supporting high throughput high-biotechnologies in today’s research environme...Supporting high throughput high-biotechnologies in today’s research environme...
Supporting high throughput high-biotechnologies in today’s research environme...
Ed Dodds
 
An Oz Mammals Bioinformatics and Data Resource
An Oz Mammals Bioinformatics and Data ResourceAn Oz Mammals Bioinformatics and Data Resource
An Oz Mammals Bioinformatics and Data Resource
Philippa Griffin
 
CLinical Research Office at Ponce Health Sciences Foundation/Ponce Research I...
CLinical Research Office at Ponce Health Sciences Foundation/Ponce Research I...CLinical Research Office at Ponce Health Sciences Foundation/Ponce Research I...
CLinical Research Office at Ponce Health Sciences Foundation/Ponce Research I...
Dr. Roberto Torres
 
Dr. Eliot Siegel: Watson and Deep QA Software in Pursuit of Personalized Medi...
Dr. Eliot Siegel: Watson and Deep QA Software in Pursuit of Personalized Medi...Dr. Eliot Siegel: Watson and Deep QA Software in Pursuit of Personalized Medi...
Dr. Eliot Siegel: Watson and Deep QA Software in Pursuit of Personalized Medi...
National Cancer Institute National Cancer Informatics Program
 
The case for cloud computing in Life Sciences
The case for cloud computing in Life SciencesThe case for cloud computing in Life Sciences
The case for cloud computing in Life Sciences
Ola Spjuth
 
Data Landscapes: The Neuroscience Information Framework
Data Landscapes:  The Neuroscience Information FrameworkData Landscapes:  The Neuroscience Information Framework
Data Landscapes: The Neuroscience Information Framework
Maryann Martone
 
Overview of Next Gen Sequencing Data Analysis
Overview of Next Gen Sequencing Data AnalysisOverview of Next Gen Sequencing Data Analysis
Overview of Next Gen Sequencing Data Analysis
Bioinformatics and Computational Biosciences Branch
 
openSNP - Crowdsourcing Genome Wide Association Studies
openSNP - Crowdsourcing Genome Wide Association StudiesopenSNP - Crowdsourcing Genome Wide Association Studies
openSNP - Crowdsourcing Genome Wide Association Studies
Bastian Greshake
 
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
Bonnie Hurwitz
 
PhRMA Some Early Thoughts
PhRMA Some Early ThoughtsPhRMA Some Early Thoughts
PhRMA Some Early Thoughts
Philip Bourne
 
Providing support for JC Bradleys vision of open science using RSC cheminform...
Providing support for JC Bradleys vision of open science using RSC cheminform...Providing support for JC Bradleys vision of open science using RSC cheminform...
Providing support for JC Bradleys vision of open science using RSC cheminform...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Medical image analysis and big data evaluation infrastructures
Medical image analysis and big data evaluation infrastructuresMedical image analysis and big data evaluation infrastructures
Medical image analysis and big data evaluation infrastructures
Institute of Information Systems (HES-SO)
 
Data Landscapes - Addiction
Data Landscapes - AddictionData Landscapes - Addiction
Data Landscapes - Addiction
Neuroscience Information Framework
 
Aug2014 giab intro slides
Aug2014 giab intro slidesAug2014 giab intro slides
Aug2014 giab intro slides
GenomeInABottle
 
UCSF Informatics Day 2014 - Sorena Nadaf, "Translational Informatics OnCore C...
UCSF Informatics Day 2014 - Sorena Nadaf, "Translational Informatics OnCore C...UCSF Informatics Day 2014 - Sorena Nadaf, "Translational Informatics OnCore C...
UCSF Informatics Day 2014 - Sorena Nadaf, "Translational Informatics OnCore C...CTSI at UCSF
 

Similar to HPC at NIBR (20)

Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
 
Reproducibility in human cognitive neuroimaging: a community-­driven data sha...
Reproducibility in human cognitive neuroimaging: a community-­driven data sha...Reproducibility in human cognitive neuroimaging: a community-­driven data sha...
Reproducibility in human cognitive neuroimaging: a community-­driven data sha...
 
Advanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven ResearchAdvanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven Research
 
Data-knowledge transition zones within the biomedical research ecosystem
Data-knowledge transition zones within the biomedical research ecosystemData-knowledge transition zones within the biomedical research ecosystem
Data-knowledge transition zones within the biomedical research ecosystem
 
Big data's impact on healthcare
Big data's impact on healthcareBig data's impact on healthcare
Big data's impact on healthcare
 
Supporting high throughput high-biotechnologies in today’s research environme...
Supporting high throughput high-biotechnologies in today’s research environme...Supporting high throughput high-biotechnologies in today’s research environme...
Supporting high throughput high-biotechnologies in today’s research environme...
 
An Oz Mammals Bioinformatics and Data Resource
An Oz Mammals Bioinformatics and Data ResourceAn Oz Mammals Bioinformatics and Data Resource
An Oz Mammals Bioinformatics and Data Resource
 
CLinical Research Office at Ponce Health Sciences Foundation/Ponce Research I...
CLinical Research Office at Ponce Health Sciences Foundation/Ponce Research I...CLinical Research Office at Ponce Health Sciences Foundation/Ponce Research I...
CLinical Research Office at Ponce Health Sciences Foundation/Ponce Research I...
 
Dr. Eliot Siegel: Watson and Deep QA Software in Pursuit of Personalized Medi...
Dr. Eliot Siegel: Watson and Deep QA Software in Pursuit of Personalized Medi...Dr. Eliot Siegel: Watson and Deep QA Software in Pursuit of Personalized Medi...
Dr. Eliot Siegel: Watson and Deep QA Software in Pursuit of Personalized Medi...
 
The case for cloud computing in Life Sciences
The case for cloud computing in Life SciencesThe case for cloud computing in Life Sciences
The case for cloud computing in Life Sciences
 
Data Landscapes: The Neuroscience Information Framework
Data Landscapes:  The Neuroscience Information FrameworkData Landscapes:  The Neuroscience Information Framework
Data Landscapes: The Neuroscience Information Framework
 
Overview of Next Gen Sequencing Data Analysis
Overview of Next Gen Sequencing Data AnalysisOverview of Next Gen Sequencing Data Analysis
Overview of Next Gen Sequencing Data Analysis
 
openSNP - Crowdsourcing Genome Wide Association Studies
openSNP - Crowdsourcing Genome Wide Association StudiesopenSNP - Crowdsourcing Genome Wide Association Studies
openSNP - Crowdsourcing Genome Wide Association Studies
 
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
 
PhRMA Some Early Thoughts
PhRMA Some Early ThoughtsPhRMA Some Early Thoughts
PhRMA Some Early Thoughts
 
Providing support for JC Bradleys vision of open science using RSC cheminform...
Providing support for JC Bradleys vision of open science using RSC cheminform...Providing support for JC Bradleys vision of open science using RSC cheminform...
Providing support for JC Bradleys vision of open science using RSC cheminform...
 
Medical image analysis and big data evaluation infrastructures
Medical image analysis and big data evaluation infrastructuresMedical image analysis and big data evaluation infrastructures
Medical image analysis and big data evaluation infrastructures
 
Data Landscapes - Addiction
Data Landscapes - AddictionData Landscapes - Addiction
Data Landscapes - Addiction
 
Aug2014 giab intro slides
Aug2014 giab intro slidesAug2014 giab intro slides
Aug2014 giab intro slides
 
UCSF Informatics Day 2014 - Sorena Nadaf, "Translational Informatics OnCore C...
UCSF Informatics Day 2014 - Sorena Nadaf, "Translational Informatics OnCore C...UCSF Informatics Day 2014 - Sorena Nadaf, "Translational Informatics OnCore C...
UCSF Informatics Day 2014 - Sorena Nadaf, "Translational Informatics OnCore C...
 

More from inside-BigData.com

Major Market Shifts in IT
Major Market Shifts in ITMajor Market Shifts in IT
Major Market Shifts in IT
inside-BigData.com
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networks
inside-BigData.com
 
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
inside-BigData.com
 
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
inside-BigData.com
 
HPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural NetworksHPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural Networks
inside-BigData.com
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
inside-BigData.com
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecasts
inside-BigData.com
 
HPC AI Advisory Council Update
HPC AI Advisory Council UpdateHPC AI Advisory Council Update
HPC AI Advisory Council Update
inside-BigData.com
 
Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19
inside-BigData.com
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuning
inside-BigData.com
 
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODHPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
inside-BigData.com
 
State of ARM-based HPC
State of ARM-based HPCState of ARM-based HPC
State of ARM-based HPC
inside-BigData.com
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Acceleration
inside-BigData.com
 
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance EfficientlyZettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
inside-BigData.com
 
Scaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's EraScaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's Era
inside-BigData.com
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Cluster
inside-BigData.com
 
Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...
Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...
Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...
inside-BigData.com
 
Data Parallel Deep Learning
Data Parallel Deep LearningData Parallel Deep Learning
Data Parallel Deep Learning
inside-BigData.com
 
Making Supernovae with Jets
Making Supernovae with JetsMaking Supernovae with Jets
Making Supernovae with Jets
inside-BigData.com
 
Adaptive Linear Solvers and Eigensolvers
Adaptive Linear Solvers and EigensolversAdaptive Linear Solvers and Eigensolvers
Adaptive Linear Solvers and Eigensolvers
inside-BigData.com
 

More from inside-BigData.com (20)

Major Market Shifts in IT
Major Market Shifts in ITMajor Market Shifts in IT
Major Market Shifts in IT
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networks
 
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
 
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
 
HPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural NetworksHPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural Networks
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecasts
 
HPC AI Advisory Council Update
HPC AI Advisory Council UpdateHPC AI Advisory Council Update
HPC AI Advisory Council Update
 
Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuning
 
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODHPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
 
State of ARM-based HPC
State of ARM-based HPCState of ARM-based HPC
State of ARM-based HPC
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Acceleration
 
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance EfficientlyZettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
 
Scaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's EraScaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's Era
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Cluster
 
Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...
Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...
Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...
 
Data Parallel Deep Learning
Data Parallel Deep LearningData Parallel Deep Learning
Data Parallel Deep Learning
 
Making Supernovae with Jets
Making Supernovae with JetsMaking Supernovae with Jets
Making Supernovae with Jets
 
Adaptive Linear Solvers and Eigensolvers
Adaptive Linear Solvers and EigensolversAdaptive Linear Solvers and Eigensolvers
Adaptive Linear Solvers and Eigensolvers
 

Recently uploaded

How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
danishmna97
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
Neo4j
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIEnchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Vladimir Iglovikov, Ph.D.
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
Uni Systems S.M.S.A.
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems S.M.S.A.
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
DianaGray10
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
DianaGray10
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
Neo4j
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
RinaMondal9
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
James Anderson
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
Kumud Singh
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
Neo4j
 

Recently uploaded (20)

How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIEnchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
 

HPC at NIBR

  • 1. HPC at NIBR Nick Holway Scientific Computing Group, NIBR HPC Advisory Council, Lugano Twitter: @nickholway LinkedIn: https://www.linkedin.com/in/nickholway/ April 2017 Novartis Institutes for Biomedical Research (NIBR)
  • 2. HPC means we can get better, more targeted drugs to patients quicker Public
  • 3. Novartis Institutes for Biomedical Research Today’s talk 1. Introducing Novartis 2. A very quick introduction to drug discovery 3. What our HPC looks like 4. Examples of how we use HPC to accelerate drug discovery 5. Outlook Public
  • 5. Novartis Institutes for Biomedical Research Novartis Public Innovative Medicines Sandoz Pharmaceuticals business unit Oncology business unit Alcon R&DR&D
  • 6. Novartis Institutes for Biomedical Research Drug discovery and early development Public ~6,000 Scientists / 7 sites globally ~90 New Molecular Entities ~400 Research projects >500 Ongoing clinical trials>400 Computational Scientists
  • 7. Novartis Institutes for Biomedical Research NIBR ATI CVM Other ID MSD ONC IO OPH RESP NEURO Public Organised around prevalent Disease Areas Note: Distribution of ~90 New Molecular Entities at NIBR Immuno-Oncology Oncology Ophthalmology Respiratory Diseases Neuroscience Autoimmunity, Transplantation & Inflammation Cardiovascular & Metabolism Infectious Diseases Musculoskeletal
  • 8. A very quick introduction to drug discovery
  • 9. Novartis Institutes for Biomedical Research What’s a drug? • ”A pharmaceutical drug is a drug used to diagnose, cure, treat, or prevent disease” (https://en.wikipedia.org/wiki/Pharmaceutical_drug) • Examples of some medicines and their “mechanisms”: – Cancer: blocking cells dividing by disrupting DNA replication or the cells’ internal skeletons – Infectious diseases: disrupting bacterial cell walls – Depression: Preventing serotonin re-uptake in nerve cells in the brain Public
  • 10. Novartis Institutes for Biomedical Research The path to a new medicine Public Discovery Clinical trials Evaluation Post- approval Target selection Drug research and design Preclinical research Proof of Concept 5–15 patients Phase I 20–100 healthy volunteers and/or patients Phase II 100–500 patients Phase III 1000–5000 patients Submission Review by regulatory authority Phase IV Post-marketing surveillance and research Manufacturing Investigational New Drug (IND) Application submitted NDA/ BLA* submitted Approval of one new medicine 10 – 15 years >10 000 Compounds <250 Compounds <5 Compounds *New Drug Application / Biologics Licence Application
  • 11. What our HPC looks like
  • 12. Novartis Institutes for Biomedical Research HPC at NIBR - Hardware • x86 servers – Intel Xeon CPUs – 128-768GB RAM – FDR Infiniband – 10GigE • Specialised nodes – Nvidia GPUs – >=1TB RAM • Isilon storage – CIFS/NFS – 10GigE to Arista switches • Lustre – Scratch Public
  • 13. Novartis Institutes for Biomedical Research HPC at NIBR - Software • RHEL 6.x • Univa Grid Engine for scheduling • Software compilation & configuration – Easybuild – Modules – GCC, Intel, Nvidia compilers • Languages: C++, Fortran, CUDA, Python, R, Matlab • Libraries: *MPI, MKL etc • The software stack is identical on Linux desktops and “scientific servers” Public
  • 14. Novartis Institutes for Biomedical Research HPC at NIBR - Humans • HPC is provided by the Scientific Computing Group (SciComp) • Global team (Europe, USA, Asia) • Complementary backgrounds and skills – Sysadmins – Mathematicians – Scientists • HPCWire award winners in 2014 • Other teams in NIBR Informatics provide storage, Linux servers, etc. Public
  • 15. Novartis Institutes for Biomedical Research HPC at NIBR: Community • We’ve worked very hard to build an interdisciplinary group of informatics scientists to share knowledge • Various activities – Fortnightly informal talks – Social events – Deep Learning “bootcamp” – 24hr virtual multi-site workshop (Shanghai -> California!) • This started out from the grassroots and has now been formally funded within the Company Public
  • 16. Novartis Institutes for Biomedical Research HPC elsewhere in Novartis • Today’s talk covers Research; however HPC is used elsewhere in the Company for – Modelling Drug absorption, metabolism & secretion (PK/PD) – Processing data from Clinical Trials – Predicting where in the lungs inhaled drugs go (CFD) • The cluster used for this work is much more tightly controlled and tested than NIBR’s systems Public
  • 17. Examples of how we use HPC to accelerate drug discovery
  • 18. Novartis Institutes for Biomedical Research Using HPC in early drug discovery • There are many different ways NIBR scientists use HPC – Molecular dynamics – NGS analysis – Ligand-protein docking – Image analysis – Cryo-EM analysis • Our usage is similar to a university with biology and chemistry departments • In today’s talk I’ll focus on using HPC to accelerate phenotypic assays Public
  • 19. Novartis Institutes for Biomedical Research Phenotypic assays • Traditionally our scientists have used biochemical assays in early stage drug discovery – Assays use an isolated enzyme or protein and measure fluorescence etc. – This tells us very little about the cells and how they react • Increasingly our scientists are using “phenotypic assays” using cells grown in a lab – Scientists can see the impact of their drug on an entire cell or population of cells Public
  • 20. Novartis Institutes for Biomedical Research Example: wound healing Public 24 hrs Images from http://cellprofiler.org/examples/#Wound
  • 21. Novartis Institutes for Biomedical Research What is High Content Screening (HCS) • A method for identifying molecules which alter the phenotype of cells (eg cell shape, number etc) or small organisms (eg Malaria parasites) • Using robotics & automated microscopes a large number of potential drugs can be ”screened” in a few hours or days • Assays can generate a lot of data – Videos – Millions of images – >600TB/yr for some HCS instruments Public
  • 22. Novartis Institutes for Biomedical Research Accelerating MND/ALS disease research with GPUs Public
  • 23. Novartis Institutes for Biomedical Research In-vitro model for neuromuscular junctions • Faulty junctions between motor neurons and muscle cells are implicated in MND/ALS • We’d like to create a drug which corrects this • Motor neurons & myotube (muscle fibre) cells were “co-cultured” in a “plate” to which drug candidates are added • Cells were imaged in real time to measure their contractility • This is very hard to see by eye and also hard to segment using computers Public
  • 24. Novartis Institutes for Biomedical Research What do the cells look like? Public Figure: I Hossain
  • 25. Novartis Institutes for Biomedical Research Motion estimated with Optic Flow Public Different contracting regions Total area under contraction Figure: I Hossain
  • 26. Novartis Institutes for Biomedical Research Impact of HPC • A good joint project between bench scientists, lab automation experts & informaticians • 80x increase of throughput compared to CPU • NIBR scientists have access to new method of monitoring myotube contractility Public
  • 27. Novartis Institutes for Biomedical Research Deep learning for HCS image analysis Public
  • 28. Novartis Institutes for Biomedical Research CNNs for HCS image analysis • HCS analysis is traditionally performed using tools such as CellProfiler, Fiji or commercial tools • Deep Learning approaches are becoming increasingly used for image analysis • A team has investigated Convolutional Neural Networks for deriving images’ phenotypes • They used only the images’ pixel intensity values with no a priori knowledge • They used public and Novartis datasets Public
  • 29. Novartis Institutes for Biomedical Research Outcome • The images were classified better than conventional methods • This is included tracking a response to drugs • There was no need to design a unique pipeline for the processing Public
  • 30. Novartis Institutes for Biomedical Research Interested in knowing more? • This work has been published (including some code) in Godinez et al: “A multi- scale convolutional neural network for phenotyping high-content cellular images”, Bioinformatics btx069, https://doi.org/10.1093/bioinformatics/btx069 Public
  • 31. Novartis Institutes for Biomedical Research Pushing HPC to non-technical scientists Public
  • 32. Novartis Institutes for Biomedical Research Why bench scientists need HPC (and don’t realise it!) • Bench scientists generally do not know how to programme or use the Linux command line • Many scientists’ data has grown too big to be processed on a single workstation • This means they have to wait a long time for the data to be processed and also they may need to wait for an informatician to become available • If you can give a scientist the tools to analyse their data at scale then they get their data sooner and enables the informaticians to focus on more complex tasks Public
  • 33. Novartis Institutes for Biomedical Research Pushing HCS analysis to bench scientists • Our scientists create pipelines using CellProfiler (http://cellprofiler.org/) using the normal GUI on their laptops • The Pipeline is then uploaded to a central server at each site • The scientist can kick off a analysis run on the cluster using the same webpage that they use to visualise their images Public
  • 34. Novartis Institutes for Biomedical Research Screenshots of the GUI Public
  • 35. Novartis Institutes for Biomedical Research Also (ab)using Jenkins • Our scientists have automated cluster submission using the continuous integration tool, Jenkins, again with a web front end • The work has been published: https://doi.org/10.1177/1087057116679993 • Source freely available at https://github.com/Novartis/Jenkins-LSCI Public
  • 37. Novartis Institutes for Biomedical Research HPC Trends • GPUs / Intel Phi / FPGAs – Deep learning – Cryo-EM • Real time collection & processing of data from clinical trials • Integration of “big data” technologies such as Apache Spark into HPC • HPC in the cloud – Currently most useful for bursting or embarrassingly parallel jobs Public
  • 40. HPC in the cloud
  • 41. Novartis Institutes for Biomedical Research HPC in the cloud • NIBR have used Amazon EC2 for compute workloads – Cycle computing • ISVs eg DNANexus – Bioinformatics NGS Public
  • 42. Novartis Institutes for Biomedical Research Docking at scale in the cloud • Ligand-protein docking is “to predict the position and orientation of a ligand (a small molecule) when it is bound to a protein receptor or enzyme” (Wikipedia) • Embarrassingly parallel - compute-heavy / data-light • We used the cloud to screen 10 million molecules against a cancer target Public
  • 43. Novartis Institutes for Biomedical Research How we did it • Cycle computing’s software (Cycle server, Cyclecloud) • Over 10,000 EC2 spot instances – Extensive benchmarking to select instance type • Licence files (licence servers cannot cope with the load) • Proprietary compounds run in NIBR’s VPC, others in “public” • See http://opensource.nibr.com/videos/aws-litster/ and http://cyclecomputing.com/novartis-taps-cloud-hpc-for-faster-drug-discovery- better-science/ Public
  • 44. Novartis Institutes for Biomedical Research Where we’re going in the cloud • “Cloud by default” for many non-HPC applications • Clinical data (subject to “informed consent”) • HPC where appropriate – IB etc for tightly-coupled parallel jobs usually unavailable – Data locality challenging Public