Building a GPU-enabled OpenStack Cloud for HPC - Lance Wilson, Monash University

•

1 like•478 views

Audience Level Intermediate Synopsis M3 is the latest generation system of the MASSIVE project, an HPC facility specializing in characterization science (imaging and visualization). Using OpenStack as the compute provisioning layer, M3 is a hybrid HPC/cloud system, custom-integrated by Monash’s R@CMon Research Cloud team. Built to support Monash University’s next-gen high-throughput instrument processing requirements, M3 is half-half GPU-accelerated and CPU-only. We’ll discuss the design and tech used to build this innovative platform as well as detailing approaches and challenges to building GPU-enabled and HPC clouds. We’ll also discuss some of the software and processing pipelines that this system supports and highlight the importance of tuning for these workloads. Speaker Bio Blair Bethwaite: Blair has worked in distributed computing at Monash University for 10 years, with OpenStack for half of that. Having served as team lead, architect, administrator, user, researcher, and occasional hacker, Blair’s unique perspective as a science power-user, developer, and system architect has helped guide the evolution of the research computing engine central to Monash’s 21st Century Microscope. Lance Wilson: Lance is a mechanical engineer, who has been making tools to break things for the last 20 years. His career has moved through a number of engineering subdisciplines from manufacturing to bioengineering. Now he supports the national characterisation research community in Melbourne, Australia using OpenStack to create HPC systems solving problems too large for your laptop.

Technology

Angstrom-Scale Microscopy on OpenStack
Dr Lance Wilson
Senior HPC Consultant @ MASSIVE

Source https://www.monash.edu/research/infrastructure/delivering-impact/research-outcomes/cryo-em/half-a-million-dollar-tick

Figure 5. 3D structure of the Ebola spike.
Beniac DR, Melito PL, deVarennes SL, Hiebert SL, Rabb
MJ, et al. (2012) The Organisation of Ebola Virus
Reveals a Capacity for Extensive, Modular Polyploidy.
PLOS ONE 7(1): e29608.
https://doi.org/10.1371/journal.pone.0029608
http://journals.plos.org/plosone/article?id=10.1371/jo
urnal.pone.0029608

“Seeing Molecular Interactions of Large Complexes by Cryo Electron Microscopy at Atomic Resolution” Laurie J. Wang Z.H. Zhou 2014

openstack.org
3D reconstruction of the electron density of aE11 Fab’ polyC9 complex

9
What is the scope?
Or
How big is the computer?

10
§ ~1-4TB raw data set/sample ~2000-5000 files
§ Pipeline analysis with internal & external tools
§ Require large memory gpu > 8GB
§ Require large system memory > 64GB
§ Require cpu cores 200 - 400
§ Parallel file reads and writes
Compute and Storage Requirements

Typical workflow for
processing using Relion2

https://sites.google.com/site/emcloudprocessing/home/relion2#TOC-Benchmark-Tests

14
§ How to achieve maximum
processing speed?
§ # GPUs = # raw movies
§ Storage I/O > Processing
I/O
§ The electron beam drifts during
collection and the results need to be
shifted to account for it.
§ http://www.ncbi.nlm.nih.gov/pubmed/2
3644547
§ Software: motioncorr 2.1
§ 3.4 GB of raw movie data (15 movies)
§ 218MB of corrected micrographs (15
images)
§ Processing Time:- 109s
§ Single Nvidia K80
Motion Correction

15
Using local desktop :- ~3hrs
●Limited by local storage and network access, single gpu
Using remote dekstop on MASSIVE:- ~45mins
●Limited by GPU (2 per desktop)
Using an inhouse parallel scripted version:- ~4.5mins
●Limited by file system bandwidth (1-2 GB/s)
Case study - Motion Correction 10x speedup!

How many GPUs do you
need?
Relion2 - Class 2D Step

0
100
200
300
400
500
600
700
800
4 8 12 16
PROCESSING TIME (MINS)
NUMBER OF GPUS
Relion Class 2D - Effect of No. of K80 GPUs

How does this hardware
compare against
workstations?

0
5
10
15
20
25
Processing Time (Hours)
Hardware Configuration
Processing Time vs Hardware Configuration for 3d
Classification Step

Cloud HPC vs Commodity
Cloud
What is the difference?

25
§ Jon Mansour for help gathering benchmarking results
§ Jafar Lie for help gathering benchmarking results and
§ Mike Wang (NVIDIA) for access to the NVIDIA DGX-1
Acknowledgments

Similar to Building a GPU-enabled OpenStack Cloud for HPC - Lance Wilson, Monash University

Dynamic Semantic Metadata in Biomedical Communications

Tim Clark

Arranging atoms one by one the way we want them

Ondrej Dyck

2015 mcgill-talk

c.titus.brown

Methods to enhance the validity of precision guidelines emerging from big data

Chirag Patel

The Singularity: Toward a Post-Human Reality

Larry Smarr

Containerized attribute indexing and graph genomes for federated data access

Ben Busby

Patterns in Nature, Networks Intro

Albert

Quantifying locality in complex social networks

Gabor Vattay

Fabricio Silva: Cloud Computing Technologies for Genomic Big Data Analysis

Flávio Codeço Coelho

THE WHY AND HOW OF DNA UNLINKING

Liang Kai Hu

Data Publication for UC Davis Publish or Perish

Carly Strasser

PubMed now indexes roughly 25 million articles and is growing by more than a million per year. The scale of this “Big Knowledge” repository renders traditional, article-based modes of user interaction unsatisfactory, demanding new interfaces for integrating and summarizing widely distributed knowledge. Natural language processing (NLP) techniques coupled with rich user interfaces can help meet this demand, providing end-users with enhanced views into public knowledge, stimulating their ability to form new hypotheses. Knowledge.Bio provides a Web interface for exploring the results from text-mining PubMed. It works with subject, predicate, object assertions (triples) extracted from individual abstracts and with predicted statistical associations between pairs of concepts. While agnostic to the NLP technology employed, the current implementation is loaded with triples from the SemRep-generated SemmedDB database and putative gene-disease pairs obtained using Leiden University Medical Center’s ‘Implicitome’ technology. Users of Knowledge.Bio begin by identifying a concept of interest using text search. Once a concept is identified, associated triples and concept-pairs are displayed in tables. These tables have text-based and semantic filters to help refine the list of triples to relations of interest. The user then selects relations for insertion into a personal knowledge graph implemented using cytoscape.js. The graph is used as a note-taking or ‘mind-mapping’ structure that can be saved offline and then later reloaded into the application. Clicking on edges within a graph or on the ‘evidence’ element of a triple displays the abstracts where that relation was detected, thus allowing the user to judge the veracity of the statement and to read the underlying articles. Knowledge.Bio is a free, open-source application that can provide, deep, personal, concise, shareable views into the “Big Knowledge” scattered across the biomedical literature. Application: http://knowledge.bio Source code: https://bitbucket.org/sulab/kb1/

(Poster) Knowledge.Bio: an Interactive Tool for Literature-based Discovery

Benjamin Good

High throughput mining of the scholarly literature; talk at NIH

petermurrayrust

Microbial Metagenomics Drives a New Cyberinfrastructure

Larry Smarr

ContentMine at EuropePMC AGM

Jenny Molloy

The International Journal of Artificial Intelligence & Applications (IJAIA) is a bi monthly open access peer-reviewed journal that publishes articles which contribute new results in all areas of the Artificial Intelligence & Applications (IJAIA). It is an international journal intended for professionals and researchers in all fields of AI for researchers, programmers, and software and hardware manufacturers. The journal also aims to publish new attempts in the form of special issues on emerging areas in Artificial Intelligence and applications.

Trends in deep learning in 2020 - International Journal of Artificial Intelli...

gerogepatton

Phyloinformatics and the Semantic Web

Rutger Vos

Neuroscience: Transforming Visual Percepts into Memories

mustafa sarac

Biomolecular engineer receives $1.5M to build energy-efficient computer out o...

Steve Scansaroli

Presentationonline

kashif Iqbal Kashif.Iqbal.Shah

Similar to Building a GPU-enabled OpenStack Cloud for HPC - Lance Wilson, Monash University (20)

Dynamic Semantic Metadata in Biomedical Communications

Arranging atoms one by one the way we want them

2015 mcgill-talk

Methods to enhance the validity of precision guidelines emerging from big data

The Singularity: Toward a Post-Human Reality

Containerized attribute indexing and graph genomes for federated data access

Patterns in Nature, Networks Intro

Quantifying locality in complex social networks

Fabricio Silva: Cloud Computing Technologies for Genomic Big Data Analysis

THE WHY AND HOW OF DNA UNLINKING

Data Publication for UC Davis Publish or Perish

(Poster) Knowledge.Bio: an Interactive Tool for Literature-based Discovery

High throughput mining of the scholarly literature; talk at NIH

Microbial Metagenomics Drives a New Cyberinfrastructure

ContentMine at EuropePMC AGM

Trends in deep learning in 2020 - International Journal of Artificial Intelli...

Phyloinformatics and the Semantic Web

Neuroscience: Transforming Visual Percepts into Memories

Biomolecular engineer receives $1.5M to build energy-efficient computer out o...

Presentationonline

More from OpenStack

We recently teamed up with our good friends at SUSE to build a very high-performing and scalable storage landscape at a fraction of the cost than with traditional storage systems for the Swinburne University of Technology. The challenge that the IT team at Swinburne were facing is how to accommodate the ever-growing need for performance and capacity while sticking to a tight budget. With SUSE Enterprise Storage we have been able to deploy a compelling and affordable solution to support Swinburne’s storage needs. Their new storage platform is fast and flexible, meaning the IT team at Swinburne can support the needs of researchers more effectively. https://aptira.com/swinburne-university-technology/

Swinburne University of Technology - Shunde Zhang & Kieran Spear, Aptira

Building a GPU-enabled OpenStack Cloud for HPC - Lance Wilson, Monash University

Recommended

Recommended

More Related Content

Similar to Building a GPU-enabled OpenStack Cloud for HPC - Lance Wilson, Monash University

Similar to Building a GPU-enabled OpenStack Cloud for HPC - Lance Wilson, Monash University (20)

More from OpenStack

More from OpenStack (20)

Recently uploaded

Recently uploaded (20)

Building a GPU-enabled OpenStack Cloud for HPC - Lance Wilson, Monash University