SlideShare a Scribd company logo
NGS Data Hardware Requirements 
© 2014 Knome, Inc.! 
and Considerations! 
Presenter: Michael J. McManus, PhD, SVP of Operations! 
Date: September 26, 2014!
© 2014 Knome, Inc.! 
Questions! 
If you have any questions during the 
webinar, please enter them in the 
GoToWebinar pane. 
We will answer as many as possible 
at the end.
© 2014 Knome, Inc.! 
[Poll]!
! ? 
© 2014 Knome, Inc.! 
During this webinar we will discuss 
four questions: 
" 
1. Why purchase hardware when you can 
process NGS data on the cloud?! 
2. What sort of hardware should be 
considered?! 
3. What hardware specifications are 
needed for conducting align + call 
versus interpretation?! 
4. How do I compare systems apples-to-apples?
Align!Call!Annotate!Filter!Classify!Report! 
© 2014 Knome, Inc.! 
NGS informatics and interpretation infrastructure! 
Flexible, fast 
bioinformatics 
2 
Comprehensive, 
customizable 
annotation 
3 
Indication-specific 
filtering, prioritization, and 
interpretation 
Bioinformaticians & 
Technologists 
Geneticists, Clinicians, & 
Genetic Counselors 
1
© 2014 Knome, Inc.! 
Why internal vs. using the cloud? ! 
§ Knome’s customers have 
expressed a strong 
preference for an internally 
installed solution over a 
cloud solution. Why? ! 
! 
§ Three reasons:! 
1. Security! 
2. Software Version Control! 
3. File Transfer Time!
! ? 
© 2014 Knome, Inc.! 
During this webinar we will discuss 
four questions: 
" 
1. Why purchase hardware when you can 
process NGS data on the cloud?! 
2. What sort of hardware should be 
considered?! 
3. What hardware specifications are 
needed for conducting align + call 
versus interpretation?! 
4. How do I compare systems apples-to-apples?
© 2014 Knome, Inc.! 
What type of hardware should be considered?! 
§ To process NGS data you need to understand many 
issues:!
© 2014 Knome, Inc.! 
Elements for NGS informatics ! 
Five elements must be balanced:" 
1. Compute! 
• Multiple nodes! 
• Grid Computing! 
! 
2. Database! 
! 
3. Storage! 
• Shared File System! 
4. Networks! 
• Storage! 
• Communications! 
• File upload/download! 
! 
5. Software! 
• Operating System! 
• Virtualization! 
• Open Source Tools! 
• Web Server!
© 2014 Knome, Inc.! 
knoSYS state diagram - node view! 
Application node" 
Grid node" 
Database node" 
File System Manager" 
Data 
nodes"
© 2014 Knome, Inc.! 
Shared File System! 
§ All files are stored in one place, not on separate nodes! 
§ Failure tolerance is a requirement! 
– RAID 6 protection is required ! 
– A minimum of 2 drive failures should be tolerated! 
– One “hot spare” should be provided per array! 
– Good array reliability rates (>90%)! 
§ Performance is a key need! 
– A file system that supports “striping” files across the storage array is 
desired! 
– A file system that gets faster as more disks are added to the storage array.! 
– A minimum of a 1 Gigabyte per second of sustained I/O rate!
! ? 
© 2014 Knome, Inc.! 
During this webinar we will discuss 
four questions: 
" 
1. Why purchase hardware when you can 
process NGS data on the cloud?! 
2. What sort of hardware should be 
considered?! 
3. What hardware specifications are 
needed for conducting align + call 
versus interpretation?! 
4. How do I compare systems apples-to-apples?
© 2014 Knome, Inc.! 
What hardware is needed for align/call vs. interpretation? ! 
§ Aligning & Calling:" 
– Aligning starts with a FASTQ, produces a BAM! 
– Calling takes the BAM and produces a VCF! 
– These processes require large amounts of RAM, disk 
space, and CPU cores! 
§ Interpretation:" 
– Starts with a VCF file! 
– The annotation and interpretation processes also benefit 
from ample amounts of RAM, disk space, and CPU 
cores, but can be done with far less. !
© 2014 Knome, Inc.! 
The knoSYS® system overview! 
§ End-to-end: reads to report! 
§ Flexible, fast, secure! 
§ Supports a multi-disciplinary 
team! 
§ Ideal for translational and 
clinical labs! 
§ Multiple configuration 
options ! 
k100
© 2014 Knome, Inc.! 
k100 model – for align/call, whole genomes! 
§ The knoSYS k100 model will 
efficiently process large numbers of 
whole genomes and exomes. !
© 2014 Knome, Inc.! 
k25 model – for interpretation! 
§ The knoSYS k25 model is designed to efficiently process panels, as well 
as smaller volumes of genomes and exomes.!
k100 Monthly Throughput" 
" FASTQ" VCF-Only" 
Sequence Type" Align/Call" Annotation" 
Genomes (37x)! 60! 1,440! 
Exomes (100x)! 270! 12,960! 
Panels (300x )! 3,600! 64,800! 
© 2014 Knome, Inc.! 
Specs and Throughput! 
k25 Specs" 
Server" 
# 
Nodes" 
CPU" 
# 
CPU" 
# 
Cores" 
RAM 
(GB)" 
Storage 
(TB)" 
1 GbE + 
card" 
10GbE 
card" 
IB" UPS" 
Compute" 1! E5-2640v2! 2! 16! 256! -! Yes! Yes! No! 
No! 
Database" -! -! -! -! -! -! -! -! -! 
Storage" -! -! -! -! -! 24! -! -! -! 
Total" 1" -" 2" 16" 256" 24" -" -" -" -" 
k25 Monthly Throughput" 
" FASTQ" VCF-Only" 
Sequence Type" Align/Call" Annotation" 
Genomes (37x)! 12! 360! 
Exomes (100x)! 54! 3,240! 
Panels (300x )! 720! 16,200! 
k100 Specs" 
Server" 
# 
Nodes" 
CPU" 
# 
CPU" 
# 
Cores" 
RAM 
(GB)" 
Storage 
(TB)" 
1 GbE + 
switch" 
10 GbE 
card" 
IB + 
switch" 
UPS" 
Compute" 4! E5-2560v2! 8! 64! 512! 16! 
Yes! 
Yes! 
Yes! Yes! 
Database" 1! E5-2640v2! 2! 16! 128! 4! No! 
Storage" 3! E5-2609! 3! 18! 48! 60! No! 
Total" 8" -" 13" 98" 688" 80" -" -" -" -"
Storage" Parity" 
© 2014 Knome, Inc.! 
Lustre® Shared File System for the k100! 
§ Two configurations:! 
– 60TB and 180TB! 
• 60 TB has 1 SSU! 
• 180TB has 1 SSU and 2 ESUs! 
§ Specs:! 
– RAID 6 configuration! 
– 20 x 4TB drives, plus 1 x 4TB hot spare ! 
for each SSU and each ESU! 
– Max I/O ! 
• 60TB array ≈ 2.5 GB/sec! 
• 180TB array ≈ 7.0 GB/sec! 
• Matches Infiniband peak I/O rate of 7GB/sec! 
– Array Reliability of 96.6%! 
knoSYS k100 ClusterStor 1+0 
TOTAL = 80TB / Usable = 60TB (4U) 
SSU 
0 
OST 
4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 
4TB 
OST 
4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 
knoSYS k100 ClusterStor 1+2 
TOTAL = 240TB / Usable = 180TB (12U) 
SSU 
0 
OST 
4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 
4TB 
OST 
4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 
ESU 
1 
OST 
4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 
4TB 
OST 
4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 
ESU 
2 
OST 
4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 
4TB 
OST 
4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB
© 2014 Knome, Inc.! 
RAID File System for the k25! 
§ One configuration! 
– 24TB usable / 32TB raw! 
§ Specs:! 
– RAID 6 configuration! 
– 8 x 4TB drives! 
• 6 x 4TB drives for storage! 
– Max I/O ! 
• ≈ 900MB/sec! 
– Array Reliability of 94.3%! 
4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 
Storage" Parity"
! ? 
© 2014 Knome, Inc.! 
During this webinar we will discuss 
four questions: 
" 
1. Why purchase hardware when you can 
process NGS data on the cloud?! 
2. What sort of hardware should be 
considered?! 
3. What hardware specifications are 
needed for conducting align + call 
versus interpretation?! 
4. How do I compare systems apples-to-apples?
© 2014 Knome, Inc.! 
How do I compare systems apples-to-apples?! 
§ All hardware sounds 
similar, but the benefit of 
the Knome solution is in:! 
! 
1. The unique combination 
of the various hardware 
elements! 
! 
2. The price-performance 
that Knome provides for 
its solution! 
! 
§ 5 Elements:! 
! 
– Compute! 
– Database! 
– Storage! 
– Network! 
– Software!
• Switch to manage and direct storage traffic" 
• Switch to manage and direct network traffic" 
• RDMS for managing storage of projects, sequences, etc. 
PostgreSQL running on Lustre FS" 
• Expanded Storage Unit (ESU) to add more capacity. Can 
use 2TB, 3TB or 4TB drives. " 
• Expanded Storage Unit (ESU) to add more capacity. 
© 2014 Knome, Inc.! 
knoSYS architecture – hardware! 
QDR/FDR Infiniband Switch" 
Gigabit Ethernet Switch" 
Database Server" 
ClusterStor Management Unit" 
Scalable Storage unit" 
30TB or 60TB usable" 
Expanded Storage Unit 1" 
30TB or 60TB usable" 
Back-up Power Supply" 
N 
E 
T 
W 
O 
R 
K 
" 
T 
R 
A 
F 
F 
I" 
C 
S 
T 
O 
R 
A 
G 
E 
" 
T 
R 
A 
F 
F 
I" 
C 
High Performance Computing Server" 
Expanded Storage Unit 2" 
30TB or 60TB usable" 
• GRID NODES (3) to align, call, annotate, compare 
genomes, exomes, and panels" 
• APPLICATION NODE (1) for web-based GUI" 
• ClusterStor Management Unit – Houses Metadata Server 
(MDS) and Management Server (MGS)" 
• Scalable Storage Unit (SSU) for a SHARED FILE SYSTEM 
for storage of genomes, exomes, panels; projects, 
analyses, etc." 
Can use 2TB, 3TB or 4TB drives " 
• BACK-UP POWER in case of power failure" 
• CONDITIONS incoming power to prevent spikes/dips"
© 2014 Knome, Inc.! 
knoSYS elements for NGS informatics - solution! 
Component" Model k100" Model k25" 
Compute and Database" 
Compute nodes ! 
4 physical nodes, (3 compute nodes, ! 
1 application node)! 
1 physical node with 3 virtual nodes ! 
(2 compute nodes, 1 application node)! 
Grid Computing! Open Grid Engine / Open Grid Scheduler! 
Database ! PostgreSQL node (physical)! PostgreSQL node (virtual)! 
Storage" 
Shared File System! Lustre! RAID 6 disk array! 
Network 
Storage Network! QDR/FDR Infiniband ! No network, uses SAS! 
Communications 
1Gb/s Ethernet for server-to-server communication! 
Network! 
10Gb/s Ethernet for file uploading and downloading! 
Software" 
Web Server! Tomcat (server-side), Java and Chrome (client-side)! 
Operating System! CentOS 6.3 or higher! 
Virtualization! N/A! VMWare vSphere ESXi! 
Open Source Tools! Many open source tools!
© 2014 Knome, Inc.! 
Conclusions! 
§ The cloud has great potential, but for 
today’s genomics needs, the focus is on an 
in-house solution! 
§ There is more to the decision than 
hardware alone. You need to consider the 
hardware and software when making your 
decision! 
§ There are many questions to be answered 
before you can decide on your hardware 
purchase! 
§ Hardware is fairly similar, but there are 
methods to combine hardware elements to 
maximize performance, but at a reasonable 
price. ! 
hardware 
? k100
© 2014 Knome, Inc.! 
What’s Next?! 
§ A recording 
of this 
webinar and 
the slides 
will be 
available on 
our website 
on Monday.! 
www.knome.com 
twitter.com/knome 
info@knome.com 
facebook.com/knomeinc 
linkedin.com/company/knome-inc 
617-715-1000
© 2014 Knome, Inc.! 
Questions! 
If you have any questions during the 
webinar, please enter them in the 
GoToWebinar pane. 
We will answer as many as possible 
at the end.

More Related Content

What's hot

Azure VM 101 - HomeGen by CloudGen Verona - Marco Obinu
Azure VM 101 - HomeGen by CloudGen Verona - Marco ObinuAzure VM 101 - HomeGen by CloudGen Verona - Marco Obinu
Azure VM 101 - HomeGen by CloudGen Verona - Marco Obinu
Marco Obinu
 
Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio
Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio
Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio
Ceph Community
 
Ceph Day San Jose - Ceph at Salesforce
Ceph Day San Jose - Ceph at Salesforce Ceph Day San Jose - Ceph at Salesforce
Ceph Day San Jose - Ceph at Salesforce
Ceph Community
 
Ceph Day Shanghai - Recovery Erasure Coding and Cache Tiering
Ceph Day Shanghai - Recovery Erasure Coding and Cache TieringCeph Day Shanghai - Recovery Erasure Coding and Cache Tiering
Ceph Day Shanghai - Recovery Erasure Coding and Cache Tiering
Ceph Community
 
Ceph Day KL - Ceph on All-Flash Storage
Ceph Day KL - Ceph on All-Flash Storage Ceph Day KL - Ceph on All-Flash Storage
Ceph Day KL - Ceph on All-Flash Storage
Ceph Community
 
2016-JAN-28 -- High Performance Production Databases on Ceph
2016-JAN-28 -- High Performance Production Databases on Ceph2016-JAN-28 -- High Performance Production Databases on Ceph
2016-JAN-28 -- High Performance Production Databases on Ceph
Ceph Community
 
Ceph Day KL - Delivering cost-effective, high performance Ceph cluster
Ceph Day KL - Delivering cost-effective, high performance Ceph clusterCeph Day KL - Delivering cost-effective, high performance Ceph cluster
Ceph Day KL - Delivering cost-effective, high performance Ceph cluster
Ceph Community
 
Walk Through a Software Defined Everything PoC
Walk Through a Software Defined Everything PoCWalk Through a Software Defined Everything PoC
Walk Through a Software Defined Everything PoC
Ceph Community
 
Aerospike DB and Storm for real-time analytics
Aerospike DB and Storm for real-time analyticsAerospike DB and Storm for real-time analytics
Aerospike DB and Storm for real-time analytics
Aerospike
 
Accelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDS
Accelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDSAccelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDS
Accelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDS
Ceph Community
 
Ceph Day Beijing - SPDK for Ceph
Ceph Day Beijing - SPDK for CephCeph Day Beijing - SPDK for Ceph
Ceph Day Beijing - SPDK for Ceph
Danielle Womboldt
 
Ceph Day Seoul - Ceph: a decade in the making and still going strong
Ceph Day Seoul - Ceph: a decade in the making and still going strong Ceph Day Seoul - Ceph: a decade in the making and still going strong
Ceph Day Seoul - Ceph: a decade in the making and still going strong
Ceph Community
 
Ceph: Low Fail Go Scale
Ceph: Low Fail Go Scale Ceph: Low Fail Go Scale
Ceph: Low Fail Go Scale
Ceph Community
 
FDW-based Sharding Update and Future
FDW-based Sharding Update and FutureFDW-based Sharding Update and Future
FDW-based Sharding Update and Future
Masahiko Sawada
 
How to Get a Game Changing Performance Advantage with Intel SSDs and Aerospike
How to Get a Game Changing Performance Advantage with Intel SSDs and AerospikeHow to Get a Game Changing Performance Advantage with Intel SSDs and Aerospike
How to Get a Game Changing Performance Advantage with Intel SSDs and Aerospike
Aerospike, Inc.
 
Ceph Day Beijing - Optimizing Ceph Performance by Leveraging Intel Optane and...
Ceph Day Beijing - Optimizing Ceph Performance by Leveraging Intel Optane and...Ceph Day Beijing - Optimizing Ceph Performance by Leveraging Intel Optane and...
Ceph Day Beijing - Optimizing Ceph Performance by Leveraging Intel Optane and...
Danielle Womboldt
 
Ceph Day Taipei - Delivering cost-effective, high performance, Ceph cluster
Ceph Day Taipei - Delivering cost-effective, high performance, Ceph cluster Ceph Day Taipei - Delivering cost-effective, high performance, Ceph cluster
Ceph Day Taipei - Delivering cost-effective, high performance, Ceph cluster
Ceph Community
 
Transparent Data Encryption in PostgreSQL and Integration with Key Management...
Transparent Data Encryption in PostgreSQL and Integration with Key Management...Transparent Data Encryption in PostgreSQL and Integration with Key Management...
Transparent Data Encryption in PostgreSQL and Integration with Key Management...
Masahiko Sawada
 
Ceph Day Beijing - Our journey to high performance large scale Ceph cluster a...
Ceph Day Beijing - Our journey to high performance large scale Ceph cluster a...Ceph Day Beijing - Our journey to high performance large scale Ceph cluster a...
Ceph Day Beijing - Our journey to high performance large scale Ceph cluster a...
Danielle Womboldt
 
Ceph Community Talk on High-Performance Solid Sate Ceph
Ceph Community Talk on High-Performance Solid Sate Ceph Ceph Community Talk on High-Performance Solid Sate Ceph
Ceph Community Talk on High-Performance Solid Sate Ceph
Ceph Community
 

What's hot (20)

Azure VM 101 - HomeGen by CloudGen Verona - Marco Obinu
Azure VM 101 - HomeGen by CloudGen Verona - Marco ObinuAzure VM 101 - HomeGen by CloudGen Verona - Marco Obinu
Azure VM 101 - HomeGen by CloudGen Verona - Marco Obinu
 
Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio
Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio
Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio
 
Ceph Day San Jose - Ceph at Salesforce
Ceph Day San Jose - Ceph at Salesforce Ceph Day San Jose - Ceph at Salesforce
Ceph Day San Jose - Ceph at Salesforce
 
Ceph Day Shanghai - Recovery Erasure Coding and Cache Tiering
Ceph Day Shanghai - Recovery Erasure Coding and Cache TieringCeph Day Shanghai - Recovery Erasure Coding and Cache Tiering
Ceph Day Shanghai - Recovery Erasure Coding and Cache Tiering
 
Ceph Day KL - Ceph on All-Flash Storage
Ceph Day KL - Ceph on All-Flash Storage Ceph Day KL - Ceph on All-Flash Storage
Ceph Day KL - Ceph on All-Flash Storage
 
2016-JAN-28 -- High Performance Production Databases on Ceph
2016-JAN-28 -- High Performance Production Databases on Ceph2016-JAN-28 -- High Performance Production Databases on Ceph
2016-JAN-28 -- High Performance Production Databases on Ceph
 
Ceph Day KL - Delivering cost-effective, high performance Ceph cluster
Ceph Day KL - Delivering cost-effective, high performance Ceph clusterCeph Day KL - Delivering cost-effective, high performance Ceph cluster
Ceph Day KL - Delivering cost-effective, high performance Ceph cluster
 
Walk Through a Software Defined Everything PoC
Walk Through a Software Defined Everything PoCWalk Through a Software Defined Everything PoC
Walk Through a Software Defined Everything PoC
 
Aerospike DB and Storm for real-time analytics
Aerospike DB and Storm for real-time analyticsAerospike DB and Storm for real-time analytics
Aerospike DB and Storm for real-time analytics
 
Accelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDS
Accelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDSAccelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDS
Accelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDS
 
Ceph Day Beijing - SPDK for Ceph
Ceph Day Beijing - SPDK for CephCeph Day Beijing - SPDK for Ceph
Ceph Day Beijing - SPDK for Ceph
 
Ceph Day Seoul - Ceph: a decade in the making and still going strong
Ceph Day Seoul - Ceph: a decade in the making and still going strong Ceph Day Seoul - Ceph: a decade in the making and still going strong
Ceph Day Seoul - Ceph: a decade in the making and still going strong
 
Ceph: Low Fail Go Scale
Ceph: Low Fail Go Scale Ceph: Low Fail Go Scale
Ceph: Low Fail Go Scale
 
FDW-based Sharding Update and Future
FDW-based Sharding Update and FutureFDW-based Sharding Update and Future
FDW-based Sharding Update and Future
 
How to Get a Game Changing Performance Advantage with Intel SSDs and Aerospike
How to Get a Game Changing Performance Advantage with Intel SSDs and AerospikeHow to Get a Game Changing Performance Advantage with Intel SSDs and Aerospike
How to Get a Game Changing Performance Advantage with Intel SSDs and Aerospike
 
Ceph Day Beijing - Optimizing Ceph Performance by Leveraging Intel Optane and...
Ceph Day Beijing - Optimizing Ceph Performance by Leveraging Intel Optane and...Ceph Day Beijing - Optimizing Ceph Performance by Leveraging Intel Optane and...
Ceph Day Beijing - Optimizing Ceph Performance by Leveraging Intel Optane and...
 
Ceph Day Taipei - Delivering cost-effective, high performance, Ceph cluster
Ceph Day Taipei - Delivering cost-effective, high performance, Ceph cluster Ceph Day Taipei - Delivering cost-effective, high performance, Ceph cluster
Ceph Day Taipei - Delivering cost-effective, high performance, Ceph cluster
 
Transparent Data Encryption in PostgreSQL and Integration with Key Management...
Transparent Data Encryption in PostgreSQL and Integration with Key Management...Transparent Data Encryption in PostgreSQL and Integration with Key Management...
Transparent Data Encryption in PostgreSQL and Integration with Key Management...
 
Ceph Day Beijing - Our journey to high performance large scale Ceph cluster a...
Ceph Day Beijing - Our journey to high performance large scale Ceph cluster a...Ceph Day Beijing - Our journey to high performance large scale Ceph cluster a...
Ceph Day Beijing - Our journey to high performance large scale Ceph cluster a...
 
Ceph Community Talk on High-Performance Solid Sate Ceph
Ceph Community Talk on High-Performance Solid Sate Ceph Ceph Community Talk on High-Performance Solid Sate Ceph
Ceph Community Talk on High-Performance Solid Sate Ceph
 

Viewers also liked

Part 4 of 'Introduction to Linux for bioinformatics': Managing data
Part 4 of 'Introduction to Linux for bioinformatics': Managing data Part 4 of 'Introduction to Linux for bioinformatics': Managing data
Part 4 of 'Introduction to Linux for bioinformatics': Managing data
Joachim Jacob
 
Evolutionary arguments in medical genomics
Evolutionary arguments in medical genomicsEvolutionary arguments in medical genomics
Evolutionary arguments in medical genomics
Nikita Khromov-Borisov
 
2014 Wellcome Trust Advances Course: NGS Course - Lecture2
2014 Wellcome Trust Advances Course: NGS Course - Lecture22014 Wellcome Trust Advances Course: NGS Course - Lecture2
2014 Wellcome Trust Advances Course: NGS Course - Lecture2
Thomas Keane
 
HDx™ Reference Standards and Reference Materials for Next Generation Sequenci...
HDx™ Reference Standards and Reference Materials for Next Generation Sequenci...HDx™ Reference Standards and Reference Materials for Next Generation Sequenci...
HDx™ Reference Standards and Reference Materials for Next Generation Sequenci...
Candy Smellie
 
Detection of heterogeneous flt3 itd mutant variants in
Detection of heterogeneous flt3  itd mutant variants inDetection of heterogeneous flt3  itd mutant variants in
Detection of heterogeneous flt3 itd mutant variants in
kamalmodi481
 
Managing multiple projects
Managing multiple projectsManaging multiple projects
Managing multiple projects
Project Management Solutions
 
Korte handleiding van de Partago app
Korte handleiding van de Partago appKorte handleiding van de Partago app
Korte handleiding van de Partago app
Joachim Jacob
 
Part 4 of RNA-seq for DE analysis: Extracting count table and QC
Part 4 of RNA-seq for DE analysis: Extracting count table and QCPart 4 of RNA-seq for DE analysis: Extracting count table and QC
Part 4 of RNA-seq for DE analysis: Extracting count table and QC
Joachim Jacob
 
Big Data and Genomic Medicine by Corey Nislow
Big Data and Genomic Medicine by Corey NislowBig Data and Genomic Medicine by Corey Nislow
Big Data and Genomic Medicine by Corey Nislow
Knome_Inc
 
Ngs intro_v6_public
 Ngs intro_v6_public Ngs intro_v6_public
Ngs intro_v6_public
François PAILLIER
 
Introduction to next generation sequencing
Introduction to next generation sequencingIntroduction to next generation sequencing
Introduction to next generation sequencing
VHIR Vall d’Hebron Institut de Recerca
 

Viewers also liked (11)

Part 4 of 'Introduction to Linux for bioinformatics': Managing data
Part 4 of 'Introduction to Linux for bioinformatics': Managing data Part 4 of 'Introduction to Linux for bioinformatics': Managing data
Part 4 of 'Introduction to Linux for bioinformatics': Managing data
 
Evolutionary arguments in medical genomics
Evolutionary arguments in medical genomicsEvolutionary arguments in medical genomics
Evolutionary arguments in medical genomics
 
2014 Wellcome Trust Advances Course: NGS Course - Lecture2
2014 Wellcome Trust Advances Course: NGS Course - Lecture22014 Wellcome Trust Advances Course: NGS Course - Lecture2
2014 Wellcome Trust Advances Course: NGS Course - Lecture2
 
HDx™ Reference Standards and Reference Materials for Next Generation Sequenci...
HDx™ Reference Standards and Reference Materials for Next Generation Sequenci...HDx™ Reference Standards and Reference Materials for Next Generation Sequenci...
HDx™ Reference Standards and Reference Materials for Next Generation Sequenci...
 
Detection of heterogeneous flt3 itd mutant variants in
Detection of heterogeneous flt3  itd mutant variants inDetection of heterogeneous flt3  itd mutant variants in
Detection of heterogeneous flt3 itd mutant variants in
 
Managing multiple projects
Managing multiple projectsManaging multiple projects
Managing multiple projects
 
Korte handleiding van de Partago app
Korte handleiding van de Partago appKorte handleiding van de Partago app
Korte handleiding van de Partago app
 
Part 4 of RNA-seq for DE analysis: Extracting count table and QC
Part 4 of RNA-seq for DE analysis: Extracting count table and QCPart 4 of RNA-seq for DE analysis: Extracting count table and QC
Part 4 of RNA-seq for DE analysis: Extracting count table and QC
 
Big Data and Genomic Medicine by Corey Nislow
Big Data and Genomic Medicine by Corey NislowBig Data and Genomic Medicine by Corey Nislow
Big Data and Genomic Medicine by Corey Nislow
 
Ngs intro_v6_public
 Ngs intro_v6_public Ngs intro_v6_public
Ngs intro_v6_public
 
Introduction to next generation sequencing
Introduction to next generation sequencingIntroduction to next generation sequencing
Introduction to next generation sequencing
 

Similar to NGS Informatics and Interpretation - Hardware Considerations by Michael McManus

Introduction to Cassandra and CQL for Java developers
Introduction to Cassandra and CQL for Java developersIntroduction to Cassandra and CQL for Java developers
Introduction to Cassandra and CQL for Java developers
Julien Anguenot
 
Ceph Day Seoul - Ceph on All-Flash Storage
Ceph Day Seoul - Ceph on All-Flash Storage Ceph Day Seoul - Ceph on All-Flash Storage
Ceph Day Seoul - Ceph on All-Flash Storage
Ceph Community
 
Ceph Day Taipei - Ceph on All-Flash Storage
Ceph Day Taipei - Ceph on All-Flash Storage Ceph Day Taipei - Ceph on All-Flash Storage
Ceph Day Taipei - Ceph on All-Flash Storage
Ceph Community
 
Webinar slides: The Holy Grail Webinar: Become a MySQL DBA - Database Perform...
Webinar slides: The Holy Grail Webinar: Become a MySQL DBA - Database Perform...Webinar slides: The Holy Grail Webinar: Become a MySQL DBA - Database Perform...
Webinar slides: The Holy Grail Webinar: Become a MySQL DBA - Database Perform...
Severalnines
 
Hands-on Lab: How to Unleash Your Storage Performance by Using NVM Express™ B...
Hands-on Lab: How to Unleash Your Storage Performance by Using NVM Express™ B...Hands-on Lab: How to Unleash Your Storage Performance by Using NVM Express™ B...
Hands-on Lab: How to Unleash Your Storage Performance by Using NVM Express™ B...
Odinot Stanislas
 
Ceph Day Shanghai - SSD/NVM Technology Boosting Ceph Performance
Ceph Day Shanghai - SSD/NVM Technology Boosting Ceph Performance Ceph Day Shanghai - SSD/NVM Technology Boosting Ceph Performance
Ceph Day Shanghai - SSD/NVM Technology Boosting Ceph Performance
Ceph Community
 
Elasticsearch Arcihtecture & What's New in Version 5
Elasticsearch Arcihtecture & What's New in Version 5Elasticsearch Arcihtecture & What's New in Version 5
Elasticsearch Arcihtecture & What's New in Version 5
Burak TUNGUT
 
Evoluzione dello storage
Evoluzione dello storageEvoluzione dello storage
Evoluzione dello storage
Andrea Mauro
 
TDS-16489U-R2 0215 EN
TDS-16489U-R2 0215 ENTDS-16489U-R2 0215 EN
TDS-16489U-R2 0215 EN
QNAP Systems, Inc.
 
Delivering Apache Hadoop for the Modern Data Architecture
Delivering Apache Hadoop for the Modern Data Architecture Delivering Apache Hadoop for the Modern Data Architecture
Delivering Apache Hadoop for the Modern Data Architecture
Hortonworks
 
Running BSD on AWS
Running BSD on AWSRunning BSD on AWS
Running BSD on AWS
Julien SIMON
 
CloudOverviewAWS.pptx
CloudOverviewAWS.pptxCloudOverviewAWS.pptx
CloudOverviewAWS.pptx
ssuser73fa361
 
The Proto-Burst Buffer: Experience with the flash-based file system on SDSC's...
The Proto-Burst Buffer: Experience with the flash-based file system on SDSC's...The Proto-Burst Buffer: Experience with the flash-based file system on SDSC's...
The Proto-Burst Buffer: Experience with the flash-based file system on SDSC's...
Glenn K. Lockwood
 
Performance analysis with_ceph
Performance analysis with_cephPerformance analysis with_ceph
Performance analysis with_ceph
Alex Lau
 
Ceph
CephCeph
Ceph: Open Source Storage Software Optimizations on Intel® Architecture for C...
Ceph: Open Source Storage Software Optimizations on Intel® Architecture for C...Ceph: Open Source Storage Software Optimizations on Intel® Architecture for C...
Ceph: Open Source Storage Software Optimizations on Intel® Architecture for C...
Odinot Stanislas
 
Challenges and Opportunities of Big Data Genomics
Challenges and Opportunities of Big Data GenomicsChallenges and Opportunities of Big Data Genomics
Challenges and Opportunities of Big Data Genomics
Yasin Memari
 
Flexible compute
Flexible computeFlexible compute
Flexible compute
Peter Clapham
 
Sanger, upcoming Openstack for Bio-informaticians
Sanger, upcoming Openstack for Bio-informaticiansSanger, upcoming Openstack for Bio-informaticians
Sanger, upcoming Openstack for Bio-informaticians
Peter Clapham
 
Empower Data-Driven Organizations
Empower Data-Driven OrganizationsEmpower Data-Driven Organizations
Empower Data-Driven Organizations
DataWorks Summit/Hadoop Summit
 

Similar to NGS Informatics and Interpretation - Hardware Considerations by Michael McManus (20)

Introduction to Cassandra and CQL for Java developers
Introduction to Cassandra and CQL for Java developersIntroduction to Cassandra and CQL for Java developers
Introduction to Cassandra and CQL for Java developers
 
Ceph Day Seoul - Ceph on All-Flash Storage
Ceph Day Seoul - Ceph on All-Flash Storage Ceph Day Seoul - Ceph on All-Flash Storage
Ceph Day Seoul - Ceph on All-Flash Storage
 
Ceph Day Taipei - Ceph on All-Flash Storage
Ceph Day Taipei - Ceph on All-Flash Storage Ceph Day Taipei - Ceph on All-Flash Storage
Ceph Day Taipei - Ceph on All-Flash Storage
 
Webinar slides: The Holy Grail Webinar: Become a MySQL DBA - Database Perform...
Webinar slides: The Holy Grail Webinar: Become a MySQL DBA - Database Perform...Webinar slides: The Holy Grail Webinar: Become a MySQL DBA - Database Perform...
Webinar slides: The Holy Grail Webinar: Become a MySQL DBA - Database Perform...
 
Hands-on Lab: How to Unleash Your Storage Performance by Using NVM Express™ B...
Hands-on Lab: How to Unleash Your Storage Performance by Using NVM Express™ B...Hands-on Lab: How to Unleash Your Storage Performance by Using NVM Express™ B...
Hands-on Lab: How to Unleash Your Storage Performance by Using NVM Express™ B...
 
Ceph Day Shanghai - SSD/NVM Technology Boosting Ceph Performance
Ceph Day Shanghai - SSD/NVM Technology Boosting Ceph Performance Ceph Day Shanghai - SSD/NVM Technology Boosting Ceph Performance
Ceph Day Shanghai - SSD/NVM Technology Boosting Ceph Performance
 
Elasticsearch Arcihtecture & What's New in Version 5
Elasticsearch Arcihtecture & What's New in Version 5Elasticsearch Arcihtecture & What's New in Version 5
Elasticsearch Arcihtecture & What's New in Version 5
 
Evoluzione dello storage
Evoluzione dello storageEvoluzione dello storage
Evoluzione dello storage
 
TDS-16489U-R2 0215 EN
TDS-16489U-R2 0215 ENTDS-16489U-R2 0215 EN
TDS-16489U-R2 0215 EN
 
Delivering Apache Hadoop for the Modern Data Architecture
Delivering Apache Hadoop for the Modern Data Architecture Delivering Apache Hadoop for the Modern Data Architecture
Delivering Apache Hadoop for the Modern Data Architecture
 
Running BSD on AWS
Running BSD on AWSRunning BSD on AWS
Running BSD on AWS
 
CloudOverviewAWS.pptx
CloudOverviewAWS.pptxCloudOverviewAWS.pptx
CloudOverviewAWS.pptx
 
The Proto-Burst Buffer: Experience with the flash-based file system on SDSC's...
The Proto-Burst Buffer: Experience with the flash-based file system on SDSC's...The Proto-Burst Buffer: Experience with the flash-based file system on SDSC's...
The Proto-Burst Buffer: Experience with the flash-based file system on SDSC's...
 
Performance analysis with_ceph
Performance analysis with_cephPerformance analysis with_ceph
Performance analysis with_ceph
 
Ceph
CephCeph
Ceph
 
Ceph: Open Source Storage Software Optimizations on Intel® Architecture for C...
Ceph: Open Source Storage Software Optimizations on Intel® Architecture for C...Ceph: Open Source Storage Software Optimizations on Intel® Architecture for C...
Ceph: Open Source Storage Software Optimizations on Intel® Architecture for C...
 
Challenges and Opportunities of Big Data Genomics
Challenges and Opportunities of Big Data GenomicsChallenges and Opportunities of Big Data Genomics
Challenges and Opportunities of Big Data Genomics
 
Flexible compute
Flexible computeFlexible compute
Flexible compute
 
Sanger, upcoming Openstack for Bio-informaticians
Sanger, upcoming Openstack for Bio-informaticiansSanger, upcoming Openstack for Bio-informaticians
Sanger, upcoming Openstack for Bio-informaticians
 
Empower Data-Driven Organizations
Empower Data-Driven OrganizationsEmpower Data-Driven Organizations
Empower Data-Driven Organizations
 

Recently uploaded

加急办理美国南加州大学毕业证文凭毕业证原版一模一样
加急办理美国南加州大学毕业证文凭毕业证原版一模一样加急办理美国南加州大学毕业证文凭毕业证原版一模一样
加急办理美国南加州大学毕业证文凭毕业证原版一模一样
u0g33km
 
按照学校原版(Birmingham文凭证书)伯明翰大学|学院毕业证快速办理
按照学校原版(Birmingham文凭证书)伯明翰大学|学院毕业证快速办理按照学校原版(Birmingham文凭证书)伯明翰大学|学院毕业证快速办理
按照学校原版(Birmingham文凭证书)伯明翰大学|学院毕业证快速办理
6oo02s6l
 
LORRAINE ANDREI_LEQUIGAN_GOOGLE CALENDAR
LORRAINE ANDREI_LEQUIGAN_GOOGLE CALENDARLORRAINE ANDREI_LEQUIGAN_GOOGLE CALENDAR
LORRAINE ANDREI_LEQUIGAN_GOOGLE CALENDAR
lorraineandreiamcidl
 
按照学校原版(SUT文凭证书)斯威本科技大学毕业证快速办理
按照学校原版(SUT文凭证书)斯威本科技大学毕业证快速办理按照学校原版(SUT文凭证书)斯威本科技大学毕业证快速办理
按照学校原版(SUT文凭证书)斯威本科技大学毕业证快速办理
1jtj7yul
 
按照学校原版(Greenwich文凭证书)格林威治大学毕业证快速办理
按照学校原版(Greenwich文凭证书)格林威治大学毕业证快速办理按照学校原版(Greenwich文凭证书)格林威治大学毕业证快速办理
按照学校原版(Greenwich文凭证书)格林威治大学毕业证快速办理
yizxn4sx
 
一比一原版(Adelaide文凭证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide文凭证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide文凭证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide文凭证书)阿德莱德大学毕业证如何办理
nudduv
 
一比一原版(UOL文凭证书)利物浦大学毕业证如何办理
一比一原版(UOL文凭证书)利物浦大学毕业证如何办理一比一原版(UOL文凭证书)利物浦大学毕业证如何办理
一比一原版(UOL文凭证书)利物浦大学毕业证如何办理
eydeofo
 
一比一原版(Monash文凭证书)莫纳什大学毕业证如何办理
一比一原版(Monash文凭证书)莫纳什大学毕业证如何办理一比一原版(Monash文凭证书)莫纳什大学毕业证如何办理
一比一原版(Monash文凭证书)莫纳什大学毕业证如何办理
xuqdabu
 
按照学校原版(UOL文凭证书)利物浦大学毕业证快速办理
按照学校原版(UOL文凭证书)利物浦大学毕业证快速办理按照学校原版(UOL文凭证书)利物浦大学毕业证快速办理
按照学校原版(UOL文凭证书)利物浦大学毕业证快速办理
terpt4iu
 
按照学校原版(USD文凭证书)圣地亚哥大学毕业证快速办理
按照学校原版(USD文凭证书)圣地亚哥大学毕业证快速办理按照学校原版(USD文凭证书)圣地亚哥大学毕业证快速办理
按照学校原版(USD文凭证书)圣地亚哥大学毕业证快速办理
snfdnzl7
 
按照学校原版(UST文凭证书)圣托马斯大学毕业证快速办理
按照学校原版(UST文凭证书)圣托马斯大学毕业证快速办理按照学校原版(UST文凭证书)圣托马斯大学毕业证快速办理
按照学校原版(UST文凭证书)圣托马斯大学毕业证快速办理
zpc0z12
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证如何办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证如何办理一比一原版(IIT毕业证)伊利诺伊理工大学毕业证如何办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证如何办理
aozcue
 
Production.pptxd dddddddddddddddddddddddddddddddddd
Production.pptxd ddddddddddddddddddddddddddddddddddProduction.pptxd dddddddddddddddddddddddddddddddddd
Production.pptxd dddddddddddddddddddddddddddddddddd
DanielOliver74
 
一比一原版(TheAuckland毕业证书)新西兰奥克兰大学毕业证如何办理
一比一原版(TheAuckland毕业证书)新西兰奥克兰大学毕业证如何办理一比一原版(TheAuckland毕业证书)新西兰奥克兰大学毕业证如何办理
一比一原版(TheAuckland毕业证书)新西兰奥克兰大学毕业证如何办理
xuqdabu
 
一比一原版(ANU文凭证书)澳大利亚国立大学毕业证如何办理
一比一原版(ANU文凭证书)澳大利亚国立大学毕业证如何办理一比一原版(ANU文凭证书)澳大利亚国立大学毕业证如何办理
一比一原版(ANU文凭证书)澳大利亚国立大学毕业证如何办理
nudduv
 
一比一原版(Adelaide文凭证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide文凭证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide文凭证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide文凭证书)阿德莱德大学毕业证如何办理
xuqdabu
 
按照学校原版(AU文凭证书)英国阿伯丁大学毕业证快速办理
按照学校原版(AU文凭证书)英国阿伯丁大学毕业证快速办理按照学校原版(AU文凭证书)英国阿伯丁大学毕业证快速办理
按照学校原版(AU文凭证书)英国阿伯丁大学毕业证快速办理
ei8c4cba
 
按照学校原版(QU文凭证书)皇后大学毕业证快速办理
按照学校原版(QU文凭证书)皇后大学毕业证快速办理按照学校原版(QU文凭证书)皇后大学毕业证快速办理
按照学校原版(QU文凭证书)皇后大学毕业证快速办理
8db3cz8x
 
一比一原版(KCL文凭证书)伦敦国王学院毕业证如何办理
一比一原版(KCL文凭证书)伦敦国王学院毕业证如何办理一比一原版(KCL文凭证书)伦敦国王学院毕业证如何办理
一比一原版(KCL文凭证书)伦敦国王学院毕业证如何办理
kuehcub
 
按照学校原版(Adelaide文凭证书)阿德莱德大学毕业证快速办理
按照学校原版(Adelaide文凭证书)阿德莱德大学毕业证快速办理按照学校原版(Adelaide文凭证书)阿德莱德大学毕业证快速办理
按照学校原版(Adelaide文凭证书)阿德莱德大学毕业证快速办理
terpt4iu
 

Recently uploaded (20)

加急办理美国南加州大学毕业证文凭毕业证原版一模一样
加急办理美国南加州大学毕业证文凭毕业证原版一模一样加急办理美国南加州大学毕业证文凭毕业证原版一模一样
加急办理美国南加州大学毕业证文凭毕业证原版一模一样
 
按照学校原版(Birmingham文凭证书)伯明翰大学|学院毕业证快速办理
按照学校原版(Birmingham文凭证书)伯明翰大学|学院毕业证快速办理按照学校原版(Birmingham文凭证书)伯明翰大学|学院毕业证快速办理
按照学校原版(Birmingham文凭证书)伯明翰大学|学院毕业证快速办理
 
LORRAINE ANDREI_LEQUIGAN_GOOGLE CALENDAR
LORRAINE ANDREI_LEQUIGAN_GOOGLE CALENDARLORRAINE ANDREI_LEQUIGAN_GOOGLE CALENDAR
LORRAINE ANDREI_LEQUIGAN_GOOGLE CALENDAR
 
按照学校原版(SUT文凭证书)斯威本科技大学毕业证快速办理
按照学校原版(SUT文凭证书)斯威本科技大学毕业证快速办理按照学校原版(SUT文凭证书)斯威本科技大学毕业证快速办理
按照学校原版(SUT文凭证书)斯威本科技大学毕业证快速办理
 
按照学校原版(Greenwich文凭证书)格林威治大学毕业证快速办理
按照学校原版(Greenwich文凭证书)格林威治大学毕业证快速办理按照学校原版(Greenwich文凭证书)格林威治大学毕业证快速办理
按照学校原版(Greenwich文凭证书)格林威治大学毕业证快速办理
 
一比一原版(Adelaide文凭证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide文凭证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide文凭证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide文凭证书)阿德莱德大学毕业证如何办理
 
一比一原版(UOL文凭证书)利物浦大学毕业证如何办理
一比一原版(UOL文凭证书)利物浦大学毕业证如何办理一比一原版(UOL文凭证书)利物浦大学毕业证如何办理
一比一原版(UOL文凭证书)利物浦大学毕业证如何办理
 
一比一原版(Monash文凭证书)莫纳什大学毕业证如何办理
一比一原版(Monash文凭证书)莫纳什大学毕业证如何办理一比一原版(Monash文凭证书)莫纳什大学毕业证如何办理
一比一原版(Monash文凭证书)莫纳什大学毕业证如何办理
 
按照学校原版(UOL文凭证书)利物浦大学毕业证快速办理
按照学校原版(UOL文凭证书)利物浦大学毕业证快速办理按照学校原版(UOL文凭证书)利物浦大学毕业证快速办理
按照学校原版(UOL文凭证书)利物浦大学毕业证快速办理
 
按照学校原版(USD文凭证书)圣地亚哥大学毕业证快速办理
按照学校原版(USD文凭证书)圣地亚哥大学毕业证快速办理按照学校原版(USD文凭证书)圣地亚哥大学毕业证快速办理
按照学校原版(USD文凭证书)圣地亚哥大学毕业证快速办理
 
按照学校原版(UST文凭证书)圣托马斯大学毕业证快速办理
按照学校原版(UST文凭证书)圣托马斯大学毕业证快速办理按照学校原版(UST文凭证书)圣托马斯大学毕业证快速办理
按照学校原版(UST文凭证书)圣托马斯大学毕业证快速办理
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证如何办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证如何办理一比一原版(IIT毕业证)伊利诺伊理工大学毕业证如何办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证如何办理
 
Production.pptxd dddddddddddddddddddddddddddddddddd
Production.pptxd ddddddddddddddddddddddddddddddddddProduction.pptxd dddddddddddddddddddddddddddddddddd
Production.pptxd dddddddddddddddddddddddddddddddddd
 
一比一原版(TheAuckland毕业证书)新西兰奥克兰大学毕业证如何办理
一比一原版(TheAuckland毕业证书)新西兰奥克兰大学毕业证如何办理一比一原版(TheAuckland毕业证书)新西兰奥克兰大学毕业证如何办理
一比一原版(TheAuckland毕业证书)新西兰奥克兰大学毕业证如何办理
 
一比一原版(ANU文凭证书)澳大利亚国立大学毕业证如何办理
一比一原版(ANU文凭证书)澳大利亚国立大学毕业证如何办理一比一原版(ANU文凭证书)澳大利亚国立大学毕业证如何办理
一比一原版(ANU文凭证书)澳大利亚国立大学毕业证如何办理
 
一比一原版(Adelaide文凭证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide文凭证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide文凭证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide文凭证书)阿德莱德大学毕业证如何办理
 
按照学校原版(AU文凭证书)英国阿伯丁大学毕业证快速办理
按照学校原版(AU文凭证书)英国阿伯丁大学毕业证快速办理按照学校原版(AU文凭证书)英国阿伯丁大学毕业证快速办理
按照学校原版(AU文凭证书)英国阿伯丁大学毕业证快速办理
 
按照学校原版(QU文凭证书)皇后大学毕业证快速办理
按照学校原版(QU文凭证书)皇后大学毕业证快速办理按照学校原版(QU文凭证书)皇后大学毕业证快速办理
按照学校原版(QU文凭证书)皇后大学毕业证快速办理
 
一比一原版(KCL文凭证书)伦敦国王学院毕业证如何办理
一比一原版(KCL文凭证书)伦敦国王学院毕业证如何办理一比一原版(KCL文凭证书)伦敦国王学院毕业证如何办理
一比一原版(KCL文凭证书)伦敦国王学院毕业证如何办理
 
按照学校原版(Adelaide文凭证书)阿德莱德大学毕业证快速办理
按照学校原版(Adelaide文凭证书)阿德莱德大学毕业证快速办理按照学校原版(Adelaide文凭证书)阿德莱德大学毕业证快速办理
按照学校原版(Adelaide文凭证书)阿德莱德大学毕业证快速办理
 

NGS Informatics and Interpretation - Hardware Considerations by Michael McManus

  • 1. NGS Data Hardware Requirements © 2014 Knome, Inc.! and Considerations! Presenter: Michael J. McManus, PhD, SVP of Operations! Date: September 26, 2014!
  • 2. © 2014 Knome, Inc.! Questions! If you have any questions during the webinar, please enter them in the GoToWebinar pane. We will answer as many as possible at the end.
  • 3. © 2014 Knome, Inc.! [Poll]!
  • 4. ! ? © 2014 Knome, Inc.! During this webinar we will discuss four questions: " 1. Why purchase hardware when you can process NGS data on the cloud?! 2. What sort of hardware should be considered?! 3. What hardware specifications are needed for conducting align + call versus interpretation?! 4. How do I compare systems apples-to-apples?
  • 5. Align!Call!Annotate!Filter!Classify!Report! © 2014 Knome, Inc.! NGS informatics and interpretation infrastructure! Flexible, fast bioinformatics 2 Comprehensive, customizable annotation 3 Indication-specific filtering, prioritization, and interpretation Bioinformaticians & Technologists Geneticists, Clinicians, & Genetic Counselors 1
  • 6. © 2014 Knome, Inc.! Why internal vs. using the cloud? ! § Knome’s customers have expressed a strong preference for an internally installed solution over a cloud solution. Why? ! ! § Three reasons:! 1. Security! 2. Software Version Control! 3. File Transfer Time!
  • 7. ! ? © 2014 Knome, Inc.! During this webinar we will discuss four questions: " 1. Why purchase hardware when you can process NGS data on the cloud?! 2. What sort of hardware should be considered?! 3. What hardware specifications are needed for conducting align + call versus interpretation?! 4. How do I compare systems apples-to-apples?
  • 8. © 2014 Knome, Inc.! What type of hardware should be considered?! § To process NGS data you need to understand many issues:!
  • 9. © 2014 Knome, Inc.! Elements for NGS informatics ! Five elements must be balanced:" 1. Compute! • Multiple nodes! • Grid Computing! ! 2. Database! ! 3. Storage! • Shared File System! 4. Networks! • Storage! • Communications! • File upload/download! ! 5. Software! • Operating System! • Virtualization! • Open Source Tools! • Web Server!
  • 10. © 2014 Knome, Inc.! knoSYS state diagram - node view! Application node" Grid node" Database node" File System Manager" Data nodes"
  • 11. © 2014 Knome, Inc.! Shared File System! § All files are stored in one place, not on separate nodes! § Failure tolerance is a requirement! – RAID 6 protection is required ! – A minimum of 2 drive failures should be tolerated! – One “hot spare” should be provided per array! – Good array reliability rates (>90%)! § Performance is a key need! – A file system that supports “striping” files across the storage array is desired! – A file system that gets faster as more disks are added to the storage array.! – A minimum of a 1 Gigabyte per second of sustained I/O rate!
  • 12. ! ? © 2014 Knome, Inc.! During this webinar we will discuss four questions: " 1. Why purchase hardware when you can process NGS data on the cloud?! 2. What sort of hardware should be considered?! 3. What hardware specifications are needed for conducting align + call versus interpretation?! 4. How do I compare systems apples-to-apples?
  • 13. © 2014 Knome, Inc.! What hardware is needed for align/call vs. interpretation? ! § Aligning & Calling:" – Aligning starts with a FASTQ, produces a BAM! – Calling takes the BAM and produces a VCF! – These processes require large amounts of RAM, disk space, and CPU cores! § Interpretation:" – Starts with a VCF file! – The annotation and interpretation processes also benefit from ample amounts of RAM, disk space, and CPU cores, but can be done with far less. !
  • 14. © 2014 Knome, Inc.! The knoSYS® system overview! § End-to-end: reads to report! § Flexible, fast, secure! § Supports a multi-disciplinary team! § Ideal for translational and clinical labs! § Multiple configuration options ! k100
  • 15. © 2014 Knome, Inc.! k100 model – for align/call, whole genomes! § The knoSYS k100 model will efficiently process large numbers of whole genomes and exomes. !
  • 16. © 2014 Knome, Inc.! k25 model – for interpretation! § The knoSYS k25 model is designed to efficiently process panels, as well as smaller volumes of genomes and exomes.!
  • 17. k100 Monthly Throughput" " FASTQ" VCF-Only" Sequence Type" Align/Call" Annotation" Genomes (37x)! 60! 1,440! Exomes (100x)! 270! 12,960! Panels (300x )! 3,600! 64,800! © 2014 Knome, Inc.! Specs and Throughput! k25 Specs" Server" # Nodes" CPU" # CPU" # Cores" RAM (GB)" Storage (TB)" 1 GbE + card" 10GbE card" IB" UPS" Compute" 1! E5-2640v2! 2! 16! 256! -! Yes! Yes! No! No! Database" -! -! -! -! -! -! -! -! -! Storage" -! -! -! -! -! 24! -! -! -! Total" 1" -" 2" 16" 256" 24" -" -" -" -" k25 Monthly Throughput" " FASTQ" VCF-Only" Sequence Type" Align/Call" Annotation" Genomes (37x)! 12! 360! Exomes (100x)! 54! 3,240! Panels (300x )! 720! 16,200! k100 Specs" Server" # Nodes" CPU" # CPU" # Cores" RAM (GB)" Storage (TB)" 1 GbE + switch" 10 GbE card" IB + switch" UPS" Compute" 4! E5-2560v2! 8! 64! 512! 16! Yes! Yes! Yes! Yes! Database" 1! E5-2640v2! 2! 16! 128! 4! No! Storage" 3! E5-2609! 3! 18! 48! 60! No! Total" 8" -" 13" 98" 688" 80" -" -" -" -"
  • 18. Storage" Parity" © 2014 Knome, Inc.! Lustre® Shared File System for the k100! § Two configurations:! – 60TB and 180TB! • 60 TB has 1 SSU! • 180TB has 1 SSU and 2 ESUs! § Specs:! – RAID 6 configuration! – 20 x 4TB drives, plus 1 x 4TB hot spare ! for each SSU and each ESU! – Max I/O ! • 60TB array ≈ 2.5 GB/sec! • 180TB array ≈ 7.0 GB/sec! • Matches Infiniband peak I/O rate of 7GB/sec! – Array Reliability of 96.6%! knoSYS k100 ClusterStor 1+0 TOTAL = 80TB / Usable = 60TB (4U) SSU 0 OST 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB OST 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB knoSYS k100 ClusterStor 1+2 TOTAL = 240TB / Usable = 180TB (12U) SSU 0 OST 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB OST 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB ESU 1 OST 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB OST 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB ESU 2 OST 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB OST 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB
  • 19. © 2014 Knome, Inc.! RAID File System for the k25! § One configuration! – 24TB usable / 32TB raw! § Specs:! – RAID 6 configuration! – 8 x 4TB drives! • 6 x 4TB drives for storage! – Max I/O ! • ≈ 900MB/sec! – Array Reliability of 94.3%! 4TB 4TB 4TB 4TB 4TB 4TB 4TB 4TB Storage" Parity"
  • 20. ! ? © 2014 Knome, Inc.! During this webinar we will discuss four questions: " 1. Why purchase hardware when you can process NGS data on the cloud?! 2. What sort of hardware should be considered?! 3. What hardware specifications are needed for conducting align + call versus interpretation?! 4. How do I compare systems apples-to-apples?
  • 21. © 2014 Knome, Inc.! How do I compare systems apples-to-apples?! § All hardware sounds similar, but the benefit of the Knome solution is in:! ! 1. The unique combination of the various hardware elements! ! 2. The price-performance that Knome provides for its solution! ! § 5 Elements:! ! – Compute! – Database! – Storage! – Network! – Software!
  • 22. • Switch to manage and direct storage traffic" • Switch to manage and direct network traffic" • RDMS for managing storage of projects, sequences, etc. PostgreSQL running on Lustre FS" • Expanded Storage Unit (ESU) to add more capacity. Can use 2TB, 3TB or 4TB drives. " • Expanded Storage Unit (ESU) to add more capacity. © 2014 Knome, Inc.! knoSYS architecture – hardware! QDR/FDR Infiniband Switch" Gigabit Ethernet Switch" Database Server" ClusterStor Management Unit" Scalable Storage unit" 30TB or 60TB usable" Expanded Storage Unit 1" 30TB or 60TB usable" Back-up Power Supply" N E T W O R K " T R A F F I" C S T O R A G E " T R A F F I" C High Performance Computing Server" Expanded Storage Unit 2" 30TB or 60TB usable" • GRID NODES (3) to align, call, annotate, compare genomes, exomes, and panels" • APPLICATION NODE (1) for web-based GUI" • ClusterStor Management Unit – Houses Metadata Server (MDS) and Management Server (MGS)" • Scalable Storage Unit (SSU) for a SHARED FILE SYSTEM for storage of genomes, exomes, panels; projects, analyses, etc." Can use 2TB, 3TB or 4TB drives " • BACK-UP POWER in case of power failure" • CONDITIONS incoming power to prevent spikes/dips"
  • 23. © 2014 Knome, Inc.! knoSYS elements for NGS informatics - solution! Component" Model k100" Model k25" Compute and Database" Compute nodes ! 4 physical nodes, (3 compute nodes, ! 1 application node)! 1 physical node with 3 virtual nodes ! (2 compute nodes, 1 application node)! Grid Computing! Open Grid Engine / Open Grid Scheduler! Database ! PostgreSQL node (physical)! PostgreSQL node (virtual)! Storage" Shared File System! Lustre! RAID 6 disk array! Network Storage Network! QDR/FDR Infiniband ! No network, uses SAS! Communications 1Gb/s Ethernet for server-to-server communication! Network! 10Gb/s Ethernet for file uploading and downloading! Software" Web Server! Tomcat (server-side), Java and Chrome (client-side)! Operating System! CentOS 6.3 or higher! Virtualization! N/A! VMWare vSphere ESXi! Open Source Tools! Many open source tools!
  • 24. © 2014 Knome, Inc.! Conclusions! § The cloud has great potential, but for today’s genomics needs, the focus is on an in-house solution! § There is more to the decision than hardware alone. You need to consider the hardware and software when making your decision! § There are many questions to be answered before you can decide on your hardware purchase! § Hardware is fairly similar, but there are methods to combine hardware elements to maximize performance, but at a reasonable price. ! hardware ? k100
  • 25. © 2014 Knome, Inc.! What’s Next?! § A recording of this webinar and the slides will be available on our website on Monday.! www.knome.com twitter.com/knome info@knome.com facebook.com/knomeinc linkedin.com/company/knome-inc 617-715-1000
  • 26. © 2014 Knome, Inc.! Questions! If you have any questions during the webinar, please enter them in the GoToWebinar pane. We will answer as many as possible at the end.