SlideShare a Scribd company logo
1 of 30
TakingPrecision
Medicine
Mainstream
Ketan Paranjape
GM Life Sciences
Intel Corp.
1
www.intel.com/healthcare/bigdata
Health & Life Sciences at Intel
Where information and care meet
Big Data Analytics
in Health and Life Sciences
Today: Many disparate
data types, streams…
Future: Integrated
computing and data
2
Genomics
Clinical
Claims &
transaction
s
Meds &
labs
Patient
experience
Personal
data
Health & Life Sciences at Intel
Where information and care meet
Vision for Precision Medicine
1 Patient visit
2
Genes causing disease
and key pathways
identified
3
Gene targeted drugs
identified, Treatment
begins in earnest
Imaging, parallelism, statistical
work, accelerated algorithms
Various algorithms, big data, parallelism, accelerated algorithms,
statistical and math puzzles, in memory processing
+ imaging + ML + drug sensitivity
assays + mechanistic learning and
systems + correlation work and
knowledge systems
1-4 days MonthsWeeks
PRIMARY ANALYSIS SECONDARY ANALYSIS, DNA/RNA PIPELINE + MORE PRECISION MEDICINE
HC
HC
HC
HC
HC
HC
HC
HC
HC
HC
Multiple sample
compute starts here
Joint
genotyping
Variant
store
Pop/Dis
studyIndividual Multiple individual
Predicted Actionable Variants
Data Driven Association
Clinically Actionable Variants
Knowledge Dtabase
Clinical trail groups
Data curation
3
GOAL :: Precision Medicine in a Day by 2020 !!
$1K-$5K $10K+$5K-$10K
Health & Life Sciences at Intel
Where information and care meet
Life Sciences World Map
4
$11B Japan investments
$100M ETRI Korea
Ontario LS Investments
Genomics France
Beijing Genomics Institute (PRC)
$1.58B line of credit
Geisinger Health
100,000 patients
NY Genome Center (US)
$200M investments
Mt Sinai (US)
Genomics England
100,000 patients
Genomics Qatar
400,000 patients
Broad Institute (US) Wellcome Trust Sanger (UK)
Charité Germany
100,000 patients
Moffitt Health
100,000 patients
We are barely scratching the surface …
Health & Life Sciences at Intel
Where information and care meet
Challenges in Life Sciences
5
Big Data in Life
Sciences
• Sequencer advances – 4x data in 50% less time
.5TB/device/day
• 4D molecular imaging produces 2TB/device/day
• Fragmented software ecosystem, lots of open source
Burdens of Data
Management
• Store, manage, share, ingest and move PBs of research
& clinical data
• Need to reliably ‘snapshot’ pipelines with archive to tiered
storage
Innovation Drives
Change
• Rapid iteration of algorithms far outpace IT, requiring
flexibility, agility
• Most applications do not fully leverage available
infrastructure
Converged
Infrastructure
• Workloads converging between local and cloud-based
HPC/Big Data
• Advanced orchestration required to maximize throughput
& efficiency
At the Intersection
of Transformative
Forces
10
18 Enabling extreme-
scale computing on
massive data sets
Helping enterprises
build open,
interoperable clouds
Contributing code
and fostering the
ecosystem
Health & Life Sciences at Intel
Where information and care meet
Intel Partnerships and Ecosystem Enablement
to resolve challenges
Need for Balanced Compute Infrastructure
*Other names and brands may be claimed as the property of others.
Health & Life Sciences at Intel
Where information and care meet
Optimizing Top Applications and Pipelines
Intel working with industry experts worldwide
• Genomics, Molecular Dynamics
and Molecular Imaging
applications targeting both Intel®
Xeon® processors and Xeon® Phi™
coprocessors
• Fine- and coarse-grained
optimization at the node and cluster
level
• Work with code authors to release
optimizations, disseminate best
practices
ABySS*
BLAST*
Bowtie*
TopHat*
Cufflinks*
BWA*
GATK*
Picard*
SAMtools*
MPI-HMMER*
Velvet*
*Some names and brands may be claimed as the property of others.
AMBER*
CAS-Soft Sphere*
CAS-IPE*
CP2K*
CPMD*
DLPOLY*
GAMESS*
Gaussian*
GROMACS*
LAMMPS*
NAMD*
NWChem*
Quantum Espresso*
VASP*
7
Health & Life Sciences at Intel
Where information and care meet
Profiling: Single Instance Run of GATK
GATK: Genome Analysis Toolkit (The Broad Institute)
• # of Machines = 1
• # of cores/Machine = 24
• Temporary Storage – RAID0 2x4TB HDD
• Input Dataset: G15512.HCC1954.1, coverage:
65x
Average CPU utilization is very low. Most cores not being used
Average I/O bandwidth is very low. Application not I/O bound
Average memory footprint is small. Application not using memory available in newer systems
There is a lot of room to improve
• Open Source Distribution:
https://01.org/workflow-profiler
Health & Life Sciences at Intel
Where information and care meet
GATK 3.0 with The Broad Institute
• Pair HMM Acceleration using Intel® AVX
resulted in 970x speedup
− Computation kernel and bottleneck in
GATK Haplotype Caller
− AVX enables 8 floating point SIMD
operations in parallel
9
Health & Life Sciences at Intel
Where information and care meet
Compression Libraries Tuned for Genomics
• Challenge:
− Data compression is a significant performance limiter for
genomics analytics
− With post-analytics archive of large datasets, very good
compression ratios are required
− Within an active workflow, transient data needs very fast
compression with “acceptable” compression ratio, especially
SAM and BAM formats
• Solution:
− igzip is a library for performing high-speed DEFLATE/gzip
compression
− For BAM & SAM files, the compression ratio of igzip is very close
to zlib -1
−igzip is ~4X the speed of zlib* (at fastest settings)
For more information including technical benchmark specification details: https://software.intel.com/en-us/articles/igzip-a-high-performance-deflate-compressor-with-
optimizations-for-genomic-data
10
Speedup and Compression Ratio delta of igzip0c* vs. gzip/Zlib -1
Compression Performance (Cycles/Byte) and Compression Ratio
*Some names and brands may be claimed as the property of others.
Health & Life Sciences at Intel
Where information and care meet
BLAST – Basic Local Alignment Search Tool
National Center for Biotechnology Information (NCBI)
Xeon BLASTp vs. GPU-BLASTp Xeon + Tesla k40x
• Application:
• blastn v.29. Basic Local Alignment Search Tool searching for
alignment in nucleotide query sequences against a known
nucleotide db volume set.
• Availability:
• blastn v.29:
ftp:://ftp.ncbi.nlm.nih.gov/blast/executables/blast+/LATEST
• Highlights:
− throughput for this offload model has a wide sweet spot for a
sufficiently large query set.
• Results:
− Simulation rate with Xeon + Phi™ heterogeneous model is up
to 1.4X
• Code Optimization Strategy:
− Xeon: GAT and OFS parallelized (48T)
− KNC: GAT and OFS parallelized (180T)
11
Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark
and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the
results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance
of that product when combined with other products. See benchmark tests and configurations in the speaker notes. For more information go to
http://www.intel.com/performance
1
1.3
1.4
0
0.2
0.4
0.6
0.8
1
1.2
1.4
1.6
1 Node
Speedup
(Higher is Better)
• 2S Intel® Xeon® processor E5-2697v2 (BLASTn v.29 Baseline)
• 2S E5-2697v2 + Intel® Xeon Phi™ 7120A OFS serial
• 2S E5-2697v2 + Intel® Xeon Phi™ 7120A OFS parallelized
Health & Life Sciences at Intel
Where information and care meet
GROMACS
Application: GROMACS 5.0-RC1
Description:
− GROMACS is a versatile package to perform molecular dynamics, i.e. simulate
the Newtonian equations of motion for systems with hundreds to millions of
particles. It is one of the fastest and the most popular Molecular Dynamics
packages
− Workload: 512K H2O with RF method
Availability:
− VERSION 5.0-rc1 is available from http://www.gromacs.org/Downloads &
− ftp://ftp.gromacs.org/pub/gromacs/gromacs-5.0-rc1.tar.gz
− Recipe: https://software.intel.com/en-us/articles/gromacs-for-intel-xeon-phi-
coprocessor
Results:
− Highly optimized for Intel® Xeon® Processors (AVX-intrinsics)
− Able to run full simulation on Intel® Xeon Phi™ coprocessor natively + host
processor using a symmetric model
− Optimized with intrinsics for 512-bit vectorization on Intel Xeon Phi coprocessors
12
Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark
and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the
results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance
of that product when combined with other products. See benchmark tests and configurations in the speaker notes. For more information go to
http://www.intel.com/performance
1.0x
1.56x
1.79x
 2S Intel® Xeon® processor E5-2697 v2
 2S Intel® Xeon® processor E5-2697 v2 + 1 Intel® Xeon
Phi™ coprocessor 7120P/X
 2S Intel® Xeon® processor E5-2697 v2 + 2 Intel® Xeon
Phi™ coprocessor 7120P/X
Health & Life Sciences at Intel
Where information and care meet
Genomics Data Processing Pipeline
Lustre
*Some names and brands may be claimed as the property of others.
Health & Life Sciences at Intel
Where information and care meet
HPC Appliances for Life Sciences
• Challenge: Experiment processing takes 7 days with current
infrastructure. Delays treatment for sick patients
• Solution: Dell Next Generation Sequencing Appliance
− Single Rack Solution; 9 Teraflops, Lustre File Storage; Intel SW tools
• Benefits: RNA-Seq processing reduced to 4 hour
• Includes everything you need for NGS - compute, storage, software,
networking, infrastructure, installation, deployment, training, service &
supportDell HSS (Lustre)
(up to 360TB)
Dell NSS (NFS)
(up to 180TB)
Infrastructure:
Dell PE, PC & F10
M420 (Compute)
(up to 32 nodes)
2U Plenum
Actual placement in racks may vary.
NSS-HA Pair
NSS User Data
HSS Metadata
Pair
HSS OSS Pair
HSS User Data
** 2-socket Intel(R) Xeon(R) CPU E5-2687W / 3.1 GHz*Other names and brands may be claimed as the property of others.
Health & Life Sciences at Intel
Where information and care meet
HPC Appliances for Life Sciences
• Challenge: Set up personalized medicine competency – DNA seq. analysis, Finite
Element Analysis, Natural Language Processing, Image Processing, Computational
Fluid Dynamics
• Solution: Dell Next Generation Sequencing Appliance
− 400 cores derived from 40 Intel® Xeon® E5-2680v2 2.8 GHz Ivy Bridge processors in 20 Dell®
PowerEdge M620 nodes providing 9 teraflops
− Room for 12 additional nodes in the Dell M1000e blade enclosure
− 270 terabytes of usable Intel® Enterprise Edition for Lustre* software scalable file system in
a 60-drive Dell® PowerVault MD3260 with 60-drive MD3060e expansion enclosure
− Deployed with Intel® Parallel Studio XE development environment
− InfiniBand backplane and multiple 10 gigabit/ sec uplinks to 1 petabyte replicated grid storage
and the network backbone
• Software:
− CentOS Linux 6.5 64-bit; Bright Cluster Manager® 7.0
− MPI Library: OpenMPI 1.8.1; Intel® MKL; GCC; IEEL v2.1.0.0
− SLURM (Simple Linux Utility for Resource Management) v14.03.0-2462U Plenum
Actual placement in racks may vary.
NSS-HA Pair
NSS User Data
HSS Metadata
Pair
HSS OSS Pair
HSS User Data
** 2-socket Intel(R) Xeon(R) CPU E5-2687W / 3.1 GHz*Other names and brands may be claimed as the property of others.
Health & Life Sciences at Intel
Where information and care meet
IBM, CLC bio Genomics Sequencing Analytics
Solution • Challenge: Need for processing power and storage
capacity in order to correlate the variants in the genome
with the relevant patient symptoms
• Solution: IBM®, CLC Genomics server SW, Genomics
Workbench client SW; Small (48 Cores, 192 GB),
Medium, Large (192 Cores, 768 GB) Analytics Solutions
• Benefits:
– Reference Mapping for 37x coverage human genome – ~9hr (1
node) to ~30mins (37 nodes)
– Variant Calling and annotation for 37x coverage – ~40 hrs (1
node) to ~3hrs (23 nodes)
• Infrastructure
– IBM System x® 3550 M4, E5-2650; 48 CPU cores and 192 GBs of memory to 192 CPU
cores and 768 GBs of memory
– IBM Storwize® V7000
– CLC Genomics Server 5.0.2 , Workbench 6.0.1
– 7x 3TB SAS 6 Gbps HDD (16 TB usable)
http://www-148.ibm.com/bin/newsletter/tool/landingPage.cgi?lpId=6155
Health & Life Sciences at Intel
Where information and care meet
BIONANO Appliance
http://www.thinkmate.com/systems/solutions/bionanogenomics#specs17
System Specs
•2x Ten-Core Intel® Xeon® Processor
E5-2680 V2 2.80GHz 25MB Cache (115W)
•8x 16GB (128GB Total) PC3-14900 1866MHz
DDR3 ECC Registered DIMM
•2x 1.0TB SATA 6.0Gb/s 7200RPM 2.5"
Seagate Constellation.
•6x Intel® Xeon Phi Co-Processor 5110P
1.053GHz - 8GB - 60 Cores
•2x Intel 10-Gigabit Ethernet Ports
via Intel® X540 Chipset - 10GBase-T (RJ-45)
http://www.thinkmate.com/systems/solutions/bionanogenomics#spec
s
Health & Life Sciences at Intel
Where information and care meet
Genomics & Clinical Analytics Appliances
18
2U Plenum
Actual placement in racks may vary.
NSS-HA Pair
NSS User Data
HSS Metadata Pair
HSS OSS Pair
HSS User Data
Health & Life Sciences at Intel
Where information and care meet
Aspera* & Intel optimized solutions for Science DMZ
– Moving the World’s Data at Maximum Speed
• Challenge: Enterprise to Cloud Transmission of Terabyte Payloads
• Solution:
− Demonstrated effective throughput of 73.3Gbps
− Equivalent to downloading 254 whole human genomes per
hour (7.3x speed-up), as compared with baseline Aspera using
commodity 10GbE at 35 whole human genomes per hour
• Benefits with Intel® Xeon® E5 v3 product family:
− AES-NI encryption provides 2x faster inline data encryption,
securely transporting sensitive workloads from enterprise-to-cloud.
− Intel® DPDK, Intel® XL710 40GbE, Intel® NVMe SSDs and Intel®
Xeon® E5 v3 processors significantly boost overall system
performance by moving data closer to the processor, avoiding
unnecessary memory copies, and reducing protocol-related
latencies.
19
Infrastructure and Data Characteristics:
Aspera High-Throughput Transport, featuring Intel pre-production system (Intel® Server Board S2600WT) with
two Intel® Xeon® processor E5-2697 v3 (45M Cache, 2.30 GHz, Intel® Hyper-Threading Technology enabled),
128GB DDR4 memory (2133 MHz), Intel® Communications Chipset 89xx Series, 2x dual-port Intel® XL710
Ethernet Controller (40GbE), 5x Intel® DC P3700 PCIe NVMe Solid-State Drives (800GB), Intel® DPDK 1.7,
Intel® DDIO, Intel® AES-NI GCM encryption, Fedora 20 with custom kernel. Aspera A4 (based on Aspera
fasp* 4). Source: Aspera testing as of August 2014.
Health & Life Sciences at Intel
Where information and care meet
Charite “Real-time” Cancer Analysis – Matching proper
therapies to patients using in-memory techniques
• Challenge: Real-time analysis of cancer
patients using in-memory SAP HANA
Oncolyzer database running on Intel®
Xeon® family infrastructure. (3.5M Data
points per Patient, Up to 20 TB of
data/patient)
• Solution: Using structured and unstructured
data to collect and analyze tables used to
take up to two days -- now takes seconds
• Benefits: Improves medical quality in
disruptive way for Patient, Doctor, Hospital,
Research
http://moss.ger.ith.intel.com/sites/SAP/SAP%20account%20team%20documents/Marketing/SAP%20HANA/SAPHANA_Charite_case_study_HI.PDF
Health & Life Sciences at Intel
Where information and care meet
High Throughput Science: Embracing Cloud-based
Analytics for Computational Chemistry Simulation
• Challenge: Sustaining 50000+ compute
cores for large scale simulations, for less
than a week; CapEX v. OpX
• Solution: Novartis leveraged software from
AWS partner, Cycle Computing, and
MolSoft to provision a fully secured cluster
of 30,000 CPUs, powered by the Intel®
Xeon® processor E5 family.
− Completed screening of 3.2 million
compounds in approximately 9 hrs,
compared to 4 -14 days on existing
resources.
Powerof60.com
Health & Life Sciences at Intel
Where information and care meetIntel Confidential
• Solution: Intel Distribution for
Hadoop (IDH), Map Reduce, Hbase,
Hive
• Benefits: Ability to compare 14
million proteins and more, reducing
the processing time from days to
hours.
• Project Characteristics:
SLA: reducing processing time
from 30 days to less then a day
and scale to 4x4 million samples
comparison
Data: Multi-Terabyte database
Problem Statement:
Comparing all pairs of 14 million proteins,
using BLAST search.
Big Data, Bioinformatics
Team websiteBlast
Program
Genome data
Proteins comparison
High performance scalable
Hadoop/Hbase cluster
Health & Life Sciences at Intel
Where information and care meet
Drug Information Network Analysis
for Treatment & Chemoinformatics
Integration of heterogeneous data sets
reveals relationships among drug compounds,
enabling insight into treatment efficacy
Business Outcomes
• Faster, cheaper discovery of new drugs via focusing research
• Improving patient outcomes by reducing toxic drug reactions
• Reduce cost of treatment by predicting novel efficacy of existing
compounds
25 Pharmacological
Data sets
41 Million
Compounds
• Data preparation
• Feature engineering
• Network/graph modeling
• Visualization
• Machine Learning
• Query
Advanced Analytics
• Medical Research
• Toxicity
• Drug interactions
• Efficacy prediction
Health & Life Sciences at Intel
Where information and care meet
Accelerating Secondary Genomic Analysis Using
Intel Reference Architecture
• Challenge: How to accelerate Secondary Analysis and
Interpretation in Whole Genome Sequencing
• Solution: Intel Genomics Reference Architecture
(IGRA) utilizing Big Data Technologies (Cloudera
CDH5 Hadoop Distro/Impala/Hive/Python) and Intel
Hardware and Infrastructure. (E5-2650, 8 GB*16, 300
GB SSD, E10G42BFSR)
• Benefits:
− 732x acceleration in Query
− 242x acceleration in statistical function (histogram of indel
lengths)
− 15x acceleration in Annotation
− 11x acceleration in Intersect
* 1000 genome data set, comparison of VCFtools vs. IGRA, cumulative
runtime for 23 chromosomes
24
Health & Life Sciences at Intel
Where information and care meet
High-Performance Interconnect (InfiniBand)
Intel® True Scale Fabric
• Challenge: Can high performance interconnect
technology (InfiniBand) keep up with increase in
number of processor cores?
• Workloads: VASP, WIEN2K
• Benchmarks: MVAPICH (MPI over InfiniBand), IMPI
(Intel MPI)
• Results:
− Scale-up research – 5 to 10x speed up when
scaling from single node to 16 nodes
− Intel® True Scale Fabric QDR-40 shows
excellent price/performance results
25
High Performance Scale-out Storage
Challenge:
• Challenge: Need to accelerate and optimize
“time to results” clinical trial simulation
environment; resource allocation and job
prioritization was manual/ad-hoc
• Solution: Scale-Out” architecture: SAS Visual
Analytics, Enterprise Miner, Grid Manager, Red
Hat Enterprise Linux, Xeon E5 servers (HP)
• Benefits:
− Clinical trial simulation exercises reduced
from hours to < 5 minutes; registration
decisions accelerated with multi-hundred
million USD impact
Health & Life Sciences at Intel
Where information and care meet
Genomic Assembly of Non-Human Species
• Challenge: Complete assembly of non-human
genome on existing clusters
• Solution: Intel Xeon E7 4800 V3 + SSDs
(S3700)
• Benefits:
− 29.92 hours (on their cluster) vs. 4.03 hours
(on HSW-EX) for a dataset of 73GB.
High Performance Scale-out Storage
Challenge:
• Challenge: 10-15TB data added weekly, small
fraction of overall storage capacity and need a
system to scale, be flexible and efficient
• Solution: HPC-class storage, powered by Intel®
Enterprise Edition for Lustre software
• Benefits:
− Openess, global namespace
− Performance of upwards of 1 TB/s
− Virtually unlimited file system and per file
sizes, and management simplicity
26
www.intel.com/healthcare/bigdata
Intel Health & Life Sciences
Where information and care meet
Intel/Intermountain Collaboration
An Idea For New CDS Applications Combining
Clinical, Genetic/Genomic, and Family Health History
Data
• Goal - Promote widespread use of clinical decision
support that will help clinicians/counselors in
assessing risk and assist genetic counselors in
ordering genetic tests.
• Build a scalable CDS that leverages standardized data
that includes:
−Family Health History
−Clinical and Screening
−Genomic data
• The solution will:
• Be agnostic to data collection tools. The solution Be
scaled to different clinical domains (grow beyond
Breast Cancer) and other healthcare institutions.
• Be standards based where they exist
• Work across all EHRs, but starting with Cerner.
• Leverage Intel technologies (infrastructure, Intel Data
Platform etc.).
• Be flexible to incorporate other data sources (e.g.
Imaging data, personal device data)
Health & Life Sciences at Intel
Where information and care meet
Policy – United States, European
Union Snapshot of US, EU Recommendations
Develop an ICT-enabled European Strategy for Personalised
Medicine
2014-2020
Driving research to unleash the potential of ICT at the point-of-care
EU R&D initiatives must address:
 Interoperability of technical standards for managing and sharing sequence data in
research and clinical samples;
 Development of hardware, software and workflow algorithms to accelerate cost
efficient analysis of genetic abnormalities that cause cancer and other complex
diseases;
 Research to ensure convergence of Big Data and Cloud Computing infrastructure to
meet the requirements of High Performance Computing and data throughout the life
sciences and healthcare value chains
The eHealth Action Plan 2020 should include Personalised Medicine as a
priority
 Gain knowledge of the challenges and barriers (technical, organizational, legal and
political) to the adoption of ICT in support of Personalised Medicine leveraged by
genomic information;
 Evaluate how to change workflows and education requirements to facilitate adoption
of ICT mediated personalized medicine in clinical practice;
 Expand collaboration with other regions of the world in matters of common interest,
e.g. by leveraging the eHealth MoU with the United States of America;
 Study, evaluate and disseminate technology neutral risk assessment frameworks for
data privacy and security, covering the entire ICT enabled Personalised Medicine
delivery chain;
 Develop effective methods for enabling the use of medical information for public health
and research
Health & Life Sciences at Intel
Where information and care meet
29
Genes causing it
identified & disease
pathways determined
Precision
medicine regime
Precision Medicine: All in a day by 2050 2020
Intel Confidential — Do Not Forward

More Related Content

What's hot

Accelerating Real-Time Analytics Insights Through Hadoop Open Source Ecosystem
Accelerating Real-Time Analytics Insights Through Hadoop Open Source EcosystemAccelerating Real-Time Analytics Insights Through Hadoop Open Source Ecosystem
Accelerating Real-Time Analytics Insights Through Hadoop Open Source EcosystemDataWorks Summit
 
Uniting traditional GIS and mainstream IT
Uniting traditional GIS and mainstream ITUniting traditional GIS and mainstream IT
Uniting traditional GIS and mainstream ITgssg
 
ESG Lab Report - Catalogic Software DPX
ESG Lab Report - Catalogic Software DPXESG Lab Report - Catalogic Software DPX
ESG Lab Report - Catalogic Software DPXCatalogic Software
 
Whitepaper: Big Data - Infrastructure Considerations - Happiest Minds
Whitepaper: Big Data - Infrastructure Considerations - Happiest MindsWhitepaper: Big Data - Infrastructure Considerations - Happiest Minds
Whitepaper: Big Data - Infrastructure Considerations - Happiest MindsHappiest Minds Technologies
 
What is the Point of Hadoop
What is the Point of HadoopWhat is the Point of Hadoop
What is the Point of HadoopDataWorks Summit
 
To Be Digital, Pharma Labs Must Bridge the Gap Between Legacy Systems & Conne...
To Be Digital, Pharma Labs Must Bridge the Gap Between Legacy Systems & Conne...To Be Digital, Pharma Labs Must Bridge the Gap Between Legacy Systems & Conne...
To Be Digital, Pharma Labs Must Bridge the Gap Between Legacy Systems & Conne...Cognizant
 
MO3.L10 - NATIONAL POLAR-ORBITING OPERATIONAL ENVIRONMENTAL SATELLITE SYSTEM ...
MO3.L10 - NATIONAL POLAR-ORBITING OPERATIONAL ENVIRONMENTAL SATELLITE SYSTEM ...MO3.L10 - NATIONAL POLAR-ORBITING OPERATIONAL ENVIRONMENTAL SATELLITE SYSTEM ...
MO3.L10 - NATIONAL POLAR-ORBITING OPERATIONAL ENVIRONMENTAL SATELLITE SYSTEM ...grssieee
 
A Modern Data Strategy for Precision Medicine
A Modern Data Strategy for Precision MedicineA Modern Data Strategy for Precision Medicine
A Modern Data Strategy for Precision MedicineCloudera, Inc.
 
Big Data Practice_Planning_steps_RK
Big Data Practice_Planning_steps_RKBig Data Practice_Planning_steps_RK
Big Data Practice_Planning_steps_RKRajesh Jayarman
 
Database Archiving - Managing Data for Long Retention Periods
Database Archiving - Managing Data for Long Retention PeriodsDatabase Archiving - Managing Data for Long Retention Periods
Database Archiving - Managing Data for Long Retention PeriodsCraig Mullins
 
Big data analysis concepts and references by Cloud Security Alliance
Big data analysis concepts and references by Cloud Security AllianceBig data analysis concepts and references by Cloud Security Alliance
Big data analysis concepts and references by Cloud Security AllianceInformation Security Awareness Group
 
Privacy Preserving Data Analytics using Cryptographic Technique for Large Dat...
Privacy Preserving Data Analytics using Cryptographic Technique for Large Dat...Privacy Preserving Data Analytics using Cryptographic Technique for Large Dat...
Privacy Preserving Data Analytics using Cryptographic Technique for Large Dat...IRJET Journal
 
Big Data-Survey
Big Data-SurveyBig Data-Survey
Big Data-Surveyijeei-iaes
 
Faster Data Processing for healthcare system
Faster Data Processing for healthcare systemFaster Data Processing for healthcare system
Faster Data Processing for healthcare systemRolta
 
BIG DATA TECHNOLOGY ACCELERATE GENOMICS PRECISION MEDICINE
BIG DATA TECHNOLOGY ACCELERATE GENOMICS PRECISION MEDICINE BIG DATA TECHNOLOGY ACCELERATE GENOMICS PRECISION MEDICINE
BIG DATA TECHNOLOGY ACCELERATE GENOMICS PRECISION MEDICINE csandit
 
Data integrity proof techniques in cloud storage
Data integrity proof techniques in cloud storageData integrity proof techniques in cloud storage
Data integrity proof techniques in cloud storageIAEME Publication
 
Data-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and CloudData-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and CloudOla Spjuth
 
Enhancing the Security for Clinical Document Architecture Generating System u...
Enhancing the Security for Clinical Document Architecture Generating System u...Enhancing the Security for Clinical Document Architecture Generating System u...
Enhancing the Security for Clinical Document Architecture Generating System u...IRJET Journal
 

What's hot (20)

Hadoop Enabled Healthcare
Hadoop Enabled HealthcareHadoop Enabled Healthcare
Hadoop Enabled Healthcare
 
Accelerating Real-Time Analytics Insights Through Hadoop Open Source Ecosystem
Accelerating Real-Time Analytics Insights Through Hadoop Open Source EcosystemAccelerating Real-Time Analytics Insights Through Hadoop Open Source Ecosystem
Accelerating Real-Time Analytics Insights Through Hadoop Open Source Ecosystem
 
Uniting traditional GIS and mainstream IT
Uniting traditional GIS and mainstream ITUniting traditional GIS and mainstream IT
Uniting traditional GIS and mainstream IT
 
ESG Lab Report - Catalogic Software DPX
ESG Lab Report - Catalogic Software DPXESG Lab Report - Catalogic Software DPX
ESG Lab Report - Catalogic Software DPX
 
Whitepaper: Big Data - Infrastructure Considerations - Happiest Minds
Whitepaper: Big Data - Infrastructure Considerations - Happiest MindsWhitepaper: Big Data - Infrastructure Considerations - Happiest Minds
Whitepaper: Big Data - Infrastructure Considerations - Happiest Minds
 
What is the Point of Hadoop
What is the Point of HadoopWhat is the Point of Hadoop
What is the Point of Hadoop
 
To Be Digital, Pharma Labs Must Bridge the Gap Between Legacy Systems & Conne...
To Be Digital, Pharma Labs Must Bridge the Gap Between Legacy Systems & Conne...To Be Digital, Pharma Labs Must Bridge the Gap Between Legacy Systems & Conne...
To Be Digital, Pharma Labs Must Bridge the Gap Between Legacy Systems & Conne...
 
MO3.L10 - NATIONAL POLAR-ORBITING OPERATIONAL ENVIRONMENTAL SATELLITE SYSTEM ...
MO3.L10 - NATIONAL POLAR-ORBITING OPERATIONAL ENVIRONMENTAL SATELLITE SYSTEM ...MO3.L10 - NATIONAL POLAR-ORBITING OPERATIONAL ENVIRONMENTAL SATELLITE SYSTEM ...
MO3.L10 - NATIONAL POLAR-ORBITING OPERATIONAL ENVIRONMENTAL SATELLITE SYSTEM ...
 
A Modern Data Strategy for Precision Medicine
A Modern Data Strategy for Precision MedicineA Modern Data Strategy for Precision Medicine
A Modern Data Strategy for Precision Medicine
 
Big Data Practice_Planning_steps_RK
Big Data Practice_Planning_steps_RKBig Data Practice_Planning_steps_RK
Big Data Practice_Planning_steps_RK
 
Big data analysis concepts and references
Big data analysis concepts and referencesBig data analysis concepts and references
Big data analysis concepts and references
 
Database Archiving - Managing Data for Long Retention Periods
Database Archiving - Managing Data for Long Retention PeriodsDatabase Archiving - Managing Data for Long Retention Periods
Database Archiving - Managing Data for Long Retention Periods
 
Big data analysis concepts and references by Cloud Security Alliance
Big data analysis concepts and references by Cloud Security AllianceBig data analysis concepts and references by Cloud Security Alliance
Big data analysis concepts and references by Cloud Security Alliance
 
Privacy Preserving Data Analytics using Cryptographic Technique for Large Dat...
Privacy Preserving Data Analytics using Cryptographic Technique for Large Dat...Privacy Preserving Data Analytics using Cryptographic Technique for Large Dat...
Privacy Preserving Data Analytics using Cryptographic Technique for Large Dat...
 
Big Data-Survey
Big Data-SurveyBig Data-Survey
Big Data-Survey
 
Faster Data Processing for healthcare system
Faster Data Processing for healthcare systemFaster Data Processing for healthcare system
Faster Data Processing for healthcare system
 
BIG DATA TECHNOLOGY ACCELERATE GENOMICS PRECISION MEDICINE
BIG DATA TECHNOLOGY ACCELERATE GENOMICS PRECISION MEDICINE BIG DATA TECHNOLOGY ACCELERATE GENOMICS PRECISION MEDICINE
BIG DATA TECHNOLOGY ACCELERATE GENOMICS PRECISION MEDICINE
 
Data integrity proof techniques in cloud storage
Data integrity proof techniques in cloud storageData integrity proof techniques in cloud storage
Data integrity proof techniques in cloud storage
 
Data-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and CloudData-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and Cloud
 
Enhancing the Security for Clinical Document Architecture Generating System u...
Enhancing the Security for Clinical Document Architecture Generating System u...Enhancing the Security for Clinical Document Architecture Generating System u...
Enhancing the Security for Clinical Document Architecture Generating System u...
 

Viewers also liked

Personalized medicine
Personalized medicinePersonalized medicine
Personalized medicinePeter Egorov
 
Precision medicine the future of_healthcare
Precision medicine  the future of_healthcarePrecision medicine  the future of_healthcare
Precision medicine the future of_healthcareThomas Wilckens
 
Personalized Medicine: Current and Future Perspectives Personalized Medicin...
Personalized Medicine: Current and Future Perspectives 	 Personalized Medicin...Personalized Medicine: Current and Future Perspectives 	 Personalized Medicin...
Personalized Medicine: Current and Future Perspectives Personalized Medicin...MedicineAndHealth
 
Personalized medicine
Personalized medicinePersonalized medicine
Personalized medicineMadiheh
 
Precision Medicine in Oncology Informatics
Precision Medicine in Oncology InformaticsPrecision Medicine in Oncology Informatics
Precision Medicine in Oncology InformaticsWarren Kibbe
 
InnVentis Precision Medicine 2014
InnVentis  Precision Medicine 2014InnVentis  Precision Medicine 2014
InnVentis Precision Medicine 2014Thomas Wilckens
 
Towards Personalized Medicine
Towards Personalized MedicineTowards Personalized Medicine
Towards Personalized MedicineMichel Dumontier
 
2015 09-14 Precision Medicine 2015, London, Alain van Gool
2015 09-14 Precision Medicine 2015, London, Alain van Gool2015 09-14 Precision Medicine 2015, London, Alain van Gool
2015 09-14 Precision Medicine 2015, London, Alain van GoolAlain van Gool
 
Precision Medicine - The Future of Healthcare
Precision Medicine - The Future of HealthcarePrecision Medicine - The Future of Healthcare
Precision Medicine - The Future of HealthcareData Science Thailand
 
Precision Medicine: Opportunities and Challenges for Clinical Trials
Precision Medicine: Opportunities and Challenges for Clinical TrialsPrecision Medicine: Opportunities and Challenges for Clinical Trials
Precision Medicine: Opportunities and Challenges for Clinical TrialsMedpace
 
Genomics and Computation in Precision Medicine March 2017
Genomics and Computation in Precision Medicine March 2017Genomics and Computation in Precision Medicine March 2017
Genomics and Computation in Precision Medicine March 2017Warren Kibbe
 
From Clinical Decision Support to Precision Medicine
From Clinical Decision Support to Precision MedicineFrom Clinical Decision Support to Precision Medicine
From Clinical Decision Support to Precision MedicineKent State University
 
Presentation on Research Reproducibility at Friends of the National Library o...
Presentation on Research Reproducibility at Friends of the National Library o...Presentation on Research Reproducibility at Friends of the National Library o...
Presentation on Research Reproducibility at Friends of the National Library o...University of California, San Francisco
 

Viewers also liked (20)

Personalized medicine
Personalized medicinePersonalized medicine
Personalized medicine
 
Precision medicine the future of_healthcare
Precision medicine  the future of_healthcarePrecision medicine  the future of_healthcare
Precision medicine the future of_healthcare
 
Personalized Medicine: Current and Future Perspectives Personalized Medicin...
Personalized Medicine: Current and Future Perspectives 	 Personalized Medicin...Personalized Medicine: Current and Future Perspectives 	 Personalized Medicin...
Personalized Medicine: Current and Future Perspectives Personalized Medicin...
 
Personalized medicine
Personalized medicinePersonalized medicine
Personalized medicine
 
Personalized medicine
Personalized medicinePersonalized medicine
Personalized medicine
 
Precision Medicine in Oncology Informatics
Precision Medicine in Oncology InformaticsPrecision Medicine in Oncology Informatics
Precision Medicine in Oncology Informatics
 
InnVentis Precision Medicine 2014
InnVentis  Precision Medicine 2014InnVentis  Precision Medicine 2014
InnVentis Precision Medicine 2014
 
Towards Personalized Medicine
Towards Personalized MedicineTowards Personalized Medicine
Towards Personalized Medicine
 
2015 09-14 Precision Medicine 2015, London, Alain van Gool
2015 09-14 Precision Medicine 2015, London, Alain van Gool2015 09-14 Precision Medicine 2015, London, Alain van Gool
2015 09-14 Precision Medicine 2015, London, Alain van Gool
 
Personalized Medicine
Personalized MedicinePersonalized Medicine
Personalized Medicine
 
Precision Medicine - The Future of Healthcare
Precision Medicine - The Future of HealthcarePrecision Medicine - The Future of Healthcare
Precision Medicine - The Future of Healthcare
 
Precision Medicine: Opportunities and Challenges for Clinical Trials
Precision Medicine: Opportunities and Challenges for Clinical TrialsPrecision Medicine: Opportunities and Challenges for Clinical Trials
Precision Medicine: Opportunities and Challenges for Clinical Trials
 
Genomics and Computation in Precision Medicine March 2017
Genomics and Computation in Precision Medicine March 2017Genomics and Computation in Precision Medicine March 2017
Genomics and Computation in Precision Medicine March 2017
 
From Clinical Decision Support to Precision Medicine
From Clinical Decision Support to Precision MedicineFrom Clinical Decision Support to Precision Medicine
From Clinical Decision Support to Precision Medicine
 
New sources of big data for precision medicine: are we ready?
New sources of big data for precision medicine: are we ready?New sources of big data for precision medicine: are we ready?
New sources of big data for precision medicine: are we ready?
 
Dr. Leroy Hood Lecuture on P4 Medicine
Dr. Leroy Hood Lecuture on P4 MedicineDr. Leroy Hood Lecuture on P4 Medicine
Dr. Leroy Hood Lecuture on P4 Medicine
 
Precision Medicine World Conference 2017
Precision Medicine World Conference 2017Precision Medicine World Conference 2017
Precision Medicine World Conference 2017
 
Presentation on Research Reproducibility at Friends of the National Library o...
Presentation on Research Reproducibility at Friends of the National Library o...Presentation on Research Reproducibility at Friends of the National Library o...
Presentation on Research Reproducibility at Friends of the National Library o...
 
Intro: California Initiative to Advance Precision Medicine Workshop
Intro: California Initiative to Advance Precision Medicine WorkshopIntro: California Initiative to Advance Precision Medicine Workshop
Intro: California Initiative to Advance Precision Medicine Workshop
 
Pharmacogenetics Ppt
Pharmacogenetics PptPharmacogenetics Ppt
Pharmacogenetics Ppt
 

Similar to Intel precision medicine apr 2015

HPC DAY 2017 | Accelerating tomorrow's HPC and AI workflows with Intel Archit...
HPC DAY 2017 | Accelerating tomorrow's HPC and AI workflows with Intel Archit...HPC DAY 2017 | Accelerating tomorrow's HPC and AI workflows with Intel Archit...
HPC DAY 2017 | Accelerating tomorrow's HPC and AI workflows with Intel Archit...HPC DAY
 
High Performance Computing: The Essential tool for a Knowledge Economy
High Performance Computing: The Essential tool for a Knowledge EconomyHigh Performance Computing: The Essential tool for a Knowledge Economy
High Performance Computing: The Essential tool for a Knowledge EconomyIntel IT Center
 
TDC2018SP | Trilha IA - Inteligencia Artificial na Arquitetura Intel
TDC2018SP | Trilha IA - Inteligencia Artificial na Arquitetura IntelTDC2018SP | Trilha IA - Inteligencia Artificial na Arquitetura Intel
TDC2018SP | Trilha IA - Inteligencia Artificial na Arquitetura Inteltdc-globalcode
 
Driving Industrial InnovationOn the Path to Exascale
Driving Industrial InnovationOn the Path to ExascaleDriving Industrial InnovationOn the Path to Exascale
Driving Industrial InnovationOn the Path to ExascaleIntel IT Center
 
Accelerate Machine Learning Software on Intel Architecture
Accelerate Machine Learning Software on Intel Architecture Accelerate Machine Learning Software on Intel Architecture
Accelerate Machine Learning Software on Intel Architecture Intel® Software
 
Big Data Intel® Platform
Big Data Intel® PlatformBig Data Intel® Platform
Big Data Intel® Platformxband
 
Deep Learning Training at Scale: Spring Crest Deep Learning Accelerator
Deep Learning Training at Scale: Spring Crest Deep Learning AcceleratorDeep Learning Training at Scale: Spring Crest Deep Learning Accelerator
Deep Learning Training at Scale: Spring Crest Deep Learning Acceleratorinside-BigData.com
 
Big data intel platform commenting
Big data   intel platform commentingBig data   intel platform commenting
Big data intel platform commentingIntel IT Center
 
Accelerating AI Adoption with Partners
Accelerating AI Adoption with PartnersAccelerating AI Adoption with Partners
Accelerating AI Adoption with PartnersSri Ambati
 
Spring Hill (NNP-I 1000): Intel's Data Center Inference Chip
Spring Hill (NNP-I 1000): Intel's Data Center Inference ChipSpring Hill (NNP-I 1000): Intel's Data Center Inference Chip
Spring Hill (NNP-I 1000): Intel's Data Center Inference Chipinside-BigData.com
 
Crooke CWF Keynote FINAL final platinum
Crooke CWF Keynote FINAL final platinumCrooke CWF Keynote FINAL final platinum
Crooke CWF Keynote FINAL final platinumAlan Frost
 
Introducing the Intel® Xeon Phi Coprocessor Architecture for Discovery
Introducing the Intel® Xeon Phi Coprocessor Architecture for DiscoveryIntroducing the Intel® Xeon Phi Coprocessor Architecture for Discovery
Introducing the Intel® Xeon Phi Coprocessor Architecture for DiscoveryIntel IT Center
 
Python* Scalability in Production Environments
Python* Scalability in Production EnvironmentsPython* Scalability in Production Environments
Python* Scalability in Production EnvironmentsIntel® Software
 
Develop, Deploy, and Innovate with Intel® Cluster Ready
Develop, Deploy, and Innovate with Intel® Cluster ReadyDevelop, Deploy, and Innovate with Intel® Cluster Ready
Develop, Deploy, and Innovate with Intel® Cluster ReadyIntel IT Center
 
Intel xeon-scalable-processors-overview
Intel xeon-scalable-processors-overviewIntel xeon-scalable-processors-overview
Intel xeon-scalable-processors-overviewDESMOND YUEN
 
Inside story on Intel Data Center @ IDF 2013
Inside story on Intel Data Center @ IDF 2013Inside story on Intel Data Center @ IDF 2013
Inside story on Intel Data Center @ IDF 2013Intel IT Center
 
Best Practice of Compression/Decompression Codes in Apache Spark with Sophia...
 Best Practice of Compression/Decompression Codes in Apache Spark with Sophia... Best Practice of Compression/Decompression Codes in Apache Spark with Sophia...
Best Practice of Compression/Decompression Codes in Apache Spark with Sophia...Databricks
 
Intel’s Big Data and Hadoop Security Initiatives - StampedeCon 2014
Intel’s Big Data and Hadoop Security Initiatives - StampedeCon 2014Intel’s Big Data and Hadoop Security Initiatives - StampedeCon 2014
Intel’s Big Data and Hadoop Security Initiatives - StampedeCon 2014StampedeCon
 

Similar to Intel precision medicine apr 2015 (20)

HPC DAY 2017 | Accelerating tomorrow's HPC and AI workflows with Intel Archit...
HPC DAY 2017 | Accelerating tomorrow's HPC and AI workflows with Intel Archit...HPC DAY 2017 | Accelerating tomorrow's HPC and AI workflows with Intel Archit...
HPC DAY 2017 | Accelerating tomorrow's HPC and AI workflows with Intel Archit...
 
High Performance Computing: The Essential tool for a Knowledge Economy
High Performance Computing: The Essential tool for a Knowledge EconomyHigh Performance Computing: The Essential tool for a Knowledge Economy
High Performance Computing: The Essential tool for a Knowledge Economy
 
TDC2018SP | Trilha IA - Inteligencia Artificial na Arquitetura Intel
TDC2018SP | Trilha IA - Inteligencia Artificial na Arquitetura IntelTDC2018SP | Trilha IA - Inteligencia Artificial na Arquitetura Intel
TDC2018SP | Trilha IA - Inteligencia Artificial na Arquitetura Intel
 
Driving Industrial InnovationOn the Path to Exascale
Driving Industrial InnovationOn the Path to ExascaleDriving Industrial InnovationOn the Path to Exascale
Driving Industrial InnovationOn the Path to Exascale
 
Accelerate Machine Learning Software on Intel Architecture
Accelerate Machine Learning Software on Intel Architecture Accelerate Machine Learning Software on Intel Architecture
Accelerate Machine Learning Software on Intel Architecture
 
The Intel Xeon Scalable Processor and IoT
The Intel Xeon Scalable Processor and IoTThe Intel Xeon Scalable Processor and IoT
The Intel Xeon Scalable Processor and IoT
 
Big Data Intel® Platform
Big Data Intel® PlatformBig Data Intel® Platform
Big Data Intel® Platform
 
Deep Learning Training at Scale: Spring Crest Deep Learning Accelerator
Deep Learning Training at Scale: Spring Crest Deep Learning AcceleratorDeep Learning Training at Scale: Spring Crest Deep Learning Accelerator
Deep Learning Training at Scale: Spring Crest Deep Learning Accelerator
 
Big data intel platform commenting
Big data   intel platform commentingBig data   intel platform commenting
Big data intel platform commenting
 
Accelerating AI Adoption with Partners
Accelerating AI Adoption with PartnersAccelerating AI Adoption with Partners
Accelerating AI Adoption with Partners
 
Spring Hill (NNP-I 1000): Intel's Data Center Inference Chip
Spring Hill (NNP-I 1000): Intel's Data Center Inference ChipSpring Hill (NNP-I 1000): Intel's Data Center Inference Chip
Spring Hill (NNP-I 1000): Intel's Data Center Inference Chip
 
Crooke CWF Keynote FINAL final platinum
Crooke CWF Keynote FINAL final platinumCrooke CWF Keynote FINAL final platinum
Crooke CWF Keynote FINAL final platinum
 
Introducing the Intel® Xeon Phi Coprocessor Architecture for Discovery
Introducing the Intel® Xeon Phi Coprocessor Architecture for DiscoveryIntroducing the Intel® Xeon Phi Coprocessor Architecture for Discovery
Introducing the Intel® Xeon Phi Coprocessor Architecture for Discovery
 
Python* Scalability in Production Environments
Python* Scalability in Production EnvironmentsPython* Scalability in Production Environments
Python* Scalability in Production Environments
 
Intel python 2017
Intel python 2017Intel python 2017
Intel python 2017
 
Develop, Deploy, and Innovate with Intel® Cluster Ready
Develop, Deploy, and Innovate with Intel® Cluster ReadyDevelop, Deploy, and Innovate with Intel® Cluster Ready
Develop, Deploy, and Innovate with Intel® Cluster Ready
 
Intel xeon-scalable-processors-overview
Intel xeon-scalable-processors-overviewIntel xeon-scalable-processors-overview
Intel xeon-scalable-processors-overview
 
Inside story on Intel Data Center @ IDF 2013
Inside story on Intel Data Center @ IDF 2013Inside story on Intel Data Center @ IDF 2013
Inside story on Intel Data Center @ IDF 2013
 
Best Practice of Compression/Decompression Codes in Apache Spark with Sophia...
 Best Practice of Compression/Decompression Codes in Apache Spark with Sophia... Best Practice of Compression/Decompression Codes in Apache Spark with Sophia...
Best Practice of Compression/Decompression Codes in Apache Spark with Sophia...
 
Intel’s Big Data and Hadoop Security Initiatives - StampedeCon 2014
Intel’s Big Data and Hadoop Security Initiatives - StampedeCon 2014Intel’s Big Data and Hadoop Security Initiatives - StampedeCon 2014
Intel’s Big Data and Hadoop Security Initiatives - StampedeCon 2014
 

Recently uploaded

"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 

Recently uploaded (20)

"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 

Intel precision medicine apr 2015

  • 1. TakingPrecision Medicine Mainstream Ketan Paranjape GM Life Sciences Intel Corp. 1 www.intel.com/healthcare/bigdata
  • 2. Health & Life Sciences at Intel Where information and care meet Big Data Analytics in Health and Life Sciences Today: Many disparate data types, streams… Future: Integrated computing and data 2 Genomics Clinical Claims & transaction s Meds & labs Patient experience Personal data
  • 3. Health & Life Sciences at Intel Where information and care meet Vision for Precision Medicine 1 Patient visit 2 Genes causing disease and key pathways identified 3 Gene targeted drugs identified, Treatment begins in earnest Imaging, parallelism, statistical work, accelerated algorithms Various algorithms, big data, parallelism, accelerated algorithms, statistical and math puzzles, in memory processing + imaging + ML + drug sensitivity assays + mechanistic learning and systems + correlation work and knowledge systems 1-4 days MonthsWeeks PRIMARY ANALYSIS SECONDARY ANALYSIS, DNA/RNA PIPELINE + MORE PRECISION MEDICINE HC HC HC HC HC HC HC HC HC HC Multiple sample compute starts here Joint genotyping Variant store Pop/Dis studyIndividual Multiple individual Predicted Actionable Variants Data Driven Association Clinically Actionable Variants Knowledge Dtabase Clinical trail groups Data curation 3 GOAL :: Precision Medicine in a Day by 2020 !! $1K-$5K $10K+$5K-$10K
  • 4. Health & Life Sciences at Intel Where information and care meet Life Sciences World Map 4 $11B Japan investments $100M ETRI Korea Ontario LS Investments Genomics France Beijing Genomics Institute (PRC) $1.58B line of credit Geisinger Health 100,000 patients NY Genome Center (US) $200M investments Mt Sinai (US) Genomics England 100,000 patients Genomics Qatar 400,000 patients Broad Institute (US) Wellcome Trust Sanger (UK) Charité Germany 100,000 patients Moffitt Health 100,000 patients We are barely scratching the surface …
  • 5. Health & Life Sciences at Intel Where information and care meet Challenges in Life Sciences 5 Big Data in Life Sciences • Sequencer advances – 4x data in 50% less time .5TB/device/day • 4D molecular imaging produces 2TB/device/day • Fragmented software ecosystem, lots of open source Burdens of Data Management • Store, manage, share, ingest and move PBs of research & clinical data • Need to reliably ‘snapshot’ pipelines with archive to tiered storage Innovation Drives Change • Rapid iteration of algorithms far outpace IT, requiring flexibility, agility • Most applications do not fully leverage available infrastructure Converged Infrastructure • Workloads converging between local and cloud-based HPC/Big Data • Advanced orchestration required to maximize throughput & efficiency At the Intersection of Transformative Forces 10 18 Enabling extreme- scale computing on massive data sets Helping enterprises build open, interoperable clouds Contributing code and fostering the ecosystem
  • 6. Health & Life Sciences at Intel Where information and care meet Intel Partnerships and Ecosystem Enablement to resolve challenges Need for Balanced Compute Infrastructure *Other names and brands may be claimed as the property of others.
  • 7. Health & Life Sciences at Intel Where information and care meet Optimizing Top Applications and Pipelines Intel working with industry experts worldwide • Genomics, Molecular Dynamics and Molecular Imaging applications targeting both Intel® Xeon® processors and Xeon® Phi™ coprocessors • Fine- and coarse-grained optimization at the node and cluster level • Work with code authors to release optimizations, disseminate best practices ABySS* BLAST* Bowtie* TopHat* Cufflinks* BWA* GATK* Picard* SAMtools* MPI-HMMER* Velvet* *Some names and brands may be claimed as the property of others. AMBER* CAS-Soft Sphere* CAS-IPE* CP2K* CPMD* DLPOLY* GAMESS* Gaussian* GROMACS* LAMMPS* NAMD* NWChem* Quantum Espresso* VASP* 7
  • 8. Health & Life Sciences at Intel Where information and care meet Profiling: Single Instance Run of GATK GATK: Genome Analysis Toolkit (The Broad Institute) • # of Machines = 1 • # of cores/Machine = 24 • Temporary Storage – RAID0 2x4TB HDD • Input Dataset: G15512.HCC1954.1, coverage: 65x Average CPU utilization is very low. Most cores not being used Average I/O bandwidth is very low. Application not I/O bound Average memory footprint is small. Application not using memory available in newer systems There is a lot of room to improve • Open Source Distribution: https://01.org/workflow-profiler
  • 9. Health & Life Sciences at Intel Where information and care meet GATK 3.0 with The Broad Institute • Pair HMM Acceleration using Intel® AVX resulted in 970x speedup − Computation kernel and bottleneck in GATK Haplotype Caller − AVX enables 8 floating point SIMD operations in parallel 9
  • 10. Health & Life Sciences at Intel Where information and care meet Compression Libraries Tuned for Genomics • Challenge: − Data compression is a significant performance limiter for genomics analytics − With post-analytics archive of large datasets, very good compression ratios are required − Within an active workflow, transient data needs very fast compression with “acceptable” compression ratio, especially SAM and BAM formats • Solution: − igzip is a library for performing high-speed DEFLATE/gzip compression − For BAM & SAM files, the compression ratio of igzip is very close to zlib -1 −igzip is ~4X the speed of zlib* (at fastest settings) For more information including technical benchmark specification details: https://software.intel.com/en-us/articles/igzip-a-high-performance-deflate-compressor-with- optimizations-for-genomic-data 10 Speedup and Compression Ratio delta of igzip0c* vs. gzip/Zlib -1 Compression Performance (Cycles/Byte) and Compression Ratio *Some names and brands may be claimed as the property of others.
  • 11. Health & Life Sciences at Intel Where information and care meet BLAST – Basic Local Alignment Search Tool National Center for Biotechnology Information (NCBI) Xeon BLASTp vs. GPU-BLASTp Xeon + Tesla k40x • Application: • blastn v.29. Basic Local Alignment Search Tool searching for alignment in nucleotide query sequences against a known nucleotide db volume set. • Availability: • blastn v.29: ftp:://ftp.ncbi.nlm.nih.gov/blast/executables/blast+/LATEST • Highlights: − throughput for this offload model has a wide sweet spot for a sufficiently large query set. • Results: − Simulation rate with Xeon + Phi™ heterogeneous model is up to 1.4X • Code Optimization Strategy: − Xeon: GAT and OFS parallelized (48T) − KNC: GAT and OFS parallelized (180T) 11 Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. See benchmark tests and configurations in the speaker notes. For more information go to http://www.intel.com/performance 1 1.3 1.4 0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1 Node Speedup (Higher is Better) • 2S Intel® Xeon® processor E5-2697v2 (BLASTn v.29 Baseline) • 2S E5-2697v2 + Intel® Xeon Phi™ 7120A OFS serial • 2S E5-2697v2 + Intel® Xeon Phi™ 7120A OFS parallelized
  • 12. Health & Life Sciences at Intel Where information and care meet GROMACS Application: GROMACS 5.0-RC1 Description: − GROMACS is a versatile package to perform molecular dynamics, i.e. simulate the Newtonian equations of motion for systems with hundreds to millions of particles. It is one of the fastest and the most popular Molecular Dynamics packages − Workload: 512K H2O with RF method Availability: − VERSION 5.0-rc1 is available from http://www.gromacs.org/Downloads & − ftp://ftp.gromacs.org/pub/gromacs/gromacs-5.0-rc1.tar.gz − Recipe: https://software.intel.com/en-us/articles/gromacs-for-intel-xeon-phi- coprocessor Results: − Highly optimized for Intel® Xeon® Processors (AVX-intrinsics) − Able to run full simulation on Intel® Xeon Phi™ coprocessor natively + host processor using a symmetric model − Optimized with intrinsics for 512-bit vectorization on Intel Xeon Phi coprocessors 12 Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. See benchmark tests and configurations in the speaker notes. For more information go to http://www.intel.com/performance 1.0x 1.56x 1.79x  2S Intel® Xeon® processor E5-2697 v2  2S Intel® Xeon® processor E5-2697 v2 + 1 Intel® Xeon Phi™ coprocessor 7120P/X  2S Intel® Xeon® processor E5-2697 v2 + 2 Intel® Xeon Phi™ coprocessor 7120P/X
  • 13. Health & Life Sciences at Intel Where information and care meet Genomics Data Processing Pipeline Lustre *Some names and brands may be claimed as the property of others.
  • 14. Health & Life Sciences at Intel Where information and care meet HPC Appliances for Life Sciences • Challenge: Experiment processing takes 7 days with current infrastructure. Delays treatment for sick patients • Solution: Dell Next Generation Sequencing Appliance − Single Rack Solution; 9 Teraflops, Lustre File Storage; Intel SW tools • Benefits: RNA-Seq processing reduced to 4 hour • Includes everything you need for NGS - compute, storage, software, networking, infrastructure, installation, deployment, training, service & supportDell HSS (Lustre) (up to 360TB) Dell NSS (NFS) (up to 180TB) Infrastructure: Dell PE, PC & F10 M420 (Compute) (up to 32 nodes) 2U Plenum Actual placement in racks may vary. NSS-HA Pair NSS User Data HSS Metadata Pair HSS OSS Pair HSS User Data ** 2-socket Intel(R) Xeon(R) CPU E5-2687W / 3.1 GHz*Other names and brands may be claimed as the property of others.
  • 15. Health & Life Sciences at Intel Where information and care meet HPC Appliances for Life Sciences • Challenge: Set up personalized medicine competency – DNA seq. analysis, Finite Element Analysis, Natural Language Processing, Image Processing, Computational Fluid Dynamics • Solution: Dell Next Generation Sequencing Appliance − 400 cores derived from 40 Intel® Xeon® E5-2680v2 2.8 GHz Ivy Bridge processors in 20 Dell® PowerEdge M620 nodes providing 9 teraflops − Room for 12 additional nodes in the Dell M1000e blade enclosure − 270 terabytes of usable Intel® Enterprise Edition for Lustre* software scalable file system in a 60-drive Dell® PowerVault MD3260 with 60-drive MD3060e expansion enclosure − Deployed with Intel® Parallel Studio XE development environment − InfiniBand backplane and multiple 10 gigabit/ sec uplinks to 1 petabyte replicated grid storage and the network backbone • Software: − CentOS Linux 6.5 64-bit; Bright Cluster Manager® 7.0 − MPI Library: OpenMPI 1.8.1; Intel® MKL; GCC; IEEL v2.1.0.0 − SLURM (Simple Linux Utility for Resource Management) v14.03.0-2462U Plenum Actual placement in racks may vary. NSS-HA Pair NSS User Data HSS Metadata Pair HSS OSS Pair HSS User Data ** 2-socket Intel(R) Xeon(R) CPU E5-2687W / 3.1 GHz*Other names and brands may be claimed as the property of others.
  • 16. Health & Life Sciences at Intel Where information and care meet IBM, CLC bio Genomics Sequencing Analytics Solution • Challenge: Need for processing power and storage capacity in order to correlate the variants in the genome with the relevant patient symptoms • Solution: IBM®, CLC Genomics server SW, Genomics Workbench client SW; Small (48 Cores, 192 GB), Medium, Large (192 Cores, 768 GB) Analytics Solutions • Benefits: – Reference Mapping for 37x coverage human genome – ~9hr (1 node) to ~30mins (37 nodes) – Variant Calling and annotation for 37x coverage – ~40 hrs (1 node) to ~3hrs (23 nodes) • Infrastructure – IBM System x® 3550 M4, E5-2650; 48 CPU cores and 192 GBs of memory to 192 CPU cores and 768 GBs of memory – IBM Storwize® V7000 – CLC Genomics Server 5.0.2 , Workbench 6.0.1 – 7x 3TB SAS 6 Gbps HDD (16 TB usable) http://www-148.ibm.com/bin/newsletter/tool/landingPage.cgi?lpId=6155
  • 17. Health & Life Sciences at Intel Where information and care meet BIONANO Appliance http://www.thinkmate.com/systems/solutions/bionanogenomics#specs17 System Specs •2x Ten-Core Intel® Xeon® Processor E5-2680 V2 2.80GHz 25MB Cache (115W) •8x 16GB (128GB Total) PC3-14900 1866MHz DDR3 ECC Registered DIMM •2x 1.0TB SATA 6.0Gb/s 7200RPM 2.5" Seagate Constellation. •6x Intel® Xeon Phi Co-Processor 5110P 1.053GHz - 8GB - 60 Cores •2x Intel 10-Gigabit Ethernet Ports via Intel® X540 Chipset - 10GBase-T (RJ-45) http://www.thinkmate.com/systems/solutions/bionanogenomics#spec s
  • 18. Health & Life Sciences at Intel Where information and care meet Genomics & Clinical Analytics Appliances 18 2U Plenum Actual placement in racks may vary. NSS-HA Pair NSS User Data HSS Metadata Pair HSS OSS Pair HSS User Data
  • 19. Health & Life Sciences at Intel Where information and care meet Aspera* & Intel optimized solutions for Science DMZ – Moving the World’s Data at Maximum Speed • Challenge: Enterprise to Cloud Transmission of Terabyte Payloads • Solution: − Demonstrated effective throughput of 73.3Gbps − Equivalent to downloading 254 whole human genomes per hour (7.3x speed-up), as compared with baseline Aspera using commodity 10GbE at 35 whole human genomes per hour • Benefits with Intel® Xeon® E5 v3 product family: − AES-NI encryption provides 2x faster inline data encryption, securely transporting sensitive workloads from enterprise-to-cloud. − Intel® DPDK, Intel® XL710 40GbE, Intel® NVMe SSDs and Intel® Xeon® E5 v3 processors significantly boost overall system performance by moving data closer to the processor, avoiding unnecessary memory copies, and reducing protocol-related latencies. 19 Infrastructure and Data Characteristics: Aspera High-Throughput Transport, featuring Intel pre-production system (Intel® Server Board S2600WT) with two Intel® Xeon® processor E5-2697 v3 (45M Cache, 2.30 GHz, Intel® Hyper-Threading Technology enabled), 128GB DDR4 memory (2133 MHz), Intel® Communications Chipset 89xx Series, 2x dual-port Intel® XL710 Ethernet Controller (40GbE), 5x Intel® DC P3700 PCIe NVMe Solid-State Drives (800GB), Intel® DPDK 1.7, Intel® DDIO, Intel® AES-NI GCM encryption, Fedora 20 with custom kernel. Aspera A4 (based on Aspera fasp* 4). Source: Aspera testing as of August 2014.
  • 20. Health & Life Sciences at Intel Where information and care meet Charite “Real-time” Cancer Analysis – Matching proper therapies to patients using in-memory techniques • Challenge: Real-time analysis of cancer patients using in-memory SAP HANA Oncolyzer database running on Intel® Xeon® family infrastructure. (3.5M Data points per Patient, Up to 20 TB of data/patient) • Solution: Using structured and unstructured data to collect and analyze tables used to take up to two days -- now takes seconds • Benefits: Improves medical quality in disruptive way for Patient, Doctor, Hospital, Research http://moss.ger.ith.intel.com/sites/SAP/SAP%20account%20team%20documents/Marketing/SAP%20HANA/SAPHANA_Charite_case_study_HI.PDF
  • 21. Health & Life Sciences at Intel Where information and care meet High Throughput Science: Embracing Cloud-based Analytics for Computational Chemistry Simulation • Challenge: Sustaining 50000+ compute cores for large scale simulations, for less than a week; CapEX v. OpX • Solution: Novartis leveraged software from AWS partner, Cycle Computing, and MolSoft to provision a fully secured cluster of 30,000 CPUs, powered by the Intel® Xeon® processor E5 family. − Completed screening of 3.2 million compounds in approximately 9 hrs, compared to 4 -14 days on existing resources. Powerof60.com
  • 22. Health & Life Sciences at Intel Where information and care meetIntel Confidential • Solution: Intel Distribution for Hadoop (IDH), Map Reduce, Hbase, Hive • Benefits: Ability to compare 14 million proteins and more, reducing the processing time from days to hours. • Project Characteristics: SLA: reducing processing time from 30 days to less then a day and scale to 4x4 million samples comparison Data: Multi-Terabyte database Problem Statement: Comparing all pairs of 14 million proteins, using BLAST search. Big Data, Bioinformatics Team websiteBlast Program Genome data Proteins comparison High performance scalable Hadoop/Hbase cluster
  • 23. Health & Life Sciences at Intel Where information and care meet Drug Information Network Analysis for Treatment & Chemoinformatics Integration of heterogeneous data sets reveals relationships among drug compounds, enabling insight into treatment efficacy Business Outcomes • Faster, cheaper discovery of new drugs via focusing research • Improving patient outcomes by reducing toxic drug reactions • Reduce cost of treatment by predicting novel efficacy of existing compounds 25 Pharmacological Data sets 41 Million Compounds • Data preparation • Feature engineering • Network/graph modeling • Visualization • Machine Learning • Query Advanced Analytics • Medical Research • Toxicity • Drug interactions • Efficacy prediction
  • 24. Health & Life Sciences at Intel Where information and care meet Accelerating Secondary Genomic Analysis Using Intel Reference Architecture • Challenge: How to accelerate Secondary Analysis and Interpretation in Whole Genome Sequencing • Solution: Intel Genomics Reference Architecture (IGRA) utilizing Big Data Technologies (Cloudera CDH5 Hadoop Distro/Impala/Hive/Python) and Intel Hardware and Infrastructure. (E5-2650, 8 GB*16, 300 GB SSD, E10G42BFSR) • Benefits: − 732x acceleration in Query − 242x acceleration in statistical function (histogram of indel lengths) − 15x acceleration in Annotation − 11x acceleration in Intersect * 1000 genome data set, comparison of VCFtools vs. IGRA, cumulative runtime for 23 chromosomes 24
  • 25. Health & Life Sciences at Intel Where information and care meet High-Performance Interconnect (InfiniBand) Intel® True Scale Fabric • Challenge: Can high performance interconnect technology (InfiniBand) keep up with increase in number of processor cores? • Workloads: VASP, WIEN2K • Benchmarks: MVAPICH (MPI over InfiniBand), IMPI (Intel MPI) • Results: − Scale-up research – 5 to 10x speed up when scaling from single node to 16 nodes − Intel® True Scale Fabric QDR-40 shows excellent price/performance results 25 High Performance Scale-out Storage Challenge: • Challenge: Need to accelerate and optimize “time to results” clinical trial simulation environment; resource allocation and job prioritization was manual/ad-hoc • Solution: Scale-Out” architecture: SAS Visual Analytics, Enterprise Miner, Grid Manager, Red Hat Enterprise Linux, Xeon E5 servers (HP) • Benefits: − Clinical trial simulation exercises reduced from hours to < 5 minutes; registration decisions accelerated with multi-hundred million USD impact
  • 26. Health & Life Sciences at Intel Where information and care meet Genomic Assembly of Non-Human Species • Challenge: Complete assembly of non-human genome on existing clusters • Solution: Intel Xeon E7 4800 V3 + SSDs (S3700) • Benefits: − 29.92 hours (on their cluster) vs. 4.03 hours (on HSW-EX) for a dataset of 73GB. High Performance Scale-out Storage Challenge: • Challenge: 10-15TB data added weekly, small fraction of overall storage capacity and need a system to scale, be flexible and efficient • Solution: HPC-class storage, powered by Intel® Enterprise Edition for Lustre software • Benefits: − Openess, global namespace − Performance of upwards of 1 TB/s − Virtually unlimited file system and per file sizes, and management simplicity 26
  • 27. www.intel.com/healthcare/bigdata Intel Health & Life Sciences Where information and care meet Intel/Intermountain Collaboration An Idea For New CDS Applications Combining Clinical, Genetic/Genomic, and Family Health History Data • Goal - Promote widespread use of clinical decision support that will help clinicians/counselors in assessing risk and assist genetic counselors in ordering genetic tests. • Build a scalable CDS that leverages standardized data that includes: −Family Health History −Clinical and Screening −Genomic data • The solution will: • Be agnostic to data collection tools. The solution Be scaled to different clinical domains (grow beyond Breast Cancer) and other healthcare institutions. • Be standards based where they exist • Work across all EHRs, but starting with Cerner. • Leverage Intel technologies (infrastructure, Intel Data Platform etc.). • Be flexible to incorporate other data sources (e.g. Imaging data, personal device data)
  • 28. Health & Life Sciences at Intel Where information and care meet Policy – United States, European Union Snapshot of US, EU Recommendations Develop an ICT-enabled European Strategy for Personalised Medicine 2014-2020 Driving research to unleash the potential of ICT at the point-of-care EU R&D initiatives must address:  Interoperability of technical standards for managing and sharing sequence data in research and clinical samples;  Development of hardware, software and workflow algorithms to accelerate cost efficient analysis of genetic abnormalities that cause cancer and other complex diseases;  Research to ensure convergence of Big Data and Cloud Computing infrastructure to meet the requirements of High Performance Computing and data throughout the life sciences and healthcare value chains The eHealth Action Plan 2020 should include Personalised Medicine as a priority  Gain knowledge of the challenges and barriers (technical, organizational, legal and political) to the adoption of ICT in support of Personalised Medicine leveraged by genomic information;  Evaluate how to change workflows and education requirements to facilitate adoption of ICT mediated personalized medicine in clinical practice;  Expand collaboration with other regions of the world in matters of common interest, e.g. by leveraging the eHealth MoU with the United States of America;  Study, evaluate and disseminate technology neutral risk assessment frameworks for data privacy and security, covering the entire ICT enabled Personalised Medicine delivery chain;  Develop effective methods for enabling the use of medical information for public health and research
  • 29. Health & Life Sciences at Intel Where information and care meet 29 Genes causing it identified & disease pathways determined Precision medicine regime Precision Medicine: All in a day by 2050 2020
  • 30. Intel Confidential — Do Not Forward