SlideShare a Scribd company logo
D I G I T A L T R A N S F O R M A T I O N I N H E A L T H
Partnering
to Advance
Clinical
Genomics
The Microsoft Cloud empowers organizations of
all sizes to re-envision the way they bring together
people, data, and processes that better engage
patients and customers, empower care teams
and employees, optimize clinical and operational
effectiveness, and digitally transform health.
D I G I T A L T R A N S F O R M A T I O N I N H E A L T H
Advancing genomic understanding
Microsoft’s investment in genomics
Partnering for success
Curoverse
BC Platforms
DNAnexus
Democratizing genomics
Table of contents
D I G I T A L T R A N S F O R M A T I O N I N H E A L T H 2
“We’ve done medicine the way
we’ve done medicine because
we haven’t had the benefit of
genomics,” said Dr. Simon Kos,
Chief Medical Officer at Microsoft.
“We prescribe a therapy based on how the average
population will respond, but for individual patients, that
might not be the right therapy—which is cold comfort
for you as an individual patient when you get a therapy
that doesn’t work. Over time, we’ve managed to
elevate medicine to a science, but right now that
science is driven by aggregate evidence rather than
personalized evidence.”
Advancing genomic understanding
The availability of more powerful tools is driving
advances in genomics research and applications that
are bringing tailored, personalized care plans for
patients into greater focus. “When you’ve got genomic
information about a patient, with all of its implications
understood, and add the patient’s phenotypic and
demographic information, then you start to get the
real picture,” said Kos.
D I G I T A L T R A N S F O R M A T I O N I N H E A L T H 3
Infinitely salable, readily available resources
in the cloud
As they gather information from more and more
patients, researchers and clinicians may find that the
volume of data outstrips their in-house resources. A
paper published in the journal PLOS Biology reports
that “as much as 2–40 exabytes of storage capacity
will be needed by 2025 just for the human genomes.”*
Attempting to manage that level of information on-
premises can quickly become a perfect storm of
problems: massive storage requirements, massive
compute requirements, and strict requirements around
security compliance, privacy and data sovereignty.
Fortunately, the cloud, with its practically unlimited
ability to scale, presents a viable alternative. The cloud
can automatically provide additional storage in minutes
—in other words, as soon as it’s needed. In contrast,
the process of requisitioning, purchasing, and setting
up additional disk space on-premises can take months
in many organizations.
Computing resources are similarly scalable on demand.
Translating raw genomic data into useful information
requires extensive processing that only supercomputers
or cloud computing resources can handle. While only a
small number of elite research centers, major medical
centers, or large pharmaceutical companies have access
to super computers, any organization—from the smallest
startup to the largest enterprise—can access the cloud.
A more in-depth understanding of how individuals who
have certain genes might respond to certain therapies
will open the doors to precision medicine, a tantalizing
prospect to clinicians. Technology and medical research
advances are combining to dramatically improve
the timeliness and cost effectiveness of genomics,
making it relevant in clinical scenarios, even for
medical emergencies.
Of course, the ability to tailor treatments much more
specifically based on an individual’s genomic profile
requires more than their genetic data. “It’s not enough
to have the genome,” said Kos. “We’ve got to know
what the genome means, and that requires collaborative
research.” Since volumes of genomes must be
compared to yield insights, the more genomes that are
available, the more accurate and informative the
analysis will be.
“Any one organization may have petabytes of genomic
information, which is huge, but it’s just a fraction of all
of the genomes that have been sequenced in the
world,” said Kos. “Because these are large file sizes, be-
cause historically they’ve been held on-premises, and
because the data is highly sensitive, there’s no way to
aggregate the world’s genomic information to do
de-identified ethical studies into what these genomes
mean. We need large amounts of data, aggregated in
a single place, secure but accessible. That’s what our
cloud platforms afford.”
D I G I T A L T R A N S F O R M A T I O N I N H E A L T H 4
heavy, long-running workloads that push Microsoft’s
cloud platform, Azure, to its limits. “We’re the only
cloud provider that has a research team dedicated to
genomics,” said Miller. “We understand the needs and
concerns of researchers because we’re researchers
ourselves.” To advance genomics, the team takes an
open-source approach, sharing much of its work with
the research community.
In addition to the work her team has done to improve
secondary and tertiary analysis, they’ve helped integrate
genomics-specific analytical tools and capabilities into
the Microsoft platform. “Combining these tools with the
research we’ve done makes our results significantly
more powerful,” Miller explained. “In essence, we’re
blending data science and artificial intelligence with
core bioinformatics.”
Cloud services can be resources, like utilities, that
genomics solutions draw on only as required, keeping
costs down. Industry experts who run datacenters for
a living keep the underlying software current with the
latest features and security patches as soon as they
become available, ensuring that clinicians and
researchers always have immediate access to the
latest technology. Moreover, cloud providers take
physical, technical, and operational measures to
achieve and maintain security and privacy standards
that meet compliance requirements.
Learn more about privacy, security, and compliance
in the Microsoft cloud
“Someday, with the added speeds and feeds from
cloud infrastructure, we’ll be able to sequence and
analyze a full genome for less than a hundred dollars,”
said Kos. “As more sequenced genomes become
available, genomics will move into the mainstream.
Your primary doctor will be able to use your genomic
information for practical purposes.”
Microsoft’s investment in genomics
In 2007, Microsoft established a team focused on
genomics in its Artificial Intelligence and Research
division. This team of engineers, data scientists, and
scientists, led by Geralyn Miller, works hand in hand
with Microsoft’s cloud computing team to create
genomics-specific platforms, consistently running
Project Premonition seeks to detect pathogens in the
wild before they cause outbreaks of disease by capturing
mosquitoes and sequencing the genomes of the mosquito,
the blood they have ingested, and any viruses they are carry-
ing. The project is using drones to find mosquito hotspots
and breeding sites and catches specimens with robotic traps.
A core focus of Microsoft’s genomics team is to
develop analytics, machine learning, and artificial
intelligence algorithms to aid clinicians who want to
make more accurate diagnoses more quickly, and
reduce trial-and-error when prescribing treatments.
Microsoft Research offers Factor Spectrally Transformed
Linear Mixed Models (FaST-LMM), a set of tools for genome-wide
association studies (GWAS). The tools help researchers handle huge
volumes of data by cataloging and comparing nucleotide variations
at specific positions.
Learn more here.
D I G I T A L T R A N S F O R M A T I O N I N H E A L T H 5
The goal is not only to inform a patient’s current
condition, but also predict, based on data collected
from the patient and similar subpopulations, how their
condition may evolve, and help suggest the optimal
course of action.
“For example, just because you have lung cancer
doesn’t mean all lung cancers are the same,” Miller
explained. “Tumors don’t always identify with the tissue
of origin. Today, clinicians generally start with the treat-
ment that’s most commonly used. With genomics, you
can get more specific about the type of cancer and the
best type of treatment. If you can tell upfront whether a
person has an inherited tendency for an adverse reaction
to a drug, you can make a better choice as the doctor.
It’s all about taking the raw data and getting to an
insight that’s actionable in the context of a care situation.”
Partnering for success
Building on its collaboration with researchers, Microsoft
has developed a vibrant ecosystem of solution partners
who focus solely on genomics. As these partners build
solutions based on input from researchers and scientists,
they help Microsoft further develop and accelerate
commercialization of their cloud-based platforms,
optimizing services for storing and analyzing vast
amounts of genomics data.
Curoverse: pooling genomic data while
protecting privacy
“The number of genomes being sequenced today is
growing at three times the rate of Moore’s Law,” said
Keith Elliston, Chief Commercial Officer for Curoverse,
which provides open source genomics enabling tech-
nology. “You’re not done once you’ve sequenced a
single genome. The germ line genome is just the base
map. Things change. When you develop tumors, that’s
a change. Then there’s the whole microbiome, which
is all the bacteria living on your skin, in your gut and in
your hair, in your nose. These have a big impact on
what’s happening. You look at terminally differentiated
cells. Your immune system has rearrangements of the
whole genome in every cell. You have all these
genomes that become very important, so we’re not
talking about one genome per person. We’re talking
about literally thousands of genomes per person.”
Learn more about genomic and biomedical
computing solutions from Curoverse
Since single genomes don’t yield insight in isolation,
the ability to amass genomic data, and then share it
broadly and easily, is particularly important. But
moving data into a single repository—even within
the cloud—isn’t practical.
“If you have a hundred thousand genome centers in
the world, and they each have a hundred thousand
genomes, the model of bringing all of them into one
database will never work,” said Elliston. “It’s all going
to live in the hundred thousand different centers out
there. Your genome sequence is the most private and
personal data you have. It’s really the fundamental
blueprint that makes you you, and from that
perspective, there are lots of laws about how that
data must be protected. If you put data up on the
cloud securely and privately, you’re not going to mix
it with anybody else’s.”
The cloud makes it possible to assemble genome
data from multiple organizations into a data lake,
while retaining the appropriate privacy and security
controls. Cloud-based algorithms can run on
individual datasets regardless of whether they’re
cloud-based or stored in localized datacenters.
D I G I T A L T R A N S F O R M A T I O N I N H E A L T H
Even with faster sequencing, however, the resources
required to process the millions of genomes that will
be necessary to yield a broad range of useful clinical
insights are gargantuan. “When you go from ten
thousand genotype patients to one hundred thousand
to one million patients,” said Nino Da Silva,
Executive Vice President of BC Platforms, a provider
of bioinformatics and genome data management
software headquartered in Switzerland, “it will take
2.4 million CPU core hours just to impute the raw data.
And at this point we haven’t even started analyzing.”
Fortunately, the cloud makes such intensive computing
requirements manageable without a large, upfront
investment in infrastructure. The calculations required
to sequence genomes, and then analyze them, can
take place in the cloud, even if sensitive patient data
remains on-premises. The role of companies like BC
Platforms is to offer end-to-end solutions that provide
ready access to genomic information, even for
clinicians who aren’t geneticists, or don’t have a high
level of training in genetics.
“We take the data and make it understandable,” Da
Silva explained. “When a patient comes in with a
referral for a specific question, such as whether they
have the BRCA breast cancer gene, our system will take
the raw data, align and create the usable file (gVCF)
and from there create the variant core report. We then
inject the report into the electronic medical record or
While the algorithms generate insight, the genome
data stays where it is. “When our platform, Arvados,
does genetic variation analysis,” Elliston explained, “it
federates the data in a way that encapsulates the
analysis in a workflow, which we can distribute across
the datacenters where those data live. We do the
analysis, aggregate the results, and only export
metadata. We don’t move the private individual data.
Even on the same cloud infrastructure, it’s stored in
distinct places. With Arvados, it can still be analyzed
with everyone else’s data, while respecting all that
privacy and ownership.”
lab system without any manual intervention, so
doctors can see the results where they need it, with
the rest of the patient data.” The genomic information
becomes one of many tools that clinicians use in
treating their patients.
Learn more about genomic data management and
analysis solutions from BC Platforms
“Once we’ve optimized the whole flow from the
diagnostics devices in the laboratory to the patient
records, we’ll need to form a standard of what’s going
to be reported and how it will be used in terms of the
actual patient care,” said Da Silva. “It’s also important
to understand that clinical decision makers, university
researchers and potentially commercial partners can
all benefit from different pieces of the same know-
ledge base. We must ensure that doctors and patients
always have the most up-to-date, personalized, valid
interpretation available.”
Microsoft’s method of running the Burrows-Wheeler Aligner
(BWA) and the Broad Institute’s Genome Analysis Toolkit (GATK)
on its Azure cloud computing system is seven times faster than
the previous version, allowing researchers and medical
professionals to get results in just four hours instead of 28.
Learn more here.
Microsoft has a unique ability to do hybrid cloud installations
using local datacenters in combination with Azure. Microsoft
can deploy the Azure Stack on-premises at the customer site or
at the facility of a hosting partner. The clinical data remains se-
curely in the local data warehouse to comply with data sover-
eignty requirements, while the processing and analysis takes
place in the Azure cloud.
BC Platforms: turning raw genomic data
into analytical insights
It took 13 years and USD $2.7 billion to sequence the
first genome. Steady improvements in algorithms
reduced sequencing time to 28 hours. Algorithms
Microsoft has developed with the Broad Institute can
yield results in just four hours.
6
D I G I T A L T R A N S F O R M A T I O N I N H E A L T H
Da Silva believes that at some point, clinicians will rely
on genetic data about patients as a matter of course.
“Genomics is like any other medical system,” he said,
“and I think we should perceive it that way. Nobody
would today consider replacing a hip without doing
an X-ray or an MRI first and interpreting it. I think
genetics will become the same, that nobody will treat
a patient for cancer without having specific genetic
information. Not doing so will be considered as
uninformed as doing hip surgery without an X-ray.”
DNAnexus: employing genomics in
clinical settings
Dr. David Shaywitz, Chief Medical Officer at DNAnexus,
a Mountain View, California company that offers a
cloud-based genome informatics and data manage-
ment platform, agrees that genomics will become a
standard component of clinical evaluation. “There’s an
increased recognition of the relevance of genetics as
people have a better understanding of it and as the
tools have become basically faster, cheaper, and
better,” he said.
Yet many clinicians remain skeptical about genomics.
“For over a generation, physicians, even in medical
school, have been hearing about the promise of
genetics, and have caught glimpses of ways genetic
diagnosis and genetically informed treatments could
really be helpful,” Shaywitz explained. “But there’s
always been a disconnect between the promise
of genetic medicine and the reality of precision
or personalized medicine.” The successful use of
genomics to improve diagnoses, drug selection, and
outcomes for patients with specific cancers, such as
breast cancer, is encouraging skeptics to reevaluate.
“What’s so exciting now are the specific tools available
for patients,” said Shaywitz. “There are more and more
clear use cases—clear opportunities where you can
demonstrate that patients’ lives were dramatically
improved through the use of genetic information.”
For example, it’s estimated that about one-third of the
children under the age of one who are hospitalized
suffer from a genetic condition, and that 25 to 40
percent of mortality in high acuity neonatal intensive
care units (NICUs) may be attributable to a genetic
disorder. “An accurate genetic diagnosis—which in
some cases can now be achieved within about 24
hours —can immediately influence the direction of
care, and also avoid a prolonged and often excruciat-
ing diagnostic odyssey,” said Shaywitz. “While the
primary driver is, as always, humanistic, that is, it’s the
‘right thing to do,’ there are also immediate economic
benefits, given the high cost of keeping children in the
NICU. The result is that appropriate, early use of
genetics in these cases can be a win for everyone.”
Learn more about sharing and management of
genomic data and tools to accelerate genomics
from DNAnexus
DNAnexus has developed a platform for local and
distributed sequencing and analysis, built on the
Microsoft cloud, that enables an end-to-end, rapid
genomics solution to support both researchers and
clinicians. “Oncologists seeking to provide the best care
for their cancer patients are going to be performing
molecular evaluations,” Shaywitz predicted, “and cancer
centers that want to provide the best care for their
patients will want to collect and organize this
information—both the genetic information and the
patient journey information.” Over time, as more
medical centers and research facilities around the world
gather genetic data, it will become more useful for
identifying the best treatments options.
7
D I G I T A L T R A N S F O R M A T I O N I N H E A L T H
“Ideally, we’ll have a learning healthcare system,” said
Shaywitz. “We’d learn from the clinical journey of the first
patient integrating their clinical and genetic information,
and apply it intelligently so that we treat the next patient
in a fashion that reflects the knowledge gained from
each previous interaction. It will be a virtuous cycle.”
Democratizing genomics
The successful use of genomics to improve diagnoses
and outcomes for neonatal infants and patients with
specific cancers, such as breast cancer, is encouraging
efforts to develop more genomic information.
Microsoft and its partners are hard at work making
this information easier to sequence, store, analyze,
and digest, thus transforming the promise of genomics
into real-world evidence that clinicians can use to
improve care.
“We’re taking our research investment and getting the
IP into the hands of our customers and partners through
a series of Azure-based offerings,” said Miller. “Our goal
with our partners is to offer a turnkey way to do the
bioinformatics pipeline—to reduce complexity. When
you run analyses across large sample sets through the
same pipeline, you end up with more consistent data.
In essence, we’re democratizing genomics.”
Learn more about Microsoft’s investments in genomics.
8
Project Hanover, is a Microsoft Research effort that seeks to
empower biomedicine with cutting-edge computer science,
focusing on three areas: machine reading of the millions of bio-
medical articles to form queryable knowledge bases, cancer
decision support using AI with a current focus on Acute Myeloid
Leukemia (AML), and the use of AI in chronic disease management.
DNAnexus, in partnership with Microsoft, supports
The Stanford Center for Genomics and Personalized Medicine
(SCGPM), which is creating a new paradigm of patient-centered
medicine that encompasses the entire genome of individuals
“to improve disease prediction, prevention, and treatment of
conditions such as cancer, diabetes, heart disease, asthma,
schizophrenia, and many others.” Efforts include reducing the
time to achieve effective treatment, integrating bioinformatics
with the patient’s electronic medical record, eliminating trial
and error in prescribing drugs, identifying disease risks, offering
patients genetic counseling and engaging them to enhance
their compliance with prescribed treatments.
Scale. The cloud provides an almost infinite set of computing
resources, which is necessary for storing or processing massive
amounts of data. When you need more resources, you can scale
up instantly and automatically. When you need fewer resources,
you scale down instantly and automatically. In other words, you
get exactly what you need, when you need it—no more, no less.
Choosing a cloud offeringBenefits of cloud computing
Health organizations need cloud services that are not only
powerful, but also secure and compliant. Here is a short
checklist to consider when evaluating cloud offerings:
• Is it Enterprise-ready? Do the services it offers scale to large	
	 organizations with many users or customers?
• Does it offer more than infrastructure, such as platforms 	
	 that support core functions for productivity, 		
	 communications, relationship management, and analytics, 	
	 so you can easily tailor solutions to meet your needs?
• Does it work with on-premises environments and with 	
	 technologies from multiple cloud providers and device 	
	 manufacturers?
• Does it have certifications to show it complies with 		
	 health security and privacy regulations in the countries 	
	 where you operate, so you can safely store and process 	
	 personal health information?
Economics. Few organizations have the resources to set up
or manage very-largescale data centers. The cloud model, like a
utility, is pay-as-you-go. This way you never have to worry about
having fewer resources than you need, and your organization
never has to worry about paying for more than you’re using.
Speed. Traditionally, getting access to new or better tools, more
storage space, or more computing power has involved a lot of
waiting. This happens when IT has to get budget, purchase
hardware and/or software, and set it up before they can give you
what you need. With the cloud, it takes minutes instead of days,
weeks, or months, to get more storage space, compute cycles, or
updated software.
Trust. Security is a big concern when it comes to cloud adoption.
Health data needs to be protected. The industry experts who
manage cloud data centers take physical, technical, and
operational measures to achieve and maintain security and privacy
so your organization can meet the compliance requirements of
regulators. They deploy the latest security patches as soon as
they’re available, so you get them automatically.
9D I G I T A L T R A N S F O R M A T I O N I N H E A L T H
© 2017 Microsoft Corporation. All rights reserved. This document is provided
“as-is.” Information and views expressed in this document, including URL and
other Internet Web site references, may change without notice. You bear the
risk of using it. Some examples are for illustration only and are fictitious. No
real association is intended or inferred.
 
This document does not provide you with any legal rights to any intellectual
property in any Microsoft product. You may copy and use this document for
your internal, reference purposes. Microsoft and Windows are either
registered trademarks of Microsoft Corporation in the United States and/or
other countries.
microsoft.com/health
* Zachary D. Stephens, Skylar Y. Lee, Faraz Faghri, Roy H. Campbell, Chengziang
Zhai, Miles J. Efron, Ravishankar Iyer, Michael C. Schatz, Saurabh Sinha, Gene E.
Robinson. (2015, July 7). Big Data: Astronomical or Genomical? Retrieved from
PLOS Biology: http://journals.plos.org/plosbiology/article?id=10.1371/journal.
pbio.1002195

More Related Content

What's hot

NRNB Annual Report 2013
NRNB Annual Report 2013NRNB Annual Report 2013
NRNB Annual Report 2013
Alexander Pico
 
Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009
Ian Foster
 
NRNB Annual Report 2017
NRNB Annual Report 2017NRNB Annual Report 2017
NRNB Annual Report 2017
Alexander Pico
 
NRNB Annual Report 2016: Overall
NRNB Annual Report 2016: OverallNRNB Annual Report 2016: Overall
NRNB Annual Report 2016: Overall
Alexander Pico
 
dkNET Webinar: "The Microphysiology Systems Database (MPS-Db): A Platform For...
dkNET Webinar: "The Microphysiology Systems Database (MPS-Db): A Platform For...dkNET Webinar: "The Microphysiology Systems Database (MPS-Db): A Platform For...
dkNET Webinar: "The Microphysiology Systems Database (MPS-Db): A Platform For...
dkNET
 
Machine learning in biology
Machine learning in biologyMachine learning in biology
Machine learning in biology
Pranavathiyani G
 
nm0915-965-2
nm0915-965-2nm0915-965-2
nm0915-965-2
Ekaterina Loginicheva
 
NRNB EAC Report 2011
NRNB EAC Report 2011NRNB EAC Report 2011
NRNB EAC Report 2011
Alexander Pico
 
NetBioSIG2013-Talk Gang Su
NetBioSIG2013-Talk Gang SuNetBioSIG2013-Talk Gang Su
NetBioSIG2013-Talk Gang Su
Alexander Pico
 
Genomics2 Phenomics Complete
Genomics2 Phenomics CompleteGenomics2 Phenomics Complete
Genomics2 Phenomics Complete
InterpretOmics
 
AI for drug discovery
AI for drug discoveryAI for drug discovery
AI for drug discovery
Deakin University
 
Bioinformatics in the Era of Open Science and Big Data
Bioinformatics in the Era of Open Science and Big DataBioinformatics in the Era of Open Science and Big Data
Bioinformatics in the Era of Open Science and Big Data
Philip Bourne
 
NetBioSIG2013-Talk Tijana Milenkovic
NetBioSIG2013-Talk Tijana MilenkovicNetBioSIG2013-Talk Tijana Milenkovic
NetBioSIG2013-Talk Tijana Milenkovic
Alexander Pico
 
Beating Bugs with Big Data: Harnessing HPC to Realize the Potential of Genomi...
Beating Bugs with Big Data: Harnessing HPC to Realize the Potential of Genomi...Beating Bugs with Big Data: Harnessing HPC to Realize the Potential of Genomi...
Beating Bugs with Big Data: Harnessing HPC to Realize the Potential of Genomi...
Tom Connor
 
2011Field talk at iEVOBIO 2011
2011Field talk at iEVOBIO 20112011Field talk at iEVOBIO 2011
2011Field talk at iEVOBIO 2011
MIBBI Checklists
 
Pharma data analytics
Pharma data analyticsPharma data analytics
Pharma data analytics
Axon Lawyers
 
NetBioSIG2013-KEYNOTE Benno Schwikowski
NetBioSIG2013-KEYNOTE Benno SchwikowskiNetBioSIG2013-KEYNOTE Benno Schwikowski
NetBioSIG2013-KEYNOTE Benno Schwikowski
Alexander Pico
 
NetBioSIG2014-Talk by Tijana Milenkovic
NetBioSIG2014-Talk by Tijana MilenkovicNetBioSIG2014-Talk by Tijana Milenkovic
NetBioSIG2014-Talk by Tijana Milenkovic
Alexander Pico
 
Next Gen Sequencing and Associated Big Data / AI problem
Next Gen Sequencing and Associated Big Data / AI problemNext Gen Sequencing and Associated Big Data / AI problem
Next Gen Sequencing and Associated Big Data / AI problem
Subhendu Dey
 
Towards a gold standard and regarding quality in public domain chemistry data...
Towards a gold standard and regarding quality in public domain chemistry data...Towards a gold standard and regarding quality in public domain chemistry data...
Towards a gold standard and regarding quality in public domain chemistry data...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 

What's hot (20)

NRNB Annual Report 2013
NRNB Annual Report 2013NRNB Annual Report 2013
NRNB Annual Report 2013
 
Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009
 
NRNB Annual Report 2017
NRNB Annual Report 2017NRNB Annual Report 2017
NRNB Annual Report 2017
 
NRNB Annual Report 2016: Overall
NRNB Annual Report 2016: OverallNRNB Annual Report 2016: Overall
NRNB Annual Report 2016: Overall
 
dkNET Webinar: "The Microphysiology Systems Database (MPS-Db): A Platform For...
dkNET Webinar: "The Microphysiology Systems Database (MPS-Db): A Platform For...dkNET Webinar: "The Microphysiology Systems Database (MPS-Db): A Platform For...
dkNET Webinar: "The Microphysiology Systems Database (MPS-Db): A Platform For...
 
Machine learning in biology
Machine learning in biologyMachine learning in biology
Machine learning in biology
 
nm0915-965-2
nm0915-965-2nm0915-965-2
nm0915-965-2
 
NRNB EAC Report 2011
NRNB EAC Report 2011NRNB EAC Report 2011
NRNB EAC Report 2011
 
NetBioSIG2013-Talk Gang Su
NetBioSIG2013-Talk Gang SuNetBioSIG2013-Talk Gang Su
NetBioSIG2013-Talk Gang Su
 
Genomics2 Phenomics Complete
Genomics2 Phenomics CompleteGenomics2 Phenomics Complete
Genomics2 Phenomics Complete
 
AI for drug discovery
AI for drug discoveryAI for drug discovery
AI for drug discovery
 
Bioinformatics in the Era of Open Science and Big Data
Bioinformatics in the Era of Open Science and Big DataBioinformatics in the Era of Open Science and Big Data
Bioinformatics in the Era of Open Science and Big Data
 
NetBioSIG2013-Talk Tijana Milenkovic
NetBioSIG2013-Talk Tijana MilenkovicNetBioSIG2013-Talk Tijana Milenkovic
NetBioSIG2013-Talk Tijana Milenkovic
 
Beating Bugs with Big Data: Harnessing HPC to Realize the Potential of Genomi...
Beating Bugs with Big Data: Harnessing HPC to Realize the Potential of Genomi...Beating Bugs with Big Data: Harnessing HPC to Realize the Potential of Genomi...
Beating Bugs with Big Data: Harnessing HPC to Realize the Potential of Genomi...
 
2011Field talk at iEVOBIO 2011
2011Field talk at iEVOBIO 20112011Field talk at iEVOBIO 2011
2011Field talk at iEVOBIO 2011
 
Pharma data analytics
Pharma data analyticsPharma data analytics
Pharma data analytics
 
NetBioSIG2013-KEYNOTE Benno Schwikowski
NetBioSIG2013-KEYNOTE Benno SchwikowskiNetBioSIG2013-KEYNOTE Benno Schwikowski
NetBioSIG2013-KEYNOTE Benno Schwikowski
 
NetBioSIG2014-Talk by Tijana Milenkovic
NetBioSIG2014-Talk by Tijana MilenkovicNetBioSIG2014-Talk by Tijana Milenkovic
NetBioSIG2014-Talk by Tijana Milenkovic
 
Next Gen Sequencing and Associated Big Data / AI problem
Next Gen Sequencing and Associated Big Data / AI problemNext Gen Sequencing and Associated Big Data / AI problem
Next Gen Sequencing and Associated Big Data / AI problem
 
Towards a gold standard and regarding quality in public domain chemistry data...
Towards a gold standard and regarding quality in public domain chemistry data...Towards a gold standard and regarding quality in public domain chemistry data...
Towards a gold standard and regarding quality in public domain chemistry data...
 

Similar to Microsoft genomics to advance clinical science

Big data
Big dataBig data
Health IT Summit Austin 2013 - Presentation "The Impact of All Data on Health...
Health IT Summit Austin 2013 - Presentation "The Impact of All Data on Health...Health IT Summit Austin 2013 - Presentation "The Impact of All Data on Health...
Health IT Summit Austin 2013 - Presentation "The Impact of All Data on Health...
Health IT Conference – iHT2
 
ICBO 2014, October 8, 2014
ICBO 2014, October 8, 2014ICBO 2014, October 8, 2014
ICBO 2014, October 8, 2014
Warren Kibbe
 
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
Robert Grossman
 
Big_Data_and_Machine_Learning_in_Plastic_Surgery__.45 (5)
Big_Data_and_Machine_Learning_in_Plastic_Surgery__.45 (5)Big_Data_and_Machine_Learning_in_Plastic_Surgery__.45 (5)
Big_Data_and_Machine_Learning_in_Plastic_Surgery__.45 (5)
Jonathan Kanevsky, MD, FRCSC
 
Caris Life Sciences
Caris Life SciencesCaris Life Sciences
Caris Life Sciences
Gorman K
 
Caris Life Sciences
Caris Life SciencesCaris Life Sciences
Caris Life Sciences
Kim Kozlik
 
Federal Research & Development for the Florida system Sept 2014
Federal Research & Development for the Florida system Sept 2014 Federal Research & Development for the Florida system Sept 2014
Federal Research & Development for the Florida system Sept 2014
Warren Kibbe
 
Big Data, The Community and The Commons (May 12, 2014)
Big Data, The Community and The Commons (May 12, 2014)Big Data, The Community and The Commons (May 12, 2014)
Big Data, The Community and The Commons (May 12, 2014)
Robert Grossman
 
Role of data in precision oncology
Role of data in precision oncologyRole of data in precision oncology
Role of data in precision oncology
Warren Kibbe
 
The Catalyst April Edition
The Catalyst April EditionThe Catalyst April Edition
The Catalyst April Edition
Michael Leung
 
Block23 investor 27_04_2018
Block23 investor 27_04_2018Block23 investor 27_04_2018
Block23 investor 27_04_2018
Prem Couture
 
04 Exec Summ_Shivom
04 Exec Summ_Shivom04 Exec Summ_Shivom
04 Exec Summ_Shivom
Oscar Chan
 
Key Technologies for Genome Analysis
Key Technologies for Genome AnalysisKey Technologies for Genome Analysis
Key Technologies for Genome Analysis
Hannes Smárason
 
의료의 미래, 디지털 헬스케어: 신약개발을 중심으로
의료의 미래, 디지털 헬스케어: 신약개발을 중심으로의료의 미래, 디지털 헬스케어: 신약개발을 중심으로
의료의 미래, 디지털 헬스케어: 신약개발을 중심으로
Yoon Sup Choi
 
Paper id 36201506
Paper id 36201506Paper id 36201506
Paper id 36201506
IJRAT
 
Possible Solution for Managing the Worlds Personal Genetic Data - DNA Guide, ...
Possible Solution for Managing the Worlds Personal Genetic Data - DNA Guide, ...Possible Solution for Managing the Worlds Personal Genetic Data - DNA Guide, ...
Possible Solution for Managing the Worlds Personal Genetic Data - DNA Guide, ...
DNA Compass
 
Benefits of Big Data in Health Care A Revolution
Benefits of Big Data in Health Care A RevolutionBenefits of Big Data in Health Care A Revolution
Benefits of Big Data in Health Care A Revolution
ijtsrd
 
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014
Robert Grossman
 
Rock Report: Big Data by @Rock_Health
Rock Report: Big Data by @Rock_HealthRock Report: Big Data by @Rock_Health
Rock Report: Big Data by @Rock_Health
Rock Health
 

Similar to Microsoft genomics to advance clinical science (20)

Big data
Big dataBig data
Big data
 
Health IT Summit Austin 2013 - Presentation "The Impact of All Data on Health...
Health IT Summit Austin 2013 - Presentation "The Impact of All Data on Health...Health IT Summit Austin 2013 - Presentation "The Impact of All Data on Health...
Health IT Summit Austin 2013 - Presentation "The Impact of All Data on Health...
 
ICBO 2014, October 8, 2014
ICBO 2014, October 8, 2014ICBO 2014, October 8, 2014
ICBO 2014, October 8, 2014
 
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
 
Big_Data_and_Machine_Learning_in_Plastic_Surgery__.45 (5)
Big_Data_and_Machine_Learning_in_Plastic_Surgery__.45 (5)Big_Data_and_Machine_Learning_in_Plastic_Surgery__.45 (5)
Big_Data_and_Machine_Learning_in_Plastic_Surgery__.45 (5)
 
Caris Life Sciences
Caris Life SciencesCaris Life Sciences
Caris Life Sciences
 
Caris Life Sciences
Caris Life SciencesCaris Life Sciences
Caris Life Sciences
 
Federal Research & Development for the Florida system Sept 2014
Federal Research & Development for the Florida system Sept 2014 Federal Research & Development for the Florida system Sept 2014
Federal Research & Development for the Florida system Sept 2014
 
Big Data, The Community and The Commons (May 12, 2014)
Big Data, The Community and The Commons (May 12, 2014)Big Data, The Community and The Commons (May 12, 2014)
Big Data, The Community and The Commons (May 12, 2014)
 
Role of data in precision oncology
Role of data in precision oncologyRole of data in precision oncology
Role of data in precision oncology
 
The Catalyst April Edition
The Catalyst April EditionThe Catalyst April Edition
The Catalyst April Edition
 
Block23 investor 27_04_2018
Block23 investor 27_04_2018Block23 investor 27_04_2018
Block23 investor 27_04_2018
 
04 Exec Summ_Shivom
04 Exec Summ_Shivom04 Exec Summ_Shivom
04 Exec Summ_Shivom
 
Key Technologies for Genome Analysis
Key Technologies for Genome AnalysisKey Technologies for Genome Analysis
Key Technologies for Genome Analysis
 
의료의 미래, 디지털 헬스케어: 신약개발을 중심으로
의료의 미래, 디지털 헬스케어: 신약개발을 중심으로의료의 미래, 디지털 헬스케어: 신약개발을 중심으로
의료의 미래, 디지털 헬스케어: 신약개발을 중심으로
 
Paper id 36201506
Paper id 36201506Paper id 36201506
Paper id 36201506
 
Possible Solution for Managing the Worlds Personal Genetic Data - DNA Guide, ...
Possible Solution for Managing the Worlds Personal Genetic Data - DNA Guide, ...Possible Solution for Managing the Worlds Personal Genetic Data - DNA Guide, ...
Possible Solution for Managing the Worlds Personal Genetic Data - DNA Guide, ...
 
Benefits of Big Data in Health Care A Revolution
Benefits of Big Data in Health Care A RevolutionBenefits of Big Data in Health Care A Revolution
Benefits of Big Data in Health Care A Revolution
 
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014
 
Rock Report: Big Data by @Rock_Health
Rock Report: Big Data by @Rock_HealthRock Report: Big Data by @Rock_Health
Rock Report: Big Data by @Rock_Health
 

Recently uploaded

在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
vluwdy49
 
HUMAN EYE By-R.M Class 10 phy best digital notes.pdf
HUMAN EYE By-R.M Class 10 phy best digital notes.pdfHUMAN EYE By-R.M Class 10 phy best digital notes.pdf
HUMAN EYE By-R.M Class 10 phy best digital notes.pdf
Ritik83251
 
Tissue fluids_etiology_volume regulation_pressure.pptx
Tissue fluids_etiology_volume regulation_pressure.pptxTissue fluids_etiology_volume regulation_pressure.pptx
Tissue fluids_etiology_volume regulation_pressure.pptx
muralinath2
 
The cost of acquiring information by natural selection
The cost of acquiring information by natural selectionThe cost of acquiring information by natural selection
The cost of acquiring information by natural selection
Carl Bergstrom
 
8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf
by6843629
 
Farming systems analysis: what have we learnt?.pptx
Farming systems analysis: what have we learnt?.pptxFarming systems analysis: what have we learnt?.pptx
Farming systems analysis: what have we learnt?.pptx
Frédéric Baudron
 
Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...
Leonel Morgado
 
Summary Of transcription and Translation.pdf
Summary Of transcription and Translation.pdfSummary Of transcription and Translation.pdf
Summary Of transcription and Translation.pdf
vadgavevedant86
 
Applied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdfApplied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdf
University of Hertfordshire
 
The debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically youngThe debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically young
Sérgio Sacani
 
Methods of grain storage Structures in India.pdf
Methods of grain storage Structures in India.pdfMethods of grain storage Structures in India.pdf
Methods of grain storage Structures in India.pdf
PirithiRaju
 
Sciences of Europe journal No 142 (2024)
Sciences of Europe journal No 142 (2024)Sciences of Europe journal No 142 (2024)
Sciences of Europe journal No 142 (2024)
Sciences of Europe
 
Discovery of An Apparent Red, High-Velocity Type Ia Supernova at 𝐳 = 2.9 wi...
Discovery of An Apparent Red, High-Velocity Type Ia Supernova at  𝐳 = 2.9  wi...Discovery of An Apparent Red, High-Velocity Type Ia Supernova at  𝐳 = 2.9  wi...
Discovery of An Apparent Red, High-Velocity Type Ia Supernova at 𝐳 = 2.9 wi...
Sérgio Sacani
 
23PH301 - Optics - Optical Lenses.pptx
23PH301 - Optics  -  Optical Lenses.pptx23PH301 - Optics  -  Optical Lenses.pptx
23PH301 - Optics - Optical Lenses.pptx
RDhivya6
 
cathode ray oscilloscope and its applications
cathode ray oscilloscope and its applicationscathode ray oscilloscope and its applications
cathode ray oscilloscope and its applications
sandertein
 
The binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defectsThe binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defects
Sérgio Sacani
 
Immersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths ForwardImmersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths Forward
Leonel Morgado
 
Compexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titrationCompexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titration
Vandana Devesh Sharma
 
Direct Seeded Rice - Climate Smart Agriculture
Direct Seeded Rice - Climate Smart AgricultureDirect Seeded Rice - Climate Smart Agriculture
Direct Seeded Rice - Climate Smart Agriculture
International Food Policy Research Institute- South Asia Office
 
Randomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNERandomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNE
University of Maribor
 

Recently uploaded (20)

在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
 
HUMAN EYE By-R.M Class 10 phy best digital notes.pdf
HUMAN EYE By-R.M Class 10 phy best digital notes.pdfHUMAN EYE By-R.M Class 10 phy best digital notes.pdf
HUMAN EYE By-R.M Class 10 phy best digital notes.pdf
 
Tissue fluids_etiology_volume regulation_pressure.pptx
Tissue fluids_etiology_volume regulation_pressure.pptxTissue fluids_etiology_volume regulation_pressure.pptx
Tissue fluids_etiology_volume regulation_pressure.pptx
 
The cost of acquiring information by natural selection
The cost of acquiring information by natural selectionThe cost of acquiring information by natural selection
The cost of acquiring information by natural selection
 
8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf
 
Farming systems analysis: what have we learnt?.pptx
Farming systems analysis: what have we learnt?.pptxFarming systems analysis: what have we learnt?.pptx
Farming systems analysis: what have we learnt?.pptx
 
Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...
 
Summary Of transcription and Translation.pdf
Summary Of transcription and Translation.pdfSummary Of transcription and Translation.pdf
Summary Of transcription and Translation.pdf
 
Applied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdfApplied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdf
 
The debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically youngThe debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically young
 
Methods of grain storage Structures in India.pdf
Methods of grain storage Structures in India.pdfMethods of grain storage Structures in India.pdf
Methods of grain storage Structures in India.pdf
 
Sciences of Europe journal No 142 (2024)
Sciences of Europe journal No 142 (2024)Sciences of Europe journal No 142 (2024)
Sciences of Europe journal No 142 (2024)
 
Discovery of An Apparent Red, High-Velocity Type Ia Supernova at 𝐳 = 2.9 wi...
Discovery of An Apparent Red, High-Velocity Type Ia Supernova at  𝐳 = 2.9  wi...Discovery of An Apparent Red, High-Velocity Type Ia Supernova at  𝐳 = 2.9  wi...
Discovery of An Apparent Red, High-Velocity Type Ia Supernova at 𝐳 = 2.9 wi...
 
23PH301 - Optics - Optical Lenses.pptx
23PH301 - Optics  -  Optical Lenses.pptx23PH301 - Optics  -  Optical Lenses.pptx
23PH301 - Optics - Optical Lenses.pptx
 
cathode ray oscilloscope and its applications
cathode ray oscilloscope and its applicationscathode ray oscilloscope and its applications
cathode ray oscilloscope and its applications
 
The binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defectsThe binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defects
 
Immersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths ForwardImmersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths Forward
 
Compexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titrationCompexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titration
 
Direct Seeded Rice - Climate Smart Agriculture
Direct Seeded Rice - Climate Smart AgricultureDirect Seeded Rice - Climate Smart Agriculture
Direct Seeded Rice - Climate Smart Agriculture
 
Randomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNERandomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNE
 

Microsoft genomics to advance clinical science

  • 1. D I G I T A L T R A N S F O R M A T I O N I N H E A L T H Partnering to Advance Clinical Genomics The Microsoft Cloud empowers organizations of all sizes to re-envision the way they bring together people, data, and processes that better engage patients and customers, empower care teams and employees, optimize clinical and operational effectiveness, and digitally transform health.
  • 2. D I G I T A L T R A N S F O R M A T I O N I N H E A L T H Advancing genomic understanding Microsoft’s investment in genomics Partnering for success Curoverse BC Platforms DNAnexus Democratizing genomics Table of contents
  • 3. D I G I T A L T R A N S F O R M A T I O N I N H E A L T H 2 “We’ve done medicine the way we’ve done medicine because we haven’t had the benefit of genomics,” said Dr. Simon Kos, Chief Medical Officer at Microsoft. “We prescribe a therapy based on how the average population will respond, but for individual patients, that might not be the right therapy—which is cold comfort for you as an individual patient when you get a therapy that doesn’t work. Over time, we’ve managed to elevate medicine to a science, but right now that science is driven by aggregate evidence rather than personalized evidence.” Advancing genomic understanding The availability of more powerful tools is driving advances in genomics research and applications that are bringing tailored, personalized care plans for patients into greater focus. “When you’ve got genomic information about a patient, with all of its implications understood, and add the patient’s phenotypic and demographic information, then you start to get the real picture,” said Kos.
  • 4. D I G I T A L T R A N S F O R M A T I O N I N H E A L T H 3 Infinitely salable, readily available resources in the cloud As they gather information from more and more patients, researchers and clinicians may find that the volume of data outstrips their in-house resources. A paper published in the journal PLOS Biology reports that “as much as 2–40 exabytes of storage capacity will be needed by 2025 just for the human genomes.”* Attempting to manage that level of information on- premises can quickly become a perfect storm of problems: massive storage requirements, massive compute requirements, and strict requirements around security compliance, privacy and data sovereignty. Fortunately, the cloud, with its practically unlimited ability to scale, presents a viable alternative. The cloud can automatically provide additional storage in minutes —in other words, as soon as it’s needed. In contrast, the process of requisitioning, purchasing, and setting up additional disk space on-premises can take months in many organizations. Computing resources are similarly scalable on demand. Translating raw genomic data into useful information requires extensive processing that only supercomputers or cloud computing resources can handle. While only a small number of elite research centers, major medical centers, or large pharmaceutical companies have access to super computers, any organization—from the smallest startup to the largest enterprise—can access the cloud. A more in-depth understanding of how individuals who have certain genes might respond to certain therapies will open the doors to precision medicine, a tantalizing prospect to clinicians. Technology and medical research advances are combining to dramatically improve the timeliness and cost effectiveness of genomics, making it relevant in clinical scenarios, even for medical emergencies. Of course, the ability to tailor treatments much more specifically based on an individual’s genomic profile requires more than their genetic data. “It’s not enough to have the genome,” said Kos. “We’ve got to know what the genome means, and that requires collaborative research.” Since volumes of genomes must be compared to yield insights, the more genomes that are available, the more accurate and informative the analysis will be. “Any one organization may have petabytes of genomic information, which is huge, but it’s just a fraction of all of the genomes that have been sequenced in the world,” said Kos. “Because these are large file sizes, be- cause historically they’ve been held on-premises, and because the data is highly sensitive, there’s no way to aggregate the world’s genomic information to do de-identified ethical studies into what these genomes mean. We need large amounts of data, aggregated in a single place, secure but accessible. That’s what our cloud platforms afford.”
  • 5. D I G I T A L T R A N S F O R M A T I O N I N H E A L T H 4 heavy, long-running workloads that push Microsoft’s cloud platform, Azure, to its limits. “We’re the only cloud provider that has a research team dedicated to genomics,” said Miller. “We understand the needs and concerns of researchers because we’re researchers ourselves.” To advance genomics, the team takes an open-source approach, sharing much of its work with the research community. In addition to the work her team has done to improve secondary and tertiary analysis, they’ve helped integrate genomics-specific analytical tools and capabilities into the Microsoft platform. “Combining these tools with the research we’ve done makes our results significantly more powerful,” Miller explained. “In essence, we’re blending data science and artificial intelligence with core bioinformatics.” Cloud services can be resources, like utilities, that genomics solutions draw on only as required, keeping costs down. Industry experts who run datacenters for a living keep the underlying software current with the latest features and security patches as soon as they become available, ensuring that clinicians and researchers always have immediate access to the latest technology. Moreover, cloud providers take physical, technical, and operational measures to achieve and maintain security and privacy standards that meet compliance requirements. Learn more about privacy, security, and compliance in the Microsoft cloud “Someday, with the added speeds and feeds from cloud infrastructure, we’ll be able to sequence and analyze a full genome for less than a hundred dollars,” said Kos. “As more sequenced genomes become available, genomics will move into the mainstream. Your primary doctor will be able to use your genomic information for practical purposes.” Microsoft’s investment in genomics In 2007, Microsoft established a team focused on genomics in its Artificial Intelligence and Research division. This team of engineers, data scientists, and scientists, led by Geralyn Miller, works hand in hand with Microsoft’s cloud computing team to create genomics-specific platforms, consistently running Project Premonition seeks to detect pathogens in the wild before they cause outbreaks of disease by capturing mosquitoes and sequencing the genomes of the mosquito, the blood they have ingested, and any viruses they are carry- ing. The project is using drones to find mosquito hotspots and breeding sites and catches specimens with robotic traps. A core focus of Microsoft’s genomics team is to develop analytics, machine learning, and artificial intelligence algorithms to aid clinicians who want to make more accurate diagnoses more quickly, and reduce trial-and-error when prescribing treatments. Microsoft Research offers Factor Spectrally Transformed Linear Mixed Models (FaST-LMM), a set of tools for genome-wide association studies (GWAS). The tools help researchers handle huge volumes of data by cataloging and comparing nucleotide variations at specific positions. Learn more here.
  • 6. D I G I T A L T R A N S F O R M A T I O N I N H E A L T H 5 The goal is not only to inform a patient’s current condition, but also predict, based on data collected from the patient and similar subpopulations, how their condition may evolve, and help suggest the optimal course of action. “For example, just because you have lung cancer doesn’t mean all lung cancers are the same,” Miller explained. “Tumors don’t always identify with the tissue of origin. Today, clinicians generally start with the treat- ment that’s most commonly used. With genomics, you can get more specific about the type of cancer and the best type of treatment. If you can tell upfront whether a person has an inherited tendency for an adverse reaction to a drug, you can make a better choice as the doctor. It’s all about taking the raw data and getting to an insight that’s actionable in the context of a care situation.” Partnering for success Building on its collaboration with researchers, Microsoft has developed a vibrant ecosystem of solution partners who focus solely on genomics. As these partners build solutions based on input from researchers and scientists, they help Microsoft further develop and accelerate commercialization of their cloud-based platforms, optimizing services for storing and analyzing vast amounts of genomics data. Curoverse: pooling genomic data while protecting privacy “The number of genomes being sequenced today is growing at three times the rate of Moore’s Law,” said Keith Elliston, Chief Commercial Officer for Curoverse, which provides open source genomics enabling tech- nology. “You’re not done once you’ve sequenced a single genome. The germ line genome is just the base map. Things change. When you develop tumors, that’s a change. Then there’s the whole microbiome, which is all the bacteria living on your skin, in your gut and in your hair, in your nose. These have a big impact on what’s happening. You look at terminally differentiated cells. Your immune system has rearrangements of the whole genome in every cell. You have all these genomes that become very important, so we’re not talking about one genome per person. We’re talking about literally thousands of genomes per person.” Learn more about genomic and biomedical computing solutions from Curoverse Since single genomes don’t yield insight in isolation, the ability to amass genomic data, and then share it broadly and easily, is particularly important. But moving data into a single repository—even within the cloud—isn’t practical. “If you have a hundred thousand genome centers in the world, and they each have a hundred thousand genomes, the model of bringing all of them into one database will never work,” said Elliston. “It’s all going to live in the hundred thousand different centers out there. Your genome sequence is the most private and personal data you have. It’s really the fundamental blueprint that makes you you, and from that perspective, there are lots of laws about how that data must be protected. If you put data up on the cloud securely and privately, you’re not going to mix it with anybody else’s.” The cloud makes it possible to assemble genome data from multiple organizations into a data lake, while retaining the appropriate privacy and security controls. Cloud-based algorithms can run on individual datasets regardless of whether they’re cloud-based or stored in localized datacenters.
  • 7. D I G I T A L T R A N S F O R M A T I O N I N H E A L T H Even with faster sequencing, however, the resources required to process the millions of genomes that will be necessary to yield a broad range of useful clinical insights are gargantuan. “When you go from ten thousand genotype patients to one hundred thousand to one million patients,” said Nino Da Silva, Executive Vice President of BC Platforms, a provider of bioinformatics and genome data management software headquartered in Switzerland, “it will take 2.4 million CPU core hours just to impute the raw data. And at this point we haven’t even started analyzing.” Fortunately, the cloud makes such intensive computing requirements manageable without a large, upfront investment in infrastructure. The calculations required to sequence genomes, and then analyze them, can take place in the cloud, even if sensitive patient data remains on-premises. The role of companies like BC Platforms is to offer end-to-end solutions that provide ready access to genomic information, even for clinicians who aren’t geneticists, or don’t have a high level of training in genetics. “We take the data and make it understandable,” Da Silva explained. “When a patient comes in with a referral for a specific question, such as whether they have the BRCA breast cancer gene, our system will take the raw data, align and create the usable file (gVCF) and from there create the variant core report. We then inject the report into the electronic medical record or While the algorithms generate insight, the genome data stays where it is. “When our platform, Arvados, does genetic variation analysis,” Elliston explained, “it federates the data in a way that encapsulates the analysis in a workflow, which we can distribute across the datacenters where those data live. We do the analysis, aggregate the results, and only export metadata. We don’t move the private individual data. Even on the same cloud infrastructure, it’s stored in distinct places. With Arvados, it can still be analyzed with everyone else’s data, while respecting all that privacy and ownership.” lab system without any manual intervention, so doctors can see the results where they need it, with the rest of the patient data.” The genomic information becomes one of many tools that clinicians use in treating their patients. Learn more about genomic data management and analysis solutions from BC Platforms “Once we’ve optimized the whole flow from the diagnostics devices in the laboratory to the patient records, we’ll need to form a standard of what’s going to be reported and how it will be used in terms of the actual patient care,” said Da Silva. “It’s also important to understand that clinical decision makers, university researchers and potentially commercial partners can all benefit from different pieces of the same know- ledge base. We must ensure that doctors and patients always have the most up-to-date, personalized, valid interpretation available.” Microsoft’s method of running the Burrows-Wheeler Aligner (BWA) and the Broad Institute’s Genome Analysis Toolkit (GATK) on its Azure cloud computing system is seven times faster than the previous version, allowing researchers and medical professionals to get results in just four hours instead of 28. Learn more here. Microsoft has a unique ability to do hybrid cloud installations using local datacenters in combination with Azure. Microsoft can deploy the Azure Stack on-premises at the customer site or at the facility of a hosting partner. The clinical data remains se- curely in the local data warehouse to comply with data sover- eignty requirements, while the processing and analysis takes place in the Azure cloud. BC Platforms: turning raw genomic data into analytical insights It took 13 years and USD $2.7 billion to sequence the first genome. Steady improvements in algorithms reduced sequencing time to 28 hours. Algorithms Microsoft has developed with the Broad Institute can yield results in just four hours. 6
  • 8. D I G I T A L T R A N S F O R M A T I O N I N H E A L T H Da Silva believes that at some point, clinicians will rely on genetic data about patients as a matter of course. “Genomics is like any other medical system,” he said, “and I think we should perceive it that way. Nobody would today consider replacing a hip without doing an X-ray or an MRI first and interpreting it. I think genetics will become the same, that nobody will treat a patient for cancer without having specific genetic information. Not doing so will be considered as uninformed as doing hip surgery without an X-ray.” DNAnexus: employing genomics in clinical settings Dr. David Shaywitz, Chief Medical Officer at DNAnexus, a Mountain View, California company that offers a cloud-based genome informatics and data manage- ment platform, agrees that genomics will become a standard component of clinical evaluation. “There’s an increased recognition of the relevance of genetics as people have a better understanding of it and as the tools have become basically faster, cheaper, and better,” he said. Yet many clinicians remain skeptical about genomics. “For over a generation, physicians, even in medical school, have been hearing about the promise of genetics, and have caught glimpses of ways genetic diagnosis and genetically informed treatments could really be helpful,” Shaywitz explained. “But there’s always been a disconnect between the promise of genetic medicine and the reality of precision or personalized medicine.” The successful use of genomics to improve diagnoses, drug selection, and outcomes for patients with specific cancers, such as breast cancer, is encouraging skeptics to reevaluate. “What’s so exciting now are the specific tools available for patients,” said Shaywitz. “There are more and more clear use cases—clear opportunities where you can demonstrate that patients’ lives were dramatically improved through the use of genetic information.” For example, it’s estimated that about one-third of the children under the age of one who are hospitalized suffer from a genetic condition, and that 25 to 40 percent of mortality in high acuity neonatal intensive care units (NICUs) may be attributable to a genetic disorder. “An accurate genetic diagnosis—which in some cases can now be achieved within about 24 hours —can immediately influence the direction of care, and also avoid a prolonged and often excruciat- ing diagnostic odyssey,” said Shaywitz. “While the primary driver is, as always, humanistic, that is, it’s the ‘right thing to do,’ there are also immediate economic benefits, given the high cost of keeping children in the NICU. The result is that appropriate, early use of genetics in these cases can be a win for everyone.” Learn more about sharing and management of genomic data and tools to accelerate genomics from DNAnexus DNAnexus has developed a platform for local and distributed sequencing and analysis, built on the Microsoft cloud, that enables an end-to-end, rapid genomics solution to support both researchers and clinicians. “Oncologists seeking to provide the best care for their cancer patients are going to be performing molecular evaluations,” Shaywitz predicted, “and cancer centers that want to provide the best care for their patients will want to collect and organize this information—both the genetic information and the patient journey information.” Over time, as more medical centers and research facilities around the world gather genetic data, it will become more useful for identifying the best treatments options. 7
  • 9. D I G I T A L T R A N S F O R M A T I O N I N H E A L T H “Ideally, we’ll have a learning healthcare system,” said Shaywitz. “We’d learn from the clinical journey of the first patient integrating their clinical and genetic information, and apply it intelligently so that we treat the next patient in a fashion that reflects the knowledge gained from each previous interaction. It will be a virtuous cycle.” Democratizing genomics The successful use of genomics to improve diagnoses and outcomes for neonatal infants and patients with specific cancers, such as breast cancer, is encouraging efforts to develop more genomic information. Microsoft and its partners are hard at work making this information easier to sequence, store, analyze, and digest, thus transforming the promise of genomics into real-world evidence that clinicians can use to improve care. “We’re taking our research investment and getting the IP into the hands of our customers and partners through a series of Azure-based offerings,” said Miller. “Our goal with our partners is to offer a turnkey way to do the bioinformatics pipeline—to reduce complexity. When you run analyses across large sample sets through the same pipeline, you end up with more consistent data. In essence, we’re democratizing genomics.” Learn more about Microsoft’s investments in genomics. 8 Project Hanover, is a Microsoft Research effort that seeks to empower biomedicine with cutting-edge computer science, focusing on three areas: machine reading of the millions of bio- medical articles to form queryable knowledge bases, cancer decision support using AI with a current focus on Acute Myeloid Leukemia (AML), and the use of AI in chronic disease management. DNAnexus, in partnership with Microsoft, supports The Stanford Center for Genomics and Personalized Medicine (SCGPM), which is creating a new paradigm of patient-centered medicine that encompasses the entire genome of individuals “to improve disease prediction, prevention, and treatment of conditions such as cancer, diabetes, heart disease, asthma, schizophrenia, and many others.” Efforts include reducing the time to achieve effective treatment, integrating bioinformatics with the patient’s electronic medical record, eliminating trial and error in prescribing drugs, identifying disease risks, offering patients genetic counseling and engaging them to enhance their compliance with prescribed treatments.
  • 10. Scale. The cloud provides an almost infinite set of computing resources, which is necessary for storing or processing massive amounts of data. When you need more resources, you can scale up instantly and automatically. When you need fewer resources, you scale down instantly and automatically. In other words, you get exactly what you need, when you need it—no more, no less. Choosing a cloud offeringBenefits of cloud computing Health organizations need cloud services that are not only powerful, but also secure and compliant. Here is a short checklist to consider when evaluating cloud offerings: • Is it Enterprise-ready? Do the services it offers scale to large organizations with many users or customers? • Does it offer more than infrastructure, such as platforms that support core functions for productivity, communications, relationship management, and analytics, so you can easily tailor solutions to meet your needs? • Does it work with on-premises environments and with technologies from multiple cloud providers and device manufacturers? • Does it have certifications to show it complies with health security and privacy regulations in the countries where you operate, so you can safely store and process personal health information? Economics. Few organizations have the resources to set up or manage very-largescale data centers. The cloud model, like a utility, is pay-as-you-go. This way you never have to worry about having fewer resources than you need, and your organization never has to worry about paying for more than you’re using. Speed. Traditionally, getting access to new or better tools, more storage space, or more computing power has involved a lot of waiting. This happens when IT has to get budget, purchase hardware and/or software, and set it up before they can give you what you need. With the cloud, it takes minutes instead of days, weeks, or months, to get more storage space, compute cycles, or updated software. Trust. Security is a big concern when it comes to cloud adoption. Health data needs to be protected. The industry experts who manage cloud data centers take physical, technical, and operational measures to achieve and maintain security and privacy so your organization can meet the compliance requirements of regulators. They deploy the latest security patches as soon as they’re available, so you get them automatically. 9D I G I T A L T R A N S F O R M A T I O N I N H E A L T H
  • 11. © 2017 Microsoft Corporation. All rights reserved. This document is provided “as-is.” Information and views expressed in this document, including URL and other Internet Web site references, may change without notice. You bear the risk of using it. Some examples are for illustration only and are fictitious. No real association is intended or inferred.   This document does not provide you with any legal rights to any intellectual property in any Microsoft product. You may copy and use this document for your internal, reference purposes. Microsoft and Windows are either registered trademarks of Microsoft Corporation in the United States and/or other countries. microsoft.com/health * Zachary D. Stephens, Skylar Y. Lee, Faraz Faghri, Roy H. Campbell, Chengziang Zhai, Miles J. Efron, Ravishankar Iyer, Michael C. Schatz, Saurabh Sinha, Gene E. Robinson. (2015, July 7). Big Data: Astronomical or Genomical? Retrieved from PLOS Biology: http://journals.plos.org/plosbiology/article?id=10.1371/journal. pbio.1002195