2019-06-21 YC Preso V5.pdf

Yue Cathy Chang
Artificial Intelligence Meets Genomics
Accelerating understanding of our genetic makeup and use
of genome editing to revolutionize medicine
Speaker
Yue Cathy Chang
TutumGene
What You Will Learn
• Discover genome sequencing fundamentals
• Learn different analysis stages of sequencing, how AI applies to
each, genome editing fundamentals, and AI's role
• Discuss questions and considerations in "optimizing" human health
Topics
• Why Genomics?
• What is Genomics and What Can We Do About It?
• Opportunity and Outlooks
Topics
• Why Genomics?
• World Wide Impact on Health Challenges
• What is Genomics and What Can We Do About It?
• Opportunity and Outlooks
• Gene mutations responsible for causing illnesses in:
• Chromosomal diseases (e.g. Down Syndrome)
• Single-gene disorders (e.g. Sickle-Cell Anaemia)
• Multifactorial disorders (e.g. Diabetes)
• Mitochondrial disorders (e.g. Leber’s or LHON)
• Genes play a role in:
• infectious (e.g. AIDS, tuberculosis)
• non commutable diseases (e.g. cancer)
World Wide Impact on Health Challenges
Why Genomics?
World Wide Impact on Health Challenges
• Ongoing genomics revolution promises
• Change how diseases are diagnosed, prevented, and treated
• Provide some of the most personalized and effective medical treatments
• Genomic information and technology has the potential to improve
healthcare outcomes, quality, and safety and reduce cost
Why Genomics?
Genomics and Personalization of Cancer Care at Roche
Why Genomics?
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4883576/table/T1/?report=objectonly
Example: Genomics Applications Improve Healthcare
Why Genomics?
Preconception
Prenatal
Newborn
Screening
Disease Susceptibility
Screening and diagnosis
Prognosis and
therapeutic decisions
Monitoring disease
burden and recurrence
Topics
• Why Genomics ?
• What is Genomics and What Can We Do About It?
• Genomics Fundamentals
• Genetics vs. Genomics
• Genome Sequencing and Analysis
• AI Application to Genome Sequencing
• Genome Editing
• Genome Editing in Brief and Technologies
• Genome Editing Efforts
• AI’s Role in Genome Editing
• Opportunity and Outlooks
Concept Fundamentals
What is Genomics?
Sequencing Technology: Exponential Speed Increases
What is Genomics?
Sequencing Technology: Exponential Cost Decreases
What is Genomics?
Oxford Nanopore: MinION technology, 2016
http://blog.goldenhelix.com/grudy/a-hitchhikers-guide-to-next-generation-sequencing-part-2/
Base Calling
read alignment
variant calling
Interpretation
Sense Making
Read Alignment
Variant Calling
Interpretation
Sense Making
Sequencing Technology: 3 Stages of Analysis
What is Genomics?
Base Calling
Sequencing Technology: 3 Stages of Analysis
What is Genomics?
Sequencing Chip Illumina HiSeq 4000 Base calling software
Read Alignment
Sequencing Technology: 3 Stages of Analysis
What is Genomics?
https://www.thermofisher.com/cn/zh/home/life-science/sequencing/sanger-sequencing/sanger-dna-sequencing/
sanger-sequencing-data-analysis/minor-variant-finder-software.html.html
Variant Calling
Sequencing Technology: 3 Stages of Analysis
What is Genomics?
Interpretation and Sense Making
Sequencing Technology: 3 Stages of Analysis
What is Genomics?
For the biologist: analyze gene pathways
For the chemist: uncover structure-function relationships
For the pharmacologist: a first step in the drug discovery process
Base Calling
read alignment
variant calling
Interpretation
Sense Making
Read Alignment
Variant Calling
Interpretation
Sense Making
Sequencing Technology: 3 Stages of Analysis
What is Genomics?
Base Calling
read alignment
variant calling
Interpretation
Sense Making
Read Alignment
Variant Calling
Interpretation
Sense Making
Illumina Real-Time Analysis (RTA) software
Swift on Illumina Solexa Sequencing Platform
DeepVariant (Google)
PrimateAI (Illumina)
Diploid MOON
BaseSpace® Variant Interprator
Application of AI in Sequencing Technology
What is Genomics?
https://github.com/google/deepvariant
AI in Variant Calling: DeepVariant
What is Genomics?
DeepVariant (Google)
• Developed by Google
Brain team and Verily:
• Based on the same neural
network for image recognition
• Challenge: Sequencer errors
confound variant calling
• Published: 2016/12
https://www.biorxiv.org/content/biorxiv/early/
2016/12/21/092890.full.pdf
AI in Variant Calling: PrimateAI
What is Genomics?
• Developed by Illumina
• Challenge: Many variants can be
detected, but only some are
pathogenic in human
• Compared common missense
variants in other primate species
• Predict which may by pathogenic
• Published: 2018/08
https://github.com/Illumina/PrimateAI
https://www.nature.com/articles/
s41588-018-0167-z
BatchNormalization +
Relu +
1D Convolution
BatchNormalization +
Relu +
1D Convolution
1D Convolution
1D Convolution
1D Convolution
Only Primate
Conservation
Only Mammal
Conservation
Only Vertebrate
Conservation
1D Convolution
1D Convolution
Reference Sequence
Alternative Sequence
99 Vertebrate
Conservation
99 Vertebrate
Conservation
Pretrained Solvent
Accessibility Layers
Pretrained Secondary
Structure Layers
Concat
( 51,20)
( 51,20)
( 51,20)
( 51,20)
( 51,20)
( 51,20)
( 51,20)
( 51,40)
( 1,20,40)
( 1,20,40)
( 1,20,40)
( 1,20,40)
( 1,20,40)
( 5,40,40) ( 5,40,40)
( 51,40)
( 51,40)
BatchNormalization +
Relu +
1D Convolution
6 Layers
BatchNormalization +
Relu+
1D Convolution
+
+
( 51,40)
BatchNormalization +
Relu +
1D Convolution
( 5,40,40)
( 5,40,40)
( 1,40,40)
( 51,40)
( 51,40)
( 51,40)
( 51,40)
( 51,1)
BatchNormalization +
Sigmoid+
1D Convolution
( 1,1,1)
( 51,1)
BatchNormalization +
Relu +
1D Convolution
( 5,80,40)
( 51,80)
+ + ( 51,40)
( 51,40)
( 51,40)
( 51,40)
( 51,40)
( 51,40) ( 51,40)
( 51,40)
( 51,40)
( 51,40)
( 51,40)
( 51,40) ( 51,40)
AI in Interpretation: Diploid Moon
What is Genomics?
• Developed by Diploid, Belgium
• Challenge: identify the one or two
mutations responsible for the patient’s
condition hidden amongst 40,000 variants
• Autonomously diagnoses rare diseases
from NGS data using AI, reduce analysis
time from days or weeks to minutes
• Founded in 2014
http://www.diploid.com/
Topics
• Why Genomics?
• What is Genomics and What Can We Do About It?
• Genomics Fundamentals
• Genetics vs. Genomics
• Genome Sequencing and Analysis
• AI Application to Genome Sequencing
• Genome Editing
• Genome Editing Technologies
• Genome Editing Efforts
• AI’s Role in Genome Editing
• Opportunity and Outlooks
• A type of genetic
engineering in which
DNA is inserted,
deleted, modified or
replaced at a precise
location within a genome
of a living organism
The Mechanisms
Genome Editing
Illustrations from yourgenome.org
Insertion
Deletion
Modification
The Technologies
Genome Editing
Info or Comparing
Metrics 
Technologies
ssDNA-RecA-CPP
nucleoprotein filaments
CRISPR base-
editor system
CRISPR/Cas9 TALENS ZFNs
Full name
ssDNA (single strand DNA)-
RecA-CPP (cell-penetrating
peptide) nucleoprotein
mediated homology-directed
recombination (HDR)
CRISPR Base
Editors (CRISPR-
based base editor
system)
CRISPR/Cas9 {(Clustered
Regularly Interspaced Short
Palindromic Repeats
(CRISPR)-associated protein
9 nuclease (Cas9)}
TALENs
(Transcription
activator-like
effector
nucleases)
ZFNs (Zinc
finger
nucleases)
Technology
advancement
Next gen - bacterial
recombinase-based
technology
Fourth gen -
CRISPR-
deaminase based
technology
Third gen - nuclease-based
technology
Second gen -
nuclease-based
technology
First gen -
nuclease-based
technology
Avoids double-
stranded break (DSB)?
YES YES NO NO NO
Does not introduce
unintended mutations?
YES YES NO NO NO
Adapted From TutumGene 2018 Genome Editing Technology Comparison
Cell Types
Genome Editing
• Somatic cells
• All body cells in a multicellular organism that are not sperm or egg cells
• Germ(line) cells
• Cells responsible for reproduction - sperm or egg cells
• Stem cells
• Unspecialized cells with self-renewal capacity that can divide limitlessly to
produce new stem cells, and can differentiate to different cell types in the body
Genome Editing
Source: news.harvard.edu
Somatic, Stem Cell, and Germline Editing
Genome Editing
• Somatic genome editing: more mature; precise and regulated
• Stem cell editing can be used to treat or prevent disease or condition
• Germline genome editing may be needed for some genetic diseases
• that manifest themselves in a systemic way
• that are in a single but widespread tissue
• that are in a tissue not easily accessible
“Someday we may consider it unethical not to use germline editing to
alleviate human suffering.” – from A Crack in Creation
AI Tools - Examples
Genome Editing
• inDelphi, a Dash (a Python
framework) application, helps
scientists predict the
outcomes of end-joining
• Impact: make the editing
process more predictable,
controllable and useful
Published in Nature 2018
AI Tools - Examples
Genome Editing
• UT Austin’s CHAMP, or Chip Hybridized Affinity Mapping Platform,
built on NGS chip, spots editing mistakes
• Applications example: rapidly test a CRISPR molecule across a person’s entire
genome to foresee other DNA segments it might interact with besides its target
• Microsoft’s Elevation AI uses machine learning to predict so-called
off-target effects when editing genes with the CRISPR system
• Applications example: modify cells to combat cancer or produce high-yielding
drought-tolerant crops (e.g. corn and wheat)
Topics
• Why Genomics?
• What is Genomics and What Can We Do About It?
• Opportunity and Outlooks
• Questions and Considerations in Genome Editing Usage
• Market Opportunities
• Players in the field and discussions
Questions / Considerations in Genome Editing
Usage: Safety, Ethics, Regulations
Market Opportunities
Genome Editing market: 2017 $3.19B, CAGR 14.5%, 2022 est $6.28B
Future potential revenue from genome editing applications in:
• Research
• Medical fields
• Healthcare industry
• Food industry
• Agriculture
• Patent licensing
Estimated to be $25B by 2030
Source: grandviewresearch.com
Investments mainly in the US, China, and Europe. Some examples:
• Helix (US): health company that focuses on personal genomics and
connects people with insights into their own DNA
• Oxford Nanopore (UK):develops nanopore-based electronic systems
for analyzing single molecules, including DNA, RNA and proteins.
• iCarbon X (China):a China-based artificial intelligence platform for
health data company
• Prenetics (HK): leading genetic testing and digital health company in
southeast asia
AI in Genomics Players
Your Learning
• Discover genome sequencing fundamentals
• Learn different analysis stages of sequencing, how AI applies to
each, genome editing fundamentals, and AI's role
• Discuss questions and considerations in "optimizing" human health
1 of 36

Recommended

2015 04 22_time_labs_shared by
2015 04 22_time_labs_shared2015 04 22_time_labs_shared
2015 04 22_time_labs_sharedProf. Wim Van Criekinge
1.7K views90 slides
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011 by
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011Adam Ford
527 views48 slides
2014 12-11 Skipr99 masterclass Arnhem by
2014 12-11 Skipr99 masterclass Arnhem2014 12-11 Skipr99 masterclass Arnhem
2014 12-11 Skipr99 masterclass ArnhemAlain van Gool
505 views39 slides
Genomics In Personal Care Product Development by
Genomics In Personal Care Product DevelopmentGenomics In Personal Care Product Development
Genomics In Personal Care Product DevelopmentGenemarkers
206 views40 slides
2015 03 13_puurs_v_public by
2015 03 13_puurs_v_public2015 03 13_puurs_v_public
2015 03 13_puurs_v_publicProf. Wim Van Criekinge
1.5K views96 slides
Big Data in Pharma - Overview and Use Cases by
Big Data in Pharma - Overview and Use CasesBig Data in Pharma - Overview and Use Cases
Big Data in Pharma - Overview and Use CasesJosef Scheiber
4.1K views58 slides

More Related Content

Similar to 2019-06-21 YC Preso V5.pdf

Open data genomics_palermo_2017_ver03 by
Open data genomics_palermo_2017_ver03Open data genomics_palermo_2017_ver03
Open data genomics_palermo_2017_ver03Neuro, McGill University
315 views100 slides
Life sciences big data use cases by
Life sciences big data use casesLife sciences big data use cases
Life sciences big data use casesGuy Coates
3.6K views24 slides
Gene Editing And Engineering Technology by
Gene Editing And Engineering TechnologyGene Editing And Engineering Technology
Gene Editing And Engineering TechnologyJennifer Mower
3 views41 slides
Genomics and Plant Genomics by
Genomics and Plant GenomicsGenomics and Plant Genomics
Genomics and Plant GenomicsHarshvardhan Gaikwad
12.2K views49 slides
171017 giab for giab grc workshop by
171017 giab for giab grc workshop171017 giab for giab grc workshop
171017 giab for giab grc workshopGenome Reference Consortium
312 views27 slides
Bioinformatics issues and challanges presentation at s p college by
Bioinformatics  issues and challanges  presentation at s p collegeBioinformatics  issues and challanges  presentation at s p college
Bioinformatics issues and challanges presentation at s p collegeSKUASTKashmir
4.4K views31 slides

Similar to 2019-06-21 YC Preso V5.pdf(20)

Life sciences big data use cases by Guy Coates
Life sciences big data use casesLife sciences big data use cases
Life sciences big data use cases
Guy Coates3.6K views
Gene Editing And Engineering Technology by Jennifer Mower
Gene Editing And Engineering TechnologyGene Editing And Engineering Technology
Gene Editing And Engineering Technology
Jennifer Mower3 views
Bioinformatics issues and challanges presentation at s p college by SKUASTKashmir
Bioinformatics  issues and challanges  presentation at s p collegeBioinformatics  issues and challanges  presentation at s p college
Bioinformatics issues and challanges presentation at s p college
SKUASTKashmir4.4K views
GENOMICS AND BIOINFORMATICS by sandeshGM
GENOMICS AND BIOINFORMATICSGENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICS
sandeshGM8.6K views
01. Introduction to Bioinformatics.pptx by HussainTaqi1
01. Introduction to Bioinformatics.pptx01. Introduction to Bioinformatics.pptx
01. Introduction to Bioinformatics.pptx
HussainTaqi110 views
Careers in bioinformatics, Scope, Skills and Jobs by M Abdullah Chaudhry
Careers in bioinformatics, Scope, Skills and JobsCareers in bioinformatics, Scope, Skills and Jobs
Careers in bioinformatics, Scope, Skills and Jobs
M Abdullah Chaudhry7.4K views
Comparative genomics and proteomics by Nikhil Aggarwal
Comparative genomics and proteomicsComparative genomics and proteomics
Comparative genomics and proteomics
Nikhil Aggarwal5K views
How Can We Make Genomic Epidemiology a Widespread Reality? - William Hsiao by William Hsiao
How Can We Make Genomic Epidemiology a Widespread Reality?  - William HsiaoHow Can We Make Genomic Epidemiology a Widespread Reality?  - William Hsiao
How Can We Make Genomic Epidemiology a Widespread Reality? - William Hsiao
William Hsiao348 views
Best Practices for Building an End-to-End Workflow for Microbial Genomics by Jonathan Jacobs, PhD
 Best Practices for Building an End-to-End Workflow for Microbial Genomics Best Practices for Building an End-to-End Workflow for Microbial Genomics
Best Practices for Building an End-to-End Workflow for Microbial Genomics
Thesis ppt by smitabbsr
Thesis pptThesis ppt
Thesis ppt
smitabbsr8.8K views
A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance... by InsideScientific
A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance...A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance...
A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance...
InsideScientific125 views

Recently uploaded

Business administration Project File.pdf by
Business administration Project File.pdfBusiness administration Project File.pdf
Business administration Project File.pdfKiranPrajapati91
10 views36 slides
Inawsidom - Data Journey by
Inawsidom - Data JourneyInawsidom - Data Journey
Inawsidom - Data JourneyPhilipBasford
9 views38 slides
Oral presentation (1).pdf by
Oral presentation (1).pdfOral presentation (1).pdf
Oral presentation (1).pdfreemalmazroui8
6 views10 slides
Best Home Security Systems.pptx by
Best Home Security Systems.pptxBest Home Security Systems.pptx
Best Home Security Systems.pptxmogalang
9 views16 slides
Custom Tag Manager Templates by
Custom Tag Manager TemplatesCustom Tag Manager Templates
Custom Tag Manager TemplatesMarkus Baersch
30 views17 slides
Underfunded.pptx by
Underfunded.pptxUnderfunded.pptx
Underfunded.pptxvgarcia19
15 views7 slides

Recently uploaded(20)

Business administration Project File.pdf by KiranPrajapati91
Business administration Project File.pdfBusiness administration Project File.pdf
Business administration Project File.pdf
KiranPrajapati9110 views
Best Home Security Systems.pptx by mogalang
Best Home Security Systems.pptxBest Home Security Systems.pptx
Best Home Security Systems.pptx
mogalang9 views
Underfunded.pptx by vgarcia19
Underfunded.pptxUnderfunded.pptx
Underfunded.pptx
vgarcia1915 views
DGIQ East 2023 AI Ethics SIG by Karen Lopez
DGIQ East 2023 AI Ethics SIGDGIQ East 2023 AI Ethics SIG
DGIQ East 2023 AI Ethics SIG
Karen Lopez5 views
GDG Cloud Community Day 2022 - Managing data quality in Machine Learning by SARADINDU SENGUPTA
GDG Cloud Community Day 2022 -  Managing data quality in Machine LearningGDG Cloud Community Day 2022 -  Managing data quality in Machine Learning
GDG Cloud Community Day 2022 - Managing data quality in Machine Learning
Product Research sample.pdf by AllenSingson
Product Research sample.pdfProduct Research sample.pdf
Product Research sample.pdf
AllenSingson35 views
[DSC Europe 23] Irena Cerovic - AI in International Development.pdf by DataScienceConferenc1
[DSC Europe 23] Irena Cerovic - AI in International Development.pdf[DSC Europe 23] Irena Cerovic - AI in International Development.pdf
[DSC Europe 23] Irena Cerovic - AI in International Development.pdf
OPPOTUS - Malaysians on Malaysia 3Q2023.pdf by Oppotus
OPPOTUS - Malaysians on Malaysia 3Q2023.pdfOPPOTUS - Malaysians on Malaysia 3Q2023.pdf
OPPOTUS - Malaysians on Malaysia 3Q2023.pdf
Oppotus34 views
4_4_WP_4_06_ND_Model.pptx by d6fmc6kwd4
4_4_WP_4_06_ND_Model.pptx4_4_WP_4_06_ND_Model.pptx
4_4_WP_4_06_ND_Model.pptx
d6fmc6kwd47 views
Lack of communication among family.pptx by ahmed164023
Lack of communication among family.pptxLack of communication among family.pptx
Lack of communication among family.pptx
ahmed16402316 views
6498-Butun_Beyinli_Cocuq-Daniel_J.Siegel-Tina_Payne_Bryson-2011-259s.pdf by 10urkyr34
6498-Butun_Beyinli_Cocuq-Daniel_J.Siegel-Tina_Payne_Bryson-2011-259s.pdf6498-Butun_Beyinli_Cocuq-Daniel_J.Siegel-Tina_Payne_Bryson-2011-259s.pdf
6498-Butun_Beyinli_Cocuq-Daniel_J.Siegel-Tina_Payne_Bryson-2011-259s.pdf
10urkyr347 views
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion by Bertram Ludäscher
Games, Queries, and Argumentation Frameworks: Time for a Family ReunionGames, Queries, and Argumentation Frameworks: Time for a Family Reunion
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion
[DSC Europe 23] Ilija Duni - How Foursquare Builds Meaningful Bridges Between... by DataScienceConferenc1
[DSC Europe 23] Ilija Duni - How Foursquare Builds Meaningful Bridges Between...[DSC Europe 23] Ilija Duni - How Foursquare Builds Meaningful Bridges Between...
[DSC Europe 23] Ilija Duni - How Foursquare Builds Meaningful Bridges Between...

2019-06-21 YC Preso V5.pdf

  • 1. Artificial Intelligence Meets Genomics Accelerating understanding of our genetic makeup and use of genome editing to revolutionize medicine
  • 3. What You Will Learn • Discover genome sequencing fundamentals • Learn different analysis stages of sequencing, how AI applies to each, genome editing fundamentals, and AI's role • Discuss questions and considerations in "optimizing" human health
  • 4. Topics • Why Genomics? • What is Genomics and What Can We Do About It? • Opportunity and Outlooks
  • 5. Topics • Why Genomics? • World Wide Impact on Health Challenges • What is Genomics and What Can We Do About It? • Opportunity and Outlooks
  • 6. • Gene mutations responsible for causing illnesses in: • Chromosomal diseases (e.g. Down Syndrome) • Single-gene disorders (e.g. Sickle-Cell Anaemia) • Multifactorial disorders (e.g. Diabetes) • Mitochondrial disorders (e.g. Leber’s or LHON) • Genes play a role in: • infectious (e.g. AIDS, tuberculosis) • non commutable diseases (e.g. cancer) World Wide Impact on Health Challenges Why Genomics?
  • 7. World Wide Impact on Health Challenges • Ongoing genomics revolution promises • Change how diseases are diagnosed, prevented, and treated • Provide some of the most personalized and effective medical treatments • Genomic information and technology has the potential to improve healthcare outcomes, quality, and safety and reduce cost Why Genomics?
  • 8. Genomics and Personalization of Cancer Care at Roche Why Genomics?
  • 9. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4883576/table/T1/?report=objectonly Example: Genomics Applications Improve Healthcare Why Genomics? Preconception Prenatal Newborn Screening Disease Susceptibility Screening and diagnosis Prognosis and therapeutic decisions Monitoring disease burden and recurrence
  • 10. Topics • Why Genomics ? • What is Genomics and What Can We Do About It? • Genomics Fundamentals • Genetics vs. Genomics • Genome Sequencing and Analysis • AI Application to Genome Sequencing • Genome Editing • Genome Editing in Brief and Technologies • Genome Editing Efforts • AI’s Role in Genome Editing • Opportunity and Outlooks
  • 12. Sequencing Technology: Exponential Speed Increases What is Genomics?
  • 13. Sequencing Technology: Exponential Cost Decreases What is Genomics? Oxford Nanopore: MinION technology, 2016
  • 14. http://blog.goldenhelix.com/grudy/a-hitchhikers-guide-to-next-generation-sequencing-part-2/ Base Calling read alignment variant calling Interpretation Sense Making Read Alignment Variant Calling Interpretation Sense Making Sequencing Technology: 3 Stages of Analysis What is Genomics?
  • 15. Base Calling Sequencing Technology: 3 Stages of Analysis What is Genomics? Sequencing Chip Illumina HiSeq 4000 Base calling software
  • 16. Read Alignment Sequencing Technology: 3 Stages of Analysis What is Genomics?
  • 18. Interpretation and Sense Making Sequencing Technology: 3 Stages of Analysis What is Genomics? For the biologist: analyze gene pathways For the chemist: uncover structure-function relationships For the pharmacologist: a first step in the drug discovery process
  • 19. Base Calling read alignment variant calling Interpretation Sense Making Read Alignment Variant Calling Interpretation Sense Making Sequencing Technology: 3 Stages of Analysis What is Genomics?
  • 20. Base Calling read alignment variant calling Interpretation Sense Making Read Alignment Variant Calling Interpretation Sense Making Illumina Real-Time Analysis (RTA) software Swift on Illumina Solexa Sequencing Platform DeepVariant (Google) PrimateAI (Illumina) Diploid MOON BaseSpace® Variant Interprator Application of AI in Sequencing Technology What is Genomics?
  • 21. https://github.com/google/deepvariant AI in Variant Calling: DeepVariant What is Genomics? DeepVariant (Google) • Developed by Google Brain team and Verily: • Based on the same neural network for image recognition • Challenge: Sequencer errors confound variant calling • Published: 2016/12 https://www.biorxiv.org/content/biorxiv/early/ 2016/12/21/092890.full.pdf
  • 22. AI in Variant Calling: PrimateAI What is Genomics? • Developed by Illumina • Challenge: Many variants can be detected, but only some are pathogenic in human • Compared common missense variants in other primate species • Predict which may by pathogenic • Published: 2018/08 https://github.com/Illumina/PrimateAI https://www.nature.com/articles/ s41588-018-0167-z BatchNormalization + Relu + 1D Convolution BatchNormalization + Relu + 1D Convolution 1D Convolution 1D Convolution 1D Convolution Only Primate Conservation Only Mammal Conservation Only Vertebrate Conservation 1D Convolution 1D Convolution Reference Sequence Alternative Sequence 99 Vertebrate Conservation 99 Vertebrate Conservation Pretrained Solvent Accessibility Layers Pretrained Secondary Structure Layers Concat ( 51,20) ( 51,20) ( 51,20) ( 51,20) ( 51,20) ( 51,20) ( 51,20) ( 51,40) ( 1,20,40) ( 1,20,40) ( 1,20,40) ( 1,20,40) ( 1,20,40) ( 5,40,40) ( 5,40,40) ( 51,40) ( 51,40) BatchNormalization + Relu + 1D Convolution 6 Layers BatchNormalization + Relu+ 1D Convolution + + ( 51,40) BatchNormalization + Relu + 1D Convolution ( 5,40,40) ( 5,40,40) ( 1,40,40) ( 51,40) ( 51,40) ( 51,40) ( 51,40) ( 51,1) BatchNormalization + Sigmoid+ 1D Convolution ( 1,1,1) ( 51,1) BatchNormalization + Relu + 1D Convolution ( 5,80,40) ( 51,80) + + ( 51,40) ( 51,40) ( 51,40) ( 51,40) ( 51,40) ( 51,40) ( 51,40) ( 51,40) ( 51,40) ( 51,40) ( 51,40) ( 51,40) ( 51,40)
  • 23. AI in Interpretation: Diploid Moon What is Genomics? • Developed by Diploid, Belgium • Challenge: identify the one or two mutations responsible for the patient’s condition hidden amongst 40,000 variants • Autonomously diagnoses rare diseases from NGS data using AI, reduce analysis time from days or weeks to minutes • Founded in 2014 http://www.diploid.com/
  • 24. Topics • Why Genomics? • What is Genomics and What Can We Do About It? • Genomics Fundamentals • Genetics vs. Genomics • Genome Sequencing and Analysis • AI Application to Genome Sequencing • Genome Editing • Genome Editing Technologies • Genome Editing Efforts • AI’s Role in Genome Editing • Opportunity and Outlooks
  • 25. • A type of genetic engineering in which DNA is inserted, deleted, modified or replaced at a precise location within a genome of a living organism The Mechanisms Genome Editing Illustrations from yourgenome.org Insertion Deletion Modification
  • 26. The Technologies Genome Editing Info or Comparing Metrics Technologies ssDNA-RecA-CPP nucleoprotein filaments CRISPR base- editor system CRISPR/Cas9 TALENS ZFNs Full name ssDNA (single strand DNA)- RecA-CPP (cell-penetrating peptide) nucleoprotein mediated homology-directed recombination (HDR) CRISPR Base Editors (CRISPR- based base editor system) CRISPR/Cas9 {(Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)-associated protein 9 nuclease (Cas9)} TALENs (Transcription activator-like effector nucleases) ZFNs (Zinc finger nucleases) Technology advancement Next gen - bacterial recombinase-based technology Fourth gen - CRISPR- deaminase based technology Third gen - nuclease-based technology Second gen - nuclease-based technology First gen - nuclease-based technology Avoids double- stranded break (DSB)? YES YES NO NO NO Does not introduce unintended mutations? YES YES NO NO NO Adapted From TutumGene 2018 Genome Editing Technology Comparison
  • 27. Cell Types Genome Editing • Somatic cells • All body cells in a multicellular organism that are not sperm or egg cells • Germ(line) cells • Cells responsible for reproduction - sperm or egg cells • Stem cells • Unspecialized cells with self-renewal capacity that can divide limitlessly to produce new stem cells, and can differentiate to different cell types in the body
  • 29. Somatic, Stem Cell, and Germline Editing Genome Editing • Somatic genome editing: more mature; precise and regulated • Stem cell editing can be used to treat or prevent disease or condition • Germline genome editing may be needed for some genetic diseases • that manifest themselves in a systemic way • that are in a single but widespread tissue • that are in a tissue not easily accessible “Someday we may consider it unethical not to use germline editing to alleviate human suffering.” – from A Crack in Creation
  • 30. AI Tools - Examples Genome Editing • inDelphi, a Dash (a Python framework) application, helps scientists predict the outcomes of end-joining • Impact: make the editing process more predictable, controllable and useful Published in Nature 2018
  • 31. AI Tools - Examples Genome Editing • UT Austin’s CHAMP, or Chip Hybridized Affinity Mapping Platform, built on NGS chip, spots editing mistakes • Applications example: rapidly test a CRISPR molecule across a person’s entire genome to foresee other DNA segments it might interact with besides its target • Microsoft’s Elevation AI uses machine learning to predict so-called off-target effects when editing genes with the CRISPR system • Applications example: modify cells to combat cancer or produce high-yielding drought-tolerant crops (e.g. corn and wheat)
  • 32. Topics • Why Genomics? • What is Genomics and What Can We Do About It? • Opportunity and Outlooks • Questions and Considerations in Genome Editing Usage • Market Opportunities • Players in the field and discussions
  • 33. Questions / Considerations in Genome Editing Usage: Safety, Ethics, Regulations
  • 34. Market Opportunities Genome Editing market: 2017 $3.19B, CAGR 14.5%, 2022 est $6.28B Future potential revenue from genome editing applications in: • Research • Medical fields • Healthcare industry • Food industry • Agriculture • Patent licensing Estimated to be $25B by 2030 Source: grandviewresearch.com
  • 35. Investments mainly in the US, China, and Europe. Some examples: • Helix (US): health company that focuses on personal genomics and connects people with insights into their own DNA • Oxford Nanopore (UK):develops nanopore-based electronic systems for analyzing single molecules, including DNA, RNA and proteins. • iCarbon X (China):a China-based artificial intelligence platform for health data company • Prenetics (HK): leading genetic testing and digital health company in southeast asia AI in Genomics Players
  • 36. Your Learning • Discover genome sequencing fundamentals • Learn different analysis stages of sequencing, how AI applies to each, genome editing fundamentals, and AI's role • Discuss questions and considerations in "optimizing" human health