SlideShare a Scribd company logo
Imputation and de novo variant discovery from
low-pass whole genome sequencing data for
cost-effective and scalable trait mapping
Joe Pickrell
@joe_pickrell | joe@gencove.com
WHOLE GENOME SEQUENCING EXOME SEQUENCING
What technology to use to measure genotypes?
SNP ARRAY
O ($1000)
PROS: comprehensive
CONS: expensive, often overkill
O ($100)
PROS: comprehensive in exons
CONS: completely misses non-
coding variations
O ($10)
PROS: cost-effective, well-tested
CONS: no new variant discovery
(e.g. rare or population-specific),
for cost-effectiveness overall
genome coverage suffers
LOW-PASS SEQUENCING
What technology to use to measure genotypes?
SNP ARRAY
O ($10)
Sequencing technologies allow for new
variant discovery, high discovery power
across the genome.
O ($10)
PROS: cost-effective, well-tested
CONS: no new variant discovery
(e.g. rare or population-specific),
for cost-effectiveness overall
genome coverage suffers
+ 💻 =
Shotgun sequence a human genome to (usually) 0.4x or 1x coverage, and
use computational methods to ‘fill in’ anything we missed.
What is low-pass sequencing?
INTUITION
0.4x coverage = one sequencing read at ~30M SNPs
Genotyping array = excellent measurement of 0.5M SNPs
Why now?
2018: nominal price/Mb of sequence is <$0.01
The challenging part of low-pass sequencing is not sequencing per se
1. Cost of commercial library prep kits or outsourcing is higher
(sometimes considerably higher) than the cost of sequencing.
The challenges in low-pass sequencing are in sample
preparation and analysis
2. Going from a fastq file of low-pass sequences to genetic variant
calls is non-trivial and no standard software exists.
1. Divide the 1000 Genomes dataset in two
How does imputation from low-pass sequencing compare to
imputation from arrays?
2. Simulate low-pass sequencing (or genotyping from a few
commonly-used commercial arrays) from one half, impute from
the other
In an African population, low-pass sequencing increases
effective power by ~50-100%
In a European population, low-pass sequencing increases
effective power by ~10-20%
How does imputation from low-pass
sequencing compare to imputation
from arrays?
Up to now this is all simulations.
What about in practice?
How does imputation from low-pass
sequencing compare to imputation
from arrays?
79 European-ancestry individuals sequenced to
~1x coverage
Downsampled to 0.4x, 0.6x, 0.8x
Genotyped on the Affymetrix Axiom Biobank
Precision Medicine Research Array, around 800k
SNPs
Collaboration with Charlie Cox, GSK
High concordance between
genotyping array and imputed low-
pass genome sequences
Concordance at non-reference genotypes at non-reference genotypes
Low-pass sequencing increases power relative to the
PMR array
Can low-pass sequencing be used
to discover variants?
Ignore genotype data, call variants
from the sequencing reads alone
What fraction of polymorphic
variants are identified?
Can low-pass sequencing be used to discover variants?
With 1x sequencing, variants
present in >10 copies are
discovered.
The absolute number of copies
of the variant is more relevant
then the frequency per se; in
massive samples could profile
extremely rare variants.
| Low-pass sequencing increases association power by 10-100%
compared to commonly-used genotyping arrays, particularly in
non-European populations
Summary
| Low-pass sequencing allows for discovery of new/rare
variants, particularly at large sample sizes
| Additional applications: combining low-pass sequencing with
exon capture allows for joint clinical assays of rare and common
variation
THANKS!
Tomaz Berisa

Kaja Wasik

Maria Vazquez
Charlie Cox
Dana Fraser
Karen King
Joe Pickrell | @joe_pickrell | joe@gencove.com
Gencove-GSK results

More Related Content

Similar to Gencove-GSK results

OKC Grand Rounds 2009
OKC Grand Rounds 2009OKC Grand Rounds 2009
OKC Grand Rounds 2009
Sean Davis
 
Lecture bioinformatics Part2.next generation
Lecture bioinformatics Part2.next generationLecture bioinformatics Part2.next generation
Lecture bioinformatics Part2.next generation
MohamedHasan816582
 
RNA Seq Data Analysis
RNA Seq Data AnalysisRNA Seq Data Analysis
RNA Seq Data Analysis
Ravi Gandham
 
One man's *1 is another man's *13? Trouble with nomenclatures in personalized...
One man's *1 is another man's *13? Trouble with nomenclatures in personalized...One man's *1 is another man's *13? Trouble with nomenclatures in personalized...
One man's *1 is another man's *13? Trouble with nomenclatures in personalized...
Matthias Samwald
 
20160219 - S. De Toffol - Dal Sanger al NGS nello studio delle mutazioni BRCA
20160219 - S. De Toffol -  Dal Sanger al NGS nello studio delle mutazioni BRCA �20160219 - S. De Toffol -  Dal Sanger al NGS nello studio delle mutazioni BRCA �
20160219 - S. De Toffol - Dal Sanger al NGS nello studio delle mutazioni BRCA
Roberto Scarafia
 
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
VHIR Vall d’Hebron Institut de Recerca
 
Axiom™ Genome-Wide CEU 1 Array Plate
Axiom™ Genome-Wide CEU 1 Array PlateAxiom™ Genome-Wide CEU 1 Array Plate
Axiom™ Genome-Wide CEU 1 Array Plate
Affymetrix
 
Next Generation Diagnostics: Potential Clinical Applications of Illumina’sTec...
Next Generation Diagnostics: Potential Clinical Applications of Illumina’sTec...Next Generation Diagnostics: Potential Clinical Applications of Illumina’sTec...
Next Generation Diagnostics: Potential Clinical Applications of Illumina’sTec...
Ilya Klabukov
 
Axiom™ Genome-Wide ASI 1 Array Plate
Axiom™ Genome-Wide ASI 1 Array PlateAxiom™ Genome-Wide ASI 1 Array Plate
Axiom™ Genome-Wide ASI 1 Array Plate
Affymetrix
 
Microhaplotype, A Powerful New Type of Genetic Marker
Microhaplotype, A Powerful New Type of Genetic MarkerMicrohaplotype, A Powerful New Type of Genetic Marker
Microhaplotype, A Powerful New Type of Genetic Marker
Mojgan Talebian
 
The Origin of Ashkenazi Levites
The Origin of Ashkenazi Levites The Origin of Ashkenazi Levites
The Origin of Ashkenazi Levites
Family Tree DNA
 
Genomica - Microarreglos de DNA
Genomica - Microarreglos de DNAGenomica - Microarreglos de DNA
Genomica - Microarreglos de DNA
Ulises Urzua
 
FFPE Applications Solutions brochure
FFPE Applications Solutions brochureFFPE Applications Solutions brochure
FFPE Applications Solutions brochure
Affymetrix
 
Aug2013 illumina platinum genomes
Aug2013 illumina platinum genomesAug2013 illumina platinum genomes
Aug2013 illumina platinum genomes
GenomeInABottle
 
Best practices for genotyping analysis of plant and animal genomes with Affym...
Best practices for genotyping analysis of plant and animal genomes with Affym...Best practices for genotyping analysis of plant and animal genomes with Affym...
Best practices for genotyping analysis of plant and animal genomes with Affym...
Affymetrix
 
Axiom® Biobank Genotyping Arrays
Axiom® Biobank Genotyping ArraysAxiom® Biobank Genotyping Arrays
Axiom® Biobank Genotyping Arrays
Affymetrix
 
CELL - FREE DNA TEST: ASPETTI EMERGENTI NELLA PRATICA QUOTIDIANA
CELL - FREE DNA TEST: ASPETTI EMERGENTI NELLA PRATICA QUOTIDIANACELL - FREE DNA TEST: ASPETTI EMERGENTI NELLA PRATICA QUOTIDIANA
CELL - FREE DNA TEST: ASPETTI EMERGENTI NELLA PRATICA QUOTIDIANA
Roberto Scarafia
 
Genotyping, linkage mapping and binary data
Genotyping, linkage mapping and binary dataGenotyping, linkage mapping and binary data
Genotyping, linkage mapping and binary data
FAO
 
Bioinformatics and NGS for advancing in hearing loss research
Bioinformatics and NGS for advancing in hearing loss researchBioinformatics and NGS for advancing in hearing loss research
Bioinformatics and NGS for advancing in hearing loss research
Joaquin Dopazo
 
How to transform genomic big data into valuable clinical information
How to transform genomic big data into valuable clinical informationHow to transform genomic big data into valuable clinical information
How to transform genomic big data into valuable clinical information
Joaquin Dopazo
 

Similar to Gencove-GSK results (20)

OKC Grand Rounds 2009
OKC Grand Rounds 2009OKC Grand Rounds 2009
OKC Grand Rounds 2009
 
Lecture bioinformatics Part2.next generation
Lecture bioinformatics Part2.next generationLecture bioinformatics Part2.next generation
Lecture bioinformatics Part2.next generation
 
RNA Seq Data Analysis
RNA Seq Data AnalysisRNA Seq Data Analysis
RNA Seq Data Analysis
 
One man's *1 is another man's *13? Trouble with nomenclatures in personalized...
One man's *1 is another man's *13? Trouble with nomenclatures in personalized...One man's *1 is another man's *13? Trouble with nomenclatures in personalized...
One man's *1 is another man's *13? Trouble with nomenclatures in personalized...
 
20160219 - S. De Toffol - Dal Sanger al NGS nello studio delle mutazioni BRCA
20160219 - S. De Toffol -  Dal Sanger al NGS nello studio delle mutazioni BRCA �20160219 - S. De Toffol -  Dal Sanger al NGS nello studio delle mutazioni BRCA �
20160219 - S. De Toffol - Dal Sanger al NGS nello studio delle mutazioni BRCA
 
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
 
Axiom™ Genome-Wide CEU 1 Array Plate
Axiom™ Genome-Wide CEU 1 Array PlateAxiom™ Genome-Wide CEU 1 Array Plate
Axiom™ Genome-Wide CEU 1 Array Plate
 
Next Generation Diagnostics: Potential Clinical Applications of Illumina’sTec...
Next Generation Diagnostics: Potential Clinical Applications of Illumina’sTec...Next Generation Diagnostics: Potential Clinical Applications of Illumina’sTec...
Next Generation Diagnostics: Potential Clinical Applications of Illumina’sTec...
 
Axiom™ Genome-Wide ASI 1 Array Plate
Axiom™ Genome-Wide ASI 1 Array PlateAxiom™ Genome-Wide ASI 1 Array Plate
Axiom™ Genome-Wide ASI 1 Array Plate
 
Microhaplotype, A Powerful New Type of Genetic Marker
Microhaplotype, A Powerful New Type of Genetic MarkerMicrohaplotype, A Powerful New Type of Genetic Marker
Microhaplotype, A Powerful New Type of Genetic Marker
 
The Origin of Ashkenazi Levites
The Origin of Ashkenazi Levites The Origin of Ashkenazi Levites
The Origin of Ashkenazi Levites
 
Genomica - Microarreglos de DNA
Genomica - Microarreglos de DNAGenomica - Microarreglos de DNA
Genomica - Microarreglos de DNA
 
FFPE Applications Solutions brochure
FFPE Applications Solutions brochureFFPE Applications Solutions brochure
FFPE Applications Solutions brochure
 
Aug2013 illumina platinum genomes
Aug2013 illumina platinum genomesAug2013 illumina platinum genomes
Aug2013 illumina platinum genomes
 
Best practices for genotyping analysis of plant and animal genomes with Affym...
Best practices for genotyping analysis of plant and animal genomes with Affym...Best practices for genotyping analysis of plant and animal genomes with Affym...
Best practices for genotyping analysis of plant and animal genomes with Affym...
 
Axiom® Biobank Genotyping Arrays
Axiom® Biobank Genotyping ArraysAxiom® Biobank Genotyping Arrays
Axiom® Biobank Genotyping Arrays
 
CELL - FREE DNA TEST: ASPETTI EMERGENTI NELLA PRATICA QUOTIDIANA
CELL - FREE DNA TEST: ASPETTI EMERGENTI NELLA PRATICA QUOTIDIANACELL - FREE DNA TEST: ASPETTI EMERGENTI NELLA PRATICA QUOTIDIANA
CELL - FREE DNA TEST: ASPETTI EMERGENTI NELLA PRATICA QUOTIDIANA
 
Genotyping, linkage mapping and binary data
Genotyping, linkage mapping and binary dataGenotyping, linkage mapping and binary data
Genotyping, linkage mapping and binary data
 
Bioinformatics and NGS for advancing in hearing loss research
Bioinformatics and NGS for advancing in hearing loss researchBioinformatics and NGS for advancing in hearing loss research
Bioinformatics and NGS for advancing in hearing loss research
 
How to transform genomic big data into valuable clinical information
How to transform genomic big data into valuable clinical informationHow to transform genomic big data into valuable clinical information
How to transform genomic big data into valuable clinical information
 

Recently uploaded

Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
akankshawande
 
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
Edge AI and Vision Alliance
 
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptxPRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
christinelarrosa
 
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectorsConnector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
DianaGray10
 
Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...
Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...
Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...
Pitangent Analytics & Technology Solutions Pvt. Ltd
 
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
ScyllaDB
 
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham HillinQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
LizaNolte
 
Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
Antonios Katsarakis
 
Essentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation ParametersEssentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation Parameters
Safe Software
 
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
saastr
 
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge GraphGraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
Neo4j
 
Mutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented ChatbotsMutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented Chatbots
Pablo Gómez Abajo
 
High performance Serverless Java on AWS- GoTo Amsterdam 2024
High performance Serverless Java on AWS- GoTo Amsterdam 2024High performance Serverless Java on AWS- GoTo Amsterdam 2024
High performance Serverless Java on AWS- GoTo Amsterdam 2024
Vadym Kazulkin
 
Session 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdfSession 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdf
UiPathCommunity
 
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Tosin Akinosho
 
"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota
Fwdays
 
Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
c5vrf27qcz
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
Tatiana Kojar
 
Christine's Supplier Sourcing Presentaion.pptx
Christine's Supplier Sourcing Presentaion.pptxChristine's Supplier Sourcing Presentaion.pptx
Christine's Supplier Sourcing Presentaion.pptx
christinelarrosa
 

Recently uploaded (20)

Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
 
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
 
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptxPRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
 
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectorsConnector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
 
Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...
Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...
Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...
 
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
 
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham HillinQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
 
Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
 
Essentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation ParametersEssentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation Parameters
 
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
 
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge GraphGraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
 
Mutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented ChatbotsMutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented Chatbots
 
High performance Serverless Java on AWS- GoTo Amsterdam 2024
High performance Serverless Java on AWS- GoTo Amsterdam 2024High performance Serverless Java on AWS- GoTo Amsterdam 2024
High performance Serverless Java on AWS- GoTo Amsterdam 2024
 
Session 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdfSession 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdf
 
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
 
"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota
 
Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
 
Christine's Supplier Sourcing Presentaion.pptx
Christine's Supplier Sourcing Presentaion.pptxChristine's Supplier Sourcing Presentaion.pptx
Christine's Supplier Sourcing Presentaion.pptx
 

Gencove-GSK results

  • 1. Imputation and de novo variant discovery from low-pass whole genome sequencing data for cost-effective and scalable trait mapping Joe Pickrell @joe_pickrell | joe@gencove.com
  • 2. WHOLE GENOME SEQUENCING EXOME SEQUENCING What technology to use to measure genotypes? SNP ARRAY O ($1000) PROS: comprehensive CONS: expensive, often overkill O ($100) PROS: comprehensive in exons CONS: completely misses non- coding variations O ($10) PROS: cost-effective, well-tested CONS: no new variant discovery (e.g. rare or population-specific), for cost-effectiveness overall genome coverage suffers
  • 3. LOW-PASS SEQUENCING What technology to use to measure genotypes? SNP ARRAY O ($10) Sequencing technologies allow for new variant discovery, high discovery power across the genome. O ($10) PROS: cost-effective, well-tested CONS: no new variant discovery (e.g. rare or population-specific), for cost-effectiveness overall genome coverage suffers + 💻 =
  • 4. Shotgun sequence a human genome to (usually) 0.4x or 1x coverage, and use computational methods to ‘fill in’ anything we missed. What is low-pass sequencing? INTUITION 0.4x coverage = one sequencing read at ~30M SNPs Genotyping array = excellent measurement of 0.5M SNPs
  • 5. Why now? 2018: nominal price/Mb of sequence is <$0.01 The challenging part of low-pass sequencing is not sequencing per se
  • 6. 1. Cost of commercial library prep kits or outsourcing is higher (sometimes considerably higher) than the cost of sequencing. The challenges in low-pass sequencing are in sample preparation and analysis 2. Going from a fastq file of low-pass sequences to genetic variant calls is non-trivial and no standard software exists.
  • 7. 1. Divide the 1000 Genomes dataset in two How does imputation from low-pass sequencing compare to imputation from arrays? 2. Simulate low-pass sequencing (or genotyping from a few commonly-used commercial arrays) from one half, impute from the other
  • 8. In an African population, low-pass sequencing increases effective power by ~50-100%
  • 9. In a European population, low-pass sequencing increases effective power by ~10-20%
  • 10. How does imputation from low-pass sequencing compare to imputation from arrays? Up to now this is all simulations. What about in practice?
  • 11. How does imputation from low-pass sequencing compare to imputation from arrays? 79 European-ancestry individuals sequenced to ~1x coverage Downsampled to 0.4x, 0.6x, 0.8x Genotyped on the Affymetrix Axiom Biobank Precision Medicine Research Array, around 800k SNPs Collaboration with Charlie Cox, GSK
  • 12. High concordance between genotyping array and imputed low- pass genome sequences Concordance at non-reference genotypes at non-reference genotypes
  • 13. Low-pass sequencing increases power relative to the PMR array
  • 14. Can low-pass sequencing be used to discover variants? Ignore genotype data, call variants from the sequencing reads alone What fraction of polymorphic variants are identified?
  • 15. Can low-pass sequencing be used to discover variants? With 1x sequencing, variants present in >10 copies are discovered. The absolute number of copies of the variant is more relevant then the frequency per se; in massive samples could profile extremely rare variants.
  • 16. | Low-pass sequencing increases association power by 10-100% compared to commonly-used genotyping arrays, particularly in non-European populations Summary | Low-pass sequencing allows for discovery of new/rare variants, particularly at large sample sizes | Additional applications: combining low-pass sequencing with exon capture allows for joint clinical assays of rare and common variation
  • 17. THANKS! Tomaz Berisa
 Kaja Wasik
 Maria Vazquez Charlie Cox Dana Fraser Karen King Joe Pickrell | @joe_pickrell | joe@gencove.com