SlideShare a Scribd company logo
1 of 18
Download to read offline
Imputation and de novo variant discovery from
low-pass whole genome sequencing data for
cost-effective and scalable trait mapping
Joe Pickrell
@joe_pickrell | joe@gencove.com
WHOLE GENOME SEQUENCING EXOME SEQUENCING
What technology to use to measure genotypes?
SNP ARRAY
O ($1000)
PROS: comprehensive
CONS: expensive, often overkill
O ($100)
PROS: comprehensive in exons
CONS: completely misses non-
coding variations
O ($10)
PROS: cost-effective, well-tested
CONS: no new variant discovery
(e.g. rare or population-specific),
for cost-effectiveness overall
genome coverage suffers
LOW-PASS SEQUENCING
What technology to use to measure genotypes?
SNP ARRAY
O ($10)
Sequencing technologies allow for new
variant discovery, high discovery power
across the genome.
O ($10)
PROS: cost-effective, well-tested
CONS: no new variant discovery
(e.g. rare or population-specific),
for cost-effectiveness overall
genome coverage suffers
+ 💻 =
Shotgun sequence a human genome to (usually) 0.4x or 1x coverage, and
use computational methods to ‘fill in’ anything we missed.
What is low-pass sequencing?
INTUITION
0.4x coverage = one sequencing read at ~30M SNPs
Genotyping array = excellent measurement of 0.5M SNPs
Why now?
2018: nominal price/Mb of sequence is <$0.01
The challenging part of low-pass sequencing is not sequencing per se
1. Cost of commercial library prep kits or outsourcing is higher
(sometimes considerably higher) than the cost of sequencing.
The challenges in low-pass sequencing are in sample
preparation and analysis
2. Going from a fastq file of low-pass sequences to genetic variant
calls is non-trivial and no standard software exists.
1. Divide the 1000 Genomes dataset in two
How does imputation from low-pass sequencing compare to
imputation from arrays?
2. Simulate low-pass sequencing (or genotyping from a few
commonly-used commercial arrays) from one half, impute from
the other
In an African population, low-pass sequencing increases
effective power by ~50-100%
In a European population, low-pass sequencing increases
effective power by ~10-20%
How does imputation from low-pass
sequencing compare to imputation
from arrays?
Up to now this is all simulations.
What about in practice?
How does imputation from low-pass
sequencing compare to imputation
from arrays?
79 European-ancestry individuals sequenced to
~1x coverage
Downsampled to 0.4x, 0.6x, 0.8x
Genotyped on the Affymetrix Axiom Biobank
Precision Medicine Research Array, around 800k
SNPs
Collaboration with Charlie Cox, GSK
High concordance between
genotyping array and imputed low-
pass genome sequences
Concordance at non-reference genotypes at non-reference genotypes
Low-pass sequencing increases power relative to the
PMR array
Can low-pass sequencing be used
to discover variants?
Ignore genotype data, call variants
from the sequencing reads alone
What fraction of polymorphic
variants are identified?
Can low-pass sequencing be used to discover variants?
With 1x sequencing, variants
present in >10 copies are
discovered.
The absolute number of copies
of the variant is more relevant
then the frequency per se; in
massive samples could profile
extremely rare variants.
| Low-pass sequencing increases association power by 10-100%
compared to commonly-used genotyping arrays, particularly in
non-European populations
Summary
| Low-pass sequencing allows for discovery of new/rare
variants, particularly at large sample sizes
| Additional applications: combining low-pass sequencing with
exon capture allows for joint clinical assays of rare and common
variation
THANKS!
Tomaz Berisa

Kaja Wasik

Maria Vazquez
Charlie Cox
Dana Fraser
Karen King
Joe Pickrell | @joe_pickrell | joe@gencove.com
Gencove-GSK results

More Related Content

Similar to Gencove-GSK results

OKC Grand Rounds 2009
OKC Grand Rounds 2009OKC Grand Rounds 2009
OKC Grand Rounds 2009Sean Davis
 
Lecture bioinformatics Part2.next generation
Lecture bioinformatics Part2.next generationLecture bioinformatics Part2.next generation
Lecture bioinformatics Part2.next generationMohamedHasan816582
 
RNA Seq Data Analysis
RNA Seq Data AnalysisRNA Seq Data Analysis
RNA Seq Data AnalysisRavi Gandham
 
One man's *1 is another man's *13? Trouble with nomenclatures in personalized...
One man's *1 is another man's *13? Trouble with nomenclatures in personalized...One man's *1 is another man's *13? Trouble with nomenclatures in personalized...
One man's *1 is another man's *13? Trouble with nomenclatures in personalized...Matthias Samwald
 
20160219 - S. De Toffol - Dal Sanger al NGS nello studio delle mutazioni BRCA
20160219 - S. De Toffol -  Dal Sanger al NGS nello studio delle mutazioni BRCA �20160219 - S. De Toffol -  Dal Sanger al NGS nello studio delle mutazioni BRCA �
20160219 - S. De Toffol - Dal Sanger al NGS nello studio delle mutazioni BRCA Roberto Scarafia
 
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...VHIR Vall d’Hebron Institut de Recerca
 
Axiom™ Genome-Wide CEU 1 Array Plate
Axiom™ Genome-Wide CEU 1 Array PlateAxiom™ Genome-Wide CEU 1 Array Plate
Axiom™ Genome-Wide CEU 1 Array PlateAffymetrix
 
Next Generation Diagnostics: Potential Clinical Applications of Illumina’sTec...
Next Generation Diagnostics: Potential Clinical Applications of Illumina’sTec...Next Generation Diagnostics: Potential Clinical Applications of Illumina’sTec...
Next Generation Diagnostics: Potential Clinical Applications of Illumina’sTec...Ilya Klabukov
 
Axiom™ Genome-Wide ASI 1 Array Plate
Axiom™ Genome-Wide ASI 1 Array PlateAxiom™ Genome-Wide ASI 1 Array Plate
Axiom™ Genome-Wide ASI 1 Array PlateAffymetrix
 
Microhaplotype, A Powerful New Type of Genetic Marker
Microhaplotype, A Powerful New Type of Genetic MarkerMicrohaplotype, A Powerful New Type of Genetic Marker
Microhaplotype, A Powerful New Type of Genetic MarkerMojgan Talebian
 
The Origin of Ashkenazi Levites
The Origin of Ashkenazi Levites The Origin of Ashkenazi Levites
The Origin of Ashkenazi Levites Family Tree DNA
 
Genomica - Microarreglos de DNA
Genomica - Microarreglos de DNAGenomica - Microarreglos de DNA
Genomica - Microarreglos de DNAUlises Urzua
 
FFPE Applications Solutions brochure
FFPE Applications Solutions brochureFFPE Applications Solutions brochure
FFPE Applications Solutions brochureAffymetrix
 
Aug2013 illumina platinum genomes
Aug2013 illumina platinum genomesAug2013 illumina platinum genomes
Aug2013 illumina platinum genomesGenomeInABottle
 
Best practices for genotyping analysis of plant and animal genomes with Affym...
Best practices for genotyping analysis of plant and animal genomes with Affym...Best practices for genotyping analysis of plant and animal genomes with Affym...
Best practices for genotyping analysis of plant and animal genomes with Affym...Affymetrix
 
Axiom® Biobank Genotyping Arrays
Axiom® Biobank Genotyping ArraysAxiom® Biobank Genotyping Arrays
Axiom® Biobank Genotyping ArraysAffymetrix
 
CELL - FREE DNA TEST: ASPETTI EMERGENTI NELLA PRATICA QUOTIDIANA
CELL - FREE DNA TEST: ASPETTI EMERGENTI NELLA PRATICA QUOTIDIANACELL - FREE DNA TEST: ASPETTI EMERGENTI NELLA PRATICA QUOTIDIANA
CELL - FREE DNA TEST: ASPETTI EMERGENTI NELLA PRATICA QUOTIDIANARoberto Scarafia
 
Genotyping, linkage mapping and binary data
Genotyping, linkage mapping and binary dataGenotyping, linkage mapping and binary data
Genotyping, linkage mapping and binary dataFAO
 
Bioinformatics and NGS for advancing in hearing loss research
Bioinformatics and NGS for advancing in hearing loss researchBioinformatics and NGS for advancing in hearing loss research
Bioinformatics and NGS for advancing in hearing loss researchJoaquin Dopazo
 
How to transform genomic big data into valuable clinical information
How to transform genomic big data into valuable clinical informationHow to transform genomic big data into valuable clinical information
How to transform genomic big data into valuable clinical informationJoaquin Dopazo
 

Similar to Gencove-GSK results (20)

OKC Grand Rounds 2009
OKC Grand Rounds 2009OKC Grand Rounds 2009
OKC Grand Rounds 2009
 
Lecture bioinformatics Part2.next generation
Lecture bioinformatics Part2.next generationLecture bioinformatics Part2.next generation
Lecture bioinformatics Part2.next generation
 
RNA Seq Data Analysis
RNA Seq Data AnalysisRNA Seq Data Analysis
RNA Seq Data Analysis
 
One man's *1 is another man's *13? Trouble with nomenclatures in personalized...
One man's *1 is another man's *13? Trouble with nomenclatures in personalized...One man's *1 is another man's *13? Trouble with nomenclatures in personalized...
One man's *1 is another man's *13? Trouble with nomenclatures in personalized...
 
20160219 - S. De Toffol - Dal Sanger al NGS nello studio delle mutazioni BRCA
20160219 - S. De Toffol -  Dal Sanger al NGS nello studio delle mutazioni BRCA �20160219 - S. De Toffol -  Dal Sanger al NGS nello studio delle mutazioni BRCA �
20160219 - S. De Toffol - Dal Sanger al NGS nello studio delle mutazioni BRCA
 
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
 
Axiom™ Genome-Wide CEU 1 Array Plate
Axiom™ Genome-Wide CEU 1 Array PlateAxiom™ Genome-Wide CEU 1 Array Plate
Axiom™ Genome-Wide CEU 1 Array Plate
 
Next Generation Diagnostics: Potential Clinical Applications of Illumina’sTec...
Next Generation Diagnostics: Potential Clinical Applications of Illumina’sTec...Next Generation Diagnostics: Potential Clinical Applications of Illumina’sTec...
Next Generation Diagnostics: Potential Clinical Applications of Illumina’sTec...
 
Axiom™ Genome-Wide ASI 1 Array Plate
Axiom™ Genome-Wide ASI 1 Array PlateAxiom™ Genome-Wide ASI 1 Array Plate
Axiom™ Genome-Wide ASI 1 Array Plate
 
Microhaplotype, A Powerful New Type of Genetic Marker
Microhaplotype, A Powerful New Type of Genetic MarkerMicrohaplotype, A Powerful New Type of Genetic Marker
Microhaplotype, A Powerful New Type of Genetic Marker
 
The Origin of Ashkenazi Levites
The Origin of Ashkenazi Levites The Origin of Ashkenazi Levites
The Origin of Ashkenazi Levites
 
Genomica - Microarreglos de DNA
Genomica - Microarreglos de DNAGenomica - Microarreglos de DNA
Genomica - Microarreglos de DNA
 
FFPE Applications Solutions brochure
FFPE Applications Solutions brochureFFPE Applications Solutions brochure
FFPE Applications Solutions brochure
 
Aug2013 illumina platinum genomes
Aug2013 illumina platinum genomesAug2013 illumina platinum genomes
Aug2013 illumina platinum genomes
 
Best practices for genotyping analysis of plant and animal genomes with Affym...
Best practices for genotyping analysis of plant and animal genomes with Affym...Best practices for genotyping analysis of plant and animal genomes with Affym...
Best practices for genotyping analysis of plant and animal genomes with Affym...
 
Axiom® Biobank Genotyping Arrays
Axiom® Biobank Genotyping ArraysAxiom® Biobank Genotyping Arrays
Axiom® Biobank Genotyping Arrays
 
CELL - FREE DNA TEST: ASPETTI EMERGENTI NELLA PRATICA QUOTIDIANA
CELL - FREE DNA TEST: ASPETTI EMERGENTI NELLA PRATICA QUOTIDIANACELL - FREE DNA TEST: ASPETTI EMERGENTI NELLA PRATICA QUOTIDIANA
CELL - FREE DNA TEST: ASPETTI EMERGENTI NELLA PRATICA QUOTIDIANA
 
Genotyping, linkage mapping and binary data
Genotyping, linkage mapping and binary dataGenotyping, linkage mapping and binary data
Genotyping, linkage mapping and binary data
 
Bioinformatics and NGS for advancing in hearing loss research
Bioinformatics and NGS for advancing in hearing loss researchBioinformatics and NGS for advancing in hearing loss research
Bioinformatics and NGS for advancing in hearing loss research
 
How to transform genomic big data into valuable clinical information
How to transform genomic big data into valuable clinical informationHow to transform genomic big data into valuable clinical information
How to transform genomic big data into valuable clinical information
 

Recently uploaded

A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 

Recently uploaded (20)

A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 

Gencove-GSK results

  • 1. Imputation and de novo variant discovery from low-pass whole genome sequencing data for cost-effective and scalable trait mapping Joe Pickrell @joe_pickrell | joe@gencove.com
  • 2. WHOLE GENOME SEQUENCING EXOME SEQUENCING What technology to use to measure genotypes? SNP ARRAY O ($1000) PROS: comprehensive CONS: expensive, often overkill O ($100) PROS: comprehensive in exons CONS: completely misses non- coding variations O ($10) PROS: cost-effective, well-tested CONS: no new variant discovery (e.g. rare or population-specific), for cost-effectiveness overall genome coverage suffers
  • 3. LOW-PASS SEQUENCING What technology to use to measure genotypes? SNP ARRAY O ($10) Sequencing technologies allow for new variant discovery, high discovery power across the genome. O ($10) PROS: cost-effective, well-tested CONS: no new variant discovery (e.g. rare or population-specific), for cost-effectiveness overall genome coverage suffers + 💻 =
  • 4. Shotgun sequence a human genome to (usually) 0.4x or 1x coverage, and use computational methods to ‘fill in’ anything we missed. What is low-pass sequencing? INTUITION 0.4x coverage = one sequencing read at ~30M SNPs Genotyping array = excellent measurement of 0.5M SNPs
  • 5. Why now? 2018: nominal price/Mb of sequence is <$0.01 The challenging part of low-pass sequencing is not sequencing per se
  • 6. 1. Cost of commercial library prep kits or outsourcing is higher (sometimes considerably higher) than the cost of sequencing. The challenges in low-pass sequencing are in sample preparation and analysis 2. Going from a fastq file of low-pass sequences to genetic variant calls is non-trivial and no standard software exists.
  • 7. 1. Divide the 1000 Genomes dataset in two How does imputation from low-pass sequencing compare to imputation from arrays? 2. Simulate low-pass sequencing (or genotyping from a few commonly-used commercial arrays) from one half, impute from the other
  • 8. In an African population, low-pass sequencing increases effective power by ~50-100%
  • 9. In a European population, low-pass sequencing increases effective power by ~10-20%
  • 10. How does imputation from low-pass sequencing compare to imputation from arrays? Up to now this is all simulations. What about in practice?
  • 11. How does imputation from low-pass sequencing compare to imputation from arrays? 79 European-ancestry individuals sequenced to ~1x coverage Downsampled to 0.4x, 0.6x, 0.8x Genotyped on the Affymetrix Axiom Biobank Precision Medicine Research Array, around 800k SNPs Collaboration with Charlie Cox, GSK
  • 12. High concordance between genotyping array and imputed low- pass genome sequences Concordance at non-reference genotypes at non-reference genotypes
  • 13. Low-pass sequencing increases power relative to the PMR array
  • 14. Can low-pass sequencing be used to discover variants? Ignore genotype data, call variants from the sequencing reads alone What fraction of polymorphic variants are identified?
  • 15. Can low-pass sequencing be used to discover variants? With 1x sequencing, variants present in >10 copies are discovered. The absolute number of copies of the variant is more relevant then the frequency per se; in massive samples could profile extremely rare variants.
  • 16. | Low-pass sequencing increases association power by 10-100% compared to commonly-used genotyping arrays, particularly in non-European populations Summary | Low-pass sequencing allows for discovery of new/rare variants, particularly at large sample sizes | Additional applications: combining low-pass sequencing with exon capture allows for joint clinical assays of rare and common variation
  • 17. THANKS! Tomaz Berisa
 Kaja Wasik
 Maria Vazquez Charlie Cox Dana Fraser Karen King Joe Pickrell | @joe_pickrell | joe@gencove.com