SlideShare a Scribd company logo
1/29/18
1
Human Structural Genomic Variation
Charles Lee, Ph.D., FACMG
SCIENTIFIC DIRECTOR AND PROFESSOR
The Jackson Laboratory for Genomic Medicine
January 25, 2018
Conclusion of the 1000G Project: Published September 30, 2015
1/29/18
2
Steven	McCarroll
Deanna	Church
Jonas	Korlach
Scott	Devine
Ali	Bashir Ryan	Mills
Ken	Chen
Kai	Ye
Mark	Gerstein
Gabor	Marth
Jonathan	Sebat
Michael	Talkowski
Goo	Jun
Li	Ding
Bing	Ren
Paul	Flicek Pui-Yan	Kwok
Peter	Lansdorp
Xinghua ShiTobias	Marschall
Jan	Korbel
Evan	Eichler
Chong	Lek Koh
Charles	Lee
HGSV: Human Genome Structural Variation Consortium
PRINCIPAL	INVESTIGATORS:
HGSV: Strategy
Illumina	Hiseq
Jumping	Libraries
Pac	Bio
Chrom (10X)
Illumina	(Moleculo)
Hi-C
Strand	Seq
BioNano
Affy Cytoscan
Short reads
High error rate
Low base calling accuracy
Not true long read
Cost
High library cost
Long run time
Not commercially available
WGS remains a challenge
Low sequence quality
GC bias
CONS
Minimal sample
FastCapable of long range phasing
True long reads
DNA-protein interactions
Easy sample prep
High quality de novo assembly
DNA methylation analysis
High quality
No assembly required
High consensus
Uniform coverage
PROS
1/29/18
3
HGSV: Selected Trios
GOAL:	to	utilize	multiple	platforms	for	comprehensive	discovery	
of	structural	variations	in	selected	individual	genomes
NA19240 HG00514 HG00733
NA19239 NA19238
Yoruban Han	Chinese Puerto	Rican
HG00512 HG00513 HG00731 HG00732
(high	genetic	diversity) (low	genetic	diversity) (population	admixture)
Parents		were	previously	sequenced	in	1000G
Children	sequenced	in	HGSV	to	223-fold	combined	coverage
Illumina HiSeq
Short	read	sequencing,	deep	coverage	PCR-free	(bridge	amplification)	
Sequences	reads	(approximately	150bp)	from	both	ends	of	a	DNA	fragment	=	PAIRED	END
Three	approaches	for	SV	detection	(using	paired-end	sequencing	data):
READ DEPTH
PAIRED END MAPPING
SPLIT MAPPING
* Short Read Sequencing
1/29/18
4
PacBio SMRT Sequencing
Single	Molecule	Real	Time	(SMRT)	long	insert	sequencing,	no	amplification	required
Sequences	long	and	complex	regions
* Long Read Sequencing
True	long	read	sequencing,	average	read	size	>10,000	bases
Bio Nano Video
Bio Nano Genomics Irys * Optical Mapping
Analyzes	single,	linearized	DNA	molecule	(no	amplification	required)
Optical	mapping	=	complimentary	way	to	interrogate	genome	for	SVs	
Whole	genome	maps	enable	detection	of	translocations,	repeats	and	deletions
through	de	novo	assembly
1/29/18
5
Illumina Moleculo
Generate	long	reads	from	existing	tagged	short	reads,	synthetic	length	is	~100kb
Genome	is	sheared	to	10kb	fragments,	which	are	tagged	with	unique	barcode
* Synthetic Long Reads
Long	reads	allow	for	de	novo	assembly
BARCODE:	A	series	of	known	bases	added	to	template	molecule	(through	ligation	or	amplification).
Chromium (10X Genomics)
Long	range	genomic	cell-by-cell	gene	expression	(low	DNA	input:	1ng)	
* Synthetic Long Reads
Long	range	information	enables	phasing,	SV	calling	and	copy	number	determination
10X Video
Standard	short	reads	are	attached	with	molecular	barcodes,	synthetic	length	is	~100kb
1/29/18
6
Strand-Seq
1 2 3 96
BrdU incorporated	
into	new	strands
MitosisReplication
Cell Daughter	Cells Isolate	and	deposit	single	daughter	
cell	into	single	well
MNase
digestion
Fragment	DNA
to	150bp
Ligate	adaptors	
Hoechst-UV	
Treatment
BrdU+	strands	nicked
Remove	BrdU+	strands
PCR
Amplification
Only	template	strands	
are	amplified
Introduce	multiplex	barcode	
during	PCR	amplification
Sequence	and	
Ideogram
B B B
Template
B B B
Template
B B B
Template
B B B
Template
A2 A1
Template
A2’ A1’
A1’ A2’
Template
A1 A2
Create	paired-end	
sequencing	libraries	
Create	Ideogram
Falconer	et	al.	Nature	Methods	9:	1107-12,	2012;	Sanders	et	al.	Genome	Res	26:	1575-1587,	2016
Haplotype	A
Haplotype	B
10kb	reads
Partitioning PacBio Reads to Haplotypes
67%	of	reads	could	be	haplotype	partitioned
1/29/18
7
Integrative Phasing
Illumina
Illumina-SLR
PacBio
Chro
HiC
Strand-Seq
WhatsHap
Dense,	global	(chromosomal	scale)	haplotypes
DENSE	+	LOCAL SPARSE	+	GLOBAL
Fraction of Phase Connection (power to call and resolve haplotypes)
heterozygous heterozygous heterozygous
Illumina
PacBio
Chromium	(10X)
Strand-Seq
F.P.C.%
68%
95%
92%
65%
C G A
C G A
C AG
G
G A
Chromium	(10X)	
+	Strand-Seq C G A
Lowest	mismatch	error	rate	(0.23%).		96.5%	of	all	heterozygous	SNVs	were	phased.
1/29/18
8
Discovering Indels (1-49bp)
Illumina
Short	reads
818,181
Indels/child
PacBio
Long	reads
Increase:
119,274
Phased-SV
698,907
Indels/child
GATK
FreeBays
Pindel
Discovering SVs (>50bp): Unified Callsets
Illumina
Short	reads
Increase:
20,963SVs
BNGPacBio
Long	reads Optical	Mapping
12,680 Deletions	
18,919 Insertions
31,599 SVs	(avg/individual)
Various	algorithms
(Assemble/Map	,	Call	Variants)
Haplotype-resolved	Phased	SV
6808 Deletions	
3035 Insertions
793 Duplications
10,636 SVs	(avg/individual)
11	Algorithms
(Assemble/Map	,	Call	Variants)
1/29/18
9
NA19240
Unified Callset for SV (DELETIONS)
7176
90
202
1186
5109
60
721
PacBio
(n=13066)
Illumina
(n=7218)
BioNano
(n=1073)
NA19240
Unified Callset for SV (INSERTIONS)
13443
435
121
1365
2939
1073
87
PacBio
(n=17542)
Illumina
(n=4512)
BioNano
(n=1716)
1/29/18
10
PacBio Assemblies have Reduced Ability to Detect CNVs
~341	genes/child	had	a	CNV	that	was	NOT	detected	by	PacBio assembly
Therefore	it’s	important	to	continue	read-depth	based	CNV	detection
24.9%of	all	SD’s	detected	by	PacBio
All	Known
Segmental	Duplications
All	CNVs	detected	
by	dCGH
All	CNVs	detected	
by	GenomeSTRiP
6.2%of	all	CNVs	detected	by	PacBio
27%of	all	CNVs	detected	by	PacBio
Discovering Inversions
BNGIllumina PacBio
Short	reads,	WGS Long	read
Various	algorithms
(Assemble/Map	,	Call	Variants)
Optical	Mapping
Illumina
Jumping	library
(liWGS)
Strand	Seq
126 91 118 28 170
306 Inversions
(across	all	9	individuals)
121 Simple	Inversions
(per	person)
1/29/18
11
Inversion Calls in Trios, Based on Technologies
81	heterozygous
38	homozygous
Smallest:	263	bp (ch1:187497343-187497605)
Largest:	1.28Mb	(chr8:	8230468-12095842)
121 Simple	Inversions
(per	person)
Technology
Illumina
Jumping
PacBio
Strand-seq
Bionano
Refining Meiotic Breakpoints
Strand-seq
~25	kbp
Meiotic	breakpoint
median	resolution:
~1.5	kbp
PacBio +	Strand-seq
(trio-aware	phasing)
Maternal	meiotic	recombination	events
Alu retrotransposons	(p=3.07E-7)
THE1-A,B	(p=0.08)	
15-mer	motif
1/29/18
12
Discovering Full-length L1s
On	average	~190	FL-L1	(with	2	ORFs)	per	child
124	of	these	FL-L1s	were	found	in	all	3	children
- Some	are	known	active	(hot)	L1	elements	associated	with	disease	such	as	cancer.
- 4-6	unique	active	(hot)	L1s	found	in	each	genome.	
Cumulative	differences	in	L1	mutagenesis	suggest	that	
such	diversity	may	translate	into	differential	risk	levels	
for	cancers	and	other	disorders.	
Take Home Message
Combining	different	technologies/algorithms,	we	have	identified	
7x	more	structural	variation	per	person	than	was	previously	appreciated
No	single	technology	/	algorithm	will	be	comprehensive
Application	to	disease	association	studies	and	clinical	interpretation	
of	genomes
• Average	of	818,181	indels (<50	bp)	per	person
• Average	of	31,599	SVs	(>50	bp)	per	person
• Average	of	121	inversions	per	person	(simple	+	complex)
• Active	LINE-1	elements
1/29/18
13
Discovery of Haplotype-resolved SVs - bioRxiv

More Related Content

What's hot

Lecaut et al 2012
Lecaut et al 2012Lecaut et al 2012
Lecaut et al 2012
Fran Flores
 
Editing genome
Editing genomeEditing genome
Editing genome
jennifferco
 
Mining heterogeneous data: Understanding systems at the level of complexes an...
Mining heterogeneous data: Understanding systems at the level of complexes an...Mining heterogeneous data: Understanding systems at the level of complexes an...
Mining heterogeneous data: Understanding systems at the level of complexes an...
Lars Juhl Jensen
 
Targeting the $100 genome | Insight, issue 5
Targeting the $100 genome | Insight, issue 5Targeting the $100 genome | Insight, issue 5
Targeting the $100 genome | Insight, issue 5
Team Consulting Ltd
 
Δρ Χαράλαμπος Πιτσαλίδης, 3rd Health Innovation Conference
Δρ Χαράλαμπος Πιτσαλίδης, 3rd Health Innovation ConferenceΔρ Χαράλαμπος Πιτσαλίδης, 3rd Health Innovation Conference
Δρ Χαράλαμπος Πιτσαλίδης, 3rd Health Innovation Conference
Starttech Ventures
 
The Genomics Revolution: The Good, The Bad, and The Ugly
The Genomics Revolution: The Good, The Bad, and The UglyThe Genomics Revolution: The Good, The Bad, and The Ugly
The Genomics Revolution: The Good, The Bad, and The Ugly
Emiliano De Cristofaro
 
2nd CRISPR Congress Boston, 23-25 February 2016
2nd CRISPR Congress Boston, 23-25 February 2016 2nd CRISPR Congress Boston, 23-25 February 2016
2nd CRISPR Congress Boston, 23-25 February 2016
Diane McKenna
 
Genetic engineers and molecular factories | Insight, issue 3
Genetic engineers and molecular factories | Insight, issue 3Genetic engineers and molecular factories | Insight, issue 3
Genetic engineers and molecular factories | Insight, issue 3
Team Consulting Ltd
 
Discuss an example of knockout mouse model used for disease modelling (Metast...
Discuss an example of knockout mouse model used for disease modelling (Metast...Discuss an example of knockout mouse model used for disease modelling (Metast...
Discuss an example of knockout mouse model used for disease modelling (Metast...
SaniikaRenganadan
 
Advances and Applications Enabled by Single Cell Technology
Advances and Applications Enabled by Single Cell TechnologyAdvances and Applications Enabled by Single Cell Technology
Advances and Applications Enabled by Single Cell Technology
QIAGEN
 
Aug2013 Heidi Rehm integrating large scale sequencing into clinical practice
Aug2013 Heidi Rehm integrating large scale sequencing into clinical practiceAug2013 Heidi Rehm integrating large scale sequencing into clinical practice
Aug2013 Heidi Rehm integrating large scale sequencing into clinical practice
GenomeInABottle
 
Evolutionconnection replication
Evolutionconnection replicationEvolutionconnection replication
Evolutionconnection replication
Elsa von Licy
 
Novocus Legal LLP Nanomaterial news update 25-31 march 2018
Novocus Legal LLP Nanomaterial news update 25-31 march 2018Novocus Legal LLP Nanomaterial news update 25-31 march 2018
Novocus Legal LLP Nanomaterial news update 25-31 march 2018
Ruchica Kumar
 
10 oncologic research
10   oncologic research10   oncologic research
10 oncologic research
Juan R Farro
 
Haendel clingenetics.3.14.14
Haendel clingenetics.3.14.14Haendel clingenetics.3.14.14
Haendel clingenetics.3.14.14
mhaendel
 
Achieve improved variant detection in single cell sequencing infographic
Achieve improved variant detection in single cell sequencing infographicAchieve improved variant detection in single cell sequencing infographic
Achieve improved variant detection in single cell sequencing infographic
QIAGEN
 
Supporting Genomics in the Practice of Medicine by Heidi Rehm
Supporting Genomics in the Practice of Medicine by Heidi RehmSupporting Genomics in the Practice of Medicine by Heidi Rehm
Supporting Genomics in the Practice of Medicine by Heidi Rehm
Knome_Inc
 
Biosecurity Challenges in the Genome Engineering Space – Beth Vitalis
Biosecurity Challenges in the Genome Engineering Space – Beth VitalisBiosecurity Challenges in the Genome Engineering Space – Beth Vitalis
Biosecurity Challenges in the Genome Engineering Space – Beth Vitalis
Catalyst Biosummit
 
Pegs Europe 2015 Protein & Antibody Engineering Summit
Pegs Europe 2015   Protein & Antibody Engineering SummitPegs Europe 2015   Protein & Antibody Engineering Summit
Pegs Europe 2015 Protein & Antibody Engineering Summit
Nicole Proulx
 
Supercomputing Your Inner Microbiome
Supercomputing Your Inner MicrobiomeSupercomputing Your Inner Microbiome
Supercomputing Your Inner Microbiome
Nicole McLaughlin
 

What's hot (20)

Lecaut et al 2012
Lecaut et al 2012Lecaut et al 2012
Lecaut et al 2012
 
Editing genome
Editing genomeEditing genome
Editing genome
 
Mining heterogeneous data: Understanding systems at the level of complexes an...
Mining heterogeneous data: Understanding systems at the level of complexes an...Mining heterogeneous data: Understanding systems at the level of complexes an...
Mining heterogeneous data: Understanding systems at the level of complexes an...
 
Targeting the $100 genome | Insight, issue 5
Targeting the $100 genome | Insight, issue 5Targeting the $100 genome | Insight, issue 5
Targeting the $100 genome | Insight, issue 5
 
Δρ Χαράλαμπος Πιτσαλίδης, 3rd Health Innovation Conference
Δρ Χαράλαμπος Πιτσαλίδης, 3rd Health Innovation ConferenceΔρ Χαράλαμπος Πιτσαλίδης, 3rd Health Innovation Conference
Δρ Χαράλαμπος Πιτσαλίδης, 3rd Health Innovation Conference
 
The Genomics Revolution: The Good, The Bad, and The Ugly
The Genomics Revolution: The Good, The Bad, and The UglyThe Genomics Revolution: The Good, The Bad, and The Ugly
The Genomics Revolution: The Good, The Bad, and The Ugly
 
2nd CRISPR Congress Boston, 23-25 February 2016
2nd CRISPR Congress Boston, 23-25 February 2016 2nd CRISPR Congress Boston, 23-25 February 2016
2nd CRISPR Congress Boston, 23-25 February 2016
 
Genetic engineers and molecular factories | Insight, issue 3
Genetic engineers and molecular factories | Insight, issue 3Genetic engineers and molecular factories | Insight, issue 3
Genetic engineers and molecular factories | Insight, issue 3
 
Discuss an example of knockout mouse model used for disease modelling (Metast...
Discuss an example of knockout mouse model used for disease modelling (Metast...Discuss an example of knockout mouse model used for disease modelling (Metast...
Discuss an example of knockout mouse model used for disease modelling (Metast...
 
Advances and Applications Enabled by Single Cell Technology
Advances and Applications Enabled by Single Cell TechnologyAdvances and Applications Enabled by Single Cell Technology
Advances and Applications Enabled by Single Cell Technology
 
Aug2013 Heidi Rehm integrating large scale sequencing into clinical practice
Aug2013 Heidi Rehm integrating large scale sequencing into clinical practiceAug2013 Heidi Rehm integrating large scale sequencing into clinical practice
Aug2013 Heidi Rehm integrating large scale sequencing into clinical practice
 
Evolutionconnection replication
Evolutionconnection replicationEvolutionconnection replication
Evolutionconnection replication
 
Novocus Legal LLP Nanomaterial news update 25-31 march 2018
Novocus Legal LLP Nanomaterial news update 25-31 march 2018Novocus Legal LLP Nanomaterial news update 25-31 march 2018
Novocus Legal LLP Nanomaterial news update 25-31 march 2018
 
10 oncologic research
10   oncologic research10   oncologic research
10 oncologic research
 
Haendel clingenetics.3.14.14
Haendel clingenetics.3.14.14Haendel clingenetics.3.14.14
Haendel clingenetics.3.14.14
 
Achieve improved variant detection in single cell sequencing infographic
Achieve improved variant detection in single cell sequencing infographicAchieve improved variant detection in single cell sequencing infographic
Achieve improved variant detection in single cell sequencing infographic
 
Supporting Genomics in the Practice of Medicine by Heidi Rehm
Supporting Genomics in the Practice of Medicine by Heidi RehmSupporting Genomics in the Practice of Medicine by Heidi Rehm
Supporting Genomics in the Practice of Medicine by Heidi Rehm
 
Biosecurity Challenges in the Genome Engineering Space – Beth Vitalis
Biosecurity Challenges in the Genome Engineering Space – Beth VitalisBiosecurity Challenges in the Genome Engineering Space – Beth Vitalis
Biosecurity Challenges in the Genome Engineering Space – Beth Vitalis
 
Pegs Europe 2015 Protein & Antibody Engineering Summit
Pegs Europe 2015   Protein & Antibody Engineering SummitPegs Europe 2015   Protein & Antibody Engineering Summit
Pegs Europe 2015 Protein & Antibody Engineering Summit
 
Supercomputing Your Inner Microbiome
Supercomputing Your Inner MicrobiomeSupercomputing Your Inner Microbiome
Supercomputing Your Inner Microbiome
 

Similar to Charles lee human genome structural variant consortium update

Systems biology - Understanding biology at the systems level
Systems biology - Understanding biology at the systems levelSystems biology - Understanding biology at the systems level
Systems biology - Understanding biology at the systems level
Lars Juhl Jensen
 
Large-scale integration of data and text
Large-scale integration of data and textLarge-scale integration of data and text
Large-scale integration of data and text
Lars Juhl Jensen
 
Hao Liu Resume 2017-02
Hao Liu Resume 2017-02Hao Liu Resume 2017-02
Hao Liu Resume 2017-02
Hao Liu
 
Integration of biomedical literature and databases
Integration of biomedical literature and databasesIntegration of biomedical literature and databases
Integration of biomedical literature and databases
Lars Juhl Jensen
 
Bio Lab Paper
Bio Lab PaperBio Lab Paper
Bio Lab Paper
tclythgoe84
 
20190615-資料科學與基因體研究的應用
20190615-資料科學與基因體研究的應用20190615-資料科學與基因體研究的應用
20190615-資料科學與基因體研究的應用
tsai cheng yu
 
Cell centered database for immunology and cancer research feb252016
Cell centered database for immunology and cancer research feb252016Cell centered database for immunology and cancer research feb252016
Cell centered database for immunology and cancer research feb252016
Ann-Marie Roche
 
Laura Ann Kelley_CV1
Laura Ann Kelley_CV1Laura Ann Kelley_CV1
Laura Ann Kelley_CV1
Laura Kelley
 
Integration of heterogeneous data
Integration of heterogeneous dataIntegration of heterogeneous data
Integration of heterogeneous data
Lars Juhl Jensen
 
Microbiome studies using 16S ribosomal DNA PCR: some cautionary tales.
Microbiome studies using 16S ribosomal DNA PCR: some cautionary tales.Microbiome studies using 16S ribosomal DNA PCR: some cautionary tales.
Microbiome studies using 16S ribosomal DNA PCR: some cautionary tales.
jennomics
 
scRNA-Seq Lecture - Stem Cell Network RNA-Seq Workshop 2017
scRNA-Seq Lecture - Stem Cell Network RNA-Seq Workshop 2017scRNA-Seq Lecture - Stem Cell Network RNA-Seq Workshop 2017
scRNA-Seq Lecture - Stem Cell Network RNA-Seq Workshop 2017
David Cook
 
OKC Grand Rounds 2009
OKC Grand Rounds 2009OKC Grand Rounds 2009
OKC Grand Rounds 2009
Sean Davis
 
Exploring proteins, chemicals and their interactions with STRING and STITCH
Exploring proteins, chemicals and their interactions with STRING and STITCHExploring proteins, chemicals and their interactions with STRING and STITCH
Exploring proteins, chemicals and their interactions with STRING and STITCH
biocs
 
Data Translator: an Open Science Data Platform for Mechanistic Disease Discovery
Data Translator: an Open Science Data Platform for Mechanistic Disease DiscoveryData Translator: an Open Science Data Platform for Mechanistic Disease Discovery
Data Translator: an Open Science Data Platform for Mechanistic Disease Discovery
mhaendel
 
Impact of advanced technology in biology
Impact of advanced technology in biologyImpact of advanced technology in biology
Impact of advanced technology in biology
The Knowledge Review
 
Caulder - DIVOS BioITWorld 2015
Caulder - DIVOS BioITWorld 2015Caulder - DIVOS BioITWorld 2015
Caulder - DIVOS BioITWorld 2015
Dana Caulder
 
Microbial Phylogenomics (EVE161) Class 10-11: Genome Sequencing
Microbial Phylogenomics (EVE161) Class 10-11: Genome SequencingMicrobial Phylogenomics (EVE161) Class 10-11: Genome Sequencing
Microbial Phylogenomics (EVE161) Class 10-11: Genome Sequencing
Jonathan Eisen
 
Network biology: Large-scale data and text mining
Network biology: Large-scale data and text miningNetwork biology: Large-scale data and text mining
Network biology: Large-scale data and text mining
Lars Juhl Jensen
 
Systems biology - Bioinformatics on complete biological systems
Systems biology - Bioinformatics on complete biological systemsSystems biology - Bioinformatics on complete biological systems
Systems biology - Bioinformatics on complete biological systems
Lars Juhl Jensen
 
Gellibolian 2010 Audio Visual2
Gellibolian 2010 Audio Visual2Gellibolian 2010 Audio Visual2
Gellibolian 2010 Audio Visual2
Robert Gellibolian, Ph.D
 

Similar to Charles lee human genome structural variant consortium update (20)

Systems biology - Understanding biology at the systems level
Systems biology - Understanding biology at the systems levelSystems biology - Understanding biology at the systems level
Systems biology - Understanding biology at the systems level
 
Large-scale integration of data and text
Large-scale integration of data and textLarge-scale integration of data and text
Large-scale integration of data and text
 
Hao Liu Resume 2017-02
Hao Liu Resume 2017-02Hao Liu Resume 2017-02
Hao Liu Resume 2017-02
 
Integration of biomedical literature and databases
Integration of biomedical literature and databasesIntegration of biomedical literature and databases
Integration of biomedical literature and databases
 
Bio Lab Paper
Bio Lab PaperBio Lab Paper
Bio Lab Paper
 
20190615-資料科學與基因體研究的應用
20190615-資料科學與基因體研究的應用20190615-資料科學與基因體研究的應用
20190615-資料科學與基因體研究的應用
 
Cell centered database for immunology and cancer research feb252016
Cell centered database for immunology and cancer research feb252016Cell centered database for immunology and cancer research feb252016
Cell centered database for immunology and cancer research feb252016
 
Laura Ann Kelley_CV1
Laura Ann Kelley_CV1Laura Ann Kelley_CV1
Laura Ann Kelley_CV1
 
Integration of heterogeneous data
Integration of heterogeneous dataIntegration of heterogeneous data
Integration of heterogeneous data
 
Microbiome studies using 16S ribosomal DNA PCR: some cautionary tales.
Microbiome studies using 16S ribosomal DNA PCR: some cautionary tales.Microbiome studies using 16S ribosomal DNA PCR: some cautionary tales.
Microbiome studies using 16S ribosomal DNA PCR: some cautionary tales.
 
scRNA-Seq Lecture - Stem Cell Network RNA-Seq Workshop 2017
scRNA-Seq Lecture - Stem Cell Network RNA-Seq Workshop 2017scRNA-Seq Lecture - Stem Cell Network RNA-Seq Workshop 2017
scRNA-Seq Lecture - Stem Cell Network RNA-Seq Workshop 2017
 
OKC Grand Rounds 2009
OKC Grand Rounds 2009OKC Grand Rounds 2009
OKC Grand Rounds 2009
 
Exploring proteins, chemicals and their interactions with STRING and STITCH
Exploring proteins, chemicals and their interactions with STRING and STITCHExploring proteins, chemicals and their interactions with STRING and STITCH
Exploring proteins, chemicals and their interactions with STRING and STITCH
 
Data Translator: an Open Science Data Platform for Mechanistic Disease Discovery
Data Translator: an Open Science Data Platform for Mechanistic Disease DiscoveryData Translator: an Open Science Data Platform for Mechanistic Disease Discovery
Data Translator: an Open Science Data Platform for Mechanistic Disease Discovery
 
Impact of advanced technology in biology
Impact of advanced technology in biologyImpact of advanced technology in biology
Impact of advanced technology in biology
 
Caulder - DIVOS BioITWorld 2015
Caulder - DIVOS BioITWorld 2015Caulder - DIVOS BioITWorld 2015
Caulder - DIVOS BioITWorld 2015
 
Microbial Phylogenomics (EVE161) Class 10-11: Genome Sequencing
Microbial Phylogenomics (EVE161) Class 10-11: Genome SequencingMicrobial Phylogenomics (EVE161) Class 10-11: Genome Sequencing
Microbial Phylogenomics (EVE161) Class 10-11: Genome Sequencing
 
Network biology: Large-scale data and text mining
Network biology: Large-scale data and text miningNetwork biology: Large-scale data and text mining
Network biology: Large-scale data and text mining
 
Systems biology - Bioinformatics on complete biological systems
Systems biology - Bioinformatics on complete biological systemsSystems biology - Bioinformatics on complete biological systems
Systems biology - Bioinformatics on complete biological systems
 
Gellibolian 2010 Audio Visual2
Gellibolian 2010 Audio Visual2Gellibolian 2010 Audio Visual2
Gellibolian 2010 Audio Visual2
 

More from GenomeInABottle

2023 GIAB AMP Update
2023 GIAB AMP Update2023 GIAB AMP Update
2023 GIAB AMP Update
GenomeInABottle
 
GIAB Tumor Normal ASHG 2023
GIAB Tumor Normal ASHG 2023GIAB Tumor Normal ASHG 2023
GIAB Tumor Normal ASHG 2023
GenomeInABottle
 
Stratomod ASHG 2023
Stratomod ASHG 2023Stratomod ASHG 2023
Stratomod ASHG 2023
GenomeInABottle
 
GIAB_ASHG_JZook_2023.pdf
GIAB_ASHG_JZook_2023.pdfGIAB_ASHG_JZook_2023.pdf
GIAB_ASHG_JZook_2023.pdf
GenomeInABottle
 
Using accurate long reads to improve Genome in a Bottle Benchmarks 220923
Using accurate long reads to improve Genome in a Bottle Benchmarks 220923Using accurate long reads to improve Genome in a Bottle Benchmarks 220923
Using accurate long reads to improve Genome in a Bottle Benchmarks 220923
GenomeInABottle
 
Benchmarking with GIAB 220907
Benchmarking with GIAB 220907Benchmarking with GIAB 220907
Benchmarking with GIAB 220907
GenomeInABottle
 
Genome in a Bottle- reference materials to benchmark challenging variants and...
Genome in a Bottle- reference materials to benchmark challenging variants and...Genome in a Bottle- reference materials to benchmark challenging variants and...
Genome in a Bottle- reference materials to benchmark challenging variants and...
GenomeInABottle
 
GIAB Technical Germline Benchmark roadmap discussion
GIAB Technical Germline Benchmark roadmap discussionGIAB Technical Germline Benchmark roadmap discussion
GIAB Technical Germline Benchmark roadmap discussion
GenomeInABottle
 
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
GenomeInABottle
 
Giab agbt small_var_2020
Giab agbt small_var_2020Giab agbt small_var_2020
Giab agbt small_var_2020
GenomeInABottle
 
GIAB for AMP GeT-RM Forum
GIAB for AMP GeT-RM ForumGIAB for AMP GeT-RM Forum
GIAB for AMP GeT-RM Forum
GenomeInABottle
 
Ga4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GH
Ga4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GHGa4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GH
Ga4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GH
GenomeInABottle
 
GIAB ASHG 2019 Structural Variant poster
GIAB ASHG 2019 Structural Variant posterGIAB ASHG 2019 Structural Variant poster
GIAB ASHG 2019 Structural Variant poster
GenomeInABottle
 
GIAB GRC Workshop ASHG 2019 Billy Rowell Evaluation of v4 with CCS GATK
GIAB GRC Workshop ASHG 2019 Billy Rowell Evaluation of v4 with CCS GATKGIAB GRC Workshop ASHG 2019 Billy Rowell Evaluation of v4 with CCS GATK
GIAB GRC Workshop ASHG 2019 Billy Rowell Evaluation of v4 with CCS GATK
GenomeInABottle
 
GIAB ASHG 2019 Small Variant poster
GIAB ASHG 2019 Small Variant posterGIAB ASHG 2019 Small Variant poster
GIAB ASHG 2019 Small Variant poster
GenomeInABottle
 
GRC GIAB Workshop ASHG 2019 Small Variant Benchmark
GRC GIAB Workshop ASHG 2019 Small Variant BenchmarkGRC GIAB Workshop ASHG 2019 Small Variant Benchmark
GRC GIAB Workshop ASHG 2019 Small Variant Benchmark
GenomeInABottle
 
Jason Chin MHC diploid assembly
Jason Chin MHC diploid assemblyJason Chin MHC diploid assembly
Jason Chin MHC diploid assembly
GenomeInABottle
 
GIAB update for GRC GIAB workshop 191015
GIAB update for GRC GIAB workshop 191015GIAB update for GRC GIAB workshop 191015
GIAB update for GRC GIAB workshop 191015
GenomeInABottle
 
Giab for jax long read 190917
Giab for jax long read 190917Giab for jax long read 190917
Giab for jax long read 190917
GenomeInABottle
 
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
GenomeInABottle
 

More from GenomeInABottle (20)

2023 GIAB AMP Update
2023 GIAB AMP Update2023 GIAB AMP Update
2023 GIAB AMP Update
 
GIAB Tumor Normal ASHG 2023
GIAB Tumor Normal ASHG 2023GIAB Tumor Normal ASHG 2023
GIAB Tumor Normal ASHG 2023
 
Stratomod ASHG 2023
Stratomod ASHG 2023Stratomod ASHG 2023
Stratomod ASHG 2023
 
GIAB_ASHG_JZook_2023.pdf
GIAB_ASHG_JZook_2023.pdfGIAB_ASHG_JZook_2023.pdf
GIAB_ASHG_JZook_2023.pdf
 
Using accurate long reads to improve Genome in a Bottle Benchmarks 220923
Using accurate long reads to improve Genome in a Bottle Benchmarks 220923Using accurate long reads to improve Genome in a Bottle Benchmarks 220923
Using accurate long reads to improve Genome in a Bottle Benchmarks 220923
 
Benchmarking with GIAB 220907
Benchmarking with GIAB 220907Benchmarking with GIAB 220907
Benchmarking with GIAB 220907
 
Genome in a Bottle- reference materials to benchmark challenging variants and...
Genome in a Bottle- reference materials to benchmark challenging variants and...Genome in a Bottle- reference materials to benchmark challenging variants and...
Genome in a Bottle- reference materials to benchmark challenging variants and...
 
GIAB Technical Germline Benchmark roadmap discussion
GIAB Technical Germline Benchmark roadmap discussionGIAB Technical Germline Benchmark roadmap discussion
GIAB Technical Germline Benchmark roadmap discussion
 
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
 
Giab agbt small_var_2020
Giab agbt small_var_2020Giab agbt small_var_2020
Giab agbt small_var_2020
 
GIAB for AMP GeT-RM Forum
GIAB for AMP GeT-RM ForumGIAB for AMP GeT-RM Forum
GIAB for AMP GeT-RM Forum
 
Ga4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GH
Ga4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GHGa4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GH
Ga4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GH
 
GIAB ASHG 2019 Structural Variant poster
GIAB ASHG 2019 Structural Variant posterGIAB ASHG 2019 Structural Variant poster
GIAB ASHG 2019 Structural Variant poster
 
GIAB GRC Workshop ASHG 2019 Billy Rowell Evaluation of v4 with CCS GATK
GIAB GRC Workshop ASHG 2019 Billy Rowell Evaluation of v4 with CCS GATKGIAB GRC Workshop ASHG 2019 Billy Rowell Evaluation of v4 with CCS GATK
GIAB GRC Workshop ASHG 2019 Billy Rowell Evaluation of v4 with CCS GATK
 
GIAB ASHG 2019 Small Variant poster
GIAB ASHG 2019 Small Variant posterGIAB ASHG 2019 Small Variant poster
GIAB ASHG 2019 Small Variant poster
 
GRC GIAB Workshop ASHG 2019 Small Variant Benchmark
GRC GIAB Workshop ASHG 2019 Small Variant BenchmarkGRC GIAB Workshop ASHG 2019 Small Variant Benchmark
GRC GIAB Workshop ASHG 2019 Small Variant Benchmark
 
Jason Chin MHC diploid assembly
Jason Chin MHC diploid assemblyJason Chin MHC diploid assembly
Jason Chin MHC diploid assembly
 
GIAB update for GRC GIAB workshop 191015
GIAB update for GRC GIAB workshop 191015GIAB update for GRC GIAB workshop 191015
GIAB update for GRC GIAB workshop 191015
 
Giab for jax long read 190917
Giab for jax long read 190917Giab for jax long read 190917
Giab for jax long read 190917
 
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
 

Recently uploaded

pharmacology for dummies free pdf download.pdf
pharmacology for dummies free pdf download.pdfpharmacology for dummies free pdf download.pdf
pharmacology for dummies free pdf download.pdf
KerlynIgnacio
 
PGx Analysis in VarSeq: A User’s Perspective
PGx Analysis in VarSeq: A User’s PerspectivePGx Analysis in VarSeq: A User’s Perspective
PGx Analysis in VarSeq: A User’s Perspective
Golden Helix
 
Tele Optometry (kunj'sppt) / Basics of tele optometry.
Tele Optometry (kunj'sppt) / Basics of tele optometry.Tele Optometry (kunj'sppt) / Basics of tele optometry.
Tele Optometry (kunj'sppt) / Basics of tele optometry.
Kunj Vihari
 
Ageing, the Elderly, Gerontology and Public Health
Ageing, the Elderly, Gerontology and Public HealthAgeing, the Elderly, Gerontology and Public Health
Ageing, the Elderly, Gerontology and Public Health
phuakl
 
Call Girls In Mumbai +91-7426014248 High Profile Call Girl Mumbai
Call Girls In Mumbai +91-7426014248 High Profile Call Girl MumbaiCall Girls In Mumbai +91-7426014248 High Profile Call Girl Mumbai
Call Girls In Mumbai +91-7426014248 High Profile Call Girl Mumbai
Mobile Problem
 
Pharmacology of 5-hydroxytryptamine and Antagonist
Pharmacology of 5-hydroxytryptamine and AntagonistPharmacology of 5-hydroxytryptamine and Antagonist
Pharmacology of 5-hydroxytryptamine and Antagonist
Dr. Nikhilkumar Sakle
 
Top Travel Vaccinations in Manchester
Top Travel Vaccinations in ManchesterTop Travel Vaccinations in Manchester
Top Travel Vaccinations in Manchester
NX Healthcare
 
Artificial Intelligence Symposium (THAIS)
Artificial Intelligence Symposium (THAIS)Artificial Intelligence Symposium (THAIS)
Artificial Intelligence Symposium (THAIS)
Josep Vidal-Alaball
 
Histololgy of Female Reproductive System.pptx
Histololgy of Female Reproductive System.pptxHistololgy of Female Reproductive System.pptx
Histololgy of Female Reproductive System.pptx
AyeshaZaid1
 
biomechanics of running. Dr.dhwani.pptx
biomechanics of running.   Dr.dhwani.pptxbiomechanics of running.   Dr.dhwani.pptx
biomechanics of running. Dr.dhwani.pptx
Dr. Dhwani kawedia
 
Hemodialysis: Chapter 5, Dialyzers Overview - Dr.Gawad
Hemodialysis: Chapter 5, Dialyzers Overview - Dr.GawadHemodialysis: Chapter 5, Dialyzers Overview - Dr.Gawad
Hemodialysis: Chapter 5, Dialyzers Overview - Dr.Gawad
NephroTube - Dr.Gawad
 
NARCOTICS- POLICY AND PROCEDURES FOR ITS USE
NARCOTICS- POLICY AND PROCEDURES FOR ITS USENARCOTICS- POLICY AND PROCEDURES FOR ITS USE
NARCOTICS- POLICY AND PROCEDURES FOR ITS USE
Dr. Ahana Haroon
 
Osvaldo Bernardo Muchanga-GASTROINTESTINAL INFECTIONS AND GASTRITIS-2024.pdf
Osvaldo Bernardo Muchanga-GASTROINTESTINAL INFECTIONS AND GASTRITIS-2024.pdfOsvaldo Bernardo Muchanga-GASTROINTESTINAL INFECTIONS AND GASTRITIS-2024.pdf
Osvaldo Bernardo Muchanga-GASTROINTESTINAL INFECTIONS AND GASTRITIS-2024.pdf
Osvaldo Bernardo Muchanga
 
Breast cancer: Post menopausal endocrine therapy
Breast cancer: Post menopausal endocrine therapyBreast cancer: Post menopausal endocrine therapy
Breast cancer: Post menopausal endocrine therapy
Dr. Sumit KUMAR
 
K CỔ TỬ CUNG.pdf tự ghi chép, chữ hơi xấu
K CỔ TỬ CUNG.pdf tự ghi chép, chữ hơi xấuK CỔ TỬ CUNG.pdf tự ghi chép, chữ hơi xấu
K CỔ TỬ CUNG.pdf tự ghi chép, chữ hơi xấu
HongBiThi1
 
NAVIGATING THE HORIZONS OF TIME LAPSE EMBRYO MONITORING.pdf
NAVIGATING THE HORIZONS OF TIME LAPSE EMBRYO MONITORING.pdfNAVIGATING THE HORIZONS OF TIME LAPSE EMBRYO MONITORING.pdf
NAVIGATING THE HORIZONS OF TIME LAPSE EMBRYO MONITORING.pdf
Rahul Sen
 
The Nervous and Chemical Regulation of Respiration
The Nervous and Chemical Regulation of RespirationThe Nervous and Chemical Regulation of Respiration
The Nervous and Chemical Regulation of Respiration
MedicoseAcademics
 
Acute Gout Care & Urate Lowering Therapy .pdf
Acute Gout Care & Urate Lowering Therapy .pdfAcute Gout Care & Urate Lowering Therapy .pdf
Acute Gout Care & Urate Lowering Therapy .pdf
Jim Jacob Roy
 
Travel Clinic Cardiff: Health Advice for International Travelers
Travel Clinic Cardiff: Health Advice for International TravelersTravel Clinic Cardiff: Health Advice for International Travelers
Travel Clinic Cardiff: Health Advice for International Travelers
NX Healthcare
 
Full Handwritten notes of RA by Ayush Kumar M pharm - Al ameen college of pha...
Full Handwritten notes of RA by Ayush Kumar M pharm - Al ameen college of pha...Full Handwritten notes of RA by Ayush Kumar M pharm - Al ameen college of pha...
Full Handwritten notes of RA by Ayush Kumar M pharm - Al ameen college of pha...
ayushrajshrivastava7
 

Recently uploaded (20)

pharmacology for dummies free pdf download.pdf
pharmacology for dummies free pdf download.pdfpharmacology for dummies free pdf download.pdf
pharmacology for dummies free pdf download.pdf
 
PGx Analysis in VarSeq: A User’s Perspective
PGx Analysis in VarSeq: A User’s PerspectivePGx Analysis in VarSeq: A User’s Perspective
PGx Analysis in VarSeq: A User’s Perspective
 
Tele Optometry (kunj'sppt) / Basics of tele optometry.
Tele Optometry (kunj'sppt) / Basics of tele optometry.Tele Optometry (kunj'sppt) / Basics of tele optometry.
Tele Optometry (kunj'sppt) / Basics of tele optometry.
 
Ageing, the Elderly, Gerontology and Public Health
Ageing, the Elderly, Gerontology and Public HealthAgeing, the Elderly, Gerontology and Public Health
Ageing, the Elderly, Gerontology and Public Health
 
Call Girls In Mumbai +91-7426014248 High Profile Call Girl Mumbai
Call Girls In Mumbai +91-7426014248 High Profile Call Girl MumbaiCall Girls In Mumbai +91-7426014248 High Profile Call Girl Mumbai
Call Girls In Mumbai +91-7426014248 High Profile Call Girl Mumbai
 
Pharmacology of 5-hydroxytryptamine and Antagonist
Pharmacology of 5-hydroxytryptamine and AntagonistPharmacology of 5-hydroxytryptamine and Antagonist
Pharmacology of 5-hydroxytryptamine and Antagonist
 
Top Travel Vaccinations in Manchester
Top Travel Vaccinations in ManchesterTop Travel Vaccinations in Manchester
Top Travel Vaccinations in Manchester
 
Artificial Intelligence Symposium (THAIS)
Artificial Intelligence Symposium (THAIS)Artificial Intelligence Symposium (THAIS)
Artificial Intelligence Symposium (THAIS)
 
Histololgy of Female Reproductive System.pptx
Histololgy of Female Reproductive System.pptxHistololgy of Female Reproductive System.pptx
Histololgy of Female Reproductive System.pptx
 
biomechanics of running. Dr.dhwani.pptx
biomechanics of running.   Dr.dhwani.pptxbiomechanics of running.   Dr.dhwani.pptx
biomechanics of running. Dr.dhwani.pptx
 
Hemodialysis: Chapter 5, Dialyzers Overview - Dr.Gawad
Hemodialysis: Chapter 5, Dialyzers Overview - Dr.GawadHemodialysis: Chapter 5, Dialyzers Overview - Dr.Gawad
Hemodialysis: Chapter 5, Dialyzers Overview - Dr.Gawad
 
NARCOTICS- POLICY AND PROCEDURES FOR ITS USE
NARCOTICS- POLICY AND PROCEDURES FOR ITS USENARCOTICS- POLICY AND PROCEDURES FOR ITS USE
NARCOTICS- POLICY AND PROCEDURES FOR ITS USE
 
Osvaldo Bernardo Muchanga-GASTROINTESTINAL INFECTIONS AND GASTRITIS-2024.pdf
Osvaldo Bernardo Muchanga-GASTROINTESTINAL INFECTIONS AND GASTRITIS-2024.pdfOsvaldo Bernardo Muchanga-GASTROINTESTINAL INFECTIONS AND GASTRITIS-2024.pdf
Osvaldo Bernardo Muchanga-GASTROINTESTINAL INFECTIONS AND GASTRITIS-2024.pdf
 
Breast cancer: Post menopausal endocrine therapy
Breast cancer: Post menopausal endocrine therapyBreast cancer: Post menopausal endocrine therapy
Breast cancer: Post menopausal endocrine therapy
 
K CỔ TỬ CUNG.pdf tự ghi chép, chữ hơi xấu
K CỔ TỬ CUNG.pdf tự ghi chép, chữ hơi xấuK CỔ TỬ CUNG.pdf tự ghi chép, chữ hơi xấu
K CỔ TỬ CUNG.pdf tự ghi chép, chữ hơi xấu
 
NAVIGATING THE HORIZONS OF TIME LAPSE EMBRYO MONITORING.pdf
NAVIGATING THE HORIZONS OF TIME LAPSE EMBRYO MONITORING.pdfNAVIGATING THE HORIZONS OF TIME LAPSE EMBRYO MONITORING.pdf
NAVIGATING THE HORIZONS OF TIME LAPSE EMBRYO MONITORING.pdf
 
The Nervous and Chemical Regulation of Respiration
The Nervous and Chemical Regulation of RespirationThe Nervous and Chemical Regulation of Respiration
The Nervous and Chemical Regulation of Respiration
 
Acute Gout Care & Urate Lowering Therapy .pdf
Acute Gout Care & Urate Lowering Therapy .pdfAcute Gout Care & Urate Lowering Therapy .pdf
Acute Gout Care & Urate Lowering Therapy .pdf
 
Travel Clinic Cardiff: Health Advice for International Travelers
Travel Clinic Cardiff: Health Advice for International TravelersTravel Clinic Cardiff: Health Advice for International Travelers
Travel Clinic Cardiff: Health Advice for International Travelers
 
Full Handwritten notes of RA by Ayush Kumar M pharm - Al ameen college of pha...
Full Handwritten notes of RA by Ayush Kumar M pharm - Al ameen college of pha...Full Handwritten notes of RA by Ayush Kumar M pharm - Al ameen college of pha...
Full Handwritten notes of RA by Ayush Kumar M pharm - Al ameen college of pha...
 

Charles lee human genome structural variant consortium update

  • 1. 1/29/18 1 Human Structural Genomic Variation Charles Lee, Ph.D., FACMG SCIENTIFIC DIRECTOR AND PROFESSOR The Jackson Laboratory for Genomic Medicine January 25, 2018 Conclusion of the 1000G Project: Published September 30, 2015
  • 2. 1/29/18 2 Steven McCarroll Deanna Church Jonas Korlach Scott Devine Ali Bashir Ryan Mills Ken Chen Kai Ye Mark Gerstein Gabor Marth Jonathan Sebat Michael Talkowski Goo Jun Li Ding Bing Ren Paul Flicek Pui-Yan Kwok Peter Lansdorp Xinghua ShiTobias Marschall Jan Korbel Evan Eichler Chong Lek Koh Charles Lee HGSV: Human Genome Structural Variation Consortium PRINCIPAL INVESTIGATORS: HGSV: Strategy Illumina Hiseq Jumping Libraries Pac Bio Chrom (10X) Illumina (Moleculo) Hi-C Strand Seq BioNano Affy Cytoscan Short reads High error rate Low base calling accuracy Not true long read Cost High library cost Long run time Not commercially available WGS remains a challenge Low sequence quality GC bias CONS Minimal sample FastCapable of long range phasing True long reads DNA-protein interactions Easy sample prep High quality de novo assembly DNA methylation analysis High quality No assembly required High consensus Uniform coverage PROS
  • 3. 1/29/18 3 HGSV: Selected Trios GOAL: to utilize multiple platforms for comprehensive discovery of structural variations in selected individual genomes NA19240 HG00514 HG00733 NA19239 NA19238 Yoruban Han Chinese Puerto Rican HG00512 HG00513 HG00731 HG00732 (high genetic diversity) (low genetic diversity) (population admixture) Parents were previously sequenced in 1000G Children sequenced in HGSV to 223-fold combined coverage Illumina HiSeq Short read sequencing, deep coverage PCR-free (bridge amplification) Sequences reads (approximately 150bp) from both ends of a DNA fragment = PAIRED END Three approaches for SV detection (using paired-end sequencing data): READ DEPTH PAIRED END MAPPING SPLIT MAPPING * Short Read Sequencing
  • 4. 1/29/18 4 PacBio SMRT Sequencing Single Molecule Real Time (SMRT) long insert sequencing, no amplification required Sequences long and complex regions * Long Read Sequencing True long read sequencing, average read size >10,000 bases Bio Nano Video Bio Nano Genomics Irys * Optical Mapping Analyzes single, linearized DNA molecule (no amplification required) Optical mapping = complimentary way to interrogate genome for SVs Whole genome maps enable detection of translocations, repeats and deletions through de novo assembly
  • 5. 1/29/18 5 Illumina Moleculo Generate long reads from existing tagged short reads, synthetic length is ~100kb Genome is sheared to 10kb fragments, which are tagged with unique barcode * Synthetic Long Reads Long reads allow for de novo assembly BARCODE: A series of known bases added to template molecule (through ligation or amplification). Chromium (10X Genomics) Long range genomic cell-by-cell gene expression (low DNA input: 1ng) * Synthetic Long Reads Long range information enables phasing, SV calling and copy number determination 10X Video Standard short reads are attached with molecular barcodes, synthetic length is ~100kb
  • 6. 1/29/18 6 Strand-Seq 1 2 3 96 BrdU incorporated into new strands MitosisReplication Cell Daughter Cells Isolate and deposit single daughter cell into single well MNase digestion Fragment DNA to 150bp Ligate adaptors Hoechst-UV Treatment BrdU+ strands nicked Remove BrdU+ strands PCR Amplification Only template strands are amplified Introduce multiplex barcode during PCR amplification Sequence and Ideogram B B B Template B B B Template B B B Template B B B Template A2 A1 Template A2’ A1’ A1’ A2’ Template A1 A2 Create paired-end sequencing libraries Create Ideogram Falconer et al. Nature Methods 9: 1107-12, 2012; Sanders et al. Genome Res 26: 1575-1587, 2016 Haplotype A Haplotype B 10kb reads Partitioning PacBio Reads to Haplotypes 67% of reads could be haplotype partitioned
  • 7. 1/29/18 7 Integrative Phasing Illumina Illumina-SLR PacBio Chro HiC Strand-Seq WhatsHap Dense, global (chromosomal scale) haplotypes DENSE + LOCAL SPARSE + GLOBAL Fraction of Phase Connection (power to call and resolve haplotypes) heterozygous heterozygous heterozygous Illumina PacBio Chromium (10X) Strand-Seq F.P.C.% 68% 95% 92% 65% C G A C G A C AG G G A Chromium (10X) + Strand-Seq C G A Lowest mismatch error rate (0.23%). 96.5% of all heterozygous SNVs were phased.
  • 8. 1/29/18 8 Discovering Indels (1-49bp) Illumina Short reads 818,181 Indels/child PacBio Long reads Increase: 119,274 Phased-SV 698,907 Indels/child GATK FreeBays Pindel Discovering SVs (>50bp): Unified Callsets Illumina Short reads Increase: 20,963SVs BNGPacBio Long reads Optical Mapping 12,680 Deletions 18,919 Insertions 31,599 SVs (avg/individual) Various algorithms (Assemble/Map , Call Variants) Haplotype-resolved Phased SV 6808 Deletions 3035 Insertions 793 Duplications 10,636 SVs (avg/individual) 11 Algorithms (Assemble/Map , Call Variants)
  • 9. 1/29/18 9 NA19240 Unified Callset for SV (DELETIONS) 7176 90 202 1186 5109 60 721 PacBio (n=13066) Illumina (n=7218) BioNano (n=1073) NA19240 Unified Callset for SV (INSERTIONS) 13443 435 121 1365 2939 1073 87 PacBio (n=17542) Illumina (n=4512) BioNano (n=1716)
  • 10. 1/29/18 10 PacBio Assemblies have Reduced Ability to Detect CNVs ~341 genes/child had a CNV that was NOT detected by PacBio assembly Therefore it’s important to continue read-depth based CNV detection 24.9%of all SD’s detected by PacBio All Known Segmental Duplications All CNVs detected by dCGH All CNVs detected by GenomeSTRiP 6.2%of all CNVs detected by PacBio 27%of all CNVs detected by PacBio Discovering Inversions BNGIllumina PacBio Short reads, WGS Long read Various algorithms (Assemble/Map , Call Variants) Optical Mapping Illumina Jumping library (liWGS) Strand Seq 126 91 118 28 170 306 Inversions (across all 9 individuals) 121 Simple Inversions (per person)
  • 11. 1/29/18 11 Inversion Calls in Trios, Based on Technologies 81 heterozygous 38 homozygous Smallest: 263 bp (ch1:187497343-187497605) Largest: 1.28Mb (chr8: 8230468-12095842) 121 Simple Inversions (per person) Technology Illumina Jumping PacBio Strand-seq Bionano Refining Meiotic Breakpoints Strand-seq ~25 kbp Meiotic breakpoint median resolution: ~1.5 kbp PacBio + Strand-seq (trio-aware phasing) Maternal meiotic recombination events Alu retrotransposons (p=3.07E-7) THE1-A,B (p=0.08) 15-mer motif
  • 12. 1/29/18 12 Discovering Full-length L1s On average ~190 FL-L1 (with 2 ORFs) per child 124 of these FL-L1s were found in all 3 children - Some are known active (hot) L1 elements associated with disease such as cancer. - 4-6 unique active (hot) L1s found in each genome. Cumulative differences in L1 mutagenesis suggest that such diversity may translate into differential risk levels for cancers and other disorders. Take Home Message Combining different technologies/algorithms, we have identified 7x more structural variation per person than was previously appreciated No single technology / algorithm will be comprehensive Application to disease association studies and clinical interpretation of genomes • Average of 818,181 indels (<50 bp) per person • Average of 31,599 SVs (>50 bp) per person • Average of 121 inversions per person (simple + complex) • Active LINE-1 elements