SlideShare a Scribd company logo
1 of 17
Genome-in-a-Bottle Consortium
August 2013
Reference Materials for Clinical Applications of
Human Genome Sequencing
Marc Salit, Ph.D. and Justin Zook, Ph.D
National Institute of Standards and Technology
Genome in a Bottle
Consortium Development
• NIST met with sequencing
technology developers to assess
standards needs
– Stanford, June 2011
• Open, exploratory workshop
– ASHG, Montreal, Canada
– October 2011
• Small, invitational workshop at
NIST to develop consortium for
human genome reference
materials
– FDA, NCBI, NHGRI, NCI, CDC, Wash
U, Broad, technology developers,
clinical labs, CAP, PGP, Partners,
ABRF, others
– developed draft work plan
– April 2012
• Open, public meeting at NIST to
formally establish consortium,
present draft work plan
– formed working groups
– identified candidate genomes
– established principles of:
• reference material selection
• characterization
• informatics
• performance metrics
– August 2012
• Open, public workshop at XGen
Congress
– March 2013
• Website
– www.genomeinabottle.org
Well-characterized, stable RMs
• Obtain metrics for validation,
QC, QA, PT
• Determine sources and types
of bias/error
• Learn to resolve difficult
structural variants
• Improve reference genome
assembly
• Optimization
– integration of data from
multiple platforms
– sequencing and analysis
• Enable regulated applications Comparison of SNP Calls for
NA12878 on 2 platforms, 3
analysis methods
Measurement Process
Sample
gDNA isolation
Library Prep
Sequencing
Alignment/Mapping
Variant Calling
Confidence Estimates
Downstream Analysis
• gDNA reference
materials will be
developed to
characterize
performance of a part
of process
– materials will be
certified for their
variants against a
reference sequence,
with confidence
estimates
genericmeasurementprocess
• NIST working with GiaB
to select genomes
• Current plan
– NA12878 HapMap
sample as Pilot sample
• part of 17-member
pedigree
– trios from PGP as more
complete set
• 8 trios, focus on children
• varying biogeographic
ancestry
12889 12890 12891 12892
12877 12878
12879 12880 12881 12882 12883 12884 12885 1288712886 12888 12893
CEPH Utah Pedigree 1463
Putting “Genomes” in Bottles
11 children, Birth Order Redacted
Genome in a Bottle Working Groups
Reference Material
Selection
& Design
Andrew Grupe,
Celera
•Develop prioritized list
of whole human
genomes for Reference
Materials
•Identify candidate
approaches and
materials for artificial
RMs
•Develop prioritized
list
Meaurements for
Reference Material
Characterization
Mike Eberle, Illumina
•Develop consensus
plan for experimental
characterization of
Reference Materials
Bioninformatics,
Data Integration,
and Data
Representation
Steve Sherry, NCBI
•Develop plan for
integrating
experimental data and
forming consensus
variant calls and
confidence estimates
•Develop consensus
plan for data
representation
Performance Metrics
& Figures of Merit
Justin Johnson
•User interface to the
Genome-in-a-Bottle
Reference Material
•“Dashboard”
•what an end user will
see and report to
understand and
describe the
performance of their
experiment
•variant call accuracy
•process performance
measures to enable
optimization
Agenda
Thursday
Welcome and Intro
Integrating large scale sequencing into clinical
practice
Heidi Rehm
Personal Genomics
Michael Snyder
Break/Poster Session
Update on GIAB Progress
Marc Salit
Comparison of NIST, Platinum Genomes, and
other NA12878 call-sets to understand
sequencing performance
Justin Zook
Presentations from related projects
Platinum Genomes
Michael Eberle
NA12878 Trio Analysis
Francisco De La Vega
GeT-RM Project and Genome Browser
Deanna Church
Lunch (on your own in NIST cafeteria)
Working Group Meetings
Reference Material Selection & Design (Lecture
Room E)
Measurements for Reference Material
Characterization (Dining Room A&B)
Bioinformatics, Data Integration, and Data
Representation (Lecture Room A)
Performance Metrics and Figures of Merit
(Lecture Room C)
Friday
Discussion between working groups
Working group reports (Green Auditorium)
Workplan refinement, timeline
Lunch (on your own in NIST cafeteria)
Discussion: Scope of consortium, how to make
decisions
Resource needs, how to meet them, and next
steps
Agenda
Thursday
Welcome and Intro
Integrating large scale sequencing into clinical
practice
Heidi Rehm
Personal Genomics
Michael Snyder
Break/Poster Session
Update on GIAB Progress
Marc Salit
Comparison of NIST, Platinum Genomes, and
other NA12878 call-sets to understand
sequencing performance
Justin Zook
Presentations from related projects
Platinum Genomes
Michael Eberle
NA12878 Trio Analysis
Francisco De La Vega
GeT-RM Project and Genome Browser
Deanna Church
Lunch (on your own in NIST cafeteria)
Working Group Meetings
Reference Material Selection & Design (Lecture
Room E)
Measurements for Reference Material
Characterization (Dining Room A&B)
Bioinformatics, Data Integration, and Data
Representation (Lecture Room A)
Performance Metrics and Figures of Merit
(Lecture Room C)
Friday
Discussion between working groups
Working group reports (Green Auditorium)
Workplan refinement, timeline
Lunch (on your own in NIST cafeteria)
Discussion: Scope of consortium, how to make
decisions
Resource needs, how to meet them, and next
steps
Please Note
The plenary sessions of this workshop are being
webcasted (audio & slides) – please use the
microphones when asking questions. Web
attendees can ask questions with chat. Slides will
be made available on SlideShare after the
workshop (see genomeinabottle.org).
Tweets are welcome unless the speaker requests
otherwise. Please use #giab as the hashtag.
Status Update,
Consortium Business
Marc Salit and Justin Zook
Consenting Genomes for use as
Reference Materials
• Risk of re-identification
– this is a real risk
– privacy
– implications for family members
• Meaning of possibility of
withdrawal
• Commercial application
– indirect, research
– direct, derived products
• PGP project currently state-of-art
– broad and direct
– test to demonstrate understanding
• “Wild West”
NIST Reference Materials
Pilot RM - NA12878
• 8300 10ug vials of NA12878
gDNA @ NIST 4/2013
– Available for sequencing by
GIAB participants
– target for release as NIST RM
2/2014
• SNPs, small indels
• Will be sequenced at ~10 labs
– ~4 technologies, multiple
modes
• Received “Human Subjects
Approval” for release of
NA12878 as NIST RM
Personal Genome Project
• Ashkenazim trio DNA expected
~Dec 2013
• Asian son DNA expected ~Dec
2013
– Parents’ cell lines in process at
Coriell
• “Human subjects review”
close to approval for release of
PGP genomes as NIST RMs
• Plan is 5-6 additional trios of
diverse ancestry
– Ideally, african, asian, hispanic
– What should we do if PGP
doesn’t have trios from each of
these groups?
Planned Measurements on NA12878
candidate RM
• NIST
– ~300x total 2x150bp Illumina
over 6 vials of NA12878
– ~100x SOLiD 5500W 2x50bp
coverage
– ~50x SOLiD 5500W 2x50bp
coverage of parents
• Illumina
– PCR-free
– Mate-pair
• Complete Genomics
– Normal pipeline
– LFR pipeline
• NCI
– Ion Proton
– Illumina
– Various libraries
• Garvan
– Illumina exome
• Celera
– Targeted panels
• Cornell Weill
– Illumina
• MTAs pending
– Univ. of Nebraska Medical
– Univ. of Michigan
HOW DO WE WANT TO FUNCTION
AS A CONSORTIUM?
What’s our scope?
How do we make decisions?
Spectrum of Possibilities
• NIST develops and
disseminated gDNA
RMs with consortium
input
• Consortium functions as
a Standards Body, with
dynamic portfolio and
broad influence
Spectrum of Possibilities
NIST develops
and disseminates
gDNA RMs with
consortium input
Consortium
functions as a
Standards Body,
with dynamic
portfolio and
broad influence
Scope
Basic Scope
• Develop/disseminate pilot
genome and 8 trios as RMs
– gDNA and reference data
• Develop/disseminate
“Performance Metrics
Suite”
– data repository?
• Documentary Standards to
describe methods?
– through a clinical SDO? CLSI?
IFCC?
Extended Scope
• Other RMs as part of GiaB
portfolio?
– tumor/normal pair
– artifical spike-in controls
• pDNA from NCI
– derived commercial materials
• Cell lines for which we have
reference gDNA
• Such cell lines embedded in
FFPE
– engineered cell lines
• designed as controls for
specific variants
Extended Scope
• need process to include
new material/product in
portfolio
– what does it mean to put the
GiaB imprimatur on
something?
• some possible requirements
– guidelines for usage
– methods for characterization
– conduct interlab studies to
establish utility
• how do we decide what to
do?
– need to be
• open, transparent, public
• form consensus
– pragmatic consensus
needs champion and
commitment
• e.g. proposer pilots interlab
• consortium members
participate in interlab
• how do we decide policy
matters?
– see draft data release policy
discussion on this tomorrow after lunch…

More Related Content

What's hot

Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Carole Goble
 
2nd RNA-Seq, Boston, MA 17-19 June 2014 updated brochure
2nd RNA-Seq, Boston, MA 17-19 June 2014 updated brochure2nd RNA-Seq, Boston, MA 17-19 June 2014 updated brochure
2nd RNA-Seq, Boston, MA 17-19 June 2014 updated brochureDiane McKenna
 
Session 6 part 1
Session 6 part 1Session 6 part 1
Session 6 part 1plmiami
 
Publication of raw and curated NMR spectroscopic data for organic molecules
Publication of raw and curated NMR spectroscopic data for organic moleculesPublication of raw and curated NMR spectroscopic data for organic molecules
Publication of raw and curated NMR spectroscopic data for organic moleculesChristoph Steinbeck
 
Machine Learned Relevance at A Large Scale Search Engine
Machine Learned Relevance at A Large Scale Search EngineMachine Learned Relevance at A Large Scale Search Engine
Machine Learned Relevance at A Large Scale Search EngineSalford Systems
 
What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)Carole Goble
 
State of the Art Natural Language Processing at Scale with Alexander Thomas a...
State of the Art Natural Language Processing at Scale with Alexander Thomas a...State of the Art Natural Language Processing at Scale with Alexander Thomas a...
State of the Art Natural Language Processing at Scale with Alexander Thomas a...Databricks
 
Deep Learning for Domain-Specific Entity Extraction from Unstructured Text wi...
Deep Learning for Domain-Specific Entity Extraction from Unstructured Text wi...Deep Learning for Domain-Specific Entity Extraction from Unstructured Text wi...
Deep Learning for Domain-Specific Entity Extraction from Unstructured Text wi...Databricks
 
RNA-Seq 2013 Brochure
RNA-Seq 2013 BrochureRNA-Seq 2013 Brochure
RNA-Seq 2013 BrochureDiane McKenna
 
Questel Halliburton Aug2 2016 RD (Webinar)
Questel Halliburton Aug2 2016 RD (Webinar)Questel Halliburton Aug2 2016 RD (Webinar)
Questel Halliburton Aug2 2016 RD (Webinar)Eric Moran
 
Pharos – A Torch to Use in Your Journey In the Dark Genome
Pharos – A Torch to Use in Your Journey In the Dark GenomePharos – A Torch to Use in Your Journey In the Dark Genome
Pharos – A Torch to Use in Your Journey In the Dark GenomeRajarshi Guha
 

What's hot (12)

Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017
 
2nd RNA-Seq, Boston, MA 17-19 June 2014 updated brochure
2nd RNA-Seq, Boston, MA 17-19 June 2014 updated brochure2nd RNA-Seq, Boston, MA 17-19 June 2014 updated brochure
2nd RNA-Seq, Boston, MA 17-19 June 2014 updated brochure
 
Session 6 part 1
Session 6 part 1Session 6 part 1
Session 6 part 1
 
Publication of raw and curated NMR spectroscopic data for organic molecules
Publication of raw and curated NMR spectroscopic data for organic moleculesPublication of raw and curated NMR spectroscopic data for organic molecules
Publication of raw and curated NMR spectroscopic data for organic molecules
 
Machine Learned Relevance at A Large Scale Search Engine
Machine Learned Relevance at A Large Scale Search EngineMachine Learned Relevance at A Large Scale Search Engine
Machine Learned Relevance at A Large Scale Search Engine
 
What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)
 
State of the Art Natural Language Processing at Scale with Alexander Thomas a...
State of the Art Natural Language Processing at Scale with Alexander Thomas a...State of the Art Natural Language Processing at Scale with Alexander Thomas a...
State of the Art Natural Language Processing at Scale with Alexander Thomas a...
 
Deep Learning for Domain-Specific Entity Extraction from Unstructured Text wi...
Deep Learning for Domain-Specific Entity Extraction from Unstructured Text wi...Deep Learning for Domain-Specific Entity Extraction from Unstructured Text wi...
Deep Learning for Domain-Specific Entity Extraction from Unstructured Text wi...
 
RNA-Seq 2013 Brochure
RNA-Seq 2013 BrochureRNA-Seq 2013 Brochure
RNA-Seq 2013 Brochure
 
Questel Halliburton Aug2 2016 RD (Webinar)
Questel Halliburton Aug2 2016 RD (Webinar)Questel Halliburton Aug2 2016 RD (Webinar)
Questel Halliburton Aug2 2016 RD (Webinar)
 
Phylogenetics: Making publication-quality tree figures
Phylogenetics: Making publication-quality tree figuresPhylogenetics: Making publication-quality tree figures
Phylogenetics: Making publication-quality tree figures
 
Pharos – A Torch to Use in Your Journey In the Dark Genome
Pharos – A Torch to Use in Your Journey In the Dark GenomePharos – A Torch to Use in Your Journey In the Dark Genome
Pharos – A Torch to Use in Your Journey In the Dark Genome
 

Similar to Aug2013 NIST program slides

Aug2014 giab intro slides
Aug2014 giab intro slidesAug2014 giab intro slides
Aug2014 giab intro slidesGenomeInABottle
 
2014 agbt giab_progress update
2014 agbt giab_progress update2014 agbt giab_progress update
2014 agbt giab_progress updateGenomeInABottle
 
Giab aug2015 intro and update 150821.pptx
Giab aug2015 intro and update 150821.pptxGiab aug2015 intro and update 150821.pptx
Giab aug2015 intro and update 150821.pptxGenomeInABottle
 
Giab ashg webinar 160224
Giab ashg webinar 160224Giab ashg webinar 160224
Giab ashg webinar 160224GenomeInABottle
 
150219 agbt giab_poster_marc
150219 agbt giab_poster_marc150219 agbt giab_poster_marc
150219 agbt giab_poster_marcGenomeInABottle
 
Giab workshop update mar2019
Giab workshop update mar2019Giab workshop update mar2019
Giab workshop update mar2019GenomeInABottle
 
Jan2015 GIAB intro, Update, and Data Analysis Planning
Jan2015 GIAB intro, Update, and Data Analysis PlanningJan2015 GIAB intro, Update, and Data Analysis Planning
Jan2015 GIAB intro, Update, and Data Analysis PlanningGenomeInABottle
 
FAIR and metadata standards - FAIRsharing and Neuroscience
FAIR and metadata standards - FAIRsharing and NeuroscienceFAIR and metadata standards - FAIRsharing and Neuroscience
FAIR and metadata standards - FAIRsharing and NeuroscienceSusanna-Assunta Sansone
 
Mar2013 Performance Metrics Working Group
Mar2013 Performance Metrics Working GroupMar2013 Performance Metrics Working Group
Mar2013 Performance Metrics Working GroupGenomeInABottle
 
GARNet workshop on Integrating Large Data into Plant Science
GARNet workshop on Integrating Large Data into Plant ScienceGARNet workshop on Integrating Large Data into Plant Science
GARNet workshop on Integrating Large Data into Plant ScienceDavid Johnson
 
GIAB-GRC workshop oct2015 giab introduction 151005
GIAB-GRC workshop oct2015 giab introduction 151005GIAB-GRC workshop oct2015 giab introduction 151005
GIAB-GRC workshop oct2015 giab introduction 151005GenomeInABottle
 
How to share useful data
How to share useful dataHow to share useful data
How to share useful dataPeter McQuilton
 
Aug2013 bioinformatics working group
Aug2013 bioinformatics working groupAug2013 bioinformatics working group
Aug2013 bioinformatics working groupGenomeInABottle
 
Ramil Mauleon: Galaxy: bioinformatics for rice scientists
Ramil Mauleon: Galaxy: bioinformatics for rice scientistsRamil Mauleon: Galaxy: bioinformatics for rice scientists
Ramil Mauleon: Galaxy: bioinformatics for rice scientistsGigaScience, BGI Hong Kong
 
Reproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trendsReproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trendsCarole Goble
 

Similar to Aug2013 NIST program slides (20)

Aug2014 giab intro slides
Aug2014 giab intro slidesAug2014 giab intro slides
Aug2014 giab intro slides
 
140127 GIAB Intro
140127 GIAB Intro140127 GIAB Intro
140127 GIAB Intro
 
2014 agbt giab_progress update
2014 agbt giab_progress update2014 agbt giab_progress update
2014 agbt giab_progress update
 
Giab aug2015 intro and update 150821.pptx
Giab aug2015 intro and update 150821.pptxGiab aug2015 intro and update 150821.pptx
Giab aug2015 intro and update 150821.pptx
 
Giab ashg webinar 160224
Giab ashg webinar 160224Giab ashg webinar 160224
Giab ashg webinar 160224
 
150219 agbt giab_poster_marc
150219 agbt giab_poster_marc150219 agbt giab_poster_marc
150219 agbt giab_poster_marc
 
Giab workshop update mar2019
Giab workshop update mar2019Giab workshop update mar2019
Giab workshop update mar2019
 
March 2013 Introduction
March 2013 IntroductionMarch 2013 Introduction
March 2013 Introduction
 
Jan2015 GIAB intro, Update, and Data Analysis Planning
Jan2015 GIAB intro, Update, and Data Analysis PlanningJan2015 GIAB intro, Update, and Data Analysis Planning
Jan2015 GIAB intro, Update, and Data Analysis Planning
 
FAIR and metadata standards - FAIRsharing and Neuroscience
FAIR and metadata standards - FAIRsharing and NeuroscienceFAIR and metadata standards - FAIRsharing and Neuroscience
FAIR and metadata standards - FAIRsharing and Neuroscience
 
Mar2013 Performance Metrics Working Group
Mar2013 Performance Metrics Working GroupMar2013 Performance Metrics Working Group
Mar2013 Performance Metrics Working Group
 
GARNet workshop on Integrating Large Data into Plant Science
GARNet workshop on Integrating Large Data into Plant ScienceGARNet workshop on Integrating Large Data into Plant Science
GARNet workshop on Integrating Large Data into Plant Science
 
GIAB-GRC workshop oct2015 giab introduction 151005
GIAB-GRC workshop oct2015 giab introduction 151005GIAB-GRC workshop oct2015 giab introduction 151005
GIAB-GRC workshop oct2015 giab introduction 151005
 
How to share useful data
How to share useful dataHow to share useful data
How to share useful data
 
Aug2013 bioinformatics working group
Aug2013 bioinformatics working groupAug2013 bioinformatics working group
Aug2013 bioinformatics working group
 
171017 giab for giab grc workshop
171017 giab for giab grc workshop171017 giab for giab grc workshop
171017 giab for giab grc workshop
 
CV_10/17
CV_10/17CV_10/17
CV_10/17
 
Cv long
Cv longCv long
Cv long
 
Ramil Mauleon: Galaxy: bioinformatics for rice scientists
Ramil Mauleon: Galaxy: bioinformatics for rice scientistsRamil Mauleon: Galaxy: bioinformatics for rice scientists
Ramil Mauleon: Galaxy: bioinformatics for rice scientists
 
Reproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trendsReproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trends
 

More from GenomeInABottle

GIAB Tumor Normal ASHG 2023
GIAB Tumor Normal ASHG 2023GIAB Tumor Normal ASHG 2023
GIAB Tumor Normal ASHG 2023GenomeInABottle
 
GIAB_ASHG_JZook_2023.pdf
GIAB_ASHG_JZook_2023.pdfGIAB_ASHG_JZook_2023.pdf
GIAB_ASHG_JZook_2023.pdfGenomeInABottle
 
Using accurate long reads to improve Genome in a Bottle Benchmarks 220923
Using accurate long reads to improve Genome in a Bottle Benchmarks 220923Using accurate long reads to improve Genome in a Bottle Benchmarks 220923
Using accurate long reads to improve Genome in a Bottle Benchmarks 220923GenomeInABottle
 
Benchmarking with GIAB 220907
Benchmarking with GIAB 220907Benchmarking with GIAB 220907
Benchmarking with GIAB 220907GenomeInABottle
 
Genome in a Bottle- reference materials to benchmark challenging variants and...
Genome in a Bottle- reference materials to benchmark challenging variants and...Genome in a Bottle- reference materials to benchmark challenging variants and...
Genome in a Bottle- reference materials to benchmark challenging variants and...GenomeInABottle
 
GIAB Technical Germline Benchmark roadmap discussion
GIAB Technical Germline Benchmark roadmap discussionGIAB Technical Germline Benchmark roadmap discussion
GIAB Technical Germline Benchmark roadmap discussionGenomeInABottle
 
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511GenomeInABottle
 
Giab agbt small_var_2020
Giab agbt small_var_2020Giab agbt small_var_2020
Giab agbt small_var_2020GenomeInABottle
 
GIAB for AMP GeT-RM Forum
GIAB for AMP GeT-RM ForumGIAB for AMP GeT-RM Forum
GIAB for AMP GeT-RM ForumGenomeInABottle
 
Ga4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GH
Ga4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GHGa4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GH
Ga4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GHGenomeInABottle
 
GIAB ASHG 2019 Structural Variant poster
GIAB ASHG 2019 Structural Variant posterGIAB ASHG 2019 Structural Variant poster
GIAB ASHG 2019 Structural Variant posterGenomeInABottle
 
GIAB GRC Workshop ASHG 2019 Billy Rowell Evaluation of v4 with CCS GATK
GIAB GRC Workshop ASHG 2019 Billy Rowell Evaluation of v4 with CCS GATKGIAB GRC Workshop ASHG 2019 Billy Rowell Evaluation of v4 with CCS GATK
GIAB GRC Workshop ASHG 2019 Billy Rowell Evaluation of v4 with CCS GATKGenomeInABottle
 
GIAB ASHG 2019 Small Variant poster
GIAB ASHG 2019 Small Variant posterGIAB ASHG 2019 Small Variant poster
GIAB ASHG 2019 Small Variant posterGenomeInABottle
 
GRC GIAB Workshop ASHG 2019 Small Variant Benchmark
GRC GIAB Workshop ASHG 2019 Small Variant BenchmarkGRC GIAB Workshop ASHG 2019 Small Variant Benchmark
GRC GIAB Workshop ASHG 2019 Small Variant BenchmarkGenomeInABottle
 
Jason Chin MHC diploid assembly
Jason Chin MHC diploid assemblyJason Chin MHC diploid assembly
Jason Chin MHC diploid assemblyGenomeInABottle
 
GIAB update for GRC GIAB workshop 191015
GIAB update for GRC GIAB workshop 191015GIAB update for GRC GIAB workshop 191015
GIAB update for GRC GIAB workshop 191015GenomeInABottle
 
Giab for jax long read 190917
Giab for jax long read 190917Giab for jax long read 190917
Giab for jax long read 190917GenomeInABottle
 
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...GenomeInABottle
 

More from GenomeInABottle (20)

2023 GIAB AMP Update
2023 GIAB AMP Update2023 GIAB AMP Update
2023 GIAB AMP Update
 
GIAB Tumor Normal ASHG 2023
GIAB Tumor Normal ASHG 2023GIAB Tumor Normal ASHG 2023
GIAB Tumor Normal ASHG 2023
 
Stratomod ASHG 2023
Stratomod ASHG 2023Stratomod ASHG 2023
Stratomod ASHG 2023
 
GIAB_ASHG_JZook_2023.pdf
GIAB_ASHG_JZook_2023.pdfGIAB_ASHG_JZook_2023.pdf
GIAB_ASHG_JZook_2023.pdf
 
Using accurate long reads to improve Genome in a Bottle Benchmarks 220923
Using accurate long reads to improve Genome in a Bottle Benchmarks 220923Using accurate long reads to improve Genome in a Bottle Benchmarks 220923
Using accurate long reads to improve Genome in a Bottle Benchmarks 220923
 
Benchmarking with GIAB 220907
Benchmarking with GIAB 220907Benchmarking with GIAB 220907
Benchmarking with GIAB 220907
 
Genome in a Bottle- reference materials to benchmark challenging variants and...
Genome in a Bottle- reference materials to benchmark challenging variants and...Genome in a Bottle- reference materials to benchmark challenging variants and...
Genome in a Bottle- reference materials to benchmark challenging variants and...
 
GIAB Technical Germline Benchmark roadmap discussion
GIAB Technical Germline Benchmark roadmap discussionGIAB Technical Germline Benchmark roadmap discussion
GIAB Technical Germline Benchmark roadmap discussion
 
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
 
Giab agbt small_var_2020
Giab agbt small_var_2020Giab agbt small_var_2020
Giab agbt small_var_2020
 
GIAB for AMP GeT-RM Forum
GIAB for AMP GeT-RM ForumGIAB for AMP GeT-RM Forum
GIAB for AMP GeT-RM Forum
 
Ga4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GH
Ga4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GHGa4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GH
Ga4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GH
 
GIAB ASHG 2019 Structural Variant poster
GIAB ASHG 2019 Structural Variant posterGIAB ASHG 2019 Structural Variant poster
GIAB ASHG 2019 Structural Variant poster
 
GIAB GRC Workshop ASHG 2019 Billy Rowell Evaluation of v4 with CCS GATK
GIAB GRC Workshop ASHG 2019 Billy Rowell Evaluation of v4 with CCS GATKGIAB GRC Workshop ASHG 2019 Billy Rowell Evaluation of v4 with CCS GATK
GIAB GRC Workshop ASHG 2019 Billy Rowell Evaluation of v4 with CCS GATK
 
GIAB ASHG 2019 Small Variant poster
GIAB ASHG 2019 Small Variant posterGIAB ASHG 2019 Small Variant poster
GIAB ASHG 2019 Small Variant poster
 
GRC GIAB Workshop ASHG 2019 Small Variant Benchmark
GRC GIAB Workshop ASHG 2019 Small Variant BenchmarkGRC GIAB Workshop ASHG 2019 Small Variant Benchmark
GRC GIAB Workshop ASHG 2019 Small Variant Benchmark
 
Jason Chin MHC diploid assembly
Jason Chin MHC diploid assemblyJason Chin MHC diploid assembly
Jason Chin MHC diploid assembly
 
GIAB update for GRC GIAB workshop 191015
GIAB update for GRC GIAB workshop 191015GIAB update for GRC GIAB workshop 191015
GIAB update for GRC GIAB workshop 191015
 
Giab for jax long read 190917
Giab for jax long read 190917Giab for jax long read 190917
Giab for jax long read 190917
 
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
 

Recently uploaded

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsPrecisely
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 

Recently uploaded (20)

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power Systems
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 

Aug2013 NIST program slides

  • 1. Genome-in-a-Bottle Consortium August 2013 Reference Materials for Clinical Applications of Human Genome Sequencing Marc Salit, Ph.D. and Justin Zook, Ph.D National Institute of Standards and Technology
  • 2. Genome in a Bottle Consortium Development • NIST met with sequencing technology developers to assess standards needs – Stanford, June 2011 • Open, exploratory workshop – ASHG, Montreal, Canada – October 2011 • Small, invitational workshop at NIST to develop consortium for human genome reference materials – FDA, NCBI, NHGRI, NCI, CDC, Wash U, Broad, technology developers, clinical labs, CAP, PGP, Partners, ABRF, others – developed draft work plan – April 2012 • Open, public meeting at NIST to formally establish consortium, present draft work plan – formed working groups – identified candidate genomes – established principles of: • reference material selection • characterization • informatics • performance metrics – August 2012 • Open, public workshop at XGen Congress – March 2013 • Website – www.genomeinabottle.org
  • 3. Well-characterized, stable RMs • Obtain metrics for validation, QC, QA, PT • Determine sources and types of bias/error • Learn to resolve difficult structural variants • Improve reference genome assembly • Optimization – integration of data from multiple platforms – sequencing and analysis • Enable regulated applications Comparison of SNP Calls for NA12878 on 2 platforms, 3 analysis methods
  • 4. Measurement Process Sample gDNA isolation Library Prep Sequencing Alignment/Mapping Variant Calling Confidence Estimates Downstream Analysis • gDNA reference materials will be developed to characterize performance of a part of process – materials will be certified for their variants against a reference sequence, with confidence estimates genericmeasurementprocess
  • 5. • NIST working with GiaB to select genomes • Current plan – NA12878 HapMap sample as Pilot sample • part of 17-member pedigree – trios from PGP as more complete set • 8 trios, focus on children • varying biogeographic ancestry 12889 12890 12891 12892 12877 12878 12879 12880 12881 12882 12883 12884 12885 1288712886 12888 12893 CEPH Utah Pedigree 1463 Putting “Genomes” in Bottles 11 children, Birth Order Redacted
  • 6. Genome in a Bottle Working Groups Reference Material Selection & Design Andrew Grupe, Celera •Develop prioritized list of whole human genomes for Reference Materials •Identify candidate approaches and materials for artificial RMs •Develop prioritized list Meaurements for Reference Material Characterization Mike Eberle, Illumina •Develop consensus plan for experimental characterization of Reference Materials Bioninformatics, Data Integration, and Data Representation Steve Sherry, NCBI •Develop plan for integrating experimental data and forming consensus variant calls and confidence estimates •Develop consensus plan for data representation Performance Metrics & Figures of Merit Justin Johnson •User interface to the Genome-in-a-Bottle Reference Material •“Dashboard” •what an end user will see and report to understand and describe the performance of their experiment •variant call accuracy •process performance measures to enable optimization
  • 7. Agenda Thursday Welcome and Intro Integrating large scale sequencing into clinical practice Heidi Rehm Personal Genomics Michael Snyder Break/Poster Session Update on GIAB Progress Marc Salit Comparison of NIST, Platinum Genomes, and other NA12878 call-sets to understand sequencing performance Justin Zook Presentations from related projects Platinum Genomes Michael Eberle NA12878 Trio Analysis Francisco De La Vega GeT-RM Project and Genome Browser Deanna Church Lunch (on your own in NIST cafeteria) Working Group Meetings Reference Material Selection & Design (Lecture Room E) Measurements for Reference Material Characterization (Dining Room A&B) Bioinformatics, Data Integration, and Data Representation (Lecture Room A) Performance Metrics and Figures of Merit (Lecture Room C) Friday Discussion between working groups Working group reports (Green Auditorium) Workplan refinement, timeline Lunch (on your own in NIST cafeteria) Discussion: Scope of consortium, how to make decisions Resource needs, how to meet them, and next steps
  • 8. Agenda Thursday Welcome and Intro Integrating large scale sequencing into clinical practice Heidi Rehm Personal Genomics Michael Snyder Break/Poster Session Update on GIAB Progress Marc Salit Comparison of NIST, Platinum Genomes, and other NA12878 call-sets to understand sequencing performance Justin Zook Presentations from related projects Platinum Genomes Michael Eberle NA12878 Trio Analysis Francisco De La Vega GeT-RM Project and Genome Browser Deanna Church Lunch (on your own in NIST cafeteria) Working Group Meetings Reference Material Selection & Design (Lecture Room E) Measurements for Reference Material Characterization (Dining Room A&B) Bioinformatics, Data Integration, and Data Representation (Lecture Room A) Performance Metrics and Figures of Merit (Lecture Room C) Friday Discussion between working groups Working group reports (Green Auditorium) Workplan refinement, timeline Lunch (on your own in NIST cafeteria) Discussion: Scope of consortium, how to make decisions Resource needs, how to meet them, and next steps Please Note The plenary sessions of this workshop are being webcasted (audio & slides) – please use the microphones when asking questions. Web attendees can ask questions with chat. Slides will be made available on SlideShare after the workshop (see genomeinabottle.org). Tweets are welcome unless the speaker requests otherwise. Please use #giab as the hashtag.
  • 10. Consenting Genomes for use as Reference Materials • Risk of re-identification – this is a real risk – privacy – implications for family members • Meaning of possibility of withdrawal • Commercial application – indirect, research – direct, derived products • PGP project currently state-of-art – broad and direct – test to demonstrate understanding • “Wild West”
  • 11. NIST Reference Materials Pilot RM - NA12878 • 8300 10ug vials of NA12878 gDNA @ NIST 4/2013 – Available for sequencing by GIAB participants – target for release as NIST RM 2/2014 • SNPs, small indels • Will be sequenced at ~10 labs – ~4 technologies, multiple modes • Received “Human Subjects Approval” for release of NA12878 as NIST RM Personal Genome Project • Ashkenazim trio DNA expected ~Dec 2013 • Asian son DNA expected ~Dec 2013 – Parents’ cell lines in process at Coriell • “Human subjects review” close to approval for release of PGP genomes as NIST RMs • Plan is 5-6 additional trios of diverse ancestry – Ideally, african, asian, hispanic – What should we do if PGP doesn’t have trios from each of these groups?
  • 12. Planned Measurements on NA12878 candidate RM • NIST – ~300x total 2x150bp Illumina over 6 vials of NA12878 – ~100x SOLiD 5500W 2x50bp coverage – ~50x SOLiD 5500W 2x50bp coverage of parents • Illumina – PCR-free – Mate-pair • Complete Genomics – Normal pipeline – LFR pipeline • NCI – Ion Proton – Illumina – Various libraries • Garvan – Illumina exome • Celera – Targeted panels • Cornell Weill – Illumina • MTAs pending – Univ. of Nebraska Medical – Univ. of Michigan
  • 13. HOW DO WE WANT TO FUNCTION AS A CONSORTIUM? What’s our scope? How do we make decisions?
  • 14. Spectrum of Possibilities • NIST develops and disseminated gDNA RMs with consortium input • Consortium functions as a Standards Body, with dynamic portfolio and broad influence
  • 15. Spectrum of Possibilities NIST develops and disseminates gDNA RMs with consortium input Consortium functions as a Standards Body, with dynamic portfolio and broad influence
  • 16. Scope Basic Scope • Develop/disseminate pilot genome and 8 trios as RMs – gDNA and reference data • Develop/disseminate “Performance Metrics Suite” – data repository? • Documentary Standards to describe methods? – through a clinical SDO? CLSI? IFCC? Extended Scope • Other RMs as part of GiaB portfolio? – tumor/normal pair – artifical spike-in controls • pDNA from NCI – derived commercial materials • Cell lines for which we have reference gDNA • Such cell lines embedded in FFPE – engineered cell lines • designed as controls for specific variants
  • 17. Extended Scope • need process to include new material/product in portfolio – what does it mean to put the GiaB imprimatur on something? • some possible requirements – guidelines for usage – methods for characterization – conduct interlab studies to establish utility • how do we decide what to do? – need to be • open, transparent, public • form consensus – pragmatic consensus needs champion and commitment • e.g. proposer pilots interlab • consortium members participate in interlab • how do we decide policy matters? – see draft data release policy discussion on this tomorrow after lunch…