The Transforming Genetic Medicine Initiative (TGMI)

Genome Reference Consortium
Genome Reference ConsortiumGenome Reference Consortium
The Transforming Genetic
Medicine Initiative (TGMI)
£5.3M 4 year programme funded by
Wellcome
Designing, Developing and
Delivering Integrated Foundations
for Genetic Medicine
Nazneen Rahman
Paul Flicek Caroline Wright
Sian Ellard David Fitzpatrick
Ewan Birney
Fiona Cunningham
Graeme BlackHelen Firth
Gerton Lunter
Matthew Hurles
Patrick Chinnery
TGMI PIs
National and international collaborations
Transforming genetic medicine
Must ensure the wealth of existing medical
genetic knowledge informs our use of current
and future technology, if we are to do more
right and less wrong.
‘The past is never dead, it’s not even past.’ William Faulkner
GENOME PHENOME
Genomic medicine
GENOME PHENOME
TGMI is focussed on genes
GENE ‘MENDELIAN’
DISORDERS
Genetic medicine 1990-2010
GENE ‘MENDELIAN’
DISORDERS
Prior to NGS, genetic medicine was phenotype-driven.
Meticulous phenotyping used to decide which genes to test.
Genetic medicine 2020
GENE ‘MENDELIAN’
DISORDERS
With NGS, genetic medicine becomes genotype-driven and
can potentially be large-scale and routine.
Genetic medicine 2010-2016
GENE ‘MENDELIAN’
DISORDERS
With NGS, genetic medicine can be genotype-driven. But as
the processes are not well formed phenotyping often used
(often incorrectly) to decide which data is ‘relevant’.
TGMI aims to undertake conceptual,
foundational research to deliver
practical solutions to make genetic
medicine work
TGMI Aims
1. To provide robust, comprehensive information on
links between genes and human disease in a user-
friendly interface.
2. To develop standardised frameworks for consistent
clinical annotation and reporting of gene variation.
3. To develop approaches to deliver fast, automated,
high-throughput, large-scale variant interpretation.
4. To develop and validate flexible, multipurpose
analytical processes to maximise clinical and research
utilities of genetic testing.
GENES
Gene 1
Gene 20,000
For each gene ask qn:
Are germline mutations known to
‘cause’ a human disorder
YES – red (should not become
blue)
NO – blue (some will become
red)
All others – grey (further work
to classify to red or blue)
Gene Disease Map
DISEASES
Many complexities
at phenotype level.
‘Mendelian’
diseases
Why this is needed
Q: How many disease genes are there?
A: Depends who and how you ask.
OMIM: ‘genes phenotype-causing mutation’ = 3416
‘phenotype description, molecular basis known’ = 4482
BioMart: Ensembl Genes: + Swiss Prot IDs and OMIM
phenotype = 3268
Gene Cards: ‘disease genes’ = 9578
TGMI Aims
1. To provide robust, comprehensive information on
links between genes and human disease in a user-
friendly interface.
2. To develop standardised frameworks for consistent
clinical annotation and reporting of gene variation.
3. To develop approaches to deliver fast, automated,
high-throughput, large-scale variant interpretation.
4. To develop and validate flexible, multipurpose
analytical processes to maximise clinical and research
utilities of genetic testing.
TGMI – Aim 2
2.1 – Defining a Clinical Annotation Reference
System (CARS)
2.2 – Defining a Clinical Sequencing Notation
(CSN)
2.3 – Development and distribution of
conversion tools
Why this is needed
• In the clinic and research settings there is
huge variability in annotation of genetic
variation at every level (gene name, transcript
choice, variant annotation etc).
• This inevitably compromises data integration,
and clinical utility and fosters errors and
harms.
The CARS
• The Clinical Annotation Reference System
(CARS) encompasses the set of protein-coding
genes, the set of reference transcripts and
proteins corresponding to the genes, and a
Clinical Sequencing Notation (CSN) for
annotation of variation according to the
sequences.
• Defined against the reference human
genome.
TGMI gene set working criteria
• Has an HGNC ID
• Has an annotated start (which can be non-
methionine)
• Has an annotated stop
• Occurs on chromosomes 1-22, X, Y, or MT
• Has a gene and transcript biotype of “protein-
coding” from Ensembl (release 84)
The TGMI gene working set is
comprised of 18,885 genes
Clinical reference transcripts
1. Sequences must be based on the reference human
genome.
2. The system must allow flexible iteration without
compromising stability or clarity of sequence selection.
3. Reference transcripts must have durability, i.e. historical
sequences used for clinical reporting that are
subsequently superseded must remain available.
4. The reference transcript set should include as few
sequences as possible (one per gene for most genes) but
as many as required.
5. The reference transcript set must be easily available and
usable to encourage universal uptake.
CSN – Clinical Sequencing Notation
• Once transcript is selected, the observed variant
must be named according to its relative difference
from the reference.
• Fixed, standardised, automatic process for
annotation of sequence variation
• Consistent with historical HGVS guidelines
TGMI Aims
1. To provide robust, comprehensive information on
links between genes and human disease in a user-
friendly interface.
2. To develop standardised frameworks for consistent
clinical annotation and reporting of gene variation.
3. To develop approaches to deliver fast, automated,
high-throughput, large-scale variant interpretation.
4. To develop and validate flexible, multipurpose
analytical processes to maximise clinical and research
utilities of genetic testing.
Traditional interpretation process
1. Leveraging generic predictors, e.g. evolutionary
conservation, protein structural features, impact on
splicing etc to predict the functional consequences
of individual variants (done in lab).
2. Leveraging expert assessment of clinical impact
through disease and gene specific knowledge about
the phenotype, genetic architecture, genotype-
phenotype correlations, personal and family history
and variant segregation etc (done in clinic).
Interpretation requirements
1. High-throughput + large volume
2. Fast turnaround
3. Integrated into NGS pipelines
4. Integrated into clinical pipelines
5. Intelligible and usable by non-expert/patients
Variant Phenotype
Variant Phenotype
Frequency of phenotype
Mechanism of pathogenicity
Inheritance pattern
Attribution of gene for
phenotype
Penetrance of gene for
phenotype
Population variation
Variability of gene
Gene structure/function
Much useful information can be utilised and automated so
that the required manual curation can be focussed on the
~2-5% of variants where it is required.
TGMI Aims
1. To provide robust, comprehensive information on
links between genes and human disease in a user-
friendly interface.
2. To develop standardised frameworks for consistent
clinical annotation and reporting of gene variation.
3. To develop approaches to deliver fast, automated,
high-throughput, large-scale variant interpretation.
4. To develop and validate flexible, multipurpose
analytical processes to maximise clinical and research
utilities of genetic testing.
OpEx (Optimised Exome) pipeline
https://github.com/RahmanTeam/OpEx
www.icr.ac.uk/opex
OpEx pipeline
•Simple
•To clinical standards
•High-quality indel calling
http://icr.ac.uk/opex
https://github.com/RahmanTeam/OpEx
• Comparisons with ExAC
• Comparisons with clinical
exome pipelines
All input is welcome!
• The TGMI is keen to hear from and engage with anyone
interested in our aims. We are grateful for any input into
what is needed in genetic medicine, how those needs are
best met, and whether our solutions work.
• How to stay in touch:
– http://theTGMI.org
– info@theTGMI.org
– Weekly blog
– Twitter: @theTGMI
1 of 31

Recommended

Genome in a Bottle by
Genome in a BottleGenome in a Bottle
Genome in a BottleGenome Reference Consortium
1.4K views33 slides
170326 giab abrf by
170326 giab abrf170326 giab abrf
170326 giab abrfGenomeInABottle
547 views30 slides
GIAB GRC Workshop slides by
GIAB GRC Workshop slidesGIAB GRC Workshop slides
GIAB GRC Workshop slidesGenomeInABottle
1.5K views33 slides
Sept2016 plenary mercer_sequins by
Sept2016 plenary mercer_sequinsSept2016 plenary mercer_sequins
Sept2016 plenary mercer_sequinsGenomeInABottle
566 views26 slides
Sept2016 plenary nist_intro by
Sept2016 plenary nist_introSept2016 plenary nist_intro
Sept2016 plenary nist_introGenomeInABottle
869 views36 slides
170120 giab stanford genetics seminar by
170120 giab stanford genetics seminar170120 giab stanford genetics seminar
170120 giab stanford genetics seminarGenomeInABottle
443 views30 slides

More Related Content

What's hot

2016 ashg giab poster by
2016 ashg giab poster2016 ashg giab poster
2016 ashg giab posterGenomeInABottle
325 views1 slide
2017 agbt giab_poster by
2017 agbt giab_poster2017 agbt giab_poster
2017 agbt giab_posterGenomeInABottle
303 views1 slide
GIAB-GRC workshop oct2015 giab introduction 151005 by
GIAB-GRC workshop oct2015 giab introduction 151005GIAB-GRC workshop oct2015 giab introduction 151005
GIAB-GRC workshop oct2015 giab introduction 151005GenomeInABottle
12.7K views27 slides
161115 precision fda giab by
161115 precision fda giab161115 precision fda giab
161115 precision fda giabGenomeInABottle
550 views11 slides
Aug2013 illumina platinum genomes by
Aug2013 illumina platinum genomesAug2013 illumina platinum genomes
Aug2013 illumina platinum genomesGenomeInABottle
5.6K views14 slides
GIAB Sep2016 Lightning megan cleveland targeted seq by
GIAB Sep2016 Lightning megan cleveland targeted seqGIAB Sep2016 Lightning megan cleveland targeted seq
GIAB Sep2016 Lightning megan cleveland targeted seqGenomeInABottle
806 views12 slides

What's hot(20)

GIAB-GRC workshop oct2015 giab introduction 151005 by GenomeInABottle
GIAB-GRC workshop oct2015 giab introduction 151005GIAB-GRC workshop oct2015 giab introduction 151005
GIAB-GRC workshop oct2015 giab introduction 151005
GenomeInABottle12.7K views
Aug2013 illumina platinum genomes by GenomeInABottle
Aug2013 illumina platinum genomesAug2013 illumina platinum genomes
Aug2013 illumina platinum genomes
GenomeInABottle5.6K views
GIAB Sep2016 Lightning megan cleveland targeted seq by GenomeInABottle
GIAB Sep2016 Lightning megan cleveland targeted seqGIAB Sep2016 Lightning megan cleveland targeted seq
GIAB Sep2016 Lightning megan cleveland targeted seq
GenomeInABottle806 views
171114 best practices for benchmarking variant calls justin by GenomeInABottle
171114 best practices for benchmarking variant calls justin171114 best practices for benchmarking variant calls justin
171114 best practices for benchmarking variant calls justin
GenomeInABottle891 views
Tools for Using NIST Reference Materials by GenomeInABottle
Tools for Using NIST Reference MaterialsTools for Using NIST Reference Materials
Tools for Using NIST Reference Materials
GenomeInABottle1.6K views
Aug2015 salit standards architecture by GenomeInABottle
Aug2015 salit standards architectureAug2015 salit standards architecture
Aug2015 salit standards architecture
GenomeInABottle517 views
Giab aug2015 intro and update 150821.pptx by GenomeInABottle
Giab aug2015 intro and update 150821.pptxGiab aug2015 intro and update 150821.pptx
Giab aug2015 intro and update 150821.pptx
GenomeInABottle846 views
Genome in a Bottle- reference materials to benchmark challenging variants and... by GenomeInABottle
Genome in a Bottle- reference materials to benchmark challenging variants and...Genome in a Bottle- reference materials to benchmark challenging variants and...
Genome in a Bottle- reference materials to benchmark challenging variants and...
GenomeInABottle247 views
Aug2013 NIST highly confident genotype calls for NA12878 by GenomeInABottle
Aug2013 NIST highly confident genotype calls for NA12878Aug2013 NIST highly confident genotype calls for NA12878
Aug2013 NIST highly confident genotype calls for NA12878
GenomeInABottle2.4K views
Aug2015 Giab nist integration methods by GenomeInABottle
Aug2015 Giab nist integration methodsAug2015 Giab nist integration methods
Aug2015 Giab nist integration methods
GenomeInABottle426 views
Aug2015 Ali Bashir and Jason Chin Pac bio giab_assembly_summary_ali3 by GenomeInABottle
Aug2015 Ali Bashir and Jason Chin Pac bio giab_assembly_summary_ali3Aug2015 Ali Bashir and Jason Chin Pac bio giab_assembly_summary_ali3
Aug2015 Ali Bashir and Jason Chin Pac bio giab_assembly_summary_ali3
GenomeInABottle1.1K views
2017 amp benchmarking_poster_justin by GenomeInABottle
2017 amp benchmarking_poster_justin2017 amp benchmarking_poster_justin
2017 amp benchmarking_poster_justin
GenomeInABottle184 views
Genome in a bottle for next gen dx v2 180821 by GenomeInABottle
Genome in a bottle for next gen dx v2 180821Genome in a bottle for next gen dx v2 180821
Genome in a bottle for next gen dx v2 180821
GenomeInABottle137 views
160627 giab for festival sv workshop by GenomeInABottle
160627 giab for festival sv workshop160627 giab for festival sv workshop
160627 giab for festival sv workshop
GenomeInABottle373 views

Viewers also liked

Variation reference graphs and the variation graph toolkit vg by
Variation reference graphs and the variation graph toolkit vgVariation reference graphs and the variation graph toolkit vg
Variation reference graphs and the variation graph toolkit vgGenome Reference Consortium
1.6K views17 slides
Graph and assembly strategies for the MHC and ribosomal DNA regions by
Graph and assembly strategies for the MHC and ribosomal DNA regionsGraph and assembly strategies for the MHC and ribosomal DNA regions
Graph and assembly strategies for the MHC and ribosomal DNA regionsGenome Reference Consortium
564 views27 slides
agbt 2016 workshop lindsay by
agbt 2016 workshop lindsayagbt 2016 workshop lindsay
agbt 2016 workshop lindsayGenome Reference Consortium
1.2K views25 slides
Getting the most from the reference assembly by
Getting the most from the reference assemblyGetting the most from the reference assembly
Getting the most from the reference assemblyGenome Reference Consortium
738 views46 slides
Creating Reference-Grade Human Genome Assemblies by
Creating Reference-Grade Human Genome AssembliesCreating Reference-Grade Human Genome Assemblies
Creating Reference-Grade Human Genome AssembliesGenome Reference Consortium
1.6K views31 slides
Understanding the reference assembly: CSHL Hackathon by
Understanding the reference assembly: CSHL HackathonUnderstanding the reference assembly: CSHL Hackathon
Understanding the reference assembly: CSHL HackathonGenome Reference Consortium
483 views41 slides

Viewers also liked(18)

Exploiting long read sequencing technology to build a substantially improved ... by Genome Reference Consortium
Exploiting long read sequencing technology to build a substantially improved ...Exploiting long read sequencing technology to build a substantially improved ...
Exploiting long read sequencing technology to build a substantially improved ...

Similar to The Transforming Genetic Medicine Initiative (TGMI)

PadminiNarayanan-Intro-2018.pptx by
PadminiNarayanan-Intro-2018.pptxPadminiNarayanan-Intro-2018.pptx
PadminiNarayanan-Intro-2018.pptxDESMONDEZIEKE1
6 views67 slides
INBIOMEDvision Workshop at MIE 2011. Victoria López by
INBIOMEDvision Workshop at MIE 2011. Victoria LópezINBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria LópezINBIOMEDvision
1.2K views29 slides
Accelerating the benefits of genomics worldwide by
Accelerating the benefits of genomics worldwideAccelerating the benefits of genomics worldwide
Accelerating the benefits of genomics worldwideJoaquin Dopazo
637 views46 slides
NGS and the molecular basis of disease: a practical view by
NGS and the molecular basis of disease: a practical viewNGS and the molecular basis of disease: a practical view
NGS and the molecular basis of disease: a practical viewVall d'Hebron Institute of Research (VHIR)
2.6K views49 slides

Similar to The Transforming Genetic Medicine Initiative (TGMI)(20)

INBIOMEDvision Workshop at MIE 2011. Victoria López by INBIOMEDvision
INBIOMEDvision Workshop at MIE 2011. Victoria LópezINBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision1.2K views
Accelerating the benefits of genomics worldwide by Joaquin Dopazo
Accelerating the benefits of genomics worldwideAccelerating the benefits of genomics worldwide
Accelerating the benefits of genomics worldwide
Joaquin Dopazo637 views
EuroBioForum2014_speaker_Manolio by EuroBioForum
EuroBioForum2014_speaker_ManolioEuroBioForum2014_speaker_Manolio
EuroBioForum2014_speaker_Manolio
EuroBioForum1K views
IRIDA's Genomic epidemiology application ontology (GenEpiO): Genomic, clinica... by Emma Griffiths
IRIDA's Genomic epidemiology application ontology (GenEpiO): Genomic, clinica...IRIDA's Genomic epidemiology application ontology (GenEpiO): Genomic, clinica...
IRIDA's Genomic epidemiology application ontology (GenEpiO): Genomic, clinica...
Emma Griffiths917 views
How Can We Make Genomic Epidemiology a Widespread Reality? - William Hsiao by William Hsiao
How Can We Make Genomic Epidemiology a Widespread Reality?  - William HsiaoHow Can We Make Genomic Epidemiology a Widespread Reality?  - William Hsiao
How Can We Make Genomic Epidemiology a Widespread Reality? - William Hsiao
William Hsiao348 views
Genomics and proteomics by shreeman by shreeman cs
Genomics and proteomics by shreemanGenomics and proteomics by shreeman
Genomics and proteomics by shreeman
shreeman cs3.4K views
Reg Sci Lecture Dec 2016 by Rick Silva
Reg Sci Lecture Dec 2016Reg Sci Lecture Dec 2016
Reg Sci Lecture Dec 2016
Rick Silva74 views
A Retrospective Analysis of Exome Sequencing Cases Using the GenePool™ Genomi... by Antoaneta Vladimirova
A Retrospective Analysis of Exome Sequencing Cases Using the GenePool™ Genomi...A Retrospective Analysis of Exome Sequencing Cases Using the GenePool™ Genomi...
A Retrospective Analysis of Exome Sequencing Cases Using the GenePool™ Genomi...
Gene hunting strategies by Ashfaq Ahmad
Gene hunting strategiesGene hunting strategies
Gene hunting strategies
Ashfaq Ahmad1.1K views
How to transform genomic big data into valuable clinical information by Joaquin Dopazo
How to transform genomic big data into valuable clinical informationHow to transform genomic big data into valuable clinical information
How to transform genomic big data into valuable clinical information
Joaquin Dopazo1.6K views

More from Genome Reference Consortium

Previewing GRCm39: Assembly Updates from the GRC by
Previewing GRCm39: Assembly Updates from the GRCPreviewing GRCm39: Assembly Updates from the GRC
Previewing GRCm39: Assembly Updates from the GRCGenome Reference Consortium
7.5K views18 slides
What's new and what's next for the human reference assembly? by
What's new and what's next for the human reference assembly?What's new and what's next for the human reference assembly?
What's new and what's next for the human reference assembly?Genome Reference Consortium
2.3K views19 slides
Advancements in the human genome reference assembly (GRCh38) by
Advancements in the human genome reference assembly (GRCh38)Advancements in the human genome reference assembly (GRCh38)
Advancements in the human genome reference assembly (GRCh38)Genome Reference Consortium
2.4K views11 slides
Telomere-to-telomere assembly of a complete human chromosomes by
Telomere-to-telomere assembly of a complete human chromosomesTelomere-to-telomere assembly of a complete human chromosomes
Telomere-to-telomere assembly of a complete human chromosomesGenome Reference Consortium
1.9K views48 slides
Genome variation graphs with the vg toolkit by
Genome variation graphs with the vg toolkitGenome variation graphs with the vg toolkit
Genome variation graphs with the vg toolkitGenome Reference Consortium
2.1K views17 slides
The Matched Annotation from NCBI and EMBL-EBI (MANE) Project by
The Matched Annotation from NCBI and EMBL-EBI (MANE) ProjectThe Matched Annotation from NCBI and EMBL-EBI (MANE) Project
The Matched Annotation from NCBI and EMBL-EBI (MANE) ProjectGenome Reference Consortium
1.4K views21 slides

More from Genome Reference Consortium(17)

Recently uploaded

Fetal and Neonatal Circulation - MBBS, Gandhi medical College Hyderabad by
Fetal and Neonatal Circulation - MBBS, Gandhi medical College Hyderabad Fetal and Neonatal Circulation - MBBS, Gandhi medical College Hyderabad
Fetal and Neonatal Circulation - MBBS, Gandhi medical College Hyderabad Swetha rani Savala
10 views16 slides
Cholera Romy W. (3).pptx by
Cholera Romy W. (3).pptxCholera Romy W. (3).pptx
Cholera Romy W. (3).pptxrweth613
50 views11 slides
eTEP -RS Dr.TVR.pptx by
eTEP -RS Dr.TVR.pptxeTEP -RS Dr.TVR.pptx
eTEP -RS Dr.TVR.pptxVarunraju9
141 views33 slides
CRANIAL NERVE EXAMINATION.pptx by
CRANIAL NERVE EXAMINATION.pptxCRANIAL NERVE EXAMINATION.pptx
CRANIAL NERVE EXAMINATION.pptxNerusu sai priyanka
177 views30 slides
Thrives Priority Areas: Behavioral Health by
Thrives Priority Areas: Behavioral HealthThrives Priority Areas: Behavioral Health
Thrives Priority Areas: Behavioral HealthCity of Chesapeake
53 views22 slides
Aminoglycosides by
AminoglycosidesAminoglycosides
AminoglycosidesDr. Ajmer Singh Grewal
7 views26 slides

Recently uploaded(20)

Fetal and Neonatal Circulation - MBBS, Gandhi medical College Hyderabad by Swetha rani Savala
Fetal and Neonatal Circulation - MBBS, Gandhi medical College Hyderabad Fetal and Neonatal Circulation - MBBS, Gandhi medical College Hyderabad
Fetal and Neonatal Circulation - MBBS, Gandhi medical College Hyderabad
Cholera Romy W. (3).pptx by rweth613
Cholera Romy W. (3).pptxCholera Romy W. (3).pptx
Cholera Romy W. (3).pptx
rweth61350 views
eTEP -RS Dr.TVR.pptx by Varunraju9
eTEP -RS Dr.TVR.pptxeTEP -RS Dr.TVR.pptx
eTEP -RS Dr.TVR.pptx
Varunraju9141 views
Depression PPT template by EmanMegahed6
Depression PPT templateDepression PPT template
Depression PPT template
EmanMegahed620 views
The Art of naming drugs.pptx by DanaKarem1
The Art of naming drugs.pptxThe Art of naming drugs.pptx
The Art of naming drugs.pptx
DanaKarem111 views
AntiAnxiety Drugs .pptx by Dr Dhanik Mk
AntiAnxiety Drugs .pptxAntiAnxiety Drugs .pptx
AntiAnxiety Drugs .pptx
Dr Dhanik Mk20 views
Complications & Solutions in Laparoscopic Hernia Surgery.pptx by Varunraju9
Complications & Solutions in Laparoscopic Hernia Surgery.pptxComplications & Solutions in Laparoscopic Hernia Surgery.pptx
Complications & Solutions in Laparoscopic Hernia Surgery.pptx
Varunraju9122 views
DRUG REPUROSING SEMINAR.pptx by Riya Gagnani
DRUG REPUROSING SEMINAR.pptxDRUG REPUROSING SEMINAR.pptx
DRUG REPUROSING SEMINAR.pptx
Riya Gagnani6 views

The Transforming Genetic Medicine Initiative (TGMI)

  • 1. The Transforming Genetic Medicine Initiative (TGMI) £5.3M 4 year programme funded by Wellcome
  • 2. Designing, Developing and Delivering Integrated Foundations for Genetic Medicine
  • 3. Nazneen Rahman Paul Flicek Caroline Wright Sian Ellard David Fitzpatrick Ewan Birney Fiona Cunningham Graeme BlackHelen Firth Gerton Lunter Matthew Hurles Patrick Chinnery TGMI PIs
  • 4. National and international collaborations
  • 5. Transforming genetic medicine Must ensure the wealth of existing medical genetic knowledge informs our use of current and future technology, if we are to do more right and less wrong. ‘The past is never dead, it’s not even past.’ William Faulkner
  • 7. GENOME PHENOME TGMI is focussed on genes GENE ‘MENDELIAN’ DISORDERS
  • 8. Genetic medicine 1990-2010 GENE ‘MENDELIAN’ DISORDERS Prior to NGS, genetic medicine was phenotype-driven. Meticulous phenotyping used to decide which genes to test.
  • 9. Genetic medicine 2020 GENE ‘MENDELIAN’ DISORDERS With NGS, genetic medicine becomes genotype-driven and can potentially be large-scale and routine.
  • 10. Genetic medicine 2010-2016 GENE ‘MENDELIAN’ DISORDERS With NGS, genetic medicine can be genotype-driven. But as the processes are not well formed phenotyping often used (often incorrectly) to decide which data is ‘relevant’.
  • 11. TGMI aims to undertake conceptual, foundational research to deliver practical solutions to make genetic medicine work
  • 12. TGMI Aims 1. To provide robust, comprehensive information on links between genes and human disease in a user- friendly interface. 2. To develop standardised frameworks for consistent clinical annotation and reporting of gene variation. 3. To develop approaches to deliver fast, automated, high-throughput, large-scale variant interpretation. 4. To develop and validate flexible, multipurpose analytical processes to maximise clinical and research utilities of genetic testing.
  • 13. GENES Gene 1 Gene 20,000 For each gene ask qn: Are germline mutations known to ‘cause’ a human disorder YES – red (should not become blue) NO – blue (some will become red) All others – grey (further work to classify to red or blue) Gene Disease Map DISEASES Many complexities at phenotype level. ‘Mendelian’ diseases
  • 14. Why this is needed Q: How many disease genes are there? A: Depends who and how you ask. OMIM: ‘genes phenotype-causing mutation’ = 3416 ‘phenotype description, molecular basis known’ = 4482 BioMart: Ensembl Genes: + Swiss Prot IDs and OMIM phenotype = 3268 Gene Cards: ‘disease genes’ = 9578
  • 15. TGMI Aims 1. To provide robust, comprehensive information on links between genes and human disease in a user- friendly interface. 2. To develop standardised frameworks for consistent clinical annotation and reporting of gene variation. 3. To develop approaches to deliver fast, automated, high-throughput, large-scale variant interpretation. 4. To develop and validate flexible, multipurpose analytical processes to maximise clinical and research utilities of genetic testing.
  • 16. TGMI – Aim 2 2.1 – Defining a Clinical Annotation Reference System (CARS) 2.2 – Defining a Clinical Sequencing Notation (CSN) 2.3 – Development and distribution of conversion tools
  • 17. Why this is needed • In the clinic and research settings there is huge variability in annotation of genetic variation at every level (gene name, transcript choice, variant annotation etc). • This inevitably compromises data integration, and clinical utility and fosters errors and harms.
  • 18. The CARS • The Clinical Annotation Reference System (CARS) encompasses the set of protein-coding genes, the set of reference transcripts and proteins corresponding to the genes, and a Clinical Sequencing Notation (CSN) for annotation of variation according to the sequences. • Defined against the reference human genome.
  • 19. TGMI gene set working criteria • Has an HGNC ID • Has an annotated start (which can be non- methionine) • Has an annotated stop • Occurs on chromosomes 1-22, X, Y, or MT • Has a gene and transcript biotype of “protein- coding” from Ensembl (release 84)
  • 20. The TGMI gene working set is comprised of 18,885 genes
  • 21. Clinical reference transcripts 1. Sequences must be based on the reference human genome. 2. The system must allow flexible iteration without compromising stability or clarity of sequence selection. 3. Reference transcripts must have durability, i.e. historical sequences used for clinical reporting that are subsequently superseded must remain available. 4. The reference transcript set should include as few sequences as possible (one per gene for most genes) but as many as required. 5. The reference transcript set must be easily available and usable to encourage universal uptake.
  • 22. CSN – Clinical Sequencing Notation • Once transcript is selected, the observed variant must be named according to its relative difference from the reference. • Fixed, standardised, automatic process for annotation of sequence variation • Consistent with historical HGVS guidelines
  • 23. TGMI Aims 1. To provide robust, comprehensive information on links between genes and human disease in a user- friendly interface. 2. To develop standardised frameworks for consistent clinical annotation and reporting of gene variation. 3. To develop approaches to deliver fast, automated, high-throughput, large-scale variant interpretation. 4. To develop and validate flexible, multipurpose analytical processes to maximise clinical and research utilities of genetic testing.
  • 24. Traditional interpretation process 1. Leveraging generic predictors, e.g. evolutionary conservation, protein structural features, impact on splicing etc to predict the functional consequences of individual variants (done in lab). 2. Leveraging expert assessment of clinical impact through disease and gene specific knowledge about the phenotype, genetic architecture, genotype- phenotype correlations, personal and family history and variant segregation etc (done in clinic).
  • 25. Interpretation requirements 1. High-throughput + large volume 2. Fast turnaround 3. Integrated into NGS pipelines 4. Integrated into clinical pipelines 5. Intelligible and usable by non-expert/patients
  • 27. Variant Phenotype Frequency of phenotype Mechanism of pathogenicity Inheritance pattern Attribution of gene for phenotype Penetrance of gene for phenotype Population variation Variability of gene Gene structure/function Much useful information can be utilised and automated so that the required manual curation can be focussed on the ~2-5% of variants where it is required.
  • 28. TGMI Aims 1. To provide robust, comprehensive information on links between genes and human disease in a user- friendly interface. 2. To develop standardised frameworks for consistent clinical annotation and reporting of gene variation. 3. To develop approaches to deliver fast, automated, high-throughput, large-scale variant interpretation. 4. To develop and validate flexible, multipurpose analytical processes to maximise clinical and research utilities of genetic testing.
  • 29. OpEx (Optimised Exome) pipeline https://github.com/RahmanTeam/OpEx www.icr.ac.uk/opex
  • 30. OpEx pipeline •Simple •To clinical standards •High-quality indel calling http://icr.ac.uk/opex https://github.com/RahmanTeam/OpEx • Comparisons with ExAC • Comparisons with clinical exome pipelines
  • 31. All input is welcome! • The TGMI is keen to hear from and engage with anyone interested in our aims. We are grateful for any input into what is needed in genetic medicine, how those needs are best met, and whether our solutions work. • How to stay in touch: – http://theTGMI.org – info@theTGMI.org – Weekly blog – Twitter: @theTGMI