Evaluating Oncogenicity in VSClinical

Evaluating Oncogenicity in VSClinical
Nathan Fortier, Ph.D., Director of Research
20 Most Promising Biotech
Technology Providers
Top 10 Analytics
Solution Providers

NIH Grant Funding Acknowledgments
• Research reported in this publication was supported by the National Institute Of
General Medical Sciences of the National Institutes of Health under:
• Award Number R43GM128485-01
• Award Number R43GM128485-02
• Award Number 2R44 GM125432-01
• Award Number 2R44 GM125432-02
• Montana SMIR/STTR Matching Funds Program Grant Agreement Number 19-51-RCSBIR-005
• PI is Dr. Andreas Scherer, CEO Golden Helix.
• The content is solely the responsibility of the authors and does not necessarily
represent the official views of the National Institutes of Health.

Filtering and Annotation
ACMG & AMP Guidelines
Clinical Reports
CNVAnalysis
Pipeline: Run Workflows
Variant Warehouse
CentralizedAnnotations
Hosted Reports
Sharing and Integration
CNVAnalysis
GWAS |Genomic Prediction
Large-NPopulation Studies
RNA-Seq
Large-NCNV-Analysis
Who Are We?
Golden Helix is a global bioinformatics company
founded in 1998

Cited in 1,000s of Peer-Reviewed Publications

SIMPLE, SUBSCRIPTION-
BASED BUSINESSMODEL
o Yearlyfee
o Unlimitedtraining&support
SOFTWARE ISVETTED
o 20,000+ usersat 400+ organizations
o Quality&feedback
DEEPLY ENGRAINED IN
SCIENTIFIC COMMUNITY
o Give backto thecommunity
o Contributecontentandsupport
INNOVATIVESOFTWARE SOLUTIONS
o Cited in1,000s ofpublications
When you choose Golden Helix,
you receive more than just the software

PDF
REPORT
WORD
REPORT
EXCEL
TABLE
B A M
Calling of CNVs
V C F
Annotating, filtering &
prioritizing of clinically
relevant SNPs and CNVs
‐ Clinical interpretation of SNPs &
CNVs
‐ ACMG & AMP guidelines assessing
germline and somatic variations
‐ Clinical reporting

VSClinical - AMP Guidelines: Analyzing Biomarkers
Haroche J. et al. Dramatic efficacy of vemurafenib in both
multisystemic and refractory Erdheim-Chester disease and
Langerhans cell histiocytosis harboring the BRAF V600E mutation.
Blood 2013 121
• Biomarker Definition
- Biological states with indications for
treatments, prognostic, or diagnostic
outcomes
- Presence or absence of proteins, antigens,
and specific genomic attributes of the
tumor
• Common Cancer Biomarkers
- HER2+: High levels of HER2 receptor
protein
- MSI-H: Microsatellite instability-high
- BRAFV600E: Activating mutation V600E
- ERBB2Amp: Amplification of ERBB2
- BCR-ABL1: Activation of ABL1 with BCR
fusion
- TP53WT: No significant alterations of
critical TSG

VSClinical – AMP and ACMG Guidelines: One Suite
• Increased lab throughput
• Consistent results
• Shorten learning curve
• Staying abreast of new
developments
Germlin
e
Somatic

Oncogenicity Scale
Oncogenic
Likely
Oncogenic
Likely
BenignBenign
-5 0 +3 +5-4 +2-2 +1 +4-1-3
Variant of
Unknown
Significance
• Germline Population Catalogs
• In-Silico Functional/Splicing
• Previous / Clinical Evaluations
• Somatic Catalogs
• Domain / Hotspot Analysis
• Gene Affinity to Variant Type

Oncogenicity Scoring
Applies To Criteria -5B -3B -2B -1B +1O +2O +3O
All
Population Frequency -5 -3 -1
Homozygous in Controls -2 -1
In Somatic Catalogs +1 +2 +3
Relevant Variant Assessments -1 +2 +3
Null
Damaging LoF +1 +2
LoF are Oncogenic Mutations in Gene +1
Missense
Nearby Pathogenic Missense Variants +2
In-Frame not in Repeat Region +1
Somatic Hotspot & Active Binding Sites +1 +2
Non-Null Computational Evidence -1 +1
All Splice Site Prediction +1 +2
Non-Coding Silent, Intronic, UTR, Intergenic Variants w/ No Splice Effect -3

Germline Population Frequency
• The maximum sub-population frequency is used.
• We use gnomAD and 1000 Genomes (choosing the maximum
frequency between both catalogs)
• Our thresholds are equivalent to those used in the ACMG
Guideline automation for BA1/BS1 but there is no PM2 (+2)
for being novel (not in germline catalogs)
• Recessive genes allow for higher frequency (two-hit)
Possible Scores:
Recessive Dominant
-5B 1.00% 0.50%
-3B 0.15% 0.05%
-1B 16 individuals (all) 16 individuals (all)
-5 -1-3

Present in Controls
• Controls include 1000 genomes and gnomAD “Controls”
subset.
• Score counts of being homozygous in recessive gene
• Score counts of being heterozygous / hemizygous in a
dominant / x-linked gene respectively
Possible Scores:
Number of Individuals
-2B Multiple individual
-1B Exactly one individual
-2 -1

In Somatic Catalogs
• Will look at COSMIC, ICGC and MSK-Impact
• Total sample count (tumor type agnostic)
• Thresholds chosen to match power law of mutation
occurrence in somatic catalogs
• +2D/+3D only apply if variant < 16 AC in germline catalogs
Possible Scores: +3+2+1
# Samples (At Least) Variants in COSMIC
+1D 1 3,296,000 (100%)
+2D 5 43,000 (1.4%)
+3D 35 1,000 (0.03%)

Relevant Variant Assessments
Possible Scores:
 Classified variants
- Internal Knowledge-Base of
classified variants
- ClinVar 1+ star Likely Pathogenic /
Pathogenic
- CIViC 1+ star variants
- Other “Consortium” sources
 Score
- +3 if Pathogenic Same Change
- +2 if Pathogenic Missense Same
Codon
- -1 if Benign Scored
+3+2-1

Variant Type Specific Criteria
Groups of Variant Types:
• Null variant: frameshift, stop gain, start loss
• Previously classified mutation?
• Does mutation result in null / truncated gene product?
• Are Null variants shown to be drivers in cancer for this gene?
• Missense variants: amino acid substations and length
polymorphisms
• Previously classified amino acid (same codon)?
• In local region of previously classified variants?
• In active binding site or mutation hot-spot?
• In-silico evidence: functional prediction and splicing?
• Non-coding variants: silent mutations, intronic, utr
• Predicted to disrupt canonical splice site?
Sequencing Ontology on Current Transcript (Selectable)

Damaging LoF
The p.K1358Dfs variant occurs in the last
exon of MSH6. There are no other pathogenic
loss of function variants downstream of the
variant p.K1358Dfs.
Possible Scores: +2+1
Truncating / Null Variant Evidence:
 +1 Relative position in protein coding sequence
- Not within 50bp of penultimate exon
- Not on last exon
 +1 Previously classified variant downstream
- Any LoF variant downstream of this variant’s position
- Sources of previously classified variants:
- Internal KnowledgeBase of classified / interpreted
variant
- ClinVar 1+ star Likely Pathogenic / Pathogenic
- CIViC variants with certain evidence threshold / star-
rating
- Other “consortium” sources

LoF are Oncogenic Mutations in Gene
Possible Scores: +1
Affinity with
Gene:
 Classified variants
- 1 or more LoF
Pathogenic / Likely
Pathogenic
 Proportion of COSMIC
mutations:
- 5% of variants are LoF
 LoF CIViC Evidence
- Statement about null
variants in CIViC
- 1 Star+ rating

Nearby Pathogenic Missense Variants
Possible Scores: +2
Using Previous Classified:
 There are no benign missense
variations within three amino acids
of the variant
 There are at least two pathogenic
missense variants within six amino
acids of the variant
 The number of pathogenic missense
variants within six amino acids
exceeds the number of benign
missense variants

In-Frame Not in Repeat Region
For In-Frame Insertions / Deletions:
• +1 If the inserted sequence is not repeated two or more
times
• Considering a version of “Nearby Pathogenic Inframe
Variants” for another +1 to boost variants in inframe indel
hotspots (i.e. EGFR exon 19)
The p.A3571_V3572del variant is a in-frame
deletion of an amino acid sequence that is
repeated 2 times in the surrounding region.
Possible Scores: +1

Somatic Hotspot & Active Binding Site
Exon 15 of BRAF shows regions designated as somatic
missense mutation hotpsots as well as key activating sites
and binding site annotations
Possible Scores: +2+1
Region Tracks:
 +1 Cancer hotspots
- Single residue and in-frame indel
mutation hotspots identified in 24,592
tumor samples by the algorithm
described in [Chang et al. 2017] and
[Chang et al. 2016]
 +1 binding sites / activating / active
sites
- Curated through InterPro
- Residue annotations from CDD
- More specific than large domain
annotations

Computational Evidence
In-Silico Evidence (for Non-LoF
Variants)
• +2: 3 or 4 out of 4 splice site predictions of damaging
• +1: In-silico predictions in agreement variant is damaging &
conserved
• -1: If variant amino acid present in mammalian species
• -1: In-silico predictions in agreement that variant is tolerated
& not conserved
Synonymous / UTR / Intronic Variants
• -3: Not predicted to disrupt a canonical splice site and no
Pathogenic clinical assessment
Possible Scores: +3-1

Example: BRAF V600E
General Scoring
• +0: novel in gnomAD
• +3: Somatic catalog of 28,263 samples in COSMIC
• +3: In ClinVar as Pathogenic, in CIViC 1+ star
Missense/Computational Evidence
• +2: Nearby pathogenic variants
• +2: In Cancer Hotspot and Active Binding Site
• +1: Functional & Conservation all agree
Final Score: +11

Example: SLX4 A1461Pfs*2
General Scoring
• +0: 0.0009% (1 of 109874 European) in gnomAD
• +1: Somatic catalog of 1 sample in COSMIC
• +0: Not in ClinVar or CIViC
Loss of Function
• +2: Not at end of gene, downstream pathogenic LoF
• There are 2 downstream pathogenic loss of function variants,
with the furthest variant being 283 residues downstream of the
variant p.A1461Pfs*2.
• +1: LoF are Driver Mutation in Gene
• The p.A1461Pfs*2 variant is a loss of function variant in the gene
SLX4, which is intolerant of Loss of Function variants, as indicated
by the presence of existing pathogenic loss of function variant
NP_115820.2:p.Leu20Argfs*24 and 5 others
Final Score: +4 (Likely Oncogenic)

Example: PTCH1 C454Y
General Scoring
• +0: novel in gnomAD
• +0 : Not in Cosmic
• +0: Not in ClinVar or CIViC
Missense / Computational Evidence
• +0 : Nearby pathogenic variants
• There are no classified pathogenic variants within 6 amino acid
positions of the variant p.C454Y, providing no evidence of being in
a mutation hot spot.
• +0 : In Cancer Hotspot and Active Binding Site
• +1: Functional & Conservation all agree
Final Score: +1 (VUS)

ProjectDemonstration
*Enter any questions you have into the questions pane while we transition*

Questions & Answers
All questions will be anonymous

COVID-19 Resources
• Bundle discounts will be ending on June 15th
• SVS Imputation Module w/CADD & OMIM
• VSClinical, AMP, CNV, Sentieon Tier 1
• Small Warehouse License: VS-CNV, VSClinical+ AMP, Sentieon Tier 1,
VSReports, VSPipeline
• If you are interested in reserving one of these bundles, you can
mention this in the Questions pane now.

COVID-19 Resources
• Head to bit.ly/covid19ghi
• Articles, eBooks, home licenses,
and more!

Thank you for attending!
Pleaseletus know ifyou have any further questions by emailing
info@goldenhelix.com.
Welookforward to seeingyou onthe nextwebcast.

Evaluating Oncogenicity in VSClinical

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Evaluating Oncogenicity in VSClinical

Similar to Evaluating Oncogenicity in VSClinical (20)

More from Golden Helix

More from Golden Helix (20)

Recently uploaded

Recently uploaded (20)

Evaluating Oncogenicity in VSClinical

Editor's Notes