SlideShare a Scribd company logo
1 of 1
Download to read offline
Innovative Research for a Sustainable Future
www.epa.gov/research
Background: In Silico Metabolism and its Potential
Use in Read-Across Frameworks
Results: Predictive Performance of MetSim Tools Discussion: Diversity of Tools Can Enhance
Metabolite Recall
ORCID: 0000-0002-6357-0508
Louis Groff l groff.louis@epa.gov l 919-541-4256
MetSim: Integrated Programmatic Access and Pathway Management for Xenobiotic
Metabolism Tools
Louis Groff1, Imran Shah1, and Grace Patlewicz1
1U.S. Environmental Protection Agency, Office of Research and Development
Center for Computational Toxicology and Exposure, Research Triangle Park, NC
• Substances were assigned into
Chemical classes using Wishart
Lab ClassyFire API
• Hierarchically clustered heatmap
on left shows mean recall rate per
chemical class, scaled white to red
from 0 to 1, respectively
• Histogram on right shows number
of compounds within each
chemical class, on a log 2 scale
• Pyrimidine nucleosides, fluorenes,
pyrenes, and benzimidazoles
classes reflected the highest mean
recall rates across all tools
• Classes vinyl halides, cinnamic
acids, organic oxoanionic
compounds, and flavonoids
showed the lowest mean recall
rates observed across all tools
• Highest frequency chemical
classes include benzenes and
substituted derivatives,
azobenzenes, naphthalenes, and
organooxygen compounds
• The predictions from each tool were
evaluated on a set of literature data for
pharmaceuticals and non-steroidal anti-
inflammatory drug (NSAID) chemicals
including their phase I metabolites
• Individual recall per tool was 0.6 for
BioTransformer, 0.4 for TIMES/Toolbox,
and combined recall is 0.8
• Of the five metabolites, all three tools
captured one hydroxylated aripiprazole
metabolite
• All tools missed the oxidized metabolite
dehydroaripiprazole
References
Upcoming Publication: Groff, L. C., et al. Chem. Res. Tox. 2023, In Final Preparation.
GenRA Tool: https://comptox.epa.gov/genra/
Cheminformatics Modules Standardizer: https://hazard-dev.sciencedataexperts.com/#/stdizer
ClassyFire API: http://classyfire.wishartlab.com/
Frequency
21 22
23 24
MetSim Model
Input
Data
DTXSID
SMILES
CASRN
Organism
Rat
Human
Type
of
Metabolism
Phase I Only
Phase I &
Phase II
Metabolism
Simulator
OECD Toolbox
TIMES
BioTransformer
Simulator
Outputs
Precursor
SMILES
Metabolite
SMILES
Reactions &
Enzymes
Cheminformatics
Modules
For Precursors &
Metabolites:
SMILES
InChIKey
CASRN
DTXSID
Chemical Names
Reactions &
Enzymes
Data
Storage
Harmonize
output
formats
Store in
Mongo
Database
Methods: Metabolism Simulation (MetSim) Algorithm
• Analysis of environmental samples via high-throughput screening methods has the potential to
uncover new or data-poor chemicals, or contaminants of emerging concern (CECs)
• Read-across methodologies such as EPA’s Generalized Read-Across (GenRA) can be helpful to
identify source analogue chemicals with available toxicity information including points of departure
(POD)
• Source analogues are typically identified on the basis of structural similarity
• We seek to extend this framework to incorporate metabolism similarity between CECs and their
analogues as an additional context for analogue identification within GenRA
Recall Rate Hierarchically Clustered on Chemical Class With Histogram of Class Frequency
Conclusions and Future Directions
MetSim Algorithm Summary Flow Chart
Example: Tool Specific Performance on Aripiprazole and its Phase I Metabolites
Chemical
Class
DSSTox 700
Phase I &
Phase II
Metabolites
BioTransformer
2x Phase I &
1x Phase II
TIMES
In Vitro
TIMES
In Vivo
OECD
Toolbox
In Vitro
OECD
Toolbox
In Vivo
True Positives 266 595 648 595 679
False Positives 58357 4416 6018​ 4485​ 6937​
False Negatives 1629 1305 1249​ 1304 1222
Total Predictions 58623​ 5410 6666​ 5080 7616​
Total Precision 0.004​ 0.11 0.10​ 0.12​ 0.09​
Total Recall 0.14 0.31 0.34 0.31 0.36
Total Reported
Metabolites
1895 1895 1895​ 1895​ 1895​
MetSim Performance Metrics by Choice of Tool/Model on DSSTox-Registered Metabolites
• Dataset consists of 700 drugs and
environmental chemicals of interest to
EPA
• Regardless of tool choice, recall rate
was low
• BioTransformer generates many
predictions, but yielded the lowest recall
for this dataset
• Toolbox/TIMES generated fewer
predictions, resulting in both higher
precision and higher recall rates than
BioTransformer for this dataset
• Toolbox In Vivo Rat Simulator had the
highest recall of all tools evaluated
• All metabolism prediction tools require either an application programming interface (API) or
command line interface (CLI) for incorporation into MetSim
• All tools in this study used SMILES as an input, which can be obtained from DSSTox Substance ID
(DTXSID) searches, if registered. CAS Registry Number (CASRN) is an alternate input that can be
used to query the OECD Toolbox WebAPI
• Human tissue metabolism is handled with BioTransformer, which performs both phase I & II
metabolism
• Both OECD Toolbox and TIMES contain simulators for In Vitro and In Vivo Rat tissue metabolism.
The Toolbox only performs phase I metabolism, TIMES includes phase II metabolism as well
• All tools were applied to a dataset of 59 common drugs to predict their phase I metabolites, and 700
EPA-relevant chemicals registered in DSSTox to predict their phase I and phase II metabolites
• Outputs are aggregated into a harmonized dictionary format containing precursor and metabolite
SMILES, as well as reaction and enzyme information from BioTransformer and TIMES
• DTXSID and CASRN for a metabolite were added to the dictionary where available by querying the
EPA Cheminformatics Modules Standardizer API using metabolite SMILES
• Output dictionaries for each parent are loaded into a Mongo database for subsequent use
• TIMES and OECD Toolbox captured the second of two aripiprazole hydroxides but missed the
other three metabolites
• BioTransformer identified 7-(4-hydroxybutoxy)-3,4-dihydroquinolin-2(1H)-one and 1-(2,3-
Dichlorophenyl) piperazine metabolites but missed the remaining two metabolites
Usefulness of Tools Applied in this Study
• Low precision of BioTransformer is offset by the fact that expansive sets of predictions can be
helpful for high-throughput screening studies aimed at identifying CECs, such as Non-Targeted
Analysis (NTA), where CECs could be metabolites in an environmental sample, and later correlated
with their respective parents
• Toolbox/TIMES training datasets contain more environmentally relevant chemicals
• Our MetSim algorithm successfully harmonized inputs and outputs between different in silico
metabolism tools, to facilitate storage and interaction of prediction data within a Mongo database
• MetSim could be extended to accommodate information from other in silico tools if they possess an
API or CLI
• Individual tools can be utilized to predict metabolism in human or rat tissues
• All tools yielded low recall on the dataset of 700 drugs and environmental chemicals of interest to
EPA
• Inspection of false negative predictions on a per chemical class basis can provide useful insights on
what refinements to the transformation rule sets are merited for each tool
• Future work in progress will focus on analyzing the graphs stored within MetSim to evaluate graph
similarity with different metrics to inform metabolic similarity within read-across (MetGraph)
Motivation for Metabolism Within Read-Across
Challenges to Address for Successful Implementation into GenRA
• A variety of in silico tools are available that enable metabolites to be simulated.
• Each tool outputs its prediction information in a different format. For comparison purposes, a
harmonized output data format is required.
• No single standardized database of xenobiotic metabolism pathways of environmental chemicals
exists at present.
• Develop a database to store prediction data to facilitate analysis of metabolic graphical networks
Disclaimer: The views expressed in this presentation are those of the
authors and do not necessarily reflect the views or policies of the U.S. EPA.

More Related Content

Similar to In Silico Metabolism Tools Predict Phase I & II Metabolites

Integrative omics approches
Integrative omics approches   Integrative omics approches
Integrative omics approches Sayali Magar
 
Designing a community resource - Sandra Orchard
Designing a community resource - Sandra OrchardDesigning a community resource - Sandra Orchard
Designing a community resource - Sandra OrchardEMBL-ABR
 
CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
CoMPARA: Collaborative Modeling Project for Androgen Receptor ActivityCoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
CoMPARA: Collaborative Modeling Project for Androgen Receptor ActivityKamel Mansouri
 
Crofton Evolution of Toxicology
Crofton Evolution of ToxicologyCrofton Evolution of Toxicology
Crofton Evolution of ToxicologyKevinCrofton
 
Metabolic engineering approaches in medicinal plants
Metabolic engineering approaches in medicinal plantsMetabolic engineering approaches in medicinal plants
Metabolic engineering approaches in medicinal plantsN Poorin
 
Quantifying the content of biomedical semantic resources as a core for drug d...
Quantifying the content of biomedical semantic resources as a core for drug d...Quantifying the content of biomedical semantic resources as a core for drug d...
Quantifying the content of biomedical semantic resources as a core for drug d...Syed Muhammad Ali Hasnain
 
hit identification.pptx
hit identification.pptxhit identification.pptx
hit identification.pptxashharnomani
 
Predictive Models for Mechanism of Action Classification from Phenotypic Assa...
Predictive Models for Mechanism of Action Classification from Phenotypic Assa...Predictive Models for Mechanism of Action Classification from Phenotypic Assa...
Predictive Models for Mechanism of Action Classification from Phenotypic Assa...Ellen Berg
 
AI and Machine Learning for Secondary Metabolite Prediction
AI and Machine Learning for Secondary Metabolite PredictionAI and Machine Learning for Secondary Metabolite Prediction
AI and Machine Learning for Secondary Metabolite PredictionYannick Djoumbou
 
T-BioInfo Methods and Approaches
T-BioInfo Methods and ApproachesT-BioInfo Methods and Approaches
T-BioInfo Methods and ApproachesElia Brodsky
 
Exploiting Edinburgh's Guide to PHARMACOLOGY database as a source of protein ...
Exploiting Edinburgh's Guide to PHARMACOLOGY database as a source of protein ...Exploiting Edinburgh's Guide to PHARMACOLOGY database as a source of protein ...
Exploiting Edinburgh's Guide to PHARMACOLOGY database as a source of protein ...Chris Southan
 
computer simulations.pptx
computer simulations.pptxcomputer simulations.pptx
computer simulations.pptxDr Pavani A
 
High throughput approaches to understanding gene function and mapping archite...
High throughput approaches to understanding gene function and mapping archite...High throughput approaches to understanding gene function and mapping archite...
High throughput approaches to understanding gene function and mapping archite...Tintumann
 
Bio Scope
Bio ScopeBio Scope
Bio ScopeStartup
 

Similar to In Silico Metabolism Tools Predict Phase I & II Metabolites (20)

MORPH-R article
MORPH-R articleMORPH-R article
MORPH-R article
 
HTS
HTSHTS
HTS
 
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
 
Integrative omics approches
Integrative omics approches   Integrative omics approches
Integrative omics approches
 
Designing a community resource - Sandra Orchard
Designing a community resource - Sandra OrchardDesigning a community resource - Sandra Orchard
Designing a community resource - Sandra Orchard
 
CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
CoMPARA: Collaborative Modeling Project for Androgen Receptor ActivityCoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
 
Crofton Evolution of Toxicology
Crofton Evolution of ToxicologyCrofton Evolution of Toxicology
Crofton Evolution of Toxicology
 
Metabolic engineering approaches in medicinal plants
Metabolic engineering approaches in medicinal plantsMetabolic engineering approaches in medicinal plants
Metabolic engineering approaches in medicinal plants
 
Quantifying the content of biomedical semantic resources as a core for drug d...
Quantifying the content of biomedical semantic resources as a core for drug d...Quantifying the content of biomedical semantic resources as a core for drug d...
Quantifying the content of biomedical semantic resources as a core for drug d...
 
hit identification.pptx
hit identification.pptxhit identification.pptx
hit identification.pptx
 
Predictive Models for Mechanism of Action Classification from Phenotypic Assa...
Predictive Models for Mechanism of Action Classification from Phenotypic Assa...Predictive Models for Mechanism of Action Classification from Phenotypic Assa...
Predictive Models for Mechanism of Action Classification from Phenotypic Assa...
 
AI and Machine Learning for Secondary Metabolite Prediction
AI and Machine Learning for Secondary Metabolite PredictionAI and Machine Learning for Secondary Metabolite Prediction
AI and Machine Learning for Secondary Metabolite Prediction
 
T-BioInfo Methods and Approaches
T-BioInfo Methods and ApproachesT-BioInfo Methods and Approaches
T-BioInfo Methods and Approaches
 
T-bioinfo overview
T-bioinfo overviewT-bioinfo overview
T-bioinfo overview
 
Exploiting Edinburgh's Guide to PHARMACOLOGY database as a source of protein ...
Exploiting Edinburgh's Guide to PHARMACOLOGY database as a source of protein ...Exploiting Edinburgh's Guide to PHARMACOLOGY database as a source of protein ...
Exploiting Edinburgh's Guide to PHARMACOLOGY database as a source of protein ...
 
computer simulations.pptx
computer simulations.pptxcomputer simulations.pptx
computer simulations.pptx
 
High throughput approaches to understanding gene function and mapping archite...
High throughput approaches to understanding gene function and mapping archite...High throughput approaches to understanding gene function and mapping archite...
High throughput approaches to understanding gene function and mapping archite...
 
Metabolomics seminarslides 013111final 110201
Metabolomics seminarslides 013111final 110201Metabolomics seminarslides 013111final 110201
Metabolomics seminarslides 013111final 110201
 
ChemSpider: Connecting Chemistry & Mass Spectrometry on the Internet
ChemSpider: Connecting Chemistry & Mass Spectrometry on the Internet ChemSpider: Connecting Chemistry & Mass Spectrometry on the Internet
ChemSpider: Connecting Chemistry & Mass Spectrometry on the Internet
 
Bio Scope
Bio ScopeBio Scope
Bio Scope
 

Recently uploaded

G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptMAESTRELLAMesa2
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxSwapnil Therkar
 
Caco-2 cell permeability assay for drug absorption
Caco-2 cell permeability assay for drug absorptionCaco-2 cell permeability assay for drug absorption
Caco-2 cell permeability assay for drug absorptionPriyansha Singh
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxkessiyaTpeter
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Work, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE PhysicsWork, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE Physicsvishikhakeshava1
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Nistarini College, Purulia (W.B) India
 
Types of different blotting techniques.pptx
Types of different blotting techniques.pptxTypes of different blotting techniques.pptx
Types of different blotting techniques.pptxkhadijarafiq2012
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 

Recently uploaded (20)

The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.ppt
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
 
Caco-2 cell permeability assay for drug absorption
Caco-2 cell permeability assay for drug absorptionCaco-2 cell permeability assay for drug absorption
Caco-2 cell permeability assay for drug absorption
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Work, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE PhysicsWork, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE Physics
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...
 
Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 
Types of different blotting techniques.pptx
Types of different blotting techniques.pptxTypes of different blotting techniques.pptx
Types of different blotting techniques.pptx
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 

In Silico Metabolism Tools Predict Phase I & II Metabolites

  • 1. Innovative Research for a Sustainable Future www.epa.gov/research Background: In Silico Metabolism and its Potential Use in Read-Across Frameworks Results: Predictive Performance of MetSim Tools Discussion: Diversity of Tools Can Enhance Metabolite Recall ORCID: 0000-0002-6357-0508 Louis Groff l groff.louis@epa.gov l 919-541-4256 MetSim: Integrated Programmatic Access and Pathway Management for Xenobiotic Metabolism Tools Louis Groff1, Imran Shah1, and Grace Patlewicz1 1U.S. Environmental Protection Agency, Office of Research and Development Center for Computational Toxicology and Exposure, Research Triangle Park, NC • Substances were assigned into Chemical classes using Wishart Lab ClassyFire API • Hierarchically clustered heatmap on left shows mean recall rate per chemical class, scaled white to red from 0 to 1, respectively • Histogram on right shows number of compounds within each chemical class, on a log 2 scale • Pyrimidine nucleosides, fluorenes, pyrenes, and benzimidazoles classes reflected the highest mean recall rates across all tools • Classes vinyl halides, cinnamic acids, organic oxoanionic compounds, and flavonoids showed the lowest mean recall rates observed across all tools • Highest frequency chemical classes include benzenes and substituted derivatives, azobenzenes, naphthalenes, and organooxygen compounds • The predictions from each tool were evaluated on a set of literature data for pharmaceuticals and non-steroidal anti- inflammatory drug (NSAID) chemicals including their phase I metabolites • Individual recall per tool was 0.6 for BioTransformer, 0.4 for TIMES/Toolbox, and combined recall is 0.8 • Of the five metabolites, all three tools captured one hydroxylated aripiprazole metabolite • All tools missed the oxidized metabolite dehydroaripiprazole References Upcoming Publication: Groff, L. C., et al. Chem. Res. Tox. 2023, In Final Preparation. GenRA Tool: https://comptox.epa.gov/genra/ Cheminformatics Modules Standardizer: https://hazard-dev.sciencedataexperts.com/#/stdizer ClassyFire API: http://classyfire.wishartlab.com/ Frequency 21 22 23 24 MetSim Model Input Data DTXSID SMILES CASRN Organism Rat Human Type of Metabolism Phase I Only Phase I & Phase II Metabolism Simulator OECD Toolbox TIMES BioTransformer Simulator Outputs Precursor SMILES Metabolite SMILES Reactions & Enzymes Cheminformatics Modules For Precursors & Metabolites: SMILES InChIKey CASRN DTXSID Chemical Names Reactions & Enzymes Data Storage Harmonize output formats Store in Mongo Database Methods: Metabolism Simulation (MetSim) Algorithm • Analysis of environmental samples via high-throughput screening methods has the potential to uncover new or data-poor chemicals, or contaminants of emerging concern (CECs) • Read-across methodologies such as EPA’s Generalized Read-Across (GenRA) can be helpful to identify source analogue chemicals with available toxicity information including points of departure (POD) • Source analogues are typically identified on the basis of structural similarity • We seek to extend this framework to incorporate metabolism similarity between CECs and their analogues as an additional context for analogue identification within GenRA Recall Rate Hierarchically Clustered on Chemical Class With Histogram of Class Frequency Conclusions and Future Directions MetSim Algorithm Summary Flow Chart Example: Tool Specific Performance on Aripiprazole and its Phase I Metabolites Chemical Class DSSTox 700 Phase I & Phase II Metabolites BioTransformer 2x Phase I & 1x Phase II TIMES In Vitro TIMES In Vivo OECD Toolbox In Vitro OECD Toolbox In Vivo True Positives 266 595 648 595 679 False Positives 58357 4416 6018​ 4485​ 6937​ False Negatives 1629 1305 1249​ 1304 1222 Total Predictions 58623​ 5410 6666​ 5080 7616​ Total Precision 0.004​ 0.11 0.10​ 0.12​ 0.09​ Total Recall 0.14 0.31 0.34 0.31 0.36 Total Reported Metabolites 1895 1895 1895​ 1895​ 1895​ MetSim Performance Metrics by Choice of Tool/Model on DSSTox-Registered Metabolites • Dataset consists of 700 drugs and environmental chemicals of interest to EPA • Regardless of tool choice, recall rate was low • BioTransformer generates many predictions, but yielded the lowest recall for this dataset • Toolbox/TIMES generated fewer predictions, resulting in both higher precision and higher recall rates than BioTransformer for this dataset • Toolbox In Vivo Rat Simulator had the highest recall of all tools evaluated • All metabolism prediction tools require either an application programming interface (API) or command line interface (CLI) for incorporation into MetSim • All tools in this study used SMILES as an input, which can be obtained from DSSTox Substance ID (DTXSID) searches, if registered. CAS Registry Number (CASRN) is an alternate input that can be used to query the OECD Toolbox WebAPI • Human tissue metabolism is handled with BioTransformer, which performs both phase I & II metabolism • Both OECD Toolbox and TIMES contain simulators for In Vitro and In Vivo Rat tissue metabolism. The Toolbox only performs phase I metabolism, TIMES includes phase II metabolism as well • All tools were applied to a dataset of 59 common drugs to predict their phase I metabolites, and 700 EPA-relevant chemicals registered in DSSTox to predict their phase I and phase II metabolites • Outputs are aggregated into a harmonized dictionary format containing precursor and metabolite SMILES, as well as reaction and enzyme information from BioTransformer and TIMES • DTXSID and CASRN for a metabolite were added to the dictionary where available by querying the EPA Cheminformatics Modules Standardizer API using metabolite SMILES • Output dictionaries for each parent are loaded into a Mongo database for subsequent use • TIMES and OECD Toolbox captured the second of two aripiprazole hydroxides but missed the other three metabolites • BioTransformer identified 7-(4-hydroxybutoxy)-3,4-dihydroquinolin-2(1H)-one and 1-(2,3- Dichlorophenyl) piperazine metabolites but missed the remaining two metabolites Usefulness of Tools Applied in this Study • Low precision of BioTransformer is offset by the fact that expansive sets of predictions can be helpful for high-throughput screening studies aimed at identifying CECs, such as Non-Targeted Analysis (NTA), where CECs could be metabolites in an environmental sample, and later correlated with their respective parents • Toolbox/TIMES training datasets contain more environmentally relevant chemicals • Our MetSim algorithm successfully harmonized inputs and outputs between different in silico metabolism tools, to facilitate storage and interaction of prediction data within a Mongo database • MetSim could be extended to accommodate information from other in silico tools if they possess an API or CLI • Individual tools can be utilized to predict metabolism in human or rat tissues • All tools yielded low recall on the dataset of 700 drugs and environmental chemicals of interest to EPA • Inspection of false negative predictions on a per chemical class basis can provide useful insights on what refinements to the transformation rule sets are merited for each tool • Future work in progress will focus on analyzing the graphs stored within MetSim to evaluate graph similarity with different metrics to inform metabolic similarity within read-across (MetGraph) Motivation for Metabolism Within Read-Across Challenges to Address for Successful Implementation into GenRA • A variety of in silico tools are available that enable metabolites to be simulated. • Each tool outputs its prediction information in a different format. For comparison purposes, a harmonized output data format is required. • No single standardized database of xenobiotic metabolism pathways of environmental chemicals exists at present. • Develop a database to store prediction data to facilitate analysis of metabolic graphical networks Disclaimer: The views expressed in this presentation are those of the authors and do not necessarily reflect the views or policies of the U.S. EPA.