SlideShare a Scribd company logo
Virtual screening of chemicals for endocrine disrupting activity through
CERAPP and CoMPARA projects
Kamel Mansouri1, Nicole Kleinstreuer2, Chris Grulke3, Ann Richard3, Imran Shah3, Antony J. Williams3 and Richard S. Judson3
1 ScitoVation LLC, RTP, NC, USA; 2 NTP Interagency Center for the Evaluation of AlternativeToxicological Methods, RTP, NC, USA; 3 National Center for ComputationalToxicology, U.S. EPA, RTP, NC, USA
Introduction
Endocrine disrupting chemicals (EDCs) are xenobiotics that mimic the interaction of natural hormones at the
receptor level and alter synthesis, transport and metabolism pathways. The prospect of EDCs causing
adverse health effects in humans and wildlife has led to the development of scientific and regulatory
approaches for evaluating bioactivity. This need is being partially addressed by the use of high-throughput
screening (HTS) in vitro approaches and computational modeling. In the framework of the Endocrine
Disruptor Screening Program (EDSP), the U.S. EPA led two worldwide consortiums to “virtually” (i.e., in
silico) screen chemicals for their potential estrogenic and androgenic activities. The Collaborative Estrogen
Receptor (ER) Activity Prediction Project (CERAPP) [1] predicted activities for 32,464 chemicals and the
Collaborative Modeling Project for Androgen Receptor (AR) Activity (CoMPARA) generated predictions on the
CERAPP list with additional simulated metabolites, totaling 55,450 unique structures. Modelers and
computational toxicology scientists from 30 international groups contributed structure-based models and
results for activity prediction to one or both projects, with methods ranging from QSARs to docking to
predict binding, agonism and antagonism activities. Models were based on a common training set of 1746
chemicals having ToxCast/Tox21 HTS in vitro assay results (18 assays for ER and 11 for AR) integrated into
computational networks. The models were then validated using curated literature data from different sources
(~7,000 results for ER and ~5,000 results for AR). To overcome the limitations of single approaches,
CERAPP and CoMPARA models were each combined into consensus models reaching high predictive accuracy.
These consensus models were extended beyond the initially designed datasets by implementing them into
the free and open-source application OPERA to avoid running every single model on new chemicals [2]. This
implementation was used to screen the entire EPA DSSTox database of ~750,000 chemicals and predicted
ER and AR activity is made freely available on the CompTox Chemistry dashboard
(https://comptox.epa.gov/dashboard) [3].
METHODS AND RESULTS
ACKNOWLEDGEMENTS
REFERENCES
This work received funding from:
• Participation Program at the EPA/ORD, administered by the Oak Ridge Institute for Science and Education
through an interagency agreement between the U.S. Department of Energy and EPA.
• The Lush Prize 2017, category of young researchers (Americas), supporting animal free testing.
Disclaimer: The views expressed in this presentation are those of the authors and do not necessarily represent the U.S. EPA or NIEHS policies.
Project planning
1) Mansouri, K. et al. CERAPP: Collaborative estrogen receptor activity prediction project.
Environ. Health. Perspect. 2016; 124: 1023-1033.
2) Mansouri, K. et al. OPERA models for predicting physicochemical properties and
environmental fate endpoints. Journal of Cheminformatics 2018 In Press, DOI:
10.1186/s13321-018-0263-1.
3) Williams, A. J. et al. The CompTox Chemistry Dashboard: a community data resource for
environmental chemistry. Journal of Cheminformatics 2017, 9, (1), 61.
4) Judson, R. et al. Integrated model of chemical perturbations of a biological pathway using 18
in vitro high-throughput screening assays for the estrogen receptor. Toxicological Sciences.
2015; 148: 137-154.
5) Kleinstreuer N. C. et al. Development and Validation of a Computational Model for Androgen
Receptor Activity. Chemical Research in Toxicology 2017 30 (4), 946-964.
Steps Tasks
1: Training and prioritization sets
Organizers
- ToxCast assays for training set data
- AUC values and discrete classes for continuous/classification modeling
- QSAR-ready training set and prioritization set
2: Experimental validation set
Organizers
- Collect and clean experimental data from the literature
- Prepare validation sets for qualitative and quantitative models
3: Modeling & predictions
All participants
- Train/refine the models based on the training set
- Deliver predictions and applicability domains for evaluation
4: Model evaluation
Organizers
- Evaluate the predictions of each model separately
- Assign a score for each model based on the evaluation step
5: Consensus modeling
Organizers
- Use the weighting scheme based on the scores to generate the consensus
- Use the same validation set to evaluate consensus predictions
6: Manuscript writing
All participants
- Descriptions of modeling approaches for each individual model
- Input of the participants on the draft of the manuscript
Participants
Tox21/ToxCast ER Pathway Model [4] Tox21/ToxCast AR Pathway Model [5]
Binding Agonist Antagonist
Training Validation Training Validation Training Validation
Sn 0.93 0.58 0.85 0.94 0.67 0.18
Sp 0.97 0.92 0.98 0.94 0.94 0.90
BA 0.95 0.75 0.92 0.94 0.80 0.54
Group ID Institution Country CERAPP CoMPARA
ATSDR Agency for Toxic Substances and Disease Registry. CDC. USA 
DTU Technical University of Denmark. Denmark  
ECUST East China University of Science and Technology. China 
IBMC Institute of Biomedical Chemistry. Russia 
IDEA IdeaConsult, Ltd. Bulgaria 
ILS Integrated Laboratory Systems, Inc. USA 
INSLA University of Insubria. Italy 
INSLA Lanzhou University. China
IRFMN Istituto di Ricerche Farmacologiche “Mario Negri”. Italy  
JRC Joint Research Centre of the European Commission. Italy 
LM Lockheed Martin IS&GS. USA  
MTI Molecules Theurapetiques In silico. France 
NCATS National Center for Advancing Translational Sciences. USA  
NCCT National Center for Computational Toxicology. EPA. USA  
NCI National Cancer Institute. USA 
NCSU North Carolina State University. USA  
NCTR National Center for Toxicological Research. FDA. USA  
NICEATM
NTP Interagency Center for the Evaluation of Alternative
Toxicological Methods.
USA  
NRMRL National Risk Management Research Laboratory. EPA. USA 
RIFM Research Institute for Fragrance Materials, Inc. USA 
SWETOX Swedish toxicology research center. Sweden 
TARTU University of Tartu. Estonia 
TUM Technical University Munich. Germany  
UFG Federal University of Golas. Brazil 
UMEA University of UMEA. Sweden  
UNC University of North Carolina. USA  
UNIBA University of Bari. Italy  
UNIMIB University of Milano-Bicocca. Italy  
UNISTRA University of Strasbourg. France  
VCCLAB Virtual Computational Chemistry Laboratory. Germany  
Categorical models CERAPP validation scores CoMPARA validation scores
Group ID Binding Agonist Antagonist Binding Agonist Antagonist
ATSDR_IRFMN_1 0.69
ATSDR_IRFMN_2 0.8
ATSDR_IRFMN_3 0.85
DTU_1 0.80 0.83 0.85 0.90 0.80
DTU_2 0.87
ECUST 0.67
IBMC_1 0.83 0.88 0.79
IBMC_2 0.86 0.87 0.86
IDEA 0.82
INSLA 0.82 0.88 0.80
IRFMN_1 0.77 0.70
IRFMN_2 0.77
JRC 0.74 0.81 0.66
LM_1 0.75 0.83 0.70 0.86 0.91 0.84
LM_2 0.70 0.77 0.62
MTI 0.69 0.69 0.73
NCATS 0.67 0.66 0.79 0.58
NCCT_1 0.78 0.73 0.72
NCCT_2 0.85 0.84
NCCT_3 0.57
NCI_1 0.84 0.91 0.77
NCI_2 0.76 0.83 0.64
NCSU 0.75 0.86 0.70
NCTR_1 0.84 0.89 0.78 0.84 0.89 0.82
NCTR_2 0.66
ILS_NICEATM 0.79
NRMRL_1 0.82 0.86 0.83
NRMRL_2 0.79 0.88 0.80
RIFM 0.69
SWETOX_1 0.85 0.88 0.85
SWETOX_2 0.86 0.88 0.81
TARTU_1 0.85 0.88 0.78
TARTU_2 0.84 0.89 0.82
TUM 0.66 0.88 0.75
UFG 0.73 0.89 0.76
UMEA 0.76 0.88 0.92 0.77
UNC 0.73 0.82
UNIBA 0.80 0.84 0.78 0.68
UNIMIB_1 0.68 0.51
UNIMIB_2 0.85
UNISTRA 0.73 0.79 0.77 0.88 0.67
VCCLAB 0.80 0.87 0.69 0.86 0.88 0.76
CERAPP participants models CoMPARA participants models
Categorical Continuous Total Categorical Continuous Total
Binding 21 3 24 32 5 37
Agonist 11 3 14 20 5 25
Antagonist 8 2 10 22 3 25
Total 40 8 48 74 13 87
ER validation data AR validation data
Active Inactive Total Active Inactive Total
Binding 1982 5301 7283 453 3429 3882
Agonist 350 5969 6319 167 4672 4839
Antagonist 284 6255 6539 355 3685 4040
Total 2017 7024 7522 487 4928 5273
ER training data AR training data
Active Inactive Total Active Inactive Total
Binding 237 1440 1677 198 1464 1662
Agonist 219 1458 1677 43 1616 1659
Antagonist 41 1636 1677 159 1366 1525
Total 237 1440 1677 198 1648 1688
Concordance of binding models on the active compounds of the prediction sets.
High general concordance across all binding models included in the consensus.
Distributions of the number of the predicted chemical structures by all binding models.
CERAPP CoMPARA
Active Inactive Active Inactive
Binding 4001 28463 8202 40656
Agonist 2475 29989 1764 47094
Antagonist 2793 29671 9899 38959
Total 4001 28463 10623 47613
ToxCast metabolites
Active Inactive
Binding 1609 4983
Agonist 428 6164
Antagonist 1820 4772
Total 1989 6325
Binding Agonist Antagonist
Training Validation Training Validation Training Validation
Sn 0.99 0.69 0.95 0.74 1.00 0.61
Sp 0.91 0.87 0.98 0.97 0.95 0.87
BA 0.95 0.78 0.97 0.86 0.97 0.74
CERAPP consensus validation CoMPARA consensus validation
Consensus models
The lists of chemicals prioritized for ER and AR activity:
• CERAPP list: 32,464 unique QSAR-ready structures
(standardized, organic, no mixtures…)
–EDSP Universe (10k)
–Chemicals with known use (40k) (CPCat & ACToR)
–Canadian Domestic Substances List (DSL) (23k)
–EPA DSSTox (version 1)– structures of EPA/FDA interest
(15k)
–ToxCast and Tox21 (In vitro ER data) (8k)
• CoMPARA: 55,450 unique QSAR-ready structures
–CERAPP list (32k)
– EINECS European Inventory (60k)
–ToxCast metabolites (6k)
757 chemicals
have >75%
positive
concordance
Most models predict most
chemicals as ER non-binders
1772 chemicals
have >75%
positive
concordance
Most models predict most
chemicals as AR non-binders
These consensus models predictions where model concordance is higher
than 85% are being implemented into the free and open-source application
OPERA in order to prioritize additional chemicals.
CERAPP models CoMPARA models
CoMPARA predictions
The score formula: 𝑆 = 0.3 ∗ 𝑔𝑜𝑜𝑑𝑛𝑒𝑠𝑠 𝑜𝑓 𝑓𝑖𝑡 + 0.45 ∗ 𝑝𝑟𝑒𝑑𝑖𝑐𝑡𝑖𝑣𝑖𝑡𝑦 + 0.25 ∗ (𝑟𝑜𝑏𝑢𝑠𝑡𝑛𝑒𝑠𝑠). Where the goodness of fit (training set) and
the predictivity (validation set) are functions of the balanced accuracy (BA) and the balance between the specificity (Sp) and Sensitivity (Sn):
0.7 ∗ 𝐵𝐴 + 0.3 ∗ (1 − 𝑆𝑛 − 𝑆𝑝 ). Robustness of the model is the balance between training and validation set statistics: 1 − 𝐵𝐴 𝑇𝑟 − 𝐵𝐴 𝑡𝑒𝑠𝑡 .

More Related Content

What's hot

Delivering The Benefits of Chemical-Biological Integration in Computational T...
Delivering The Benefits of Chemical-Biological Integration in Computational T...Delivering The Benefits of Chemical-Biological Integration in Computational T...
Delivering The Benefits of Chemical-Biological Integration in Computational T...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Delivering The Benefits of Chemical-Biological Integration in Computational T...
Delivering The Benefits of Chemical-Biological Integration in Computational T...Delivering The Benefits of Chemical-Biological Integration in Computational T...
Delivering The Benefits of Chemical-Biological Integration in Computational T...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Virtual screening of chemicals for endocrine disrupting activity: Case studie...
Virtual screening of chemicals for endocrine disrupting activity: Case studie...Virtual screening of chemicals for endocrine disrupting activity: Case studie...
Virtual screening of chemicals for endocrine disrupting activity: Case studie...
Kamel Mansouri
 
Environmental Chemistry Compound Identification Using High Resolution Mass Sp...
Environmental Chemistry Compound Identification Using High Resolution Mass Sp...Environmental Chemistry Compound Identification Using High Resolution Mass Sp...
Environmental Chemistry Compound Identification Using High Resolution Mass Sp...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
An examination of data quality on QSAR Modeling in regards to the environment...
An examination of data quality on QSAR Modeling in regards to the environment...An examination of data quality on QSAR Modeling in regards to the environment...
An examination of data quality on QSAR Modeling in regards to the environment...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
CERAPP - Collaborative Estrogen Receptor Activity Prediction Project. Computa...
CERAPP - Collaborative Estrogen Receptor Activity Prediction Project. Computa...CERAPP - Collaborative Estrogen Receptor Activity Prediction Project. Computa...
CERAPP - Collaborative Estrogen Receptor Activity Prediction Project. Computa...
Kamel Mansouri
 
Data drivenapproach to medicinalchemistry
Data drivenapproach to medicinalchemistryData drivenapproach to medicinalchemistry
Data drivenapproach to medicinalchemistry
Ann-Marie Roche
 
Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Exploiting enhanced non-testing approaches to meet the needs for sustainable ...
Exploiting enhanced non-testing approaches to meet the needs for sustainable ...Exploiting enhanced non-testing approaches to meet the needs for sustainable ...
Exploiting enhanced non-testing approaches to meet the needs for sustainable ...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
Kamel Mansouri
 
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
New Approach Methods - What is That?
New Approach Methods - What is That?New Approach Methods - What is That?
Chemical identification of unknowns in high resolution mass spectrometry usin...
Chemical identification of unknowns in high resolution mass spectrometry usin...Chemical identification of unknowns in high resolution mass spectrometry usin...
Chemical identification of unknowns in high resolution mass spectrometry usin...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Web-based access to data for >600 disinfection by-products via the EPA CompTo...
Web-based access to data for >600 disinfection by-products via the EPA CompTo...Web-based access to data for >600 disinfection by-products via the EPA CompTo...
Web-based access to data for >600 disinfection by-products via the EPA CompTo...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
The US-EPA CompTox Chemicals Dashboard to support Non-Targeted Analysis
The US-EPA CompTox Chemicals Dashboard to support Non-Targeted AnalysisThe US-EPA CompTox Chemicals Dashboard to support Non-Targeted Analysis
The US-EPA CompTox Chemicals Dashboard to support Non-Targeted Analysis
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
The needs for chemistry standards, database tools and data curation at the ch...
The needs for chemistry standards, database tools and data curation at the ch...The needs for chemistry standards, database tools and data curation at the ch...
The needs for chemistry standards, database tools and data curation at the ch...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
How to place your research questions or results into the context of the "Lega...
How to place your research questions or results into the context of the "Lega...How to place your research questions or results into the context of the "Lega...
How to place your research questions or results into the context of the "Lega...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
The EPA iCSS Chemistry Dashboard to Support Compound Identification Using Hig...
The EPA iCSS Chemistry Dashboard to Support Compound Identification Using Hig...The EPA iCSS Chemistry Dashboard to Support Compound Identification Using Hig...
The EPA iCSS Chemistry Dashboard to Support Compound Identification Using Hig...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 

What's hot (20)

Delivering The Benefits of Chemical-Biological Integration in Computational T...
Delivering The Benefits of Chemical-Biological Integration in Computational T...Delivering The Benefits of Chemical-Biological Integration in Computational T...
Delivering The Benefits of Chemical-Biological Integration in Computational T...
 
Delivering The Benefits of Chemical-Biological Integration in Computational T...
Delivering The Benefits of Chemical-Biological Integration in Computational T...Delivering The Benefits of Chemical-Biological Integration in Computational T...
Delivering The Benefits of Chemical-Biological Integration in Computational T...
 
Virtual screening of chemicals for endocrine disrupting activity: Case studie...
Virtual screening of chemicals for endocrine disrupting activity: Case studie...Virtual screening of chemicals for endocrine disrupting activity: Case studie...
Virtual screening of chemicals for endocrine disrupting activity: Case studie...
 
Environmental Chemistry Compound Identification Using High Resolution Mass Sp...
Environmental Chemistry Compound Identification Using High Resolution Mass Sp...Environmental Chemistry Compound Identification Using High Resolution Mass Sp...
Environmental Chemistry Compound Identification Using High Resolution Mass Sp...
 
An examination of data quality on QSAR Modeling in regards to the environment...
An examination of data quality on QSAR Modeling in regards to the environment...An examination of data quality on QSAR Modeling in regards to the environment...
An examination of data quality on QSAR Modeling in regards to the environment...
 
Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...
 
CERAPP - Collaborative Estrogen Receptor Activity Prediction Project. Computa...
CERAPP - Collaborative Estrogen Receptor Activity Prediction Project. Computa...CERAPP - Collaborative Estrogen Receptor Activity Prediction Project. Computa...
CERAPP - Collaborative Estrogen Receptor Activity Prediction Project. Computa...
 
Data drivenapproach to medicinalchemistry
Data drivenapproach to medicinalchemistryData drivenapproach to medicinalchemistry
Data drivenapproach to medicinalchemistry
 
Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...
 
Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...
 
Exploiting enhanced non-testing approaches to meet the needs for sustainable ...
Exploiting enhanced non-testing approaches to meet the needs for sustainable ...Exploiting enhanced non-testing approaches to meet the needs for sustainable ...
Exploiting enhanced non-testing approaches to meet the needs for sustainable ...
 
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
 
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
 
New Approach Methods - What is That?
New Approach Methods - What is That?New Approach Methods - What is That?
New Approach Methods - What is That?
 
Chemical identification of unknowns in high resolution mass spectrometry usin...
Chemical identification of unknowns in high resolution mass spectrometry usin...Chemical identification of unknowns in high resolution mass spectrometry usin...
Chemical identification of unknowns in high resolution mass spectrometry usin...
 
Web-based access to data for >600 disinfection by-products via the EPA CompTo...
Web-based access to data for >600 disinfection by-products via the EPA CompTo...Web-based access to data for >600 disinfection by-products via the EPA CompTo...
Web-based access to data for >600 disinfection by-products via the EPA CompTo...
 
The US-EPA CompTox Chemicals Dashboard to support Non-Targeted Analysis
The US-EPA CompTox Chemicals Dashboard to support Non-Targeted AnalysisThe US-EPA CompTox Chemicals Dashboard to support Non-Targeted Analysis
The US-EPA CompTox Chemicals Dashboard to support Non-Targeted Analysis
 
The needs for chemistry standards, database tools and data curation at the ch...
The needs for chemistry standards, database tools and data curation at the ch...The needs for chemistry standards, database tools and data curation at the ch...
The needs for chemistry standards, database tools and data curation at the ch...
 
How to place your research questions or results into the context of the "Lega...
How to place your research questions or results into the context of the "Lega...How to place your research questions or results into the context of the "Lega...
How to place your research questions or results into the context of the "Lega...
 
The EPA iCSS Chemistry Dashboard to Support Compound Identification Using Hig...
The EPA iCSS Chemistry Dashboard to Support Compound Identification Using Hig...The EPA iCSS Chemistry Dashboard to Support Compound Identification Using Hig...
The EPA iCSS Chemistry Dashboard to Support Compound Identification Using Hig...
 

Similar to Virtual screening of chemicals for endocrine disrupting activity through CERAPP and CoMPARA projects. SOT 2018 (San Antonio)

CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
CoMPARA: Collaborative Modeling Project for Androgen Receptor ActivityCoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
Kamel Mansouri
 
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
Kamel Mansouri
 
Development of machine learning-based prediction models for chemical modulato...
Development of machine learning-based prediction models for chemical modulato...Development of machine learning-based prediction models for chemical modulato...
Development of machine learning-based prediction models for chemical modulato...
Sunghwan Kim
 
Unc slides on computational toxicology
Unc slides on computational toxicologyUnc slides on computational toxicology
Unc slides on computational toxicology
Sean Ekins
 
International Computational Collaborations to Solve Toxicology Problems
International Computational Collaborations to Solve Toxicology ProblemsInternational Computational Collaborations to Solve Toxicology Problems
International Computational Collaborations to Solve Toxicology Problems
Kamel Mansouri
 
Alternative methods to animal toxicity testing
Alternative methods to animal toxicity testingAlternative methods to animal toxicity testing
Alternative methods to animal toxicity testing
priyachhikara1
 
Using available tools for tiered assessments and rapid MoE
Using available tools for tiered assessments and rapid MoEUsing available tools for tiered assessments and rapid MoE
Using available tools for tiered assessments and rapid MoE
RebeccaClewell
 
Predictive tox proposal_ver1
Predictive tox proposal_ver1Predictive tox proposal_ver1
Predictive tox proposal_ver1
Ankur Khanna
 
Development and sharing of ADME/Tox and Drug Discovery Machine learning models
Development and sharing of ADME/Tox and Drug Discovery Machine learning modelsDevelopment and sharing of ADME/Tox and Drug Discovery Machine learning models
Development and sharing of ADME/Tox and Drug Discovery Machine learning models
Sean Ekins
 
Nc state lecture v2 Computational Toxicology
Nc state lecture v2 Computational ToxicologyNc state lecture v2 Computational Toxicology
Nc state lecture v2 Computational Toxicology
Sean Ekins
 
QSAR Modeling.pdf
QSAR Modeling.pdfQSAR Modeling.pdf
QSAR Modeling.pdf
NavnitKumarMishra
 
presentation on in silico studies
presentation on in silico studiespresentation on in silico studies
presentation on in silico studies
Shaik Sana
 
Basics of QSAR Modeling by Prof Rahul D. Jawarkar.pptx
Basics of QSAR Modeling by Prof Rahul D. Jawarkar.pptxBasics of QSAR Modeling by Prof Rahul D. Jawarkar.pptx
Basics of QSAR Modeling by Prof Rahul D. Jawarkar.pptx
Rahul Jawarkar
 
A comparison of three chromatographic retention time prediction models
A comparison of three chromatographic retention time prediction modelsA comparison of three chromatographic retention time prediction models
A comparison of three chromatographic retention time prediction models
Andrew McEachran
 
Deep learning methods applied to physicochemical and toxicological endpoints
Deep learning methods applied to physicochemical and toxicological endpointsDeep learning methods applied to physicochemical and toxicological endpoints
Deep learning methods applied to physicochemical and toxicological endpoints
Valery Tkachenko
 
SOT short course on computational toxicology
SOT short course on computational toxicology SOT short course on computational toxicology
SOT short course on computational toxicology
Sean Ekins
 
Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...
Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...
Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...
Sean Ekins
 
Metabolic engineering approaches in medicinal plants
Metabolic engineering approaches in medicinal plantsMetabolic engineering approaches in medicinal plants
Metabolic engineering approaches in medicinal plants
N Poorin
 
In silico softwares
In silico softwaresIn silico softwares
In silico softwares
Sagar Savale
 
Ovanes Mekenyan - Lush Prize Conference 2014
Ovanes Mekenyan - Lush Prize Conference 2014Ovanes Mekenyan - Lush Prize Conference 2014
Ovanes Mekenyan - Lush Prize Conference 2014
LushPrize
 

Similar to Virtual screening of chemicals for endocrine disrupting activity through CERAPP and CoMPARA projects. SOT 2018 (San Antonio) (20)

CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
CoMPARA: Collaborative Modeling Project for Androgen Receptor ActivityCoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
 
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
 
Development of machine learning-based prediction models for chemical modulato...
Development of machine learning-based prediction models for chemical modulato...Development of machine learning-based prediction models for chemical modulato...
Development of machine learning-based prediction models for chemical modulato...
 
Unc slides on computational toxicology
Unc slides on computational toxicologyUnc slides on computational toxicology
Unc slides on computational toxicology
 
International Computational Collaborations to Solve Toxicology Problems
International Computational Collaborations to Solve Toxicology ProblemsInternational Computational Collaborations to Solve Toxicology Problems
International Computational Collaborations to Solve Toxicology Problems
 
Alternative methods to animal toxicity testing
Alternative methods to animal toxicity testingAlternative methods to animal toxicity testing
Alternative methods to animal toxicity testing
 
Using available tools for tiered assessments and rapid MoE
Using available tools for tiered assessments and rapid MoEUsing available tools for tiered assessments and rapid MoE
Using available tools for tiered assessments and rapid MoE
 
Predictive tox proposal_ver1
Predictive tox proposal_ver1Predictive tox proposal_ver1
Predictive tox proposal_ver1
 
Development and sharing of ADME/Tox and Drug Discovery Machine learning models
Development and sharing of ADME/Tox and Drug Discovery Machine learning modelsDevelopment and sharing of ADME/Tox and Drug Discovery Machine learning models
Development and sharing of ADME/Tox and Drug Discovery Machine learning models
 
Nc state lecture v2 Computational Toxicology
Nc state lecture v2 Computational ToxicologyNc state lecture v2 Computational Toxicology
Nc state lecture v2 Computational Toxicology
 
QSAR Modeling.pdf
QSAR Modeling.pdfQSAR Modeling.pdf
QSAR Modeling.pdf
 
presentation on in silico studies
presentation on in silico studiespresentation on in silico studies
presentation on in silico studies
 
Basics of QSAR Modeling by Prof Rahul D. Jawarkar.pptx
Basics of QSAR Modeling by Prof Rahul D. Jawarkar.pptxBasics of QSAR Modeling by Prof Rahul D. Jawarkar.pptx
Basics of QSAR Modeling by Prof Rahul D. Jawarkar.pptx
 
A comparison of three chromatographic retention time prediction models
A comparison of three chromatographic retention time prediction modelsA comparison of three chromatographic retention time prediction models
A comparison of three chromatographic retention time prediction models
 
Deep learning methods applied to physicochemical and toxicological endpoints
Deep learning methods applied to physicochemical and toxicological endpointsDeep learning methods applied to physicochemical and toxicological endpoints
Deep learning methods applied to physicochemical and toxicological endpoints
 
SOT short course on computational toxicology
SOT short course on computational toxicology SOT short course on computational toxicology
SOT short course on computational toxicology
 
Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...
Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...
Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...
 
Metabolic engineering approaches in medicinal plants
Metabolic engineering approaches in medicinal plantsMetabolic engineering approaches in medicinal plants
Metabolic engineering approaches in medicinal plants
 
In silico softwares
In silico softwaresIn silico softwares
In silico softwares
 
Ovanes Mekenyan - Lush Prize Conference 2014
Ovanes Mekenyan - Lush Prize Conference 2014Ovanes Mekenyan - Lush Prize Conference 2014
Ovanes Mekenyan - Lush Prize Conference 2014
 

More from Kamel Mansouri

Chemical prioritization using in silico modeling. SOT 2018 (San Antonio, USA)
Chemical prioritization using in silico modeling. SOT 2018 (San Antonio, USA)Chemical prioritization using in silico modeling. SOT 2018 (San Antonio, USA)
Chemical prioritization using in silico modeling. SOT 2018 (San Antonio, USA)
Kamel Mansouri
 
Scoring and ranking of metabolic trees to computationally prioritize chemical...
Scoring and ranking of metabolic trees to computationally prioritize chemical...Scoring and ranking of metabolic trees to computationally prioritize chemical...
Scoring and ranking of metabolic trees to computationally prioritize chemical...
Kamel Mansouri
 
QSAR STUDY ON READY BIODEGRADABILITY OF CHEMICALS. Presented at the 3rd Chemo...
QSAR STUDY ON READY BIODEGRADABILITY OF CHEMICALS. Presented at the 3rd Chemo...QSAR STUDY ON READY BIODEGRADABILITY OF CHEMICALS. Presented at the 3rd Chemo...
QSAR STUDY ON READY BIODEGRADABILITY OF CHEMICALS. Presented at the 3rd Chemo...
Kamel Mansouri
 
In-silico study of ToxCast GPCR assays by quantitative structure-activity rel...
In-silico study of ToxCast GPCR assays by quantitative structure-activity rel...In-silico study of ToxCast GPCR assays by quantitative structure-activity rel...
In-silico study of ToxCast GPCR assays by quantitative structure-activity rel...
Kamel Mansouri
 
An examination of data quality on QSAR Modeling in regards to the environment...
An examination of data quality on QSAR Modeling in regards to the environment...An examination of data quality on QSAR Modeling in regards to the environment...
An examination of data quality on QSAR Modeling in regards to the environment...
Kamel Mansouri
 
In-silico structure activity relationship study of toxicity endpoints by QSAR...
In-silico structure activity relationship study of toxicity endpoints by QSAR...In-silico structure activity relationship study of toxicity endpoints by QSAR...
In-silico structure activity relationship study of toxicity endpoints by QSAR...
Kamel Mansouri
 

More from Kamel Mansouri (6)

Chemical prioritization using in silico modeling. SOT 2018 (San Antonio, USA)
Chemical prioritization using in silico modeling. SOT 2018 (San Antonio, USA)Chemical prioritization using in silico modeling. SOT 2018 (San Antonio, USA)
Chemical prioritization using in silico modeling. SOT 2018 (San Antonio, USA)
 
Scoring and ranking of metabolic trees to computationally prioritize chemical...
Scoring and ranking of metabolic trees to computationally prioritize chemical...Scoring and ranking of metabolic trees to computationally prioritize chemical...
Scoring and ranking of metabolic trees to computationally prioritize chemical...
 
QSAR STUDY ON READY BIODEGRADABILITY OF CHEMICALS. Presented at the 3rd Chemo...
QSAR STUDY ON READY BIODEGRADABILITY OF CHEMICALS. Presented at the 3rd Chemo...QSAR STUDY ON READY BIODEGRADABILITY OF CHEMICALS. Presented at the 3rd Chemo...
QSAR STUDY ON READY BIODEGRADABILITY OF CHEMICALS. Presented at the 3rd Chemo...
 
In-silico study of ToxCast GPCR assays by quantitative structure-activity rel...
In-silico study of ToxCast GPCR assays by quantitative structure-activity rel...In-silico study of ToxCast GPCR assays by quantitative structure-activity rel...
In-silico study of ToxCast GPCR assays by quantitative structure-activity rel...
 
An examination of data quality on QSAR Modeling in regards to the environment...
An examination of data quality on QSAR Modeling in regards to the environment...An examination of data quality on QSAR Modeling in regards to the environment...
An examination of data quality on QSAR Modeling in regards to the environment...
 
In-silico structure activity relationship study of toxicity endpoints by QSAR...
In-silico structure activity relationship study of toxicity endpoints by QSAR...In-silico structure activity relationship study of toxicity endpoints by QSAR...
In-silico structure activity relationship study of toxicity endpoints by QSAR...
 

Recently uploaded

Chapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisisChapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisis
tonzsalvador2222
 
aziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobelaziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobel
İsa Badur
 
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
David Osipyan
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
PRIYANKA PATEL
 
Oedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptxOedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptx
muralinath2
 
BREEDING METHODS FOR DISEASE RESISTANCE.pptx
BREEDING METHODS FOR DISEASE RESISTANCE.pptxBREEDING METHODS FOR DISEASE RESISTANCE.pptx
BREEDING METHODS FOR DISEASE RESISTANCE.pptx
RASHMI M G
 
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
Sérgio Sacani
 
molar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptxmolar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptx
Anagha Prasad
 
Randomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNERandomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNE
University of Maribor
 
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
yqqaatn0
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
University of Rennes, INSA Rennes, Inria/IRISA, CNRS
 
Equivariant neural networks and representation theory
Equivariant neural networks and representation theoryEquivariant neural networks and representation theory
Equivariant neural networks and representation theory
Daniel Tubbenhauer
 
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills MN
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
Nistarini College, Purulia (W.B) India
 
The binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defectsThe binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defects
Sérgio Sacani
 
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptxThe use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
MAGOTI ERNEST
 
Cytokines and their role in immune regulation.pptx
Cytokines and their role in immune regulation.pptxCytokines and their role in immune regulation.pptx
Cytokines and their role in immune regulation.pptx
Hitesh Sikarwar
 
Nucleophilic Addition of carbonyl compounds.pptx
Nucleophilic Addition of carbonyl  compounds.pptxNucleophilic Addition of carbonyl  compounds.pptx
Nucleophilic Addition of carbonyl compounds.pptx
SSR02
 
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốtmô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
HongcNguyn6
 
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
AbdullaAlAsif1
 

Recently uploaded (20)

Chapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisisChapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisis
 
aziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobelaziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobel
 
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
 
Oedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptxOedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptx
 
BREEDING METHODS FOR DISEASE RESISTANCE.pptx
BREEDING METHODS FOR DISEASE RESISTANCE.pptxBREEDING METHODS FOR DISEASE RESISTANCE.pptx
BREEDING METHODS FOR DISEASE RESISTANCE.pptx
 
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
 
molar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptxmolar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptx
 
Randomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNERandomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNE
 
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
 
Equivariant neural networks and representation theory
Equivariant neural networks and representation theoryEquivariant neural networks and representation theory
Equivariant neural networks and representation theory
 
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
 
The binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defectsThe binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defects
 
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptxThe use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
 
Cytokines and their role in immune regulation.pptx
Cytokines and their role in immune regulation.pptxCytokines and their role in immune regulation.pptx
Cytokines and their role in immune regulation.pptx
 
Nucleophilic Addition of carbonyl compounds.pptx
Nucleophilic Addition of carbonyl  compounds.pptxNucleophilic Addition of carbonyl  compounds.pptx
Nucleophilic Addition of carbonyl compounds.pptx
 
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốtmô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
 
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
 

Virtual screening of chemicals for endocrine disrupting activity through CERAPP and CoMPARA projects. SOT 2018 (San Antonio)

  • 1. Virtual screening of chemicals for endocrine disrupting activity through CERAPP and CoMPARA projects Kamel Mansouri1, Nicole Kleinstreuer2, Chris Grulke3, Ann Richard3, Imran Shah3, Antony J. Williams3 and Richard S. Judson3 1 ScitoVation LLC, RTP, NC, USA; 2 NTP Interagency Center for the Evaluation of AlternativeToxicological Methods, RTP, NC, USA; 3 National Center for ComputationalToxicology, U.S. EPA, RTP, NC, USA Introduction Endocrine disrupting chemicals (EDCs) are xenobiotics that mimic the interaction of natural hormones at the receptor level and alter synthesis, transport and metabolism pathways. The prospect of EDCs causing adverse health effects in humans and wildlife has led to the development of scientific and regulatory approaches for evaluating bioactivity. This need is being partially addressed by the use of high-throughput screening (HTS) in vitro approaches and computational modeling. In the framework of the Endocrine Disruptor Screening Program (EDSP), the U.S. EPA led two worldwide consortiums to “virtually” (i.e., in silico) screen chemicals for their potential estrogenic and androgenic activities. The Collaborative Estrogen Receptor (ER) Activity Prediction Project (CERAPP) [1] predicted activities for 32,464 chemicals and the Collaborative Modeling Project for Androgen Receptor (AR) Activity (CoMPARA) generated predictions on the CERAPP list with additional simulated metabolites, totaling 55,450 unique structures. Modelers and computational toxicology scientists from 30 international groups contributed structure-based models and results for activity prediction to one or both projects, with methods ranging from QSARs to docking to predict binding, agonism and antagonism activities. Models were based on a common training set of 1746 chemicals having ToxCast/Tox21 HTS in vitro assay results (18 assays for ER and 11 for AR) integrated into computational networks. The models were then validated using curated literature data from different sources (~7,000 results for ER and ~5,000 results for AR). To overcome the limitations of single approaches, CERAPP and CoMPARA models were each combined into consensus models reaching high predictive accuracy. These consensus models were extended beyond the initially designed datasets by implementing them into the free and open-source application OPERA to avoid running every single model on new chemicals [2]. This implementation was used to screen the entire EPA DSSTox database of ~750,000 chemicals and predicted ER and AR activity is made freely available on the CompTox Chemistry dashboard (https://comptox.epa.gov/dashboard) [3]. METHODS AND RESULTS ACKNOWLEDGEMENTS REFERENCES This work received funding from: • Participation Program at the EPA/ORD, administered by the Oak Ridge Institute for Science and Education through an interagency agreement between the U.S. Department of Energy and EPA. • The Lush Prize 2017, category of young researchers (Americas), supporting animal free testing. Disclaimer: The views expressed in this presentation are those of the authors and do not necessarily represent the U.S. EPA or NIEHS policies. Project planning 1) Mansouri, K. et al. CERAPP: Collaborative estrogen receptor activity prediction project. Environ. Health. Perspect. 2016; 124: 1023-1033. 2) Mansouri, K. et al. OPERA models for predicting physicochemical properties and environmental fate endpoints. Journal of Cheminformatics 2018 In Press, DOI: 10.1186/s13321-018-0263-1. 3) Williams, A. J. et al. The CompTox Chemistry Dashboard: a community data resource for environmental chemistry. Journal of Cheminformatics 2017, 9, (1), 61. 4) Judson, R. et al. Integrated model of chemical perturbations of a biological pathway using 18 in vitro high-throughput screening assays for the estrogen receptor. Toxicological Sciences. 2015; 148: 137-154. 5) Kleinstreuer N. C. et al. Development and Validation of a Computational Model for Androgen Receptor Activity. Chemical Research in Toxicology 2017 30 (4), 946-964. Steps Tasks 1: Training and prioritization sets Organizers - ToxCast assays for training set data - AUC values and discrete classes for continuous/classification modeling - QSAR-ready training set and prioritization set 2: Experimental validation set Organizers - Collect and clean experimental data from the literature - Prepare validation sets for qualitative and quantitative models 3: Modeling & predictions All participants - Train/refine the models based on the training set - Deliver predictions and applicability domains for evaluation 4: Model evaluation Organizers - Evaluate the predictions of each model separately - Assign a score for each model based on the evaluation step 5: Consensus modeling Organizers - Use the weighting scheme based on the scores to generate the consensus - Use the same validation set to evaluate consensus predictions 6: Manuscript writing All participants - Descriptions of modeling approaches for each individual model - Input of the participants on the draft of the manuscript Participants Tox21/ToxCast ER Pathway Model [4] Tox21/ToxCast AR Pathway Model [5] Binding Agonist Antagonist Training Validation Training Validation Training Validation Sn 0.93 0.58 0.85 0.94 0.67 0.18 Sp 0.97 0.92 0.98 0.94 0.94 0.90 BA 0.95 0.75 0.92 0.94 0.80 0.54 Group ID Institution Country CERAPP CoMPARA ATSDR Agency for Toxic Substances and Disease Registry. CDC. USA  DTU Technical University of Denmark. Denmark   ECUST East China University of Science and Technology. China  IBMC Institute of Biomedical Chemistry. Russia  IDEA IdeaConsult, Ltd. Bulgaria  ILS Integrated Laboratory Systems, Inc. USA  INSLA University of Insubria. Italy  INSLA Lanzhou University. China IRFMN Istituto di Ricerche Farmacologiche “Mario Negri”. Italy   JRC Joint Research Centre of the European Commission. Italy  LM Lockheed Martin IS&GS. USA   MTI Molecules Theurapetiques In silico. France  NCATS National Center for Advancing Translational Sciences. USA   NCCT National Center for Computational Toxicology. EPA. USA   NCI National Cancer Institute. USA  NCSU North Carolina State University. USA   NCTR National Center for Toxicological Research. FDA. USA   NICEATM NTP Interagency Center for the Evaluation of Alternative Toxicological Methods. USA   NRMRL National Risk Management Research Laboratory. EPA. USA  RIFM Research Institute for Fragrance Materials, Inc. USA  SWETOX Swedish toxicology research center. Sweden  TARTU University of Tartu. Estonia  TUM Technical University Munich. Germany   UFG Federal University of Golas. Brazil  UMEA University of UMEA. Sweden   UNC University of North Carolina. USA   UNIBA University of Bari. Italy   UNIMIB University of Milano-Bicocca. Italy   UNISTRA University of Strasbourg. France   VCCLAB Virtual Computational Chemistry Laboratory. Germany   Categorical models CERAPP validation scores CoMPARA validation scores Group ID Binding Agonist Antagonist Binding Agonist Antagonist ATSDR_IRFMN_1 0.69 ATSDR_IRFMN_2 0.8 ATSDR_IRFMN_3 0.85 DTU_1 0.80 0.83 0.85 0.90 0.80 DTU_2 0.87 ECUST 0.67 IBMC_1 0.83 0.88 0.79 IBMC_2 0.86 0.87 0.86 IDEA 0.82 INSLA 0.82 0.88 0.80 IRFMN_1 0.77 0.70 IRFMN_2 0.77 JRC 0.74 0.81 0.66 LM_1 0.75 0.83 0.70 0.86 0.91 0.84 LM_2 0.70 0.77 0.62 MTI 0.69 0.69 0.73 NCATS 0.67 0.66 0.79 0.58 NCCT_1 0.78 0.73 0.72 NCCT_2 0.85 0.84 NCCT_3 0.57 NCI_1 0.84 0.91 0.77 NCI_2 0.76 0.83 0.64 NCSU 0.75 0.86 0.70 NCTR_1 0.84 0.89 0.78 0.84 0.89 0.82 NCTR_2 0.66 ILS_NICEATM 0.79 NRMRL_1 0.82 0.86 0.83 NRMRL_2 0.79 0.88 0.80 RIFM 0.69 SWETOX_1 0.85 0.88 0.85 SWETOX_2 0.86 0.88 0.81 TARTU_1 0.85 0.88 0.78 TARTU_2 0.84 0.89 0.82 TUM 0.66 0.88 0.75 UFG 0.73 0.89 0.76 UMEA 0.76 0.88 0.92 0.77 UNC 0.73 0.82 UNIBA 0.80 0.84 0.78 0.68 UNIMIB_1 0.68 0.51 UNIMIB_2 0.85 UNISTRA 0.73 0.79 0.77 0.88 0.67 VCCLAB 0.80 0.87 0.69 0.86 0.88 0.76 CERAPP participants models CoMPARA participants models Categorical Continuous Total Categorical Continuous Total Binding 21 3 24 32 5 37 Agonist 11 3 14 20 5 25 Antagonist 8 2 10 22 3 25 Total 40 8 48 74 13 87 ER validation data AR validation data Active Inactive Total Active Inactive Total Binding 1982 5301 7283 453 3429 3882 Agonist 350 5969 6319 167 4672 4839 Antagonist 284 6255 6539 355 3685 4040 Total 2017 7024 7522 487 4928 5273 ER training data AR training data Active Inactive Total Active Inactive Total Binding 237 1440 1677 198 1464 1662 Agonist 219 1458 1677 43 1616 1659 Antagonist 41 1636 1677 159 1366 1525 Total 237 1440 1677 198 1648 1688 Concordance of binding models on the active compounds of the prediction sets. High general concordance across all binding models included in the consensus. Distributions of the number of the predicted chemical structures by all binding models. CERAPP CoMPARA Active Inactive Active Inactive Binding 4001 28463 8202 40656 Agonist 2475 29989 1764 47094 Antagonist 2793 29671 9899 38959 Total 4001 28463 10623 47613 ToxCast metabolites Active Inactive Binding 1609 4983 Agonist 428 6164 Antagonist 1820 4772 Total 1989 6325 Binding Agonist Antagonist Training Validation Training Validation Training Validation Sn 0.99 0.69 0.95 0.74 1.00 0.61 Sp 0.91 0.87 0.98 0.97 0.95 0.87 BA 0.95 0.78 0.97 0.86 0.97 0.74 CERAPP consensus validation CoMPARA consensus validation Consensus models The lists of chemicals prioritized for ER and AR activity: • CERAPP list: 32,464 unique QSAR-ready structures (standardized, organic, no mixtures…) –EDSP Universe (10k) –Chemicals with known use (40k) (CPCat & ACToR) –Canadian Domestic Substances List (DSL) (23k) –EPA DSSTox (version 1)– structures of EPA/FDA interest (15k) –ToxCast and Tox21 (In vitro ER data) (8k) • CoMPARA: 55,450 unique QSAR-ready structures –CERAPP list (32k) – EINECS European Inventory (60k) –ToxCast metabolites (6k) 757 chemicals have >75% positive concordance Most models predict most chemicals as ER non-binders 1772 chemicals have >75% positive concordance Most models predict most chemicals as AR non-binders These consensus models predictions where model concordance is higher than 85% are being implemented into the free and open-source application OPERA in order to prioritize additional chemicals. CERAPP models CoMPARA models CoMPARA predictions The score formula: 𝑆 = 0.3 ∗ 𝑔𝑜𝑜𝑑𝑛𝑒𝑠𝑠 𝑜𝑓 𝑓𝑖𝑡 + 0.45 ∗ 𝑝𝑟𝑒𝑑𝑖𝑐𝑡𝑖𝑣𝑖𝑡𝑦 + 0.25 ∗ (𝑟𝑜𝑏𝑢𝑠𝑡𝑛𝑒𝑠𝑠). Where the goodness of fit (training set) and the predictivity (validation set) are functions of the balanced accuracy (BA) and the balance between the specificity (Sp) and Sensitivity (Sn): 0.7 ∗ 𝐵𝐴 + 0.3 ∗ (1 − 𝑆𝑛 − 𝑆𝑝 ). Robustness of the model is the balance between training and validation set statistics: 1 − 𝐵𝐴 𝑇𝑟 − 𝐵𝐴 𝑡𝑒𝑠𝑡 .