SlideShare a Scribd company logo
1 of 54
Alexandre Borrel, PhD
Postdoctoral Research Fellow,
National Institute of Environment Health Sciences,
RTP, North Carolina, USA
Exploring the Chemical Universe
using www.ChemMaps.com
@AlBorrel0000-0001-6499-4540
National Institutes of Health
U.S. Department of Health and Human Services2
More than 1.1060 accessible molecules 1,2,3
(1) Hann, M.M., and Oprea, T.I. (2004). Curr. Opin. Chem. Biol. 8: 255–263.
(2) Ursu, O., Rayan, A., Goldblum, A., and Oprea, T.I. (2011). Rev. Comput. Mol. Sci. 1: 760–781.
(3) Drew, K. L. M., Baiman, H., Khwaounjoo, P., Yu, B., & Reynisson, J. (2012). Journal of Pharmacy and Pharmacology, 64(4), 490–495.
Introduction
National Institutes of Health
U.S. Department of Health and Human Services3
More than 1.1060 accessible molecules 1,2,3
(1) Hann, M.M., and Oprea, T.I. (2004). Curr. Opin. Chem. Biol. 8: 255–263.
(2) Ursu, O., Rayan, A., Goldblum, A., and Oprea, T.I. (2011). Rev. Comput. Mol. Sci. 1: 760–781.
(3) Drew, K. L. M., Baiman, H., Khwaounjoo, P., Yu, B., & Reynisson, J. (2012). Journal of Pharmacy and Pharmacology, 64(4), 490–495.
Introduction
National Institutes of Health
U.S. Department of Health and Human Services4
Chemical space: “…Chemical space’ is a term often used
in place of ‘multi- dimensional descriptor space’: it is a
region defined by a particular choice of descriptors…”
Dobson CM (2004) Nature 432:824–828
Chemical space
National Institutes of Health
U.S. Department of Health and Human Services5
Lipinski C, Hopkins A (2004) Nature 432:855–861.
Chemical space
National Institutes of Health
U.S. Department of Health and Human Services6
National Institutes of Health
U.S. Department of Health and Human Services7
Locate chemical
of interest
National Institutes of Health
U.S. Department of Health and Human Services8
Locate chemical
of interest Optimization,
define analogue,
replacement, ….
National Institutes of Health
U.S. Department of Health and Human Services9
Investigate ADME/Tox
properties
Locate chemical
of interest Optimization,
define analogue,
replacement, ….
National Institutes of Health
U.S. Department of Health and Human Services10
Drug repurposing
Locate chemical
of interest Optimization,
define analogue,
replacement, ….
Investigate ADME/Tox
properties
National Institutes of Health
U.S. Department of Health and Human Services11
Define, visualize
domains
Locate chemical
of interest Optimization,
define analogue,
replacement, ….
Investigate ADME/Tox
properties
Drug repurposing
National Institutes of Health
U.S. Department of Health and Human Services12
Investigate
new area
National Institutes of Health
U.S. Department of Health and Human Services13
Efficient navigation tool
14
Google Maps
15
Google Maps approach
• Interactive
• Easy to use
• Informative
• Responsive
• ….
Chemical space
16
DrugMap: compounds
~8,000 drug entries (release 12-2018):
• ~2,500 FDA-approved small molecule drugs
• Over 5,000 experimental drugs.
https://www.drugbank.ca/
17
DrugMap: descriptors
https://www.drugbank.ca/
RDkit: http://www.rdkit.org/
PaDEL: http://www.yapcwsoft.com/dd/padeldescriptor/
1D descriptors:
chemical formula
2D descriptors:
connectivity
3D descriptors:
spatial coordinates
(ligprep to generate 3D)
C23H34O5
• Molecular weight
• Count of atoms
• …
• Pharmacophore based
• … • Volume
• Surface
• …
Descriptor selection:
• Remove null variance
• Person’s correlation coefficient < 0.9
18
DrugMap: descriptors space
Samples
(Compounds)
Variables (descriptors)
X1 X2 ... X238
1 X1,1 X1,2 ... X1, 238
2 X2,1 X2,2 ... X2, 238
... ... ... ... ...
8550 X8550,1 X8550, 2 ... X8550,238
116 1D/2D descriptors
122 3D descriptors
19
PC1 = 14%; PC2=9%; PC3 = 26%
• Sufficient coverage of variance
• Understandable
z = 70 3D descriptors
x, y = 116 1D-2D descriptors
Multiple PCA
DrugMap: descriptors space
National Institutes of Health
U.S. Department of Health and Human Services20
Environmental Chemical Space
~48,000 chemicals with 3D descriptors
Informed by regulatory lists*:
• Endocrine Disruptor Screening Program
• Toxic Substances Control Act Inventory
• Canadian Domestic Substances List
• Swedish Chemicals Agency
~12,000 chemicals with acute systemic toxicity data
• Rat oral LD50 values
• GHS/EPA classifications
*not inclusive
https://comptox.epa.gov/dashboard/
www.chemmaps.com
version 1.0
ACS New Orleans (March 2018)
National Institutes of Health
U.S. Department of Health and Human Services22
www.ChemMaps.com
~30,000 unique users since March 2018 (ACS - New Orleans)
~1,000 new users each month (~ 8,000 in June 2018)
@SpaceChemMaps
National Institutes of Health
U.S. Department of Health and Human Services23
Challenges for version 2
• Extended universe: Distributed Structure-Searchable Toxicity
(DSSTox) Database (EPA – EPA comptox)
• > 800,000 chemicals, (chemical infrastructure for EPA’s Safer
Chemicals Research, including the ToxCast and Tox21 high-
throughput toxicology efforts)
https://www.epa.gov
National Institutes of Health
U.S. Department of Health and Human Services24
ChemMaps v2 work-in-progress
• Extended universe: Distributed Structure-Searchable Toxicity
(DSSTox) Database (EPA comptox dashboard)
• > 800,000 chemicals
• chemical infrastructure for EPA’s Safer Chemicals Research,
including ToxCast and Tox21 high-throughput toxicology efforts
• OPERA model predictions
https://comptox.epa.gov/dashboard
• More interactive
• Users can input their data
• Customizable
• Accessibility
National Institutes of Health
U.S. Department of Health and Human Services25
Live demo (video)
National Institutes of Health
U.S. Department of Health and Human Services26
Future Vision
• Project a all map on the fly
National Institutes of Health
U.S. Department of Health and Human Services27
Future Vision
• Select option on the map
• Project a map on the fly
National Institutes of Health
U.S. Department of Health and Human Services28
Future Vision
• Select option on the map
• Compute distances between several chemicals using
various metrics
• Project a map on the fly
National Institutes of Health
U.S. Department of Health and Human Services29
Conclusions
• www.chemmaps.com (version 2)
• Project the whole DSSTox database
• Update DrugBank (release 12-2018)
• Customize the map
• Upload chemical on the fly
• Accessibility
National Institutes of Health
U.S. Department of Health and Human Services30
Conclusions
• www.chemmaps.com (version 2)
• Project the whole DSSTox database
• Update DrugBank (release 12-2018)
• Customize the map
• Upload chemical on the fly
• Accessibility
Requirements
• Multiplatform (phone, tablet, computer)
• Firefox >59, Chrome >65, Safari >5 (WebGL technology)
• 1GB on GPU memory
National Institutes of Health
U.S. Department of Health and Human Services31
Conclusions
• www.chemmaps.com (version 2)
• Project the all DSSTox database
• Update drugbank (release 12-2018)
• Customize the map
• Upload chemical on the fly
• Accessibility
Requirements
• Multiplatform (phone, tablet, computer)
• Firefox >59, Chrome >65, Safari >5 (WebGL technology)
• 1GB on GPU memory
Beta version on the NIEHS network
Release for the end of April 2019
@SpaceChemMaps
Fourches’ lab
Dr. Denis Fourches
NIEHS
Dr. Nicole Kleinstreuer
Office of Data Science
Dr. Kamel Mansouri
(contractor, ILS)
www.chemmaps.com
@SpaceChemMaps
National Institutes of Health
U.S. Department of Health and Human Services34
Annexes
National Institutes of Health
U.S. Department of Health and Human Services35
EnvMap: projections
Principal component analysis
216 descriptors
PC1=14%, PC2 = 11%, PC3 = 9%
Independent component
analysis
36
Principal component analysis
186 descriptors
PC1=16%, PC2 = 11%, PC3 = 8%
Independent component
analysis
186 descriptors
Multidimensional-scaling 3D
186 descriptors
Euclidian distance
DrugMap: projections
37
DrugMap: compounds
MolVS (RDkit):
- SMILES standardization, normalize
- Remove salts
- Remove hydrogen
- Remove fragments (mixture)
8,752 SMILES
MolVS: https://molvs.readthedocs.io/en/latest/
Fourches,D. et al. (2016). J. Chem. Inf. Model., 56, 1243–1252.
8,551 canonical SMILES
38
DrugMap: compounds
MolVS (RDkit):
- SMILES standardization, normalize
- Remove salts
- Remove hydrogen
- Remove fragments (mixture)
8,752 SMILES
8,551 canonical SMILES
C[S@@](=O)CC[C@H](N)C(O)=O
DB02235
C[S+]([O-])CCC(N)C(=O)O
DB02165
[Zn2+] ….
MolVS: https://molvs.readthedocs.io/en/latest/
Fourches,D. et al. (2016). J. Chem. Inf. Model., 56, 1243–1252.
39
DrugMap: descriptors space
https://www.drugbank.ca/
RDkit: http://www.rdkit.org/
1D descriptors:
chemical formula
2D descriptors:
connectivity
C23H34O5
• Molecular weight
• Count of atoms
• …
• Pharmacophore based
• …
1D/2D RDKit descriptors (648)
40
DrugMap: 3D generation
RDkit: http://www.rdkit.org/
https://www.schrodinger.com/ligprep
Riniker, S.; Landrum, G. A. J. Chem. Inf. Comp. Sci. 55:2562-74 (2015)
3D generations
• Riniker and Laundrum (RDKit)
SMILES
SDF
41
DrugMap: 3D descriptors
Cao,D.-S. et al. (2013) J. Chem. Inf. Model., 53, 3086–3096
3D descriptors:
spatial coordinates
• Volume
• Surface
• Charge distribution
• …
3D PyDPI descriptors (420)
42
DrugMap: descriptors space
Samples
(Compounds)
Variables (descriptors)
X1 X2 ... X1068
1 X1,1 X1,2 ... X1,1068
2 X2,1 X2,2 ... X2,1068
... ... ... ... ...
8550 X8550,1 X8550, 2 ... X8550,1068
43
DrugMap: descriptors space
Samples
(Compounds)
Variables (descriptors)
X1 X2 ... X1068
1 X1,1 X1,2 ... X1,1068
2 X2,1 X2,2 ... X2,1068
... ... ... ... ...
8550 X8550,1 X8550, 2 ... X8850,1068
Descriptor selection:
• Remove null variance
• Pairwise Person’s correlation coefficient < 0.9
National Institutes of Health
U.S. Department of Health and Human Services44
Projection
z = 78 3D descriptors
x, y = 138 1D-2D descriptors
Multiple PCA
PC1: 13%
PC2: 9%
PC3: 24%
Coverage of variance
National Institutes of Health
U.S. Department of Health and Human Services45
Future Vision: Environmental Maps
National Institutes of Health
U.S. Department of Health and Human Services46
Future Vision: Environmental Maps
• Define and project several domains
National Institutes of Health
U.S. Department of Health and Human Services47
Future Vision: Environmental Maps
• Define and project several domains
• Add entire DSSTox Inventory (>700,000 chemicals)
National Institutes of Health
U.S. Department of Health and Human Services48
Future Vision: Environmental Maps
• Define and project several domains
• Add entire DSSTox Inventory (>700,000 chemicals)
• Incorporate diverse biological datasets (e.g. ToxRefDB, HTT)
National Institutes of Health
U.S. Department of Health and Human Services49
Future Vision: map on the fly
• Define new map on the fly
• Chemical databases
• Precomputed coordinates
• Local version
National Institutes of Health
U.S. Department of Health and Human Services50
Future Vision: Virtual reality
National Institutes of Health
U.S. Department of Health and Human Services51
Future Vision: Navigation
National Institutes of Health
U.S. Department of Health and Human Services52
Future VisionFuture Vision: Navigation
• Project new chemical lists on map
• Add your chemicals/data/model predictions on the map
National Institutes of Health
U.S. Department of Health and Human Services53
Future VisionFuture Vision: Navigation
• Select and redefine part of the chemical map on the fly
• Project new chemical lists on map
• Add your chemicals/data/model predictions on the map
National Institutes of Health
U.S. Department of Health and Human Services54
Future Vision : Navigation
• Select and redefine part of the chemical map on the fly
• Compute distances between several chemicals using
various metrics
• Project new chemical lists on map
• Add your chemicals/data/model predictions on the map
• Download area and matrix of distance

More Related Content

What's hot (7)

Applying Royal Society of Chemistry cheminformatics skills to support the Pha...
Applying Royal Society of Chemistry cheminformatics skills to support the Pha...Applying Royal Society of Chemistry cheminformatics skills to support the Pha...
Applying Royal Society of Chemistry cheminformatics skills to support the Pha...
 
PubChem LCSS
PubChem LCSSPubChem LCSS
PubChem LCSS
 
CINF 55: SureChEMBL: An open patent chemistry resource
CINF 55: SureChEMBL: An open patent chemistry resourceCINF 55: SureChEMBL: An open patent chemistry resource
CINF 55: SureChEMBL: An open patent chemistry resource
 
!Coughlin at GMA Science Forum - Proposition 65_April 2017
!Coughlin at GMA Science Forum - Proposition 65_April 2017!Coughlin at GMA Science Forum - Proposition 65_April 2017
!Coughlin at GMA Science Forum - Proposition 65_April 2017
 
SureChEMBL and Open PHACTS
SureChEMBL and Open PHACTSSureChEMBL and Open PHACTS
SureChEMBL and Open PHACTS
 
Sourcing high quality online data resources for computational toxicology
Sourcing high quality online data resources for computational toxicologySourcing high quality online data resources for computational toxicology
Sourcing high quality online data resources for computational toxicology
 
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
 

Similar to ChemMaps version 2, ACS Orlando 2019

EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...Kamel Mansouri
 
Toxicological information in PubChem
Toxicological information in PubChemToxicological information in PubChem
Toxicological information in PubChemSunghwan Kim
 
CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
CoMPARA: Collaborative Modeling Project for Androgen Receptor ActivityCoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
CoMPARA: Collaborative Modeling Project for Androgen Receptor ActivityKamel Mansouri
 
Health Datapalooza 2013: Datalab - Steven Edwards
Health Datapalooza 2013: Datalab - Steven EdwardsHealth Datapalooza 2013: Datalab - Steven Edwards
Health Datapalooza 2013: Datalab - Steven EdwardsHealth Data Consortium
 

Similar to ChemMaps version 2, ACS Orlando 2019 (20)

EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
 
Web-based access to experimental and predicted data for environmental fate, t...
Web-based access to experimental and predicted data for environmental fate, t...Web-based access to experimental and predicted data for environmental fate, t...
Web-based access to experimental and predicted data for environmental fate, t...
 
The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Tox...
The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Tox...The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Tox...
The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Tox...
 
Accessing information for chemicals in hydraulic fracturing fluids using the ...
Accessing information for chemicals in hydraulic fracturing fluids using the ...Accessing information for chemicals in hydraulic fracturing fluids using the ...
Accessing information for chemicals in hydraulic fracturing fluids using the ...
 
Development of a Tool for Systematic Integration of Traditional and New Appro...
Development of a Tool for Systematic Integration of Traditional and New Appro...Development of a Tool for Systematic Integration of Traditional and New Appro...
Development of a Tool for Systematic Integration of Traditional and New Appro...
 
New developments in delivering public access to data from the National Center...
New developments in delivering public access to data from the National Center...New developments in delivering public access to data from the National Center...
New developments in delivering public access to data from the National Center...
 
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
 
Delivering The Benefits of Chemical-Biological Integration in Computational T...
Delivering The Benefits of Chemical-Biological Integration in Computational T...Delivering The Benefits of Chemical-Biological Integration in Computational T...
Delivering The Benefits of Chemical-Biological Integration in Computational T...
 
Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...
 
Toxicological information in PubChem
Toxicological information in PubChemToxicological information in PubChem
Toxicological information in PubChem
 
CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
CoMPARA: Collaborative Modeling Project for Androgen Receptor ActivityCoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
 
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental scienceUS-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
 
CompTox Chemicals Dashboard: Data and tools to support chemical and environme...
CompTox Chemicals Dashboard: Data and tools to support chemical and environme...CompTox Chemicals Dashboard: Data and tools to support chemical and environme...
CompTox Chemicals Dashboard: Data and tools to support chemical and environme...
 
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
 
The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...
The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...
The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...
 
The US-EPA CompTox Chemicals Dashboard – an online data integration hub suppo...
The US-EPA CompTox Chemicals Dashboard – an online data integration hub suppo...The US-EPA CompTox Chemicals Dashboard – an online data integration hub suppo...
The US-EPA CompTox Chemicals Dashboard – an online data integration hub suppo...
 
Animal Testing - Science or Tradition
Animal Testing - Science or TraditionAnimal Testing - Science or Tradition
Animal Testing - Science or Tradition
 
Health Datapalooza 2013: Datalab - Steven Edwards
Health Datapalooza 2013: Datalab - Steven EdwardsHealth Datapalooza 2013: Datalab - Steven Edwards
Health Datapalooza 2013: Datalab - Steven Edwards
 
Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...
Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...
Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...
 
US-EPA CompTox Chemicals Dashboard: Bioactivity Data for Endocrine Assays
US-EPA CompTox Chemicals Dashboard: Bioactivity Data for Endocrine AssaysUS-EPA CompTox Chemicals Dashboard: Bioactivity Data for Endocrine Assays
US-EPA CompTox Chemicals Dashboard: Bioactivity Data for Endocrine Assays
 

Recently uploaded

SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICEayushi9330
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfSumit Kumar yadav
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsSérgio Sacani
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfrohankumarsinghrore1
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticssakshisoni2385
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPirithiRaju
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPirithiRaju
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencySheetal Arora
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...chandars293
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.Nitya salvi
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000Sapana Sha
 
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLkantirani197
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptxRajatChauhan518211
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...Lokesh Kothari
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfSumit Kumar yadav
 

Recently uploaded (20)

SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptx
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 

ChemMaps version 2, ACS Orlando 2019

  • 1. Alexandre Borrel, PhD Postdoctoral Research Fellow, National Institute of Environment Health Sciences, RTP, North Carolina, USA Exploring the Chemical Universe using www.ChemMaps.com @AlBorrel0000-0001-6499-4540
  • 2. National Institutes of Health U.S. Department of Health and Human Services2 More than 1.1060 accessible molecules 1,2,3 (1) Hann, M.M., and Oprea, T.I. (2004). Curr. Opin. Chem. Biol. 8: 255–263. (2) Ursu, O., Rayan, A., Goldblum, A., and Oprea, T.I. (2011). Rev. Comput. Mol. Sci. 1: 760–781. (3) Drew, K. L. M., Baiman, H., Khwaounjoo, P., Yu, B., & Reynisson, J. (2012). Journal of Pharmacy and Pharmacology, 64(4), 490–495. Introduction
  • 3. National Institutes of Health U.S. Department of Health and Human Services3 More than 1.1060 accessible molecules 1,2,3 (1) Hann, M.M., and Oprea, T.I. (2004). Curr. Opin. Chem. Biol. 8: 255–263. (2) Ursu, O., Rayan, A., Goldblum, A., and Oprea, T.I. (2011). Rev. Comput. Mol. Sci. 1: 760–781. (3) Drew, K. L. M., Baiman, H., Khwaounjoo, P., Yu, B., & Reynisson, J. (2012). Journal of Pharmacy and Pharmacology, 64(4), 490–495. Introduction
  • 4. National Institutes of Health U.S. Department of Health and Human Services4 Chemical space: “…Chemical space’ is a term often used in place of ‘multi- dimensional descriptor space’: it is a region defined by a particular choice of descriptors…” Dobson CM (2004) Nature 432:824–828 Chemical space
  • 5. National Institutes of Health U.S. Department of Health and Human Services5 Lipinski C, Hopkins A (2004) Nature 432:855–861. Chemical space
  • 6. National Institutes of Health U.S. Department of Health and Human Services6
  • 7. National Institutes of Health U.S. Department of Health and Human Services7 Locate chemical of interest
  • 8. National Institutes of Health U.S. Department of Health and Human Services8 Locate chemical of interest Optimization, define analogue, replacement, ….
  • 9. National Institutes of Health U.S. Department of Health and Human Services9 Investigate ADME/Tox properties Locate chemical of interest Optimization, define analogue, replacement, ….
  • 10. National Institutes of Health U.S. Department of Health and Human Services10 Drug repurposing Locate chemical of interest Optimization, define analogue, replacement, …. Investigate ADME/Tox properties
  • 11. National Institutes of Health U.S. Department of Health and Human Services11 Define, visualize domains Locate chemical of interest Optimization, define analogue, replacement, …. Investigate ADME/Tox properties Drug repurposing
  • 12. National Institutes of Health U.S. Department of Health and Human Services12 Investigate new area
  • 13. National Institutes of Health U.S. Department of Health and Human Services13 Efficient navigation tool
  • 15. 15 Google Maps approach • Interactive • Easy to use • Informative • Responsive • …. Chemical space
  • 16. 16 DrugMap: compounds ~8,000 drug entries (release 12-2018): • ~2,500 FDA-approved small molecule drugs • Over 5,000 experimental drugs. https://www.drugbank.ca/
  • 17. 17 DrugMap: descriptors https://www.drugbank.ca/ RDkit: http://www.rdkit.org/ PaDEL: http://www.yapcwsoft.com/dd/padeldescriptor/ 1D descriptors: chemical formula 2D descriptors: connectivity 3D descriptors: spatial coordinates (ligprep to generate 3D) C23H34O5 • Molecular weight • Count of atoms • … • Pharmacophore based • … • Volume • Surface • … Descriptor selection: • Remove null variance • Person’s correlation coefficient < 0.9
  • 18. 18 DrugMap: descriptors space Samples (Compounds) Variables (descriptors) X1 X2 ... X238 1 X1,1 X1,2 ... X1, 238 2 X2,1 X2,2 ... X2, 238 ... ... ... ... ... 8550 X8550,1 X8550, 2 ... X8550,238 116 1D/2D descriptors 122 3D descriptors
  • 19. 19 PC1 = 14%; PC2=9%; PC3 = 26% • Sufficient coverage of variance • Understandable z = 70 3D descriptors x, y = 116 1D-2D descriptors Multiple PCA DrugMap: descriptors space
  • 20. National Institutes of Health U.S. Department of Health and Human Services20 Environmental Chemical Space ~48,000 chemicals with 3D descriptors Informed by regulatory lists*: • Endocrine Disruptor Screening Program • Toxic Substances Control Act Inventory • Canadian Domestic Substances List • Swedish Chemicals Agency ~12,000 chemicals with acute systemic toxicity data • Rat oral LD50 values • GHS/EPA classifications *not inclusive https://comptox.epa.gov/dashboard/
  • 21. www.chemmaps.com version 1.0 ACS New Orleans (March 2018)
  • 22. National Institutes of Health U.S. Department of Health and Human Services22 www.ChemMaps.com ~30,000 unique users since March 2018 (ACS - New Orleans) ~1,000 new users each month (~ 8,000 in June 2018) @SpaceChemMaps
  • 23. National Institutes of Health U.S. Department of Health and Human Services23 Challenges for version 2 • Extended universe: Distributed Structure-Searchable Toxicity (DSSTox) Database (EPA – EPA comptox) • > 800,000 chemicals, (chemical infrastructure for EPA’s Safer Chemicals Research, including the ToxCast and Tox21 high- throughput toxicology efforts) https://www.epa.gov
  • 24. National Institutes of Health U.S. Department of Health and Human Services24 ChemMaps v2 work-in-progress • Extended universe: Distributed Structure-Searchable Toxicity (DSSTox) Database (EPA comptox dashboard) • > 800,000 chemicals • chemical infrastructure for EPA’s Safer Chemicals Research, including ToxCast and Tox21 high-throughput toxicology efforts • OPERA model predictions https://comptox.epa.gov/dashboard • More interactive • Users can input their data • Customizable • Accessibility
  • 25. National Institutes of Health U.S. Department of Health and Human Services25 Live demo (video)
  • 26. National Institutes of Health U.S. Department of Health and Human Services26 Future Vision • Project a all map on the fly
  • 27. National Institutes of Health U.S. Department of Health and Human Services27 Future Vision • Select option on the map • Project a map on the fly
  • 28. National Institutes of Health U.S. Department of Health and Human Services28 Future Vision • Select option on the map • Compute distances between several chemicals using various metrics • Project a map on the fly
  • 29. National Institutes of Health U.S. Department of Health and Human Services29 Conclusions • www.chemmaps.com (version 2) • Project the whole DSSTox database • Update DrugBank (release 12-2018) • Customize the map • Upload chemical on the fly • Accessibility
  • 30. National Institutes of Health U.S. Department of Health and Human Services30 Conclusions • www.chemmaps.com (version 2) • Project the whole DSSTox database • Update DrugBank (release 12-2018) • Customize the map • Upload chemical on the fly • Accessibility Requirements • Multiplatform (phone, tablet, computer) • Firefox >59, Chrome >65, Safari >5 (WebGL technology) • 1GB on GPU memory
  • 31. National Institutes of Health U.S. Department of Health and Human Services31 Conclusions • www.chemmaps.com (version 2) • Project the all DSSTox database • Update drugbank (release 12-2018) • Customize the map • Upload chemical on the fly • Accessibility Requirements • Multiplatform (phone, tablet, computer) • Firefox >59, Chrome >65, Safari >5 (WebGL technology) • 1GB on GPU memory Beta version on the NIEHS network Release for the end of April 2019 @SpaceChemMaps
  • 32. Fourches’ lab Dr. Denis Fourches NIEHS Dr. Nicole Kleinstreuer Office of Data Science Dr. Kamel Mansouri (contractor, ILS)
  • 34. National Institutes of Health U.S. Department of Health and Human Services34 Annexes
  • 35. National Institutes of Health U.S. Department of Health and Human Services35 EnvMap: projections Principal component analysis 216 descriptors PC1=14%, PC2 = 11%, PC3 = 9% Independent component analysis
  • 36. 36 Principal component analysis 186 descriptors PC1=16%, PC2 = 11%, PC3 = 8% Independent component analysis 186 descriptors Multidimensional-scaling 3D 186 descriptors Euclidian distance DrugMap: projections
  • 37. 37 DrugMap: compounds MolVS (RDkit): - SMILES standardization, normalize - Remove salts - Remove hydrogen - Remove fragments (mixture) 8,752 SMILES MolVS: https://molvs.readthedocs.io/en/latest/ Fourches,D. et al. (2016). J. Chem. Inf. Model., 56, 1243–1252. 8,551 canonical SMILES
  • 38. 38 DrugMap: compounds MolVS (RDkit): - SMILES standardization, normalize - Remove salts - Remove hydrogen - Remove fragments (mixture) 8,752 SMILES 8,551 canonical SMILES C[S@@](=O)CC[C@H](N)C(O)=O DB02235 C[S+]([O-])CCC(N)C(=O)O DB02165 [Zn2+] …. MolVS: https://molvs.readthedocs.io/en/latest/ Fourches,D. et al. (2016). J. Chem. Inf. Model., 56, 1243–1252.
  • 39. 39 DrugMap: descriptors space https://www.drugbank.ca/ RDkit: http://www.rdkit.org/ 1D descriptors: chemical formula 2D descriptors: connectivity C23H34O5 • Molecular weight • Count of atoms • … • Pharmacophore based • … 1D/2D RDKit descriptors (648)
  • 40. 40 DrugMap: 3D generation RDkit: http://www.rdkit.org/ https://www.schrodinger.com/ligprep Riniker, S.; Landrum, G. A. J. Chem. Inf. Comp. Sci. 55:2562-74 (2015) 3D generations • Riniker and Laundrum (RDKit) SMILES SDF
  • 41. 41 DrugMap: 3D descriptors Cao,D.-S. et al. (2013) J. Chem. Inf. Model., 53, 3086–3096 3D descriptors: spatial coordinates • Volume • Surface • Charge distribution • … 3D PyDPI descriptors (420)
  • 42. 42 DrugMap: descriptors space Samples (Compounds) Variables (descriptors) X1 X2 ... X1068 1 X1,1 X1,2 ... X1,1068 2 X2,1 X2,2 ... X2,1068 ... ... ... ... ... 8550 X8550,1 X8550, 2 ... X8550,1068
  • 43. 43 DrugMap: descriptors space Samples (Compounds) Variables (descriptors) X1 X2 ... X1068 1 X1,1 X1,2 ... X1,1068 2 X2,1 X2,2 ... X2,1068 ... ... ... ... ... 8550 X8550,1 X8550, 2 ... X8850,1068 Descriptor selection: • Remove null variance • Pairwise Person’s correlation coefficient < 0.9
  • 44. National Institutes of Health U.S. Department of Health and Human Services44 Projection z = 78 3D descriptors x, y = 138 1D-2D descriptors Multiple PCA PC1: 13% PC2: 9% PC3: 24% Coverage of variance
  • 45. National Institutes of Health U.S. Department of Health and Human Services45 Future Vision: Environmental Maps
  • 46. National Institutes of Health U.S. Department of Health and Human Services46 Future Vision: Environmental Maps • Define and project several domains
  • 47. National Institutes of Health U.S. Department of Health and Human Services47 Future Vision: Environmental Maps • Define and project several domains • Add entire DSSTox Inventory (>700,000 chemicals)
  • 48. National Institutes of Health U.S. Department of Health and Human Services48 Future Vision: Environmental Maps • Define and project several domains • Add entire DSSTox Inventory (>700,000 chemicals) • Incorporate diverse biological datasets (e.g. ToxRefDB, HTT)
  • 49. National Institutes of Health U.S. Department of Health and Human Services49 Future Vision: map on the fly • Define new map on the fly • Chemical databases • Precomputed coordinates • Local version
  • 50. National Institutes of Health U.S. Department of Health and Human Services50 Future Vision: Virtual reality
  • 51. National Institutes of Health U.S. Department of Health and Human Services51 Future Vision: Navigation
  • 52. National Institutes of Health U.S. Department of Health and Human Services52 Future VisionFuture Vision: Navigation • Project new chemical lists on map • Add your chemicals/data/model predictions on the map
  • 53. National Institutes of Health U.S. Department of Health and Human Services53 Future VisionFuture Vision: Navigation • Select and redefine part of the chemical map on the fly • Project new chemical lists on map • Add your chemicals/data/model predictions on the map
  • 54. National Institutes of Health U.S. Department of Health and Human Services54 Future Vision : Navigation • Select and redefine part of the chemical map on the fly • Compute distances between several chemicals using various metrics • Project new chemical lists on map • Add your chemicals/data/model predictions on the map • Download area and matrix of distance