SlideShare a Scribd company logo
1 of 19
EPA CompTox Chemicals
Dashboard
A chemical integration hub
Antony Williams
US-EPA National Center for Computational Toxicology
1
The views expressed in this presentation are those of the author and do not necessarily reflect the views or policies of
the U.S. EPA
The CompTox Chemicals Dashboard
https://comptox.epa.gov/dashboard
• Integration hub for >875,000 chemical
substances
• Managing the complex nature of substances and
identifiers – synonyms, CAS Registry Numbers
• Each substance has a unique ID mapped to
structure (when available), CASRN(s), multiple
synonyms
• Aggregated sets of chemical lists of interest
• UVCB substances - unknown or variable
composition, complex reaction products or
biological materials – includes mappings to 2
CASRN or Name Search for 875,000
chemicals
3
Type-ahead search for millions of
identifiers including systematic
nomenclature, synonyms and CAS
Registry Numbers
Example Record - Ergocalciferol
4
Ergocalciferol – Identifiers include names and
CAS
Total of 112 identifiers for this chemical
5
Systematic Names are complex and rules-
based
6
Synonyms are simply by declaration
• D-Tracetten
• Ercalciol
• Ergorone
• Fortodyl
• Geltabs
• Hi-Deratol
• Metadee
• Mulsiferol
• Mykostin
• NSC 62792
• Oleovitamin D
• Oleovitamin D2
• Ostelin
7
• Radiostol
• Radsterin
• Rodine C
• Shock-ferol
• Sterogyl
• Uvesterol D
• Viosterol
• Vitavel-D
• Activated ergosterol
• Calciferolum
• EINECS 200-014-9
• Hyperkil
• Rules-based systematic
names can be converted to
structures (in general)
• Synonyms, and many UVCB
identifiers are vague and
ambiguous and commonly
used for “look-up” only
Converting Systematic Names to Structures
e.g. https://opsin.ch.cam.ac.uk/
8
A chemical can have many CAS Registry
Numbers
• There are various flavors of CAS Registry
Numbers
• Active
• Alternate
• Deleted
• Some chemical substances have hundreds of
deleted CAS numbers!
• There are ways to map distinct structures but
this is difficult for mixtures and ambiguous
chemicals
9
Dealing with Complex Substances – e.g.
PCBs
10
PCBs – members of the category
11
How does the dashboard deal with UVCBs?
• n-Nonylphenol – indicated as a mixture of various sustituted forms
using “Markush structure” representation
12
How does the dashboard deal with UVCBs?
• n-Nonylphenol – indicates a mixture of
substituted forms using “Markush structure”
representation
13
But there are various nonylphenols…
• 25154-52-3 General class
• 104-40-5 4-n-Nonyl phenol
• 84852-15-3 Branched 4-Nonyl phenols
• 11066-49-2 Isononylphenols
14
Branched form is of interest as endocrine
disruptor
15
Estradiol Branched
nonylphenol
Dodecene – is a family..
16
“Propylene Tetramer” substring search
• STRING Search gives two
substances
• A search for “propylene tetramer”
substring gives 7 substances
17
Dashboard Segregated Lists
• Lists of chemicals to serve specific
interests in chemical substance
sets
• TSCA Inventory, active, non-
confidential
• Pesticides
• Lists assembled to serve projects
and communities as needed
18
Conclusions
• Dashboard is a collection of
substances and data. ~875,000 and
growing
• Data is under constant curation
• Mapped relationships between
chemicals is of value for complex
substances
• Support for UVCB chemicals is built-in
• Chemistry is complex and informatics
solutions are more complete for
distinct structures than complex
substances 19
Data Quality
Flags
The category Polycyclic Aromatic Hydrocarbons

More Related Content

Similar to EPA CompTox Chemicals Dashboard - A chemical integration hub

Substance searching in Reaxys - Webinar - 24 March 2015
Substance searching in Reaxys - Webinar - 24 March 2015Substance searching in Reaxys - Webinar - 24 March 2015
Substance searching in Reaxys - Webinar - 24 March 2015Ann-Marie Roche
 
Chapter6carboxylicacidsestersandotherderivatives 160405013912
Chapter6carboxylicacidsestersandotherderivatives 160405013912Chapter6carboxylicacidsestersandotherderivatives 160405013912
Chapter6carboxylicacidsestersandotherderivatives 160405013912Cleophas Rwemera
 
Chapter 6 Carboxylic acids Ssters and other derivatives
Chapter 6 Carboxylic acids Ssters and other derivativesChapter 6 Carboxylic acids Ssters and other derivatives
Chapter 6 Carboxylic acids Ssters and other derivativesGizel Santiago
 
Introduction to alkaloids
Introduction to alkaloidsIntroduction to alkaloids
Introduction to alkaloidsROHIT PAL
 
SECONDARY METABOLITIES .pdf
SECONDARY METABOLITIES .pdfSECONDARY METABOLITIES .pdf
SECONDARY METABOLITIES .pdfVishal Bagul
 
enzyme-191130180618.pdf
enzyme-191130180618.pdfenzyme-191130180618.pdf
enzyme-191130180618.pdfraghad ibrahim
 
PowerPoint presentation
PowerPoint presentationPowerPoint presentation
PowerPoint presentationwebhostingguy
 
Structure activity relation ship
Structure activity relation shipStructure activity relation ship
Structure activity relation shipAkshil Mehta
 
SAR_EMBL_EBI_EC_BLAST_NOV_2013_Industry_workshop
SAR_EMBL_EBI_EC_BLAST_NOV_2013_Industry_workshopSAR_EMBL_EBI_EC_BLAST_NOV_2013_Industry_workshop
SAR_EMBL_EBI_EC_BLAST_NOV_2013_Industry_workshopSyed Asad Rahman
 
Chapter 2 Saturated Hydrocarbons
Chapter 2 Saturated HydrocarbonsChapter 2 Saturated Hydrocarbons
Chapter 2 Saturated HydrocarbonsGizel Santiago
 
Chapter2saturatedhydrocarbons 151111004334-lva1-app6892
Chapter2saturatedhydrocarbons 151111004334-lva1-app6892Chapter2saturatedhydrocarbons 151111004334-lva1-app6892
Chapter2saturatedhydrocarbons 151111004334-lva1-app6892Cleophas Rwemera
 
Ap bio ch 3 Functional Groups & Macromolecules
Ap bio ch 3 Functional Groups & MacromoleculesAp bio ch 3 Functional Groups & Macromolecules
Ap bio ch 3 Functional Groups & Macromoleculeszernwoman
 
Introduction to secondary metabolites
Introduction to secondary metabolitesIntroduction to secondary metabolites
Introduction to secondary metabolitesMegha Shah
 
Unit-2.1 PGPHY-2 By M H Ghante.pdf
Unit-2.1 PGPHY-2 By M H Ghante.pdfUnit-2.1 PGPHY-2 By M H Ghante.pdf
Unit-2.1 PGPHY-2 By M H Ghante.pdfMahavir Ghante
 

Similar to EPA CompTox Chemicals Dashboard - A chemical integration hub (20)

Ontology work at the Royal Society of Chemistry
Ontology work at the Royal Society of ChemistryOntology work at the Royal Society of Chemistry
Ontology work at the Royal Society of Chemistry
 
Substance searching in Reaxys - Webinar - 24 March 2015
Substance searching in Reaxys - Webinar - 24 March 2015Substance searching in Reaxys - Webinar - 24 March 2015
Substance searching in Reaxys - Webinar - 24 March 2015
 
Chapter6carboxylicacidsestersandotherderivatives 160405013912
Chapter6carboxylicacidsestersandotherderivatives 160405013912Chapter6carboxylicacidsestersandotherderivatives 160405013912
Chapter6carboxylicacidsestersandotherderivatives 160405013912
 
Chapter 6 Carboxylic acids Ssters and other derivatives
Chapter 6 Carboxylic acids Ssters and other derivativesChapter 6 Carboxylic acids Ssters and other derivatives
Chapter 6 Carboxylic acids Ssters and other derivatives
 
Introduction to alkaloids
Introduction to alkaloidsIntroduction to alkaloids
Introduction to alkaloids
 
SECONDARY METABOLITIES .pdf
SECONDARY METABOLITIES .pdfSECONDARY METABOLITIES .pdf
SECONDARY METABOLITIES .pdf
 
enzyme-191130180618.pdf
enzyme-191130180618.pdfenzyme-191130180618.pdf
enzyme-191130180618.pdf
 
Enzyme
EnzymeEnzyme
Enzyme
 
Markush enumeration to manage, mesh and manipulate substances of unknown or v...
Markush enumeration to manage, mesh and manipulate substances of unknown or v...Markush enumeration to manage, mesh and manipulate substances of unknown or v...
Markush enumeration to manage, mesh and manipulate substances of unknown or v...
 
PowerPoint presentation
PowerPoint presentationPowerPoint presentation
PowerPoint presentation
 
Structure activity relation ship
Structure activity relation shipStructure activity relation ship
Structure activity relation ship
 
SAR_EMBL_EBI_EC_BLAST_NOV_2013_Industry_workshop
SAR_EMBL_EBI_EC_BLAST_NOV_2013_Industry_workshopSAR_EMBL_EBI_EC_BLAST_NOV_2013_Industry_workshop
SAR_EMBL_EBI_EC_BLAST_NOV_2013_Industry_workshop
 
Non-targeted analysis supported by data and cheminformatics delivered via the...
Non-targeted analysis supported by data and cheminformatics delivered via the...Non-targeted analysis supported by data and cheminformatics delivered via the...
Non-targeted analysis supported by data and cheminformatics delivered via the...
 
Chapter 2 Saturated Hydrocarbons
Chapter 2 Saturated HydrocarbonsChapter 2 Saturated Hydrocarbons
Chapter 2 Saturated Hydrocarbons
 
Chapter2saturatedhydrocarbons 151111004334-lva1-app6892
Chapter2saturatedhydrocarbons 151111004334-lva1-app6892Chapter2saturatedhydrocarbons 151111004334-lva1-app6892
Chapter2saturatedhydrocarbons 151111004334-lva1-app6892
 
Ap bio ch 3 Functional Groups & Macromolecules
Ap bio ch 3 Functional Groups & MacromoleculesAp bio ch 3 Functional Groups & Macromolecules
Ap bio ch 3 Functional Groups & Macromolecules
 
Introduction to secondary metabolites
Introduction to secondary metabolitesIntroduction to secondary metabolites
Introduction to secondary metabolites
 
CARBON COMPOUNDS SCIENCE 9
CARBON COMPOUNDS SCIENCE 9CARBON COMPOUNDS SCIENCE 9
CARBON COMPOUNDS SCIENCE 9
 
Unit-2.1 PGPHY-2 By M H Ghante.pdf
Unit-2.1 PGPHY-2 By M H Ghante.pdfUnit-2.1 PGPHY-2 By M H Ghante.pdf
Unit-2.1 PGPHY-2 By M H Ghante.pdf
 
Enzyme introduction
Enzyme introductionEnzyme introduction
Enzyme introduction
 

Recently uploaded

Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsSumit Kumar yadav
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 sciencefloriejanemacaya1
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Patrick Diehl
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptMAESTRELLAMesa2
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxkessiyaTpeter
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxpradhanghanshyam7136
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfSumit Kumar yadav
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptxRajatChauhan518211
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfSumit Kumar yadav
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
Broad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptxBroad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptxjana861314
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Nistarini College, Purulia (W.B) India
 

Recently uploaded (20)

Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questions
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 science
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.ppt
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptx
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptx
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Broad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptxBroad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptx
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...
 

EPA CompTox Chemicals Dashboard - A chemical integration hub

  • 1. EPA CompTox Chemicals Dashboard A chemical integration hub Antony Williams US-EPA National Center for Computational Toxicology 1 The views expressed in this presentation are those of the author and do not necessarily reflect the views or policies of the U.S. EPA
  • 2. The CompTox Chemicals Dashboard https://comptox.epa.gov/dashboard • Integration hub for >875,000 chemical substances • Managing the complex nature of substances and identifiers – synonyms, CAS Registry Numbers • Each substance has a unique ID mapped to structure (when available), CASRN(s), multiple synonyms • Aggregated sets of chemical lists of interest • UVCB substances - unknown or variable composition, complex reaction products or biological materials – includes mappings to 2
  • 3. CASRN or Name Search for 875,000 chemicals 3 Type-ahead search for millions of identifiers including systematic nomenclature, synonyms and CAS Registry Numbers
  • 4. Example Record - Ergocalciferol 4
  • 5. Ergocalciferol – Identifiers include names and CAS Total of 112 identifiers for this chemical 5
  • 6. Systematic Names are complex and rules- based 6
  • 7. Synonyms are simply by declaration • D-Tracetten • Ercalciol • Ergorone • Fortodyl • Geltabs • Hi-Deratol • Metadee • Mulsiferol • Mykostin • NSC 62792 • Oleovitamin D • Oleovitamin D2 • Ostelin 7 • Radiostol • Radsterin • Rodine C • Shock-ferol • Sterogyl • Uvesterol D • Viosterol • Vitavel-D • Activated ergosterol • Calciferolum • EINECS 200-014-9 • Hyperkil • Rules-based systematic names can be converted to structures (in general) • Synonyms, and many UVCB identifiers are vague and ambiguous and commonly used for “look-up” only
  • 8. Converting Systematic Names to Structures e.g. https://opsin.ch.cam.ac.uk/ 8
  • 9. A chemical can have many CAS Registry Numbers • There are various flavors of CAS Registry Numbers • Active • Alternate • Deleted • Some chemical substances have hundreds of deleted CAS numbers! • There are ways to map distinct structures but this is difficult for mixtures and ambiguous chemicals 9
  • 10. Dealing with Complex Substances – e.g. PCBs 10
  • 11. PCBs – members of the category 11
  • 12. How does the dashboard deal with UVCBs? • n-Nonylphenol – indicated as a mixture of various sustituted forms using “Markush structure” representation 12
  • 13. How does the dashboard deal with UVCBs? • n-Nonylphenol – indicates a mixture of substituted forms using “Markush structure” representation 13
  • 14. But there are various nonylphenols… • 25154-52-3 General class • 104-40-5 4-n-Nonyl phenol • 84852-15-3 Branched 4-Nonyl phenols • 11066-49-2 Isononylphenols 14
  • 15. Branched form is of interest as endocrine disruptor 15 Estradiol Branched nonylphenol
  • 16. Dodecene – is a family.. 16
  • 17. “Propylene Tetramer” substring search • STRING Search gives two substances • A search for “propylene tetramer” substring gives 7 substances 17
  • 18. Dashboard Segregated Lists • Lists of chemicals to serve specific interests in chemical substance sets • TSCA Inventory, active, non- confidential • Pesticides • Lists assembled to serve projects and communities as needed 18
  • 19. Conclusions • Dashboard is a collection of substances and data. ~875,000 and growing • Data is under constant curation • Mapped relationships between chemicals is of value for complex substances • Support for UVCB chemicals is built-in • Chemistry is complex and informatics solutions are more complete for distinct structures than complex substances 19 Data Quality Flags The category Polycyclic Aromatic Hydrocarbons