SlideShare a Scribd company logo
1 of 49
The views expressed in this presentation are those of the author and do not necessarily reflect the views or policies of the U.S. EPA
Cheminformatics tools supporting
dissemination of data associated with US EPA
Clean Water Act hazardous substances
Antony Williams1, Fran Kremer2, Jason Lambert1, Jace Cuje3 and Valery Tkachenko4
1. Center for Computational Toxicology and Exposure, US-EPA
2. Center for Environmental Solutions and Emergency Response, US-EPA
3. Office of Science Advisor, Policy and Engagement. US-EPA
4. ScienceDataExperts Inc.
March 2024: Spring Fall Meeting, New Orleans, LA
Data, Model and Tool Development
• I work for the Center for Computational Toxicology and Exposure
in the Computational Chemistry and Cheminformatics Branch
• There are many tools developed by our cheminformatics team and
across other centers in EPA. I will represent ours only…
• We have production level public-facing tools, proof-of-concept
public-facing tools, and many tools in development…
• We focus on FAIR data releasing it to the community and making
it available on Public APIs
1
Free-Access Cheminformatics Tools
• The Center for Computational Toxicology and Exposure has
delivered many tools including
– CompTox Chemicals Dashboard
– Proof-of-Concept cheminformatics modules
• Chemicals Hazard Profiling
• Chemical Transformations Database
• Analytical Methods and Spectra
• Chemical Safety Profiling
2
Research Projects we apply them to
4
Research Projects we apply them to
5
Research Projects we apply them to
Curating Chemistry into the DSSTox Database
6
• Chemistry underpins all of our tools
• Data assembly and curation is critical
• DSSTox assembled over 25 years
CompTox Chemicals Dashboard
https://comptox.epa.gov/dashboard/
The Charge for the Dashboard
• Develop a “first-stop-shop” for environmental chemical data to
support EPA and partner decision making:
– Centralized location for relevant chemical data
– Chemistry, exposure, hazard and dosimetry
– Combination of existing data and predictive models
– Publicly accessible, periodically updated, curated
• Easy access to data improves efficiency and ultimately
accelerates chemical risk assessment
Detailed Chemical Pages
Experimental and Predicted Data
• Physchem and Fate & Transport
experimental and predicted data
• Data can be downloaded as Excel,
TSV and CSV files
Chemical Hazard Data
Hazard Data for Copper
• 2246 rows of human/eco hazard data harvested with three
clicks
Sources of Exposure to Chemicals
Chemical Lists
Chemical Lists
• Chemical lists are focused on regulations, specific research
efforts and categories
• 425 lists and growing
– TSCA Inventory
– Clean Water Act Hazardous Substances
– Consumer Products database
– Chemicals of Emerging Concern
– PFAS lists
– Extractables and Leachables
– …lists are versioned and updated and new lists added
15
Remember those Research Projects?
Some Research Projects…
Some Research Projects…
Clean Water Act Hazardous Substances
19
Harvesting Data en masse
• Harvesting data for CWAHS related chemicals
–Physicochemical properties
–Fate and transport
–Toxicity values
–Exposure data
–Chemical identifiers
–Links to regulatory assessments
Batch Searching
Batch Searching is a big enabler
https://pubs.acs.org/doi/10.1021/acs.jcim.0c01273
22
Batch Search
Batch Search – Excel, CSV, SDF file
Batch Search
We supply predicted data for many endpoints
• Property prediction – e.g., water solubility, vapor pressure
• Fate and Transport – e.g., bioaccumulation, bioconcentration
• Bioactivity – e.g., endocrine disruption
• Models are constantly updated with fresh data, are transparent
in their data, and are open source
26
QSAR Modeled Data are available
• We build models then apply then to our curated datasets
for release, PLUS deliver the models for realtime use
27
Where do we use predictions like this?
• Models are used in many places in our computational
toxicology research
• They are used in the analytical labs to help guide non-
targeted analysis
• By stakeholders for Hazard
profiling of chemicals
28
Where do we use predictions like this?
• Models are used in many places in our computational
toxicology research
• They are used in the analytical labs to help guide non-
targeted analysis
• By stakeholders for Hazard
profiling of chemicals
• Predictions for breakdown
products in the environment
29
Lots of “proof-of-concept” tools in development
• PoCs are research software builds to prove approaches
before moving into production software environments
• PoCs are to figure out how to address specific questions
• Assemble data, develop data model(s), test user interface
approaches, work with test user base to garner feedback
• Since PoCs are internal access data refreshes and application
updates can be more
• Underlying APIs are being used in our research
30
How to compare Hazard Data?
31
How to compare Hazard Data?
NOT Easy to interpret…
32
Hazard Profile
33
• Hazard Comparison module profiles toxicity across chemicals
https://www.epa.gov/chemical-research/cheminformatics
Hazard Profile
On-Hover view of trumping scheme call
34
Hazard Profile
On-click view of underlying data
35
Data to Excel in <60s
36
Linked to Chemical Transformation Simulator
37
Linked to Chemical Transformation Simulator
38
Where can our tools be applied
• Emergency Response utility is obvious…
• Consider East Palestine
39
https://www.cleveland19.com/2023/
02/14/ntsb-announces-preliminary-
malfunction-that-caused-east-
palestine-train-derailment/
POLYPROPYLENE
POLYETHYLENE
Residue lube oil
VINYL CHLORIDE
DIPROPYLENE GLYCOL
PROPYLENE GLYCOL
DIETHYLENE GLYCOL
COMBUSTIBLE LIQ., NOS (ETHYLENE GLYCOL MONOBUTYL ETHER)
SEMOLINA
COMBUSTIBLE LIQ., NOS (ETHYLHEXYL ACRYLATE)
POLYVINYL
PETROLEUM LUBEOIL
POLYPROPYL GLYCOL
ISOBUTYLENE
BUTYL ACRYLATES, STABILIZED
PETRO OIL, NEC
ADDITIVES, FUEL
BALLS,CTN,M EDCL
SHEET STEEL
VEGTABLE, FROZEN
BENZENE
PARAFFIN WAX
FLAKES, POWDER
HYDRAULIC CEMENT
AUTOS PASSENGER
MALT LIQUORS
Hazard Comparison Profiling
40
Hazard Comparison Profiling
Perfect Example of FAIR Data and APIs
• We owe a lot to FAIR data and availability of information
• We curate a lot of our chemistry data using public resources
such as PubChem, ChEBI, Common Chemistry and others
• The availability of Public APIs takes things to another level!
• We have been using the PubChem API to harvest data so
we can build new applications, like the Safety Module
42
Cheminformatics Safety Module (NOT PUBLIC)
Integrate multiple data streams…
43
Some Related Publications of Interest
You want to know more…
• Lots of resources available
– Presentations: https://tinyurl.com/w5hqs55
– Communities of Practice Videos: https://rb.gy/qsbno1
– Manual: https://rb.gy/4fgydc
– Latest News: https://comptox.epa.gov/dashboard/news_info
45
This talk is an overview
• This talk is a high-level overview only. We
can provide trainings into the individual
modules and data as required
• LOTS of training materials are available
https://www.epa.gov/chemical-research/new-approach-methods-nams-training
Conclusions
• Underpinning chemistry data is from the DSSTox database
• CompTox Chemicals Dashboard is public access to DSSTox
and other related databases
• Proof-of-Concept (PoC) tools are built to prove approaches
• Everything is increasingly API driven and APIs are now public
47
Contact Information
• Contact info: williams.antony@epa.gov
• Slides available at: https://www.slideshare.net/AntonyWilliams/
• Obtain articles from Google Scholar Profile
48

More Related Content

Similar to Cheminformatics tools supporting dissemination of data associated with US EPA Clean Water Act hazardous substances

The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...
The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...
The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...Andrew McEachran
 

Similar to Cheminformatics tools supporting dissemination of data associated with US EPA Clean Water Act hazardous substances (20)

The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Env...
The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Env...The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Env...
The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Env...
 
How to place your research questions or results into the context of the "Lega...
How to place your research questions or results into the context of the "Lega...How to place your research questions or results into the context of the "Lega...
How to place your research questions or results into the context of the "Lega...
 
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
 
The EPA CompTox Chemistry Dashboard and Underpinning Software Architecture
The EPA CompTox Chemistry Dashboard and Underpinning Software Architecture The EPA CompTox Chemistry Dashboard and Underpinning Software Architecture
The EPA CompTox Chemistry Dashboard and Underpinning Software Architecture
 
Chemistry data delivery from the US-EPA to support environmental chemistry
Chemistry data delivery from the US-EPA to support environmental chemistryChemistry data delivery from the US-EPA to support environmental chemistry
Chemistry data delivery from the US-EPA to support environmental chemistry
 
TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...
TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...
TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...
 
Environmental Chemistry Compound Identification Using High Resolution Mass Sp...
Environmental Chemistry Compound Identification Using High Resolution Mass Sp...Environmental Chemistry Compound Identification Using High Resolution Mass Sp...
Environmental Chemistry Compound Identification Using High Resolution Mass Sp...
 
Progress in delivering transparency in research data
Progress in delivering transparency in research dataProgress in delivering transparency in research data
Progress in delivering transparency in research data
 
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
 
Utilizing US-EPA Data Dashboards to Support Exposomics research
Utilizing US-EPA Data Dashboards to  Support Exposomics researchUtilizing US-EPA Data Dashboards to  Support Exposomics research
Utilizing US-EPA Data Dashboards to Support Exposomics research
 
The US-EPA CompTox Chemicals Dashboard to support Non-Targeted Analysis
The US-EPA CompTox Chemicals Dashboard to support Non-Targeted AnalysisThe US-EPA CompTox Chemicals Dashboard to support Non-Targeted Analysis
The US-EPA CompTox Chemicals Dashboard to support Non-Targeted Analysis
 
Accessing Environmental Chemistry Data via Data Dashboards
Accessing Environmental Chemistry Data via Data Dashboards Accessing Environmental Chemistry Data via Data Dashboards
Accessing Environmental Chemistry Data via Data Dashboards
 
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental scienceUS-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
 
The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...
The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...
The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...
 
New developments in delivering public access to data from the National Center...
New developments in delivering public access to data from the National Center...New developments in delivering public access to data from the National Center...
New developments in delivering public access to data from the National Center...
 
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental scienceUS-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
 
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
 
The EPA Comptox Chemicals Dashboard as a Data Integration Hub for Environment...
The EPA Comptox Chemicals Dashboard as a Data Integration Hub for Environment...The EPA Comptox Chemicals Dashboard as a Data Integration Hub for Environment...
The EPA Comptox Chemicals Dashboard as a Data Integration Hub for Environment...
 
Delivering access to chemistry and bioassay data from the National Center for...
Delivering access to chemistry and bioassay data from the National Center for...Delivering access to chemistry and bioassay data from the National Center for...
Delivering access to chemistry and bioassay data from the National Center for...
 
Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...
 

Recently uploaded

Concept of gene and Complementation test.pdf
Concept of gene and Complementation test.pdfConcept of gene and Complementation test.pdf
Concept of gene and Complementation test.pdfCherry
 
FS P2 COMBO MSTA LAST PUSH past exam papers.
FS P2 COMBO MSTA LAST PUSH past exam papers.FS P2 COMBO MSTA LAST PUSH past exam papers.
FS P2 COMBO MSTA LAST PUSH past exam papers.takadzanijustinmaime
 
Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptxCherry
 
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIACURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIADr. TATHAGAT KHOBRAGADE
 
PODOCARPUS...........................pptx
PODOCARPUS...........................pptxPODOCARPUS...........................pptx
PODOCARPUS...........................pptxCherry
 
Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.Cherry
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professormuralinath2
 
Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Cherry
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learninglevieagacer
 
Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspectsmuralinath2
 
CYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptxCYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptxCherry
 
Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Cherry
 
Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxRenuJangid3
 
Genome organization in virus,bacteria and eukaryotes.pptx
Genome organization in virus,bacteria and eukaryotes.pptxGenome organization in virus,bacteria and eukaryotes.pptx
Genome organization in virus,bacteria and eukaryotes.pptxCherry
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...Scintica Instrumentation
 
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxANSARKHAN96
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....muralinath2
 

Recently uploaded (20)

Concept of gene and Complementation test.pdf
Concept of gene and Complementation test.pdfConcept of gene and Complementation test.pdf
Concept of gene and Complementation test.pdf
 
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICEPATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
 
FS P2 COMBO MSTA LAST PUSH past exam papers.
FS P2 COMBO MSTA LAST PUSH past exam papers.FS P2 COMBO MSTA LAST PUSH past exam papers.
FS P2 COMBO MSTA LAST PUSH past exam papers.
 
Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptx
 
Early Development of Mammals (Mouse and Human).pdf
Early Development of Mammals (Mouse and Human).pdfEarly Development of Mammals (Mouse and Human).pdf
Early Development of Mammals (Mouse and Human).pdf
 
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIACURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
 
PODOCARPUS...........................pptx
PODOCARPUS...........................pptxPODOCARPUS...........................pptx
PODOCARPUS...........................pptx
 
Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
 
Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspects
 
CYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptxCYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptx
 
Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.
 
Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptx
 
Genome organization in virus,bacteria and eukaryotes.pptx
Genome organization in virus,bacteria and eukaryotes.pptxGenome organization in virus,bacteria and eukaryotes.pptx
Genome organization in virus,bacteria and eukaryotes.pptx
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
 
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
 

Cheminformatics tools supporting dissemination of data associated with US EPA Clean Water Act hazardous substances

  • 1. The views expressed in this presentation are those of the author and do not necessarily reflect the views or policies of the U.S. EPA Cheminformatics tools supporting dissemination of data associated with US EPA Clean Water Act hazardous substances Antony Williams1, Fran Kremer2, Jason Lambert1, Jace Cuje3 and Valery Tkachenko4 1. Center for Computational Toxicology and Exposure, US-EPA 2. Center for Environmental Solutions and Emergency Response, US-EPA 3. Office of Science Advisor, Policy and Engagement. US-EPA 4. ScienceDataExperts Inc. March 2024: Spring Fall Meeting, New Orleans, LA
  • 2. Data, Model and Tool Development • I work for the Center for Computational Toxicology and Exposure in the Computational Chemistry and Cheminformatics Branch • There are many tools developed by our cheminformatics team and across other centers in EPA. I will represent ours only… • We have production level public-facing tools, proof-of-concept public-facing tools, and many tools in development… • We focus on FAIR data releasing it to the community and making it available on Public APIs 1
  • 3. Free-Access Cheminformatics Tools • The Center for Computational Toxicology and Exposure has delivered many tools including – CompTox Chemicals Dashboard – Proof-of-Concept cheminformatics modules • Chemicals Hazard Profiling • Chemical Transformations Database • Analytical Methods and Spectra • Chemical Safety Profiling 2
  • 4. Research Projects we apply them to
  • 5. 4 Research Projects we apply them to
  • 6. 5 Research Projects we apply them to
  • 7. Curating Chemistry into the DSSTox Database 6 • Chemistry underpins all of our tools • Data assembly and curation is critical • DSSTox assembled over 25 years
  • 9. The Charge for the Dashboard • Develop a “first-stop-shop” for environmental chemical data to support EPA and partner decision making: – Centralized location for relevant chemical data – Chemistry, exposure, hazard and dosimetry – Combination of existing data and predictive models – Publicly accessible, periodically updated, curated • Easy access to data improves efficiency and ultimately accelerates chemical risk assessment
  • 11. Experimental and Predicted Data • Physchem and Fate & Transport experimental and predicted data • Data can be downloaded as Excel, TSV and CSV files
  • 13. Hazard Data for Copper • 2246 rows of human/eco hazard data harvested with three clicks
  • 14. Sources of Exposure to Chemicals
  • 16. Chemical Lists • Chemical lists are focused on regulations, specific research efforts and categories • 425 lists and growing – TSCA Inventory – Clean Water Act Hazardous Substances – Consumer Products database – Chemicals of Emerging Concern – PFAS lists – Extractables and Leachables – …lists are versioned and updated and new lists added 15
  • 20. Clean Water Act Hazardous Substances 19
  • 21. Harvesting Data en masse • Harvesting data for CWAHS related chemicals –Physicochemical properties –Fate and transport –Toxicity values –Exposure data –Chemical identifiers –Links to regulatory assessments
  • 23. Batch Searching is a big enabler https://pubs.acs.org/doi/10.1021/acs.jcim.0c01273 22
  • 25. Batch Search – Excel, CSV, SDF file
  • 27. We supply predicted data for many endpoints • Property prediction – e.g., water solubility, vapor pressure • Fate and Transport – e.g., bioaccumulation, bioconcentration • Bioactivity – e.g., endocrine disruption • Models are constantly updated with fresh data, are transparent in their data, and are open source 26
  • 28. QSAR Modeled Data are available • We build models then apply then to our curated datasets for release, PLUS deliver the models for realtime use 27
  • 29. Where do we use predictions like this? • Models are used in many places in our computational toxicology research • They are used in the analytical labs to help guide non- targeted analysis • By stakeholders for Hazard profiling of chemicals 28
  • 30. Where do we use predictions like this? • Models are used in many places in our computational toxicology research • They are used in the analytical labs to help guide non- targeted analysis • By stakeholders for Hazard profiling of chemicals • Predictions for breakdown products in the environment 29
  • 31. Lots of “proof-of-concept” tools in development • PoCs are research software builds to prove approaches before moving into production software environments • PoCs are to figure out how to address specific questions • Assemble data, develop data model(s), test user interface approaches, work with test user base to garner feedback • Since PoCs are internal access data refreshes and application updates can be more • Underlying APIs are being used in our research 30
  • 32. How to compare Hazard Data? 31
  • 33. How to compare Hazard Data? NOT Easy to interpret… 32
  • 34. Hazard Profile 33 • Hazard Comparison module profiles toxicity across chemicals https://www.epa.gov/chemical-research/cheminformatics
  • 35. Hazard Profile On-Hover view of trumping scheme call 34
  • 36. Hazard Profile On-click view of underlying data 35
  • 37. Data to Excel in <60s 36
  • 38. Linked to Chemical Transformation Simulator 37
  • 39. Linked to Chemical Transformation Simulator 38
  • 40. Where can our tools be applied • Emergency Response utility is obvious… • Consider East Palestine 39 https://www.cleveland19.com/2023/ 02/14/ntsb-announces-preliminary- malfunction-that-caused-east- palestine-train-derailment/ POLYPROPYLENE POLYETHYLENE Residue lube oil VINYL CHLORIDE DIPROPYLENE GLYCOL PROPYLENE GLYCOL DIETHYLENE GLYCOL COMBUSTIBLE LIQ., NOS (ETHYLENE GLYCOL MONOBUTYL ETHER) SEMOLINA COMBUSTIBLE LIQ., NOS (ETHYLHEXYL ACRYLATE) POLYVINYL PETROLEUM LUBEOIL POLYPROPYL GLYCOL ISOBUTYLENE BUTYL ACRYLATES, STABILIZED PETRO OIL, NEC ADDITIVES, FUEL BALLS,CTN,M EDCL SHEET STEEL VEGTABLE, FROZEN BENZENE PARAFFIN WAX FLAKES, POWDER HYDRAULIC CEMENT AUTOS PASSENGER MALT LIQUORS
  • 43. Perfect Example of FAIR Data and APIs • We owe a lot to FAIR data and availability of information • We curate a lot of our chemistry data using public resources such as PubChem, ChEBI, Common Chemistry and others • The availability of Public APIs takes things to another level! • We have been using the PubChem API to harvest data so we can build new applications, like the Safety Module 42
  • 44. Cheminformatics Safety Module (NOT PUBLIC) Integrate multiple data streams… 43
  • 46. You want to know more… • Lots of resources available – Presentations: https://tinyurl.com/w5hqs55 – Communities of Practice Videos: https://rb.gy/qsbno1 – Manual: https://rb.gy/4fgydc – Latest News: https://comptox.epa.gov/dashboard/news_info 45
  • 47. This talk is an overview • This talk is a high-level overview only. We can provide trainings into the individual modules and data as required • LOTS of training materials are available https://www.epa.gov/chemical-research/new-approach-methods-nams-training
  • 48. Conclusions • Underpinning chemistry data is from the DSSTox database • CompTox Chemicals Dashboard is public access to DSSTox and other related databases • Proof-of-Concept (PoC) tools are built to prove approaches • Everything is increasingly API driven and APIs are now public 47
  • 49. Contact Information • Contact info: williams.antony@epa.gov • Slides available at: https://www.slideshare.net/AntonyWilliams/ • Obtain articles from Google Scholar Profile 48