SlideShare a Scribd company logo
US-EPA Cheminformatics Support for
Delivering Data Related to Chemicals
of Emerging Concern
Antony Williams
Center for Computational Toxicology and Exposure, US-EPA, RTP, NC
The views expressed in this presentation are those of the author
and do not necessarily reflect the views or policies of the U.S. EPA
The role of cheminformatics at EPA
• I am from the EPA Center for Computational Toxicology and Exposure
• We develop lots of prediction models and web-based applications
• Today’s presentation: how do our efforts support data dissemination
regarding chemicals of emerging concern and MS-NTA
2
2
Chemical Monitoring Needs
Exposure
Assessment
Dose-
Response
Assessment
Risk
Characterization
Hazard
Identification
Free-Access Cheminformatics Tools
• The Center for Computational Toxicology and Exposure many tools
• CompTox Chemicals Dashboard
• Proof-of-Concept cheminformatics modules
• Chemicals Hazard Profiling
• Chemical Transformations database (ChET)
• Analytical Methods and Open Spectra database (AMOS)
• All chemicals are stored/curated in DSSTox
3
DSSTox Database
4
Accessing DSSTox chemistry:
CompTox Chemicals Dashboard
•A publicly accessible website delivering:
• 1.2M chemicals with related property data
• Related substances: transformation products, mono/polymer
• Experimental/predicted physicochemical property data
• Experimental Human and Ecological hazard data
• Integration to “biological assay data”
• Information regarding chemicals in consumer products
• Links to other agency websites and public data resources
• “Batch searching” for tens to thousands of chemicals
5
CompTox Chemicals Dashboard
https://comptox.epa.gov/dashboard
6
1 of ~1.2M Chemical Pages
7
Physicochemical Properties
8
Experimental Data
9
• Experimental data harvested
from public domain databases
and journal articles
• Data link back to provenance
• Data are used to build QSAR
models for real time predictions
• Data are available for download
and reuse
What is PFOS Called?
Synonyms, CASRNs and more
10
Substance Relationship Mappings
• Similar compounds -
based on structure
“fingerprints”
11
Relationships in the data
12
• Structure mappings -
between parent and
salts, multicomponent
chemicals, isotopomers
• Related substances –
monomer to polymer,
parent to transformation
products
Batch Searching is a big enabler
https://pubs.acs.org/doi/10.1021/acs.jcim.0c01273
13
Batch Searching
• Singleton searches are useful but people work with groups of chemicals
• Typical questions
• Find me all data based on the input of 1000 CASRNs, or 1000 names
• What are the physicochemical properties for a set of identifiers?
• What is the list of chemicals for the formula CxHyOz?
• What is the list of chemicals for a mass +/- error?
• Can I get chemical lists in Excel files? In SDF files?
• Can I include properties in the download file?
14
Batch Search
Batch Search
Batch Search
• All data can be
downloaded into Excel
files, CSV files or SDF
files and reused
• All data are Open
Chemical Lists
https://comptox.epa.gov/dashboard/chemical_lists
• Chemical lists are focused on regulations, research efforts and categories
• 425 lists and growing
• TSCA Inventory
• Clean Water Act Hazardous Substances
• Consumer Products database
• Chemicals of Emerging Concern
• PFAS lists
• Extractables and Leachables
• Lists are versioned and updated and new lists added regularly
18
Extractables
19
Tire Crumb Rubber
20
Hydraulic Fracturing
21
Disinfection By-Products
22
PFAS Lists of Chemicals (51/426)
23
Consumer Products Database
24
Applications at the EPA
•We have ongoing efforts applying NTA to multiple
challenges including
• PFAS identification
• Pesticides in various matrices
• CECs in water
• Biosolids
•Examples include…
25
Example 1: Consumer Product Analysis
26
27
Many chemicals observed in
consumer product extracts
More observed chemicals not
known to be in consumer
products
Why might the ‘other’
chemicals be in the products?
Many observed chemicals
known to be in consumer
products
Example 1: Consumer Product Analysis
28
Example 2: Recycled Product Analysis
29
Significant differences between
chemicals in recycled vs. virgin products
for certain product & use categories
Most differences observed in paper
products and construction materials
Some uses (e.g., fragrances) highly
represented across all product/use
categories
Example 2: Recycled Product Analysis
Example 3: Placental Tissue Analysis
30
Lots of “proof-of-concept” tools in development
• PoCs are research software builds to prove approaches before moving
into production software environments
• Assemble data, develop data model(s), test user interface approaches,
work with test user base to garner feedback
• Since PoCs are internal access data refreshes and application updates
can be more
31
32
Cheminformatics PoC Modules
https://www.epa.gov/chemical-research/cheminformatics
Easy Export of all data to Excel
33
AMOS: Analytical Methods and Spectra Database
• Three types of data in the database:
• Methods (regulatory, lab manuals and SOPs, publications, tech notes)
• Spectra (from public domain and our own laboratories)
• Fact Sheets (harvested from SWGDRUG and other sites)
• Currently contains >210,000 spectra, >700,000 external links, 4000
“Fact Sheets” and ~4000 methods
• ALL data are growing in number weekly at present
34
Embedded Method PDFs
35
Literature articles, SOPs, Protocols
36
Linking to actual spectra
37
Linking to actual spectra
38
• We are doing a lot of chemical curation as we
build the database
Why not just Regulatory Methods?
39
Why not just Regulatory Methods?
Because we need methods faster
40
Full presentation
https://t.ly/4MxFe
41
Our Data via services
https://api-ccte.epa.gov/docs/
42
Conclusions
• Our data resources underpin our research efforts – data quality is key
• Our web-based applications deliver our data to the community
• Our support for identifying chemicals of emerging concern is multi-fold
• Curated chemistry data streams
• Non-targeted analysis tool development and cheminformatics support
• NTA WebApp in development uses all data streams to support analysis
43
Acknowledgements and Contact Information
• The work presented here represents an enormous team of contributors
• Chemical curators
• Software developers and contractors
• Postdocs, SMEs and PIs
• Contact info: williams.antony@epa.gov
• Slides will be available at: https://www.slideshare.net/AntonyWilliams/
44

More Related Content

Similar to US-EPA Cheminformatics Support for Delivering Data Related to Chemicals of Emerging Concern

How to place your research questions or results into the context of the "Lega...
How to place your research questions or results into the context of the "Lega...How to place your research questions or results into the context of the "Lega...
How to place your research questions or results into the context of the "Lega...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental scienceUS-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Non-targeted analysis supported by data and cheminformatics delivered via the...
Non-targeted analysis supported by data and cheminformatics delivered via the...Non-targeted analysis supported by data and cheminformatics delivered via the...
Non-targeted analysis supported by data and cheminformatics delivered via the...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Delivering access to chemistry and bioassay data from the National Center for...
Delivering access to chemistry and bioassay data from the National Center for...Delivering access to chemistry and bioassay data from the National Center for...
Delivering access to chemistry and bioassay data from the National Center for...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
New Approach Methods - What is That?
New Approach Methods - What is That?New Approach Methods - What is That?
TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...
TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...
TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental scienceUS-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
The US-EPA CompTox Chemicals Dashboard to support Non-Targeted Analysis
The US-EPA CompTox Chemicals Dashboard to support Non-Targeted AnalysisThe US-EPA CompTox Chemicals Dashboard to support Non-Targeted Analysis
The US-EPA CompTox Chemicals Dashboard to support Non-Targeted Analysis
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Accessing Environmental Chemistry Data via Data Dashboards
Accessing Environmental Chemistry Data via Data DashboardsAccessing Environmental Chemistry Data via Data Dashboards
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Using Cheminformatics Approaches to Develop a Structure Searchable Database o...
Using Cheminformatics Approaches to Develop a Structure Searchable Database o...Using Cheminformatics Approaches to Develop a Structure Searchable Database o...
Using Cheminformatics Approaches to Develop a Structure Searchable Database o...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
US-EPA Chemicals Dashboard and Applications to Digital Design of Molecules
US-EPA Chemicals Dashboard and Applications to Digital Design  of MoleculesUS-EPA Chemicals Dashboard and Applications to Digital Design  of Molecules
US-EPA Chemicals Dashboard and Applications to Digital Design of Molecules
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Progress in delivering transparency in research data
Progress in delivering transparency in research dataProgress in delivering transparency in research data
The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...
The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...
The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...
Andrew McEachran
 
Incorporating new technologies and High Throughput Screening in the design an...
Incorporating new technologies and High Throughput Screening in the design an...Incorporating new technologies and High Throughput Screening in the design an...
Incorporating new technologies and High Throughput Screening in the design an...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
New developments in delivering public access to data from the National Center...
New developments in delivering public access to data from the National Center...New developments in delivering public access to data from the National Center...
New developments in delivering public access to data from the National Center...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 

Similar to US-EPA Cheminformatics Support for Delivering Data Related to Chemicals of Emerging Concern (20)

How to place your research questions or results into the context of the "Lega...
How to place your research questions or results into the context of the "Lega...How to place your research questions or results into the context of the "Lega...
How to place your research questions or results into the context of the "Lega...
 
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
 
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
 
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental scienceUS-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
 
Non-targeted analysis supported by data and cheminformatics delivered via the...
Non-targeted analysis supported by data and cheminformatics delivered via the...Non-targeted analysis supported by data and cheminformatics delivered via the...
Non-targeted analysis supported by data and cheminformatics delivered via the...
 
Delivering access to chemistry and bioassay data from the National Center for...
Delivering access to chemistry and bioassay data from the National Center for...Delivering access to chemistry and bioassay data from the National Center for...
Delivering access to chemistry and bioassay data from the National Center for...
 
New Approach Methods - What is That?
New Approach Methods - What is That?New Approach Methods - What is That?
New Approach Methods - What is That?
 
TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...
TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...
TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...
 
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental scienceUS-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
 
The US-EPA CompTox Chemicals Dashboard to support Non-Targeted Analysis
The US-EPA CompTox Chemicals Dashboard to support Non-Targeted AnalysisThe US-EPA CompTox Chemicals Dashboard to support Non-Targeted Analysis
The US-EPA CompTox Chemicals Dashboard to support Non-Targeted Analysis
 
Accessing Environmental Chemistry Data via Data Dashboards
Accessing Environmental Chemistry Data via Data DashboardsAccessing Environmental Chemistry Data via Data Dashboards
Accessing Environmental Chemistry Data via Data Dashboards
 
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
 
Using Cheminformatics Approaches to Develop a Structure Searchable Database o...
Using Cheminformatics Approaches to Develop a Structure Searchable Database o...Using Cheminformatics Approaches to Develop a Structure Searchable Database o...
Using Cheminformatics Approaches to Develop a Structure Searchable Database o...
 
US-EPA Chemicals Dashboard and Applications to Digital Design of Molecules
US-EPA Chemicals Dashboard and Applications to Digital Design  of MoleculesUS-EPA Chemicals Dashboard and Applications to Digital Design  of Molecules
US-EPA Chemicals Dashboard and Applications to Digital Design of Molecules
 
Progress in delivering transparency in research data
Progress in delivering transparency in research dataProgress in delivering transparency in research data
Progress in delivering transparency in research data
 
The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...
The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...
The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...
 
Incorporating new technologies and High Throughput Screening in the design an...
Incorporating new technologies and High Throughput Screening in the design an...Incorporating new technologies and High Throughput Screening in the design an...
Incorporating new technologies and High Throughput Screening in the design an...
 
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
 
New developments in delivering public access to data from the National Center...
New developments in delivering public access to data from the National Center...New developments in delivering public access to data from the National Center...
New developments in delivering public access to data from the National Center...
 
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
 

Recently uploaded

Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Ana Luísa Pinho
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
moosaasad1975
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
University of Rennes, INSA Rennes, Inria/IRISA, CNRS
 
20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx
Sharon Liu
 
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
Wasswaderrick3
 
platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
muralinath2
 
Chapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisisChapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisis
tonzsalvador2222
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
RenuJangid3
 
Richard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlandsRichard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlands
Richard Gill
 
SAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdfSAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdf
KrushnaDarade1
 
Phenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvementPhenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvement
IshaGoswami9
 
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdfMudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
frank0071
 
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
yqqaatn0
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Sérgio Sacani
 
nodule formation by alisha dewangan.pptx
nodule formation by alisha dewangan.pptxnodule formation by alisha dewangan.pptx
nodule formation by alisha dewangan.pptx
alishadewangan1
 
ISI 2024: Application Form (Extended), Exam Date (Out), Eligibility
ISI 2024: Application Form (Extended), Exam Date (Out), EligibilityISI 2024: Application Form (Extended), Exam Date (Out), Eligibility
ISI 2024: Application Form (Extended), Exam Date (Out), Eligibility
SciAstra
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
Lokesh Patil
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
silvermistyshot
 
Introduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptxIntroduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptx
zeex60
 
bordetella pertussis.................................ppt
bordetella pertussis.................................pptbordetella pertussis.................................ppt
bordetella pertussis.................................ppt
kejapriya1
 

Recently uploaded (20)

Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
 
20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx
 
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
 
platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
 
Chapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisisChapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisis
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
 
Richard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlandsRichard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlands
 
SAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdfSAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdf
 
Phenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvementPhenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvement
 
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdfMudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
 
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
 
nodule formation by alisha dewangan.pptx
nodule formation by alisha dewangan.pptxnodule formation by alisha dewangan.pptx
nodule formation by alisha dewangan.pptx
 
ISI 2024: Application Form (Extended), Exam Date (Out), Eligibility
ISI 2024: Application Form (Extended), Exam Date (Out), EligibilityISI 2024: Application Form (Extended), Exam Date (Out), Eligibility
ISI 2024: Application Form (Extended), Exam Date (Out), Eligibility
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
 
Introduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptxIntroduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptx
 
bordetella pertussis.................................ppt
bordetella pertussis.................................pptbordetella pertussis.................................ppt
bordetella pertussis.................................ppt
 

US-EPA Cheminformatics Support for Delivering Data Related to Chemicals of Emerging Concern

  • 1. US-EPA Cheminformatics Support for Delivering Data Related to Chemicals of Emerging Concern Antony Williams Center for Computational Toxicology and Exposure, US-EPA, RTP, NC The views expressed in this presentation are those of the author and do not necessarily reflect the views or policies of the U.S. EPA
  • 2. The role of cheminformatics at EPA • I am from the EPA Center for Computational Toxicology and Exposure • We develop lots of prediction models and web-based applications • Today’s presentation: how do our efforts support data dissemination regarding chemicals of emerging concern and MS-NTA 2 2 Chemical Monitoring Needs Exposure Assessment Dose- Response Assessment Risk Characterization Hazard Identification
  • 3. Free-Access Cheminformatics Tools • The Center for Computational Toxicology and Exposure many tools • CompTox Chemicals Dashboard • Proof-of-Concept cheminformatics modules • Chemicals Hazard Profiling • Chemical Transformations database (ChET) • Analytical Methods and Open Spectra database (AMOS) • All chemicals are stored/curated in DSSTox 3
  • 5. Accessing DSSTox chemistry: CompTox Chemicals Dashboard •A publicly accessible website delivering: • 1.2M chemicals with related property data • Related substances: transformation products, mono/polymer • Experimental/predicted physicochemical property data • Experimental Human and Ecological hazard data • Integration to “biological assay data” • Information regarding chemicals in consumer products • Links to other agency websites and public data resources • “Batch searching” for tens to thousands of chemicals 5
  • 7. 1 of ~1.2M Chemical Pages 7
  • 9. Experimental Data 9 • Experimental data harvested from public domain databases and journal articles • Data link back to provenance • Data are used to build QSAR models for real time predictions • Data are available for download and reuse
  • 10. What is PFOS Called? Synonyms, CASRNs and more 10
  • 11. Substance Relationship Mappings • Similar compounds - based on structure “fingerprints” 11
  • 12. Relationships in the data 12 • Structure mappings - between parent and salts, multicomponent chemicals, isotopomers • Related substances – monomer to polymer, parent to transformation products
  • 13. Batch Searching is a big enabler https://pubs.acs.org/doi/10.1021/acs.jcim.0c01273 13
  • 14. Batch Searching • Singleton searches are useful but people work with groups of chemicals • Typical questions • Find me all data based on the input of 1000 CASRNs, or 1000 names • What are the physicochemical properties for a set of identifiers? • What is the list of chemicals for the formula CxHyOz? • What is the list of chemicals for a mass +/- error? • Can I get chemical lists in Excel files? In SDF files? • Can I include properties in the download file? 14
  • 17. Batch Search • All data can be downloaded into Excel files, CSV files or SDF files and reused • All data are Open
  • 18. Chemical Lists https://comptox.epa.gov/dashboard/chemical_lists • Chemical lists are focused on regulations, research efforts and categories • 425 lists and growing • TSCA Inventory • Clean Water Act Hazardous Substances • Consumer Products database • Chemicals of Emerging Concern • PFAS lists • Extractables and Leachables • Lists are versioned and updated and new lists added regularly 18
  • 23. PFAS Lists of Chemicals (51/426) 23
  • 25. Applications at the EPA •We have ongoing efforts applying NTA to multiple challenges including • PFAS identification • Pesticides in various matrices • CECs in water • Biosolids •Examples include… 25
  • 26. Example 1: Consumer Product Analysis 26
  • 27. 27 Many chemicals observed in consumer product extracts More observed chemicals not known to be in consumer products Why might the ‘other’ chemicals be in the products? Many observed chemicals known to be in consumer products Example 1: Consumer Product Analysis
  • 28. 28 Example 2: Recycled Product Analysis
  • 29. 29 Significant differences between chemicals in recycled vs. virgin products for certain product & use categories Most differences observed in paper products and construction materials Some uses (e.g., fragrances) highly represented across all product/use categories Example 2: Recycled Product Analysis
  • 30. Example 3: Placental Tissue Analysis 30
  • 31. Lots of “proof-of-concept” tools in development • PoCs are research software builds to prove approaches before moving into production software environments • Assemble data, develop data model(s), test user interface approaches, work with test user base to garner feedback • Since PoCs are internal access data refreshes and application updates can be more 31
  • 33. Easy Export of all data to Excel 33
  • 34. AMOS: Analytical Methods and Spectra Database • Three types of data in the database: • Methods (regulatory, lab manuals and SOPs, publications, tech notes) • Spectra (from public domain and our own laboratories) • Fact Sheets (harvested from SWGDRUG and other sites) • Currently contains >210,000 spectra, >700,000 external links, 4000 “Fact Sheets” and ~4000 methods • ALL data are growing in number weekly at present 34
  • 37. Linking to actual spectra 37
  • 38. Linking to actual spectra 38 • We are doing a lot of chemical curation as we build the database
  • 39. Why not just Regulatory Methods? 39
  • 40. Why not just Regulatory Methods? Because we need methods faster 40
  • 42. Our Data via services https://api-ccte.epa.gov/docs/ 42
  • 43. Conclusions • Our data resources underpin our research efforts – data quality is key • Our web-based applications deliver our data to the community • Our support for identifying chemicals of emerging concern is multi-fold • Curated chemistry data streams • Non-targeted analysis tool development and cheminformatics support • NTA WebApp in development uses all data streams to support analysis 43
  • 44. Acknowledgements and Contact Information • The work presented here represents an enormous team of contributors • Chemical curators • Software developers and contractors • Postdocs, SMEs and PIs • Contact info: williams.antony@epa.gov • Slides will be available at: https://www.slideshare.net/AntonyWilliams/ 44