SlideShare a Scribd company logo
1 of 1
Download to read offline
Innovative Research for a Sustainable Futurewww.epa.gov/research
Developing tools for high resolution mass spectrometry-based screening via the EPA’s
CompTox Dashboard
Andrew D. McEachran1,2*, Kamel Mansouri3, Hussein Al-Ghoul4, Alex Chao4, Chris Grulke2, Jon R. Sobus5, and Antony J. Williams2
1Oak Ridge Institute of Science and Education (ORISE) Research Participant, Research Triangle Park, NC
2U.S. Environmental Protection Agency, Office of Research and Development, National Center for Computational Toxicology, Research Triangle Park, NC
3Integrated Laboratory Systems, Inc., Morrisville, NC | 4Oak Ridge Associated Universities, Research Triangle Park, NC
5U.S. Environmental Protection Agency, Office of Research and Development, National Exposure Research Laboratory, Research Triangle Park, NC
* mceachran.andrew@epa.gov l ORCiD: 0000-0003-1423-330X
Problem Definition and Goals
The CompTox Dashboard
MS-Ready Structures for Database Searching MS/MS Data in NTA/SSA
References
Acknowledgements
1. Williams, AJ, et al. 2017. The CompTox Chemistry Dashboard: a community data resource for environmental chemistry. J
Cheminf. https://doi.org/10.1186/s13321-017-0247-6
2. McEachran, AD, et al. 2017. Identifying known unknowns using the US EPA's CompTox Chemistry Dashboard. Anal. Bioanal.
Chem. 409(7): 1729-1735. doi:10.1007/s00216-016-0139-z
3. McEachran, AD, et al. 2017. A comparison of three chromatographic retention time prediction models. Talanta.
https://doi.org/10.1016/j.talanta.2018.01.022
4. McEachran, AD, et al. 2018. “MS-Ready” structures for non-targeted high-resolution mass spectrometry screening studies. J
Cheminf. Accepted for publication.
5. Allen, et al. 2015. Competitive fragmentation modeling of ESI-MS/MS spectra for putative metabolite identification.
Metabolomics. https://doi.org/10.1007/s11306-014-0676-4
6. Newton, SR, et al. 2018. Suspect screening and non-targeted analysis of drinking water using point-of-use filters. Environ Poll.
https://doi.org/10.1016/j.envpol.2017.11.033
Applications
The authors would like to thank Emma Schymanski and Christoph Ruttkies for their valuable contributions on the MS-Ready Structures,
associated publication, and integration to MetFrag. The authors would also like to thank Reza Aalizadeh and Nikolaos Thomaidis for use
of the UOA-RTI software for retention time index prediction (manuscript submitted for publication). The authors would like to
acknowledge the Dashboard development team for their efforts in surfacing all functionality. This work was supported in part by an
appointment to the ORISE Research Participation Program at the Office of Research and Development, U.S. EPA, through an
interagency agreement between the US EPA and DOE.
This presentation does not necessarily represent the views or policies of the U.S. Environmental Protection Agency.
Chemical metadata for ranking
Problem: Non-targeted and suspect screening studies using high
resolution mass spectrometry (HRMS) have revolutionized the detection of
chemicals in complex matrices. However, data processing remains
challenging due to the vast number of chemicals detected in samples,
software and computational requirements of data processing, and inherent
uncertainty in confidently identifying chemicals from candidate lists.
Goals: Develop tools, data, and visualization approaches within an open
chemistry resource to provide a freely available software tool to support
structure identification and non-targeted analysis.
Home page. The CompTox Dashboard (https://comptox.epa.gov) is a comprehensive
chemistry resource containing chemistry data on more than 760,000 chemical substances.
(Left and Below) To facilitate database searching, structures
in DSSTox were processed into their “MS-Ready” form [4].
This processing workflow removes salts and stereochemistry
and separates components of mixtures while retaining
linkages to the original structure and substance. This enables
the form of a structure observed via HRMS to be related to all
variants of a structure in the database.
(Left) The results of an MS-Ready query include
all those substances where one component of a
substance matches the input query terms [4]. In
this example, C10H14N2 returns single component
chemicals, mixtures, and salts. Returning all
linked DTXSIDs enables access to varied
metadata.
Percentage of chemicals identified using data source ranking
combining multiple metadata sources. Total n=1748 unique
chemicals from the ENTACT trial and CASMI 2016 contest (training
and test sets). DS=DSSTox Data Sources, PC= PubChem Data
Sources, PM=PubMed Reference Counts, STOFF= Presence in
STOFF-IDENT, KEMI_half= Swedish Chemicals Agency (KEMI)
Market List, weighted by 0.5.
# Identified % of Total
#1 Hits 89 43%
Top 5 154 74%
Top 10 174 84%
Top 20 190 91%
# Identified % of Total
#1 Hits 154 74%
Top 5 195 94%
Top 10 198 95%
Top 20 202 97%
CFM-ID only CFM-ID +DSSTox Data Sources
Using CFM-ID command line tools [5], MS/MS spectra were predicted for >700,000 structures in
ESI+, ESI-, and EI mode (manuscript in prep). The ESI+/- data were used to analyze the CASMI
2016 datasets. Data below are structure identification ranks for the challenge set (n=208).
Retention Time Prediction
logP ChromGenius OPERA-RT
Combined Test and
Training Set (n=97)
R2 0.66 0.83 0.86
RMSE (min) 5.50 3.93 3.60
Absolute Mean Error
(min)
4.65 3.03 2.93
Suspect screening of drinking water using point of
use filters [6].
• Dashboard tools used for identification (data
source ranking and batch searching)
• Dashboard tools used for prioritization of
identified compounds (exposure, bioactivity)
EPA’s Non-targeted Analysis Collaborative Trial
(ENTACT):
• MS-Ready structures underpinning analyses
and provided to participants, improving
accessibility to data
• In-house analysis utilizing data source
ranking, metadata in Dashboard
0 2 0 0 4 0 0 6 0 0 8 0 0 1 0 0 0
0
2 0 0
4 0 0
6 0 0
8 0 0
1 0 0 0
A C N M e th o d
E xp e rim e n ta l R T I
PredictedRTI
(Top) Results from [2] indicated that an in-house QSRR model
termed OPERA-RT performed on par with the commercial
software ACD/ChromGenius.
(Bottom) Evaluation of retention time index (RTI) modeling
developed by Aalizadeh et al (submitted manuscript).
Experimental vs. predicted RTI value from ENTACT mixtures.
Batch Search Page. The Batch Search page enables users to query the underlying
database using data collected from HRMS studies as molecular formulae or monoisotopic
masses (i.e. 10s to 100s at a time). Metadata, structural information, presence in chemical
lists, and many more pieces of data can be included in the download.
Single Chemical Page. A single chemical page includes chemical structures,
experimental and predicted physicochemical and toxicity data, exposure data, and much
more.

More Related Content

What's hot

US EPA CompTox Chemistry Dashboard as a source of data to fill data gaps for ...
US EPA CompTox Chemistry Dashboard as a source of data to fill data gaps for ...US EPA CompTox Chemistry Dashboard as a source of data to fill data gaps for ...
US EPA CompTox Chemistry Dashboard as a source of data to fill data gaps for ...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...
Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...
Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Accessing information for chemicals in hydraulic fracturing fluids using the ...
Accessing information for chemicals in hydraulic fracturing fluids using the ...Accessing information for chemicals in hydraulic fracturing fluids using the ...
Accessing information for chemicals in hydraulic fracturing fluids using the ...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Using the US EPA’s CompTox Chemistry Dashboard for structure identification a...
Using the US EPA’s CompTox Chemistry Dashboard for structure identification a...Using the US EPA’s CompTox Chemistry Dashboard for structure identification a...
Using the US EPA’s CompTox Chemistry Dashboard for structure identification a...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...
The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...
The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Sharing chemical structures with peer reviewed publications
Sharing chemical structures with peer reviewed publications Sharing chemical structures with peer reviewed publications
Sharing chemical structures with peer reviewed publications
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Adding complex expert knowledge into chemical database and transforming surfa...
Adding complex expert knowledge into chemical database and transforming surfa...Adding complex expert knowledge into chemical database and transforming surfa...
Adding complex expert knowledge into chemical database and transforming surfa...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Translating research into practical tools: A case study of GenRA, a new read...
Translating research into practical tools: A case study of GenRA,  a new read...Translating research into practical tools: A case study of GenRA,  a new read...
Translating research into practical tools: A case study of GenRA, a new read...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Chemical identification of unknowns in high resolution mass spectrometry usin...
Chemical identification of unknowns in high resolution mass spectrometry usin...Chemical identification of unknowns in high resolution mass spectrometry usin...
Chemical identification of unknowns in high resolution mass spectrometry usin...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
How predictive models help Medicinal Chemists design better drugs_webinar
How predictive models help Medicinal Chemists design better drugs_webinarHow predictive models help Medicinal Chemists design better drugs_webinar
How predictive models help Medicinal Chemists design better drugs_webinar
Ann-Marie Roche
 

What's hot (20)

US EPA CompTox Chemistry Dashboard as a source of data to fill data gaps for ...
US EPA CompTox Chemistry Dashboard as a source of data to fill data gaps for ...US EPA CompTox Chemistry Dashboard as a source of data to fill data gaps for ...
US EPA CompTox Chemistry Dashboard as a source of data to fill data gaps for ...
 
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
 
Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...
Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...
Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...
 
Accessing information for chemicals in hydraulic fracturing fluids using the ...
Accessing information for chemicals in hydraulic fracturing fluids using the ...Accessing information for chemicals in hydraulic fracturing fluids using the ...
Accessing information for chemicals in hydraulic fracturing fluids using the ...
 
Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...
 
Overview of open resources to support automated structure verification and e...
Overview of open resources to support automated structure verification  and e...Overview of open resources to support automated structure verification  and e...
Overview of open resources to support automated structure verification and e...
 
Using the US EPA’s CompTox Chemistry Dashboard for structure identification a...
Using the US EPA’s CompTox Chemistry Dashboard for structure identification a...Using the US EPA’s CompTox Chemistry Dashboard for structure identification a...
Using the US EPA’s CompTox Chemistry Dashboard for structure identification a...
 
Data analysis workflows part 2 2015
Data analysis workflows part 2 2015Data analysis workflows part 2 2015
Data analysis workflows part 2 2015
 
The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...
The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...
The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...
 
Sharing chemical structures with peer reviewed publications
Sharing chemical structures with peer reviewed publications Sharing chemical structures with peer reviewed publications
Sharing chemical structures with peer reviewed publications
 
Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...
 
Adding complex expert knowledge into chemical database and transforming surfa...
Adding complex expert knowledge into chemical database and transforming surfa...Adding complex expert knowledge into chemical database and transforming surfa...
Adding complex expert knowledge into chemical database and transforming surfa...
 
Delivering The Benefits of Chemical-Biological Integration in Computational T...
Delivering The Benefits of Chemical-Biological Integration in Computational T...Delivering The Benefits of Chemical-Biological Integration in Computational T...
Delivering The Benefits of Chemical-Biological Integration in Computational T...
 
Translating research into practical tools: A case study of GenRA, a new read...
Translating research into practical tools: A case study of GenRA,  a new read...Translating research into practical tools: A case study of GenRA,  a new read...
Translating research into practical tools: A case study of GenRA, a new read...
 
Chemical identification of unknowns in high resolution mass spectrometry usin...
Chemical identification of unknowns in high resolution mass spectrometry usin...Chemical identification of unknowns in high resolution mass spectrometry usin...
Chemical identification of unknowns in high resolution mass spectrometry usin...
 
Multivarite and network tools for biological data analysis
Multivarite and network tools for biological data analysisMultivarite and network tools for biological data analysis
Multivarite and network tools for biological data analysis
 
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
 
Prote-OMIC Data Analysis and Visualization
Prote-OMIC Data Analysis and VisualizationProte-OMIC Data Analysis and Visualization
Prote-OMIC Data Analysis and Visualization
 
How predictive models help Medicinal Chemists design better drugs_webinar
How predictive models help Medicinal Chemists design better drugs_webinarHow predictive models help Medicinal Chemists design better drugs_webinar
How predictive models help Medicinal Chemists design better drugs_webinar
 
Metabolomics and Beyond Challenges and Strategies for Next-gen Omic Analyses
Metabolomics and Beyond Challenges and Strategies for Next-gen Omic Analyses Metabolomics and Beyond Challenges and Strategies for Next-gen Omic Analyses
Metabolomics and Beyond Challenges and Strategies for Next-gen Omic Analyses
 

Similar to Developing tools for high resolution mass spectrometry-based screening via the EPA’s CompTox Dashboard

The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Tox...
The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Tox...The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Tox...
The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Tox...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
Kamel Mansouri
 
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
Kamel Mansouri
 
Free online access to experimental and predicted chemical properties through ...
Free online access to experimental and predicted chemical properties through ...Free online access to experimental and predicted chemical properties through ...
Free online access to experimental and predicted chemical properties through ...
Kamel Mansouri
 
An integrated data hub for per- and polyfluoroalkyl (PFAS) chemicals to suppo...
An integrated data hub for per- and polyfluoroalkyl (PFAS) chemicals to suppo...An integrated data hub for per- and polyfluoroalkyl (PFAS) chemicals to suppo...
An integrated data hub for per- and polyfluoroalkyl (PFAS) chemicals to suppo...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
SETAC HTS SFlood
SETAC HTS SFlood SETAC HTS SFlood
SETAC HTS SFlood
Stacie Flood
 
Web-based access to experimental and predicted data for environmental fate, t...
Web-based access to experimental and predicted data for environmental fate, t...Web-based access to experimental and predicted data for environmental fate, t...
Web-based access to experimental and predicted data for environmental fate, t...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
CoMPARA: Collaborative Modeling Project for Androgen Receptor ActivityCoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
Kamel Mansouri
 
CompTox Chemicals Dashboard: Data and tools to support chemical and environme...
CompTox Chemicals Dashboard: Data and tools to support chemical and environme...CompTox Chemicals Dashboard: Data and tools to support chemical and environme...
CompTox Chemicals Dashboard: Data and tools to support chemical and environme...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
US-EPA CompTox Chemicals Dashboard as a web-based data resource to help ident...
US-EPA CompTox Chemicals Dashboard as a web-based data resource to help ident...US-EPA CompTox Chemicals Dashboard as a web-based data resource to help ident...
US-EPA CompTox Chemicals Dashboard as a web-based data resource to help ident...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Delivering The Benefits of Chemical-Biological Integration in Computational T...
Delivering The Benefits of Chemical-Biological Integration in Computational T...Delivering The Benefits of Chemical-Biological Integration in Computational T...
Delivering The Benefits of Chemical-Biological Integration in Computational T...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
A comparison of three chromatographic retention time prediction models
A comparison of three chromatographic retention time prediction modelsA comparison of three chromatographic retention time prediction models
A comparison of three chromatographic retention time prediction models
Andrew McEachran
 
Predictive tox proposal_ver1
Predictive tox proposal_ver1Predictive tox proposal_ver1
Predictive tox proposal_ver1
Ankur Khanna
 

Similar to Developing tools for high resolution mass spectrometry-based screening via the EPA’s CompTox Dashboard (20)

The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Tox...
The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Tox...The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Tox...
The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Tox...
 
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
 
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
 
Hdat pdf-draft
Hdat pdf-draftHdat pdf-draft
Hdat pdf-draft
 
Free online access to experimental and predicted chemical properties through ...
Free online access to experimental and predicted chemical properties through ...Free online access to experimental and predicted chemical properties through ...
Free online access to experimental and predicted chemical properties through ...
 
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
 
An integrated data hub for per- and polyfluoroalkyl (PFAS) chemicals to suppo...
An integrated data hub for per- and polyfluoroalkyl (PFAS) chemicals to suppo...An integrated data hub for per- and polyfluoroalkyl (PFAS) chemicals to suppo...
An integrated data hub for per- and polyfluoroalkyl (PFAS) chemicals to suppo...
 
The EPA iCSS Chemistry Dashboard to Support Compound Identification Using Hig...
The EPA iCSS Chemistry Dashboard to Support Compound Identification Using Hig...The EPA iCSS Chemistry Dashboard to Support Compound Identification Using Hig...
The EPA iCSS Chemistry Dashboard to Support Compound Identification Using Hig...
 
SETAC HTS SFlood
SETAC HTS SFlood SETAC HTS SFlood
SETAC HTS SFlood
 
Web-based access to experimental and predicted data for environmental fate, t...
Web-based access to experimental and predicted data for environmental fate, t...Web-based access to experimental and predicted data for environmental fate, t...
Web-based access to experimental and predicted data for environmental fate, t...
 
CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
CoMPARA: Collaborative Modeling Project for Androgen Receptor ActivityCoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
 
CompTox Chemicals Dashboard: Data and tools to support chemical and environme...
CompTox Chemicals Dashboard: Data and tools to support chemical and environme...CompTox Chemicals Dashboard: Data and tools to support chemical and environme...
CompTox Chemicals Dashboard: Data and tools to support chemical and environme...
 
US-EPA CompTox Chemicals Dashboard as a web-based data resource to help ident...
US-EPA CompTox Chemicals Dashboard as a web-based data resource to help ident...US-EPA CompTox Chemicals Dashboard as a web-based data resource to help ident...
US-EPA CompTox Chemicals Dashboard as a web-based data resource to help ident...
 
Delivering The Benefits of Chemical-Biological Integration in Computational T...
Delivering The Benefits of Chemical-Biological Integration in Computational T...Delivering The Benefits of Chemical-Biological Integration in Computational T...
Delivering The Benefits of Chemical-Biological Integration in Computational T...
 
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
 
Medical science
Medical scienceMedical science
Medical science
 
A comparison of three chromatographic retention time prediction models
A comparison of three chromatographic retention time prediction modelsA comparison of three chromatographic retention time prediction models
A comparison of three chromatographic retention time prediction models
 
D1803012022
D1803012022D1803012022
D1803012022
 
GRC-MS: A GENETIC RULE-BASED CLASSIFIER MODEL FOR ANALYSIS OF MASS SPECTRA DATA
GRC-MS: A GENETIC RULE-BASED CLASSIFIER MODEL FOR ANALYSIS OF MASS SPECTRA DATAGRC-MS: A GENETIC RULE-BASED CLASSIFIER MODEL FOR ANALYSIS OF MASS SPECTRA DATA
GRC-MS: A GENETIC RULE-BASED CLASSIFIER MODEL FOR ANALYSIS OF MASS SPECTRA DATA
 
Predictive tox proposal_ver1
Predictive tox proposal_ver1Predictive tox proposal_ver1
Predictive tox proposal_ver1
 

More from Andrew McEachran

Consensus ranking and fragmentation prediction for identification of unknowns...
Consensus ranking and fragmentation prediction for identification of unknowns...Consensus ranking and fragmentation prediction for identification of unknowns...
Consensus ranking and fragmentation prediction for identification of unknowns...
Andrew McEachran
 
The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...
The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...
The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...
Andrew McEachran
 
Leveraging chemistry data to improve exposure analyses using the EPA’s CompTo...
Leveraging chemistry data to improve exposure analyses using the EPA’s CompTo...Leveraging chemistry data to improve exposure analyses using the EPA’s CompTo...
Leveraging chemistry data to improve exposure analyses using the EPA’s CompTo...
Andrew McEachran
 
Using the US EPA's CompTox Chemistry Dashboard to advance non-targeted analys...
Using the US EPA's CompTox Chemistry Dashboard to advance non-targeted analys...Using the US EPA's CompTox Chemistry Dashboard to advance non-targeted analys...
Using the US EPA's CompTox Chemistry Dashboard to advance non-targeted analys...
Andrew McEachran
 

More from Andrew McEachran (6)

Consensus ranking and fragmentation prediction for identification of unknowns...
Consensus ranking and fragmentation prediction for identification of unknowns...Consensus ranking and fragmentation prediction for identification of unknowns...
Consensus ranking and fragmentation prediction for identification of unknowns...
 
The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...
The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...
The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...
 
Leveraging chemistry data to improve exposure analyses using the EPA’s CompTo...
Leveraging chemistry data to improve exposure analyses using the EPA’s CompTo...Leveraging chemistry data to improve exposure analyses using the EPA’s CompTo...
Leveraging chemistry data to improve exposure analyses using the EPA’s CompTo...
 
Using the US EPA's CompTox Chemistry Dashboard to advance non-targeted analys...
Using the US EPA's CompTox Chemistry Dashboard to advance non-targeted analys...Using the US EPA's CompTox Chemistry Dashboard to advance non-targeted analys...
Using the US EPA's CompTox Chemistry Dashboard to advance non-targeted analys...
 
Using the US EPA's CompTox Chemistry Dashboard to support identification and ...
Using the US EPA's CompTox Chemistry Dashboard to support identification and ...Using the US EPA's CompTox Chemistry Dashboard to support identification and ...
Using the US EPA's CompTox Chemistry Dashboard to support identification and ...
 
Structure identification using high resolution mass spectrometry data and the...
Structure identification using high resolution mass spectrometry data and the...Structure identification using high resolution mass spectrometry data and the...
Structure identification using high resolution mass spectrometry data and the...
 

Recently uploaded

Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Lokesh Kothari
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
Areesha Ahmad
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
gindu3009
 
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
Lokesh Kothari
 

Recently uploaded (20)

Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questions
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 

Developing tools for high resolution mass spectrometry-based screening via the EPA’s CompTox Dashboard

  • 1. Innovative Research for a Sustainable Futurewww.epa.gov/research Developing tools for high resolution mass spectrometry-based screening via the EPA’s CompTox Dashboard Andrew D. McEachran1,2*, Kamel Mansouri3, Hussein Al-Ghoul4, Alex Chao4, Chris Grulke2, Jon R. Sobus5, and Antony J. Williams2 1Oak Ridge Institute of Science and Education (ORISE) Research Participant, Research Triangle Park, NC 2U.S. Environmental Protection Agency, Office of Research and Development, National Center for Computational Toxicology, Research Triangle Park, NC 3Integrated Laboratory Systems, Inc., Morrisville, NC | 4Oak Ridge Associated Universities, Research Triangle Park, NC 5U.S. Environmental Protection Agency, Office of Research and Development, National Exposure Research Laboratory, Research Triangle Park, NC * mceachran.andrew@epa.gov l ORCiD: 0000-0003-1423-330X Problem Definition and Goals The CompTox Dashboard MS-Ready Structures for Database Searching MS/MS Data in NTA/SSA References Acknowledgements 1. Williams, AJ, et al. 2017. The CompTox Chemistry Dashboard: a community data resource for environmental chemistry. J Cheminf. https://doi.org/10.1186/s13321-017-0247-6 2. McEachran, AD, et al. 2017. Identifying known unknowns using the US EPA's CompTox Chemistry Dashboard. Anal. Bioanal. Chem. 409(7): 1729-1735. doi:10.1007/s00216-016-0139-z 3. McEachran, AD, et al. 2017. A comparison of three chromatographic retention time prediction models. Talanta. https://doi.org/10.1016/j.talanta.2018.01.022 4. McEachran, AD, et al. 2018. “MS-Ready” structures for non-targeted high-resolution mass spectrometry screening studies. J Cheminf. Accepted for publication. 5. Allen, et al. 2015. Competitive fragmentation modeling of ESI-MS/MS spectra for putative metabolite identification. Metabolomics. https://doi.org/10.1007/s11306-014-0676-4 6. Newton, SR, et al. 2018. Suspect screening and non-targeted analysis of drinking water using point-of-use filters. Environ Poll. https://doi.org/10.1016/j.envpol.2017.11.033 Applications The authors would like to thank Emma Schymanski and Christoph Ruttkies for their valuable contributions on the MS-Ready Structures, associated publication, and integration to MetFrag. The authors would also like to thank Reza Aalizadeh and Nikolaos Thomaidis for use of the UOA-RTI software for retention time index prediction (manuscript submitted for publication). The authors would like to acknowledge the Dashboard development team for their efforts in surfacing all functionality. This work was supported in part by an appointment to the ORISE Research Participation Program at the Office of Research and Development, U.S. EPA, through an interagency agreement between the US EPA and DOE. This presentation does not necessarily represent the views or policies of the U.S. Environmental Protection Agency. Chemical metadata for ranking Problem: Non-targeted and suspect screening studies using high resolution mass spectrometry (HRMS) have revolutionized the detection of chemicals in complex matrices. However, data processing remains challenging due to the vast number of chemicals detected in samples, software and computational requirements of data processing, and inherent uncertainty in confidently identifying chemicals from candidate lists. Goals: Develop tools, data, and visualization approaches within an open chemistry resource to provide a freely available software tool to support structure identification and non-targeted analysis. Home page. The CompTox Dashboard (https://comptox.epa.gov) is a comprehensive chemistry resource containing chemistry data on more than 760,000 chemical substances. (Left and Below) To facilitate database searching, structures in DSSTox were processed into their “MS-Ready” form [4]. This processing workflow removes salts and stereochemistry and separates components of mixtures while retaining linkages to the original structure and substance. This enables the form of a structure observed via HRMS to be related to all variants of a structure in the database. (Left) The results of an MS-Ready query include all those substances where one component of a substance matches the input query terms [4]. In this example, C10H14N2 returns single component chemicals, mixtures, and salts. Returning all linked DTXSIDs enables access to varied metadata. Percentage of chemicals identified using data source ranking combining multiple metadata sources. Total n=1748 unique chemicals from the ENTACT trial and CASMI 2016 contest (training and test sets). DS=DSSTox Data Sources, PC= PubChem Data Sources, PM=PubMed Reference Counts, STOFF= Presence in STOFF-IDENT, KEMI_half= Swedish Chemicals Agency (KEMI) Market List, weighted by 0.5. # Identified % of Total #1 Hits 89 43% Top 5 154 74% Top 10 174 84% Top 20 190 91% # Identified % of Total #1 Hits 154 74% Top 5 195 94% Top 10 198 95% Top 20 202 97% CFM-ID only CFM-ID +DSSTox Data Sources Using CFM-ID command line tools [5], MS/MS spectra were predicted for >700,000 structures in ESI+, ESI-, and EI mode (manuscript in prep). The ESI+/- data were used to analyze the CASMI 2016 datasets. Data below are structure identification ranks for the challenge set (n=208). Retention Time Prediction logP ChromGenius OPERA-RT Combined Test and Training Set (n=97) R2 0.66 0.83 0.86 RMSE (min) 5.50 3.93 3.60 Absolute Mean Error (min) 4.65 3.03 2.93 Suspect screening of drinking water using point of use filters [6]. • Dashboard tools used for identification (data source ranking and batch searching) • Dashboard tools used for prioritization of identified compounds (exposure, bioactivity) EPA’s Non-targeted Analysis Collaborative Trial (ENTACT): • MS-Ready structures underpinning analyses and provided to participants, improving accessibility to data • In-house analysis utilizing data source ranking, metadata in Dashboard 0 2 0 0 4 0 0 6 0 0 8 0 0 1 0 0 0 0 2 0 0 4 0 0 6 0 0 8 0 0 1 0 0 0 A C N M e th o d E xp e rim e n ta l R T I PredictedRTI (Top) Results from [2] indicated that an in-house QSRR model termed OPERA-RT performed on par with the commercial software ACD/ChromGenius. (Bottom) Evaluation of retention time index (RTI) modeling developed by Aalizadeh et al (submitted manuscript). Experimental vs. predicted RTI value from ENTACT mixtures. Batch Search Page. The Batch Search page enables users to query the underlying database using data collected from HRMS studies as molecular formulae or monoisotopic masses (i.e. 10s to 100s at a time). Metadata, structural information, presence in chemical lists, and many more pieces of data can be included in the download. Single Chemical Page. A single chemical page includes chemical structures, experimental and predicted physicochemical and toxicity data, exposure data, and much more.