SlideShare a Scribd company logo
AMOS: the EPA database of analytical methods
and open mass spectral database supporting
non-targeted analysis
Gregory Janesch1, Erik Carr1, Vicente Samano2, James McCord3,
Jacqueline Bangma3, Jon Sobus4 and Antony Williams4
1. ORAU Student Services Contractor 2. Senior Environmental Employment Program
3. Center for Environmental Measurement and Modeling and 4. Center for Computational Toxicology & Exposure,
ALL at the U.S. Environmental Protection Agency
October 2023: FDA Cheminformatics Workshop
The views expressed in this presentation are those of the author and do not necessarily reflect the views or policies of the U.S. EPA
Background
• A huge number of openly available sources exist for spectra,
documentation of analytical procedures, etc.
• Search engines can easily find high-traffic sources, but
maybe not niche-but-high-quality ones
– Most are not “structurally-enabled”
• Useful to have complementary types of data alongside each
other, especially with consistent substance identifiers
• Non-targeted analysis can benefit from a broad, high-quality
experimental database as a reference
2
About AMOS - General
• AMOS is a cheminformatics application integrating spectra
and analytical methods with consistent substance identifiers
• Provides mappings between substances and records (method
documents, experimental spectra, etc.)
• Under development for ~18 months as a “proof-of-concept”;
not yet available publicly
3
About AMOS - Data
• Three categories of records:
– Spectra (~210,000)
– Methods (~4100)
– Fact Sheets (>3000)
• Most data are open access, some are just external links
– All data links back to the original source, if possible
• Data is being continually updated (new datasets & updates)
• Many chemicals of interest to EPA – PFAS, pesticides, etc.
4
About AMOS - Curation
• Identifiers vary between different sources so we must curate
5
• A single chemical can have dozens
of names
– FTOH 10:1
– 10:1 FTOH
– 10:1 Fluorotelomer alcohol
– 2,2,3,3,4,4,5,5,6,6,7,7,8,8,9,9,10,10,11,11,11
-Henicosafluoroundecan-1-ol
– 1-Undecanol, 2,2,3,3,4,4,5,5,6,6,7,7,8,8,9,9,
10,10,11,11,11-heneicosafluoro-
About AMOS – Curation Issue Example
6
About AMOS – Curating Methods
• Often have a table of
substances
– Can be extracted with scripts
• Sample matrix, limits of
detection, etc. still need to be
manually collected
• Some are old, scanned
documents that require fully
manual work
7
Spectra
• About 210,000 experimental spectra covering about 21,500
substances (not including externally-linked ones)
• Most are from external sources
– About 90% between MassBank EU, MoNA, & HMDB
• EPA labs now providing spectra (especially PFAS)
• Includes metadata like instrument settings (when possible)
8
Methods
• Almost 4100 in AMOS so far from an assortment of
vendors, publications, and government agencies
– Agencies including US-EPA, DEA, CDC, FDA, OSHA, USGS, USDA
– Vendors including Agilent, Shimadzu, LECO, Sciex
• Searchable on analytes, matrix, analytical methodology,
source
• Methods can be linked to sets of spectra
9
• Search by DTXSID,
InChIKey, CAS number,
name
• Can filter on record types or
other information
• InChIKeys and some names
will prompt disambiguation
10
General Search
General Search – Disambiguation
11
InChIKey example search:
General Search – Spectra
12
General Search – Spectra
13
General Search – Methods & Fact Sheets
14
General Search – Methods & Fact Sheets
15
Method with Spectra
16
Batch Search
17
• Search a set of DTXSIDs,
download info on spectra and
methods and links to original data
Methods List
18
Methods List – Filtering
19
Similar Method Search
20
Similar Method Search
21
Spectrum Search
22
Connections to Other Applications
• Other apps often deal with
focused subsets of
chemicals; AMOS’s data
can augment that
• API endpoints have been
built for an NTA application
– Originally just in silico spectra
23
Future Work
• Add more data assembled from EPA labs (standards)
• Improvements to spectral searching – in testing
– Structure, substructure, and similarity searching
• Expand spectral and chromatographic metadata
• Additional integration with other EPA applications
– Mostly just simple links to AMOS pages at the moment
• Hoping to release to the public in 2024
24
Summary
• AMOS combines multiple kinds of analytical chemistry
data
– Primarily mass spectrometry data
– Growing steadily for the foreseeable future
• Data can be queried via a cheminformatically-oriented
application
• Intended to be useful as both an independent application
and a way of augmenting other EPA applications 25
Acknowledgements
• Greg Janesch – Database and App Development
• Sakuntala Sivasupramaniam – curation
• Tyler Carr – curation, visualizations
• Joshua Powell, Asif Rashid, Freddie Valone – assorted
technical support
If you want to help, send information regarding analytical
methods and method articles to williams.antony@epa.gov
26

More Related Content

Similar to AMOS: the EPA database of analytical methods and open mass spectral database supporting non-targeted analysis

Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...
Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...
Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Integrating Mass Spectrometry Non-Targeted Analysis and Computational Chemis...
Integrating Mass Spectrometry  Non-Targeted Analysis and Computational Chemis...Integrating Mass Spectrometry  Non-Targeted Analysis and Computational Chemis...
Integrating Mass Spectrometry Non-Targeted Analysis and Computational Chemis...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Chemistry Data Delivery from the US-EPA Center for Computational Toxicology a...
Chemistry Data Delivery from the US-EPA Center for Computational Toxicology a...Chemistry Data Delivery from the US-EPA Center for Computational Toxicology a...
Chemistry Data Delivery from the US-EPA Center for Computational Toxicology a...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Delivering chemical-associated data via EPA web applications
Delivering chemical-associated data via EPA web applicationsDelivering chemical-associated data via EPA web applications
Delivering chemical-associated data via EPA web applications
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Cheminformatics Support for MS Supporting Exposomics
Cheminformatics Support for MS Supporting ExposomicsCheminformatics Support for MS Supporting Exposomics
Accessing Environmental Chemistry Data via Data Dashboards
Accessing Environmental Chemistry Data via Data Dashboards Accessing Environmental Chemistry Data via Data Dashboards
Accessing Environmental Chemistry Data via Data Dashboards
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Data delivery from the US-EPA Center for Computational Toxicology and Exposur...
Data delivery from the US-EPA Center for Computational Toxicology and Exposur...Data delivery from the US-EPA Center for Computational Toxicology and Exposur...
Data delivery from the US-EPA Center for Computational Toxicology and Exposur...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Comparison of lists of per- and polyfluoroalkyl substances (PFAS) based on di...
Comparison of lists of per- and polyfluoroalkyl substances (PFAS) based on di...Comparison of lists of per- and polyfluoroalkyl substances (PFAS) based on di...
Comparison of lists of per- and polyfluoroalkyl substances (PFAS) based on di...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Integrating an Analytical Methods and Mass Spectral Database with Cheminforma...
Integrating an Analytical Methods and Mass Spectral Database with Cheminforma...Integrating an Analytical Methods and Mass Spectral Database with Cheminforma...
Integrating an Analytical Methods and Mass Spectral Database with Cheminforma...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Drug Discovery and Development Using AI
Drug Discovery and Development Using AIDrug Discovery and Development Using AI
Drug Discovery and Development Using AI
Databricks
 
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
Kamel Mansouri
 
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Cheminformatics Tools to Access Data for PFAS and Constituents of Fluorine-Fr...
Cheminformatics Tools to Access Data for PFAS and Constituents of Fluorine-Fr...Cheminformatics Tools to Access Data for PFAS and Constituents of Fluorine-Fr...
Cheminformatics Tools to Access Data for PFAS and Constituents of Fluorine-Fr...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Development of a Tool for Systematic Integration of Traditional and New Appro...
Development of a Tool for Systematic Integration of Traditional and New Appro...Development of a Tool for Systematic Integration of Traditional and New Appro...
Development of a Tool for Systematic Integration of Traditional and New Appro...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
PFAS Chemistry: Range, Complexity, Groupings, and the CompTox Chemicals Dash...
PFAS Chemistry: Range, Complexity, Groupings, and the CompTox  Chemicals Dash...PFAS Chemistry: Range, Complexity, Groupings, and the CompTox  Chemicals Dash...
PFAS Chemistry: Range, Complexity, Groupings, and the CompTox Chemicals Dash...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental scienceUS-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
4th Annual Advancing the Pace of Chemical Risk Assessment
4th Annual Advancing the Pace of Chemical Risk Assessment4th Annual Advancing the Pace of Chemical Risk Assessment
4th Annual Advancing the Pace of Chemical Risk Assessment
Michelle Angrish
 

Similar to AMOS: the EPA database of analytical methods and open mass spectral database supporting non-targeted analysis (20)

Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...
Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...
Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...
 
Integrating Mass Spectrometry Non-Targeted Analysis and Computational Chemis...
Integrating Mass Spectrometry  Non-Targeted Analysis and Computational Chemis...Integrating Mass Spectrometry  Non-Targeted Analysis and Computational Chemis...
Integrating Mass Spectrometry Non-Targeted Analysis and Computational Chemis...
 
Chemistry Data Delivery from the US-EPA Center for Computational Toxicology a...
Chemistry Data Delivery from the US-EPA Center for Computational Toxicology a...Chemistry Data Delivery from the US-EPA Center for Computational Toxicology a...
Chemistry Data Delivery from the US-EPA Center for Computational Toxicology a...
 
Delivering chemical-associated data via EPA web applications
Delivering chemical-associated data via EPA web applicationsDelivering chemical-associated data via EPA web applications
Delivering chemical-associated data via EPA web applications
 
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
 
Cheminformatics Support for MS Supporting Exposomics
Cheminformatics Support for MS Supporting ExposomicsCheminformatics Support for MS Supporting Exposomics
Cheminformatics Support for MS Supporting Exposomics
 
Accessing Environmental Chemistry Data via Data Dashboards
Accessing Environmental Chemistry Data via Data Dashboards Accessing Environmental Chemistry Data via Data Dashboards
Accessing Environmental Chemistry Data via Data Dashboards
 
Data delivery from the US-EPA Center for Computational Toxicology and Exposur...
Data delivery from the US-EPA Center for Computational Toxicology and Exposur...Data delivery from the US-EPA Center for Computational Toxicology and Exposur...
Data delivery from the US-EPA Center for Computational Toxicology and Exposur...
 
Comparison of lists of per- and polyfluoroalkyl substances (PFAS) based on di...
Comparison of lists of per- and polyfluoroalkyl substances (PFAS) based on di...Comparison of lists of per- and polyfluoroalkyl substances (PFAS) based on di...
Comparison of lists of per- and polyfluoroalkyl substances (PFAS) based on di...
 
Integrating an Analytical Methods and Mass Spectral Database with Cheminforma...
Integrating an Analytical Methods and Mass Spectral Database with Cheminforma...Integrating an Analytical Methods and Mass Spectral Database with Cheminforma...
Integrating an Analytical Methods and Mass Spectral Database with Cheminforma...
 
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
 
Drug Discovery and Development Using AI
Drug Discovery and Development Using AIDrug Discovery and Development Using AI
Drug Discovery and Development Using AI
 
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
 
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
 
Cheminformatics Tools to Access Data for PFAS and Constituents of Fluorine-Fr...
Cheminformatics Tools to Access Data for PFAS and Constituents of Fluorine-Fr...Cheminformatics Tools to Access Data for PFAS and Constituents of Fluorine-Fr...
Cheminformatics Tools to Access Data for PFAS and Constituents of Fluorine-Fr...
 
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
 
Development of a Tool for Systematic Integration of Traditional and New Appro...
Development of a Tool for Systematic Integration of Traditional and New Appro...Development of a Tool for Systematic Integration of Traditional and New Appro...
Development of a Tool for Systematic Integration of Traditional and New Appro...
 
PFAS Chemistry: Range, Complexity, Groupings, and the CompTox Chemicals Dash...
PFAS Chemistry: Range, Complexity, Groupings, and the CompTox  Chemicals Dash...PFAS Chemistry: Range, Complexity, Groupings, and the CompTox  Chemicals Dash...
PFAS Chemistry: Range, Complexity, Groupings, and the CompTox Chemicals Dash...
 
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental scienceUS-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
 
4th Annual Advancing the Pace of Chemical Risk Assessment
4th Annual Advancing the Pace of Chemical Risk Assessment4th Annual Advancing the Pace of Chemical Risk Assessment
4th Annual Advancing the Pace of Chemical Risk Assessment
 

Recently uploaded

Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...
Sérgio Sacani
 
Structures and textures of metamorphic rocks
Structures and textures of metamorphic rocksStructures and textures of metamorphic rocks
Structures and textures of metamorphic rocks
kumarmathi863
 
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
NathanBaughman3
 
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
Scintica Instrumentation
 
ESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptxESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptx
muralinath2
 
general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
IqrimaNabilatulhusni
 
filosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptxfilosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptx
IvanMallco1
 
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Sérgio Sacani
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
silvermistyshot
 
erythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptxerythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptx
muralinath2
 
in vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptxin vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptx
yusufzako14
 
Lab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerinLab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerin
ossaicprecious19
 
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdfSCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SELF-EXPLANATORY
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
Lokesh Patil
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
muralinath2
 
NuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final versionNuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final version
pablovgd
 
platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
muralinath2
 
extra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdfextra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdf
DiyaBiswas10
 
Citrus Greening Disease and its Management
Citrus Greening Disease and its ManagementCitrus Greening Disease and its Management
Citrus Greening Disease and its Management
subedisuryaofficial
 
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptxBody fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
muralinath2
 

Recently uploaded (20)

Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...
 
Structures and textures of metamorphic rocks
Structures and textures of metamorphic rocksStructures and textures of metamorphic rocks
Structures and textures of metamorphic rocks
 
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
 
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
 
ESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptxESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptx
 
general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
 
filosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptxfilosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptx
 
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
 
erythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptxerythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptx
 
in vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptxin vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptx
 
Lab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerinLab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerin
 
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdfSCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
 
NuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final versionNuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final version
 
platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
 
extra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdfextra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdf
 
Citrus Greening Disease and its Management
Citrus Greening Disease and its ManagementCitrus Greening Disease and its Management
Citrus Greening Disease and its Management
 
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptxBody fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
 

AMOS: the EPA database of analytical methods and open mass spectral database supporting non-targeted analysis

  • 1. AMOS: the EPA database of analytical methods and open mass spectral database supporting non-targeted analysis Gregory Janesch1, Erik Carr1, Vicente Samano2, James McCord3, Jacqueline Bangma3, Jon Sobus4 and Antony Williams4 1. ORAU Student Services Contractor 2. Senior Environmental Employment Program 3. Center for Environmental Measurement and Modeling and 4. Center for Computational Toxicology & Exposure, ALL at the U.S. Environmental Protection Agency October 2023: FDA Cheminformatics Workshop The views expressed in this presentation are those of the author and do not necessarily reflect the views or policies of the U.S. EPA
  • 2. Background • A huge number of openly available sources exist for spectra, documentation of analytical procedures, etc. • Search engines can easily find high-traffic sources, but maybe not niche-but-high-quality ones – Most are not “structurally-enabled” • Useful to have complementary types of data alongside each other, especially with consistent substance identifiers • Non-targeted analysis can benefit from a broad, high-quality experimental database as a reference 2
  • 3. About AMOS - General • AMOS is a cheminformatics application integrating spectra and analytical methods with consistent substance identifiers • Provides mappings between substances and records (method documents, experimental spectra, etc.) • Under development for ~18 months as a “proof-of-concept”; not yet available publicly 3
  • 4. About AMOS - Data • Three categories of records: – Spectra (~210,000) – Methods (~4100) – Fact Sheets (>3000) • Most data are open access, some are just external links – All data links back to the original source, if possible • Data is being continually updated (new datasets & updates) • Many chemicals of interest to EPA – PFAS, pesticides, etc. 4
  • 5. About AMOS - Curation • Identifiers vary between different sources so we must curate 5 • A single chemical can have dozens of names – FTOH 10:1 – 10:1 FTOH – 10:1 Fluorotelomer alcohol – 2,2,3,3,4,4,5,5,6,6,7,7,8,8,9,9,10,10,11,11,11 -Henicosafluoroundecan-1-ol – 1-Undecanol, 2,2,3,3,4,4,5,5,6,6,7,7,8,8,9,9, 10,10,11,11,11-heneicosafluoro-
  • 6. About AMOS – Curation Issue Example 6
  • 7. About AMOS – Curating Methods • Often have a table of substances – Can be extracted with scripts • Sample matrix, limits of detection, etc. still need to be manually collected • Some are old, scanned documents that require fully manual work 7
  • 8. Spectra • About 210,000 experimental spectra covering about 21,500 substances (not including externally-linked ones) • Most are from external sources – About 90% between MassBank EU, MoNA, & HMDB • EPA labs now providing spectra (especially PFAS) • Includes metadata like instrument settings (when possible) 8
  • 9. Methods • Almost 4100 in AMOS so far from an assortment of vendors, publications, and government agencies – Agencies including US-EPA, DEA, CDC, FDA, OSHA, USGS, USDA – Vendors including Agilent, Shimadzu, LECO, Sciex • Searchable on analytes, matrix, analytical methodology, source • Methods can be linked to sets of spectra 9
  • 10. • Search by DTXSID, InChIKey, CAS number, name • Can filter on record types or other information • InChIKeys and some names will prompt disambiguation 10 General Search
  • 11. General Search – Disambiguation 11 InChIKey example search:
  • 12. General Search – Spectra 12
  • 13. General Search – Spectra 13
  • 14. General Search – Methods & Fact Sheets 14
  • 15. General Search – Methods & Fact Sheets 15
  • 17. Batch Search 17 • Search a set of DTXSIDs, download info on spectra and methods and links to original data
  • 19. Methods List – Filtering 19
  • 23. Connections to Other Applications • Other apps often deal with focused subsets of chemicals; AMOS’s data can augment that • API endpoints have been built for an NTA application – Originally just in silico spectra 23
  • 24. Future Work • Add more data assembled from EPA labs (standards) • Improvements to spectral searching – in testing – Structure, substructure, and similarity searching • Expand spectral and chromatographic metadata • Additional integration with other EPA applications – Mostly just simple links to AMOS pages at the moment • Hoping to release to the public in 2024 24
  • 25. Summary • AMOS combines multiple kinds of analytical chemistry data – Primarily mass spectrometry data – Growing steadily for the foreseeable future • Data can be queried via a cheminformatically-oriented application • Intended to be useful as both an independent application and a way of augmenting other EPA applications 25
  • 26. Acknowledgements • Greg Janesch – Database and App Development • Sakuntala Sivasupramaniam – curation • Tyler Carr – curation, visualizations • Joshua Powell, Asif Rashid, Freddie Valone – assorted technical support If you want to help, send information regarding analytical methods and method articles to williams.antony@epa.gov 26