SlideShare a Scribd company logo
1 of 41
Using Cheminformatics Approaches to
Develop a Structure Searchable
Database of Analytical Methods
Antony Williams, Greg Janesch, Sakuntala
Sivasupramaniam, Brian Meyer and Erik Carr
Center for Computational Toxicology & Exposure, U.S. Environmental Protection Agency
http://www.orcid.org/0000-0002-2668-4821
May 2023: American Oil Chemists Society meeting, Denver, CO.
The views expressed in this presentation are those of the author and do not necessarily reflect the views or policies of the U.S. EPA
Disclaimer
• The views expressed in this presentation are those of the
author and do not necessarily reflect the views or policies of
the U.S. EPA
• This presentation is on a proof-of-concept tool in development
– NOT yet publicly available
1
Building a Methods Database
• Simple Vision: I want to find the best method(s) associated with a
chemical and/or class of chemicals
• Answer the question “I cannot find a method for my chemical” - HELP
• The Approach:
– Aggregate MS method documents (and adjust the definition of “what is a useful method”)
– Extract chemistry (mostly CASRN and Names)
– Map CASRN and Names to structures
– Deliver a proof-of-concept application to search a database by names, CASRNs, InChIKeys
and ultimately structure
2
Most people start with Google
• People have many places to search for methods, and
there is no one integration hub, except for search engines
• Search engines can return so many hits – then you filter
by analytes, matrix, analytical methodology, so many
synonyms and abbreviations for so many chemicals
3
Synonyms, Abbreviations and Chemicals
4
• CAS Numbers, Names
and Abbreviations can
limit what’s possible…
Might this be a better view?
5
Might this be a better view?
6
When methods are mapped to chemistry…
• The advantages of mapping chemicals directly to methods
– When chemicals are mapped it opens access to many other tools
– Chemical structures allow for QSAR modeling
7
…and what if we could then profile toxicity?
8
…and what if we could then profile toxicity?
9
…or simply harvest data from the
CompTox Chemicals Dashboard
10
What data would you like???
11
Introducing AMOS
Analytical Methods and Spectra Database
• Three types of data in the database:
– Methods (regulatory, lab manuals and SOPs, publications, tech notes)
– Spectra (from public domain and our own laboratories)
– Monographs (harvested from SWGDRUG and other sites)
• Some methods have associated spectra
• Some data are just externally linked
• Currently contains around 135,000 spectra, 600,000
external links, 650 monographs, and ~2000 methods
• ALL data are growing in number
12
Where are there methods?
• Agency-based methods
– EPA
– USGS
– USDA
– CDC
– FDA
• Vendor application notes – Thermo, Waters, Agilent, Sciex,
Shimadzu, LECO, ….
• Peer-reviewed articles
• Laboratory Documents – lab manuals, SOPs
13
A view of the methods list
14
Filtering the list for interests…
• Look for pesticides studied in water, by GC/MS, after 1990
15
Where are there methods?
• 900 method documents from the EPA harvested
16
Many Scanned Documents!!!
17
• Methods generally developed by
the agrochemical companies
• Include parents plus degradation
products
• Lots of scanned, old, documents
but the historical records are still
of significant use
• Electronic document forms of old
documents still of benefit
Embedding the old Method PDFs
18
Embedding New Method PDFs
19
When Methods are OPEN Access
20
When Methods are PubMed OPEN Access
21
Proprietary Methods for INTERNAL Access
22
If there is no method for your chemical
• Use “Chemical Similarity Searching” so that you can find
chemicals that are similar in structure space
• Use the “Tanimoto Similarity Search
23
Searching for a chemical – CASRN, Name
Direct structure searching coming
24
When Methods are Not Enough
• EPA is highly active in the field of non-targeted analysis
• We have been applying lots of cheminformatics approaches
25
Building a spectrum library to search against
26
Linking to actual spectra
27
Linking to actual spectra
28
There are errors EVERYWHERE: 110-75-8
It can be challenging
9/365 chemicals…
There are MANY errors in public
spectral databases
31
There are MANY errors in public
spectral databases
32
There are MANY errors in public
spectral databases
33
There are MANY errors in public
spectral databases
34
• Chemical structure representations would ideally
be standardized…consider tautomeric forms
• Not all substances are explicit and can be
ambiguous representations
My first time at AOCS…what did I learn?
• Some of your chemicals of interest: TAGs, PAHs, Toxins
of different types
• Methods for PAHs, Aflatoxins and Microsystins, and tri-
acyl glycerols all extracted and added to database
35
My first time at AOCS…what did I learn?
• Index Mark Collison’s methods 
36
My first time at AOCS…what did I learn?
37
General Comments
• It is possible we can extract chemicals from AOCS methods
and structure enable the method, WITHOUT making the
method itself publicly accessible – like ASTM and ISO
• Our focus right now is on mass spectrometry methods but
can support many other methods
38
Conclusions
• We have built (I think) the first chemical structure indexed
database of “methods”
• Methods are not just “approved methods” but also
standard operating procedures, application notes, lab
manuals, regulatory methods etc.
• Integrating methods to experimental spectral data will
serve our non-targeted analysis efforts
39
If you want to help…
• Send information regarding analytical methods and method
articles to williams.antony@epa.gov
• If anyone is interested in a live demo I can do one in the break
40

More Related Content

What's hot

Evaluation of medlars
Evaluation of medlarsEvaluation of medlars
Evaluation of medlarssilambu111
 
Tutarlı bir katalog için otorite kontrol
 Tutarlı bir katalog için otorite kontrol Tutarlı bir katalog için otorite kontrol
Tutarlı bir katalog için otorite kontrolEmine Gür
 
Literature search techniques
Literature search techniquesLiterature search techniques
Literature search techniquesAhmed Elfaitury
 
Subject Indexing & Techniques
Subject Indexing  & TechniquesSubject Indexing  & Techniques
Subject Indexing & TechniquesDr. Utpal Das
 
search-strategy.ppt
search-strategy.pptsearch-strategy.ppt
search-strategy.pptAidaMohamed9
 
Collection Development Policy in College Library
Collection Development Policy in College LibraryCollection Development Policy in College Library
Collection Development Policy in College LibraryPallavi Belkar
 
Embase for pharmacovigilance: Search and validation March 22 2017
Embase for pharmacovigilance: Search and validation March 22 2017Embase for pharmacovigilance: Search and validation March 22 2017
Embase for pharmacovigilance: Search and validation March 22 2017Ann-Marie Roche
 
Let's Get Serious: Promoting Library Online Services
Let's Get Serious: Promoting Library Online ServicesLet's Get Serious: Promoting Library Online Services
Let's Get Serious: Promoting Library Online ServicesLesley Williams
 
Subject analysis
Subject analysisSubject analysis
Subject analysisJohan Koren
 
BOĞAZİÇİ ÜNİVERSİTESİ KÜTÜPHANESİ RDA UYGULAMASINDAN ÖRNEKLER
BOĞAZİÇİ ÜNİVERSİTESİ KÜTÜPHANESİ RDA UYGULAMASINDAN ÖRNEKLERBOĞAZİÇİ ÜNİVERSİTESİ KÜTÜPHANESİ RDA UYGULAMASINDAN ÖRNEKLER
BOĞAZİÇİ ÜNİVERSİTESİ KÜTÜPHANESİ RDA UYGULAMASINDAN ÖRNEKLEREmine Gür
 
Cloud computing and library services
Cloud computing and library servicesCloud computing and library services
Cloud computing and library servicesErik Mitchell
 
Pharmacovigilance - a regulator's perspective
Pharmacovigilance - a regulator's perspectivePharmacovigilance - a regulator's perspective
Pharmacovigilance - a regulator's perspectiveTGA Australia
 
Linked data for Libraries, Archives, Museums
Linked data for Libraries, Archives, MuseumsLinked data for Libraries, Archives, Museums
Linked data for Libraries, Archives, Museumsljsmart
 
ICEGOV - Tutorial 1 - Information Policy Concepts and Principles
ICEGOV - Tutorial 1 - Information Policy Concepts and PrinciplesICEGOV - Tutorial 1 - Information Policy Concepts and Principles
ICEGOV - Tutorial 1 - Information Policy Concepts and PrinciplesICEGOV
 

What's hot (20)

Electronic resource management system (ERM)
Electronic resource management system (ERM)Electronic resource management system (ERM)
Electronic resource management system (ERM)
 
Medlars
MedlarsMedlars
Medlars
 
Evaluation of medlars
Evaluation of medlarsEvaluation of medlars
Evaluation of medlars
 
Tutarlı bir katalog için otorite kontrol
 Tutarlı bir katalog için otorite kontrol Tutarlı bir katalog için otorite kontrol
Tutarlı bir katalog için otorite kontrol
 
Literature search techniques
Literature search techniquesLiterature search techniques
Literature search techniques
 
Subject Indexing & Techniques
Subject Indexing  & TechniquesSubject Indexing  & Techniques
Subject Indexing & Techniques
 
Cataloguing
CataloguingCataloguing
Cataloguing
 
Publishing: Everything you wanted to know but were afraid to ask
Publishing: Everything you wanted to know but were afraid to askPublishing: Everything you wanted to know but were afraid to ask
Publishing: Everything you wanted to know but were afraid to ask
 
search-strategy.ppt
search-strategy.pptsearch-strategy.ppt
search-strategy.ppt
 
Collection Development Policy in College Library
Collection Development Policy in College LibraryCollection Development Policy in College Library
Collection Development Policy in College Library
 
Embase for pharmacovigilance: Search and validation March 22 2017
Embase for pharmacovigilance: Search and validation March 22 2017Embase for pharmacovigilance: Search and validation March 22 2017
Embase for pharmacovigilance: Search and validation March 22 2017
 
Let's Get Serious: Promoting Library Online Services
Let's Get Serious: Promoting Library Online ServicesLet's Get Serious: Promoting Library Online Services
Let's Get Serious: Promoting Library Online Services
 
Subject analysis
Subject analysisSubject analysis
Subject analysis
 
LISTA Database Analysis
LISTA Database AnalysisLISTA Database Analysis
LISTA Database Analysis
 
BOĞAZİÇİ ÜNİVERSİTESİ KÜTÜPHANESİ RDA UYGULAMASINDAN ÖRNEKLER
BOĞAZİÇİ ÜNİVERSİTESİ KÜTÜPHANESİ RDA UYGULAMASINDAN ÖRNEKLERBOĞAZİÇİ ÜNİVERSİTESİ KÜTÜPHANESİ RDA UYGULAMASINDAN ÖRNEKLER
BOĞAZİÇİ ÜNİVERSİTESİ KÜTÜPHANESİ RDA UYGULAMASINDAN ÖRNEKLER
 
Predatory journals
Predatory journalsPredatory journals
Predatory journals
 
Cloud computing and library services
Cloud computing and library servicesCloud computing and library services
Cloud computing and library services
 
Pharmacovigilance - a regulator's perspective
Pharmacovigilance - a regulator's perspectivePharmacovigilance - a regulator's perspective
Pharmacovigilance - a regulator's perspective
 
Linked data for Libraries, Archives, Museums
Linked data for Libraries, Archives, MuseumsLinked data for Libraries, Archives, Museums
Linked data for Libraries, Archives, Museums
 
ICEGOV - Tutorial 1 - Information Policy Concepts and Principles
ICEGOV - Tutorial 1 - Information Policy Concepts and PrinciplesICEGOV - Tutorial 1 - Information Policy Concepts and Principles
ICEGOV - Tutorial 1 - Information Policy Concepts and Principles
 

Similar to Using Cheminformatics Approaches to Develop a Structure Searchable Database of Analytical Methods

Similar to Using Cheminformatics Approaches to Develop a Structure Searchable Database of Analytical Methods (20)

Applying Cheminformatics to Develop a Structure Searchable Database of Analyt...
Applying Cheminformatics to Develop a Structure Searchable Database of Analyt...Applying Cheminformatics to Develop a Structure Searchable Database of Analyt...
Applying Cheminformatics to Develop a Structure Searchable Database of Analyt...
 
Data delivery from the US-EPA Center for Computational Toxicology and Exposur...
Data delivery from the US-EPA Center for Computational Toxicology and Exposur...Data delivery from the US-EPA Center for Computational Toxicology and Exposur...
Data delivery from the US-EPA Center for Computational Toxicology and Exposur...
 
Cheminformatics tools and chemistry data underpinning mass spectrometry analy...
Cheminformatics tools and chemistry data underpinning mass spectrometry analy...Cheminformatics tools and chemistry data underpinning mass spectrometry analy...
Cheminformatics tools and chemistry data underpinning mass spectrometry analy...
 
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
 
Introduction to Cheminformatics: Accessing data through the CompTox Chemicals...
Introduction to Cheminformatics: Accessing data through the CompTox Chemicals...Introduction to Cheminformatics: Accessing data through the CompTox Chemicals...
Introduction to Cheminformatics: Accessing data through the CompTox Chemicals...
 
Accessing Environmental Chemistry Data via Data Dashboards
Accessing Environmental Chemistry Data via Data Dashboards Accessing Environmental Chemistry Data via Data Dashboards
Accessing Environmental Chemistry Data via Data Dashboards
 
US-EPA Cheminformatics Support for Delivering Data Related to Chemicals of E...
US-EPA Cheminformatics Support for Delivering Data Related to Chemicals of E...US-EPA Cheminformatics Support for Delivering Data Related to Chemicals of E...
US-EPA Cheminformatics Support for Delivering Data Related to Chemicals of E...
 
The US-EPA CompTox Chemicals Dashboard to support Non-Targeted Analysis
The US-EPA CompTox Chemicals Dashboard to support Non-Targeted AnalysisThe US-EPA CompTox Chemicals Dashboard to support Non-Targeted Analysis
The US-EPA CompTox Chemicals Dashboard to support Non-Targeted Analysis
 
AMOS: the EPA database of analytical methods and open mass spectral database ...
AMOS: the EPA database of analytical methods and open mass spectral database ...AMOS: the EPA database of analytical methods and open mass spectral database ...
AMOS: the EPA database of analytical methods and open mass spectral database ...
 
Cheminformatics Support for MS Supporting Exposomics
Cheminformatics Support for MS Supporting ExposomicsCheminformatics Support for MS Supporting Exposomics
Cheminformatics Support for MS Supporting Exposomics
 
TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...
TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...
TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...
 
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
 
Non-targeted analysis supported by data and cheminformatics delivered via the...
Non-targeted analysis supported by data and cheminformatics delivered via the...Non-targeted analysis supported by data and cheminformatics delivered via the...
Non-targeted analysis supported by data and cheminformatics delivered via the...
 
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
 
Accessing Environmental Chemistry Data via Data Dashboards
Accessing Environmental Chemistry Data via Data DashboardsAccessing Environmental Chemistry Data via Data Dashboards
Accessing Environmental Chemistry Data via Data Dashboards
 
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
 
Integrating Mass Spectrometry Non-Targeted Analysis and Computational Chemis...
Integrating Mass Spectrometry  Non-Targeted Analysis and Computational Chemis...Integrating Mass Spectrometry  Non-Targeted Analysis and Computational Chemis...
Integrating Mass Spectrometry Non-Targeted Analysis and Computational Chemis...
 
Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...
Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...
Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...
 
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
 
Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...
 

Recently uploaded

Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfSELF-EXPLANATORY
 
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxpradhanghanshyam7136
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
Luciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptxLuciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptxAleenaTreesaSaji
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxyaramohamed343013
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...jana861314
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhousejana861314
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxAArockiyaNisha
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PPRINCE C P
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 sciencefloriejanemacaya1
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Nistarini College, Purulia (W.B) India
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 

Recently uploaded (20)

Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C P
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
 
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptx
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
Luciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptxLuciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptx
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docx
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhouse
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 science
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 

Using Cheminformatics Approaches to Develop a Structure Searchable Database of Analytical Methods

  • 1. Using Cheminformatics Approaches to Develop a Structure Searchable Database of Analytical Methods Antony Williams, Greg Janesch, Sakuntala Sivasupramaniam, Brian Meyer and Erik Carr Center for Computational Toxicology & Exposure, U.S. Environmental Protection Agency http://www.orcid.org/0000-0002-2668-4821 May 2023: American Oil Chemists Society meeting, Denver, CO. The views expressed in this presentation are those of the author and do not necessarily reflect the views or policies of the U.S. EPA
  • 2. Disclaimer • The views expressed in this presentation are those of the author and do not necessarily reflect the views or policies of the U.S. EPA • This presentation is on a proof-of-concept tool in development – NOT yet publicly available 1
  • 3. Building a Methods Database • Simple Vision: I want to find the best method(s) associated with a chemical and/or class of chemicals • Answer the question “I cannot find a method for my chemical” - HELP • The Approach: – Aggregate MS method documents (and adjust the definition of “what is a useful method”) – Extract chemistry (mostly CASRN and Names) – Map CASRN and Names to structures – Deliver a proof-of-concept application to search a database by names, CASRNs, InChIKeys and ultimately structure 2
  • 4. Most people start with Google • People have many places to search for methods, and there is no one integration hub, except for search engines • Search engines can return so many hits – then you filter by analytes, matrix, analytical methodology, so many synonyms and abbreviations for so many chemicals 3
  • 5. Synonyms, Abbreviations and Chemicals 4 • CAS Numbers, Names and Abbreviations can limit what’s possible…
  • 6. Might this be a better view? 5
  • 7. Might this be a better view? 6
  • 8. When methods are mapped to chemistry… • The advantages of mapping chemicals directly to methods – When chemicals are mapped it opens access to many other tools – Chemical structures allow for QSAR modeling 7
  • 9. …and what if we could then profile toxicity? 8
  • 10. …and what if we could then profile toxicity? 9
  • 11. …or simply harvest data from the CompTox Chemicals Dashboard 10
  • 12. What data would you like??? 11
  • 13. Introducing AMOS Analytical Methods and Spectra Database • Three types of data in the database: – Methods (regulatory, lab manuals and SOPs, publications, tech notes) – Spectra (from public domain and our own laboratories) – Monographs (harvested from SWGDRUG and other sites) • Some methods have associated spectra • Some data are just externally linked • Currently contains around 135,000 spectra, 600,000 external links, 650 monographs, and ~2000 methods • ALL data are growing in number 12
  • 14. Where are there methods? • Agency-based methods – EPA – USGS – USDA – CDC – FDA • Vendor application notes – Thermo, Waters, Agilent, Sciex, Shimadzu, LECO, …. • Peer-reviewed articles • Laboratory Documents – lab manuals, SOPs 13
  • 15. A view of the methods list 14
  • 16. Filtering the list for interests… • Look for pesticides studied in water, by GC/MS, after 1990 15
  • 17. Where are there methods? • 900 method documents from the EPA harvested 16
  • 18. Many Scanned Documents!!! 17 • Methods generally developed by the agrochemical companies • Include parents plus degradation products • Lots of scanned, old, documents but the historical records are still of significant use • Electronic document forms of old documents still of benefit
  • 19. Embedding the old Method PDFs 18
  • 21. When Methods are OPEN Access 20
  • 22. When Methods are PubMed OPEN Access 21
  • 23. Proprietary Methods for INTERNAL Access 22
  • 24. If there is no method for your chemical • Use “Chemical Similarity Searching” so that you can find chemicals that are similar in structure space • Use the “Tanimoto Similarity Search 23
  • 25. Searching for a chemical – CASRN, Name Direct structure searching coming 24
  • 26. When Methods are Not Enough • EPA is highly active in the field of non-targeted analysis • We have been applying lots of cheminformatics approaches 25
  • 27. Building a spectrum library to search against 26
  • 28. Linking to actual spectra 27
  • 29. Linking to actual spectra 28
  • 30. There are errors EVERYWHERE: 110-75-8
  • 31. It can be challenging 9/365 chemicals…
  • 32. There are MANY errors in public spectral databases 31
  • 33. There are MANY errors in public spectral databases 32
  • 34. There are MANY errors in public spectral databases 33
  • 35. There are MANY errors in public spectral databases 34 • Chemical structure representations would ideally be standardized…consider tautomeric forms • Not all substances are explicit and can be ambiguous representations
  • 36. My first time at AOCS…what did I learn? • Some of your chemicals of interest: TAGs, PAHs, Toxins of different types • Methods for PAHs, Aflatoxins and Microsystins, and tri- acyl glycerols all extracted and added to database 35
  • 37. My first time at AOCS…what did I learn? • Index Mark Collison’s methods  36
  • 38. My first time at AOCS…what did I learn? 37
  • 39. General Comments • It is possible we can extract chemicals from AOCS methods and structure enable the method, WITHOUT making the method itself publicly accessible – like ASTM and ISO • Our focus right now is on mass spectrometry methods but can support many other methods 38
  • 40. Conclusions • We have built (I think) the first chemical structure indexed database of “methods” • Methods are not just “approved methods” but also standard operating procedures, application notes, lab manuals, regulatory methods etc. • Integrating methods to experimental spectral data will serve our non-targeted analysis efforts 39
  • 41. If you want to help… • Send information regarding analytical methods and method articles to williams.antony@epa.gov • If anyone is interested in a live demo I can do one in the break 40