This presentation was given at the "Department of Defense's (DoD) Energy and Environment Innovation Symposium" in Arlington, Virginia on December 1st 2023 (https://serdp-estcp.org/events/details/04d444f1-aa19-4e66-bb5c-5163964cc4dd/symposium-2023)
Mass spectrometry analyses at the US-EPA, especially non-targeted analysis studies, are highly dependent on the cheminformatics efforts which have been underway within the agency for almost a decade. These research efforts have resulted in a rich data infrastructure based on the DSSTox database, data integration approaches based on a structure standardization approach to produce “MS-ready” structures, and a number of supporting data types to facilitate ranking of non-targeted analysis candidates. This presentation will provide an overview of all tools in development and the integrated nature of the applications based on the underlying chemistry data. This includes the development of the underlying chemistry database of >1.2 million chemical substances (DSSTox), approaches to structure standardization to facilitate structure-substance mapping, development of a spectral database of >150,000 spectra for >25,000 chemicals, a database of >3000 analytical methods, prediction models for LCMS amenability, and an application for the profiling of toxicity hazards for batches of chemical substances. This abstract does not necessarily represent the views or policies of the U.S. Environmental Protection Agency.
This presentation was given at the conference: "Cheminformatics Resources of U.S. Governmental Organizations" and focuses on the CompTox Chemicals Dashboard and the sharing of our data https://www.fda.gov/news-events/fda-meetings-conferences-and-workshops/cheminformatics-resources-us-governmental-organizations-05092022#event-information
A presentation given at the 5th Metabolomics of North America webinar on September 8th 2023. Provides an overview of the cheminformatics support provided by the DSSTox database, CompTox Chemicals Dashboard and multiple other web-based applications in development
This presentation was made to the University of North Carolina in Chapel Hill on 9/20/21. The presentation was a general introduction to cheminformatics prior to how to navigate the Dashboard.
• An introduction to the dashboard
• Substances vs structures
• Structure formats for data exchange and connectivity (SMILES, InChIs, molfiles)
• Identifiers – CASRN, chemical names, systematic names
• Data curation approaches: substance-structure ambiguity
• ChemReg: substance registration
• Data gathering for systematic reviews
• Curated lists
• Properties/Fate and Transport
• Access to Exposure Data
• Hazard data in the dashboard – ToxVal data (sourced from >40 databases, >50,000 chemicals, >900,000 data points)
• The Executive Summary of data
• Single chemical searches vs Batch searches
In recent years, the growth of scientific data and the increasing need for data sharing and collaboration in the field of environmental chemistry has led to the creation of various software and databases that facilitate research and development into the safety and toxicity of chemicals. The US-EPA Center for Computational Toxicology and Exposure has been developing software and databases that serve the chemistry community for many years. This presentation will focus on several web-based software applications which have been developed at the USEPA and made available to the community. While the primary software application from the Center is the CompTox Chemicals Dashboard almost a dozen proof-of-concept applications have been built serving various capabilities. The publicly accessible Cheminformatics Modules (https://www.epa.gov/chemicalresearch/cheminformatics) provides access to six individual modules to allow for hazard comparison for sets of chemicals, structure-substructure-similarity searching, structure alerts and batch QSAR prediction of both physicochemical and toxicity endpoints. A number of other applications in development include a chemical transformations database (ChET) and a database of analytical methods and open mass spectral data (AMOS). Each of these depends on the underlying DSSTox chemicals database, a rich source of chemistry data for over 1.2 million chemical substances. I will provide an overview of all tools in development and the integrated nature of the applications based on the underlying chemistry data. This abstract does not necessarily represent the views or policies of the U.S. Environmental Protection Agency.
As part of its mission the Center for Computational Toxicology and Exposure (CCTE) delivers access to chemicals related data via online Dashboards. The CompTox Chemicals Dashboard (available at https://comptox.epa.gov/dashboard) provides access to >900,000 chemicals and associated data including experimental and predicted property data, in vivo hazard data, in vitro bioactivity data, exposure data, and various other data types. The application provides a set of flexible searches allowing for search, visualization and downloads of the data to the desktop for further interrogation. This presentation will provide an overview of the Dashboard and other proof-of-concept applications that are now publicly available (https://www.epa.gov/chemical-research/cheminformatics). For example, the Hazard Comparison module (shown in the figure below) allows profiling of chemicals based on toxicity types (https://doi.org/10.1007/s10098-019-01795-w). This abstract does not necessarily represent the views or policies of the U.S. Environmental Protection Agency.
In recent years, the growth of scientific data and the increasing need for data sharing and collaboration in the field of environmental chemistry has led to the creation of various software and databases that facilitate research and development into the safety and toxicity of chemicals. The US-EPA Center for Computational Toxicology and Exposure has been developing software and databases that have served the chemistry community for many years. Several web-based software applications have been developed at the US-EPA and made available to the community to provide access to information regarding mycotoxins. This includes related structures, experimental and predicted properties, hazard data and mass spectrometry analytical data and methods. While the primary software application from the Center is the CompTox Chemicals Dashboard almost a dozen proof-of-concept applications have been built serving various capabilities. The publicly accessible Cheminformatics Modules (https://www.epa.gov/chemical-research/cheminformatics) provides access to modules to allow for hazard comparison for sets of chemicals, structure-substructure-similarity searching and batch QSAR prediction of both physicochemical and toxicity endpoints. This presentation will provide an overview of all tools in development that provide access to mycotoxin related data and the integrated nature of the applications based on the underlying chemistry data set. This abstract does not necessarily represent the views or policies of the U.S. Environmental Protection Agency.
This presentation was given at the XPAND2022 Annual Meeting for the Household & Commercial Products Association and focused non-targeted analysis mass spectrometry and applications of the CompTox Chemicals Dashboard.
Mass spectrometry analyses at the US-EPA, especially non-targeted analysis studies, are highly dependent on the cheminformatics efforts which have been underway within the agency for almost a decade. These research efforts have resulted in a rich data infrastructure based on the DSSTox database, data integration approaches based on a structure standardization approach to produce “MS-ready” structures, and a number of supporting data types to facilitate ranking of non-targeted analysis candidates. This presentation will provide an overview of all tools in development and the integrated nature of the applications based on the underlying chemistry data. This includes the development of the underlying chemistry database of >1.2 million chemical substances (DSSTox), approaches to structure standardization to facilitate structure-substance mapping, development of a spectral database of >150,000 spectra for >25,000 chemicals, a database of >3000 analytical methods, prediction models for LCMS amenability, and an application for the profiling of toxicity hazards for batches of chemical substances. This abstract does not necessarily represent the views or policies of the U.S. Environmental Protection Agency.
This presentation was given at the conference: "Cheminformatics Resources of U.S. Governmental Organizations" and focuses on the CompTox Chemicals Dashboard and the sharing of our data https://www.fda.gov/news-events/fda-meetings-conferences-and-workshops/cheminformatics-resources-us-governmental-organizations-05092022#event-information
A presentation given at the 5th Metabolomics of North America webinar on September 8th 2023. Provides an overview of the cheminformatics support provided by the DSSTox database, CompTox Chemicals Dashboard and multiple other web-based applications in development
This presentation was made to the University of North Carolina in Chapel Hill on 9/20/21. The presentation was a general introduction to cheminformatics prior to how to navigate the Dashboard.
• An introduction to the dashboard
• Substances vs structures
• Structure formats for data exchange and connectivity (SMILES, InChIs, molfiles)
• Identifiers – CASRN, chemical names, systematic names
• Data curation approaches: substance-structure ambiguity
• ChemReg: substance registration
• Data gathering for systematic reviews
• Curated lists
• Properties/Fate and Transport
• Access to Exposure Data
• Hazard data in the dashboard – ToxVal data (sourced from >40 databases, >50,000 chemicals, >900,000 data points)
• The Executive Summary of data
• Single chemical searches vs Batch searches
In recent years, the growth of scientific data and the increasing need for data sharing and collaboration in the field of environmental chemistry has led to the creation of various software and databases that facilitate research and development into the safety and toxicity of chemicals. The US-EPA Center for Computational Toxicology and Exposure has been developing software and databases that serve the chemistry community for many years. This presentation will focus on several web-based software applications which have been developed at the USEPA and made available to the community. While the primary software application from the Center is the CompTox Chemicals Dashboard almost a dozen proof-of-concept applications have been built serving various capabilities. The publicly accessible Cheminformatics Modules (https://www.epa.gov/chemicalresearch/cheminformatics) provides access to six individual modules to allow for hazard comparison for sets of chemicals, structure-substructure-similarity searching, structure alerts and batch QSAR prediction of both physicochemical and toxicity endpoints. A number of other applications in development include a chemical transformations database (ChET) and a database of analytical methods and open mass spectral data (AMOS). Each of these depends on the underlying DSSTox chemicals database, a rich source of chemistry data for over 1.2 million chemical substances. I will provide an overview of all tools in development and the integrated nature of the applications based on the underlying chemistry data. This abstract does not necessarily represent the views or policies of the U.S. Environmental Protection Agency.
As part of its mission the Center for Computational Toxicology and Exposure (CCTE) delivers access to chemicals related data via online Dashboards. The CompTox Chemicals Dashboard (available at https://comptox.epa.gov/dashboard) provides access to >900,000 chemicals and associated data including experimental and predicted property data, in vivo hazard data, in vitro bioactivity data, exposure data, and various other data types. The application provides a set of flexible searches allowing for search, visualization and downloads of the data to the desktop for further interrogation. This presentation will provide an overview of the Dashboard and other proof-of-concept applications that are now publicly available (https://www.epa.gov/chemical-research/cheminformatics). For example, the Hazard Comparison module (shown in the figure below) allows profiling of chemicals based on toxicity types (https://doi.org/10.1007/s10098-019-01795-w). This abstract does not necessarily represent the views or policies of the U.S. Environmental Protection Agency.
In recent years, the growth of scientific data and the increasing need for data sharing and collaboration in the field of environmental chemistry has led to the creation of various software and databases that facilitate research and development into the safety and toxicity of chemicals. The US-EPA Center for Computational Toxicology and Exposure has been developing software and databases that have served the chemistry community for many years. Several web-based software applications have been developed at the US-EPA and made available to the community to provide access to information regarding mycotoxins. This includes related structures, experimental and predicted properties, hazard data and mass spectrometry analytical data and methods. While the primary software application from the Center is the CompTox Chemicals Dashboard almost a dozen proof-of-concept applications have been built serving various capabilities. The publicly accessible Cheminformatics Modules (https://www.epa.gov/chemical-research/cheminformatics) provides access to modules to allow for hazard comparison for sets of chemicals, structure-substructure-similarity searching and batch QSAR prediction of both physicochemical and toxicity endpoints. This presentation will provide an overview of all tools in development that provide access to mycotoxin related data and the integrated nature of the applications based on the underlying chemistry data set. This abstract does not necessarily represent the views or policies of the U.S. Environmental Protection Agency.
This presentation was given at the XPAND2022 Annual Meeting for the Household & Commercial Products Association and focused non-targeted analysis mass spectrometry and applications of the CompTox Chemicals Dashboard.
Presentation for Texas A&M Superfund Research Center virtual learning series, Big Data in Environmental Science and Toxicology. More details at https://superfund.tamu.edu/big-data-session-1-july-14-2021/
The Center for Computational Toxicology and Exposure (CCTE) is part of the Office of Research and Development at the US Environmental Protection Agency. As part of its mission the center delivers access to chemicals related data via web-based freely accessible online Dashboards to disseminate data generated within the center as well as harvested and integrated from open databases around the world. The CompTox Chemicals Dashboard (available at https://comptox.epa.gov/dashboard) provides access to >1.2 million chemicals and associated data including experimental and predicted property data, in vivo hazard data, in vitro bioactivity data, exposure data, and various other data types. The curation of the chemicals dataset has included the development of over 400 segregated lists of chemicals that represent specific research areas of interest including disinfectant by-products, per- and polyfluoroalkyl substances (PFAS), extractables and leachables, and chemicals of emerging concern. The chemicals collection, the associated data, the lists and searches for mass and formulae makes the Dashboard an ideal foundation technology to support our colleagues working in the field of mass spectrometry, especially in targeted and non-targeted analysis. This presentation will provide an overview of the Dashboard, its value to the community in terms of providing access to the integrated and highly curated data, and its utility to support researchers in the field of mass spectrometry. New proof-of-concept projects will also be introduced including the development of a cheminformatics enabled methods database. This abstract does not necessarily represent the views or policies of the U.S. Environmental Protection Agency.
High resolution mass spectrometry (HRMS) and non-targeted analysis (NTA) are utilized to identify emerging contaminants and chemical signatures of interest detected in various media. At the US Environmental Protection Agency the CompTox Chemicals Dashboard (https://comptox.epa.gov/dashboard) is an open chemistry resource and web-based application containing data for ~900,000 substances and supports non-targeted and suspect screening analyses. Searching functionality includes identifier searches (e.g. systematic names, trade names and CAS Registry Numbers), mass and formula-based searches and prototype developments include combined substructure-mass/formula searches and searching experimental mass spectral data against predicted fragmentation spectra. A specific type of data mapping in the database uses “MS-Ready” structures, a way to process all registered substances to separate multi-component chemicals into their individual components, removal of stereochemical bonds and desalting and neutralization. This MS-Ready processing supports batch-searching using either mass or formulae to identify candidate chemicals and their mapped substances. A number of chemical lists (https://comptox.epa.gov/dashboard/chemical_lists) have also been developed to support the identification of chemicals related to agrochemistry, specifically pesticides (both active and inert constituents), insecticides and their metabolites and environmental breakdown products). This presentation will provide an overview of how the CompTox Chemicals Dashboard supports mass spectrometry based structure identification and non-targeted analysis of chemicals in agrochemistry. This abstract does not necessarily represent the views or policies of the U.S. Environmental Protection Agency.
The U.S. Environmental Protection Agency (EPA) Computational Toxicology Program utilizes computational and data-driven approaches that integrate chemistry, exposure and biological data to help characterize potential risks from chemical exposure. The National Center for Computational Toxicology (NCCT) has measured, assembled and delivered an enormous quantity and diversity of data for the environmental sciences, including high-throughput in vitro screening data, in vivo and functional use data, exposure models and chemical databases with associated properties. The CompTox Chemicals Dashboard website provides access to data associated with ~900,000 chemical substances. New data are added on an ongoing basis, including the registration of new and emerging chemicals, data extracted from the literature, chemicals studied in our labs, and data of interest to specific research projects at the EPA. Hazard and exposure data have been assembled from a large number of public databases and as a result the dashboard surfaces hundreds of thousands of data points. Other data includes experimental and predicted physicochemical property data, in vitro bioassay data and millions of chemical identifiers (names and CAS Registry Numbers) to facilitate searching. Other integrated modules include real-time physicochemical and toxicity endpoint prediction and an integrated search to PubMed. This presentation will provide an overview of the CompTox Chemicals Dashboard and how it has developed into an integrated data hub for environmental data. This abstract does not necessarily represent the views or policies of the U.S. Environmental Protection Agency.
Non-targeted analysis (NTA) uses high-resolution mass spectrometry to better understand the identity of a wide variety of chemicals present in environmental samples (and other matrices). However, data processing remains challenging due to the vast number of chemicals detected in samples, software and computational requirements of data processing, and inherent uncertainty in confidently identifying chemicals from candidate lists. Analysis of the resultant mass spectrometry information relies on cheminformatics to identify and rank chemicals and the US EPA has developed functionality within the CompTox Chemicals Dashboard (https://comptox.epa.gov/dashboard) to address challenges related to this analysis. These tools include the generation of “MS-Ready” structures to optimize database searching, retention time prediction for candidate reduction, consensus ranking using chemical metadata, and in silico MS/MS fragmentation prediction for spectral matching. Combining these tools into a comprehensive workflow improves certainty in candidate identification. This presentation will review how the CompTox Chemicals Dashboard via its flexible search capabilities, rich data for ~875,000 chemical substances, and visualization approaches within this open chemistry resource provides a freely available software tool to support structure identification and NTA. This abstract does not necessarily represent the views or policies of the U.S. Environmental Protection Agency.
Researchers at EPA’s National Center for Computational Toxicology integrate advances in biology, chemistry, and computer science to examine the toxicity of chemicals and help prioritize chemicals for further research based on potential human health risks. The goal of this research program is to quickly evaluate thousands of chemicals, but at a much reduced cost and shorter time frame relative to traditional approaches. The data generated by the Center includes characterization of thousands of chemicals across hundreds of high-throughput screening assays, consumer use and production information, pharmacokinetic properties, literature data, physical-chemical properties as well as the predictive computational modeling of toxicity and exposure. We have developed a number of databases and applications to deliver the data to the public, academic community, industry stakeholders, and regulators. This presentation will provide an overview of our work to develop an architecture that integrates diverse large-scale data from the chemical and biological domains, our approaches to disseminate these data, and the delivery of models supporting predictive computational toxicology. In particular, this presentation will review our new CompTox Chemistry Dashboard and the developing software architecture. This abstract does not reflect U.S. EPA policy.
Presentation for Texas A&M Superfund Research Center virtual learning series, Big Data in Environmental Science and Toxicology. More details at https://superfund.tamu.edu/big-data-session-2-aug-18-2021/
This presentation was given at a TRIANGLE AREA MASS SPECTOMETRY meeting on 01/29/2019 in Research Triangle Park, North Carolina to provide a general overview of the CompTox Chemicals Dashboard to an audience of mass spectrometrists and people interested in the capabilities of the dashboard for chemical forensics, structure identification etc.
The U.S. Environmental Protection Agency (EPA) Computational Toxicology Program utilizes computational and data-driven approaches that integrate chemistry, exposure and biological data to help characterize potential risks from chemical exposure. The National Center for Computational Toxicology (NCCT) has measured, assembled and delivered an enormous quantity and diversity of data for the environmental sciences, including high-throughput in vitro screening data, in vivo and functional use data, exposure models and chemical databases with associated properties. The CompTox Chemicals Dashboard website provides access to data associated with ~900,000 chemical substances. New data are added on an ongoing basis, including the registration of new and emerging chemicals, data extracted from the literature, chemicals studied in our labs, and data of interest to specific research projects at the EPA. Hazard and exposure data have been assembled from a large number of public databases and as a result the dashboard surfaces hundreds of thousands of data points. Other data includes experimental and predicted physicochemical property data, in vitro bioassay data for over 4000 chemicals and ~1500 assays, and millions of chemical identifiers (names and CAS Registry Numbers) to facilitate searching. Other integrated modules include an interactive read-across module, real-time physicochemical and toxicity endpoint prediction and an integrated search to PubMed. This presentation will provide an overview of the CompTox Chemicals Dashboard and how it has developed into an integrated data hub for environmental data. This abstract does not necessarily represent the views or policies of the U.S. Environmental Protection Agency.
This presentation was given at the ASMS Sanibel Conference "Unraveling the Exposome" and provided a general overview of the dashboard and how it integrates to many of the projects that we support but with a special focus on list generation, mass and formula searching based on MS-Ready structures and some of the prototypes that we have been developing to support non-targeted analysis.
Presentation given at the Federal Environment Symposium on March 28th 2022.
As part of its mission the Center for Computational Toxicology and Exposure (CCTE) delivers access to chemicals related data via online Dashboards. The CompTox Chemicals Dashboard (available at https://comptox.epa.gov/dashboard) provides access to >900,000 chemicals and associated data including experimental and predicted property data, in vivo hazard data, in vitro bioactivity data, exposure data, and various other data types. The application provides a set of flexible searches allowing for search, visualization and downloads of the data to the desktop for further interrogation. This presentation will provide an overview of the Dashboard and other proof-of-concept applications. For example, the Hazard Comparison Dashboard has a module which allows profiling of chemicals based on toxicity types (https://doi.org/10.1007/s10098-019-01795-w). This presentation will also introduce a number of proof-of-concept modules in development. This abstract does not necessarily represent the views or policies of the U.S. Environmental Protection Agency.
High resolution mass spectrometry (HRMS) and non-targeted analysis (NTA) are of increasing interest in chemical forensics for the identification of emerging contaminants and chemical signatures of interest. At the US Environmental Protection Agency, our research using HRMS for non-targeted and suspect screening analyses utilizes databases and cheminformatics approaches that are applicable to chemical forensics. The CompTox Chemicals Dashboard is an open chemistry resource and web-based application containing data for ~900,000 substances. Basic functionality for searching through the data is provided through identifier searches, such as systematic name, trade names and CAS Registry Numbers. Advanced Search capabilities supporting mass spectrometry include mass and formula-based searches, combined substructure-mass searches and searching experimental mass spectral data against predicted fragmentation spectra. A specific type of data mapping in the underpinning database, using “MS-Ready” structures, has proven to be a valuable approach for structure identification that links structures that can be identified via HRMS with related substances in the form of salts, and other multi-component mixtures that are available in commerce. These MS-Ready structures have been used as an input set for computational MS-fragmentation to provide a database against which to search experimental data for spectral matching. This presentation will provide an overview of how the CompTox Chemicals Dashboard supports structure identification and non-targeted analysis in chemical forensics. This abstract does not necessarily represent the views or policies of the U.S. Environmental Protection Agency.
Analytical methods can vary in nature from detailed regulatory methods to more summary in nature. Regulatory method documents can include details of analytes which can be studied, supported matrices, reagents, methodological details, statistical performance, interlaboratory validation and other details. Summary methods provide a general overview of reagents, instrumentation and commonly a short list of analytes. Regulatory bodies including the US Environmental Protection Agency (US-EPA), US Geological Survey (USGS), US Department of Agriculture (USDA) and others provide detailed analytical methods and collections of summary methods from the agrochemical industry, such as the US-EPA Environmental Chemistry Methods (https://www.epa.gov/pesticide-analytical-methods/environmental-chemistry-methods-ecm). Instrument vendors also provide access to many hundreds of application notes which can be considered as summary methods. We have developed a cheminformatically enabled database of methods whereby chemicals have been extracted from the methods, with the identifiers (names and/or chemical abstracts registry numbers) mapped to chemical structures. The resulting database of almost 3000 methods can be searched by chemical name, CASRN, structure and similarity of chemical structure. The resulting database has been integrated into a web-based application and includes integration to public domain mass spectral data and filtering of the methods based on analyte, chemical class, method source and other related metadata. This abstract does not necessarily represent the views or policies of the U.S. Environmental Protection Agency.
Presentation at: Digital design of molecules and formulations (https://www.soci.org/events/ai-and-digitalisation/digital-design-of-molecules-and-formulations). The Chemicals Dashboard is a free web-based application from the United States Environmental Protection Agency that provides various types of data for ~900,000 chemicals. These data include physicochemical properties, in vivo and in vitro toxicity data and information regarding the presence of chemicals in commercial products, including formulation data where available. The dashboard allows for sourcing of data associated with singleton chemicals or a batch mode for downloading data for thousands of chemicals at a time. This presentation will provide an overview of the Dashboard, the myriad of data sources underpinning the application and potential applications of the dashboard.
The US EPA’s National Center for Computational Toxicology (NCCT) has been both measuring and aggregating data to support our research efforts for over a decade. We have delivered these data via a number of publicly accessible websites, so-called dashboards, to provide transparent access to the outputs of the center. Since the inception of our research, software projects technologies have changed dramatically, as have the expectations regarding the methods by which to access data. Our informatics efforts provide access to millions of dollars of high-throughput screening data available in open, downloadable formats, via web services and through a rich web interface. Similarly, we provide access to experimental and predicted data associated with ~760,000 substances to serve the environmental chemistry community, and open source code for predictive models. This presentation will provide an overview of the efforts of NCCT to provide transparent access to our research and data via our publications (and accompanying supplementary data), via our Open Data policies, and through our databases, software tools and web services. This abstract does not necessarily represent the views or policies of the U.S. Environmental Protection Agency.
The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...Andrew McEachran
The U.S. Environmental Protection Agency (EPA) Computational Toxicology Program integrates advances in biology, chemistry, and computer science to help prioritize chemicals for further research based on potential human health risks. This involves computational and data-driven approaches that integrate chemistry, exposure and biological data. The National Center for Computational Toxicology (NCCT) has measured, assembled and delivered an enormous quantity and diversity of data for the environmental sciences, including high-throughput in vitro screening data, in vivo and functional use data, exposure models and chemical databases with associated properties. The CompTox Chemistry Dashboard is a web-based application providing access to data associated with ~760,000 chemical substances. New data are continuously added to the database on an ongoing basis, along with registration of new and emerging chemicals. This includes data extracted from the literature, identified by our analytical labs, and otherwise of interest to support specific research projects to the agency. By adding these data, with their associated chemical identifiers (names and CAS Registry Numbers), the dashboard uses linking approaches to allow for automated searching of PubMed, Google Scholar and an array of public databases. This presentation will provide an overview of the Dashboard, how it has developed into an integrated data hub for environmental data, and how it can be used for the analysis of emerging chemicals in terms of sourcing related chemicals of interest, and deriving read-across as well as QSAR predictions in real time. This abstract does not necessarily represent the views or policies of the U.S. Environmental Protection Agency.
Tens of thousands of chemicals are currently in commerce, and hundreds more are introduced every year. Because current chemical testing is resource intensive, only a small fraction of chemicals have been adequately evaluated for potential human health effects. New technologies and computational tools have shown promise for closing this knowledge gap. In the U.S. EPA’s ToxCast effort, the use of ~700 high-throughput in vitro assays has broadly characterized the biological activity and potential mechanisms of ~1,800 chemicals. Coupling the high-throughput in vitro assays with additional in vitro pharmacokinetic assays and in vitro-to-in vivo extrapolation modeling allows conversion of in vitro bioactive concentrations to estimates to an administered dose (mg/kg/day). High throughput exposure models are generating exposure estimates based on key aspects of chemical production, fate, transport, and personal use. The path for incorporating new approach methods and technologies for prioritization and assessment of chemical alternatives poses multiple scientific challenges. These challenges include sufficient coverage of toxicological mechanisms to meaningfully interpret negative test results, development of increasingly relevant test systems, computational modeling to integrate experimental data, characterizing uncertainty, and efficient validation of the test systems and computational models. The presentation will cover progress at the U.S. EPA in the development and application of these technologies and approaches in evaluating alternatives and systematically addressing each of these challenges. This abstract does not necessarily reflect U.S. EPA policy.
High resolution mass spectrometry (HRMS) and non-targeted analysis (NTA) are advancing the identification of emerging contaminants in environmental matrices, improving the means by which exposure analyses can be conducted. However, confidence in structure identification of unknowns in NTA presents challenges to analytical chemists. Structure identification requires integration of complementary data types such as reference databases (either commercial or open databases), fragmentation prediction tools, and retention time prediction models. One goal of our research is to optimize and implement structure identification functionality within the US EPA’s CompTox Chemicals Dashboard, an open chemistry resource and web application containing data for ~900,000 substances. Database searching using mass or formula-based inputs has been optimized for structure identification using “MS-Ready Structures”: de-salted, stripped of stereochemistry, and mixture separated to replicate the form of a chemical observed via HRMS. Functionality to conduct batch searching of molecular formulae and monoisotopic masses has also been implemented. While the increasing number of free online databases are of value to support chemical structure verification and elucidation there are known issues regarding data quality and careful data curation is a very necessary part of the development of these resources. This presentation will provide an overview of our latest enhancements to the dashboard to support mass spectrometry, incorporation of specific datasets (i.e. to support breath research and household dust analysis) and the value of metadata and predicted fragmentation spectral matching to support structure identification. This abstract does not necessarily represent the views or policies of the U.S. Environmental Protection Agency.
Researchers at EPA’s National Center for Computational Toxicology integrate advances in biology, chemistry, and computer science to examine the toxicity of chemicals and help prioritize chemicals for further research based on potential human health risks. The goal of this research program is to quickly evaluate thousands of chemicals, but at a much reduced cost and shorter time frame relative to traditional approaches. The data generated by the Center includes characterization of thousands of chemicals across hundreds of high-throughput screening assays, consumer use and production information, pharmacokinetic properties, literature data, physical-chemical properties as well as the predictive computational modeling of toxicity and exposure. We have developed a number of databases and applications to deliver the data to the public, academic community, industry stakeholders, and regulators. This presentation will provide an overview of our work to develop an architecture that integrates diverse large-scale data from the chemical and biological domains, our approaches to disseminate these data, and the delivery of models supporting predictive computational toxicology. In particular, this presentation will review our new CompTox Chemistry Dashboard and the developing architecture to support real-time property and toxicity endpoint prediction. This abstract does not reflect U.S. EPA policy.
High resolution mass spectrometry (HRMS) and non-targeted analysis (NTA) are of increasing interest in chemical forensics for the identification of emerging contaminants and chemical signatures of interest. At the US Environmental Protection Agency, our research using HRMS for non-targeted and suspect screening analyses utilizes databases and cheminformatics approaches that are applicable to chemical forensics. The CompTox Chemistry Dashboard is an open chemistry resource and web-based application containing data for ~760,000 substances. Basic functionality for searching through the data is provided through identifier searches, such as systematic name, trade names and CAS Registry Numbers. Advanced Search capabilities supporting mass spectrometry include mass and formula-based searches, combined substructure-mass searches and searching experimental mass spectral data against predicted fragmentation spectra. A specific type of data mapping in the underpinning database, using “MS-Ready” structures, has proven to be a valuable approach for structure identification that links structures that can be identified via HRMS with related substances in the form of salts, and other multi-component mixtures that are available in commerce. This presentation will provide an overview of the CompTox Chemistry Dashboard and demonstrate its utility for supporting structure identification and NTA in chemical forensics. This abstract does not necessarily represent the views or policies of the U.S. Environmental Protection Agency.
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Ana Luísa Pinho
Functional Magnetic Resonance Imaging (fMRI) provides means to characterize brain activations in response to behavior. However, cognitive neuroscience has been limited to group-level effects referring to the performance of specific tasks. To obtain the functional profile of elementary cognitive mechanisms, the combination of brain responses to many tasks is required. Yet, to date, both structural atlases and parcellation-based activations do not fully account for cognitive function and still present several limitations. Further, they do not adapt overall to individual characteristics. In this talk, I will give an account of deep-behavioral phenotyping strategies, namely data-driven methods in large task-fMRI datasets, to optimize functional brain-data collection and improve inference of effects-of-interest related to mental processes. Key to this approach is the employment of fast multi-functional paradigms rich on features that can be well parametrized and, consequently, facilitate the creation of psycho-physiological constructs to be modelled with imaging data. Particular emphasis will be given to music stimuli when studying high-order cognitive mechanisms, due to their ecological nature and quality to enable complex behavior compounded by discrete entities. I will also discuss how deep-behavioral phenotyping and individualized models applied to neuroimaging data can better account for the subject-specific organization of domain-general cognitive systems in the human brain. Finally, the accumulation of functional brain signatures brings the possibility to clarify relationships among tasks and create a univocal link between brain systems and mental functions through: (1) the development of ontologies proposing an organization of cognitive processes; and (2) brain-network taxonomies describing functional specialization. To this end, tools to improve commensurability in cognitive science are necessary, such as public repositories, ontology-based platforms and automated meta-analysis tools. I will thus discuss some brain-atlasing resources currently under development, and their applicability in cognitive as well as clinical neuroscience.
What is greenhouse gasses and how many gasses are there to affect the Earth.moosaasad1975
What are greenhouse gasses how they affect the earth and its environment what is the future of the environment and earth how the weather and the climate effects.
More Related Content
Similar to US-EPA Cheminformatics Support for Delivering Data Related to Chemicals of Emerging Concern
Presentation for Texas A&M Superfund Research Center virtual learning series, Big Data in Environmental Science and Toxicology. More details at https://superfund.tamu.edu/big-data-session-1-july-14-2021/
The Center for Computational Toxicology and Exposure (CCTE) is part of the Office of Research and Development at the US Environmental Protection Agency. As part of its mission the center delivers access to chemicals related data via web-based freely accessible online Dashboards to disseminate data generated within the center as well as harvested and integrated from open databases around the world. The CompTox Chemicals Dashboard (available at https://comptox.epa.gov/dashboard) provides access to >1.2 million chemicals and associated data including experimental and predicted property data, in vivo hazard data, in vitro bioactivity data, exposure data, and various other data types. The curation of the chemicals dataset has included the development of over 400 segregated lists of chemicals that represent specific research areas of interest including disinfectant by-products, per- and polyfluoroalkyl substances (PFAS), extractables and leachables, and chemicals of emerging concern. The chemicals collection, the associated data, the lists and searches for mass and formulae makes the Dashboard an ideal foundation technology to support our colleagues working in the field of mass spectrometry, especially in targeted and non-targeted analysis. This presentation will provide an overview of the Dashboard, its value to the community in terms of providing access to the integrated and highly curated data, and its utility to support researchers in the field of mass spectrometry. New proof-of-concept projects will also be introduced including the development of a cheminformatics enabled methods database. This abstract does not necessarily represent the views or policies of the U.S. Environmental Protection Agency.
High resolution mass spectrometry (HRMS) and non-targeted analysis (NTA) are utilized to identify emerging contaminants and chemical signatures of interest detected in various media. At the US Environmental Protection Agency the CompTox Chemicals Dashboard (https://comptox.epa.gov/dashboard) is an open chemistry resource and web-based application containing data for ~900,000 substances and supports non-targeted and suspect screening analyses. Searching functionality includes identifier searches (e.g. systematic names, trade names and CAS Registry Numbers), mass and formula-based searches and prototype developments include combined substructure-mass/formula searches and searching experimental mass spectral data against predicted fragmentation spectra. A specific type of data mapping in the database uses “MS-Ready” structures, a way to process all registered substances to separate multi-component chemicals into their individual components, removal of stereochemical bonds and desalting and neutralization. This MS-Ready processing supports batch-searching using either mass or formulae to identify candidate chemicals and their mapped substances. A number of chemical lists (https://comptox.epa.gov/dashboard/chemical_lists) have also been developed to support the identification of chemicals related to agrochemistry, specifically pesticides (both active and inert constituents), insecticides and their metabolites and environmental breakdown products). This presentation will provide an overview of how the CompTox Chemicals Dashboard supports mass spectrometry based structure identification and non-targeted analysis of chemicals in agrochemistry. This abstract does not necessarily represent the views or policies of the U.S. Environmental Protection Agency.
The U.S. Environmental Protection Agency (EPA) Computational Toxicology Program utilizes computational and data-driven approaches that integrate chemistry, exposure and biological data to help characterize potential risks from chemical exposure. The National Center for Computational Toxicology (NCCT) has measured, assembled and delivered an enormous quantity and diversity of data for the environmental sciences, including high-throughput in vitro screening data, in vivo and functional use data, exposure models and chemical databases with associated properties. The CompTox Chemicals Dashboard website provides access to data associated with ~900,000 chemical substances. New data are added on an ongoing basis, including the registration of new and emerging chemicals, data extracted from the literature, chemicals studied in our labs, and data of interest to specific research projects at the EPA. Hazard and exposure data have been assembled from a large number of public databases and as a result the dashboard surfaces hundreds of thousands of data points. Other data includes experimental and predicted physicochemical property data, in vitro bioassay data and millions of chemical identifiers (names and CAS Registry Numbers) to facilitate searching. Other integrated modules include real-time physicochemical and toxicity endpoint prediction and an integrated search to PubMed. This presentation will provide an overview of the CompTox Chemicals Dashboard and how it has developed into an integrated data hub for environmental data. This abstract does not necessarily represent the views or policies of the U.S. Environmental Protection Agency.
Non-targeted analysis (NTA) uses high-resolution mass spectrometry to better understand the identity of a wide variety of chemicals present in environmental samples (and other matrices). However, data processing remains challenging due to the vast number of chemicals detected in samples, software and computational requirements of data processing, and inherent uncertainty in confidently identifying chemicals from candidate lists. Analysis of the resultant mass spectrometry information relies on cheminformatics to identify and rank chemicals and the US EPA has developed functionality within the CompTox Chemicals Dashboard (https://comptox.epa.gov/dashboard) to address challenges related to this analysis. These tools include the generation of “MS-Ready” structures to optimize database searching, retention time prediction for candidate reduction, consensus ranking using chemical metadata, and in silico MS/MS fragmentation prediction for spectral matching. Combining these tools into a comprehensive workflow improves certainty in candidate identification. This presentation will review how the CompTox Chemicals Dashboard via its flexible search capabilities, rich data for ~875,000 chemical substances, and visualization approaches within this open chemistry resource provides a freely available software tool to support structure identification and NTA. This abstract does not necessarily represent the views or policies of the U.S. Environmental Protection Agency.
Researchers at EPA’s National Center for Computational Toxicology integrate advances in biology, chemistry, and computer science to examine the toxicity of chemicals and help prioritize chemicals for further research based on potential human health risks. The goal of this research program is to quickly evaluate thousands of chemicals, but at a much reduced cost and shorter time frame relative to traditional approaches. The data generated by the Center includes characterization of thousands of chemicals across hundreds of high-throughput screening assays, consumer use and production information, pharmacokinetic properties, literature data, physical-chemical properties as well as the predictive computational modeling of toxicity and exposure. We have developed a number of databases and applications to deliver the data to the public, academic community, industry stakeholders, and regulators. This presentation will provide an overview of our work to develop an architecture that integrates diverse large-scale data from the chemical and biological domains, our approaches to disseminate these data, and the delivery of models supporting predictive computational toxicology. In particular, this presentation will review our new CompTox Chemistry Dashboard and the developing software architecture. This abstract does not reflect U.S. EPA policy.
Presentation for Texas A&M Superfund Research Center virtual learning series, Big Data in Environmental Science and Toxicology. More details at https://superfund.tamu.edu/big-data-session-2-aug-18-2021/
This presentation was given at a TRIANGLE AREA MASS SPECTOMETRY meeting on 01/29/2019 in Research Triangle Park, North Carolina to provide a general overview of the CompTox Chemicals Dashboard to an audience of mass spectrometrists and people interested in the capabilities of the dashboard for chemical forensics, structure identification etc.
The U.S. Environmental Protection Agency (EPA) Computational Toxicology Program utilizes computational and data-driven approaches that integrate chemistry, exposure and biological data to help characterize potential risks from chemical exposure. The National Center for Computational Toxicology (NCCT) has measured, assembled and delivered an enormous quantity and diversity of data for the environmental sciences, including high-throughput in vitro screening data, in vivo and functional use data, exposure models and chemical databases with associated properties. The CompTox Chemicals Dashboard website provides access to data associated with ~900,000 chemical substances. New data are added on an ongoing basis, including the registration of new and emerging chemicals, data extracted from the literature, chemicals studied in our labs, and data of interest to specific research projects at the EPA. Hazard and exposure data have been assembled from a large number of public databases and as a result the dashboard surfaces hundreds of thousands of data points. Other data includes experimental and predicted physicochemical property data, in vitro bioassay data for over 4000 chemicals and ~1500 assays, and millions of chemical identifiers (names and CAS Registry Numbers) to facilitate searching. Other integrated modules include an interactive read-across module, real-time physicochemical and toxicity endpoint prediction and an integrated search to PubMed. This presentation will provide an overview of the CompTox Chemicals Dashboard and how it has developed into an integrated data hub for environmental data. This abstract does not necessarily represent the views or policies of the U.S. Environmental Protection Agency.
This presentation was given at the ASMS Sanibel Conference "Unraveling the Exposome" and provided a general overview of the dashboard and how it integrates to many of the projects that we support but with a special focus on list generation, mass and formula searching based on MS-Ready structures and some of the prototypes that we have been developing to support non-targeted analysis.
Presentation given at the Federal Environment Symposium on March 28th 2022.
As part of its mission the Center for Computational Toxicology and Exposure (CCTE) delivers access to chemicals related data via online Dashboards. The CompTox Chemicals Dashboard (available at https://comptox.epa.gov/dashboard) provides access to >900,000 chemicals and associated data including experimental and predicted property data, in vivo hazard data, in vitro bioactivity data, exposure data, and various other data types. The application provides a set of flexible searches allowing for search, visualization and downloads of the data to the desktop for further interrogation. This presentation will provide an overview of the Dashboard and other proof-of-concept applications. For example, the Hazard Comparison Dashboard has a module which allows profiling of chemicals based on toxicity types (https://doi.org/10.1007/s10098-019-01795-w). This presentation will also introduce a number of proof-of-concept modules in development. This abstract does not necessarily represent the views or policies of the U.S. Environmental Protection Agency.
High resolution mass spectrometry (HRMS) and non-targeted analysis (NTA) are of increasing interest in chemical forensics for the identification of emerging contaminants and chemical signatures of interest. At the US Environmental Protection Agency, our research using HRMS for non-targeted and suspect screening analyses utilizes databases and cheminformatics approaches that are applicable to chemical forensics. The CompTox Chemicals Dashboard is an open chemistry resource and web-based application containing data for ~900,000 substances. Basic functionality for searching through the data is provided through identifier searches, such as systematic name, trade names and CAS Registry Numbers. Advanced Search capabilities supporting mass spectrometry include mass and formula-based searches, combined substructure-mass searches and searching experimental mass spectral data against predicted fragmentation spectra. A specific type of data mapping in the underpinning database, using “MS-Ready” structures, has proven to be a valuable approach for structure identification that links structures that can be identified via HRMS with related substances in the form of salts, and other multi-component mixtures that are available in commerce. These MS-Ready structures have been used as an input set for computational MS-fragmentation to provide a database against which to search experimental data for spectral matching. This presentation will provide an overview of how the CompTox Chemicals Dashboard supports structure identification and non-targeted analysis in chemical forensics. This abstract does not necessarily represent the views or policies of the U.S. Environmental Protection Agency.
Analytical methods can vary in nature from detailed regulatory methods to more summary in nature. Regulatory method documents can include details of analytes which can be studied, supported matrices, reagents, methodological details, statistical performance, interlaboratory validation and other details. Summary methods provide a general overview of reagents, instrumentation and commonly a short list of analytes. Regulatory bodies including the US Environmental Protection Agency (US-EPA), US Geological Survey (USGS), US Department of Agriculture (USDA) and others provide detailed analytical methods and collections of summary methods from the agrochemical industry, such as the US-EPA Environmental Chemistry Methods (https://www.epa.gov/pesticide-analytical-methods/environmental-chemistry-methods-ecm). Instrument vendors also provide access to many hundreds of application notes which can be considered as summary methods. We have developed a cheminformatically enabled database of methods whereby chemicals have been extracted from the methods, with the identifiers (names and/or chemical abstracts registry numbers) mapped to chemical structures. The resulting database of almost 3000 methods can be searched by chemical name, CASRN, structure and similarity of chemical structure. The resulting database has been integrated into a web-based application and includes integration to public domain mass spectral data and filtering of the methods based on analyte, chemical class, method source and other related metadata. This abstract does not necessarily represent the views or policies of the U.S. Environmental Protection Agency.
Presentation at: Digital design of molecules and formulations (https://www.soci.org/events/ai-and-digitalisation/digital-design-of-molecules-and-formulations). The Chemicals Dashboard is a free web-based application from the United States Environmental Protection Agency that provides various types of data for ~900,000 chemicals. These data include physicochemical properties, in vivo and in vitro toxicity data and information regarding the presence of chemicals in commercial products, including formulation data where available. The dashboard allows for sourcing of data associated with singleton chemicals or a batch mode for downloading data for thousands of chemicals at a time. This presentation will provide an overview of the Dashboard, the myriad of data sources underpinning the application and potential applications of the dashboard.
The US EPA’s National Center for Computational Toxicology (NCCT) has been both measuring and aggregating data to support our research efforts for over a decade. We have delivered these data via a number of publicly accessible websites, so-called dashboards, to provide transparent access to the outputs of the center. Since the inception of our research, software projects technologies have changed dramatically, as have the expectations regarding the methods by which to access data. Our informatics efforts provide access to millions of dollars of high-throughput screening data available in open, downloadable formats, via web services and through a rich web interface. Similarly, we provide access to experimental and predicted data associated with ~760,000 substances to serve the environmental chemistry community, and open source code for predictive models. This presentation will provide an overview of the efforts of NCCT to provide transparent access to our research and data via our publications (and accompanying supplementary data), via our Open Data policies, and through our databases, software tools and web services. This abstract does not necessarily represent the views or policies of the U.S. Environmental Protection Agency.
The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...Andrew McEachran
The U.S. Environmental Protection Agency (EPA) Computational Toxicology Program integrates advances in biology, chemistry, and computer science to help prioritize chemicals for further research based on potential human health risks. This involves computational and data-driven approaches that integrate chemistry, exposure and biological data. The National Center for Computational Toxicology (NCCT) has measured, assembled and delivered an enormous quantity and diversity of data for the environmental sciences, including high-throughput in vitro screening data, in vivo and functional use data, exposure models and chemical databases with associated properties. The CompTox Chemistry Dashboard is a web-based application providing access to data associated with ~760,000 chemical substances. New data are continuously added to the database on an ongoing basis, along with registration of new and emerging chemicals. This includes data extracted from the literature, identified by our analytical labs, and otherwise of interest to support specific research projects to the agency. By adding these data, with their associated chemical identifiers (names and CAS Registry Numbers), the dashboard uses linking approaches to allow for automated searching of PubMed, Google Scholar and an array of public databases. This presentation will provide an overview of the Dashboard, how it has developed into an integrated data hub for environmental data, and how it can be used for the analysis of emerging chemicals in terms of sourcing related chemicals of interest, and deriving read-across as well as QSAR predictions in real time. This abstract does not necessarily represent the views or policies of the U.S. Environmental Protection Agency.
Tens of thousands of chemicals are currently in commerce, and hundreds more are introduced every year. Because current chemical testing is resource intensive, only a small fraction of chemicals have been adequately evaluated for potential human health effects. New technologies and computational tools have shown promise for closing this knowledge gap. In the U.S. EPA’s ToxCast effort, the use of ~700 high-throughput in vitro assays has broadly characterized the biological activity and potential mechanisms of ~1,800 chemicals. Coupling the high-throughput in vitro assays with additional in vitro pharmacokinetic assays and in vitro-to-in vivo extrapolation modeling allows conversion of in vitro bioactive concentrations to estimates to an administered dose (mg/kg/day). High throughput exposure models are generating exposure estimates based on key aspects of chemical production, fate, transport, and personal use. The path for incorporating new approach methods and technologies for prioritization and assessment of chemical alternatives poses multiple scientific challenges. These challenges include sufficient coverage of toxicological mechanisms to meaningfully interpret negative test results, development of increasingly relevant test systems, computational modeling to integrate experimental data, characterizing uncertainty, and efficient validation of the test systems and computational models. The presentation will cover progress at the U.S. EPA in the development and application of these technologies and approaches in evaluating alternatives and systematically addressing each of these challenges. This abstract does not necessarily reflect U.S. EPA policy.
High resolution mass spectrometry (HRMS) and non-targeted analysis (NTA) are advancing the identification of emerging contaminants in environmental matrices, improving the means by which exposure analyses can be conducted. However, confidence in structure identification of unknowns in NTA presents challenges to analytical chemists. Structure identification requires integration of complementary data types such as reference databases (either commercial or open databases), fragmentation prediction tools, and retention time prediction models. One goal of our research is to optimize and implement structure identification functionality within the US EPA’s CompTox Chemicals Dashboard, an open chemistry resource and web application containing data for ~900,000 substances. Database searching using mass or formula-based inputs has been optimized for structure identification using “MS-Ready Structures”: de-salted, stripped of stereochemistry, and mixture separated to replicate the form of a chemical observed via HRMS. Functionality to conduct batch searching of molecular formulae and monoisotopic masses has also been implemented. While the increasing number of free online databases are of value to support chemical structure verification and elucidation there are known issues regarding data quality and careful data curation is a very necessary part of the development of these resources. This presentation will provide an overview of our latest enhancements to the dashboard to support mass spectrometry, incorporation of specific datasets (i.e. to support breath research and household dust analysis) and the value of metadata and predicted fragmentation spectral matching to support structure identification. This abstract does not necessarily represent the views or policies of the U.S. Environmental Protection Agency.
Researchers at EPA’s National Center for Computational Toxicology integrate advances in biology, chemistry, and computer science to examine the toxicity of chemicals and help prioritize chemicals for further research based on potential human health risks. The goal of this research program is to quickly evaluate thousands of chemicals, but at a much reduced cost and shorter time frame relative to traditional approaches. The data generated by the Center includes characterization of thousands of chemicals across hundreds of high-throughput screening assays, consumer use and production information, pharmacokinetic properties, literature data, physical-chemical properties as well as the predictive computational modeling of toxicity and exposure. We have developed a number of databases and applications to deliver the data to the public, academic community, industry stakeholders, and regulators. This presentation will provide an overview of our work to develop an architecture that integrates diverse large-scale data from the chemical and biological domains, our approaches to disseminate these data, and the delivery of models supporting predictive computational toxicology. In particular, this presentation will review our new CompTox Chemistry Dashboard and the developing architecture to support real-time property and toxicity endpoint prediction. This abstract does not reflect U.S. EPA policy.
High resolution mass spectrometry (HRMS) and non-targeted analysis (NTA) are of increasing interest in chemical forensics for the identification of emerging contaminants and chemical signatures of interest. At the US Environmental Protection Agency, our research using HRMS for non-targeted and suspect screening analyses utilizes databases and cheminformatics approaches that are applicable to chemical forensics. The CompTox Chemistry Dashboard is an open chemistry resource and web-based application containing data for ~760,000 substances. Basic functionality for searching through the data is provided through identifier searches, such as systematic name, trade names and CAS Registry Numbers. Advanced Search capabilities supporting mass spectrometry include mass and formula-based searches, combined substructure-mass searches and searching experimental mass spectral data against predicted fragmentation spectra. A specific type of data mapping in the underpinning database, using “MS-Ready” structures, has proven to be a valuable approach for structure identification that links structures that can be identified via HRMS with related substances in the form of salts, and other multi-component mixtures that are available in commerce. This presentation will provide an overview of the CompTox Chemistry Dashboard and demonstrate its utility for supporting structure identification and NTA in chemical forensics. This abstract does not necessarily represent the views or policies of the U.S. Environmental Protection Agency.
Similar to US-EPA Cheminformatics Support for Delivering Data Related to Chemicals of Emerging Concern (20)
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Ana Luísa Pinho
Functional Magnetic Resonance Imaging (fMRI) provides means to characterize brain activations in response to behavior. However, cognitive neuroscience has been limited to group-level effects referring to the performance of specific tasks. To obtain the functional profile of elementary cognitive mechanisms, the combination of brain responses to many tasks is required. Yet, to date, both structural atlases and parcellation-based activations do not fully account for cognitive function and still present several limitations. Further, they do not adapt overall to individual characteristics. In this talk, I will give an account of deep-behavioral phenotyping strategies, namely data-driven methods in large task-fMRI datasets, to optimize functional brain-data collection and improve inference of effects-of-interest related to mental processes. Key to this approach is the employment of fast multi-functional paradigms rich on features that can be well parametrized and, consequently, facilitate the creation of psycho-physiological constructs to be modelled with imaging data. Particular emphasis will be given to music stimuli when studying high-order cognitive mechanisms, due to their ecological nature and quality to enable complex behavior compounded by discrete entities. I will also discuss how deep-behavioral phenotyping and individualized models applied to neuroimaging data can better account for the subject-specific organization of domain-general cognitive systems in the human brain. Finally, the accumulation of functional brain signatures brings the possibility to clarify relationships among tasks and create a univocal link between brain systems and mental functions through: (1) the development of ontologies proposing an organization of cognitive processes; and (2) brain-network taxonomies describing functional specialization. To this end, tools to improve commensurability in cognitive science are necessary, such as public repositories, ontology-based platforms and automated meta-analysis tools. I will thus discuss some brain-atlasing resources currently under development, and their applicability in cognitive as well as clinical neuroscience.
What is greenhouse gasses and how many gasses are there to affect the Earth.moosaasad1975
What are greenhouse gasses how they affect the earth and its environment what is the future of the environment and earth how the weather and the climate effects.
The ability to recreate computational results with minimal effort and actionable metrics provides a solid foundation for scientific research and software development. When people can replicate an analysis at the touch of a button using open-source software, open data, and methods to assess and compare proposals, it significantly eases verification of results, engagement with a diverse range of contributors, and progress. However, we have yet to fully achieve this; there are still many sociotechnical frictions.
Inspired by David Donoho's vision, this talk aims to revisit the three crucial pillars of frictionless reproducibility (data sharing, code sharing, and competitive challenges) with the perspective of deep software variability.
Our observation is that multiple layers — hardware, operating systems, third-party libraries, software versions, input data, compile-time options, and parameters — are subject to variability that exacerbates frictions but is also essential for achieving robust, generalizable results and fostering innovation. I will first review the literature, providing evidence of how the complex variability interactions across these layers affect qualitative and quantitative software properties, thereby complicating the reproduction and replication of scientific studies in various fields.
I will then present some software engineering and AI techniques that can support the strategic exploration of variability spaces. These include the use of abstractions and models (e.g., feature models), sampling strategies (e.g., uniform, random), cost-effective measurements (e.g., incremental build of software configurations), and dimensionality reduction methods (e.g., transfer learning, feature selection, software debloating).
I will finally argue that deep variability is both the problem and solution of frictionless reproducibility, calling the software science community to develop new methods and tools to manage variability and foster reproducibility in software systems.
Exposé invité Journées Nationales du GDR GPL 2024
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...Wasswaderrick3
In this book, we use conservation of energy techniques on a fluid element to derive the Modified Bernoulli equation of flow with viscous or friction effects. We derive the general equation of flow/ velocity and then from this we derive the Pouiselle flow equation, the transition flow equation and the turbulent flow equation. In the situations where there are no viscous effects , the equation reduces to the Bernoulli equation. From experimental results, we are able to include other terms in the Bernoulli equation. We also look at cases where pressure gradients exist. We use the Modified Bernoulli equation to derive equations of flow rate for pipes of different cross sectional areas connected together. We also extend our techniques of energy conservation to a sphere falling in a viscous medium under the effect of gravity. We demonstrate Stokes equation of terminal velocity and turbulent flow equation. We look at a way of calculating the time taken for a body to fall in a viscous medium. We also look at the general equation of terminal velocity.
Richard's aventures in two entangled wonderlandsRichard Gill
Since the loophole-free Bell experiments of 2020 and the Nobel prizes in physics of 2022, critics of Bell's work have retreated to the fortress of super-determinism. Now, super-determinism is a derogatory word - it just means "determinism". Palmer, Hance and Hossenfelder argue that quantum mechanics and determinism are not incompatible, using a sophisticated mathematical construction based on a subtle thinning of allowed states and measurements in quantum mechanics, such that what is left appears to make Bell's argument fail, without altering the empirical predictions of quantum mechanics. I think however that it is a smoke screen, and the slogan "lost in math" comes to my mind. I will discuss some other recent disproofs of Bell's theorem using the language of causality based on causal graphs. Causal thinking is also central to law and justice. I will mention surprising connections to my work on serial killer nurse cases, in particular the Dutch case of Lucia de Berk and the current UK case of Lucy Letby.
Phenomics assisted breeding in crop improvementIshaGoswami9
As the population is increasing and will reach about 9 billion upto 2050. Also due to climate change, it is difficult to meet the food requirement of such a large population. Facing the challenges presented by resource shortages, climate
change, and increasing global population, crop yield and quality need to be improved in a sustainable way over the coming decades. Genetic improvement by breeding is the best way to increase crop productivity. With the rapid progression of functional
genomics, an increasing number of crop genomes have been sequenced and dozens of genes influencing key agronomic traits have been identified. However, current genome sequence information has not been adequately exploited for understanding
the complex characteristics of multiple gene, owing to a lack of crop phenotypic data. Efficient, automatic, and accurate technologies and platforms that can capture phenotypic data that can
be linked to genomics information for crop improvement at all growth stages have become as important as genotyping. Thus,
high-throughput phenotyping has become the major bottleneck restricting crop breeding. Plant phenomics has been defined as the high-throughput, accurate acquisition and analysis of multi-dimensional phenotypes
during crop growing stages at the organism level, including the cell, tissue, organ, individual plant, plot, and field levels. With the rapid development of novel sensors, imaging technology,
and analysis methods, numerous infrastructure platforms have been developed for phenotyping.
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Sérgio Sacani
Since volcanic activity was first discovered on Io from Voyager images in 1979, changes
on Io’s surface have been monitored from both spacecraft and ground-based telescopes.
Here, we present the highest spatial resolution images of Io ever obtained from a groundbased telescope. These images, acquired by the SHARK-VIS instrument on the Large
Binocular Telescope, show evidence of a major resurfacing event on Io’s trailing hemisphere. When compared to the most recent spacecraft images, the SHARK-VIS images
show that a plume deposit from a powerful eruption at Pillan Patera has covered part
of the long-lived Pele plume deposit. Although this type of resurfacing event may be common on Io, few have been detected due to the rarity of spacecraft visits and the previously low spatial resolution available from Earth-based telescopes. The SHARK-VIS instrument ushers in a new era of high resolution imaging of Io’s surface using adaptive
optics at visible wavelengths.
ISI 2024: Application Form (Extended), Exam Date (Out), EligibilitySciAstra
The Indian Statistical Institute (ISI) has extended its application deadline for 2024 admissions to April 2. Known for its excellence in statistics and related fields, ISI offers a range of programs from Bachelor's to Junior Research Fellowships. The admission test is scheduled for May 12, 2024. Eligibility varies by program, generally requiring a background in Mathematics and English for undergraduate courses and specific degrees for postgraduate and research positions. Application fees are ₹1500 for male general category applicants and ₹1000 for females. Applications are open to Indian and OCI candidates.
Nutraceutical market, scope and growth: Herbal drug technologyLokesh Patil
As consumer awareness of health and wellness rises, the nutraceutical market—which includes goods like functional meals, drinks, and dietary supplements that provide health advantages beyond basic nutrition—is growing significantly. As healthcare expenses rise, the population ages, and people want natural and preventative health solutions more and more, this industry is increasing quickly. Further driving market expansion are product formulation innovations and the use of cutting-edge technology for customized nutrition. With its worldwide reach, the nutraceutical industry is expected to keep growing and provide significant chances for research and investment in a number of categories, including vitamins, minerals, probiotics, and herbal supplements.
US-EPA Cheminformatics Support for Delivering Data Related to Chemicals of Emerging Concern
1. US-EPA Cheminformatics Support for
Delivering Data Related to Chemicals
of Emerging Concern
Antony Williams
Center for Computational Toxicology and Exposure, US-EPA, RTP, NC
The views expressed in this presentation are those of the author
and do not necessarily reflect the views or policies of the U.S. EPA
2. The role of cheminformatics at EPA
• I am from the EPA Center for Computational Toxicology and Exposure
• We develop lots of prediction models and web-based applications
• Today’s presentation: how do our efforts support data dissemination
regarding chemicals of emerging concern and MS-NTA
2
2
Chemical Monitoring Needs
Exposure
Assessment
Dose-
Response
Assessment
Risk
Characterization
Hazard
Identification
3. Free-Access Cheminformatics Tools
• The Center for Computational Toxicology and Exposure many tools
• CompTox Chemicals Dashboard
• Proof-of-Concept cheminformatics modules
• Chemicals Hazard Profiling
• Chemical Transformations database (ChET)
• Analytical Methods and Open Spectra database (AMOS)
• All chemicals are stored/curated in DSSTox
3
5. Accessing DSSTox chemistry:
CompTox Chemicals Dashboard
•A publicly accessible website delivering:
• 1.2M chemicals with related property data
• Related substances: transformation products, mono/polymer
• Experimental/predicted physicochemical property data
• Experimental Human and Ecological hazard data
• Integration to “biological assay data”
• Information regarding chemicals in consumer products
• Links to other agency websites and public data resources
• “Batch searching” for tens to thousands of chemicals
5
9. Experimental Data
9
• Experimental data harvested
from public domain databases
and journal articles
• Data link back to provenance
• Data are used to build QSAR
models for real time predictions
• Data are available for download
and reuse
10. What is PFOS Called?
Synonyms, CASRNs and more
10
12. Relationships in the data
12
• Structure mappings -
between parent and
salts, multicomponent
chemicals, isotopomers
• Related substances –
monomer to polymer,
parent to transformation
products
13. Batch Searching is a big enabler
https://pubs.acs.org/doi/10.1021/acs.jcim.0c01273
13
14. Batch Searching
• Singleton searches are useful but people work with groups of chemicals
• Typical questions
• Find me all data based on the input of 1000 CASRNs, or 1000 names
• What are the physicochemical properties for a set of identifiers?
• What is the list of chemicals for the formula CxHyOz?
• What is the list of chemicals for a mass +/- error?
• Can I get chemical lists in Excel files? In SDF files?
• Can I include properties in the download file?
14
17. Batch Search
• All data can be
downloaded into Excel
files, CSV files or SDF
files and reused
• All data are Open
18. Chemical Lists
https://comptox.epa.gov/dashboard/chemical_lists
• Chemical lists are focused on regulations, research efforts and categories
• 425 lists and growing
• TSCA Inventory
• Clean Water Act Hazardous Substances
• Consumer Products database
• Chemicals of Emerging Concern
• PFAS lists
• Extractables and Leachables
• Lists are versioned and updated and new lists added regularly
18
25. Applications at the EPA
•We have ongoing efforts applying NTA to multiple
challenges including
• PFAS identification
• Pesticides in various matrices
• CECs in water
• Biosolids
•Examples include…
25
27. 27
Many chemicals observed in
consumer product extracts
More observed chemicals not
known to be in consumer
products
Why might the ‘other’
chemicals be in the products?
Many observed chemicals
known to be in consumer
products
Example 1: Consumer Product Analysis
29. 29
Significant differences between
chemicals in recycled vs. virgin products
for certain product & use categories
Most differences observed in paper
products and construction materials
Some uses (e.g., fragrances) highly
represented across all product/use
categories
Example 2: Recycled Product Analysis
31. Lots of “proof-of-concept” tools in development
• PoCs are research software builds to prove approaches before moving
into production software environments
• Assemble data, develop data model(s), test user interface approaches,
work with test user base to garner feedback
• Since PoCs are internal access data refreshes and application updates
can be more
31
34. AMOS: Analytical Methods and Spectra Database
• Three types of data in the database:
• Methods (regulatory, lab manuals and SOPs, publications, tech notes)
• Spectra (from public domain and our own laboratories)
• Fact Sheets (harvested from SWGDRUG and other sites)
• Currently contains >210,000 spectra, >700,000 external links, 4000
“Fact Sheets” and ~4000 methods
• ALL data are growing in number weekly at present
34
42. Our Data via services
https://api-ccte.epa.gov/docs/
42
43. Conclusions
• Our data resources underpin our research efforts – data quality is key
• Our web-based applications deliver our data to the community
• Our support for identifying chemicals of emerging concern is multi-fold
• Curated chemistry data streams
• Non-targeted analysis tool development and cheminformatics support
• NTA WebApp in development uses all data streams to support analysis
43
44. Acknowledgements and Contact Information
• The work presented here represents an enormous team of contributors
• Chemical curators
• Software developers and contractors
• Postdocs, SMEs and PIs
• Contact info: williams.antony@epa.gov
• Slides will be available at: https://www.slideshare.net/AntonyWilliams/
44