SlideShare a Scribd company logo
1 of 51
In Silico Approaches for Predicting Hazards
from Chemical Structure and Existing Data
Lyle D. Burgoon, Ph.D.
Leader, Bioinformatics and Computational Toxicology
US Army Engineer Research and Development Center
Opinions expressed are those of the author and do not necessarily reflect
US Army policy.
DECIDE FASTER!
DECIDE FASTER!
WITH SAME OR FEWER RESOURCES
DECIDE FASTER!
WITH SAME OR FEWER RESOURCES
WITH LESS DATA
Why????!!!!
• About 80,000 data poor chemicals in the environment
• Threatened and Endangered Species
• Ethics of testing
• Permits, practicality
• Human health
• Ethics of testing
• Species extrapolation issues
• Ecological species and population impacts
• Species extrapolation issues
• Ethics of testing
• Cost
• Which species get tested
There has got to be a better way…
What Should I Choose?
What Should I Choose?
Match your time constraints with what information you have
What Should I Choose?
Match your time constraints with what information you have
Emergency Response?
Military Intelligence?
Site cleanup?
Prioritization?
docking.
Capturing:
- Affinity
- Model protein crystal
- Any modifications to the
crystal
- Chemical structure
- Version of DAMSL model
DAMSL:
Digital Automated Molecular Screening Library
Capturing:
- Affinity
- Model protein crystal
- Any modifications to the
crystal
- Chemical structure
- Version of DAMSL model
DAMSL:
Digital Automated Molecular Screening Library
Downside: Accuracy tends to not be as great a structure-based model
qsar.
qsar.
in a nutshell.
qsar.
in a nutshell (spoiler alert: there’s a little math).
f(x) = hazard (yes/no)
f(x) = LD50
some mathematical function applied on x
some mathematical function applied on x
f( ) = hazard
f( ) = LD50
qsar.
deep learning.
deep learning.
its not just for cat
pictures any
more
Briefly, what is deep learning?
• Artificial intelligence approach
• Misconceptions
• Always requires a lot of data
• Not necessarily – relative to a lot of things, and what you’re trying to do
• Always overfits when you don’t give it a lot of data
• Not necessarily – depends on a lot of things; simpler methods can also overfit
• The architecture of your neural networks are important
• What is true…
• There’s a lot of art to designing the optimal network
• Like any technique or approach, it’s best to get training before you
jump in
• Lots of free training on the web, lots of tutorials
1 or more
hidden layers
of neurons
Probability of Hazard
1 or more
hidden layers
of neurons
Probability of Hazard
If you want to start learning deep
learning…
• Kaggle is a great place to learn – several tutorials
• Lots of blogs with tutorials
• Online and traditional courses are popping up a lot
Deep Learning Approach to
Predict PPAR-gamma Ligands
• Ground Truth Dataset: 796 chemicals
• Ligands: 33 chemicals
• Not Ligands: 763
• This is pretty typical – very few chemicals will be ligands
• Accuracy (10-fold cross-validation): 94.5%
assay data integration.
assay data integration.
bayesian network approach.
assay data integration.
deep learning approach.
APECS
Autoencoder Predicting Estrogenic Chemical
Substances
Capture:
• APECS version
• Estrogenicity
prediction
• Chemical information
• ToxCast data version
and assays used for
training APECS
• Sensitivity and
Specificity data
Burgoon, L.D. Computational Toxicology 2: 45-49. https://doi.org/10.1016/j.comtox.2017.03.002
APECS
Autoencoder Predicting Estrogenic Chemical
Substances
In Vivo Model
Sensitivity: 97%
Specificity: 80%
Accuracy: 91%
In Vitro Model
Sensitivity: 100%
Specificity: 75%
Accuracy: 93%
Burgoon, L.D. Computational Toxicology 2: 45-49. https://doi.org/10.1016/j.comtox.2017.03.002
adverse outcome pathway bayesian networks.
predict probability of adverse outcomes.
Example Workflow (steroidogenesis)
AOPXplorer
Visualize results using AOPXplorer – a
Cytoscape App that facilitates AOP-based
data visualization
https://github.com/DataSciBurgoon/bisct/releases/tag/1.1.2
Screenshot of BISCT following analysis of the ToxCast H295R prochloraz screening dataset (Karmaus, et al. (2016) ToxSci
150(2): 323-332).
Steatosis
We fed this data into our AOPBN
Angrish, M.M., et al (2017). Mechanistic Toxicity Tests Based on an Adverse Outcome Pathway Network for Hepatic
Steatosis. Toxicol. Sci. 159, 159–169.
We got these results
Why I like AOPBNs
• Causal networks
• Use maths to identify the Minimally Sufficient Set of
Key Events (MinSSKEs)
• Minimal set of key events sufficient to infer an adverse
outcome
• Devise scenarios to measure the value of information
associated with each key event and sets of key events
• Devise test batteries that maximize value of information while
minimizing resource costs
Value of key event analysis
practical advice.
What Should I Choose?
Match your time constraints with what information you have
Emergency Response?
Military Intelligence?
Site cleanup?
Prioritization?
Tools
• I’m developing freely available, open source,
‘government off the shelf’ software for everything
presented here
• If you are interested in learning how to do this stuff
on your own, chat me up
Acknowledgements
• Shannon Bell (ILS)
• Ed Perkins (Army ERDC)
• Natalia Vinas (Army ERDC)
• Agnes Karmaus (ILS)
• Michelle Angrish (EPA)
• Ingrid Druwe (formerly ORISE, currently EPA)
• Erin Yost (formerly ORISE, currently EPA)
• Kyle Painter (formerly ORISE)
• Supported by the US Army Environmental Quality and
Installations Program
Contact me for more!
• Email: lyle.d.burgoon@usace.army.mil
• Twitter: @DataSciBurgoon
• ORCID: https://orcid.org/0000-0003-4977-5352

More Related Content

What's hot

Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013
Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013
Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013Functional Genomics Data Society
 
Ngs workshop passarelli-mapping-1
Ngs workshop passarelli-mapping-1Ngs workshop passarelli-mapping-1
Ngs workshop passarelli-mapping-1Shaojun Xie
 
Automating the process of continuously prioritising data, updating and deploy...
Automating the process of continuously prioritising data, updating and deploy...Automating the process of continuously prioritising data, updating and deploy...
Automating the process of continuously prioritising data, updating and deploy...Ola Spjuth
 
Mining 'Bigger' Datasets to Create, Validate and Share Machine Learning Models
Mining 'Bigger' Datasets to Create, Validate and Share Machine Learning ModelsMining 'Bigger' Datasets to Create, Validate and Share Machine Learning Models
Mining 'Bigger' Datasets to Create, Validate and Share Machine Learning ModelsSean Ekins
 
Practical Guide to the $1000 Genome (2014)
Practical Guide to the $1000 Genome (2014)Practical Guide to the $1000 Genome (2014)
Practical Guide to the $1000 Genome (2014)AllSeq
 
Why are we still doing industrial age drug
Why are we still doing industrial age drugWhy are we still doing industrial age drug
Why are we still doing industrial age drugSean Ekins
 
Using Bioinformatics Data to inform Therapeutics discovery and development
Using Bioinformatics Data to inform Therapeutics discovery and developmentUsing Bioinformatics Data to inform Therapeutics discovery and development
Using Bioinformatics Data to inform Therapeutics discovery and developmentEleanor Howe
 
Gene Ontology Enrichment Network Analysis -Tutorial
Gene Ontology Enrichment Network Analysis -TutorialGene Ontology Enrichment Network Analysis -Tutorial
Gene Ontology Enrichment Network Analysis -TutorialDmitry Grapov
 
Giab jan2016 analysis team breakout summary
Giab jan2016 analysis team breakout summaryGiab jan2016 analysis team breakout summary
Giab jan2016 analysis team breakout summaryGenomeInABottle
 
Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...
Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...
Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...Databricks
 
Reproducibility in cheminformatics and computational chemistry research: cert...
Reproducibility in cheminformatics and computational chemistry research: cert...Reproducibility in cheminformatics and computational chemistry research: cert...
Reproducibility in cheminformatics and computational chemistry research: cert...Greg Landrum
 
Understanding Biological Function in Times of High Throughput and Low Output
Understanding Biological Function in Times of High Throughput and Low OutputUnderstanding Biological Function in Times of High Throughput and Low Output
Understanding Biological Function in Times of High Throughput and Low OutputIddo
 
Considerations and challenges in building an end to-end microbiome workflow
Considerations and challenges in building an end to-end microbiome workflowConsiderations and challenges in building an end to-end microbiome workflow
Considerations and challenges in building an end to-end microbiome workflowEagle Genomics
 
Data Mining and Big Data Analytics in Pharma
Data Mining and Big Data Analytics in Pharma Data Mining and Big Data Analytics in Pharma
Data Mining and Big Data Analytics in Pharma Ankur Khanna
 
BIOLINK 2008: Linking database submissions to primary citations with PubMe...
BIOLINK 2008:    Linking database submissions to primary citations with PubMe...BIOLINK 2008:    Linking database submissions to primary citations with PubMe...
BIOLINK 2008: Linking database submissions to primary citations with PubMe...Heather Piwowar
 
Ilya Kupershmidt speaks at the Molecular Medicine Tri-Conference
Ilya Kupershmidt speaks at the Molecular Medicine Tri-ConferenceIlya Kupershmidt speaks at the Molecular Medicine Tri-Conference
Ilya Kupershmidt speaks at the Molecular Medicine Tri-ConferenceNextBio
 

What's hot (20)

Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013
Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013
Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013
 
Ngs workshop passarelli-mapping-1
Ngs workshop passarelli-mapping-1Ngs workshop passarelli-mapping-1
Ngs workshop passarelli-mapping-1
 
Automating the process of continuously prioritising data, updating and deploy...
Automating the process of continuously prioritising data, updating and deploy...Automating the process of continuously prioritising data, updating and deploy...
Automating the process of continuously prioritising data, updating and deploy...
 
Mining 'Bigger' Datasets to Create, Validate and Share Machine Learning Models
Mining 'Bigger' Datasets to Create, Validate and Share Machine Learning ModelsMining 'Bigger' Datasets to Create, Validate and Share Machine Learning Models
Mining 'Bigger' Datasets to Create, Validate and Share Machine Learning Models
 
Practical Guide to the $1000 Genome (2014)
Practical Guide to the $1000 Genome (2014)Practical Guide to the $1000 Genome (2014)
Practical Guide to the $1000 Genome (2014)
 
Why are we still doing industrial age drug
Why are we still doing industrial age drugWhy are we still doing industrial age drug
Why are we still doing industrial age drug
 
Using Bioinformatics Data to inform Therapeutics discovery and development
Using Bioinformatics Data to inform Therapeutics discovery and developmentUsing Bioinformatics Data to inform Therapeutics discovery and development
Using Bioinformatics Data to inform Therapeutics discovery and development
 
Gene Ontology Enrichment Network Analysis -Tutorial
Gene Ontology Enrichment Network Analysis -TutorialGene Ontology Enrichment Network Analysis -Tutorial
Gene Ontology Enrichment Network Analysis -Tutorial
 
Michael Reich, GenomeSpace Workshop, fged_seattle_2013
Michael Reich, GenomeSpace Workshop, fged_seattle_2013Michael Reich, GenomeSpace Workshop, fged_seattle_2013
Michael Reich, GenomeSpace Workshop, fged_seattle_2013
 
Giab jan2016 analysis team breakout summary
Giab jan2016 analysis team breakout summaryGiab jan2016 analysis team breakout summary
Giab jan2016 analysis team breakout summary
 
Jan2016 bina giab
Jan2016 bina giabJan2016 bina giab
Jan2016 bina giab
 
Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...
Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...
Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...
 
Reproducibility in cheminformatics and computational chemistry research: cert...
Reproducibility in cheminformatics and computational chemistry research: cert...Reproducibility in cheminformatics and computational chemistry research: cert...
Reproducibility in cheminformatics and computational chemistry research: cert...
 
Understanding Biological Function in Times of High Throughput and Low Output
Understanding Biological Function in Times of High Throughput and Low OutputUnderstanding Biological Function in Times of High Throughput and Low Output
Understanding Biological Function in Times of High Throughput and Low Output
 
Phylogenetics: Making publication-quality tree figures
Phylogenetics: Making publication-quality tree figuresPhylogenetics: Making publication-quality tree figures
Phylogenetics: Making publication-quality tree figures
 
Considerations and challenges in building an end to-end microbiome workflow
Considerations and challenges in building an end to-end microbiome workflowConsiderations and challenges in building an end to-end microbiome workflow
Considerations and challenges in building an end to-end microbiome workflow
 
Data Mining and Big Data Analytics in Pharma
Data Mining and Big Data Analytics in Pharma Data Mining and Big Data Analytics in Pharma
Data Mining and Big Data Analytics in Pharma
 
BIOLINK 2008: Linking database submissions to primary citations with PubMe...
BIOLINK 2008:    Linking database submissions to primary citations with PubMe...BIOLINK 2008:    Linking database submissions to primary citations with PubMe...
BIOLINK 2008: Linking database submissions to primary citations with PubMe...
 
Activities at the Royal Society of Chemistry to gather, extract and analyze b...
Activities at the Royal Society of Chemistry to gather, extract and analyze b...Activities at the Royal Society of Chemistry to gather, extract and analyze b...
Activities at the Royal Society of Chemistry to gather, extract and analyze b...
 
Ilya Kupershmidt speaks at the Molecular Medicine Tri-Conference
Ilya Kupershmidt speaks at the Molecular Medicine Tri-ConferenceIlya Kupershmidt speaks at the Molecular Medicine Tri-Conference
Ilya Kupershmidt speaks at the Molecular Medicine Tri-Conference
 

Similar to In Silico Approaches for Predicting Hazards from Chemical Structure and Existing Data

BioAssay Express: Creating and exploiting assay metadata
BioAssay Express: Creating and exploiting assay metadataBioAssay Express: Creating and exploiting assay metadata
BioAssay Express: Creating and exploiting assay metadataPhilip Cheung
 
ECCB 2014: Extracting patterns of database and software usage from the bioinf...
ECCB 2014: Extracting patterns of database and software usage from the bioinf...ECCB 2014: Extracting patterns of database and software usage from the bioinf...
ECCB 2014: Extracting patterns of database and software usage from the bioinf...geraintduck
 
So you want to do a: RNAseq experiment, Differential Gene Expression Analysis
So you want to do a: RNAseq experiment, Differential Gene Expression AnalysisSo you want to do a: RNAseq experiment, Differential Gene Expression Analysis
So you want to do a: RNAseq experiment, Differential Gene Expression AnalysisUniversity of California, Davis
 
Building bioinformatics resources for the global community
Building bioinformatics resources for the global communityBuilding bioinformatics resources for the global community
Building bioinformatics resources for the global communityExternalEvents
 
Jillian ms defense-4-14-14-ja-novid3
Jillian ms defense-4-14-14-ja-novid3Jillian ms defense-4-14-14-ja-novid3
Jillian ms defense-4-14-14-ja-novid3Jillian Aurisano
 
Amia tb-review-08
Amia tb-review-08Amia tb-review-08
Amia tb-review-08Russ Altman
 
Jillian ms defense-4-14-14-ja-novid2
Jillian ms defense-4-14-14-ja-novid2Jillian ms defense-4-14-14-ja-novid2
Jillian ms defense-4-14-14-ja-novid2Jillian Aurisano
 
Jillian ms defense-4-14-14-ja-novid2
Jillian ms defense-4-14-14-ja-novid2Jillian ms defense-4-14-14-ja-novid2
Jillian ms defense-4-14-14-ja-novid2Jillian Aurisano
 
Too good to be true? How validate your data
Too good to be true? How validate your dataToo good to be true? How validate your data
Too good to be true? How validate your dataAlex Henderson
 
sience 2.0 : an illustration of good research practices in a real study
sience 2.0 : an illustration of good research practices in a real studysience 2.0 : an illustration of good research practices in a real study
sience 2.0 : an illustration of good research practices in a real studywolf vanpaemel
 
Mining Big datasets to create and validate machine learning models
Mining Big datasets to create and validate machine learning modelsMining Big datasets to create and validate machine learning models
Mining Big datasets to create and validate machine learning modelsSean Ekins
 
Life sciences big data use cases
Life sciences big data use casesLife sciences big data use cases
Life sciences big data use casesGuy Coates
 

Similar to In Silico Approaches for Predicting Hazards from Chemical Structure and Existing Data (20)

BioAssay Express: Creating and exploiting assay metadata
BioAssay Express: Creating and exploiting assay metadataBioAssay Express: Creating and exploiting assay metadata
BioAssay Express: Creating and exploiting assay metadata
 
Accessing Environmental Chemistry Data via Data Dashboards
Accessing Environmental Chemistry Data via Data Dashboards Accessing Environmental Chemistry Data via Data Dashboards
Accessing Environmental Chemistry Data via Data Dashboards
 
ChemSpider as an integration hub for interlinked chemistry data
ChemSpider as an integration hub for interlinked chemistry dataChemSpider as an integration hub for interlinked chemistry data
ChemSpider as an integration hub for interlinked chemistry data
 
ECCB 2014: Extracting patterns of database and software usage from the bioinf...
ECCB 2014: Extracting patterns of database and software usage from the bioinf...ECCB 2014: Extracting patterns of database and software usage from the bioinf...
ECCB 2014: Extracting patterns of database and software usage from the bioinf...
 
So you want to do a: RNAseq experiment, Differential Gene Expression Analysis
So you want to do a: RNAseq experiment, Differential Gene Expression AnalysisSo you want to do a: RNAseq experiment, Differential Gene Expression Analysis
So you want to do a: RNAseq experiment, Differential Gene Expression Analysis
 
Incorporating new technologies and High Throughput Screening in the design an...
Incorporating new technologies and High Throughput Screening in the design an...Incorporating new technologies and High Throughput Screening in the design an...
Incorporating new technologies and High Throughput Screening in the design an...
 
Building bioinformatics resources for the global community
Building bioinformatics resources for the global communityBuilding bioinformatics resources for the global community
Building bioinformatics resources for the global community
 
Development of a Tool for Systematic Integration of Traditional and New Appro...
Development of a Tool for Systematic Integration of Traditional and New Appro...Development of a Tool for Systematic Integration of Traditional and New Appro...
Development of a Tool for Systematic Integration of Traditional and New Appro...
 
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
 
Jillian ms defense-4-14-14-ja-novid3
Jillian ms defense-4-14-14-ja-novid3Jillian ms defense-4-14-14-ja-novid3
Jillian ms defense-4-14-14-ja-novid3
 
Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...
Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...
Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...
 
Amia tb-review-08
Amia tb-review-08Amia tb-review-08
Amia tb-review-08
 
New Approach Methods - What is That?
New Approach Methods - What is That?New Approach Methods - What is That?
New Approach Methods - What is That?
 
Jillian ms defense-4-14-14-ja-novid2
Jillian ms defense-4-14-14-ja-novid2Jillian ms defense-4-14-14-ja-novid2
Jillian ms defense-4-14-14-ja-novid2
 
Jillian ms defense-4-14-14-ja-novid2
Jillian ms defense-4-14-14-ja-novid2Jillian ms defense-4-14-14-ja-novid2
Jillian ms defense-4-14-14-ja-novid2
 
Too good to be true? How validate your data
Too good to be true? How validate your dataToo good to be true? How validate your data
Too good to be true? How validate your data
 
sience 2.0 : an illustration of good research practices in a real study
sience 2.0 : an illustration of good research practices in a real studysience 2.0 : an illustration of good research practices in a real study
sience 2.0 : an illustration of good research practices in a real study
 
Delivering The Benefits of Chemical-Biological Integration in Computational T...
Delivering The Benefits of Chemical-Biological Integration in Computational T...Delivering The Benefits of Chemical-Biological Integration in Computational T...
Delivering The Benefits of Chemical-Biological Integration in Computational T...
 
Mining Big datasets to create and validate machine learning models
Mining Big datasets to create and validate machine learning modelsMining Big datasets to create and validate machine learning models
Mining Big datasets to create and validate machine learning models
 
Life sciences big data use cases
Life sciences big data use casesLife sciences big data use cases
Life sciences big data use cases
 

More from U.S. Army Engineer Research and Development Center

More from U.S. Army Engineer Research and Development Center (6)

What's the Harm in Toxicological Fallacies and Misrepresentations
What's the Harm in Toxicological Fallacies and MisrepresentationsWhat's the Harm in Toxicological Fallacies and Misrepresentations
What's the Harm in Toxicological Fallacies and Misrepresentations
 
Use of Artificial Intelligence for Literature Screening
Use of Artificial Intelligence for Literature ScreeningUse of Artificial Intelligence for Literature Screening
Use of Artificial Intelligence for Literature Screening
 
Perspective on the US Army's Uses for Predictive Models of Acute Oral Toxicity
Perspective on the US Army's Uses for Predictive Models of Acute Oral ToxicityPerspective on the US Army's Uses for Predictive Models of Acute Oral Toxicity
Perspective on the US Army's Uses for Predictive Models of Acute Oral Toxicity
 
Safe Environmental Exposure Levels for RDX and TNT Using Toxicogenomic Data
Safe Environmental Exposure Levels for RDX and TNT Using Toxicogenomic DataSafe Environmental Exposure Levels for RDX and TNT Using Toxicogenomic Data
Safe Environmental Exposure Levels for RDX and TNT Using Toxicogenomic Data
 
Integrating Toxicological Data with BISCT and GRAVEE for Better Chemical Asse...
Integrating Toxicological Data with BISCT and GRAVEE for Better Chemical Asse...Integrating Toxicological Data with BISCT and GRAVEE for Better Chemical Asse...
Integrating Toxicological Data with BISCT and GRAVEE for Better Chemical Asse...
 
Using Machine Intelligence to Perform Predictive Toxicology
Using Machine Intelligence to Perform Predictive ToxicologyUsing Machine Intelligence to Perform Predictive Toxicology
Using Machine Intelligence to Perform Predictive Toxicology
 

Recently uploaded

Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |aasikanpl
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxyaramohamed343013
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxSwapnil Therkar
 
TOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physicsTOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physicsssuserddc89b
 
Recombinant DNA technology( Transgenic plant and animal)
Recombinant DNA technology( Transgenic plant and animal)Recombinant DNA technology( Transgenic plant and animal)
Recombinant DNA technology( Transgenic plant and animal)DHURKADEVIBASKAR
 
Recombination DNA Technology (Microinjection)
Recombination DNA Technology (Microinjection)Recombination DNA Technology (Microinjection)
Recombination DNA Technology (Microinjection)Jshifa
 
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tanta
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tantaDashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tanta
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tantaPraksha3
 
Genomic DNA And Complementary DNA Libraries construction.
Genomic DNA And Complementary DNA Libraries construction.Genomic DNA And Complementary DNA Libraries construction.
Genomic DNA And Complementary DNA Libraries construction.k64182334
 
Work, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE PhysicsWork, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE Physicsvishikhakeshava1
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
Neurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trNeurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trssuser06f238
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxkessiyaTpeter
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real timeSatoshi NAKAHIRA
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 

Recently uploaded (20)

Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docx
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptx
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
 
TOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physicsTOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physics
 
Recombinant DNA technology( Transgenic plant and animal)
Recombinant DNA technology( Transgenic plant and animal)Recombinant DNA technology( Transgenic plant and animal)
Recombinant DNA technology( Transgenic plant and animal)
 
Recombination DNA Technology (Microinjection)
Recombination DNA Technology (Microinjection)Recombination DNA Technology (Microinjection)
Recombination DNA Technology (Microinjection)
 
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tanta
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tantaDashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tanta
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tanta
 
Genomic DNA And Complementary DNA Libraries construction.
Genomic DNA And Complementary DNA Libraries construction.Genomic DNA And Complementary DNA Libraries construction.
Genomic DNA And Complementary DNA Libraries construction.
 
Work, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE PhysicsWork, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE Physics
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Neurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trNeurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 tr
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C P
 
Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real time
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 

In Silico Approaches for Predicting Hazards from Chemical Structure and Existing Data

  • 1. In Silico Approaches for Predicting Hazards from Chemical Structure and Existing Data Lyle D. Burgoon, Ph.D. Leader, Bioinformatics and Computational Toxicology US Army Engineer Research and Development Center Opinions expressed are those of the author and do not necessarily reflect US Army policy.
  • 3. DECIDE FASTER! WITH SAME OR FEWER RESOURCES
  • 4. DECIDE FASTER! WITH SAME OR FEWER RESOURCES WITH LESS DATA
  • 5. Why????!!!! • About 80,000 data poor chemicals in the environment • Threatened and Endangered Species • Ethics of testing • Permits, practicality • Human health • Ethics of testing • Species extrapolation issues • Ecological species and population impacts • Species extrapolation issues • Ethics of testing • Cost • Which species get tested
  • 6. There has got to be a better way…
  • 7.
  • 8. What Should I Choose?
  • 9. What Should I Choose? Match your time constraints with what information you have
  • 10. What Should I Choose? Match your time constraints with what information you have Emergency Response? Military Intelligence? Site cleanup? Prioritization?
  • 12.
  • 13. Capturing: - Affinity - Model protein crystal - Any modifications to the crystal - Chemical structure - Version of DAMSL model DAMSL: Digital Automated Molecular Screening Library
  • 14. Capturing: - Affinity - Model protein crystal - Any modifications to the crystal - Chemical structure - Version of DAMSL model DAMSL: Digital Automated Molecular Screening Library Downside: Accuracy tends to not be as great a structure-based model
  • 15. qsar.
  • 17. qsar. in a nutshell (spoiler alert: there’s a little math).
  • 18. f(x) = hazard (yes/no) f(x) = LD50 some mathematical function applied on x some mathematical function applied on x
  • 19.
  • 20. f( ) = hazard
  • 21. f( ) = LD50
  • 22.
  • 24. deep learning. its not just for cat pictures any more
  • 25. Briefly, what is deep learning? • Artificial intelligence approach • Misconceptions • Always requires a lot of data • Not necessarily – relative to a lot of things, and what you’re trying to do • Always overfits when you don’t give it a lot of data • Not necessarily – depends on a lot of things; simpler methods can also overfit • The architecture of your neural networks are important • What is true… • There’s a lot of art to designing the optimal network • Like any technique or approach, it’s best to get training before you jump in • Lots of free training on the web, lots of tutorials
  • 26. 1 or more hidden layers of neurons Probability of Hazard
  • 27. 1 or more hidden layers of neurons Probability of Hazard
  • 28. If you want to start learning deep learning… • Kaggle is a great place to learn – several tutorials • Lots of blogs with tutorials • Online and traditional courses are popping up a lot
  • 29. Deep Learning Approach to Predict PPAR-gamma Ligands • Ground Truth Dataset: 796 chemicals • Ligands: 33 chemicals • Not Ligands: 763 • This is pretty typical – very few chemicals will be ligands • Accuracy (10-fold cross-validation): 94.5%
  • 31. assay data integration. bayesian network approach.
  • 32.
  • 33.
  • 34. assay data integration. deep learning approach.
  • 35. APECS Autoencoder Predicting Estrogenic Chemical Substances Capture: • APECS version • Estrogenicity prediction • Chemical information • ToxCast data version and assays used for training APECS • Sensitivity and Specificity data Burgoon, L.D. Computational Toxicology 2: 45-49. https://doi.org/10.1016/j.comtox.2017.03.002
  • 36. APECS Autoencoder Predicting Estrogenic Chemical Substances In Vivo Model Sensitivity: 97% Specificity: 80% Accuracy: 91% In Vitro Model Sensitivity: 100% Specificity: 75% Accuracy: 93% Burgoon, L.D. Computational Toxicology 2: 45-49. https://doi.org/10.1016/j.comtox.2017.03.002
  • 37. adverse outcome pathway bayesian networks. predict probability of adverse outcomes.
  • 38. Example Workflow (steroidogenesis) AOPXplorer Visualize results using AOPXplorer – a Cytoscape App that facilitates AOP-based data visualization
  • 40. Screenshot of BISCT following analysis of the ToxCast H295R prochloraz screening dataset (Karmaus, et al. (2016) ToxSci 150(2): 323-332).
  • 41.
  • 43. We fed this data into our AOPBN Angrish, M.M., et al (2017). Mechanistic Toxicity Tests Based on an Adverse Outcome Pathway Network for Hepatic Steatosis. Toxicol. Sci. 159, 159–169.
  • 44. We got these results
  • 45. Why I like AOPBNs • Causal networks • Use maths to identify the Minimally Sufficient Set of Key Events (MinSSKEs) • Minimal set of key events sufficient to infer an adverse outcome • Devise scenarios to measure the value of information associated with each key event and sets of key events • Devise test batteries that maximize value of information while minimizing resource costs
  • 46. Value of key event analysis
  • 48. What Should I Choose? Match your time constraints with what information you have Emergency Response? Military Intelligence? Site cleanup? Prioritization?
  • 49. Tools • I’m developing freely available, open source, ‘government off the shelf’ software for everything presented here • If you are interested in learning how to do this stuff on your own, chat me up
  • 50. Acknowledgements • Shannon Bell (ILS) • Ed Perkins (Army ERDC) • Natalia Vinas (Army ERDC) • Agnes Karmaus (ILS) • Michelle Angrish (EPA) • Ingrid Druwe (formerly ORISE, currently EPA) • Erin Yost (formerly ORISE, currently EPA) • Kyle Painter (formerly ORISE) • Supported by the US Army Environmental Quality and Installations Program
  • 51. Contact me for more! • Email: lyle.d.burgoon@usace.army.mil • Twitter: @DataSciBurgoon • ORCID: https://orcid.org/0000-0003-4977-5352