SlideShare a Scribd company logo
1 of 19
Download to read offline
TITLE OF PRESENTATION |
Presented By
Date
CLARK Matthew
August 2018
Exploring the use of conditional generative
adversarial networks (cGAN) to analyze chemical
reactions via electron density fields
Matthew CLARK
Frederik van den BROEK
ACS 256th Annual Meeting 2018 Boston MA
CINF 149
PAPER ID: 2967997
TITLE OF PRESENTATION |
• Published chemistry -> descriptors -> predictive system
• Other workers use a variety of descriptors
• SMILES strings
• Connectivity matrix/graphs
• These have strengths and weaknesses
• Chemistry is based on movement of electrons
• Bonds are subjective and not limited to those used in sketching
programs
Chemistry Deep Learning
TITLE OF PRESENTATION | 3
Examples of problems with “normal” stick chemical structure
drawing
Charge is a molecular attribute, not
atomic
Representation of bond order is
approximate
How will you draw this?
These are challenging
Solid state –
catalysis/polymerization
Challenges
TITLE OF PRESENTATION |
• Electrons define chemistry
• Independent of the lines drawn for bond order
• Better representation of what is happening
4
Lets look at electrons for studying reactivity
Woodward , Fukui, and Hoffmann were pretty successful with
this kind of analysis ☺
TITLE OF PRESENTATION | 5
Electron Density Computed with Extended Hückel Theory
Electron density represented by
density values in a 3D regular
field
These allow point-by-point
comparisons among molecules
FORTICON8 – QCPE571 – ‘77
PSI1EHT – Severance ’86
Visualized with Pymol™, colored by
density written as b-factors in pdb
Essentially the same
FORTRAN program
Hoffman used
TITLE OF PRESENTATION | 6
Generative Adversarial Network – short explanation
Network
generates
proposed output
Training process
Discriminator
network trains
To distinguish
good/bad
output
“Real” output
used for training
Final output
Adversarial Feedback
Train on these
density changes
TITLE OF PRESENTATION | 7
Training and output
Predicted outputTraining transformation
Input for
prediction
TITLE OF PRESENTATION | 8
Other applications of generative adversarial networks
Image to Image Translation with Conditional Adversarial Networks ArXiv, Nov 2017 Isola, Zhu,
Zhou, Efros https://arxiv.org/pdf/1611.07004v2.pdf
• Used to colorize photos and movies
• Good for ‘high dimensionality’ problems
TITLE OF PRESENTATION | 9
Generating network
3D fields
TITLE OF PRESENTATION | 10
Subsampling finds local electron density patterns
Sampling
subregion
over whole
region
• In movie colorization this is how it correctly renders a
hand regardless of position in the frame
• In chemistry it can create models regardless of the
orientation or exact location of a functional group in the
molecule
TITLE OF PRESENTATION | 11
Processing reactions to produce training set
A -> B transformations
3,278 reactions that met
MW, Yield criteria
KNIME workflow used to
process RD files into
reactants and products for
training
TITLE OF PRESENTATION | 12
High level process description
Select
reactions
Compute
EHMO
Reactant/product
Convert to
3D ED field
Train Deep
learning
Select
compound
Compute
EHMO
Convert to
3D ED field
Predict using
DL model
Training
Prediction
TITLE OF PRESENTATION |
• If trained on rotations can
produce a ‘canonical form’
• Insensitive to rotations
• Can be trained to deconvolute
conformations
• This should be explored more; is
a useful capability
13
Experiments with basic cheminformatics
TITLE OF PRESENTATION | 14
Training Examples
Tungsten complex chemistry – not addressable by most methods
TITLE OF PRESENTATION | 15
Training
TITLE OF PRESENTATION | 16
Reactants to Products Training
• Training to answer: what can this molecule become?
• Look at metabolites
• Transformations that could happen to a given starting
material
TITLE OF PRESENTATION | 17
Products to Reactants Training
• Training to answer: what molecule could become this molecule?
• Useful for retrosynthesis
TITLE OF PRESENTATION |
• The method appears to have promise
• Must interpret electron density to understand compounds
• Similar to crystallography!
• Output can be indistinct when several possibilities exist
• Koes group (Pitt) has some interesting solutions
• Gives insight into transformations
• Where transformations are likely to take place in a molecule
• Scoring results – how?
• Many to one relationship is not represented well
• System predicts only one output from an input structure
• Should include the nuclei!
• Difficult to distinguish among electronegative elements.
• Select a better test/reaction selection to demonstrate efficacy
• Autoencoding input
• Reproduce Woodward-Hoffmann rules with AI?
• Work with A + B -> C in addition to A -> B
18
Learnings
Good
Challenges
Next Step
TITLE OF PRESENTATION |
• Elsevier Machine Learning Services Team
• Frederik “Flying Dutchman” van den Broek
• Markus “Blockchain” Bussen
• Sally “Renzo” Makady
• Jabe “The Machine” Wilson
• Anton “Pathway” Yuryev
• Maria “Animal” Shkrob
• Tom “Running Man” Woodcock
• Tim “Hardcore” Hoctor
• You?
• Recruiting now in Boston/Cambridge area!
19

More Related Content

Similar to Use of GAN's to analyze chemical reactions

The Impact of Applying LabVIEW Simulation Platform in Teaching Power Electron...
The Impact of Applying LabVIEW Simulation Platform in Teaching Power Electron...The Impact of Applying LabVIEW Simulation Platform in Teaching Power Electron...
The Impact of Applying LabVIEW Simulation Platform in Teaching Power Electron...Yashodhan Mohan Bhatnagar
 
A new development in the hierarchical clustering of repertory grid data
A new development in the hierarchical clustering of repertory grid dataA new development in the hierarchical clustering of repertory grid data
A new development in the hierarchical clustering of repertory grid dataMark Heckmann
 
ResNeSt: Split-Attention Networks
ResNeSt: Split-Attention NetworksResNeSt: Split-Attention Networks
ResNeSt: Split-Attention NetworksSeunghyun Hwang
 
Monte Carlo Modeling:Solar Cell
Monte Carlo Modeling:Solar Cell Monte Carlo Modeling:Solar Cell
Monte Carlo Modeling:Solar Cell Daniel Lee
 
Research Presentation.pptx
Research Presentation.pptxResearch Presentation.pptx
Research Presentation.pptxcomplab3
 
Course amplificadores I
Course amplificadores ICourse amplificadores I
Course amplificadores Ijaroldane
 
Quantum Mechanics in Molecular modeling
Quantum Mechanics in Molecular modelingQuantum Mechanics in Molecular modeling
Quantum Mechanics in Molecular modelingAkshay Kank
 
CoMFA CoMFA Comparative Molecular Field Analysis)
CoMFA CoMFA Comparative Molecular Field Analysis)CoMFA CoMFA Comparative Molecular Field Analysis)
CoMFA CoMFA Comparative Molecular Field Analysis)Pinky Vincent
 
LightGCN: Simplifying and Powering Graph Convolution Network for Recommendati...
LightGCN: Simplifying and Powering Graph Convolution Network for Recommendati...LightGCN: Simplifying and Powering Graph Convolution Network for Recommendati...
LightGCN: Simplifying and Powering Graph Convolution Network for Recommendati...ssuser2624f71
 
Your Classifier is Secretly an Energy based model and you should treat it lik...
Your Classifier is Secretly an Energy based model and you should treat it lik...Your Classifier is Secretly an Energy based model and you should treat it lik...
Your Classifier is Secretly an Energy based model and you should treat it lik...Seunghyun Hwang
 
Online learning in estimation of distribution algorithms for dynamic environm...
Online learning in estimation of distribution algorithms for dynamic environm...Online learning in estimation of distribution algorithms for dynamic environm...
Online learning in estimation of distribution algorithms for dynamic environm...André Gonçalves
 

Similar to Use of GAN's to analyze chemical reactions (20)

The Impact of Applying LabVIEW Simulation Platform in Teaching Power Electron...
The Impact of Applying LabVIEW Simulation Platform in Teaching Power Electron...The Impact of Applying LabVIEW Simulation Platform in Teaching Power Electron...
The Impact of Applying LabVIEW Simulation Platform in Teaching Power Electron...
 
paper
paperpaper
paper
 
A new development in the hierarchical clustering of repertory grid data
A new development in the hierarchical clustering of repertory grid dataA new development in the hierarchical clustering of repertory grid data
A new development in the hierarchical clustering of repertory grid data
 
ResNeSt: Split-Attention Networks
ResNeSt: Split-Attention NetworksResNeSt: Split-Attention Networks
ResNeSt: Split-Attention Networks
 
Monte Carlo Modeling:Solar Cell
Monte Carlo Modeling:Solar Cell Monte Carlo Modeling:Solar Cell
Monte Carlo Modeling:Solar Cell
 
Computational chemistry
Computational chemistryComputational chemistry
Computational chemistry
 
Research Presentation.pptx
Research Presentation.pptxResearch Presentation.pptx
Research Presentation.pptx
 
docking
docking docking
docking
 
Course amplificadores I
Course amplificadores ICourse amplificadores I
Course amplificadores I
 
FinalReport
FinalReportFinalReport
FinalReport
 
Applications of Computational Quantum Chemistry
Applications of Computational Quantum ChemistryApplications of Computational Quantum Chemistry
Applications of Computational Quantum Chemistry
 
Quantum Mechanics in Molecular modeling
Quantum Mechanics in Molecular modelingQuantum Mechanics in Molecular modeling
Quantum Mechanics in Molecular modeling
 
The Importance of Open Data and Models for Energy Systems Analysis
The Importance of Open Data and Models for Energy Systems AnalysisThe Importance of Open Data and Models for Energy Systems Analysis
The Importance of Open Data and Models for Energy Systems Analysis
 
CoMFA CoMFA Comparative Molecular Field Analysis)
CoMFA CoMFA Comparative Molecular Field Analysis)CoMFA CoMFA Comparative Molecular Field Analysis)
CoMFA CoMFA Comparative Molecular Field Analysis)
 
Viva Voce poster
Viva Voce posterViva Voce poster
Viva Voce poster
 
oo testing.pptx
oo testing.pptxoo testing.pptx
oo testing.pptx
 
LightGCN: Simplifying and Powering Graph Convolution Network for Recommendati...
LightGCN: Simplifying and Powering Graph Convolution Network for Recommendati...LightGCN: Simplifying and Powering Graph Convolution Network for Recommendati...
LightGCN: Simplifying and Powering Graph Convolution Network for Recommendati...
 
Fahroo - Computational Mathematics - Spring Review 2012
Fahroo - Computational Mathematics - Spring Review 2012 Fahroo - Computational Mathematics - Spring Review 2012
Fahroo - Computational Mathematics - Spring Review 2012
 
Your Classifier is Secretly an Energy based model and you should treat it lik...
Your Classifier is Secretly an Energy based model and you should treat it lik...Your Classifier is Secretly an Energy based model and you should treat it lik...
Your Classifier is Secretly an Energy based model and you should treat it lik...
 
Online learning in estimation of distribution algorithms for dynamic environm...
Online learning in estimation of distribution algorithms for dynamic environm...Online learning in estimation of distribution algorithms for dynamic environm...
Online learning in estimation of distribution algorithms for dynamic environm...
 

Recently uploaded

Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .Poonam Aher Patil
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...Scintica Instrumentation
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professormuralinath2
 
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...Monika Rani
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryAlex Henderson
 
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRLGwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRLkantirani197
 
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptxClimate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptxDiariAli
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Silpa
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusNazaninKarimi6
 
GBSN - Microbiology (Unit 3)Defense Mechanism of the body
GBSN - Microbiology (Unit 3)Defense Mechanism of the body GBSN - Microbiology (Unit 3)Defense Mechanism of the body
GBSN - Microbiology (Unit 3)Defense Mechanism of the body Areesha Ahmad
 
Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.Silpa
 
Genome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptxGenome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptxSilpa
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bSérgio Sacani
 
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Silpa
 
LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.Silpa
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.Silpa
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsSérgio Sacani
 
Role of AI in seed science Predictive modelling and Beyond.pptx
Role of AI in seed science  Predictive modelling and  Beyond.pptxRole of AI in seed science  Predictive modelling and  Beyond.pptx
Role of AI in seed science Predictive modelling and Beyond.pptxArvind Kumar
 

Recently uploaded (20)

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
 
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRLGwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
 
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptxClimate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
 
GBSN - Microbiology (Unit 3)Defense Mechanism of the body
GBSN - Microbiology (Unit 3)Defense Mechanism of the body GBSN - Microbiology (Unit 3)Defense Mechanism of the body
GBSN - Microbiology (Unit 3)Defense Mechanism of the body
 
Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.
 
Genome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptxGenome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptx
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
 
LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
Role of AI in seed science Predictive modelling and Beyond.pptx
Role of AI in seed science  Predictive modelling and  Beyond.pptxRole of AI in seed science  Predictive modelling and  Beyond.pptx
Role of AI in seed science Predictive modelling and Beyond.pptx
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 

Use of GAN's to analyze chemical reactions

  • 1. TITLE OF PRESENTATION | Presented By Date CLARK Matthew August 2018 Exploring the use of conditional generative adversarial networks (cGAN) to analyze chemical reactions via electron density fields Matthew CLARK Frederik van den BROEK ACS 256th Annual Meeting 2018 Boston MA CINF 149 PAPER ID: 2967997
  • 2. TITLE OF PRESENTATION | • Published chemistry -> descriptors -> predictive system • Other workers use a variety of descriptors • SMILES strings • Connectivity matrix/graphs • These have strengths and weaknesses • Chemistry is based on movement of electrons • Bonds are subjective and not limited to those used in sketching programs Chemistry Deep Learning
  • 3. TITLE OF PRESENTATION | 3 Examples of problems with “normal” stick chemical structure drawing Charge is a molecular attribute, not atomic Representation of bond order is approximate How will you draw this? These are challenging Solid state – catalysis/polymerization Challenges
  • 4. TITLE OF PRESENTATION | • Electrons define chemistry • Independent of the lines drawn for bond order • Better representation of what is happening 4 Lets look at electrons for studying reactivity Woodward , Fukui, and Hoffmann were pretty successful with this kind of analysis ☺
  • 5. TITLE OF PRESENTATION | 5 Electron Density Computed with Extended Hückel Theory Electron density represented by density values in a 3D regular field These allow point-by-point comparisons among molecules FORTICON8 – QCPE571 – ‘77 PSI1EHT – Severance ’86 Visualized with Pymol™, colored by density written as b-factors in pdb Essentially the same FORTRAN program Hoffman used
  • 6. TITLE OF PRESENTATION | 6 Generative Adversarial Network – short explanation Network generates proposed output Training process Discriminator network trains To distinguish good/bad output “Real” output used for training Final output Adversarial Feedback Train on these density changes
  • 7. TITLE OF PRESENTATION | 7 Training and output Predicted outputTraining transformation Input for prediction
  • 8. TITLE OF PRESENTATION | 8 Other applications of generative adversarial networks Image to Image Translation with Conditional Adversarial Networks ArXiv, Nov 2017 Isola, Zhu, Zhou, Efros https://arxiv.org/pdf/1611.07004v2.pdf • Used to colorize photos and movies • Good for ‘high dimensionality’ problems
  • 9. TITLE OF PRESENTATION | 9 Generating network 3D fields
  • 10. TITLE OF PRESENTATION | 10 Subsampling finds local electron density patterns Sampling subregion over whole region • In movie colorization this is how it correctly renders a hand regardless of position in the frame • In chemistry it can create models regardless of the orientation or exact location of a functional group in the molecule
  • 11. TITLE OF PRESENTATION | 11 Processing reactions to produce training set A -> B transformations 3,278 reactions that met MW, Yield criteria KNIME workflow used to process RD files into reactants and products for training
  • 12. TITLE OF PRESENTATION | 12 High level process description Select reactions Compute EHMO Reactant/product Convert to 3D ED field Train Deep learning Select compound Compute EHMO Convert to 3D ED field Predict using DL model Training Prediction
  • 13. TITLE OF PRESENTATION | • If trained on rotations can produce a ‘canonical form’ • Insensitive to rotations • Can be trained to deconvolute conformations • This should be explored more; is a useful capability 13 Experiments with basic cheminformatics
  • 14. TITLE OF PRESENTATION | 14 Training Examples Tungsten complex chemistry – not addressable by most methods
  • 15. TITLE OF PRESENTATION | 15 Training
  • 16. TITLE OF PRESENTATION | 16 Reactants to Products Training • Training to answer: what can this molecule become? • Look at metabolites • Transformations that could happen to a given starting material
  • 17. TITLE OF PRESENTATION | 17 Products to Reactants Training • Training to answer: what molecule could become this molecule? • Useful for retrosynthesis
  • 18. TITLE OF PRESENTATION | • The method appears to have promise • Must interpret electron density to understand compounds • Similar to crystallography! • Output can be indistinct when several possibilities exist • Koes group (Pitt) has some interesting solutions • Gives insight into transformations • Where transformations are likely to take place in a molecule • Scoring results – how? • Many to one relationship is not represented well • System predicts only one output from an input structure • Should include the nuclei! • Difficult to distinguish among electronegative elements. • Select a better test/reaction selection to demonstrate efficacy • Autoencoding input • Reproduce Woodward-Hoffmann rules with AI? • Work with A + B -> C in addition to A -> B 18 Learnings Good Challenges Next Step
  • 19. TITLE OF PRESENTATION | • Elsevier Machine Learning Services Team • Frederik “Flying Dutchman” van den Broek • Markus “Blockchain” Bussen • Sally “Renzo” Makady • Jabe “The Machine” Wilson • Anton “Pathway” Yuryev • Maria “Animal” Shkrob • Tom “Running Man” Woodcock • Tim “Hardcore” Hoctor • You? • Recruiting now in Boston/Cambridge area! 19