SlideShare a Scribd company logo
1 of 12
Download to read offline
A Quantum Chemist Meets Cheminformatics
@janhjensen
Jan H. Jensen, University of Copenhagen
Feel free to tweet, record, …
#WATOC
2018
2017
2016
How I met RDKit
RDKit
N
H
N + Br+
-H+
N
H
N N
H
N N
H
N
Br
Br
Br
or ?
N
H
N
N
H
N H
H
H
N
H
N
H
N
H
N
169.4182.2 181.1
H HH
H
PM3/COSMOheat of formation
Kcal/mol
90% success rate for 520 compounds
Workflow / Automation
Molecule Protonated
isomers
Conformational
search
Find lowest
energy isomer
Display
result
Check for
proton transfer
ChemDraw RDKit
SMILES
c1cnc(cc1c1n(c(c(n1)c1ccc(cc1Cl)Cl)C(=O)OC)COCC[Si](C)(C)C)NC(=O)C
RDKitRDKit
N
N
N
Cl
Cl
O
O
O
Si
N
H
O
1
RegioSQM
23
4
5
6
6 isomers x 20 confs
SMILES
github.com/jensengroup/RegioSQM
Web server
regiosqm.org
xyz2mol
Molecule Protonated
isomers
Conformational
search
Find lowest
energy isomer
Display
result
Check for
proton transfer
ChemDraw RDKit
SMILES
RDKitRDKit
xyz2mol
6 isomers x 20 confs
N
N
N
Cl
Cl
O
O
O
Si
N
H
O N
N
N
Cl
Cl
O
O
O
Si
N
H
O
xyx2mol converts an xyz file to an RDKit mol object
(needs the molecular charge)
github.com/jensengroup/xyz2mol
2019
2004
2013
github.com/jensengroup/GB-GA
An RDKit implementation of
and
Last example
crossover
Unpublished: finding molecules that absorb at 600 nm
Mating pool size: 20, mutation rate: 0.05, sTDA-xTB, 10 runs
Starting from random molecule in the ZINC data base
O
N
NH
O
O
F
O
NH2
N
O
NH
N
H
O
O
N
N
N
NH2
Does it work?
H
N
O
N
N
N
O
O
O
N
O
Summary/Outlook
RDKit changed my research life
Quantum chemical studies need to be automated
Manual work replacing CPU power as rate limiting step
Mistakes become increasingly common
QM students need to learn Python/RDKit
“QM-needs” for RDKit
Conf search for finding global minimum
Generalized Born solvation model

More Related Content

Similar to A Quantum Chemist Meets Cheminformatics

Using semiempirical methods for fast and automated predictions
Using semiempirical methods for fast and automated predictionsUsing semiempirical methods for fast and automated predictions
Using semiempirical methods for fast and automated predictionsmolmodbasics
 
LSSC2011 Optimization of intermolecular interaction potential energy paramete...
LSSC2011 Optimization of intermolecular interaction potential energy paramete...LSSC2011 Optimization of intermolecular interaction potential energy paramete...
LSSC2011 Optimization of intermolecular interaction potential energy paramete...Dragan Sahpaski
 
Pseudo Random Number Generators
Pseudo Random Number GeneratorsPseudo Random Number Generators
Pseudo Random Number GeneratorsDarshini Parikh
 
The PubChemQC Project
The PubChemQC ProjectThe PubChemQC Project
The PubChemQC ProjectMaho Nakata
 
General Cryptography :WHY JOHNNY THE DEVELOPER CAN’T WORK WITH PUBLIC KEY CER...
General Cryptography :WHY JOHNNY THE DEVELOPER CAN’T WORK WITH PUBLIC KEY CER...General Cryptography :WHY JOHNNY THE DEVELOPER CAN’T WORK WITH PUBLIC KEY CER...
General Cryptography :WHY JOHNNY THE DEVELOPER CAN’T WORK WITH PUBLIC KEY CER...Priyanka Aash
 
Kobeworkshop pubchemqc project
Kobeworkshop pubchemqc projectKobeworkshop pubchemqc project
Kobeworkshop pubchemqc projectMaho Nakata
 
Predicting accurate absolute binding energies in aqueous solution: thermodyn...
Predicting accurate absolute binding energies in aqueous solution: thermodyn...Predicting accurate absolute binding energies in aqueous solution: thermodyn...
Predicting accurate absolute binding energies in aqueous solution: thermodyn...molmodbasics
 
IB Chemistry Equilibrium constant, Kc and Reaction quotient, Qc.
IB Chemistry Equilibrium constant, Kc and  Reaction quotient, Qc.IB Chemistry Equilibrium constant, Kc and  Reaction quotient, Qc.
IB Chemistry Equilibrium constant, Kc and Reaction quotient, Qc.Lawrence kok
 
EUGM15 - Michael J. Bodkin (Evotec): Algorithms, Evolution and Network-Based ...
EUGM15 - Michael J. Bodkin (Evotec): Algorithms, Evolution and Network-Based ...EUGM15 - Michael J. Bodkin (Evotec): Algorithms, Evolution and Network-Based ...
EUGM15 - Michael J. Bodkin (Evotec): Algorithms, Evolution and Network-Based ...ChemAxon
 
10 problems in nmr
10 problems in nmr10 problems in nmr
10 problems in nmrUday Deokate
 
Can we agree on the structure represented by a SMILES string? A benchmark dat...
Can we agree on the structure represented by a SMILES string? A benchmark dat...Can we agree on the structure represented by a SMILES string? A benchmark dat...
Can we agree on the structure represented by a SMILES string? A benchmark dat...NextMove Software
 
2006-01-0240
2006-01-02402006-01-0240
2006-01-0240Emad Amin
 
Qm Treatment AECOM Presentation
Qm Treatment AECOM PresentationQm Treatment AECOM Presentation
Qm Treatment AECOM PresentationCain Morano
 

Similar to A Quantum Chemist Meets Cheminformatics (19)

Biochip
BiochipBiochip
Biochip
 
some_other_API
some_other_APIsome_other_API
some_other_API
 
Using semiempirical methods for fast and automated predictions
Using semiempirical methods for fast and automated predictionsUsing semiempirical methods for fast and automated predictions
Using semiempirical methods for fast and automated predictions
 
LSSC2011 Optimization of intermolecular interaction potential energy paramete...
LSSC2011 Optimization of intermolecular interaction potential energy paramete...LSSC2011 Optimization of intermolecular interaction potential energy paramete...
LSSC2011 Optimization of intermolecular interaction potential energy paramete...
 
Understanding and predicting CO2 properties - Presentation by Richard Graham ...
Understanding and predicting CO2 properties - Presentation by Richard Graham ...Understanding and predicting CO2 properties - Presentation by Richard Graham ...
Understanding and predicting CO2 properties - Presentation by Richard Graham ...
 
Pseudo Random Number Generators
Pseudo Random Number GeneratorsPseudo Random Number Generators
Pseudo Random Number Generators
 
The PubChemQC Project
The PubChemQC ProjectThe PubChemQC Project
The PubChemQC Project
 
General Cryptography :WHY JOHNNY THE DEVELOPER CAN’T WORK WITH PUBLIC KEY CER...
General Cryptography :WHY JOHNNY THE DEVELOPER CAN’T WORK WITH PUBLIC KEY CER...General Cryptography :WHY JOHNNY THE DEVELOPER CAN’T WORK WITH PUBLIC KEY CER...
General Cryptography :WHY JOHNNY THE DEVELOPER CAN’T WORK WITH PUBLIC KEY CER...
 
Kobeworkshop pubchemqc project
Kobeworkshop pubchemqc projectKobeworkshop pubchemqc project
Kobeworkshop pubchemqc project
 
Predicting accurate absolute binding energies in aqueous solution: thermodyn...
Predicting accurate absolute binding energies in aqueous solution: thermodyn...Predicting accurate absolute binding energies in aqueous solution: thermodyn...
Predicting accurate absolute binding energies in aqueous solution: thermodyn...
 
Avinash_PPT
Avinash_PPTAvinash_PPT
Avinash_PPT
 
RDKit Gems
RDKit GemsRDKit Gems
RDKit Gems
 
IB Chemistry Equilibrium constant, Kc and Reaction quotient, Qc.
IB Chemistry Equilibrium constant, Kc and  Reaction quotient, Qc.IB Chemistry Equilibrium constant, Kc and  Reaction quotient, Qc.
IB Chemistry Equilibrium constant, Kc and Reaction quotient, Qc.
 
EUGM15 - Michael J. Bodkin (Evotec): Algorithms, Evolution and Network-Based ...
EUGM15 - Michael J. Bodkin (Evotec): Algorithms, Evolution and Network-Based ...EUGM15 - Michael J. Bodkin (Evotec): Algorithms, Evolution and Network-Based ...
EUGM15 - Michael J. Bodkin (Evotec): Algorithms, Evolution and Network-Based ...
 
10 problems in nmr
10 problems in nmr10 problems in nmr
10 problems in nmr
 
PhD work
PhD workPhD work
PhD work
 
Can we agree on the structure represented by a SMILES string? A benchmark dat...
Can we agree on the structure represented by a SMILES string? A benchmark dat...Can we agree on the structure represented by a SMILES string? A benchmark dat...
Can we agree on the structure represented by a SMILES string? A benchmark dat...
 
2006-01-0240
2006-01-02402006-01-0240
2006-01-0240
 
Qm Treatment AECOM Presentation
Qm Treatment AECOM PresentationQm Treatment AECOM Presentation
Qm Treatment AECOM Presentation
 

More from molmodbasics

xyz2mol for organometallic compounds
xyz2mol for organometallic compoundsxyz2mol for organometallic compounds
xyz2mol for organometallic compoundsmolmodbasics
 
Chemical Space Exploration
Chemical Space ExplorationChemical Space Exploration
Chemical Space Explorationmolmodbasics
 
ChemRxiv, Plan S, and OA publishing
ChemRxiv, Plan S, and OA publishingChemRxiv, Plan S, and OA publishing
ChemRxiv, Plan S, and OA publishingmolmodbasics
 
Proteiner du kan regne med
Proteiner du kan regne med Proteiner du kan regne med
Proteiner du kan regne med molmodbasics
 
Jan H. Jensen: profile
Jan H. Jensen: profileJan H. Jensen: profile
Jan H. Jensen: profilemolmodbasics
 
Can semiempirical methods be used for high throughput screening (for enzyme m...
Can semiempirical methods be used for high throughput screening (for enzyme m...Can semiempirical methods be used for high throughput screening (for enzyme m...
Can semiempirical methods be used for high throughput screening (for enzyme m...molmodbasics
 
Thermodynamics for Biochemists: a YouTube textbook
Thermodynamics for Biochemists: a YouTube textbookThermodynamics for Biochemists: a YouTube textbook
Thermodynamics for Biochemists: a YouTube textbookmolmodbasics
 
Teaching Tools and Tips
Teaching Tools and TipsTeaching Tools and Tips
Teaching Tools and Tipsmolmodbasics
 
Short answer questions on thermodynamics
Short answer questions on thermodynamicsShort answer questions on thermodynamics
Short answer questions on thermodynamicsmolmodbasics
 
Different kinds of peer instruction questions for thermodynamics
Different kinds of peer instruction questions for thermodynamicsDifferent kinds of peer instruction questions for thermodynamics
Different kinds of peer instruction questions for thermodynamicsmolmodbasics
 
Teaching Tools and Tips
Teaching Tools and TipsTeaching Tools and Tips
Teaching Tools and Tipsmolmodbasics
 
Quantum Biochemistry: the rise of semiempirical methods
Quantum Biochemistry: the rise of semiempirical methodsQuantum Biochemistry: the rise of semiempirical methods
Quantum Biochemistry: the rise of semiempirical methodsmolmodbasics
 
Peer instruction questions on thermodynamics part 1
Peer instruction questions on thermodynamics part 1Peer instruction questions on thermodynamics part 1
Peer instruction questions on thermodynamics part 1molmodbasics
 
Protein structure determination & refinement using QM-derived chemical shifts
Protein structure determination & refinement using QM-derived chemical shiftsProtein structure determination & refinement using QM-derived chemical shifts
Protein structure determination & refinement using QM-derived chemical shiftsmolmodbasics
 
The 2013 Bjerrum-Brønsted-Lang Lecture: quantum biochemistry and the rise of ...
The 2013 Bjerrum-Brønsted-Lang Lecture: quantum biochemistry and the rise of ...The 2013 Bjerrum-Brønsted-Lang Lecture: quantum biochemistry and the rise of ...
The 2013 Bjerrum-Brønsted-Lang Lecture: quantum biochemistry and the rise of ...molmodbasics
 
Quantum biochemistry
Quantum biochemistryQuantum biochemistry
Quantum biochemistrymolmodbasics
 

More from molmodbasics (20)

xyz2mol for organometallic compounds
xyz2mol for organometallic compoundsxyz2mol for organometallic compounds
xyz2mol for organometallic compounds
 
Chemical Space Exploration
Chemical Space ExplorationChemical Space Exploration
Chemical Space Exploration
 
ChemRxiv, Plan S, and OA publishing
ChemRxiv, Plan S, and OA publishingChemRxiv, Plan S, and OA publishing
ChemRxiv, Plan S, and OA publishing
 
Open is Better
Open is BetterOpen is Better
Open is Better
 
Proteiner du kan regne med
Proteiner du kan regne med Proteiner du kan regne med
Proteiner du kan regne med
 
Jan H. Jensen: profile
Jan H. Jensen: profileJan H. Jensen: profile
Jan H. Jensen: profile
 
Why I blog
Why I blogWhy I blog
Why I blog
 
Why I tweet
Why I tweetWhy I tweet
Why I tweet
 
Can semiempirical methods be used for high throughput screening (for enzyme m...
Can semiempirical methods be used for high throughput screening (for enzyme m...Can semiempirical methods be used for high throughput screening (for enzyme m...
Can semiempirical methods be used for high throughput screening (for enzyme m...
 
Thermodynamics for Biochemists: a YouTube textbook
Thermodynamics for Biochemists: a YouTube textbookThermodynamics for Biochemists: a YouTube textbook
Thermodynamics for Biochemists: a YouTube textbook
 
I lecture nomore
I lecture nomoreI lecture nomore
I lecture nomore
 
Teaching Tools and Tips
Teaching Tools and TipsTeaching Tools and Tips
Teaching Tools and Tips
 
Short answer questions on thermodynamics
Short answer questions on thermodynamicsShort answer questions on thermodynamics
Short answer questions on thermodynamics
 
Different kinds of peer instruction questions for thermodynamics
Different kinds of peer instruction questions for thermodynamicsDifferent kinds of peer instruction questions for thermodynamics
Different kinds of peer instruction questions for thermodynamics
 
Teaching Tools and Tips
Teaching Tools and TipsTeaching Tools and Tips
Teaching Tools and Tips
 
Quantum Biochemistry: the rise of semiempirical methods
Quantum Biochemistry: the rise of semiempirical methodsQuantum Biochemistry: the rise of semiempirical methods
Quantum Biochemistry: the rise of semiempirical methods
 
Peer instruction questions on thermodynamics part 1
Peer instruction questions on thermodynamics part 1Peer instruction questions on thermodynamics part 1
Peer instruction questions on thermodynamics part 1
 
Protein structure determination & refinement using QM-derived chemical shifts
Protein structure determination & refinement using QM-derived chemical shiftsProtein structure determination & refinement using QM-derived chemical shifts
Protein structure determination & refinement using QM-derived chemical shifts
 
The 2013 Bjerrum-Brønsted-Lang Lecture: quantum biochemistry and the rise of ...
The 2013 Bjerrum-Brønsted-Lang Lecture: quantum biochemistry and the rise of ...The 2013 Bjerrum-Brønsted-Lang Lecture: quantum biochemistry and the rise of ...
The 2013 Bjerrum-Brønsted-Lang Lecture: quantum biochemistry and the rise of ...
 
Quantum biochemistry
Quantum biochemistryQuantum biochemistry
Quantum biochemistry
 

Recently uploaded

THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxANSARKHAN96
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxseri bangash
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxSuji236384
 
CYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptxCYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptxSilpa
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.Silpa
 
Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptxSilpa
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bSérgio Sacani
 
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRLGwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRLkantirani197
 
Chemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdfChemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdfSumit Kumar yadav
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryAlex Henderson
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY1301aanya
 
Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.Silpa
 
GBSN - Microbiology (Unit 3)Defense Mechanism of the body
GBSN - Microbiology (Unit 3)Defense Mechanism of the body GBSN - Microbiology (Unit 3)Defense Mechanism of the body
GBSN - Microbiology (Unit 3)Defense Mechanism of the body Areesha Ahmad
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...Scintica Instrumentation
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Silpa
 
Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspectsmuralinath2
 
Genome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptxGenome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptxSilpa
 
Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Silpa
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .Poonam Aher Patil
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceAlex Henderson
 

Recently uploaded (20)

THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
 
CYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptxCYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptx
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
 
Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptx
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRLGwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
 
Chemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdfChemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdf
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
 
Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.
 
GBSN - Microbiology (Unit 3)Defense Mechanism of the body
GBSN - Microbiology (Unit 3)Defense Mechanism of the body GBSN - Microbiology (Unit 3)Defense Mechanism of the body
GBSN - Microbiology (Unit 3)Defense Mechanism of the body
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspects
 
Genome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptxGenome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptx
 
Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical Science
 

A Quantum Chemist Meets Cheminformatics

  • 1. A Quantum Chemist Meets Cheminformatics @janhjensen Jan H. Jensen, University of Copenhagen Feel free to tweet, record, … #WATOC
  • 3. N H N + Br+ -H+ N H N N H N N H N Br Br Br or ? N H N N H N H H H N H N H N H N 169.4182.2 181.1 H HH H PM3/COSMOheat of formation Kcal/mol 90% success rate for 520 compounds
  • 4. Workflow / Automation Molecule Protonated isomers Conformational search Find lowest energy isomer Display result Check for proton transfer ChemDraw RDKit SMILES c1cnc(cc1c1n(c(c(n1)c1ccc(cc1Cl)Cl)C(=O)OC)COCC[Si](C)(C)C)NC(=O)C RDKitRDKit N N N Cl Cl O O O Si N H O 1 RegioSQM 23 4 5 6 6 isomers x 20 confs SMILES github.com/jensengroup/RegioSQM
  • 6. xyz2mol Molecule Protonated isomers Conformational search Find lowest energy isomer Display result Check for proton transfer ChemDraw RDKit SMILES RDKitRDKit xyz2mol 6 isomers x 20 confs N N N Cl Cl O O O Si N H O N N N Cl Cl O O O Si N H O xyx2mol converts an xyz file to an RDKit mol object (needs the molecular charge) github.com/jensengroup/xyz2mol
  • 9.
  • 10.
  • 11. Unpublished: finding molecules that absorb at 600 nm Mating pool size: 20, mutation rate: 0.05, sTDA-xTB, 10 runs Starting from random molecule in the ZINC data base O N NH O O F O NH2 N O NH N H O O N N N NH2 Does it work? H N O N N N O O O N O
  • 12. Summary/Outlook RDKit changed my research life Quantum chemical studies need to be automated Manual work replacing CPU power as rate limiting step Mistakes become increasingly common QM students need to learn Python/RDKit “QM-needs” for RDKit Conf search for finding global minimum Generalized Born solvation model