SlideShare a Scribd company logo
AI in Science Research
How can modern AI help to push the boundary of science
Ding Li 2022.1
2
MATHEMATICS
3
AI Aids Intuition in Mathematical Discovery
The cycle of developing mathematical theories by
studying examples.
• After recognizing a possible pattern in the properties of
mathematical objects, such as convex polyhedra (3D
shapes with flat faces, straight edges and vertices that all
point outwards), mathematicians typically go through a
cycle to understand this pattern.
• They first compute the properties of some simple
examples and analyze the possible relationships
between these properties.
• The researchers then refine these relationships. For
example, they might come up with Euler’s polyhedron
formula, which posits that the number of vertices (V)
minus the number of edges (E) plus the number of faces
(F) of a convex polyhedron is always equal to two:
V − E + F = 2.
• They then test this suggested relationship on more
complicated examples, discard irrelevant properties and
attempt to understand why the relationship holds. If it
remains unclear, mathematicians then consider different
examples, and the cycle continues.
• Davies et al.1 show that machine-learning techniques
can help researchers with the refinement step, which
usually relies strongly on human intuition
Stump 2021
4
Advancing mathematics by guiding human intuition with AI Davies 2021
As an illustrative example: let z be convex polyhedra,
X(z) ∈ Z2 × R2 be the number of vertices and edges of z, as well as the
volume and surface area, and Y(z) ∈ ℤ be the number of faces of z.
Euler’s formula states that there is an exact relationship between X(z)
and Y(z) in this case: X(z) · (−1, 1, 0, 0) + 2 = Y(z).
The framework helps guide the intuition of mathematicians in two
ways: by verifying the hypothesized existence of structure/patterns in
mathematical objects through the use of supervised machine learning;
and by helping in the understanding of these patterns through the use
of attribution techniques.
5
Quantum
Chemistry
6
Pushing the Frontiers of Density Functionals
by Solving the Fractional Electron Problem
Kirkpatrick 2021
• Computing electronic energies underpins theoretical chemistry and materials science, and
density functional theory (DFT) promises an exact and efficient approach
• But the approach has limitations and is known to give the wrong results for certain types of
molecule.
• “It’s sort of the ideal problem for machine learning: you know the answer, but not the
formula you want to apply.”
• The functional was evaluated by integrating local energies computed by a multilayer
perceptron (MLP), which took as input both local and nonlocal features of the occupied
Kohn-Sham (KS) orbitals and can be described as a local range-separated hybrid.
• To train the functional, the sum of two objective functions was used: a regression a
gradient regularization term that ensured that the functional derivatives can be used in
self-consistent field (SCF) calculations after training
Castelvecchi 2021
7
BIOLOGY
8
Primary Structure
Amino acids (20)
Peptide bond
Secondary Structure Tertiary Structure
Quaternary Structure
9
(MSA) Multiple Sequence Alignments Nseq x Nres
• Evolutionary constrains
• MSA clustering
• Cluster deletion
• Evolutionary correlations
Pairwise Feature Nres x Nres
• Physical and geometric constrains
• Target feat (amino acids), residue index
• Structural templates
• Template distogram
Near experimental accuracy in
most cases for CASP14 assessment
(May-July 2020)
Jumper 2021 GitHub
AlphaFold Protein Structure Database (JAK2)
Blog
Colab
UniProt (JAK2)
10
A BERT-style transformer was applied to predict randomly masked
individual residues within the MSA, which encourages the network to
learn to interpret phylogenetic and covariation relationships without
hardcoding a particular correlation statistic into the features.
Exchange information iteratively
to enable direct reasoning about
the spatial and evolutionary
relationships in the proteins.
Combination of the bioinformatics and physical approaches
We hope that AlphaFold—and computational approaches that apply its techniques
for other biophysical problems—will become essential tools of modern biology.
11
“Do not quench your inspiration
and your imagination; do not
become the slave of your
model.”
– Vincent van Gogh

More Related Content

What's hot

A Novel Algorithm for Design Tree Classification with PCA
A Novel Algorithm for Design Tree Classification with PCAA Novel Algorithm for Design Tree Classification with PCA
A Novel Algorithm for Design Tree Classification with PCA
Editor Jacotech
 
Similarity Features, and their Role in Concept Alignment Learning
Similarity Features, and their Role in Concept Alignment Learning Similarity Features, and their Role in Concept Alignment Learning
Similarity Features, and their Role in Concept Alignment Learning
Shenghui Wang
 

What's hot (19)

Principal Component Analysis (PCA) and LDA PPT Slides
Principal Component Analysis (PCA) and LDA PPT SlidesPrincipal Component Analysis (PCA) and LDA PPT Slides
Principal Component Analysis (PCA) and LDA PPT Slides
 
EFFECTIVENESS PREDICTION OF MEMORY BASED CLASSIFIERS FOR THE CLASSIFICATION O...
EFFECTIVENESS PREDICTION OF MEMORY BASED CLASSIFIERS FOR THE CLASSIFICATION O...EFFECTIVENESS PREDICTION OF MEMORY BASED CLASSIFIERS FOR THE CLASSIFICATION O...
EFFECTIVENESS PREDICTION OF MEMORY BASED CLASSIFIERS FOR THE CLASSIFICATION O...
 
A Novel Algorithm for Design Tree Classification with PCA
A Novel Algorithm for Design Tree Classification with PCAA Novel Algorithm for Design Tree Classification with PCA
A Novel Algorithm for Design Tree Classification with PCA
 
Cluster analysis
Cluster analysisCluster analysis
Cluster analysis
 
Matrix Factorization Technique for Recommender Systems
Matrix Factorization Technique for Recommender SystemsMatrix Factorization Technique for Recommender Systems
Matrix Factorization Technique for Recommender Systems
 
Machine learning clustering
Machine learning clusteringMachine learning clustering
Machine learning clustering
 
Pca ppt
Pca pptPca ppt
Pca ppt
 
Morse-Smale Regression for Risk Modeling
Morse-Smale Regression for Risk ModelingMorse-Smale Regression for Risk Modeling
Morse-Smale Regression for Risk Modeling
 
Types of clustering and different types of clustering algorithms
Types of clustering and different types of clustering algorithmsTypes of clustering and different types of clustering algorithms
Types of clustering and different types of clustering algorithms
 
Dbm630 lecture09
Dbm630 lecture09Dbm630 lecture09
Dbm630 lecture09
 
Clustering techniques final
Clustering techniques finalClustering techniques final
Clustering techniques final
 
Clustering in Data Mining
Clustering in Data MiningClustering in Data Mining
Clustering in Data Mining
 
Unsupervised learning clustering
Unsupervised learning clusteringUnsupervised learning clustering
Unsupervised learning clustering
 
IRJET- Performance Evaluation of Various Classification Algorithms
IRJET- Performance Evaluation of Various Classification AlgorithmsIRJET- Performance Evaluation of Various Classification Algorithms
IRJET- Performance Evaluation of Various Classification Algorithms
 
Similarity Features, and their Role in Concept Alignment Learning
Similarity Features, and their Role in Concept Alignment Learning Similarity Features, and their Role in Concept Alignment Learning
Similarity Features, and their Role in Concept Alignment Learning
 
Literature Survey: Clustering Technique
Literature Survey: Clustering TechniqueLiterature Survey: Clustering Technique
Literature Survey: Clustering Technique
 
Clustering
ClusteringClustering
Clustering
 
Presentation on unsupervised learning
Presentation on unsupervised learning Presentation on unsupervised learning
Presentation on unsupervised learning
 
Capter10 cluster basic
Capter10 cluster basicCapter10 cluster basic
Capter10 cluster basic
 

Similar to AI to advance science research

Ib mathematics hl
Ib mathematics hlIb mathematics hl
Ib mathematics hl
Ross
 
A Nonstandard Study of Taylor Ser.Dev.-Abstract+ Intro. M.Sc. Thesis
A Nonstandard Study of Taylor Ser.Dev.-Abstract+ Intro. M.Sc. ThesisA Nonstandard Study of Taylor Ser.Dev.-Abstract+ Intro. M.Sc. Thesis
A Nonstandard Study of Taylor Ser.Dev.-Abstract+ Intro. M.Sc. Thesis
Ibrahim Hamad
 
Modeling the dynamics of molecular concentration during the diffusion procedure
Modeling the dynamics of molecular concentration during the  diffusion procedureModeling the dynamics of molecular concentration during the  diffusion procedure
Modeling the dynamics of molecular concentration during the diffusion procedure
International Journal of Engineering Inventions www.ijeijournal.com
 
Gabor Frames for Quasicrystals and K-theory
Gabor Frames for Quasicrystals and K-theoryGabor Frames for Quasicrystals and K-theory
Gabor Frames for Quasicrystals and K-theory
Michael Kreisel
 
Artifact3 allen
Artifact3 allenArtifact3 allen
Artifact3 allen
allent07
 
Artifact3 allen
Artifact3 allenArtifact3 allen
Artifact3 allen
allent07
 
Artifact3 allen
Artifact3 allenArtifact3 allen
Artifact3 allen
allent07
 

Similar to AI to advance science research (20)

So sánh cấu trúc protein_Protein structure comparison
So sánh cấu trúc protein_Protein structure comparisonSo sánh cấu trúc protein_Protein structure comparison
So sánh cấu trúc protein_Protein structure comparison
 
A MATLAB Computational Investigation of the Jordan Canonical Form of a Class ...
A MATLAB Computational Investigation of the Jordan Canonical Form of a Class ...A MATLAB Computational Investigation of the Jordan Canonical Form of a Class ...
A MATLAB Computational Investigation of the Jordan Canonical Form of a Class ...
 
Q26099103
Q26099103Q26099103
Q26099103
 
OBJECTRECOGNITION1.pptxjjjkkkkjjjjkkkkkkk
OBJECTRECOGNITION1.pptxjjjkkkkjjjjkkkkkkkOBJECTRECOGNITION1.pptxjjjkkkkjjjjkkkkkkk
OBJECTRECOGNITION1.pptxjjjkkkkjjjjkkkkkkk
 
OBJECTRECOGNITION1.pptxjjjkkkkjjjjkkkkkkk
OBJECTRECOGNITION1.pptxjjjkkkkjjjjkkkkkkkOBJECTRECOGNITION1.pptxjjjkkkkjjjjkkkkkkk
OBJECTRECOGNITION1.pptxjjjkkkkjjjjkkkkkkk
 
graph_embeddings
graph_embeddingsgraph_embeddings
graph_embeddings
 
Ib mathematics hl
Ib mathematics hlIb mathematics hl
Ib mathematics hl
 
theory of computation lecture 01
theory of computation lecture 01theory of computation lecture 01
theory of computation lecture 01
 
08 Exponential Random Graph Models (ERGM)
08 Exponential Random Graph Models (ERGM)08 Exponential Random Graph Models (ERGM)
08 Exponential Random Graph Models (ERGM)
 
08 Exponential Random Graph Models (2016)
08 Exponential Random Graph Models (2016)08 Exponential Random Graph Models (2016)
08 Exponential Random Graph Models (2016)
 
A Nonstandard Study of Taylor Ser.Dev.-Abstract+ Intro. M.Sc. Thesis
A Nonstandard Study of Taylor Ser.Dev.-Abstract+ Intro. M.Sc. ThesisA Nonstandard Study of Taylor Ser.Dev.-Abstract+ Intro. M.Sc. Thesis
A Nonstandard Study of Taylor Ser.Dev.-Abstract+ Intro. M.Sc. Thesis
 
paper
paperpaper
paper
 
Modeling the dynamics of molecular concentration during the diffusion procedure
Modeling the dynamics of molecular concentration during the  diffusion procedureModeling the dynamics of molecular concentration during the  diffusion procedure
Modeling the dynamics of molecular concentration during the diffusion procedure
 
Gabor Frames for Quasicrystals and K-theory
Gabor Frames for Quasicrystals and K-theoryGabor Frames for Quasicrystals and K-theory
Gabor Frames for Quasicrystals and K-theory
 
Aussem
AussemAussem
Aussem
 
A Logical Language with a Prototypical Semantics
A Logical Language with a Prototypical SemanticsA Logical Language with a Prototypical Semantics
A Logical Language with a Prototypical Semantics
 
Em molnar2015
Em molnar2015Em molnar2015
Em molnar2015
 
Artifact3 allen
Artifact3 allenArtifact3 allen
Artifact3 allen
 
Artifact3 allen
Artifact3 allenArtifact3 allen
Artifact3 allen
 
Artifact3 allen
Artifact3 allenArtifact3 allen
Artifact3 allen
 

More from Ding Li

More from Ding Li (12)

Software architecture for data applications
Software architecture for data applicationsSoftware architecture for data applications
Software architecture for data applications
 
Seismic data analysis with u net
Seismic data analysis with u netSeismic data analysis with u net
Seismic data analysis with u net
 
Titanic survivor prediction by machine learning
Titanic survivor prediction by machine learningTitanic survivor prediction by machine learning
Titanic survivor prediction by machine learning
 
Find nuclei in images with U-net
Find nuclei in images with U-netFind nuclei in images with U-net
Find nuclei in images with U-net
 
Digit recognizer by convolutional neural network
Digit recognizer by convolutional neural networkDigit recognizer by convolutional neural network
Digit recognizer by convolutional neural network
 
Reinforcement learning
Reinforcement learningReinforcement learning
Reinforcement learning
 
Practical data science
Practical data sciencePractical data science
Practical data science
 
Generative adversarial networks
Generative adversarial networksGenerative adversarial networks
Generative adversarial networks
 
Machine learning with graph
Machine learning with graphMachine learning with graph
Machine learning with graph
 
Natural language processing and transformer models
Natural language processing and transformer modelsNatural language processing and transformer models
Natural language processing and transformer models
 
Great neck school budget 2016-2017 analysis
Great neck school budget 2016-2017 analysisGreat neck school budget 2016-2017 analysis
Great neck school budget 2016-2017 analysis
 
Business Intelligence and Big Data in Cloud
Business Intelligence and Big Data in CloudBusiness Intelligence and Big Data in Cloud
Business Intelligence and Big Data in Cloud
 

Recently uploaded

一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
vcaxypu
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
vcaxypu
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
enxupq
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
nscud
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
Opendatabay
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
ocavb
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
ewymefz
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
ewymefz
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
yhkoc
 
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Domenico Conte
 

Recently uploaded (20)

一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
 
Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMI
 
How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
 
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPsWebinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
 
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
 
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 

AI to advance science research

  • 1. AI in Science Research How can modern AI help to push the boundary of science Ding Li 2022.1
  • 3. 3 AI Aids Intuition in Mathematical Discovery The cycle of developing mathematical theories by studying examples. • After recognizing a possible pattern in the properties of mathematical objects, such as convex polyhedra (3D shapes with flat faces, straight edges and vertices that all point outwards), mathematicians typically go through a cycle to understand this pattern. • They first compute the properties of some simple examples and analyze the possible relationships between these properties. • The researchers then refine these relationships. For example, they might come up with Euler’s polyhedron formula, which posits that the number of vertices (V) minus the number of edges (E) plus the number of faces (F) of a convex polyhedron is always equal to two: V − E + F = 2. • They then test this suggested relationship on more complicated examples, discard irrelevant properties and attempt to understand why the relationship holds. If it remains unclear, mathematicians then consider different examples, and the cycle continues. • Davies et al.1 show that machine-learning techniques can help researchers with the refinement step, which usually relies strongly on human intuition Stump 2021
  • 4. 4 Advancing mathematics by guiding human intuition with AI Davies 2021 As an illustrative example: let z be convex polyhedra, X(z) ∈ Z2 × R2 be the number of vertices and edges of z, as well as the volume and surface area, and Y(z) ∈ ℤ be the number of faces of z. Euler’s formula states that there is an exact relationship between X(z) and Y(z) in this case: X(z) · (−1, 1, 0, 0) + 2 = Y(z). The framework helps guide the intuition of mathematicians in two ways: by verifying the hypothesized existence of structure/patterns in mathematical objects through the use of supervised machine learning; and by helping in the understanding of these patterns through the use of attribution techniques.
  • 6. 6 Pushing the Frontiers of Density Functionals by Solving the Fractional Electron Problem Kirkpatrick 2021 • Computing electronic energies underpins theoretical chemistry and materials science, and density functional theory (DFT) promises an exact and efficient approach • But the approach has limitations and is known to give the wrong results for certain types of molecule. • “It’s sort of the ideal problem for machine learning: you know the answer, but not the formula you want to apply.” • The functional was evaluated by integrating local energies computed by a multilayer perceptron (MLP), which took as input both local and nonlocal features of the occupied Kohn-Sham (KS) orbitals and can be described as a local range-separated hybrid. • To train the functional, the sum of two objective functions was used: a regression a gradient regularization term that ensured that the functional derivatives can be used in self-consistent field (SCF) calculations after training Castelvecchi 2021
  • 8. 8 Primary Structure Amino acids (20) Peptide bond Secondary Structure Tertiary Structure Quaternary Structure
  • 9. 9 (MSA) Multiple Sequence Alignments Nseq x Nres • Evolutionary constrains • MSA clustering • Cluster deletion • Evolutionary correlations Pairwise Feature Nres x Nres • Physical and geometric constrains • Target feat (amino acids), residue index • Structural templates • Template distogram Near experimental accuracy in most cases for CASP14 assessment (May-July 2020) Jumper 2021 GitHub AlphaFold Protein Structure Database (JAK2) Blog Colab UniProt (JAK2)
  • 10. 10 A BERT-style transformer was applied to predict randomly masked individual residues within the MSA, which encourages the network to learn to interpret phylogenetic and covariation relationships without hardcoding a particular correlation statistic into the features. Exchange information iteratively to enable direct reasoning about the spatial and evolutionary relationships in the proteins. Combination of the bioinformatics and physical approaches We hope that AlphaFold—and computational approaches that apply its techniques for other biophysical problems—will become essential tools of modern biology.
  • 11. 11 “Do not quench your inspiration and your imagination; do not become the slave of your model.” – Vincent van Gogh