SlideShare a Scribd company logo
1 of 28
Alphafold2 - Protein Structural
Bioinformatics After CASP14
Daisuke Kihara
Department of Biological Sciences, Computer Science
Purdue University
https://kiharalab.org
How it started: CASP13 – CASP14
1
2
Alphafold1 (CASP13) & Alphafold2 (CAPSP14)
3
Sum of the Z-score of GDT-TS
November 30, 2020
Alphafold2:
Differences and Novelty
2
4
Alphafold1 (CASP13)
5
(Jumper J et al., Nature 2020)
Jan. 15, 2020
Incomplete code release at Github
6
(From CASP13 presentations by Jumper & Senior)
trRosetta
7
Submitted: Aug. 22, 2019
Appeared: Jan. 2, 2020
(Yang et al., PNAS, 2020)
AttentiveDist (Kiharalab)
8
Submitted (& bioRxiv) : Nov. 25, 2020
Appeared: April 7, 2021
(Jain A, et al., Scientific Reports, 2021)
1D Features:
1-hot encoding (20 aa)
PSSM, HMM
Predicted 2ndary structures & ASA (Spot-1D)
2D Features:
Predicted contacts (CCMPred)
Mutual Information
Statistical pairwise potential
CASP13, 43 FM and FM/TBM targets
Alphafold2
9
April 7, 2021
(Jumper J et al., Nature 2021)
Input Sequence Embedding Structure building
(Inference) code at Github & well-documented Suppl. Info. Released
End-to-End
Input MSA
◉ DB Search
○ JackHMMER > Mgnify (Metagenome DB)
○ JackHMMER > Uniref90
○ Hhblits > Uniclust30 + BFD (Metagenome DB)
○ Above are simply stacked.
◉ MSA processing (to reduce the memory usage)
○ MSA block deletion (MSA-level dropout)
○ Nseq sequences are randomly selected as MSA cluster centers
○ 15% are masked
○ Among masked: 10%, replaced to random amino acids; 10%,
replaced from MSA profile; 10%, unchanged; 70%: targets for
prediction.
○ Nextra_Seq selected for extra MSA sequences
10
11
Processing Input Embedding
12
Evoformer Block (Sequence Embedding)
“tri-amino acid contact (potential)”
i
j
k
“MSA Transformer”
13
Structure Module (IPA)
14
Structure Module (IPA)
8 blocks (shared weights)
Model Training & Loss Function
◉ Recycling (training & inference)
◉ MSA resampling (training & inference)
◉ Noisy Student Self-distillation
◉ Loss Function
15
Atom
coordinates
Main-chain atom
coord. & angles Dist-ogram
Predicted
error
If atoms
are
experiment
ally
determined
Crystal
structure
statistics
Masked
MSA
RoseTTafold
16
April 7, 2021
(Baek M et al., Science 2021)
Preparing input of SE(3)
“MSA transformer”
Time Course
3
17
CASP13
18
December 1-4, 2018
Alphafold US Patent
19
June 3, 2021
Alphafold2 Paper & Code Release
20
July 15, 2021
(Jumper J et al., Nature 2021)
Input Sequence Embedding Structure building
Alphafold Database
21
July 22, 2021
(Tunyasuvunakool K. et al., Nature 2021)
Database of protein models of 21 model organisms
Alphafold Use Case Reports (Twitter)
◉ Installation of the software
◉ Surprisingly accurate models of specific
proteins
◉ Modeling of protein complexes
○ With Gly-linker; X-linker, manipulating residue
number index
◉ Peptide docking
◉ Indication of disordered region, mutation..
22
July - August, 2021
Alphafold-Related Papers (166 papers on bioRxiv
since 12/01/2020)
23
July - August, 2021
Peptide docking
Protein-protein docking
Mutation
Protein Design
Folding Pathway
Molecular Replacement
Application
ColabFold
24
August 15, 2021
(Mirdita M et al., bioRxiv 2021)
Future Directions …
4
25
Future Directions for Structural
Bioinformatics works after Alphafold2
◉ Reuse the AF2 code
○ Docking, interaction
○ Data-assisted protein structure modeling
○ Protein design
○ Structure modeling of other molecules
◉ Application of AF2 structure models
○ Drug screening
○ Function (binding pocket classification ..)
◉ Explore new topics
○ Cryo-ET
○ Dynamics 26
Alphafold2 as BLAST
◉ Use Alphafold2 as the base-line tool, just like
BLAST in sequence analysis
◉ Use it in a pipeline
◉ Modify the code, concepts, techniques
27
Kihara Lab Research Team
28
https://kiharalab.org
@kiharalab @d_kihara
Sai Raghavendra Maddhuri
Venkata Subramaniya
Zicong Zhang
Special Thanks to Sai and Zicong for discussion

More Related Content

What's hot

Protein Structural predection
Protein Structural predectionProtein Structural predection
Protein Structural predectionSantu Chall
 
WGCNA: an R package for weighted correlation network analysis
WGCNA: an R package for weighted  correlation network analysisWGCNA: an R package for weighted  correlation network analysis
WGCNA: an R package for weighted correlation network analysisAlireza Doustmohammadi
 
Protein Structure, Databases and Structural Alignment
Protein Structure, Databases and Structural AlignmentProtein Structure, Databases and Structural Alignment
Protein Structure, Databases and Structural AlignmentSaramita De Chakravarti
 
De novo str_prediction
De novo str_predictionDe novo str_prediction
De novo str_predictionShwetA Kumari
 
NGS - Basic principles and sequencing platforms
NGS - Basic principles and sequencing platformsNGS - Basic principles and sequencing platforms
NGS - Basic principles and sequencing platformsAnnelies Haegeman
 
PAM : Point Accepted Mutation
PAM : Point Accepted MutationPAM : Point Accepted Mutation
PAM : Point Accepted MutationAmit Kyada
 
Uni prot presentation
Uni prot presentationUni prot presentation
Uni prot presentationRida Khalid
 
FindMod
FindModFindMod
FindModSobia
 
Protein databases
Protein databasesProtein databases
Protein databasessarumalay
 
Mass Spectrometry Based Proteomics Analysis
Mass Spectrometry Based Proteomics AnalysisMass Spectrometry Based Proteomics Analysis
Mass Spectrometry Based Proteomics AnalysisSijo A
 
Machine Learning in Bioinformatics
Machine Learning in BioinformaticsMachine Learning in Bioinformatics
Machine Learning in BioinformaticsDmytro Fishman
 
Biotechnology information system in india (btis net)
Biotechnology information system in india (btis net)Biotechnology information system in india (btis net)
Biotechnology information system in india (btis net)KAUSHAL SAHU
 
Methods of Protein structure determination
Methods of  Protein structure determination Methods of  Protein structure determination
Methods of Protein structure determination EL Sayed Sabry
 

What's hot (20)

Protein Structural predection
Protein Structural predectionProtein Structural predection
Protein Structural predection
 
WGCNA: an R package for weighted correlation network analysis
WGCNA: an R package for weighted  correlation network analysisWGCNA: an R package for weighted  correlation network analysis
WGCNA: an R package for weighted correlation network analysis
 
Protein Structure, Databases and Structural Alignment
Protein Structure, Databases and Structural AlignmentProtein Structure, Databases and Structural Alignment
Protein Structure, Databases and Structural Alignment
 
MASCOT
MASCOTMASCOT
MASCOT
 
NGS - QC & Dataformat
NGS - QC & Dataformat NGS - QC & Dataformat
NGS - QC & Dataformat
 
De novo str_prediction
De novo str_predictionDe novo str_prediction
De novo str_prediction
 
NGS - Basic principles and sequencing platforms
NGS - Basic principles and sequencing platformsNGS - Basic principles and sequencing platforms
NGS - Basic principles and sequencing platforms
 
PAM : Point Accepted Mutation
PAM : Point Accepted MutationPAM : Point Accepted Mutation
PAM : Point Accepted Mutation
 
Uni prot presentation
Uni prot presentationUni prot presentation
Uni prot presentation
 
FindMod
FindModFindMod
FindMod
 
Proteomics
ProteomicsProteomics
Proteomics
 
Reactome Pathways Portal
Reactome Pathways PortalReactome Pathways Portal
Reactome Pathways Portal
 
Protein databases
Protein databasesProtein databases
Protein databases
 
String.pptx
String.pptxString.pptx
String.pptx
 
Mass Spectrometry Based Proteomics Analysis
Mass Spectrometry Based Proteomics AnalysisMass Spectrometry Based Proteomics Analysis
Mass Spectrometry Based Proteomics Analysis
 
Machine Learning in Bioinformatics
Machine Learning in BioinformaticsMachine Learning in Bioinformatics
Machine Learning in Bioinformatics
 
Protein fold recognition and ab_initio modeling
Protein fold recognition and ab_initio modelingProtein fold recognition and ab_initio modeling
Protein fold recognition and ab_initio modeling
 
I-Tasser
I-TasserI-Tasser
I-Tasser
 
Biotechnology information system in india (btis net)
Biotechnology information system in india (btis net)Biotechnology information system in india (btis net)
Biotechnology information system in india (btis net)
 
Methods of Protein structure determination
Methods of  Protein structure determination Methods of  Protein structure determination
Methods of Protein structure determination
 

Similar to Alphafold2 - Protein Structural Bioinformatics After CASP14

Materials Design in the Age of Deep Learning and Quantum Computation
Materials Design in the Age of Deep Learning and Quantum ComputationMaterials Design in the Age of Deep Learning and Quantum Computation
Materials Design in the Age of Deep Learning and Quantum ComputationKAMAL CHOUDHARY
 
NIST-JARVIS infrastructure for Improved Materials Design
NIST-JARVIS infrastructure for Improved Materials DesignNIST-JARVIS infrastructure for Improved Materials Design
NIST-JARVIS infrastructure for Improved Materials DesignKAMAL CHOUDHARY
 
Thesis def
Thesis defThesis def
Thesis defJay Vyas
 
Graphs, Environments, and Machine Learning for Materials Science
Graphs, Environments, and Machine Learning for Materials ScienceGraphs, Environments, and Machine Learning for Materials Science
Graphs, Environments, and Machine Learning for Materials Scienceaimsnist
 
RAD: Christian J. Stockert Jr, Junmin Liu
RAD: Christian J. Stockert Jr, Junmin LiuRAD: Christian J. Stockert Jr, Junmin Liu
RAD: Christian J. Stockert Jr, Junmin Liuniranabey
 
Space Technologies for Experimental Astronomy
Space Technologies for Experimental AstronomySpace Technologies for Experimental Astronomy
Space Technologies for Experimental AstronomyFrancesco Lazzarotto
 
Structural Studies of Aspartic Endopeptidase pep2 from Neosartorya Fisherica ...
Structural Studies of Aspartic Endopeptidase pep2 from Neosartorya Fisherica ...Structural Studies of Aspartic Endopeptidase pep2 from Neosartorya Fisherica ...
Structural Studies of Aspartic Endopeptidase pep2 from Neosartorya Fisherica ...ijbbjournal
 
Machine Learning for Molecules
Machine Learning for MoleculesMachine Learning for Molecules
Machine Learning for MoleculesIchigaku Takigawa
 
CIBEC Presentation Fatma Sayed.pptx
CIBEC Presentation Fatma Sayed.pptxCIBEC Presentation Fatma Sayed.pptx
CIBEC Presentation Fatma Sayed.pptxFatma Sayed Ibrahim
 
(Very) Basic graphing with R
(Very) Basic graphing with R(Very) Basic graphing with R
(Very) Basic graphing with RKazuki Yoshida
 
Kiharalab Bioinformatics Projects 2019
Kiharalab Bioinformatics Projects 2019Kiharalab Bioinformatics Projects 2019
Kiharalab Bioinformatics Projects 2019Purdue University
 
The TIR revolution in plant stress biology
The TIR revolution in plant stress biologyThe TIR revolution in plant stress biology
The TIR revolution in plant stress biologyDmitryLapin2
 
Joe parker-benchmarking-bioinformatics
Joe parker-benchmarking-bioinformaticsJoe parker-benchmarking-bioinformatics
Joe parker-benchmarking-bioinformaticsJoe Parker
 
Error of Multileaf collimator prediction using recurrent neural network (LSTM)
Error of Multileaf collimator prediction using recurrent neural network (LSTM)Error of Multileaf collimator prediction using recurrent neural network (LSTM)
Error of Multileaf collimator prediction using recurrent neural network (LSTM)WonjoongCheon
 
2014 11-13-sbsm032-reproducible research
2014 11-13-sbsm032-reproducible research2014 11-13-sbsm032-reproducible research
2014 11-13-sbsm032-reproducible researchYannick Wurm
 
A North-South partnership Identifies Genetic Biodiversity for Durable Rust Re...
A North-South partnership Identifies Genetic Biodiversity for Durable Rust Re...A North-South partnership Identifies Genetic Biodiversity for Durable Rust Re...
A North-South partnership Identifies Genetic Biodiversity for Durable Rust Re...Borlaug Global Rust Initiative
 

Similar to Alphafold2 - Protein Structural Bioinformatics After CASP14 (20)

Materials Design in the Age of Deep Learning and Quantum Computation
Materials Design in the Age of Deep Learning and Quantum ComputationMaterials Design in the Age of Deep Learning and Quantum Computation
Materials Design in the Age of Deep Learning and Quantum Computation
 
NIST-JARVIS infrastructure for Improved Materials Design
NIST-JARVIS infrastructure for Improved Materials DesignNIST-JARVIS infrastructure for Improved Materials Design
NIST-JARVIS infrastructure for Improved Materials Design
 
Thesis def
Thesis defThesis def
Thesis def
 
Folker Meyer: Metagenomic Data Annotation
Folker Meyer: Metagenomic Data AnnotationFolker Meyer: Metagenomic Data Annotation
Folker Meyer: Metagenomic Data Annotation
 
Graphs, Environments, and Machine Learning for Materials Science
Graphs, Environments, and Machine Learning for Materials ScienceGraphs, Environments, and Machine Learning for Materials Science
Graphs, Environments, and Machine Learning for Materials Science
 
RAD: Christian J. Stockert Jr, Junmin Liu
RAD: Christian J. Stockert Jr, Junmin LiuRAD: Christian J. Stockert Jr, Junmin Liu
RAD: Christian J. Stockert Jr, Junmin Liu
 
Resume
ResumeResume
Resume
 
Space Technologies for Experimental Astronomy
Space Technologies for Experimental AstronomySpace Technologies for Experimental Astronomy
Space Technologies for Experimental Astronomy
 
Structural Studies of Aspartic Endopeptidase pep2 from Neosartorya Fisherica ...
Structural Studies of Aspartic Endopeptidase pep2 from Neosartorya Fisherica ...Structural Studies of Aspartic Endopeptidase pep2 from Neosartorya Fisherica ...
Structural Studies of Aspartic Endopeptidase pep2 from Neosartorya Fisherica ...
 
Machine Learning for Molecules
Machine Learning for MoleculesMachine Learning for Molecules
Machine Learning for Molecules
 
CIBEC Presentation Fatma Sayed.pptx
CIBEC Presentation Fatma Sayed.pptxCIBEC Presentation Fatma Sayed.pptx
CIBEC Presentation Fatma Sayed.pptx
 
(Very) Basic graphing with R
(Very) Basic graphing with R(Very) Basic graphing with R
(Very) Basic graphing with R
 
A*STAR Webinar on The AI Revolution in Materials Science
A*STAR Webinar on The AI Revolution in Materials ScienceA*STAR Webinar on The AI Revolution in Materials Science
A*STAR Webinar on The AI Revolution in Materials Science
 
Kiharalab Bioinformatics Projects 2019
Kiharalab Bioinformatics Projects 2019Kiharalab Bioinformatics Projects 2019
Kiharalab Bioinformatics Projects 2019
 
3302 3305
3302 33053302 3305
3302 3305
 
The TIR revolution in plant stress biology
The TIR revolution in plant stress biologyThe TIR revolution in plant stress biology
The TIR revolution in plant stress biology
 
Joe parker-benchmarking-bioinformatics
Joe parker-benchmarking-bioinformaticsJoe parker-benchmarking-bioinformatics
Joe parker-benchmarking-bioinformatics
 
Error of Multileaf collimator prediction using recurrent neural network (LSTM)
Error of Multileaf collimator prediction using recurrent neural network (LSTM)Error of Multileaf collimator prediction using recurrent neural network (LSTM)
Error of Multileaf collimator prediction using recurrent neural network (LSTM)
 
2014 11-13-sbsm032-reproducible research
2014 11-13-sbsm032-reproducible research2014 11-13-sbsm032-reproducible research
2014 11-13-sbsm032-reproducible research
 
A North-South partnership Identifies Genetic Biodiversity for Durable Rust Re...
A North-South partnership Identifies Genetic Biodiversity for Durable Rust Re...A North-South partnership Identifies Genetic Biodiversity for Durable Rust Re...
A North-South partnership Identifies Genetic Biodiversity for Durable Rust Re...
 

More from Purdue University

CASP14 Data Assisted Modeling (KIharalab)
CASP14 Data Assisted Modeling (KIharalab)CASP14 Data Assisted Modeling (KIharalab)
CASP14 Data Assisted Modeling (KIharalab)Purdue University
 
Predicting Assembly Order of Multimeric Protein Complexes
Predicting Assembly Order of Multimeric Protein ComplexesPredicting Assembly Order of Multimeric Protein Complexes
Predicting Assembly Order of Multimeric Protein ComplexesPurdue University
 
Structure Modeling of Disordered Protein Interactions
Structure Modeling of Disordered Protein InteractionsStructure Modeling of Disordered Protein Interactions
Structure Modeling of Disordered Protein InteractionsPurdue University
 
DextMP: Text mining for finding moonlighting proteins
DextMP: Text mining for finding moonlighting proteinsDextMP: Text mining for finding moonlighting proteins
DextMP: Text mining for finding moonlighting proteinsPurdue University
 
Discovery of Ligand-Protein Interactome
Discovery of Ligand-Protein InteractomeDiscovery of Ligand-Protein Interactome
Discovery of Ligand-Protein InteractomePurdue University
 
Kihara Lab protein structure prediction performance in CASP11
Kihara Lab protein structure prediction performance in CASP11Kihara Lab protein structure prediction performance in CASP11
Kihara Lab protein structure prediction performance in CASP11Purdue University
 
Protein docking by LZerD, KiharaLab at CAPRI meeting 2016
Protein docking by LZerD, KiharaLab at CAPRI meeting 2016Protein docking by LZerD, KiharaLab at CAPRI meeting 2016
Protein docking by LZerD, KiharaLab at CAPRI meeting 2016Purdue University
 
Kihara Bioinformatics Lab Research Summary 2016
Kihara Bioinformatics Lab Research Summary 2016Kihara Bioinformatics Lab Research Summary 2016
Kihara Bioinformatics Lab Research Summary 2016Purdue University
 
Flexscore: Ensemble-based evaluation for protein Structure models
Flexscore: Ensemble-based evaluation for protein Structure modelsFlexscore: Ensemble-based evaluation for protein Structure models
Flexscore: Ensemble-based evaluation for protein Structure modelsPurdue University
 

More from Purdue University (9)

CASP14 Data Assisted Modeling (KIharalab)
CASP14 Data Assisted Modeling (KIharalab)CASP14 Data Assisted Modeling (KIharalab)
CASP14 Data Assisted Modeling (KIharalab)
 
Predicting Assembly Order of Multimeric Protein Complexes
Predicting Assembly Order of Multimeric Protein ComplexesPredicting Assembly Order of Multimeric Protein Complexes
Predicting Assembly Order of Multimeric Protein Complexes
 
Structure Modeling of Disordered Protein Interactions
Structure Modeling of Disordered Protein InteractionsStructure Modeling of Disordered Protein Interactions
Structure Modeling of Disordered Protein Interactions
 
DextMP: Text mining for finding moonlighting proteins
DextMP: Text mining for finding moonlighting proteinsDextMP: Text mining for finding moonlighting proteins
DextMP: Text mining for finding moonlighting proteins
 
Discovery of Ligand-Protein Interactome
Discovery of Ligand-Protein InteractomeDiscovery of Ligand-Protein Interactome
Discovery of Ligand-Protein Interactome
 
Kihara Lab protein structure prediction performance in CASP11
Kihara Lab protein structure prediction performance in CASP11Kihara Lab protein structure prediction performance in CASP11
Kihara Lab protein structure prediction performance in CASP11
 
Protein docking by LZerD, KiharaLab at CAPRI meeting 2016
Protein docking by LZerD, KiharaLab at CAPRI meeting 2016Protein docking by LZerD, KiharaLab at CAPRI meeting 2016
Protein docking by LZerD, KiharaLab at CAPRI meeting 2016
 
Kihara Bioinformatics Lab Research Summary 2016
Kihara Bioinformatics Lab Research Summary 2016Kihara Bioinformatics Lab Research Summary 2016
Kihara Bioinformatics Lab Research Summary 2016
 
Flexscore: Ensemble-based evaluation for protein Structure models
Flexscore: Ensemble-based evaluation for protein Structure modelsFlexscore: Ensemble-based evaluation for protein Structure models
Flexscore: Ensemble-based evaluation for protein Structure models
 

Recently uploaded

FREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naFREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naJASISJULIANOELYNV
 
Microteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical EngineeringMicroteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical EngineeringPrajakta Shinde
 
Citronella presentation SlideShare mani upadhyay
Citronella presentation SlideShare mani upadhyayCitronella presentation SlideShare mani upadhyay
Citronella presentation SlideShare mani upadhyayupadhyaymani499
 
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRCall Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRlizamodels9
 
The dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptxThe dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptxEran Akiva Sinbar
 
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...lizamodels9
 
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPirithiRaju
 
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPirithiRaju
 
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)Columbia Weather Systems
 
Neurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trNeurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trssuser06f238
 
Pests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdfPests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdfPirithiRaju
 
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPirithiRaju
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Nistarini College, Purulia (W.B) India
 
Pests of Bengal gram_Identification_Dr.UPR.pdf
Pests of Bengal gram_Identification_Dr.UPR.pdfPests of Bengal gram_Identification_Dr.UPR.pdf
Pests of Bengal gram_Identification_Dr.UPR.pdfPirithiRaju
 
Sulphur & Phosphrus Cycle PowerPoint Presentation (2) [Autosaved]-3-1.pptx
Sulphur & Phosphrus Cycle PowerPoint Presentation (2) [Autosaved]-3-1.pptxSulphur & Phosphrus Cycle PowerPoint Presentation (2) [Autosaved]-3-1.pptx
Sulphur & Phosphrus Cycle PowerPoint Presentation (2) [Autosaved]-3-1.pptxnoordubaliya2003
 
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptx
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptxRESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptx
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptxFarihaAbdulRasheed
 
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)riyaescorts54
 
Davis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologyDavis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologycaarthichand2003
 
Transposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptTransposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptArshadWarsi13
 
Topic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxTopic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxJorenAcuavera1
 

Recently uploaded (20)

FREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naFREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by na
 
Microteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical EngineeringMicroteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical Engineering
 
Citronella presentation SlideShare mani upadhyay
Citronella presentation SlideShare mani upadhyayCitronella presentation SlideShare mani upadhyay
Citronella presentation SlideShare mani upadhyay
 
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRCall Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
 
The dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptxThe dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptx
 
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
 
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
 
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
 
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
 
Neurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trNeurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 tr
 
Pests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdfPests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdf
 
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...
 
Pests of Bengal gram_Identification_Dr.UPR.pdf
Pests of Bengal gram_Identification_Dr.UPR.pdfPests of Bengal gram_Identification_Dr.UPR.pdf
Pests of Bengal gram_Identification_Dr.UPR.pdf
 
Sulphur & Phosphrus Cycle PowerPoint Presentation (2) [Autosaved]-3-1.pptx
Sulphur & Phosphrus Cycle PowerPoint Presentation (2) [Autosaved]-3-1.pptxSulphur & Phosphrus Cycle PowerPoint Presentation (2) [Autosaved]-3-1.pptx
Sulphur & Phosphrus Cycle PowerPoint Presentation (2) [Autosaved]-3-1.pptx
 
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptx
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptxRESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptx
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptx
 
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
 
Davis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologyDavis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technology
 
Transposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptTransposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.ppt
 
Topic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxTopic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptx
 

Alphafold2 - Protein Structural Bioinformatics After CASP14

  • 1. Alphafold2 - Protein Structural Bioinformatics After CASP14 Daisuke Kihara Department of Biological Sciences, Computer Science Purdue University https://kiharalab.org
  • 2. How it started: CASP13 – CASP14 1 2
  • 3. Alphafold1 (CASP13) & Alphafold2 (CAPSP14) 3 Sum of the Z-score of GDT-TS November 30, 2020
  • 5. Alphafold1 (CASP13) 5 (Jumper J et al., Nature 2020) Jan. 15, 2020 Incomplete code release at Github
  • 6. 6 (From CASP13 presentations by Jumper & Senior)
  • 7. trRosetta 7 Submitted: Aug. 22, 2019 Appeared: Jan. 2, 2020 (Yang et al., PNAS, 2020)
  • 8. AttentiveDist (Kiharalab) 8 Submitted (& bioRxiv) : Nov. 25, 2020 Appeared: April 7, 2021 (Jain A, et al., Scientific Reports, 2021) 1D Features: 1-hot encoding (20 aa) PSSM, HMM Predicted 2ndary structures & ASA (Spot-1D) 2D Features: Predicted contacts (CCMPred) Mutual Information Statistical pairwise potential CASP13, 43 FM and FM/TBM targets
  • 9. Alphafold2 9 April 7, 2021 (Jumper J et al., Nature 2021) Input Sequence Embedding Structure building (Inference) code at Github & well-documented Suppl. Info. Released End-to-End
  • 10. Input MSA ◉ DB Search ○ JackHMMER > Mgnify (Metagenome DB) ○ JackHMMER > Uniref90 ○ Hhblits > Uniclust30 + BFD (Metagenome DB) ○ Above are simply stacked. ◉ MSA processing (to reduce the memory usage) ○ MSA block deletion (MSA-level dropout) ○ Nseq sequences are randomly selected as MSA cluster centers ○ 15% are masked ○ Among masked: 10%, replaced to random amino acids; 10%, replaced from MSA profile; 10%, unchanged; 70%: targets for prediction. ○ Nextra_Seq selected for extra MSA sequences 10
  • 12. 12 Evoformer Block (Sequence Embedding) “tri-amino acid contact (potential)” i j k “MSA Transformer”
  • 14. 14 Structure Module (IPA) 8 blocks (shared weights)
  • 15. Model Training & Loss Function ◉ Recycling (training & inference) ◉ MSA resampling (training & inference) ◉ Noisy Student Self-distillation ◉ Loss Function 15 Atom coordinates Main-chain atom coord. & angles Dist-ogram Predicted error If atoms are experiment ally determined Crystal structure statistics Masked MSA
  • 16. RoseTTafold 16 April 7, 2021 (Baek M et al., Science 2021) Preparing input of SE(3) “MSA transformer”
  • 20. Alphafold2 Paper & Code Release 20 July 15, 2021 (Jumper J et al., Nature 2021) Input Sequence Embedding Structure building
  • 21. Alphafold Database 21 July 22, 2021 (Tunyasuvunakool K. et al., Nature 2021) Database of protein models of 21 model organisms
  • 22. Alphafold Use Case Reports (Twitter) ◉ Installation of the software ◉ Surprisingly accurate models of specific proteins ◉ Modeling of protein complexes ○ With Gly-linker; X-linker, manipulating residue number index ◉ Peptide docking ◉ Indication of disordered region, mutation.. 22 July - August, 2021
  • 23. Alphafold-Related Papers (166 papers on bioRxiv since 12/01/2020) 23 July - August, 2021 Peptide docking Protein-protein docking Mutation Protein Design Folding Pathway Molecular Replacement Application
  • 24. ColabFold 24 August 15, 2021 (Mirdita M et al., bioRxiv 2021)
  • 26. Future Directions for Structural Bioinformatics works after Alphafold2 ◉ Reuse the AF2 code ○ Docking, interaction ○ Data-assisted protein structure modeling ○ Protein design ○ Structure modeling of other molecules ◉ Application of AF2 structure models ○ Drug screening ○ Function (binding pocket classification ..) ◉ Explore new topics ○ Cryo-ET ○ Dynamics 26
  • 27. Alphafold2 as BLAST ◉ Use Alphafold2 as the base-line tool, just like BLAST in sequence analysis ◉ Use it in a pipeline ◉ Modify the code, concepts, techniques 27
  • 28. Kihara Lab Research Team 28 https://kiharalab.org @kiharalab @d_kihara Sai Raghavendra Maddhuri Venkata Subramaniya Zicong Zhang Special Thanks to Sai and Zicong for discussion