SlideShare a Scribd company logo
1 of 31
(ITERATIVE THREADING ASSEMBLY REFINEMENT)
By:
Animesh kumar
M. Sc. (Bioinformatics)
IASRI, New Delhi
Mob: 8512800911
 Background
• Structure modeling processes often involve human interventions because the
human-expert knowledge combined with biochemical information (function,
mutagenesis, catalytic residues, etc.) could help in both structural assembly
and model selection.
• Development of fully automated algorithms allows non-experts to generate
structural models for their own sequences through Internet services.
• I-TASSER (as 'Zhang-Server') was ranked as the no. 1 server in recent
CASP7 and CASP8 experiments.
THE ZHANG LAB ON-LINE SERVICE SYSTEM CONTAINS:
• ON-LINE SERVERS - [FOLDING, DOCKING, DESIGN, DOMAINS ETC; SOME ARE
DOWNLOADABLE]
• BIOINFORMATICS TOOLS - [ALIGNMENT, IMAGE, CLUSTERING ETC; ALL ARE
DOWNLOADABLE]
• DATABASES - [LIGAND, GPCR, GENOME, DECOY, POTENTIAL, CASP ETC; ALL ARE
DOWNLOADABLE]
I. Protein Structure and Function Prediction Services (folding, threading, potential,
contact, torsion, docking etc.)
II. Bioinformatics Tools (structure alignment, sequence alignment,
3D visualization, surface, and clustering, etc.)
III. Databases and Potentials
Introduction:
• I-TASSER server is an Internet service for protein structure and function
predictions.
• It allows academic users to automatically generate high-quality predictions
of 3D structure and biological function of protein molecules from their amino
acid sequences.
• Models are built based on multiple-threading alignments by LOMETS and
iterative TASSER simulations.
• I-TASSER (as 'Zhang-Server') was ranked as the No 1 server in recent
CASP7 and CASP8 experiments.
I-TASSER method
• I-TASSER is a hierarchical protein structure modeling approach.
• It is based on the Profile-Profile threading Alignment (PPA) and the iterative
implementation of the Threading ASSEmbly Refinement (TASSER) program.
• The target sequences are first threaded through a representative PDB
structure library (with a pair-wise sequence identity cut-off of 70%) to search
for the possible folds.
• It is done by four simple variants of PPA methods, with different
combinations of the hidden Markov model, PSI-BLAST profiles, Needleman-
Wunsch and Smith-Waterman alignment algorithms.
• Threading aligned regions are used to reassemble full-length models while
the threading unaligned regions (mainly loops) are built by ab initio
modeling.
• conformational space is searched by replica-exchange Monte Carlo
simulations.
• Structure trajectories are clustered by SPICKER and the cluster centroids
are obtained by the averaging the coordinates of all clustered structures.
• Fragment assembly simulation to rule out the steric clashes on the centroid
structures and to refine the models further.
• Spatial restraints are extracted from the centroids and the PDB structures
searched by the structure alignment program TM-align.
• Finally, the structure decoys are clustered and the lowest energy structure
in each cluster is selected, which has the Cα atoms and the side-chain
centers of mass specified.
• Pulchra is used to add backbone atoms (N, C, O) and Scwrl_3.0 to build
side-chain rotamers.
How does I-TASSER generate structure and function predictions?
submission of an amino acid sequence
server first tries to retrieve template proteins of similar folds (or super-secondary
structures) from the PDB library by LOMETS, (a locally installed meta-threading
approach).
if fragments found
then are reassembled into full-length
models by replica-exchange Monte Carlo
simulations by threading unaligned regions
(mainly loops) through ab initio modeling.
If not
I-TASSER will build the whole structures
by ab initio modeling. The low free-energy
states are identified by SPICKER through
clustering the simulation decoys.
• Third step, the fragment assembly simulation is performed again starting from
the SPICKER cluster centroids, where the spatial restrains collected from
both the LOMETS templates and the PDB structures by TM-align are used to
guide the simulations.
• The purpose of the second iteration is to remove the steric clash as well as to
refine the global topology of the cluster centroids.
• The decoys generated in the second simulations are then clustered and the
lowest energy structures are selected.
• The final full-atomic models are obtained by REMO which builds the atomic
details from the selected I-TASSER decoys through the optimization of the
hydrogen-bonding network.
Binding sites
• If any region with >80 residues has no aligned residues in at least two strong PPA
alignments of Z-score > Z0 , the target will be judged as a multiple domain protein and
domain boundaries are automatically assigned based on the borders of the large gaps.
• I-TASSER simulations will be run for the full chain as well as the separate domains. The
final full-length models are generated by docking the model of domains together.
• The domain docking is performed by a quick Metropolis Monte Carlo simulation where the
energy is defined as the RMSD of domain models to the full-chain model plus the
reciprocal of the number of steric clashes between domains.
• The goal of the docking is to find the domain orientation that is closest to the I-TASSER
full-chain model but has the minimum steric clashes.
• This procedure does not influence the multiple domain proteins which have all domains
completely aligned by the PPAs.
Server setting
• Project name: I-TASSER server
• Home page: http://zhanglab.ccmb.med.umich.edu/I-TASSER/
• I-TASSER Standalone Package (Version 4.0)
 Operating system(s): Windows, Linux, Mac
 Programming language: Perl, Fortran77
 License: GPL
o Input: Amino acid sequence of the proteins(10–1,500 residues in FASTA format)
o Output: Email sent to the users, include the PDB format files of up to 5 predicted
models, C-score of the models, and the predicted RMSD and Tm-score of the first
model. A brief explanation of the RMSD, TM-score, and C-score
URL: http://zhanglab.ccmb.med.umich.edu/I-TASSER/
Running:
• Log on to I-Tasser web page.
• Copy & paste or directlt upload the a.a. sequence int the provided box also
provide e-mail address and name of the job.
• There is also option for specifying the inter residue contact or distance
restraints to exclude some tempelate.
• To submit click on “Run I-Tasser”
• Check the status of submitted job by visiting the I-Tasser queue page.
• Click on search to find out the submitted job by providing the job-Id no.
• After the structure and modelling is finished a notification e-mail containing
image of the predicted structure and web link will be sent.
• Click to view and download the result.
Structure analysis
1. Predicted secondary structure
• Displayed H as -helix, S for β-strand and C for coil.
• Also consider the confidence score for each residue.
• Look for region for long stretches of secondary structure to estimate
the core region in the protein.
2. Predicted Solvent Accessibility
• To predict the buried and exposed region in the query.
• It is “0” for buried residue and “9” for residue.
• The region with solvent exposed and hydrophilic residue are
potential hydration or functional site.
3. Top 5 Models predicted by I-TASSER
• There is predicted tertiary structure of protein in the interactive j-mol
appellate.
• Left click to change the appearance of displayed structure (style,
zoom, select etc.)
• C-score
 To analyze the quality of the predict.
 its range is -5 to 2.
 Higher score reflect a model of better quality.
• TM score and RMSD
 standards for measuring structural similarity between query and
tempelate
 used to measure the accuracy of structure modeling when the native
structure is known
 TM-score >0.5 indicates a model of correct topology and a TM-
score<0.17 means a random similarity
4. Top 10 templates used by I-TASSER
• Analyze sequence identity in the threading aligned region and for the
whole chain to access the homology between query and template.
• Z-score
 Used to analyze the quality of threading alignment
 Z-score>1 reflect confident alignment and most likely to have the
same fold as query protein
• High sequence identity 1 in the threading aligned region as compared to
whole chain to access the homology between query and template
• High sequence identity 2 represent the evolutionary relationship between
the query and template.
• Colored residue shows the conserved residue or motif in the query protein
and template protein.
• High sequence identity in the threading aligned region as compared to
whole chain alignment indicate presence of conserved motif or domain in
the sequence.
• Access the coverage of the alignment by inspecting the alignment.
• If the coverage of top alignment is low and confined to a small region of a
query protein shows absence of long segment of query segment, which
indicate query protein contain more than one domain.
• In this case it is suggested to split the sequence and model the domain
individually.
5. Structural analog in PDB and Enzyme commission number prediction
• To determine the top 10 structural analog of the first predicted models as identified by the
alignment program TM-align
• TM-score>0.5 indicate the detected homology and the model have a similar topology and
used to determine structural class of protein family of query sequence
• TM-score<0.3 signify random structure.
• Analyze the sequence identity and RMSD to access the conservational special motif of the
model and structural analog for that (see colored portion
6. Function prediction using COFACTOR
• EC number
 Gives the potential homologue of the query protein
 Confidence level is shown as EC score
 CscoreEC is the confidence score for the Enzyme Classification (EC) number
prediction
 CscoreEC values range in between [0-1]
 A higher score indicates a more reliable EC number prediction.
 RMSDa is the RMSD between residues that are structurally aligned by TM-align
while IDENa is the percentage sequence identity in the structurally aligned region.
 Cov. represents the coverage of global structural alignment and is equal to the
number of structurally aligned residues divided by length of the query protein.
 If the EC score is very high then there is lack of consensus against identified hits,
the prediction becomes less reliable then we go for gene ontology prediction.
• Gene ontology (GO) term and protein- ligand binding site prediction
 CscoreGO, which is a combined measure for evaluating global and local
similarity between query and template protein.
 CscoreGO values range in between [0-1]
 A higher value indicates a better confidence in predicting the function
using the template.
 Each protein is associated with multiple GO term describing its molecular
function, biological process and cellular location (each term is linked to
respective amigo website and lineage)
 Analyze the Fh score(functional homology score) column to access the
functional homology between query and template
 GO score >0.5 indicates a reliable prediction.
7. Template proteins with similar binding site
• They are ranked based on the no. of predicted ligand conformation.
• They share a common binding pocket. The best indentified ligand are already
displayed in the J-mol applet.
• Click on the other radio buttons to visualize predicted binding site and ligand
interacting residues.
• CscoreLB
 is the confidence score of predicted binding site
 Its values range in between [0-1]
 A higher score indicates a more reliable ligand-binding site prediction.
• BS-score
 measure of local similarity (sequence & structure) between template
binding site and predicted binding site in the query structure.
 BS-score >1 reflects a significant local match(structural similarity) between
the predicted and template binding site.
Advantage
• Main advantage over the existing structural modelling method is the
inherent structure fragment assembly approach which consistently drive
the threading alignment close to the native state.
I- Tasser

More Related Content

What's hot

Protien Structure Prediction
Protien Structure PredictionProtien Structure Prediction
Protien Structure PredictionSelimReza76
 
Protein information resource (PIR)
Protein information resource (PIR)Protein information resource (PIR)
Protein information resource (PIR)ShivaniShewale2
 
Yeast two hybrid
Yeast two hybridYeast two hybrid
Yeast two hybridhina ojha
 
MULTIPLE SEQUENCE ALIGNMENT
MULTIPLE  SEQUENCE  ALIGNMENTMULTIPLE  SEQUENCE  ALIGNMENT
MULTIPLE SEQUENCE ALIGNMENTMariya Raju
 
Threading modeling methods
Threading modeling methodsThreading modeling methods
Threading modeling methodsratanvishwas
 
methods for protein structure prediction
methods for protein structure predictionmethods for protein structure prediction
methods for protein structure predictionkaramveer prajapat
 
gene prediction programs
gene prediction programsgene prediction programs
gene prediction programsMugdhaSharma11
 
GENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICSGENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICSsandeshGM
 
SAGE (Serial analysis of Gene Expression)
SAGE (Serial analysis of Gene Expression)SAGE (Serial analysis of Gene Expression)
SAGE (Serial analysis of Gene Expression)talhakhat
 

What's hot (20)

Protien Structure Prediction
Protien Structure PredictionProtien Structure Prediction
Protien Structure Prediction
 
(Expasy)
(Expasy)(Expasy)
(Expasy)
 
BLAST
BLASTBLAST
BLAST
 
I-Tasser
I-TasserI-Tasser
I-Tasser
 
Prosite
PrositeProsite
Prosite
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
 
Protein information resource (PIR)
Protein information resource (PIR)Protein information resource (PIR)
Protein information resource (PIR)
 
Pymol
PymolPymol
Pymol
 
Yeast two hybrid
Yeast two hybridYeast two hybrid
Yeast two hybrid
 
MULTIPLE SEQUENCE ALIGNMENT
MULTIPLE  SEQUENCE  ALIGNMENTMULTIPLE  SEQUENCE  ALIGNMENT
MULTIPLE SEQUENCE ALIGNMENT
 
Tools and database of NCBI
Tools and database of NCBITools and database of NCBI
Tools and database of NCBI
 
Homology modelling
Homology modellingHomology modelling
Homology modelling
 
Threading modeling methods
Threading modeling methodsThreading modeling methods
Threading modeling methods
 
methods for protein structure prediction
methods for protein structure predictionmethods for protein structure prediction
methods for protein structure prediction
 
Clustal
ClustalClustal
Clustal
 
gene prediction programs
gene prediction programsgene prediction programs
gene prediction programs
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
 
Homology Modelling
Homology ModellingHomology Modelling
Homology Modelling
 
GENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICSGENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICS
 
SAGE (Serial analysis of Gene Expression)
SAGE (Serial analysis of Gene Expression)SAGE (Serial analysis of Gene Expression)
SAGE (Serial analysis of Gene Expression)
 

Similar to I- Tasser

Tertiary structure prediction- MODELLER, RASMOL
Tertiary structure prediction- MODELLER, RASMOLTertiary structure prediction- MODELLER, RASMOL
Tertiary structure prediction- MODELLER, RASMOLkaverinair1
 
In silico structure prediction
In silico structure predictionIn silico structure prediction
In silico structure predictionSubin E K
 
Session ii g2 overview metabolic network modeling mcc
Session ii g2 overview metabolic network modeling mccSession ii g2 overview metabolic network modeling mcc
Session ii g2 overview metabolic network modeling mccUSD Bioinformatics
 
HOMOLOGY MODELING IN EASIER WAY
HOMOLOGY MODELING IN EASIER WAYHOMOLOGY MODELING IN EASIER WAY
HOMOLOGY MODELING IN EASIER WAYShikha Popali
 
Pretzel: optimized Machine Learning framework for low-latency and high throug...
Pretzel: optimized Machine Learning framework for low-latency and high throug...Pretzel: optimized Machine Learning framework for low-latency and high throug...
Pretzel: optimized Machine Learning framework for low-latency and high throug...NECST Lab @ Politecnico di Milano
 
New Microsoft PowerPoint Presentation (2).pptx
New Microsoft PowerPoint Presentation (2).pptxNew Microsoft PowerPoint Presentation (2).pptx
New Microsoft PowerPoint Presentation (2).pptxpraveen kumar
 
PR-344: A Battle of Network Structures: An Empirical Study of CNN, Transforme...
PR-344: A Battle of Network Structures: An Empirical Study of CNN, Transforme...PR-344: A Battle of Network Structures: An Empirical Study of CNN, Transforme...
PR-344: A Battle of Network Structures: An Empirical Study of CNN, Transforme...Jinwon Lee
 
Distributed approach for Peptide Identification
Distributed approach for Peptide IdentificationDistributed approach for Peptide Identification
Distributed approach for Peptide Identificationabhinav vedanbhatla
 
Large Scale Kernel Learning using Block Coordinate Descent
Large Scale Kernel Learning using Block Coordinate DescentLarge Scale Kernel Learning using Block Coordinate Descent
Large Scale Kernel Learning using Block Coordinate DescentShaleen Kumar Gupta
 
AI 바이오 (4일차).pdf
AI 바이오 (4일차).pdfAI 바이오 (4일차).pdf
AI 바이오 (4일차).pdfH K Yoon
 
Computational Analysis with ICM
Computational Analysis with ICMComputational Analysis with ICM
Computational Analysis with ICMVernon D Dutch Jr
 
Homology Modeling.pptx
Homology Modeling.pptxHomology Modeling.pptx
Homology Modeling.pptxAmnaAkram29
 
Optimization of Test Pattern Using Genetic Algorithm for Testing SRAM
Optimization of Test Pattern Using Genetic Algorithm for Testing SRAMOptimization of Test Pattern Using Genetic Algorithm for Testing SRAM
Optimization of Test Pattern Using Genetic Algorithm for Testing SRAMIJERA Editor
 
modelling assignment
modelling assignmentmodelling assignment
modelling assignmentShwetA Kumari
 
L1Protein_Structure_Analysis.pptx
L1Protein_Structure_Analysis.pptxL1Protein_Structure_Analysis.pptx
L1Protein_Structure_Analysis.pptxkigaruantony
 
AN EFFICIENT ALGORITHM FOR WRAPPER AND TAM CO-OPTIMIZATION TO REDUCE TEST APP...
AN EFFICIENT ALGORITHM FOR WRAPPER AND TAM CO-OPTIMIZATION TO REDUCE TEST APP...AN EFFICIENT ALGORITHM FOR WRAPPER AND TAM CO-OPTIMIZATION TO REDUCE TEST APP...
AN EFFICIENT ALGORITHM FOR WRAPPER AND TAM CO-OPTIMIZATION TO REDUCE TEST APP...IAEME Publication
 

Similar to I- Tasser (20)

Tertiary structure prediction- MODELLER, RASMOL
Tertiary structure prediction- MODELLER, RASMOLTertiary structure prediction- MODELLER, RASMOL
Tertiary structure prediction- MODELLER, RASMOL
 
In silico structure prediction
In silico structure predictionIn silico structure prediction
In silico structure prediction
 
Session ii g2 overview metabolic network modeling mcc
Session ii g2 overview metabolic network modeling mccSession ii g2 overview metabolic network modeling mcc
Session ii g2 overview metabolic network modeling mcc
 
HOMOLOGY MODELING IN EASIER WAY
HOMOLOGY MODELING IN EASIER WAYHOMOLOGY MODELING IN EASIER WAY
HOMOLOGY MODELING IN EASIER WAY
 
Protein Threading
Protein ThreadingProtein Threading
Protein Threading
 
Pretzel: optimized Machine Learning framework for low-latency and high throug...
Pretzel: optimized Machine Learning framework for low-latency and high throug...Pretzel: optimized Machine Learning framework for low-latency and high throug...
Pretzel: optimized Machine Learning framework for low-latency and high throug...
 
Machine Learning @NECST
Machine Learning @NECSTMachine Learning @NECST
Machine Learning @NECST
 
New Microsoft PowerPoint Presentation (2).pptx
New Microsoft PowerPoint Presentation (2).pptxNew Microsoft PowerPoint Presentation (2).pptx
New Microsoft PowerPoint Presentation (2).pptx
 
PR-344: A Battle of Network Structures: An Empirical Study of CNN, Transforme...
PR-344: A Battle of Network Structures: An Empirical Study of CNN, Transforme...PR-344: A Battle of Network Structures: An Empirical Study of CNN, Transforme...
PR-344: A Battle of Network Structures: An Empirical Study of CNN, Transforme...
 
Distributed approach for Peptide Identification
Distributed approach for Peptide IdentificationDistributed approach for Peptide Identification
Distributed approach for Peptide Identification
 
Jr3417541760
Jr3417541760Jr3417541760
Jr3417541760
 
Large Scale Kernel Learning using Block Coordinate Descent
Large Scale Kernel Learning using Block Coordinate DescentLarge Scale Kernel Learning using Block Coordinate Descent
Large Scale Kernel Learning using Block Coordinate Descent
 
AI 바이오 (4일차).pdf
AI 바이오 (4일차).pdfAI 바이오 (4일차).pdf
AI 바이오 (4일차).pdf
 
Computational Analysis with ICM
Computational Analysis with ICMComputational Analysis with ICM
Computational Analysis with ICM
 
Homology Modeling.pptx
Homology Modeling.pptxHomology Modeling.pptx
Homology Modeling.pptx
 
Optimization of Test Pattern Using Genetic Algorithm for Testing SRAM
Optimization of Test Pattern Using Genetic Algorithm for Testing SRAMOptimization of Test Pattern Using Genetic Algorithm for Testing SRAM
Optimization of Test Pattern Using Genetic Algorithm for Testing SRAM
 
Swaati modeling
Swaati modeling Swaati modeling
Swaati modeling
 
modelling assignment
modelling assignmentmodelling assignment
modelling assignment
 
L1Protein_Structure_Analysis.pptx
L1Protein_Structure_Analysis.pptxL1Protein_Structure_Analysis.pptx
L1Protein_Structure_Analysis.pptx
 
AN EFFICIENT ALGORITHM FOR WRAPPER AND TAM CO-OPTIMIZATION TO REDUCE TEST APP...
AN EFFICIENT ALGORITHM FOR WRAPPER AND TAM CO-OPTIMIZATION TO REDUCE TEST APP...AN EFFICIENT ALGORITHM FOR WRAPPER AND TAM CO-OPTIMIZATION TO REDUCE TEST APP...
AN EFFICIENT ALGORITHM FOR WRAPPER AND TAM CO-OPTIMIZATION TO REDUCE TEST APP...
 

Recently uploaded

Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
Micromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of PowdersMicromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of PowdersChitralekhaTherkar
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfUmakantAnnand
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 

Recently uploaded (20)

Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Micromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of PowdersMicromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of Powders
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.Compdf
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 

I- Tasser

  • 1. (ITERATIVE THREADING ASSEMBLY REFINEMENT) By: Animesh kumar M. Sc. (Bioinformatics) IASRI, New Delhi Mob: 8512800911
  • 2.  Background • Structure modeling processes often involve human interventions because the human-expert knowledge combined with biochemical information (function, mutagenesis, catalytic residues, etc.) could help in both structural assembly and model selection. • Development of fully automated algorithms allows non-experts to generate structural models for their own sequences through Internet services. • I-TASSER (as 'Zhang-Server') was ranked as the no. 1 server in recent CASP7 and CASP8 experiments.
  • 3. THE ZHANG LAB ON-LINE SERVICE SYSTEM CONTAINS: • ON-LINE SERVERS - [FOLDING, DOCKING, DESIGN, DOMAINS ETC; SOME ARE DOWNLOADABLE] • BIOINFORMATICS TOOLS - [ALIGNMENT, IMAGE, CLUSTERING ETC; ALL ARE DOWNLOADABLE] • DATABASES - [LIGAND, GPCR, GENOME, DECOY, POTENTIAL, CASP ETC; ALL ARE DOWNLOADABLE]
  • 4. I. Protein Structure and Function Prediction Services (folding, threading, potential, contact, torsion, docking etc.) II. Bioinformatics Tools (structure alignment, sequence alignment, 3D visualization, surface, and clustering, etc.) III. Databases and Potentials
  • 5. Introduction: • I-TASSER server is an Internet service for protein structure and function predictions. • It allows academic users to automatically generate high-quality predictions of 3D structure and biological function of protein molecules from their amino acid sequences. • Models are built based on multiple-threading alignments by LOMETS and iterative TASSER simulations. • I-TASSER (as 'Zhang-Server') was ranked as the No 1 server in recent CASP7 and CASP8 experiments.
  • 6. I-TASSER method • I-TASSER is a hierarchical protein structure modeling approach. • It is based on the Profile-Profile threading Alignment (PPA) and the iterative implementation of the Threading ASSEmbly Refinement (TASSER) program. • The target sequences are first threaded through a representative PDB structure library (with a pair-wise sequence identity cut-off of 70%) to search for the possible folds. • It is done by four simple variants of PPA methods, with different combinations of the hidden Markov model, PSI-BLAST profiles, Needleman- Wunsch and Smith-Waterman alignment algorithms.
  • 7. • Threading aligned regions are used to reassemble full-length models while the threading unaligned regions (mainly loops) are built by ab initio modeling. • conformational space is searched by replica-exchange Monte Carlo simulations. • Structure trajectories are clustered by SPICKER and the cluster centroids are obtained by the averaging the coordinates of all clustered structures. • Fragment assembly simulation to rule out the steric clashes on the centroid structures and to refine the models further.
  • 8. • Spatial restraints are extracted from the centroids and the PDB structures searched by the structure alignment program TM-align. • Finally, the structure decoys are clustered and the lowest energy structure in each cluster is selected, which has the Cα atoms and the side-chain centers of mass specified. • Pulchra is used to add backbone atoms (N, C, O) and Scwrl_3.0 to build side-chain rotamers.
  • 9. How does I-TASSER generate structure and function predictions? submission of an amino acid sequence server first tries to retrieve template proteins of similar folds (or super-secondary structures) from the PDB library by LOMETS, (a locally installed meta-threading approach). if fragments found then are reassembled into full-length models by replica-exchange Monte Carlo simulations by threading unaligned regions (mainly loops) through ab initio modeling. If not I-TASSER will build the whole structures by ab initio modeling. The low free-energy states are identified by SPICKER through clustering the simulation decoys.
  • 10. • Third step, the fragment assembly simulation is performed again starting from the SPICKER cluster centroids, where the spatial restrains collected from both the LOMETS templates and the PDB structures by TM-align are used to guide the simulations. • The purpose of the second iteration is to remove the steric clash as well as to refine the global topology of the cluster centroids. • The decoys generated in the second simulations are then clustered and the lowest energy structures are selected. • The final full-atomic models are obtained by REMO which builds the atomic details from the selected I-TASSER decoys through the optimization of the hydrogen-bonding network.
  • 12. • If any region with >80 residues has no aligned residues in at least two strong PPA alignments of Z-score > Z0 , the target will be judged as a multiple domain protein and domain boundaries are automatically assigned based on the borders of the large gaps. • I-TASSER simulations will be run for the full chain as well as the separate domains. The final full-length models are generated by docking the model of domains together. • The domain docking is performed by a quick Metropolis Monte Carlo simulation where the energy is defined as the RMSD of domain models to the full-chain model plus the reciprocal of the number of steric clashes between domains. • The goal of the docking is to find the domain orientation that is closest to the I-TASSER full-chain model but has the minimum steric clashes. • This procedure does not influence the multiple domain proteins which have all domains completely aligned by the PPAs.
  • 13. Server setting • Project name: I-TASSER server • Home page: http://zhanglab.ccmb.med.umich.edu/I-TASSER/ • I-TASSER Standalone Package (Version 4.0)  Operating system(s): Windows, Linux, Mac  Programming language: Perl, Fortran77  License: GPL o Input: Amino acid sequence of the proteins(10–1,500 residues in FASTA format) o Output: Email sent to the users, include the PDB format files of up to 5 predicted models, C-score of the models, and the predicted RMSD and Tm-score of the first model. A brief explanation of the RMSD, TM-score, and C-score
  • 15. Running: • Log on to I-Tasser web page. • Copy & paste or directlt upload the a.a. sequence int the provided box also provide e-mail address and name of the job. • There is also option for specifying the inter residue contact or distance restraints to exclude some tempelate. • To submit click on “Run I-Tasser” • Check the status of submitted job by visiting the I-Tasser queue page. • Click on search to find out the submitted job by providing the job-Id no. • After the structure and modelling is finished a notification e-mail containing image of the predicted structure and web link will be sent. • Click to view and download the result.
  • 16. Structure analysis 1. Predicted secondary structure • Displayed H as -helix, S for β-strand and C for coil. • Also consider the confidence score for each residue. • Look for region for long stretches of secondary structure to estimate the core region in the protein.
  • 17. 2. Predicted Solvent Accessibility • To predict the buried and exposed region in the query. • It is “0” for buried residue and “9” for residue. • The region with solvent exposed and hydrophilic residue are potential hydration or functional site.
  • 18. 3. Top 5 Models predicted by I-TASSER • There is predicted tertiary structure of protein in the interactive j-mol appellate. • Left click to change the appearance of displayed structure (style, zoom, select etc.) • C-score  To analyze the quality of the predict.  its range is -5 to 2.  Higher score reflect a model of better quality.
  • 19. • TM score and RMSD  standards for measuring structural similarity between query and tempelate  used to measure the accuracy of structure modeling when the native structure is known  TM-score >0.5 indicates a model of correct topology and a TM- score<0.17 means a random similarity 4. Top 10 templates used by I-TASSER • Analyze sequence identity in the threading aligned region and for the whole chain to access the homology between query and template. • Z-score  Used to analyze the quality of threading alignment  Z-score>1 reflect confident alignment and most likely to have the same fold as query protein
  • 20. • High sequence identity 1 in the threading aligned region as compared to whole chain to access the homology between query and template • High sequence identity 2 represent the evolutionary relationship between the query and template. • Colored residue shows the conserved residue or motif in the query protein and template protein. • High sequence identity in the threading aligned region as compared to whole chain alignment indicate presence of conserved motif or domain in the sequence. • Access the coverage of the alignment by inspecting the alignment. • If the coverage of top alignment is low and confined to a small region of a query protein shows absence of long segment of query segment, which indicate query protein contain more than one domain. • In this case it is suggested to split the sequence and model the domain individually.
  • 21.
  • 22. 5. Structural analog in PDB and Enzyme commission number prediction • To determine the top 10 structural analog of the first predicted models as identified by the alignment program TM-align • TM-score>0.5 indicate the detected homology and the model have a similar topology and used to determine structural class of protein family of query sequence • TM-score<0.3 signify random structure. • Analyze the sequence identity and RMSD to access the conservational special motif of the model and structural analog for that (see colored portion
  • 23.
  • 24. 6. Function prediction using COFACTOR • EC number  Gives the potential homologue of the query protein  Confidence level is shown as EC score  CscoreEC is the confidence score for the Enzyme Classification (EC) number prediction  CscoreEC values range in between [0-1]  A higher score indicates a more reliable EC number prediction.  RMSDa is the RMSD between residues that are structurally aligned by TM-align while IDENa is the percentage sequence identity in the structurally aligned region.  Cov. represents the coverage of global structural alignment and is equal to the number of structurally aligned residues divided by length of the query protein.  If the EC score is very high then there is lack of consensus against identified hits, the prediction becomes less reliable then we go for gene ontology prediction.
  • 25.
  • 26. • Gene ontology (GO) term and protein- ligand binding site prediction  CscoreGO, which is a combined measure for evaluating global and local similarity between query and template protein.  CscoreGO values range in between [0-1]  A higher value indicates a better confidence in predicting the function using the template.  Each protein is associated with multiple GO term describing its molecular function, biological process and cellular location (each term is linked to respective amigo website and lineage)  Analyze the Fh score(functional homology score) column to access the functional homology between query and template  GO score >0.5 indicates a reliable prediction.
  • 27.
  • 28. 7. Template proteins with similar binding site • They are ranked based on the no. of predicted ligand conformation. • They share a common binding pocket. The best indentified ligand are already displayed in the J-mol applet. • Click on the other radio buttons to visualize predicted binding site and ligand interacting residues. • CscoreLB  is the confidence score of predicted binding site  Its values range in between [0-1]  A higher score indicates a more reliable ligand-binding site prediction. • BS-score  measure of local similarity (sequence & structure) between template binding site and predicted binding site in the query structure.  BS-score >1 reflects a significant local match(structural similarity) between the predicted and template binding site.
  • 29.
  • 30. Advantage • Main advantage over the existing structural modelling method is the inherent structure fragment assembly approach which consistently drive the threading alignment close to the native state.