SlideShare a Scribd company logo
1 of 19
PRESENTATION ON PredictPROTEIN
BY:
ARVIND KUMAR
MAJID JAMAAL
JAMIA MILLIA ISLAMIA
(CENTRAL UNIVERSITY)
OVERVIEW
•
•
•
•
•
•
•

Introduction of predictProtein
Attempts to simplify outputs
Different methods incorporated in this predictProtein
Performance of the methods
Novel methods
Discussion about its input and output
Demonstration
INTRODUCTION
PredictProtein(PP) is an Internet service for sequence analysis
and the prediction of protein structure and function.
PP is an automatic service that searches up-to-date public
sequence databases, creates alignments, and predicts aspects of
protein structure and function.
PP went online in 1992 at the European Molecular Biology Laboratory (EMBL,
Heidelberg); since 1999 it has operated from Columbia University (New York).PP
remains the most widely used public server for structure prediction: over 1.5 million
requests from users in 104 countries have been handled; over 13 000 users submitted
10 or more different queries. PP web pages are mirrored in 17 countries on 4 continents.
INTRODUCTION….
Input:• We submit protein sequences or alignments in the query
box.
Output:• multiple sequence alignments,
• PROSITE sequence motifs,
• low-complexity regions (LCRs),
• Nuclear-localization signals,
• regions lacking regular structure(NORS),
• predictions of secondary structure,
• solvent accessibility, globular regions,
INTRODUCTION….
Output:• Trans-membrane helices,
• coiled-coil regions,
• structural switch regions,
• disulfide-bonds,
• sub-cellular localization
• functional annotations
Upon request fold recognition by prediction-based threading,
CHOP domain assignments, predictions of transmembrane
strands and inter-residue contacts also can be find out.
OUTPUT
Attempt to simplify output by
incorporating a hierarchy of
thresholds
Default PP returns only those proteins found in the database
that are very likely to have a similar structure to the query
protein .
Particular predictions, such as those for membrane helices,
coiled-coil regions, signal peptides and nuclear localization
signals, are not returned if found to be below given probability
thresholds.
DIFFERENT METHODS IN
PredictProtein
PERFORMANCE OF THE
METHODS
Alignment methods:
While the dynamic-programming method MaxHom still
appears best in aligning pairs of proteins.
The iterated PSI-BLAST tends to be more sensitive in
unravelling more distantly related proteins and also in correctly
aligning them provided the underlying profiles contain enough
information.
PERFORMANCE OF THE
METHODS
Protein domains and unusual regions:
Like for instance SMART , ProDom tends to identify regions
that are significantly shorter than structural domains.
This is not the case for CHOP.CHOP misses many domain
boundaries since it relies heavily on similarities to domains
annotated by others (PrISM, Pfam-A).
Note that short regions of low-complexity (SEG) are fairly
common and are not necessarily informative.
PERFORMANCE OF THE
METHODS
Structure for an up-to-date evaluation of structure prediction:
(a) PROFsec secondary structure prediction: on average, 76% of all
residues are correctly predicted (only 71% by PHDsec).
(b) PROFacc accessibility prediction: almost 80% of all residues are
correctly predicted as either buried or exposed, and >80% of the surface
residues are correct.
(c) PHDhtm membrane helix prediction: 80% of the membrane helices are
correctly predicted; for 66% of all tested proteins.
PERFORMANCE OF THE
METHODS
Structure for an up-to-date evaluation of structure
prediction:
(d) PROFtmb membrane barrel prediction: >80% of the membrane
strand residues are correctly predicted.
(e) GLOBE: not accurate enough to identify domain boundaries. GLOBE
often correctly captures trends such as ‘very unlike a globular protein’.
Multi-domain proteins with globular and non-globular domains—such as
NORS regions—are misclassified.
(f ) COILS: perceived to be correct most of the time.
PERFORMANCE OF THE
METHODS
(g) CYSPRED:most disulfide-bonding residues are correctly identified.
however,most predicted bonds are wrong.
(h) ASP: if the protein has a structural switching region, this is usually detected
correctly.
(i) PROFcon08: most inter-residue contacts that are predicted are wrong; in
fact, even at a coverage of 10%, only 27–40% of all contacts are correctly
predicted.
( J) NORSp predictions of non-regular regions: so far, there is no example of a
protein with regular structure that we predicted to be irregular. Note that the
PROF and PHD series and CYSPRED are all based on the artificial neural
network systems (except for PROFtmb, which is based on a hidden Markov
model).
PERFORMANCE OF THE
METHODS
Protein function:
Our signal-motif-based predictions reach levels of accuracy from as high as
close to 100% (NLS) to as low as 50% (endoplasmic reticulum and Golgi
apparatus). Homology transfer and keyword-based annotations are returned at
levels >70% accuracy.
Our system for de novo prediction of sub-cellular localization reaches levels
60% accuracy (extra-cellular space, cytoplasm, nucleus, mitochondria, other).
summary

• PredictProtein is an internet service for protein function and
structure.
• It performs jobs by the combination of many methods.
• It uses the concept of threshold to increase its output
efficeincy.
Thank you!!

More Related Content

What's hot

SonPhamSVURS2015
SonPhamSVURS2015SonPhamSVURS2015
SonPhamSVURS2015Son Pham
 
Protein Structure, Databases and Structural Alignment
Protein Structure, Databases and Structural AlignmentProtein Structure, Databases and Structural Alignment
Protein Structure, Databases and Structural AlignmentSaramita De Chakravarti
 
Protein function and bioinformatics
Protein function and bioinformaticsProtein function and bioinformatics
Protein function and bioinformaticsNeil Saunders
 
RNA structure analysis
RNA structure analysis RNA structure analysis
RNA structure analysis Afra Fathima
 
Bioinformatics t8-go-hmm wim-vancriekinge_v2013
Bioinformatics t8-go-hmm wim-vancriekinge_v2013Bioinformatics t8-go-hmm wim-vancriekinge_v2013
Bioinformatics t8-go-hmm wim-vancriekinge_v2013Prof. Wim Van Criekinge
 
Sequence analysis - Bioinformatics
Sequence analysis - BioinformaticsSequence analysis - Bioinformatics
Sequence analysis - BioinformaticsPratik Parikh
 

What's hot (6)

SonPhamSVURS2015
SonPhamSVURS2015SonPhamSVURS2015
SonPhamSVURS2015
 
Protein Structure, Databases and Structural Alignment
Protein Structure, Databases and Structural AlignmentProtein Structure, Databases and Structural Alignment
Protein Structure, Databases and Structural Alignment
 
Protein function and bioinformatics
Protein function and bioinformaticsProtein function and bioinformatics
Protein function and bioinformatics
 
RNA structure analysis
RNA structure analysis RNA structure analysis
RNA structure analysis
 
Bioinformatics t8-go-hmm wim-vancriekinge_v2013
Bioinformatics t8-go-hmm wim-vancriekinge_v2013Bioinformatics t8-go-hmm wim-vancriekinge_v2013
Bioinformatics t8-go-hmm wim-vancriekinge_v2013
 
Sequence analysis - Bioinformatics
Sequence analysis - BioinformaticsSequence analysis - Bioinformatics
Sequence analysis - Bioinformatics
 

Similar to Predict protein1 presentation

Bio process
Bio processBio process
Bio processsun777
 
Bio process
Bio processBio process
Bio processsun777
 
Bioinformatics.Practical Notebook
Bioinformatics.Practical NotebookBioinformatics.Practical Notebook
Bioinformatics.Practical NotebookNaima Tahsin
 
gene prediction programs
gene prediction programsgene prediction programs
gene prediction programsMugdhaSharma11
 
IRJET-A Novel Approaches for Motif Discovery using Data Mining Algorithm
IRJET-A Novel Approaches for Motif Discovery using Data Mining AlgorithmIRJET-A Novel Approaches for Motif Discovery using Data Mining Algorithm
IRJET-A Novel Approaches for Motif Discovery using Data Mining AlgorithmIRJET Journal
 
BTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptx
BTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptxBTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptx
BTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptxChijiokeNsofor
 
Research report (alternative splicing, protein structure; retinitis pigmentosa)
Research report (alternative splicing, protein structure; retinitis pigmentosa)Research report (alternative splicing, protein structure; retinitis pigmentosa)
Research report (alternative splicing, protein structure; retinitis pigmentosa)avalgar
 
Cox2007-Protein_fabrication_automation
Cox2007-Protein_fabrication_automationCox2007-Protein_fabrication_automation
Cox2007-Protein_fabrication_automationJ. Colin Cox
 
Bioinfomatics Presentation
Bioinfomatics PresentationBioinfomatics Presentation
Bioinfomatics PresentationZhenhong Bao
 
International Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentInternational Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentIJERD Editor
 
Genome editing & targeting tools
Genome editing & targeting toolsGenome editing & targeting tools
Genome editing & targeting toolsS Rasouli
 

Similar to Predict protein1 presentation (20)

genomeannotation-160822182432.pdf
genomeannotation-160822182432.pdfgenomeannotation-160822182432.pdf
genomeannotation-160822182432.pdf
 
Genome annotation
Genome annotationGenome annotation
Genome annotation
 
Bio process
Bio processBio process
Bio process
 
Bio process
Bio processBio process
Bio process
 
Bioinformatics.Practical Notebook
Bioinformatics.Practical NotebookBioinformatics.Practical Notebook
Bioinformatics.Practical Notebook
 
gene prediction programs
gene prediction programsgene prediction programs
gene prediction programs
 
IRJET-A Novel Approaches for Motif Discovery using Data Mining Algorithm
IRJET-A Novel Approaches for Motif Discovery using Data Mining AlgorithmIRJET-A Novel Approaches for Motif Discovery using Data Mining Algorithm
IRJET-A Novel Approaches for Motif Discovery using Data Mining Algorithm
 
Gene identification using bioinformatic tools.pptx
Gene identification using bioinformatic tools.pptxGene identification using bioinformatic tools.pptx
Gene identification using bioinformatic tools.pptx
 
Introduction to Apollo for i5k
Introduction to Apollo for i5kIntroduction to Apollo for i5k
Introduction to Apollo for i5k
 
BTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptx
BTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptxBTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptx
BTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptx
 
Research report (alternative splicing, protein structure; retinitis pigmentosa)
Research report (alternative splicing, protein structure; retinitis pigmentosa)Research report (alternative splicing, protein structure; retinitis pigmentosa)
Research report (alternative splicing, protein structure; retinitis pigmentosa)
 
Cox2007-Protein_fabrication_automation
Cox2007-Protein_fabrication_automationCox2007-Protein_fabrication_automation
Cox2007-Protein_fabrication_automation
 
Homology modeling
Homology modelingHomology modeling
Homology modeling
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Gene prediction strategies
Gene prediction strategies Gene prediction strategies
Gene prediction strategies
 
Bioinfomatics Presentation
Bioinfomatics PresentationBioinfomatics Presentation
Bioinfomatics Presentation
 
bioinformatic.pptx
bioinformatic.pptxbioinformatic.pptx
bioinformatic.pptx
 
An26247254
An26247254An26247254
An26247254
 
International Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentInternational Journal of Engineering Research and Development
International Journal of Engineering Research and Development
 
Genome editing & targeting tools
Genome editing & targeting toolsGenome editing & targeting tools
Genome editing & targeting tools
 

Recently uploaded

call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
Blooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxBlooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxUnboundStockton
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,Virag Sontakke
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsKarinaGenton
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxsocialsciencegdgrohi
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
internship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerinternship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerunnathinaik
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxRaymartEstabillo3
 

Recently uploaded (20)

call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
Blooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxBlooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docx
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its Characteristics
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
internship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerinternship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developer
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
 

Predict protein1 presentation

  • 1. PRESENTATION ON PredictPROTEIN BY: ARVIND KUMAR MAJID JAMAAL JAMIA MILLIA ISLAMIA (CENTRAL UNIVERSITY)
  • 2. OVERVIEW • • • • • • • Introduction of predictProtein Attempts to simplify outputs Different methods incorporated in this predictProtein Performance of the methods Novel methods Discussion about its input and output Demonstration
  • 3. INTRODUCTION PredictProtein(PP) is an Internet service for sequence analysis and the prediction of protein structure and function. PP is an automatic service that searches up-to-date public sequence databases, creates alignments, and predicts aspects of protein structure and function. PP went online in 1992 at the European Molecular Biology Laboratory (EMBL, Heidelberg); since 1999 it has operated from Columbia University (New York).PP remains the most widely used public server for structure prediction: over 1.5 million requests from users in 104 countries have been handled; over 13 000 users submitted 10 or more different queries. PP web pages are mirrored in 17 countries on 4 continents.
  • 4. INTRODUCTION…. Input:• We submit protein sequences or alignments in the query box. Output:• multiple sequence alignments, • PROSITE sequence motifs, • low-complexity regions (LCRs), • Nuclear-localization signals, • regions lacking regular structure(NORS), • predictions of secondary structure, • solvent accessibility, globular regions,
  • 5. INTRODUCTION…. Output:• Trans-membrane helices, • coiled-coil regions, • structural switch regions, • disulfide-bonds, • sub-cellular localization • functional annotations Upon request fold recognition by prediction-based threading, CHOP domain assignments, predictions of transmembrane strands and inter-residue contacts also can be find out.
  • 7.
  • 8. Attempt to simplify output by incorporating a hierarchy of thresholds Default PP returns only those proteins found in the database that are very likely to have a similar structure to the query protein . Particular predictions, such as those for membrane helices, coiled-coil regions, signal peptides and nuclear localization signals, are not returned if found to be below given probability thresholds.
  • 10.
  • 11.
  • 12. PERFORMANCE OF THE METHODS Alignment methods: While the dynamic-programming method MaxHom still appears best in aligning pairs of proteins. The iterated PSI-BLAST tends to be more sensitive in unravelling more distantly related proteins and also in correctly aligning them provided the underlying profiles contain enough information.
  • 13. PERFORMANCE OF THE METHODS Protein domains and unusual regions: Like for instance SMART , ProDom tends to identify regions that are significantly shorter than structural domains. This is not the case for CHOP.CHOP misses many domain boundaries since it relies heavily on similarities to domains annotated by others (PrISM, Pfam-A). Note that short regions of low-complexity (SEG) are fairly common and are not necessarily informative.
  • 14. PERFORMANCE OF THE METHODS Structure for an up-to-date evaluation of structure prediction: (a) PROFsec secondary structure prediction: on average, 76% of all residues are correctly predicted (only 71% by PHDsec). (b) PROFacc accessibility prediction: almost 80% of all residues are correctly predicted as either buried or exposed, and >80% of the surface residues are correct. (c) PHDhtm membrane helix prediction: 80% of the membrane helices are correctly predicted; for 66% of all tested proteins.
  • 15. PERFORMANCE OF THE METHODS Structure for an up-to-date evaluation of structure prediction: (d) PROFtmb membrane barrel prediction: >80% of the membrane strand residues are correctly predicted. (e) GLOBE: not accurate enough to identify domain boundaries. GLOBE often correctly captures trends such as ‘very unlike a globular protein’. Multi-domain proteins with globular and non-globular domains—such as NORS regions—are misclassified. (f ) COILS: perceived to be correct most of the time.
  • 16. PERFORMANCE OF THE METHODS (g) CYSPRED:most disulfide-bonding residues are correctly identified. however,most predicted bonds are wrong. (h) ASP: if the protein has a structural switching region, this is usually detected correctly. (i) PROFcon08: most inter-residue contacts that are predicted are wrong; in fact, even at a coverage of 10%, only 27–40% of all contacts are correctly predicted. ( J) NORSp predictions of non-regular regions: so far, there is no example of a protein with regular structure that we predicted to be irregular. Note that the PROF and PHD series and CYSPRED are all based on the artificial neural network systems (except for PROFtmb, which is based on a hidden Markov model).
  • 17. PERFORMANCE OF THE METHODS Protein function: Our signal-motif-based predictions reach levels of accuracy from as high as close to 100% (NLS) to as low as 50% (endoplasmic reticulum and Golgi apparatus). Homology transfer and keyword-based annotations are returned at levels >70% accuracy. Our system for de novo prediction of sub-cellular localization reaches levels 60% accuracy (extra-cellular space, cytoplasm, nucleus, mitochondria, other).
  • 18. summary • PredictProtein is an internet service for protein function and structure. • It performs jobs by the combination of many methods. • It uses the concept of threshold to increase its output efficeincy.