SlideShare a Scribd company logo
1 of 9
Knowledge Engineering Course (SCOM7348)
                                                                         University of Birzeit, Palestine
                                                                                       December, 2012




                                   Synthesis Paper Talk




 Bioinformatics and protein ontologies

                                   Malak Eshtawi.
                                           (Student)
                              Master of Computing, Birzeit University
                                      meshtawi@ymail.com
This is a student talk, at the Knowledge Engineering course, each student is is
                    asked to present his/her synthesis paper.
      Course Page: http://jarrar-courses.blogspot.com/2011/09/knowledgeengineering-fall2011.html


                                                                                                        1
Content


Introduction.

The protein phosphatase family.

Phosphatase ontology.

Materials and methods.

Translation into OWL.


                                   2
Introduction
 Classification of proteins is a central process in
  understanding the molecular biology of an
  organism. Sequencing is a first step in revealing
  the molecular machinery of a cell, but the
  sequences need to be characterised and
  classified, at DNA and protein levels, before
  biologists can start more thorough investigations.
 Techniques involved in sequencing, especially the
  high throughput sequencing of whole genomes,
  have improved dramatically in recent years.
  Consequently, classification and analysis of data
  is now the rate-limiting step.

                                                  3
The protein phosphatase family

   Protein phosphatases and protein kinases
   control phosphorylation events in the cell, which
   regulate many different aspects of cell life and
   cell interactions with the environment.Recent
   reviews on the protein phosphatase family focus
   on either :-
i. tyrosine phosphatases .
ii. serine/threonine phosphatases.



                                                       4
Phosphatase ontology


 An ontology attempts to describe what exists in
  the world; an ontology of protein phosphatases
  describes what protein phosphatases exist. In
  computer science, an ontology creates a model of
  what a community understands about its domain
  as a highly interconnected hierarch of concepts
  and relationships. By agreeing upon an ontology
  and the terms within it, a community can create a
  shared understanding of their domain of study.



                                                 5
Materials and methods
The bioinformatics analyses necessary for classification of
a protein sequence as a protein phosphatase can be divided
    into the following stages :-
I. Extract the protein phosphatase gene products from the
      genome in a pre-screening step, without extracting any
      non-phosphatase proteins .
II. Perform an InterproScan on each protein phosphatase to
      determine its p-domain composition.
III. Use the pattern of p-domain composition to identify to
      which class of phosphatases each protein belongs.




                                                         6
Translation into OWL

 It requires implicit knowledge of the ontology. In
  this case, the use of naming conventions within
  the ontology made this transformation simple.
  Once translated, all descriptions of protein
  instances were loaded into the Instance Store.
  We then systematically asked the instance store
  which proteins belonged to which class of
  phosphatase.


                                                       7
References

• Please make sure you list all references,
  and to specify them correctly
 Allen,J.E., Pertea,M. and Salzberg,S.L. (2004) Computational gene prediction using
  multiple sources of evidence.. Genome Res., 14(1), 142–8.

   Alonso,A., Sasin,J., Bottini,N., Friedberg,I., Friedberg,I., Osterman,A., Godzik,A.,
    Hunter,T., Dixon,J. and Mustelin,T. (2004) Protein tyrosine phosphatases in the
    human genome. Cell, 117(6), 699–711, Review.

 OWL Web Ontology Language Reference http://www.w3.org/TR/owl-ref/
  Pillutla,R.C., Shimamoto,A., Furuichi,Y. and Shatkin,A.J. (1998) Human mRNA
  capping enzyme (RNGTT), cap methyltransferase (RNMT) map to 6q16 and
  18p11.22-p11.23, respectively. Genomics, 154(2), 351–3.




                                                                                           8
Thank You




            9

More Related Content

Viewers also liked

Kihara Bioinformatics Lab Research Summary 2016
Kihara Bioinformatics Lab Research Summary 2016Kihara Bioinformatics Lab Research Summary 2016
Kihara Bioinformatics Lab Research Summary 2016Purdue University
 
Protein function and bioinformatics
Protein function and bioinformaticsProtein function and bioinformatics
Protein function and bioinformaticsNeil Saunders
 
Protein structure 2
Protein structure 2Protein structure 2
Protein structure 2Rainu Rajeev
 
2015 bioinformatics protein_structure_wimvancriekinge
2015 bioinformatics protein_structure_wimvancriekinge2015 bioinformatics protein_structure_wimvancriekinge
2015 bioinformatics protein_structure_wimvancriekingeProf. Wim Van Criekinge
 
Chemistry of amino acids
Chemistry of amino acidsChemistry of amino acids
Chemistry of amino acidsNamrata Chhabra
 
amino acids
amino acidsamino acids
amino acidsAmeenah
 

Viewers also liked (7)

Kihara Bioinformatics Lab Research Summary 2016
Kihara Bioinformatics Lab Research Summary 2016Kihara Bioinformatics Lab Research Summary 2016
Kihara Bioinformatics Lab Research Summary 2016
 
Protein function and bioinformatics
Protein function and bioinformaticsProtein function and bioinformatics
Protein function and bioinformatics
 
Protein structure 2
Protein structure 2Protein structure 2
Protein structure 2
 
Bioalgo 2012-03-massspec
Bioalgo 2012-03-massspecBioalgo 2012-03-massspec
Bioalgo 2012-03-massspec
 
2015 bioinformatics protein_structure_wimvancriekinge
2015 bioinformatics protein_structure_wimvancriekinge2015 bioinformatics protein_structure_wimvancriekinge
2015 bioinformatics protein_structure_wimvancriekinge
 
Chemistry of amino acids
Chemistry of amino acidsChemistry of amino acids
Chemistry of amino acids
 
amino acids
amino acidsamino acids
amino acids
 

Similar to Presentation(malak eshtawi)

Proteomics: lecture (1) introduction to proteomics
Proteomics: lecture (1) introduction to proteomicsProteomics: lecture (1) introduction to proteomics
Proteomics: lecture (1) introduction to proteomicsClaudine83
 
Proteomics, definatio , general concept, signficance
Proteomics,  definatio , general concept, signficanceProteomics,  definatio , general concept, signficance
Proteomics, definatio , general concept, signficanceKAUSHAL SAHU
 
Selective Protein Staining On Native Gel
Selective Protein Staining On Native GelSelective Protein Staining On Native Gel
Selective Protein Staining On Native GelMartina Bertsch
 
Functional proteomics, methods and tools
Functional proteomics, methods and toolsFunctional proteomics, methods and tools
Functional proteomics, methods and toolsKAUSHAL SAHU
 
Functional proteomics, and tools
Functional proteomics, and toolsFunctional proteomics, and tools
Functional proteomics, and toolsKAUSHAL SAHU
 
Bioinformatics, application by kk sahu sir
Bioinformatics, application by kk sahu sirBioinformatics, application by kk sahu sir
Bioinformatics, application by kk sahu sirKAUSHAL SAHU
 
protein microarray.pptx
protein microarray.pptxprotein microarray.pptx
protein microarray.pptxAtulSingh77625
 
Genomics and proteomics II
Genomics and proteomics IIGenomics and proteomics II
Genomics and proteomics IINikolay Vyahhi
 
Predicting peptide interactions using protein building blocks
Predicting peptide interactions using protein building blocksPredicting peptide interactions using protein building blocks
Predicting peptide interactions using protein building blocksPeter Vanhee
 
Protein sequence classification in data mining– a study
Protein sequence classification in data mining– a studyProtein sequence classification in data mining– a study
Protein sequence classification in data mining– a studyZac Darcy
 
PROTEIN SEQUENCE CLASSIFICATION IN DATA MINING– A STUDY
PROTEIN SEQUENCE CLASSIFICATION IN DATA MINING– A STUDYPROTEIN SEQUENCE CLASSIFICATION IN DATA MINING– A STUDY
PROTEIN SEQUENCE CLASSIFICATION IN DATA MINING– A STUDYZac Darcy
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...SBituila
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...BibiQuinah
 
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICSSTRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICSSHEETHUMOLKS
 
Integrative omics approches
Integrative omics approches   Integrative omics approches
Integrative omics approches Sayali Magar
 
Assignment Of Protein Function And Discovery Of Novel Nucleolar Proteins Base...
Assignment Of Protein Function And Discovery Of Novel Nucleolar Proteins Base...Assignment Of Protein Function And Discovery Of Novel Nucleolar Proteins Base...
Assignment Of Protein Function And Discovery Of Novel Nucleolar Proteins Base...Jim Jimenez
 

Similar to Presentation(malak eshtawi) (20)

Proteomics: lecture (1) introduction to proteomics
Proteomics: lecture (1) introduction to proteomicsProteomics: lecture (1) introduction to proteomics
Proteomics: lecture (1) introduction to proteomics
 
Proteomics, definatio , general concept, signficance
Proteomics,  definatio , general concept, signficanceProteomics,  definatio , general concept, signficance
Proteomics, definatio , general concept, signficance
 
Selective Protein Staining On Native Gel
Selective Protein Staining On Native GelSelective Protein Staining On Native Gel
Selective Protein Staining On Native Gel
 
Functional proteomics, methods and tools
Functional proteomics, methods and toolsFunctional proteomics, methods and tools
Functional proteomics, methods and tools
 
Functional proteomics, and tools
Functional proteomics, and toolsFunctional proteomics, and tools
Functional proteomics, and tools
 
Bioinformatics, application by kk sahu sir
Bioinformatics, application by kk sahu sirBioinformatics, application by kk sahu sir
Bioinformatics, application by kk sahu sir
 
Proteomic and metabolomic
Proteomic and metabolomicProteomic and metabolomic
Proteomic and metabolomic
 
protein microarray.pptx
protein microarray.pptxprotein microarray.pptx
protein microarray.pptx
 
Genomics and proteomics II
Genomics and proteomics IIGenomics and proteomics II
Genomics and proteomics II
 
Predicting peptide interactions using protein building blocks
Predicting peptide interactions using protein building blocksPredicting peptide interactions using protein building blocks
Predicting peptide interactions using protein building blocks
 
Protein sequence classification in data mining– a study
Protein sequence classification in data mining– a studyProtein sequence classification in data mining– a study
Protein sequence classification in data mining– a study
 
PROTEIN SEQUENCE CLASSIFICATION IN DATA MINING– A STUDY
PROTEIN SEQUENCE CLASSIFICATION IN DATA MINING– A STUDYPROTEIN SEQUENCE CLASSIFICATION IN DATA MINING– A STUDY
PROTEIN SEQUENCE CLASSIFICATION IN DATA MINING– A STUDY
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...
 
www.ijerd.com
www.ijerd.comwww.ijerd.com
www.ijerd.com
 
Resume
ResumeResume
Resume
 
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICSSTRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
 
Introduction to Proteogenomics
Introduction to Proteogenomics Introduction to Proteogenomics
Introduction to Proteogenomics
 
Integrative omics approches
Integrative omics approches   Integrative omics approches
Integrative omics approches
 
Assignment Of Protein Function And Discovery Of Novel Nucleolar Proteins Base...
Assignment Of Protein Function And Discovery Of Novel Nucleolar Proteins Base...Assignment Of Protein Function And Discovery Of Novel Nucleolar Proteins Base...
Assignment Of Protein Function And Discovery Of Novel Nucleolar Proteins Base...
 

Presentation(malak eshtawi)

  • 1. Knowledge Engineering Course (SCOM7348) University of Birzeit, Palestine December, 2012 Synthesis Paper Talk Bioinformatics and protein ontologies Malak Eshtawi. (Student) Master of Computing, Birzeit University meshtawi@ymail.com This is a student talk, at the Knowledge Engineering course, each student is is asked to present his/her synthesis paper. Course Page: http://jarrar-courses.blogspot.com/2011/09/knowledgeengineering-fall2011.html 1
  • 2. Content Introduction. The protein phosphatase family. Phosphatase ontology. Materials and methods. Translation into OWL. 2
  • 3. Introduction  Classification of proteins is a central process in understanding the molecular biology of an organism. Sequencing is a first step in revealing the molecular machinery of a cell, but the sequences need to be characterised and classified, at DNA and protein levels, before biologists can start more thorough investigations.  Techniques involved in sequencing, especially the high throughput sequencing of whole genomes, have improved dramatically in recent years. Consequently, classification and analysis of data is now the rate-limiting step. 3
  • 4. The protein phosphatase family Protein phosphatases and protein kinases control phosphorylation events in the cell, which regulate many different aspects of cell life and cell interactions with the environment.Recent reviews on the protein phosphatase family focus on either :- i. tyrosine phosphatases . ii. serine/threonine phosphatases. 4
  • 5. Phosphatase ontology  An ontology attempts to describe what exists in the world; an ontology of protein phosphatases describes what protein phosphatases exist. In computer science, an ontology creates a model of what a community understands about its domain as a highly interconnected hierarch of concepts and relationships. By agreeing upon an ontology and the terms within it, a community can create a shared understanding of their domain of study. 5
  • 6. Materials and methods The bioinformatics analyses necessary for classification of a protein sequence as a protein phosphatase can be divided into the following stages :- I. Extract the protein phosphatase gene products from the genome in a pre-screening step, without extracting any non-phosphatase proteins . II. Perform an InterproScan on each protein phosphatase to determine its p-domain composition. III. Use the pattern of p-domain composition to identify to which class of phosphatases each protein belongs. 6
  • 7. Translation into OWL  It requires implicit knowledge of the ontology. In this case, the use of naming conventions within the ontology made this transformation simple. Once translated, all descriptions of protein instances were loaded into the Instance Store. We then systematically asked the instance store which proteins belonged to which class of phosphatase. 7
  • 8. References • Please make sure you list all references, and to specify them correctly  Allen,J.E., Pertea,M. and Salzberg,S.L. (2004) Computational gene prediction using multiple sources of evidence.. Genome Res., 14(1), 142–8.  Alonso,A., Sasin,J., Bottini,N., Friedberg,I., Friedberg,I., Osterman,A., Godzik,A., Hunter,T., Dixon,J. and Mustelin,T. (2004) Protein tyrosine phosphatases in the human genome. Cell, 117(6), 699–711, Review.  OWL Web Ontology Language Reference http://www.w3.org/TR/owl-ref/ Pillutla,R.C., Shimamoto,A., Furuichi,Y. and Shatkin,A.J. (1998) Human mRNA capping enzyme (RNGTT), cap methyltransferase (RNMT) map to 6q16 and 18p11.22-p11.23, respectively. Genomics, 154(2), 351–3. 8