SlideShare a Scribd company logo
1 of 24
Biological Databases
• Primary databases
• Secondary databases
• Specialized databases
Biological Databases
Primary Databases: contains original biological data
(raw nucleic acid sequence data produced and submitted
by researchers worldwide)
-Genbank (https://www.ncbi.nlm.nih.gov/genbank/)
-EMBL (European Molecular Biology Laboratory)
(https://www.ebi.ac.uk)
-DDBJ (DNA Data Bank of Japan)
(https://www.ddbj.nig.ac.jp/index-e.html)
Secondary Databases: which contain computationally processed
sequence information derived from the primary databases
-SWISS-PROT: which provides detailed sequence annotation that
includes structure, function, and protein family assignment
-TrEMBL, a database of translated nucleic acid sequences stored
in the EMBL database
-UniProt: (SWISS-PROT + TrEMBL + PIR), which has larger
coverage than any one of the three databases
-Pfam and Blocks: aligned protein sequence information, motifs,
patterns
DALI: protein secondary structure database that is vital for protein
structure classification and threading analysis
Major Biological Databases Available Via the
World Wide Web
SWISS-Prot Curated protein sequence database www.ebi.ac.uk/swissprot/acces
s.html
AceDB Genome database for Caenorhabditis elegans www.acedb.org
DDBJ Primary nucleotide sequence database in Japan www.ddbj.nig.ac.jp
EMBL Primary nucleotide sequence database in Europe www.ebi.ac.uk/embl/index.html
Entrez NCBI portal for a variety of biological databases www.ncbi.nlm.nih.gov/gquery/g
query.fcgi
ExPASY Proteomics database http://us.expasy.org/
FlyBase A database of the Drosophila genome http://flybase.bio.indiana.edu/
FSSP Protein secondary structures www.bioinfo.biocenter.helsinki.f
i:8080/dali/index.html
GenBank Primary nucleotide sequence database in NCBI www.ncbi.nlm.nih.gov/Genban
k
HIV Databases HIV sequence data and related immunologic
information
www.hiv.lanl.gov/content/index
Microarray gene
expression data
base
DNA microarray data and analysis tools www.ebi.ac.uk/microarray
OMIM Genetic information of human diseases www.ncbi.nlm.nih.gov/entrez/q
uery.fcgi?db=OMIM
SRS General sequence retrieval system http://srs6.ebi.ac.uk
PubMed Biomedical literature information www.ncbi.nlm.nih.gov/PubMed
TAIR Arabidopsis information database www.arabidopsis.org
Entrez
NCBI advanced search builder
Main file formats used in Bioinformatics
•GenBank/GenPept
•ASN.1
•EMBL, Swiss Prot
•FASTA
•GCG
•PHYLIP
•PIR
(https://www.ebi.ac.uk/Tools/sfc/emboss_seqret/)
EMBOSS Seqret
Algorithms for pairwise alignments
Web resources
• LALIGN - pairwise sequence alignment
• Global alignment: Needle (EMBOSS): https://www.ebi.ac.uk/Tools/psa/
• Local alignment: Water (EMBOSS): https://www.ebi.ac.uk/Tools/psa/
• The BLAST program was developed by Stephen Altschul of NCBI in 1990
and has since become one of the most popular programs for sequence
analysis
• BLAST uses heuristics to align a query sequence with all sequences in a
database
• The objective is to find high-scoring ungapped segments among related
sequences
• The existence of such segments above a given threshold indicates pairwise
similarity beyond random chance, which helps to discriminate related
sequences from unrelated sequences in a database
BLAST (Basic Local Alignment Search Tool)
(www.ncbi.nlm.nih.gov/BLAST/)
NCBI search for spike surface glycoprotein
BLAST Search against Protein Data Bank
Low Complexity Regions
PSI-BLAST Contd.
• An iterative search in which sequences found in one round of
searching are used to build a score model for the next round of
searching
• An important tool for predicting both biochemical activity and
function
• Identify week homologies (distant relatives of a proteins, which are
not found in FASTA or BLAST.
Information:
https://www.ncbi.nlm.nih.gov/BLAST/tutorial/Altschul-2.html
BLAST QuickStart
Example-Driven Web-Based BLAST Tutorial
https://www.ncbi.nlm.nih.gov/books/NBK1734/
NCBI BLAST tutorial – YouTube
https://www.youtube.com/watch?v=HXEpBnUbAMo
NCBI PSI-BLAST Tutorial - YouTube
https://www.youtube.com/watch?v=T3kHEieyylk
MEME suite 5.1.1: http://meme-suite.org/tools/meme
PROSITE: https://prosite.expasy.org
ScanProsite: https://prosite.expasy.org/scanprosite/
Patterns can be generated from multiple sequences using PRATT
https://web.expasy.org/pratt/
PRATT - ExPASy
PHI-BLAST
PROSITE pattern for the kinase active site, starting from the conserved DRH and
making use of the very conserved DFG region: D-R-H-[NS]-[DS]-N-[IL]-x-[IV]-x-[DEK]-
[DGST]-G-[NQR]-L-F-H-I-D-F-G
The above query sequence and the PROSITE pattern used as inputs for the PHI-
BLAST search (see next slide)
PHI-BLAST

More Related Content

Similar to Hands on training_biological_databases.ppt

Introduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASEIntroduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASEPrashantSharma807
 
Data retreival system
Data retreival systemData retreival system
Data retreival systemShikha Thakur
 
protein databases.ppt
protein databases.pptprotein databases.ppt
protein databases.pptSanthiyaAK
 
Bioinformatics مي.pdf
Bioinformatics  مي.pdfBioinformatics  مي.pdf
Bioinformatics مي.pdfnedalalazzwy
 
Nucleic Acid Sequence Databases
Nucleic Acid Sequence DatabasesNucleic Acid Sequence Databases
Nucleic Acid Sequence Databasesfarwa fayaz
 
Nucleic Acid Sequence databases
Nucleic Acid Sequence databasesNucleic Acid Sequence databases
Nucleic Acid Sequence databasesPranavathiyani G
 
Databases, bioinformatics, sequence analysis
Databases, bioinformatics, sequence analysisDatabases, bioinformatics, sequence analysis
Databases, bioinformatics, sequence analysisMohamedHasan816582
 
Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...
Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...
Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...VHIR Vall d’Hebron Institut de Recerca
 
Presentation on Biological database By Elufer Akram @ University Of Science ...
Presentation on Biological database  By Elufer Akram @ University Of Science ...Presentation on Biological database  By Elufer Akram @ University Of Science ...
Presentation on Biological database By Elufer Akram @ University Of Science ...Elufer Akram
 
Database in bioinformatics
Database in bioinformaticsDatabase in bioinformatics
Database in bioinformaticsVinaKhan1
 

Similar to Hands on training_biological_databases.ppt (20)

Introduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASEIntroduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASE
 
Major databases in bioinformatics
Major databases in bioinformaticsMajor databases in bioinformatics
Major databases in bioinformatics
 
Data retreival system
Data retreival systemData retreival system
Data retreival system
 
Protein Databases
Protein DatabasesProtein Databases
Protein Databases
 
Data Retrieval Systems
Data Retrieval SystemsData Retrieval Systems
Data Retrieval Systems
 
Protein database
Protein databaseProtein database
Protein database
 
protein databases.ppt
protein databases.pptprotein databases.ppt
protein databases.ppt
 
Protein database
Protein  databaseProtein  database
Protein database
 
Proteins databases
Proteins databasesProteins databases
Proteins databases
 
Bioinformatics مي.pdf
Bioinformatics  مي.pdfBioinformatics  مي.pdf
Bioinformatics مي.pdf
 
Biological databases
Biological databases Biological databases
Biological databases
 
Structural database and their classification by abdul qahar
Structural database and their classification by abdul qaharStructural database and their classification by abdul qahar
Structural database and their classification by abdul qahar
 
Nucleic Acid Sequence Databases
Nucleic Acid Sequence DatabasesNucleic Acid Sequence Databases
Nucleic Acid Sequence Databases
 
Nucleic Acid Sequence databases
Nucleic Acid Sequence databasesNucleic Acid Sequence databases
Nucleic Acid Sequence databases
 
Databases, bioinformatics, sequence analysis
Databases, bioinformatics, sequence analysisDatabases, bioinformatics, sequence analysis
Databases, bioinformatics, sequence analysis
 
Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...
Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...
Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...
 
Databases_L2.pptx
Databases_L2.pptxDatabases_L2.pptx
Databases_L2.pptx
 
Presentation on Biological database By Elufer Akram @ University Of Science ...
Presentation on Biological database  By Elufer Akram @ University Of Science ...Presentation on Biological database  By Elufer Akram @ University Of Science ...
Presentation on Biological database By Elufer Akram @ University Of Science ...
 
Database in bioinformatics
Database in bioinformaticsDatabase in bioinformatics
Database in bioinformatics
 
Biological databases.pptx
Biological databases.pptxBiological databases.pptx
Biological databases.pptx
 

Recently uploaded

Efficient spin-up of Earth System Models usingsequence acceleration
Efficient spin-up of Earth System Models usingsequence accelerationEfficient spin-up of Earth System Models usingsequence acceleration
Efficient spin-up of Earth System Models usingsequence accelerationSérgio Sacani
 
Cyanide resistant respiration pathway.pptx
Cyanide resistant respiration pathway.pptxCyanide resistant respiration pathway.pptx
Cyanide resistant respiration pathway.pptxCherry
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsSérgio Sacani
 
Cot curve, melting temperature, unique and repetitive DNA
Cot curve, melting temperature, unique and repetitive DNACot curve, melting temperature, unique and repetitive DNA
Cot curve, melting temperature, unique and repetitive DNACherry
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .Poonam Aher Patil
 
Pteris : features, anatomy, morphology and lifecycle
Pteris : features, anatomy, morphology and lifecyclePteris : features, anatomy, morphology and lifecycle
Pteris : features, anatomy, morphology and lifecycleCherry
 
COMPOSTING : types of compost, merits and demerits
COMPOSTING : types of compost, merits and demeritsCOMPOSTING : types of compost, merits and demerits
COMPOSTING : types of compost, merits and demeritsCherry
 
Terpineol and it's characterization pptx
Terpineol and it's characterization pptxTerpineol and it's characterization pptx
Terpineol and it's characterization pptxMuhammadRazzaq31
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceAlex Henderson
 
Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxRenuJangid3
 
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...Monika Rani
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learninglevieagacer
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learninglevieagacer
 
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry Areesha Ahmad
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusNazaninKarimi6
 
Understanding Partial Differential Equations: Types and Solution Methods
Understanding Partial Differential Equations: Types and Solution MethodsUnderstanding Partial Differential Equations: Types and Solution Methods
Understanding Partial Differential Equations: Types and Solution Methodsimroshankoirala
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...Scintica Instrumentation
 
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Cherry
 

Recently uploaded (20)

Efficient spin-up of Earth System Models usingsequence acceleration
Efficient spin-up of Earth System Models usingsequence accelerationEfficient spin-up of Earth System Models usingsequence acceleration
Efficient spin-up of Earth System Models usingsequence acceleration
 
Cyanide resistant respiration pathway.pptx
Cyanide resistant respiration pathway.pptxCyanide resistant respiration pathway.pptx
Cyanide resistant respiration pathway.pptx
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
Cot curve, melting temperature, unique and repetitive DNA
Cot curve, melting temperature, unique and repetitive DNACot curve, melting temperature, unique and repetitive DNA
Cot curve, melting temperature, unique and repetitive DNA
 
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICEPATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
Pteris : features, anatomy, morphology and lifecycle
Pteris : features, anatomy, morphology and lifecyclePteris : features, anatomy, morphology and lifecycle
Pteris : features, anatomy, morphology and lifecycle
 
COMPOSTING : types of compost, merits and demerits
COMPOSTING : types of compost, merits and demeritsCOMPOSTING : types of compost, merits and demerits
COMPOSTING : types of compost, merits and demerits
 
ABHISHEK ANTIBIOTICS PPT MICROBIOLOGY // USES OF ANTIOBIOTICS TYPES OF ANTIB...
ABHISHEK ANTIBIOTICS PPT MICROBIOLOGY  // USES OF ANTIOBIOTICS TYPES OF ANTIB...ABHISHEK ANTIBIOTICS PPT MICROBIOLOGY  // USES OF ANTIOBIOTICS TYPES OF ANTIB...
ABHISHEK ANTIBIOTICS PPT MICROBIOLOGY // USES OF ANTIOBIOTICS TYPES OF ANTIB...
 
Terpineol and it's characterization pptx
Terpineol and it's characterization pptxTerpineol and it's characterization pptx
Terpineol and it's characterization pptx
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical Science
 
Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptx
 
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
 
Understanding Partial Differential Equations: Types and Solution Methods
Understanding Partial Differential Equations: Types and Solution MethodsUnderstanding Partial Differential Equations: Types and Solution Methods
Understanding Partial Differential Equations: Types and Solution Methods
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
 
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
 

Hands on training_biological_databases.ppt

  • 1. Biological Databases • Primary databases • Secondary databases • Specialized databases
  • 2. Biological Databases Primary Databases: contains original biological data (raw nucleic acid sequence data produced and submitted by researchers worldwide) -Genbank (https://www.ncbi.nlm.nih.gov/genbank/) -EMBL (European Molecular Biology Laboratory) (https://www.ebi.ac.uk) -DDBJ (DNA Data Bank of Japan) (https://www.ddbj.nig.ac.jp/index-e.html)
  • 3. Secondary Databases: which contain computationally processed sequence information derived from the primary databases -SWISS-PROT: which provides detailed sequence annotation that includes structure, function, and protein family assignment -TrEMBL, a database of translated nucleic acid sequences stored in the EMBL database -UniProt: (SWISS-PROT + TrEMBL + PIR), which has larger coverage than any one of the three databases -Pfam and Blocks: aligned protein sequence information, motifs, patterns DALI: protein secondary structure database that is vital for protein structure classification and threading analysis
  • 4. Major Biological Databases Available Via the World Wide Web SWISS-Prot Curated protein sequence database www.ebi.ac.uk/swissprot/acces s.html AceDB Genome database for Caenorhabditis elegans www.acedb.org DDBJ Primary nucleotide sequence database in Japan www.ddbj.nig.ac.jp EMBL Primary nucleotide sequence database in Europe www.ebi.ac.uk/embl/index.html Entrez NCBI portal for a variety of biological databases www.ncbi.nlm.nih.gov/gquery/g query.fcgi ExPASY Proteomics database http://us.expasy.org/ FlyBase A database of the Drosophila genome http://flybase.bio.indiana.edu/ FSSP Protein secondary structures www.bioinfo.biocenter.helsinki.f i:8080/dali/index.html GenBank Primary nucleotide sequence database in NCBI www.ncbi.nlm.nih.gov/Genban k HIV Databases HIV sequence data and related immunologic information www.hiv.lanl.gov/content/index Microarray gene expression data base DNA microarray data and analysis tools www.ebi.ac.uk/microarray OMIM Genetic information of human diseases www.ncbi.nlm.nih.gov/entrez/q uery.fcgi?db=OMIM SRS General sequence retrieval system http://srs6.ebi.ac.uk PubMed Biomedical literature information www.ncbi.nlm.nih.gov/PubMed TAIR Arabidopsis information database www.arabidopsis.org
  • 6.
  • 7.
  • 9. Main file formats used in Bioinformatics •GenBank/GenPept •ASN.1 •EMBL, Swiss Prot •FASTA •GCG •PHYLIP •PIR
  • 11. Algorithms for pairwise alignments Web resources • LALIGN - pairwise sequence alignment • Global alignment: Needle (EMBOSS): https://www.ebi.ac.uk/Tools/psa/ • Local alignment: Water (EMBOSS): https://www.ebi.ac.uk/Tools/psa/
  • 12. • The BLAST program was developed by Stephen Altschul of NCBI in 1990 and has since become one of the most popular programs for sequence analysis • BLAST uses heuristics to align a query sequence with all sequences in a database • The objective is to find high-scoring ungapped segments among related sequences • The existence of such segments above a given threshold indicates pairwise similarity beyond random chance, which helps to discriminate related sequences from unrelated sequences in a database BLAST (Basic Local Alignment Search Tool) (www.ncbi.nlm.nih.gov/BLAST/)
  • 13.
  • 14. NCBI search for spike surface glycoprotein
  • 15. BLAST Search against Protein Data Bank
  • 17. PSI-BLAST Contd. • An iterative search in which sequences found in one round of searching are used to build a score model for the next round of searching • An important tool for predicting both biochemical activity and function • Identify week homologies (distant relatives of a proteins, which are not found in FASTA or BLAST. Information: https://www.ncbi.nlm.nih.gov/BLAST/tutorial/Altschul-2.html
  • 18. BLAST QuickStart Example-Driven Web-Based BLAST Tutorial https://www.ncbi.nlm.nih.gov/books/NBK1734/ NCBI BLAST tutorial – YouTube https://www.youtube.com/watch?v=HXEpBnUbAMo NCBI PSI-BLAST Tutorial - YouTube https://www.youtube.com/watch?v=T3kHEieyylk
  • 19. MEME suite 5.1.1: http://meme-suite.org/tools/meme
  • 22. Patterns can be generated from multiple sequences using PRATT https://web.expasy.org/pratt/ PRATT - ExPASy
  • 23. PHI-BLAST PROSITE pattern for the kinase active site, starting from the conserved DRH and making use of the very conserved DFG region: D-R-H-[NS]-[DS]-N-[IL]-x-[IV]-x-[DEK]- [DGST]-G-[NQR]-L-F-H-I-D-F-G The above query sequence and the PROSITE pattern used as inputs for the PHI- BLAST search (see next slide)