SlideShare a Scribd company logo
Biological Databases
• Primary databases
• Secondary databases
• Specialized databases
Biological Databases
Primary Databases: contains original biological data
(raw nucleic acid sequence data produced and submitted
by researchers worldwide)
-Genbank (https://www.ncbi.nlm.nih.gov/genbank/)
-EMBL (European Molecular Biology Laboratory)
(https://www.ebi.ac.uk)
-DDBJ (DNA Data Bank of Japan)
(https://www.ddbj.nig.ac.jp/index-e.html)
Secondary Databases: which contain computationally processed
sequence information derived from the primary databases
-SWISS-PROT: which provides detailed sequence annotation that
includes structure, function, and protein family assignment
-TrEMBL, a database of translated nucleic acid sequences stored
in the EMBL database
-UniProt: (SWISS-PROT + TrEMBL + PIR), which has larger
coverage than any one of the three databases
-Pfam and Blocks: aligned protein sequence information, motifs,
patterns
DALI: protein secondary structure database that is vital for protein
structure classification and threading analysis
Major Biological Databases Available Via the
World Wide Web
SWISS-Prot Curated protein sequence database www.ebi.ac.uk/swissprot/acces
s.html
AceDB Genome database for Caenorhabditis elegans www.acedb.org
DDBJ Primary nucleotide sequence database in Japan www.ddbj.nig.ac.jp
EMBL Primary nucleotide sequence database in Europe www.ebi.ac.uk/embl/index.html
Entrez NCBI portal for a variety of biological databases www.ncbi.nlm.nih.gov/gquery/g
query.fcgi
ExPASY Proteomics database http://us.expasy.org/
FlyBase A database of the Drosophila genome http://flybase.bio.indiana.edu/
FSSP Protein secondary structures www.bioinfo.biocenter.helsinki.f
i:8080/dali/index.html
GenBank Primary nucleotide sequence database in NCBI www.ncbi.nlm.nih.gov/Genban
k
HIV Databases HIV sequence data and related immunologic
information
www.hiv.lanl.gov/content/index
Microarray gene
expression data
base
DNA microarray data and analysis tools www.ebi.ac.uk/microarray
OMIM Genetic information of human diseases www.ncbi.nlm.nih.gov/entrez/q
uery.fcgi?db=OMIM
SRS General sequence retrieval system http://srs6.ebi.ac.uk
PubMed Biomedical literature information www.ncbi.nlm.nih.gov/PubMed
TAIR Arabidopsis information database www.arabidopsis.org
Entrez
NCBI advanced search builder
Main file formats used in Bioinformatics
•GenBank/GenPept
•ASN.1
•EMBL, Swiss Prot
•FASTA
•GCG
•PHYLIP
•PIR
(https://www.ebi.ac.uk/Tools/sfc/emboss_seqret/)
EMBOSS Seqret
Algorithms for pairwise alignments
Web resources
• LALIGN - pairwise sequence alignment
• Global alignment: Needle (EMBOSS): https://www.ebi.ac.uk/Tools/psa/
• Local alignment: Water (EMBOSS): https://www.ebi.ac.uk/Tools/psa/
• The BLAST program was developed by Stephen Altschul of NCBI in 1990
and has since become one of the most popular programs for sequence
analysis
• BLAST uses heuristics to align a query sequence with all sequences in a
database
• The objective is to find high-scoring ungapped segments among related
sequences
• The existence of such segments above a given threshold indicates pairwise
similarity beyond random chance, which helps to discriminate related
sequences from unrelated sequences in a database
BLAST (Basic Local Alignment Search Tool)
(www.ncbi.nlm.nih.gov/BLAST/)
NCBI search for spike surface glycoprotein
BLAST Search against Protein Data Bank
Low Complexity Regions
PSI-BLAST Contd.
• An iterative search in which sequences found in one round of
searching are used to build a score model for the next round of
searching
• An important tool for predicting both biochemical activity and
function
• Identify week homologies (distant relatives of a proteins, which are
not found in FASTA or BLAST.
Information:
https://www.ncbi.nlm.nih.gov/BLAST/tutorial/Altschul-2.html
BLAST QuickStart
Example-Driven Web-Based BLAST Tutorial
https://www.ncbi.nlm.nih.gov/books/NBK1734/
NCBI BLAST tutorial – YouTube
https://www.youtube.com/watch?v=HXEpBnUbAMo
NCBI PSI-BLAST Tutorial - YouTube
https://www.youtube.com/watch?v=T3kHEieyylk
MEME suite 5.1.1: http://meme-suite.org/tools/meme
PROSITE: https://prosite.expasy.org
ScanProsite: https://prosite.expasy.org/scanprosite/
Patterns can be generated from multiple sequences using PRATT
https://web.expasy.org/pratt/
PRATT - ExPASy
PHI-BLAST
PROSITE pattern for the kinase active site, starting from the conserved DRH and
making use of the very conserved DFG region: D-R-H-[NS]-[DS]-N-[IL]-x-[IV]-x-[DEK]-
[DGST]-G-[NQR]-L-F-H-I-D-F-G
The above query sequence and the PROSITE pattern used as inputs for the PHI-
BLAST search (see next slide)
PHI-BLAST

More Related Content

Similar to Hands on training_biological_databases.ppt

Introduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASEIntroduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASE
PrashantSharma807
 
Major databases in bioinformatics
Major databases in bioinformaticsMajor databases in bioinformatics
Major databases in bioinformatics
Vidya Kalaivani Rajkumar
 
Data retreival system
Data retreival systemData retreival system
Data retreival system
Shikha Thakur
 
Protein Databases
Protein DatabasesProtein Databases
Data Retrieval Systems
Data Retrieval SystemsData Retrieval Systems
Data Retrieval Systems
Saramita De Chakravarti
 
Protein database
Protein databaseProtein database
Protein database
Rajpal Choudhary
 
protein databases.ppt
protein databases.pptprotein databases.ppt
protein databases.ppt
SanthiyaAK
 
Protein database
Protein  databaseProtein  database
Protein database
KAUSHAL SAHU
 
Proteins databases
Proteins databasesProteins databases
Proteins databases
Hafiz Muhammad Zeeshan Raza
 
Bioinformatics مي.pdf
Bioinformatics  مي.pdfBioinformatics  مي.pdf
Bioinformatics مي.pdf
nedalalazzwy
 
Biological databases
Biological databases Biological databases
Biological databases
SEKHARREDDYAMBATI
 
Structural database and their classification by abdul qahar
Structural database and their classification by abdul qaharStructural database and their classification by abdul qahar
Structural database and their classification by abdul qahar
Abdul Qahar {{Abdul Wali Khan University Mardan}} (Buner Campus)
 
Nucleic Acid Sequence Databases
Nucleic Acid Sequence DatabasesNucleic Acid Sequence Databases
Nucleic Acid Sequence Databases
farwa fayaz
 
Nucleic Acid Sequence databases
Nucleic Acid Sequence databasesNucleic Acid Sequence databases
Nucleic Acid Sequence databases
Pranavathiyani G
 
Databases, bioinformatics, sequence analysis
Databases, bioinformatics, sequence analysisDatabases, bioinformatics, sequence analysis
Databases, bioinformatics, sequence analysis
MohamedHasan816582
 
Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...
Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...
Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...
VHIR Vall d’Hebron Institut de Recerca
 
Databases_L2.pptx
Databases_L2.pptxDatabases_L2.pptx
Databases_L2.pptx
kigaruantony
 
Presentation on Biological database By Elufer Akram @ University Of Science ...
Presentation on Biological database  By Elufer Akram @ University Of Science ...Presentation on Biological database  By Elufer Akram @ University Of Science ...
Presentation on Biological database By Elufer Akram @ University Of Science ...
Elufer Akram
 
Database in bioinformatics
Database in bioinformaticsDatabase in bioinformatics
Database in bioinformatics
VinaKhan1
 
Biological databases.pptx
Biological databases.pptxBiological databases.pptx
Biological databases.pptx
PagudalaSangeetha
 

Similar to Hands on training_biological_databases.ppt (20)

Introduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASEIntroduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASE
 
Major databases in bioinformatics
Major databases in bioinformaticsMajor databases in bioinformatics
Major databases in bioinformatics
 
Data retreival system
Data retreival systemData retreival system
Data retreival system
 
Protein Databases
Protein DatabasesProtein Databases
Protein Databases
 
Data Retrieval Systems
Data Retrieval SystemsData Retrieval Systems
Data Retrieval Systems
 
Protein database
Protein databaseProtein database
Protein database
 
protein databases.ppt
protein databases.pptprotein databases.ppt
protein databases.ppt
 
Protein database
Protein  databaseProtein  database
Protein database
 
Proteins databases
Proteins databasesProteins databases
Proteins databases
 
Bioinformatics مي.pdf
Bioinformatics  مي.pdfBioinformatics  مي.pdf
Bioinformatics مي.pdf
 
Biological databases
Biological databases Biological databases
Biological databases
 
Structural database and their classification by abdul qahar
Structural database and their classification by abdul qaharStructural database and their classification by abdul qahar
Structural database and their classification by abdul qahar
 
Nucleic Acid Sequence Databases
Nucleic Acid Sequence DatabasesNucleic Acid Sequence Databases
Nucleic Acid Sequence Databases
 
Nucleic Acid Sequence databases
Nucleic Acid Sequence databasesNucleic Acid Sequence databases
Nucleic Acid Sequence databases
 
Databases, bioinformatics, sequence analysis
Databases, bioinformatics, sequence analysisDatabases, bioinformatics, sequence analysis
Databases, bioinformatics, sequence analysis
 
Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...
Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...
Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...
 
Databases_L2.pptx
Databases_L2.pptxDatabases_L2.pptx
Databases_L2.pptx
 
Presentation on Biological database By Elufer Akram @ University Of Science ...
Presentation on Biological database  By Elufer Akram @ University Of Science ...Presentation on Biological database  By Elufer Akram @ University Of Science ...
Presentation on Biological database By Elufer Akram @ University Of Science ...
 
Database in bioinformatics
Database in bioinformaticsDatabase in bioinformatics
Database in bioinformatics
 
Biological databases.pptx
Biological databases.pptxBiological databases.pptx
Biological databases.pptx
 

Recently uploaded

(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
Scintica Instrumentation
 
Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...
Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...
Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...
frank0071
 
molar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptxmolar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptx
Anagha Prasad
 
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
Advanced-Concepts-Team
 
Applied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdfApplied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdf
University of Hertfordshire
 
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
PsychoTech Services
 
Direct Seeded Rice - Climate Smart Agriculture
Direct Seeded Rice - Climate Smart AgricultureDirect Seeded Rice - Climate Smart Agriculture
Direct Seeded Rice - Climate Smart Agriculture
International Food Policy Research Institute- South Asia Office
 
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
vluwdy49
 
Randomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNERandomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNE
University of Maribor
 
Pests of Storage_Identification_Dr.UPR.pdf
Pests of Storage_Identification_Dr.UPR.pdfPests of Storage_Identification_Dr.UPR.pdf
Pests of Storage_Identification_Dr.UPR.pdf
PirithiRaju
 
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdfMending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Selcen Ozturkcan
 
Gadgets for management of stored product pests_Dr.UPR.pdf
Gadgets for management of stored product pests_Dr.UPR.pdfGadgets for management of stored product pests_Dr.UPR.pdf
Gadgets for management of stored product pests_Dr.UPR.pdf
PirithiRaju
 
GBSN - Biochemistry (Unit 6) Chemistry of Proteins
GBSN - Biochemistry (Unit 6) Chemistry of ProteinsGBSN - Biochemistry (Unit 6) Chemistry of Proteins
GBSN - Biochemistry (Unit 6) Chemistry of Proteins
Areesha Ahmad
 
Immersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths ForwardImmersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths Forward
Leonel Morgado
 
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
hozt8xgk
 
The debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically youngThe debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically young
Sérgio Sacani
 
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Leonel Morgado
 
HOW DO ORGANISMS REPRODUCE?reproduction part 1
HOW DO ORGANISMS REPRODUCE?reproduction part 1HOW DO ORGANISMS REPRODUCE?reproduction part 1
HOW DO ORGANISMS REPRODUCE?reproduction part 1
Shashank Shekhar Pandey
 
Compexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titrationCompexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titration
Vandana Devesh Sharma
 
aziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobelaziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobel
İsa Badur
 

Recently uploaded (20)

(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
 
Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...
Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...
Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...
 
molar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptxmolar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptx
 
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
 
Applied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdfApplied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdf
 
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
 
Direct Seeded Rice - Climate Smart Agriculture
Direct Seeded Rice - Climate Smart AgricultureDirect Seeded Rice - Climate Smart Agriculture
Direct Seeded Rice - Climate Smart Agriculture
 
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
 
Randomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNERandomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNE
 
Pests of Storage_Identification_Dr.UPR.pdf
Pests of Storage_Identification_Dr.UPR.pdfPests of Storage_Identification_Dr.UPR.pdf
Pests of Storage_Identification_Dr.UPR.pdf
 
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdfMending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
 
Gadgets for management of stored product pests_Dr.UPR.pdf
Gadgets for management of stored product pests_Dr.UPR.pdfGadgets for management of stored product pests_Dr.UPR.pdf
Gadgets for management of stored product pests_Dr.UPR.pdf
 
GBSN - Biochemistry (Unit 6) Chemistry of Proteins
GBSN - Biochemistry (Unit 6) Chemistry of ProteinsGBSN - Biochemistry (Unit 6) Chemistry of Proteins
GBSN - Biochemistry (Unit 6) Chemistry of Proteins
 
Immersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths ForwardImmersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths Forward
 
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
 
The debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically youngThe debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically young
 
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
 
HOW DO ORGANISMS REPRODUCE?reproduction part 1
HOW DO ORGANISMS REPRODUCE?reproduction part 1HOW DO ORGANISMS REPRODUCE?reproduction part 1
HOW DO ORGANISMS REPRODUCE?reproduction part 1
 
Compexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titrationCompexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titration
 
aziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobelaziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobel
 

Hands on training_biological_databases.ppt

  • 1. Biological Databases • Primary databases • Secondary databases • Specialized databases
  • 2. Biological Databases Primary Databases: contains original biological data (raw nucleic acid sequence data produced and submitted by researchers worldwide) -Genbank (https://www.ncbi.nlm.nih.gov/genbank/) -EMBL (European Molecular Biology Laboratory) (https://www.ebi.ac.uk) -DDBJ (DNA Data Bank of Japan) (https://www.ddbj.nig.ac.jp/index-e.html)
  • 3. Secondary Databases: which contain computationally processed sequence information derived from the primary databases -SWISS-PROT: which provides detailed sequence annotation that includes structure, function, and protein family assignment -TrEMBL, a database of translated nucleic acid sequences stored in the EMBL database -UniProt: (SWISS-PROT + TrEMBL + PIR), which has larger coverage than any one of the three databases -Pfam and Blocks: aligned protein sequence information, motifs, patterns DALI: protein secondary structure database that is vital for protein structure classification and threading analysis
  • 4. Major Biological Databases Available Via the World Wide Web SWISS-Prot Curated protein sequence database www.ebi.ac.uk/swissprot/acces s.html AceDB Genome database for Caenorhabditis elegans www.acedb.org DDBJ Primary nucleotide sequence database in Japan www.ddbj.nig.ac.jp EMBL Primary nucleotide sequence database in Europe www.ebi.ac.uk/embl/index.html Entrez NCBI portal for a variety of biological databases www.ncbi.nlm.nih.gov/gquery/g query.fcgi ExPASY Proteomics database http://us.expasy.org/ FlyBase A database of the Drosophila genome http://flybase.bio.indiana.edu/ FSSP Protein secondary structures www.bioinfo.biocenter.helsinki.f i:8080/dali/index.html GenBank Primary nucleotide sequence database in NCBI www.ncbi.nlm.nih.gov/Genban k HIV Databases HIV sequence data and related immunologic information www.hiv.lanl.gov/content/index Microarray gene expression data base DNA microarray data and analysis tools www.ebi.ac.uk/microarray OMIM Genetic information of human diseases www.ncbi.nlm.nih.gov/entrez/q uery.fcgi?db=OMIM SRS General sequence retrieval system http://srs6.ebi.ac.uk PubMed Biomedical literature information www.ncbi.nlm.nih.gov/PubMed TAIR Arabidopsis information database www.arabidopsis.org
  • 6.
  • 7.
  • 9. Main file formats used in Bioinformatics •GenBank/GenPept •ASN.1 •EMBL, Swiss Prot •FASTA •GCG •PHYLIP •PIR
  • 11. Algorithms for pairwise alignments Web resources • LALIGN - pairwise sequence alignment • Global alignment: Needle (EMBOSS): https://www.ebi.ac.uk/Tools/psa/ • Local alignment: Water (EMBOSS): https://www.ebi.ac.uk/Tools/psa/
  • 12. • The BLAST program was developed by Stephen Altschul of NCBI in 1990 and has since become one of the most popular programs for sequence analysis • BLAST uses heuristics to align a query sequence with all sequences in a database • The objective is to find high-scoring ungapped segments among related sequences • The existence of such segments above a given threshold indicates pairwise similarity beyond random chance, which helps to discriminate related sequences from unrelated sequences in a database BLAST (Basic Local Alignment Search Tool) (www.ncbi.nlm.nih.gov/BLAST/)
  • 13.
  • 14. NCBI search for spike surface glycoprotein
  • 15. BLAST Search against Protein Data Bank
  • 17. PSI-BLAST Contd. • An iterative search in which sequences found in one round of searching are used to build a score model for the next round of searching • An important tool for predicting both biochemical activity and function • Identify week homologies (distant relatives of a proteins, which are not found in FASTA or BLAST. Information: https://www.ncbi.nlm.nih.gov/BLAST/tutorial/Altschul-2.html
  • 18. BLAST QuickStart Example-Driven Web-Based BLAST Tutorial https://www.ncbi.nlm.nih.gov/books/NBK1734/ NCBI BLAST tutorial – YouTube https://www.youtube.com/watch?v=HXEpBnUbAMo NCBI PSI-BLAST Tutorial - YouTube https://www.youtube.com/watch?v=T3kHEieyylk
  • 19. MEME suite 5.1.1: http://meme-suite.org/tools/meme
  • 22. Patterns can be generated from multiple sequences using PRATT https://web.expasy.org/pratt/ PRATT - ExPASy
  • 23. PHI-BLAST PROSITE pattern for the kinase active site, starting from the conserved DRH and making use of the very conserved DFG region: D-R-H-[NS]-[DS]-N-[IL]-x-[IV]-x-[DEK]- [DGST]-G-[NQR]-L-F-H-I-D-F-G The above query sequence and the PROSITE pattern used as inputs for the PHI- BLAST search (see next slide)