Introduction to bioinformatics.pptx

Introduction to bioinformatics
Morteza Ghandadi
PhD. of Pharmaceutical Biotechnology
Mazandaran university of medical sciences
1

Bioinformatics is an interdisciplinary field
that develops methods and software tools
for understanding biological data
3

History
• The first protein was sequenced (1955)
• Margaret O.Dayhoff
• Atlas of Protein Sequence and Structure (1965)
• Paulien Hogeweg and Ben Hesper coined “bioinformatics” in 1970
• The protein data bank (1973)
• DNA sequencing (1977)
• NCBI( 1988)
• Human genome project lunched (1990)
• The human genome is published (2001) https://www.smithsonianmag.com/science-
nature/how-margaret-dayhoff-helped-bring-
computing-scientific-research-180971904/
4

Unites Of Information In Biological Macromolecules
DNA
Sequence analysis
Mutation and polymorphism studies
Identification of regulatory regions
Gene finding
Genome annotation
Comparative genomics
RNA
RNA sequencing
Splice variants
Tissue expression level
MicroArray
Single gene analysis
Sequence contigs
Protein
Homology modeling
Structure function prediction
Ligand docking
Protein-protein interaction
Protein expression
Phylogenic analysis 7

Applications of Bioinformatics
8

Develop templates to develop potent drug molecules
• Structural analysis of secretory phospholipase A2 from Clonorchis sinensis
https://link.springer.com/article/10.1007/s00894-011-1333-8
9

Phylogenic analysis
https://link.springer.com/article/10.1007/s00894-013-1913-x 11

Personalized medicine
https://link.springer.com/article/10.1007/s12291-013-0377-1 12

Protein interaction studies and pathway analysis
https://pubmed.ncbi.nlm.nih.gov/26997873/ 13

Deciphering the molecular basis of disease
https://pubmed.ncbi.nlm.nih.gov/27585677/
14

Pharmacophore modeling and docking
15

Pharmacophore modeling and docking
16

Pathogenesis of drug toxicity
https://link.springer.com/article/10.1007/s00894-011-1289-8 17

Vaccine designing and development
18

Tumor antigens
• The ideal antigen for a cancer vaccine should be highly immunogenic,
explicitly expressed in all cancer cells (not in normal cells) and
necessary for the survival of cancer cells
• Tumor-associated antigens (TAAs)
• Tumor-specific antigens (TSAs)

Bioinformatics journals
• Briefings in Bioinformatics
• Bioinformatics
• Genomics, Proteomics & Bioinformatics
• Current Bioinformatics
• BMC Bioinformatics
• Computers in Biology and Medicine
• Journal of Computer-Aided Molecular Design
• In Silico Biology
• Journal of molecular modeling
22

Bioinformatics for beginners
23

Gene
• A searchable database of genes, focusing on genomes that have been
completely sequenced and that have an active research community to
contribute gene-specific data. Information includes nomenclature,
chromosomal localization, gene products and their attributes (e.g.,
protein interactions), associated markers, phenotypes, interactions,
and links to citations, sequences, variation details, maps, expression
reports, homologs, protein domain content, and external databases
26

Genome
• Contains sequence and map data from the whole genomes of over
1000 organisms. The genomes represent both completely sequenced
organisms and those for which sequencing is in progress. All three
main domains of life (bacteria, archaea, and eukaryota) are
represented, as well as many viruses, phages, viroids, plasmids, and
organelles.
28

Nucleotide
• A collection of nucleotide sequences from several sources, including
GenBank, RefSeq, the Third Party Annotation (TPA) database, and
PDB. Searching the Nucleotide Database will yield available results
from each of its component databases.
30

RefSeq: NCBI Reference Sequence Database
• A comprehensive, integrated, non-redundant, well-annotated set of
reference sequences including genomic, transcript, and protein.
32

What is the difference between RefSeq and GenBank?
• GenBank sequence records are owned by the original submitter and
cannot be altered by a third party. RefSeq sequences are not part of
the INSDC but are derived from INSDC sequences to provide non-
redundant curated data representing our current knowledge of
known genes
33

BLAST
• The Basic Local Alignment Search Tool (BLAST) finds regions of local
similarity between sequences. The program compares nucleotide or
protein sequences to sequence databases and calculates the statistical
significance of matches. BLAST can be used to infer functional and
evolutionary relationships between sequences as well as help identify
members of gene families.
34

Protein
• The Protein database is a collection of sequences from several sources,
including translations from annotated coding regions in GenBank,
RefSeq and TPA, as well as records from SwissProt, PIR, PRF, and PDB.
Protein sequences are the fundamental determinants of biological
structure and function.
36

UniProt
• Universal Protein Resource (UniProt) is a comprehensive resource for
protein sequence and annotation data. The UniProt databases are the
UniProt Knowledgebase (UniProtKB), the UniProt Reference Clusters
(UniRef), and the UniProt Archive (UniParc). UniProt is a collaboration
between the European Bioinformatics Institute (EMBL-EBI), the SIB Swiss
Institute of Bioinformatics and the Protein Information Resource (PIR).
37

Protein Data Bank (PDB)
• The Protein Data Bank (PDB) is a database for the three-dimensional
structural data of large biological molecules, such as proteins and
nucleic acids.
41

Introduction to bioinformatics.pptx

Recommended

Recommended

More Related Content

Similar to Introduction to bioinformatics.pptx

Similar to Introduction to bioinformatics.pptx (20)

Recently uploaded

Recently uploaded (20)

Introduction to bioinformatics.pptx

Editor's Notes