SlideShare a Scribd company logo
1 of 20
Bioinformatics
• What is bioinformatics?
• Why bioinformatics?
• The major molecular biology facts
• Brief history of bioinformatics
• Typical problems of bioinformatics:
collection and retrieval of data
alignment and similarity search
prediction and classification
• Expectations and the level of requirements
Lecture 1
What is Bioinformatics?
Mathematics
and Statistics
Biology
Computer
Science
A working definition is that of House of
Representatives Standing Committee on Primary
Industries and Regional Services Inquiry :-
"All aspects of gathering, storing, handling,
analyzing, interpreting and spreading vast amounts
of biological information in databases. The
information involved includes gene sequences,
biological activity/function, pharmacological activity,
biological structure, molecular structure, protein-
protein interactions, and gene expression.
Bioinformatics uses powerful computers and
statistical techniques to accomplish research
objectives, for example, to discover a new
pharmaceutical or herbicide."
What is bioinformatics?
• Molecular biology and genetics
• Phylogenetic and evolutionary sciences
• Different aspects of biotechnology including
pharmaceutical and microbiological industries
• Medicine
• Agriculture
•Eco-management
Areas of current and future development of
bioinformatics
• Exponential growth of investments
• Constant deficit of trained professionals
• Diversification of bioinformatics applications
• Need in different types of bioinformaticians
Why bioinformatics?
Central Dogma of Molecular Biology
GENOTYPE (i.e. Aa)
PHENOTYPE (pink)
GENE (DNA)
MESSENGER (RNA)
PROTEIN
TRAIT
ATGCAAGTCCACTGTATTCCA
UACGUUCAGGUGACAUAAGGG
transcription reverse tr
translation
replication
DNA
Symbol Meaning Explanation
G G Guanine
A A Adenine
T T Thymine
C C Cytosine
R A or G puRine
Y C or T pYrimidine
N A, C, G or T Any base
Double helix
5’
3’
3’
5’
A C G T C A T G
T G C A G T A C
RNA
5’ 3’
A C G U C A U G
template
U U Uracil
Genetic Code
1. Amino acids are coded by codons – triplets of
nucleotides, e.g. |ACG|TAT|….
2. There are 43 = 64 codons for ~20 amino acids, the
code is degenerate
3. Codons do not overlap
4. Deletions or insertions of one or few nucleotides (not
equal to 3 x N) usually destroy a message by shifting
a reading frame
5. Three specific codons (stop codons) do not code any
amino acid and are always located at the very end of
the protein coding part of a gene
The genetic code
The 20 amino acids common in living
organisms
PROTEINS
Green Fluorecent Protein (GFP)
1 mcgkkfelki dnvrfvghpt llqpphtiqa sktdpspkre lptmilfsvv falranadas
61 viscmhnlsr riaialqhee rrcqyltrea klmlamqdev ttiidsdgsp qspfrqilpk
121 cklardlkea ydslcttgvv rlhinnwlev sfclphkihr vggkhiplea lerslkairp
Genomic Hierarchy in Eukaryotes
Genome nuclear (1)
Chromosomes (23x2)
DNA molecules (23x2)
Genes (~30,000); only a small fraction of genome
Nucleotides (~3x109)
Eukaryotic genes are complex
Promoter Exon 1 Exon 2 Exon 3 Exon 4
Start codon Intron 1 Intron 2 Intron 3 Stop codon
Protein coding regions
• The first biological database - Protein Identification Resource
was established in 1972 by Margaret Dayhoff
• Dayhoff and co-workers organized the proteins into families and
superfamilies based on degree of sequence similarity
• Idea of sequence alignment was introduced as well as special
tables that reflected the frequency of changes observed in the
sequences of a group of closely related proteins
• Currently there are several huge Protein Banks : SwissProt, PIR
International, etc.
• The first DNA database was established in 1979. Currently there
are several powerful databases: GenBank, EMBL, DDBJ, etc.
Brief history of bioinformatics: Databases
Brief history of bioinformatics:
evolutionary reconsructions
Brief history of bioinformatics: other
important steps
• Development of sequence retrieval methods (1970-80s)
• Development of principles of sequence alignment (1980s)
• Prediction of RNA secondary structure (1980s)
• Prediction of protein secondary structure and 3D (1980-90s)
• The FASTA and BLAST methods for DB search (1980-90s)
• Prediction of genes (1990s)
• Studies of complete genome sequences (late 1990s –2000s)
Collection and retrieval of data.
Alignment methods.
• Sequencing (DNA, proteins)
• Submission of sequences to the databases
• Computer storage of sequences
• Development of sequence formats
• Conversion of one sequence format to another
• Development of retrieval and alignment methods
Prediction, reconstruction and
classification
• Prediction of secondary and 3D structure of RNA and proteins
• Gene prediction in prokaryotes and eukaryotes
• Prediction of promoters and other functional sites
• Reconstruction of phylogeny
• Genome analysis
• Classification of proteins and genes
Prediction of RNA secondary structure:
an example
A. Single stranded RNA 5’ 3’
5’
3’
B. Stem and loop or hairpin loop
Expectations of students’ performance
• Basic understanding of general principles of molecular biology
• Some mathematical and computer science background
• Focus on using computational methods and understanding
general ideas of analysis used in bioinformatics
• Formal description of algorithms and complex methodology
will not be the core elements of this unit
• The core requirement is understanding of foundations of
bioinformatics and “hands on” approach

More Related Content

Similar to BIOINFORMATICS.ppt

DNA, CHROMOSOMES & GENES
DNA, CHROMOSOMES & GENESDNA, CHROMOSOMES & GENES
DNA, CHROMOSOMES & GENESjagan vana
 
Lecture 1 Introduction to Bioinformatics BCH 433.ppt
Lecture 1 Introduction to Bioinformatics BCH 433.pptLecture 1 Introduction to Bioinformatics BCH 433.ppt
Lecture 1 Introduction to Bioinformatics BCH 433.pptKelechiChukwuemeka
 
GENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICSGENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICSsandeshGM
 
Genomics and bioinformatics
Genomics and bioinformatics Genomics and bioinformatics
Genomics and bioinformatics Senthil Natesan
 
BASIC OF BIOINFORMATICS.pptx
BASIC OF BIOINFORMATICS.pptxBASIC OF BIOINFORMATICS.pptx
BASIC OF BIOINFORMATICS.pptxDevaprasadPanda
 
PadminiNarayanan-Intro-2018.pptx
PadminiNarayanan-Intro-2018.pptxPadminiNarayanan-Intro-2018.pptx
PadminiNarayanan-Intro-2018.pptxDESMONDEZIEKE1
 
1.introduction to genetic engineering and restriction enzymes
1.introduction to genetic engineering and restriction enzymes1.introduction to genetic engineering and restriction enzymes
1.introduction to genetic engineering and restriction enzymesGetachew Birhanu
 
Shriram belge (exome sequencing) 27 2003
Shriram belge (exome sequencing) 27  2003Shriram belge (exome sequencing) 27  2003
Shriram belge (exome sequencing) 27 2003Shriram Belge
 
Introduction to Bioinformatics
Introduction to BioinformaticsIntroduction to Bioinformatics
Introduction to Bioinformaticsjaumebp
 
Databases_CSS2.pptx
Databases_CSS2.pptxDatabases_CSS2.pptx
Databases_CSS2.pptxSilpa87
 
Introduction to Bioinformatics-1.pdf
Introduction to Bioinformatics-1.pdfIntroduction to Bioinformatics-1.pdf
Introduction to Bioinformatics-1.pdfkigaruantony
 
Introduction to Biological database ppt(1).pptx
Introduction to Biological database ppt(1).pptxIntroduction to Biological database ppt(1).pptx
Introduction to Biological database ppt(1).pptxRAJESHKUMAR428748
 
introduction of Bioinformatics
introduction of Bioinformaticsintroduction of Bioinformatics
introduction of BioinformaticsVinaKhan1
 
Pcmd bioinformatics-lecture i
Pcmd bioinformatics-lecture iPcmd bioinformatics-lecture i
Pcmd bioinformatics-lecture iMuhammad Younis
 
Basics Of Bioinformatics .pptx
Basics Of Bioinformatics .pptxBasics Of Bioinformatics .pptx
Basics Of Bioinformatics .pptxMohdkaifkhan18
 

Similar to BIOINFORMATICS.ppt (20)

DNA, CHROMOSOMES & GENES
DNA, CHROMOSOMES & GENESDNA, CHROMOSOMES & GENES
DNA, CHROMOSOMES & GENES
 
Lecture 1 Introduction to Bioinformatics BCH 433.ppt
Lecture 1 Introduction to Bioinformatics BCH 433.pptLecture 1 Introduction to Bioinformatics BCH 433.ppt
Lecture 1 Introduction to Bioinformatics BCH 433.ppt
 
GENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICSGENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICS
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Genomics and bioinformatics
Genomics and bioinformatics Genomics and bioinformatics
Genomics and bioinformatics
 
BASIC OF BIOINFORMATICS.pptx
BASIC OF BIOINFORMATICS.pptxBASIC OF BIOINFORMATICS.pptx
BASIC OF BIOINFORMATICS.pptx
 
Genomics experimental-methods
Genomics experimental-methodsGenomics experimental-methods
Genomics experimental-methods
 
PadminiNarayanan-Intro-2018.pptx
PadminiNarayanan-Intro-2018.pptxPadminiNarayanan-Intro-2018.pptx
PadminiNarayanan-Intro-2018.pptx
 
1.introduction to genetic engineering and restriction enzymes
1.introduction to genetic engineering and restriction enzymes1.introduction to genetic engineering and restriction enzymes
1.introduction to genetic engineering and restriction enzymes
 
2013 10 23_dna_for_dummies_v_presented
2013 10 23_dna_for_dummies_v_presented2013 10 23_dna_for_dummies_v_presented
2013 10 23_dna_for_dummies_v_presented
 
Shriram belge (exome sequencing) 27 2003
Shriram belge (exome sequencing) 27  2003Shriram belge (exome sequencing) 27  2003
Shriram belge (exome sequencing) 27 2003
 
Introduction to Bioinformatics
Introduction to BioinformaticsIntroduction to Bioinformatics
Introduction to Bioinformatics
 
Databases_CSS2.pptx
Databases_CSS2.pptxDatabases_CSS2.pptx
Databases_CSS2.pptx
 
Introduction to Bioinformatics-1.pdf
Introduction to Bioinformatics-1.pdfIntroduction to Bioinformatics-1.pdf
Introduction to Bioinformatics-1.pdf
 
Molecular profiling 2013
Molecular profiling 2013Molecular profiling 2013
Molecular profiling 2013
 
Introduction to Biological database ppt(1).pptx
Introduction to Biological database ppt(1).pptxIntroduction to Biological database ppt(1).pptx
Introduction to Biological database ppt(1).pptx
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
introduction of Bioinformatics
introduction of Bioinformaticsintroduction of Bioinformatics
introduction of Bioinformatics
 
Pcmd bioinformatics-lecture i
Pcmd bioinformatics-lecture iPcmd bioinformatics-lecture i
Pcmd bioinformatics-lecture i
 
Basics Of Bioinformatics .pptx
Basics Of Bioinformatics .pptxBasics Of Bioinformatics .pptx
Basics Of Bioinformatics .pptx
 

Recently uploaded

EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxRaymartEstabillo3
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
Blooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxBlooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxUnboundStockton
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptxENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptxAnaBeatriceAblay2
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxsocialsciencegdgrohi
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
Science lesson Moon for 4th quarter lesson
Science lesson Moon for 4th quarter lessonScience lesson Moon for 4th quarter lesson
Science lesson Moon for 4th quarter lessonJericReyAuditor
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfadityarao40181
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 

Recently uploaded (20)

EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
Blooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxBlooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docx
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptxENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
Science lesson Moon for 4th quarter lesson
Science lesson Moon for 4th quarter lessonScience lesson Moon for 4th quarter lesson
Science lesson Moon for 4th quarter lesson
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdf
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 

BIOINFORMATICS.ppt

  • 1. Bioinformatics • What is bioinformatics? • Why bioinformatics? • The major molecular biology facts • Brief history of bioinformatics • Typical problems of bioinformatics: collection and retrieval of data alignment and similarity search prediction and classification • Expectations and the level of requirements Lecture 1
  • 2. What is Bioinformatics? Mathematics and Statistics Biology Computer Science
  • 3. A working definition is that of House of Representatives Standing Committee on Primary Industries and Regional Services Inquiry :- "All aspects of gathering, storing, handling, analyzing, interpreting and spreading vast amounts of biological information in databases. The information involved includes gene sequences, biological activity/function, pharmacological activity, biological structure, molecular structure, protein- protein interactions, and gene expression. Bioinformatics uses powerful computers and statistical techniques to accomplish research objectives, for example, to discover a new pharmaceutical or herbicide." What is bioinformatics?
  • 4. • Molecular biology and genetics • Phylogenetic and evolutionary sciences • Different aspects of biotechnology including pharmaceutical and microbiological industries • Medicine • Agriculture •Eco-management Areas of current and future development of bioinformatics
  • 5. • Exponential growth of investments • Constant deficit of trained professionals • Diversification of bioinformatics applications • Need in different types of bioinformaticians Why bioinformatics?
  • 6. Central Dogma of Molecular Biology GENOTYPE (i.e. Aa) PHENOTYPE (pink) GENE (DNA) MESSENGER (RNA) PROTEIN TRAIT ATGCAAGTCCACTGTATTCCA UACGUUCAGGUGACAUAAGGG transcription reverse tr translation replication
  • 7. DNA Symbol Meaning Explanation G G Guanine A A Adenine T T Thymine C C Cytosine R A or G puRine Y C or T pYrimidine N A, C, G or T Any base Double helix 5’ 3’ 3’ 5’ A C G T C A T G T G C A G T A C RNA 5’ 3’ A C G U C A U G template U U Uracil
  • 8. Genetic Code 1. Amino acids are coded by codons – triplets of nucleotides, e.g. |ACG|TAT|…. 2. There are 43 = 64 codons for ~20 amino acids, the code is degenerate 3. Codons do not overlap 4. Deletions or insertions of one or few nucleotides (not equal to 3 x N) usually destroy a message by shifting a reading frame 5. Three specific codons (stop codons) do not code any amino acid and are always located at the very end of the protein coding part of a gene
  • 10. The 20 amino acids common in living organisms
  • 11. PROTEINS Green Fluorecent Protein (GFP) 1 mcgkkfelki dnvrfvghpt llqpphtiqa sktdpspkre lptmilfsvv falranadas 61 viscmhnlsr riaialqhee rrcqyltrea klmlamqdev ttiidsdgsp qspfrqilpk 121 cklardlkea ydslcttgvv rlhinnwlev sfclphkihr vggkhiplea lerslkairp
  • 12. Genomic Hierarchy in Eukaryotes Genome nuclear (1) Chromosomes (23x2) DNA molecules (23x2) Genes (~30,000); only a small fraction of genome Nucleotides (~3x109)
  • 13. Eukaryotic genes are complex Promoter Exon 1 Exon 2 Exon 3 Exon 4 Start codon Intron 1 Intron 2 Intron 3 Stop codon Protein coding regions
  • 14. • The first biological database - Protein Identification Resource was established in 1972 by Margaret Dayhoff • Dayhoff and co-workers organized the proteins into families and superfamilies based on degree of sequence similarity • Idea of sequence alignment was introduced as well as special tables that reflected the frequency of changes observed in the sequences of a group of closely related proteins • Currently there are several huge Protein Banks : SwissProt, PIR International, etc. • The first DNA database was established in 1979. Currently there are several powerful databases: GenBank, EMBL, DDBJ, etc. Brief history of bioinformatics: Databases
  • 15. Brief history of bioinformatics: evolutionary reconsructions
  • 16. Brief history of bioinformatics: other important steps • Development of sequence retrieval methods (1970-80s) • Development of principles of sequence alignment (1980s) • Prediction of RNA secondary structure (1980s) • Prediction of protein secondary structure and 3D (1980-90s) • The FASTA and BLAST methods for DB search (1980-90s) • Prediction of genes (1990s) • Studies of complete genome sequences (late 1990s –2000s)
  • 17. Collection and retrieval of data. Alignment methods. • Sequencing (DNA, proteins) • Submission of sequences to the databases • Computer storage of sequences • Development of sequence formats • Conversion of one sequence format to another • Development of retrieval and alignment methods
  • 18. Prediction, reconstruction and classification • Prediction of secondary and 3D structure of RNA and proteins • Gene prediction in prokaryotes and eukaryotes • Prediction of promoters and other functional sites • Reconstruction of phylogeny • Genome analysis • Classification of proteins and genes
  • 19. Prediction of RNA secondary structure: an example A. Single stranded RNA 5’ 3’ 5’ 3’ B. Stem and loop or hairpin loop
  • 20. Expectations of students’ performance • Basic understanding of general principles of molecular biology • Some mathematical and computer science background • Focus on using computational methods and understanding general ideas of analysis used in bioinformatics • Formal description of algorithms and complex methodology will not be the core elements of this unit • The core requirement is understanding of foundations of bioinformatics and “hands on” approach