SlideShare a Scribd company logo
1 of 53
Introduction to Bioinformatics
Molecular Biology Primer
1
Genetic Material
• DNA (deoxyribonucleic acid) is the genetic
material
• Information stored in DNA
– the basis of inheritance
– distinguishes living things from nonliving
things
• Genes
– various units that govern living thing’s
characteristics at the genetic level
2
Nucleotides
• Genes themselves contain their information as a specific
sequence of nucleotides found in DNA molecules
• Only four different bases in DNA molecules
– Guanine (G)
– Adenine (A)
– Thymine (T)
– Cytosine (C)
• Each base is attached to a phosphate group and a
deoxyribose sugar to form a nucleotide.
• The only thing that makes one nucleotide different from
another is which nitrogenous base it contains
Sugar
P
Base
3
Nucleoside
Purine:
Pyrimidine:
4
Nucleotides
• Complicated genes can be many
thousands of nucleotides long
• All of an organism’s genetic instructions,
its genome, can be maintained in millions
or even billions of nucleotides
5
Orientation
• Strings of nucleotides can be attached to
each other to make long polynucleotide
chains
• 5’ (5 prime) end
– The end of a string of nucleotides with a 5'
carbon not attached to another nucleotide
• 3’ (3 prime) end
– The other end of the molecule with an
unattached 3' carbon
6
1’
2’
3’
4’
5’
7
Base Pairing
• Structure of DNA
– Double helix
– Seminal paper by Watson and Crick in 1953
– Rosalind Franklin’s contribution
• Information content on one of those strands
essentially redundant with the information on the
other
– Not exactly the same—it is complementary
• Base pair
– G paired with C (G  C)
– A paired with T (A = T)
8
9
Base Pairing
• Reverse complements
– 5' end of one strand corresponding to the 3' end of its
complementary strand and vice versa
• Example
– one strand: 5'-GTATCC-3'
the other strand: 3'-CATAGG-5'  5'-GGATAC-3'
• Upstream: Sequence features that are 5' to a
particular reference point
• Downstream: Sequence features that are 3' to a
particular reference point
5' 3'
Upstream Downstream
10
DNA Structure
11
DNA Structure
12
Chromosome
• Threadlike "packages" of genes and other
DNA in the nucleus of a cell
13
14
Chromosome
• Different kinds of organisms have different
numbers of chromosomes
• Humans
– 23 pairs
– 46 in all
15
Central Dogma of Molecular
Biology
• DNA: information storage
• Protein: function unit, such as enzyme
• Gene: instructions needed to make protein
• Central dogma
16
Central Dogma of Molecular
Biology
• Central dogma
reverse transcription
(reverse transcriptase)
replication
(DNA polymerase)
• DNA obtained from reverse transcription is
called complementary DNA (cDNA)
 Difference between DNA and cDNA will be
discussed later 17
Central Dogma of Molecular
Biology
• RNA (ribonucleic acid)
– Single-stranded polynucleotide
– Bases
• A
• G
• C
• U (uracil), instead of T
• Transcription (simplified …)
– A  A, G G, C  C, T  U
Sugar
P
Base
Sugar
P
Base
H
OH
DNA
RNA
18
19
20
DNA Replication (DNA  DNA)
21
DNA Replication (DNA  DNA)
22
DNA Replication Animation
Courtesy of Rob Rutherford, St. Olaf University
23
Transcription (DNA  RNA)
• Messenger RNA (mRNA)
– carries information to be
translated
• Ribosomal RNA (rRNA)
– the working “spine” of
the ribosome
• Transfer RNA (tRNA)
– the “decoder keys” that
will translate nucleic
acids to amino acids
24
Transcription Animation
Courtesy of Rob Rutherford, St. Olaf University
25
Peptides and Proteins
• mRNA  Sequence of amino
acids connected by peptide
bond
• Amino acid sequence
– Peptide: < 30 – 50 amino acids
– Protein: longer peptide
26
27
28
29
Genetic Code – Codon
Stop
codons
Start
codon
Codon:
3-base
RNA
sequence
List of Amino Acids
Amino acid Symbol Codon
A Alanine Ala GC*
C Cysteine Cys UGU, UGC
D Aspartic Acid Asp GAU, GAC
E Glutamic Acid Glu GAA, GAG
F Phenylalanine Phe UUU, UUC
G Glycine Gly GG*
H Histidine His CAU, CAC
I Isoleucine Ile AUU, AUC, AUA
K Lysine Lys AAA, AAG
L Leucine Leu UUA, UUG, CU*
30
List of Amino Acids
Amino acid Symbol Codon
M Methionine Met AUG
N Asparagine Asn AAU, AAC
P Proline Pro CC*
Q Glutamine Gln CAA, CAG
R Arginine Arg CG*, AGA, AGG
S Serine Ser UC*, AGU, AGC
T Threonine Thr AC*
V Valine Val GU*
W Tryptophan Trp UGG
Y Tyrosine Tyr UAU, UAC
20 letters, no B J O U X Z 31
Codon and Reading Frame
• 4 AA letters  43 = 64 triplet possibilities
• 20 (< 64) known amino acids
• Wobbling 3rd base
• Redundant  Resistant to mutation
• Reading frame: linear sequence of codons in a
gene
• Open Reading Frame (ORF), definition varies:
– a reading frame that begins with a start codon and
end at a stop codon
– a series of codons in a DNA sequence uninterrupted
by the presence of a stop codon
 a potential protein-coding region of DNA sequence
32
Open Reading Frame
• Given a nucleotide sequence
– How many reading frames? __
• __ forward and __ backward
• Example: Given a DNA sequence,
5’-ATGACCGTGGGCTCTTAA-3’
– ATG ACC GTG GGC TCT TAA  M T V G S *
– TGA CCG TGG GCT CTT AA  * P W A L
– GAC CGT GGG CTC TTA A  D R G L L
– Figure out the three backward reading frames
• In random sequence, a stop codon will follow a Met in
~20 AAs
• Substantially longer ORFs are often genes or parts of
them
33
Translation (RNA  Protein)
34
Translation Animation
Courtesy of Rob Rutherford, St. Olaf University
35
Gene Expression
• Gene expression
– Process of using the information stored in
DNA to make an RNA molecule and then a
corresponding protein
• Cells controlling gene expression by
– reliably distinguishing between those parts of
an organism’s genome that correspond to the
beginnings of genes and those that do not
– determining which genes code for proteins
that are needed at any particular time.
36
Promoter
• The probability (P) that a string of nucleotides will occur
by chance alone if all nucleotides are present at the
same frequency P = (1/4)n, where n is the string’s length
• Promoter sequences
– Sequences recognized by RNA polymerases as being
associated with a gene
• Example
– Prokaryotic RNA polymerases scan along DNA looking for a
specific set of approximately 13 nucleotides marking the
beginning of genes
– 1 nucleotide that serves as a transcriptional start site
– 6 that are 10 nucleotides 5' to the start site, and
– 6 more that are 35 nucleotides 5' to the start site
– What is the frequency for the sequence to occur?
37
Gene Regulation
• Regulatory proteins
– Capable of binding to a cell’s DNA near the promoter
of the genes
– Control gene expression in some circumstances but
not in others
• Positive regulation
– binding of regulatory proteins makes it easier for an
RNA polymerase to initiate transcription
• Negative regulation
– binding of the regulatory proteins prevents
transcription from occurring
38
Promoter and Regulatory Example
• Low tryptophan concentration
 RNA polymerase binds to promoter
 genes transcribed
• High tryptophan concentration
 repressor protein becomes active and binds to operator
 blocks the binding of RNA polymerase to the promoter
• Tryptophan concentration drops
 repressor releases its tryptophan and is released from DNA
 polymerase again transcribes genes
39
Gene Structure
40
Exons and Introns
41
Exons and Introns Example
42
Protein Structure and Function
• Genes encode the recipes for proteins
43
44
Protein Structure and Function
• Proteins are amino acid polymers
Proteins: Molecular Machines
 Proteins in your muscles allows you to move:
myosin
and
actin
45
Proteins: Molecular Machines
 Digestion, catalysis
(enzymes)
 Structure (collagen)
46
Proteins: Molecular Machines
 Signaling
(hormones,
kinases)
 Transport
(energy,
oxygen)
47
Protein
Structures
48
Information Flow in Nucleated Cell
49
Point Mutation Example:
Sickle-cell Disease
• Wild-type hemoglobin
DNA
3’----CTT----5’
mRNA
5’----GAA----3’
Normal hemoglobin
------[Glu]------
• Mutant hemoglobin
DNA
3’----CAT----5’
mRNA
5’----GUA----3’
Mutant hemoglobin
------[Val]------
50
image credit: U.S. Department of Energy Human Genome Program, http://www.ornl.gov/hgmis.
51
50% is high copy number repeats
About 10% is transcribed
(made into RNA)
Only 1.5% actually codes for protein
98.5% Junk DNA
Thinking about the Human
Genome
52
Thinking about the Human
Genome
~ 3 X 109 bps
(3 billion base pairs)
If each base were one mm long…
2000 miles, across the center of Africa
Average gene about 30 meters long
Occur about every 270 meters between them
Once spliced the message would only be
~1 meter long
53

More Related Content

Similar to BioPrimer.ppt

Nucleic Acid Chemistry Of Purine and Pyrimidine Ribonucleotide Part I MD1 By ...
Nucleic Acid Chemistry Of Purine and Pyrimidine Ribonucleotide Part I MD1 By ...Nucleic Acid Chemistry Of Purine and Pyrimidine Ribonucleotide Part I MD1 By ...
Nucleic Acid Chemistry Of Purine and Pyrimidine Ribonucleotide Part I MD1 By ...princeprefa
 
104 Genetics and cellular functionLearning Objective.docx
104 Genetics and cellular functionLearning Objective.docx104 Genetics and cellular functionLearning Objective.docx
104 Genetics and cellular functionLearning Objective.docxaulasnilda
 
Genes and proteins updated
Genes and proteins updatedGenes and proteins updated
Genes and proteins updatedlumenalexis
 
Genes and proteins updated
Genes and proteins updatedGenes and proteins updated
Genes and proteins updatedLumen Learning
 
Biol102 chp17-pp-spr10-100508132228-phpapp02
Biol102 chp17-pp-spr10-100508132228-phpapp02Biol102 chp17-pp-spr10-100508132228-phpapp02
Biol102 chp17-pp-spr10-100508132228-phpapp02Cleophas Rwemera
 
Biol102 chp17-pp-spr10-100508132228-phpapp02
Biol102 chp17-pp-spr10-100508132228-phpapp02Biol102 chp17-pp-spr10-100508132228-phpapp02
Biol102 chp17-pp-spr10-100508132228-phpapp02Cleophas Rwemera
 
Presentation Protein Synthesis
Presentation Protein SynthesisPresentation Protein Synthesis
Presentation Protein Synthesisangelsalaman
 
111 ch 15 online.pptx
111 ch 15 online.pptx111 ch 15 online.pptx
111 ch 15 online.pptxSondiHoffman1
 
Dna chapter12 13
Dna chapter12 13Dna chapter12 13
Dna chapter12 13Oliver Pham
 
Basics of Molecular Biology.ppt
Basics of Molecular Biology.pptBasics of Molecular Biology.ppt
Basics of Molecular Biology.ppteman badr
 
Mira_Nucleotides_NucleicA_Chemistry_Metabolism.pptx
Mira_Nucleotides_NucleicA_Chemistry_Metabolism.pptxMira_Nucleotides_NucleicA_Chemistry_Metabolism.pptx
Mira_Nucleotides_NucleicA_Chemistry_Metabolism.pptxMira426412
 
BACTERIAL GENETICS-1.pptx
BACTERIAL GENETICS-1.pptxBACTERIAL GENETICS-1.pptx
BACTERIAL GENETICS-1.pptxHeloEng
 
Polymerase Chain Reaction
Polymerase Chain ReactionPolymerase Chain Reaction
Polymerase Chain ReactionMehulJain143
 
Unit III (Nucleic acid and genetic code).ppt
Unit III (Nucleic acid and genetic code).pptUnit III (Nucleic acid and genetic code).ppt
Unit III (Nucleic acid and genetic code).pptDrJoginderSingh2
 

Similar to BioPrimer.ppt (20)

Nucleic Acid Chemistry Of Purine and Pyrimidine Ribonucleotide Part I MD1 By ...
Nucleic Acid Chemistry Of Purine and Pyrimidine Ribonucleotide Part I MD1 By ...Nucleic Acid Chemistry Of Purine and Pyrimidine Ribonucleotide Part I MD1 By ...
Nucleic Acid Chemistry Of Purine and Pyrimidine Ribonucleotide Part I MD1 By ...
 
104 Genetics and cellular functionLearning Objective.docx
104 Genetics and cellular functionLearning Objective.docx104 Genetics and cellular functionLearning Objective.docx
104 Genetics and cellular functionLearning Objective.docx
 
Genes and proteins updated
Genes and proteins updatedGenes and proteins updated
Genes and proteins updated
 
Genes and proteins updated
Genes and proteins updatedGenes and proteins updated
Genes and proteins updated
 
Biol102 chp17-pp-spr10-100508132228-phpapp02
Biol102 chp17-pp-spr10-100508132228-phpapp02Biol102 chp17-pp-spr10-100508132228-phpapp02
Biol102 chp17-pp-spr10-100508132228-phpapp02
 
Biol102 chp17-pp-spr10-100508132228-phpapp02
Biol102 chp17-pp-spr10-100508132228-phpapp02Biol102 chp17-pp-spr10-100508132228-phpapp02
Biol102 chp17-pp-spr10-100508132228-phpapp02
 
Presentation Protein Synthesis
Presentation Protein SynthesisPresentation Protein Synthesis
Presentation Protein Synthesis
 
111 ch 15 online.pptx
111 ch 15 online.pptx111 ch 15 online.pptx
111 ch 15 online.pptx
 
Dna chapter12 13
Dna chapter12 13Dna chapter12 13
Dna chapter12 13
 
Basics of Molecular Biology.ppt
Basics of Molecular Biology.pptBasics of Molecular Biology.ppt
Basics of Molecular Biology.ppt
 
Microbial genetics441
Microbial genetics441Microbial genetics441
Microbial genetics441
 
Mira_Nucleotides_NucleicA_Chemistry_Metabolism.pptx
Mira_Nucleotides_NucleicA_Chemistry_Metabolism.pptxMira_Nucleotides_NucleicA_Chemistry_Metabolism.pptx
Mira_Nucleotides_NucleicA_Chemistry_Metabolism.pptx
 
BACTERIAL GENETICS-1.pptx
BACTERIAL GENETICS-1.pptxBACTERIAL GENETICS-1.pptx
BACTERIAL GENETICS-1.pptx
 
BASICS OF MOLECULAR BIOLOGY
BASICS OF MOLECULAR BIOLOGYBASICS OF MOLECULAR BIOLOGY
BASICS OF MOLECULAR BIOLOGY
 
Lecture 8
Lecture 8Lecture 8
Lecture 8
 
Lecture 1.pptx
Lecture 1.pptxLecture 1.pptx
Lecture 1.pptx
 
Polymerase Chain Reaction
Polymerase Chain ReactionPolymerase Chain Reaction
Polymerase Chain Reaction
 
Lecture 4.ppt
Lecture 4.pptLecture 4.ppt
Lecture 4.ppt
 
Unit III (Nucleic acid and genetic code).ppt
Unit III (Nucleic acid and genetic code).pptUnit III (Nucleic acid and genetic code).ppt
Unit III (Nucleic acid and genetic code).ppt
 
Genetic Code and Translation.pdf
Genetic Code and Translation.pdfGenetic Code and Translation.pdf
Genetic Code and Translation.pdf
 

Recently uploaded

1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 

Recently uploaded (20)

1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 

BioPrimer.ppt

  • 2. Genetic Material • DNA (deoxyribonucleic acid) is the genetic material • Information stored in DNA – the basis of inheritance – distinguishes living things from nonliving things • Genes – various units that govern living thing’s characteristics at the genetic level 2
  • 3. Nucleotides • Genes themselves contain their information as a specific sequence of nucleotides found in DNA molecules • Only four different bases in DNA molecules – Guanine (G) – Adenine (A) – Thymine (T) – Cytosine (C) • Each base is attached to a phosphate group and a deoxyribose sugar to form a nucleotide. • The only thing that makes one nucleotide different from another is which nitrogenous base it contains Sugar P Base 3
  • 5. Nucleotides • Complicated genes can be many thousands of nucleotides long • All of an organism’s genetic instructions, its genome, can be maintained in millions or even billions of nucleotides 5
  • 6. Orientation • Strings of nucleotides can be attached to each other to make long polynucleotide chains • 5’ (5 prime) end – The end of a string of nucleotides with a 5' carbon not attached to another nucleotide • 3’ (3 prime) end – The other end of the molecule with an unattached 3' carbon 6
  • 8. Base Pairing • Structure of DNA – Double helix – Seminal paper by Watson and Crick in 1953 – Rosalind Franklin’s contribution • Information content on one of those strands essentially redundant with the information on the other – Not exactly the same—it is complementary • Base pair – G paired with C (G  C) – A paired with T (A = T) 8
  • 9. 9
  • 10. Base Pairing • Reverse complements – 5' end of one strand corresponding to the 3' end of its complementary strand and vice versa • Example – one strand: 5'-GTATCC-3' the other strand: 3'-CATAGG-5'  5'-GGATAC-3' • Upstream: Sequence features that are 5' to a particular reference point • Downstream: Sequence features that are 3' to a particular reference point 5' 3' Upstream Downstream 10
  • 13. Chromosome • Threadlike "packages" of genes and other DNA in the nucleus of a cell 13
  • 14. 14
  • 15. Chromosome • Different kinds of organisms have different numbers of chromosomes • Humans – 23 pairs – 46 in all 15
  • 16. Central Dogma of Molecular Biology • DNA: information storage • Protein: function unit, such as enzyme • Gene: instructions needed to make protein • Central dogma 16
  • 17. Central Dogma of Molecular Biology • Central dogma reverse transcription (reverse transcriptase) replication (DNA polymerase) • DNA obtained from reverse transcription is called complementary DNA (cDNA)  Difference between DNA and cDNA will be discussed later 17
  • 18. Central Dogma of Molecular Biology • RNA (ribonucleic acid) – Single-stranded polynucleotide – Bases • A • G • C • U (uracil), instead of T • Transcription (simplified …) – A  A, G G, C  C, T  U Sugar P Base Sugar P Base H OH DNA RNA 18
  • 19. 19
  • 20. 20
  • 21. DNA Replication (DNA  DNA) 21
  • 22. DNA Replication (DNA  DNA) 22
  • 23. DNA Replication Animation Courtesy of Rob Rutherford, St. Olaf University 23
  • 24. Transcription (DNA  RNA) • Messenger RNA (mRNA) – carries information to be translated • Ribosomal RNA (rRNA) – the working “spine” of the ribosome • Transfer RNA (tRNA) – the “decoder keys” that will translate nucleic acids to amino acids 24
  • 25. Transcription Animation Courtesy of Rob Rutherford, St. Olaf University 25
  • 26. Peptides and Proteins • mRNA  Sequence of amino acids connected by peptide bond • Amino acid sequence – Peptide: < 30 – 50 amino acids – Protein: longer peptide 26
  • 27. 27
  • 28. 28
  • 29. 29 Genetic Code – Codon Stop codons Start codon Codon: 3-base RNA sequence
  • 30. List of Amino Acids Amino acid Symbol Codon A Alanine Ala GC* C Cysteine Cys UGU, UGC D Aspartic Acid Asp GAU, GAC E Glutamic Acid Glu GAA, GAG F Phenylalanine Phe UUU, UUC G Glycine Gly GG* H Histidine His CAU, CAC I Isoleucine Ile AUU, AUC, AUA K Lysine Lys AAA, AAG L Leucine Leu UUA, UUG, CU* 30
  • 31. List of Amino Acids Amino acid Symbol Codon M Methionine Met AUG N Asparagine Asn AAU, AAC P Proline Pro CC* Q Glutamine Gln CAA, CAG R Arginine Arg CG*, AGA, AGG S Serine Ser UC*, AGU, AGC T Threonine Thr AC* V Valine Val GU* W Tryptophan Trp UGG Y Tyrosine Tyr UAU, UAC 20 letters, no B J O U X Z 31
  • 32. Codon and Reading Frame • 4 AA letters  43 = 64 triplet possibilities • 20 (< 64) known amino acids • Wobbling 3rd base • Redundant  Resistant to mutation • Reading frame: linear sequence of codons in a gene • Open Reading Frame (ORF), definition varies: – a reading frame that begins with a start codon and end at a stop codon – a series of codons in a DNA sequence uninterrupted by the presence of a stop codon  a potential protein-coding region of DNA sequence 32
  • 33. Open Reading Frame • Given a nucleotide sequence – How many reading frames? __ • __ forward and __ backward • Example: Given a DNA sequence, 5’-ATGACCGTGGGCTCTTAA-3’ – ATG ACC GTG GGC TCT TAA  M T V G S * – TGA CCG TGG GCT CTT AA  * P W A L – GAC CGT GGG CTC TTA A  D R G L L – Figure out the three backward reading frames • In random sequence, a stop codon will follow a Met in ~20 AAs • Substantially longer ORFs are often genes or parts of them 33
  • 34. Translation (RNA  Protein) 34
  • 35. Translation Animation Courtesy of Rob Rutherford, St. Olaf University 35
  • 36. Gene Expression • Gene expression – Process of using the information stored in DNA to make an RNA molecule and then a corresponding protein • Cells controlling gene expression by – reliably distinguishing between those parts of an organism’s genome that correspond to the beginnings of genes and those that do not – determining which genes code for proteins that are needed at any particular time. 36
  • 37. Promoter • The probability (P) that a string of nucleotides will occur by chance alone if all nucleotides are present at the same frequency P = (1/4)n, where n is the string’s length • Promoter sequences – Sequences recognized by RNA polymerases as being associated with a gene • Example – Prokaryotic RNA polymerases scan along DNA looking for a specific set of approximately 13 nucleotides marking the beginning of genes – 1 nucleotide that serves as a transcriptional start site – 6 that are 10 nucleotides 5' to the start site, and – 6 more that are 35 nucleotides 5' to the start site – What is the frequency for the sequence to occur? 37
  • 38. Gene Regulation • Regulatory proteins – Capable of binding to a cell’s DNA near the promoter of the genes – Control gene expression in some circumstances but not in others • Positive regulation – binding of regulatory proteins makes it easier for an RNA polymerase to initiate transcription • Negative regulation – binding of the regulatory proteins prevents transcription from occurring 38
  • 39. Promoter and Regulatory Example • Low tryptophan concentration  RNA polymerase binds to promoter  genes transcribed • High tryptophan concentration  repressor protein becomes active and binds to operator  blocks the binding of RNA polymerase to the promoter • Tryptophan concentration drops  repressor releases its tryptophan and is released from DNA  polymerase again transcribes genes 39
  • 42. Exons and Introns Example 42
  • 43. Protein Structure and Function • Genes encode the recipes for proteins 43
  • 44. 44 Protein Structure and Function • Proteins are amino acid polymers
  • 45. Proteins: Molecular Machines  Proteins in your muscles allows you to move: myosin and actin 45
  • 46. Proteins: Molecular Machines  Digestion, catalysis (enzymes)  Structure (collagen) 46
  • 47. Proteins: Molecular Machines  Signaling (hormones, kinases)  Transport (energy, oxygen) 47
  • 49. Information Flow in Nucleated Cell 49
  • 50. Point Mutation Example: Sickle-cell Disease • Wild-type hemoglobin DNA 3’----CTT----5’ mRNA 5’----GAA----3’ Normal hemoglobin ------[Glu]------ • Mutant hemoglobin DNA 3’----CAT----5’ mRNA 5’----GUA----3’ Mutant hemoglobin ------[Val]------ 50
  • 51. image credit: U.S. Department of Energy Human Genome Program, http://www.ornl.gov/hgmis. 51
  • 52. 50% is high copy number repeats About 10% is transcribed (made into RNA) Only 1.5% actually codes for protein 98.5% Junk DNA Thinking about the Human Genome 52
  • 53. Thinking about the Human Genome ~ 3 X 109 bps (3 billion base pairs) If each base were one mm long… 2000 miles, across the center of Africa Average gene about 30 meters long Occur about every 270 meters between them Once spliced the message would only be ~1 meter long 53