SlideShare a Scribd company logo
1 of 6
Lecture 7:Codon Usage analysis and plotting open reading frames
Dr. Naulikha Kituyi
Department of Biological Sciences
University of Embu
Biochemistry-
2021
Codon Usage
SELF-TEST QUESTIONS
•Explain why some genes may not be fully expressed in
certain host organisms during recombinant DNA
technologies.
• State the relevance of ORF finding in bioinformatics
Codon usage refers to the frequency with which a particular
organism uses the available CODONS in genes. The majority of
AMINOACIDS are coded for by more than one codon (see
GENETIC CODE and there are marked preferences for the use
of the alternative codons amongst different species. For
example, in bacteria CCG is the preferred codon for the amino
acid proline, rather than CCU, CCC or CCA. Codon usage can
affect the efficiency of translation, particularly when cloned
heterologous genes are being expressed. If the cloned gene
contains a large number of unfavoured codons the tRNA
molecules of the host may not effectively recognize them, so
reducing translation and hence the amount of protein
synthesized.
Codon usage bias refers to differences in the
frequency of occurrence
of synonymous codons in coding DNA. A
codon is a series of three nucleotides (a
triplet) that encodes a specific amino
acid residue in a polypeptide chain or for the
termination of translation (stop codons).
Illustration of Codon Usage bias
Although codon usage is not the only
barrier to success in heterologous gene
expression, this tool provides a very
easy way to check the compatibility of
your gene and expression system before
you start In this diagram is an illustration
of Codon usage bias in P. patens as
compared to other selected organisms.
Codon usage differs between all genes
and highly expressed genes. Codons
significantly over represented in highly
expressed genes are colored dark
green/marked by a (+) and significantly
under represented codons are colored
light blue/marked by a (–) (p-adjusted
<0.05, Fisher’s exact test).
Open Reading Frames
An open reading frame is a portion of a DNA molecule that,
when translated into amino acids, contains no stop codons.
The genetic code reads DNA sequences in groups of three
base pairs, which means that a double-stranded DNA
molecule can read in any of six possible reading frames--
three in the forward direction and three in the reverse. A long
open reading frame is likely part of a gene.
Target genes can be identified by searching online databases for long
stretches of DNA that could potentially code for protein
 These sequences – called open reading frames (ORF) – will be
preceded by a start codon and uninterrupted by stop codons
 Open reading frames will typically consist of at least 100 codons
(300 nucleotides)
 Searches can be refined by looking at regions downstream of
known promoter sequences and upstream of termination sites
While open reading frames may predict potential coding regions, they
do not automatically guarantee the presence of a gene
 Some long and uninterrupted sequences DNA may not actually be
translated, whilst other short sequences may code protein
Identifying Open Reading Frames
To identify an open reading frame:
 Locate a sequence corresponding to a start codon in order
to determine the reading frame – this will be ATG (sense
strand)
 Read this sequence in base triplets until a stop codon is
reached (TGA, TAG or TAA)
 The longer the sequence, the more significant the
likelihood that the sequence corresponds to an open reading
frame
Certain bioinformatic programs can automatically identify
potential ORFs when provided with a candidate sequence
 Gene sequences are largely conserved – so if an ORF
sequence is present in multiple genomes, it likely
represents a gene
Identification of an Open Reading Frame
The Open Reading Frame Finder
ORF finder searches for open reading frames (ORFs) in the DNA
sequence you enter. The program returns the range of each ORF,
along with its protein translation. Use ORF finder to search newly
sequenced DNA for potential protein encoding segments, verify
predicted protein using newly developed SMART BLAST or regular
BLASTP.
Examples (click to set values, then click Submit button) :
 NC_011604 Salmonella enterica plasmid pWES-1; genetic code:
11; 'ATG' and alternative initiation codons; minimal ORF length: 300
nt
 NM_000059; genetic code: 1; start codon: 'ATG only'; minimal ORF
length: 150 nt
Top of Form
Enter Query Sequence
Enter accession number, gi, or nucleotide sequence in FASTA format:
From: To:
Choose Search Parameters
Use this link to attempt an illustration
https://www.ncbi.nlm.nih.gov/orffinder/B
ottom of Form

More Related Content

Similar to LECTURE 7.pptx

Apollo : A workshop for the Manakin Research Coordination Network
Apollo: A workshop for the Manakin Research Coordination NetworkApollo: A workshop for the Manakin Research Coordination Network
Apollo : A workshop for the Manakin Research Coordination NetworkMonica Munoz-Torres
 
Pathema Burkholderia Annotation Jamboree: Prokaryotic Annotation Overview
Pathema Burkholderia Annotation Jamboree: Prokaryotic Annotation OverviewPathema Burkholderia Annotation Jamboree: Prokaryotic Annotation Overview
Pathema Burkholderia Annotation Jamboree: Prokaryotic Annotation OverviewPathema
 
Apollo - A webinar for the Phascolarctos cinereus research community
Apollo - A webinar for the Phascolarctos cinereus research communityApollo - A webinar for the Phascolarctos cinereus research community
Apollo - A webinar for the Phascolarctos cinereus research communityMonica Munoz-Torres
 
Marker devt. workshop 27022012
Marker devt. workshop 27022012Marker devt. workshop 27022012
Marker devt. workshop 27022012Koppolu Ravi
 
Forensic dna typing by John M Butler
Forensic dna typing by John M ButlerForensic dna typing by John M Butler
Forensic dna typing by John M ButlerMuhammad Ahmad
 
Bio305 genome analysis and annotation 2012
Bio305 genome analysis and annotation 2012Bio305 genome analysis and annotation 2012
Bio305 genome analysis and annotation 2012Mark Pallen
 
Codon p referance and factor influences it
Codon p referance and factor influences itCodon p referance and factor influences it
Codon p referance and factor influences itAhmed Madni
 
Central dogma
Central dogmaCentral dogma
Central dogmaneizylah
 

Similar to LECTURE 7.pptx (20)

Gene identification using bioinformatic tools.pptx
Gene identification using bioinformatic tools.pptxGene identification using bioinformatic tools.pptx
Gene identification using bioinformatic tools.pptx
 
Apollo : A workshop for the Manakin Research Coordination Network
Apollo: A workshop for the Manakin Research Coordination NetworkApollo: A workshop for the Manakin Research Coordination Network
Apollo : A workshop for the Manakin Research Coordination Network
 
Pathema Burkholderia Annotation Jamboree: Prokaryotic Annotation Overview
Pathema Burkholderia Annotation Jamboree: Prokaryotic Annotation OverviewPathema Burkholderia Annotation Jamboree: Prokaryotic Annotation Overview
Pathema Burkholderia Annotation Jamboree: Prokaryotic Annotation Overview
 
Apollo - A webinar for the Phascolarctos cinereus research community
Apollo - A webinar for the Phascolarctos cinereus research communityApollo - A webinar for the Phascolarctos cinereus research community
Apollo - A webinar for the Phascolarctos cinereus research community
 
Marker devt. workshop 27022012
Marker devt. workshop 27022012Marker devt. workshop 27022012
Marker devt. workshop 27022012
 
DNA fingerprinting
DNA fingerprintingDNA fingerprinting
DNA fingerprinting
 
EiB Seminar from Antoni Miñarro, Ph.D
EiB Seminar from Antoni Miñarro, Ph.DEiB Seminar from Antoni Miñarro, Ph.D
EiB Seminar from Antoni Miñarro, Ph.D
 
Forensic dna typing by John M Butler
Forensic dna typing by John M ButlerForensic dna typing by John M Butler
Forensic dna typing by John M Butler
 
EiB Seminar from Antoni Miñarro, Ph.D
EiB Seminar from Antoni Miñarro, Ph.DEiB Seminar from Antoni Miñarro, Ph.D
EiB Seminar from Antoni Miñarro, Ph.D
 
Lecture 4.ppt
Lecture 4.pptLecture 4.ppt
Lecture 4.ppt
 
Bio305 genome analysis and annotation 2012
Bio305 genome analysis and annotation 2012Bio305 genome analysis and annotation 2012
Bio305 genome analysis and annotation 2012
 
Yeast Genome
Yeast Genome Yeast Genome
Yeast Genome
 
DNA Library
DNA LibraryDNA Library
DNA Library
 
Recombinant DNA
Recombinant DNARecombinant DNA
Recombinant DNA
 
genomeannotation-160822182432.pdf
genomeannotation-160822182432.pdfgenomeannotation-160822182432.pdf
genomeannotation-160822182432.pdf
 
Genome annotation
Genome annotationGenome annotation
Genome annotation
 
C value
C value C value
C value
 
Asnmnt 4
Asnmnt 4Asnmnt 4
Asnmnt 4
 
Codon p referance and factor influences it
Codon p referance and factor influences itCodon p referance and factor influences it
Codon p referance and factor influences it
 
Central dogma
Central dogmaCentral dogma
Central dogma
 

Recently uploaded

Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfUmakantAnnand
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docxPoojaSen20
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...M56BOOKSTORE PRODUCT/SERVICE
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 

Recently uploaded (20)

Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.Compdf
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docx
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 

LECTURE 7.pptx

  • 1. Lecture 7:Codon Usage analysis and plotting open reading frames Dr. Naulikha Kituyi Department of Biological Sciences University of Embu Biochemistry- 2021
  • 2. Codon Usage SELF-TEST QUESTIONS •Explain why some genes may not be fully expressed in certain host organisms during recombinant DNA technologies. • State the relevance of ORF finding in bioinformatics Codon usage refers to the frequency with which a particular organism uses the available CODONS in genes. The majority of AMINOACIDS are coded for by more than one codon (see GENETIC CODE and there are marked preferences for the use of the alternative codons amongst different species. For example, in bacteria CCG is the preferred codon for the amino acid proline, rather than CCU, CCC or CCA. Codon usage can affect the efficiency of translation, particularly when cloned heterologous genes are being expressed. If the cloned gene contains a large number of unfavoured codons the tRNA molecules of the host may not effectively recognize them, so reducing translation and hence the amount of protein synthesized. Codon usage bias refers to differences in the frequency of occurrence of synonymous codons in coding DNA. A codon is a series of three nucleotides (a triplet) that encodes a specific amino acid residue in a polypeptide chain or for the termination of translation (stop codons).
  • 3. Illustration of Codon Usage bias Although codon usage is not the only barrier to success in heterologous gene expression, this tool provides a very easy way to check the compatibility of your gene and expression system before you start In this diagram is an illustration of Codon usage bias in P. patens as compared to other selected organisms. Codon usage differs between all genes and highly expressed genes. Codons significantly over represented in highly expressed genes are colored dark green/marked by a (+) and significantly under represented codons are colored light blue/marked by a (–) (p-adjusted <0.05, Fisher’s exact test).
  • 4. Open Reading Frames An open reading frame is a portion of a DNA molecule that, when translated into amino acids, contains no stop codons. The genetic code reads DNA sequences in groups of three base pairs, which means that a double-stranded DNA molecule can read in any of six possible reading frames-- three in the forward direction and three in the reverse. A long open reading frame is likely part of a gene. Target genes can be identified by searching online databases for long stretches of DNA that could potentially code for protein  These sequences – called open reading frames (ORF) – will be preceded by a start codon and uninterrupted by stop codons  Open reading frames will typically consist of at least 100 codons (300 nucleotides)  Searches can be refined by looking at regions downstream of known promoter sequences and upstream of termination sites While open reading frames may predict potential coding regions, they do not automatically guarantee the presence of a gene  Some long and uninterrupted sequences DNA may not actually be translated, whilst other short sequences may code protein
  • 5. Identifying Open Reading Frames To identify an open reading frame:  Locate a sequence corresponding to a start codon in order to determine the reading frame – this will be ATG (sense strand)  Read this sequence in base triplets until a stop codon is reached (TGA, TAG or TAA)  The longer the sequence, the more significant the likelihood that the sequence corresponds to an open reading frame Certain bioinformatic programs can automatically identify potential ORFs when provided with a candidate sequence  Gene sequences are largely conserved – so if an ORF sequence is present in multiple genomes, it likely represents a gene Identification of an Open Reading Frame
  • 6. The Open Reading Frame Finder ORF finder searches for open reading frames (ORFs) in the DNA sequence you enter. The program returns the range of each ORF, along with its protein translation. Use ORF finder to search newly sequenced DNA for potential protein encoding segments, verify predicted protein using newly developed SMART BLAST or regular BLASTP. Examples (click to set values, then click Submit button) :  NC_011604 Salmonella enterica plasmid pWES-1; genetic code: 11; 'ATG' and alternative initiation codons; minimal ORF length: 300 nt  NM_000059; genetic code: 1; start codon: 'ATG only'; minimal ORF length: 150 nt Top of Form Enter Query Sequence Enter accession number, gi, or nucleotide sequence in FASTA format: From: To: Choose Search Parameters Use this link to attempt an illustration https://www.ncbi.nlm.nih.gov/orffinder/B ottom of Form