SlideShare a Scribd company logo
BY:-
BY:-
KARAMVEER
M.Sc. LIFE SCIENCES WITH SPECIALISATION BIOINFORMATICS
(2015-17)
WEL-COME
 From a genomic DNA sequence we want to predict the regions that
will encode for a protein: the genes.
• Gene finding is about detecting these coding regions and infer the
gene structure starting from genomic DNA sequences.
 an open reading frame (ORF) is the part of a reading frame that has
the potential to code for a protein or peptide. An ORF is a continuous
stretch of codons that do not contain a stop codon
• We need to distinguish coding from non-coding regions using
properties specific to each type of DNA region.
• Gene finding is not an easy task!
• DNA sequence signals have low information content.
• It is difficult to discriminate real signals from noise (degenerated and
highly unspecific signals);
 Simple 1st step in gene findings.
 Translate genomic sequence in six frames.
 Identify stop codon in each frame.
 Regions without stop codons are called “open reading frames”
or ORFs.
 Locate and tag all of the likely ORFs in a sequence.
 The longest ORF from a methionine codon is a good prediction
of a protein encoding sequence.
 The ORF finder is a graphical analysis tool which finds all
open reading of a selectable minimum size in a user’s sequence
or in a sequence already in the database.
 This tool identifies all open reading frames using the standard
genetic codes.
 The deduced amino acid sequence can be saved in various
format and searched against the sequence database using the
blast server.
 The orf finder should be helpful in preparing complete and
accurate sequence.
 Based on sequence similarity of query sequence with
annotated genes present in databases.
 Given a database of sequences of other organism.
 Search for query sequence in this database .
 Identify database sequence (known genes) that resemble the
query sequence.
 If the identified sequences are genes , the query sequence is
probably (putatively) a gene.
 Basic local alignment search tool.
 Well known search tool in this category.
Strengths:-
 able to identify biologically relevant genes.
 Accuracy
weakness:-
 Could not identify genes that code for protein , not present in
database.
 Only 50% genes can be found by homology to other known
genes or proteins.
 Uses HMMs to compare DNA sequences to protein
sequences at the level of its conceptual translation,
regardless of sequencing errors and introns.
• Principle:
• The exon model used in genewise is a HMM with 3
base states (match, insert, delete) with the addition of
more transitions between states to consider frame-
shifts.
• Intron states have been added to the base model.
• Genewise directly compare HMM-profiles of proteins
or domains to the gene structure HMM model.
• Genewise is a powerful tool, but time consuming.
• Requires strong similarities (>70% identity) to produce
good predictions.
• Genewise is part of the Wise2 package:
 Computational prediction that use most elementary
information.
 Can predict both eukaryotic and prokaryotic genes.
 Predict genes based on the given sequence alone.
 It works on two major features associated with genes:-
1. Gene signals
2. Gene content
• Hidden Markov Models (HMMs):-
• HMMs use a probabilistic framework to infer the probability
that a sequence correspond to a real signal.
• Neural Networks (NNs):
• NNs are trained with positive and negative examples. NNs
”discover” the features that distinguish the two sets.
• . The gene structure information is separated into several
classes of features such as hexamer frequencies, splice sites,
and GC composition.
 Example: NN for acceptor sites, the perceptron, (Horton and
Kanehisa, 1992)
 Neural network recognizing coding potential
• Incorporates genomic context information (splice junctions,
start and stop codons , poly-A signals)
• Not appropriate for sequences without genomic context
• http://compbio.ornl.gov
• Human, Mouse, Drosophila, Arabidopsis, and E. coli
 accuracy of a prediction program can be evaluated using
parameters such as sensitivity and specificity.
 To describe the concept of sensitivity and specificity accurately,
four features are used:-
 true positive (TP), which is a correctly predicted feature; false
positive (FP), which is an incorrectly predicted feature; false
negative (FN), which is a missed feature; and true negative
(TN), which is the correctly predicted absence of a feature.
Conclusion
 https://www.google.co.in/search?q=gene+components&biw=1366&bih=623
&source=lnms&tbm=isch&sa=X&sqi=2&ved=0CAYQ_AUoAWoVChMIld-
fy7_4yAIVwh-
UCh1dfwEb#tbm=isch&q=rbs+in+prokaryotic+gene&imgrc=p4VQkhXIIG
_DsM%3A.
 http://www.aun.edu.eg/molecular_biology/Procedure%20Bioinformatics22.
23-4-2015/Xiong%20-
%20Essential%20Bioinformatics%20send%20by%20Amira.pdf.
prediction methods for ORF

More Related Content

What's hot

DNA footprinting
DNA footprintingDNA footprinting
DNA footprinting
Saajida Sultaana
 
Microsatellite
MicrosatelliteMicrosatellite
Microsatellite
subramaniam sethupathy
 
Sts
StsSts
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomicshemantbreeder
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
Vidya Kalaivani Rajkumar
 
Nucleic Acid Sequence databases
Nucleic Acid Sequence databasesNucleic Acid Sequence databases
Nucleic Acid Sequence databases
Pranavathiyani G
 
Genetic and Physical map of Genome
Genetic and Physical map of GenomeGenetic and Physical map of Genome
Genetic and Physical map of Genome
KAUSHAL SAHU
 
Finding ORF
Finding ORFFinding ORF
Finding ORF
Sabahat Ali
 
Clustal W - Multiple Sequence alignment
Clustal W - Multiple Sequence alignment   Clustal W - Multiple Sequence alignment
Clustal W - Multiple Sequence alignment
The Oxford College Engineering
 
SAGE (Serial analysis of Gene Expression)
SAGE (Serial analysis of Gene Expression)SAGE (Serial analysis of Gene Expression)
SAGE (Serial analysis of Gene Expression)
talhakhat
 
Blast and fasta
Blast and fastaBlast and fasta
Blast and fasta
ALLIENU
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
Athira RG
 
Genome annotation 2013
Genome annotation 2013Genome annotation 2013
Genome annotation 2013
Karan Veer Singh
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencing
Swathi Prabakar
 
Scop database
Scop databaseScop database
Scop database
Sayantani Roy
 
BLAST
BLASTBLAST
Physical mapping
Physical mappingPhysical mapping
Physical mapping
Priya Trivedi
 
Map based cloning
Map based cloning Map based cloning
Map based cloning
PREETHYDAVID
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
ajay301
 
Express sequence tags
Express sequence tagsExpress sequence tags
Express sequence tags
Dhananjay Desai
 

What's hot (20)

DNA footprinting
DNA footprintingDNA footprinting
DNA footprinting
 
Microsatellite
MicrosatelliteMicrosatellite
Microsatellite
 
Sts
StsSts
Sts
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
 
Nucleic Acid Sequence databases
Nucleic Acid Sequence databasesNucleic Acid Sequence databases
Nucleic Acid Sequence databases
 
Genetic and Physical map of Genome
Genetic and Physical map of GenomeGenetic and Physical map of Genome
Genetic and Physical map of Genome
 
Finding ORF
Finding ORFFinding ORF
Finding ORF
 
Clustal W - Multiple Sequence alignment
Clustal W - Multiple Sequence alignment   Clustal W - Multiple Sequence alignment
Clustal W - Multiple Sequence alignment
 
SAGE (Serial analysis of Gene Expression)
SAGE (Serial analysis of Gene Expression)SAGE (Serial analysis of Gene Expression)
SAGE (Serial analysis of Gene Expression)
 
Blast and fasta
Blast and fastaBlast and fasta
Blast and fasta
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Genome annotation 2013
Genome annotation 2013Genome annotation 2013
Genome annotation 2013
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencing
 
Scop database
Scop databaseScop database
Scop database
 
BLAST
BLASTBLAST
BLAST
 
Physical mapping
Physical mappingPhysical mapping
Physical mapping
 
Map based cloning
Map based cloning Map based cloning
Map based cloning
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
Express sequence tags
Express sequence tagsExpress sequence tags
Express sequence tags
 

Viewers also liked

Gene Prediction Using Hidden Markov Model and Recurrent Neural Network
Gene Prediction Using Hidden Markov Model and Recurrent Neural NetworkGene Prediction Using Hidden Markov Model and Recurrent Neural Network
Gene Prediction Using Hidden Markov Model and Recurrent Neural NetworkAhmed Hani Ibrahim
 
Gene prediction methods vijay
Gene prediction methods  vijayGene prediction methods  vijay
Gene prediction methods vijay
Vijay Hemmadi
 
Rice Emboss Bosc2009
Rice Emboss Bosc2009Rice Emboss Bosc2009
Rice Emboss Bosc2009bosc
 
Free software and bioinformatics
Free software and bioinformaticsFree software and bioinformatics
Free software and bioinformatics
Alberto Labarga
 
Constraints and Global Optimization for Gene Prediction Overlap Resolution
Constraints and Global Optimization for Gene Prediction Overlap ResolutionConstraints and Global Optimization for Gene Prediction Overlap Resolution
Constraints and Global Optimization for Gene Prediction Overlap Resolution
Christian Have
 
Lit01316 emboss tools
Lit01316 emboss toolsLit01316 emboss tools
Lit01316 emboss tools
Mate Precision Tooling
 
Hmm
Hmm Hmm
Session ii g1 overview genomics and gene expression mmc-good
Session ii g1 overview genomics and gene expression mmc-goodSession ii g1 overview genomics and gene expression mmc-good
Session ii g1 overview genomics and gene expression mmc-goodUSD Bioinformatics
 
Microarray and its application
Microarray and its applicationMicroarray and its application
Microarray and its application
prateek kumar
 
Rnaseq forgenefinding
Rnaseq forgenefindingRnaseq forgenefinding
Rnaseq forgenefinding
Sucheta Tripathy
 
Genomica - Microarreglos de DNA
Genomica - Microarreglos de DNAGenomica - Microarreglos de DNA
Genomica - Microarreglos de DNAUlises Urzua
 
Applications of microarray
Applications of microarrayApplications of microarray
Applications of microarray
prateek kumar
 
Translating Cancer Genomes and Transcriptomes for Precision Oncology
Translating Cancer Genomes and Transcriptomes for Precision Oncology Translating Cancer Genomes and Transcriptomes for Precision Oncology
Translating Cancer Genomes and Transcriptomes for Precision Oncology
Wafaa Mowlabaccus
 
B.sc biochem i bobi u 4 gene prediction
B.sc biochem i bobi u 4 gene predictionB.sc biochem i bobi u 4 gene prediction
B.sc biochem i bobi u 4 gene prediction
Rai University
 
DNA Fingerprinting
DNA FingerprintingDNA Fingerprinting
DNA Fingerprinting
biomedicz
 

Viewers also liked (20)

Gene Prediction Using Hidden Markov Model and Recurrent Neural Network
Gene Prediction Using Hidden Markov Model and Recurrent Neural NetworkGene Prediction Using Hidden Markov Model and Recurrent Neural Network
Gene Prediction Using Hidden Markov Model and Recurrent Neural Network
 
Gene prediction methods vijay
Gene prediction methods  vijayGene prediction methods  vijay
Gene prediction methods vijay
 
Rice Emboss Bosc2009
Rice Emboss Bosc2009Rice Emboss Bosc2009
Rice Emboss Bosc2009
 
Emboss
EmbossEmboss
Emboss
 
Emboss
EmbossEmboss
Emboss
 
Emboss
EmbossEmboss
Emboss
 
Free software and bioinformatics
Free software and bioinformaticsFree software and bioinformatics
Free software and bioinformatics
 
Constraints and Global Optimization for Gene Prediction Overlap Resolution
Constraints and Global Optimization for Gene Prediction Overlap ResolutionConstraints and Global Optimization for Gene Prediction Overlap Resolution
Constraints and Global Optimization for Gene Prediction Overlap Resolution
 
Lit01316 emboss tools
Lit01316 emboss toolsLit01316 emboss tools
Lit01316 emboss tools
 
Emboss
EmbossEmboss
Emboss
 
Hmm
Hmm Hmm
Hmm
 
Session ii g1 overview genomics and gene expression mmc-good
Session ii g1 overview genomics and gene expression mmc-goodSession ii g1 overview genomics and gene expression mmc-good
Session ii g1 overview genomics and gene expression mmc-good
 
Microarray and its application
Microarray and its applicationMicroarray and its application
Microarray and its application
 
Rnaseq forgenefinding
Rnaseq forgenefindingRnaseq forgenefinding
Rnaseq forgenefinding
 
Genomica - Microarreglos de DNA
Genomica - Microarreglos de DNAGenomica - Microarreglos de DNA
Genomica - Microarreglos de DNA
 
Applications of microarray
Applications of microarrayApplications of microarray
Applications of microarray
 
Dna microarray application in vp research mehran
Dna microarray application in vp research  mehranDna microarray application in vp research  mehran
Dna microarray application in vp research mehran
 
Translating Cancer Genomes and Transcriptomes for Precision Oncology
Translating Cancer Genomes and Transcriptomes for Precision Oncology Translating Cancer Genomes and Transcriptomes for Precision Oncology
Translating Cancer Genomes and Transcriptomes for Precision Oncology
 
B.sc biochem i bobi u 4 gene prediction
B.sc biochem i bobi u 4 gene predictionB.sc biochem i bobi u 4 gene prediction
B.sc biochem i bobi u 4 gene prediction
 
DNA Fingerprinting
DNA FingerprintingDNA Fingerprinting
DNA Fingerprinting
 

Similar to prediction methods for ORF

Genome analysis2
Genome analysis2Genome analysis2
Structural annotation................pptx
Structural annotation................pptxStructural annotation................pptx
Structural annotation................pptx
Cherry
 
BTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptx
BTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptxBTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptx
BTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptx
ChijiokeNsofor
 
Bioinformatics t8-go-hmm wim-vancriekinge_v2013
Bioinformatics t8-go-hmm wim-vancriekinge_v2013Bioinformatics t8-go-hmm wim-vancriekinge_v2013
Bioinformatics t8-go-hmm wim-vancriekinge_v2013
Prof. Wim Van Criekinge
 
Genome annotation
Genome annotationGenome annotation
Genome annotation
Shifa Ansari
 
genomeannotation-160822182432.pdf
genomeannotation-160822182432.pdfgenomeannotation-160822182432.pdf
genomeannotation-160822182432.pdf
VidyasriDharmalingam1
 
Gene identification using bioinformatic tools.pptx
Gene identification using bioinformatic tools.pptxGene identification using bioinformatic tools.pptx
Gene identification using bioinformatic tools.pptx
University of Petroleum and Energy studies
 
artificial neural network-gene prediction
artificial neural network-gene predictionartificial neural network-gene prediction
artificial neural network-gene prediction
alagappa university karaikudi
 
Apollo Introduction for i5K Groups 2015-10-07
Apollo Introduction for i5K Groups 2015-10-07Apollo Introduction for i5K Groups 2015-10-07
Apollo Introduction for i5K Groups 2015-10-07
Monica Munoz-Torres
 
Bioinformatics t8-go-hmm v2014
Bioinformatics t8-go-hmm v2014Bioinformatics t8-go-hmm v2014
Bioinformatics t8-go-hmm v2014
Prof. Wim Van Criekinge
 
Introduction to Apollo: i5K E affinis
Introduction to Apollo: i5K E affinisIntroduction to Apollo: i5K E affinis
Introduction to Apollo: i5K E affinis
Monica Munoz-Torres
 
Bioinformatics.Practical Notebook
Bioinformatics.Practical NotebookBioinformatics.Practical Notebook
Bioinformatics.Practical NotebookNaima Tahsin
 
Lecture 4.ppt
Lecture 4.pptLecture 4.ppt
Lecture 4.ppt
khadijarafique14
 
Finding genes
Finding genesFinding genes
Finding genes
Sabahat Ali
 
gene prediction programs
gene prediction programsgene prediction programs
gene prediction programs
MugdhaSharma11
 
Gene prediction strategies
Gene prediction strategies Gene prediction strategies
Gene prediction strategies
Amity university, Noida
 
Comparative and functional genomics
Comparative and functional genomicsComparative and functional genomics
Comparative and functional genomics
Jalormi Parekh
 
Introduction to Apollo: A webinar for the i5K Research Community
Introduction to Apollo: A webinar for the i5K Research CommunityIntroduction to Apollo: A webinar for the i5K Research Community
Introduction to Apollo: A webinar for the i5K Research Community
Monica Munoz-Torres
 
Micro RNA.ppt
Micro RNA.pptMicro RNA.ppt
Micro RNA.ppt
Pudhuvai Baveesh
 

Similar to prediction methods for ORF (20)

Genome analysis2
Genome analysis2Genome analysis2
Genome analysis2
 
Structural annotation................pptx
Structural annotation................pptxStructural annotation................pptx
Structural annotation................pptx
 
BTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptx
BTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptxBTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptx
BTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptx
 
Bioinformatics t8-go-hmm wim-vancriekinge_v2013
Bioinformatics t8-go-hmm wim-vancriekinge_v2013Bioinformatics t8-go-hmm wim-vancriekinge_v2013
Bioinformatics t8-go-hmm wim-vancriekinge_v2013
 
Genome annotation
Genome annotationGenome annotation
Genome annotation
 
genomeannotation-160822182432.pdf
genomeannotation-160822182432.pdfgenomeannotation-160822182432.pdf
genomeannotation-160822182432.pdf
 
Gene identification using bioinformatic tools.pptx
Gene identification using bioinformatic tools.pptxGene identification using bioinformatic tools.pptx
Gene identification using bioinformatic tools.pptx
 
artificial neural network-gene prediction
artificial neural network-gene predictionartificial neural network-gene prediction
artificial neural network-gene prediction
 
Apollo Introduction for i5K Groups 2015-10-07
Apollo Introduction for i5K Groups 2015-10-07Apollo Introduction for i5K Groups 2015-10-07
Apollo Introduction for i5K Groups 2015-10-07
 
Bioinformatics t8-go-hmm v2014
Bioinformatics t8-go-hmm v2014Bioinformatics t8-go-hmm v2014
Bioinformatics t8-go-hmm v2014
 
Bioinformatica t8-go-hmm
Bioinformatica t8-go-hmmBioinformatica t8-go-hmm
Bioinformatica t8-go-hmm
 
Introduction to Apollo: i5K E affinis
Introduction to Apollo: i5K E affinisIntroduction to Apollo: i5K E affinis
Introduction to Apollo: i5K E affinis
 
Bioinformatics.Practical Notebook
Bioinformatics.Practical NotebookBioinformatics.Practical Notebook
Bioinformatics.Practical Notebook
 
Lecture 4.ppt
Lecture 4.pptLecture 4.ppt
Lecture 4.ppt
 
Finding genes
Finding genesFinding genes
Finding genes
 
gene prediction programs
gene prediction programsgene prediction programs
gene prediction programs
 
Gene prediction strategies
Gene prediction strategies Gene prediction strategies
Gene prediction strategies
 
Comparative and functional genomics
Comparative and functional genomicsComparative and functional genomics
Comparative and functional genomics
 
Introduction to Apollo: A webinar for the i5K Research Community
Introduction to Apollo: A webinar for the i5K Research CommunityIntroduction to Apollo: A webinar for the i5K Research Community
Introduction to Apollo: A webinar for the i5K Research Community
 
Micro RNA.ppt
Micro RNA.pptMicro RNA.ppt
Micro RNA.ppt
 

Recently uploaded

erythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptxerythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptx
muralinath2
 
Hemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptxHemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptx
muralinath2
 
S.1 chemistry scheme term 2 for ordinary level
S.1 chemistry scheme term 2 for ordinary levelS.1 chemistry scheme term 2 for ordinary level
S.1 chemistry scheme term 2 for ordinary level
ronaldlakony0
 
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Studia Poinsotiana
 
nodule formation by alisha dewangan.pptx
nodule formation by alisha dewangan.pptxnodule formation by alisha dewangan.pptx
nodule formation by alisha dewangan.pptx
alishadewangan1
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
moosaasad1975
 
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
Wasswaderrick3
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
SAMIR PANDA
 
Hemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptxHemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptx
muralinath2
 
Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
Columbia Weather Systems
 
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of LipidsGBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
Areesha Ahmad
 
bordetella pertussis.................................ppt
bordetella pertussis.................................pptbordetella pertussis.................................ppt
bordetella pertussis.................................ppt
kejapriya1
 
general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
IqrimaNabilatulhusni
 
GBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram StainingGBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram Staining
Areesha Ahmad
 
Chapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisisChapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisis
tonzsalvador2222
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
silvermistyshot
 
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
Scintica Instrumentation
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
Nistarini College, Purulia (W.B) India
 
NuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final versionNuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final version
pablovgd
 
platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
muralinath2
 

Recently uploaded (20)

erythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptxerythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptx
 
Hemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptxHemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptx
 
S.1 chemistry scheme term 2 for ordinary level
S.1 chemistry scheme term 2 for ordinary levelS.1 chemistry scheme term 2 for ordinary level
S.1 chemistry scheme term 2 for ordinary level
 
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
 
nodule formation by alisha dewangan.pptx
nodule formation by alisha dewangan.pptxnodule formation by alisha dewangan.pptx
nodule formation by alisha dewangan.pptx
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
 
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
 
Hemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptxHemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptx
 
Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
 
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of LipidsGBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
 
bordetella pertussis.................................ppt
bordetella pertussis.................................pptbordetella pertussis.................................ppt
bordetella pertussis.................................ppt
 
general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
 
GBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram StainingGBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram Staining
 
Chapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisisChapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisis
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
 
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
 
NuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final versionNuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final version
 
platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
 

prediction methods for ORF

  • 1. BY:- BY:- KARAMVEER M.Sc. LIFE SCIENCES WITH SPECIALISATION BIOINFORMATICS (2015-17) WEL-COME
  • 2.  From a genomic DNA sequence we want to predict the regions that will encode for a protein: the genes. • Gene finding is about detecting these coding regions and infer the gene structure starting from genomic DNA sequences.  an open reading frame (ORF) is the part of a reading frame that has the potential to code for a protein or peptide. An ORF is a continuous stretch of codons that do not contain a stop codon • We need to distinguish coding from non-coding regions using properties specific to each type of DNA region. • Gene finding is not an easy task! • DNA sequence signals have low information content. • It is difficult to discriminate real signals from noise (degenerated and highly unspecific signals);
  • 3.
  • 4.  Simple 1st step in gene findings.  Translate genomic sequence in six frames.  Identify stop codon in each frame.  Regions without stop codons are called “open reading frames” or ORFs.  Locate and tag all of the likely ORFs in a sequence.  The longest ORF from a methionine codon is a good prediction of a protein encoding sequence.
  • 5.  The ORF finder is a graphical analysis tool which finds all open reading of a selectable minimum size in a user’s sequence or in a sequence already in the database.  This tool identifies all open reading frames using the standard genetic codes.  The deduced amino acid sequence can be saved in various format and searched against the sequence database using the blast server.  The orf finder should be helpful in preparing complete and accurate sequence.
  • 6.
  • 7.
  • 8.  Based on sequence similarity of query sequence with annotated genes present in databases.  Given a database of sequences of other organism.  Search for query sequence in this database .  Identify database sequence (known genes) that resemble the query sequence.  If the identified sequences are genes , the query sequence is probably (putatively) a gene.
  • 9.  Basic local alignment search tool.  Well known search tool in this category. Strengths:-  able to identify biologically relevant genes.  Accuracy weakness:-  Could not identify genes that code for protein , not present in database.  Only 50% genes can be found by homology to other known genes or proteins.
  • 10.  Uses HMMs to compare DNA sequences to protein sequences at the level of its conceptual translation, regardless of sequencing errors and introns. • Principle: • The exon model used in genewise is a HMM with 3 base states (match, insert, delete) with the addition of more transitions between states to consider frame- shifts. • Intron states have been added to the base model. • Genewise directly compare HMM-profiles of proteins or domains to the gene structure HMM model. • Genewise is a powerful tool, but time consuming. • Requires strong similarities (>70% identity) to produce good predictions. • Genewise is part of the Wise2 package:
  • 11.  Computational prediction that use most elementary information.  Can predict both eukaryotic and prokaryotic genes.  Predict genes based on the given sequence alone.  It works on two major features associated with genes:- 1. Gene signals 2. Gene content
  • 12.
  • 13.
  • 14. • Hidden Markov Models (HMMs):- • HMMs use a probabilistic framework to infer the probability that a sequence correspond to a real signal. • Neural Networks (NNs): • NNs are trained with positive and negative examples. NNs ”discover” the features that distinguish the two sets. • . The gene structure information is separated into several classes of features such as hexamer frequencies, splice sites, and GC composition.  Example: NN for acceptor sites, the perceptron, (Horton and Kanehisa, 1992)
  • 15.  Neural network recognizing coding potential • Incorporates genomic context information (splice junctions, start and stop codons , poly-A signals) • Not appropriate for sequences without genomic context • http://compbio.ornl.gov • Human, Mouse, Drosophila, Arabidopsis, and E. coli
  • 16.  accuracy of a prediction program can be evaluated using parameters such as sensitivity and specificity.  To describe the concept of sensitivity and specificity accurately, four features are used:-  true positive (TP), which is a correctly predicted feature; false positive (FP), which is an incorrectly predicted feature; false negative (FN), which is a missed feature; and true negative (TN), which is the correctly predicted absence of a feature.
  • 17.
  • 18.