SlideShare a Scribd company logo
1 of 38
Bioinformatics
Ranjan Jyoti Sarma
M.Sc, M.Phil.
Nucleotide Databases:
• Repository of all nucleic acid sequences of major
important Organisms.
• Major Public Nucleotide databases: GeneBank, DDBJ and
ENA (EMBL-EBI).
• Exchange of data through International Nucleotide
Sequence Database Collaboration (INSDC).
Other specialised Databases:
• EST
Expressed Sequence Tags are the cDNA sequence (Highly expressed mRNA of
portion of Gene) and less than 1000 bp: Used for gene discovery, construction
of gene models, alternative splicing prediction, genome annotation, expression
profiling, and comparative genomics.
In the helminth field, ESTs have extensive application in the discovery of new
genes and identification of novel vaccine candidates and drug targets.
• dbSNP:
dbSNP contains human single nucleotide variations, microsatellites, and small-
scale insertions and deletions along with publication, population frequency,
molecular consequence. Also genomic and RefSeq (Reference Sequence)
mapping information for both common variations and clinical mutations.
A dense catalog of SNPs is expected to facilitate large-scale studies in
association genetics, functional and pharmaco-genomics, population genetics
and evolutionary biology, and positional cloning and physical mapping.
Protein Databases
• Database of the protein sequences, and the 3D structural data produced by X-
ray crystallography and macromolecular NMR.
• PDB: A protein structure database contain structures of Proteins solved using
X-ray crystallography, NMR and electron Microscopy.
• SWISS-PROT: A well curated proteins sequence database also provides a high
level of annotation.
• Protein Information Resource (PIR) – Protein Sequence Database (PIR-PSD):
The sequence in PIR-PSD is also classified based on homology domain and
sequence motifs.
• TrEMBL: Computer-annotated protein sequence database that is released as a
supplement to SWISS-PROT. It contains the translation of all coding sequences
present in the EMBL Nucleotide database, which have not been fully
annotated.
• Pfam: Pfam is a database of protein families that includes their annotations
and multiple sequence alignments generated using hidden Markov models.
• ClinVar
 Stores variants with Clinical interpretation.
Defining Sequence Analysis:
• Sequence Analysis is the process of subjecting a DNA, RNA or
peptide sequence to any of a wide range of analytical methods to
understand its features and functions.
• It includes:
Sequencing and Sequence assembly.
Alignment and Database Searching.
7
Genome Browsers:
• What do you mean by complete genome? Answer will be complete DNA
sequences.
• If so, then How to view it interactively?
• Genome browser enables it to explore the complete genome or the region of the
genome of your interest.
• Genome browser provides a graphical interface for users to browse, search,
retrieve and analyze genomic sequence and annotation data.
• Some Genome Browsers: Web Based NCBI Genome Viewer, Ensemble, UCSC and
Standalone IGV (Integrative Genome Browser), IGB (Integrated Genome
Browser) and ArrayGene Genome Browser (Commercial Genome Browser
developed by ArrayGen Technologies Pvt. Ltd. , Pune, India)
Gene name or Chromosome Location
Current Genome Assembly
Organism List
Ensemble Genome Browser for Vertebrates:
Ensemble Genome Browser for Plants:
Ensemble Genome Browser for Prokaryotes:
• DNA sequencing is the process of determining the precise
order of nucleotides or order of the four A,T,G and C in DNA
Strand.
• Methods:
• Sanger Sequencing or Chain T
ermination Method.
• Next Generation Sequencing.
15
Fluorescence-Based Sequencing
I
N
D
I
V
I
D
U
A
L
SEP
ER
ATE
D
NU
CL
E
OTI
D
E
SE
QU
EN
CE
S
P
E
A
K
S
4
• Tools for Viewing Sanger Sequencing Data:
17
• QV10: 10% or 1/10 Chance that the base call is incorrect.
• QV20: 1% or 1/100 Chance that the base call is incorrect
• QV30: 0.1% or 1/1000 Chance that the base call is incorrect
• QV40: 0.01% or 1/10000 Chance that the base call is incorrect
• QV50: 0.001% or 1/100000 Chance that the base call is incorrect.
Quality Value: -10 log10 Pe
18
SIMILARITY SEARCH
Finch TV
19
BLAST
20
21
BLAST RESULT
ANNOTATION
• What do you mean by feature annotation and why do we need to
annotate the sequences?
• Genome annotation is the process of finding and designating locations
of individual genes and other features on raw DNA sequences, called
assemblies.
22
• Feature annotation is the addition of biological features such as
genes and associated coding regions, structural RNA, variation
information, exon, introns, etc. to your submitted sequence.
• The annotation should include the location of the feature (start and
stop) and a description of the feature.
• The addition of feature annotation to the sequence:
Improves the quality of your submission.
Increases the efficiency with which your submitted sequences
are processed by members of the GenBank staff.
Is of far greater use to the scientific community than sequence
data alone.
23
24
25
26
Sequence Alignment
• The sequence alignment is made between known sequence and unknown
sequence or between two unknown sequences.
• The known sequence is called reference sequence. The unknown sequence is
called query sequence.
• Sequence alignment is useful for discovering structural, functional and
evolutionary information.
• Different algorithms available for alignment are used depending on the
requirement.
27
o Types of Sequence Alignment
 Global Alignment (Needleman-Wunsch Algorithm)
 Local Alignment (Smith-Waterman Algorithm)
Global Alignment : is a matching the residues of two sequences
across their entire length.
o Global alignment matches the identical sequences .
Local Alignment : is a matching two sequence from regions which
have more similarity with each other.
o These methods are mostly defined by Dynamic programming approach
for aligning two different sequences. 28
29
• Pairwise Sequence Alignment is used to identify regions of
similarity that may indicate functional, structural and/or
evolutionary relationships between two biological sequences
(protein or nucleic acid).
• Multiple Sequence Alignment (MSA) is the alignment of three or
more biological sequences of similar length. From the output of
MSA applications, homology can be inferred and the evolutionary
relationship between the sequences studied.
30
31
32
FASTA
NCBI-BLAST
DYNAMIC PROGRAMMING
• It finds the alignment in a more quantitative way by giving particular scores for
matches and mismatches.
• The Dynamic Programming solves the original problem by dividing the
problem into smaller independent sub problems.
• Scoring matrices:
PAM Matrix
BLOSUM Matrix
Gap Penalty
33
• PAM matrices are calculated by observing the differences in closely
related proteins.
• One PAM unit (PAM1) specifies one accepted point mutation per 100
amino acid residues, i.e. 1% change and 99% remains as such.
• BLOcks SUbstitution Matrix, developed by Henikoff and Henikoff in 1992,
used conserved regions.
• These matrices are actual percentage identity values. Simply to say, they
depend on similarity. BLOSUM 62 means there is 62 % similarity.
• Gap Penalty - Dynamic programming algorithms use gap penalties to
maximize the biological meaning.
• Gap penalty is subtracted for each gap that has been introduced.
34
• Identity: Identical Sequences.
• E-Value: The E-value provides information about the likelihood
that a given sequence match is purely by chance.
• The lower the E-value, the less likely the database match is a
result of random chance and therefore the match is more
significant.
• E < 1e - 50 (or 1 × 10-50 ), there should be an extremely high
confidence that the database match is a result of homologous
relationships.
35
Multiple Sequence Alignment
36
Why we do multiple alignments?
• In order to characterize protein families, identify shared regions of
homology in a multiple sequence alignment; (this happens
generally when a sequence search revealed homologies to several
sequences)
• Determination of the consensus sequence of several aligned
sequences.
• Help prediction of the secondary and tertiary structures of new
sequences;
• Preliminary step in molecular evolution analysis using
Phylogenetic methods for constructing phylogenetic trees.
Multiple Alignment Method
• The steps are summarized as follows:
• Compare all sequences pairwise.
• Perform cluster analysis on the pairwise data to generate a hierarchy
for alignment. This may be in the form of a binary tree or a simple
ordering
• Build the multiple alignment by first aligning the most similar pair of
sequences, then the next most similar pair and so on. Once an
alignment of two sequences has been made, then this is fixed. Thus
for a set of sequences A, B, C, D having aligned A with C and B with D
the alignment of A, B, C, D is obtained by comparing the alignments of
A and C with that of B and D using averaged scores at each aligned
position.

More Related Content

Similar to Bioinformaatics for M.Sc. Biotecchnology.pptx

Nitant_Choksi_CAP6545_Presentation_Slides.pptx
Nitant_Choksi_CAP6545_Presentation_Slides.pptxNitant_Choksi_CAP6545_Presentation_Slides.pptx
Nitant_Choksi_CAP6545_Presentation_Slides.pptxNitantChoksi1
 
RNASeq Experiment Design
RNASeq Experiment DesignRNASeq Experiment Design
RNASeq Experiment DesignYaoyu Wang
 
Microbial Phylogenomics (EVE161) Class 17: Genomes from Uncultured
Microbial Phylogenomics (EVE161) Class 17: Genomes from UnculturedMicrobial Phylogenomics (EVE161) Class 17: Genomes from Uncultured
Microbial Phylogenomics (EVE161) Class 17: Genomes from UnculturedJonathan Eisen
 
sequence alignment
sequence alignmentsequence alignment
sequence alignmentammar kareem
 
Microbial Phylogenomics (EVE161) Class 17: Genomes from Uncultured
Microbial Phylogenomics (EVE161) Class 17: Genomes from UnculturedMicrobial Phylogenomics (EVE161) Class 17: Genomes from Uncultured
Microbial Phylogenomics (EVE161) Class 17: Genomes from UnculturedJonathan Eisen
 
Basics of bioinformatics
Basics of bioinformaticsBasics of bioinformatics
Basics of bioinformaticsAbhishek Vatsa
 
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511GenomeInABottle
 
NCBI Boot Camp for Beginners Slides
NCBI Boot Camp for Beginners SlidesNCBI Boot Camp for Beginners Slides
NCBI Boot Camp for Beginners SlidesJackie Wirz, PhD
 
Correcting bias and variation in small RNA sequencing for optimal (microRNA) ...
Correcting bias and variation in small RNA sequencing for optimal (microRNA) ...Correcting bias and variation in small RNA sequencing for optimal (microRNA) ...
Correcting bias and variation in small RNA sequencing for optimal (microRNA) ...Christos Argyropoulos
 
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...VHIR Vall d’Hebron Institut de Recerca
 
BLAST AND FASTA.pptx12345789999987544321234
BLAST AND FASTA.pptx12345789999987544321234BLAST AND FASTA.pptx12345789999987544321234
BLAST AND FASTA.pptx12345789999987544321234alizain9604
 
GLBIO/CCBC Metagenomics Workshop
GLBIO/CCBC Metagenomics WorkshopGLBIO/CCBC Metagenomics Workshop
GLBIO/CCBC Metagenomics WorkshopMorgan Langille
 
4. sequence alignment.pptx
4. sequence alignment.pptx4. sequence alignment.pptx
4. sequence alignment.pptxArupKhakhlari1
 
Prediction of protein function
Prediction of protein functionPrediction of protein function
Prediction of protein functionLars Juhl Jensen
 
Phylogenetic analysis in nutshell
Phylogenetic analysis in nutshellPhylogenetic analysis in nutshell
Phylogenetic analysis in nutshellAvinash Kumar
 
Informal presentation on bioinformatics
Informal presentation on bioinformaticsInformal presentation on bioinformatics
Informal presentation on bioinformaticsAtai Rabby
 

Similar to Bioinformaatics for M.Sc. Biotecchnology.pptx (20)

Nitant_Choksi_CAP6545_Presentation_Slides.pptx
Nitant_Choksi_CAP6545_Presentation_Slides.pptxNitant_Choksi_CAP6545_Presentation_Slides.pptx
Nitant_Choksi_CAP6545_Presentation_Slides.pptx
 
RNASeq Experiment Design
RNASeq Experiment DesignRNASeq Experiment Design
RNASeq Experiment Design
 
Microbial Phylogenomics (EVE161) Class 17: Genomes from Uncultured
Microbial Phylogenomics (EVE161) Class 17: Genomes from UnculturedMicrobial Phylogenomics (EVE161) Class 17: Genomes from Uncultured
Microbial Phylogenomics (EVE161) Class 17: Genomes from Uncultured
 
sequence alignment
sequence alignmentsequence alignment
sequence alignment
 
Microbial Phylogenomics (EVE161) Class 17: Genomes from Uncultured
Microbial Phylogenomics (EVE161) Class 17: Genomes from UnculturedMicrobial Phylogenomics (EVE161) Class 17: Genomes from Uncultured
Microbial Phylogenomics (EVE161) Class 17: Genomes from Uncultured
 
Ncbi
NcbiNcbi
Ncbi
 
hts ...kafna
hts ...kafna hts ...kafna
hts ...kafna
 
Basics of bioinformatics
Basics of bioinformaticsBasics of bioinformatics
Basics of bioinformatics
 
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
 
Sequence Analysis
Sequence AnalysisSequence Analysis
Sequence Analysis
 
NCBI Boot Camp for Beginners Slides
NCBI Boot Camp for Beginners SlidesNCBI Boot Camp for Beginners Slides
NCBI Boot Camp for Beginners Slides
 
Correcting bias and variation in small RNA sequencing for optimal (microRNA) ...
Correcting bias and variation in small RNA sequencing for optimal (microRNA) ...Correcting bias and variation in small RNA sequencing for optimal (microRNA) ...
Correcting bias and variation in small RNA sequencing for optimal (microRNA) ...
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
 
BLAST AND FASTA.pptx12345789999987544321234
BLAST AND FASTA.pptx12345789999987544321234BLAST AND FASTA.pptx12345789999987544321234
BLAST AND FASTA.pptx12345789999987544321234
 
GLBIO/CCBC Metagenomics Workshop
GLBIO/CCBC Metagenomics WorkshopGLBIO/CCBC Metagenomics Workshop
GLBIO/CCBC Metagenomics Workshop
 
4. sequence alignment.pptx
4. sequence alignment.pptx4. sequence alignment.pptx
4. sequence alignment.pptx
 
Prediction of protein function
Prediction of protein functionPrediction of protein function
Prediction of protein function
 
Phylogenetic analysis in nutshell
Phylogenetic analysis in nutshellPhylogenetic analysis in nutshell
Phylogenetic analysis in nutshell
 
Informal presentation on bioinformatics
Informal presentation on bioinformaticsInformal presentation on bioinformatics
Informal presentation on bioinformatics
 

Recently uploaded

Call Girls Service Chennai Jiya 7001305949 Independent Escort Service Chennai
Call Girls Service Chennai Jiya 7001305949 Independent Escort Service ChennaiCall Girls Service Chennai Jiya 7001305949 Independent Escort Service Chennai
Call Girls Service Chennai Jiya 7001305949 Independent Escort Service ChennaiNehru place Escorts
 
Low Rate Call Girls Pune Esha 9907093804 Short 1500 Night 6000 Best call girl...
Low Rate Call Girls Pune Esha 9907093804 Short 1500 Night 6000 Best call girl...Low Rate Call Girls Pune Esha 9907093804 Short 1500 Night 6000 Best call girl...
Low Rate Call Girls Pune Esha 9907093804 Short 1500 Night 6000 Best call girl...Miss joya
 
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...Garima Khatri
 
Call Girls Chennai Megha 9907093804 Independent Call Girls Service Chennai
Call Girls Chennai Megha 9907093804 Independent Call Girls Service ChennaiCall Girls Chennai Megha 9907093804 Independent Call Girls Service Chennai
Call Girls Chennai Megha 9907093804 Independent Call Girls Service ChennaiNehru place Escorts
 
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls AvailableVip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls AvailableNehru place Escorts
 
Russian Call Girls in Bangalore Manisha 7001305949 Independent Escort Service...
Russian Call Girls in Bangalore Manisha 7001305949 Independent Escort Service...Russian Call Girls in Bangalore Manisha 7001305949 Independent Escort Service...
Russian Call Girls in Bangalore Manisha 7001305949 Independent Escort Service...narwatsonia7
 
Bangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% SafeBangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% Safenarwatsonia7
 
Aspirin presentation slides by Dr. Rewas Ali
Aspirin presentation slides by Dr. Rewas AliAspirin presentation slides by Dr. Rewas Ali
Aspirin presentation slides by Dr. Rewas AliRewAs ALI
 
Russian Call Girls in Chennai Pallavi 9907093804 Independent Call Girls Servi...
Russian Call Girls in Chennai Pallavi 9907093804 Independent Call Girls Servi...Russian Call Girls in Chennai Pallavi 9907093804 Independent Call Girls Servi...
Russian Call Girls in Chennai Pallavi 9907093804 Independent Call Girls Servi...Nehru place Escorts
 
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original Photos
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original PhotosCall Girl Service Bidadi - For 7001305949 Cheap & Best with original Photos
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original Photosnarwatsonia7
 
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% SafeBangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safenarwatsonia7
 
Call Girls Service Pune Vaishnavi 9907093804 Short 1500 Night 6000 Best call ...
Call Girls Service Pune Vaishnavi 9907093804 Short 1500 Night 6000 Best call ...Call Girls Service Pune Vaishnavi 9907093804 Short 1500 Night 6000 Best call ...
Call Girls Service Pune Vaishnavi 9907093804 Short 1500 Night 6000 Best call ...Miss joya
 
College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...
College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...
College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...Miss joya
 
Russian Call Girls Chennai Madhuri 9907093804 Independent Call Girls Service ...
Russian Call Girls Chennai Madhuri 9907093804 Independent Call Girls Service ...Russian Call Girls Chennai Madhuri 9907093804 Independent Call Girls Service ...
Russian Call Girls Chennai Madhuri 9907093804 Independent Call Girls Service ...Nehru place Escorts
 
Russian Call Girls in Pune Tanvi 9907093804 Short 1500 Night 6000 Best call g...
Russian Call Girls in Pune Tanvi 9907093804 Short 1500 Night 6000 Best call g...Russian Call Girls in Pune Tanvi 9907093804 Short 1500 Night 6000 Best call g...
Russian Call Girls in Pune Tanvi 9907093804 Short 1500 Night 6000 Best call g...Miss joya
 
Call Girls Whitefield Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Whitefield Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Whitefield Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Whitefield Just Call 7001305949 Top Class Call Girl Service Availablenarwatsonia7
 
Sonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
Sonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call NowSonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
Sonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call NowRiya Pathan
 
Call Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort Service
Call Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort ServiceCall Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort Service
Call Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort Serviceparulsinha
 

Recently uploaded (20)

Call Girls Service Chennai Jiya 7001305949 Independent Escort Service Chennai
Call Girls Service Chennai Jiya 7001305949 Independent Escort Service ChennaiCall Girls Service Chennai Jiya 7001305949 Independent Escort Service Chennai
Call Girls Service Chennai Jiya 7001305949 Independent Escort Service Chennai
 
Low Rate Call Girls Pune Esha 9907093804 Short 1500 Night 6000 Best call girl...
Low Rate Call Girls Pune Esha 9907093804 Short 1500 Night 6000 Best call girl...Low Rate Call Girls Pune Esha 9907093804 Short 1500 Night 6000 Best call girl...
Low Rate Call Girls Pune Esha 9907093804 Short 1500 Night 6000 Best call girl...
 
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
 
Call Girls Chennai Megha 9907093804 Independent Call Girls Service Chennai
Call Girls Chennai Megha 9907093804 Independent Call Girls Service ChennaiCall Girls Chennai Megha 9907093804 Independent Call Girls Service Chennai
Call Girls Chennai Megha 9907093804 Independent Call Girls Service Chennai
 
sauth delhi call girls in Bhajanpura 🔝 9953056974 🔝 escort Service
sauth delhi call girls in Bhajanpura 🔝 9953056974 🔝 escort Servicesauth delhi call girls in Bhajanpura 🔝 9953056974 🔝 escort Service
sauth delhi call girls in Bhajanpura 🔝 9953056974 🔝 escort Service
 
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls AvailableVip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
 
Russian Call Girls in Bangalore Manisha 7001305949 Independent Escort Service...
Russian Call Girls in Bangalore Manisha 7001305949 Independent Escort Service...Russian Call Girls in Bangalore Manisha 7001305949 Independent Escort Service...
Russian Call Girls in Bangalore Manisha 7001305949 Independent Escort Service...
 
Bangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% SafeBangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% Safe
 
Aspirin presentation slides by Dr. Rewas Ali
Aspirin presentation slides by Dr. Rewas AliAspirin presentation slides by Dr. Rewas Ali
Aspirin presentation slides by Dr. Rewas Ali
 
Russian Call Girls in Chennai Pallavi 9907093804 Independent Call Girls Servi...
Russian Call Girls in Chennai Pallavi 9907093804 Independent Call Girls Servi...Russian Call Girls in Chennai Pallavi 9907093804 Independent Call Girls Servi...
Russian Call Girls in Chennai Pallavi 9907093804 Independent Call Girls Servi...
 
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original Photos
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original PhotosCall Girl Service Bidadi - For 7001305949 Cheap & Best with original Photos
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original Photos
 
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% SafeBangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safe
 
Call Girls Service Pune Vaishnavi 9907093804 Short 1500 Night 6000 Best call ...
Call Girls Service Pune Vaishnavi 9907093804 Short 1500 Night 6000 Best call ...Call Girls Service Pune Vaishnavi 9907093804 Short 1500 Night 6000 Best call ...
Call Girls Service Pune Vaishnavi 9907093804 Short 1500 Night 6000 Best call ...
 
College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...
College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...
College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...
 
Russian Call Girls Chennai Madhuri 9907093804 Independent Call Girls Service ...
Russian Call Girls Chennai Madhuri 9907093804 Independent Call Girls Service ...Russian Call Girls Chennai Madhuri 9907093804 Independent Call Girls Service ...
Russian Call Girls Chennai Madhuri 9907093804 Independent Call Girls Service ...
 
Russian Call Girls in Pune Tanvi 9907093804 Short 1500 Night 6000 Best call g...
Russian Call Girls in Pune Tanvi 9907093804 Short 1500 Night 6000 Best call g...Russian Call Girls in Pune Tanvi 9907093804 Short 1500 Night 6000 Best call g...
Russian Call Girls in Pune Tanvi 9907093804 Short 1500 Night 6000 Best call g...
 
Call Girls Whitefield Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Whitefield Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Whitefield Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Whitefield Just Call 7001305949 Top Class Call Girl Service Available
 
Sonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
Sonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call NowSonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
Sonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
 
Call Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort Service
Call Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort ServiceCall Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort Service
Call Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort Service
 
Russian Call Girls in Delhi Tanvi ➡️ 9711199012 💋📞 Independent Escort Service...
Russian Call Girls in Delhi Tanvi ➡️ 9711199012 💋📞 Independent Escort Service...Russian Call Girls in Delhi Tanvi ➡️ 9711199012 💋📞 Independent Escort Service...
Russian Call Girls in Delhi Tanvi ➡️ 9711199012 💋📞 Independent Escort Service...
 

Bioinformaatics for M.Sc. Biotecchnology.pptx

  • 2. Nucleotide Databases: • Repository of all nucleic acid sequences of major important Organisms. • Major Public Nucleotide databases: GeneBank, DDBJ and ENA (EMBL-EBI). • Exchange of data through International Nucleotide Sequence Database Collaboration (INSDC).
  • 3.
  • 4. Other specialised Databases: • EST Expressed Sequence Tags are the cDNA sequence (Highly expressed mRNA of portion of Gene) and less than 1000 bp: Used for gene discovery, construction of gene models, alternative splicing prediction, genome annotation, expression profiling, and comparative genomics. In the helminth field, ESTs have extensive application in the discovery of new genes and identification of novel vaccine candidates and drug targets. • dbSNP: dbSNP contains human single nucleotide variations, microsatellites, and small- scale insertions and deletions along with publication, population frequency, molecular consequence. Also genomic and RefSeq (Reference Sequence) mapping information for both common variations and clinical mutations. A dense catalog of SNPs is expected to facilitate large-scale studies in association genetics, functional and pharmaco-genomics, population genetics and evolutionary biology, and positional cloning and physical mapping.
  • 5. Protein Databases • Database of the protein sequences, and the 3D structural data produced by X- ray crystallography and macromolecular NMR. • PDB: A protein structure database contain structures of Proteins solved using X-ray crystallography, NMR and electron Microscopy. • SWISS-PROT: A well curated proteins sequence database also provides a high level of annotation. • Protein Information Resource (PIR) – Protein Sequence Database (PIR-PSD): The sequence in PIR-PSD is also classified based on homology domain and sequence motifs. • TrEMBL: Computer-annotated protein sequence database that is released as a supplement to SWISS-PROT. It contains the translation of all coding sequences present in the EMBL Nucleotide database, which have not been fully annotated. • Pfam: Pfam is a database of protein families that includes their annotations and multiple sequence alignments generated using hidden Markov models.
  • 6. • ClinVar  Stores variants with Clinical interpretation.
  • 7. Defining Sequence Analysis: • Sequence Analysis is the process of subjecting a DNA, RNA or peptide sequence to any of a wide range of analytical methods to understand its features and functions. • It includes: Sequencing and Sequence assembly. Alignment and Database Searching. 7
  • 8. Genome Browsers: • What do you mean by complete genome? Answer will be complete DNA sequences. • If so, then How to view it interactively? • Genome browser enables it to explore the complete genome or the region of the genome of your interest. • Genome browser provides a graphical interface for users to browse, search, retrieve and analyze genomic sequence and annotation data. • Some Genome Browsers: Web Based NCBI Genome Viewer, Ensemble, UCSC and Standalone IGV (Integrative Genome Browser), IGB (Integrated Genome Browser) and ArrayGene Genome Browser (Commercial Genome Browser developed by ArrayGen Technologies Pvt. Ltd. , Pune, India)
  • 9. Gene name or Chromosome Location Current Genome Assembly Organism List
  • 10. Ensemble Genome Browser for Vertebrates:
  • 11.
  • 12.
  • 13. Ensemble Genome Browser for Plants:
  • 14. Ensemble Genome Browser for Prokaryotes:
  • 15. • DNA sequencing is the process of determining the precise order of nucleotides or order of the four A,T,G and C in DNA Strand. • Methods: • Sanger Sequencing or Chain T ermination Method. • Next Generation Sequencing. 15
  • 17. • Tools for Viewing Sanger Sequencing Data: 17
  • 18. • QV10: 10% or 1/10 Chance that the base call is incorrect. • QV20: 1% or 1/100 Chance that the base call is incorrect • QV30: 0.1% or 1/1000 Chance that the base call is incorrect • QV40: 0.01% or 1/10000 Chance that the base call is incorrect • QV50: 0.001% or 1/100000 Chance that the base call is incorrect. Quality Value: -10 log10 Pe 18
  • 22. ANNOTATION • What do you mean by feature annotation and why do we need to annotate the sequences? • Genome annotation is the process of finding and designating locations of individual genes and other features on raw DNA sequences, called assemblies. 22
  • 23. • Feature annotation is the addition of biological features such as genes and associated coding regions, structural RNA, variation information, exon, introns, etc. to your submitted sequence. • The annotation should include the location of the feature (start and stop) and a description of the feature. • The addition of feature annotation to the sequence: Improves the quality of your submission. Increases the efficiency with which your submitted sequences are processed by members of the GenBank staff. Is of far greater use to the scientific community than sequence data alone. 23
  • 24. 24
  • 25. 25
  • 26. 26
  • 27. Sequence Alignment • The sequence alignment is made between known sequence and unknown sequence or between two unknown sequences. • The known sequence is called reference sequence. The unknown sequence is called query sequence. • Sequence alignment is useful for discovering structural, functional and evolutionary information. • Different algorithms available for alignment are used depending on the requirement. 27
  • 28. o Types of Sequence Alignment  Global Alignment (Needleman-Wunsch Algorithm)  Local Alignment (Smith-Waterman Algorithm) Global Alignment : is a matching the residues of two sequences across their entire length. o Global alignment matches the identical sequences . Local Alignment : is a matching two sequence from regions which have more similarity with each other. o These methods are mostly defined by Dynamic programming approach for aligning two different sequences. 28
  • 29. 29
  • 30. • Pairwise Sequence Alignment is used to identify regions of similarity that may indicate functional, structural and/or evolutionary relationships between two biological sequences (protein or nucleic acid). • Multiple Sequence Alignment (MSA) is the alignment of three or more biological sequences of similar length. From the output of MSA applications, homology can be inferred and the evolutionary relationship between the sequences studied. 30
  • 31. 31
  • 33. DYNAMIC PROGRAMMING • It finds the alignment in a more quantitative way by giving particular scores for matches and mismatches. • The Dynamic Programming solves the original problem by dividing the problem into smaller independent sub problems. • Scoring matrices: PAM Matrix BLOSUM Matrix Gap Penalty 33
  • 34. • PAM matrices are calculated by observing the differences in closely related proteins. • One PAM unit (PAM1) specifies one accepted point mutation per 100 amino acid residues, i.e. 1% change and 99% remains as such. • BLOcks SUbstitution Matrix, developed by Henikoff and Henikoff in 1992, used conserved regions. • These matrices are actual percentage identity values. Simply to say, they depend on similarity. BLOSUM 62 means there is 62 % similarity. • Gap Penalty - Dynamic programming algorithms use gap penalties to maximize the biological meaning. • Gap penalty is subtracted for each gap that has been introduced. 34
  • 35. • Identity: Identical Sequences. • E-Value: The E-value provides information about the likelihood that a given sequence match is purely by chance. • The lower the E-value, the less likely the database match is a result of random chance and therefore the match is more significant. • E < 1e - 50 (or 1 × 10-50 ), there should be an extremely high confidence that the database match is a result of homologous relationships. 35
  • 37. Why we do multiple alignments? • In order to characterize protein families, identify shared regions of homology in a multiple sequence alignment; (this happens generally when a sequence search revealed homologies to several sequences) • Determination of the consensus sequence of several aligned sequences. • Help prediction of the secondary and tertiary structures of new sequences; • Preliminary step in molecular evolution analysis using Phylogenetic methods for constructing phylogenetic trees.
  • 38. Multiple Alignment Method • The steps are summarized as follows: • Compare all sequences pairwise. • Perform cluster analysis on the pairwise data to generate a hierarchy for alignment. This may be in the form of a binary tree or a simple ordering • Build the multiple alignment by first aligning the most similar pair of sequences, then the next most similar pair and so on. Once an alignment of two sequences has been made, then this is fixed. Thus for a set of sequences A, B, C, D having aligned A with C and B with D the alignment of A, B, C, D is obtained by comparing the alignments of A and C with that of B and D using averaged scores at each aligned position.