SlideShare a Scribd company logo
Bioinformatics
Bioinformatics is about finding and interpreting
biological data using informatic tools, with the goal of
enabling and accelerating biological research
Bioinformatics spans a wide range of activities
- Data capture
- Automated recording of experimental results
- Data storage
- Visualization of raw data and analytical results
- Access to data using a multitude of databases
and query tools
Workflow
Experimental
Design
Sample collection
and analysis
Data collection,
filtering, and input
Data analysis
Output results
Human based
Computer aided
Focus on
Bioinformatics strategies for
disease gene identification
Traditional Methods of Drug Discovery
natural
(plant-derived) treatment
for illness
↓
isolation of active
compound
(small, organic)
synthesis
of compound
↓
manipulation of
structure to get better
drug
(greater efficacy,
fewer side effects)
Modern Methods of Drug Discovery
What’s different?
• Drug discovery process begins
with a disease (rather than a treatment)
• Use disease model to pinpoint relevant
genetic/biological components (i.e. possible
drug targets)
Defining genetic disease
Genetic disorders are caused by abnormalities in the
genetic material
Abnormalities can range from a small mutation in a single gene to the
addition or subtraction of an entire chromosome or set of
chromosomes.
In general, four types of genetic disorders can be distinguished
Monogenetic
Monogenetic (also called Mendelian or single gene) disorders are caused by
a mutation in one particular pair of gene.
A mutated gene can result in a mutated protein, which can no longer carry
out its normal function.
Over 10,000 human diseases are known to be caused by defects in single
genes, affecting about 1% of the population as a whole.
Monogenetic disorders often have simple and predictable inheritance
patterns.
Thalassaemia
Sickle cell anemia
Haemophilia
Cystic Fibrosis
Tay sachs disease
Fragile X syndrome
Huntington's disease
Monogenetic disorders
Polygenic
Polygenic disorders are due to mutations in multiple genes in combination
with external factors, such as lifestyle and environment
Heritability presents the contritution of genetic factors in the formation of
multiple gene diseases. Higher heritability is generally interpreted as a larger
contribution of genes.
Examples of polygenic diseases include coronary heart disease, diabetes,
hypertension, and peptic ulcers.
At present, there are still many difficulties in prenatal diagnosis for multiple-
gene diseases, however, as technology develops, prenatal diagnosis for
common multiple-gene diseases will be available in the near future.
Type 1 diabetes
Multiple sclerosis
Autism
Asthma
Celiac disease
Polygenic disorders
Chromosomal
Abnormalities in the chromosomal number or structure, e.g. (partial)
deletion, extra copies, breakage, and (partial) rearrangements, can result in
disease.
Down syndrome
Klinefelter's syndrome
Prader–Willi syndrome
Turner syndrome
Chromosomal
Mitochondrial
Mitochondria, like the cell nucleus, contains DNA (mtDNA), which is the
biggest difference between mitochondria and other sub-units. mtDNA is only
inherited from the mother and exhibits higher mutation rate than that of
nuclear DNA as well as low repair capacity.
Mitochondrial diseases have threshold effects. That means mitochondrial
diseases could occur only if the abnormal mtDNA exceeds the threshold.
Although sometimes diseases would not happen in the female carriers, for
their underthreshold abnormal mtDNA or certain nuclear effects, mutant
mtDNA can also be passed from generation to generation.
Kearns-Sayre syndrome
Chronic progressive external ophthalmoplegia
Mitochondrial encephalomyopathy with lactic acidosis
Leigh syndrome
Mitochondrial disorders
DISEASE
Gene identification/finding of inherited disease
Every gene has a specific task
Identification of disease genes is similar to finding genes
responsible for normal functions
The mutation may be within a gene/protein or within a
regulatory part of the genome that, e.g., affects the amount of
protein being produced.
The mutation changes the protein, which alters the way the
task is usually performed
Gene identification/finding of inherited disease
Timeline
1983 – Invention of Polymerase Chain Reaction (PCR) technique by Kary Mullis
1989 – the National Center for Human Genome Research is created
1990 – the Human Genome Project (HGP) starts to map and sequence human DNA
1996 – the DNA sequence of the first eukaryotic genome (S. Cerevisiae) is completed
2003 – the human genome sequence is completed
Now – the genome sequences are still frequently updated with new and rearranged
sequences, and some parts are still missing.
2002 – the mouse genome sequence is completed
Gene identification/finding of inherited disease
We have a huge amount of genetic data in place.
And now?
Find a candidate gene!
Candidate gene definition
A candidate gene is a gene that is suspected
to be involved in a genetic disease
It is located in a chromosome region suspected of
being involved in the expression of a trait such as a
disease, whose protein product suggests that it could
be the gene in question.
Disease genes identification in complex disorders
Complex disorders are multifactorial and many such
diseases, like heart and vascular disease are quite
common.
Five steps are applicable to research of
a complex disease:
I. Establish that the disease is indeed (partially ) caused
by genetic factors
To prove that the candidate is in fact a gene, demonstration of a
genetic mutation is needed.
Mutation analysis can be done by direct sequencing. Changes in
the splicing process of the gene may be missed when screening
protein-coding DNA sequences only, but are detectable at the
RNA level using RT-PCR. With RT-PCR and related methods it is
possible to evaluate whether the spatio-temporal gene
expression pattern is compatible with the phenotype of
interest.
Final proof may require the examination of the effect of
induced mutation in model organisms.
Mutation analysis
II. Perform segregation analysis on individual pedigree to
determine the type of inheritance.
Inheritance can vary from Mendelian to polygenic,
depending on penetrance and environment.
The mode of inheritance determines the linkage analysis
methods applicable (next step).
Segregation analysis
E.g. Genetic Association Interaction Analysis Software (GAIA)
http://www.bbu.cf.ac.uk/html/research/biostats.htm
III. Perform linkage analysis to map susceptibility loci
Genetic linkage analysis is a statistical method that is used to
associate functionality of genes to their location on
chromosomes. The main idea is that markers which are found in
vicinity on the chromosome have a tendency to stick together
when passed on to offsprings. Thus, if some disease is often
passed to offsprings along with specific markers, then it can be
concluded that the gene(s) which are responsible for the disease
are located close on the chromosome to these markers.
Parametric, useful for Mendelian traits;
Non parametric, useful for complex diseases.
IV. Fine mapping of the susceptibility gene by
population-association studies
Association studies require the use of DNA from many
individuals. However, association studies do not use families.
Rather, they look at DNA from affected individuals compared
to DNA of controls (non-affected individuals who do not have
to be relatives.)
After using linkage to get an idea where disease genes may be
located, use association to try to better locate the gene.
Association allows to test candidate genes, or very small
genetic regions, to see if they are associated with the
phenotype in study. These tests can result in the location of a
risk gene.
V. Elucidation of the DNA sequences/genes
Confirm their molecular and biochemical action and involvement.
Relatively easy in case of Mendelian disorders, because the
disease is due to a single change.
In complex disorders the susceptibility is often modelled as a
quantitative trait locus (QTL)
In silico positional cloning
Once the critical region for a genetic disease has been
determined by linkage analysis, population-association, etc., the
human genome sequence can be used to identify positional
candidate disease genes.
Genome browsers, biological databases, and other bioinformatics
tools all contribute to the gene finding strategy.
Bioinformatics approach to disease gene identification
The release of genomic sequences, full-lenght cDNA sequences,
expressed sequence tags (ESTs), and large-scale expression micro-
array data of human and model organisms (e.g. Mus Musculus)
offer invaluable resources for studying genetic diseases.
This huge amount of data is stored in numerous different
databases, thus making the use of high performance computing an
essential tool for decoding the information contained in these
databases.
Pedigree Analysis Software
E.g. MERLIN
http://www.sph.umich.edu/csg/abecasis/Merlin/index.html
DNA Data Bank of Japan
http://www.ddbj.nig.ac.jp/index-e.html
European Molecular Biology Laboratory database
http://www.ensembl.org/
GenBank
http://www.ncbi.nlm.nih.gov/genbank/
First Step
To search for all genes between two genetic markers on the
chromosome under study
Essential is a proper description of the location of genes and
other annotations like regulatory elements
Databases and computational tools have been developed to
identify all genes on the human genome sequence. None is
perfect and genes may be missed, or false genes may be
annotated  manual evaluation is necessary or..
Multiple sequence analyses on different databases
should be performed
USCS Genome Browser
http://genome.ucsc.edu/
Second Step
Functional cloning and candidate gene selection
We identified all the genes between the genetic markers
In theory, every gene within the disease critical region can cause the disease.
When the critical region is large, or the gene density is high,
positional candidates are many.
Strategies:
•There may be already a suspicion on the biochemical/pathogenic background of the
disease
•If a genetic disorder affects e.g. the liver, select only genes expressed in liver
•For known genes, the knowledge in literature can be used to select the candidate genes
•Genes located within the critical disease region that have a functional similarity to genes
involved in related diseases are good candidates
The Gene Ontology project is a major bioinformatics initiative with the aim
of standardizing the representation of gene and gene product attributes
across species and databases. The project provides a controlled vocabulary
of terms for describing gene product characteristics and gene product
annotation data from GO Consortium members, as well as tools to access
and process this data.
Database for Annotation, Visualization and Integrated Discovery
http://david.abcc.ncifcrf.gov/home.jsp
Systematic and integrative analysis of large gene lists using DAVID Bioinformatics
Resources. (2009) Nat Protoc. 4(1):44 -57.
Further, knowledge of model organisms makes
comparative candidate selection possible
This situation applies when a gene is known, which
causes a similar phenotype in other species.
Transfer of knowledge by phenotype is strightforward
in Mus Musculus, being evolutionarily close to humans
This grid, called Oxford grid, shows the relationship between human and mouse
chromosomes. Chromosome location of either of the species often predicts the
chromosome location in the other species.
When none of the known genes has mutations, it is possible to
try to find new genes in the critical region.
Comparative genome analysis of related species
present us with a wealth of opportunities for studying
evolution and gene/protein function.
Homology-based function-prediction transfers information from
known genes/proteins to unknow sequences and remains the
primary method to determine the function of a new gene
Example of homology-based methods are
Basic Local Alignment Search Tool (BLAST)
Evolutionary annotation database (EVOLA)
BLAST
http://blast.ncbi.nlm.nih.gov/
EVOLA
http://www.h-invitational.jp/evola/search.html
EVOLA
http://www.h-invitational.jp/evola/search.html
Biological Networks
Over the last years, the wealth of information derived from
high-throughput interaction screening methods have been
used to map different biological interactions.
These maps provide a vision of the molecular networks in
biological systems.
C
B
A
D
F E
Protein-protein interaction networks represent the interaction
between proteins such as the building of protein complexes and the
activation of one protein by another protein.
Gene regulatory and signal transduction networks describe how genes
can be activated or repressed and therefore which proteins are
produced in a cell at a particular time.
Metabolic networks show how metabolites are transformed, for
example to produce energy or synthesize specific substances.
Biological Networks
Gene regulatory, protein-protein interaction and metabolic networks
interact with each other and build a complex network of interactions.
The study of biological network is essential to understand the
role of candidate genes in genetic diseases
Biological Networks
Finally they are very useful to identify the
genotypes that are associated with phenotypes,
a major goal in genetic research
Ingenuity Pathway Analysis Software
http://www.ingenuity.com/index.html
Confirming a candidate gene
Selected genes have to be tested individually to see if there is
evidence that mutations in them do cause the disease in question.
Mutation screening. Identifying mutations in several unrelated affected individuals
strongly suggests that the correct candidate gene has been chosen, but formal proof
requires additional evidence.
Restoration of normal phenotype in vitro. If a cell line that displays the mutant
phenotype can be cultured from the cells of a patient, transfection of a cloned normal
allele into the cultured disease cells may result in restoration of the normal phenotype
by complementing the genetic deficiency.
Production of a mouse model of the disease. Once a putative disease gene is
identified, a transgenic mouse model can be constructed. If the human phenotype is
known to result from loss of function, gene targeting can be used to generate a
germline knockout mutation in the mouse ortholog. The mutant mice are expected to
show some resemblance to humans with the disease.
Suggested readings
1. A. L. Barabási, N. Gulbahce, J. Loscalzo, Network medicine: a network-based
approach to human disease. Nature Reviews Genetics 12, 56 (2011).
2. J. K. DiStefano, Disease Gene Identification: Methods and Protocols. (Humana Press,
2011).
3. S. D. Mooney, V. G. Krishnan, U. S. Evani, Bioinformatic tools for identifying disease
gene and SNP candidates. Methods Mol. Biol 628, 307 (2010).
4. A. Schlicker, T. Lengauer, M. Albrecht, Improving disease gene prioritization using the
semantic similarity of Gene Ontology terms. Bioinformatics 26, i561 (2010).
5. N. Tiffin et al., Integration of text-and data-mining using ontologies successfully
selects disease gene candidates. Nucleic acids research 33, 1544 (2005).
6. Y. Zhang et al., Systematic analysis, comparison, and integration of disease based
human genetic association data and mouse genetic phenotypic information. BMC
medical genomics 3, 1 (2010).

More Related Content

What's hot

Application of bioinformatics in climate smart horticulture
Application of bioinformatics in climate smart horticultureApplication of bioinformatics in climate smart horticulture
Application of bioinformatics in climate smart horticulture
Dr.Hetalkumar Panchal
 
Rt131 lec 2
Rt131 lec 2Rt131 lec 2
Rt131 lec 2
MCsalvador
 
Computational Biology and Bioinformatics
Computational Biology and BioinformaticsComputational Biology and Bioinformatics
Computational Biology and Bioinformatics
Sharif Shuvo
 
Bioinformatics & It's Scope in Biotechnology
Bioinformatics & It's Scope in BiotechnologyBioinformatics & It's Scope in Biotechnology
Bioinformatics & It's Scope in Biotechnology
Tuhin Samanta
 
Applications of bioinformatics
Applications of bioinformaticsApplications of bioinformatics
Applications of bioinformatics
Sudha Rameshwari
 
Informal presentation on bioinformatics
Informal presentation on bioinformaticsInformal presentation on bioinformatics
Informal presentation on bioinformaticsAtai Rabby
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
JTADrexel
 
Bioinformatics-General_Intro
Bioinformatics-General_IntroBioinformatics-General_Intro
Bioinformatics-General_Intro
Abhiroop Ghatak
 
Bioinformatics in present and its future
Bioinformatics in present and its futureBioinformatics in present and its future
Bioinformatics in present and its future
হলুদ হিমু
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
KhanhNgoc LiLa
 
Bioinformatics
BioinformaticsBioinformatics
Applications of bioinformatics, main by kk sahu
Applications of bioinformatics, main by kk sahuApplications of bioinformatics, main by kk sahu
Applications of bioinformatics, main by kk sahu
KAUSHAL SAHU
 
Current Trends & Developments of Bioinformatics
Current Trends & Developments of BioinformaticsCurrent Trends & Developments of Bioinformatics
Current Trends & Developments of Bioinformatics
Yousif A. Algabri
 
Bioinformatics Database Computer applications
Bioinformatics Database Computer applicationsBioinformatics Database Computer applications
Bioinformatics Database Computer applications
Yogi Raikwar
 
Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu
KAUSHAL SAHU
 
Industry Program In For Sci
Industry Program In For SciIndustry Program In For Sci
Industry Program In For Scibiinoida
 
Bioinformatics issues and challanges presentation at s p college
Bioinformatics  issues and challanges  presentation at s p collegeBioinformatics  issues and challanges  presentation at s p college
Bioinformatics issues and challanges presentation at s p college
SKUASTKashmir
 
Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introduction
Biotech Online
 

What's hot (19)

Application of bioinformatics in climate smart horticulture
Application of bioinformatics in climate smart horticultureApplication of bioinformatics in climate smart horticulture
Application of bioinformatics in climate smart horticulture
 
Rt131 lec 2
Rt131 lec 2Rt131 lec 2
Rt131 lec 2
 
Computational Biology and Bioinformatics
Computational Biology and BioinformaticsComputational Biology and Bioinformatics
Computational Biology and Bioinformatics
 
Bioinformatics & It's Scope in Biotechnology
Bioinformatics & It's Scope in BiotechnologyBioinformatics & It's Scope in Biotechnology
Bioinformatics & It's Scope in Biotechnology
 
Applications of bioinformatics
Applications of bioinformaticsApplications of bioinformatics
Applications of bioinformatics
 
Informal presentation on bioinformatics
Informal presentation on bioinformaticsInformal presentation on bioinformatics
Informal presentation on bioinformatics
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Bioinformatics-General_Intro
Bioinformatics-General_IntroBioinformatics-General_Intro
Bioinformatics-General_Intro
 
Bioinformatics in present and its future
Bioinformatics in present and its futureBioinformatics in present and its future
Bioinformatics in present and its future
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Applications of bioinformatics, main by kk sahu
Applications of bioinformatics, main by kk sahuApplications of bioinformatics, main by kk sahu
Applications of bioinformatics, main by kk sahu
 
Current Trends & Developments of Bioinformatics
Current Trends & Developments of BioinformaticsCurrent Trends & Developments of Bioinformatics
Current Trends & Developments of Bioinformatics
 
Bioinformatics Database Computer applications
Bioinformatics Database Computer applicationsBioinformatics Database Computer applications
Bioinformatics Database Computer applications
 
Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu
 
Industry Program In For Sci
Industry Program In For SciIndustry Program In For Sci
Industry Program In For Sci
 
Bioinformatics issues and challanges presentation at s p college
Bioinformatics  issues and challanges  presentation at s p collegeBioinformatics  issues and challanges  presentation at s p college
Bioinformatics issues and challanges presentation at s p college
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introduction
 

Similar to Bioinformatics

Thesis On Psoriasis
Thesis On PsoriasisThesis On Psoriasis
Thesis On Psoriasis
Amanda Burkett
 
Personalized medicine through wes and big data analytics
Personalized medicine through wes and big data analyticsPersonalized medicine through wes and big data analytics
Personalized medicine through wes and big data analytics
JunaidAKG
 
A New Generation Of Mechanism-Based Biomarkers For The Clinic
A New Generation Of Mechanism-Based Biomarkers For The ClinicA New Generation Of Mechanism-Based Biomarkers For The Clinic
A New Generation Of Mechanism-Based Biomarkers For The Clinic
Joaquin Dopazo
 
Genetic Mapping & its applications in different field of science
Genetic Mapping & its applications in different field of scienceGenetic Mapping & its applications in different field of science
Genetic Mapping & its applications in different field of science
habibullah181
 
Pharmacogenomics
PharmacogenomicsPharmacogenomics
Pharmacogenomics
ANAND SAGAR TIWARI
 
Introducción a la bioinformatica
Introducción a la bioinformaticaIntroducción a la bioinformatica
Introducción a la bioinformaticaMartín Arrieta
 
Application of Genetics in Nursing Practice
Application of Genetics in Nursing PracticeApplication of Genetics in Nursing Practice
Application of Genetics in Nursing Practice
Sherwood College of Nursing, Barabanki, Uttar Pradesh
 
Human genome project 2007
Human genome project 2007Human genome project 2007
Human genome project 2007Hesham Gaber
 
BIOINFORMATICS Applications And Challenges
BIOINFORMATICS Applications And ChallengesBIOINFORMATICS Applications And Challenges
BIOINFORMATICS Applications And Challenges
Amos Watentena
 
Human Therapeutics
Human TherapeuticsHuman Therapeutics
Human Therapeuticsmgsonline
 
Gene hunting strategies
Gene hunting strategiesGene hunting strategies
Gene hunting strategies
Ashfaq Ahmad
 
Human genetic technologies
Human genetic technologiesHuman genetic technologies
Human genetic technologies
Aparna Chaudhary
 
POSTGENETICS MEDICINE
POSTGENETICS MEDICINEPOSTGENETICS MEDICINE
POSTGENETICS MEDICINE
inemet
 
Single-Gene Disorders: Understanding Genetics With Examples | The Lifescience...
Single-Gene Disorders: Understanding Genetics With Examples | The Lifescience...Single-Gene Disorders: Understanding Genetics With Examples | The Lifescience...
Single-Gene Disorders: Understanding Genetics With Examples | The Lifescience...
The Lifesciences Magazine
 
Dna profiling presentation x2
Dna profiling presentation x2Dna profiling presentation x2
Dna profiling presentation x2Eli Rosenthal
 
Dna profiling presentation x2
Dna profiling presentation x2Dna profiling presentation x2
Dna profiling presentation x2
teamchaotex
 
헬스케어 빅데이터로 무엇을 할 수 있는가?
헬스케어 빅데이터로 무엇을 할 수 있는가?헬스케어 빅데이터로 무엇을 할 수 있는가?
헬스케어 빅데이터로 무엇을 할 수 있는가?
Hyung Jin Choi
 
(Genomics in human health)
 (Genomics in human health) (Genomics in human health)
(Genomics in human health)
iqraakbar8
 
4. HGP.pptx
4. HGP.pptx4. HGP.pptx
4. HGP.pptx
ssuserfdb252
 

Similar to Bioinformatics (20)

Thesis On Psoriasis
Thesis On PsoriasisThesis On Psoriasis
Thesis On Psoriasis
 
Personalized medicine through wes and big data analytics
Personalized medicine through wes and big data analyticsPersonalized medicine through wes and big data analytics
Personalized medicine through wes and big data analytics
 
A New Generation Of Mechanism-Based Biomarkers For The Clinic
A New Generation Of Mechanism-Based Biomarkers For The ClinicA New Generation Of Mechanism-Based Biomarkers For The Clinic
A New Generation Of Mechanism-Based Biomarkers For The Clinic
 
Genetic screening & gene therapy
Genetic screening & gene therapyGenetic screening & gene therapy
Genetic screening & gene therapy
 
Genetic Mapping & its applications in different field of science
Genetic Mapping & its applications in different field of scienceGenetic Mapping & its applications in different field of science
Genetic Mapping & its applications in different field of science
 
Pharmacogenomics
PharmacogenomicsPharmacogenomics
Pharmacogenomics
 
Introducción a la bioinformatica
Introducción a la bioinformaticaIntroducción a la bioinformatica
Introducción a la bioinformatica
 
Application of Genetics in Nursing Practice
Application of Genetics in Nursing PracticeApplication of Genetics in Nursing Practice
Application of Genetics in Nursing Practice
 
Human genome project 2007
Human genome project 2007Human genome project 2007
Human genome project 2007
 
BIOINFORMATICS Applications And Challenges
BIOINFORMATICS Applications And ChallengesBIOINFORMATICS Applications And Challenges
BIOINFORMATICS Applications And Challenges
 
Human Therapeutics
Human TherapeuticsHuman Therapeutics
Human Therapeutics
 
Gene hunting strategies
Gene hunting strategiesGene hunting strategies
Gene hunting strategies
 
Human genetic technologies
Human genetic technologiesHuman genetic technologies
Human genetic technologies
 
POSTGENETICS MEDICINE
POSTGENETICS MEDICINEPOSTGENETICS MEDICINE
POSTGENETICS MEDICINE
 
Single-Gene Disorders: Understanding Genetics With Examples | The Lifescience...
Single-Gene Disorders: Understanding Genetics With Examples | The Lifescience...Single-Gene Disorders: Understanding Genetics With Examples | The Lifescience...
Single-Gene Disorders: Understanding Genetics With Examples | The Lifescience...
 
Dna profiling presentation x2
Dna profiling presentation x2Dna profiling presentation x2
Dna profiling presentation x2
 
Dna profiling presentation x2
Dna profiling presentation x2Dna profiling presentation x2
Dna profiling presentation x2
 
헬스케어 빅데이터로 무엇을 할 수 있는가?
헬스케어 빅데이터로 무엇을 할 수 있는가?헬스케어 빅데이터로 무엇을 할 수 있는가?
헬스케어 빅데이터로 무엇을 할 수 있는가?
 
(Genomics in human health)
 (Genomics in human health) (Genomics in human health)
(Genomics in human health)
 
4. HGP.pptx
4. HGP.pptx4. HGP.pptx
4. HGP.pptx
 

More from Hafeezarana

Translation
TranslationTranslation
Translation
Hafeezarana
 
Baisc biomolecules
Baisc biomoleculesBaisc biomolecules
Baisc biomolecules
Hafeezarana
 
Cytoplam
CytoplamCytoplam
Cytoplam
Hafeezarana
 
Er
ErEr
Carbohydrate
CarbohydrateCarbohydrate
Carbohydrate
Hafeezarana
 
Kaaryotype and abberation
Kaaryotype and abberationKaaryotype and abberation
Kaaryotype and abberation
Hafeezarana
 
Golgi complex
Golgi complexGolgi complex
Golgi complex
Hafeezarana
 
Cell cycle
Cell cycleCell cycle
Cell cycle
Hafeezarana
 
Cell membbrane
Cell membbraneCell membbrane
Cell membbrane
Hafeezarana
 
Ecosystem
EcosystemEcosystem
Ecosystem
Hafeezarana
 

More from Hafeezarana (10)

Translation
TranslationTranslation
Translation
 
Baisc biomolecules
Baisc biomoleculesBaisc biomolecules
Baisc biomolecules
 
Cytoplam
CytoplamCytoplam
Cytoplam
 
Er
ErEr
Er
 
Carbohydrate
CarbohydrateCarbohydrate
Carbohydrate
 
Kaaryotype and abberation
Kaaryotype and abberationKaaryotype and abberation
Kaaryotype and abberation
 
Golgi complex
Golgi complexGolgi complex
Golgi complex
 
Cell cycle
Cell cycleCell cycle
Cell cycle
 
Cell membbrane
Cell membbraneCell membbrane
Cell membbrane
 
Ecosystem
EcosystemEcosystem
Ecosystem
 

Recently uploaded

Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
RenuJangid3
 
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Sérgio Sacani
 
Citrus Greening Disease and its Management
Citrus Greening Disease and its ManagementCitrus Greening Disease and its Management
Citrus Greening Disease and its Management
subedisuryaofficial
 
EY - Supply Chain Services 2018_template.pptx
EY - Supply Chain Services 2018_template.pptxEY - Supply Chain Services 2018_template.pptx
EY - Supply Chain Services 2018_template.pptx
AlguinaldoKong
 
GBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture MediaGBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture Media
Areesha Ahmad
 
NuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final versionNuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final version
pablovgd
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Sérgio Sacani
 
Comparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebratesComparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebrates
sachin783648
 
platelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptxplatelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptx
muralinath2
 
general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
IqrimaNabilatulhusni
 
in vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptxin vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptx
yusufzako14
 
Lab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerinLab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerin
ossaicprecious19
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
muralinath2
 
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
ssuserbfdca9
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
silvermistyshot
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
SAMIR PANDA
 
insect taxonomy importance systematics and classification
insect taxonomy importance systematics and classificationinsect taxonomy importance systematics and classification
insect taxonomy importance systematics and classification
anitaento25
 
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
NathanBaughman3
 
Structures and textures of metamorphic rocks
Structures and textures of metamorphic rocksStructures and textures of metamorphic rocks
Structures and textures of metamorphic rocks
kumarmathi863
 
ESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptxESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptx
muralinath2
 

Recently uploaded (20)

Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
 
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
 
Citrus Greening Disease and its Management
Citrus Greening Disease and its ManagementCitrus Greening Disease and its Management
Citrus Greening Disease and its Management
 
EY - Supply Chain Services 2018_template.pptx
EY - Supply Chain Services 2018_template.pptxEY - Supply Chain Services 2018_template.pptx
EY - Supply Chain Services 2018_template.pptx
 
GBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture MediaGBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture Media
 
NuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final versionNuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final version
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
 
Comparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebratesComparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebrates
 
platelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptxplatelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptx
 
general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
 
in vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptxin vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptx
 
Lab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerinLab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerin
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
 
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
 
insect taxonomy importance systematics and classification
insect taxonomy importance systematics and classificationinsect taxonomy importance systematics and classification
insect taxonomy importance systematics and classification
 
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
 
Structures and textures of metamorphic rocks
Structures and textures of metamorphic rocksStructures and textures of metamorphic rocks
Structures and textures of metamorphic rocks
 
ESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptxESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptx
 

Bioinformatics

  • 1. Bioinformatics Bioinformatics is about finding and interpreting biological data using informatic tools, with the goal of enabling and accelerating biological research
  • 2. Bioinformatics spans a wide range of activities - Data capture - Automated recording of experimental results - Data storage - Visualization of raw data and analytical results - Access to data using a multitude of databases and query tools
  • 3. Workflow Experimental Design Sample collection and analysis Data collection, filtering, and input Data analysis Output results Human based Computer aided
  • 4. Focus on Bioinformatics strategies for disease gene identification
  • 5. Traditional Methods of Drug Discovery natural (plant-derived) treatment for illness ↓ isolation of active compound (small, organic)
  • 6. synthesis of compound ↓ manipulation of structure to get better drug (greater efficacy, fewer side effects)
  • 7. Modern Methods of Drug Discovery What’s different? • Drug discovery process begins with a disease (rather than a treatment) • Use disease model to pinpoint relevant genetic/biological components (i.e. possible drug targets)
  • 8. Defining genetic disease Genetic disorders are caused by abnormalities in the genetic material Abnormalities can range from a small mutation in a single gene to the addition or subtraction of an entire chromosome or set of chromosomes. In general, four types of genetic disorders can be distinguished
  • 9. Monogenetic Monogenetic (also called Mendelian or single gene) disorders are caused by a mutation in one particular pair of gene. A mutated gene can result in a mutated protein, which can no longer carry out its normal function. Over 10,000 human diseases are known to be caused by defects in single genes, affecting about 1% of the population as a whole. Monogenetic disorders often have simple and predictable inheritance patterns.
  • 10. Thalassaemia Sickle cell anemia Haemophilia Cystic Fibrosis Tay sachs disease Fragile X syndrome Huntington's disease Monogenetic disorders
  • 11. Polygenic Polygenic disorders are due to mutations in multiple genes in combination with external factors, such as lifestyle and environment Heritability presents the contritution of genetic factors in the formation of multiple gene diseases. Higher heritability is generally interpreted as a larger contribution of genes. Examples of polygenic diseases include coronary heart disease, diabetes, hypertension, and peptic ulcers. At present, there are still many difficulties in prenatal diagnosis for multiple- gene diseases, however, as technology develops, prenatal diagnosis for common multiple-gene diseases will be available in the near future.
  • 12. Type 1 diabetes Multiple sclerosis Autism Asthma Celiac disease Polygenic disorders
  • 13. Chromosomal Abnormalities in the chromosomal number or structure, e.g. (partial) deletion, extra copies, breakage, and (partial) rearrangements, can result in disease.
  • 14. Down syndrome Klinefelter's syndrome Prader–Willi syndrome Turner syndrome Chromosomal
  • 15. Mitochondrial Mitochondria, like the cell nucleus, contains DNA (mtDNA), which is the biggest difference between mitochondria and other sub-units. mtDNA is only inherited from the mother and exhibits higher mutation rate than that of nuclear DNA as well as low repair capacity. Mitochondrial diseases have threshold effects. That means mitochondrial diseases could occur only if the abnormal mtDNA exceeds the threshold. Although sometimes diseases would not happen in the female carriers, for their underthreshold abnormal mtDNA or certain nuclear effects, mutant mtDNA can also be passed from generation to generation.
  • 16. Kearns-Sayre syndrome Chronic progressive external ophthalmoplegia Mitochondrial encephalomyopathy with lactic acidosis Leigh syndrome Mitochondrial disorders
  • 17. DISEASE Gene identification/finding of inherited disease Every gene has a specific task Identification of disease genes is similar to finding genes responsible for normal functions The mutation may be within a gene/protein or within a regulatory part of the genome that, e.g., affects the amount of protein being produced. The mutation changes the protein, which alters the way the task is usually performed
  • 18. Gene identification/finding of inherited disease Timeline 1983 – Invention of Polymerase Chain Reaction (PCR) technique by Kary Mullis 1989 – the National Center for Human Genome Research is created 1990 – the Human Genome Project (HGP) starts to map and sequence human DNA 1996 – the DNA sequence of the first eukaryotic genome (S. Cerevisiae) is completed 2003 – the human genome sequence is completed Now – the genome sequences are still frequently updated with new and rearranged sequences, and some parts are still missing. 2002 – the mouse genome sequence is completed
  • 19. Gene identification/finding of inherited disease We have a huge amount of genetic data in place. And now? Find a candidate gene!
  • 20. Candidate gene definition A candidate gene is a gene that is suspected to be involved in a genetic disease It is located in a chromosome region suspected of being involved in the expression of a trait such as a disease, whose protein product suggests that it could be the gene in question.
  • 21. Disease genes identification in complex disorders Complex disorders are multifactorial and many such diseases, like heart and vascular disease are quite common. Five steps are applicable to research of a complex disease:
  • 22. I. Establish that the disease is indeed (partially ) caused by genetic factors To prove that the candidate is in fact a gene, demonstration of a genetic mutation is needed. Mutation analysis can be done by direct sequencing. Changes in the splicing process of the gene may be missed when screening protein-coding DNA sequences only, but are detectable at the RNA level using RT-PCR. With RT-PCR and related methods it is possible to evaluate whether the spatio-temporal gene expression pattern is compatible with the phenotype of interest. Final proof may require the examination of the effect of induced mutation in model organisms.
  • 24. II. Perform segregation analysis on individual pedigree to determine the type of inheritance. Inheritance can vary from Mendelian to polygenic, depending on penetrance and environment. The mode of inheritance determines the linkage analysis methods applicable (next step).
  • 25. Segregation analysis E.g. Genetic Association Interaction Analysis Software (GAIA) http://www.bbu.cf.ac.uk/html/research/biostats.htm
  • 26. III. Perform linkage analysis to map susceptibility loci Genetic linkage analysis is a statistical method that is used to associate functionality of genes to their location on chromosomes. The main idea is that markers which are found in vicinity on the chromosome have a tendency to stick together when passed on to offsprings. Thus, if some disease is often passed to offsprings along with specific markers, then it can be concluded that the gene(s) which are responsible for the disease are located close on the chromosome to these markers. Parametric, useful for Mendelian traits; Non parametric, useful for complex diseases.
  • 27. IV. Fine mapping of the susceptibility gene by population-association studies Association studies require the use of DNA from many individuals. However, association studies do not use families. Rather, they look at DNA from affected individuals compared to DNA of controls (non-affected individuals who do not have to be relatives.) After using linkage to get an idea where disease genes may be located, use association to try to better locate the gene. Association allows to test candidate genes, or very small genetic regions, to see if they are associated with the phenotype in study. These tests can result in the location of a risk gene.
  • 28. V. Elucidation of the DNA sequences/genes Confirm their molecular and biochemical action and involvement. Relatively easy in case of Mendelian disorders, because the disease is due to a single change. In complex disorders the susceptibility is often modelled as a quantitative trait locus (QTL)
  • 29. In silico positional cloning Once the critical region for a genetic disease has been determined by linkage analysis, population-association, etc., the human genome sequence can be used to identify positional candidate disease genes. Genome browsers, biological databases, and other bioinformatics tools all contribute to the gene finding strategy.
  • 30. Bioinformatics approach to disease gene identification The release of genomic sequences, full-lenght cDNA sequences, expressed sequence tags (ESTs), and large-scale expression micro- array data of human and model organisms (e.g. Mus Musculus) offer invaluable resources for studying genetic diseases. This huge amount of data is stored in numerous different databases, thus making the use of high performance computing an essential tool for decoding the information contained in these databases.
  • 31. Pedigree Analysis Software E.g. MERLIN http://www.sph.umich.edu/csg/abecasis/Merlin/index.html
  • 32. DNA Data Bank of Japan http://www.ddbj.nig.ac.jp/index-e.html
  • 33. European Molecular Biology Laboratory database http://www.ensembl.org/
  • 35. First Step To search for all genes between two genetic markers on the chromosome under study Essential is a proper description of the location of genes and other annotations like regulatory elements Databases and computational tools have been developed to identify all genes on the human genome sequence. None is perfect and genes may be missed, or false genes may be annotated  manual evaluation is necessary or.. Multiple sequence analyses on different databases should be performed
  • 37. Second Step Functional cloning and candidate gene selection We identified all the genes between the genetic markers In theory, every gene within the disease critical region can cause the disease. When the critical region is large, or the gene density is high, positional candidates are many. Strategies: •There may be already a suspicion on the biochemical/pathogenic background of the disease •If a genetic disorder affects e.g. the liver, select only genes expressed in liver •For known genes, the knowledge in literature can be used to select the candidate genes •Genes located within the critical disease region that have a functional similarity to genes involved in related diseases are good candidates
  • 38. The Gene Ontology project is a major bioinformatics initiative with the aim of standardizing the representation of gene and gene product attributes across species and databases. The project provides a controlled vocabulary of terms for describing gene product characteristics and gene product annotation data from GO Consortium members, as well as tools to access and process this data.
  • 39. Database for Annotation, Visualization and Integrated Discovery http://david.abcc.ncifcrf.gov/home.jsp Systematic and integrative analysis of large gene lists using DAVID Bioinformatics Resources. (2009) Nat Protoc. 4(1):44 -57.
  • 40.
  • 41.
  • 42.
  • 43. Further, knowledge of model organisms makes comparative candidate selection possible This situation applies when a gene is known, which causes a similar phenotype in other species. Transfer of knowledge by phenotype is strightforward in Mus Musculus, being evolutionarily close to humans
  • 44. This grid, called Oxford grid, shows the relationship between human and mouse chromosomes. Chromosome location of either of the species often predicts the chromosome location in the other species.
  • 45. When none of the known genes has mutations, it is possible to try to find new genes in the critical region. Comparative genome analysis of related species present us with a wealth of opportunities for studying evolution and gene/protein function. Homology-based function-prediction transfers information from known genes/proteins to unknow sequences and remains the primary method to determine the function of a new gene Example of homology-based methods are Basic Local Alignment Search Tool (BLAST) Evolutionary annotation database (EVOLA)
  • 49. Biological Networks Over the last years, the wealth of information derived from high-throughput interaction screening methods have been used to map different biological interactions. These maps provide a vision of the molecular networks in biological systems. C B A D F E
  • 50. Protein-protein interaction networks represent the interaction between proteins such as the building of protein complexes and the activation of one protein by another protein. Gene regulatory and signal transduction networks describe how genes can be activated or repressed and therefore which proteins are produced in a cell at a particular time. Metabolic networks show how metabolites are transformed, for example to produce energy or synthesize specific substances. Biological Networks Gene regulatory, protein-protein interaction and metabolic networks interact with each other and build a complex network of interactions.
  • 51. The study of biological network is essential to understand the role of candidate genes in genetic diseases Biological Networks Finally they are very useful to identify the genotypes that are associated with phenotypes, a major goal in genetic research
  • 52. Ingenuity Pathway Analysis Software http://www.ingenuity.com/index.html
  • 53. Confirming a candidate gene Selected genes have to be tested individually to see if there is evidence that mutations in them do cause the disease in question. Mutation screening. Identifying mutations in several unrelated affected individuals strongly suggests that the correct candidate gene has been chosen, but formal proof requires additional evidence. Restoration of normal phenotype in vitro. If a cell line that displays the mutant phenotype can be cultured from the cells of a patient, transfection of a cloned normal allele into the cultured disease cells may result in restoration of the normal phenotype by complementing the genetic deficiency. Production of a mouse model of the disease. Once a putative disease gene is identified, a transgenic mouse model can be constructed. If the human phenotype is known to result from loss of function, gene targeting can be used to generate a germline knockout mutation in the mouse ortholog. The mutant mice are expected to show some resemblance to humans with the disease.
  • 54. Suggested readings 1. A. L. Barabási, N. Gulbahce, J. Loscalzo, Network medicine: a network-based approach to human disease. Nature Reviews Genetics 12, 56 (2011). 2. J. K. DiStefano, Disease Gene Identification: Methods and Protocols. (Humana Press, 2011). 3. S. D. Mooney, V. G. Krishnan, U. S. Evani, Bioinformatic tools for identifying disease gene and SNP candidates. Methods Mol. Biol 628, 307 (2010). 4. A. Schlicker, T. Lengauer, M. Albrecht, Improving disease gene prioritization using the semantic similarity of Gene Ontology terms. Bioinformatics 26, i561 (2010). 5. N. Tiffin et al., Integration of text-and data-mining using ontologies successfully selects disease gene candidates. Nucleic acids research 33, 1544 (2005). 6. Y. Zhang et al., Systematic analysis, comparison, and integration of disease based human genetic association data and mouse genetic phenotypic information. BMC medical genomics 3, 1 (2010).