SlideShare a Scribd company logo
Bioinformatics:
What, Why and
Where?
Mohamed El-Hadidi
Assistant Professor of Bioinformatics
Biomedical Informatics Program Director
School of Information Technology and Computer Science
Nile University
Where DNA is Located in our Body?
6/3/2020 Bioinformatics: What, Why and Where? 2
From Human Body to DNA Sequences
DNA Sequencers
Sequence Files
How many cells
in the Human
Body?
10 Trillion Cells!
6/3/2020 Bioinformatics: What, Why and Where? 3
From Human Body to DNA Sequences
DNA Sequencers
Sequence Files
How many
chromosomes in one
cell?
46
Chromosomes!
6/3/2020 4Bioinformatics: What, Why and Where?
From Human Body to DNA Sequences
DNA Sequencers
Sequence Files
What is the length of
all chromosomes in
one cell?
2 m in one cell!
1500 times from Earth to
moon (all cells)
6/3/2020 5Bioinformatics: What, Why and Where?
From Human Body to DNA Sequences
DNA Sequencers
Sequence Files
What are in
these files?
GAATTTGGGCAAGAATCCAGGCATTGGAACTTATTCAAATAACTAGTTTGCCTGTAATTTTCACTTTTTC
AGAGTCATCTGATAAAGCTTTCTTGCTACACATTTAGATAGATACACTCAATCCAGTTGTCTAGAAAGTT
CCCTGAGCCAGCTGGGAGCAGGAGGGGTAGTTGGGGCCAGGAATATTGGGGGTGTGTTTACTGAGCCCCT
AGAAAGTAAGTGCTAGATTTGACATTTCAATCCCTGAAGGCCCTGAAGTTCAGTATCAAATGACTGGTCC
TGTGGACTGAGCATCTGTGAATTGCATATGCTTAGAGTAAATTTTACTCCTACCAGTTTCAGCAGCTTGC
TTTAGCAAGCAGTATGGAAACACTAACATGGGGGAGTAGAATTTCTCTCTCTGATCCAAGTTTTATCTCA
TTCTGGTGGGTTTTCAAGGAGAGACTCGGAGTCCAAGTGTCCTTTCTGAATATATCTGGAACTTCTCATT
AACAAAAGACTCAAGTTATAATTTAGGGGACAAGGCACCCAATGAGAATGCCTTGCAGGCAGCCCTAAGT
ACACCTGCAATTACACCATTACTAGCGCGGCAGCACACATGGCCCTGACTTAGTTTAAATAATTACGTAA
GTCAACCATGATTGTTTGCCCTTTGCATAGAAGGGCAAGTATTGGTACCTGTTACAACTTAGGCTTTTTT
TTCTTTATGTTTGAGCCATGATGAGTGATTTACACTGTTGCATCCATATGTTGAGATGTAAGAATAAATT
AGACTTGGTAATTGCCCTTAAGTGTCTGGAAGTCAACTGGGGAAAGAGAGCTAGAGATAATAAGTGTGAA
ACAATGTCACAGAATCAATGACGGAACTCTTCCCAGGACAAAGGATGACTTTTGAGTTCAGTCTTTGCCT
TTAATTCTACATGGGGAGGAGAGCACGTTTAGCCACAAATGGAAGGGATTACTCATTTGAGCTATTTGGT
TATATGATTATTTCCCCAGAGAATAGGATGTGCAGGGCATTACACAAGCAGTGCCAATAGCAGCAAAGTT
CTTGAGAGTGCTAGTAATTCAAATGGCAGGAAGAGAAGGAATAAATGGTAAGGCTACCTACAGTTCACAG
AGAGCTCCATCCTCACTGTGGCTTTGGATTTTGTCCTGTGTGAAAGAGAAGTGACTGTGAACTGACATGC
TGTGTTTGGTGTTTTAGAAAGATGGCTGCAGCAGCGGTTTGGGGAATGGACTGCAGGAGTGGCATTGGAA
ACAGGAAGGTTCATGACTATTGCCAGAGACAGAGGATGAAGCAGGAGCAAGGAAGATTCAGGACAGGGGA
CTCCGGGGCTGATCAGGAGGCAGAACTGGTTGATAAGTATATGTAGCAGCATAAGAAAGAAAGAATCCCA
GATTGACACCCAGGCTTCTCACTTGGAAGCCTGGATAGATACTGAATGCAATCACAAAGGCTGGGAAGTC
AATGGGACTGCAGGGAAGGGAAGGGAAGGGAGGAGAAGAGGAAGGGCAGGAGGGTCCAATATCAATATTC
AGCTTTTAGATGTGTTGAGCTTGAAGTGCTCAGATGGAGAAGTCCAGGAGGCAGTAGAATACGGTGGTCC
AGAGCACAGGAGAGCAATGTGGCTTGAGTTGTCATTTGCTCACATATTTCCGTGTCAGTTACTTGTCTTA
GATCACAGAACAAGTTCTCCTCTCACAGTTTCCTGGCTCCACCTGTCTCATGCTCACCGTCAGCATCGAA
ATTGAGCCACACCAGGGGTTCTGGATACCAGCTTCTCTCTAGGTGAGGCTGCTATAGTCAGCAGCTGATT
AGTTGCAGTTATCAGCAACTGGTAATATAATATATTGTGCATATAAGTGTACCAGAAGTCATGTTTATAT
ATTGCTGCAAATACTCGGAATGGGGATCTCTTGTTCCCTGCTTAAGACCACATCACATTACTTGGTTTTG
TACGCTAGTGGCTGAACCAAAAAAAGTAGGAGATGATTTTTTTTCTTTTTTCTTAAAGCAGTAGCTTTTG
AACCTTGACCATGCTTTCTAACCAGCTGAGGGGCTTTTGAAAAAGAGGGTGCCTTACTGTGCCCCAGACC
AGGACAATCAGTATTTCTGGGGAATGGAGCCTGGCACACACACATTTCTTAAAGCTCCCTTGGCAATTCT
GAGGAGTGGATTACATGTTGTATGTAGCTCGTAACGAAAGAAATCTTGTCTTTGCTCTCAGACCCCCATT
TCTTACTCATCTCATGAGCTCCTTCGAGATCCAGAAACAGTTGCATATTTCATTAGTAAATCAGTTCCAG
AGTCACATTTTATTTCACAAGTTAGTCCATTAAAAGTTTCCTGCAGTGAGGAAATAGCCAGAAAGAACAC
TCCACCCCTCCTCCTTTTTATAACTATAGGGTCTGGCTCGACAGAGCAGGAGCATCGCCATCTTGGACAA
6/3/2020 6Bioinformatics: What, Why and Where?
From Human Body to DNA Sequences
DNA Sequencers
Sequence Files
What are in
these files?
GAATTTGGGCAAGAATCCAGGCATTGGAACTTATTCAAATAACTAGTTTGCCTGTAATTTTCACTTTTTC
AGAGTCATCTGATAAAGCTTTCTTGCTACACATTTAGATAGATACACTCAATCCAGTTGTCTAGAAAGTT
CCCTGAGCCAGCTGGGAGCAGGAGGGGTAGTTGGGGCCAGGAATATTGGGGGTGTGTTTACTGAGCCCCT
AGAAAGTAAGTGCTAGATTTGACATTTCAATCCCTGAAGGCCCTGAAGTTCAGTATCAAATGACTGGTCC
TGTGGACTGAGCATCTGTGAATTGCATATGCTTAGAGTAAATTTTACTCCTACCAGTTTCAGCAGCTTGC
TTTAGCAAGCAGTATGGAAACACTAACATGGGGGAGTAGAATTTCTCTCTCTGATCCAAGTTTTATCTCA
TTCTGGTGGGTTTTCAAGGAGAGACTCGGAGTCCAAGTGTCCTTTCTGAATATATCTGGAACTTCTCATT
AACAAAAGACTCAAGTTATAATTTAGGGGACAAGGCACCCAATGAGAATGCCTTGCAGGCAGCCCTAAGT
ACACCTGCAATTACACCATTACTAGCGCGGCAGCACACATGGCCCTGACTTAGTTTAAATAATTACGTAA
GTCAACCATGATTGTTTGCCCTTTGCATAGAAGGGCAAGTATTGGTACCTGTTACAACTTAGGCTTTTTT
TTCTTTATGTTTGAGCCATGATGAGTGATTTACACTGTTGCATCCATATGTTGAGATGTAAGAATAAATT
AGACTTGGTAATTGCCCTTAAGTGTCTGGAAGTCAACTGGGGAAAGAGAGCTAGAGATAATAAGTGTGAA
ACAATGTCACAGAATCAATGACGGAACTCTTCCCAGGACAAAGGATGACTTTTGAGTTCAGTCTTTGCCT
TTAATTCTACATGGGGAGGAGAGCACGTTTAGCCACAAATGGAAGGGATTACTCATTTGAGCTATTTGGT
TATATGATTATTTCCCCAGAGAATAGGATGTGCAGGGCATTACACAAGCAGTGCCAATAGCAGCAAAGTT
CTTGAGAGTGCTAGTAATTCAAATGGCAGGAAGAGAAGGAATAAATGGTAAGGCTACCTACAGTTCACAG
AGAGCTCCATCCTCACTGTGGCTTTGGATTTTGTCCTGTGTGAAAGAGAAGTGACTGTGAACTGACATGC
TGTGTTTGGTGTTTTAGAAAGATGGCTGCAGCAGCGGTTTGGGGAATGGACTGCAGGAGTGGCATTGGAA
ACAGGAAGGTTCATGACTATTGCCAGAGACAGAGGATGAAGCAGGAGCAAGGAAGATTCAGGACAGGGGA
CTCCGGGGCTGATCAGGAGGCAGAACTGGTTGATAAGTATATGTAGCAGCATAAGAAAGAAAGAATCCCA
GATTGACACCCAGGCTTCTCACTTGGAAGCCTGGATAGATACTGAATGCAATCACAAAGGCTGGGAAGTC
AATGGGACTGCAGGGAAGGGAAGGGAAGGGAGGAGAAGAGGAAGGGCAGGAGGGTCCAATATCAATATTC
AGCTTTTAGATGTGTTGAGCTTGAAGTGCTCAGATGGAGAAGTCCAGGAGGCAGTAGAATACGGTGGTCC
AGAGCACAGGAGAGCAATGTGGCTTGAGTTGTCATTTGCTCACATATTTCCGTGTCAGTTACTTGTCTTA
GATCACAGAACAAGTTCTCCTCTCACAGTTTCCTGGCTCCACCTGTCTCATGCTCACCGTCAGCATCGAA
ATTGAGCCACACCAGGGGTTCTGGATACCAGCTTCTCTCTAGGTGAGGCTGCTATAGTCAGCAGCTGATT
AGTTGCAGTTATCAGCAACTGGTAATATAATATATTGTGCATATAAGTGTACCAGAAGTCATGTTTATAT
ATTGCTGCAAATACTCGGAATGGGGATCTCTTGTTCCCTGCTTAAGACCACATCACATTACTTGGTTTTG
TACGCTAGTGGCTGAACCAAAAAAAGTAGGAGATGATTTTTTTTCTTTTTTCTTAAAGCAGTAGCTTTTG
AACCTTGACCATGCTTTCTAACCAGCTGAGGGGCTTTTGAAAAAGAGGGTGCCTTACTGTGCCCCAGACC
AGGACAATCAGTATTTCTGGGGAATGGAGCCTGGCACACACACATTTCTTAAAGCTCCCTTGGCAATTCT
GAGGAGTGGATTACATGTTGTATGTAGCTCGTAACGAAAGAAATCTTGTCTTTGCTCTCAGACCCCCATT
TCTTACTCATCTCATGAGCTCCTTCGAGATCCAGAAACAGTTGCATATTTCATTAGTAAATCAGTTCCAG
AGTCACATTTTATTTCACAAGTTAGTCCATTAAAAGTTTCCTGCAGTGAGGAAATAGCCAGAAAGAACAC
TCCACCCCTCCTCCTTTTTATAACTATAGGGTCTGGCTCGACAGAGCAGGAGCATCGCCATCTTGGACAA
6/3/2020 7Bioinformatics: What, Why and Where?
From Human Body to DNA Sequences
DNA Sequencers
Sequence Files
GAATTTGGGCAAGAATCCAGGCATTGGAACTTATTCAAATAACTAGTTTGCCTGTAATTTTCACTTTTTC
AGAGTCATCTGATAAAGCTTTCTTGCTACACATTTAGATAGATACACTCAATCCAGTTGTCTAGAAAGTT
CCCTGAGCCAGCTGGGAGCAGGAGGGGTAGTTGGGGCCAGGAATATTGGGGGTGTGTTTACTGAGCCCCT
AGAAAGTAAGTGCTAGATTTGACATTTCAATCCCTGAAGGCCCTGAAGTTCAGTATCAAATGACTGGTCC
TGTGGACTGAGCATCTGTGAATTGCATATGCTTAGAGTAAATTTTACTCCTACCAGTTTCAGCAGCTTGC
TTTAGCAAGCAGTATGGAAACACTAACATGGGGGAGTAGAATTTCTCTCTCTGATCCAAGTTTTATCTCA
TTCTGGTGGGTTTTCAAGGAGAGACTCGGAGTCCAAGTGTCCTTTCTGAATATATCTGGAACTTCTCATT
AACAAAAGACTCAAGTTATAATTTAGGGGACAAGGCACCCAATGAGAATGCCTTGCAGGCAGCCCTAAGT
ACACCTGCAATTACACCATTACTAGCGCGGCAGCACACATGGCCCTGACTTAGTTTAAATAATTACGTAA
GTCAACCATGATTGTTTGCCCTTTGCATAGAAGGGCAAGTATTGGTACCTGTTACAACTTAGGCTTTTTT
TTCTTTATGTTTGAGCCATGATGAGTGATTTACACTGTTGCATCCATATGTTGAGATGTAAGAATAAATT
AGACTTGGTAATTGCCCTTAAGTGTCTGGAAGTCAACTGGGGAAAGAGAGCTAGAGATAATAAGTGTGAA
ACAATGTCACAGAATCAATGACGGAACTCTTCCCAGGACAAAGGATGACTTTTGAGTTCAGTCTTTGCCT
TTAATTCTACATGGGGAGGAGAGCACGTTTAGCCACAAATGGAAGGGATTACTCATTTGAGCTATTTGGT
TATATGATTATTTCCCCAGAGAATAGGATGTGCAGGGCATTACACAAGCAGTGCCAATAGCAGCAAAGTT
CTTGAGAGTGCTAGTAATTCAAATGGCAGGAAGAGAAGGAATAAATGGTAAGGCTACCTACAGTTCACAG
AGAGCTCCATCCTCACTGTGGCTTTGGATTTTGTCCTGTGTGAAAGAGAAGTGACTGTGAACTGACATGC
TGTGTTTGGTGTTTTAGAAAGATGGCTGCAGCAGCGGTTTGGGGAATGGACTGCAGGAGTGGCATTGGAA
ACAGGAAGGTTCATGACTATTGCCAGAGACAGAGGATGAAGCAGGAGCAAGGAAGATTCAGGACAGGGGA
CTCCGGGGCTGATCAGGAGGCAGAACTGGTTGATAAGTATATGTAGCAGCATAAGAAAGAAAGAATCCCA
GATTGACACCCAGGCTTCTCACTTGGAAGCCTGGATAGATACTGAATGCAATCACAAAGGCTGGGAAGTC
AATGGGACTGCAGGGAAGGGAAGGGAAGGGAGGAGAAGAGGAAGGGCAGGAGGGTCCAATATCAATATTC
AGCTTTTAGATGTGTTGAGCTTGAAGTGCTCAGATGGAGAAGTCCAGGAGGCAGTAGAATACGGTGGTCC
AGAGCACAGGAGAGCAATGTGGCTTGAGTTGTCATTTGCTCACATATTTCCGTGTCAGTTACTTGTCTTA
GATCACAGAACAAGTTCTCCTCTCACAGTTTCCTGGCTCCACCTGTCTCATGCTCACCGTCAGCATCGAA
ATTGAGCCACACCAGGGGTTCTGGATACCAGCTTCTCTCTAGGTGAGGCTGCTATAGTCAGCAGCTGATT
AGTTGCAGTTATCAGCAACTGGTAATATAATATATTGTGCATATAAGTGTACCAGAAGTCATGTTTATAT
ATTGCTGCAAATACTCGGAATGGGGATCTCTTGTTCCCTGCTTAAGACCACATCACATTACTTGGTTTTG
TACGCTAGTGGCTGAACCAAAAAAAGTAGGAGATGATTTTTTTTCTTTTTTCTTAAAGCAGTAGCTTTTG
AACCTTGACCATGCTTTCTAACCAGCTGAGGGGCTTTTGAAAAAGAGGGTGCCTTACTGTGCCCCAGACC
AGGACAATCAGTATTTCTGGGGAATGGAGCCTGGCACACACACATTTCTTAAAGCTCCCTTGGCAATTCT
GAGGAGTGGATTACATGTTGTATGTAGCTCGTAACGAAAGAAATCTTGTCTTTGCTCTCAGACCCCCATT
TCTTACTCATCTCATGAGCTCCTTCGAGATCCAGAAACAGTTGCATATTTCATTAGTAAATCAGTTCCAG
AGTCACATTTTATTTCACAAGTTAGTCCATTAAAAGTTTCCTGCAGTGAGGAAATAGCCAGAAAGAACAC
TCCACCCCTCCTCCTTTTTATAACTATAGGGTCTGGCTCGACAGAGCAGGAGCATCGCCATCTTGGACAA
How many
nucleotides in the
Human body?
3 Billion
Nucleotides!
6/3/2020 8Bioinformatics: What, Why and Where?
From Human Body to DNA Sequences
DNA Sequencers
Sequence Files
What is the size
of data?
150 GB/person
6/3/2020 9Bioinformatics: What, Why and Where?
How These Files were Generated?
6/3/2020 Bioinformatics: What, Why and Where? 10
How These Files were Generated?
6/3/2020 Bioinformatics: What, Why and Where? 11
Bioinformatics Data is
Increasing Rapidly!
• Speed of sequencing?
 10,000 bp/day/machine ->
billions bp/day/machine.
• Computing cost and time?
 Sequencing cost is falling 5X
faster than computing
• Price / genome?
 Dropped to $1000!
• Storage cost?
 150 GB/genome
Bioinformatics: What, Why and Where? 12
How These Files were Generated?
6/3/2020 13
How These Files were Generated?
Bioinformatics: What, Why and Where?
6/3/2020 Bioinformatics: What, Why and Where? 14
What to Do with These Files?
Making sense of this BIG DATA!
How to Make Sense of This BIG DATA?
Through Bioinformatics!
What is Bioinformatics??!
6/3/2020 Bioinformatics: What, Why and Where? 15
What Do You Need to Learn Bioinformatics?
6/3/2020 Bioinformatics: What, Why and Where? 16
Statistics
Computer
Science
Biology
Bioinformatics
Data
Science
Biostatistics Computational
Biology
What is
Bioinformatics?
https://qph.fs.quoracdn.net/main-qimg-73f348d1d5ee87af6955de6c53a444cf
6/3/2020 Bioinformatics: What, Why and Where? 17
What is
Bioinformatics?
https://bioinformaticsonline.com/mod/file/thumbnail.php?file_guid=4482&size=large&icontime=1379016276
6/3/2020 Bioinformatics: What, Why and Where? 18
What is Bioinformatics?
6/3/2020 Bioinformatics: What, Why and Where? 19
GAATTTGGGCAAGAATCCAGGCATTGGAACTTATTCAAATAACTAGTTTGCCTGTAATTTTCACTTTTTC
AGAGTCATCTGATAAAGCTTTCTTGCTACACATTTAGATAGATACACTCAATCCAGTTGTCTAGAAAGTT
CCCTGAGCCAGCTGGGAGCAGGAGGGGTAGTTGGGGCCAGGAATATTGGGGGTGTGTTTACTGAGCCCCT
AGAAAGTAAGTGCTAGATTTGACATTTCAATCCCTGAAGGCCCTGAAGTTCAGTATCAAATGACTGGTCC
TGTGGACTGAGCATCTGTGAATTGCATATGCTTAGAGTAAATTTTACTCCTACCAGTTTCAGCAGCTTGC
TTTAGCAAGCAGTATGGAAACACTAACATGGGGGAGTAGAATTTCTCTCTCTGATCCAAGTTTTATCTCA
TTCTGGTGGGTTTTCAAGGAGAGACTCGGAGTCCAAGTGTCCTTTCTGAATATATCTGGAACTTCTCATT
AACAAAAGACTCAAGTTATAATTTAGGGGACAAGGCACCCAATGAGAATGCCTTGCAGGCAGCCCTAAGT
ACACCTGCAATTACACCATTACTAGCGCGGCAGCACACATGGCCCTGACTTAGTTTAAATAATTACGTAA
GTCAACCATGATTGTTTGCCCTTTGCATAGAAGGGCAAGTATTGGTACCTGTTACAACTTAGGCTTTTTT
TTCTTTATGTTTGAGCCATGATGAGTGATTTACACTGTTGCATCCATATGTTGAGATGTAAGAATAAATT
AGACTTGGTAATTGCCCTTAAGTGTCTGGAAGTCAACTGGGGAAAGAGAGCTAGAGATAATAAGTGTGAA
ACAATGTCACAGAATCAATGACGGAACTCTTCCCAGGACAAAGGATGACTTTTGAGTTCAGTCTTTGCCT
TTAATTCTACATGGGGAGGAGAGCACGTTTAGCCACAAATGGAAGGGATTACTCATTTGAGCTATTTGGT
TATATGATTATTTCCCCAGAGAATAGGATGTGCAGGGCATTACACAAGCAGTGCCAATAGCAGCAAAGTT
CTTGAGAGTGCTAGTAATTCAAATGGCAGGAAGAGAAGGAATAAATGGTAAGGCTACCTACAGTTCACAG
AGAGCTCCATCCTCACTGTGGCTTTGGATTTTGTCCTGTGTGAAAGAGAAGTGACTGTGAACTGACATGC
TGTGTTTGGTGTTTTAGAAAGATGGCTGCAGCAGCGGTTTGGGGAATGGACTGCAGGAGTGGCATTGGAA
ACAGGAAGGTTCATGACTATTGCCAGAGACAGAGGATGAAGCAGGAGCAAGGAAGATTCAGGACAGGGGA
CTCCGGGGCTGATCAGGAGGCAGAACTGGTTGATAAGTATATGTAGCAGCATAAGAAAGAAAGAATCCCA
GATTGACACCCAGGCTTCTCACTTGGAAGCCTGGATAGATACTGAATGCAATCACAAAGGCTGGGAAGTC
AATGGGACTGCAGGGAAGGGAAGGGAAGGGAGGAGAAGAGGAAGGGCAGGAGGGTCCAATATCAATATTC
AGCTTTTAGATGTGTTGAGCTTGAAGTGCTCAGATGGAGAAGTCCAGGAGGCAGTAGAATACGGTGGTCC
AGAGCACAGGAGAGCAATGTGGCTTGAGTTGTCATTTGCTCACATATTTCCGTGTCAGTTACTTGTCTTA
GATCACAGAACAAGTTCTCCTCTCACAGTTTCCTGGCTCCACCTGTCTCATGCTCACCGTCAGCATCGAA
ATTGAGCCACACCAGGGGTTCTGGATACCAGCTTCTCTCTAGGTGAGGCTGCTATAGTCAGCAGCTGATT
AGTTGCAGTTATCAGCAACTGGTAATATAATATATTGTGCATATAAGTGTACCAGAAGTCATGTTTATAT
ATTGCTGCAAATACTCGGAATGGGGATCTCTTGTTCCCTGCTTAAGACCACATCACATTACTTGGTTTTG
TACGCTAGTGGCTGAACCAAAAAAAGTAGGAGATGATTTTTTTTCTTTTTTCTTAAAGCAGTAGCTTTTG
AACCTTGACCATGCTTTCTAACCAGCTGAGGGGCTTTTGAAAAAGAGGGTGCCTTACTGTGCCCCAGACC
AGGACAATCAGTATTTCTGGGGAATGGAGCCTGGCACACACACATTTCTTAAAGCTCCCTTGGCAATTCT
GAGGAGTGGATTACATGTTGTATGTAGCTCGTAACGAAAGAAATCTTGTCTTTGCTCTCAGACCCCCATT
TCTTACTCATCTCATGAGCTCCTTCGAGATCCAGAAACAGTTGCATATTTCATTAGTAAATCAGTTCCAG
AGTCACATTTTATTTCACAAGTTAGTCCATTAAAAGTTTCCTGCAGTGAGGAAATAGCCAGAAAGAACAC
TCCACCCCTCCTCCTTTTTATAACTATAGGGTCTGGCTCGACAGAGCAGGAGCATCGCCATCTTGGACAA
What is Bioinformatics?
6/3/2020 Bioinformatics: What, Why and Where? 20
GAATTTGGGCAAGAATCCAGGCATTGGAACTTATTCAAATAACTAGTTTGCCTGTAATTTTCACTTTTTC
AGAGTCATCTGATAAAGCTTTCTTGCTACACATTTAGATAGATACACTCAATCCAGTTGTCTAGAAAGTT
CCCTGAGCCAGCTGGGAGCAGGAGGGGTAGTTGGGGCCAGGAATATTGGGGGTGTGTTTACTGAGCCCCT
AGAAAGTAAGTGCTAGATTTGACATTTCAATCCCTGAAGGCCCTGAAGTTCAGTATCAAATGACTGGTCC
TGTGGACTGAGCATCTGTGAATTGCATATGCTTAGAGTAAATTTTACTCCTACCAGTTTCAGCAGCTTGC
TTTAGCAAGCAGTATGGAAACACTAACATGGGGGAGTAGAATTTCTCTCTCTGATCCAAGTTTTATCTCA
TTCTGGTGGGTTTTCAAGGAGAGACTCGGAGTCCAAGTGTCCTTTCTGAATATATCTGGAACTTCTCATT
AACAAAAGACTCAAGTTATAATTTAGGGGACAAGGCACCCAATGAGAATGCCTTGCAGGCAGCCCTAAGT
ACACCTGCAATTACACCATTACTAGCGCGGCAGCACACATGGCCCTGACTTAGTTTAAATAATTACGTAA
GTCAACCATGATTGTTTGCCCTTTGCATAGAAGGGCAAGTATTGGTACCTGTTACAACTTAGGCTTTTTT
TTCTTTATGTTTGAGCCATGATGAGTGATTTACACTGTTGCATCCATATGTTGAGATGTAAGAATAAATT
AGACTTGGTAATTGCCCTTAAGTGTCTGGAAGTCAACTGGGGAAAGAGAGCTAGAGATAATAAGTGTGAA
ACAATGTCACAGAATCAATGACGGAACTCTTCCCAGGACAAAGGATGACTTTTGAGTTCAGTCTTTGCCT
TTAATTCTACATGGGGAGGAGAGCACGTTTAGCCACAAATGGAAGGGATTACTCATTTGAGCTATTTGGT
TATATGATTATTTCCCCAGAGAATAGGATGTGCAGGGCATTACACAAGCAGTGCCAATAGCAGCAAAGTT
CTTGAGAGTGCTAGTAATTCAAATGGCAGGAAGAGAAGGAATAAATGGTAAGGCTACCTACAGTTCACAG
AGAGCTCCATCCTCACTGTGGCTTTGGATTTTGTCCTGTGTGAAAGAGAAGTGACTGTGAACTGACATGC
TGTGTTTGGTGTTTTAGAAAGATGGCTGCAGCAGCGGTTTGGGGAATGGACTGCAGGAGTGGCATTGGAA
ACAGGAAGGTTCATGACTATTGCCAGAGACAGAGGATGAAGCAGGAGCAAGGAAGATTCAGGACAGGGGA
CTCCGGGGCTGATCAGGAGGCAGAACTGGTTGATAAGTATATGTAGCAGCATAAGAAAGAAAGAATCCCA
GATTGACACCCAGGCTTCTCACTTGGAAGCCTGGATAGATACTGAATGCAATCACAAAGGCTGGGAAGTC
AATGGGACTGCAGGGAAGGGAAGGGAAGGGAGGAGAAGAGGAAGGGCAGGAGGGTCCAATATCAATATTC
AGCTTTTAGATGTGTTGAGCTTGAAGTGCTCAGATGGAGAAGTCCAGGAGGCAGTAGAATACGGTGGTCC
AGAGCACAGGAGAGCAATGTGGCTTGAGTTGTCATTTGCTCACATATTTCCGTGTCAGTTACTTGTCTTA
GATCACAGAACAAGTTCTCCTCTCACAGTTTCCTGGCTCCACCTGTCTCATGCTCACCGTCAGCATCGAA
ATTGAGCCACACCAGGGGTTCTGGATACCAGCTTCTCTCTAGGTGAGGCTGCTATAGTCAGCAGCTGATT
AGTTGCAGTTATCAGCAACTGGTAATATAATATATTGTGCATATAAGTGTACCAGAAGTCATGTTTATAT
ATTGCTGCAAATACTCGGAATGGGGATCTCTTGTTCCCTGCTTAAGACCACATCACATTACTTGGTTTTG
TACGCTAGTGGCTGAACCAAAAAAAGTAGGAGATGATTTTTTTTCTTTTTTCTTAAAGCAGTAGCTTTTG
AACCTTGACCATGCTTTCTAACCAGCTGAGGGGCTTTTGAAAAAGAGGGTGCCTTACTGTGCCCCAGACC
AGGACAATCAGTATTTCTGGGGAATGGAGCCTGGCACACACACATTTCTTAAAGCTCCCTTGGCAATTCT
GAGGAGTGGATTACATGTTGTATGTAGCTCGTAACGAAAGAAATCTTGTCTTTGCTCTCAGACCCCCATT
TCTTACTCATCTCATGAGCTCCTTCGAGATCCAGAAACAGTTGCATATTTCATTAGTAAATCAGTTCCAG
AGTCACATTTTATTTCACAAGTTAGTCCATTAAAAGTTTCCTGCAGTGAGGAAATAGCCAGAAAGAACAC
TCCACCCCTCCTCCTTTTTATAACTATAGGGTCTGGCTCGACAGAGCAGGAGCATCGCCATCTTGGACAA
Use Existing tools to build
analysis workflows
• Linux
• Command Line
• Scripting
Develop your own tools
• Programming
• Algorithm Design
• Machine Learning
What is Bioinformatics?
6/3/2020 Bioinformatics: What, Why and Where? 21
GAATTTGGGCAAGAATCCAGGCATTGGAACTTATTCAAATAACTAGTTTGCCTGTAATTTTCACTTTTTC
AGAGTCATCTGATAAAGCTTTCTTGCTACACATTTAGATAGATACACTCAATCCAGTTGTCTAGAAAGTT
CCCTGAGCCAGCTGGGAGCAGGAGGGGTAGTTGGGGCCAGGAATATTGGGGGTGTGTTTACTGAGCCCCT
AGAAAGTAAGTGCTAGATTTGACATTTCAATCCCTGAAGGCCCTGAAGTTCAGTATCAAATGACTGGTCC
TGTGGACTGAGCATCTGTGAATTGCATATGCTTAGAGTAAATTTTACTCCTACCAGTTTCAGCAGCTTGC
TTTAGCAAGCAGTATGGAAACACTAACATGGGGGAGTAGAATTTCTCTCTCTGATCCAAGTTTTATCTCA
TTCTGGTGGGTTTTCAAGGAGAGACTCGGAGTCCAAGTGTCCTTTCTGAATATATCTGGAACTTCTCATT
AACAAAAGACTCAAGTTATAATTTAGGGGACAAGGCACCCAATGAGAATGCCTTGCAGGCAGCCCTAAGT
ACACCTGCAATTACACCATTACTAGCGCGGCAGCACACATGGCCCTGACTTAGTTTAAATAATTACGTAA
GTCAACCATGATTGTTTGCCCTTTGCATAGAAGGGCAAGTATTGGTACCTGTTACAACTTAGGCTTTTTT
TTCTTTATGTTTGAGCCATGATGAGTGATTTACACTGTTGCATCCATATGTTGAGATGTAAGAATAAATT
AGACTTGGTAATTGCCCTTAAGTGTCTGGAAGTCAACTGGGGAAAGAGAGCTAGAGATAATAAGTGTGAA
ACAATGTCACAGAATCAATGACGGAACTCTTCCCAGGACAAAGGATGACTTTTGAGTTCAGTCTTTGCCT
TTAATTCTACATGGGGAGGAGAGCACGTTTAGCCACAAATGGAAGGGATTACTCATTTGAGCTATTTGGT
TATATGATTATTTCCCCAGAGAATAGGATGTGCAGGGCATTACACAAGCAGTGCCAATAGCAGCAAAGTT
CTTGAGAGTGCTAGTAATTCAAATGGCAGGAAGAGAAGGAATAAATGGTAAGGCTACCTACAGTTCACAG
AGAGCTCCATCCTCACTGTGGCTTTGGATTTTGTCCTGTGTGAAAGAGAAGTGACTGTGAACTGACATGC
TGTGTTTGGTGTTTTAGAAAGATGGCTGCAGCAGCGGTTTGGGGAATGGACTGCAGGAGTGGCATTGGAA
ACAGGAAGGTTCATGACTATTGCCAGAGACAGAGGATGAAGCAGGAGCAAGGAAGATTCAGGACAGGGGA
CTCCGGGGCTGATCAGGAGGCAGAACTGGTTGATAAGTATATGTAGCAGCATAAGAAAGAAAGAATCCCA
GATTGACACCCAGGCTTCTCACTTGGAAGCCTGGATAGATACTGAATGCAATCACAAAGGCTGGGAAGTC
AATGGGACTGCAGGGAAGGGAAGGGAAGGGAGGAGAAGAGGAAGGGCAGGAGGGTCCAATATCAATATTC
AGCTTTTAGATGTGTTGAGCTTGAAGTGCTCAGATGGAGAAGTCCAGGAGGCAGTAGAATACGGTGGTCC
AGAGCACAGGAGAGCAATGTGGCTTGAGTTGTCATTTGCTCACATATTTCCGTGTCAGTTACTTGTCTTA
GATCACAGAACAAGTTCTCCTCTCACAGTTTCCTGGCTCCACCTGTCTCATGCTCACCGTCAGCATCGAA
ATTGAGCCACACCAGGGGTTCTGGATACCAGCTTCTCTCTAGGTGAGGCTGCTATAGTCAGCAGCTGATT
AGTTGCAGTTATCAGCAACTGGTAATATAATATATTGTGCATATAAGTGTACCAGAAGTCATGTTTATAT
ATTGCTGCAAATACTCGGAATGGGGATCTCTTGTTCCCTGCTTAAGACCACATCACATTACTTGGTTTTG
TACGCTAGTGGCTGAACCAAAAAAAGTAGGAGATGATTTTTTTTCTTTTTTCTTAAAGCAGTAGCTTTTG
AACCTTGACCATGCTTTCTAACCAGCTGAGGGGCTTTTGAAAAAGAGGGTGCCTTACTGTGCCCCAGACC
AGGACAATCAGTATTTCTGGGGAATGGAGCCTGGCACACACACATTTCTTAAAGCTCCCTTGGCAATTCT
GAGGAGTGGATTACATGTTGTATGTAGCTCGTAACGAAAGAAATCTTGTCTTTGCTCTCAGACCCCCATT
TCTTACTCATCTCATGAGCTCCTTCGAGATCCAGAAACAGTTGCATATTTCATTAGTAAATCAGTTCCAG
AGTCACATTTTATTTCACAAGTTAGTCCATTAAAAGTTTCCTGCAGTGAGGAAATAGCCAGAAAGAACAC
TCCACCCCTCCTCCTTTTTATAACTATAGGGTCTGGCTCGACAGAGCAGGAGCATCGCCATCTTGGACAA
Use Existing tools to build
analysis workflows
Develop your own tools
• Linux
• Command Line
• Scripting
• Programming
• Algorithm Design
• Machine Learning
A = 1765 G = 3561
C = 2677 T = 1121
What is Bioinformatics?
6/3/2020 Bioinformatics: What, Why and Where? 22
Use Existing tools to build
analysis workflows
Develop your own tools
• Linux
• Command Line
• Scripting
• Programming
• Algorithm Design
• Machine Learning
Biologist (Biology Background)
Use existing bioinformatics tools
Computer Scientist (CS Background)
Develops bioinformatics tools
Basic User
Windows OS
Web-based Tools
GUI Standalone tools
No Programming skills
Advanced User
Linux OS
Command line Standalone
tools
Basic Programming Skills
Developer
Basic Biology Knowledge
Advanced Programming Skills
Advanced Mathematics
Advanced Statistics
Who Can Be a Bioinformatician?
6/3/2020 Bioinformatics: What, Why and Where? 23
How can I Learn Bioinformatics?
Tons of free courses are available online!
More than 26 million
results when searching
without comma!
6/3/2020 Bioinformatics: What, Why and Where? 24
How can I Learn Bioinformatics?
Tons of free courses are available online!
More than 46 million
results when searching
without comma!
6/3/2020 Bioinformatics: What, Why and Where? 25
Examples of Free Online Bioinformatics MOOCs
Websites
6/3/2020 Bioinformatics: What, Why and Where? 26
6/3/2020 Bioinformatics: What, Why and Where? 27
Milestones of
Bioinformatics
28
• OMICS Sciences
• Programming and Data
Structure
•Algorithm Design
• LINUX
• Statistics
•Basic Mathematics
• AI and Data Science
•Data Visualization
• Results Interpretation
Milestones of
Bioinformatics
29
Shall I learn Everything?
Next Step?
30
Read Papers and Reproduce
Results!
Compare
Modify
Explain
Seek Internships Options!
Real Life Problems!
Advice…
31
Perceive Biology as CS and
Perceive CS as Biology!
The Link!
No Need for a Supercomputer!
6/3/2020 Bioinformatics: What, Why and Where? 32
Where to Find a Job (Egypt and Abroad)?
6/3/2020 Bioinformatics: What, Why and Where? 33
Research
Academia
Companies
Startup
Freelancer
Salaries
6/3/2020 Bioinformatics: What, Why and Where? 34
Salaries
6/3/2020 Bioinformatics: What, Why and Where? 35
Salaries
6/3/2020 Bioinformatics: What, Why and Where? 36
Salaries
6/3/2020 Bioinformatics: What, Why and Where? 37
6/3/2020 Bioinformatics: What, Why and Where? 38
Institute/Company Department Sequencer
American University in Cairo (AUC) Biology Ion S5
American University in Cairo (AUC)
Global Health and Human
Ecology MiSeq
National Research Center (NRC) Genetics MiSeq
Zewail City of Science and Technology Center for Genomics
MiSeq and
NextSeq 500
Kasr Alainy School of Medicine Clinical Oncology 3 MiSeq
CCHE 57357 Genomics program
MiSeq and
NextSeq 500
Ahram Canadian University Central Research Lab
Agilent
Bioanalyzer 2100
National Research Center (NRC) Genetics Ion torrent
National Research Center (NRC) Environmental department Ion torrent PGM
MASRI ain shams University Center
Ion S5 and Ion
shef
Air forces specialised hospital Labs Miseq
Maadi military hospital Labs Ion S5
Mansoura University Stem cells center Ion torrent
National Cancer Institute (NCI) Molecular biology Ion S5
Abo Alraish Hospital Microbiology Labs MiSeq
Alexandria Regional Center for Women's Health
and Development Ion S5
Tanta University - Faculty of Medicine - Center of
Exellence Genomic Signature Center MiSeq
Magdi Yacoub Foundation
MiSeq and
NextSeq
Generations Genetics Labs MiSeq
Sequencers in Egypt
(Sample)
Source: Prof. Ahmed Moustafa, AUC.
What Bioinformatics Can Do for Life Sciences?
6/3/2020 Bioinformatics: What, Why and Where? 39
Genome Assembly
6/3/2020 Bioinformatics: What, Why and Where? 40
Gene Prediction
• Gene structure
• Open Reading Frames (ORFs).
• Start and stop of the gene
• Locations of exons and introns
• Splice variants
• Gene prediction is one of the first and
most important steps in understanding
any genome after being sequenced.
6/3/2020 Bioinformatics: What, Why and Where? 41
Sequence Comparison
• Compare unknown gene or protein
sequences against known sequences to
identify their origin or function.
• Finding Signatures that can be used in
diagnostics
6/3/2020 Bioinformatics: What, Why and Where? 42
Phylogenetic Analyses
• Evolutionary relationship among a
group of related molecules or
organisms
• Track gene flow based on sequence
similarity
6/3/2020 Bioinformatics: What, Why and Where? 43
Understand the Functions of Genes (Pathway
Analysis)
6/3/2020 Bioinformatics: What, Why and Where? 44
Predicting Protein Structure and Function
• Protein’s 3D structure Prediction
• Understand how biomolecules
interact with other molecules
• Predict functions based on
interactions
6/3/2020 Bioinformatics: What, Why and Where? 45
Drug Design
• It is faster to analyze molecules on
computer as compared to
experimental approaches.
• Helps in identifying drug
targets easily
• Simulating drug effects on computers
6/3/2020 Bioinformatics: What, Why and Where? 46
Applications of
Bioinformatics
6/3/2020 Bioinformatics: What, Why and Where? 47
Applications of
Bioinformatics in Medicine
• The Human Genome Project (HGP) helps scientists to
search for genes directly associated with diseases and
understand the molecular basis of those identified
diseases.
• This new Information will help in better understanding
of the mechanisms of diseases and hence develop
better treatment and preventive methods.
6/3/2020 Bioinformatics: What, Why and Where? 48
Applications of
Bioinformatics in Pharmacy
• Identification and validating new drugs through
Computer Aided Drug Design (CADD).
• Helps to develop specific drugs with less side
effect
6/3/2020 Bioinformatics: What, Why and Where? 49
Applications of Bioinformatics
in Food Security
• Large amount of genomics data is available from plants and
animals
• Bioinformatic analysis of plant and animal genomes will
help scientists to improve crops
• Resistant to drought
• Resistant to insects and pests
• More nutritional value
• Animals with higher meat quality and productivity
6/3/2020 Bioinformatics: What, Why and Where? 50
Applications of Bioinformatics
in the Environment
• Sequencing and analysis of microbial genomes and search
for genes expressing enzymes for
• Bioremediation and biodegradation
• Climate change studies (Microbes that use CO2 as their
sole source of enegy)
• Alternative energy sources (energy from light)
• Microbes with industrial benefits
• Generation of Biogas
6/3/2020 Bioinformatics: What, Why and Where? 51
Bioinformatics Tools…
https://www.omicsonline.org/articles-images/data-mining-genomics-Application-bioinformatics-tools-5-158-g001.png
6/3/2020 Bioinformatics: What, Why and Where? 52
Take Home Messages
• Understand the biological background first (in details)!
• For writing a software
• For using a software
• Which tool/software to use?
• Understand the algorithm behind each software/tool
• Test different parameters
• Select the best tool
• Free software are everywhere
• Read about benchmarking studies first
• Before Writing your own software
• Check if it is exist (don’t work from scratch)
• Modify existing tools
6/3/2020 Bioinformatics: What, Why and Where? 53
Biologists and Computer Scientitst Should
Communicate!
6/3/2020 Bioinformatics: What, Why and Where? 54
6/3/2020 Bioinformatics: What, Why and Where? 55
6/3/2020 Bioinformatics: What, Why and Where? 56
6/3/2020 Bioinformatics: What, Why and Where? 57
Thank You for Your Attention!
Open Discussion…
melhadidi@nu.edu.eg
hadidi.bioinfo@gmail.com
6/3/2020 Bioinformatics: What, Why and Where? 58

More Related Content

What's hot

origin, history.pptx
origin, history.pptxorigin, history.pptx
origin, history.pptx
sworna kumari chithiraivelu
 
TrEMBL
TrEMBLTrEMBL
Gene Expression Omnibus (GEO)
Gene Expression Omnibus (GEO)Gene Expression Omnibus (GEO)
Gene Expression Omnibus (GEO)
Thi K. Tran-Nguyen, PhD
 
Single nucleotide polymorphism
Single nucleotide polymorphismSingle nucleotide polymorphism
Single nucleotide polymorphism
Simon Silvan
 
Protein Databases
Protein DatabasesProtein Databases
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
Somdutt Sharma
 
Dna sequencing ppt
Dna sequencing pptDna sequencing ppt
Dna sequencing ppt
Siddaraj Basavaraj
 
Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu
KAUSHAL SAHU
 
Cath
CathCath
Cath
Ramya S
 
OMIM- Online Mendelian Inheritance in Man
OMIM- Online Mendelian Inheritance in Man OMIM- Online Mendelian Inheritance in Man
Bioinformatics
BioinformaticsBioinformatics
Illumina Sequencing
Illumina SequencingIllumina Sequencing
Illumina Sequencing
USD Bioinformatics
 
Applications of genomics and proteomics ppt
Applications of genomics and  proteomics pptApplications of genomics and  proteomics ppt
Applications of genomics and proteomics ppt
Ibad khan
 
A Brief Introduction to Metabolomics
A Brief Introduction to Metabolomics A Brief Introduction to Metabolomics
A Brief Introduction to Metabolomics
Ranjith Raj V
 
Scop database
Scop databaseScop database
Scop database
Sayantani Roy
 
Composite and Specialized databases
Composite and Specialized databasesComposite and Specialized databases
An Introduction to Genomics
An Introduction to GenomicsAn Introduction to Genomics
An Introduction to Genomics
Dr NEETHU ASOKAN
 
Third Generation Sequencing
Third Generation Sequencing Third Generation Sequencing
Third Generation Sequencing
priyanka raviraj
 
Express sequence tags
Express sequence tagsExpress sequence tags
Express sequence tags
Dhananjay Desai
 
Human genome project
Human genome projectHuman genome project
Human genome project
Dilip jaipal
 

What's hot (20)

origin, history.pptx
origin, history.pptxorigin, history.pptx
origin, history.pptx
 
TrEMBL
TrEMBLTrEMBL
TrEMBL
 
Gene Expression Omnibus (GEO)
Gene Expression Omnibus (GEO)Gene Expression Omnibus (GEO)
Gene Expression Omnibus (GEO)
 
Single nucleotide polymorphism
Single nucleotide polymorphismSingle nucleotide polymorphism
Single nucleotide polymorphism
 
Protein Databases
Protein DatabasesProtein Databases
Protein Databases
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Dna sequencing ppt
Dna sequencing pptDna sequencing ppt
Dna sequencing ppt
 
Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu
 
Cath
CathCath
Cath
 
OMIM- Online Mendelian Inheritance in Man
OMIM- Online Mendelian Inheritance in Man OMIM- Online Mendelian Inheritance in Man
OMIM- Online Mendelian Inheritance in Man
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Illumina Sequencing
Illumina SequencingIllumina Sequencing
Illumina Sequencing
 
Applications of genomics and proteomics ppt
Applications of genomics and  proteomics pptApplications of genomics and  proteomics ppt
Applications of genomics and proteomics ppt
 
A Brief Introduction to Metabolomics
A Brief Introduction to Metabolomics A Brief Introduction to Metabolomics
A Brief Introduction to Metabolomics
 
Scop database
Scop databaseScop database
Scop database
 
Composite and Specialized databases
Composite and Specialized databasesComposite and Specialized databases
Composite and Specialized databases
 
An Introduction to Genomics
An Introduction to GenomicsAn Introduction to Genomics
An Introduction to Genomics
 
Third Generation Sequencing
Third Generation Sequencing Third Generation Sequencing
Third Generation Sequencing
 
Express sequence tags
Express sequence tagsExpress sequence tags
Express sequence tags
 
Human genome project
Human genome projectHuman genome project
Human genome project
 

Similar to Bioinformatics: What, Why and Where?

Introduction to Bioinformatics.
 Introduction to Bioinformatics. Introduction to Bioinformatics.
Introduction to Bioinformatics.
Elena Sügis
 
Sequencing Genomics: The New Big Data Driver
Sequencing Genomics:The New Big Data DriverSequencing Genomics:The New Big Data Driver
Sequencing Genomics: The New Big Data Driver
Larry Smarr
 
Next generation genomics: Petascale data in the life sciences
Next generation genomics: Petascale data in the life sciencesNext generation genomics: Petascale data in the life sciences
Next generation genomics: Petascale data in the life sciences
Guy Coates
 
01. Introduction to Bioinformatics.pptx
01. Introduction to Bioinformatics.pptx01. Introduction to Bioinformatics.pptx
01. Introduction to Bioinformatics.pptx
HussainTaqi1
 
2015 illinois-talk
2015 illinois-talk2015 illinois-talk
2015 illinois-talk
c.titus.brown
 
2015 03 13_puurs_v_public
2015 03 13_puurs_v_public2015 03 13_puurs_v_public
2015 03 13_puurs_v_public
Prof. Wim Van Criekinge
 
Life sciences big data use cases
Life sciences big data use casesLife sciences big data use cases
Life sciences big data use cases
Guy Coates
 
Bioinformatics issues and challanges presentation at s p college
Bioinformatics  issues and challanges  presentation at s p collegeBioinformatics  issues and challanges  presentation at s p college
Bioinformatics issues and challanges presentation at s p college
SKUASTKashmir
 
2014 aus-agta
2014 aus-agta2014 aus-agta
2014 aus-agta
c.titus.brown
 
2016 davis-plantbio
2016 davis-plantbio2016 davis-plantbio
2016 davis-plantbio
c.titus.brown
 
2012 hpcuserforum talk
2012 hpcuserforum talk2012 hpcuserforum talk
2012 hpcuserforum talk
c.titus.brown
 
Centralized Model Organism Database (Biocuration 2014 poster)
Centralized Model Organism Database (Biocuration 2014 poster)Centralized Model Organism Database (Biocuration 2014 poster)
Centralized Model Organism Database (Biocuration 2014 poster)
Andrew Su
 
Bauhina Genome slides for school visit
Bauhina Genome slides for school visitBauhina Genome slides for school visit
Bauhina Genome slides for school visit
Scott Edmunds
 
Bioinformatica 29-09-2011-t1-bioinformatics
Bioinformatica 29-09-2011-t1-bioinformaticsBioinformatica 29-09-2011-t1-bioinformatics
Bioinformatica 29-09-2011-t1-bioinformatics
Prof. Wim Van Criekinge
 
Emerging challenges in data-intensive genomics
Emerging challenges in data-intensive genomicsEmerging challenges in data-intensive genomics
Emerging challenges in data-intensive genomics
mikaelhuss
 
1. Introduction to Genetic Engineering.pptx
1. Introduction to Genetic Engineering.pptx1. Introduction to Genetic Engineering.pptx
1. Introduction to Genetic Engineering.pptx
samuelmerga3
 
Bioinformatics relevance with biotechnology
Bioinformatics relevance with biotechnologyBioinformatics relevance with biotechnology
Bioinformatics relevance with biotechnology
KAUSHAL SAHU
 
B.sc biochem i bobi u 2 database
B.sc biochem i bobi u 2 databaseB.sc biochem i bobi u 2 database
B.sc biochem i bobi u 2 database
Rai University
 
Deep learning for biomedicine
Deep learning for biomedicineDeep learning for biomedicine
Deep learning for biomedicine
Deakin University
 

Similar to Bioinformatics: What, Why and Where? (20)

Introduction to Bioinformatics.
 Introduction to Bioinformatics. Introduction to Bioinformatics.
Introduction to Bioinformatics.
 
Sequencing Genomics: The New Big Data Driver
Sequencing Genomics:The New Big Data DriverSequencing Genomics:The New Big Data Driver
Sequencing Genomics: The New Big Data Driver
 
Next generation genomics: Petascale data in the life sciences
Next generation genomics: Petascale data in the life sciencesNext generation genomics: Petascale data in the life sciences
Next generation genomics: Petascale data in the life sciences
 
01. Introduction to Bioinformatics.pptx
01. Introduction to Bioinformatics.pptx01. Introduction to Bioinformatics.pptx
01. Introduction to Bioinformatics.pptx
 
2015 illinois-talk
2015 illinois-talk2015 illinois-talk
2015 illinois-talk
 
2015 03 13_puurs_v_public
2015 03 13_puurs_v_public2015 03 13_puurs_v_public
2015 03 13_puurs_v_public
 
Life sciences big data use cases
Life sciences big data use casesLife sciences big data use cases
Life sciences big data use cases
 
Bioinformatics issues and challanges presentation at s p college
Bioinformatics  issues and challanges  presentation at s p collegeBioinformatics  issues and challanges  presentation at s p college
Bioinformatics issues and challanges presentation at s p college
 
2014 aus-agta
2014 aus-agta2014 aus-agta
2014 aus-agta
 
2016 davis-plantbio
2016 davis-plantbio2016 davis-plantbio
2016 davis-plantbio
 
2012 hpcuserforum talk
2012 hpcuserforum talk2012 hpcuserforum talk
2012 hpcuserforum talk
 
Centralized Model Organism Database (Biocuration 2014 poster)
Centralized Model Organism Database (Biocuration 2014 poster)Centralized Model Organism Database (Biocuration 2014 poster)
Centralized Model Organism Database (Biocuration 2014 poster)
 
Bauhina Genome slides for school visit
Bauhina Genome slides for school visitBauhina Genome slides for school visit
Bauhina Genome slides for school visit
 
Bioinformatica 29-09-2011-t1-bioinformatics
Bioinformatica 29-09-2011-t1-bioinformaticsBioinformatica 29-09-2011-t1-bioinformatics
Bioinformatica 29-09-2011-t1-bioinformatics
 
Emerging challenges in data-intensive genomics
Emerging challenges in data-intensive genomicsEmerging challenges in data-intensive genomics
Emerging challenges in data-intensive genomics
 
1. Introduction to Genetic Engineering.pptx
1. Introduction to Genetic Engineering.pptx1. Introduction to Genetic Engineering.pptx
1. Introduction to Genetic Engineering.pptx
 
Big data nebraska
Big data nebraskaBig data nebraska
Big data nebraska
 
Bioinformatics relevance with biotechnology
Bioinformatics relevance with biotechnologyBioinformatics relevance with biotechnology
Bioinformatics relevance with biotechnology
 
B.sc biochem i bobi u 2 database
B.sc biochem i bobi u 2 databaseB.sc biochem i bobi u 2 database
B.sc biochem i bobi u 2 database
 
Deep learning for biomedicine
Deep learning for biomedicineDeep learning for biomedicine
Deep learning for biomedicine
 

Recently uploaded

Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
Columbia Weather Systems
 
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
yqqaatn0
 
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdfDMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
fafyfskhan251kmf
 
in vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptxin vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptx
yusufzako14
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
University of Rennes, INSA Rennes, Inria/IRISA, CNRS
 
Chapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisisChapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisis
tonzsalvador2222
 
BLOOD AND BLOOD COMPONENT- introduction to blood physiology
BLOOD AND BLOOD COMPONENT- introduction to blood physiologyBLOOD AND BLOOD COMPONENT- introduction to blood physiology
BLOOD AND BLOOD COMPONENT- introduction to blood physiology
NoelManyise1
 
Hemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptxHemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptx
muralinath2
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
SAMIR PANDA
 
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
Wasswaderrick3
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
RenuJangid3
 
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
yqqaatn0
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
Lokesh Patil
 
GBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram StainingGBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram Staining
Areesha Ahmad
 
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdfMudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
frank0071
 
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Studia Poinsotiana
 
bordetella pertussis.................................ppt
bordetella pertussis.................................pptbordetella pertussis.................................ppt
bordetella pertussis.................................ppt
kejapriya1
 
general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
IqrimaNabilatulhusni
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Erdal Coalmaker
 
role of pramana in research.pptx in science
role of pramana in research.pptx in sciencerole of pramana in research.pptx in science
role of pramana in research.pptx in science
sonaliswain16
 

Recently uploaded (20)

Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
 
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
 
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdfDMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
 
in vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptxin vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptx
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
 
Chapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisisChapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisis
 
BLOOD AND BLOOD COMPONENT- introduction to blood physiology
BLOOD AND BLOOD COMPONENT- introduction to blood physiologyBLOOD AND BLOOD COMPONENT- introduction to blood physiology
BLOOD AND BLOOD COMPONENT- introduction to blood physiology
 
Hemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptxHemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptx
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
 
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
 
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
 
GBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram StainingGBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram Staining
 
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdfMudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
 
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
 
bordetella pertussis.................................ppt
bordetella pertussis.................................pptbordetella pertussis.................................ppt
bordetella pertussis.................................ppt
 
general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
 
role of pramana in research.pptx in science
role of pramana in research.pptx in sciencerole of pramana in research.pptx in science
role of pramana in research.pptx in science
 

Bioinformatics: What, Why and Where?

  • 1. Bioinformatics: What, Why and Where? Mohamed El-Hadidi Assistant Professor of Bioinformatics Biomedical Informatics Program Director School of Information Technology and Computer Science Nile University
  • 2. Where DNA is Located in our Body? 6/3/2020 Bioinformatics: What, Why and Where? 2
  • 3. From Human Body to DNA Sequences DNA Sequencers Sequence Files How many cells in the Human Body? 10 Trillion Cells! 6/3/2020 Bioinformatics: What, Why and Where? 3
  • 4. From Human Body to DNA Sequences DNA Sequencers Sequence Files How many chromosomes in one cell? 46 Chromosomes! 6/3/2020 4Bioinformatics: What, Why and Where?
  • 5. From Human Body to DNA Sequences DNA Sequencers Sequence Files What is the length of all chromosomes in one cell? 2 m in one cell! 1500 times from Earth to moon (all cells) 6/3/2020 5Bioinformatics: What, Why and Where?
  • 6. From Human Body to DNA Sequences DNA Sequencers Sequence Files What are in these files? GAATTTGGGCAAGAATCCAGGCATTGGAACTTATTCAAATAACTAGTTTGCCTGTAATTTTCACTTTTTC AGAGTCATCTGATAAAGCTTTCTTGCTACACATTTAGATAGATACACTCAATCCAGTTGTCTAGAAAGTT CCCTGAGCCAGCTGGGAGCAGGAGGGGTAGTTGGGGCCAGGAATATTGGGGGTGTGTTTACTGAGCCCCT AGAAAGTAAGTGCTAGATTTGACATTTCAATCCCTGAAGGCCCTGAAGTTCAGTATCAAATGACTGGTCC TGTGGACTGAGCATCTGTGAATTGCATATGCTTAGAGTAAATTTTACTCCTACCAGTTTCAGCAGCTTGC TTTAGCAAGCAGTATGGAAACACTAACATGGGGGAGTAGAATTTCTCTCTCTGATCCAAGTTTTATCTCA TTCTGGTGGGTTTTCAAGGAGAGACTCGGAGTCCAAGTGTCCTTTCTGAATATATCTGGAACTTCTCATT AACAAAAGACTCAAGTTATAATTTAGGGGACAAGGCACCCAATGAGAATGCCTTGCAGGCAGCCCTAAGT ACACCTGCAATTACACCATTACTAGCGCGGCAGCACACATGGCCCTGACTTAGTTTAAATAATTACGTAA GTCAACCATGATTGTTTGCCCTTTGCATAGAAGGGCAAGTATTGGTACCTGTTACAACTTAGGCTTTTTT TTCTTTATGTTTGAGCCATGATGAGTGATTTACACTGTTGCATCCATATGTTGAGATGTAAGAATAAATT AGACTTGGTAATTGCCCTTAAGTGTCTGGAAGTCAACTGGGGAAAGAGAGCTAGAGATAATAAGTGTGAA ACAATGTCACAGAATCAATGACGGAACTCTTCCCAGGACAAAGGATGACTTTTGAGTTCAGTCTTTGCCT TTAATTCTACATGGGGAGGAGAGCACGTTTAGCCACAAATGGAAGGGATTACTCATTTGAGCTATTTGGT TATATGATTATTTCCCCAGAGAATAGGATGTGCAGGGCATTACACAAGCAGTGCCAATAGCAGCAAAGTT CTTGAGAGTGCTAGTAATTCAAATGGCAGGAAGAGAAGGAATAAATGGTAAGGCTACCTACAGTTCACAG AGAGCTCCATCCTCACTGTGGCTTTGGATTTTGTCCTGTGTGAAAGAGAAGTGACTGTGAACTGACATGC TGTGTTTGGTGTTTTAGAAAGATGGCTGCAGCAGCGGTTTGGGGAATGGACTGCAGGAGTGGCATTGGAA ACAGGAAGGTTCATGACTATTGCCAGAGACAGAGGATGAAGCAGGAGCAAGGAAGATTCAGGACAGGGGA CTCCGGGGCTGATCAGGAGGCAGAACTGGTTGATAAGTATATGTAGCAGCATAAGAAAGAAAGAATCCCA GATTGACACCCAGGCTTCTCACTTGGAAGCCTGGATAGATACTGAATGCAATCACAAAGGCTGGGAAGTC AATGGGACTGCAGGGAAGGGAAGGGAAGGGAGGAGAAGAGGAAGGGCAGGAGGGTCCAATATCAATATTC AGCTTTTAGATGTGTTGAGCTTGAAGTGCTCAGATGGAGAAGTCCAGGAGGCAGTAGAATACGGTGGTCC AGAGCACAGGAGAGCAATGTGGCTTGAGTTGTCATTTGCTCACATATTTCCGTGTCAGTTACTTGTCTTA GATCACAGAACAAGTTCTCCTCTCACAGTTTCCTGGCTCCACCTGTCTCATGCTCACCGTCAGCATCGAA ATTGAGCCACACCAGGGGTTCTGGATACCAGCTTCTCTCTAGGTGAGGCTGCTATAGTCAGCAGCTGATT AGTTGCAGTTATCAGCAACTGGTAATATAATATATTGTGCATATAAGTGTACCAGAAGTCATGTTTATAT ATTGCTGCAAATACTCGGAATGGGGATCTCTTGTTCCCTGCTTAAGACCACATCACATTACTTGGTTTTG TACGCTAGTGGCTGAACCAAAAAAAGTAGGAGATGATTTTTTTTCTTTTTTCTTAAAGCAGTAGCTTTTG AACCTTGACCATGCTTTCTAACCAGCTGAGGGGCTTTTGAAAAAGAGGGTGCCTTACTGTGCCCCAGACC AGGACAATCAGTATTTCTGGGGAATGGAGCCTGGCACACACACATTTCTTAAAGCTCCCTTGGCAATTCT GAGGAGTGGATTACATGTTGTATGTAGCTCGTAACGAAAGAAATCTTGTCTTTGCTCTCAGACCCCCATT TCTTACTCATCTCATGAGCTCCTTCGAGATCCAGAAACAGTTGCATATTTCATTAGTAAATCAGTTCCAG AGTCACATTTTATTTCACAAGTTAGTCCATTAAAAGTTTCCTGCAGTGAGGAAATAGCCAGAAAGAACAC TCCACCCCTCCTCCTTTTTATAACTATAGGGTCTGGCTCGACAGAGCAGGAGCATCGCCATCTTGGACAA 6/3/2020 6Bioinformatics: What, Why and Where?
  • 7. From Human Body to DNA Sequences DNA Sequencers Sequence Files What are in these files? GAATTTGGGCAAGAATCCAGGCATTGGAACTTATTCAAATAACTAGTTTGCCTGTAATTTTCACTTTTTC AGAGTCATCTGATAAAGCTTTCTTGCTACACATTTAGATAGATACACTCAATCCAGTTGTCTAGAAAGTT CCCTGAGCCAGCTGGGAGCAGGAGGGGTAGTTGGGGCCAGGAATATTGGGGGTGTGTTTACTGAGCCCCT AGAAAGTAAGTGCTAGATTTGACATTTCAATCCCTGAAGGCCCTGAAGTTCAGTATCAAATGACTGGTCC TGTGGACTGAGCATCTGTGAATTGCATATGCTTAGAGTAAATTTTACTCCTACCAGTTTCAGCAGCTTGC TTTAGCAAGCAGTATGGAAACACTAACATGGGGGAGTAGAATTTCTCTCTCTGATCCAAGTTTTATCTCA TTCTGGTGGGTTTTCAAGGAGAGACTCGGAGTCCAAGTGTCCTTTCTGAATATATCTGGAACTTCTCATT AACAAAAGACTCAAGTTATAATTTAGGGGACAAGGCACCCAATGAGAATGCCTTGCAGGCAGCCCTAAGT ACACCTGCAATTACACCATTACTAGCGCGGCAGCACACATGGCCCTGACTTAGTTTAAATAATTACGTAA GTCAACCATGATTGTTTGCCCTTTGCATAGAAGGGCAAGTATTGGTACCTGTTACAACTTAGGCTTTTTT TTCTTTATGTTTGAGCCATGATGAGTGATTTACACTGTTGCATCCATATGTTGAGATGTAAGAATAAATT AGACTTGGTAATTGCCCTTAAGTGTCTGGAAGTCAACTGGGGAAAGAGAGCTAGAGATAATAAGTGTGAA ACAATGTCACAGAATCAATGACGGAACTCTTCCCAGGACAAAGGATGACTTTTGAGTTCAGTCTTTGCCT TTAATTCTACATGGGGAGGAGAGCACGTTTAGCCACAAATGGAAGGGATTACTCATTTGAGCTATTTGGT TATATGATTATTTCCCCAGAGAATAGGATGTGCAGGGCATTACACAAGCAGTGCCAATAGCAGCAAAGTT CTTGAGAGTGCTAGTAATTCAAATGGCAGGAAGAGAAGGAATAAATGGTAAGGCTACCTACAGTTCACAG AGAGCTCCATCCTCACTGTGGCTTTGGATTTTGTCCTGTGTGAAAGAGAAGTGACTGTGAACTGACATGC TGTGTTTGGTGTTTTAGAAAGATGGCTGCAGCAGCGGTTTGGGGAATGGACTGCAGGAGTGGCATTGGAA ACAGGAAGGTTCATGACTATTGCCAGAGACAGAGGATGAAGCAGGAGCAAGGAAGATTCAGGACAGGGGA CTCCGGGGCTGATCAGGAGGCAGAACTGGTTGATAAGTATATGTAGCAGCATAAGAAAGAAAGAATCCCA GATTGACACCCAGGCTTCTCACTTGGAAGCCTGGATAGATACTGAATGCAATCACAAAGGCTGGGAAGTC AATGGGACTGCAGGGAAGGGAAGGGAAGGGAGGAGAAGAGGAAGGGCAGGAGGGTCCAATATCAATATTC AGCTTTTAGATGTGTTGAGCTTGAAGTGCTCAGATGGAGAAGTCCAGGAGGCAGTAGAATACGGTGGTCC AGAGCACAGGAGAGCAATGTGGCTTGAGTTGTCATTTGCTCACATATTTCCGTGTCAGTTACTTGTCTTA GATCACAGAACAAGTTCTCCTCTCACAGTTTCCTGGCTCCACCTGTCTCATGCTCACCGTCAGCATCGAA ATTGAGCCACACCAGGGGTTCTGGATACCAGCTTCTCTCTAGGTGAGGCTGCTATAGTCAGCAGCTGATT AGTTGCAGTTATCAGCAACTGGTAATATAATATATTGTGCATATAAGTGTACCAGAAGTCATGTTTATAT ATTGCTGCAAATACTCGGAATGGGGATCTCTTGTTCCCTGCTTAAGACCACATCACATTACTTGGTTTTG TACGCTAGTGGCTGAACCAAAAAAAGTAGGAGATGATTTTTTTTCTTTTTTCTTAAAGCAGTAGCTTTTG AACCTTGACCATGCTTTCTAACCAGCTGAGGGGCTTTTGAAAAAGAGGGTGCCTTACTGTGCCCCAGACC AGGACAATCAGTATTTCTGGGGAATGGAGCCTGGCACACACACATTTCTTAAAGCTCCCTTGGCAATTCT GAGGAGTGGATTACATGTTGTATGTAGCTCGTAACGAAAGAAATCTTGTCTTTGCTCTCAGACCCCCATT TCTTACTCATCTCATGAGCTCCTTCGAGATCCAGAAACAGTTGCATATTTCATTAGTAAATCAGTTCCAG AGTCACATTTTATTTCACAAGTTAGTCCATTAAAAGTTTCCTGCAGTGAGGAAATAGCCAGAAAGAACAC TCCACCCCTCCTCCTTTTTATAACTATAGGGTCTGGCTCGACAGAGCAGGAGCATCGCCATCTTGGACAA 6/3/2020 7Bioinformatics: What, Why and Where?
  • 8. From Human Body to DNA Sequences DNA Sequencers Sequence Files GAATTTGGGCAAGAATCCAGGCATTGGAACTTATTCAAATAACTAGTTTGCCTGTAATTTTCACTTTTTC AGAGTCATCTGATAAAGCTTTCTTGCTACACATTTAGATAGATACACTCAATCCAGTTGTCTAGAAAGTT CCCTGAGCCAGCTGGGAGCAGGAGGGGTAGTTGGGGCCAGGAATATTGGGGGTGTGTTTACTGAGCCCCT AGAAAGTAAGTGCTAGATTTGACATTTCAATCCCTGAAGGCCCTGAAGTTCAGTATCAAATGACTGGTCC TGTGGACTGAGCATCTGTGAATTGCATATGCTTAGAGTAAATTTTACTCCTACCAGTTTCAGCAGCTTGC TTTAGCAAGCAGTATGGAAACACTAACATGGGGGAGTAGAATTTCTCTCTCTGATCCAAGTTTTATCTCA TTCTGGTGGGTTTTCAAGGAGAGACTCGGAGTCCAAGTGTCCTTTCTGAATATATCTGGAACTTCTCATT AACAAAAGACTCAAGTTATAATTTAGGGGACAAGGCACCCAATGAGAATGCCTTGCAGGCAGCCCTAAGT ACACCTGCAATTACACCATTACTAGCGCGGCAGCACACATGGCCCTGACTTAGTTTAAATAATTACGTAA GTCAACCATGATTGTTTGCCCTTTGCATAGAAGGGCAAGTATTGGTACCTGTTACAACTTAGGCTTTTTT TTCTTTATGTTTGAGCCATGATGAGTGATTTACACTGTTGCATCCATATGTTGAGATGTAAGAATAAATT AGACTTGGTAATTGCCCTTAAGTGTCTGGAAGTCAACTGGGGAAAGAGAGCTAGAGATAATAAGTGTGAA ACAATGTCACAGAATCAATGACGGAACTCTTCCCAGGACAAAGGATGACTTTTGAGTTCAGTCTTTGCCT TTAATTCTACATGGGGAGGAGAGCACGTTTAGCCACAAATGGAAGGGATTACTCATTTGAGCTATTTGGT TATATGATTATTTCCCCAGAGAATAGGATGTGCAGGGCATTACACAAGCAGTGCCAATAGCAGCAAAGTT CTTGAGAGTGCTAGTAATTCAAATGGCAGGAAGAGAAGGAATAAATGGTAAGGCTACCTACAGTTCACAG AGAGCTCCATCCTCACTGTGGCTTTGGATTTTGTCCTGTGTGAAAGAGAAGTGACTGTGAACTGACATGC TGTGTTTGGTGTTTTAGAAAGATGGCTGCAGCAGCGGTTTGGGGAATGGACTGCAGGAGTGGCATTGGAA ACAGGAAGGTTCATGACTATTGCCAGAGACAGAGGATGAAGCAGGAGCAAGGAAGATTCAGGACAGGGGA CTCCGGGGCTGATCAGGAGGCAGAACTGGTTGATAAGTATATGTAGCAGCATAAGAAAGAAAGAATCCCA GATTGACACCCAGGCTTCTCACTTGGAAGCCTGGATAGATACTGAATGCAATCACAAAGGCTGGGAAGTC AATGGGACTGCAGGGAAGGGAAGGGAAGGGAGGAGAAGAGGAAGGGCAGGAGGGTCCAATATCAATATTC AGCTTTTAGATGTGTTGAGCTTGAAGTGCTCAGATGGAGAAGTCCAGGAGGCAGTAGAATACGGTGGTCC AGAGCACAGGAGAGCAATGTGGCTTGAGTTGTCATTTGCTCACATATTTCCGTGTCAGTTACTTGTCTTA GATCACAGAACAAGTTCTCCTCTCACAGTTTCCTGGCTCCACCTGTCTCATGCTCACCGTCAGCATCGAA ATTGAGCCACACCAGGGGTTCTGGATACCAGCTTCTCTCTAGGTGAGGCTGCTATAGTCAGCAGCTGATT AGTTGCAGTTATCAGCAACTGGTAATATAATATATTGTGCATATAAGTGTACCAGAAGTCATGTTTATAT ATTGCTGCAAATACTCGGAATGGGGATCTCTTGTTCCCTGCTTAAGACCACATCACATTACTTGGTTTTG TACGCTAGTGGCTGAACCAAAAAAAGTAGGAGATGATTTTTTTTCTTTTTTCTTAAAGCAGTAGCTTTTG AACCTTGACCATGCTTTCTAACCAGCTGAGGGGCTTTTGAAAAAGAGGGTGCCTTACTGTGCCCCAGACC AGGACAATCAGTATTTCTGGGGAATGGAGCCTGGCACACACACATTTCTTAAAGCTCCCTTGGCAATTCT GAGGAGTGGATTACATGTTGTATGTAGCTCGTAACGAAAGAAATCTTGTCTTTGCTCTCAGACCCCCATT TCTTACTCATCTCATGAGCTCCTTCGAGATCCAGAAACAGTTGCATATTTCATTAGTAAATCAGTTCCAG AGTCACATTTTATTTCACAAGTTAGTCCATTAAAAGTTTCCTGCAGTGAGGAAATAGCCAGAAAGAACAC TCCACCCCTCCTCCTTTTTATAACTATAGGGTCTGGCTCGACAGAGCAGGAGCATCGCCATCTTGGACAA How many nucleotides in the Human body? 3 Billion Nucleotides! 6/3/2020 8Bioinformatics: What, Why and Where?
  • 9. From Human Body to DNA Sequences DNA Sequencers Sequence Files What is the size of data? 150 GB/person 6/3/2020 9Bioinformatics: What, Why and Where?
  • 10. How These Files were Generated? 6/3/2020 Bioinformatics: What, Why and Where? 10
  • 11. How These Files were Generated? 6/3/2020 Bioinformatics: What, Why and Where? 11
  • 12. Bioinformatics Data is Increasing Rapidly! • Speed of sequencing?  10,000 bp/day/machine -> billions bp/day/machine. • Computing cost and time?  Sequencing cost is falling 5X faster than computing • Price / genome?  Dropped to $1000! • Storage cost?  150 GB/genome Bioinformatics: What, Why and Where? 12 How These Files were Generated?
  • 13. 6/3/2020 13 How These Files were Generated? Bioinformatics: What, Why and Where?
  • 14. 6/3/2020 Bioinformatics: What, Why and Where? 14 What to Do with These Files? Making sense of this BIG DATA!
  • 15. How to Make Sense of This BIG DATA? Through Bioinformatics! What is Bioinformatics??! 6/3/2020 Bioinformatics: What, Why and Where? 15
  • 16. What Do You Need to Learn Bioinformatics? 6/3/2020 Bioinformatics: What, Why and Where? 16 Statistics Computer Science Biology Bioinformatics Data Science Biostatistics Computational Biology
  • 19. What is Bioinformatics? 6/3/2020 Bioinformatics: What, Why and Where? 19 GAATTTGGGCAAGAATCCAGGCATTGGAACTTATTCAAATAACTAGTTTGCCTGTAATTTTCACTTTTTC AGAGTCATCTGATAAAGCTTTCTTGCTACACATTTAGATAGATACACTCAATCCAGTTGTCTAGAAAGTT CCCTGAGCCAGCTGGGAGCAGGAGGGGTAGTTGGGGCCAGGAATATTGGGGGTGTGTTTACTGAGCCCCT AGAAAGTAAGTGCTAGATTTGACATTTCAATCCCTGAAGGCCCTGAAGTTCAGTATCAAATGACTGGTCC TGTGGACTGAGCATCTGTGAATTGCATATGCTTAGAGTAAATTTTACTCCTACCAGTTTCAGCAGCTTGC TTTAGCAAGCAGTATGGAAACACTAACATGGGGGAGTAGAATTTCTCTCTCTGATCCAAGTTTTATCTCA TTCTGGTGGGTTTTCAAGGAGAGACTCGGAGTCCAAGTGTCCTTTCTGAATATATCTGGAACTTCTCATT AACAAAAGACTCAAGTTATAATTTAGGGGACAAGGCACCCAATGAGAATGCCTTGCAGGCAGCCCTAAGT ACACCTGCAATTACACCATTACTAGCGCGGCAGCACACATGGCCCTGACTTAGTTTAAATAATTACGTAA GTCAACCATGATTGTTTGCCCTTTGCATAGAAGGGCAAGTATTGGTACCTGTTACAACTTAGGCTTTTTT TTCTTTATGTTTGAGCCATGATGAGTGATTTACACTGTTGCATCCATATGTTGAGATGTAAGAATAAATT AGACTTGGTAATTGCCCTTAAGTGTCTGGAAGTCAACTGGGGAAAGAGAGCTAGAGATAATAAGTGTGAA ACAATGTCACAGAATCAATGACGGAACTCTTCCCAGGACAAAGGATGACTTTTGAGTTCAGTCTTTGCCT TTAATTCTACATGGGGAGGAGAGCACGTTTAGCCACAAATGGAAGGGATTACTCATTTGAGCTATTTGGT TATATGATTATTTCCCCAGAGAATAGGATGTGCAGGGCATTACACAAGCAGTGCCAATAGCAGCAAAGTT CTTGAGAGTGCTAGTAATTCAAATGGCAGGAAGAGAAGGAATAAATGGTAAGGCTACCTACAGTTCACAG AGAGCTCCATCCTCACTGTGGCTTTGGATTTTGTCCTGTGTGAAAGAGAAGTGACTGTGAACTGACATGC TGTGTTTGGTGTTTTAGAAAGATGGCTGCAGCAGCGGTTTGGGGAATGGACTGCAGGAGTGGCATTGGAA ACAGGAAGGTTCATGACTATTGCCAGAGACAGAGGATGAAGCAGGAGCAAGGAAGATTCAGGACAGGGGA CTCCGGGGCTGATCAGGAGGCAGAACTGGTTGATAAGTATATGTAGCAGCATAAGAAAGAAAGAATCCCA GATTGACACCCAGGCTTCTCACTTGGAAGCCTGGATAGATACTGAATGCAATCACAAAGGCTGGGAAGTC AATGGGACTGCAGGGAAGGGAAGGGAAGGGAGGAGAAGAGGAAGGGCAGGAGGGTCCAATATCAATATTC AGCTTTTAGATGTGTTGAGCTTGAAGTGCTCAGATGGAGAAGTCCAGGAGGCAGTAGAATACGGTGGTCC AGAGCACAGGAGAGCAATGTGGCTTGAGTTGTCATTTGCTCACATATTTCCGTGTCAGTTACTTGTCTTA GATCACAGAACAAGTTCTCCTCTCACAGTTTCCTGGCTCCACCTGTCTCATGCTCACCGTCAGCATCGAA ATTGAGCCACACCAGGGGTTCTGGATACCAGCTTCTCTCTAGGTGAGGCTGCTATAGTCAGCAGCTGATT AGTTGCAGTTATCAGCAACTGGTAATATAATATATTGTGCATATAAGTGTACCAGAAGTCATGTTTATAT ATTGCTGCAAATACTCGGAATGGGGATCTCTTGTTCCCTGCTTAAGACCACATCACATTACTTGGTTTTG TACGCTAGTGGCTGAACCAAAAAAAGTAGGAGATGATTTTTTTTCTTTTTTCTTAAAGCAGTAGCTTTTG AACCTTGACCATGCTTTCTAACCAGCTGAGGGGCTTTTGAAAAAGAGGGTGCCTTACTGTGCCCCAGACC AGGACAATCAGTATTTCTGGGGAATGGAGCCTGGCACACACACATTTCTTAAAGCTCCCTTGGCAATTCT GAGGAGTGGATTACATGTTGTATGTAGCTCGTAACGAAAGAAATCTTGTCTTTGCTCTCAGACCCCCATT TCTTACTCATCTCATGAGCTCCTTCGAGATCCAGAAACAGTTGCATATTTCATTAGTAAATCAGTTCCAG AGTCACATTTTATTTCACAAGTTAGTCCATTAAAAGTTTCCTGCAGTGAGGAAATAGCCAGAAAGAACAC TCCACCCCTCCTCCTTTTTATAACTATAGGGTCTGGCTCGACAGAGCAGGAGCATCGCCATCTTGGACAA
  • 20. What is Bioinformatics? 6/3/2020 Bioinformatics: What, Why and Where? 20 GAATTTGGGCAAGAATCCAGGCATTGGAACTTATTCAAATAACTAGTTTGCCTGTAATTTTCACTTTTTC AGAGTCATCTGATAAAGCTTTCTTGCTACACATTTAGATAGATACACTCAATCCAGTTGTCTAGAAAGTT CCCTGAGCCAGCTGGGAGCAGGAGGGGTAGTTGGGGCCAGGAATATTGGGGGTGTGTTTACTGAGCCCCT AGAAAGTAAGTGCTAGATTTGACATTTCAATCCCTGAAGGCCCTGAAGTTCAGTATCAAATGACTGGTCC TGTGGACTGAGCATCTGTGAATTGCATATGCTTAGAGTAAATTTTACTCCTACCAGTTTCAGCAGCTTGC TTTAGCAAGCAGTATGGAAACACTAACATGGGGGAGTAGAATTTCTCTCTCTGATCCAAGTTTTATCTCA TTCTGGTGGGTTTTCAAGGAGAGACTCGGAGTCCAAGTGTCCTTTCTGAATATATCTGGAACTTCTCATT AACAAAAGACTCAAGTTATAATTTAGGGGACAAGGCACCCAATGAGAATGCCTTGCAGGCAGCCCTAAGT ACACCTGCAATTACACCATTACTAGCGCGGCAGCACACATGGCCCTGACTTAGTTTAAATAATTACGTAA GTCAACCATGATTGTTTGCCCTTTGCATAGAAGGGCAAGTATTGGTACCTGTTACAACTTAGGCTTTTTT TTCTTTATGTTTGAGCCATGATGAGTGATTTACACTGTTGCATCCATATGTTGAGATGTAAGAATAAATT AGACTTGGTAATTGCCCTTAAGTGTCTGGAAGTCAACTGGGGAAAGAGAGCTAGAGATAATAAGTGTGAA ACAATGTCACAGAATCAATGACGGAACTCTTCCCAGGACAAAGGATGACTTTTGAGTTCAGTCTTTGCCT TTAATTCTACATGGGGAGGAGAGCACGTTTAGCCACAAATGGAAGGGATTACTCATTTGAGCTATTTGGT TATATGATTATTTCCCCAGAGAATAGGATGTGCAGGGCATTACACAAGCAGTGCCAATAGCAGCAAAGTT CTTGAGAGTGCTAGTAATTCAAATGGCAGGAAGAGAAGGAATAAATGGTAAGGCTACCTACAGTTCACAG AGAGCTCCATCCTCACTGTGGCTTTGGATTTTGTCCTGTGTGAAAGAGAAGTGACTGTGAACTGACATGC TGTGTTTGGTGTTTTAGAAAGATGGCTGCAGCAGCGGTTTGGGGAATGGACTGCAGGAGTGGCATTGGAA ACAGGAAGGTTCATGACTATTGCCAGAGACAGAGGATGAAGCAGGAGCAAGGAAGATTCAGGACAGGGGA CTCCGGGGCTGATCAGGAGGCAGAACTGGTTGATAAGTATATGTAGCAGCATAAGAAAGAAAGAATCCCA GATTGACACCCAGGCTTCTCACTTGGAAGCCTGGATAGATACTGAATGCAATCACAAAGGCTGGGAAGTC AATGGGACTGCAGGGAAGGGAAGGGAAGGGAGGAGAAGAGGAAGGGCAGGAGGGTCCAATATCAATATTC AGCTTTTAGATGTGTTGAGCTTGAAGTGCTCAGATGGAGAAGTCCAGGAGGCAGTAGAATACGGTGGTCC AGAGCACAGGAGAGCAATGTGGCTTGAGTTGTCATTTGCTCACATATTTCCGTGTCAGTTACTTGTCTTA GATCACAGAACAAGTTCTCCTCTCACAGTTTCCTGGCTCCACCTGTCTCATGCTCACCGTCAGCATCGAA ATTGAGCCACACCAGGGGTTCTGGATACCAGCTTCTCTCTAGGTGAGGCTGCTATAGTCAGCAGCTGATT AGTTGCAGTTATCAGCAACTGGTAATATAATATATTGTGCATATAAGTGTACCAGAAGTCATGTTTATAT ATTGCTGCAAATACTCGGAATGGGGATCTCTTGTTCCCTGCTTAAGACCACATCACATTACTTGGTTTTG TACGCTAGTGGCTGAACCAAAAAAAGTAGGAGATGATTTTTTTTCTTTTTTCTTAAAGCAGTAGCTTTTG AACCTTGACCATGCTTTCTAACCAGCTGAGGGGCTTTTGAAAAAGAGGGTGCCTTACTGTGCCCCAGACC AGGACAATCAGTATTTCTGGGGAATGGAGCCTGGCACACACACATTTCTTAAAGCTCCCTTGGCAATTCT GAGGAGTGGATTACATGTTGTATGTAGCTCGTAACGAAAGAAATCTTGTCTTTGCTCTCAGACCCCCATT TCTTACTCATCTCATGAGCTCCTTCGAGATCCAGAAACAGTTGCATATTTCATTAGTAAATCAGTTCCAG AGTCACATTTTATTTCACAAGTTAGTCCATTAAAAGTTTCCTGCAGTGAGGAAATAGCCAGAAAGAACAC TCCACCCCTCCTCCTTTTTATAACTATAGGGTCTGGCTCGACAGAGCAGGAGCATCGCCATCTTGGACAA Use Existing tools to build analysis workflows • Linux • Command Line • Scripting Develop your own tools • Programming • Algorithm Design • Machine Learning
  • 21. What is Bioinformatics? 6/3/2020 Bioinformatics: What, Why and Where? 21 GAATTTGGGCAAGAATCCAGGCATTGGAACTTATTCAAATAACTAGTTTGCCTGTAATTTTCACTTTTTC AGAGTCATCTGATAAAGCTTTCTTGCTACACATTTAGATAGATACACTCAATCCAGTTGTCTAGAAAGTT CCCTGAGCCAGCTGGGAGCAGGAGGGGTAGTTGGGGCCAGGAATATTGGGGGTGTGTTTACTGAGCCCCT AGAAAGTAAGTGCTAGATTTGACATTTCAATCCCTGAAGGCCCTGAAGTTCAGTATCAAATGACTGGTCC TGTGGACTGAGCATCTGTGAATTGCATATGCTTAGAGTAAATTTTACTCCTACCAGTTTCAGCAGCTTGC TTTAGCAAGCAGTATGGAAACACTAACATGGGGGAGTAGAATTTCTCTCTCTGATCCAAGTTTTATCTCA TTCTGGTGGGTTTTCAAGGAGAGACTCGGAGTCCAAGTGTCCTTTCTGAATATATCTGGAACTTCTCATT AACAAAAGACTCAAGTTATAATTTAGGGGACAAGGCACCCAATGAGAATGCCTTGCAGGCAGCCCTAAGT ACACCTGCAATTACACCATTACTAGCGCGGCAGCACACATGGCCCTGACTTAGTTTAAATAATTACGTAA GTCAACCATGATTGTTTGCCCTTTGCATAGAAGGGCAAGTATTGGTACCTGTTACAACTTAGGCTTTTTT TTCTTTATGTTTGAGCCATGATGAGTGATTTACACTGTTGCATCCATATGTTGAGATGTAAGAATAAATT AGACTTGGTAATTGCCCTTAAGTGTCTGGAAGTCAACTGGGGAAAGAGAGCTAGAGATAATAAGTGTGAA ACAATGTCACAGAATCAATGACGGAACTCTTCCCAGGACAAAGGATGACTTTTGAGTTCAGTCTTTGCCT TTAATTCTACATGGGGAGGAGAGCACGTTTAGCCACAAATGGAAGGGATTACTCATTTGAGCTATTTGGT TATATGATTATTTCCCCAGAGAATAGGATGTGCAGGGCATTACACAAGCAGTGCCAATAGCAGCAAAGTT CTTGAGAGTGCTAGTAATTCAAATGGCAGGAAGAGAAGGAATAAATGGTAAGGCTACCTACAGTTCACAG AGAGCTCCATCCTCACTGTGGCTTTGGATTTTGTCCTGTGTGAAAGAGAAGTGACTGTGAACTGACATGC TGTGTTTGGTGTTTTAGAAAGATGGCTGCAGCAGCGGTTTGGGGAATGGACTGCAGGAGTGGCATTGGAA ACAGGAAGGTTCATGACTATTGCCAGAGACAGAGGATGAAGCAGGAGCAAGGAAGATTCAGGACAGGGGA CTCCGGGGCTGATCAGGAGGCAGAACTGGTTGATAAGTATATGTAGCAGCATAAGAAAGAAAGAATCCCA GATTGACACCCAGGCTTCTCACTTGGAAGCCTGGATAGATACTGAATGCAATCACAAAGGCTGGGAAGTC AATGGGACTGCAGGGAAGGGAAGGGAAGGGAGGAGAAGAGGAAGGGCAGGAGGGTCCAATATCAATATTC AGCTTTTAGATGTGTTGAGCTTGAAGTGCTCAGATGGAGAAGTCCAGGAGGCAGTAGAATACGGTGGTCC AGAGCACAGGAGAGCAATGTGGCTTGAGTTGTCATTTGCTCACATATTTCCGTGTCAGTTACTTGTCTTA GATCACAGAACAAGTTCTCCTCTCACAGTTTCCTGGCTCCACCTGTCTCATGCTCACCGTCAGCATCGAA ATTGAGCCACACCAGGGGTTCTGGATACCAGCTTCTCTCTAGGTGAGGCTGCTATAGTCAGCAGCTGATT AGTTGCAGTTATCAGCAACTGGTAATATAATATATTGTGCATATAAGTGTACCAGAAGTCATGTTTATAT ATTGCTGCAAATACTCGGAATGGGGATCTCTTGTTCCCTGCTTAAGACCACATCACATTACTTGGTTTTG TACGCTAGTGGCTGAACCAAAAAAAGTAGGAGATGATTTTTTTTCTTTTTTCTTAAAGCAGTAGCTTTTG AACCTTGACCATGCTTTCTAACCAGCTGAGGGGCTTTTGAAAAAGAGGGTGCCTTACTGTGCCCCAGACC AGGACAATCAGTATTTCTGGGGAATGGAGCCTGGCACACACACATTTCTTAAAGCTCCCTTGGCAATTCT GAGGAGTGGATTACATGTTGTATGTAGCTCGTAACGAAAGAAATCTTGTCTTTGCTCTCAGACCCCCATT TCTTACTCATCTCATGAGCTCCTTCGAGATCCAGAAACAGTTGCATATTTCATTAGTAAATCAGTTCCAG AGTCACATTTTATTTCACAAGTTAGTCCATTAAAAGTTTCCTGCAGTGAGGAAATAGCCAGAAAGAACAC TCCACCCCTCCTCCTTTTTATAACTATAGGGTCTGGCTCGACAGAGCAGGAGCATCGCCATCTTGGACAA Use Existing tools to build analysis workflows Develop your own tools • Linux • Command Line • Scripting • Programming • Algorithm Design • Machine Learning A = 1765 G = 3561 C = 2677 T = 1121
  • 22. What is Bioinformatics? 6/3/2020 Bioinformatics: What, Why and Where? 22 Use Existing tools to build analysis workflows Develop your own tools • Linux • Command Line • Scripting • Programming • Algorithm Design • Machine Learning
  • 23. Biologist (Biology Background) Use existing bioinformatics tools Computer Scientist (CS Background) Develops bioinformatics tools Basic User Windows OS Web-based Tools GUI Standalone tools No Programming skills Advanced User Linux OS Command line Standalone tools Basic Programming Skills Developer Basic Biology Knowledge Advanced Programming Skills Advanced Mathematics Advanced Statistics Who Can Be a Bioinformatician? 6/3/2020 Bioinformatics: What, Why and Where? 23
  • 24. How can I Learn Bioinformatics? Tons of free courses are available online! More than 26 million results when searching without comma! 6/3/2020 Bioinformatics: What, Why and Where? 24
  • 25. How can I Learn Bioinformatics? Tons of free courses are available online! More than 46 million results when searching without comma! 6/3/2020 Bioinformatics: What, Why and Where? 25
  • 26. Examples of Free Online Bioinformatics MOOCs Websites 6/3/2020 Bioinformatics: What, Why and Where? 26
  • 27. 6/3/2020 Bioinformatics: What, Why and Where? 27
  • 28. Milestones of Bioinformatics 28 • OMICS Sciences • Programming and Data Structure •Algorithm Design • LINUX • Statistics •Basic Mathematics • AI and Data Science •Data Visualization • Results Interpretation
  • 30. Next Step? 30 Read Papers and Reproduce Results! Compare Modify Explain Seek Internships Options! Real Life Problems!
  • 31. Advice… 31 Perceive Biology as CS and Perceive CS as Biology! The Link!
  • 32. No Need for a Supercomputer! 6/3/2020 Bioinformatics: What, Why and Where? 32
  • 33. Where to Find a Job (Egypt and Abroad)? 6/3/2020 Bioinformatics: What, Why and Where? 33 Research Academia Companies Startup Freelancer
  • 38. 6/3/2020 Bioinformatics: What, Why and Where? 38 Institute/Company Department Sequencer American University in Cairo (AUC) Biology Ion S5 American University in Cairo (AUC) Global Health and Human Ecology MiSeq National Research Center (NRC) Genetics MiSeq Zewail City of Science and Technology Center for Genomics MiSeq and NextSeq 500 Kasr Alainy School of Medicine Clinical Oncology 3 MiSeq CCHE 57357 Genomics program MiSeq and NextSeq 500 Ahram Canadian University Central Research Lab Agilent Bioanalyzer 2100 National Research Center (NRC) Genetics Ion torrent National Research Center (NRC) Environmental department Ion torrent PGM MASRI ain shams University Center Ion S5 and Ion shef Air forces specialised hospital Labs Miseq Maadi military hospital Labs Ion S5 Mansoura University Stem cells center Ion torrent National Cancer Institute (NCI) Molecular biology Ion S5 Abo Alraish Hospital Microbiology Labs MiSeq Alexandria Regional Center for Women's Health and Development Ion S5 Tanta University - Faculty of Medicine - Center of Exellence Genomic Signature Center MiSeq Magdi Yacoub Foundation MiSeq and NextSeq Generations Genetics Labs MiSeq Sequencers in Egypt (Sample) Source: Prof. Ahmed Moustafa, AUC.
  • 39. What Bioinformatics Can Do for Life Sciences? 6/3/2020 Bioinformatics: What, Why and Where? 39
  • 40. Genome Assembly 6/3/2020 Bioinformatics: What, Why and Where? 40
  • 41. Gene Prediction • Gene structure • Open Reading Frames (ORFs). • Start and stop of the gene • Locations of exons and introns • Splice variants • Gene prediction is one of the first and most important steps in understanding any genome after being sequenced. 6/3/2020 Bioinformatics: What, Why and Where? 41
  • 42. Sequence Comparison • Compare unknown gene or protein sequences against known sequences to identify their origin or function. • Finding Signatures that can be used in diagnostics 6/3/2020 Bioinformatics: What, Why and Where? 42
  • 43. Phylogenetic Analyses • Evolutionary relationship among a group of related molecules or organisms • Track gene flow based on sequence similarity 6/3/2020 Bioinformatics: What, Why and Where? 43
  • 44. Understand the Functions of Genes (Pathway Analysis) 6/3/2020 Bioinformatics: What, Why and Where? 44
  • 45. Predicting Protein Structure and Function • Protein’s 3D structure Prediction • Understand how biomolecules interact with other molecules • Predict functions based on interactions 6/3/2020 Bioinformatics: What, Why and Where? 45
  • 46. Drug Design • It is faster to analyze molecules on computer as compared to experimental approaches. • Helps in identifying drug targets easily • Simulating drug effects on computers 6/3/2020 Bioinformatics: What, Why and Where? 46
  • 48. Applications of Bioinformatics in Medicine • The Human Genome Project (HGP) helps scientists to search for genes directly associated with diseases and understand the molecular basis of those identified diseases. • This new Information will help in better understanding of the mechanisms of diseases and hence develop better treatment and preventive methods. 6/3/2020 Bioinformatics: What, Why and Where? 48
  • 49. Applications of Bioinformatics in Pharmacy • Identification and validating new drugs through Computer Aided Drug Design (CADD). • Helps to develop specific drugs with less side effect 6/3/2020 Bioinformatics: What, Why and Where? 49
  • 50. Applications of Bioinformatics in Food Security • Large amount of genomics data is available from plants and animals • Bioinformatic analysis of plant and animal genomes will help scientists to improve crops • Resistant to drought • Resistant to insects and pests • More nutritional value • Animals with higher meat quality and productivity 6/3/2020 Bioinformatics: What, Why and Where? 50
  • 51. Applications of Bioinformatics in the Environment • Sequencing and analysis of microbial genomes and search for genes expressing enzymes for • Bioremediation and biodegradation • Climate change studies (Microbes that use CO2 as their sole source of enegy) • Alternative energy sources (energy from light) • Microbes with industrial benefits • Generation of Biogas 6/3/2020 Bioinformatics: What, Why and Where? 51
  • 53. Take Home Messages • Understand the biological background first (in details)! • For writing a software • For using a software • Which tool/software to use? • Understand the algorithm behind each software/tool • Test different parameters • Select the best tool • Free software are everywhere • Read about benchmarking studies first • Before Writing your own software • Check if it is exist (don’t work from scratch) • Modify existing tools 6/3/2020 Bioinformatics: What, Why and Where? 53
  • 54. Biologists and Computer Scientitst Should Communicate! 6/3/2020 Bioinformatics: What, Why and Where? 54
  • 55. 6/3/2020 Bioinformatics: What, Why and Where? 55
  • 56. 6/3/2020 Bioinformatics: What, Why and Where? 56
  • 57. 6/3/2020 Bioinformatics: What, Why and Where? 57 Thank You for Your Attention!

Editor's Notes

  1. Each letter of letter (A,G,C or T) are called nucleotide