SlideShare a Scribd company logo
1 of 27
A consortium of 440 scientists, 32
           laboratories
   Sucheta Tripathy, IICB, 17th Sept. 2012
   http://www.nature.com/encode/
   http://www.encodeproject.org/ENCODE/
   http://www.factorbook.org/
   http://encodeproject.org/ENCODE/dataStand
    ards.html
   http://1000genomes.org
   http://genome.ucsc.edu/ENCODE/
http://www.gencodegenes.org/data.html
Characterization
                                              of intergenic
                                              region and gene
                                              definition




http://homes.gersteinlab.org/people/rar62/subwaymap/SubwayMap8
_16_12.pdf
http://homes.gersteinlab.org/people/rar62/subwaymap/SubwayMap
NHGRI
                            Solicited           RFAs were
                First
                              pilot             sought for
              Publicat
                            proposal               full
               ion in
                          for ENCODE             ENCODE
               2000



 In October                         GWAS    -
1990 Human               Finished   90% lies    First Report
                                                                ENCODE
  Genome                 paper in   outside      on Encode
                                                               published
   project                 2003     coding       Published
                                        2005                     2012
   started                                        in 2007
http://www.nature.
com/nature/journa
l/v489/n7414/full
/489049a.html
Treasure Hunt?




It is like google map says Eric Lander : Map of earth
from outer space
   95% of the genome is “junk”.
    ◦ 2.94% of the genome is coding
   cis regulatory elements occur within a
    limited genome distance.
   Most of the genome is transposable
    elements that are of obscure origin are
    dying.
   Transcribed elements are most often
    translated than not.
   80% of the human genome is active!!
    ◦ 70,000 promoters and 400,000 enhancers
   75% of the genome transcribed in some
    tissue or other during life time.
   Environment plays great role in switching on
    or off of a lot many genes. [Epigenetics]
   Most of the diseases don‟t lie with the genes
    but the switches!!
   Dark matters controlling the genes are
    physically close to the genes they control.
   Genes and the switches don‟t hold one to one
    relationship!
   4 million switches controlling 21,000 genes!!
   Identical twins are NOT identical – greatly
    influenced by environments.

   Astronomy and genetic Biology looks
    similar(95% of the Universe is called as dark
    matter – we don‟t understand)
   “This explains why 6.5 billion people on earth
    don‟t look alike”..
   Intelligent Design (Creationism) believers are
    excited that it is handiwork of God.
   Natural selectionists (Darwinists) excited that
    natural selection at its best.
    ◦ This has raged a war between democrats and
      republicans as usual.
   Junk DNA is an “Oxymoron”.
   Some are still wondering about the remaining
    20%.
   „I hope this information stirs the mind of
    those researchers that have ignored "trace
    minerals" in food as part of the nutritional
    package‟.
   The more we think we are close to finding an
    answer – the far we find ourselves. Reminds
    me of Aristotle Who once said “The more you
    know, the more you know you don't know”
   Most part of DNA was considered “Garbage”
    but later upgraded to “junk”.
   Most people are actually happy because it is
    happening during their “life time”.
   Switches are software and genes are
    hardware.
   Ancient Egyptians considered “torso” has a
    divine role and discarded grey matter in head
    as “junk”.
   Sean Eddy “At least 40% of the human genome is
    composed of the decaying DNA remains of transposable
    elements (TEs), different species of which have
    replicated in great waves during the evolution of our
    genome.”
   “I sure wish I‟d gotten the memo, because this week a
    collaboration of labs led by myself, Arian Smit, and
    Jerzy Jurka just released a new data resource that
    annotates nearly 50% of the human genome as
    transposable element-derived, and transposon-derived
    repetitive sequence is the poster child for what we
    colloquially call “junk DNA”.”


   http://cryptogenomicon.org/
PLoS Biol.
2011
April; 9(4):
e1001046
.
PLoS Biol.
2011 April;
9(4):
e1001046.
PLoS Biol.
2011 April;
9(4):
e1001046.
The Cell Types
Cell Type          Tier   Description                    Source


GM12878            1      B-Lymphoblastoid cell line     Coriell GM12878



                          Chronic
K562               1      Myelogenous/Erythroleukemia ATCC CCL-243
                          cell line



                          Human Embryonic Stem Cells,    Cellular Dynamics
H1-hESC            1
                          line H1                        International



HepG2              2      Hepatoblastoma cell line       ATCC HB-8065



HeLa-S3            2      Cervical carcinoma cell line   ATCC CCL-2.2




                          Human Umbilical Vein
HUVEC              2                                     Lonza CC-2517
                          Endothelial Cells


                                                                             PLoS Biol.
Various (Tier 3)   3
                          Various cell lines, cultured
                          primary cells, and primary     Various
                                                                             2011
                          tissues                                            April; 9(4):
                                                                             e1001046
                                                                             .
   DNAseI -> Transcription factor binding sites
    (2.9 million sites, 1/3 rd in one cell type and
    remaining in others)
   Chip-seq -> sequence transcription factor
    and histone binding sites (HeLA and
    GM12878 – qualified to be called as new
    species)
   5C technology -> Finding proximity between
    regulatory and regulated regions
   High density 5 bp tiling DNA micro arrays
   Cap Analysis of Gene Expression
   Paired-End diTag (PET)
   Reduced Representation Bisulphite
    Sequencing (RRBS)
   33.45% exon and 66.55% intron.
   62% of the genome is transcribed
    reproducibly.
   231 MB of genome has protein binding sites.
    ◦ 80% of which are low affinity sites
      (http://www.factorbook.org/)
    ◦ Many are highly conserved cell selective type
   96% of the CpG exhibited differential
    methylation pattern.
   GWAS SNPs had overlaps with ENCODE
    elements.
   Chromosome confirmation capture carbon
    copy(5C)
    ◦ 1% of the genome is distally regulated (>1000 bp)
    ◦ On an average 3.9 distal elements interacted with
      TSS.
    ◦ Distance could be several KBs to MBs
   cis-regulatory elements - Enhancers,
    promoters, insulators, silencers.
   2.9 million DHS encompassing 125 diverse
    cell and tissue types.
   20-50 bp length DHS mapped uniquely to
    86.9% of genome
    ◦   580,000 distal DHS with target promoters
    ◦   3% lie in TSS
    ◦   5% lie within 2.5 KB of TSS
    ◦   95% lie distally (introns and intergenic regions)
    ◦   Strongly enriched in LTRs
   3/4th of genome is capable of transcription –
    redefine concept of gene?
    ◦ 62.1% AND 74.7% are processed or primary
      transcripts.
    ◦ 10-12 expressed isoforms per gene per cell.
    ◦ Coding and non-coding transcripts are localized in
      cytoplasm and nucleus respectively.
    ◦ 6% of the coding and non-coding transcripts
      overlap with small RNAs – precursors?
    ◦ Most of the novel transcripts lacked protein coding
      ability.
   Mapping job is only half done.
   Characterizing everything a genome does is
    10% done.
   Finding Network of switches for genes.
   A number of correlations…..
   Where does gene therapy go from here?
   Our fundamental understanding of genes as
    the functional units are flawed??
   Epigenetics becomes the key player…
   Gives impetus to holistic approach in treating
    a disease.

   Do we still believe that human genome is
    most efficient?

More Related Content

What's hot

Single nucleotide polymorphisms (sn ps), haplotypes,
Single nucleotide polymorphisms (sn ps), haplotypes,Single nucleotide polymorphisms (sn ps), haplotypes,
Single nucleotide polymorphisms (sn ps), haplotypes,
Karan Veer Singh
 

What's hot (20)

Formation and expression ofpseudogenes
Formation and expression ofpseudogenesFormation and expression ofpseudogenes
Formation and expression ofpseudogenes
 
LncRNA (Long noncoding RNA)
LncRNA (Long noncoding RNA)LncRNA (Long noncoding RNA)
LncRNA (Long noncoding RNA)
 
Gene silencing
Gene silencingGene silencing
Gene silencing
 
Gene expression concept and analysis
Gene expression concept and analysisGene expression concept and analysis
Gene expression concept and analysis
 
Types of genomics ppt
Types of genomics pptTypes of genomics ppt
Types of genomics ppt
 
Promoters
PromotersPromoters
Promoters
 
non coding RNA
non coding RNAnon coding RNA
non coding RNA
 
Segmentation in Drosophila melanogaster
Segmentation in Drosophila melanogaster Segmentation in Drosophila melanogaster
Segmentation in Drosophila melanogaster
 
Genomic imprinting
Genomic imprintingGenomic imprinting
Genomic imprinting
 
Express sequence tags
Express sequence tagsExpress sequence tags
Express sequence tags
 
Exon shuffling
Exon shufflingExon shuffling
Exon shuffling
 
Regulation of gene expression in eukaryotes
Regulation of gene expression in eukaryotesRegulation of gene expression in eukaryotes
Regulation of gene expression in eukaryotes
 
Single nucleotide polymorphisms (sn ps), haplotypes,
Single nucleotide polymorphisms (sn ps), haplotypes,Single nucleotide polymorphisms (sn ps), haplotypes,
Single nucleotide polymorphisms (sn ps), haplotypes,
 
Riboswitches and riboswitches mediated regulation
Riboswitches and riboswitches mediated regulationRiboswitches and riboswitches mediated regulation
Riboswitches and riboswitches mediated regulation
 
Crispr
CrisprCrispr
Crispr
 
Markers and reporter genes
Markers and reporter genesMarkers and reporter genes
Markers and reporter genes
 
Expressed sequence tag (EST), molecular marker
Expressed sequence tag (EST), molecular markerExpressed sequence tag (EST), molecular marker
Expressed sequence tag (EST), molecular marker
 
Chromosome walking jumping transposon tagging map based cloning
Chromosome walking jumping transposon tagging map based cloningChromosome walking jumping transposon tagging map based cloning
Chromosome walking jumping transposon tagging map based cloning
 
TALENs (Transcription Activator-like Effector Nucleases)
TALENs  (Transcription Activator-like Effector Nucleases)TALENs  (Transcription Activator-like Effector Nucleases)
TALENs (Transcription Activator-like Effector Nucleases)
 
Molecular markr sscp
Molecular markr sscpMolecular markr sscp
Molecular markr sscp
 

Similar to Human encodeproject

Sk microfluidics and lab on-a-chip-ch3
Sk microfluidics and lab on-a-chip-ch3Sk microfluidics and lab on-a-chip-ch3
Sk microfluidics and lab on-a-chip-ch3
stanislas547
 
Group 5 DNA Tech - Ecology & Envt
Group 5 DNA Tech - Ecology & EnvtGroup 5 DNA Tech - Ecology & Envt
Group 5 DNA Tech - Ecology & Envt
Jessica Kabigting
 
The human genome project was started in 1990 with the goal of sequencing and ...
The human genome project was started in 1990 with the goal of sequencing and ...The human genome project was started in 1990 with the goal of sequencing and ...
The human genome project was started in 1990 with the goal of sequencing and ...
Rania Malik
 
B sc biotech i fob unit 4 application in biotechnology
B sc biotech i fob unit 4 application in biotechnologyB sc biotech i fob unit 4 application in biotechnology
B sc biotech i fob unit 4 application in biotechnology
Rai University
 
Recombinant Dna technology, Restriction Endonucleas and Vector
Recombinant Dna technology, Restriction Endonucleas and Vector Recombinant Dna technology, Restriction Endonucleas and Vector
Recombinant Dna technology, Restriction Endonucleas and Vector
Dr. Priti D. Diwan
 

Similar to Human encodeproject (20)

Human genome project (2) converted
Human genome project (2) convertedHuman genome project (2) converted
Human genome project (2) converted
 
Sk microfluidics and lab on-a-chip-ch3
Sk microfluidics and lab on-a-chip-ch3Sk microfluidics and lab on-a-chip-ch3
Sk microfluidics and lab on-a-chip-ch3
 
Human genome project
Human genome projectHuman genome project
Human genome project
 
Hgp
HgpHgp
Hgp
 
Human genome project
Human genome projectHuman genome project
Human genome project
 
Genome sequencingprojects
Genome sequencingprojectsGenome sequencingprojects
Genome sequencingprojects
 
Genomics
GenomicsGenomics
Genomics
 
Organellar genome and its composition
Organellar genome and its compositionOrganellar genome and its composition
Organellar genome and its composition
 
Group 5 DNA Tech - Ecology & Envt
Group 5 DNA Tech - Ecology & EnvtGroup 5 DNA Tech - Ecology & Envt
Group 5 DNA Tech - Ecology & Envt
 
Complete assignment on human Genome Project
Complete assignment on human Genome ProjectComplete assignment on human Genome Project
Complete assignment on human Genome Project
 
Dn abarcode
Dn abarcodeDn abarcode
Dn abarcode
 
Mitochondrial DNA in Taxonomy and Phylogeny
Mitochondrial DNA in Taxonomy and PhylogenyMitochondrial DNA in Taxonomy and Phylogeny
Mitochondrial DNA in Taxonomy and Phylogeny
 
The human genome project was started in 1990 with the goal of sequencing and ...
The human genome project was started in 1990 with the goal of sequencing and ...The human genome project was started in 1990 with the goal of sequencing and ...
The human genome project was started in 1990 with the goal of sequencing and ...
 
B sc biotech i fob unit 4 application in biotechnology
B sc biotech i fob unit 4 application in biotechnologyB sc biotech i fob unit 4 application in biotechnology
B sc biotech i fob unit 4 application in biotechnology
 
Numbers in Life: A Statistical Genetic Approach
Numbers in Life: A Statistical Genetic ApproachNumbers in Life: A Statistical Genetic Approach
Numbers in Life: A Statistical Genetic Approach
 
Recombinant Dna technology, Restriction Endonucleas and Vector
Recombinant Dna technology, Restriction Endonucleas and Vector Recombinant Dna technology, Restriction Endonucleas and Vector
Recombinant Dna technology, Restriction Endonucleas and Vector
 
Marzillier_09052014.pdf
Marzillier_09052014.pdfMarzillier_09052014.pdf
Marzillier_09052014.pdf
 
Human Genome Project
Human Genome ProjectHuman Genome Project
Human Genome Project
 
Domains of unknown function are essential in yeast
Domains of unknown function are essential in yeastDomains of unknown function are essential in yeast
Domains of unknown function are essential in yeast
 
Mitochondrial dna
Mitochondrial dnaMitochondrial dna
Mitochondrial dna
 

More from Sucheta Tripathy

More from Sucheta Tripathy (20)

Gal
GalGal
Gal
 
Ramorum2016 final
Ramorum2016 finalRamorum2016 final
Ramorum2016 final
 
Primer designgeneprediction
Primer designgenepredictionPrimer designgeneprediction
Primer designgeneprediction
 
Motif andpatterndatabase
Motif andpatterndatabaseMotif andpatterndatabase
Motif andpatterndatabase
 
Databases ii
Databases iiDatabases ii
Databases ii
 
Snps and microarray
Snps and microarraySnps and microarray
Snps and microarray
 
Stat2013
Stat2013Stat2013
Stat2013
 
26 nov2013seminar
26 nov2013seminar26 nov2013seminar
26 nov2013seminar
 
Stat2013
Stat2013Stat2013
Stat2013
 
Presentation2013
Presentation2013Presentation2013
Presentation2013
 
Lecture7,8
Lecture7,8Lecture7,8
Lecture7,8
 
Lecture5,6
Lecture5,6Lecture5,6
Lecture5,6
 
Primer designgeneprediction
Primer designgenepredictionPrimer designgeneprediction
Primer designgeneprediction
 
Lecture 3,4
Lecture 3,4Lecture 3,4
Lecture 3,4
 
Lecture 1,2
Lecture 1,2Lecture 1,2
Lecture 1,2
 
Sequence Alignment,Blast, Fasta, MSA
Sequence Alignment,Blast, Fasta, MSASequence Alignment,Blast, Fasta, MSA
Sequence Alignment,Blast, Fasta, MSA
 
Databases Part II
Databases Part IIDatabases Part II
Databases Part II
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Tyler presentation
Tyler presentationTyler presentation
Tyler presentation
 
Tyler presentation
Tyler presentationTyler presentation
Tyler presentation
 

Recently uploaded

Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
KarakKing
 

Recently uploaded (20)

ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptx
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the Classroom
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 

Human encodeproject

  • 1. A consortium of 440 scientists, 32 laboratories Sucheta Tripathy, IICB, 17th Sept. 2012
  • 2. http://www.nature.com/encode/  http://www.encodeproject.org/ENCODE/  http://www.factorbook.org/  http://encodeproject.org/ENCODE/dataStand ards.html  http://1000genomes.org  http://genome.ucsc.edu/ENCODE/
  • 4. Characterization of intergenic region and gene definition http://homes.gersteinlab.org/people/rar62/subwaymap/SubwayMap8 _16_12.pdf
  • 6. NHGRI Solicited RFAs were First pilot sought for Publicat proposal full ion in for ENCODE ENCODE 2000 In October GWAS - 1990 Human Finished 90% lies First Report ENCODE Genome paper in outside on Encode published project 2003 coding Published 2005 2012 started in 2007
  • 8. Treasure Hunt? It is like google map says Eric Lander : Map of earth from outer space
  • 9. 95% of the genome is “junk”. ◦ 2.94% of the genome is coding  cis regulatory elements occur within a limited genome distance.  Most of the genome is transposable elements that are of obscure origin are dying.  Transcribed elements are most often translated than not.
  • 10. 80% of the human genome is active!! ◦ 70,000 promoters and 400,000 enhancers  75% of the genome transcribed in some tissue or other during life time.  Environment plays great role in switching on or off of a lot many genes. [Epigenetics]  Most of the diseases don‟t lie with the genes but the switches!!  Dark matters controlling the genes are physically close to the genes they control.
  • 11. Genes and the switches don‟t hold one to one relationship!  4 million switches controlling 21,000 genes!!  Identical twins are NOT identical – greatly influenced by environments.  Astronomy and genetic Biology looks similar(95% of the Universe is called as dark matter – we don‟t understand)
  • 12. “This explains why 6.5 billion people on earth don‟t look alike”..  Intelligent Design (Creationism) believers are excited that it is handiwork of God.  Natural selectionists (Darwinists) excited that natural selection at its best. ◦ This has raged a war between democrats and republicans as usual.  Junk DNA is an “Oxymoron”.  Some are still wondering about the remaining 20%.
  • 13. „I hope this information stirs the mind of those researchers that have ignored "trace minerals" in food as part of the nutritional package‟.  The more we think we are close to finding an answer – the far we find ourselves. Reminds me of Aristotle Who once said “The more you know, the more you know you don't know”
  • 14. Most part of DNA was considered “Garbage” but later upgraded to “junk”.  Most people are actually happy because it is happening during their “life time”.  Switches are software and genes are hardware.  Ancient Egyptians considered “torso” has a divine role and discarded grey matter in head as “junk”.
  • 15. Sean Eddy “At least 40% of the human genome is composed of the decaying DNA remains of transposable elements (TEs), different species of which have replicated in great waves during the evolution of our genome.”  “I sure wish I‟d gotten the memo, because this week a collaboration of labs led by myself, Arian Smit, and Jerzy Jurka just released a new data resource that annotates nearly 50% of the human genome as transposable element-derived, and transposon-derived repetitive sequence is the poster child for what we colloquially call “junk DNA”.”  http://cryptogenomicon.org/
  • 19. The Cell Types Cell Type Tier Description Source GM12878 1 B-Lymphoblastoid cell line Coriell GM12878 Chronic K562 1 Myelogenous/Erythroleukemia ATCC CCL-243 cell line Human Embryonic Stem Cells, Cellular Dynamics H1-hESC 1 line H1 International HepG2 2 Hepatoblastoma cell line ATCC HB-8065 HeLa-S3 2 Cervical carcinoma cell line ATCC CCL-2.2 Human Umbilical Vein HUVEC 2 Lonza CC-2517 Endothelial Cells PLoS Biol. Various (Tier 3) 3 Various cell lines, cultured primary cells, and primary Various 2011 tissues April; 9(4): e1001046 .
  • 20. DNAseI -> Transcription factor binding sites (2.9 million sites, 1/3 rd in one cell type and remaining in others)  Chip-seq -> sequence transcription factor and histone binding sites (HeLA and GM12878 – qualified to be called as new species)  5C technology -> Finding proximity between regulatory and regulated regions  High density 5 bp tiling DNA micro arrays
  • 21. Cap Analysis of Gene Expression  Paired-End diTag (PET)  Reduced Representation Bisulphite Sequencing (RRBS)
  • 22. 33.45% exon and 66.55% intron.  62% of the genome is transcribed reproducibly.  231 MB of genome has protein binding sites. ◦ 80% of which are low affinity sites (http://www.factorbook.org/) ◦ Many are highly conserved cell selective type  96% of the CpG exhibited differential methylation pattern.  GWAS SNPs had overlaps with ENCODE elements.
  • 23. Chromosome confirmation capture carbon copy(5C) ◦ 1% of the genome is distally regulated (>1000 bp) ◦ On an average 3.9 distal elements interacted with TSS. ◦ Distance could be several KBs to MBs
  • 24. cis-regulatory elements - Enhancers, promoters, insulators, silencers.  2.9 million DHS encompassing 125 diverse cell and tissue types.  20-50 bp length DHS mapped uniquely to 86.9% of genome ◦ 580,000 distal DHS with target promoters ◦ 3% lie in TSS ◦ 5% lie within 2.5 KB of TSS ◦ 95% lie distally (introns and intergenic regions) ◦ Strongly enriched in LTRs
  • 25. 3/4th of genome is capable of transcription – redefine concept of gene? ◦ 62.1% AND 74.7% are processed or primary transcripts. ◦ 10-12 expressed isoforms per gene per cell. ◦ Coding and non-coding transcripts are localized in cytoplasm and nucleus respectively. ◦ 6% of the coding and non-coding transcripts overlap with small RNAs – precursors? ◦ Most of the novel transcripts lacked protein coding ability.
  • 26. Mapping job is only half done.  Characterizing everything a genome does is 10% done.  Finding Network of switches for genes.  A number of correlations…..
  • 27. Where does gene therapy go from here?  Our fundamental understanding of genes as the functional units are flawed??  Epigenetics becomes the key player…  Gives impetus to holistic approach in treating a disease.  Do we still believe that human genome is most efficient?