SlideShare a Scribd company logo
1 of 35
Topological Associated
Domains identification
using Hi-C
Speaker : Djekidel Mohamed Nadhir
Date : 03/03/2014
Outline
Background
β€’ Despite revealing the sequence of the genome, little is known about its 3D structure
β€’ high-throughput chromosome capture (Hi-C) is 3C-based technology
β€’ it can detect chromatin interactions between loci across the entire genome
Biological experiment:
Ming, H., et al. (2013). "Understanding spatial organizations of chromosomes via statistical analysis of Hi-C data." Quantitative Biology 1.
Background
β€’ Hi-C in the chromatin conformation study map
Smallwood, A. and B. Ren (2013). "Genome organization and long-range regulation of gene expression by enhancers." Current opinion in cell biology 25(3):
387-394.
Background- Processing pipeline
β€’ 4 main steps:
β€’ Read mapping : Each side (50 bp) is mapped independently to the reference genome
β€’ Read level filtering
β€’ Fragment filtering : Filter fragments with low mappability score
β€’ Creation of the Hi-C contact matrix
Ming, H., et al. (2013). "Understanding spatial organizations of chromosomes via statistical analysis of Hi-C data." Quantitative Biology 1.
Background- Processing pipeline
β€’ Read filtering step : The flowing types of reads should be removed :
β€’ Self-ligation reads:
β€’ Dangling reads : un-ligated reads
β€’ PCR amplification reads: many reads that map to the same location
β€’ Random breaking reads : reads located far from the enzyme cutting site (𝑑1 + 𝑑2 > 500𝑏𝑝 )
Background- Processing pipeline
β€’ Fragment filtering step : Remove fragments with low mappability score (< 0.5)
β€’ fragment near centromere or telomere regions tends to contain a large proportion of repetitive sequence and
leads to a low mappability score
β€’ Additional suggestions :
β€’ Remove fragments with <100bp or > 100 kb
β€’ Remove 0.5% of the fragments with the highest number of reads (can be source of PCR artifacts)
Background
β€’ Construction of the Hi-C interaction matrix:
β€’ The number of Enzyme cut-site is 1012
, however a typical Hi-C experiment generate 108
reads
β€’ Thus, we need to partition the genome into large scale bins.
Processing pipeline:
Hi-C vs FISH
Discussed paper
β€’ Aim :
β€’ Investigate the 3D dimensional organization of the human and mouse genome in ES
and differentiated cell.
β€’ Data :
β€’ Mouse :
β€’ Mouse embryonic stem cell (mESC)
β€’ Cortex cell (generated by another group)
β€’ Human :
β€’ Human embryonic stem cell (hESC)
β€’ IMR90
Data control (1)
β€’ Remove cut site bias
Raw data Normalized data
Data control (2)
Compare 5C generated data for the HoxA
locus (correlation > 0.73)
Compare with Phc1 locus 3C data
Compare with FISH data of 6 loci
Data control (3)
Pearson Correlation between replicates
Visualization of interactions
We can notice aTopological Associated Domain (TAD) structure at bins < 100kb
Identification of topological domains
Step1: Detection of the interaction bias
We notice that in aTAD that :
β€’ The upstream portion is highly biased to interact
downstream
β€’ The downstream portion is highly biased to interact
upstream
a directionality index (ID) was defined to calculate this bias:
β€’ 𝐷𝐼 > 0 οƒ  Upstream bias
β€’ 𝐷𝐼 < 0 οƒ  Downstream bias
β€’ 𝐷𝐼 the extent of the interaction
DI calculation
Steps:
β€’ The genome was split into bins of length 40 kb
β€’ Let :
β€’ A: # of reads that map in the 2M upstream of the bin
β€’ B: # of reads that map in the 2M downstream of the bin
β€’ E: expected number of reads 𝐄 =
𝑨+𝑩
𝟐
β€’ Then :
β€’ 𝐷𝐼 =
π΅βˆ’π΄
π΅βˆ’π΄
π΄βˆ’πΈ 2
𝐸
+
π΅βˆ’πΈ 2
𝐸
-2Mb +2Mb40kb
A B
Domain detection (1)
β€’ Each bin can have 3 states :
β€’ Upstream biased
β€’ Downstream biased
β€’ No bias
β€’ Use a HMM based on the DI to infer the biased state
β€’ We define :
β€’ 𝒀 = [𝒀 𝟏, 𝒀 𝟐, … , 𝒀 𝒏] :The observed DI
β€’ 𝑸 = [𝑸 𝟏, 𝑸 𝟐, … , 𝑸 𝒏] :The hidden bias 𝑄𝑖 ∈ {𝐷, π‘ˆ, 𝑁}
β€’ 𝑴 = 𝑴 𝟏, 𝑴 𝟐, … , 𝑴 π’Ž : π‘š ∈ [1,20]
β€’ The probabilities are calculated as follow:
β€’ 𝑷 𝒀𝒕 𝑸 𝒕 = π’Š, 𝑴𝒕 ) = 𝓝 𝐘𝐭; ππ’Šπ’Ž, πšΊπ’Šπ’Ž
β€’ 𝑷 𝑴𝒕 = π’Ž 𝑸 𝒕 = π’Š) = π‘ͺ(π’Š, π’Ž)
β€’ π‘ͺ(π’Š, π’Ž): the mixture weight
D D D D U U U N N N D D D U U
Domain Boundary Domain
` ` `
𝑀1 𝑀2
𝑀3
𝑸 𝒕
π’šπ’•
𝑴 𝒕
𝑸 𝒕+𝟏
π’šπ’•+𝟏
𝑴𝒕+𝟏
D
U
N
Domain detection (1)
β€’ The region between twoTAD is termed :
β€’ Topological boundary : if size < 400kb
β€’ Unrecognized chromatin : if size β‰₯ 400 kb
What separates twoTADs
β€’ Studied the HoxA locus known to be separated into two compartments
β€’ Found that the CS5 insulator resides in the boundary
β€’ Maybe insulators are enriched at the boundary ?
CTFC role in the boundary
β€’ Studied other known insulator CTCF
Heterochromatin and boundary
β€’ the H3K9me3 profile changed between cells hESC and IMR90 but the boundaries structure didn’t change
β€’ potential link between the topological domains and transcriptional control in the mammalian genome
Characteristics ofTAD
β€’ TAD are stable between cell lines
hESC
IMR90
Characteristics ofTAD
β€’ TAD are conserved between species
Cell type specific interactions
β€’ A binomial test is performed for each 20kb bin to determine is it is cell specific
β€’ Calculate 𝒏 = 𝑰 π’Žπ‘¬π‘Ίπ‘ͺ + 𝑰 𝒄𝒐𝒓𝒕𝒆𝒙 , the number of possible interactions at a distance 𝒅
β€’ Calculate the expected value 𝒑 =
𝑰 π’Žπ‘¬π‘Ίπ‘ͺ
𝒏
or 𝒑 =
𝑰 𝒄𝒐𝒓𝒕𝒆𝒙
𝒏
β€’ Then for each bin do a binomial-test to see if there is a deviation in the number cell specific
interactions
d d d d
𝒏 = πŸ‘ + 𝟐 + 𝟏 + 𝟏 + 𝟐 + 𝟏 + πŸ’ + 𝟏 = πŸπŸ“
mESC
Cortex
𝒑 =
πŸ•
πŸπŸ“
or 𝒑 =
πŸ–
𝟏𝟐
Cell type specific interactions
β€’ 20% of the genes that have a FCβ‰₯ 4 are found in dynamic interacting loci.
β€’ > 96% of the dynamic interactions occur in the same domain.
β€’ Model :
β€’ domain organization is stable between cell types
β€’ but the regions within each domain may be dynamic,
Factors forming the boundary (1)
β€’ Boundaries are enriched for active promoter signals and gene bodies
Factors forming the boundary (2)
TAD vs A/B compartments (1)
β€’ Loci found clustered in A compartments
are generally:
β€’ gene rich,
β€’ transcriptionally active,
β€’ and DNase I hypersensitive,
Lieberman-Aiden, E., et al. (2009), Science (New York, N.Y.) 326(5950): 289-293.
Compartment B
CompartmentA
β€’ Loci found clustered in B compartments
are generally:
β€’ gene poor,
β€’ transcriptionally silent
β€’ and DNase I insensitive
At a higher order the chromatin is organized into A and B compartments
TAD vs A/B compartments (2)
TAD are smaller than A/B compartments
TAD vs A/B compartments (3)
In summary :
Gibcus, J. and J. Dekker (2013). "The hierarchy of the 3D genome." Molecular cell 49(5): 773-782.
TAD vs A/B compartments (4)
In summary :
Gibcus, J. and J. Dekker (2013). "The hierarchy of the 3D genome." Molecular cell 49(5): 773-782.
TAD vs Lamina associated domains (LAD) (1)
TAD vs Lamina associated domains (LAD) (2)
Nora, E., et al. (2013). BioEssays : news and reviews in molecular, cellular and developmental biology 35(9): 818-828.
TAD vs LOCKs
β€’ LOCK: Large Organized Chromatin K9-modifications
β€’ Conserved regions exhibiting large H3K9Me2 difference between cell lines
Summary
β€’ The mammalian genome is segmented into a megabase-scale domains
β€’ Domain boundaries are stable between cell lines and species , suggesting that
they are a basic property of the chromosome architecture.
β€’ Domain boundaries are enricher for :
β€’ Transcriptionally active genes
β€’ Coincide with heterochromatin boundaries
β€’ Enriched with insulator proteins
β€’ Enriched with tRNA, SINE and housekeeping genes
β€’ Developed many data-analysis approaches
Summary
β€’ The mammalian genome is segmented into a megabase-scale domains
β€’ Domain boundaries are stable between cell lines and species , suggesting that
they are a basic property of the chromosome architecture.
β€’ Domain boundaries are enricher for :
β€’ Transcriptionally active genes
β€’ Coincide with heterochromatin boundaries
β€’ Enriched with insulator proteins
β€’ Enriched with tRNA, SINE and housekeeping genes
β€’ Developed many data-analysis approaches

More Related Content

What's hot

Linkage and crossing over
Linkage and crossing overLinkage and crossing over
Linkage and crossing oversurendran aduthila
Β 
Modern cytogenetic tools in crop improvement
Modern cytogenetic tools in crop improvementModern cytogenetic tools in crop improvement
Modern cytogenetic tools in crop improvementShreyas A
Β 
Mapping and QTL
Mapping and QTLMapping and QTL
Mapping and QTLFAO
Β 
Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)AnkitTiwari354
Β 
The Smith Waterman algorithm
The Smith Waterman algorithmThe Smith Waterman algorithm
The Smith Waterman algorithmavrilcoghlan
Β 
Qtl analysis and its mapping
Qtl analysis and its mappingQtl analysis and its mapping
Qtl analysis and its mappingVikas Verma
Β 
Phylogenetic analysis
Phylogenetic analysis Phylogenetic analysis
Phylogenetic analysis Nitin Naik
Β 
Multiple Alignment Sequence using Clustal Omega/ Shumaila Riaz
Multiple Alignment Sequence using Clustal Omega/ Shumaila RiazMultiple Alignment Sequence using Clustal Omega/ Shumaila Riaz
Multiple Alignment Sequence using Clustal Omega/ Shumaila RiazShumailaRiaz6
Β 
Phylogenetic tree and its construction and phylogeny of
Phylogenetic tree and its construction and phylogeny ofPhylogenetic tree and its construction and phylogeny of
Phylogenetic tree and its construction and phylogeny ofbhavnesthakur
Β 
transcriptional gene silencing
transcriptional gene silencingtranscriptional gene silencing
transcriptional gene silencingSheetal Mehla
Β 
Linkage mapping
Linkage mappingLinkage mapping
Linkage mappingSnehaSahu20
Β 
Epigenetic regulation in higher plants
Epigenetic regulation in higher plantsEpigenetic regulation in higher plants
Epigenetic regulation in higher plantsFOODCROPS
Β 
Introduction to sequence alignment
Introduction to sequence alignmentIntroduction to sequence alignment
Introduction to sequence alignmentKubuldinho
Β 
TILLING & ECO-TILLING
TILLING & ECO-TILLINGTILLING & ECO-TILLING
TILLING & ECO-TILLINGRachana Bagudam
Β 
Gene Mapping Methods:Linkage Maps & Mapping with Molecular Markers
Gene  Mapping  Methods:Linkage Maps & Mapping with Molecular MarkersGene  Mapping  Methods:Linkage Maps & Mapping with Molecular Markers
Gene Mapping Methods:Linkage Maps & Mapping with Molecular MarkersAssam University, Silchar, Assam, India
Β 
Genomic in-situ hybridization (GISH)
Genomic in-situ hybridization (GISH)Genomic in-situ hybridization (GISH)
Genomic in-situ hybridization (GISH)areebafatima20
Β 

What's hot (20)

Linkage and crossing over
Linkage and crossing overLinkage and crossing over
Linkage and crossing over
Β 
Modern cytogenetic tools in crop improvement
Modern cytogenetic tools in crop improvementModern cytogenetic tools in crop improvement
Modern cytogenetic tools in crop improvement
Β 
Mapping and QTL
Mapping and QTLMapping and QTL
Mapping and QTL
Β 
Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)
Β 
The Smith Waterman algorithm
The Smith Waterman algorithmThe Smith Waterman algorithm
The Smith Waterman algorithm
Β 
Tree building 2
Tree building 2Tree building 2
Tree building 2
Β 
Molecular phylogenetics
Molecular phylogeneticsMolecular phylogenetics
Molecular phylogenetics
Β 
Qtl analysis and its mapping
Qtl analysis and its mappingQtl analysis and its mapping
Qtl analysis and its mapping
Β 
Phylogenetic analysis
Phylogenetic analysis Phylogenetic analysis
Phylogenetic analysis
Β 
Multiple Alignment Sequence using Clustal Omega/ Shumaila Riaz
Multiple Alignment Sequence using Clustal Omega/ Shumaila RiazMultiple Alignment Sequence using Clustal Omega/ Shumaila Riaz
Multiple Alignment Sequence using Clustal Omega/ Shumaila Riaz
Β 
NGS: Mapping and de novo assembly
NGS: Mapping and de novo assemblyNGS: Mapping and de novo assembly
NGS: Mapping and de novo assembly
Β 
Phylogenetic tree and its construction and phylogeny of
Phylogenetic tree and its construction and phylogeny ofPhylogenetic tree and its construction and phylogeny of
Phylogenetic tree and its construction and phylogeny of
Β 
Homology
HomologyHomology
Homology
Β 
transcriptional gene silencing
transcriptional gene silencingtranscriptional gene silencing
transcriptional gene silencing
Β 
Linkage mapping
Linkage mappingLinkage mapping
Linkage mapping
Β 
Epigenetic regulation in higher plants
Epigenetic regulation in higher plantsEpigenetic regulation in higher plants
Epigenetic regulation in higher plants
Β 
Introduction to sequence alignment
Introduction to sequence alignmentIntroduction to sequence alignment
Introduction to sequence alignment
Β 
TILLING & ECO-TILLING
TILLING & ECO-TILLINGTILLING & ECO-TILLING
TILLING & ECO-TILLING
Β 
Gene Mapping Methods:Linkage Maps & Mapping with Molecular Markers
Gene  Mapping  Methods:Linkage Maps & Mapping with Molecular MarkersGene  Mapping  Methods:Linkage Maps & Mapping with Molecular Markers
Gene Mapping Methods:Linkage Maps & Mapping with Molecular Markers
Β 
Genomic in-situ hybridization (GISH)
Genomic in-situ hybridization (GISH)Genomic in-situ hybridization (GISH)
Genomic in-situ hybridization (GISH)
Β 

Viewers also liked

BPIPE: a bioinformatics pipeline framework
BPIPE: a bioinformatics pipeline frameworkBPIPE: a bioinformatics pipeline framework
BPIPE: a bioinformatics pipeline frameworkMohamed Nadhir Djekidel
Β 
Topological data analysis
Topological data analysisTopological data analysis
Topological data analysisSunghyon Kyeong
Β 
Imbim igp 20151127-amb.part1b
Imbim igp 20151127-amb.part1bImbim igp 20151127-amb.part1b
Imbim igp 20151127-amb.part1bbarrioam
Β 
Kulakova sbb2014
Kulakova sbb2014Kulakova sbb2014
Kulakova sbb2014Ek_Kul
Β 
Recomb2016poster
Recomb2016posterRecomb2016poster
Recomb2016posterLong Pei
Β 
Helsinki genome project-20151210-amb
Helsinki genome project-20151210-ambHelsinki genome project-20151210-amb
Helsinki genome project-20151210-ambbarrioam
Β 
Different pcr techniques and their application
Different pcr techniques and their applicationDifferent pcr techniques and their application
Different pcr techniques and their applicationsaurabh Pandey.Saurabh784
Β 

Viewers also liked (7)

BPIPE: a bioinformatics pipeline framework
BPIPE: a bioinformatics pipeline frameworkBPIPE: a bioinformatics pipeline framework
BPIPE: a bioinformatics pipeline framework
Β 
Topological data analysis
Topological data analysisTopological data analysis
Topological data analysis
Β 
Imbim igp 20151127-amb.part1b
Imbim igp 20151127-amb.part1bImbim igp 20151127-amb.part1b
Imbim igp 20151127-amb.part1b
Β 
Kulakova sbb2014
Kulakova sbb2014Kulakova sbb2014
Kulakova sbb2014
Β 
Recomb2016poster
Recomb2016posterRecomb2016poster
Recomb2016poster
Β 
Helsinki genome project-20151210-amb
Helsinki genome project-20151210-ambHelsinki genome project-20151210-amb
Helsinki genome project-20151210-amb
Β 
Different pcr techniques and their application
Different pcr techniques and their applicationDifferent pcr techniques and their application
Different pcr techniques and their application
Β 

Similar to Topological associated domains- Hi-C

DNA cell cycle by flow cytometry
DNA cell cycle by flow cytometryDNA cell cycle by flow cytometry
DNA cell cycle by flow cytometryRichard Hastings
Β 
Telomere-to-telomere assembly of a complete human chromosomes
Telomere-to-telomere assembly of a complete human chromosomesTelomere-to-telomere assembly of a complete human chromosomes
Telomere-to-telomere assembly of a complete human chromosomesGenome Reference Consortium
Β 
Predictive Features of TCR Repertoire
Predictive Features of TCR RepertoirePredictive Features of TCR Repertoire
Predictive Features of TCR RepertoireThi K. Tran-Nguyen, PhD
Β 
Architecture and evolution of neochromosomes
Architecture and evolution of neochromosomesArchitecture and evolution of neochromosomes
Architecture and evolution of neochromosomesAnthony Papenfuss
Β 
40 Years of Genome Assembly: Are We Done Yet?
40 Years of Genome Assembly: Are We Done Yet?40 Years of Genome Assembly: Are We Done Yet?
40 Years of Genome Assembly: Are We Done Yet?Adam Phillippy
Β 
Scalable Genome Analysis With ADAM
Scalable Genome Analysis With ADAMScalable Genome Analysis With ADAM
Scalable Genome Analysis With ADAMfnothaft
Β 
2015 beacon-metagenome-tutorial
2015 beacon-metagenome-tutorial2015 beacon-metagenome-tutorial
2015 beacon-metagenome-tutorialc.titus.brown
Β 
sequencing of genome
sequencing of genomesequencing of genome
sequencing of genomeNaveen Gupta
Β 
2013 ucdavis-smbe-eukaryotes
2013 ucdavis-smbe-eukaryotes2013 ucdavis-smbe-eukaryotes
2013 ucdavis-smbe-eukaryotesc.titus.brown
Β 
3UnitGeneMapping.pptx
3UnitGeneMapping.pptx3UnitGeneMapping.pptx
3UnitGeneMapping.pptxGounderKirthika2
Β 
Sequence Tagged Sites pdf.pdf
Sequence Tagged Sites pdf.pdfSequence Tagged Sites pdf.pdf
Sequence Tagged Sites pdf.pdfNetHelix
Β 
London Calling 2019: Karen Miga
London Calling 2019: Karen MigaLondon Calling 2019: Karen Miga
London Calling 2019: Karen MigaKaren Hayden Miga
Β 
AlgoAlignementGenomicSequences.ppt
AlgoAlignementGenomicSequences.pptAlgoAlignementGenomicSequences.ppt
AlgoAlignementGenomicSequences.pptSkanderBena
Β 
Telomere-to-telomere assembly of a complete human X chromosome
Telomere-to-telomere assembly of a complete human X chromosomeTelomere-to-telomere assembly of a complete human X chromosome
Telomere-to-telomere assembly of a complete human X chromosomeAdam Phillippy
Β 
Seminar Slides
Seminar SlidesSeminar Slides
Seminar Slidespannicle
Β 

Similar to Topological associated domains- Hi-C (20)

DNA cell cycle by flow cytometry
DNA cell cycle by flow cytometryDNA cell cycle by flow cytometry
DNA cell cycle by flow cytometry
Β 
Telomere-to-telomere assembly of a complete human chromosomes
Telomere-to-telomere assembly of a complete human chromosomesTelomere-to-telomere assembly of a complete human chromosomes
Telomere-to-telomere assembly of a complete human chromosomes
Β 
Predictive Features of TCR Repertoire
Predictive Features of TCR RepertoirePredictive Features of TCR Repertoire
Predictive Features of TCR Repertoire
Β 
Bioinformatics t8-go-hmm v2014
Bioinformatics t8-go-hmm v2014Bioinformatics t8-go-hmm v2014
Bioinformatics t8-go-hmm v2014
Β 
Architecture and evolution of neochromosomes
Architecture and evolution of neochromosomesArchitecture and evolution of neochromosomes
Architecture and evolution of neochromosomes
Β 
40 Years of Genome Assembly: Are We Done Yet?
40 Years of Genome Assembly: Are We Done Yet?40 Years of Genome Assembly: Are We Done Yet?
40 Years of Genome Assembly: Are We Done Yet?
Β 
Scalable Genome Analysis With ADAM
Scalable Genome Analysis With ADAMScalable Genome Analysis With ADAM
Scalable Genome Analysis With ADAM
Β 
2015 beacon-metagenome-tutorial
2015 beacon-metagenome-tutorial2015 beacon-metagenome-tutorial
2015 beacon-metagenome-tutorial
Β 
sequencing of genome
sequencing of genomesequencing of genome
sequencing of genome
Β 
Basics of Genome Assembly
Basics of Genome Assembly Basics of Genome Assembly
Basics of Genome Assembly
Β 
2013 ucdavis-smbe-eukaryotes
2013 ucdavis-smbe-eukaryotes2013 ucdavis-smbe-eukaryotes
2013 ucdavis-smbe-eukaryotes
Β 
Alignment Approaches II: Long Reads
Alignment Approaches II: Long ReadsAlignment Approaches II: Long Reads
Alignment Approaches II: Long Reads
Β 
3UnitGeneMapping.pptx
3UnitGeneMapping.pptx3UnitGeneMapping.pptx
3UnitGeneMapping.pptx
Β 
Sequence Tagged Sites pdf.pdf
Sequence Tagged Sites pdf.pdfSequence Tagged Sites pdf.pdf
Sequence Tagged Sites pdf.pdf
Β 
Machine Learning
Machine LearningMachine Learning
Machine Learning
Β 
London Calling 2019: Karen Miga
London Calling 2019: Karen MigaLondon Calling 2019: Karen Miga
London Calling 2019: Karen Miga
Β 
AlgoAlignementGenomicSequences.ppt
AlgoAlignementGenomicSequences.pptAlgoAlignementGenomicSequences.ppt
AlgoAlignementGenomicSequences.ppt
Β 
Telomere-to-telomere assembly of a complete human X chromosome
Telomere-to-telomere assembly of a complete human X chromosomeTelomere-to-telomere assembly of a complete human X chromosome
Telomere-to-telomere assembly of a complete human X chromosome
Β 
Seminar Slides
Seminar SlidesSeminar Slides
Seminar Slides
Β 
A tutorial in Connectome Analysis (3) - Marcus Kaiser
A tutorial in Connectome Analysis (3) - Marcus KaiserA tutorial in Connectome Analysis (3) - Marcus Kaiser
A tutorial in Connectome Analysis (3) - Marcus Kaiser
Β 

Recently uploaded

Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
Β 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhousejana861314
Β 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)Areesha Ahmad
Β 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxkessiyaTpeter
Β 
Lucknow πŸ’‹ Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow πŸ’‹ Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow πŸ’‹ Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow πŸ’‹ Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
Β 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfSumit Kumar yadav
Β 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCEPRINCE C P
Β 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
Β 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...ssifa0344
Β 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PPRINCE C P
Β 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)Areesha Ahmad
Β 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
Β 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSSLeenakshiTyagi
Β 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
Β 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINsankalpkumarsahoo174
Β 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSΓ©rgio Sacani
Β 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)Areesha Ahmad
Β 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
Β 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptxRajatChauhan518211
Β 

Recently uploaded (20)

Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
Β 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhouse
Β 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
Β 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
Β 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
Β 
Lucknow πŸ’‹ Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow πŸ’‹ Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow πŸ’‹ Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow πŸ’‹ Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Β 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
Β 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
Β 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Β 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
Β 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
Β 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
Β 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Β 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSS
Β 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Β 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Β 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
Β 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
Β 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
Β 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptx
Β 

Topological associated domains- Hi-C

  • 1. Topological Associated Domains identification using Hi-C Speaker : Djekidel Mohamed Nadhir Date : 03/03/2014
  • 3. Background β€’ Despite revealing the sequence of the genome, little is known about its 3D structure β€’ high-throughput chromosome capture (Hi-C) is 3C-based technology β€’ it can detect chromatin interactions between loci across the entire genome Biological experiment: Ming, H., et al. (2013). "Understanding spatial organizations of chromosomes via statistical analysis of Hi-C data." Quantitative Biology 1.
  • 4. Background β€’ Hi-C in the chromatin conformation study map Smallwood, A. and B. Ren (2013). "Genome organization and long-range regulation of gene expression by enhancers." Current opinion in cell biology 25(3): 387-394.
  • 5. Background- Processing pipeline β€’ 4 main steps: β€’ Read mapping : Each side (50 bp) is mapped independently to the reference genome β€’ Read level filtering β€’ Fragment filtering : Filter fragments with low mappability score β€’ Creation of the Hi-C contact matrix Ming, H., et al. (2013). "Understanding spatial organizations of chromosomes via statistical analysis of Hi-C data." Quantitative Biology 1.
  • 6. Background- Processing pipeline β€’ Read filtering step : The flowing types of reads should be removed : β€’ Self-ligation reads: β€’ Dangling reads : un-ligated reads β€’ PCR amplification reads: many reads that map to the same location β€’ Random breaking reads : reads located far from the enzyme cutting site (𝑑1 + 𝑑2 > 500𝑏𝑝 )
  • 7. Background- Processing pipeline β€’ Fragment filtering step : Remove fragments with low mappability score (< 0.5) β€’ fragment near centromere or telomere regions tends to contain a large proportion of repetitive sequence and leads to a low mappability score β€’ Additional suggestions : β€’ Remove fragments with <100bp or > 100 kb β€’ Remove 0.5% of the fragments with the highest number of reads (can be source of PCR artifacts)
  • 8. Background β€’ Construction of the Hi-C interaction matrix: β€’ The number of Enzyme cut-site is 1012 , however a typical Hi-C experiment generate 108 reads β€’ Thus, we need to partition the genome into large scale bins. Processing pipeline: Hi-C vs FISH
  • 9. Discussed paper β€’ Aim : β€’ Investigate the 3D dimensional organization of the human and mouse genome in ES and differentiated cell. β€’ Data : β€’ Mouse : β€’ Mouse embryonic stem cell (mESC) β€’ Cortex cell (generated by another group) β€’ Human : β€’ Human embryonic stem cell (hESC) β€’ IMR90
  • 10. Data control (1) β€’ Remove cut site bias Raw data Normalized data
  • 11. Data control (2) Compare 5C generated data for the HoxA locus (correlation > 0.73) Compare with Phc1 locus 3C data Compare with FISH data of 6 loci
  • 12. Data control (3) Pearson Correlation between replicates
  • 13. Visualization of interactions We can notice aTopological Associated Domain (TAD) structure at bins < 100kb
  • 14. Identification of topological domains Step1: Detection of the interaction bias We notice that in aTAD that : β€’ The upstream portion is highly biased to interact downstream β€’ The downstream portion is highly biased to interact upstream a directionality index (ID) was defined to calculate this bias: β€’ 𝐷𝐼 > 0 οƒ  Upstream bias β€’ 𝐷𝐼 < 0 οƒ  Downstream bias β€’ 𝐷𝐼 the extent of the interaction
  • 15. DI calculation Steps: β€’ The genome was split into bins of length 40 kb β€’ Let : β€’ A: # of reads that map in the 2M upstream of the bin β€’ B: # of reads that map in the 2M downstream of the bin β€’ E: expected number of reads 𝐄 = 𝑨+𝑩 𝟐 β€’ Then : β€’ 𝐷𝐼 = π΅βˆ’π΄ π΅βˆ’π΄ π΄βˆ’πΈ 2 𝐸 + π΅βˆ’πΈ 2 𝐸 -2Mb +2Mb40kb A B
  • 16. Domain detection (1) β€’ Each bin can have 3 states : β€’ Upstream biased β€’ Downstream biased β€’ No bias β€’ Use a HMM based on the DI to infer the biased state β€’ We define : β€’ 𝒀 = [𝒀 𝟏, 𝒀 𝟐, … , 𝒀 𝒏] :The observed DI β€’ 𝑸 = [𝑸 𝟏, 𝑸 𝟐, … , 𝑸 𝒏] :The hidden bias 𝑄𝑖 ∈ {𝐷, π‘ˆ, 𝑁} β€’ 𝑴 = 𝑴 𝟏, 𝑴 𝟐, … , 𝑴 π’Ž : π‘š ∈ [1,20] β€’ The probabilities are calculated as follow: β€’ 𝑷 𝒀𝒕 𝑸 𝒕 = π’Š, 𝑴𝒕 ) = 𝓝 𝐘𝐭; ππ’Šπ’Ž, πšΊπ’Šπ’Ž β€’ 𝑷 𝑴𝒕 = π’Ž 𝑸 𝒕 = π’Š) = π‘ͺ(π’Š, π’Ž) β€’ π‘ͺ(π’Š, π’Ž): the mixture weight D D D D U U U N N N D D D U U Domain Boundary Domain ` ` ` 𝑀1 𝑀2 𝑀3 𝑸 𝒕 π’šπ’• 𝑴 𝒕 𝑸 𝒕+𝟏 π’šπ’•+𝟏 𝑴𝒕+𝟏 D U N
  • 17. Domain detection (1) β€’ The region between twoTAD is termed : β€’ Topological boundary : if size < 400kb β€’ Unrecognized chromatin : if size β‰₯ 400 kb
  • 18. What separates twoTADs β€’ Studied the HoxA locus known to be separated into two compartments β€’ Found that the CS5 insulator resides in the boundary β€’ Maybe insulators are enriched at the boundary ?
  • 19. CTFC role in the boundary β€’ Studied other known insulator CTCF
  • 20. Heterochromatin and boundary β€’ the H3K9me3 profile changed between cells hESC and IMR90 but the boundaries structure didn’t change β€’ potential link between the topological domains and transcriptional control in the mammalian genome
  • 21. Characteristics ofTAD β€’ TAD are stable between cell lines hESC IMR90
  • 22. Characteristics ofTAD β€’ TAD are conserved between species
  • 23. Cell type specific interactions β€’ A binomial test is performed for each 20kb bin to determine is it is cell specific β€’ Calculate 𝒏 = 𝑰 π’Žπ‘¬π‘Ίπ‘ͺ + 𝑰 𝒄𝒐𝒓𝒕𝒆𝒙 , the number of possible interactions at a distance 𝒅 β€’ Calculate the expected value 𝒑 = 𝑰 π’Žπ‘¬π‘Ίπ‘ͺ 𝒏 or 𝒑 = 𝑰 𝒄𝒐𝒓𝒕𝒆𝒙 𝒏 β€’ Then for each bin do a binomial-test to see if there is a deviation in the number cell specific interactions d d d d 𝒏 = πŸ‘ + 𝟐 + 𝟏 + 𝟏 + 𝟐 + 𝟏 + πŸ’ + 𝟏 = πŸπŸ“ mESC Cortex 𝒑 = πŸ• πŸπŸ“ or 𝒑 = πŸ– 𝟏𝟐
  • 24. Cell type specific interactions β€’ 20% of the genes that have a FCβ‰₯ 4 are found in dynamic interacting loci. β€’ > 96% of the dynamic interactions occur in the same domain. β€’ Model : β€’ domain organization is stable between cell types β€’ but the regions within each domain may be dynamic,
  • 25. Factors forming the boundary (1) β€’ Boundaries are enriched for active promoter signals and gene bodies
  • 26. Factors forming the boundary (2)
  • 27. TAD vs A/B compartments (1) β€’ Loci found clustered in A compartments are generally: β€’ gene rich, β€’ transcriptionally active, β€’ and DNase I hypersensitive, Lieberman-Aiden, E., et al. (2009), Science (New York, N.Y.) 326(5950): 289-293. Compartment B CompartmentA β€’ Loci found clustered in B compartments are generally: β€’ gene poor, β€’ transcriptionally silent β€’ and DNase I insensitive At a higher order the chromatin is organized into A and B compartments
  • 28. TAD vs A/B compartments (2) TAD are smaller than A/B compartments
  • 29. TAD vs A/B compartments (3) In summary : Gibcus, J. and J. Dekker (2013). "The hierarchy of the 3D genome." Molecular cell 49(5): 773-782.
  • 30. TAD vs A/B compartments (4) In summary : Gibcus, J. and J. Dekker (2013). "The hierarchy of the 3D genome." Molecular cell 49(5): 773-782.
  • 31. TAD vs Lamina associated domains (LAD) (1)
  • 32. TAD vs Lamina associated domains (LAD) (2) Nora, E., et al. (2013). BioEssays : news and reviews in molecular, cellular and developmental biology 35(9): 818-828.
  • 33. TAD vs LOCKs β€’ LOCK: Large Organized Chromatin K9-modifications β€’ Conserved regions exhibiting large H3K9Me2 difference between cell lines
  • 34. Summary β€’ The mammalian genome is segmented into a megabase-scale domains β€’ Domain boundaries are stable between cell lines and species , suggesting that they are a basic property of the chromosome architecture. β€’ Domain boundaries are enricher for : β€’ Transcriptionally active genes β€’ Coincide with heterochromatin boundaries β€’ Enriched with insulator proteins β€’ Enriched with tRNA, SINE and housekeeping genes β€’ Developed many data-analysis approaches
  • 35. Summary β€’ The mammalian genome is segmented into a megabase-scale domains β€’ Domain boundaries are stable between cell lines and species , suggesting that they are a basic property of the chromosome architecture. β€’ Domain boundaries are enricher for : β€’ Transcriptionally active genes β€’ Coincide with heterochromatin boundaries β€’ Enriched with insulator proteins β€’ Enriched with tRNA, SINE and housekeeping genes β€’ Developed many data-analysis approaches