SlideShare a Scribd company logo
1 of 28
Mohamed Nadhir Djekidel (那弟尔)
2013/07/09
Outline
ChIA-PET principle (1)
ChIA-PET principle (2)
Head Tag : 20 bp Tail Tag : 20 bp
Half-linker: 16 bp
ChIA-PET principle (3)
Pipeline implementation challenges
• The designed pipeline we face two kind of problems:
• Analysis problems :
• how to accurately filter the linker sequences from the raw reads ?
• how to accurately and efficiently map the tag sequences to reference genomes?
• how to evaluate the noise level in the data?
• how to identify bona-fide (real) binding sites and chromatin interactions ?
• Visualization problems:
• how to organize the datasets ?
• how to effectively visualize the identified long-range chromatin interactions ?
ChIA-PET tool
ChIA-PET tool-fastq file format
• Each PET entry consists of four lines.
• Only the first 20 bp are mapped to the genome.
• The other 16 are assumed to belong to the linker.
Seq1.fasq
Seq2.fasq
ChIA-PET tool-Filtering
Non-Chimeric Chimeric (non considered)
ambiguous PET
ChIA-PET tool-Filtering
Using ER-𝛼 on MCF-7
(breast cancer)
ChIA-PET tool-Mapping
• Mapping was done using the Batman package.
• Only one mismatch was allowed.
• Only PETs with uniquely mapped tags where considered.
• Duplicate PETs were removed as they can be the result of PCR clonal
amplification.
ChIA-PET tool-Mapping
• Merge similarly mapped PETs (within ± 1bp) into one unique PET
• The aim is to reduce false positive calls due to PCR amplification.
ChIA-PET tool-Mapping-File format
Uniquely mapped
ChIA-PET tool – PET classification (1)
ChIA-PET tool – PET classification (2)
• How to determine the span cutoff between self-ligation and Inter-ligation PETs ?
• From the histogram we can see that the majority of the PETs do not span more than 2kb from each other.
• for IHH015M library data, the span cutoff called by this method is 4,595 bp
Self-ligation
Intra-ligation
ChIA-PET tool – PET classification (3)
To check whether the obtained self-ligation are not due to random
ligation, a random model was generated:
Let:
• 𝑁𝑖: The number of DNA fragment on 𝐶ℎ𝑟𝑖
• 𝐼𝑛𝑡𝑟𝑎𝑖: The number of Intra-ligation related fragments on 𝐶ℎ𝑟𝑖
If we suppose any DNA fragment can interact with any other
fragment we get:
𝑟𝑎𝑡𝑖𝑜_𝑖𝑛𝑡𝑟𝑎 𝑖 =
𝐼𝑛𝑡𝑟𝑎𝑖
𝑁𝑖
2
So the total ratio over all chromosomes will be
𝑟𝑎𝑡𝑖𝑜 𝑡𝑜𝑡𝑎𝑙 =
𝑖 𝐼𝑛𝑡𝑟𝑎𝑖
𝑖 𝑁𝑖
2
• In Chimeric intra-ligation (IHH015C):
• The expected ration : 0.0552
• the observed ration: 0.0558
• ⇒ P-value =5.2E-4 ⇒ Random
• In non-Chimeric intra-ligation (IHH015M) :
• The expected ratio is : 0.0546
• the observed ratio is : 0.0929
• ⇒ P-value < 2.97E-323 ⇒ non-Random
𝑁𝑖:5
𝐼𝑛𝑡𝑎𝑖 ∶ 3
𝑟𝑎𝑡𝑖𝑜_𝑖𝑛𝑡𝑟𝑎 𝑖 =
3
25
ChIA-PET tool – PET classification (3)
ChIA-PET tool – PET classification (3)
We can see from the heatmap that the chimeric ligations are randomly distributed.
ChIA-PET tool- Slef-ligation site analysis
• In this step the Chip-enriched regions (self-ligation) are clustered.
• Need to filter-out clusters with high FDR
• FDR was calculated as follow:
• For each cluster calculate the probability of obtaining 𝑘 ligation by chance using a Monte-Carlo simulation
• Monte-Carlo simulation consists of randomly picking PETs and see how many PETs are in that cluster.
• Repeat the simulation 100 times and get the mean.
• Clusters with an FDR > 0.01 were filtered-out
ChIA-PET tool-Chromatin interaction (1)
• The identification chromatin interaction is done in two steps:
• Step1:
• Identify ChIP enriched interaction anchor regions from inter-ligation PETs similar to ChIP-
Seq
• The tag length is extended from 5’ to 3’ by a “tag extension length” (in this data set it
was 200 bp)
• Note: Most of the Inter-ligation anchor regions overlap with the self-ligation regions
• Step2
• Determine the number of overlapping PETs.
• Filter-out random interactions.
Region A
Region B
ChIA-PET tool-Chromatin interaction (2)
• The null hypothesis assumes :
• Each chromatin fragment has a equal chance to ligate to any other fragment randomly.
• The interaction between each anchors are independent from each other.
• Let
• 𝑹 𝑨 and 𝑹 𝑩 be two regions
• 𝑪 𝑨 = Number of PETs in a 𝑅 𝐴
• 𝑪 𝑩 = Number of PETs in a 𝑅 𝐵
• 𝑰 𝑨,𝑩 = The number of interaction between two regions 𝐴 and 𝐵
• 𝑵 = The number of Inter-ligation (I think it should be only on the same chromosome)
𝑃 𝐼𝐴,𝐵 𝑁, 𝐶𝐴, 𝐶 𝐵) =
𝐶 𝐴
𝐼 𝐴,𝐵
2𝑁−𝐶 𝐴
𝐶 𝐵 −𝐼 𝐴,𝐵
2𝑁
𝐶 𝐵
ChIA-PET tool- Visualization (1)
ChIA-PET tool- Visualization (1)
Can be accessed at : http://cms1.gis.a-star.edu.sg/cgi-bin/gbrowse
ChIA-PET tool- Visualization (2)
• Can be visualized at this link: http://cms1.gis.a-star.edu.sg/index.php
• User name: guest
• Password: guest
ChIA-PET tool- Visualization (3)
Installing Chia-PET on our server
MySQL server:
• You need to install your own or install MySQL Cluster
Adding a new assembly:
• Download the reference genome and create BATMAN index using steps in Installation guid (appendix)
• Download for the UCSC table the following files and format them in the Chia-pet format:
• Chromosome sizes
• Genes
• Gaps
• Installing missing packages (Note: replace $HOME by the real path)
• fftw: create a custom directory (ex: $HOME/bin) install using the following commands
• rimage: Open R and install it as follow:
./configure –-prefix=$HOME/bin
make CFLAGS=“-fCIP”
Make install
install.packages("rimage_0.5-8.2.tar",repos=NULL,configure.args="--with-fftw-include=$HOME/bin/incluse --with-fftw-lib=$HOME/bin/lib")
Running Chia-PET tool
• We suppose that Chia-PET is installed in $HOME/ChiaPET
• Go to $HOME/ChiaPET/src/python/common and open the file config.py and change the linkers
• Go to $HOME/ChiaPET/src/python/main
• Just need to run the following commands:
• The results will be in $HOME/ChiaPET/work/<lib_name>
• Temporary files will be in $HOME/ChiaPET/prep/<lib_name>
Overview of ChIA-PET tools

More Related Content

What's hot

Previewing GRCm39: Assembly Updates from the GRC
Previewing GRCm39: Assembly Updates from the GRCPreviewing GRCm39: Assembly Updates from the GRC
Previewing GRCm39: Assembly Updates from the GRCGenome Reference Consortium
 
hg19 (GRCh37) vs. hg38 (GRCh38)
hg19 (GRCh37) vs. hg38 (GRCh38)hg19 (GRCh37) vs. hg38 (GRCh38)
hg19 (GRCh37) vs. hg38 (GRCh38)Shaojun Xie
 
2016 bioinformatics i_bio_cheminformatics_wimvancriekinge
2016 bioinformatics i_bio_cheminformatics_wimvancriekinge2016 bioinformatics i_bio_cheminformatics_wimvancriekinge
2016 bioinformatics i_bio_cheminformatics_wimvancriekingeProf. Wim Van Criekinge
 
Advancements in the human genome reference assembly (GRCh38)
Advancements in the human genome reference assembly (GRCh38)Advancements in the human genome reference assembly (GRCh38)
Advancements in the human genome reference assembly (GRCh38)Genome Reference Consortium
 
Telomere-to-telomere assembly of a complete human chromosomes
Telomere-to-telomere assembly of a complete human chromosomesTelomere-to-telomere assembly of a complete human chromosomes
Telomere-to-telomere assembly of a complete human chromosomesGenome Reference Consortium
 
2016 bioinformatics i_wim_vancriekinge_vupload
2016 bioinformatics i_wim_vancriekinge_vupload2016 bioinformatics i_wim_vancriekinge_vupload
2016 bioinformatics i_wim_vancriekinge_vuploadProf. Wim Van Criekinge
 
What's new and what's next for the human reference assembly?
What's new and what's next for the human reference assembly?What's new and what's next for the human reference assembly?
What's new and what's next for the human reference assembly?Genome Reference Consortium
 
GIAB ASHG 2019 Structural Variant poster
GIAB ASHG 2019 Structural Variant posterGIAB ASHG 2019 Structural Variant poster
GIAB ASHG 2019 Structural Variant posterGenomeInABottle
 
Building Genomic Data Processing and Machine Learning Workflows Using Apache ...
Building Genomic Data Processing and Machine Learning Workflows Using Apache ...Building Genomic Data Processing and Machine Learning Workflows Using Apache ...
Building Genomic Data Processing and Machine Learning Workflows Using Apache ...Databricks
 
Aug2015 Ali Bashir and Jason Chin Pac bio giab_assembly_summary_ali3
Aug2015 Ali Bashir and Jason Chin Pac bio giab_assembly_summary_ali3Aug2015 Ali Bashir and Jason Chin Pac bio giab_assembly_summary_ali3
Aug2015 Ali Bashir and Jason Chin Pac bio giab_assembly_summary_ali3GenomeInABottle
 
Giab agbt small_var_2020
Giab agbt small_var_2020Giab agbt small_var_2020
Giab agbt small_var_2020GenomeInABottle
 
2015 bioinformatics databases_wim_vancriekinge
2015 bioinformatics databases_wim_vancriekinge2015 bioinformatics databases_wim_vancriekinge
2015 bioinformatics databases_wim_vancriekingeProf. Wim Van Criekinge
 
GRC GIAB Workshop ASHG 2019 Small Variant Benchmark
GRC GIAB Workshop ASHG 2019 Small Variant BenchmarkGRC GIAB Workshop ASHG 2019 Small Variant Benchmark
GRC GIAB Workshop ASHG 2019 Small Variant BenchmarkGenomeInABottle
 

What's hot (20)

Previewing GRCm39: Assembly Updates from the GRC
Previewing GRCm39: Assembly Updates from the GRCPreviewing GRCm39: Assembly Updates from the GRC
Previewing GRCm39: Assembly Updates from the GRC
 
Lrg and mane 16 oct 2018
Lrg and mane   16 oct 2018Lrg and mane   16 oct 2018
Lrg and mane 16 oct 2018
 
hg19 (GRCh37) vs. hg38 (GRCh38)
hg19 (GRCh37) vs. hg38 (GRCh38)hg19 (GRCh37) vs. hg38 (GRCh38)
hg19 (GRCh37) vs. hg38 (GRCh38)
 
Ashg sedlazeck grc_share
Ashg sedlazeck grc_shareAshg sedlazeck grc_share
Ashg sedlazeck grc_share
 
2016 bioinformatics i_bio_cheminformatics_wimvancriekinge
2016 bioinformatics i_bio_cheminformatics_wimvancriekinge2016 bioinformatics i_bio_cheminformatics_wimvancriekinge
2016 bioinformatics i_bio_cheminformatics_wimvancriekinge
 
Schneider grc workshop_final
Schneider grc workshop_finalSchneider grc workshop_final
Schneider grc workshop_final
 
Advancements in the human genome reference assembly (GRCh38)
Advancements in the human genome reference assembly (GRCh38)Advancements in the human genome reference assembly (GRCh38)
Advancements in the human genome reference assembly (GRCh38)
 
Variant analysis and whole exome sequencing
Variant analysis and whole exome sequencingVariant analysis and whole exome sequencing
Variant analysis and whole exome sequencing
 
Telomere-to-telomere assembly of a complete human chromosomes
Telomere-to-telomere assembly of a complete human chromosomesTelomere-to-telomere assembly of a complete human chromosomes
Telomere-to-telomere assembly of a complete human chromosomes
 
2016 bioinformatics i_wim_vancriekinge_vupload
2016 bioinformatics i_wim_vancriekinge_vupload2016 bioinformatics i_wim_vancriekinge_vupload
2016 bioinformatics i_wim_vancriekinge_vupload
 
What's new and what's next for the human reference assembly?
What's new and what's next for the human reference assembly?What's new and what's next for the human reference assembly?
What's new and what's next for the human reference assembly?
 
GIAB ASHG 2019 Structural Variant poster
GIAB ASHG 2019 Structural Variant posterGIAB ASHG 2019 Structural Variant poster
GIAB ASHG 2019 Structural Variant poster
 
Building Genomic Data Processing and Machine Learning Workflows Using Apache ...
Building Genomic Data Processing and Machine Learning Workflows Using Apache ...Building Genomic Data Processing and Machine Learning Workflows Using Apache ...
Building Genomic Data Processing and Machine Learning Workflows Using Apache ...
 
Aug2015 Ali Bashir and Jason Chin Pac bio giab_assembly_summary_ali3
Aug2015 Ali Bashir and Jason Chin Pac bio giab_assembly_summary_ali3Aug2015 Ali Bashir and Jason Chin Pac bio giab_assembly_summary_ali3
Aug2015 Ali Bashir and Jason Chin Pac bio giab_assembly_summary_ali3
 
RNA-Seq with R-Bioconductor
RNA-Seq with R-BioconductorRNA-Seq with R-Bioconductor
RNA-Seq with R-Bioconductor
 
2016 02 23_biological_databases_part1
2016 02 23_biological_databases_part12016 02 23_biological_databases_part1
2016 02 23_biological_databases_part1
 
Giab agbt small_var_2020
Giab agbt small_var_2020Giab agbt small_var_2020
Giab agbt small_var_2020
 
2015 bioinformatics databases_wim_vancriekinge
2015 bioinformatics databases_wim_vancriekinge2015 bioinformatics databases_wim_vancriekinge
2015 bioinformatics databases_wim_vancriekinge
 
GoTermsAnalysisWithR
GoTermsAnalysisWithRGoTermsAnalysisWithR
GoTermsAnalysisWithR
 
GRC GIAB Workshop ASHG 2019 Small Variant Benchmark
GRC GIAB Workshop ASHG 2019 Small Variant BenchmarkGRC GIAB Workshop ASHG 2019 Small Variant Benchmark
GRC GIAB Workshop ASHG 2019 Small Variant Benchmark
 

Similar to Overview of ChIA-PET tools

Botnets behavioral patterns in the network. A Machine Learning study of botne...
Botnets behavioral patterns in the network. A Machine Learning study of botne...Botnets behavioral patterns in the network. A Machine Learning study of botne...
Botnets behavioral patterns in the network. A Machine Learning study of botne...Czech Technical University in Prague
 
PCR Array Data Analysis Tutorial: qPCR Technology Webinar Series Part 3
PCR Array Data Analysis Tutorial: qPCR Technology Webinar Series Part 3PCR Array Data Analysis Tutorial: qPCR Technology Webinar Series Part 3
PCR Array Data Analysis Tutorial: qPCR Technology Webinar Series Part 3QIAGEN
 
Cyto 2015 Forensic Flow Cytometry Tutorial
Cyto 2015 Forensic Flow Cytometry TutorialCyto 2015 Forensic Flow Cytometry Tutorial
Cyto 2015 Forensic Flow Cytometry TutorialPratip Chattopadhyay
 
Tautomers - Advanced Databases for in-silico Screening?
Tautomers - Advanced Databases for in-silico Screening?Tautomers - Advanced Databases for in-silico Screening?
Tautomers - Advanced Databases for in-silico Screening?Frank Oellien
 
Too good to be true? How validate your data
Too good to be true? How validate your dataToo good to be true? How validate your data
Too good to be true? How validate your dataAlex Henderson
 
Soft And Handling
Soft And HandlingSoft And Handling
Soft And Handlinghiratufail
 
Feature Selection Strategies for HTTP Botnet Traffic Detection
Feature Selection Strategies for HTTP Botnet Traffic DetectionFeature Selection Strategies for HTTP Botnet Traffic Detection
Feature Selection Strategies for HTTP Botnet Traffic DetectionIvan Letteri
 
On Improving the Performance of Data Leak Prevention using White-list Approach
On Improving the Performance of Data Leak Prevention using White-list ApproachOn Improving the Performance of Data Leak Prevention using White-list Approach
On Improving the Performance of Data Leak Prevention using White-list ApproachPatrick Nguyen
 
DNA cell cycle by flow cytometry
DNA cell cycle by flow cytometryDNA cell cycle by flow cytometry
DNA cell cycle by flow cytometryRichard Hastings
 
BITS - Comparative genomics: the Contra tool
BITS - Comparative genomics: the Contra toolBITS - Comparative genomics: the Contra tool
BITS - Comparative genomics: the Contra toolBITS
 
Test vector compression in Digital Testing
Test vector compression in Digital Testing Test vector compression in Digital Testing
Test vector compression in Digital Testing Amr Abd El Latief
 
Pete thorpe wp1 april 2018
Pete thorpe wp1 april 2018Pete thorpe wp1 april 2018
Pete thorpe wp1 april 2018Forest Research
 
BioAssay Express: Creating and exploiting assay metadata
BioAssay Express: Creating and exploiting assay metadataBioAssay Express: Creating and exploiting assay metadata
BioAssay Express: Creating and exploiting assay metadataPhilip Cheung
 

Similar to Overview of ChIA-PET tools (20)

Botnets behavioral patterns in the network. A Machine Learning study of botne...
Botnets behavioral patterns in the network. A Machine Learning study of botne...Botnets behavioral patterns in the network. A Machine Learning study of botne...
Botnets behavioral patterns in the network. A Machine Learning study of botne...
 
Lecture_3.pptx
Lecture_3.pptxLecture_3.pptx
Lecture_3.pptx
 
PCR Array Data Analysis Tutorial: qPCR Technology Webinar Series Part 3
PCR Array Data Analysis Tutorial: qPCR Technology Webinar Series Part 3PCR Array Data Analysis Tutorial: qPCR Technology Webinar Series Part 3
PCR Array Data Analysis Tutorial: qPCR Technology Webinar Series Part 3
 
Cyto 2015 Forensic Flow Cytometry Tutorial
Cyto 2015 Forensic Flow Cytometry TutorialCyto 2015 Forensic Flow Cytometry Tutorial
Cyto 2015 Forensic Flow Cytometry Tutorial
 
May workshop
May workshopMay workshop
May workshop
 
Tautomers - Advanced Databases for in-silico Screening?
Tautomers - Advanced Databases for in-silico Screening?Tautomers - Advanced Databases for in-silico Screening?
Tautomers - Advanced Databases for in-silico Screening?
 
May 15 workshop
May 15  workshopMay 15  workshop
May 15 workshop
 
Database Searching
Database SearchingDatabase Searching
Database Searching
 
Too good to be true? How validate your data
Too good to be true? How validate your dataToo good to be true? How validate your data
Too good to be true? How validate your data
 
Soft And Handling
Soft And HandlingSoft And Handling
Soft And Handling
 
Feature Selection Strategies for HTTP Botnet Traffic Detection
Feature Selection Strategies for HTTP Botnet Traffic DetectionFeature Selection Strategies for HTTP Botnet Traffic Detection
Feature Selection Strategies for HTTP Botnet Traffic Detection
 
On Improving the Performance of Data Leak Prevention using White-list Approach
On Improving the Performance of Data Leak Prevention using White-list ApproachOn Improving the Performance of Data Leak Prevention using White-list Approach
On Improving the Performance of Data Leak Prevention using White-list Approach
 
DNA cell cycle by flow cytometry
DNA cell cycle by flow cytometryDNA cell cycle by flow cytometry
DNA cell cycle by flow cytometry
 
BITS - Comparative genomics: the Contra tool
BITS - Comparative genomics: the Contra toolBITS - Comparative genomics: the Contra tool
BITS - Comparative genomics: the Contra tool
 
Test vector compression in Digital Testing
Test vector compression in Digital Testing Test vector compression in Digital Testing
Test vector compression in Digital Testing
 
Pete thorpe wp1 april 2018
Pete thorpe wp1 april 2018Pete thorpe wp1 april 2018
Pete thorpe wp1 april 2018
 
Test vector compression
Test vector compressionTest vector compression
Test vector compression
 
Ensembl annotation
Ensembl annotationEnsembl annotation
Ensembl annotation
 
cache memory
 cache memory cache memory
cache memory
 
BioAssay Express: Creating and exploiting assay metadata
BioAssay Express: Creating and exploiting assay metadataBioAssay Express: Creating and exploiting assay metadata
BioAssay Express: Creating and exploiting assay metadata
 

Recently uploaded

Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfSumit Kumar yadav
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)Areesha Ahmad
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY1301aanya
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and ClassificationsAreesha Ahmad
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICEayushi9330
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPirithiRaju
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learninglevieagacer
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...ssuser79fe74
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000Sapana Sha
 
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Silpa
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Monika Rani
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxRizalinePalanog2
 
Introduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptxIntroduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptxBhagirath Gogikar
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)AkefAfaneh2
 
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑Damini Dixit
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .Poonam Aher Patil
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformationAreesha Ahmad
 

Recently uploaded (20)

Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
 
Introduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptxIntroduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptx
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)
 
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
 

Overview of ChIA-PET tools

  • 1. Mohamed Nadhir Djekidel (那弟尔) 2013/07/09
  • 4. ChIA-PET principle (2) Head Tag : 20 bp Tail Tag : 20 bp Half-linker: 16 bp
  • 6. Pipeline implementation challenges • The designed pipeline we face two kind of problems: • Analysis problems : • how to accurately filter the linker sequences from the raw reads ? • how to accurately and efficiently map the tag sequences to reference genomes? • how to evaluate the noise level in the data? • how to identify bona-fide (real) binding sites and chromatin interactions ? • Visualization problems: • how to organize the datasets ? • how to effectively visualize the identified long-range chromatin interactions ?
  • 8. ChIA-PET tool-fastq file format • Each PET entry consists of four lines. • Only the first 20 bp are mapped to the genome. • The other 16 are assumed to belong to the linker. Seq1.fasq Seq2.fasq
  • 9. ChIA-PET tool-Filtering Non-Chimeric Chimeric (non considered) ambiguous PET
  • 10. ChIA-PET tool-Filtering Using ER-𝛼 on MCF-7 (breast cancer)
  • 11. ChIA-PET tool-Mapping • Mapping was done using the Batman package. • Only one mismatch was allowed. • Only PETs with uniquely mapped tags where considered. • Duplicate PETs were removed as they can be the result of PCR clonal amplification.
  • 12. ChIA-PET tool-Mapping • Merge similarly mapped PETs (within ± 1bp) into one unique PET • The aim is to reduce false positive calls due to PCR amplification.
  • 14. ChIA-PET tool – PET classification (1)
  • 15. ChIA-PET tool – PET classification (2) • How to determine the span cutoff between self-ligation and Inter-ligation PETs ? • From the histogram we can see that the majority of the PETs do not span more than 2kb from each other. • for IHH015M library data, the span cutoff called by this method is 4,595 bp Self-ligation Intra-ligation
  • 16. ChIA-PET tool – PET classification (3) To check whether the obtained self-ligation are not due to random ligation, a random model was generated: Let: • 𝑁𝑖: The number of DNA fragment on 𝐶ℎ𝑟𝑖 • 𝐼𝑛𝑡𝑟𝑎𝑖: The number of Intra-ligation related fragments on 𝐶ℎ𝑟𝑖 If we suppose any DNA fragment can interact with any other fragment we get: 𝑟𝑎𝑡𝑖𝑜_𝑖𝑛𝑡𝑟𝑎 𝑖 = 𝐼𝑛𝑡𝑟𝑎𝑖 𝑁𝑖 2 So the total ratio over all chromosomes will be 𝑟𝑎𝑡𝑖𝑜 𝑡𝑜𝑡𝑎𝑙 = 𝑖 𝐼𝑛𝑡𝑟𝑎𝑖 𝑖 𝑁𝑖 2 • In Chimeric intra-ligation (IHH015C): • The expected ration : 0.0552 • the observed ration: 0.0558 • ⇒ P-value =5.2E-4 ⇒ Random • In non-Chimeric intra-ligation (IHH015M) : • The expected ratio is : 0.0546 • the observed ratio is : 0.0929 • ⇒ P-value < 2.97E-323 ⇒ non-Random 𝑁𝑖:5 𝐼𝑛𝑡𝑎𝑖 ∶ 3 𝑟𝑎𝑡𝑖𝑜_𝑖𝑛𝑡𝑟𝑎 𝑖 = 3 25
  • 17. ChIA-PET tool – PET classification (3)
  • 18. ChIA-PET tool – PET classification (3) We can see from the heatmap that the chimeric ligations are randomly distributed.
  • 19. ChIA-PET tool- Slef-ligation site analysis • In this step the Chip-enriched regions (self-ligation) are clustered. • Need to filter-out clusters with high FDR • FDR was calculated as follow: • For each cluster calculate the probability of obtaining 𝑘 ligation by chance using a Monte-Carlo simulation • Monte-Carlo simulation consists of randomly picking PETs and see how many PETs are in that cluster. • Repeat the simulation 100 times and get the mean. • Clusters with an FDR > 0.01 were filtered-out
  • 20. ChIA-PET tool-Chromatin interaction (1) • The identification chromatin interaction is done in two steps: • Step1: • Identify ChIP enriched interaction anchor regions from inter-ligation PETs similar to ChIP- Seq • The tag length is extended from 5’ to 3’ by a “tag extension length” (in this data set it was 200 bp) • Note: Most of the Inter-ligation anchor regions overlap with the self-ligation regions • Step2 • Determine the number of overlapping PETs. • Filter-out random interactions.
  • 21. Region A Region B ChIA-PET tool-Chromatin interaction (2) • The null hypothesis assumes : • Each chromatin fragment has a equal chance to ligate to any other fragment randomly. • The interaction between each anchors are independent from each other. • Let • 𝑹 𝑨 and 𝑹 𝑩 be two regions • 𝑪 𝑨 = Number of PETs in a 𝑅 𝐴 • 𝑪 𝑩 = Number of PETs in a 𝑅 𝐵 • 𝑰 𝑨,𝑩 = The number of interaction between two regions 𝐴 and 𝐵 • 𝑵 = The number of Inter-ligation (I think it should be only on the same chromosome) 𝑃 𝐼𝐴,𝐵 𝑁, 𝐶𝐴, 𝐶 𝐵) = 𝐶 𝐴 𝐼 𝐴,𝐵 2𝑁−𝐶 𝐴 𝐶 𝐵 −𝐼 𝐴,𝐵 2𝑁 𝐶 𝐵
  • 23. ChIA-PET tool- Visualization (1) Can be accessed at : http://cms1.gis.a-star.edu.sg/cgi-bin/gbrowse
  • 24. ChIA-PET tool- Visualization (2) • Can be visualized at this link: http://cms1.gis.a-star.edu.sg/index.php • User name: guest • Password: guest
  • 26. Installing Chia-PET on our server MySQL server: • You need to install your own or install MySQL Cluster Adding a new assembly: • Download the reference genome and create BATMAN index using steps in Installation guid (appendix) • Download for the UCSC table the following files and format them in the Chia-pet format: • Chromosome sizes • Genes • Gaps • Installing missing packages (Note: replace $HOME by the real path) • fftw: create a custom directory (ex: $HOME/bin) install using the following commands • rimage: Open R and install it as follow: ./configure –-prefix=$HOME/bin make CFLAGS=“-fCIP” Make install install.packages("rimage_0.5-8.2.tar",repos=NULL,configure.args="--with-fftw-include=$HOME/bin/incluse --with-fftw-lib=$HOME/bin/lib")
  • 27. Running Chia-PET tool • We suppose that Chia-PET is installed in $HOME/ChiaPET • Go to $HOME/ChiaPET/src/python/common and open the file config.py and change the linkers • Go to $HOME/ChiaPET/src/python/main • Just need to run the following commands: • The results will be in $HOME/ChiaPET/work/<lib_name> • Temporary files will be in $HOME/ChiaPET/prep/<lib_name>

Editor's Notes

  1. Intra-chromosomal inter-ligation PETs' are PETs with both tags mapped to the same chromosome with a long genomic span
  2. Intra-chromosomal inter-ligation PETs' are PETs with both tags mapped to the same chromosome with a long genomic span
  3. Intra-chromosomal inter-ligation PETs' are PETs with both tags mapped to the same chromosome with a long genomic span
  4. Intra-chromosomal inter-ligation PETs' are PETs with both tags mapped to the same chromosome with a long genomic span
  5. Intra-chromosomal inter-ligation PETs' are PETs with both tags mapped to the same chromosome with a long genomic span