SlideShare a Scribd company logo
methylKit,
DNA methylation analysis from
high-throughput bisulfite
sequencing data
Alexander Gosdschan
PhD Student
Akalin Group, BIMSB MDC
bioinformatics.mdc-berlin.de
Bioconductor Europe Meeting 2018
Bioconductor Europe Meeting 2018Alexander Gosdschan
Bisulfite Sequencing Workflow
5’
ACm
GTAATCGAG
3’
5’
ACm
GTAATUGAG
3’
5’
ACm
GTAATTGAG3
’
sodium
bisulfite
PCR
Sequence
Align to genome
Call methylation
Statistics on
samples
Comparative
analysis
Sample
Correlation
Sample
Clustering
Differential
Analysis
Annotation
Krueger &
Andrews (2011)
Akalin et.
al. (2012)
Segmentation
Bioconductor Europe Meeting 2018Alexander Gosdschan
Read in data
## chrBase chr base strand coverage freqC freqT
## 1 chr21.9764539 chr21 9764539 R 12 25.00 75.00
## 2 chr21.9764513 chr21 9764513 R 12 0.00 100.00
## 3 chr21.9820622 chr21 9820622 F 13 0.00 100.00
## 4 chr21.9837545 chr21 9837545 F 11 0.00 100.00
## 5 chr21.9849022 chr21 9849022 F 124 72.58 27.42
from pre-called txt files (e.g. cytosineReport or
coverage files from Bismark aligner ):
methRead()
from Bismark BAM reads
(supported through RHTSlib):
processBismarkAln()
flat-file database: dbtype = “tabix”
methyl*DB, supported through
Rsamtools
in-memory:
methylRaw / methylRawList
Bioconductor Europe Meeting 2018Alexander Gosdschan
Summarize statistics on samples
getCoverageStats(methylRaw)getMethylationStats(methylRaw)
methylation statistics per base
summary:
Min. 1st Qu. Median Mean 3rd Qu. Max.
0.00 20.00 82.79 63.17 94.74 100.00
percentiles:
0% 10% 20% 30% 40% 50% 60% 70%
0.00000 0.00000 0.00000 48.38710 70.00000 82.78556 90.00000 93.33333
80% 90% 95% 99% 99.5% 99.9% 100%
96.42857 100.00000 100.00000 100.00000 100.00000 100.00000 100.00000
read coverage statistics per base
summary:
Min. 1st Qu. Median Mean 3rd Qu. Max.
10.00 16.00 26.00 34.45 39.00 630.00
percentiles:
0% 10% 20% 30% 40% 50% 60% 70% 80% 90%
10.000 11.000 14.000 17.000 20.000 26.000 30.000 36.000 42.000 60.000
95% 99% 99.5% 99.9% 100%
78.750 195.800 328.300 441.945 630.000
Bioconductor Europe Meeting 2018Alexander Gosdschan
Segmentation
Segmenting the methylome into sections of CpGs with similar
methylation profiles using change-point analysis, followed by
clustering using a mixture modeling approach.
Comparison of features identified
using methylKit change-point based
segmentation on Human IMR90
methylome with published PMDs
identified with MethPipe (Song et al.,
2013b),(Lister et al., 2009, Gaidatzis
et al., 2014)
Wreczycka & Gosdschan (2017)
(supported through fastseg)
methSeg(object) - segmentation of
GRanges, methylDiff,methylRaw
Bioconductor Europe Meeting 2018Alexander Gosdschan
Compare Samples
get the bases covered in all samples: merge all samples to one object for
base-pair locations that are covered in all samples:
unite(methylRawList) → methylBase
getCorrelation(methylBase) clusterSamples(methylBase) PCASamples(methylBase)
assocComp - Batch effect correction
tileMethylCounts - Tiling
windows analysis
Bioconductor Europe Meeting 2018Alexander Gosdschan
Differential Analysis
Testing for differential methylation using either Fisher’s exact test or
Chisq test for logistic regression model (depending on the sample
size per set) with p-value adjustment using SLIM method (Wang,
Tuominen, and Tsai 2011):
calculateDiffMeth(methylBase) → methylDiff
getMethylDiff - filtering
differential bases
calculateDiffMeth(...,
mc.cores=2) - use multiple cores:
Optional correction for overdispersion if more variability present in
the data than assumed by binomial distribution:
calculateDiffMeth(methylBase,overdispersion="MN")
Covariates can be included in the analysis to separate the influence of
the covariates from the treatment effect via the logistic regression model.
Testing if full model is better than the model with the covariates only.
covariates=data.frame(age=c(30,80,30,80))
calculateDiffMeth(methylBase,covariates=covariates)
Bioconductor Europe Meeting 2018Alexander Gosdschan
Annotation
Use genomation package to annotate differentially
methylated regions/bases based on gene annotation:
Presentation on Friday:
Session VI - Katarzyna Wreczycka
first read the gene BED file:
gene.obj=readTranscriptFeatures(system.file("extdata",
"refseq.hg18.bed.txt",package = "methylKit"))
then get all differentially methylated bases:
myDiff25p=getMethylDiff(methylDiff,difference=25,
qvalue=0.01)
now annotate differentially methylated CpGs with
promoter/exon/intron using annotation data
diffAnn=annotateWithGeneParts(as(myDiff25p,"GRanges"),
gene.obj)
finally visualize the annotation:
plotTargetAnnotation(diffAnn,precedence=TRUE,
main="differential methylation annotation")
Bioconductor Europe Meeting 2018Alexander Gosdschan
Acknowledgements
BIMSB: Altuna Akalin, Katarzyna Wreczycka
Bioconductor Team
Code:
- https://github.com/al2na/methylKit
Blog:
- http://zvfak.blogspot.com/search/label/methylKit
Support:
- https://groups.google.com/forum/#!forum/methylkit_discussion

More Related Content

What's hot

Project report-on-bio-informatics
Project report-on-bio-informaticsProject report-on-bio-informatics
Project report-on-bio-informatics
Daniela Rotariu
 
NGS: Mapping and de novo assembly
NGS: Mapping and de novo assemblyNGS: Mapping and de novo assembly
NGS: Mapping and de novo assembly
Bioinformatics and Computational Biosciences Branch
 
A comparative study of Clustering for Gene expression data in Bioinformatics
A comparative study of Clustering for Gene expression data in BioinformaticsA comparative study of Clustering for Gene expression data in Bioinformatics
A comparative study of Clustering for Gene expression data in Bioinformatics
Begum Rokeya Universtiy, Rangpur
 
de Bruijn Graph Construction from Combination of Short and Long Reads
de Bruijn Graph Construction from Combination of Short and Long Readsde Bruijn Graph Construction from Combination of Short and Long Reads
de Bruijn Graph Construction from Combination of Short and Long Reads
Sikder Tahsin Al-Amin
 
RNASeq - Analysis Pipeline for Differential Expression
RNASeq - Analysis Pipeline for Differential ExpressionRNASeq - Analysis Pipeline for Differential Expression
RNASeq - Analysis Pipeline for Differential Expression
Jatinder Singh
 
So you want to do a: RNAseq experiment, Differential Gene Expression Analysis
So you want to do a: RNAseq experiment, Differential Gene Expression AnalysisSo you want to do a: RNAseq experiment, Differential Gene Expression Analysis
So you want to do a: RNAseq experiment, Differential Gene Expression Analysis
University of California, Davis
 
Data Manipulation Using R (& dplyr)
Data Manipulation Using R (& dplyr)Data Manipulation Using R (& dplyr)
Data Manipulation Using R (& dplyr)
Ram Narasimhan
 
20141218 Methylation Sequencing Analysis
20141218  Methylation Sequencing Analysis20141218  Methylation Sequencing Analysis
20141218 Methylation Sequencing Analysis
Yi-Feng Chang
 
An introduction to RNA-seq data analysis
An introduction to RNA-seq data analysisAn introduction to RNA-seq data analysis
An introduction to RNA-seq data analysis
AGRF_Ltd
 
DNA Methylation Data Analysis
DNA Methylation Data AnalysisDNA Methylation Data Analysis
DNA Methylation Data Analysis
Yi-Feng Chang
 
Fasta
FastaFasta
An Overview to Protein bioinformatics
An Overview to Protein bioinformaticsAn Overview to Protein bioinformatics
An Overview to Protein bioinformatics
Joel Ricci-López
 
Multi-omics data integration methods: kernel and other machine learning appro...
Multi-omics data integration methods: kernel and other machine learning appro...Multi-omics data integration methods: kernel and other machine learning appro...
Multi-omics data integration methods: kernel and other machine learning appro...
tuxette
 
Zinc finger technology
Zinc finger technologyZinc finger technology
Zinc finger technology
Munish Chhabra
 
SWISS-PROT
SWISS-PROTSWISS-PROT
SWISS-PROT
Sowmy Sowmy
 
BITS: Basics of sequence analysis
BITS: Basics of sequence analysisBITS: Basics of sequence analysis
BITS: Basics of sequence analysis
BITS
 
Bioinformatics .pptx
Bioinformatics .pptxBioinformatics .pptx
Bioinformatics .pptx
UpendraSharmaUS1
 
Protein structure prediction with a focus on Rosetta
Protein structure prediction with a focus on RosettaProtein structure prediction with a focus on Rosetta
Protein structure prediction with a focus on Rosetta
Bioinformatics and Computational Biosciences Branch
 
ChipSeq Data Analysis
ChipSeq Data AnalysisChipSeq Data Analysis
ChipSeq Data Analysis
COST action BM1006
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformatics
philmaweb
 

What's hot (20)

Project report-on-bio-informatics
Project report-on-bio-informaticsProject report-on-bio-informatics
Project report-on-bio-informatics
 
NGS: Mapping and de novo assembly
NGS: Mapping and de novo assemblyNGS: Mapping and de novo assembly
NGS: Mapping and de novo assembly
 
A comparative study of Clustering for Gene expression data in Bioinformatics
A comparative study of Clustering for Gene expression data in BioinformaticsA comparative study of Clustering for Gene expression data in Bioinformatics
A comparative study of Clustering for Gene expression data in Bioinformatics
 
de Bruijn Graph Construction from Combination of Short and Long Reads
de Bruijn Graph Construction from Combination of Short and Long Readsde Bruijn Graph Construction from Combination of Short and Long Reads
de Bruijn Graph Construction from Combination of Short and Long Reads
 
RNASeq - Analysis Pipeline for Differential Expression
RNASeq - Analysis Pipeline for Differential ExpressionRNASeq - Analysis Pipeline for Differential Expression
RNASeq - Analysis Pipeline for Differential Expression
 
So you want to do a: RNAseq experiment, Differential Gene Expression Analysis
So you want to do a: RNAseq experiment, Differential Gene Expression AnalysisSo you want to do a: RNAseq experiment, Differential Gene Expression Analysis
So you want to do a: RNAseq experiment, Differential Gene Expression Analysis
 
Data Manipulation Using R (& dplyr)
Data Manipulation Using R (& dplyr)Data Manipulation Using R (& dplyr)
Data Manipulation Using R (& dplyr)
 
20141218 Methylation Sequencing Analysis
20141218  Methylation Sequencing Analysis20141218  Methylation Sequencing Analysis
20141218 Methylation Sequencing Analysis
 
An introduction to RNA-seq data analysis
An introduction to RNA-seq data analysisAn introduction to RNA-seq data analysis
An introduction to RNA-seq data analysis
 
DNA Methylation Data Analysis
DNA Methylation Data AnalysisDNA Methylation Data Analysis
DNA Methylation Data Analysis
 
Fasta
FastaFasta
Fasta
 
An Overview to Protein bioinformatics
An Overview to Protein bioinformaticsAn Overview to Protein bioinformatics
An Overview to Protein bioinformatics
 
Multi-omics data integration methods: kernel and other machine learning appro...
Multi-omics data integration methods: kernel and other machine learning appro...Multi-omics data integration methods: kernel and other machine learning appro...
Multi-omics data integration methods: kernel and other machine learning appro...
 
Zinc finger technology
Zinc finger technologyZinc finger technology
Zinc finger technology
 
SWISS-PROT
SWISS-PROTSWISS-PROT
SWISS-PROT
 
BITS: Basics of sequence analysis
BITS: Basics of sequence analysisBITS: Basics of sequence analysis
BITS: Basics of sequence analysis
 
Bioinformatics .pptx
Bioinformatics .pptxBioinformatics .pptx
Bioinformatics .pptx
 
Protein structure prediction with a focus on Rosetta
Protein structure prediction with a focus on RosettaProtein structure prediction with a focus on Rosetta
Protein structure prediction with a focus on Rosetta
 
ChipSeq Data Analysis
ChipSeq Data AnalysisChipSeq Data Analysis
ChipSeq Data Analysis
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformatics
 

Similar to EuroBioc 2018 - metyhlKit overview

CIBEC Presentation Fatma Sayed.pptx
CIBEC Presentation Fatma Sayed.pptxCIBEC Presentation Fatma Sayed.pptx
CIBEC Presentation Fatma Sayed.pptx
Fatma Sayed Ibrahim
 
GPU-accelerated Virtual Screening
GPU-accelerated Virtual ScreeningGPU-accelerated Virtual Screening
GPU-accelerated Virtual Screening
Olexandr Isayev
 
4.15 w13 final presentation
4.15  w13 final presentation4.15  w13 final presentation
4.15 w13 final presentation
BYUiGem
 
BCSRCv1.3
BCSRCv1.3BCSRCv1.3
BCSRCv1.3
Kishan Bhut
 
Characterization of monoclonal antibodies and Antibody drug conjugates by Sur...
Characterization of monoclonal antibodies and Antibody drug conjugates by Sur...Characterization of monoclonal antibodies and Antibody drug conjugates by Sur...
Characterization of monoclonal antibodies and Antibody drug conjugates by Sur...
Merck Life Sciences
 
Characterization of monoclonal antibodies and Antibody drug conjugates by Sur...
Characterization of monoclonal antibodies and Antibody drug conjugates by Sur...Characterization of monoclonal antibodies and Antibody drug conjugates by Sur...
Characterization of monoclonal antibodies and Antibody drug conjugates by Sur...
MilliporeSigma
 
Pjb Probes 2009
Pjb Probes 2009Pjb Probes 2009
Pjb Probes 2009
toluene
 
OECD Webinar | Assessing the dispersion stability and dissolution (rate) of n...
OECD Webinar | Assessing the dispersion stability and dissolution (rate) of n...OECD Webinar | Assessing the dispersion stability and dissolution (rate) of n...
OECD Webinar | Assessing the dispersion stability and dissolution (rate) of n...
OECD Environment
 
Benchmarking with GIAB 220907
Benchmarking with GIAB 220907Benchmarking with GIAB 220907
Benchmarking with GIAB 220907
GenomeInABottle
 
SF and PE CTR-IN 2016 Poster_FInal
SF and PE CTR-IN 2016 Poster_FInalSF and PE CTR-IN 2016 Poster_FInal
SF and PE CTR-IN 2016 Poster_FInal
Steve Flynn
 
Gene Expression Analysis by Real Time PCR
Gene Expression Analysis by Real Time PCRGene Expression Analysis by Real Time PCR
Gene Expression Analysis by Real Time PCR
Suresh Antre
 
QSAR STUDY ON READY BIODEGRADABILITY OF CHEMICALS. Presented at the 3rd Chemo...
QSAR STUDY ON READY BIODEGRADABILITY OF CHEMICALS. Presented at the 3rd Chemo...QSAR STUDY ON READY BIODEGRADABILITY OF CHEMICALS. Presented at the 3rd Chemo...
QSAR STUDY ON READY BIODEGRADABILITY OF CHEMICALS. Presented at the 3rd Chemo...
Kamel Mansouri
 
Gordon2003
Gordon2003Gordon2003
Gordon2003
toluene
 
Homology directed repair (HDR) Knock-in
Homology directed repair (HDR) Knock-inHomology directed repair (HDR) Knock-in
Homology directed repair (HDR) Knock-in
Anandrao Ashok Patil, Ph.D.
 
GIAB and long reads for bio it world 190417
GIAB and long reads for bio it world 190417GIAB and long reads for bio it world 190417
GIAB and long reads for bio it world 190417
GenomeInABottle
 
EpiMOLAS: An Intuitive Web-based Framework for Genome-Wide DNA Methylation An...
EpiMOLAS: An Intuitive Web-based Framework for Genome-Wide DNA Methylation An...EpiMOLAS: An Intuitive Web-based Framework for Genome-Wide DNA Methylation An...
EpiMOLAS: An Intuitive Web-based Framework for Genome-Wide DNA Methylation An...
Daniel Su
 
D012622129
D012622129D012622129
D012622129
IOSR Journals
 
Process Parameter Optimization of Bead geometry for AISI 446 in GMAW Process ...
Process Parameter Optimization of Bead geometry for AISI 446 in GMAW Process ...Process Parameter Optimization of Bead geometry for AISI 446 in GMAW Process ...
Process Parameter Optimization of Bead geometry for AISI 446 in GMAW Process ...
iosrjce
 
D012622129
D012622129D012622129
D012622129
IOSR Journals
 
Microarray biotechnologg ppy dna microarrays
Microarray biotechnologg ppy dna microarraysMicroarray biotechnologg ppy dna microarrays
Microarray biotechnologg ppy dna microarrays
ayeshasattarsandhu
 

Similar to EuroBioc 2018 - metyhlKit overview (20)

CIBEC Presentation Fatma Sayed.pptx
CIBEC Presentation Fatma Sayed.pptxCIBEC Presentation Fatma Sayed.pptx
CIBEC Presentation Fatma Sayed.pptx
 
GPU-accelerated Virtual Screening
GPU-accelerated Virtual ScreeningGPU-accelerated Virtual Screening
GPU-accelerated Virtual Screening
 
4.15 w13 final presentation
4.15  w13 final presentation4.15  w13 final presentation
4.15 w13 final presentation
 
BCSRCv1.3
BCSRCv1.3BCSRCv1.3
BCSRCv1.3
 
Characterization of monoclonal antibodies and Antibody drug conjugates by Sur...
Characterization of monoclonal antibodies and Antibody drug conjugates by Sur...Characterization of monoclonal antibodies and Antibody drug conjugates by Sur...
Characterization of monoclonal antibodies and Antibody drug conjugates by Sur...
 
Characterization of monoclonal antibodies and Antibody drug conjugates by Sur...
Characterization of monoclonal antibodies and Antibody drug conjugates by Sur...Characterization of monoclonal antibodies and Antibody drug conjugates by Sur...
Characterization of monoclonal antibodies and Antibody drug conjugates by Sur...
 
Pjb Probes 2009
Pjb Probes 2009Pjb Probes 2009
Pjb Probes 2009
 
OECD Webinar | Assessing the dispersion stability and dissolution (rate) of n...
OECD Webinar | Assessing the dispersion stability and dissolution (rate) of n...OECD Webinar | Assessing the dispersion stability and dissolution (rate) of n...
OECD Webinar | Assessing the dispersion stability and dissolution (rate) of n...
 
Benchmarking with GIAB 220907
Benchmarking with GIAB 220907Benchmarking with GIAB 220907
Benchmarking with GIAB 220907
 
SF and PE CTR-IN 2016 Poster_FInal
SF and PE CTR-IN 2016 Poster_FInalSF and PE CTR-IN 2016 Poster_FInal
SF and PE CTR-IN 2016 Poster_FInal
 
Gene Expression Analysis by Real Time PCR
Gene Expression Analysis by Real Time PCRGene Expression Analysis by Real Time PCR
Gene Expression Analysis by Real Time PCR
 
QSAR STUDY ON READY BIODEGRADABILITY OF CHEMICALS. Presented at the 3rd Chemo...
QSAR STUDY ON READY BIODEGRADABILITY OF CHEMICALS. Presented at the 3rd Chemo...QSAR STUDY ON READY BIODEGRADABILITY OF CHEMICALS. Presented at the 3rd Chemo...
QSAR STUDY ON READY BIODEGRADABILITY OF CHEMICALS. Presented at the 3rd Chemo...
 
Gordon2003
Gordon2003Gordon2003
Gordon2003
 
Homology directed repair (HDR) Knock-in
Homology directed repair (HDR) Knock-inHomology directed repair (HDR) Knock-in
Homology directed repair (HDR) Knock-in
 
GIAB and long reads for bio it world 190417
GIAB and long reads for bio it world 190417GIAB and long reads for bio it world 190417
GIAB and long reads for bio it world 190417
 
EpiMOLAS: An Intuitive Web-based Framework for Genome-Wide DNA Methylation An...
EpiMOLAS: An Intuitive Web-based Framework for Genome-Wide DNA Methylation An...EpiMOLAS: An Intuitive Web-based Framework for Genome-Wide DNA Methylation An...
EpiMOLAS: An Intuitive Web-based Framework for Genome-Wide DNA Methylation An...
 
D012622129
D012622129D012622129
D012622129
 
Process Parameter Optimization of Bead geometry for AISI 446 in GMAW Process ...
Process Parameter Optimization of Bead geometry for AISI 446 in GMAW Process ...Process Parameter Optimization of Bead geometry for AISI 446 in GMAW Process ...
Process Parameter Optimization of Bead geometry for AISI 446 in GMAW Process ...
 
D012622129
D012622129D012622129
D012622129
 
Microarray biotechnologg ppy dna microarrays
Microarray biotechnologg ppy dna microarraysMicroarray biotechnologg ppy dna microarrays
Microarray biotechnologg ppy dna microarrays
 

Recently uploaded

HOW DO ORGANISMS REPRODUCE?reproduction part 1
HOW DO ORGANISMS REPRODUCE?reproduction part 1HOW DO ORGANISMS REPRODUCE?reproduction part 1
HOW DO ORGANISMS REPRODUCE?reproduction part 1
Shashank Shekhar Pandey
 
GBSN - Biochemistry (Unit 6) Chemistry of Proteins
GBSN - Biochemistry (Unit 6) Chemistry of ProteinsGBSN - Biochemistry (Unit 6) Chemistry of Proteins
GBSN - Biochemistry (Unit 6) Chemistry of Proteins
Areesha Ahmad
 
CLASS 12th CHEMISTRY SOLID STATE ppt (Animated)
CLASS 12th CHEMISTRY SOLID STATE ppt (Animated)CLASS 12th CHEMISTRY SOLID STATE ppt (Animated)
CLASS 12th CHEMISTRY SOLID STATE ppt (Animated)
eitps1506
 
Gadgets for management of stored product pests_Dr.UPR.pdf
Gadgets for management of stored product pests_Dr.UPR.pdfGadgets for management of stored product pests_Dr.UPR.pdf
Gadgets for management of stored product pests_Dr.UPR.pdf
PirithiRaju
 
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
PsychoTech Services
 
Tissue fluids_etiology_volume regulation_pressure.pptx
Tissue fluids_etiology_volume regulation_pressure.pptxTissue fluids_etiology_volume regulation_pressure.pptx
Tissue fluids_etiology_volume regulation_pressure.pptx
muralinath2
 
Immersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths ForwardImmersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths Forward
Leonel Morgado
 
Compexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titrationCompexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titration
Vandana Devesh Sharma
 
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdfMending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Selcen Ozturkcan
 
Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...
Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...
Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...
Travis Hills MN
 
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Leonel Morgado
 
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
Sérgio Sacani
 
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
hozt8xgk
 
Sciences of Europe journal No 142 (2024)
Sciences of Europe journal No 142 (2024)Sciences of Europe journal No 142 (2024)
Sciences of Europe journal No 142 (2024)
Sciences of Europe
 
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
Scintica Instrumentation
 
Pests of Storage_Identification_Dr.UPR.pdf
Pests of Storage_Identification_Dr.UPR.pdfPests of Storage_Identification_Dr.UPR.pdf
Pests of Storage_Identification_Dr.UPR.pdf
PirithiRaju
 
AJAY KUMAR NIET GreNo Guava Project File.pdf
AJAY KUMAR NIET GreNo Guava Project File.pdfAJAY KUMAR NIET GreNo Guava Project File.pdf
AJAY KUMAR NIET GreNo Guava Project File.pdf
AJAY KUMAR
 
23PH301 - Optics - Optical Lenses.pptx
23PH301 - Optics  -  Optical Lenses.pptx23PH301 - Optics  -  Optical Lenses.pptx
23PH301 - Optics - Optical Lenses.pptx
RDhivya6
 
Anti-Universe And Emergent Gravity and the Dark Universe
Anti-Universe And Emergent Gravity and the Dark UniverseAnti-Universe And Emergent Gravity and the Dark Universe
Anti-Universe And Emergent Gravity and the Dark Universe
Sérgio Sacani
 
The binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defectsThe binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defects
Sérgio Sacani
 

Recently uploaded (20)

HOW DO ORGANISMS REPRODUCE?reproduction part 1
HOW DO ORGANISMS REPRODUCE?reproduction part 1HOW DO ORGANISMS REPRODUCE?reproduction part 1
HOW DO ORGANISMS REPRODUCE?reproduction part 1
 
GBSN - Biochemistry (Unit 6) Chemistry of Proteins
GBSN - Biochemistry (Unit 6) Chemistry of ProteinsGBSN - Biochemistry (Unit 6) Chemistry of Proteins
GBSN - Biochemistry (Unit 6) Chemistry of Proteins
 
CLASS 12th CHEMISTRY SOLID STATE ppt (Animated)
CLASS 12th CHEMISTRY SOLID STATE ppt (Animated)CLASS 12th CHEMISTRY SOLID STATE ppt (Animated)
CLASS 12th CHEMISTRY SOLID STATE ppt (Animated)
 
Gadgets for management of stored product pests_Dr.UPR.pdf
Gadgets for management of stored product pests_Dr.UPR.pdfGadgets for management of stored product pests_Dr.UPR.pdf
Gadgets for management of stored product pests_Dr.UPR.pdf
 
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
 
Tissue fluids_etiology_volume regulation_pressure.pptx
Tissue fluids_etiology_volume regulation_pressure.pptxTissue fluids_etiology_volume regulation_pressure.pptx
Tissue fluids_etiology_volume regulation_pressure.pptx
 
Immersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths ForwardImmersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths Forward
 
Compexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titrationCompexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titration
 
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdfMending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
 
Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...
Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...
Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...
 
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
 
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
 
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
 
Sciences of Europe journal No 142 (2024)
Sciences of Europe journal No 142 (2024)Sciences of Europe journal No 142 (2024)
Sciences of Europe journal No 142 (2024)
 
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
 
Pests of Storage_Identification_Dr.UPR.pdf
Pests of Storage_Identification_Dr.UPR.pdfPests of Storage_Identification_Dr.UPR.pdf
Pests of Storage_Identification_Dr.UPR.pdf
 
AJAY KUMAR NIET GreNo Guava Project File.pdf
AJAY KUMAR NIET GreNo Guava Project File.pdfAJAY KUMAR NIET GreNo Guava Project File.pdf
AJAY KUMAR NIET GreNo Guava Project File.pdf
 
23PH301 - Optics - Optical Lenses.pptx
23PH301 - Optics  -  Optical Lenses.pptx23PH301 - Optics  -  Optical Lenses.pptx
23PH301 - Optics - Optical Lenses.pptx
 
Anti-Universe And Emergent Gravity and the Dark Universe
Anti-Universe And Emergent Gravity and the Dark UniverseAnti-Universe And Emergent Gravity and the Dark Universe
Anti-Universe And Emergent Gravity and the Dark Universe
 
The binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defectsThe binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defects
 

EuroBioc 2018 - metyhlKit overview

  • 1. methylKit, DNA methylation analysis from high-throughput bisulfite sequencing data Alexander Gosdschan PhD Student Akalin Group, BIMSB MDC bioinformatics.mdc-berlin.de Bioconductor Europe Meeting 2018
  • 2. Bioconductor Europe Meeting 2018Alexander Gosdschan Bisulfite Sequencing Workflow 5’ ACm GTAATCGAG 3’ 5’ ACm GTAATUGAG 3’ 5’ ACm GTAATTGAG3 ’ sodium bisulfite PCR Sequence Align to genome Call methylation Statistics on samples Comparative analysis Sample Correlation Sample Clustering Differential Analysis Annotation Krueger & Andrews (2011) Akalin et. al. (2012) Segmentation
  • 3. Bioconductor Europe Meeting 2018Alexander Gosdschan Read in data ## chrBase chr base strand coverage freqC freqT ## 1 chr21.9764539 chr21 9764539 R 12 25.00 75.00 ## 2 chr21.9764513 chr21 9764513 R 12 0.00 100.00 ## 3 chr21.9820622 chr21 9820622 F 13 0.00 100.00 ## 4 chr21.9837545 chr21 9837545 F 11 0.00 100.00 ## 5 chr21.9849022 chr21 9849022 F 124 72.58 27.42 from pre-called txt files (e.g. cytosineReport or coverage files from Bismark aligner ): methRead() from Bismark BAM reads (supported through RHTSlib): processBismarkAln() flat-file database: dbtype = “tabix” methyl*DB, supported through Rsamtools in-memory: methylRaw / methylRawList
  • 4. Bioconductor Europe Meeting 2018Alexander Gosdschan Summarize statistics on samples getCoverageStats(methylRaw)getMethylationStats(methylRaw) methylation statistics per base summary: Min. 1st Qu. Median Mean 3rd Qu. Max. 0.00 20.00 82.79 63.17 94.74 100.00 percentiles: 0% 10% 20% 30% 40% 50% 60% 70% 0.00000 0.00000 0.00000 48.38710 70.00000 82.78556 90.00000 93.33333 80% 90% 95% 99% 99.5% 99.9% 100% 96.42857 100.00000 100.00000 100.00000 100.00000 100.00000 100.00000 read coverage statistics per base summary: Min. 1st Qu. Median Mean 3rd Qu. Max. 10.00 16.00 26.00 34.45 39.00 630.00 percentiles: 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 10.000 11.000 14.000 17.000 20.000 26.000 30.000 36.000 42.000 60.000 95% 99% 99.5% 99.9% 100% 78.750 195.800 328.300 441.945 630.000
  • 5. Bioconductor Europe Meeting 2018Alexander Gosdschan Segmentation Segmenting the methylome into sections of CpGs with similar methylation profiles using change-point analysis, followed by clustering using a mixture modeling approach. Comparison of features identified using methylKit change-point based segmentation on Human IMR90 methylome with published PMDs identified with MethPipe (Song et al., 2013b),(Lister et al., 2009, Gaidatzis et al., 2014) Wreczycka & Gosdschan (2017) (supported through fastseg) methSeg(object) - segmentation of GRanges, methylDiff,methylRaw
  • 6. Bioconductor Europe Meeting 2018Alexander Gosdschan Compare Samples get the bases covered in all samples: merge all samples to one object for base-pair locations that are covered in all samples: unite(methylRawList) → methylBase getCorrelation(methylBase) clusterSamples(methylBase) PCASamples(methylBase) assocComp - Batch effect correction tileMethylCounts - Tiling windows analysis
  • 7. Bioconductor Europe Meeting 2018Alexander Gosdschan Differential Analysis Testing for differential methylation using either Fisher’s exact test or Chisq test for logistic regression model (depending on the sample size per set) with p-value adjustment using SLIM method (Wang, Tuominen, and Tsai 2011): calculateDiffMeth(methylBase) → methylDiff getMethylDiff - filtering differential bases calculateDiffMeth(..., mc.cores=2) - use multiple cores: Optional correction for overdispersion if more variability present in the data than assumed by binomial distribution: calculateDiffMeth(methylBase,overdispersion="MN") Covariates can be included in the analysis to separate the influence of the covariates from the treatment effect via the logistic regression model. Testing if full model is better than the model with the covariates only. covariates=data.frame(age=c(30,80,30,80)) calculateDiffMeth(methylBase,covariates=covariates)
  • 8. Bioconductor Europe Meeting 2018Alexander Gosdschan Annotation Use genomation package to annotate differentially methylated regions/bases based on gene annotation: Presentation on Friday: Session VI - Katarzyna Wreczycka first read the gene BED file: gene.obj=readTranscriptFeatures(system.file("extdata", "refseq.hg18.bed.txt",package = "methylKit")) then get all differentially methylated bases: myDiff25p=getMethylDiff(methylDiff,difference=25, qvalue=0.01) now annotate differentially methylated CpGs with promoter/exon/intron using annotation data diffAnn=annotateWithGeneParts(as(myDiff25p,"GRanges"), gene.obj) finally visualize the annotation: plotTargetAnnotation(diffAnn,precedence=TRUE, main="differential methylation annotation")
  • 9. Bioconductor Europe Meeting 2018Alexander Gosdschan Acknowledgements BIMSB: Altuna Akalin, Katarzyna Wreczycka Bioconductor Team Code: - https://github.com/al2na/methylKit Blog: - http://zvfak.blogspot.com/search/label/methylKit Support: - https://groups.google.com/forum/#!forum/methylkit_discussion