SlideShare a Scribd company logo
EpiMOLAS: An Intuitive Web-based Framework for
Genome-wide DNA Methylation Analysis
Presented By
Sheng-Yao Su
Bioinformatics Program, Taiwan International Graduate Program,
Institute of Information Science, Academia Sinica
Institute of Biomedical Informatics, National Yang-Ming University
TAIWAN
Sep 10, 2019
Outline
• Introduction
• Methods
• Implementations and Results
• EpiMOLAS consists of DocMethyl and EpiMOLAS_web
• Discussion
• Conclusion
Introduction
Epigenomics
• Epi- (upon, above, beyond) genomics (DNA sequence)
• Waddington proposed this term in 1940s.
• Epigenomics is the study of the complete set of epigenetic
modification on the genetic material of a cell (wiki)
Epigenomic Dynamics
DNA
methylation
Histone
Midification
Nucleosome
Remodelling
Non-coding
RNAs
DNA methylation – an epigenetic mark of
cellular memory
DNA methylation: an epigenetic mark of cellular memory
Experimental & Molecular Medicine volume 49, page e322 (2017)
5-mC 5-hmCC
Chemical Structure
Sodium Bisulfite treatment
Correct conversion : C -> U -> T
Correct conversion : mC -> mC -> C
incorrect conversion : mC -> U -> T
Bisulfite
treatment
PCR
amplification
Unmethylated DNA Methylated DNA
Original sequence CCGTCGACGT CmCGTmCGAmCGT
Bisulfite converted UUGTUGAUGT UmCGTmCGAmCGT
PCR product TTGTTGATGT TCGTCGACGT
Incomplete conversion
Detect DNA modification changes
• Bisulfite conversion treatment
• Reduced Representation Bisulfite Sequencing (RRBS)
• Whole Genome Bisulfite Sequencing (WGBS)
• Bisulfite-free
• Anti-methylcytosine Antibody
• Methyl-CpG binding domain (MBD)
• Chemical labeling (MeFISH)
• Methylation-sensitive restriction enzyme
• Electrochemical oxidation
• Third Generation (SMRT-seq, Nanopore)
Methods
Generic bioinformatic analysis workflow for
bisulfite sequencing data
Seq Reads
Quality
Control
Alignment
Methylation
Call
Visualization
Annotation
Diff. Methyl.
Region
Biomarker
Candidates
Flowchart of EpiMOLAS
Biomarker
CandidatesSeq Reads
Trim Galore!
FastQC
Bowtie
Visualization
Annotation
Bismark
Extract.
EpiMOLAS_webDocMethyl
Bismark
mtable
Metric for Methylation Profiling - mtable
Gene
Genome
C
C
C
C
at least four counts of
methylated and
unmethylated cytosine
at least five qualified
observed cytosines
1 16425704 + 0 8 CHH CTC
1 16425710 + 6 6 CHG CAG
1 16425714 + 10 5 CHH CAA
1 16425717 + 6 0 CHG CTG
1 16425719 + 4 0 CG CGC
Bismark genome-wide cytosine report
Sequence
depth
Input Output
EpiMolas.jar
CG
CHG
CHH
Su et al. TEA: the epigenome platform for Arabidopsis methylome study. BMC Genomics 17(Suppl 13): 1027 (2016)
An Example of mtable
Ensembl
Gene ID
Methylation level of gene body and
promoter regions according to three
cytosine methylation contexts
less than five
qualified
observed
cytosines
Implementation and Results
Overview of EpiMOLAS
Architecture of DocMethyl and EpiMOLAS_web
DocMethyl
• Docker
• Galaxy
Infrastructure
Operating System
Docker Daemon
Galaxy platform
TrimGalore
FastQC
Bismark
EpiMolas.jar
Workflow
mtable
Methylation
Report
Raw
data
Input DocMethyl
DocMethyl
output
QC Report
Trimmed
Data
Reference
Genome
Gene
Annotation
A Workflow In DocMethyl
Trim
Sequences
Check QC of
Trimmed reads
Map Reads
on Genome
Extract Methylated
Cytosines
Generate Output
of Submission to
EpiMOLAS_web
• Trim Galore
• FastQC
• Bismark
• EpiMolas.jar
Steps and Output Files of the Workflow
Full text Search
DMGs (select diff
methylation Genes)
mC Threshold
Import Genelist
KEGG Global View
Gene List Analysis
Generate New Gene
List for further
Analysis in Built-in
Approaches
Modules Inside EpiMOLAS_web
Find Genes of Interest
Gene Profile
Gene List Analysis
Visualization Modules
• Boxplot
• Circos plot
• Heatmap
• Potein network
Discussion
Discussion
• It is hard to find the significant DMG according to DMG approaches.
Long region of gene size in length amortize the effect of DNA
methylation.
• Approximately 80% of all CpGs are located in repetitive sequences
and centromeric repeat regions of chromosomes, and are heavily
methylated.
• We list the comparison among several platforms and tools for
genome-wide DNA methylation analysis.
Comparison of each platform
EpiMOLAS BAT ENCODE
-WGBS
snakePipe NGI-
MethylSeq
Mint RnBeads
2.0
MethylPipe MethylSig Methylkit
Environment Docker,
Galaxy,
Web server
Docker Shell
script
Bioconda
Snakemake
Docker
Nexflow
Galaxy R package R package R package R package
Sequence
context
CG, CHG,
CHH
CG CG, CHG,
CHH
CG, CHG,
CHH
CG, CHG,
CHH
CG CG CG, CHG, CHH CG, CHG,
CHH
CG, CHG,
CHH
Start with raw reads raw
reads
raw reads raw reads raw reads raw
reads
Methyl.
Call file
Methyl. Call
file
Methyl.
Call file
Methyl.
Call file
Docker
Container
+ + – – + – NA NA NA NA
Web
interface
+
(Galaxy)
– + – – +
(Galaxy)
NA NA NA NA
Adapter and
base quality
trimming
+ – + + + + – NA NA NA
QC report + + + + + + + NA NA NA
Read
mapping
+ + + + + + – NA NA NA
Methylation
sites calling
+ + + + + + – NA NA NA
EpiMOLAS BAT ENCODE
-WGBS
snakePipe NGI-
MethylSeq
Mint RnBeads
2.0
MethylPipe MethylSig Methylkit
Discriptive
statistics
+ + – + + + + + + +
Find DMRs +
(simple)
+
(metilene)
– +
(metilene)
– +
(DSS)
+ + + +
Clustering
analysis
+
(heatmap)
+
(heatmap)
– +
(heatmap)
– – +
(heatmap)
– +
(heatmap)
–
GO term
enrichment
+ – – – – + + + – –
KEGG
pathway
enrichment
+ – – – – – – – – –
TFBS
enrichment
– – – – – – – – + –
Genome-
wide
visualization
+
(circos plot)
+
(circos
plot)
– – – – + – – –
Interactive
Quantitative
analysis
+ – – – – – +
(R Shiny)
NA NA NA
Data
browing and
retrieving UI
+ – – – – – +
(R Shiny)
NA NA NA
EpiMOLAS BAT ENCODE
-WGBS
snakePipe NGI-
MethylSeq
Mint RnBeads
2.0
MethylPipe MethylSig Methylkit
Gene list
with
tracking logs
+ – – – – – NA NA NA NA
Venn
analysis on
gene lists
+ – – – – – NA – – –
Interplay
with other
high
throughput
data
protein
Interactome
transcript
ome
– RNA-seq,
ChIP-seq,
ATAC-seq,
Hi-C etc.
– 5-
hmc
– RNA-seq,
ChIP-seq,
Dnase-seq
– –
Conclusion
Conclusion
• We present an integrated two-phase web-based ‘gene-centric’
framework for WGBS data from raw data processing to downstream
analysis.
• EpiMOLAS helps users deal with their WGBS data and alleviates the
burden on conducting reproducible analysis of public datasets.
https://hub.docker.com/r/lsbnb/docmethyl/
http://symbiosis.iis.sinica.edu.tw/epimolas/
Thank you for your attention !
Photo by KageHuang/Getty Images

More Related Content

What's hot

AGBT2017 Reference Workshop: Lindsay
AGBT2017 Reference Workshop: LindsayAGBT2017 Reference Workshop: Lindsay
AGBT2017 Reference Workshop: Lindsay
Genome Reference Consortium
 
Creating Reference-Grade Human Genome Assemblies
Creating Reference-Grade Human Genome AssembliesCreating Reference-Grade Human Genome Assemblies
Creating Reference-Grade Human Genome Assemblies
Genome Reference Consortium
 
Understanding the reference assembly: CSHL Hackathon
Understanding the reference assembly: CSHL HackathonUnderstanding the reference assembly: CSHL Hackathon
Understanding the reference assembly: CSHL Hackathon
Genome Reference Consortium
 
Exploiting long read sequencing technology to build a substantially improved ...
Exploiting long read sequencing technology to build a substantially improved ...Exploiting long read sequencing technology to build a substantially improved ...
Exploiting long read sequencing technology to build a substantially improved ...
Genome Reference Consortium
 
Ashg2017 workshop tg
Ashg2017 workshop tgAshg2017 workshop tg
Ashg2017 workshop tg
Genome Reference Consortium
 
Variation graphs and population assisted genome inference copy
Variation graphs and population assisted genome inference copyVariation graphs and population assisted genome inference copy
Variation graphs and population assisted genome inference copy
Genome Reference Consortium
 
Generating high-quality reference human genomes using PromethION nanopore seq...
Generating high-quality reference human genomes using PromethION nanopore seq...Generating high-quality reference human genomes using PromethION nanopore seq...
Generating high-quality reference human genomes using PromethION nanopore seq...
Miten Jain
 
ABGT 2016 Workshop Schneider
ABGT 2016 Workshop SchneiderABGT 2016 Workshop Schneider
ABGT 2016 Workshop Schneider
Genome Reference Consortium
 
AGBT2017 Reference Workshop: Fulton
AGBT2017 Reference Workshop: FultonAGBT2017 Reference Workshop: Fulton
AGBT2017 Reference Workshop: Fulton
Genome Reference Consortium
 
Ashg2014 grc workshop_schneider
Ashg2014 grc workshop_schneiderAshg2014 grc workshop_schneider
Ashg2014 grc workshop_schneider
Genome Reference Consortium
 
Ashg2015 schneider final
Ashg2015 schneider finalAshg2015 schneider final
Ashg2015 schneider final
Genome Reference Consortium
 
Ashg2015 grc-pruitt
Ashg2015 grc-pruittAshg2015 grc-pruitt
Ashg2015 grc-pruitt
Genome Reference Consortium
 
Alignment Approaches II: Long Reads
Alignment Approaches II: Long ReadsAlignment Approaches II: Long Reads
Alignment Approaches II: Long Reads
Genome Reference Consortium
 
101717.kh miga ashg_grc
101717.kh miga ashg_grc101717.kh miga ashg_grc
101717.kh miga ashg_grc
Genome Reference Consortium
 
GRCWorkshop_geval_1KG_slides
GRCWorkshop_geval_1KG_slidesGRCWorkshop_geval_1KG_slides
GRCWorkshop_geval_1KG_slides
Genome Reference Consortium
 
New data from giab genomes pacbio ccs
New data from giab genomes   pacbio ccsNew data from giab genomes   pacbio ccs
New data from giab genomes pacbio ccs
GenomeInABottle
 
New data from giab genomes promethion
New data from giab genomes   promethionNew data from giab genomes   promethion
New data from giab genomes promethion
GenomeInABottle
 
TAGC2016 schneider
TAGC2016 schneiderTAGC2016 schneider
TAGC2016 schneider
Genome Reference Consortium
 

What's hot (18)

AGBT2017 Reference Workshop: Lindsay
AGBT2017 Reference Workshop: LindsayAGBT2017 Reference Workshop: Lindsay
AGBT2017 Reference Workshop: Lindsay
 
Creating Reference-Grade Human Genome Assemblies
Creating Reference-Grade Human Genome AssembliesCreating Reference-Grade Human Genome Assemblies
Creating Reference-Grade Human Genome Assemblies
 
Understanding the reference assembly: CSHL Hackathon
Understanding the reference assembly: CSHL HackathonUnderstanding the reference assembly: CSHL Hackathon
Understanding the reference assembly: CSHL Hackathon
 
Exploiting long read sequencing technology to build a substantially improved ...
Exploiting long read sequencing technology to build a substantially improved ...Exploiting long read sequencing technology to build a substantially improved ...
Exploiting long read sequencing technology to build a substantially improved ...
 
Ashg2017 workshop tg
Ashg2017 workshop tgAshg2017 workshop tg
Ashg2017 workshop tg
 
Variation graphs and population assisted genome inference copy
Variation graphs and population assisted genome inference copyVariation graphs and population assisted genome inference copy
Variation graphs and population assisted genome inference copy
 
Generating high-quality reference human genomes using PromethION nanopore seq...
Generating high-quality reference human genomes using PromethION nanopore seq...Generating high-quality reference human genomes using PromethION nanopore seq...
Generating high-quality reference human genomes using PromethION nanopore seq...
 
ABGT 2016 Workshop Schneider
ABGT 2016 Workshop SchneiderABGT 2016 Workshop Schneider
ABGT 2016 Workshop Schneider
 
AGBT2017 Reference Workshop: Fulton
AGBT2017 Reference Workshop: FultonAGBT2017 Reference Workshop: Fulton
AGBT2017 Reference Workshop: Fulton
 
Ashg2014 grc workshop_schneider
Ashg2014 grc workshop_schneiderAshg2014 grc workshop_schneider
Ashg2014 grc workshop_schneider
 
Ashg2015 schneider final
Ashg2015 schneider finalAshg2015 schneider final
Ashg2015 schneider final
 
Ashg2015 grc-pruitt
Ashg2015 grc-pruittAshg2015 grc-pruitt
Ashg2015 grc-pruitt
 
Alignment Approaches II: Long Reads
Alignment Approaches II: Long ReadsAlignment Approaches II: Long Reads
Alignment Approaches II: Long Reads
 
101717.kh miga ashg_grc
101717.kh miga ashg_grc101717.kh miga ashg_grc
101717.kh miga ashg_grc
 
GRCWorkshop_geval_1KG_slides
GRCWorkshop_geval_1KG_slidesGRCWorkshop_geval_1KG_slides
GRCWorkshop_geval_1KG_slides
 
New data from giab genomes pacbio ccs
New data from giab genomes   pacbio ccsNew data from giab genomes   pacbio ccs
New data from giab genomes pacbio ccs
 
New data from giab genomes promethion
New data from giab genomes   promethionNew data from giab genomes   promethion
New data from giab genomes promethion
 
TAGC2016 schneider
TAGC2016 schneiderTAGC2016 schneider
TAGC2016 schneider
 

Similar to EpiMOLAS: An Intuitive Web-based Framework for Genome-Wide DNA Methylation Analysis

The Transformation of Systems Biology Into A Large Data Science
The Transformation of Systems Biology Into A Large Data ScienceThe Transformation of Systems Biology Into A Large Data Science
The Transformation of Systems Biology Into A Large Data Science
Robert Grossman
 
2014 khmer protocols
2014 khmer protocols2014 khmer protocols
2014 khmer protocols
c.titus.brown
 
Lopez-Bigas talk at the EBI/EMBL Cancer Genomics Workshop
Lopez-Bigas talk at the EBI/EMBL Cancer Genomics WorkshopLopez-Bigas talk at the EBI/EMBL Cancer Genomics Workshop
Lopez-Bigas talk at the EBI/EMBL Cancer Genomics Workshop
Nuria Lopez-Bigas
 
Bioinfo ngs data format visualization v2
Bioinfo ngs data format visualization v2Bioinfo ngs data format visualization v2
Bioinfo ngs data format visualization v2
Li Shen
 
Folker Meyer: Metagenomic Data Annotation
Folker Meyer: Metagenomic Data AnnotationFolker Meyer: Metagenomic Data Annotation
Folker Meyer: Metagenomic Data Annotation
GigaScience, BGI Hong Kong
 
CDAC 2018 Pellegrini clustering ppi networks
CDAC 2018 Pellegrini clustering ppi networksCDAC 2018 Pellegrini clustering ppi networks
CDAC 2018 Pellegrini clustering ppi networks
Marco Antoniotti
 
Cloud bioinformatics 2
Cloud bioinformatics 2Cloud bioinformatics 2
Cloud bioinformatics 2
ARPUTHA SELVARAJ A
 
Benchmarking with GIAB 220907
Benchmarking with GIAB 220907Benchmarking with GIAB 220907
Benchmarking with GIAB 220907
GenomeInABottle
 
Metabolic network mapping for metabolomics
Metabolic network mapping for metabolomicsMetabolic network mapping for metabolomics
Metabolic network mapping for metabolomics
Dinesh Barupal
 
Next-generation sequencing format and visualization with ngs.plot
Next-generation sequencing format and visualization with ngs.plotNext-generation sequencing format and visualization with ngs.plot
Next-generation sequencing format and visualization with ngs.plot
Li Shen
 
Databases_CSS2.pptx
Databases_CSS2.pptxDatabases_CSS2.pptx
Databases_CSS2.pptx
Silpa87
 
Mpp Rsv 2008 Public
Mpp Rsv 2008 PublicMpp Rsv 2008 Public
Mpp Rsv 2008 Public
lab13unisa
 
Cool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical ResearchCool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical Research
David Ruau
 
LogMap: Logic-based and Scalable Ontology Matching
LogMap: Logic-based and Scalable Ontology MatchingLogMap: Logic-based and Scalable Ontology Matching
LogMap: Logic-based and Scalable Ontology Matching
Ernesto Jimenez Ruiz
 
Generating high-quality human reference genomes using PromethION nanopore seq...
Generating high-quality human reference genomes using PromethION nanopore seq...Generating high-quality human reference genomes using PromethION nanopore seq...
Generating high-quality human reference genomes using PromethION nanopore seq...
Miten Jain
 
Knowledge Sharing - aCCCeso
Knowledge Sharing - aCCCesoKnowledge Sharing - aCCCeso
Knowledge Sharing - aCCCeso
Kaitlin Thaney
 
Thesis def
Thesis defThesis def
Thesis def
Jay Vyas
 
How we revealed genomes secrets?
How we revealed genomes secrets? How we revealed genomes secrets?
How we revealed genomes secrets?
ehsan sepahi
 
CIBEC Presentation Fatma Sayed.pptx
CIBEC Presentation Fatma Sayed.pptxCIBEC Presentation Fatma Sayed.pptx
CIBEC Presentation Fatma Sayed.pptx
Fatma Sayed Ibrahim
 
CRISPR Screening: the What, Why and How
CRISPR Screening: the What, Why and HowCRISPR Screening: the What, Why and How
CRISPR Screening: the What, Why and How
HorizonDiscovery
 

Similar to EpiMOLAS: An Intuitive Web-based Framework for Genome-Wide DNA Methylation Analysis (20)

The Transformation of Systems Biology Into A Large Data Science
The Transformation of Systems Biology Into A Large Data ScienceThe Transformation of Systems Biology Into A Large Data Science
The Transformation of Systems Biology Into A Large Data Science
 
2014 khmer protocols
2014 khmer protocols2014 khmer protocols
2014 khmer protocols
 
Lopez-Bigas talk at the EBI/EMBL Cancer Genomics Workshop
Lopez-Bigas talk at the EBI/EMBL Cancer Genomics WorkshopLopez-Bigas talk at the EBI/EMBL Cancer Genomics Workshop
Lopez-Bigas talk at the EBI/EMBL Cancer Genomics Workshop
 
Bioinfo ngs data format visualization v2
Bioinfo ngs data format visualization v2Bioinfo ngs data format visualization v2
Bioinfo ngs data format visualization v2
 
Folker Meyer: Metagenomic Data Annotation
Folker Meyer: Metagenomic Data AnnotationFolker Meyer: Metagenomic Data Annotation
Folker Meyer: Metagenomic Data Annotation
 
CDAC 2018 Pellegrini clustering ppi networks
CDAC 2018 Pellegrini clustering ppi networksCDAC 2018 Pellegrini clustering ppi networks
CDAC 2018 Pellegrini clustering ppi networks
 
Cloud bioinformatics 2
Cloud bioinformatics 2Cloud bioinformatics 2
Cloud bioinformatics 2
 
Benchmarking with GIAB 220907
Benchmarking with GIAB 220907Benchmarking with GIAB 220907
Benchmarking with GIAB 220907
 
Metabolic network mapping for metabolomics
Metabolic network mapping for metabolomicsMetabolic network mapping for metabolomics
Metabolic network mapping for metabolomics
 
Next-generation sequencing format and visualization with ngs.plot
Next-generation sequencing format and visualization with ngs.plotNext-generation sequencing format and visualization with ngs.plot
Next-generation sequencing format and visualization with ngs.plot
 
Databases_CSS2.pptx
Databases_CSS2.pptxDatabases_CSS2.pptx
Databases_CSS2.pptx
 
Mpp Rsv 2008 Public
Mpp Rsv 2008 PublicMpp Rsv 2008 Public
Mpp Rsv 2008 Public
 
Cool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical ResearchCool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical Research
 
LogMap: Logic-based and Scalable Ontology Matching
LogMap: Logic-based and Scalable Ontology MatchingLogMap: Logic-based and Scalable Ontology Matching
LogMap: Logic-based and Scalable Ontology Matching
 
Generating high-quality human reference genomes using PromethION nanopore seq...
Generating high-quality human reference genomes using PromethION nanopore seq...Generating high-quality human reference genomes using PromethION nanopore seq...
Generating high-quality human reference genomes using PromethION nanopore seq...
 
Knowledge Sharing - aCCCeso
Knowledge Sharing - aCCCesoKnowledge Sharing - aCCCeso
Knowledge Sharing - aCCCeso
 
Thesis def
Thesis defThesis def
Thesis def
 
How we revealed genomes secrets?
How we revealed genomes secrets? How we revealed genomes secrets?
How we revealed genomes secrets?
 
CIBEC Presentation Fatma Sayed.pptx
CIBEC Presentation Fatma Sayed.pptxCIBEC Presentation Fatma Sayed.pptx
CIBEC Presentation Fatma Sayed.pptx
 
CRISPR Screening: the What, Why and How
CRISPR Screening: the What, Why and HowCRISPR Screening: the What, Why and How
CRISPR Screening: the What, Why and How
 

Recently uploaded

GreenCode-A-VSCode-Plugin--Dario-Jurisic
GreenCode-A-VSCode-Plugin--Dario-JurisicGreenCode-A-VSCode-Plugin--Dario-Jurisic
GreenCode-A-VSCode-Plugin--Dario-Jurisic
Green Software Development
 
8 Best Automated Android App Testing Tool and Framework in 2024.pdf
8 Best Automated Android App Testing Tool and Framework in 2024.pdf8 Best Automated Android App Testing Tool and Framework in 2024.pdf
8 Best Automated Android App Testing Tool and Framework in 2024.pdf
kalichargn70th171
 
SQL Accounting Software Brochure Malaysia
SQL Accounting Software Brochure MalaysiaSQL Accounting Software Brochure Malaysia
SQL Accounting Software Brochure Malaysia
GohKiangHock
 
WWDC 2024 Keynote Review: For CocoaCoders Austin
WWDC 2024 Keynote Review: For CocoaCoders AustinWWDC 2024 Keynote Review: For CocoaCoders Austin
WWDC 2024 Keynote Review: For CocoaCoders Austin
Patrick Weigel
 
Top 9 Trends in Cybersecurity for 2024.pptx
Top 9 Trends in Cybersecurity for 2024.pptxTop 9 Trends in Cybersecurity for 2024.pptx
Top 9 Trends in Cybersecurity for 2024.pptx
devvsandy
 
Oracle 23c New Features For DBAs and Developers.pptx
Oracle 23c New Features For DBAs and Developers.pptxOracle 23c New Features For DBAs and Developers.pptx
Oracle 23c New Features For DBAs and Developers.pptx
Remote DBA Services
 
J-Spring 2024 - Going serverless with Quarkus, GraalVM native images and AWS ...
J-Spring 2024 - Going serverless with Quarkus, GraalVM native images and AWS ...J-Spring 2024 - Going serverless with Quarkus, GraalVM native images and AWS ...
J-Spring 2024 - Going serverless with Quarkus, GraalVM native images and AWS ...
Bert Jan Schrijver
 
Need for Speed: Removing speed bumps from your Symfony projects ⚡️
Need for Speed: Removing speed bumps from your Symfony projects ⚡️Need for Speed: Removing speed bumps from your Symfony projects ⚡️
Need for Speed: Removing speed bumps from your Symfony projects ⚡️
Łukasz Chruściel
 
Energy consumption of Database Management - Florina Jonuzi
Energy consumption of Database Management - Florina JonuziEnergy consumption of Database Management - Florina Jonuzi
Energy consumption of Database Management - Florina Jonuzi
Green Software Development
 
Unveiling the Advantages of Agile Software Development.pdf
Unveiling the Advantages of Agile Software Development.pdfUnveiling the Advantages of Agile Software Development.pdf
Unveiling the Advantages of Agile Software Development.pdf
brainerhub1
 
Oracle Database 19c New Features for DBAs and Developers.pptx
Oracle Database 19c New Features for DBAs and Developers.pptxOracle Database 19c New Features for DBAs and Developers.pptx
Oracle Database 19c New Features for DBAs and Developers.pptx
Remote DBA Services
 
一比一原版(USF毕业证)旧金山大学毕业证如何办理
一比一原版(USF毕业证)旧金山大学毕业证如何办理一比一原版(USF毕业证)旧金山大学毕业证如何办理
一比一原版(USF毕业证)旧金山大学毕业证如何办理
dakas1
 
在线购买加拿大英属哥伦比亚大学毕业证本科学位证书原版一模一样
在线购买加拿大英属哥伦比亚大学毕业证本科学位证书原版一模一样在线购买加拿大英属哥伦比亚大学毕业证本科学位证书原版一模一样
在线购买加拿大英属哥伦比亚大学毕业证本科学位证书原版一模一样
mz5nrf0n
 
KuberTENes Birthday Bash Guadalajara - Introducción a Argo CD
KuberTENes Birthday Bash Guadalajara - Introducción a Argo CDKuberTENes Birthday Bash Guadalajara - Introducción a Argo CD
KuberTENes Birthday Bash Guadalajara - Introducción a Argo CD
rodomar2
 
Fundamentals of Programming and Language Processors
Fundamentals of Programming and Language ProcessorsFundamentals of Programming and Language Processors
Fundamentals of Programming and Language Processors
Rakesh Kumar R
 
Top Benefits of Using Salesforce Healthcare CRM for Patient Management.pdf
Top Benefits of Using Salesforce Healthcare CRM for Patient Management.pdfTop Benefits of Using Salesforce Healthcare CRM for Patient Management.pdf
Top Benefits of Using Salesforce Healthcare CRM for Patient Management.pdf
VALiNTRY360
 
Webinar On-Demand: Using Flutter for Embedded
Webinar On-Demand: Using Flutter for EmbeddedWebinar On-Demand: Using Flutter for Embedded
Webinar On-Demand: Using Flutter for Embedded
ICS
 
Requirement Traceability in Xen Functional Safety
Requirement Traceability in Xen Functional SafetyRequirement Traceability in Xen Functional Safety
Requirement Traceability in Xen Functional Safety
Ayan Halder
 
ALGIT - Assembly Line for Green IT - Numbers, Data, Facts
ALGIT - Assembly Line for Green IT - Numbers, Data, FactsALGIT - Assembly Line for Green IT - Numbers, Data, Facts
ALGIT - Assembly Line for Green IT - Numbers, Data, Facts
Green Software Development
 
Odoo ERP Vs. Traditional ERP Systems – A Comparative Analysis
Odoo ERP Vs. Traditional ERP Systems – A Comparative AnalysisOdoo ERP Vs. Traditional ERP Systems – A Comparative Analysis
Odoo ERP Vs. Traditional ERP Systems – A Comparative Analysis
Envertis Software Solutions
 

Recently uploaded (20)

GreenCode-A-VSCode-Plugin--Dario-Jurisic
GreenCode-A-VSCode-Plugin--Dario-JurisicGreenCode-A-VSCode-Plugin--Dario-Jurisic
GreenCode-A-VSCode-Plugin--Dario-Jurisic
 
8 Best Automated Android App Testing Tool and Framework in 2024.pdf
8 Best Automated Android App Testing Tool and Framework in 2024.pdf8 Best Automated Android App Testing Tool and Framework in 2024.pdf
8 Best Automated Android App Testing Tool and Framework in 2024.pdf
 
SQL Accounting Software Brochure Malaysia
SQL Accounting Software Brochure MalaysiaSQL Accounting Software Brochure Malaysia
SQL Accounting Software Brochure Malaysia
 
WWDC 2024 Keynote Review: For CocoaCoders Austin
WWDC 2024 Keynote Review: For CocoaCoders AustinWWDC 2024 Keynote Review: For CocoaCoders Austin
WWDC 2024 Keynote Review: For CocoaCoders Austin
 
Top 9 Trends in Cybersecurity for 2024.pptx
Top 9 Trends in Cybersecurity for 2024.pptxTop 9 Trends in Cybersecurity for 2024.pptx
Top 9 Trends in Cybersecurity for 2024.pptx
 
Oracle 23c New Features For DBAs and Developers.pptx
Oracle 23c New Features For DBAs and Developers.pptxOracle 23c New Features For DBAs and Developers.pptx
Oracle 23c New Features For DBAs and Developers.pptx
 
J-Spring 2024 - Going serverless with Quarkus, GraalVM native images and AWS ...
J-Spring 2024 - Going serverless with Quarkus, GraalVM native images and AWS ...J-Spring 2024 - Going serverless with Quarkus, GraalVM native images and AWS ...
J-Spring 2024 - Going serverless with Quarkus, GraalVM native images and AWS ...
 
Need for Speed: Removing speed bumps from your Symfony projects ⚡️
Need for Speed: Removing speed bumps from your Symfony projects ⚡️Need for Speed: Removing speed bumps from your Symfony projects ⚡️
Need for Speed: Removing speed bumps from your Symfony projects ⚡️
 
Energy consumption of Database Management - Florina Jonuzi
Energy consumption of Database Management - Florina JonuziEnergy consumption of Database Management - Florina Jonuzi
Energy consumption of Database Management - Florina Jonuzi
 
Unveiling the Advantages of Agile Software Development.pdf
Unveiling the Advantages of Agile Software Development.pdfUnveiling the Advantages of Agile Software Development.pdf
Unveiling the Advantages of Agile Software Development.pdf
 
Oracle Database 19c New Features for DBAs and Developers.pptx
Oracle Database 19c New Features for DBAs and Developers.pptxOracle Database 19c New Features for DBAs and Developers.pptx
Oracle Database 19c New Features for DBAs and Developers.pptx
 
一比一原版(USF毕业证)旧金山大学毕业证如何办理
一比一原版(USF毕业证)旧金山大学毕业证如何办理一比一原版(USF毕业证)旧金山大学毕业证如何办理
一比一原版(USF毕业证)旧金山大学毕业证如何办理
 
在线购买加拿大英属哥伦比亚大学毕业证本科学位证书原版一模一样
在线购买加拿大英属哥伦比亚大学毕业证本科学位证书原版一模一样在线购买加拿大英属哥伦比亚大学毕业证本科学位证书原版一模一样
在线购买加拿大英属哥伦比亚大学毕业证本科学位证书原版一模一样
 
KuberTENes Birthday Bash Guadalajara - Introducción a Argo CD
KuberTENes Birthday Bash Guadalajara - Introducción a Argo CDKuberTENes Birthday Bash Guadalajara - Introducción a Argo CD
KuberTENes Birthday Bash Guadalajara - Introducción a Argo CD
 
Fundamentals of Programming and Language Processors
Fundamentals of Programming and Language ProcessorsFundamentals of Programming and Language Processors
Fundamentals of Programming and Language Processors
 
Top Benefits of Using Salesforce Healthcare CRM for Patient Management.pdf
Top Benefits of Using Salesforce Healthcare CRM for Patient Management.pdfTop Benefits of Using Salesforce Healthcare CRM for Patient Management.pdf
Top Benefits of Using Salesforce Healthcare CRM for Patient Management.pdf
 
Webinar On-Demand: Using Flutter for Embedded
Webinar On-Demand: Using Flutter for EmbeddedWebinar On-Demand: Using Flutter for Embedded
Webinar On-Demand: Using Flutter for Embedded
 
Requirement Traceability in Xen Functional Safety
Requirement Traceability in Xen Functional SafetyRequirement Traceability in Xen Functional Safety
Requirement Traceability in Xen Functional Safety
 
ALGIT - Assembly Line for Green IT - Numbers, Data, Facts
ALGIT - Assembly Line for Green IT - Numbers, Data, FactsALGIT - Assembly Line for Green IT - Numbers, Data, Facts
ALGIT - Assembly Line for Green IT - Numbers, Data, Facts
 
Odoo ERP Vs. Traditional ERP Systems – A Comparative Analysis
Odoo ERP Vs. Traditional ERP Systems – A Comparative AnalysisOdoo ERP Vs. Traditional ERP Systems – A Comparative Analysis
Odoo ERP Vs. Traditional ERP Systems – A Comparative Analysis
 

EpiMOLAS: An Intuitive Web-based Framework for Genome-Wide DNA Methylation Analysis

  • 1. EpiMOLAS: An Intuitive Web-based Framework for Genome-wide DNA Methylation Analysis Presented By Sheng-Yao Su Bioinformatics Program, Taiwan International Graduate Program, Institute of Information Science, Academia Sinica Institute of Biomedical Informatics, National Yang-Ming University TAIWAN Sep 10, 2019
  • 2. Outline • Introduction • Methods • Implementations and Results • EpiMOLAS consists of DocMethyl and EpiMOLAS_web • Discussion • Conclusion
  • 4. Epigenomics • Epi- (upon, above, beyond) genomics (DNA sequence) • Waddington proposed this term in 1940s. • Epigenomics is the study of the complete set of epigenetic modification on the genetic material of a cell (wiki)
  • 6. DNA methylation – an epigenetic mark of cellular memory DNA methylation: an epigenetic mark of cellular memory Experimental & Molecular Medicine volume 49, page e322 (2017)
  • 8. Sodium Bisulfite treatment Correct conversion : C -> U -> T Correct conversion : mC -> mC -> C incorrect conversion : mC -> U -> T Bisulfite treatment PCR amplification Unmethylated DNA Methylated DNA Original sequence CCGTCGACGT CmCGTmCGAmCGT Bisulfite converted UUGTUGAUGT UmCGTmCGAmCGT PCR product TTGTTGATGT TCGTCGACGT Incomplete conversion
  • 9. Detect DNA modification changes • Bisulfite conversion treatment • Reduced Representation Bisulfite Sequencing (RRBS) • Whole Genome Bisulfite Sequencing (WGBS) • Bisulfite-free • Anti-methylcytosine Antibody • Methyl-CpG binding domain (MBD) • Chemical labeling (MeFISH) • Methylation-sensitive restriction enzyme • Electrochemical oxidation • Third Generation (SMRT-seq, Nanopore)
  • 11. Generic bioinformatic analysis workflow for bisulfite sequencing data Seq Reads Quality Control Alignment Methylation Call Visualization Annotation Diff. Methyl. Region Biomarker Candidates
  • 12. Flowchart of EpiMOLAS Biomarker CandidatesSeq Reads Trim Galore! FastQC Bowtie Visualization Annotation Bismark Extract. EpiMOLAS_webDocMethyl Bismark mtable
  • 13. Metric for Methylation Profiling - mtable Gene Genome C C C C at least four counts of methylated and unmethylated cytosine at least five qualified observed cytosines 1 16425704 + 0 8 CHH CTC 1 16425710 + 6 6 CHG CAG 1 16425714 + 10 5 CHH CAA 1 16425717 + 6 0 CHG CTG 1 16425719 + 4 0 CG CGC Bismark genome-wide cytosine report Sequence depth Input Output EpiMolas.jar CG CHG CHH Su et al. TEA: the epigenome platform for Arabidopsis methylome study. BMC Genomics 17(Suppl 13): 1027 (2016)
  • 14. An Example of mtable Ensembl Gene ID Methylation level of gene body and promoter regions according to three cytosine methylation contexts less than five qualified observed cytosines
  • 17. Architecture of DocMethyl and EpiMOLAS_web
  • 18. DocMethyl • Docker • Galaxy Infrastructure Operating System Docker Daemon Galaxy platform TrimGalore FastQC Bismark EpiMolas.jar Workflow mtable Methylation Report Raw data Input DocMethyl DocMethyl output QC Report Trimmed Data Reference Genome Gene Annotation
  • 19. A Workflow In DocMethyl Trim Sequences Check QC of Trimmed reads Map Reads on Genome Extract Methylated Cytosines Generate Output of Submission to EpiMOLAS_web • Trim Galore • FastQC • Bismark • EpiMolas.jar
  • 20. Steps and Output Files of the Workflow
  • 21. Full text Search DMGs (select diff methylation Genes) mC Threshold Import Genelist KEGG Global View Gene List Analysis Generate New Gene List for further Analysis in Built-in Approaches Modules Inside EpiMOLAS_web
  • 22. Find Genes of Interest
  • 25. Visualization Modules • Boxplot • Circos plot • Heatmap • Potein network
  • 27. Discussion • It is hard to find the significant DMG according to DMG approaches. Long region of gene size in length amortize the effect of DNA methylation. • Approximately 80% of all CpGs are located in repetitive sequences and centromeric repeat regions of chromosomes, and are heavily methylated. • We list the comparison among several platforms and tools for genome-wide DNA methylation analysis.
  • 28. Comparison of each platform EpiMOLAS BAT ENCODE -WGBS snakePipe NGI- MethylSeq Mint RnBeads 2.0 MethylPipe MethylSig Methylkit Environment Docker, Galaxy, Web server Docker Shell script Bioconda Snakemake Docker Nexflow Galaxy R package R package R package R package Sequence context CG, CHG, CHH CG CG, CHG, CHH CG, CHG, CHH CG, CHG, CHH CG CG CG, CHG, CHH CG, CHG, CHH CG, CHG, CHH Start with raw reads raw reads raw reads raw reads raw reads raw reads Methyl. Call file Methyl. Call file Methyl. Call file Methyl. Call file Docker Container + + – – + – NA NA NA NA Web interface + (Galaxy) – + – – + (Galaxy) NA NA NA NA Adapter and base quality trimming + – + + + + – NA NA NA QC report + + + + + + + NA NA NA Read mapping + + + + + + – NA NA NA Methylation sites calling + + + + + + – NA NA NA
  • 29. EpiMOLAS BAT ENCODE -WGBS snakePipe NGI- MethylSeq Mint RnBeads 2.0 MethylPipe MethylSig Methylkit Discriptive statistics + + – + + + + + + + Find DMRs + (simple) + (metilene) – + (metilene) – + (DSS) + + + + Clustering analysis + (heatmap) + (heatmap) – + (heatmap) – – + (heatmap) – + (heatmap) – GO term enrichment + – – – – + + + – – KEGG pathway enrichment + – – – – – – – – – TFBS enrichment – – – – – – – – + – Genome- wide visualization + (circos plot) + (circos plot) – – – – + – – – Interactive Quantitative analysis + – – – – – + (R Shiny) NA NA NA Data browing and retrieving UI + – – – – – + (R Shiny) NA NA NA
  • 30. EpiMOLAS BAT ENCODE -WGBS snakePipe NGI- MethylSeq Mint RnBeads 2.0 MethylPipe MethylSig Methylkit Gene list with tracking logs + – – – – – NA NA NA NA Venn analysis on gene lists + – – – – – NA – – – Interplay with other high throughput data protein Interactome transcript ome – RNA-seq, ChIP-seq, ATAC-seq, Hi-C etc. – 5- hmc – RNA-seq, ChIP-seq, Dnase-seq – –
  • 32. Conclusion • We present an integrated two-phase web-based ‘gene-centric’ framework for WGBS data from raw data processing to downstream analysis. • EpiMOLAS helps users deal with their WGBS data and alleviates the burden on conducting reproducible analysis of public datasets.
  • 35. Thank you for your attention ! Photo by KageHuang/Getty Images