SlideShare a Scribd company logo
1 of 25
Download to read offline
Intervene to
visualize
genomic region
sets
D. Denisko
Introduction
Description
Installation and
general usage
Modules
Venn diagram
Upset
Pairwise
heatmap
ShinyApp
Examples
Plots in
publications
Conclusion
Intervene: a tool for intersection and
visualization of multiple gene or genomic
region sets
Danielle Denisko
Tech Talk
June 13, 2018
Template from: www.overleaf.com
Intervene to
visualize
genomic region
sets
D. Denisko
Introduction
Description
Installation and
general usage
Modules
Venn diagram
Upset
Pairwise
heatmap
ShinyApp
Examples
Plots in
publications
Conclusion
Outline
Introduction
Description
Installation and general usage
Modules
Venn diagram
Upset
Pairwise heatmap
ShinyApp
Examples
Plots in publications
Conclusion
Intervene to
visualize
genomic region
sets
D. Denisko
Introduction
Description
Installation and
general usage
Modules
Venn diagram
Upset
Pairwise
heatmap
ShinyApp
Examples
Plots in
publications
Conclusion
Introduction
Summary:
intersect and visualize sets of genes
novel aspect: work specifically with genomic regions
Modules:
venn, upset, and pairwise
Intervene to
visualize
genomic region
sets
D. Denisko
Introduction
Description
Installation and
general usage
Modules
Venn diagram
Upset
Pairwise
heatmap
ShinyApp
Examples
Plots in
publications
Conclusion
Description
Use:
command line and Shiny web interface
Implementation:
Python 2.7 (also works with Python 3.4, 3.5, and 3.6)
R
Built upon:
pybedtools
Seaborn
Matplotlib
UpSetR
Corrplot
Venerable
heatmap.2
Intervene to
visualize
genomic region
sets
D. Denisko
Introduction
Description
Installation and
general usage
Modules
Venn diagram
Upset
Pairwise
heatmap
ShinyApp
Examples
Plots in
publications
Conclusion
Installation and general usage
To install:
pip install intervene
conda install intervene
Bitbucket and Github source code
Input:
genomic regions in BED, GFF, or VCF format
gene/name lists in plain text format
Intervene to
visualize
genomic region
sets
D. Denisko
Introduction
Description
Installation and
general usage
Modules
Venn diagram
Upset
Pairwise
heatmap
ShinyApp
Examples
Plots in
publications
Conclusion
Installation and general usage
Workflow:
Khan A and Mathelier A. 2017. BMC Bioinformatics. 18:287.
Intervene to
visualize
genomic region
sets
D. Denisko
Introduction
Description
Installation and
general usage
Modules
Venn diagram
Upset
Pairwise
heatmap
ShinyApp
Examples
Plots in
publications
Conclusion
Installation and general usage
There are three types of output plots:
Khan A and Mathelier A. 2017. BMC Bioinformatics. 18:287.
Intervene to
visualize
genomic region
sets
D. Denisko
Introduction
Description
Installation and
general usage
Modules
Venn diagram
Upset
Pairwise
heatmap
ShinyApp
Examples
Plots in
publications
Conclusion
Installation and general usage
Users can provide all possible bedtools intersect
options via --bedtools-options.
Figure 1: There are over 15 options for specifying overlaps in
bedtools intersect.
Image source: BEDTools suite web page.
Intervene to
visualize
genomic region
sets
D. Denisko
Introduction
Description
Installation and
general usage
Modules
Venn diagram
Upset
Pairwise
heatmap
ShinyApp
Examples
Plots in
publications
Conclusion
Modules
venn
classical Venn diagram
up to 6 sets
input: gene lists or genomic region sets
Shiny web interface provides some more flexibility:
weighted and unweighted Venn and Euler diagrams
different types of diagrams (up to 9 sets)
Intervene to
visualize
genomic region
sets
D. Denisko
Introduction
Description
Installation and
general usage
Modules
Venn diagram
Upset
Pairwise
heatmap
ShinyApp
Examples
Plots in
publications
Conclusion
Modules
venn
Figure 2: Venn (leftmost column) vs. Euler diagrams.
Venn: show all 2n possible regions
Euler: only show relevant (non-empty) regions
Image source: Wikipedia
Intervene to
visualize
genomic region
sets
D. Denisko
Introduction
Description
Installation and
general usage
Modules
Venn diagram
Upset
Pairwise
heatmap
ShinyApp
Examples
Plots in
publications
Conclusion
Modules
venn
intervene venn -i RC13-KO.narrowPeak RC13-WT.narrowPeak S12-KO.narrowPeak S12-WT.narrowPeak 
--names=KO-rep1,WT-rep1,KO-rep2,WT-rep2 -o ~/intervene_plots/ --save-overlaps 
--title="RNF169 ChIP-seq peaks" --project=RNF169_KO_WT --figtype=png 
--figsize 12 12 --fontsize=24 --dpi=450
Figure 3: Intervene venn diagram.
Intervene to
visualize
genomic region
sets
D. Denisko
Introduction
Description
Installation and
general usage
Modules
Venn diagram
Upset
Pairwise
heatmap
ShinyApp
Examples
Plots in
publications
Conclusion
Modules
venn
Figure 4: Chow-Ruskey
Figure 5: Edwards
Figure 6: Squares
Figure 7: Battle
Intervene to
visualize
genomic region
sets
D. Denisko
Introduction
Description
Installation and
general usage
Modules
Venn diagram
Upset
Pairwise
heatmap
ShinyApp
Examples
Plots in
publications
Conclusion
Modules
venn
Figure 8: Intervene venn diagram with 6 sets.
Intervene to
visualize
genomic region
sets
D. Denisko
Introduction
Description
Installation and
general usage
Modules
Venn diagram
Upset
Pairwise
heatmap
ShinyApp
Examples
Plots in
publications
Conclusion
Modules
upset
easier to interpret when there are more than 4 sets
can be used effectively for 20-30 sets
Intervene to
visualize
genomic region
sets
D. Denisko
Introduction
Description
Installation and
general usage
Modules
Venn diagram
Upset
Pairwise
heatmap
ShinyApp
Examples
Plots in
publications
Conclusion
Modules
upset
Motivation:
Figure 9: Edwards-Venn diagram for banana gene clusters
comparison. D’Hont A et al. 2012. Nature. 488:7410.
Intervene to
visualize
genomic region
sets
D. Denisko
Introduction
Description
Installation and
general usage
Modules
Venn diagram
Upset
Pairwise
heatmap
ShinyApp
Examples
Plots in
publications
Conclusion
Modules
upset
intervene upset -i RC13-KO.narrowPeak RC13-WT.narrowPeak S12-KO.narrowPeak 
S12-WT.narrowPeak --names=KO-rep1,WT-rep1,KO-rep2,WT-rep2 -o ~/intervene_plots/ 
--figtype=png --figsize 12 12 --showshiny
Figure 10: Intervene upset diagram.
Intervene to
visualize
genomic region
sets
D. Denisko
Introduction
Description
Installation and
general usage
Modules
Venn diagram
Upset
Pairwise
heatmap
ShinyApp
Examples
Plots in
publications
Conclusion
Modules
upset
1 #! / usr / bin /env R s c r i p t
2 l i b r a r y ( ”UpSetR” )
3 png ( ” I n t e r v e n e upset . png” , width =7200 , h e i g h t =3600 ,
4 r e s =300)
5 e x p r e s s i o n I n p u t <− c ( ’WT−rep2 ’ =2606 ,
6 ’KO−rep2 ’ =109 ,
7 ’KO−rep2&WT−rep2 ’ =44,
8 ’WT−rep1 ’ =39967 ,
9 ’WT−rep1&WT−rep2 ’ =12136 ,
10 ’WT−rep1&KO−rep2 ’ =39,
11 ’WT−rep1&KO−rep2&WT−rep2 ’ =114 ,
12 ’KO−rep1 ’ =77,
13 ’KO−rep1&WT−rep2 ’ =5,
14 ’KO−rep1&KO−rep2 ’ =21,
15 ’KO−rep1&KO−rep2&WT−rep2 ’ =22,
16 ’KO−rep1&WT−rep1 ’ =112 ,
17 ’KO−rep1&WT−rep1&WT−rep2 ’ =72,
18 ’KO−rep1&WT−rep1&KO−rep2 ’ =92,
19 ’KO−rep1&WT−rep1&KO−rep2&WT−rep2 ’ =290)
20 upset ( fromExpression ( e x p r e s s i o n I n p u t ) , n s e t s =4,
21 n i n t e r s e c t s =30, show . numbers=” yes ” ,
22 main . bar . c o l o r=”#ea5d4e ” ,
23 s e t s . bar . c o l o r=”#317eab ” ,
24 empty . i n t e r s e c t i o n s=NULL,
25 order . by = ” f r e q ” , number . a n g l e s = 0 ,
26 mainbar . y . l a b e l =”No . o f I n t e r s e c t i o n s ” ,
27 s e t s . x . l a b e l =” Set s i z e ” ,
28 t e x t . s c a l e=c ( 2 , 2 , 2 , 2 , 2 , 3 ) ) # added to a d j u s t f o n t s i z e s
29 i n v i s i b l e ( dev . o f f ( ) )
Intervene to
visualize
genomic region
sets
D. Denisko
Introduction
Description
Installation and
general usage
Modules
Venn diagram
Upset
Pairwise
heatmap
ShinyApp
Examples
Plots in
publications
Conclusion
Modules
upset
Figure 11: UpSet diagram from web application.
Intervene to
visualize
genomic region
sets
D. Denisko
Introduction
Description
Installation and
general usage
Modules
Venn diagram
Upset
Pairwise
heatmap
ShinyApp
Examples
Plots in
publications
Conclusion
Modules
pairwise
clustered heat map of pairwise associations
very large sets
metrics: number of overlaps, fraction of overlap,
Jaccard statistics, Fisher’s exact test, and
distribution of relative distances
heat map styles: tribar, dendrogram, pie, circle,
square, ellipse, etc.
clustering methods: various agglomerative options
Intervene to
visualize
genomic region
sets
D. Denisko
Introduction
Description
Installation and
general usage
Modules
Venn diagram
Upset
Pairwise
heatmap
ShinyApp
Examples
Plots in
publications
Conclusion
Modules
pairwise
Figure 12: Intervene pairwise plot.
Intervene to
visualize
genomic region
sets
D. Denisko
Introduction
Description
Installation and
general usage
Modules
Venn diagram
Upset
Pairwise
heatmap
ShinyApp
Examples
Plots in
publications
Conclusion
ShinyApp
Input:
does not accept genomic regions
venn: lists of names/genes/SNPs
upset: lists of names/genes/SNPs, binary data,
Intervene command line output listing all possible
combinations of sets
pairwise: lists of names/genes/SNPs, pairwise
matrix of number/fraction of overlap (can be
generated through Intervene on command line)
Intervene to
visualize
genomic region
sets
D. Denisko
Introduction
Description
Installation and
general usage
Modules
Venn diagram
Upset
Pairwise
heatmap
ShinyApp
Examples
Plots in
publications
Conclusion
ShinyApp
Figure 13: Screenshot from Intervene’s upset module
ShinyApp.
Intervene to
visualize
genomic region
sets
D. Denisko
Introduction
Description
Installation and
general usage
Modules
Venn diagram
Upset
Pairwise
heatmap
ShinyApp
Examples
Plots in
publications
Conclusion
Plots in publications
Figure 14: Coregulated and antiregulated genes (with lncRNA)
in various yeast colonies.
Wilkinson D et al. 2018. Oxid Med Cell Longev.
Intervene to
visualize
genomic region
sets
D. Denisko
Introduction
Description
Installation and
general usage
Modules
Venn diagram
Upset
Pairwise
heatmap
ShinyApp
Examples
Plots in
publications
Conclusion
Plots in publications
Figure 15: Differentially expressed genes over time after influenza
treatment. Black: up- or down-regulated genes, red: upregulated genes,
blue: downregulated genes. Top: PBMCs, bottom: B cells.
Jensen TL et al. 2018. F1000Research. 6:2162.
Intervene to
visualize
genomic region
sets
D. Denisko
Introduction
Description
Installation and
general usage
Modules
Venn diagram
Upset
Pairwise
heatmap
ShinyApp
Examples
Plots in
publications
Conclusion
Conclusion
Pros:
simple command line tool for generating quick plots
convenient for visualizing genomic region sets
some customization (via output scripts and/or
ShinyApp)
Cons:
limited ability to customize, even in ShinyApp
limited plot types in comparison to ShinyApp

More Related Content

Similar to Intervene: a tool for intersection and visualization of multiple gene or genomic region sets

Applied parallel coordinates for logs and network traffic attack analysis
Applied parallel coordinates for logs and network traffic attack analysisApplied parallel coordinates for logs and network traffic attack analysis
Applied parallel coordinates for logs and network traffic attack analysis
UltraUploader
 

Similar to Intervene: a tool for intersection and visualization of multiple gene or genomic region sets (20)

HalifaxNGGs
HalifaxNGGsHalifaxNGGs
HalifaxNGGs
 
Higher-order organization of complex networks
Higher-order organization of complex networksHigher-order organization of complex networks
Higher-order organization of complex networks
 
A Novel Technique in Software Engineering for Building Scalable Large Paralle...
A Novel Technique in Software Engineering for Building Scalable Large Paralle...A Novel Technique in Software Engineering for Building Scalable Large Paralle...
A Novel Technique in Software Engineering for Building Scalable Large Paralle...
 
Coastal erosion management using image processing and Node Oriented Programming
Coastal erosion management using image processing and Node Oriented Programming Coastal erosion management using image processing and Node Oriented Programming
Coastal erosion management using image processing and Node Oriented Programming
 
Sat4j: from the lab to desktop computers. OW2con'15, November 17, Paris.
Sat4j: from the lab to desktop computers. OW2con'15, November 17, Paris. Sat4j: from the lab to desktop computers. OW2con'15, November 17, Paris.
Sat4j: from the lab to desktop computers. OW2con'15, November 17, Paris.
 
DefenseTalk_Trimmed
DefenseTalk_TrimmedDefenseTalk_Trimmed
DefenseTalk_Trimmed
 
NIPS2017 Few-shot Learning and Graph Convolution
NIPS2017 Few-shot Learning and Graph ConvolutionNIPS2017 Few-shot Learning and Graph Convolution
NIPS2017 Few-shot Learning and Graph Convolution
 
UMBC Research Day Presentation
UMBC Research Day PresentationUMBC Research Day Presentation
UMBC Research Day Presentation
 
From RNN to neural networks for cyclic undirected graphs
From RNN to neural networks for cyclic undirected graphsFrom RNN to neural networks for cyclic undirected graphs
From RNN to neural networks for cyclic undirected graphs
 
DEEP LEARNING BASED MULTIPLE REGRESSION TO PREDICT TOTAL COLUMN WATER VAPOR (...
DEEP LEARNING BASED MULTIPLE REGRESSION TO PREDICT TOTAL COLUMN WATER VAPOR (...DEEP LEARNING BASED MULTIPLE REGRESSION TO PREDICT TOTAL COLUMN WATER VAPOR (...
DEEP LEARNING BASED MULTIPLE REGRESSION TO PREDICT TOTAL COLUMN WATER VAPOR (...
 
Applied parallel coordinates for logs and network traffic attack analysis
Applied parallel coordinates for logs and network traffic attack analysisApplied parallel coordinates for logs and network traffic attack analysis
Applied parallel coordinates for logs and network traffic attack analysis
 
[Hydro]geological analysis using open source app: case Cikapundung River
[Hydro]geological analysis using open source app: case Cikapundung River[Hydro]geological analysis using open source app: case Cikapundung River
[Hydro]geological analysis using open source app: case Cikapundung River
 
DDGK: Learning Graph Representations for Deep Divergence Graph Kernels
DDGK: Learning Graph Representations for Deep Divergence Graph KernelsDDGK: Learning Graph Representations for Deep Divergence Graph Kernels
DDGK: Learning Graph Representations for Deep Divergence Graph Kernels
 
Preparation Data Structures 11 graphs
Preparation Data Structures 11 graphsPreparation Data Structures 11 graphs
Preparation Data Structures 11 graphs
 
Graph Neural Network in practice
Graph Neural Network in practiceGraph Neural Network in practice
Graph Neural Network in practice
 
RiemannTEX
RiemannTEXRiemannTEX
RiemannTEX
 
Tensor Field Network (and other ConvNet Generalisations)
Tensor Field Network (and other ConvNet Generalisations)Tensor Field Network (and other ConvNet Generalisations)
Tensor Field Network (and other ConvNet Generalisations)
 
ON APPROACH TO DECREASE DIMENSIONS OF FIELD-EFFECT TRANSISTORS FRAMEWORK ELEM...
ON APPROACH TO DECREASE DIMENSIONS OF FIELD-EFFECT TRANSISTORS FRAMEWORK ELEM...ON APPROACH TO DECREASE DIMENSIONS OF FIELD-EFFECT TRANSISTORS FRAMEWORK ELEM...
ON APPROACH TO DECREASE DIMENSIONS OF FIELD-EFFECT TRANSISTORS FRAMEWORK ELEM...
 
Incremental View Maintenance for openCypher Queries
Incremental View Maintenance for openCypher QueriesIncremental View Maintenance for openCypher Queries
Incremental View Maintenance for openCypher Queries
 
Incremental View Maintenance for openCypher Queries
Incremental View Maintenance for openCypher QueriesIncremental View Maintenance for openCypher Queries
Incremental View Maintenance for openCypher Queries
 

More from Hoffman Lab

Efficient querying of genomic reference databases with gget
Efficient querying of genomic reference databases with ggetEfficient querying of genomic reference databases with gget
Efficient querying of genomic reference databases with gget
Hoffman Lab
 
MultiQC: summarize analysis results for multiple tools and samples in a singl...
MultiQC: summarize analysis results for multiple tools and samples in a singl...MultiQC: summarize analysis results for multiple tools and samples in a singl...
MultiQC: summarize analysis results for multiple tools and samples in a singl...
Hoffman Lab
 

More from Hoffman Lab (20)

GNU Parallel: Lab meeting—technical talk
GNU Parallel: Lab meeting—technical talkGNU Parallel: Lab meeting—technical talk
GNU Parallel: Lab meeting—technical talk
 
TCRpower
TCRpowerTCRpower
TCRpower
 
Efficient querying of genomic reference databases with gget
Efficient querying of genomic reference databases with ggetEfficient querying of genomic reference databases with gget
Efficient querying of genomic reference databases with gget
 
WashU Epigenome Browser
WashU Epigenome BrowserWashU Epigenome Browser
WashU Epigenome Browser
 
Wireguard: A Virtual Private Network Tunnel
Wireguard: A Virtual Private Network TunnelWireguard: A Virtual Private Network Tunnel
Wireguard: A Virtual Private Network Tunnel
 
Plotting heatmap with matplotlib/seaborn
Plotting heatmap with matplotlib/seabornPlotting heatmap with matplotlib/seaborn
Plotting heatmap with matplotlib/seaborn
 
Go Get Data (GGD)
Go Get Data (GGD)Go Get Data (GGD)
Go Get Data (GGD)
 
fastp: the FASTQ pre-processor
fastp: the FASTQ pre-processorfastp: the FASTQ pre-processor
fastp: the FASTQ pre-processor
 
R markdown and Rmdformats
R markdown and RmdformatsR markdown and Rmdformats
R markdown and Rmdformats
 
File searching tools
File searching toolsFile searching tools
File searching tools
 
Better BibTeX (BBT) for Zotero
Better BibTeX (BBT) for ZoteroBetter BibTeX (BBT) for Zotero
Better BibTeX (BBT) for Zotero
 
Awk primer and Bioawk
Awk primer and BioawkAwk primer and Bioawk
Awk primer and Bioawk
 
Terminals and Shells
Terminals and ShellsTerminals and Shells
Terminals and Shells
 
BioRender & Glossary/Acronym
BioRender & Glossary/AcronymBioRender & Glossary/Acronym
BioRender & Glossary/Acronym
 
Linters in R
Linters in RLinters in R
Linters in R
 
BioSyntax: syntax highlighting for computational biology
BioSyntax: syntax highlighting for computational biologyBioSyntax: syntax highlighting for computational biology
BioSyntax: syntax highlighting for computational biology
 
Get Good With Git
Get Good With GitGet Good With Git
Get Good With Git
 
Tech Talk: UCSC Genome Browser
Tech Talk: UCSC Genome BrowserTech Talk: UCSC Genome Browser
Tech Talk: UCSC Genome Browser
 
MultiQC: summarize analysis results for multiple tools and samples in a singl...
MultiQC: summarize analysis results for multiple tools and samples in a singl...MultiQC: summarize analysis results for multiple tools and samples in a singl...
MultiQC: summarize analysis results for multiple tools and samples in a singl...
 
dreamRs: interactive ggplot2
dreamRs: interactive ggplot2dreamRs: interactive ggplot2
dreamRs: interactive ggplot2
 

Recently uploaded

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Recently uploaded (20)

[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 

Intervene: a tool for intersection and visualization of multiple gene or genomic region sets

  • 1. Intervene to visualize genomic region sets D. Denisko Introduction Description Installation and general usage Modules Venn diagram Upset Pairwise heatmap ShinyApp Examples Plots in publications Conclusion Intervene: a tool for intersection and visualization of multiple gene or genomic region sets Danielle Denisko Tech Talk June 13, 2018 Template from: www.overleaf.com
  • 2. Intervene to visualize genomic region sets D. Denisko Introduction Description Installation and general usage Modules Venn diagram Upset Pairwise heatmap ShinyApp Examples Plots in publications Conclusion Outline Introduction Description Installation and general usage Modules Venn diagram Upset Pairwise heatmap ShinyApp Examples Plots in publications Conclusion
  • 3. Intervene to visualize genomic region sets D. Denisko Introduction Description Installation and general usage Modules Venn diagram Upset Pairwise heatmap ShinyApp Examples Plots in publications Conclusion Introduction Summary: intersect and visualize sets of genes novel aspect: work specifically with genomic regions Modules: venn, upset, and pairwise
  • 4. Intervene to visualize genomic region sets D. Denisko Introduction Description Installation and general usage Modules Venn diagram Upset Pairwise heatmap ShinyApp Examples Plots in publications Conclusion Description Use: command line and Shiny web interface Implementation: Python 2.7 (also works with Python 3.4, 3.5, and 3.6) R Built upon: pybedtools Seaborn Matplotlib UpSetR Corrplot Venerable heatmap.2
  • 5. Intervene to visualize genomic region sets D. Denisko Introduction Description Installation and general usage Modules Venn diagram Upset Pairwise heatmap ShinyApp Examples Plots in publications Conclusion Installation and general usage To install: pip install intervene conda install intervene Bitbucket and Github source code Input: genomic regions in BED, GFF, or VCF format gene/name lists in plain text format
  • 6. Intervene to visualize genomic region sets D. Denisko Introduction Description Installation and general usage Modules Venn diagram Upset Pairwise heatmap ShinyApp Examples Plots in publications Conclusion Installation and general usage Workflow: Khan A and Mathelier A. 2017. BMC Bioinformatics. 18:287.
  • 7. Intervene to visualize genomic region sets D. Denisko Introduction Description Installation and general usage Modules Venn diagram Upset Pairwise heatmap ShinyApp Examples Plots in publications Conclusion Installation and general usage There are three types of output plots: Khan A and Mathelier A. 2017. BMC Bioinformatics. 18:287.
  • 8. Intervene to visualize genomic region sets D. Denisko Introduction Description Installation and general usage Modules Venn diagram Upset Pairwise heatmap ShinyApp Examples Plots in publications Conclusion Installation and general usage Users can provide all possible bedtools intersect options via --bedtools-options. Figure 1: There are over 15 options for specifying overlaps in bedtools intersect. Image source: BEDTools suite web page.
  • 9. Intervene to visualize genomic region sets D. Denisko Introduction Description Installation and general usage Modules Venn diagram Upset Pairwise heatmap ShinyApp Examples Plots in publications Conclusion Modules venn classical Venn diagram up to 6 sets input: gene lists or genomic region sets Shiny web interface provides some more flexibility: weighted and unweighted Venn and Euler diagrams different types of diagrams (up to 9 sets)
  • 10. Intervene to visualize genomic region sets D. Denisko Introduction Description Installation and general usage Modules Venn diagram Upset Pairwise heatmap ShinyApp Examples Plots in publications Conclusion Modules venn Figure 2: Venn (leftmost column) vs. Euler diagrams. Venn: show all 2n possible regions Euler: only show relevant (non-empty) regions Image source: Wikipedia
  • 11. Intervene to visualize genomic region sets D. Denisko Introduction Description Installation and general usage Modules Venn diagram Upset Pairwise heatmap ShinyApp Examples Plots in publications Conclusion Modules venn intervene venn -i RC13-KO.narrowPeak RC13-WT.narrowPeak S12-KO.narrowPeak S12-WT.narrowPeak --names=KO-rep1,WT-rep1,KO-rep2,WT-rep2 -o ~/intervene_plots/ --save-overlaps --title="RNF169 ChIP-seq peaks" --project=RNF169_KO_WT --figtype=png --figsize 12 12 --fontsize=24 --dpi=450 Figure 3: Intervene venn diagram.
  • 12. Intervene to visualize genomic region sets D. Denisko Introduction Description Installation and general usage Modules Venn diagram Upset Pairwise heatmap ShinyApp Examples Plots in publications Conclusion Modules venn Figure 4: Chow-Ruskey Figure 5: Edwards Figure 6: Squares Figure 7: Battle
  • 13. Intervene to visualize genomic region sets D. Denisko Introduction Description Installation and general usage Modules Venn diagram Upset Pairwise heatmap ShinyApp Examples Plots in publications Conclusion Modules venn Figure 8: Intervene venn diagram with 6 sets.
  • 14. Intervene to visualize genomic region sets D. Denisko Introduction Description Installation and general usage Modules Venn diagram Upset Pairwise heatmap ShinyApp Examples Plots in publications Conclusion Modules upset easier to interpret when there are more than 4 sets can be used effectively for 20-30 sets
  • 15. Intervene to visualize genomic region sets D. Denisko Introduction Description Installation and general usage Modules Venn diagram Upset Pairwise heatmap ShinyApp Examples Plots in publications Conclusion Modules upset Motivation: Figure 9: Edwards-Venn diagram for banana gene clusters comparison. D’Hont A et al. 2012. Nature. 488:7410.
  • 16. Intervene to visualize genomic region sets D. Denisko Introduction Description Installation and general usage Modules Venn diagram Upset Pairwise heatmap ShinyApp Examples Plots in publications Conclusion Modules upset intervene upset -i RC13-KO.narrowPeak RC13-WT.narrowPeak S12-KO.narrowPeak S12-WT.narrowPeak --names=KO-rep1,WT-rep1,KO-rep2,WT-rep2 -o ~/intervene_plots/ --figtype=png --figsize 12 12 --showshiny Figure 10: Intervene upset diagram.
  • 17. Intervene to visualize genomic region sets D. Denisko Introduction Description Installation and general usage Modules Venn diagram Upset Pairwise heatmap ShinyApp Examples Plots in publications Conclusion Modules upset 1 #! / usr / bin /env R s c r i p t 2 l i b r a r y ( ”UpSetR” ) 3 png ( ” I n t e r v e n e upset . png” , width =7200 , h e i g h t =3600 , 4 r e s =300) 5 e x p r e s s i o n I n p u t <− c ( ’WT−rep2 ’ =2606 , 6 ’KO−rep2 ’ =109 , 7 ’KO−rep2&WT−rep2 ’ =44, 8 ’WT−rep1 ’ =39967 , 9 ’WT−rep1&WT−rep2 ’ =12136 , 10 ’WT−rep1&KO−rep2 ’ =39, 11 ’WT−rep1&KO−rep2&WT−rep2 ’ =114 , 12 ’KO−rep1 ’ =77, 13 ’KO−rep1&WT−rep2 ’ =5, 14 ’KO−rep1&KO−rep2 ’ =21, 15 ’KO−rep1&KO−rep2&WT−rep2 ’ =22, 16 ’KO−rep1&WT−rep1 ’ =112 , 17 ’KO−rep1&WT−rep1&WT−rep2 ’ =72, 18 ’KO−rep1&WT−rep1&KO−rep2 ’ =92, 19 ’KO−rep1&WT−rep1&KO−rep2&WT−rep2 ’ =290) 20 upset ( fromExpression ( e x p r e s s i o n I n p u t ) , n s e t s =4, 21 n i n t e r s e c t s =30, show . numbers=” yes ” , 22 main . bar . c o l o r=”#ea5d4e ” , 23 s e t s . bar . c o l o r=”#317eab ” , 24 empty . i n t e r s e c t i o n s=NULL, 25 order . by = ” f r e q ” , number . a n g l e s = 0 , 26 mainbar . y . l a b e l =”No . o f I n t e r s e c t i o n s ” , 27 s e t s . x . l a b e l =” Set s i z e ” , 28 t e x t . s c a l e=c ( 2 , 2 , 2 , 2 , 2 , 3 ) ) # added to a d j u s t f o n t s i z e s 29 i n v i s i b l e ( dev . o f f ( ) )
  • 18. Intervene to visualize genomic region sets D. Denisko Introduction Description Installation and general usage Modules Venn diagram Upset Pairwise heatmap ShinyApp Examples Plots in publications Conclusion Modules upset Figure 11: UpSet diagram from web application.
  • 19. Intervene to visualize genomic region sets D. Denisko Introduction Description Installation and general usage Modules Venn diagram Upset Pairwise heatmap ShinyApp Examples Plots in publications Conclusion Modules pairwise clustered heat map of pairwise associations very large sets metrics: number of overlaps, fraction of overlap, Jaccard statistics, Fisher’s exact test, and distribution of relative distances heat map styles: tribar, dendrogram, pie, circle, square, ellipse, etc. clustering methods: various agglomerative options
  • 20. Intervene to visualize genomic region sets D. Denisko Introduction Description Installation and general usage Modules Venn diagram Upset Pairwise heatmap ShinyApp Examples Plots in publications Conclusion Modules pairwise Figure 12: Intervene pairwise plot.
  • 21. Intervene to visualize genomic region sets D. Denisko Introduction Description Installation and general usage Modules Venn diagram Upset Pairwise heatmap ShinyApp Examples Plots in publications Conclusion ShinyApp Input: does not accept genomic regions venn: lists of names/genes/SNPs upset: lists of names/genes/SNPs, binary data, Intervene command line output listing all possible combinations of sets pairwise: lists of names/genes/SNPs, pairwise matrix of number/fraction of overlap (can be generated through Intervene on command line)
  • 22. Intervene to visualize genomic region sets D. Denisko Introduction Description Installation and general usage Modules Venn diagram Upset Pairwise heatmap ShinyApp Examples Plots in publications Conclusion ShinyApp Figure 13: Screenshot from Intervene’s upset module ShinyApp.
  • 23. Intervene to visualize genomic region sets D. Denisko Introduction Description Installation and general usage Modules Venn diagram Upset Pairwise heatmap ShinyApp Examples Plots in publications Conclusion Plots in publications Figure 14: Coregulated and antiregulated genes (with lncRNA) in various yeast colonies. Wilkinson D et al. 2018. Oxid Med Cell Longev.
  • 24. Intervene to visualize genomic region sets D. Denisko Introduction Description Installation and general usage Modules Venn diagram Upset Pairwise heatmap ShinyApp Examples Plots in publications Conclusion Plots in publications Figure 15: Differentially expressed genes over time after influenza treatment. Black: up- or down-regulated genes, red: upregulated genes, blue: downregulated genes. Top: PBMCs, bottom: B cells. Jensen TL et al. 2018. F1000Research. 6:2162.
  • 25. Intervene to visualize genomic region sets D. Denisko Introduction Description Installation and general usage Modules Venn diagram Upset Pairwise heatmap ShinyApp Examples Plots in publications Conclusion Conclusion Pros: simple command line tool for generating quick plots convenient for visualizing genomic region sets some customization (via output scripts and/or ShinyApp) Cons: limited ability to customize, even in ShinyApp limited plot types in comparison to ShinyApp