Workshop: Introduction to Cytoscape at UT-KBRIN Bioinformatics Summit 2014 (4/11/2014)
1. Keiichiro Ono
UC, San Diego
Trey Ideker Lab
Bioinformatics
Summit 2014
4/11/2014
Cytoscape
An Open Source Platform for
Biological Network Analysis and Visualization
2. -
Thanks for Attending!
- Keiichiro Ono
- Cytoscape Core Developer since 2005
- Core module design & implementation
- Area of Interest: Data Integration & Visualization
- University of California, San Diego Trey Ideker Lab
6. An Open Source Platform for Biological Network Data
Integration, Analysis and Visualization
Cytoscape
7. Cytoscape
- Open Source (LGPL)
- Free for both commercial and academic use
- Developed and maintained by universities,
companies, and research institutions
- De-facto standard software in biological network
research community
- Expandable by Apps
- This is why Cytoscape is a Platform, not a
simple desktop application
15. Nodes and Edges in Biology
- Protein - Protein
- Protein - DNA
- Genetic (Epistasis)
- Synthetic lethality
- Biochemical
Reactions
- Compound - Enzyme - Compound
Cartoon representation of a complex between DNA and the protein p53
Node
Node
Edge
33. is NOT a...
- Cytoscape is a powerful tool, but cannot do
everything
- Simulator
- Fully-featured Pathway diagram editor
- Statistical network analysis tool suite
- Still, you can implement these as Apps though
50. NCBI Gene ID 672
On Chromosome 17
GO Terms
DNA Repair
Cell Cycle
DNA Binding
Ensemble ID
ENSG00000012048
BRCA1
51. Data Tables for Cytoscape
- Example:
- Numeric
- Gene expression profiles
- Network statistics calculated in other
applications, such as R
- Confidence scores for edges
- Text (or categorical)
- GO annotation for genes
- List of genes related to disease X
- Targets for FDA approved drugs
- Genes on KEGG Pathway Y
- Clusters / group / community calculated
in external programs
- …
52. Your Data Sets
- Anything saved as a table can be
loaded into Cytoscape
- Excel
- Tab Delimited Document
- CSV
- As long as proper mapping key is
available, Cytoscape can map them to
your networks.
58. Network Analysis
- Filtering
- Calculate network statistics by
Network Analyzer
- Degree distribution,centrality, etc.
- Advanced analysis by Apps
59. Filtering (Selection)
- Simple, but powerful feature to extract sub
networks from large data sets
- Select nodes and edges with specific
conditions
- Pick nodes with degree > 5
- Select edges extracted from publication
X
- Find nodes on KEGG Pathway X
65. Summary: Selection
- Create filter from your biological question:
- Select all nodes annotated with GO
term “DNA repair” and have two or
more know interaction
- Select all genes directly interact with
brca1, brca2, etc.
- Extract subnetwork by CTR+N
68. - Goal: Help others to
understand your data
- Emphasize what you
want to tell
- Use color, shape,
size of objects
effectively!
- Excellent resource for
data visualization
- Tamara Munzner’s Web Site:
http://www.cs.ubc.ca/~tmm/
Data Visualization
73. Visual Style
- Example Mappings:
- Expression profiles to node color
- Object type (protein, compounds,
complex, etc.) to node shape
- Interaction type (Protein-Protein,
Protein-DNA, etc.) to edge line
style
84. Cytoscape Apps
- Extension programs to
add new features to
Cytoscape (were called
Plugins)
- Large App developer/
user community
- This is why Cytoscape
is so successful in life
science community!
86. Quick Overview of Apps
A travel guide to Cytoscape plugins
!
Rintaro Saito, Michael E Smoot, Keiichiro Ono, Johannes Ruscheinski, Peng-
Liang Wang, Samad Lotia, Alexander R Pico, Gary D Bader, Trey Ideker (2012)
Nature Methods 9 (11) p. 1069-1076
87. A travel guide to
Cytoscape plugins
!
Rintaro Saito, Michael E Smoot, Keiichiro Ono, Johannes
Ruscheinski, Peng-Liang Wang, Samad Lotia, Alexander R Pico,
Gary D Bader, Trey Ideker (2012)
Nature Methods 9 (11) p. 1069-1076
88. A travel guide to Cytoscape plugins
!
Rintaro Saito, Michael E Smoot, Keiichiro Ono, Johannes Ruscheinski, Peng-Liang Wang,
Samad Lotia, Alexander R Pico, Gary D Bader, Trey Ideker (2012)
Nature Methods 9 (11) p. 1069-1076
89. Agenda
- What is Cytoscape?
- Data Integration, Analysis, and
Visualization with Cytoscape
- Cytoscape Ecosystem
- Cytoscape Future Plan
91. Cytoscape Family
- Version 2.x - Legacy version
- Last release for 2.x is 2.8.3.
- Version 3.x - Current production version
- Latest version: 3.1.0
- Important Note: Apps for 2.x is not
compatible with 3.x
(We have similar problem like Python…)
97. What is cytoscape.js?
A Javascript Library for network visualization,
not a web application!
Need to write some code to use it on the web browsers…
98. Complete desktop
application for network
analysis and visualization
!
Written in Java
!
Expandable by Apps
!
For Users
A Javascript Library for
network visualization,
not a web application!
!
Written in JavaScript
!
Expandable by Extensions
!
For Developers
JS
102. Export to HTML5 Session
Feature for users: you can view networks on web browsers
with Cytoscape.js
Under development… We need early adapters for testing!
JS
105. Future Plan
- Integration to external
tools
- Access from
Python, R, Perl, etc.
- More integration to
Cytoscape.js
- Cytoscape
Cyberinfrastructure
(Cytoscape CI)
106. Cytoscape CI: Background
- The size of data biologists have
to analyze is still growing
- Desktop machines are
powerful, but not enough for
large scale data analysis
- Clusters / Clouds
- Using multiple computing
resources as external service is
normal for biological data
analysis…
- Scalability
Service 1
Service 2
112. Collaboration
- Once you are ready to use Cytoscape for real-world
problems, National Resources for Network Biology
(NRNB) is always open for collaboration!
- NRNB Provides support for both of
- Scientific Research
- Application / Tool Development
- nrnb.org
113. -
- Two Google Groups
- cytoscape-
discuss@googlegroups.com
- cytoscape-
helpdesk@googlegroups.com
- ANY question is OK!
Getting Help