SlideShare a Scribd company logo
Barry Demchak, PhD
Ideker Lab, UC San Diego
bdemchak@cytoscape.org
November 6, 2016
RSG with DREAM & Cytoscape Workshop
Phoenix, AZ
2
Outline
• Biological Networks
– Why Networks?
– Biological Network Taxonomy
– Analytical Approaches
– Visualization
• Introduction to Cytoscape
• Break
• Hands on Tutorial
– Data import
– Layout and apps
3
Introductions
• Barry Demchak
– 2012-Current
• Chief Software Architect, Project Manager for National Resource for
Network Biology (NRNB, Ideker Lab)
– 2005-2012
• PhD Computer Science and Engineering, UC San Diego
– 1987-current
• President, Torrey Pines Software, Inc
– Cytoscape core team since 2012
– Architect of Cytoscape Cyberinfrastructure
4
Introductions
• Keiichiro Ono
– 2005-Current
• Research Associate / Software Engineer (UC, San Diego Trey Ideker
Lab / NRNB)
– Cytoscape core team since 2005
– Area of Interest
• Biological Data Integration
• Data Visualization
5
• Alex Pico, Ph.D.
– Executive Director, National Resource for Network Biology
and VP of Cytoscape Consortium
– Associate Director of Bioinformatics, Gladstone Institutes,
UCSF
– Cytoscape core developer since 2007
– Developer of several Cytoscape apps:
• WikiPathways, PathExplorer, GenMAPP-CS,
enhancedGraphics, Mosaic, NOA, Synapse Client, Variation
Acknowledgment
6
• You?
– Clinician
– Biologist
– Bioinformatician
– Computer Scientist
– Chemist
– Other?
Introductions
7
Introducing Cytoscape
0
500
1000
1500
2000
2500
3000
3500
4000
2/9/2014 8/9/2014 2/9/2015 8/9/2015 2/9/2016 8/9/2016
DailyExecutions
Date
Daily Cytoscape Executions
(2,032,889 - 2014 through 2016)
0
2000
4000
6000
8000
10000
12000
14000
16000
18000
20000
Jul-04 Jul-06 Jul-08 Jul-10 Jul-12 Jul-14 Jul-16
Cytoscape Downloads
United States
27%
India
8%
China
7%
United Kingdom
5%
Germany
5%
Japan
5%
France
4%
Canada
3%
Italy
3%
Spain
2%
Other
31%
Cytoscape.org Sessions (1,799,876 since 2012)
0
200
400
600
800
1000
1200
1400
1600
1800
Count
Year
Google Scholar
Cytoscape Citations 11/1/2016
8
Why Networks?
• Networks are…
• Commonly understood
• Structured to reduce complexity
• More efficient than tables
• Network tools allow…
Analysis
• Characterize network properties
• Identify hubs and subnets
• Classify, quantify and correlate, e.g.,
cluster nodes by associated data
Visualization
• Explore data overlays
• Interpret mechanisms, e.g., how a process
is modulated or attenuated by a stimulus
9
Applications of Network Biology
jActiveModules, UCSD
(based on Microarray Data)
PathBlast, UCSD
mCode, University of Toronto
(data agnostic)
DomainGraph, Max Planck Institute
• Gene Function Prediction
shows connections to sets of
genes/proteins involved in same
biological process
• Detection of protein
complexes/subnetworks
discover modularity & higher
order organization (motifs,
feedback loops)
• Network evolution
biological process(s)
conservation across species
• Prediction of interactions &
functional associations
statistically significant domain-
domain correlations in protein
interaction network to predict
protein-protein or genetic
interaction
10
Applications in Disease
• Identification of disease
subnetworks – identification
of disease network subnetworks
that are transcriptionally active
in disease.
• Subnetwork-based
diagnosis – source of
biomarkers for disease
classification, identify
interconnected genes whose
aggregate expression levels are
predictive of disease state
• Subnetwork-based gene
association – map common
pathway mechanisms affected
by collection of genotypes
(SNP, CNV)
Agilent Literature Search
Mondrian, MSKCC
PinnacleZ, UCSD
11
The Challenge
http://cytoscape-publications.tumblr.com/archive
12
Biological Network Taxonomy
• Pathways
– Signaling, Metabolic, Regulatory, etc
13
Biological Network Taxonomy
• Interactions
– Protein-Protein
– Protein-Ligand
– Domain-Domain
– Others
• Residue or atomic
• Cell-cell
• Epidemiology
• Social networks
14
Biological Network Taxonomy
• Similarity
– Protein-Protein
– Chemical similarity
– Ligand similarity (SEA)
– Others
• Tag clouds
• Topic maps
15
Biological Network Taxonomy
Where do I get the network?
There is no such thing!
550 different interaction databases!
… in 2013
It depends on your biological question
and your analysis plan.
16
The levels of organization of
complex networks:
 Node degree provides
information about single nodes
 Three or more nodes represent
a motif
 Larger groups of nodes are
called modules or
communities
 Hierarchy describes how the
various structural elements are
combined
Analytical Approaches
19
• Concepts
– Graph/Network correspondence
– Node/Vertex correspondence
– Edge directedness
• Usually a network property
– Node degree
– Multigraph
• Allow multiple edges between nodes
– Hypergraph
• Allow edges to connect more than 2 nodes
Analytical Approaches
21
• Scale-free networks
– Degree distribution follows power law:
P(k) ~ k-γ, where γ is a constant.
– Result is that there are distinctive “hubs”
(essential proteins?)
– Overall, though, network is resilient to
perturbation
– Biological (and social) networks tend to be
scale-free
Analytical Approaches
22
Analytical Approaches
• Small-world networks
– any two arbitrary nodes are connected
by a small number of intermediate
edges
– the network has an average shortest
path length much smaller than the
number of nodes in the network
(Watts, Nature, 1998).
– Interaction networks have been shown
to be small-world networks (Barabási,
Nature Reviews in Genetics, 2004)
27
• Motif finding
– Search directed networks for network motifs
(feed-forward loops, feedback loops, etc.)
Analytical Approaches
28
• Overrepresentation analysis
– Find terms (GO) that are statistically
overrepresented in a network
– Not really a network analysis technique
– Very useful for visualization
Analytical Approaches
29
• Clustering (find hubs, complexes)
– Goal: group related items together
• Clustering types:
– Hierarchical clustering
• Divide network into pair-wise hierarchy
– K-Means clustering
• Divide network into k groups
– MCL
• Uses a flow simulation to find groups
– Community Clustering
• Maximize intra-cluster edges vs. inter-cluster edges
Analytical Approaches
30
Visualization of Biological Networks
• Data Mapping
• Layouts
• Animation
32
Data Mapping
• Mapping of data values associated with
graph elements onto graph visuals
• Visual attributes
– Node fill color, border color, border width, size,
shape, opacity, label
– Edge type, color, width, ending type, ending
size, ending color
• Mapping types
– Passthrough (labels)
– Continuous (numeric values)
– Discrete (categories)
33
• Avoid cluttering your visualization with too
much data
– Highlight meaningful differences
– Avoid confusing the viewer
– Consider creating multiple network images
Data Mapping
34
• Layouts determine the location of nodes and
(sometimes) the paths of edges
• Types:
– Simple
• Grid
• Partitions
– Hierarchical
• layout data as a tree or hierarchy
• Works best when there are no loops
– Circular (Radial)
• arrange nodes around a circle
• could use node attributes to govern position
– e.g. degree sorted
Layouts
35
• Types:
– Force-Directed
• simulate edges as springs
• may be weighted or unweighted
– Combining layouts
• Use a general layout (force directed) for the entire
graph, but use hierarchical or radial to focus on a
particular portion
– Multi-layer layouts
• Partition graph, layout each partition then layout
partitions
– Many, many others
Layouts
36
• Use layouts to convey the relationships
between the nodes.
• Layout algorithms may need to be “tuned”
to fit your network.
• There is not one correct layout. Try different
things.
Layouts
37
Animation
• Animation is useful to show changes in a
network:
– Over a time series
– Over different conditions
– Between species
38
Introduction to Cytoscape
• Overview
• Core Concepts
– Networks and Tables
– Visual Properties
– Cytoscape Apps
• Working with Data
– Loading networks from files and online databases
– Loading data tables from CSV or Excel files
– The Table Panel
39
 Open source
 Cross platform
 Consortium
Cytoscape
University of Toronto
40
Core Concepts
• Networks and Tables
Tables
e.g., data or annotations
Networks
e.g., PPIs or pathways
41
Core Concepts
• Networks and Tables
TablesNetworks
Visual Styles
42
Core Concepts
• Cytoscape Apps!
apps.cytoscape.org
43
Loading Networks
• Use import network from file
– Excel file
– Comma or tab delimited text
44
Loading Networks
• But what if I don’t have a network?!
45
Loading Tables
• Nodes and edges can have
data associated with them
– Gene expression data
– Mass spectrometry data
– Protein structure information
– Gene Ontology terms, etc.
• Cytoscape supports multiple
data types: Numbers, Text,
Boolean, Lists...
46
• Use import table from file
– Excel file
– Comma or tab delimited text
Loading Tables
47
Visual Style Manager
• Click on the Styles tab
48
• Click on Select tab
Selection Filters
49
• Sessions save everything as .cys files:
Networks, Tables, Styles, Screen sizes, etc
• Export networks in different formats:
SIF, GML, XGMML, BioPAX, PSI-MI 1 & 2.5
• Publication quality graphics in several formats:
PDF, EPS, SVG, PNG, JPEG, and BMP
Saving and Exporting
50
Getting Help
57
Hands-on Tutorial
Introduction to Cytoscape:
Networks, Data, Styles, Layouts and App Manager
tutorials.cytoscape.org

More Related Content

What's hot

encode project
encode project encode project
encode project
Priti Pal
 
Proteins databases
Proteins databasesProteins databases
Proteins databases
Hafiz Muhammad Zeeshan Raza
 
Gene Ontology Network Enrichment Analysis
Gene Ontology Network Enrichment AnalysisGene Ontology Network Enrichment Analysis
Gene Ontology Network Enrichment Analysis
UC Davis
 
So you want to do a: RNAseq experiment, Differential Gene Expression Analysis
So you want to do a: RNAseq experiment, Differential Gene Expression AnalysisSo you want to do a: RNAseq experiment, Differential Gene Expression Analysis
So you want to do a: RNAseq experiment, Differential Gene Expression Analysis
University of California, Davis
 
MEGA (Molecular Evolutionary Genetics Analysis)
MEGA (Molecular Evolutionary Genetics Analysis)MEGA (Molecular Evolutionary Genetics Analysis)
MEGA (Molecular Evolutionary Genetics Analysis)
Athar Mutahari
 
Sequence Alignment In Bioinformatics
Sequence Alignment In BioinformaticsSequence Alignment In Bioinformatics
Sequence Alignment In Bioinformatics
Nikesh Narayanan
 
THIRD GEN SEQUENCING.pptx
THIRD GEN SEQUENCING.pptxTHIRD GEN SEQUENCING.pptx
THIRD GEN SEQUENCING.pptx
RITHIKA R S
 
EMBL-EBI
EMBL-EBIEMBL-EBI
EMBL-EBI
Sayma Zerin
 
Protein function and bioinformatics
Protein function and bioinformaticsProtein function and bioinformatics
Protein function and bioinformatics
Neil Saunders
 
Sequence Alignment
Sequence AlignmentSequence Alignment
Sequence Alignment
Meghaj Mallick
 
AI in Bioinformatics
AI in BioinformaticsAI in Bioinformatics
AI in Bioinformatics
Ali Kishk
 
NCBI
NCBINCBI
2015 Cytoscape 3.2 Tutorial
2015 Cytoscape 3.2 Tutorial2015 Cytoscape 3.2 Tutorial
2015 Cytoscape 3.2 Tutorial
Alexander Pico
 
Structural Genomics
Structural GenomicsStructural Genomics
Structural Genomics
Aqsa Javed
 
Protein Sequence Databases
Protein Sequence Databases Protein Sequence Databases
Protein Sequence Databases
Hemant Bothe
 
The Smith Waterman algorithm
The Smith Waterman algorithmThe Smith Waterman algorithm
The Smith Waterman algorithm
avrilcoghlan
 
Yeast two hybrid system for Protein Protein Interaction Studies
Yeast two hybrid system for Protein Protein Interaction StudiesYeast two hybrid system for Protein Protein Interaction Studies
Yeast two hybrid system for Protein Protein Interaction Studies
ajithnandanam
 
BIOLOGICAL SEQUENCE DATABASES
BIOLOGICAL SEQUENCE DATABASES BIOLOGICAL SEQUENCE DATABASES
BIOLOGICAL SEQUENCE DATABASES
nadeem akhter
 
Genomics and bioinformatics
Genomics and bioinformatics Genomics and bioinformatics
Genomics and bioinformatics
Senthil Natesan
 
Yeast hybrid system
Yeast hybrid systemYeast hybrid system
Yeast hybrid system
LekshmiJohnson
 

What's hot (20)

encode project
encode project encode project
encode project
 
Proteins databases
Proteins databasesProteins databases
Proteins databases
 
Gene Ontology Network Enrichment Analysis
Gene Ontology Network Enrichment AnalysisGene Ontology Network Enrichment Analysis
Gene Ontology Network Enrichment Analysis
 
So you want to do a: RNAseq experiment, Differential Gene Expression Analysis
So you want to do a: RNAseq experiment, Differential Gene Expression AnalysisSo you want to do a: RNAseq experiment, Differential Gene Expression Analysis
So you want to do a: RNAseq experiment, Differential Gene Expression Analysis
 
MEGA (Molecular Evolutionary Genetics Analysis)
MEGA (Molecular Evolutionary Genetics Analysis)MEGA (Molecular Evolutionary Genetics Analysis)
MEGA (Molecular Evolutionary Genetics Analysis)
 
Sequence Alignment In Bioinformatics
Sequence Alignment In BioinformaticsSequence Alignment In Bioinformatics
Sequence Alignment In Bioinformatics
 
THIRD GEN SEQUENCING.pptx
THIRD GEN SEQUENCING.pptxTHIRD GEN SEQUENCING.pptx
THIRD GEN SEQUENCING.pptx
 
EMBL-EBI
EMBL-EBIEMBL-EBI
EMBL-EBI
 
Protein function and bioinformatics
Protein function and bioinformaticsProtein function and bioinformatics
Protein function and bioinformatics
 
Sequence Alignment
Sequence AlignmentSequence Alignment
Sequence Alignment
 
AI in Bioinformatics
AI in BioinformaticsAI in Bioinformatics
AI in Bioinformatics
 
NCBI
NCBINCBI
NCBI
 
2015 Cytoscape 3.2 Tutorial
2015 Cytoscape 3.2 Tutorial2015 Cytoscape 3.2 Tutorial
2015 Cytoscape 3.2 Tutorial
 
Structural Genomics
Structural GenomicsStructural Genomics
Structural Genomics
 
Protein Sequence Databases
Protein Sequence Databases Protein Sequence Databases
Protein Sequence Databases
 
The Smith Waterman algorithm
The Smith Waterman algorithmThe Smith Waterman algorithm
The Smith Waterman algorithm
 
Yeast two hybrid system for Protein Protein Interaction Studies
Yeast two hybrid system for Protein Protein Interaction StudiesYeast two hybrid system for Protein Protein Interaction Studies
Yeast two hybrid system for Protein Protein Interaction Studies
 
BIOLOGICAL SEQUENCE DATABASES
BIOLOGICAL SEQUENCE DATABASES BIOLOGICAL SEQUENCE DATABASES
BIOLOGICAL SEQUENCE DATABASES
 
Genomics and bioinformatics
Genomics and bioinformatics Genomics and bioinformatics
Genomics and bioinformatics
 
Yeast hybrid system
Yeast hybrid systemYeast hybrid system
Yeast hybrid system
 

Similar to Cytoscape Network Visualization and Analysis

Biological networks
Biological networksBiological networks
Pathway and network analysis
Pathway and network analysisPathway and network analysis
Pathway and network analysis
Manar Al-Eslam Mattar
 
Semantic Technologies for Big Sciences including Astrophysics
Semantic Technologies for Big Sciences including AstrophysicsSemantic Technologies for Big Sciences including Astrophysics
Semantic Technologies for Big Sciences including Astrophysics
Artificial Intelligence Institute at UofSC
 
System biology and its tools
System biology and its toolsSystem biology and its tools
System biology and its tools
Gaurav Diwakar
 
Biological networks - building and visualizing
Biological networks - building and visualizingBiological networks - building and visualizing
Biological networks - building and visualizing
Bioinformatics and Computational Biosciences Branch
 
Data-knowledge transition zones within the biomedical research ecosystem
Data-knowledge transition zones within the biomedical research ecosystemData-knowledge transition zones within the biomedical research ecosystem
Data-knowledge transition zones within the biomedical research ecosystem
Maryann Martone
 
Neuroscience as networked science
Neuroscience as networked scienceNeuroscience as networked science
Neuroscience as networked science
Neuroscience Information Framework
 
Keynote at AImWD
Keynote at AImWDKeynote at AImWD
Keynote at AImWD
Stefan Schlobach
 
Big Data (SOCIOMETRIC METHODS FOR RELEVANCY ANALYSIS OF LONG TAIL SCIENCE D...
Big Data (SOCIOMETRIC METHODS FOR  RELEVANCY ANALYSIS OF LONG TAIL  SCIENCE D...Big Data (SOCIOMETRIC METHODS FOR  RELEVANCY ANALYSIS OF LONG TAIL  SCIENCE D...
Big Data (SOCIOMETRIC METHODS FOR RELEVANCY ANALYSIS OF LONG TAIL SCIENCE D...
AKSHAY BHAGAT
 
CINET: A CyberInfrastructure for Network Science
CINET: A CyberInfrastructure for Network ScienceCINET: A CyberInfrastructure for Network Science
CINET: A CyberInfrastructure for Network Science
Biocomplexity Institute of Virginia Tech
 
NRNB EAC Report 2011
NRNB EAC Report 2011NRNB EAC Report 2011
NRNB EAC Report 2011
Alexander Pico
 
Building genomic data cyberinfrastructure with the online database software T...
Building genomic data cyberinfrastructure with the online database software T...Building genomic data cyberinfrastructure with the online database software T...
Building genomic data cyberinfrastructure with the online database software T...
mestato
 
Hmp 201512
Hmp 201512Hmp 201512
Metabolomic data analysis and visualization tools
Metabolomic data analysis and visualization toolsMetabolomic data analysis and visualization tools
Metabolomic data analysis and visualization tools
Dmitry Grapov
 
2016 Bio-IT World Cell Line Coordination 2016-04-06v1
2016 Bio-IT World Cell Line Coordination 2016-04-06v12016 Bio-IT World Cell Line Coordination 2016-04-06v1
2016 Bio-IT World Cell Line Coordination 2016-04-06v1
Bruce Kozuma
 
Semantics-enhanced Geoscience Interoperability, Analytics, and Applications
Semantics-enhanced Geoscience Interoperability, Analytics, and ApplicationsSemantics-enhanced Geoscience Interoperability, Analytics, and Applications
Semantics-enhanced Geoscience Interoperability, Analytics, and Applications
Artificial Intelligence Institute at UofSC
 
NetBioSIG2013-KEYNOTE Benno Schwikowski
NetBioSIG2013-KEYNOTE Benno SchwikowskiNetBioSIG2013-KEYNOTE Benno Schwikowski
NetBioSIG2013-KEYNOTE Benno Schwikowski
Alexander Pico
 
(ATS4-DEV02) Accelrys Query Service: Technology and Tools
(ATS4-DEV02) Accelrys Query Service: Technology and Tools(ATS4-DEV02) Accelrys Query Service: Technology and Tools
(ATS4-DEV02) Accelrys Query Service: Technology and Tools
BIOVIA
 
Metadata Mapping & Crosswalks
Metadata Mapping & CrosswalksMetadata Mapping & Crosswalks
Metadata Mapping & Crosswalks
Nikos Palavitsinis, PhD
 
Distributed End-to-End Drug Similarity Analytics and Visualization Workflow w...
Distributed End-to-End Drug Similarity Analytics and Visualization Workflow w...Distributed End-to-End Drug Similarity Analytics and Visualization Workflow w...
Distributed End-to-End Drug Similarity Analytics and Visualization Workflow w...
Databricks
 

Similar to Cytoscape Network Visualization and Analysis (20)

Biological networks
Biological networksBiological networks
Biological networks
 
Pathway and network analysis
Pathway and network analysisPathway and network analysis
Pathway and network analysis
 
Semantic Technologies for Big Sciences including Astrophysics
Semantic Technologies for Big Sciences including AstrophysicsSemantic Technologies for Big Sciences including Astrophysics
Semantic Technologies for Big Sciences including Astrophysics
 
System biology and its tools
System biology and its toolsSystem biology and its tools
System biology and its tools
 
Biological networks - building and visualizing
Biological networks - building and visualizingBiological networks - building and visualizing
Biological networks - building and visualizing
 
Data-knowledge transition zones within the biomedical research ecosystem
Data-knowledge transition zones within the biomedical research ecosystemData-knowledge transition zones within the biomedical research ecosystem
Data-knowledge transition zones within the biomedical research ecosystem
 
Neuroscience as networked science
Neuroscience as networked scienceNeuroscience as networked science
Neuroscience as networked science
 
Keynote at AImWD
Keynote at AImWDKeynote at AImWD
Keynote at AImWD
 
Big Data (SOCIOMETRIC METHODS FOR RELEVANCY ANALYSIS OF LONG TAIL SCIENCE D...
Big Data (SOCIOMETRIC METHODS FOR  RELEVANCY ANALYSIS OF LONG TAIL  SCIENCE D...Big Data (SOCIOMETRIC METHODS FOR  RELEVANCY ANALYSIS OF LONG TAIL  SCIENCE D...
Big Data (SOCIOMETRIC METHODS FOR RELEVANCY ANALYSIS OF LONG TAIL SCIENCE D...
 
CINET: A CyberInfrastructure for Network Science
CINET: A CyberInfrastructure for Network ScienceCINET: A CyberInfrastructure for Network Science
CINET: A CyberInfrastructure for Network Science
 
NRNB EAC Report 2011
NRNB EAC Report 2011NRNB EAC Report 2011
NRNB EAC Report 2011
 
Building genomic data cyberinfrastructure with the online database software T...
Building genomic data cyberinfrastructure with the online database software T...Building genomic data cyberinfrastructure with the online database software T...
Building genomic data cyberinfrastructure with the online database software T...
 
Hmp 201512
Hmp 201512Hmp 201512
Hmp 201512
 
Metabolomic data analysis and visualization tools
Metabolomic data analysis and visualization toolsMetabolomic data analysis and visualization tools
Metabolomic data analysis and visualization tools
 
2016 Bio-IT World Cell Line Coordination 2016-04-06v1
2016 Bio-IT World Cell Line Coordination 2016-04-06v12016 Bio-IT World Cell Line Coordination 2016-04-06v1
2016 Bio-IT World Cell Line Coordination 2016-04-06v1
 
Semantics-enhanced Geoscience Interoperability, Analytics, and Applications
Semantics-enhanced Geoscience Interoperability, Analytics, and ApplicationsSemantics-enhanced Geoscience Interoperability, Analytics, and Applications
Semantics-enhanced Geoscience Interoperability, Analytics, and Applications
 
NetBioSIG2013-KEYNOTE Benno Schwikowski
NetBioSIG2013-KEYNOTE Benno SchwikowskiNetBioSIG2013-KEYNOTE Benno Schwikowski
NetBioSIG2013-KEYNOTE Benno Schwikowski
 
(ATS4-DEV02) Accelrys Query Service: Technology and Tools
(ATS4-DEV02) Accelrys Query Service: Technology and Tools(ATS4-DEV02) Accelrys Query Service: Technology and Tools
(ATS4-DEV02) Accelrys Query Service: Technology and Tools
 
Metadata Mapping & Crosswalks
Metadata Mapping & CrosswalksMetadata Mapping & Crosswalks
Metadata Mapping & Crosswalks
 
Distributed End-to-End Drug Similarity Analytics and Visualization Workflow w...
Distributed End-to-End Drug Similarity Analytics and Visualization Workflow w...Distributed End-to-End Drug Similarity Analytics and Visualization Workflow w...
Distributed End-to-End Drug Similarity Analytics and Visualization Workflow w...
 

More from bdemchak

The New CyREST: Economical Delivery of Complex, Reproducible Network Biology ...
The New CyREST: Economical Delivery of Complex, Reproducible Network Biology ...The New CyREST: Economical Delivery of Complex, Reproducible Network Biology ...
The New CyREST: Economical Delivery of Complex, Reproducible Network Biology ...
bdemchak
 
Cytoscape Cyberinfrastructure
Cytoscape CyberinfrastructureCytoscape Cyberinfrastructure
Cytoscape Cyberinfrastructure
bdemchak
 
No More Silos! Cytoscape CI Enables Interoperability
No More Silos! Cytoscape CI Enables InteroperabilityNo More Silos! Cytoscape CI Enables Interoperability
No More Silos! Cytoscape CI Enables Interoperability
bdemchak
 
Cytoscape CI Chapter 2
Cytoscape CI Chapter 2Cytoscape CI Chapter 2
Cytoscape CI Chapter 2
bdemchak
 
Composable Chat Introduction
Composable Chat IntroductionComposable Chat Introduction
Composable Chat Introduction
bdemchak
 
Rich Services: Composable chat
Rich Services: Composable chatRich Services: Composable chat
Rich Services: Composable chat
bdemchak
 
Ucsd tum workshop bd
Ucsd tum workshop bdUcsd tum workshop bd
Ucsd tum workshop bd
bdemchak
 
Rich Feeds for RESCUE and PALMS
Rich Feeds for RESCUE and PALMSRich Feeds for RESCUE and PALMS
Rich Feeds for RESCUE and PALMS
bdemchak
 
Iscram 2008 presentation
Iscram 2008 presentationIscram 2008 presentation
Iscram 2008 presentation
bdemchak
 
Rich feeds policy, the cloud, and CAP
Rich feeds   policy, the cloud, and CAPRich feeds   policy, the cloud, and CAP
Rich feeds policy, the cloud, and CAP
bdemchak
 
Rich services to the Rescue
Rich services to the RescueRich services to the Rescue
Rich services to the Rescue
bdemchak
 
Hicss 2012 presentation
Hicss 2012 presentationHicss 2012 presentation
Hicss 2012 presentation
bdemchak
 
Policy 2012 presentation
Policy 2012 presentationPolicy 2012 presentation
Policy 2012 presentation
bdemchak
 
Rich feeds for rescue an integration story
Rich feeds for rescue   an integration storyRich feeds for rescue   an integration story
Rich feeds for rescue an integration story
bdemchak
 
Background scenario drivers and critical issues with a focus on technology ...
Background   scenario drivers and critical issues with a focus on technology ...Background   scenario drivers and critical issues with a focus on technology ...
Background scenario drivers and critical issues with a focus on technology ...
bdemchak
 
Rich feeds for rescue, palms cyberinfrastructure integration stories
Rich feeds for rescue, palms cyberinfrastructure   integration storiesRich feeds for rescue, palms cyberinfrastructure   integration stories
Rich feeds for rescue, palms cyberinfrastructure integration stories
bdemchak
 
Data quality and uncertainty visualization
Data quality and uncertainty visualizationData quality and uncertainty visualization
Data quality and uncertainty visualization
bdemchak
 
Web programming in clojure
Web programming in clojureWeb programming in clojure
Web programming in clojure
bdemchak
 
Structure and interpretation of computer programs modularity, objects, and ...
Structure and interpretation of computer programs   modularity, objects, and ...Structure and interpretation of computer programs   modularity, objects, and ...
Structure and interpretation of computer programs modularity, objects, and ...
bdemchak
 
Requirements engineering from system goals to uml models to software specif...
Requirements engineering   from system goals to uml models to software specif...Requirements engineering   from system goals to uml models to software specif...
Requirements engineering from system goals to uml models to software specif...
bdemchak
 

More from bdemchak (20)

The New CyREST: Economical Delivery of Complex, Reproducible Network Biology ...
The New CyREST: Economical Delivery of Complex, Reproducible Network Biology ...The New CyREST: Economical Delivery of Complex, Reproducible Network Biology ...
The New CyREST: Economical Delivery of Complex, Reproducible Network Biology ...
 
Cytoscape Cyberinfrastructure
Cytoscape CyberinfrastructureCytoscape Cyberinfrastructure
Cytoscape Cyberinfrastructure
 
No More Silos! Cytoscape CI Enables Interoperability
No More Silos! Cytoscape CI Enables InteroperabilityNo More Silos! Cytoscape CI Enables Interoperability
No More Silos! Cytoscape CI Enables Interoperability
 
Cytoscape CI Chapter 2
Cytoscape CI Chapter 2Cytoscape CI Chapter 2
Cytoscape CI Chapter 2
 
Composable Chat Introduction
Composable Chat IntroductionComposable Chat Introduction
Composable Chat Introduction
 
Rich Services: Composable chat
Rich Services: Composable chatRich Services: Composable chat
Rich Services: Composable chat
 
Ucsd tum workshop bd
Ucsd tum workshop bdUcsd tum workshop bd
Ucsd tum workshop bd
 
Rich Feeds for RESCUE and PALMS
Rich Feeds for RESCUE and PALMSRich Feeds for RESCUE and PALMS
Rich Feeds for RESCUE and PALMS
 
Iscram 2008 presentation
Iscram 2008 presentationIscram 2008 presentation
Iscram 2008 presentation
 
Rich feeds policy, the cloud, and CAP
Rich feeds   policy, the cloud, and CAPRich feeds   policy, the cloud, and CAP
Rich feeds policy, the cloud, and CAP
 
Rich services to the Rescue
Rich services to the RescueRich services to the Rescue
Rich services to the Rescue
 
Hicss 2012 presentation
Hicss 2012 presentationHicss 2012 presentation
Hicss 2012 presentation
 
Policy 2012 presentation
Policy 2012 presentationPolicy 2012 presentation
Policy 2012 presentation
 
Rich feeds for rescue an integration story
Rich feeds for rescue   an integration storyRich feeds for rescue   an integration story
Rich feeds for rescue an integration story
 
Background scenario drivers and critical issues with a focus on technology ...
Background   scenario drivers and critical issues with a focus on technology ...Background   scenario drivers and critical issues with a focus on technology ...
Background scenario drivers and critical issues with a focus on technology ...
 
Rich feeds for rescue, palms cyberinfrastructure integration stories
Rich feeds for rescue, palms cyberinfrastructure   integration storiesRich feeds for rescue, palms cyberinfrastructure   integration stories
Rich feeds for rescue, palms cyberinfrastructure integration stories
 
Data quality and uncertainty visualization
Data quality and uncertainty visualizationData quality and uncertainty visualization
Data quality and uncertainty visualization
 
Web programming in clojure
Web programming in clojureWeb programming in clojure
Web programming in clojure
 
Structure and interpretation of computer programs modularity, objects, and ...
Structure and interpretation of computer programs   modularity, objects, and ...Structure and interpretation of computer programs   modularity, objects, and ...
Structure and interpretation of computer programs modularity, objects, and ...
 
Requirements engineering from system goals to uml models to software specif...
Requirements engineering   from system goals to uml models to software specif...Requirements engineering   from system goals to uml models to software specif...
Requirements engineering from system goals to uml models to software specif...
 

Recently uploaded

What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
moosaasad1975
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
Renu Jangid
 
aziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobelaziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobel
İsa Badur
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Erdal Coalmaker
 
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdfTopic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
TinyAnderson
 
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
yqqaatn0
 
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills MN
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
Nistarini College, Purulia (W.B) India
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
University of Rennes, INSA Rennes, Inria/IRISA, CNRS
 
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptxThe use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
MAGOTI ERNEST
 
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdfDMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
fafyfskhan251kmf
 
Shallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptxShallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptx
Gokturk Mehmet Dilci
 
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốtmô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
HongcNguyn6
 
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
University of Maribor
 
SAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdfSAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdf
KrushnaDarade1
 
Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
Columbia Weather Systems
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
University of Maribor
 
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
Abdul Wali Khan University Mardan,kP,Pakistan
 
8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf
by6843629
 
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
David Osipyan
 

Recently uploaded (20)

What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
 
aziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobelaziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobel
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
 
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdfTopic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
 
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
 
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
 
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptxThe use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
 
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdfDMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
 
Shallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptxShallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptx
 
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốtmô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
 
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
 
SAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdfSAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdf
 
Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
 
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
 
8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf
 
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
 

Cytoscape Network Visualization and Analysis

  • 1. Barry Demchak, PhD Ideker Lab, UC San Diego bdemchak@cytoscape.org November 6, 2016 RSG with DREAM & Cytoscape Workshop Phoenix, AZ
  • 2. 2 Outline • Biological Networks – Why Networks? – Biological Network Taxonomy – Analytical Approaches – Visualization • Introduction to Cytoscape • Break • Hands on Tutorial – Data import – Layout and apps
  • 3. 3 Introductions • Barry Demchak – 2012-Current • Chief Software Architect, Project Manager for National Resource for Network Biology (NRNB, Ideker Lab) – 2005-2012 • PhD Computer Science and Engineering, UC San Diego – 1987-current • President, Torrey Pines Software, Inc – Cytoscape core team since 2012 – Architect of Cytoscape Cyberinfrastructure
  • 4. 4 Introductions • Keiichiro Ono – 2005-Current • Research Associate / Software Engineer (UC, San Diego Trey Ideker Lab / NRNB) – Cytoscape core team since 2005 – Area of Interest • Biological Data Integration • Data Visualization
  • 5. 5 • Alex Pico, Ph.D. – Executive Director, National Resource for Network Biology and VP of Cytoscape Consortium – Associate Director of Bioinformatics, Gladstone Institutes, UCSF – Cytoscape core developer since 2007 – Developer of several Cytoscape apps: • WikiPathways, PathExplorer, GenMAPP-CS, enhancedGraphics, Mosaic, NOA, Synapse Client, Variation Acknowledgment
  • 6. 6 • You? – Clinician – Biologist – Bioinformatician – Computer Scientist – Chemist – Other? Introductions
  • 7. 7 Introducing Cytoscape 0 500 1000 1500 2000 2500 3000 3500 4000 2/9/2014 8/9/2014 2/9/2015 8/9/2015 2/9/2016 8/9/2016 DailyExecutions Date Daily Cytoscape Executions (2,032,889 - 2014 through 2016) 0 2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 Jul-04 Jul-06 Jul-08 Jul-10 Jul-12 Jul-14 Jul-16 Cytoscape Downloads United States 27% India 8% China 7% United Kingdom 5% Germany 5% Japan 5% France 4% Canada 3% Italy 3% Spain 2% Other 31% Cytoscape.org Sessions (1,799,876 since 2012) 0 200 400 600 800 1000 1200 1400 1600 1800 Count Year Google Scholar Cytoscape Citations 11/1/2016
  • 8. 8 Why Networks? • Networks are… • Commonly understood • Structured to reduce complexity • More efficient than tables • Network tools allow… Analysis • Characterize network properties • Identify hubs and subnets • Classify, quantify and correlate, e.g., cluster nodes by associated data Visualization • Explore data overlays • Interpret mechanisms, e.g., how a process is modulated or attenuated by a stimulus
  • 9. 9 Applications of Network Biology jActiveModules, UCSD (based on Microarray Data) PathBlast, UCSD mCode, University of Toronto (data agnostic) DomainGraph, Max Planck Institute • Gene Function Prediction shows connections to sets of genes/proteins involved in same biological process • Detection of protein complexes/subnetworks discover modularity & higher order organization (motifs, feedback loops) • Network evolution biological process(s) conservation across species • Prediction of interactions & functional associations statistically significant domain- domain correlations in protein interaction network to predict protein-protein or genetic interaction
  • 10. 10 Applications in Disease • Identification of disease subnetworks – identification of disease network subnetworks that are transcriptionally active in disease. • Subnetwork-based diagnosis – source of biomarkers for disease classification, identify interconnected genes whose aggregate expression levels are predictive of disease state • Subnetwork-based gene association – map common pathway mechanisms affected by collection of genotypes (SNP, CNV) Agilent Literature Search Mondrian, MSKCC PinnacleZ, UCSD
  • 12. 12 Biological Network Taxonomy • Pathways – Signaling, Metabolic, Regulatory, etc
  • 13. 13 Biological Network Taxonomy • Interactions – Protein-Protein – Protein-Ligand – Domain-Domain – Others • Residue or atomic • Cell-cell • Epidemiology • Social networks
  • 14. 14 Biological Network Taxonomy • Similarity – Protein-Protein – Chemical similarity – Ligand similarity (SEA) – Others • Tag clouds • Topic maps
  • 15. 15 Biological Network Taxonomy Where do I get the network? There is no such thing! 550 different interaction databases! … in 2013 It depends on your biological question and your analysis plan.
  • 16. 16 The levels of organization of complex networks:  Node degree provides information about single nodes  Three or more nodes represent a motif  Larger groups of nodes are called modules or communities  Hierarchy describes how the various structural elements are combined Analytical Approaches
  • 17. 19 • Concepts – Graph/Network correspondence – Node/Vertex correspondence – Edge directedness • Usually a network property – Node degree – Multigraph • Allow multiple edges between nodes – Hypergraph • Allow edges to connect more than 2 nodes Analytical Approaches
  • 18. 21 • Scale-free networks – Degree distribution follows power law: P(k) ~ k-γ, where γ is a constant. – Result is that there are distinctive “hubs” (essential proteins?) – Overall, though, network is resilient to perturbation – Biological (and social) networks tend to be scale-free Analytical Approaches
  • 19. 22 Analytical Approaches • Small-world networks – any two arbitrary nodes are connected by a small number of intermediate edges – the network has an average shortest path length much smaller than the number of nodes in the network (Watts, Nature, 1998). – Interaction networks have been shown to be small-world networks (Barabási, Nature Reviews in Genetics, 2004)
  • 20. 27 • Motif finding – Search directed networks for network motifs (feed-forward loops, feedback loops, etc.) Analytical Approaches
  • 21. 28 • Overrepresentation analysis – Find terms (GO) that are statistically overrepresented in a network – Not really a network analysis technique – Very useful for visualization Analytical Approaches
  • 22. 29 • Clustering (find hubs, complexes) – Goal: group related items together • Clustering types: – Hierarchical clustering • Divide network into pair-wise hierarchy – K-Means clustering • Divide network into k groups – MCL • Uses a flow simulation to find groups – Community Clustering • Maximize intra-cluster edges vs. inter-cluster edges Analytical Approaches
  • 23. 30 Visualization of Biological Networks • Data Mapping • Layouts • Animation
  • 24. 32 Data Mapping • Mapping of data values associated with graph elements onto graph visuals • Visual attributes – Node fill color, border color, border width, size, shape, opacity, label – Edge type, color, width, ending type, ending size, ending color • Mapping types – Passthrough (labels) – Continuous (numeric values) – Discrete (categories)
  • 25. 33 • Avoid cluttering your visualization with too much data – Highlight meaningful differences – Avoid confusing the viewer – Consider creating multiple network images Data Mapping
  • 26. 34 • Layouts determine the location of nodes and (sometimes) the paths of edges • Types: – Simple • Grid • Partitions – Hierarchical • layout data as a tree or hierarchy • Works best when there are no loops – Circular (Radial) • arrange nodes around a circle • could use node attributes to govern position – e.g. degree sorted Layouts
  • 27. 35 • Types: – Force-Directed • simulate edges as springs • may be weighted or unweighted – Combining layouts • Use a general layout (force directed) for the entire graph, but use hierarchical or radial to focus on a particular portion – Multi-layer layouts • Partition graph, layout each partition then layout partitions – Many, many others Layouts
  • 28. 36 • Use layouts to convey the relationships between the nodes. • Layout algorithms may need to be “tuned” to fit your network. • There is not one correct layout. Try different things. Layouts
  • 29. 37 Animation • Animation is useful to show changes in a network: – Over a time series – Over different conditions – Between species
  • 30. 38 Introduction to Cytoscape • Overview • Core Concepts – Networks and Tables – Visual Properties – Cytoscape Apps • Working with Data – Loading networks from files and online databases – Loading data tables from CSV or Excel files – The Table Panel
  • 31. 39  Open source  Cross platform  Consortium Cytoscape University of Toronto
  • 32. 40 Core Concepts • Networks and Tables Tables e.g., data or annotations Networks e.g., PPIs or pathways
  • 33. 41 Core Concepts • Networks and Tables TablesNetworks Visual Styles
  • 34. 42 Core Concepts • Cytoscape Apps! apps.cytoscape.org
  • 35. 43 Loading Networks • Use import network from file – Excel file – Comma or tab delimited text
  • 36. 44 Loading Networks • But what if I don’t have a network?!
  • 37. 45 Loading Tables • Nodes and edges can have data associated with them – Gene expression data – Mass spectrometry data – Protein structure information – Gene Ontology terms, etc. • Cytoscape supports multiple data types: Numbers, Text, Boolean, Lists...
  • 38. 46 • Use import table from file – Excel file – Comma or tab delimited text Loading Tables
  • 39. 47 Visual Style Manager • Click on the Styles tab
  • 40. 48 • Click on Select tab Selection Filters
  • 41. 49 • Sessions save everything as .cys files: Networks, Tables, Styles, Screen sizes, etc • Export networks in different formats: SIF, GML, XGMML, BioPAX, PSI-MI 1 & 2.5 • Publication quality graphics in several formats: PDF, EPS, SVG, PNG, JPEG, and BMP Saving and Exporting
  • 43. 57 Hands-on Tutorial Introduction to Cytoscape: Networks, Data, Styles, Layouts and App Manager tutorials.cytoscape.org

Editor's Notes

  1. EDIT: schedule
  2. EDIT: introductions per presenter
  3. EDIT: introductions per presenter
  4. EDIT: introductions per presenter
  5. Gene Function Prediction – basic but important, examining genes (proteins) in a network context shows connections to sets of genes/proteins involved in same biological process that are likely to function in that process Detection of protein complexes/other modular structures – although interaction networks are based on pair-wise interactions, there is clear evidence for modularity & higher order organization (motifs, feedback loops) Network evolution – biological process(s) conservation across species (PathBLAST, NetworkBLAST to align p-p interaction networks & clusters) Prediction of new interactions and functional associations – Statistically significant domain-domain correlations in protein interaction network suggest that certain domain (and domain pairs) mediate protein binding. Machine learning extends this to suggest to predict protein-protein or genetic interaction through integration of diverse types of evidence for interaction
  6. Identification of disease subnetworks – identification of disease network subnetworks that are transcriptionally active in disease. These suggest key pathway components in disease progression and provide leads for further study and potential therapeutic targets Subnetwork-based diagnosis – subnetworks also provide a rich source of biomarkers for disease classification, based on mRNA profiling integrated with protein networks to identify subnetwork biomarkers (interconnected genes whose aggregate expression levels are predictive of disease state) Subnetwork-based gene association – molecular networks will provide a powerful framework for mapping common pathway mechanisms affected by collection of genotypes (SNP, CNV)
  7. Some good examples An example with perhaps too many visual properties Cool cell illustration Mapping brain regions (not proteins!) Cytoscape is often just one part of an analysis pipeline
  8. Signaling pathway: Interleukin-2 Metabolic pathway: Krebs (TCA) Cycle
  9. “Cancer genes are central not only in terms of number of neighbors, but also in terms of the clusters they connect. Remarkably, we found that high Participation roles played by cancer genes are not explained by their higher number of interactions, but by their unique inter-modular connectivity patterns. We also found that these connectivity patterns differ for disease genes with contrasting inheritance modes: while autosomal dominant genes play high participation roles, autosomal recessive genes are more confined to their own modules. “ Replicates over 6 different interaction networks (HIPPIE18, BIANA40, BioGRID41, IntAct42, IrefIndex43 and from Human Interactome Project (HBI)19,44). Looked at ~2000 Mendelian disease genes and 781 “cancer driver genes.”
  10. Note: all Cytoscape ORA tools provide a network visualization of terms and a ranked table with statistics
  11. Last two: - NetGestalt (used to align data in fashion of genome browsers) - BioFrabric (used to organize/explore massive hairballs)
  12. http://git.dhimmel.com/SIDER2/similarity/
  13. EDIT: according to schedule
  14. NOTE: Pre-acknowledgements slide
  15. Notes: it is a Cytoscape goal that both the graph model and the data are free from Biological semantics
  16. Screenshots: App Store front page, example app, and wall of apps Note: mention increasing number of apps (and ports to 3.x) Alternative: visit App Store live 272 apps – April 2016
  17. Welcome! - web services: a couple provided by default (more in App Store) - preset networks from BioGRID per species (mouse over for details)
  18. New interface! - def, map, byp - example continuous mapping panel - menu of provided visual styles
  19. Note: these used to be major issues in 2.x
  20. TODO: Update
  21. clusterMaker: - simple: galFiltered -> BiNGO - complex: collins PPI litSearch: - CDC25, human, cancer WikiPathways - human: TCA, Statin or 5-FU
  22. EDIT: update Timing section just before presenting
  23. The example file, galFiltered.cys is included with all Cytoscape releases. It is taken from Ideker, et.al. 2001, although it has gained significantly more annotations over the years. It shows a protein-protein interaction network that focuses on the galactose (gal) utilization in the yeast Saccharomyces cerevisiae. The network was perturbed by gene deletion of the major GAL genes, then the cells were grown in the presence and absence of galactose.