This document outlines a presentation on biological networks and the software Cytoscape. It begins with an introduction to biological networks and their taxonomy, as well as analytical approaches and visualization techniques. It then provides an overview of Cytoscape, covering core concepts like networks and tables, visual properties, and apps. The document demonstrates how to load networks and data, use visual style managers, and save and export networks. It concludes with tips and tricks for using Cytoscape and a link to a hands-on tutorial.
cytoscape is open source network analyses tools, in this slides we define the basic features of this tool, and a brief tutorial of how can you use this tool in innovative way.
cytoscape is open source network analyses tools, in this slides we define the basic features of this tool, and a brief tutorial of how can you use this tool in innovative way.
DRUG DESIGN BASED ON BIOINFORMATICS TOOLSNIPER MOHALI
Drug design is a very complex process it takes many more times but using the these specific tools we can reduce complex process and save the time and produce a effective new drug that will be helpful in heath environment.
Owing to the fragmented nature of EST reads, it is worthwhile to attempt to organize the reads into assemblies that provide a consensus view of the sampled transcripts.
Such fragmented, EST data or gene sequence data, placed in correct context, and indexed by gene such that all expressed data concerning a single gene is in a single index class, and each index class contains the information for only one gene is an EST cluster.
Functional proteomics, methods and toolsKAUSHAL SAHU
INTRODUCTION
HISTORY
DEFINITION
PROTEOMICS
FUNCTIONAL PROTEOMICS
PROTEOMICS SOFTWARE
PROTEOMICS ANALYSIS
TOOLS FOR PROTEOM ANALYSIS
DIFFERENTS METHODS FOR STUDY OF FUNCTIONAL PROTEOMICS
APLLICATIONS
LIMITATIONS
CONCLUSION
Systems biology & Approaches of genomics and proteomicssonam786
This presentation provides the basic understanding of varous genomics and proteomics techniques.Systems biology studies life as a system .It includes the study of living system using various omic technologies .
it will help you to understand how the protein microarrays are made, what are the different types and what all purposes they are used for. its very useful ppt
DRUG DESIGN BASED ON BIOINFORMATICS TOOLSNIPER MOHALI
Drug design is a very complex process it takes many more times but using the these specific tools we can reduce complex process and save the time and produce a effective new drug that will be helpful in heath environment.
Owing to the fragmented nature of EST reads, it is worthwhile to attempt to organize the reads into assemblies that provide a consensus view of the sampled transcripts.
Such fragmented, EST data or gene sequence data, placed in correct context, and indexed by gene such that all expressed data concerning a single gene is in a single index class, and each index class contains the information for only one gene is an EST cluster.
Functional proteomics, methods and toolsKAUSHAL SAHU
INTRODUCTION
HISTORY
DEFINITION
PROTEOMICS
FUNCTIONAL PROTEOMICS
PROTEOMICS SOFTWARE
PROTEOMICS ANALYSIS
TOOLS FOR PROTEOM ANALYSIS
DIFFERENTS METHODS FOR STUDY OF FUNCTIONAL PROTEOMICS
APLLICATIONS
LIMITATIONS
CONCLUSION
Systems biology & Approaches of genomics and proteomicssonam786
This presentation provides the basic understanding of varous genomics and proteomics techniques.Systems biology studies life as a system .It includes the study of living system using various omic technologies .
it will help you to understand how the protein microarrays are made, what are the different types and what all purposes they are used for. its very useful ppt
Keynote given at the workshop for Artificial Intelligence meets the Web of Data on Pragmatic Semantics.
In this keynote I argue that the Web of Data is a Complex System or Marketplace of Ideas rather than a classical Database, and that the model theory on which classical semantics are based is not appropriate in all situations, and propose an alternative "Pragmatic Semantics" based on optimisation of possible interpretations. .
Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...Denis Parra Santander
- First version was a guest lecture about Network Visualization in the class "Data Visualization" taught by Dr. Sharon Hsiao in the QMSS program at Columbia University http://www.columbia.edu/~ih2240/dataviz/index.htm
- This updated version was delivered in our class on SNA at PUC Chile in the MPGI master program.
Database Models, Client-Server Architecture, Distributed Database and Classif...Rubal Sagwal
Introduction to Data Models
-Hierarchical Model
-Network Model
-Relational Model
-Client/Server Architecture
Introduction to Distributed Database
Classification of DBMS
National Resource for Networks Biology's TR&D Theme 3: Although networks have been very useful for representing molecular interactions and mechanisms, network diagrams do not visually resemble the contents of cells. Rather, the cell involves a multi-scale hierarchy of components – proteins are subunits of protein complexes which, in turn, are parts of pathways, biological processes, organelles, cells, tissues, and so on. In this technology research project, we will pursue methods that move Network Biology towards such hierarchical, multi-scale views of cell structure and function.
Technology R&D Theme 2: From Descriptive to Predictive NetworksAlexander Pico
National Resource for Networks Biology's TR&D Theme 2: Genomics is mapping complex data about human biology and promises major medical advances. However, the routine use of genomics data in medical research is in its infancy, due mainly to the challenges of working with highly complex “big data”. In this theme, we will use network information to help organize, analyze and integrate these data into models that can be used to make clinically relevant diagnoses and predictions about an individual.
National Resource for Networks Biology's TR&D Theme 1: In this theme, we will develop a series of tools and methodologies for conducting differential analyses of biological networks perturbed under multiple conditions. The novel algorithmic methodologies enable us to make use of high-throughput proteomic level data to recover biological networks under specific biological perturbations. The software tools developed in this project enable researchers to further predict, analyze, and visualize the effects of these perturbations and alterations, while enabling researchers to aggregate additional information regarding the known roles of the involved interactions and their participants.
The NRNB has been funded as an NIGMS Biomedical Technology Research Resource since 2010. During the previous five-year period, NRNB investigators introduced a series of innovative methods for network biology including network-based biomarkers, network-based stratification of genomes, and automated inference of gene ontologies using network data. Over the next five years, we will seek to catalyze major phase transitions in how biological networks are represented and used, working across three broad themes: (1) From static to differential networks, (2) From descriptive to predictive networks, and (3) From flat to hierarchical networks bridging across scales. All of these efforts leverage and further support our growing stable of network technologies, including the popular Cytoscape network analysis infrastructure.
Visualization and Analysis of Dynamic Networks Alexander Pico
DynNetwork development was taken up initially by Sabina Sara Pfister back in GSoC 2012. She laid out a strong foundation for dynamic network visualization in Cytoscape and my job was to extend the plugin’s functionality to help users analyse time changing networks. The two of us were mentored by Jason Montojo. We had developed a decent tool over the course of two GSoC programs to aid dynamic network analysis and our efforts culminated in DynNetwork getting accepted for an oral presentation at the International Network for Social Network Analysis (INSNA), Sunbelt 2014 which was held in St. Petersburg, FL in February.
A brief information about the SCOP protein database used in bioinformatics.
The Structural Classification of Proteins (SCOP) database is a comprehensive and authoritative resource for the structural and evolutionary relationships of proteins. It provides a detailed and curated classification of protein structures, grouping them into families, superfamilies, and folds based on their structural and sequence similarities.
Richard's entangled aventures in wonderlandRichard Gill
Since the loophole-free Bell experiments of 2020 and the Nobel prizes in physics of 2022, critics of Bell's work have retreated to the fortress of super-determinism. Now, super-determinism is a derogatory word - it just means "determinism". Palmer, Hance and Hossenfelder argue that quantum mechanics and determinism are not incompatible, using a sophisticated mathematical construction based on a subtle thinning of allowed states and measurements in quantum mechanics, such that what is left appears to make Bell's argument fail, without altering the empirical predictions of quantum mechanics. I think however that it is a smoke screen, and the slogan "lost in math" comes to my mind. I will discuss some other recent disproofs of Bell's theorem using the language of causality based on causal graphs. Causal thinking is also central to law and justice. I will mention surprising connections to my work on serial killer nurse cases, in particular the Dutch case of Lucia de Berk and the current UK case of Lucy Letby.
Professional air quality monitoring systems provide immediate, on-site data for analysis, compliance, and decision-making.
Monitor common gases, weather parameters, particulates.
Multi-source connectivity as the driver of solar wind variability in the heli...Sérgio Sacani
The ambient solar wind that flls the heliosphere originates from multiple
sources in the solar corona and is highly structured. It is often described
as high-speed, relatively homogeneous, plasma streams from coronal
holes and slow-speed, highly variable, streams whose source regions are
under debate. A key goal of ESA/NASA’s Solar Orbiter mission is to identify
solar wind sources and understand what drives the complexity seen in the
heliosphere. By combining magnetic feld modelling and spectroscopic
techniques with high-resolution observations and measurements, we show
that the solar wind variability detected in situ by Solar Orbiter in March
2022 is driven by spatio-temporal changes in the magnetic connectivity to
multiple sources in the solar atmosphere. The magnetic feld footpoints
connected to the spacecraft moved from the boundaries of a coronal hole
to one active region (12961) and then across to another region (12957). This
is refected in the in situ measurements, which show the transition from fast
to highly Alfvénic then to slow solar wind that is disrupted by the arrival of
a coronal mass ejection. Our results describe solar wind variability at 0.5 au
but are applicable to near-Earth observatories.
2. 2
Outline
• Biological Networks
– Why Networks?
– Biological Network Taxonomy
– Analytical Approaches
– Visualization
• Break
• Introduction to Cytoscape
• Hands on Tutorial
– Data import
– Layout and apps
3. 3
Why Networks?
• Networks are…
• Commonly understood
• Structured to reduce complexity
• More efficient than tables
• Network tools allow…
Analysis
• Characterize network properties
• Identify hubs and subnets
• Classify, quantify and correlate, e.g.,
cluster nodes by associated data
Visualization
• Explore data overlays
• Interpret mechanisms, e.g., how a process
is modulated or attenuated by a stimulus
4. 4
Applications of Network Biology
jActiveModules, UCSD
PathBlast, UCSD
mCode, University of Toronto
DomainGraph, Max Planck Institute
• Gene Function Prediction
shows connections to sets of
genes/proteins involved in same
biological process
• Detection of protein
complexes/subnetworks
discover modularity & higher
order organization (motifs,
feedback loops)
• Network evolution
biological process(s)
conservation across species
• Prediction of interactions &
functional associations
statistically significant domain-
domain correlations in protein
interaction network to predict
protein-protein or genetic
interaction
5. 5
Applications in Disease
• Identification of disease
subnetworks – identification
of disease network subnetworks
that are transcriptionally active
in disease.
• Subnetwork-based
diagnosis – source of
biomarkers for disease
classification, identify
interconnected genes whose
aggregate expression levels are
predictive of disease state
• Subnetwork-based gene
association – map common
pathway mechanisms affected
by collection of genotypes
(SNP, CNV)
Agilent Literature Search
Mondrian, MSKCC
PinnacleZ, UCSD
10. 10
Biological Network Taxonomy
Where do I get the network?
There is no such thing!
550 different interaction databases!
… in 2013
It depends on your biological question
and your analysis plan.
11. 11
The levels of organization of
complex networks:
Node degree provides
information about single nodes
Three or more nodes represent
a motif
Larger groups of nodes are
called modules or
communities
Hierarchy describes how the
various structural elements are
combined
Analytical Approaches
13. 13
Network topology statistics
such as node degree, degree
distribution, centralitiy,
clustering coefficient, shortest
paths, and robustness of the
network to the random removal
of single nodes are important
network characteristics.
Motif analysis is the
identification of small network
patterns that are over-
represented when compared
with a randomized version of
the same network. Regulatory
elements are often composed
of such motifs.
Analytical Approaches
Modularity refers to the
identification of sub-networks of
interconnected nodes that might
represent molecules physically
or functionally linked that work
coordinately to achieve a
specific function.
Network alignment and
comparison tools can identify
similarities between networks
and have been used to study
evolutionary relationships
between protein networks of
organisms.
14. 14
• Concepts
– Graph/Network correspondence
– Node/Vertex correspondence
– Edge directedness
• Usually a network property
– Node degree
– Multigraph
• Allow multiple edges between nodes
– Hypergraph
• Allow edges to connect more than 2 nodes
Analytical Approaches
15. 15
• Network measures
– Node degree
• Indegree: # of edges for which this node is a target
• Outdegree: # of edges for which this node is a
source
– Hub
• node with an exceptionally high degree
– Shortest path length
• Shortest traversal distance between two nodes
• Can be weighted if edges have weights or hops if not
Analytical Approaches
16. 16
• Scale-free networks
– Degree distribution follows power law:
P(k) ~ k-γ, where γ is a constant.
– Result is that there are distinctive “hubs”
(essential proteins?)
– Overall, though, network is resilient to
perturbation
– Biological (and social) networks tend to be
scale-free
Analytical Approaches
17. 17
Analytical Approaches
• Small-world networks
– any two arbitrary nodes are connected
by a small number of intermediate
edges
– the network has an average shortest
path length much smaller than the
number of nodes in the network
(Watts, Nature, 1998).
– Interaction networks have been shown
to be small-world networks (Barabási,
Nature Reviews in Genetics, 2004)
18. 18
• Random networks
– homogeneous, nodes have similar degrees,
and not robust to arbitrary node failure
– Algorithms exist to create random networks
• Flat random network: Erdos-Renyi
• Scale-free: Barabasi-Albert
• Small-world (high clustering): Watts-Strogatz
– Useful to compare your network vs. a random
network
Analytical Approaches
19. 19
• Network measures
– Clustering coefficient
• Measures how close the neighbors of a node are to
being a clique (fully connected group)
• # of edges connecting a node’s neighbors/the node’s
degree
0.5
1.0
0
0.3
Analytical Approaches
20. 20
• Centrality - measures of node importance
– Degree centrality (find hubs)
• Degree of this node / (# of nodes – 1)
– Betweenness centrality (bottleneck)
• The average number of shortest paths that go
through this node / (# of pairs)
– Closeness centrality
• The sum of all shortest paths between this node
and all other nodes / (# of nodes – 1)
Analytical Approaches
22. 22
• Overrepresentation analysis
– Find terms (GO) that are statistically
overrepresented in a network
– Not really a network analysis technique
– Very useful for visualization
Analytical Approaches
23. 23
• Clustering (find hubs, complexes)
– Goal: group related items together
• Clustering types:
– Hierarchical clustering
• Divide network into pair-wise hierarchy
– K-Means clustering
• Divide network into k groups
– MCL
• Uses a flow simulation to find groups
– Community Clustering
• Maximize intra-cluster edges vs. inter-cluster edges
Analytical Approaches
25. 25
Depiction
• Various ways to depict biological networks:
– Node-Link (graph) representation
– Partitioned Node-Link representation
– Matrix representation
• Can be useful for very dense networks
• Can also map information into cells of matrix
– e.g. degree, color scale (heat map)
– Hierarchical reduction to 1D
– Orthogonal 1D representation of nodes/edges
26. 26
Data Mapping
• Mapping of data values associated with
graph elements onto graph visuals
• Visual attributes
– Node fill color, border color, border width, size,
shape, opacity, label
– Edge type, color, width, ending type, ending
size, ending color
• Mapping types
– Passthrough (labels)
– Continuous (numeric values)
– Discrete (categories)
27. 27
• Avoid cluttering your visualization with too
much data
– Highlight meaningful differences
– Avoid confusing the viewer
– Consider creating multiple network images
Data Mapping
28. 28
• Layouts determine the location of nodes and
(sometimes) the paths of edges
• Types:
– Simple
• Grid
• Partitions
– Hierarchical
• layout data as a tree or hierarchy
• Works best when there are no loops
– Circular (Radial)
• arrange nodes around a circle
• could use node attributes to govern position
– e.g. degree sorted
Layouts
29. 29
• Types:
– Force-Directed
• simulate edges as springs
• may be weighted or unweighted
– Combining layouts
• Use a general layout (force directed) for the entire
graph, but use hierarchical or radial to focus on a
particular portion
– Multi-layer layouts
• Partition graph, layout each partition then layout
partitions
– Many, many others
Layouts
30. 30
• Use layouts to convey the relationships
between the nodes.
• Layout algorithms may need to be “tuned”
to fit your network.
• There is not one correct layout. Try different
things.
Layouts
31. 31
Animation
• Animation is useful to show changes in a
network:
– Over a time series
– Over different conditions
– Between species
32. 32
Introduction to Cytoscape
• Overview
• Core Concepts
– Networks and Tables
– Visual Properties
– Cytoscape Apps
• Working with Data
– Loading networks from files and online databases
– Loading data tables from CSV or Excel files
– The Table Panel
33. 33
Open source
Cross platform
Consortium
Cytoscape
University of Toronto
39. 39
Loading Tables
• Nodes and edges can have
data associated with them
– Gene expression data
– Mass spectrometry data
– Protein structure information
– Gene Ontology terms, etc.
• Cytoscape supports multiple
data types: Numbers, Text,
Boolean, Lists...
40. 40
• Use import table from file
– Excel file
– Comma or tab delimited text
Loading Tables
45. 45
Tips & Tricks
• Network Collections
– Each collection has a “root” network
– Changing the attribute for a node in one
network will also change that attribute for a
node with the same SUID in all other networks
within the collection
– You can clone a network into a new collection
to “decouple” it and start a new root
46. 46
• Network views
– When you open a large network, you will not
get a view by default
– To improve interactive performance, Cytoscape
has the concept of “Levels of Detail”
• Some visual attributes will only be apparent when
you zoom in
• The level of detail for various attributes can be
changed in the preferences
• To see what things will look like at full detail:
– ViewShow Graphics Details
Tips & Tricks
47. 47
• Sessions
– Sessions save pretty much everything:
• Networks
• Properties
• Visual styles
• Screen sizes
– Saving a session on a large screen may require
some resizing when opened on your laptop
Tips & Tricks
48. 48
• Task monitor
– Current task displayed in status bar (lower left)
– Icon opens complete task history
• Memory
– Current status (lower right)
– Toggle open for details and “Free Unused
Memory” button
Tips & Tricks
49. 49
• CytoscapeConfiguration directory
– Your defaults and any apps downloaded from the
App Store will go here
• App Manager
– This is where you search/install/update/uninstall
apps
– You now have the option of disabling vs.
uninstalling…
– Can also install and update apps directly from the
App Store website, if you have Cytoscape 3 up and
running
Tips & Tricks
50. 50
Cytoscape: Platform
• Cytoscape as a platform
– App architecture
• http://apps.cytoscape.org
– Use cases
• Expression data analysis
• Protein complexes
• Agilent literature search
Gene Function Prediction –basic but important, examining genes (proteins) in a network context shows connections to sets of genes/proteins involved in same biological process that are likely to function in that process
Detection of protein complexes/other modular structures – although interaction networks are based on pair-wise interactions, there is clear evidence for modularity & higher order organization (motifs, feedback loops)
Network evolution – biological process(s) conservation across species (PathBLAST, NetworkBLAST to align p-p interaction networks & clusters)
Prediction of new interactions and functional associations – Statistically significant domain-domain correlations in protein interaction network suggest that certain domain (and domain pairs) mediate protein binding. Machine learning extends this to suggest to predict protein-protein or genetic interaction through integration of diverse types of evidence for interaction
Identification of disease subnetworks – identification of disease network subnetworks that are transcriptionally active in disease. These suggest key pathway components in disease progression and provide leads for further study and potential therapeutic targets
Subnetwork-based diagnosis – subnetworks also provide a rich source of biomarkers for disease classification, based on mRNA profiling integrated with protein networks to identify subnetwork biomarkers (interconnected genes whose aggregate expression levels are predictive of disease state)
Subnetwork-based gene association – molecular networks will provide a powerful framework for mapping common pathway mechanisms affected by collection of genotypes (SNP, CNV)
Some good examples
An example with perhaps too many visual properties
Cool cell illustration
Mapping brain regions (not proteins!)
Cytoscape is often just one part of an analysis pipeline
“Cancer genes are central not only in terms of number of neighbors, but also in terms of the clusters they connect. Remarkably, we found that high Participation roles played by cancer genes are not explained by their higher number of interactions, but by their unique inter-modular connectivity patterns. We also found that these connectivity patterns differ for disease genes with contrasting inheritance modes: while autosomal dominant genes play high participation roles, autosomal recessive genes are more confined to their own modules. “
Replicates over 6 different interaction networks (HIPPIE18, BIANA40, BioGRID41, IntAct42, IrefIndex43 and from Human Interactome Project (HBI)19,44). Looked at ~2000 Mendelian disease genes and 781 “cancer driver genes.”
Note: all Cytoscape ORA tools provide a network visualization of terms and a ranked table with statistics
Last two:
- NetGestalt (used to align data in fashion of genome browsers)
- BioFrabric (used to organize/explore massive hairballs)
http://git.dhimmel.com/SIDER2/similarity/
EDIT: according to schedule
NOTE: Pre-acknowledgements slide
Notes: it is a Cytoscape goal that both the graph model and the data are free from Biological semantics
Screenshots: App Store front page, example app, and wall of apps
Note: mention increasing number of apps (and ports to 3.x)
Alternative: visit App Store live
272 apps – April 2016
Welcome!
- web services: a couple provided by default (more in App Store)
- preset networks from BioGRID per species (mouse over for details)
New interface!
- def, map, byp
- example continuous mapping panel
- menu of provided visual styles