SlideShare a Scribd company logo
1 of 45
Download to read offline
1
In modern systems biology we have three main data domains.
1) Experimental data from genomics types of experiments like in the example,
   (bottom right) microarrays. Note that this type requires intensive
   precalculations (quality control, filtering, clustering, annotation) but that is
   not enough to really understand the data. You see patterns in the data, but
   you do not really know what they mean. Large scale genomics data has
   been available over the pas 15 years or so, and although technologies
   used are now being replaced that doesn’t really change this field.
2) Existing knowledge (see next slide), that can be used to better understand
   the two other types of data
3) Genetics (sequence based) data that rapidly becomes more important with
   the decrease of sequencing cost. The addition of the leftmost corner to the
   triangle is relatively new, and I will only discuss it in the last few slides




                                                                                      2
Huge amounts of existing knowledge can be found hidden in the literature or in
the heads of people. The hard task is to collect it from there and to make it
available for analysis. (People on the slide are Ben van Ommen - NuGO
director, Hannelore Daniel – nutrigenomics chair from Munich and a Thai
Princess and institute director.


Note that a lot of information is also available in curated databases, but that
was left out of the talk for brevity reasons. You could say that structuring of the
other knowledge is needed to provide these databases that can then be used
for analysis.




                                                                                      3
An historical example of a microarray result. Again note the intensive
preprocessing done. (clustering to the left, annotation to the right).
Nevertheless the data is very hard to understand. Especially if you take into
account that there are about 20,000 genes on a typical array. About as much
as there are words in a dictionary.
But if you are willing to make the effort you can actually see meaningful groups
of genes within specific coexpression clusters. Like the fatty acid degradation
genes shown here. But it is hard to find (or easy to miss) all relevant pathways.
Probably not an iPAD, those microarrays were at least 10 years old.




                                                                      6
The problem is not only the long list of resulting genes, but also the
oversampling that occurs. In genomics experiments you typically get large
numbers of false positives at useful levels of significance. Of course false
discovery rate corrections exist but they will usually also loose information.


Pathway or function group (ontology) analysis helps since it is not likely that a
larger set of genes occur as false positives within a smaller functional group.


On the other hand the meaning of pathway statistics should not be
overestimated There are many aspects in real biology and in the way the
groups are build that influence the statistical outcome.


For instance when you have two metabolic reactions where one is catalyzed
by a single enzyme and the other by 4. Are all enzymes of the same
importance? Or are the four together as important as the single one? Or are 3
of the 4 not important in reality and the other one is? All these situations can
occur and the statistics just doesn’t know.


Also suppose you 10 non-regulated genes to a pathway. That will change
significance of your result, but it doesn’t change the biology behind it.



                                                                                    7
Example of a pathway that can be used for the purposes described.
A closer look at the same pathway.


Note that this uses MIM notation from the MIM PathVisio plugin.

In general the connections between different genes and metabolites describe
the network underlying the pathway. Note that this is already quite complex
since there are different ways to show what interacts with what.


Graphical methods to capture this like MIM and SBGN definitely help. The
result can be captures in descriptive relationships in BioPax,




                                                                              9
10
PathVisio can do a combined visualization of different omics results. Here
proteomics and transcriptomics both shown on the same gene product boxes.
It can also show effects from metabolomics.
Examples of pathways like we have them on wikipathways.org




                                                             12
This talk is not really about WikiPathways. Check out the information in the
paper or the information on the wiki itself. (www.wikipathways.org) developer
information is mainly on the www.pathvisio.org website.




                                                                                13
You obtain microarray data (e.g. affymetrix)
You can visualize micorarray data
Each color corresponds to a measured datapoint
For example, green is up, red is down, grey is constant


And now? How do you make sure the Affymetrix probeset IDs related to the
measurements can be mapped to the gene products in the pathway?




                                                                           14
On WikiPathways (or in pathvisio) you can attach identifiers to each gene. A
click opens up the corresponding page on (this specific case) the worm
database.
You can download the corresponding transcript sequence in two clicks
This makes it for instance really easy to design primers




                                                                               15
As soon as you have entered one (and only one) identifier to describe what
gene product or metabolite you really mean this information is linked to many
other identifiers from other databases and links to these respective pages are
shown in the so called “backpage” (actually one of the pages under the tabs at
the righthand side of the pathway).




                                                                                 16
BridgeDB (see www.bridgedb.org and the paper mentioned on the slide)
provides the mechanism needed for that identifier mapping.




                                                                       17
Pathways can be downloaded to be used in different tools.


There is also a wikipathway webservice. See:
http://www.wikipathways.org/index.php/Help:WikiPathways_Webservice
Thomas Kelder, Alexander R Pico, Kristina Hanspers, Chris Evelo & Bruce R
Conklin. Mining biological pathways using WikiPathways web services.
PLoS One (2009) 4: 7 e644. http://dx.doi.org/10.1371/journal.pone.0006447


We also have semantic output in RDF which can be queried through a
SPARQL endpoint described at semantics.bigcat.unimaas.nl.
Introducing a problem




                        19
And a solution that isn’t really a solution. There are just too many things you
could add.




                                                                                  20
The PathVisio Regulatory Interaction plugin (author Stefan van Helden) has a
new approach where information is not really added to a pathway, but shown
in a separate page upon request.




                                                                               21
The plugin can be found here:
http://chianti.ucsd.edu/cyto_web/plugins/displayplugininfo.php?name=GPML-Plugin


It can be used to read and write gpml pathway files used by WikiPathways and
PathVisio in Cytoscape




                                                                                  22
Example showing some more advanced usage of the GPML plugin.


Data from the NuGO proof of principle study with dietary challenged mice.
Three tissues were sampled and in the other two tissues relatively many
genes showed expression changes on Affymetrix arrays but not many
pathways were found.

For liver the number of genes affected was lower but the number of pathways
found to be affected was found to be higher (how come)?


The pathway based network analysis showed that there was a set of stronger
affected pathway (more reguated genes, large blue circles) that share
regulated genes (the red diamonds). When looking at the highlighted group of
pathways it became clear that these all belong to the same superste of
biologically relevant pathways (fatty acid metabolism and inflammation).




                                                                               23
A paper that we published with a more extensive pathway relationship
approach. It takes into account relations between pathways through affected
genes not necessarily showing up in either pathway.




                                                                              24
25
The approach takes into account all data use (pathways, interactions and
experimentally determined weight). Check out the original paper for details.




                                                                               26
Example result. Pathways with stronger interaction based on gene snot
present in them.




                                                                        27
And you can do the same for relatively large sets of pathways “driving” a
process like apoptosis.




                                                                            28
CyTargetLinker is a Cytoscape plugin that can be used to extend one network
with information about things targeting entities in that network from databases
that are created as a network. It already provides a number of target relation
databases as mentioned on the slide.




                                                                                  29
Example of a target network. (You will normally see this, it contains the
information that is used to extend your source network).




                                                                            30
And a more detailed view.




                            31
You can drive it from a gene set, that isn’t even a network at the start. But
when miRNAs are found to target more than one gene in the ggroup the
network is created on the fly.




                                                                                32
Or you can bootstrap the approach from an existing network. Which can be a
pathway based one imported with the GPML plugin like shown here.




                                                                             33
An overview of the Open Phacts project that pulls in lots of information in a
semantic web triple store (including information from WikiPathways RDF) and
then provides that for use in other tools. In WikiPathways we use that to
suggest possible pathway extensions to curators




                                                                                34
This show the PathVisio Loom plugin in action. A gene or metabolite in a
pathway under development (left side) is right clicked and the LOOM is
activated to pull related genes or metabolites from another resource
(database, text mining result or Open Phacts API). The suggested interactions
are shown in the window on the right and the entities are added to the pathway
(two already shown on the left).
Talk so far focused on the genomics-knowledge relationship shown on the
right, So what about genetics?




                                                                          36
37
This is the image was to us by Jim Kaput (at that time NTCR, now
Nestle).”Look people group those SNPs in gene groups, made sense of the
directions and showed them in a pathway. Can you do something like that?”




                                                                            38
In principle? Yes.




                     39
There are just too many SNPs for any given gene.




                                                   40
So it would really look like a bunch of jellies if we show these all on the genes
in a pathway, and you would not know what they mean.




                                                                                    41
There are loads of bioinformatics tools out there (like Sift and Polyphen) that
allow us to estimate functional effects of SNPs on coded protein (activity or
protein-protein interactions), binding site for transcription factors in the DNA, or
miRNA in RNA. Doing that we can decide what edges SNPs would affect (and
how much in what direction). Now as soon as you do that you can use the
result to strengthen SNP statistics (ie create groups that can be used for
supervised types of group based GWAS analysis) or to build predictive models
to estimate that specific (personal or tissue/tumor based) sets of variations
would do. That provides a need to use the pathways to link experimental
(genomics) data not only to the genetic variations occurring in there, but also
to modeling results




                                                                                       42
Showing the concept. Integrating flux predictions from modelling (of course
that could also be real fluxomics data)




                                                                              43
And showing “real” results from the new flux data representation plugin.
The plugin is functional but we still need better mapping databases for reaction
identifiers




                                                                                   44
Many people involved in this work. (Really many if you count associated
groups like the plugin developers, pathway curators etc).


Most important


SF group (Kristina Hanspers, Bruce Conklin and Alex Pico) collaborating on
many things but primarily WikiPatwhays
Martijn van Iersel top left (PathVisio, BridgeDB). Thomas Kelder (top middle)
(WikiPathways including webservices, pathway integration networks for
nutrigenomics), Martina Kutmon (top right) (CyTargetLinker, PathVisio further
development), Andra Waagmeester (second row, right) (WikiPathways RDF),
Anwesha Dutta (bottom, 2nd from the left) (flux visualization), Stefan van
Helden (not on the picture) for the RI PathVisio plugin




                                                                                45

More Related Content

What's hot

NetBioSIG2013-Talk Martina Kutmon
NetBioSIG2013-Talk Martina KutmonNetBioSIG2013-Talk Martina Kutmon
NetBioSIG2013-Talk Martina KutmonAlexander Pico
 
NetBioSIG2014-Talk by Hyunghoon Cho
NetBioSIG2014-Talk by Hyunghoon ChoNetBioSIG2014-Talk by Hyunghoon Cho
NetBioSIG2014-Talk by Hyunghoon ChoAlexander Pico
 
NetBioSIG2013-Talk Tijana Milenkovic
NetBioSIG2013-Talk Tijana MilenkovicNetBioSIG2013-Talk Tijana Milenkovic
NetBioSIG2013-Talk Tijana MilenkovicAlexander Pico
 
NRNB Annual Report 2016: Overall
NRNB Annual Report 2016: OverallNRNB Annual Report 2016: Overall
NRNB Annual Report 2016: OverallAlexander Pico
 
NetBioSIG2013-Talk Vuk Janjic
NetBioSIG2013-Talk Vuk JanjicNetBioSIG2013-Talk Vuk Janjic
NetBioSIG2013-Talk Vuk JanjicAlexander Pico
 
NRNB Annual Report 2011
NRNB Annual Report 2011NRNB Annual Report 2011
NRNB Annual Report 2011Alexander Pico
 
NRNB Annual Report 2018
NRNB Annual Report 2018NRNB Annual Report 2018
NRNB Annual Report 2018Alexander Pico
 
Technology R&D Theme 2: From Descriptive to Predictive Networks
Technology R&D Theme 2: From Descriptive to Predictive NetworksTechnology R&D Theme 2: From Descriptive to Predictive Networks
Technology R&D Theme 2: From Descriptive to Predictive NetworksAlexander Pico
 
NetBioSIG2012 ugurdogrusoz-cbio
NetBioSIG2012 ugurdogrusoz-cbioNetBioSIG2012 ugurdogrusoz-cbio
NetBioSIG2012 ugurdogrusoz-cbioAlexander Pico
 
NRNB Annual Report 2012
NRNB Annual Report 2012NRNB Annual Report 2012
NRNB Annual Report 2012Alexander Pico
 
NetBioSIG2013-Talk Thomas Kelder
NetBioSIG2013-Talk Thomas KelderNetBioSIG2013-Talk Thomas Kelder
NetBioSIG2013-Talk Thomas KelderAlexander Pico
 
NetBioSIG2013-Talk David Amar
NetBioSIG2013-Talk David AmarNetBioSIG2013-Talk David Amar
NetBioSIG2013-Talk David AmarAlexander Pico
 
NRNB Annual Report 2013
NRNB Annual Report 2013NRNB Annual Report 2013
NRNB Annual Report 2013Alexander Pico
 
Una estrategia para la integración de ontologías, servicios web y PLN en el a...
Una estrategia para la integración de ontologías, servicios web y PLN en el a...Una estrategia para la integración de ontologías, servicios web y PLN en el a...
Una estrategia para la integración de ontologías, servicios web y PLN en el a...Anubis Hosein
 
NRNB Annual Report 2017
NRNB Annual Report 2017NRNB Annual Report 2017
NRNB Annual Report 2017Alexander Pico
 
Community Finding with Applications on Phylogenetic Networks [Extended Abstract]
Community Finding with Applications on Phylogenetic Networks [Extended Abstract]Community Finding with Applications on Phylogenetic Networks [Extended Abstract]
Community Finding with Applications on Phylogenetic Networks [Extended Abstract]Luís Rita
 
NetBioSIG2014-Talk by David Amar
NetBioSIG2014-Talk by David AmarNetBioSIG2014-Talk by David Amar
NetBioSIG2014-Talk by David AmarAlexander Pico
 
NetBioSIG2013-KEYNOTE Natasa Przulj
NetBioSIG2013-KEYNOTE Natasa PrzuljNetBioSIG2013-KEYNOTE Natasa Przulj
NetBioSIG2013-KEYNOTE Natasa PrzuljAlexander Pico
 

What's hot (20)

NetBioSIG2013-Talk Martina Kutmon
NetBioSIG2013-Talk Martina KutmonNetBioSIG2013-Talk Martina Kutmon
NetBioSIG2013-Talk Martina Kutmon
 
NetBioSIG2014-Talk by Hyunghoon Cho
NetBioSIG2014-Talk by Hyunghoon ChoNetBioSIG2014-Talk by Hyunghoon Cho
NetBioSIG2014-Talk by Hyunghoon Cho
 
NetBioSIG2013-Talk Tijana Milenkovic
NetBioSIG2013-Talk Tijana MilenkovicNetBioSIG2013-Talk Tijana Milenkovic
NetBioSIG2013-Talk Tijana Milenkovic
 
NRNB Annual Report 2016: Overall
NRNB Annual Report 2016: OverallNRNB Annual Report 2016: Overall
NRNB Annual Report 2016: Overall
 
NetBioSIG2013-Talk Vuk Janjic
NetBioSIG2013-Talk Vuk JanjicNetBioSIG2013-Talk Vuk Janjic
NetBioSIG2013-Talk Vuk Janjic
 
NRNB Annual Report 2011
NRNB Annual Report 2011NRNB Annual Report 2011
NRNB Annual Report 2011
 
NRNB Annual Report 2018
NRNB Annual Report 2018NRNB Annual Report 2018
NRNB Annual Report 2018
 
Technology R&D Theme 2: From Descriptive to Predictive Networks
Technology R&D Theme 2: From Descriptive to Predictive NetworksTechnology R&D Theme 2: From Descriptive to Predictive Networks
Technology R&D Theme 2: From Descriptive to Predictive Networks
 
NetBioSIG2012 ugurdogrusoz-cbio
NetBioSIG2012 ugurdogrusoz-cbioNetBioSIG2012 ugurdogrusoz-cbio
NetBioSIG2012 ugurdogrusoz-cbio
 
NRNB Annual Report 2012
NRNB Annual Report 2012NRNB Annual Report 2012
NRNB Annual Report 2012
 
NetBioSIG2013-Talk Thomas Kelder
NetBioSIG2013-Talk Thomas KelderNetBioSIG2013-Talk Thomas Kelder
NetBioSIG2013-Talk Thomas Kelder
 
NRNB EAC Meeting 2012
NRNB EAC Meeting 2012NRNB EAC Meeting 2012
NRNB EAC Meeting 2012
 
NetBioSIG2013-Talk David Amar
NetBioSIG2013-Talk David AmarNetBioSIG2013-Talk David Amar
NetBioSIG2013-Talk David Amar
 
NRNB Annual Report 2013
NRNB Annual Report 2013NRNB Annual Report 2013
NRNB Annual Report 2013
 
Una estrategia para la integración de ontologías, servicios web y PLN en el a...
Una estrategia para la integración de ontologías, servicios web y PLN en el a...Una estrategia para la integración de ontologías, servicios web y PLN en el a...
Una estrategia para la integración de ontologías, servicios web y PLN en el a...
 
NRNB Annual Report 2017
NRNB Annual Report 2017NRNB Annual Report 2017
NRNB Annual Report 2017
 
Community Finding with Applications on Phylogenetic Networks [Extended Abstract]
Community Finding with Applications on Phylogenetic Networks [Extended Abstract]Community Finding with Applications on Phylogenetic Networks [Extended Abstract]
Community Finding with Applications on Phylogenetic Networks [Extended Abstract]
 
NRNB EAC Report 2011
NRNB EAC Report 2011NRNB EAC Report 2011
NRNB EAC Report 2011
 
NetBioSIG2014-Talk by David Amar
NetBioSIG2014-Talk by David AmarNetBioSIG2014-Talk by David Amar
NetBioSIG2014-Talk by David Amar
 
NetBioSIG2013-KEYNOTE Natasa Przulj
NetBioSIG2013-KEYNOTE Natasa PrzuljNetBioSIG2013-KEYNOTE Natasa Przulj
NetBioSIG2013-KEYNOTE Natasa Przulj
 

Similar to NetBioSIG2012 chrisevelo

Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!adcobb
 
A Cell-Cycle Knowledge Integration Framework
A Cell-Cycle Knowledge Integration FrameworkA Cell-Cycle Knowledge Integration Framework
A Cell-Cycle Knowledge Integration FrameworkLisa Muthukumar
 
Web based servers and softwares for genome analysis
Web based servers and softwares for genome analysisWeb based servers and softwares for genome analysis
Web based servers and softwares for genome analysisDr. Naveen Gaurav srivastava
 
Data analysis & integration challenges in genomics
Data analysis & integration challenges in genomicsData analysis & integration challenges in genomics
Data analysis & integration challenges in genomicsmikaelhuss
 
2013 nas-ehs-data-integration-dc
2013 nas-ehs-data-integration-dc2013 nas-ehs-data-integration-dc
2013 nas-ehs-data-integration-dcc.titus.brown
 
2014 marine-microbes-grc
2014 marine-microbes-grc2014 marine-microbes-grc
2014 marine-microbes-grcc.titus.brown
 
Emerging challenges in data-intensive genomics
Emerging challenges in data-intensive genomicsEmerging challenges in data-intensive genomics
Emerging challenges in data-intensive genomicsmikaelhuss
 
Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...
Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...
Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...rahulmonikasharma
 
A consistent and efficient graphical User Interface Design and Querying Organ...
A consistent and efficient graphical User Interface Design and Querying Organ...A consistent and efficient graphical User Interface Design and Querying Organ...
A consistent and efficient graphical User Interface Design and Querying Organ...CSCJournals
 
sunny field project.pptx
sunny field project.pptxsunny field project.pptx
sunny field project.pptxgauravShubh
 
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europeopen_phacts
 
COMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSION
COMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSIONCOMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSION
COMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSIONcsandit
 
Collaboration with GeneGo provides seamless access to compound databases, pat...
Collaboration with GeneGo provides seamless access to compound databases, pat...Collaboration with GeneGo provides seamless access to compound databases, pat...
Collaboration with GeneGo provides seamless access to compound databases, pat...Craig Morgan NZCS, MBA (Hons), PMP
 

Similar to NetBioSIG2012 chrisevelo (20)

B.3.5
B.3.5B.3.5
B.3.5
 
Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!
 
A Cell-Cycle Knowledge Integration Framework
A Cell-Cycle Knowledge Integration FrameworkA Cell-Cycle Knowledge Integration Framework
A Cell-Cycle Knowledge Integration Framework
 
Web based servers and softwares for genome analysis
Web based servers and softwares for genome analysisWeb based servers and softwares for genome analysis
Web based servers and softwares for genome analysis
 
FAIR data management in biomedicine
FAIR data management  in biomedicineFAIR data management  in biomedicine
FAIR data management in biomedicine
 
Data analysis & integration challenges in genomics
Data analysis & integration challenges in genomicsData analysis & integration challenges in genomics
Data analysis & integration challenges in genomics
 
Reaching out to collaborators and crowdsourcing for pharmaceutical research
Reaching out to collaborators and crowdsourcing for pharmaceutical research  Reaching out to collaborators and crowdsourcing for pharmaceutical research
Reaching out to collaborators and crowdsourcing for pharmaceutical research
 
2013 nas-ehs-data-integration-dc
2013 nas-ehs-data-integration-dc2013 nas-ehs-data-integration-dc
2013 nas-ehs-data-integration-dc
 
2014 marine-microbes-grc
2014 marine-microbes-grc2014 marine-microbes-grc
2014 marine-microbes-grc
 
Bioinformatics-2009-Moura-1096-8
Bioinformatics-2009-Moura-1096-8Bioinformatics-2009-Moura-1096-8
Bioinformatics-2009-Moura-1096-8
 
String.pptx
String.pptxString.pptx
String.pptx
 
Emerging challenges in data-intensive genomics
Emerging challenges in data-intensive genomicsEmerging challenges in data-intensive genomics
Emerging challenges in data-intensive genomics
 
Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...
Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...
Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...
 
A consistent and efficient graphical User Interface Design and Querying Organ...
A consistent and efficient graphical User Interface Design and Querying Organ...A consistent and efficient graphical User Interface Design and Querying Organ...
A consistent and efficient graphical User Interface Design and Querying Organ...
 
rheumatoid arthritis
rheumatoid arthritisrheumatoid arthritis
rheumatoid arthritis
 
sunny field project.pptx
sunny field project.pptxsunny field project.pptx
sunny field project.pptx
 
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe
 
COMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSION
COMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSIONCOMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSION
COMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSION
 
Phylogenetics
PhylogeneticsPhylogenetics
Phylogenetics
 
Collaboration with GeneGo provides seamless access to compound databases, pat...
Collaboration with GeneGo provides seamless access to compound databases, pat...Collaboration with GeneGo provides seamless access to compound databases, pat...
Collaboration with GeneGo provides seamless access to compound databases, pat...
 

More from Alexander Pico

2016 Cytoscape 3.3 Tutorial
2016 Cytoscape 3.3 Tutorial2016 Cytoscape 3.3 Tutorial
2016 Cytoscape 3.3 TutorialAlexander Pico
 
2015 Cytoscape 3.2 Tutorial
2015 Cytoscape 3.2 Tutorial2015 Cytoscape 3.2 Tutorial
2015 Cytoscape 3.2 TutorialAlexander Pico
 
NetBioSIG2014-FlashJournalClub by Frank Kramer
NetBioSIG2014-FlashJournalClub by Frank KramerNetBioSIG2014-FlashJournalClub by Frank Kramer
NetBioSIG2014-FlashJournalClub by Frank KramerAlexander Pico
 
NetBioSIG2014-Talk by Salvatore Loguercio
NetBioSIG2014-Talk by Salvatore LoguercioNetBioSIG2014-Talk by Salvatore Loguercio
NetBioSIG2014-Talk by Salvatore LoguercioAlexander Pico
 
NetBioSIG2014-Intro by Alex Pico
NetBioSIG2014-Intro by Alex PicoNetBioSIG2014-Intro by Alex Pico
NetBioSIG2014-Intro by Alex PicoAlexander Pico
 
NetBioSIG2014-Talk by Traver Hart
NetBioSIG2014-Talk by Traver HartNetBioSIG2014-Talk by Traver Hart
NetBioSIG2014-Talk by Traver HartAlexander Pico
 
NetBioSIG2014-Talk by Yu Xia
NetBioSIG2014-Talk by Yu XiaNetBioSIG2014-Talk by Yu Xia
NetBioSIG2014-Talk by Yu XiaAlexander Pico
 
NetBioSIG2014-Keynote by Marian Walhout
NetBioSIG2014-Keynote by Marian WalhoutNetBioSIG2014-Keynote by Marian Walhout
NetBioSIG2014-Keynote by Marian WalhoutAlexander Pico
 
NetBioSIG2014-Talk by Ashwini Patil
NetBioSIG2014-Talk by Ashwini PatilNetBioSIG2014-Talk by Ashwini Patil
NetBioSIG2014-Talk by Ashwini PatilAlexander Pico
 
NetBioSIG2014-Talk by Gerald Quon
NetBioSIG2014-Talk by Gerald QuonNetBioSIG2014-Talk by Gerald Quon
NetBioSIG2014-Talk by Gerald QuonAlexander Pico
 
Visualization and Analysis of Dynamic Networks
Visualization and Analysis of Dynamic Networks Visualization and Analysis of Dynamic Networks
Visualization and Analysis of Dynamic Networks Alexander Pico
 
Introduction to WikiPathways
Introduction to WikiPathwaysIntroduction to WikiPathways
Introduction to WikiPathwaysAlexander Pico
 
Network Visualization and Analysis with Cytoscape
Network Visualization and Analysis with CytoscapeNetwork Visualization and Analysis with Cytoscape
Network Visualization and Analysis with CytoscapeAlexander Pico
 
NetBioSIG2013-KEYNOTE Michael Schroeder
NetBioSIG2013-KEYNOTE Michael SchroederNetBioSIG2013-KEYNOTE Michael Schroeder
NetBioSIG2013-KEYNOTE Michael SchroederAlexander Pico
 
NetBioSIG2013-KEYNOTE Stefan Schuster
NetBioSIG2013-KEYNOTE Stefan SchusterNetBioSIG2013-KEYNOTE Stefan Schuster
NetBioSIG2013-KEYNOTE Stefan SchusterAlexander Pico
 
NetBioSIG2013-KEYNOTE Esti Yeger-Lotem
NetBioSIG2013-KEYNOTE Esti Yeger-LotemNetBioSIG2013-KEYNOTE Esti Yeger-Lotem
NetBioSIG2013-KEYNOTE Esti Yeger-LotemAlexander Pico
 

More from Alexander Pico (16)

2016 Cytoscape 3.3 Tutorial
2016 Cytoscape 3.3 Tutorial2016 Cytoscape 3.3 Tutorial
2016 Cytoscape 3.3 Tutorial
 
2015 Cytoscape 3.2 Tutorial
2015 Cytoscape 3.2 Tutorial2015 Cytoscape 3.2 Tutorial
2015 Cytoscape 3.2 Tutorial
 
NetBioSIG2014-FlashJournalClub by Frank Kramer
NetBioSIG2014-FlashJournalClub by Frank KramerNetBioSIG2014-FlashJournalClub by Frank Kramer
NetBioSIG2014-FlashJournalClub by Frank Kramer
 
NetBioSIG2014-Talk by Salvatore Loguercio
NetBioSIG2014-Talk by Salvatore LoguercioNetBioSIG2014-Talk by Salvatore Loguercio
NetBioSIG2014-Talk by Salvatore Loguercio
 
NetBioSIG2014-Intro by Alex Pico
NetBioSIG2014-Intro by Alex PicoNetBioSIG2014-Intro by Alex Pico
NetBioSIG2014-Intro by Alex Pico
 
NetBioSIG2014-Talk by Traver Hart
NetBioSIG2014-Talk by Traver HartNetBioSIG2014-Talk by Traver Hart
NetBioSIG2014-Talk by Traver Hart
 
NetBioSIG2014-Talk by Yu Xia
NetBioSIG2014-Talk by Yu XiaNetBioSIG2014-Talk by Yu Xia
NetBioSIG2014-Talk by Yu Xia
 
NetBioSIG2014-Keynote by Marian Walhout
NetBioSIG2014-Keynote by Marian WalhoutNetBioSIG2014-Keynote by Marian Walhout
NetBioSIG2014-Keynote by Marian Walhout
 
NetBioSIG2014-Talk by Ashwini Patil
NetBioSIG2014-Talk by Ashwini PatilNetBioSIG2014-Talk by Ashwini Patil
NetBioSIG2014-Talk by Ashwini Patil
 
NetBioSIG2014-Talk by Gerald Quon
NetBioSIG2014-Talk by Gerald QuonNetBioSIG2014-Talk by Gerald Quon
NetBioSIG2014-Talk by Gerald Quon
 
Visualization and Analysis of Dynamic Networks
Visualization and Analysis of Dynamic Networks Visualization and Analysis of Dynamic Networks
Visualization and Analysis of Dynamic Networks
 
Introduction to WikiPathways
Introduction to WikiPathwaysIntroduction to WikiPathways
Introduction to WikiPathways
 
Network Visualization and Analysis with Cytoscape
Network Visualization and Analysis with CytoscapeNetwork Visualization and Analysis with Cytoscape
Network Visualization and Analysis with Cytoscape
 
NetBioSIG2013-KEYNOTE Michael Schroeder
NetBioSIG2013-KEYNOTE Michael SchroederNetBioSIG2013-KEYNOTE Michael Schroeder
NetBioSIG2013-KEYNOTE Michael Schroeder
 
NetBioSIG2013-KEYNOTE Stefan Schuster
NetBioSIG2013-KEYNOTE Stefan SchusterNetBioSIG2013-KEYNOTE Stefan Schuster
NetBioSIG2013-KEYNOTE Stefan Schuster
 
NetBioSIG2013-KEYNOTE Esti Yeger-Lotem
NetBioSIG2013-KEYNOTE Esti Yeger-LotemNetBioSIG2013-KEYNOTE Esti Yeger-Lotem
NetBioSIG2013-KEYNOTE Esti Yeger-Lotem
 

Recently uploaded

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 

Recently uploaded (20)

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 

NetBioSIG2012 chrisevelo

  • 1. 1
  • 2. In modern systems biology we have three main data domains. 1) Experimental data from genomics types of experiments like in the example, (bottom right) microarrays. Note that this type requires intensive precalculations (quality control, filtering, clustering, annotation) but that is not enough to really understand the data. You see patterns in the data, but you do not really know what they mean. Large scale genomics data has been available over the pas 15 years or so, and although technologies used are now being replaced that doesn’t really change this field. 2) Existing knowledge (see next slide), that can be used to better understand the two other types of data 3) Genetics (sequence based) data that rapidly becomes more important with the decrease of sequencing cost. The addition of the leftmost corner to the triangle is relatively new, and I will only discuss it in the last few slides 2
  • 3. Huge amounts of existing knowledge can be found hidden in the literature or in the heads of people. The hard task is to collect it from there and to make it available for analysis. (People on the slide are Ben van Ommen - NuGO director, Hannelore Daniel – nutrigenomics chair from Munich and a Thai Princess and institute director. Note that a lot of information is also available in curated databases, but that was left out of the talk for brevity reasons. You could say that structuring of the other knowledge is needed to provide these databases that can then be used for analysis. 3
  • 4. An historical example of a microarray result. Again note the intensive preprocessing done. (clustering to the left, annotation to the right). Nevertheless the data is very hard to understand. Especially if you take into account that there are about 20,000 genes on a typical array. About as much as there are words in a dictionary.
  • 5. But if you are willing to make the effort you can actually see meaningful groups of genes within specific coexpression clusters. Like the fatty acid degradation genes shown here. But it is hard to find (or easy to miss) all relevant pathways.
  • 6. Probably not an iPAD, those microarrays were at least 10 years old. 6
  • 7. The problem is not only the long list of resulting genes, but also the oversampling that occurs. In genomics experiments you typically get large numbers of false positives at useful levels of significance. Of course false discovery rate corrections exist but they will usually also loose information. Pathway or function group (ontology) analysis helps since it is not likely that a larger set of genes occur as false positives within a smaller functional group. On the other hand the meaning of pathway statistics should not be overestimated There are many aspects in real biology and in the way the groups are build that influence the statistical outcome. For instance when you have two metabolic reactions where one is catalyzed by a single enzyme and the other by 4. Are all enzymes of the same importance? Or are the four together as important as the single one? Or are 3 of the 4 not important in reality and the other one is? All these situations can occur and the statistics just doesn’t know. Also suppose you 10 non-regulated genes to a pathway. That will change significance of your result, but it doesn’t change the biology behind it. 7
  • 8. Example of a pathway that can be used for the purposes described.
  • 9. A closer look at the same pathway. Note that this uses MIM notation from the MIM PathVisio plugin. In general the connections between different genes and metabolites describe the network underlying the pathway. Note that this is already quite complex since there are different ways to show what interacts with what. Graphical methods to capture this like MIM and SBGN definitely help. The result can be captures in descriptive relationships in BioPax, 9
  • 10. 10
  • 11. PathVisio can do a combined visualization of different omics results. Here proteomics and transcriptomics both shown on the same gene product boxes. It can also show effects from metabolomics.
  • 12. Examples of pathways like we have them on wikipathways.org 12
  • 13. This talk is not really about WikiPathways. Check out the information in the paper or the information on the wiki itself. (www.wikipathways.org) developer information is mainly on the www.pathvisio.org website. 13
  • 14. You obtain microarray data (e.g. affymetrix) You can visualize micorarray data Each color corresponds to a measured datapoint For example, green is up, red is down, grey is constant And now? How do you make sure the Affymetrix probeset IDs related to the measurements can be mapped to the gene products in the pathway? 14
  • 15. On WikiPathways (or in pathvisio) you can attach identifiers to each gene. A click opens up the corresponding page on (this specific case) the worm database. You can download the corresponding transcript sequence in two clicks This makes it for instance really easy to design primers 15
  • 16. As soon as you have entered one (and only one) identifier to describe what gene product or metabolite you really mean this information is linked to many other identifiers from other databases and links to these respective pages are shown in the so called “backpage” (actually one of the pages under the tabs at the righthand side of the pathway). 16
  • 17. BridgeDB (see www.bridgedb.org and the paper mentioned on the slide) provides the mechanism needed for that identifier mapping. 17
  • 18. Pathways can be downloaded to be used in different tools. There is also a wikipathway webservice. See: http://www.wikipathways.org/index.php/Help:WikiPathways_Webservice Thomas Kelder, Alexander R Pico, Kristina Hanspers, Chris Evelo & Bruce R Conklin. Mining biological pathways using WikiPathways web services. PLoS One (2009) 4: 7 e644. http://dx.doi.org/10.1371/journal.pone.0006447 We also have semantic output in RDF which can be queried through a SPARQL endpoint described at semantics.bigcat.unimaas.nl.
  • 20. And a solution that isn’t really a solution. There are just too many things you could add. 20
  • 21. The PathVisio Regulatory Interaction plugin (author Stefan van Helden) has a new approach where information is not really added to a pathway, but shown in a separate page upon request. 21
  • 22. The plugin can be found here: http://chianti.ucsd.edu/cyto_web/plugins/displayplugininfo.php?name=GPML-Plugin It can be used to read and write gpml pathway files used by WikiPathways and PathVisio in Cytoscape 22
  • 23. Example showing some more advanced usage of the GPML plugin. Data from the NuGO proof of principle study with dietary challenged mice. Three tissues were sampled and in the other two tissues relatively many genes showed expression changes on Affymetrix arrays but not many pathways were found. For liver the number of genes affected was lower but the number of pathways found to be affected was found to be higher (how come)? The pathway based network analysis showed that there was a set of stronger affected pathway (more reguated genes, large blue circles) that share regulated genes (the red diamonds). When looking at the highlighted group of pathways it became clear that these all belong to the same superste of biologically relevant pathways (fatty acid metabolism and inflammation). 23
  • 24. A paper that we published with a more extensive pathway relationship approach. It takes into account relations between pathways through affected genes not necessarily showing up in either pathway. 24
  • 25. 25
  • 26. The approach takes into account all data use (pathways, interactions and experimentally determined weight). Check out the original paper for details. 26
  • 27. Example result. Pathways with stronger interaction based on gene snot present in them. 27
  • 28. And you can do the same for relatively large sets of pathways “driving” a process like apoptosis. 28
  • 29. CyTargetLinker is a Cytoscape plugin that can be used to extend one network with information about things targeting entities in that network from databases that are created as a network. It already provides a number of target relation databases as mentioned on the slide. 29
  • 30. Example of a target network. (You will normally see this, it contains the information that is used to extend your source network). 30
  • 31. And a more detailed view. 31
  • 32. You can drive it from a gene set, that isn’t even a network at the start. But when miRNAs are found to target more than one gene in the ggroup the network is created on the fly. 32
  • 33. Or you can bootstrap the approach from an existing network. Which can be a pathway based one imported with the GPML plugin like shown here. 33
  • 34. An overview of the Open Phacts project that pulls in lots of information in a semantic web triple store (including information from WikiPathways RDF) and then provides that for use in other tools. In WikiPathways we use that to suggest possible pathway extensions to curators 34
  • 35. This show the PathVisio Loom plugin in action. A gene or metabolite in a pathway under development (left side) is right clicked and the LOOM is activated to pull related genes or metabolites from another resource (database, text mining result or Open Phacts API). The suggested interactions are shown in the window on the right and the entities are added to the pathway (two already shown on the left).
  • 36. Talk so far focused on the genomics-knowledge relationship shown on the right, So what about genetics? 36
  • 37. 37
  • 38. This is the image was to us by Jim Kaput (at that time NTCR, now Nestle).”Look people group those SNPs in gene groups, made sense of the directions and showed them in a pathway. Can you do something like that?” 38
  • 40. There are just too many SNPs for any given gene. 40
  • 41. So it would really look like a bunch of jellies if we show these all on the genes in a pathway, and you would not know what they mean. 41
  • 42. There are loads of bioinformatics tools out there (like Sift and Polyphen) that allow us to estimate functional effects of SNPs on coded protein (activity or protein-protein interactions), binding site for transcription factors in the DNA, or miRNA in RNA. Doing that we can decide what edges SNPs would affect (and how much in what direction). Now as soon as you do that you can use the result to strengthen SNP statistics (ie create groups that can be used for supervised types of group based GWAS analysis) or to build predictive models to estimate that specific (personal or tissue/tumor based) sets of variations would do. That provides a need to use the pathways to link experimental (genomics) data not only to the genetic variations occurring in there, but also to modeling results 42
  • 43. Showing the concept. Integrating flux predictions from modelling (of course that could also be real fluxomics data) 43
  • 44. And showing “real” results from the new flux data representation plugin. The plugin is functional but we still need better mapping databases for reaction identifiers 44
  • 45. Many people involved in this work. (Really many if you count associated groups like the plugin developers, pathway curators etc). Most important SF group (Kristina Hanspers, Bruce Conklin and Alex Pico) collaborating on many things but primarily WikiPatwhays Martijn van Iersel top left (PathVisio, BridgeDB). Thomas Kelder (top middle) (WikiPathways including webservices, pathway integration networks for nutrigenomics), Martina Kutmon (top right) (CyTargetLinker, PathVisio further development), Andra Waagmeester (second row, right) (WikiPathways RDF), Anwesha Dutta (bottom, 2nd from the left) (flux visualization), Stefan van Helden (not on the picture) for the RI PathVisio plugin 45