SlideShare a Scribd company logo
1 of 30
SEEK for Science:
A Data Management Platform
to support Open and
Reproducible Science
Professor Carole Goble
The University of Manchester
UK
BOSC 2014, 12th July 2014
Hypothesis
Generation
Public Data
Acquisition
Experiment
and Data
Generation
Public Data
Acquisition
Model
Analysis
Biological insight
Biological insight
Experiment
Analysis
Model
Validation
Model
Construction
Public Data
Acquisition
Public Data
Acquisition
Modelling
Experimental
Systems
Biology
Sponsors and Motivation
• BMBF “Großprojekt“
• ~45 organisations
• ~70 groups
• multiscale rep of the liver
• multiscale data, models
• imaging data
• EU ERANet programme
• 122 organisations
• 16 multi-inst. consortia
• independent projects in a
two-round funding initiative
Funders
• Preserve results beyond projects.
• Organise & link data, models,
processes.
• Exchange & search initiative‘s
assets.
• Share & disseminate results
• Improve standard curation
practice.
• Pool capacities.
• Handle home-brewed solutions
with mixed resourcing and no
access
People
• Dynamic distributed groups of
experimentalists and modellers
• Cherished own home-grown
and unstable data solutions
– wikis, CMS, databases,
spreadsheets, files.
• Access & visibility control over
shared content
Content
• Locally hosted private repositories
• Public archives
• From single-cell to human
• Samples, Specimens, Standard Op
Procedures
• Small Data: Reactome…: files,
spreadsheets
• Big Data: NGS, Mass Spec…:
Specialist repositories, files
• Models: ODE, SBML, Native Matlab,
PDE, Multi-scale
• In progress: versioning, track provenance and parameters
• Published: citation, links to publications
Cataloguing
7
Find my peers
Creating and sharing
SOPs across projects
Track my
specimens
yellow pages, manage SOPs and link them to investigations, studies, assays,
specimens and samples
Browse experimental data without
downloading them
How data, models and SOPs fit
together
Which data belong to which
publication
Data viewing functionality
ISA: Link Studies to their data, models,
SOPs, samples, publications
Track different
versions of my
model
The Web-based SEEK Platform
Ruby on RAILS 3.2, BSD,
https://bitbucket.org/seek4science/seek
https://seek.sysmo-db.org/models/114
http://www.seek4science.org
Data
Models
Articles
External
Databases
http://www.seek4science.org
Metadata
http://www.isatools.org
Aggregated Asset Infrastructure….
share and interlinking multi-stewarded, mixed,
methods, models, data, samples…
A Commons….
simulate models
project mgt,
access control
reporting, citation
governance &
policies
yellow pages
of peers
projects,
experts
catalogue, link and
index data, models,
samples, specimens,
sops, experiments,
publications using
standards
curate &
annotate data
and models using
standards with
compliance tools
incorporate public
data and model
repositories & tools
deposition
manage, store and
exchange different
types and scales of
data
Reproducibility Score Card
integrate local and
project tools and
data systems
scaled-out
collection &
analytics using third
party platforms
differentiate
construction,
validation &
predicted data
Yellow Pages InstitutionsProjectsPeople
ISA
Investigation
Study
Assay
Asset Catalogue
Models
Datafiles
SOPs
Publications
TagsVersions
Access Privileges
PresentationsEvents
Datafiles Models SOPs
JERMExtract,Harvest,Index
APIsandLinks
BioModels
CheBI
BioPortal
PubMed
JWS Online
GEO
SABIO-RK
Web Interface REST API
Local SEEK
Wikis
CMS
Own DB
Direct Upload
Project DM External
SEEK
OpenBIS
• Gateway plugin framework
– Tight and loose coupling
– RAILS plugin or bundled GEM
• Metadata framework
– JERM and ISA
• Different instances
– Single query across all model
repositories
– One click deposition
BioModels
Plug-in, Play nice, Don’t reinvent
Data….
• Public and new data
• Factors studied
– Linked -> SABIO-RK and ChEBI
• Samples and Specimens
– Extends EBI/NCBI BioSamples
• Treatment Extraction
• Tagging with vocabularies
• Spreadsheet-based data-view
• Big Data
– Upload and by email, Share by
trusted link, Link to external
repository
• Access
– DOIs and Temp links for reviews
Cytoscape
Repositories
• Biomodels, JWS Online,
local SEEK
JWS Online Simulator
• SBML support
• Auto generation of SBGN
schemas for user models
• SED-ML export
DataFuse
• Link and compare
construction and validation
data with models
• Run models with
parameter values from
spreadsheets
Models….
Models
Exchange
Experiment Data
Exchange
Exchange
Exchange
Verification
Comparison
Just
Enough
Results
Model
ISA-TAB
SBML
MIRIAM
SBGN
SemanticSBML
CellML
Construction
Prediction
MIBBI Standards
OBO Controlled
Vocabularies
SED-ML
Simulation Experiment Description Markup Language
Standard Formats and Vocabularies
Standards, Structure, Interlink
Construction Validation
Metabolomics
Metabolomics
Mass
SpecTranscriptomics
Proteomics
Fluxomics
Investigations
Studies
Assays
Towards Interoperable
Bioscience Data, Nature
Genetics, 2012
Assays
Just Enough
Results Model
Describes and
enriches the
relationships
between things
produced and
used in
experiments.
http://bioportal.bioontology.org/ontologies/JERM
reuse
community
ontologies,
markups, mim,
identifiers
metadata sheets
sample sheets
data sheets
indexes
http://rightfield.org.uk/
Just Enough
Results Model
Describes and
enriches the
relationships
between things
produced and
used in
experiments.
http://bioportal.bioontology.org/ontologies/JERM
reuse
community
ontologies,
markups, mim,
identifiers
Different types of data
Plugins to registered
data repositories
Extract and
auto-
catalogue
metadata
Define
relationships,
cross-link,
aggregate, query
standard based
templates
non-standard
templates
Open Modelling
Exchange Format
archive
Sys Bio Research Objects
portable packaged research
Adobe UCF
Research Object
Bundle
ORE PROVODF
• Aggregation
• Annotations/provenance
• Ad-hoc domain-specific
specification
OMEX archive
Systems Biology:
A common archive
format for reuse across
tools
http://www.researchobject.org
Reproducible (Open?) Research
Data sharing,
openness and
careers
incentive
See Titus and Phil talks
Open Research: Research Groups & Lifecycles
• Sharing policy
• Visibility, Downloadability
• Fine grained permissions
• Protocols for
– Management transfer
– Visibility feedback and sharing
workflows
– Publication data deposition in
external public stores
– Batch publishing
Within Project
Versions
Retractions
Across Projects
Versions
Public
Final version
No Retraction
Manager
Owner
Gatekeeper
Open Source
Customisable Platform
https://bitbucket.org/seek4science/seek
Vrije Universiteit, Amsterdam
Systems
Science for
Health (SSfH)
MACS
Yeast
Glycolysis
Open Source
Customisable Platform
https://bitbucket.org/seek4science/seek
Open Facility for European Systems
Biology data & model management
seeded by EU programmes
• Platform
– SEEK + openBIS + new features & styling
• Resource
– EuroSEEK + pool of community resources
(including established SEEKs).
– Independent researchers. Secure data.
• Facility
– Curation & support services, training
http://fair-dom.org/
Open Facility for European Systems
Biology data & model management
seeded by EU programmes
• Community
– workshops, user and developer forums,
knowledge network, standards & policy,
training, FAIRDOM Foundation, Model
Carpentry.
• Sys Bio Developers Foundry workshop
6-7 October Heidelberg
http://fair-dom.org/wiki/Foundry_workshop
• RI
– working with other EU RIs, an EU network of
national facilities, funding models.
http://fair-dom.org/
Carole Goble
Stuart
Owen
Jacky Snoep
Wolfgang
Mueller
Olga Krebs Quyen Nguyen
Natalie
Stanfor
d
Katy WolstencrofPeter Kunszt Bernd Rinn
also contributing
VLN SEEK team
also contributing
UK SEEK team

More Related Content

What's hot

Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Carole Goble
 
Reproducible research: First steps.
Reproducible research: First steps. Reproducible research: First steps.
Reproducible research: First steps. Richard Layton
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer SchoolCarole Goble
 
The Rhetoric of Research Objects
The Rhetoric of Research ObjectsThe Rhetoric of Research Objects
The Rhetoric of Research ObjectsCarole Goble
 
Research Objects for FAIRer Science
Research Objects for FAIRer Science Research Objects for FAIRer Science
Research Objects for FAIRer Science Carole Goble
 
Reproducible research: theory
Reproducible research: theoryReproducible research: theory
Reproducible research: theoryC. Tobin Magle
 
HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8Scott Edmunds
 
What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)Carole Goble
 
Aspects of Reproducibility in Earth Science
Aspects of Reproducibility in Earth ScienceAspects of Reproducibility in Earth Science
Aspects of Reproducibility in Earth ScienceRaul Palma
 
Being FAIR: Enabling Reproducible Data Science
Being FAIR: Enabling Reproducible Data ScienceBeing FAIR: Enabling Reproducible Data Science
Being FAIR: Enabling Reproducible Data ScienceCarole Goble
 
The "social" side of digital science
The "social" side of digital scienceThe "social" side of digital science
The "social" side of digital scienceKaitlin Thaney
 
Capturing Context in Scientific Experiments: Towards Computer-Driven Science
Capturing Context in Scientific Experiments: Towards Computer-Driven ScienceCapturing Context in Scientific Experiments: Towards Computer-Driven Science
Capturing Context in Scientific Experiments: Towards Computer-Driven Sciencedgarijo
 
Better Software, Better Research
Better Software, Better ResearchBetter Software, Better Research
Better Software, Better ResearchCarole Goble
 
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...Carole Goble
 
Reproducibility Using Semantics: An Overview
Reproducibility Using Semantics: An OverviewReproducibility Using Semantics: An Overview
Reproducibility Using Semantics: An Overviewdgarijo
 

What's hot (20)

Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017
 
Reproducible research: First steps.
Reproducible research: First steps. Reproducible research: First steps.
Reproducible research: First steps.
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 
The Rhetoric of Research Objects
The Rhetoric of Research ObjectsThe Rhetoric of Research Objects
The Rhetoric of Research Objects
 
Research Objects for FAIRer Science
Research Objects for FAIRer Science Research Objects for FAIRer Science
Research Objects for FAIRer Science
 
Reproducible research: theory
Reproducible research: theoryReproducible research: theory
Reproducible research: theory
 
HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8
 
What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)
 
Aspects of Reproducibility in Earth Science
Aspects of Reproducibility in Earth ScienceAspects of Reproducibility in Earth Science
Aspects of Reproducibility in Earth Science
 
Being FAIR: Enabling Reproducible Data Science
Being FAIR: Enabling Reproducible Data ScienceBeing FAIR: Enabling Reproducible Data Science
Being FAIR: Enabling Reproducible Data Science
 
The "social" side of digital science
The "social" side of digital scienceThe "social" side of digital science
The "social" side of digital science
 
Capturing Context in Scientific Experiments: Towards Computer-Driven Science
Capturing Context in Scientific Experiments: Towards Computer-Driven ScienceCapturing Context in Scientific Experiments: Towards Computer-Driven Science
Capturing Context in Scientific Experiments: Towards Computer-Driven Science
 
Beyond the PDF 2, 2013
Beyond the PDF 2, 2013Beyond the PDF 2, 2013
Beyond the PDF 2, 2013
 
NETTAB 2013
NETTAB 2013NETTAB 2013
NETTAB 2013
 
UKON 2014
UKON 2014UKON 2014
UKON 2014
 
NETTAB 2012
NETTAB 2012NETTAB 2012
NETTAB 2012
 
CSHALS 2013
CSHALS 2013CSHALS 2013
CSHALS 2013
 
Better Software, Better Research
Better Software, Better ResearchBetter Software, Better Research
Better Software, Better Research
 
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...
 
Reproducibility Using Semantics: An Overview
Reproducibility Using Semantics: An OverviewReproducibility Using Semantics: An Overview
Reproducibility Using Semantics: An Overview
 

Similar to SEEK for Science: A Data and Model Management Platform to support Open and Reproducible Science in Systems Biology

Introduction to FAIRDOM
Introduction to FAIRDOMIntroduction to FAIRDOM
Introduction to FAIRDOMCarole Goble
 
FAIR Data, Operations and Model management for Systems Biology and Systems Me...
FAIR Data, Operations and Model management for Systems Biology and Systems Me...FAIR Data, Operations and Model management for Systems Biology and Systems Me...
FAIR Data, Operations and Model management for Systems Biology and Systems Me...Carole Goble
 
Networked Science, And Integrating with Dataverse
Networked Science, And Integrating with DataverseNetworked Science, And Integrating with Dataverse
Networked Science, And Integrating with DataverseAnita de Waard
 
FAIR BioData Management
FAIR BioData ManagementFAIR BioData Management
FAIR BioData ManagementUlrike Wittig
 
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific DataNIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific DataSusanna-Assunta Sansone
 
FAIR data and model management for systems biology.
FAIR data and model management for systems biology.FAIR data and model management for systems biology.
FAIR data and model management for systems biology.FAIRDOM
 
Lynch & Dirks - Platforms for Open Research - Charleston Conference 2011
Lynch & Dirks  - Platforms for Open Research - Charleston Conference 2011Lynch & Dirks  - Platforms for Open Research - Charleston Conference 2011
Lynch & Dirks - Platforms for Open Research - Charleston Conference 2011Lee Dirks
 
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...Sarah Anna Stewart
 
Effective research data management
Effective research data managementEffective research data management
Effective research data managementCatherine Gold
 
HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 Scott Edmunds
 
Metadata for Research Objects
Metadata for Research ObjectsMetadata for Research Objects
Metadata for Research Objectsseanb
 
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...Open Science Fair
 
FAIR data and model management for systems biology (and SOPs too!)
FAIR data and model management for systems biology (and SOPs too!)FAIR data and model management for systems biology (and SOPs too!)
FAIR data and model management for systems biology (and SOPs too!)FAIRDOM
 
FAIR Data and Model Management for Systems Biology (and SOPs too!)
FAIR Data and Model Management for Systems Biology(and SOPs too!)FAIR Data and Model Management for Systems Biology(and SOPs too!)
FAIR Data and Model Management for Systems Biology (and SOPs too!)Carole Goble
 
Research Shared: researchobject.org
Research Shared: researchobject.orgResearch Shared: researchobject.org
Research Shared: researchobject.orgNorman Morrison
 
BioSharing.org - mapping the landscape of community standards, databases, dat...
BioSharing.org - mapping the landscape of community standards, databases, dat...BioSharing.org - mapping the landscape of community standards, databases, dat...
BioSharing.org - mapping the landscape of community standards, databases, dat...Alejandra Gonzalez-Beltran
 

Similar to SEEK for Science: A Data and Model Management Platform to support Open and Reproducible Science in Systems Biology (20)

Introduction to FAIRDOM
Introduction to FAIRDOMIntroduction to FAIRDOM
Introduction to FAIRDOM
 
FAIR Data, Operations and Model management for Systems Biology and Systems Me...
FAIR Data, Operations and Model management for Systems Biology and Systems Me...FAIR Data, Operations and Model management for Systems Biology and Systems Me...
FAIR Data, Operations and Model management for Systems Biology and Systems Me...
 
FAIRer Research
FAIRer ResearchFAIRer Research
FAIRer Research
 
A Clean Slate?
A Clean Slate?A Clean Slate?
A Clean Slate?
 
Networked Science, And Integrating with Dataverse
Networked Science, And Integrating with DataverseNetworked Science, And Integrating with Dataverse
Networked Science, And Integrating with Dataverse
 
FAIR BioData Management
FAIR BioData ManagementFAIR BioData Management
FAIR BioData Management
 
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific DataNIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
 
FAIR data and model management for systems biology.
FAIR data and model management for systems biology.FAIR data and model management for systems biology.
FAIR data and model management for systems biology.
 
Lynch & Dirks - Platforms for Open Research - Charleston Conference 2011
Lynch & Dirks  - Platforms for Open Research - Charleston Conference 2011Lynch & Dirks  - Platforms for Open Research - Charleston Conference 2011
Lynch & Dirks - Platforms for Open Research - Charleston Conference 2011
 
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
 
Enhance your rese​arch impact through open science
Enhance your rese​arch impact through open scienceEnhance your rese​arch impact through open science
Enhance your rese​arch impact through open science
 
Model management for systems biology projects
Model management for systems biology projectsModel management for systems biology projects
Model management for systems biology projects
 
Effective research data management
Effective research data managementEffective research data management
Effective research data management
 
HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9
 
Metadata for Research Objects
Metadata for Research ObjectsMetadata for Research Objects
Metadata for Research Objects
 
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...
 
FAIR data and model management for systems biology (and SOPs too!)
FAIR data and model management for systems biology (and SOPs too!)FAIR data and model management for systems biology (and SOPs too!)
FAIR data and model management for systems biology (and SOPs too!)
 
FAIR Data and Model Management for Systems Biology (and SOPs too!)
FAIR Data and Model Management for Systems Biology(and SOPs too!)FAIR Data and Model Management for Systems Biology(and SOPs too!)
FAIR Data and Model Management for Systems Biology (and SOPs too!)
 
Research Shared: researchobject.org
Research Shared: researchobject.orgResearch Shared: researchobject.org
Research Shared: researchobject.org
 
BioSharing.org - mapping the landscape of community standards, databases, dat...
BioSharing.org - mapping the landscape of community standards, databases, dat...BioSharing.org - mapping the landscape of community standards, databases, dat...
BioSharing.org - mapping the landscape of community standards, databases, dat...
 

More from Carole Goble

The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...Carole Goble
 
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...Carole Goble
 
RO-Crate: packaging metadata love notes into FAIR Digital Objects
RO-Crate: packaging metadata love notes into FAIR Digital ObjectsRO-Crate: packaging metadata love notes into FAIR Digital Objects
RO-Crate: packaging metadata love notes into FAIR Digital ObjectsCarole Goble
 
Research Software Sustainability takes a Village
Research Software Sustainability takes a VillageResearch Software Sustainability takes a Village
Research Software Sustainability takes a VillageCarole Goble
 
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...Carole Goble
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
 
Open Research: Manchester leading and learning
Open Research: Manchester leading and learningOpen Research: Manchester leading and learning
Open Research: Manchester leading and learningCarole Goble
 
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...Carole Goble
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
 
EOSC-Life Workflow Collaboratory
EOSC-Life Workflow CollaboratoryEOSC-Life Workflow Collaboratory
EOSC-Life Workflow CollaboratoryCarole Goble
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
 
FAIR Data Bridging from researcher data management to ELIXIR archives in the...
FAIR Data Bridging from researcher data management to ELIXIR archives in the...FAIR Data Bridging from researcher data management to ELIXIR archives in the...
FAIR Data Bridging from researcher data management to ELIXIR archives in the...Carole Goble
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows Carole Goble
 
FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout Carole Goble
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceCarole Goble
 
RO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research ObjectsRO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research ObjectsCarole Goble
 
The swings and roundabouts of a decade of fun and games with Research Objects
The swings and roundabouts of a decade of fun and games with Research Objects The swings and roundabouts of a decade of fun and games with Research Objects
The swings and roundabouts of a decade of fun and games with Research Objects Carole Goble
 
How are we Faring with FAIR? (and what FAIR is not)
How are we Faring with FAIR? (and what FAIR is not)How are we Faring with FAIR? (and what FAIR is not)
How are we Faring with FAIR? (and what FAIR is not)Carole Goble
 
What is Reproducibility? The R* brouhaha and how Research Objects can help
What is Reproducibility? The R* brouhaha and how Research Objects can helpWhat is Reproducibility? The R* brouhaha and how Research Objects can help
What is Reproducibility? The R* brouhaha and how Research Objects can helpCarole Goble
 

More from Carole Goble (20)

The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
 
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...
 
RO-Crate: packaging metadata love notes into FAIR Digital Objects
RO-Crate: packaging metadata love notes into FAIR Digital ObjectsRO-Crate: packaging metadata love notes into FAIR Digital Objects
RO-Crate: packaging metadata love notes into FAIR Digital Objects
 
Research Software Sustainability takes a Village
Research Software Sustainability takes a VillageResearch Software Sustainability takes a Village
Research Software Sustainability takes a Village
 
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
Open Research: Manchester leading and learning
Open Research: Manchester leading and learningOpen Research: Manchester leading and learning
Open Research: Manchester leading and learning
 
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
EOSC-Life Workflow Collaboratory
EOSC-Life Workflow CollaboratoryEOSC-Life Workflow Collaboratory
EOSC-Life Workflow Collaboratory
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
FAIR Data Bridging from researcher data management to ELIXIR archives in the...
FAIR Data Bridging from researcher data management to ELIXIR archives in the...FAIR Data Bridging from researcher data management to ELIXIR archives in the...
FAIR Data Bridging from researcher data management to ELIXIR archives in the...
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practice
 
RO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research ObjectsRO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research Objects
 
The swings and roundabouts of a decade of fun and games with Research Objects
The swings and roundabouts of a decade of fun and games with Research Objects The swings and roundabouts of a decade of fun and games with Research Objects
The swings and roundabouts of a decade of fun and games with Research Objects
 
How are we Faring with FAIR? (and what FAIR is not)
How are we Faring with FAIR? (and what FAIR is not)How are we Faring with FAIR? (and what FAIR is not)
How are we Faring with FAIR? (and what FAIR is not)
 
What is Reproducibility? The R* brouhaha and how Research Objects can help
What is Reproducibility? The R* brouhaha and how Research Objects can helpWhat is Reproducibility? The R* brouhaha and how Research Objects can help
What is Reproducibility? The R* brouhaha and how Research Objects can help
 

Recently uploaded

What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWave PLM
 
PREDICTING RIVER WATER QUALITY ppt presentation
PREDICTING  RIVER  WATER QUALITY  ppt presentationPREDICTING  RIVER  WATER QUALITY  ppt presentation
PREDICTING RIVER WATER QUALITY ppt presentationvaddepallysandeep122
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaHanief Utama
 
A healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfA healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfMarharyta Nedzelska
 
What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...Technogeeks
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Andreas Granig
 
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdfExploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdfkalichargn70th171
 
Cloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEECloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEEVICTOR MAESTRE RAMIREZ
 
Machine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their EngineeringMachine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their EngineeringHironori Washizaki
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsAhmed Mohamed
 
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanySuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanyChristoph Pohl
 
Precise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive GoalPrecise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive GoalLionel Briand
 
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...OnePlan Solutions
 
Comparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdfComparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdfDrew Moseley
 
Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)Ahmed Mater
 
SensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving CarsSensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving CarsChristian Birchler
 
cpct NetworkING BASICS AND NETWORK TOOL.ppt
cpct NetworkING BASICS AND NETWORK TOOL.pptcpct NetworkING BASICS AND NETWORK TOOL.ppt
cpct NetworkING BASICS AND NETWORK TOOL.pptrcbcrtm
 
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...OnePlan Solutions
 

Recently uploaded (20)

Odoo Development Company in India | Devintelle Consulting Service
Odoo Development Company in India | Devintelle Consulting ServiceOdoo Development Company in India | Devintelle Consulting Service
Odoo Development Company in India | Devintelle Consulting Service
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need It
 
PREDICTING RIVER WATER QUALITY ppt presentation
PREDICTING  RIVER  WATER QUALITY  ppt presentationPREDICTING  RIVER  WATER QUALITY  ppt presentation
PREDICTING RIVER WATER QUALITY ppt presentation
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief Utama
 
A healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfA healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdf
 
What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024
 
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdfExploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
 
Cloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEECloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEE
 
Machine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their EngineeringMachine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their Engineering
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML Diagrams
 
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanySuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
 
Precise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive GoalPrecise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive Goal
 
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
 
Comparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdfComparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdf
 
Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)
 
SensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving CarsSensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving Cars
 
cpct NetworkING BASICS AND NETWORK TOOL.ppt
cpct NetworkING BASICS AND NETWORK TOOL.pptcpct NetworkING BASICS AND NETWORK TOOL.ppt
cpct NetworkING BASICS AND NETWORK TOOL.ppt
 
Advantages of Odoo ERP 17 for Your Business
Advantages of Odoo ERP 17 for Your BusinessAdvantages of Odoo ERP 17 for Your Business
Advantages of Odoo ERP 17 for Your Business
 
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
 

SEEK for Science: A Data and Model Management Platform to support Open and Reproducible Science in Systems Biology

  • 1. SEEK for Science: A Data Management Platform to support Open and Reproducible Science Professor Carole Goble The University of Manchester UK BOSC 2014, 12th July 2014
  • 2. Hypothesis Generation Public Data Acquisition Experiment and Data Generation Public Data Acquisition Model Analysis Biological insight Biological insight Experiment Analysis Model Validation Model Construction Public Data Acquisition Public Data Acquisition Modelling Experimental Systems Biology
  • 3. Sponsors and Motivation • BMBF “Großprojekt“ • ~45 organisations • ~70 groups • multiscale rep of the liver • multiscale data, models • imaging data • EU ERANet programme • 122 organisations • 16 multi-inst. consortia • independent projects in a two-round funding initiative
  • 4. Funders • Preserve results beyond projects. • Organise & link data, models, processes. • Exchange & search initiative‘s assets. • Share & disseminate results • Improve standard curation practice. • Pool capacities. • Handle home-brewed solutions with mixed resourcing and no access
  • 5. People • Dynamic distributed groups of experimentalists and modellers • Cherished own home-grown and unstable data solutions – wikis, CMS, databases, spreadsheets, files. • Access & visibility control over shared content
  • 6. Content • Locally hosted private repositories • Public archives • From single-cell to human • Samples, Specimens, Standard Op Procedures • Small Data: Reactome…: files, spreadsheets • Big Data: NGS, Mass Spec…: Specialist repositories, files • Models: ODE, SBML, Native Matlab, PDE, Multi-scale • In progress: versioning, track provenance and parameters • Published: citation, links to publications
  • 7. Cataloguing 7 Find my peers Creating and sharing SOPs across projects Track my specimens yellow pages, manage SOPs and link them to investigations, studies, assays, specimens and samples Browse experimental data without downloading them How data, models and SOPs fit together Which data belong to which publication Data viewing functionality ISA: Link Studies to their data, models, SOPs, samples, publications Track different versions of my model
  • 8. The Web-based SEEK Platform Ruby on RAILS 3.2, BSD, https://bitbucket.org/seek4science/seek https://seek.sysmo-db.org/models/114 http://www.seek4science.org
  • 10. simulate models project mgt, access control reporting, citation governance & policies yellow pages of peers projects, experts catalogue, link and index data, models, samples, specimens, sops, experiments, publications using standards curate & annotate data and models using standards with compliance tools incorporate public data and model repositories & tools deposition manage, store and exchange different types and scales of data Reproducibility Score Card integrate local and project tools and data systems scaled-out collection & analytics using third party platforms differentiate construction, validation & predicted data
  • 11. Yellow Pages InstitutionsProjectsPeople ISA Investigation Study Assay Asset Catalogue Models Datafiles SOPs Publications TagsVersions Access Privileges PresentationsEvents Datafiles Models SOPs JERMExtract,Harvest,Index APIsandLinks BioModels CheBI BioPortal PubMed JWS Online GEO SABIO-RK Web Interface REST API Local SEEK Wikis CMS Own DB Direct Upload Project DM External SEEK OpenBIS
  • 12. • Gateway plugin framework – Tight and loose coupling – RAILS plugin or bundled GEM • Metadata framework – JERM and ISA • Different instances – Single query across all model repositories – One click deposition BioModels Plug-in, Play nice, Don’t reinvent
  • 13. Data…. • Public and new data • Factors studied – Linked -> SABIO-RK and ChEBI • Samples and Specimens – Extends EBI/NCBI BioSamples • Treatment Extraction • Tagging with vocabularies • Spreadsheet-based data-view • Big Data – Upload and by email, Share by trusted link, Link to external repository • Access – DOIs and Temp links for reviews
  • 14. Cytoscape Repositories • Biomodels, JWS Online, local SEEK JWS Online Simulator • SBML support • Auto generation of SBGN schemas for user models • SED-ML export DataFuse • Link and compare construction and validation data with models • Run models with parameter values from spreadsheets Models….
  • 16. Standards, Structure, Interlink Construction Validation Metabolomics Metabolomics Mass SpecTranscriptomics Proteomics Fluxomics Investigations Studies Assays Towards Interoperable Bioscience Data, Nature Genetics, 2012 Assays
  • 17.
  • 18.
  • 19.
  • 20. Just Enough Results Model Describes and enriches the relationships between things produced and used in experiments. http://bioportal.bioontology.org/ontologies/JERM reuse community ontologies, markups, mim, identifiers
  • 21. metadata sheets sample sheets data sheets indexes http://rightfield.org.uk/ Just Enough Results Model Describes and enriches the relationships between things produced and used in experiments. http://bioportal.bioontology.org/ontologies/JERM reuse community ontologies, markups, mim, identifiers
  • 22. Different types of data Plugins to registered data repositories Extract and auto- catalogue metadata Define relationships, cross-link, aggregate, query standard based templates non-standard templates Open Modelling Exchange Format archive
  • 23. Sys Bio Research Objects portable packaged research Adobe UCF Research Object Bundle ORE PROVODF • Aggregation • Annotations/provenance • Ad-hoc domain-specific specification OMEX archive Systems Biology: A common archive format for reuse across tools http://www.researchobject.org
  • 24. Reproducible (Open?) Research Data sharing, openness and careers incentive See Titus and Phil talks
  • 25. Open Research: Research Groups & Lifecycles • Sharing policy • Visibility, Downloadability • Fine grained permissions • Protocols for – Management transfer – Visibility feedback and sharing workflows – Publication data deposition in external public stores – Batch publishing Within Project Versions Retractions Across Projects Versions Public Final version No Retraction Manager Owner Gatekeeper
  • 26. Open Source Customisable Platform https://bitbucket.org/seek4science/seek Vrije Universiteit, Amsterdam Systems Science for Health (SSfH) MACS Yeast Glycolysis
  • 28. Open Facility for European Systems Biology data & model management seeded by EU programmes • Platform – SEEK + openBIS + new features & styling • Resource – EuroSEEK + pool of community resources (including established SEEKs). – Independent researchers. Secure data. • Facility – Curation & support services, training http://fair-dom.org/
  • 29. Open Facility for European Systems Biology data & model management seeded by EU programmes • Community – workshops, user and developer forums, knowledge network, standards & policy, training, FAIRDOM Foundation, Model Carpentry. • Sys Bio Developers Foundry workshop 6-7 October Heidelberg http://fair-dom.org/wiki/Foundry_workshop • RI – working with other EU RIs, an EU network of national facilities, funding models. http://fair-dom.org/
  • 30. Carole Goble Stuart Owen Jacky Snoep Wolfgang Mueller Olga Krebs Quyen Nguyen Natalie Stanfor d Katy WolstencrofPeter Kunszt Bernd Rinn also contributing VLN SEEK team also contributing UK SEEK team