SlideShare a Scribd company logo
1 of 30
Download to read offline
Workflows, Provenance & Reporting
A Lifecycle Perspective
Professor Carole Goble FREng FBCS
The University of Manchester, UK
carole.goble@manchester.ac.uk
3rd
– 6th
September 2013, Rome, Italy
The Scientific and Technical Ecosystem
Mobilising Big and Broad Data
• Streaming
• Sweeps through models
• Integrative analysis
• Results synthesis
• Heavy compute
Interoperability, plugging together
• Multi step chains, Multi software / data
• Mixed resources / platforms
• Incompatibility smoothing
• Trans-disciplinary, Alien processes
[DataONE]
BioSTIF
inputs:
data, parameters,
configurations
outputs
Workflow nutshell
• A series of automated /
interactive data
analysis steps
• Process data at scale
• Import data / codes
from one’s own
research and/or from
existing libraries
• Pipelines & analytic
and synthesis
procedures
• Chains of components
• Bridges between
resources
• Shield from change
and operational
complexity
• Releasing capacity
Services
Resources
Provisioning
Workflows
Appln
Service
Appln
Service
Users
Workflows
Composition
Incorporation
Invocation
Applications
• Applications components
of workflows
• Compose applications
into workflows
• Incorporate workflows into
applications
Infrastructure
• Provision physical
resources to support
application workflows
• Coordinate resources through workflows
• Optimise and adapt to change
[Foster 2005]
Workflows
Wfms
Assembly of Components
Interoperability
Covering up incompatibility
Flexible variation
Stabilising
Optimising
Workflows: maturing approach
Underpin integrative
platforms.
Established in many
disciplines, notably chemistry
and biology, esp. ‘omics:
assembly, synthesis,
annotation, analytics.
Overlaps with metagenomics,
phylogenetics and genetic
ecology
Powering service based
science and science as a
service
http://www.globus.org/genomics/solution
Sandve, Nekrutenko, Taylor, Hovig
Ten simple rules for reproducible in silico research, PLoS Comp Bio submitted
Ecological Niche modelling, population
modelling, Metagenomics and Phylogenetics
‘omics pipelines and analytic workflows
http://www.biovel.eu
Community Cyberinfrastructure for Advanced
Marine Microbial Ecology Research and
Analysis http://camera.calit2.net/index.shtm
Combine species occurrence data with global
climate, terrain and land cover information, to
identify environmental correlates of species
ranges. http://www.lifemapper.org/species
BioDiversity
Taxonomic Data Refinement
www.biovel.eu
• Synonym expansion
• Taxonomic name resolution
• Occurrence retrieval
• Spell checking
• Geographic and taxonomic cleaning
• Temporal refinement
• Data processing log
[Matthias Obst, INTECOL 2013]
Data Operations in Workflows in the Wild
Analysis of 260 publicly available workflows in Taverna, WINGS, Galaxy and Vistrails
Garijo et al Common Motifs in Scientific Workflows: An Empirical Analysis, in press, FGCS
Large Scale
Ecological Niche
Modeling Workflow
.
Step 1: Explorative modeling
-Use unfiltered data
-Use fixed parameters: Mahalonobis distance (Farber
and Kadmon 2003)
-Native projections
-Test the model, distribution of points, number of points
Step 2: Deep modeling
-Filtering environmentally unique points with BioClim
algorithm (Nix 1986)
-ENM with Support Vector Machine (Cristianini & Shawe-
Taylor 2000) and Maximum Entropy (Phillips 2004)
-Parameter optimization (if necessary) on the model test
results
-2 masks (model generate, model project)
Data discoveryData discovery
Data assembly, cleaning,
and refinement
Data assembly, cleaning,
and refinement
Ecological Niche
Modeling
Ecological Niche
Modeling
Statistical analysisStatistical analysis
Analytical cycle
Pilumnus hirtellus
Enclosed sea problem (Ready et al., 2010)
[Matthias Obst, INTECOL 2013]
Workflow-enabled science
• Common Templates
• Prepared components
• Systematic assembly
• (Steered) automation
• Hybrid combinations
• Variations
• Extensibility
• Customisation
• Parameterisation
• Repeats
• Cross-run synthesis
• Routine, pooled methods
• Tracking
Repeated model sweeps
Ten insect species were modelled:
European spruce bark beetle – Ips
typographus L.
Bordered white moth (syn. pine looper) -
Bupalus piniarius L., (syn. B. piniaria L.)
Pine-tree lappet - Dendrolimus pini L.
Mottled umber - Erannis defoliaria Clerck
Nun moth - Lymantria monacha L.
Winter moth - Operopthera brumata L.
Pine beauty moth - Panolis flammea Den.
& Schiff
Green oak tortrix - Tortrix viridana L.
European pine sawfly – Neodiprion sertifer
Geoffr.
Common pine sawfly – Diprion pini L.
Tortrix viridana Image by Kimmo & Seppo
Silvonen Lymantria monacha
data
configuration
parameters
steps Päivi Lyytikäinen-Saarenmaa
presentation, INTECOL 2013
http://www.jisc.ac.uk/whatwedo/campaigns/res3/jischelp.aspx
Workflows
workflows
results
provenance
process (log)
results (origin)
Reporting
Record of science
Reproducibility
Transparent process
Integrate with
reporting systems
Know how
Training
See Penev
presentation
Provenance
the link between computation and results
W3C PROV model standard
record for reporting
compare diffs/discrepancies
provenance analytics
track changes, adapt
partial repeat/reproduce
carry attributions
compute credits
compute data quality/trust
select data to keep/release
optimisation and debugging
d1
S0
d2
S1
w
S2
y
S4
df
d1'
S0
d2
S1
z w
S'2
y'
S4
df'
(i) Trace A (ii) Trace B
PDIFF: comparing provenance traces to
diagnose divergence across experimental
results [Woodman et al, 2011]
[Freire]http://www.aosabook.org/en/vistrails.html
Collecting -> Using Provenance
Instrumenting, cross-tool interoperability
Reporting at different scales
b
Publishing with Provenance
Summary: Infrastructure Productivity
CustomiseCustomise
ProcessProcess
CustomiseCustomise
ProcessProcess
CustomiseCustomise
EnvironmentEnvironment
Legacy, others and your own software,
datasets, services, codes, and platforms.
optimise and manage use of computing
infrastructure, HPC, clouds and platforms
WFMS
middleware
WFMS
middleware
Support the design, config. and execution of
workflows. manage utility actions for data,
logging, security, compute, errors…shield
incompatibilities / complexity / change
Parameterised, integrative, multi-step
(data) pipelines, analytics, computational
protocols. That can be repetitively reused.
dependency-rich interoperability.
WorkflowWorkflow
AppsApps
Domain/task specific apps that incorporate
(an ecosystem of) workflows
Integrate
Summary: User Productivity: Capability Raising
AccessAccess
Framework to access and leverage
heterogeneous legacy applications, services,
datasets and codes.
Shielding from complexity.
CustomiseCustomise
Rapid development: Flexibility,
Extensibility, Adaptability, Reuse.
Reusable Workflow Components
ProcessProcess
Automated plumbing + Interaction
Systematic, repetitive and unbiased analysis
and processing and error handling
Ensembles, comparisons, “what ifs”
CustomiseCustomise
Rapid development: Flexibility,
Extensibility, Adaptability, Reuse.
Reusable Workflow Components
ProcessProcess
Automated plumbing + Interaction
Systematic, repetitive and unbiased analysis
and processing and error handling
Ensembles, comparisons, “what ifs”
CustomiseCustomise
Rapid development: Flexibility,
Extensibility, Adaptability, Reuse.
Reusable Workflow Components
AccessAccess
Framework to access and leverage
heterogeneous legacy applications, services,
datasets and codes and combine with yours.
Shielding from complexity.
ProcessProcess
Integration, Reusable workflows/components
Automated plumbing + Interaction
Systematic, repetitive and unbiased analysis
Ensembles, comparisons, “what ifs”
Process reporting. Citation tracking.
Reproducibility, Provenance, Audit. Quality
Control. Standard Operating Procedures.RecordRecord
CustomiseCustomise
Rapid development: Flexibility,
Extensibility, Adaptability, Reuse. Reusable
Workflow Components
Workflow Commodities
building cohorts, capturing traits,
explicit reporting, clear instructions
• Workflow templates
• Workflow sets
• Libraries of sub workflow parts
• Design practices for mix, match
and reuse
• Future proofed design predicting
need to adapt
• Discovery and exchange
• Workflow engineers
• Workflow custodians
Seeding a workflow library
Workflow Commodities
exchanging, curating, preserving,
packaging, life cycle management
http://www.researchobject.orghttp://www.dcc.ac.uk
Katy’s student’s 200 hours
Tracking where data went
Workflow Commodities
getting credit, capability,
engineers and custodians
Application Building
user variety, outcome focused
• Right apps, right users.
• Commodity apps:
– Web. Spreadsheets. R.
• Customisation
• Mixed workflow / scripting
• Deployment / Portability
– Web based / desktop
– Virtualised deployments
– Cloud hosted service
– A cloud-enabled local host
• Local ownership
• Capability building
WorkflowVisibility
BioDiversity
Low
ConceptKnowledge
High
Technology/InfrastructureDomainScientist
Technicalspecialists
ComputationalScientist
Custom
Specific
Apps
General
Toolkits
Policy
makers
Low
High
Versatility
Who are the users?
• Policy makers?
• Biodiversity researcher?
• Computational scientist?
• Tool developer?
• Service provider?
• Infrastructure provider?
• Digital custodian?
Workflow management systems
• Integrated into community frameworks,
coupled into tools
• Virtualised (Web) Services
• Scaling, Optimisation
• Interoperability, Using provenance
• No one workflow language/system
• Specialisation & its cost
• Plug-ins for common community
platforms and resources
• Mitigating and adapting to changes in
infrastructures and resources.
• Sustainability and engineering
Generic
Specific
http://www.erflow.eu/
Population dynamics
The life cycle of infrastructures
• Dynamics: Mitigate, Adapt,
Disperse, Die
• Standard and maintained
prog. interfaces (APIs)
• Standard formats and ids
• Stability, reliability, repair
• Interoperability
• Semantic descriptions
• Sustainability of services
and infrastructure
• Instrument resources for
citation & microattribution
• Coupled services and
infrastructure.
Impact of dependencies
[Zhao et al. Why workflows break e-Science 2012]
Summary
Scale.
Standards data formats, programmatic interfaces.
Governance.
Workflow commodities
Design practices
Credit
A seamless, pluggable service.
Scale. Adaptability. Specific-Generic tension.
Putting provenance to use for data credit.
Embedding workflows in common applications
Integration into reporting and publishing lifecycles
BioDiversity Virtual e-Laboratory
www.biovel.eu
Wf4Ever
www.wf4ever-project.org
SysMO
www.sysmo-db.org
SCaleable Preservation Environments
http://www.scape-project.eu

More Related Content

What's hot

Small Is Beautiful: Summarizing Scientific Workflows Using Semantic Annotat...
Small Is Beautiful:  Summarizing Scientific Workflows  Using Semantic Annotat...Small Is Beautiful:  Summarizing Scientific Workflows  Using Semantic Annotat...
Small Is Beautiful: Summarizing Scientific Workflows Using Semantic Annotat...Khalid Belhajjame
 
Improving the Management of Computational Models -- Invited talk at the EBI
Improving the Management of Computational Models -- Invited talk at the EBIImproving the Management of Computational Models -- Invited talk at the EBI
Improving the Management of Computational Models -- Invited talk at the EBIMartin Scharm
 
Building collaborative workflows for scientific data
Building collaborative workflows for scientific dataBuilding collaborative workflows for scientific data
Building collaborative workflows for scientific dataBruno Vieira
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer SchoolCarole Goble
 
FAIR Data, Operations and Model management for Systems Biology and Systems Me...
FAIR Data, Operations and Model management for Systems Biology and Systems Me...FAIR Data, Operations and Model management for Systems Biology and Systems Me...
FAIR Data, Operations and Model management for Systems Biology and Systems Me...Carole Goble
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows Carole Goble
 
Advances in Scientific Workflow Environments
Advances in Scientific Workflow EnvironmentsAdvances in Scientific Workflow Environments
Advances in Scientific Workflow EnvironmentsCarole Goble
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
 
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...Carole Goble
 
NZ eResearch Symposium 2013 - Capturing the Flux in Scientific Knowledge
NZ eResearch Symposium 2013 - Capturing the Flux in Scientific KnowledgeNZ eResearch Symposium 2013 - Capturing the Flux in Scientific Knowledge
NZ eResearch Symposium 2013 - Capturing the Flux in Scientific KnowledgePrashant Gupta
 
Introduction to FAIRDOM
Introduction to FAIRDOMIntroduction to FAIRDOM
Introduction to FAIRDOMCarole Goble
 
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...Carole Goble
 
What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)Carole Goble
 
RARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research ObjectsRARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research ObjectsCarole Goble
 
SEEK for Science: A Data and Model Management Platform to support Open and Re...
SEEK for Science: A Data and Model Management Platform to support Open and Re...SEEK for Science: A Data and Model Management Platform to support Open and Re...
SEEK for Science: A Data and Model Management Platform to support Open and Re...Carole Goble
 
Scientific Workflow Systems for accessible, reproducible research
Scientific Workflow Systems for accessible, reproducible researchScientific Workflow Systems for accessible, reproducible research
Scientific Workflow Systems for accessible, reproducible researchPeter van Heusden
 
The Research Object Initiative: Frameworks and Use Cases
The Research Object Initiative:Frameworks and Use CasesThe Research Object Initiative:Frameworks and Use Cases
The Research Object Initiative: Frameworks and Use CasesCarole Goble
 
Results Vary: The Pragmatics of Reproducibility and Research Object Frameworks
Results Vary: The Pragmatics of Reproducibility and Research Object FrameworksResults Vary: The Pragmatics of Reproducibility and Research Object Frameworks
Results Vary: The Pragmatics of Reproducibility and Research Object FrameworksCarole Goble
 

What's hot (20)

Small Is Beautiful: Summarizing Scientific Workflows Using Semantic Annotat...
Small Is Beautiful:  Summarizing Scientific Workflows  Using Semantic Annotat...Small Is Beautiful:  Summarizing Scientific Workflows  Using Semantic Annotat...
Small Is Beautiful: Summarizing Scientific Workflows Using Semantic Annotat...
 
Improving the Management of Computational Models -- Invited talk at the EBI
Improving the Management of Computational Models -- Invited talk at the EBIImproving the Management of Computational Models -- Invited talk at the EBI
Improving the Management of Computational Models -- Invited talk at the EBI
 
Building collaborative workflows for scientific data
Building collaborative workflows for scientific dataBuilding collaborative workflows for scientific data
Building collaborative workflows for scientific data
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 
FAIR Data, Operations and Model management for Systems Biology and Systems Me...
FAIR Data, Operations and Model management for Systems Biology and Systems Me...FAIR Data, Operations and Model management for Systems Biology and Systems Me...
FAIR Data, Operations and Model management for Systems Biology and Systems Me...
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
Advances in Scientific Workflow Environments
Advances in Scientific Workflow EnvironmentsAdvances in Scientific Workflow Environments
Advances in Scientific Workflow Environments
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
ROHub
ROHubROHub
ROHub
 
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
 
NZ eResearch Symposium 2013 - Capturing the Flux in Scientific Knowledge
NZ eResearch Symposium 2013 - Capturing the Flux in Scientific KnowledgeNZ eResearch Symposium 2013 - Capturing the Flux in Scientific Knowledge
NZ eResearch Symposium 2013 - Capturing the Flux in Scientific Knowledge
 
Introduction to FAIRDOM
Introduction to FAIRDOMIntroduction to FAIRDOM
Introduction to FAIRDOM
 
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
 
What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)
 
RARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research ObjectsRARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research Objects
 
COPO kick-off meeting
COPO kick-off meetingCOPO kick-off meeting
COPO kick-off meeting
 
SEEK for Science: A Data and Model Management Platform to support Open and Re...
SEEK for Science: A Data and Model Management Platform to support Open and Re...SEEK for Science: A Data and Model Management Platform to support Open and Re...
SEEK for Science: A Data and Model Management Platform to support Open and Re...
 
Scientific Workflow Systems for accessible, reproducible research
Scientific Workflow Systems for accessible, reproducible researchScientific Workflow Systems for accessible, reproducible research
Scientific Workflow Systems for accessible, reproducible research
 
The Research Object Initiative: Frameworks and Use Cases
The Research Object Initiative:Frameworks and Use CasesThe Research Object Initiative:Frameworks and Use Cases
The Research Object Initiative: Frameworks and Use Cases
 
Results Vary: The Pragmatics of Reproducibility and Research Object Frameworks
Results Vary: The Pragmatics of Reproducibility and Research Object FrameworksResults Vary: The Pragmatics of Reproducibility and Research Object Frameworks
Results Vary: The Pragmatics of Reproducibility and Research Object Frameworks
 

Viewers also liked

PCTY 2012, TRIRIGA Space og Energy Management v. Jens Cajus
PCTY 2012, TRIRIGA Space og Energy Management v. Jens CajusPCTY 2012, TRIRIGA Space og Energy Management v. Jens Cajus
PCTY 2012, TRIRIGA Space og Energy Management v. Jens CajusIBM Danmark
 
Impact si valoare in Asset Management cu Primavera Project Management
Impact si valoare in Asset Management cu Primavera Project ManagementImpact si valoare in Asset Management cu Primavera Project Management
Impact si valoare in Asset Management cu Primavera Project ManagementTotalSoft
 
TRI-1-Case Studies in Improving TRIRIGA Application Performance
TRI-1-Case Studies in Improving TRIRIGA Application PerformanceTRI-1-Case Studies in Improving TRIRIGA Application Performance
TRI-1-Case Studies in Improving TRIRIGA Application PerformanceMark Johnson
 
Leveraging IBM TRIRIGA to Improve Your Existing IBM Maximo Implementation
Leveraging IBM TRIRIGA to Improve Your Existing IBM Maximo ImplementationLeveraging IBM TRIRIGA to Improve Your Existing IBM Maximo Implementation
Leveraging IBM TRIRIGA to Improve Your Existing IBM Maximo ImplementationScott Geller
 
Everything TRIRIGA users need to know about IBM InterConnect 2016 with Sessio...
Everything TRIRIGA users need to know about IBM InterConnect 2016 with Sessio...Everything TRIRIGA users need to know about IBM InterConnect 2016 with Sessio...
Everything TRIRIGA users need to know about IBM InterConnect 2016 with Sessio...Dan Barrett
 
Scientific Paper for Ecology: Grassland and Coastal Marine ecosystem
Scientific Paper for Ecology: Grassland and Coastal Marine ecosystemScientific Paper for Ecology: Grassland and Coastal Marine ecosystem
Scientific Paper for Ecology: Grassland and Coastal Marine ecosystemShaina Mavreen Villaroza
 
Facility Management & Smarter Buildings - PCTY 2011
Facility Management & Smarter Buildings - PCTY 2011Facility Management & Smarter Buildings - PCTY 2011
Facility Management & Smarter Buildings - PCTY 2011IBM Sverige
 
IBM Intelligent Building Management 2012
IBM Intelligent Building Management 2012IBM Intelligent Building Management 2012
IBM Intelligent Building Management 2012IBM Danmark
 
Har ni koll på balansräkningens största post? - IBM Smarter Business 2013
Har ni koll på balansräkningens största post? - IBM Smarter Business 2013Har ni koll på balansräkningens största post? - IBM Smarter Business 2013
Har ni koll på balansräkningens största post? - IBM Smarter Business 2013IBM Sverige
 
Intelligent Building Management Systems
Intelligent Building Management SystemsIntelligent Building Management Systems
Intelligent Building Management SystemsKhaled Ali
 
IBM Smarter Buildings: Detailed Overview
IBM Smarter Buildings: Detailed Overview IBM Smarter Buildings: Detailed Overview
IBM Smarter Buildings: Detailed Overview Steve Cole
 

Viewers also liked (13)

PCTY 2012, TRIRIGA Space og Energy Management v. Jens Cajus
PCTY 2012, TRIRIGA Space og Energy Management v. Jens CajusPCTY 2012, TRIRIGA Space og Energy Management v. Jens Cajus
PCTY 2012, TRIRIGA Space og Energy Management v. Jens Cajus
 
Impact si valoare in Asset Management cu Primavera Project Management
Impact si valoare in Asset Management cu Primavera Project ManagementImpact si valoare in Asset Management cu Primavera Project Management
Impact si valoare in Asset Management cu Primavera Project Management
 
TRI-1-Case Studies in Improving TRIRIGA Application Performance
TRI-1-Case Studies in Improving TRIRIGA Application PerformanceTRI-1-Case Studies in Improving TRIRIGA Application Performance
TRI-1-Case Studies in Improving TRIRIGA Application Performance
 
Leveraging IBM TRIRIGA to Improve Your Existing IBM Maximo Implementation
Leveraging IBM TRIRIGA to Improve Your Existing IBM Maximo ImplementationLeveraging IBM TRIRIGA to Improve Your Existing IBM Maximo Implementation
Leveraging IBM TRIRIGA to Improve Your Existing IBM Maximo Implementation
 
Ibmtririga
IbmtririgaIbmtririga
Ibmtririga
 
Everything TRIRIGA users need to know about IBM InterConnect 2016 with Sessio...
Everything TRIRIGA users need to know about IBM InterConnect 2016 with Sessio...Everything TRIRIGA users need to know about IBM InterConnect 2016 with Sessio...
Everything TRIRIGA users need to know about IBM InterConnect 2016 with Sessio...
 
Tririga Detailed Presentation
Tririga Detailed PresentationTririga Detailed Presentation
Tririga Detailed Presentation
 
Scientific Paper for Ecology: Grassland and Coastal Marine ecosystem
Scientific Paper for Ecology: Grassland and Coastal Marine ecosystemScientific Paper for Ecology: Grassland and Coastal Marine ecosystem
Scientific Paper for Ecology: Grassland and Coastal Marine ecosystem
 
Facility Management & Smarter Buildings - PCTY 2011
Facility Management & Smarter Buildings - PCTY 2011Facility Management & Smarter Buildings - PCTY 2011
Facility Management & Smarter Buildings - PCTY 2011
 
IBM Intelligent Building Management 2012
IBM Intelligent Building Management 2012IBM Intelligent Building Management 2012
IBM Intelligent Building Management 2012
 
Har ni koll på balansräkningens största post? - IBM Smarter Business 2013
Har ni koll på balansräkningens största post? - IBM Smarter Business 2013Har ni koll på balansräkningens största post? - IBM Smarter Business 2013
Har ni koll på balansräkningens största post? - IBM Smarter Business 2013
 
Intelligent Building Management Systems
Intelligent Building Management SystemsIntelligent Building Management Systems
Intelligent Building Management Systems
 
IBM Smarter Buildings: Detailed Overview
IBM Smarter Buildings: Detailed Overview IBM Smarter Buildings: Detailed Overview
IBM Smarter Buildings: Detailed Overview
 

Similar to Workflows, provenance and reporting: a lifecycle perspective at BIH 2013, Rome

UK Digital Curation Centre: enabling research data management at the coalface
UK Digital Curation Centre: enabling research data management at the coalfaceUK Digital Curation Centre: enabling research data management at the coalface
UK Digital Curation Centre: enabling research data management at the coalfaceLizLyon
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
 
Cloud e-Genome: NGS Workflows on the Cloud Using e-Science Central
Cloud e-Genome: NGS Workflows on the Cloud Using e-Science CentralCloud e-Genome: NGS Workflows on the Cloud Using e-Science Central
Cloud e-Genome: NGS Workflows on the Cloud Using e-Science CentralPaolo Missier
 
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)Blue BRIDGE
 
FAIR data and model management for systems biology (and SOPs too!)
FAIR data and model management for systems biology (and SOPs too!)FAIR data and model management for systems biology (and SOPs too!)
FAIR data and model management for systems biology (and SOPs too!)FAIRDOM
 
FAIR Data and Model Management for Systems Biology (and SOPs too!)
FAIR Data and Model Management for Systems Biology(and SOPs too!)FAIR Data and Model Management for Systems Biology(and SOPs too!)
FAIR Data and Model Management for Systems Biology (and SOPs too!)Carole Goble
 
Results may vary: Collaborations Workshop, Oxford 2014
Results may vary: Collaborations Workshop, Oxford 2014Results may vary: Collaborations Workshop, Oxford 2014
Results may vary: Collaborations Workshop, Oxford 2014Carole Goble
 
The BlueBRIDGE approach to collaborative research
The BlueBRIDGE approach to collaborative researchThe BlueBRIDGE approach to collaborative research
The BlueBRIDGE approach to collaborative researchBlue BRIDGE
 
Approach and outcome of the Biodiversity Virtual e-Laboratory (BioVeL) project
Approach and outcome of the Biodiversity Virtual e-Laboratory (BioVeL) projectApproach and outcome of the Biodiversity Virtual e-Laboratory (BioVeL) project
Approach and outcome of the Biodiversity Virtual e-Laboratory (BioVeL) projectAlex Hardisty
 
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...Robert Grossman
 
An Overview of VIEW
An Overview of VIEWAn Overview of VIEW
An Overview of VIEWShiyong Lu
 
Curation-Friendly Tools for the Scientific Researcher
Curation-Friendly Tools for the Scientific ResearcherCuration-Friendly Tools for the Scientific Researcher
Curation-Friendly Tools for the Scientific Researcherbwestra
 
Allotrope Foundation & OSTHUS at SmartLab Exchange 2015: Update on the Allotr...
Allotrope Foundation & OSTHUS at SmartLab Exchange 2015: Update on the Allotr...Allotrope Foundation & OSTHUS at SmartLab Exchange 2015: Update on the Allotr...
Allotrope Foundation & OSTHUS at SmartLab Exchange 2015: Update on the Allotr...OSTHUS
 
Collaboration and Sharing
Collaboration and SharingCollaboration and Sharing
Collaboration and SharingJisc
 
Metadata for Research Objects
Metadata for Research ObjectsMetadata for Research Objects
Metadata for Research Objectsseanb
 
Sharing massive data analysis: from provenance to linked experiment reports
Sharing massive data analysis: from provenance to linked experiment reportsSharing massive data analysis: from provenance to linked experiment reports
Sharing massive data analysis: from provenance to linked experiment reportsGaignard Alban
 
Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)Dmitry Grapov
 
Conservation of Scientific Workflow Infrastructures by Using Semantics - 2012
Conservation of Scientific Workflow Infrastructures by Using Semantics - 2012Conservation of Scientific Workflow Infrastructures by Using Semantics - 2012
Conservation of Scientific Workflow Infrastructures by Using Semantics - 2012Idafen Santana Pérez
 
Integrating scientific laboratories into the cloud
Integrating scientific laboratories into the cloudIntegrating scientific laboratories into the cloud
Integrating scientific laboratories into the cloudData Finder
 
Eclipse Meets Systems Biology
Eclipse Meets Systems BiologyEclipse Meets Systems Biology
Eclipse Meets Systems BiologyRichard Adams
 

Similar to Workflows, provenance and reporting: a lifecycle perspective at BIH 2013, Rome (20)

UK Digital Curation Centre: enabling research data management at the coalface
UK Digital Curation Centre: enabling research data management at the coalfaceUK Digital Curation Centre: enabling research data management at the coalface
UK Digital Curation Centre: enabling research data management at the coalface
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
Cloud e-Genome: NGS Workflows on the Cloud Using e-Science Central
Cloud e-Genome: NGS Workflows on the Cloud Using e-Science CentralCloud e-Genome: NGS Workflows on the Cloud Using e-Science Central
Cloud e-Genome: NGS Workflows on the Cloud Using e-Science Central
 
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)
 
FAIR data and model management for systems biology (and SOPs too!)
FAIR data and model management for systems biology (and SOPs too!)FAIR data and model management for systems biology (and SOPs too!)
FAIR data and model management for systems biology (and SOPs too!)
 
FAIR Data and Model Management for Systems Biology (and SOPs too!)
FAIR Data and Model Management for Systems Biology(and SOPs too!)FAIR Data and Model Management for Systems Biology(and SOPs too!)
FAIR Data and Model Management for Systems Biology (and SOPs too!)
 
Results may vary: Collaborations Workshop, Oxford 2014
Results may vary: Collaborations Workshop, Oxford 2014Results may vary: Collaborations Workshop, Oxford 2014
Results may vary: Collaborations Workshop, Oxford 2014
 
The BlueBRIDGE approach to collaborative research
The BlueBRIDGE approach to collaborative researchThe BlueBRIDGE approach to collaborative research
The BlueBRIDGE approach to collaborative research
 
Approach and outcome of the Biodiversity Virtual e-Laboratory (BioVeL) project
Approach and outcome of the Biodiversity Virtual e-Laboratory (BioVeL) projectApproach and outcome of the Biodiversity Virtual e-Laboratory (BioVeL) project
Approach and outcome of the Biodiversity Virtual e-Laboratory (BioVeL) project
 
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...
 
An Overview of VIEW
An Overview of VIEWAn Overview of VIEW
An Overview of VIEW
 
Curation-Friendly Tools for the Scientific Researcher
Curation-Friendly Tools for the Scientific ResearcherCuration-Friendly Tools for the Scientific Researcher
Curation-Friendly Tools for the Scientific Researcher
 
Allotrope Foundation & OSTHUS at SmartLab Exchange 2015: Update on the Allotr...
Allotrope Foundation & OSTHUS at SmartLab Exchange 2015: Update on the Allotr...Allotrope Foundation & OSTHUS at SmartLab Exchange 2015: Update on the Allotr...
Allotrope Foundation & OSTHUS at SmartLab Exchange 2015: Update on the Allotr...
 
Collaboration and Sharing
Collaboration and SharingCollaboration and Sharing
Collaboration and Sharing
 
Metadata for Research Objects
Metadata for Research ObjectsMetadata for Research Objects
Metadata for Research Objects
 
Sharing massive data analysis: from provenance to linked experiment reports
Sharing massive data analysis: from provenance to linked experiment reportsSharing massive data analysis: from provenance to linked experiment reports
Sharing massive data analysis: from provenance to linked experiment reports
 
Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)
 
Conservation of Scientific Workflow Infrastructures by Using Semantics - 2012
Conservation of Scientific Workflow Infrastructures by Using Semantics - 2012Conservation of Scientific Workflow Infrastructures by Using Semantics - 2012
Conservation of Scientific Workflow Infrastructures by Using Semantics - 2012
 
Integrating scientific laboratories into the cloud
Integrating scientific laboratories into the cloudIntegrating scientific laboratories into the cloud
Integrating scientific laboratories into the cloud
 
Eclipse Meets Systems Biology
Eclipse Meets Systems BiologyEclipse Meets Systems Biology
Eclipse Meets Systems Biology
 

More from Carole Goble

The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...Carole Goble
 
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...Carole Goble
 
RO-Crate: packaging metadata love notes into FAIR Digital Objects
RO-Crate: packaging metadata love notes into FAIR Digital ObjectsRO-Crate: packaging metadata love notes into FAIR Digital Objects
RO-Crate: packaging metadata love notes into FAIR Digital ObjectsCarole Goble
 
Research Software Sustainability takes a Village
Research Software Sustainability takes a VillageResearch Software Sustainability takes a Village
Research Software Sustainability takes a VillageCarole Goble
 
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...Carole Goble
 
Open Research: Manchester leading and learning
Open Research: Manchester leading and learningOpen Research: Manchester leading and learning
Open Research: Manchester leading and learningCarole Goble
 
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...Carole Goble
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
 
EOSC-Life Workflow Collaboratory
EOSC-Life Workflow CollaboratoryEOSC-Life Workflow Collaboratory
EOSC-Life Workflow CollaboratoryCarole Goble
 
FAIR Data Bridging from researcher data management to ELIXIR archives in the...
FAIR Data Bridging from researcher data management to ELIXIR archives in the...FAIR Data Bridging from researcher data management to ELIXIR archives in the...
FAIR Data Bridging from researcher data management to ELIXIR archives in the...Carole Goble
 
FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout Carole Goble
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceCarole Goble
 
RO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research ObjectsRO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research ObjectsCarole Goble
 
The swings and roundabouts of a decade of fun and games with Research Objects
The swings and roundabouts of a decade of fun and games with Research Objects The swings and roundabouts of a decade of fun and games with Research Objects
The swings and roundabouts of a decade of fun and games with Research Objects Carole Goble
 
How are we Faring with FAIR? (and what FAIR is not)
How are we Faring with FAIR? (and what FAIR is not)How are we Faring with FAIR? (and what FAIR is not)
How are we Faring with FAIR? (and what FAIR is not)Carole Goble
 
What is Reproducibility? The R* brouhaha and how Research Objects can help
What is Reproducibility? The R* brouhaha and how Research Objects can helpWhat is Reproducibility? The R* brouhaha and how Research Objects can help
What is Reproducibility? The R* brouhaha and how Research Objects can helpCarole Goble
 
FAIR History and the Future
FAIR History and the FutureFAIR History and the Future
FAIR History and the FutureCarole Goble
 
ELIXIR UK Node presentation to the ELIXIR Board
ELIXIR UK Node presentation to the ELIXIR BoardELIXIR UK Node presentation to the ELIXIR Board
ELIXIR UK Node presentation to the ELIXIR BoardCarole Goble
 
FAIRy stories: tales from building the FAIR Research Commons
FAIRy stories: tales from building the FAIR Research CommonsFAIRy stories: tales from building the FAIR Research Commons
FAIRy stories: tales from building the FAIR Research CommonsCarole Goble
 

More from Carole Goble (20)

The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
 
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...
 
RO-Crate: packaging metadata love notes into FAIR Digital Objects
RO-Crate: packaging metadata love notes into FAIR Digital ObjectsRO-Crate: packaging metadata love notes into FAIR Digital Objects
RO-Crate: packaging metadata love notes into FAIR Digital Objects
 
Research Software Sustainability takes a Village
Research Software Sustainability takes a VillageResearch Software Sustainability takes a Village
Research Software Sustainability takes a Village
 
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
 
Open Research: Manchester leading and learning
Open Research: Manchester leading and learningOpen Research: Manchester leading and learning
Open Research: Manchester leading and learning
 
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
EOSC-Life Workflow Collaboratory
EOSC-Life Workflow CollaboratoryEOSC-Life Workflow Collaboratory
EOSC-Life Workflow Collaboratory
 
FAIR Data Bridging from researcher data management to ELIXIR archives in the...
FAIR Data Bridging from researcher data management to ELIXIR archives in the...FAIR Data Bridging from researcher data management to ELIXIR archives in the...
FAIR Data Bridging from researcher data management to ELIXIR archives in the...
 
FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practice
 
RO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research ObjectsRO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research Objects
 
The swings and roundabouts of a decade of fun and games with Research Objects
The swings and roundabouts of a decade of fun and games with Research Objects The swings and roundabouts of a decade of fun and games with Research Objects
The swings and roundabouts of a decade of fun and games with Research Objects
 
How are we Faring with FAIR? (and what FAIR is not)
How are we Faring with FAIR? (and what FAIR is not)How are we Faring with FAIR? (and what FAIR is not)
How are we Faring with FAIR? (and what FAIR is not)
 
What is Reproducibility? The R* brouhaha and how Research Objects can help
What is Reproducibility? The R* brouhaha and how Research Objects can helpWhat is Reproducibility? The R* brouhaha and how Research Objects can help
What is Reproducibility? The R* brouhaha and how Research Objects can help
 
FAIR History and the Future
FAIR History and the FutureFAIR History and the Future
FAIR History and the Future
 
ELIXIR UK Node presentation to the ELIXIR Board
ELIXIR UK Node presentation to the ELIXIR BoardELIXIR UK Node presentation to the ELIXIR Board
ELIXIR UK Node presentation to the ELIXIR Board
 
FAIRy stories: tales from building the FAIR Research Commons
FAIRy stories: tales from building the FAIR Research CommonsFAIRy stories: tales from building the FAIR Research Commons
FAIRy stories: tales from building the FAIR Research Commons
 

Recently uploaded

How to Create a Toggle Button in Odoo 17
How to Create a Toggle Button in Odoo 17How to Create a Toggle Button in Odoo 17
How to Create a Toggle Button in Odoo 17Celine George
 
How to Add Existing Field in One2Many Tree View in Odoo 17
How to Add Existing Field in One2Many Tree View in Odoo 17How to Add Existing Field in One2Many Tree View in Odoo 17
How to Add Existing Field in One2Many Tree View in Odoo 17Celine George
 
Diploma in Nursing Admission Test Question Solution 2023.pdf
Diploma in Nursing Admission Test Question Solution 2023.pdfDiploma in Nursing Admission Test Question Solution 2023.pdf
Diploma in Nursing Admission Test Question Solution 2023.pdfMohonDas
 
Easter in the USA presentation by Chloe.
Easter in the USA presentation by Chloe.Easter in the USA presentation by Chloe.
Easter in the USA presentation by Chloe.EnglishCEIPdeSigeiro
 
Clinical Pharmacy Introduction to Clinical Pharmacy, Concept of clinical pptx
Clinical Pharmacy  Introduction to Clinical Pharmacy, Concept of clinical pptxClinical Pharmacy  Introduction to Clinical Pharmacy, Concept of clinical pptx
Clinical Pharmacy Introduction to Clinical Pharmacy, Concept of clinical pptxraviapr7
 
HED Office Sohayok Exam Question Solution 2023.pdf
HED Office Sohayok Exam Question Solution 2023.pdfHED Office Sohayok Exam Question Solution 2023.pdf
HED Office Sohayok Exam Question Solution 2023.pdfMohonDas
 
Riddhi Kevadiya. WILLIAM SHAKESPEARE....
Riddhi Kevadiya. WILLIAM SHAKESPEARE....Riddhi Kevadiya. WILLIAM SHAKESPEARE....
Riddhi Kevadiya. WILLIAM SHAKESPEARE....Riddhi Kevadiya
 
Department of Health Compounder Question ‍Solution 2022.pdf
Department of Health Compounder Question ‍Solution 2022.pdfDepartment of Health Compounder Question ‍Solution 2022.pdf
Department of Health Compounder Question ‍Solution 2022.pdfMohonDas
 
How to Send Emails From Odoo 17 Using Code
How to Send Emails From Odoo 17 Using CodeHow to Send Emails From Odoo 17 Using Code
How to Send Emails From Odoo 17 Using CodeCeline George
 
KARNAADA.pptx made by - saransh dwivedi ( SD ) - SHALAKYA TANTRA - ENT - 4...
KARNAADA.pptx  made by -  saransh dwivedi ( SD ) -  SHALAKYA TANTRA - ENT - 4...KARNAADA.pptx  made by -  saransh dwivedi ( SD ) -  SHALAKYA TANTRA - ENT - 4...
KARNAADA.pptx made by - saransh dwivedi ( SD ) - SHALAKYA TANTRA - ENT - 4...M56BOOKSTORE PRODUCT/SERVICE
 
SOLIDE WASTE in Cameroon,,,,,,,,,,,,,,,,,,,,,,,,,,,.pptx
SOLIDE WASTE in Cameroon,,,,,,,,,,,,,,,,,,,,,,,,,,,.pptxSOLIDE WASTE in Cameroon,,,,,,,,,,,,,,,,,,,,,,,,,,,.pptx
SOLIDE WASTE in Cameroon,,,,,,,,,,,,,,,,,,,,,,,,,,,.pptxSyedNadeemGillANi
 
ARTICULAR DISC OF TEMPOROMANDIBULAR JOINT
ARTICULAR DISC OF TEMPOROMANDIBULAR JOINTARTICULAR DISC OF TEMPOROMANDIBULAR JOINT
ARTICULAR DISC OF TEMPOROMANDIBULAR JOINTDR. SNEHA NAIR
 
In - Vivo and In - Vitro Correlation.pptx
In - Vivo and In - Vitro Correlation.pptxIn - Vivo and In - Vitro Correlation.pptx
In - Vivo and In - Vitro Correlation.pptxAditiChauhan701637
 
The basics of sentences session 10pptx.pptx
The basics of sentences session 10pptx.pptxThe basics of sentences session 10pptx.pptx
The basics of sentences session 10pptx.pptxheathfieldcps1
 
P4C x ELT = P4ELT: Its Theoretical Background (Kanazawa, 2024 March).pdf
P4C x ELT = P4ELT: Its Theoretical Background (Kanazawa, 2024 March).pdfP4C x ELT = P4ELT: Its Theoretical Background (Kanazawa, 2024 March).pdf
P4C x ELT = P4ELT: Its Theoretical Background (Kanazawa, 2024 March).pdfYu Kanazawa / Osaka University
 
Optical Fibre and It's Applications.pptx
Optical Fibre and It's Applications.pptxOptical Fibre and It's Applications.pptx
Optical Fibre and It's Applications.pptxPurva Nikam
 
EBUS5423 Data Analytics and Reporting Bl
EBUS5423 Data Analytics and Reporting BlEBUS5423 Data Analytics and Reporting Bl
EBUS5423 Data Analytics and Reporting BlDr. Bruce A. Johnson
 

Recently uploaded (20)

How to Create a Toggle Button in Odoo 17
How to Create a Toggle Button in Odoo 17How to Create a Toggle Button in Odoo 17
How to Create a Toggle Button in Odoo 17
 
Personal Resilience in Project Management 2 - TV Edit 1a.pdf
Personal Resilience in Project Management 2 - TV Edit 1a.pdfPersonal Resilience in Project Management 2 - TV Edit 1a.pdf
Personal Resilience in Project Management 2 - TV Edit 1a.pdf
 
How to Add Existing Field in One2Many Tree View in Odoo 17
How to Add Existing Field in One2Many Tree View in Odoo 17How to Add Existing Field in One2Many Tree View in Odoo 17
How to Add Existing Field in One2Many Tree View in Odoo 17
 
Diploma in Nursing Admission Test Question Solution 2023.pdf
Diploma in Nursing Admission Test Question Solution 2023.pdfDiploma in Nursing Admission Test Question Solution 2023.pdf
Diploma in Nursing Admission Test Question Solution 2023.pdf
 
Easter in the USA presentation by Chloe.
Easter in the USA presentation by Chloe.Easter in the USA presentation by Chloe.
Easter in the USA presentation by Chloe.
 
Clinical Pharmacy Introduction to Clinical Pharmacy, Concept of clinical pptx
Clinical Pharmacy  Introduction to Clinical Pharmacy, Concept of clinical pptxClinical Pharmacy  Introduction to Clinical Pharmacy, Concept of clinical pptx
Clinical Pharmacy Introduction to Clinical Pharmacy, Concept of clinical pptx
 
HED Office Sohayok Exam Question Solution 2023.pdf
HED Office Sohayok Exam Question Solution 2023.pdfHED Office Sohayok Exam Question Solution 2023.pdf
HED Office Sohayok Exam Question Solution 2023.pdf
 
Riddhi Kevadiya. WILLIAM SHAKESPEARE....
Riddhi Kevadiya. WILLIAM SHAKESPEARE....Riddhi Kevadiya. WILLIAM SHAKESPEARE....
Riddhi Kevadiya. WILLIAM SHAKESPEARE....
 
Department of Health Compounder Question ‍Solution 2022.pdf
Department of Health Compounder Question ‍Solution 2022.pdfDepartment of Health Compounder Question ‍Solution 2022.pdf
Department of Health Compounder Question ‍Solution 2022.pdf
 
How to Send Emails From Odoo 17 Using Code
How to Send Emails From Odoo 17 Using CodeHow to Send Emails From Odoo 17 Using Code
How to Send Emails From Odoo 17 Using Code
 
KARNAADA.pptx made by - saransh dwivedi ( SD ) - SHALAKYA TANTRA - ENT - 4...
KARNAADA.pptx  made by -  saransh dwivedi ( SD ) -  SHALAKYA TANTRA - ENT - 4...KARNAADA.pptx  made by -  saransh dwivedi ( SD ) -  SHALAKYA TANTRA - ENT - 4...
KARNAADA.pptx made by - saransh dwivedi ( SD ) - SHALAKYA TANTRA - ENT - 4...
 
SOLIDE WASTE in Cameroon,,,,,,,,,,,,,,,,,,,,,,,,,,,.pptx
SOLIDE WASTE in Cameroon,,,,,,,,,,,,,,,,,,,,,,,,,,,.pptxSOLIDE WASTE in Cameroon,,,,,,,,,,,,,,,,,,,,,,,,,,,.pptx
SOLIDE WASTE in Cameroon,,,,,,,,,,,,,,,,,,,,,,,,,,,.pptx
 
ARTICULAR DISC OF TEMPOROMANDIBULAR JOINT
ARTICULAR DISC OF TEMPOROMANDIBULAR JOINTARTICULAR DISC OF TEMPOROMANDIBULAR JOINT
ARTICULAR DISC OF TEMPOROMANDIBULAR JOINT
 
Finals of Kant get Marx 2.0 : a general politics quiz
Finals of Kant get Marx 2.0 : a general politics quizFinals of Kant get Marx 2.0 : a general politics quiz
Finals of Kant get Marx 2.0 : a general politics quiz
 
In - Vivo and In - Vitro Correlation.pptx
In - Vivo and In - Vitro Correlation.pptxIn - Vivo and In - Vitro Correlation.pptx
In - Vivo and In - Vitro Correlation.pptx
 
The basics of sentences session 10pptx.pptx
The basics of sentences session 10pptx.pptxThe basics of sentences session 10pptx.pptx
The basics of sentences session 10pptx.pptx
 
P4C x ELT = P4ELT: Its Theoretical Background (Kanazawa, 2024 March).pdf
P4C x ELT = P4ELT: Its Theoretical Background (Kanazawa, 2024 March).pdfP4C x ELT = P4ELT: Its Theoretical Background (Kanazawa, 2024 March).pdf
P4C x ELT = P4ELT: Its Theoretical Background (Kanazawa, 2024 March).pdf
 
Optical Fibre and It's Applications.pptx
Optical Fibre and It's Applications.pptxOptical Fibre and It's Applications.pptx
Optical Fibre and It's Applications.pptx
 
March 2024 Directors Meeting, Division of Student Affairs and Academic Support
March 2024 Directors Meeting, Division of Student Affairs and Academic SupportMarch 2024 Directors Meeting, Division of Student Affairs and Academic Support
March 2024 Directors Meeting, Division of Student Affairs and Academic Support
 
EBUS5423 Data Analytics and Reporting Bl
EBUS5423 Data Analytics and Reporting BlEBUS5423 Data Analytics and Reporting Bl
EBUS5423 Data Analytics and Reporting Bl
 

Workflows, provenance and reporting: a lifecycle perspective at BIH 2013, Rome

  • 1. Workflows, Provenance & Reporting A Lifecycle Perspective Professor Carole Goble FREng FBCS The University of Manchester, UK carole.goble@manchester.ac.uk 3rd – 6th September 2013, Rome, Italy
  • 2. The Scientific and Technical Ecosystem Mobilising Big and Broad Data • Streaming • Sweeps through models • Integrative analysis • Results synthesis • Heavy compute Interoperability, plugging together • Multi step chains, Multi software / data • Mixed resources / platforms • Incompatibility smoothing • Trans-disciplinary, Alien processes [DataONE]
  • 3. BioSTIF inputs: data, parameters, configurations outputs Workflow nutshell • A series of automated / interactive data analysis steps • Process data at scale • Import data / codes from one’s own research and/or from existing libraries • Pipelines & analytic and synthesis procedures • Chains of components • Bridges between resources • Shield from change and operational complexity • Releasing capacity Services Resources
  • 4. Provisioning Workflows Appln Service Appln Service Users Workflows Composition Incorporation Invocation Applications • Applications components of workflows • Compose applications into workflows • Incorporate workflows into applications Infrastructure • Provision physical resources to support application workflows • Coordinate resources through workflows • Optimise and adapt to change [Foster 2005] Workflows Wfms
  • 7. Workflows: maturing approach Underpin integrative platforms. Established in many disciplines, notably chemistry and biology, esp. ‘omics: assembly, synthesis, annotation, analytics. Overlaps with metagenomics, phylogenetics and genetic ecology Powering service based science and science as a service http://www.globus.org/genomics/solution Sandve, Nekrutenko, Taylor, Hovig Ten simple rules for reproducible in silico research, PLoS Comp Bio submitted
  • 8. Ecological Niche modelling, population modelling, Metagenomics and Phylogenetics ‘omics pipelines and analytic workflows http://www.biovel.eu Community Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analysis http://camera.calit2.net/index.shtm Combine species occurrence data with global climate, terrain and land cover information, to identify environmental correlates of species ranges. http://www.lifemapper.org/species BioDiversity
  • 9. Taxonomic Data Refinement www.biovel.eu • Synonym expansion • Taxonomic name resolution • Occurrence retrieval • Spell checking • Geographic and taxonomic cleaning • Temporal refinement • Data processing log [Matthias Obst, INTECOL 2013]
  • 10. Data Operations in Workflows in the Wild Analysis of 260 publicly available workflows in Taverna, WINGS, Galaxy and Vistrails Garijo et al Common Motifs in Scientific Workflows: An Empirical Analysis, in press, FGCS
  • 11. Large Scale Ecological Niche Modeling Workflow . Step 1: Explorative modeling -Use unfiltered data -Use fixed parameters: Mahalonobis distance (Farber and Kadmon 2003) -Native projections -Test the model, distribution of points, number of points Step 2: Deep modeling -Filtering environmentally unique points with BioClim algorithm (Nix 1986) -ENM with Support Vector Machine (Cristianini & Shawe- Taylor 2000) and Maximum Entropy (Phillips 2004) -Parameter optimization (if necessary) on the model test results -2 masks (model generate, model project) Data discoveryData discovery Data assembly, cleaning, and refinement Data assembly, cleaning, and refinement Ecological Niche Modeling Ecological Niche Modeling Statistical analysisStatistical analysis Analytical cycle Pilumnus hirtellus Enclosed sea problem (Ready et al., 2010) [Matthias Obst, INTECOL 2013]
  • 12. Workflow-enabled science • Common Templates • Prepared components • Systematic assembly • (Steered) automation • Hybrid combinations • Variations • Extensibility • Customisation • Parameterisation • Repeats • Cross-run synthesis • Routine, pooled methods • Tracking
  • 13. Repeated model sweeps Ten insect species were modelled: European spruce bark beetle – Ips typographus L. Bordered white moth (syn. pine looper) - Bupalus piniarius L., (syn. B. piniaria L.) Pine-tree lappet - Dendrolimus pini L. Mottled umber - Erannis defoliaria Clerck Nun moth - Lymantria monacha L. Winter moth - Operopthera brumata L. Pine beauty moth - Panolis flammea Den. & Schiff Green oak tortrix - Tortrix viridana L. European pine sawfly – Neodiprion sertifer Geoffr. Common pine sawfly – Diprion pini L. Tortrix viridana Image by Kimmo & Seppo Silvonen Lymantria monacha data configuration parameters steps Päivi Lyytikäinen-Saarenmaa presentation, INTECOL 2013
  • 14. http://www.jisc.ac.uk/whatwedo/campaigns/res3/jischelp.aspx Workflows workflows results provenance process (log) results (origin) Reporting Record of science Reproducibility Transparent process Integrate with reporting systems Know how Training See Penev presentation
  • 15. Provenance the link between computation and results W3C PROV model standard record for reporting compare diffs/discrepancies provenance analytics track changes, adapt partial repeat/reproduce carry attributions compute credits compute data quality/trust select data to keep/release optimisation and debugging d1 S0 d2 S1 w S2 y S4 df d1' S0 d2 S1 z w S'2 y' S4 df' (i) Trace A (ii) Trace B PDIFF: comparing provenance traces to diagnose divergence across experimental results [Woodman et al, 2011]
  • 16. [Freire]http://www.aosabook.org/en/vistrails.html Collecting -> Using Provenance Instrumenting, cross-tool interoperability Reporting at different scales
  • 18. Summary: Infrastructure Productivity CustomiseCustomise ProcessProcess CustomiseCustomise ProcessProcess CustomiseCustomise EnvironmentEnvironment Legacy, others and your own software, datasets, services, codes, and platforms. optimise and manage use of computing infrastructure, HPC, clouds and platforms WFMS middleware WFMS middleware Support the design, config. and execution of workflows. manage utility actions for data, logging, security, compute, errors…shield incompatibilities / complexity / change Parameterised, integrative, multi-step (data) pipelines, analytics, computational protocols. That can be repetitively reused. dependency-rich interoperability. WorkflowWorkflow AppsApps Domain/task specific apps that incorporate (an ecosystem of) workflows Integrate
  • 19. Summary: User Productivity: Capability Raising AccessAccess Framework to access and leverage heterogeneous legacy applications, services, datasets and codes. Shielding from complexity. CustomiseCustomise Rapid development: Flexibility, Extensibility, Adaptability, Reuse. Reusable Workflow Components ProcessProcess Automated plumbing + Interaction Systematic, repetitive and unbiased analysis and processing and error handling Ensembles, comparisons, “what ifs” CustomiseCustomise Rapid development: Flexibility, Extensibility, Adaptability, Reuse. Reusable Workflow Components ProcessProcess Automated plumbing + Interaction Systematic, repetitive and unbiased analysis and processing and error handling Ensembles, comparisons, “what ifs” CustomiseCustomise Rapid development: Flexibility, Extensibility, Adaptability, Reuse. Reusable Workflow Components AccessAccess Framework to access and leverage heterogeneous legacy applications, services, datasets and codes and combine with yours. Shielding from complexity. ProcessProcess Integration, Reusable workflows/components Automated plumbing + Interaction Systematic, repetitive and unbiased analysis Ensembles, comparisons, “what ifs” Process reporting. Citation tracking. Reproducibility, Provenance, Audit. Quality Control. Standard Operating Procedures.RecordRecord CustomiseCustomise Rapid development: Flexibility, Extensibility, Adaptability, Reuse. Reusable Workflow Components
  • 20. Workflow Commodities building cohorts, capturing traits, explicit reporting, clear instructions • Workflow templates • Workflow sets • Libraries of sub workflow parts • Design practices for mix, match and reuse • Future proofed design predicting need to adapt • Discovery and exchange • Workflow engineers • Workflow custodians
  • 22. Workflow Commodities exchanging, curating, preserving, packaging, life cycle management http://www.researchobject.orghttp://www.dcc.ac.uk
  • 23. Katy’s student’s 200 hours Tracking where data went Workflow Commodities getting credit, capability, engineers and custodians
  • 24. Application Building user variety, outcome focused • Right apps, right users. • Commodity apps: – Web. Spreadsheets. R. • Customisation • Mixed workflow / scripting • Deployment / Portability – Web based / desktop – Virtualised deployments – Cloud hosted service – A cloud-enabled local host • Local ownership • Capability building WorkflowVisibility BioDiversity Low ConceptKnowledge High Technology/InfrastructureDomainScientist Technicalspecialists ComputationalScientist Custom Specific Apps General Toolkits Policy makers Low High Versatility
  • 25. Who are the users? • Policy makers? • Biodiversity researcher? • Computational scientist? • Tool developer? • Service provider? • Infrastructure provider? • Digital custodian?
  • 26. Workflow management systems • Integrated into community frameworks, coupled into tools • Virtualised (Web) Services • Scaling, Optimisation • Interoperability, Using provenance • No one workflow language/system • Specialisation & its cost • Plug-ins for common community platforms and resources • Mitigating and adapting to changes in infrastructures and resources. • Sustainability and engineering Generic Specific http://www.erflow.eu/
  • 27. Population dynamics The life cycle of infrastructures • Dynamics: Mitigate, Adapt, Disperse, Die • Standard and maintained prog. interfaces (APIs) • Standard formats and ids • Stability, reliability, repair • Interoperability • Semantic descriptions • Sustainability of services and infrastructure • Instrument resources for citation & microattribution • Coupled services and infrastructure.
  • 28. Impact of dependencies [Zhao et al. Why workflows break e-Science 2012]
  • 29. Summary Scale. Standards data formats, programmatic interfaces. Governance. Workflow commodities Design practices Credit A seamless, pluggable service. Scale. Adaptability. Specific-Generic tension. Putting provenance to use for data credit. Embedding workflows in common applications Integration into reporting and publishing lifecycles

Editor's Notes

  1. The Technical Environment: Challenging Areas and Promising Technologies Workflows, provenance and reporting: a lifecycle perspective Workflow systems support the design, configuration and execution of repetitive, multi-step pipelines and analytics, well established in many disciplines, notably biology and chemistry, but less so in biodiversity and ecology. From an experimental perspective workflows are a means to handle the work of accessing an ecosystem of software and platforms, manage data and security, and handle errors. From a reporting perspective they are a means to accurately document methodology for reproducibility, comparison, exchange and reuse, and to trace the provenance of results for review, credit, workflow interoperability and impact analysis. Workflows operate in an evolving ecosystem and are assemblages of components in that ecosystem; their provenance trails are snapshots of intermediate and final results. Taking a lifecycle perspective, what are the challenges in workflow design and use with different stakeholders? What needs to be tackled in evolution, resilience, and preservation? And what are the “mitigate or adapt” strategies adopted by workflow systems in the face of changes in the ecosystem/environment, for example when tools are depreciated or datasets become inaccessible in the face of funding shortfalls? Bio: Carole Goble is a full professor in Computer Science at the University of Manchester, UK, and a partner of the Software Sustainability Institute UK. She has an international reputation in Semantic technologies, Distributed computing and Social Computing for scientific collaboration through eLabs. She directs the myGrid project, which produces the widely-used open source Taverna workflow management system; myExperiment, a social web site for sharing scientific workflows; the BioDiversityCatalogue of web services ; and the SEEK for storing, sharing and preserving Systems Biology outcomes, which is part of the ERANet e-infrastructure for EU-based Systems Biology. Her technical infrastructure underpins the EU BioVeL Project e-Laboratory. In 2008 Carole was awarded the Microsoft Jim Gray award for outstanding contributions to e-Science. In 2010 she was elected a Fellow of the Royal Academy of Engineering. In 2012 she was nominated for the Benjamin Franklin award for open science in Biology. She serves on the UK BBSRC funding agency governance Council and is the Deputy Director of the UK's Node of the ESFRI ELIXIR programme.
  2. Katy Willis talk on Wednesday shows the value of automation of data integration standardised pipelines auto record of experiment and set-up report & variant reuse Systematically capture, coordinate, run and record the steps buffered infrastructure platform libraries, plugins Infrastructure components, services infrastructure
  3. aimed at different layers of the software stack “ The Many Faces of IT as Service”, Foster, Tuecke, 2005 “ Provisioning” – reservation to configuration to … … make sure resource will do what I want it to do, with the right qualities of service Virtualization = separation of concerns between provider & consumer of “content” Client and service Service provider and resource provider Provisioning = assemble & configure resources to meet user needs Management = sustain desired qualities of service despite dynamic environment
  4. Just in time interoperability by papering over the cracks.
  5. Scale of data – from Matthias talk. Geographic: we can build models in China and project it into Europe Taxonomic: we can build models for plants (phytoplankton), animals (birds), and in one year hopefully even microbial communities Environmental: sea, land, still very difficult for lakes and rivers
  6. Analysis factories Typical variations in workflows Local and Global workflow population variations Micro and Macro level
  7. Came up in policy session reporting perspective accurately document methodology for reproducibility , comparison, exchange and reuse trace the provenance of results for review, credit, workflow interoperability and impact analysis
  8. Simplify Track Versions and retractions Error propagation Contributions and credits Fix Workflow repair, alternate component discovery, Black box annotation Rerun and Replay Partial reproducibility: Replay some of the workflow A verifiable, reviewable trace in people terms Analyse Calculate data quality & trust, Decide what data to keep or release Compare to find differences and discrepancies S. Woodman, H. Hiden, P. Watson,  P. Missier Achieving Reproducibility by Combining Provenance with Service and Workflow Versioning.  In: The 6th Workshop on Workflows in Support of Large-Scale Science . 2011, Seattle
  9. http://www.aosabook.org/en/vistrails.html http://biodiversity.ku.edu/blog/lab-notes/lifemapper-vistrails-better-science
  10. Galaxy pages (30K users, 1K new users/month)
  11. Environment: services, codes, datasets, platforms
  12. Workflow templates Workflow sets Libraries of sub workflow parts Design practices for mix, match and reuse Future proofed design: mitigate or adapt Discovery and exchange Life cycle management Curation Packaging. Credit and publishing. Workflow engineers Workflow custodians
  13. Workflow templates Workflow sets Libraries of sub workflow parts Design practices for mix, match and reuse Future proofed design: mitigate or adapt Discovery and exchange Life cycle management Curation Packaging. Credit and publishing. Workflow engineers Workflow custodians
  14. Future proofed design: mitigate or adapt Discovery and exchange Life cycle management Curation Packaging. Credit and publishing. Workflow engineers Workflow custodians
  15. Local level or eu hosted
  16. Reducing sensitivity, robustness to loss SHIWA and ER Flow Factories
  17. Reducing Mortality, Invasion, Predatory Black boxes Poor metadata Incompatibility of data formats and identifiers. Poor awareness or adherence to standards. Poor methodology Unrepeatable or unknown experimental method. Black boxes. Incorrect interpretations and poor quality. Poor service / tool / resource ethic Service decay, service palpability & complexity, service reliability & stability, poor diagnostics. GEO, GEOSS, Ecosystems, earth observations NextData c2012.org Encyclopedia of Life Global BioDiversity Informatics Conference www.gbic2012.org Dawn and Cynthia Parr (EOL)
  18. A virtual machine (VM) is a software implementation of a machine (i.e. a computer) that executes programs like a physical machine. Virtual machines are separated into two major classifications, based on their use and degree of correspondence to any real machine: System Zhao, Gomez-Perez, Belhajjame, Klyne, Garcia-Cuesta, Garrido, Hettne, Roos, De Roure and Goble. Why workflows break - Understanding and combating decay in Taverna workflows, 8th Intl Conf e-Science 2012 Reproducibility success is proportional to the number of dependent components and your control over them” Many reasons why. Change / Availability Updates to public datasets, changes to services / codes Availability/Access to components / execution environment Platform differences on simulations, code ports Volatile third-party resources (50%): Not available, available but inaccessible, changed Prevent, Detect, Repair
  19. Logbook data Capacity, services, collaboration Variation, diversity and change at all levels Modularity Plugins Separate Services from underlying infrastructure Ensure Service Networks are built using standard Web 2.0 technologies Separation of applications, workflows and VREs from the services