SlideShare a Scribd company logo
1 of 17
Taverna workflows: provenance
and reproducibility
Aleksandra Pawlik
The University of Manchester
Workflow approaches to investigation of biological complexity
STFC/NERC Workshop 15-16 October 2013
Workflows for improvement
Workflows are more than just
pipelines…
Scaling

up automated execution
Bringing together distributed and
continually changing resources
Dealing with different standards,
interfaces and implementation
Support for repeatable analysis
Taverna Engine Execution
Workflows in Scufl2
 Functional dataflow, simple control flows, implicit


iteration

Linking services and tools
 Different data resources and formats
 “In Workflow Programming” (eg. Beanshell scripting)
 Provenance collection: W3C PROV-O, OPM
 Plug-in Framework







Infrastructures: Web Services (SOAP, REST), Grid, HPC
Common Tools: Excel Spreadsheets, Google Refine, R

OAuth security plug-in
Taverna Workbench
• Customizable for domains (eg. expose services only for
biodiversity)
• Desktop application
• Intermediate results views
• Plug-in framework
List of services

Workflow engine
to run workflows

Construct and
visualise workflows
Taverna User Spectrum
Taverna

Concept Knowledge

Workflow
Engineer

Workbench

Computational
Scientist

Workbench
Components

Lite

Domain
Domain
Scientist
(Workflow User)

Domain-Specific
Website / Tool / Portal

Player

High

Workflow Visibility

Low
reuse



Right apps, right users
Commodity apps:













Web. Spreadsheets. R.

Customisation
Mixed workflow / scripting
Deployment / Portability


Apps
Apps

Web based / desktop
Virtualised deployments
Cloud hosted service
A cloud-enabled local host

Local ownership
Capability building

Workflow
Workflow
WFMS
WFMS
middleware
middleware

Infrastructure
Infrastructure

Domain/task specific apps that
incorporate (an ecosystem of)
workflows. Integrate

Parameterised, integrative, multi-step
(data) pipelines, analytics,
computational protocols. Can be
repetitively reused.
Support design, config. and
execution of workflows. manage
utility actions for data, logging,
security, compute, error. Shield
incompatibilities & complexity.
Legacy, others and your own software,
datasets, services, codes, and
platforms.
Optimise and manage use of
computing infrastructure.
Reuse and Reproducibility
~6,000 members
over 300 groups,
over 3,000 workflows
Taverna Components

Workflow Blocks made of a
workflow
 Well

described
 Well behaved
 Well looked after
 Agreed fail
 Agreed formats in and out
 Agreed provenance
Deposited in myExperiment
Grouped into families
Provenance: how did you do it?
 The

link between
computation and
results

d1

 Reporting

at
different scales/
levels

d2

S1

S0

S1

w

-> Using
Provenance

d1'

S0

 Collecting

d2

z

w

S2

S'2

y

y'

S4

S4

df

df'

(i) Trace A

(ii) Trace B

PDIFF: comparing provenance traces to
diagnose divergence across experimental
results [Woodman et al, 2011]
Research Objects
http://www.researchobject.org/

http://www.w3.org/community/rosc/

bundles and relates digital resources of a
scientific experiment or investigation using
standard mechanisms
Taverna in Galaxy
Wrap as Tool
Tools

Workflow in

Upload

Galaxy execution

Taverna server
The Taverna Suite of Tools
Workflow
Repository

User Interfaces

Workbench

Service Catalogue

Workflow Engine

Workflow
Provenance

Activity and
Service Plug-in
Manager

Taverna
Lite

Workflow
Server

Web Portals / Gateways
Client User Interfaces

Third Party Tools

Player
Virtual
Machine

Workflow
Components

Command

Interaction
Line
Server

Prog
APIs
Sustainability and user support
Freely available
Open source
Current version 2.4
80,000+ downloads
across version
Windows/Mac OS X/
Linux/Unix

Tutorials and Workshops

Active user forum
& support

www.taverna.org.uk
Taverna in other projects
BioDiversity Virtual e-Laboratory
www.biovel.eu

SCAPE
www.scape-project.eu

Wf4Ever
www.wf4ever-project.org

VPH-Share
www.vph-share.eu

HELIO
www.helio-vo.eu

iPlant Collaborative
www.iplantcollaborative.or
g

HELIO
www.helio-vo.eu

Pacific Northwest
National Laboratory
www.pnnl.gov

KBase
www.kbase.us

Scientific Workflows and
Provenance Working Group
www.dataone.org

SHIWA
www.shiwa-workflow.eu
Products

Methods
Data-centric Computation
Scientific workflows over
Distributed Cyber-Infrastructure.

Data sharing
libraries and catalogues for all
types of scientific artefacts and all
types of scientists.

Knowledge Management
Metadata, semantics digital
exchange, preservation,
publishing

Software Engineering
Software sustainability,
software and data policy,
training
For more information


Taverna




myExperiment




http://www.taverna.org.uk

http://www.myexperiment.org

myGrid


http://www.mygrid.org.uk

More Related Content

Similar to Taverna workflows: provenance and reproducibility - STFC/NERC workshop 2013

The Taverna Software Suite
The Taverna Software SuiteThe Taverna Software Suite
The Taverna Software SuitemyGrid team
 
2014 Taverna tutorial introduction to Taverna workflows
2014 Taverna tutorial introduction to Taverna workflows2014 Taverna tutorial introduction to Taverna workflows
2014 Taverna tutorial introduction to Taverna workflowsmyGrid team
 
eResearch workflows for studying free and open source software development
eResearch workflows for studying free and open source software developmenteResearch workflows for studying free and open source software development
eResearch workflows for studying free and open source software developmentAndrea Wiggins
 
Advances in Scientific Workflow Environments
Advances in Scientific Workflow EnvironmentsAdvances in Scientific Workflow Environments
Advances in Scientific Workflow EnvironmentsCarole Goble
 
Ogce Workflow Suite
Ogce Workflow SuiteOgce Workflow Suite
Ogce Workflow Suitesmarru
 
1.5 weka an intoduction
1.5 weka an intoduction1.5 weka an intoduction
1.5 weka an intoductionKrish_ver2
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
 
Wolstencroft K - Workflows on the Cloud: scaling for national service
Wolstencroft K - Workflows on the Cloud: scaling for national serviceWolstencroft K - Workflows on the Cloud: scaling for national service
Wolstencroft K - Workflows on the Cloud: scaling for national serviceJan Aerts
 
XSEDE14 SciGaP-Apache Airavata Tutorial
XSEDE14 SciGaP-Apache Airavata TutorialXSEDE14 SciGaP-Apache Airavata Tutorial
XSEDE14 SciGaP-Apache Airavata Tutorialmarpierc
 
LarKC Tutorial at ISWC 2009 - Introduction
LarKC Tutorial at ISWC 2009 - IntroductionLarKC Tutorial at ISWC 2009 - Introduction
LarKC Tutorial at ISWC 2009 - IntroductionLarKC
 
DustinVannoy_DataPipelines_AzureDataConf_Dec22.pdf
DustinVannoy_DataPipelines_AzureDataConf_Dec22.pdfDustinVannoy_DataPipelines_AzureDataConf_Dec22.pdf
DustinVannoy_DataPipelines_AzureDataConf_Dec22.pdfDustin Vannoy
 
Data cleaning with the Kurator toolkit: Bridging the gap between conventional...
Data cleaning with the Kurator toolkit: Bridging the gap between conventional...Data cleaning with the Kurator toolkit: Bridging the gap between conventional...
Data cleaning with the Kurator toolkit: Bridging the gap between conventional...Timothy McPhillips
 
The IMPACT Interoperability Framework - Workflows for OCR and beyond
The IMPACT Interoperability Framework - Workflows for OCR and beyondThe IMPACT Interoperability Framework - Workflows for OCR and beyond
The IMPACT Interoperability Framework - Workflows for OCR and beyondcneudecker
 
Workflows, provenance and reporting: a lifecycle perspective at BIH 2013, Rome
Workflows, provenance and reporting: a lifecycle perspective at BIH 2013, RomeWorkflows, provenance and reporting: a lifecycle perspective at BIH 2013, Rome
Workflows, provenance and reporting: a lifecycle perspective at BIH 2013, RomeCarole Goble
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
 
Proactive ops for container orchestration environments
Proactive ops for container orchestration environmentsProactive ops for container orchestration environments
Proactive ops for container orchestration environmentsDocker, Inc.
 
EOSC-Life Workflow Collaboratory
EOSC-Life Workflow CollaboratoryEOSC-Life Workflow Collaboratory
EOSC-Life Workflow CollaboratoryCarole Goble
 

Similar to Taverna workflows: provenance and reproducibility - STFC/NERC workshop 2013 (20)

The Taverna Software Suite
The Taverna Software SuiteThe Taverna Software Suite
The Taverna Software Suite
 
2014 Taverna tutorial introduction to Taverna workflows
2014 Taverna tutorial introduction to Taverna workflows2014 Taverna tutorial introduction to Taverna workflows
2014 Taverna tutorial introduction to Taverna workflows
 
IMPACT/myGrid Hackathon - Taverna Roadmap
IMPACT/myGrid Hackathon - Taverna RoadmapIMPACT/myGrid Hackathon - Taverna Roadmap
IMPACT/myGrid Hackathon - Taverna Roadmap
 
eResearch workflows for studying free and open source software development
eResearch workflows for studying free and open source software developmenteResearch workflows for studying free and open source software development
eResearch workflows for studying free and open source software development
 
Advances in Scientific Workflow Environments
Advances in Scientific Workflow EnvironmentsAdvances in Scientific Workflow Environments
Advances in Scientific Workflow Environments
 
Ogce Workflow Suite
Ogce Workflow SuiteOgce Workflow Suite
Ogce Workflow Suite
 
1.5 weka an intoduction
1.5 weka an intoduction1.5 weka an intoduction
1.5 weka an intoduction
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
IMPACT Final Conference - Clemens Neudecker
IMPACT Final Conference - Clemens NeudeckerIMPACT Final Conference - Clemens Neudecker
IMPACT Final Conference - Clemens Neudecker
 
Wolstencroft K - Workflows on the Cloud: scaling for national service
Wolstencroft K - Workflows on the Cloud: scaling for national serviceWolstencroft K - Workflows on the Cloud: scaling for national service
Wolstencroft K - Workflows on the Cloud: scaling for national service
 
XSEDE14 SciGaP-Apache Airavata Tutorial
XSEDE14 SciGaP-Apache Airavata TutorialXSEDE14 SciGaP-Apache Airavata Tutorial
XSEDE14 SciGaP-Apache Airavata Tutorial
 
LarKC Tutorial at ISWC 2009 - Introduction
LarKC Tutorial at ISWC 2009 - IntroductionLarKC Tutorial at ISWC 2009 - Introduction
LarKC Tutorial at ISWC 2009 - Introduction
 
DustinVannoy_DataPipelines_AzureDataConf_Dec22.pdf
DustinVannoy_DataPipelines_AzureDataConf_Dec22.pdfDustinVannoy_DataPipelines_AzureDataConf_Dec22.pdf
DustinVannoy_DataPipelines_AzureDataConf_Dec22.pdf
 
Data cleaning with the Kurator toolkit: Bridging the gap between conventional...
Data cleaning with the Kurator toolkit: Bridging the gap between conventional...Data cleaning with the Kurator toolkit: Bridging the gap between conventional...
Data cleaning with the Kurator toolkit: Bridging the gap between conventional...
 
The IMPACT Interoperability Framework - Workflows for OCR and beyond
The IMPACT Interoperability Framework - Workflows for OCR and beyondThe IMPACT Interoperability Framework - Workflows for OCR and beyond
The IMPACT Interoperability Framework - Workflows for OCR and beyond
 
Workflows, provenance and reporting: a lifecycle perspective at BIH 2013, Rome
Workflows, provenance and reporting: a lifecycle perspective at BIH 2013, RomeWorkflows, provenance and reporting: a lifecycle perspective at BIH 2013, Rome
Workflows, provenance and reporting: a lifecycle perspective at BIH 2013, Rome
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
Pavani_Rao
Pavani_RaoPavani_Rao
Pavani_Rao
 
Proactive ops for container orchestration environments
Proactive ops for container orchestration environmentsProactive ops for container orchestration environments
Proactive ops for container orchestration environments
 
EOSC-Life Workflow Collaboratory
EOSC-Life Workflow CollaboratoryEOSC-Life Workflow Collaboratory
EOSC-Life Workflow Collaboratory
 

Recently uploaded

PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docxPoojaSen20
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfAyushMahapatra5
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.pptRamjanShidvankar
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingTeacherCyreneCayanan
 
Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.MateoGardella
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin ClassesCeline George
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxDenish Jangid
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxVishalSingh1417
 

Recently uploaded (20)

PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 
Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 

Taverna workflows: provenance and reproducibility - STFC/NERC workshop 2013

  • 1. Taverna workflows: provenance and reproducibility Aleksandra Pawlik The University of Manchester Workflow approaches to investigation of biological complexity STFC/NERC Workshop 15-16 October 2013
  • 2. Workflows for improvement Workflows are more than just pipelines… Scaling up automated execution Bringing together distributed and continually changing resources Dealing with different standards, interfaces and implementation Support for repeatable analysis
  • 3. Taverna Engine Execution Workflows in Scufl2  Functional dataflow, simple control flows, implicit  iteration Linking services and tools  Different data resources and formats  “In Workflow Programming” (eg. Beanshell scripting)  Provenance collection: W3C PROV-O, OPM  Plug-in Framework     Infrastructures: Web Services (SOAP, REST), Grid, HPC Common Tools: Excel Spreadsheets, Google Refine, R OAuth security plug-in
  • 4. Taverna Workbench • Customizable for domains (eg. expose services only for biodiversity) • Desktop application • Intermediate results views • Plug-in framework List of services Workflow engine to run workflows Construct and visualise workflows
  • 5. Taverna User Spectrum Taverna Concept Knowledge Workflow Engineer Workbench Computational Scientist Workbench Components Lite Domain Domain Scientist (Workflow User) Domain-Specific Website / Tool / Portal Player High Workflow Visibility Low
  • 6. reuse   Right apps, right users Commodity apps:          Web. Spreadsheets. R. Customisation Mixed workflow / scripting Deployment / Portability  Apps Apps Web based / desktop Virtualised deployments Cloud hosted service A cloud-enabled local host Local ownership Capability building Workflow Workflow WFMS WFMS middleware middleware Infrastructure Infrastructure Domain/task specific apps that incorporate (an ecosystem of) workflows. Integrate Parameterised, integrative, multi-step (data) pipelines, analytics, computational protocols. Can be repetitively reused. Support design, config. and execution of workflows. manage utility actions for data, logging, security, compute, error. Shield incompatibilities & complexity. Legacy, others and your own software, datasets, services, codes, and platforms. Optimise and manage use of computing infrastructure.
  • 8. ~6,000 members over 300 groups, over 3,000 workflows
  • 9. Taverna Components Workflow Blocks made of a workflow  Well described  Well behaved  Well looked after  Agreed fail  Agreed formats in and out  Agreed provenance Deposited in myExperiment Grouped into families
  • 10. Provenance: how did you do it?  The link between computation and results d1  Reporting at different scales/ levels d2 S1 S0 S1 w -> Using Provenance d1' S0  Collecting d2 z w S2 S'2 y y' S4 S4 df df' (i) Trace A (ii) Trace B PDIFF: comparing provenance traces to diagnose divergence across experimental results [Woodman et al, 2011]
  • 11. Research Objects http://www.researchobject.org/ http://www.w3.org/community/rosc/ bundles and relates digital resources of a scientific experiment or investigation using standard mechanisms
  • 12. Taverna in Galaxy Wrap as Tool Tools Workflow in Upload Galaxy execution Taverna server
  • 13. The Taverna Suite of Tools Workflow Repository User Interfaces Workbench Service Catalogue Workflow Engine Workflow Provenance Activity and Service Plug-in Manager Taverna Lite Workflow Server Web Portals / Gateways Client User Interfaces Third Party Tools Player Virtual Machine Workflow Components Command Interaction Line Server Prog APIs
  • 14. Sustainability and user support Freely available Open source Current version 2.4 80,000+ downloads across version Windows/Mac OS X/ Linux/Unix Tutorials and Workshops Active user forum & support www.taverna.org.uk
  • 15. Taverna in other projects BioDiversity Virtual e-Laboratory www.biovel.eu SCAPE www.scape-project.eu Wf4Ever www.wf4ever-project.org VPH-Share www.vph-share.eu HELIO www.helio-vo.eu iPlant Collaborative www.iplantcollaborative.or g HELIO www.helio-vo.eu Pacific Northwest National Laboratory www.pnnl.gov KBase www.kbase.us Scientific Workflows and Provenance Working Group www.dataone.org SHIWA www.shiwa-workflow.eu
  • 16. Products Methods Data-centric Computation Scientific workflows over Distributed Cyber-Infrastructure. Data sharing libraries and catalogues for all types of scientific artefacts and all types of scientists. Knowledge Management Metadata, semantics digital exchange, preservation, publishing Software Engineering Software sustainability, software and data policy, training

Editor's Notes

  1. {"11":"http://purl.org/wf4ever/model\nResearch Objects (RO) aggregate related resources, their provenance and annotations\nConveys “everything you need to know” about a study/experiment/analysis/dataset/workflow\nShareable, evolvable, contributable, citable ROs have their own provenance and lifecycles\n","12":"Hosted resource – no installation tears\nSelf-hosting distribution – locality fears \nServices and/or workflow engine hosted locally or remotely\nHPC/cloud installations avoid cost of local installations on local infrastructures. Some like the comfort of local ownership.\nDeployment Infrastructure of BioVeL\n","1":"Title: Time well spent: Workflows for Environmental Omic Analysis.\nThe contextual analysis of Environmental Omics data is computationally intensive (involving the processing and management of large quantities of data), highly integrative (spanning data from many different disciplines) and rapidly evolving (involving the continuous development of novel methods and technologies). This poses a number of challenges for researchers in the field, including access to appropriate infrastructure, taking advantage of recent advancements and communicating research activities.\nScientific Workflow Management Systems, such as the Taverna Workflow Suite, are a particular class of computer application that manage the design, configuration and execution of repetitive, multi-step analysis processes that are particularly prevalent in Environmental Omics. The system handles the awkward work of accessing the different software and platforms, managing the data and security, handling errors and documenting the process.\nUtilising HPC or cloud installations of Taverna also means that there is no requirement to install tools and data sources locally, which reduces local infrastructure and maintenance costs and enables rapid workflow development and testing. Consequently, large-scale analyses can be performed regardless of local infrastructure.\n The Taverna Workflow Suite is currently powering the Biodiversity Virtual eLaboratory project (www.biovel.eu), the project is beginning to release a number of useful Environmental Omic workflows in collaboration with Genomic Observatories (http://genomicobservatories.blogspot.co.uk/) and MicroB3 (Ocean Sampling Day. http://www.microb3.eu/news/new-axis-collaboration-biovel-workflows-micro-b3-ocean-sampling-day).\nThis talk will discuss aspects of workflows and the benefits that adopting workflows as an integral part of Environmental Omics analysis can offer to the community including, reproducibility, knowledge exchange and easier access to high performance infrastructure.\n","14":"2001, run by manchester and oxford\n"}