WORKS 11 Presentation

dgarijo
A new Approach for Publishing
   Workflows: Abstractions,
  Standards and Linked Data

                       Daniel Garijo
 Ontology Engineering Group, Departamento de Inteligencia
        Artificial. Universidad Politécnica de Madrid

                         Yolanda Gil
             Information Sciences and Institute
      University of Southern California, Marina del Rey
                                                          Date: 14/11/2011
Index of contents

Index:
1.   Background

2.   Limitations of existing approaches to workflow publication

3.   Features of our approach

     •   Publishing abstract workflows and specific workflows

     •   OPMW Ontology

     •   Linked Data Publication

4.   Workflow querying and Linked Data consumption

5.   Conclusions




                                                                                1
Background


Typical Published Article          Reproducible Article:
                                   Weaver, GenePattern GRRD, etc.

            Text:                              Text:
    Narrative of method,               Narrative of method,
   software packages used             software packages used

            Data:                             Data:
Key datasets and figures/plots    Key datasets and figures/plots

                                             Workflow:
NOT published,                     Workflow/scripts describing
loosely recorded:                 dataflow, codes, and parameters

            Software:
scripted codes + manual steps +
          notes/emails

                                                                    2
Current issues with existing publication approaches


                                      Only executable workflow is published:
 Reproducible Article:                1. Must have the same codes to re-execute
 Weaver, GenePattern GRRD, etc.          the workflow, but:
                                          –   Codes become unavailable
                                               •   Eg: eHits was proprietary and replaced by
             Text:                                 AutodockVina
     Narrative of method,                 –   Different labs prefer different codes
    software packages used                     •   Eg: R vs Matlab
                                               •   Eg: viz in Citoscape vs yEd
            Data:                     2. Must have the same workflow framework
Key datasets and figures/plots           to re-execute the workflow
                                          –   Must have R for Weaver
           Workflow:                  3. Must import files to local file system and
 Workflow/scripts describing             workflow framework
dataflow, codes, and parameters
                                          –   Must import bundle of workflow/data/code
                                              files to reproduce



                                                                                               3
Key Features of our approach


•   Publish an abstract workflow in addition to executable workflow
     – Description of workflow that is independent of the codes executed
     – Maps to the codes executed (the “executable workflow”)

•   Publish both abstract and executable workflow using the OPM standard
     – OPM (Open Provenance Model) is independent of workflow framework and is
        widely implemented
     – Other groups can import to their own workflow framework

•   Publish data and workflows as Linked Data on the Web
     – All workflows and related files are web-accessible
     – Simple mechanism to share across local file systems




                                                                                 4
What is Linked Data


1. Use URIs as names for things.

2. Use HTTP URIs so that people can look up those names.

3. When someone looks up a URI, provide useful information.

4. Include links to other URIs.




              “Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/”

                                                                                                                  5
High level architecture


                                                                                    Other
                                                                                    workflow
                  WINGS on local laptop                                             environments
                         Workflow
          Core           Template        OPM
         Portal          Workflow       export
                         Instance
                                                               Programatic access
                                                                 (external apps)
                  WINGS on shared host
                         Workflow                  Linked
          Core           Template        OPM
         Portal                         export      Data
                         Workflow
                         Instance                Publication       Interactive
                  WINGS on web server
                                                                    Browsing
                         Workflow                               (Pubby frontend)
          Core           Template      OPM
         Portal                       export                                            Users
                        Workflow
                        Instance



Wings workflow                OPM
                                                 Publication      Share               Reuse
  generation                conversion




                                                                                                6
Publishing the abstract workflow




                Comparison of
               Dissimilar protein
                   structures
                   workflow




                                    7
OPMW Ontology


                              opmv:Artifact                                           opmv:Artifact
                                          opmw:                                                            opmw:                       opmv:
                                                                   opmw: hasArtifactTemplate
                                    ArtifactTemplate                                                  ArtifactInstance                 Agent
                                         artifact1                                                      execInput1                     user1
                                                                                                                                                     opmo:account


opmo:          opmo:hasArtifact             opmv:used                                                        opmv:used                      opmo:account       opmo:
                                                                                                                         opmv:wasControlledBy
OPMGraph                                               opmv:Process                                                                                            Account
                                                                                         opmv:Process
      opmw:                                                                                                                                                opmw:
                                  opmw:ProcessTemplate                                                           opmw:ProcessInstance     opmo:
 WorkflowTemplate                                                                                                                                     ExecutionAccount
                      opmo:          templateNode1                 opmw:hasProcessTemplate                         executionNode1         account
    template1                                                                                                                                             account1
                      hasProcess
                                             opmw:hasTemplateComponent                         opmw:hasSpecificComponent
                opmo:          opmv:
                hasArtifact    wasGeneratedBy            ac:AbstractComponent         ac:SpecificComponent         opmv:wasGeneratedBy         opmo: account
                                                               absComp1                    specComp1

                                         opmw:                                                                           opmw:
                                                                      opmw:hasArtifactTemplate                       ArtifactInstace
                                    ArtifactTemplate
                                     outputArtifact1                                                               executionOutput1
                                                opmv:Artifact                             opmv:Artifact
                                                                    opmw:hasWorkflowTemplate

                        Abstract Workflow                                                                     Executable Workflow




                                                                                                                                                                     8
Publication of Workflows as Linked Data



                                  Linked Data publication
               Abstract
               Workflow                            RDF Upload
   Wings        (OPM)                               Interface        OPM
                                                                   conversion
  OPM          Executable                                        Other workflow
conversion                                                       frameworks
               Workflow
                                                       RDF
                 (OPM)                                               OPM
                            Permanent                 Triple
                                                      store         import
                               web-
                            accessible
              Workflow          file
                Data,          store                 SPARQL      Web
             Components,                            Endpoint     accessible
                 etc.
                                                                Web
                                                                browser



                                                                                  9
Searching/Browsing Workflows as Linked Data

             Types of
             search




                              Resource URI
                              (Process instance)

                        Autocomplete search bar



                                  Specific component for this
                                  process instance
Properties




                                                                10
Searching/Browsing Workflows as Linked Data

           Component Name

            Component Inputs

           Component Outputs



            Code Implementations



           Template additional metadata




            Record of the different
            executions of this workflow
                                          11
Conclusions



1. Publication of an abstract workflow that represents the computational method in an
   execution-independent manner.




2. Publication of the abstract workflow and the executed workflow using the OPM
   standard that is independent of the execution environment used.




3. Publication of the workflows, components, codes and datasets as Linked Data on the
   web.




                                                                                    12
Future work



•   Extensions to abstract workflow publication
     – Be able to provide abstractions on several steps.
     – Incomplete provenance.



•   Create an OPMV/W3C PROV-O profile for common workflow representation.
     – Increase interoperability with other workflow representation systems.



•   Workflow reuse in different workflow systems.
     – Import and execute workflows in other workflow frameworks.




                                                                                13
References


• WINGS workflow system: http://seagull.isi.edu/marbles/

•The Open Provenance Model Specification: http://openprovenance.org/

• OPMO: http://openprovenance.org/model/opmo

•OPMV: http://open-biomed.sourceforge.net/opmv/ns.html

• TB Drugome Wiki (Evolution of this work):
    http://seagull.isi.edu/wings-drugome/index.php/Main_Page

•W3C PROV-O current ontology (draft):
   http://www.w3.org/2011/prov/wiki/PIL_OWL_Ontology

•Principles of Linked Data:
     http://www.w3.org/DesignIssues/LinkedData.html




                                                                              14
Acknowledgements


•UCSD people:

     •Li Xie

     •Lei Xie

     •Sarah Kinnings

     •Phil Bourne

•ISI people:

     •Varun Ratnakaar

•OEG people:

     •Oscar Corcho



                                     15
A new Approach for Publishing
   Workflows: Abstractions,
  Standards and Linked Data

                       Daniel Garijo
 Ontology Engineering Group, Departamento de Inteligencia
        Artificial. Universidad Politécnica de Madrid

                         Yolanda Gil
             Information Sciences and Institute
      University of Southern California, Marina del Rey
                                                          Date: 14/11/2011
1 of 17

Recommended

(ATS4-DEV04) Protocols as RESTful Services and RESTful URL Routing by
(ATS4-DEV04) Protocols as RESTful Services and RESTful URL Routing(ATS4-DEV04) Protocols as RESTful Services and RESTful URL Routing
(ATS4-DEV04) Protocols as RESTful Services and RESTful URL RoutingBIOVIA
1.3K views23 slides
Modularity with OSGi by
Modularity with OSGiModularity with OSGi
Modularity with OSGiPeter Kriens
829 views50 slides
Modularity with OSGi by
Modularity with OSGiModularity with OSGi
Modularity with OSGiPeter Kriens
627 views50 slides
Intel open mp by
Intel open mpIntel open mp
Intel open mpPiyush Mittal
681 views82 slides
AD for i in modern world by
AD for i in modern worldAD for i in modern world
AD for i in modern worldCOMMON Europe
674 views16 slides
Abap course chapter 1 introduction and first program by
Abap course   chapter 1 introduction and first programAbap course   chapter 1 introduction and first program
Abap course chapter 1 introduction and first programMilind Patil
3.4K views18 slides

More Related Content

Similar to WORKS 11 Presentation

Status update OEG - Nov 2012 by
Status update OEG - Nov 2012Status update OEG - Nov 2012
Status update OEG - Nov 2012dgarijo
455 views26 slides
Overview Of .Net 4.0 Sanjay Vyas by
Overview Of .Net 4.0   Sanjay VyasOverview Of .Net 4.0   Sanjay Vyas
Overview Of .Net 4.0 Sanjay Vyasrsnarayanan
1.1K views32 slides
ISI work by
ISI workISI work
ISI workdgarijo
981 views16 slides
MEW22 22nd Machine Evaluation Workshop Microsoft by
MEW22 22nd Machine Evaluation Workshop MicrosoftMEW22 22nd Machine Evaluation Workshop Microsoft
MEW22 22nd Machine Evaluation Workshop MicrosoftLee Stott
2.3K views16 slides
OpenStack Preso: DevOps on Hybrid Infrastructure by
OpenStack Preso: DevOps on Hybrid InfrastructureOpenStack Preso: DevOps on Hybrid Infrastructure
OpenStack Preso: DevOps on Hybrid Infrastructurerhirschfeld
972 views21 slides
2016 - Open Mic - IGNITE - Open Infrastructure = ANY Infrastructure by
2016 - Open Mic - IGNITE - Open Infrastructure = ANY Infrastructure2016 - Open Mic - IGNITE - Open Infrastructure = ANY Infrastructure
2016 - Open Mic - IGNITE - Open Infrastructure = ANY Infrastructuredevopsdaysaustin
426 views21 slides

Similar to WORKS 11 Presentation(20)

Status update OEG - Nov 2012 by dgarijo
Status update OEG - Nov 2012Status update OEG - Nov 2012
Status update OEG - Nov 2012
dgarijo455 views
Overview Of .Net 4.0 Sanjay Vyas by rsnarayanan
Overview Of .Net 4.0   Sanjay VyasOverview Of .Net 4.0   Sanjay Vyas
Overview Of .Net 4.0 Sanjay Vyas
rsnarayanan1.1K views
ISI work by dgarijo
ISI workISI work
ISI work
dgarijo981 views
MEW22 22nd Machine Evaluation Workshop Microsoft by Lee Stott
MEW22 22nd Machine Evaluation Workshop MicrosoftMEW22 22nd Machine Evaluation Workshop Microsoft
MEW22 22nd Machine Evaluation Workshop Microsoft
Lee Stott2.3K views
OpenStack Preso: DevOps on Hybrid Infrastructure by rhirschfeld
OpenStack Preso: DevOps on Hybrid InfrastructureOpenStack Preso: DevOps on Hybrid Infrastructure
OpenStack Preso: DevOps on Hybrid Infrastructure
rhirschfeld972 views
2016 - Open Mic - IGNITE - Open Infrastructure = ANY Infrastructure by devopsdaysaustin
2016 - Open Mic - IGNITE - Open Infrastructure = ANY Infrastructure2016 - Open Mic - IGNITE - Open Infrastructure = ANY Infrastructure
2016 - Open Mic - IGNITE - Open Infrastructure = ANY Infrastructure
devopsdaysaustin426 views
Introduction to the Semantic Web by Nuxeo
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic Web
Nuxeo2.1K views
What's new in Nuxeo 5.2? - Solutions Linux 2009 by Stefane Fermigier
What's new in Nuxeo 5.2? - Solutions Linux 2009What's new in Nuxeo 5.2? - Solutions Linux 2009
What's new in Nuxeo 5.2? - Solutions Linux 2009
Stefane Fermigier827 views
Integrating Splunk into your Spring Applications by Damien Dallimore
Integrating Splunk into your Spring ApplicationsIntegrating Splunk into your Spring Applications
Integrating Splunk into your Spring Applications
Damien Dallimore11.2K views
(ATS3-DEV05) Coding up Pipeline Pilot Components by BIOVIA
(ATS3-DEV05) Coding up Pipeline Pilot Components(ATS3-DEV05) Coding up Pipeline Pilot Components
(ATS3-DEV05) Coding up Pipeline Pilot Components
BIOVIA2.5K views
SharePoint 2010 as a Development Platform by Ayman El-Hattab
SharePoint 2010 as a Development PlatformSharePoint 2010 as a Development Platform
SharePoint 2010 as a Development Platform
Ayman El-Hattab1.2K views
Spring MVC framework by Mohit Gupta
Spring MVC frameworkSpring MVC framework
Spring MVC framework
Mohit Gupta919 views
Splunk as a_big_data_platform_for_developers_spring_one2gx by Damien Dallimore
Splunk as a_big_data_platform_for_developers_spring_one2gxSplunk as a_big_data_platform_for_developers_spring_one2gx
Splunk as a_big_data_platform_for_developers_spring_one2gx
Damien Dallimore5.3K views
Hadoop World 2011: Proven Tools to Manage Hadoop Environments - Joey Jablonsk... by Cloudera, Inc.
Hadoop World 2011: Proven Tools to Manage Hadoop Environments - Joey Jablonsk...Hadoop World 2011: Proven Tools to Manage Hadoop Environments - Joey Jablonsk...
Hadoop World 2011: Proven Tools to Manage Hadoop Environments - Joey Jablonsk...
Cloudera, Inc.986 views
Java EE 7 with Apache Spark for the World’s Largest Credit Card Core Systems ... by Hirofumi Iwasaki
Java EE 7 with Apache Spark for the World’s Largest Credit Card Core Systems ...Java EE 7 with Apache Spark for the World’s Largest Credit Card Core Systems ...
Java EE 7 with Apache Spark for the World’s Largest Credit Card Core Systems ...
Hirofumi Iwasaki1.1K views
An Integrated Framework for Parameter-based Optimization of Scientific Workflows by vijayskumar
An Integrated Framework for Parameter-based Optimization of Scientific WorkflowsAn Integrated Framework for Parameter-based Optimization of Scientific Workflows
An Integrated Framework for Parameter-based Optimization of Scientific Workflows
vijayskumar320 views
A Java Implementer's Guide to Boosting Apache Spark Performance by Tim Ellison. by J On The Beach
A Java Implementer's Guide to Boosting Apache Spark Performance by Tim Ellison.A Java Implementer's Guide to Boosting Apache Spark Performance by Tim Ellison.
A Java Implementer's Guide to Boosting Apache Spark Performance by Tim Ellison.
J On The Beach1.6K views
Building and deploying LLM applications with Apache Airflow by Kaxil Naik
Building and deploying LLM applications with Apache AirflowBuilding and deploying LLM applications with Apache Airflow
Building and deploying LLM applications with Apache Airflow
Kaxil Naik94 views
HPC Web overview - Mobyle Workshop - September 28, 2012 by Hervé Ménager
HPC Web overview - Mobyle Workshop - September 28, 2012HPC Web overview - Mobyle Workshop - September 28, 2012
HPC Web overview - Mobyle Workshop - September 28, 2012
Hervé Ménager1.6K views

More from dgarijo

FOOPS!: An Ontology Pitfall Scanner for the FAIR principles by
FOOPS!: An Ontology Pitfall Scanner for the FAIR principlesFOOPS!: An Ontology Pitfall Scanner for the FAIR principles
FOOPS!: An Ontology Pitfall Scanner for the FAIR principlesdgarijo
519 views8 slides
FAIR Workflows: A step closer to the Scientific Paper of the Future by
FAIR Workflows: A step closer to the Scientific Paper of the FutureFAIR Workflows: A step closer to the Scientific Paper of the Future
FAIR Workflows: A step closer to the Scientific Paper of the Futuredgarijo
618 views36 slides
Towards Reusable Research Software by
Towards Reusable Research SoftwareTowards Reusable Research Software
Towards Reusable Research Softwaredgarijo
171 views9 slides
SOMEF: a metadata extraction framework from software documentation by
SOMEF: a metadata extraction framework from software documentationSOMEF: a metadata extraction framework from software documentation
SOMEF: a metadata extraction framework from software documentationdgarijo
121 views7 slides
A Template-Based Approach for Annotating Long-Tailed Datasets by
A Template-Based Approach for Annotating Long-Tailed DatasetsA Template-Based Approach for Annotating Long-Tailed Datasets
A Template-Based Approach for Annotating Long-Tailed Datasetsdgarijo
144 views12 slides
OBA: An Ontology-Based Framework for Creating REST APIs for Knowledge Graphs by
OBA: An Ontology-Based Framework for Creating REST APIs for Knowledge GraphsOBA: An Ontology-Based Framework for Creating REST APIs for Knowledge Graphs
OBA: An Ontology-Based Framework for Creating REST APIs for Knowledge Graphsdgarijo
423 views21 slides

More from dgarijo(20)

FOOPS!: An Ontology Pitfall Scanner for the FAIR principles by dgarijo
FOOPS!: An Ontology Pitfall Scanner for the FAIR principlesFOOPS!: An Ontology Pitfall Scanner for the FAIR principles
FOOPS!: An Ontology Pitfall Scanner for the FAIR principles
dgarijo519 views
FAIR Workflows: A step closer to the Scientific Paper of the Future by dgarijo
FAIR Workflows: A step closer to the Scientific Paper of the FutureFAIR Workflows: A step closer to the Scientific Paper of the Future
FAIR Workflows: A step closer to the Scientific Paper of the Future
dgarijo618 views
Towards Reusable Research Software by dgarijo
Towards Reusable Research SoftwareTowards Reusable Research Software
Towards Reusable Research Software
dgarijo171 views
SOMEF: a metadata extraction framework from software documentation by dgarijo
SOMEF: a metadata extraction framework from software documentationSOMEF: a metadata extraction framework from software documentation
SOMEF: a metadata extraction framework from software documentation
dgarijo121 views
A Template-Based Approach for Annotating Long-Tailed Datasets by dgarijo
A Template-Based Approach for Annotating Long-Tailed DatasetsA Template-Based Approach for Annotating Long-Tailed Datasets
A Template-Based Approach for Annotating Long-Tailed Datasets
dgarijo144 views
OBA: An Ontology-Based Framework for Creating REST APIs for Knowledge Graphs by dgarijo
OBA: An Ontology-Based Framework for Creating REST APIs for Knowledge GraphsOBA: An Ontology-Based Framework for Creating REST APIs for Knowledge Graphs
OBA: An Ontology-Based Framework for Creating REST APIs for Knowledge Graphs
dgarijo423 views
Towards Knowledge Graphs of Reusable Research Software Metadata by dgarijo
Towards Knowledge Graphs of Reusable Research Software MetadataTowards Knowledge Graphs of Reusable Research Software Metadata
Towards Knowledge Graphs of Reusable Research Software Metadata
dgarijo624 views
Scientific Software Registry Collaboration Workshop: From Software Metadata r... by dgarijo
Scientific Software Registry Collaboration Workshop: From Software Metadata r...Scientific Software Registry Collaboration Workshop: From Software Metadata r...
Scientific Software Registry Collaboration Workshop: From Software Metadata r...
dgarijo460 views
WDPlus: Leveraging Wikidata to Link and Extend Tabular Data by dgarijo
WDPlus: Leveraging Wikidata to Link and Extend Tabular DataWDPlus: Leveraging Wikidata to Link and Extend Tabular Data
WDPlus: Leveraging Wikidata to Link and Extend Tabular Data
dgarijo584 views
OKG-Soft: An Open Knowledge Graph With Mathine Readable Scientific Software M... by dgarijo
OKG-Soft: An Open Knowledge Graph With Mathine Readable Scientific Software M...OKG-Soft: An Open Knowledge Graph With Mathine Readable Scientific Software M...
OKG-Soft: An Open Knowledge Graph With Mathine Readable Scientific Software M...
dgarijo1.8K views
Towards Human-Guided Machine Learning - IUI 2019 by dgarijo
Towards Human-Guided Machine Learning - IUI 2019Towards Human-Guided Machine Learning - IUI 2019
Towards Human-Guided Machine Learning - IUI 2019
dgarijo545 views
Capturing Context in Scientific Experiments: Towards Computer-Driven Science by dgarijo
Capturing Context in Scientific Experiments: Towards Computer-Driven ScienceCapturing Context in Scientific Experiments: Towards Computer-Driven Science
Capturing Context in Scientific Experiments: Towards Computer-Driven Science
dgarijo551 views
A Controlled Crowdsourcing Approach for Practical Ontology Extensions and Met... by dgarijo
A Controlled Crowdsourcing Approach for Practical Ontology Extensions and Met...A Controlled Crowdsourcing Approach for Practical Ontology Extensions and Met...
A Controlled Crowdsourcing Approach for Practical Ontology Extensions and Met...
dgarijo583 views
WIDOCO: A Wizard for Documenting Ontologies by dgarijo
WIDOCO: A Wizard for Documenting OntologiesWIDOCO: A Wizard for Documenting Ontologies
WIDOCO: A Wizard for Documenting Ontologies
dgarijo1.2K views
Towards Automating Data Narratives by dgarijo
Towards Automating Data NarrativesTowards Automating Data Narratives
Towards Automating Data Narratives
dgarijo918 views
Automated Hypothesis Testing with Large Scale Scientific Workflows by dgarijo
Automated Hypothesis Testing with Large Scale Scientific WorkflowsAutomated Hypothesis Testing with Large Scale Scientific Workflows
Automated Hypothesis Testing with Large Scale Scientific Workflows
dgarijo586 views
OntoSoft: A Distributed Semantic Registry for Scientific Software by dgarijo
OntoSoft: A Distributed Semantic Registry for Scientific SoftwareOntoSoft: A Distributed Semantic Registry for Scientific Software
OntoSoft: A Distributed Semantic Registry for Scientific Software
dgarijo919 views
OEG tools for supporting Ontology Engineering by dgarijo
OEG tools for supporting Ontology EngineeringOEG tools for supporting Ontology Engineering
OEG tools for supporting Ontology Engineering
dgarijo289 views
Software Metadata: Describing "dark software" in GeoSciences by dgarijo
Software Metadata: Describing "dark software" in GeoSciencesSoftware Metadata: Describing "dark software" in GeoSciences
Software Metadata: Describing "dark software" in GeoSciences
dgarijo901 views
Reproducibility Using Semantics: An Overview by dgarijo
Reproducibility Using Semantics: An OverviewReproducibility Using Semantics: An Overview
Reproducibility Using Semantics: An Overview
dgarijo890 views

Recently uploaded

HTTP headers that make your website go faster - devs.gent November 2023 by
HTTP headers that make your website go faster - devs.gent November 2023HTTP headers that make your website go faster - devs.gent November 2023
HTTP headers that make your website go faster - devs.gent November 2023Thijs Feryn
21 views151 slides
Report 2030 Digital Decade by
Report 2030 Digital DecadeReport 2030 Digital Decade
Report 2030 Digital DecadeMassimo Talia
15 views41 slides
TouchLog: Finger Micro Gesture Recognition Using Photo-Reflective Sensors by
TouchLog: Finger Micro Gesture Recognition  Using Photo-Reflective SensorsTouchLog: Finger Micro Gesture Recognition  Using Photo-Reflective Sensors
TouchLog: Finger Micro Gesture Recognition Using Photo-Reflective Sensorssugiuralab
19 views15 slides
SAP Automation Using Bar Code and FIORI.pdf by
SAP Automation Using Bar Code and FIORI.pdfSAP Automation Using Bar Code and FIORI.pdf
SAP Automation Using Bar Code and FIORI.pdfVirendra Rai, PMP
22 views38 slides
6g - REPORT.pdf by
6g - REPORT.pdf6g - REPORT.pdf
6g - REPORT.pdfLiveplex
10 views23 slides
Empathic Computing: Delivering the Potential of the Metaverse by
Empathic Computing: Delivering  the Potential of the MetaverseEmpathic Computing: Delivering  the Potential of the Metaverse
Empathic Computing: Delivering the Potential of the MetaverseMark Billinghurst
476 views80 slides

Recently uploaded(20)

HTTP headers that make your website go faster - devs.gent November 2023 by Thijs Feryn
HTTP headers that make your website go faster - devs.gent November 2023HTTP headers that make your website go faster - devs.gent November 2023
HTTP headers that make your website go faster - devs.gent November 2023
Thijs Feryn21 views
TouchLog: Finger Micro Gesture Recognition Using Photo-Reflective Sensors by sugiuralab
TouchLog: Finger Micro Gesture Recognition  Using Photo-Reflective SensorsTouchLog: Finger Micro Gesture Recognition  Using Photo-Reflective Sensors
TouchLog: Finger Micro Gesture Recognition Using Photo-Reflective Sensors
sugiuralab19 views
SAP Automation Using Bar Code and FIORI.pdf by Virendra Rai, PMP
SAP Automation Using Bar Code and FIORI.pdfSAP Automation Using Bar Code and FIORI.pdf
SAP Automation Using Bar Code and FIORI.pdf
6g - REPORT.pdf by Liveplex
6g - REPORT.pdf6g - REPORT.pdf
6g - REPORT.pdf
Liveplex10 views
Empathic Computing: Delivering the Potential of the Metaverse by Mark Billinghurst
Empathic Computing: Delivering  the Potential of the MetaverseEmpathic Computing: Delivering  the Potential of the Metaverse
Empathic Computing: Delivering the Potential of the Metaverse
Mark Billinghurst476 views
Data-centric AI and the convergence of data and model engineering: opportunit... by Paolo Missier
Data-centric AI and the convergence of data and model engineering:opportunit...Data-centric AI and the convergence of data and model engineering:opportunit...
Data-centric AI and the convergence of data and model engineering: opportunit...
Paolo Missier39 views
AMAZON PRODUCT RESEARCH.pdf by JerikkLaureta
AMAZON PRODUCT RESEARCH.pdfAMAZON PRODUCT RESEARCH.pdf
AMAZON PRODUCT RESEARCH.pdf
JerikkLaureta19 views
Five Things You SHOULD Know About Postman by Postman
Five Things You SHOULD Know About PostmanFive Things You SHOULD Know About Postman
Five Things You SHOULD Know About Postman
Postman30 views
Business Analyst Series 2023 - Week 3 Session 5 by DianaGray10
Business Analyst Series 2023 -  Week 3 Session 5Business Analyst Series 2023 -  Week 3 Session 5
Business Analyst Series 2023 - Week 3 Session 5
DianaGray10237 views
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLive by Network Automation Forum
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLiveAutomating a World-Class Technology Conference; Behind the Scenes of CiscoLive
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLive
From chaos to control: Managing migrations and Microsoft 365 with ShareGate! by sammart93
From chaos to control: Managing migrations and Microsoft 365 with ShareGate!From chaos to control: Managing migrations and Microsoft 365 with ShareGate!
From chaos to control: Managing migrations and Microsoft 365 with ShareGate!
sammart939 views
1st parposal presentation.pptx by i238212
1st parposal presentation.pptx1st parposal presentation.pptx
1st parposal presentation.pptx
i2382129 views
PharoJS - Zürich Smalltalk Group Meetup November 2023 by Noury Bouraqadi
PharoJS - Zürich Smalltalk Group Meetup November 2023PharoJS - Zürich Smalltalk Group Meetup November 2023
PharoJS - Zürich Smalltalk Group Meetup November 2023
Noury Bouraqadi126 views
Case Study Copenhagen Energy and Business Central.pdf by Aitana
Case Study Copenhagen Energy and Business Central.pdfCase Study Copenhagen Energy and Business Central.pdf
Case Study Copenhagen Energy and Business Central.pdf
Aitana16 views
Piloting & Scaling Successfully With Microsoft Viva by Richard Harbridge
Piloting & Scaling Successfully With Microsoft VivaPiloting & Scaling Successfully With Microsoft Viva
Piloting & Scaling Successfully With Microsoft Viva

WORKS 11 Presentation

  • 1. A new Approach for Publishing Workflows: Abstractions, Standards and Linked Data Daniel Garijo Ontology Engineering Group, Departamento de Inteligencia Artificial. Universidad Politécnica de Madrid Yolanda Gil Information Sciences and Institute University of Southern California, Marina del Rey Date: 14/11/2011
  • 2. Index of contents Index: 1. Background 2. Limitations of existing approaches to workflow publication 3. Features of our approach • Publishing abstract workflows and specific workflows • OPMW Ontology • Linked Data Publication 4. Workflow querying and Linked Data consumption 5. Conclusions 1
  • 3. Background Typical Published Article Reproducible Article: Weaver, GenePattern GRRD, etc. Text: Text: Narrative of method, Narrative of method, software packages used software packages used Data: Data: Key datasets and figures/plots Key datasets and figures/plots Workflow: NOT published, Workflow/scripts describing loosely recorded: dataflow, codes, and parameters Software: scripted codes + manual steps + notes/emails 2
  • 4. Current issues with existing publication approaches Only executable workflow is published: Reproducible Article: 1. Must have the same codes to re-execute Weaver, GenePattern GRRD, etc. the workflow, but: – Codes become unavailable • Eg: eHits was proprietary and replaced by Text: AutodockVina Narrative of method, – Different labs prefer different codes software packages used • Eg: R vs Matlab • Eg: viz in Citoscape vs yEd Data: 2. Must have the same workflow framework Key datasets and figures/plots to re-execute the workflow – Must have R for Weaver Workflow: 3. Must import files to local file system and Workflow/scripts describing workflow framework dataflow, codes, and parameters – Must import bundle of workflow/data/code files to reproduce 3
  • 5. Key Features of our approach • Publish an abstract workflow in addition to executable workflow – Description of workflow that is independent of the codes executed – Maps to the codes executed (the “executable workflow”) • Publish both abstract and executable workflow using the OPM standard – OPM (Open Provenance Model) is independent of workflow framework and is widely implemented – Other groups can import to their own workflow framework • Publish data and workflows as Linked Data on the Web – All workflows and related files are web-accessible – Simple mechanism to share across local file systems 4
  • 6. What is Linked Data 1. Use URIs as names for things. 2. Use HTTP URIs so that people can look up those names. 3. When someone looks up a URI, provide useful information. 4. Include links to other URIs. “Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/” 5
  • 7. High level architecture Other workflow WINGS on local laptop environments Workflow Core Template OPM Portal Workflow export Instance Programatic access (external apps) WINGS on shared host Workflow Linked Core Template OPM Portal export Data Workflow Instance Publication Interactive WINGS on web server Browsing Workflow (Pubby frontend) Core Template OPM Portal export Users Workflow Instance Wings workflow OPM Publication Share Reuse generation conversion 6
  • 8. Publishing the abstract workflow Comparison of Dissimilar protein structures workflow 7
  • 9. OPMW Ontology opmv:Artifact opmv:Artifact opmw: opmw: opmv: opmw: hasArtifactTemplate ArtifactTemplate ArtifactInstance Agent artifact1 execInput1 user1 opmo:account opmo: opmo:hasArtifact opmv:used opmv:used opmo:account opmo: opmv:wasControlledBy OPMGraph opmv:Process Account opmv:Process opmw: opmw: opmw:ProcessTemplate opmw:ProcessInstance opmo: WorkflowTemplate ExecutionAccount opmo: templateNode1 opmw:hasProcessTemplate executionNode1 account template1 account1 hasProcess opmw:hasTemplateComponent opmw:hasSpecificComponent opmo: opmv: hasArtifact wasGeneratedBy ac:AbstractComponent ac:SpecificComponent opmv:wasGeneratedBy opmo: account absComp1 specComp1 opmw: opmw: opmw:hasArtifactTemplate ArtifactInstace ArtifactTemplate outputArtifact1 executionOutput1 opmv:Artifact opmv:Artifact opmw:hasWorkflowTemplate Abstract Workflow Executable Workflow 8
  • 10. Publication of Workflows as Linked Data Linked Data publication Abstract Workflow RDF Upload Wings (OPM) Interface OPM conversion OPM Executable Other workflow conversion frameworks Workflow RDF (OPM) OPM Permanent Triple store import web- accessible Workflow file Data, store SPARQL Web Components, Endpoint accessible etc. Web browser 9
  • 11. Searching/Browsing Workflows as Linked Data Types of search Resource URI (Process instance) Autocomplete search bar Specific component for this process instance Properties 10
  • 12. Searching/Browsing Workflows as Linked Data Component Name Component Inputs Component Outputs Code Implementations Template additional metadata Record of the different executions of this workflow 11
  • 13. Conclusions 1. Publication of an abstract workflow that represents the computational method in an execution-independent manner. 2. Publication of the abstract workflow and the executed workflow using the OPM standard that is independent of the execution environment used. 3. Publication of the workflows, components, codes and datasets as Linked Data on the web. 12
  • 14. Future work • Extensions to abstract workflow publication – Be able to provide abstractions on several steps. – Incomplete provenance. • Create an OPMV/W3C PROV-O profile for common workflow representation. – Increase interoperability with other workflow representation systems. • Workflow reuse in different workflow systems. – Import and execute workflows in other workflow frameworks. 13
  • 15. References • WINGS workflow system: http://seagull.isi.edu/marbles/ •The Open Provenance Model Specification: http://openprovenance.org/ • OPMO: http://openprovenance.org/model/opmo •OPMV: http://open-biomed.sourceforge.net/opmv/ns.html • TB Drugome Wiki (Evolution of this work): http://seagull.isi.edu/wings-drugome/index.php/Main_Page •W3C PROV-O current ontology (draft): http://www.w3.org/2011/prov/wiki/PIL_OWL_Ontology •Principles of Linked Data: http://www.w3.org/DesignIssues/LinkedData.html 14
  • 16. Acknowledgements •UCSD people: •Li Xie •Lei Xie •Sarah Kinnings •Phil Bourne •ISI people: •Varun Ratnakaar •OEG people: •Oscar Corcho 15
  • 17. A new Approach for Publishing Workflows: Abstractions, Standards and Linked Data Daniel Garijo Ontology Engineering Group, Departamento de Inteligencia Artificial. Universidad Politécnica de Madrid Yolanda Gil Information Sciences and Institute University of Southern California, Marina del Rey Date: 14/11/2011