- The document discusses astronomy research and proposes making the research process more digital, collaborative, and reproducible through the use of research objects and scientific workflows.
- It describes the astronomy research lifecycle and proposes capturing all the components of an experiment digitally in research objects to make experiments repeatable, reusable, and shareable.
- Scientific workflows are presented as a way to combine data and processes into a structured set of steps to automate computational solutions and make science more reproducible.
- The Wf4Ever project aims to build an infrastructure to manage research objects and scientific workflows over time to facilitate collaborative work and the development of new scientific knowledge in astronomy.
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Vince Smith
This is a derivative of a talk I gave at the Linnean society on 20th Sept. 2012. This version was given at the i4Life Environmental Genomics workshop on 25th Sept. and refocused to look at the dark taxa problem and developing published descriptions of molecular sequence clusters.
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Vince Smith
This is a derivative of a talk I gave at the Linnean society on 20th Sept. 2012. This version was given at the i4Life Environmental Genomics workshop on 25th Sept. and refocused to look at the dark taxa problem and developing published descriptions of molecular sequence clusters.
Description of the Virtual Observatory architecture, and IVOA documents that make it explicit. Part of the virtual observatory course by Juan de Dios Santander Vela, as imparted for the MTAF (Métodos y Técnicas Avanzadas en Física, Advanced Methods and Techniques in Physics) Master at the University of Granada (UGR).
The selection of guide stars for giant telescopes using Virtual Observatory t...Hector Quintero Casanova
Presentation for the preliminary report on how to go about searching suitable guide stars for giant telescopes such as the E-ELT (at the time the E-ELT project still had not been given the go-ahead) using the star catalogues available on the Virtual Observatory.
Los IPython Notebooks nos han proporcionado una sustancial mejora en la documentación del scripts, así como su inspección y una mayor re-utilización. Los IPython Notebooks también permiten acceder a distintos lenguajes de programación (Fortran, IDL, R, Shell,..) en un mismo script, lo que unido a su modo de acceso Web les hace ser un elemento ideal para el trabajo colaborativo (multi-lenguaje, multi-usuario, multi-plataforma, etc..) Os contaré qué tipo de cosas pueden hacerse con IPython Notebooks, desde desarrollo colaborativo de código multi-lenguaje, pasando por la reutilización de tutoriales, visualización interactiva de resultados, hasta la distribución de código más modular, y la publicación final de un experimento digital verificable y reproducible: el preámbulo de los papers ejecutables.
Presentation given at CERN Workshop on Innovations in Scholarly Communication (OAI7) on 22nd June 2011
http://indico.cern.ch/conferenceDisplay.py?confId=103325
SciDataCon 2014 Data Papers and their applications workshop - NPG Scientific ...Susanna-Assunta Sansone
Part of the SciDataCon14 workshop on "Data Papers and their applications" run by myself and Brian Hole to help attendees understand current data-publishing journals and trends and help them understand the editorial processes on NPG's Scientific Data and Ubiquity's Open Health Data.
Where are we going and how are we going to get there?David De Roure
Keynote from JISC Projects start-up meeting
Information Environment 2009-11 & Virtual Research Environment http://www.jisc.ac.uk/whatwedo/programmes/inf11/inf11startup.aspx
ORCID identifiers in repositories
The ORCID identifier has been incorporated into numerous repository platforms. This session will offer a discussion of integration points, policy issues, data flow between systems, researcher participation, discovered opportunities, and demonstrations by universities, research organizations, and vendors.
Moderator: Salvatore Mele, Head of Open Access at CERN
Presenters:
Robin Haw, Scientific Associate and Reactome Outreach Coordinator, Department of Informatics and Bio-computing, OICR
Rick Johnson, Co-Program Director, Digital Library Initiatives and Scholarship E-Research and Digital Initiatives, Notre Dame University
Ann Campion Riley, Associate Director for Access, Collections and Technical Services, University of Missouri Library
Sarah Shreeves, Coordinator, Illinois Digital Environment for Access to Learning and Scholarship (IDEALS), University Library. University of Illinois at Urbana-Champaign
Michael Witt, Head, Distributed Data Curation Center, Purdue University
"Stuff and data: challenges for research data management in the visual arts" ...ARLGSW
During the workshop we will explore the work of the Jisc funded KAPTUR and AHRC funded VADS4R projects over the last three years. This has focused on seeking to enhance research data management practice within in the visual arts. In particular we will focus on the specific disciplinary challenges, how these have been addressed, and reflect upon the lessons learned and work still needed to be undertaken. The workshop will be interactive, enabling participants to investigate the nature of research data and the curatorial challenges it presents in the visual arts.
Research Objects for improved sharing and reproducibilityOscar Corcho
Presentation about the usage of Research Objects to improve scientific experiment sharing and reproducibility, given at the Dagstuhl Perspective Workshop on the intersection between Computer Sciences and Psychology (July 2015)
Description of the Virtual Observatory architecture, and IVOA documents that make it explicit. Part of the virtual observatory course by Juan de Dios Santander Vela, as imparted for the MTAF (Métodos y Técnicas Avanzadas en Física, Advanced Methods and Techniques in Physics) Master at the University of Granada (UGR).
The selection of guide stars for giant telescopes using Virtual Observatory t...Hector Quintero Casanova
Presentation for the preliminary report on how to go about searching suitable guide stars for giant telescopes such as the E-ELT (at the time the E-ELT project still had not been given the go-ahead) using the star catalogues available on the Virtual Observatory.
Los IPython Notebooks nos han proporcionado una sustancial mejora en la documentación del scripts, así como su inspección y una mayor re-utilización. Los IPython Notebooks también permiten acceder a distintos lenguajes de programación (Fortran, IDL, R, Shell,..) en un mismo script, lo que unido a su modo de acceso Web les hace ser un elemento ideal para el trabajo colaborativo (multi-lenguaje, multi-usuario, multi-plataforma, etc..) Os contaré qué tipo de cosas pueden hacerse con IPython Notebooks, desde desarrollo colaborativo de código multi-lenguaje, pasando por la reutilización de tutoriales, visualización interactiva de resultados, hasta la distribución de código más modular, y la publicación final de un experimento digital verificable y reproducible: el preámbulo de los papers ejecutables.
Presentation given at CERN Workshop on Innovations in Scholarly Communication (OAI7) on 22nd June 2011
http://indico.cern.ch/conferenceDisplay.py?confId=103325
SciDataCon 2014 Data Papers and their applications workshop - NPG Scientific ...Susanna-Assunta Sansone
Part of the SciDataCon14 workshop on "Data Papers and their applications" run by myself and Brian Hole to help attendees understand current data-publishing journals and trends and help them understand the editorial processes on NPG's Scientific Data and Ubiquity's Open Health Data.
Where are we going and how are we going to get there?David De Roure
Keynote from JISC Projects start-up meeting
Information Environment 2009-11 & Virtual Research Environment http://www.jisc.ac.uk/whatwedo/programmes/inf11/inf11startup.aspx
ORCID identifiers in repositories
The ORCID identifier has been incorporated into numerous repository platforms. This session will offer a discussion of integration points, policy issues, data flow between systems, researcher participation, discovered opportunities, and demonstrations by universities, research organizations, and vendors.
Moderator: Salvatore Mele, Head of Open Access at CERN
Presenters:
Robin Haw, Scientific Associate and Reactome Outreach Coordinator, Department of Informatics and Bio-computing, OICR
Rick Johnson, Co-Program Director, Digital Library Initiatives and Scholarship E-Research and Digital Initiatives, Notre Dame University
Ann Campion Riley, Associate Director for Access, Collections and Technical Services, University of Missouri Library
Sarah Shreeves, Coordinator, Illinois Digital Environment for Access to Learning and Scholarship (IDEALS), University Library. University of Illinois at Urbana-Champaign
Michael Witt, Head, Distributed Data Curation Center, Purdue University
"Stuff and data: challenges for research data management in the visual arts" ...ARLGSW
During the workshop we will explore the work of the Jisc funded KAPTUR and AHRC funded VADS4R projects over the last three years. This has focused on seeking to enhance research data management practice within in the visual arts. In particular we will focus on the specific disciplinary challenges, how these have been addressed, and reflect upon the lessons learned and work still needed to be undertaken. The workshop will be interactive, enabling participants to investigate the nature of research data and the curatorial challenges it presents in the visual arts.
Research Objects for improved sharing and reproducibilityOscar Corcho
Presentation about the usage of Research Objects to improve scientific experiment sharing and reproducibility, given at the Dagstuhl Perspective Workshop on the intersection between Computer Sciences and Psychology (July 2015)
An short introduction to the PRIME (Publisher, Repository and Institutional Metadata Exchange) project, by Brian Hole, at the JISC Managing Research Data programme launch workshop in Nottingham, UK, October 25th 2012.
Reproducibility in human cognitive neuroimaging: a community-driven data sha...Nolan Nichols
Access to primary data and the provenance of derived data are increasingly recognized as an essential aspect of reproducibility in biomedical research. While productive data sharing has become the norm in some biomedical communities, human brain imaging has lagged in open data and descriptions of provenance. The overarching goal of my dissertation was to identify barriers to neuroimaging data sharing and to develop a fundamentally new, granular data exchange standard that incorporates provenance as a primitive to document cognitive neuroimaging workflow.
For my dissertation research, I led the development of the Neuroimaging Data Model (NIDM), an extension to the W3C PROV standard for the domain of human brain imaging. NIDM provides a language to communicate provenance by representing primary data, computational workflow, and derived data as bundles of linked Agents, Activities, and Entities. Similar to the way a sentence conveys a standalone thought, a bundle contains provenance statements that parsimoniously express the way a given piece of data was produced. To demonstrate a system that implements NIDM, I developed a modern, semantic Web application platform that provides neuroimaging workflow as a service and captures provenance statements as NIDM bundles. The course of this work necessitated interaction with an international community, which adopted and extended central elements of this work into prevailing brain imaging software. My dissertation contributes neuroinformatics standards to advance the current state of computational infrastructure available to the cognitive neuroimaging community.
Slides from the first meeting of the project group PUSHPIN at the University of Paderborn. I focus on the general focus of the project group and the topics for the seminar phase.
This presentation was given at the CLIR/DLF Postdoctoral Fellowship Summer Seminar at Bryn Mawr college in Pennsylvania on July 29th 2014. The intention was to communicate what we are doing in the fields of text and data mining in the domain of chemistry and specifically around mining the RSC archive publication and chemistry dissertations and theses. How would these experiences map over to the humanities?
Jupyter notebooks have arrived to stay as a means to document the scientific analysis protocol, as well as to provide executable recipes shared seamlessly among the community. This has triggered the rise of a plethora of complementary tools and services associated to them. This talk will cover different possibilities to use Jupyter notebooks and JupyterLab interface. We will start with the description of their basic functionalities, as well as functionality extensions not widely known by the community. We will describe how to take advantage of their cross-language capabilities to enhance collaborative work, and also use them as complementary assets in the paper publication process to provide reproducibility of the results. Other aspects on how to deal with modularity and scalability of long complex notebooks will be covered, and we will see several platforms for rendering and execution others then the browser and the local desktop. We will finish on how they are actually being used together with Docker and Binder as part of the versioned executable documentation of a project like Gammapy.
Astronomy is a collaborative science, but it has also become highly specialized, as many other disciplines. Improvement of sharing, discovery and access to resources will enable astronomers to greatly benefit from each other’s highly specialized knowhow. Some initiatives led by scientists and publishers, complement traditional paper publishing with assets published in more interactive digital formats. Among the main goals of these efforts are improving the reproducibility and clarity of the scientific outcome, going beyond the static PDF file, and fostering re-use, which turns into a more efficient exploitation of available digital resources.
The science performed in Astronomy is digital science, from observing proposals to final publication, including data and software used: each of the elements and actions involved in the scientific output could be recorded in electronic form.
This fact does not prevent the final outcome of an experiment is still difficult to reproduce. An exhaustive process of documentation can be long, tedious, where access to all the resources must be granted, and after all, the repeatability of results is not even guaranteed. At the same time, we have access to a wealth of files, observational data and publications that could be used more efficiently with a better visibility of the scientific production, avoiding duplication of effort and reinvention.
Digital Science: Reproducibility and Visibility in AstronomyJose Enrique Ruiz
The science done in Astronomy is digital science, from observing proposals to final publication, to data and software used: each of the elements and actions involved in scientific output could be recorded in electronic form. This fact does not prevent the final outcome of an experiment is still difficult to reproduce. This procedure can be long, tedious, not easily accessible or understandable, even to the author. At the same time, we have a rich infrastructure of files, observational data and publications. This could be used more efficiently if we reach greater visibility of the scientific production, which avoids duplication of effort and reinvention.
Reproducibility is a cornerstone in scientific method, and extraction of relevant information in the current and future data flood is key in Astronomy. The AMIGA group (Analysis of the interstellar Medium of Isolated GAlaxies, IAA-CSIC, http://amiga.iaa.es) faces these two challenges in the European project "Wf4Ever: Advanced technologies for enhanced preservation workflow Science" to enable the preservation of the methodology in scalable semantic repositories to facilitate their discovery, access, inspection, exploitation and distribution. These repositories store the experiments on "Research Objects" whose main constituents are digital scientific workflows. These provide a comprehensive view and clear scientific interpretation of the experiment as well as the automation of the method, going beyond the usual pipelines that normally end up in data processing.
The quantitative leap in volume and complexity of the next generation of archives will need analysis and data mining tasks to live closer to the data, in computing and distributed storage environments, but they should also be modular enough to allow customization from scientists and be easily accessible to foster their dissemination among the community. Astronomy is a collaborative science, but it has also become highly specialized, as many other disciplines. Sharing, preservation, discovery and a much simplified access to resources in the composition of scientific workflows will enable astronomers to greatly benefit from each other’s highly specialized knowhow, they constitute a way to push Astronomy to share and publish not only results and data, but also processes and methodologies.
We will show how the use of scientific workflows can help to improve the reproducibility of the experiment and a more efficient exploitation of astronomical archives, as well as the visibility of the scientific methodology and its reuse.
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf91mobiles
91mobiles recently conducted a Smart TV Buyer Insights Survey in which we asked over 3,000 respondents about the TV they own, aspects they look at on a new TV, and their TV buying preferences.
Accelerate your Kubernetes clusters with Varnish CachingThijs Feryn
A presentation about the usage and availability of Varnish on Kubernetes. This talk explores the capabilities of Varnish caching and shows how to use the Varnish Helm chart to deploy it to Kubernetes.
This presentation was delivered at K8SUG Singapore. See https://feryn.eu/presentations/accelerate-your-kubernetes-clusters-with-varnish-caching-k8sug-singapore-28-2024 for more details.
Connector Corner: Automate dynamic content and events by pushing a buttonDianaGray10
Here is something new! In our next Connector Corner webinar, we will demonstrate how you can use a single workflow to:
Create a campaign using Mailchimp with merge tags/fields
Send an interactive Slack channel message (using buttons)
Have the message received by managers and peers along with a test email for review
But there’s more:
In a second workflow supporting the same use case, you’ll see:
Your campaign sent to target colleagues for approval
If the “Approve” button is clicked, a Jira/Zendesk ticket is created for the marketing design team
But—if the “Reject” button is pushed, colleagues will be alerted via Slack message
Join us to learn more about this new, human-in-the-loop capability, brought to you by Integration Service connectors.
And...
Speakers:
Akshay Agnihotri, Product Manager
Charlie Greenberg, Host
UiPath Test Automation using UiPath Test Suite series, part 4DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 4. In this session, we will cover Test Manager overview along with SAP heatmap.
The UiPath Test Manager overview with SAP heatmap webinar offers a concise yet comprehensive exploration of the role of a Test Manager within SAP environments, coupled with the utilization of heatmaps for effective testing strategies.
Participants will gain insights into the responsibilities, challenges, and best practices associated with test management in SAP projects. Additionally, the webinar delves into the significance of heatmaps as a visual aid for identifying testing priorities, areas of risk, and resource allocation within SAP landscapes. Through this session, attendees can expect to enhance their understanding of test management principles while learning practical approaches to optimize testing processes in SAP environments using heatmap visualization techniques
What will you get from this session?
1. Insights into SAP testing best practices
2. Heatmap utilization for testing
3. Optimization of testing processes
4. Demo
Topics covered:
Execution from the test manager
Orchestrator execution result
Defect reporting
SAP heatmap example with demo
Speaker:
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Ramesh Iyer
In today's fast-changing business world, Companies that adapt and embrace new ideas often need help to keep up with the competition. However, fostering a culture of innovation takes much work. It takes vision, leadership and willingness to take risks in the right proportion. Sachin Dev Duggal, co-founder of Builder.ai, has perfected the art of this balance, creating a company culture where creativity and growth are nurtured at each stage.
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Albert Hoitingh
In this session I delve into the encryption technology used in Microsoft 365 and Microsoft Purview. Including the concepts of Customer Key and Double Key Encryption.
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Jeffrey Haguewood
Sidekick Solutions uses Bonterra Impact Management (fka Social Solutions Apricot) and automation solutions to integrate data for business workflows.
We believe integration and automation are essential to user experience and the promise of efficient work through technology. Automation is the critical ingredient to realizing that full vision. We develop integration products and services for Bonterra Case Management software to support the deployment of automations for a variety of use cases.
This video focuses on the notifications, alerts, and approval requests using Slack for Bonterra Impact Management. The solutions covered in this webinar can also be deployed for Microsoft Teams.
Interested in deploying notification automations for Bonterra Impact Management? Contact us at sales@sidekicksolutionsllc.com to discuss next steps.
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...DanBrown980551
Do you want to learn how to model and simulate an electrical network from scratch in under an hour?
Then welcome to this PowSyBl workshop, hosted by Rte, the French Transmission System Operator (TSO)!
During the webinar, you will discover the PowSyBl ecosystem as well as handle and study an electrical network through an interactive Python notebook.
PowSyBl is an open source project hosted by LF Energy, which offers a comprehensive set of features for electrical grid modelling and simulation. Among other advanced features, PowSyBl provides:
- A fully editable and extendable library for grid component modelling;
- Visualization tools to display your network;
- Grid simulation tools, such as power flows, security analyses (with or without remedial actions) and sensitivity analyses;
The framework is mostly written in Java, with a Python binding so that Python developers can access PowSyBl functionalities as well.
What you will learn during the webinar:
- For beginners: discover PowSyBl's functionalities through a quick general presentation and the notebook, without needing any expert coding skills;
- For advanced developers: master the skills to efficiently apply PowSyBl functionalities to your real-world scenarios.
UiPath Test Automation using UiPath Test Suite series, part 3DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 3. In this session, we will cover desktop automation along with UI automation.
Topics covered:
UI automation Introduction,
UI automation Sample
Desktop automation flow
Pradeep Chinnala, Senior Consultant Automation Developer @WonderBotz and UiPath MVP
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
Generating a custom Ruby SDK for your web service or Rails API using Smithyg2nightmarescribd
Have you ever wanted a Ruby client API to communicate with your web service? Smithy is a protocol-agnostic language for defining services and SDKs. Smithy Ruby is an implementation of Smithy that generates a Ruby SDK using a Smithy model. In this talk, we will explore Smithy and Smithy Ruby to learn how to generate custom feature-rich SDKs that can communicate with any web service, such as a Rails JSON API.
Generating a custom Ruby SDK for your web service or Rails API using Smithy
Collaborative Digital Experiments
1. Grant agreement no.: 27092
Mashing-up Science !
Collaborative Digital Experiments!
Jose Enrique Ruiz - @bultako!
IAA-CSIC!
!
April 4th 2011!
2011 .Astronomy Conference - Oxford!
2. Collaborative digital experiments!
Summary!
• Astronomy research lifecycle!
• Research objects: the ingredients!
• Scientific workflows: the cooking recipes!
• Wf4Ever: preserving knowledge!
2
3. Astronomy research lifecycle!
Proposals Submission!
• Funding!
• Observing time !
• Description of the experiment!
• Precedent related studies!
• Problems to solve and strategy to follow!
• Requirements and expected results!
• The digital sky vs. Observing time proposals
3
4. Astronomy research lifecycle!
Observation and data reduction!
• Ancillary data!
• Air masses!
• Exposure time!
• Meteorological conditions!
• Instrumental signatures!
• Observed raw datasets!
• Science-ready datasets!
• Observational programs !
• Pipelines for automated data reduction
4
5. Astronomy research lifecycle!
Analysis of the data!
• Specific computing environments!
• Specific local interactive software!
• Commercial packages (IDL)!
• Grid and clusters for simulations!
• High level programming languages!
• Fortran, C++!
• Python recipes community
• Use of web archives for well known properties of objects !
• Vizier, Simbad, NED!
• SaaS approach slowly coming with the Virtual Observatory!
5
6. Astronomy research lifecycle!
Publishing!
• Electronic PDF files!
• Provenance concerning the Analysis!
• Data Results are hidden behind the plots!
• Most of them are public!
• Interlinking provided!
• Astronomical objects!
• Catalogs and missions!
• Related publications!
• Citations, datasets, proposals..!
6
7. Astronomy research lifecycle!
Astronomy research is entirely digital !
Time has come to go “Beyond the PDF”!
• Methodology “in action”!
• All data exposed!
• Indexed experiments!
• Reproducible!
• Repeatable!
• Participatory!
• Formative!
• Collaborative!
• Cross-boundary!
7
8. Research Objects: the ingredients!
Research Object!
A digital entity capturing all the components needed for the
execution of a digital experiment and also the results produced,
describing and characterizing the overall experiment, every
single one of its components and the links existing among
them. !
8
9. Research Objects: the ingredients!
The R’s Dimensions !
• Repeatable!
• Reproducible!
• Replayable!
• Refreshable!
• Reusable!
• Reliable!
• Retrievable!
• Roll-backable!
• Referenceable!
• Research cross-boundary!
!
Towards exchange and reuse of digital knowledge!
Bechhofer, S., De Roure, D., Gamble, M., Goble, C. and Buchan, I. (2010)! 9
10. Research Objects: the ingredients!
Research Objects in Astronomy!
• Metadata characterization!
• Description of the experiment!
• Related bibliography!
• Ancillary and raw data!
• Reduced science-ready data!
• Digital environment needed !
• Scripting and software used!
• Links to web archives!
• Final data products!
• Scientific discussion !
10
11. Scientific workflows: the cooking recipes!
Scientific Workflows!
!
The combination of data and processes into a configurable,
structured set of steps that implement semi-automated
computational solutions in scientific problem-solving.!
!
A digital recipe which can itself be cooked in order to produce
repeatable results.!
!
!
11
12. Scientific workflows: the cooking recipes!
Scientific Workflows!
!
• Enable automation!
• Make science reproducible!
• Sometimes repeatable !
• Encourage best practices!
• Modular nature allows reuse!
• Exposes the scientific method!
• Formative!
• Scientist friendly!
!
!
!
12
!
13. Scientific workflows: the cooking recipes!
The oven!
A workflow Management System!
!
• Scalable suite of tools!
• Workflow design!
• Workflow execution!
• Workbench!
• Server implemented!
• Domain independent!
• Access to remote resources!
• Open source!
!
!
! 13
14. Scientific workflows: the cooking recipes!
The recipes store!
!
• Find workflows!
• Share workflows and files!
• Find people!
• Build communities!
• Publish packages!
• Tag workflows!
• Score and rate workflows!
• Comment on workflows!
• Write reviews!
!
!
14
22. Wf4Ever: preserving knowledge!
Wf4Ever goals!
!
Preservation of Research Objects!
• Data!
• Resources!
• Workflows!
!
Workflow preservation is complex!
• Interpreted through their execution!
• Complex models are required to describe them!
• Provenance is a complex issue in a cloud of services!
• Need of Web Semantics, Ontologies, Linked Data, RDF, etc..!
• Resources are often beyond control of scientists!
22
23. Wf4Ever: preserving knowledge!
Wf4Ever goals!
!
Creation of workflows and research objects for astronomical
digital experiments!
!
• Data archives!
• Web services!
• Virtual Observatory standards!
• Ubiquitous storing and computing!
• Python based community!
• Interlinked digital libraries!
!
23
24. Wf4Ever: preserving knowledge!
Wf4Ever goals!
Build an infrastructure for Research Objects management which
stimulates the development of new scientific knowledge via
collaborative work!
• Creation !
• Archival!
• Classification!
• Indexing!
• Retrieval!
• Community reuse!
• Rating, scoring and annotations!
• Scalable in semantic repositories!
!
24
!
25. Wf4Ever: preserving knowledge!
Users roles!
!
Reader!
Skims titles and abstracts of published research objects !
Comparator!
Looking for similar research objects to those he’s working with at present!
Re-user!
Extract and replace modules from the workflow and use it for his own purpose!
Publisher!
Wants to the community to check his digital experiment!
Evaluator!
He is allowed to evaluate, comment and rate a specific research object !
!
25
27. Wf4Ever: preserving knowledge!
The team!
1. Intelligent Software Components (ISOCO, Spain)!
2. University of Manchester (UNIMAN, UK)!
2 7 3. Universidad Politécnica de Madrid (UPM, Spain)!
5! 4! 4. Poznan Supercomputing and Networking Centre
(PSNC, Poland)!
5. Universisty of Oxford (OXF, UK)!
6. Instituto de Astrofísica de Andalucía (IAA, Spain)!
1! 3!
7. Leiden University Medical Centre (LUMC, NL)!
6!
27
28. Thanks for your attention !!
http://www.wf4ever-project.org!
28