Galaxy is a scientific workflow, data integration, and data and analysis persistence and publishing platform that aims to make computational biology accessible to research scientists that do not have computer programming or systems administration experience. AGINFRA+ VREs include access to a Galaxy deployment where researchers can make use of the usual Galaxy features and also interact with the DataMiner platform to run the available algorithms.
The webinar introduced Galaxy, the available deployment for AGINFRA+ VREs and the DataMiner integration.
Τhe webinar was described by Enol Fernández from EGI Advanced Computing Services for Research.
Post Quantum Cryptography – The Impact on Identity
Webinar on Galaxy & Galaxy integration with the Dataminer Service in the context of AGINFRA+ Project
1. WWW.PLUS.AGINFRA.EU
AGINFRA PLUS - Accelerating user-driven e-infrastructure innovation
in Food & Agriculture has received funding from the European Union’s
Horizon 2020 research and innovation programme under grant
agreement No 731001.
Galaxy for AGINFRA+ VREs
Online webinar | January 2019
Enol Fernández
EGI Foundation
3. WWW.PLUS.AGINFRA.EU
Demonstrate how scientific communities working
on agriculture and food topics may carry out rapid
and intuitive development and deployment of
innovative applications and workflows, powered
by open e-infrastructures.
Strengthen and illustrate the value and potential
of AGINFRA+ as a virtual research environment
for the domain of agriculture and food.
The AGINFRA+ Vision
Galaxy for AGINFRA+ VREs 3
4. WWW.PLUS.AGINFRA.EU
Enol Fernández
Cloud Architect @ EGI Foundation
Working since 2003 on computing e-infrastructures
(grid, cloud)
Support to user communities
EGI Cloud Federation Coordinator
Development of new services for EGI
Galaxy for AGINFRA+ VREs 4
/usr/bin/whoami
7. WWW.PLUS.AGINFRA.EU
Web-based platform for computational
biomedical research (analysis and data
integration)
Open source, community driven software that
makes integrating your biomed tools simple
Galaxy for AGINFRA+ VREs 7
8. WWW.PLUS.AGINFRA.EU
Galaxy is "an open, web-based platform for performing
accessible, reproducible, and transparent genomic
science.”
Accessibility
Users without programming experience can easily upload/retrieve
data, run complex tools and workflows, and visualize data
Reproducibility
Galaxy captures information so that any user can understand and
repeat a complete computational analysis
Transparency
Users can share or publish their analyses (histories, workflows,
visualizations)
Galaxy for AGINFRA+ VREs 8
Galaxy: core values
10. WWW.PLUS.AGINFRA.EU
Tools manipulate and transform your data
Each tool is defined by:
input datasets, parameters, commands, and outputs
help, tests, citations, dependency requirements
New versions can be installed without removing
old ones to ensure reproducibility
Galaxy Tool Shed: application store with +1000
tolos ready to be used
Galaxy for AGINFRA+ VREs 10
Tools
12. WWW.PLUS.AGINFRA.EU
Every action and dataset is tracked on the history
Datasets produced by tools
Operations performed on data
Includes:
name, format, size, creation time, datatype-specific
metadata
tool id, version, inputs, parameters
standard output (stdout) and error (stderr)
state (waiting, running, success, failed)
hidden, deleted, purged
Galaxy for AGINFRA+ VREs 12
History
14. WWW.PLUS.AGINFRA.EU
Series of tools and dataset actions that run in
sequence as a batch operation
From history, from scratch with visual editor or import
Allow to re-run the same analysis on different input
data sets
Change parameters before re-running a similar analysis
Workflows can be annotated, viewed, shared,
published, and imported - just like other Galaxy
objects.
Galaxy for AGINFRA+ VREs 14
Workflows
17. WWW.PLUS.AGINFRA.EU
Workflow design and execution technologies are
key for the research activities of the use cases
Two platforms selected:
KNIME
Galaxy
Integration with AGINFRA+:
Access the workflow platforms within the VRE
Access other VRE tools form the workflow platforms
Galaxy for AGINFRA+ VREs 17
Workflows on AGINFRA+
18. WWW.PLUS.AGINFRA.EU
Custom wsgi filter for Galaxy
Leverages support for external authentication in
Galaxy
Exposes REMOTE_USER with information from
D4Science
Stores the D4Science token to further interaction with
the VRE
Galaxy for AGINFRA+ VREs 18
Authentication
20. WWW.PLUS.AGINFRA.EUGalaxy for AGINFRA+ VREs 20
DataMiner and Galaxy
Periodic synchronisation of
DataMiner as tools in Galaxy
Execution using WPS calls to
Dataminer
With user token
Use DataMiner algorithm as
any other Galaxy tool
22. WWW.PLUS.AGINFRA.EUGalaxy for AGINFRA+ VREs 22
Some limitations
Input files must be provided as URLs
No easy way to discover the files from the Galaxy
interface
Output is stored as HTML with pointers to the
actual outputs
Can still be easily converted to other more friendly
format as needed
A csv extractor provided to facilitate that case, more
can be created.
25. WWW.PLUS.AGINFRA.EU
Galaxy available now for AGINFRA+ VREs
Go and test!
Start collecting your feedback:
Add new tools as needed by use cases
Improve the integration with DataMiner (input and
output files)
Expand deployment as usage grows
Galaxy for AGINFRA+ VREs 25
Next steps