SlideShare a Scribd company logo
www.elixir-europe.org
BioContainers: From tools to services
Challenges and opportunities
BioContainers Community
github: github.com/Biocontainers
twitter: @BioContainers
http://biocontainers.pro | @biocontainers | github.com/biocontainers
ELIXIR All Hands June, 2019
http://biocontainers.pro | @biocontainers | github.com/biocontainers
BioContainers Community
http://biocontainers.pro | @biocontainers | github.com/biocontainershttp://biocontainers.pro | @biocontainers | github.com/biocontainers
Outline
I. Introduction to BioContainers
II. Results of the ELIXIR Implementation Study (2018-2019)
III. Bioinformatics Services to Cloud
Question : https://www.sli.do Code: Y726
ELIXIR All Hands June, 2019
http://biocontainers.pro | @biocontainers | github.com/biocontainershttp://biocontainers.pro | @biocontainers | github.com/biocontainers
Software
Publication
Research
What is BioContainers
Implementation of Guidelines:
• Versioning
• Testing
• Dependency management
• Documentation
My Data
Container Publication
Results
Results
mulled
dockerfile
ELIXIR All Hands June, 2019
http://biocontainers.pro | @biocontainers | github.com/biocontainershttp://biocontainers.pro | @biocontainers | github.com/biocontainers
BioContainers / BioConda Architecture
Leprevost, F., Grüning, ... & Perez-Riverol Y. (2017). Bioinformatics, 33(16), 2580-2582.
Grüning B, Dale R, ... BioConda Team. Nat Methods. 2018 Jul;15(7):475-476.
ELIXIR All Hands June, 2019
http://biocontainers.pro | @biocontainers | github.com/biocontainershttp://biocontainers.pro | @biocontainers | github.com/biocontainers
Findability
# Base Image FROM biocontainers/biocontainers:latest
# Metadata
LABEL base.image="biocontainers:latest”
LABEL version="3"
LABEL software="Comet”
LABEL software.version="2016012”
LABEL description="an open source tandem mass spectrometry
sequence database search tool"
LABEL website="http://comet-ms.sourceforge.net/"
LABEL
documentation="http://comet-ms.sourceforge.net/parameters/param
eters_2016010/"
LABEL license="http://comet-ms.sourceforge.net/"
LABEL tags="Proteomics"
# Maintainer
MAINTAINER Felipe da Veiga Leprevost
<felipe@leprevost.com.br>
USER biodocker
…..
Metadata should be able to connect
to other relevant software registries
such as: bio.tools , pubmed or
omicstools
Versions should be defined in an
standard format for the container and
the software inside.
Software duplications should be
removed between mulled and
dockerfile containers.
Metadata of bioconda packages and
Dockerfile recipes should be the
same.
ELIXIR All Hands June, 2019
http://biocontainers.pro | @biocontainers | github.com/biocontainers
Implementation Study (2018-2019)
ELIXIR All Hands June, 2019
Development of Architecture for Software Containers at ELIXIR and its use by
EXCELERATE use-cases communities.
1. Develop the Bioinformatics Containers Central Service (BCCS) to support the use of
software containers in ELIXIR.
2. BioContainers integration with bio.tools and OpenEBench.
3. BioContainers Registry 2.0 and Command Line tool.
4. BioContainers for Training and Support
5. Demonstration of the use of software containers in selected ELIXIR’s Use Cases
a. Rare Diseases use-case. Proof-of-concept implementation of a human genomics
variant calling pipeline using BioContainers and Galaxy.
b. EGA Integration.
http://biocontainers.pro | @biocontainers | github.com/biocontainershttp://biocontainers.pro | @biocontainers | github.com/biocontainers
ELIXIR All Hands June, 2019
http://biocontainers.pro | @biocontainers | github.com/biocontainershttp://biocontainers.pro | @biocontainers | github.com/biocontainers
ELIXIR All Hands June, 2019
Architecture BioContainers 2.0
1. Continuous integration for Dockerfile containers:
https://jenkins.biocontainers.pro/jenkins/
2. Binary and recipes ELIXIR Compute backup:
https://containers.biocontainers.pro/s3/
3. Internal ELIXIR registry:
http://containers.biocontainers.pro/containers
4. Biocontainers TRS API:
https://api.biocontainers.pro/api/ga4gh/v2/ui/
5. BioContainers Web: https://biocontainers.pro
http://biocontainers.pro | @biocontainers | github.com/biocontainershttp://biocontainers.pro | @biocontainers | github.com/biocontainers
ELIXIR All Hands June, 2019
Planemo - A Scientific Workflow SDK
> planemo run --docker …
> planemo test --docker … workflow[xml.yml]
> planemo mull build-and-test …
> mulled-build-channel --channel bioconda
--namespace biocontainers
http://biocontainers.pro | @biocontainers | github.com/biocontainershttp://biocontainers.pro | @biocontainers | github.com/biocontainers
ELIXIR All Hands June, 2019
Training containers based on Galaxy VRE with:
● tools (versioned)
● workflows (versioned)
● Interactive tours
● example data
● linked to and from hands-on material
● reference data
http://biocontainers.pro | @biocontainers | github.com/biocontainershttp://biocontainers.pro | @biocontainers | github.com/biocontainers
ELIXIR All Hands June, 2019
http://biocontainers.pro | @biocontainers | github.com/biocontainershttp://biocontainers.pro | @biocontainers | github.com/biocontainers
ELIXIR All Hands June, 2019
BioContainers TRS API
Code (Flask, MongoDB, Python): https://github.com/BioContainers/biocontainers-backend
https://api.biocontainers.pro/api/ga4gh/v2/ui/
http://biocontainers.pro | @biocontainers | github.com/biocontainershttp://biocontainers.pro | @biocontainers | github.com/biocontainers
ELIXIR All Hands June, 2019
http://biocontainers.pro | @biocontainers | github.com/biocontainers
Biocontainers Statistics and Usage
ELIXIR All Hands June, 2019
Pipeline and tools using
BioContainers:
● Galaxy: ~1500
● CWL: ~300
● Shell: ~200
● WDL: 60
● Python: 50
● Nextflow: 50
http://biocontainers.pro | @biocontainers | github.com/biocontainershttp://biocontainers.pro | @biocontainers | github.com/biocontainers
ELIXIR All Hands June, 2019
Selected Publications (2018-2019)
https://scholar.google.co.uk/citations?user=O6xG2-EAAAAJ&hl=en
1. da Veiga Leprevost, F., Grüning, B. et. al. Bioinformatics. 2017 Aug 15;33(16):2580-2582. doi:
10.1093/bioinformatics/btx192.
2. Grüning B, Dale R et. al. Nat Methods. 2018 Jul;15(7):475-476. doi: 10.1038/s41592-018-0046-7.
3. Gruening B., Sallou O., Moreno, P.,et. al. 2019. F1000Research, 7. doi:
10.12688/f1000research.15140.2
4. Moreno P. et. al. 2019. biorxiv. doi: 10.1101/488643v2
5. Perez-Riverol Y, et. al. 2019 biorxiv. doi: . 10.1101/604413v1
http://biocontainers.pro | @biocontainers | github.com/biocontainershttp://biocontainers.pro | @biocontainers | github.com/biocontainers
Moving services to Cloud
1. Services are not tools
• Services needs to be re-define/re-implemented to be cloud compatible.
• Services needs high-level of orchestration. Most of these services contains DB,
APIs, Web, etc.
2. Data should be open, transferable from the source into the deployed cloud instances.
3. A public list of important/interesting services for the community and industry should be
collected.
4. Funding from industry / public to help service teams to move into Cloud.
https://elixir-europe.org/services/tag/data-resources
ELIXIR All Hands June, 2019
http://biocontainers.pro | @biocontainers | github.com/biocontainershttp://biocontainers.pro | @biocontainers | github.com/biocontainers
First efforts
1. BioContainers Architecture is fully deployable into Cloud using Kubenetes helm
charts. (https://github.com/BioContainers/biocontainers-backend)
2. Some other projects moving within this efforts:
- Omics Discovery Index (www.omicsdi.org)
- Identifiers.org
- Reactome (https://reactome.org/)
ELIXIR All Hands June, 2019
http://biocontainers.pro | @biocontainers | github.com/biocontainershttp://biocontainers.pro | @biocontainers | github.com/biocontainers
ELIXIR All Hands June, 2019
Thanks!!!
and 700+ contributors

More Related Content

Similar to Biocontainers 2019: Presentation for the ELIXIR All Hands

AGINFRA+ on EOSCHubWeek
AGINFRA+ on EOSCHubWeekAGINFRA+ on EOSCHubWeek
AGINFRA+ on EOSCHubWeek
AGINFRA
 
6thInternational Conference on Bioscience & Engineering(BIOENG 2022)
6thInternational Conference on Bioscience & Engineering(BIOENG 2022)6thInternational Conference on Bioscience & Engineering(BIOENG 2022)
6thInternational Conference on Bioscience & Engineering(BIOENG 2022)
bioejjournal
 
6thInternational Conference on Bioscience & Engineering(BIOENG 2022)
6thInternational Conference on Bioscience & Engineering(BIOENG 2022)6thInternational Conference on Bioscience & Engineering(BIOENG 2022)
6thInternational Conference on Bioscience & Engineering(BIOENG 2022)
bioejjournal
 
5th International Conference on Bioscience & Engineering (BIO 2023)
5th International Conference on Bioscience & Engineering (BIO 2023)5th International Conference on Bioscience & Engineering (BIO 2023)
5th International Conference on Bioscience & Engineering (BIO 2023)
hiij
 
Democratising biodiversity and genomics research: open and citizen science to...
Democratising biodiversity and genomics research: open and citizen science to...Democratising biodiversity and genomics research: open and citizen science to...
Democratising biodiversity and genomics research: open and citizen science to...
GigaScience, BGI Hong Kong
 
FYP report
FYP reportFYP report
FYP report
Chong Yee Gan
 
ENE3 - Overview of nutrient related Horizon 2020 and Life projects in Europe ...
ENE3 - Overview of nutrient related Horizon 2020 and Life projects in Europe ...ENE3 - Overview of nutrient related Horizon 2020 and Life projects in Europe ...
ENE3 - Overview of nutrient related Horizon 2020 and Life projects in Europe ...
European Sustainable Phosphorus Platform
 
What Bio.Kitchen
What Bio.KitchenWhat Bio.Kitchen
What Bio.Kitchen
ssuser432659
 
openBIO
openBIOopenBIO
openBIO
ssuser432659
 
4th International Conference on Bioscience & Engineering (BIEN 2020)
4th International Conference on Bioscience & Engineering (BIEN 2020) 4th International Conference on Bioscience & Engineering (BIEN 2020)
4th International Conference on Bioscience & Engineering (BIEN 2020)
ecij
 
Scott Edmunds: Preparing a data paper for GigaByte
Scott Edmunds: Preparing a data paper for GigaByteScott Edmunds: Preparing a data paper for GigaByte
Scott Edmunds: Preparing a data paper for GigaByte
GigaScience, BGI Hong Kong
 
Use of IoT technologies for irrigation and plant protection: the case for Cyp...
Use of IoT technologies for irrigation and plant protection: the case for Cyp...Use of IoT technologies for irrigation and plant protection: the case for Cyp...
Use of IoT technologies for irrigation and plant protection: the case for Cyp...
Agriculural Research Institute
 
The BlueBRIDGE Project - Pasquale Pagano
The BlueBRIDGE Project - Pasquale PaganoThe BlueBRIDGE Project - Pasquale Pagano
The BlueBRIDGE Project - Pasquale Pagano
Blue BRIDGE
 
Integrating Covid-19 Bioassays in the Open Research Knowledge Graph
Integrating Covid-19 Bioassays in the Open Research Knowledge GraphIntegrating Covid-19 Bioassays in the Open Research Knowledge Graph
Integrating Covid-19 Bioassays in the Open Research Knowledge Graph
Jennifer D'Souza
 
Bioscience & Engineering: An International Journal (BIOEJ)
Bioscience & Engineering: An International Journal (BIOEJ)Bioscience & Engineering: An International Journal (BIOEJ)
Bioscience & Engineering: An International Journal (BIOEJ)
bioejjournal
 
Biology Organisms Level 2 - Edukite
Biology Organisms Level 2 - EdukiteBiology Organisms Level 2 - Edukite
Biology Organisms Level 2 - Edukite
EduKite
 
Do Developers Update
 Third-Party Libraries in Mobile Apps?
Do Developers Update
 Third-Party Libraries in Mobile Apps?Do Developers Update
 Third-Party Libraries in Mobile Apps?
Do Developers Update
 Third-Party Libraries in Mobile Apps?
Pasquale Salza
 
Techchallenge Covid-19 & Bio.Game
Techchallenge Covid-19 & Bio.GameTechchallenge Covid-19 & Bio.Game
Techchallenge Covid-19 & Bio.Game
ssuser432659
 
Coopbox experience in bio-based packaging: From PLA to the GLOPACK project
Coopbox experience in bio-based packaging: From PLA to the GLOPACK projectCoopbox experience in bio-based packaging: From PLA to the GLOPACK project
Coopbox experience in bio-based packaging: From PLA to the GLOPACK project
GLOPACK project
 
CALL FOR PAPERS - 5th International Conference on Bioscience & Engineering (B...
CALL FOR PAPERS - 5th International Conference on Bioscience & Engineering (B...CALL FOR PAPERS - 5th International Conference on Bioscience & Engineering (B...
CALL FOR PAPERS - 5th International Conference on Bioscience & Engineering (B...
bioejjournal
 

Similar to Biocontainers 2019: Presentation for the ELIXIR All Hands (20)

AGINFRA+ on EOSCHubWeek
AGINFRA+ on EOSCHubWeekAGINFRA+ on EOSCHubWeek
AGINFRA+ on EOSCHubWeek
 
6thInternational Conference on Bioscience & Engineering(BIOENG 2022)
6thInternational Conference on Bioscience & Engineering(BIOENG 2022)6thInternational Conference on Bioscience & Engineering(BIOENG 2022)
6thInternational Conference on Bioscience & Engineering(BIOENG 2022)
 
6thInternational Conference on Bioscience & Engineering(BIOENG 2022)
6thInternational Conference on Bioscience & Engineering(BIOENG 2022)6thInternational Conference on Bioscience & Engineering(BIOENG 2022)
6thInternational Conference on Bioscience & Engineering(BIOENG 2022)
 
5th International Conference on Bioscience & Engineering (BIO 2023)
5th International Conference on Bioscience & Engineering (BIO 2023)5th International Conference on Bioscience & Engineering (BIO 2023)
5th International Conference on Bioscience & Engineering (BIO 2023)
 
Democratising biodiversity and genomics research: open and citizen science to...
Democratising biodiversity and genomics research: open and citizen science to...Democratising biodiversity and genomics research: open and citizen science to...
Democratising biodiversity and genomics research: open and citizen science to...
 
FYP report
FYP reportFYP report
FYP report
 
ENE3 - Overview of nutrient related Horizon 2020 and Life projects in Europe ...
ENE3 - Overview of nutrient related Horizon 2020 and Life projects in Europe ...ENE3 - Overview of nutrient related Horizon 2020 and Life projects in Europe ...
ENE3 - Overview of nutrient related Horizon 2020 and Life projects in Europe ...
 
What Bio.Kitchen
What Bio.KitchenWhat Bio.Kitchen
What Bio.Kitchen
 
openBIO
openBIOopenBIO
openBIO
 
4th International Conference on Bioscience & Engineering (BIEN 2020)
4th International Conference on Bioscience & Engineering (BIEN 2020) 4th International Conference on Bioscience & Engineering (BIEN 2020)
4th International Conference on Bioscience & Engineering (BIEN 2020)
 
Scott Edmunds: Preparing a data paper for GigaByte
Scott Edmunds: Preparing a data paper for GigaByteScott Edmunds: Preparing a data paper for GigaByte
Scott Edmunds: Preparing a data paper for GigaByte
 
Use of IoT technologies for irrigation and plant protection: the case for Cyp...
Use of IoT technologies for irrigation and plant protection: the case for Cyp...Use of IoT technologies for irrigation and plant protection: the case for Cyp...
Use of IoT technologies for irrigation and plant protection: the case for Cyp...
 
The BlueBRIDGE Project - Pasquale Pagano
The BlueBRIDGE Project - Pasquale PaganoThe BlueBRIDGE Project - Pasquale Pagano
The BlueBRIDGE Project - Pasquale Pagano
 
Integrating Covid-19 Bioassays in the Open Research Knowledge Graph
Integrating Covid-19 Bioassays in the Open Research Knowledge GraphIntegrating Covid-19 Bioassays in the Open Research Knowledge Graph
Integrating Covid-19 Bioassays in the Open Research Knowledge Graph
 
Bioscience & Engineering: An International Journal (BIOEJ)
Bioscience & Engineering: An International Journal (BIOEJ)Bioscience & Engineering: An International Journal (BIOEJ)
Bioscience & Engineering: An International Journal (BIOEJ)
 
Biology Organisms Level 2 - Edukite
Biology Organisms Level 2 - EdukiteBiology Organisms Level 2 - Edukite
Biology Organisms Level 2 - Edukite
 
Do Developers Update
 Third-Party Libraries in Mobile Apps?
Do Developers Update
 Third-Party Libraries in Mobile Apps?Do Developers Update
 Third-Party Libraries in Mobile Apps?
Do Developers Update
 Third-Party Libraries in Mobile Apps?
 
Techchallenge Covid-19 & Bio.Game
Techchallenge Covid-19 & Bio.GameTechchallenge Covid-19 & Bio.Game
Techchallenge Covid-19 & Bio.Game
 
Coopbox experience in bio-based packaging: From PLA to the GLOPACK project
Coopbox experience in bio-based packaging: From PLA to the GLOPACK projectCoopbox experience in bio-based packaging: From PLA to the GLOPACK project
Coopbox experience in bio-based packaging: From PLA to the GLOPACK project
 
CALL FOR PAPERS - 5th International Conference on Bioscience & Engineering (B...
CALL FOR PAPERS - 5th International Conference on Bioscience & Engineering (B...CALL FOR PAPERS - 5th International Conference on Bioscience & Engineering (B...
CALL FOR PAPERS - 5th International Conference on Bioscience & Engineering (B...
 

More from Yasset Perez-Riverol

Introduction to Proteogenomics
Introduction to Proteogenomics Introduction to Proteogenomics
Introduction to Proteogenomics
Yasset Perez-Riverol
 
Mapping millions of peptidoforms to Genome Coordinates
Mapping millions of peptidoforms to Genome CoordinatesMapping millions of peptidoforms to Genome Coordinates
Mapping millions of peptidoforms to Genome Coordinates
Yasset Perez-Riverol
 
Systematic integration of millions of peptidoform evidences into Ensembl and ...
Systematic integration of millions of peptidoform evidences into Ensembl and ...Systematic integration of millions of peptidoform evidences into Ensembl and ...
Systematic integration of millions of peptidoform evidences into Ensembl and ...
Yasset Perez-Riverol
 
BioContainers on ELIXIR All Hands 2017
BioContainers on ELIXIR All Hands 2017BioContainers on ELIXIR All Hands 2017
BioContainers on ELIXIR All Hands 2017
Yasset Perez-Riverol
 
OpenMS: Quantitative proteomics at large scale
OpenMS: Quantitative proteomics at large scaleOpenMS: Quantitative proteomics at large scale
OpenMS: Quantitative proteomics at large scale
Yasset Perez-Riverol
 
Do we need to make public our proteomics data?
Do we need to make public our proteomics data?Do we need to make public our proteomics data?
Do we need to make public our proteomics data?
Yasset Perez-Riverol
 
Design of an hexapeptide database for proteomics studies
Design of an hexapeptide database for proteomics studiesDesign of an hexapeptide database for proteomics studies
Design of an hexapeptide database for proteomics studies
Yasset Perez-Riverol
 
Parallel conformational search of small molecules
Parallel conformational search of small moleculesParallel conformational search of small molecules
Parallel conformational search of small molecules
Yasset Perez-Riverol
 
PBS Web (Spanish)
PBS Web (Spanish)PBS Web (Spanish)
PBS Web (Spanish)
Yasset Perez-Riverol
 
Standarization in Proteomics: From raw data to metadata files
Standarization in Proteomics: From raw data to metadata filesStandarization in Proteomics: From raw data to metadata files
Standarization in Proteomics: From raw data to metadata files
Yasset Perez-Riverol
 
PRIDE and ProteomeXchange – Making proteomics data accessible and reusable
PRIDE and ProteomeXchange – Making proteomics data accessible and reusablePRIDE and ProteomeXchange – Making proteomics data accessible and reusable
PRIDE and ProteomeXchange – Making proteomics data accessible and reusable
Yasset Perez-Riverol
 
Yasset perezriverol csi2011
Yasset perezriverol csi2011Yasset perezriverol csi2011
Yasset perezriverol csi2011
Yasset Perez-Riverol
 
Yasset iso point-cigb-2012
Yasset iso point-cigb-2012Yasset iso point-cigb-2012
Yasset iso point-cigb-2012
Yasset Perez-Riverol
 
SintCompound: A Small Compound Database for Virtual Screening
SintCompound: A Small Compound Database for Virtual ScreeningSintCompound: A Small Compound Database for Virtual Screening
SintCompound: A Small Compound Database for Virtual Screening
Yasset Perez-Riverol
 

More from Yasset Perez-Riverol (14)

Introduction to Proteogenomics
Introduction to Proteogenomics Introduction to Proteogenomics
Introduction to Proteogenomics
 
Mapping millions of peptidoforms to Genome Coordinates
Mapping millions of peptidoforms to Genome CoordinatesMapping millions of peptidoforms to Genome Coordinates
Mapping millions of peptidoforms to Genome Coordinates
 
Systematic integration of millions of peptidoform evidences into Ensembl and ...
Systematic integration of millions of peptidoform evidences into Ensembl and ...Systematic integration of millions of peptidoform evidences into Ensembl and ...
Systematic integration of millions of peptidoform evidences into Ensembl and ...
 
BioContainers on ELIXIR All Hands 2017
BioContainers on ELIXIR All Hands 2017BioContainers on ELIXIR All Hands 2017
BioContainers on ELIXIR All Hands 2017
 
OpenMS: Quantitative proteomics at large scale
OpenMS: Quantitative proteomics at large scaleOpenMS: Quantitative proteomics at large scale
OpenMS: Quantitative proteomics at large scale
 
Do we need to make public our proteomics data?
Do we need to make public our proteomics data?Do we need to make public our proteomics data?
Do we need to make public our proteomics data?
 
Design of an hexapeptide database for proteomics studies
Design of an hexapeptide database for proteomics studiesDesign of an hexapeptide database for proteomics studies
Design of an hexapeptide database for proteomics studies
 
Parallel conformational search of small molecules
Parallel conformational search of small moleculesParallel conformational search of small molecules
Parallel conformational search of small molecules
 
PBS Web (Spanish)
PBS Web (Spanish)PBS Web (Spanish)
PBS Web (Spanish)
 
Standarization in Proteomics: From raw data to metadata files
Standarization in Proteomics: From raw data to metadata filesStandarization in Proteomics: From raw data to metadata files
Standarization in Proteomics: From raw data to metadata files
 
PRIDE and ProteomeXchange – Making proteomics data accessible and reusable
PRIDE and ProteomeXchange – Making proteomics data accessible and reusablePRIDE and ProteomeXchange – Making proteomics data accessible and reusable
PRIDE and ProteomeXchange – Making proteomics data accessible and reusable
 
Yasset perezriverol csi2011
Yasset perezriverol csi2011Yasset perezriverol csi2011
Yasset perezriverol csi2011
 
Yasset iso point-cigb-2012
Yasset iso point-cigb-2012Yasset iso point-cigb-2012
Yasset iso point-cigb-2012
 
SintCompound: A Small Compound Database for Virtual Screening
SintCompound: A Small Compound Database for Virtual ScreeningSintCompound: A Small Compound Database for Virtual Screening
SintCompound: A Small Compound Database for Virtual Screening
 

Recently uploaded

Shallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptxShallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptx
Gokturk Mehmet Dilci
 
Basics of crystallography, crystal systems, classes and different forms
Basics of crystallography, crystal systems, classes and different formsBasics of crystallography, crystal systems, classes and different forms
Basics of crystallography, crystal systems, classes and different forms
MaheshaNanjegowda
 
Sharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Sharlene Leurig - Enabling Onsite Water Use with Net Zero WaterSharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Sharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Texas Alliance of Groundwater Districts
 
Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...
Leonel Morgado
 
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
David Osipyan
 
Medical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptxMedical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptx
terusbelajar5
 
Thornton ESPP slides UK WW Network 4_6_24.pdf
Thornton ESPP slides UK WW Network 4_6_24.pdfThornton ESPP slides UK WW Network 4_6_24.pdf
Thornton ESPP slides UK WW Network 4_6_24.pdf
European Sustainable Phosphorus Platform
 
Cytokines and their role in immune regulation.pptx
Cytokines and their role in immune regulation.pptxCytokines and their role in immune regulation.pptx
Cytokines and their role in immune regulation.pptx
Hitesh Sikarwar
 
Oedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptxOedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptx
muralinath2
 
bordetella pertussis.................................ppt
bordetella pertussis.................................pptbordetella pertussis.................................ppt
bordetella pertussis.................................ppt
kejapriya1
 
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
vluwdy49
 
Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.
Aditi Bajpai
 
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
University of Maribor
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
University of Rennes, INSA Rennes, Inria/IRISA, CNRS
 
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills MN
 
20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx
Sharon Liu
 
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdfTopic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
TinyAnderson
 
aziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobelaziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobel
İsa Badur
 
Immersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths ForwardImmersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths Forward
Leonel Morgado
 
The debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically youngThe debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically young
Sérgio Sacani
 

Recently uploaded (20)

Shallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptxShallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptx
 
Basics of crystallography, crystal systems, classes and different forms
Basics of crystallography, crystal systems, classes and different formsBasics of crystallography, crystal systems, classes and different forms
Basics of crystallography, crystal systems, classes and different forms
 
Sharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Sharlene Leurig - Enabling Onsite Water Use with Net Zero WaterSharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Sharlene Leurig - Enabling Onsite Water Use with Net Zero Water
 
Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...
 
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
 
Medical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptxMedical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptx
 
Thornton ESPP slides UK WW Network 4_6_24.pdf
Thornton ESPP slides UK WW Network 4_6_24.pdfThornton ESPP slides UK WW Network 4_6_24.pdf
Thornton ESPP slides UK WW Network 4_6_24.pdf
 
Cytokines and their role in immune regulation.pptx
Cytokines and their role in immune regulation.pptxCytokines and their role in immune regulation.pptx
Cytokines and their role in immune regulation.pptx
 
Oedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptxOedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptx
 
bordetella pertussis.................................ppt
bordetella pertussis.................................pptbordetella pertussis.................................ppt
bordetella pertussis.................................ppt
 
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
 
Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.
 
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
 
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
 
20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx
 
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdfTopic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
 
aziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobelaziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobel
 
Immersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths ForwardImmersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths Forward
 
The debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically youngThe debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically young
 

Biocontainers 2019: Presentation for the ELIXIR All Hands

  • 1. www.elixir-europe.org BioContainers: From tools to services Challenges and opportunities BioContainers Community github: github.com/Biocontainers twitter: @BioContainers
  • 2. http://biocontainers.pro | @biocontainers | github.com/biocontainers ELIXIR All Hands June, 2019 http://biocontainers.pro | @biocontainers | github.com/biocontainers BioContainers Community
  • 3. http://biocontainers.pro | @biocontainers | github.com/biocontainershttp://biocontainers.pro | @biocontainers | github.com/biocontainers Outline I. Introduction to BioContainers II. Results of the ELIXIR Implementation Study (2018-2019) III. Bioinformatics Services to Cloud Question : https://www.sli.do Code: Y726 ELIXIR All Hands June, 2019
  • 4. http://biocontainers.pro | @biocontainers | github.com/biocontainershttp://biocontainers.pro | @biocontainers | github.com/biocontainers Software Publication Research What is BioContainers Implementation of Guidelines: • Versioning • Testing • Dependency management • Documentation My Data Container Publication Results Results mulled dockerfile ELIXIR All Hands June, 2019
  • 5. http://biocontainers.pro | @biocontainers | github.com/biocontainershttp://biocontainers.pro | @biocontainers | github.com/biocontainers BioContainers / BioConda Architecture Leprevost, F., Grüning, ... & Perez-Riverol Y. (2017). Bioinformatics, 33(16), 2580-2582. Grüning B, Dale R, ... BioConda Team. Nat Methods. 2018 Jul;15(7):475-476. ELIXIR All Hands June, 2019
  • 6. http://biocontainers.pro | @biocontainers | github.com/biocontainershttp://biocontainers.pro | @biocontainers | github.com/biocontainers Findability # Base Image FROM biocontainers/biocontainers:latest # Metadata LABEL base.image="biocontainers:latest” LABEL version="3" LABEL software="Comet” LABEL software.version="2016012” LABEL description="an open source tandem mass spectrometry sequence database search tool" LABEL website="http://comet-ms.sourceforge.net/" LABEL documentation="http://comet-ms.sourceforge.net/parameters/param eters_2016010/" LABEL license="http://comet-ms.sourceforge.net/" LABEL tags="Proteomics" # Maintainer MAINTAINER Felipe da Veiga Leprevost <felipe@leprevost.com.br> USER biodocker ….. Metadata should be able to connect to other relevant software registries such as: bio.tools , pubmed or omicstools Versions should be defined in an standard format for the container and the software inside. Software duplications should be removed between mulled and dockerfile containers. Metadata of bioconda packages and Dockerfile recipes should be the same. ELIXIR All Hands June, 2019
  • 7. http://biocontainers.pro | @biocontainers | github.com/biocontainers Implementation Study (2018-2019) ELIXIR All Hands June, 2019 Development of Architecture for Software Containers at ELIXIR and its use by EXCELERATE use-cases communities. 1. Develop the Bioinformatics Containers Central Service (BCCS) to support the use of software containers in ELIXIR. 2. BioContainers integration with bio.tools and OpenEBench. 3. BioContainers Registry 2.0 and Command Line tool. 4. BioContainers for Training and Support 5. Demonstration of the use of software containers in selected ELIXIR’s Use Cases a. Rare Diseases use-case. Proof-of-concept implementation of a human genomics variant calling pipeline using BioContainers and Galaxy. b. EGA Integration.
  • 8. http://biocontainers.pro | @biocontainers | github.com/biocontainershttp://biocontainers.pro | @biocontainers | github.com/biocontainers ELIXIR All Hands June, 2019
  • 9. http://biocontainers.pro | @biocontainers | github.com/biocontainershttp://biocontainers.pro | @biocontainers | github.com/biocontainers ELIXIR All Hands June, 2019 Architecture BioContainers 2.0 1. Continuous integration for Dockerfile containers: https://jenkins.biocontainers.pro/jenkins/ 2. Binary and recipes ELIXIR Compute backup: https://containers.biocontainers.pro/s3/ 3. Internal ELIXIR registry: http://containers.biocontainers.pro/containers 4. Biocontainers TRS API: https://api.biocontainers.pro/api/ga4gh/v2/ui/ 5. BioContainers Web: https://biocontainers.pro
  • 10. http://biocontainers.pro | @biocontainers | github.com/biocontainershttp://biocontainers.pro | @biocontainers | github.com/biocontainers ELIXIR All Hands June, 2019 Planemo - A Scientific Workflow SDK > planemo run --docker … > planemo test --docker … workflow[xml.yml] > planemo mull build-and-test … > mulled-build-channel --channel bioconda --namespace biocontainers
  • 11. http://biocontainers.pro | @biocontainers | github.com/biocontainershttp://biocontainers.pro | @biocontainers | github.com/biocontainers ELIXIR All Hands June, 2019 Training containers based on Galaxy VRE with: ● tools (versioned) ● workflows (versioned) ● Interactive tours ● example data ● linked to and from hands-on material ● reference data
  • 12. http://biocontainers.pro | @biocontainers | github.com/biocontainershttp://biocontainers.pro | @biocontainers | github.com/biocontainers ELIXIR All Hands June, 2019
  • 13. http://biocontainers.pro | @biocontainers | github.com/biocontainershttp://biocontainers.pro | @biocontainers | github.com/biocontainers ELIXIR All Hands June, 2019 BioContainers TRS API Code (Flask, MongoDB, Python): https://github.com/BioContainers/biocontainers-backend https://api.biocontainers.pro/api/ga4gh/v2/ui/
  • 14. http://biocontainers.pro | @biocontainers | github.com/biocontainershttp://biocontainers.pro | @biocontainers | github.com/biocontainers ELIXIR All Hands June, 2019
  • 15. http://biocontainers.pro | @biocontainers | github.com/biocontainers Biocontainers Statistics and Usage ELIXIR All Hands June, 2019 Pipeline and tools using BioContainers: ● Galaxy: ~1500 ● CWL: ~300 ● Shell: ~200 ● WDL: 60 ● Python: 50 ● Nextflow: 50
  • 16. http://biocontainers.pro | @biocontainers | github.com/biocontainershttp://biocontainers.pro | @biocontainers | github.com/biocontainers ELIXIR All Hands June, 2019 Selected Publications (2018-2019) https://scholar.google.co.uk/citations?user=O6xG2-EAAAAJ&hl=en 1. da Veiga Leprevost, F., Grüning, B. et. al. Bioinformatics. 2017 Aug 15;33(16):2580-2582. doi: 10.1093/bioinformatics/btx192. 2. Grüning B, Dale R et. al. Nat Methods. 2018 Jul;15(7):475-476. doi: 10.1038/s41592-018-0046-7. 3. Gruening B., Sallou O., Moreno, P.,et. al. 2019. F1000Research, 7. doi: 10.12688/f1000research.15140.2 4. Moreno P. et. al. 2019. biorxiv. doi: 10.1101/488643v2 5. Perez-Riverol Y, et. al. 2019 biorxiv. doi: . 10.1101/604413v1
  • 17. http://biocontainers.pro | @biocontainers | github.com/biocontainershttp://biocontainers.pro | @biocontainers | github.com/biocontainers Moving services to Cloud 1. Services are not tools • Services needs to be re-define/re-implemented to be cloud compatible. • Services needs high-level of orchestration. Most of these services contains DB, APIs, Web, etc. 2. Data should be open, transferable from the source into the deployed cloud instances. 3. A public list of important/interesting services for the community and industry should be collected. 4. Funding from industry / public to help service teams to move into Cloud. https://elixir-europe.org/services/tag/data-resources ELIXIR All Hands June, 2019
  • 18. http://biocontainers.pro | @biocontainers | github.com/biocontainershttp://biocontainers.pro | @biocontainers | github.com/biocontainers First efforts 1. BioContainers Architecture is fully deployable into Cloud using Kubenetes helm charts. (https://github.com/BioContainers/biocontainers-backend) 2. Some other projects moving within this efforts: - Omics Discovery Index (www.omicsdi.org) - Identifiers.org - Reactome (https://reactome.org/) ELIXIR All Hands June, 2019
  • 19. http://biocontainers.pro | @biocontainers | github.com/biocontainershttp://biocontainers.pro | @biocontainers | github.com/biocontainers ELIXIR All Hands June, 2019 Thanks!!! and 700+ contributors