BioContainers (biocontainers.pro) is an open-source and community-driven framework which provides platform independent executable environments for bioinformatics software. BioContainers allows labs of all sizes to easily install bioinformatics software, maintain multiple versions of the same software and combine tools into powerful analysis pipelines. BioContainers is based on popular open-source projects Docker and rkt frameworks, that allow software to be installed and executed under an isolated and controlled environment. Also, it provides infrastructure and basic guidelines to create, manage and distribute bioinformatics containers with a special focus on omics technologies. These containers can be integrated into more comprehensive bioinformatics pipelines and different architectures (local desktop, cloud environments or HPC clusters).
2. http://biocontainers.pro | @biocontainers | github.com/biocontainers
ELIXIR All Hands June, 2019
http://biocontainers.pro | @biocontainers | github.com/biocontainers
BioContainers Community
3. http://biocontainers.pro | @biocontainers | github.com/biocontainershttp://biocontainers.pro | @biocontainers | github.com/biocontainers
Outline
I. Introduction to BioContainers
II. Results of the ELIXIR Implementation Study (2018-2019)
III. Bioinformatics Services to Cloud
Question : https://www.sli.do Code: Y726
ELIXIR All Hands June, 2019
4. http://biocontainers.pro | @biocontainers | github.com/biocontainershttp://biocontainers.pro | @biocontainers | github.com/biocontainers
Software
Publication
Research
What is BioContainers
Implementation of Guidelines:
• Versioning
• Testing
• Dependency management
• Documentation
My Data
Container Publication
Results
Results
mulled
dockerfile
ELIXIR All Hands June, 2019
6. http://biocontainers.pro | @biocontainers | github.com/biocontainershttp://biocontainers.pro | @biocontainers | github.com/biocontainers
Findability
# Base Image FROM biocontainers/biocontainers:latest
# Metadata
LABEL base.image="biocontainers:latest”
LABEL version="3"
LABEL software="Comet”
LABEL software.version="2016012”
LABEL description="an open source tandem mass spectrometry
sequence database search tool"
LABEL website="http://comet-ms.sourceforge.net/"
LABEL
documentation="http://comet-ms.sourceforge.net/parameters/param
eters_2016010/"
LABEL license="http://comet-ms.sourceforge.net/"
LABEL tags="Proteomics"
# Maintainer
MAINTAINER Felipe da Veiga Leprevost
<felipe@leprevost.com.br>
USER biodocker
…..
Metadata should be able to connect
to other relevant software registries
such as: bio.tools , pubmed or
omicstools
Versions should be defined in an
standard format for the container and
the software inside.
Software duplications should be
removed between mulled and
dockerfile containers.
Metadata of bioconda packages and
Dockerfile recipes should be the
same.
ELIXIR All Hands June, 2019
7. http://biocontainers.pro | @biocontainers | github.com/biocontainers
Implementation Study (2018-2019)
ELIXIR All Hands June, 2019
Development of Architecture for Software Containers at ELIXIR and its use by
EXCELERATE use-cases communities.
1. Develop the Bioinformatics Containers Central Service (BCCS) to support the use of
software containers in ELIXIR.
2. BioContainers integration with bio.tools and OpenEBench.
3. BioContainers Registry 2.0 and Command Line tool.
4. BioContainers for Training and Support
5. Demonstration of the use of software containers in selected ELIXIR’s Use Cases
a. Rare Diseases use-case. Proof-of-concept implementation of a human genomics
variant calling pipeline using BioContainers and Galaxy.
b. EGA Integration.
11. http://biocontainers.pro | @biocontainers | github.com/biocontainershttp://biocontainers.pro | @biocontainers | github.com/biocontainers
ELIXIR All Hands June, 2019
Training containers based on Galaxy VRE with:
● tools (versioned)
● workflows (versioned)
● Interactive tours
● example data
● linked to and from hands-on material
● reference data
17. http://biocontainers.pro | @biocontainers | github.com/biocontainershttp://biocontainers.pro | @biocontainers | github.com/biocontainers
Moving services to Cloud
1. Services are not tools
• Services needs to be re-define/re-implemented to be cloud compatible.
• Services needs high-level of orchestration. Most of these services contains DB,
APIs, Web, etc.
2. Data should be open, transferable from the source into the deployed cloud instances.
3. A public list of important/interesting services for the community and industry should be
collected.
4. Funding from industry / public to help service teams to move into Cloud.
https://elixir-europe.org/services/tag/data-resources
ELIXIR All Hands June, 2019
18. http://biocontainers.pro | @biocontainers | github.com/biocontainershttp://biocontainers.pro | @biocontainers | github.com/biocontainers
First efforts
1. BioContainers Architecture is fully deployable into Cloud using Kubenetes helm
charts. (https://github.com/BioContainers/biocontainers-backend)
2. Some other projects moving within this efforts:
- Omics Discovery Index (www.omicsdi.org)
- Identifiers.org
- Reactome (https://reactome.org/)
ELIXIR All Hands June, 2019
19. http://biocontainers.pro | @biocontainers | github.com/biocontainershttp://biocontainers.pro | @biocontainers | github.com/biocontainers
ELIXIR All Hands June, 2019
Thanks!!!
and 700+ contributors