SlideShare a Scribd company logo
Open PHACTS
Architecture and
Docker install
Stian Soiland-Reyes, University of Manchester
http://orcid.org/0000-0001-9842-9718
@soilandreyes
This work is licensed under a
.Creative Commons Attribution 4.0 International License
Big Data Europe Webinar, 2016-07-06
This work has been done as part of the BioExcel CoE ( ),
a project funded by the EC H2020 program, contract number
www.bioexcel.eu
EINFRA-5-2015 675728
https://slides.com/soilandreyes/2016-07-06-openphacts
1
http://www.openphacts.org/
Bringing together pharmacological data resources
in an integrated, interoperable infrastructure
Data sources integrated and linked together
so that you can easily see the relationships
between compounds, targets, pathways,
diseases and tissues.
, , , ,
, , , ,
, ,
ChEBI ChEMBL ChemSpider ConceptWiki
DisGeNET DrugBank FAERS Gene Ontology
neXtProt SureChEMBL, UniProt WikiPathways
2 . 1
Data integration
https://www.openphacts.org/2/sci/data.html
2 . 2
https://dev.openphacts.org/docs/2.1
Re-exposed as
public API
2 . 3
{
"format": "linked-data-api",
"version": "1.5",
"result": {
"_about": "https://beta.openphacts.org/1.5/compound?uri=http%3A%2F%2Fwww.conceptwiki.org%2Fconce
"definition": "https://beta.openphacts.org/api-config",
"extendedMetadataVersion": "https://beta.openphacts.org/1.5/compound?uri=http%3A%2F%2Fwww.concep
"linkPredicate": "http://www.w3.org/2004/02/skos/core#exactMatch",
"activeLens": "Default",
"primaryTopic": {
"_about": "http://www.conceptwiki.org/concept/38932552-111f-4a4e-a46a-4ed1d7bdf9d5",
"inDataset": "http://www.conceptwiki.org",
"exactMatch": [
{
"_about": "http://bio2rdf.org/drugbank:DB00398",
"description_en": "Sorafenib (rINN), marketed as Nexavar by Bayer, is a drug approved for th
"description": "Sorafenib (rINN), marketed as Nexavar by Bayer, is a drug approved for the t
"drugType_en": [
"investigational",
"approved"
],
"drugType": [
"investigational",
"approved"
],
"genericName_en": "Sorafenib",
"genericName": "Sorafenib",
"metabolism_en": "Sorafenib is metabolized primarily in the liver, undergoing oxidative meta
"metabolism": "Sorafenib is metabolized primarily in the liver, undergoing oxidative metabol
"proteinBinding_en": "99.5% bound to plasma proteins.",
"proteinBinding": "99.5% bound to plasma proteins.",
"toxicity_en": "The highest dose of sorafenib studied clinically is 800 mg twice daily. The
"toxicity": "The highest dose of sorafenib studied clinically is 800 mg twice daily. The adv
"inDataset": "http://www.openphacts.org/bio2rdf/drugbank", 2 . 4
<?xml version="1.0" encoding="utf-8"?>
<result format="linked-data-api" version="1.5" href="https://beta.openphacts.org/1.5/compound?uri=
<primaryTopic href="http://www.conceptwiki.org/concept/38932552-111f-4a4e-a46a-4ed1d7bdf9d5">
<prefLabel xml:lang="en">Sorafenib</prefLabel>
<exactMatch>
<item href="http://rdf.ebi.ac.uk/resource/chembl/molecule/CHEMBL1336">
<type href="http://rdf.ebi.ac.uk/terms/chembl#SmallMolecule"/>
<inDataset href="http://www.ebi.ac.uk/chembl"/>
<mw_freebase datatype="double">464.82</mw_freebase>
</item>
<item href="http://ops.rsc.org/OPS379634">
<smiles>CNC(=O)C1=NC=CC(=C1)OC2=CC=C(C=C2)NC(=O)NC3=CC(=C(C=C3)Cl)C(F)(F)F</smiles>
<rtb datatype="double">5.0</rtb>
<ro5_violations datatype="double">1.0</ro5_violations>
<psa datatype="double">92.35</psa>
<molweight datatype="double">464.825</molweight>
<molformula>C21H16ClF3N4O3</molformula>
<logp datatype="double">5.158</logp>
<inchikey>MLDQJTXFUGDVEO-UHFFFAOYSA-N</inchikey>
<inchi>InChI=1S/C21H16ClF3N4O3/c1-26-19(30)18-11-15(8-9-27-18)32-14-5-2-12(3-6-14)28-20(31
<hbd datatype="double">3.0</hbd>
<hba datatype="double">7.0</hba>
<inDataset href="http://ops.rsc.org"/>
</item>
<item href="http://aers.data2semantics.org/resource/drug/NEXAVAR">
<prefLabel>NEXAVAR</prefLabel>
<reportedAdverseEvent>
<item href="http://aers.data2semantics.org/resource/diagnosis/HEAD_INJURY">
<prefLabel>HEAD INJURY</prefLabel>
<inDataset href="http://aers.data2semantics.org/"/>
</item>
<item href="http://aers.data2semantics.org/resource/diagnosis/SUPRAVENTRICULAR_TACHYCARD
<prefLabel>SUPRAVENTRICULAR TACHYCARDIA</prefLabel>
<inDataset href="http://aers.data2semantics.org/"/> 2 . 5
@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .
@prefix skos: <http://www.w3.org/2004/02/skos/core#> .
@prefix void: <http://rdfs.org/ns/void#> .
@prefix foaf: <http://xmlns.com/foaf/0.1/> .
@prefix ns0: <http://www.openphacts.org/api#> .
@prefix ns1: <http://bio2rdf.org/> .
@prefix ns2: <http://rdf.ebi.ac.uk/terms/chembl#> .
@prefix chembl1336: <http://rdf.ebi.ac.uk/resource/chembl/molecule/CHEMBL1336#> .
@prefix linked-data: <http://purl.org/linked-data/api/vocab#> .
@prefix msg0: <http://www.openphacts.org/api/> .
<http://www.conceptwiki.org/concept/38932552-111f-4a4e-a46a-4ed1d7bdf9d5>
skos:exactMatch <http://aers.data2semantics.org/resource/drug/NEXAVAR> ;
skos:exactMatch <http://aers.data2semantics.org/resource/drug/SORAFENIB> ;
skos:exactMatch <http://www.conceptwiki.org/concept/38932552-111f-4a4e-a46a-4ed1d7b
skos:exactMatch <http://bio2rdf.org/drugbank:DB00398> ;
skos:exactMatch <http://rdf.ebi.ac.uk/resource/chembl/molecule/CHEMBL1336> ;
skos:exactMatch <http://ops.rsc.org/OPS379634> ;
skos:prefLabel "Sorafenib"@en ;
void:inDataset <http://www.conceptwiki.org> ;
foaf:isPrimaryTopicOf <https://beta.openphacts.org/1.5/compound?uri=http%3A%2F%2Fwww.conc
<https://beta.openphacts.org/1.5/compound?uri=http%3A%2F%2Fwww.conceptwiki.org%2Fconcept%2F3893255
foaf:primaryTopic <http://www.conceptwiki.org/concept/38932552-111f-4a4e-a46a-4ed1d7
linked-data:definition <https://beta.openphacts.org/api-config> ;
msg0:activeLens "Default" ;
void:linkPredicate skos:exactMatch ;
linked-data:extendedMetadataVersion <https://beta.openphacts.org/1.5/compound?uri=http%3A
<http://ops.rsc.org/OPS379634>
void:inDataset <http://ops.rsc.org> ;
ns0:smiles "CNC(=O)C1=NC=CC(=C1)OC2=CC=C(C=C2)NC(=O)NC3=CC(=C(C=C3)Cl)C(F)(F)F" ;
ns0:inchi "InChI=1S/C21H16ClF3N4O3/c1-26-19(30)18-11-15(8-9-27-18)32-14-5-2-12(3
ns0:inchikey "MLDQJTXFUGDVEO-UHFFFAOYSA-N" ; 2 . 6
explorer.openphacts.org
3
Architecture
4 . 1
API architecture
4 . 2
Chemical Structure Search
RDF/SPARQL
(Virtuoso)
Identity Mapping Service
Identity Resolution Service
(ConceptWiki)
Chembl, Uniprot, ...
Data loading
4 . 3
Linux Container technology
..light-weight "virtual" virtual machine
A container is started from a image
Images downloaded from Docker Hub
Dockerfile: Layer-based recipe
Philosophy: One service, one image → microservices
Cloud's best friend: scalable, reproducible, customizable
https://www.docker.com/
5 . 1
https://hub.docker.com/r/openphacts/
5 . 2
ops-ims
ops-mysql
ops-virtuoso
ops-apiops-memcached
ops-virtuosodata
ops-mysqldata
ops-virtuosostaging
ops-mysqlstaging
https://data.openphacts.org/
ops-explorer
:3001
:3002
:3004:3003
https://hub.docker.com/
ops-docker
https://github.com/openphacts/ops-docker/
5 . 3
Docker Compose
https://www.docker.com/products/docker-compose
Which images to download
Which data volumes to use
Which network ports are exposed
How are containers linked
How to start/stop the containers
$ docker-compose up -d
5 . 4
docker-compose.yml
# Open PHACTS platform
# Docker Compose configuration
explorer:
image: openphacts/explorer2
ports:
- "3001:3000"
links:
- api
environment:
- API_URL=http://localhost:3002
#restart: always
api:
image: openphacts/ops-linkeddataapi
ports:
- "3002:80"
links:
- ims
- memcached
- virtuoso:sparql
# SPARQL server
virtuoso:
build: virtuoso-ops
ports:
- "3003:8890"
volumes_from:
- virtuosodata
virtuosodata:
image: busybox
volumes:
- /virtuoso 5 . 5
Data staging
6 . 1
Docker and data?
Docker Hub maximum image size: 10 GB
Open PHACTS data (compressed): ~30 GB
Open PHACTS data (installed): ~200 GB
Solution: Added staging Docker containers
Download from
Verify consistency
Import into Virtuso and mySQL
https://data.openphacts.org/
6 . 2
https://data.openphacts.org/
6 . 3
https://data.openphacts.org/
data.openphacts.org
RDF datasets
RDF linksets
VoID metadata/provenance
mySQL-imported linksets
Virtuoso-imported datasets
→ Maven repository
release data as software
→Research Objects
propagate metadata
6 . 4
Try it!
7 . 1
https://github.com/openphacts/ops-docker
Hardware requirements:
150 GB of disk space (ideal: 250 GB)
16 GB of RAM (ideal: 128 GB)
4 CPU core (ideal: 8 cores)
Prerequisites:
Recent x64 Linux (Ubuntu 14.04 LTS, Centos 7)
Fast Internet connection
Docker
Docker Compose
What do I need?
7 . 2
https://github.com/openphacts/ops-docker
Follow the GitHub tutorial exactly, customize later
Install latest Docker and Docker Compose
Just testing on Windows or OS X?
.. modify Docker's Linux VM to have enough disk and memory
Firewall? Different settings depend on your firewall details.
Don't worry - Docker is containerized!
..you won't break your machine
Don't jump ahead..
7 . 3
https://github.com/openphacts/ops-docker
Get the software
curl -L https://github.com/openphacts/ops-docker/archive/master.tar.gz | tar xzv
cd ops-docker-master
sudo docker-compose pull
7 . 4
https://github.com/openphacts/ops-docker
Get the data
$ sudo docker-compose up --no-recreate -d mysqlstaging virtuosostaging
$ sudo docker-compose logs mysqlstaging virtuosostaging
ops-mysqlstaging | mySQL staging finished
ops-mysqlstaging exited with code 0
ops-virtuosostaging | 09:13:35 --> Backup file # 675 [0x3F02-0x74-0x8A]
ops-virtuosostaging | 09:13:36 --> Backup file # 676 [0x3F02-0x74-0x8A]
ops-virtuosostaging | 09:13:37 End of restoring from backup, 6751701 pages
ops-virtuosostaging | 09:13:37 Server exiting
ops-virtuosostaging | Loading completed
ops-virtuosostaging exited with code 0
7 . 5
https://github.com/openphacts/ops-docker
Start the services
$ sudo docker-compose up --no-recreate -d
$ sudo docker-compose logs --tail=5
7 . 6
Using the services
8 . 1
http://localhost:3001/ Explorer
8 . 2
http://localhost:3002/ API
8 . 3
http://localhost:3003/ SPARQL
8 . 4
http://localhost:3004/QueryExpander Identity Mapping
8 . 5
What's next?
9 . 1
Custom data staging
Different Open PHACTS 2.1 licensing options:
Non-Commercial users: Everything
Commercial users: No DrugBank, partial SureChembl
Open PHACTS members: Full SureChembl
9 . 2
Microservices pr dataset
Most queries have separate fragments per dataset
..which could be executed on separate microservices
Better cloud scalability
Easier to test upgrades of individual datasets
But still need "API" layer to do Identity Mapping
and selecting datasets to query
9 . 3
BioExcel Workflow blocks
BioExcel approach: Spin up virtual machine when an
Open PHACTS workflow is started
Workflow bound dynamically to VM instance(s)
Scalability (exclusive access)
Reproducibility (independent/fixed OPS install)
Tool descriptions - exposed in bio.tools
9 . 4
Customization
Make it easier to add third-party data:
datasets, linksets, queries, API calls
..so pharma industry can mix in their in-house data
.. so academics can upgrade and expand datasets
More tooling,
more documentation,
or more training?
9 . 5
Feedback
https://github.com/openphacts/ops-docker/issues
http://support.openphacts.org/
http://ask.bioexcel.eu/
https://data.openphacts.org/
10

More Related Content

Viewers also liked

Resolucion 013 de 2016 modifica cronograma
Resolucion 013 de 2016 modifica cronogramaResolucion 013 de 2016 modifica cronograma
Resolucion 013 de 2016 modifica cronograma
Remedios Antioquia
 
Alhuda CIBE - Risk Management in Islamic Banking by Tariqullah Khan
Alhuda CIBE - Risk Management in Islamic Banking by Tariqullah KhanAlhuda CIBE - Risk Management in Islamic Banking by Tariqullah Khan
Alhuda CIBE - Risk Management in Islamic Banking by Tariqullah Khan
Alhuda Centre of Islamic Banking & Economics
 
QA Fest 2015. Игорь Хрол. Автоматизация тестирования: отбрасываем лишнее и пр...
QA Fest 2015. Игорь Хрол. Автоматизация тестирования: отбрасываем лишнее и пр...QA Fest 2015. Игорь Хрол. Автоматизация тестирования: отбрасываем лишнее и пр...
QA Fest 2015. Игорь Хрол. Автоматизация тестирования: отбрасываем лишнее и пр...
QAFest
 
Docker architecture(version modified)
Docker architecture(version modified)Docker architecture(version modified)
Docker architecture(version modified)
Mohammadreza Amini
 
Uso de las redes sociales en la educación
Uso de las redes sociales en la educaciónUso de las redes sociales en la educación
Uso de las redes sociales en la educación
Vale Hdz
 
Manual de organización
Manual de organización Manual de organización
Manual de organización
DORIANGEL21
 
La sociedad Multietnica y Pluricultural.
La sociedad Multietnica y Pluricultural.La sociedad Multietnica y Pluricultural.
La sociedad Multietnica y Pluricultural.
Jorge Luis Guerra
 
Bank Alfalah Limited Internship Report
Bank Alfalah Limited Internship ReportBank Alfalah Limited Internship Report
Bank Alfalah Limited Internship ReportFatima Tafakhar
 

Viewers also liked (10)

Presentation
PresentationPresentation
Presentation
 
Resolucion 013 de 2016 modifica cronograma
Resolucion 013 de 2016 modifica cronogramaResolucion 013 de 2016 modifica cronograma
Resolucion 013 de 2016 modifica cronograma
 
Alhuda CIBE - Risk Management in Islamic Banking by Tariqullah Khan
Alhuda CIBE - Risk Management in Islamic Banking by Tariqullah KhanAlhuda CIBE - Risk Management in Islamic Banking by Tariqullah Khan
Alhuda CIBE - Risk Management in Islamic Banking by Tariqullah Khan
 
QA Fest 2015. Игорь Хрол. Автоматизация тестирования: отбрасываем лишнее и пр...
QA Fest 2015. Игорь Хрол. Автоматизация тестирования: отбрасываем лишнее и пр...QA Fest 2015. Игорь Хрол. Автоматизация тестирования: отбрасываем лишнее и пр...
QA Fest 2015. Игорь Хрол. Автоматизация тестирования: отбрасываем лишнее и пр...
 
Docker architecture(version modified)
Docker architecture(version modified)Docker architecture(version modified)
Docker architecture(version modified)
 
Presentación3
Presentación3Presentación3
Presentación3
 
Uso de las redes sociales en la educación
Uso de las redes sociales en la educaciónUso de las redes sociales en la educación
Uso de las redes sociales en la educación
 
Manual de organización
Manual de organización Manual de organización
Manual de organización
 
La sociedad Multietnica y Pluricultural.
La sociedad Multietnica y Pluricultural.La sociedad Multietnica y Pluricultural.
La sociedad Multietnica y Pluricultural.
 
Bank Alfalah Limited Internship Report
Bank Alfalah Limited Internship ReportBank Alfalah Limited Internship Report
Bank Alfalah Limited Internship Report
 

Similar to 2016-07-06-openphacts-docker

SC1 - Hangout 2: The Open PHACTS pilot
SC1 - Hangout 2: The Open PHACTS pilotSC1 - Hangout 2: The Open PHACTS pilot
SC1 - Hangout 2: The Open PHACTS pilot
BigData_Europe
 
PyConX - Python & MySQL 8.0 Document Store
PyConX - Python & MySQL 8.0 Document StorePyConX - Python & MySQL 8.0 Document Store
PyConX - Python & MySQL 8.0 Document Store
Frederic Descamps
 
Azure Container Services​
Azure Container Services​Azure Container Services​
Azure Container Services​
Pedro Sousa
 
Azure Container Services​
Azure Container Services​Azure Container Services​
Azure Container Services​
Pedro Sousa
 
Ultra Fast Deep Learning in Hybrid Cloud Using Intel Analytics Zoo & Alluxio
Ultra Fast Deep Learning in Hybrid Cloud Using Intel Analytics Zoo & AlluxioUltra Fast Deep Learning in Hybrid Cloud Using Intel Analytics Zoo & Alluxio
Ultra Fast Deep Learning in Hybrid Cloud Using Intel Analytics Zoo & Alluxio
Alluxio, Inc.
 
Bio2RDF@BH2010
Bio2RDF@BH2010Bio2RDF@BH2010
Bio2RDF@BH2010
François Belleau
 
2017-07-22 Common Workflow Language Viewer
2017-07-22 Common Workflow Language Viewer2017-07-22 Common Workflow Language Viewer
2017-07-22 Common Workflow Language Viewer
Stian Soiland-Reyes
 
MySQL Tech Café #8: MySQL 8.0 for Python Developers
MySQL Tech Café #8: MySQL 8.0 for Python DevelopersMySQL Tech Café #8: MySQL 8.0 for Python Developers
MySQL Tech Café #8: MySQL 8.0 for Python Developers
Frederic Descamps
 
Visualize open data with Plone - eea.daviz PLOG 2013
Visualize open data with Plone - eea.daviz PLOG 2013Visualize open data with Plone - eea.daviz PLOG 2013
Visualize open data with Plone - eea.daviz PLOG 2013
Antonio De Marinis
 
Data exchange alternatives, GIGA TAG (2009)
Data exchange alternatives, GIGA TAG (2009)Data exchange alternatives, GIGA TAG (2009)
Data exchange alternatives, GIGA TAG (2009)
Dag Endresen
 
Whatever it takes - Fixing SQLIA and XSS in the process
Whatever it takes - Fixing SQLIA and XSS in the processWhatever it takes - Fixing SQLIA and XSS in the process
Whatever it takes - Fixing SQLIA and XSS in the process
guest3379bd
 
2016-04-21 BioExcel Usecase Open PHACTS
2016-04-21 BioExcel Usecase Open PHACTS2016-04-21 BioExcel Usecase Open PHACTS
2016-04-21 BioExcel Usecase Open PHACTS
Stian Soiland-Reyes
 
ICWE2017 BigDataEurope
ICWE2017 BigDataEuropeICWE2017 BigDataEurope
ICWE2017 BigDataEurope
BigData_Europe
 
Offline strategies for HTML5 web applications - IPC12
Offline strategies for HTML5 web applications - IPC12Offline strategies for HTML5 web applications - IPC12
Offline strategies for HTML5 web applications - IPC12Stephan Hochdörfer
 
Practical OData
Practical ODataPractical OData
Practical OData
Vagif Abilov
 
Research software and Dataverse
Research software and DataverseResearch software and Dataverse
Research software and Dataverse
philipdurbin
 
Restfs internals
Restfs internalsRestfs internals
Restfs internals
Manfred Furuholmen
 
your browser, my storage
your browser, my storageyour browser, my storage
your browser, my storage
Francesco Fullone
 

Similar to 2016-07-06-openphacts-docker (20)

SC1 - Hangout 2: The Open PHACTS pilot
SC1 - Hangout 2: The Open PHACTS pilotSC1 - Hangout 2: The Open PHACTS pilot
SC1 - Hangout 2: The Open PHACTS pilot
 
PyConX - Python & MySQL 8.0 Document Store
PyConX - Python & MySQL 8.0 Document StorePyConX - Python & MySQL 8.0 Document Store
PyConX - Python & MySQL 8.0 Document Store
 
Bio2RDF @ W3C HCLS2009
Bio2RDF @ W3C HCLS2009Bio2RDF @ W3C HCLS2009
Bio2RDF @ W3C HCLS2009
 
NYC_2016_slides
NYC_2016_slidesNYC_2016_slides
NYC_2016_slides
 
Azure Container Services​
Azure Container Services​Azure Container Services​
Azure Container Services​
 
Azure Container Services​
Azure Container Services​Azure Container Services​
Azure Container Services​
 
Ultra Fast Deep Learning in Hybrid Cloud Using Intel Analytics Zoo & Alluxio
Ultra Fast Deep Learning in Hybrid Cloud Using Intel Analytics Zoo & AlluxioUltra Fast Deep Learning in Hybrid Cloud Using Intel Analytics Zoo & Alluxio
Ultra Fast Deep Learning in Hybrid Cloud Using Intel Analytics Zoo & Alluxio
 
Bio2RDF@BH2010
Bio2RDF@BH2010Bio2RDF@BH2010
Bio2RDF@BH2010
 
2017-07-22 Common Workflow Language Viewer
2017-07-22 Common Workflow Language Viewer2017-07-22 Common Workflow Language Viewer
2017-07-22 Common Workflow Language Viewer
 
MySQL Tech Café #8: MySQL 8.0 for Python Developers
MySQL Tech Café #8: MySQL 8.0 for Python DevelopersMySQL Tech Café #8: MySQL 8.0 for Python Developers
MySQL Tech Café #8: MySQL 8.0 for Python Developers
 
Visualize open data with Plone - eea.daviz PLOG 2013
Visualize open data with Plone - eea.daviz PLOG 2013Visualize open data with Plone - eea.daviz PLOG 2013
Visualize open data with Plone - eea.daviz PLOG 2013
 
Data exchange alternatives, GIGA TAG (2009)
Data exchange alternatives, GIGA TAG (2009)Data exchange alternatives, GIGA TAG (2009)
Data exchange alternatives, GIGA TAG (2009)
 
Whatever it takes - Fixing SQLIA and XSS in the process
Whatever it takes - Fixing SQLIA and XSS in the processWhatever it takes - Fixing SQLIA and XSS in the process
Whatever it takes - Fixing SQLIA and XSS in the process
 
2016-04-21 BioExcel Usecase Open PHACTS
2016-04-21 BioExcel Usecase Open PHACTS2016-04-21 BioExcel Usecase Open PHACTS
2016-04-21 BioExcel Usecase Open PHACTS
 
ICWE2017 BigDataEurope
ICWE2017 BigDataEuropeICWE2017 BigDataEurope
ICWE2017 BigDataEurope
 
Offline strategies for HTML5 web applications - IPC12
Offline strategies for HTML5 web applications - IPC12Offline strategies for HTML5 web applications - IPC12
Offline strategies for HTML5 web applications - IPC12
 
Practical OData
Practical ODataPractical OData
Practical OData
 
Research software and Dataverse
Research software and DataverseResearch software and Dataverse
Research software and Dataverse
 
Restfs internals
Restfs internalsRestfs internals
Restfs internals
 
your browser, my storage
your browser, my storageyour browser, my storage
your browser, my storage
 

More from Stian Soiland-Reyes

2017-11-03 Provenance and Research Object
2017-11-03 Provenance and Research Object2017-11-03 Provenance and Research Object
2017-11-03 Provenance and Research Object
Stian Soiland-Reyes
 
2017-11-03 Scientific Workflow systems
2017-11-03 Scientific Workflow systems2017-11-03 Scientific Workflow systems
2017-11-03 Scientific Workflow systems
Stian Soiland-Reyes
 
2017-09-27-scholarly-html-ro
2017-09-27-scholarly-html-ro2017-09-27-scholarly-html-ro
2017-09-27-scholarly-html-ro
Stian Soiland-Reyes
 
2016-11-21 BioExcel Workflows and Pipelines Interest Group
2016-11-21 BioExcel Workflows and Pipelines Interest Group2016-11-21 BioExcel Workflows and Pipelines Interest Group
2016-11-21 BioExcel Workflows and Pipelines Interest Group
Stian Soiland-Reyes
 
2016-10-20 BioExcel: Building Workflows with Apache Taverna
2016-10-20 BioExcel: Building Workflows with Apache Taverna2016-10-20 BioExcel: Building Workflows with Apache Taverna
2016-10-20 BioExcel: Building Workflows with Apache Taverna
Stian Soiland-Reyes
 
2016-10-20 BioExcel: Advances in Scientific Workflow Environments
2016-10-20 BioExcel: Advances in Scientific Workflow Environments2016-10-20 BioExcel: Advances in Scientific Workflow Environments
2016-10-20 BioExcel: Advances in Scientific Workflow Environments
Stian Soiland-Reyes
 

More from Stian Soiland-Reyes (7)

2017-11-03 Provenance and Research Object
2017-11-03 Provenance and Research Object2017-11-03 Provenance and Research Object
2017-11-03 Provenance and Research Object
 
2017-11-03 Scientific Workflow systems
2017-11-03 Scientific Workflow systems2017-11-03 Scientific Workflow systems
2017-11-03 Scientific Workflow systems
 
2017-09-27-scholarly-html-ro
2017-09-27-scholarly-html-ro2017-09-27-scholarly-html-ro
2017-09-27-scholarly-html-ro
 
basal-ganglia
basal-gangliabasal-ganglia
basal-ganglia
 
2016-11-21 BioExcel Workflows and Pipelines Interest Group
2016-11-21 BioExcel Workflows and Pipelines Interest Group2016-11-21 BioExcel Workflows and Pipelines Interest Group
2016-11-21 BioExcel Workflows and Pipelines Interest Group
 
2016-10-20 BioExcel: Building Workflows with Apache Taverna
2016-10-20 BioExcel: Building Workflows with Apache Taverna2016-10-20 BioExcel: Building Workflows with Apache Taverna
2016-10-20 BioExcel: Building Workflows with Apache Taverna
 
2016-10-20 BioExcel: Advances in Scientific Workflow Environments
2016-10-20 BioExcel: Advances in Scientific Workflow Environments2016-10-20 BioExcel: Advances in Scientific Workflow Environments
2016-10-20 BioExcel: Advances in Scientific Workflow Environments
 

Recently uploaded

heat stroke and heat exhaustion in children
heat stroke and heat exhaustion in childrenheat stroke and heat exhaustion in children
heat stroke and heat exhaustion in children
SumeraAhmad5
 
Pictures of Superficial & Deep Fascia.ppt.pdf
Pictures of Superficial & Deep Fascia.ppt.pdfPictures of Superficial & Deep Fascia.ppt.pdf
Pictures of Superficial & Deep Fascia.ppt.pdf
Dr. Rabia Inam Gandapore
 
Colonic and anorectal physiology with surgical implications
Colonic and anorectal physiology with surgical implicationsColonic and anorectal physiology with surgical implications
Colonic and anorectal physiology with surgical implications
Dr Maria Tamanna
 
Physiology of Special Chemical Sensation of Taste
Physiology of Special Chemical Sensation of TastePhysiology of Special Chemical Sensation of Taste
Physiology of Special Chemical Sensation of Taste
MedicoseAcademics
 
Non-respiratory Functions of the Lungs.pdf
Non-respiratory Functions of the Lungs.pdfNon-respiratory Functions of the Lungs.pdf
Non-respiratory Functions of the Lungs.pdf
MedicoseAcademics
 
Superficial & Deep Fascia of the NECK.pptx
Superficial & Deep Fascia of the NECK.pptxSuperficial & Deep Fascia of the NECK.pptx
Superficial & Deep Fascia of the NECK.pptx
Dr. Rabia Inam Gandapore
 
ARTIFICIAL INTELLIGENCE IN HEALTHCARE.pdf
ARTIFICIAL INTELLIGENCE IN  HEALTHCARE.pdfARTIFICIAL INTELLIGENCE IN  HEALTHCARE.pdf
ARTIFICIAL INTELLIGENCE IN HEALTHCARE.pdf
Anujkumaranit
 
BRACHYTHERAPY OVERVIEW AND APPLICATORS
BRACHYTHERAPY OVERVIEW  AND  APPLICATORSBRACHYTHERAPY OVERVIEW  AND  APPLICATORS
BRACHYTHERAPY OVERVIEW AND APPLICATORS
Krishan Murari
 
Ocular injury ppt Upendra pal optometrist upums saifai etawah
Ocular injury  ppt  Upendra pal  optometrist upums saifai etawahOcular injury  ppt  Upendra pal  optometrist upums saifai etawah
Ocular injury ppt Upendra pal optometrist upums saifai etawah
pal078100
 
Evaluation of antidepressant activity of clitoris ternatea in animals
Evaluation of antidepressant activity of clitoris ternatea in animalsEvaluation of antidepressant activity of clitoris ternatea in animals
Evaluation of antidepressant activity of clitoris ternatea in animals
Shweta
 
Novas diretrizes da OMS para os cuidados perinatais de mais qualidade
Novas diretrizes da OMS para os cuidados perinatais de mais qualidadeNovas diretrizes da OMS para os cuidados perinatais de mais qualidade
Novas diretrizes da OMS para os cuidados perinatais de mais qualidade
Prof. Marcus Renato de Carvalho
 
Triangles of Neck and Clinical Correlation by Dr. RIG.pptx
Triangles of Neck and Clinical Correlation by Dr. RIG.pptxTriangles of Neck and Clinical Correlation by Dr. RIG.pptx
Triangles of Neck and Clinical Correlation by Dr. RIG.pptx
Dr. Rabia Inam Gandapore
 
A Classical Text Review on Basavarajeeyam
A Classical Text Review on BasavarajeeyamA Classical Text Review on Basavarajeeyam
A Classical Text Review on Basavarajeeyam
Dr. Jyothirmai Paindla
 
Hemodialysis: Chapter 3, Dialysis Water Unit - Dr.Gawad
Hemodialysis: Chapter 3, Dialysis Water Unit - Dr.GawadHemodialysis: Chapter 3, Dialysis Water Unit - Dr.Gawad
Hemodialysis: Chapter 3, Dialysis Water Unit - Dr.Gawad
NephroTube - Dr.Gawad
 
Maxilla, Mandible & Hyoid Bone & Clinical Correlations by Dr. RIG.pptx
Maxilla, Mandible & Hyoid Bone & Clinical Correlations by Dr. RIG.pptxMaxilla, Mandible & Hyoid Bone & Clinical Correlations by Dr. RIG.pptx
Maxilla, Mandible & Hyoid Bone & Clinical Correlations by Dr. RIG.pptx
Dr. Rabia Inam Gandapore
 
263778731218 Abortion Clinic /Pills In Harare ,
263778731218 Abortion Clinic /Pills In Harare ,263778731218 Abortion Clinic /Pills In Harare ,
263778731218 Abortion Clinic /Pills In Harare ,
sisternakatoto
 
Tom Selleck Health: A Comprehensive Look at the Iconic Actor’s Wellness Journey
Tom Selleck Health: A Comprehensive Look at the Iconic Actor’s Wellness JourneyTom Selleck Health: A Comprehensive Look at the Iconic Actor’s Wellness Journey
Tom Selleck Health: A Comprehensive Look at the Iconic Actor’s Wellness Journey
greendigital
 
Thyroid Gland- Gross Anatomy by Dr. Rabia Inam Gandapore.pptx
Thyroid Gland- Gross Anatomy by Dr. Rabia Inam Gandapore.pptxThyroid Gland- Gross Anatomy by Dr. Rabia Inam Gandapore.pptx
Thyroid Gland- Gross Anatomy by Dr. Rabia Inam Gandapore.pptx
Dr. Rabia Inam Gandapore
 
Vision-1.pptx, Eye structure, basics of optics
Vision-1.pptx, Eye structure, basics of opticsVision-1.pptx, Eye structure, basics of optics
Vision-1.pptx, Eye structure, basics of optics
Sai Sailesh Kumar Goothy
 
Adv. biopharm. APPLICATION OF PHARMACOKINETICS : TARGETED DRUG DELIVERY SYSTEMS
Adv. biopharm. APPLICATION OF PHARMACOKINETICS : TARGETED DRUG DELIVERY SYSTEMSAdv. biopharm. APPLICATION OF PHARMACOKINETICS : TARGETED DRUG DELIVERY SYSTEMS
Adv. biopharm. APPLICATION OF PHARMACOKINETICS : TARGETED DRUG DELIVERY SYSTEMS
AkankshaAshtankar
 

Recently uploaded (20)

heat stroke and heat exhaustion in children
heat stroke and heat exhaustion in childrenheat stroke and heat exhaustion in children
heat stroke and heat exhaustion in children
 
Pictures of Superficial & Deep Fascia.ppt.pdf
Pictures of Superficial & Deep Fascia.ppt.pdfPictures of Superficial & Deep Fascia.ppt.pdf
Pictures of Superficial & Deep Fascia.ppt.pdf
 
Colonic and anorectal physiology with surgical implications
Colonic and anorectal physiology with surgical implicationsColonic and anorectal physiology with surgical implications
Colonic and anorectal physiology with surgical implications
 
Physiology of Special Chemical Sensation of Taste
Physiology of Special Chemical Sensation of TastePhysiology of Special Chemical Sensation of Taste
Physiology of Special Chemical Sensation of Taste
 
Non-respiratory Functions of the Lungs.pdf
Non-respiratory Functions of the Lungs.pdfNon-respiratory Functions of the Lungs.pdf
Non-respiratory Functions of the Lungs.pdf
 
Superficial & Deep Fascia of the NECK.pptx
Superficial & Deep Fascia of the NECK.pptxSuperficial & Deep Fascia of the NECK.pptx
Superficial & Deep Fascia of the NECK.pptx
 
ARTIFICIAL INTELLIGENCE IN HEALTHCARE.pdf
ARTIFICIAL INTELLIGENCE IN  HEALTHCARE.pdfARTIFICIAL INTELLIGENCE IN  HEALTHCARE.pdf
ARTIFICIAL INTELLIGENCE IN HEALTHCARE.pdf
 
BRACHYTHERAPY OVERVIEW AND APPLICATORS
BRACHYTHERAPY OVERVIEW  AND  APPLICATORSBRACHYTHERAPY OVERVIEW  AND  APPLICATORS
BRACHYTHERAPY OVERVIEW AND APPLICATORS
 
Ocular injury ppt Upendra pal optometrist upums saifai etawah
Ocular injury  ppt  Upendra pal  optometrist upums saifai etawahOcular injury  ppt  Upendra pal  optometrist upums saifai etawah
Ocular injury ppt Upendra pal optometrist upums saifai etawah
 
Evaluation of antidepressant activity of clitoris ternatea in animals
Evaluation of antidepressant activity of clitoris ternatea in animalsEvaluation of antidepressant activity of clitoris ternatea in animals
Evaluation of antidepressant activity of clitoris ternatea in animals
 
Novas diretrizes da OMS para os cuidados perinatais de mais qualidade
Novas diretrizes da OMS para os cuidados perinatais de mais qualidadeNovas diretrizes da OMS para os cuidados perinatais de mais qualidade
Novas diretrizes da OMS para os cuidados perinatais de mais qualidade
 
Triangles of Neck and Clinical Correlation by Dr. RIG.pptx
Triangles of Neck and Clinical Correlation by Dr. RIG.pptxTriangles of Neck and Clinical Correlation by Dr. RIG.pptx
Triangles of Neck and Clinical Correlation by Dr. RIG.pptx
 
A Classical Text Review on Basavarajeeyam
A Classical Text Review on BasavarajeeyamA Classical Text Review on Basavarajeeyam
A Classical Text Review on Basavarajeeyam
 
Hemodialysis: Chapter 3, Dialysis Water Unit - Dr.Gawad
Hemodialysis: Chapter 3, Dialysis Water Unit - Dr.GawadHemodialysis: Chapter 3, Dialysis Water Unit - Dr.Gawad
Hemodialysis: Chapter 3, Dialysis Water Unit - Dr.Gawad
 
Maxilla, Mandible & Hyoid Bone & Clinical Correlations by Dr. RIG.pptx
Maxilla, Mandible & Hyoid Bone & Clinical Correlations by Dr. RIG.pptxMaxilla, Mandible & Hyoid Bone & Clinical Correlations by Dr. RIG.pptx
Maxilla, Mandible & Hyoid Bone & Clinical Correlations by Dr. RIG.pptx
 
263778731218 Abortion Clinic /Pills In Harare ,
263778731218 Abortion Clinic /Pills In Harare ,263778731218 Abortion Clinic /Pills In Harare ,
263778731218 Abortion Clinic /Pills In Harare ,
 
Tom Selleck Health: A Comprehensive Look at the Iconic Actor’s Wellness Journey
Tom Selleck Health: A Comprehensive Look at the Iconic Actor’s Wellness JourneyTom Selleck Health: A Comprehensive Look at the Iconic Actor’s Wellness Journey
Tom Selleck Health: A Comprehensive Look at the Iconic Actor’s Wellness Journey
 
Thyroid Gland- Gross Anatomy by Dr. Rabia Inam Gandapore.pptx
Thyroid Gland- Gross Anatomy by Dr. Rabia Inam Gandapore.pptxThyroid Gland- Gross Anatomy by Dr. Rabia Inam Gandapore.pptx
Thyroid Gland- Gross Anatomy by Dr. Rabia Inam Gandapore.pptx
 
Vision-1.pptx, Eye structure, basics of optics
Vision-1.pptx, Eye structure, basics of opticsVision-1.pptx, Eye structure, basics of optics
Vision-1.pptx, Eye structure, basics of optics
 
Adv. biopharm. APPLICATION OF PHARMACOKINETICS : TARGETED DRUG DELIVERY SYSTEMS
Adv. biopharm. APPLICATION OF PHARMACOKINETICS : TARGETED DRUG DELIVERY SYSTEMSAdv. biopharm. APPLICATION OF PHARMACOKINETICS : TARGETED DRUG DELIVERY SYSTEMS
Adv. biopharm. APPLICATION OF PHARMACOKINETICS : TARGETED DRUG DELIVERY SYSTEMS
 

2016-07-06-openphacts-docker

  • 1. Open PHACTS Architecture and Docker install Stian Soiland-Reyes, University of Manchester http://orcid.org/0000-0001-9842-9718 @soilandreyes This work is licensed under a .Creative Commons Attribution 4.0 International License Big Data Europe Webinar, 2016-07-06 This work has been done as part of the BioExcel CoE ( ), a project funded by the EC H2020 program, contract number www.bioexcel.eu EINFRA-5-2015 675728 https://slides.com/soilandreyes/2016-07-06-openphacts 1
  • 2. http://www.openphacts.org/ Bringing together pharmacological data resources in an integrated, interoperable infrastructure Data sources integrated and linked together so that you can easily see the relationships between compounds, targets, pathways, diseases and tissues. , , , , , , , , , , ChEBI ChEMBL ChemSpider ConceptWiki DisGeNET DrugBank FAERS Gene Ontology neXtProt SureChEMBL, UniProt WikiPathways 2 . 1
  • 5. { "format": "linked-data-api", "version": "1.5", "result": { "_about": "https://beta.openphacts.org/1.5/compound?uri=http%3A%2F%2Fwww.conceptwiki.org%2Fconce "definition": "https://beta.openphacts.org/api-config", "extendedMetadataVersion": "https://beta.openphacts.org/1.5/compound?uri=http%3A%2F%2Fwww.concep "linkPredicate": "http://www.w3.org/2004/02/skos/core#exactMatch", "activeLens": "Default", "primaryTopic": { "_about": "http://www.conceptwiki.org/concept/38932552-111f-4a4e-a46a-4ed1d7bdf9d5", "inDataset": "http://www.conceptwiki.org", "exactMatch": [ { "_about": "http://bio2rdf.org/drugbank:DB00398", "description_en": "Sorafenib (rINN), marketed as Nexavar by Bayer, is a drug approved for th "description": "Sorafenib (rINN), marketed as Nexavar by Bayer, is a drug approved for the t "drugType_en": [ "investigational", "approved" ], "drugType": [ "investigational", "approved" ], "genericName_en": "Sorafenib", "genericName": "Sorafenib", "metabolism_en": "Sorafenib is metabolized primarily in the liver, undergoing oxidative meta "metabolism": "Sorafenib is metabolized primarily in the liver, undergoing oxidative metabol "proteinBinding_en": "99.5% bound to plasma proteins.", "proteinBinding": "99.5% bound to plasma proteins.", "toxicity_en": "The highest dose of sorafenib studied clinically is 800 mg twice daily. The "toxicity": "The highest dose of sorafenib studied clinically is 800 mg twice daily. The adv "inDataset": "http://www.openphacts.org/bio2rdf/drugbank", 2 . 4
  • 6. <?xml version="1.0" encoding="utf-8"?> <result format="linked-data-api" version="1.5" href="https://beta.openphacts.org/1.5/compound?uri= <primaryTopic href="http://www.conceptwiki.org/concept/38932552-111f-4a4e-a46a-4ed1d7bdf9d5"> <prefLabel xml:lang="en">Sorafenib</prefLabel> <exactMatch> <item href="http://rdf.ebi.ac.uk/resource/chembl/molecule/CHEMBL1336"> <type href="http://rdf.ebi.ac.uk/terms/chembl#SmallMolecule"/> <inDataset href="http://www.ebi.ac.uk/chembl"/> <mw_freebase datatype="double">464.82</mw_freebase> </item> <item href="http://ops.rsc.org/OPS379634"> <smiles>CNC(=O)C1=NC=CC(=C1)OC2=CC=C(C=C2)NC(=O)NC3=CC(=C(C=C3)Cl)C(F)(F)F</smiles> <rtb datatype="double">5.0</rtb> <ro5_violations datatype="double">1.0</ro5_violations> <psa datatype="double">92.35</psa> <molweight datatype="double">464.825</molweight> <molformula>C21H16ClF3N4O3</molformula> <logp datatype="double">5.158</logp> <inchikey>MLDQJTXFUGDVEO-UHFFFAOYSA-N</inchikey> <inchi>InChI=1S/C21H16ClF3N4O3/c1-26-19(30)18-11-15(8-9-27-18)32-14-5-2-12(3-6-14)28-20(31 <hbd datatype="double">3.0</hbd> <hba datatype="double">7.0</hba> <inDataset href="http://ops.rsc.org"/> </item> <item href="http://aers.data2semantics.org/resource/drug/NEXAVAR"> <prefLabel>NEXAVAR</prefLabel> <reportedAdverseEvent> <item href="http://aers.data2semantics.org/resource/diagnosis/HEAD_INJURY"> <prefLabel>HEAD INJURY</prefLabel> <inDataset href="http://aers.data2semantics.org/"/> </item> <item href="http://aers.data2semantics.org/resource/diagnosis/SUPRAVENTRICULAR_TACHYCARD <prefLabel>SUPRAVENTRICULAR TACHYCARDIA</prefLabel> <inDataset href="http://aers.data2semantics.org/"/> 2 . 5
  • 7. @prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> . @prefix skos: <http://www.w3.org/2004/02/skos/core#> . @prefix void: <http://rdfs.org/ns/void#> . @prefix foaf: <http://xmlns.com/foaf/0.1/> . @prefix ns0: <http://www.openphacts.org/api#> . @prefix ns1: <http://bio2rdf.org/> . @prefix ns2: <http://rdf.ebi.ac.uk/terms/chembl#> . @prefix chembl1336: <http://rdf.ebi.ac.uk/resource/chembl/molecule/CHEMBL1336#> . @prefix linked-data: <http://purl.org/linked-data/api/vocab#> . @prefix msg0: <http://www.openphacts.org/api/> . <http://www.conceptwiki.org/concept/38932552-111f-4a4e-a46a-4ed1d7bdf9d5> skos:exactMatch <http://aers.data2semantics.org/resource/drug/NEXAVAR> ; skos:exactMatch <http://aers.data2semantics.org/resource/drug/SORAFENIB> ; skos:exactMatch <http://www.conceptwiki.org/concept/38932552-111f-4a4e-a46a-4ed1d7b skos:exactMatch <http://bio2rdf.org/drugbank:DB00398> ; skos:exactMatch <http://rdf.ebi.ac.uk/resource/chembl/molecule/CHEMBL1336> ; skos:exactMatch <http://ops.rsc.org/OPS379634> ; skos:prefLabel "Sorafenib"@en ; void:inDataset <http://www.conceptwiki.org> ; foaf:isPrimaryTopicOf <https://beta.openphacts.org/1.5/compound?uri=http%3A%2F%2Fwww.conc <https://beta.openphacts.org/1.5/compound?uri=http%3A%2F%2Fwww.conceptwiki.org%2Fconcept%2F3893255 foaf:primaryTopic <http://www.conceptwiki.org/concept/38932552-111f-4a4e-a46a-4ed1d7 linked-data:definition <https://beta.openphacts.org/api-config> ; msg0:activeLens "Default" ; void:linkPredicate skos:exactMatch ; linked-data:extendedMetadataVersion <https://beta.openphacts.org/1.5/compound?uri=http%3A <http://ops.rsc.org/OPS379634> void:inDataset <http://ops.rsc.org> ; ns0:smiles "CNC(=O)C1=NC=CC(=C1)OC2=CC=C(C=C2)NC(=O)NC3=CC(=C(C=C3)Cl)C(F)(F)F" ; ns0:inchi "InChI=1S/C21H16ClF3N4O3/c1-26-19(30)18-11-15(8-9-27-18)32-14-5-2-12(3 ns0:inchikey "MLDQJTXFUGDVEO-UHFFFAOYSA-N" ; 2 . 6
  • 11. Chemical Structure Search RDF/SPARQL (Virtuoso) Identity Mapping Service Identity Resolution Service (ConceptWiki) Chembl, Uniprot, ... Data loading 4 . 3
  • 12. Linux Container technology ..light-weight "virtual" virtual machine A container is started from a image Images downloaded from Docker Hub Dockerfile: Layer-based recipe Philosophy: One service, one image → microservices Cloud's best friend: scalable, reproducible, customizable https://www.docker.com/ 5 . 1
  • 15. Docker Compose https://www.docker.com/products/docker-compose Which images to download Which data volumes to use Which network ports are exposed How are containers linked How to start/stop the containers $ docker-compose up -d 5 . 4
  • 16. docker-compose.yml # Open PHACTS platform # Docker Compose configuration explorer: image: openphacts/explorer2 ports: - "3001:3000" links: - api environment: - API_URL=http://localhost:3002 #restart: always api: image: openphacts/ops-linkeddataapi ports: - "3002:80" links: - ims - memcached - virtuoso:sparql # SPARQL server virtuoso: build: virtuoso-ops ports: - "3003:8890" volumes_from: - virtuosodata virtuosodata: image: busybox volumes: - /virtuoso 5 . 5
  • 18. Docker and data? Docker Hub maximum image size: 10 GB Open PHACTS data (compressed): ~30 GB Open PHACTS data (installed): ~200 GB Solution: Added staging Docker containers Download from Verify consistency Import into Virtuso and mySQL https://data.openphacts.org/ 6 . 2
  • 20. https://data.openphacts.org/ data.openphacts.org RDF datasets RDF linksets VoID metadata/provenance mySQL-imported linksets Virtuoso-imported datasets → Maven repository release data as software →Research Objects propagate metadata 6 . 4
  • 22. https://github.com/openphacts/ops-docker Hardware requirements: 150 GB of disk space (ideal: 250 GB) 16 GB of RAM (ideal: 128 GB) 4 CPU core (ideal: 8 cores) Prerequisites: Recent x64 Linux (Ubuntu 14.04 LTS, Centos 7) Fast Internet connection Docker Docker Compose What do I need? 7 . 2
  • 23. https://github.com/openphacts/ops-docker Follow the GitHub tutorial exactly, customize later Install latest Docker and Docker Compose Just testing on Windows or OS X? .. modify Docker's Linux VM to have enough disk and memory Firewall? Different settings depend on your firewall details. Don't worry - Docker is containerized! ..you won't break your machine Don't jump ahead.. 7 . 3
  • 24. https://github.com/openphacts/ops-docker Get the software curl -L https://github.com/openphacts/ops-docker/archive/master.tar.gz | tar xzv cd ops-docker-master sudo docker-compose pull 7 . 4
  • 25. https://github.com/openphacts/ops-docker Get the data $ sudo docker-compose up --no-recreate -d mysqlstaging virtuosostaging $ sudo docker-compose logs mysqlstaging virtuosostaging ops-mysqlstaging | mySQL staging finished ops-mysqlstaging exited with code 0 ops-virtuosostaging | 09:13:35 --> Backup file # 675 [0x3F02-0x74-0x8A] ops-virtuosostaging | 09:13:36 --> Backup file # 676 [0x3F02-0x74-0x8A] ops-virtuosostaging | 09:13:37 End of restoring from backup, 6751701 pages ops-virtuosostaging | 09:13:37 Server exiting ops-virtuosostaging | Loading completed ops-virtuosostaging exited with code 0 7 . 5
  • 26. https://github.com/openphacts/ops-docker Start the services $ sudo docker-compose up --no-recreate -d $ sudo docker-compose logs --tail=5 7 . 6
  • 33. Custom data staging Different Open PHACTS 2.1 licensing options: Non-Commercial users: Everything Commercial users: No DrugBank, partial SureChembl Open PHACTS members: Full SureChembl 9 . 2
  • 34. Microservices pr dataset Most queries have separate fragments per dataset ..which could be executed on separate microservices Better cloud scalability Easier to test upgrades of individual datasets But still need "API" layer to do Identity Mapping and selecting datasets to query 9 . 3
  • 35. BioExcel Workflow blocks BioExcel approach: Spin up virtual machine when an Open PHACTS workflow is started Workflow bound dynamically to VM instance(s) Scalability (exclusive access) Reproducibility (independent/fixed OPS install) Tool descriptions - exposed in bio.tools 9 . 4
  • 36. Customization Make it easier to add third-party data: datasets, linksets, queries, API calls ..so pharma industry can mix in their in-house data .. so academics can upgrade and expand datasets More tooling, more documentation, or more training? 9 . 5