ChEMBL and KNIME provide an ideal match of open data with open tools. This is a quick overview of how to access ChEMBL data resources and web services (ChEMBL, UniChem, Beaker, myChEMBL, SureChEMBL) via the KNIME platform.
5. KNIME at the EBI
• Access ChEBI and ChEMBL databases via KNIME nodes
• Trusted community nodes
• Algorithms development
• Document classification
• Share example workflows and use cases
• Provide KNIME training to scientists and researchers
• Wellcome Trust drug discovery courses, EMBL courses
• CDK community nodes development
h"p://tech.knime.org/book/embl-‐ebi-‐nodes
13. EMBL-EBI chemistry resources
RDF
and
REST
API
interfaces
REST
API
Interface
-‐
h"ps://www.ebi.ac.uk/unichem/
Atlas
Ligand
induced
transcript
response
750
PDBe
Ligand
structures
from
structurally
defined
protein
complexes
15K
ChEBI
Nomenclature
of
primary
and
secondary
metabolites.
Chemical
Ontology
24K
SureChEMBL
Chemical
structures
from
patent
literature
~17M
ChEMBL
Bioac9vity
data
from
literature
and
deposi9ons
1.5M
UniChem
–
InChI-‐based
chemical
resolver
(full
+
relaxed
‘lenses’)
>90M
3rd
Party
Data
ZINC,
PubChem,
ThomsonPharma
DOTF,
IUPHAR,
DrugBank,
KEGG,
NIH
NCC,
eMolecules,
FDA
SRS,
PharmGKB,
Selleck,
….
~70M
22. Using KNIME to connect to myChEMBL
SELECT mr.*, md.chembl_id,
cp.full_mwt, cp.alogp
from mols_rdkit mr,
molecule_dictionary md,
compound_properties cp
where
mr.m @> '$${SMolecule}$$'::qmol
and
mr.molregno = md.molregno
and
md.molregno = cp.molregno;
24. SureChEMBL and Open PHACTS
SureChEMBL
SciBite
Termite
Open
PHACTS
API
https://dev.openphacts.org/docs/develop
https://github.com/openphacts/OPS-Knime/
26. Most relevant diseases
Most relevant targets
Patent publication date histogra
http://rdf.ebi.ac.uk/resource/surechembl/molecule/SCHEMBL371804
Foretinib, a kinase inhibitor in clinical phase II
Found in 89 EP,
WO and US
patents
27. Summary
• KNIME: democratizes access to data and tools
• Access public domain structure and bioactivity data and
services with KNIME
• ChEMBL KNIME Nodes
• UniChem
• Cheminformatics services
• myChEMBL
• SureChEMBL
29. Acknowledgements
• Francis Atkinson
• Louisa Bellis
• Jon Chambers
• Michał Nowotka
• Anne Hersey
• Stefan Beisken
• Edmund Duesbury
• Daniela Digles
• Thorsten Meinl
• KNIME
• KNIME community
All
workflow
examples
are
available
on
request.