SlideShare a Scribd company logo
Claudia Bobach, K. Kruse, S. Boyer, T. Böhme, L. Weber
Using chemical ontologies to create molecular
prediction systems for any molecular property
1Nice, 06.10.2020 11:45
our customers
OntoChem was started in 2005 in Halle / Saale (Germany).
We are approx. 15 people with diverse scientific and technical backgrounds: chemists, biologists,
computer linguists, chemo- and bioinformatics, computer scientists.
Who we are ...
3
How do we support our customers ?
4
1
Data Analytics & Predictions:
indexing your enterprise data; combining internal &
external sources, documents, and DBs; linked data;
document classification; alerts; compound reports.
3 Semantic Software Solutions:
standalone customizable annotation pipelines with
or without ontologies and source documents
(installable or as SaaS).
5 Data Streams:
normalized patents, scientific articles (open source
and behind paywalls), clinical trials, news & more.
2Smart Data:
compound properties from text, images & tables;
mixture compositions; relations (disease X is caused by
protein Y which is inhibited by Z), etc.
4Ontologies:
we understand pharma, chemistry, life and material
sciences, and more …!
We have created structure based chemical ontologies that are used to classify chemical
compounds automatically. These classifications can be used with success in semantic search engines
to find all representatives of a chemical class. In the present paper we would like to demonstrate use
cases when utilizing these chemical classes as features in typical machine learning approaches.
Thus, we have used the co-occurrence of chemical compounds with biological properties in
scientific articles and data sets to train models that predict properties of novel compounds that did
not occur in those training sets. One example is the prediction of toxicity as well as antimicrobial
activity. In principle, one can use any property that is found in the textual vicinity of compounds to
build such predictive models.
Abstract
5
Goal
● Prediction of any molecular property, e.g. toxicity, by using ontologies and text mining
● Application example: likelihood of drug approval based on toxicity
Inputs to predict any property?
● ACS 2018: Compound classifications (“Boyer molecular labels”) -
chemistry compound classes (OntoChem & ClassyFire) and ChEMBL bioactivities
● Now
Millions of full text scientific documents
Co-occurrences - compound and property (e.g. toxicities)
Scoring ontology classes
Prediction of compound properties
Introduction
6
The Workflow
7
Chemical class score
for property
molecular
assignment/
labeling
compound property score
(likeliness to have the property)
Chem. Class Score
secondary amines 1.2052
benzenes 1.1388
propoxybenzenes 0.8142
... ...
Co-Occurrences
Chemical Ontology
[2]
Property Ontology
[2]
Ø
[1]
Chemical Ontology Classes & Hierarchy
8
“An ontology encompasses a representation, formal naming and definition of the categories, properties and
relations between the concepts”
(https://en.wikipedia.org/wiki/Ontology_(information_science)).
http://www.cse.buffalo.edu/~rapaport/676/F01/
icecreamontology.jpg
all amine compounds
all secondary amine compounds
Property Ontology e.g. cytotoxicity as a subclass of toxicity
9
toxicity = parent of cytotoxicity
class cytotoxicity
Ontologies
10
● Ontology to represent chemical features:
● Chemistry ontology including chemical classes & chemical compounds
● Ontologies to represent properties:
● Toxicology ontology: toxic properties
● Effects & processes:
● pharmacologic properties
● antimicrobial,antibacterial, antiviral
● anti-infective, anti-depressant, anti-alzheimer
● Protein interactions
● protein binding
● enzyme inhibition
11
For example - property prediction:
toxicity
11
Annotator output showing [compound property] co-occurrence
12
The occurrence of two named entities (words) in a defined range of text (usually a sentence).
For example PMC3663206 [4]:
The PM genotype has furthermore been associated with an increased risk of toxicity after metoprolol administration [22, 58] …
=> Gives one co-occurrence between metoprolol and toxicity.
Co-Occurrence
13
[3]
Compound - Toxicity Co-Occurrences in Documents
14
Co-Occurrences
[2] [2]
[1]
2,817,902 - PMC Documents1,333,362 - unique compounds
6,385 - compound classes
83,511,588 - compound annotations
12,842,016 - compound class annotations
9,784,556 - toxicity annotations
816 - toxicity concepts
3,311 - unique compound classes co-occur
with a toxicity concept
1,801,709 - co-occurrences between
compounds/classes and toxicity
Chemical Ontology
15
6,500 chemical classes
millions of chemical
compounds are assigned to
its structural chemical
classes by SODIAC
Each compound is
characterized regarding
their chemical features by
the corresponding chemical
classes
molecular assignment/labeling
● given a molecules structure (e.g. smiles).
● given a chemical classes ontology.
● assign molecule substructures to chemical
classes.
● SODIAC (Structure based Ontology Development
and Individual Assignment Center)
Assignment of Compounds to Ontology Classes
16
secondary aminesbenzenes
propoxybenzenes
alkylarylether
Aggregated molecular information
1. Establish co-occurrence between a chemical compound and a property (e.g.: toxicity)
2. Assign ontological information, substructure assignments = chemical compound classes
3. Generalize: co-occurrences are generalized (inherited) to all ancestors of the respective
chemical class
Hypothesis
The more often a chemical class co-occurs with a property, the more likely this class is related to this
property.
Compound Class - Property Co-occurrence Score
17
Let:
coocij
:= number of co-occurrences between class i and property j plus the number of unique
co-occurrences of any child of class i respective property j
Example:
Ni
:= the total number of occurrences of class i plus the total number of unique occurences of any
child of class i
Example:
Then the property score pscrij
of a class i for property j is:
Example:
This defines a score for each of the ~6000 compound classes.
Cooc-Property-Score - Definition
18
coocij
= [all secondary amines]i
- [toxicity]j
cooccurrences = 115,245
Ni
= [all secondary amines]i
total occurrences = 4,581,235
pscrij
= 115,245/4,581,235*100 = 2.5155
To get a normalized cooc-property score, where a score > 1 corresponds to classes that enforce the
property and a score < 1 weakens the property we normalize all scores by the root compound class
“compounds” because the score of this class describes the average property likeliness of all
compounds.
Let: c := the root “compounds” class. Then the normalize score npscr_ij is:
Example:
Cooc-Property-Score - Normalization
19
npscrij
= 2.5155/2.1017 = 1.9968
For a given compound k, the compound property score for a property j is simply the average
compound class property score pscrlj
of all assigned classes l ∈ Ak
normalized to the root compound
class:
Aggregated Compound - Property Score
20
secondary amines
benzenes
propoxybenzenes
1.9968
1.1388
0.8142
Chemical classes
k
npscr Tox Score
j for k
... ...
Metoprolol
score kj =
Mean 1.1751
Chemical Compounds
some with
classes (fragments)
in common
Desired
property j
source
compound
Metoprolol
source fragments
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
coocij
= # of co-occurrences
between class i and property j
property (toxicity)
score for the
compound
Metoprolol
score kj =
Mean 1.1751
Metoprolol
Chemical
classes I
in common = N
(chem fragments)
i
chemical class (fragment)
present in the source molecule
chemical compound that has a
chemical class (fragment) in common
with the source molecule but no
association with the desired property
The desired property
e.g. toxicity all 6,385 chemical
classes (fragments)
all properties
(in this case 816
toxicity classes)
all compounds
In this case 1,333,362
compounds
chemical compound
line thickness
indicates strong or
week association
chemical compound that has a
chemical class (fragment) in common
with the source molecule and has an
association with the desired property
Other properties
e.g. adhesion, LD50, etc
chemical class (fragment) not
in the source molecule
Ranking of Compound Classes by Tox Score
22
Compound class High Tox Score
1,2,3-trithianes 42.3077
campest-24(28)-enes 40.0000
aconitine derivatives 34.3284
trichothecane derivatives 32.0149
trichothecene derivatives 32.0149
3,5-dihydroxybenzoic acid der. 31.5789
2,6-diaminotoluenes 30.4965
eriodictyol derivatives 30.3030
1-azaspiro[5.5]undecanes 26.5306
Compound class Low Tox Score
delta-cadinene derivatives 0.0747
5α-campestanes 0.0690
campestanes 0.0688
alpha-farnesene derivatives 0.0663
sedoheptulose derivatives 0.0469
ketoheptose derivatives 0.0326
oxirenes 0
azirines 0
indolsulfonamides 0
Compound Classes
23
Metoprolol
secondary amines
benzenes
propoxybenzenes
secondary aliphatic
alcohols
1,2-aminoalcohols
alkyl aryl ethers
dialkylethers
methyl ethers
phenol ethers
Example compound Chemical classes
Compound Classes and Toxicity Scores - Example Drug
24
Metoprolol
secondary amines
benzenes
propoxybenzenes
secondary aliphatic
alcohols
1,2-aminoalcohols
alkyl aryl ethers
dialkylethers
methyl ethers
phenol ethers
Example compound Chemical classes Tox score
1.9968
1.1388
0.8142
0.8905
1.1290
1.4162
1.1182
1.6551
1.2091
Tox score
Mean 1.1751
Compound Classes and Toxicity Scores - toxic example
25
batrachotoxin
pyrroles
methylamines
tertiary amines
carboxylic acid esters
cycloalkenes
alkene derivatives
natural product
derivatives
batraochotoxin
derivatives
secondary aliphatic
alcohols
Example compound Chemical classes Tox score
1.7126
1.1190
1.0410
1.2318
1.1955
1.1530
0.9978
4.2887
0.8905
Tox score
Mean 1.3635
diols
dialkylethers
hemiacetals
oxacyclic compounds
0.9349
1.1182
0.8417
1.2011
Compound Classes and Toxicity Scores - non-toxic example
26
L-ascorbic acid
furanones
cycloalkenes
ketoalkenes
aliphatic compounds
primary alcohols
secondary aliphatic
alcohols
1,2-diols
cyclic ketones
dialkylethers
Example compound Chemical classes Tox score
0.9644
1.1955
1.0205
0.8412
0.8067
0.8905
0.6333
1.4434
1.1182
Tox score
Mean 0.9755
hemiacetals 0.8417
27
Application to other properties:
antimicrobial
27
● Similar to the ToxScore calculation
● Used “anti-microbial” concept from “effect or process” ontology as property.
● Co-occurrences between “anti-microbial” child concepts and all chemical classes respective
compounds.
● Five example compounds and their antimicrobial score:
Antimicrobial Property
28
Compound Antimicrobial Score
Halicin 1.44
Trimethoprim 2.34
(+)-(4R)-Limonene 1.84
3-ethenyl-3,4-dihydrodithiine 0.55
2-Vinyl-4H-1,3-dithiine 1.28
29
Evaluation
29
To evaluate the toxicity score, we used a FDA drug approval and clinical toxicity dataset (ClinTox):
Wu Z, Ramsundar B, Feinberg EN, Gomes J, Geniesse C, Pappu AS et al (2017)
MoleculeNet: a benchmark for molecular machine learning.
arXiv preprint arXiv:170300564 http://moleculenet.ai/
1,478 molecules/smiles were assigned to classes
For each compound, two outcomes where given:
1. Clinical trial toxic (CT_TOX):
112 - toxic
1,366 - not_toxic
2. FDA approved (FDA_APPROVED):
1,384 - approved
94 - not_approved
ToxScore Evaluation - FDA ClinTox and FDA Drug Approval
30
ROC curve for the ToxScore - CT_TOX outcome.
ToxScore Evaluation - FDA Clinical Trial Tox Prediction
31
ToxScore distributions of not toxic vs. toxic
ClinTox compounds.
Using our toxicity score (ToxScore), we tried to predict the ClinTox CT_TOX outcome.
ROC curve for the ToxScore - FDA approval
outcome.
ToxScore Evaluation - FDA Approval Prediction
32
ToxScore distributions of not approved vs.
approved ClinTox compounds.
Using our toxicity score (ToxScore), we tried to predict the ClinTox approval outcome.
Aim: Evaluation of chemical class assignment to provide suitable features for property prediction
The Tox21 Challenge [5] provided more than 10k compounds. Outcomes for 12 different pathway
assays, where 1 means the compound affects this pathway and 0 not.
We made an assignment with this >10k compounds and used this assigned chem classes as features
to train a Support Vector Machine (SVM), on the training data, to predict the outcomes.
The final classifiers were used to predict the outcome of the remaining test set compounds.
Result, see next slide.
Chemical Feature Evaluation - Tox21 Challenge
33
Tox21 test set results:
AUC under the ROC curve (0.5 = no enrichment; 1 - 100% enrichment)
NR.AR androgen receptor
SR.MMP matrix metalloprotease (NR=nuclear receptor effect, SR=stress response effect)
Chemical Feature Evaluation - Tox21 Data Set
34
Tox21 test set results:
different performance for compound prediction of the 3 example clusters:
● NR.AR
● NR.AR.LBD
● NR.AhR
Chemical Feature Evaluation - Tox21 Data Set
35
● Structural chemistry ontology classes can be used as a feature space for machine learning
● Data from Tox21 reveals most relevant chemical ontology classes
● Scientific publications annotated with toxicity terms and small molecule compounds can be
used to automatically generate a toxicity prediction system solely based on chemical structure,
even for non-published compounds
● Beside toxicity, any other annotated compound property can also be used to predict the same
property, e.g. as shown for antimicrobial activity
Summary
36
[1] http://madan.org.il/en/news/languages-still-major-barrier-global-science
[2] https://idslabblog.wordpress.com/research-topic-ontology/
[3] https://commons.wikimedia.org/wiki/File:Hazard_T.svg
[4] Samer CF, Lorenzini KI, Rollason V, Daali Y, Desmeules JA. Applications of CYP450 testing in the clinical setting.
Mol Diagn Ther. 2013 Jun;17(3):165-84. doi: 10.1007/s40291-013-0028-5. PMID: 23588782; PMCID:
PMC3663206.
[5] Andersen, M. E., and Krewski, D. (2009). Toxicity testing in the 21st century: bringing the vision to life. Toxicol.
Sci. 107, 324–330. doi: 10.1093/toxsci/kfn255
[6] Bobach C, Böhme T, Laube U, Püschel A, Weber L (2012) Automated compound classification using a chemical
ontology. J Cheminformatics 4(12):40
References
37
Thank you
contact:
www.ontochem.com
phone: +49 345 4780470
e-mail: claudia.bobach@ontochem.com

More Related Content

Similar to AI-SDV 2020: Using chemical ontologies to create molecular prediction systems for any molecular property Claudia Bobach (OntoChem, Germany)

Computer-aided prediction of xenobiotics toxicity
Computer-aided prediction of xenobiotics toxicityComputer-aided prediction of xenobiotics toxicity
Computer-aided prediction of xenobiotics toxicity
Mobiliuz
 
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
Kamel Mansouri
 
Medicinal chemistry i
Medicinal chemistry  iMedicinal chemistry  i
Medicinal chemistry i
priya yadav
 
CADD
CADDCADD
Hazard Estimation and Method Comparison with OWL-Encoded Toxicity Decision Trees
Hazard Estimation and Method Comparison with OWL-Encoded Toxicity Decision TreesHazard Estimation and Method Comparison with OWL-Encoded Toxicity Decision Trees
Hazard Estimation and Method Comparison with OWL-Encoded Toxicity Decision Trees
Michel Dumontier
 
Computational Drug Discovery: Machine Learning for Making Sense of Big Data i...
Computational Drug Discovery: Machine Learning for Making Sense of Big Data i...Computational Drug Discovery: Machine Learning for Making Sense of Big Data i...
Computational Drug Discovery: Machine Learning for Making Sense of Big Data i...
Chanin Nantasenamat
 
Ontology work at the Royal Society of Chemistry
Ontology work at the Royal Society of ChemistryOntology work at the Royal Society of Chemistry
Ontology - and Reloaded and Revolutions
Ontology - and Reloaded and RevolutionsOntology - and Reloaded and Revolutions
Ontology - and Reloaded and Revolutions
Jie Bao
 
Evolutionary Symbolic Discovery for Bioinformatics, Systems and Synthetic Bi...
Evolutionary Symbolic Discovery for Bioinformatics,  Systems and Synthetic Bi...Evolutionary Symbolic Discovery for Bioinformatics,  Systems and Synthetic Bi...
Evolutionary Symbolic Discovery for Bioinformatics, Systems and Synthetic Bi...
Natalio Krasnogor
 
80 exam-questions-general-toxicology-for-practising
80 exam-questions-general-toxicology-for-practising80 exam-questions-general-toxicology-for-practising
80 exam-questions-general-toxicology-for-practising
tahamakhlouf
 
Deep reinforcement learning for de novo drug design
Deep reinforcement learning for de novo drug designDeep reinforcement learning for de novo drug design
Deep reinforcement learning for de novo drug design
Nimmi Weeraddana
 
Pharmacophore Modeling and Docking Techniques.ppt
Pharmacophore Modeling and Docking Techniques.pptPharmacophore Modeling and Docking Techniques.ppt
Pharmacophore Modeling and Docking Techniques.ppt
DrVivekChauhan1
 
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe
open_phacts
 
Qsar ppt
Qsar pptQsar ppt
A Biclustering Method for Rationalizing Chemical Biology Mechanisms of Action
A Biclustering Method for Rationalizing Chemical Biology Mechanisms of ActionA Biclustering Method for Rationalizing Chemical Biology Mechanisms of Action
A Biclustering Method for Rationalizing Chemical Biology Mechanisms of Action
Gerald Lushington
 
cadd-191129134050 (1).pptx
cadd-191129134050 (1).pptxcadd-191129134050 (1).pptx
cadd-191129134050 (1).pptx
Noorelhuda2
 
Structural Bioinformatics.pdf
Structural Bioinformatics.pdfStructural Bioinformatics.pdf
Structural Bioinformatics.pdf
RahmatEkoSanjaya1
 
Molecular and data visualization in drug discovery
Molecular and data visualization in drug discoveryMolecular and data visualization in drug discovery
Molecular and data visualization in drug discovery
Deepak Bandyopadhyay
 
Overview of cheminformatics
Overview of cheminformaticsOverview of cheminformatics
Overview of cheminformatics
Benjamin Bucior
 
Drug design
Drug designDrug design
Drug design
subhasis banerjee
 

Similar to AI-SDV 2020: Using chemical ontologies to create molecular prediction systems for any molecular property Claudia Bobach (OntoChem, Germany) (20)

Computer-aided prediction of xenobiotics toxicity
Computer-aided prediction of xenobiotics toxicityComputer-aided prediction of xenobiotics toxicity
Computer-aided prediction of xenobiotics toxicity
 
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
 
Medicinal chemistry i
Medicinal chemistry  iMedicinal chemistry  i
Medicinal chemistry i
 
CADD
CADDCADD
CADD
 
Hazard Estimation and Method Comparison with OWL-Encoded Toxicity Decision Trees
Hazard Estimation and Method Comparison with OWL-Encoded Toxicity Decision TreesHazard Estimation and Method Comparison with OWL-Encoded Toxicity Decision Trees
Hazard Estimation and Method Comparison with OWL-Encoded Toxicity Decision Trees
 
Computational Drug Discovery: Machine Learning for Making Sense of Big Data i...
Computational Drug Discovery: Machine Learning for Making Sense of Big Data i...Computational Drug Discovery: Machine Learning for Making Sense of Big Data i...
Computational Drug Discovery: Machine Learning for Making Sense of Big Data i...
 
Ontology work at the Royal Society of Chemistry
Ontology work at the Royal Society of ChemistryOntology work at the Royal Society of Chemistry
Ontology work at the Royal Society of Chemistry
 
Ontology - and Reloaded and Revolutions
Ontology - and Reloaded and RevolutionsOntology - and Reloaded and Revolutions
Ontology - and Reloaded and Revolutions
 
Evolutionary Symbolic Discovery for Bioinformatics, Systems and Synthetic Bi...
Evolutionary Symbolic Discovery for Bioinformatics,  Systems and Synthetic Bi...Evolutionary Symbolic Discovery for Bioinformatics,  Systems and Synthetic Bi...
Evolutionary Symbolic Discovery for Bioinformatics, Systems and Synthetic Bi...
 
80 exam-questions-general-toxicology-for-practising
80 exam-questions-general-toxicology-for-practising80 exam-questions-general-toxicology-for-practising
80 exam-questions-general-toxicology-for-practising
 
Deep reinforcement learning for de novo drug design
Deep reinforcement learning for de novo drug designDeep reinforcement learning for de novo drug design
Deep reinforcement learning for de novo drug design
 
Pharmacophore Modeling and Docking Techniques.ppt
Pharmacophore Modeling and Docking Techniques.pptPharmacophore Modeling and Docking Techniques.ppt
Pharmacophore Modeling and Docking Techniques.ppt
 
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe
 
Qsar ppt
Qsar pptQsar ppt
Qsar ppt
 
A Biclustering Method for Rationalizing Chemical Biology Mechanisms of Action
A Biclustering Method for Rationalizing Chemical Biology Mechanisms of ActionA Biclustering Method for Rationalizing Chemical Biology Mechanisms of Action
A Biclustering Method for Rationalizing Chemical Biology Mechanisms of Action
 
cadd-191129134050 (1).pptx
cadd-191129134050 (1).pptxcadd-191129134050 (1).pptx
cadd-191129134050 (1).pptx
 
Structural Bioinformatics.pdf
Structural Bioinformatics.pdfStructural Bioinformatics.pdf
Structural Bioinformatics.pdf
 
Molecular and data visualization in drug discovery
Molecular and data visualization in drug discoveryMolecular and data visualization in drug discovery
Molecular and data visualization in drug discovery
 
Overview of cheminformatics
Overview of cheminformaticsOverview of cheminformatics
Overview of cheminformatics
 
Drug design
Drug designDrug design
Drug design
 

More from Dr. Haxel Consult

AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering ManagementAI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
Dr. Haxel Consult
 
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
Dr. Haxel Consult
 
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
Dr. Haxel Consult
 
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
Dr. Haxel Consult
 
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
Dr. Haxel Consult
 
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
Dr. Haxel Consult
 
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
Dr. Haxel Consult
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...
Dr. Haxel Consult
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...
Dr. Haxel Consult
 
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
Dr. Haxel Consult
 
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
Dr. Haxel Consult
 
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
Dr. Haxel Consult
 
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
Dr. Haxel Consult
 
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
Dr. Haxel Consult
 
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
Dr. Haxel Consult
 
AI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance CenterAI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance Center
Dr. Haxel Consult
 
AI-SDV 2022: Lighthouse IP
AI-SDV 2022: Lighthouse IPAI-SDV 2022: Lighthouse IP
AI-SDV 2022: Lighthouse IP
Dr. Haxel Consult
 
AI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOCAI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOC
Dr. Haxel Consult
 
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
Dr. Haxel Consult
 
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
Dr. Haxel Consult
 

More from Dr. Haxel Consult (20)

AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering ManagementAI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
 
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
 
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
 
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
 
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
 
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
 
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...
 
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
 
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
 
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
 
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
 
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
 
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
 
AI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance CenterAI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance Center
 
AI-SDV 2022: Lighthouse IP
AI-SDV 2022: Lighthouse IPAI-SDV 2022: Lighthouse IP
AI-SDV 2022: Lighthouse IP
 
AI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOCAI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOC
 
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
 
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
 

Recently uploaded

Internet of Things in Manufacturing: Revolutionizing Efficiency & Quality | C...
Internet of Things in Manufacturing: Revolutionizing Efficiency & Quality | C...Internet of Things in Manufacturing: Revolutionizing Efficiency & Quality | C...
Internet of Things in Manufacturing: Revolutionizing Efficiency & Quality | C...
CIOWomenMagazine
 
成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理
成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理
成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理
ysasp1
 
假文凭国外(Adelaide毕业证)澳大利亚国立大学毕业证成绩单办理
假文凭国外(Adelaide毕业证)澳大利亚国立大学毕业证成绩单办理假文凭国外(Adelaide毕业证)澳大利亚国立大学毕业证成绩单办理
假文凭国外(Adelaide毕业证)澳大利亚国立大学毕业证成绩单办理
cuobya
 
Search Result Showing My Post is Now Buried
Search Result Showing My Post is Now BuriedSearch Result Showing My Post is Now Buried
Search Result Showing My Post is Now Buried
Trish Parr
 
可查真实(Monash毕业证)西澳大学毕业证成绩单退学买
可查真实(Monash毕业证)西澳大学毕业证成绩单退学买可查真实(Monash毕业证)西澳大学毕业证成绩单退学买
可查真实(Monash毕业证)西澳大学毕业证成绩单退学买
cuobya
 
manuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaal
manuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaalmanuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaal
manuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaal
wolfsoftcompanyco
 
[HUN][hackersuli] Red Teaming alapok 2024
[HUN][hackersuli] Red Teaming alapok 2024[HUN][hackersuli] Red Teaming alapok 2024
[HUN][hackersuli] Red Teaming alapok 2024
hackersuli
 
学位认证网(DU毕业证)迪肯大学毕业证成绩单一比一原版制作
学位认证网(DU毕业证)迪肯大学毕业证成绩单一比一原版制作学位认证网(DU毕业证)迪肯大学毕业证成绩单一比一原版制作
学位认证网(DU毕业证)迪肯大学毕业证成绩单一比一原版制作
zyfovom
 
Meet up Milano 14 _ Axpo Italia_ Migration from Mule3 (On-prem) to.pdf
Meet up Milano 14 _ Axpo Italia_ Migration from Mule3 (On-prem) to.pdfMeet up Milano 14 _ Axpo Italia_ Migration from Mule3 (On-prem) to.pdf
Meet up Milano 14 _ Axpo Italia_ Migration from Mule3 (On-prem) to.pdf
Florence Consulting
 
重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理
重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理
重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理
vmemo1
 
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptxBridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
Brad Spiegel Macon GA
 
7 Best Cloud Hosting Services to Try Out in 2024
7 Best Cloud Hosting Services to Try Out in 20247 Best Cloud Hosting Services to Try Out in 2024
7 Best Cloud Hosting Services to Try Out in 2024
Danica Gill
 
Discover the benefits of outsourcing SEO to India
Discover the benefits of outsourcing SEO to IndiaDiscover the benefits of outsourcing SEO to India
Discover the benefits of outsourcing SEO to India
davidjhones387
 
Gen Z and the marketplaces - let's translate their needs
Gen Z and the marketplaces - let's translate their needsGen Z and the marketplaces - let's translate their needs
Gen Z and the marketplaces - let's translate their needs
Laura Szabó
 
Explore-Insanony: Watch Instagram Stories Secretly
Explore-Insanony: Watch Instagram Stories SecretlyExplore-Insanony: Watch Instagram Stories Secretly
Explore-Insanony: Watch Instagram Stories Secretly
Trending Blogers
 
Ready to Unlock the Power of Blockchain!
Ready to Unlock the Power of Blockchain!Ready to Unlock the Power of Blockchain!
Ready to Unlock the Power of Blockchain!
Toptal Tech
 
制作毕业证书(ANU毕业证)莫纳什大学毕业证成绩单官方原版办理
制作毕业证书(ANU毕业证)莫纳什大学毕业证成绩单官方原版办理制作毕业证书(ANU毕业证)莫纳什大学毕业证成绩单官方原版办理
制作毕业证书(ANU毕业证)莫纳什大学毕业证成绩单官方原版办理
cuobya
 
留学学历(UoA毕业证)奥克兰大学毕业证成绩单官方原版办理
留学学历(UoA毕业证)奥克兰大学毕业证成绩单官方原版办理留学学历(UoA毕业证)奥克兰大学毕业证成绩单官方原版办理
留学学历(UoA毕业证)奥克兰大学毕业证成绩单官方原版办理
bseovas
 
办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理
办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理
办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理
uehowe
 
一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理
一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理
一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理
ufdana
 

Recently uploaded (20)

Internet of Things in Manufacturing: Revolutionizing Efficiency & Quality | C...
Internet of Things in Manufacturing: Revolutionizing Efficiency & Quality | C...Internet of Things in Manufacturing: Revolutionizing Efficiency & Quality | C...
Internet of Things in Manufacturing: Revolutionizing Efficiency & Quality | C...
 
成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理
成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理
成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理
 
假文凭国外(Adelaide毕业证)澳大利亚国立大学毕业证成绩单办理
假文凭国外(Adelaide毕业证)澳大利亚国立大学毕业证成绩单办理假文凭国外(Adelaide毕业证)澳大利亚国立大学毕业证成绩单办理
假文凭国外(Adelaide毕业证)澳大利亚国立大学毕业证成绩单办理
 
Search Result Showing My Post is Now Buried
Search Result Showing My Post is Now BuriedSearch Result Showing My Post is Now Buried
Search Result Showing My Post is Now Buried
 
可查真实(Monash毕业证)西澳大学毕业证成绩单退学买
可查真实(Monash毕业证)西澳大学毕业证成绩单退学买可查真实(Monash毕业证)西澳大学毕业证成绩单退学买
可查真实(Monash毕业证)西澳大学毕业证成绩单退学买
 
manuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaal
manuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaalmanuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaal
manuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaal
 
[HUN][hackersuli] Red Teaming alapok 2024
[HUN][hackersuli] Red Teaming alapok 2024[HUN][hackersuli] Red Teaming alapok 2024
[HUN][hackersuli] Red Teaming alapok 2024
 
学位认证网(DU毕业证)迪肯大学毕业证成绩单一比一原版制作
学位认证网(DU毕业证)迪肯大学毕业证成绩单一比一原版制作学位认证网(DU毕业证)迪肯大学毕业证成绩单一比一原版制作
学位认证网(DU毕业证)迪肯大学毕业证成绩单一比一原版制作
 
Meet up Milano 14 _ Axpo Italia_ Migration from Mule3 (On-prem) to.pdf
Meet up Milano 14 _ Axpo Italia_ Migration from Mule3 (On-prem) to.pdfMeet up Milano 14 _ Axpo Italia_ Migration from Mule3 (On-prem) to.pdf
Meet up Milano 14 _ Axpo Italia_ Migration from Mule3 (On-prem) to.pdf
 
重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理
重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理
重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理
 
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptxBridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
 
7 Best Cloud Hosting Services to Try Out in 2024
7 Best Cloud Hosting Services to Try Out in 20247 Best Cloud Hosting Services to Try Out in 2024
7 Best Cloud Hosting Services to Try Out in 2024
 
Discover the benefits of outsourcing SEO to India
Discover the benefits of outsourcing SEO to IndiaDiscover the benefits of outsourcing SEO to India
Discover the benefits of outsourcing SEO to India
 
Gen Z and the marketplaces - let's translate their needs
Gen Z and the marketplaces - let's translate their needsGen Z and the marketplaces - let's translate their needs
Gen Z and the marketplaces - let's translate their needs
 
Explore-Insanony: Watch Instagram Stories Secretly
Explore-Insanony: Watch Instagram Stories SecretlyExplore-Insanony: Watch Instagram Stories Secretly
Explore-Insanony: Watch Instagram Stories Secretly
 
Ready to Unlock the Power of Blockchain!
Ready to Unlock the Power of Blockchain!Ready to Unlock the Power of Blockchain!
Ready to Unlock the Power of Blockchain!
 
制作毕业证书(ANU毕业证)莫纳什大学毕业证成绩单官方原版办理
制作毕业证书(ANU毕业证)莫纳什大学毕业证成绩单官方原版办理制作毕业证书(ANU毕业证)莫纳什大学毕业证成绩单官方原版办理
制作毕业证书(ANU毕业证)莫纳什大学毕业证成绩单官方原版办理
 
留学学历(UoA毕业证)奥克兰大学毕业证成绩单官方原版办理
留学学历(UoA毕业证)奥克兰大学毕业证成绩单官方原版办理留学学历(UoA毕业证)奥克兰大学毕业证成绩单官方原版办理
留学学历(UoA毕业证)奥克兰大学毕业证成绩单官方原版办理
 
办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理
办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理
办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理
 
一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理
一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理
一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理
 

AI-SDV 2020: Using chemical ontologies to create molecular prediction systems for any molecular property Claudia Bobach (OntoChem, Germany)

  • 1. Claudia Bobach, K. Kruse, S. Boyer, T. Böhme, L. Weber Using chemical ontologies to create molecular prediction systems for any molecular property 1Nice, 06.10.2020 11:45
  • 3. OntoChem was started in 2005 in Halle / Saale (Germany). We are approx. 15 people with diverse scientific and technical backgrounds: chemists, biologists, computer linguists, chemo- and bioinformatics, computer scientists. Who we are ... 3
  • 4. How do we support our customers ? 4 1 Data Analytics & Predictions: indexing your enterprise data; combining internal & external sources, documents, and DBs; linked data; document classification; alerts; compound reports. 3 Semantic Software Solutions: standalone customizable annotation pipelines with or without ontologies and source documents (installable or as SaaS). 5 Data Streams: normalized patents, scientific articles (open source and behind paywalls), clinical trials, news & more. 2Smart Data: compound properties from text, images & tables; mixture compositions; relations (disease X is caused by protein Y which is inhibited by Z), etc. 4Ontologies: we understand pharma, chemistry, life and material sciences, and more …!
  • 5. We have created structure based chemical ontologies that are used to classify chemical compounds automatically. These classifications can be used with success in semantic search engines to find all representatives of a chemical class. In the present paper we would like to demonstrate use cases when utilizing these chemical classes as features in typical machine learning approaches. Thus, we have used the co-occurrence of chemical compounds with biological properties in scientific articles and data sets to train models that predict properties of novel compounds that did not occur in those training sets. One example is the prediction of toxicity as well as antimicrobial activity. In principle, one can use any property that is found in the textual vicinity of compounds to build such predictive models. Abstract 5
  • 6. Goal ● Prediction of any molecular property, e.g. toxicity, by using ontologies and text mining ● Application example: likelihood of drug approval based on toxicity Inputs to predict any property? ● ACS 2018: Compound classifications (“Boyer molecular labels”) - chemistry compound classes (OntoChem & ClassyFire) and ChEMBL bioactivities ● Now Millions of full text scientific documents Co-occurrences - compound and property (e.g. toxicities) Scoring ontology classes Prediction of compound properties Introduction 6
  • 7. The Workflow 7 Chemical class score for property molecular assignment/ labeling compound property score (likeliness to have the property) Chem. Class Score secondary amines 1.2052 benzenes 1.1388 propoxybenzenes 0.8142 ... ... Co-Occurrences Chemical Ontology [2] Property Ontology [2] Ø [1]
  • 8. Chemical Ontology Classes & Hierarchy 8 “An ontology encompasses a representation, formal naming and definition of the categories, properties and relations between the concepts” (https://en.wikipedia.org/wiki/Ontology_(information_science)). http://www.cse.buffalo.edu/~rapaport/676/F01/ icecreamontology.jpg all amine compounds all secondary amine compounds
  • 9. Property Ontology e.g. cytotoxicity as a subclass of toxicity 9 toxicity = parent of cytotoxicity class cytotoxicity
  • 10. Ontologies 10 ● Ontology to represent chemical features: ● Chemistry ontology including chemical classes & chemical compounds ● Ontologies to represent properties: ● Toxicology ontology: toxic properties ● Effects & processes: ● pharmacologic properties ● antimicrobial,antibacterial, antiviral ● anti-infective, anti-depressant, anti-alzheimer ● Protein interactions ● protein binding ● enzyme inhibition
  • 11. 11 For example - property prediction: toxicity 11
  • 12. Annotator output showing [compound property] co-occurrence 12
  • 13. The occurrence of two named entities (words) in a defined range of text (usually a sentence). For example PMC3663206 [4]: The PM genotype has furthermore been associated with an increased risk of toxicity after metoprolol administration [22, 58] … => Gives one co-occurrence between metoprolol and toxicity. Co-Occurrence 13 [3]
  • 14. Compound - Toxicity Co-Occurrences in Documents 14 Co-Occurrences [2] [2] [1] 2,817,902 - PMC Documents1,333,362 - unique compounds 6,385 - compound classes 83,511,588 - compound annotations 12,842,016 - compound class annotations 9,784,556 - toxicity annotations 816 - toxicity concepts 3,311 - unique compound classes co-occur with a toxicity concept 1,801,709 - co-occurrences between compounds/classes and toxicity
  • 15. Chemical Ontology 15 6,500 chemical classes millions of chemical compounds are assigned to its structural chemical classes by SODIAC Each compound is characterized regarding their chemical features by the corresponding chemical classes
  • 16. molecular assignment/labeling ● given a molecules structure (e.g. smiles). ● given a chemical classes ontology. ● assign molecule substructures to chemical classes. ● SODIAC (Structure based Ontology Development and Individual Assignment Center) Assignment of Compounds to Ontology Classes 16 secondary aminesbenzenes propoxybenzenes alkylarylether
  • 17. Aggregated molecular information 1. Establish co-occurrence between a chemical compound and a property (e.g.: toxicity) 2. Assign ontological information, substructure assignments = chemical compound classes 3. Generalize: co-occurrences are generalized (inherited) to all ancestors of the respective chemical class Hypothesis The more often a chemical class co-occurs with a property, the more likely this class is related to this property. Compound Class - Property Co-occurrence Score 17
  • 18. Let: coocij := number of co-occurrences between class i and property j plus the number of unique co-occurrences of any child of class i respective property j Example: Ni := the total number of occurrences of class i plus the total number of unique occurences of any child of class i Example: Then the property score pscrij of a class i for property j is: Example: This defines a score for each of the ~6000 compound classes. Cooc-Property-Score - Definition 18 coocij = [all secondary amines]i - [toxicity]j cooccurrences = 115,245 Ni = [all secondary amines]i total occurrences = 4,581,235 pscrij = 115,245/4,581,235*100 = 2.5155
  • 19. To get a normalized cooc-property score, where a score > 1 corresponds to classes that enforce the property and a score < 1 weakens the property we normalize all scores by the root compound class “compounds” because the score of this class describes the average property likeliness of all compounds. Let: c := the root “compounds” class. Then the normalize score npscr_ij is: Example: Cooc-Property-Score - Normalization 19 npscrij = 2.5155/2.1017 = 1.9968
  • 20. For a given compound k, the compound property score for a property j is simply the average compound class property score pscrlj of all assigned classes l ∈ Ak normalized to the root compound class: Aggregated Compound - Property Score 20 secondary amines benzenes propoxybenzenes 1.9968 1.1388 0.8142 Chemical classes k npscr Tox Score j for k ... ... Metoprolol score kj = Mean 1.1751
  • 21. Chemical Compounds some with classes (fragments) in common Desired property j source compound Metoprolol source fragments . . . . . . . . . . . . . . . . . . coocij = # of co-occurrences between class i and property j property (toxicity) score for the compound Metoprolol score kj = Mean 1.1751 Metoprolol Chemical classes I in common = N (chem fragments) i chemical class (fragment) present in the source molecule chemical compound that has a chemical class (fragment) in common with the source molecule but no association with the desired property The desired property e.g. toxicity all 6,385 chemical classes (fragments) all properties (in this case 816 toxicity classes) all compounds In this case 1,333,362 compounds chemical compound line thickness indicates strong or week association chemical compound that has a chemical class (fragment) in common with the source molecule and has an association with the desired property Other properties e.g. adhesion, LD50, etc chemical class (fragment) not in the source molecule
  • 22. Ranking of Compound Classes by Tox Score 22 Compound class High Tox Score 1,2,3-trithianes 42.3077 campest-24(28)-enes 40.0000 aconitine derivatives 34.3284 trichothecane derivatives 32.0149 trichothecene derivatives 32.0149 3,5-dihydroxybenzoic acid der. 31.5789 2,6-diaminotoluenes 30.4965 eriodictyol derivatives 30.3030 1-azaspiro[5.5]undecanes 26.5306 Compound class Low Tox Score delta-cadinene derivatives 0.0747 5α-campestanes 0.0690 campestanes 0.0688 alpha-farnesene derivatives 0.0663 sedoheptulose derivatives 0.0469 ketoheptose derivatives 0.0326 oxirenes 0 azirines 0 indolsulfonamides 0
  • 23. Compound Classes 23 Metoprolol secondary amines benzenes propoxybenzenes secondary aliphatic alcohols 1,2-aminoalcohols alkyl aryl ethers dialkylethers methyl ethers phenol ethers Example compound Chemical classes
  • 24. Compound Classes and Toxicity Scores - Example Drug 24 Metoprolol secondary amines benzenes propoxybenzenes secondary aliphatic alcohols 1,2-aminoalcohols alkyl aryl ethers dialkylethers methyl ethers phenol ethers Example compound Chemical classes Tox score 1.9968 1.1388 0.8142 0.8905 1.1290 1.4162 1.1182 1.6551 1.2091 Tox score Mean 1.1751
  • 25. Compound Classes and Toxicity Scores - toxic example 25 batrachotoxin pyrroles methylamines tertiary amines carboxylic acid esters cycloalkenes alkene derivatives natural product derivatives batraochotoxin derivatives secondary aliphatic alcohols Example compound Chemical classes Tox score 1.7126 1.1190 1.0410 1.2318 1.1955 1.1530 0.9978 4.2887 0.8905 Tox score Mean 1.3635 diols dialkylethers hemiacetals oxacyclic compounds 0.9349 1.1182 0.8417 1.2011
  • 26. Compound Classes and Toxicity Scores - non-toxic example 26 L-ascorbic acid furanones cycloalkenes ketoalkenes aliphatic compounds primary alcohols secondary aliphatic alcohols 1,2-diols cyclic ketones dialkylethers Example compound Chemical classes Tox score 0.9644 1.1955 1.0205 0.8412 0.8067 0.8905 0.6333 1.4434 1.1182 Tox score Mean 0.9755 hemiacetals 0.8417
  • 27. 27 Application to other properties: antimicrobial 27
  • 28. ● Similar to the ToxScore calculation ● Used “anti-microbial” concept from “effect or process” ontology as property. ● Co-occurrences between “anti-microbial” child concepts and all chemical classes respective compounds. ● Five example compounds and their antimicrobial score: Antimicrobial Property 28 Compound Antimicrobial Score Halicin 1.44 Trimethoprim 2.34 (+)-(4R)-Limonene 1.84 3-ethenyl-3,4-dihydrodithiine 0.55 2-Vinyl-4H-1,3-dithiine 1.28
  • 30. To evaluate the toxicity score, we used a FDA drug approval and clinical toxicity dataset (ClinTox): Wu Z, Ramsundar B, Feinberg EN, Gomes J, Geniesse C, Pappu AS et al (2017) MoleculeNet: a benchmark for molecular machine learning. arXiv preprint arXiv:170300564 http://moleculenet.ai/ 1,478 molecules/smiles were assigned to classes For each compound, two outcomes where given: 1. Clinical trial toxic (CT_TOX): 112 - toxic 1,366 - not_toxic 2. FDA approved (FDA_APPROVED): 1,384 - approved 94 - not_approved ToxScore Evaluation - FDA ClinTox and FDA Drug Approval 30
  • 31. ROC curve for the ToxScore - CT_TOX outcome. ToxScore Evaluation - FDA Clinical Trial Tox Prediction 31 ToxScore distributions of not toxic vs. toxic ClinTox compounds. Using our toxicity score (ToxScore), we tried to predict the ClinTox CT_TOX outcome.
  • 32. ROC curve for the ToxScore - FDA approval outcome. ToxScore Evaluation - FDA Approval Prediction 32 ToxScore distributions of not approved vs. approved ClinTox compounds. Using our toxicity score (ToxScore), we tried to predict the ClinTox approval outcome.
  • 33. Aim: Evaluation of chemical class assignment to provide suitable features for property prediction The Tox21 Challenge [5] provided more than 10k compounds. Outcomes for 12 different pathway assays, where 1 means the compound affects this pathway and 0 not. We made an assignment with this >10k compounds and used this assigned chem classes as features to train a Support Vector Machine (SVM), on the training data, to predict the outcomes. The final classifiers were used to predict the outcome of the remaining test set compounds. Result, see next slide. Chemical Feature Evaluation - Tox21 Challenge 33
  • 34. Tox21 test set results: AUC under the ROC curve (0.5 = no enrichment; 1 - 100% enrichment) NR.AR androgen receptor SR.MMP matrix metalloprotease (NR=nuclear receptor effect, SR=stress response effect) Chemical Feature Evaluation - Tox21 Data Set 34
  • 35. Tox21 test set results: different performance for compound prediction of the 3 example clusters: ● NR.AR ● NR.AR.LBD ● NR.AhR Chemical Feature Evaluation - Tox21 Data Set 35
  • 36. ● Structural chemistry ontology classes can be used as a feature space for machine learning ● Data from Tox21 reveals most relevant chemical ontology classes ● Scientific publications annotated with toxicity terms and small molecule compounds can be used to automatically generate a toxicity prediction system solely based on chemical structure, even for non-published compounds ● Beside toxicity, any other annotated compound property can also be used to predict the same property, e.g. as shown for antimicrobial activity Summary 36
  • 37. [1] http://madan.org.il/en/news/languages-still-major-barrier-global-science [2] https://idslabblog.wordpress.com/research-topic-ontology/ [3] https://commons.wikimedia.org/wiki/File:Hazard_T.svg [4] Samer CF, Lorenzini KI, Rollason V, Daali Y, Desmeules JA. Applications of CYP450 testing in the clinical setting. Mol Diagn Ther. 2013 Jun;17(3):165-84. doi: 10.1007/s40291-013-0028-5. PMID: 23588782; PMCID: PMC3663206. [5] Andersen, M. E., and Krewski, D. (2009). Toxicity testing in the 21st century: bringing the vision to life. Toxicol. Sci. 107, 324–330. doi: 10.1093/toxsci/kfn255 [6] Bobach C, Böhme T, Laube U, Püschel A, Weber L (2012) Automated compound classification using a chemical ontology. J Cheminformatics 4(12):40 References 37
  • 38. Thank you contact: www.ontochem.com phone: +49 345 4780470 e-mail: claudia.bobach@ontochem.com