SlideShare a Scribd company logo
1 of 29
Download to read offline
DL-FOIL: Class Expression Learning Revisited
Nicola Fanizzi, Giuseppe Rizzo, Claudia d’Amato, Floriana Esposito
LACAM - Dipartimento di Informatica, Universit`a degli Studi di Bari Aldo Moro
EKAW 2018, Nancy, France – 15th November 2018
Outline
1 Introduction
2 The problem
3 DL-Foil
4 Evaluation
5 Conclusions, Ongoing & Future Work
Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
Introduction
Introduction
Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
Introduction
Motivations
Goal
Eliciting candidate concept descriptions for semi-automatic knowledge base
completion
TBox: candidate (equivalence) axioms
ABox: candidate (class) assertions by classifying individuals
Solutions
(Supervised) Machine learning methods:
E.g. concept learning: symbolic methods for producing a concept
description using a set of pos./neg./unlabeled. examples
Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
Introduction
Motivations:
Previous solutions and current limits
DL-Foil: produces a concept description in a disjunctive form providing a
consistent classification of the examples
ternary problem (pos., neg., unlabeled ex.s) – OWA
partial description generated on-the-fly to cover the largest number of pos.
ex.s as possible
selection among a set of candidates generated according to an heuristic
Problems:
generated descriptions not covering positive examples
unlabeled individuals equally contribute to the score for candidate evaluation
Contribution: improving both the specialization procedure and and the
heuristic considering the actual number of unlabeled individuals
Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
The problem
The problem
Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
The problem
The learning problem
Let K = T , A be a DL knowledge base.
Given a concept name C
a training set Tr = (Ps, Ns, Us)
Find a concept description D, such that, letting K = K ∪ {C ≡ D}, A ,
the following entailments hold:
∀a ∈ Ps: K |= C(a)
∀b ∈ Ns: K |= ¬C(b)
i.e. correct w.r.t. the examples and general for predictive purposes
Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
The problem
The learning problem: an example
Let K = { Man Person, Man ¬Woman
Woman Person,
Man (a), Man (b), Man(c), hasChild(a,d), hasChild (b,e), Woman (d), Woman
(f), Artist(e), Dog(z) }
Target concept: Father, i.e. a man with at least a child
Ps={ a, b}
Ns ={ d,f} ( due to Man ¬Woman)
Us= { c, e,z }
induced concept: Father ≡ Man ∃ hasChild
Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
DL-Foil
DL-Foil
Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
DL-Foil
The algorithm
Given Tr and a partial description C in a disjunctive form (initialized C = ⊥):
Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
DL-Foil
The algorithm
Given Tr and a partial description C in a disjunctive form (initialized C = ⊥):
C’=
Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
DL-Foil
The algorithm
Given Tr and a partial description C in a disjunctive form (initialized C = ⊥):
C’= Refines C
Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
DL-Foil
The algorithm
Given Tr and a partial description C in a disjunctive form (initialized C = ⊥):
C’= Refines C Find the best Di
{Di |Di C }
neg./unl. example covered
Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
DL-Foil
The algorithm
Given Tr and a partial description C in a disjunctive form (initialized C = ⊥):
C’= Refines C Find the best Di
C = C Di
{Di |Di C }
neg./unl. example covered
no neg.exs coveredRemove pos examples
Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
DL-Foil
DL-Foil: Covering Procedure Example
Input:
Tr = {a, b, d, f, c, e, z}
C = ⊥
Trace of the algorithm
1st ref. step: C = (covered examples: Tr)
D∗
= ¬Woman (covered examples: a, b, c) (c ∈ Us — further specialization
required)
2nd ref. step: C = ¬Woman
D∗
= ¬Woman ∃hasChild.Person (covered examples: a, b –all i ∈ Ps)
C = ⊥ ¬Woman ∃hasChild.Person
Ps = Ps  {a, b} = ∅
Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
DL-Foil
Specializing a concept
Generation n concept D C performing a sort of random sampling in the DL
concept space
Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
DL-Foil
Specializing a concept
Generation n concept D C performing a sort of random sampling in the DL
concept space
ρ1 D = C A
ρ2 D = C ¬A
ρ3 D = C ∀R. Add a conjunct (randomly selected from the signature)
ρ4 D = C ∃R.
Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
DL-Foil
Specializing a concept
Generation n concept D C performing a sort of random sampling in the DL
concept space
Refines an existing sub-description (randomly selected)
ρ5 D = C1 · · · B · · · Cn
if C = C1 · · · A · · · Cn and B A
ρ6 D = C1 · · · ¬B · · · Cn
if C = C1 · · · ¬A · · · Cn and A B
ρ7 D = C1 · · · ∃R.F · · · Cn
if C = C1 · · · ∃R.E · · · Cn and F ∈ ρ(E)
ρ8 D = C1 · · · ∀R.F · · · Cn
if C = C1 · · · ∀R.E · · · Cn and F ∈ ρ(E)
Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
DL-Foil
Specializing a concept
Generation n concept D C performing a sort of random sampling in the DL
concept space
ρ1 D = C A
ρ2 D = C ¬A
ρ3 D = C ∀R.
ρ4 D = C ∃R.
ρ5 D = C1 · · · B · · · Cn
if C = C1 · · · A · · · Cn and B A
ρ6 D = C1 · · · ¬B · · · Cn
if C = C1 · · · ¬A · · · Cn and A B
ρ7 D = C1 · · · ∃R.F · · · Cn
if C = C1 · · · ∃R.E · · · Cn and F ∈ ρ(E)
ρ8 D = C1 · · · ∀R.F · · · Cn
if C = C1 · · · ∀R.E · · · Cn and F ∈ ρ(E)
Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
DL-Foil
Specializing a concept: examples
Let C = Person the concept to be refined
D = Person Man ( using ρ1)
D = Person ¬Woman( using ρ2)
D = Person ∃hasChild. (using ρ3)
D = Person ∀hasChild. (using ρ4)
Let C = Person ∃hasChild.
D = Person ∃hasChild.Man
D = Person ∃hasChild.Dog
Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
DL-Foil
Specializing a concept: examples
Let C = Person the concept to be refined
D = Person Man ( using ρ1)
D = Person ¬Woman( using ρ2)
D = Person ∃hasChild. (using ρ3)
D = Person ∀hasChild. (using ρ4)
Let C = Person ∃hasChild.
D = Person ∃hasChild.Man
D = Person ∃hasChild.Dog ← Satisfiable w.r.t KB but without pos.exs.!
Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
DL-Foil
Improving concept specialization
Further constraints are used in DL-FOIL for avoiding ”uninformative”
concepts:
the specialization procedure implementing ρ generates concepts that
covers at least a positive example
the score of each specialization exceeds a threshold
Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
DL-Foil
The score
DL-Foil selects the concept maximizing an information-theoretics heuristic:
g(D0, D1) = p1 · log
p1
p1 + n1 + u1
− log
p0
p0 + n0 + u0
D0: the former partial definition
D1: the specialization
p1, n1, u1: the actual number of pos., neg., unl. exs. covered by D1
p0, n0, u0: the actual number of pos., neg., unl. exs. covered by D0
Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
Evaluation
Evaluation
Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
Evaluation
Preliminary Experiments
Concept membership prediction
5 publicly available ontologies
15 artificially generated datasets:
random target concept generation
ground truth: individuals labeled according to the membership w.r.t. target
Competitor: CELOE
0.632 bootstrap as the design of the experiment
Indices: membership w.r.t. the induced concept against membership
w.r.t. the target
actual
value
Prediction outcome
pos. neg. unl.
pos. match (M) commission (C) omission (O)
neg. commission match omission
unl. induction (I) induction match
Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
Evaluation
Outcomes
Dataset index DL-FOIL CELOE
BIOPAX M% 95.73 ± 03.74 94.53 ± 01.17
C% 00.13 ± 00.20 03.24 ± 00.85
O% 01.90 ± 03.31 01.62 ± 00.38
I% 02.23 ± 00.40 00.61 ± 00.18
NTN M% 97.78 ± 05.05 97.41 ± 00.15
C% 00.05 ± 00.07 00.00 ± 00.00
O% 02.17 ± 05.00 00.00 ± 00.00
I% 00.01 ± 00.01 02.59 ± 00.15
HDISEASE M% 88.75 ± 01.09 88.08 ± 01.09
C% 00.04 ± 00.10 00.00 ± 00.00
O% 03.64 ± 01.30 07.69 ± 00.90
I% 07.57 ± 01.42 04.23 ± 00.24
FINANCIAL M% 93.52 ± 01.02 87.40 ± 04.74
C% 00.22 ± 00.21 06.33 ± 04.33
O% 00.00 ± 00.00 00.00 ± 00.01
I% 06.26 ± 00.88 06.26 ± 00.52
GEOSKILLS M% 82.60 ± 04.69 50.20 ± 02.31
C% 00.00 ± 00.00 23.66 ± 02.61
O% 13.33 ± 04.43 01.34 ± 00.12
I% 04.07 ± 04.09 24.80 ± 00.89
Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
Conclusions, Ongoing & Future Work
Conclusions, Ongoing & Future Work
Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
Conclusions, Ongoing & Future Work
Conclusions & Extensions
Modified version of DL-Foil with a different specialization procedure and
heuristic
The evaluation shows good results in terms of match rate
Ongoing & Future Work
New evaluations on larger knowledge bases
New specializations procedures
New heuristics
Scalability
Parallel computation
Distributed computation (Spark, Flink...)
Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
Conclusions, Ongoing & Future Work
Thank You!
Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018

More Related Content

Similar to DL-Foil:Class Expression Learning Revisited

Asynchronous Stochastic Optimization, New Analysis and Algorithms
Asynchronous Stochastic Optimization, New Analysis and AlgorithmsAsynchronous Stochastic Optimization, New Analysis and Algorithms
Asynchronous Stochastic Optimization, New Analysis and AlgorithmsFabian Pedregosa
 
Towards new solutions for scientific computing: the case of Julia
Towards new solutions for scientific computing: the case of JuliaTowards new solutions for scientific computing: the case of Julia
Towards new solutions for scientific computing: the case of JuliaMaurizio Tomasi
 
The Lazy Traveling Salesman Memory Management for Large-Scale Link Discovery
The Lazy Traveling Salesman Memory Management for Large-Scale Link DiscoveryThe Lazy Traveling Salesman Memory Management for Large-Scale Link Discovery
The Lazy Traveling Salesman Memory Management for Large-Scale Link DiscoveryHolistic Benchmarking of Big Linked Data
 
WISS 2015 - Machine Learning lecture by Ludovic Samper
WISS 2015 - Machine Learning lecture by Ludovic Samper WISS 2015 - Machine Learning lecture by Ludovic Samper
WISS 2015 - Machine Learning lecture by Ludovic Samper Antidot
 
A Short Course in Data Stream Mining
A Short Course in Data Stream MiningA Short Course in Data Stream Mining
A Short Course in Data Stream MiningAlbert Bifet
 
Sessione II - Estimation methods and accuracy - P.D. Falorsi F. Petrarca, P...
Sessione II - Estimation methods and accuracy  -  P.D. Falorsi F. Petrarca, P...Sessione II - Estimation methods and accuracy  -  P.D. Falorsi F. Petrarca, P...
Sessione II - Estimation methods and accuracy - P.D. Falorsi F. Petrarca, P...Istituto nazionale di statistica
 
The probability of non-confluent systems
The probability of non-confluent systemsThe probability of non-confluent systems
The probability of non-confluent systemsAlejandro Díaz-Caro
 
Computational complexity
Computational complexityComputational complexity
Computational complexityFulvio Corno
 
Inducing Predictive Clustering Trees for Datatype properties Values
Inducing Predictive Clustering Trees for Datatype properties ValuesInducing Predictive Clustering Trees for Datatype properties Values
Inducing Predictive Clustering Trees for Datatype properties ValuesGiuseppe Rizzo
 
Scikit-Learn in Particle Physics
Scikit-Learn in Particle PhysicsScikit-Learn in Particle Physics
Scikit-Learn in Particle PhysicsGilles Louppe
 
DSD-INT 2018 Work with iMOD MODFLOW models in Python - Visser Bootsma
DSD-INT 2018 Work with iMOD MODFLOW models in Python - Visser BootsmaDSD-INT 2018 Work with iMOD MODFLOW models in Python - Visser Bootsma
DSD-INT 2018 Work with iMOD MODFLOW models in Python - Visser BootsmaDeltares
 
Recommendation Systems in banking and Financial Services
Recommendation Systems in banking and Financial ServicesRecommendation Systems in banking and Financial Services
Recommendation Systems in banking and Financial ServicesAndrea Gigli
 
Generative Adversarial Networks (D2L5 Deep Learning for Speech and Language U...
Generative Adversarial Networks (D2L5 Deep Learning for Speech and Language U...Generative Adversarial Networks (D2L5 Deep Learning for Speech and Language U...
Generative Adversarial Networks (D2L5 Deep Learning for Speech and Language U...Universitat Politècnica de Catalunya
 
AET vs. AED: Unsupervised Representation Learning by Auto-Encoding Transforma...
AET vs. AED: Unsupervised Representation Learning by Auto-Encoding Transforma...AET vs. AED: Unsupervised Representation Learning by Auto-Encoding Transforma...
AET vs. AED: Unsupervised Representation Learning by Auto-Encoding Transforma...Tomoyuki Suzuki
 
Lucas Theis - Compressing Images with Neural Networks - Creative AI meetup
Lucas Theis - Compressing Images with Neural Networks - Creative AI meetupLucas Theis - Compressing Images with Neural Networks - Creative AI meetup
Lucas Theis - Compressing Images with Neural Networks - Creative AI meetupLuba Elliott
 
Introduction to R
Introduction to RIntroduction to R
Introduction to RAli Ghods
 
Simple APIs and innovative documentation
Simple APIs and innovative documentationSimple APIs and innovative documentation
Simple APIs and innovative documentationPyDataParis
 

Similar to DL-Foil:Class Expression Learning Revisited (20)

Asynchronous Stochastic Optimization, New Analysis and Algorithms
Asynchronous Stochastic Optimization, New Analysis and AlgorithmsAsynchronous Stochastic Optimization, New Analysis and Algorithms
Asynchronous Stochastic Optimization, New Analysis and Algorithms
 
Towards new solutions for scientific computing: the case of Julia
Towards new solutions for scientific computing: the case of JuliaTowards new solutions for scientific computing: the case of Julia
Towards new solutions for scientific computing: the case of Julia
 
The Lazy Traveling Salesman Memory Management for Large-Scale Link Discovery
The Lazy Traveling Salesman Memory Management for Large-Scale Link DiscoveryThe Lazy Traveling Salesman Memory Management for Large-Scale Link Discovery
The Lazy Traveling Salesman Memory Management for Large-Scale Link Discovery
 
An Evaluation of Models for Runtime Approximation in Link Discovery
An Evaluation of Models for Runtime Approximation in Link DiscoveryAn Evaluation of Models for Runtime Approximation in Link Discovery
An Evaluation of Models for Runtime Approximation in Link Discovery
 
Link Discovery Tutorial Part I: Efficiency
Link Discovery Tutorial Part I: EfficiencyLink Discovery Tutorial Part I: Efficiency
Link Discovery Tutorial Part I: Efficiency
 
WISS 2015 - Machine Learning lecture by Ludovic Samper
WISS 2015 - Machine Learning lecture by Ludovic Samper WISS 2015 - Machine Learning lecture by Ludovic Samper
WISS 2015 - Machine Learning lecture by Ludovic Samper
 
A Short Course in Data Stream Mining
A Short Course in Data Stream MiningA Short Course in Data Stream Mining
A Short Course in Data Stream Mining
 
SASA 2016
SASA 2016SASA 2016
SASA 2016
 
Sessione II - Estimation methods and accuracy - P.D. Falorsi F. Petrarca, P...
Sessione II - Estimation methods and accuracy  -  P.D. Falorsi F. Petrarca, P...Sessione II - Estimation methods and accuracy  -  P.D. Falorsi F. Petrarca, P...
Sessione II - Estimation methods and accuracy - P.D. Falorsi F. Petrarca, P...
 
The probability of non-confluent systems
The probability of non-confluent systemsThe probability of non-confluent systems
The probability of non-confluent systems
 
Computational complexity
Computational complexityComputational complexity
Computational complexity
 
Inducing Predictive Clustering Trees for Datatype properties Values
Inducing Predictive Clustering Trees for Datatype properties ValuesInducing Predictive Clustering Trees for Datatype properties Values
Inducing Predictive Clustering Trees for Datatype properties Values
 
Scikit-Learn in Particle Physics
Scikit-Learn in Particle PhysicsScikit-Learn in Particle Physics
Scikit-Learn in Particle Physics
 
DSD-INT 2018 Work with iMOD MODFLOW models in Python - Visser Bootsma
DSD-INT 2018 Work with iMOD MODFLOW models in Python - Visser BootsmaDSD-INT 2018 Work with iMOD MODFLOW models in Python - Visser Bootsma
DSD-INT 2018 Work with iMOD MODFLOW models in Python - Visser Bootsma
 
Recommendation Systems in banking and Financial Services
Recommendation Systems in banking and Financial ServicesRecommendation Systems in banking and Financial Services
Recommendation Systems in banking and Financial Services
 
Generative Adversarial Networks (D2L5 Deep Learning for Speech and Language U...
Generative Adversarial Networks (D2L5 Deep Learning for Speech and Language U...Generative Adversarial Networks (D2L5 Deep Learning for Speech and Language U...
Generative Adversarial Networks (D2L5 Deep Learning for Speech and Language U...
 
AET vs. AED: Unsupervised Representation Learning by Auto-Encoding Transforma...
AET vs. AED: Unsupervised Representation Learning by Auto-Encoding Transforma...AET vs. AED: Unsupervised Representation Learning by Auto-Encoding Transforma...
AET vs. AED: Unsupervised Representation Learning by Auto-Encoding Transforma...
 
Lucas Theis - Compressing Images with Neural Networks - Creative AI meetup
Lucas Theis - Compressing Images with Neural Networks - Creative AI meetupLucas Theis - Compressing Images with Neural Networks - Creative AI meetup
Lucas Theis - Compressing Images with Neural Networks - Creative AI meetup
 
Introduction to R
Introduction to RIntroduction to R
Introduction to R
 
Simple APIs and innovative documentation
Simple APIs and innovative documentationSimple APIs and innovative documentation
Simple APIs and innovative documentation
 

More from Giuseppe Rizzo

Boosting dl concept learners
Boosting dl concept learners Boosting dl concept learners
Boosting dl concept learners Giuseppe Rizzo
 
A framework for Tackling myopia in concept learning on the Web of Data
A framework for Tackling myopia in concept learning on the Web of DataA framework for Tackling myopia in concept learning on the Web of Data
A framework for Tackling myopia in concept learning on the Web of DataGiuseppe Rizzo
 
Terminological cluster trees for Disjointness Axiom Discovery
Terminological cluster trees for Disjointness Axiom DiscoveryTerminological cluster trees for Disjointness Axiom Discovery
Terminological cluster trees for Disjointness Axiom DiscoveryGiuseppe Rizzo
 
Approximating Numeric Role Fillers via Predictive Clustering Trees for Know...
Approximating Numeric Role Fillers via Predictive Clustering Trees  for  Know...Approximating Numeric Role Fillers via Predictive Clustering Trees  for  Know...
Approximating Numeric Role Fillers via Predictive Clustering Trees for Know...Giuseppe Rizzo
 
On the Effectiveness of Evidence-based Terminological Decision Trees
On the Effectiveness of Evidence-based Terminological Decision TreesOn the Effectiveness of Evidence-based Terminological Decision Trees
On the Effectiveness of Evidence-based Terminological Decision TreesGiuseppe Rizzo
 
Inductive Classification through Evidence-based Models and Their Ensemble
Inductive Classification through Evidence-based Models and Their EnsembleInductive Classification through Evidence-based Models and Their Ensemble
Inductive Classification through Evidence-based Models and Their EnsembleGiuseppe Rizzo
 
Tackling the Class Imbalance Learning Problem in Semantic Web Knowledge bases
Tackling the Class Imbalance Learning Problem in Semantic Web Knowledge basesTackling the Class Imbalance Learning Problem in Semantic Web Knowledge bases
Tackling the Class Imbalance Learning Problem in Semantic Web Knowledge basesGiuseppe Rizzo
 
Towards Evidence Terminological Decision Tree
Towards Evidence Terminological Decision TreeTowards Evidence Terminological Decision Tree
Towards Evidence Terminological Decision TreeGiuseppe Rizzo
 

More from Giuseppe Rizzo (8)

Boosting dl concept learners
Boosting dl concept learners Boosting dl concept learners
Boosting dl concept learners
 
A framework for Tackling myopia in concept learning on the Web of Data
A framework for Tackling myopia in concept learning on the Web of DataA framework for Tackling myopia in concept learning on the Web of Data
A framework for Tackling myopia in concept learning on the Web of Data
 
Terminological cluster trees for Disjointness Axiom Discovery
Terminological cluster trees for Disjointness Axiom DiscoveryTerminological cluster trees for Disjointness Axiom Discovery
Terminological cluster trees for Disjointness Axiom Discovery
 
Approximating Numeric Role Fillers via Predictive Clustering Trees for Know...
Approximating Numeric Role Fillers via Predictive Clustering Trees  for  Know...Approximating Numeric Role Fillers via Predictive Clustering Trees  for  Know...
Approximating Numeric Role Fillers via Predictive Clustering Trees for Know...
 
On the Effectiveness of Evidence-based Terminological Decision Trees
On the Effectiveness of Evidence-based Terminological Decision TreesOn the Effectiveness of Evidence-based Terminological Decision Trees
On the Effectiveness of Evidence-based Terminological Decision Trees
 
Inductive Classification through Evidence-based Models and Their Ensemble
Inductive Classification through Evidence-based Models and Their EnsembleInductive Classification through Evidence-based Models and Their Ensemble
Inductive Classification through Evidence-based Models and Their Ensemble
 
Tackling the Class Imbalance Learning Problem in Semantic Web Knowledge bases
Tackling the Class Imbalance Learning Problem in Semantic Web Knowledge basesTackling the Class Imbalance Learning Problem in Semantic Web Knowledge bases
Tackling the Class Imbalance Learning Problem in Semantic Web Knowledge bases
 
Towards Evidence Terminological Decision Tree
Towards Evidence Terminological Decision TreeTowards Evidence Terminological Decision Tree
Towards Evidence Terminological Decision Tree
 

Recently uploaded

2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Call2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Calllward7
 
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...Valters Lauzums
 
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflictSupply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflictJack Cole
 
The Significance of Transliteration Enhancing
The Significance of Transliteration EnhancingThe Significance of Transliteration Enhancing
The Significance of Transliteration Enhancingmohamed Elzalabany
 
Atlantic Grupa Case Study (Mintec Data AI)
Atlantic Grupa Case Study (Mintec Data AI)Atlantic Grupa Case Study (Mintec Data AI)
Atlantic Grupa Case Study (Mintec Data AI)Jon Hansen
 
一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理pyhepag
 
AI Imagen for data-storytelling Infographics.pdf
AI Imagen for data-storytelling Infographics.pdfAI Imagen for data-storytelling Infographics.pdf
AI Imagen for data-storytelling Infographics.pdfMichaelSenkow
 
Artificial_General_Intelligence__storm_gen_article.pdf
Artificial_General_Intelligence__storm_gen_article.pdfArtificial_General_Intelligence__storm_gen_article.pdf
Artificial_General_Intelligence__storm_gen_article.pdfscitechtalktv
 
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...Amil baba
 
Pre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptxPre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptxStephen266013
 
Fuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertaintyFuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertaintyRafigAliyev2
 
一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理pyhepag
 
一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理cyebo
 
Easy and simple project file on mp online
Easy and simple project file on mp onlineEasy and simple project file on mp online
Easy and simple project file on mp onlinebalibahu1313
 
一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理cyebo
 
basics of data science with application areas.pdf
basics of data science with application areas.pdfbasics of data science with application areas.pdf
basics of data science with application areas.pdfvyankatesh1
 
Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...
Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...
Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...ssuserf63bd7
 

Recently uploaded (20)

2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Call2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Call
 
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
 
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflictSupply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
 
The Significance of Transliteration Enhancing
The Significance of Transliteration EnhancingThe Significance of Transliteration Enhancing
The Significance of Transliteration Enhancing
 
Atlantic Grupa Case Study (Mintec Data AI)
Atlantic Grupa Case Study (Mintec Data AI)Atlantic Grupa Case Study (Mintec Data AI)
Atlantic Grupa Case Study (Mintec Data AI)
 
一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理
 
AI Imagen for data-storytelling Infographics.pdf
AI Imagen for data-storytelling Infographics.pdfAI Imagen for data-storytelling Infographics.pdf
AI Imagen for data-storytelling Infographics.pdf
 
Artificial_General_Intelligence__storm_gen_article.pdf
Artificial_General_Intelligence__storm_gen_article.pdfArtificial_General_Intelligence__storm_gen_article.pdf
Artificial_General_Intelligence__storm_gen_article.pdf
 
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
 
Pre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptxPre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptx
 
Fuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertaintyFuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertainty
 
一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理
 
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotecAbortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
 
Slip-and-fall Injuries: Top Workers' Comp Claims
Slip-and-fall Injuries: Top Workers' Comp ClaimsSlip-and-fall Injuries: Top Workers' Comp Claims
Slip-and-fall Injuries: Top Workers' Comp Claims
 
一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理
 
Easy and simple project file on mp online
Easy and simple project file on mp onlineEasy and simple project file on mp online
Easy and simple project file on mp online
 
Machine Learning for Accident Severity Prediction
Machine Learning for Accident Severity PredictionMachine Learning for Accident Severity Prediction
Machine Learning for Accident Severity Prediction
 
一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理
 
basics of data science with application areas.pdf
basics of data science with application areas.pdfbasics of data science with application areas.pdf
basics of data science with application areas.pdf
 
Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...
Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...
Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...
 

DL-Foil:Class Expression Learning Revisited

  • 1. DL-FOIL: Class Expression Learning Revisited Nicola Fanizzi, Giuseppe Rizzo, Claudia d’Amato, Floriana Esposito LACAM - Dipartimento di Informatica, Universit`a degli Studi di Bari Aldo Moro EKAW 2018, Nancy, France – 15th November 2018
  • 2. Outline 1 Introduction 2 The problem 3 DL-Foil 4 Evaluation 5 Conclusions, Ongoing & Future Work Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
  • 3. Introduction Introduction Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
  • 4. Introduction Motivations Goal Eliciting candidate concept descriptions for semi-automatic knowledge base completion TBox: candidate (equivalence) axioms ABox: candidate (class) assertions by classifying individuals Solutions (Supervised) Machine learning methods: E.g. concept learning: symbolic methods for producing a concept description using a set of pos./neg./unlabeled. examples Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
  • 5. Introduction Motivations: Previous solutions and current limits DL-Foil: produces a concept description in a disjunctive form providing a consistent classification of the examples ternary problem (pos., neg., unlabeled ex.s) – OWA partial description generated on-the-fly to cover the largest number of pos. ex.s as possible selection among a set of candidates generated according to an heuristic Problems: generated descriptions not covering positive examples unlabeled individuals equally contribute to the score for candidate evaluation Contribution: improving both the specialization procedure and and the heuristic considering the actual number of unlabeled individuals Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
  • 6. The problem The problem Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
  • 7. The problem The learning problem Let K = T , A be a DL knowledge base. Given a concept name C a training set Tr = (Ps, Ns, Us) Find a concept description D, such that, letting K = K ∪ {C ≡ D}, A , the following entailments hold: ∀a ∈ Ps: K |= C(a) ∀b ∈ Ns: K |= ¬C(b) i.e. correct w.r.t. the examples and general for predictive purposes Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
  • 8. The problem The learning problem: an example Let K = { Man Person, Man ¬Woman Woman Person, Man (a), Man (b), Man(c), hasChild(a,d), hasChild (b,e), Woman (d), Woman (f), Artist(e), Dog(z) } Target concept: Father, i.e. a man with at least a child Ps={ a, b} Ns ={ d,f} ( due to Man ¬Woman) Us= { c, e,z } induced concept: Father ≡ Man ∃ hasChild Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
  • 9. DL-Foil DL-Foil Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
  • 10. DL-Foil The algorithm Given Tr and a partial description C in a disjunctive form (initialized C = ⊥): Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
  • 11. DL-Foil The algorithm Given Tr and a partial description C in a disjunctive form (initialized C = ⊥): C’= Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
  • 12. DL-Foil The algorithm Given Tr and a partial description C in a disjunctive form (initialized C = ⊥): C’= Refines C Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
  • 13. DL-Foil The algorithm Given Tr and a partial description C in a disjunctive form (initialized C = ⊥): C’= Refines C Find the best Di {Di |Di C } neg./unl. example covered Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
  • 14. DL-Foil The algorithm Given Tr and a partial description C in a disjunctive form (initialized C = ⊥): C’= Refines C Find the best Di C = C Di {Di |Di C } neg./unl. example covered no neg.exs coveredRemove pos examples Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
  • 15. DL-Foil DL-Foil: Covering Procedure Example Input: Tr = {a, b, d, f, c, e, z} C = ⊥ Trace of the algorithm 1st ref. step: C = (covered examples: Tr) D∗ = ¬Woman (covered examples: a, b, c) (c ∈ Us — further specialization required) 2nd ref. step: C = ¬Woman D∗ = ¬Woman ∃hasChild.Person (covered examples: a, b –all i ∈ Ps) C = ⊥ ¬Woman ∃hasChild.Person Ps = Ps {a, b} = ∅ Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
  • 16. DL-Foil Specializing a concept Generation n concept D C performing a sort of random sampling in the DL concept space Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
  • 17. DL-Foil Specializing a concept Generation n concept D C performing a sort of random sampling in the DL concept space ρ1 D = C A ρ2 D = C ¬A ρ3 D = C ∀R. Add a conjunct (randomly selected from the signature) ρ4 D = C ∃R. Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
  • 18. DL-Foil Specializing a concept Generation n concept D C performing a sort of random sampling in the DL concept space Refines an existing sub-description (randomly selected) ρ5 D = C1 · · · B · · · Cn if C = C1 · · · A · · · Cn and B A ρ6 D = C1 · · · ¬B · · · Cn if C = C1 · · · ¬A · · · Cn and A B ρ7 D = C1 · · · ∃R.F · · · Cn if C = C1 · · · ∃R.E · · · Cn and F ∈ ρ(E) ρ8 D = C1 · · · ∀R.F · · · Cn if C = C1 · · · ∀R.E · · · Cn and F ∈ ρ(E) Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
  • 19. DL-Foil Specializing a concept Generation n concept D C performing a sort of random sampling in the DL concept space ρ1 D = C A ρ2 D = C ¬A ρ3 D = C ∀R. ρ4 D = C ∃R. ρ5 D = C1 · · · B · · · Cn if C = C1 · · · A · · · Cn and B A ρ6 D = C1 · · · ¬B · · · Cn if C = C1 · · · ¬A · · · Cn and A B ρ7 D = C1 · · · ∃R.F · · · Cn if C = C1 · · · ∃R.E · · · Cn and F ∈ ρ(E) ρ8 D = C1 · · · ∀R.F · · · Cn if C = C1 · · · ∀R.E · · · Cn and F ∈ ρ(E) Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
  • 20. DL-Foil Specializing a concept: examples Let C = Person the concept to be refined D = Person Man ( using ρ1) D = Person ¬Woman( using ρ2) D = Person ∃hasChild. (using ρ3) D = Person ∀hasChild. (using ρ4) Let C = Person ∃hasChild. D = Person ∃hasChild.Man D = Person ∃hasChild.Dog Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
  • 21. DL-Foil Specializing a concept: examples Let C = Person the concept to be refined D = Person Man ( using ρ1) D = Person ¬Woman( using ρ2) D = Person ∃hasChild. (using ρ3) D = Person ∀hasChild. (using ρ4) Let C = Person ∃hasChild. D = Person ∃hasChild.Man D = Person ∃hasChild.Dog ← Satisfiable w.r.t KB but without pos.exs.! Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
  • 22. DL-Foil Improving concept specialization Further constraints are used in DL-FOIL for avoiding ”uninformative” concepts: the specialization procedure implementing ρ generates concepts that covers at least a positive example the score of each specialization exceeds a threshold Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
  • 23. DL-Foil The score DL-Foil selects the concept maximizing an information-theoretics heuristic: g(D0, D1) = p1 · log p1 p1 + n1 + u1 − log p0 p0 + n0 + u0 D0: the former partial definition D1: the specialization p1, n1, u1: the actual number of pos., neg., unl. exs. covered by D1 p0, n0, u0: the actual number of pos., neg., unl. exs. covered by D0 Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
  • 24. Evaluation Evaluation Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
  • 25. Evaluation Preliminary Experiments Concept membership prediction 5 publicly available ontologies 15 artificially generated datasets: random target concept generation ground truth: individuals labeled according to the membership w.r.t. target Competitor: CELOE 0.632 bootstrap as the design of the experiment Indices: membership w.r.t. the induced concept against membership w.r.t. the target actual value Prediction outcome pos. neg. unl. pos. match (M) commission (C) omission (O) neg. commission match omission unl. induction (I) induction match Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
  • 26. Evaluation Outcomes Dataset index DL-FOIL CELOE BIOPAX M% 95.73 ± 03.74 94.53 ± 01.17 C% 00.13 ± 00.20 03.24 ± 00.85 O% 01.90 ± 03.31 01.62 ± 00.38 I% 02.23 ± 00.40 00.61 ± 00.18 NTN M% 97.78 ± 05.05 97.41 ± 00.15 C% 00.05 ± 00.07 00.00 ± 00.00 O% 02.17 ± 05.00 00.00 ± 00.00 I% 00.01 ± 00.01 02.59 ± 00.15 HDISEASE M% 88.75 ± 01.09 88.08 ± 01.09 C% 00.04 ± 00.10 00.00 ± 00.00 O% 03.64 ± 01.30 07.69 ± 00.90 I% 07.57 ± 01.42 04.23 ± 00.24 FINANCIAL M% 93.52 ± 01.02 87.40 ± 04.74 C% 00.22 ± 00.21 06.33 ± 04.33 O% 00.00 ± 00.00 00.00 ± 00.01 I% 06.26 ± 00.88 06.26 ± 00.52 GEOSKILLS M% 82.60 ± 04.69 50.20 ± 02.31 C% 00.00 ± 00.00 23.66 ± 02.61 O% 13.33 ± 04.43 01.34 ± 00.12 I% 04.07 ± 04.09 24.80 ± 00.89 Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
  • 27. Conclusions, Ongoing & Future Work Conclusions, Ongoing & Future Work Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
  • 28. Conclusions, Ongoing & Future Work Conclusions & Extensions Modified version of DL-Foil with a different specialization procedure and heuristic The evaluation shows good results in terms of match rate Ongoing & Future Work New evaluations on larger knowledge bases New specializations procedures New heuristics Scalability Parallel computation Distributed computation (Spark, Flink...) Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018
  • 29. Conclusions, Ongoing & Future Work Thank You! Giuseppe Rizzo (LACAM-Dip.Informatica, Bari) DL-Foil EKAW 2018, Nancy, France – 15th November 2018