Fanizzi Ilp2008 Kernel

ILP 2008 – Prague

Learning with Kernels
in Description Logics
Nicola Fanizzi
Claudia d’Amato
Floriana Esposito

LACAM - Dipartimento di Informatica
Università degli studi di Bari

Motivation
In the Semantic Web context,
uncertainty due to
Incoherence


heterogeneous / distributed knowledge


sources
Inherent incompleteness


Open-World Semantics


need for alternative methods wrt purely logic
(deductive) reasoning
09/11/08 ILP 2008 - N. Fanizzi 2

Wine Example
Considering the well-known WINE ontology,
some non logically derivable assertions:
KathrynKennedyLateral


known as a Meritage wine,
but not as a CaliforniaWine and an
AmericanWine
CotturiZinfandel


known as a Zinfandel
it is not a CabernetSauvignon
(a non-disjoint sibling class)

09/11/08 ILP 2008 - N. Fanizzi 3

Our Proposal
Inductive Inference
Non-parametric statistical learning methods


applied to standard ontology languages
based on epistemic inference: underlying


semantic similarity between individuals as
elicited from KB
Distance & Kernel functions


A learning framework for based on them


Inductive classifiers for inductive instance


check
09/11/08 ILP 2008 - N. Fanizzi 4

Application
Approximate
concept retrieval & query-answering
using inductive classifiers:
Kernel Machines (SVMs)


based on kernel functions


for individuals in KBs

09/11/08 ILP 2008 - N. Fanizzi 5

Description Logics
Building blocks:


NC = {C, D, ... } primitive concept names
{C


NR = {R, Q, ... } primitive role names
{R


NI = {a, b, ... } individual names
{a


Complex descriptions built using the language


constructors (next slide)
(∆
set-theoretic Interpretation I = (∆I, ·I)


Int. domain: ∆I of the discourse


Int. mapping: ·I from NI to ∆I


09/11/08 ILP 2008 - N. Fanizzi 6

ALCN Logic

CI extension of concept C in the interpretation I


RI extension of role R in the interpretation I


09/11/08 ILP 2008 - N. Fanizzi 7

DL Knowledge Bases
KB: K = <T , A >
<T


TBox T : set of axioms regarding the concepts


C≡ D


equivalent to CI ⊆ DI ∧ DI ⊆ CI in the


interpretations satisfying the KB
ABox A : set of assertions on the individuals


C (a )


 R (a , b )

Open-world assumption


09/11/08 ILP 2008 - N. Fanizzi 8

Royal Family Example

09/11/08 ILP 2008 - N. Fanizzi 9

Normal Form
Negation normal form employed by the tableau-


reasoners [Baader et al., 2003]
Formulae as AND-OR trees


C: └┘ D: └┘

┌┐ { P , P } ┌┐ { }
┌┐ { } { P3 }
1 2
∀R
∃R ∃R ∃R

{ P1, ¬ P2 } { ¬ P1 }
{ P3 } └┘

┌┐ { }
∀R

{ P2 }
09/11/08 ILP 2008 - N. Fanizzi 10

Normal Form: a definition

09/11/08 ILP 2008 - N. Fanizzi 11

Kernels for ALCN

09/11/08 ILP 2008 - N. Fanizzi 12

Kernels for ALCN / 2

09/11/08 ILP 2008 - N. Fanizzi 13

Validity

Easily proven


Limited recursion


Closure wrt std operations


09/11/08 ILP 2008 - N. Fanizzi 14

Problems with DL KBs
From concepts to indivuals
concepts


Multiple-class classification
Multiple-class


Not necessarily disjoint classes


Open-world semantics


Use MSC approximation


Decompose in multiple learning problems


Training instances may be unlabeled


Possible values for estimator V = {-1, 0, +1}
{-1, +1}


09/11/08 ILP 2008 - N. Fanizzi 15

SVMs
Kernels implemented and integrated in LIBSVM


Train 2 classifiers


for the positive classification


for the negative classification


Decision procedure used for instance checking


09/11/08 ILP 2008 - N. Fanizzi 16

Experimental Setup
For each KB


classification of all their concepts


 randomly generated query concepts (30)

 composition of primitive or defined

concepts (2 through 8)
SVMs: default parametres values


10-fold Cross Validation test procedure
10-fold


09/11/08 ILP 2008 - N. Fanizzi 17

Evaluation Indices
Deductive / Inductive classifier
match rate:
rate:
-1/-1 0/0 +1/+1
omission error rate:
rate:
-1/0 +1/0
commission error rate:
rate:
-1/+1 +1/-1
induction rate:
rate:
0/-1 0/+1
09/11/08 ILP 2008 - N. Fanizzi 18

Ontologies

From Protégé's Ontology library


and other benchmarks


09/11/08 ILP 2008 - N. Fanizzi 19

Results / 1

09/11/08 ILP 2008 - N. Fanizzi 20

Results / 2

09/11/08 ILP 2008 - N. Fanizzi 21

Results / 3

09/11/08 ILP 2008 - N. Fanizzi 22

Results / 4

09/11/08 ILP 2008 - N. Fanizzi 23

Concluding Remarks
Non-parametric What's next:


statistical learning to Better eval. indices

work on DL KBs
New kernels for


Application to

OWL @ ISWC2008
approximate
unsupervised

classification query
learning @
answering
ESWC08
Encouraging

Uncertainty models

experimental results
 Probabilistic m.

 Rough DL

09/11/08 ILP 2008 - N. Fanizzi 24

The End

Questions ?
Offline:
Nicola Fanizzi fanizzi@di.uniba.it


Claudia d'Amato claudia.damato@di.uniba.it


Floriana Esposito esposito@di.uniba.it


LACAM - Dipartimento di Informatica
Università degli studi di Bari
Via E. Orabona, 4 - 70125 Bari, Italy
http://lacam.di.uniba.it:8000/
09/11/08 ILP 2008 - N. Fanizzi 25

Fanizzi Ilp2008 Kernel

Recommended

Recommended

More Related Content

Featured

Featured (20)

Fanizzi Ilp2008 Kernel