SlideShare a Scribd company logo
Verifying Integrity Constraints of a RDF-based
WordNet
Fabricio Chalub
Alexandre Rademaker
IBM Research, Brazil
January 30, 2016
Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 1 / 21
OpenWordnet-PT
http://wnpt.brlcloud.com/wn/
Goal: not a simple translation of PWN, based on PWN architecture.
originally created from a (PT) projection of the Universal WordNet
(Gerard de Melo)
Three language strategies in its lexical enrichment process: (i)
translation; (ii) corpus extraction; (iii) dictionaries.
Corpora: AC/DC project, DHBB CPDOC/FGV etc.
LR: morphosemantic links, nominalizations from NomLex, Nomage and
Wiktionary etc.
Freely available since Dec 2011. Download as RDF files, query via
SPARQL or browse via web interface (above).
used by “Google Translate”, FreeLing, OMW, BabelNet and Onto.PT.
Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 2 / 21
Why RDF
<definition gloss="This is my definition">
<meta creator="me/"></definition>
<definition><meta creator="me"/>
This is my definition</definition>
<definition><meta creator="me"/>
<text>This is my definition</text>
</definition>
<lmf:definition meta:creator="me">
This is my definition</lmf:definition>
“Why RDF model is different from the XML model” by Tim Berners-Lee
(1998). http://www.w3.org/DesignIssues/RDF-XML.html
Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 3 / 21
Why RDF (cont.)
There is a mapping from XML documents to semantic graphs.
The element names were a big hint for a human reader.
Without the schema (DTD, XML Schema), you know things about
the doc structure, but nothing else. You can’t tell what to deduce.
You can’t even really tell what real questions can be asked.
(1) mapping is many to one; (2) you need a schema to know what
the mapping is (don’t have a inference language); (3) the expression
you need for querying something in terms of the XML tree is
necessarily more complicated than the expression you need for
querying something in terms of the RDF tree.
“give me the properties with the same metadatas of this one?”
Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 4 / 21
Why RDF (cont.)
Giving a machine a knowledge tree vs. giving a person a document.
A document for a person is generally serialized so that, when read
serially by a human being, the result will be to build up a graph of
associations in that person’s head. The order is important.
For a graph of knowledge, order is not important, so long as the nodes
in common between different statements are identified consistently.
Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 5 / 21
Linked Data
Use URIs as names for things
Use HTTP URIs so that people can look up those names
When someone looks up a URI, provide useful information, using the
standards (RDF, SPARQL)
Include links to other URIs. so that they can discover more things
(ILI?)
A number of Linked Data projects for lexical resources.
“Linked Data” by Tim Berners-Lee (2006).
Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 6 / 21
Some words about vocabularies
To encode data, we need to decide which classes and properties to
use!
Different vocabularies for RDF encoding wordnets!
The adoption of already defined vocabularies helps on the data
interoperability since these makes data easily integrate with other
resources.
We use http://www.w3.org/TR/wordnet-rdf/ from 2006.
Scripts available http://github.com/own-pt/.
Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 7 / 21
Namespaces
1 https://w3id.org/own-pt/wn30/schema/
2 https://w3id.org/own-pt/wn30-pt/instances/
3 https://w3id.org/own-pt/wn30-en/instances/
4 https://w3id.org/own-pt/nomlex/schema/
5 https://w3id.org/own-pt/nomlex/instances/
Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 8 / 21
This paper
Our first attempt at verifying integrity constraints of our openWordnet-PT
against the ontology for Wordnets encoding.
Correcting and improving linguistic data is a hard task.
So far, no clear criteria for semantic evaluation wordnets not ways of
comparing their relative quality or accuracy. Thus qualitative assessment
of a new wordnet seems, presently, a matter of judgment and art.
Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 9 / 21
OWL and RDF
Consistency check of OWL and Integrity Constraints in RDF
OWL Lite and OWL DL semantics are based on Description logics.
DL are a family of logics that are decidable fragments of first-order
logic with attractive and well-understood computational properties.
A DL knowledge base is comprised by two components, TBox and
ABox. The TBox contains intensional knowledge (terminology).
The ABox contains extensional knowledge (assertional).
Intensional knowledge is usually thought not to change and
extensional knowledge is usually thought to be contingent.
Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 10 / 21
Reasoning
Given an ontology encoded in OWL (Lite or DL) one can use DL reasoners
for different tasks such as: concepts consistency checking, query
answering, classification, etc.
The basic reasoning task in an ABox is instance checking, which verifies
whether a given individ- ual is an instance of (or belongs to) a specified
concept.
In some use cases, we want a method to validating the RDF data
regarding a given model. In this case, OWL users intend OWL axioms to
be interpreted as constraints on RDF data.
Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 11 / 21
CWA vs. OWA
OWL default semantics adopts the Open World Assumption (OWA) and
does not adopt the Unique Name Assumption (UNA).
Due to OWA, a statement must not be inferred to be false on the basis of
failures to prove it; the fact that a piece of information has not been
specified does not mean that such information does not exist.
On the other hand, the absence of UNA allows two different constants to
refer to the same individual.
Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 12 / 21
Queries in SPARQL
One more motivation for RDF:
select ?w ?ws1 ?ws2
{
?ss1 wn30:containsWordSense ?ws1 .
?ws1 wn30:word ?w .
?ss2 wn30:containsWordSense ?ws2 .
?ws2 wn30:word ?w .
?ss1 wn30:hyponymOf* ?ss2 .
}
Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 13 / 21
Tools
Protege is an ontology editor that among other features has interface with
two well-know DL reasoners: FaCT++, HermiT etc.
Starting in version 4, Protege also gives us an interface to search for
explanations that caused an inconsistency. Racer and Pellet are reasoners
that have this feature builtin.
RDFpro for combine, split and syntax check etc.
Stardog and Allegro Graph triplestores. Stardog has ICC included, AG has
RDF++ semantics.
Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 14 / 21
Errors
Errors found can be categorized in three different classes: datatype errors,
domain and range errors, structural errors.
Missing classes and properties definitions. We improved the OWL file.
wn30:AdjectiveWordSense rdfs:subClassOf
wn30:WordSense .
Literal values with types.
Literal value "00113726" does not
belong to datatype nonNegativeInteger
Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 15 / 21
Errors
current account is the label of wordsense-13363970-n-3 and Britain
the label of wordsense-08860123-n-4.
Explanation for:
Thing SubClassOf Nothing
classifiedByRegion Domain Synset
current_account classifiedByRegion Britain
current_account Type WordSense
Synset DisjointWith WordSense
wordsense-13363970-n-3 classifiedByRegion
wordsense-08860123-n-4
“The following pointer types are usually used to indicate lexical
relations: Antonym, Pertainym, Participle, Also See, Derivationally
Related. The remaining pointer types are generally used to represent
semantic relations.”
Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 16 / 21
Fixing Errors
wn30:classifiedByRegion
a rdf:Property, owl:ObjectProperty ;
rdfs:domain wn30:Synset ;
rdfs:range wn30:NounSynset ;
rdfs:subPropertyOf wn30:classifiedBy .
Updated to:
wn30:classifiedByRegion
a rdf:Property, owl:ObjectProperty ;
rdfs:subPropertyOf wn30:classifiedBy ;
rdfs:range [ a owl:Class ;
owl:unionOf (wn30:NounWordSense
wn30:NounSynset)] ;
rdfs:domain [ a owl:Class ;
owl:unionOf (wn30:WordSense wn30:Synset)] .
Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 17 / 21
Proofs Explanations
In formal verifications, the complexity of the proofs/explanations. This is
the explanation found for the issue:
synset-01345109-v hypernymOf
synset-01220528-v
VerbWordSense subClassOf WordSense
frame domain VerbWordSense
synset-01220528-v frame
"Somebody ----s something"
hypernymOf range Synset
Synset disjointWith WordSense
synset-01220528-v found to be of type ’Synset’ due to it is the object of a
triple with predicate wn30:hypernymOf combined with the range of this predicate
is the set of all synsets.
synset-01220528-v is a verb synset and that verb synsets are a subset of
synsets.
Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 18 / 21
More
Two invalid situations: (a) two or more words associated to a single word
sense subject; (b) two or more lexical forms associated to a single word
subject.
wordsense-01860795-v-2 type WordSense
word-deixar lexicalForm "deixar"@pt
word-parar lexicalForm "parar"@pt
wordsense-01860795-v-2 word word-deixar
Word subClassOf lexicalForm exactly 1
wordsense-01860795-v-2 word word-parar
word-deixar type Word
word-parar type Word
WordSense subClassOf word exactly 1 Word
Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 19 / 21
More
Stardog is the only reasoner and database system that supports ICV.
Under the ICV semantics, the axioms below from the wn30:WordSense
class were taken as constraints rather than terminology definitions.
Finding an instance of the class wn30:WordSense connected to more than
one instance of wn30:Word, it will raise an exception instead of infer that
the two different wn30:Word instances should be the same.
wn30:WordSense
a rdfs:Class, owl:Class ;
rdfs:subClassOf [
a owl:Restriction ;
owl:onProperty wn30:inSynset ;
owl:qualifiedCardinality
"1"^^xsd:nonNegativeInteger ;
owl:onClass wn30:Synset ], [
a owl:Restriction ;
owl:onProperty wn30:word;
owl:qualifiedCardinality
"1"^^xsd:nonNegativeInteger ;
owl:onClass wn30:Word ] .
Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 20 / 21
Conclusions
Linguistic resources are very easy to start, hard to improve and
extremely difficult to maintain.
Size of lexical resources are easy to compare, quality is hard.
A lot of (old? already used/defined?) verifications can be encoded in
OWL axioms. Some of them may require more expressivity (SUMO?)
Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 21 / 21

More Related Content

What's hot

Moving Library Metadata Toward Linked Data: Opportunities Provided by the eX...
Moving Library Metadata Toward Linked Data:  Opportunities Provided by the eX...Moving Library Metadata Toward Linked Data:  Opportunities Provided by the eX...
Moving Library Metadata Toward Linked Data: Opportunities Provided by the eX...
Jennifer Bowen
 
Matching and merging anonymous terms from web sources
Matching and merging anonymous terms from web sourcesMatching and merging anonymous terms from web sources
Matching and merging anonymous terms from web sources
IJwest
 
Efficient RDF Interchange (ERI) Format for RDF Data Streams
Efficient RDF Interchange (ERI) Format for RDF Data StreamsEfficient RDF Interchange (ERI) Format for RDF Data Streams
Efficient RDF Interchange (ERI) Format for RDF Data Streams
WU (Vienna University of Economics and Business)
 
Hack U Barcelona 2011
Hack U Barcelona 2011Hack U Barcelona 2011
Hack U Barcelona 2011
Peter Mika
 
Bio ontologies and semantic technologies
Bio ontologies and semantic technologiesBio ontologies and semantic technologies
Bio ontologies and semantic technologies
Prof. Wim Van Criekinge
 
Triple Stores
Triple StoresTriple Stores
Triple Stores
Stephan Volmer
 
FedX - Optimization Techniques for Federated Query Processing on Linked Data
FedX - Optimization Techniques for Federated Query Processing on Linked DataFedX - Optimization Techniques for Federated Query Processing on Linked Data
FedX - Optimization Techniques for Federated Query Processing on Linked Data
aschwarte
 
A Semantic Multimedia Web (Part 2)
A Semantic Multimedia Web (Part 2)A Semantic Multimedia Web (Part 2)
A Semantic Multimedia Web (Part 2)
Raphael Troncy
 
Datalift a-catalyser-for-the-web-of-data-fosdem-05-02-2011
Datalift a-catalyser-for-the-web-of-data-fosdem-05-02-2011Datalift a-catalyser-for-the-web-of-data-fosdem-05-02-2011
Datalift a-catalyser-for-the-web-of-data-fosdem-05-02-2011
François Scharffe
 
Introducing FRSAD and Mapping it with Other Models
Introducing FRSAD and Mapping it with Other ModelsIntroducing FRSAD and Mapping it with Other Models
Introducing FRSAD and Mapping it with Other Models
Marcia Zeng
 
Rdf
RdfRdf
Entity Retrieval (WSDM 2014 tutorial)
Entity Retrieval (WSDM 2014 tutorial)Entity Retrieval (WSDM 2014 tutorial)
Entity Retrieval (WSDM 2014 tutorial)
krisztianbalog
 
2009 0807 Lod Gmod
2009 0807 Lod Gmod2009 0807 Lod Gmod
2009 0807 Lod Gmod
Jun Zhao
 
Introduction to RDF
Introduction to RDFIntroduction to RDF
Introduction to RDF
Narni Rajesh
 
FAIRness through a novel combination of Web technologies
FAIRness through a novel combination of Web technologiesFAIRness through a novel combination of Web technologies
FAIRness through a novel combination of Web technologies
Research Data Alliance
 

What's hot (17)

Moving Library Metadata Toward Linked Data: Opportunities Provided by the eX...
Moving Library Metadata Toward Linked Data:  Opportunities Provided by the eX...Moving Library Metadata Toward Linked Data:  Opportunities Provided by the eX...
Moving Library Metadata Toward Linked Data: Opportunities Provided by the eX...
 
Matching and merging anonymous terms from web sources
Matching and merging anonymous terms from web sourcesMatching and merging anonymous terms from web sources
Matching and merging anonymous terms from web sources
 
SWT Lecture Session 8 - Rules
SWT Lecture Session 8 - RulesSWT Lecture Session 8 - Rules
SWT Lecture Session 8 - Rules
 
Efficient RDF Interchange (ERI) Format for RDF Data Streams
Efficient RDF Interchange (ERI) Format for RDF Data StreamsEfficient RDF Interchange (ERI) Format for RDF Data Streams
Efficient RDF Interchange (ERI) Format for RDF Data Streams
 
Hack U Barcelona 2011
Hack U Barcelona 2011Hack U Barcelona 2011
Hack U Barcelona 2011
 
Bio ontologies and semantic technologies
Bio ontologies and semantic technologiesBio ontologies and semantic technologies
Bio ontologies and semantic technologies
 
20110728 datalift-rpi-troy
20110728 datalift-rpi-troy20110728 datalift-rpi-troy
20110728 datalift-rpi-troy
 
Triple Stores
Triple StoresTriple Stores
Triple Stores
 
FedX - Optimization Techniques for Federated Query Processing on Linked Data
FedX - Optimization Techniques for Federated Query Processing on Linked DataFedX - Optimization Techniques for Federated Query Processing on Linked Data
FedX - Optimization Techniques for Federated Query Processing on Linked Data
 
A Semantic Multimedia Web (Part 2)
A Semantic Multimedia Web (Part 2)A Semantic Multimedia Web (Part 2)
A Semantic Multimedia Web (Part 2)
 
Datalift a-catalyser-for-the-web-of-data-fosdem-05-02-2011
Datalift a-catalyser-for-the-web-of-data-fosdem-05-02-2011Datalift a-catalyser-for-the-web-of-data-fosdem-05-02-2011
Datalift a-catalyser-for-the-web-of-data-fosdem-05-02-2011
 
Introducing FRSAD and Mapping it with Other Models
Introducing FRSAD and Mapping it with Other ModelsIntroducing FRSAD and Mapping it with Other Models
Introducing FRSAD and Mapping it with Other Models
 
Rdf
RdfRdf
Rdf
 
Entity Retrieval (WSDM 2014 tutorial)
Entity Retrieval (WSDM 2014 tutorial)Entity Retrieval (WSDM 2014 tutorial)
Entity Retrieval (WSDM 2014 tutorial)
 
2009 0807 Lod Gmod
2009 0807 Lod Gmod2009 0807 Lod Gmod
2009 0807 Lod Gmod
 
Introduction to RDF
Introduction to RDFIntroduction to RDF
Introduction to RDF
 
FAIRness through a novel combination of Web technologies
FAIRness through a novel combination of Web technologiesFAIRness through a novel combination of Web technologies
FAIRness through a novel combination of Web technologies
 

Viewers also liked

A linked open data architecture for contemporary historical archives
A linked open data architecture for contemporary historical archivesA linked open data architecture for contemporary historical archives
A linked open data architecture for contemporary historical archives
Alexandre Rademaker
 
An overview of Portuguese WordNets
An overview of Portuguese WordNetsAn overview of Portuguese WordNets
An overview of Portuguese WordNets
Alexandre Rademaker
 
Is it important to explain a theorem? A case study in UML and ALCQI
Is it important to explain a theorem? A case study in UML and ALCQIIs it important to explain a theorem? A case study in UML and ALCQI
Is it important to explain a theorem? A case study in UML and ALCQI
Alexandre Rademaker
 
Embedding NomLex-BR nominalizations into OpenWordnet-PT
Embedding NomLex-BR nominalizations into OpenWordnet-PTEmbedding NomLex-BR nominalizations into OpenWordnet-PT
Embedding NomLex-BR nominalizations into OpenWordnet-PT
Alexandre Rademaker
 
On the Computational Complexity of Intuitionistic Hybrid Modal Logic
On the Computational Complexity of Intuitionistic Hybrid Modal LogicOn the Computational Complexity of Intuitionistic Hybrid Modal Logic
On the Computational Complexity of Intuitionistic Hybrid Modal LogicAlexandre Rademaker
 
It’s a WIN, WIN: ‘WordPress On Windows’
It’s a WIN, WIN: ‘WordPress On Windows’It’s a WIN, WIN: ‘WordPress On Windows’
It’s a WIN, WIN: ‘WordPress On Windows’
Brendan Sera-Shriar
 
OpenWordnet-PT: A Project Report
OpenWordnet-PT: A Project ReportOpenWordnet-PT: A Project Report
OpenWordnet-PT: A Project Report
Alexandre Rademaker
 

Viewers also liked (7)

A linked open data architecture for contemporary historical archives
A linked open data architecture for contemporary historical archivesA linked open data architecture for contemporary historical archives
A linked open data architecture for contemporary historical archives
 
An overview of Portuguese WordNets
An overview of Portuguese WordNetsAn overview of Portuguese WordNets
An overview of Portuguese WordNets
 
Is it important to explain a theorem? A case study in UML and ALCQI
Is it important to explain a theorem? A case study in UML and ALCQIIs it important to explain a theorem? A case study in UML and ALCQI
Is it important to explain a theorem? A case study in UML and ALCQI
 
Embedding NomLex-BR nominalizations into OpenWordnet-PT
Embedding NomLex-BR nominalizations into OpenWordnet-PTEmbedding NomLex-BR nominalizations into OpenWordnet-PT
Embedding NomLex-BR nominalizations into OpenWordnet-PT
 
On the Computational Complexity of Intuitionistic Hybrid Modal Logic
On the Computational Complexity of Intuitionistic Hybrid Modal LogicOn the Computational Complexity of Intuitionistic Hybrid Modal Logic
On the Computational Complexity of Intuitionistic Hybrid Modal Logic
 
It’s a WIN, WIN: ‘WordPress On Windows’
It’s a WIN, WIN: ‘WordPress On Windows’It’s a WIN, WIN: ‘WordPress On Windows’
It’s a WIN, WIN: ‘WordPress On Windows’
 
OpenWordnet-PT: A Project Report
OpenWordnet-PT: A Project ReportOpenWordnet-PT: A Project Report
OpenWordnet-PT: A Project Report
 

Similar to Verifying Integrity Constraints of a RDF-based WordNet

SKOS, Past, Present and Future
SKOS, Past, Present and FutureSKOS, Past, Present and Future
SKOS, Past, Present and Future
seanb
 
Semantics
SemanticsSemantics
Open library data and embrace the world library linked data
Open library data and embrace the world library linked dataOpen library data and embrace the world library linked data
Open library data and embrace the world library linked data
皓仁 柯
 
Doctoral Examination at the Karlsruhe Institute of Technology (08.07.2016)
Doctoral Examination at the Karlsruhe Institute of Technology (08.07.2016)Doctoral Examination at the Karlsruhe Institute of Technology (08.07.2016)
Doctoral Examination at the Karlsruhe Institute of Technology (08.07.2016)
Dr.-Ing. Thomas Hartmann
 
Jpl presentation
Jpl presentationJpl presentation
Jpl presentation
Rama Bastola
 
Jpl presentation
Jpl presentationJpl presentation
Jpl presentation
Rama Bastola
 
Jpl presentation
Jpl presentationJpl presentation
Jpl presentation
Rama Bastola
 
NLP2RDF Wortschatz and Linguistic LOD draft
NLP2RDF Wortschatz and Linguistic LOD draftNLP2RDF Wortschatz and Linguistic LOD draft
NLP2RDF Wortschatz and Linguistic LOD draftSebastian Hellmann
 
Linked data 101: Getting Caught in the Semantic Web
Linked data 101: Getting Caught in the Semantic Web Linked data 101: Getting Caught in the Semantic Web
Linked data 101: Getting Caught in the Semantic Web
Morgan Briles
 
Piloting Linked Data to Connect Library and Archive Resources to the New Worl...
Piloting Linked Data to Connect Library and Archive Resources to the New Worl...Piloting Linked Data to Connect Library and Archive Resources to the New Worl...
Piloting Linked Data to Connect Library and Archive Resources to the New Worl...
Laura Akerman
 
Web 3 Mark Greaves
Web 3 Mark GreavesWeb 3 Mark Greaves
Web 3 Mark GreavesMediabistro
 
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Stuart Chalk
 
Semantic Web: From Representations to Applications
Semantic Web: From Representations to ApplicationsSemantic Web: From Representations to Applications
Semantic Web: From Representations to Applications
Guus Schreiber
 
SNSW CO3.pptx
SNSW CO3.pptxSNSW CO3.pptx
SNSW CO3.pptx
harishdhanukonda48
 
Neno/Fhat: Semantic Network Programming Language and Virtual Machine Specific...
Neno/Fhat: Semantic Network Programming Language and Virtual Machine Specific...Neno/Fhat: Semantic Network Programming Language and Virtual Machine Specific...
Neno/Fhat: Semantic Network Programming Language and Virtual Machine Specific...
Marko Rodriguez
 
Explanations in Dialogue Systems through Uncertain RDF Knowledge Bases
Explanations in Dialogue Systems through Uncertain RDF Knowledge BasesExplanations in Dialogue Systems through Uncertain RDF Knowledge Bases
Explanations in Dialogue Systems through Uncertain RDF Knowledge Bases
Daniel Sonntag
 
Ontology Engineering
Ontology EngineeringOntology Engineering
Ontology Engineering
Alessandro Adamou
 
CSHALS 2010 W3C Semanic Web Tutorial
CSHALS 2010 W3C Semanic Web TutorialCSHALS 2010 W3C Semanic Web Tutorial
CSHALS 2010 W3C Semanic Web Tutorial
LeeFeigenbaum
 
Sem facet paper
Sem facet paperSem facet paper
Sem facet paperDBOnto
 

Similar to Verifying Integrity Constraints of a RDF-based WordNet (20)

SKOS, Past, Present and Future
SKOS, Past, Present and FutureSKOS, Past, Present and Future
SKOS, Past, Present and Future
 
Semantics
SemanticsSemantics
Semantics
 
Open library data and embrace the world library linked data
Open library data and embrace the world library linked dataOpen library data and embrace the world library linked data
Open library data and embrace the world library linked data
 
Doctoral Examination at the Karlsruhe Institute of Technology (08.07.2016)
Doctoral Examination at the Karlsruhe Institute of Technology (08.07.2016)Doctoral Examination at the Karlsruhe Institute of Technology (08.07.2016)
Doctoral Examination at the Karlsruhe Institute of Technology (08.07.2016)
 
Jpl presentation
Jpl presentationJpl presentation
Jpl presentation
 
Jpl presentation
Jpl presentationJpl presentation
Jpl presentation
 
Jpl presentation
Jpl presentationJpl presentation
Jpl presentation
 
NLP2RDF Wortschatz and Linguistic LOD draft
NLP2RDF Wortschatz and Linguistic LOD draftNLP2RDF Wortschatz and Linguistic LOD draft
NLP2RDF Wortschatz and Linguistic LOD draft
 
Linked data 101: Getting Caught in the Semantic Web
Linked data 101: Getting Caught in the Semantic Web Linked data 101: Getting Caught in the Semantic Web
Linked data 101: Getting Caught in the Semantic Web
 
Piloting Linked Data to Connect Library and Archive Resources to the New Worl...
Piloting Linked Data to Connect Library and Archive Resources to the New Worl...Piloting Linked Data to Connect Library and Archive Resources to the New Worl...
Piloting Linked Data to Connect Library and Archive Resources to the New Worl...
 
Web 3 Mark Greaves
Web 3 Mark GreavesWeb 3 Mark Greaves
Web 3 Mark Greaves
 
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
 
Semantic Web: From Representations to Applications
Semantic Web: From Representations to ApplicationsSemantic Web: From Representations to Applications
Semantic Web: From Representations to Applications
 
SNSW CO3.pptx
SNSW CO3.pptxSNSW CO3.pptx
SNSW CO3.pptx
 
Neno/Fhat: Semantic Network Programming Language and Virtual Machine Specific...
Neno/Fhat: Semantic Network Programming Language and Virtual Machine Specific...Neno/Fhat: Semantic Network Programming Language and Virtual Machine Specific...
Neno/Fhat: Semantic Network Programming Language and Virtual Machine Specific...
 
Explanations in Dialogue Systems through Uncertain RDF Knowledge Bases
Explanations in Dialogue Systems through Uncertain RDF Knowledge BasesExplanations in Dialogue Systems through Uncertain RDF Knowledge Bases
Explanations in Dialogue Systems through Uncertain RDF Knowledge Bases
 
Ontology Engineering
Ontology EngineeringOntology Engineering
Ontology Engineering
 
Semantic web
Semantic web Semantic web
Semantic web
 
CSHALS 2010 W3C Semanic Web Tutorial
CSHALS 2010 W3C Semanic Web TutorialCSHALS 2010 W3C Semanic Web Tutorial
CSHALS 2010 W3C Semanic Web Tutorial
 
Sem facet paper
Sem facet paperSem facet paper
Sem facet paper
 

More from Alexandre Rademaker

Processamento de Linguagem Natural em textos da História Comptemporânea do Br...
Processamento de Linguagem Natural em textos da História Comptemporânea do Br...Processamento de Linguagem Natural em textos da História Comptemporânea do Br...
Processamento de Linguagem Natural em textos da História Comptemporânea do Br...Alexandre Rademaker
 
On the proof theory for Description Logics
On the proof theory for Description LogicsOn the proof theory for Description Logics
On the proof theory for Description Logics
Alexandre Rademaker
 
OpenWN-PT: a Brazilian Wordnet for all
OpenWN-PT: a Brazilian Wordnet for allOpenWN-PT: a Brazilian Wordnet for all
OpenWN-PT: a Brazilian Wordnet for all
Alexandre Rademaker
 
A database approach to monitoring the quality of information in RDF stores
A database approach to monitoring the quality of information in RDF storesA database approach to monitoring the quality of information in RDF stores
A database approach to monitoring the quality of information in RDF storesAlexandre Rademaker
 
Intuitionistic Description Logic for Legal Reasoning
Intuitionistic Description Logic for Legal ReasoningIntuitionistic Description Logic for Legal Reasoning
Intuitionistic Description Logic for Legal Reasoning
Alexandre Rademaker
 
First Order Logic
First Order LogicFirst Order Logic
First Order Logic
Alexandre Rademaker
 

More from Alexandre Rademaker (6)

Processamento de Linguagem Natural em textos da História Comptemporânea do Br...
Processamento de Linguagem Natural em textos da História Comptemporânea do Br...Processamento de Linguagem Natural em textos da História Comptemporânea do Br...
Processamento de Linguagem Natural em textos da História Comptemporânea do Br...
 
On the proof theory for Description Logics
On the proof theory for Description LogicsOn the proof theory for Description Logics
On the proof theory for Description Logics
 
OpenWN-PT: a Brazilian Wordnet for all
OpenWN-PT: a Brazilian Wordnet for allOpenWN-PT: a Brazilian Wordnet for all
OpenWN-PT: a Brazilian Wordnet for all
 
A database approach to monitoring the quality of information in RDF stores
A database approach to monitoring the quality of information in RDF storesA database approach to monitoring the quality of information in RDF stores
A database approach to monitoring the quality of information in RDF stores
 
Intuitionistic Description Logic for Legal Reasoning
Intuitionistic Description Logic for Legal ReasoningIntuitionistic Description Logic for Legal Reasoning
Intuitionistic Description Logic for Legal Reasoning
 
First Order Logic
First Order LogicFirst Order Logic
First Order Logic
 

Recently uploaded

Phenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvementPhenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvement
IshaGoswami9
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Erdal Coalmaker
 
Introduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptxIntroduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptx
zeex60
 
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Sérgio Sacani
 
In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
AlaminAfendy1
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
Lokesh Patil
 
ISI 2024: Application Form (Extended), Exam Date (Out), Eligibility
ISI 2024: Application Form (Extended), Exam Date (Out), EligibilityISI 2024: Application Form (Extended), Exam Date (Out), Eligibility
ISI 2024: Application Form (Extended), Exam Date (Out), Eligibility
SciAstra
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
PRIYANKA PATEL
 
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdfDMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
fafyfskhan251kmf
 
SAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdfSAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdf
KrushnaDarade1
 
Richard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlandsRichard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlands
Richard Gill
 
platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
muralinath2
 
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Ana Luísa Pinho
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
University of Maribor
 
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
yqqaatn0
 
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdfMudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
frank0071
 
bordetella pertussis.................................ppt
bordetella pertussis.................................pptbordetella pertussis.................................ppt
bordetella pertussis.................................ppt
kejapriya1
 
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
yqqaatn0
 
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills MN
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
University of Rennes, INSA Rennes, Inria/IRISA, CNRS
 

Recently uploaded (20)

Phenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvementPhenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvement
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
 
Introduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptxIntroduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptx
 
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
 
In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
 
ISI 2024: Application Form (Extended), Exam Date (Out), Eligibility
ISI 2024: Application Form (Extended), Exam Date (Out), EligibilityISI 2024: Application Form (Extended), Exam Date (Out), Eligibility
ISI 2024: Application Form (Extended), Exam Date (Out), Eligibility
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
 
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdfDMARDs Pharmacolgy Pharm D 5th Semester.pdf
DMARDs Pharmacolgy Pharm D 5th Semester.pdf
 
SAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdfSAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdf
 
Richard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlandsRichard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlands
 
platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
 
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
 
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
 
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdfMudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
 
bordetella pertussis.................................ppt
bordetella pertussis.................................pptbordetella pertussis.................................ppt
bordetella pertussis.................................ppt
 
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
 
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
 

Verifying Integrity Constraints of a RDF-based WordNet

  • 1. Verifying Integrity Constraints of a RDF-based WordNet Fabricio Chalub Alexandre Rademaker IBM Research, Brazil January 30, 2016 Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 1 / 21
  • 2. OpenWordnet-PT http://wnpt.brlcloud.com/wn/ Goal: not a simple translation of PWN, based on PWN architecture. originally created from a (PT) projection of the Universal WordNet (Gerard de Melo) Three language strategies in its lexical enrichment process: (i) translation; (ii) corpus extraction; (iii) dictionaries. Corpora: AC/DC project, DHBB CPDOC/FGV etc. LR: morphosemantic links, nominalizations from NomLex, Nomage and Wiktionary etc. Freely available since Dec 2011. Download as RDF files, query via SPARQL or browse via web interface (above). used by “Google Translate”, FreeLing, OMW, BabelNet and Onto.PT. Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 2 / 21
  • 3. Why RDF <definition gloss="This is my definition"> <meta creator="me/"></definition> <definition><meta creator="me"/> This is my definition</definition> <definition><meta creator="me"/> <text>This is my definition</text> </definition> <lmf:definition meta:creator="me"> This is my definition</lmf:definition> “Why RDF model is different from the XML model” by Tim Berners-Lee (1998). http://www.w3.org/DesignIssues/RDF-XML.html Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 3 / 21
  • 4. Why RDF (cont.) There is a mapping from XML documents to semantic graphs. The element names were a big hint for a human reader. Without the schema (DTD, XML Schema), you know things about the doc structure, but nothing else. You can’t tell what to deduce. You can’t even really tell what real questions can be asked. (1) mapping is many to one; (2) you need a schema to know what the mapping is (don’t have a inference language); (3) the expression you need for querying something in terms of the XML tree is necessarily more complicated than the expression you need for querying something in terms of the RDF tree. “give me the properties with the same metadatas of this one?” Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 4 / 21
  • 5. Why RDF (cont.) Giving a machine a knowledge tree vs. giving a person a document. A document for a person is generally serialized so that, when read serially by a human being, the result will be to build up a graph of associations in that person’s head. The order is important. For a graph of knowledge, order is not important, so long as the nodes in common between different statements are identified consistently. Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 5 / 21
  • 6. Linked Data Use URIs as names for things Use HTTP URIs so that people can look up those names When someone looks up a URI, provide useful information, using the standards (RDF, SPARQL) Include links to other URIs. so that they can discover more things (ILI?) A number of Linked Data projects for lexical resources. “Linked Data” by Tim Berners-Lee (2006). Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 6 / 21
  • 7. Some words about vocabularies To encode data, we need to decide which classes and properties to use! Different vocabularies for RDF encoding wordnets! The adoption of already defined vocabularies helps on the data interoperability since these makes data easily integrate with other resources. We use http://www.w3.org/TR/wordnet-rdf/ from 2006. Scripts available http://github.com/own-pt/. Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 7 / 21
  • 8. Namespaces 1 https://w3id.org/own-pt/wn30/schema/ 2 https://w3id.org/own-pt/wn30-pt/instances/ 3 https://w3id.org/own-pt/wn30-en/instances/ 4 https://w3id.org/own-pt/nomlex/schema/ 5 https://w3id.org/own-pt/nomlex/instances/ Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 8 / 21
  • 9. This paper Our first attempt at verifying integrity constraints of our openWordnet-PT against the ontology for Wordnets encoding. Correcting and improving linguistic data is a hard task. So far, no clear criteria for semantic evaluation wordnets not ways of comparing their relative quality or accuracy. Thus qualitative assessment of a new wordnet seems, presently, a matter of judgment and art. Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 9 / 21
  • 10. OWL and RDF Consistency check of OWL and Integrity Constraints in RDF OWL Lite and OWL DL semantics are based on Description logics. DL are a family of logics that are decidable fragments of first-order logic with attractive and well-understood computational properties. A DL knowledge base is comprised by two components, TBox and ABox. The TBox contains intensional knowledge (terminology). The ABox contains extensional knowledge (assertional). Intensional knowledge is usually thought not to change and extensional knowledge is usually thought to be contingent. Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 10 / 21
  • 11. Reasoning Given an ontology encoded in OWL (Lite or DL) one can use DL reasoners for different tasks such as: concepts consistency checking, query answering, classification, etc. The basic reasoning task in an ABox is instance checking, which verifies whether a given individ- ual is an instance of (or belongs to) a specified concept. In some use cases, we want a method to validating the RDF data regarding a given model. In this case, OWL users intend OWL axioms to be interpreted as constraints on RDF data. Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 11 / 21
  • 12. CWA vs. OWA OWL default semantics adopts the Open World Assumption (OWA) and does not adopt the Unique Name Assumption (UNA). Due to OWA, a statement must not be inferred to be false on the basis of failures to prove it; the fact that a piece of information has not been specified does not mean that such information does not exist. On the other hand, the absence of UNA allows two different constants to refer to the same individual. Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 12 / 21
  • 13. Queries in SPARQL One more motivation for RDF: select ?w ?ws1 ?ws2 { ?ss1 wn30:containsWordSense ?ws1 . ?ws1 wn30:word ?w . ?ss2 wn30:containsWordSense ?ws2 . ?ws2 wn30:word ?w . ?ss1 wn30:hyponymOf* ?ss2 . } Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 13 / 21
  • 14. Tools Protege is an ontology editor that among other features has interface with two well-know DL reasoners: FaCT++, HermiT etc. Starting in version 4, Protege also gives us an interface to search for explanations that caused an inconsistency. Racer and Pellet are reasoners that have this feature builtin. RDFpro for combine, split and syntax check etc. Stardog and Allegro Graph triplestores. Stardog has ICC included, AG has RDF++ semantics. Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 14 / 21
  • 15. Errors Errors found can be categorized in three different classes: datatype errors, domain and range errors, structural errors. Missing classes and properties definitions. We improved the OWL file. wn30:AdjectiveWordSense rdfs:subClassOf wn30:WordSense . Literal values with types. Literal value "00113726" does not belong to datatype nonNegativeInteger Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 15 / 21
  • 16. Errors current account is the label of wordsense-13363970-n-3 and Britain the label of wordsense-08860123-n-4. Explanation for: Thing SubClassOf Nothing classifiedByRegion Domain Synset current_account classifiedByRegion Britain current_account Type WordSense Synset DisjointWith WordSense wordsense-13363970-n-3 classifiedByRegion wordsense-08860123-n-4 “The following pointer types are usually used to indicate lexical relations: Antonym, Pertainym, Participle, Also See, Derivationally Related. The remaining pointer types are generally used to represent semantic relations.” Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 16 / 21
  • 17. Fixing Errors wn30:classifiedByRegion a rdf:Property, owl:ObjectProperty ; rdfs:domain wn30:Synset ; rdfs:range wn30:NounSynset ; rdfs:subPropertyOf wn30:classifiedBy . Updated to: wn30:classifiedByRegion a rdf:Property, owl:ObjectProperty ; rdfs:subPropertyOf wn30:classifiedBy ; rdfs:range [ a owl:Class ; owl:unionOf (wn30:NounWordSense wn30:NounSynset)] ; rdfs:domain [ a owl:Class ; owl:unionOf (wn30:WordSense wn30:Synset)] . Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 17 / 21
  • 18. Proofs Explanations In formal verifications, the complexity of the proofs/explanations. This is the explanation found for the issue: synset-01345109-v hypernymOf synset-01220528-v VerbWordSense subClassOf WordSense frame domain VerbWordSense synset-01220528-v frame "Somebody ----s something" hypernymOf range Synset Synset disjointWith WordSense synset-01220528-v found to be of type ’Synset’ due to it is the object of a triple with predicate wn30:hypernymOf combined with the range of this predicate is the set of all synsets. synset-01220528-v is a verb synset and that verb synsets are a subset of synsets. Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 18 / 21
  • 19. More Two invalid situations: (a) two or more words associated to a single word sense subject; (b) two or more lexical forms associated to a single word subject. wordsense-01860795-v-2 type WordSense word-deixar lexicalForm "deixar"@pt word-parar lexicalForm "parar"@pt wordsense-01860795-v-2 word word-deixar Word subClassOf lexicalForm exactly 1 wordsense-01860795-v-2 word word-parar word-deixar type Word word-parar type Word WordSense subClassOf word exactly 1 Word Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 19 / 21
  • 20. More Stardog is the only reasoner and database system that supports ICV. Under the ICV semantics, the axioms below from the wn30:WordSense class were taken as constraints rather than terminology definitions. Finding an instance of the class wn30:WordSense connected to more than one instance of wn30:Word, it will raise an exception instead of infer that the two different wn30:Word instances should be the same. wn30:WordSense a rdfs:Class, owl:Class ; rdfs:subClassOf [ a owl:Restriction ; owl:onProperty wn30:inSynset ; owl:qualifiedCardinality "1"^^xsd:nonNegativeInteger ; owl:onClass wn30:Synset ], [ a owl:Restriction ; owl:onProperty wn30:word; owl:qualifiedCardinality "1"^^xsd:nonNegativeInteger ; owl:onClass wn30:Word ] . Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 20 / 21
  • 21. Conclusions Linguistic resources are very easy to start, hard to improve and extremely difficult to maintain. Size of lexical resources are easy to compare, quality is hard. A lot of (old? already used/defined?) verifications can be encoded in OWL axioms. Some of them may require more expressivity (SUMO?) Rademaker, Chalub (IBM Research) ICC on OpenWordnet-PT January 30, 2016 21 / 21