SlideShare a Scribd company logo
www.isocat.org

ISOcat and RELcat:
2 cooperating Semantic Registries

Menzo Windhouwer
menzo.windhouwer@dans.knaw.nl
The Language Archive – DANS

Ineke Schuurman
ineke@ccl.kuleuven.be
KU Leuven, CLARIN-NL – Utrecht University
17 January 2014

CLIN 24

1
www.isocat.org

Outline

• The need for explicit semantics
– ISOcat

• Mapping issues
– Languages, theoretical frameworks
– Granularity levels
– RELcat

• CGN case study
• Conclusions and future work
17 January 2014

CLIN 24

2
www.isocat.org

Typological Database Nijmegen
TOP NOTION tds:Noun GROUPS{
NOTION tdn:GrammaticalDistinctions
LABEL "Grammatical distinctions for nouns."
GROUPS {
NOTION tdn:AgentNouns
LABEL "Agent nouns."
DESCRIPTION "Nouns can function as the agent of a clause."
LINK TO CONCEPT agentRole
GROUPS {
NOTION tdn:v098_plusAffix
LABEL "Agent nouns formed by verb stem plus affix."
LINK TO CONCEPTS (agentRole, verbalMorphology, boundAffix)
DESCRIPTION
<p>Agent nouns are formed by a verb stem plus an affix, e.g. English <qv>walk-er</qv>.</p>
NOTE AUTHOR IS "TDS" TYPE IS "original TDN label" "AGENT NOUNS ARE VERB STEM PLUS AFFIX"
IS FIELD v098;
...

Notes: TDN is not in archived in TLA, but curated in TDS, a previous project Menzo worked on, and now archived at DANS;
17 January 2014
CLIN 24
3
also this not a TDN punchcard
www.isocat.org

17 January 2014

DOBES corpora

CLIN 24

4
www.isocat.org

ISOcat

• An open Data Category/Concept Registry where
everyone can
– find and select data categories/concepts
– create new data categories/concepts
– share data categories/concepts

• Each data category/concept has a Persistent
Identifier which can be embedded in a resource
(schema) to make the intended semantics (more)
explicit
17 January 2014

CLIN 24

5
www.isocat.org

Mapping issues

• Interesting resources for a specific research
question might
– use very different theoretical frameworks, which
might share few/none data categories/concepts
– use more coarse or finer grained data
categories/concepts

• How to overcome these differences by
mapping data categories/concepts to each
other?
17 January 2014

CLIN 24

6
www.isocat.org

Some examples

• definite article (PoS)
– EN: 1 (-)
– FR: 2 (masc, fem)
– NL: 2 (neuter, non-neuter)
– DE: 3 (masc, fem, neuter)

Dutch ‘non-neuter’ , for example, should be
related to ‘masc’ and ‘fem’
17 January 2014

CLIN 24

7
www.isocat.org

Some examples

• Indirect object (syntax)
– EN: indirect object
– NL:
• meewerkend voorwerp (1), or
• meewerkend voorwerp (2) plus belanghebbend
voorwerp
– All translated as ‘indirect object’

=> 3 definitions of ‘indirect object’, relations are
to be shown !
17 January 2014

CLIN 24

8
www.isocat.org

Some examples

• Event (semantics)
– ISO-TimeML: event and state, where ‘state’ is a
type of event
– Other theories (Kamp & Reyle etc): eventuality,
two subtypes: ‘event’ and ‘state’

Concepts ‘eventuality’, ‘event’ and ‘state’ are to
be related
17 January 2014

CLIN 24

9
www.isocat.org

ISOcat internal issues

Data categories that are almost the same,
apart from type, profile, language, …
Currently we insert a new DC. But note that the
original one and the new one should be
marked as having a same-as relation

17 January 2014

CLIN 24

10
www.isocat.org

RELcat

• A Relation Registry (under construction) to store
–
–
–
–

(almost) same-as relationships
subsumption relationships (isSuperClassOf, isSubClassOf)
mereology relationships (isPartOf, hasPart)
…

between data categories/concepts
• The focus is on informal and possibly partial
ontologies to be used for resource discovery
• Based on RDF triples
17 January 2014

CLIN 24

11
www.isocat.org

CGN case study

• Atomic building blocks of CGN tags are
defined in ISOcat (still private)
• The EBNF schema of a CGN tag is stored in
SCHEMAcat
• The subsumption relations in the value
domains are stored in RELcat
• (almost) same-as relationships with other data
categories/concepts are also stored in RELcat
17 January 2014

CLIN 24

12
www.isocat.org

CGN granularity mappings

• How to deal with (almost) same-as
relationships that involve more then one
atomic CGN data category/concept?
– Example: N(SOORT) = Common Noun

• Based on the CGN EBNF this involves the
following slots of the /CGN tag/
– /PoS/ = /N/
– /NTYPE/ = /SOORT/

• How to express this in RDF?
17 January 2014

CLIN 24

13
www.isocat.org

RELcat RDF mapping

• Data categories/concepts can function as
subjects and objects in an RDF triple
• The predicate of an RDF triple is a RELcat
relationship type
• Alternative: complex data categories as
properties

17 January 2014

CLIN 24

14
www.isocat.org

N(SOORT) = Common Noun
CGN tag

isA

sameAs

Common Noun
17 January 2014

CLIN 24

15
www.isocat.org

N(SOORT) = Common Noun
CGN tag

isA
hasPart

hasPart

PoS

has more parts
NTYPE

has more
potential
values

has more
potential
values
sameAs

hasPotentialValue

N
17 January 2014

Common Noun
CLIN 24

hasPotentialValue

SOORT
16
www.isocat.org

N(SOORT) = Common Noun
CGN tag

isA
hasPart
hasPart

PoS
has more
potential
values

hasPart
hasPart

isA
hasValue

hasPotentialValue

17 January 2014

NTYPE

has more
potential
values

isA
sameAs

isA
N

has more parts

hasValue

hasPotentialValue

isA
Common Noun
CLIN 24

SOORT
17
www.isocat.org

N(SOORT) = Common Noun
CGN tag

isA
hasPart
hasPart

PoS
has more
potential
values

hasPart
hasPart

isA
hasValue

hasPotentialValue

17 January 2014

NTYPE

has more
potential
values

isA
sameAs

isA
N

has more parts

hasValue

hasPotentialValue

isA
Common Noun
CLIN 24

SOORT
18
www.isocat.org

Cooperation between
ISOcat and RELcat

• ISOcat: value domains of closed data
categories
– RELcat: hasPotentialValue (new relationship type)

• ISOcat: is-a relations between simple data
categories
– RELcat: subsumption relations

• SCHEMAcat: part-of relationships
– RELcat: mereology relationships
17 January 2014

CLIN 24

19
www.isocat.org

Conclusions and future work

• Simple mappings are easy
• Complex mapping get easily fairly complex
– UI support?
– DSL support?
– Alternative RDF mapping?

• User front-end for RELcat
– Integration of RELcat and ISOcat?
17 January 2014

CLIN 24

20
www.isocat.org

Other examples

• “JJR” -> “POS=adjective & degree=comparative”
• “Transitive” -> “thetavp=vp120 & synvps=[synNP] &
caseAssigner=True”
• “VVIMP” -> “POS= verb & main verb & mood=imperative”

17 January 2014

CLIN 24

21

More Related Content

Similar to ISOcat and RELcat, two cooperating semantic registries

Ontology engineering: Ontology alignment
Ontology engineering: Ontology alignmentOntology engineering: Ontology alignment
Ontology engineering: Ontology alignment
Guus Schreiber
 
Can Deep Learning Techniques Improve Entity Linking?
Can Deep Learning Techniques Improve Entity Linking?Can Deep Learning Techniques Improve Entity Linking?
Can Deep Learning Techniques Improve Entity Linking?
Julien PLU
 
What_do_Knowledge_Graph_Embeddings_Learn.pdf
What_do_Knowledge_Graph_Embeddings_Learn.pdfWhat_do_Knowledge_Graph_Embeddings_Learn.pdf
What_do_Knowledge_Graph_Embeddings_Learn.pdf
Heiko Paulheim
 
How to model digital objects within the semantic web
How to model digital objects within the semantic webHow to model digital objects within the semantic web
How to model digital objects within the semantic web
Angelica Lo Duca
 
Gathering Lexical Linked Data and Knowledge Patterns from FrameNet
Gathering Lexical Linked Data and Knowledge Patterns from FrameNetGathering Lexical Linked Data and Knowledge Patterns from FrameNet
Gathering Lexical Linked Data and Knowledge Patterns from FrameNet
Andrea Nuzzolese
 
What do cats have to do with explicit semantics?
What do cats have to do with explicit semantics?What do cats have to do with explicit semantics?
What do cats have to do with explicit semantics?
Menzo Windhouwer
 
Semantic Web - Ontology 101
Semantic Web - Ontology 101Semantic Web - Ontology 101
Semantic Web - Ontology 101
Luigi De Russis
 
Extracting Relevant Questions to an RDF Dataset Using Formal Concept Analysis
Extracting Relevant Questions to an RDF Dataset Using Formal Concept AnalysisExtracting Relevant Questions to an RDF Dataset Using Formal Concept Analysis
Extracting Relevant Questions to an RDF Dataset Using Formal Concept Analysis
Mathieu d'Aquin
 
Data Designed for Discovery
Data Designed for DiscoveryData Designed for Discovery
Data Designed for Discovery
OCLC
 
2012 04-26-ifip-wg.pptx
2012 04-26-ifip-wg.pptx2012 04-26-ifip-wg.pptx
2012 04-26-ifip-wg.pptx
Christophe Debruyne
 
Expressing Dublin Core Metadata.ppt
Expressing Dublin Core Metadata.pptExpressing Dublin Core Metadata.ppt
Expressing Dublin Core Metadata.ppt
Bharath Abbareddy
 
Machine Learning Methods for Analysing and Linking RDF Data
Machine Learning Methods for Analysing and Linking RDF DataMachine Learning Methods for Analysing and Linking RDF Data
Machine Learning Methods for Analysing and Linking RDF Data
Jens Lehmann
 
LSI latent (par HATOUM Saria et DONGO ESCALANTE Irvin Franco)
LSI latent (par HATOUM Saria et DONGO ESCALANTE Irvin Franco)LSI latent (par HATOUM Saria et DONGO ESCALANTE Irvin Franco)
LSI latent (par HATOUM Saria et DONGO ESCALANTE Irvin Franco)
rchbeir
 
Understanding the world through ontology patterns - Eva Blomqvist, ESSENCE co...
Understanding the world through ontology patterns - Eva Blomqvist, ESSENCE co...Understanding the world through ontology patterns - Eva Blomqvist, ESSENCE co...
Understanding the world through ontology patterns - Eva Blomqvist, ESSENCE co...
Eva Blomqvist
 
The nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologiesThe nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologies
Tony Hammond
 
OpenWN-PT: a Brazilian Wordnet for all
OpenWN-PT: a Brazilian Wordnet for allOpenWN-PT: a Brazilian Wordnet for all
OpenWN-PT: a Brazilian Wordnet for all
Alexandre Rademaker
 
Lean Logic for Lean Times: Varieties of Natural Logic
Lean Logic for Lean Times: Varieties of Natural LogicLean Logic for Lean Times: Varieties of Natural Logic
Lean Logic for Lean Times: Varieties of Natural Logic
Valeria de Paiva
 
Logics and Ontologies for Portuguese Understanding
Logics and Ontologies for Portuguese UnderstandingLogics and Ontologies for Portuguese Understanding
Logics and Ontologies for Portuguese Understanding
Valeria de Paiva
 
Discovering Alignments in Ontologies of Linked Data
Discovering Alignments in Ontologies of Linked DataDiscovering Alignments in Ontologies of Linked Data
Discovering Alignments in Ontologies of Linked Data
Craig Knoblock
 
20160818 Semantics and Linkage of Archived Catalogs
20160818 Semantics and Linkage of Archived Catalogs20160818 Semantics and Linkage of Archived Catalogs
20160818 Semantics and Linkage of Archived Catalogs
andrea huang
 

Similar to ISOcat and RELcat, two cooperating semantic registries (20)

Ontology engineering: Ontology alignment
Ontology engineering: Ontology alignmentOntology engineering: Ontology alignment
Ontology engineering: Ontology alignment
 
Can Deep Learning Techniques Improve Entity Linking?
Can Deep Learning Techniques Improve Entity Linking?Can Deep Learning Techniques Improve Entity Linking?
Can Deep Learning Techniques Improve Entity Linking?
 
What_do_Knowledge_Graph_Embeddings_Learn.pdf
What_do_Knowledge_Graph_Embeddings_Learn.pdfWhat_do_Knowledge_Graph_Embeddings_Learn.pdf
What_do_Knowledge_Graph_Embeddings_Learn.pdf
 
How to model digital objects within the semantic web
How to model digital objects within the semantic webHow to model digital objects within the semantic web
How to model digital objects within the semantic web
 
Gathering Lexical Linked Data and Knowledge Patterns from FrameNet
Gathering Lexical Linked Data and Knowledge Patterns from FrameNetGathering Lexical Linked Data and Knowledge Patterns from FrameNet
Gathering Lexical Linked Data and Knowledge Patterns from FrameNet
 
What do cats have to do with explicit semantics?
What do cats have to do with explicit semantics?What do cats have to do with explicit semantics?
What do cats have to do with explicit semantics?
 
Semantic Web - Ontology 101
Semantic Web - Ontology 101Semantic Web - Ontology 101
Semantic Web - Ontology 101
 
Extracting Relevant Questions to an RDF Dataset Using Formal Concept Analysis
Extracting Relevant Questions to an RDF Dataset Using Formal Concept AnalysisExtracting Relevant Questions to an RDF Dataset Using Formal Concept Analysis
Extracting Relevant Questions to an RDF Dataset Using Formal Concept Analysis
 
Data Designed for Discovery
Data Designed for DiscoveryData Designed for Discovery
Data Designed for Discovery
 
2012 04-26-ifip-wg.pptx
2012 04-26-ifip-wg.pptx2012 04-26-ifip-wg.pptx
2012 04-26-ifip-wg.pptx
 
Expressing Dublin Core Metadata.ppt
Expressing Dublin Core Metadata.pptExpressing Dublin Core Metadata.ppt
Expressing Dublin Core Metadata.ppt
 
Machine Learning Methods for Analysing and Linking RDF Data
Machine Learning Methods for Analysing and Linking RDF DataMachine Learning Methods for Analysing and Linking RDF Data
Machine Learning Methods for Analysing and Linking RDF Data
 
LSI latent (par HATOUM Saria et DONGO ESCALANTE Irvin Franco)
LSI latent (par HATOUM Saria et DONGO ESCALANTE Irvin Franco)LSI latent (par HATOUM Saria et DONGO ESCALANTE Irvin Franco)
LSI latent (par HATOUM Saria et DONGO ESCALANTE Irvin Franco)
 
Understanding the world through ontology patterns - Eva Blomqvist, ESSENCE co...
Understanding the world through ontology patterns - Eva Blomqvist, ESSENCE co...Understanding the world through ontology patterns - Eva Blomqvist, ESSENCE co...
Understanding the world through ontology patterns - Eva Blomqvist, ESSENCE co...
 
The nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologiesThe nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologies
 
OpenWN-PT: a Brazilian Wordnet for all
OpenWN-PT: a Brazilian Wordnet for allOpenWN-PT: a Brazilian Wordnet for all
OpenWN-PT: a Brazilian Wordnet for all
 
Lean Logic for Lean Times: Varieties of Natural Logic
Lean Logic for Lean Times: Varieties of Natural LogicLean Logic for Lean Times: Varieties of Natural Logic
Lean Logic for Lean Times: Varieties of Natural Logic
 
Logics and Ontologies for Portuguese Understanding
Logics and Ontologies for Portuguese UnderstandingLogics and Ontologies for Portuguese Understanding
Logics and Ontologies for Portuguese Understanding
 
Discovering Alignments in Ontologies of Linked Data
Discovering Alignments in Ontologies of Linked DataDiscovering Alignments in Ontologies of Linked Data
Discovering Alignments in Ontologies of Linked Data
 
20160818 Semantics and Linkage of Archived Catalogs
20160818 Semantics and Linkage of Archived Catalogs20160818 Semantics and Linkage of Archived Catalogs
20160818 Semantics and Linkage of Archived Catalogs
 

More from Menzo Windhouwer

CMD2RDF
CMD2RDFCMD2RDF
Semantic Mapping in CLARIN Component Metadata.
Semantic Mapping in CLARIN Component Metadata.Semantic Mapping in CLARIN Component Metadata.
Semantic Mapping in CLARIN Component Metadata.
Menzo Windhouwer
 
Collaboratively Defining Widely Accepted Linguistic Data Categories in the IS...
Collaboratively Defining Widely Accepted Linguistic Data Categories in the IS...Collaboratively Defining Widely Accepted Linguistic Data Categories in the IS...
Collaboratively Defining Widely Accepted Linguistic Data Categories in the IS...
Menzo Windhouwer
 
A CMD Core Model for CLARIN Web Services
A CMD Core Model for CLARIN Web ServicesA CMD Core Model for CLARIN Web Services
A CMD Core Model for CLARIN Web Services
Menzo Windhouwer
 
LDL 2012 - Linking to ISOcat Data Categories
LDL 2012 - Linking to ISOcat Data CategoriesLDL 2012 - Linking to ISOcat Data Categories
LDL 2012 - Linking to ISOcat Data Categories
Menzo Windhouwer
 
ISOcat to LMF to TEI
ISOcat to LMF to TEIISOcat to LMF to TEI
ISOcat to LMF to TEI
Menzo Windhouwer
 
On the way to a Relation Registry for ISOcat data categories
On the way to a Relation Registry for ISOcat data categoriesOn the way to a Relation Registry for ISOcat data categories
On the way to a Relation Registry for ISOcat data categories
Menzo Windhouwer
 
The ISO-DCR
The ISO-DCRThe ISO-DCR
The ISO-DCR
Menzo Windhouwer
 
Use of ISOcat within CMDI
Use of ISOcat within CMDIUse of ISOcat within CMDI
Use of ISOcat within CMDI
Menzo Windhouwer
 
ISOcat: a short introduction
ISOcat: a short introductionISOcat: a short introduction
ISOcat: a short introduction
Menzo Windhouwer
 
Sustainable operability: Keeping complex linguistic resources alive.
Sustainable operability: Keeping complex linguistic resources alive.Sustainable operability: Keeping complex linguistic resources alive.
Sustainable operability: Keeping complex linguistic resources alive.
Menzo Windhouwer
 

More from Menzo Windhouwer (11)

CMD2RDF
CMD2RDFCMD2RDF
CMD2RDF
 
Semantic Mapping in CLARIN Component Metadata.
Semantic Mapping in CLARIN Component Metadata.Semantic Mapping in CLARIN Component Metadata.
Semantic Mapping in CLARIN Component Metadata.
 
Collaboratively Defining Widely Accepted Linguistic Data Categories in the IS...
Collaboratively Defining Widely Accepted Linguistic Data Categories in the IS...Collaboratively Defining Widely Accepted Linguistic Data Categories in the IS...
Collaboratively Defining Widely Accepted Linguistic Data Categories in the IS...
 
A CMD Core Model for CLARIN Web Services
A CMD Core Model for CLARIN Web ServicesA CMD Core Model for CLARIN Web Services
A CMD Core Model for CLARIN Web Services
 
LDL 2012 - Linking to ISOcat Data Categories
LDL 2012 - Linking to ISOcat Data CategoriesLDL 2012 - Linking to ISOcat Data Categories
LDL 2012 - Linking to ISOcat Data Categories
 
ISOcat to LMF to TEI
ISOcat to LMF to TEIISOcat to LMF to TEI
ISOcat to LMF to TEI
 
On the way to a Relation Registry for ISOcat data categories
On the way to a Relation Registry for ISOcat data categoriesOn the way to a Relation Registry for ISOcat data categories
On the way to a Relation Registry for ISOcat data categories
 
The ISO-DCR
The ISO-DCRThe ISO-DCR
The ISO-DCR
 
Use of ISOcat within CMDI
Use of ISOcat within CMDIUse of ISOcat within CMDI
Use of ISOcat within CMDI
 
ISOcat: a short introduction
ISOcat: a short introductionISOcat: a short introduction
ISOcat: a short introduction
 
Sustainable operability: Keeping complex linguistic resources alive.
Sustainable operability: Keeping complex linguistic resources alive.Sustainable operability: Keeping complex linguistic resources alive.
Sustainable operability: Keeping complex linguistic resources alive.
 

Recently uploaded

JavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green MasterplanJavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green Masterplan
Miro Wengner
 
Demystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through StorytellingDemystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through Storytelling
Enterprise Knowledge
 
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
Fwdays
 
AI in the Workplace Reskilling, Upskilling, and Future Work.pptx
AI in the Workplace Reskilling, Upskilling, and Future Work.pptxAI in the Workplace Reskilling, Upskilling, and Future Work.pptx
AI in the Workplace Reskilling, Upskilling, and Future Work.pptx
Sunil Jagani
 
Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
c5vrf27qcz
 
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
AlexanderRichford
 
A Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's ArchitectureA Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's Architecture
ScyllaDB
 
Session 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdfSession 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdf
UiPathCommunity
 
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin..."$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
Fwdays
 
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptxPRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
christinelarrosa
 
AWS Certified Solutions Architect Associate (SAA-C03)
AWS Certified Solutions Architect Associate (SAA-C03)AWS Certified Solutions Architect Associate (SAA-C03)
AWS Certified Solutions Architect Associate (SAA-C03)
HarpalGohil4
 
"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota
Fwdays
 
"NATO Hackathon Winner: AI-Powered Drug Search", Taras Kloba
"NATO Hackathon Winner: AI-Powered Drug Search",  Taras Kloba"NATO Hackathon Winner: AI-Powered Drug Search",  Taras Kloba
"NATO Hackathon Winner: AI-Powered Drug Search", Taras Kloba
Fwdays
 
Day 2 - Intro to UiPath Studio Fundamentals
Day 2 - Intro to UiPath Studio FundamentalsDay 2 - Intro to UiPath Studio Fundamentals
Day 2 - Intro to UiPath Studio Fundamentals
UiPathCommunity
 
The Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptxThe Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptx
operationspcvita
 
QA or the Highway - Component Testing: Bridging the gap between frontend appl...
QA or the Highway - Component Testing: Bridging the gap between frontend appl...QA or the Highway - Component Testing: Bridging the gap between frontend appl...
QA or the Highway - Component Testing: Bridging the gap between frontend appl...
zjhamm304
 
GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...
GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...
GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...
GlobalLogic Ukraine
 
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectorsConnector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
DianaGray10
 
From Natural Language to Structured Solr Queries using LLMs
From Natural Language to Structured Solr Queries using LLMsFrom Natural Language to Structured Solr Queries using LLMs
From Natural Language to Structured Solr Queries using LLMs
Sease
 
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving
 

Recently uploaded (20)

JavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green MasterplanJavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green Masterplan
 
Demystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through StorytellingDemystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through Storytelling
 
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
 
AI in the Workplace Reskilling, Upskilling, and Future Work.pptx
AI in the Workplace Reskilling, Upskilling, and Future Work.pptxAI in the Workplace Reskilling, Upskilling, and Future Work.pptx
AI in the Workplace Reskilling, Upskilling, and Future Work.pptx
 
Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
 
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
 
A Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's ArchitectureA Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's Architecture
 
Session 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdfSession 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdf
 
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin..."$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
 
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptxPRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
 
AWS Certified Solutions Architect Associate (SAA-C03)
AWS Certified Solutions Architect Associate (SAA-C03)AWS Certified Solutions Architect Associate (SAA-C03)
AWS Certified Solutions Architect Associate (SAA-C03)
 
"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota
 
"NATO Hackathon Winner: AI-Powered Drug Search", Taras Kloba
"NATO Hackathon Winner: AI-Powered Drug Search",  Taras Kloba"NATO Hackathon Winner: AI-Powered Drug Search",  Taras Kloba
"NATO Hackathon Winner: AI-Powered Drug Search", Taras Kloba
 
Day 2 - Intro to UiPath Studio Fundamentals
Day 2 - Intro to UiPath Studio FundamentalsDay 2 - Intro to UiPath Studio Fundamentals
Day 2 - Intro to UiPath Studio Fundamentals
 
The Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptxThe Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptx
 
QA or the Highway - Component Testing: Bridging the gap between frontend appl...
QA or the Highway - Component Testing: Bridging the gap between frontend appl...QA or the Highway - Component Testing: Bridging the gap between frontend appl...
QA or the Highway - Component Testing: Bridging the gap between frontend appl...
 
GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...
GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...
GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...
 
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectorsConnector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
 
From Natural Language to Structured Solr Queries using LLMs
From Natural Language to Structured Solr Queries using LLMsFrom Natural Language to Structured Solr Queries using LLMs
From Natural Language to Structured Solr Queries using LLMs
 
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024
 

ISOcat and RELcat, two cooperating semantic registries

  • 1. www.isocat.org ISOcat and RELcat: 2 cooperating Semantic Registries Menzo Windhouwer menzo.windhouwer@dans.knaw.nl The Language Archive – DANS Ineke Schuurman ineke@ccl.kuleuven.be KU Leuven, CLARIN-NL – Utrecht University 17 January 2014 CLIN 24 1
  • 2. www.isocat.org Outline • The need for explicit semantics – ISOcat • Mapping issues – Languages, theoretical frameworks – Granularity levels – RELcat • CGN case study • Conclusions and future work 17 January 2014 CLIN 24 2
  • 3. www.isocat.org Typological Database Nijmegen TOP NOTION tds:Noun GROUPS{ NOTION tdn:GrammaticalDistinctions LABEL "Grammatical distinctions for nouns." GROUPS { NOTION tdn:AgentNouns LABEL "Agent nouns." DESCRIPTION "Nouns can function as the agent of a clause." LINK TO CONCEPT agentRole GROUPS { NOTION tdn:v098_plusAffix LABEL "Agent nouns formed by verb stem plus affix." LINK TO CONCEPTS (agentRole, verbalMorphology, boundAffix) DESCRIPTION <p>Agent nouns are formed by a verb stem plus an affix, e.g. English <qv>walk-er</qv>.</p> NOTE AUTHOR IS "TDS" TYPE IS "original TDN label" "AGENT NOUNS ARE VERB STEM PLUS AFFIX" IS FIELD v098; ... Notes: TDN is not in archived in TLA, but curated in TDS, a previous project Menzo worked on, and now archived at DANS; 17 January 2014 CLIN 24 3 also this not a TDN punchcard
  • 5. www.isocat.org ISOcat • An open Data Category/Concept Registry where everyone can – find and select data categories/concepts – create new data categories/concepts – share data categories/concepts • Each data category/concept has a Persistent Identifier which can be embedded in a resource (schema) to make the intended semantics (more) explicit 17 January 2014 CLIN 24 5
  • 6. www.isocat.org Mapping issues • Interesting resources for a specific research question might – use very different theoretical frameworks, which might share few/none data categories/concepts – use more coarse or finer grained data categories/concepts • How to overcome these differences by mapping data categories/concepts to each other? 17 January 2014 CLIN 24 6
  • 7. www.isocat.org Some examples • definite article (PoS) – EN: 1 (-) – FR: 2 (masc, fem) – NL: 2 (neuter, non-neuter) – DE: 3 (masc, fem, neuter) Dutch ‘non-neuter’ , for example, should be related to ‘masc’ and ‘fem’ 17 January 2014 CLIN 24 7
  • 8. www.isocat.org Some examples • Indirect object (syntax) – EN: indirect object – NL: • meewerkend voorwerp (1), or • meewerkend voorwerp (2) plus belanghebbend voorwerp – All translated as ‘indirect object’ => 3 definitions of ‘indirect object’, relations are to be shown ! 17 January 2014 CLIN 24 8
  • 9. www.isocat.org Some examples • Event (semantics) – ISO-TimeML: event and state, where ‘state’ is a type of event – Other theories (Kamp & Reyle etc): eventuality, two subtypes: ‘event’ and ‘state’ Concepts ‘eventuality’, ‘event’ and ‘state’ are to be related 17 January 2014 CLIN 24 9
  • 10. www.isocat.org ISOcat internal issues Data categories that are almost the same, apart from type, profile, language, … Currently we insert a new DC. But note that the original one and the new one should be marked as having a same-as relation 17 January 2014 CLIN 24 10
  • 11. www.isocat.org RELcat • A Relation Registry (under construction) to store – – – – (almost) same-as relationships subsumption relationships (isSuperClassOf, isSubClassOf) mereology relationships (isPartOf, hasPart) … between data categories/concepts • The focus is on informal and possibly partial ontologies to be used for resource discovery • Based on RDF triples 17 January 2014 CLIN 24 11
  • 12. www.isocat.org CGN case study • Atomic building blocks of CGN tags are defined in ISOcat (still private) • The EBNF schema of a CGN tag is stored in SCHEMAcat • The subsumption relations in the value domains are stored in RELcat • (almost) same-as relationships with other data categories/concepts are also stored in RELcat 17 January 2014 CLIN 24 12
  • 13. www.isocat.org CGN granularity mappings • How to deal with (almost) same-as relationships that involve more then one atomic CGN data category/concept? – Example: N(SOORT) = Common Noun • Based on the CGN EBNF this involves the following slots of the /CGN tag/ – /PoS/ = /N/ – /NTYPE/ = /SOORT/ • How to express this in RDF? 17 January 2014 CLIN 24 13
  • 14. www.isocat.org RELcat RDF mapping • Data categories/concepts can function as subjects and objects in an RDF triple • The predicate of an RDF triple is a RELcat relationship type • Alternative: complex data categories as properties 17 January 2014 CLIN 24 14
  • 15. www.isocat.org N(SOORT) = Common Noun CGN tag isA sameAs Common Noun 17 January 2014 CLIN 24 15
  • 16. www.isocat.org N(SOORT) = Common Noun CGN tag isA hasPart hasPart PoS has more parts NTYPE has more potential values has more potential values sameAs hasPotentialValue N 17 January 2014 Common Noun CLIN 24 hasPotentialValue SOORT 16
  • 17. www.isocat.org N(SOORT) = Common Noun CGN tag isA hasPart hasPart PoS has more potential values hasPart hasPart isA hasValue hasPotentialValue 17 January 2014 NTYPE has more potential values isA sameAs isA N has more parts hasValue hasPotentialValue isA Common Noun CLIN 24 SOORT 17
  • 18. www.isocat.org N(SOORT) = Common Noun CGN tag isA hasPart hasPart PoS has more potential values hasPart hasPart isA hasValue hasPotentialValue 17 January 2014 NTYPE has more potential values isA sameAs isA N has more parts hasValue hasPotentialValue isA Common Noun CLIN 24 SOORT 18
  • 19. www.isocat.org Cooperation between ISOcat and RELcat • ISOcat: value domains of closed data categories – RELcat: hasPotentialValue (new relationship type) • ISOcat: is-a relations between simple data categories – RELcat: subsumption relations • SCHEMAcat: part-of relationships – RELcat: mereology relationships 17 January 2014 CLIN 24 19
  • 20. www.isocat.org Conclusions and future work • Simple mappings are easy • Complex mapping get easily fairly complex – UI support? – DSL support? – Alternative RDF mapping? • User front-end for RELcat – Integration of RELcat and ISOcat? 17 January 2014 CLIN 24 20
  • 21. www.isocat.org Other examples • “JJR” -> “POS=adjective & degree=comparative” • “Transitive” -> “thetavp=vp120 & synvps=[synNP] & caseAssigner=True” • “VVIMP” -> “POS= verb & main verb & mood=imperative” 17 January 2014 CLIN 24 21