CMDI2RDF

CLARIAH
CLARIAHDirector User Involvement :-) at CLARIAH
1
Daan Broeder
Menzo Windhouwer
Meertens Instituut
2
• Create an interoperable domain of Language
Resources (LR)
– Interoperable formats for LR content
– Persistent identification (and citation) of LRs
– Use of SAML based AAI for access to LRs
– Use of the Component Metadata Infrastructure (CMDI) for describing
LRs
3
• Created as a response to a fragmented situation of LR metadata
• Flexible
– Not a single schema, but supports different metadata schema
– Different schema for different situations
– Semantic Interoperability via linking to semantic registries
• Community driven
– communities can model their own metadata schema
– know their data and can create the right schema
– know the right terminology
• Sharing
– Concepts, Terminology, Vocabularies
• CLARIN Concept Registry for linguistic concepts,
• ISO 368 and other relevant vocabularies
• CLAVAS for organisation names
– Components & profiles via the CLARIN metadata component registry
4
• A Component groups together metadata
Elements, which naturally belong together
to describe a property of the resources
– The Location where a SpeechRecording took place
– The Location of an Actor
– A Location is described by an address a/o region a/o
country a/o continent
• Components can be nested
– The Language a specific Actor speaks
– An Actor who takes part in a SpeechRecording for a
specific Project
• A Profile is a specific collection of
Components for a specific type of
resources, e.g., speech recordings
SpeechRecordingP
ActorC
LocationC
- addressE
- regionE
- countryE
- continentE
LocationC
ProjectC
LanguageC
LanguageC
Technical
MetadataC
5
OAI-PMH
Provider
OAI-PMH
Harvester
Local
metadata
repository
Joint
metadata
repository
metadata
modeler
metadata
user
metadata
creator
component
registry &
editor
metadata
editor
metadata
curator
metadata
curator
metadata
catalogue
Relation
Registry
search &
semantic
mapping
Resources
Concept
Registry
6
• Started in 2010, version 1.2 released in 2016 supporting
remote vocabularies
• Actively supported by CLARIN ERIC and several national CLARIN
consortia
• Many supporting tools:
– VLO, COMEDI, ARBIL, CMDI maker, Virtual Collection Registry …
• Link to the Linked (open) Data world: CMDI2RDF
CMDI LODCMDI2RDF
7
• Started as a 2014 CLARIN NL project by TLA/MPI and DANS
• Now a service supported by CLARIAH WP2 (X11.400)
• Linking also to other ‘linguistic’ LoD information sources:
– WALS for linguistic typology information
– CLAVAS organization names
– DBpedia (currently only used as glue)
• Automatic synchronization CMDI metadata
• Simplification of the RDFs CMDI model
8
• CMD is classic W3C schema constrained XML
• To map a CMD record to RDF we need
– A mapping for the basic component model to RDFS
• Basic classes and properties to represent profiles, components,
elements, attributes and their relationships and values
– A mapping for a specific profile or component to RDFS
• A specific subclass or subproperty of the basic component model
– A mapping for specific metadata records to RDF instances of RDFS
• Instances of profile or component
– Additionaly there is a generic CMD envelop that is mapped using
common LOD vocabularies
9
 Basic CMD model is described by ISO/DIS 24622-1
 1st part of ISO TC 37 SC 4 3 CMD standards family
 Natural mapping to RDF would be:
 Profiles/components to RDF Classes
 Elements to RDF Properties
 Complication
 CLARIN’s CMDI allows attributes on both Components and Elements
 So elements have to be RDF Classes as well
10
• Nevertheless introduces extra hierarchy
• CMDI is already a hierarchical metadata schema
• Human readability decreases
• Other solutions welcome!
R 14
Age
<Description URI= …. >
<Age>14</Age>
…
</Person
<Description…. >
<Age status=‘U’>14</Age>
…
</Description> R
Age
14
U
Simplified example
status
11
OAI
harvester
CLARIN
joint
metadata
domain
CMD2RDF
• conversion
• enrichment
Virtuoso
caching
CMD-RDF
• SPARQL
• REST
• browse
(L)L(O)D cloud
Component
Registry
CLAVAS
WALS
Technology:
• Virtuoso RDF store
• Elda as browser
• Tomcat as application server
• Conversion pipeline in Java
• Core transforms in XSLT
• All source code on GitHub,
• Docker build file & images available
12
13
• Offers LoD for different LR
metadata infrastructures
– LRE Map (LREC)
– META-SHARE
– CLARIN
– DataHub (linguistic part)
• However
– Wrt. CLARIN only data with DC
profiles
• Just a small part of CLARIN
– Seems partly based on static old
data dumps
14
• Goals:
– Find metadata type of information about LRs in LD format
– Translate that into a ‘suitable’ CMDI profile based metadata record
• Is there such LD that is not already available direct in another
format: OLAC, CLARIN, DC, META-SHARE
– If so, useful to have this metadata in the CLARIN VLO metadata catalogue
– Humanities data archives will have mostly DC, (inventory available from
different projects: e.g. DASISH) and frequently offer LD
– Easier ways exist to translate DC into CMDI (e.g. the CMDI DC profile)
– But LD can be a pivot set for many such translations
• Still in exploratory phase
– Would like to use a general strategy,
– Its very labor intensive to craft specific transformations for every LD set.
15
• Useful for CLARIN?
– Enriching existing CMDI metadata and
recycling them
– Relations to sources already known as:
• WALS, DBpedia, CLAVAS, GlotoLog, …
• Relations to CLARIAH LD sources ?
– Enable the VLO (or an alternative browser)
for visualizing this information
– Increasing metadata quality:
• Use CLAVAS to repair errors
• Include preferred labels
– Some CMDI adaptations required
• Foreign namespace support in CMDI
payload
A
VLO
B
C
RDF2CMD
CLARIN CENTRES
CLARIAH?
Enriched
CMDI
CMDI
DPpedia Glotolog
RDFstore
16
http://cmdi2rdf.meertens.knaw.nl/cmd2rdf/
1 of 16

Recommended

.Net and Rdf APIs by
.Net and Rdf APIs.Net and Rdf APIs
.Net and Rdf APIsRecean Denis
1.3K views9 slides
Comparative Study That Aims Rdf Processing For The Java Platform by
Comparative Study That Aims Rdf Processing For The Java PlatformComparative Study That Aims Rdf Processing For The Java Platform
Comparative Study That Aims Rdf Processing For The Java PlatformComputer Science
659 views6 slides
semanticweb by
semanticwebsemanticweb
semanticwebKevin Hutt
193 views7 slides
Rdf Processing Tools In Java by
Rdf Processing Tools In JavaRdf Processing Tools In Java
Rdf Processing Tools In JavaDicusarCorneliu
726 views5 slides
Audio MD Metadata Scheme by
Audio MD Metadata SchemeAudio MD Metadata Scheme
Audio MD Metadata SchemeAriel Hess
338 views15 slides
ISO MLR semantics by
ISO MLR semanticsISO MLR semantics
ISO MLR semanticsMikael Nilsson
669 views19 slides

More Related Content

What's hot

DC-2008 Architecture Forum Open session by
DC-2008 Architecture Forum Open sessionDC-2008 Architecture Forum Open session
DC-2008 Architecture Forum Open sessionMikael Nilsson
5K views94 slides
Deploying RDF Linked Data via Virtuoso Universal Server by
Deploying RDF Linked Data via Virtuoso Universal ServerDeploying RDF Linked Data via Virtuoso Universal Server
Deploying RDF Linked Data via Virtuoso Universal Serverrumito
5.8K views50 slides
Semantic web by
Semantic webSemantic web
Semantic webAbid Fakhre Alam
319 views17 slides
Semantic Web use cases in outcomes research by
Semantic Web use cases in outcomes researchSemantic Web use cases in outcomes research
Semantic Web use cases in outcomes researchChimezie Ogbuji
821 views27 slides
Semantic Technologies and Triplestores for Business Intelligence by
Semantic Technologies and Triplestores for Business IntelligenceSemantic Technologies and Triplestores for Business Intelligence
Semantic Technologies and Triplestores for Business IntelligenceMarin Dimitrov
4.1K views57 slides
CLARIAH CMDI use case and flexible metadata schemes by
CLARIAH CMDI use case and flexible metadata schemesCLARIAH CMDI use case and flexible metadata schemes
CLARIAH CMDI use case and flexible metadata schemesVyacheslav Tykhonov
59 views53 slides

What's hot(20)

DC-2008 Architecture Forum Open session by Mikael Nilsson
DC-2008 Architecture Forum Open sessionDC-2008 Architecture Forum Open session
DC-2008 Architecture Forum Open session
Mikael Nilsson5K views
Deploying RDF Linked Data via Virtuoso Universal Server by rumito
Deploying RDF Linked Data via Virtuoso Universal ServerDeploying RDF Linked Data via Virtuoso Universal Server
Deploying RDF Linked Data via Virtuoso Universal Server
rumito5.8K views
Semantic Web use cases in outcomes research by Chimezie Ogbuji
Semantic Web use cases in outcomes researchSemantic Web use cases in outcomes research
Semantic Web use cases in outcomes research
Chimezie Ogbuji821 views
Semantic Technologies and Triplestores for Business Intelligence by Marin Dimitrov
Semantic Technologies and Triplestores for Business IntelligenceSemantic Technologies and Triplestores for Business Intelligence
Semantic Technologies and Triplestores for Business Intelligence
Marin Dimitrov4.1K views
CLARIAH CMDI use case and flexible metadata schemes by Vyacheslav Tykhonov
CLARIAH CMDI use case and flexible metadata schemesCLARIAH CMDI use case and flexible metadata schemes
CLARIAH CMDI use case and flexible metadata schemes
MLA crosswalk by sol613
MLA crosswalkMLA crosswalk
MLA crosswalk
sol613968 views
Solving Real Problems Using Linked Data by rumito
Solving Real Problems Using Linked DataSolving Real Problems Using Linked Data
Solving Real Problems Using Linked Data
rumito1.6K views
RDF_API_Java_Stefan_Apostoaie by iosstef
RDF_API_Java_Stefan_ApostoaieRDF_API_Java_Stefan_Apostoaie
RDF_API_Java_Stefan_Apostoaie
iosstef322 views
Virtuoso Universal Server Overview by rumito
Virtuoso Universal Server OverviewVirtuoso Universal Server Overview
Virtuoso Universal Server Overview
rumito2.4K views
Semantic Mapping in CLARIN Component Metadata. by Menzo Windhouwer
Semantic Mapping in CLARIN Component Metadata.Semantic Mapping in CLARIN Component Metadata.
Semantic Mapping in CLARIN Component Metadata.
Menzo Windhouwer1.6K views
Flexible metadata schemes for research data repositories - Clarin Conference... by Vyacheslav Tykhonov
Flexible metadata schemes for research data repositories  - Clarin Conference...Flexible metadata schemes for research data repositories  - Clarin Conference...
Flexible metadata schemes for research data repositories - Clarin Conference...
DC Architecture WG Meeting - DC-2006, Mexico by Eduserv Foundation
DC Architecture WG Meeting - DC-2006, MexicoDC Architecture WG Meeting - DC-2006, Mexico
DC Architecture WG Meeting - DC-2006, Mexico
Eduserv Foundation2.6K views
Ldap system administration by Ali Abdo
Ldap system administrationLdap system administration
Ldap system administration
Ali Abdo5.7K views
IASLIC's 23rd National Seminar, Kolkata by Goutam Biswas by Goutam Biswas
IASLIC's 23rd National Seminar, Kolkata by Goutam BiswasIASLIC's 23rd National Seminar, Kolkata by Goutam Biswas
IASLIC's 23rd National Seminar, Kolkata by Goutam Biswas
Goutam Biswas1.1K views
Linked data integration_framework by STI Innsbruck
Linked data integration_frameworkLinked data integration_framework
Linked data integration_framework
STI Innsbruck295 views
Expressing Concept Schemes & Competency Frameworks in CTDL by Credential Engine
Expressing Concept Schemes & Competency Frameworks in CTDLExpressing Concept Schemes & Competency Frameworks in CTDL
Expressing Concept Schemes & Competency Frameworks in CTDL
Credential Engine927 views

Viewers also liked

Sem título 1 by
Sem título 1Sem título 1
Sem título 1Adriana Medeiros Oliveira
165 views7 slides
Uso del internet rivaldo by
Uso del internet  rivaldoUso del internet  rivaldo
Uso del internet rivaldoalevehe11
117 views2 slides
Certificate-MERAJ by
Certificate-MERAJCertificate-MERAJ
Certificate-MERAJMohammed Meraj
175 views1 slide
Integrating nutrition to systems research: through Nutrition sensitive landsc... by
Integrating nutrition to systems research: through Nutrition sensitive landsc...Integrating nutrition to systems research: through Nutrition sensitive landsc...
Integrating nutrition to systems research: through Nutrition sensitive landsc...International Institute of Tropical Agriculture
132 views15 slides
Testing by
TestingTesting
Testingyangzhao200923
117 views1 slide
Marlene delgado by
Marlene delgadoMarlene delgado
Marlene delgadoMarlene Delgado
266 views7 slides

Viewers also liked(20)

Uso del internet rivaldo by alevehe11
Uso del internet  rivaldoUso del internet  rivaldo
Uso del internet rivaldo
alevehe11117 views
Hamza CV (1) by Hamza Mian
Hamza CV (1)Hamza CV (1)
Hamza CV (1)
Hamza Mian193 views
Insects [training] by Trong1903
Insects  [training]Insects  [training]
Insects [training]
Trong1903278 views
Peningkatan mutu kompetensi pendidik dan tenaga kependidikan serta by Kank Hari
Peningkatan mutu kompetensi  pendidik dan tenaga kependidikan sertaPeningkatan mutu kompetensi  pendidik dan tenaga kependidikan serta
Peningkatan mutu kompetensi pendidik dan tenaga kependidikan serta
Kank Hari1.6K views
Revoluciã³n mexicana presentacion.esp5 by hobbitgirl23
Revoluciã³n mexicana presentacion.esp5Revoluciã³n mexicana presentacion.esp5
Revoluciã³n mexicana presentacion.esp5
hobbitgirl23792 views
Buku kebijakan spmi by spmi
Buku kebijakan spmiBuku kebijakan spmi
Buku kebijakan spmi
spmi2.8K views
20160124_GPL勉強会 by rie05
20160124_GPL勉強会20160124_GPL勉強会
20160124_GPL勉強会
rie0513.5K views
Música contemporánea 3ºESO by Inma Montesinos
Música contemporánea 3ºESOMúsica contemporánea 3ºESO
Música contemporánea 3ºESO
Inma Montesinos6.3K views
Ejercicio guiado de Inkscape by Ainara Pérez
Ejercicio guiado de InkscapeEjercicio guiado de Inkscape
Ejercicio guiado de Inkscape
Ainara Pérez7.3K views

Similar to CMDI2RDF

Introduction to Dublin Core Metadata by
Introduction to Dublin Core MetadataIntroduction to Dublin Core Metadata
Introduction to Dublin Core MetadataHannes Ebner
4.3K views11 slides
CMD2RDF by
CMD2RDFCMD2RDF
CMD2RDFMenzo Windhouwer
39 views8 slides
Dublin Core Metadata Initiative Abstract Model by
Dublin Core Metadata Initiative Abstract ModelDublin Core Metadata Initiative Abstract Model
Dublin Core Metadata Initiative Abstract ModelJenn Riley
1K views19 slides
CS6010 Social Network Analysis Unit II by
CS6010 Social Network Analysis   Unit IICS6010 Social Network Analysis   Unit II
CS6010 Social Network Analysis Unit IIpkaviya
8.4K views45 slides
Providing Linked Data by
Providing Linked DataProviding Linked Data
Providing Linked DataEUCLID project
22.8K views138 slides
Tools for Next Generation of CMS: XML, RDF, & GRDDL by
Tools for Next Generation of CMS: XML, RDF, & GRDDLTools for Next Generation of CMS: XML, RDF, & GRDDL
Tools for Next Generation of CMS: XML, RDF, & GRDDLChimezie Ogbuji
1.7K views31 slides

Similar to CMDI2RDF(20)

Introduction to Dublin Core Metadata by Hannes Ebner
Introduction to Dublin Core MetadataIntroduction to Dublin Core Metadata
Introduction to Dublin Core Metadata
Hannes Ebner4.3K views
Dublin Core Metadata Initiative Abstract Model by Jenn Riley
Dublin Core Metadata Initiative Abstract ModelDublin Core Metadata Initiative Abstract Model
Dublin Core Metadata Initiative Abstract Model
Jenn Riley1K views
CS6010 Social Network Analysis Unit II by pkaviya
CS6010 Social Network Analysis   Unit IICS6010 Social Network Analysis   Unit II
CS6010 Social Network Analysis Unit II
pkaviya8.4K views
Tools for Next Generation of CMS: XML, RDF, & GRDDL by Chimezie Ogbuji
Tools for Next Generation of CMS: XML, RDF, & GRDDLTools for Next Generation of CMS: XML, RDF, & GRDDL
Tools for Next Generation of CMS: XML, RDF, & GRDDL
Chimezie Ogbuji1.7K views
Services semantic technology_terminology by Tenforce
Services semantic technology_terminologyServices semantic technology_terminology
Services semantic technology_terminology
Tenforce2.5K views
Fedora Commons in the CLARIN Infrastructure by Menzo Windhouwer
Fedora Commons in the CLARIN InfrastructureFedora Commons in the CLARIN Infrastructure
Fedora Commons in the CLARIN Infrastructure
Menzo Windhouwer595 views
20080917 Rev by charper
20080917 Rev20080917 Rev
20080917 Rev
charper971 views
Intro to the semantic web (for libraries) by robin fay
Intro to the semantic web (for libraries) Intro to the semantic web (for libraries)
Intro to the semantic web (for libraries)
robin fay3.3K views
SWIB14 Weaving repository contents into the Semantic Web by Pascal-Nicolas Becker
SWIB14 Weaving repository contents into the Semantic WebSWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic Web
RDA-DCAM and Application Profiles by Mikael Nilsson
RDA-DCAM and Application ProfilesRDA-DCAM and Application Profiles
RDA-DCAM and Application Profiles
Mikael Nilsson2.5K views
Linked Open Data and DANS by vty
Linked Open Data and DANSLinked Open Data and DANS
Linked Open Data and DANS
vty559 views
Comparative study on the processing of RDF in PHP by MSGUNC
Comparative study on the processing of RDF in PHPComparative study on the processing of RDF in PHP
Comparative study on the processing of RDF in PHP
MSGUNC913 views
Development of a MODS-RDF Cataloguing Tool for Information Professionals CONU... by Lucy McKenna
Development of a MODS-RDF Cataloguing Tool for Information Professionals CONU...Development of a MODS-RDF Cataloguing Tool for Information Professionals CONU...
Development of a MODS-RDF Cataloguing Tool for Information Professionals CONU...
Lucy McKenna22 views

More from CLARIAH

ACAD Presentation by Wilbert Spooren, CLARIAH Toogdag 19-10-2018 by
ACAD Presentation by Wilbert Spooren, CLARIAH Toogdag 19-10-2018ACAD Presentation by Wilbert Spooren, CLARIAH Toogdag 19-10-2018
ACAD Presentation by Wilbert Spooren, CLARIAH Toogdag 19-10-2018CLARIAH
112 views12 slides
DB:CCC Presentation of Karin Hofmeester, CLARIAH Toogdag 19-10-2018 by
DB:CCC Presentation of Karin Hofmeester, CLARIAH Toogdag 19-10-2018DB:CCC Presentation of Karin Hofmeester, CLARIAH Toogdag 19-10-2018
DB:CCC Presentation of Karin Hofmeester, CLARIAH Toogdag 19-10-2018CLARIAH
93 views15 slides
Masterclass innosurance 2018 by
Masterclass innosurance 2018Masterclass innosurance 2018
Masterclass innosurance 2018CLARIAH
127 views52 slides
Flat TLA by
Flat TLAFlat TLA
Flat TLACLARIAH
416 views12 slides
QB'er demonstration by
QB'er demonstrationQB'er demonstration
QB'er demonstrationCLARIAH
1.7K views23 slides
Collection registration for the CLARIAH Media Suite. by
Collection registration for the CLARIAH Media Suite.Collection registration for the CLARIAH Media Suite.
Collection registration for the CLARIAH Media Suite.CLARIAH
412 views22 slides

More from CLARIAH(20)

ACAD Presentation by Wilbert Spooren, CLARIAH Toogdag 19-10-2018 by CLARIAH
ACAD Presentation by Wilbert Spooren, CLARIAH Toogdag 19-10-2018ACAD Presentation by Wilbert Spooren, CLARIAH Toogdag 19-10-2018
ACAD Presentation by Wilbert Spooren, CLARIAH Toogdag 19-10-2018
CLARIAH112 views
DB:CCC Presentation of Karin Hofmeester, CLARIAH Toogdag 19-10-2018 by CLARIAH
DB:CCC Presentation of Karin Hofmeester, CLARIAH Toogdag 19-10-2018DB:CCC Presentation of Karin Hofmeester, CLARIAH Toogdag 19-10-2018
DB:CCC Presentation of Karin Hofmeester, CLARIAH Toogdag 19-10-2018
CLARIAH93 views
Masterclass innosurance 2018 by CLARIAH
Masterclass innosurance 2018Masterclass innosurance 2018
Masterclass innosurance 2018
CLARIAH127 views
Flat TLA by CLARIAH
Flat TLAFlat TLA
Flat TLA
CLARIAH416 views
QB'er demonstration by CLARIAH
QB'er demonstrationQB'er demonstration
QB'er demonstration
CLARIAH1.7K views
Collection registration for the CLARIAH Media Suite. by CLARIAH
Collection registration for the CLARIAH Media Suite.Collection registration for the CLARIAH Media Suite.
Collection registration for the CLARIAH Media Suite.
CLARIAH412 views
2016 05-20-clariah-wp4 by CLARIAH
2016 05-20-clariah-wp42016 05-20-clariah-wp4
2016 05-20-clariah-wp4
CLARIAH3.2K views
2016 05-20-clariah-wp3 by CLARIAH
2016 05-20-clariah-wp32016 05-20-clariah-wp3
2016 05-20-clariah-wp3
CLARIAH434 views
2016 05-20-clariah-wp2 by CLARIAH
2016 05-20-clariah-wp22016 05-20-clariah-wp2
2016 05-20-clariah-wp2
CLARIAH525 views
2016 05-20-clariah-wp5 by CLARIAH
2016 05-20-clariah-wp52016 05-20-clariah-wp5
2016 05-20-clariah-wp5
CLARIAH349 views
MTAS Henny Brugman by CLARIAH
MTAS Henny BrugmanMTAS Henny Brugman
MTAS Henny Brugman
CLARIAH707 views
LREC Ton vd Wouden by CLARIAH
LREC Ton vd WoudenLREC Ton vd Wouden
LREC Ton vd Wouden
CLARIAH583 views
Paqu Gertjan van Noord en Jan Odijk by CLARIAH
Paqu Gertjan van Noord en Jan OdijkPaqu Gertjan van Noord en Jan Odijk
Paqu Gertjan van Noord en Jan Odijk
CLARIAH837 views
Open sonar martinreynaert by CLARIAH
Open sonar martinreynaertOpen sonar martinreynaert
Open sonar martinreynaert
CLARIAH485 views
Struc data Auke Rijpma by CLARIAH
Struc data Auke RijpmaStruc data Auke Rijpma
Struc data Auke Rijpma
CLARIAH337 views
Diachronous conceptuallexicons Marieke van Erp / Piek Vossen by CLARIAH
Diachronous conceptuallexicons Marieke van Erp / Piek VossenDiachronous conceptuallexicons Marieke van Erp / Piek Vossen
Diachronous conceptuallexicons Marieke van Erp / Piek Vossen
CLARIAH467 views
Corpus studio Erwin Komen by CLARIAH
Corpus studio Erwin KomenCorpus studio Erwin Komen
Corpus studio Erwin Komen
CLARIAH770 views
Athena richard zijdeman by CLARIAH
Athena richard zijdemanAthena richard zijdeman
Athena richard zijdeman
CLARIAH443 views
Struc data aukerijpma by CLARIAH
Struc data aukerijpmaStruc data aukerijpma
Struc data aukerijpma
CLARIAH364 views
Anansi jauco noordzij by CLARIAH
Anansi jauco noordzijAnansi jauco noordzij
Anansi jauco noordzij
CLARIAH478 views

Recently uploaded

How to build dyanmic dashboards and ensure they always work by
How to build dyanmic dashboards and ensure they always workHow to build dyanmic dashboards and ensure they always work
How to build dyanmic dashboards and ensure they always workWiiisdom
14 views13 slides
Flask-Python.pptx by
Flask-Python.pptxFlask-Python.pptx
Flask-Python.pptxTriloki Gupta
9 views12 slides
Quality Engineer: A Day in the Life by
Quality Engineer: A Day in the LifeQuality Engineer: A Day in the Life
Quality Engineer: A Day in the LifeJohn Valentino
7 views18 slides
Programming Field by
Programming FieldProgramming Field
Programming Fieldthehardtechnology
6 views9 slides
Airline Booking Software by
Airline Booking SoftwareAirline Booking Software
Airline Booking SoftwareSharmiMehta
9 views26 slides
Ports-and-Adapters Architecture for Embedded HMI by
Ports-and-Adapters Architecture for Embedded HMIPorts-and-Adapters Architecture for Embedded HMI
Ports-and-Adapters Architecture for Embedded HMIBurkhard Stubert
33 views19 slides

Recently uploaded(20)

How to build dyanmic dashboards and ensure they always work by Wiiisdom
How to build dyanmic dashboards and ensure they always workHow to build dyanmic dashboards and ensure they always work
How to build dyanmic dashboards and ensure they always work
Wiiisdom14 views
Quality Engineer: A Day in the Life by John Valentino
Quality Engineer: A Day in the LifeQuality Engineer: A Day in the Life
Quality Engineer: A Day in the Life
John Valentino7 views
Airline Booking Software by SharmiMehta
Airline Booking SoftwareAirline Booking Software
Airline Booking Software
SharmiMehta9 views
Ports-and-Adapters Architecture for Embedded HMI by Burkhard Stubert
Ports-and-Adapters Architecture for Embedded HMIPorts-and-Adapters Architecture for Embedded HMI
Ports-and-Adapters Architecture for Embedded HMI
Burkhard Stubert33 views
Dev-HRE-Ops - Addressing the _Last Mile DevOps Challenge_ in Highly Regulated... by TomHalpin9
Dev-HRE-Ops - Addressing the _Last Mile DevOps Challenge_ in Highly Regulated...Dev-HRE-Ops - Addressing the _Last Mile DevOps Challenge_ in Highly Regulated...
Dev-HRE-Ops - Addressing the _Last Mile DevOps Challenge_ in Highly Regulated...
TomHalpin96 views
Introduction to Git Source Control by John Valentino
Introduction to Git Source ControlIntroduction to Git Source Control
Introduction to Git Source Control
John Valentino7 views
ADDO_2022_CICID_Tom_Halpin.pdf by TomHalpin9
ADDO_2022_CICID_Tom_Halpin.pdfADDO_2022_CICID_Tom_Halpin.pdf
ADDO_2022_CICID_Tom_Halpin.pdf
TomHalpin95 views
How Workforce Management Software Empowers SMEs | TraQSuite by TraQSuite
How Workforce Management Software Empowers SMEs | TraQSuiteHow Workforce Management Software Empowers SMEs | TraQSuite
How Workforce Management Software Empowers SMEs | TraQSuite
TraQSuite6 views
Transport Management System - Shipment & Container Tracking by Freightoscope
Transport Management System - Shipment & Container TrackingTransport Management System - Shipment & Container Tracking
Transport Management System - Shipment & Container Tracking
Freightoscope 5 views
Understanding HTML terminology by artembondar5
Understanding HTML terminologyUnderstanding HTML terminology
Understanding HTML terminology
artembondar57 views
Top-5-production-devconMunich-2023.pptx by Tier1 app
Top-5-production-devconMunich-2023.pptxTop-5-production-devconMunich-2023.pptx
Top-5-production-devconMunich-2023.pptx
Tier1 app9 views
aATP - New Correlation Confirmation Feature.pptx by EsatEsenek1
aATP - New Correlation Confirmation Feature.pptxaATP - New Correlation Confirmation Feature.pptx
aATP - New Correlation Confirmation Feature.pptx
EsatEsenek1205 views

CMDI2RDF

  • 2. 2 • Create an interoperable domain of Language Resources (LR) – Interoperable formats for LR content – Persistent identification (and citation) of LRs – Use of SAML based AAI for access to LRs – Use of the Component Metadata Infrastructure (CMDI) for describing LRs
  • 3. 3 • Created as a response to a fragmented situation of LR metadata • Flexible – Not a single schema, but supports different metadata schema – Different schema for different situations – Semantic Interoperability via linking to semantic registries • Community driven – communities can model their own metadata schema – know their data and can create the right schema – know the right terminology • Sharing – Concepts, Terminology, Vocabularies • CLARIN Concept Registry for linguistic concepts, • ISO 368 and other relevant vocabularies • CLAVAS for organisation names – Components & profiles via the CLARIN metadata component registry
  • 4. 4 • A Component groups together metadata Elements, which naturally belong together to describe a property of the resources – The Location where a SpeechRecording took place – The Location of an Actor – A Location is described by an address a/o region a/o country a/o continent • Components can be nested – The Language a specific Actor speaks – An Actor who takes part in a SpeechRecording for a specific Project • A Profile is a specific collection of Components for a specific type of resources, e.g., speech recordings SpeechRecordingP ActorC LocationC - addressE - regionE - countryE - continentE LocationC ProjectC LanguageC LanguageC Technical MetadataC
  • 6. 6 • Started in 2010, version 1.2 released in 2016 supporting remote vocabularies • Actively supported by CLARIN ERIC and several national CLARIN consortia • Many supporting tools: – VLO, COMEDI, ARBIL, CMDI maker, Virtual Collection Registry … • Link to the Linked (open) Data world: CMDI2RDF CMDI LODCMDI2RDF
  • 7. 7 • Started as a 2014 CLARIN NL project by TLA/MPI and DANS • Now a service supported by CLARIAH WP2 (X11.400) • Linking also to other ‘linguistic’ LoD information sources: – WALS for linguistic typology information – CLAVAS organization names – DBpedia (currently only used as glue) • Automatic synchronization CMDI metadata • Simplification of the RDFs CMDI model
  • 8. 8 • CMD is classic W3C schema constrained XML • To map a CMD record to RDF we need – A mapping for the basic component model to RDFS • Basic classes and properties to represent profiles, components, elements, attributes and their relationships and values – A mapping for a specific profile or component to RDFS • A specific subclass or subproperty of the basic component model – A mapping for specific metadata records to RDF instances of RDFS • Instances of profile or component – Additionaly there is a generic CMD envelop that is mapped using common LOD vocabularies
  • 9. 9  Basic CMD model is described by ISO/DIS 24622-1  1st part of ISO TC 37 SC 4 3 CMD standards family  Natural mapping to RDF would be:  Profiles/components to RDF Classes  Elements to RDF Properties  Complication  CLARIN’s CMDI allows attributes on both Components and Elements  So elements have to be RDF Classes as well
  • 10. 10 • Nevertheless introduces extra hierarchy • CMDI is already a hierarchical metadata schema • Human readability decreases • Other solutions welcome! R 14 Age <Description URI= …. > <Age>14</Age> … </Person <Description…. > <Age status=‘U’>14</Age> … </Description> R Age 14 U Simplified example status
  • 11. 11 OAI harvester CLARIN joint metadata domain CMD2RDF • conversion • enrichment Virtuoso caching CMD-RDF • SPARQL • REST • browse (L)L(O)D cloud Component Registry CLAVAS WALS Technology: • Virtuoso RDF store • Elda as browser • Tomcat as application server • Conversion pipeline in Java • Core transforms in XSLT • All source code on GitHub, • Docker build file & images available
  • 12. 12
  • 13. 13 • Offers LoD for different LR metadata infrastructures – LRE Map (LREC) – META-SHARE – CLARIN – DataHub (linguistic part) • However – Wrt. CLARIN only data with DC profiles • Just a small part of CLARIN – Seems partly based on static old data dumps
  • 14. 14 • Goals: – Find metadata type of information about LRs in LD format – Translate that into a ‘suitable’ CMDI profile based metadata record • Is there such LD that is not already available direct in another format: OLAC, CLARIN, DC, META-SHARE – If so, useful to have this metadata in the CLARIN VLO metadata catalogue – Humanities data archives will have mostly DC, (inventory available from different projects: e.g. DASISH) and frequently offer LD – Easier ways exist to translate DC into CMDI (e.g. the CMDI DC profile) – But LD can be a pivot set for many such translations • Still in exploratory phase – Would like to use a general strategy, – Its very labor intensive to craft specific transformations for every LD set.
  • 15. 15 • Useful for CLARIN? – Enriching existing CMDI metadata and recycling them – Relations to sources already known as: • WALS, DBpedia, CLAVAS, GlotoLog, … • Relations to CLARIAH LD sources ? – Enable the VLO (or an alternative browser) for visualizing this information – Increasing metadata quality: • Use CLAVAS to repair errors • Include preferred labels – Some CMDI adaptations required • Foreign namespace support in CMDI payload A VLO B C RDF2CMD CLARIN CENTRES CLARIAH? Enriched CMDI CMDI DPpedia Glotolog RDFstore

Editor's Notes

  1. Virtuoso as a tripelstore Tomcat as application server Elda as browser Conversion pipeline in Java core transforms in XSLT all in a Docker package Code all on GitHub: