SlideShare a Scribd company logo
This project has received funding from the European Union’s Horizon 2020 research and innovation programme
under grant agreement No 731015.
John P. McCrae1, Thierry Declerck2
Linked Open Data
Cloud
1Insight Centre for Data Analytics, National University of Ireland Galway
2Austrian Centre for Digital Humanities
This project has received funding from the European Union’s Horizon 2020 research and innovation
Hitchhiker’s guide to the Linked
Open Data Cloud
DBpedia
• Largest node in the linked open data cloud
• “Nucleus for a web of open data”
• Most data is derived by parsing Wikipedia
• E.g., https://en.wikipedia.org/wiki/C++
• => http://dbpedia.org/resource/C++
• Uses transparent content negotiation
This project has received funding from the European Union’s Horizon 2020 research and innovation
Transparent content negotiationI want to know about “C++” and I understand RDF and
HTML
curl -H "Accept: application/rdf+xml;text/html" -I 
http://dbpedia.org/resource/C++
HTTP/1.1 303 See Other
...
Location: http://dbpedia.org/data/C++.xml
...
Go to this location for the RDF/XML version
This project has received funding from the European Union’s Horizon 2020 research and innovation
Transparent content negotiationI want to know about “C++” and I only know HTML
curl -H "Accept: text/html" -I 
http://dbpedia.org/resource/C++
HTTP/1.1 303 See Other
...
Location: http://dbpedia.org/page/C++
...
Go to this location for the HTML version
Use
/resource/
URL to refer
to concept
This project has received funding from the European Union’s Horizon 2020 research and innovation
DBpedia Pages
Links to other
resources
Ontology
properties
This project has received funding from the European Union’s Horizon 2020 research and innovation
DBpedia ontologyAxioms
Labels
External
Links
This project has received funding from the European Union’s Horizon 2020 research and innovation
WikiData RDF Version:
https://www.wikidata.org/entity/Q2407
BabelNet
Dictionary
compiled from
• Wikipedia
• (Open
Mulitlingual)
WordNet
• Wiktionary
• OmegaWiki
• WikiData
LexVo
• Assigns URIs to
words (strings in a
language)
• Contains links to
WordNet,
FrameNet etc
• Definitions of ISO
Language Codes
Domain datasets
100’s of domain
specific datasets
This project has received funding from the European Union’s Horizon 2020 research and innovation
LexInfo
LexVo
BabelNet
DBpedia
Reusing URIs
Why reuse URIs
• Data interoperability
• Queries work over multiple datasets
• Semantic definitions allows alignments to be reasoned
• (Often) the creators of the URIs have good idea on how data
should be structured
Challenges of interoperability
Susanne Penn
The AT
DT
Fulton NP1s
NNP
County NNL1cb
NNP
Grand JJ
NNP
Jury NN1c
Differences in
Granularity
Linguistic
Differences
Language codes
fr
en
de
thProblem: 7,000+
languages and more
dialects, but only
262=676 codes br?
br =
Breton
ISO Language Codes
fr-CA
Québécois
ang
Anglo-Saxon
pms
Piedmontese
3-Letter codes with
region cover
minority, historical
languages, right?
Variability
How to tag this talk?
• en?
• en-Latn? (As it is not written in Cyrillic)
• en-LV or en-428 (As it is presented in Latvia)
• en-GB or en-826 (As is is composed in British
English)
• en-Latn-GB?Region subtags are used to indicate linguistic variations associated
with or appropriate to a specific country, territory, or region. Typically,
a region subtag is used to indicate variations such as regional
dialects or usage, or region-specific spelling conventions. It can also
be used to indicate that content is expressed in a way that is
appropriate for use throughout a region, for instance, Spanish content
tailored to be useful throughout Latin America.
-- RFC 5646
Glottolog
• Identifies languoids (language
varieties)
• Uses URLs
http://glottolog.org/resource/languoid/id/queb1247
• More information can be found
by following the link
Linked Open Vocabularies
http://lov.linkeddata.es/
ISOcat
• Effort to standardize linguistic vocabulary from ISO Technical
Committee
• Standardized “Data Categories” in a “Registry”
• Discontinued in December 2014
Problems with ISOcat
According to Schuurman et al.
• Too easy to get a login
• “Out-of-control”
• Entries were copies of other entries
• “People sometimes copied an entry, just in order to
make sure the original owner would not change the
entry without them knowing it”
• Complexity - Too many obligatory and overly
technical fields
As an alternative the CLARIN concept registry is (still)
being introduced.
I. Schuurman, M. Windhouwer, O. Ohren, D. Zeman, CLARIN Concept Registry: The new semantic registry, in
CLARIN 2015 Selected Papers (2015), pp. 62–70
LexInfo
• Ontology for “associat[ing] linguistic information
with respect to any level of linguistic description
and expressivity to elements in an ontology”
• Expands OntoLex-Lemon with a set of general
categories
LexInfo - Properties and Values
Properties and
open-world
(non-exhaustive)
list of values
LexInfo - Verb Frames
Verb frames
with formal
definitions
LexInfo - Arguments
Hierarchies of
arguments to be
used in the
frames
OLiA
Ontologies of Linguistic Annotation
• Modular architecture for describing
annotation schemes:
• Reference Model: Common
terminology (similar to LexInfo)
• Annotation Model: Describes a
particular annotation scheme
• Linking Model: Describes the
linking between the reference and
annotation
GOLD - General Ontology Linguistic
Description
• Quite popular
• Defines many terms
• ‘Loose’ semantics
• Sometimes has range and domains on properties
• Not clear how this fits together
Submitting to the LOD Cloud
This project has received funding from the European Union’s Horizon 2020 research and innovation
Go to lod-cloud.net
This project has received funding from the European Union’s Horizon 2020 research and innovation
Fill in the form
Fields
• Identifier
• unique alphanumeric string
• Title
• Full name in English
• Description
• 2-10 sentence description in English
• Full Download
• A link to the complete dataset, ideally as compressed N-
Triples
• SPARQL Endpoint
• If available
• Other Download
• Other formats for download or partial downloads
Fields (2)
• Example
• A single resource that resolves
• Keywords
• Domain
• Defines the colour in the diagram
• Website
• Contact Point
• Links
• Number of triples linking to another dataset in the cloud
• Size
• Number of triples in this dataset
• Namespace, DOI, Image (if desired)
Stars for metadata
quality
Availability of
resource
Services using linked data
Service-oriented architectures
• It is implemented a self-contained operation unit.
• It is a black box for its consumers, which only need
to know the interface, not the implementation.
• It may consist of other underlying services.
Interoperability is a significant challenge here
Service chains
Translation
DE => EN
Parser (EN)
Sentiment
Analysis (EN)
Often tricky to do in
practice!
Issues with service chains
• Services are often components of pipelines without
clear usage to the end user
• The technology readiness level of services is often
quite low, with little documentation or graphical
user interface,
• Services are hard to install often requiring
compiling from source or specialized libraries not
found in major software repositories.
Teanga
• RDF and Linked Data to provide
service interoperability
• Docker to enable easy install
and usage
• Attractive Web Front-End
(Bootstrap, AngularJS, NodeJS)
• Graceful control of errors
This project has received funding from the European Union’s Horizon 2020 research and innovation
This project has received funding from the European Union’s Horizon 2020 research and innovation
This project has received funding from the European Union’s Horizon 2020 research and innovation
LAPPS Grid
Defines key vocabularies for service interoperability
• LAPPS Interchange Format (JSON-LD)
• Web Service Exchange Vocabulary
• Human-in-the-loop workflow construction using
Galaxy
This project has received funding from the European Union’s Horizon 2020 research and innovation
Summary
Summary
• Linked Open Data Cloud
• Big
• Many relevant tools
• Fragmented
• Interoperability is less terrible than other systems
Thanks.
This publication has emanated from research supported in part by a research grant from Science
Foundation Ireland (SFI) under Grant Number SFI/12/RC/2289, co-funded by the European Regional
Development Fund

More Related Content

What's hot

LOD2 Webinar Series: 3rd relase of the Stack
LOD2 Webinar Series: 3rd relase of the StackLOD2 Webinar Series: 3rd relase of the Stack
LOD2 Webinar Series: 3rd relase of the Stack
LOD2 Creating Knowledge out of Interlinked Data
 
Freme at feisgiltt 2015 freme & linked data & localisers
Freme at feisgiltt 2015   freme & linked data & localisersFreme at feisgiltt 2015   freme & linked data & localisers
Freme at feisgiltt 2015 freme & linked data & localisers
Felix Sasaki
 
Fremeatfeisgiltt2015 fremelinkeddatalocalisers-150603090934-lva1-app6891
Fremeatfeisgiltt2015 fremelinkeddatalocalisers-150603090934-lva1-app6891Fremeatfeisgiltt2015 fremelinkeddatalocalisers-150603090934-lva1-app6891
Fremeatfeisgiltt2015 fremelinkeddatalocalisers-150603090934-lva1-app6891
FREMEProjectH2020
 
LOD2 Plenary Vienna 2012: WP3 - Knowledge Base Creation, Enrichment and Repair
LOD2 Plenary Vienna 2012: WP3 - Knowledge Base Creation, Enrichment and RepairLOD2 Plenary Vienna 2012: WP3 - Knowledge Base Creation, Enrichment and Repair
LOD2 Plenary Vienna 2012: WP3 - Knowledge Base Creation, Enrichment and Repair
LOD2 Creating Knowledge out of Interlinked Data
 
Lod2 review meeting
Lod2 review meetingLod2 review meeting
Lod2 review meeting
Andreea Bonea
 
Introduction to LDL 2012
Introduction to LDL 2012Introduction to LDL 2012
Introduction to LDL 2012
Sebastian Hellmann
 
XML, XML Databases and MPEG-7
XML, XML Databases and MPEG-7XML, XML Databases and MPEG-7
XML, XML Databases and MPEG-7
Deniz Kılınç
 
RTÉ Content Discovery Project - Christophe Debruyne
RTÉ Content Discovery Project - Christophe DebruyneRTÉ Content Discovery Project - Christophe Debruyne
RTÉ Content Discovery Project - Christophe Debruyne
dri_ireland
 
NIF 2.0 Phd thesis intermediate report
NIF 2.0 Phd thesis intermediate reportNIF 2.0 Phd thesis intermediate report
NIF 2.0 Phd thesis intermediate report
Sebastian Hellmann
 

What's hot (9)

LOD2 Webinar Series: 3rd relase of the Stack
LOD2 Webinar Series: 3rd relase of the StackLOD2 Webinar Series: 3rd relase of the Stack
LOD2 Webinar Series: 3rd relase of the Stack
 
Freme at feisgiltt 2015 freme & linked data & localisers
Freme at feisgiltt 2015   freme & linked data & localisersFreme at feisgiltt 2015   freme & linked data & localisers
Freme at feisgiltt 2015 freme & linked data & localisers
 
Fremeatfeisgiltt2015 fremelinkeddatalocalisers-150603090934-lva1-app6891
Fremeatfeisgiltt2015 fremelinkeddatalocalisers-150603090934-lva1-app6891Fremeatfeisgiltt2015 fremelinkeddatalocalisers-150603090934-lva1-app6891
Fremeatfeisgiltt2015 fremelinkeddatalocalisers-150603090934-lva1-app6891
 
LOD2 Plenary Vienna 2012: WP3 - Knowledge Base Creation, Enrichment and Repair
LOD2 Plenary Vienna 2012: WP3 - Knowledge Base Creation, Enrichment and RepairLOD2 Plenary Vienna 2012: WP3 - Knowledge Base Creation, Enrichment and Repair
LOD2 Plenary Vienna 2012: WP3 - Knowledge Base Creation, Enrichment and Repair
 
Lod2 review meeting
Lod2 review meetingLod2 review meeting
Lod2 review meeting
 
Introduction to LDL 2012
Introduction to LDL 2012Introduction to LDL 2012
Introduction to LDL 2012
 
XML, XML Databases and MPEG-7
XML, XML Databases and MPEG-7XML, XML Databases and MPEG-7
XML, XML Databases and MPEG-7
 
RTÉ Content Discovery Project - Christophe Debruyne
RTÉ Content Discovery Project - Christophe DebruyneRTÉ Content Discovery Project - Christophe Debruyne
RTÉ Content Discovery Project - Christophe Debruyne
 
NIF 2.0 Phd thesis intermediate report
NIF 2.0 Phd thesis intermediate reportNIF 2.0 Phd thesis intermediate report
NIF 2.0 Phd thesis intermediate report
 

Similar to Linked Open Data Cloud

2014 06-04-presentation-mdn-2014
2014 06-04-presentation-mdn-20142014 06-04-presentation-mdn-2014
2014 06-04-presentation-mdn-2014
Christophe Debruyne
 
Wednesday 6 May: Hand me the data! What you should know as a humanities resea...
Wednesday 6 May: Hand me the data! What you should know as a humanities resea...Wednesday 6 May: Hand me the data! What you should know as a humanities resea...
Wednesday 6 May: Hand me the data! What you should know as a humanities resea...
WARCnet
 
Building an electronic repository and archives on Dataverse in the European O...
Building an electronic repository and archives on Dataverse in the European O...Building an electronic repository and archives on Dataverse in the European O...
Building an electronic repository and archives on Dataverse in the European O...
vty
 
Usage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosUsage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application Scenarios
EUCLID project
 
CLARIN presentation
CLARIN presentationCLARIN presentation
CLARIN presentation
EOSC-hub project
 
Designing a multilingual knowledge graph - DCMI2018
Designing a multilingual knowledge graph - DCMI2018Designing a multilingual knowledge graph - DCMI2018
Designing a multilingual knowledge graph - DCMI2018
Antoine Isaac
 
Innovative methods for data integration: Linked Data and NLP
Innovative methods for data integration: Linked Data and NLPInnovative methods for data integration: Linked Data and NLP
Innovative methods for data integration: Linked Data and NLP
ariadnenetwork
 
The Europeana Strategy and Linked Open Data
The Europeana Strategy and Linked Open DataThe Europeana Strategy and Linked Open Data
The Europeana Strategy and Linked Open Data
David Haskiya
 
From Open Access to Open Standards, (Linked) Data and Collaborations
From Open Access to Open Standards, (Linked) Data and CollaborationsFrom Open Access to Open Standards, (Linked) Data and Collaborations
From Open Access to Open Standards, (Linked) Data and Collaborations
Simeon Warner
 
Lider Reference Model ld4lt session March, 3rd, 2015
Lider Reference Model ld4lt session  March, 3rd, 2015Lider Reference Model ld4lt session  March, 3rd, 2015
Lider Reference Model ld4lt session March, 3rd, 2015
Sebastian Hellmann
 
CLARIAH Toogdag 2018: A distributed network of digital heritage information
CLARIAH Toogdag 2018: A distributed network of digital heritage informationCLARIAH Toogdag 2018: A distributed network of digital heritage information
CLARIAH Toogdag 2018: A distributed network of digital heritage information
Enno Meijers
 
Lynx Webinar #4: Lynx Services Platform (LySP) - Part 2 - The Services
Lynx Webinar #4: Lynx Services Platform (LySP) - Part 2 - The ServicesLynx Webinar #4: Lynx Services Platform (LySP) - Part 2 - The Services
Lynx Webinar #4: Lynx Services Platform (LySP) - Part 2 - The Services
Lynx Project
 
Valentine Charles: Linking cultural heritage with KOS: the Europeana example
Valentine Charles: Linking cultural heritage with KOS: the Europeana example Valentine Charles: Linking cultural heritage with KOS: the Europeana example
Valentine Charles: Linking cultural heritage with KOS: the Europeana example
COST Action TD1210
 
LoCloud - Local content in a Europeana cloud
LoCloud - Local content in a Europeana cloudLoCloud - Local content in a Europeana cloud
LoCloud - Local content in a Europeana cloud
Europeana
 
Web 3 final(1)
Web 3 final(1)Web 3 final(1)
Web 3 final(1)Venky Dood
 
RDF Data and Image Annotations in ResearchSpace (slides)
RDF Data and Image Annotations in ResearchSpace (slides)RDF Data and Image Annotations in ResearchSpace (slides)
RDF Data and Image Annotations in ResearchSpace (slides)
Vladimir Alexiev, PhD, PMP
 
SFScon21 - Sander Van Dooren - Joinup: Maintaining an Open catalogue of reusa...
SFScon21 - Sander Van Dooren - Joinup: Maintaining an Open catalogue of reusa...SFScon21 - Sander Van Dooren - Joinup: Maintaining an Open catalogue of reusa...
SFScon21 - Sander Van Dooren - Joinup: Maintaining an Open catalogue of reusa...
South Tyrol Free Software Conference
 
Illuminating DSpace's Linked Data Support
Illuminating DSpace's Linked Data SupportIlluminating DSpace's Linked Data Support
Illuminating DSpace's Linked Data Support
Pascal-Nicolas Becker
 
Innovation for Europeana - Europeana v2.0 WP7
Innovation for Europeana - Europeana v2.0 WP7Innovation for Europeana - Europeana v2.0 WP7
Innovation for Europeana - Europeana v2.0 WP7
Max Kaiser
 
Multilingual Data Value Chain for CEF Automated Translation: Interoperability...
Multilingual Data Value Chain for CEF Automated Translation:Interoperability...Multilingual Data Value Chain for CEF Automated Translation:Interoperability...
Multilingual Data Value Chain for CEF Automated Translation: Interoperability...
Dave Lewis
 

Similar to Linked Open Data Cloud (20)

2014 06-04-presentation-mdn-2014
2014 06-04-presentation-mdn-20142014 06-04-presentation-mdn-2014
2014 06-04-presentation-mdn-2014
 
Wednesday 6 May: Hand me the data! What you should know as a humanities resea...
Wednesday 6 May: Hand me the data! What you should know as a humanities resea...Wednesday 6 May: Hand me the data! What you should know as a humanities resea...
Wednesday 6 May: Hand me the data! What you should know as a humanities resea...
 
Building an electronic repository and archives on Dataverse in the European O...
Building an electronic repository and archives on Dataverse in the European O...Building an electronic repository and archives on Dataverse in the European O...
Building an electronic repository and archives on Dataverse in the European O...
 
Usage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosUsage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application Scenarios
 
CLARIN presentation
CLARIN presentationCLARIN presentation
CLARIN presentation
 
Designing a multilingual knowledge graph - DCMI2018
Designing a multilingual knowledge graph - DCMI2018Designing a multilingual knowledge graph - DCMI2018
Designing a multilingual knowledge graph - DCMI2018
 
Innovative methods for data integration: Linked Data and NLP
Innovative methods for data integration: Linked Data and NLPInnovative methods for data integration: Linked Data and NLP
Innovative methods for data integration: Linked Data and NLP
 
The Europeana Strategy and Linked Open Data
The Europeana Strategy and Linked Open DataThe Europeana Strategy and Linked Open Data
The Europeana Strategy and Linked Open Data
 
From Open Access to Open Standards, (Linked) Data and Collaborations
From Open Access to Open Standards, (Linked) Data and CollaborationsFrom Open Access to Open Standards, (Linked) Data and Collaborations
From Open Access to Open Standards, (Linked) Data and Collaborations
 
Lider Reference Model ld4lt session March, 3rd, 2015
Lider Reference Model ld4lt session  March, 3rd, 2015Lider Reference Model ld4lt session  March, 3rd, 2015
Lider Reference Model ld4lt session March, 3rd, 2015
 
CLARIAH Toogdag 2018: A distributed network of digital heritage information
CLARIAH Toogdag 2018: A distributed network of digital heritage informationCLARIAH Toogdag 2018: A distributed network of digital heritage information
CLARIAH Toogdag 2018: A distributed network of digital heritage information
 
Lynx Webinar #4: Lynx Services Platform (LySP) - Part 2 - The Services
Lynx Webinar #4: Lynx Services Platform (LySP) - Part 2 - The ServicesLynx Webinar #4: Lynx Services Platform (LySP) - Part 2 - The Services
Lynx Webinar #4: Lynx Services Platform (LySP) - Part 2 - The Services
 
Valentine Charles: Linking cultural heritage with KOS: the Europeana example
Valentine Charles: Linking cultural heritage with KOS: the Europeana example Valentine Charles: Linking cultural heritage with KOS: the Europeana example
Valentine Charles: Linking cultural heritage with KOS: the Europeana example
 
LoCloud - Local content in a Europeana cloud
LoCloud - Local content in a Europeana cloudLoCloud - Local content in a Europeana cloud
LoCloud - Local content in a Europeana cloud
 
Web 3 final(1)
Web 3 final(1)Web 3 final(1)
Web 3 final(1)
 
RDF Data and Image Annotations in ResearchSpace (slides)
RDF Data and Image Annotations in ResearchSpace (slides)RDF Data and Image Annotations in ResearchSpace (slides)
RDF Data and Image Annotations in ResearchSpace (slides)
 
SFScon21 - Sander Van Dooren - Joinup: Maintaining an Open catalogue of reusa...
SFScon21 - Sander Van Dooren - Joinup: Maintaining an Open catalogue of reusa...SFScon21 - Sander Van Dooren - Joinup: Maintaining an Open catalogue of reusa...
SFScon21 - Sander Van Dooren - Joinup: Maintaining an Open catalogue of reusa...
 
Illuminating DSpace's Linked Data Support
Illuminating DSpace's Linked Data SupportIlluminating DSpace's Linked Data Support
Illuminating DSpace's Linked Data Support
 
Innovation for Europeana - Europeana v2.0 WP7
Innovation for Europeana - Europeana v2.0 WP7Innovation for Europeana - Europeana v2.0 WP7
Innovation for Europeana - Europeana v2.0 WP7
 
Multilingual Data Value Chain for CEF Automated Translation: Interoperability...
Multilingual Data Value Chain for CEF Automated Translation:Interoperability...Multilingual Data Value Chain for CEF Automated Translation:Interoperability...
Multilingual Data Value Chain for CEF Automated Translation: Interoperability...
 

More from PretaLLOD

Dfki Linghub presentation
Dfki Linghub presentationDfki Linghub presentation
Dfki Linghub presentation
PretaLLOD
 
Towards the Detection and Formal Representation of Semantic Shifts in Inflect...
Towards the Detection and Formal Representation of Semantic Shifts in Inflect...Towards the Detection and Formal Representation of Semantic Shifts in Inflect...
Towards the Detection and Formal Representation of Semantic Shifts in Inflect...
PretaLLOD
 
Wordnets and TEI-LEX
Wordnets and TEI-LEXWordnets and TEI-LEX
Wordnets and TEI-LEX
PretaLLOD
 
OntoLex-Lemon as a Possible Bridge between WordNets and Full Lexical Descript...
OntoLex-Lemon as a Possible Bridge between WordNets and Full Lexical Descript...OntoLex-Lemon as a Possible Bridge between WordNets and Full Lexical Descript...
OntoLex-Lemon as a Possible Bridge between WordNets and Full Lexical Descript...
PretaLLOD
 
OntoLex-TEI: Inspiration from Global WordNet
OntoLex-TEI: Inspiration from Global WordNetOntoLex-TEI: Inspiration from Global WordNet
OntoLex-TEI: Inspiration from Global WordNet
PretaLLOD
 
Overview of the Sustainability Plans of the ICT-29b) Projects
Overview of the Sustainability Plans of the ICT-29b) ProjectsOverview of the Sustainability Plans of the ICT-29b) Projects
Overview of the Sustainability Plans of the ICT-29b) Projects
PretaLLOD
 
Validating the OntoLex-lemon lexicography module with K Dictionaries’ multili...
Validating the OntoLex-lemon lexicography module with K Dictionaries’ multili...Validating the OntoLex-lemon lexicography module with K Dictionaries’ multili...
Validating the OntoLex-lemon lexicography module with K Dictionaries’ multili...
PretaLLOD
 
lexicog: Overview of the New Module for Lexicography of OntoLex-lemon
lexicog: Overview of the New Module for Lexicography of OntoLex-lemonlexicog: Overview of the New Module for Lexicography of OntoLex-lemon
lexicog: Overview of the New Module for Lexicography of OntoLex-lemon
PretaLLOD
 
ELSE IF 2019: Language Technology Market: State-of-the-Art, Trends and Value ...
ELSE IF 2019: Language Technology Market: State-of-the-Art, Trends and Value ...ELSE IF 2019: Language Technology Market: State-of-the-Art, Trends and Value ...
ELSE IF 2019: Language Technology Market: State-of-the-Art, Trends and Value ...
PretaLLOD
 
ELSE IF 2019: Porting the xEBR Taxonomy to a Linked Open Data compliant Format
ELSE IF 2019: Porting the xEBR Taxonomy to a Linked Open Data compliant FormatELSE IF 2019: Porting the xEBR Taxonomy to a Linked Open Data compliant Format
ELSE IF 2019: Porting the xEBR Taxonomy to a Linked Open Data compliant Format
PretaLLOD
 
ELSE IF 2019: What’s next for Multilingual Europe?
ELSE IF 2019: What’s next for Multilingual Europe?ELSE IF 2019: What’s next for Multilingual Europe?
ELSE IF 2019: What’s next for Multilingual Europe?
PretaLLOD
 
ELSE IF 2019: Multilingual Text Analytics for Extracting Pharma Real-World Ev...
ELSE IF 2019: Multilingual Text Analytics for Extracting Pharma Real-World Ev...ELSE IF 2019: Multilingual Text Analytics for Extracting Pharma Real-World Ev...
ELSE IF 2019: Multilingual Text Analytics for Extracting Pharma Real-World Ev...
PretaLLOD
 
Language technology market and components taxonomy
Language technology market and components taxonomyLanguage technology market and components taxonomy
Language technology market and components taxonomy
PretaLLOD
 

More from PretaLLOD (13)

Dfki Linghub presentation
Dfki Linghub presentationDfki Linghub presentation
Dfki Linghub presentation
 
Towards the Detection and Formal Representation of Semantic Shifts in Inflect...
Towards the Detection and Formal Representation of Semantic Shifts in Inflect...Towards the Detection and Formal Representation of Semantic Shifts in Inflect...
Towards the Detection and Formal Representation of Semantic Shifts in Inflect...
 
Wordnets and TEI-LEX
Wordnets and TEI-LEXWordnets and TEI-LEX
Wordnets and TEI-LEX
 
OntoLex-Lemon as a Possible Bridge between WordNets and Full Lexical Descript...
OntoLex-Lemon as a Possible Bridge between WordNets and Full Lexical Descript...OntoLex-Lemon as a Possible Bridge between WordNets and Full Lexical Descript...
OntoLex-Lemon as a Possible Bridge between WordNets and Full Lexical Descript...
 
OntoLex-TEI: Inspiration from Global WordNet
OntoLex-TEI: Inspiration from Global WordNetOntoLex-TEI: Inspiration from Global WordNet
OntoLex-TEI: Inspiration from Global WordNet
 
Overview of the Sustainability Plans of the ICT-29b) Projects
Overview of the Sustainability Plans of the ICT-29b) ProjectsOverview of the Sustainability Plans of the ICT-29b) Projects
Overview of the Sustainability Plans of the ICT-29b) Projects
 
Validating the OntoLex-lemon lexicography module with K Dictionaries’ multili...
Validating the OntoLex-lemon lexicography module with K Dictionaries’ multili...Validating the OntoLex-lemon lexicography module with K Dictionaries’ multili...
Validating the OntoLex-lemon lexicography module with K Dictionaries’ multili...
 
lexicog: Overview of the New Module for Lexicography of OntoLex-lemon
lexicog: Overview of the New Module for Lexicography of OntoLex-lemonlexicog: Overview of the New Module for Lexicography of OntoLex-lemon
lexicog: Overview of the New Module for Lexicography of OntoLex-lemon
 
ELSE IF 2019: Language Technology Market: State-of-the-Art, Trends and Value ...
ELSE IF 2019: Language Technology Market: State-of-the-Art, Trends and Value ...ELSE IF 2019: Language Technology Market: State-of-the-Art, Trends and Value ...
ELSE IF 2019: Language Technology Market: State-of-the-Art, Trends and Value ...
 
ELSE IF 2019: Porting the xEBR Taxonomy to a Linked Open Data compliant Format
ELSE IF 2019: Porting the xEBR Taxonomy to a Linked Open Data compliant FormatELSE IF 2019: Porting the xEBR Taxonomy to a Linked Open Data compliant Format
ELSE IF 2019: Porting the xEBR Taxonomy to a Linked Open Data compliant Format
 
ELSE IF 2019: What’s next for Multilingual Europe?
ELSE IF 2019: What’s next for Multilingual Europe?ELSE IF 2019: What’s next for Multilingual Europe?
ELSE IF 2019: What’s next for Multilingual Europe?
 
ELSE IF 2019: Multilingual Text Analytics for Extracting Pharma Real-World Ev...
ELSE IF 2019: Multilingual Text Analytics for Extracting Pharma Real-World Ev...ELSE IF 2019: Multilingual Text Analytics for Extracting Pharma Real-World Ev...
ELSE IF 2019: Multilingual Text Analytics for Extracting Pharma Real-World Ev...
 
Language technology market and components taxonomy
Language technology market and components taxonomyLanguage technology market and components taxonomy
Language technology market and components taxonomy
 

Recently uploaded

FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
Product School
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Product School
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Tobias Schneck
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
Jemma Hussein Allen
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
Ralf Eggert
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
RTTS
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Inflectra
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Jeffrey Haguewood
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 

Recently uploaded (20)

FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 

Linked Open Data Cloud

  • 1. This project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 731015. John P. McCrae1, Thierry Declerck2 Linked Open Data Cloud 1Insight Centre for Data Analytics, National University of Ireland Galway 2Austrian Centre for Digital Humanities
  • 2. This project has received funding from the European Union’s Horizon 2020 research and innovation
  • 3. Hitchhiker’s guide to the Linked Open Data Cloud
  • 4. DBpedia • Largest node in the linked open data cloud • “Nucleus for a web of open data” • Most data is derived by parsing Wikipedia • E.g., https://en.wikipedia.org/wiki/C++ • => http://dbpedia.org/resource/C++ • Uses transparent content negotiation
  • 5. This project has received funding from the European Union’s Horizon 2020 research and innovation Transparent content negotiationI want to know about “C++” and I understand RDF and HTML curl -H "Accept: application/rdf+xml;text/html" -I http://dbpedia.org/resource/C++ HTTP/1.1 303 See Other ... Location: http://dbpedia.org/data/C++.xml ... Go to this location for the RDF/XML version
  • 6. This project has received funding from the European Union’s Horizon 2020 research and innovation Transparent content negotiationI want to know about “C++” and I only know HTML curl -H "Accept: text/html" -I http://dbpedia.org/resource/C++ HTTP/1.1 303 See Other ... Location: http://dbpedia.org/page/C++ ... Go to this location for the HTML version Use /resource/ URL to refer to concept
  • 7. This project has received funding from the European Union’s Horizon 2020 research and innovation DBpedia Pages Links to other resources Ontology properties
  • 8. This project has received funding from the European Union’s Horizon 2020 research and innovation DBpedia ontologyAxioms Labels External Links
  • 9. This project has received funding from the European Union’s Horizon 2020 research and innovation WikiData RDF Version: https://www.wikidata.org/entity/Q2407
  • 10. BabelNet Dictionary compiled from • Wikipedia • (Open Mulitlingual) WordNet • Wiktionary • OmegaWiki • WikiData
  • 11. LexVo • Assigns URIs to words (strings in a language) • Contains links to WordNet, FrameNet etc • Definitions of ISO Language Codes
  • 12. Domain datasets 100’s of domain specific datasets
  • 13. This project has received funding from the European Union’s Horizon 2020 research and innovation LexInfo LexVo BabelNet DBpedia
  • 15. Why reuse URIs • Data interoperability • Queries work over multiple datasets • Semantic definitions allows alignments to be reasoned • (Often) the creators of the URIs have good idea on how data should be structured
  • 16. Challenges of interoperability Susanne Penn The AT DT Fulton NP1s NNP County NNL1cb NNP Grand JJ NNP Jury NN1c Differences in Granularity Linguistic Differences
  • 17. Language codes fr en de thProblem: 7,000+ languages and more dialects, but only 262=676 codes br? br = Breton
  • 18. ISO Language Codes fr-CA Québécois ang Anglo-Saxon pms Piedmontese 3-Letter codes with region cover minority, historical languages, right?
  • 19. Variability How to tag this talk? • en? • en-Latn? (As it is not written in Cyrillic) • en-LV or en-428 (As it is presented in Latvia) • en-GB or en-826 (As is is composed in British English) • en-Latn-GB?Region subtags are used to indicate linguistic variations associated with or appropriate to a specific country, territory, or region. Typically, a region subtag is used to indicate variations such as regional dialects or usage, or region-specific spelling conventions. It can also be used to indicate that content is expressed in a way that is appropriate for use throughout a region, for instance, Spanish content tailored to be useful throughout Latin America. -- RFC 5646
  • 20. Glottolog • Identifies languoids (language varieties) • Uses URLs http://glottolog.org/resource/languoid/id/queb1247 • More information can be found by following the link
  • 22. ISOcat • Effort to standardize linguistic vocabulary from ISO Technical Committee • Standardized “Data Categories” in a “Registry” • Discontinued in December 2014
  • 23. Problems with ISOcat According to Schuurman et al. • Too easy to get a login • “Out-of-control” • Entries were copies of other entries • “People sometimes copied an entry, just in order to make sure the original owner would not change the entry without them knowing it” • Complexity - Too many obligatory and overly technical fields As an alternative the CLARIN concept registry is (still) being introduced. I. Schuurman, M. Windhouwer, O. Ohren, D. Zeman, CLARIN Concept Registry: The new semantic registry, in CLARIN 2015 Selected Papers (2015), pp. 62–70
  • 24. LexInfo • Ontology for “associat[ing] linguistic information with respect to any level of linguistic description and expressivity to elements in an ontology” • Expands OntoLex-Lemon with a set of general categories
  • 25. LexInfo - Properties and Values Properties and open-world (non-exhaustive) list of values
  • 26. LexInfo - Verb Frames Verb frames with formal definitions
  • 27. LexInfo - Arguments Hierarchies of arguments to be used in the frames
  • 28. OLiA Ontologies of Linguistic Annotation • Modular architecture for describing annotation schemes: • Reference Model: Common terminology (similar to LexInfo) • Annotation Model: Describes a particular annotation scheme • Linking Model: Describes the linking between the reference and annotation
  • 29. GOLD - General Ontology Linguistic Description • Quite popular • Defines many terms • ‘Loose’ semantics • Sometimes has range and domains on properties • Not clear how this fits together
  • 30. Submitting to the LOD Cloud
  • 31. This project has received funding from the European Union’s Horizon 2020 research and innovation Go to lod-cloud.net
  • 32. This project has received funding from the European Union’s Horizon 2020 research and innovation Fill in the form
  • 33. Fields • Identifier • unique alphanumeric string • Title • Full name in English • Description • 2-10 sentence description in English • Full Download • A link to the complete dataset, ideally as compressed N- Triples • SPARQL Endpoint • If available • Other Download • Other formats for download or partial downloads
  • 34. Fields (2) • Example • A single resource that resolves • Keywords • Domain • Defines the colour in the diagram • Website • Contact Point • Links • Number of triples linking to another dataset in the cloud • Size • Number of triples in this dataset • Namespace, DOI, Image (if desired)
  • 37. Service-oriented architectures • It is implemented a self-contained operation unit. • It is a black box for its consumers, which only need to know the interface, not the implementation. • It may consist of other underlying services. Interoperability is a significant challenge here
  • 38. Service chains Translation DE => EN Parser (EN) Sentiment Analysis (EN) Often tricky to do in practice!
  • 39. Issues with service chains • Services are often components of pipelines without clear usage to the end user • The technology readiness level of services is often quite low, with little documentation or graphical user interface, • Services are hard to install often requiring compiling from source or specialized libraries not found in major software repositories.
  • 40. Teanga • RDF and Linked Data to provide service interoperability • Docker to enable easy install and usage • Attractive Web Front-End (Bootstrap, AngularJS, NodeJS) • Graceful control of errors
  • 41. This project has received funding from the European Union’s Horizon 2020 research and innovation
  • 42. This project has received funding from the European Union’s Horizon 2020 research and innovation
  • 43. This project has received funding from the European Union’s Horizon 2020 research and innovation
  • 44. LAPPS Grid Defines key vocabularies for service interoperability • LAPPS Interchange Format (JSON-LD) • Web Service Exchange Vocabulary • Human-in-the-loop workflow construction using Galaxy
  • 45. This project has received funding from the European Union’s Horizon 2020 research and innovation
  • 47. Summary • Linked Open Data Cloud • Big • Many relevant tools • Fragmented • Interoperability is less terrible than other systems
  • 48. Thanks. This publication has emanated from research supported in part by a research grant from Science Foundation Ireland (SFI) under Grant Number SFI/12/RC/2289, co-funded by the European Regional Development Fund