Web open standards for linked data and knowledge graphs as enablers of EU digital sovereignty

Web open standards for linked data and
knowledge graphs as enablers of EU
digital sovereignty
Fabien Gandon, http://fabien.info

PROFILE
 Graduated Engineer INSA Applied Math, DEA/Master Image & Vision
 PHD & HDR (Habilitation) in computer science
 Research Director / Senior researcher, INRIA
 Leader Wimmics (UCA, Inria, CNRS, I3S) on Campus Sophia Antipolis
 Advisory Committee of W3C
 Responsible research convention French Ministry of Culture – Inria
 Vice-head of Science for Inria Sophia Antipolis

WIMMICS TEAM
DR/Professors:
 Fabien GANDON, Inria, AI, KRR, Semantic Web, Social Web, K. Graphs
 Nhan LE THANH, UCA, Logics, KR, Emotions, Workflows, K. Graphs
 Peter SANDER, UCA, Web, Emotions
 Andrea TETTAMANZI, UCA, AI, Logics, Evo, Learning, Agents, K. Graphs
 Marco WINCKLER, UCA, Human-Computer Interaction, Web, K. Graphs
CR/Assistant Professors:
 Michel BUFFA, UCA, Web, Social Media, Web Audio, K. Graphs
 Elena CABRIO, UCA, NLP, KR, Linguistics, Q&A, Text Mining, K. Graphs
 Olivier CORBY, Inria, KR, AI, Sem. Web, Programming, K. Graphs
 Catherine FARON-ZUCKER, UCA, KR, AI, Semantic Web, K. Graphs
 Damien GRAUX, Inria, Linked Data, Sem. Web, Querying, K. Graphs
 Serena VILLATA, CNRS, AI, Argumentation, Licenses, Rights, K. Graphs
Research engineer: Franck MICHEL, CNRS, Linked Data, Integration, DB, K. Graphs
External:
 Andrei Ciortea (University of St. Gallen) Agents, WoT, Sem. Web, K. Graphs
 Nicolas DELAFORGE (Mnemotix) Sem. Web, KM, Integration, K. Graphs
 Alain GIBOIN, (Retired CR Inria), Interaction Design, KE, User & Task, K. Graphs
 Freddy LECUE (Thales, Montreal) AI, Logics, Mining, Big Data, S. Web , K. Graphs

URI, IRI, URL, HTTP URI
STANDARDS FOR DATA & KNOWLEDGE GRAPHS ON THE WEB
JSON
RDF
JSON LD
N-Triple
N-Quad
Turtle/N3
TriG
RDFS
OWL
SPARQL
XML
HTML
RDF XML
HTTP
Linked Data
CSV-LD R2RML
GRDDL
RDFa
SHACL
LDP

World Wide Web Consortium
an international community leading the Web to its full potential since 1994
i.e. building an open, interoperable Web that works for everyone,
by developing freely available and open standards for it.
In 2016, Tim Berners-Lee received the
Turing Award for his invention of the Web

World Wide Web Consortium
 Over 430 Members org. around the world
 The not-for-profit organization’s staff of 50
supported by Membership dues
 Over 12,000 developers worldwide
 38 working groups + 10 interest groups
+ 350 Business Groups and Community Groups
 Hundreds of open technologies that power…
browsers, smart phones, ebook readers, set top
boxes, automobiles, search engines, social media,
trillions of dollars of online commerce, and more
than a billion Web sites
=

for instance…
examples of former or current members
html http
url
uri
iri atag
uaag
wcag
aria
mwbp
earl
ra cc/pp
assx
css
ddrsa xml eve. exi
geo api
dom xform
grddl inkml its cmwww ruby an.
xhtml rdfa
ets omr m. ok emma
p3p
mathml mf
pics qa rif sec cont. sawsdl
png powder
sml soap
wsdl
svg awww
ttml smile
rdf owl
rdfs
sparql
woff
webcgm
xbl xkms xlink
wscdl wsp
skos
ns canon. x dtxml xproc xfrag
xml xbase
xschema
xml:id xpath xpointer
xquery
xsignat. xbop
xslt
xslfo
examples of standards
…
…

(2/8) Web open standards for…
distributed, interoperable hypermedia

AN HYPERMEDIA
linking everything…

three components of the Web architecture
1. identification (URI) & address (URL)
ex. http://www.inria.fr
URL

2. communication / protocol (HTTP)
GET /centre/sophia HTTP/1.1
Host: www.inria.fr
HTTP
URL
address

2. communication / protocol (HTTP)
GET /centre/sophia HTTP/1.1
Host: www.inria.fr
3. representation language (HTML)
Fabien works at
<a href="http://inria.fr">Inria</a>
HTTP
URL
HTML
reference address
communication
WEB

14
[Tim Beners-Lee et al., 1994]

distributed, interoperable identifiers

Universal Resource Locator / Indentifier
HTTP
URL
HTML
reference address
communication
WEB
HTTP
URI
HTML
reference address
communication
WEB

identify what
exists on the
web
http://my-site.fr
identify,
on the web, what
exists
http://animals.org/this-zebra

URIs for everything
• URI for Paris in DBpedia:
http://dbpedia.org/resource/Paris
• URI for name of Victor Hugo in the Library of Congress:
http://id.loc.gov/authorities/names/n79091479
• The MUC18 protein at UniProt
http://www.uniprot.org/uniprot/P43121
• Xavier Dolan in Wikidata
https://www.wikidata.org/wiki/Special:EntityData/Q551861
• The book with doi:10.1007/3-540-45741-0_18
http://dx.doi.org/10.1007/3-540-45741-0_18
•

e.g. identifying 1025 car configurations
[François-Paul Servant et al. ESWC 2012]

distributed, interoperable data

RDF: a Web standard for knowledge graphs
HTTP
URI
reference address
communication
WEB
HTTP
URI
HTML
reference address
communication
WEB
RDF

a Web approach to data publication
???...
« http://fr.dbpedia.org/resource/Paris »

HTTP URI
GET

HTTP URI
GET
HTML, …

HTTP URI
GET
RDF

The MUC18 protein at UniProt
http://www.uniprot.org/uniprot/P43121

linked open data(sets) cloud on the Web
0
200
400
600
800
1000
1200
1400
5/1/2007 10/8/2007 11/7/2007 11/10/2007 2/28/2008 3/31/2008 9/18/2008 3/5/2009 3/27/2009 7/14/2009 9/22/2010 9/19/2011 8/30/2014 1/26/2017
number of linked open datasets on the Web

Smarter Cities’ knowledge graphs
IBM Dublin [Lécué et al., 2015] (also for private KGs behind firewalls)

distributed interoperable access

31
SPARQL : Get Data, Not Documents
ex. DBpedia

DBPEDIA.FR
180 000 000 arcs in an
encyclopedic knowledge
graph
number of queries per day
70 000 on average
2.5 millions max
185 377 686 RDF triples extracted and mapped
public dumps, endpoints, interfaces, APIs…

COVID LINKED DATA
 integrate multiple datasets in heterogeneous formats
 perform information extraction, inferences, validation
 provide a public end-point and visualization services
[Gandon, Michel, Gazzotti, Mayer, Cabrio, Corby, Menin, Winckler, Villata et al. 2020]

distributed interoperable validation

SHACL is a language for
describing and validating pieces (shapes) of
RDF knowledge graphs
eg. every Person must have one and only one name
used for validation, description, interaction,
integration, code generation,…

This project has received funding from the European Union's Horizon 2020
research and innovation programme under grant agreement 825619.
ONTOLOGY FOR AI ITSELF
 ontology and metadata of AI resources
 SHACL to validate AI4EU these RDF graphs
 online endpoint http://corese.inria.fr
 predefined SPARQL queries, SHACL shapes, display
[Corby et al., 2019]

distributed, interoperable vocabularies

RDFS to declare classes of resources and properties, of
your knowledge graph and organize their hierarchy
Document
Report
creator
author
Document Person

OWL in one…
algebraic properties
disjoint properties
qualified cardinality
1..1
!
individual prop. neg
chained prop.


enumeration
intersection
union
complement
 disjunction
restriction
!
cardinality
1..1
equivalence

[>18]
disjoint union

value restriction
keys …

PREDICT HOSPITALIZATION
 Predict hospitalization from
Physician’s records classification
[Gazzotti, Faron et al. 2020]
Sexe Date Cause CISP2 ... History Observations
H 25/04/2012 vaccin-antitétanique A44 ... Appendicite EN CP - Bon état général - auscult
pulm libre; bdc rég sans souffle -
tympans ok-
Element Number
Patients
Consultations
Past medical history
Biometric data
Semiotics
Diagnosis
Row of prescribed drugs
Symptoms
Health care procedures
Additional examination
Paramedical prescription
Observations/notes
55 823
364 684
187 290
293 908
250 669
117 442
847 422
23 488
11 850
871 590
17 222
56 143
PRIMEGE

 Augment records data with
Web knowledge graphs
tympans ok-
Element Number
Patients
Consultations
Biometric data
Semiotics
Diagnosis
Symptoms
Observations/notes
55 823
364 684
187 290
293 908
250 669
117 442
847 422
23 488
11 850
871 590
17 222
56 143
(1)
PRIMEGE

 Augment records data with
Web knowledge graphs
 Study impact on prediction
tympans ok-
Element Number
Patients
Consultations
Biometric data
Semiotics
Diagnosis
Symptoms
Observations/notes
55 823
364 684
187 290
293 908
250 669
117 442
847 422
23 488
11 850
871 590
17 222
56 143
(1)
(2)
PRIMEGE

SKOS
thesaurus, lexicon
skos:narrowerTransitive
skos:narrower
skos:broaderTransitive
skos:broader
#Algebra
#Mathematics #LinearAlgebra
broader
narrower
broader
narrower
broaderTransitive broaderTransitive
narrowerTransitive narrowerTransitive
broaderTransitive
narrowerTransitive

MonaLIA
 reason & query on RDF to build training sets.
350 000 images
of artworks
RDF metadata based
on external thesauri
Joconde database from French museums
(1)
[Bobasheva et al. 2020]

MonaLIA
 transfer learning & CNN classifiers on targeted
categories (topics, techniques, etc.)
350 000 images
of artworks
RDF metadata based
(1)
(2)

Image Metadata Score 
portrait
50350012455
C:Jocondejoconde0138m503501_d0012455-000_p.jpg
cheval:
0.999
Image Metadata Score 
figure (saint Eloi de Noyon, évêque, en pied, bénédiction,
vêtement liturgique, mitre, attribut, cheval, marteau, outil :
ferronnerie)
000SC022652
C:/Joconde/joconde0355/m079806_bsa0030101_p.jpg
cheval:
0.006
MonaLIA
 transfer learning & CNN classifiers on targeted
categories (topics, techniques, etc.)
 reason & query RDF of results to address
silence, noise and explain
350 000 images
of artworks
RDF metadata based
(1)
(3)
(2)

Web open standards as enablers of
interoperable platforms e.g.
“Solid (…) is a proposed set of conventions and tools for building decentralized
Web applications based on Linked Data principles. (…)
It relies as much as possible on existing W3C standards and protocols. (…)
RDF 1.1 (…) The WebID 1.0 (…) The FOAF vocabulary (…)
WebID-TLS protocol (…) HTML5 (…) Linked Data Platform (LDP) standard”
https://github.com/solid/solid#standards-used

distributed, interoperable Europe
“I’m right there in the room, but
no one even acknowledges me.”

W3C = strategic place to survey and shape Web standards

W3C = strategic place to survey and shape Web standards
Personal opinion:
 Important to have a neutral place to build open-standards (1 member = 1 vote)
 Important to have public and private members at W3C
 Important to have a large European participation to W3C

Web open standards & world-wide interoperability
are key enablers of EU digital sovereignty
 Interoperability is strategic to federate actors/actions. (cf. members)
 Web standards are transversal to domains/tasks/… (cf. applications examples)
 Importance of knowledge graphs and danger of knowledge silos. (cf. data)
 Having established open standards between actors in Europe
(public and private) is a stake for setting up European data spaces.

Web open standards & world-wide interoperability
are key enablers of EU digital sovereignty
 Interoperability is strategic to federate actors/actions. (cf. members)
 Web standards are transversal to domains/tasks/… (cf. applications examples)
 Importance of knowledge graphs and danger of knowledge silos. (cf. data)
 Having established open standards between actors in Europe
(public and private) is a stake for setting up European data spaces.
• active participation to W3C is a key to build EU digital sovereignty.

WIMMICS
Web-Instrumented Man-Machine Interactions, Communities and Semantics
Fabien Gandon - @fabien_gandon - http://fabien.info
he who controls metadata, controls the web
and through the world-wide web many things in our world.
Site: http://wimmics.inria.fr
Overview: http://bit.ly/wimmics-slides
Technical details: http://bit.ly/wimmics-papers
   

Web open standards for linked data and knowledge graphs as enablers of EU digital sovereignty

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Web open standards for linked data and knowledge graphs as enablers of EU digital sovereignty

Similar to Web open standards for linked data and knowledge graphs as enablers of EU digital sovereignty (20)

More from Fabien Gandon

More from Fabien Gandon (17)

Recently uploaded

Recently uploaded (20)

Web open standards for linked data and knowledge graphs as enablers of EU digital sovereignty

Editor's Notes