SlideShare a Scribd company logo
1 of 49
Download to read offline
Semi-Automatic Example-Driven
Linked Data Mapping Creation
Pieter Heyvaert
pheyvaer.heyvaert@ugent.be
1
Semantic Web technologies rely on Linked Data,
but not all data is accessible as Linked Data.
databases
XML files
Solutions to provide access exist,
but results are not always as desired because limited knowledge is used:
data schema
ontology
2
Goal: improve access to data as Linked Data
3
Overview
problem
current solutions
approach
discussion
4
Overview
problem
current solutions
approach
discussion
5
Input data
id title author
0 Harry Potter and The Sorcerer’s Stone J.K. Rowling
1 Homo Deus Yuval Noah Harari
{
"authors": [{
"id": "jkr",
"name": "J.K. Rowling",
"country": "UK"
"birthdate": "1965-07-31"
},{
"id": "ynh",
"name": "Yuval Noah Harari",
"country": "Israel",
"birthdate": "1976-04-24"
}]
}
6
Desired Linked Data
book:0 a schema:Book;
schema:title "Harry Potter and The Sorcerer’s Stone"@en;
schema:author author:jkr.
book:1 a schema:Book;
schema:title "Homo Deus"@en;
schema:author author:ynh.
author:jkr a foaf:Person;
foaf:name "J.K. Rowling";
foaf:country "UK";
schema:birthdate "1965-07-21"^^xsd:date.
author:ynh a foaf:Person;
foaf:name "Yuval Noah Harari";
foaf:country "UK";
schema:birthdate "1976-04-24"^^xsd:date.
7
Apply rules to generate Linked Data
original
data
Linked
Data
rules
rules state how to generate RDF terms and triples using data and ontologies
8
Linked Data example available
9
book:1 a schema:Book;
schema:title "Homo Deus"@en;
schema:author author:ynh.
author:ynh a foaf:Person;
foaf:name "Yuval Noah Harari";
foaf:country "Israel";
schema:birthdate "1976-04-24"^^xsd:date.
Use example to create rules
sample example
rules
10
original
data
Linked
Data
Linked Data example aligns
with sample of original data
id title author
1 Homo Deus Yuval Noah Harari
{
"id": "ynh",
"name": "Yuval Noah Harari",
"country": "Israel",
"birthdate": "1976-04-24"
}
11
book:1 a schema:Book;
schema:title "Homo Deus"@en;
schema:author author:ynh.
author:ynh a foaf:Person;
foaf:name "Yuval Noah Harari";
foaf:country "Israel";
schema:birthdate "1976-04-24"^^xsd:date.
Alignment with original data and create rules
book:1 a schema:Book;
schema:title "Homo Deus"@en;
schema:author author:ynh.
author:ynh a foaf:Person;
foaf:name "Yuval Noah Harari";
foaf:country "Israel";
schema:birthdate "1976-04-24"^^xsd:date.
id title author
1 Homo Deus Yuval Noah Harari
{
"id": "ynh",
"name": "Yuval Noah Harari",
"country": "Israel",
"birthdate": "1976-04-24"
}
id
12
rule: IRI is “book” + value from column “id”
Alignment with original data and create rules
book:1 a schema:Book;
schema:title "Homo Deus"@en;
schema:author author:ynh.
author:ynh a foaf:Person;
foaf:name "Yuval Noah Harari";
foaf:country "Israel";
schema:birthdate "1976-04-24"^^xsd:date.
id title author
1 Homo Deus Yuval Noah Harari
{
"id": "ynh",
"name": "Yuval Noah Harari",
"country": "Israel",
"birthdate": "1976-04-24"
}
title
13
rule: literal uses value from column “title”
Alignment with original data and create rules
book:1 a schema:Book;
schema:title "Homo Deus"@en;
schema:author author:ynh.
author:ynh a foaf:Person;
foaf:name "Yuval Noah Harari";
foaf:country "Israel";
schema:birthdate "1976-04-24"^^xsd:date.
id title author
1 Homo Deus Yuval Noah Harari
{
"id": "ynh",
"name": "Yuval Noah Harari",
"country": "Israel",
"birthdate": "1976-04-24"
}
titleproperty
14
rule: predicate is schema:title
Alignment with original data and create rules
book:1 a schema:Book;
schema:title "Homo Deus"@en;
schema:author author:ynh.
author:ynh a foaf:Person;
foaf:name "Yuval Noah Harari";
foaf:country "Israel";
schema:birthdate "1976-04-24"^^xsd:date.
id title author
1 Homo Deus Yuval Noah Harari
{
"id": "ynh",
"name": "Yuval Noah Harari",
"country": "Israel",
"birthdate": "1976-04-24"
}
type
15
rule: type of a book is schema:Book
Alignment with original data and create rules
book:1 a schema:Book;
schema:title "Homo Deus"@en;
schema:author author:ynh.
author:ynh a foaf:Person;
foaf:name "Yuval Noah Harari";
foaf:country "Israel";
schema:birthdate "1976-04-24"^^xsd:date.
id title author
1 Homo Deus Yuval Noah Harari
{
"id": "ynh",
"name": "Yuval Noah Harari",
"country": "Israel",
"birthdate": "1976-04-24"
}
other entity
16
rule: a book is related to its author
All rules
IRI is “book” + value from column “id”
Literal uses value from column “title”
Predicate is schema:title
Type of a book is schema:Book
A book is related to its author
17
Apply rules to generate all Linked Data
sample example
rules
18
original
data
Linked
Data
Linked Data might not be as desired
Rules are prone to errors when created manually
Wrong use of ontology classes, properties, and datatypes
Wrong alignments with original data
Especially when dealing
with large and complex data sources
multiple data sources at the same time
19
Overview
problem
current solutions
approach
discussion
20
Overview
problem
current solutions
approach
discussion
21
Solutions to reduce manual effort
when creating rules
Semi-automatic: users provide feedback
Automatic: no user interaction required
22
Current solutions use limited knowledge
Only work with
data schemas
data values
ontologies
Do not consider knowledge embedded in
query workload of Linked Data
Linked Data examples
23
Overview
problem
current solutions
approach
discussion
24
Overview
problem
current solutions
approach
discussion
25
We propose a semi-automatic
example-driven approach to create rules.
26
Overview
problem
current solutions
approach
data source alignment
rule creation
optional adjustments
Linked Data generation
discussion
27
book:1 a schema:Book;
schema:title "Homo Deus"@en;
schema:author author:ynh.
Visualize Linked Data example
author:ynh a foaf:Person;
foaf:name "Yuval Noah Harari";
foaf:country "Israel";
schema:birthdate "1976-04-24"^^xsd:date.
schema:Book foaf:Person
Homo Deus
@en
Yuval Noah Harari Israel 1976-04-24
xsd:date
book:1 author:ynh
schema:author
schema:title
foaf:name schema:birthdate
foaf:country
28
Overview
problem
current solutions
approach
data source alignment
rule creation
optional adjustments
Linked Data generation
discussion
29
Overview
problem
current solutions
approach
data source alignment
rule creation
optional adjustments
Linked Data generation
discussion
30
{
"id": "ynh",
"name": "Yuval Noah Harari",
"country": "Israel",
"birthdate": "1976-04-24"
}
Align with data sources
schema:Book foaf:Person
Homo Deus
@en
Yuval Noah Harari Israel 1976-04-24
xsd:date
book:1 author:ynh
schema:author
schema:title
foaf:name schema:birthdate
foaf:country
id title author
1 Homo Deus Yuval Noah Harari
CSV
JSON
31
{
"id": "ynh",
"name": "Yuval Noah Harari",
"country": "Israel",
"birthdate": "1976-04-24"
}
Align with data sources
schema:Book foaf:Person
Homo Deus
@en
Yuval Noah Harari Israel 1976-04-24
xsd:date
book:1 author:ynh
schema:author
schema:title
foaf:name schema:birthdate
foaf:country
id title author
1 Homo Deus Yuval Noah Harari
CSV
CSV
JSON
32
{
"id": "ynh",
"name": "Yuval Noah Harari",
"country": "Israel",
"birthdate": "1976-04-24"
}
Align with data sources
schema:Book foaf:Person
Homo Deus
@en
Yuval Noah Harari Israel 1976-04-24
xsd:date
book:1 author:ynh
schema:author
schema:title
foaf:name schema:birthdate
foaf:country
id title author
1 Homo Deus Yuval Noah Harari
CSV
CSV
CSV
JSON
33
Align with data sources
schema:Book foaf:Person
Homo Deus
@en
Yuval Noah Harari Israel 1976-04-24
xsd:date
book:1 author:ynh
schema:author
schema:title
foaf:name schema:birthdate
foaf:country
JSONCSV
CSV CSV JSON JSON JSON
{
"id": "ynh",
"name": "Yuval Noah Harari",
"country": "Israel",
"birthdate": "1976-04-24"
}
id title author
1 Homo Deus Yuval Noah Harari
CSV
JSON
34
Select best data source
schema:Book foaf:Person
Homo Deus
@en
Yuval Noah Harari Israel 1976-04-24
xsd:date
book:1 author:ynh
schema:author
schema:title
foaf:name schema:birthdate
foaf:country
JSONCSV
CSV CSV JSON JSON JSON
for each subgraph with an entity
35
Select best data source
schema:Book foaf:Person
Homo Deus
@en
Yuval Noah Harari Israel 1976-04-24
xsd:date
book:1 author:ynh
schema:author
schema:title
foaf:name schema:birthdate
foaf:country
JSONCSV
CSV CSV JSON JSON JSON
only CSV data source
CSV
36
Select best data source
schema:Book foaf:Person
Homo Deus
@en
Yuval Noah Harari Israel 1976-04-24
xsd:date
book:1 author:ynh
schema:author
schema:title
foaf:name schema:birthdate
foaf:country
JSONCSV
CSV CSV JSON JSON JSON
CSV data source match with 1 node
JSON data source match with all nodes
JSON
37
Overview
problem
current solutions
approach
data source alignment
rule creation
optional adjustments
Linked Data generation
discussion
38
Rule creation
For every entity
For every attribute
For all interlinked entities
39
Create rules for entity
schema:Book
book:1
CSV
IRI is “book” + id
type is schema:Book
40
Create rules for attribute
schema:Book
book:1
CSVuse predicate schema:title
literal uses value from column title
language of the title is English
Homo Deus
@en
schema:title
CSV
41
Create rules for interlinked entities
schema:Book
book:1
CSV
use predicate schema:author
join condition: names match
foaf:Person
author:ynh
schema:author
JSON
42
Overview
problem
current solutions
approach
data source alignment
rule creation
optional adjustments
Linked Data generation
discussion
43
Optional adjustments
Fix errors if needed
Add additional rules to deal with exceptions
44
Overview
problem
current solutions
approach
data source alignment
rule creation
optional adjustments
Linked Data generation
discussion
45
Apply rules to generate all Linked Data
sample example
46
original
data
Linked
Data
rules
Overview
problem
current solutions
approach
data source alignment
rule creation
optional adjustments
Linked Data generation
discussion
47
Discussion
Advantages
Use knowledge embedded in Linked Data examples
Minimize errors and user interaction
Approach can be combined with other approaches
Disadvantages
Linked Data example is required
User action might still be required for special cases
48
Recap
Use cases can have Linked Data example available.
Example contains knowledge to create rules.
We introduced approach that uses this knowledge.
This approach can be combined with other approaches.
49

More Related Content

What's hot

Lecture linked data cloud & sparql
Lecture linked data cloud & sparqlLecture linked data cloud & sparql
Lecture linked data cloud & sparqlDhavalkumar Thakker
 
Semantic Web and Schema.org
Semantic Web and Schema.orgSemantic Web and Schema.org
Semantic Web and Schema.orgrvguha
 
LD4L OCLC Data Strategy
LD4L OCLC Data StrategyLD4L OCLC Data Strategy
LD4L OCLC Data StrategyRichard Wallis
 
WorldCat, Works, and Schema.org
WorldCat, Works, and Schema.orgWorldCat, Works, and Schema.org
WorldCat, Works, and Schema.orgRichard Wallis
 
Creating Web APIs with JSON-LD and RDF
Creating Web APIs with JSON-LD and RDFCreating Web APIs with JSON-LD and RDF
Creating Web APIs with JSON-LD and RDFdonaldlsmithjr
 
Web Driven Revolution For Library Data
Web Driven Revolution For Library DataWeb Driven Revolution For Library Data
Web Driven Revolution For Library DataRichard Wallis
 
The Web of Data is Our Opportunity
The Web of Data is Our OpportunityThe Web of Data is Our Opportunity
The Web of Data is Our OpportunityRichard Wallis
 
Contextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of EntitiesContextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of EntitiesRichard Wallis
 
Challenges and applications of RDF shapes
Challenges and applications of RDF shapesChallenges and applications of RDF shapes
Challenges and applications of RDF shapesJose Emilio Labra Gayo
 
Entification: The Route to 'Useful' Library Data
Entification: The Route to 'Useful' Library DataEntification: The Route to 'Useful' Library Data
Entification: The Route to 'Useful' Library DataRichard Wallis
 
Graph and RDF databases
Graph and RDF databasesGraph and RDF databases
Graph and RDF databasesNassim Bahri
 
Microdata for Dummies
Microdata for DummiesMicrodata for Dummies
Microdata for Dummiesgiurca
 
Linked data for Ebook discovery
Linked data for Ebook discoveryLinked data for Ebook discovery
Linked data for Ebook discoveryRichard Wallis
 
SPARQL in the Semantic Web
SPARQL in the Semantic WebSPARQL in the Semantic Web
SPARQL in the Semantic WebJan Beeck
 
Schema Design
Schema DesignSchema Design
Schema DesignMongoDB
 
Contextual Computing: Laying a Global Data Foundation
Contextual Computing: Laying a Global Data FoundationContextual Computing: Laying a Global Data Foundation
Contextual Computing: Laying a Global Data FoundationRichard Wallis
 

What's hot (19)

Lecture linked data cloud & sparql
Lecture linked data cloud & sparqlLecture linked data cloud & sparql
Lecture linked data cloud & sparql
 
Semantic Web and Schema.org
Semantic Web and Schema.orgSemantic Web and Schema.org
Semantic Web and Schema.org
 
LD4L OCLC Data Strategy
LD4L OCLC Data StrategyLD4L OCLC Data Strategy
LD4L OCLC Data Strategy
 
WorldCat, Works, and Schema.org
WorldCat, Works, and Schema.orgWorldCat, Works, and Schema.org
WorldCat, Works, and Schema.org
 
Presentation shexer
Presentation shexerPresentation shexer
Presentation shexer
 
Creating Web APIs with JSON-LD and RDF
Creating Web APIs with JSON-LD and RDFCreating Web APIs with JSON-LD and RDF
Creating Web APIs with JSON-LD and RDF
 
Web Driven Revolution For Library Data
Web Driven Revolution For Library DataWeb Driven Revolution For Library Data
Web Driven Revolution For Library Data
 
Linked Data and OCLC
Linked Data and OCLCLinked Data and OCLC
Linked Data and OCLC
 
The Web of Data is Our Opportunity
The Web of Data is Our OpportunityThe Web of Data is Our Opportunity
The Web of Data is Our Opportunity
 
Contextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of EntitiesContextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of Entities
 
Challenges and applications of RDF shapes
Challenges and applications of RDF shapesChallenges and applications of RDF shapes
Challenges and applications of RDF shapes
 
RDF Data Model
RDF Data ModelRDF Data Model
RDF Data Model
 
Entification: The Route to 'Useful' Library Data
Entification: The Route to 'Useful' Library DataEntification: The Route to 'Useful' Library Data
Entification: The Route to 'Useful' Library Data
 
Graph and RDF databases
Graph and RDF databasesGraph and RDF databases
Graph and RDF databases
 
Microdata for Dummies
Microdata for DummiesMicrodata for Dummies
Microdata for Dummies
 
Linked data for Ebook discovery
Linked data for Ebook discoveryLinked data for Ebook discovery
Linked data for Ebook discovery
 
SPARQL in the Semantic Web
SPARQL in the Semantic WebSPARQL in the Semantic Web
SPARQL in the Semantic Web
 
Schema Design
Schema DesignSchema Design
Schema Design
 
Contextual Computing: Laying a Global Data Foundation
Contextual Computing: Laying a Global Data FoundationContextual Computing: Laying a Global Data Foundation
Contextual Computing: Laying a Global Data Foundation
 

More from Pieter Heyvaert

Ontology-Based Data Access Mapping Generation using Data, Schema, Query, and ...
Ontology-Based Data Access Mapping Generation using Data, Schema, Query, and ...Ontology-Based Data Access Mapping Generation using Data, Schema, Query, and ...
Ontology-Based Data Access Mapping Generation using Data, Schema, Query, and ...Pieter Heyvaert
 
Towards a Uniform User Interface for Editing Mapping Definitions
Towards a Uniform User Interface for Editing Mapping DefinitionsTowards a Uniform User Interface for Editing Mapping Definitions
Towards a Uniform User Interface for Editing Mapping DefinitionsPieter Heyvaert
 
Using EPUB 3 and the Open Web Platform for Enhanced Presentation and Machine-...
Using EPUB 3 and the Open Web Platform for Enhanced Presentation and Machine-...Using EPUB 3 and the Open Web Platform for Enhanced Presentation and Machine-...
Using EPUB 3 and the Open Web Platform for Enhanced Presentation and Machine-...Pieter Heyvaert
 
RMLEditor: A Graph-based Mapping Editor for Linked Data Mappings
RMLEditor: A Graph-based Mapping Editor for Linked Data MappingsRMLEditor: A Graph-based Mapping Editor for Linked Data Mappings
RMLEditor: A Graph-based Mapping Editor for Linked Data MappingsPieter Heyvaert
 
Graph-Based Editing of Linked Data Mappings using the RMLEditor | ESWC2016 De...
Graph-Based Editing of Linked Data Mappings using the RMLEditor | ESWC2016 De...Graph-Based Editing of Linked Data Mappings using the RMLEditor | ESWC2016 De...
Graph-Based Editing of Linked Data Mappings using the RMLEditor | ESWC2016 De...Pieter Heyvaert
 
FREME (EU Project Networking Session ESWC 2015)
FREME (EU Project Networking Session ESWC 2015)FREME (EU Project Networking Session ESWC 2015)
FREME (EU Project Networking Session ESWC 2015)Pieter Heyvaert
 
Buliding a DCAT Merger (SemDev 2015)
Buliding a DCAT Merger (SemDev 2015)Buliding a DCAT Merger (SemDev 2015)
Buliding a DCAT Merger (SemDev 2015)Pieter Heyvaert
 

More from Pieter Heyvaert (7)

Ontology-Based Data Access Mapping Generation using Data, Schema, Query, and ...
Ontology-Based Data Access Mapping Generation using Data, Schema, Query, and ...Ontology-Based Data Access Mapping Generation using Data, Schema, Query, and ...
Ontology-Based Data Access Mapping Generation using Data, Schema, Query, and ...
 
Towards a Uniform User Interface for Editing Mapping Definitions
Towards a Uniform User Interface for Editing Mapping DefinitionsTowards a Uniform User Interface for Editing Mapping Definitions
Towards a Uniform User Interface for Editing Mapping Definitions
 
Using EPUB 3 and the Open Web Platform for Enhanced Presentation and Machine-...
Using EPUB 3 and the Open Web Platform for Enhanced Presentation and Machine-...Using EPUB 3 and the Open Web Platform for Enhanced Presentation and Machine-...
Using EPUB 3 and the Open Web Platform for Enhanced Presentation and Machine-...
 
RMLEditor: A Graph-based Mapping Editor for Linked Data Mappings
RMLEditor: A Graph-based Mapping Editor for Linked Data MappingsRMLEditor: A Graph-based Mapping Editor for Linked Data Mappings
RMLEditor: A Graph-based Mapping Editor for Linked Data Mappings
 
Graph-Based Editing of Linked Data Mappings using the RMLEditor | ESWC2016 De...
Graph-Based Editing of Linked Data Mappings using the RMLEditor | ESWC2016 De...Graph-Based Editing of Linked Data Mappings using the RMLEditor | ESWC2016 De...
Graph-Based Editing of Linked Data Mappings using the RMLEditor | ESWC2016 De...
 
FREME (EU Project Networking Session ESWC 2015)
FREME (EU Project Networking Session ESWC 2015)FREME (EU Project Networking Session ESWC 2015)
FREME (EU Project Networking Session ESWC 2015)
 
Buliding a DCAT Merger (SemDev 2015)
Buliding a DCAT Merger (SemDev 2015)Buliding a DCAT Merger (SemDev 2015)
Buliding a DCAT Merger (SemDev 2015)
 

Recently uploaded

DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 

Recently uploaded (20)

DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 

Semi-Automatic Example-Driven Linked Data Mapping Creation

  • 1. Semi-Automatic Example-Driven Linked Data Mapping Creation Pieter Heyvaert pheyvaer.heyvaert@ugent.be 1
  • 2. Semantic Web technologies rely on Linked Data, but not all data is accessible as Linked Data. databases XML files Solutions to provide access exist, but results are not always as desired because limited knowledge is used: data schema ontology 2
  • 3. Goal: improve access to data as Linked Data 3
  • 6. Input data id title author 0 Harry Potter and The Sorcerer’s Stone J.K. Rowling 1 Homo Deus Yuval Noah Harari { "authors": [{ "id": "jkr", "name": "J.K. Rowling", "country": "UK" "birthdate": "1965-07-31" },{ "id": "ynh", "name": "Yuval Noah Harari", "country": "Israel", "birthdate": "1976-04-24" }] } 6
  • 7. Desired Linked Data book:0 a schema:Book; schema:title "Harry Potter and The Sorcerer’s Stone"@en; schema:author author:jkr. book:1 a schema:Book; schema:title "Homo Deus"@en; schema:author author:ynh. author:jkr a foaf:Person; foaf:name "J.K. Rowling"; foaf:country "UK"; schema:birthdate "1965-07-21"^^xsd:date. author:ynh a foaf:Person; foaf:name "Yuval Noah Harari"; foaf:country "UK"; schema:birthdate "1976-04-24"^^xsd:date. 7
  • 8. Apply rules to generate Linked Data original data Linked Data rules rules state how to generate RDF terms and triples using data and ontologies 8
  • 9. Linked Data example available 9 book:1 a schema:Book; schema:title "Homo Deus"@en; schema:author author:ynh. author:ynh a foaf:Person; foaf:name "Yuval Noah Harari"; foaf:country "Israel"; schema:birthdate "1976-04-24"^^xsd:date.
  • 10. Use example to create rules sample example rules 10 original data Linked Data
  • 11. Linked Data example aligns with sample of original data id title author 1 Homo Deus Yuval Noah Harari { "id": "ynh", "name": "Yuval Noah Harari", "country": "Israel", "birthdate": "1976-04-24" } 11 book:1 a schema:Book; schema:title "Homo Deus"@en; schema:author author:ynh. author:ynh a foaf:Person; foaf:name "Yuval Noah Harari"; foaf:country "Israel"; schema:birthdate "1976-04-24"^^xsd:date.
  • 12. Alignment with original data and create rules book:1 a schema:Book; schema:title "Homo Deus"@en; schema:author author:ynh. author:ynh a foaf:Person; foaf:name "Yuval Noah Harari"; foaf:country "Israel"; schema:birthdate "1976-04-24"^^xsd:date. id title author 1 Homo Deus Yuval Noah Harari { "id": "ynh", "name": "Yuval Noah Harari", "country": "Israel", "birthdate": "1976-04-24" } id 12 rule: IRI is “book” + value from column “id”
  • 13. Alignment with original data and create rules book:1 a schema:Book; schema:title "Homo Deus"@en; schema:author author:ynh. author:ynh a foaf:Person; foaf:name "Yuval Noah Harari"; foaf:country "Israel"; schema:birthdate "1976-04-24"^^xsd:date. id title author 1 Homo Deus Yuval Noah Harari { "id": "ynh", "name": "Yuval Noah Harari", "country": "Israel", "birthdate": "1976-04-24" } title 13 rule: literal uses value from column “title”
  • 14. Alignment with original data and create rules book:1 a schema:Book; schema:title "Homo Deus"@en; schema:author author:ynh. author:ynh a foaf:Person; foaf:name "Yuval Noah Harari"; foaf:country "Israel"; schema:birthdate "1976-04-24"^^xsd:date. id title author 1 Homo Deus Yuval Noah Harari { "id": "ynh", "name": "Yuval Noah Harari", "country": "Israel", "birthdate": "1976-04-24" } titleproperty 14 rule: predicate is schema:title
  • 15. Alignment with original data and create rules book:1 a schema:Book; schema:title "Homo Deus"@en; schema:author author:ynh. author:ynh a foaf:Person; foaf:name "Yuval Noah Harari"; foaf:country "Israel"; schema:birthdate "1976-04-24"^^xsd:date. id title author 1 Homo Deus Yuval Noah Harari { "id": "ynh", "name": "Yuval Noah Harari", "country": "Israel", "birthdate": "1976-04-24" } type 15 rule: type of a book is schema:Book
  • 16. Alignment with original data and create rules book:1 a schema:Book; schema:title "Homo Deus"@en; schema:author author:ynh. author:ynh a foaf:Person; foaf:name "Yuval Noah Harari"; foaf:country "Israel"; schema:birthdate "1976-04-24"^^xsd:date. id title author 1 Homo Deus Yuval Noah Harari { "id": "ynh", "name": "Yuval Noah Harari", "country": "Israel", "birthdate": "1976-04-24" } other entity 16 rule: a book is related to its author
  • 17. All rules IRI is “book” + value from column “id” Literal uses value from column “title” Predicate is schema:title Type of a book is schema:Book A book is related to its author 17
  • 18. Apply rules to generate all Linked Data sample example rules 18 original data Linked Data
  • 19. Linked Data might not be as desired Rules are prone to errors when created manually Wrong use of ontology classes, properties, and datatypes Wrong alignments with original data Especially when dealing with large and complex data sources multiple data sources at the same time 19
  • 22. Solutions to reduce manual effort when creating rules Semi-automatic: users provide feedback Automatic: no user interaction required 22
  • 23. Current solutions use limited knowledge Only work with data schemas data values ontologies Do not consider knowledge embedded in query workload of Linked Data Linked Data examples 23
  • 26. We propose a semi-automatic example-driven approach to create rules. 26
  • 27. Overview problem current solutions approach data source alignment rule creation optional adjustments Linked Data generation discussion 27
  • 28. book:1 a schema:Book; schema:title "Homo Deus"@en; schema:author author:ynh. Visualize Linked Data example author:ynh a foaf:Person; foaf:name "Yuval Noah Harari"; foaf:country "Israel"; schema:birthdate "1976-04-24"^^xsd:date. schema:Book foaf:Person Homo Deus @en Yuval Noah Harari Israel 1976-04-24 xsd:date book:1 author:ynh schema:author schema:title foaf:name schema:birthdate foaf:country 28
  • 29. Overview problem current solutions approach data source alignment rule creation optional adjustments Linked Data generation discussion 29
  • 30. Overview problem current solutions approach data source alignment rule creation optional adjustments Linked Data generation discussion 30
  • 31. { "id": "ynh", "name": "Yuval Noah Harari", "country": "Israel", "birthdate": "1976-04-24" } Align with data sources schema:Book foaf:Person Homo Deus @en Yuval Noah Harari Israel 1976-04-24 xsd:date book:1 author:ynh schema:author schema:title foaf:name schema:birthdate foaf:country id title author 1 Homo Deus Yuval Noah Harari CSV JSON 31
  • 32. { "id": "ynh", "name": "Yuval Noah Harari", "country": "Israel", "birthdate": "1976-04-24" } Align with data sources schema:Book foaf:Person Homo Deus @en Yuval Noah Harari Israel 1976-04-24 xsd:date book:1 author:ynh schema:author schema:title foaf:name schema:birthdate foaf:country id title author 1 Homo Deus Yuval Noah Harari CSV CSV JSON 32
  • 33. { "id": "ynh", "name": "Yuval Noah Harari", "country": "Israel", "birthdate": "1976-04-24" } Align with data sources schema:Book foaf:Person Homo Deus @en Yuval Noah Harari Israel 1976-04-24 xsd:date book:1 author:ynh schema:author schema:title foaf:name schema:birthdate foaf:country id title author 1 Homo Deus Yuval Noah Harari CSV CSV CSV JSON 33
  • 34. Align with data sources schema:Book foaf:Person Homo Deus @en Yuval Noah Harari Israel 1976-04-24 xsd:date book:1 author:ynh schema:author schema:title foaf:name schema:birthdate foaf:country JSONCSV CSV CSV JSON JSON JSON { "id": "ynh", "name": "Yuval Noah Harari", "country": "Israel", "birthdate": "1976-04-24" } id title author 1 Homo Deus Yuval Noah Harari CSV JSON 34
  • 35. Select best data source schema:Book foaf:Person Homo Deus @en Yuval Noah Harari Israel 1976-04-24 xsd:date book:1 author:ynh schema:author schema:title foaf:name schema:birthdate foaf:country JSONCSV CSV CSV JSON JSON JSON for each subgraph with an entity 35
  • 36. Select best data source schema:Book foaf:Person Homo Deus @en Yuval Noah Harari Israel 1976-04-24 xsd:date book:1 author:ynh schema:author schema:title foaf:name schema:birthdate foaf:country JSONCSV CSV CSV JSON JSON JSON only CSV data source CSV 36
  • 37. Select best data source schema:Book foaf:Person Homo Deus @en Yuval Noah Harari Israel 1976-04-24 xsd:date book:1 author:ynh schema:author schema:title foaf:name schema:birthdate foaf:country JSONCSV CSV CSV JSON JSON JSON CSV data source match with 1 node JSON data source match with all nodes JSON 37
  • 38. Overview problem current solutions approach data source alignment rule creation optional adjustments Linked Data generation discussion 38
  • 39. Rule creation For every entity For every attribute For all interlinked entities 39
  • 40. Create rules for entity schema:Book book:1 CSV IRI is “book” + id type is schema:Book 40
  • 41. Create rules for attribute schema:Book book:1 CSVuse predicate schema:title literal uses value from column title language of the title is English Homo Deus @en schema:title CSV 41
  • 42. Create rules for interlinked entities schema:Book book:1 CSV use predicate schema:author join condition: names match foaf:Person author:ynh schema:author JSON 42
  • 43. Overview problem current solutions approach data source alignment rule creation optional adjustments Linked Data generation discussion 43
  • 44. Optional adjustments Fix errors if needed Add additional rules to deal with exceptions 44
  • 45. Overview problem current solutions approach data source alignment rule creation optional adjustments Linked Data generation discussion 45
  • 46. Apply rules to generate all Linked Data sample example 46 original data Linked Data rules
  • 47. Overview problem current solutions approach data source alignment rule creation optional adjustments Linked Data generation discussion 47
  • 48. Discussion Advantages Use knowledge embedded in Linked Data examples Minimize errors and user interaction Approach can be combined with other approaches Disadvantages Linked Data example is required User action might still be required for special cases 48
  • 49. Recap Use cases can have Linked Data example available. Example contains knowledge to create rules. We introduced approach that uses this knowledge. This approach can be combined with other approaches. 49