SlideShare a Scribd company logo
id date …
1 April 1st ‘19 …
… … …
ex:{id} dcterms:date {*date*}^^xsd:date
Data transformation
Validation + entailment
Schema transformation
?s dcterms:date ?date
?date :type xsd:date
2
1
3
4
High Quality Schema and Data Transformations
for Linked Data Generation
Ben De Meester ben.de-meester.org/#me
ben.demeester@ugent.be
@Ben__DM
Validating generated Linked Data
is computationally intensive and
makes it hard to determine root error causes
Reusable data transformation descriptions fno.io
for 90% of common data transformations
Complete and reusable generation processes rml.io
Can generate the same DBpedia dataset as original process
Validation approach handling custom entailment regimes git.io/fxJ2j
Faster for same constraint types than RDFUnit and SHACL
Automatic validation of the declarative generation process
More efficient, and finds root error causes up front for 90% of SHACL unit tests
2
1
3
4
https://ben.de-meester.org/presentations/2018/consortium/#

More Related Content

Similar to ISWC2018 PhD Consortium: High Quality Schema and Data Transformations for Linked Data Generation

Big Data & Machine Learning Pipelines: A Tale of Lambdas, Kappas and Pancakes
Big Data & Machine Learning Pipelines: A Tale of Lambdas, Kappas and PancakesBig Data & Machine Learning Pipelines: A Tale of Lambdas, Kappas and Pancakes
Big Data & Machine Learning Pipelines: A Tale of Lambdas, Kappas and Pancakes
Osama Khan
 
Multi-Dimensional Process Analysis
Multi-Dimensional Process Analysis Multi-Dimensional Process Analysis
Multi-Dimensional Process Analysis
Dirk Fahland
 
GeoShareOverview_Mar11
GeoShareOverview_Mar11GeoShareOverview_Mar11
GeoShareOverview_Mar11
Werner Runge
 
Omitola o rian_eswc_idts final
Omitola o rian_eswc_idts finalOmitola o rian_eswc_idts final
Omitola o rian_eswc_idts final
Tope Omitola
 
Serving Information Needs of Knowledge Workers
Serving Information Needs of Knowledge WorkersServing Information Needs of Knowledge Workers
Serving Information Needs of Knowledge Workers
Debdoot Mukherjee
 

Similar to ISWC2018 PhD Consortium: High Quality Schema and Data Transformations for Linked Data Generation (20)

D3 meetup (Backbone and D3)
D3 meetup (Backbone and D3)D3 meetup (Backbone and D3)
D3 meetup (Backbone and D3)
 
Big Data & Machine Learning Pipelines: A Tale of Lambdas, Kappas and Pancakes
Big Data & Machine Learning Pipelines: A Tale of Lambdas, Kappas and PancakesBig Data & Machine Learning Pipelines: A Tale of Lambdas, Kappas and Pancakes
Big Data & Machine Learning Pipelines: A Tale of Lambdas, Kappas and Pancakes
 
20090701 Climate Data Staging
20090701 Climate Data Staging20090701 Climate Data Staging
20090701 Climate Data Staging
 
Webinar: Enterprise Data Management in the Era of MongoDB and Data Lakes
Webinar: Enterprise Data Management in the Era of MongoDB and Data LakesWebinar: Enterprise Data Management in the Era of MongoDB and Data Lakes
Webinar: Enterprise Data Management in the Era of MongoDB and Data Lakes
 
Multi-Dimensional Process Analysis
Multi-Dimensional Process Analysis Multi-Dimensional Process Analysis
Multi-Dimensional Process Analysis
 
How Rackspace Cloud Monitoring uses Cassandra
How Rackspace Cloud Monitoring uses CassandraHow Rackspace Cloud Monitoring uses Cassandra
How Rackspace Cloud Monitoring uses Cassandra
 
Ensuring data quality with lakeFS
Ensuring data quality with lakeFSEnsuring data quality with lakeFS
Ensuring data quality with lakeFS
 
PyData Berlin 2023 - Mythical ML Pipeline.pdf
PyData Berlin 2023 - Mythical ML Pipeline.pdfPyData Berlin 2023 - Mythical ML Pipeline.pdf
PyData Berlin 2023 - Mythical ML Pipeline.pdf
 
Query Processing and Optimisation - Lecture 10 - Introduction to Databases (1...
Query Processing and Optimisation - Lecture 10 - Introduction to Databases (1...Query Processing and Optimisation - Lecture 10 - Introduction to Databases (1...
Query Processing and Optimisation - Lecture 10 - Introduction to Databases (1...
 
GeoShareOverview_Mar11
GeoShareOverview_Mar11GeoShareOverview_Mar11
GeoShareOverview_Mar11
 
Data Science At Zillow
Data Science At ZillowData Science At Zillow
Data Science At Zillow
 
Capturing Interactive Data Transformation Operations using Provenance Workflows
Capturing Interactive Data Transformation Operations using Provenance WorkflowsCapturing Interactive Data Transformation Operations using Provenance Workflows
Capturing Interactive Data Transformation Operations using Provenance Workflows
 
Omitola o rian_eswc_idts final
Omitola o rian_eswc_idts finalOmitola o rian_eswc_idts final
Omitola o rian_eswc_idts final
 
Data Lineage, Property Based Testing & Neo4j
Data Lineage, Property Based Testing & Neo4j Data Lineage, Property Based Testing & Neo4j
Data Lineage, Property Based Testing & Neo4j
 
How to govern and secure a Data Mesh?
How to govern and secure a Data Mesh?How to govern and secure a Data Mesh?
How to govern and secure a Data Mesh?
 
Serving Information Needs of Knowledge Workers
Serving Information Needs of Knowledge WorkersServing Information Needs of Knowledge Workers
Serving Information Needs of Knowledge Workers
 
Bring Your Data Model Alive with Automation - Data Modeling Zone Europe 2018
Bring Your Data Model Alive with Automation - Data Modeling Zone Europe 2018 Bring Your Data Model Alive with Automation - Data Modeling Zone Europe 2018
Bring Your Data Model Alive with Automation - Data Modeling Zone Europe 2018
 
JUG Poznan - 2017.01.31
JUG Poznan - 2017.01.31 JUG Poznan - 2017.01.31
JUG Poznan - 2017.01.31
 
Front End performance as a Continuous Integration - Part2 (Browserperf/perfja...
Front End performance as a Continuous Integration - Part2 (Browserperf/perfja...Front End performance as a Continuous Integration - Part2 (Browserperf/perfja...
Front End performance as a Continuous Integration - Part2 (Browserperf/perfja...
 
Secrets of Enterprise Data Mining 201305
Secrets of Enterprise Data Mining 201305Secrets of Enterprise Data Mining 201305
Secrets of Enterprise Data Mining 201305
 

More from Ben De Meester

Public PhD Defense - Ben De Meester
Public PhD Defense - Ben De MeesterPublic PhD Defense - Ben De Meester
Public PhD Defense - Ben De Meester
Ben De Meester
 
OrdRing2015 - Event-Driven Rule-based Reasoning using EYE
OrdRing2015 - Event-Driven Rule-based Reasoning using EYEOrdRing2015 - Event-Driven Rule-based Reasoning using EYE
OrdRing2015 - Event-Driven Rule-based Reasoning using EYE
Ben De Meester
 
LocWeb2015 - Reconnecting Digital Publications to the Web Using their Spatial...
LocWeb2015 - Reconnecting Digital Publications to the Web Using their Spatial...LocWeb2015 - Reconnecting Digital Publications to the Web Using their Spatial...
LocWeb2015 - Reconnecting Digital Publications to the Web Using their Spatial...
Ben De Meester
 

More from Ben De Meester (12)

Public PhD Defense - Ben De Meester
Public PhD Defense - Ben De MeesterPublic PhD Defense - Ben De Meester
Public PhD Defense - Ben De Meester
 
EcoDaLo: closing event
EcoDaLo: closing eventEcoDaLo: closing event
EcoDaLo: closing event
 
ESWC2017 P&D: The Function Hub - An implementation-independent read/write fun...
ESWC2017 P&D: The Function Hub - An implementation-independent read/write fun...ESWC2017 P&D: The Function Hub - An implementation-independent read/write fun...
ESWC2017 P&D: The Function Hub - An implementation-independent read/write fun...
 
ESWC2019 KGB Workshop - Mapping language analysis of comparative characteristics
ESWC2019 KGB Workshop - Mapping language analysis of comparative characteristicsESWC2019 KGB Workshop - Mapping language analysis of comparative characteristics
ESWC2019 KGB Workshop - Mapping language analysis of comparative characteristics
 
SemSci2017 - Detailed Provenance Capture of Data Processing
SemSci2017 - Detailed Provenance Capture of Data ProcessingSemSci2017 - Detailed Provenance Capture of Data Processing
SemSci2017 - Detailed Provenance Capture of Data Processing
 
ESWC2017 In-Use - Declarative Data Transformations for Linked Data Generation...
ESWC2017 In-Use - Declarative Data Transformations for Linked Data Generation...ESWC2017 In-Use - Declarative Data Transformations for Linked Data Generation...
ESWC2017 In-Use - Declarative Data Transformations for Linked Data Generation...
 
OrdRing2015 - Event-Driven Rule-based Reasoning using EYE
OrdRing2015 - Event-Driven Rule-based Reasoning using EYEOrdRing2015 - Event-Driven Rule-based Reasoning using EYE
OrdRing2015 - Event-Driven Rule-based Reasoning using EYE
 
LINKed2015 - SERIF - A Semantic ExeRcise Interchange Format
LINKed2015 - SERIF - A Semantic ExeRcise Interchange FormatLINKed2015 - SERIF - A Semantic ExeRcise Interchange Format
LINKed2015 - SERIF - A Semantic ExeRcise Interchange Format
 
NLP&DBpedia2015 - Exposing Digital Content as Linked Data, and Linking them ...
NLP&DBpedia2015 - Exposing Digital Content as Linked Data, and Linking them ...NLP&DBpedia2015 - Exposing Digital Content as Linked Data, and Linking them ...
NLP&DBpedia2015 - Exposing Digital Content as Linked Data, and Linking them ...
 
ISWC2015 P&D - StoryBlink
ISWC2015 P&D - StoryBlinkISWC2015 P&D - StoryBlink
ISWC2015 P&D - StoryBlink
 
LocWeb2015 - Reconnecting Digital Publications to the Web Using their Spatial...
LocWeb2015 - Reconnecting Digital Publications to the Web Using their Spatial...LocWeb2015 - Reconnecting Digital Publications to the Web Using their Spatial...
LocWeb2015 - Reconnecting Digital Publications to the Web Using their Spatial...
 
Creating discoverable learning content using a user-friendly authoring enviro...
Creating discoverable learning content using a user-friendly authoring enviro...Creating discoverable learning content using a user-friendly authoring enviro...
Creating discoverable learning content using a user-friendly authoring enviro...
 

Recently uploaded

Exploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptxExploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptx
DilipVasan
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
StarCompliance.io
 

Recently uploaded (20)

basics of data science with application areas.pdf
basics of data science with application areas.pdfbasics of data science with application areas.pdf
basics of data science with application areas.pdf
 
Exploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptxExploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptx
 
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
 
Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMI
 
how can i exchange pi coins for others currency like Bitcoin
how can i exchange pi coins for others currency like Bitcoinhow can i exchange pi coins for others currency like Bitcoin
how can i exchange pi coins for others currency like Bitcoin
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
 
Slip-and-fall Injuries: Top Workers' Comp Claims
Slip-and-fall Injuries: Top Workers' Comp ClaimsSlip-and-fall Injuries: Top Workers' Comp Claims
Slip-and-fall Injuries: Top Workers' Comp Claims
 
Artificial_General_Intelligence__storm_gen_article.pdf
Artificial_General_Intelligence__storm_gen_article.pdfArtificial_General_Intelligence__storm_gen_article.pdf
Artificial_General_Intelligence__storm_gen_article.pdf
 
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPsWebinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
 
Using PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDBUsing PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDB
 
2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group Meeting2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group Meeting
 
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflictSupply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization Sample
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
 
AI Imagen for data-storytelling Infographics.pdf
AI Imagen for data-storytelling Infographics.pdfAI Imagen for data-storytelling Infographics.pdf
AI Imagen for data-storytelling Infographics.pdf
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
 
Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflows
 
2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Call2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Call
 
Pre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptxPre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptx
 
How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?
 

ISWC2018 PhD Consortium: High Quality Schema and Data Transformations for Linked Data Generation

  • 1. id date … 1 April 1st ‘19 … … … … ex:{id} dcterms:date {*date*}^^xsd:date Data transformation Validation + entailment Schema transformation ?s dcterms:date ?date ?date :type xsd:date 2 1 3 4 High Quality Schema and Data Transformations for Linked Data Generation Ben De Meester ben.de-meester.org/#me ben.demeester@ugent.be @Ben__DM Validating generated Linked Data is computationally intensive and makes it hard to determine root error causes Reusable data transformation descriptions fno.io for 90% of common data transformations Complete and reusable generation processes rml.io Can generate the same DBpedia dataset as original process Validation approach handling custom entailment regimes git.io/fxJ2j Faster for same constraint types than RDFUnit and SHACL Automatic validation of the declarative generation process More efficient, and finds root error causes up front for 90% of SHACL unit tests 2 1 3 4 https://ben.de-meester.org/presentations/2018/consortium/#