SlideShare a Scribd company logo
Ilaria Tiddi, Mathieu d’Aquin, Enrico Motta
Learning to Assess
Linked Data Relationships
Using Genetic Programming
@IlaTiddi
20.10.2016
15th International Semantic Web Conference (ISWC 2016)
Research Problem
Automatically discover what makes a strong relationship
between two entities in (the Web of) Linked Data.
• relationship : a semantic path between two entities
ASongOfIceAnd
Fire(novel)
GoTASongOfIce
AndFire(topic)
dc:subject dc:subject
Research Problem
Automatically discover what makes a strong relationship
between two entities in (the Web of) Linked Data.
• relationship : a semantic path between two entities
• automatically : through graph search techniques
ASongOfIceAnd
Fire(novel)
UnitedStates
GoT
GeorgeRRMartin
ASongOfIce
AndFire(topic)
:author
:born
:airedIn
dc:subjectdc:subject
Fantasy
dc:subject dc:subject
Research Problem
Problem
• Entities/properties in a path might come from a number
of different, unknown data sources
Solution (the easy one)
• indexing & preprocessing of a portion of Linked Data
• a priori knowledge, computational resources
ASongOfIceAnd
Fire(novel)
UnitedStates
GoT
GeorgeRRMartin
ASongOfIce
AndFire(topic)
:author
:born
:airedIn
dc:subjectdc:subject
Fantasy
dc:subject dc:subject
Research Problem
Solution
• Find paths between entities through Link Traversal
• Incremental and agnostic graph exploration
• Perform uninformed (or blind) search over Linked Data
ASongOfIceAnd
Fire(novel)
GoT
Research Problem
Solution
• Find paths between entities through Link Traversal
• Incremental and agnostic graph exploration
• Perform uninformed (or blind) search over Linked Data
ASongOfIceAnd
Fire(novel)
GoT
GeorgeRRMartin
ASongOfIce
AndFire(topic)
:author
dc:subject
Fantasy
dc:subject
Research Problem
Solution
• Find paths between entities through Link Traversal
• Incremental and agnostic graph exploration
• Perform uninformed (or blind) search over Linked Data
ASongOfIceAnd
Fire(novel)
GoT
GeorgeRRMartin
ASongOfIce
AndFire(topic)
:author
dc:subject
Fantasy
dc:subject
Research Problem
Solution
• Find paths between entities through Link Traversal
• Incremental and agnostic graph exploration
• Perform uninformed (or blind) search over Linked Data
ASongOfIceAnd
Fire(novel)
GoTASongOfIce
AndFire(topic)
dc:subject
Fantasy
dc:subject
UnitedStates:bornGeorgeRRMartin
:author
Research Problem
Solution
• Find paths between entities through Link Traversal
• Incremental and agnostic graph exploration
• Perform uninformed (or blind) search over Linked Data
ASongOfIceAnd
Fire(novel)
GoT
GeorgeRRMartin
ASongOfIce
AndFire(topic)
:author
dc:subject
Fantasy
dc:subject
UnitedStates:born
Research Problem
Solution
• Find paths between entities through Link Traversal
• Incremental and agnostic graph exploration
• Perform uninformed (or blind) search over Linked Data
ASongOfIceAnd
Fire(novel)
UnitedStates
GoT
GeorgeRRMartin
ASongOfIce
AndFire(topic)
:author
dc:subjectdc:subject
Fantasy
dc:subject
:born
Research Problem
Solution
• Find paths between entities through Link Traversal
• Incremental and agnostic graph exploration
• Perform uninformed (or blind) search over Linked Data
ASongOfIceAnd
Fire(novel)
UnitedStates
GoT
GeorgeRRMartin
ASongOfIce
AndFire(topic)
:author :airedIn
dc:subjectdc:subject
Fantasy
dc:subject dc:subject
:born
Research Problem
ASongOfIceAnd
Fire(novel)
UnitedStates
GoT
GeorgeRRMartin
ASongOfIce
AndFire(topic)
:author :airedIn
dc:subjectdc:subject
Fantasy
dc:subject dc:subject
Solution
• Find paths between entities through Link Traversal
• Incremental and agnostic graph exploration
• Perform uninformed (or blind) search over Linked Data
:born
Research Hypothesis
Problem
Uninformed searches require a cost-function to explore the
graph following the most promising paths
Hypo
Linked Data information can drive a cost-function that
detects strong relationships between entities
ASongOfIceAnd
Fire(novel)
UnitedStates
GoT
GeorgeRRMartin
ASongOfIce
AndFire(topic)
:author :airedIn
dc:subjectdc:subject
Fantasy
dc:subject dc:subject
:born
Research Questions
What makes a path strong?
• Which topological or semantic features of nodes/edges?
✗ e.g. length of a path?
 entities of different datasets are connected by many paths
of similar length
How can we use Linked Data to assess strong relationships?
• Which information do we need?
• Can we use structural features of the graph?
Challenges
• find topological/semantic features to detect strong relationships
• combine these features in a cost-function
• perform an effective blind search
Proposed Approach
• A set of topological/semantic characteristics of
the Linked Data graph
• a benchmark of human-evaluated relationship
paths
Identify the cost-function for a blind search that
best performs in ranking sets of alternative
relationship paths
Automatically learn a cost-function to detect strong
relationships between Linked Data entities using a
supervised method (Genetic Programming)
Proposed Approach
Genetic Programming: why?
• Flexible learning process
• Suitable for wide search spaces (such as Linked Data)
• Results assessed with a fitness (scores vs. functions)
• Human-understandable results
• Easy to integrate in a graph search
Automatically learn a cost-function to detect strong
relationships between Linked Data entities using a
supervised method (Genetic Programming)
VS
Genetic Programming
Programs (solutions for a problem)
• trees of primitives
• functions : internal nodes (mathematical or logical
operations)
• terminals : leaf nodes (constants or variables)
Fitness function (evaluation)
• how well the program solves the problem
Genetic operations (evolution)
• reproduction
• crossover from two parents
• mutation from one parent
Termination condition
• maximum number of evolutions
• a desired fitness
Genetic Programming
Procedure
• Create random population of programs based on the primitives
• Evolve population until an ideal situation is met
✗✗
✗
✔✔✗✗ ✔
canned spaghetti meatballs spaghetti tomato sauced penne tomato sauced spaghetti
Genetic Programming
Given
• a starting population of randomly generated cost-functions
• sets of alternative paths between two Linked Data entities,
ranked by humans
Determine how good each cost-function is in ranking paths
compared to the human evaluators
✗✗
✗
✔✔✗✗ ✔
canned spaghetti meatballs spaghetti tomato sauced penne tomato sauced spaghetti
Genetic Programming
Primitives
Constant terminals
• Z= {0, 1000}
Aggregated terminals
• Topological edge weighs
indegree, outdegree, constant weight
• Semantic edge weighs
usage of namespaces, taxonomies, vocabularies
• Aggregators along the path
sum, avg, min, max
Functions (combining different information)
• Math operations
addition, multiplication, division, log
Genetic Programming
Fitness
Normalised Discounted Cumulative Gain (nDCG)
• (IR) quality of rankings provided by search engines based on
the graded relevance of the returned documents
• how good is a program in ranking paths based on human ranks
• avg(nDCG) across the dataset
• length penalty
Genetic operations
• Reproduction
• Crossover
• Mutation
Learning
• Training set + test set
• Keep fittest program for each runs on training set
• Test them (discard inconsistent)
Experiments
Dataset
Entities (random types from different sources)
• 12,630 events from Yago
• 8,185 people from the VIAF dataset
• 999 movies from the LMDB
• 1,174 countries/capitals from Geonames/ the UNESCO dataset
Paths (a set of possible paths between them)
• select a random pair
• bidirectional breadth-first search
Assessment
• 100 pairs (~10 possible paths per pair)
• 8 judges
• from (2) highly relevant to (0) not relevant
db:Dina-
Korzun
viaf:Dina-
Korzun
gn:Europe
gn:United-
Kingdom
lmdb:The
SkinGame
owl:sameAsdbo:citizenship
gno:parent
Feature
foaf:based
_near
Experiments
Results
Different runs (fitness on training set/test set)
(T) Topological primitives only
(S) Topological + semantic primitives
(N) Topological + namespaces primitives
Runs Best program Fitness TR Fitness TS
T1 log(log(min.cd × min.cd))/max.cd 0.79 0.79
T2 log(min.cd)/(avg.cd + 87) 0.77 0.78
T3 min.cd × (min.cd/max.cd) 0.78 0.72
N1 (log((max.ns/max.cd))/avg.ns) + min.ns 0.82 0.81
N2 (min.dg/sum.cd)/sum.ou) + min.ns 0.79 0.77
N3 min.ns/(log(max.cd)/avg.ns) 0.83 0.75
S1 min.ns + (sum.ns/log(log(sum.si))) 0.88 0.83
S2 min.ns + (min.cd/log(log(sum.si))) 0.88 0.86
S3 min.ns + (log(max.in)/log(log(sum.si))) 0.87 0.86
Experiments
Results
Lower performance for T-runs and N-runs
Recurrent terminals
• conditional degree (node degree depending on the RDF triple)
• namespace variety
• number of topic properties (dc:subject/skos:broader/foaf:primaryTopic)
Runs Best program Fitness TR Fitness TS
T1 log(log(min.cd × min.cd))/max.cd 0.79 0.79
T2 log(min.cd)/(avg.cd + 87) 0.77 0.78
T3 min.cd × (min.cd/max.cd) 0.78 0.72
N1 (log((max.ns/max.cd))/avg.ns) + min.ns 0.82 0.81
N2 (min.dg/sum.cd)/sum.ou) + min.ns 0.79 0.77
N3 min.ns/(log(max.cd)/avg.ns) 0.83 0.75
S1 min.ns + (sum.ns/log(log(sum.si))) 0.88 0.83
S2 min.ns + (min.cd/log(log(sum.si))) 0.88 0.86
S3 min.ns + (log(max.in)/log(log(sum.si))) 0.87 0.86
Experiments
Comparative evaluation
Best programs
• automatically learnt
vs. literature functions
• RECAP,RelFinder,Everything Is Connected Engine, Moore et al.
• ad-hoc / handcrafted information theoretical measures
Experiments
Which cost-function?
Interpretation
• pass through nodes with rich node descriptions
higher min_namespaces = higher path score
• not high level entities / few topic categories
few incoming topic categories = higher path score
• more specific entities (not hubs) for path with few topic categories
ratio conditional_degree / inTopicCategories
 specific paths are privileged over general paths
min_namespaces+
min_conditionalDegree
log(log(sum_inTopicCategories))
Conclusions
Contributions
A measure to detect strong relationships in Linked Data
 can be integrated in uninformed searches over Linked Data
vs. indexing/pre-processing techniques
 derived empirically through Genetic Programming
vs. domain-specific / handcrafted measures
 what is important in Linked Data
topological features + little knowledge about the edge vocabulary
Future work
• Integrate the measure in the blind-search process
• Explore more characteristics
• Improve the measure
THANK YOU VERY MUCH
(AND DO NOT MESS UP WITH ITALIAN FOOD)
Questions?
IlaTiddi ilaria.tiddi@open.ac.uk

More Related Content

What's hot

The Network Data Structure in Computing
The Network Data Structure in ComputingThe Network Data Structure in Computing
The Network Data Structure in Computing
Marko Rodriguez
 
Named Entity Recognition from Online News
Named Entity Recognition from Online NewsNamed Entity Recognition from Online News
Named Entity Recognition from Online News
Bernardo Najlis
 
Text Classification in Python – using Pandas, scikit-learn, IPython Notebook ...
Text Classification in Python – using Pandas, scikit-learn, IPython Notebook ...Text Classification in Python – using Pandas, scikit-learn, IPython Notebook ...
Text Classification in Python – using Pandas, scikit-learn, IPython Notebook ...
Jimmy Lai
 
DB-IR-ranking
DB-IR-rankingDB-IR-ranking
DB-IR-ranking
FELIX75
 
Knowledge Graph Embeddings for Recommender Systems
Knowledge Graph Embeddings for Recommender SystemsKnowledge Graph Embeddings for Recommender Systems
Knowledge Graph Embeddings for Recommender Systems
Enrico Palumbo
 
Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data...
Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data...Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data...
Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data...
andimou
 
Entity2rec recsys
Entity2rec recsysEntity2rec recsys
Entity2rec recsys
Enrico Palumbo
 
Assessing and Refining Mappings to RDF to Improve Dataset Quality
Assessing and Refining Mappings to RDF to Improve Dataset QualityAssessing and Refining Mappings to RDF to Improve Dataset Quality
Assessing and Refining Mappings to RDF to Improve Dataset Quality
andimou
 
Materials Informatics Overview
Materials Informatics OverviewMaterials Informatics Overview
Materials Informatics Overview
Tony Fast
 
Evaluating Machine Learning Algorithms for Materials Science using the Matben...
Evaluating Machine Learning Algorithms for Materials Science using the Matben...Evaluating Machine Learning Algorithms for Materials Science using the Matben...
Evaluating Machine Learning Algorithms for Materials Science using the Matben...
Anubhav Jain
 
DB and IR Integration
DB and IR IntegrationDB and IR Integration
DB and IR Integration
Marco A Torres
 
ML+Hadoop at NYC Predictive Analytics
ML+Hadoop at NYC Predictive AnalyticsML+Hadoop at NYC Predictive Analytics
ML+Hadoop at NYC Predictive Analytics
Erik Bernhardsson
 
Improving Top-K Retrieval Algorithms Using Dynamic Programming and Longer Ski...
Improving Top-K Retrieval Algorithms Using Dynamic Programming and Longer Ski...Improving Top-K Retrieval Algorithms Using Dynamic Programming and Longer Ski...
Improving Top-K Retrieval Algorithms Using Dynamic Programming and Longer Ski...
Sease
 
Using Substitutive Itemset Mining Framework for Finding Synonymous Properties...
Using Substitutive Itemset Mining Framework for Finding Synonymous Properties...Using Substitutive Itemset Mining Framework for Finding Synonymous Properties...
Using Substitutive Itemset Mining Framework for Finding Synonymous Properties...
Agnieszka Ławrynowicz
 
Modular design patterns for systems that learn and reason: a boxology
Modular design patterns for systems that learn and reason: a boxologyModular design patterns for systems that learn and reason: a boxology
Modular design patterns for systems that learn and reason: a boxology
Frank van Harmelen
 
Algorithmic Music Recommendations at Spotify
Algorithmic Music Recommendations at SpotifyAlgorithmic Music Recommendations at Spotify
Algorithmic Music Recommendations at Spotify
Chris Johnson
 
The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...
The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...
The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...
PyData
 
A Generic Language for Integrated RDF Mappings of Heterogeneous Data
A Generic Language for Integrated RDF Mappings of Heterogeneous DataA Generic Language for Integrated RDF Mappings of Heterogeneous Data
A Generic Language for Integrated RDF Mappings of Heterogeneous Data
andimou
 
Detection of Related Semantic Datasets Based on Frequent Subgraph Mining
Detection of Related Semantic Datasets Based on Frequent Subgraph MiningDetection of Related Semantic Datasets Based on Frequent Subgraph Mining
Detection of Related Semantic Datasets Based on Frequent Subgraph Mining
Mikel Emaldi Manrique
 

What's hot (19)

The Network Data Structure in Computing
The Network Data Structure in ComputingThe Network Data Structure in Computing
The Network Data Structure in Computing
 
Named Entity Recognition from Online News
Named Entity Recognition from Online NewsNamed Entity Recognition from Online News
Named Entity Recognition from Online News
 
Text Classification in Python – using Pandas, scikit-learn, IPython Notebook ...
Text Classification in Python – using Pandas, scikit-learn, IPython Notebook ...Text Classification in Python – using Pandas, scikit-learn, IPython Notebook ...
Text Classification in Python – using Pandas, scikit-learn, IPython Notebook ...
 
DB-IR-ranking
DB-IR-rankingDB-IR-ranking
DB-IR-ranking
 
Knowledge Graph Embeddings for Recommender Systems
Knowledge Graph Embeddings for Recommender SystemsKnowledge Graph Embeddings for Recommender Systems
Knowledge Graph Embeddings for Recommender Systems
 
Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data...
Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data...Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data...
Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data...
 
Entity2rec recsys
Entity2rec recsysEntity2rec recsys
Entity2rec recsys
 
Assessing and Refining Mappings to RDF to Improve Dataset Quality
Assessing and Refining Mappings to RDF to Improve Dataset QualityAssessing and Refining Mappings to RDF to Improve Dataset Quality
Assessing and Refining Mappings to RDF to Improve Dataset Quality
 
Materials Informatics Overview
Materials Informatics OverviewMaterials Informatics Overview
Materials Informatics Overview
 
Evaluating Machine Learning Algorithms for Materials Science using the Matben...
Evaluating Machine Learning Algorithms for Materials Science using the Matben...Evaluating Machine Learning Algorithms for Materials Science using the Matben...
Evaluating Machine Learning Algorithms for Materials Science using the Matben...
 
DB and IR Integration
DB and IR IntegrationDB and IR Integration
DB and IR Integration
 
ML+Hadoop at NYC Predictive Analytics
ML+Hadoop at NYC Predictive AnalyticsML+Hadoop at NYC Predictive Analytics
ML+Hadoop at NYC Predictive Analytics
 
Improving Top-K Retrieval Algorithms Using Dynamic Programming and Longer Ski...
Improving Top-K Retrieval Algorithms Using Dynamic Programming and Longer Ski...Improving Top-K Retrieval Algorithms Using Dynamic Programming and Longer Ski...
Improving Top-K Retrieval Algorithms Using Dynamic Programming and Longer Ski...
 
Using Substitutive Itemset Mining Framework for Finding Synonymous Properties...
Using Substitutive Itemset Mining Framework for Finding Synonymous Properties...Using Substitutive Itemset Mining Framework for Finding Synonymous Properties...
Using Substitutive Itemset Mining Framework for Finding Synonymous Properties...
 
Modular design patterns for systems that learn and reason: a boxology
Modular design patterns for systems that learn and reason: a boxologyModular design patterns for systems that learn and reason: a boxology
Modular design patterns for systems that learn and reason: a boxology
 
Algorithmic Music Recommendations at Spotify
Algorithmic Music Recommendations at SpotifyAlgorithmic Music Recommendations at Spotify
Algorithmic Music Recommendations at Spotify
 
The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...
The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...
The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...
 
A Generic Language for Integrated RDF Mappings of Heterogeneous Data
A Generic Language for Integrated RDF Mappings of Heterogeneous DataA Generic Language for Integrated RDF Mappings of Heterogeneous Data
A Generic Language for Integrated RDF Mappings of Heterogeneous Data
 
Detection of Related Semantic Datasets Based on Frequent Subgraph Mining
Detection of Related Semantic Datasets Based on Frequent Subgraph MiningDetection of Related Semantic Datasets Based on Frequent Subgraph Mining
Detection of Related Semantic Datasets Based on Frequent Subgraph Mining
 

Viewers also liked

Publishing and Using Linked Data
Publishing and Using Linked DataPublishing and Using Linked Data
Publishing and Using Linked Data
ostephens
 
Linked Open Data Principles, benefits of LOD for sustainable development
Linked Open Data Principles, benefits of LOD for sustainable developmentLinked Open Data Principles, benefits of LOD for sustainable development
Linked Open Data Principles, benefits of LOD for sustainable development
Martin Kaltenböck
 
An Approach for the Incremental Export of Relational Databases into RDF Graphs
An Approach for the Incremental Export of Relational Databases into RDF GraphsAn Approach for the Incremental Export of Relational Databases into RDF Graphs
An Approach for the Incremental Export of Relational Databases into RDF Graphs
Nikolaos Konstantinou
 
Technical Background
Technical BackgroundTechnical Background
Technical Background
Nikolaos Konstantinou
 
Conclusions: Summary and Outlook
Conclusions: Summary and OutlookConclusions: Summary and Outlook
Conclusions: Summary and Outlook
Nikolaos Konstantinou
 
Materializing the Web of Linked Data
Materializing the Web of Linked DataMaterializing the Web of Linked Data
Materializing the Web of Linked Data
Nikolaos Konstantinou
 
Incremental Export of Relational Database Contents into RDF Graphs
Incremental Export of Relational Database Contents into RDF GraphsIncremental Export of Relational Database Contents into RDF Graphs
Incremental Export of Relational Database Contents into RDF Graphs
Nikolaos Konstantinou
 
Transient and persistent RDF views over relational databases in the context o...
Transient and persistent RDF views over relational databases in the context o...Transient and persistent RDF views over relational databases in the context o...
Transient and persistent RDF views over relational databases in the context o...
Nikolaos Konstantinou
 
Deploying Linked Open Data: Methodologies and Software Tools
Deploying Linked Open Data: Methodologies and Software ToolsDeploying Linked Open Data: Methodologies and Software Tools
Deploying Linked Open Data: Methodologies and Software Tools
Nikolaos Konstantinou
 
Introduction: Linked Data and the Semantic Web
Introduction: Linked Data and the Semantic WebIntroduction: Linked Data and the Semantic Web
Introduction: Linked Data and the Semantic Web
Nikolaos Konstantinou
 
Linking KOS Data [using SKOS and OWL2]
Linking KOS Data [using SKOS and OWL2]Linking KOS Data [using SKOS and OWL2]
Linking KOS Data [using SKOS and OWL2]
Marcia Zeng
 
Publishing Linked Data from RDB
Publishing Linked Data from RDBPublishing Linked Data from RDB
Publishing Linked Data from RDB
Boris Villazón-Terrazas
 
Entity Linking in Queries: Tasks and Evaluation
Entity Linking in Queries: Tasks and EvaluationEntity Linking in Queries: Tasks and Evaluation
Entity Linking in Queries: Tasks and Evaluation
Faegheh Hasibi
 
From Research to Innovation: Linked Open Data and Gamification to Design Inte...
From Research to Innovation: Linked Open Data and Gamification to Design Inte...From Research to Innovation: Linked Open Data and Gamification to Design Inte...
From Research to Innovation: Linked Open Data and Gamification to Design Inte...
Ig Bittencourt
 
Linked Data tutorial at Semtech 2012
Linked Data tutorial at Semtech 2012Linked Data tutorial at Semtech 2012
Linked Data tutorial at Semtech 2012
Juan Sequeda
 
Introduction to Linked Data 1/5
Introduction to Linked Data 1/5Introduction to Linked Data 1/5
Introduction to Linked Data 1/5
Juan Sequeda
 
Consuming Linked Data 4/5 Semtech2011
Consuming Linked Data 4/5 Semtech2011Consuming Linked Data 4/5 Semtech2011
Consuming Linked Data 4/5 Semtech2011
Juan Sequeda
 
Méthodes et outils pour interrelier le web des données
Méthodes et outils pour interrelier le web des donnéesMéthodes et outils pour interrelier le web des données
Méthodes et outils pour interrelier le web des données
François Scharffe
 
RDF Tutorial - SPARQL 20091031
RDF Tutorial - SPARQL 20091031RDF Tutorial - SPARQL 20091031
RDF Tutorial - SPARQL 20091031
kwangsub kim
 
Consuming Linked Data SemTech2010
Consuming Linked Data SemTech2010Consuming Linked Data SemTech2010
Consuming Linked Data SemTech2010
Juan Sequeda
 

Viewers also liked (20)

Publishing and Using Linked Data
Publishing and Using Linked DataPublishing and Using Linked Data
Publishing and Using Linked Data
 
Linked Open Data Principles, benefits of LOD for sustainable development
Linked Open Data Principles, benefits of LOD for sustainable developmentLinked Open Data Principles, benefits of LOD for sustainable development
Linked Open Data Principles, benefits of LOD for sustainable development
 
An Approach for the Incremental Export of Relational Databases into RDF Graphs
An Approach for the Incremental Export of Relational Databases into RDF GraphsAn Approach for the Incremental Export of Relational Databases into RDF Graphs
An Approach for the Incremental Export of Relational Databases into RDF Graphs
 
Technical Background
Technical BackgroundTechnical Background
Technical Background
 
Conclusions: Summary and Outlook
Conclusions: Summary and OutlookConclusions: Summary and Outlook
Conclusions: Summary and Outlook
 
Materializing the Web of Linked Data
Materializing the Web of Linked DataMaterializing the Web of Linked Data
Materializing the Web of Linked Data
 
Incremental Export of Relational Database Contents into RDF Graphs
Incremental Export of Relational Database Contents into RDF GraphsIncremental Export of Relational Database Contents into RDF Graphs
Incremental Export of Relational Database Contents into RDF Graphs
 
Transient and persistent RDF views over relational databases in the context o...
Transient and persistent RDF views over relational databases in the context o...Transient and persistent RDF views over relational databases in the context o...
Transient and persistent RDF views over relational databases in the context o...
 
Deploying Linked Open Data: Methodologies and Software Tools
Deploying Linked Open Data: Methodologies and Software ToolsDeploying Linked Open Data: Methodologies and Software Tools
Deploying Linked Open Data: Methodologies and Software Tools
 
Introduction: Linked Data and the Semantic Web
Introduction: Linked Data and the Semantic WebIntroduction: Linked Data and the Semantic Web
Introduction: Linked Data and the Semantic Web
 
Linking KOS Data [using SKOS and OWL2]
Linking KOS Data [using SKOS and OWL2]Linking KOS Data [using SKOS and OWL2]
Linking KOS Data [using SKOS and OWL2]
 
Publishing Linked Data from RDB
Publishing Linked Data from RDBPublishing Linked Data from RDB
Publishing Linked Data from RDB
 
Entity Linking in Queries: Tasks and Evaluation
Entity Linking in Queries: Tasks and EvaluationEntity Linking in Queries: Tasks and Evaluation
Entity Linking in Queries: Tasks and Evaluation
 
From Research to Innovation: Linked Open Data and Gamification to Design Inte...
From Research to Innovation: Linked Open Data and Gamification to Design Inte...From Research to Innovation: Linked Open Data and Gamification to Design Inte...
From Research to Innovation: Linked Open Data and Gamification to Design Inte...
 
Linked Data tutorial at Semtech 2012
Linked Data tutorial at Semtech 2012Linked Data tutorial at Semtech 2012
Linked Data tutorial at Semtech 2012
 
Introduction to Linked Data 1/5
Introduction to Linked Data 1/5Introduction to Linked Data 1/5
Introduction to Linked Data 1/5
 
Consuming Linked Data 4/5 Semtech2011
Consuming Linked Data 4/5 Semtech2011Consuming Linked Data 4/5 Semtech2011
Consuming Linked Data 4/5 Semtech2011
 
Méthodes et outils pour interrelier le web des données
Méthodes et outils pour interrelier le web des donnéesMéthodes et outils pour interrelier le web des données
Méthodes et outils pour interrelier le web des données
 
RDF Tutorial - SPARQL 20091031
RDF Tutorial - SPARQL 20091031RDF Tutorial - SPARQL 20091031
RDF Tutorial - SPARQL 20091031
 
Consuming Linked Data SemTech2010
Consuming Linked Data SemTech2010Consuming Linked Data SemTech2010
Consuming Linked Data SemTech2010
 

Similar to Learning to assess Linked Data relationships using Genetic Programming

The Web of Data: do we actually understand what we built?
The Web of Data: do we actually understand what we built?The Web of Data: do we actually understand what we built?
The Web of Data: do we actually understand what we built?
Frank van Harmelen
 
Neo4j MeetUp - Graph Exploration with MetaExp
Neo4j MeetUp - Graph Exploration with MetaExpNeo4j MeetUp - Graph Exploration with MetaExp
Neo4j MeetUp - Graph Exploration with MetaExp
Adrian Ziegler
 
Machine Learning ICS 273A
Machine Learning ICS 273AMachine Learning ICS 273A
Machine Learning ICS 273A
butest
 
Machine Learning ICS 273A
Machine Learning ICS 273AMachine Learning ICS 273A
Machine Learning ICS 273A
butest
 
Synthetic Data Generation using exponential random Graph modeling
Synthetic Data Generation using exponential random Graph modelingSynthetic Data Generation using exponential random Graph modeling
Synthetic Data Generation using exponential random Graph modeling
Graph-TA
 
Genetic Algorithms
Genetic AlgorithmsGenetic Algorithms
Genetic Algorithms
Karthik Sankar
 
Machine Learning ebook.pdf
Machine Learning ebook.pdfMachine Learning ebook.pdf
Machine Learning ebook.pdf
HODIT12
 
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 11_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1
MostafaHazemMostafaa
 
DeepLearning
DeepLearningDeepLearning
DeepLearning
ShahzadAsgharArain
 
Machine Learning ICS 273A
Machine Learning ICS 273AMachine Learning ICS 273A
Machine Learning ICS 273A
butest
 
R, Data Wrangling & Kaggle Data Science Competitions
R, Data Wrangling & Kaggle Data Science CompetitionsR, Data Wrangling & Kaggle Data Science Competitions
R, Data Wrangling & Kaggle Data Science Competitions
Krishna Sankar
 
Follow the money with graphs
Follow the money with graphsFollow the money with graphs
Follow the money with graphs
Stanka Dalekova
 
04-Data-Analysis-Overview.pptx
04-Data-Analysis-Overview.pptx04-Data-Analysis-Overview.pptx
04-Data-Analysis-Overview.pptx
Shree Shree
 
Open Analytics Environment
Open Analytics EnvironmentOpen Analytics Environment
Open Analytics Environment
Ian Foster
 
Explainable AI
Explainable AIExplainable AI
Explainable AI
Arithmer Inc.
 
Multiscale Modeling
Multiscale ModelingMultiscale Modeling
Multiscale Modeling
Daniel S. Katz
 
know Machine Learning Basic Concepts.pdf
know Machine Learning Basic Concepts.pdfknow Machine Learning Basic Concepts.pdf
know Machine Learning Basic Concepts.pdf
hemangppatel
 
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
Jonathan Stray
 
Spark Summit Europe: Share and analyse genomic data at scale
Spark Summit Europe: Share and analyse genomic data at scaleSpark Summit Europe: Share and analyse genomic data at scale
Spark Summit Europe: Share and analyse genomic data at scale
Andy Petrella
 
Geneticalgorithms 100403002207-phpapp02
Geneticalgorithms 100403002207-phpapp02Geneticalgorithms 100403002207-phpapp02
Geneticalgorithms 100403002207-phpapp02
Amna Saeed
 

Similar to Learning to assess Linked Data relationships using Genetic Programming (20)

The Web of Data: do we actually understand what we built?
The Web of Data: do we actually understand what we built?The Web of Data: do we actually understand what we built?
The Web of Data: do we actually understand what we built?
 
Neo4j MeetUp - Graph Exploration with MetaExp
Neo4j MeetUp - Graph Exploration with MetaExpNeo4j MeetUp - Graph Exploration with MetaExp
Neo4j MeetUp - Graph Exploration with MetaExp
 
Machine Learning ICS 273A
Machine Learning ICS 273AMachine Learning ICS 273A
Machine Learning ICS 273A
 
Machine Learning ICS 273A
Machine Learning ICS 273AMachine Learning ICS 273A
Machine Learning ICS 273A
 
Synthetic Data Generation using exponential random Graph modeling
Synthetic Data Generation using exponential random Graph modelingSynthetic Data Generation using exponential random Graph modeling
Synthetic Data Generation using exponential random Graph modeling
 
Genetic Algorithms
Genetic AlgorithmsGenetic Algorithms
Genetic Algorithms
 
Machine Learning ebook.pdf
Machine Learning ebook.pdfMachine Learning ebook.pdf
Machine Learning ebook.pdf
 
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 11_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1
 
DeepLearning
DeepLearningDeepLearning
DeepLearning
 
Machine Learning ICS 273A
Machine Learning ICS 273AMachine Learning ICS 273A
Machine Learning ICS 273A
 
R, Data Wrangling & Kaggle Data Science Competitions
R, Data Wrangling & Kaggle Data Science CompetitionsR, Data Wrangling & Kaggle Data Science Competitions
R, Data Wrangling & Kaggle Data Science Competitions
 
Follow the money with graphs
Follow the money with graphsFollow the money with graphs
Follow the money with graphs
 
04-Data-Analysis-Overview.pptx
04-Data-Analysis-Overview.pptx04-Data-Analysis-Overview.pptx
04-Data-Analysis-Overview.pptx
 
Open Analytics Environment
Open Analytics EnvironmentOpen Analytics Environment
Open Analytics Environment
 
Explainable AI
Explainable AIExplainable AI
Explainable AI
 
Multiscale Modeling
Multiscale ModelingMultiscale Modeling
Multiscale Modeling
 
know Machine Learning Basic Concepts.pdf
know Machine Learning Basic Concepts.pdfknow Machine Learning Basic Concepts.pdf
know Machine Learning Basic Concepts.pdf
 
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
 
Spark Summit Europe: Share and analyse genomic data at scale
Spark Summit Europe: Share and analyse genomic data at scaleSpark Summit Europe: Share and analyse genomic data at scale
Spark Summit Europe: Share and analyse genomic data at scale
 
Geneticalgorithms 100403002207-phpapp02
Geneticalgorithms 100403002207-phpapp02Geneticalgorithms 100403002207-phpapp02
Geneticalgorithms 100403002207-phpapp02
 

More from Vrije Universiteit Amsterdam

Building intelligent systems (that can explain)
Building intelligent systems (that can explain)Building intelligent systems (that can explain)
Building intelligent systems (that can explain)
Vrije Universiteit Amsterdam
 
Building intelligent systems (that can explain)
Building intelligent systems (that can explain)Building intelligent systems (that can explain)
Building intelligent systems (that can explain)
Vrije Universiteit Amsterdam
 
Building intelligent systems with FAIR data
Building intelligent systems with FAIR dataBuilding intelligent systems with FAIR data
Building intelligent systems with FAIR data
Vrije Universiteit Amsterdam
 
Building intelligent systems (that can explain)
Building intelligent systems (that can explain)Building intelligent systems (that can explain)
Building intelligent systems (that can explain)
Vrije Universiteit Amsterdam
 
An ontology-based approach to improve the accessibility of ROS-based robotic ...
An ontology-based approach to improve the accessibility of ROS-based robotic ...An ontology-based approach to improve the accessibility of ROS-based robotic ...
An ontology-based approach to improve the accessibility of ROS-based robotic ...
Vrije Universiteit Amsterdam
 
Answer Worskshop @ESWC2017 - Introduction
Answer Worskshop @ESWC2017 - IntroductionAnswer Worskshop @ESWC2017 - Introduction
Answer Worskshop @ESWC2017 - Introduction
Vrije Universiteit Amsterdam
 
Update of time-invalid information in knowledge bases through mobile agents
Update of time-invalid information in knowledge bases through mobile agentsUpdate of time-invalid information in knowledge bases through mobile agents
Update of time-invalid information in knowledge bases through mobile agents
Vrije Universiteit Amsterdam
 
An Ontology Design Pattern to Define Explanations
An Ontology Design Pattern to Define ExplanationsAn Ontology Design Pattern to Define Explanations
An Ontology Design Pattern to Define Explanations
Vrije Universiteit Amsterdam
 
LD4KD 2015 - Demos and tools
LD4KD 2015 - Demos and toolsLD4KD 2015 - Demos and tools
LD4KD 2015 - Demos and tools
Vrije Universiteit Amsterdam
 
Using Linked Data Traversal to Label Academic Communities - SAVE-SD2015
Using Linked Data Traversal to Label Academic Communities - SAVE-SD2015Using Linked Data Traversal to Label Academic Communities - SAVE-SD2015
Using Linked Data Traversal to Label Academic Communities - SAVE-SD2015
Vrije Universiteit Amsterdam
 
Quantifying the bias in data links
Quantifying the bias in data linksQuantifying the bias in data links
Quantifying the bias in data links
Vrije Universiteit Amsterdam
 
Using Neural Networks to aggregate Linked Data rules
Using Neural Networks to aggregate Linked Data rulesUsing Neural Networks to aggregate Linked Data rules
Using Neural Networks to aggregate Linked Data rules
Vrije Universiteit Amsterdam
 
Walking Linked Data: a graph traversal approach to explain clusters
Walking Linked Data: a graph traversal approach to explain clustersWalking Linked Data: a graph traversal approach to explain clusters
Walking Linked Data: a graph traversal approach to explain clusters
Vrije Universiteit Amsterdam
 
Dedalo, looking for Cluster Explanations in a labyrinth of Linked Data
Dedalo, looking for Cluster Explanations in a labyrinth of Linked DataDedalo, looking for Cluster Explanations in a labyrinth of Linked Data
Dedalo, looking for Cluster Explanations in a labyrinth of Linked Data
Vrije Universiteit Amsterdam
 

More from Vrije Universiteit Amsterdam (14)

Building intelligent systems (that can explain)
Building intelligent systems (that can explain)Building intelligent systems (that can explain)
Building intelligent systems (that can explain)
 
Building intelligent systems (that can explain)
Building intelligent systems (that can explain)Building intelligent systems (that can explain)
Building intelligent systems (that can explain)
 
Building intelligent systems with FAIR data
Building intelligent systems with FAIR dataBuilding intelligent systems with FAIR data
Building intelligent systems with FAIR data
 
Building intelligent systems (that can explain)
Building intelligent systems (that can explain)Building intelligent systems (that can explain)
Building intelligent systems (that can explain)
 
An ontology-based approach to improve the accessibility of ROS-based robotic ...
An ontology-based approach to improve the accessibility of ROS-based robotic ...An ontology-based approach to improve the accessibility of ROS-based robotic ...
An ontology-based approach to improve the accessibility of ROS-based robotic ...
 
Answer Worskshop @ESWC2017 - Introduction
Answer Worskshop @ESWC2017 - IntroductionAnswer Worskshop @ESWC2017 - Introduction
Answer Worskshop @ESWC2017 - Introduction
 
Update of time-invalid information in knowledge bases through mobile agents
Update of time-invalid information in knowledge bases through mobile agentsUpdate of time-invalid information in knowledge bases through mobile agents
Update of time-invalid information in knowledge bases through mobile agents
 
An Ontology Design Pattern to Define Explanations
An Ontology Design Pattern to Define ExplanationsAn Ontology Design Pattern to Define Explanations
An Ontology Design Pattern to Define Explanations
 
LD4KD 2015 - Demos and tools
LD4KD 2015 - Demos and toolsLD4KD 2015 - Demos and tools
LD4KD 2015 - Demos and tools
 
Using Linked Data Traversal to Label Academic Communities - SAVE-SD2015
Using Linked Data Traversal to Label Academic Communities - SAVE-SD2015Using Linked Data Traversal to Label Academic Communities - SAVE-SD2015
Using Linked Data Traversal to Label Academic Communities - SAVE-SD2015
 
Quantifying the bias in data links
Quantifying the bias in data linksQuantifying the bias in data links
Quantifying the bias in data links
 
Using Neural Networks to aggregate Linked Data rules
Using Neural Networks to aggregate Linked Data rulesUsing Neural Networks to aggregate Linked Data rules
Using Neural Networks to aggregate Linked Data rules
 
Walking Linked Data: a graph traversal approach to explain clusters
Walking Linked Data: a graph traversal approach to explain clustersWalking Linked Data: a graph traversal approach to explain clusters
Walking Linked Data: a graph traversal approach to explain clusters
 
Dedalo, looking for Cluster Explanations in a labyrinth of Linked Data
Dedalo, looking for Cluster Explanations in a labyrinth of Linked DataDedalo, looking for Cluster Explanations in a labyrinth of Linked Data
Dedalo, looking for Cluster Explanations in a labyrinth of Linked Data
 

Recently uploaded

Competition and Regulation in Professions and Occupations – OECD – June 2024 ...
Competition and Regulation in Professions and Occupations – OECD – June 2024 ...Competition and Regulation in Professions and Occupations – OECD – June 2024 ...
Competition and Regulation in Professions and Occupations – OECD – June 2024 ...
OECD Directorate for Financial and Enterprise Affairs
 
2024-05-30_meetup_devops_aix-marseille.pdf
2024-05-30_meetup_devops_aix-marseille.pdf2024-05-30_meetup_devops_aix-marseille.pdf
2024-05-30_meetup_devops_aix-marseille.pdf
Frederic Leger
 
Presentatie 4. Jochen Cremer - TU Delft 28 mei 2024
Presentatie 4. Jochen Cremer - TU Delft 28 mei 2024Presentatie 4. Jochen Cremer - TU Delft 28 mei 2024
Presentatie 4. Jochen Cremer - TU Delft 28 mei 2024
Dutch Power
 
ASONAM2023_presection_slide_track-recommendation.pdf
ASONAM2023_presection_slide_track-recommendation.pdfASONAM2023_presection_slide_track-recommendation.pdf
ASONAM2023_presection_slide_track-recommendation.pdf
ToshihiroIto4
 
Suzanne Lagerweij - Influence Without Power - Why Empathy is Your Best Friend...
Suzanne Lagerweij - Influence Without Power - Why Empathy is Your Best Friend...Suzanne Lagerweij - Influence Without Power - Why Empathy is Your Best Friend...
Suzanne Lagerweij - Influence Without Power - Why Empathy is Your Best Friend...
Suzanne Lagerweij
 
Pro-competitive Industrial Policy – OECD – June 2024 OECD discussion
Pro-competitive Industrial Policy – OECD – June 2024 OECD discussionPro-competitive Industrial Policy – OECD – June 2024 OECD discussion
Pro-competitive Industrial Policy – OECD – June 2024 OECD discussion
OECD Directorate for Financial and Enterprise Affairs
 
Gregory Harris' Civics Presentation.pptx
Gregory Harris' Civics Presentation.pptxGregory Harris' Civics Presentation.pptx
Gregory Harris' Civics Presentation.pptx
gharris9
 
Artificial Intelligence, Data and Competition – OECD – June 2024 OECD discussion
Artificial Intelligence, Data and Competition – OECD – June 2024 OECD discussionArtificial Intelligence, Data and Competition – OECD – June 2024 OECD discussion
Artificial Intelligence, Data and Competition – OECD – June 2024 OECD discussion
OECD Directorate for Financial and Enterprise Affairs
 
Collapsing Narratives: Exploring Non-Linearity • a micro report by Rosie Wells
Collapsing Narratives: Exploring Non-Linearity • a micro report by Rosie WellsCollapsing Narratives: Exploring Non-Linearity • a micro report by Rosie Wells
Collapsing Narratives: Exploring Non-Linearity • a micro report by Rosie Wells
Rosie Wells
 
Presentatie 8. Joost van der Linde & Daniel Anderton - Eliq 28 mei 2024
Presentatie 8. Joost van der Linde & Daniel Anderton - Eliq 28 mei 2024Presentatie 8. Joost van der Linde & Daniel Anderton - Eliq 28 mei 2024
Presentatie 8. Joost van der Linde & Daniel Anderton - Eliq 28 mei 2024
Dutch Power
 
Artificial Intelligence, Data and Competition – ČORBA – June 2024 OECD discus...
Artificial Intelligence, Data and Competition – ČORBA – June 2024 OECD discus...Artificial Intelligence, Data and Competition – ČORBA – June 2024 OECD discus...
Artificial Intelligence, Data and Competition – ČORBA – June 2024 OECD discus...
OECD Directorate for Financial and Enterprise Affairs
 
Carrer goals.pptx and their importance in real life
Carrer goals.pptx  and their importance in real lifeCarrer goals.pptx  and their importance in real life
Carrer goals.pptx and their importance in real life
artemacademy2
 
Artificial Intelligence, Data and Competition – LIM – June 2024 OECD discussion
Artificial Intelligence, Data and Competition – LIM – June 2024 OECD discussionArtificial Intelligence, Data and Competition – LIM – June 2024 OECD discussion
Artificial Intelligence, Data and Competition – LIM – June 2024 OECD discussion
OECD Directorate for Financial and Enterprise Affairs
 
原版制作贝德福特大学毕业证(bedfordhire毕业证)硕士文凭原版一模一样
原版制作贝德福特大学毕业证(bedfordhire毕业证)硕士文凭原版一模一样原版制作贝德福特大学毕业证(bedfordhire毕业证)硕士文凭原版一模一样
原版制作贝德福特大学毕业证(bedfordhire毕业证)硕士文凭原版一模一样
gpww3sf4
 
XP 2024 presentation: A New Look to Leadership
XP 2024 presentation: A New Look to LeadershipXP 2024 presentation: A New Look to Leadership
XP 2024 presentation: A New Look to Leadership
samililja
 
Artificial Intelligence, Data and Competition – SCHREPEL – June 2024 OECD dis...
Artificial Intelligence, Data and Competition – SCHREPEL – June 2024 OECD dis...Artificial Intelligence, Data and Competition – SCHREPEL – June 2024 OECD dis...
Artificial Intelligence, Data and Competition – SCHREPEL – June 2024 OECD dis...
OECD Directorate for Financial and Enterprise Affairs
 
Mastering the Concepts Tested in the Databricks Certified Data Engineer Assoc...
Mastering the Concepts Tested in the Databricks Certified Data Engineer Assoc...Mastering the Concepts Tested in the Databricks Certified Data Engineer Assoc...
Mastering the Concepts Tested in the Databricks Certified Data Engineer Assoc...
SkillCertProExams
 
Updated diagnosis. Cause and treatment of hypothyroidism
Updated diagnosis. Cause and treatment of hypothyroidismUpdated diagnosis. Cause and treatment of hypothyroidism
Updated diagnosis. Cause and treatment of hypothyroidism
Faculty of Medicine And Health Sciences
 
Tom tresser burning issue.pptx My Burning issue
Tom tresser burning issue.pptx My Burning issueTom tresser burning issue.pptx My Burning issue
Tom tresser burning issue.pptx My Burning issue
amekonnen
 
Mẫu PPT kế hoạch làm việc sáng tạo cho nửa cuối năm PowerPoint
Mẫu PPT kế hoạch làm việc sáng tạo cho nửa cuối năm PowerPointMẫu PPT kế hoạch làm việc sáng tạo cho nửa cuối năm PowerPoint
Mẫu PPT kế hoạch làm việc sáng tạo cho nửa cuối năm PowerPoint
1990 Media
 

Recently uploaded (20)

Competition and Regulation in Professions and Occupations – OECD – June 2024 ...
Competition and Regulation in Professions and Occupations – OECD – June 2024 ...Competition and Regulation in Professions and Occupations – OECD – June 2024 ...
Competition and Regulation in Professions and Occupations – OECD – June 2024 ...
 
2024-05-30_meetup_devops_aix-marseille.pdf
2024-05-30_meetup_devops_aix-marseille.pdf2024-05-30_meetup_devops_aix-marseille.pdf
2024-05-30_meetup_devops_aix-marseille.pdf
 
Presentatie 4. Jochen Cremer - TU Delft 28 mei 2024
Presentatie 4. Jochen Cremer - TU Delft 28 mei 2024Presentatie 4. Jochen Cremer - TU Delft 28 mei 2024
Presentatie 4. Jochen Cremer - TU Delft 28 mei 2024
 
ASONAM2023_presection_slide_track-recommendation.pdf
ASONAM2023_presection_slide_track-recommendation.pdfASONAM2023_presection_slide_track-recommendation.pdf
ASONAM2023_presection_slide_track-recommendation.pdf
 
Suzanne Lagerweij - Influence Without Power - Why Empathy is Your Best Friend...
Suzanne Lagerweij - Influence Without Power - Why Empathy is Your Best Friend...Suzanne Lagerweij - Influence Without Power - Why Empathy is Your Best Friend...
Suzanne Lagerweij - Influence Without Power - Why Empathy is Your Best Friend...
 
Pro-competitive Industrial Policy – OECD – June 2024 OECD discussion
Pro-competitive Industrial Policy – OECD – June 2024 OECD discussionPro-competitive Industrial Policy – OECD – June 2024 OECD discussion
Pro-competitive Industrial Policy – OECD – June 2024 OECD discussion
 
Gregory Harris' Civics Presentation.pptx
Gregory Harris' Civics Presentation.pptxGregory Harris' Civics Presentation.pptx
Gregory Harris' Civics Presentation.pptx
 
Artificial Intelligence, Data and Competition – OECD – June 2024 OECD discussion
Artificial Intelligence, Data and Competition – OECD – June 2024 OECD discussionArtificial Intelligence, Data and Competition – OECD – June 2024 OECD discussion
Artificial Intelligence, Data and Competition – OECD – June 2024 OECD discussion
 
Collapsing Narratives: Exploring Non-Linearity • a micro report by Rosie Wells
Collapsing Narratives: Exploring Non-Linearity • a micro report by Rosie WellsCollapsing Narratives: Exploring Non-Linearity • a micro report by Rosie Wells
Collapsing Narratives: Exploring Non-Linearity • a micro report by Rosie Wells
 
Presentatie 8. Joost van der Linde & Daniel Anderton - Eliq 28 mei 2024
Presentatie 8. Joost van der Linde & Daniel Anderton - Eliq 28 mei 2024Presentatie 8. Joost van der Linde & Daniel Anderton - Eliq 28 mei 2024
Presentatie 8. Joost van der Linde & Daniel Anderton - Eliq 28 mei 2024
 
Artificial Intelligence, Data and Competition – ČORBA – June 2024 OECD discus...
Artificial Intelligence, Data and Competition – ČORBA – June 2024 OECD discus...Artificial Intelligence, Data and Competition – ČORBA – June 2024 OECD discus...
Artificial Intelligence, Data and Competition – ČORBA – June 2024 OECD discus...
 
Carrer goals.pptx and their importance in real life
Carrer goals.pptx  and their importance in real lifeCarrer goals.pptx  and their importance in real life
Carrer goals.pptx and their importance in real life
 
Artificial Intelligence, Data and Competition – LIM – June 2024 OECD discussion
Artificial Intelligence, Data and Competition – LIM – June 2024 OECD discussionArtificial Intelligence, Data and Competition – LIM – June 2024 OECD discussion
Artificial Intelligence, Data and Competition – LIM – June 2024 OECD discussion
 
原版制作贝德福特大学毕业证(bedfordhire毕业证)硕士文凭原版一模一样
原版制作贝德福特大学毕业证(bedfordhire毕业证)硕士文凭原版一模一样原版制作贝德福特大学毕业证(bedfordhire毕业证)硕士文凭原版一模一样
原版制作贝德福特大学毕业证(bedfordhire毕业证)硕士文凭原版一模一样
 
XP 2024 presentation: A New Look to Leadership
XP 2024 presentation: A New Look to LeadershipXP 2024 presentation: A New Look to Leadership
XP 2024 presentation: A New Look to Leadership
 
Artificial Intelligence, Data and Competition – SCHREPEL – June 2024 OECD dis...
Artificial Intelligence, Data and Competition – SCHREPEL – June 2024 OECD dis...Artificial Intelligence, Data and Competition – SCHREPEL – June 2024 OECD dis...
Artificial Intelligence, Data and Competition – SCHREPEL – June 2024 OECD dis...
 
Mastering the Concepts Tested in the Databricks Certified Data Engineer Assoc...
Mastering the Concepts Tested in the Databricks Certified Data Engineer Assoc...Mastering the Concepts Tested in the Databricks Certified Data Engineer Assoc...
Mastering the Concepts Tested in the Databricks Certified Data Engineer Assoc...
 
Updated diagnosis. Cause and treatment of hypothyroidism
Updated diagnosis. Cause and treatment of hypothyroidismUpdated diagnosis. Cause and treatment of hypothyroidism
Updated diagnosis. Cause and treatment of hypothyroidism
 
Tom tresser burning issue.pptx My Burning issue
Tom tresser burning issue.pptx My Burning issueTom tresser burning issue.pptx My Burning issue
Tom tresser burning issue.pptx My Burning issue
 
Mẫu PPT kế hoạch làm việc sáng tạo cho nửa cuối năm PowerPoint
Mẫu PPT kế hoạch làm việc sáng tạo cho nửa cuối năm PowerPointMẫu PPT kế hoạch làm việc sáng tạo cho nửa cuối năm PowerPoint
Mẫu PPT kế hoạch làm việc sáng tạo cho nửa cuối năm PowerPoint
 

Learning to assess Linked Data relationships using Genetic Programming

  • 1. Ilaria Tiddi, Mathieu d’Aquin, Enrico Motta Learning to Assess Linked Data Relationships Using Genetic Programming @IlaTiddi 20.10.2016 15th International Semantic Web Conference (ISWC 2016)
  • 2. Research Problem Automatically discover what makes a strong relationship between two entities in (the Web of) Linked Data. • relationship : a semantic path between two entities ASongOfIceAnd Fire(novel) GoTASongOfIce AndFire(topic) dc:subject dc:subject
  • 3. Research Problem Automatically discover what makes a strong relationship between two entities in (the Web of) Linked Data. • relationship : a semantic path between two entities • automatically : through graph search techniques ASongOfIceAnd Fire(novel) UnitedStates GoT GeorgeRRMartin ASongOfIce AndFire(topic) :author :born :airedIn dc:subjectdc:subject Fantasy dc:subject dc:subject
  • 4. Research Problem Problem • Entities/properties in a path might come from a number of different, unknown data sources Solution (the easy one) • indexing & preprocessing of a portion of Linked Data • a priori knowledge, computational resources ASongOfIceAnd Fire(novel) UnitedStates GoT GeorgeRRMartin ASongOfIce AndFire(topic) :author :born :airedIn dc:subjectdc:subject Fantasy dc:subject dc:subject
  • 5. Research Problem Solution • Find paths between entities through Link Traversal • Incremental and agnostic graph exploration • Perform uninformed (or blind) search over Linked Data ASongOfIceAnd Fire(novel) GoT
  • 6. Research Problem Solution • Find paths between entities through Link Traversal • Incremental and agnostic graph exploration • Perform uninformed (or blind) search over Linked Data ASongOfIceAnd Fire(novel) GoT GeorgeRRMartin ASongOfIce AndFire(topic) :author dc:subject Fantasy dc:subject
  • 7. Research Problem Solution • Find paths between entities through Link Traversal • Incremental and agnostic graph exploration • Perform uninformed (or blind) search over Linked Data ASongOfIceAnd Fire(novel) GoT GeorgeRRMartin ASongOfIce AndFire(topic) :author dc:subject Fantasy dc:subject
  • 8. Research Problem Solution • Find paths between entities through Link Traversal • Incremental and agnostic graph exploration • Perform uninformed (or blind) search over Linked Data ASongOfIceAnd Fire(novel) GoTASongOfIce AndFire(topic) dc:subject Fantasy dc:subject UnitedStates:bornGeorgeRRMartin :author
  • 9. Research Problem Solution • Find paths between entities through Link Traversal • Incremental and agnostic graph exploration • Perform uninformed (or blind) search over Linked Data ASongOfIceAnd Fire(novel) GoT GeorgeRRMartin ASongOfIce AndFire(topic) :author dc:subject Fantasy dc:subject UnitedStates:born
  • 10. Research Problem Solution • Find paths between entities through Link Traversal • Incremental and agnostic graph exploration • Perform uninformed (or blind) search over Linked Data ASongOfIceAnd Fire(novel) UnitedStates GoT GeorgeRRMartin ASongOfIce AndFire(topic) :author dc:subjectdc:subject Fantasy dc:subject :born
  • 11. Research Problem Solution • Find paths between entities through Link Traversal • Incremental and agnostic graph exploration • Perform uninformed (or blind) search over Linked Data ASongOfIceAnd Fire(novel) UnitedStates GoT GeorgeRRMartin ASongOfIce AndFire(topic) :author :airedIn dc:subjectdc:subject Fantasy dc:subject dc:subject :born
  • 12. Research Problem ASongOfIceAnd Fire(novel) UnitedStates GoT GeorgeRRMartin ASongOfIce AndFire(topic) :author :airedIn dc:subjectdc:subject Fantasy dc:subject dc:subject Solution • Find paths between entities through Link Traversal • Incremental and agnostic graph exploration • Perform uninformed (or blind) search over Linked Data :born
  • 13. Research Hypothesis Problem Uninformed searches require a cost-function to explore the graph following the most promising paths Hypo Linked Data information can drive a cost-function that detects strong relationships between entities ASongOfIceAnd Fire(novel) UnitedStates GoT GeorgeRRMartin ASongOfIce AndFire(topic) :author :airedIn dc:subjectdc:subject Fantasy dc:subject dc:subject :born
  • 14. Research Questions What makes a path strong? • Which topological or semantic features of nodes/edges? ✗ e.g. length of a path?  entities of different datasets are connected by many paths of similar length How can we use Linked Data to assess strong relationships? • Which information do we need? • Can we use structural features of the graph? Challenges • find topological/semantic features to detect strong relationships • combine these features in a cost-function • perform an effective blind search
  • 15. Proposed Approach • A set of topological/semantic characteristics of the Linked Data graph • a benchmark of human-evaluated relationship paths Identify the cost-function for a blind search that best performs in ranking sets of alternative relationship paths Automatically learn a cost-function to detect strong relationships between Linked Data entities using a supervised method (Genetic Programming)
  • 16. Proposed Approach Genetic Programming: why? • Flexible learning process • Suitable for wide search spaces (such as Linked Data) • Results assessed with a fitness (scores vs. functions) • Human-understandable results • Easy to integrate in a graph search Automatically learn a cost-function to detect strong relationships between Linked Data entities using a supervised method (Genetic Programming) VS
  • 17. Genetic Programming Programs (solutions for a problem) • trees of primitives • functions : internal nodes (mathematical or logical operations) • terminals : leaf nodes (constants or variables) Fitness function (evaluation) • how well the program solves the problem Genetic operations (evolution) • reproduction • crossover from two parents • mutation from one parent Termination condition • maximum number of evolutions • a desired fitness
  • 18. Genetic Programming Procedure • Create random population of programs based on the primitives • Evolve population until an ideal situation is met ✗✗ ✗ ✔✔✗✗ ✔ canned spaghetti meatballs spaghetti tomato sauced penne tomato sauced spaghetti
  • 19. Genetic Programming Given • a starting population of randomly generated cost-functions • sets of alternative paths between two Linked Data entities, ranked by humans Determine how good each cost-function is in ranking paths compared to the human evaluators ✗✗ ✗ ✔✔✗✗ ✔ canned spaghetti meatballs spaghetti tomato sauced penne tomato sauced spaghetti
  • 20. Genetic Programming Primitives Constant terminals • Z= {0, 1000} Aggregated terminals • Topological edge weighs indegree, outdegree, constant weight • Semantic edge weighs usage of namespaces, taxonomies, vocabularies • Aggregators along the path sum, avg, min, max Functions (combining different information) • Math operations addition, multiplication, division, log
  • 21. Genetic Programming Fitness Normalised Discounted Cumulative Gain (nDCG) • (IR) quality of rankings provided by search engines based on the graded relevance of the returned documents • how good is a program in ranking paths based on human ranks • avg(nDCG) across the dataset • length penalty Genetic operations • Reproduction • Crossover • Mutation Learning • Training set + test set • Keep fittest program for each runs on training set • Test them (discard inconsistent)
  • 22. Experiments Dataset Entities (random types from different sources) • 12,630 events from Yago • 8,185 people from the VIAF dataset • 999 movies from the LMDB • 1,174 countries/capitals from Geonames/ the UNESCO dataset Paths (a set of possible paths between them) • select a random pair • bidirectional breadth-first search Assessment • 100 pairs (~10 possible paths per pair) • 8 judges • from (2) highly relevant to (0) not relevant db:Dina- Korzun viaf:Dina- Korzun gn:Europe gn:United- Kingdom lmdb:The SkinGame owl:sameAsdbo:citizenship gno:parent Feature foaf:based _near
  • 23. Experiments Results Different runs (fitness on training set/test set) (T) Topological primitives only (S) Topological + semantic primitives (N) Topological + namespaces primitives Runs Best program Fitness TR Fitness TS T1 log(log(min.cd × min.cd))/max.cd 0.79 0.79 T2 log(min.cd)/(avg.cd + 87) 0.77 0.78 T3 min.cd × (min.cd/max.cd) 0.78 0.72 N1 (log((max.ns/max.cd))/avg.ns) + min.ns 0.82 0.81 N2 (min.dg/sum.cd)/sum.ou) + min.ns 0.79 0.77 N3 min.ns/(log(max.cd)/avg.ns) 0.83 0.75 S1 min.ns + (sum.ns/log(log(sum.si))) 0.88 0.83 S2 min.ns + (min.cd/log(log(sum.si))) 0.88 0.86 S3 min.ns + (log(max.in)/log(log(sum.si))) 0.87 0.86
  • 24. Experiments Results Lower performance for T-runs and N-runs Recurrent terminals • conditional degree (node degree depending on the RDF triple) • namespace variety • number of topic properties (dc:subject/skos:broader/foaf:primaryTopic) Runs Best program Fitness TR Fitness TS T1 log(log(min.cd × min.cd))/max.cd 0.79 0.79 T2 log(min.cd)/(avg.cd + 87) 0.77 0.78 T3 min.cd × (min.cd/max.cd) 0.78 0.72 N1 (log((max.ns/max.cd))/avg.ns) + min.ns 0.82 0.81 N2 (min.dg/sum.cd)/sum.ou) + min.ns 0.79 0.77 N3 min.ns/(log(max.cd)/avg.ns) 0.83 0.75 S1 min.ns + (sum.ns/log(log(sum.si))) 0.88 0.83 S2 min.ns + (min.cd/log(log(sum.si))) 0.88 0.86 S3 min.ns + (log(max.in)/log(log(sum.si))) 0.87 0.86
  • 25. Experiments Comparative evaluation Best programs • automatically learnt vs. literature functions • RECAP,RelFinder,Everything Is Connected Engine, Moore et al. • ad-hoc / handcrafted information theoretical measures
  • 26. Experiments Which cost-function? Interpretation • pass through nodes with rich node descriptions higher min_namespaces = higher path score • not high level entities / few topic categories few incoming topic categories = higher path score • more specific entities (not hubs) for path with few topic categories ratio conditional_degree / inTopicCategories  specific paths are privileged over general paths min_namespaces+ min_conditionalDegree log(log(sum_inTopicCategories))
  • 27. Conclusions Contributions A measure to detect strong relationships in Linked Data  can be integrated in uninformed searches over Linked Data vs. indexing/pre-processing techniques  derived empirically through Genetic Programming vs. domain-specific / handcrafted measures  what is important in Linked Data topological features + little knowledge about the edge vocabulary Future work • Integrate the measure in the blind-search process • Explore more characteristics • Improve the measure
  • 28. THANK YOU VERY MUCH (AND DO NOT MESS UP WITH ITALIAN FOOD) Questions? IlaTiddi ilaria.tiddi@open.ac.uk

Editor's Notes

  1. you need to know these datasets computational efforts that are not necessarily required
  2. LT which allows this is equivalent to performing
  3. to avoid inconclusive searches
  4. there a series of qs to be answered bablabla and if so the challenges are effective = com
  5. a a set of possible topological or semantic features of the nodes and edges in LD
  6. a a set of possible topological or semantic features of the nodes and edges in LD
  7. a a set of possible topological or semantic features of the nodes and edges in LD
  8. a a set of possible topological or semantic features of the nodes and edges in LD
  9. fitting GP to our problem
  10. combination of edge weighting functions
  11. given this dataset unwieghted fitness on trainset/testset
  12. unwieghted fitness on trainset/testset
  13. RelFinder, Recap, Everything is connected Engine, Moore et al.
  14. paths representing the strongest relationships in very simple words prioritises specific paths (e.g. a movie and a person are based in the same region) to more general paths (e.g. a movie and a person are based in the same country). only specific entities (not hubs) for paths with a small number of topic categories. (the ratio between min.cd and log(log(sum.si)) is negative if sum.si is lower than 10) Dataset stability Removal of entities from one data source at a time S-runs programs remain consistent