SlideShare a Scribd company logo
1 of 20
Download to read offline
Focused Exploration of Geospatial 
Context on Linked Open Data 
Thomas Gottron, Johannes Schmitz, Stuart E. Middleton 
20 October 2014 
IESD workshop, Riva del Garda 
Thomas Gottron Focused Institute for Web Science and Technolo Egxieplso r a·t i oUnn oifv LeOrDs ity of Koblenz-Landau, Germany 1
Challenge: Focused Exploration of LOD 
• Linked Data entities 
Thomas Gottron Focused Exploration of LOD 2
Challenge: Focused Exploration of LOD 
• Linked Data entities 
• (Semantic) link 
structure 
Thomas Gottron Focused Exploration of LOD 3
Challenge: Focused Exploration of LOD 
• Linked Data entities 
• (Semantic) link 
structure 
• „Relevant“ entities 
Thomas Gottron Focused Exploration of LOD 4
Challenge: Focused Exploration of LOD 
• Linked Data entities 
• (Semantic) link 
structure 
• „Relevant“ entities 
• Seed entity 
Thomas Gottron Focused Exploration of LOD 5
Challenge: Focused Exploration of LOD 
• Linked Data entities 
• (Semantic) link 
structure 
• „Relevant“ entities 
• Seed entity 
? ? 
? ? 
? ? 
Classification: 
Which links lead to 
relevant entities? 
Ranking: 
How probable is a link 
leading to a relevant entity? 
Use Cases: 
Guided exploration 
Focused LOD crawler 
Thomas Gottron Focused Exploration of LOD 6
Focused exploration of Geospatial Context 
Rovereto 
Relevant entities: 
Locations semantically 
related to seed entities 
Bensheim (Germany) 
Thomas Gottron Focused Exploration of LOD 7
Focused Exploration: Formalisation 
• E: set of entities (URIs) 
• R: set of RDF triples (s,p,o) 
s∈ L 
– Restricted to s,o ∈ E 
wgs84:long 
• L⊆E: relevant entities 
-1.404 
– For us: Locations with coordinates 
• Task: for given s‘ and all (s‘,p,o) ∈ R 
– Classification: Predict which o are in L 
– Ranking: Sort object entities o starting from the 
one presumed most probable to be relevant 
wgs84:lat 
50.897 
Thomas Gottron Focused Exploration of LOD 8
5 Approaches 
• Based on 3 paradigms: 
– Schema semantics (1 approach) 
– Supervised machine learning (2 approaches) 
– Information Retrieval inspired (2 approaches) 
Thomas Gottron Focused Exploration of LOD 9
Exploration based on Schema Semantics 
• Exploit rdfs:range definitions of link predicates 
rdfs:range 
dbpedia:Place 
rdfs:subClassOf 
dbponto:twinCity dbpedia:City 
• Follow links which lead to locations 
Thomas Gottron Focused Exploration of LOD 10
Exploration based on Schema Semantics 
s 
Classification 
p1 
p2 
• Range of any pi is a 
location? 
àLabel = relevant 
o 
pm 
Ranking 
Location? 
• Re-use classification: 
– Relevant before 
irrelevant 
... 
Thomas Gottron Focused Exploration of LOD 11
Supervised Machine Learning 
• Use incoming link predicates as features 
– Learn predicates which typically leading to locations 
p4 
p6 
p2 
p3 o‘ 
o 
xxx 
wgs84:lat 
yyy 
wgs84:long 
• Train a classifier (e.g. Naive Bayes) 
2 Variations: 
Use all or only 
observed predicates 
Thomas Gottron Focused Exploration of LOD 12
Supervised Machine Learning 
s 
Classification 
• 
p1 
P(o ∈ L) > P(o ∉ L)? 
àLabel = relevant 
o 
pm 
Ranking 
Location? 
• Rank by odds: 
p2 
... 
O(o ∈ L) = 
P(o ∈ L) 
P(o ∉ L) 
Thomas Gottron Focused Exploration of LOD 13
IR Inspired Approaches 
• Discriminativeness of predicates (inspired by tf-idf) 
• Property relevance frequency: 
• Inverse property frequency 
• Combine into prf-ipf and prr-ipf 
• Total score ρ: aggregate over all predicates 
prf = c(p, L) 
ipf = log 
c(∗,∗) 
c(p,∗) 
" 
# $ 
Thomas Gottron Focused Exploration of LOD 14 
% 
& ' 
o p3 
2nd Variation: 
prr: normalised prf
IR Inspired Approaches 
s 
Classification 
p1 
p2 
• Determine threshold 
– Nearest centroid 
o 
pm 
Ranking 
Location? 
• Rank by score 
... 
ρ prr-ipf (o) 
Thomas Gottron Focused Exploration of LOD 15
Evaluation 
• Metrics: 
– Ranking: 
• ROC curves 
• AUC 
– Classification: 
• Precision 
• Recall 
• F1 
• Accuracy 
• Cross validation: 
– 10-times / 10-fold 
– Averages 
99,951 entities 
1,728,633 links 
425,338 entities 
128,171 relevant 
Seed 
Exploration 
owl:sameAs 
Thomas Gottron Focused Exploration of LOD 16
Performance (Ranking) 
1 
0.8 
0.6 
0.4 
0.2 
0 
ROC 
1 
0.975 
0.95 
0 0.025 0.05 
0 0.2 0.4 0.6 0.8 1 
random 
Schema Semantics 
NB (all predicates) 
NB (present predicates) 
prf-ipf 
prr-ipf 
Thomas Gottron Focused Exploration of LOD 17
Performance (Classification & Ranking) 
2. Average performance of approaches († indicates significant improvements confidence level ⇢ = 0.01) 
Method Recall Precision F1 Accuracy AUC 
Schema Scemantics 0.1188 0.8119 0.2073 0.7262 0.5552 
NB (all predicates) 0.9906 0.9491 † 0.9694 † 0.9812 0.9970 
NB (observed predicates) 0.9943 0.9436 0.9683 0.9804 0.9968 
prf-ipf 0.8512 † 0.9754 0.9091 0.9487 0.9958 
prr-ipf † 0.9973 0.9240 0.9592 0.9745 0.9769 
performance in bold. Furthermore, we marked the results where we had a significant over the second best method at confidence level of ⇢ = 0.01. The aggregated 
basically Thomas Gottron confirm the observations Focused Exploration made of above. LOD In general, when considering 18
Summary 
• Focused exploration feasible 
• ML approach performing best 
• Future work: 
– Other data sets 
– Generalise scenario (more than locations) 
– Better approaches using more features 
Thomas Gottron Focused Exploration of LOD 19
Questions? 
Thomas Gottron 
Institute for Web Science and Technologies 
Universität Koblenz-Landau 
gottron@uni-koblenz.de 
Thomas Gottron Focused Institute for Web Science and Technolo Egxieplso r a·t i oUnn oifv LeOrDs ity of Koblenz-Landau, Germany 20

More Related Content

Viewers also liked

DTADA: Distributed Trusted Agent Based Detection Approach For Doline And Sen...
DTADA: Distributed Trusted Agent Based Detection Approach  For Doline And Sen...DTADA: Distributed Trusted Agent Based Detection Approach  For Doline And Sen...
DTADA: Distributed Trusted Agent Based Detection Approach For Doline And Sen...IOSR Journals
 
Modeling of generation and propagation of cardiac action potential using frac...
Modeling of generation and propagation of cardiac action potential using frac...Modeling of generation and propagation of cardiac action potential using frac...
Modeling of generation and propagation of cardiac action potential using frac...IOSR Journals
 
ISMAR 2012: Tailoring the Adaptive Augmented Reality (A²R) Museum Visit: Iden...
ISMAR 2012: Tailoring the Adaptive Augmented Reality (A²R) Museum Visit: Iden...ISMAR 2012: Tailoring the Adaptive Augmented Reality (A²R) Museum Visit: Iden...
ISMAR 2012: Tailoring the Adaptive Augmented Reality (A²R) Museum Visit: Iden...ARtSENSE_EU
 
Efficient Doubletree: An Algorithm for Large-Scale Topology Discovery
Efficient Doubletree: An Algorithm for Large-Scale Topology DiscoveryEfficient Doubletree: An Algorithm for Large-Scale Topology Discovery
Efficient Doubletree: An Algorithm for Large-Scale Topology DiscoveryIOSR Journals
 
Исследование качества инвестиционного климата и уровня инвестиционной активно...
Исследование качества инвестиционного климата и уровня инвестиционной активно...Исследование качества инвестиционного климата и уровня инвестиционной активно...
Исследование качества инвестиционного климата и уровня инвестиционной активно...Тарас Москаленко
 
How to Create an Effective Imagefest Slideshow
How to Create an Effective Imagefest SlideshowHow to Create an Effective Imagefest Slideshow
How to Create an Effective Imagefest Slideshowcomoxvalleycameraclub
 
егэ виды грамматических ошибок
егэ виды грамматических ошибокегэ виды грамматических ошибок
егэ виды грамматических ошибокNatalya Dyrda
 
The juggler's brain final
The juggler's brain finalThe juggler's brain final
The juggler's brain finalGeorge Haydock
 
царскосельский лицей
царскосельский лицейцарскосельский лицей
царскосельский лицейNatalya Dyrda
 

Viewers also liked (18)

DTADA: Distributed Trusted Agent Based Detection Approach For Doline And Sen...
DTADA: Distributed Trusted Agent Based Detection Approach  For Doline And Sen...DTADA: Distributed Trusted Agent Based Detection Approach  For Doline And Sen...
DTADA: Distributed Trusted Agent Based Detection Approach For Doline And Sen...
 
Modeling of generation and propagation of cardiac action potential using frac...
Modeling of generation and propagation of cardiac action potential using frac...Modeling of generation and propagation of cardiac action potential using frac...
Modeling of generation and propagation of cardiac action potential using frac...
 
Dewis cv
Dewis cvDewis cv
Dewis cv
 
G0933443
G0933443G0933443
G0933443
 
D0931621
D0931621D0931621
D0931621
 
ISMAR 2012: Tailoring the Adaptive Augmented Reality (A²R) Museum Visit: Iden...
ISMAR 2012: Tailoring the Adaptive Augmented Reality (A²R) Museum Visit: Iden...ISMAR 2012: Tailoring the Adaptive Augmented Reality (A²R) Museum Visit: Iden...
ISMAR 2012: Tailoring the Adaptive Augmented Reality (A²R) Museum Visit: Iden...
 
Efficient Doubletree: An Algorithm for Large-Scale Topology Discovery
Efficient Doubletree: An Algorithm for Large-Scale Topology DiscoveryEfficient Doubletree: An Algorithm for Large-Scale Topology Discovery
Efficient Doubletree: An Algorithm for Large-Scale Topology Discovery
 
Исследование качества инвестиционного климата и уровня инвестиционной активно...
Исследование качества инвестиционного климата и уровня инвестиционной активно...Исследование качества инвестиционного климата и уровня инвестиционной активно...
Исследование качества инвестиционного климата и уровня инвестиционной активно...
 
10____
  10____  10____
10____
 
Sel
SelSel
Sel
 
Abecedario
AbecedarioAbecedario
Abecedario
 
F0313239
F0313239F0313239
F0313239
 
Bangkok
BangkokBangkok
Bangkok
 
How to Create an Effective Imagefest Slideshow
How to Create an Effective Imagefest SlideshowHow to Create an Effective Imagefest Slideshow
How to Create an Effective Imagefest Slideshow
 
егэ виды грамматических ошибок
егэ виды грамматических ошибокегэ виды грамматических ошибок
егэ виды грамматических ошибок
 
G0345259
G0345259G0345259
G0345259
 
The juggler's brain final
The juggler's brain finalThe juggler's brain final
The juggler's brain final
 
царскосельский лицей
царскосельский лицейцарскосельский лицей
царскосельский лицей
 

Similar to Focused Exploration of Geospatial Context on Linked Open Data

The Maze of Deletion in Ontology Stream Reasoning
The Maze of Deletion in Ontology Stream Reasoning The Maze of Deletion in Ontology Stream Reasoning
The Maze of Deletion in Ontology Stream Reasoning Jeff Z. Pan
 
Data structures and algorithms
Data structures and algorithmsData structures and algorithms
Data structures and algorithmsJulie Iskander
 
Towards advanced data retrieval from learning objects repositories
Towards advanced data retrieval from learning objects repositoriesTowards advanced data retrieval from learning objects repositories
Towards advanced data retrieval from learning objects repositoriesValentina Paunovic
 
Personalised Search for the Social Semantic Web
Personalised Search for the Social Semantic WebPersonalised Search for the Social Semantic Web
Personalised Search for the Social Semantic WebOana Tifrea-Marciuska
 
The Web of Data: do we actually understand what we built?
The Web of Data: do we actually understand what we built?The Web of Data: do we actually understand what we built?
The Web of Data: do we actually understand what we built?Frank van Harmelen
 
Machine Learning Applications in Subsurface Analysis: Case Study in North Sea
Machine Learning Applications in Subsurface Analysis: Case Study in North SeaMachine Learning Applications in Subsurface Analysis: Case Study in North Sea
Machine Learning Applications in Subsurface Analysis: Case Study in North SeaYohanes Nuwara
 
GDSC SSN - solution Challenge : Fundamentals of Decision Making
GDSC SSN - solution Challenge : Fundamentals of Decision MakingGDSC SSN - solution Challenge : Fundamentals of Decision Making
GDSC SSN - solution Challenge : Fundamentals of Decision MakingGDSCSSN
 
Knowledge engg using & in fol
Knowledge engg using & in folKnowledge engg using & in fol
Knowledge engg using & in folchandsek666
 
machine-learning-with-large-networks-of-people-and-places
machine-learning-with-large-networks-of-people-and-placesmachine-learning-with-large-networks-of-people-and-places
machine-learning-with-large-networks-of-people-and-placesTony Frame
 
10. Getting Spatial
10. Getting Spatial10. Getting Spatial
10. Getting SpatialFAO
 
A Comparison of Propositionalization Strategies for Creating Features from Li...
A Comparison of Propositionalization Strategies for Creating Features from Li...A Comparison of Propositionalization Strategies for Creating Features from Li...
A Comparison of Propositionalization Strategies for Creating Features from Li...Petar Ristoski
 
Modeling and Querying Metadata in the Semantic Sensor Web: stRDF and stSPARQL
Modeling and Querying Metadata in the Semantic Sensor Web: stRDF and stSPARQLModeling and Querying Metadata in the Semantic Sensor Web: stRDF and stSPARQL
Modeling and Querying Metadata in the Semantic Sensor Web: stRDF and stSPARQLKostis Kyzirakos
 
Applied machine learning for search engine relevance 3
Applied machine learning for search engine relevance 3Applied machine learning for search engine relevance 3
Applied machine learning for search engine relevance 3Charles Martin
 
Artificial intelligence for Social Good
Artificial intelligence for Social GoodArtificial intelligence for Social Good
Artificial intelligence for Social GoodOana Tifrea-Marciuska
 
[AAAI-16] Tiebreaking Strategies for A* Search: How to Explore the Final Fron...
[AAAI-16] Tiebreaking Strategies for A* Search: How to Explore the Final Fron...[AAAI-16] Tiebreaking Strategies for A* Search: How to Explore the Final Fron...
[AAAI-16] Tiebreaking Strategies for A* Search: How to Explore the Final Fron...Asai Masataro
 
Spatial data mining
Spatial data miningSpatial data mining
Spatial data miningMITS Gwalior
 
Perplexity of Index Models over Evolving Linked Data
Perplexity of Index Models over Evolving Linked Data Perplexity of Index Models over Evolving Linked Data
Perplexity of Index Models over Evolving Linked Data Thomas Gottron
 
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven Recipes
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven RecipesReasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven Recipes
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven RecipesOntotext
 

Similar to Focused Exploration of Geospatial Context on Linked Open Data (20)

The Maze of Deletion in Ontology Stream Reasoning
The Maze of Deletion in Ontology Stream Reasoning The Maze of Deletion in Ontology Stream Reasoning
The Maze of Deletion in Ontology Stream Reasoning
 
Data structures and algorithms
Data structures and algorithmsData structures and algorithms
Data structures and algorithms
 
Towards advanced data retrieval from learning objects repositories
Towards advanced data retrieval from learning objects repositoriesTowards advanced data retrieval from learning objects repositories
Towards advanced data retrieval from learning objects repositories
 
Personalised Search for the Social Semantic Web
Personalised Search for the Social Semantic WebPersonalised Search for the Social Semantic Web
Personalised Search for the Social Semantic Web
 
The Web of Data: do we actually understand what we built?
The Web of Data: do we actually understand what we built?The Web of Data: do we actually understand what we built?
The Web of Data: do we actually understand what we built?
 
Machine Learning Applications in Subsurface Analysis: Case Study in North Sea
Machine Learning Applications in Subsurface Analysis: Case Study in North SeaMachine Learning Applications in Subsurface Analysis: Case Study in North Sea
Machine Learning Applications in Subsurface Analysis: Case Study in North Sea
 
GDSC SSN - solution Challenge : Fundamentals of Decision Making
GDSC SSN - solution Challenge : Fundamentals of Decision MakingGDSC SSN - solution Challenge : Fundamentals of Decision Making
GDSC SSN - solution Challenge : Fundamentals of Decision Making
 
Knowledge engg using & in fol
Knowledge engg using & in folKnowledge engg using & in fol
Knowledge engg using & in fol
 
machine-learning-with-large-networks-of-people-and-places
machine-learning-with-large-networks-of-people-and-placesmachine-learning-with-large-networks-of-people-and-places
machine-learning-with-large-networks-of-people-and-places
 
10. Getting Spatial
10. Getting Spatial10. Getting Spatial
10. Getting Spatial
 
A Comparison of Propositionalization Strategies for Creating Features from Li...
A Comparison of Propositionalization Strategies for Creating Features from Li...A Comparison of Propositionalization Strategies for Creating Features from Li...
A Comparison of Propositionalization Strategies for Creating Features from Li...
 
A Survey of Entity Ranking over RDF Graphs
A Survey of Entity Ranking over RDF GraphsA Survey of Entity Ranking over RDF Graphs
A Survey of Entity Ranking over RDF Graphs
 
Labreport
LabreportLabreport
Labreport
 
Modeling and Querying Metadata in the Semantic Sensor Web: stRDF and stSPARQL
Modeling and Querying Metadata in the Semantic Sensor Web: stRDF and stSPARQLModeling and Querying Metadata in the Semantic Sensor Web: stRDF and stSPARQL
Modeling and Querying Metadata in the Semantic Sensor Web: stRDF and stSPARQL
 
Applied machine learning for search engine relevance 3
Applied machine learning for search engine relevance 3Applied machine learning for search engine relevance 3
Applied machine learning for search engine relevance 3
 
Artificial intelligence for Social Good
Artificial intelligence for Social GoodArtificial intelligence for Social Good
Artificial intelligence for Social Good
 
[AAAI-16] Tiebreaking Strategies for A* Search: How to Explore the Final Fron...
[AAAI-16] Tiebreaking Strategies for A* Search: How to Explore the Final Fron...[AAAI-16] Tiebreaking Strategies for A* Search: How to Explore the Final Fron...
[AAAI-16] Tiebreaking Strategies for A* Search: How to Explore the Final Fron...
 
Spatial data mining
Spatial data miningSpatial data mining
Spatial data mining
 
Perplexity of Index Models over Evolving Linked Data
Perplexity of Index Models over Evolving Linked Data Perplexity of Index Models over Evolving Linked Data
Perplexity of Index Models over Evolving Linked Data
 
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven Recipes
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven RecipesReasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven Recipes
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven Recipes
 

More from Thomas Gottron

Leveraging the Web of Data: Managing, Analysing and Making Use of Linked Open...
Leveraging the Web of Data: Managing, Analysing and Making Use of Linked Open...Leveraging the Web of Data: Managing, Analysing and Making Use of Linked Open...
Leveraging the Web of Data: Managing, Analysing and Making Use of Linked Open...Thomas Gottron
 
From Changes to Dynamics: Dynamics Analysis of Linked Open Data Sources
From Changes to Dynamics: Dynamics Analysis of Linked Open Data Sources From Changes to Dynamics: Dynamics Analysis of Linked Open Data Sources
From Changes to Dynamics: Dynamics Analysis of Linked Open Data Sources Thomas Gottron
 
Making Use of the Linked Data Cloud: The Role of Index Structures
Making Use of the Linked Data Cloud: The Role of Index StructuresMaking Use of the Linked Data Cloud: The Role of Index Structures
Making Use of the Linked Data Cloud: The Role of Index StructuresThomas Gottron
 
 Challenges in Managing Online Business Communities
 Challenges in Managing Online Business Communities Challenges in Managing Online Business Communities
 Challenges in Managing Online Business CommunitiesThomas Gottron
 
ESWC 2013: A Systematic Investigation of Explicit and Implicit Schema Informa...
ESWC 2013: A Systematic Investigation of Explicit and Implicit Schema Informa...ESWC 2013: A Systematic Investigation of Explicit and Implicit Schema Informa...
ESWC 2013: A Systematic Investigation of Explicit and Implicit Schema Informa...Thomas Gottron
 
Challenging Retrieval Scenarios: Social Media and Linked Open Data
Challenging Retrieval Scenarios: Social Media and Linked Open DataChallenging Retrieval Scenarios: Social Media and Linked Open Data
Challenging Retrieval Scenarios: Social Media and Linked Open DataThomas Gottron
 
Get the Google Feeling! Supporting Users in Finding Relevant Sources
Get the Google Feeling! Supporting Users in Finding Relevant SourcesGet the Google Feeling! Supporting Users in Finding Relevant Sources
Get the Google Feeling! Supporting Users in Finding Relevant SourcesThomas Gottron
 
Finding Good URLs: Aligning Entities in Knowledge Bases with Public Web Docum...
Finding Good URLs: Aligning Entities in Knowledge Bases with Public Web Docum...Finding Good URLs: Aligning Entities in Knowledge Bases with Public Web Docum...
Finding Good URLs: Aligning Entities in Knowledge Bases with Public Web Docum...Thomas Gottron
 

More from Thomas Gottron (8)

Leveraging the Web of Data: Managing, Analysing and Making Use of Linked Open...
Leveraging the Web of Data: Managing, Analysing and Making Use of Linked Open...Leveraging the Web of Data: Managing, Analysing and Making Use of Linked Open...
Leveraging the Web of Data: Managing, Analysing and Making Use of Linked Open...
 
From Changes to Dynamics: Dynamics Analysis of Linked Open Data Sources
From Changes to Dynamics: Dynamics Analysis of Linked Open Data Sources From Changes to Dynamics: Dynamics Analysis of Linked Open Data Sources
From Changes to Dynamics: Dynamics Analysis of Linked Open Data Sources
 
Making Use of the Linked Data Cloud: The Role of Index Structures
Making Use of the Linked Data Cloud: The Role of Index StructuresMaking Use of the Linked Data Cloud: The Role of Index Structures
Making Use of the Linked Data Cloud: The Role of Index Structures
 
 Challenges in Managing Online Business Communities
 Challenges in Managing Online Business Communities Challenges in Managing Online Business Communities
 Challenges in Managing Online Business Communities
 
ESWC 2013: A Systematic Investigation of Explicit and Implicit Schema Informa...
ESWC 2013: A Systematic Investigation of Explicit and Implicit Schema Informa...ESWC 2013: A Systematic Investigation of Explicit and Implicit Schema Informa...
ESWC 2013: A Systematic Investigation of Explicit and Implicit Schema Informa...
 
Challenging Retrieval Scenarios: Social Media and Linked Open Data
Challenging Retrieval Scenarios: Social Media and Linked Open DataChallenging Retrieval Scenarios: Social Media and Linked Open Data
Challenging Retrieval Scenarios: Social Media and Linked Open Data
 
Get the Google Feeling! Supporting Users in Finding Relevant Sources
Get the Google Feeling! Supporting Users in Finding Relevant SourcesGet the Google Feeling! Supporting Users in Finding Relevant Sources
Get the Google Feeling! Supporting Users in Finding Relevant Sources
 
Finding Good URLs: Aligning Entities in Knowledge Bases with Public Web Docum...
Finding Good URLs: Aligning Entities in Knowledge Bases with Public Web Docum...Finding Good URLs: Aligning Entities in Knowledge Bases with Public Web Docum...
Finding Good URLs: Aligning Entities in Knowledge Bases with Public Web Docum...
 

Recently uploaded

Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRCall Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRlizamodels9
 
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)Columbia Weather Systems
 
basic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomybasic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomyDrAnita Sharma
 
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPirithiRaju
 
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPirithiRaju
 
The dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptxThe dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptxEran Akiva Sinbar
 
OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024innovationoecd
 
Pests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdfPests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdfPirithiRaju
 
Carbon Dioxide Capture and Storage (CSS)
Carbon Dioxide Capture and Storage (CSS)Carbon Dioxide Capture and Storage (CSS)
Carbon Dioxide Capture and Storage (CSS)Tamer Koksalan, PhD
 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxpriyankatabhane
 
Pests of Bengal gram_Identification_Dr.UPR.pdf
Pests of Bengal gram_Identification_Dr.UPR.pdfPests of Bengal gram_Identification_Dr.UPR.pdf
Pests of Bengal gram_Identification_Dr.UPR.pdfPirithiRaju
 
User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)Columbia Weather Systems
 
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxpriyankatabhane
 
Base editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editingBase editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editingNetHelix
 
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxLIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxmalonesandreagweneth
 
Microteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical EngineeringMicroteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical EngineeringPrajakta Shinde
 
Harmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms PresentationHarmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms Presentationtahreemzahra82
 
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 GenuineCall Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuinethapagita
 
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPirithiRaju
 

Recently uploaded (20)

Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRCall Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
 
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
 
basic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomybasic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomy
 
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
 
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
 
The dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptxThe dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptx
 
OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024
 
Pests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdfPests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdf
 
Volatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -IVolatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -I
 
Carbon Dioxide Capture and Storage (CSS)
Carbon Dioxide Capture and Storage (CSS)Carbon Dioxide Capture and Storage (CSS)
Carbon Dioxide Capture and Storage (CSS)
 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptx
 
Pests of Bengal gram_Identification_Dr.UPR.pdf
Pests of Bengal gram_Identification_Dr.UPR.pdfPests of Bengal gram_Identification_Dr.UPR.pdf
Pests of Bengal gram_Identification_Dr.UPR.pdf
 
User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)
 
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
 
Base editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editingBase editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editing
 
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxLIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
 
Microteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical EngineeringMicroteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical Engineering
 
Harmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms PresentationHarmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms Presentation
 
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 GenuineCall Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
 
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
 

Focused Exploration of Geospatial Context on Linked Open Data

  • 1. Focused Exploration of Geospatial Context on Linked Open Data Thomas Gottron, Johannes Schmitz, Stuart E. Middleton 20 October 2014 IESD workshop, Riva del Garda Thomas Gottron Focused Institute for Web Science and Technolo Egxieplso r a·t i oUnn oifv LeOrDs ity of Koblenz-Landau, Germany 1
  • 2. Challenge: Focused Exploration of LOD • Linked Data entities Thomas Gottron Focused Exploration of LOD 2
  • 3. Challenge: Focused Exploration of LOD • Linked Data entities • (Semantic) link structure Thomas Gottron Focused Exploration of LOD 3
  • 4. Challenge: Focused Exploration of LOD • Linked Data entities • (Semantic) link structure • „Relevant“ entities Thomas Gottron Focused Exploration of LOD 4
  • 5. Challenge: Focused Exploration of LOD • Linked Data entities • (Semantic) link structure • „Relevant“ entities • Seed entity Thomas Gottron Focused Exploration of LOD 5
  • 6. Challenge: Focused Exploration of LOD • Linked Data entities • (Semantic) link structure • „Relevant“ entities • Seed entity ? ? ? ? ? ? Classification: Which links lead to relevant entities? Ranking: How probable is a link leading to a relevant entity? Use Cases: Guided exploration Focused LOD crawler Thomas Gottron Focused Exploration of LOD 6
  • 7. Focused exploration of Geospatial Context Rovereto Relevant entities: Locations semantically related to seed entities Bensheim (Germany) Thomas Gottron Focused Exploration of LOD 7
  • 8. Focused Exploration: Formalisation • E: set of entities (URIs) • R: set of RDF triples (s,p,o) s∈ L – Restricted to s,o ∈ E wgs84:long • L⊆E: relevant entities -1.404 – For us: Locations with coordinates • Task: for given s‘ and all (s‘,p,o) ∈ R – Classification: Predict which o are in L – Ranking: Sort object entities o starting from the one presumed most probable to be relevant wgs84:lat 50.897 Thomas Gottron Focused Exploration of LOD 8
  • 9. 5 Approaches • Based on 3 paradigms: – Schema semantics (1 approach) – Supervised machine learning (2 approaches) – Information Retrieval inspired (2 approaches) Thomas Gottron Focused Exploration of LOD 9
  • 10. Exploration based on Schema Semantics • Exploit rdfs:range definitions of link predicates rdfs:range dbpedia:Place rdfs:subClassOf dbponto:twinCity dbpedia:City • Follow links which lead to locations Thomas Gottron Focused Exploration of LOD 10
  • 11. Exploration based on Schema Semantics s Classification p1 p2 • Range of any pi is a location? àLabel = relevant o pm Ranking Location? • Re-use classification: – Relevant before irrelevant ... Thomas Gottron Focused Exploration of LOD 11
  • 12. Supervised Machine Learning • Use incoming link predicates as features – Learn predicates which typically leading to locations p4 p6 p2 p3 o‘ o xxx wgs84:lat yyy wgs84:long • Train a classifier (e.g. Naive Bayes) 2 Variations: Use all or only observed predicates Thomas Gottron Focused Exploration of LOD 12
  • 13. Supervised Machine Learning s Classification • p1 P(o ∈ L) > P(o ∉ L)? àLabel = relevant o pm Ranking Location? • Rank by odds: p2 ... O(o ∈ L) = P(o ∈ L) P(o ∉ L) Thomas Gottron Focused Exploration of LOD 13
  • 14. IR Inspired Approaches • Discriminativeness of predicates (inspired by tf-idf) • Property relevance frequency: • Inverse property frequency • Combine into prf-ipf and prr-ipf • Total score ρ: aggregate over all predicates prf = c(p, L) ipf = log c(∗,∗) c(p,∗) " # $ Thomas Gottron Focused Exploration of LOD 14 % & ' o p3 2nd Variation: prr: normalised prf
  • 15. IR Inspired Approaches s Classification p1 p2 • Determine threshold – Nearest centroid o pm Ranking Location? • Rank by score ... ρ prr-ipf (o) Thomas Gottron Focused Exploration of LOD 15
  • 16. Evaluation • Metrics: – Ranking: • ROC curves • AUC – Classification: • Precision • Recall • F1 • Accuracy • Cross validation: – 10-times / 10-fold – Averages 99,951 entities 1,728,633 links 425,338 entities 128,171 relevant Seed Exploration owl:sameAs Thomas Gottron Focused Exploration of LOD 16
  • 17. Performance (Ranking) 1 0.8 0.6 0.4 0.2 0 ROC 1 0.975 0.95 0 0.025 0.05 0 0.2 0.4 0.6 0.8 1 random Schema Semantics NB (all predicates) NB (present predicates) prf-ipf prr-ipf Thomas Gottron Focused Exploration of LOD 17
  • 18. Performance (Classification & Ranking) 2. Average performance of approaches († indicates significant improvements confidence level ⇢ = 0.01) Method Recall Precision F1 Accuracy AUC Schema Scemantics 0.1188 0.8119 0.2073 0.7262 0.5552 NB (all predicates) 0.9906 0.9491 † 0.9694 † 0.9812 0.9970 NB (observed predicates) 0.9943 0.9436 0.9683 0.9804 0.9968 prf-ipf 0.8512 † 0.9754 0.9091 0.9487 0.9958 prr-ipf † 0.9973 0.9240 0.9592 0.9745 0.9769 performance in bold. Furthermore, we marked the results where we had a significant over the second best method at confidence level of ⇢ = 0.01. The aggregated basically Thomas Gottron confirm the observations Focused Exploration made of above. LOD In general, when considering 18
  • 19. Summary • Focused exploration feasible • ML approach performing best • Future work: – Other data sets – Generalise scenario (more than locations) – Better approaches using more features Thomas Gottron Focused Exploration of LOD 19
  • 20. Questions? Thomas Gottron Institute for Web Science and Technologies Universität Koblenz-Landau gottron@uni-koblenz.de Thomas Gottron Focused Institute for Web Science and Technolo Egxieplso r a·t i oUnn oifv LeOrDs ity of Koblenz-Landau, Germany 20