SlideShare a Scribd company logo
Benchmarking Link Discovery
Systems for Geo-Spatial Data
Tzanina Saveta
Giorgos Flouris, Irini Fundulaki
Institute of Computer Science-FORTH, Greece
Axel-Cyrille Ngonga Ngomo
Paderborn University,Germany
Benchmarking Link Discovery Systems for Geo-Spatial Data 1
Why Benchmarking Link Discovery Systems for
Geospatial Data?
• Large amount of available geospatial datasets and link
discovery systems
• Limited number of benchmarks for link discovery systems
geospatial data
• Propose: Spatial Benchmark Generator
– differs from the classical, mostly string-based approaches
– test the performance of link discovery systems that deal with
the DE-9IM (Dimensionally Extended nine-Intersection Model)
model that is used to represent topological relations
– Choke-point based design
Benchmarking Link Discovery Systems for Geo-Spatial Data 2
Essential to determine the effectiveness and the
efficiency of the proposed techniques and tools
Spatial Benchmark Ontology & Datasets
• TomTom traffic data archive contains approximately 15 trillion
(1:5 1013) speed measurements from hundreds of millions of
road segments all over the world, mostly based on
anonymized GPS positioning data
Benchmarking Link Discovery Systems for Geo-Spatial Data 3
Trace
Point
has
Point
Velocity
Motion
Property
Vectorwgs84_pos:
Point
xsd:TimeStamp
hasTimeStamp
has
Speed
km_per_hour, Float,
miles_per_hour
velocityMetric velocityValue
xsd:Float
Choke Points & Key Performance Indicators (KPIs)
Benchmarking Link Discovery Systems for Geo-Spatial Data 4
Choke Points Key Performance
IndicatorsCP1: Scalability
• Large datasets
• Large number of
properties for each
instance
CP2: Output Quality Metric
CP3: Performance
Dataset size
Precision, Recall,
F-measure
Link DiscoveryTime
Spatial Benchmark (Overview)
• Source Dataset
– Consists of traces, a trace represented as a LineString in the
Well-known text (WKT) format
• Target Dataset
– Consists of traces (LineString) represented in theWKT format
Given aWKT Geometry s as source and a DE-9IM Relation r we to
generate aWKT target Geometry t such as their Intersection Matrix
follows the definition of Relation r.
• Gold Standard
– Stores pairs of matched (source, target) traces
– Generated using RADON [SDS+16] to ensure completeness and
correctness
Benchmarking Link Discovery Systems for Geo-Spatial Data 5
DE-9IMTopological Relations between LineStrings
Benchmarking Link Discovery Systems for Geo-Spatial Data 6
EQUALS DISJOINT TOUCHES
CONTAINS & COVERS INTERSECTS CROSSES
OVERLAPS
JTSTopology Suite
• The JTSTopology Suite is a Java API that implements a core
set of spatial data operations using an explicit precision model
and robust geometric algorithms
• Spatial benchmark implements an extension of the JTS
Topology Suite that allows us to generate
– a target geometry given a source geometry and a DE-9IM
relation
Benchmarking Link Discovery Systems for Geo-Spatial Data 7
--------- SOURCE ---------
t1 aTrace .
t1 hasGeometry "LINESTRING (8.79232 48.92266, 8.79186 48.92246, 8.79135
48.92229, 8.79079 48.92219, 8.79012 48.92216, 8.78957 48.92207, 8.78903 48.92185,
8.78848 48.92162, 8.78807 48.92133, 8.78782 48.92097, 8.78771 48.92068, 8.78737
48.92031, 8.78695 48.92004)"
^^<http://strdf.di.uoa.gr/ontology#WKT> .
---------TARGET ---------
t1' aTrace .
t1' hasGeometry "LINESTRING (8.78903 48.92185, 8.78848 48.92162, 8.78807
48.92133, 8.78782 48.92097, 8.78771 48.92068, 8.78737 48.92031)"
^^<http://strdf.di.uoa.gr/ontology#WKT> .
--------- GOLD STANDARD (e.g. for COVERS) ---------
t1 t1'.
Datasets example
Architecture: Data Generation
Benchmarking Link Discovery Systems for Geo-Spatial Data 9
Input Dataset
(Traces)
Initialization
Module
Resource
Generation
Resource
Transformation
(JTS Extension)
Test Case
Generation
Parameters
Source
Data
Target
Data
RADON
Gold
Standard
Experiments for Spatial Benchmark (1)
• Objective: Provide a comparative analysis to assess the ability of
RADON [SDS+16] and Silk [SK16] to manage DE-9IM relations
• Experiment set up: HOBBIT Platform time limit = 75 minutes
• Issues encountered
– Scalability
• Due to a problem encountered for SILK, we had to select small
traces (no larger than 64 Kbytes)
– Output Quality Metric
• Precision, recall and f-measure were equal to 1.0
– Time Performance
• Show the rate between the time performance and trace size (10,
100, 1K and 10K)
• Show the time needed by link discovery systems for the different
topological relations
Benchmarking Link Discovery Systems for Geo-Spatial Data 10
Experiments for Spatial Benchmark (2)
27-Oct-17 Benchmarking Link Discovery Systems for Geo-Spatial Data 11
RELATION System #10 #100 #1K #10K
EQUALS RADON 1139 1290 3891 23509
Silk 2424 3804 37585 704821
CROSSES RADON 1209 1996 5139 88799
Silk 3126 3927 55691 1035194
COVERED BY RADON 1124 1334 4793 28095
Silk Not Supported – same as Within for LineStrings
COVERS RADON 1147 1235 4158 28673
Silk Not Supported – same as Contains for LineStrings
DISJOINT RADON 567 820 7269 18031
Silk 2686 5843 206313 Platform time limit *
OVERLAPS RADON 1209 3047 21829 1452969
Silk 3693 10790 221734 Platform time limit *
WITHIN RADON 1116 1311 3899 30233
Silk 2859 3151 27933 550819
CONTAINS RADON 1163 1294 3155 30560
Silk 3170 3079 26094 538441
INTERSECTS RADON 1400 6396 94733 Platform time limit *
Silk 3628 13175 381310 Platform time limit *
TOUCHES RADON 1436 4958 118104 Platform time limit *
Silk 3072 19786 601024 Platform time limit
Time rate (in ms) while increasing population for each topological relation for RADON and Silk.
Experiments for Spatial Benchmark (3)
Benchmarking Link Discovery Systems for Geo-Spatial Data 12
Experiments for Spatial Benchmark (4)
Benchmarking Link Discovery Systems for Geo-Spatial Data 13
Overview of Experimental Results (1)
• DE-9IM relations
– DISJOINT needs the shortest time for the systems for small
number of traces but stops due to the platform time limit when
increasing the dataset size
– EQUALS, CROSSES, COVERS/COVERED BY and
CONTAINS/WITHIN need approximately the same time
– INTERSECTS, TOUCHES and OVERLAPS seem to be the
hardest relations for the systems
Benchmarking Link Discovery Systems for Geo-Spatial Data 14
Overview of Experimental Results (2)
• Link Discovery Systems
– RADON handles the growth of the dataset size smoother than
SILK
– For small number of traces, RADON needs approximately half
time than SILK
– When the size increases, RADON outperforms SILK by more
than one order of magnitude
Benchmarking Link Discovery Systems for Geo-Spatial Data 15
[SDS+16] Mohamed Ahmed Sherif, Kevin Dreßler, Panayiotis Smeros,
and Axel- Cyrille Ngonga Ngomo. RADON - Rapid Discovery of
Topological Relations. In Proceedings ofTheThirty-First AAAI
Conference onArtificial Intelligence (AAAI-17), 2017.
[SK16] P. Smeros and M. Koubarakis. Discovering Spatial andTemporal
Links Among RDF Data. InWWW2016 Workshop: Linked Data on the
Web (LDOW2016), Montreál, Canada, 2016.
References
17
This work was supported by grands from the EU H2020 Framework Programme
provided for the project HOBBIT (GA no. 688227).

More Related Content

What's hot

Network analysis for shortest optimum path
Network analysis for shortest optimum pathNetwork analysis for shortest optimum path
Network analysis for shortest optimum path
Sourabh Jain
 
Beyond good enough? Spatial Data Quality and OpenStreetMap data
Beyond good enough? Spatial Data Quality and OpenStreetMap dataBeyond good enough? Spatial Data Quality and OpenStreetMap data
Beyond good enough? Spatial Data Quality and OpenStreetMap data
Muki Haklay
 
SmartGeo - G. Satta
SmartGeo - G. SattaSmartGeo - G. Satta
SmartGeo - G. Satta
Sardegna Ricerche
 
Popescu.ppt
Popescu.pptPopescu.ppt
Popescu.ppt
grssieee
 
Network Analysis in ArcGIS
Network Analysis in ArcGISNetwork Analysis in ArcGIS
Network Analysis in ArcGIS
John Reiser
 
PANSAT COM AB05-CD06 Final Report
PANSAT COM AB05-CD06 Final ReportPANSAT COM AB05-CD06 Final Report
PANSAT COM AB05-CD06 Final Report
Brian Martiniello
 
Introduction to spatial data mining
Introduction to spatial data miningIntroduction to spatial data mining
Introduction to spatial data mining
Hoang Nguyen
 
Watershed analysis using GIS
Watershed analysis using GISWatershed analysis using GIS
Watershed analysis using GIS
Aya Osman
 
Network Flow Pattern Extraction by Clustering Eugine Kang
Network Flow Pattern Extraction by Clustering Eugine KangNetwork Flow Pattern Extraction by Clustering Eugine Kang
Network Flow Pattern Extraction by Clustering Eugine Kang
Eugine Kang
 
Data Usability Assessment for Remote Sensing Data: Accuracy of Interactive Da...
Data Usability Assessment for Remote Sensing Data: Accuracy of Interactive Da...Data Usability Assessment for Remote Sensing Data: Accuracy of Interactive Da...
Data Usability Assessment for Remote Sensing Data: Accuracy of Interactive Da...
Beniamino Murgante
 
Analyzing the performance of the dynamic
Analyzing the performance of the dynamicAnalyzing the performance of the dynamic
Analyzing the performance of the dynamic
IJCNCJournal
 
PPT TRAINING
PPT TRAININGPPT TRAINING
PPT TRAINING
Aakash Sharma
 
AI models for Ice Classification - ExtremeEarth Open Workshop
AI models for Ice Classification - ExtremeEarth Open WorkshopAI models for Ice Classification - ExtremeEarth Open Workshop
AI models for Ice Classification - ExtremeEarth Open Workshop
ExtremeEarth
 
SLAM Constructor Framework for ROS
SLAM Constructor Framework for ROSSLAM Constructor Framework for ROS
SLAM Constructor Framework for ROS
OSLL
 
PIC Tier-1 (LHCP Conference / Barcelona)
PIC Tier-1 (LHCP Conference / Barcelona)PIC Tier-1 (LHCP Conference / Barcelona)
PIC Tier-1 (LHCP Conference / Barcelona)
Josep Flix
 
ruSMART 2013 presentation
ruSMART 2013 presentationruSMART 2013 presentation
ruSMART 2013 presentation
Oscar Rodríguez Rocha
 
Active learning algorithms in seismic facies classification
Active learning algorithms in seismic facies classificationActive learning algorithms in seismic facies classification
Active learning algorithms in seismic facies classification
Pioneer Natural Resources
 
Contribution Of Real Time Network (NRTK) for improvement of accuracy
Contribution Of Real Time Network (NRTK) for improvement of accuracyContribution Of Real Time Network (NRTK) for improvement of accuracy
Contribution Of Real Time Network (NRTK) for improvement of accuracy
Nzar Braim
 
FUZZY CLUSTERING FOR IMPROVED POSITIONING
FUZZY CLUSTERING FOR IMPROVED POSITIONINGFUZZY CLUSTERING FOR IMPROVED POSITIONING
FUZZY CLUSTERING FOR IMPROVED POSITIONING
ijitjournal
 
Data dissemination based on complex networks
Data dissemination based on complex networksData dissemination based on complex networks
Data dissemination based on complex networks
Shahid Beheshti University
 

What's hot (20)

Network analysis for shortest optimum path
Network analysis for shortest optimum pathNetwork analysis for shortest optimum path
Network analysis for shortest optimum path
 
Beyond good enough? Spatial Data Quality and OpenStreetMap data
Beyond good enough? Spatial Data Quality and OpenStreetMap dataBeyond good enough? Spatial Data Quality and OpenStreetMap data
Beyond good enough? Spatial Data Quality and OpenStreetMap data
 
SmartGeo - G. Satta
SmartGeo - G. SattaSmartGeo - G. Satta
SmartGeo - G. Satta
 
Popescu.ppt
Popescu.pptPopescu.ppt
Popescu.ppt
 
Network Analysis in ArcGIS
Network Analysis in ArcGISNetwork Analysis in ArcGIS
Network Analysis in ArcGIS
 
PANSAT COM AB05-CD06 Final Report
PANSAT COM AB05-CD06 Final ReportPANSAT COM AB05-CD06 Final Report
PANSAT COM AB05-CD06 Final Report
 
Introduction to spatial data mining
Introduction to spatial data miningIntroduction to spatial data mining
Introduction to spatial data mining
 
Watershed analysis using GIS
Watershed analysis using GISWatershed analysis using GIS
Watershed analysis using GIS
 
Network Flow Pattern Extraction by Clustering Eugine Kang
Network Flow Pattern Extraction by Clustering Eugine KangNetwork Flow Pattern Extraction by Clustering Eugine Kang
Network Flow Pattern Extraction by Clustering Eugine Kang
 
Data Usability Assessment for Remote Sensing Data: Accuracy of Interactive Da...
Data Usability Assessment for Remote Sensing Data: Accuracy of Interactive Da...Data Usability Assessment for Remote Sensing Data: Accuracy of Interactive Da...
Data Usability Assessment for Remote Sensing Data: Accuracy of Interactive Da...
 
Analyzing the performance of the dynamic
Analyzing the performance of the dynamicAnalyzing the performance of the dynamic
Analyzing the performance of the dynamic
 
PPT TRAINING
PPT TRAININGPPT TRAINING
PPT TRAINING
 
AI models for Ice Classification - ExtremeEarth Open Workshop
AI models for Ice Classification - ExtremeEarth Open WorkshopAI models for Ice Classification - ExtremeEarth Open Workshop
AI models for Ice Classification - ExtremeEarth Open Workshop
 
SLAM Constructor Framework for ROS
SLAM Constructor Framework for ROSSLAM Constructor Framework for ROS
SLAM Constructor Framework for ROS
 
PIC Tier-1 (LHCP Conference / Barcelona)
PIC Tier-1 (LHCP Conference / Barcelona)PIC Tier-1 (LHCP Conference / Barcelona)
PIC Tier-1 (LHCP Conference / Barcelona)
 
ruSMART 2013 presentation
ruSMART 2013 presentationruSMART 2013 presentation
ruSMART 2013 presentation
 
Active learning algorithms in seismic facies classification
Active learning algorithms in seismic facies classificationActive learning algorithms in seismic facies classification
Active learning algorithms in seismic facies classification
 
Contribution Of Real Time Network (NRTK) for improvement of accuracy
Contribution Of Real Time Network (NRTK) for improvement of accuracyContribution Of Real Time Network (NRTK) for improvement of accuracy
Contribution Of Real Time Network (NRTK) for improvement of accuracy
 
FUZZY CLUSTERING FOR IMPROVED POSITIONING
FUZZY CLUSTERING FOR IMPROVED POSITIONINGFUZZY CLUSTERING FOR IMPROVED POSITIONING
FUZZY CLUSTERING FOR IMPROVED POSITIONING
 
Data dissemination based on complex networks
Data dissemination based on complex networksData dissemination based on complex networks
Data dissemination based on complex networks
 

Similar to Benchmarking Link Discovery Systems for Geo-Spatial Data - BLINK ISWC2017.

Data mining projects topics for java and dot net
Data mining projects topics for java and dot netData mining projects topics for java and dot net
Data mining projects topics for java and dot net
redpel dot com
 
ADAPTER
ADAPTERADAPTER
Big Linked Data Federation - ExtremeEarth Open Workshop
Big Linked Data Federation - ExtremeEarth Open WorkshopBig Linked Data Federation - ExtremeEarth Open Workshop
Big Linked Data Federation - ExtremeEarth Open Workshop
ExtremeEarth
 
A fuzzy clustering algorithm for high dimensional streaming data
A fuzzy clustering algorithm for high dimensional streaming dataA fuzzy clustering algorithm for high dimensional streaming data
A fuzzy clustering algorithm for high dimensional streaming data
Alexander Decker
 
5G-USA-Telemetry
5G-USA-Telemetry5G-USA-Telemetry
5G-USA-Telemetry
snrism
 
Achieving horizontal scalability in density-based clustering for urls
Achieving horizontal scalability in density-based clustering for urlsAchieving horizontal scalability in density-based clustering for urls
Achieving horizontal scalability in density-based clustering for urls
Andrea Morichetta
 
Delivering Application-Layer​ Traffic Optimization​ (ALTO) Services based on ...
Delivering Application-Layer​ Traffic Optimization​ (ALTO) Services based on ...Delivering Application-Layer​ Traffic Optimization​ (ALTO) Services based on ...
Delivering Application-Layer​ Traffic Optimization​ (ALTO) Services based on ...
Danny Alex Lachos Perez
 
Taming Big Data!
Taming Big Data!Taming Big Data!
Taming Big Data!
Ian Foster
 
Grid optical network service architecture for data intensive applications
Grid optical network service architecture for data intensive applicationsGrid optical network service architecture for data intensive applications
Grid optical network service architecture for data intensive applications
Tal Lavian Ph.D.
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)
theijes
 
Semantics-enhanced Geoscience Interoperability, Analytics, and Applications
Semantics-enhanced Geoscience Interoperability, Analytics, and ApplicationsSemantics-enhanced Geoscience Interoperability, Analytics, and Applications
Semantics-enhanced Geoscience Interoperability, Analytics, and Applications
Artificial Intelligence Institute at UofSC
 
RAMSES: Robust Analytic Models for Science at Extreme Scales
RAMSES: Robust Analytic Models for Science at Extreme ScalesRAMSES: Robust Analytic Models for Science at Extreme Scales
RAMSES: Robust Analytic Models for Science at Extreme Scales
Ian Foster
 
AusCover portal presentation
AusCover portal presentationAusCover portal presentation
AusCover portal presentation
TERN Australia
 
Cognitive Technique for Software Defined Optical Network (SDON)
Cognitive Technique for Software Defined Optical Network (SDON)Cognitive Technique for Software Defined Optical Network (SDON)
Cognitive Technique for Software Defined Optical Network (SDON)
CPqD
 
RESPONSE SURFACE METHODOLOGY FOR PERFORMANCE ANALYSIS AND MODELING OF MANET R...
RESPONSE SURFACE METHODOLOGY FOR PERFORMANCE ANALYSIS AND MODELING OF MANET R...RESPONSE SURFACE METHODOLOGY FOR PERFORMANCE ANALYSIS AND MODELING OF MANET R...
RESPONSE SURFACE METHODOLOGY FOR PERFORMANCE ANALYSIS AND MODELING OF MANET R...
IJCNCJournal
 
cug2011-praveen
cug2011-praveencug2011-praveen
cug2011-praveen
Praveen Narayanan
 
Graph Analytics: Graph Algorithms Inside Neo4j
Graph Analytics: Graph Algorithms Inside Neo4jGraph Analytics: Graph Algorithms Inside Neo4j
Graph Analytics: Graph Algorithms Inside Neo4j
Neo4j
 
03 interlinking-dass
03 interlinking-dass03 interlinking-dass
03 interlinking-dass
Diego Pessoa
 
RDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival dataRDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival data
Giorgos Santipantakis
 
Follow the money with graphs
Follow the money with graphsFollow the money with graphs
Follow the money with graphs
Stanka Dalekova
 

Similar to Benchmarking Link Discovery Systems for Geo-Spatial Data - BLINK ISWC2017. (20)

Data mining projects topics for java and dot net
Data mining projects topics for java and dot netData mining projects topics for java and dot net
Data mining projects topics for java and dot net
 
ADAPTER
ADAPTERADAPTER
ADAPTER
 
Big Linked Data Federation - ExtremeEarth Open Workshop
Big Linked Data Federation - ExtremeEarth Open WorkshopBig Linked Data Federation - ExtremeEarth Open Workshop
Big Linked Data Federation - ExtremeEarth Open Workshop
 
A fuzzy clustering algorithm for high dimensional streaming data
A fuzzy clustering algorithm for high dimensional streaming dataA fuzzy clustering algorithm for high dimensional streaming data
A fuzzy clustering algorithm for high dimensional streaming data
 
5G-USA-Telemetry
5G-USA-Telemetry5G-USA-Telemetry
5G-USA-Telemetry
 
Achieving horizontal scalability in density-based clustering for urls
Achieving horizontal scalability in density-based clustering for urlsAchieving horizontal scalability in density-based clustering for urls
Achieving horizontal scalability in density-based clustering for urls
 
Delivering Application-Layer​ Traffic Optimization​ (ALTO) Services based on ...
Delivering Application-Layer​ Traffic Optimization​ (ALTO) Services based on ...Delivering Application-Layer​ Traffic Optimization​ (ALTO) Services based on ...
Delivering Application-Layer​ Traffic Optimization​ (ALTO) Services based on ...
 
Taming Big Data!
Taming Big Data!Taming Big Data!
Taming Big Data!
 
Grid optical network service architecture for data intensive applications
Grid optical network service architecture for data intensive applicationsGrid optical network service architecture for data intensive applications
Grid optical network service architecture for data intensive applications
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)
 
Semantics-enhanced Geoscience Interoperability, Analytics, and Applications
Semantics-enhanced Geoscience Interoperability, Analytics, and ApplicationsSemantics-enhanced Geoscience Interoperability, Analytics, and Applications
Semantics-enhanced Geoscience Interoperability, Analytics, and Applications
 
RAMSES: Robust Analytic Models for Science at Extreme Scales
RAMSES: Robust Analytic Models for Science at Extreme ScalesRAMSES: Robust Analytic Models for Science at Extreme Scales
RAMSES: Robust Analytic Models for Science at Extreme Scales
 
AusCover portal presentation
AusCover portal presentationAusCover portal presentation
AusCover portal presentation
 
Cognitive Technique for Software Defined Optical Network (SDON)
Cognitive Technique for Software Defined Optical Network (SDON)Cognitive Technique for Software Defined Optical Network (SDON)
Cognitive Technique for Software Defined Optical Network (SDON)
 
RESPONSE SURFACE METHODOLOGY FOR PERFORMANCE ANALYSIS AND MODELING OF MANET R...
RESPONSE SURFACE METHODOLOGY FOR PERFORMANCE ANALYSIS AND MODELING OF MANET R...RESPONSE SURFACE METHODOLOGY FOR PERFORMANCE ANALYSIS AND MODELING OF MANET R...
RESPONSE SURFACE METHODOLOGY FOR PERFORMANCE ANALYSIS AND MODELING OF MANET R...
 
cug2011-praveen
cug2011-praveencug2011-praveen
cug2011-praveen
 
Graph Analytics: Graph Algorithms Inside Neo4j
Graph Analytics: Graph Algorithms Inside Neo4jGraph Analytics: Graph Algorithms Inside Neo4j
Graph Analytics: Graph Algorithms Inside Neo4j
 
03 interlinking-dass
03 interlinking-dass03 interlinking-dass
03 interlinking-dass
 
RDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival dataRDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival data
 
Follow the money with graphs
Follow the money with graphsFollow the money with graphs
Follow the money with graphs
 

More from Holistic Benchmarking of Big Linked Data

EARL: Joint Entity and Relation Linking for Question Answering over Knowledge...
EARL: Joint Entity and Relation Linking for Question Answering over Knowledge...EARL: Joint Entity and Relation Linking for Question Answering over Knowledge...
EARL: Joint Entity and Relation Linking for Question Answering over Knowledge...
Holistic Benchmarking of Big Linked Data
 
Benchmarking Big Linked Data: The case of the HOBBIT Project
Benchmarking Big Linked Data: The case of the HOBBIT ProjectBenchmarking Big Linked Data: The case of the HOBBIT Project
Benchmarking Big Linked Data: The case of the HOBBIT Project
Holistic Benchmarking of Big Linked Data
 
Assessing Linked Data Versioning Systems: The Semantic Publishing Versioning ...
Assessing Linked Data Versioning Systems: The Semantic Publishing Versioning ...Assessing Linked Data Versioning Systems: The Semantic Publishing Versioning ...
Assessing Linked Data Versioning Systems: The Semantic Publishing Versioning ...
Holistic Benchmarking of Big Linked Data
 
The DEBS Grand Challenge 2018
The DEBS Grand Challenge 2018The DEBS Grand Challenge 2018
The DEBS Grand Challenge 2018
Holistic Benchmarking of Big Linked Data
 
Benchmarking of distributed linked data streaming systems
Benchmarking of distributed linked data streaming systemsBenchmarking of distributed linked data streaming systems
Benchmarking of distributed linked data streaming systems
Holistic Benchmarking of Big Linked Data
 
SQCFramework: SPARQL Query Containment Benchmarks Generation Framework
SQCFramework: SPARQL Query Containment Benchmarks Generation FrameworkSQCFramework: SPARQL Query Containment Benchmarks Generation Framework
SQCFramework: SPARQL Query Containment Benchmarks Generation Framework
Holistic Benchmarking of Big Linked Data
 
LargeRDFBench: A billion triples benchmark for SPARQL endpoint federation
LargeRDFBench: A billion triples benchmark for SPARQL endpoint federationLargeRDFBench: A billion triples benchmark for SPARQL endpoint federation
LargeRDFBench: A billion triples benchmark for SPARQL endpoint federation
Holistic Benchmarking of Big Linked Data
 
The DEBS Grand Challenge 2017
The DEBS Grand Challenge 2017The DEBS Grand Challenge 2017
The DEBS Grand Challenge 2017
Holistic Benchmarking of Big Linked Data
 
4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QAL...
4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QAL...4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QAL...
4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QAL...
Holistic Benchmarking of Big Linked Data
 
Scalable Link Discovery for Modern Data-Driven Applications (poster)
Scalable Link Discovery for Modern Data-Driven Applications (poster)Scalable Link Discovery for Modern Data-Driven Applications (poster)
Scalable Link Discovery for Modern Data-Driven Applications (poster)
Holistic Benchmarking of Big Linked Data
 
An Evaluation of Models for Runtime Approximation in Link Discovery
An Evaluation of Models for Runtime Approximation in Link DiscoveryAn Evaluation of Models for Runtime Approximation in Link Discovery
An Evaluation of Models for Runtime Approximation in Link Discovery
Holistic Benchmarking of Big Linked Data
 
Scalable Link Discovery for Modern Data-Driven Applications
Scalable Link Discovery for Modern Data-Driven ApplicationsScalable Link Discovery for Modern Data-Driven Applications
Scalable Link Discovery for Modern Data-Driven Applications
Holistic Benchmarking of Big Linked Data
 
Extending LargeRDFBench for Multi-Source Data at Scale for SPARQL Endpoint F...
 Extending LargeRDFBench for Multi-Source Data at Scale for SPARQL Endpoint F... Extending LargeRDFBench for Multi-Source Data at Scale for SPARQL Endpoint F...
Extending LargeRDFBench for Multi-Source Data at Scale for SPARQL Endpoint F...
Holistic Benchmarking of Big Linked Data
 
SPgen: A Benchmark Generator for Spatial Link Discovery Tools
SPgen: A Benchmark Generator for Spatial Link Discovery ToolsSPgen: A Benchmark Generator for Spatial Link Discovery Tools
SPgen: A Benchmark Generator for Spatial Link Discovery Tools
Holistic Benchmarking of Big Linked Data
 
Introducing the HOBBIT platform into the Ontology Alignment Evaluation Campaign
Introducing the HOBBIT platform into the Ontology Alignment Evaluation CampaignIntroducing the HOBBIT platform into the Ontology Alignment Evaluation Campaign
Introducing the HOBBIT platform into the Ontology Alignment Evaluation Campaign
Holistic Benchmarking of Big Linked Data
 
OKE2018 Challenge @ ESWC2018
OKE2018 Challenge @ ESWC2018OKE2018 Challenge @ ESWC2018
OKE2018 Challenge @ ESWC2018
Holistic Benchmarking of Big Linked Data
 
MOCHA 2018 Challenge @ ESWC2018
MOCHA 2018 Challenge @ ESWC2018MOCHA 2018 Challenge @ ESWC2018
MOCHA 2018 Challenge @ ESWC2018
Holistic Benchmarking of Big Linked Data
 
Dynamic planning for link discovery - ESWC 2018
Dynamic planning for link discovery - ESWC 2018Dynamic planning for link discovery - ESWC 2018
Dynamic planning for link discovery - ESWC 2018
Holistic Benchmarking of Big Linked Data
 
Hobbit project overview presented at EBDVF 2017
Hobbit project overview presented at EBDVF 2017Hobbit project overview presented at EBDVF 2017
Hobbit project overview presented at EBDVF 2017
Holistic Benchmarking of Big Linked Data
 
Leopard ISWC Semantic Web Challenge 2017 (poster)
Leopard ISWC Semantic Web Challenge 2017 (poster)Leopard ISWC Semantic Web Challenge 2017 (poster)
Leopard ISWC Semantic Web Challenge 2017 (poster)
Holistic Benchmarking of Big Linked Data
 

More from Holistic Benchmarking of Big Linked Data (20)

EARL: Joint Entity and Relation Linking for Question Answering over Knowledge...
EARL: Joint Entity and Relation Linking for Question Answering over Knowledge...EARL: Joint Entity and Relation Linking for Question Answering over Knowledge...
EARL: Joint Entity and Relation Linking for Question Answering over Knowledge...
 
Benchmarking Big Linked Data: The case of the HOBBIT Project
Benchmarking Big Linked Data: The case of the HOBBIT ProjectBenchmarking Big Linked Data: The case of the HOBBIT Project
Benchmarking Big Linked Data: The case of the HOBBIT Project
 
Assessing Linked Data Versioning Systems: The Semantic Publishing Versioning ...
Assessing Linked Data Versioning Systems: The Semantic Publishing Versioning ...Assessing Linked Data Versioning Systems: The Semantic Publishing Versioning ...
Assessing Linked Data Versioning Systems: The Semantic Publishing Versioning ...
 
The DEBS Grand Challenge 2018
The DEBS Grand Challenge 2018The DEBS Grand Challenge 2018
The DEBS Grand Challenge 2018
 
Benchmarking of distributed linked data streaming systems
Benchmarking of distributed linked data streaming systemsBenchmarking of distributed linked data streaming systems
Benchmarking of distributed linked data streaming systems
 
SQCFramework: SPARQL Query Containment Benchmarks Generation Framework
SQCFramework: SPARQL Query Containment Benchmarks Generation FrameworkSQCFramework: SPARQL Query Containment Benchmarks Generation Framework
SQCFramework: SPARQL Query Containment Benchmarks Generation Framework
 
LargeRDFBench: A billion triples benchmark for SPARQL endpoint federation
LargeRDFBench: A billion triples benchmark for SPARQL endpoint federationLargeRDFBench: A billion triples benchmark for SPARQL endpoint federation
LargeRDFBench: A billion triples benchmark for SPARQL endpoint federation
 
The DEBS Grand Challenge 2017
The DEBS Grand Challenge 2017The DEBS Grand Challenge 2017
The DEBS Grand Challenge 2017
 
4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QAL...
4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QAL...4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QAL...
4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QAL...
 
Scalable Link Discovery for Modern Data-Driven Applications (poster)
Scalable Link Discovery for Modern Data-Driven Applications (poster)Scalable Link Discovery for Modern Data-Driven Applications (poster)
Scalable Link Discovery for Modern Data-Driven Applications (poster)
 
An Evaluation of Models for Runtime Approximation in Link Discovery
An Evaluation of Models for Runtime Approximation in Link DiscoveryAn Evaluation of Models for Runtime Approximation in Link Discovery
An Evaluation of Models for Runtime Approximation in Link Discovery
 
Scalable Link Discovery for Modern Data-Driven Applications
Scalable Link Discovery for Modern Data-Driven ApplicationsScalable Link Discovery for Modern Data-Driven Applications
Scalable Link Discovery for Modern Data-Driven Applications
 
Extending LargeRDFBench for Multi-Source Data at Scale for SPARQL Endpoint F...
 Extending LargeRDFBench for Multi-Source Data at Scale for SPARQL Endpoint F... Extending LargeRDFBench for Multi-Source Data at Scale for SPARQL Endpoint F...
Extending LargeRDFBench for Multi-Source Data at Scale for SPARQL Endpoint F...
 
SPgen: A Benchmark Generator for Spatial Link Discovery Tools
SPgen: A Benchmark Generator for Spatial Link Discovery ToolsSPgen: A Benchmark Generator for Spatial Link Discovery Tools
SPgen: A Benchmark Generator for Spatial Link Discovery Tools
 
Introducing the HOBBIT platform into the Ontology Alignment Evaluation Campaign
Introducing the HOBBIT platform into the Ontology Alignment Evaluation CampaignIntroducing the HOBBIT platform into the Ontology Alignment Evaluation Campaign
Introducing the HOBBIT platform into the Ontology Alignment Evaluation Campaign
 
OKE2018 Challenge @ ESWC2018
OKE2018 Challenge @ ESWC2018OKE2018 Challenge @ ESWC2018
OKE2018 Challenge @ ESWC2018
 
MOCHA 2018 Challenge @ ESWC2018
MOCHA 2018 Challenge @ ESWC2018MOCHA 2018 Challenge @ ESWC2018
MOCHA 2018 Challenge @ ESWC2018
 
Dynamic planning for link discovery - ESWC 2018
Dynamic planning for link discovery - ESWC 2018Dynamic planning for link discovery - ESWC 2018
Dynamic planning for link discovery - ESWC 2018
 
Hobbit project overview presented at EBDVF 2017
Hobbit project overview presented at EBDVF 2017Hobbit project overview presented at EBDVF 2017
Hobbit project overview presented at EBDVF 2017
 
Leopard ISWC Semantic Web Challenge 2017 (poster)
Leopard ISWC Semantic Web Challenge 2017 (poster)Leopard ISWC Semantic Web Challenge 2017 (poster)
Leopard ISWC Semantic Web Challenge 2017 (poster)
 

Recently uploaded

Sciences of Europe journal No 142 (2024)
Sciences of Europe journal No 142 (2024)Sciences of Europe journal No 142 (2024)
Sciences of Europe journal No 142 (2024)
Sciences of Europe
 
Farming systems analysis: what have we learnt?.pptx
Farming systems analysis: what have we learnt?.pptxFarming systems analysis: what have we learnt?.pptx
Farming systems analysis: what have we learnt?.pptx
Frédéric Baudron
 
Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.
Aditi Bajpai
 
Compexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titrationCompexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titration
Vandana Devesh Sharma
 
Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...
Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...
Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...
frank0071
 
GBSN - Biochemistry (Unit 6) Chemistry of Proteins
GBSN - Biochemistry (Unit 6) Chemistry of ProteinsGBSN - Biochemistry (Unit 6) Chemistry of Proteins
GBSN - Biochemistry (Unit 6) Chemistry of Proteins
Areesha Ahmad
 
Applied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdfApplied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdf
University of Hertfordshire
 
The cost of acquiring information by natural selection
The cost of acquiring information by natural selectionThe cost of acquiring information by natural selection
The cost of acquiring information by natural selection
Carl Bergstrom
 
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
Sérgio Sacani
 
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Leonel Morgado
 
The binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defectsThe binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defects
Sérgio Sacani
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
PRIYANKA PATEL
 
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdfwaterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
LengamoLAppostilic
 
The debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically youngThe debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically young
Sérgio Sacani
 
Gadgets for management of stored product pests_Dr.UPR.pdf
Gadgets for management of stored product pests_Dr.UPR.pdfGadgets for management of stored product pests_Dr.UPR.pdf
Gadgets for management of stored product pests_Dr.UPR.pdf
PirithiRaju
 
molar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptxmolar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptx
Anagha Prasad
 
Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...
Leonel Morgado
 
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
vluwdy49
 
Basics of crystallography, crystal systems, classes and different forms
Basics of crystallography, crystal systems, classes and different formsBasics of crystallography, crystal systems, classes and different forms
Basics of crystallography, crystal systems, classes and different forms
MaheshaNanjegowda
 
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
Advanced-Concepts-Team
 

Recently uploaded (20)

Sciences of Europe journal No 142 (2024)
Sciences of Europe journal No 142 (2024)Sciences of Europe journal No 142 (2024)
Sciences of Europe journal No 142 (2024)
 
Farming systems analysis: what have we learnt?.pptx
Farming systems analysis: what have we learnt?.pptxFarming systems analysis: what have we learnt?.pptx
Farming systems analysis: what have we learnt?.pptx
 
Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.
 
Compexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titrationCompexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titration
 
Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...
Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...
Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...
 
GBSN - Biochemistry (Unit 6) Chemistry of Proteins
GBSN - Biochemistry (Unit 6) Chemistry of ProteinsGBSN - Biochemistry (Unit 6) Chemistry of Proteins
GBSN - Biochemistry (Unit 6) Chemistry of Proteins
 
Applied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdfApplied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdf
 
The cost of acquiring information by natural selection
The cost of acquiring information by natural selectionThe cost of acquiring information by natural selection
The cost of acquiring information by natural selection
 
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
 
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
 
The binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defectsThe binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defects
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
 
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdfwaterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
 
The debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically youngThe debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically young
 
Gadgets for management of stored product pests_Dr.UPR.pdf
Gadgets for management of stored product pests_Dr.UPR.pdfGadgets for management of stored product pests_Dr.UPR.pdf
Gadgets for management of stored product pests_Dr.UPR.pdf
 
molar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptxmolar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptx
 
Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...
 
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
 
Basics of crystallography, crystal systems, classes and different forms
Basics of crystallography, crystal systems, classes and different formsBasics of crystallography, crystal systems, classes and different forms
Basics of crystallography, crystal systems, classes and different forms
 
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
 

Benchmarking Link Discovery Systems for Geo-Spatial Data - BLINK ISWC2017.

  • 1. Benchmarking Link Discovery Systems for Geo-Spatial Data Tzanina Saveta Giorgos Flouris, Irini Fundulaki Institute of Computer Science-FORTH, Greece Axel-Cyrille Ngonga Ngomo Paderborn University,Germany Benchmarking Link Discovery Systems for Geo-Spatial Data 1
  • 2. Why Benchmarking Link Discovery Systems for Geospatial Data? • Large amount of available geospatial datasets and link discovery systems • Limited number of benchmarks for link discovery systems geospatial data • Propose: Spatial Benchmark Generator – differs from the classical, mostly string-based approaches – test the performance of link discovery systems that deal with the DE-9IM (Dimensionally Extended nine-Intersection Model) model that is used to represent topological relations – Choke-point based design Benchmarking Link Discovery Systems for Geo-Spatial Data 2 Essential to determine the effectiveness and the efficiency of the proposed techniques and tools
  • 3. Spatial Benchmark Ontology & Datasets • TomTom traffic data archive contains approximately 15 trillion (1:5 1013) speed measurements from hundreds of millions of road segments all over the world, mostly based on anonymized GPS positioning data Benchmarking Link Discovery Systems for Geo-Spatial Data 3 Trace Point has Point Velocity Motion Property Vectorwgs84_pos: Point xsd:TimeStamp hasTimeStamp has Speed km_per_hour, Float, miles_per_hour velocityMetric velocityValue xsd:Float
  • 4. Choke Points & Key Performance Indicators (KPIs) Benchmarking Link Discovery Systems for Geo-Spatial Data 4 Choke Points Key Performance IndicatorsCP1: Scalability • Large datasets • Large number of properties for each instance CP2: Output Quality Metric CP3: Performance Dataset size Precision, Recall, F-measure Link DiscoveryTime
  • 5. Spatial Benchmark (Overview) • Source Dataset – Consists of traces, a trace represented as a LineString in the Well-known text (WKT) format • Target Dataset – Consists of traces (LineString) represented in theWKT format Given aWKT Geometry s as source and a DE-9IM Relation r we to generate aWKT target Geometry t such as their Intersection Matrix follows the definition of Relation r. • Gold Standard – Stores pairs of matched (source, target) traces – Generated using RADON [SDS+16] to ensure completeness and correctness Benchmarking Link Discovery Systems for Geo-Spatial Data 5
  • 6. DE-9IMTopological Relations between LineStrings Benchmarking Link Discovery Systems for Geo-Spatial Data 6 EQUALS DISJOINT TOUCHES CONTAINS & COVERS INTERSECTS CROSSES OVERLAPS
  • 7. JTSTopology Suite • The JTSTopology Suite is a Java API that implements a core set of spatial data operations using an explicit precision model and robust geometric algorithms • Spatial benchmark implements an extension of the JTS Topology Suite that allows us to generate – a target geometry given a source geometry and a DE-9IM relation Benchmarking Link Discovery Systems for Geo-Spatial Data 7
  • 8. --------- SOURCE --------- t1 aTrace . t1 hasGeometry "LINESTRING (8.79232 48.92266, 8.79186 48.92246, 8.79135 48.92229, 8.79079 48.92219, 8.79012 48.92216, 8.78957 48.92207, 8.78903 48.92185, 8.78848 48.92162, 8.78807 48.92133, 8.78782 48.92097, 8.78771 48.92068, 8.78737 48.92031, 8.78695 48.92004)" ^^<http://strdf.di.uoa.gr/ontology#WKT> . ---------TARGET --------- t1' aTrace . t1' hasGeometry "LINESTRING (8.78903 48.92185, 8.78848 48.92162, 8.78807 48.92133, 8.78782 48.92097, 8.78771 48.92068, 8.78737 48.92031)" ^^<http://strdf.di.uoa.gr/ontology#WKT> . --------- GOLD STANDARD (e.g. for COVERS) --------- t1 t1'. Datasets example
  • 9. Architecture: Data Generation Benchmarking Link Discovery Systems for Geo-Spatial Data 9 Input Dataset (Traces) Initialization Module Resource Generation Resource Transformation (JTS Extension) Test Case Generation Parameters Source Data Target Data RADON Gold Standard
  • 10. Experiments for Spatial Benchmark (1) • Objective: Provide a comparative analysis to assess the ability of RADON [SDS+16] and Silk [SK16] to manage DE-9IM relations • Experiment set up: HOBBIT Platform time limit = 75 minutes • Issues encountered – Scalability • Due to a problem encountered for SILK, we had to select small traces (no larger than 64 Kbytes) – Output Quality Metric • Precision, recall and f-measure were equal to 1.0 – Time Performance • Show the rate between the time performance and trace size (10, 100, 1K and 10K) • Show the time needed by link discovery systems for the different topological relations Benchmarking Link Discovery Systems for Geo-Spatial Data 10
  • 11. Experiments for Spatial Benchmark (2) 27-Oct-17 Benchmarking Link Discovery Systems for Geo-Spatial Data 11 RELATION System #10 #100 #1K #10K EQUALS RADON 1139 1290 3891 23509 Silk 2424 3804 37585 704821 CROSSES RADON 1209 1996 5139 88799 Silk 3126 3927 55691 1035194 COVERED BY RADON 1124 1334 4793 28095 Silk Not Supported – same as Within for LineStrings COVERS RADON 1147 1235 4158 28673 Silk Not Supported – same as Contains for LineStrings DISJOINT RADON 567 820 7269 18031 Silk 2686 5843 206313 Platform time limit * OVERLAPS RADON 1209 3047 21829 1452969 Silk 3693 10790 221734 Platform time limit * WITHIN RADON 1116 1311 3899 30233 Silk 2859 3151 27933 550819 CONTAINS RADON 1163 1294 3155 30560 Silk 3170 3079 26094 538441 INTERSECTS RADON 1400 6396 94733 Platform time limit * Silk 3628 13175 381310 Platform time limit * TOUCHES RADON 1436 4958 118104 Platform time limit * Silk 3072 19786 601024 Platform time limit Time rate (in ms) while increasing population for each topological relation for RADON and Silk.
  • 12. Experiments for Spatial Benchmark (3) Benchmarking Link Discovery Systems for Geo-Spatial Data 12
  • 13. Experiments for Spatial Benchmark (4) Benchmarking Link Discovery Systems for Geo-Spatial Data 13
  • 14. Overview of Experimental Results (1) • DE-9IM relations – DISJOINT needs the shortest time for the systems for small number of traces but stops due to the platform time limit when increasing the dataset size – EQUALS, CROSSES, COVERS/COVERED BY and CONTAINS/WITHIN need approximately the same time – INTERSECTS, TOUCHES and OVERLAPS seem to be the hardest relations for the systems Benchmarking Link Discovery Systems for Geo-Spatial Data 14
  • 15. Overview of Experimental Results (2) • Link Discovery Systems – RADON handles the growth of the dataset size smoother than SILK – For small number of traces, RADON needs approximately half time than SILK – When the size increases, RADON outperforms SILK by more than one order of magnitude Benchmarking Link Discovery Systems for Geo-Spatial Data 15
  • 16. [SDS+16] Mohamed Ahmed Sherif, Kevin Dreßler, Panayiotis Smeros, and Axel- Cyrille Ngonga Ngomo. RADON - Rapid Discovery of Topological Relations. In Proceedings ofTheThirty-First AAAI Conference onArtificial Intelligence (AAAI-17), 2017. [SK16] P. Smeros and M. Koubarakis. Discovering Spatial andTemporal Links Among RDF Data. InWWW2016 Workshop: Linked Data on the Web (LDOW2016), Montreál, Canada, 2016. References
  • 17. 17 This work was supported by grands from the EU H2020 Framework Programme provided for the project HOBBIT (GA no. 688227).