SlideShare a Scribd company logo
SAVETA SPgen ISWC 2018 – Monterey CA
SPgen: A Benchmark Generator for Spatial
Link Discovery Tools
T. Saveta1, I. Fundulaki1, G. Flouris1, and A.-C. Ngonga-Ngomo2
1 Institute of Computer Science - FORTH, Greece
2 University of Paderborn, Germany
ISWC 2018 – Monterey CA
ISWC 2018 – Monterey CA
SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA
LOD Cloud and Geospatial Data
UK Postcodes
LinkedGeoData
GeoWordNet
Ocean Drilling - Forams
GeoNames Semantic Web
Pleiades
Ocean Drilling –
Janus Age Models
Weather Stations
OpenMobileNetwork
GeoLinkedData
EARTh
Linked Crowdsourced Data
greek-administrative-
geography
The Getty
Thesaurus of
Geographic
Names
Linked Logainm
FAO geopolitical ontology
NUTS (GeoVocab)
NUTS (GeoVocab)
AgreementMaker
Strabon
OntoIdea
Silk
RADON
SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA
Essential to determine the effectiveness and the efficiency of the
proposed techniques and tools
Compare them
Assess their performance
«Push» the systems to get better
Benchmarking Link Discovery Systems for Geospatial Data
Develop benchmarks for geo-spatial link discovery
systems
SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA
Organized into test cases each addressing different kind of
requirements
Datasets
The raw material of the benchmarks.These are the source and
the target dataset that will be matched together to find the links
Gold Standard
The “correct answer sheet” used to judge the completeness and
soundness of the link discovery algorithms
Metrics
The performance metric(s) that determine the systems behavior
and performance
Link Discovery Benchmark Ingredients
SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA
Only a limited number of benchmarks target the problem of
linking geo-spatial entities
Propose: Spatial Benchmark Generator (SPgen)
Differs from the classical, mostly string-based
approaches
Generic, schema agnostic and choke-point based
Test the performance of link discovery systems that
deal with the DE-9IM (Dimensionally Extended nine-
Intersection Model) - standard to describe topological
relations between two geometries.
Is integrated into HOBBIT platform
Spatial Benchmark Generator (SPgen)
SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA
Source Dataset
Traces (lists of (longitude, latitude)) represented as LineStrings in
the Well-known text (WKT) format
Target Dataset
Traces represented as LineStrings or Polygons in the WKT format
Gold Standard
Stores pairs of matched source and target traces
Generated using RADON to ensure completeness and
correctness
SPgen Overview
Given aWKT Geometry s as source and a DE-9IM Relation r, we generate a
WKT Geometry t as target such as their Intersection Matrix follows the
definition of Relation r.
SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA
Choke Points
CP1: Scalability
• Large datasets
• Large number of points for each trace
CP2: Output Quality Metric
• Precision
• Recall
• F-measure
CP3:Time Performance
SPgen Choke Points & Key Performance Indicators
SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA
SPgen: Source Dataset Generation
Input Dataset
(Traces)
Source Data
Generation
Target Data
Generation
(JTS Extension)
Test Case
Generation
Parameters
Source
Data
Target
Data
Gold
Standard
Generation
Gold
Standard
SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA
TomTom Data Generator
GPSTrace Generator
• Trace: a list of (longitude, latitude) pairs recorded by one
device (phone, car, etc.)
Spaten (Spatio-temporal and textual Big Data Generator)
Did not use their data generator but the 163.845 already
generated traces that came from 9.460 users and 38.800.019
GPSTraces
The generator can produce 14 users per day using the
free version of Google Maps API –2500 requests per day
Data Generators
SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA
SPgen: Target Dataset Generation
Input Dataset
(Traces)
Source Data
Generation
Target Data
Generation
(JTS Extension)
Test Case
Generation
Parameters
Source
Data
Target
Data
Gold
Standard
Generation
Gold
Standard
SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA
DE-9IM Topological Relations between LineStrings
Equals Disjoint Touches
Contains (Within)
& Covers (Covered By)
Intersects Crosses
Overlaps
SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA
DE-9IM Topological Relations between a LineString and a Polygon
Disjoint
TouchesIntersects
• Equals and Overlaps cannot hold between a LineString and a Polygon
Crosses Contains (Within)
& Covers (Covered By)
SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA
Disjoint example (LineString/LineString)
SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA
SPgen: Gold Standard Generation
Input Dataset
(Traces)
Source Data
Generation
Target Data
Generation
(JTS Extension)
Test Case
Generation
Parameters
Source
Data
Target
Data
Gold
Standard
Generation
Gold
Standard
SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA
Systems to test
Spatial Link Discovery systems: RADON, Silk, AML, OntoIdea
Tasks: For each data generator (TomTom/Spaten) we divided the
tasks into 4 subtasks, systems were asked to match the geometries
considering a given relation
SLL & LLL: LineStrings/LineStrings, Small dataset of 200
instances and Large dataset of 2K instances
SLP & LLP: LineStrings/Polygons, Small dataset of 200
instances Large dataset of 2K instances
Experiments for SPgen
SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA
Aims
Scalability
• Due to a limitation of Silk, we had to select traces no larger than 64
KB
Output Quality Metric
• Precision, Recall and F-measure (all were equal to 1.0)
Time Performance to show:
• The rate between the time performance and the size of the datasets
• The rate between the time performance and the different data
generators
• The time difference between the topological relations
• The time difference between the geometries
Experimental set up: Platform time limit = 75 minutes
Experiments for SPgen
SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA
Experiments for SPgen (LineStrings/LineStrings) [SLL]
Time rate (in ms) while increasing population for each topological relation for RADON, Silk, AML and OntoIdea.
*Platform time limit = 75 mins
SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA
Experiments for SPgen (LineStrings/LineStrings) [LLL]
Time rate (in ms) while increasing population for each topological relation for RADON, Silk, AML and OntoIdea.
*Platform time limit = 75 mins
SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA
Experiments for SPgen (LineStrings/Polygons) [SLP]
Time rate (in ms) while increasing population for each topological relation for RADON, Silk, AML.
*Platform time limit = 75 mins
SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA
Experiments for SPgen (LineStrings/Polygons) [LLP]
Time rate (in ms) while increasing population for each topological relation for RADON, Silk, AML.
*Platform time limit = 75 mins
SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA
RADON
Addressed all the tasks
Best performance for the LineString/PolygonTasks
Improvements in Touches and Intersects relations for the LineString/LineStringTasks
AML
Performs extremely well in most cases
Improvements in LineString/PolygonTasks and in Disjoint relations
Silk
Improvements in Touches, Intersects and Overlaps relations and for the
LineString/LineString and in Disjoint relation for the LineString/PolygonTasks
OntoIdea
Handles small datasets efficiently, this does not hold when it comes to larger datasets
Did not support LineString/PolygonTasks in this version
Experiments General Remarks
SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA
All the systems needed
More time forTomTom than Spaten, due to the smaller number
of points per trace in the latter
More time the LineString/LineString Tasks for Touches, Intersects
and Crosses relations
Approximately the same time for both LineString/LineString and
LineString/PolygonTasks for Disjoint relation
Less time for the LineString/LineString Tasks for Contains, Within,
Covers and Covered By relations
Depending on the test case we can choose the appropriate
system
Experiments General Remarks
SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA
Conclusions
SPgen: checks whether spatial link discovery systems can
identify DE-9IM topological relations between LineStrings and
Polygons
Experiments with RADON and Silk, AML and OntoIdea
Choke points: Scalability, Quality measure output andTime
performance
Next
Implement DE-9IM relations for all possible combinations of
different geometries (e.g. Polygon/Polygon, combination with
Points, LineStrings and Polygons, etc.)
More data generators – already added DEBS dataset
More system – already added Strabon
Concluding Remarks & Future Work
SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA
Thank you!
Questions?
SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA
This work was supported by grands from the EU H2020 Framework Programme
provided for the project HOBBIT (GA no. 688227).

More Related Content

Similar to SPgen: A Benchmark Generator for Spatial Link Discovery Tools

RDO_01_2016_Journal_P_Web
RDO_01_2016_Journal_P_WebRDO_01_2016_Journal_P_Web
RDO_01_2016_Journal_P_Web
Sahl Martin
 
Building a scalable data science platform with R
Building a scalable data science platform with RBuilding a scalable data science platform with R
Building a scalable data science platform with R
Revolution Analytics
 
Smart E-Logistics for SCM Spend Analysis
Smart E-Logistics for SCM Spend AnalysisSmart E-Logistics for SCM Spend Analysis
Smart E-Logistics for SCM Spend Analysis
IRJET Journal
 
Free Writing - Grammatical Error Correction System: Sequence Tagging
Free Writing - Grammatical Error Correction System: Sequence TaggingFree Writing - Grammatical Error Correction System: Sequence Tagging
Free Writing - Grammatical Error Correction System: Sequence Tagging
IRJET Journal
 
Your Timestamps Deserve Better than a Generic Database
Your Timestamps Deserve Better than a Generic DatabaseYour Timestamps Deserve Better than a Generic Database
Your Timestamps Deserve Better than a Generic Database
javier ramirez
 
IRJET- Univariate Time Series Prediction of Reservoir Inflow using Artifi...
IRJET-  	  Univariate Time Series Prediction of Reservoir Inflow using Artifi...IRJET-  	  Univariate Time Series Prediction of Reservoir Inflow using Artifi...
IRJET- Univariate Time Series Prediction of Reservoir Inflow using Artifi...
IRJET Journal
 
Contextualizing the Visualization of Climate Data
Contextualizing the Visualization of Climate DataContextualizing the Visualization of Climate Data
Contextualizing the Visualization of Climate Data
Raquel Alegre
 
2003-11-02 Combined Aerosol Trajectory Tool, CATT
2003-11-02 Combined Aerosol Trajectory Tool, CATT2003-11-02 Combined Aerosol Trajectory Tool, CATT
2003-11-02 Combined Aerosol Trajectory Tool, CATT
Rudolf Husar
 
HW3_Introduction_Mik
HW3_Introduction_MikHW3_Introduction_Mik
HW3_Introduction_Mik
Michele (Mik) Isernia
 
TGS GPS Overview
TGS GPS OverviewTGS GPS Overview
TGS GPS Overview
TGS
 
Benchmarking Link Discovery Systems for Geo-Spatial Data - BLINK ISWC2017.
Benchmarking Link Discovery Systems for Geo-Spatial Data - BLINK  ISWC2017. Benchmarking Link Discovery Systems for Geo-Spatial Data - BLINK  ISWC2017.
Benchmarking Link Discovery Systems for Geo-Spatial Data - BLINK ISWC2017.
Holistic Benchmarking of Big Linked Data
 
2005-01-18 Combined Aerosol Trajectory Tool, CATT Illustrated Instruction Manual
2005-01-18 Combined Aerosol Trajectory Tool, CATT Illustrated Instruction Manual2005-01-18 Combined Aerosol Trajectory Tool, CATT Illustrated Instruction Manual
2005-01-18 Combined Aerosol Trajectory Tool, CATT Illustrated Instruction Manual
Rudolf Husar
 
Catt Manual
Catt ManualCatt Manual
Catt Manual
Rudolf Husar
 
time series modeling-Decision Science Central Oct 26
time series modeling-Decision Science Central Oct 26time series modeling-Decision Science Central Oct 26
time series modeling-Decision Science Central Oct 26
Vinay Mehendiratta, PhD
 
Ifgi presentation
Ifgi presentationIfgi presentation
Ifgi presentation
noho
 
IRJET- Novel based Stock Value Prediction Method
IRJET- Novel based Stock Value Prediction MethodIRJET- Novel based Stock Value Prediction Method
IRJET- Novel based Stock Value Prediction Method
IRJET Journal
 
Challenge@RuleML2015 Modeling Object-Relational Geolocation Knowledge in PSOA...
Challenge@RuleML2015 Modeling Object-Relational Geolocation Knowledge in PSOA...Challenge@RuleML2015 Modeling Object-Relational Geolocation Knowledge in PSOA...
Challenge@RuleML2015 Modeling Object-Relational Geolocation Knowledge in PSOA...
RuleML
 
Kitzmiller Openhelisphereproject Bosc2008
Kitzmiller Openhelisphereproject Bosc2008Kitzmiller Openhelisphereproject Bosc2008
Kitzmiller Openhelisphereproject Bosc2008
bosc_2008
 
ESWC 2009 In-Use Track: SCOVO
ESWC 2009 In-Use Track: SCOVOESWC 2009 In-Use Track: SCOVO
ESWC 2009 In-Use Track: SCOVO
Michael Hausenblas
 
Study of Internet Traffic to Analyze and Predict Traffic
Study of Internet Traffic to Analyze and Predict TrafficStudy of Internet Traffic to Analyze and Predict Traffic
Study of Internet Traffic to Analyze and Predict Traffic
Amit Arora
 

Similar to SPgen: A Benchmark Generator for Spatial Link Discovery Tools (20)

RDO_01_2016_Journal_P_Web
RDO_01_2016_Journal_P_WebRDO_01_2016_Journal_P_Web
RDO_01_2016_Journal_P_Web
 
Building a scalable data science platform with R
Building a scalable data science platform with RBuilding a scalable data science platform with R
Building a scalable data science platform with R
 
Smart E-Logistics for SCM Spend Analysis
Smart E-Logistics for SCM Spend AnalysisSmart E-Logistics for SCM Spend Analysis
Smart E-Logistics for SCM Spend Analysis
 
Free Writing - Grammatical Error Correction System: Sequence Tagging
Free Writing - Grammatical Error Correction System: Sequence TaggingFree Writing - Grammatical Error Correction System: Sequence Tagging
Free Writing - Grammatical Error Correction System: Sequence Tagging
 
Your Timestamps Deserve Better than a Generic Database
Your Timestamps Deserve Better than a Generic DatabaseYour Timestamps Deserve Better than a Generic Database
Your Timestamps Deserve Better than a Generic Database
 
IRJET- Univariate Time Series Prediction of Reservoir Inflow using Artifi...
IRJET-  	  Univariate Time Series Prediction of Reservoir Inflow using Artifi...IRJET-  	  Univariate Time Series Prediction of Reservoir Inflow using Artifi...
IRJET- Univariate Time Series Prediction of Reservoir Inflow using Artifi...
 
Contextualizing the Visualization of Climate Data
Contextualizing the Visualization of Climate DataContextualizing the Visualization of Climate Data
Contextualizing the Visualization of Climate Data
 
2003-11-02 Combined Aerosol Trajectory Tool, CATT
2003-11-02 Combined Aerosol Trajectory Tool, CATT2003-11-02 Combined Aerosol Trajectory Tool, CATT
2003-11-02 Combined Aerosol Trajectory Tool, CATT
 
HW3_Introduction_Mik
HW3_Introduction_MikHW3_Introduction_Mik
HW3_Introduction_Mik
 
TGS GPS Overview
TGS GPS OverviewTGS GPS Overview
TGS GPS Overview
 
Benchmarking Link Discovery Systems for Geo-Spatial Data - BLINK ISWC2017.
Benchmarking Link Discovery Systems for Geo-Spatial Data - BLINK  ISWC2017. Benchmarking Link Discovery Systems for Geo-Spatial Data - BLINK  ISWC2017.
Benchmarking Link Discovery Systems for Geo-Spatial Data - BLINK ISWC2017.
 
2005-01-18 Combined Aerosol Trajectory Tool, CATT Illustrated Instruction Manual
2005-01-18 Combined Aerosol Trajectory Tool, CATT Illustrated Instruction Manual2005-01-18 Combined Aerosol Trajectory Tool, CATT Illustrated Instruction Manual
2005-01-18 Combined Aerosol Trajectory Tool, CATT Illustrated Instruction Manual
 
Catt Manual
Catt ManualCatt Manual
Catt Manual
 
time series modeling-Decision Science Central Oct 26
time series modeling-Decision Science Central Oct 26time series modeling-Decision Science Central Oct 26
time series modeling-Decision Science Central Oct 26
 
Ifgi presentation
Ifgi presentationIfgi presentation
Ifgi presentation
 
IRJET- Novel based Stock Value Prediction Method
IRJET- Novel based Stock Value Prediction MethodIRJET- Novel based Stock Value Prediction Method
IRJET- Novel based Stock Value Prediction Method
 
Challenge@RuleML2015 Modeling Object-Relational Geolocation Knowledge in PSOA...
Challenge@RuleML2015 Modeling Object-Relational Geolocation Knowledge in PSOA...Challenge@RuleML2015 Modeling Object-Relational Geolocation Knowledge in PSOA...
Challenge@RuleML2015 Modeling Object-Relational Geolocation Knowledge in PSOA...
 
Kitzmiller Openhelisphereproject Bosc2008
Kitzmiller Openhelisphereproject Bosc2008Kitzmiller Openhelisphereproject Bosc2008
Kitzmiller Openhelisphereproject Bosc2008
 
ESWC 2009 In-Use Track: SCOVO
ESWC 2009 In-Use Track: SCOVOESWC 2009 In-Use Track: SCOVO
ESWC 2009 In-Use Track: SCOVO
 
Study of Internet Traffic to Analyze and Predict Traffic
Study of Internet Traffic to Analyze and Predict TrafficStudy of Internet Traffic to Analyze and Predict Traffic
Study of Internet Traffic to Analyze and Predict Traffic
 

More from Holistic Benchmarking of Big Linked Data

EARL: Joint Entity and Relation Linking for Question Answering over Knowledge...
EARL: Joint Entity and Relation Linking for Question Answering over Knowledge...EARL: Joint Entity and Relation Linking for Question Answering over Knowledge...
EARL: Joint Entity and Relation Linking for Question Answering over Knowledge...
Holistic Benchmarking of Big Linked Data
 
Benchmarking Big Linked Data: The case of the HOBBIT Project
Benchmarking Big Linked Data: The case of the HOBBIT ProjectBenchmarking Big Linked Data: The case of the HOBBIT Project
Benchmarking Big Linked Data: The case of the HOBBIT Project
Holistic Benchmarking of Big Linked Data
 
Assessing Linked Data Versioning Systems: The Semantic Publishing Versioning ...
Assessing Linked Data Versioning Systems: The Semantic Publishing Versioning ...Assessing Linked Data Versioning Systems: The Semantic Publishing Versioning ...
Assessing Linked Data Versioning Systems: The Semantic Publishing Versioning ...
Holistic Benchmarking of Big Linked Data
 
The DEBS Grand Challenge 2018
The DEBS Grand Challenge 2018The DEBS Grand Challenge 2018
The DEBS Grand Challenge 2018
Holistic Benchmarking of Big Linked Data
 
Benchmarking of distributed linked data streaming systems
Benchmarking of distributed linked data streaming systemsBenchmarking of distributed linked data streaming systems
Benchmarking of distributed linked data streaming systems
Holistic Benchmarking of Big Linked Data
 
SQCFramework: SPARQL Query Containment Benchmarks Generation Framework
SQCFramework: SPARQL Query Containment Benchmarks Generation FrameworkSQCFramework: SPARQL Query Containment Benchmarks Generation Framework
SQCFramework: SPARQL Query Containment Benchmarks Generation Framework
Holistic Benchmarking of Big Linked Data
 
LargeRDFBench: A billion triples benchmark for SPARQL endpoint federation
LargeRDFBench: A billion triples benchmark for SPARQL endpoint federationLargeRDFBench: A billion triples benchmark for SPARQL endpoint federation
LargeRDFBench: A billion triples benchmark for SPARQL endpoint federation
Holistic Benchmarking of Big Linked Data
 
The DEBS Grand Challenge 2017
The DEBS Grand Challenge 2017The DEBS Grand Challenge 2017
The DEBS Grand Challenge 2017
Holistic Benchmarking of Big Linked Data
 
4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QAL...
4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QAL...4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QAL...
4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QAL...
Holistic Benchmarking of Big Linked Data
 
Scalable Link Discovery for Modern Data-Driven Applications (poster)
Scalable Link Discovery for Modern Data-Driven Applications (poster)Scalable Link Discovery for Modern Data-Driven Applications (poster)
Scalable Link Discovery for Modern Data-Driven Applications (poster)
Holistic Benchmarking of Big Linked Data
 
An Evaluation of Models for Runtime Approximation in Link Discovery
An Evaluation of Models for Runtime Approximation in Link DiscoveryAn Evaluation of Models for Runtime Approximation in Link Discovery
An Evaluation of Models for Runtime Approximation in Link Discovery
Holistic Benchmarking of Big Linked Data
 
Scalable Link Discovery for Modern Data-Driven Applications
Scalable Link Discovery for Modern Data-Driven ApplicationsScalable Link Discovery for Modern Data-Driven Applications
Scalable Link Discovery for Modern Data-Driven Applications
Holistic Benchmarking of Big Linked Data
 
Extending LargeRDFBench for Multi-Source Data at Scale for SPARQL Endpoint F...
 Extending LargeRDFBench for Multi-Source Data at Scale for SPARQL Endpoint F... Extending LargeRDFBench for Multi-Source Data at Scale for SPARQL Endpoint F...
Extending LargeRDFBench for Multi-Source Data at Scale for SPARQL Endpoint F...
Holistic Benchmarking of Big Linked Data
 
Introducing the HOBBIT platform into the Ontology Alignment Evaluation Campaign
Introducing the HOBBIT platform into the Ontology Alignment Evaluation CampaignIntroducing the HOBBIT platform into the Ontology Alignment Evaluation Campaign
Introducing the HOBBIT platform into the Ontology Alignment Evaluation Campaign
Holistic Benchmarking of Big Linked Data
 
OKE2018 Challenge @ ESWC2018
OKE2018 Challenge @ ESWC2018OKE2018 Challenge @ ESWC2018
OKE2018 Challenge @ ESWC2018
Holistic Benchmarking of Big Linked Data
 
MOCHA 2018 Challenge @ ESWC2018
MOCHA 2018 Challenge @ ESWC2018MOCHA 2018 Challenge @ ESWC2018
MOCHA 2018 Challenge @ ESWC2018
Holistic Benchmarking of Big Linked Data
 
Dynamic planning for link discovery - ESWC 2018
Dynamic planning for link discovery - ESWC 2018Dynamic planning for link discovery - ESWC 2018
Dynamic planning for link discovery - ESWC 2018
Holistic Benchmarking of Big Linked Data
 
Hobbit project overview presented at EBDVF 2017
Hobbit project overview presented at EBDVF 2017Hobbit project overview presented at EBDVF 2017
Hobbit project overview presented at EBDVF 2017
Holistic Benchmarking of Big Linked Data
 
Leopard ISWC Semantic Web Challenge 2017 (poster)
Leopard ISWC Semantic Web Challenge 2017 (poster)Leopard ISWC Semantic Web Challenge 2017 (poster)
Leopard ISWC Semantic Web Challenge 2017 (poster)
Holistic Benchmarking of Big Linked Data
 
Leopard ISWC Semantic Web Challenge 2017
Leopard ISWC Semantic Web Challenge 2017Leopard ISWC Semantic Web Challenge 2017
Leopard ISWC Semantic Web Challenge 2017
Holistic Benchmarking of Big Linked Data
 

More from Holistic Benchmarking of Big Linked Data (20)

EARL: Joint Entity and Relation Linking for Question Answering over Knowledge...
EARL: Joint Entity and Relation Linking for Question Answering over Knowledge...EARL: Joint Entity and Relation Linking for Question Answering over Knowledge...
EARL: Joint Entity and Relation Linking for Question Answering over Knowledge...
 
Benchmarking Big Linked Data: The case of the HOBBIT Project
Benchmarking Big Linked Data: The case of the HOBBIT ProjectBenchmarking Big Linked Data: The case of the HOBBIT Project
Benchmarking Big Linked Data: The case of the HOBBIT Project
 
Assessing Linked Data Versioning Systems: The Semantic Publishing Versioning ...
Assessing Linked Data Versioning Systems: The Semantic Publishing Versioning ...Assessing Linked Data Versioning Systems: The Semantic Publishing Versioning ...
Assessing Linked Data Versioning Systems: The Semantic Publishing Versioning ...
 
The DEBS Grand Challenge 2018
The DEBS Grand Challenge 2018The DEBS Grand Challenge 2018
The DEBS Grand Challenge 2018
 
Benchmarking of distributed linked data streaming systems
Benchmarking of distributed linked data streaming systemsBenchmarking of distributed linked data streaming systems
Benchmarking of distributed linked data streaming systems
 
SQCFramework: SPARQL Query Containment Benchmarks Generation Framework
SQCFramework: SPARQL Query Containment Benchmarks Generation FrameworkSQCFramework: SPARQL Query Containment Benchmarks Generation Framework
SQCFramework: SPARQL Query Containment Benchmarks Generation Framework
 
LargeRDFBench: A billion triples benchmark for SPARQL endpoint federation
LargeRDFBench: A billion triples benchmark for SPARQL endpoint federationLargeRDFBench: A billion triples benchmark for SPARQL endpoint federation
LargeRDFBench: A billion triples benchmark for SPARQL endpoint federation
 
The DEBS Grand Challenge 2017
The DEBS Grand Challenge 2017The DEBS Grand Challenge 2017
The DEBS Grand Challenge 2017
 
4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QAL...
4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QAL...4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QAL...
4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QAL...
 
Scalable Link Discovery for Modern Data-Driven Applications (poster)
Scalable Link Discovery for Modern Data-Driven Applications (poster)Scalable Link Discovery for Modern Data-Driven Applications (poster)
Scalable Link Discovery for Modern Data-Driven Applications (poster)
 
An Evaluation of Models for Runtime Approximation in Link Discovery
An Evaluation of Models for Runtime Approximation in Link DiscoveryAn Evaluation of Models for Runtime Approximation in Link Discovery
An Evaluation of Models for Runtime Approximation in Link Discovery
 
Scalable Link Discovery for Modern Data-Driven Applications
Scalable Link Discovery for Modern Data-Driven ApplicationsScalable Link Discovery for Modern Data-Driven Applications
Scalable Link Discovery for Modern Data-Driven Applications
 
Extending LargeRDFBench for Multi-Source Data at Scale for SPARQL Endpoint F...
 Extending LargeRDFBench for Multi-Source Data at Scale for SPARQL Endpoint F... Extending LargeRDFBench for Multi-Source Data at Scale for SPARQL Endpoint F...
Extending LargeRDFBench for Multi-Source Data at Scale for SPARQL Endpoint F...
 
Introducing the HOBBIT platform into the Ontology Alignment Evaluation Campaign
Introducing the HOBBIT platform into the Ontology Alignment Evaluation CampaignIntroducing the HOBBIT platform into the Ontology Alignment Evaluation Campaign
Introducing the HOBBIT platform into the Ontology Alignment Evaluation Campaign
 
OKE2018 Challenge @ ESWC2018
OKE2018 Challenge @ ESWC2018OKE2018 Challenge @ ESWC2018
OKE2018 Challenge @ ESWC2018
 
MOCHA 2018 Challenge @ ESWC2018
MOCHA 2018 Challenge @ ESWC2018MOCHA 2018 Challenge @ ESWC2018
MOCHA 2018 Challenge @ ESWC2018
 
Dynamic planning for link discovery - ESWC 2018
Dynamic planning for link discovery - ESWC 2018Dynamic planning for link discovery - ESWC 2018
Dynamic planning for link discovery - ESWC 2018
 
Hobbit project overview presented at EBDVF 2017
Hobbit project overview presented at EBDVF 2017Hobbit project overview presented at EBDVF 2017
Hobbit project overview presented at EBDVF 2017
 
Leopard ISWC Semantic Web Challenge 2017 (poster)
Leopard ISWC Semantic Web Challenge 2017 (poster)Leopard ISWC Semantic Web Challenge 2017 (poster)
Leopard ISWC Semantic Web Challenge 2017 (poster)
 
Leopard ISWC Semantic Web Challenge 2017
Leopard ISWC Semantic Web Challenge 2017Leopard ISWC Semantic Web Challenge 2017
Leopard ISWC Semantic Web Challenge 2017
 

Recently uploaded

GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
kumardaparthi1024
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
Zilliz
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Tosin Akinosho
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
Ivanti
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
Zilliz
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
Matthew Sinclair
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
Pixlogix Infotech
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
Jason Packer
 
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptxOcean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
SitimaJohn
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Speck&Tech
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc
 
Infrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI modelsInfrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI models
Zilliz
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
名前 です男
 
Things to Consider When Choosing a Website Developer for your Website | FODUU
Things to Consider When Choosing a Website Developer for your Website | FODUUThings to Consider When Choosing a Website Developer for your Website | FODUU
Things to Consider When Choosing a Website Developer for your Website | FODUU
FODUU
 
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial IntelligenceAI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
IndexBug
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems S.M.S.A.
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 

Recently uploaded (20)

GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
 
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptxOcean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
 
Infrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI modelsInfrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI models
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
 
Things to Consider When Choosing a Website Developer for your Website | FODUU
Things to Consider When Choosing a Website Developer for your Website | FODUUThings to Consider When Choosing a Website Developer for your Website | FODUU
Things to Consider When Choosing a Website Developer for your Website | FODUU
 
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial IntelligenceAI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 

SPgen: A Benchmark Generator for Spatial Link Discovery Tools

  • 1. SAVETA SPgen ISWC 2018 – Monterey CA SPgen: A Benchmark Generator for Spatial Link Discovery Tools T. Saveta1, I. Fundulaki1, G. Flouris1, and A.-C. Ngonga-Ngomo2 1 Institute of Computer Science - FORTH, Greece 2 University of Paderborn, Germany ISWC 2018 – Monterey CA ISWC 2018 – Monterey CA
  • 2. SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA LOD Cloud and Geospatial Data UK Postcodes LinkedGeoData GeoWordNet Ocean Drilling - Forams GeoNames Semantic Web Pleiades Ocean Drilling – Janus Age Models Weather Stations OpenMobileNetwork GeoLinkedData EARTh Linked Crowdsourced Data greek-administrative- geography The Getty Thesaurus of Geographic Names Linked Logainm FAO geopolitical ontology NUTS (GeoVocab) NUTS (GeoVocab) AgreementMaker Strabon OntoIdea Silk RADON
  • 3. SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA Essential to determine the effectiveness and the efficiency of the proposed techniques and tools Compare them Assess their performance «Push» the systems to get better Benchmarking Link Discovery Systems for Geospatial Data Develop benchmarks for geo-spatial link discovery systems
  • 4. SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA Organized into test cases each addressing different kind of requirements Datasets The raw material of the benchmarks.These are the source and the target dataset that will be matched together to find the links Gold Standard The “correct answer sheet” used to judge the completeness and soundness of the link discovery algorithms Metrics The performance metric(s) that determine the systems behavior and performance Link Discovery Benchmark Ingredients
  • 5. SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA Only a limited number of benchmarks target the problem of linking geo-spatial entities Propose: Spatial Benchmark Generator (SPgen) Differs from the classical, mostly string-based approaches Generic, schema agnostic and choke-point based Test the performance of link discovery systems that deal with the DE-9IM (Dimensionally Extended nine- Intersection Model) - standard to describe topological relations between two geometries. Is integrated into HOBBIT platform Spatial Benchmark Generator (SPgen)
  • 6. SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA Source Dataset Traces (lists of (longitude, latitude)) represented as LineStrings in the Well-known text (WKT) format Target Dataset Traces represented as LineStrings or Polygons in the WKT format Gold Standard Stores pairs of matched source and target traces Generated using RADON to ensure completeness and correctness SPgen Overview Given aWKT Geometry s as source and a DE-9IM Relation r, we generate a WKT Geometry t as target such as their Intersection Matrix follows the definition of Relation r.
  • 7. SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA Choke Points CP1: Scalability • Large datasets • Large number of points for each trace CP2: Output Quality Metric • Precision • Recall • F-measure CP3:Time Performance SPgen Choke Points & Key Performance Indicators
  • 8. SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA SPgen: Source Dataset Generation Input Dataset (Traces) Source Data Generation Target Data Generation (JTS Extension) Test Case Generation Parameters Source Data Target Data Gold Standard Generation Gold Standard
  • 9. SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA TomTom Data Generator GPSTrace Generator • Trace: a list of (longitude, latitude) pairs recorded by one device (phone, car, etc.) Spaten (Spatio-temporal and textual Big Data Generator) Did not use their data generator but the 163.845 already generated traces that came from 9.460 users and 38.800.019 GPSTraces The generator can produce 14 users per day using the free version of Google Maps API –2500 requests per day Data Generators
  • 10. SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA SPgen: Target Dataset Generation Input Dataset (Traces) Source Data Generation Target Data Generation (JTS Extension) Test Case Generation Parameters Source Data Target Data Gold Standard Generation Gold Standard
  • 11. SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA DE-9IM Topological Relations between LineStrings Equals Disjoint Touches Contains (Within) & Covers (Covered By) Intersects Crosses Overlaps
  • 12. SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA DE-9IM Topological Relations between a LineString and a Polygon Disjoint TouchesIntersects • Equals and Overlaps cannot hold between a LineString and a Polygon Crosses Contains (Within) & Covers (Covered By)
  • 13. SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA Disjoint example (LineString/LineString)
  • 14. SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA SPgen: Gold Standard Generation Input Dataset (Traces) Source Data Generation Target Data Generation (JTS Extension) Test Case Generation Parameters Source Data Target Data Gold Standard Generation Gold Standard
  • 15. SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA Systems to test Spatial Link Discovery systems: RADON, Silk, AML, OntoIdea Tasks: For each data generator (TomTom/Spaten) we divided the tasks into 4 subtasks, systems were asked to match the geometries considering a given relation SLL & LLL: LineStrings/LineStrings, Small dataset of 200 instances and Large dataset of 2K instances SLP & LLP: LineStrings/Polygons, Small dataset of 200 instances Large dataset of 2K instances Experiments for SPgen
  • 16. SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA Aims Scalability • Due to a limitation of Silk, we had to select traces no larger than 64 KB Output Quality Metric • Precision, Recall and F-measure (all were equal to 1.0) Time Performance to show: • The rate between the time performance and the size of the datasets • The rate between the time performance and the different data generators • The time difference between the topological relations • The time difference between the geometries Experimental set up: Platform time limit = 75 minutes Experiments for SPgen
  • 17. SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA Experiments for SPgen (LineStrings/LineStrings) [SLL] Time rate (in ms) while increasing population for each topological relation for RADON, Silk, AML and OntoIdea. *Platform time limit = 75 mins
  • 18. SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA Experiments for SPgen (LineStrings/LineStrings) [LLL] Time rate (in ms) while increasing population for each topological relation for RADON, Silk, AML and OntoIdea. *Platform time limit = 75 mins
  • 19. SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA Experiments for SPgen (LineStrings/Polygons) [SLP] Time rate (in ms) while increasing population for each topological relation for RADON, Silk, AML. *Platform time limit = 75 mins
  • 20. SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA Experiments for SPgen (LineStrings/Polygons) [LLP] Time rate (in ms) while increasing population for each topological relation for RADON, Silk, AML. *Platform time limit = 75 mins
  • 21. SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA RADON Addressed all the tasks Best performance for the LineString/PolygonTasks Improvements in Touches and Intersects relations for the LineString/LineStringTasks AML Performs extremely well in most cases Improvements in LineString/PolygonTasks and in Disjoint relations Silk Improvements in Touches, Intersects and Overlaps relations and for the LineString/LineString and in Disjoint relation for the LineString/PolygonTasks OntoIdea Handles small datasets efficiently, this does not hold when it comes to larger datasets Did not support LineString/PolygonTasks in this version Experiments General Remarks
  • 22. SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA All the systems needed More time forTomTom than Spaten, due to the smaller number of points per trace in the latter More time the LineString/LineString Tasks for Touches, Intersects and Crosses relations Approximately the same time for both LineString/LineString and LineString/PolygonTasks for Disjoint relation Less time for the LineString/LineString Tasks for Contains, Within, Covers and Covered By relations Depending on the test case we can choose the appropriate system Experiments General Remarks
  • 23. SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA Conclusions SPgen: checks whether spatial link discovery systems can identify DE-9IM topological relations between LineStrings and Polygons Experiments with RADON and Silk, AML and OntoIdea Choke points: Scalability, Quality measure output andTime performance Next Implement DE-9IM relations for all possible combinations of different geometries (e.g. Polygon/Polygon, combination with Points, LineStrings and Polygons, etc.) More data generators – already added DEBS dataset More system – already added Strabon Concluding Remarks & Future Work
  • 24. SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA Thank you! Questions?
  • 25. SAVETA SPgen ISWC 2018 – Monterey CASAVETA SPgen ISWC 2018 – Monterey CA This work was supported by grands from the EU H2020 Framework Programme provided for the project HOBBIT (GA no. 688227).