SlideShare a Scribd company logo
1 of 24
Muhammad Saleem, Yasar Khan, Ali Hasnain, Ivan
Ermilov, Axel-Cyrille Ngonga Ngomo
ISWC 2016, Kobe, Japan, 20/10/2016
10/20/2016
1
Agile Knowledge Engineering and Semantic Web (AKSW), University of
Leipzig,
Germany
 SPARQL query federation
 Survey of federated SPARQL query processing systems
 Performance variables
 Evaluation of SPARQL endpoint federation systems
10/20/2016 2
 SPARQL Endpoint Federation (SEF)
 Linked Data Federation (LDF)
 Hybrid of SEF+LDF
10/20/2016 3
10/20/2016 4
SPARQL Endpoint Federation
S1 S2 S3 S4
RDF RDF RDF RDF
Parsing/Rewriting
Source Selection
Federator Optimzer
Integrator
Rewrite query
and get Individual
Triple Patterns
Identify capable
source against
Individual Triple
Patterns
Generate
optimized sub-
query Exe. Plan
Integrate sub-
queries results
Execute sub-
queries
 Index-free
 Using SPARQL ASK queries
 Potentially ensures result set completeness
 SPARQL ASK queries can be expensive
 SPARQL ASK cache is beneficial
 E.g., FedX
 Index-only
 Only make use of Index/data summaries
 Less efficient but fast source selection
 Result set completeness is not ensured
 E.g., DARQ, LHD
10/20/2016 5
 Hybrid
 Make use of index+SPARQL ASK
 Most efficient
 Result set completeness is not ensured
 SPARQL ASK cache is beneficial
 E.g., HiBISCuS, ANAPSID, SPLENDID
10/20/2016 6
 System Information
 Title and/or URL of federation engine
 Code available?
 Implementation and license
 Type of source selection
 Type of join(s) used for data integration
 Use of cache?
 Support for catalog/index update
10/20/2016 7
10/20/2016 8
10/20/2016 9
Yes
59%
No
12%
Not yet
29%
Code available?
Index-free
6%
Index-
only
29%
Hybrid
65%
Type of source selection
Yes
47%No
53%
Use of cache?
Yes
29%
No
59%
NA
12%
Index update support?
 Requirements
 Result completeness
 Policy-based query planning
 Support for partial results retrieval
 Support for no-blocking operator/ adaptive query processing
 Support for provenance information
 Query runtime estimation
 Duplicate Detection
 Top-K query processing
 Supported SPARQL types/ clauses
10/20/2016 10
10/20/2016 11
10/20/2016 12
Yes
18%
No
82%
Result completeness?
Yes
6%
No
88%
Partial
6%
Policy-based planning?
Yes
24%
No
76%
Partial results retrieval?
Yes
59%
No
41%
Adaptive processing?
10/20/2016 13
Yes
0%
No
94%
Partial
6%
Provenance info?
Yes
0%
No
100%
Runtime estimation?
Yes
12%
No
76%
Partial
12%
Duplicate detection?
No
100%
Top-k querying?
10/20/2016 14
10/20/2016 15
 Benchmarks
 FedBench
 Sliced FedBench
 SP2Bench
 SPARQL endpoint federation engines
 FedX
 SPLENDID
 LHD
 DAW
 ADERIS
 ANAPSID
10/20/2016 16
10/20/2016 17
FedBench
Query Category FedX SPLENDID LHD DARQ ANAPSID ADERIS
CD 252 44 0 0 43 0
LS 297 33 0 0 63 0
LD 369 19 0 0 37 0
Net 918 96 0 0 143 0
Query Category Sliced FedBench
CD 280 110 0 0 228 0
LS 330 100 0 0 143 0
LD 410 140 0 0 270 0
SP2Bench 660 180 0 0 265 0
Net 1680 530 0 0 906 0
10/20/2016 18
FedBench
Query Category FedX SPLENDID LHD DARQ ANAPSID ADERIS
CD 78 78 112 84 36 84
LS 56 56 105 77 44 70
LD 108 108 119 112 54 119
Net 242 242 336 283 134 273
Query Category Sliced FedBench
CD 182 182 225 195 163 195
LS 125 125 163 133 102 117
LD 243 243 324 324 183 324
SP2Bench 521 521 593 420 244 173
Net 1071 1071 1305 1072 692 809
10/20/2016 19
FedBench
Query Category FedX(cold) FedX(warm) SPLENDID LHD DARQ ANAPSID ADERIS
CD 151 7 256 6 7 186 6
LS 147 8 236 6 12 477 5
LD 139 8 221 6 7 804 5
Average 146 8 237 6 9 489 5
Query Category Sliced FedBench
CD 284 9 470 7 8 3183 7
LS 207 7 308 7 8 2897 7
LD 323 9 557 8 7 2908 8
SP2Bench 212 9 738 8 9RE 6
Average 256 8 518 7 8 2996 7
10/20/2016 20
21
FedBench: FedX(warm)  FedX(cold)  LHD  SPLENDID  ANAPSID  DARQ
25/25 17/22 13/22 15/24 16/22
Sliced FedBench: FedX(warm)  FedX(cold)  LHD  SPLENDID  ANAPSID  DARQ
29/36 17/24 17/24 17/26 12/20
22
0
50
100
150
200
250
300
350
400
450
500
CD LS LD Average
Averageexecutiontime(msec)
FedX (first run) FedBench
FedX (first run) SlicedBench
0
50
100
150
200
250
300
CD LS LD Average
Averageexecutiontime(msec)
FedX (cached) FedBench
FedX (cached) SlicedBench
0
200
400
600
800
1000
1200
1400
1600
CD LS LD Average
Averageexecutiontime(msec)
SPLENDID FedBench
SPLENDID SlicedBench
0
10000
20000
30000
40000
50000
60000
70000
80000
CD LS LD Average
Averageexecutiontime(msec)
ANAPSID FedBench
ANAPSID SlicedBench
0
50000
100000
150000
200000
250000
300000
350000
CD LS LD Average
Averageexecutiontime(msec)
DARQ FedBench
DARQ SlicedBench
0
2000
4000
6000
8000
10000
12000
14000
CD LS LD Average
Averageexecutiontime(msec)
LHD FedBench
LHD SlicedBench
23
0
50
100
150
200
250
300
350
400
450
500
CD LS LD Average
Averageexecutiontime(msec)
FedX (first run) FedBench
FedX (first run) SlicedBench
Decreased
by 214%
0
50
100
150
200
250
300
CD LS LD Average
Averageexecutiontime(msec)
FedX (cached) FedBench
FedX (cached) SlicedBench
Decreased
by 199%
0
200
400
600
800
1000
1200
1400
1600
CD LS LD Average
Averageexecutiontime(msec)
SPLENDID FedBench
SPLENDID SlicedBench
Decreased
by 227%
0
10000
20000
30000
40000
50000
60000
70000
80000
CD LS LD Average
Averageexecutiontime(msec)
ANAPSID FedBench
ANAPSID SlicedBench
Decreased
by 392%
0
50000
100000
150000
200000
250000
300000
350000
CD LS LD Average
Averageexecutiontime(msec)
DARQ FedBench
DARQ SlicedBench
Increased
by 36%
0
2000
4000
6000
8000
10000
12000
14000
CD LS LD Average
Averageexecutiontime(msec)
LHD FedBench
LHD SlicedBench
Decreased
by 293%
10/20/2016 24
{lastname}@informatik.uni-leipzig.de
AKSW, University of Leipzig, Germany

More Related Content

What's hot

Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...Olaf Hartig
 
Querying Linked Data with SPARQL
Querying Linked Data with SPARQLQuerying Linked Data with SPARQL
Querying Linked Data with SPARQLOlaf Hartig
 
SPARQL 1.1 Update (2013-03-05)
SPARQL 1.1 Update (2013-03-05)SPARQL 1.1 Update (2013-03-05)
SPARQL 1.1 Update (2013-03-05)andyseaborne
 
SPARQL-DL - Theory & Practice
SPARQL-DL - Theory & PracticeSPARQL-DL - Theory & Practice
SPARQL-DL - Theory & PracticeAdriel Café
 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...Olaf Hartig
 
Linking the world with Python and Semantics
Linking the world with Python and SemanticsLinking the world with Python and Semantics
Linking the world with Python and SemanticsTatiana Al-Chueyr
 
NoSQL and Triple Stores
NoSQL and Triple StoresNoSQL and Triple Stores
NoSQL and Triple Storesandyseaborne
 
An Introduction to SPARQL
An Introduction to SPARQLAn Introduction to SPARQL
An Introduction to SPARQLOlaf Hartig
 
Mon norton tut_queryinglinkeddata02
Mon norton tut_queryinglinkeddata02Mon norton tut_queryinglinkeddata02
Mon norton tut_queryinglinkeddata02eswcsummerschool
 
Semantic web meetup – sparql tutorial
Semantic web meetup – sparql tutorialSemantic web meetup – sparql tutorial
Semantic web meetup – sparql tutorialAdonisDamian
 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...Olaf Hartig
 
Data Integration And Visualization
Data Integration And VisualizationData Integration And Visualization
Data Integration And VisualizationIvan Ermilov
 
Relational Database to RDF (RDB2RDF)
Relational Database to RDF (RDB2RDF)Relational Database to RDF (RDB2RDF)
Relational Database to RDF (RDB2RDF)EUCLID project
 
Mapping Relational Databases to Linked Data
Mapping Relational Databases to Linked DataMapping Relational Databases to Linked Data
Mapping Relational Databases to Linked DataEUCLID project
 
Ks2008 Semanticweb In Action
Ks2008 Semanticweb In ActionKs2008 Semanticweb In Action
Ks2008 Semanticweb In ActionRinke Hoekstra
 
Linked Data, Ontologies and Inference
Linked Data, Ontologies and InferenceLinked Data, Ontologies and Inference
Linked Data, Ontologies and InferenceBarry Norton
 

What's hot (20)

Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...
 
Querying Linked Data with SPARQL
Querying Linked Data with SPARQLQuerying Linked Data with SPARQL
Querying Linked Data with SPARQL
 
SPARQL 1.1 Update (2013-03-05)
SPARQL 1.1 Update (2013-03-05)SPARQL 1.1 Update (2013-03-05)
SPARQL 1.1 Update (2013-03-05)
 
SPARQL-DL - Theory & Practice
SPARQL-DL - Theory & PracticeSPARQL-DL - Theory & Practice
SPARQL-DL - Theory & Practice
 
Sparql
SparqlSparql
Sparql
 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
 
Linking the world with Python and Semantics
Linking the world with Python and SemanticsLinking the world with Python and Semantics
Linking the world with Python and Semantics
 
NoSQL and Triple Stores
NoSQL and Triple StoresNoSQL and Triple Stores
NoSQL and Triple Stores
 
An Introduction to SPARQL
An Introduction to SPARQLAn Introduction to SPARQL
An Introduction to SPARQL
 
SPARQL Cheat Sheet
SPARQL Cheat SheetSPARQL Cheat Sheet
SPARQL Cheat Sheet
 
Mon norton tut_queryinglinkeddata02
Mon norton tut_queryinglinkeddata02Mon norton tut_queryinglinkeddata02
Mon norton tut_queryinglinkeddata02
 
Semantic web meetup – sparql tutorial
Semantic web meetup – sparql tutorialSemantic web meetup – sparql tutorial
Semantic web meetup – sparql tutorial
 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
 
Data Integration And Visualization
Data Integration And VisualizationData Integration And Visualization
Data Integration And Visualization
 
Relational Database to RDF (RDB2RDF)
Relational Database to RDF (RDB2RDF)Relational Database to RDF (RDB2RDF)
Relational Database to RDF (RDB2RDF)
 
Mapping Relational Databases to Linked Data
Mapping Relational Databases to Linked DataMapping Relational Databases to Linked Data
Mapping Relational Databases to Linked Data
 
Ks2008 Semanticweb In Action
Ks2008 Semanticweb In ActionKs2008 Semanticweb In Action
Ks2008 Semanticweb In Action
 
Triple Stores
Triple StoresTriple Stores
Triple Stores
 
Linked Data, Ontologies and Inference
Linked Data, Ontologies and InferenceLinked Data, Ontologies and Inference
Linked Data, Ontologies and Inference
 
HadoopFileFormats_2016
HadoopFileFormats_2016HadoopFileFormats_2016
HadoopFileFormats_2016
 

Similar to Fine-grained Evaluation of SPARQL Endpoint Federation Systems

Is IPv6 Really Faster?
Is IPv6 Really Faster?Is IPv6 Really Faster?
Is IPv6 Really Faster?APNIC
 
More Complete Resultset Retrieval from Large Heterogeneous RDF Sources
More Complete Resultset Retrieval from Large Heterogeneous RDF SourcesMore Complete Resultset Retrieval from Large Heterogeneous RDF Sources
More Complete Resultset Retrieval from Large Heterogeneous RDF SourcesAndré Valdestilhas
 
IPv6 Performance
IPv6 PerformanceIPv6 Performance
IPv6 PerformanceAPNIC
 
Heiner Oberkampf: Semantics for Integrated Analytical Laboratory Processes – ...
Heiner Oberkampf: Semantics for Integrated Analytical Laboratory Processes – ...Heiner Oberkampf: Semantics for Integrated Analytical Laboratory Processes – ...
Heiner Oberkampf: Semantics for Integrated Analytical Laboratory Processes – ...Semantic Web Company
 
Semantics for Integrated Analytical Laboratory Processes – the Allotrope Pers...
Semantics for Integrated Analytical Laboratory Processes – the Allotrope Pers...Semantics for Integrated Analytical Laboratory Processes – the Allotrope Pers...
Semantics for Integrated Analytical Laboratory Processes – the Allotrope Pers...OSTHUS
 
Open source programming
Open source programmingOpen source programming
Open source programmingRizwan Ahmed
 
Creating APIs over RDF
Creating APIs over RDFCreating APIs over RDF
Creating APIs over RDFLeigh Dodds
 
Creating APIs over RDF
Creating APIs over RDFCreating APIs over RDF
Creating APIs over RDFLeigh Dodds
 
Compile ahead of time. It's fine?
Compile ahead of time. It's fine?Compile ahead of time. It's fine?
Compile ahead of time. It's fine?Dmitry Chuyko
 
balloon Fusion: SPARQL Rewriting Based on Unified Co-Reference Information
balloon Fusion: SPARQL Rewriting Based on  Unified Co-Reference Informationballoon Fusion: SPARQL Rewriting Based on  Unified Co-Reference Information
balloon Fusion: SPARQL Rewriting Based on Unified Co-Reference InformationKai Schlegel
 
IGUANA: A Generic Framework for Benchmarking the Read-Write Performance of Tr...
IGUANA: A Generic Framework for Benchmarking the Read-Write Performance of Tr...IGUANA: A Generic Framework for Benchmarking the Read-Write Performance of Tr...
IGUANA: A Generic Framework for Benchmarking the Read-Write Performance of Tr...Lixi Conrads
 
Side by Side with Elasticsearch & Solr, Part 2
Side by Side with Elasticsearch & Solr, Part 2Side by Side with Elasticsearch & Solr, Part 2
Side by Side with Elasticsearch & Solr, Part 2Sematext Group, Inc.
 
2009 0807 Lod Gmod
2009 0807 Lod Gmod2009 0807 Lod Gmod
2009 0807 Lod GmodJun Zhao
 
Retour d'expérience d'un environnement base de données multitenant
Retour d'expérience d'un environnement base de données multitenantRetour d'expérience d'un environnement base de données multitenant
Retour d'expérience d'un environnement base de données multitenantSwiss Data Forum Swiss Data Forum
 
Review of some successes
Review of some successesReview of some successes
Review of some successesAndrea Zaliani
 
Introduction to metadata cleansing using SPARQL update queries
Introduction to metadata cleansing using SPARQL update queriesIntroduction to metadata cleansing using SPARQL update queries
Introduction to metadata cleansing using SPARQL update queriesEuropean Commission
 
How Representative Is a SPARQL Benchmark? An Analysis of RDF Triplestore Benc...
How Representative Is a SPARQL Benchmark? An Analysis of RDF Triplestore Benc...How Representative Is a SPARQL Benchmark? An Analysis of RDF Triplestore Benc...
How Representative Is a SPARQL Benchmark? An Analysis of RDF Triplestore Benc...Muhammad Saleem
 
RDFUnit - Test-Driven Linked Data quality Assessment (WWW2014)
RDFUnit - Test-Driven Linked Data quality Assessment (WWW2014)RDFUnit - Test-Driven Linked Data quality Assessment (WWW2014)
RDFUnit - Test-Driven Linked Data quality Assessment (WWW2014)Dimitris Kontokostas
 
Spark Overview - Oleg Mürk
Spark Overview - Oleg MürkSpark Overview - Oleg Mürk
Spark Overview - Oleg MürkPlanet OS
 

Similar to Fine-grained Evaluation of SPARQL Endpoint Federation Systems (20)

Is IPv6 Really Faster?
Is IPv6 Really Faster?Is IPv6 Really Faster?
Is IPv6 Really Faster?
 
Net app
Net appNet app
Net app
 
More Complete Resultset Retrieval from Large Heterogeneous RDF Sources
More Complete Resultset Retrieval from Large Heterogeneous RDF SourcesMore Complete Resultset Retrieval from Large Heterogeneous RDF Sources
More Complete Resultset Retrieval from Large Heterogeneous RDF Sources
 
IPv6 Performance
IPv6 PerformanceIPv6 Performance
IPv6 Performance
 
Heiner Oberkampf: Semantics for Integrated Analytical Laboratory Processes – ...
Heiner Oberkampf: Semantics for Integrated Analytical Laboratory Processes – ...Heiner Oberkampf: Semantics for Integrated Analytical Laboratory Processes – ...
Heiner Oberkampf: Semantics for Integrated Analytical Laboratory Processes – ...
 
Semantics for Integrated Analytical Laboratory Processes – the Allotrope Pers...
Semantics for Integrated Analytical Laboratory Processes – the Allotrope Pers...Semantics for Integrated Analytical Laboratory Processes – the Allotrope Pers...
Semantics for Integrated Analytical Laboratory Processes – the Allotrope Pers...
 
Open source programming
Open source programmingOpen source programming
Open source programming
 
Creating APIs over RDF
Creating APIs over RDFCreating APIs over RDF
Creating APIs over RDF
 
Creating APIs over RDF
Creating APIs over RDFCreating APIs over RDF
Creating APIs over RDF
 
Compile ahead of time. It's fine?
Compile ahead of time. It's fine?Compile ahead of time. It's fine?
Compile ahead of time. It's fine?
 
balloon Fusion: SPARQL Rewriting Based on Unified Co-Reference Information
balloon Fusion: SPARQL Rewriting Based on  Unified Co-Reference Informationballoon Fusion: SPARQL Rewriting Based on  Unified Co-Reference Information
balloon Fusion: SPARQL Rewriting Based on Unified Co-Reference Information
 
IGUANA: A Generic Framework for Benchmarking the Read-Write Performance of Tr...
IGUANA: A Generic Framework for Benchmarking the Read-Write Performance of Tr...IGUANA: A Generic Framework for Benchmarking the Read-Write Performance of Tr...
IGUANA: A Generic Framework for Benchmarking the Read-Write Performance of Tr...
 
Side by Side with Elasticsearch & Solr, Part 2
Side by Side with Elasticsearch & Solr, Part 2Side by Side with Elasticsearch & Solr, Part 2
Side by Side with Elasticsearch & Solr, Part 2
 
2009 0807 Lod Gmod
2009 0807 Lod Gmod2009 0807 Lod Gmod
2009 0807 Lod Gmod
 
Retour d'expérience d'un environnement base de données multitenant
Retour d'expérience d'un environnement base de données multitenantRetour d'expérience d'un environnement base de données multitenant
Retour d'expérience d'un environnement base de données multitenant
 
Review of some successes
Review of some successesReview of some successes
Review of some successes
 
Introduction to metadata cleansing using SPARQL update queries
Introduction to metadata cleansing using SPARQL update queriesIntroduction to metadata cleansing using SPARQL update queries
Introduction to metadata cleansing using SPARQL update queries
 
How Representative Is a SPARQL Benchmark? An Analysis of RDF Triplestore Benc...
How Representative Is a SPARQL Benchmark? An Analysis of RDF Triplestore Benc...How Representative Is a SPARQL Benchmark? An Analysis of RDF Triplestore Benc...
How Representative Is a SPARQL Benchmark? An Analysis of RDF Triplestore Benc...
 
RDFUnit - Test-Driven Linked Data quality Assessment (WWW2014)
RDFUnit - Test-Driven Linked Data quality Assessment (WWW2014)RDFUnit - Test-Driven Linked Data quality Assessment (WWW2014)
RDFUnit - Test-Driven Linked Data quality Assessment (WWW2014)
 
Spark Overview - Oleg Mürk
Spark Overview - Oleg MürkSpark Overview - Oleg Mürk
Spark Overview - Oleg Mürk
 

More from Muhammad Saleem

QaldGen: Towards Microbenchmarking of Question Answering Systems Over Knowled...
QaldGen: Towards Microbenchmarking of Question Answering Systems Over Knowled...QaldGen: Towards Microbenchmarking of Question Answering Systems Over Knowled...
QaldGen: Towards Microbenchmarking of Question Answering Systems Over Knowled...Muhammad Saleem
 
CostFed: Cost-Based Query Optimization for SPARQL Endpoint Federation
CostFed: Cost-Based Query Optimization for SPARQL Endpoint FederationCostFed: Cost-Based Query Optimization for SPARQL Endpoint Federation
CostFed: Cost-Based Query Optimization for SPARQL Endpoint FederationMuhammad Saleem
 
SQCFramework: SPARQL Query containment Benchmark Generation Framework
SQCFramework: SPARQL Query containment  Benchmark Generation Framework SQCFramework: SPARQL Query containment  Benchmark Generation Framework
SQCFramework: SPARQL Query containment Benchmark Generation Framework Muhammad Saleem
 
Question Answering Over Linked Data: What is Difficult to Answer? What Affect...
Question Answering Over Linked Data: What is Difficult to Answer? What Affect...Question Answering Over Linked Data: What is Difficult to Answer? What Affect...
Question Answering Over Linked Data: What is Difficult to Answer? What Affect...Muhammad Saleem
 
LSQ: The Linked SPARQL Queries Dataset
LSQ: The Linked SPARQL Queries DatasetLSQ: The Linked SPARQL Queries Dataset
LSQ: The Linked SPARQL Queries DatasetMuhammad Saleem
 
SAFE: Policy Aware SPARQL Query Federation Over RDF Data Cubes
SAFE: Policy Aware SPARQL Query Federation Over RDF Data CubesSAFE: Policy Aware SPARQL Query Federation Over RDF Data Cubes
SAFE: Policy Aware SPARQL Query Federation Over RDF Data CubesMuhammad Saleem
 
DAW: Duplicate-AWare Federated Query Processing over the Web of Data
DAW: Duplicate-AWare Federated Query Processing over the Web of DataDAW: Duplicate-AWare Federated Query Processing over the Web of Data
DAW: Duplicate-AWare Federated Query Processing over the Web of DataMuhammad Saleem
 
Fostering Serendipity through Big Linked Data
Fostering Serendipity through Big Linked DataFostering Serendipity through Big Linked Data
Fostering Serendipity through Big Linked DataMuhammad Saleem
 
Linked Cancer Genome Atlas Database
Linked Cancer Genome Atlas DatabaseLinked Cancer Genome Atlas Database
Linked Cancer Genome Atlas DatabaseMuhammad Saleem
 

More from Muhammad Saleem (11)

QaldGen: Towards Microbenchmarking of Question Answering Systems Over Knowled...
QaldGen: Towards Microbenchmarking of Question Answering Systems Over Knowled...QaldGen: Towards Microbenchmarking of Question Answering Systems Over Knowled...
QaldGen: Towards Microbenchmarking of Question Answering Systems Over Knowled...
 
LargeRDFBench
LargeRDFBenchLargeRDFBench
LargeRDFBench
 
Extended LargeRDFBench
Extended LargeRDFBenchExtended LargeRDFBench
Extended LargeRDFBench
 
CostFed: Cost-Based Query Optimization for SPARQL Endpoint Federation
CostFed: Cost-Based Query Optimization for SPARQL Endpoint FederationCostFed: Cost-Based Query Optimization for SPARQL Endpoint Federation
CostFed: Cost-Based Query Optimization for SPARQL Endpoint Federation
 
SQCFramework: SPARQL Query containment Benchmark Generation Framework
SQCFramework: SPARQL Query containment  Benchmark Generation Framework SQCFramework: SPARQL Query containment  Benchmark Generation Framework
SQCFramework: SPARQL Query containment Benchmark Generation Framework
 
Question Answering Over Linked Data: What is Difficult to Answer? What Affect...
Question Answering Over Linked Data: What is Difficult to Answer? What Affect...Question Answering Over Linked Data: What is Difficult to Answer? What Affect...
Question Answering Over Linked Data: What is Difficult to Answer? What Affect...
 
LSQ: The Linked SPARQL Queries Dataset
LSQ: The Linked SPARQL Queries DatasetLSQ: The Linked SPARQL Queries Dataset
LSQ: The Linked SPARQL Queries Dataset
 
SAFE: Policy Aware SPARQL Query Federation Over RDF Data Cubes
SAFE: Policy Aware SPARQL Query Federation Over RDF Data CubesSAFE: Policy Aware SPARQL Query Federation Over RDF Data Cubes
SAFE: Policy Aware SPARQL Query Federation Over RDF Data Cubes
 
DAW: Duplicate-AWare Federated Query Processing over the Web of Data
DAW: Duplicate-AWare Federated Query Processing over the Web of DataDAW: Duplicate-AWare Federated Query Processing over the Web of Data
DAW: Duplicate-AWare Federated Query Processing over the Web of Data
 
Fostering Serendipity through Big Linked Data
Fostering Serendipity through Big Linked DataFostering Serendipity through Big Linked Data
Fostering Serendipity through Big Linked Data
 
Linked Cancer Genome Atlas Database
Linked Cancer Genome Atlas DatabaseLinked Cancer Genome Atlas Database
Linked Cancer Genome Atlas Database
 

Recently uploaded

The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...ranjana rawat
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxupamatechverse
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdfankushspencer015
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome
 
UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduitsrknatarajan
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Christo Ananth
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxupamatechverse
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordAsst.prof M.Gokilavani
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSISrknatarajan
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N
 

Recently uploaded (20)

The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
 
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEDJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptx
 
Roadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and RoutesRoadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and Routes
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
 
UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduits
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptx
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSIS
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
 

Fine-grained Evaluation of SPARQL Endpoint Federation Systems

  • 1. Muhammad Saleem, Yasar Khan, Ali Hasnain, Ivan Ermilov, Axel-Cyrille Ngonga Ngomo ISWC 2016, Kobe, Japan, 20/10/2016 10/20/2016 1 Agile Knowledge Engineering and Semantic Web (AKSW), University of Leipzig, Germany
  • 2.  SPARQL query federation  Survey of federated SPARQL query processing systems  Performance variables  Evaluation of SPARQL endpoint federation systems 10/20/2016 2
  • 3.  SPARQL Endpoint Federation (SEF)  Linked Data Federation (LDF)  Hybrid of SEF+LDF 10/20/2016 3
  • 4. 10/20/2016 4 SPARQL Endpoint Federation S1 S2 S3 S4 RDF RDF RDF RDF Parsing/Rewriting Source Selection Federator Optimzer Integrator Rewrite query and get Individual Triple Patterns Identify capable source against Individual Triple Patterns Generate optimized sub- query Exe. Plan Integrate sub- queries results Execute sub- queries
  • 5.  Index-free  Using SPARQL ASK queries  Potentially ensures result set completeness  SPARQL ASK queries can be expensive  SPARQL ASK cache is beneficial  E.g., FedX  Index-only  Only make use of Index/data summaries  Less efficient but fast source selection  Result set completeness is not ensured  E.g., DARQ, LHD 10/20/2016 5
  • 6.  Hybrid  Make use of index+SPARQL ASK  Most efficient  Result set completeness is not ensured  SPARQL ASK cache is beneficial  E.g., HiBISCuS, ANAPSID, SPLENDID 10/20/2016 6
  • 7.  System Information  Title and/or URL of federation engine  Code available?  Implementation and license  Type of source selection  Type of join(s) used for data integration  Use of cache?  Support for catalog/index update 10/20/2016 7
  • 9. 10/20/2016 9 Yes 59% No 12% Not yet 29% Code available? Index-free 6% Index- only 29% Hybrid 65% Type of source selection Yes 47%No 53% Use of cache? Yes 29% No 59% NA 12% Index update support?
  • 10.  Requirements  Result completeness  Policy-based query planning  Support for partial results retrieval  Support for no-blocking operator/ adaptive query processing  Support for provenance information  Query runtime estimation  Duplicate Detection  Top-K query processing  Supported SPARQL types/ clauses 10/20/2016 10
  • 12. 10/20/2016 12 Yes 18% No 82% Result completeness? Yes 6% No 88% Partial 6% Policy-based planning? Yes 24% No 76% Partial results retrieval? Yes 59% No 41% Adaptive processing?
  • 13. 10/20/2016 13 Yes 0% No 94% Partial 6% Provenance info? Yes 0% No 100% Runtime estimation? Yes 12% No 76% Partial 12% Duplicate detection? No 100% Top-k querying?
  • 16.  Benchmarks  FedBench  Sliced FedBench  SP2Bench  SPARQL endpoint federation engines  FedX  SPLENDID  LHD  DAW  ADERIS  ANAPSID 10/20/2016 16
  • 17. 10/20/2016 17 FedBench Query Category FedX SPLENDID LHD DARQ ANAPSID ADERIS CD 252 44 0 0 43 0 LS 297 33 0 0 63 0 LD 369 19 0 0 37 0 Net 918 96 0 0 143 0 Query Category Sliced FedBench CD 280 110 0 0 228 0 LS 330 100 0 0 143 0 LD 410 140 0 0 270 0 SP2Bench 660 180 0 0 265 0 Net 1680 530 0 0 906 0
  • 18. 10/20/2016 18 FedBench Query Category FedX SPLENDID LHD DARQ ANAPSID ADERIS CD 78 78 112 84 36 84 LS 56 56 105 77 44 70 LD 108 108 119 112 54 119 Net 242 242 336 283 134 273 Query Category Sliced FedBench CD 182 182 225 195 163 195 LS 125 125 163 133 102 117 LD 243 243 324 324 183 324 SP2Bench 521 521 593 420 244 173 Net 1071 1071 1305 1072 692 809
  • 19. 10/20/2016 19 FedBench Query Category FedX(cold) FedX(warm) SPLENDID LHD DARQ ANAPSID ADERIS CD 151 7 256 6 7 186 6 LS 147 8 236 6 12 477 5 LD 139 8 221 6 7 804 5 Average 146 8 237 6 9 489 5 Query Category Sliced FedBench CD 284 9 470 7 8 3183 7 LS 207 7 308 7 8 2897 7 LD 323 9 557 8 7 2908 8 SP2Bench 212 9 738 8 9RE 6 Average 256 8 518 7 8 2996 7
  • 21. 21 FedBench: FedX(warm)  FedX(cold)  LHD  SPLENDID  ANAPSID  DARQ 25/25 17/22 13/22 15/24 16/22 Sliced FedBench: FedX(warm)  FedX(cold)  LHD  SPLENDID  ANAPSID  DARQ 29/36 17/24 17/24 17/26 12/20
  • 22. 22 0 50 100 150 200 250 300 350 400 450 500 CD LS LD Average Averageexecutiontime(msec) FedX (first run) FedBench FedX (first run) SlicedBench 0 50 100 150 200 250 300 CD LS LD Average Averageexecutiontime(msec) FedX (cached) FedBench FedX (cached) SlicedBench 0 200 400 600 800 1000 1200 1400 1600 CD LS LD Average Averageexecutiontime(msec) SPLENDID FedBench SPLENDID SlicedBench 0 10000 20000 30000 40000 50000 60000 70000 80000 CD LS LD Average Averageexecutiontime(msec) ANAPSID FedBench ANAPSID SlicedBench 0 50000 100000 150000 200000 250000 300000 350000 CD LS LD Average Averageexecutiontime(msec) DARQ FedBench DARQ SlicedBench 0 2000 4000 6000 8000 10000 12000 14000 CD LS LD Average Averageexecutiontime(msec) LHD FedBench LHD SlicedBench
  • 23. 23 0 50 100 150 200 250 300 350 400 450 500 CD LS LD Average Averageexecutiontime(msec) FedX (first run) FedBench FedX (first run) SlicedBench Decreased by 214% 0 50 100 150 200 250 300 CD LS LD Average Averageexecutiontime(msec) FedX (cached) FedBench FedX (cached) SlicedBench Decreased by 199% 0 200 400 600 800 1000 1200 1400 1600 CD LS LD Average Averageexecutiontime(msec) SPLENDID FedBench SPLENDID SlicedBench Decreased by 227% 0 10000 20000 30000 40000 50000 60000 70000 80000 CD LS LD Average Averageexecutiontime(msec) ANAPSID FedBench ANAPSID SlicedBench Decreased by 392% 0 50000 100000 150000 200000 250000 300000 350000 CD LS LD Average Averageexecutiontime(msec) DARQ FedBench DARQ SlicedBench Increased by 36% 0 2000 4000 6000 8000 10000 12000 14000 CD LS LD Average Averageexecutiontime(msec) LHD FedBench LHD SlicedBench Decreased by 293%