SlideShare a Scribd company logo
1 of 26
Download to read offline
Topic-based Federated
Query Engine
Ester Giallonardo, Ciro Sorrentino ,Eugenio Zimeo
ICWI - BUDAPEST - 2018
CONTEXT AND PROBLEM STATEMENT
CONTEXT AND PROBLEM STATEMENT
RELATED WORK
Federated Service Endpoint Query Engine - Code Avalable
FedX SPLENDID DARQ CostFed LHD ANAPSID ADERIS
Code Avalability
YES (java- jar)
YES (java,
scala) YES (java) YES (python) YES (java) YES (python) YES (java)
LAST UPDATE 2016 2011 2006 2018 2013 2013 N.D.
SERVICE Sparql Clause
support yes NO No yes No yes No
SOURCE SELECTION
query (ASK)
query (ASK),
index, Sparql
service
descriptor, VoID
index (sparql
service
descriptor)
index, query cost
estimation based
on selectivity
query (ASK),
index, Sparql
service
descriptor, VoID
query (ASK),
index index
JOIN TYPE
nested loop,
bind hash, bind
nested loop,
bind
bound,
symmetric hash
join hash, bind adaptive
Index based
nested loop
CACHE
YES (ASK
history) YES NO NO NO NO NO
None of existing federated engines exploits data mining based strategies to select
services and to implement SPARQL queries executions on SPARQL endpoints
RELATED WORK (2)
- Topic model has been widely used for document identification
- Bhattacharya & Sil, 2016 for first used LDA and sparse representation
based classifier for information retrieval
- Wang et al., 2016 and Wei & Croft, 2006 consider a query as a distribution
of terms over topics through LDA for information retrieval
- Röder et al., 2016 used LDA to identify only the topics of RDF datasets
through the extracted English labels
- None of these research contributions:
- exploits LDA to determine the similarity between queries and services
- builds the corpus by IRIs, i.e. structured and semantic data
TOPIC FEDERATED QUERY ENGINE
- RDF dataset are intelligible documents i.e. semantic based
- LDA is used to get the dataset topic model and to infer query topics
- Topics are datasets summary
- Topic similarity between a query and a dataset reveals a possible
pertinence
- The Topic SPARQL Federated Query Engine
- learns through datasets hosted on services
- infers query routing information
- executes the federated query on the distributed architecture
SOLUTION: ARCHITECTURAL VIEW
ARCHITECTURE DETAILS
QUERY REWRITING RULES
LDA BASED INDEX
DTD1
DTD2
DTD3
DTD4
IRIs IRIs IRIs
SOURCE SELECTION STRATEGIES
…. ….
DTDDi
…. ….
DTDDi
…. ….
DTDDi
…. ….
DTDDi
* QTD
Tx
Ty
Tz
…. ….
QTD
…. ….
QTD
…. ….
QTD
BEST STRATEGY
ALL STRATEGY
ALL FILTERED STRATEGY
K-MEANS STRATEGY K=2
threshold
threshold
Centroid 1
Centroid 2
For some
pattern
Tx
Ty
Tz
Tx
Ty
Tz
Tx
Ty
Tz
Tx
Ty
Tz
Tx
Ty
Tz
Dm
Dn
Do
Dp Di
Cluster delimitation
THE BEST STRATEGY
SELECT *WHERE {
}
BEST ALL ALL-FILT. K-MEANS
Triple-pattern1 Ty
Triple-pattern2 Ty
Triple-pattern3 Ty
Triple-pattern4 Ty
…. ….
DTDDi
Tx
Ty
Tz
…. ….
QTD
BEST STRATEGY
Tx
Ty
Tz
THE ALL STRATEGY
SELECT *WHERE {
}
BEST ALL ALL-FILT. K-MEANS
Triple-pattern1 Ty
Ty
Tz
Triple-pattern2 Ty
Ty
Tz
Triple-pattern3 Ty
Ty
Tz
Triple-pattern4 Ty
Ty
Tz
….
…. ….
DTDDi
…. ….
QTD
ALL STRATEGY
threshold
Tx
Ty
Tz
Tx
Ty
Tz
THE ALL-FILTERED STRATEGY
SELECT *WHERE {
}
BEST ALL ALL-FILT. K-MEANS
Triple-pattern1 Ty
Ty
Tz
Ty
Tz
Triple-pattern2 Ty
Ty
Tz
Ty
Tz
Triple-pattern3 Ty
Ty
Tz
Ty
Triple-pattern4 Ty
Ty
Tz
Ty
…. ….
DTDDi
…. ….
QTD
ALL FILTERED STRATEGY
threshold
For some
pattern
Tx
Ty
Tz
Tx
Ty
Tz
THE K-MEANS STRATEGY
SELECT *WHERE {
}
BEST ALL ALL-FILT. K-MEANS
Triple-pattern1 Ty
Ty
Tz
Ty
Tz
Dn
Dp
Triple-pattern2 Ty
Ty
Tz
Ty
Tz
Dn
Dp
Triple-pattern3 Ty
Ty
Tz
Ty
Dn
Dp
Triple-pattern4 Ty
Ty
Tz
Ty
Dn
Dp
…. ….
DTDDi
* QTD
K-MEANS STRATEGY K=2
Centroid 1
Centroid 2
Dm
Dn
Do
Dp Di
Cluster delimitation
DATASET-QUERY TOPIC MATCHING
SELECT *WHERE {
}
BEST ALL ALL-FIL. K-MEANS
Triple-pattern1 Dn
Ty
Tz
Ty
Tz
Dn
Dp
Triple-pattern2 Dn
Ty
Tz
Ty
Tz
Dn
Dp
Triple-pattern3 Dn
Ty
Tz
Ty
Dn
Dp
Triple-pattern4 Dn
Ty
Tz
Ty
Dn
Dp
BEST STRATEGY
DATASET-QUERY TOPIC MATCHING (2)
SELECT *WHERE {
}
BEST ALL ALL-FIL. K-MEANS
Triple-pattern1 Dn
Dn
Do
Dp
Ty
Tz
Dn
Dp
Triple-pattern2 Dn
Dn
Do
Dp
Ty
Tz
Dn
Dp
Triple-pattern3 Dn
Dn
Do
Dp
Ty
Dn
Dp
Triple-pattern4 Dn
Dn
Do
Dp
Ty
Dn
Dp
ALL STRATEGY
DATASET-QUERY TOPIC MATCHING (3)
SELECT *WHERE {
}
BEST ALL ALL-FIL. K-MEANS
Triple-pattern1 Dn
Dn
Do
Dp
Dn
Do
Dp
Dn
Dp
Triple-pattern2 Dn
Dn
Do
Dp
Dn
Do
Dp
Dn
Dp
Triple-pattern3 Dn
Dn
Do
Dp
Dn
Dn
Dp
Triple-pattern4 Dn
Dn
Do
Dp
Dn
Dn
Dp
ALL FILTERED STRATEGY
SELECT *
WHERE {
}
BEST ALL ALL-FILTERED K-MEANS
Triple-pattern1
Sn
Sn
So
Sp
Sn
So
Sp
Sn
Sp
Triple-pattern2 Sn
So
Sp
Sn
So
Sp
Sn
Sp
Triple-pattern3 Sn
So
Sp
Sn
Sn
Sp
Triple-pattern4 Sn
So
Sp
Sn
Sp
SERVICE SUBSTITUTION - AGGREGATION
QUERY EXAMPLE
THE BENCHMARK
RESULTS
RESULTS (2)
RESULTS (3)
CONCLUSION
- RDF-dataset once treated as documents are exploited by LDA to extract
datasets latent semantics .
- This latent semantic is represented by topics that are datasets summaries.
- The Topic SPARQL Federated Query Engine learns through datasets hosted
on services how to split, route and execute service-less Sparql queries in a
federated way.
- It is a middleware oriented to transparently querying the Open Data world
- Work in progress:
- Benchmarking with other engines
- Evaluating index stability
- Improving performance and recall of the strategies
THANK YOU FOR YOUR ATTENTION!
ANY QUESTION?
Topic-based Federated Query Engine
Ester Giallonardo, Ciro Sorrentino, Eugenio Zimeo
ICWI - BUDAPEST - 2018

More Related Content

What's hot

ParlBench: a SPARQL-benchmark for electronic publishing applications.
ParlBench: a SPARQL-benchmark for electronic publishing applications.ParlBench: a SPARQL-benchmark for electronic publishing applications.
ParlBench: a SPARQL-benchmark for electronic publishing applications.Tatiana Tarasova
 
Text analytics in Python and R with examples from Tobacco Control
Text analytics in Python and R with examples from Tobacco ControlText analytics in Python and R with examples from Tobacco Control
Text analytics in Python and R with examples from Tobacco ControlBen Healey
 
Querying the Web of Data
Querying the Web of DataQuerying the Web of Data
Querying the Web of DataRinke Hoekstra
 
Large-scale Reasoning with a Complex Cultural Heritage Ontology (CIDOC CRM) ...
 Large-scale Reasoning with a Complex Cultural Heritage Ontology (CIDOC CRM) ... Large-scale Reasoning with a Complex Cultural Heritage Ontology (CIDOC CRM) ...
Large-scale Reasoning with a Complex Cultural Heritage Ontology (CIDOC CRM) ...Vladimir Alexiev, PhD, PMP
 
LDQL: A Query Language for the Web of Linked Data
LDQL: A Query Language for the Web of Linked DataLDQL: A Query Language for the Web of Linked Data
LDQL: A Query Language for the Web of Linked DataOlaf Hartig
 
A Workshop on R
A Workshop on RA Workshop on R
A Workshop on RAjay Ohri
 
DCU Search Runs at MediaEval 2014 Search and Hyperlinking
DCU Search Runs at MediaEval 2014 Search and HyperlinkingDCU Search Runs at MediaEval 2014 Search and Hyperlinking
DCU Search Runs at MediaEval 2014 Search and Hyperlinkingmultimediaeval
 
The Semantics of SPARQL
The Semantics of SPARQLThe Semantics of SPARQL
The Semantics of SPARQLOlaf Hartig
 
Text Mining Infrastructure in R
Text Mining Infrastructure in RText Mining Infrastructure in R
Text Mining Infrastructure in RAshraf Uddin
 
final_copy_camera_ready_paper (7)
final_copy_camera_ready_paper (7)final_copy_camera_ready_paper (7)
final_copy_camera_ready_paper (7)Ankit Rathi
 
Rethinking Online SPARQL Querying to Support Incremental Result Visualization
Rethinking Online SPARQL Querying to Support Incremental Result VisualizationRethinking Online SPARQL Querying to Support Incremental Result Visualization
Rethinking Online SPARQL Querying to Support Incremental Result VisualizationOlaf Hartig
 
Applications of Word Vectors in Text Retrieval and Classification
Applications of Word Vectors in Text Retrieval and ClassificationApplications of Word Vectors in Text Retrieval and Classification
Applications of Word Vectors in Text Retrieval and Classificationshakimov
 
Data Mining: Concepts and Techniques_ Chapter 6: Mining Frequent Patterns, ...
Data Mining:  Concepts and Techniques_ Chapter 6: Mining Frequent Patterns, ...Data Mining:  Concepts and Techniques_ Chapter 6: Mining Frequent Patterns, ...
Data Mining: Concepts and Techniques_ Chapter 6: Mining Frequent Patterns, ...Salah Amean
 
hands on: Text Mining With R
hands on: Text Mining With Rhands on: Text Mining With R
hands on: Text Mining With RJahnab Kumar Deka
 
An Introduction to Spark with Scala
An Introduction to Spark with ScalaAn Introduction to Spark with Scala
An Introduction to Spark with ScalaChetan Khatri
 

What's hot (20)

ParlBench: a SPARQL-benchmark for electronic publishing applications.
ParlBench: a SPARQL-benchmark for electronic publishing applications.ParlBench: a SPARQL-benchmark for electronic publishing applications.
ParlBench: a SPARQL-benchmark for electronic publishing applications.
 
Profile of NPOESS HDF5 Files
Profile of NPOESS HDF5 FilesProfile of NPOESS HDF5 Files
Profile of NPOESS HDF5 Files
 
Text analytics in Python and R with examples from Tobacco Control
Text analytics in Python and R with examples from Tobacco ControlText analytics in Python and R with examples from Tobacco Control
Text analytics in Python and R with examples from Tobacco Control
 
Querying the Web of Data
Querying the Web of DataQuerying the Web of Data
Querying the Web of Data
 
Large-scale Reasoning with a Complex Cultural Heritage Ontology (CIDOC CRM) ...
 Large-scale Reasoning with a Complex Cultural Heritage Ontology (CIDOC CRM) ... Large-scale Reasoning with a Complex Cultural Heritage Ontology (CIDOC CRM) ...
Large-scale Reasoning with a Complex Cultural Heritage Ontology (CIDOC CRM) ...
 
SWT Lecture Session 8 - Rules
SWT Lecture Session 8 - RulesSWT Lecture Session 8 - Rules
SWT Lecture Session 8 - Rules
 
LDQL: A Query Language for the Web of Linked Data
LDQL: A Query Language for the Web of Linked DataLDQL: A Query Language for the Web of Linked Data
LDQL: A Query Language for the Web of Linked Data
 
A Workshop on R
A Workshop on RA Workshop on R
A Workshop on R
 
DCU Search Runs at MediaEval 2014 Search and Hyperlinking
DCU Search Runs at MediaEval 2014 Search and HyperlinkingDCU Search Runs at MediaEval 2014 Search and Hyperlinking
DCU Search Runs at MediaEval 2014 Search and Hyperlinking
 
The Semantics of SPARQL
The Semantics of SPARQLThe Semantics of SPARQL
The Semantics of SPARQL
 
Text Mining Infrastructure in R
Text Mining Infrastructure in RText Mining Infrastructure in R
Text Mining Infrastructure in R
 
5 rdfs
5 rdfs5 rdfs
5 rdfs
 
final_copy_camera_ready_paper (7)
final_copy_camera_ready_paper (7)final_copy_camera_ready_paper (7)
final_copy_camera_ready_paper (7)
 
Rethinking Online SPARQL Querying to Support Incremental Result Visualization
Rethinking Online SPARQL Querying to Support Incremental Result VisualizationRethinking Online SPARQL Querying to Support Incremental Result Visualization
Rethinking Online SPARQL Querying to Support Incremental Result Visualization
 
SWT Lecture Session 2 - RDF
SWT Lecture Session 2 - RDFSWT Lecture Session 2 - RDF
SWT Lecture Session 2 - RDF
 
Applications of Word Vectors in Text Retrieval and Classification
Applications of Word Vectors in Text Retrieval and ClassificationApplications of Word Vectors in Text Retrieval and Classification
Applications of Word Vectors in Text Retrieval and Classification
 
Data Mining: Concepts and Techniques_ Chapter 6: Mining Frequent Patterns, ...
Data Mining:  Concepts and Techniques_ Chapter 6: Mining Frequent Patterns, ...Data Mining:  Concepts and Techniques_ Chapter 6: Mining Frequent Patterns, ...
Data Mining: Concepts and Techniques_ Chapter 6: Mining Frequent Patterns, ...
 
COMPUTER LABORATORY-4 LAB MANUAL BE COMPUTER ENGINEERING
COMPUTER LABORATORY-4 LAB MANUAL BE COMPUTER ENGINEERINGCOMPUTER LABORATORY-4 LAB MANUAL BE COMPUTER ENGINEERING
COMPUTER LABORATORY-4 LAB MANUAL BE COMPUTER ENGINEERING
 
hands on: Text Mining With R
hands on: Text Mining With Rhands on: Text Mining With R
hands on: Text Mining With R
 
An Introduction to Spark with Scala
An Introduction to Spark with ScalaAn Introduction to Spark with Scala
An Introduction to Spark with Scala
 

Similar to Topic-based Federator Query Engine - Presented at ICWI Budapest 2018

Querying federations 
of Triple Pattern Fragments
Querying federations 
of Triple Pattern FragmentsQuerying federations 
of Triple Pattern Fragments
Querying federations 
of Triple Pattern FragmentsRuben Verborgh
 
Big Data Processing using Apache Spark and Clojure
Big Data Processing using Apache Spark and ClojureBig Data Processing using Apache Spark and Clojure
Big Data Processing using Apache Spark and ClojureDr. Christian Betz
 
Koalas: How Well Does Koalas Work?
Koalas: How Well Does Koalas Work?Koalas: How Well Does Koalas Work?
Koalas: How Well Does Koalas Work?Databricks
 
Sparql service-description
Sparql service-descriptionSparql service-description
Sparql service-descriptionSTIinnsbruck
 
No more struggles with Apache Spark workloads in production
No more struggles with Apache Spark workloads in productionNo more struggles with Apache Spark workloads in production
No more struggles with Apache Spark workloads in productionChetan Khatri
 
Bringing OpenClinica Data into SAS
Bringing OpenClinica Data into SASBringing OpenClinica Data into SAS
Bringing OpenClinica Data into SASRick Watts
 
Hadoop Summit 2014: Query Optimization and JIT-based Vectorized Execution in ...
Hadoop Summit 2014: Query Optimization and JIT-based Vectorized Execution in ...Hadoop Summit 2014: Query Optimization and JIT-based Vectorized Execution in ...
Hadoop Summit 2014: Query Optimization and JIT-based Vectorized Execution in ...Gruter
 
Big Data Analytics with Apache Spark
Big Data Analytics with Apache SparkBig Data Analytics with Apache Spark
Big Data Analytics with Apache SparkMarcoYuriFujiiMelo
 
Spark Summit EU talk by Ross Lawley
Spark Summit EU talk by Ross LawleySpark Summit EU talk by Ross Lawley
Spark Summit EU talk by Ross LawleySpark Summit
 
How To Connect Spark To Your Own Datasource
How To Connect Spark To Your Own DatasourceHow To Connect Spark To Your Own Datasource
How To Connect Spark To Your Own DatasourceMongoDB
 
A Tale of Three Apache Spark APIs: RDDs, DataFrames, and Datasets with Jules ...
A Tale of Three Apache Spark APIs: RDDs, DataFrames, and Datasets with Jules ...A Tale of Three Apache Spark APIs: RDDs, DataFrames, and Datasets with Jules ...
A Tale of Three Apache Spark APIs: RDDs, DataFrames, and Datasets with Jules ...Databricks
 
Spark ETL Techniques - Creating An Optimal Fantasy Baseball Roster
Spark ETL Techniques - Creating An Optimal Fantasy Baseball RosterSpark ETL Techniques - Creating An Optimal Fantasy Baseball Roster
Spark ETL Techniques - Creating An Optimal Fantasy Baseball RosterDon Drake
 
A candidate dataset_discovery_and_linkage_recommendation_system_for_linked_data
A candidate dataset_discovery_and_linkage_recommendation_system_for_linked_dataA candidate dataset_discovery_and_linkage_recommendation_system_for_linked_data
A candidate dataset_discovery_and_linkage_recommendation_system_for_linked_dataSTIinnsbruck
 
An Empirical Evaluation of RDF Graph Partitioning Techniques
An Empirical Evaluation of RDF Graph Partitioning TechniquesAn Empirical Evaluation of RDF Graph Partitioning Techniques
An Empirical Evaluation of RDF Graph Partitioning TechniquesAdnan Akhter
 
GraphX: Graph analytics for insights about developer communities
GraphX: Graph analytics for insights about developer communitiesGraphX: Graph analytics for insights about developer communities
GraphX: Graph analytics for insights about developer communitiesPaco Nathan
 
Building a modern Application with DataFrames
Building a modern Application with DataFramesBuilding a modern Application with DataFrames
Building a modern Application with DataFramesDatabricks
 
Building a modern Application with DataFrames
Building a modern Application with DataFramesBuilding a modern Application with DataFrames
Building a modern Application with DataFramesSpark Summit
 
CostFed: Cost-Based Query Optimization for SPARQL Endpoint Federation
CostFed: Cost-Based Query Optimization for SPARQL Endpoint FederationCostFed: Cost-Based Query Optimization for SPARQL Endpoint Federation
CostFed: Cost-Based Query Optimization for SPARQL Endpoint FederationMuhammad Saleem
 
DataStax | Data Science with DataStax Enterprise (Brian Hess) | Cassandra Sum...
DataStax | Data Science with DataStax Enterprise (Brian Hess) | Cassandra Sum...DataStax | Data Science with DataStax Enterprise (Brian Hess) | Cassandra Sum...
DataStax | Data Science with DataStax Enterprise (Brian Hess) | Cassandra Sum...DataStax
 

Similar to Topic-based Federator Query Engine - Presented at ICWI Budapest 2018 (20)

Querying federations 
of Triple Pattern Fragments
Querying federations 
of Triple Pattern FragmentsQuerying federations 
of Triple Pattern Fragments
Querying federations 
of Triple Pattern Fragments
 
Big Data Processing using Apache Spark and Clojure
Big Data Processing using Apache Spark and ClojureBig Data Processing using Apache Spark and Clojure
Big Data Processing using Apache Spark and Clojure
 
Koalas: How Well Does Koalas Work?
Koalas: How Well Does Koalas Work?Koalas: How Well Does Koalas Work?
Koalas: How Well Does Koalas Work?
 
Sparql service-description
Sparql service-descriptionSparql service-description
Sparql service-description
 
No more struggles with Apache Spark workloads in production
No more struggles with Apache Spark workloads in productionNo more struggles with Apache Spark workloads in production
No more struggles with Apache Spark workloads in production
 
Bringing OpenClinica Data into SAS
Bringing OpenClinica Data into SASBringing OpenClinica Data into SAS
Bringing OpenClinica Data into SAS
 
Spark learning
Spark learningSpark learning
Spark learning
 
Hadoop Summit 2014: Query Optimization and JIT-based Vectorized Execution in ...
Hadoop Summit 2014: Query Optimization and JIT-based Vectorized Execution in ...Hadoop Summit 2014: Query Optimization and JIT-based Vectorized Execution in ...
Hadoop Summit 2014: Query Optimization and JIT-based Vectorized Execution in ...
 
Big Data Analytics with Apache Spark
Big Data Analytics with Apache SparkBig Data Analytics with Apache Spark
Big Data Analytics with Apache Spark
 
Spark Summit EU talk by Ross Lawley
Spark Summit EU talk by Ross LawleySpark Summit EU talk by Ross Lawley
Spark Summit EU talk by Ross Lawley
 
How To Connect Spark To Your Own Datasource
How To Connect Spark To Your Own DatasourceHow To Connect Spark To Your Own Datasource
How To Connect Spark To Your Own Datasource
 
A Tale of Three Apache Spark APIs: RDDs, DataFrames, and Datasets with Jules ...
A Tale of Three Apache Spark APIs: RDDs, DataFrames, and Datasets with Jules ...A Tale of Three Apache Spark APIs: RDDs, DataFrames, and Datasets with Jules ...
A Tale of Three Apache Spark APIs: RDDs, DataFrames, and Datasets with Jules ...
 
Spark ETL Techniques - Creating An Optimal Fantasy Baseball Roster
Spark ETL Techniques - Creating An Optimal Fantasy Baseball RosterSpark ETL Techniques - Creating An Optimal Fantasy Baseball Roster
Spark ETL Techniques - Creating An Optimal Fantasy Baseball Roster
 
A candidate dataset_discovery_and_linkage_recommendation_system_for_linked_data
A candidate dataset_discovery_and_linkage_recommendation_system_for_linked_dataA candidate dataset_discovery_and_linkage_recommendation_system_for_linked_data
A candidate dataset_discovery_and_linkage_recommendation_system_for_linked_data
 
An Empirical Evaluation of RDF Graph Partitioning Techniques
An Empirical Evaluation of RDF Graph Partitioning TechniquesAn Empirical Evaluation of RDF Graph Partitioning Techniques
An Empirical Evaluation of RDF Graph Partitioning Techniques
 
GraphX: Graph analytics for insights about developer communities
GraphX: Graph analytics for insights about developer communitiesGraphX: Graph analytics for insights about developer communities
GraphX: Graph analytics for insights about developer communities
 
Building a modern Application with DataFrames
Building a modern Application with DataFramesBuilding a modern Application with DataFrames
Building a modern Application with DataFrames
 
Building a modern Application with DataFrames
Building a modern Application with DataFramesBuilding a modern Application with DataFrames
Building a modern Application with DataFrames
 
CostFed: Cost-Based Query Optimization for SPARQL Endpoint Federation
CostFed: Cost-Based Query Optimization for SPARQL Endpoint FederationCostFed: Cost-Based Query Optimization for SPARQL Endpoint Federation
CostFed: Cost-Based Query Optimization for SPARQL Endpoint Federation
 
DataStax | Data Science with DataStax Enterprise (Brian Hess) | Cassandra Sum...
DataStax | Data Science with DataStax Enterprise (Brian Hess) | Cassandra Sum...DataStax | Data Science with DataStax Enterprise (Brian Hess) | Cassandra Sum...
DataStax | Data Science with DataStax Enterprise (Brian Hess) | Cassandra Sum...
 

Recently uploaded

Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Challengers I Told Ya ShirtChallengers I Told Ya Shirt
Challengers I Told Ya ShirtChallengers I Told Ya ShirtChallengers I Told Ya ShirtChallengers I Told Ya Shirt
Challengers I Told Ya ShirtChallengers I Told Ya Shirtrahman018755
 
Russian Call girl in Ajman +971563133746 Ajman Call girl Service
Russian Call girl in Ajman +971563133746 Ajman Call girl ServiceRussian Call girl in Ajman +971563133746 Ajman Call girl Service
Russian Call girl in Ajman +971563133746 Ajman Call girl Servicegwenoracqe6
 
VIP Kolkata Call Girl Kestopur 👉 8250192130 Available With Room
VIP Kolkata Call Girl Kestopur 👉 8250192130  Available With RoomVIP Kolkata Call Girl Kestopur 👉 8250192130  Available With Room
VIP Kolkata Call Girl Kestopur 👉 8250192130 Available With Roomdivyansh0kumar0
 
Gram Darshan PPT cyber rural in villages of india
Gram Darshan PPT cyber rural  in villages of indiaGram Darshan PPT cyber rural  in villages of india
Gram Darshan PPT cyber rural in villages of indiaimessage0108
 
Best VIP Call Girls Noida Sector 75 Call Me: 8448380779
Best VIP Call Girls Noida Sector 75 Call Me: 8448380779Best VIP Call Girls Noida Sector 75 Call Me: 8448380779
Best VIP Call Girls Noida Sector 75 Call Me: 8448380779Delhi Call girls
 
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024APNIC
 
FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607
FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607
FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607dollysharma2066
 
Delhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Enjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort ServiceEnjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort ServiceDelhi Call girls
 
'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...
'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...
'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...APNIC
 
Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$
Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$
Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$kojalkojal131
 
On Starlink, presented by Geoff Huston at NZNOG 2024
On Starlink, presented by Geoff Huston at NZNOG 2024On Starlink, presented by Geoff Huston at NZNOG 2024
On Starlink, presented by Geoff Huston at NZNOG 2024APNIC
 
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...Diya Sharma
 
Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...
Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...
Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...SofiyaSharma5
 
Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝soniya singh
 

Recently uploaded (20)

Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝
 
Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝
 
Challengers I Told Ya ShirtChallengers I Told Ya Shirt
Challengers I Told Ya ShirtChallengers I Told Ya ShirtChallengers I Told Ya ShirtChallengers I Told Ya Shirt
Challengers I Told Ya ShirtChallengers I Told Ya Shirt
 
Russian Call girl in Ajman +971563133746 Ajman Call girl Service
Russian Call girl in Ajman +971563133746 Ajman Call girl ServiceRussian Call girl in Ajman +971563133746 Ajman Call girl Service
Russian Call girl in Ajman +971563133746 Ajman Call girl Service
 
VIP Kolkata Call Girl Kestopur 👉 8250192130 Available With Room
VIP Kolkata Call Girl Kestopur 👉 8250192130  Available With RoomVIP Kolkata Call Girl Kestopur 👉 8250192130  Available With Room
VIP Kolkata Call Girl Kestopur 👉 8250192130 Available With Room
 
Gram Darshan PPT cyber rural in villages of india
Gram Darshan PPT cyber rural  in villages of indiaGram Darshan PPT cyber rural  in villages of india
Gram Darshan PPT cyber rural in villages of india
 
Best VIP Call Girls Noida Sector 75 Call Me: 8448380779
Best VIP Call Girls Noida Sector 75 Call Me: 8448380779Best VIP Call Girls Noida Sector 75 Call Me: 8448380779
Best VIP Call Girls Noida Sector 75 Call Me: 8448380779
 
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024
 
FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607
FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607
FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607
 
Delhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Enjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort ServiceEnjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort Service
 
'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...
'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...
'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...
 
Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$
Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$
Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$
 
Rohini Sector 26 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 26 Call Girls Delhi 9999965857 @Sabina Saikh No AdvanceRohini Sector 26 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 26 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
 
On Starlink, presented by Geoff Huston at NZNOG 2024
On Starlink, presented by Geoff Huston at NZNOG 2024On Starlink, presented by Geoff Huston at NZNOG 2024
On Starlink, presented by Geoff Huston at NZNOG 2024
 
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
 
Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...
Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...
Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...
 
Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...
Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...
Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...
 
Call Girls In South Ex 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICE
Call Girls In South Ex 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICECall Girls In South Ex 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICE
Call Girls In South Ex 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICE
 
Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝
 

Topic-based Federator Query Engine - Presented at ICWI Budapest 2018

  • 1. Topic-based Federated Query Engine Ester Giallonardo, Ciro Sorrentino ,Eugenio Zimeo ICWI - BUDAPEST - 2018
  • 4. RELATED WORK Federated Service Endpoint Query Engine - Code Avalable FedX SPLENDID DARQ CostFed LHD ANAPSID ADERIS Code Avalability YES (java- jar) YES (java, scala) YES (java) YES (python) YES (java) YES (python) YES (java) LAST UPDATE 2016 2011 2006 2018 2013 2013 N.D. SERVICE Sparql Clause support yes NO No yes No yes No SOURCE SELECTION query (ASK) query (ASK), index, Sparql service descriptor, VoID index (sparql service descriptor) index, query cost estimation based on selectivity query (ASK), index, Sparql service descriptor, VoID query (ASK), index index JOIN TYPE nested loop, bind hash, bind nested loop, bind bound, symmetric hash join hash, bind adaptive Index based nested loop CACHE YES (ASK history) YES NO NO NO NO NO None of existing federated engines exploits data mining based strategies to select services and to implement SPARQL queries executions on SPARQL endpoints
  • 5. RELATED WORK (2) - Topic model has been widely used for document identification - Bhattacharya & Sil, 2016 for first used LDA and sparse representation based classifier for information retrieval - Wang et al., 2016 and Wei & Croft, 2006 consider a query as a distribution of terms over topics through LDA for information retrieval - Röder et al., 2016 used LDA to identify only the topics of RDF datasets through the extracted English labels - None of these research contributions: - exploits LDA to determine the similarity between queries and services - builds the corpus by IRIs, i.e. structured and semantic data
  • 6. TOPIC FEDERATED QUERY ENGINE - RDF dataset are intelligible documents i.e. semantic based - LDA is used to get the dataset topic model and to infer query topics - Topics are datasets summary - Topic similarity between a query and a dataset reveals a possible pertinence - The Topic SPARQL Federated Query Engine - learns through datasets hosted on services - infers query routing information - executes the federated query on the distributed architecture
  • 11. SOURCE SELECTION STRATEGIES …. …. DTDDi …. …. DTDDi …. …. DTDDi …. …. DTDDi * QTD Tx Ty Tz …. …. QTD …. …. QTD …. …. QTD BEST STRATEGY ALL STRATEGY ALL FILTERED STRATEGY K-MEANS STRATEGY K=2 threshold threshold Centroid 1 Centroid 2 For some pattern Tx Ty Tz Tx Ty Tz Tx Ty Tz Tx Ty Tz Tx Ty Tz Dm Dn Do Dp Di Cluster delimitation
  • 12. THE BEST STRATEGY SELECT *WHERE { } BEST ALL ALL-FILT. K-MEANS Triple-pattern1 Ty Triple-pattern2 Ty Triple-pattern3 Ty Triple-pattern4 Ty …. …. DTDDi Tx Ty Tz …. …. QTD BEST STRATEGY Tx Ty Tz
  • 13. THE ALL STRATEGY SELECT *WHERE { } BEST ALL ALL-FILT. K-MEANS Triple-pattern1 Ty Ty Tz Triple-pattern2 Ty Ty Tz Triple-pattern3 Ty Ty Tz Triple-pattern4 Ty Ty Tz …. …. …. DTDDi …. …. QTD ALL STRATEGY threshold Tx Ty Tz Tx Ty Tz
  • 14. THE ALL-FILTERED STRATEGY SELECT *WHERE { } BEST ALL ALL-FILT. K-MEANS Triple-pattern1 Ty Ty Tz Ty Tz Triple-pattern2 Ty Ty Tz Ty Tz Triple-pattern3 Ty Ty Tz Ty Triple-pattern4 Ty Ty Tz Ty …. …. DTDDi …. …. QTD ALL FILTERED STRATEGY threshold For some pattern Tx Ty Tz Tx Ty Tz
  • 15. THE K-MEANS STRATEGY SELECT *WHERE { } BEST ALL ALL-FILT. K-MEANS Triple-pattern1 Ty Ty Tz Ty Tz Dn Dp Triple-pattern2 Ty Ty Tz Ty Tz Dn Dp Triple-pattern3 Ty Ty Tz Ty Dn Dp Triple-pattern4 Ty Ty Tz Ty Dn Dp …. …. DTDDi * QTD K-MEANS STRATEGY K=2 Centroid 1 Centroid 2 Dm Dn Do Dp Di Cluster delimitation
  • 16. DATASET-QUERY TOPIC MATCHING SELECT *WHERE { } BEST ALL ALL-FIL. K-MEANS Triple-pattern1 Dn Ty Tz Ty Tz Dn Dp Triple-pattern2 Dn Ty Tz Ty Tz Dn Dp Triple-pattern3 Dn Ty Tz Ty Dn Dp Triple-pattern4 Dn Ty Tz Ty Dn Dp BEST STRATEGY
  • 17. DATASET-QUERY TOPIC MATCHING (2) SELECT *WHERE { } BEST ALL ALL-FIL. K-MEANS Triple-pattern1 Dn Dn Do Dp Ty Tz Dn Dp Triple-pattern2 Dn Dn Do Dp Ty Tz Dn Dp Triple-pattern3 Dn Dn Do Dp Ty Dn Dp Triple-pattern4 Dn Dn Do Dp Ty Dn Dp ALL STRATEGY
  • 18. DATASET-QUERY TOPIC MATCHING (3) SELECT *WHERE { } BEST ALL ALL-FIL. K-MEANS Triple-pattern1 Dn Dn Do Dp Dn Do Dp Dn Dp Triple-pattern2 Dn Dn Do Dp Dn Do Dp Dn Dp Triple-pattern3 Dn Dn Do Dp Dn Dn Dp Triple-pattern4 Dn Dn Do Dp Dn Dn Dp ALL FILTERED STRATEGY
  • 19. SELECT * WHERE { } BEST ALL ALL-FILTERED K-MEANS Triple-pattern1 Sn Sn So Sp Sn So Sp Sn Sp Triple-pattern2 Sn So Sp Sn So Sp Sn Sp Triple-pattern3 Sn So Sp Sn Sn Sp Triple-pattern4 Sn So Sp Sn Sp SERVICE SUBSTITUTION - AGGREGATION
  • 25. CONCLUSION - RDF-dataset once treated as documents are exploited by LDA to extract datasets latent semantics . - This latent semantic is represented by topics that are datasets summaries. - The Topic SPARQL Federated Query Engine learns through datasets hosted on services how to split, route and execute service-less Sparql queries in a federated way. - It is a middleware oriented to transparently querying the Open Data world - Work in progress: - Benchmarking with other engines - Evaluating index stability - Improving performance and recall of the strategies
  • 26. THANK YOU FOR YOUR ATTENTION! ANY QUESTION? Topic-based Federated Query Engine Ester Giallonardo, Ciro Sorrentino, Eugenio Zimeo ICWI - BUDAPEST - 2018