SlideShare a Scribd company logo
Opportunistic Linked Data
Querying through Approximate
Membership Metadata
Miel Vander Sande
“Solve a query for a client, 

and it will be happy for a day.

Teach a client to SPARQL, 

and it’ll query happily ever after.”
!
— Confucius, 431 BC
Linked Data Fragments: a uniform view

on publishing Linked Data
Exploring the axis: selector and metadata
Approximate Membership Metadata
Querying through Approximate
Membership Metadata
Opportunistic Querying
Linked Data Fragments: a uniform view

on publishing Linked Data
Exploring the axis: selector and metadata
Approximate Membership Metadata
Querying through Approximate
Membership Metadata
Opportunistic Querying
Interaction between client & server.

The hunt for trade-offs: What can we learn?
high server costlow server cost
data

dump
SPARQL

endpoint
interface offered by the server
high availability low availability
high bandwidth low bandwidth
out-of-date data live data
low client costhigh client cost
Linked Data Fragments are

a uniform view on Linked Data interfaces.
data

dump
SPARQL

endpoint
interface offered by the server
Every Linked Data interface

offers specific fragments

of a Linked Data set.
data
metadata
controls
What triples does it contain?
What do we know about it?
How to access more data?
Each type of Linked Data Fragment

is defined by three characteristics.
all dataset triples
(none)
data dump
number of triples, file size
data
metadata
controls
Each type of Linked Data Fragment

is defined by three characteristics.
triples matching the query
(none)
(none)
SPARQL query result
data
metadata
controls
Each type of Linked Data Fragment

is defined by three characteristics.
Linked Data Fragments: a uniform view

on publishing Linked Data
Exploring the axis: selector and metadata
Approximate Membership Metadata
Querying through Approximate
Membership Metadata
Opportunistic Querying
low server cost
data

dump
SPARQL

query results
high availability
live data
Linked Data

documents
triple pattern

fragments
You have to start somewhere: 

Triple Pattern Fragments.
Verborgh, R., Hartig, O.,…: Querying datasets on the Web
with high availability. ISWC2014
high bandwidth
data (first 100)
controls (other fragments)
metadata (total count)
controls
Triple pattern fragment servers

enable clients to be intelligent.
<http://fragments.dbpedia.org/2014/en#dataset> hydra:search [
hydra:template "http://fragments.dbpedia.org/2014/en
{?subject,predicate,object}";
hydra:mapping
[ hydra:variable "subject"; hydra:property rdf:subject ],
[ hydra:variable "predicate"; hydra:property rdf:predicate ],
[ hydra:variable "object"; hydra:property rdf:object ]
].
The RDF representation explains:

“you can query by triple pattern”.
The RDF representation explains:

“this is the number of matches”.
metadata
Triple pattern fragment servers

enable clients to be intelligent.
<#fragment> void:triples 8141.
Give them a SPARQL query.

Give them a URL of any dataset fragment.
How can intelligent clients

solve SPARQL queries over fragments?
They look inside the fragment

to see how to access the dataset
and use the metadata

to decide how to plan the query.
The client splits the query

into the available fragments.
SELECT ?artist ?name WHERE {
?artist a dbpedia-owl:Artist;
rdfs:label ?name;
dbpedia-owl:birthPlace dbpedia:Padua.
FILTER LANGMATCHES(LANG(?name), "EN")
}
The client gets the fragments

and inspects their metadata.
?artist a dbpedia-owl:Artist.
first 100 triples
96,000
?artist rdfs:label ?name.
first 100 triples
12,000,000
?artist dbont:birthPlace dbpedia:Padua.
first 100 triples
135
?artist a dbpedia-owl:Artist. 96.000
?artist rdfs:label ?name. 12.000.000
?artist dbont:birthPlace dbpedia:Padua.
dbpedia:Alberto_Benettin dbont:birthPlace dbpedia:Padua.
135
dbpedia:Alberto_Bigon dbont:birthPlace dbpedia:Padua.
The metadata enables the client

to choose the right starting point.
dbp:Alberto_Benettin a dbont:Artist.
dbp:Alberto_Benettin rdfs:label ?name.
For some patterns, many requests are
of type “is this triple in the dataset?”
Fractionofmembershipqueries
0%
25%
50%
75%
100%
L1 L2 L3 L4 L5 S1 S2 S3 S4 S5 S6 S7 F1 F2 F3 F4 F5 C1 C2 C3
20 WatDiv queries

linear (L), star (S), snowflake-shaped (F) and complex (C)
Advancing in selector and/or metadata
dimensions.
metadata
selector
Triple Pattern Fragments
low server cost
high availability
live data
high bandwidth
Simple

Questions
Complex 

Questions
No information 

for the client
Extensive useful

information for the client
Advancing in selector and/or metadata
dimensions.
metadata
selector
Triple Pattern Fragments
Substring search
J Van Herwegen et. al.:
Substring Filtering for Low-Cost
Linked Data Interfaces

Last talk of this session!
Advancing in selector and/or metadata
dimensions.
metadata
selector
Triple Pattern Fragments
Substring search
Approximate Membership

Function (AMF)
Linked Data Fragments: a uniform view

on publishing Linked Data
Exploring the axis: selector and metadata
Approximate Membership Metadata
Querying through Approximate
Membership Metadata
Opportunistic Querying
Append TPF response with a compact
representation of all possible mappings.
metadata
Triple Pattern Fragments
Approximate Membership Function (AMF)
Approximate set membership assessment
with a predefined false positive probability.
Bloom filter / Golomb-coded set
+
“Can we reduce the number of HTTP requests?”
“Can we reduce the total execution time?”
“What is the overhead on server CPU load?”
Bloom Filter
Golomb-coded set (GCS)
0 0 0 0 1 0 0 0 0 0 0 1 0 0 0 0 … 0 1 0
!
!
n0 dbpedia:Alberto_Benettin
n1 dbpedia:Alberto_Bigon
nx …
0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 … 0 0 0
m
0 1 0 0 1 0 0 0 1 0 0 1 0 0 1 0 … 0 1 0
k0 k1 kx
k0 k1 kx
0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 … 0 0 0
!
n0 dbpedia:Alberto_Benettin
n1 dbpedia:Alberto_Bigon
k
0 1 0 0 1 0 0 0 0 0 0 0 0 0 0 0 … 0 0 0
k
0 1 0 1 1 0 1Golomb coded
Geometric distribution
“this BloomFilter with false positive
probability X and hash function Y
represents the presence of all bindings for ?s”.
metadata
Server enables clients to avoid 

membership requests.
<#fragment> void:triples 96300. # existing count metadata
_:membershipFunction a ms:BloomFilter; # AMF metadata
ms:hashSize 524288;
ms:hashFunction <MyMurmur1>, <MyMurmur2>;
ms:memberCollection [
ms:sourceCollection <#fragment>;
ms:projectedProperty rdf:subject
];
ms:falsePositiveRate 0.05;
ms:falseNegativeRate 0.0;
ms:binaryRepresentation "QmF...ZTY"^^xsd:base64Binary.
GET ?artist dbont:birthPlace dbpedia:Padua.
dbpedia:Alberto_Benettin dbont:birthPlace dbpedia:Padua.
135
…
Client filters non-members locally 

with one extra (cached) request
GET dbpedia:Alberto_Benettin a dbont:Artist. 0
GET dbpedia:Alberto_Bigon a dbont:Artist. 1
GET dbpedia:Alberto_Da_Zara a dbont:Artist. 1
GET dbpedia:Alberto_Gallo a dbont:Artist. 0
GET dbpedia:Alberto_Bigon a dbont:Artist. 1
GET ?artist a dbont:Artist.
Approx.MembershipFilt.
GET …
We evaluated for request count, server
cost and speedup in a Web setting.
BloomFilter: MurMurHash3, GCS: FNV-1
1 HTTP Cache with 1 Mbps
p = 1/1024 (0.1%) , 1/128 (1%), 1/64 (1.6%)
250 queries from 125 diverse WatDiv
templates on Amazon EC2 machine
WatDiv 100M triples dataset
Timeout: 3min
We evaluated for request count, server
cost and speedup in a Web setting.
vs. vanilla TPF server & client
Original “greedy” algorithm

Optimized join-tree algorithm*
250 queries from 125 diverse WatDiv
templates on Amazon EC2 machine
* Van Herwegen, et. al.: Query Execution Optimization for
Clients of Triple Patterns Fragments. ESWC2015
2 client algorithms:
> 50% of the queries has fewer requests,

< 20% has more requests.
Greedy Bloom
Greedy GCS
Optimized Bloom
Optimized GCS
Percentage of queries (p = 1/1024)
0% 25% 50% 75% 100%
6%
5%
18%
17%
59%
62%
49%
50%
35%
33%
33%
32%
Equal Fewer Requests More Requests
Queries with relatively many HTTP req.
(45,000+ / query) benefit greatly
Differencein#Requests
0
4,000
8,000
12,000
16,000
Fewer Requests More Requests
Greedy Bloom Greedy GCS Optimized Bloom Optimized GCS
< 35
No queries have reduction in execution
time, a third even has increase.
Greedy Bloom
Greedy GCS
Optimized Bloom
Optimized GCS
Percentage of queries (p = 1/1024)
0% 25% 50% 75% 100%
16%
31%
33%
38%0%
84%
69%
67%
62%
Equal Lower Execution time Higher Execution time
Server remains low-cost, as impact is 

very acceptable (< 6%).
CPU(%)
0
7.5
15
22.5
30
O
riginal
Bloom
(1/1024)
Bloom
(1/128)
Bloom
(1/64)
G
CS
(1/1024)
G
CS
(1/128)
G
CS
(1/64)
11.110.810.2
14.9
11.210.8
9.2
Linked Data Fragments: a uniform view

on publishing Linked Data
Exploring the axis: selector and metadata
Approximate Membership Metadata
Querying through Approximate
Membership Metadata
Opportunistic Querying
During execution, a result candidate
could already be correct (1 - p).
Can we be opportunistic here, and
temporarily allow imprecise results?
“Can we reduce the time to 100% recall?”
Opportunistic Linked Data Querying 13
only allow
certain results
temporarily allow
uncertain results
start
execution
start
execution
1st result
computed
1st result
computed
n < r results
computed
n < r results
computed
r results
computed
r results
computed
r + f results
computed
0% recall 100% recall 100% recall
100% precision
Fig. 2. This SPARQL query execution timeline compares regular and opportunistic
query execution, assuming r total query results and f false positives. Note how
both approaches achieve 100% recall and precision at a shared point in the end, but
there exists a period during which only opportunistic execution reaches 100% recall
(shaded).
need to be discarded. The user thus sees the photos faster than if they
had only been retrieved after full precision was achieved. This example
Temporarily allowing <100% precision 

can reduce 100% recall time with 1/3.
Executiontime(s)
0
35
70
105
140
Greedy + Bloom (p = 1/1024)
100% Recall 100% Precision
Number of revoked results was 0 or 1.
Linked Data Fragments: a uniform view

on publishing Linked Data
Exploring the axis: selector and metadata
Approximate Membership Metadata
Querying through Approximate
Membership Metadata
Opportunistic Querying
For some queries types, bandwidth highly
decreases for TPF query execution.
Approximate Membership Metadata 

is a nuanced debate
For larger fragments, realtime computation
hurts execution time. We expect gain with 

pre-caching and out-of-band delivery.
Opportunistic querying is a promising direction
for further exploration.
TRIPLE PATTERN
fragments
data
APPR. MEM. FILT.
No one size fits all, explore the axis.

Find metrics that fit your use-case.
Client & Server load

Request & Response size

Protocol (HTTP) impact

…
Try you own trade-off
server at our demo (and
get a nice cup of coffee).
Start serving Linked Data like a barista
Opportunistic Linked Data
Querying through Approximate
Membership Metadata
Miel Vander Sande

More Related Content

What's hot

Asp.net server control
Asp.net  server controlAsp.net  server control
Asp.net server control
Sireesh K
 
Vespa, A Tour
Vespa, A TourVespa, A Tour
Vespa, A Tour
MatthewOverstreet2
 
MongoDB World 2019: Don't Break the Camel's Back: Running MongoDB as Hard as ...
MongoDB World 2019: Don't Break the Camel's Back: Running MongoDB as Hard as ...MongoDB World 2019: Don't Break the Camel's Back: Running MongoDB as Hard as ...
MongoDB World 2019: Don't Break the Camel's Back: Running MongoDB as Hard as ...
MongoDB
 
Document Validation in MongoDB 3.2
Document Validation in MongoDB 3.2Document Validation in MongoDB 3.2
Document Validation in MongoDB 3.2
MongoDB
 
Managing your black friday logs - Code Europe
Managing your black friday logs - Code EuropeManaging your black friday logs - Code Europe
Managing your black friday logs - Code Europe
David Pilato
 
Open Policy Agent (OPA) と Kubernetes Policy
Open Policy Agent (OPA) と Kubernetes PolicyOpen Policy Agent (OPA) と Kubernetes Policy
Open Policy Agent (OPA) と Kubernetes Policy
Motonori Shindo
 
5 Pitfalls to Avoid with MongoDB
5 Pitfalls to Avoid with MongoDB5 Pitfalls to Avoid with MongoDB
5 Pitfalls to Avoid with MongoDB
Tim Callaghan
 
Kafka Summit 2018: A Journey Building Kafka Connectors - Pegerto Fernandez
Kafka Summit 2018: A Journey Building Kafka Connectors - Pegerto FernandezKafka Summit 2018: A Journey Building Kafka Connectors - Pegerto Fernandez
Kafka Summit 2018: A Journey Building Kafka Connectors - Pegerto Fernandez
OpenCredo
 
Data mining for_java_and_dot_net 2016-17
Data mining for_java_and_dot_net 2016-17Data mining for_java_and_dot_net 2016-17
Data mining for_java_and_dot_net 2016-17
redpel dot com
 
MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your Mindset
MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your MindsetMongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your Mindset
MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your Mindset
MongoDB
 
PROVABLE DATA PROCESSING (PDP) A MODEL FOR CLIENT'S SECURED DATA ON CLOUD
PROVABLE DATA PROCESSING (PDP) A MODEL FOR CLIENT'S SECURED DATA ON CLOUDPROVABLE DATA PROCESSING (PDP) A MODEL FOR CLIENT'S SECURED DATA ON CLOUD
PROVABLE DATA PROCESSING (PDP) A MODEL FOR CLIENT'S SECURED DATA ON CLOUD
Journal For Research
 
Getting Started With Elasticsearch In .NET
Getting Started With Elasticsearch In .NETGetting Started With Elasticsearch In .NET
Getting Started With Elasticsearch In .NET
Ahmed Abd Ellatif
 
Visualizing Mobile Broadband with MongoDB
Visualizing Mobile Broadband with MongoDBVisualizing Mobile Broadband with MongoDB
Visualizing Mobile Broadband with MongoDBMongoDB
 
21.04.2016 Meetup: Spark vs. Flink
21.04.2016 Meetup: Spark vs. Flink21.04.2016 Meetup: Spark vs. Flink
21.04.2016 Meetup: Spark vs. Flink
Comsysto Reply GmbH
 
MongoDB .local Munich 2019: Managing a Heterogeneous Stack with MongoDB & SQL
MongoDB .local Munich 2019: Managing a Heterogeneous Stack with MongoDB & SQLMongoDB .local Munich 2019: Managing a Heterogeneous Stack with MongoDB & SQL
MongoDB .local Munich 2019: Managing a Heterogeneous Stack with MongoDB & SQL
MongoDB
 
Managing your Black Friday Logs NDC Oslo
Managing your  Black Friday Logs NDC OsloManaging your  Black Friday Logs NDC Oslo
Managing your Black Friday Logs NDC Oslo
David Pilato
 
Haystacks slides
Haystacks slidesHaystacks slides
Haystacks slides
Ted Sullivan
 
Managing your black friday logs Voxxed Luxembourg
Managing your black friday logs Voxxed LuxembourgManaging your black friday logs Voxxed Luxembourg
Managing your black friday logs Voxxed Luxembourg
David Pilato
 
Efficient Similarity Search over Encrypted Data
Efficient Similarity Search over Encrypted DataEfficient Similarity Search over Encrypted Data
Efficient Similarity Search over Encrypted Data
IRJET Journal
 

What's hot (20)

Asp.net server control
Asp.net  server controlAsp.net  server control
Asp.net server control
 
Vespa, A Tour
Vespa, A TourVespa, A Tour
Vespa, A Tour
 
MongoDB World 2019: Don't Break the Camel's Back: Running MongoDB as Hard as ...
MongoDB World 2019: Don't Break the Camel's Back: Running MongoDB as Hard as ...MongoDB World 2019: Don't Break the Camel's Back: Running MongoDB as Hard as ...
MongoDB World 2019: Don't Break the Camel's Back: Running MongoDB as Hard as ...
 
Document Validation in MongoDB 3.2
Document Validation in MongoDB 3.2Document Validation in MongoDB 3.2
Document Validation in MongoDB 3.2
 
Managing your black friday logs - Code Europe
Managing your black friday logs - Code EuropeManaging your black friday logs - Code Europe
Managing your black friday logs - Code Europe
 
Open Policy Agent (OPA) と Kubernetes Policy
Open Policy Agent (OPA) と Kubernetes PolicyOpen Policy Agent (OPA) と Kubernetes Policy
Open Policy Agent (OPA) と Kubernetes Policy
 
5 Pitfalls to Avoid with MongoDB
5 Pitfalls to Avoid with MongoDB5 Pitfalls to Avoid with MongoDB
5 Pitfalls to Avoid with MongoDB
 
Kafka Summit 2018: A Journey Building Kafka Connectors - Pegerto Fernandez
Kafka Summit 2018: A Journey Building Kafka Connectors - Pegerto FernandezKafka Summit 2018: A Journey Building Kafka Connectors - Pegerto Fernandez
Kafka Summit 2018: A Journey Building Kafka Connectors - Pegerto Fernandez
 
Data mining for_java_and_dot_net 2016-17
Data mining for_java_and_dot_net 2016-17Data mining for_java_and_dot_net 2016-17
Data mining for_java_and_dot_net 2016-17
 
MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your Mindset
MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your MindsetMongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your Mindset
MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your Mindset
 
My Master's Thesis
My Master's ThesisMy Master's Thesis
My Master's Thesis
 
PROVABLE DATA PROCESSING (PDP) A MODEL FOR CLIENT'S SECURED DATA ON CLOUD
PROVABLE DATA PROCESSING (PDP) A MODEL FOR CLIENT'S SECURED DATA ON CLOUDPROVABLE DATA PROCESSING (PDP) A MODEL FOR CLIENT'S SECURED DATA ON CLOUD
PROVABLE DATA PROCESSING (PDP) A MODEL FOR CLIENT'S SECURED DATA ON CLOUD
 
Getting Started With Elasticsearch In .NET
Getting Started With Elasticsearch In .NETGetting Started With Elasticsearch In .NET
Getting Started With Elasticsearch In .NET
 
Visualizing Mobile Broadband with MongoDB
Visualizing Mobile Broadband with MongoDBVisualizing Mobile Broadband with MongoDB
Visualizing Mobile Broadband with MongoDB
 
21.04.2016 Meetup: Spark vs. Flink
21.04.2016 Meetup: Spark vs. Flink21.04.2016 Meetup: Spark vs. Flink
21.04.2016 Meetup: Spark vs. Flink
 
MongoDB .local Munich 2019: Managing a Heterogeneous Stack with MongoDB & SQL
MongoDB .local Munich 2019: Managing a Heterogeneous Stack with MongoDB & SQLMongoDB .local Munich 2019: Managing a Heterogeneous Stack with MongoDB & SQL
MongoDB .local Munich 2019: Managing a Heterogeneous Stack with MongoDB & SQL
 
Managing your Black Friday Logs NDC Oslo
Managing your  Black Friday Logs NDC OsloManaging your  Black Friday Logs NDC Oslo
Managing your Black Friday Logs NDC Oslo
 
Haystacks slides
Haystacks slidesHaystacks slides
Haystacks slides
 
Managing your black friday logs Voxxed Luxembourg
Managing your black friday logs Voxxed LuxembourgManaging your black friday logs Voxxed Luxembourg
Managing your black friday logs Voxxed Luxembourg
 
Efficient Similarity Search over Encrypted Data
Efficient Similarity Search over Encrypted DataEfficient Similarity Search over Encrypted Data
Efficient Similarity Search over Encrypted Data
 

Viewers also liked

iRail: History & current issues
iRail: History & current issuesiRail: History & current issues
iRail: History & current issues
Pieter Colpaert
 
Using EPUB 3 and the Open Web Platform for Enhanced Presentation and Machine-...
Using EPUB 3 and the Open Web Platform for Enhanced Presentation and Machine-...Using EPUB 3 and the Open Web Platform for Enhanced Presentation and Machine-...
Using EPUB 3 and the Open Web Platform for Enhanced Presentation and Machine-...
Pieter Heyvaert
 
Situation of open data in Flanders
Situation of open data in FlandersSituation of open data in Flanders
Situation of open data in Flanders
Pieter Colpaert
 
Towards an Interface for User-Friendly Linked Data Generation Administration
Towards an Interface for User-Friendly Linked Data Generation AdministrationTowards an Interface for User-Friendly Linked Data Generation Administration
Towards an Interface for User-Friendly Linked Data Generation Administration
andimou
 
LDOW2013 r&wbase: git for triples
LDOW2013 r&wbase: git for triplesLDOW2013 r&wbase: git for triples
LDOW2013 r&wbase: git for triplesMiel Vander Sande
 
Machines are the new Digital Natives
Machines are the new Digital NativesMachines are the new Digital Natives
Machines are the new Digital Natives
Miel Vander Sande
 
Querying federations 
of Triple Pattern Fragments
Querying federations 
of Triple Pattern FragmentsQuerying federations 
of Triple Pattern Fragments
Querying federations 
of Triple Pattern Fragments
Ruben Verborgh
 
Querying Heterogeneous Linked Date Interfaces through Reasoning
Querying Heterogeneous Linked Date Interfaces through ReasoningQuerying Heterogeneous Linked Date Interfaces through Reasoning
Querying Heterogeneous Linked Date Interfaces through Reasoning
Joachim Van Herwegen
 
Time travelling through DBpedia
Time travelling through DBpediaTime travelling through DBpedia
Time travelling through DBpedia
Miel Vander Sande
 
ESWC2015 - Query Optimization for Clients of Linked Data Fragments
ESWC2015 - Query Optimization for Clients of Linked Data FragmentsESWC2015 - Query Optimization for Clients of Linked Data Fragments
ESWC2015 - Query Optimization for Clients of Linked Data Fragments
Joachim Van Herwegen
 
Presentation Data Science Challenge
Presentation Data Science ChallengePresentation Data Science Challenge
Presentation Data Science Challenge
Dieter De Witte
 
Towards a Uniform User Interface for Editing Mapping Definitions
Towards a Uniform User Interface for Editing Mapping DefinitionsTowards a Uniform User Interface for Editing Mapping Definitions
Towards a Uniform User Interface for Editing Mapping Definitions
Pieter Heyvaert
 
DBpedia Mappings Quality Assessment
DBpedia Mappings Quality AssessmentDBpedia Mappings Quality Assessment
DBpedia Mappings Quality Assessment
andimou
 
Scaling out federated queries for Life Sciences Data In Production
Scaling out federated queries for Life Sciences Data In ProductionScaling out federated queries for Life Sciences Data In Production
Scaling out federated queries for Life Sciences Data In Production
Dieter De Witte
 
ComparativeMotifFinding
ComparativeMotifFindingComparativeMotifFinding
ComparativeMotifFinding
Dieter De Witte
 
Effect of Heuristics on Serendipity in Path-Based Storytelling with Linked Data
Effect of Heuristics on Serendipity in Path-Based Storytelling with Linked DataEffect of Heuristics on Serendipity in Path-Based Storytelling with Linked Data
Effect of Heuristics on Serendipity in Path-Based Storytelling with Linked Data
Laurens De Vocht
 
Benchmarking the Effectiveness of Associating Chains of Links for Exploratory...
Benchmarking the Effectiveness of Associating Chains of Links for Exploratory...Benchmarking the Effectiveness of Associating Chains of Links for Exploratory...
Benchmarking the Effectiveness of Associating Chains of Links for Exploratory...
Laurens De Vocht
 
RMLEditor: A Graph-based Mapping Editor for Linked Data Mappings
RMLEditor: A Graph-based Mapping Editor for Linked Data MappingsRMLEditor: A Graph-based Mapping Editor for Linked Data Mappings
RMLEditor: A Graph-based Mapping Editor for Linked Data Mappings
Pieter Heyvaert
 
OSLO: Open Standards for Linked Organizations
OSLO: Open Standards for Linked OrganizationsOSLO: Open Standards for Linked Organizations
OSLO: Open Standards for Linked Organizations
Laurens De Vocht
 
The Future is Federated
The Future is FederatedThe Future is Federated
The Future is Federated
Ruben Verborgh
 

Viewers also liked (20)

iRail: History & current issues
iRail: History & current issuesiRail: History & current issues
iRail: History & current issues
 
Using EPUB 3 and the Open Web Platform for Enhanced Presentation and Machine-...
Using EPUB 3 and the Open Web Platform for Enhanced Presentation and Machine-...Using EPUB 3 and the Open Web Platform for Enhanced Presentation and Machine-...
Using EPUB 3 and the Open Web Platform for Enhanced Presentation and Machine-...
 
Situation of open data in Flanders
Situation of open data in FlandersSituation of open data in Flanders
Situation of open data in Flanders
 
Towards an Interface for User-Friendly Linked Data Generation Administration
Towards an Interface for User-Friendly Linked Data Generation AdministrationTowards an Interface for User-Friendly Linked Data Generation Administration
Towards an Interface for User-Friendly Linked Data Generation Administration
 
LDOW2013 r&wbase: git for triples
LDOW2013 r&wbase: git for triplesLDOW2013 r&wbase: git for triples
LDOW2013 r&wbase: git for triples
 
Machines are the new Digital Natives
Machines are the new Digital NativesMachines are the new Digital Natives
Machines are the new Digital Natives
 
Querying federations 
of Triple Pattern Fragments
Querying federations 
of Triple Pattern FragmentsQuerying federations 
of Triple Pattern Fragments
Querying federations 
of Triple Pattern Fragments
 
Querying Heterogeneous Linked Date Interfaces through Reasoning
Querying Heterogeneous Linked Date Interfaces through ReasoningQuerying Heterogeneous Linked Date Interfaces through Reasoning
Querying Heterogeneous Linked Date Interfaces through Reasoning
 
Time travelling through DBpedia
Time travelling through DBpediaTime travelling through DBpedia
Time travelling through DBpedia
 
ESWC2015 - Query Optimization for Clients of Linked Data Fragments
ESWC2015 - Query Optimization for Clients of Linked Data FragmentsESWC2015 - Query Optimization for Clients of Linked Data Fragments
ESWC2015 - Query Optimization for Clients of Linked Data Fragments
 
Presentation Data Science Challenge
Presentation Data Science ChallengePresentation Data Science Challenge
Presentation Data Science Challenge
 
Towards a Uniform User Interface for Editing Mapping Definitions
Towards a Uniform User Interface for Editing Mapping DefinitionsTowards a Uniform User Interface for Editing Mapping Definitions
Towards a Uniform User Interface for Editing Mapping Definitions
 
DBpedia Mappings Quality Assessment
DBpedia Mappings Quality AssessmentDBpedia Mappings Quality Assessment
DBpedia Mappings Quality Assessment
 
Scaling out federated queries for Life Sciences Data In Production
Scaling out federated queries for Life Sciences Data In ProductionScaling out federated queries for Life Sciences Data In Production
Scaling out federated queries for Life Sciences Data In Production
 
ComparativeMotifFinding
ComparativeMotifFindingComparativeMotifFinding
ComparativeMotifFinding
 
Effect of Heuristics on Serendipity in Path-Based Storytelling with Linked Data
Effect of Heuristics on Serendipity in Path-Based Storytelling with Linked DataEffect of Heuristics on Serendipity in Path-Based Storytelling with Linked Data
Effect of Heuristics on Serendipity in Path-Based Storytelling with Linked Data
 
Benchmarking the Effectiveness of Associating Chains of Links for Exploratory...
Benchmarking the Effectiveness of Associating Chains of Links for Exploratory...Benchmarking the Effectiveness of Associating Chains of Links for Exploratory...
Benchmarking the Effectiveness of Associating Chains of Links for Exploratory...
 
RMLEditor: A Graph-based Mapping Editor for Linked Data Mappings
RMLEditor: A Graph-based Mapping Editor for Linked Data MappingsRMLEditor: A Graph-based Mapping Editor for Linked Data Mappings
RMLEditor: A Graph-based Mapping Editor for Linked Data Mappings
 
OSLO: Open Standards for Linked Organizations
OSLO: Open Standards for Linked OrganizationsOSLO: Open Standards for Linked Organizations
OSLO: Open Standards for Linked Organizations
 
The Future is Federated
The Future is FederatedThe Future is Federated
The Future is Federated
 

Similar to Opportunistic Linked Data Querying through Approximate Membership Metadata

Querying datasets on the Web with high availability
Querying datasets on the Web with high availabilityQuerying datasets on the Web with high availability
Querying datasets on the Web with high availability
Ruben Verborgh
 
Sustainable queryable access to Linked Data
Sustainable queryable access to Linked DataSustainable queryable access to Linked Data
Sustainable queryable access to Linked Data
Ruben Verborgh
 
Querying data on the Web – client or server?
Querying data on the Web – client or server?Querying data on the Web – client or server?
Querying data on the Web – client or server?
Ruben Verborgh
 
OSMC 2023 | Experiments with OpenSearch and AI by Jochen Kressin & Leanne La...
OSMC 2023 | Experiments with OpenSearch and AI by Jochen Kressin &  Leanne La...OSMC 2023 | Experiments with OpenSearch and AI by Jochen Kressin &  Leanne La...
OSMC 2023 | Experiments with OpenSearch and AI by Jochen Kressin & Leanne La...
NETWAYS
 
How sitecore depends on mongo db for scalability and performance, and what it...
How sitecore depends on mongo db for scalability and performance, and what it...How sitecore depends on mongo db for scalability and performance, and what it...
How sitecore depends on mongo db for scalability and performance, and what it...
Antonios Giannopoulos
 
Making sense of your data jug
Making sense of your data   jugMaking sense of your data   jug
Making sense of your data jug
Gerald Muecke
 
How to use Impala query plan and profile to fix performance issues
How to use Impala query plan and profile to fix performance issuesHow to use Impala query plan and profile to fix performance issues
How to use Impala query plan and profile to fix performance issues
Cloudera, Inc.
 
Elastic{ON} 2017 Recap
Elastic{ON} 2017 RecapElastic{ON} 2017 Recap
Elastic{ON} 2017 Recap
Matias Cascallares
 
BioIT Europe 2010 - BioCatalogue
BioIT Europe 2010 - BioCatalogueBioIT Europe 2010 - BioCatalogue
BioIT Europe 2010 - BioCatalogue
BioCatalogue
 
Realtime Analytics on AWS
Realtime Analytics on AWSRealtime Analytics on AWS
Realtime Analytics on AWS
Sungmin Kim
 
Introduction to GraphQL Presentation.pptx
Introduction to GraphQL Presentation.pptxIntroduction to GraphQL Presentation.pptx
Introduction to GraphQL Presentation.pptx
Knoldus Inc.
 
Computing Outside The Box June 2009
Computing Outside The Box June 2009Computing Outside The Box June 2009
Computing Outside The Box June 2009
Ian Foster
 
Sharing is Caring: Toward Creating Self-tuning Multi-tenant Kafka (Anna Povzn...
Sharing is Caring: Toward Creating Self-tuning Multi-tenant Kafka (Anna Povzn...Sharing is Caring: Toward Creating Self-tuning Multi-tenant Kafka (Anna Povzn...
Sharing is Caring: Toward Creating Self-tuning Multi-tenant Kafka (Anna Povzn...
HostedbyConfluent
 
Website fingerprinting on TOR
Website fingerprinting on TORWebsite fingerprinting on TOR
Website fingerprinting on TOR
E Hacking
 
Scale and Throughput @ Clicktale with Akka
Scale and Throughput @ Clicktale with AkkaScale and Throughput @ Clicktale with Akka
Scale and Throughput @ Clicktale with Akka
Yuval Itzchakov
 
Presto at Tivo, Boston Hadoop Meetup
Presto at Tivo, Boston Hadoop MeetupPresto at Tivo, Boston Hadoop Meetup
Presto at Tivo, Boston Hadoop Meetup
Justin Borgman
 
Data Architectures for Robust Decision Making
Data Architectures for Robust Decision MakingData Architectures for Robust Decision Making
Data Architectures for Robust Decision Making
Gwen (Chen) Shapira
 
Apache Cassandra - Drivers deep dive
Apache Cassandra - Drivers deep diveApache Cassandra - Drivers deep dive
Apache Cassandra - Drivers deep dive
Alex Thompson
 
gLite Information System
gLite Information SystemgLite Information System
gLite Information SystemLeandro Ciuffo
 

Similar to Opportunistic Linked Data Querying through Approximate Membership Metadata (20)

Querying datasets on the Web with high availability
Querying datasets on the Web with high availabilityQuerying datasets on the Web with high availability
Querying datasets on the Web with high availability
 
Sustainable queryable access to Linked Data
Sustainable queryable access to Linked DataSustainable queryable access to Linked Data
Sustainable queryable access to Linked Data
 
Querying data on the Web – client or server?
Querying data on the Web – client or server?Querying data on the Web – client or server?
Querying data on the Web – client or server?
 
OSMC 2023 | Experiments with OpenSearch and AI by Jochen Kressin & Leanne La...
OSMC 2023 | Experiments with OpenSearch and AI by Jochen Kressin &  Leanne La...OSMC 2023 | Experiments with OpenSearch and AI by Jochen Kressin &  Leanne La...
OSMC 2023 | Experiments with OpenSearch and AI by Jochen Kressin & Leanne La...
 
How sitecore depends on mongo db for scalability and performance, and what it...
How sitecore depends on mongo db for scalability and performance, and what it...How sitecore depends on mongo db for scalability and performance, and what it...
How sitecore depends on mongo db for scalability and performance, and what it...
 
Making sense of your data jug
Making sense of your data   jugMaking sense of your data   jug
Making sense of your data jug
 
How to use Impala query plan and profile to fix performance issues
How to use Impala query plan and profile to fix performance issuesHow to use Impala query plan and profile to fix performance issues
How to use Impala query plan and profile to fix performance issues
 
Elastic{ON} 2017 Recap
Elastic{ON} 2017 RecapElastic{ON} 2017 Recap
Elastic{ON} 2017 Recap
 
BioIT Europe 2010 - BioCatalogue
BioIT Europe 2010 - BioCatalogueBioIT Europe 2010 - BioCatalogue
BioIT Europe 2010 - BioCatalogue
 
Realtime Analytics on AWS
Realtime Analytics on AWSRealtime Analytics on AWS
Realtime Analytics on AWS
 
Introduction to GraphQL Presentation.pptx
Introduction to GraphQL Presentation.pptxIntroduction to GraphQL Presentation.pptx
Introduction to GraphQL Presentation.pptx
 
Linked Data Fragments
Linked Data FragmentsLinked Data Fragments
Linked Data Fragments
 
Computing Outside The Box June 2009
Computing Outside The Box June 2009Computing Outside The Box June 2009
Computing Outside The Box June 2009
 
Sharing is Caring: Toward Creating Self-tuning Multi-tenant Kafka (Anna Povzn...
Sharing is Caring: Toward Creating Self-tuning Multi-tenant Kafka (Anna Povzn...Sharing is Caring: Toward Creating Self-tuning Multi-tenant Kafka (Anna Povzn...
Sharing is Caring: Toward Creating Self-tuning Multi-tenant Kafka (Anna Povzn...
 
Website fingerprinting on TOR
Website fingerprinting on TORWebsite fingerprinting on TOR
Website fingerprinting on TOR
 
Scale and Throughput @ Clicktale with Akka
Scale and Throughput @ Clicktale with AkkaScale and Throughput @ Clicktale with Akka
Scale and Throughput @ Clicktale with Akka
 
Presto at Tivo, Boston Hadoop Meetup
Presto at Tivo, Boston Hadoop MeetupPresto at Tivo, Boston Hadoop Meetup
Presto at Tivo, Boston Hadoop Meetup
 
Data Architectures for Robust Decision Making
Data Architectures for Robust Decision MakingData Architectures for Robust Decision Making
Data Architectures for Robust Decision Making
 
Apache Cassandra - Drivers deep dive
Apache Cassandra - Drivers deep diveApache Cassandra - Drivers deep dive
Apache Cassandra - Drivers deep dive
 
gLite Information System
gLite Information SystemgLite Information System
gLite Information System
 

More from Miel Vander Sande

20230525_mmc_seminar.pdf
20230525_mmc_seminar.pdf20230525_mmc_seminar.pdf
20230525_mmc_seminar.pdf
Miel Vander Sande
 
The Memento protocol
The Memento protocolThe Memento protocol
The Memento protocol
Miel Vander Sande
 
Slight change of plans!
Slight change of plans!Slight change of plans!
Slight change of plans!
Miel Vander Sande
 
Preserving a Web of Linked Data: Lessons and challenges from a fading web
Preserving a Web of Linked Data: Lessons and challenges from a fading webPreserving a Web of Linked Data: Lessons and challenges from a fading web
Preserving a Web of Linked Data: Lessons and challenges from a fading web
Miel Vander Sande
 
PhD Defense: Metadata and Control Features for Low-Cost Linked Data Publishin...
PhD Defense: Metadata and Control Features for Low-Cost Linked Data Publishin...PhD Defense: Metadata and Control Features for Low-Cost Linked Data Publishin...
PhD Defense: Metadata and Control Features for Low-Cost Linked Data Publishin...
Miel Vander Sande
 
Reproducibility with 
the 99 cents Linked Data archive
Reproducibility with 
the 99 cents Linked Data archiveReproducibility with 
the 99 cents Linked Data archive
Reproducibility with 
the 99 cents Linked Data archive
Miel Vander Sande
 
Innovatiemarkt 2017: Machines are the new digital natives
Innovatiemarkt 2017: Machines are the new digital nativesInnovatiemarkt 2017: Machines are the new digital natives
Innovatiemarkt 2017: Machines are the new digital natives
Miel Vander Sande
 
A sweet affordable combo for Linked Data Archives
A sweet affordable combo for Linked Data ArchivesA sweet affordable combo for Linked Data Archives
A sweet affordable combo for Linked Data Archives
Miel Vander Sande
 
Publish data as Time Consistent Web API based on Provenance (WS-REST 2014)
Publish data as Time Consistent Web API based on Provenance (WS-REST 2014)Publish data as Time Consistent Web API based on Provenance (WS-REST 2014)
Publish data as Time Consistent Web API based on Provenance (WS-REST 2014)Miel Vander Sande
 
The Story behind Everything Is Connected: Multimedia narration of automatical...
The Story behind Everything Is Connected: Multimedia narration of automatical...The Story behind Everything Is Connected: Multimedia narration of automatical...
The Story behind Everything Is Connected: Multimedia narration of automatical...
Miel Vander Sande
 
The Terminator's origins or how the Semantic Web could endanger Humanity.
The Terminator's origins or how the Semantic Web could endanger Humanity.The Terminator's origins or how the Semantic Web could endanger Humanity.
The Terminator's origins or how the Semantic Web could endanger Humanity.Miel Vander Sande
 
PMOD Challenges for Open Data Usage: Open derivatives and challenges
PMOD Challenges for Open Data Usage: Open derivatives and challengesPMOD Challenges for Open Data Usage: Open derivatives and challenges
PMOD Challenges for Open Data Usage: Open derivatives and challengesMiel Vander Sande
 
Aan de slag met Linked Open Data
Aan de slag met Linked Open DataAan de slag met Linked Open Data
Aan de slag met Linked Open DataMiel Vander Sande
 
The DataTank: an Open Data adapter with semantic output
The DataTank: an Open Data adapter with semantic outputThe DataTank: an Open Data adapter with semantic output
The DataTank: an Open Data adapter with semantic outputMiel Vander Sande
 
Follow the stars 25/11/2011
Follow the stars 25/11/2011Follow the stars 25/11/2011
Follow the stars 25/11/2011
Miel Vander Sande
 

More from Miel Vander Sande (15)

20230525_mmc_seminar.pdf
20230525_mmc_seminar.pdf20230525_mmc_seminar.pdf
20230525_mmc_seminar.pdf
 
The Memento protocol
The Memento protocolThe Memento protocol
The Memento protocol
 
Slight change of plans!
Slight change of plans!Slight change of plans!
Slight change of plans!
 
Preserving a Web of Linked Data: Lessons and challenges from a fading web
Preserving a Web of Linked Data: Lessons and challenges from a fading webPreserving a Web of Linked Data: Lessons and challenges from a fading web
Preserving a Web of Linked Data: Lessons and challenges from a fading web
 
PhD Defense: Metadata and Control Features for Low-Cost Linked Data Publishin...
PhD Defense: Metadata and Control Features for Low-Cost Linked Data Publishin...PhD Defense: Metadata and Control Features for Low-Cost Linked Data Publishin...
PhD Defense: Metadata and Control Features for Low-Cost Linked Data Publishin...
 
Reproducibility with 
the 99 cents Linked Data archive
Reproducibility with 
the 99 cents Linked Data archiveReproducibility with 
the 99 cents Linked Data archive
Reproducibility with 
the 99 cents Linked Data archive
 
Innovatiemarkt 2017: Machines are the new digital natives
Innovatiemarkt 2017: Machines are the new digital nativesInnovatiemarkt 2017: Machines are the new digital natives
Innovatiemarkt 2017: Machines are the new digital natives
 
A sweet affordable combo for Linked Data Archives
A sweet affordable combo for Linked Data ArchivesA sweet affordable combo for Linked Data Archives
A sweet affordable combo for Linked Data Archives
 
Publish data as Time Consistent Web API based on Provenance (WS-REST 2014)
Publish data as Time Consistent Web API based on Provenance (WS-REST 2014)Publish data as Time Consistent Web API based on Provenance (WS-REST 2014)
Publish data as Time Consistent Web API based on Provenance (WS-REST 2014)
 
The Story behind Everything Is Connected: Multimedia narration of automatical...
The Story behind Everything Is Connected: Multimedia narration of automatical...The Story behind Everything Is Connected: Multimedia narration of automatical...
The Story behind Everything Is Connected: Multimedia narration of automatical...
 
The Terminator's origins or how the Semantic Web could endanger Humanity.
The Terminator's origins or how the Semantic Web could endanger Humanity.The Terminator's origins or how the Semantic Web could endanger Humanity.
The Terminator's origins or how the Semantic Web could endanger Humanity.
 
PMOD Challenges for Open Data Usage: Open derivatives and challenges
PMOD Challenges for Open Data Usage: Open derivatives and challengesPMOD Challenges for Open Data Usage: Open derivatives and challenges
PMOD Challenges for Open Data Usage: Open derivatives and challenges
 
Aan de slag met Linked Open Data
Aan de slag met Linked Open DataAan de slag met Linked Open Data
Aan de slag met Linked Open Data
 
The DataTank: an Open Data adapter with semantic output
The DataTank: an Open Data adapter with semantic outputThe DataTank: an Open Data adapter with semantic output
The DataTank: an Open Data adapter with semantic output
 
Follow the stars 25/11/2011
Follow the stars 25/11/2011Follow the stars 25/11/2011
Follow the stars 25/11/2011
 

Recently uploaded

Recycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part IIIRecycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part III
Aditya Rajan Patra
 
Harnessing WebAssembly for Real-time Stateless Streaming Pipelines
Harnessing WebAssembly for Real-time Stateless Streaming PipelinesHarnessing WebAssembly for Real-time Stateless Streaming Pipelines
Harnessing WebAssembly for Real-time Stateless Streaming Pipelines
Christina Lin
 
Student information management system project report ii.pdf
Student information management system project report ii.pdfStudent information management system project report ii.pdf
Student information management system project report ii.pdf
Kamal Acharya
 
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressionsKuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
Victor Morales
 
Fundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptxFundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptx
manasideore6
 
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
bakpo1
 
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
MdTanvirMahtab2
 
14 Template Contractual Notice - EOT Application
14 Template Contractual Notice - EOT Application14 Template Contractual Notice - EOT Application
14 Template Contractual Notice - EOT Application
SyedAbiiAzazi1
 
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
AJAYKUMARPUND1
 
Cosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdfCosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdf
Kamal Acharya
 
PPT on GRP pipes manufacturing and testing
PPT on GRP pipes manufacturing and testingPPT on GRP pipes manufacturing and testing
PPT on GRP pipes manufacturing and testing
anoopmanoharan2
 
MCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdfMCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdf
Osamah Alsalih
 
Planning Of Procurement o different goods and services
Planning Of Procurement o different goods and servicesPlanning Of Procurement o different goods and services
Planning Of Procurement o different goods and services
JoytuBarua2
 
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
ydteq
 
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdfHybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
fxintegritypublishin
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单专业办理
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单专业办理一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单专业办理
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单专业办理
zwunae
 
Online aptitude test management system project report.pdf
Online aptitude test management system project report.pdfOnline aptitude test management system project report.pdf
Online aptitude test management system project report.pdf
Kamal Acharya
 
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&BDesign and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Sreedhar Chowdam
 
一比一原版(Otago毕业证)奥塔哥大学毕业证成绩单如何办理
一比一原版(Otago毕业证)奥塔哥大学毕业证成绩单如何办理一比一原版(Otago毕业证)奥塔哥大学毕业证成绩单如何办理
一比一原版(Otago毕业证)奥塔哥大学毕业证成绩单如何办理
dxobcob
 
Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024
Massimo Talia
 

Recently uploaded (20)

Recycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part IIIRecycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part III
 
Harnessing WebAssembly for Real-time Stateless Streaming Pipelines
Harnessing WebAssembly for Real-time Stateless Streaming PipelinesHarnessing WebAssembly for Real-time Stateless Streaming Pipelines
Harnessing WebAssembly for Real-time Stateless Streaming Pipelines
 
Student information management system project report ii.pdf
Student information management system project report ii.pdfStudent information management system project report ii.pdf
Student information management system project report ii.pdf
 
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressionsKuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
 
Fundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptxFundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptx
 
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
 
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
 
14 Template Contractual Notice - EOT Application
14 Template Contractual Notice - EOT Application14 Template Contractual Notice - EOT Application
14 Template Contractual Notice - EOT Application
 
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
 
Cosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdfCosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdf
 
PPT on GRP pipes manufacturing and testing
PPT on GRP pipes manufacturing and testingPPT on GRP pipes manufacturing and testing
PPT on GRP pipes manufacturing and testing
 
MCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdfMCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdf
 
Planning Of Procurement o different goods and services
Planning Of Procurement o different goods and servicesPlanning Of Procurement o different goods and services
Planning Of Procurement o different goods and services
 
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
 
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdfHybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单专业办理
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单专业办理一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单专业办理
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单专业办理
 
Online aptitude test management system project report.pdf
Online aptitude test management system project report.pdfOnline aptitude test management system project report.pdf
Online aptitude test management system project report.pdf
 
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&BDesign and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
 
一比一原版(Otago毕业证)奥塔哥大学毕业证成绩单如何办理
一比一原版(Otago毕业证)奥塔哥大学毕业证成绩单如何办理一比一原版(Otago毕业证)奥塔哥大学毕业证成绩单如何办理
一比一原版(Otago毕业证)奥塔哥大学毕业证成绩单如何办理
 
Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024
 

Opportunistic Linked Data Querying through Approximate Membership Metadata

  • 1. Opportunistic Linked Data Querying through Approximate Membership Metadata Miel Vander Sande
  • 2. “Solve a query for a client, 
 and it will be happy for a day.
 Teach a client to SPARQL, 
 and it’ll query happily ever after.” ! — Confucius, 431 BC
  • 3. Linked Data Fragments: a uniform view
 on publishing Linked Data Exploring the axis: selector and metadata Approximate Membership Metadata Querying through Approximate Membership Metadata Opportunistic Querying
  • 4. Linked Data Fragments: a uniform view
 on publishing Linked Data Exploring the axis: selector and metadata Approximate Membership Metadata Querying through Approximate Membership Metadata Opportunistic Querying
  • 5. Interaction between client & server.
 The hunt for trade-offs: What can we learn? high server costlow server cost data
 dump SPARQL
 endpoint interface offered by the server high availability low availability high bandwidth low bandwidth out-of-date data live data low client costhigh client cost
  • 6. Linked Data Fragments are
 a uniform view on Linked Data interfaces. data
 dump SPARQL
 endpoint interface offered by the server Every Linked Data interface
 offers specific fragments
 of a Linked Data set.
  • 7. data metadata controls What triples does it contain? What do we know about it? How to access more data? Each type of Linked Data Fragment
 is defined by three characteristics.
  • 8. all dataset triples (none) data dump number of triples, file size data metadata controls Each type of Linked Data Fragment
 is defined by three characteristics.
  • 9. triples matching the query (none) (none) SPARQL query result data metadata controls Each type of Linked Data Fragment
 is defined by three characteristics.
  • 10. Linked Data Fragments: a uniform view
 on publishing Linked Data Exploring the axis: selector and metadata Approximate Membership Metadata Querying through Approximate Membership Metadata Opportunistic Querying
  • 11. low server cost data
 dump SPARQL
 query results high availability live data Linked Data
 documents triple pattern
 fragments You have to start somewhere: 
 Triple Pattern Fragments. Verborgh, R., Hartig, O.,…: Querying datasets on the Web with high availability. ISWC2014 high bandwidth
  • 12. data (first 100) controls (other fragments) metadata (total count)
  • 13. controls Triple pattern fragment servers
 enable clients to be intelligent. <http://fragments.dbpedia.org/2014/en#dataset> hydra:search [ hydra:template "http://fragments.dbpedia.org/2014/en {?subject,predicate,object}"; hydra:mapping [ hydra:variable "subject"; hydra:property rdf:subject ], [ hydra:variable "predicate"; hydra:property rdf:predicate ], [ hydra:variable "object"; hydra:property rdf:object ] ]. The RDF representation explains:
 “you can query by triple pattern”.
  • 14. The RDF representation explains:
 “this is the number of matches”. metadata Triple pattern fragment servers
 enable clients to be intelligent. <#fragment> void:triples 8141.
  • 15. Give them a SPARQL query.
 Give them a URL of any dataset fragment. How can intelligent clients
 solve SPARQL queries over fragments? They look inside the fragment
 to see how to access the dataset and use the metadata
 to decide how to plan the query.
  • 16. The client splits the query
 into the available fragments. SELECT ?artist ?name WHERE { ?artist a dbpedia-owl:Artist; rdfs:label ?name; dbpedia-owl:birthPlace dbpedia:Padua. FILTER LANGMATCHES(LANG(?name), "EN") }
  • 17. The client gets the fragments
 and inspects their metadata. ?artist a dbpedia-owl:Artist. first 100 triples 96,000 ?artist rdfs:label ?name. first 100 triples 12,000,000 ?artist dbont:birthPlace dbpedia:Padua. first 100 triples 135
  • 18. ?artist a dbpedia-owl:Artist. 96.000 ?artist rdfs:label ?name. 12.000.000 ?artist dbont:birthPlace dbpedia:Padua. dbpedia:Alberto_Benettin dbont:birthPlace dbpedia:Padua. 135 dbpedia:Alberto_Bigon dbont:birthPlace dbpedia:Padua. The metadata enables the client
 to choose the right starting point. dbp:Alberto_Benettin a dbont:Artist. dbp:Alberto_Benettin rdfs:label ?name.
  • 19. For some patterns, many requests are of type “is this triple in the dataset?” Fractionofmembershipqueries 0% 25% 50% 75% 100% L1 L2 L3 L4 L5 S1 S2 S3 S4 S5 S6 S7 F1 F2 F3 F4 F5 C1 C2 C3 20 WatDiv queries
 linear (L), star (S), snowflake-shaped (F) and complex (C)
  • 20. Advancing in selector and/or metadata dimensions. metadata selector Triple Pattern Fragments low server cost high availability live data high bandwidth Simple
 Questions Complex 
 Questions No information 
 for the client Extensive useful
 information for the client
  • 21. Advancing in selector and/or metadata dimensions. metadata selector Triple Pattern Fragments Substring search J Van Herwegen et. al.: Substring Filtering for Low-Cost Linked Data Interfaces
 Last talk of this session!
  • 22. Advancing in selector and/or metadata dimensions. metadata selector Triple Pattern Fragments Substring search Approximate Membership
 Function (AMF)
  • 23. Linked Data Fragments: a uniform view
 on publishing Linked Data Exploring the axis: selector and metadata Approximate Membership Metadata Querying through Approximate Membership Metadata Opportunistic Querying
  • 24. Append TPF response with a compact representation of all possible mappings. metadata Triple Pattern Fragments Approximate Membership Function (AMF) Approximate set membership assessment with a predefined false positive probability. Bloom filter / Golomb-coded set +
  • 25. “Can we reduce the number of HTTP requests?” “Can we reduce the total execution time?” “What is the overhead on server CPU load?”
  • 26. Bloom Filter Golomb-coded set (GCS) 0 0 0 0 1 0 0 0 0 0 0 1 0 0 0 0 … 0 1 0 ! ! n0 dbpedia:Alberto_Benettin n1 dbpedia:Alberto_Bigon nx … 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 … 0 0 0 m 0 1 0 0 1 0 0 0 1 0 0 1 0 0 1 0 … 0 1 0 k0 k1 kx k0 k1 kx 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 … 0 0 0 ! n0 dbpedia:Alberto_Benettin n1 dbpedia:Alberto_Bigon k 0 1 0 0 1 0 0 0 0 0 0 0 0 0 0 0 … 0 0 0 k 0 1 0 1 1 0 1Golomb coded Geometric distribution
  • 27. “this BloomFilter with false positive probability X and hash function Y represents the presence of all bindings for ?s”. metadata Server enables clients to avoid 
 membership requests. <#fragment> void:triples 96300. # existing count metadata _:membershipFunction a ms:BloomFilter; # AMF metadata ms:hashSize 524288; ms:hashFunction <MyMurmur1>, <MyMurmur2>; ms:memberCollection [ ms:sourceCollection <#fragment>; ms:projectedProperty rdf:subject ]; ms:falsePositiveRate 0.05; ms:falseNegativeRate 0.0; ms:binaryRepresentation "QmF...ZTY"^^xsd:base64Binary.
  • 28. GET ?artist dbont:birthPlace dbpedia:Padua. dbpedia:Alberto_Benettin dbont:birthPlace dbpedia:Padua. 135 … Client filters non-members locally 
 with one extra (cached) request GET dbpedia:Alberto_Benettin a dbont:Artist. 0 GET dbpedia:Alberto_Bigon a dbont:Artist. 1 GET dbpedia:Alberto_Da_Zara a dbont:Artist. 1 GET dbpedia:Alberto_Gallo a dbont:Artist. 0 GET dbpedia:Alberto_Bigon a dbont:Artist. 1 GET ?artist a dbont:Artist. Approx.MembershipFilt. GET …
  • 29. We evaluated for request count, server cost and speedup in a Web setting. BloomFilter: MurMurHash3, GCS: FNV-1 1 HTTP Cache with 1 Mbps p = 1/1024 (0.1%) , 1/128 (1%), 1/64 (1.6%) 250 queries from 125 diverse WatDiv templates on Amazon EC2 machine WatDiv 100M triples dataset Timeout: 3min
  • 30. We evaluated for request count, server cost and speedup in a Web setting. vs. vanilla TPF server & client Original “greedy” algorithm
 Optimized join-tree algorithm* 250 queries from 125 diverse WatDiv templates on Amazon EC2 machine * Van Herwegen, et. al.: Query Execution Optimization for Clients of Triple Patterns Fragments. ESWC2015 2 client algorithms:
  • 31. > 50% of the queries has fewer requests,
 < 20% has more requests. Greedy Bloom Greedy GCS Optimized Bloom Optimized GCS Percentage of queries (p = 1/1024) 0% 25% 50% 75% 100% 6% 5% 18% 17% 59% 62% 49% 50% 35% 33% 33% 32% Equal Fewer Requests More Requests
  • 32. Queries with relatively many HTTP req. (45,000+ / query) benefit greatly Differencein#Requests 0 4,000 8,000 12,000 16,000 Fewer Requests More Requests Greedy Bloom Greedy GCS Optimized Bloom Optimized GCS < 35
  • 33. No queries have reduction in execution time, a third even has increase. Greedy Bloom Greedy GCS Optimized Bloom Optimized GCS Percentage of queries (p = 1/1024) 0% 25% 50% 75% 100% 16% 31% 33% 38%0% 84% 69% 67% 62% Equal Lower Execution time Higher Execution time
  • 34. Server remains low-cost, as impact is 
 very acceptable (< 6%). CPU(%) 0 7.5 15 22.5 30 O riginal Bloom (1/1024) Bloom (1/128) Bloom (1/64) G CS (1/1024) G CS (1/128) G CS (1/64) 11.110.810.2 14.9 11.210.8 9.2
  • 35. Linked Data Fragments: a uniform view
 on publishing Linked Data Exploring the axis: selector and metadata Approximate Membership Metadata Querying through Approximate Membership Metadata Opportunistic Querying
  • 36. During execution, a result candidate could already be correct (1 - p). Can we be opportunistic here, and temporarily allow imprecise results?
  • 37. “Can we reduce the time to 100% recall?” Opportunistic Linked Data Querying 13 only allow certain results temporarily allow uncertain results start execution start execution 1st result computed 1st result computed n < r results computed n < r results computed r results computed r results computed r + f results computed 0% recall 100% recall 100% recall 100% precision Fig. 2. This SPARQL query execution timeline compares regular and opportunistic query execution, assuming r total query results and f false positives. Note how both approaches achieve 100% recall and precision at a shared point in the end, but there exists a period during which only opportunistic execution reaches 100% recall (shaded). need to be discarded. The user thus sees the photos faster than if they had only been retrieved after full precision was achieved. This example
  • 38. Temporarily allowing <100% precision 
 can reduce 100% recall time with 1/3. Executiontime(s) 0 35 70 105 140 Greedy + Bloom (p = 1/1024) 100% Recall 100% Precision Number of revoked results was 0 or 1.
  • 39. Linked Data Fragments: a uniform view
 on publishing Linked Data Exploring the axis: selector and metadata Approximate Membership Metadata Querying through Approximate Membership Metadata Opportunistic Querying
  • 40. For some queries types, bandwidth highly decreases for TPF query execution. Approximate Membership Metadata 
 is a nuanced debate For larger fragments, realtime computation hurts execution time. We expect gain with 
 pre-caching and out-of-band delivery. Opportunistic querying is a promising direction for further exploration.
  • 41. TRIPLE PATTERN fragments data APPR. MEM. FILT. No one size fits all, explore the axis.
 Find metrics that fit your use-case. Client & Server load
 Request & Response size
 Protocol (HTTP) impact
 … Try you own trade-off server at our demo (and get a nice cup of coffee). Start serving Linked Data like a barista
  • 42. Opportunistic Linked Data Querying through Approximate Membership Metadata Miel Vander Sande