SlideShare a Scribd company logo
1 of 15
ACQUA:
APPROXIMATE CONTINUOUS
QUERY ANSWERING
OVER STREAMS AND
DYNAMIC LINKED DATA SETS
Emanuele Della Valle
DEIB - Politecnico of Milano
http://emanueledellavalle.org
@manudellavalle
Schloss Dagstuhl, Germany - 26 June 2017
Stream Processing in Nutshell
Stream Processing Engine
ResultsWindows
Stream data Register query
once and execute it
continuously
2Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
Web Stream Processing
Web
Results
Join
WindowsWeb Streams Linked Data
 High Latency
 Rate Limits
 Loosing Reactiveness
3
Stream Processing Engine
Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
RDF Stream Processing (RSP) EngineRSPengine
Web
Results
Join
WindowsRDF Streams SPARQL endpoint
4Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
An example
The cloth brand ACME wants to persuade influential Social
Networks users to post commercial endorsements.
Every minute give me the ID of the users that are mentioned on
Social Network in the last 10 minutes whose number of followers is
greater than 100,000.
5
REGISTER STREAM <:InfluencersToContact> AS
CONSTRUCT {?user a :influentialUser}
FROM NAMED WINDOW W ON S [RANGE 10m STEP 1m]
WHERE {
WINDOW W {?user :hasMentions ?mentionsNumber}
SERVICE BKG {?user :hasFollowers ?followerCount }
FILTER (?followerCount > 100,000)
}
Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
Problem DefinitionRSPengine
Web
Results
Join
WindowsRDF Streams
Define Refresh
Budget to
control
reactiveness
6
Data become stale
if not refreshed
Correct vs
approximate
answer
SPARQL endpoint
Local
Replica
Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
Problem DefinitionRSPengine
Web
Results
Join
WindowsRDF Streams SPARQL endpoint
7
Local
Replica
Maintenance
Policy
Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
ACQUA approach
8
WINDOW clause
Stream data
JOIN Proposer Ranker
Maintainer
2
3
1
SERVICE clause
AQCUA: without FILTER
AQCUA.F: with FILTER Clause
E
C
ACQUA [2]
RND
LRU
WBM
ACQUA.F [3]
Filter Update Policy
RND.F
LRU.F
WBM.F
ACQUA.F+/* [5]
LRU.F+
WBM.F+
WBM.F*
Candidate set
Elected set: top γ mappings
of Candidate set
Local Replica
WSJ: Filter out mappings
that are not involved in
current evaluation
Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
Where to read about ACQUA
1. Soheila Dehghanzadeh, Alessandra Mileo, Daniele Dell'Aglio,
Emanuele Della Valle, Shen Gao, Abraham Bernstein: Online View
Maintenance for Continuous Query Evaluation. WWW (Companion
Volume) 2015: 25-26
2. Soheila Dehghanzadeh, Daniele Dell'Aglio, Shen Gao, Emanuele Della
Valle, Alessandra Mileo, Abraham Bernstein: Approximate
Continuous Query Answering over Streams and Dynamic Linked
Data Sets. ICWE 2015: 307-325
3. Shima Zahmatkesh, Emanuele Della Valle, Daniele Dell'Aglio:
When a FILTER Makes the Difference in Continuously Answering
SPARQL Queries on Streaming and Quasi-Static Linked Data.
ICWE 2016: 299-316
4. Shen Gao, Daniele Dell'Aglio, Soheila Dehghanzadeh, Abraham
Bernstein, Emanuele Della Valle, Alessandra Mileo: Planning Ahead:
Stream-Driven Linked-Data Access Under Update-Budget
Constraints. International Semantic Web Conference (1) 2016: 252-
270
5. Shima Zahmatkesh, Emanuele Della Valle, Daniele Dell'Aglio: Using
Rank Aggregation in Continuously Answering SPARQL Queries on
Streaming and Quasi-static Linked Data. DEBS 2017: 170-179
Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle 9
Experimental Results
10
WorstBest
Performance
Experiment Dimension
For high selectivity
Filter Update Policy
is better than WBM
For low selectivity
WBM is better than
Filter Update Policy
Comparable to
Best Result
Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
Future works
• Broaden the class of queries
• Multiple filtering
• Filtering condition formulated as a ranking clause
• Pushing the FILTER clause into the SERVICE clause and
considering caching instead of local replica
• Study the effect of different trends in the data
11Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
ACQUA IN THE
STREAM REASONING
CONTEXT
Annex
Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle 12
Stream Reasoning in a nutshell
Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle 13
Tame data Variety and Velocity simultaneously
Traditional StreamReasoning
Tame data Variety and Velocity simultaneously
Traditional StreamReasoning
Stream Reasoning in a nutshell
Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle 14
Tame data Variety and Velocity simultaneously without forgetting volume
Traditional StreamReasoning
What if the analysis
includes also
data "at rest"?
What if the data "at rest"
are massive and
slowly evolving?
Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle 15
Stream Reasoning in a nutshell
ACQUA

More Related Content

Similar to ACQUA: Approximate Continuous Query Answering over Streams and Dynamic Linked Data Sets

Using Rank Aggregation in Continuously Answering SPARQL Queries on Streaming ...
Using Rank Aggregation in Continuously Answering SPARQL Queries on Streaming ...Using Rank Aggregation in Continuously Answering SPARQL Queries on Streaming ...
Using Rank Aggregation in Continuously Answering SPARQL Queries on Streaming ...Shima Zahmatkesh
 
It's a Streaming World! Reasoning upon Rapidly Changing Information (Milano, ...
It's a Streaming World! Reasoning upon Rapidly Changing Information (Milano, ...It's a Streaming World! Reasoning upon Rapidly Changing Information (Milano, ...
It's a Streaming World! Reasoning upon Rapidly Changing Information (Milano, ...Emanuele Della Valle
 
Data-Driven Transformation: Leveraging Big Data at Showtime with Apache Spark
Data-Driven Transformation: Leveraging Big Data at Showtime with Apache SparkData-Driven Transformation: Leveraging Big Data at Showtime with Apache Spark
Data-Driven Transformation: Leveraging Big Data at Showtime with Apache SparkDatabricks
 
TUW-ASE Summer 2015: Advanced service-based data analytics: Models, Elasticit...
TUW-ASE Summer 2015: Advanced service-based data analytics: Models, Elasticit...TUW-ASE Summer 2015: Advanced service-based data analytics: Models, Elasticit...
TUW-ASE Summer 2015: Advanced service-based data analytics: Models, Elasticit...Hong-Linh Truong
 
Optimized Couchbase Data Management
Optimized Couchbase Data ManagementOptimized Couchbase Data Management
Optimized Couchbase Data ManagementImanis Data
 
Oracle Coherence: in-memory datagrid
Oracle Coherence: in-memory datagridOracle Coherence: in-memory datagrid
Oracle Coherence: in-memory datagridEmiliano Pecis
 
Big Data Serving with Vespa - Jon Bratseth, Distinguished Architect, Oath
Big Data Serving with Vespa - Jon Bratseth, Distinguished Architect, OathBig Data Serving with Vespa - Jon Bratseth, Distinguished Architect, Oath
Big Data Serving with Vespa - Jon Bratseth, Distinguished Architect, OathYahoo Developer Network
 
AIM NIAC PNNL-SA-116502
AIM NIAC PNNL-SA-116502AIM NIAC PNNL-SA-116502
AIM NIAC PNNL-SA-116502Mark Greaves
 
070416 Egu Vienna Husar
070416 Egu Vienna Husar070416 Egu Vienna Husar
070416 Egu Vienna HusarRudolf Husar
 
Issues in AI product development and practices in audio applications
Issues in AI product development and practices in audio applicationsIssues in AI product development and practices in audio applications
Issues in AI product development and practices in audio applicationsTaesu Kim
 
A Recommendation Engine For Predicting Movie Ratings Using A Big Data Approach
A Recommendation Engine For Predicting Movie Ratings Using A Big Data ApproachA Recommendation Engine For Predicting Movie Ratings Using A Big Data Approach
A Recommendation Engine For Predicting Movie Ratings Using A Big Data ApproachFelicia Clark
 
SAP Data Hub e SUSE Container as a Service Platform
SAP Data Hub e SUSE Container as a Service PlatformSAP Data Hub e SUSE Container as a Service Platform
SAP Data Hub e SUSE Container as a Service PlatformSUSE Italy
 
Digital Transformation Journey
Digital Transformation JourneyDigital Transformation Journey
Digital Transformation JourneyClayton Pyne
 
SAP on AWS: Big Businesses, Big Workloads, Big Time - ENT202 - Chicago AWS Su...
SAP on AWS: Big Businesses, Big Workloads, Big Time - ENT202 - Chicago AWS Su...SAP on AWS: Big Businesses, Big Workloads, Big Time - ENT202 - Chicago AWS Su...
SAP on AWS: Big Businesses, Big Workloads, Big Time - ENT202 - Chicago AWS Su...Amazon Web Services
 
Relevant Query Answering on Dynamic and Distributed Datasets
Relevant Query Answering on Dynamic and Distributed DatasetsRelevant Query Answering on Dynamic and Distributed Datasets
Relevant Query Answering on Dynamic and Distributed DatasetsShima Zahmatkesh
 
ESWC2015 - Query Optimization for Clients of Linked Data Fragments
ESWC2015 - Query Optimization for Clients of Linked Data FragmentsESWC2015 - Query Optimization for Clients of Linked Data Fragments
ESWC2015 - Query Optimization for Clients of Linked Data FragmentsJoachim Van Herwegen
 
Linked Data: Opportunities for Entrepreneurs
Linked Data: Opportunities for EntrepreneursLinked Data: Opportunities for Entrepreneurs
Linked Data: Opportunities for Entrepreneurs3 Round Stones
 
Stream Reasoning: a summary of ten years of research and a vision for the nex...
Stream Reasoning: a summary of ten years of research and a vision for the nex...Stream Reasoning: a summary of ten years of research and a vision for the nex...
Stream Reasoning: a summary of ten years of research and a vision for the nex...Emanuele Della Valle
 
SAP on AWS: Big Businesses, Big Workloads, Big Time - ENT202 - Chicago AWS Su...
SAP on AWS: Big Businesses, Big Workloads, Big Time - ENT202 - Chicago AWS Su...SAP on AWS: Big Businesses, Big Workloads, Big Time - ENT202 - Chicago AWS Su...
SAP on AWS: Big Businesses, Big Workloads, Big Time - ENT202 - Chicago AWS Su...Amazon Web Services
 

Similar to ACQUA: Approximate Continuous Query Answering over Streams and Dynamic Linked Data Sets (20)

Using Rank Aggregation in Continuously Answering SPARQL Queries on Streaming ...
Using Rank Aggregation in Continuously Answering SPARQL Queries on Streaming ...Using Rank Aggregation in Continuously Answering SPARQL Queries on Streaming ...
Using Rank Aggregation in Continuously Answering SPARQL Queries on Streaming ...
 
It's a Streaming World! Reasoning upon Rapidly Changing Information (Milano, ...
It's a Streaming World! Reasoning upon Rapidly Changing Information (Milano, ...It's a Streaming World! Reasoning upon Rapidly Changing Information (Milano, ...
It's a Streaming World! Reasoning upon Rapidly Changing Information (Milano, ...
 
Data-Driven Transformation: Leveraging Big Data at Showtime with Apache Spark
Data-Driven Transformation: Leveraging Big Data at Showtime with Apache SparkData-Driven Transformation: Leveraging Big Data at Showtime with Apache Spark
Data-Driven Transformation: Leveraging Big Data at Showtime with Apache Spark
 
TUW-ASE Summer 2015: Advanced service-based data analytics: Models, Elasticit...
TUW-ASE Summer 2015: Advanced service-based data analytics: Models, Elasticit...TUW-ASE Summer 2015: Advanced service-based data analytics: Models, Elasticit...
TUW-ASE Summer 2015: Advanced service-based data analytics: Models, Elasticit...
 
Optimized Couchbase Data Management
Optimized Couchbase Data ManagementOptimized Couchbase Data Management
Optimized Couchbase Data Management
 
Oracle Coherence: in-memory datagrid
Oracle Coherence: in-memory datagridOracle Coherence: in-memory datagrid
Oracle Coherence: in-memory datagrid
 
Big Data Serving with Vespa - Jon Bratseth, Distinguished Architect, Oath
Big Data Serving with Vespa - Jon Bratseth, Distinguished Architect, OathBig Data Serving with Vespa - Jon Bratseth, Distinguished Architect, Oath
Big Data Serving with Vespa - Jon Bratseth, Distinguished Architect, Oath
 
AIM NIAC PNNL-SA-116502
AIM NIAC PNNL-SA-116502AIM NIAC PNNL-SA-116502
AIM NIAC PNNL-SA-116502
 
070416 Egu Vienna Husar
070416 Egu Vienna Husar070416 Egu Vienna Husar
070416 Egu Vienna Husar
 
Issues in AI product development and practices in audio applications
Issues in AI product development and practices in audio applicationsIssues in AI product development and practices in audio applications
Issues in AI product development and practices in audio applications
 
A Recommendation Engine For Predicting Movie Ratings Using A Big Data Approach
A Recommendation Engine For Predicting Movie Ratings Using A Big Data ApproachA Recommendation Engine For Predicting Movie Ratings Using A Big Data Approach
A Recommendation Engine For Predicting Movie Ratings Using A Big Data Approach
 
SAP Data Hub e SUSE Container as a Service Platform
SAP Data Hub e SUSE Container as a Service PlatformSAP Data Hub e SUSE Container as a Service Platform
SAP Data Hub e SUSE Container as a Service Platform
 
Digital Transformation Journey
Digital Transformation JourneyDigital Transformation Journey
Digital Transformation Journey
 
SAP on AWS: Big Businesses, Big Workloads, Big Time - ENT202 - Chicago AWS Su...
SAP on AWS: Big Businesses, Big Workloads, Big Time - ENT202 - Chicago AWS Su...SAP on AWS: Big Businesses, Big Workloads, Big Time - ENT202 - Chicago AWS Su...
SAP on AWS: Big Businesses, Big Workloads, Big Time - ENT202 - Chicago AWS Su...
 
Relevant Query Answering on Dynamic and Distributed Datasets
Relevant Query Answering on Dynamic and Distributed DatasetsRelevant Query Answering on Dynamic and Distributed Datasets
Relevant Query Answering on Dynamic and Distributed Datasets
 
ESWC2015 - Query Optimization for Clients of Linked Data Fragments
ESWC2015 - Query Optimization for Clients of Linked Data FragmentsESWC2015 - Query Optimization for Clients of Linked Data Fragments
ESWC2015 - Query Optimization for Clients of Linked Data Fragments
 
Smac
SmacSmac
Smac
 
Linked Data: Opportunities for Entrepreneurs
Linked Data: Opportunities for EntrepreneursLinked Data: Opportunities for Entrepreneurs
Linked Data: Opportunities for Entrepreneurs
 
Stream Reasoning: a summary of ten years of research and a vision for the nex...
Stream Reasoning: a summary of ten years of research and a vision for the nex...Stream Reasoning: a summary of ten years of research and a vision for the nex...
Stream Reasoning: a summary of ten years of research and a vision for the nex...
 
SAP on AWS: Big Businesses, Big Workloads, Big Time - ENT202 - Chicago AWS Su...
SAP on AWS: Big Businesses, Big Workloads, Big Time - ENT202 - Chicago AWS Su...SAP on AWS: Big Businesses, Big Workloads, Big Time - ENT202 - Chicago AWS Su...
SAP on AWS: Big Businesses, Big Workloads, Big Time - ENT202 - Chicago AWS Su...
 

More from Emanuele Della Valle

Taming velocity - a tale of four streams
Taming velocity - a tale of four streamsTaming velocity - a tale of four streams
Taming velocity - a tale of four streamsEmanuele Della Valle
 
Work in progress on Inductive Stream Reasoning
Work in progress on Inductive Stream ReasoningWork in progress on Inductive Stream Reasoning
Work in progress on Inductive Stream ReasoningEmanuele Della Valle
 
Knowledge graphs in search engines
Knowledge graphs in search enginesKnowledge graphs in search engines
Knowledge graphs in search enginesEmanuele Della Valle
 
La città dei balocchi 2017 in numeri - Fluxedo
La città dei balocchi 2017 in numeri - FluxedoLa città dei balocchi 2017 in numeri - Fluxedo
La città dei balocchi 2017 in numeri - FluxedoEmanuele Della Valle
 
Big Data: how to use it to create value
Big Data: how to use it to create valueBig Data: how to use it to create value
Big Data: how to use it to create valueEmanuele Della Valle
 
IST16-01 - Introduction to Interoperability and Semantic Technologies
IST16-01 - Introduction to Interoperability and Semantic TechnologiesIST16-01 - Introduction to Interoperability and Semantic Technologies
IST16-01 - Introduction to Interoperability and Semantic TechnologiesEmanuele Della Valle
 
Social listener-brera-design-district-2015-03
Social listener-brera-design-district-2015-03Social listener-brera-design-district-2015-03
Social listener-brera-design-district-2015-03Emanuele Della Valle
 
City Data Fusion for Event Management (in Italiano)
City Data Fusion for Event Management (in Italiano)City Data Fusion for Event Management (in Italiano)
City Data Fusion for Event Management (in Italiano)Emanuele Della Valle
 
Semantic technologies and Interoperability
Semantic technologies and InteroperabilitySemantic technologies and Interoperability
Semantic technologies and InteroperabilityEmanuele Della Valle
 
Big data: why, what, paradigm shifts enabled , tools and market landscape
Big data: why, what, paradigm shifts enabled , tools and market landscapeBig data: why, what, paradigm shifts enabled , tools and market landscape
Big data: why, what, paradigm shifts enabled , tools and market landscapeEmanuele Della Valle
 
City Data Fusion and City Sensing presented at EIT ICT Labs for EXPO 2015
City Data Fusion and City Sensing presented at EIT ICT Labs for EXPO 2015City Data Fusion and City Sensing presented at EIT ICT Labs for EXPO 2015
City Data Fusion and City Sensing presented at EIT ICT Labs for EXPO 2015Emanuele Della Valle
 
On the effectiveness of a Mobile Puzzle Game UI to Crowdsource Linked Data Ma...
On the effectiveness of a Mobile Puzzle Game UI to Crowdsource Linked Data Ma...On the effectiveness of a Mobile Puzzle Game UI to Crowdsource Linked Data Ma...
On the effectiveness of a Mobile Puzzle Game UI to Crowdsource Linked Data Ma...Emanuele Della Valle
 
City Data Fusion: A Big Data Infrastructure to sense the pulse of the city in...
City Data Fusion: A Big Data Infrastructure to sense the pulse of the city in...City Data Fusion: A Big Data Infrastructure to sense the pulse of the city in...
City Data Fusion: A Big Data Infrastructure to sense the pulse of the city in...Emanuele Della Valle
 
On the need to include functional testing in RDF stream engine benchmarks
On the need to include functional testing in RDF stream engine benchmarks On the need to include functional testing in RDF stream engine benchmarks
On the need to include functional testing in RDF stream engine benchmarks Emanuele Della Valle
 
twindex.fuorisalone.it - Social Listening of FUORISALONE 2013
twindex.fuorisalone.it  - Social Listening of FUORISALONE 2013twindex.fuorisalone.it  - Social Listening of FUORISALONE 2013
twindex.fuorisalone.it - Social Listening of FUORISALONE 2013Emanuele Della Valle
 
Order Matters! Harnessing a World of Orderings for Reasoning over Massive Data
Order Matters! Harnessing a World of Orderings for Reasoning over Massive DataOrder Matters! Harnessing a World of Orderings for Reasoning over Massive Data
Order Matters! Harnessing a World of Orderings for Reasoning over Massive DataEmanuele Della Valle
 
Stream Reasoning: State of the Art and Beyond
Stream Reasoning: State of the Art and BeyondStream Reasoning: State of the Art and Beyond
Stream Reasoning: State of the Art and BeyondEmanuele Della Valle
 
People Dimension in Software Projects
People Dimension in Software ProjectsPeople Dimension in Software Projects
People Dimension in Software ProjectsEmanuele Della Valle
 

More from Emanuele Della Valle (20)

Taming velocity - a tale of four streams
Taming velocity - a tale of four streamsTaming velocity - a tale of four streams
Taming velocity - a tale of four streams
 
Work in progress on Inductive Stream Reasoning
Work in progress on Inductive Stream ReasoningWork in progress on Inductive Stream Reasoning
Work in progress on Inductive Stream Reasoning
 
Big Data and Data Science W's
Big Data and Data Science W'sBig Data and Data Science W's
Big Data and Data Science W's
 
Knowledge graphs in search engines
Knowledge graphs in search enginesKnowledge graphs in search engines
Knowledge graphs in search engines
 
La città dei balocchi 2017 in numeri - Fluxedo
La città dei balocchi 2017 in numeri - FluxedoLa città dei balocchi 2017 in numeri - Fluxedo
La città dei balocchi 2017 in numeri - Fluxedo
 
Big Data: how to use it to create value
Big Data: how to use it to create valueBig Data: how to use it to create value
Big Data: how to use it to create value
 
Ist16-04 An introduction to RDF
Ist16-04 An introduction to RDF Ist16-04 An introduction to RDF
Ist16-04 An introduction to RDF
 
IST16-01 - Introduction to Interoperability and Semantic Technologies
IST16-01 - Introduction to Interoperability and Semantic TechnologiesIST16-01 - Introduction to Interoperability and Semantic Technologies
IST16-01 - Introduction to Interoperability and Semantic Technologies
 
Social listener-brera-design-district-2015-03
Social listener-brera-design-district-2015-03Social listener-brera-design-district-2015-03
Social listener-brera-design-district-2015-03
 
City Data Fusion for Event Management (in Italiano)
City Data Fusion for Event Management (in Italiano)City Data Fusion for Event Management (in Italiano)
City Data Fusion for Event Management (in Italiano)
 
Semantic technologies and Interoperability
Semantic technologies and InteroperabilitySemantic technologies and Interoperability
Semantic technologies and Interoperability
 
Big data: why, what, paradigm shifts enabled , tools and market landscape
Big data: why, what, paradigm shifts enabled , tools and market landscapeBig data: why, what, paradigm shifts enabled , tools and market landscape
Big data: why, what, paradigm shifts enabled , tools and market landscape
 
City Data Fusion and City Sensing presented at EIT ICT Labs for EXPO 2015
City Data Fusion and City Sensing presented at EIT ICT Labs for EXPO 2015City Data Fusion and City Sensing presented at EIT ICT Labs for EXPO 2015
City Data Fusion and City Sensing presented at EIT ICT Labs for EXPO 2015
 
On the effectiveness of a Mobile Puzzle Game UI to Crowdsource Linked Data Ma...
On the effectiveness of a Mobile Puzzle Game UI to Crowdsource Linked Data Ma...On the effectiveness of a Mobile Puzzle Game UI to Crowdsource Linked Data Ma...
On the effectiveness of a Mobile Puzzle Game UI to Crowdsource Linked Data Ma...
 
City Data Fusion: A Big Data Infrastructure to sense the pulse of the city in...
City Data Fusion: A Big Data Infrastructure to sense the pulse of the city in...City Data Fusion: A Big Data Infrastructure to sense the pulse of the city in...
City Data Fusion: A Big Data Infrastructure to sense the pulse of the city in...
 
On the need to include functional testing in RDF stream engine benchmarks
On the need to include functional testing in RDF stream engine benchmarks On the need to include functional testing in RDF stream engine benchmarks
On the need to include functional testing in RDF stream engine benchmarks
 
twindex.fuorisalone.it - Social Listening of FUORISALONE 2013
twindex.fuorisalone.it  - Social Listening of FUORISALONE 2013twindex.fuorisalone.it  - Social Listening of FUORISALONE 2013
twindex.fuorisalone.it - Social Listening of FUORISALONE 2013
 
Order Matters! Harnessing a World of Orderings for Reasoning over Massive Data
Order Matters! Harnessing a World of Orderings for Reasoning over Massive DataOrder Matters! Harnessing a World of Orderings for Reasoning over Massive Data
Order Matters! Harnessing a World of Orderings for Reasoning over Massive Data
 
Stream Reasoning: State of the Art and Beyond
Stream Reasoning: State of the Art and BeyondStream Reasoning: State of the Art and Beyond
Stream Reasoning: State of the Art and Beyond
 
People Dimension in Software Projects
People Dimension in Software ProjectsPeople Dimension in Software Projects
People Dimension in Software Projects
 

Recently uploaded

20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxBoston Institute of Analytics
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...limedy534
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一F La
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAbdelrhman abooda
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 

Recently uploaded (20)

20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 

ACQUA: Approximate Continuous Query Answering over Streams and Dynamic Linked Data Sets

  • 1. ACQUA: APPROXIMATE CONTINUOUS QUERY ANSWERING OVER STREAMS AND DYNAMIC LINKED DATA SETS Emanuele Della Valle DEIB - Politecnico of Milano http://emanueledellavalle.org @manudellavalle Schloss Dagstuhl, Germany - 26 June 2017
  • 2. Stream Processing in Nutshell Stream Processing Engine ResultsWindows Stream data Register query once and execute it continuously 2Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
  • 3. Web Stream Processing Web Results Join WindowsWeb Streams Linked Data  High Latency  Rate Limits  Loosing Reactiveness 3 Stream Processing Engine Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
  • 4. RDF Stream Processing (RSP) EngineRSPengine Web Results Join WindowsRDF Streams SPARQL endpoint 4Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
  • 5. An example The cloth brand ACME wants to persuade influential Social Networks users to post commercial endorsements. Every minute give me the ID of the users that are mentioned on Social Network in the last 10 minutes whose number of followers is greater than 100,000. 5 REGISTER STREAM <:InfluencersToContact> AS CONSTRUCT {?user a :influentialUser} FROM NAMED WINDOW W ON S [RANGE 10m STEP 1m] WHERE { WINDOW W {?user :hasMentions ?mentionsNumber} SERVICE BKG {?user :hasFollowers ?followerCount } FILTER (?followerCount > 100,000) } Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
  • 6. Problem DefinitionRSPengine Web Results Join WindowsRDF Streams Define Refresh Budget to control reactiveness 6 Data become stale if not refreshed Correct vs approximate answer SPARQL endpoint Local Replica Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
  • 7. Problem DefinitionRSPengine Web Results Join WindowsRDF Streams SPARQL endpoint 7 Local Replica Maintenance Policy Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
  • 8. ACQUA approach 8 WINDOW clause Stream data JOIN Proposer Ranker Maintainer 2 3 1 SERVICE clause AQCUA: without FILTER AQCUA.F: with FILTER Clause E C ACQUA [2] RND LRU WBM ACQUA.F [3] Filter Update Policy RND.F LRU.F WBM.F ACQUA.F+/* [5] LRU.F+ WBM.F+ WBM.F* Candidate set Elected set: top γ mappings of Candidate set Local Replica WSJ: Filter out mappings that are not involved in current evaluation Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
  • 9. Where to read about ACQUA 1. Soheila Dehghanzadeh, Alessandra Mileo, Daniele Dell'Aglio, Emanuele Della Valle, Shen Gao, Abraham Bernstein: Online View Maintenance for Continuous Query Evaluation. WWW (Companion Volume) 2015: 25-26 2. Soheila Dehghanzadeh, Daniele Dell'Aglio, Shen Gao, Emanuele Della Valle, Alessandra Mileo, Abraham Bernstein: Approximate Continuous Query Answering over Streams and Dynamic Linked Data Sets. ICWE 2015: 307-325 3. Shima Zahmatkesh, Emanuele Della Valle, Daniele Dell'Aglio: When a FILTER Makes the Difference in Continuously Answering SPARQL Queries on Streaming and Quasi-Static Linked Data. ICWE 2016: 299-316 4. Shen Gao, Daniele Dell'Aglio, Soheila Dehghanzadeh, Abraham Bernstein, Emanuele Della Valle, Alessandra Mileo: Planning Ahead: Stream-Driven Linked-Data Access Under Update-Budget Constraints. International Semantic Web Conference (1) 2016: 252- 270 5. Shima Zahmatkesh, Emanuele Della Valle, Daniele Dell'Aglio: Using Rank Aggregation in Continuously Answering SPARQL Queries on Streaming and Quasi-static Linked Data. DEBS 2017: 170-179 Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle 9
  • 10. Experimental Results 10 WorstBest Performance Experiment Dimension For high selectivity Filter Update Policy is better than WBM For low selectivity WBM is better than Filter Update Policy Comparable to Best Result Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
  • 11. Future works • Broaden the class of queries • Multiple filtering • Filtering condition formulated as a ranking clause • Pushing the FILTER clause into the SERVICE clause and considering caching instead of local replica • Study the effect of different trends in the data 11Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
  • 12. ACQUA IN THE STREAM REASONING CONTEXT Annex Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle 12
  • 13. Stream Reasoning in a nutshell Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle 13 Tame data Variety and Velocity simultaneously Traditional StreamReasoning
  • 14. Tame data Variety and Velocity simultaneously Traditional StreamReasoning Stream Reasoning in a nutshell Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle 14
  • 15. Tame data Variety and Velocity simultaneously without forgetting volume Traditional StreamReasoning What if the analysis includes also data "at rest"? What if the data "at rest" are massive and slowly evolving? Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle 15 Stream Reasoning in a nutshell ACQUA