SlideShare a Scribd company logo
1 of 28
Download to read offline
YABench:
A Comprehensive Framework for RDF
Stream Processor Correctness and
Performance Assessment
Maxim Kolchin, Peter Wetz, Elmar Kiesling, A Min Tjoa
ITMO University, Russia | TU Wien, Austria
The 16th International Conference on Web Engineering 2016, Lugano, Switzerland
RDF Stream Processing (RSP)
RDF Stream - a potentially infinite sequence of time-varying
data elements encoded in RDF
Continuous query - a query registered over streams that in
most cases are observed through windows
Query results - similarly to SPARQL they can be tuples, RDF
dataset or a new RDF Stream
2
State of the art
■ LSBench (2012)
■ SRBench (2012)
■ CSRBench (2013)
■ CityBench (2015)
Details can be found at W3C RSP Community Group’s Wiki: https://www.w3.
org/community/rsp/wiki/RSP_Benchmarking
3
Our contribution
■ We propose a benchmarking framework for RDF Stream Processing
engines that focuses on correctness and performance
○ Stream generator (generates configurable RDF stream)
○ Oracle (validates correctness of the results)
○ Runner (measures performance of an RSP engine)
■ We run a benchmark with the window-based RDF stream processing
engines:
○ C-SPARQL
○ CQELS
4
Requirements
■ Scalable and configurable input
■ Comprehensive correctness checking
■ Flexible queries
■ Reproducibility
5
Architecture
1. Define tests,
2. Generate data streams,
3. Run the tests with a given
engine,
a. Performance metrics
are collected in a
separate process,
4. At the end validate the
results with the oracle.
6
Architecture: Reporting tool
7
Validation against CSRBench
We validated the correctness checking functionality of YABench by
reproducing the CSRBench* benchmark.
CSRBench defines 7 queries for C-SPARQL, CQELS and SPARQLstream
engines.
Datasets, test configurations and results are available online: github.
com/YABench/csrbench-validation
*Daniele Dell’Aglio, et al. “On Correctness in RDF Stream Processor Benchmarking”, 2013
8
Validation against CSRBench (C-SPARQL)
Query
C-SPARQL
CSRBench YABench
Q1 ✓ ✓
Q2 ✓ ✓
Q3 ✓ ✓*
Q4 ✓ ✓
Q5 ✗ ✗
Q6 ✓ ✓*
Q7 ✓ ✓*
* - the results are the same, but
because of timing discrepancies
some results sometimes present in
the subsequent window
9
Validation against CSRBench (CQELS)
Query
CQELS
CSRBench YABench
Q1 ✓ ✓
Q2 ✓ ✓
Q3 ✓ ✓
Q4 ✗ ✗
Q5 ✓ ✓
Q6 ✗ ✗
Q7 ✗ ✗**
** - indicates that the query did not
execute successfully on the CQELS
engine. The engine crashed before
returning the query results
10
Benchmark
We reuse queries introduced by CSRBench, but we’re able to parametrize them, e.
g. window size, window slide, filter values, etc.
Measure:
- Precision and recall,
- Window and result size, and delay,
- Memory and CPU usage, # of threads
We run each test 10 times, to compute the distribution of precision/recall.
Detailed results are available online: github.com/YABench/yabench-one
11
Benchmark: Data Stream Model
A data stream is generated
based on:
■ Number of weather
stations,
■ Time interval between
two observations of a
single station,
■ Duration of the stream,
■ A seed for the
randomize function
12
Benchmark: Queries
Experiment 1: SELECT + FILTER
Experiment 2: SELECT + AVG + FILTER
Experiment 3: joining of triples from different timestamps
Experiment 4: demonstrates the use of gracious mode which implemented by the
oracle to eliminate the timing discrepancy issues of the engines
13
Experiment 1 (precision/recall): 50 stations
14
Experiment 1 (precision/recall): 1000 stations
15
Experiment 1 (precision/recall): 10000 stations
16
Experiment 1 (memory usage): 50 stations
17
Experiment 1 (memory usage): 1000 stations
18
Experiment 1 (memory usage): 10000 stations
19
Experiment 1 (delay): 50 stations
20
Experiment 1 (delay): 1000 stations
21
Experiment 1 (delay): 10000 stations
22
Experiment 1 (C-SPARQL): delay vs result size
23
Architecture: Gracious mode
In this mode the oracle tries to adjusts its window scope to match the scope of an
actual window, by moving the left and right borders to back and/or forth while the
precision and recall grows.
It allows to:
(a) confirm our assumption on why precision and recall are low,
(b) reconstruct and visualize the actual window borders
24
Experiment 4: gracious vs non-gracious modes
(a) In non-gracious (default) mode (b) In gracious mode
C-SPARQL
25
Experiment 4: gracious vs non-gracious modes
(a) In non-gracious (default) mode (b) In gracious mode
CQELS
26
Conclusion
■ We build a framework for benchmarking RSP engines which allows to assess
their correctness and performance
■ We run a benchmark which revealed some insides:
○ CQELS shows better precision/recall for simple queries,
○ C-SPARQL is slightly more memory efficient than CQELS,
○ C-SPARQL outperformes CQELS in terms of delay for more complex queries, which is mainly
caused by a different reporting strategy
■ By introducing gracious mode we’re able to estimate the extent of the timing
discrepancy
27
Thank you!
github.com/YABench

More Related Content

What's hot

State Management in Apache Flink : Consistent Stateful Distributed Stream Pro...
State Management in Apache Flink : Consistent Stateful Distributed Stream Pro...State Management in Apache Flink : Consistent Stateful Distributed Stream Pro...
State Management in Apache Flink : Consistent Stateful Distributed Stream Pro...Paris Carbone
 
Large-Margin Multiple Kernel Learning for Discriminative Features Selection a...
Large-Margin Multiple Kernel Learning for Discriminative Features Selection a...Large-Margin Multiple Kernel Learning for Discriminative Features Selection a...
Large-Margin Multiple Kernel Learning for Discriminative Features Selection a...babak hosseini
 
Christian Moscardi Presentation
Christian Moscardi PresentationChristian Moscardi Presentation
Christian Moscardi PresentationJoseph Chow
 
Poster Version 1.7
Poster Version 1.7Poster Version 1.7
Poster Version 1.7Dewey Potts
 
Apache Flink Training Workshop @ HadoopCon2016 - #1 System Overview
Apache Flink Training Workshop @ HadoopCon2016 - #1 System OverviewApache Flink Training Workshop @ HadoopCon2016 - #1 System Overview
Apache Flink Training Workshop @ HadoopCon2016 - #1 System OverviewApache Flink Taiwan User Group
 

What's hot (6)

State Management in Apache Flink : Consistent Stateful Distributed Stream Pro...
State Management in Apache Flink : Consistent Stateful Distributed Stream Pro...State Management in Apache Flink : Consistent Stateful Distributed Stream Pro...
State Management in Apache Flink : Consistent Stateful Distributed Stream Pro...
 
Large-Margin Multiple Kernel Learning for Discriminative Features Selection a...
Large-Margin Multiple Kernel Learning for Discriminative Features Selection a...Large-Margin Multiple Kernel Learning for Discriminative Features Selection a...
Large-Margin Multiple Kernel Learning for Discriminative Features Selection a...
 
Christian Moscardi Presentation
Christian Moscardi PresentationChristian Moscardi Presentation
Christian Moscardi Presentation
 
0xdata H2O Podcast
0xdata H2O Podcast0xdata H2O Podcast
0xdata H2O Podcast
 
Poster Version 1.7
Poster Version 1.7Poster Version 1.7
Poster Version 1.7
 
Apache Flink Training Workshop @ HadoopCon2016 - #1 System Overview
Apache Flink Training Workshop @ HadoopCon2016 - #1 System OverviewApache Flink Training Workshop @ HadoopCon2016 - #1 System Overview
Apache Flink Training Workshop @ HadoopCon2016 - #1 System Overview
 

Viewers also liked

NICE Master Class final presentation 25 11 14 (including workshops)
NICE Master Class final presentation 25 11 14 (including workshops)NICE Master Class final presentation 25 11 14 (including workshops)
NICE Master Class final presentation 25 11 14 (including workshops)NEQOS
 
semantic and social (intra)webs
semantic and social (intra)webssemantic and social (intra)webs
semantic and social (intra)websFabien Gandon
 
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...Oscar Corcho
 
Nice Tag Ontology : modeling tags as RDF named graphs
Nice Tag Ontology : modeling tags as RDF named graphsNice Tag Ontology : modeling tags as RDF named graphs
Nice Tag Ontology : modeling tags as RDF named graphsFabien Gandon
 
Speech Technology and Big Data
Speech Technology and Big DataSpeech Technology and Big Data
Speech Technology and Big DataEUCLID project
 
L avenir du web au prisme de la resource
L avenir du web au prisme de la resourceL avenir du web au prisme de la resource
L avenir du web au prisme de la resourceFabien Gandon
 
Data Science Curriculum for Professionals
Data Science Curriculum for ProfessionalsData Science Curriculum for Professionals
Data Science Curriculum for ProfessionalsEUCLID project
 

Viewers also liked (8)

NICE Master Class final presentation 25 11 14 (including workshops)
NICE Master Class final presentation 25 11 14 (including workshops)NICE Master Class final presentation 25 11 14 (including workshops)
NICE Master Class final presentation 25 11 14 (including workshops)
 
semantic and social (intra)webs
semantic and social (intra)webssemantic and social (intra)webs
semantic and social (intra)webs
 
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
 
Nice Tag Ontology : modeling tags as RDF named graphs
Nice Tag Ontology : modeling tags as RDF named graphsNice Tag Ontology : modeling tags as RDF named graphs
Nice Tag Ontology : modeling tags as RDF named graphs
 
Speech Technology and Big Data
Speech Technology and Big DataSpeech Technology and Big Data
Speech Technology and Big Data
 
Essai Takahashi
Essai TakahashiEssai Takahashi
Essai Takahashi
 
L avenir du web au prisme de la resource
L avenir du web au prisme de la resourceL avenir du web au prisme de la resource
L avenir du web au prisme de la resource
 
Data Science Curriculum for Professionals
Data Science Curriculum for ProfessionalsData Science Curriculum for Professionals
Data Science Curriculum for Professionals
 

Similar to YABench: A Comprehensive Framework for RDF Stream Processor Correctness and Performance Assessment

On the need for applications aware adaptive middleware in real-time RDF data ...
On the need for applications aware adaptive middleware in real-time RDF data ...On the need for applications aware adaptive middleware in real-time RDF data ...
On the need for applications aware adaptive middleware in real-time RDF data ...Zia Ush Shamszaman
 
Clipper: A Low-Latency Online Prediction Serving System
Clipper: A Low-Latency Online Prediction Serving SystemClipper: A Low-Latency Online Prediction Serving System
Clipper: A Low-Latency Online Prediction Serving SystemDatabricks
 
On correctness in RDF stream processor benchmarking
On correctness in RDF stream processor benchmarkingOn correctness in RDF stream processor benchmarking
On correctness in RDF stream processor benchmarkingDaniele Dell'Aglio
 
Deploying and Monitoring Heterogeneous Machine Learning Applications with Cli...
Deploying and Monitoring Heterogeneous Machine Learning Applications with Cli...Deploying and Monitoring Heterogeneous Machine Learning Applications with Cli...
Deploying and Monitoring Heterogeneous Machine Learning Applications with Cli...Databricks
 
GTC16 - S6410 - Comparing OpenACC 2.5 and OpenMP 4.5
GTC16 - S6410 - Comparing OpenACC 2.5 and OpenMP 4.5GTC16 - S6410 - Comparing OpenACC 2.5 and OpenMP 4.5
GTC16 - S6410 - Comparing OpenACC 2.5 and OpenMP 4.5Jeff Larkin
 
FlinkML: Large Scale Machine Learning with Apache Flink
FlinkML: Large Scale Machine Learning with Apache FlinkFlinkML: Large Scale Machine Learning with Apache Flink
FlinkML: Large Scale Machine Learning with Apache FlinkTheodoros Vasiloudis
 
AHF_IDETC_2011_Jie
AHF_IDETC_2011_JieAHF_IDETC_2011_Jie
AHF_IDETC_2011_JieMDO_Lab
 
Clipper at UC Berkeley RISECamp 2017
Clipper at UC Berkeley RISECamp 2017Clipper at UC Berkeley RISECamp 2017
Clipper at UC Berkeley RISECamp 2017Dan Crankshaw
 
Interface for Performance Environment Autoconfiguration Framework
Interface for Performance Environment Autoconfiguration FrameworkInterface for Performance Environment Autoconfiguration Framework
Interface for Performance Environment Autoconfiguration FrameworkLiang Men
 
OpenACC Monthly Highlights: September 2021
OpenACC Monthly Highlights: September 2021OpenACC Monthly Highlights: September 2021
OpenACC Monthly Highlights: September 2021OpenACC
 
Big Data, Bigger Analytics
Big Data, Bigger AnalyticsBig Data, Bigger Analytics
Big Data, Bigger AnalyticsItzhak Kameli
 
AskTom: How to Make and Test Your Application "Oracle RAC Ready"?
AskTom: How to Make and Test Your Application "Oracle RAC Ready"?AskTom: How to Make and Test Your Application "Oracle RAC Ready"?
AskTom: How to Make and Test Your Application "Oracle RAC Ready"?Markus Michalewicz
 
Analytics DB Benchmark
Analytics DB BenchmarkAnalytics DB Benchmark
Analytics DB BenchmarkDenny Marton
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)IJERD Editor
 
Optimize Single Particle Orbital (SPO) Evaluations Based on B-splines
Optimize Single Particle Orbital (SPO) Evaluations Based on B-splinesOptimize Single Particle Orbital (SPO) Evaluations Based on B-splines
Optimize Single Particle Orbital (SPO) Evaluations Based on B-splinesIntel® Software
 
Technical_Report_on_ML_Library
Technical_Report_on_ML_LibraryTechnical_Report_on_ML_Library
Technical_Report_on_ML_LibrarySaurabh Chauhan
 

Similar to YABench: A Comprehensive Framework for RDF Stream Processor Correctness and Performance Assessment (20)

On the need for applications aware adaptive middleware in real-time RDF data ...
On the need for applications aware adaptive middleware in real-time RDF data ...On the need for applications aware adaptive middleware in real-time RDF data ...
On the need for applications aware adaptive middleware in real-time RDF data ...
 
Clipper: A Low-Latency Online Prediction Serving System
Clipper: A Low-Latency Online Prediction Serving SystemClipper: A Low-Latency Online Prediction Serving System
Clipper: A Low-Latency Online Prediction Serving System
 
On correctness in RDF stream processor benchmarking
On correctness in RDF stream processor benchmarkingOn correctness in RDF stream processor benchmarking
On correctness in RDF stream processor benchmarking
 
System mldl meetup
System mldl meetupSystem mldl meetup
System mldl meetup
 
Deploying and Monitoring Heterogeneous Machine Learning Applications with Cli...
Deploying and Monitoring Heterogeneous Machine Learning Applications with Cli...Deploying and Monitoring Heterogeneous Machine Learning Applications with Cli...
Deploying and Monitoring Heterogeneous Machine Learning Applications with Cli...
 
GTC16 - S6410 - Comparing OpenACC 2.5 and OpenMP 4.5
GTC16 - S6410 - Comparing OpenACC 2.5 and OpenMP 4.5GTC16 - S6410 - Comparing OpenACC 2.5 and OpenMP 4.5
GTC16 - S6410 - Comparing OpenACC 2.5 and OpenMP 4.5
 
FlinkML: Large Scale Machine Learning with Apache Flink
FlinkML: Large Scale Machine Learning with Apache FlinkFlinkML: Large Scale Machine Learning with Apache Flink
FlinkML: Large Scale Machine Learning with Apache Flink
 
AHF_IDETC_2011_Jie
AHF_IDETC_2011_JieAHF_IDETC_2011_Jie
AHF_IDETC_2011_Jie
 
Clipper at UC Berkeley RISECamp 2017
Clipper at UC Berkeley RISECamp 2017Clipper at UC Berkeley RISECamp 2017
Clipper at UC Berkeley RISECamp 2017
 
Redux vs GraphQL
Redux vs GraphQLRedux vs GraphQL
Redux vs GraphQL
 
Interface for Performance Environment Autoconfiguration Framework
Interface for Performance Environment Autoconfiguration FrameworkInterface for Performance Environment Autoconfiguration Framework
Interface for Performance Environment Autoconfiguration Framework
 
OpenACC Monthly Highlights: September 2021
OpenACC Monthly Highlights: September 2021OpenACC Monthly Highlights: September 2021
OpenACC Monthly Highlights: September 2021
 
Big Data, Bigger Analytics
Big Data, Bigger AnalyticsBig Data, Bigger Analytics
Big Data, Bigger Analytics
 
AskTom: How to Make and Test Your Application "Oracle RAC Ready"?
AskTom: How to Make and Test Your Application "Oracle RAC Ready"?AskTom: How to Make and Test Your Application "Oracle RAC Ready"?
AskTom: How to Make and Test Your Application "Oracle RAC Ready"?
 
Analytics DB Benchmark
Analytics DB BenchmarkAnalytics DB Benchmark
Analytics DB Benchmark
 
Ikc 2015
Ikc 2015Ikc 2015
Ikc 2015
 
Smallsat 2021
Smallsat 2021Smallsat 2021
Smallsat 2021
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
 
Optimize Single Particle Orbital (SPO) Evaluations Based on B-splines
Optimize Single Particle Orbital (SPO) Evaluations Based on B-splinesOptimize Single Particle Orbital (SPO) Evaluations Based on B-splines
Optimize Single Particle Orbital (SPO) Evaluations Based on B-splines
 
Technical_Report_on_ML_Library
Technical_Report_on_ML_LibraryTechnical_Report_on_ML_Library
Technical_Report_on_ML_Library
 

Recently uploaded

All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PPRINCE C P
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxpradhanghanshyam7136
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhousejana861314
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptMAESTRELLAMesa2
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxAArockiyaNisha
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real timeSatoshi NAKAHIRA
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...jana861314
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxyaramohamed343013
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfnehabiju2046
 
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Nistarini College, Purulia (W.B) India
 

Recently uploaded (20)

All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptx
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhouse
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.ppt
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real time
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docx
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdf
 
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...
 

YABench: A Comprehensive Framework for RDF Stream Processor Correctness and Performance Assessment

  • 1. YABench: A Comprehensive Framework for RDF Stream Processor Correctness and Performance Assessment Maxim Kolchin, Peter Wetz, Elmar Kiesling, A Min Tjoa ITMO University, Russia | TU Wien, Austria The 16th International Conference on Web Engineering 2016, Lugano, Switzerland
  • 2. RDF Stream Processing (RSP) RDF Stream - a potentially infinite sequence of time-varying data elements encoded in RDF Continuous query - a query registered over streams that in most cases are observed through windows Query results - similarly to SPARQL they can be tuples, RDF dataset or a new RDF Stream 2
  • 3. State of the art ■ LSBench (2012) ■ SRBench (2012) ■ CSRBench (2013) ■ CityBench (2015) Details can be found at W3C RSP Community Group’s Wiki: https://www.w3. org/community/rsp/wiki/RSP_Benchmarking 3
  • 4. Our contribution ■ We propose a benchmarking framework for RDF Stream Processing engines that focuses on correctness and performance ○ Stream generator (generates configurable RDF stream) ○ Oracle (validates correctness of the results) ○ Runner (measures performance of an RSP engine) ■ We run a benchmark with the window-based RDF stream processing engines: ○ C-SPARQL ○ CQELS 4
  • 5. Requirements ■ Scalable and configurable input ■ Comprehensive correctness checking ■ Flexible queries ■ Reproducibility 5
  • 6. Architecture 1. Define tests, 2. Generate data streams, 3. Run the tests with a given engine, a. Performance metrics are collected in a separate process, 4. At the end validate the results with the oracle. 6
  • 8. Validation against CSRBench We validated the correctness checking functionality of YABench by reproducing the CSRBench* benchmark. CSRBench defines 7 queries for C-SPARQL, CQELS and SPARQLstream engines. Datasets, test configurations and results are available online: github. com/YABench/csrbench-validation *Daniele Dell’Aglio, et al. “On Correctness in RDF Stream Processor Benchmarking”, 2013 8
  • 9. Validation against CSRBench (C-SPARQL) Query C-SPARQL CSRBench YABench Q1 ✓ ✓ Q2 ✓ ✓ Q3 ✓ ✓* Q4 ✓ ✓ Q5 ✗ ✗ Q6 ✓ ✓* Q7 ✓ ✓* * - the results are the same, but because of timing discrepancies some results sometimes present in the subsequent window 9
  • 10. Validation against CSRBench (CQELS) Query CQELS CSRBench YABench Q1 ✓ ✓ Q2 ✓ ✓ Q3 ✓ ✓ Q4 ✗ ✗ Q5 ✓ ✓ Q6 ✗ ✗ Q7 ✗ ✗** ** - indicates that the query did not execute successfully on the CQELS engine. The engine crashed before returning the query results 10
  • 11. Benchmark We reuse queries introduced by CSRBench, but we’re able to parametrize them, e. g. window size, window slide, filter values, etc. Measure: - Precision and recall, - Window and result size, and delay, - Memory and CPU usage, # of threads We run each test 10 times, to compute the distribution of precision/recall. Detailed results are available online: github.com/YABench/yabench-one 11
  • 12. Benchmark: Data Stream Model A data stream is generated based on: ■ Number of weather stations, ■ Time interval between two observations of a single station, ■ Duration of the stream, ■ A seed for the randomize function 12
  • 13. Benchmark: Queries Experiment 1: SELECT + FILTER Experiment 2: SELECT + AVG + FILTER Experiment 3: joining of triples from different timestamps Experiment 4: demonstrates the use of gracious mode which implemented by the oracle to eliminate the timing discrepancy issues of the engines 13
  • 16. Experiment 1 (precision/recall): 10000 stations 16
  • 17. Experiment 1 (memory usage): 50 stations 17
  • 18. Experiment 1 (memory usage): 1000 stations 18
  • 19. Experiment 1 (memory usage): 10000 stations 19
  • 20. Experiment 1 (delay): 50 stations 20
  • 21. Experiment 1 (delay): 1000 stations 21
  • 22. Experiment 1 (delay): 10000 stations 22
  • 23. Experiment 1 (C-SPARQL): delay vs result size 23
  • 24. Architecture: Gracious mode In this mode the oracle tries to adjusts its window scope to match the scope of an actual window, by moving the left and right borders to back and/or forth while the precision and recall grows. It allows to: (a) confirm our assumption on why precision and recall are low, (b) reconstruct and visualize the actual window borders 24
  • 25. Experiment 4: gracious vs non-gracious modes (a) In non-gracious (default) mode (b) In gracious mode C-SPARQL 25
  • 26. Experiment 4: gracious vs non-gracious modes (a) In non-gracious (default) mode (b) In gracious mode CQELS 26
  • 27. Conclusion ■ We build a framework for benchmarking RSP engines which allows to assess their correctness and performance ■ We run a benchmark which revealed some insides: ○ CQELS shows better precision/recall for simple queries, ○ C-SPARQL is slightly more memory efficient than CQELS, ○ C-SPARQL outperformes CQELS in terms of delay for more complex queries, which is mainly caused by a different reporting strategy ■ By introducing gracious mode we’re able to estimate the extent of the timing discrepancy 27