Second RSP workshop co-located with the 17th Extended Seamntic Web Conference.
This is a joint work from Marco Balduini, Emanuele Della Valle at Riccardo Tommasini DEIB, Politecnico of Milano, Milano, Italy.
A Hierarchical approach towards Efficient and Expressive Stream ReasoningRiccardo Tommasini
Abstract. Many approaches have been proposed for Stream Reasoning (SR). Some of them combine information flow processing (IFP) tech- niques and semantic technologies to make sense in real-time of noisy, vast and heterogeneous data streams that come from complex domains. More recent works shown the presence of a trade-off between through- put and reasoning expressiveness. Indeed, systems with IFP-like perfor- mance are not really expressive (e.g. up to an RDFS subset) and vice versa. For static data, Information Integration (II) systems approached the problem already. The idea consists in spreading the reasoning com- plexity over different layers of an hierarchical architecture and treating it where it is easier to do. Is it possible realize an expressive and efficient stream reasoning (E2SR), by defining a hierarchical approach that adapts II techniques to the streaming scenario? In this paper, I discuss my plan towards E2SR, the intuition of adapting Information Integration tech- niques to the streaming scenario and the need of Stream Reasoning of comparative analysis to support its technological progress.
Heaven: A Framework for Systematic Comparative Research Approach for RSP EnginesRiccardo Tommasini
Benchmarks like LSBench, SRBench, CSRBench and, more recently, CityBench satisfy the growing need of shared datasets, ontologies and queries to evaluate window-based RDF Stream Processing (RSP) engines. However, no clear winner emerges out of the evaluation. In this paper, we claim that the RSP community needs to adopt a Systematic Comparative Research Approach (SCRA) if it wants to move a step forward. To this end, we propose a framework that enables SCRA for window based RSP engines. The contributions of this paper are: (i) the requirements to satisfy for tools that aim at enabling SCRA; (ii) the architecture of a facility to design and execute experiment guaranteeing repeatability, reproducibility and comparability; (iii) Heaven – a proof of concept implementation of such architecture that we released as open source –; (iv) two RSP engine implementations, also open source, that we propose as baselines for the comparative research (i.e., they can serve as terms of comparison in future works). We prove Heaven effectiveness using the baselines by: (i) showing that top-down hypothesis verification is not straight forward even in controlled conditions and (ii) providing examples of bottom-up comparative analysis.
With the advancement of new ICT platform coupled with the lower cost of digital infrastructure from connectivity to cloud-based infrastructure and applications software, technology such as Mobile, IoT, Robotics, AI and foreseeable Quantum computing will impact all industries, disrupting many traditional businesses and hence redefine the way we work, live and commute.
There will be many opportunities for the entrepreneurs and research communities to develop new products and services to meet the demand of the business and consumers.
The impact on human capital is inevitable as many traditional jobs will be automated and the capability to learn, unlearn and relearn to stay relevant in the future economy is the key to sustainability.
Singapore, being a small and young economy, needs to move fast and stay ahead of the rest to remain viable. The latest policy and national initiatives based on the recommendation of Committee of Future Economy (CFE) will provide the platform to transform the economy in the next few decades.
A Hierarchical approach towards Efficient and Expressive Stream ReasoningRiccardo Tommasini
Abstract. Many approaches have been proposed for Stream Reasoning (SR). Some of them combine information flow processing (IFP) tech- niques and semantic technologies to make sense in real-time of noisy, vast and heterogeneous data streams that come from complex domains. More recent works shown the presence of a trade-off between through- put and reasoning expressiveness. Indeed, systems with IFP-like perfor- mance are not really expressive (e.g. up to an RDFS subset) and vice versa. For static data, Information Integration (II) systems approached the problem already. The idea consists in spreading the reasoning com- plexity over different layers of an hierarchical architecture and treating it where it is easier to do. Is it possible realize an expressive and efficient stream reasoning (E2SR), by defining a hierarchical approach that adapts II techniques to the streaming scenario? In this paper, I discuss my plan towards E2SR, the intuition of adapting Information Integration tech- niques to the streaming scenario and the need of Stream Reasoning of comparative analysis to support its technological progress.
Heaven: A Framework for Systematic Comparative Research Approach for RSP EnginesRiccardo Tommasini
Benchmarks like LSBench, SRBench, CSRBench and, more recently, CityBench satisfy the growing need of shared datasets, ontologies and queries to evaluate window-based RDF Stream Processing (RSP) engines. However, no clear winner emerges out of the evaluation. In this paper, we claim that the RSP community needs to adopt a Systematic Comparative Research Approach (SCRA) if it wants to move a step forward. To this end, we propose a framework that enables SCRA for window based RSP engines. The contributions of this paper are: (i) the requirements to satisfy for tools that aim at enabling SCRA; (ii) the architecture of a facility to design and execute experiment guaranteeing repeatability, reproducibility and comparability; (iii) Heaven – a proof of concept implementation of such architecture that we released as open source –; (iv) two RSP engine implementations, also open source, that we propose as baselines for the comparative research (i.e., they can serve as terms of comparison in future works). We prove Heaven effectiveness using the baselines by: (i) showing that top-down hypothesis verification is not straight forward even in controlled conditions and (ii) providing examples of bottom-up comparative analysis.
With the advancement of new ICT platform coupled with the lower cost of digital infrastructure from connectivity to cloud-based infrastructure and applications software, technology such as Mobile, IoT, Robotics, AI and foreseeable Quantum computing will impact all industries, disrupting many traditional businesses and hence redefine the way we work, live and commute.
There will be many opportunities for the entrepreneurs and research communities to develop new products and services to meet the demand of the business and consumers.
The impact on human capital is inevitable as many traditional jobs will be automated and the capability to learn, unlearn and relearn to stay relevant in the future economy is the key to sustainability.
Singapore, being a small and young economy, needs to move fast and stay ahead of the rest to remain viable. The latest policy and national initiatives based on the recommendation of Committee of Future Economy (CFE) will provide the platform to transform the economy in the next few decades.
Open-sourcing JavaScript at the City of Amsterdam - All Things Open 2019Roy Derks
Did you knew the City of Amsterdam has developed over 125 open-source projects? Also, many of those are build using JavaScript and are using other open-source packages and tools included in the JavaScript ecosystem. The City of Amsterdam is pioneering the field of open-source development by local governments worldwide. But how do we incorporate Open Standards into our projects, choose which packages to use, maintain our repositories and keep our codebase reliable?
Stream Reasoning: a summary of ten years of research and a vision for the nex...Emanuele Della Valle
Stream reasoning studies the application of inference techniques to data characterised by being highly dynamic. It can find application in several settings, from Smart Cities to Industry 4.0, from Internet of Things to Social Media analytics. This year stream reasoning turns ten, and this talk analyses its growth. In the first part, it traces the main results obtained so far, by presenting the most prominent studies. It starts by an overview of the most relevant studies developed in the context of semantic web, and then it extends the analysis to include contributions from adjacent areas, such as database and artificial intelligence. Looking at the past is useful to prepare for the future: the second part presents a set of open challenges and issues that stream reasoning will face in the next future.
From TRL to MRL: Assessing Open Source Project Market Readiness, Cédric Thoma...OW2
This talk presents OW2's efforts, experience, vision and methodology to assess the market readiness of open source software. If ''open source software has won'' it remains that many conventional managers are not comfortable with it : rating open source market readiness can be a powerful tool to help decision makers.
This presentation will cover:
- the value chain of open source software;
- the OW2 Open-source Sofware Capability Assessment Radar (OSCAR) platform;
- the first version of OW2 OSS Market Readiness Level (MRL, derived from NASA's TRL or Technology Readiness Levels).
HPC in the cloud comes of age - Red Oak HPC SeminarMartin Hamilton
What does HPC in the cloud look like in 2017, and how did we get there? In this talk for Red Oak's HPC Seminar, I look at the origins of cloud HPC, and how it has become mainstream through technologies like Amazon Web Services and OpenStack. I also offer a sneak preview of the 2017 UK national e-Infrastructure survey results, and some thoughts about what's next in cloud HPC, from hyperscale providers to the momentum behind container technologies from Docker and the Open Container Initiative.
Compilation of slides from all presentations at FORMAT Final Conference at Politecnico di Milano, May 14-15, 2015. See agenda for topics of presentations and highlighted slides from each presentation. Come and download a full set of presentations from http://handbook.format-project.eu/?page_id=354
FITCE Congress 2017, Madrid - Raf Meersman (CEO, Comsof)Comsof
Automated and Optimised FTTx Planning
How increased availability of GIS data and PC Calculation power allows to save millions of Euro's in an FTTx network Rollout .
Presentation from FITCE Congress 2017 in Madrid
More info: http://www.fiberplanit.com
Data is being generated all around us – from our smart phones tracking our movement through a city to the city itself sensing various properties and reacting to various conditions. However, to maximise the potential from all this data, it needs to be combined and coerced into models that enable analysis and interpretation. In this talk I will give an overview of the techniques that I have developed for data integration: integrating streams of sensor data with background contextual data and supporting multiple interpretations of linking data together. At the end of the talk I will overview the work I will be conducting in the Administrative Data Research Centre for Scotland.
Real Time Analytics with Apache Cassandra - Cassandra Day BerlinGuido Schmutz
Time series data is everywhere: IoT, sensor data or financial transactions. The industry has moved to databases like Cassandra to handle the high velocity and high volume of data that is now common place. In this talk I will present how we have used Cassandra to store time series data. I will highlight both the Cassandra data model as well as the architecture we put in place for collecting and ingesting data into Cassandra, using Apache Kafka and Apache Storm.
ThyssenKrupp's Procurement Transformation: Minimizing Direct Materials Contra...SAP Ariba
Direct materials sourcing, contracting, and supplier management processes are often more complex than indirect, and spend value is significantly higher, especially in large global, diversified companies. Manufacturers must harness the buying power of their global business units and ensure that negotiated savings are fully realized in their execution processes. Join this session to learn how a diversified global manufacturer is achieving all of this and more.
Personal Research Overview presented at the KU-NAIST Research MeetingChawanat Nakasan
This is the overview of my research as I finish the doctoral degree. This presentation was made on 2018-02-15 as part of the Kasetsart University and Nara Institute of Science and Technology Research Meeting. The content concerns my research and possible future contributions that I can make towards KU-NAIST joint research effort.
** This document has been edited from the time of presentation to remove sensitive and confidential material.
Securing your Kubernetes cluster_ a step-by-step guide to success !KatiaHIMEUR1
Today, after several years of existence, an extremely active community and an ultra-dynamic ecosystem, Kubernetes has established itself as the de facto standard in container orchestration. Thanks to a wide range of managed services, it has never been so easy to set up a ready-to-use Kubernetes cluster.
However, this ease of use means that the subject of security in Kubernetes is often left for later, or even neglected. This exposes companies to significant risks.
In this talk, I'll show you step-by-step how to secure your Kubernetes cluster for greater peace of mind and reliability.
Open-sourcing JavaScript at the City of Amsterdam - All Things Open 2019Roy Derks
Did you knew the City of Amsterdam has developed over 125 open-source projects? Also, many of those are build using JavaScript and are using other open-source packages and tools included in the JavaScript ecosystem. The City of Amsterdam is pioneering the field of open-source development by local governments worldwide. But how do we incorporate Open Standards into our projects, choose which packages to use, maintain our repositories and keep our codebase reliable?
Stream Reasoning: a summary of ten years of research and a vision for the nex...Emanuele Della Valle
Stream reasoning studies the application of inference techniques to data characterised by being highly dynamic. It can find application in several settings, from Smart Cities to Industry 4.0, from Internet of Things to Social Media analytics. This year stream reasoning turns ten, and this talk analyses its growth. In the first part, it traces the main results obtained so far, by presenting the most prominent studies. It starts by an overview of the most relevant studies developed in the context of semantic web, and then it extends the analysis to include contributions from adjacent areas, such as database and artificial intelligence. Looking at the past is useful to prepare for the future: the second part presents a set of open challenges and issues that stream reasoning will face in the next future.
From TRL to MRL: Assessing Open Source Project Market Readiness, Cédric Thoma...OW2
This talk presents OW2's efforts, experience, vision and methodology to assess the market readiness of open source software. If ''open source software has won'' it remains that many conventional managers are not comfortable with it : rating open source market readiness can be a powerful tool to help decision makers.
This presentation will cover:
- the value chain of open source software;
- the OW2 Open-source Sofware Capability Assessment Radar (OSCAR) platform;
- the first version of OW2 OSS Market Readiness Level (MRL, derived from NASA's TRL or Technology Readiness Levels).
HPC in the cloud comes of age - Red Oak HPC SeminarMartin Hamilton
What does HPC in the cloud look like in 2017, and how did we get there? In this talk for Red Oak's HPC Seminar, I look at the origins of cloud HPC, and how it has become mainstream through technologies like Amazon Web Services and OpenStack. I also offer a sneak preview of the 2017 UK national e-Infrastructure survey results, and some thoughts about what's next in cloud HPC, from hyperscale providers to the momentum behind container technologies from Docker and the Open Container Initiative.
Compilation of slides from all presentations at FORMAT Final Conference at Politecnico di Milano, May 14-15, 2015. See agenda for topics of presentations and highlighted slides from each presentation. Come and download a full set of presentations from http://handbook.format-project.eu/?page_id=354
FITCE Congress 2017, Madrid - Raf Meersman (CEO, Comsof)Comsof
Automated and Optimised FTTx Planning
How increased availability of GIS data and PC Calculation power allows to save millions of Euro's in an FTTx network Rollout .
Presentation from FITCE Congress 2017 in Madrid
More info: http://www.fiberplanit.com
Data is being generated all around us – from our smart phones tracking our movement through a city to the city itself sensing various properties and reacting to various conditions. However, to maximise the potential from all this data, it needs to be combined and coerced into models that enable analysis and interpretation. In this talk I will give an overview of the techniques that I have developed for data integration: integrating streams of sensor data with background contextual data and supporting multiple interpretations of linking data together. At the end of the talk I will overview the work I will be conducting in the Administrative Data Research Centre for Scotland.
Real Time Analytics with Apache Cassandra - Cassandra Day BerlinGuido Schmutz
Time series data is everywhere: IoT, sensor data or financial transactions. The industry has moved to databases like Cassandra to handle the high velocity and high volume of data that is now common place. In this talk I will present how we have used Cassandra to store time series data. I will highlight both the Cassandra data model as well as the architecture we put in place for collecting and ingesting data into Cassandra, using Apache Kafka and Apache Storm.
ThyssenKrupp's Procurement Transformation: Minimizing Direct Materials Contra...SAP Ariba
Direct materials sourcing, contracting, and supplier management processes are often more complex than indirect, and spend value is significantly higher, especially in large global, diversified companies. Manufacturers must harness the buying power of their global business units and ensure that negotiated savings are fully realized in their execution processes. Join this session to learn how a diversified global manufacturer is achieving all of this and more.
Personal Research Overview presented at the KU-NAIST Research MeetingChawanat Nakasan
This is the overview of my research as I finish the doctoral degree. This presentation was made on 2018-02-15 as part of the Kasetsart University and Nara Institute of Science and Technology Research Meeting. The content concerns my research and possible future contributions that I can make towards KU-NAIST joint research effort.
** This document has been edited from the time of presentation to remove sensitive and confidential material.
Similar to SLD Revolution: A Cheaper, Faster yet more Accurate Streaming Linked Data Framework (20)
Securing your Kubernetes cluster_ a step-by-step guide to success !KatiaHIMEUR1
Today, after several years of existence, an extremely active community and an ultra-dynamic ecosystem, Kubernetes has established itself as the de facto standard in container orchestration. Thanks to a wide range of managed services, it has never been so easy to set up a ready-to-use Kubernetes cluster.
However, this ease of use means that the subject of security in Kubernetes is often left for later, or even neglected. This exposes companies to significant risks.
In this talk, I'll show you step-by-step how to secure your Kubernetes cluster for greater peace of mind and reliability.
Threats to mobile devices are more prevalent and increasing in scope and complexity. Users of mobile devices desire to take full advantage of the features
available on those devices, but many of the features provide convenience and capability but sacrifice security. This best practices guide outlines steps the users can take to better protect personal devices and information.
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofsAlex Pruden
This paper presents Reef, a system for generating publicly verifiable succinct non-interactive zero-knowledge proofs that a committed document matches or does not match a regular expression. We describe applications such as proving the strength of passwords, the provenance of email despite redactions, the validity of oblivious DNS queries, and the existence of mutations in DNA. Reef supports the Perl Compatible Regular Expression syntax, including wildcards, alternation, ranges, capture groups, Kleene star, negations, and lookarounds. Reef introduces a new type of automata, Skipping Alternating Finite Automata (SAFA), that skips irrelevant parts of a document when producing proofs without undermining soundness, and instantiates SAFA with a lookup argument. Our experimental evaluation confirms that Reef can generate proofs for documents with 32M characters; the proofs are small and cheap to verify (under a second).
Paper: https://eprint.iacr.org/2023/1886
How to Get CNIC Information System with Paksim Ga.pptxdanishmna97
Pakdata Cf is a groundbreaking system designed to streamline and facilitate access to CNIC information. This innovative platform leverages advanced technology to provide users with efficient and secure access to their CNIC details.
In the rapidly evolving landscape of technologies, XML continues to play a vital role in structuring, storing, and transporting data across diverse systems. The recent advancements in artificial intelligence (AI) present new methodologies for enhancing XML development workflows, introducing efficiency, automation, and intelligent capabilities. This presentation will outline the scope and perspective of utilizing AI in XML development. The potential benefits and the possible pitfalls will be highlighted, providing a balanced view of the subject.
We will explore the capabilities of AI in understanding XML markup languages and autonomously creating structured XML content. Additionally, we will examine the capacity of AI to enrich plain text with appropriate XML markup. Practical examples and methodological guidelines will be provided to elucidate how AI can be effectively prompted to interpret and generate accurate XML markup.
Further emphasis will be placed on the role of AI in developing XSLT, or schemas such as XSD and Schematron. We will address the techniques and strategies adopted to create prompts for generating code, explaining code, or refactoring the code, and the results achieved.
The discussion will extend to how AI can be used to transform XML content. In particular, the focus will be on the use of AI XPath extension functions in XSLT, Schematron, Schematron Quick Fixes, or for XML content refactoring.
The presentation aims to deliver a comprehensive overview of AI usage in XML development, providing attendees with the necessary knowledge to make informed decisions. Whether you’re at the early stages of adopting AI or considering integrating it in advanced XML development, this presentation will cover all levels of expertise.
By highlighting the potential advantages and challenges of integrating AI with XML development tools and languages, the presentation seeks to inspire thoughtful conversation around the future of XML development. We’ll not only delve into the technical aspects of AI-powered XML development but also discuss practical implications and possible future directions.
Pushing the limits of ePRTC: 100ns holdover for 100 daysAdtran
At WSTS 2024, Alon Stern explored the topic of parametric holdover and explained how recent research findings can be implemented in real-world PNT networks to achieve 100 nanoseconds of accuracy for up to 100 days.
Maruthi Prithivirajan, Head of ASEAN & IN Solution Architecture, Neo4j
Get an inside look at the latest Neo4j innovations that enable relationship-driven intelligence at scale. Learn more about the newest cloud integrations and product enhancements that make Neo4j an essential choice for developers building apps with interconnected data and generative AI.
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!SOFTTECHHUB
As the digital landscape continually evolves, operating systems play a critical role in shaping user experiences and productivity. The launch of Nitrux Linux 3.5.0 marks a significant milestone, offering a robust alternative to traditional systems such as Windows 11. This article delves into the essence of Nitrux Linux 3.5.0, exploring its unique features, advantages, and how it stands as a compelling choice for both casual users and tech enthusiasts.
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Albert Hoitingh
In this session I delve into the encryption technology used in Microsoft 365 and Microsoft Purview. Including the concepts of Customer Key and Double Key Encryption.
The Art of the Pitch: WordPress Relationships and SalesLaura Byrne
Clients don’t know what they don’t know. What web solutions are right for them? How does WordPress come into the picture? How do you make sure you understand scope and timeline? What do you do if sometime changes?
All these questions and more will be explored as we talk about matching clients’ needs with what your agency offers without pulling teeth or pulling your hair out. Practical tips, and strategies for successful relationship building that leads to closing the deal.
UiPath Test Automation using UiPath Test Suite series, part 6DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 6. In this session, we will cover Test Automation with generative AI and Open AI.
UiPath Test Automation with generative AI and Open AI webinar offers an in-depth exploration of leveraging cutting-edge technologies for test automation within the UiPath platform. Attendees will delve into the integration of generative AI, a test automation solution, with Open AI advanced natural language processing capabilities.
Throughout the session, participants will discover how this synergy empowers testers to automate repetitive tasks, enhance testing accuracy, and expedite the software testing life cycle. Topics covered include the seamless integration process, practical use cases, and the benefits of harnessing AI-driven automation for UiPath testing initiatives. By attending this webinar, testers, and automation professionals can gain valuable insights into harnessing the power of AI to optimize their test automation workflows within the UiPath ecosystem, ultimately driving efficiency and quality in software development processes.
What will you get from this session?
1. Insights into integrating generative AI.
2. Understanding how this integration enhances test automation within the UiPath platform
3. Practical demonstrations
4. Exploration of real-world use cases illustrating the benefits of AI-driven test automation for UiPath
Topics covered:
What is generative AI
Test Automation with generative AI and Open AI.
UiPath integration with generative AI
Speaker:
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
Removing Uninteresting Bytes in Software FuzzingAftab Hussain
Imagine a world where software fuzzing, the process of mutating bytes in test seeds to uncover hidden and erroneous program behaviors, becomes faster and more effective. A lot depends on the initial seeds, which can significantly dictate the trajectory of a fuzzing campaign, particularly in terms of how long it takes to uncover interesting behaviour in your code. We introduce DIAR, a technique designed to speedup fuzzing campaigns by pinpointing and eliminating those uninteresting bytes in the seeds. Picture this: instead of wasting valuable resources on meaningless mutations in large, bloated seeds, DIAR removes the unnecessary bytes, streamlining the entire process.
In this work, we equipped AFL, a popular fuzzer, with DIAR and examined two critical Linux libraries -- Libxml's xmllint, a tool for parsing xml documents, and Binutil's readelf, an essential debugging and security analysis command-line tool used to display detailed information about ELF (Executable and Linkable Format). Our preliminary results show that AFL+DIAR does not only discover new paths more quickly but also achieves higher coverage overall. This work thus showcases how starting with lean and optimized seeds can lead to faster, more comprehensive fuzzing campaigns -- and DIAR helps you find such seeds.
- These are slides of the talk given at IEEE International Conference on Software Testing Verification and Validation Workshop, ICSTW 2022.
Unlocking Productivity: Leveraging the Potential of Copilot in Microsoft 365, a presentation by Christoforos Vlachos, Senior Solutions Manager – Modern Workplace, Uni Systems
Dr. Sean Tan, Head of Data Science, Changi Airport Group
Discover how Changi Airport Group (CAG) leverages graph technologies and generative AI to revolutionize their search capabilities. This session delves into the unique search needs of CAG’s diverse passengers and customers, showcasing how graph data structures enhance the accuracy and relevance of AI-generated search results, mitigating the risk of “hallucinations” and improving the overall customer journey.
SLD Revolution: A Cheaper, Faster yet more Accurate Streaming Linked Data Framework
1. Politecnico di Milano, DEIB
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
Marco Balduini, Riccardo Tommasini, Emanuele Della Valle
A Cheaper, Faster yet more Accurate
Streaming Linked Data Framework
1
2. ESWC
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
RSP is Great!
2
3. EyE
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
ESWC
Why RSP?
3
- offers a generic overview over streams and static data
- enables query answering across heterogeneous sources
- consents to create/publish new streams or graphs
4. ESWC
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
The RSP Idea
4
in short
5. EyE
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
ESWC
CQL Model
5
Streams
Relations
Streams-to-Relations
Relations-to-Streams
Relationsto-Relations
The CQL continuous query language
- Arvind Arasu · Shivnath Babu · Jennifer Widom, 2006, VLDBJ
6. EyE
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
ESWC
RSP-QL Model
6
RDF Streams Solution
Mappings
S2R operators
R2S operators
R2R operators
The CQL continuous query language
- E. Della Valle, S. Ceri, D. Barbieri, D. Braga, A. Campi, 2008, FIS
7. EyE
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
ESWC
RSP in a Nutshell
7
RDF Stream-to-RDF
RDF-to-RDF (solution mappings)
RDF-to-RDF Stream
on RDF Streams
8. ESWC
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
RSP in Practice
8
With SLD
9. EyE
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
ESWC
An Social Media Example
9
How many micro-posts do occur over time?
How often does a hashtag appears in the micro-post
stream?
Two Information Needs
10. EyE
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
ESWC
Streaming Linked Data Server
Sources
Raw Stream
10
Adapter RDF Stream Bus Publisher
Visualizer
Recorder Re-player Analiser Decorator
11. EyE
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
ESWC
11
12. Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
An Important
Optimisation
REGISTER STREAM sstr AS
CONSTRUCT {
?id sma:twitterCount ?tc }
FROM STREAM <social> [RANGE 1m STEP 1m]
WHERE {
SELECT (uuid() AS ?id) ?tc
WHERE {
SELECT
(COUNT (DISTINCT ?mp) AS ?tc)
WHERE {
?mp a sma:Tweet } } }
12
13. EyE
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
ESWC
13
14. EyE
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
ESWC
14
15. Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
Using
C-SPARQL
REGISTER STREAM countT AS
CONSTRUCT {
?uid sma:twitterCount ?tot .}
FROM STREAM <sstr> [RANGE 15m
STEP 1m]
WHERE {
SELECT
(uuid() AS ?uid)
(SUM(?tc) AS ?tot)
WHERE {
?id sma:twitterCount ?tc }}
15
16. ESWC
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
Is RSP always great?
16
17. EyE
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
ESWC
Observations
17
It is flexible. :)
It forces RDF when query results are often relational :(
It is not optimal, i.e. RSP-QL vs SQL vs Path Queries
on SLD
18. ESWC
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
Revolutionising SLD
18
19. EyE
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
ESWC
A “Lazy” Processing Model
19
Stream operators can be applied on generic data items.
QL-specific operators requires a particular data type.
Postpone the data transformation as late as possible.
on streams
20. EyE
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
ESWC
Generic Programming
20
Generic programming is a style of computer
programming in which algorithms are written in terms of
types to-be-specified-later that are then instantiated
when needed for specific types provided as parameters.
an old idea
21. EyE
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
ESWC
A new Processing Model
21
Generic
Streams<T>
Generic
Instantaneous<T>
S2I<T>
I2S<T>
I2I<T>
22. EyE
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
ESWC
Lazy Transformation by Generic Programming
22
stream-to-instantaneous<T>
instantaneous-to-instantaneous<T>
instantaneous-to-stream<T>
on streams<T>
23. ESWC
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
RSP in Practice
23
with SLD Revolution
24. EyE
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
ESWC
Sources
Stream
Sink
Streaming Linked Data Revolution Server
24
Receiver Generic Stream Bus Translator
Stream
Recorder Re-player Processor Decorator
25. EyE
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
ESWC
25
26. ESWC
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
SLD vs SLD Revolution
26
Let’s be quantitative
27. EyE
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
ESWC
It is faster, cheaper yet more accurate than SLD.
27
R² = 0,96413R² = 0,99891
30
300
3000
1 10 100
Median Engine Memory (MB)
Median CPU Load (%)
SLD
SLD Revolution
Expon. (SLD)
Linear (SLD Revolution)
28. ESWC
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
Discussion & Conclusion
28
29. EyE
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
ESWC
Observations
29
It is faster, cheaper yet more accurate than SLD. :)
It requires to know EPL, SPARQL, JSON path queries. :(
It is optimised and, thus, not flexible. :(
on SLD Revolution
30. EyE
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
ESWC
Open Problems
30
RSP-QL is not always the best solution in terms of cost/performance
Can we identify an optimum?
Can we define a cost model for RSP-QL ?
31. Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
Questions?
Email: riccardo.tommasini@polimi.it
Twitter: @rictomm
31
Email: marco.balduini@polimi.it
Twitter: @ balducci85
Pablo Picasso, Les Demoiselles d'Avignon, 1907.
Museum of Modern Art (MoMA), New York City, NY, US