The document discusses finding commonalities between RDF graphs by computing their least general generalization (lgg). It defines the lgg of RDF graphs as a generalization that entails all input graphs based on RDF entailment rules, and is entailed by any other generalization. The document focuses on computing the lgg of two RDF graphs, which can be used to iteratively find the lgg of multiple graphs. An example is provided to illustrate defining the lgg of two sample RDF graphs.
The document discusses defining and computing the least general generalization (lgg) of RDF graphs and SPARQL queries. It introduces the concepts of RDF graphs, entailment between graphs, and materializing implicit triples using RDFS and RDF entailment rules. The document outlines contributions in defining and computing the lgg in RDF and SPARQL, and reporting on experiments using datasets like DBpedia and LUBM.
Full version of http://www.slideshare.net/valexiev1/gvp-lodcidocshort. Same is available on http://vladimiralexiev.github.io/pres/20140905-CIDOC-GVP/index.html
CIDOC Congress, Dresden, Germany
2014-09-05: International Terminology Working Group: full version.
2014-09-09: Getty special session: short version
Linked geospatial data has recently received attention, as researchers and practitioners have started tapping the wealth of geospatial information available on the Web. Incomplete geospatial information, although appearing often in the applications captured by such datasets, is not represented and queried properly due to the lack of appropriate data models and query languages. We discuss our recent work on the model RDFi, an extension of RDF with the ability to represent property values that exist, but are unknown or partially known, using constraints, and an extension of the query language SPARQL with qualitative and quantitative geospatial querying capabilities. We demonstrate the usefulness of RDFi in geospatial Semantic Web applications by giving examples and comparing the modeling capabilities of RDFi with the ones of related Semantic Web systems.
A Context-Based Semantics for SPARQL Property Paths over the WebOlaf Hartig
- The document proposes a formal context-based semantics for evaluating SPARQL property path queries over the Web of Linked Data.
- This semantics defines how to compute the results of such queries in a well-defined manner and ensures the "web-safeness" of queries, meaning they can be executed directly over the Web without prior knowledge of all data.
- The paper presents a decidable syntactic condition for identifying SPARQL property path queries that are web-safe based on their sets of conditionally bounded variables.
Parallel Datalog Reasoning in RDFox PresentationDBOnto
Abstract:
We present a novel approach to parallel materialisation (i.e.,
fixpoint computation) of datalog programs in centralised,
main-memory, multi-core RDF systems. Our approach comprises an algorithm that evenly distributes the workload to cores, and an RDF indexing data structure that supports efficient, ‘mostly’ lock-free parallel updates. Our empirical evaluation shows that our approach parallelises computation very well: with 16 physical cores, materialisation can be up to 13.9 times faster than with just one core.
RSP-QL*: Querying Data-Level Annotations in RDF Streamskeski
This document proposes an extension to RSP-QL called RSP-QL* that allows querying of statement-level annotations in RDF streams. RSP-QL* uses the RDF* model, which allows embedding RDF triples as the subject or object of other triples. This provides an efficient way to represent statement-level metadata in RDF. The semantics of RSP-QL are extended to support RSP-QL* patterns, which can include basic graph patterns, named graphs, windows and other operators. Future work includes adding more functionality to the RDF* model, prototyping an implementation, and evaluating performance.
This document discusses demos and tools for linking knowledge discovery (KDD) and linked data. It summarizes several tools that integrate linked data and KDD processes like data preprocessing, mining, and postprocessing. OpenRefine, RapidMiner, R, Matlab, ProLOD++, DL-Learner, Spark, KNIME, and Gephi were highlighted as tools that support tasks like enriching data, running SPARQL queries, loading RDF data, and visualizing linked data. The document concludes by asking about gaps and how to increase adoption, noting linked data could benefit KDD with validation, enrichment, and reasoning over semantic web data.
This document discusses computing commonalities between SPARQL conjunctive queries. It defines the concept of a least general generalization (lgg) of queries, which is a most general query that entails each of the input queries. The document presents definitions for lgg of basic graph pattern queries in SPARQL with respect to a set of RDF entailment rules and RDFS constraints. It focuses on computing the lgg of two queries by iteratively taking the lgg of query pairs. The goal is to study computing lgg in the conjunctive fragment of SPARQL to applications like query optimization and recommendation.
The document discusses defining and computing the least general generalization (lgg) of RDF graphs and SPARQL queries. It introduces the concepts of RDF graphs, entailment between graphs, and materializing implicit triples using RDFS and RDF entailment rules. The document outlines contributions in defining and computing the lgg in RDF and SPARQL, and reporting on experiments using datasets like DBpedia and LUBM.
Full version of http://www.slideshare.net/valexiev1/gvp-lodcidocshort. Same is available on http://vladimiralexiev.github.io/pres/20140905-CIDOC-GVP/index.html
CIDOC Congress, Dresden, Germany
2014-09-05: International Terminology Working Group: full version.
2014-09-09: Getty special session: short version
Linked geospatial data has recently received attention, as researchers and practitioners have started tapping the wealth of geospatial information available on the Web. Incomplete geospatial information, although appearing often in the applications captured by such datasets, is not represented and queried properly due to the lack of appropriate data models and query languages. We discuss our recent work on the model RDFi, an extension of RDF with the ability to represent property values that exist, but are unknown or partially known, using constraints, and an extension of the query language SPARQL with qualitative and quantitative geospatial querying capabilities. We demonstrate the usefulness of RDFi in geospatial Semantic Web applications by giving examples and comparing the modeling capabilities of RDFi with the ones of related Semantic Web systems.
A Context-Based Semantics for SPARQL Property Paths over the WebOlaf Hartig
- The document proposes a formal context-based semantics for evaluating SPARQL property path queries over the Web of Linked Data.
- This semantics defines how to compute the results of such queries in a well-defined manner and ensures the "web-safeness" of queries, meaning they can be executed directly over the Web without prior knowledge of all data.
- The paper presents a decidable syntactic condition for identifying SPARQL property path queries that are web-safe based on their sets of conditionally bounded variables.
Parallel Datalog Reasoning in RDFox PresentationDBOnto
Abstract:
We present a novel approach to parallel materialisation (i.e.,
fixpoint computation) of datalog programs in centralised,
main-memory, multi-core RDF systems. Our approach comprises an algorithm that evenly distributes the workload to cores, and an RDF indexing data structure that supports efficient, ‘mostly’ lock-free parallel updates. Our empirical evaluation shows that our approach parallelises computation very well: with 16 physical cores, materialisation can be up to 13.9 times faster than with just one core.
RSP-QL*: Querying Data-Level Annotations in RDF Streamskeski
This document proposes an extension to RSP-QL called RSP-QL* that allows querying of statement-level annotations in RDF streams. RSP-QL* uses the RDF* model, which allows embedding RDF triples as the subject or object of other triples. This provides an efficient way to represent statement-level metadata in RDF. The semantics of RSP-QL are extended to support RSP-QL* patterns, which can include basic graph patterns, named graphs, windows and other operators. Future work includes adding more functionality to the RDF* model, prototyping an implementation, and evaluating performance.
This document discusses demos and tools for linking knowledge discovery (KDD) and linked data. It summarizes several tools that integrate linked data and KDD processes like data preprocessing, mining, and postprocessing. OpenRefine, RapidMiner, R, Matlab, ProLOD++, DL-Learner, Spark, KNIME, and Gephi were highlighted as tools that support tasks like enriching data, running SPARQL queries, loading RDF data, and visualizing linked data. The document concludes by asking about gaps and how to increase adoption, noting linked data could benefit KDD with validation, enrichment, and reasoning over semantic web data.
This document discusses computing commonalities between SPARQL conjunctive queries. It defines the concept of a least general generalization (lgg) of queries, which is a most general query that entails each of the input queries. The document presents definitions for lgg of basic graph pattern queries in SPARQL with respect to a set of RDF entailment rules and RDFS constraints. It focuses on computing the lgg of two queries by iteratively taking the lgg of query pairs. The goal is to study computing lgg in the conjunctive fragment of SPARQL to applications like query optimization and recommendation.
Federation and Navigation in SPARQL 1.1net2-project
This document discusses new features in SPARQL 1.1, including federation using the SERVICE operator and navigation using property paths. It provides an overview of the basics of SPARQL and the syntax and semantics of SPARQL 1.0 queries before explaining federation, which allows querying multiple datasets, and navigation, which allows navigating RDF graphs using regular expressions to match properties. It also discusses the evaluation procedures and complexity of these new features.
The document discusses the Semantic Web and Linked Data. It provides an overview of RDF syntaxes, storage and querying technologies for the Semantic Web. It also discusses issues around scalability and reasoning over large amounts of semantic data. Examples are provided to illustrate SPARQL querying of RDF data, including graph patterns, conjunctions, optional patterns and value testing.
The aim of the EU FP 7 Large-Scale Integrating Project LarKC is to develop the Large Knowledge Collider (LarKC, for short, pronounced “lark”), a platform for massive distributed incomplete reasoning that will remove the scalability barriers of currently existing reasoning systems for the Semantic Web. The LarKC platform is available at larkc.sourceforge.net. This talk, is part of a tutorial for early users of the LarKC platform, and describes the data model used within LarKC.
LDQL: A Query Language for the Web of Linked DataOlaf Hartig
I used this slideset to present our research paper at the 14th Int. Semantic Web Conference (ISWC 2015). Find a preprint of the paper here:
http://olafhartig.de/files/HartigPerez_ISWC2015_Preprint.pdf
In this paper, we propose the problem of implementing an efficient query processing system for incomplete temporal and geospatial information in RDFi as a challenge to the SSTD community.
ParlBench: a SPARQL-benchmark for electronic publishing applications.Tatiana Tarasova
Slides from the workshop on Benchmarking RDF Systems co-located with the Extended Semantic Web Conference 2013. The presentation is about an on-going work on building the benchmark for electronic publishing applications. The benchmark provides real-world data sets, the Dutch parliamentary proceedings and a set of analytical SPARQL queries that were built on top of these data sets. The queries were grouped into micro-benchmarks according to their analytical aims. This allows one to perform better analysis of RDF stores behaviors with respect to a certain SPARQL feature used in a micro-benchmark/query.
Preliminary results of running the benchmark on the Virtuoso native RDF store are presented, as well as references to the on-line material including the data sets, queries and the scripts that were used to obtain the results.
Machine Learning Methods for Analysing and Linking RDF DataJens Lehmann
Invited Talk at the 8th International Conference on Scalable Uncertainty Management (SUM)
The talk outlines applications of supervised structured machine learning and presents a specific refinement operator based approach for RDF/OWL. It also outlines how similar ideas can be used in other (formal) languages, in particular link specifications.
We extend RDF with the ability to represent property values that exist, but are unknown or partially known, using constraints. Following ideas from the incomplete information literature, we develop a semantics for this extension of RDF, called RDFi, and study SPARQL query evaluation in this framework.
This document summarizes Jean-Paul Calbimonte's presentation on connecting stream reasoners on the web. It discusses representing data streams as RDF and using RDF stream processing systems. Key points include:
- RDF streams can be represented as sequences of timestamped RDF graphs.
- The W3C RSP community group is working to standardize RDF stream models and query languages.
- Producing RDF streams involves mapping live data sources to RDF and adding timestamps.
- Consuming RDF streams involves discovering stream metadata and endpoints to access the streams.
- Systems like TripleWave demonstrate approaches for spreading RDF streams on the web.
Triplewave: a step towards RDF Stream Processing on the WebDaniele Dell'Aglio
The slides of my talk at INSIGHT Centre for Data Analytics (in NUI Galway) where I presented TripleWave (http://streamreasoning.github.io/TripleWave/), an open-source framework to create and publish streams of RDF data.
The document discusses requirements and approaches for RDF stream processing (RSP). It covers the following key points in 3 sentences:
RSP aims to process continuous RDF streams to address scenarios like sensor data and social media. It involves querying streaming data, integrating streams with static data, and handling issues like imperfections. The document reviews existing RSP systems and languages, actor-based approaches, and the 8 requirements for real-time stream processing including keeping data moving, generating predictable outcomes, and responding instantaneously.
Presentation done* at the 13th International Semantic Web Conference (ISWC) in which we approach a compressed format to represent RDF Data Streams. See the original article at: http://dataweb.infor.uva.es/wp-content/uploads/2014/07/iswc14.pdf
* Presented by Alejandro Llaves (http://www.slideshare.net/allaves)
The document discusses scaling web data at low cost. It begins by presenting Javier D. Fernández and providing context about his work in semantic web, open data, big data management, and databases. It then discusses techniques for compressing and querying large RDF datasets at low cost using binary RDF formats like HDT. Examples of applications using these techniques include compressing and sharing datasets, fast SPARQL querying, and embedding systems. It also discusses efforts to enable web-scale querying through projects like LOD-a-lot that integrate billions of triples for federated querying.
This document provides an overview of RDF stream processing and existing RDF stream processing engines. It discusses RDF streams and how sensor data can be represented as RDF streams. It also summarizes some existing RDF stream processing query languages and systems, including C-SPARQL, and the features they support like continuous execution, operators, and time-based windows. The document is intended as a tutorial for developers on working with RDF stream processing.
Property graph vs. RDF Triplestore comparison in 2020Ontotext
This presentation goes all the way from intro "what graph databases are" to table comparing the RDF vs. PG plus two different diagrams presenting the market circa 2020
1) The Semantic Web technologies OWL 2 and Rule Interchange Format (RIF) have recently been finalized, while technical work is ongoing for SPARQL 1.1, RDFa 1.1, and connecting relational databases to RDF.
2) A workshop will discuss a possible revision to RDF to address issues like deprecation of features and addition of new constructs like named graphs.
3) The standards organization W3C is working on finalizing current technologies while exploring new areas like provenance and revisions to the core RDF standard based on discussion at the workshop.
CIDOC Congress, Dresden, Germany
2014-09-05: International Terminology Working Group: full version (http://vladimiralexiev.github.io/pres/20140905-CIDOC-GVP/index.html)
2014-09-09: Getty special session: short version (http://VladimirAlexiev.github.io/pres/20140905-CIDOC-GVP/GVP-LOD-CIDOC-short.pdf)
TripleWave: Spreading RDF Streams on the WebAndrea Mauri
TripleWave is an open-source framework for creating and publishing RDF streams over the Web. It converts various data sources like temporal RDF datasets and web streams into RDF streams. TripleWave makes these streams available via standard protocols and allows consuming applications to access the streams through pull via Linked Data principles or push using RSP services. The framework is implemented in NodeJS and available on GitHub to help spread the use of RDF streams on the semantic web.
Rethinking Online SPARQL Querying to Support Incremental Result VisualizationOlaf Hartig
These are the slides of my invited talk at the 5th Int. Workshop on Usage Analysis and the Web of Data (USEWOD 2015): http://usewod.org/usewod2015.html
The abstract of this talks is given as follows:
To reduce user-perceived response time many interactive Web applications visualize information in a dynamic, incremental manner. Such an incremental presentation can be particularly effective for cases in which the underlying data processing systems are not capable of completely answering the users' information needs instantaneously. An example of such systems are systems that support live querying of the Web of Data, in which case query execution times of several seconds, or even minutes, are an inherent consequence of these systems' ability to guarantee up-to-date results. However, support for an incremental result visualization has not received much attention in existing work on such systems. Therefore, the goal of this talk is to discuss approaches that enable query systems for the Web of Data to return query results incrementally.
Efficient Query Answering against Dynamic RDF DatabasesAlexandra Roatiș
The document describes efficient query answering against dynamic RDF databases. It discusses RDF as a graph-based data model and standard, blank nodes, RDF Schema (RDFS) for semantic constraints, the open-world assumption and RDF entailment through implicit triples and saturation. It also covers basic graph pattern (BGP) queries in SPARQL and the need to decouple RDF entailment from query evaluation through data saturation or query reformulation to obtain complete query answers.
Wi2015 - Clustering of Linked Open Data - the LODeX toolLaura Po
Presentation of the tool LODeX (http://www.dbgroup.unimore.it/lodex2/testCluster) at the 2015 IEEE/WIC/ACM International Conference on Web Intelligence, Singapore, December 6-8, 2015
Federation and Navigation in SPARQL 1.1net2-project
This document discusses new features in SPARQL 1.1, including federation using the SERVICE operator and navigation using property paths. It provides an overview of the basics of SPARQL and the syntax and semantics of SPARQL 1.0 queries before explaining federation, which allows querying multiple datasets, and navigation, which allows navigating RDF graphs using regular expressions to match properties. It also discusses the evaluation procedures and complexity of these new features.
The document discusses the Semantic Web and Linked Data. It provides an overview of RDF syntaxes, storage and querying technologies for the Semantic Web. It also discusses issues around scalability and reasoning over large amounts of semantic data. Examples are provided to illustrate SPARQL querying of RDF data, including graph patterns, conjunctions, optional patterns and value testing.
The aim of the EU FP 7 Large-Scale Integrating Project LarKC is to develop the Large Knowledge Collider (LarKC, for short, pronounced “lark”), a platform for massive distributed incomplete reasoning that will remove the scalability barriers of currently existing reasoning systems for the Semantic Web. The LarKC platform is available at larkc.sourceforge.net. This talk, is part of a tutorial for early users of the LarKC platform, and describes the data model used within LarKC.
LDQL: A Query Language for the Web of Linked DataOlaf Hartig
I used this slideset to present our research paper at the 14th Int. Semantic Web Conference (ISWC 2015). Find a preprint of the paper here:
http://olafhartig.de/files/HartigPerez_ISWC2015_Preprint.pdf
In this paper, we propose the problem of implementing an efficient query processing system for incomplete temporal and geospatial information in RDFi as a challenge to the SSTD community.
ParlBench: a SPARQL-benchmark for electronic publishing applications.Tatiana Tarasova
Slides from the workshop on Benchmarking RDF Systems co-located with the Extended Semantic Web Conference 2013. The presentation is about an on-going work on building the benchmark for electronic publishing applications. The benchmark provides real-world data sets, the Dutch parliamentary proceedings and a set of analytical SPARQL queries that were built on top of these data sets. The queries were grouped into micro-benchmarks according to their analytical aims. This allows one to perform better analysis of RDF stores behaviors with respect to a certain SPARQL feature used in a micro-benchmark/query.
Preliminary results of running the benchmark on the Virtuoso native RDF store are presented, as well as references to the on-line material including the data sets, queries and the scripts that were used to obtain the results.
Machine Learning Methods for Analysing and Linking RDF DataJens Lehmann
Invited Talk at the 8th International Conference on Scalable Uncertainty Management (SUM)
The talk outlines applications of supervised structured machine learning and presents a specific refinement operator based approach for RDF/OWL. It also outlines how similar ideas can be used in other (formal) languages, in particular link specifications.
We extend RDF with the ability to represent property values that exist, but are unknown or partially known, using constraints. Following ideas from the incomplete information literature, we develop a semantics for this extension of RDF, called RDFi, and study SPARQL query evaluation in this framework.
This document summarizes Jean-Paul Calbimonte's presentation on connecting stream reasoners on the web. It discusses representing data streams as RDF and using RDF stream processing systems. Key points include:
- RDF streams can be represented as sequences of timestamped RDF graphs.
- The W3C RSP community group is working to standardize RDF stream models and query languages.
- Producing RDF streams involves mapping live data sources to RDF and adding timestamps.
- Consuming RDF streams involves discovering stream metadata and endpoints to access the streams.
- Systems like TripleWave demonstrate approaches for spreading RDF streams on the web.
Triplewave: a step towards RDF Stream Processing on the WebDaniele Dell'Aglio
The slides of my talk at INSIGHT Centre for Data Analytics (in NUI Galway) where I presented TripleWave (http://streamreasoning.github.io/TripleWave/), an open-source framework to create and publish streams of RDF data.
The document discusses requirements and approaches for RDF stream processing (RSP). It covers the following key points in 3 sentences:
RSP aims to process continuous RDF streams to address scenarios like sensor data and social media. It involves querying streaming data, integrating streams with static data, and handling issues like imperfections. The document reviews existing RSP systems and languages, actor-based approaches, and the 8 requirements for real-time stream processing including keeping data moving, generating predictable outcomes, and responding instantaneously.
Presentation done* at the 13th International Semantic Web Conference (ISWC) in which we approach a compressed format to represent RDF Data Streams. See the original article at: http://dataweb.infor.uva.es/wp-content/uploads/2014/07/iswc14.pdf
* Presented by Alejandro Llaves (http://www.slideshare.net/allaves)
The document discusses scaling web data at low cost. It begins by presenting Javier D. Fernández and providing context about his work in semantic web, open data, big data management, and databases. It then discusses techniques for compressing and querying large RDF datasets at low cost using binary RDF formats like HDT. Examples of applications using these techniques include compressing and sharing datasets, fast SPARQL querying, and embedding systems. It also discusses efforts to enable web-scale querying through projects like LOD-a-lot that integrate billions of triples for federated querying.
This document provides an overview of RDF stream processing and existing RDF stream processing engines. It discusses RDF streams and how sensor data can be represented as RDF streams. It also summarizes some existing RDF stream processing query languages and systems, including C-SPARQL, and the features they support like continuous execution, operators, and time-based windows. The document is intended as a tutorial for developers on working with RDF stream processing.
Property graph vs. RDF Triplestore comparison in 2020Ontotext
This presentation goes all the way from intro "what graph databases are" to table comparing the RDF vs. PG plus two different diagrams presenting the market circa 2020
1) The Semantic Web technologies OWL 2 and Rule Interchange Format (RIF) have recently been finalized, while technical work is ongoing for SPARQL 1.1, RDFa 1.1, and connecting relational databases to RDF.
2) A workshop will discuss a possible revision to RDF to address issues like deprecation of features and addition of new constructs like named graphs.
3) The standards organization W3C is working on finalizing current technologies while exploring new areas like provenance and revisions to the core RDF standard based on discussion at the workshop.
CIDOC Congress, Dresden, Germany
2014-09-05: International Terminology Working Group: full version (http://vladimiralexiev.github.io/pres/20140905-CIDOC-GVP/index.html)
2014-09-09: Getty special session: short version (http://VladimirAlexiev.github.io/pres/20140905-CIDOC-GVP/GVP-LOD-CIDOC-short.pdf)
TripleWave: Spreading RDF Streams on the WebAndrea Mauri
TripleWave is an open-source framework for creating and publishing RDF streams over the Web. It converts various data sources like temporal RDF datasets and web streams into RDF streams. TripleWave makes these streams available via standard protocols and allows consuming applications to access the streams through pull via Linked Data principles or push using RSP services. The framework is implemented in NodeJS and available on GitHub to help spread the use of RDF streams on the semantic web.
Rethinking Online SPARQL Querying to Support Incremental Result VisualizationOlaf Hartig
These are the slides of my invited talk at the 5th Int. Workshop on Usage Analysis and the Web of Data (USEWOD 2015): http://usewod.org/usewod2015.html
The abstract of this talks is given as follows:
To reduce user-perceived response time many interactive Web applications visualize information in a dynamic, incremental manner. Such an incremental presentation can be particularly effective for cases in which the underlying data processing systems are not capable of completely answering the users' information needs instantaneously. An example of such systems are systems that support live querying of the Web of Data, in which case query execution times of several seconds, or even minutes, are an inherent consequence of these systems' ability to guarantee up-to-date results. However, support for an incremental result visualization has not received much attention in existing work on such systems. Therefore, the goal of this talk is to discuss approaches that enable query systems for the Web of Data to return query results incrementally.
Efficient Query Answering against Dynamic RDF DatabasesAlexandra Roatiș
The document describes efficient query answering against dynamic RDF databases. It discusses RDF as a graph-based data model and standard, blank nodes, RDF Schema (RDFS) for semantic constraints, the open-world assumption and RDF entailment through implicit triples and saturation. It also covers basic graph pattern (BGP) queries in SPARQL and the need to decouple RDF entailment from query evaluation through data saturation or query reformulation to obtain complete query answers.
Wi2015 - Clustering of Linked Open Data - the LODeX toolLaura Po
Presentation of the tool LODeX (http://www.dbgroup.unimore.it/lodex2/testCluster) at the 2015 IEEE/WIC/ACM International Conference on Web Intelligence, Singapore, December 6-8, 2015
Framester: A Wide Coverage Linguistic Linked Data HubMehwish Alam
Framester is a linguistic linked data hub that aims to improve coverage of FrameNet by extending mappings between FrameNet and other resources like WordNet and BabelNet. Framester represents over 40 million triples linking linguistic and factual resources and aligning frames, roles, and types to foundational ontologies. It provides a word frame disambiguation service and was evaluated on annotated corpora, showing improved performance over previous approaches.
Il seminario presenta il tema emergente del Web of Data, nell'ambito del Semantic Web. Vengono esaminate le criticità incontrate nell'accedere all'enorme quantità di informazione presente attualmente nel Web e i vantaggi di un approccio basato sulla creazione interattiva di interrogazioni.
Presented in : JIST2015, Yichang, China
Prototype: http://rc.lodac.nii.ac.jp/rdf4u/
Video: https://www.youtube.com/watch?v=z3roA9-Cp8g
Abstract: It is known that Semantic Web and Linked Open Data (LOD) are powerful technologies for knowledge management, and explicit knowledge is expected to be presented by RDF format (Resource Description Framework), but normal users are far from RDF due to technical skills required. As we learn, a concept-map or a node-link diagram can enhance the learning ability of learners from beginner to advanced user level, so RDF graph visualization can be a suitable tool for making users be familiar with Semantic technology. However, an RDF graph generated from the whole query result is not suitable for reading, because it is highly connected like a hairball and less organized. To make a graph presenting knowledge be more proper to read, this research introduces an approach to sparsify a graph using the combination of three main functions: graph simplification, triple ranking, and property selection. These functions are mostly initiated based on the interpretation of RDF data as knowledge units together with statistical analysis in order to deliver an easily-readable graph to users. A prototype is implemented to demonstrate the suitability and feasibility of the approach. It shows that the simple and flexible graph visualization is easy to read, and it creates the impression of users. In addition, the attractive tool helps to inspire users to realize the advantageous role of linked data in knowledge management.
RDF4U: RDF Graph Visualization by Interpreting Linked Data as KnowledgeRathachai Chawuthai
Presented in : JIST2015, Yichang, China
Prototype: http://rc.lodac.nii.ac.jp/rdf4u/
Video: https://www.youtube.com/watch?v=z3roA9-Cp8g
Abstract: It is known that Semantic Web and Linked Open Data (LOD) are powerful technologies for knowledge management, and explicit knowledge is expected to be presented by RDF format (Resource Description Framework), but normal users are far from RDF due to technical skills required. As we learn, a concept-map or a node-link diagram can enhance the learning ability of learners from beginner to advanced user level, so RDF graph visualization can be a suitable tool for making users be familiar with Semantic technology. However, an RDF graph generated from the whole query result is not suitable for reading, because it is highly connected like a hairball and less organized. To make a graph presenting knowledge be more proper to read, this research introduces an approach to sparsify a graph using the combination of three main functions: graph simplification, triple ranking, and property selection. These functions are mostly initiated based on the interpretation of RDF data as knowledge units together with statistical analysis in order to deliver an easily-readable graph to users. A prototype is implemented to demonstrate the suitability and feasibility of the approach. It shows that the simple and flexible graph visualization is easy to read, and it creates the impression of users. In addition, the attractive tool helps to inspire users to realize the advantageous role of linked data in knowledge management.
Interactive Knowledge Discovery over Web of Data.Mehwish Alam
This document describes research on classifying and exploring data from the Web of Data. It discusses building a classification structure over RDF data by classifying triples based on RDF Schema and creating views through SPARQL queries. This structure can then be used for data completion and interactive knowledge discovery through data analysis and visualization. Formal concept analysis and pattern structures are introduced as techniques for dealing with complex data types from the Web of Data like graphs and linked data. Range minimum queries are also proposed as a way to compute the lowest common ancestor for structured attribute sets in the pattern structures.
This document discusses clustering of RDF data across the Semantic Web. It begins by describing the Linking Open Data project and the growing amount of RDF data available. It then discusses the motivations for clustering RDF data, such as improving data access and query response times over distributed machines. Current approaches to RDF clustering are also summarized, including extracting instance subgraphs and computing distances between instances. The document outlines different techniques for instance extraction and distance computation in RDF clustering.
THoSP: an Algorithm for Nesting Property GraphsGiacomo Bergami
The document describes an algorithm called THoSP for nesting property graphs. THoSP allows combining graph pattern matching and grouping to efficiently nest subgraphs within a larger graph. The algorithm represents the nested graph as an adjacency list enriched with a nesting index. Experimental results show THoSP outperforms querying nested graphs in other databases like SQL, SPARQL, AQL and Cypher.
SPARQL is a standardized query language for retrieving and manipulating data stored in RDF format. It was created by the RDF Data Access Working Group to provide querying of RDF stores. SPARQL supports four query forms: SELECT, CONSTRUCT, DESCRIBE, and ASK. It also defines a protocol for executing queries over HTTP. SPARQL has become a key technology for working with semantic data on the web.
Transforming Your Data with GraphDB: GraphDB Fundamentals, Jan 2018Ontotext
These are slides from a live webinar taken place January 2018.
GraphDB™ Fundamentals builds the basis for working with graph databases that utilize the W3C standards, and particularly GraphDB™. In this webinar, we demonstrated how to install and set-up GraphDB™ 8.4 and how you can generate your first RDF dataset. We also showed how to quickly integrate complex and highly interconnected data using RDF and SPARQL and much more.
With the help of GraphDB™, you can start smartly managing your data assets, visually represent your data model and get insights from them.
The talk was given at the 15th International Conference on Extending Database Technology (EDBT 2012) on March 29, 2012 in Berlin, Germany.
Abstract:
Query optimization in RDF Stores is a challenging problem as SPARQL queries typically contain many more joins than equivalent relational plans, and hence lead to a large join order search space. In such cases, cost-based query optimization often is not possible. One practical reason for this is that statistics typically are missing in web scale setting such as the Linked Open Datasets (LOD). The more profound reason is that due to the absence of schematic structure in RDF, join-hit ratio estimation requires complicated forms of correlated join statistics; and currently there are no methods to identify the relevant correlations beforehand. For this reason, the use of good heuristics is essential in SPARQL query optimization, even in the case that are partially used with cost-based statistics (i.e., hybrid query optimization). In this paper we describe a set of useful heuristics for SPARQL query optimizers. We present these in the context of a new Heuristic SPARQL Planner (HSP) that is capable of exploiting the syntactic and the structural variations of the triple patterns in a SPARQL query in order to choose an execution plan without the need of any cost model. For this, we define the variable graph and we show a reduction of the SPARQL query optimization problem to the maximum weight independent set problem. We implemented our planner on top of the MonetDB open source column-store and evaluated its effectiveness against the state-of-the-art RDF-3X engine as well as comparing the plan quality with a relational (SQL) equivalent of the benchmarks.
The web of interlinked data and knowledge strippedSören Auer
Linked Data approaches can help solve enterprise information integration (EII) challenges by complementing text on web pages with structured, linked open data from different sources. This allows for intelligently combining, integrating, and joining structured information across heterogeneous systems. A distributed, iterative, bottom-up integration approach using Linked Data may help solve the EII problem in large companies by taking a pay-as-you-go approach.
aRangodb, un package per l'utilizzo di ArangoDB con RGraphRM
Lingua talk: Italiano.
Descrizione:
In questo talk parleremo di come integrare e utilizzare ArangoDB, un database multi-modello con supporto nativo ai grafi, con R. Presenteremo quindi aRangodb, il package che abbiamo sviluppato per interfacciarsi in modo più semplice e intuitivo al database. Nel corso del talk mostreremo come il package possa essere utilizzato in ambito data science usando alcuni case studies concreti.
Speaker:
Gabriele Galatolo - Data Scientist - Kode srl
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven RecipesOntotext
This presentation will provide a brief introduction to logical reasoning and overview of the most popular semantic schema and ontology languages: RDFS and the profiles of OWL 2.
While automatic reasoning has always inspired the imagination, numerous projects have failed to deliver to the promises. The typical pitfalls related to ontologies and symbolic reasoning fall into two categories:
- Over-engineered ontologies. The selected ontology language and modeling patterns can be too expressive. This can make the results of inference hard to understand and verify, which in its turn makes KG hard to evolve and maintain. It can also impose performance penalties far greater than the benefits.
- Inappropriate reasoning support. There are many inference algorithms and implementation approaches, which work well with taxonomies and conceptual models of few thousands of concepts, but cannot cope with KG of millions of entities.
- Inappropriate data layer architecture. One such example is reasoning with virtual KG, which is often infeasible.
The document discusses the W3C stack for representing metadata, with XML providing syntax but no semantics, RDF and RDF Schema defining a data model for relations between resources and a vocabulary definition language, and OWL adding more expressivity with concepts such as classes, properties, and cardinality restrictions. It also covers RDF syntaxes like Turtle and XML, and how RDF can represent implied claims from XML and facilitate interoperability between systems through its abstract model.
Information access over linked data requires to determine
subgraph(s), in linked data's underlying graph, that correspond to the required information need. Usually, an information access framework is able to retrieve richer information by checking of a large number of possible subgraphs. However, on the ecking of a large number of possible subgraphs increases information access complexity. This makes information access frameworks less eective. A large number of contemporary linked data information access frameworks reduce the complexity by introducing dierent heuristics but they suer on retrieving richer information. Or, some frameworks do not care about the complexity. However, a practically usable framework should retrieve richer information with lower complexity. In linked data information access, we hypothesize that pre-processed data statistics of linked data can be used to eciently check a large number of possible subgraphs. This will help to retrieve comparatively richer information with lower data access complexity. Preliminary evaluation of our proposed hypothesis shows promising performance.
This presentation by Nathaniel Lane, Associate Professor in Economics at Oxford University, was made during the discussion “Pro-competitive Industrial Policy” held at the 143rd meeting of the OECD Competition Committee on 12 June 2024. More papers and presentations on the topic can be found at oe.cd/pcip.
This presentation was uploaded with the author’s consent.
This presentation by Professor Giuseppe Colangelo, Jean Monnet Professor of European Innovation Policy, was made during the discussion “The Intersection between Competition and Data Privacy” held at the 143rd meeting of the OECD Competition Committee on 13 June 2024. More papers and presentations on the topic can be found at oe.cd/ibcdp.
This presentation was uploaded with the author’s consent.
This presentation by Yong Lim, Professor of Economic Law at Seoul National University School of Law, was made during the discussion “Artificial Intelligence, Data and Competition” held at the 143rd meeting of the OECD Competition Committee on 12 June 2024. More papers and presentations on the topic can be found at oe.cd/aicomp.
This presentation was uploaded with the author’s consent.
This presentation by Katharine Kemp, Associate Professor at the Faculty of Law & Justice at UNSW Sydney, was made during the discussion “The Intersection between Competition and Data Privacy” held at the 143rd meeting of the OECD Competition Committee on 13 June 2024. More papers and presentations on the topic can be found at oe.cd/ibcdp.
This presentation was uploaded with the author’s consent.
This presentation by Tim Capel, Director of the UK Information Commissioner’s Office Legal Service, was made during the discussion “The Intersection between Competition and Data Privacy” held at the 143rd meeting of the OECD Competition Committee on 13 June 2024. More papers and presentations on the topic can be found at oe.cd/ibcdp.
This presentation was uploaded with the author’s consent.
This presentation by OECD, OECD Secretariat, was made during the discussion “Artificial Intelligence, Data and Competition” held at the 143rd meeting of the OECD Competition Committee on 12 June 2024. More papers and presentations on the topic can be found at oe.cd/aicomp.
This presentation was uploaded with the author’s consent.
XP 2024 presentation: A New Look to Leadershipsamililja
Presentation slides from XP2024 conference, Bolzano IT. The slides describe a new view to leadership and combines it with anthro-complexity (aka cynefin).
This presentation by OECD, OECD Secretariat, was made during the discussion “Competition and Regulation in Professions and Occupations” held at the 77th meeting of the OECD Working Party No. 2 on Competition and Regulation on 10 June 2024. More papers and presentations on the topic can be found at oe.cd/crps.
This presentation was uploaded with the author’s consent.
Why Psychological Safety Matters for Software Teams - ACE 2024 - Ben Linders.pdfBen Linders
Psychological safety in teams is important; team members must feel safe and able to communicate and collaborate effectively to deliver value. It’s also necessary to build long-lasting teams since things will happen and relationships will be strained.
But, how safe is a team? How can we determine if there are any factors that make the team unsafe or have an impact on the team’s culture?
In this mini-workshop, we’ll play games for psychological safety and team culture utilizing a deck of coaching cards, The Psychological Safety Cards. We will learn how to use gamification to gain a better understanding of what’s going on in teams. Individuals share what they have learned from working in teams, what has impacted the team’s safety and culture, and what has led to positive change.
Different game formats will be played in groups in parallel. Examples are an ice-breaker to get people talking about psychological safety, a constellation where people take positions about aspects of psychological safety in their team or organization, and collaborative card games where people work together to create an environment that fosters psychological safety.
The importance of sustainable and efficient computational practices in artificial intelligence (AI) and deep learning has become increasingly critical. This webinar focuses on the intersection of sustainability and AI, highlighting the significance of energy-efficient deep learning, innovative randomization techniques in neural networks, the potential of reservoir computing, and the cutting-edge realm of neuromorphic computing. This webinar aims to connect theoretical knowledge with practical applications and provide insights into how these innovative approaches can lead to more robust, efficient, and environmentally conscious AI systems.
Webinar Speaker: Prof. Claudio Gallicchio, Assistant Professor, University of Pisa
Claudio Gallicchio is an Assistant Professor at the Department of Computer Science of the University of Pisa, Italy. His research involves merging concepts from Deep Learning, Dynamical Systems, and Randomized Neural Systems, and he has co-authored over 100 scientific publications on the subject. He is the founder of the IEEE CIS Task Force on Reservoir Computing, and the co-founder and chair of the IEEE Task Force on Randomization-based Neural Networks and Learning Systems. He is an associate editor of IEEE Transactions on Neural Networks and Learning Systems (TNNLS).
Carrer goals.pptx and their importance in real lifeartemacademy2
Career goals serve as a roadmap for individuals, guiding them toward achieving long-term professional aspirations and personal fulfillment. Establishing clear career goals enables professionals to focus their efforts on developing specific skills, gaining relevant experience, and making strategic decisions that align with their desired career trajectory. By setting both short-term and long-term objectives, individuals can systematically track their progress, make necessary adjustments, and stay motivated. Short-term goals often include acquiring new qualifications, mastering particular competencies, or securing a specific role, while long-term goals might encompass reaching executive positions, becoming industry experts, or launching entrepreneurial ventures.
Moreover, having well-defined career goals fosters a sense of purpose and direction, enhancing job satisfaction and overall productivity. It encourages continuous learning and adaptation, as professionals remain attuned to industry trends and evolving job market demands. Career goals also facilitate better time management and resource allocation, as individuals prioritize tasks and opportunities that advance their professional growth. In addition, articulating career goals can aid in networking and mentorship, as it allows individuals to communicate their aspirations clearly to potential mentors, colleagues, and employers, thereby opening doors to valuable guidance and support. Ultimately, career goals are integral to personal and professional development, driving individuals toward sustained success and fulfillment in their chosen fields.
Suzanne Lagerweij - Influence Without Power - Why Empathy is Your Best Friend...Suzanne Lagerweij
This is a workshop about communication and collaboration. We will experience how we can analyze the reasons for resistance to change (exercise 1) and practice how to improve our conversation style and be more in control and effective in the way we communicate (exercise 2).
This session will use Dave Gray’s Empathy Mapping, Argyris’ Ladder of Inference and The Four Rs from Agile Conversations (Squirrel and Fredrick).
Abstract:
Let’s talk about powerful conversations! We all know how to lead a constructive conversation, right? Then why is it so difficult to have those conversations with people at work, especially those in powerful positions that show resistance to change?
Learning to control and direct conversations takes understanding and practice.
We can combine our innate empathy with our analytical skills to gain a deeper understanding of complex situations at work. Join this session to learn how to prepare for difficult conversations and how to improve our agile conversations in order to be more influential without power. We will use Dave Gray’s Empathy Mapping, Argyris’ Ladder of Inference and The Four Rs from Agile Conversations (Squirrel and Fredrick).
In the session you will experience how preparing and reflecting on your conversation can help you be more influential at work. You will learn how to communicate more effectively with the people needed to achieve positive change. You will leave with a self-revised version of a difficult conversation and a practical model to use when you get back to work.
Come learn more on how to become a real influencer!
This presentation by OECD, OECD Secretariat, was made during the discussion “The Intersection between Competition and Data Privacy” held at the 143rd meeting of the OECD Competition Committee on 13 June 2024. More papers and presentations on the topic can be found at oe.cd/ibcdp.
This presentation was uploaded with the author’s consent.
The Intersection between Competition and Data Privacy – OECD – June 2024 OECD...
Learning Commonalities in RDF
1. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Learning Commonalities in RDF
Sara El Hassad François Goasdoué Hélène Jaudoin
IRISA, Univ. Rennes 1, Lannion, France
ESWC 2017 28th May - 1st June 2017
1/24
2. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Introduction
Least general generalization (lgg)
• Machine Learning in the early 70’s by Gordon Plotkin
• Knowledge representation domain in the early 90’s
• Recently in semantic web
2/24
3. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Introduction
Least general generalization (lgg)
• Machine Learning in the early 70’s by Gordon Plotkin
• Knowledge representation domain in the early 90’s
• Recently in semantic web
Applications of lgg
• Social context : lgg of users descriptions (profiles)
• Research common graph patterns between of datasets
• Linked Data Cloud : links between datasets
2/24
4. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Introduction
Least general generalization (lgg)
• Machine Learning in the early 70’s by Gordon Plotkin
• Knowledge representation domain in the early 90’s
• Recently in semantic web
Applications of lgg
• Social context : lgg of users descriptions (profiles)
• Research common graph patterns between of datasets
• Linked Data Cloud : links between datasets
Goal
To study the problem in the setting of the entire RDF standard
2/24
5. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Outline
Introduction
The Resource Description Framework
Finding commonalities between RDF graphs
Related work
Conclusion
3/24
6. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
RDF graphs
• Specification of RDF graphs with triples :
(s, p, o) ∈ (U ∪ B) × U × (U ∪ L ∪ B) s op
• Built-in property URIs to state RDF statements
RDF statement Triple
Class assertion (s, rdf:type, o)
Property assertion (s, p, o) with
p = rdf:type
4/24
7. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
RDF graphs
• Specification of RDF graphs with triples :
(s, p, o) ∈ (U ∪ B) × U × (U ∪ L ∪ B) s op
• Built-in property URIs to state RDF statements
RDF statement Triple
Class assertion (s, rdf:type, o)
Property assertion (s, p, o) with
p = rdf:type
b "LGG in RDF"
ConfPaper b1
hasTitle
τ hasContactAuthor
4/24
8. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Adding ontological knowledge to RDF graphs
• Built-in property URIs to state RDF Schema statements, i.e.,
ontological constraints.
RDFS statement Triple
Subclass (s, sc, o)
Subproperty (s, sp, o)
Domain typing (s, ←d , o)
Range typing (s, →r , o)
5/24
9. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Adding ontological knowledge to RDF graphs
• Built-in property URIs to state RDF Schema statements, i.e.,
ontological constraints.
RDFS statement Triple
Subclass (s, sc, o)
Subproperty (s, sp, o)
Domain typing (s, ←d , o)
Range typing (s, →r , o)
b "LGG in RDF"
ConfPaper hasContactAuthor b1
Publication hasAuthor Researcher
hasTitle
τ
sc sp
→r←d
hasContactAuthor
5/24
10. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Deriving the implicit triples
b "LGG in RDF"
ConfPaper hasContactAuthor b1
Publication hasAuthor Researcher
hasTitle
τ
sc sp
→r←d
hasContactAuthor
Figure: RDF graph G
11. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Deriving the implicit triples
b "LGG in RDF"
ConfPaper hasContactAuthor b1
Publication hasAuthor Researcher
hasTitle
τ
sc sp
→r←d
hasContactAuthorτ
Figure: RDF graph G
12. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Deriving the implicit triples
b "LGG in RDF"
ConfPaper hasContactAuthor b1
Publication hasAuthor Researcher
hasTitle
τ
sc sp
→r←d
hasContactAuthorτ
hasAuthor
Figure: RDF graph G
13. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Deriving the implicit triples
b "LGG in RDF"
ConfPaper hasContactAuthor b1
Publication hasAuthor Researcher
hasTitle
τ
sc sp
→r←d
hasContactAuthorτ
hasAuthor
τ
Figure: RDF graph G
14. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Deriving the implicit triples
b "LGG in RDF"
ConfPaper hasContactAuthor b1
Publication hasAuthor Researcher
hasTitle
τ
sc sp
→r←d
hasContactAuthorτ
hasAuthor
τ→r
Figure: RDF graph G
15. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Deriving the implicit triples
b "LGG in RDF"
ConfPaper hasContactAuthor b1
Publication hasAuthor Researcher
hasTitle
τ
sc sp
→r←d
hasContactAuthorτ
hasAuthor
τ→r←d
Figure: RDF graph G
6/24
16. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Deriving the implicit triples
b "LGG in RDF"
ConfPaper hasContactAuthor b1
Publication hasAuthor Researcher
hasTitle
τ
sc sp
→r←d
hasContactAuthorτ
hasAuthor
τ→r←d
Figure: RDF graph G
How to derive implicit triples of an RDF graph ?
6/24
18. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Semantics of RDF graphs
b "LGG in RDF"
ConfPaper hasContactAuthor b1
Publication hasAuthor Researcher
hasTitle
τ
sc sp
→r←d
hasContactAuthor
Figure: RDF graph G
19. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Semantics of RDF graphs
b "LGG in RDF"
ConfPaper hasContactAuthor b1
Publication hasAuthor Researcher
hasTitle
τ
sc sp
→r←d
hasContactAuthorτ
rdfs9 : (s, sc, o), (s1, τ, s) → (s1, τ, o)
Figure: RDF graph G
20. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Semantics of RDF graphs
b "LGG in RDF"
ConfPaper hasContactAuthor b1
Publication hasAuthor Researcher
hasTitle
τ
sc sp
→r←d
hasContactAuthorτ
rdfs7 : (p1, sp, p2), (s, p1, o) → (s, p2, o)
hasAuthor
τ
Figure: RDF graph G
21. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Semantics of RDF graphs
b "LGG in RDF"
ConfPaper hasContactAuthor b1
Publication hasAuthor Researcher
hasTitle
τ
sc sp
→r←d
hasContactAuthorτ
rdfs3 : (p, →r , o), (s1, p, o1) → (o1, τ, o)
hasAuthor
τ
hasAuthor
τ
Figure: RDF graph G
22. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Semantics of RDF graphs
b "LGG in RDF"
ConfPaper hasContactAuthor b1
Publication hasAuthor Researcher
hasTitle
τ
sc sp
→r←d
hasContactAuthorτ
ext4 : (p, sp, p1), (p1, →r , o) → (p, →r , o)
hasAuthor
τ
hasAuthor
τ→r τ
Figure: RDF graph G
23. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Semantics of RDF graphs
b "LGG in RDF"
ConfPaper hasContactAuthor b1
Publication hasAuthor Researcher
hasTitle
τ
sc sp
→r←d
hasContactAuthorτ
ext3 : (p, sp, p1), (p1, ←d , o) → (p, ←d , o)
hasAuthor
τ
hasAuthor
τ→r τ←d →r
Figure: RDF graph G
8/24
24. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Semantics of RDF graphs
b "LGG in RDF"
ConfPaper hasContactAuthor b1
Publication hasAuthor Researcher
hasTitle
τ
sc sp
→r←d
hasContactAuthorτ
hasAuthor
τ→r←d
Figure: Saturated RDF graph G∞
9/24
25. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Entailment between RDF graphs
Let G and G be two graphs RDF and R a set of RDF entailment rules.
There exists relationship to compare G and G called entailment between
graphs
G is more specific than G :
• G |=R G ⇐⇒ G∞
|= G
There must exist an embedding of G in G∞
.
10/24
26. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Entailment between RDF graphs
G
?
|=R G
b "LGG in RDF"
ConfPaper hasContactAuthor b1
Publication hasAuthor
G
Researcher
hasTitle
τ
sc sp
→r←d
hasContactAuthor
b
b2
G
Publication
hasTitle
τ
27. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Entailment between RDF graphs
G∞
?
|= G
b "LGG in RDF"
ConfPaper hasContactAuthor b1
Publication hasAuthor
G∞
Researcher
hasTitle
τ
sc sp
→r←d
hasContactAuthor
b
b2
G
Publication
hasTitle
ττ
hasAuthor
τ→r←d
28. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Entailment between RDF graphs
G∞
?
|= G
b "LGG in RDF"
ConfPaper hasContactAuthor b1
Publication hasAuthor
G∞
Researcher
hasTitle
τ
sc sp
→r←d
hasContactAuthor
b
b2
G
Publication
hasTitle
ττ
hasAuthor
τ→r←d
b
Publication
"LGG in RDF"hasTitle
τ
RDF graph G is more specific than RDF graph G
11/24
29. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Outline
Introduction
The Resource Description Framework
Finding commonalities between RDF graphs
Related work
Conclusion
12/24
30. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Towards defining lgg in RDF
A least general generalization (lgg) of n descriptions d1, . . . , dn is a most
specific description d generalizing every d1≤i≤n for some
generalization/specialization relation between descriptions (G.Plotkin).
lgg in RDF
• descriptions are RDF graphs
• relation generalization/specialization is entailment between RDF
graphs
13/24
31. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Defining the lgg of RDF graphs
Definition (lgg of RDF graphs)
Let G1, . . . , Gn be RDF graphs and R a set of RDF entailment rules.
• A generalization of G1, . . . , Gn is an RDF graph Gg such that
Gi |=R Gg holds for 1 ≤ i ≤ n.
• A least general generalization (lgg) of G1, . . . , Gn is a generalization
Glgg of G1, . . . , Gn such that for any other generalization Gg of
G1, . . . , Gn, Glgg |=R Gg holds.
Result : lgg of n RDF graphs vs lgg of two RDF graphs
3(G1, G2, G3) ≡R 2( 2(G1, G2), G3)
· · · · · ·
n(G1, . . . , Gn) ≡R 2( n−1(G1, . . . , Gn−1), Gn)
≡R 2( 2(· · · 2( 2(G1, G2), G3) · · · , Gn−1), Gn)
14/24
32. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Defining the lgg of RDF graphs
Definition (lgg of RDF graphs)
Let G1, . . . , Gn be RDF graphs and R a set of RDF entailment rules.
• A generalization of G1, . . . , Gn is an RDF graph Gg such that
Gi |=R Gg holds for 1 ≤ i ≤ n.
• A least general generalization (lgg) of G1, . . . , Gn is a generalization
Glgg of G1, . . . , Gn such that for any other generalization Gg of
G1, . . . , Gn, Glgg |=R Gg holds.
Result : lgg of n RDF graphs vs lgg of two RDF graphs
3(G1, G2, G3) ≡R 2( 2(G1, G2), G3)
· · · · · ·
n(G1, . . . , Gn) ≡R 2( n−1(G1, . . . , Gn−1), Gn)
≡R 2( 2(· · · 2( 2(G1, G2), G3) · · · , Gn−1), Gn)
We focus on computing lgg of two RDF graphs
14/24
33. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Defining the lgg of RDF graphs
i1
SA Researcher
Publication
ConfPaper"DiD"
hasAuthor
τ
τ
τ
sctitle
hasAuthor
hasContactAuthor
i2
SAResearcherVV
Publication
JourPaper"CwFOL"
sp
hasAuthorhasContactAuthorhasAuthor
τ τ
τ
τ
sctitle
G1 G2
34. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Defining the lgg of RDF graphs
i1
SA Researcher
Publication
ConfPaper"DiD"
hasAuthor
τ
τ
τ
sctitle
hasAuthor
hasContactAuthor
i2
SAResearcherVV
Publication
JourPaper"CwFOL"
sp
hasAuthorhasContactAuthorhasAuthor
τ τ
τ
τ
sctitle
G1 G2
bi1i2
SA Researcher
bDC bCPJP
Publication
title τ
τ
hasAuthor
τ
sc
Glgg
15/24
35. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Defining the lgg of RDF graphs
i1
SA Researcher
Publication
ConfPaper"DiD"
hasAuthor
τ
τ
τ
sctitle
hasAuthor
hasContactAuthor
i2
SAResearcherVV
Publication
JourPaper"CwFOL"
sp
hasAuthorhasContactAuthorhasAuthor
τ τ
τ
τ
sctitle
G1 G2
bi1i2
SA Researcher
bDC bCPJP
Publication
title τ
τ
hasAuthor
τ
sc
Glgg
How to compute this graph ?
15/24
36. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
The cover graph of RDF graphs
Definition (Cover graph)
The cover graph G of two RDF graph G1 and G2 is the RDF graph such
that for every property p in both G1 and G2 :
(t1, p, t2) ∈ G1 and (t3, p, t4) ∈ G2 iff (t5, p, t6) ∈ G
with t5 = t1 if t1 = t3 and t1 ∈ U ∪ L, else t5 is the blank node bt1t3
, and,
similarly t6 = t2 if t2 = t4 and t2 ∈ U ∪ L, else t6 is the blank node bt2t4
.
16/24
37. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
The cover graph of RDF graphs
i1
SA Researcher
Publication
ConfPaper"DiD"
hasAuthor
τ
τ
τ
sctitle
hasAuthor
hasContactAuthor
i2
SAResearcherVV
Publication
JourPaper"CwFOL"
sp
hasAuthorhasContactAuthorhasAuthor
τ τ
τ
τ
sctitle
G1 G2
38. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
The cover graph of RDF graphs
i1
SA Researcher
Publication
ConfPaper"DiD"
hasAuthor
τ
τ
τ
sctitle
i1
SA Researcher
Publication
ConfPaper"DiD"
hasAuthor
τ
τ
τ
sctitle
hasAuthor
hasContactAuthor
i2
SAResearcherVV
Publication
JourPaper"CwFOL"
sp
hasAuthorhasContactAuthorhasAuthor
τ τ
τ
τ
sctitle
i2
ResearcherVV
Publication
JourPaper"CwFOL"
hasAuthor
τ τ
τ
τ
sctitle
G1 G2
bi1i2 bSAVVhasAuthor
39. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
The cover graph of RDF graphs
i1
SA Researcher
Publication
ConfPaper"DiD"
hasAuthor
τ
τ
τ
sctitle
i1
SA Researcher
Publication
ConfPaper"DiD"
hasAuthor
τ
τ
τ
sctitle
hasAuthor
hasContactAuthor
i2
SAResearcherVV
Publication
JourPaper"CwFOL"
sp
hasAuthorhasContactAuthorhasAuthor
τ τ
τ
τ
sctitle
i2
ResearcherVV
Publication
JourPaper"CwFOL"
hasAuthor
τ τ
τ
τ
sctitle
SA
hasAuthor
G1 G2
bi1i2 bSAVVhasAuthor bSAVVhasAuthor
SA
hasAuthor
40. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
The cover graph of RDF graphs
i1
SA Researcher
Publication
ConfPaper"DiD"
hasAuthor
τ
τ
τ
sctitle
i1
SA Researcher
Publication
ConfPaper"DiD"
hasAuthor
τ
τ
τ
sctitle
hasAuthor
hasContactAuthor
i2
SAResearcherVV
Publication
JourPaper"CwFOL"
sp
hasAuthorhasContactAuthorhasAuthor
τ τ
τ
τ
sctitle
i2
ResearcherVV
Publication
JourPaper"CwFOL"
hasAuthor
τ τ
τ
τ
sctitle
G1 G2
bi1i2 bSAVVhasAuthor bSAVVhasAuthor
SA
hasAuthor
bi1i2
SA
hasAuthor
bSAVV
Researcher
τ
41. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
The cover graph of RDF graphs
i1
SA Researcher
Publication
ConfPaper"DiD"
hasAuthor
τ
τ
τ
sctitle
i1
SA Researcher
Publication
ConfPaper"DiD"
hasAuthor
τ
τ
τ
sctitle
hasAuthor
hasContactAuthor
i2
SAResearcherVV
Publication
JourPaper"CwFOL"
sp
hasAuthorhasContactAuthorhasAuthor
τ τ
τ
τ
sctitle
i2
ResearcherVV
Publication
JourPaper"CwFOL"
hasAuthor
τ τ
τ
τ
sctitle
G1 G2
bi1i2 bSAVVhasAuthor bSAVVhasAuthor
SA
hasAuthor
bi1i2
SA
hasAuthor
bSAVV
Researcher
τ
bSAVV
Researcher
τ
bDC
title
bi1i2
42. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
The cover graph of RDF graphs
i1
SA Researcher
Publication
ConfPaper"DiD"
hasAuthor
τ
τ
τ
sctitle
hasAuthor
hasContactAuthor
i2
SAResearcherVV
Publication
JourPaper"CwFOL"
sp
hasAuthorhasContactAuthorhasAuthor
τ τ
τ
τ
sctitle
G1 G2
bi1i2 bSAVVhasAuthor bSAVVhasAuthor
SA
hasAuthor
bi1i2
SA
hasAuthor
bSAVV
Researcher
τ
bSAVV
Researcher
τ
bDC
title
bi1i2
bi1i2 bSAVV bPR
SA Researcherτ bi1SA bCPR
bi1VV
bDC bPJP bCPP bCPJP Publication
title τττ τ
hasAuthor
hasAuthor
τ
sc
τ
τ τ
τ
bRP bSAi2 bRJPτ τ
17/24
43. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Cover graph vs lgg
Theorem (R = ∅)
The cover graph G of the RDF graphs G1 and G2 is an lgg of them for
the empty set R of RDF entailment rules (i.e., R = ∅).
18/24
44. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Cover graph vs lgg
Theorem (R = ∅)
The cover graph G of the RDF graphs G1 and G2 is an lgg of them for
the empty set R of RDF entailment rules (i.e., R = ∅).
Proposition (R = ∅)
The cover graph of two RDF graphs G1 and G2 can be computed in
O(|G1| × |G2|) ; its size is bounded by |G1| × |G2|.
18/24
45. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Cover graph vs lgg
Theorem (R = ∅)
The cover graph G of the RDF graphs G1 and G2 is an lgg of them for
the empty set R of RDF entailment rules (i.e., R = ∅).
Proposition (R = ∅)
The cover graph of two RDF graphs G1 and G2 can be computed in
O(|G1| × |G2|) ; its size is bounded by |G1| × |G2|.
Theorem (R = ∅)
Let G1 and G2 be two RDF graphs, and R a set of RDF entailment rules.
The cover graph G of G∞
1 and G∞
2 is an lgg of G1 and G2.
18/24
46. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Cover graph vs lgg
Theorem (R = ∅)
The cover graph G of the RDF graphs G1 and G2 is an lgg of them for
the empty set R of RDF entailment rules (i.e., R = ∅).
Proposition (R = ∅)
The cover graph of two RDF graphs G1 and G2 can be computed in
O(|G1| × |G2|) ; its size is bounded by |G1| × |G2|.
Theorem (R = ∅)
Let G1 and G2 be two RDF graphs, and R a set of RDF entailment rules.
The cover graph G of G∞
1 and G∞
2 is an lgg of G1 and G2.
Corollary (R = ∅)
An lgg of two RDF graphs G1 and G2 can be computed in
O(|G∞
1 | × |G∞
2 |) and its size is bounded by |G∞
1 | × |G∞
2 |.
18/24
47. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Cover graph vs lgg
bi1i2 bSAVV
SA Researcherτ bi1SA bCPR
bi1VV
bDC bCPJP Publication
title τ
hasAuthor
τ
sc
τ
τ
bSAi2 bRJPτ
bi1i2 bSAVV bPR
SA Researcherτ bi1SA bCPR
bi1VV
bDC bPJP bCPP bCPJP Publication
title τττ τ
hasAuthor
hasAuthor
τ
sc
τ
τ τ
τ
bRP bSAi2 bRJPτ τ
19/24
48. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Outline
Introduction
The Resource Description Framework
Finding commonalities between RDF graphs
Related work
Conclusion
20/24
49. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Related work
Structural based approach
• Description Logics EL
- F. Baader and al. :Computing least common subsumers in
description logics with existential restrictions.In IJCAI, 1999.
- B. ZarrieB and al. :Most specific generalizations w.r.t. general
EL-TBoxes.In IJCAI, 2013.
• RDF
• SPARQL : tree queries
- J. Lehmann and L. Buhmann. Autosparql : Let users query your
knowledge base. In ESWC, 2011.
• Rooted graphs, ignore RDF entailment :
- S. Colucci and al. :Defining and computing least common subsumers
in RDF.J. Web Semantics, 39(0), 2016.
Independent structure approach
• Conceptual Graphs
- M. Chein and M. Mugnier.Graph-based Knowledge Representation -
Computational Foundations of Conceptual Graphs.Springer, 2009.
21/24
50. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Conclusion
• Revisit the problem of computing a least general generalization in
the entire setting of RDF.
• Algorithms to compute lggs of small-to-huge RDF graphs.
• Memory
• Data management system
• MapReduce
• Perspective : Heuristics in order to compute lgg without redundants
triples.
22/24
51. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
Thank you !
Questions ?
23/24
52. Introduction The Resource Description Framework Finding commonalities between RDF graphs Related work Conclusion
References I
[1] F. Baader, R. Kiisters, and R. Molitor.
Computing least common subsumers in description logics with existential restrictions.
In IJCAI, 1999.
[2] F. Baader, B. Sertkaya, and A.-Y. Turhan.
Computing the least common subsumer w.r.t. a background terminology.
Journal of Applied Logic, 5(3), 2007.
[3] M. Chein and M. Mugnier.
Graph-based Knowledge Representation - Computational Foundations of Conceptual Graphs.
Springer, 2009.
[4] S. Colucci, F. Donini, S. Giannini, and E. D. Sciascio.
Defining and computing least common subsumers in RDF.
J. Web Semantics, 39(0), 2016.
[5] S. Colucci, F. M. Donini, and E. D. Sciascio.
Common subsumbers in RDF.
In AI*IA, 2013.
[6] J. Lehmann and L. Bühmann.
Autosparql : Let users query your knowledge base.
In ESWC, 2011.
[7] RDF 1.1 semantics.
https://www.w3.org/TR/rdf11-mt/.
[8] B. Zarrieß and A. Turhan.
Most specific generalizations w.r.t. general EL-TBoxes.
In IJCAI, 2013.
24/24