The document discusses year 2 deliverables for work packages 9 and 10 of the LOD2 project. It summarizes reports on improvements made to the Publicdata.eu portal including upgrades to CKAN and new features. Next steps include further technical enhancements to Publicdata.eu and engaging communities of data publishers and users. Deliverables from the Serbian CKAN team established their data portal and infrastructure. The Polish Ministry of Economy requirements analysis identified needs for publishing their data as linked open data.
http://lod2.eu/BlogPost/webinar-series
This webinar in the course of the LOD2 webinar series will present the release 3.0 of the LOD2 stack, which contains updates to
*) Virtuoso 7 [Openlink]: the original row store of the Virtuoso 6 universal server has now been replaced by a column store, increasing the performance of SPARQL queries significantly, the store is now up to three times as fast as the previous major version.
Linked Open Data Manager Suite [SWC]: the 'lodms' application allows the user to quickly set up pipelines for transforming linked data through the use of its many extensions. It also allows operations for extracting rdf from other types of data.
*) dbpedia-spotlight-ui [ULEI]: a graphical user interface component that allows the user to use a remote DBpedia spotlight instance to annotate a text with DBpedia concepts.
*) sparqlify [ULEI]: a scalable SPARQL-SQL rewriter, allowing you to query an SQL database as if it were a triple store.
*) SIREn [DERI]: a Lucene plugin that allows you to efficiently index and query RDF, as well as any textual document with an arbitrary amount of metadata fields.
*) CubeViz [ULEI]: CubeViz allows visualization of the Data Cube linked data representation of statistical data. It has support for the more advanced DataCube features, such as slices. It also allows the selection of a remote SPARQL endpoint and export of a modified cube.
*) R2R [UMA]: the R2R mapping API is now included directly into the lod2 demonstrator application, allowing users to experience the full effect of the R2R semantic mapping language through a graphical user interface.
*) ontowiki-csvimport [ULEI]: an OntoWiki extension that transforms CSV files to RDF. The extension can create Data Cubes that can be visualized by CubeViz.
If you are interested in Linked (Open) Data principles and mechanisms, LOD tools & services and concrete use cases that can be realised using LOD then join us in the free LOD2 webinar series!
UnifiedViews is a joint project currently maintained by Semantic Web Company (SWC) and Semantica.cz (Semantica.cz). It has been mainly developed by Charles University in Prague as a student project called ODCleanStore (version 2). It is based on the experience SWC obtained with the LOD Management Suite (LODMS) used in WP7 and ODCleansStore (version 1) developed by Charles University in Prague for the WP9a use case of the LOD2 FP7 project. In the next stack release of the LOD2 stack, UnifiedViews will replace LODMS as an ETL tool in the stack and the tool has already been adopted in other projects.
In the webinar we will give a brief overview of the UnifiedViews project (Helmut Nagy). The main part will be a presentation of the tool and it's capabilities (Tomas Knap)
In this Webinar Lorenz Bühmann presents the ontology repair and enrichment tool ORE and also the DL-Learner , a machine learning tool to solve supervised learnings tasks and support knowledge engineers in constructing knowledge. Those two beneighbored tools in the LOD2 Stack are for classification and the following quality analysis of Linked Data.
(http://lod2.eu/BlogPost/webinar-series) In this Webinar Michael Martin presents CubeViz - a facetted browser for statistical data utilizing the RDF Data Cube vocabulary which is the state-of-the-art in representing statistical data in RDF. This vocabulary is compatible with SDMX and increasingly being adopted. Based on the vocabulary and the encoded Data Cube, CubeViz is generating a facetted browsing widget that can be used to filter interactively observations to be visualized in charts. Based on the selected structure, CubeViz offer beneficiary chart types and options which can be selected by users.
If you are interested in Linked (Open) Data principles and mechanisms, LOD tools & services and concrete use cases that can be realised using LOD then join us in the free LOD2 webinar series!
LOD2 is a 4-year European Commission project comprising Linked Data researchers and companies from 12 countries. The project aims to integrate Linked Data into existing large-scale applications in media, publishing, corporate intranets, and eGovernment. The webinar series offers monthly free webinars on tools and services for acquiring, editing, composing, connecting, and publishing Linked Data.
The document discusses a webinar presented by LOD2 on creating knowledge from interlinked data. It describes LOD2 as an EU-funded project involving leading linked open data organizations. The webinar agenda includes discussing SIREn, a plugin for Elasticsearch that allows indexing and searching of JSON documents. It provides an overview of Elasticsearch and describes how to install SIREn, create an index, index documents, and perform searches on nested JSON data.
This webinar in the course of the LOD2 webinar series will present use cases and live demos of D2R (Free University Berlin) and Sparqlify (University of Leipzig).
D2R Server is a tool for publishing relational databases on the Semantic Web. It enables RDF and HTML browsers to navigate the content of the database, and allows applications to query the database using the SPARQL query language.
Sparqlify is a tool enabling one to define expressive RDF views on relational databases and query them with a subset of the SPARQL query language. By featuring a novel RDF view definition syntax, it aims at simplifying the RDB-RDF mapping process.
more to be found at:
http://lod2.eu/BlogPost/webinar-series
This webinar in the course of the LOD2 webinar series will present the release 3.0 of the LOD2 stack, which contains updates to
*) Virtuoso 7 [Openlink]: the original row store of the Virtuoso 6 universal server has now been replaced by a column store, increasing the performance of SPARQL queries significantly, the store is now up to three times as fast as the previous major version.
Linked Open Data Manager Suite [SWC]: the 'lodms' application allows the user to quickly set up pipelines for transforming linked data through the use of its many extensions. It also allows operations for extracting rdf from other types of data.
*) dbpedia-spotlight-ui [ULEI]: a graphical user interface component that allows the user to use a remote DBpedia spotlight instance to annotate a text with DBpedia concepts.
*) sparqlify [ULEI]: a scalable SPARQL-SQL rewriter, allowing you to query an SQL database as if it were a triple store.
*) SIREn [DERI]: a Lucene plugin that allows you to efficiently index and query RDF, as well as any textual document with an arbitrary amount of metadata fields.
*) CubeViz [ULEI]: CubeViz allows visualization of the Data Cube linked data representation of statistical data. It has support for the more advanced DataCube features, such as slices. It also allows the selection of a remote SPARQL endpoint and export of a modified cube.
*) R2R [UMA]: the R2R mapping API is now included directly into the lod2 demonstrator application, allowing users to experience the full effect of the R2R semantic mapping language through a graphical user interface.
*) ontowiki-csvimport [ULEI]: an OntoWiki extension that transforms CSV files to RDF. The extension can create Data Cubes that can be visualized by CubeViz.
If you are interested in Linked (Open) Data principles and mechanisms, LOD tools & services and concrete use cases that can be realised using LOD then join us in the free LOD2 webinar series!
UnifiedViews is a joint project currently maintained by Semantic Web Company (SWC) and Semantica.cz (Semantica.cz). It has been mainly developed by Charles University in Prague as a student project called ODCleanStore (version 2). It is based on the experience SWC obtained with the LOD Management Suite (LODMS) used in WP7 and ODCleansStore (version 1) developed by Charles University in Prague for the WP9a use case of the LOD2 FP7 project. In the next stack release of the LOD2 stack, UnifiedViews will replace LODMS as an ETL tool in the stack and the tool has already been adopted in other projects.
In the webinar we will give a brief overview of the UnifiedViews project (Helmut Nagy). The main part will be a presentation of the tool and it's capabilities (Tomas Knap)
In this Webinar Lorenz Bühmann presents the ontology repair and enrichment tool ORE and also the DL-Learner , a machine learning tool to solve supervised learnings tasks and support knowledge engineers in constructing knowledge. Those two beneighbored tools in the LOD2 Stack are for classification and the following quality analysis of Linked Data.
(http://lod2.eu/BlogPost/webinar-series) In this Webinar Michael Martin presents CubeViz - a facetted browser for statistical data utilizing the RDF Data Cube vocabulary which is the state-of-the-art in representing statistical data in RDF. This vocabulary is compatible with SDMX and increasingly being adopted. Based on the vocabulary and the encoded Data Cube, CubeViz is generating a facetted browsing widget that can be used to filter interactively observations to be visualized in charts. Based on the selected structure, CubeViz offer beneficiary chart types and options which can be selected by users.
If you are interested in Linked (Open) Data principles and mechanisms, LOD tools & services and concrete use cases that can be realised using LOD then join us in the free LOD2 webinar series!
LOD2 is a 4-year European Commission project comprising Linked Data researchers and companies from 12 countries. The project aims to integrate Linked Data into existing large-scale applications in media, publishing, corporate intranets, and eGovernment. The webinar series offers monthly free webinars on tools and services for acquiring, editing, composing, connecting, and publishing Linked Data.
The document discusses a webinar presented by LOD2 on creating knowledge from interlinked data. It describes LOD2 as an EU-funded project involving leading linked open data organizations. The webinar agenda includes discussing SIREn, a plugin for Elasticsearch that allows indexing and searching of JSON documents. It provides an overview of Elasticsearch and describes how to install SIREn, create an index, index documents, and perform searches on nested JSON data.
This webinar in the course of the LOD2 webinar series will present use cases and live demos of D2R (Free University Berlin) and Sparqlify (University of Leipzig).
D2R Server is a tool for publishing relational databases on the Semantic Web. It enables RDF and HTML browsers to navigate the content of the database, and allows applications to query the database using the SPARQL query language.
Sparqlify is a tool enabling one to define expressive RDF views on relational databases and query them with a subset of the SPARQL query language. By featuring a novel RDF view definition syntax, it aims at simplifying the RDB-RDF mapping process.
more to be found at:
Soren Auer - LOD2 - creating knowledge out of Interlinked DataOpen City Foundation
The document discusses the LOD2 project which aims to create knowledge from interlinked open data. It focuses on very large RDF data management, knowledge enrichment through interlinking data from different sources, and developing semantic user interfaces. The project uses use cases in media, enterprise, open government data, and public sector contracts. The goal is to develop an integrated Linked Data lifecycle management stack.
This document summarizes the EU-funded LOD2 project which aims to create knowledge from interlinked open data. The 4 year project has a budget of €8.58 million and involves 10 partners from 7 European countries. The project seeks to address problems of accessing structured data on the current web by complementing text on web pages with structured linked open data from different sources. It also describes use cases for applying linked data technologies in media, publishing, enterprise applications and open government data.
1) Sebastian Hellmann presented on Linked Open Data and Natural Language Processing.
2) He discussed DBpedia and using it for NLP tasks like named entity recognition.
3) Hellmann proposed ways for the ULI and AKSW to collaborate on projects like adding CLDR data to the Linguistic Linked Open Data cloud and creating open benchmarks.
This webinar in the course of the LOD2 webinar series will present Zemanta and its LODRefine - a LOD-enabled version of OpenRefine (previously Google Refine), which is a part of the LOD2 stack. LODRefine extends cleansing and linking functionalities of OpenRefine by providing means to reconcile and augment your data with DBpedia or any other SPARQL endpoint, extract named entities using Zemanta API, export data in one of the RDF formats, and recently also to exploit available crowdsourcing services. In webinar we will demonstrate several task which demonstrate the ease of use and versatility of LODRefine.
If you are interested in Linked (Open) Data principles and mechanisms, LOD tools & services and concrete use cases that can be realised using LOD then join us in the free LOD2 webinar series: http://lod2.eu/BlogPost/webinar-series
LOD2 plenary meeting in Paris: presentation of WP5: State of Play: Linked Data Visualization, Browsing and Authoring, by Renaud Delbru (National University of Ireland, Galway).
The LOD2 project aims to improve tools for publishing Linked Open Data. It comprises researchers and companies from 12 countries who are developing an integrated stack of Linked Data tools. The stack demonstrates the benefits of Linked Data in media/publishing, corporate intranets, and eGovernment. It provides monthly webinars on tools for acquiring, editing, composing, and publishing Linked Data.
Slides of the presentation by Hugh Williams of OpenLink Software in the course of the LOD2 webinar: Virtuoso Universal Server on 20.12. 2011 - for more information please see: http://lod2.eu/BlogPost/webinar-series
This document discusses standardizing data on the web. It notes that data exists in many formats, from informal to curated, and machine to human readable. W3C has focused on integrating data at web scale using standards like RDF, SPARQL, and Linked Data principles. However, converting all data to RDF has challenges. Much data exists as CSV, JSON, XML and does not need full integration. The reality is data on the web is messy with many formats. Developers see converting data as too complex. The document discusses providing tools to publish Linked Data easily, or focusing on raw data without RDF. It notes different approaches can coexist and discusses a workshop on open data formats.
This document discusses publishing public contract data as Linked Data. It begins by introducing Linked Data and its key principles of using URIs to identify things, providing useful information about those URIs, and including links to other related URIs. This allows data to be interconnected in a global data space on the Web. The document then discusses benefits of publishing the TED public contracts database as Linked Data, such as enabling a unified view of related data and easy linking to external datasets. It also addresses challenges, such as how to identify contracting authorities consistently across notices. Finally, it outlines steps needed to adopt Linked Data principles for TED, such as extracting, storing and interlinking the data.
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataSören Auer
Over the past 4 years, the Semantic Web activity has gained momentum with the widespread publishing of structured data as RDF. The Linked Data paradigm has therefore evolved from a practical research idea into
a very promising candidate for addressing one of the biggest challenges
of computer science: the exploitation of the Web as a platform for data
and information integration. To translate this initial success into a
world-scale reality, a number of research challenges need to be
addressed: the performance gap between relational and RDF data
management has to be closed, coherence and quality of data published on
the Web have to be improved, provenance and trust on the Linked Data Web
must be established and generally the entrance barrier for data
publishers and users has to be lowered. This tutorial will discuss
approaches for tackling these challenges. As an example of a successful
Linked Data project we will present DBpedia, which leverages Wikipedia
by extracting structured information and by making this information
freely accessible on the Web. The tutorial will also outline some recent advances in DBpedia, such as the mappings Wiki, DBpedia Live as well as
the recently launched DBpedia benchmark.
These slides were originally a tutorial presented for the SIG preceding the May 2009 meeting of the PRISM Forum.
They attempt to give a survey of the technologies, tools, and state of the world with respect to the Semantic Web as of the first half of 2009.
Automated interpretability of linked data ontologies: an evaluation within th...Nuno Freire
Publication and usage of linked data has been highly pursued by cultural heritage institutions and service providers in this domain. Much research and cooperation are taking place in adapting and improving cultural heritage data models for linked data and in defining ontologies and vocabularies, as well as the setting up of services based on linked data. This article presents an evaluation of ontologies and vocabularies published as liked data, which originate from the cultural heritage domain, or are frequently used and linked to in this domain. Our study aims to evaluate their usability by crawlers operating on the web of data, according to specifications and practices of linked data, the Semantic Web and ontology reasoning. We evaluate having in mind the use case of general data consumption applications based on RDF, RDF Schema, OWL, SKOS and linked data’s guidelines. We have evaluated twelve ontologies and vocabularies and identified that four were not fully compliant, and that alignments between ontologies are not included in the definitions of the ontologies. This study contributes to the research of novel services consuming linked data. It also allows to better assess the automation that can be achieved to handle the variety and large volume of linked data, when assessing the viability of new services based on linked data in cultural heritage.
A summary of DBpedia's History and a detailed analysis of challenges and solutions.
We show how the Linked Data Cloud evolved around DBpedia and also what problems we and other data projects encountered. We included a section on the new solutions that will lead DBpedia into a bright future.
Geoportal is an interface that enables search, portrayal, evaluation and sharing of spatial and non-spatial data based on interoperable standards. It helps create a distributed network of information and knowledge with spatial positions. The GeoPortal4Everybody solution provides open source components like a metadata catalog, map viewer, and content management system to build a geoportal that interconnects data from public, private, and social sources in compliance with INSPIRE and open standards. It aims to offer free access to spatial information for all based on open principles.
Phil Ritchie | Putting Standards into Action: Multilingual and Semantic Enric...semanticsconference
The document discusses NIF and its role in the FREME project. It summarizes that NIF is used as a pivot format for enrichment workflows in FREME. It allows for incremental enrichment of documents with annotations for entities, terminology, and machine translations. FREME uses standards like NIF, ITS 2.0, OntoLex lemon, and Okapi to make linked data and language technologies work together and enable round-tripping of content between tools and formats.
The document discusses Linked Open Data (LOD) and its applications in e-government and commercial publishing. It describes a LOD2 project demo application that allows searching legislative documents from the CELLAR database using SPARQL queries. Metadata about the documents, including licenses, can be retrieved in different formats. The documents and metadata can be integrated with other vocabularies like EUROVOC. This allows the content to be reused, with references to the original sources, in applications and products from commercial publishers.
The demo application shows how LOD allows more direct access to primary sources of content and metadata, and helps publishers enhance their offerings by linking to and reusing this open data.
The document describes the LOD2 project, an EU-funded collaborative project that aims to utilize the web as an integration platform for data and information by leveraging Linked Data technologies. The LOD2 project focuses on very large RDF data management, knowledge enrichment and interlinking, fusion and information quality, and adaptive semantic user interfaces. The project brings together various partners to develop an integrated LOD2 stack for the lifecycle management of Linked Data.
State of Play presentation at the LOD2 Plenary Vienna 2012: WP9A - LOD for a Distributed Marketplace for Public Sector Contracts by Vojtěch Svátek (UEP)
Soren Auer - LOD2 - creating knowledge out of Interlinked DataOpen City Foundation
The document discusses the LOD2 project which aims to create knowledge from interlinked open data. It focuses on very large RDF data management, knowledge enrichment through interlinking data from different sources, and developing semantic user interfaces. The project uses use cases in media, enterprise, open government data, and public sector contracts. The goal is to develop an integrated Linked Data lifecycle management stack.
This document summarizes the EU-funded LOD2 project which aims to create knowledge from interlinked open data. The 4 year project has a budget of €8.58 million and involves 10 partners from 7 European countries. The project seeks to address problems of accessing structured data on the current web by complementing text on web pages with structured linked open data from different sources. It also describes use cases for applying linked data technologies in media, publishing, enterprise applications and open government data.
1) Sebastian Hellmann presented on Linked Open Data and Natural Language Processing.
2) He discussed DBpedia and using it for NLP tasks like named entity recognition.
3) Hellmann proposed ways for the ULI and AKSW to collaborate on projects like adding CLDR data to the Linguistic Linked Open Data cloud and creating open benchmarks.
This webinar in the course of the LOD2 webinar series will present Zemanta and its LODRefine - a LOD-enabled version of OpenRefine (previously Google Refine), which is a part of the LOD2 stack. LODRefine extends cleansing and linking functionalities of OpenRefine by providing means to reconcile and augment your data with DBpedia or any other SPARQL endpoint, extract named entities using Zemanta API, export data in one of the RDF formats, and recently also to exploit available crowdsourcing services. In webinar we will demonstrate several task which demonstrate the ease of use and versatility of LODRefine.
If you are interested in Linked (Open) Data principles and mechanisms, LOD tools & services and concrete use cases that can be realised using LOD then join us in the free LOD2 webinar series: http://lod2.eu/BlogPost/webinar-series
LOD2 plenary meeting in Paris: presentation of WP5: State of Play: Linked Data Visualization, Browsing and Authoring, by Renaud Delbru (National University of Ireland, Galway).
The LOD2 project aims to improve tools for publishing Linked Open Data. It comprises researchers and companies from 12 countries who are developing an integrated stack of Linked Data tools. The stack demonstrates the benefits of Linked Data in media/publishing, corporate intranets, and eGovernment. It provides monthly webinars on tools for acquiring, editing, composing, and publishing Linked Data.
Slides of the presentation by Hugh Williams of OpenLink Software in the course of the LOD2 webinar: Virtuoso Universal Server on 20.12. 2011 - for more information please see: http://lod2.eu/BlogPost/webinar-series
This document discusses standardizing data on the web. It notes that data exists in many formats, from informal to curated, and machine to human readable. W3C has focused on integrating data at web scale using standards like RDF, SPARQL, and Linked Data principles. However, converting all data to RDF has challenges. Much data exists as CSV, JSON, XML and does not need full integration. The reality is data on the web is messy with many formats. Developers see converting data as too complex. The document discusses providing tools to publish Linked Data easily, or focusing on raw data without RDF. It notes different approaches can coexist and discusses a workshop on open data formats.
This document discusses publishing public contract data as Linked Data. It begins by introducing Linked Data and its key principles of using URIs to identify things, providing useful information about those URIs, and including links to other related URIs. This allows data to be interconnected in a global data space on the Web. The document then discusses benefits of publishing the TED public contracts database as Linked Data, such as enabling a unified view of related data and easy linking to external datasets. It also addresses challenges, such as how to identify contracting authorities consistently across notices. Finally, it outlines steps needed to adopt Linked Data principles for TED, such as extracting, storing and interlinking the data.
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataSören Auer
Over the past 4 years, the Semantic Web activity has gained momentum with the widespread publishing of structured data as RDF. The Linked Data paradigm has therefore evolved from a practical research idea into
a very promising candidate for addressing one of the biggest challenges
of computer science: the exploitation of the Web as a platform for data
and information integration. To translate this initial success into a
world-scale reality, a number of research challenges need to be
addressed: the performance gap between relational and RDF data
management has to be closed, coherence and quality of data published on
the Web have to be improved, provenance and trust on the Linked Data Web
must be established and generally the entrance barrier for data
publishers and users has to be lowered. This tutorial will discuss
approaches for tackling these challenges. As an example of a successful
Linked Data project we will present DBpedia, which leverages Wikipedia
by extracting structured information and by making this information
freely accessible on the Web. The tutorial will also outline some recent advances in DBpedia, such as the mappings Wiki, DBpedia Live as well as
the recently launched DBpedia benchmark.
These slides were originally a tutorial presented for the SIG preceding the May 2009 meeting of the PRISM Forum.
They attempt to give a survey of the technologies, tools, and state of the world with respect to the Semantic Web as of the first half of 2009.
Automated interpretability of linked data ontologies: an evaluation within th...Nuno Freire
Publication and usage of linked data has been highly pursued by cultural heritage institutions and service providers in this domain. Much research and cooperation are taking place in adapting and improving cultural heritage data models for linked data and in defining ontologies and vocabularies, as well as the setting up of services based on linked data. This article presents an evaluation of ontologies and vocabularies published as liked data, which originate from the cultural heritage domain, or are frequently used and linked to in this domain. Our study aims to evaluate their usability by crawlers operating on the web of data, according to specifications and practices of linked data, the Semantic Web and ontology reasoning. We evaluate having in mind the use case of general data consumption applications based on RDF, RDF Schema, OWL, SKOS and linked data’s guidelines. We have evaluated twelve ontologies and vocabularies and identified that four were not fully compliant, and that alignments between ontologies are not included in the definitions of the ontologies. This study contributes to the research of novel services consuming linked data. It also allows to better assess the automation that can be achieved to handle the variety and large volume of linked data, when assessing the viability of new services based on linked data in cultural heritage.
A summary of DBpedia's History and a detailed analysis of challenges and solutions.
We show how the Linked Data Cloud evolved around DBpedia and also what problems we and other data projects encountered. We included a section on the new solutions that will lead DBpedia into a bright future.
Geoportal is an interface that enables search, portrayal, evaluation and sharing of spatial and non-spatial data based on interoperable standards. It helps create a distributed network of information and knowledge with spatial positions. The GeoPortal4Everybody solution provides open source components like a metadata catalog, map viewer, and content management system to build a geoportal that interconnects data from public, private, and social sources in compliance with INSPIRE and open standards. It aims to offer free access to spatial information for all based on open principles.
Phil Ritchie | Putting Standards into Action: Multilingual and Semantic Enric...semanticsconference
The document discusses NIF and its role in the FREME project. It summarizes that NIF is used as a pivot format for enrichment workflows in FREME. It allows for incremental enrichment of documents with annotations for entities, terminology, and machine translations. FREME uses standards like NIF, ITS 2.0, OntoLex lemon, and Okapi to make linked data and language technologies work together and enable round-tripping of content between tools and formats.
The document discusses Linked Open Data (LOD) and its applications in e-government and commercial publishing. It describes a LOD2 project demo application that allows searching legislative documents from the CELLAR database using SPARQL queries. Metadata about the documents, including licenses, can be retrieved in different formats. The documents and metadata can be integrated with other vocabularies like EUROVOC. This allows the content to be reused, with references to the original sources, in applications and products from commercial publishers.
The demo application shows how LOD allows more direct access to primary sources of content and metadata, and helps publishers enhance their offerings by linking to and reusing this open data.
The document describes the LOD2 project, an EU-funded collaborative project that aims to utilize the web as an integration platform for data and information by leveraging Linked Data technologies. The LOD2 project focuses on very large RDF data management, knowledge enrichment and interlinking, fusion and information quality, and adaptive semantic user interfaces. The project brings together various partners to develop an integrated LOD2 stack for the lifecycle management of Linked Data.
State of Play presentation at the LOD2 Plenary Vienna 2012: WP9A - LOD for a Distributed Marketplace for Public Sector Contracts by Vojtěch Svátek (UEP)
LOD2 is a large-scale European project that brings together Linked Open Data researchers and companies. The project aims to integrate Linked Data into existing applications and show benefits in media, publishing, corporate intranets, and eGovernment. 3DS Exalead is a partner in LOD2 and presented their CloudView platform, which enables search and analytics across structured and unstructured data through semantic techniques.
Project Description of the Linked Open Data (LOD) PILOT Austria - presented at the PiLOD event at VU Amsterdam (Netherlands) on 29.01. 2014 (see: http://www.pilod.nl/) by Martin Kaltenböck of Semantic Web Company.
This webinar in the course of the LOD2 webinar series will present the implications of Linked Open Data and Semantic Web Technologies in the information and publishing industry.
The publishing industry is struggling with too much information on the one hand and too less resources to bring meaning to this information on the other hand. As an industrial use case partner in LOD2, Wolters Kluwer Deutschland GmbH investigates in detail, how LOD and Semantic Web have the potential to solve this critical issue for their business. The presentation will show what parts of the LOD2 stack are used within the use case and what challenges had to be addressed in the last two years. Interesting future areas like natural language processing will also be mentioned. The topics covered are relevant for any industry that deals with a lot of data and documents, not only publishing.
This series will provide a monthly webinar about Linked (Open) Data tools and services around the LOD2 project, the LOD2 Stack and the Linked Open Data Life Cycle, also in the form of 3rd party tools. Please find continuously updated information here: http://lod2.eu/BlogPost/webinar-series
Open Data Conference - Sören Auer - Linked Open DataOpening-up.eu
The document summarizes the EU-FP7 LOD2 project, which aims to utilize the web as an integration platform for data and information. The project focuses on very large RDF data management, knowledge enrichment and interlinking, fusion and information quality, and adaptive semantic user interfaces. It develops a coherent LOD2 stack for linked data lifecycle management and applies linked data technologies across various use cases including media/publishing, enterprise data, open government data, and public sector contracts. The project runs from 2010-2014 with partners from various universities and companies.
The document describes the LOD2 project, an EU-funded collaborative project that aims to utilize the Web as an integration platform for data and information using Linked Data technologies. The project focuses on very large RDF data management, knowledge enrichment and interlinking, fusion and information quality, and adaptive semantic user interfaces. It brings together various partners to develop an integrated LOD2 Stack for the full Linked Data lifecycle and apply it across several use cases.
The document provides updates on Edina National Data Centre services and projects. Key points include:
- Digimap services added new map styles, formats and MasterMap data. Go-Geo! saw increased usage and new content categories.
- Projects like AddressingHistory and CHALICE aim to link historical maps and directories to create open, linked data gazetteers. A mobile scoping study evaluated delivering Digimap via mobile.
- Other activities included work on the Scottish Spatial Data Infrastructure and the ESDIN best practices network for INSPIRE compliance. The OpenStream service provides access to OS OpenData.
This document provides an introduction to linked data and open data. It discusses the evolution of the web from documents to interconnected data. The four principles of linked data are explained: using URIs to identify things, making URIs accessible, providing useful information about the URI, and including links to other URIs. The differences between open data and linked data are outlined. Key milestones in linked government data are presented. Formats for publishing linked data like RDF and SPARQL are introduced. Finally, the 5 star scheme for publishing open data as linked data is described.
LOD2 WP9a Overview Presentation: LOD2 for a Distributed Marketplace for Public Sector Contracts by Vojtěch Svátek, Jindřich Mynarz, Martin Nečaský(UEP) in the course of the LOD2 Plenary Meeting in Leuven, Belgium in September 2011.
Putting the L in front: from Open Data to Linked Open DataMartin Kaltenböck
Keynote presentation of Martin Kaltenböck (LOD2 project, Semantic Web Company) at the Government Linked Data Workshop in the course of the OGD Camp 2011 in Warsaw, Poland: Putting the L in front: from Open Data to Linked Open Data
Applying Linked Open Data to Public ProcurementJindřich Mynarz
This document summarizes the experiences of the LOD2 project in applying linked open data to public procurement. The LOD2 project developed tools like Strigil and ODCleanStore to extract, clean, and link procurement data. A mockup public contracts filing application was created to publish and matchmake procurement data. Visualizations were also created, like a heat map of Czech public spending and a network graph of German procurement flows. Challenges included unclear licensing, improving machine readability and entity reconciliation across countries. Next steps could involve working with the European Commission's Open Data Portal to further develop the vision of a distributed marketplace for public contracts.
State of Play presentation at the LOD2 Plenary Vienna 2012: WP9 publicdata.eu – Publishing Governmental Information as Linked Data by Irena Irina Bolychevsky, OKFN
Web samia mehlem open data and wb main presentationGlobalForum
The document discusses open data and its benefits. Open data refers to data that is publicly available, machine-readable, and can be used, reused and redistributed without restrictions. Open data benefits governments and citizens by increasing transparency, accountability and engagement. It also enables innovation and economic growth. The document provides examples of how open data has been used to create business opportunities and jobs, improve public services, and develop apps for citizens. It emphasizes that successful open data initiatives require connecting data suppliers to users and engaging stakeholders across sectors through ongoing collaboration.
Slides of the presentation by Michael Martin (ULEI, INFAI) and Martin Kaltenböck (Semantic Web Company) at the OKCon2011 in Berlin on 30th of June 2011: The LOD2 Open Government Data Stakeholder Survey
This deliverable presents the data management plan for the
ARCADIA project. This data management plan describes what kind of data is generated or collected in the ARCADIA project and how this data is published openly. A simple decision process is defined that either classifies a result as public or non -public. The publishing platforms used are the pro
ject website, the OwnCloud platform and GitHub for open-sourced code. All these platforms can be accessed openly.
D7.2. Dissemination and Standardisation PlanLinkedTV
This document presents the dissemination and standardization plan for the LinkedTV project. The plan outlines using the project website, social media, PR materials, conferences, and publications to disseminate project activities and results to target audiences including potential customers, industry collaborators, content owners, developers, and the research community. It also discusses participating in standards bodies like W3C, EBU, hbbTV, and ISO to standardize data models, APIs, and specifications developed by the project. The plan provides an initial set of dissemination actions and engagement with standards organizations over the project duration.
In his public lecture, Christian Timmerer provides insights into the fascinating history of video streaming, starting from its humble beginnings before YouTube to the groundbreaking technologies that now dominate platforms like Netflix and ORF ON. Timmerer also presents provocative contributions of his own that have significantly influenced the industry. He concludes by looking at future challenges and invites the audience to join in a discussion.
Sudheer Mechineni, Head of Application Frameworks, Standard Chartered Bank
Discover how Standard Chartered Bank harnessed the power of Neo4j to transform complex data access challenges into a dynamic, scalable graph database solution. This keynote will cover their journey from initial adoption to deploying a fully automated, enterprise-grade causal cluster, highlighting key strategies for modelling organisational changes and ensuring robust disaster recovery. Learn how these innovations have not only enhanced Standard Chartered Bank’s data infrastructure but also positioned them as pioneers in the banking sector’s adoption of graph technology.
GridMate - End to end testing is a critical piece to ensure quality and avoid...ThomasParaiso2
End to end testing is a critical piece to ensure quality and avoid regressions. In this session, we share our journey building an E2E testing pipeline for GridMate components (LWC and Aura) using Cypress, JSForce, FakerJS…
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
A tale of scale & speed: How the US Navy is enabling software delivery from l...sonjaschweigert1
Rapid and secure feature delivery is a goal across every application team and every branch of the DoD. The Navy’s DevSecOps platform, Party Barge, has achieved:
- Reduction in onboarding time from 5 weeks to 1 day
- Improved developer experience and productivity through actionable findings and reduction of false positives
- Maintenance of superior security standards and inherent policy enforcement with Authorization to Operate (ATO)
Development teams can ship efficiently and ensure applications are cyber ready for Navy Authorizing Officials (AOs). In this webinar, Sigma Defense and Anchore will give attendees a look behind the scenes and demo secure pipeline automation and security artifacts that speed up application ATO and time to production.
We will cover:
- How to remove silos in DevSecOps
- How to build efficient development pipeline roles and component templates
- How to deliver security artifacts that matter for ATO’s (SBOMs, vulnerability reports, and policy evidence)
- How to streamline operations with automated policy checks on container images
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
UiPath Test Automation using UiPath Test Suite series, part 5DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 5. In this session, we will cover CI/CD with devops.
Topics covered:
CI/CD with in UiPath
End-to-end overview of CI/CD pipeline with Azure devops
Speaker:
Lyndsey Byblow, Test Suite Sales Engineer @ UiPath, Inc.
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
“An Outlook of the Ongoing and Future Relationship between Blockchain Technologies and Process-aware Information Systems.” Invited talk at the joint workshop on Blockchain for Information Systems (BC4IS) and Blockchain for Trusted Data Sharing (B4TDS), co-located with with the 36th International Conference on Advanced Information Systems Engineering (CAiSE), 3 June 2024, Limassol, Cyprus.
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Albert Hoitingh
In this session I delve into the encryption technology used in Microsoft 365 and Microsoft Purview. Including the concepts of Customer Key and Double Key Encryption.
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfPaige Cruz
Monitoring and observability aren’t traditionally found in software curriculums and many of us cobble this knowledge together from whatever vendor or ecosystem we were first introduced to and whatever is a part of your current company’s observability stack.
While the dev and ops silo continues to crumble….many organizations still relegate monitoring & observability as the purview of ops, infra and SRE teams. This is a mistake - achieving a highly observable system requires collaboration up and down the stack.
I, a former op, would like to extend an invitation to all application developers to join the observability party will share these foundational concepts to build on:
Unlocking Productivity: Leveraging the Potential of Copilot in Microsoft 365, a presentation by Christoforos Vlachos, Senior Solutions Manager – Modern Workplace, Uni Systems
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...SOFTTECHHUB
The choice of an operating system plays a pivotal role in shaping our computing experience. For decades, Microsoft's Windows has dominated the market, offering a familiar and widely adopted platform for personal and professional use. However, as technological advancements continue to push the boundaries of innovation, alternative operating systems have emerged, challenging the status quo and offering users a fresh perspective on computing.
One such alternative that has garnered significant attention and acclaim is Nitrux Linux 3.5.0, a sleek, powerful, and user-friendly Linux distribution that promises to redefine the way we interact with our devices. With its focus on performance, security, and customization, Nitrux Linux presents a compelling case for those seeking to break free from the constraints of proprietary software and embrace the freedom and flexibility of open-source computing.
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIVladimir Iglovikov, Ph.D.
Presented by Vladimir Iglovikov:
- https://www.linkedin.com/in/iglovikov/
- https://x.com/viglovikov
- https://www.instagram.com/ternaus/
This presentation delves into the journey of Albumentations.ai, a highly successful open-source library for data augmentation.
Created out of a necessity for superior performance in Kaggle competitions, Albumentations has grown to become a widely used tool among data scientists and machine learning practitioners.
This case study covers various aspects, including:
People: The contributors and community that have supported Albumentations.
Metrics: The success indicators such as downloads, daily active users, GitHub stars, and financial contributions.
Challenges: The hurdles in monetizing open-source projects and measuring user engagement.
Development Practices: Best practices for creating, maintaining, and scaling open-source libraries, including code hygiene, CI/CD, and fast iteration.
Community Building: Strategies for making adoption easy, iterating quickly, and fostering a vibrant, engaged community.
Marketing: Both online and offline marketing tactics, focusing on real, impactful interactions and collaborations.
Mental Health: Maintaining balance and not feeling pressured by user demands.
Key insights include the importance of automation, making the adoption process seamless, and leveraging offline interactions for marketing. The presentation also emphasizes the need for continuous small improvements and building a friendly, inclusive community that contributes to the project's growth.
Vladimir Iglovikov brings his extensive experience as a Kaggle Grandmaster, ex-Staff ML Engineer at Lyft, sharing valuable lessons and practical advice for anyone looking to enhance the adoption of their open-source projects.
Explore more about Albumentations and join the community at:
GitHub: https://github.com/albumentations-team/albumentations
Website: https://albumentations.ai/
LinkedIn: https://www.linkedin.com/company/100504475
Twitter: https://x.com/albumentations
Full-RAG: A modern architecture for hyper-personalizationZilliz
Mike Del Balso, CEO & Co-Founder at Tecton, presents "Full RAG," a novel approach to AI recommendation systems, aiming to push beyond the limitations of traditional models through a deep integration of contextual insights and real-time data, leveraging the Retrieval-Augmented Generation architecture. This talk will outline Full RAG's potential to significantly enhance personalization, address engineering challenges such as data management and model training, and introduce data enrichment with reranking as a key solution. Attendees will gain crucial insights into the importance of hyperpersonalization in AI, the capabilities of Full RAG for advanced personalization, and strategies for managing complex data integrations for deploying cutting-edge AI solutions.
By Design, not by Accident - Agile Venture Bolzano 2024
Lod2 review meeting
1. Creating Knowledge out of Interlinked Data
WP9: Use Case 3: LOD2 for Citizens
Luxembourg, Sep 14, 2012
Irina.Bolychevsky@okfn.org
Andreea.bonea@okfn.org
Krzysztof.wecel@i2g.pl
1
LOD2 Presentation . 02.09.2010 . Page http://lod2
2. Creating Knowledge out of Interlinked Data
Agenda
Year 2 Deliverables (OKFN)
D 9.1.1. Report on first release of the Publicdata.eu website
Improvements to Publicdata.eu during the past year
D 9.3.1. Presentation on publishing Linked Data
D 9.3.2. Guide to publishing Linked Data
D 9.4 Report on publication of eGovernment Linked Open Data
Addressing Y1 Review Comments
Next steps
D 9.2.1. Further technical improvements to Publicdata.eu (personalization
features)
Community engagement with Publicdata.eu
Year 2 Deliverables (Serbian CKAN, Instytut Informatyki Gospodarczej)
D 9.5.1. Establishment of the Serbian CKAN
D 9.6. Requirements and Resources used by the Polish Ministry of Economy
Next steps
D 9.7.1. Adaptation of the LOD2 stack for Polish Ministry of Economy
LOD2 Event . 06.09.2010 2 Page
. http://lod2.eu
3. Creating Knowledge out of Interlinked Data
WP9 Objectives
“The purpose of this PublicData.eu use case is to increase public
access to high-value, machine-readable datasets generated by the
European, national as well as regional governments and public
administrations.”
LOD2 Event . 06.09.2010 3 Page
. http://lod2.eu
4. Creating Knowledge out of Interlinked Data
Year 2 Deliverables
(OKFN)
LOD2 Presentation . 02.09.2010 . Page http://lod2
5. Creating Knowledge out of Interlinked Data
Year 2 Deliverables
D 9.1.1. First release of Publicdata.eu
Submitted a thorough report summarizing our work on Publicdata.eu – it's existing features,
previous launches and plans for future improvements: http://svn.aksw.org/lod2/D9.1.1/
LOD2 Event . 06.09.2010 5 Page
. http://lod2.eu
6. Creating Knowledge out of Interlinked Data
Year 2 Deliverables
Publicdata.eu Overview
PublicData.eu is pan-European data catalogue and federation mechanism, developed by
OKF as part of WP9. Based on the CKAN open-source data portal software, the site is a use
case for the citizen aiming to make data as accessible and re-usable as possible. It is a read-
only aggregation of both official and community data portals across the EU.
LOD2 Event . 06.09.2010 6 Page
. http://lod2.eu
7. Creating Knowledge out of Interlinked Data
Key Stats Year 2 Deliverables
Publicdata.eu provides robust search, filtering and previewing tools
It currently houses 17027 data sets, harvested from 18 data catalogues, and it
provides the option to browse data sets by top level categories
LOD2 Event . 06.09.2010 7 Page
6. http://lod2.eu
8. Creating Knowledge out of Interlinked Data
Year 2 Deliverables
Technical improvements to Publicdata.eu during the past year
In March 2012 we upgraded PublicData.eu to CKAN version 1.6, adding the data preview
functionality (powered by Recline), improvements to search, interface improvements to
dataset pages, newly added resource (file) pages and group pages.
We also re-ran all the harvesters to have the most up to date set of datasets. Some
catalogues have been migrated to groups on thedatahub.org and therefore can't currently
be harvested without also including non EU datasets. In the future we may resolve this by
extending the harvester to allow us to specify which groups or tags should be harvested.
This would allow us to import relevant datasets from thedatahub.org without importing non
EU datasets.
Many new CKAN instances have recently been launched by various countries, which we
plan to include in publicdata.eu for the intermediate launch (August 2013)
LOD2 Event . 06.09.2010 8 Page
6. http://lod2.eu
9. Creating Knowledge out of Interlinked Data
Year 2 Deliverables
Addressing Year 1 Review meeting comments
Amount of RDF data in publicdata.eu
● We added the Serbian CKAN to Publicdata.eu bringing their RDF data
● Since Publicdata.eu is currently only a read-only portal, we have focused on encouraging
the source catalogs to increase their RDF data
● our deliverables of 9.3.1 and 9.3.2 facilitate this (presentation & guide on publishing
linked open data)
● worked with consortium partners to produce more RDF data for the eGovernment
report (will cover in next slides)
● In future launches we will allow users to add data, meaning we can add converted datasets
to be accessible through publicdata.eu
● Additionally we plan to improve our harvesting, allowing us to harvest groups to increase
the opportunities for what datasets can be added to publicdata.eu (including groups on
thedatahub.org)
LOD2 Event . 06.09.2010 9 Page
6. http://lod2.eu
10. Creating Knowledge out of Interlinked Data
Year 2 Deliverables
D 9.3.1. Presentation on publishing Linked Open Data
Overview
In Feb 2012, OKFN put together a best practices presentation regarding the publishing,
linking and utilizing Open Data. The presentation is easy accessible for the non technical
eye and details the economic, transparency, policy, and efficiency benefits for
Governments to publish open data. Other aspects such as licensing, registering and
getting the data online are also included in this presentation.
This is aimed to be a detailed resource that anybody can use when referring to Linked
Open Data.
Current state
The presentation can be found here:
[http://svn.aksw.org/lod2/D9.3/D9.3.1/D9.3.1-presentation.pdf]
LOD2 Event . 06.09.2010 10Page
6. http://lod2.eu
11. Creating Knowledge out of Interlinked Data
Year 2 Deliverables
D 9.3.2. Guide to publishing Linked Open Data
Overview
The Guide [http://svn.aksw.org/lod2/D9.3/D9.3.2/] is written in clear, non-technical
language and introduces the reader to the concepts, rationale and tools of Linked Open
Data, as well as providing a high-level overview of the publishing process. It has a
particular focus on public-sector data, and aims to arm decision-makers with an
understanding of Linked Data and the steps necessary to start publishing it.
Expected Impact
The guide will be published on the OKFN website [http://lod2.okfn.org/] where it is hoped
[
that it will become a standard reference document, helping organizations that need to
make decisions about whether and how to publish Linked Open Data.
LOD2 Event . 06.09.2010 11Page
6. http://lod2.eu
12. Creating Knowledge out of Interlinked Data
Year 2 Deliverables
D 9.4. Report on publication of eGovernment Linked Open Data
Overview
The report [http://svn.aksw.org/lod2/D9.4/] summarizes our assessment of the
[
current state of linked data publishing by European Governments and organizations;
Additionally it highlights some of the work that LOD2 partners have been doing to
publish more linked data (Publink initiative, Guides and Documentation)
The report details some of the benefits of publishing linked open data as well as the
current technical and legal barriers preventing the publishing of more linked data and
our proposed approach to increasing the amount of high quality linked data
published during the next phase of the LOD2 project.
The report contains two Appendices:
Appendix A - a collection of 9 use cases, showcasing the benefits of LOD
Appendix B - presenting theLODStats system developed for high performance
statistical analysis
LOD2 Event . 06.09.2010 12Page
6. http://lod2.eu
13. Ce t gK o ld eo t fnel k dD t
rain n we g u o Itr e aa
in
Appendix A – Open Data releases
Dataset Project URL Triples
WHO’s Global Health Observatory http://gho.aksw.org/ 273k
European Digital Agenda Scoreboard http://data.lod2.eu/scoreboard/
127k
National Accounts Linked Data for http://ukstatistics.lod2.eu/
the UK and Serbia
645k (UK)
http://rs.ckan.net/dataset/rzs-national-accounts
10 million (Serbia)
http://csarven.ca/statistical-linked-dataspaces
World Bank Data as Linked Data
165 million
German Labour Law & Courts http://vocabulary.wolterskluwer.de/
Thesauri 150k
German Federal Ministry of Finance http://data.lod2.eu/gfmf/
2 million
UK public data sets http://thedatahub.org/en/dataset/uk-gdp-since-1948
http://thedatahub.org/en/dataset/epims-lod2
http://thedatahub.org/en/dataset/uk-criminal-justice 12 million (total)
LinkedGeoData http://linkedgeodata.org 20 billion
Wiktionary http://wiktionary.dbpedia.org 100 million
Czech tender data http://ld.opendata.cz:8900/sparql 1071859
EU-FP 7 LOD2 P roje ct Ove rvie w . . http://lod2.
14. Creating Knowledge out of Interlinked Data
Next Steps (OKFN)
LOD2 Presentation . 02.09.2010 . Page http://lod2
15. Creating Knowledge out of Interlinked Data
Next Steps
D 9.2.1. Further technical improvements to Publicdata.eu
Improvements scheduled for the Dec 2012 release (Further personalization features)
Datasets ratings
Allow users to add/revise their own data sets
User tools to enable mash-ups and visualization of data
App marketplace for users to upload their own visualizations, stories and apps
Allow user commenting on datasets
Activity streams and follow support (i.e. allowing users to subscribe to activity updates)
Social / sharing buttons
LOD2 Event . 06.09.2010 15Page
. http://lod2.eu
16. Creating Knowledge out of Interlinked Data
Next Steps
D 9.2.1. Further technical improvements to Publicdata.eu
Improvements scheduled for the Aug 2013 interim release
CKAN core technology improvements (Harvesting)
Optimize & automate the harvesting process
Add further harvesters (to increase number of data and coverage)
Ability to only harvest changed data
Ability to harvest part of a site (e.g. a particular group vs whole catalog)
Additional features
Adding more advanced multilingual capabilities to the portal to support its Europe-wide
coverage
Add upgraded triple store and SPAQRL endpoint
LOD2 Event . 06.09.2010 16Page
. http://lod2.eu
17. Creating Knowledge out of Interlinked Data
Next Steps
Community Engagement for Publicdata.eu
We can divide our community building and engagement strategy around PublicData.eu into
two main clusters: supply & demand
Objectives on the supply side:
Engage more with data publishers by building
(a) a stronger community of official representatives and data catalogue maintainers
around PublicData.eu and
(b) consensus around key legal and technical standards (e.g. making metadata
explicitly open, enabling data catalog interoperability)
Establish datacatalogs.org as the de facto place to go to find out about data catalogs
around the world - and encourage data catalog maintainers and other official contact
points to maintain up to date information about national, regional and local catalogs, and
lists of catalogs
LOD2 Event . 06.09.2010 17 Page
10 http://lod2.eu
18. Creating Knowledge out of Interlinked Data
Next Steps
Community Engagement for Publicdata.eu
Objectives on the demand side:
To build a stronger and better connected community of open data re-users from across
EU27 around PublicData.eu
Continue to identify and pursue opportunities to engage with the Linked Data community
and to use the LOD2 Stack to publish Linked Data derived from PublicData.eu.
We are hoping to achieve this by performing the following activities:
-Organize 2-4 OKFN Labs sprints per year on a variety of different topics and disseminate
results via press releases, media contacts and partners
-Promote PublicData.eu at events, workshops and hackdays across EU27
-Dissemination via blogs, guest posts and articles on third party sites, and press releases
LOD2 Event . 06.09.2010 18 Page
11 http://lod2.eu
19. Creating Knowledge out of Interlinked Data
Year 2 Deliverables
(Serbian CKAN, Instytut
Informatyki
Gospodarczej)
LOD2 Presentation . 02.09.2010 . Page http://lod2
20. Ce t gK o ld eo t fnel k dD t
rain n we g u o Itr e aa
in
Year 2 Deliverables
D 9.5.1 Establishing Serbian CKAN - Infrastructure for Public Sector Information
Publicdata.eu
search CKAN
http://elpo.stat.gov.rs/lod2/RS-DATA
Server 3
http://elpo.stat.gov.rs/lod2/RS-DIC
import
search
LOD2 RDF
CKAN
publishing http://rs.ckan.net
Code lists XSLT
Server 2 search
Online
LOD2 dissemination
DB
Serbian CKAN
Server 1
SORS
EU-FP 7 LOD2 WP 10 – 22.-23.9.2011. P a ge
6 – 13.-14.09.2012. http://lod2.
21. Ce t gK o ld eo t fnel k dD t
rain n we g u o Itr e aa
in
q
National accounts
q
Prices
q
Usage of ICT
q
Science, Technology and
Innovations
EU-FP 7 LOD2 WP 10 – 22.-23.9.2011. P a ge
6 – 13.-14.09.2012. http://lod2.
22. Ce t gK o ld eo t fnel k dD t
rain n we g u o Itr e aa
in
Year 2 Deliverables
D9.6 Requirements and resources used by the Polish Ministry of Economy
Goal
Identify the requirements of Polish Ministry of Economy for publication of the
data
Analyze changes of data over time, temporal and topical scope
Prepare for adoption of LOD2 Stack for publication of Ministry’ data
Status
Delivered on time for M20
Work continues on Task 9.7
EU-FP 7 LOD2 WP 10 – 22.-23.9.2011. P a ge
6 – 13.-14.09.2012. http://lod2.
23. Ce t gK o ld eo t fnel k dD t
rain n we g u o Itr e aa
in
http://data.gov.pl – Current State Year 2 Deliverables
EU-FP 7 LOD2 WP 10 – 22.-23.9.2011. P a ge
6 – 13.-14.09.2012. http://lod2.
24. Ce t gK o ld eo t fnel k dD t
rain n we g u o Itr e aa
in
Need for data
EU-FP 7 LOD2 WP 10 – 22.-23.9.2011. P a ge
6 – 13.-14.09.2012. http://lod2.
25. Ce t gK o ld eo t fnel k dD t
rain n we g u o Itr e aa
in
Requirements – Querying
EU-FP 7 LOD2 WP 10 – 22.-23.9.2011. P a ge
6 – 13.-14.09.2012. http://lod2.
26. Ce t gK o ld eo t fnel k dD t
rain n we g u o Itr e aa
in
Year 2 Deliverables
D9.6 Key Points
INSIGOS
●
Internet System for Business Information
●
Access to statistical data concerning economy and foreign trade
– POLGOS - presentation of comparative data concerning Polish economy
– HZ - information about Polish foreign trade
– ENERGY – mission: energy security
Challenges
- Multidimensional database (data ware House)
- Possible linking to source for drilling-down
- Not up to date – probably needs a supplementing process
- ENERGY – many files in ugly-formatted Excel files
EU-FP 7 LOD2 WP 10 – 22.-23.9.2011. P a ge
6 – 13.-14.09.2012. http://lod2.
27. Ce t gK o ld eo t fnel k dD t
rain n we g u o Itr e aa
in
Year 2 Deliverables
D9.6 Key Points
CEIDG
– Central Register and Information on Economic Activity
•
access to data concerning natural persons’ businesses
•
references to other registries
•
ca. 2.9 million records
Challenges
-data is not clean
-available via API
-dynamic data set: ~1000/1000 applications for de-/registration daily
-snapshots – evolution phase of LOD2 Lifecycle
Public procurement data
- Pulished in XML, volume of data in 2011 alone: 828MB
EU-FP 7 LOD2 WP 10 – 22.-23.9.2011. P a ge
6 – 13.-14.09.2012. http://lod2.
28. Ce t gK o ld eo t fnel k dD t
rain n we g u o Itr e aa
in
Year 2 Deliverables
D9.6 Requirements - Summary
•
No sophisticated tools used at MoE
•
Groups of requirements
»
Data Acquisition – 7 requirements
»
Data Processing/Transformation – 2 requirements
»
Publication – 3 requirements
»
Data Analysis – 2 requirements
•
Alignment with LOD2 Life Cycle
»
all 8 phases seem to be important but
•
Alignment with LOD2 Stack
»
crucial components identified
»
D2R/Triplify, Virtuoso, CKAN, PoolParty, Ontowiki, Silk,
Sigma
EU-FP 7 LOD2 WP 10 – 22.-23.9.2011. P a ge
6 – 13.-14.09.2012. http://lod2.
29. Creating Knowledge out of Interlinked Data
Next Steps (Instytut
Informatyki
Gospodarczej)
LOD2 Presentation . 02.09.2010 . Page http://lod2
30. Ce t gK o ld eo t fnel k dD t
rain n we g u o Itr e aa
in
Next Steps
Task 9.7 Adoption of the LOD2 Stack for Polish economy data (I2G)
Goal
§
adaptation of the LOD2 Stack to the requirements of Polish Ministry of Economy
§
identification of crucial components and how to configure and link them
Status
§
first deliverable D9.7.1 scheduled for M30
§
identification of existing functionalities in the working infrastructure
§
first vocabularies linked using Silk Workbench
Next steps
§
finishing and cleaning vocabulary
§
design of data model using SDMX vocabulary
§
filling in the model
§
Establishing the Polish CKAN
EU-FP 7 LOD2 WP 10 – 22.-23.9.2011. P a ge
6 – 13.-14.09.2012. http://lod2.
31. Creating Knowledge out of Interlinked Data
Thank you for your
attention!
LOD2 Presentation . 02.09.2010 . Page http://lod2
Editor's Notes
Hello everyone, My name is …. and I am the new PM overseeing some sections of WP9 from OKFN's side. This presentation includes the work that was spearheaded by OFKFN, as well as Serbian CKAN (9.5.) and the the Requirements and Resources for the Polish Ministry of Economy (9.6.) The slides were drafted to highlight some of the work that was done for WP9 during the course of this year. Slide 2 showcases the deliverables that are part of this WP
Hello everyone, My name is …. and I am the new PM overseeing some sections of WP9 from OKFN's side. This presentation includes the work that was spearheaded by OFKFN, as well as Serbian CKAN (9.5.) and the the Requirements and Resources for the Polish Ministry of Economy (9.6.) The slides were drafted to highlight some of the work that was done for WP9 during the course of this year. Slide 2 showcases the deliverables that are part of this WP
Section 1 rehearses the arguments of Open Data (how Governments are moving towards making their data available freely) whereas section 2 provides a full non technical explanation of Linked Data (concepts such as the 5 stars of LOD are presented) . Section 3 refers to the LOD life cycle (explains high level concepts such RDF, schemas, triple stores aso). Section 4 describes a step by step way to publish LOD and describes the tools in the LOD2 Stack. Step 5 presents some of the case studies of LOD2. Most known are the EC Financial Transparency System (all grants from the EC since 2007), the Global Health Observatory data set (stats for monitoring public health), Digital Agenda Scoreboard – shows progression of countries in relation to DAE, Legal Thesauri by Wolters Kluewer (commercial publisher of legal info)
Several public authorities(such as: UK Government White Paper, EC commissioner Neelie Kroes) are acknowledging the benefits of LOD ( Organizations meet the transparency requirements, and more meeting is provided to data sets by placing them in context with other datasets); in this respect the EC also funded the LATC project that converted approx 20 sets over the past years.