Drupal Day 2011 - Thinking spatially with your open dataDrupalDay
Talk di Juan Arevalo & Marco Giacomassi | Drupal Day Roma 2011
The Open Data movement is now moving a step forward, many governments, institutions and business have recently started the process of making information available to citizens and customers. Data is now seen as a powerful instrument to increase transparency in public administration and business on policies. About 80% of this information has a spatial component that is not entirely exploited yet. A range of open source solutions are now available to address this challenge, in this session we will explore their potential and possible applications. The so-called “data deluge” is here.. but we can build good umbrellas. Please come to learn more about it!
Chris Atherton (GEANT) and Andres Steijaert (OCRE) presentation about the Open Clouds for Research Environments (OCRE) project and GÉANT's National Research and Education Networks and their Infrastructure support for global Cloud Computing at the 4th GEO Data Technology Workshop.
Vienna, Austria
25th of April 2019
The programming language R is getting increasingly popular among scientists of different research fields, in industry, as well as journalism. Starting from a purely statistically oriented environment, current open-source development in R attracts researchers that need a wide variety of tools necessary to tidy and understand the datasets they are using and communicate their findings. Recently, we noticed an increasing interest in R as a language for data science in the province of Bolzano/Bozen and started a community - BolzanoR (https://www.bolzanor.eu/) - which with the goal to inform about recent developments in R, to openly share knowledge, to create synergies between researchers and to openly disseminate news and activities. In short, to build a local community of R users.
The inherent necessities for a data scientist are reliable, accessible, and well-curated - preferably open - data sources. Here the link to the OpenDataHub Südtirol becomes evident. Researchers and data analysts will greet the datasets hosted by OpenDataHub Südtirol as valuable assets. First, this talk presents the BolzanoR community. Then it imagines future links and activities between BolzanoR and the OpenDatahub Südtirol to value possible synergies and expand upon the existing potentials.
The Elastic blog [1] recently featured webLyzard’s Visual Exploration of Sustainability Communication with Elasticsearch, a project [2] to track global information flows. Customized for the United Nations Environment Programme, the resulting platform identifies opinion leaders and analyzes the public debate surrounding the UN’s Sustainable Development Goals (SDGs). Its custom-built dashboard [3] synchronizes multiple views in real time and uses aggregations to convey context information through a portfolio of visual tools.
Two of the webLyzard [4] co-founders will present a live demo of the platform and similar applications in other domains. They will discuss some of the underlying aggregations and their experience of recently migrating to Elasticsearch 6.5. The concluding outlook will show how predictive capabilities might help to anticipate mobility bottlenecks, support digital newsrooms, or maximize the impact of published content across social media channels.
[1] https://www.elastic.co/blog/weblyzards-visual-exploration-of-sustainability-communication-with-elasticsearch
[2] https://www.weblyzard.com/unep-live
[3] https://unep.ecoresearch.net
[4] https://www.weblyzard.com
Cloudflow – A Framework for MapReduce Pipeline Development in Biomedical Rese...Lukas Forer
Cloudflow is a MapReduce pipeline framework, which is based on a similar concept as JavaFlume or Apache Crunch. In contrast to these existing approaches, Cloudflow was developed to simplify the pipeline creation in biomedical research, especially in the field of Genetics. For that purpose Cloudflow supports a variety of NGS data formats and contains a rich collection of built-in operations for analyzing such kind of datasets (e.g. quality checks, mapping reads or variation calling).
Big Data Europe SC6 WS #3: PILOT SC6: CITIZEN BUDGET ON MUNICIPAL LEVEL, Mart...BigData_Europe
Presentation at the Big Data Europe SC6 workshop #3 on 11.9.2017 in Amsterdam co-located with SEMANTiCS2017 conference: BDE PIlot Societal Challenge 6: CITIZEN BUDGET ON MUNICIPAL LEVEL by Martin Kaltenboeck (Semantic Web Company, SWC).
Field Data Collecting, Processing and Sharing: Using web Service TechnologiesNiroshan Sanjaya
Collecting, Distributing and Analyzing field data is a crucial part in any geospatial study. Field data collection tools and methods have been developed significantly due to the advancement of technologies such as Global Navigational Satellite Systems (GNSS) and development of smartphones. Accurate field data collection is also a necessary task for broad spatial data analysis and proper decision making. Development of Web technologies led to share the data and information effectively. This study tries to develop a framework based on the Geospatial Semantic Web technologies for disseminating and processing field data. Experimental results from an implemented prototype show that the proposed framework allows to visualize and process the field data in any context. The system of this study is capable of distributing and processing field data using web application. Moreover, the study demonstrates the importance and the capabilities of web services for spatial data gathering and processing. The system has been developed based on Free and Open Source Software (FOSS) packages such as ZOO-Project, Open Data Kit, etc. It enables user to further improve or deploy the system for variety of studies.
Drupal Day 2011 - Thinking spatially with your open dataDrupalDay
Talk di Juan Arevalo & Marco Giacomassi | Drupal Day Roma 2011
The Open Data movement is now moving a step forward, many governments, institutions and business have recently started the process of making information available to citizens and customers. Data is now seen as a powerful instrument to increase transparency in public administration and business on policies. About 80% of this information has a spatial component that is not entirely exploited yet. A range of open source solutions are now available to address this challenge, in this session we will explore their potential and possible applications. The so-called “data deluge” is here.. but we can build good umbrellas. Please come to learn more about it!
Chris Atherton (GEANT) and Andres Steijaert (OCRE) presentation about the Open Clouds for Research Environments (OCRE) project and GÉANT's National Research and Education Networks and their Infrastructure support for global Cloud Computing at the 4th GEO Data Technology Workshop.
Vienna, Austria
25th of April 2019
The programming language R is getting increasingly popular among scientists of different research fields, in industry, as well as journalism. Starting from a purely statistically oriented environment, current open-source development in R attracts researchers that need a wide variety of tools necessary to tidy and understand the datasets they are using and communicate their findings. Recently, we noticed an increasing interest in R as a language for data science in the province of Bolzano/Bozen and started a community - BolzanoR (https://www.bolzanor.eu/) - which with the goal to inform about recent developments in R, to openly share knowledge, to create synergies between researchers and to openly disseminate news and activities. In short, to build a local community of R users.
The inherent necessities for a data scientist are reliable, accessible, and well-curated - preferably open - data sources. Here the link to the OpenDataHub Südtirol becomes evident. Researchers and data analysts will greet the datasets hosted by OpenDataHub Südtirol as valuable assets. First, this talk presents the BolzanoR community. Then it imagines future links and activities between BolzanoR and the OpenDatahub Südtirol to value possible synergies and expand upon the existing potentials.
The Elastic blog [1] recently featured webLyzard’s Visual Exploration of Sustainability Communication with Elasticsearch, a project [2] to track global information flows. Customized for the United Nations Environment Programme, the resulting platform identifies opinion leaders and analyzes the public debate surrounding the UN’s Sustainable Development Goals (SDGs). Its custom-built dashboard [3] synchronizes multiple views in real time and uses aggregations to convey context information through a portfolio of visual tools.
Two of the webLyzard [4] co-founders will present a live demo of the platform and similar applications in other domains. They will discuss some of the underlying aggregations and their experience of recently migrating to Elasticsearch 6.5. The concluding outlook will show how predictive capabilities might help to anticipate mobility bottlenecks, support digital newsrooms, or maximize the impact of published content across social media channels.
[1] https://www.elastic.co/blog/weblyzards-visual-exploration-of-sustainability-communication-with-elasticsearch
[2] https://www.weblyzard.com/unep-live
[3] https://unep.ecoresearch.net
[4] https://www.weblyzard.com
Cloudflow – A Framework for MapReduce Pipeline Development in Biomedical Rese...Lukas Forer
Cloudflow is a MapReduce pipeline framework, which is based on a similar concept as JavaFlume or Apache Crunch. In contrast to these existing approaches, Cloudflow was developed to simplify the pipeline creation in biomedical research, especially in the field of Genetics. For that purpose Cloudflow supports a variety of NGS data formats and contains a rich collection of built-in operations for analyzing such kind of datasets (e.g. quality checks, mapping reads or variation calling).
Big Data Europe SC6 WS #3: PILOT SC6: CITIZEN BUDGET ON MUNICIPAL LEVEL, Mart...BigData_Europe
Presentation at the Big Data Europe SC6 workshop #3 on 11.9.2017 in Amsterdam co-located with SEMANTiCS2017 conference: BDE PIlot Societal Challenge 6: CITIZEN BUDGET ON MUNICIPAL LEVEL by Martin Kaltenboeck (Semantic Web Company, SWC).
Field Data Collecting, Processing and Sharing: Using web Service TechnologiesNiroshan Sanjaya
Collecting, Distributing and Analyzing field data is a crucial part in any geospatial study. Field data collection tools and methods have been developed significantly due to the advancement of technologies such as Global Navigational Satellite Systems (GNSS) and development of smartphones. Accurate field data collection is also a necessary task for broad spatial data analysis and proper decision making. Development of Web technologies led to share the data and information effectively. This study tries to develop a framework based on the Geospatial Semantic Web technologies for disseminating and processing field data. Experimental results from an implemented prototype show that the proposed framework allows to visualize and process the field data in any context. The system of this study is capable of distributing and processing field data using web application. Moreover, the study demonstrates the importance and the capabilities of web services for spatial data gathering and processing. The system has been developed based on Free and Open Source Software (FOSS) packages such as ZOO-Project, Open Data Kit, etc. It enables user to further improve or deploy the system for variety of studies.
PaNOSC Overview - ExPaNDS kick-off meeting - September 2019PaNOSC
This presentation gives an overview on the H2020 INFRAEOSC PaNOSC project, showcasing its activities and expected results, as well as its vision, i.e., to create a PaN scientific commons
Data management plans – EUDAT Best practices and case study | www.eudat.euEUDAT
| www.eudat.eu | Presentation given by Stéphane Coutin during the PRACE 2017 Spring School joint training event with the EU H2020 VI-SEEM project (https://vi-seem.eu/) organised by CaSToRC at The Cyprus Institute. Science and more specifically projects using HPC is facing a digital data explosion. Instruments and simulations are producing more and more volume; data can be shared, mined, cited, preserved… They are a great asset, but they are facing risks: we can miss storage, we can lose them, they can be misused,… To start this session, we will review why it is important to manage research data and how to do this by maintaining a Data Management Plan. This will be based on the best practices from EUDAT H2020 project and European Commission recommendation. During the second part we will interactively draft a DMP for a given use case.
Big Data, Beyond the Data Center
Increasingly the next scientific discoveries and the next industrial innovative breakthroughs will depend on the capacity to extract knowledge and sense from gigantic amount of information. Examples vary from processing data provided by scientific instruments such as the CERN’s LHC; collecting data from large-scale sensor networks; grabbing, indexing and nearly instantaneously mining and searching the Web; building and traversing the billion-edges social network graphs; anticipating market and customer trends through multiple channels of information. Collecting information from various sources, recognizing patterns and distilling insights constitutes what is called the Big Data challenge. However, As the volume of data grows exponentially, the management of these data becomes more complex in proportion. A key challenge is to handle the complexity of data management on Hybrid distributed infrastructures, i.e assemblage of Cloud, Grid or Desktop Grids. In this talk, I will overview our works in this research area; starting with BitDew, a middleware for large scale data management on Clouds and Desktop Grids. Then I will present our approach to enable MapReduce on Desktop Grids. Finally, I will present our latest results around Active Data, a programming model for managing data life cycle on heterogeneous systems and infrastructures.
Putting the L in front: from Open Data to Linked Open DataMartin Kaltenböck
Keynote presentation of Martin Kaltenböck (LOD2 project, Semantic Web Company) at the Government Linked Data Workshop in the course of the OGD Camp 2011 in Warsaw, Poland: Putting the L in front: from Open Data to Linked Open Data
Big Data Europe at eHealth Week 2017: Linking Big Data in HealthBigData_Europe
Of the four V's of big data – Volume, Velocity, Variety and Veracity – the most challenging for the health sector is Variety. Health data comes from many sources, formats and standards – how can we bring these together to reap the benefits of big data technologies?
Big Data Europe is tackling this challenge head-on, building a big data infrastructure flexible enough to tackle all seven Societal Challenges identified by Horizon 2020. Here we demonstrate our pilot implementation of Open PHACTS, which integrates life science data for drug discovery.
12 May 2017
20140902 LinDa Workshop Semantincs2014 - LinDA Project OverviewLinDa_FP7
LinDa Project presentation - Challenges, tools, workplan and objectives
Presentation at LinDA Workshop on 2nd September 2014 at Semantics2014 by Spiros Mouzakitis
Memory Management in BigData: A Perpective Viewijtsrd
The requirement to perform complicated statistic analysis of big data by institutions of engineering, scientific research, health care, commerce, banking and computer research is immense. However, the limitations of the widely used current desktop software like R, excel, minitab and spss gives a researcher limitation to deal with big data and big data analytic tools like IBM BigInsight, HP Vertica, SAP HANA & Pentaho come at an overpriced license. Apache Hadoop is an open source distributed computing framework that uses commodity hardware. With this project, I intend to collaborate Apache Hadoop and R software to develop an analytic platform that stores big data (using open source Apache Hadoop) and perform statistical analysis (using open source R software).Due to the limitations of vertical scaling of computer unit, data storage is handled by several machines and so analysis becomes distributed over all these machines. Apache Hadoop is what comes handy in this environment. To store massive quantities of data as required by researchers, we could use commodity hardware and perform analysis in distributed environment. Bhavna Bharti | Prof. Avinash Sharma"Memory Management in BigData: A Perpective View" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-2 | Issue-4 , June 2018, URL: http://www.ijtsrd.com/papers/ijtsrd14436.pdf http://www.ijtsrd.com/engineering/computer-engineering/14436/memory-management-in-bigdata-a-perpective-view/bhavna-bharti
Easy SPARQLing for the Building Performance ProfessionalMartin Kaltenböck
Slides of Martin Kaltenböcks (SWC) presentation at SEMANTiCS2014 conference in Leipzig on 5th of September 2014 about the 'Tool for Building Energy Performance Scenarios' of GBPN (Global Buildings Performance Network, http://gbpn.org) that provides a prediction tool for buildings performance worldwide by making use of Linked Open Data (LOD).
Webinar Industrial Data Space Association: Introduction and ArchitectureThorsten Huelsmann
Industrial Data Space Association is an industry and user driven initiative to develop a global Industrial Data Space standard and reference architecture which provides data sovereignty. The work bases on use cases and supports certifiable software solutions and business models for the data economy. The Webinar by Lars Nagel and Sebastian Steinbuss gives and overview to the Industrial Data Space initiative and explains the Reference Architecture and ist main components.
Gergely Sipos (EGI): Exploiting scientific data in the international context ...Gergely Sipos
Keynote presentation given at "The Emerging Technology Forum – Data Creates Universe - Scientific Data Innovation Conference" of the "Pujiang Innovation Forum 2021" event.
Hackathon for RELIANCE research communities.
Note: Hackathon was conducted using old version of ROHub (http://www.rohub.org). New portal to be released end of 2021 (http://reliance.rohub.org)
Publication of INSPIRE-based agricultural linked dataRaul Palma
Results of the publication of linked data from the agriculture sector within DATABio project, based on the agriculture data model developed in FOODIE project
Wielkopolska activities with potential to cluster to cluster collaboration EU...Raul Palma
We introduce the experiences and lessons learned towards the development of a smart agriculture infrastructure in Wielkopolska region, and comment on potential gaps and opportunities for clustering collaborations
An INSPIRE-based vocabulary for the publication of Agricultural Linked DataRaul Palma
FOODIE project aims at building an open and interoperable agricultural specialized platform on the cloud for the management, discovery and large-scale integration of data relevant for farming production. In particular, the integration focuses on existing open datasets as well as their publication in Linked data format in order to maximize their reusability and enable the exploitation of the extra knowledge derived from the generated links. Based on such data, for instance, FOODIE platform aims at providing high-value applications and services supporting the planning and decision-making processes of different stakeholders related to the agricultural domain. The keystone for data integration is FOODIE data model, which has been defined by reusing and extending current standards and best practices, including data specifications from the INSPIRE directive which are in turn based on the ISO/OGC standards for geographical information. However, as these data specifications are available as XML documents, the first step to publish Linked Data required transforming or lifting FOODIE data model into semantic format. In this paper, we describe this process, which was conducted semi-automatically by reusing existing tools, and adhering to the mapping rules for transforming geographic information UML models to OWL ontologies defined by the ISO 19150-2 standard. We describe the challenges associated to this transformation, and finally, we describe the generated ontology, providing an INSPIRE-based vocabulary for the publication of Agricultural Linked Data.
Elevating Tactical DDD Patterns Through Object CalisthenicsDorra BARTAGUIZ
After immersing yourself in the blue book and its red counterpart, attending DDD-focused conferences, and applying tactical patterns, you're left with a crucial question: How do I ensure my design is effective? Tactical patterns within Domain-Driven Design (DDD) serve as guiding principles for creating clear and manageable domain models. However, achieving success with these patterns requires additional guidance. Interestingly, we've observed that a set of constraints initially designed for training purposes remarkably aligns with effective pattern implementation, offering a more ‘mechanical’ approach. Let's explore together how Object Calisthenics can elevate the design of your tactical DDD patterns, offering concrete help for those venturing into DDD for the first time!
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...DanBrown980551
Do you want to learn how to model and simulate an electrical network from scratch in under an hour?
Then welcome to this PowSyBl workshop, hosted by Rte, the French Transmission System Operator (TSO)!
During the webinar, you will discover the PowSyBl ecosystem as well as handle and study an electrical network through an interactive Python notebook.
PowSyBl is an open source project hosted by LF Energy, which offers a comprehensive set of features for electrical grid modelling and simulation. Among other advanced features, PowSyBl provides:
- A fully editable and extendable library for grid component modelling;
- Visualization tools to display your network;
- Grid simulation tools, such as power flows, security analyses (with or without remedial actions) and sensitivity analyses;
The framework is mostly written in Java, with a Python binding so that Python developers can access PowSyBl functionalities as well.
What you will learn during the webinar:
- For beginners: discover PowSyBl's functionalities through a quick general presentation and the notebook, without needing any expert coding skills;
- For advanced developers: master the skills to efficiently apply PowSyBl functionalities to your real-world scenarios.
JMeter webinar - integration with InfluxDB and GrafanaRTTS
Watch this recorded webinar about real-time monitoring of application performance. See how to integrate Apache JMeter, the open-source leader in performance testing, with InfluxDB, the open-source time-series database, and Grafana, the open-source analytics and visualization application.
In this webinar, we will review the benefits of leveraging InfluxDB and Grafana when executing load tests and demonstrate how these tools are used to visualize performance metrics.
Length: 30 minutes
Session Overview
-------------------------------------------
During this webinar, we will cover the following topics while demonstrating the integrations of JMeter, InfluxDB and Grafana:
- What out-of-the-box solutions are available for real-time monitoring JMeter tests?
- What are the benefits of integrating InfluxDB and Grafana into the load testing stack?
- Which features are provided by Grafana?
- Demonstration of InfluxDB and Grafana using a practice web application
To view the webinar recording, go to:
https://www.rttsweb.com/jmeter-integration-webinar
UiPath Test Automation using UiPath Test Suite series, part 3DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 3. In this session, we will cover desktop automation along with UI automation.
Topics covered:
UI automation Introduction,
UI automation Sample
Desktop automation flow
Pradeep Chinnala, Senior Consultant Automation Developer @WonderBotz and UiPath MVP
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Ramesh Iyer
In today's fast-changing business world, Companies that adapt and embrace new ideas often need help to keep up with the competition. However, fostering a culture of innovation takes much work. It takes vision, leadership and willingness to take risks in the right proportion. Sachin Dev Duggal, co-founder of Builder.ai, has perfected the art of this balance, creating a company culture where creativity and growth are nurtured at each stage.
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf91mobiles
91mobiles recently conducted a Smart TV Buyer Insights Survey in which we asked over 3,000 respondents about the TV they own, aspects they look at on a new TV, and their TV buying preferences.
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...UiPathCommunity
💥 Speed, accuracy, and scaling – discover the superpowers of GenAI in action with UiPath Document Understanding and Communications Mining™:
See how to accelerate model training and optimize model performance with active learning
Learn about the latest enhancements to out-of-the-box document processing – with little to no training required
Get an exclusive demo of the new family of UiPath LLMs – GenAI models specialized for processing different types of documents and messages
This is a hands-on session specifically designed for automation developers and AI enthusiasts seeking to enhance their knowledge in leveraging the latest intelligent document processing capabilities offered by UiPath.
Speakers:
👨🏫 Andras Palfi, Senior Product Manager, UiPath
👩🏫 Lenka Dulovicova, Product Program Manager, UiPath
Neuro-symbolic is not enough, we need neuro-*semantic*Frank van Harmelen
Neuro-symbolic (NeSy) AI is on the rise. However, simply machine learning on just any symbolic structure is not sufficient to really harvest the gains of NeSy. These will only be gained when the symbolic structures have an actual semantics. I give an operational definition of semantics as “predictable inference”.
All of this illustrated with link prediction over knowledge graphs, but the argument is general.
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
Inspire hack 2017-linked-data
1. TEAM 2.5: Publication of linked data from pan-
European geospatial datasets
INSPIRE Conference, Kehl, 5th Sept 2017
2. TEAM MEMBERS
• Dr. Raul Palma : coordinator of Semantic
Technologies in the Network Services division at
Poznan Supercomputing and Networking Center
(PSNC), Poland.
• Soumya Brahma: data analyst in the Network
Services division at PSNC Technical support, use
cases analysis and monitoring, data analysis and
integration.
• Dmitrij Kozuch
• Raitis Berzins
• Acknowledments to Dr. Peter Haase, and Dr.
Johannes Trame (Metaphacts)
INSPIRE Conference, Kehl, 5th Sept 2017
3. SUPPORTING PROJECTS
• Databio - Data-Driven Bioeconomy
The main goal of the DataBio project is to show the benefits
of Big Data technologies in the raw material production from
agriculture, forestry and fishery for the bioeconomy industry
to produce food, energy and biomaterials responsibly and
sustainably.
DataBio proposes to deploy a state of the art, big data
platform on top of the existing partners’ infrastructure and
solutions.
• One of the main tasks of DataBio relates to Big Data Variety
Management, Storage, Linked Data and Queries
INSPIRE Conference, Kehl, 5th Sept 2017
H2020 - ICT-15-2016-2017; Big Data PPP: Large Scale Pilot actions in sectors best benefitting from data-driven innovation.
4. INPUT DATA
• Open Land Use (open dataset): is a composite dataset intended to create detailed land-
use maps of various regions based on certain pan-Europen datasets such as CORINE
Landcover, UrbanAtlas enriched by available regional
(http://sdi4apps.eu/open_land_use/)
• Open Transport Map (open dataset): Allows routing and visualization of traffic volumes of
the whole EU. The underlying data come from OpenStreetMap and are accessible in a
scheme compatible to INSPIRE Transport Network (http://opentransportmap.info/ )
• Smart Point of Interest (open dataset): Open and seamless SPOI data set, which is based
on Linked data principles, contains over 27 million Points of Interest important for tourism
from around the world. (http://sdi4apps.eu/spoi/ )
• Other datasets include: Urban atlas, Corine, Hilucs
INSPIRE Conference, Kehl, 5th Sept 2017
6. USED SOFTWARE/TOOLS
• D2RQ for transforming Relational Databases as Virtual
RDF Graphs
• RDF for the representation of data
• Ontologies providing the underlying vocabulary and
relations
• Virtuoso for storing the semantic datasets
• Sparql for querying semantic data
• Silk for discovery of links
• Hslayers NG for visualisation of data
• Metaphactory for visualisation of data
INSPIRE Conference, Kehl, 5th Sept 2017
7. PROJECT IDEA AND RESULTS
• The project idea was to integrate relevant datasets and
publish them as linked data. The following tasks were
performed:
• Massive transformation of data into semantic format (RDF) and
collection of exiting ones
• Loading datasets in Virtuoso
• Linking of datasets
• Query building
• Visualisation of data
• Results summary:
• Creation of ontologies (open)
• Virtuoso instance with over 700 million triples (open)
• Sparql endpoint (open)
• Three different interfaces for navigating and visualising the dataINSPIRE Conference, Kehl, 5th Sept 2017
8. NEW DATASETS
* Selected subsets
INSPIRE Conference, Kehl, 5th Sept 2017
Dataset Name Graph in FOODIE endpoint Source Triples
OLU** http://w3id.org/foodie/olu# Transformed from PostgreSQL 127,925,971
SPOI http://www.sdi4apps.eu/poi.rdf Source provided by WRLS,
modified and fixed before loading
381,393,555
NUTS http://nuts.geovocab.org/ Open Source 316,238
OTM*** http://w3id.org/foodie/otm# Transformed from PostgreSQL 154,340,611
Dataset Name Graph in FOODIE endpoint Source Triples
Hilucs classification http://w3id.org/foodie/hilucs# Transformed from PostgreSQL 397
Urban Atlas* http://w3id.org/foodie/atlas# Transformed from PostgreSQL 19,606,025
Corine* http://w3id.org/foodie/corine# Transformed from PostgreSQL 16,777,533
Eurovoc http://foodie-cloud.org/eurovoc Open Source 425,667
Emergel http://foodie-cloud.org/emergel CTIC 256,239
The ontologies generated are (available from https://github.com/FOODIE-cloud/ontology
15. INSPIRE/GEOSS/COPERNICUS/
RELEVANCE
• This work demonstrate the potential usages and
benefits of linked data with geospatial dimension
• The work exploits results from Copernicus and
INSPIRE
• The datasets generated are compliant with INSPIRE
INSPIRE Conference, Kehl, 5th Sept 2017
16. REUSE
• The results of the work will be exploited in DataBio
project, and will serve as showcase for pilots on the
potential usage of linked data, and how it could be
integrated and used with their pilot data
• The approach, and results of this work will be
leveraged and extended for task related to Big Data
Variety Management, Storage, Linked Data and
Queries
• The generated datasets could be reused in other
projects dealing with geospatial related data and
semantic technologies
INSPIRE Conference, Kehl, 5th Sept 2017
17. BUSINESS
• In future the applications on top of the linked
datasets can become commercial services for
different stakeholders. For instance
• Real estate agencies could use the datasets to show the
land parcels that you are on sale, that lie near big
highways and have school nearby
• Tourist agencies can show hotels that lie near some
point of interest and have direct connection to airports
or train stations
• Farmers can see the most dense land parcels nearby to
offer their products
INSPIRE Conference, Kehl, 5th Sept 2017
18. FOLLOW-UP
• We are ready to continue with follow up actions.
• We had people interested from the Institute for
Applied Informatics in Leipzig
• We will transform and link datasets from pilots of
FOODIE project, to demonstrate how farming data
compliant with FOODIE data model (that in turn is
compliant with INSPIRE) can be also linked with
these datasets. This work will be presented in the
Linked Data Workshop in Agriculture in Berlin.
INSPIRE Conference, Kehl, 5th Sept 2017