SlideShare a Scribd company logo
1-5 stars: Metadata on the Openness
Level of Open Data Sets in Europe
Sébastien Martin, Muriel Foulonneau, Slim Turki
Context & Objectives
•
•
•
•

Level of reuse of open data is still disappointing.
Development of open data requires a better reusability of data.
Degree of openness is a key success factor.
Catalogs listing data have a crucial role.

Analyse PublicData.eu catalogue
(i) identify the quality of a sample of metadata properties, which
are critical to enable data reuse
(ii) study the stated level of data openness.

21/11/2013

1-5 stars: Metadata on the Openness Level of
Open Data Sets in Europe

2
PublicData.eu
•

•

Many local and national portals to provide access to public sector open
datasets - 114 EU catalogues on datacatalogs.org
Gather datasets across geographic and institutional boundaries

PublicData.eu
•
•
•
•
•
•

pan-European catalogue launched under the FP7 LOD2 project.
aggregates data from CKAN open data catalogues all over Europe.
collects data from 26 sources
1st to be published in Europe in 2011
data beyond the European Union, e.g., Serbian datasets.
not exhaustive, it represents a unique aggregation of European datasets.

•
•

17.027 datasets
UK: largest provider

21/11/2013

3
Methodology
Descriptions of datasets collected in May 2013
236 distinct dataset properties identified, partially due to
•
•

linguistic diversity; some providers adapt property names in their language
problems of consistency in naming (upper / lower case, spaces /
underscore for a single field).

Major challenge to understand the content of the PublicData.eu
Data collected and analysed to identify information made available
on data openness and reusability in particular the licensing
conditions and the data formats.

21/11/2013

1-5 stars: Metadata on the Openness Level of
Open Data Sets in Europe

4
Tim Berners-Lee’s evaluation scale

★

Available on the web (whatever format) but with an
open license, to be Open Data

★★ Available as machine-readable structured data
★★★ 2 + non-proprietary format

★★★★
★★★★★

21/11/2013

3 + Use open standards from W3C (RDF and SPARQL)
to identify things
4 + Link your data to other people’s data to provide
context

1-5 stars: Metadata on the Openness Level of
Open Data Sets in Europe

5
★ Data Licences
13.535 / 17.027 datasets have at least 1 license indication
12.470 datasets can be considered having some form of open
license  73,24%
769 datasets have a Creative Commons license
Significant number of datasets have a national license:
•
•
•

apie v2 to publish information created by French public authorities
UK-crown which “covers material created by civil servants, ministers and
government departments and agencies” in the UK,
UK Open Government License

128 datasets with an explicitly closed license

21/11/2013

1-5 stars: Metadata on the Openness Level of
Open Data Sets in Europe

6
★★ Machine readable format
• Facilitates data reusability
• 4.051 / 17.027 with
content_TYPE
• 11.285 with at least one
indication about format
• 56 datasets in RDF
• Dominant proportion of
spreadsheets type’s formats
Distribution of formats

40% not a machine readable format
34% of datasets available in a machine readable format
 machine readability cond. for openness levels of 2★ and >
21/11/2013

1-5 stars: Metadata on the Openness Level of
Open Data Sets in Europe

7
★★★ Use of non-proprietary formats
Creates ambiguities as the openness nature of formats can be
debated in some cases:
•
•

Certain formats are proprietary but their specifications are open.
Some formats have been open at a certain point of time but additions and
further evolutions remain proprietary

In many cases, value of property was too vague to determine
whether the format was or not proprietary.
It was possible to identify:
•
•

For 49% of the datasets, a non-proprietary format
For 21% a proprietary format.

Use of proprietary formats is a critical issue for improving the
level of openness of datasets.

21/11/2013

1-5 stars: Metadata on the Openness Level of
Open Data Sets in Europe

8
★★★★ Use of open standards from
W3C
Including HTML, XML, and RDF in particular.
•

XML-based formats may be entirely independent from W3C (e.g. KML)

Availability in W3C standards: 9,5% of datasets
Availability in XML based formats: 10%

Information remains unknown in most cases

21/11/2013

1-5 stars: Metadata on the Openness Level of
Open Data Sets in Europe

9
★★★★★ Linked data
Linked data are only mentioned in the description of a single
dataset (Brandweer Amsterdam-Amstelland Uitrukberichten)
for which the format is described as “linked data api, rdf json”.
58 datasets mention RDF (or RDFa) as a format or content type,
i.e., 0,34%.

21/11/2013

1-5 stars: Metadata on the Openness Level of
Open Data Sets in Europe

10
Level of openness (1/2)
6.891 / 17.027 datasets show at least one information about their
degree of openness.
All come from Data.gov.uk (8 689 datasets)
For a majority of datasets, the level of openness is unknown.
•

21/11/2013

Coherent with lack of licensing information without which it is impossible
to conclude on even ★ openness level.

1-5 stars: Metadata on the Openness Level of
Open Data Sets in Europe

Distribution of openness levels in UK datasets

11
Level of openness (2/2)
Approximate level of openness derived from licensing and format
properties
•
•

73,24% of the datasets should have ★ or above.
Reference to 5★ should take into consideration linkages, cannot be
inferred from dataset metadata.

Level of openness according
to Format and License
related properties

Data openness mainly related to 1st level of compliance: licensing
issue.
•
21/11/2013

Data providers have clearly not focused on publication of data in reusable
formats.
1-5 stars: Metadata on the Openness Level of
12
Open Data Sets in Europe
Conclusion
• Limited openness of datasets advertised as open data
• Heterogeneity of associated metadata
 Difficulty for reusers to (i) discover datasets, despite the
creation of large catalogues of datasets, and to (ii) effectively
reuse machine readable and contextualized data.
★ may be sufficient to ensure transparency of gov. action,
facilitating reuse of data through services is not served below 2★
Confirmed risks regarding major challenges that data providers
have to face: (i) language barrier and (ii) lack of consistency of
metadata.
Harmonization of practices, training and tools necessary to
ensure that datasets are available in relevant formats.

21/11/2013

1-5 stars: Metadata on the Openness Level of
Open Data Sets in Europe

13
1-5 stars: Metadata on the Openness
Level of Open Data Sets in Europe
Sébastien Martin, Muriel Foulonneau, Slim Turki

Contact:

muriel.foulonneau@tudor.lu

More Related Content

What's hot

On chemical structures, substances, nanomaterials and measurements
On chemical structures, substances, nanomaterials and measurementsOn chemical structures, substances, nanomaterials and measurements
On chemical structures, substances, nanomaterials and measurements
Nina Jeliazkova
 
OpenAIRE infrastructure presentation at the Semantic Services in EOSC worksho...
OpenAIRE infrastructure presentation at the Semantic Services in EOSC worksho...OpenAIRE infrastructure presentation at the Semantic Services in EOSC worksho...
OpenAIRE infrastructure presentation at the Semantic Services in EOSC worksho...
Pedro Príncipe
 
Euro lipids 2014_graz
Euro lipids 2014_grazEuro lipids 2014_graz
Euro lipids 2014_graz
Juan Antonio Vizcaino
 
Documents, services, and data on the web
Documents, services, and data on the webDocuments, services, and data on the web
Documents, services, and data on the web
Chiara Del Vescovo
 
Information Extraction in the TalkOfEurope Creative Camp
Information Extraction in the TalkOfEurope Creative CampInformation Extraction in the TalkOfEurope Creative Camp
Information Extraction in the TalkOfEurope Creative Camp
Wim Peters
 
Linked Data Notifications Distributed Update Notification and Propagation on ...
Linked Data Notifications Distributed Update Notification and Propagation on ...Linked Data Notifications Distributed Update Notification and Propagation on ...
Linked Data Notifications Distributed Update Notification and Propagation on ...
Aksw Group
 
Krakow2010
Krakow2010Krakow2010
Krakow2010
Karel Charvat
 
Tonex's Link 16 Operational Overview Training
Tonex's Link 16 Operational Overview TrainingTonex's Link 16 Operational Overview Training
Tonex's Link 16 Operational Overview Training
Tonex
 
Automatics and Remote Control
Automatics and Remote ControlAutomatics and Remote Control
Automatics and Remote ControlVisionary_
 
Master defence 2020 - Kateryna Liubonko - Matching Red Links to Wikidata Items
 Master defence 2020 - Kateryna Liubonko - Matching Red Links to Wikidata Items Master defence 2020 - Kateryna Liubonko - Matching Red Links to Wikidata Items
Master defence 2020 - Kateryna Liubonko - Matching Red Links to Wikidata Items
Lviv Data Science Summer School
 

What's hot (10)

On chemical structures, substances, nanomaterials and measurements
On chemical structures, substances, nanomaterials and measurementsOn chemical structures, substances, nanomaterials and measurements
On chemical structures, substances, nanomaterials and measurements
 
OpenAIRE infrastructure presentation at the Semantic Services in EOSC worksho...
OpenAIRE infrastructure presentation at the Semantic Services in EOSC worksho...OpenAIRE infrastructure presentation at the Semantic Services in EOSC worksho...
OpenAIRE infrastructure presentation at the Semantic Services in EOSC worksho...
 
Euro lipids 2014_graz
Euro lipids 2014_grazEuro lipids 2014_graz
Euro lipids 2014_graz
 
Documents, services, and data on the web
Documents, services, and data on the webDocuments, services, and data on the web
Documents, services, and data on the web
 
Information Extraction in the TalkOfEurope Creative Camp
Information Extraction in the TalkOfEurope Creative CampInformation Extraction in the TalkOfEurope Creative Camp
Information Extraction in the TalkOfEurope Creative Camp
 
Linked Data Notifications Distributed Update Notification and Propagation on ...
Linked Data Notifications Distributed Update Notification and Propagation on ...Linked Data Notifications Distributed Update Notification and Propagation on ...
Linked Data Notifications Distributed Update Notification and Propagation on ...
 
Krakow2010
Krakow2010Krakow2010
Krakow2010
 
Tonex's Link 16 Operational Overview Training
Tonex's Link 16 Operational Overview TrainingTonex's Link 16 Operational Overview Training
Tonex's Link 16 Operational Overview Training
 
Automatics and Remote Control
Automatics and Remote ControlAutomatics and Remote Control
Automatics and Remote Control
 
Master defence 2020 - Kateryna Liubonko - Matching Red Links to Wikidata Items
 Master defence 2020 - Kateryna Liubonko - Matching Red Links to Wikidata Items Master defence 2020 - Kateryna Liubonko - Matching Red Links to Wikidata Items
Master defence 2020 - Kateryna Liubonko - Matching Red Links to Wikidata Items
 

Viewers also liked

The star system
The star systemThe star system
The star system
Naamah Hill
 
How to resize facebook photos using pic monkey
How to resize facebook photos using pic monkeyHow to resize facebook photos using pic monkey
How to resize facebook photos using pic monkeysweetaunzo
 
COURRIER CAB 31 MD
COURRIER CAB 31 MDCOURRIER CAB 31 MD
COURRIER CAB 31 MDComPol
 
Google glass
Google glassGoogle glass
Google glasscolegioyo
 
Геомаркетинг Геомаркетинговые исследования
Геомаркетинг Геомаркетинговые исследованияГеомаркетинг Геомаркетинговые исследования
Геомаркетинг Геомаркетинговые исследования
geo-marketing
 
Asat book0-fresh blood
Asat book0-fresh bloodAsat book0-fresh blood
Asat book0-fresh blood
Ashraf Ali
 
Less is More
Less is MoreLess is More
Less is More
Robertas Jucaitis
 
Carta de Oneida Pinto A El Espectador
Carta de Oneida Pinto A El EspectadorCarta de Oneida Pinto A El Espectador
Carta de Oneida Pinto A El Espectador
PrensaOneidaPinto
 
Aulbrey Meade - Surgical Tech RESUME
Aulbrey Meade - Surgical Tech RESUMEAulbrey Meade - Surgical Tech RESUME
Aulbrey Meade - Surgical Tech RESUMEAulbrey Meade
 
Buruketak 3.1.
Buruketak 3.1.Buruketak 3.1.
Buruketak 3.1.
auldreikie
 
New Barco ClickShare CMS-1
New Barco ClickShare CMS-1New Barco ClickShare CMS-1
New Barco ClickShare CMS-1
Paul Richards
 
Making the cut - Roberta Lucca, Bossa
Making the cut - Roberta Lucca, BossaMaking the cut - Roberta Lucca, Bossa
Making the cut - Roberta Lucca, Bossa
LondonGamesConference
 
Sistemas de equações de 1º grau - Como fazer + exercicios
Sistemas de equações de 1º grau - Como fazer + exerciciosSistemas de equações de 1º grau - Como fazer + exercicios
Sistemas de equações de 1º grau - Como fazer + exercicios
Ana Tapadinhas
 
Гаражи, Чернигов , ул. Пушкина
Гаражи, Чернигов , ул. ПушкинаГаражи, Чернигов , ул. Пушкина
Гаражи, Чернигов , ул. Пушкина
Alexander Gashpar
 
Megacoderit
MegacoderitMegacoderit
Megacoderit
PEDRO LEGORRETA
 

Viewers also liked (20)

The star system
The star systemThe star system
The star system
 
How to resize facebook photos using pic monkey
How to resize facebook photos using pic monkeyHow to resize facebook photos using pic monkey
How to resize facebook photos using pic monkey
 
COURRIER CAB 31 MD
COURRIER CAB 31 MDCOURRIER CAB 31 MD
COURRIER CAB 31 MD
 
Google glass
Google glassGoogle glass
Google glass
 
VTSP 5.5
VTSP 5.5VTSP 5.5
VTSP 5.5
 
Геомаркетинг Геомаркетинговые исследования
Геомаркетинг Геомаркетинговые исследованияГеомаркетинг Геомаркетинговые исследования
Геомаркетинг Геомаркетинговые исследования
 
Asat book0-fresh blood
Asat book0-fresh bloodAsat book0-fresh blood
Asat book0-fresh blood
 
Less is More
Less is MoreLess is More
Less is More
 
Carta de Oneida Pinto A El Espectador
Carta de Oneida Pinto A El EspectadorCarta de Oneida Pinto A El Espectador
Carta de Oneida Pinto A El Espectador
 
Presentación1
Presentación1Presentación1
Presentación1
 
Calendario escolar
Calendario escolarCalendario escolar
Calendario escolar
 
Aulbrey Meade - Surgical Tech RESUME
Aulbrey Meade - Surgical Tech RESUMEAulbrey Meade - Surgical Tech RESUME
Aulbrey Meade - Surgical Tech RESUME
 
Buruketak 3.1.
Buruketak 3.1.Buruketak 3.1.
Buruketak 3.1.
 
New Barco ClickShare CMS-1
New Barco ClickShare CMS-1New Barco ClickShare CMS-1
New Barco ClickShare CMS-1
 
Making the cut - Roberta Lucca, Bossa
Making the cut - Roberta Lucca, BossaMaking the cut - Roberta Lucca, Bossa
Making the cut - Roberta Lucca, Bossa
 
Sistemas de equações de 1º grau - Como fazer + exercicios
Sistemas de equações de 1º grau - Como fazer + exerciciosSistemas de equações de 1º grau - Como fazer + exercicios
Sistemas de equações de 1º grau - Como fazer + exercicios
 
Гаражи, Чернигов , ул. Пушкина
Гаражи, Чернигов , ул. ПушкинаГаражи, Чернигов , ул. Пушкина
Гаражи, Чернигов , ул. Пушкина
 
Gravitation
GravitationGravitation
Gravitation
 
Mechanics 2
Mechanics 2Mechanics 2
Mechanics 2
 
Megacoderit
MegacoderitMegacoderit
Megacoderit
 

Similar to 1-5 stars: Metadata on the Openness Level of Open Data Sets in Europe

OSFair2017 workshop | Monitoring the FAIRness of data sets - Introducing the ...
OSFair2017 workshop | Monitoring the FAIRness of data sets - Introducing the ...OSFair2017 workshop | Monitoring the FAIRness of data sets - Introducing the ...
OSFair2017 workshop | Monitoring the FAIRness of data sets - Introducing the ...
Open Science Fair
 
Llinked open data training for EU institutions
Llinked open data training for EU institutionsLlinked open data training for EU institutions
Llinked open data training for EU institutions
Open Data Support
 
Soren Auer - LOD2 - creating knowledge out of Interlinked Data
Soren Auer - LOD2 - creating knowledge out of Interlinked DataSoren Auer - LOD2 - creating knowledge out of Interlinked Data
Soren Auer - LOD2 - creating knowledge out of Interlinked DataOpen City Foundation
 
Webinar@AIMS_FAIR Principles and Data Management Planning
Webinar@AIMS_FAIR Principles and Data Management PlanningWebinar@AIMS_FAIR Principles and Data Management Planning
Webinar@AIMS_FAIR Principles and Data Management Planning
AIMS (Agricultural Information Management Standards)
 
Linked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareLinked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the Software
IMC Technologies
 
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE
 
Industry@RuleML2015 DataGraft
Industry@RuleML2015 DataGraftIndustry@RuleML2015 DataGraft
Industry@RuleML2015 DataGraft
RuleML
 
How we can understand the world through open data
How we can understand the world through open dataHow we can understand the world through open data
How we can understand the world through open dataMarie Gustafsson Friberger
 
DatalEt-Ecosystem Provider - The DEEP project
DatalEt-Ecosystem Provider - The DEEP projectDatalEt-Ecosystem Provider - The DEEP project
DatalEt-Ecosystem Provider - The DEEP project
Danube University Krems, Centre for E-Governance
 
Fair data vs 5 star open data final
Fair data vs 5 star open data finalFair data vs 5 star open data final
Fair data vs 5 star open data final
Syed Muhammad Ali Hasnain
 
CARARE: Can I use this data? FAIR into practice
CARARE: Can I use this data? FAIR into practiceCARARE: Can I use this data? FAIR into practice
CARARE: Can I use this data? FAIR into practice
CARARE
 
OSFair2017 Training | FAIR metrics - Starring your data sets
OSFair2017 Training | FAIR metrics - Starring your data setsOSFair2017 Training | FAIR metrics - Starring your data sets
OSFair2017 Training | FAIR metrics - Starring your data sets
Open Science Fair
 
Data sharing in the Netherlands
Data sharing in the NetherlandsData sharing in the Netherlands
Data sharing in the Netherlands
Jisc RDM
 
Can new technologies and digitalization improve infrastructure governance? - ...
Can new technologies and digitalization improve infrastructure governance? - ...Can new technologies and digitalization improve infrastructure governance? - ...
Can new technologies and digitalization improve infrastructure governance? - ...
OECD Governance
 
Exposing EO Linked (meta-)Data from OpenSearch Catalogue
Exposing EO Linked (meta-)Data from OpenSearch CatalogueExposing EO Linked (meta-)Data from OpenSearch Catalogue
Exposing EO Linked (meta-)Data from OpenSearch Catalogue
Raul Palma
 
Exploration, visualization and querying of linked open data sources
Exploration, visualization and querying of linked open data sourcesExploration, visualization and querying of linked open data sources
Exploration, visualization and querying of linked open data sources
Laura Po
 
Overview of the data pilot and OpenAIRE tools, Elly Dijk and Marjan Grootveld...
Overview of the data pilot and OpenAIRE tools, Elly Dijk and Marjan Grootveld...Overview of the data pilot and OpenAIRE tools, Elly Dijk and Marjan Grootveld...
Overview of the data pilot and OpenAIRE tools, Elly Dijk and Marjan Grootveld...
OpenAIRE
 
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
Research Data Alliance
 
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
Research Data Alliance
 
Data-as-a-Service: DataGraft
Data-as-a-Service: DataGraftData-as-a-Service: DataGraft
Data-as-a-Service: DataGraft
dapaasproject
 

Similar to 1-5 stars: Metadata on the Openness Level of Open Data Sets in Europe (20)

OSFair2017 workshop | Monitoring the FAIRness of data sets - Introducing the ...
OSFair2017 workshop | Monitoring the FAIRness of data sets - Introducing the ...OSFair2017 workshop | Monitoring the FAIRness of data sets - Introducing the ...
OSFair2017 workshop | Monitoring the FAIRness of data sets - Introducing the ...
 
Llinked open data training for EU institutions
Llinked open data training for EU institutionsLlinked open data training for EU institutions
Llinked open data training for EU institutions
 
Soren Auer - LOD2 - creating knowledge out of Interlinked Data
Soren Auer - LOD2 - creating knowledge out of Interlinked DataSoren Auer - LOD2 - creating knowledge out of Interlinked Data
Soren Auer - LOD2 - creating knowledge out of Interlinked Data
 
Webinar@AIMS_FAIR Principles and Data Management Planning
Webinar@AIMS_FAIR Principles and Data Management PlanningWebinar@AIMS_FAIR Principles and Data Management Planning
Webinar@AIMS_FAIR Principles and Data Management Planning
 
Linked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareLinked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the Software
 
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
 
Industry@RuleML2015 DataGraft
Industry@RuleML2015 DataGraftIndustry@RuleML2015 DataGraft
Industry@RuleML2015 DataGraft
 
How we can understand the world through open data
How we can understand the world through open dataHow we can understand the world through open data
How we can understand the world through open data
 
DatalEt-Ecosystem Provider - The DEEP project
DatalEt-Ecosystem Provider - The DEEP projectDatalEt-Ecosystem Provider - The DEEP project
DatalEt-Ecosystem Provider - The DEEP project
 
Fair data vs 5 star open data final
Fair data vs 5 star open data finalFair data vs 5 star open data final
Fair data vs 5 star open data final
 
CARARE: Can I use this data? FAIR into practice
CARARE: Can I use this data? FAIR into practiceCARARE: Can I use this data? FAIR into practice
CARARE: Can I use this data? FAIR into practice
 
OSFair2017 Training | FAIR metrics - Starring your data sets
OSFair2017 Training | FAIR metrics - Starring your data setsOSFair2017 Training | FAIR metrics - Starring your data sets
OSFair2017 Training | FAIR metrics - Starring your data sets
 
Data sharing in the Netherlands
Data sharing in the NetherlandsData sharing in the Netherlands
Data sharing in the Netherlands
 
Can new technologies and digitalization improve infrastructure governance? - ...
Can new technologies and digitalization improve infrastructure governance? - ...Can new technologies and digitalization improve infrastructure governance? - ...
Can new technologies and digitalization improve infrastructure governance? - ...
 
Exposing EO Linked (meta-)Data from OpenSearch Catalogue
Exposing EO Linked (meta-)Data from OpenSearch CatalogueExposing EO Linked (meta-)Data from OpenSearch Catalogue
Exposing EO Linked (meta-)Data from OpenSearch Catalogue
 
Exploration, visualization and querying of linked open data sources
Exploration, visualization and querying of linked open data sourcesExploration, visualization and querying of linked open data sources
Exploration, visualization and querying of linked open data sources
 
Overview of the data pilot and OpenAIRE tools, Elly Dijk and Marjan Grootveld...
Overview of the data pilot and OpenAIRE tools, Elly Dijk and Marjan Grootveld...Overview of the data pilot and OpenAIRE tools, Elly Dijk and Marjan Grootveld...
Overview of the data pilot and OpenAIRE tools, Elly Dijk and Marjan Grootveld...
 
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
 
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
 
Data-as-a-Service: DataGraft
Data-as-a-Service: DataGraftData-as-a-Service: DataGraft
Data-as-a-Service: DataGraft
 

More from Slim Turki, Dr.

Local Digital Twins Conversations: Framing the Green + Digital Transition
Local Digital Twins Conversations:  Framing the Green + Digital TransitionLocal Digital Twins Conversations:  Framing the Green + Digital Transition
Local Digital Twins Conversations: Framing the Green + Digital Transition
Slim Turki, Dr.
 
Data ecosystems: turning data into public value
Data ecosystems:  turning data into public valueData ecosystems:  turning data into public value
Data ecosystems: turning data into public value
Slim Turki, Dr.
 
#opendata Back to the future
#opendata Back to the future#opendata Back to the future
#opendata Back to the future
Slim Turki, Dr.
 
Data Ecosystems for Geospatial Data
Data Ecosystems for Geospatial DataData Ecosystems for Geospatial Data
Data Ecosystems for Geospatial Data
Slim Turki, Dr.
 
Open Data in Disaster Management
Open Data in Disaster ManagementOpen Data in Disaster Management
Open Data in Disaster Management
Slim Turki, Dr.
 
BE-GOOD: Building an Ecosystem to Generate Opportunities in Open Data
BE-GOOD: Building an Ecosystem to Generate Opportunities in Open DataBE-GOOD: Building an Ecosystem to Generate Opportunities in Open Data
BE-GOOD: Building an Ecosystem to Generate Opportunities in Open Data
Slim Turki, Dr.
 
How open data ecosystems are stimulated?
How open data ecosystems are stimulated?How open data ecosystems are stimulated?
How open data ecosystems are stimulated?
Slim Turki, Dr.
 
BE-GOOD Challenges - factsheet 2017-06
BE-GOOD Challenges - factsheet 2017-06BE-GOOD Challenges - factsheet 2017-06
BE-GOOD Challenges - factsheet 2017-06
Slim Turki, Dr.
 
Service innovation: the hidden value of open data
Service innovation: the hidden value of open dataService innovation: the hidden value of open data
Service innovation: the hidden value of open data
Slim Turki, Dr.
 
From open data to data-driven services
From open data to data-driven servicesFrom open data to data-driven services
From open data to data-driven services
Slim Turki, Dr.
 
How open data are turned into services?
How open data are turned into services?How open data are turned into services?
How open data are turned into services?
Slim Turki, Dr.
 
SPOCS: A semantic interoperability layer to support the implementation of the...
SPOCS: A semantic interoperability layer to support the implementation of the...SPOCS: A semantic interoperability layer to support the implementation of the...
SPOCS: A semantic interoperability layer to support the implementation of the...
Slim Turki, Dr.
 
Open Data: Barriers, Risks, and Opportunities
Open Data: Barriers, Risks, and OpportunitiesOpen Data: Barriers, Risks, and Opportunities
Open Data: Barriers, Risks, and Opportunities
Slim Turki, Dr.
 
Luxembourg Service Jam 2013 - Guide book
Luxembourg Service Jam 2013 - Guide bookLuxembourg Service Jam 2013 - Guide book
Luxembourg Service Jam 2013 - Guide book
Slim Turki, Dr.
 
Luxembourg Service Jam 2012 - Guide book
Luxembourg Service Jam 2012 - Guide bookLuxembourg Service Jam 2012 - Guide book
Luxembourg Service Jam 2012 - Guide book
Slim Turki, Dr.
 
Global Service Jam - Luxembourg spot
Global Service Jam - Luxembourg spotGlobal Service Jam - Luxembourg spot
Global Service Jam - Luxembourg spot
Slim Turki, Dr.
 
Legora@IESS1.0
Legora@IESS1.0Legora@IESS1.0
Legora@IESS1.0
Slim Turki, Dr.
 
Compliance In e-government Service Engineering
Compliance In e-government Service EngineeringCompliance In e-government Service Engineering
Compliance In e-government Service Engineering
Slim Turki, Dr.
 

More from Slim Turki, Dr. (18)

Local Digital Twins Conversations: Framing the Green + Digital Transition
Local Digital Twins Conversations:  Framing the Green + Digital TransitionLocal Digital Twins Conversations:  Framing the Green + Digital Transition
Local Digital Twins Conversations: Framing the Green + Digital Transition
 
Data ecosystems: turning data into public value
Data ecosystems:  turning data into public valueData ecosystems:  turning data into public value
Data ecosystems: turning data into public value
 
#opendata Back to the future
#opendata Back to the future#opendata Back to the future
#opendata Back to the future
 
Data Ecosystems for Geospatial Data
Data Ecosystems for Geospatial DataData Ecosystems for Geospatial Data
Data Ecosystems for Geospatial Data
 
Open Data in Disaster Management
Open Data in Disaster ManagementOpen Data in Disaster Management
Open Data in Disaster Management
 
BE-GOOD: Building an Ecosystem to Generate Opportunities in Open Data
BE-GOOD: Building an Ecosystem to Generate Opportunities in Open DataBE-GOOD: Building an Ecosystem to Generate Opportunities in Open Data
BE-GOOD: Building an Ecosystem to Generate Opportunities in Open Data
 
How open data ecosystems are stimulated?
How open data ecosystems are stimulated?How open data ecosystems are stimulated?
How open data ecosystems are stimulated?
 
BE-GOOD Challenges - factsheet 2017-06
BE-GOOD Challenges - factsheet 2017-06BE-GOOD Challenges - factsheet 2017-06
BE-GOOD Challenges - factsheet 2017-06
 
Service innovation: the hidden value of open data
Service innovation: the hidden value of open dataService innovation: the hidden value of open data
Service innovation: the hidden value of open data
 
From open data to data-driven services
From open data to data-driven servicesFrom open data to data-driven services
From open data to data-driven services
 
How open data are turned into services?
How open data are turned into services?How open data are turned into services?
How open data are turned into services?
 
SPOCS: A semantic interoperability layer to support the implementation of the...
SPOCS: A semantic interoperability layer to support the implementation of the...SPOCS: A semantic interoperability layer to support the implementation of the...
SPOCS: A semantic interoperability layer to support the implementation of the...
 
Open Data: Barriers, Risks, and Opportunities
Open Data: Barriers, Risks, and OpportunitiesOpen Data: Barriers, Risks, and Opportunities
Open Data: Barriers, Risks, and Opportunities
 
Luxembourg Service Jam 2013 - Guide book
Luxembourg Service Jam 2013 - Guide bookLuxembourg Service Jam 2013 - Guide book
Luxembourg Service Jam 2013 - Guide book
 
Luxembourg Service Jam 2012 - Guide book
Luxembourg Service Jam 2012 - Guide bookLuxembourg Service Jam 2012 - Guide book
Luxembourg Service Jam 2012 - Guide book
 
Global Service Jam - Luxembourg spot
Global Service Jam - Luxembourg spotGlobal Service Jam - Luxembourg spot
Global Service Jam - Luxembourg spot
 
Legora@IESS1.0
Legora@IESS1.0Legora@IESS1.0
Legora@IESS1.0
 
Compliance In e-government Service Engineering
Compliance In e-government Service EngineeringCompliance In e-government Service Engineering
Compliance In e-government Service Engineering
 

Recently uploaded

Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Thierry Lestable
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
Alison B. Lowndes
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
Product School
 
Quantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsQuantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIs
Vlad Stirbu
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
Dorra BARTAGUIZ
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
Cheryl Hung
 
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfSAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
Peter Spielvogel
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
RinaMondal9
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
Ralf Eggert
 

Recently uploaded (20)

Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
Quantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsQuantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIs
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfSAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
 

1-5 stars: Metadata on the Openness Level of Open Data Sets in Europe

  • 1. 1-5 stars: Metadata on the Openness Level of Open Data Sets in Europe Sébastien Martin, Muriel Foulonneau, Slim Turki
  • 2. Context & Objectives • • • • Level of reuse of open data is still disappointing. Development of open data requires a better reusability of data. Degree of openness is a key success factor. Catalogs listing data have a crucial role. Analyse PublicData.eu catalogue (i) identify the quality of a sample of metadata properties, which are critical to enable data reuse (ii) study the stated level of data openness. 21/11/2013 1-5 stars: Metadata on the Openness Level of Open Data Sets in Europe 2
  • 3. PublicData.eu • • Many local and national portals to provide access to public sector open datasets - 114 EU catalogues on datacatalogs.org Gather datasets across geographic and institutional boundaries PublicData.eu • • • • • • pan-European catalogue launched under the FP7 LOD2 project. aggregates data from CKAN open data catalogues all over Europe. collects data from 26 sources 1st to be published in Europe in 2011 data beyond the European Union, e.g., Serbian datasets. not exhaustive, it represents a unique aggregation of European datasets. • • 17.027 datasets UK: largest provider 21/11/2013 3
  • 4. Methodology Descriptions of datasets collected in May 2013 236 distinct dataset properties identified, partially due to • • linguistic diversity; some providers adapt property names in their language problems of consistency in naming (upper / lower case, spaces / underscore for a single field). Major challenge to understand the content of the PublicData.eu Data collected and analysed to identify information made available on data openness and reusability in particular the licensing conditions and the data formats. 21/11/2013 1-5 stars: Metadata on the Openness Level of Open Data Sets in Europe 4
  • 5. Tim Berners-Lee’s evaluation scale ★ Available on the web (whatever format) but with an open license, to be Open Data ★★ Available as machine-readable structured data ★★★ 2 + non-proprietary format ★★★★ ★★★★★ 21/11/2013 3 + Use open standards from W3C (RDF and SPARQL) to identify things 4 + Link your data to other people’s data to provide context 1-5 stars: Metadata on the Openness Level of Open Data Sets in Europe 5
  • 6. ★ Data Licences 13.535 / 17.027 datasets have at least 1 license indication 12.470 datasets can be considered having some form of open license  73,24% 769 datasets have a Creative Commons license Significant number of datasets have a national license: • • • apie v2 to publish information created by French public authorities UK-crown which “covers material created by civil servants, ministers and government departments and agencies” in the UK, UK Open Government License 128 datasets with an explicitly closed license 21/11/2013 1-5 stars: Metadata on the Openness Level of Open Data Sets in Europe 6
  • 7. ★★ Machine readable format • Facilitates data reusability • 4.051 / 17.027 with content_TYPE • 11.285 with at least one indication about format • 56 datasets in RDF • Dominant proportion of spreadsheets type’s formats Distribution of formats 40% not a machine readable format 34% of datasets available in a machine readable format  machine readability cond. for openness levels of 2★ and > 21/11/2013 1-5 stars: Metadata on the Openness Level of Open Data Sets in Europe 7
  • 8. ★★★ Use of non-proprietary formats Creates ambiguities as the openness nature of formats can be debated in some cases: • • Certain formats are proprietary but their specifications are open. Some formats have been open at a certain point of time but additions and further evolutions remain proprietary In many cases, value of property was too vague to determine whether the format was or not proprietary. It was possible to identify: • • For 49% of the datasets, a non-proprietary format For 21% a proprietary format. Use of proprietary formats is a critical issue for improving the level of openness of datasets. 21/11/2013 1-5 stars: Metadata on the Openness Level of Open Data Sets in Europe 8
  • 9. ★★★★ Use of open standards from W3C Including HTML, XML, and RDF in particular. • XML-based formats may be entirely independent from W3C (e.g. KML) Availability in W3C standards: 9,5% of datasets Availability in XML based formats: 10% Information remains unknown in most cases 21/11/2013 1-5 stars: Metadata on the Openness Level of Open Data Sets in Europe 9
  • 10. ★★★★★ Linked data Linked data are only mentioned in the description of a single dataset (Brandweer Amsterdam-Amstelland Uitrukberichten) for which the format is described as “linked data api, rdf json”. 58 datasets mention RDF (or RDFa) as a format or content type, i.e., 0,34%. 21/11/2013 1-5 stars: Metadata on the Openness Level of Open Data Sets in Europe 10
  • 11. Level of openness (1/2) 6.891 / 17.027 datasets show at least one information about their degree of openness. All come from Data.gov.uk (8 689 datasets) For a majority of datasets, the level of openness is unknown. • 21/11/2013 Coherent with lack of licensing information without which it is impossible to conclude on even ★ openness level. 1-5 stars: Metadata on the Openness Level of Open Data Sets in Europe Distribution of openness levels in UK datasets 11
  • 12. Level of openness (2/2) Approximate level of openness derived from licensing and format properties • • 73,24% of the datasets should have ★ or above. Reference to 5★ should take into consideration linkages, cannot be inferred from dataset metadata. Level of openness according to Format and License related properties Data openness mainly related to 1st level of compliance: licensing issue. • 21/11/2013 Data providers have clearly not focused on publication of data in reusable formats. 1-5 stars: Metadata on the Openness Level of 12 Open Data Sets in Europe
  • 13. Conclusion • Limited openness of datasets advertised as open data • Heterogeneity of associated metadata  Difficulty for reusers to (i) discover datasets, despite the creation of large catalogues of datasets, and to (ii) effectively reuse machine readable and contextualized data. ★ may be sufficient to ensure transparency of gov. action, facilitating reuse of data through services is not served below 2★ Confirmed risks regarding major challenges that data providers have to face: (i) language barrier and (ii) lack of consistency of metadata. Harmonization of practices, training and tools necessary to ensure that datasets are available in relevant formats. 21/11/2013 1-5 stars: Metadata on the Openness Level of Open Data Sets in Europe 13
  • 14. 1-5 stars: Metadata on the Openness Level of Open Data Sets in Europe Sébastien Martin, Muriel Foulonneau, Slim Turki Contact: muriel.foulonneau@tudor.lu

Editor's Notes

  1. The study uses the Tim Berners-Lee’s five star evaluation scale.
  2. The one star openness level depends upon data licenses. Licensing information can be found in 10 distinct metadata properties, i.e., licence, License, licence_url, License_details, License_ID, License_summary, License_title, License_uri, License_url, and mandate.
  3. The two star level depends upon the format in which the data is made available.