Open statistics Belgium

Open Knowledge Belgium
Open Knowledge BelgiumOpen Knowledge Belgium
Open Statistics
Open Belgium 6 March 2017
Statistics Belgium
Lucia Decuyper
Youri Baeyesn
Open Statistics – Agenda
 Statistics Belgium => Open Data
• Statistics Belgium
• Open Data Start
• Statbel Open Data Portal
• Statistics Belgium in the EU
 Open Data => Linked Open Data
• 5*****?
• RDF
• LOD
• Semantic Web
• Ontologies for statisticians
• LOD in the NSIs
• RDF@statbel
 Questions
 Contact
Open Statistics – Statistics Belgium 1
 Statistics Belgium ?
– National Statistical Institute (before NIS)
• largest producer of official statistics in Belgium
 What do we do?
– Collect data: administrative sources (registers) or surveys
– Process and analyse data:
• common methodology, definitions (national, European)
– Publish data
• => +/- 400y releases on Statbel
Open Statistics – Statistics Belgium 2
 One of the core tasks consists in making all produced statistics
available to everyone (European Statistics Code of Practice)
– Website Statbel since 1997
– Free re-(use) => source
– ‘open by default’
 +/-100 statistics
– The main fields covered are population, society, work,
economy, real estate, construction, mobility and transport.
– Census
Open Statistics – Open Data Start?
 Why?
• 2nd PSI – directive
• Belgian Federal Open Data strategy 2015
• Digital agenda (EU)
• Eurostat => EU Open Data Portal
• Crossroad Bank Enterprises (KBO) company register
• Users
 Benefits
Open Statistics – Statbel Open Data Portal 1
 Open Data Portal on the Statbel website since Q4 2015 : www.statbel.fgov.be/opendata
– Population & Census
– Labour market &
living conditions
• Fiscal statistics
on income
– Environment
– Prices
• CPI
– Tools
• Geography
• Codes and Classifications
Open Statistics – Statbel Open Data Portal 2
 +-/ 110 datasets
 Formats
• XLSX  Excel  Pivot tables
• CSV, TXT  R, SAS, …, PostgreSQL,
• GML, SHP  QGIS, ArcGIS, … ,
• Json, XML, CSV, XLSX be.STAT=> dynamic databank of Statbel
 Special care
– Privacy
– Continuity
 Goal : 1 new dataset/month
– Next : population, households, real estate
Open Statistics – Statistics Belgium in the EU
 European Statistical System = Eurostat + NSI’s
– Key provider of public open data
– Draft Open Data Strategy (feb 2017)
Statistics Belgium
• Statbel.fgov.be/opendata
Eurostat
• Key contributor to the open
data portals
EU Open Data Portal
• Data.europa.eu/euodp
Belgium
• Data.gov.be
Metadata
harvesting European Data Portal
• www.europeandataportal.eu
metadata
Metadata
harvesting
Open Statistics – 5***** ?
 Statistics Belgium => Open Data
Statbel: Situation actuelle
Statbel: Ambition
Open Statistics – RDF
Resource description framework (RDF)
Open Statistics – RDF - Uniform resource identifier URI
 Use URIs to identify things, so that people can point at your
stuff
– A URI identifies a concept.
– Example of a URI for the Rixensart
commune:http://vocab.belgif.be/refnis/25091#id
– In general, a URI is associated with a web page that documents the
concept. For Rixensart:
http://vocab.belgif.be/refnis/25091
Open Statistics – Resource description framework (RDF)
 In the RDF files, triplets of the type “subject-predicate-object” are stored
 In RDF files,
– subjects are URIs.
– predicats are URIs.
– objects are URIs ou des litéraux
 Example (nomenclature):
<http://vocab.belgif.be/refnis/25091#id>
<http://www.w3.org/2004/02/skos/core#prefLabel> "Rixensart"@fr .
 There are "standard vocabularies" (rules for forming triplets). Skos is one
of them.
Open Statistics – Resource description framework (RDF)
 It’s possible to use "prefixes" to "abbreviate" URIs in RDF files
 Example:
@prefix refnis: http://vocab.belgif.be/refnis/ .
@prefix skos: http://www.w3.org/2004/02/skos/core# .
refnis:25091#id skos:prefLabel "Rixensart"@fr.
refnis:25091#id skos:broader refnis:25000#id.
Open Statistics – Resource description framework (RDF)
 Sample RDF file to describe a study(metadata):
– ddi:Study_1 a disco:Study.
– ddi:Study_1 dcterms:title "National Population and Housing Census, 1980"@en.
– ddi:Study_1 dcterms:identifier "ARG_1980_PHC_v01_A_IPUMS“ .
 This description uses the vocabulary « ddi-rdf » (disco):
– DDI-RDF is “A vocabulary for publishing metadata about data sets
(research and survey data) into the Web of Linked Data”
– Described here : http://rdf-vocabulary.ddialliance.org/discovery.html
Open Statistics – Resource description framework (RDF)
 RDF = forming triplets
 There are several syntaxes to form them
– turtle,
– N-triples,
– xml,
– …
Open Statistics – Linked Open Data (LOD)
Linked open-data (LOD)
Open Statistics – Linked Open Data (LOD)
 It’s possible to link several RDF sources. This is referred to as Linked
Open Data (LOD).
Examples of LOD sites on which to link :
– Dbpedia
– Wikidata
– Geonames
 A simple way to link to another DB is to re-use its URIs
Open Statistics – Linked Open Data (LOD)
 Example of LOD (nomenclature):
– @prefix refnis: http://vocab.belgif.be/refnis/ .
@prefix skos: http://www.w3.org/2004/02/skos/core# .
refnis:25091#id skos:prefLabel "Rixensart"@fr.
refnis:25091#id skos:broader refnis:25000#id.
refnis:25091#id skos:exactMatch <http://sws.geonames.org/2787990>.
refnis:25091#id skos:exactMatch <http://www.wikidata.org/entity/Q630478> .
Open Statistics – Semantic web
Semantic web
Open Statistics – Semantic web
 All the " sujet-prédicat-objet " sentences of the different LODs
form a giant "knowledge graph" whose size increases rapidly
Open Statistics – Semantic web
Open Statistics – Ontologies for statisticians
Standard vocabularies
Open Statistics – Standard vocabularies
 Classifications
– SKOS: Classifications (nomenclatures)
– XKOS: SKOS extension (for NACE, …)
 Document a list of files (catalog)
– DCAT
– StatDCAT-AP
– GeoDCAT-AP
Open Statistics – Standard vocabularies
 Metadata:
– Dublin core
– DDI-RDF
 Data:
– RDF Data cube vocabulary
Open Statistics – Standard vocabularies
 Other interesting vocabularies recommended by Eurostat
– The Organization Ontology
– The PROV ontology
– Time Ontology in OWL
– Dublin Core
– ISA Core Vocabularies in RDF (Person, Public Organisation,
Business, Public Service, Location)
– Vocabulary of Interlinked Datasets (VoID)
Open Statistics – Nomenclatures
 Some nomenclatures, "controlled vocabularies" & thesauri
recommended by Eurostat:INSPIRE code lists
– EuroVoc thesaurus
– Named Authority Lists (NAL)
Open Statistics – LOD IN THE NSIs
 Some NSIs already have LOD:
– Insee: Some code tables + legal population
– Istat
– ONS + Geoportal UK
– Census 2011 in Ireland
Open Statistics – RDF@Statbel
 What to publish as LOD?
 Priorities for publication as LOD:
– Nomenclatures (create URIs for NACEBEL, REFNIS, … +
create files that expose hierarchies, …)
– Catalog of the data (to let the ‘machines’ all over the world
know that our datasets are available in csv, …)
– Metadata
– A selection of datasets (For example: legal population of
municipalities)
Open Statistics – Questions
Open Statistics – Contact
 Check out our websites
 Explore our datasets
 Re-use our data
 and
 Contact us!
 For questions please contact :
statbel.opendata@economie.fgov.be
Lucia.Decuyper@economie.fgov.be
Youri.Baeyens@economie.fgov.be
 To find out more check:
http://statbel.fgov.be
https://bestat.statbel.fgov.be
http://statbel.fgov.be/opendata/
http://statbel.fgov.be/en/statistics/opendata/licence/
 Follow Statbel on Twitter
1 of 30

Recommended

Linked Data Research Projects at Ontology Engineering Group by
Linked Data Research Projects at Ontology Engineering GroupLinked Data Research Projects at Ontology Engineering Group
Linked Data Research Projects at Ontology Engineering GroupBoris Villazón-Terrazas
671 views30 slides
Sitemap4rdf(v2 boris) by
Sitemap4rdf(v2 boris)Sitemap4rdf(v2 boris)
Sitemap4rdf(v2 boris)Boris Villazón-Terrazas
626 views22 slides
RDF4U: RDF Graph Visualization by Interpreting Linked Data as Knowledge by
RDF4U: RDF Graph Visualization by Interpreting Linked Data as KnowledgeRDF4U: RDF Graph Visualization by Interpreting Linked Data as Knowledge
RDF4U: RDF Graph Visualization by Interpreting Linked Data as KnowledgeNational Institute of Informatics
1.6K views27 slides
Presentation of the INVENiT Expert Meeting on Monday 16 February 2015 by
Presentation of the INVENiT Expert Meeting on Monday 16 February 2015Presentation of the INVENiT Expert Meeting on Monday 16 February 2015
Presentation of the INVENiT Expert Meeting on Monday 16 February 2015Leon Wessels
1.9K views59 slides
Getting Started with Knowledge Graphs by
Getting Started with Knowledge GraphsGetting Started with Knowledge Graphs
Getting Started with Knowledge GraphsPeter Haase
13.9K views74 slides
GeoLinkedData by
GeoLinkedDataGeoLinkedData
GeoLinkedDataBoris Villazón-Terrazas
1.1K views32 slides

More Related Content

What's hot

Ephedra: efficiently combining RDF data and services using SPARQL federation by
Ephedra: efficiently combining RDF data and services using SPARQL federationEphedra: efficiently combining RDF data and services using SPARQL federation
Ephedra: efficiently combining RDF data and services using SPARQL federationPeter Haase
269 views16 slides
Semantic Pipes and Semantic Mashups by
Semantic Pipes and Semantic MashupsSemantic Pipes and Semantic Mashups
Semantic Pipes and Semantic Mashupsgiurca
1.8K views15 slides
An R primer for SQL folks by
An R primer for SQL folksAn R primer for SQL folks
An R primer for SQL folksThomas Hütter
60 views20 slides
Smart Data Applications powered by the Wikidata Knowledge Graph by
Smart Data Applications powered by the Wikidata Knowledge GraphSmart Data Applications powered by the Wikidata Knowledge Graph
Smart Data Applications powered by the Wikidata Knowledge GraphPeter Haase
2.4K views23 slides
Linked Data, Ontologies and Inference by
Linked Data, Ontologies and InferenceLinked Data, Ontologies and Inference
Linked Data, Ontologies and InferenceBarry Norton
4.4K views41 slides
Yann Nicolas - Elag 2018 : From XML to MARC by
Yann Nicolas - Elag 2018 : From XML to MARCYann Nicolas - Elag 2018 : From XML to MARC
Yann Nicolas - Elag 2018 : From XML to MARCABES
254 views21 slides

What's hot(20)

Ephedra: efficiently combining RDF data and services using SPARQL federation by Peter Haase
Ephedra: efficiently combining RDF data and services using SPARQL federationEphedra: efficiently combining RDF data and services using SPARQL federation
Ephedra: efficiently combining RDF data and services using SPARQL federation
Peter Haase269 views
Semantic Pipes and Semantic Mashups by giurca
Semantic Pipes and Semantic MashupsSemantic Pipes and Semantic Mashups
Semantic Pipes and Semantic Mashups
giurca1.8K views
Smart Data Applications powered by the Wikidata Knowledge Graph by Peter Haase
Smart Data Applications powered by the Wikidata Knowledge GraphSmart Data Applications powered by the Wikidata Knowledge Graph
Smart Data Applications powered by the Wikidata Knowledge Graph
Peter Haase2.4K views
Linked Data, Ontologies and Inference by Barry Norton
Linked Data, Ontologies and InferenceLinked Data, Ontologies and Inference
Linked Data, Ontologies and Inference
Barry Norton4.4K views
Yann Nicolas - Elag 2018 : From XML to MARC by ABES
Yann Nicolas - Elag 2018 : From XML to MARCYann Nicolas - Elag 2018 : From XML to MARC
Yann Nicolas - Elag 2018 : From XML to MARC
ABES254 views
From XML to MARC. RDF behind the scenes. by Y. Nicolas
From XML to MARC. RDF behind the scenes.From XML to MARC. RDF behind the scenes.
From XML to MARC. RDF behind the scenes.
Y. Nicolas375 views
VALA Tech Camp 2017: Intro to Wikidata & SPARQL by Jane Frazier
VALA Tech Camp 2017: Intro to Wikidata & SPARQLVALA Tech Camp 2017: Intro to Wikidata & SPARQL
VALA Tech Camp 2017: Intro to Wikidata & SPARQL
Jane Frazier117 views
The Information Workbench - Linked Data and Semantic Wikis in the Enterprise by Peter Haase
The Information Workbench - Linked Data and Semantic Wikis in the EnterpriseThe Information Workbench - Linked Data and Semantic Wikis in the Enterprise
The Information Workbench - Linked Data and Semantic Wikis in the Enterprise
Peter Haase2.6K views
Keynote new convergences between natural language processing and knowledge ... by semanticsconference
Keynote   new convergences between natural language processing and knowledge ...Keynote   new convergences between natural language processing and knowledge ...
Keynote new convergences between natural language processing and knowledge ...
A Corpus of Chinese Comic Books: Database, Metadata, and Visual Object Recogn... by Matthias Arnold
A Corpus of Chinese Comic Books: Database, Metadata, and Visual Object Recogn...A Corpus of Chinese Comic Books: Database, Metadata, and Visual Object Recogn...
A Corpus of Chinese Comic Books: Database, Metadata, and Visual Object Recogn...
Matthias Arnold282 views
LDOW2015 Position Talk and Discussion by Sören Auer
LDOW2015 Position Talk and DiscussionLDOW2015 Position Talk and Discussion
LDOW2015 Position Talk and Discussion
Sören Auer1.5K views
Visual Ontology Modeling for Domain Experts and Business Users with metaphactory by Peter Haase
Visual Ontology Modeling for Domain Experts and Business Users with metaphactoryVisual Ontology Modeling for Domain Experts and Business Users with metaphactory
Visual Ontology Modeling for Domain Experts and Business Users with metaphactory
Peter Haase270 views
Das Semantische Daten Web für Unternehmen by Sören Auer
Das Semantische Daten Web für UnternehmenDas Semantische Daten Web für Unternehmen
Das Semantische Daten Web für Unternehmen
Sören Auer2.4K views
Nicoletta Fornara and Fabio Marfia | Modeling and Enforcing Access Control Ob... by semanticsconference
Nicoletta Fornara and Fabio Marfia | Modeling and Enforcing Access Control Ob...Nicoletta Fornara and Fabio Marfia | Modeling and Enforcing Access Control Ob...
Nicoletta Fornara and Fabio Marfia | Modeling and Enforcing Access Control Ob...
R and Rcmdr Statistical Software by arttan2001
R and Rcmdr Statistical SoftwareR and Rcmdr Statistical Software
R and Rcmdr Statistical Software
arttan20011K views
First Steps in Semantic Data Modelling and Search & Analytics in the Cloud by Ontotext
First Steps in Semantic Data Modelling and Search & Analytics in the CloudFirst Steps in Semantic Data Modelling and Search & Analytics in the Cloud
First Steps in Semantic Data Modelling and Search & Analytics in the Cloud
Ontotext2.2K views

Viewers also liked

The State of Open Knowledge in Belgium 2017 by
The State of Open Knowledge in Belgium 2017The State of Open Knowledge in Belgium 2017
The State of Open Knowledge in Belgium 2017Open Knowledge Belgium
694 views38 slides
Open data is only the beginning by
Open data is only the beginningOpen data is only the beginning
Open data is only the beginningOpen Knowledge Belgium
502 views49 slides
Open data in Vlaanderen 2017 by
Open data in Vlaanderen 2017Open data in Vlaanderen 2017
Open data in Vlaanderen 2017Open Knowledge Belgium
286 views9 slides
5-star linked open council decisions by
5-star linked open council decisions5-star linked open council decisions
5-star linked open council decisionsOpen Knowledge Belgium
706 views21 slides
Open Badges Belgium, a new Open Knowledge Working Group to give value to badg... by
Open Badges Belgium, a new Open Knowledge Working Group to give value to badg...Open Badges Belgium, a new Open Knowledge Working Group to give value to badg...
Open Badges Belgium, a new Open Knowledge Working Group to give value to badg...Open Knowledge Belgium
823 views65 slides
Belgian governments and open data: what's happening at the federal and region... by
Belgian governments and open data: what's happening at the federal and region...Belgian governments and open data: what's happening at the federal and region...
Belgian governments and open data: what's happening at the federal and region...Open Knowledge Belgium
567 views44 slides

Viewers also liked(20)

Open Badges Belgium, a new Open Knowledge Working Group to give value to badg... by Open Knowledge Belgium
Open Badges Belgium, a new Open Knowledge Working Group to give value to badg...Open Badges Belgium, a new Open Knowledge Working Group to give value to badg...
Open Badges Belgium, a new Open Knowledge Working Group to give value to badg...
Belgian governments and open data: what's happening at the federal and region... by Open Knowledge Belgium
Belgian governments and open data: what's happening at the federal and region...Belgian governments and open data: what's happening at the federal and region...
Belgian governments and open data: what's happening at the federal and region...
Open Data Governance as an Integral Part of a Smart City: How to start with? by Open Knowledge Belgium
Open Data Governance as an Integral Part of a Smart City: How to start with?Open Data Governance as an Integral Part of a Smart City: How to start with?
Open Data Governance as an Integral Part of a Smart City: How to start with?

Similar to Open statistics Belgium

20140521 sem-tech-biz-guest-lecture by
20140521 sem-tech-biz-guest-lecture20140521 sem-tech-biz-guest-lecture
20140521 sem-tech-biz-guest-lectureVladimir Alexiev, PhD, PMP
729 views22 slides
Scaling up Linked Data by
Scaling up Linked DataScaling up Linked Data
Scaling up Linked DataMarin Dimitrov
1.9K views67 slides
Linked Data (1st Linked Data Meetup Malmö) by
Linked Data (1st Linked Data Meetup Malmö)Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)Anja Jentzsch
1.5K views66 slides
Linked Data Management by
Linked Data ManagementLinked Data Management
Linked Data ManagementMarin Dimitrov
2.7K views107 slides
lodlam summit session browsable linked data by
lodlam summit session browsable linked datalodlam summit session browsable linked data
lodlam summit session browsable linked dataEnno Meijers
299 views20 slides
IASSIST 2012 - DDI-RDF - Trouble with Triples by
IASSIST 2012 - DDI-RDF - Trouble with TriplesIASSIST 2012 - DDI-RDF - Trouble with Triples
IASSIST 2012 - DDI-RDF - Trouble with TriplesDr.-Ing. Thomas Hartmann
693 views31 slides

Similar to Open statistics Belgium(20)

Linked Data (1st Linked Data Meetup Malmö) by Anja Jentzsch
Linked Data (1st Linked Data Meetup Malmö)Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)
Anja Jentzsch1.5K views
lodlam summit session browsable linked data by Enno Meijers
lodlam summit session browsable linked datalodlam summit session browsable linked data
lodlam summit session browsable linked data
Enno Meijers299 views
Dart ord the citizen's persepctive-20141107 by Andre Golliez
Dart ord the citizen's persepctive-20141107Dart ord the citizen's persepctive-20141107
Dart ord the citizen's persepctive-20141107
Andre Golliez701 views
Requirements for Open Sharing of Archaeological Research Data by ariadnenetwork
Requirements for Open Sharing of Archaeological Research DataRequirements for Open Sharing of Archaeological Research Data
Requirements for Open Sharing of Archaeological Research Data
ariadnenetwork797 views
How to read a million books? by cneudecker
How to read a million books?How to read a million books?
How to read a million books?
cneudecker593 views
Gaenovium - Open data in the Netherlands by Bob Coret
Gaenovium - Open data in the NetherlandsGaenovium - Open data in the Netherlands
Gaenovium - Open data in the Netherlands
Bob Coret379 views
Linked Data: thinking big, starting small by Peter Neish
Linked Data: thinking big, starting smallLinked Data: thinking big, starting small
Linked Data: thinking big, starting small
Peter Neish3.1K views
Open data and reuse of public information by Vestforsk.no
Open data and reuse of public informationOpen data and reuse of public information
Open data and reuse of public information
Vestforsk.no624 views
Visualising the Australian open data and research data landscape by Jonathan Yu
Visualising the Australian open data and research data landscapeVisualising the Australian open data and research data landscape
Visualising the Australian open data and research data landscape
Jonathan Yu105 views
01 caa2019 ariadn_eplus_snd_uj_krakow 20190425 by ariadnenetwork
01 caa2019 ariadn_eplus_snd_uj_krakow 2019042501 caa2019 ariadn_eplus_snd_uj_krakow 20190425
01 caa2019 ariadn_eplus_snd_uj_krakow 20190425
ariadnenetwork49 views
Scaling up Linked Data by EUCLID project
Scaling up Linked DataScaling up Linked Data
Scaling up Linked Data
EUCLID project18.5K views
Linking Open Data with Drupal by emmanuel_jamin
Linking Open Data with DrupalLinking Open Data with Drupal
Linking Open Data with Drupal
emmanuel_jamin8.5K views
Estermann wd glam-intro_20181204 by Beat Estermann
Estermann wd glam-intro_20181204Estermann wd glam-intro_20181204
Estermann wd glam-intro_20181204
Beat Estermann143 views

More from Open Knowledge Belgium

Open Data Stories You haven't heard! by
Open Data Stories You haven't heard!Open Data Stories You haven't heard!
Open Data Stories You haven't heard!Open Knowledge Belgium
874 views21 slides
A​ FUNUMENTARY:​ Take what you can, give nothing back...​ ​(NOT) by
A​ FUNUMENTARY:​ Take what you can, give nothing back...​ ​(NOT)A​ FUNUMENTARY:​ Take what you can, give nothing back...​ ​(NOT)
A​ FUNUMENTARY:​ Take what you can, give nothing back...​ ​(NOT)Open Knowledge Belgium
870 views40 slides
Smarter by Open Data: Process and Practice in Flevoland (NL) by
Smarter by Open Data: Process and Practice in Flevoland (NL)Smarter by Open Data: Process and Practice in Flevoland (NL)
Smarter by Open Data: Process and Practice in Flevoland (NL)Open Knowledge Belgium
584 views26 slides
Open Knowledge for Social Innovation by
Open Knowledge for Social InnovationOpen Knowledge for Social Innovation
Open Knowledge for Social InnovationOpen Knowledge Belgium
313 views30 slides
Smart Flanders: Tackling urban challenges through Open Data by
Smart Flanders: Tackling urban challenges through Open DataSmart Flanders: Tackling urban challenges through Open Data
Smart Flanders: Tackling urban challenges through Open DataOpen Knowledge Belgium
519 views41 slides
EIF and NIFO connecting public administrations, businesses, and citizens by
EIF and NIFO connecting public administrations, businesses, and citizensEIF and NIFO connecting public administrations, businesses, and citizens
EIF and NIFO connecting public administrations, businesses, and citizensOpen Knowledge Belgium
6.2K views27 slides

More from Open Knowledge Belgium(20)

A​ FUNUMENTARY:​ Take what you can, give nothing back...​ ​(NOT) by Open Knowledge Belgium
A​ FUNUMENTARY:​ Take what you can, give nothing back...​ ​(NOT)A​ FUNUMENTARY:​ Take what you can, give nothing back...​ ​(NOT)
A​ FUNUMENTARY:​ Take what you can, give nothing back...​ ​(NOT)
Smarter by Open Data: Process and Practice in Flevoland (NL) by Open Knowledge Belgium
Smarter by Open Data: Process and Practice in Flevoland (NL)Smarter by Open Data: Process and Practice in Flevoland (NL)
Smarter by Open Data: Process and Practice in Flevoland (NL)
EIF and NIFO connecting public administrations, businesses, and citizens by Open Knowledge Belgium
EIF and NIFO connecting public administrations, businesses, and citizensEIF and NIFO connecting public administrations, businesses, and citizens
EIF and NIFO connecting public administrations, businesses, and citizens
Connecting Open data for solving the fiscal transparency puzzle in the EU by Open Knowledge Belgium
Connecting Open data for solving the fiscal transparency puzzle in the EUConnecting Open data for solving the fiscal transparency puzzle in the EU
Connecting Open data for solving the fiscal transparency puzzle in the EU
Eliminating data roadbloacks to get by traffic roadblocks without pain by Open Knowledge Belgium
Eliminating data roadbloacks to get by traffic roadblocks without painEliminating data roadbloacks to get by traffic roadblocks without pain
Eliminating data roadbloacks to get by traffic roadblocks without pain
How we use the massive open lidar dataset for the benfit of our clients by Open Knowledge Belgium
How we use the massive open lidar dataset for the benfit of our clientsHow we use the massive open lidar dataset for the benfit of our clients
How we use the massive open lidar dataset for the benfit of our clients

Recently uploaded

shivam tiwari.pptx by
shivam tiwari.pptxshivam tiwari.pptx
shivam tiwari.pptxAanyaMishra4
7 views14 slides
CRM stick or twist workshop by
CRM stick or twist workshopCRM stick or twist workshop
CRM stick or twist workshopinfo828217
14 views16 slides
META.pptx by
META.pptxMETA.pptx
META.pptxvasanthan19012003
6 views10 slides
[DSC Europe 23][Cryptica] Martin_Summer_Digital_central_bank_money_Ideas_init... by
[DSC Europe 23][Cryptica] Martin_Summer_Digital_central_bank_money_Ideas_init...[DSC Europe 23][Cryptica] Martin_Summer_Digital_central_bank_money_Ideas_init...
[DSC Europe 23][Cryptica] Martin_Summer_Digital_central_bank_money_Ideas_init...DataScienceConferenc1
5 views18 slides
Inawsidom - Data Journey by
Inawsidom - Data JourneyInawsidom - Data Journey
Inawsidom - Data JourneyPhilipBasford
8 views38 slides
[DSC Europe 23][DigiHealth] Muthu Ramachandran AI and Blockchain Framework fo... by
[DSC Europe 23][DigiHealth] Muthu Ramachandran AI and Blockchain Framework fo...[DSC Europe 23][DigiHealth] Muthu Ramachandran AI and Blockchain Framework fo...
[DSC Europe 23][DigiHealth] Muthu Ramachandran AI and Blockchain Framework fo...DataScienceConferenc1
9 views77 slides

Recently uploaded(20)

CRM stick or twist workshop by info828217
CRM stick or twist workshopCRM stick or twist workshop
CRM stick or twist workshop
info82821714 views
[DSC Europe 23][Cryptica] Martin_Summer_Digital_central_bank_money_Ideas_init... by DataScienceConferenc1
[DSC Europe 23][Cryptica] Martin_Summer_Digital_central_bank_money_Ideas_init...[DSC Europe 23][Cryptica] Martin_Summer_Digital_central_bank_money_Ideas_init...
[DSC Europe 23][Cryptica] Martin_Summer_Digital_central_bank_money_Ideas_init...
[DSC Europe 23][DigiHealth] Muthu Ramachandran AI and Blockchain Framework fo... by DataScienceConferenc1
[DSC Europe 23][DigiHealth] Muthu Ramachandran AI and Blockchain Framework fo...[DSC Europe 23][DigiHealth] Muthu Ramachandran AI and Blockchain Framework fo...
[DSC Europe 23][DigiHealth] Muthu Ramachandran AI and Blockchain Framework fo...
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion by Bertram Ludäscher
Games, Queries, and Argumentation Frameworks: Time for a Family ReunionGames, Queries, and Argumentation Frameworks: Time for a Family Reunion
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion
Dr. Ousmane Badiane-2023 ReSAKSS Conference by AKADEMIYA2063
Dr. Ousmane Badiane-2023 ReSAKSS ConferenceDr. Ousmane Badiane-2023 ReSAKSS Conference
Dr. Ousmane Badiane-2023 ReSAKSS Conference
AKADEMIYA20635 views
DGST Methodology Presentation.pdf by maddierlegum
DGST Methodology Presentation.pdfDGST Methodology Presentation.pdf
DGST Methodology Presentation.pdf
maddierlegum7 views
Customer Data Cleansing Project.pptx by Nat O
Customer Data Cleansing Project.pptxCustomer Data Cleansing Project.pptx
Customer Data Cleansing Project.pptx
Nat O6 views
OECD-Persol Holdings Workshop on Advancing Employee Well-being in Business an... by StatsCommunications
OECD-Persol Holdings Workshop on Advancing Employee Well-being in Business an...OECD-Persol Holdings Workshop on Advancing Employee Well-being in Business an...
OECD-Persol Holdings Workshop on Advancing Employee Well-being in Business an...
Lack of communication among family.pptx by ahmed164023
Lack of communication among family.pptxLack of communication among family.pptx
Lack of communication among family.pptx
ahmed16402315 views
4_4_WP_4_06_ND_Model.pptx by d6fmc6kwd4
4_4_WP_4_06_ND_Model.pptx4_4_WP_4_06_ND_Model.pptx
4_4_WP_4_06_ND_Model.pptx
d6fmc6kwd47 views
Best Home Security Systems.pptx by mogalang
Best Home Security Systems.pptxBest Home Security Systems.pptx
Best Home Security Systems.pptx
mogalang9 views
Enhancing Financial Sentiment Analysis via Retrieval Augmented Large Language... by patiladiti752
Enhancing Financial Sentiment Analysis via Retrieval Augmented Large Language...Enhancing Financial Sentiment Analysis via Retrieval Augmented Large Language...
Enhancing Financial Sentiment Analysis via Retrieval Augmented Large Language...
patiladiti7528 views

Open statistics Belgium

  • 1. Open Statistics Open Belgium 6 March 2017 Statistics Belgium Lucia Decuyper Youri Baeyesn
  • 2. Open Statistics – Agenda  Statistics Belgium => Open Data • Statistics Belgium • Open Data Start • Statbel Open Data Portal • Statistics Belgium in the EU  Open Data => Linked Open Data • 5*****? • RDF • LOD • Semantic Web • Ontologies for statisticians • LOD in the NSIs • RDF@statbel  Questions  Contact
  • 3. Open Statistics – Statistics Belgium 1  Statistics Belgium ? – National Statistical Institute (before NIS) • largest producer of official statistics in Belgium  What do we do? – Collect data: administrative sources (registers) or surveys – Process and analyse data: • common methodology, definitions (national, European) – Publish data • => +/- 400y releases on Statbel
  • 4. Open Statistics – Statistics Belgium 2  One of the core tasks consists in making all produced statistics available to everyone (European Statistics Code of Practice) – Website Statbel since 1997 – Free re-(use) => source – ‘open by default’  +/-100 statistics – The main fields covered are population, society, work, economy, real estate, construction, mobility and transport. – Census
  • 5. Open Statistics – Open Data Start?  Why? • 2nd PSI – directive • Belgian Federal Open Data strategy 2015 • Digital agenda (EU) • Eurostat => EU Open Data Portal • Crossroad Bank Enterprises (KBO) company register • Users  Benefits
  • 6. Open Statistics – Statbel Open Data Portal 1  Open Data Portal on the Statbel website since Q4 2015 : www.statbel.fgov.be/opendata – Population & Census – Labour market & living conditions • Fiscal statistics on income – Environment – Prices • CPI – Tools • Geography • Codes and Classifications
  • 7. Open Statistics – Statbel Open Data Portal 2  +-/ 110 datasets  Formats • XLSX  Excel  Pivot tables • CSV, TXT  R, SAS, …, PostgreSQL, • GML, SHP  QGIS, ArcGIS, … , • Json, XML, CSV, XLSX be.STAT=> dynamic databank of Statbel  Special care – Privacy – Continuity  Goal : 1 new dataset/month – Next : population, households, real estate
  • 8. Open Statistics – Statistics Belgium in the EU  European Statistical System = Eurostat + NSI’s – Key provider of public open data – Draft Open Data Strategy (feb 2017) Statistics Belgium • Statbel.fgov.be/opendata Eurostat • Key contributor to the open data portals EU Open Data Portal • Data.europa.eu/euodp Belgium • Data.gov.be Metadata harvesting European Data Portal • www.europeandataportal.eu metadata Metadata harvesting
  • 9. Open Statistics – 5***** ?  Statistics Belgium => Open Data Statbel: Situation actuelle Statbel: Ambition
  • 10. Open Statistics – RDF Resource description framework (RDF)
  • 11. Open Statistics – RDF - Uniform resource identifier URI  Use URIs to identify things, so that people can point at your stuff – A URI identifies a concept. – Example of a URI for the Rixensart commune:http://vocab.belgif.be/refnis/25091#id – In general, a URI is associated with a web page that documents the concept. For Rixensart: http://vocab.belgif.be/refnis/25091
  • 12. Open Statistics – Resource description framework (RDF)  In the RDF files, triplets of the type “subject-predicate-object” are stored  In RDF files, – subjects are URIs. – predicats are URIs. – objects are URIs ou des litéraux  Example (nomenclature): <http://vocab.belgif.be/refnis/25091#id> <http://www.w3.org/2004/02/skos/core#prefLabel> "Rixensart"@fr .  There are "standard vocabularies" (rules for forming triplets). Skos is one of them.
  • 13. Open Statistics – Resource description framework (RDF)  It’s possible to use "prefixes" to "abbreviate" URIs in RDF files  Example: @prefix refnis: http://vocab.belgif.be/refnis/ . @prefix skos: http://www.w3.org/2004/02/skos/core# . refnis:25091#id skos:prefLabel "Rixensart"@fr. refnis:25091#id skos:broader refnis:25000#id.
  • 14. Open Statistics – Resource description framework (RDF)  Sample RDF file to describe a study(metadata): – ddi:Study_1 a disco:Study. – ddi:Study_1 dcterms:title "National Population and Housing Census, 1980"@en. – ddi:Study_1 dcterms:identifier "ARG_1980_PHC_v01_A_IPUMS“ .  This description uses the vocabulary « ddi-rdf » (disco): – DDI-RDF is “A vocabulary for publishing metadata about data sets (research and survey data) into the Web of Linked Data” – Described here : http://rdf-vocabulary.ddialliance.org/discovery.html
  • 15. Open Statistics – Resource description framework (RDF)  RDF = forming triplets  There are several syntaxes to form them – turtle, – N-triples, – xml, – …
  • 16. Open Statistics – Linked Open Data (LOD) Linked open-data (LOD)
  • 17. Open Statistics – Linked Open Data (LOD)  It’s possible to link several RDF sources. This is referred to as Linked Open Data (LOD). Examples of LOD sites on which to link : – Dbpedia – Wikidata – Geonames  A simple way to link to another DB is to re-use its URIs
  • 18. Open Statistics – Linked Open Data (LOD)  Example of LOD (nomenclature): – @prefix refnis: http://vocab.belgif.be/refnis/ . @prefix skos: http://www.w3.org/2004/02/skos/core# . refnis:25091#id skos:prefLabel "Rixensart"@fr. refnis:25091#id skos:broader refnis:25000#id. refnis:25091#id skos:exactMatch <http://sws.geonames.org/2787990>. refnis:25091#id skos:exactMatch <http://www.wikidata.org/entity/Q630478> .
  • 19. Open Statistics – Semantic web Semantic web
  • 20. Open Statistics – Semantic web  All the " sujet-prédicat-objet " sentences of the different LODs form a giant "knowledge graph" whose size increases rapidly
  • 21. Open Statistics – Semantic web
  • 22. Open Statistics – Ontologies for statisticians Standard vocabularies
  • 23. Open Statistics – Standard vocabularies  Classifications – SKOS: Classifications (nomenclatures) – XKOS: SKOS extension (for NACE, …)  Document a list of files (catalog) – DCAT – StatDCAT-AP – GeoDCAT-AP
  • 24. Open Statistics – Standard vocabularies  Metadata: – Dublin core – DDI-RDF  Data: – RDF Data cube vocabulary
  • 25. Open Statistics – Standard vocabularies  Other interesting vocabularies recommended by Eurostat – The Organization Ontology – The PROV ontology – Time Ontology in OWL – Dublin Core – ISA Core Vocabularies in RDF (Person, Public Organisation, Business, Public Service, Location) – Vocabulary of Interlinked Datasets (VoID)
  • 26. Open Statistics – Nomenclatures  Some nomenclatures, "controlled vocabularies" & thesauri recommended by Eurostat:INSPIRE code lists – EuroVoc thesaurus – Named Authority Lists (NAL)
  • 27. Open Statistics – LOD IN THE NSIs  Some NSIs already have LOD: – Insee: Some code tables + legal population – Istat – ONS + Geoportal UK – Census 2011 in Ireland
  • 28. Open Statistics – RDF@Statbel  What to publish as LOD?  Priorities for publication as LOD: – Nomenclatures (create URIs for NACEBEL, REFNIS, … + create files that expose hierarchies, …) – Catalog of the data (to let the ‘machines’ all over the world know that our datasets are available in csv, …) – Metadata – A selection of datasets (For example: legal population of municipalities)
  • 29. Open Statistics – Questions
  • 30. Open Statistics – Contact  Check out our websites  Explore our datasets  Re-use our data  and  Contact us!  For questions please contact : statbel.opendata@economie.fgov.be Lucia.Decuyper@economie.fgov.be Youri.Baeyens@economie.fgov.be  To find out more check: http://statbel.fgov.be https://bestat.statbel.fgov.be http://statbel.fgov.be/opendata/ http://statbel.fgov.be/en/statistics/opendata/licence/  Follow Statbel on Twitter