SlideShare a Scribd company logo
1 of 22
Oscar Corcho1, Idafen Santana-Pérez1,
Hugo Lafuente2, David Portolés3,
César Cano4, Alfredo Peris4 and José María Subero4
1 Ontology Engineering Group, Universidad Politécnica de Madrid
2 Localidata
3 Idearium Consultores
4 Gobierno de Aragón
Publishing Linked
Statistical Data:
Aragón, a case study
ocorcho@fi.upm.es
@ocorcho
22/10/2017
SemStats 2017 @ ISWC
Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017
Context
2
 IAEst: Instituto Aragonés de Estadística
o http://www.aragon.es/iaest
o The statistical office from Aragón
o Offering open data through
• Open Data portal in Aragón (http://opendata.aragon.es/)
• Their own portal (our interest is on the database of
“estadística local”)
Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017
Context: Existing IAEst data infrastructure
3
 Existing data infrastructure
o Data warehouse infrastructure based on an Oracle BI
o Exports into different formats, including CSVs
Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017
Context: Existing IAEst data infrastructure
4
 Existing data infrastructure
o Data warehouse infrastructure based on an Oracle BI
o Exports into different formats, including CSVs
o http://www.aragon.es/DepartamentosOrganismosPublicos/Institu
tos/InstitutoAragonesEstadistica/AreasGenericas/ci.EstadisticaL
ocal.detalleDepartamento
 Data retrieval and browsing
o Taxonomy-based
o Fixed filters coded in the app
o User selects
• Administrative division
• The concrete municipality
• Browses the folder structure
o Data retrieved in HTML, PDF
or CSV
Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017
Predesigned reports
offered from Oracle BI
Web app for
Estadística Local
Context: Existing IAEst web app
Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017
Context: Existing IAEst data sharing
 En la Web del IAEst
o http://www.aragon.es/DepartamentosOrganismosPublicos/Institu
tos/InstitutoAragonesEstadistica/AreasGenericas/ci.EstadisticaL
ocal.detalleDepartamento
 En OpenDataAragón
o http://opendata.aragon.es/catalogo/edificios-superficie-y-
vivienda-comarcas
Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017
Goals
7
Extract those statistical reports, transform them into
RDF according to W3C standards, curate them, link
them to the existing Linked Data from Aragón (mostly
URIs from municipalities and regions) and provide an
API and a new user interface to make use of them
Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017
Results
8
 An easier-to-maintain data transformation process
o Enriching existing Linked Data APIs from Aragón
o Using GitHub for
• Version control and archival
• Continuous updates: detecting new data and data structures
on a daily basis
• https://github.com/aragonopendata/local-data-aragopedia/
 Developer-friendly API
 Additional user interface
o Improving data retrieval and browsing capabilities
 Side effect: data curation
o Many errors and improvements detected in pre-existing CSV
exports, which have been corrected throughout the process
Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017
Transformation and publication process
9
Initial characterisation
•Identify sources
•Identify dimensions
and measurements
Transformation
•Daily data download
•Processing (UTF8)
•Upload into GitHub
•New dimensions/measures
annotation
•RDF transformation
Publication and use
•Linked Data APIs
https://github.com/aragonopendata/local-data-aragopedia/
Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017
Initial characterisation
10
 Identify and download data sources to be published
(~1000)
o https://github.com/aragonopendata/local-data-
aragopedia/tree/master/data/resource/DatosDescarga-UTF8
 Pre-process data (UTF-8 encoding, download error
verification and retrials)
 Identify potential dimensions and measurements
o Analysis of column header names (e.g., municipio, comarca),
and data content (how many different values)
• https://github.com/aragonopendata/local-data-
aragopedia/blob/master/data/resource/heads.txt
o From 700+ dimensions to ~500
• Curated by IAEst experts (e.g., Male, M, Males, Female, F,
Females, Women, Men)
Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017
Initial characterisation
11
 SKOS concept schemes for each dimension
o https://github.com/aragonopendata/local-data-
aragopedia/tree/master/data/dump/DatosTTL/codelists
o Mapping files available in GitHub (e.g.,
https://github.com/aragonopendata/local-data-
aragopedia/blob/master/data/metadata/mapping-tipo-edificio-
detalle.xlsx)
Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017
Initial characterisation
12
 Measurement properties
o https://github.com/aragonopendata/local-data-
aragopedia/blob/master/data/dump/DatosTTL/codelists/propertie
s.ttl
 DSDs
o https://github.com/aragonopendata/local-data-
aragopedia/tree/master/data/dump/DatosTTL/dataStructures
 Errors were identified during this phase
o Same concept, different names (e.g. sexo and género)
o Typos in header names
o Columns with no values
o Data belonging to wrong municipalities and districts
o https://github.com/aragonopendata/local-data-
aragopedia/blob/master/data/dump/errorReport.txt
Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017
Continuous Transformation
13
 Continuous production cycle
o Update RDF as reports are generated, modified or removed
 Executed every night
o Retrieves all the reports from the list (generated before)
o Checks whether the reports have been already transformed
or if the contain new data
o Hash signatures for each generated Data Cube
• https://github.com/aragonopendata/local-data-
aragopedia/blob/master/data/resource/hashcode.csv
• Used to compare data versions
• If hashes do not match, the Data Cube is marked to be
regenerated
Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017
Continuous Transformation
14
 Each iteration generates a GitHub issue, listing the
cubes that have must be created, modified, etc.
o https://github.com/aragonopendata/local-data-
aragopedia/issues
• https://github.com/aragonopendata/local-data-
aragopedia/issues/93 (new data)
• https://github.com/aragonopendata/local-data-
aragopedia/issues/457 (datacube to delete, new
configurations needed)
o When user interaction is needed, this is reflected in the issue
text, and the IAEst responsible needs to update it
 RDF transformation is done according to the
configuration file
o https://github.com/aragonopendata/local-data-
aragopedia/blob/master/data/metadata/Informe-01-010001-
A-TC-TM-TP.xlsx
Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017
Continuous Transformation
15
 RDF data is stored in GitHub (new version)
o https://github.com/aragonopendata/local-data-
aragopedia/tree/master/data/dump/DatosTTL/informes
 RDF data is stored in the Open Data Aragón
SPARQL endpoint
o http://opendata.aragon.es/sparql
o Reusing the 3cixty KB deployment utilities
o Each cube is stored on its own graph
o Graphs updated for Data Structure Definition (DSD),
properties and SKOS information
Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017
Data transformation. In summary…
bi.aragon.es
Google
Drive
Dataset and
configuration download
New dataset?
GitHub
Sí
For each
dataset
Generate new
configuration and
create an issue
New structure?
No
Create
issue
Sí
New data?
Regenerate
data and
create issue
No
Sí
SPARQL
Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017
Data publication and use
17
 Data can be accessed
o API (using ELDA)
• http://opendata.aragon.es/herramientas/apis?#aragodbpedia
o GitHub (CSVs, RDF)
o SPARQL endpoint
SPARQL
Elda
Linked Data
Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017
Data API
http://opendata.aragon.es/herramientas/apis?#aragodb
pedia
Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017
Data publication and use
19
 Aragopedia
o http://opendata.aragon.es/apps/aragopedia/datos
o Where, when and what (dónde, cuándo y qué)
o Data can be downloaded in
• CSV
• JSON
Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017
Aragopedia
20
 Aragopedia
o JSON result of querying about
• Maestrazgo region (where)
• population (what)
• in 1999 (when)
Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017
Conclusions (Results)
21
 An easier-to-maintain data transformation process
o Enriching existing Linked Data APIs from Aragón
o Using GitHub for
• Version control and archival
• Continuous updates: detecting new data and data structures
on a daily basis
• https://github.com/aragonopendata/local-data-aragopedia/
 Developer-friendly API
 Additional user interface
o Improving data retrieval and browsing capabilities
 Side effect: data curation
o Many errors and improvements detected in pre-existing CSV
exports, which have been corrected throughout the process
Oscar Corcho1, Idafen Santana-Pérez1,
Hugo Lafuente2, David Portolés3,
César Cano4, Alfredo Peris4 and José María Subero4
1 Ontology Engineering Group, Universidad Politécnica de Madrid
2 Localidata
3 Idearium Consultores
4 Gobierno de Aragón
Publishing Linked
Statistical Data:
Aragón, a case study
ocorcho@fi.upm.es
@ocorcho
22/10/2017
SemStats 2017 @ ISWC

More Related Content

What's hot

iLastic: Linked Data Generation Workflow and User Interface for iMinds Schola...
iLastic: Linked Data Generation Workflow and User Interface for iMinds Schola...iLastic: Linked Data Generation Workflow and User Interface for iMinds Schola...
iLastic: Linked Data Generation Workflow and User Interface for iMinds Schola...andimou
 
High quality Linked Data generation for librarians
High quality Linked Data generation for librariansHigh quality Linked Data generation for librarians
High quality Linked Data generation for librariansandimou
 
What Factors Influence the Design of a Linked Data Generation Algorithm?
What Factors Influence the Design of a Linked Data Generation Algorithm?What Factors Influence the Design of a Linked Data Generation Algorithm?
What Factors Influence the Design of a Linked Data Generation Algorithm?andimou
 
Boosting big data with apache spark
Boosting big data with apache sparkBoosting big data with apache spark
Boosting big data with apache sparkInfoFarm
 
BigInsight seminar on Practical Privacy-Preserving Distributed Statistical Co...
BigInsight seminar on Practical Privacy-Preserving Distributed Statistical Co...BigInsight seminar on Practical Privacy-Preserving Distributed Statistical Co...
BigInsight seminar on Practical Privacy-Preserving Distributed Statistical Co...Statistisk sentralbyrå
 
Provenance as a building block for an open science infrastructure
Provenance as a building block for an open science infrastructureProvenance as a building block for an open science infrastructure
Provenance as a building block for an open science infrastructureAndreas Schreiber
 
Semantic E-Commerce - Use Cases in Enterprise Web Applications
Semantic E-Commerce - Use Cases in Enterprise Web ApplicationsSemantic E-Commerce - Use Cases in Enterprise Web Applications
Semantic E-Commerce - Use Cases in Enterprise Web ApplicationsLinked Enterprise Date Services
 
Stream processing: The Matrix Revolutions
Stream processing: The Matrix RevolutionsStream processing: The Matrix Revolutions
Stream processing: The Matrix RevolutionsRomanaPernischov
 
MOAR RDA For Systems Folks
MOAR RDA For Systems FolksMOAR RDA For Systems Folks
MOAR RDA For Systems FolksAlison Hitchens
 
The reuse of open data: an opportunity for Spain
The reuse of open data: an opportunity for SpainThe reuse of open data: an opportunity for Spain
The reuse of open data: an opportunity for SpainAlberto Abella
 
Finding Insights In Connected Data: Using Graph Databases In Journalism
Finding Insights In Connected Data: Using Graph Databases In JournalismFinding Insights In Connected Data: Using Graph Databases In Journalism
Finding Insights In Connected Data: Using Graph Databases In JournalismWilliam Lyon
 
Data Collection Industry Insights: Hotels
Data Collection Industry Insights: HotelsData Collection Industry Insights: Hotels
Data Collection Industry Insights: HotelsOxylabs
 
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open DataMuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data21Style
 
KESeDa: Knowledge Extraction from Heterogeneous Semi-Structured Data Sources
KESeDa: Knowledge Extraction from Heterogeneous Semi-Structured Data SourcesKESeDa: Knowledge Extraction from Heterogeneous Semi-Structured Data Sources
KESeDa: Knowledge Extraction from Heterogeneous Semi-Structured Data SourcesLinked Enterprise Date Services
 
RDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival dataRDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival dataGiorgos Santipantakis
 
Getting started with JUSP
Getting started with JUSPGetting started with JUSP
Getting started with JUSPJUSPSTATS
 
Enabling Data Analytics from Knowledge Graphs @ ISWC 2017 Doctoral Consortium
Enabling Data Analytics from Knowledge Graphs @ ISWC 2017 Doctoral ConsortiumEnabling Data Analytics from Knowledge Graphs @ ISWC 2017 Doctoral Consortium
Enabling Data Analytics from Knowledge Graphs @ ISWC 2017 Doctoral ConsortiumHenrique O. Santos
 
Introducing the IRUSdataUK pilot webinar
Introducing the IRUSdataUK pilot webinarIntroducing the IRUSdataUK pilot webinar
Introducing the IRUSdataUK pilot webinarJisc
 

What's hot (20)

LD4KD 2015 - Demos and tools
LD4KD 2015 - Demos and toolsLD4KD 2015 - Demos and tools
LD4KD 2015 - Demos and tools
 
iLastic: Linked Data Generation Workflow and User Interface for iMinds Schola...
iLastic: Linked Data Generation Workflow and User Interface for iMinds Schola...iLastic: Linked Data Generation Workflow and User Interface for iMinds Schola...
iLastic: Linked Data Generation Workflow and User Interface for iMinds Schola...
 
High quality Linked Data generation for librarians
High quality Linked Data generation for librariansHigh quality Linked Data generation for librarians
High quality Linked Data generation for librarians
 
What Factors Influence the Design of a Linked Data Generation Algorithm?
What Factors Influence the Design of a Linked Data Generation Algorithm?What Factors Influence the Design of a Linked Data Generation Algorithm?
What Factors Influence the Design of a Linked Data Generation Algorithm?
 
Boosting big data with apache spark
Boosting big data with apache sparkBoosting big data with apache spark
Boosting big data with apache spark
 
IMDb Data Integration
IMDb Data IntegrationIMDb Data Integration
IMDb Data Integration
 
BigInsight seminar on Practical Privacy-Preserving Distributed Statistical Co...
BigInsight seminar on Practical Privacy-Preserving Distributed Statistical Co...BigInsight seminar on Practical Privacy-Preserving Distributed Statistical Co...
BigInsight seminar on Practical Privacy-Preserving Distributed Statistical Co...
 
Provenance as a building block for an open science infrastructure
Provenance as a building block for an open science infrastructureProvenance as a building block for an open science infrastructure
Provenance as a building block for an open science infrastructure
 
Semantic E-Commerce - Use Cases in Enterprise Web Applications
Semantic E-Commerce - Use Cases in Enterprise Web ApplicationsSemantic E-Commerce - Use Cases in Enterprise Web Applications
Semantic E-Commerce - Use Cases in Enterprise Web Applications
 
Stream processing: The Matrix Revolutions
Stream processing: The Matrix RevolutionsStream processing: The Matrix Revolutions
Stream processing: The Matrix Revolutions
 
MOAR RDA For Systems Folks
MOAR RDA For Systems FolksMOAR RDA For Systems Folks
MOAR RDA For Systems Folks
 
The reuse of open data: an opportunity for Spain
The reuse of open data: an opportunity for SpainThe reuse of open data: an opportunity for Spain
The reuse of open data: an opportunity for Spain
 
Finding Insights In Connected Data: Using Graph Databases In Journalism
Finding Insights In Connected Data: Using Graph Databases In JournalismFinding Insights In Connected Data: Using Graph Databases In Journalism
Finding Insights In Connected Data: Using Graph Databases In Journalism
 
Data Collection Industry Insights: Hotels
Data Collection Industry Insights: HotelsData Collection Industry Insights: Hotels
Data Collection Industry Insights: Hotels
 
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open DataMuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data
 
KESeDa: Knowledge Extraction from Heterogeneous Semi-Structured Data Sources
KESeDa: Knowledge Extraction from Heterogeneous Semi-Structured Data SourcesKESeDa: Knowledge Extraction from Heterogeneous Semi-Structured Data Sources
KESeDa: Knowledge Extraction from Heterogeneous Semi-Structured Data Sources
 
RDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival dataRDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival data
 
Getting started with JUSP
Getting started with JUSPGetting started with JUSP
Getting started with JUSP
 
Enabling Data Analytics from Knowledge Graphs @ ISWC 2017 Doctoral Consortium
Enabling Data Analytics from Knowledge Graphs @ ISWC 2017 Doctoral ConsortiumEnabling Data Analytics from Knowledge Graphs @ ISWC 2017 Doctoral Consortium
Enabling Data Analytics from Knowledge Graphs @ ISWC 2017 Doctoral Consortium
 
Introducing the IRUSdataUK pilot webinar
Introducing the IRUSdataUK pilot webinarIntroducing the IRUSdataUK pilot webinar
Introducing the IRUSdataUK pilot webinar
 

Similar to Publishing Linked Statistical Data: Aragón, a case study

BDE SC6-ws-05/12/2016 technology part - SWC
BDE SC6-ws-05/12/2016 technology part - SWCBDE SC6-ws-05/12/2016 technology part - SWC
BDE SC6-ws-05/12/2016 technology part - SWCBigData_Europe
 
Boost your data analytics with open data and public news content
Boost your data analytics with open data and public news contentBoost your data analytics with open data and public news content
Boost your data analytics with open data and public news contentOntotext
 
Big Data Europe SC6 WS #3: PILOT SC6: CITIZEN BUDGET ON MUNICIPAL LEVEL, Mart...
Big Data Europe SC6 WS #3: PILOT SC6: CITIZEN BUDGET ON MUNICIPAL LEVEL, Mart...Big Data Europe SC6 WS #3: PILOT SC6: CITIZEN BUDGET ON MUNICIPAL LEVEL, Mart...
Big Data Europe SC6 WS #3: PILOT SC6: CITIZEN BUDGET ON MUNICIPAL LEVEL, Mart...BigData_Europe
 
Big data Europe: concept, platform and pilots
Big data Europe: concept, platform and pilotsBig data Europe: concept, platform and pilots
Big data Europe: concept, platform and pilotsBigData_Europe
 
eNanoMapper database, search tools and templates
eNanoMapper database, search tools and templateseNanoMapper database, search tools and templates
eNanoMapper database, search tools and templatesNina Jeliazkova
 
StatDCAT-Application Profile: presentation
StatDCAT-Application Profile: presentationStatDCAT-Application Profile: presentation
StatDCAT-Application Profile: presentationSemic.eu
 
The Power of Semantic Technologies to Explore Linked Open Data
The Power of Semantic Technologies to Explore Linked Open DataThe Power of Semantic Technologies to Explore Linked Open Data
The Power of Semantic Technologies to Explore Linked Open DataOntotext
 
Linked Statistical Data: does it actually pay off?
Linked Statistical Data: does it actually pay off?Linked Statistical Data: does it actually pay off?
Linked Statistical Data: does it actually pay off?Oscar Corcho
 
Meetup070416 Presentations
Meetup070416 PresentationsMeetup070416 Presentations
Meetup070416 PresentationsAna Rebelo
 
Tag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformTag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformSanjay Padhi, Ph.D
 
Fairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matricesFairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matricesPistoia Alliance
 
Apache Big_Data Europe event: "Demonstrating the Societal Value of Big & Smar...
Apache Big_Data Europe event: "Demonstrating the Societal Value of Big & Smar...Apache Big_Data Europe event: "Demonstrating the Societal Value of Big & Smar...
Apache Big_Data Europe event: "Demonstrating the Societal Value of Big & Smar...BigData_Europe
 
BDE SC6 workshop - introduction 2016
BDE SC6 workshop - introduction 2016BDE SC6 workshop - introduction 2016
BDE SC6 workshop - introduction 2016BigData_Europe
 
TEAMS 6, 7 and 8
TEAMS 6, 7 and 8TEAMS 6, 7 and 8
TEAMS 6, 7 and 8plan4all
 
WSO2 Machine Learner - Product Overview
WSO2 Machine Learner - Product OverviewWSO2 Machine Learner - Product Overview
WSO2 Machine Learner - Product OverviewWSO2
 
BDE SC6-hang out - technology part-SWC - Martin
BDE SC6-hang out - technology part-SWC - MartinBDE SC6-hang out - technology part-SWC - Martin
BDE SC6-hang out - technology part-SWC - MartinBigData_Europe
 
Better Hackathon 2020 - Fraunhofer IAIS - Semantic geo-clustering with SANSA
Better Hackathon 2020 - Fraunhofer IAIS - Semantic geo-clustering with SANSABetter Hackathon 2020 - Fraunhofer IAIS - Semantic geo-clustering with SANSA
Better Hackathon 2020 - Fraunhofer IAIS - Semantic geo-clustering with SANSAPRBETTER
 
ESSnet Big Data WP8 Methodology (+ Quality, +IT)
ESSnet Big Data WP8 Methodology (+ Quality, +IT)ESSnet Big Data WP8 Methodology (+ Quality, +IT)
ESSnet Big Data WP8 Methodology (+ Quality, +IT)Piet J.H. Daas
 

Similar to Publishing Linked Statistical Data: Aragón, a case study (20)

BDE SC6-ws-05/12/2016 technology part - SWC
BDE SC6-ws-05/12/2016 technology part - SWCBDE SC6-ws-05/12/2016 technology part - SWC
BDE SC6-ws-05/12/2016 technology part - SWC
 
Boost your data analytics with open data and public news content
Boost your data analytics with open data and public news contentBoost your data analytics with open data and public news content
Boost your data analytics with open data and public news content
 
Big Data Europe SC6 WS #3: PILOT SC6: CITIZEN BUDGET ON MUNICIPAL LEVEL, Mart...
Big Data Europe SC6 WS #3: PILOT SC6: CITIZEN BUDGET ON MUNICIPAL LEVEL, Mart...Big Data Europe SC6 WS #3: PILOT SC6: CITIZEN BUDGET ON MUNICIPAL LEVEL, Mart...
Big Data Europe SC6 WS #3: PILOT SC6: CITIZEN BUDGET ON MUNICIPAL LEVEL, Mart...
 
Big data Europe: concept, platform and pilots
Big data Europe: concept, platform and pilotsBig data Europe: concept, platform and pilots
Big data Europe: concept, platform and pilots
 
GeoLinkedData
GeoLinkedDataGeoLinkedData
GeoLinkedData
 
eNanoMapper database, search tools and templates
eNanoMapper database, search tools and templateseNanoMapper database, search tools and templates
eNanoMapper database, search tools and templates
 
StatDCAT-Application Profile: presentation
StatDCAT-Application Profile: presentationStatDCAT-Application Profile: presentation
StatDCAT-Application Profile: presentation
 
The Power of Semantic Technologies to Explore Linked Open Data
The Power of Semantic Technologies to Explore Linked Open DataThe Power of Semantic Technologies to Explore Linked Open Data
The Power of Semantic Technologies to Explore Linked Open Data
 
Linked Statistical Data: does it actually pay off?
Linked Statistical Data: does it actually pay off?Linked Statistical Data: does it actually pay off?
Linked Statistical Data: does it actually pay off?
 
Meetup070416 Presentations
Meetup070416 PresentationsMeetup070416 Presentations
Meetup070416 Presentations
 
Tag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformTag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh Platform
 
Fairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matricesFairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matrices
 
Apache Big_Data Europe event: "Demonstrating the Societal Value of Big & Smar...
Apache Big_Data Europe event: "Demonstrating the Societal Value of Big & Smar...Apache Big_Data Europe event: "Demonstrating the Societal Value of Big & Smar...
Apache Big_Data Europe event: "Demonstrating the Societal Value of Big & Smar...
 
BDE SC6 workshop - introduction 2016
BDE SC6 workshop - introduction 2016BDE SC6 workshop - introduction 2016
BDE SC6 workshop - introduction 2016
 
TEAMS 6, 7 and 8
TEAMS 6, 7 and 8TEAMS 6, 7 and 8
TEAMS 6, 7 and 8
 
WSO2 Machine Learner - Product Overview
WSO2 Machine Learner - Product OverviewWSO2 Machine Learner - Product Overview
WSO2 Machine Learner - Product Overview
 
BDE SC6-hang out - technology part-SWC - Martin
BDE SC6-hang out - technology part-SWC - MartinBDE SC6-hang out - technology part-SWC - Martin
BDE SC6-hang out - technology part-SWC - Martin
 
Better Hackathon 2020 - Fraunhofer IAIS - Semantic geo-clustering with SANSA
Better Hackathon 2020 - Fraunhofer IAIS - Semantic geo-clustering with SANSABetter Hackathon 2020 - Fraunhofer IAIS - Semantic geo-clustering with SANSA
Better Hackathon 2020 - Fraunhofer IAIS - Semantic geo-clustering with SANSA
 
Bde euro proworkshop
Bde euro proworkshopBde euro proworkshop
Bde euro proworkshop
 
ESSnet Big Data WP8 Methodology (+ Quality, +IT)
ESSnet Big Data WP8 Methodology (+ Quality, +IT)ESSnet Big Data WP8 Methodology (+ Quality, +IT)
ESSnet Big Data WP8 Methodology (+ Quality, +IT)
 

More from Oscar Corcho

Organisational Interoperability in Practice at Universidad Politécnica de Madrid
Organisational Interoperability in Practice at Universidad Politécnica de MadridOrganisational Interoperability in Practice at Universidad Politécnica de Madrid
Organisational Interoperability in Practice at Universidad Politécnica de MadridOscar Corcho
 
Introducción a los Datos Abiertos - Open Data Day 2020
Introducción a los Datos Abiertos - Open Data Day 2020Introducción a los Datos Abiertos - Open Data Day 2020
Introducción a los Datos Abiertos - Open Data Day 2020Oscar Corcho
 
Open Data (and Software, and other Research Artefacts) - A proper management
Open Data (and Software, and other Research Artefacts) -A proper managementOpen Data (and Software, and other Research Artefacts) -A proper management
Open Data (and Software, and other Research Artefacts) - A proper management Oscar Corcho
 
Adiós a los ficheros, hola a los grafos de conocimientos estadísticos
Adiós a los ficheros, hola a los grafos de conocimientos estadísticosAdiós a los ficheros, hola a los grafos de conocimientos estadísticos
Adiós a los ficheros, hola a los grafos de conocimientos estadísticosOscar Corcho
 
Ontology Engineering at Scale for Open City Data Sharing
Ontology Engineering at Scale for Open City Data SharingOntology Engineering at Scale for Open City Data Sharing
Ontology Engineering at Scale for Open City Data SharingOscar Corcho
 
Situación de las iniciativas de Open Data internacionales (y algunas recomen...
Situación de las iniciativas de Open Data internacionales (y algunas recomen...Situación de las iniciativas de Open Data internacionales (y algunas recomen...
Situación de las iniciativas de Open Data internacionales (y algunas recomen...Oscar Corcho
 
STARS4ALL - Contaminación Lumínica
STARS4ALL - Contaminación LumínicaSTARS4ALL - Contaminación Lumínica
STARS4ALL - Contaminación LumínicaOscar Corcho
 
Towards Reproducible Science: a few building blocks from my personal experience
Towards Reproducible Science: a few building blocks from my personal experienceTowards Reproducible Science: a few building blocks from my personal experience
Towards Reproducible Science: a few building blocks from my personal experienceOscar Corcho
 
An initial analysis of topic-based similarity among scientific documents base...
An initial analysis of topic-based similarity among scientific documents base...An initial analysis of topic-based similarity among scientific documents base...
An initial analysis of topic-based similarity among scientific documents base...Oscar Corcho
 
Linked Statistical Data 101
Linked Statistical Data 101Linked Statistical Data 101
Linked Statistical Data 101Oscar Corcho
 
Aplicando los principios de Linked Data en AEMET
Aplicando los principios de Linked Data en AEMETAplicando los principios de Linked Data en AEMET
Aplicando los principios de Linked Data en AEMET Oscar Corcho
 
Ojo Al Data 100 - Call for sharing session at IODC 2016
Ojo Al Data 100 - Call for sharing session at IODC 2016Ojo Al Data 100 - Call for sharing session at IODC 2016
Ojo Al Data 100 - Call for sharing session at IODC 2016Oscar Corcho
 
Educando sobre datos abiertos: desde el colegio a la universidad
Educando sobre datos abiertos: desde el colegio a la universidadEducando sobre datos abiertos: desde el colegio a la universidad
Educando sobre datos abiertos: desde el colegio a la universidadOscar Corcho
 
STARS4ALL general presentation at ALAN2016
STARS4ALL general presentation at ALAN2016STARS4ALL general presentation at ALAN2016
STARS4ALL general presentation at ALAN2016Oscar Corcho
 
Generación de datos estadísticos enlazados del Instituto Aragonés de Estadística
Generación de datos estadísticos enlazados del Instituto Aragonés de EstadísticaGeneración de datos estadísticos enlazados del Instituto Aragonés de Estadística
Generación de datos estadísticos enlazados del Instituto Aragonés de EstadísticaOscar Corcho
 
Presentación de la red de excelencia de Open Data y Smart Cities
Presentación de la red de excelencia de Open Data y Smart CitiesPresentación de la red de excelencia de Open Data y Smart Cities
Presentación de la red de excelencia de Open Data y Smart CitiesOscar Corcho
 
Why do they call it Linked Data when they want to say...?
Why do they call it Linked Data when they want to say...?Why do they call it Linked Data when they want to say...?
Why do they call it Linked Data when they want to say...?Oscar Corcho
 
Slow-cooked data and APIs in the world of Big Data: the view from a city per...
Slow-cooked data and APIs in the world of Big Data: the view from a city per...Slow-cooked data and APIs in the world of Big Data: the view from a city per...
Slow-cooked data and APIs in the world of Big Data: the view from a city per...Oscar Corcho
 
Research Objects for improved sharing and reproducibility
Research Objects for improved sharing and reproducibilityResearch Objects for improved sharing and reproducibility
Research Objects for improved sharing and reproducibilityOscar Corcho
 
(Big) Data (Science) Skills
(Big) Data (Science) Skills(Big) Data (Science) Skills
(Big) Data (Science) SkillsOscar Corcho
 

More from Oscar Corcho (20)

Organisational Interoperability in Practice at Universidad Politécnica de Madrid
Organisational Interoperability in Practice at Universidad Politécnica de MadridOrganisational Interoperability in Practice at Universidad Politécnica de Madrid
Organisational Interoperability in Practice at Universidad Politécnica de Madrid
 
Introducción a los Datos Abiertos - Open Data Day 2020
Introducción a los Datos Abiertos - Open Data Day 2020Introducción a los Datos Abiertos - Open Data Day 2020
Introducción a los Datos Abiertos - Open Data Day 2020
 
Open Data (and Software, and other Research Artefacts) - A proper management
Open Data (and Software, and other Research Artefacts) -A proper managementOpen Data (and Software, and other Research Artefacts) -A proper management
Open Data (and Software, and other Research Artefacts) - A proper management
 
Adiós a los ficheros, hola a los grafos de conocimientos estadísticos
Adiós a los ficheros, hola a los grafos de conocimientos estadísticosAdiós a los ficheros, hola a los grafos de conocimientos estadísticos
Adiós a los ficheros, hola a los grafos de conocimientos estadísticos
 
Ontology Engineering at Scale for Open City Data Sharing
Ontology Engineering at Scale for Open City Data SharingOntology Engineering at Scale for Open City Data Sharing
Ontology Engineering at Scale for Open City Data Sharing
 
Situación de las iniciativas de Open Data internacionales (y algunas recomen...
Situación de las iniciativas de Open Data internacionales (y algunas recomen...Situación de las iniciativas de Open Data internacionales (y algunas recomen...
Situación de las iniciativas de Open Data internacionales (y algunas recomen...
 
STARS4ALL - Contaminación Lumínica
STARS4ALL - Contaminación LumínicaSTARS4ALL - Contaminación Lumínica
STARS4ALL - Contaminación Lumínica
 
Towards Reproducible Science: a few building blocks from my personal experience
Towards Reproducible Science: a few building blocks from my personal experienceTowards Reproducible Science: a few building blocks from my personal experience
Towards Reproducible Science: a few building blocks from my personal experience
 
An initial analysis of topic-based similarity among scientific documents base...
An initial analysis of topic-based similarity among scientific documents base...An initial analysis of topic-based similarity among scientific documents base...
An initial analysis of topic-based similarity among scientific documents base...
 
Linked Statistical Data 101
Linked Statistical Data 101Linked Statistical Data 101
Linked Statistical Data 101
 
Aplicando los principios de Linked Data en AEMET
Aplicando los principios de Linked Data en AEMETAplicando los principios de Linked Data en AEMET
Aplicando los principios de Linked Data en AEMET
 
Ojo Al Data 100 - Call for sharing session at IODC 2016
Ojo Al Data 100 - Call for sharing session at IODC 2016Ojo Al Data 100 - Call for sharing session at IODC 2016
Ojo Al Data 100 - Call for sharing session at IODC 2016
 
Educando sobre datos abiertos: desde el colegio a la universidad
Educando sobre datos abiertos: desde el colegio a la universidadEducando sobre datos abiertos: desde el colegio a la universidad
Educando sobre datos abiertos: desde el colegio a la universidad
 
STARS4ALL general presentation at ALAN2016
STARS4ALL general presentation at ALAN2016STARS4ALL general presentation at ALAN2016
STARS4ALL general presentation at ALAN2016
 
Generación de datos estadísticos enlazados del Instituto Aragonés de Estadística
Generación de datos estadísticos enlazados del Instituto Aragonés de EstadísticaGeneración de datos estadísticos enlazados del Instituto Aragonés de Estadística
Generación de datos estadísticos enlazados del Instituto Aragonés de Estadística
 
Presentación de la red de excelencia de Open Data y Smart Cities
Presentación de la red de excelencia de Open Data y Smart CitiesPresentación de la red de excelencia de Open Data y Smart Cities
Presentación de la red de excelencia de Open Data y Smart Cities
 
Why do they call it Linked Data when they want to say...?
Why do they call it Linked Data when they want to say...?Why do they call it Linked Data when they want to say...?
Why do they call it Linked Data when they want to say...?
 
Slow-cooked data and APIs in the world of Big Data: the view from a city per...
Slow-cooked data and APIs in the world of Big Data: the view from a city per...Slow-cooked data and APIs in the world of Big Data: the view from a city per...
Slow-cooked data and APIs in the world of Big Data: the view from a city per...
 
Research Objects for improved sharing and reproducibility
Research Objects for improved sharing and reproducibilityResearch Objects for improved sharing and reproducibility
Research Objects for improved sharing and reproducibility
 
(Big) Data (Science) Skills
(Big) Data (Science) Skills(Big) Data (Science) Skills
(Big) Data (Science) Skills
 

Recently uploaded

Building a better Britain: How cities like Bradford can help to end economic ...
Building a better Britain: How cities like Bradford can help to end economic ...Building a better Britain: How cities like Bradford can help to end economic ...
Building a better Britain: How cities like Bradford can help to end economic ...ResolutionFoundation
 
Inflation scarring: How has the cost-of-living crisis changed Britain?
Inflation scarring: How has the cost-of-living crisis changed Britain?Inflation scarring: How has the cost-of-living crisis changed Britain?
Inflation scarring: How has the cost-of-living crisis changed Britain?ResolutionFoundation
 
History of DAVAO DE ORO Municipality of Maragusan
History of DAVAO DE ORO Municipality of MaragusanHistory of DAVAO DE ORO Municipality of Maragusan
History of DAVAO DE ORO Municipality of Maragusannarzilgulmatico
 
Program Kickoff- Cohort......_4 (1).pptx
Program Kickoff- Cohort......_4 (1).pptxProgram Kickoff- Cohort......_4 (1).pptx
Program Kickoff- Cohort......_4 (1).pptxScottMeyers35
 
Effective governance in the modern charity
Effective governance in the modern charityEffective governance in the modern charity
Effective governance in the modern charityFelixPerez547899
 
OilChange: Big Oil Reality Check May 2024
OilChange: Big Oil Reality Check May 2024OilChange: Big Oil Reality Check May 2024
OilChange: Big Oil Reality Check May 2024Energy for One World
 
Vip ℂall Girls Laxmi Nagar Phone No 9999965857 High Profile ℂall Girl Delhi N...
Vip ℂall Girls Laxmi Nagar Phone No 9999965857 High Profile ℂall Girl Delhi N...Vip ℂall Girls Laxmi Nagar Phone No 9999965857 High Profile ℂall Girl Delhi N...
Vip ℂall Girls Laxmi Nagar Phone No 9999965857 High Profile ℂall Girl Delhi N...zoyat9250
 
Rocky Mount Greenville Willson Regional Transit Plan
Rocky Mount Greenville Willson Regional Transit PlanRocky Mount Greenville Willson Regional Transit Plan
Rocky Mount Greenville Willson Regional Transit PlanRobert Hiett
 
Vip ℂall Girls New Delhi Phone No 9999965857 High Profile ℂall Girl Delhi No ...
Vip ℂall Girls New Delhi Phone No 9999965857 High Profile ℂall Girl Delhi No ...Vip ℂall Girls New Delhi Phone No 9999965857 High Profile ℂall Girl Delhi No ...
Vip ℂall Girls New Delhi Phone No 9999965857 High Profile ℂall Girl Delhi No ...vanyaj843
 
Vip ℂall Girls Akshardham Phone No 9999965857 High Profile ℂall Girl Delhi No...
Vip ℂall Girls Akshardham Phone No 9999965857 High Profile ℂall Girl Delhi No...Vip ℂall Girls Akshardham Phone No 9999965857 High Profile ℂall Girl Delhi No...
Vip ℂall Girls Akshardham Phone No 9999965857 High Profile ℂall Girl Delhi No...jiyav969
 
2024: The FAR - Federal Acquisition Regulations, Part 33
2024: The FAR - Federal Acquisition Regulations, Part 332024: The FAR - Federal Acquisition Regulations, Part 33
2024: The FAR - Federal Acquisition Regulations, Part 33JSchaus & Associates
 
Vip ℂall Girls AK Market Phone No 9999965857 High Profile ℂall Girl Delhi No ...
Vip ℂall Girls AK Market Phone No 9999965857 High Profile ℂall Girl Delhi No ...Vip ℂall Girls AK Market Phone No 9999965857 High Profile ℂall Girl Delhi No ...
Vip ℂall Girls AK Market Phone No 9999965857 High Profile ℂall Girl Delhi No ...jiyav969
 
Daftar Rumpun, Pohon, dan Cabang Ilmu (2024) - Daftar Rumpun, Pohon, dan Caba...
Daftar Rumpun, Pohon, dan Cabang Ilmu (2024) - Daftar Rumpun, Pohon, dan Caba...Daftar Rumpun, Pohon, dan Cabang Ilmu (2024) - Daftar Rumpun, Pohon, dan Caba...
Daftar Rumpun, Pohon, dan Cabang Ilmu (2024) - Daftar Rumpun, Pohon, dan Caba...suryaaamsyah
 
Dawson City Rolling Ads for May 15 2024 Yukon
Dawson City Rolling Ads for May 15 2024 YukonDawson City Rolling Ads for May 15 2024 Yukon
Dawson City Rolling Ads for May 15 2024 Yukonpmenzies
 
16 may, International Day of Living together in peace 2024
16 may, International Day of Living together in peace 202416 may, International Day of Living together in peace 2024
16 may, International Day of Living together in peace 2024Christina Parmionova
 
加急制作英国朴次茅斯大学毕业证(uop学位证书)学位证书范本雅思成绩单原版一模一样
加急制作英国朴次茅斯大学毕业证(uop学位证书)学位证书范本雅思成绩单原版一模一样加急制作英国朴次茅斯大学毕业证(uop学位证书)学位证书范本雅思成绩单原版一模一样
加急制作英国朴次茅斯大学毕业证(uop学位证书)学位证书范本雅思成绩单原版一模一样ucnutfa
 
Effective Financial Reporting - May 2024
Effective Financial Reporting - May 2024Effective Financial Reporting - May 2024
Effective Financial Reporting - May 2024FelixPerez547899
 

Recently uploaded (20)

Building a better Britain: How cities like Bradford can help to end economic ...
Building a better Britain: How cities like Bradford can help to end economic ...Building a better Britain: How cities like Bradford can help to end economic ...
Building a better Britain: How cities like Bradford can help to end economic ...
 
Inflation scarring: How has the cost-of-living crisis changed Britain?
Inflation scarring: How has the cost-of-living crisis changed Britain?Inflation scarring: How has the cost-of-living crisis changed Britain?
Inflation scarring: How has the cost-of-living crisis changed Britain?
 
History of DAVAO DE ORO Municipality of Maragusan
History of DAVAO DE ORO Municipality of MaragusanHistory of DAVAO DE ORO Municipality of Maragusan
History of DAVAO DE ORO Municipality of Maragusan
 
Program Kickoff- Cohort......_4 (1).pptx
Program Kickoff- Cohort......_4 (1).pptxProgram Kickoff- Cohort......_4 (1).pptx
Program Kickoff- Cohort......_4 (1).pptx
 
Effective governance in the modern charity
Effective governance in the modern charityEffective governance in the modern charity
Effective governance in the modern charity
 
OilChange: Big Oil Reality Check May 2024
OilChange: Big Oil Reality Check May 2024OilChange: Big Oil Reality Check May 2024
OilChange: Big Oil Reality Check May 2024
 
07/03/2024 Publiekdomeindag - voormiddag
07/03/2024 Publiekdomeindag - voormiddag07/03/2024 Publiekdomeindag - voormiddag
07/03/2024 Publiekdomeindag - voormiddag
 
Vip ℂall Girls Laxmi Nagar Phone No 9999965857 High Profile ℂall Girl Delhi N...
Vip ℂall Girls Laxmi Nagar Phone No 9999965857 High Profile ℂall Girl Delhi N...Vip ℂall Girls Laxmi Nagar Phone No 9999965857 High Profile ℂall Girl Delhi N...
Vip ℂall Girls Laxmi Nagar Phone No 9999965857 High Profile ℂall Girl Delhi N...
 
Rocky Mount Greenville Willson Regional Transit Plan
Rocky Mount Greenville Willson Regional Transit PlanRocky Mount Greenville Willson Regional Transit Plan
Rocky Mount Greenville Willson Regional Transit Plan
 
Vip ℂall Girls New Delhi Phone No 9999965857 High Profile ℂall Girl Delhi No ...
Vip ℂall Girls New Delhi Phone No 9999965857 High Profile ℂall Girl Delhi No ...Vip ℂall Girls New Delhi Phone No 9999965857 High Profile ℂall Girl Delhi No ...
Vip ℂall Girls New Delhi Phone No 9999965857 High Profile ℂall Girl Delhi No ...
 
Vip ℂall Girls Akshardham Phone No 9999965857 High Profile ℂall Girl Delhi No...
Vip ℂall Girls Akshardham Phone No 9999965857 High Profile ℂall Girl Delhi No...Vip ℂall Girls Akshardham Phone No 9999965857 High Profile ℂall Girl Delhi No...
Vip ℂall Girls Akshardham Phone No 9999965857 High Profile ℂall Girl Delhi No...
 
2024: The FAR - Federal Acquisition Regulations, Part 33
2024: The FAR - Federal Acquisition Regulations, Part 332024: The FAR - Federal Acquisition Regulations, Part 33
2024: The FAR - Federal Acquisition Regulations, Part 33
 
07/03/2024 Publiekdomeindag - namiddag
07/03/2024 Publiekdomeindag - namiddag07/03/2024 Publiekdomeindag - namiddag
07/03/2024 Publiekdomeindag - namiddag
 
How to Save a Place: Get the Word Out Far And Wide
How to Save a Place: Get the Word Out Far And WideHow to Save a Place: Get the Word Out Far And Wide
How to Save a Place: Get the Word Out Far And Wide
 
Vip ℂall Girls AK Market Phone No 9999965857 High Profile ℂall Girl Delhi No ...
Vip ℂall Girls AK Market Phone No 9999965857 High Profile ℂall Girl Delhi No ...Vip ℂall Girls AK Market Phone No 9999965857 High Profile ℂall Girl Delhi No ...
Vip ℂall Girls AK Market Phone No 9999965857 High Profile ℂall Girl Delhi No ...
 
Daftar Rumpun, Pohon, dan Cabang Ilmu (2024) - Daftar Rumpun, Pohon, dan Caba...
Daftar Rumpun, Pohon, dan Cabang Ilmu (2024) - Daftar Rumpun, Pohon, dan Caba...Daftar Rumpun, Pohon, dan Cabang Ilmu (2024) - Daftar Rumpun, Pohon, dan Caba...
Daftar Rumpun, Pohon, dan Cabang Ilmu (2024) - Daftar Rumpun, Pohon, dan Caba...
 
Dawson City Rolling Ads for May 15 2024 Yukon
Dawson City Rolling Ads for May 15 2024 YukonDawson City Rolling Ads for May 15 2024 Yukon
Dawson City Rolling Ads for May 15 2024 Yukon
 
16 may, International Day of Living together in peace 2024
16 may, International Day of Living together in peace 202416 may, International Day of Living together in peace 2024
16 may, International Day of Living together in peace 2024
 
加急制作英国朴次茅斯大学毕业证(uop学位证书)学位证书范本雅思成绩单原版一模一样
加急制作英国朴次茅斯大学毕业证(uop学位证书)学位证书范本雅思成绩单原版一模一样加急制作英国朴次茅斯大学毕业证(uop学位证书)学位证书范本雅思成绩单原版一模一样
加急制作英国朴次茅斯大学毕业证(uop学位证书)学位证书范本雅思成绩单原版一模一样
 
Effective Financial Reporting - May 2024
Effective Financial Reporting - May 2024Effective Financial Reporting - May 2024
Effective Financial Reporting - May 2024
 

Publishing Linked Statistical Data: Aragón, a case study

  • 1. Oscar Corcho1, Idafen Santana-Pérez1, Hugo Lafuente2, David Portolés3, César Cano4, Alfredo Peris4 and José María Subero4 1 Ontology Engineering Group, Universidad Politécnica de Madrid 2 Localidata 3 Idearium Consultores 4 Gobierno de Aragón Publishing Linked Statistical Data: Aragón, a case study ocorcho@fi.upm.es @ocorcho 22/10/2017 SemStats 2017 @ ISWC
  • 2. Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017 Context 2  IAEst: Instituto Aragonés de Estadística o http://www.aragon.es/iaest o The statistical office from Aragón o Offering open data through • Open Data portal in Aragón (http://opendata.aragon.es/) • Their own portal (our interest is on the database of “estadística local”)
  • 3. Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017 Context: Existing IAEst data infrastructure 3  Existing data infrastructure o Data warehouse infrastructure based on an Oracle BI o Exports into different formats, including CSVs
  • 4. Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017 Context: Existing IAEst data infrastructure 4  Existing data infrastructure o Data warehouse infrastructure based on an Oracle BI o Exports into different formats, including CSVs o http://www.aragon.es/DepartamentosOrganismosPublicos/Institu tos/InstitutoAragonesEstadistica/AreasGenericas/ci.EstadisticaL ocal.detalleDepartamento  Data retrieval and browsing o Taxonomy-based o Fixed filters coded in the app o User selects • Administrative division • The concrete municipality • Browses the folder structure o Data retrieved in HTML, PDF or CSV
  • 5. Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017 Predesigned reports offered from Oracle BI Web app for Estadística Local Context: Existing IAEst web app
  • 6. Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017 Context: Existing IAEst data sharing  En la Web del IAEst o http://www.aragon.es/DepartamentosOrganismosPublicos/Institu tos/InstitutoAragonesEstadistica/AreasGenericas/ci.EstadisticaL ocal.detalleDepartamento  En OpenDataAragón o http://opendata.aragon.es/catalogo/edificios-superficie-y- vivienda-comarcas
  • 7. Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017 Goals 7 Extract those statistical reports, transform them into RDF according to W3C standards, curate them, link them to the existing Linked Data from Aragón (mostly URIs from municipalities and regions) and provide an API and a new user interface to make use of them
  • 8. Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017 Results 8  An easier-to-maintain data transformation process o Enriching existing Linked Data APIs from Aragón o Using GitHub for • Version control and archival • Continuous updates: detecting new data and data structures on a daily basis • https://github.com/aragonopendata/local-data-aragopedia/  Developer-friendly API  Additional user interface o Improving data retrieval and browsing capabilities  Side effect: data curation o Many errors and improvements detected in pre-existing CSV exports, which have been corrected throughout the process
  • 9. Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017 Transformation and publication process 9 Initial characterisation •Identify sources •Identify dimensions and measurements Transformation •Daily data download •Processing (UTF8) •Upload into GitHub •New dimensions/measures annotation •RDF transformation Publication and use •Linked Data APIs https://github.com/aragonopendata/local-data-aragopedia/
  • 10. Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017 Initial characterisation 10  Identify and download data sources to be published (~1000) o https://github.com/aragonopendata/local-data- aragopedia/tree/master/data/resource/DatosDescarga-UTF8  Pre-process data (UTF-8 encoding, download error verification and retrials)  Identify potential dimensions and measurements o Analysis of column header names (e.g., municipio, comarca), and data content (how many different values) • https://github.com/aragonopendata/local-data- aragopedia/blob/master/data/resource/heads.txt o From 700+ dimensions to ~500 • Curated by IAEst experts (e.g., Male, M, Males, Female, F, Females, Women, Men)
  • 11. Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017 Initial characterisation 11  SKOS concept schemes for each dimension o https://github.com/aragonopendata/local-data- aragopedia/tree/master/data/dump/DatosTTL/codelists o Mapping files available in GitHub (e.g., https://github.com/aragonopendata/local-data- aragopedia/blob/master/data/metadata/mapping-tipo-edificio- detalle.xlsx)
  • 12. Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017 Initial characterisation 12  Measurement properties o https://github.com/aragonopendata/local-data- aragopedia/blob/master/data/dump/DatosTTL/codelists/propertie s.ttl  DSDs o https://github.com/aragonopendata/local-data- aragopedia/tree/master/data/dump/DatosTTL/dataStructures  Errors were identified during this phase o Same concept, different names (e.g. sexo and género) o Typos in header names o Columns with no values o Data belonging to wrong municipalities and districts o https://github.com/aragonopendata/local-data- aragopedia/blob/master/data/dump/errorReport.txt
  • 13. Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017 Continuous Transformation 13  Continuous production cycle o Update RDF as reports are generated, modified or removed  Executed every night o Retrieves all the reports from the list (generated before) o Checks whether the reports have been already transformed or if the contain new data o Hash signatures for each generated Data Cube • https://github.com/aragonopendata/local-data- aragopedia/blob/master/data/resource/hashcode.csv • Used to compare data versions • If hashes do not match, the Data Cube is marked to be regenerated
  • 14. Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017 Continuous Transformation 14  Each iteration generates a GitHub issue, listing the cubes that have must be created, modified, etc. o https://github.com/aragonopendata/local-data- aragopedia/issues • https://github.com/aragonopendata/local-data- aragopedia/issues/93 (new data) • https://github.com/aragonopendata/local-data- aragopedia/issues/457 (datacube to delete, new configurations needed) o When user interaction is needed, this is reflected in the issue text, and the IAEst responsible needs to update it  RDF transformation is done according to the configuration file o https://github.com/aragonopendata/local-data- aragopedia/blob/master/data/metadata/Informe-01-010001- A-TC-TM-TP.xlsx
  • 15. Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017 Continuous Transformation 15  RDF data is stored in GitHub (new version) o https://github.com/aragonopendata/local-data- aragopedia/tree/master/data/dump/DatosTTL/informes  RDF data is stored in the Open Data Aragón SPARQL endpoint o http://opendata.aragon.es/sparql o Reusing the 3cixty KB deployment utilities o Each cube is stored on its own graph o Graphs updated for Data Structure Definition (DSD), properties and SKOS information
  • 16. Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017 Data transformation. In summary… bi.aragon.es Google Drive Dataset and configuration download New dataset? GitHub Sí For each dataset Generate new configuration and create an issue New structure? No Create issue Sí New data? Regenerate data and create issue No Sí SPARQL
  • 17. Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017 Data publication and use 17  Data can be accessed o API (using ELDA) • http://opendata.aragon.es/herramientas/apis?#aragodbpedia o GitHub (CSVs, RDF) o SPARQL endpoint SPARQL Elda Linked Data
  • 18. Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017 Data API http://opendata.aragon.es/herramientas/apis?#aragodb pedia
  • 19. Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017 Data publication and use 19  Aragopedia o http://opendata.aragon.es/apps/aragopedia/datos o Where, when and what (dónde, cuándo y qué) o Data can be downloaded in • CSV • JSON
  • 20. Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017 Aragopedia 20  Aragopedia o JSON result of querying about • Maestrazgo region (where) • population (what) • in 1999 (when)
  • 21. Publishing Linked Statistical Data: Aragón, a case study. – SemStats 2017 Conclusions (Results) 21  An easier-to-maintain data transformation process o Enriching existing Linked Data APIs from Aragón o Using GitHub for • Version control and archival • Continuous updates: detecting new data and data structures on a daily basis • https://github.com/aragonopendata/local-data-aragopedia/  Developer-friendly API  Additional user interface o Improving data retrieval and browsing capabilities  Side effect: data curation o Many errors and improvements detected in pre-existing CSV exports, which have been corrected throughout the process
  • 22. Oscar Corcho1, Idafen Santana-Pérez1, Hugo Lafuente2, David Portolés3, César Cano4, Alfredo Peris4 and José María Subero4 1 Ontology Engineering Group, Universidad Politécnica de Madrid 2 Localidata 3 Idearium Consultores 4 Gobierno de Aragón Publishing Linked Statistical Data: Aragón, a case study ocorcho@fi.upm.es @ocorcho 22/10/2017 SemStats 2017 @ ISWC

Editor's Notes

  1. Cambiar la licencia por la que aplique.
  2. Cambiar la licencia por la que aplique.