SlideShare a Scribd company logo
1 of 58
Workshop at Dual EGOV 2015 & ePart 2015 conference
2 September 2015, Thessaloniki Greece
Create, Expand, and Exploit Linked Open
Statistical Data
E. Tambouris, E. Kalampokis, K. Tarabanis
 Data Cubes on the Web
 The RDF Data Cube Vocabulary
 OpenCube OLAP Browser
 OpenCube Mapview
 OpenCube Expander
2
Table of Contents
Dual EGOV2015 & ePart2015 conference
3
Data Cube
Measures
Dimension
Dimension
Dimension
Dual EGOV2015 & ePart2015 conference
 http://ec.europa.eu/eurostat
4
Eurostat
Dual EGOV2015 & ePart2015 conference
 http://ec.europa.eu/eurostat/tgm/table.do?tab=table&plugin=1&l
anguage=en&pcode=t2020_50
5
People at risk of poverty
Dual EGOV2015 & ePart2015 conference
6
Graph view
Graph types
Data filter
Sort and Labels
Dual EGOV2015 & ePart2015 conference
7
Map view
Map types
Data filter
Layers
Dual EGOV2015 & ePart2015 conference
8
Combining variables from Eurostat & Digital Agenda
Dual EGOV2015 & ePart2015 conference
 We need generic tools that can be reused across different datasets
and sources.
9
The vision of exploiting Expanded Linked Data Cubes
Dual EGOV2015 & ePart2015 conference
 Data Cubes on the Web
 The RDF Data Cube Vocabulary
 OpenCube OLAP Browser
 OpenCube Mapview
 OpenCube Expander
10
Table of Contents
Dual EGOV2015 & ePart2015 conference
11
RDF Data Cube Vocabulary
http://www.w3.org/TR/vocab-data-cube/
Dual EGOV2015 & ePart2015 conference
 A qb:DataSet is a collection of statistical data that corresponds
to a defined structure
12
Data Set
qb:DataSet
a
Dual EGOV2015 & ePart2015 conference
 A qb:DataStructureDefinition defines the structure of one or
more datasets. In particular, it defines the dimensions, attributes
and measures used in the dataset
13
Data Structure Definition
qb:DataStructureDefinition
qb:structure
Dual EGOV2015 & ePart2015 conference
 The Data Cube vocabulary represents the dimensions, attributes and measures
as RDF properties.
 Each is an instance of the abstract qb:ComponentProperty class, which in
turn has sub-classes qb:DimensionProperty, qb:AttributeProperty and
qb:MeasureProperty.
14
Data Structure Definition
qb:DimensionProperty
Dual EGOV2015 & ePart2015 conference
 Each observation is represented as an instance of type
qb:Observation.
15
Observation
qb:Observation
Dual EGOV2015 & ePart2015 conference
 Slices allow us to group subsets of observations together.
 This is not intended to represent arbitrary selections from the
observations but uniform slices through the cube in which one
or more of the dimension values are fixed.
16
Slice
Dual EGOV2015 & ePart2015 conference
Eurostat – Linked Data
 This is NOT an official endeavor of Eurostat
 http://eurostat.linked-statistics.org
 Every data set from Eurostat has been transformed to an RDF
data cube described in a RDF file in the following links
 <http://eurostat.linked-statistics.org/data/EUROSTAT_CODE.rdf>
 <http://eurostat.linked-statistics.org/dsd/EUROSTAT_CODE.ttl>
 For example:
 DATA: <http://eurostat.linked-statistics.org/data/t2020_50.rdf>
 DSD: <http://eurostat.linked-statistics.org/dsd/t2020_50.ttl>
17
Data (Observations)
Data Structure Definition
Dual EGOV2015 & ePart2015 conference
18
Observation
Dimensions Measure
Dual EGOV2015 & ePart2015 conference
Browse observations
19
Dual EGOV2015 & ePart2015 conference
SPARQL federated query
 Get from Eurostat the percentage of people at risk of poverty and from Digital Agenda the eGov indicator
per country in 2013
20
PREFIX …
SELECT DISTINCT (str(?label) as ?labelStr) ?poverty (str(?ucs) as ?userCentricityScore)
FROM <http://eurostat.linked-statistics.org/data/t2020_50.rdf>
FROM <http://eurostat.linked-statistics.org/dsd/t2020_50.ttl>
WHERE {
?obs a qb:Observation;
sdmx-dimension:timePeriod "2013-01-01"^^xsd:date;
property:unit <http://eurostat.linked-statistics.org/dic/unit#PC_POP>;
property:geo ?country;
sdmx-measure:obsValue ?poverty.
?country skos:prefLabel ?label.
FILTER(LANGMATCHES(LANG(?label), "EN"))
SERVICE <http://digital-agenda-data.eu/data/sparql> {
?observation a qb:Observation;
dad-prop:indicator <http://semantic.digital-agenda-data.eu/codelist/indicator/user_centric_egov>;
dad-prop:time-period <http://reference.data.gov.uk/id/gregorian-year/2013>;
dad-prop:breakdown <http://semantic.digital-agenda-data.eu/codelist/breakdown/all_egov_le>;
dad-prop:ref-area ?country;
sdmx-measure:obsValue ?ucs.
}}
Dual EGOV2015 & ePart2015 conference
SPARQL federated query
 Get from Eurostat the percentage of people at risk of poverty and from Digital Agenda the eGov indicator
per country in 2013
21
PREFIX …
SELECT DISTINCT (str(?label) as ?labelStr) ?poverty (str(?ucs) as ?userCentricityScore)
FROM <http://eurostat.linked-statistics.org/data/t2020_50.rdf>
FROM <http://eurostat.linked-statistics.org/dsd/t2020_50.ttl>
WHERE {
?obs a qb:Observation;
sdmx-dimension:timePeriod "2013-01-01"^^xsd:date;
property:unit <http://eurostat.linked-statistics.org/dic/unit#PC_POP>;
property:geo ?country;
sdmx-measure:obsValue ?poverty.
?country skos:prefLabel ?label.
FILTER(LANGMATCHES(LANG(?label), "EN"))
SERVICE <http://digital-agenda-data.eu/data/sparql> {
?observation a qb:Observation;
dad-prop:indicator <http://semantic.digital-agenda-data.eu/codelist/indicator/user_centric_egov>;
dad-prop:time-period <http://reference.data.gov.uk/id/gregorian-year/2013>;
dad-prop:breakdown <http://semantic.digital-agenda-data.eu/codelist/breakdown/all_egov_le>;
dad-prop:ref-area ?country;
sdmx-measure:obsValue ?ucs.
}}
Dual EGOV2015 & ePart2015 conference
 Data cubes on the Web
 The RDF Data Cube Vocabulary
 OpenCube OLAP Browser
 OpenCube Mapview
22
Table of Contents
Dual EGOV2015 & ePart2015 conference
 It is a proof of concept of the linked data analytics vision.
 It enables performing OLAP operations on top of integrated views of
multiple linked data cubes.
23
The OpenCube OLAP browser
Dual EGOV2015 & ePart2015 conference
Dual EGOV2015 & ePart2015 conference 24
Architecture
Dual EGOV2015 & ePart2015 conference 25
Architecture (Aggregator)
The Aggregator computes
aggregations of cells across
dimensions or hierarchies
 The Aggregator creates 2n-1 sub-cubes from a cube of n dimensions.
26
Compute aggregations across dimension
Time
Geo
Sex
Time Time
Geo Sex
Geo
Sex
Time GeoSex
Total
Three dimensions
Two dimensions
One dimension
No dimensions
Dual EGOV2015 & ePart2015 conference
 It enriches an existing cube with new observations by using a
hierarchy.
Dual EGOV2015 & ePart2015 conference 27
Compute aggregations across hierarchies
Time
Geo
Sex
city1
city2
city3
+
city4
country1
region1
region2
city1
city2
city3
city4
Time
Geo
city1
city2
city3
city4
= region1
region2
country1
Sex
Dual EGOV2015 & ePart2015 conference 28
Architecture (Compatibility Explorer)
Given a cube in the local store,
the Compatibility Explorer
(a) Searches into the Linked
Data Web and identifies
cubes that are compatible to
expand the initial cube and
(b) Establishes typed links
between the local and the
compatible cubes
 Binary relations that link two cubes that are compatible to integrate.
 Operators that map from these two cubes to a new expanded one.
 The framework assumes that a cube can be expanded by increasing
the size of one of the sets that define a cube i.e.:
 The set of measures
 The set of objects of an attribute of a dimension
 The set of attributes of a dimension
 The set of dimensions
Dual EGOV2015 & ePart2015 conference 29
Theoretical Framework
Dual EGOV2015 & ePart2015 conference 30
Dual EGOV2015 & ePart2015 conference 31
Dual EGOV2015 & ePart2015 conference 32
Architecture (Expander)
The Expander creates a new
expanded cube by merging two
compatible ones.
The Expander implements the
theoretical framework
 In our case the Expander is integrated with the OLAP browser enabling
this way the performance of OLAP operations on top of integrated views of
compatible cubes
Dual EGOV2015 & ePart2015 conference 33
Expander
Dual EGOV2015 & ePart2015 conference 34
Architecture (OLAP Browser)
The linked data OLAP browser
exploits the others components
of the platform in order to
enable performing OLAP
operation on top of expanded
cubes.
These may include measures,
dimensions, objects, and/or
attributes from multiple cubes
that reside on disparate sources
on the Web.
Dual EGOV2015 & ePart2015 conference 35
OLAP Browser
 An instance of the developed platform have been
deployed at the premises of the Flemish government.
 Flemish government had already opened up statistics by
means of linked data cubes.
 11 cubes had been transformed to linked data according
to the QB vocabulary and stored in a Virtuoso RDF store.
 Using the Aggregator a total of 230 sub-cubes have been
created.
 250 links have been established from 73 cubes or
(sub)cubes to other compatibles (sub)cubes
Dual EGOV2015 & ePart2015 conference 36
The Flemish Government
 http://188.166.126.11:50080/resource/OpenCubeBrowser
Dual EGOV2015 & ePart2015 conference 37
Flemish pilot
Login to the platform:
username: admin
password: iwb
 The user selects one of the cubes
Dual EGOV2015 & ePart2015 conference 38
OpenCube Browser
 The browser starts with an empty canvas
Dual EGOV2015 & ePart2015 conference 39
OpenCube Browser
 The user can change the language
Dual EGOV2015 & ePart2015 conference 40
OpenCube Browser
 The user can see the dimensions of the cube
Dual EGOV2015 & ePart2015 conference 41
OpenCube Browser
 The user can see the measures of the cube
Dual EGOV2015 & ePart2015 conference 42
OpenCube Browser
 When the user selects at least one measure and one dimension…
Dual EGOV2015 & ePart2015 conference 43
OpenCube Browser
The geo
dimension has 4
levels
 When the user selects a second level in a dimension…
Dual EGOV2015 & ePart2015 conference 44
OpenCube Browser (Drill-down & roll-up)
2 levels have
been selected
 Keep in mind that you can select at most 2 levels
Dual EGOV2015 & ePart2015 conference 45
OpenCube Browser (Drill-down & roll-up)
Dual EGOV2015 & ePart2015 conference 46
OpenCube Browser (Selecting more measure & dimensions)
We set a fixed
value in the
other
dimensions
Different colors
for multiple
measures
 All this time you see a green message
 The user is able to select to expand the cube that sees in the table
using data from other cubes
Dual EGOV2015 & ePart2015 conference 47
OpenCube Browser (Expander)
Dual EGOV2015 & ePart2015 conference 48
OpenCube Browser (Expander)
Dual EGOV2015 & ePart2015 conference 49
OpenCube Browser (Expanded cube)
A new measure has
been added in the initial
cube
Dual EGOV2015 & ePart2015 conference 50
OpenCube Browser (Browsing Multiple Cubes)
 The RDF Data Cube Vocabulary
 Linked Data Cubes on the Web
 The OpenCube Toolkit
 OpenCube OLAP Browser
 OpenCube Mapview
51
Table of Contents
17-18 June 2015 Foster School of Data
 http://83.212.122.81:8888
Dual EGOV2015 & ePart2015 conference 52
OpenCube MapView
Login to the platform:
username: admin
password: iwb
Dual EGOV2015 & ePart2015 conference 53
OpenCube MapView
Dual EGOV2015 & ePart2015 conference 54
OpenCube MapView
Dual EGOV2015 & ePart2015 conference 55
OpenCube MapView
 The QB vocabulary is expressive enough and fulfils the requirements
for combining two cubes and performing OLAP operations.
 Difficulties to combine cubes from different sources
 The QB vocabulary allows data publishers to choose application practices that
are best suited to their particular situation
 As a result, different practices are followed by different publishers making it
difficult to produce generically applicable tools that combine data.
 The second reason that prevents disparate cubes integration is the
standardisation of concept schemes and code lists.
 Extend both the theoretical framework (join) and the platform
(statistical analyses)
Dual EGOV2015 & ePart2015 conference 56
Conclusions & Challenges
 The work presented in the paper is partly funded by
Dual EGOV2015 & ePart2015 conference 57
Acknowledgments
http://opencube-project.eu
@OpenCubeProject
Dual EGOV2015 & ePart2015 conference 58
Questions

More Related Content

What's hot

Spatial Data Integrator - Software Presentation and Use Cases
Spatial Data Integrator - Software Presentation and Use CasesSpatial Data Integrator - Software Presentation and Use Cases
Spatial Data Integrator - Software Presentation and Use Cases
mathieuraj
 

What's hot (14)

Yet another population cartogram: Creating gridded cartograms using ArcGIS an...
Yet another population cartogram: Creating gridded cartograms using ArcGIS an...Yet another population cartogram: Creating gridded cartograms using ArcGIS an...
Yet another population cartogram: Creating gridded cartograms using ArcGIS an...
 
Dr Richard Fry - Using R as a GIS
Dr Richard Fry - Using R as a GISDr Richard Fry - Using R as a GIS
Dr Richard Fry - Using R as a GIS
 
Spatial Data Integrator - Software Presentation and Use Cases
Spatial Data Integrator - Software Presentation and Use CasesSpatial Data Integrator - Software Presentation and Use Cases
Spatial Data Integrator - Software Presentation and Use Cases
 
Maps with leafletR
Maps with leafletRMaps with leafletR
Maps with leafletR
 
MOCHA 2018 Challenge @ ESWC2018
MOCHA 2018 Challenge @ ESWC2018MOCHA 2018 Challenge @ ESWC2018
MOCHA 2018 Challenge @ ESWC2018
 
Team 5: Open Land Use Metadata Harvesting on NextGEOSS
Team 5: Open Land Use Metadata Harvesting on NextGEOSSTeam 5: Open Land Use Metadata Harvesting on NextGEOSS
Team 5: Open Land Use Metadata Harvesting on NextGEOSS
 
What's new in Spark 2.0?
What's new in Spark 2.0?What's new in Spark 2.0?
What's new in Spark 2.0?
 
Graph database
Graph databaseGraph database
Graph database
 
From Simple Features to Moving Features and Beyond? at OGC Member Meeting, Se...
From Simple Features to Moving Features and Beyond? at OGC Member Meeting, Se...From Simple Features to Moving Features and Beyond? at OGC Member Meeting, Se...
From Simple Features to Moving Features and Beyond? at OGC Member Meeting, Se...
 
European Data Portal - ePSI platform webinar 8 February 2016
European Data Portal - ePSI platform webinar 8 February 2016European Data Portal - ePSI platform webinar 8 February 2016
European Data Portal - ePSI platform webinar 8 February 2016
 
Supervised Papers Classification on Large-Scale High-Dimensional Data with Ap...
Supervised Papers Classification on Large-Scale High-Dimensional Data with Ap...Supervised Papers Classification on Large-Scale High-Dimensional Data with Ap...
Supervised Papers Classification on Large-Scale High-Dimensional Data with Ap...
 
Reporting Summary Information of Spatial Datasets and Non-Compliance Issues U...
Reporting Summary Information of Spatial Datasets and Non-Compliance Issues U...Reporting Summary Information of Spatial Datasets and Non-Compliance Issues U...
Reporting Summary Information of Spatial Datasets and Non-Compliance Issues U...
 
Flink Forward Berlin 2018: Piotr Wawrzyniak & Jarosław Legierski - "Using Apa...
Flink Forward Berlin 2018: Piotr Wawrzyniak & Jarosław Legierski - "Using Apa...Flink Forward Berlin 2018: Piotr Wawrzyniak & Jarosław Legierski - "Using Apa...
Flink Forward Berlin 2018: Piotr Wawrzyniak & Jarosław Legierski - "Using Apa...
 
nCOVID-19 pivot-and-fan map
nCOVID-19 pivot-and-fan mapnCOVID-19 pivot-and-fan map
nCOVID-19 pivot-and-fan map
 

Similar to OpenCube Workshop at eGov2015 & ePart2015 dual conference

Axelink Tools_BO2C10_0415_EN
Axelink Tools_BO2C10_0415_ENAxelink Tools_BO2C10_0415_EN
Axelink Tools_BO2C10_0415_EN
Eric Soyer
 
The future of charting in .Stat
The future of charting in .StatThe future of charting in .Stat
The future of charting in .Stat
Jonathan Challener
 
Representing verifiable statistical index computations as linked data
Representing verifiable statistical index computations as linked dataRepresenting verifiable statistical index computations as linked data
Representing verifiable statistical index computations as linked data
Jose Emilio Labra Gayo
 

Similar to OpenCube Workshop at eGov2015 & ePart2015 dual conference (20)

Ogi conf delft_v1_evangelos_kalampokis
Ogi conf delft_v1_evangelos_kalampokisOgi conf delft_v1_evangelos_kalampokis
Ogi conf delft_v1_evangelos_kalampokis
 
Linked Statistical Data: does it actually pay off?
Linked Statistical Data: does it actually pay off?Linked Statistical Data: does it actually pay off?
Linked Statistical Data: does it actually pay off?
 
UKSG Conference 2015 - E-resources: ezPAARSE helps you discover who is readin...
UKSG Conference 2015 - E-resources: ezPAARSE helps you discover who is readin...UKSG Conference 2015 - E-resources: ezPAARSE helps you discover who is readin...
UKSG Conference 2015 - E-resources: ezPAARSE helps you discover who is readin...
 
Benchmarking of distributed linked data streaming systems
Benchmarking of distributed linked data streaming systemsBenchmarking of distributed linked data streaming systems
Benchmarking of distributed linked data streaming systems
 
Smart application on Azure at Vattenfall - Rens Weijers & Peter van 't Hof
Smart application on Azure at Vattenfall - Rens Weijers & Peter van 't HofSmart application on Azure at Vattenfall - Rens Weijers & Peter van 't Hof
Smart application on Azure at Vattenfall - Rens Weijers & Peter van 't Hof
 
Axelink Tools_BO2C10_0415_EN
Axelink Tools_BO2C10_0415_ENAxelink Tools_BO2C10_0415_EN
Axelink Tools_BO2C10_0415_EN
 
BDE SC6-hang out - technology part-SWC - Martin
BDE SC6-hang out - technology part-SWC - MartinBDE SC6-hang out - technology part-SWC - Martin
BDE SC6-hang out - technology part-SWC - Martin
 
EUDAT
EUDATEUDAT
EUDAT
 
From measurement to knowledge with sofia2 Platform
From measurement to knowledge with sofia2 PlatformFrom measurement to knowledge with sofia2 Platform
From measurement to knowledge with sofia2 Platform
 
WSO2 Machine Learner - Product Overview
WSO2 Machine Learner - Product OverviewWSO2 Machine Learner - Product Overview
WSO2 Machine Learner - Product Overview
 
Discover How Volvo Cars Uses a Time Series Database to Become Data-Driven
Discover How Volvo Cars Uses a Time Series Database to Become Data-DrivenDiscover How Volvo Cars Uses a Time Series Database to Become Data-Driven
Discover How Volvo Cars Uses a Time Series Database to Become Data-Driven
 
Digital twins
Digital twinsDigital twins
Digital twins
 
VoltDB and Flytxt Present: Building a Single Technology Platform for Real-Tim...
VoltDB and Flytxt Present: Building a Single Technology Platform for Real-Tim...VoltDB and Flytxt Present: Building a Single Technology Platform for Real-Tim...
VoltDB and Flytxt Present: Building a Single Technology Platform for Real-Tim...
 
Benchmarking Commercial RDF Stores with Publications Office Dataset
Benchmarking Commercial RDF Stores with Publications Office DatasetBenchmarking Commercial RDF Stores with Publications Office Dataset
Benchmarking Commercial RDF Stores with Publications Office Dataset
 
The future of charting in .Stat
The future of charting in .StatThe future of charting in .Stat
The future of charting in .Stat
 
Validating statistical Index Data represented in RDF using SPARQL Queries: Co...
Validating statistical Index Data represented in RDF using SPARQL Queries: Co...Validating statistical Index Data represented in RDF using SPARQL Queries: Co...
Validating statistical Index Data represented in RDF using SPARQL Queries: Co...
 
Master in Big Data Analytics and Social Mining 20015
Master in Big Data Analytics and Social Mining 20015Master in Big Data Analytics and Social Mining 20015
Master in Big Data Analytics and Social Mining 20015
 
Representing verifiable statistical index computations as linked data
Representing verifiable statistical index computations as linked dataRepresenting verifiable statistical index computations as linked data
Representing verifiable statistical index computations as linked data
 
OLAP Basics and Fundamentals by Bharat Kalia
OLAP Basics and Fundamentals by Bharat Kalia OLAP Basics and Fundamentals by Bharat Kalia
OLAP Basics and Fundamentals by Bharat Kalia
 
LOD2 Webinar Series: CubeViz
LOD2 Webinar Series: CubeViz LOD2 Webinar Series: CubeViz
LOD2 Webinar Series: CubeViz
 

Recently uploaded

Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
amitlee9823
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
MarinCaroMartnezBerg
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
amitlee9823
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
JoseMangaJr1
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
amitlee9823
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 

Recently uploaded (20)

Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
ELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptxELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptx
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 

OpenCube Workshop at eGov2015 & ePart2015 dual conference

  • 1. Workshop at Dual EGOV 2015 & ePart 2015 conference 2 September 2015, Thessaloniki Greece Create, Expand, and Exploit Linked Open Statistical Data E. Tambouris, E. Kalampokis, K. Tarabanis
  • 2.  Data Cubes on the Web  The RDF Data Cube Vocabulary  OpenCube OLAP Browser  OpenCube Mapview  OpenCube Expander 2 Table of Contents Dual EGOV2015 & ePart2015 conference
  • 6. 6 Graph view Graph types Data filter Sort and Labels Dual EGOV2015 & ePart2015 conference
  • 7. 7 Map view Map types Data filter Layers Dual EGOV2015 & ePart2015 conference
  • 8. 8 Combining variables from Eurostat & Digital Agenda Dual EGOV2015 & ePart2015 conference
  • 9.  We need generic tools that can be reused across different datasets and sources. 9 The vision of exploiting Expanded Linked Data Cubes Dual EGOV2015 & ePart2015 conference
  • 10.  Data Cubes on the Web  The RDF Data Cube Vocabulary  OpenCube OLAP Browser  OpenCube Mapview  OpenCube Expander 10 Table of Contents Dual EGOV2015 & ePart2015 conference
  • 11. 11 RDF Data Cube Vocabulary http://www.w3.org/TR/vocab-data-cube/ Dual EGOV2015 & ePart2015 conference
  • 12.  A qb:DataSet is a collection of statistical data that corresponds to a defined structure 12 Data Set qb:DataSet a Dual EGOV2015 & ePart2015 conference
  • 13.  A qb:DataStructureDefinition defines the structure of one or more datasets. In particular, it defines the dimensions, attributes and measures used in the dataset 13 Data Structure Definition qb:DataStructureDefinition qb:structure Dual EGOV2015 & ePart2015 conference
  • 14.  The Data Cube vocabulary represents the dimensions, attributes and measures as RDF properties.  Each is an instance of the abstract qb:ComponentProperty class, which in turn has sub-classes qb:DimensionProperty, qb:AttributeProperty and qb:MeasureProperty. 14 Data Structure Definition qb:DimensionProperty Dual EGOV2015 & ePart2015 conference
  • 15.  Each observation is represented as an instance of type qb:Observation. 15 Observation qb:Observation Dual EGOV2015 & ePart2015 conference
  • 16.  Slices allow us to group subsets of observations together.  This is not intended to represent arbitrary selections from the observations but uniform slices through the cube in which one or more of the dimension values are fixed. 16 Slice Dual EGOV2015 & ePart2015 conference
  • 17. Eurostat – Linked Data  This is NOT an official endeavor of Eurostat  http://eurostat.linked-statistics.org  Every data set from Eurostat has been transformed to an RDF data cube described in a RDF file in the following links  <http://eurostat.linked-statistics.org/data/EUROSTAT_CODE.rdf>  <http://eurostat.linked-statistics.org/dsd/EUROSTAT_CODE.ttl>  For example:  DATA: <http://eurostat.linked-statistics.org/data/t2020_50.rdf>  DSD: <http://eurostat.linked-statistics.org/dsd/t2020_50.ttl> 17 Data (Observations) Data Structure Definition Dual EGOV2015 & ePart2015 conference
  • 19. Browse observations 19 Dual EGOV2015 & ePart2015 conference
  • 20. SPARQL federated query  Get from Eurostat the percentage of people at risk of poverty and from Digital Agenda the eGov indicator per country in 2013 20 PREFIX … SELECT DISTINCT (str(?label) as ?labelStr) ?poverty (str(?ucs) as ?userCentricityScore) FROM <http://eurostat.linked-statistics.org/data/t2020_50.rdf> FROM <http://eurostat.linked-statistics.org/dsd/t2020_50.ttl> WHERE { ?obs a qb:Observation; sdmx-dimension:timePeriod "2013-01-01"^^xsd:date; property:unit <http://eurostat.linked-statistics.org/dic/unit#PC_POP>; property:geo ?country; sdmx-measure:obsValue ?poverty. ?country skos:prefLabel ?label. FILTER(LANGMATCHES(LANG(?label), "EN")) SERVICE <http://digital-agenda-data.eu/data/sparql> { ?observation a qb:Observation; dad-prop:indicator <http://semantic.digital-agenda-data.eu/codelist/indicator/user_centric_egov>; dad-prop:time-period <http://reference.data.gov.uk/id/gregorian-year/2013>; dad-prop:breakdown <http://semantic.digital-agenda-data.eu/codelist/breakdown/all_egov_le>; dad-prop:ref-area ?country; sdmx-measure:obsValue ?ucs. }} Dual EGOV2015 & ePart2015 conference
  • 21. SPARQL federated query  Get from Eurostat the percentage of people at risk of poverty and from Digital Agenda the eGov indicator per country in 2013 21 PREFIX … SELECT DISTINCT (str(?label) as ?labelStr) ?poverty (str(?ucs) as ?userCentricityScore) FROM <http://eurostat.linked-statistics.org/data/t2020_50.rdf> FROM <http://eurostat.linked-statistics.org/dsd/t2020_50.ttl> WHERE { ?obs a qb:Observation; sdmx-dimension:timePeriod "2013-01-01"^^xsd:date; property:unit <http://eurostat.linked-statistics.org/dic/unit#PC_POP>; property:geo ?country; sdmx-measure:obsValue ?poverty. ?country skos:prefLabel ?label. FILTER(LANGMATCHES(LANG(?label), "EN")) SERVICE <http://digital-agenda-data.eu/data/sparql> { ?observation a qb:Observation; dad-prop:indicator <http://semantic.digital-agenda-data.eu/codelist/indicator/user_centric_egov>; dad-prop:time-period <http://reference.data.gov.uk/id/gregorian-year/2013>; dad-prop:breakdown <http://semantic.digital-agenda-data.eu/codelist/breakdown/all_egov_le>; dad-prop:ref-area ?country; sdmx-measure:obsValue ?ucs. }} Dual EGOV2015 & ePart2015 conference
  • 22.  Data cubes on the Web  The RDF Data Cube Vocabulary  OpenCube OLAP Browser  OpenCube Mapview 22 Table of Contents Dual EGOV2015 & ePart2015 conference
  • 23.  It is a proof of concept of the linked data analytics vision.  It enables performing OLAP operations on top of integrated views of multiple linked data cubes. 23 The OpenCube OLAP browser Dual EGOV2015 & ePart2015 conference
  • 24. Dual EGOV2015 & ePart2015 conference 24 Architecture
  • 25. Dual EGOV2015 & ePart2015 conference 25 Architecture (Aggregator) The Aggregator computes aggregations of cells across dimensions or hierarchies
  • 26.  The Aggregator creates 2n-1 sub-cubes from a cube of n dimensions. 26 Compute aggregations across dimension Time Geo Sex Time Time Geo Sex Geo Sex Time GeoSex Total Three dimensions Two dimensions One dimension No dimensions Dual EGOV2015 & ePart2015 conference
  • 27.  It enriches an existing cube with new observations by using a hierarchy. Dual EGOV2015 & ePart2015 conference 27 Compute aggregations across hierarchies Time Geo Sex city1 city2 city3 + city4 country1 region1 region2 city1 city2 city3 city4 Time Geo city1 city2 city3 city4 = region1 region2 country1 Sex
  • 28. Dual EGOV2015 & ePart2015 conference 28 Architecture (Compatibility Explorer) Given a cube in the local store, the Compatibility Explorer (a) Searches into the Linked Data Web and identifies cubes that are compatible to expand the initial cube and (b) Establishes typed links between the local and the compatible cubes
  • 29.  Binary relations that link two cubes that are compatible to integrate.  Operators that map from these two cubes to a new expanded one.  The framework assumes that a cube can be expanded by increasing the size of one of the sets that define a cube i.e.:  The set of measures  The set of objects of an attribute of a dimension  The set of attributes of a dimension  The set of dimensions Dual EGOV2015 & ePart2015 conference 29 Theoretical Framework
  • 30. Dual EGOV2015 & ePart2015 conference 30
  • 31. Dual EGOV2015 & ePart2015 conference 31
  • 32. Dual EGOV2015 & ePart2015 conference 32 Architecture (Expander) The Expander creates a new expanded cube by merging two compatible ones. The Expander implements the theoretical framework
  • 33.  In our case the Expander is integrated with the OLAP browser enabling this way the performance of OLAP operations on top of integrated views of compatible cubes Dual EGOV2015 & ePart2015 conference 33 Expander
  • 34. Dual EGOV2015 & ePart2015 conference 34 Architecture (OLAP Browser) The linked data OLAP browser exploits the others components of the platform in order to enable performing OLAP operation on top of expanded cubes. These may include measures, dimensions, objects, and/or attributes from multiple cubes that reside on disparate sources on the Web.
  • 35. Dual EGOV2015 & ePart2015 conference 35 OLAP Browser
  • 36.  An instance of the developed platform have been deployed at the premises of the Flemish government.  Flemish government had already opened up statistics by means of linked data cubes.  11 cubes had been transformed to linked data according to the QB vocabulary and stored in a Virtuoso RDF store.  Using the Aggregator a total of 230 sub-cubes have been created.  250 links have been established from 73 cubes or (sub)cubes to other compatibles (sub)cubes Dual EGOV2015 & ePart2015 conference 36 The Flemish Government
  • 37.  http://188.166.126.11:50080/resource/OpenCubeBrowser Dual EGOV2015 & ePart2015 conference 37 Flemish pilot Login to the platform: username: admin password: iwb
  • 38.  The user selects one of the cubes Dual EGOV2015 & ePart2015 conference 38 OpenCube Browser
  • 39.  The browser starts with an empty canvas Dual EGOV2015 & ePart2015 conference 39 OpenCube Browser
  • 40.  The user can change the language Dual EGOV2015 & ePart2015 conference 40 OpenCube Browser
  • 41.  The user can see the dimensions of the cube Dual EGOV2015 & ePart2015 conference 41 OpenCube Browser
  • 42.  The user can see the measures of the cube Dual EGOV2015 & ePart2015 conference 42 OpenCube Browser
  • 43.  When the user selects at least one measure and one dimension… Dual EGOV2015 & ePart2015 conference 43 OpenCube Browser The geo dimension has 4 levels
  • 44.  When the user selects a second level in a dimension… Dual EGOV2015 & ePart2015 conference 44 OpenCube Browser (Drill-down & roll-up) 2 levels have been selected
  • 45.  Keep in mind that you can select at most 2 levels Dual EGOV2015 & ePart2015 conference 45 OpenCube Browser (Drill-down & roll-up)
  • 46. Dual EGOV2015 & ePart2015 conference 46 OpenCube Browser (Selecting more measure & dimensions) We set a fixed value in the other dimensions Different colors for multiple measures
  • 47.  All this time you see a green message  The user is able to select to expand the cube that sees in the table using data from other cubes Dual EGOV2015 & ePart2015 conference 47 OpenCube Browser (Expander)
  • 48. Dual EGOV2015 & ePart2015 conference 48 OpenCube Browser (Expander)
  • 49. Dual EGOV2015 & ePart2015 conference 49 OpenCube Browser (Expanded cube) A new measure has been added in the initial cube
  • 50. Dual EGOV2015 & ePart2015 conference 50 OpenCube Browser (Browsing Multiple Cubes)
  • 51.  The RDF Data Cube Vocabulary  Linked Data Cubes on the Web  The OpenCube Toolkit  OpenCube OLAP Browser  OpenCube Mapview 51 Table of Contents 17-18 June 2015 Foster School of Data
  • 52.  http://83.212.122.81:8888 Dual EGOV2015 & ePart2015 conference 52 OpenCube MapView Login to the platform: username: admin password: iwb
  • 53. Dual EGOV2015 & ePart2015 conference 53 OpenCube MapView
  • 54. Dual EGOV2015 & ePart2015 conference 54 OpenCube MapView
  • 55. Dual EGOV2015 & ePart2015 conference 55 OpenCube MapView
  • 56.  The QB vocabulary is expressive enough and fulfils the requirements for combining two cubes and performing OLAP operations.  Difficulties to combine cubes from different sources  The QB vocabulary allows data publishers to choose application practices that are best suited to their particular situation  As a result, different practices are followed by different publishers making it difficult to produce generically applicable tools that combine data.  The second reason that prevents disparate cubes integration is the standardisation of concept schemes and code lists.  Extend both the theoretical framework (join) and the platform (statistical analyses) Dual EGOV2015 & ePart2015 conference 56 Conclusions & Challenges
  • 57.  The work presented in the paper is partly funded by Dual EGOV2015 & ePart2015 conference 57 Acknowledgments http://opencube-project.eu @OpenCubeProject
  • 58. Dual EGOV2015 & ePart2015 conference 58 Questions