Exploring a Semantic Framework for Integrating DPM, XBRL and SDMX Data

Roberto García
Roberto GarcíaAssociate Professor at Universitat de Lleida
EXPLORINGA SEMANTIC
FRAMEWORKFOR
INTEGRATINGDPM,XBRL AND
SDMX DATA
Roberto García
Associate Professor
Universitat deLleida
INTRODUCTION
• Proliferation financial data and available formats
• Increased need for ways to integrate it
• Semantic Technologies:
• facilitate integration bymoving effort to the level of meanings
• instead of trying to deal with syntax subtleties
• Explore this alternative through a practical experiment
INTEGRATION SOURCES
• Data sources:
• XBRL,
• DataPointModel (DPM)
• SDMX
• Schema sources:
• XBRLTaxonomies,
• DPMDataDictionaries
• SMX DataStructureDefinitions
(DSD)
CONCEPTUAL FRAMEWORK
• Consider the multidimensional nature of the data (e.g. DPM)
• Far beyond 2D data available from spreadsheets
• Avoid having to encode “hidden dimensions” into footnotes, attachments, etc.
• Dimensions might be hierarchically organised (like geographical administrative divisions)
• Proposal: RDF Data Cube Vocabulary (based on semantic technologies, RDF & Web Ontologies)
• Supports multidimensional data
• Based on SDMX and the Semantic Web vocabulary for statistical data
• Web standard (W3C Recommendation)
• Approach:
• Map DPM and XBRL to the RDF Data Cube Vocabulary (example next)
• SDMX trivially becomes RDF based on the Data Cube Vocabulary
DATA CUBE
Dataset a collection of observations
Dimensions identify an observation
e.g. observation time or a geographic region
Measures represent observed phenomenon
Attributes qualify / help interpret observations
e.g. units of measure, scaling factors or observation
status (estimated, provisional,…)
Slice subsets observations by fixing all
but one dimension (or a few)
https://www.slideshare.net/140er/lets-talkaboutstatisticaldatainrdf
RDFDATA CUBE
Dataset a collection of observations
Dimensions identify an observation
e.g. observation time or a geographic region
Measures represent observed phenomenon
Attributes qualify / help interpret observations
e.g. units of measure, scaling factors or observation
status (estimated, provisional,…)
Slice subsets observations by fixing all
but one dimension (or a few)
https://www.w3.org/TR/vocab-data-cube/#outline
MODELLINGEXAMPLE
• Data Point example based on the taxonomy "FINancial REPorting 2016-A Individual
(2.1.5)", authored by EBA using DPM 2.5 and based on table "Balance Sheet Statement:
Assets (F_01.01)", row "Total assets" and column "Carrying amount”
• Metric: eba_mi53 - Carrying amount  Value: 1000 EUR
• Dimension 1: BAS – Base  Dimension 1 Value: x6 - Assets
• Dimension 2: MCY - Main Category  Dimension 2 Value: x25 - All assets
• Plus entity with LEI 549300N33JQ7EG2VD447 and time 2017-07-01
MODELLINGEXAMPLE
• XBRL representation of the Data Point
<xbrli:context id="c1">
<xbrli:entity>
<xbrli:identifier scheme="http://standards.iso.org/iso/17442">
549300N33JQ7EG2VD447</xbrli:identifier>
</xbrli:entity>
<xbrli:period>
<xbrli:instant>2017-07-01</xbrli:instant>
</xbrli:period>
<xbrli:scenario>
<xbrldi:explicitMember dimension="eba_dim:BAS">eba_BA:x6</xbrldi:explicitMember>
<xbrldi:explicitMember dimension="eba_dim:MCY">eba_MC:x25</xbrldi:explicitMember>
</xbrli:scenario>
</xbrli:context>
<eba_met:mi53 unitRef="EUR" decimals="-3" contextRef="c1">1</eba_met:mi53>
MODELLINGEXAMPLE
• RDF Data Cube Vocabulary representation of the Data Point and XBRL instance
ex:dst-1/obs-1 a qb:Observation;
qb:dataSet ex:dtst-1 ;
xbrli:entity lei:549300N33JQ7EG2VD447 ;
sdmx-dim:refTime "2017-07-01"^^xsd:date ;
eba_dim:BAS eba_BA:x6 ;
eba_dim:MCY eba_MC:x25 ;
eba_met:mi53 "1"^^xsd:int ;
sdmx-att:decimals "-3"^^xsd:int ;
sdmx-att:currency currency:EUR .
MODELLINGEXAMPLE
• RDF Data Cube Vocabulary terms to model:
Observations linked to their dataset
Dimensions, including entities and time
Measures, including data type
Attributes, decimals and currency
MODELLINGFINANCIAL DATA
SCHEMAS
• RDF Data Cube Vocabulary
also to model how the
dimensions, metrics and
attributes are structured
• Capture
• DPM Data Dictionaries
• XBRL Taxonomies
in a Data Structure
Definition (DSD) linked to
each dataset
MODELLINGFINANCIAL DATA
SCHEMAS
• DSD also defines the types of the
values of measures, dimensions and
attributes (their ranges):
• Data types
(date, integer,…)
• Taxonomy terms
MODELLINGFINANCIAL DATA
SCHEMAS
• Example: the range of the property eba_dim:BAS is eba:BA
• eba:BA is defined as a SDMX Code List (and a semantic SKOS Concept Scheme)
with members:
• eba_BA:x6
• eba_BA:x2
• eba_BA:x3
(members can be
hierarchically organised)
CONCLUSIONS
• Possible to use the RDF Data Cube Vocabulary to
semantically model and integrate:
• Data Point / XBRL Instance
• Data Dictionary / XBRL Taxonomy
• Per design, also SDMX / DSD
• Semantic technologies facilitate the integration by
operating at the level of dictionaries and
taxonomies
• Facilitates multidimensional data management
and multiple views on the same data
FUTURE WORK
• More systematic analysis of how the different constructs in the DPM Dictionaries and XBRL
Taxonomies can be mapped to the RDF Data Cube DSDs (automation?)
• Formalisation of the semantic relationships among the concepts and relationships defined in the
DPM Dictionaries, XBRL Taxonomies and SDMX DSDs
• For instance, formalise the equivalence between the concepts related to currency values in all
them so they can be queried transparently using semantic requests
• Additionally, possible to benefit from existing efforts to unify these dictionaries and taxonomies
• ECB Single Data Dictionary (SDD) can also be formalised using semantic technologies and
become the hub for integration using semantic relationships
THANK YOU
1 of 16

Recommended

Visualization Proess by
Visualization ProessVisualization Proess
Visualization ProessPawandeep Kaur
425 views6 slides
Spatial Databases by
Spatial DatabasesSpatial Databases
Spatial DatabasesPratibha Chaudhary
1.6K views27 slides
Vector data model by
Vector data modelVector data model
Vector data modelNaresh Kumar
250 views24 slides
090925 Data Transformation by
090925 Data Transformation090925 Data Transformation
090925 Data TransformationeXchange For Travel (XFT)
369 views8 slides
Spatial databases by
Spatial databasesSpatial databases
Spatial databasesNeha Kulkarni
7.1K views28 slides
Prague Hacks 2015 by
Prague Hacks 2015Prague Hacks 2015
Prague Hacks 2015Ondřej Profant
281 views14 slides

More Related Content

What's hot

Vectors and Rasters by
Vectors and RastersVectors and Rasters
Vectors and RastersJessica Meyer
4.7K views17 slides
Spatial Database Systems by
Spatial Database SystemsSpatial Database Systems
Spatial Database SystemsAsifuzzaman Hridoy
2.6K views13 slides
Spatial databases by
Spatial databasesSpatial databases
Spatial databasesSeraphic Nazir
15.8K views18 slides
Archival galaxies by
Archival galaxiesArchival galaxies
Archival galaxiesSteffi Keran Rani J
106 views39 slides
Spatial databases by
Spatial databasesSpatial databases
Spatial databasesDabbal Singh Mahara
8.4K views57 slides
Structures by
StructuresStructures
StructuresRadhe Syam
9 views10 slides

Similar to Exploring a Semantic Framework for Integrating DPM, XBRL and SDMX Data

Putting Historical Data in Context: how to use DSpace-GLAM by
Putting Historical Data in Context: how to use DSpace-GLAMPutting Historical Data in Context: how to use DSpace-GLAM
Putting Historical Data in Context: how to use DSpace-GLAM4Science
3.9K views113 slides
Arches Getty Brownbag Talk by
Arches Getty Brownbag TalkArches Getty Brownbag Talk
Arches Getty Brownbag Talkbenosteen
156 views35 slides
MAD skills for analysis and big data Machine Learning by
MAD skills for analysis and big data Machine LearningMAD skills for analysis and big data Machine Learning
MAD skills for analysis and big data Machine LearningGianvito Siciliano
1.3K views24 slides
Database system concepts by
Database system conceptsDatabase system concepts
Database system conceptsKumar
11.6K views34 slides
Data integration with a façade. The case of knowledge graph construction. by
Data integration with a façade. The case of knowledge graph construction.Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.Enrico Daga
332 views40 slides
Expressing Concept Schemes & Competency Frameworks in CTDL by
Expressing Concept Schemes & Competency Frameworks in CTDLExpressing Concept Schemes & Competency Frameworks in CTDL
Expressing Concept Schemes & Competency Frameworks in CTDLCredential Engine
927 views32 slides

Similar to Exploring a Semantic Framework for Integrating DPM, XBRL and SDMX Data(20)

Putting Historical Data in Context: how to use DSpace-GLAM by 4Science
Putting Historical Data in Context: how to use DSpace-GLAMPutting Historical Data in Context: how to use DSpace-GLAM
Putting Historical Data in Context: how to use DSpace-GLAM
4Science3.9K views
Arches Getty Brownbag Talk by benosteen
Arches Getty Brownbag TalkArches Getty Brownbag Talk
Arches Getty Brownbag Talk
benosteen156 views
MAD skills for analysis and big data Machine Learning by Gianvito Siciliano
MAD skills for analysis and big data Machine LearningMAD skills for analysis and big data Machine Learning
MAD skills for analysis and big data Machine Learning
Gianvito Siciliano1.3K views
Database system concepts by Kumar
Database system conceptsDatabase system concepts
Database system concepts
Kumar 11.6K views
Data integration with a façade. The case of knowledge graph construction. by Enrico Daga
Data integration with a façade. The case of knowledge graph construction.Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.
Enrico Daga332 views
Expressing Concept Schemes & Competency Frameworks in CTDL by Credential Engine
Expressing Concept Schemes & Competency Frameworks in CTDLExpressing Concept Schemes & Competency Frameworks in CTDL
Expressing Concept Schemes & Competency Frameworks in CTDL
Credential Engine927 views
DITA's New Thang: Going Mapless! by dclsocialmedia
DITA's New Thang: Going Mapless!DITA's New Thang: Going Mapless!
DITA's New Thang: Going Mapless!
dclsocialmedia607 views
Data mining query language by GowriLatha1
Data mining query languageData mining query language
Data mining query language
GowriLatha14.4K views
2016 SDMX Experts meeting, Opening of SDMX Capacity Building - Introduction ... by StatsCommunications
2016 SDMX Experts meeting, Opening of SDMX Capacity Building  - Introduction ...2016 SDMX Experts meeting, Opening of SDMX Capacity Building  - Introduction ...
2016 SDMX Experts meeting, Opening of SDMX Capacity Building - Introduction ...
Spatial decision support and analytics on a campus scale: bringing GIS, CAD, ... by Safe Software
Spatial decision support and analytics on a campus scale: bringing GIS, CAD, ...Spatial decision support and analytics on a campus scale: bringing GIS, CAD, ...
Spatial decision support and analytics on a campus scale: bringing GIS, CAD, ...
Safe Software2.1K views
Multi-Model Data Query Languages and Processing Paradigms by Jiaheng Lu
Multi-Model Data Query Languages and Processing ParadigmsMulti-Model Data Query Languages and Processing Paradigms
Multi-Model Data Query Languages and Processing Paradigms
Jiaheng Lu140 views
Etosha - Data Asset Manager : Status and road map by Dr. Mirko Kämpf
Etosha - Data Asset Manager : Status and road mapEtosha - Data Asset Manager : Status and road map
Etosha - Data Asset Manager : Status and road map
Dr. Mirko Kämpf506 views
Part 4 - Data Warehousing Lecture at BW Cooperative State University (DHBW) by Andreas Buckenhofer
Part 4 - Data Warehousing Lecture at BW Cooperative State University (DHBW)Part 4 - Data Warehousing Lecture at BW Cooperative State University (DHBW)
Part 4 - Data Warehousing Lecture at BW Cooperative State University (DHBW)
Data(base) taxonomy by Dejan Radic
Data(base) taxonomyData(base) taxonomy
Data(base) taxonomy
Dejan Radic276 views

More from Roberto García

CopyrightLY: Blockchain and Semantic Web for Decentralised Copyright Management by
CopyrightLY: Blockchain and Semantic Web for Decentralised Copyright ManagementCopyrightLY: Blockchain and Semantic Web for Decentralised Copyright Management
CopyrightLY: Blockchain and Semantic Web for Decentralised Copyright ManagementRoberto García
151 views9 slides
Facilitating an agricultural data ecosystem - The EU Code of conduct on agric... by
Facilitating an agricultural data ecosystem- The EU Code of conduct on agric...Facilitating an agricultural data ecosystem- The EU Code of conduct on agric...
Facilitating an agricultural data ecosystem - The EU Code of conduct on agric...Roberto García
67 views14 slides
A pragmatic view on Semantic Technologies by
A pragmatic view on Semantic TechnologiesA pragmatic view on Semantic Technologies
A pragmatic view on Semantic TechnologiesRoberto García
131 views38 slides
Facilitant un ecosistema de dades agràries: El codi de conducta de la Unió Eu... by
Facilitant un ecosistema de dades agràries:El codi de conducta de la Unió Eu...Facilitant un ecosistema de dades agràries:El codi de conducta de la Unió Eu...
Facilitant un ecosistema de dades agràries: El codi de conducta de la Unió Eu...Roberto García
219 views14 slides
ETHICOMP 2020: Exploring Value Sensitive Design for Blockchain Development by
ETHICOMP 2020: Exploring Value Sensitive Design for Blockchain DevelopmentETHICOMP 2020: Exploring Value Sensitive Design for Blockchain Development
ETHICOMP 2020: Exploring Value Sensitive Design for Blockchain DevelopmentRoberto García
135 views19 slides
Social Media Copyright Management using Semantic Web and Blockchain by
Social Media Copyright Management  using Semantic Web and BlockchainSocial Media Copyright Management  using Semantic Web and Blockchain
Social Media Copyright Management using Semantic Web and BlockchainRoberto García
350 views18 slides

More from Roberto García(20)

CopyrightLY: Blockchain and Semantic Web for Decentralised Copyright Management by Roberto García
CopyrightLY: Blockchain and Semantic Web for Decentralised Copyright ManagementCopyrightLY: Blockchain and Semantic Web for Decentralised Copyright Management
CopyrightLY: Blockchain and Semantic Web for Decentralised Copyright Management
Roberto García151 views
Facilitating an agricultural data ecosystem - The EU Code of conduct on agric... by Roberto García
Facilitating an agricultural data ecosystem- The EU Code of conduct on agric...Facilitating an agricultural data ecosystem- The EU Code of conduct on agric...
Facilitating an agricultural data ecosystem - The EU Code of conduct on agric...
Roberto García67 views
A pragmatic view on Semantic Technologies by Roberto García
A pragmatic view on Semantic TechnologiesA pragmatic view on Semantic Technologies
A pragmatic view on Semantic Technologies
Roberto García131 views
Facilitant un ecosistema de dades agràries: El codi de conducta de la Unió Eu... by Roberto García
Facilitant un ecosistema de dades agràries:El codi de conducta de la Unió Eu...Facilitant un ecosistema de dades agràries:El codi de conducta de la Unió Eu...
Facilitant un ecosistema de dades agràries: El codi de conducta de la Unió Eu...
Roberto García219 views
ETHICOMP 2020: Exploring Value Sensitive Design for Blockchain Development by Roberto García
ETHICOMP 2020: Exploring Value Sensitive Design for Blockchain DevelopmentETHICOMP 2020: Exploring Value Sensitive Design for Blockchain Development
ETHICOMP 2020: Exploring Value Sensitive Design for Blockchain Development
Roberto García135 views
Social Media Copyright Management using Semantic Web and Blockchain by Roberto García
Social Media Copyright Management  using Semantic Web and BlockchainSocial Media Copyright Management  using Semantic Web and Blockchain
Social Media Copyright Management using Semantic Web and Blockchain
Roberto García350 views
Copyright Management in the Web 3 by Roberto García
Copyright Management in the Web 3Copyright Management in the Web 3
Copyright Management in the Web 3
Roberto García1.2K views
Integration and Exploration of Financial Data using Semantics and Ontologies by Roberto García
Integration and Exploration of Financial Data using Semantics and OntologiesIntegration and Exploration of Financial Data using Semantics and Ontologies
Integration and Exploration of Financial Data using Semantics and Ontologies
Roberto García529 views
Multilingual Ontology for Plant Health Threats Media Monitoring by Roberto García
Multilingual Ontology for Plant Health Threats Media MonitoringMultilingual Ontology for Plant Health Threats Media Monitoring
Multilingual Ontology for Plant Health Threats Media Monitoring
Roberto García304 views
BESDUI: Benchmark for End-User Structured Data User Interfaces by Roberto García
BESDUI: Benchmark for End-User Structured Data User InterfacesBESDUI: Benchmark for End-User Structured Data User Interfaces
BESDUI: Benchmark for End-User Structured Data User Interfaces
Roberto García407 views
Semantic Management of your Media Fragments Rights by Roberto García
Semantic Management of your Media Fragments RightsSemantic Management of your Media Fragments Rights
Semantic Management of your Media Fragments Rights
Roberto García3.1K views
Semantic Technologies for Copyright Management by Roberto García
Semantic Technologies for Copyright ManagementSemantic Technologies for Copyright Management
Semantic Technologies for Copyright Management
Roberto García1.2K views
Linked Data: the Entry Point for Worldwide Media Fragments Re-use and Copyrig... by Roberto García
Linked Data: the Entry Point for Worldwide Media Fragments Re-use and Copyrig...Linked Data: the Entry Point for Worldwide Media Fragments Re-use and Copyrig...
Linked Data: the Entry Point for Worldwide Media Fragments Re-use and Copyrig...
Roberto García1.8K views
Semantic Copyright Management of Media Fragments by Roberto García
Semantic Copyright Management of Media FragmentsSemantic Copyright Management of Media Fragments
Semantic Copyright Management of Media Fragments
Roberto García1.1K views
MediaMixer: facilitating media fragments mixing and its rights management usi... by Roberto García
MediaMixer: facilitating media fragments mixing and its rights management usi...MediaMixer: facilitating media fragments mixing and its rights management usi...
MediaMixer: facilitating media fragments mixing and its rights management usi...
Roberto García990 views
Facets and Pivoting for Flexible and Usable Linked Data Exploration by Roberto García
Facets and Pivoting for Flexible and Usable Linked Data ExplorationFacets and Pivoting for Flexible and Usable Linked Data Exploration
Facets and Pivoting for Flexible and Usable Linked Data Exploration
Roberto García1K views
Using Semantic Web Technologies to Facilitate XBRL-based Financial Data Compa... by Roberto García
Using Semantic Web Technologies to Facilitate XBRL-based Financial Data Compa...Using Semantic Web Technologies to Facilitate XBRL-based Financial Data Compa...
Using Semantic Web Technologies to Facilitate XBRL-based Financial Data Compa...
Roberto García919 views

Recently uploaded

Mistakes Young Entrepreneurs Make When Approaching Problem Solving by
Mistakes Young Entrepreneurs Make When Approaching Problem SolvingMistakes Young Entrepreneurs Make When Approaching Problem Solving
Mistakes Young Entrepreneurs Make When Approaching Problem SolvingYasser Takie Eddine Abdesselam
60 views31 slides
CORPORATE COMMUNICATION.pdf by
CORPORATE COMMUNICATION.pdfCORPORATE COMMUNICATION.pdf
CORPORATE COMMUNICATION.pdfAKarthikeyan8
14 views71 slides
terms_2.pdf by
terms_2.pdfterms_2.pdf
terms_2.pdfJAWADIQBAL40
52 views8 slides
davood_keshavarz_david_keshavarz_criminal_conviction_prison_sentence_judgemen... by
davood_keshavarz_david_keshavarz_criminal_conviction_prison_sentence_judgemen...davood_keshavarz_david_keshavarz_criminal_conviction_prison_sentence_judgemen...
davood_keshavarz_david_keshavarz_criminal_conviction_prison_sentence_judgemen...morshedislam3
14 views5 slides
Coomes Consulting Business Profile by
Coomes Consulting Business ProfileCoomes Consulting Business Profile
Coomes Consulting Business ProfileChris Coomes
50 views10 slides
Integrating Talent Management Practices by
Integrating Talent Management PracticesIntegrating Talent Management Practices
Integrating Talent Management PracticesSeta Wicaksana
29 views29 slides

Recently uploaded(20)

CORPORATE COMMUNICATION.pdf by AKarthikeyan8
CORPORATE COMMUNICATION.pdfCORPORATE COMMUNICATION.pdf
CORPORATE COMMUNICATION.pdf
AKarthikeyan814 views
davood_keshavarz_david_keshavarz_criminal_conviction_prison_sentence_judgemen... by morshedislam3
davood_keshavarz_david_keshavarz_criminal_conviction_prison_sentence_judgemen...davood_keshavarz_david_keshavarz_criminal_conviction_prison_sentence_judgemen...
davood_keshavarz_david_keshavarz_criminal_conviction_prison_sentence_judgemen...
morshedislam314 views
Coomes Consulting Business Profile by Chris Coomes
Coomes Consulting Business ProfileCoomes Consulting Business Profile
Coomes Consulting Business Profile
Chris Coomes50 views
Integrating Talent Management Practices by Seta Wicaksana
Integrating Talent Management PracticesIntegrating Talent Management Practices
Integrating Talent Management Practices
Seta Wicaksana29 views
Bloomerang_Forecasting Your Fundraising Revenue 2024.pptx.pdf by Bloomerang
Bloomerang_Forecasting Your Fundraising Revenue 2024.pptx.pdfBloomerang_Forecasting Your Fundraising Revenue 2024.pptx.pdf
Bloomerang_Forecasting Your Fundraising Revenue 2024.pptx.pdf
Bloomerang134 views
Imports Next Level.pdf by Bloomerang
Imports Next Level.pdfImports Next Level.pdf
Imports Next Level.pdf
Bloomerang101 views
PMU Launch - Guaranteed Slides by pmulaunch
PMU Launch - Guaranteed SlidesPMU Launch - Guaranteed Slides
PMU Launch - Guaranteed Slides
pmulaunch16 views
2023 Photo Contest.pptx by culhama
2023 Photo Contest.pptx2023 Photo Contest.pptx
2023 Photo Contest.pptx
culhama30 views
The Truth About Customer Journey Mapping by Aggregage
The Truth About Customer Journey MappingThe Truth About Customer Journey Mapping
The Truth About Customer Journey Mapping
Aggregage82 views
Amazing Opportunities: PCD Pharma Franchise in Kerala.pptx by SaphnixMedicure1
Amazing Opportunities: PCD Pharma Franchise in Kerala.pptxAmazing Opportunities: PCD Pharma Franchise in Kerala.pptx
Amazing Opportunities: PCD Pharma Franchise in Kerala.pptx
SaphnixMedicure120 views
Pitch Deck Teardown: Scalestack's $1M AI sales tech Seed deck by HajeJanKamps
Pitch Deck Teardown: Scalestack's $1M AI sales tech Seed deckPitch Deck Teardown: Scalestack's $1M AI sales tech Seed deck
Pitch Deck Teardown: Scalestack's $1M AI sales tech Seed deck
HajeJanKamps502 views
Learning from Failure_ Lessons from Failed Startups.pptx by Codeventures
Learning from Failure_ Lessons from Failed Startups.pptxLearning from Failure_ Lessons from Failed Startups.pptx
Learning from Failure_ Lessons from Failed Startups.pptx
Codeventures11 views
Nevigating Sucess.pdf by TEWMAGAZINE
Nevigating Sucess.pdfNevigating Sucess.pdf
Nevigating Sucess.pdf
TEWMAGAZINE24 views
Assignment 4: Reporting to Management.pptx by BethanyAline
Assignment 4: Reporting to Management.pptxAssignment 4: Reporting to Management.pptx
Assignment 4: Reporting to Management.pptx
BethanyAline18 views

Exploring a Semantic Framework for Integrating DPM, XBRL and SDMX Data

  • 1. EXPLORINGA SEMANTIC FRAMEWORKFOR INTEGRATINGDPM,XBRL AND SDMX DATA Roberto García Associate Professor Universitat deLleida
  • 2. INTRODUCTION • Proliferation financial data and available formats • Increased need for ways to integrate it • Semantic Technologies: • facilitate integration bymoving effort to the level of meanings • instead of trying to deal with syntax subtleties • Explore this alternative through a practical experiment
  • 3. INTEGRATION SOURCES • Data sources: • XBRL, • DataPointModel (DPM) • SDMX • Schema sources: • XBRLTaxonomies, • DPMDataDictionaries • SMX DataStructureDefinitions (DSD)
  • 4. CONCEPTUAL FRAMEWORK • Consider the multidimensional nature of the data (e.g. DPM) • Far beyond 2D data available from spreadsheets • Avoid having to encode “hidden dimensions” into footnotes, attachments, etc. • Dimensions might be hierarchically organised (like geographical administrative divisions) • Proposal: RDF Data Cube Vocabulary (based on semantic technologies, RDF & Web Ontologies) • Supports multidimensional data • Based on SDMX and the Semantic Web vocabulary for statistical data • Web standard (W3C Recommendation) • Approach: • Map DPM and XBRL to the RDF Data Cube Vocabulary (example next) • SDMX trivially becomes RDF based on the Data Cube Vocabulary
  • 5. DATA CUBE Dataset a collection of observations Dimensions identify an observation e.g. observation time or a geographic region Measures represent observed phenomenon Attributes qualify / help interpret observations e.g. units of measure, scaling factors or observation status (estimated, provisional,…) Slice subsets observations by fixing all but one dimension (or a few) https://www.slideshare.net/140er/lets-talkaboutstatisticaldatainrdf
  • 6. RDFDATA CUBE Dataset a collection of observations Dimensions identify an observation e.g. observation time or a geographic region Measures represent observed phenomenon Attributes qualify / help interpret observations e.g. units of measure, scaling factors or observation status (estimated, provisional,…) Slice subsets observations by fixing all but one dimension (or a few) https://www.w3.org/TR/vocab-data-cube/#outline
  • 7. MODELLINGEXAMPLE • Data Point example based on the taxonomy "FINancial REPorting 2016-A Individual (2.1.5)", authored by EBA using DPM 2.5 and based on table "Balance Sheet Statement: Assets (F_01.01)", row "Total assets" and column "Carrying amount” • Metric: eba_mi53 - Carrying amount  Value: 1000 EUR • Dimension 1: BAS – Base  Dimension 1 Value: x6 - Assets • Dimension 2: MCY - Main Category  Dimension 2 Value: x25 - All assets • Plus entity with LEI 549300N33JQ7EG2VD447 and time 2017-07-01
  • 8. MODELLINGEXAMPLE • XBRL representation of the Data Point <xbrli:context id="c1"> <xbrli:entity> <xbrli:identifier scheme="http://standards.iso.org/iso/17442"> 549300N33JQ7EG2VD447</xbrli:identifier> </xbrli:entity> <xbrli:period> <xbrli:instant>2017-07-01</xbrli:instant> </xbrli:period> <xbrli:scenario> <xbrldi:explicitMember dimension="eba_dim:BAS">eba_BA:x6</xbrldi:explicitMember> <xbrldi:explicitMember dimension="eba_dim:MCY">eba_MC:x25</xbrldi:explicitMember> </xbrli:scenario> </xbrli:context> <eba_met:mi53 unitRef="EUR" decimals="-3" contextRef="c1">1</eba_met:mi53>
  • 9. MODELLINGEXAMPLE • RDF Data Cube Vocabulary representation of the Data Point and XBRL instance ex:dst-1/obs-1 a qb:Observation; qb:dataSet ex:dtst-1 ; xbrli:entity lei:549300N33JQ7EG2VD447 ; sdmx-dim:refTime "2017-07-01"^^xsd:date ; eba_dim:BAS eba_BA:x6 ; eba_dim:MCY eba_MC:x25 ; eba_met:mi53 "1"^^xsd:int ; sdmx-att:decimals "-3"^^xsd:int ; sdmx-att:currency currency:EUR .
  • 10. MODELLINGEXAMPLE • RDF Data Cube Vocabulary terms to model: Observations linked to their dataset Dimensions, including entities and time Measures, including data type Attributes, decimals and currency
  • 11. MODELLINGFINANCIAL DATA SCHEMAS • RDF Data Cube Vocabulary also to model how the dimensions, metrics and attributes are structured • Capture • DPM Data Dictionaries • XBRL Taxonomies in a Data Structure Definition (DSD) linked to each dataset
  • 12. MODELLINGFINANCIAL DATA SCHEMAS • DSD also defines the types of the values of measures, dimensions and attributes (their ranges): • Data types (date, integer,…) • Taxonomy terms
  • 13. MODELLINGFINANCIAL DATA SCHEMAS • Example: the range of the property eba_dim:BAS is eba:BA • eba:BA is defined as a SDMX Code List (and a semantic SKOS Concept Scheme) with members: • eba_BA:x6 • eba_BA:x2 • eba_BA:x3 (members can be hierarchically organised)
  • 14. CONCLUSIONS • Possible to use the RDF Data Cube Vocabulary to semantically model and integrate: • Data Point / XBRL Instance • Data Dictionary / XBRL Taxonomy • Per design, also SDMX / DSD • Semantic technologies facilitate the integration by operating at the level of dictionaries and taxonomies • Facilitates multidimensional data management and multiple views on the same data
  • 15. FUTURE WORK • More systematic analysis of how the different constructs in the DPM Dictionaries and XBRL Taxonomies can be mapped to the RDF Data Cube DSDs (automation?) • Formalisation of the semantic relationships among the concepts and relationships defined in the DPM Dictionaries, XBRL Taxonomies and SDMX DSDs • For instance, formalise the equivalence between the concepts related to currency values in all them so they can be queried transparently using semantic requests • Additionally, possible to benefit from existing efforts to unify these dictionaries and taxonomies • ECB Single Data Dictionary (SDD) can also be formalised using semantic technologies and become the hub for integration using semantic relationships