Integration and Exploration of Financial Data using Semantics and Ontologies

Roberto García
Roberto GarcíaAssociate Professor at Universitat de Lleida
Integration and Exploration of Financial
Data using Semantics and Ontologies
Roberto García
HCI and Data Integration Research Group
Universitat de Lleida, Spain
8
Academic Track
Introduction
• Report experience first attempt to map “significant”
amount of XBRL to RDF
– 2008 +100M triples… and as far as we know
• Test Semantic Web and Linked Data for financial
data integration and exploration
• Convert it, map XBRL to RDF
– Apply generic mapping: ReDeFer
• XSD2OWL, XML2RDF
• Once as RDF+OWL, improve comparability using
semantic links
Semantic XBRL
• Dataset size (from US SEC)
– May 2012: 125,55 million triples from 29342 XBRL filings
– https://datahub.io/dataset/semantic-xbrl
– Linked Open Data Cloud
0
20
40
60
80
100
120
140
0
5000
10000
15000
20000
25000
30000
35000
Filings
Triples
(millions)
Publishing XBRL as Linked Open Data
LDOW’09 @ WWW’09
Position Paper @ 2009 W3C Workshop on
Improving Access to Financial Data on the Web
Linking XBRL Financial Data (chapter)
Linking Enterprise Data Book, 2010
Using Semantic Web Technologies to Facilitate
XBRL-based Financial Data Comparability
FEOSW'12 @ ESWC’12
Improving the comparability of financial statements in
a changeable context: a Semantic Web-based approach
PhD Thesis, Economics. Héctor Carretié, 2012
Approach
XML2RDF
XSD2OWL
XBRL Schemas
.xml
XBRL filings
XBRL Ontologies
ReDeFer
Rhizomer
+1M triples
LOD
Linking
RDF2Form
XHTML+
RDFa
RDF2HTML
SPARQL
edit
post
Rhizomer
AJAX
get
.xsd
EDGAR
RSS Feed
download
EDGAR
Standard
Taxonomies
.US
+100M triples
XBRL XML to RDF
• ReDeFer XML2RDF,
model XML tree using triples
– xsd:element and xsd:atribute  rdf:Property
– xbrli:id and xbrli:identifier  rdf:Resource ID
– Other resources, anonymous
Rendered by RDF2SVG
XBRL XSD to OWL
• XBRL Schemas: XBRL 2.1, US GAAP1,
Spanish PGC…
• ReDeFer XSD2OWL
XML Schema OWL
element | attribute
rdf:Property
owl:DatatypeProperty
owl:ObjectProperty
element@substitutionGroup rdfs:subPropertyOf
element@type rdfs:range
complexType owl:Class
complexType//element owl:Restriction
extension@base |
restriction@base
rdfs:subClassOf
@maxOccurs,
@minOccurs
owl:maxCardinality,
owl:minCardinality
sequence,
choice
owl:intersectionOf,
owl:unionOf
Core classes and properties for XBRL Instance
XSD2OWL mappings
1 Ontologies available from http://rhizomik.net/ontologies/bizontos
XML2RDF plus XSD2OWL
• Enrich RDF with links to classes for
corresponding XSD complexTypes:
Comparability
• Generate semantic links across accounting
standards (XBRL taxonomies)
– Automatic ontology alignment tools
– Experts input
– Numeric analysis
• Comparability experiment:
– Telefonica S.A. Balance Sheet 2008 and 2009
• XBRL for Spanish CNMV
• XBRL for US SEC
Semantic Links US SEC - Spanish CNMV
Integration and Exploration of Financial Data using Semantics and Ontologies
DESCRIBE ?r
WHERE {
?r a ifrs:NoncurrentAssets }
DESCRIBE ?r
WHERE {
?r a
ifrs:OtherNoncurrentFinancialAssets }
Conclusions
• XBRL tools: for financial information edition and
maintenance (formulae,…)
• Semantic Web tools: data integration, cross
querying,… COMPARABILITY
– Across filings, companies, accounting principles,…
• However, lack of impact of Semantic XBRL
– Project discontinued 2013
– Too early?
– Too much semantics… or too little?
XSD2OWLXML2RDF
Financial Industry Business Ontology
• FIBO™ open industry standard for defining financial
contracts terms, facts and relationships
• 11 core domains, 49 modules and 418 ontology files:
– Foundations
– Business Entities
– Indices and Indicators
– Securities Reference Terms
– Derivatives
– Loans
– Market Data
– Collective Investment Vehicles
– Corporate Actions
– Payments
– Portfolios and Holdings
– …
Derivatives / Swaps
Experience with US Banks
• Consulting for Cambridge Semantics
– http://www.cambridgesemantics.com
• Banks interest in heavyweight semantics, data
integration for regulators reports automation
• Proof of Concept with Interest Rate Swaps
– Modelling
– Exploration
– Automatic classification
(reasoning)
• Cross-currency swaps
• Single-currency swaps
• Float-Fixed swaps
• …
Experience with US Banks
Thank you for your attention
Questions?
Roberto García
@rogargon
roberto.garcia@udl.cat
http://rhizomik.net/~roberto/
HCI and Data Integration Research Group
Universitat de Lleida, Spain
1 of 17

More Related Content

Similar to Integration and Exploration of Financial Data using Semantics and Ontologies(20)

Omitola birmingham cityunivOmitola birmingham cityuniv
Omitola birmingham cityuniv
Tope Omitola663 views
Standardizing for Open DataStandardizing for Open Data
Standardizing for Open Data
Ivan Herman731 views
IASSIST 2012 - DDI-RDF - Trouble with TriplesIASSIST 2012 - DDI-RDF - Trouble with Triples
IASSIST 2012 - DDI-RDF - Trouble with Triples
Dr.-Ing. Thomas Hartmann693 views
KEDL DBpedia  2019KEDL DBpedia  2019
KEDL DBpedia 2019
Sebastian Hellmann392 views
XBRL in Oracle 11i and R12XBRL in Oracle 11i and R12
XBRL in Oracle 11i and R12
Mahesh Vallampati1.8K views
What do we want computers to do for us? What do we want computers to do for us?
What do we want computers to do for us?
Andrea Volpini2.3K views
CSHALS 2010 W3C Semanic Web TutorialCSHALS 2010 W3C Semanic Web Tutorial
CSHALS 2010 W3C Semanic Web Tutorial
LeeFeigenbaum2.5K views
Michael Lang Sr. PresentationMichael Lang Sr. Presentation
Michael Lang Sr. Presentation
Mediabistro1K views

More from Roberto García(20)

Copyright Management in the Web 3Copyright Management in the Web 3
Copyright Management in the Web 3
Roberto García1.2K views
Damny media mixerDamny media mixer
Damny media mixer
Roberto García910 views
Exploring Linked DataExploring Linked Data
Exploring Linked Data
Roberto García459 views
Exploring the Semantic WebExploring the Semantic Web
Exploring the Semantic Web
Roberto García2.1K views

Recently uploaded(20)

MOSORE_BRESCIAMOSORE_BRESCIA
MOSORE_BRESCIA
Federico Karagulian5 views
ColonyOSColonyOS
ColonyOS
JohanKristiansson69 views
RuleBookForTheFairDataEconomy.pptxRuleBookForTheFairDataEconomy.pptx
RuleBookForTheFairDataEconomy.pptx
noraelstela166 views
Journey of Generative AIJourney of Generative AI
Journey of Generative AI
thomasjvarghese4918 views
Data structure and algorithm. Data structure and algorithm.
Data structure and algorithm.
Abdul salam 12 views
PTicketInput.pdfPTicketInput.pdf
PTicketInput.pdf
stuartmcphersonflipm314 views
3196 The Case of The East River3196 The Case of The East River
3196 The Case of The East River
ErickANDRADE9011 views
Microsoft Fabric.pptxMicrosoft Fabric.pptx
Microsoft Fabric.pptx
Shruti Chaurasia19 views
RIO GRANDE SUPPLY COMPANY INC, JAYSON.docxRIO GRANDE SUPPLY COMPANY INC, JAYSON.docx
RIO GRANDE SUPPLY COMPANY INC, JAYSON.docx
JaysonGarabilesEspej6 views
Survey on Factuality in LLM's.pptxSurvey on Factuality in LLM's.pptx
Survey on Factuality in LLM's.pptx
NeethaSherra15 views

Integration and Exploration of Financial Data using Semantics and Ontologies

  • 1. Integration and Exploration of Financial Data using Semantics and Ontologies Roberto García HCI and Data Integration Research Group Universitat de Lleida, Spain 8 Academic Track
  • 2. Introduction • Report experience first attempt to map “significant” amount of XBRL to RDF – 2008 +100M triples… and as far as we know • Test Semantic Web and Linked Data for financial data integration and exploration • Convert it, map XBRL to RDF – Apply generic mapping: ReDeFer • XSD2OWL, XML2RDF • Once as RDF+OWL, improve comparability using semantic links
  • 3. Semantic XBRL • Dataset size (from US SEC) – May 2012: 125,55 million triples from 29342 XBRL filings – https://datahub.io/dataset/semantic-xbrl – Linked Open Data Cloud 0 20 40 60 80 100 120 140 0 5000 10000 15000 20000 25000 30000 35000 Filings Triples (millions) Publishing XBRL as Linked Open Data LDOW’09 @ WWW’09 Position Paper @ 2009 W3C Workshop on Improving Access to Financial Data on the Web Linking XBRL Financial Data (chapter) Linking Enterprise Data Book, 2010 Using Semantic Web Technologies to Facilitate XBRL-based Financial Data Comparability FEOSW'12 @ ESWC’12 Improving the comparability of financial statements in a changeable context: a Semantic Web-based approach PhD Thesis, Economics. Héctor Carretié, 2012
  • 4. Approach XML2RDF XSD2OWL XBRL Schemas .xml XBRL filings XBRL Ontologies ReDeFer Rhizomer +1M triples LOD Linking RDF2Form XHTML+ RDFa RDF2HTML SPARQL edit post Rhizomer AJAX get .xsd EDGAR RSS Feed download EDGAR Standard Taxonomies .US +100M triples
  • 5. XBRL XML to RDF • ReDeFer XML2RDF, model XML tree using triples – xsd:element and xsd:atribute  rdf:Property – xbrli:id and xbrli:identifier  rdf:Resource ID – Other resources, anonymous Rendered by RDF2SVG
  • 6. XBRL XSD to OWL • XBRL Schemas: XBRL 2.1, US GAAP1, Spanish PGC… • ReDeFer XSD2OWL XML Schema OWL element | attribute rdf:Property owl:DatatypeProperty owl:ObjectProperty element@substitutionGroup rdfs:subPropertyOf element@type rdfs:range complexType owl:Class complexType//element owl:Restriction extension@base | restriction@base rdfs:subClassOf @maxOccurs, @minOccurs owl:maxCardinality, owl:minCardinality sequence, choice owl:intersectionOf, owl:unionOf Core classes and properties for XBRL Instance XSD2OWL mappings 1 Ontologies available from http://rhizomik.net/ontologies/bizontos
  • 7. XML2RDF plus XSD2OWL • Enrich RDF with links to classes for corresponding XSD complexTypes:
  • 8. Comparability • Generate semantic links across accounting standards (XBRL taxonomies) – Automatic ontology alignment tools – Experts input – Numeric analysis • Comparability experiment: – Telefonica S.A. Balance Sheet 2008 and 2009 • XBRL for Spanish CNMV • XBRL for US SEC
  • 9. Semantic Links US SEC - Spanish CNMV
  • 11. DESCRIBE ?r WHERE { ?r a ifrs:NoncurrentAssets } DESCRIBE ?r WHERE { ?r a ifrs:OtherNoncurrentFinancialAssets }
  • 12. Conclusions • XBRL tools: for financial information edition and maintenance (formulae,…) • Semantic Web tools: data integration, cross querying,… COMPARABILITY – Across filings, companies, accounting principles,… • However, lack of impact of Semantic XBRL – Project discontinued 2013 – Too early? – Too much semantics… or too little? XSD2OWLXML2RDF
  • 13. Financial Industry Business Ontology • FIBO™ open industry standard for defining financial contracts terms, facts and relationships • 11 core domains, 49 modules and 418 ontology files: – Foundations – Business Entities – Indices and Indicators – Securities Reference Terms – Derivatives – Loans – Market Data – Collective Investment Vehicles – Corporate Actions – Payments – Portfolios and Holdings – …
  • 15. Experience with US Banks • Consulting for Cambridge Semantics – http://www.cambridgesemantics.com • Banks interest in heavyweight semantics, data integration for regulators reports automation • Proof of Concept with Interest Rate Swaps – Modelling – Exploration – Automatic classification (reasoning) • Cross-currency swaps • Single-currency swaps • Float-Fixed swaps • …
  • 17. Thank you for your attention Questions? Roberto García @rogargon roberto.garcia@udl.cat http://rhizomik.net/~roberto/ HCI and Data Integration Research Group Universitat de Lleida, Spain