Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Triplificating and linking XBRL financial data


Published on

Financial reports based on the XML standard XBRL are mapped to RDF together with the associated XBRL taxonomies, which are mapped to OWL ontologies.

Published in: Technology, Education
  • Be the first to comment

  • Be the first to like this

Triplificating and linking XBRL financial data

  1. 1. Triplificating and Linking XBRL Financial Data Roberto García, Rosa Gil HCI & Data Integration Research Group Universitat de Lleida, Spain 6th International Conference on Semantic Systems, I-SEMANTICS'10, 1-3 September 2010, Graz, Austria
  2. 2. Introduction <ul><li>Explore Semantic Web technologies for financial information </li></ul><ul><li>Most public financial data available as XBRL </li></ul><ul><li>Reuse this data, map XBRL to RDF </li></ul><ul><ul><li>Apply generic mapping: ReDeFer </li></ul></ul><ul><ul><ul><li>XSD2OWL, XML2RDF </li></ul></ul></ul><ul><li>Publish as Linked Data using Rhizomer </li></ul>
  3. 3. Approach <ul><li>Reuse XBRL data and schemas </li></ul><ul><li>Facilitate roundtrip: </li></ul><ul><ul><li>XBRL  SW  XBRL </li></ul></ul><ul><li>Showcase Semantic Web benefits </li></ul><ul><li>Test data: SEC’s EDGAR voluntary program 1 </li></ul>1 From 2009, 3 year phased-in schedule, by end 2011 all U.S. public companies plus 8,000 mutual funds
  4. 4. ReDeFer <ul><li>Part of the Rhizomik initiative </li></ul><ul><li>ReDeFer (in and out Semantic Web) </li></ul><ul><ul><li>RDF2HTML+RDFa </li></ul></ul><ul><ul><li>RDF2SVG </li></ul></ul><ul><ul><li>XML2RDF </li></ul></ul><ul><ul><li>XSD2OWL </li></ul></ul><ul><ul><li>Use cases: MPEG-7, MPEG-21, … </li></ul></ul>
  5. 5. XBRL XML to RDF <ul><li>ReDeFer XML2RDF , model XML tree using triples </li></ul><ul><ul><li>xsd:element and xsd:atribute  rdf:Property </li></ul></ul><ul><ul><li>xbrli:id and xbrli:identifier  rdf:Resource ID </li></ul></ul><ul><ul><li>Other resources, anonymous </li></ul></ul>Rendered by RDF2SVG
  6. 6. XBRL XSD to OWL <ul><li>XBRL Schemas: XBRL 2.1, US GAAP 1 ,… </li></ul><ul><li>ReDeFer XSD2OWL </li></ul>Core classes and properties for XBRL Instance XSD2OWL mappings 1 Ontologies available from XML Schema OWL element | attribute rdf:Property owl:DatatypeProperty owl:ObjectProperty [email_address] rdfs:subPropertyOf [email_address] rdfs:range complexType owl:Class complexType//element owl:Restriction extension@base | restriction@base rdfs:subClassOf @maxOccurs, @minOccurs owl:maxCardinality, owl:minCardinality sequence, choice owl:intersectionOf, owl:unionOf
  7. 7. XML2RDF plus XSD2OWL <ul><li>Enrich RDF with links to classes for corresponding XSD complexTypes : </li></ul>
  8. 8. XBRL Sources <ul><li>SEC’s EDGAR filings data files: </li></ul> XML2RDF XSD2OWL Calculations Labels Presentation Definition
  9. 9. XBRL Ontologies adbe-20090828.owl adbe-20090828.rdf adbe-20090529.rdf XBRL 2.1 US-GAAP Non-GAAP adbe-20090529.owl … adbe-20080916.rdf … adbe-20080829.owl
  10. 10. Ontology Alignment <ul><li>Integrate ontologies for different filings </li></ul><ul><ul><li>E.g.: adbe-20080530.owl:InvestmentLeaseReceivable = adbe-20080829.owl:Investment In LeaseReceivable </li></ul></ul><ul><ul><li>Ontology alignment tools (edit distance) </li></ul></ul><ul><ul><li>Perform queries across filings </li></ul></ul>
  11. 11. Semantic XBRL <ul><li>Dataset size </li></ul><ul><ul><li>Sept 2010: 9,19 million triples from 2974 XBRL filings </li></ul></ul><ul><ul><li>Available from </li></ul></ul>
  12. 12. Semantic XBRL <ul><li>Linked Data: </li></ul><ul><ul><li>Links to DBPedia: </li></ul></ul><ul><ul><ul><li>Companies: Central Index Key (CIK) , name or ticker through dbpprop:redirect </li></ul></ul></ul><ul><ul><ul><ul><li>dbpedia:Microsoft is dbpprop:redirect of dbpedia:MSFT </li></ul></ul></ul></ul><ul><ul><ul><li>Units: e.g. USD  dbpedia:United_States_dollar </li></ul></ul></ul><ul><ul><li>Links to Corporate Ownership RDF Data 1 : </li></ul></ul><ul><ul><ul><li>(Individual ownership and subsidiaries) </li></ul></ul></ul><ul><ul><ul><li>Companies: CIK </li></ul></ul></ul>1
  13. 13. Architecture
  14. 14. Prototype Demo <ul><li>Rhizomer : </li></ul><ul><li>publish </li></ul><ul><li>query </li></ul><ul><li>browse </li></ul><ul><li>edit </li></ul><ul><li>mashup (map, timeline,…) </li></ul><ul><li>etc. </li></ul>
  15. 15. Get a context description from its URL
  16. 16. Reuse Wikipedia data (DBPedia) <>
  17. 17. Get all Investment[In]LeaseReceivable ?o WHERE { ?s ?p ?o . ?p <> <> . }
  18. 18. Integrate different CIK variants
  19. 19. Conclusions <ul><li>XBRL tools : for financial information edition and maintenance (formulae, …) </li></ul><ul><li>Semantic Web tools : data integration (linked data), cross querying, … </li></ul><ul><ul><li>Across filings, companies, accounting principles,… </li></ul></ul>XSD2OWL XML2RDF
  20. 20. Future Work <ul><li>Systematic mappings across filings for the same company </li></ul><ul><li>Explore mappings: </li></ul><ul><ul><li>Across companies </li></ul></ul><ul><ul><li>A cross accounting principles … </li></ul></ul><ul><li>Develop demo webpps: </li></ul><ul><ul><li>E.g.: show accounting data for ownership/subsidiary networks of companies… </li></ul></ul>
  21. 21. Thank you for your attention Roberto García [email_address] HCI & Data Integration Research Group Universitat de Lleida, Spain