Linked Data for Enterprise Information
Integration
Sören Auer
© Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS
The Web evolves into a Web of Data
2
Linked O...
© Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS
The Evolution of the Web
3
Web 1.0 - Hypertex...
© Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS
Linked Data Principles
1. Use URIs to identif...
© Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS
Linked Enterprise Data Principles
1. Evolve e...
© Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS
Linked Enterprise Data Advantages
• Light-wei...
Creating Knowledge
out of Interlinked Data
Inter-linking/
Fusing
Classifi-cation/
Enrichment
Quality
Analysis
Evolution /
...
Creating Knowledge
out of Interlinked Data
Extraction
Inter-
linking
Enrichm
ent
Quality
Analysis
Evolution
Repair
Explora...
Creating Knowledge
out of Interlinked Data
From unstructured sources
• NLP, text mining, annotation
From semi-structured s...
Creating Knowledge
out of Interlinked Data
Many different approaches: D2R, Virtuoso RDF Views, Triplify,
No agreement on a...
Creating Knowledge
out of Interlinked Data
• Rationale: Exploit existing formalisms
(SQL, SPARQL Construct) as much as
pos...
Creating Knowledge
out of Interlinked Data
Storage and Querying
Inter-
linking
Enrichm
ent
Quality
Analysis
Evolution
Repa...
Authoring
Inter-
linking
Enrichm
ent
Quality
Analysis
Evolution
Repair
Explora-
tion
Extrac-
tion
tore
uery
Author
ing
Creating Knowledge
out of Interlinked Data
1. Semantic (Text) Wikis
• Authoring of semantically
annotated texts
2. Semanti...
Creating Knowledge
out of Interlinked Data
The situation at Daimler (€97.76 billion revenue, 250.000
employees):
• 3.000 h...
Creating Knowledge
out of Interlinked Data
16
Search before
Creating Knowledge
out of Interlinked Data
Creating Knowledge
out of Interlinked Data
OntoWiki
with loaded
car model
data
Creating Knowledge
out of Interlinked Data
Management of Enterprise Taxonomies with OntoWiki
Based on the W3C SKOS standar...
Creating Knowledge
out of Interlinked Data
Search after
Showing recommondations
from the knowledge base
integrating car mo...
Creating Knowledge
out of Interlinked Data
You can search for „Kombi“
(station wagon) and find T-
Models (Daimler term for...
FromIntranettoEnterpriseDataWebaroundaknowledgehub
Auer, Frischmuth, Klímek, Unbehauen, Holzweißig, Marquardt: Linked Data...
Creating Knowledge
out of Interlinked Data
© CC-BY-NC-ND by ~Dezz~ (residae on flickr)
Linking
Inter-
linking
Enrichm
ent
...
Creating Knowledge
out of Interlinked Data
In an uncontrolled
environment as the Data
Web, there will be a
proliferation o...
Creating Knowledge
out of Interlinked Data
Enrichment
Inter-
linking
Enrichm
ent
Quality
Analysis
Evolution
Repair
Explora...
Creating Knowledge
out of Interlinked Data
Linked Data is mainly instance data!!!
ORE (Ontology Repair and Enrichment) too...
Creating Knowledge
out of Interlinked Data
Analysis
Quality
Inter-
linking
Enrichm
ent
Quality
Analysis
Evolution
Repair
E...
Creating Knowledge
out of Interlinked Data
Quality on the Data Web is varying a lot
• Hand crafted or expensively curated ...
Creating Knowledge
out of Interlinked Data
Evolution © CC-BY-SA by alasis on flickr)
Inter-
linking
Enrichm
ent
Quality
An...
Creating Knowledge
out of Interlinked Data
Exploration
Inter-
linking
Enrichm
ent
Quality
Analysis
Evolution
Repair
Explor...
Creating Knowledge
out of Interlinked Data
An ecosystem of LOD visualizations
LODExploration
Widgets
Spatial faceted-
brow...
Creating Knowledge
out of Interlinked Data
LOD Life-(Washing-)cycle supported by Debian
based LOD2 Stack
http://stack.lod2...
Creating Knowledge
out of Interlinked Data
Linked Enterprise Intra Data Webs fill the gap
between Intra-/Extranets and EIS...
Creating Knowledge
out of Interlinked Data
• Linked Data is a promising technology for closing the
gap between SOA and uns...
Creating Knowledge
out of Interlinked Data
Thanks for your attention!
Sören Auer
http://www.informatik.uni-leipzig.de/~aue...
Upcoming SlideShare
Loading in …5
×

Linked data for Enterprise Data Integration

3,369 views

Published on

The Web evolves into a Web of Data. In parallel Intranets of large companies will evolve into Data Intranets based on the Linked Data principles. Linked Data has the potential to complement the SOA paradigm with a light-weight, adaptive data integration approach.

Published in: Technology, Education
0 Comments
14 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
3,369
On SlideShare
0
From Embeds
0
Number of Embeds
39
Actions
Shares
0
Downloads
78
Comments
0
Likes
14
Embeds 0
No embeds

No notes for slide
  • http://www.flickr.com/photos/residae/2560241604/#/
  • http://www.flickr.com/photos/alasis/3541341601/sizes/l/in/photostream/
  • Linked data for Enterprise Data Integration

    1. 1. Linked Data for Enterprise Information Integration Sören Auer
    2. 2. © Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS The Web evolves into a Web of Data 2 Linked Open Data Facebook Open Graph
    3. 3. © Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS The Evolution of the Web 3 Web 1.0 - Hypertext  Static Web pages  Hyperlinks  Link directories Web 2.0 – Social Apps  Social Web  Crowd-sourcing  Mashups Web 3.0 – Linked Data  REST APIs, RDF, JSON-LD  Vocabularies  Rich-snippets, Semantic Search 1990 2000 2010 Intranet 1.0 - Hypertext  Static Intranet pages  Keyword search  Hyperlinks Intranet 2.0 – Social Enterprise Apps  Salesforce  Crowd-sourcing  Mashups Intranet 3.0 – Enterprise Data Intranet  URI Scheme  Enterprise taxonomies / knowledge bases  RDB2RDF Mapping 1995 2005 2015 & Enterprise Intranets
    4. 4. © Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS Linked Data Principles 1. Use URIs to identify the “things” in your data 2. Use http:// URIs so people (and machines) can look them up on the web 3. When a URI is looked up, return a description of the thing (in RDF format) 4. Include links to related things http://www.w3.org/DesignIssues/LinkedData.html 4
    5. 5. © Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS Linked Enterprise Data Principles 1. Evolve existing existing taxonomies into enterprise knowledge bases/hubs 2. Establish a enterprise wide URI scheme 3. Equip existing information systems in your intranet with Linked Data interfaces 4. Establish links between related information 5
    6. 6. © Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS Linked Enterprise Data Advantages • Light-weight linked data integration complements more complex SOA architectures • Unified data (access) model simplifies data integration • Increase standardization while preserving diversity • Facilitate information flows along supply and value creation chains  Dramatically reduce data integration costs, increase enterprise flexibility 6
    7. 7. Creating Knowledge out of Interlinked Data Inter-linking/ Fusing Classifi-cation/ Enrichment Quality Analysis Evolution / Repair Search/ Browsing/ Exploration Extraction Storage/ Querying Manual revision/ authoring Linked Data Lifecycle
    8. 8. Creating Knowledge out of Interlinked Data Extraction Inter- linking Enrichm ent Quality Analysis Evolution Repair Explora- tion Extrac- tion Store Query Author ing
    9. 9. Creating Knowledge out of Interlinked Data From unstructured sources • NLP, text mining, annotation From semi-structured sources • DBpedia, LinkedGeoData, DataCube From structured sources • RDB2RDF Extraction
    10. 10. Creating Knowledge out of Interlinked Data Many different approaches: D2R, Virtuoso RDF Views, Triplify, No agreement on a formal semantics of RDF2RDF mapping • LOD readiness, SPARQL-SQL translation W3C RDB2RDF WG Extraction Relational Data Tool Triplify Sparqlify D2RQ Virtuoso RDF Views Technology Scripting languages (PHP) Java Java Whole middleware solution SPARQL endpoint - X X X Mapping language SQL SPARQL CONSTRUCT Views + SQL RDF based RDF based Mapping generation Manual Semi- automatic Semi- automatic Manual Scalability Medium- high (but no SPARQL) Very high Medium High Malhotra, Auer, Erling, Hausenblas: W3C RDB2RDF Incubator Group Report. W3C RDB2RDF Incubator Group, 2009.
    11. 11. Creating Knowledge out of Interlinked Data • Rationale: Exploit existing formalisms (SQL, SPARQL Construct) as much as possible • flexible & versatile mapping language • translating one SPARQL query into exactly one efficiently executable SQL query • Solid theoretical formalization based on SPARQL-relational algebra transformations • Extremely scalable through elaborated view candidate selection mechanism • Used to publish 20B triples for LinkedGeoData Sparqlify Stadler, Unbehauen, Auer, Lehmann: Sparqlify – Very Large Scale Linked Data Publication from Relational Databases. Submitted to VLDB-Journal. SPARQL Construct SQL View Bridge
    12. 12. Creating Knowledge out of Interlinked Data Storage and Querying Inter- linking Enrichm ent Quality Analysis Evolution Repair Explora- tion Extrac- tion Store Query Author ing
    13. 13. Authoring Inter- linking Enrichm ent Quality Analysis Evolution Repair Explora- tion Extrac- tion tore uery Author ing
    14. 14. Creating Knowledge out of Interlinked Data 1. Semantic (Text) Wikis • Authoring of semantically annotated texts 2. Semantic Data Wikis • Direct authoring of structured information (i.e. RDF, RDF-Schema, OWL) Two Kinds of Semantic Wikis
    15. 15. Creating Knowledge out of Interlinked Data The situation at Daimler (€97.76 billion revenue, 250.000 employees): • 3.000 heterogeneous IT systems • Different units (car, bus, truck etc.) with very different views • No common language • Inability to identify crucial entities (parts, locations etc.) enterprise wide There is no (can not be a) single Enterprise Information Model A distributed, iterative, bottom-up integration approach such as Linked Data might be able to help (pay-as-you-go). Can Linked Data help to solve the EII problem in a fortune-500 company?
    16. 16. Creating Knowledge out of Interlinked Data 16 Search before
    17. 17. Creating Knowledge out of Interlinked Data
    18. 18. Creating Knowledge out of Interlinked Data OntoWiki with loaded car model data
    19. 19. Creating Knowledge out of Interlinked Data Management of Enterprise Taxonomies with OntoWiki Based on the W3C SKOS standard Corporate Language Management at Daimler: 500k concepts in 20 languages
    20. 20. Creating Knowledge out of Interlinked Data Search after Showing recommondations from the knowledge base integrating car model data and enterprise taxonomy
    21. 21. Creating Knowledge out of Interlinked Data You can search for „Kombi“ (station wagon) and find T- Models (Daimler term for station waggon)
    22. 22. FromIntranettoEnterpriseDataWebaroundaknowledgehub Auer, Frischmuth, Klímek, Unbehauen, Holzweißig, Marquardt: Linked Data in Enterprise Information Integration Submitted to Semantic Web Journal 2012.
    23. 23. Creating Knowledge out of Interlinked Data © CC-BY-NC-ND by ~Dezz~ (residae on flickr) Linking Inter- linking Enrichm ent Quality Analysis Evolution Repair Explora- tion Extrac- tion Store Query Author ing
    24. 24. Creating Knowledge out of Interlinked Data In an uncontrolled environment as the Data Web, there will be a proliferation of equivalent or similar entity identifiers Manual Link discovery: • Sindice integration into UIs • Semantic Pingback Semi-automatic: • SILK • LIMES Automatic/ Supervised: • Raven [1] Linking Entities on the Data Web [1] Ngonga, Lehmann, Auer, Höffner: RAVEN -- Active Learning of Link Specifications, OM@ISWC, 2011.
    25. 25. Creating Knowledge out of Interlinked Data Enrichment Inter- linking Enrichm ent Quality Analysis Evolution Repair Explora- tion Extrac- tion Store Query Author ing
    26. 26. Creating Knowledge out of Interlinked Data Linked Data is mainly instance data!!! ORE (Ontology Repair and Enrichment) tool allows to improve an OWL ontology by fixing inconsistencies & making suggestions for adding further axioms. • Ontology Debugging: OWL reasoning to detect inconsistencies and satisfiable classes + detect the most likely sources for the problems. user can create a repair plan, while maintaining full control. • Ontology Enrichment: uses the DL-Learner framework to suggest definitions & super classes for existing classes in the KB. works if instance data is available for harmonising schema and data. http://aksw.org/Projects/ORE Enrichment & Repair Lehmann, Auer, Tramp: Class Expression Learning for Ontology Engineering. Journal of Web Semantics (JWS), 2011.
    27. 27. Creating Knowledge out of Interlinked Data Analysis Quality Inter- linking Enrichm ent Quality Analysis Evolution Repair Explora- tion Extrac- tion Store Query Author ing CC BY SA Wikipedia
    28. 28. Creating Knowledge out of Interlinked Data Quality on the Data Web is varying a lot • Hand crafted or expensively curated knowledge base (e.g. DBLP, UMLS) vs. extracted from text or Web 2.0 sources (DBpedia) Research Challenge • Establish measures for assessing the authority, provenance, reliability of Data Web resources Opportunity for EII: Employ crowd-sourced knowledge from the Data Web in the Enterprise Linked Data Quality Analysis FP7-IP DIACHRON Managing the Evolution and Preservation of the Data Web Started April 2013
    29. 29. Creating Knowledge out of Interlinked Data Evolution © CC-BY-SA by alasis on flickr) Inter- linking Enrichm ent Quality Analysis Evolution Repair Explora- tion Extrac- tion Store Query Author ing
    30. 30. Creating Knowledge out of Interlinked Data Exploration Inter- linking Enrichm ent Quality Analysis Evolution Repair Explora- tion Extrac- tion Store Query Author ing
    31. 31. Creating Knowledge out of Interlinked Data An ecosystem of LOD visualizations LODExploration Widgets Spatial faceted- browsing Faceted- browsing Statistical visualization Entity-/faceted- Based browsing Domain specific visualizations … … LODDatasetsChoreography layer • Dataset analysis (size, vocabularies, property histograms etc.) • Selection of suitable visualization widgets Brunetti, Auer, García: The Linked Data Visualization Model. To appear in IJSWIS, 2012.
    32. 32. Creating Knowledge out of Interlinked Data LOD Life-(Washing-)cycle supported by Debian based LOD2 Stack http://stack.lod2.eu
    33. 33. Creating Knowledge out of Interlinked Data Linked Enterprise Intra Data Webs fill the gap between Intra-/Extranets and EIS/ERP Unstructured Information Management Structured Information Management Support the long tail of enterprise information domains • Human-resources • Requirements engineering • Supply-chains
    34. 34. Creating Knowledge out of Interlinked Data • Linked Data is a promising technology for closing the gap between SOA and unstructured information management • wealth of knowledge available as LOD can be leveraged as background knowledge for Enterprise applications • The application of Linked Data in the enterprise is still largely unexplored (opportunity) • Linked Data will make Enterprise Information Integration more flexible, iterative, cost effective Take home messages Auer, Frischmuth, Klímek, Tramp, Unbehauen, Holzweißig, Marquardt: Linked Data in Enterprise Information Integration Submitted to Semantic Web Journal.
    35. 35. Creating Knowledge out of Interlinked Data Thanks for your attention! Sören Auer http://www.informatik.uni-leipzig.de/~auer | http://aksw.org | http://lod2.org auer@cs.uni-bonn.de

    ×