ISWC 2015 – Linked Data at Wolters Kluwer
Adopting Linked Data principles for
accelerating business
transformation processes in Wolters Kluwer
Quentin Reul, PhD
Christian Dirschl
Nuria Casellas, PhD
William Flannery, PhD10/13/2015
ISWC 2015 – Linked Data at Wolters Kluwer
ABOUT WOLTERS KLUWER
ISWC 2015 – Linked Data at Wolters Kluwer
Wolters Kluwer
3
Legal & Regulatory Health
Tax & Accounting Governance, Risk & Compliance
ISWC 2015 – Linked Data at Wolters Kluwer
Wolters Kluwer
4
Legal & Regulatory Health
Tax & Accounting Governance, Risk & Compliance
Wolters Kluwer is an information service and
publishing company providing information and tools to
empower legal, tax, finance, and healthcare
professionals in making the most informed decisions.
ISWC 2015 – Linked Data at Wolters Kluwer
Wolters Kluwer
5
ISWC 2015 – Linked Data at Wolters Kluwer
Wolters Kluwer
Wolters Kluwer provides solutions to
customers in over 170 countries and provides
content in at least a dozen languages.
6
ISWC 2015 – Linked Data at Wolters Kluwer
Wolters Kluwer Transformation
35%
43% 49% 54% 58%
68%
13%
13%
15%
15%
16%
12%52%
44%
36% 31% 26% 20%
2004 2006 2008 2010 2012 2014
Print
Services
Digital
7
ISWC 2015 – Linked Data at Wolters Kluwer
Wolters Kluwer Transformation
35%
43% 49% 54% 58%
68%
13%
13%
15%
15%
16%
12%52%
44%
36% 31% 26% 20%
2004 2006 2008 2010 2012 2014
Print
Services
Digital
8
In 2014, 80% of Wolters Kluwer revenues were
coming from online, software and services.
ISWC 2015 – Linked Data at Wolters Kluwer
Wolters Kluwer Transformation
Static
content
 Books
 CD-Rom
Online
offerings
 One size fits all
 Research driven
Workflow
 Integrate software
 End-to-end
workflow system
Big Data
 Leveraging data
exhaust for
valuable insights
Traditional
information
Segmented
solutions
Targeted
offerings
LOW OPPORTUNITY; PENETRATED HIGH OPPORTUNITY; LOWER PENETRATION
Deliveredvalue
Incremental offerings (additive, not mutually exclusive)
9
ISWC 2015 – Linked Data at Wolters Kluwer
WOLTERS KLUWER CONTENT
STANDARD
ISWC 2015 – Linked Data at Wolters Kluwer
What is content?
Content is the [timely] presentation of
information for a purpose to an
audience through a channel in a form.
11
ISWC 2015 – Linked Data at Wolters Kluwer
Content Standard Evolution
DTD’s (1986) Schema’s (2001) Ontologies (2003)
Scoped to documents
Scoped to documents /
content objects
Scoped to inter-related
content objects
SGML / XML XML XHTML / RDF / RDFa / OWL
Weak/No data-typing Strong data-typing
Reuse of Strong data-typing
from other schema’s
Tight binding between model
and instance
Tight binding between model
and instance
Loose binding between model
and a collection of instances
Non-Formal descriptions of
inter-document dependencies
Non-Formal descriptions of
inter-object dependencies
Formal descriptions of inter-
object dependencies
Fragile extensibility at the
cost of precision
Limited extensibility at the
cost of complexity
Native Extensibility
12
ISWC 2015 – Linked Data at Wolters Kluwer
Content Standard
• Content standard is based on industry Web standards;
• It aims to capture, validate and manage single source content
through separation concerns;
• It is not only intended for human consumption, but is also intended
for enabling task-focused software applications (e.g. workflows,
etc.);
• It aims to improve time to market and reduce costs through a
shared and extendable content model.
13
ISWC 2015 – Linked Data at Wolters Kluwer
Semantic Model
Document Structure
hasDocumentInstance
hasDocumentParthasDocumentInstance
FileResource
FileResource
Fragment
Document
14
ISWC 2015 – Linked Data at Wolters Kluwer
Semantic Model
Document Structure Document Metadata
hasDocumentInstance
hasDocumentParthasDocumentInstance
FileResource
FileResource
Fragment
Document
PID
2014-05-01
Title
displayTitle
localId
sortOn
publisher region
court judge
15
ISWC 2015 – Linked Data at Wolters Kluwer
User Feedback
16
• Words != meaning
• SMEs can’t always convey semantics
• Lack of human-readable documentation
ISWC 2015 – Linked Data at Wolters Kluwer
Lesson Learned
17
• Centralized management
• Rigid logical constraints hinder re-use
• Difficult to define mappings between
content formats
• RDF/XML != XML
• Words != meaning
• SMEs can’t always convey semantics
• Lack of human-readable documentation
ISWC 2015 – Linked Data at Wolters Kluwer
Refactored Semantic Model
18
ISWC 2015 – Linked Data at Wolters Kluwer
USE CASES
ISWC 2015 – Linked Data at Wolters Kluwer
Linked Data Traffic Pattern
20
ISWC 2015 – Linked Data at Wolters Kluwer
Content Delivery Channels
21
Semantic
Integrator
ISWC 2015 – Linked Data at Wolters Kluwer
Content Delivery Channels
Semantic
Integrator
A Semantic Integrator allows content from heterogeneous sources
to be shared across platform based on meaning (and not syntax).
22
ISWC 2015 – Linked Data at Wolters Kluwer
Cross-Source Queries
23
ISWC 2015 – Linked Data at Wolters Kluwer
Cross-Source Queries
24
ISWC 2015 – Linked Data at Wolters Kluwer
Cross-Source Queries
25
Documents indexed against WK topics can be
retrieved using topics defined in external
knowledge organization systems.
ISWC 2015 – Linked Data at Wolters Kluwer
CONCLUSION
26
ISWC 2015 – Linked Data at Wolters Kluwer 27
- Enabling agile integration of distributed content
- Enabling agile integration of legacy content
- Enabling task-oriented workflows
ISWC 2015 – Linked Data at Wolters Kluwer 28
- Enabling agile integration of distributed content
- Enabling agile integration of legacy content
- Enabling task-oriented workflows
- Leveraging the curation of open data
- Leveraging automated content enrichment
- Identifying content impacted new published item
ISWC 2015 – Linked Data at Wolters Kluwer 29
- Enabling agile integration of distributed content
- Enabling agile integration of legacy content
- Enabling task-oriented workflows
- Leveraging the curation of open data
- Leveraging automated content enrichment
- Identifying content impacted new published item
- Supporting the management of chunked content
- Enabling responsive content based on users’ context
- Reporting on customers’ behaviours / needs
- Enabling iterative content enrichment
ISWC 2015 – Linked Data at Wolters Kluwer 30

Adopting linked data principles for accelerating business transformation processes in Wolters Kluwer

  • 1.
    ISWC 2015 –Linked Data at Wolters Kluwer Adopting Linked Data principles for accelerating business transformation processes in Wolters Kluwer Quentin Reul, PhD Christian Dirschl Nuria Casellas, PhD William Flannery, PhD10/13/2015
  • 2.
    ISWC 2015 –Linked Data at Wolters Kluwer ABOUT WOLTERS KLUWER
  • 3.
    ISWC 2015 –Linked Data at Wolters Kluwer Wolters Kluwer 3 Legal & Regulatory Health Tax & Accounting Governance, Risk & Compliance
  • 4.
    ISWC 2015 –Linked Data at Wolters Kluwer Wolters Kluwer 4 Legal & Regulatory Health Tax & Accounting Governance, Risk & Compliance Wolters Kluwer is an information service and publishing company providing information and tools to empower legal, tax, finance, and healthcare professionals in making the most informed decisions.
  • 5.
    ISWC 2015 –Linked Data at Wolters Kluwer Wolters Kluwer 5
  • 6.
    ISWC 2015 –Linked Data at Wolters Kluwer Wolters Kluwer Wolters Kluwer provides solutions to customers in over 170 countries and provides content in at least a dozen languages. 6
  • 7.
    ISWC 2015 –Linked Data at Wolters Kluwer Wolters Kluwer Transformation 35% 43% 49% 54% 58% 68% 13% 13% 15% 15% 16% 12%52% 44% 36% 31% 26% 20% 2004 2006 2008 2010 2012 2014 Print Services Digital 7
  • 8.
    ISWC 2015 –Linked Data at Wolters Kluwer Wolters Kluwer Transformation 35% 43% 49% 54% 58% 68% 13% 13% 15% 15% 16% 12%52% 44% 36% 31% 26% 20% 2004 2006 2008 2010 2012 2014 Print Services Digital 8 In 2014, 80% of Wolters Kluwer revenues were coming from online, software and services.
  • 9.
    ISWC 2015 –Linked Data at Wolters Kluwer Wolters Kluwer Transformation Static content  Books  CD-Rom Online offerings  One size fits all  Research driven Workflow  Integrate software  End-to-end workflow system Big Data  Leveraging data exhaust for valuable insights Traditional information Segmented solutions Targeted offerings LOW OPPORTUNITY; PENETRATED HIGH OPPORTUNITY; LOWER PENETRATION Deliveredvalue Incremental offerings (additive, not mutually exclusive) 9
  • 10.
    ISWC 2015 –Linked Data at Wolters Kluwer WOLTERS KLUWER CONTENT STANDARD
  • 11.
    ISWC 2015 –Linked Data at Wolters Kluwer What is content? Content is the [timely] presentation of information for a purpose to an audience through a channel in a form. 11
  • 12.
    ISWC 2015 –Linked Data at Wolters Kluwer Content Standard Evolution DTD’s (1986) Schema’s (2001) Ontologies (2003) Scoped to documents Scoped to documents / content objects Scoped to inter-related content objects SGML / XML XML XHTML / RDF / RDFa / OWL Weak/No data-typing Strong data-typing Reuse of Strong data-typing from other schema’s Tight binding between model and instance Tight binding between model and instance Loose binding between model and a collection of instances Non-Formal descriptions of inter-document dependencies Non-Formal descriptions of inter-object dependencies Formal descriptions of inter- object dependencies Fragile extensibility at the cost of precision Limited extensibility at the cost of complexity Native Extensibility 12
  • 13.
    ISWC 2015 –Linked Data at Wolters Kluwer Content Standard • Content standard is based on industry Web standards; • It aims to capture, validate and manage single source content through separation concerns; • It is not only intended for human consumption, but is also intended for enabling task-focused software applications (e.g. workflows, etc.); • It aims to improve time to market and reduce costs through a shared and extendable content model. 13
  • 14.
    ISWC 2015 –Linked Data at Wolters Kluwer Semantic Model Document Structure hasDocumentInstance hasDocumentParthasDocumentInstance FileResource FileResource Fragment Document 14
  • 15.
    ISWC 2015 –Linked Data at Wolters Kluwer Semantic Model Document Structure Document Metadata hasDocumentInstance hasDocumentParthasDocumentInstance FileResource FileResource Fragment Document PID 2014-05-01 Title displayTitle localId sortOn publisher region court judge 15
  • 16.
    ISWC 2015 –Linked Data at Wolters Kluwer User Feedback 16 • Words != meaning • SMEs can’t always convey semantics • Lack of human-readable documentation
  • 17.
    ISWC 2015 –Linked Data at Wolters Kluwer Lesson Learned 17 • Centralized management • Rigid logical constraints hinder re-use • Difficult to define mappings between content formats • RDF/XML != XML • Words != meaning • SMEs can’t always convey semantics • Lack of human-readable documentation
  • 18.
    ISWC 2015 –Linked Data at Wolters Kluwer Refactored Semantic Model 18
  • 19.
    ISWC 2015 –Linked Data at Wolters Kluwer USE CASES
  • 20.
    ISWC 2015 –Linked Data at Wolters Kluwer Linked Data Traffic Pattern 20
  • 21.
    ISWC 2015 –Linked Data at Wolters Kluwer Content Delivery Channels 21 Semantic Integrator
  • 22.
    ISWC 2015 –Linked Data at Wolters Kluwer Content Delivery Channels Semantic Integrator A Semantic Integrator allows content from heterogeneous sources to be shared across platform based on meaning (and not syntax). 22
  • 23.
    ISWC 2015 –Linked Data at Wolters Kluwer Cross-Source Queries 23
  • 24.
    ISWC 2015 –Linked Data at Wolters Kluwer Cross-Source Queries 24
  • 25.
    ISWC 2015 –Linked Data at Wolters Kluwer Cross-Source Queries 25 Documents indexed against WK topics can be retrieved using topics defined in external knowledge organization systems.
  • 26.
    ISWC 2015 –Linked Data at Wolters Kluwer CONCLUSION 26
  • 27.
    ISWC 2015 –Linked Data at Wolters Kluwer 27 - Enabling agile integration of distributed content - Enabling agile integration of legacy content - Enabling task-oriented workflows
  • 28.
    ISWC 2015 –Linked Data at Wolters Kluwer 28 - Enabling agile integration of distributed content - Enabling agile integration of legacy content - Enabling task-oriented workflows - Leveraging the curation of open data - Leveraging automated content enrichment - Identifying content impacted new published item
  • 29.
    ISWC 2015 –Linked Data at Wolters Kluwer 29 - Enabling agile integration of distributed content - Enabling agile integration of legacy content - Enabling task-oriented workflows - Leveraging the curation of open data - Leveraging automated content enrichment - Identifying content impacted new published item - Supporting the management of chunked content - Enabling responsive content based on users’ context - Reporting on customers’ behaviours / needs - Enabling iterative content enrichment
  • 30.
    ISWC 2015 –Linked Data at Wolters Kluwer 30