Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

DITA, Semantics, Content Management, Dynamic Documents, and Linked Data – A Marriage Made in Heaven?


Published on

DITA was conceived as a model for improving reuse through topic-oriented modularization of content. Instead of creating new content or copying and pasting information which may or may not be current and authoritative, organizations manage a repository of content assets – or DITA topics – that can be centrally managed, maintained and reused across the enterprise. This helps to accelerate the creation and maintenance of documents and other deliverables and to ensure the quality and consistency of the content organizations publish. But the next frontier of DITA adoption is leveraging semantic technologies—taxonomies, ontologies and text analytics—to automate the delivery of targeted content. For example, a service incident from a customer is automatically matched with the appropriate response, which is authored and managed as a DITA topic. Learn how organizations can leverage DITA, semantics, content management, dynamic documents, and linked data to fully utilize the value of their information.

Published in: Technology, News & Politics

DITA, Semantics, Content Management, Dynamic Documents, and Linked Data – A Marriage Made in Heaven?

  1. 1. DITA, Semantics, Content Management, Dynamic Documents, and Linked Data – A Marriage Made in Heaven? Linked Data Planet June 18, 2008 Amber Swope, Senior Solutions Consultant, JustSystems Paul Wlodarczyk, VP Solutions Consulting, JustSystems
  2. 2. Introductions <ul><li>Amber Swope </li></ul><ul><ul><li>Senior Solutions Consultant, JustSystems </li></ul></ul><ul><ul><li>Almost 20 years of industry experience </li></ul></ul><ul><ul><li>Author of papers/presentations on information development and information architecture, including the DITA Maturity Model </li></ul></ul><ul><ul><li>Supported information developers producing DITA content in Rational group at IBM </li></ul></ul><ul><ul><li>Implements DITA solutions for JustSystems clients </li></ul></ul><ul><li>Paul Wlodarczyk </li></ul><ul><ul><li>Vice President, Solutions Consulting, JustSystems </li></ul></ul><ul><ul><li>25 years experience in content lifecycle technologies </li></ul></ul><ul><ul><li>Author and speaker on various aspects of the content lifecycle </li></ul></ul><ul><ul><li>Background in software development, professional services, content management, document XML </li></ul></ul><ul><ul><li>Business experience in manufacturing, high tech, retail, aerospace, pharma, insurance </li></ul></ul>
  3. 3. Agenda <ul><li>Historical perspective: Document XML </li></ul><ul><li>Use cases for semantics in documents </li></ul><ul><li>DITA Maturity Model </li></ul><ul><li>DITA and Semantics </li></ul><ul><li>Dynamic Documents </li></ul><ul><li>Examples / Case Studies </li></ul>
  4. 4. Content Management System Technical Authors, Subject Matter Experts, and Reviewers XML Assembly and Publishing Why XML Documents? <ul><li>XML separates content, structure, and format </li></ul><ul><ul><li>XML documents are tagged for structure (e.g. Paragraph, Heading, List Item, etc.) </li></ul></ul><ul><ul><li>Structural tagging enables style sheet formatting </li></ul></ul>Because it separates content, structure, and format, XML is perfectly suited for structured authoring and publishing – enabling single-source authoring / multi-channel publishing Print/PDF Wireless Handheld Web/HTML CD-ROM Multiple Outputs Multiple Document Types Manuals Training Help Web Content Promotional Materials Knowledge Bases
  5. 5. Content Management System Technical Authors, Subject Matter Experts, and Reviewers XML Assembly and Publishing The rise of semantics in XML XML adopters soon discovered that true single sourcing required shared or “reused” content Shared content requires metadata for the files that are shared. Semantic metadata facilitates XML content reuse by helping authors share and find shared content at the file level. Print/PDF Wireless Handheld Web/HTML CD-ROM Multiple Outputs Multiple Document Types Manuals Training Help Web Content Promotional Materials Knowledge Bases
  6. 6. Rising use of semantics in XML <ul><li>Some XML adopters implemented sophisticated electronic documents </li></ul><ul><li>Semantic markup within the text (e.g. tagging procedures, part references, other named entities) enables dynamic document behaviors </li></ul><ul><ul><li>Conditional text </li></ul></ul><ul><ul><li>Configurable manuals </li></ul></ul><ul><ul><li>Integrated search </li></ul></ul><ul><ul><li>Data driven fields </li></ul></ul><ul><ul><li>Context menu behaviors </li></ul></ul><ul><ul><li>Applications integration </li></ul></ul><ul><ul><li>Auto-linking </li></ul></ul><ul><ul><li>Embeddable into devices </li></ul></ul><ul><ul><li>Etc. </li></ul></ul>
  7. 7. Use cases for Semantics in XML Documents <ul><li>Authors: Discover existing documents </li></ul><ul><li>Authors: Classify new documents </li></ul><ul><li>Authors: Tag items inline </li></ul><ul><li>Consumers: Navigate, search, retrieve </li></ul>Author Consumer Documents Discover Classify Tag Navigate Search Retrieve
  8. 8. DITA Standard <ul><li>Darwin Information Typing Architecture </li></ul><ul><li>A standardized framework for management and extensibility of XML document types </li></ul><ul><li>The Next Step in XML Manageability </li></ul><ul><ul><li>Interoperability and tool independence </li></ul></ul><ul><ul><li>Reuse </li></ul></ul><ul><ul><li>Collaborative authoring </li></ul></ul><ul><li>Originally developed by IBM </li></ul><ul><li>Published as an OASIS Specification in May 2005 </li></ul><ul><li>80%+ of all new document XML projects are DITA </li></ul>
  9. 9. DITA Maturity Model
  10. 10. Level 1: Topics <ul><li>Single file (document) that contains multiple topics </li></ul><ul><li>Topics are different types, hence the different shapes and colors </li></ul><ul><li>Achieve simple single sourcing </li></ul>
  11. 11. Level 2: Scalable Reuse <ul><li>Multiple maps referencing topics that are stored in individual files </li></ul><ul><li>Same topics can appear in multiple maps </li></ul><ul><li>Achieve flexible reuse </li></ul>Map 1 Map 2
  12. 12. Level 3: Specialization and Customization <ul><li>All topic types are from Topic </li></ul><ul><li>You can develop specializations from any topic type </li></ul><ul><li>Achieve quality and consistency </li></ul>Create the right topic type for your content Topic Task Concept Reference Event Announcement or… Insurance Claim or… Use Case Specification or… Tutorial Policy Report Proposal Services Proposal Product Proposal
  13. 13. Level 4: Automation and Integration <ul><li>Multiple users can create/share/use content from multiple repositories </li></ul><ul><li>Multiple repositories contain multiple topics </li></ul><ul><li>Achieve speed and efficiency </li></ul>
  14. 14. Level 5: Semantics On-Demand <ul><li>Users can create/share/use information (content and data) stored in multiple content and data repositories </li></ul><ul><li>Combination of content and data allows dynamic publishing and mash-ups </li></ul><ul><li>Achieve dynamic personalization </li></ul>Content Repositories Data Sources Map Hubs Taxonomies Dynamic Publishing Mash-ups
  15. 15. Level 6: Universal Semantic Ecosystem <ul><li>All content become usable by all stakeholders </li></ul><ul><li>Achieve universal knowledge management </li></ul>Publishing Company Product Company Government
  16. 16. DITA Paradigm Shifts <ul><li>People </li></ul><ul><ul><li>Writer  Knowledge Worker </li></ul></ul><ul><ul><li>Format  Structure </li></ul></ul><ul><ul><li>File  Tag </li></ul></ul><ul><ul><li>Create  Reuse </li></ul></ul><ul><li>Technology </li></ul><ul><ul><li>DTP  XML/DITA </li></ul></ul><ul><ul><li>Desktop  Enterprise </li></ul></ul><ul><ul><li>File system  Component CMS </li></ul></ul><ul><li>Process </li></ul><ul><ul><li>Publish  Render </li></ul></ul><ul><ul><li>Document-centric  Content-centric </li></ul></ul><ul><ul><li>Manual  Automated </li></ul></ul><ul><li>Content </li></ul><ul><ul><li>Application-specific  Standards-based </li></ul></ul><ul><ul><li>Monoliths  Compound documents </li></ul></ul><ul><ul><li>Structural mark-up  Semantic mark-up </li></ul></ul><ul><li>Topics become the unit of content lifecycle management (semantic base) </li></ul><ul><li>Topics can become the publication in a dynamic publishing scenario </li></ul>
  17. 17. Use cases for Semantics in DITA Documents <ul><li>Authors: Discover topics </li></ul><ul><li>Authors: Classify topics </li></ul><ul><li>Authors: Tag elements </li></ul><ul><li>Consumers: Inquire (answered with topics) </li></ul><ul><li>Consumers: Navigate, search, retrieve topics </li></ul>Author Consumer Topics Discover Classify Tag Inquire Navigate Search Retrieve
  18. 18. Classes of content <ul><li>DITA changes the way all content is created, managed, and consumed </li></ul><ul><ul><li>Component content management necessitates content classification </li></ul></ul><ul><ul><li>Think about how these would change in a component-oriented worlds, where everything is tagged at the topic level (and perhaps lower)? </li></ul></ul><ul><li>Product content </li></ul><ul><ul><li>Documentation </li></ul></ul><ul><ul><li>Technical Support </li></ul></ul><ul><ul><li>Help systems </li></ul></ul><ul><ul><li>Product Training </li></ul></ul><ul><li>Process content </li></ul><ul><ul><li>Policies & Procedures </li></ul></ul><ul><ul><li>SOPs </li></ul></ul><ul><ul><li>Training </li></ul></ul><ul><ul><li>Regulatory filings </li></ul></ul><ul><li>Branded content </li></ul><ul><ul><li>Web content </li></ul></ul><ul><ul><li>Customer correspondence </li></ul></ul><ul><ul><li>Data sheets, glossies </li></ul></ul><ul><ul><li>Advertising copy / media </li></ul></ul>
  19. 19. DMM revisited: Organizations mature to leverage semantics <ul><li>While many organizations adopt DITA initially for the benefits of single-source publishing, then in short order for the reuse (content management) benefits, they will ultimately mature to rely on DITA for its benefits as knowledge management technology </li></ul>
  20. 20. Dynamic Documents and Dynamic Publishing <ul><li>Dynamic Publishing: Using automation, content (structured and unstructured) is “pushed” to the point of consumption. </li></ul><ul><ul><li>Examples: RSS feeds, portals, active / dynamic documents </li></ul></ul><ul><li>Dynamic Documents: Documents that are connected via database queries and web services to authoritative sources that can self-update. Data and documents are combined in a “document application.” </li></ul><ul><ul><li>Content always up-to-date, authoritative, eliminates the need to publish </li></ul></ul><ul><li>Dynamic publishing puts semantic DITA content to use </li></ul>
  21. 21. Process Example: Healthcare payer policies and procedures <ul><li>Step 1: Convert to DITA for the benefit of faster search and retrieval of relevant topics in a call center / claims center environment (Level 1) </li></ul><ul><li>Step 2: Embed the DITA topics in a dynamic document interface that combines member, claim, provider data to create a context for recalling the appropriate policy/procedure topics (Level 5) </li></ul><ul><li>Step 3: Extend the dynamic document interface to capture information about the customer encounter (Level 6) </li></ul>
  22. 22. Product Example: Technical Service Manuals <ul><li>Step 1: Convert to DITA for the benefit of content reuse across related products and multichannel publishing (Levels 1 and 2) </li></ul><ul><li>Step 2: Create a dynamic document viewer that renders the document based upon fault conditions and other data (e.g. diagnostics), and integrates enterprise data such as job ticketing, parts logistics, part catalogs, etc. (Level 5) </li></ul><ul><li>Step 3: Extend the semantic use of DITA to support multi-faceted taxonomies (classify training, documentation, and knowledge base by user/task, product, technology, and faults) to improve information reuse across publications and formats (Level 6) </li></ul><ul><li>Step 4: Provide improved discovery tools to improve relevancy of search to use the new taxonomy (Level 6) </li></ul>
  23. 23. Brand Example: Customer Response Management <ul><li>Step 1: Convert customer responses to DITA for the benefit of content reuse across customer responses (e.g. across multiple proposals, inquiries, or correspondence) (Level 2) </li></ul><ul><li>Step 2: Create a dynamic document application for automatically generating consistent responses that combines DITA topics with enterprise data (e.g. a proposal generator) (Level 3-4) </li></ul><ul><li>Step 3: Extend the semantic use of DITA to support auto-responding (e.g. automated proposal generation, finding “questions” in RFPs and matching “answers” from the DITA topics in the CMS) (Level 5) </li></ul><ul><li>Step 4: Expose the auto-response system directly to customers through the web site (an instance of dynamic publishing) (Level 6) </li></ul>
  24. 24. Summary <ul><li>DITA is fundamentally semantic technology </li></ul><ul><li>Migrating to DITA provides a foundation for a Universal Semantic Ecosystem </li></ul><ul><li>DITA has applicability to Product, Process, and Brand content </li></ul><ul><li>DITA sets the stage for a migration away from static to dynamic publishing </li></ul><ul><li>Dynamic documents can combine semantically tagged DITA content with enterprise data to create context-rich, content-rich applications </li></ul>
  25. 25. <ul><li>Global Presence </li></ul><ul><ul><li>1,000 employees, ‘07 revenues of $110M </li></ul></ul><ul><ul><li>HQ in Japan; Corporate Offices in NJ, Vancouver and London; Sales Offices Worldwide </li></ul></ul><ul><li>Our Experience </li></ul><ul><ul><li>Established in 1979 </li></ul></ul><ul><ul><li>Market leader with over 2,500 customers </li></ul></ul><ul><li>Our Expertise </li></ul><ul><ul><li>Global provider of office productivity, information management, consumer & enterprise software </li></ul></ul><ul><ul><li>Framework for XML-based content creation, integration, visualization and delivery </li></ul></ul><ul><li>Our Credibility </li></ul><ul><ul><li>Gartner “Cool Vendor” in CM 2008 </li></ul></ul><ul><ul><li>KM World “100 Companies that Matter” 2007 </li></ul></ul><ul><ul><li>IBM CTO Innovation Award 2006 </li></ul></ul>About JustSystems 2,500 Customers, Marquee Brands
  26. 26. For more information <ul><li>Visit our website: </li></ul><ul><ul><li>http:// </li></ul></ul><ul><li>DITA Maturity Model landing page: </li></ul><ul><ul><li> </li></ul></ul><ul><li>Email us: </li></ul><ul><ul><li>[email_address] </li></ul></ul><ul><li>Call us: </li></ul><ul><ul><li>(866) 793-1542 </li></ul></ul>