SlideShare a Scribd company logo
1 of 32
Confidential & Proprietarywww.dclab.comwww.dclab.com
Converting and Integrating Content When
Moving to a New CMS
Greg Fagan,
Sales Director,
Data Conversion Laboratory (DCL)
Confidential & Proprietarywww.dclab.com 2
Valuable Content Transformed
• Document Digitization
• XML and HTML Conversion
• eBook Production
• Hosted Solutions
• Big Data Automation
• Conversion Management
• Editorial Services
• Harmonizer
Confidential & Proprietarywww.dclab.com 3
Experience the DCL Difference
DCL blends years of conversion experience with cutting-edge technology and the
infrastructure to make the process easy and efficient.
• World-Class Services
• Leading-Edge Technology
• Unparalleled Infrastructure
• US-Based Management
• Complex-Content Expertise
• 24/7 Online Project Tracking
• Automated Quality Control
• Global Capabilities
Confidential & Proprietarywww.dclab.com
We Serve a Very Broad Client Base . . .
4
Confidential & Proprietarywww.dclab.com 5
. . . Spanning All Industries
• Aerospace
• Associations
• Defense
• Distribution
• Education
• Financial
• Government
• Libraries
• Life Sciences
• Manufacturing
• Medical
• Museums
• Periodicals
• Professional
• Publishing
• Reference
• Research
• Societies
• Software
• STM
• Technology
• Telecommunications
• Universities
• Utilities
Confidential & Proprietarywww.dclab.com 6
So You’re Implementing a New CMS – Great! Now
What?
Confidential & Proprietarywww.dclab.com
Key Considerations
• Content structure
• More than likely, that structure will be XML
• Which XML schema is appropriate – DITA, DocBook, XHTML?
• Is one schema better than the others?
• What’s the plan for legacy content?
7
Confidential & Proprietarywww.dclab.com
• Multichannel publishing
• Content repurposing
• Content reuse
• Easier updating
• Avoiding multiple conversions
• Some/all of the above
8
What Are Your Business Drivers?
Confidential & Proprietarywww.dclab.com
• Can’t just be moved from one format to another
• Non-XML sources embed formatting – not applicable to other
outputs
• Tool-specific formats make your content dependent on
functionality of that tool
9
Content Reuse is Hard!
Confidential & Proprietarywww.dclab.com
So You Need XML, but Which Schema?
10
Confidential & Proprietarywww.dclab.com
So Why DITA?
• Works across all outputs
• Can be customized to different content types (educational,
financial, legal, etc.)
• Can produce both HTML5 and EPUB from DITA with open-
source tools
• Can do everything DocBook can, but reverse not true
• XHTML not a true schema
11
Confidential & Proprietarywww.dclab.com
DITA with Your CMS
• Your CMS should support several different output targets
• DITA provides the consistent structure and flexibility to do
that
• New content will be authored in DITA
• But what about…
12
Confidential & Proprietarywww.dclab.com
Your Legacy Content?!
13
Confidential & Proprietarywww.dclab.com
• Not as scary as it seems
• Prioritize and convert in stages
• Consider conversion before selecting a CMS
• Consider a pilot program before committing fully
14
Converting and Integrating Your Legacy Content
Confidential & Proprietarywww.dclab.comwww.dclab.com
Lessons from 12 DITA
Implementations
DCL Survey of 12 Companies
Confidential & Proprietarywww.dclab.com
Why DITA? Top 3 Answers
• Reduce need for composition
• Content reuse
• Reduce translation costs
16
Confidential & Proprietarywww.dclab.com
What Were the Business Drivers?
• Top answer was multi-purposing
– Ability for various teams to use content to suit their
particular needs
– Deploying chunks of content for multiple purposes
dramatically reduced costs, and improved overall reliability
17
Confidential & Proprietarywww.dclab.com
How Long Did It Take?
• Average implementation took three years
• Some took only two years; others five
• Some respondents believed implementation to be an ongoing
process (never completed)
• Across the board, however, it took far longer than planned
18
Confidential & Proprietarywww.dclab.com
When Did They Choose CMS?
• Half of respondents selected CMS at beginning of process
• Other half after running pilot programs
• Companies that implemented CMS later converted content
first then selected CMS based on data requirements
• Two companies switched to different CMS during testing
phase
19
Confidential & Proprietarywww.dclab.com
How Was Success Measured?
• Multi-purposing was top criterion, with these notable
benefits:
– Publishing content in multiple formats such as PDF and print
– Developing training and help systems
– Customizing marketing and sales collateral
– Changing styling, layout, and design while maintaining the copy
– Producing HTML and eBooks since content was standardized
20
Confidential & Proprietarywww.dclab.com
How Are You Maximizing Benefits of Content Reuse?
• Only two of the 12 companies actively reusing content
• Built extensive rewriting phase into plan
• Extended implementation time but critical to overall success
• Other companies cited size of project and drastic change to
content creation process as reasons for not implementing
reuse plan up front
21
Confidential & Proprietarywww.dclab.com
Do You Often Need Translation?
• Four of 12 companies doing heavy translation
• All reported significant saving from data standardization, even
without content reuse
• Rest of companies viewed translation as essential to future
plans
22
Confidential & Proprietarywww.dclab.com
How Did Conversion Go?
• All 12 companies felt it went smoothly
• Many didn’t do conversion at initial stages and opted for
extensive rewriting, which they regretted
• Same companies held off on legacy conversion until after
implementation, which like rewriting, wasn’t efficient
23
Confidential & Proprietarywww.dclab.com
Did DITA Work Out of the Box?
• All 12 companies reported that it did
• Only 3 reported using DITA specialization, mostly for minor
items
• All were working with technical documents, so specialization
wasn’t an issue in these cases
24
Confidential & Proprietarywww.dclab.com
Lessons Learned
• Consensus for more data clean-up before conversion
• Small pilot programs are useful
• Underestimated adapting to DITA authoring and training
needs
• Support from management crucial
25
Confidential & Proprietarywww.dclab.com 26
Confidential & Proprietarywww.dclab.com 27
Benefits and Drivers for
Content Conversion
Confidential & Proprietarywww.dclab.com 28
The Value of Structured Content
Increase Revenues
 Improve customer service
 Decrease time to market
 Expand into new markets
 Create data versatility
 Enhance discoverability
Decrease Expenses
 Increase authoring productivity
 Reduce publishing costs
 Increase information reuse
 Reduce translation costs
 Future-proof data
Successful business strategies are driven by content!
Confidential & Proprietarywww.dclab.com 29
Can your content keep up with changing technology?
 Data drives every aspect of a business from engineering and development
to maintenance, repair and operations, sales, customer service, marketing,
and more
 Documents are often converted in order to comply with law, industry
standards, or to support distribution partners and meet consumers'
expectations
 Data conversion is most desirable for its potential to lower costs by making
data easier to manage, update, reproduce, and syndicate
 Structured formatting enables content to be delivered any where at any
time on any device imaginable
Confidential & Proprietarywww.dclab.com 30
Re-purposing
Searching
Component Reuse
Enforce Data Standards
Interchange with Vendors, Customers, & World
 Creating new versions of data suitable for derivative uses
(e.g. the web, diagnostic equipment, hand-held devices,
voice devices)
 Ability to find information through text searches and
through more advanced searches that depend on context
and “understanding”
 Ability to reuse portions of data for different products and
different documentation sets
 Ability to assure that the information produced is
produced consistently and meets corporate standards
 Ability for others to use your information for
communications with others and to incorporate into
products belonging to other organizations
Various Uses for Structured Content
Confidential & Proprietarywww.dclab.com
• Plan… plan… plan
• Prepare your teams and manage attitudes and expectations
accordingly
• Phase your project for increased manageability
• Establish multiple checkpoints and test often
• DON’T GO IT ALONE!
31
Key Takeaways
Confidential & Proprietarywww.dclab.com 32
Q&A
Greg Fagan
Sales Director,
Data Conversion Laboratory
(908) 723-1884
gfagan@dclab.com
@dclaboratory

More Related Content

What's hot

Wheeles Webinar Slides - 9-16
Wheeles Webinar Slides - 9-16Wheeles Webinar Slides - 9-16
Wheeles Webinar Slides - 9-16
Dana Wheeles
 

What's hot (20)

Managing Deliverable-Specific Link Anchors: New Suggested Best Practice for Keys
Managing Deliverable-Specific Link Anchors: New Suggested Best Practice for KeysManaging Deliverable-Specific Link Anchors: New Suggested Best Practice for Keys
Managing Deliverable-Specific Link Anchors: New Suggested Best Practice for Keys
 
Converting and Integrating Content When Implementing a New CMS
Converting and Integrating Content When Implementing a New CMSConverting and Integrating Content When Implementing a New CMS
Converting and Integrating Content When Implementing a New CMS
 
Introduction to Structured Authoring
Introduction to Structured AuthoringIntroduction to Structured Authoring
Introduction to Structured Authoring
 
Anticipating Lightweight DITA
Anticipating Lightweight DITAAnticipating Lightweight DITA
Anticipating Lightweight DITA
 
Optimizing the DITA Authoring Experience
Optimizing the DITA Authoring ExperienceOptimizing the DITA Authoring Experience
Optimizing the DITA Authoring Experience
 
New Directions 2015 – Changes in Content Best Practices
New Directions 2015 – Changes in Content Best PracticesNew Directions 2015 – Changes in Content Best Practices
New Directions 2015 – Changes in Content Best Practices
 
Sprinting to Success: Why Agile and DITA Work So Well Together
Sprinting to Success: Why Agile and DITA Work So Well TogetherSprinting to Success: Why Agile and DITA Work So Well Together
Sprinting to Success: Why Agile and DITA Work So Well Together
 
Is DITA Right for You? - STC Summit 2017
Is DITA Right for You? - STC Summit 2017Is DITA Right for You? - STC Summit 2017
Is DITA Right for You? - STC Summit 2017
 
Produce Reliable Content with DITA CMS
Produce Reliable Content with DITA CMSProduce Reliable Content with DITA CMS
Produce Reliable Content with DITA CMS
 
Using Markdown and Lightweight DITA in a Collaborative Environment
Using Markdown and Lightweight DITA in a Collaborative EnvironmentUsing Markdown and Lightweight DITA in a Collaborative Environment
Using Markdown and Lightweight DITA in a Collaborative Environment
 
GraphTalks Stuttgart - Einführung in Graphdatenbanken und Neo4j
GraphTalks Stuttgart - Einführung in Graphdatenbanken und Neo4jGraphTalks Stuttgart - Einführung in Graphdatenbanken und Neo4j
GraphTalks Stuttgart - Einführung in Graphdatenbanken und Neo4j
 
Wheeles Webinar Slides - 9-16
Wheeles Webinar Slides - 9-16Wheeles Webinar Slides - 9-16
Wheeles Webinar Slides - 9-16
 
Introduction to structured authoring
Introduction to structured authoringIntroduction to structured authoring
Introduction to structured authoring
 
How to Make your Graph DB Project Successful with Neo4j Services
How to Make your Graph DB Project Successful with Neo4j ServicesHow to Make your Graph DB Project Successful with Neo4j Services
How to Make your Graph DB Project Successful with Neo4j Services
 
Streaming with Oracle Data Integration
Streaming with Oracle Data IntegrationStreaming with Oracle Data Integration
Streaming with Oracle Data Integration
 
Unleashing the value of metadata with Talend
Unleashing the value of metadata with Talend Unleashing the value of metadata with Talend
Unleashing the value of metadata with Talend
 
Slides: NoSQL Data Modeling Using JSON Documents – A Practical Approach
Slides: NoSQL Data Modeling Using JSON Documents – A Practical ApproachSlides: NoSQL Data Modeling Using JSON Documents – A Practical Approach
Slides: NoSQL Data Modeling Using JSON Documents – A Practical Approach
 
Agile BI with Data Virtualization (session 2 from Packed Lunch Webinar Series)
Agile BI with Data Virtualization (session 2 from Packed Lunch Webinar Series)Agile BI with Data Virtualization (session 2 from Packed Lunch Webinar Series)
Agile BI with Data Virtualization (session 2 from Packed Lunch Webinar Series)
 
Denodo DataFest 2017: Outpace Your Competition with Real-Time Responses
Denodo DataFest 2017: Outpace Your Competition with Real-Time ResponsesDenodo DataFest 2017: Outpace Your Competition with Real-Time Responses
Denodo DataFest 2017: Outpace Your Competition with Real-Time Responses
 
Lower Cost and Complexity with Azure and StorSimple Hybrid Cloud Solutions
Lower Cost and Complexity with Azure and StorSimple Hybrid Cloud SolutionsLower Cost and Complexity with Azure and StorSimple Hybrid Cloud Solutions
Lower Cost and Complexity with Azure and StorSimple Hybrid Cloud Solutions
 

Viewers also liked

Viewers also liked (13)

DITA for Small Teams: An Open Source Approach to DITA Content Management
DITA for Small Teams: An Open Source Approach to DITA Content ManagementDITA for Small Teams: An Open Source Approach to DITA Content Management
DITA for Small Teams: An Open Source Approach to DITA Content Management
 
DITA's New Thang: Going Mapless!
DITA's New Thang: Going Mapless!DITA's New Thang: Going Mapless!
DITA's New Thang: Going Mapless!
 
Minimalism Revisited — Let’s Stop Developing Content that No One Wants
Minimalism Revisited — Let’s Stop Developing Content that No One WantsMinimalism Revisited — Let’s Stop Developing Content that No One Wants
Minimalism Revisited — Let’s Stop Developing Content that No One Wants
 
10 Mistakes When Moving to Topic-Based Authoring
10 Mistakes When Moving to Topic-Based Authoring10 Mistakes When Moving to Topic-Based Authoring
10 Mistakes When Moving to Topic-Based Authoring
 
Precision Content™ Tools, Techniques, and Technology
Precision Content™ Tools, Techniques, and TechnologyPrecision Content™ Tools, Techniques, and Technology
Precision Content™ Tools, Techniques, and Technology
 
There's Gold in Them Thar Data
There's Gold in Them Thar DataThere's Gold in Them Thar Data
There's Gold in Them Thar Data
 
Using HTML5 to Deliver and Monetize Your Mobile Content
Using HTML5 to Deliver and Monetize Your Mobile ContentUsing HTML5 to Deliver and Monetize Your Mobile Content
Using HTML5 to Deliver and Monetize Your Mobile Content
 
Content Conversion Done Right Saves More Than Money
Content Conversion Done Right Saves More Than MoneyContent Conversion Done Right Saves More Than Money
Content Conversion Done Right Saves More Than Money
 
When Conversion Makes Sense
When Conversion Makes SenseWhen Conversion Makes Sense
When Conversion Makes Sense
 
Metadata Matters
Metadata MattersMetadata Matters
Metadata Matters
 
Demystifying SPL for Medical Devices
Demystifying SPL for Medical DevicesDemystifying SPL for Medical Devices
Demystifying SPL for Medical Devices
 
DITA, EPUB, and HTML5: An Update for 2015
DITA, EPUB, and HTML5: An Update for 2015DITA, EPUB, and HTML5: An Update for 2015
DITA, EPUB, and HTML5: An Update for 2015
 
Out of the Silos and Into the Farm
Out of the Silos and Into the FarmOut of the Silos and Into the Farm
Out of the Silos and Into the Farm
 

Similar to Converting and Integrating Legacy Data and Documents When Implementing a New CMS

Emea partners recruitment webinar
Emea partners recruitment webinarEmea partners recruitment webinar
Emea partners recruitment webinar
MongoDB
 

Similar to Converting and Integrating Legacy Data and Documents When Implementing a New CMS (20)

451 Research + NuoDB: What It Means to be a Container-Native SQL Database
451 Research + NuoDB: What It Means to be a Container-Native SQL Database451 Research + NuoDB: What It Means to be a Container-Native SQL Database
451 Research + NuoDB: What It Means to be a Container-Native SQL Database
 
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...
 
Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8
 
Why Business is Better in the Cloud
Why Business is Better in the CloudWhy Business is Better in the Cloud
Why Business is Better in the Cloud
 
Neo4j PartnerDay Amsterdam 2017
Neo4j PartnerDay Amsterdam 2017Neo4j PartnerDay Amsterdam 2017
Neo4j PartnerDay Amsterdam 2017
 
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, ClouderaMongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
 
Agile Data Architecture
Agile Data ArchitectureAgile Data Architecture
Agile Data Architecture
 
Webinar: The 5 Most Critical Things to Understand About Modern Data Integration
Webinar: The 5 Most Critical Things to Understand About Modern Data IntegrationWebinar: The 5 Most Critical Things to Understand About Modern Data Integration
Webinar: The 5 Most Critical Things to Understand About Modern Data Integration
 
ADV Slides: What Happened of Note in 1H 2020 in Enterprise Advanced Analytics
ADV Slides: What Happened of Note in 1H 2020 in Enterprise Advanced AnalyticsADV Slides: What Happened of Note in 1H 2020 in Enterprise Advanced Analytics
ADV Slides: What Happened of Note in 1H 2020 in Enterprise Advanced Analytics
 
Partner Recruitment Webinar: "Join the Most Productive Ecosystem in Big Data ...
Partner Recruitment Webinar: "Join the Most Productive Ecosystem in Big Data ...Partner Recruitment Webinar: "Join the Most Productive Ecosystem in Big Data ...
Partner Recruitment Webinar: "Join the Most Productive Ecosystem in Big Data ...
 
Emea partners recruitment webinar
Emea partners recruitment webinarEmea partners recruitment webinar
Emea partners recruitment webinar
 
Big Data Made Easy: A Simple, Scalable Solution for Getting Started with Hadoop
Big Data Made Easy:  A Simple, Scalable Solution for Getting Started with HadoopBig Data Made Easy:  A Simple, Scalable Solution for Getting Started with Hadoop
Big Data Made Easy: A Simple, Scalable Solution for Getting Started with Hadoop
 
The Future of IT Infrastructure is Hybrid and on Demand
The Future of IT Infrastructure is Hybrid and on DemandThe Future of IT Infrastructure is Hybrid and on Demand
The Future of IT Infrastructure is Hybrid and on Demand
 
Webinar: How Partners Can Benefit from our New Program (EMEA)
Webinar: How Partners Can Benefit from our New Program (EMEA)Webinar: How Partners Can Benefit from our New Program (EMEA)
Webinar: How Partners Can Benefit from our New Program (EMEA)
 
Data integration case study: Oil & Gas industry
Data integration case study: Oil & Gas industryData integration case study: Oil & Gas industry
Data integration case study: Oil & Gas industry
 
Choosing Public vs. Private vs. Hybrid Cloud Computing
Choosing Public vs. Private vs. Hybrid Cloud ComputingChoosing Public vs. Private vs. Hybrid Cloud Computing
Choosing Public vs. Private vs. Hybrid Cloud Computing
 
Modernize Your Content Publishing Process with Smart Content
Modernize Your Content Publishing Process with Smart ContentModernize Your Content Publishing Process with Smart Content
Modernize Your Content Publishing Process with Smart Content
 
OPEN'17_4_Postgres: The Centerpiece for Modernising IT Infrastructures
OPEN'17_4_Postgres: The Centerpiece for Modernising IT InfrastructuresOPEN'17_4_Postgres: The Centerpiece for Modernising IT Infrastructures
OPEN'17_4_Postgres: The Centerpiece for Modernising IT Infrastructures
 
Hadoop and Your Enterprise Data Warehouse
Hadoop and Your Enterprise Data WarehouseHadoop and Your Enterprise Data Warehouse
Hadoop and Your Enterprise Data Warehouse
 
10 Million Dita Topics Can't Be Wrong
10 Million Dita Topics Can't Be Wrong10 Million Dita Topics Can't Be Wrong
10 Million Dita Topics Can't Be Wrong
 

More from dclsocialmedia

Finding Role Clarity in UX Chaos
Finding Role Clarity in UX ChaosFinding Role Clarity in UX Chaos
Finding Role Clarity in UX Chaos
dclsocialmedia
 
Coming Up to Speed with XML Authoring in Adobe FrameMaker
Coming Up to Speed with XML Authoring in Adobe FrameMakerComing Up to Speed with XML Authoring in Adobe FrameMaker
Coming Up to Speed with XML Authoring in Adobe FrameMaker
dclsocialmedia
 

More from dclsocialmedia (6)

Automating Complex High-Volume Technical Paper and Journal Article Page Compo...
Automating Complex High-Volume Technical Paper and Journal Article Page Compo...Automating Complex High-Volume Technical Paper and Journal Article Page Compo...
Automating Complex High-Volume Technical Paper and Journal Article Page Compo...
 
Converting Your Legacy Data to S1000D
Converting Your Legacy Data to S1000DConverting Your Legacy Data to S1000D
Converting Your Legacy Data to S1000D
 
Marketing and Strategy and Bears... oh my!
Marketing and Strategy and Bears... oh my!Marketing and Strategy and Bears... oh my!
Marketing and Strategy and Bears... oh my!
 
Finding Role Clarity in UX Chaos
Finding Role Clarity in UX ChaosFinding Role Clarity in UX Chaos
Finding Role Clarity in UX Chaos
 
Managing Documentation Projects in Nearly Any Environment
Managing Documentation Projects in Nearly Any EnvironmentManaging Documentation Projects in Nearly Any Environment
Managing Documentation Projects in Nearly Any Environment
 
Coming Up to Speed with XML Authoring in Adobe FrameMaker
Coming Up to Speed with XML Authoring in Adobe FrameMakerComing Up to Speed with XML Authoring in Adobe FrameMaker
Coming Up to Speed with XML Authoring in Adobe FrameMaker
 

Recently uploaded

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Recently uploaded (20)

FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 

Converting and Integrating Legacy Data and Documents When Implementing a New CMS

  • 1. Confidential & Proprietarywww.dclab.comwww.dclab.com Converting and Integrating Content When Moving to a New CMS Greg Fagan, Sales Director, Data Conversion Laboratory (DCL)
  • 2. Confidential & Proprietarywww.dclab.com 2 Valuable Content Transformed • Document Digitization • XML and HTML Conversion • eBook Production • Hosted Solutions • Big Data Automation • Conversion Management • Editorial Services • Harmonizer
  • 3. Confidential & Proprietarywww.dclab.com 3 Experience the DCL Difference DCL blends years of conversion experience with cutting-edge technology and the infrastructure to make the process easy and efficient. • World-Class Services • Leading-Edge Technology • Unparalleled Infrastructure • US-Based Management • Complex-Content Expertise • 24/7 Online Project Tracking • Automated Quality Control • Global Capabilities
  • 4. Confidential & Proprietarywww.dclab.com We Serve a Very Broad Client Base . . . 4
  • 5. Confidential & Proprietarywww.dclab.com 5 . . . Spanning All Industries • Aerospace • Associations • Defense • Distribution • Education • Financial • Government • Libraries • Life Sciences • Manufacturing • Medical • Museums • Periodicals • Professional • Publishing • Reference • Research • Societies • Software • STM • Technology • Telecommunications • Universities • Utilities
  • 6. Confidential & Proprietarywww.dclab.com 6 So You’re Implementing a New CMS – Great! Now What?
  • 7. Confidential & Proprietarywww.dclab.com Key Considerations • Content structure • More than likely, that structure will be XML • Which XML schema is appropriate – DITA, DocBook, XHTML? • Is one schema better than the others? • What’s the plan for legacy content? 7
  • 8. Confidential & Proprietarywww.dclab.com • Multichannel publishing • Content repurposing • Content reuse • Easier updating • Avoiding multiple conversions • Some/all of the above 8 What Are Your Business Drivers?
  • 9. Confidential & Proprietarywww.dclab.com • Can’t just be moved from one format to another • Non-XML sources embed formatting – not applicable to other outputs • Tool-specific formats make your content dependent on functionality of that tool 9 Content Reuse is Hard!
  • 10. Confidential & Proprietarywww.dclab.com So You Need XML, but Which Schema? 10
  • 11. Confidential & Proprietarywww.dclab.com So Why DITA? • Works across all outputs • Can be customized to different content types (educational, financial, legal, etc.) • Can produce both HTML5 and EPUB from DITA with open- source tools • Can do everything DocBook can, but reverse not true • XHTML not a true schema 11
  • 12. Confidential & Proprietarywww.dclab.com DITA with Your CMS • Your CMS should support several different output targets • DITA provides the consistent structure and flexibility to do that • New content will be authored in DITA • But what about… 12
  • 14. Confidential & Proprietarywww.dclab.com • Not as scary as it seems • Prioritize and convert in stages • Consider conversion before selecting a CMS • Consider a pilot program before committing fully 14 Converting and Integrating Your Legacy Content
  • 15. Confidential & Proprietarywww.dclab.comwww.dclab.com Lessons from 12 DITA Implementations DCL Survey of 12 Companies
  • 16. Confidential & Proprietarywww.dclab.com Why DITA? Top 3 Answers • Reduce need for composition • Content reuse • Reduce translation costs 16
  • 17. Confidential & Proprietarywww.dclab.com What Were the Business Drivers? • Top answer was multi-purposing – Ability for various teams to use content to suit their particular needs – Deploying chunks of content for multiple purposes dramatically reduced costs, and improved overall reliability 17
  • 18. Confidential & Proprietarywww.dclab.com How Long Did It Take? • Average implementation took three years • Some took only two years; others five • Some respondents believed implementation to be an ongoing process (never completed) • Across the board, however, it took far longer than planned 18
  • 19. Confidential & Proprietarywww.dclab.com When Did They Choose CMS? • Half of respondents selected CMS at beginning of process • Other half after running pilot programs • Companies that implemented CMS later converted content first then selected CMS based on data requirements • Two companies switched to different CMS during testing phase 19
  • 20. Confidential & Proprietarywww.dclab.com How Was Success Measured? • Multi-purposing was top criterion, with these notable benefits: – Publishing content in multiple formats such as PDF and print – Developing training and help systems – Customizing marketing and sales collateral – Changing styling, layout, and design while maintaining the copy – Producing HTML and eBooks since content was standardized 20
  • 21. Confidential & Proprietarywww.dclab.com How Are You Maximizing Benefits of Content Reuse? • Only two of the 12 companies actively reusing content • Built extensive rewriting phase into plan • Extended implementation time but critical to overall success • Other companies cited size of project and drastic change to content creation process as reasons for not implementing reuse plan up front 21
  • 22. Confidential & Proprietarywww.dclab.com Do You Often Need Translation? • Four of 12 companies doing heavy translation • All reported significant saving from data standardization, even without content reuse • Rest of companies viewed translation as essential to future plans 22
  • 23. Confidential & Proprietarywww.dclab.com How Did Conversion Go? • All 12 companies felt it went smoothly • Many didn’t do conversion at initial stages and opted for extensive rewriting, which they regretted • Same companies held off on legacy conversion until after implementation, which like rewriting, wasn’t efficient 23
  • 24. Confidential & Proprietarywww.dclab.com Did DITA Work Out of the Box? • All 12 companies reported that it did • Only 3 reported using DITA specialization, mostly for minor items • All were working with technical documents, so specialization wasn’t an issue in these cases 24
  • 25. Confidential & Proprietarywww.dclab.com Lessons Learned • Consensus for more data clean-up before conversion • Small pilot programs are useful • Underestimated adapting to DITA authoring and training needs • Support from management crucial 25
  • 27. Confidential & Proprietarywww.dclab.com 27 Benefits and Drivers for Content Conversion
  • 28. Confidential & Proprietarywww.dclab.com 28 The Value of Structured Content Increase Revenues  Improve customer service  Decrease time to market  Expand into new markets  Create data versatility  Enhance discoverability Decrease Expenses  Increase authoring productivity  Reduce publishing costs  Increase information reuse  Reduce translation costs  Future-proof data Successful business strategies are driven by content!
  • 29. Confidential & Proprietarywww.dclab.com 29 Can your content keep up with changing technology?  Data drives every aspect of a business from engineering and development to maintenance, repair and operations, sales, customer service, marketing, and more  Documents are often converted in order to comply with law, industry standards, or to support distribution partners and meet consumers' expectations  Data conversion is most desirable for its potential to lower costs by making data easier to manage, update, reproduce, and syndicate  Structured formatting enables content to be delivered any where at any time on any device imaginable
  • 30. Confidential & Proprietarywww.dclab.com 30 Re-purposing Searching Component Reuse Enforce Data Standards Interchange with Vendors, Customers, & World  Creating new versions of data suitable for derivative uses (e.g. the web, diagnostic equipment, hand-held devices, voice devices)  Ability to find information through text searches and through more advanced searches that depend on context and “understanding”  Ability to reuse portions of data for different products and different documentation sets  Ability to assure that the information produced is produced consistently and meets corporate standards  Ability for others to use your information for communications with others and to incorporate into products belonging to other organizations Various Uses for Structured Content
  • 31. Confidential & Proprietarywww.dclab.com • Plan… plan… plan • Prepare your teams and manage attitudes and expectations accordingly • Phase your project for increased manageability • Establish multiple checkpoints and test often • DON’T GO IT ALONE! 31 Key Takeaways
  • 32. Confidential & Proprietarywww.dclab.com 32 Q&A Greg Fagan Sales Director, Data Conversion Laboratory (908) 723-1884 gfagan@dclab.com @dclaboratory

Editor's Notes

  1. Good afternoon, everyone! Thanks for joining us for this webinar. Today we’re going to discuss the best formats and practices for content conversion when you’re migrating to a new content management system. I’m Greg Fagan, and I’m the Sales Director for the publishing and financial industries at DCL. Because you’re all busy people, I’ve tried to keep this presentation as concise as possible. I’ll talk for about 15-20 minutes and then open the floor to your questions.
  2. Just some quick background information on DCL. We’re content conversion experts. We take content in any format you might have it and convert it to reusable formats for digital output such as XML, SGML, HTML5, DITA, and EPUB. We not only convert your content, but we can enrich it to make it more discoverable, usable, and deliverable to any output format or device. Aside from conversion, we offer a suite of services, including hosting, editorial services, and project management.
  3. Our deep experience, sophisticated infrastructure, and ferocious commitment to quality are what set us apart from the pack.
  4. We serve a broad range of clients. Myriad large, global companies from many different sectors entrust their content to us.
  5. And our clients span a wide array of industries, which speaks to our familiarity and fluency with many different XML schemas. Publishers, societies, pharmaceutical companies, defense contractors, and government agencies are just a few of the types of clients and industries we serve.
  6. So you’re implementing a new content management system. Or maybe you’re upgrading an existing one. This means that you’re serious about organizing your content to make it more searchable and retrievable, and that you’re probably keen to reuse and repurpose your content in multiple ways and to multiple outputs. That’s good business practice, and it’s something every organization that provides content should do. Delivering content to your internal and external users in the way that they want it is critical to your overall success. So now that you’ve decided to move forward with this new CMS, what’s next?
  7. Any content management system requires content to be in some kind of structured format. In most industries, from publishing to financial services to aerospace, just to list a few examples, that structure will likely be some flavor of XML. But which XML schema should you use? Is one better or more appropriate than the others? Sure, you have a plan for the new content that you’ll be entering into your CMS, and you very likely have an authoring or data input tool designed to work with that system. But what about your legacy content? How many years’ worth of content do you have? How is it currently stored – mostly paper, PDFs, spreadsheets – and what’s your plan for integrating it into your CMS? Do you need to convert all of your content now, or can you prioritize and do it in stages? These are all important considerations, and hopefully you’ve thought about them before you decided to implement a new CMS.
  8. Think about your business drivers for developing a CMS. Is the goal to publish to multiple channels – print, Web, mobile apps, streaming audio/video – and to both internal and external users? Is it to reuse your content across your enterprise from a single source so that you can streamline content creation and avoid redundancies? Do you want to make updating your content easier? Or maybe you’ve seen the inefficiency of converting your content multiple times for different outputs. More than likely, your business drivers involve some combination or even all of these reasons. After all, the whole point of implementing a CMS is to get your content into a structure that provides greater control and flexibility.
  9. In addition to the financial challenges of converting content from one source to another multiple times, the content cannot simply be “moved” from one design to another. Content for books is written to be read from beginning to end. This approach creates dependencies that make it difficult to use the same content in a different order or for a different purpose, such as a mobile app. For example, wording such as “on the previous page” is not appropriate after conversion to digital, reflowable text. Non-XML sources embed the formatting into the content. When an author applies a format in a source file, such as an InDesign file, the styling is embedded into the content. Because this styling is not usually applicable in another deliverable, the formatting must be updated for each deliverable type. Tool-specific files lock your content into a dependency on the functionality, including output generation, for that tool. All of these factors contribute to limiting the delivery possibilities for your content.
  10. For the most flexibility across all content types, my recommendation would be DITA, which, if you’re not familiar with it, stands for Darwin Information Typing Architecture. DITA is an open standard for creating, managing, and publishing modular content, which is what will be stored in your content management system. It supports the definition of new content types within a comprehensive content ecosystem, and it has been increasingly adopted across a wide range of content disciplines and industries.
  11. A few years ago, the common wisdom was that if you were developing narrative content, you should use DocBook, and if you were developing modular or topic-based content, you should use DITA. That was true to an extent but was always somewhat misleading, in my view. Books can be written with DITA and modular content can be authored with DocBook. DITA has really advanced in the last couple of years, to the point where I think it’s superior to DocBook, especially when implementing a CMS. DITA can be published to all outputs, and it can be easily customized (or specialized, to use the preferred terminology of DITA advocates) to many different content types, such as educational, financial, and legal, just to name a few. That’s important for the development of mobile content, apps, and training content. You can produce both HTML5 and EPUB with readily available open-source tools from DITA. And while both have their strong points, DITA is the more flexible schema of the two: it can do everything DocBook can, but the reverse isn’t always true. For example, DITA is better-suited to granular storage of content that you see in a most content management systems. What about XHTML? Although it’s often thought of as one, XHTML is not a true schema; it’s really a document styling format and thus not structured enough for a CMS.
  12. One of the essential functions of any content management system is that it should support most if not all output targets, and DITA provides both a consistent XML structure and the flexibility of specialization to do that. How many of us foresaw the advent of all the current deliverable types five years ago? Can you predict the quantity or variation of the deliverables your company will need to meet changing user needs in the next five years? If you separate your content from its delivery now, which is what a good CMS does, then you don’t have to try to predict the future; instead, you can future-proof your content so that it’s ready to be transformed into whatever outputs your customers need. Now it’s one thing to do that with new content. Piece of cake, right? You simply set up templates and tools that integrate with your CMS. But what about…
  13. That’s a different challenge. So let’s talk about that.
  14. Integrating your legacy content with a new CMS is no easy task, but with a logical, well-planned approach, it’s not as daunting as it might seem. A phased approach makes a lot of sense, as it helps you to avoid costly mistakes, like realizing you’ve implemented a system that doesn’t work before you get too far down the road. Prioritizing and converting your content in stages, doing some conversion and learning more about your content requirements before choosing a CMS, and running small pilot programs before getting locked into a CMS, at a cost of hundreds of thousands of dollars, are all good examples of a phased approach.
  15. With that in mind, let’s discuss some relevant DITA and CMS lessons in detail. DCL recently conducted a series of interviews with DITA implementers at twelve companies. The intent of the study was to better understand the reality of live implementations vs. the perceptions that exist in the industry. We promised anonymity so we could ensure the results would be representative of the group’s actual findings.
  16. The three most popular answers were: Reduced need for composition. That’s a rather publishing-centric term that refers to typesetting, but in this context, it can refer to any step that gets content or data ready for output. Content reuse and reduced translation costs. All three resulted in cost savings, decreased time to market and improved internal efficiencies. This isn’t surprising. We know from our own years of experience that having content in a structured format in a content management system has many benefits, with these three among the most cited.
  17. The top answer to this question was the ability for various teams to multi-purpose content to suit their particular requirements. Utilizing chunks of content for multiple purposes dramatically reduced costs and improved overall reliability. I referred to multi-purposing in an earlier slide, and it’s highly likely that it’s at or near the top of any organization’s list for moving to a structured content format within a CMS.
  18. [Read bullets.] This one came as a surprise to us. But there are ways to speed the process. After all… time is money!
  19. Half of the respondents selected their CMS at the beginning of the process. The other half followed after running various pilot programs. The companies that selected a CMS later started doing conversion and getting comfortable with the data first, then selected a management system when they had a better understanding of their own data requirements. Two of the companies had switched from their initial selection to another CMS during the testing phase, which highlights the value and wisdom of running small pilot programs before full implementation. Absent that testing, they might very well have continued down their respective paths with content management systems that weren’t meeting their needs. That would have meant large sums of money spent for poorly implemented solutions. And it also would have resulted in walking papers for those decision makers.
  20. Once again, the ability to multi-purpose content was the number one criteria for measuring success and return on investment. Some of the notable savings came from the improved ease and efficiency of the following: [Read bullets 2-6.]
  21. Only two of the twelve companies we interviewed were actively taking advantage of content reuse. Yes… we were surprised by this as well. Those two companies had decided upfront to build an extensive re-writing phase into their implementation plan. While this additional phase extended the implementation time, the upfront planning was critical to the success of their overall project. The most common reasons for not implementing a reuse plan up front included projects being too large for anyone to manage or requiring too much rewriting. Notably, many also mentioned the drastic change that would be required for the content creators to move to a more modular writing mode and to work more collaboratively and with more guidelines than typically they were accustomed to.
  22. Four of the twelve companies were actively doing a lot of translation. All four reported major savings. Even without content reuse, the savings of standardized data in terms of translation were vast and long-lasting. The eight companies who did not translate their data stated that it was a likely future endeavor but that right now, even with globalization, they were able to get away with English alone. All of the respondents said that translation was definitely a future requirement.
  23. All twelve companies felt that the conversion went smoothly. However, many didn’t do much conversion in their initial stages. Several had decided, to their later regret, to rewrite most materials from scratch, which simply took way too much time. These organizations ultimately left most of their legacy data unconverted until after the CMS implementation. Two companies initially thought that having the writers do it themselves would be good training, but noted that, in retrospect, this wasn’t a good idea.
  24. A major reason for attraction to DITA was that it works out of the box, at least for most. Others can expand its benefits by applying specializations when necessary. The companies we interviewed all agreed that for their materials DITA pretty much worked out of the box, and that standard composition software was for the most part suitable for their needs. Only three of the companies reported using specialization, and those were for minor items like customized document covers. Of course, these organizations were all working with technical documents of one kind or another, which are the type of documents DITA was originally designed for. Other kinds of documents would likely need more specialization, although there are a number of emerging standardized “specializations” for different document types.
  25. Let’s talk about lessons learned. When asked what they would do differently, the most common response was “more cleanup of data before conversion.” Many wanted smaller—and simpler—pilots, as well as more time to experiment. They felt they had focused too much on the complex outliers in their pilot, and jumped into production too quickly without enough time to adjust for lessons learned in the pilot. Underestimating the human factor was a common note. Allowing more time for people to adapt to the new system and the philosophy of DITA, as well as earlier training, were also prominent suggestions in hindsight. Finally, buy-in and support from upper management was viewed as critical by all respondents.
  26. Here is a table of common content pain points and how DITA implementation solves them.
  27. So what are the benefits and drivers for content conversion, specifically when converting to a new CMS?
  28. Well-structured content has many benefits, with the most important being that it can increase revenue by decreasing time to market and enabling new product development. It also decreases expenses, such as publishing and translation costs, over time, which makes it a smart investment. Often legacy content is more complex and difficult to manage than new content. In many cases, it was designed for one specific output and not much thought was given to proper storage, retrieval, or reusability. There are also different document types, formats, and levels of complexity, like heavy math and tabular material that was never meant for digital output. This is where the help of a trusted partner can be invaluable in helping you identify, categorize, and convert your content to a well-structured format. Your content should drive your business strategy.
  29. But you can’t structure your content and think your work is done. It’s an ongoing process to keep up with industry standards, compliance, and constantly evolving outputs. Once the major work is done, however, the changes are much easier to manage, and your content is ready for delivery to any output. Content drives every aspect of your business, so make sure yours is ready to take you in the right direction.
  30. Structured content has many uses, with reuse and repurposing the most important in my mind. Why? Because they generate revenue. The others are important, too. Different industries have differing degrees of importance, but money talks in all of them. When your content is structured at a granular level, you can assemble the different components into new products and new revenue sources.
  31. So the key considerations for conversion when implementing a CMS are as follows: You must plan thoroughly and then be prepared to adjust once theory turns into practice. To quote General Dwight Eissenhower, “No battle was ever won according to plan, but no battle was ever one without one.” Prepare your teams and manage expectations. Try to anticipate problems before they occur. That’s easier said than done sometimes, but it’s the key to good project management. Implement your conversion and your CMS in phases. Pilot projects are a great way to discover and head off potential problems before you head too far down the wrong path. Establish multiple checkpoints and milestones and test your system often with real users and real content. The people who will have to use the CMS every day are the people who will provide the most valuable feedback. And finally, we give this advice often, but it’s always worth repeating: Don’t go it alone! For a project of this scope, you’ll need outside expertise. Bringing in the right expertise is almost always more cost-effective than trying to manage every aspect of a large-scale project inhouse.
  32. I’d like to thank you for tuning in today. Feel free to contact me directly anytime; my contact information is there on the screen. Now I’m happy to take your questions.