SlideShare a Scribd company logo
1 of 70
Download to read offline
PDF/A
Addressing the challenges of digitizing and preserving
paper-based documents in GoC
Jeff Brand
October 26, 2012




© ADLIB 2012. THIS SLIDE PRESENTATION CONTAINS PROPRIETARY AND/OR CONFIDENTIAL INFORMATION.
Adlib – Who We Are




•   Software company – Burlington, Ontario, Canada
•   Leading expert in document-to-PDF transformation
•   Improve document intensive business processes
•   10+ years experience
•   5,000+ Customers Worldwide
•   50+ Countries
•   100+ Partners
Bringing Value To Many Industries




Financial     Life                   Health
                       Legal   Mfg            Gov’t   Other
Services    Sciences                  Care
Key Partners
Agenda

 Physical and Digital Media

 • Physical and Digital Archiving – Advantages and Disadvantages
 • Overview of PDF and PDF/A

 Approaches to Digitization

 • Maximize the retention of knowledge
 • Consider Security Implications

 Management of Digitized artifacts

 • Revisit Retention and Disposition polices
 • Maximize the value to Canada

 Opportunities for Savings

 • Time & Cost
 • Increased Flexibility to mitigate future costs

 Summary
Physical Archiving
• Preserving and Storing the original or
  exemplary specimen in original, physical form
Physical Archiving - Advantages

             Assuming time has not deteriorated the
             media…

             • The physical archive is the original so
               there is no variance from the original

             • Relatively little / no question about
               authenticity, accuracy

             • Technology – All you need are eyes.
Physical Archiving - Advantages




• Sentimental Value




Certain original documents will be desired to be
maintained for as long as possible...
Physical Archiving - Disadvantages




Time




Time heals all wounds…and destroys all
documents
Physical Archiving - Disadvantages




Cost




Elaborate and costly physical storage and
preservation
Physical Archiving - Disadvantages




Availability




There’s only one. Options to make it available to
citizens are limited and require manual effort
Physical Archiving - Disadvantages




Effort to retrieve




Locating relevant documents relies on
appropriate and accurate taxonomy during on-
boarding
Physical Archiving - Disadvantages




Environmental
Impact




Preserving documents require chemicals, such as
3M’s Novec 7100 Engineering Fluid
Digital Archiving
• Preserving and Storing the original or
  exemplary specimen in a digitized form.
Digital Archiving - Advantages




Space                                    = 56,140,800
                 128GB USB Flash Drive
                 $80 CDN
                                         Pages




An entire warehouse of text can fit in a USB
Thumb Drive
Digital Archiving - Advantages




Availability




Digital copies can be shared with an unlimited
number of people with little or no effort.
Digital Archiving - Advantages




Effort




Technologies such as Full-Text-Searching make
finding relevant documents easier and less
dependent on taxonomy
Digital Archiving - Advantages




Automation




Automatically execute Retention and Disposition
policies, Audit and more without manual
intervention
Digital Archiving - Advantages




Flexibility




Flexibility to support changing Policy and
Requirements easily
Digital Archiving - Advantages




Cost




Digital archives typically cost 90% less to operate
and maintain
Digital Archiving - Challenges



• Digital Dark Age

• Wide variety of formats that require special
  technology to view

• Reduced Sentimental Value
Digital Archiving - Considerations




Digital Dark Age




Ensuring files are accessible tomorrow…
Digital Archiving - TIFF
Tagged Image File Format

• Used by FAX Machines (CCITT Group 4)
• Very common image format
• Supports multiple pages
• Significant increase in file size for digitally-born
  content
• No Search capability
• Not designed for Long-Term Archiving
Digital Archiving – PDF/A
Portable Document Format
(For Archive)
• Adopted by ISO for long-term preservation of
  documents
• Based on PDF – the most popular document
  format on the Web today
• Highest-Quality representation of document
• Smallest possible file sizes
• Guaranteed to look the same forever
• Universally Viewable – Hardware / Software
  independent
What is PDF?


  Portable Document Format

  Originally created by Adobe in the 1990’s, became an
  open, ISO Standard (32000:1) in 2008



  The most popular file format on the web today   (FileInfo.com)
What else is PDF Used For?


•   Contracts
•   Agreements
•   Sales Proposals
•   Product Literature
•   Publications
•   Reports
•   Standard Operating Procedures
•   Long-Term Archiving
•   Sharing documents and content with others
•   So Much More
Isn’t PDF Free?




       Many applications can save to PDF
Isn’t PDF Free?




    A quick Google search shows dozens of free
          applications for creating PDF…
Isn’t PDF Free?            - Can your doc change?

Original Excel Chart              Free PDF Rendition




     Fidelity or quality of conversion is often the cost
Isn’t PDF Free?         - Can your doc change?




     Original Word Doc            Free PDF Rendition

Content Re-Flow - Font Substitution - Complex Formats
Isn’t PDF Free?                - Can you comply?




  PDF Features that are often required for compliance or
even optimal document conversion are often missing in free
                  or low-cost solutions
Isn’t PDF Free?                  - Can you merge?




It can be difficult and time-consuming to merge the content
     from multiple applications into a single document.

        Few if any free PDF solutions enable this.
Isn’t PDF Free?              - Can you keep up?




Workers spend far too much time dealing with low quality
       and manual PDF rendering technologies
Isn’t PDF Free?

• Free or Low-Cost software can cost you:
  • Hours of lost productivity

  • Lost opportunities

  • Miss-communication

  • Business delays

  • Fines
PDF/A
PDF/A - What is it?
• A more strict subset of the PDF specification

• Specifically designed for the purpose of long-term
  preservation of documents

• Audio, Video, JavaScript and Executables, Encryption,
  External references are all restricted

• Designed to be 100% Self Contained – All fonts must
  be embedded
PDF/A - What is it?
Based on PDF, Initially Released in 2005
(3 years ahead of PDF as an ISO Standard!)


• PDF/A-1 (a/b)
    • Based on PDF 1.4 Specification
• PDF/A-2 (a/b/u)
    • Based on ISO 32000-1
    • JPEG2000, Transparency, Layers, OpenType Fonts,
      PDF/A File Embedded
• PDF/A-3 (a/b/u)
    • Arbitrary files can be embedded
PDF/A - What is it?
What makes a PDF a PDF/A?

• A Special metadata tag that indicates that the
  document presents itself as PDF/A

• Compliance to the PDF/A Standard
PDF/A – Disadvantages




File Size – Embedding fonts in each document
means file sizes are larger when compared to
PDF

(This is still significantly better than alternatives such as TIFF)
PDF/A – Summary


• PDF provides many benefits over alternatives
  such as TIFF
  •   Small size
  •   High-quality
  •   Searchable
  •   Highly viewable
  •   Portable


• PDF/A Builds on this and ensures the long-term
  viability of content stored in this format
The AIIM Document Life Cycle
                                     Optimize with Searchable Content




    OCR- Searchable Content
    Metadata Retention




                              - Format PDF
                              - Enhancements & Watermarks
            Support for:      - Document Assembly
            - PDF/A           - Personalization
            - TIFF            - Security & Approvals
PDF/A at Library and Archives Canada

                      Services      Services     Services




Upload      Module     Staging        DAM       Web Store
                     Repositories               Local/cloud

FTP         Module                   DAM                                  Web
                                    Storage

eMail       Module
                      Structured                              Templates
                         Data
Scan        Module




                      Structured
                         Data
                                      Data
                      Metadata      Warehouse
                      & Social
                       Models
PDF/A at Library and Archives Canada


   Adlib              Services      Services     Services




Upload      Module     Staging        DAM       Web Store
                     Repositories               Local/cloud

FTP         Module                   DAM                                  Web
                                    Storage

eMail       Module
                      Structured                              Templates
                         Data
Scan        Module




                      Structured
                         Data
                                      Data
                      Metadata      Warehouse
                      & Social
                       Models
PDF/A at Library and Archives Canada
                     Adlib

                      Services      Services     Services




Upload      Module     Staging        DAM       Web Store
                     Repositories               Local/cloud

FTP         Module                   DAM                                  Web
                                    Storage

eMail       Module
                      Structured                              Templates
                         Data
Scan        Module




                      Structured
                         Data
                                      Data
                      Metadata      Warehouse
                      & Social
                       Models
PDF/A at Library and Archives Canada

                      Services      Services     Services




Upload      Module     Staging        DAM       Web Store
                     Repositories               Local/cloud

FTP         Module                   DAM                                  Web
                                    Storage

eMail       Module
                      Structured                              Templates
                         Data
Scan        Module




                      Structured
                         Data
                                      Data
            Adlib     Metadata      Warehouse
                      & Social
                       Models
PDF/A at Library and Archives Canada
                                    Adlib
                      Services      Services     Services




Upload      Module     Staging        DAM       Web Store
                     Repositories               Local/cloud

FTP         Module                   DAM                                  Web
                                    Storage

eMail       Module
                      Structured                              Templates
                         Data
Scan        Module




                      Structured
                         Data
                                      Data
                      Metadata      Warehouse
                      & Social
                       Models
PDF/A at Library and Archives Canada
                                                Adlib
                      Services      Services     Services




Upload      Module     Staging        DAM       Web Store
                     Repositories               Local/cloud

FTP         Module                   DAM                                  Web
                                    Storage

eMail       Module
                      Structured                              Templates
                         Data
Scan        Module




                      Structured
                         Data
                                      Data
                      Metadata      Warehouse
                      & Social
                       Models
PDF/A at Library and Archives Canada

                      Services      Services     Services




Upload      Module     Staging        DAM       Web Store            Adlib
                     Repositories               Local/cloud

FTP         Module                   DAM                                  Web
                                    Storage

eMail       Module
                      Structured                              Templates
                         Data
Scan        Module




                      Structured
                         Data
                                      Data
                      Metadata      Warehouse
                      & Social
                       Models
Digitization
Preparation – Typical Document Process
Digitization – Processing Large Volumes
• Digitizing entire libraries of content can be more
  than daunting but help is available:


        Seek out industry experts to ensure a
         successful transition of knowledge
Digitization – Processing Large Volumes
• Digitizing entire libraries of content can be more
  than daunting but help is available:

   • In-Sourcing and Out-Sourcing : Build a plan
     of action that considers Security
     requirements
      • Is the content potentially sensitive?
      • Is there risk of loss?
      • Is there a risk of contamination / degradation of
        the original content?
Digitization – Processing Large Volumes
• Digitizing entire libraries of content can be more
  than daunting but help is available:


   • Hardware & Software Investments
      • What do you need Today & Tomorrow
      • Consider Lease for Short Term
        requirements
      • Provision for the future
Digitization – Processing Large Volumes
• Measure Twice, Cut Once
  • Plan ahead and consider the future use of the
    content when defining requirements
  • Understand the entire lifecycle of the content when
    architecting the process
     •   How long will we keep it?
     •   How will we share it?
     •   How will people find it?
     •   How will we dispose of it?
     •   Will we maintain the originals after digitization?
     •   What are the specific requirements for each step in the
         process?
Digitization – Processing Large Volumes
• Start with Quality

   • Pay special attention to the digitization process
   • Higher quality at the IMAGING stage pays off
   • Files can be reduced as necessary later, you can
     never ADD quality
   • Consider pre-processing when scanning documents
     of questionable quality
   • Ensure highly accurate OCR is applied prior to on-
     boarding into the system, or as a part of the
     onboarding process
Digitization – Maintaining Taxonomy
Classification and indexes need to be maintained,
but how?

• Purely Physical
  • Index Cards, Catalogs, Within Content
• Modernized Physical
  • Library systems & databases
Digitization – Maintaining Taxonomy


This is often achieved by making the classification
data available on a cover sheet in front of each
document.

This can be extracted from the Library System /
DB, or pulled directly from an Index Card and
even processed from a Catalog (Even if it’s
physical!)
Digitization – Approaches
There are 2 methods to digitizing a collection:

1. Batch
  •   Everything is performed in one or multiple batches
      and the sequence of batching is pre-determined


2. Scan-On-Demand
  •   More opportunistic, existing Archives are digitized
      as requested
Digitization - Security
Preventing Loss
  •   Chain of custody
  •   Limited transportation choices
  •   Escorted Content


Selective Outsourcing
  •   Assess the risk
  •   Employ multiple tiers for Outsourcing
  •   In-Source for the most critical artifacts
Management of Digitized Artifacts
• Revisit Retention and Disposition Policies
  • Can we keep digital records longer? Indefinitely?


• Maximizing the value to Canada
  • Making content available to Canadians
  • Using Search to maximize value and enhance
    classification paradigms in use today
Sharing Canada’s Digitized Artifacts



                 Maximizing the value to
                 Canada:

                   • Education

                   • Legal

                   • Innovation
Cost Savings


• Physical Storage

• Management and Execution of Retention and
  Disposition Policies

• Flexibility to support changing Policy and
  Requirements easily
Cost Savings                                                       $36,659.20
                                                             1990
                                                             1995       $819.20
                                  Price per GB               2000     $1,433.60

                                                             2005         $10.00
$40,000.00
                                                             2010          $0.10
$35,000.00                                                   2012          $0.05

                                                             2020          $0.02
$30,000.00


$25,000.00
                                                                    Price per GB

$20,000.00


$15,000.00


$10,000.00


 $5,000.00


       $-
             1990   1995   2000      2005    2010   2012   2020
Cost Savings                                    $36,659.20
                                     1990
                                     1995          $819.20
               Price per GB          2000        $1,433.60

$0.12                                2005           $10.00

                                     2010             $0.10

                                     2012             $0.05
$0.10
                                     2020             $0.02

$0.08


                                            Price per GB
$0.06



$0.04



$0.02



  $-
        2010   2012           2020
Summary


• Hire an Expert – Or Become One!
  • Do it once and do it right


• Digitize Everything
  • On Demand / Disposition


• Physically preserve only sentimental and
  historic originals
The AIIM Document Life Cycle
                                     Optimize with Searchable Content




    OCR- Searchable Content
    Metadata Retention




                              - Format PDF
                              - Enhancements & Watermarks
            Support for:      - Document Assembly
            - PDF/A           - Personalization
            - TIFF            - Security & Approvals
Adlib PDF Enterprise




          Input:                                            Output:
•   MS Office                      Process:         •   PDF
•   MS InfoPath           • Conversion              •   PDF/A
•   MS Project            • Recognition (OCR)       •   XPS
•   Various CAD           • Publication             •   XML
•   Various PDF              • Merge                •   TIFF/JPG/BMP/PNG
•   Images                   • TOC                  •   TXT
•   OpenOffice               • Bookmarks            •   HTML
•   HTML                     • Headers/Footers
•   Over 400 File Types      • Digital Signatures
Adlib PDF Architecture
 Content
 Stores


Connector        SharePoint           Folder                Generic




                                                                            Management Console UI
                                      Connector Framework (Java 1.6/.NET)




                        WCF / SOAP Services Interface
Manager
    s
                  System               System               System
                  Manager             Database              Manager



 Engine
     s               Transformation                 Transformation
                         Engine                         Engine
Adlib Software…
                     …The PDF Experts!




          Your partner for Quality,
          Automated Document
          Transformation
Contact Information



Matt Woodworth
Manager, Public Sector. North America.
613 218 6778
mwoodworth@adlibsoftware.com

More Related Content

What's hot

072810aiimwebinar share point and your information architecture
072810aiimwebinar share point and your information architecture072810aiimwebinar share point and your information architecture
072810aiimwebinar share point and your information architectureRich Blank
 
Oracle - Document Life - 6apr2012
Oracle - Document Life - 6apr2012Oracle - Document Life - 6apr2012
Oracle - Document Life - 6apr2012Agora Group
 
Enterprise Content Management 101 for Government
Enterprise Content Management 101 for GovernmentEnterprise Content Management 101 for Government
Enterprise Content Management 101 for GovernmentAlfresco Software
 
AIIM/ARMA Cloud Collaboration Presentation
AIIM/ARMA Cloud Collaboration PresentationAIIM/ARMA Cloud Collaboration Presentation
AIIM/ARMA Cloud Collaboration PresentationPorter-Roth Associates
 
Lessons learned from & best practices for migrating to SharePoint
Lessons learned from & best practices for migrating to SharePointLessons learned from & best practices for migrating to SharePoint
Lessons learned from & best practices for migrating to SharePointGareth Davies
 
What They Won't Tell You About DITA
What They Won't Tell You About DITAWhat They Won't Tell You About DITA
What They Won't Tell You About DITAAlan Houser
 
The New DRS: Plan for Metadata Migration
The New DRS: Plan for Metadata MigrationThe New DRS: Plan for Metadata Migration
The New DRS: Plan for Metadata Migrationkevin_donovan
 
Planning a Migration to Office 365
Planning a Migration to Office 365Planning a Migration to Office 365
Planning a Migration to Office 365Doug Hemminger
 
Microsoft Office 365 Migration Tips for Government Agencies
Microsoft Office 365 Migration Tips for Government AgenciesMicrosoft Office 365 Migration Tips for Government Agencies
Microsoft Office 365 Migration Tips for Government AgenciesAventis Systems, Inc.
 
Share Point Sat Share Point 2010 And Content Migration
Share Point Sat Share Point 2010 And Content MigrationShare Point Sat Share Point 2010 And Content Migration
Share Point Sat Share Point 2010 And Content MigrationNadir Kamdar
 
Some DSpace Customisations
Some DSpace CustomisationsSome DSpace Customisations
Some DSpace CustomisationsGavin Henrick
 
Digital Asset Management with Alfresco
Digital Asset Management with AlfrescoDigital Asset Management with Alfresco
Digital Asset Management with Alfrescorivetlogic
 
CARA User Interface for Oracle WebCenter
CARA User Interface for Oracle WebCenterCARA User Interface for Oracle WebCenter
CARA User Interface for Oracle WebCentercara4oraclewebcenter
 
A Practical Approach to Managed Shared Drives
A Practical Approach to Managed Shared DrivesA Practical Approach to Managed Shared Drives
A Practical Approach to Managed Shared DrivesTAB
 
InfoPath 2010 Scaling up 1 to 100
InfoPath 2010 Scaling up 1 to 100InfoPath 2010 Scaling up 1 to 100
InfoPath 2010 Scaling up 1 to 100Chris Grist
 
Xedapp - Overview
Xedapp - OverviewXedapp - Overview
Xedapp - OverviewXedapp
 

What's hot (20)

072810aiimwebinar share point and your information architecture
072810aiimwebinar share point and your information architecture072810aiimwebinar share point and your information architecture
072810aiimwebinar share point and your information architecture
 
Oracle - Document Life - 6apr2012
Oracle - Document Life - 6apr2012Oracle - Document Life - 6apr2012
Oracle - Document Life - 6apr2012
 
Enterprise Content Management 101 for Government
Enterprise Content Management 101 for GovernmentEnterprise Content Management 101 for Government
Enterprise Content Management 101 for Government
 
AIIM/ARMA Cloud Collaboration Presentation
AIIM/ARMA Cloud Collaboration PresentationAIIM/ARMA Cloud Collaboration Presentation
AIIM/ARMA Cloud Collaboration Presentation
 
Lessons learned from & best practices for migrating to SharePoint
Lessons learned from & best practices for migrating to SharePointLessons learned from & best practices for migrating to SharePoint
Lessons learned from & best practices for migrating to SharePoint
 
What They Won't Tell You About DITA
What They Won't Tell You About DITAWhat They Won't Tell You About DITA
What They Won't Tell You About DITA
 
The New DRS: Plan for Metadata Migration
The New DRS: Plan for Metadata MigrationThe New DRS: Plan for Metadata Migration
The New DRS: Plan for Metadata Migration
 
Planning a Migration to Office 365
Planning a Migration to Office 365Planning a Migration to Office 365
Planning a Migration to Office 365
 
Microsoft Office 365 Migration Tips for Government Agencies
Microsoft Office 365 Migration Tips for Government AgenciesMicrosoft Office 365 Migration Tips for Government Agencies
Microsoft Office 365 Migration Tips for Government Agencies
 
Documentum Overview
Documentum OverviewDocumentum Overview
Documentum Overview
 
Share Point Sat Share Point 2010 And Content Migration
Share Point Sat Share Point 2010 And Content MigrationShare Point Sat Share Point 2010 And Content Migration
Share Point Sat Share Point 2010 And Content Migration
 
Office 365: The latest and greatest in the cloud
Office 365:  The latest and greatest in the cloudOffice 365:  The latest and greatest in the cloud
Office 365: The latest and greatest in the cloud
 
What is PDF/A?
What is PDF/A?What is PDF/A?
What is PDF/A?
 
Some DSpace Customisations
Some DSpace CustomisationsSome DSpace Customisations
Some DSpace Customisations
 
Workflow Toolkit
Workflow ToolkitWorkflow Toolkit
Workflow Toolkit
 
Digital Asset Management with Alfresco
Digital Asset Management with AlfrescoDigital Asset Management with Alfresco
Digital Asset Management with Alfresco
 
CARA User Interface for Oracle WebCenter
CARA User Interface for Oracle WebCenterCARA User Interface for Oracle WebCenter
CARA User Interface for Oracle WebCenter
 
A Practical Approach to Managed Shared Drives
A Practical Approach to Managed Shared DrivesA Practical Approach to Managed Shared Drives
A Practical Approach to Managed Shared Drives
 
InfoPath 2010 Scaling up 1 to 100
InfoPath 2010 Scaling up 1 to 100InfoPath 2010 Scaling up 1 to 100
InfoPath 2010 Scaling up 1 to 100
 
Xedapp - Overview
Xedapp - OverviewXedapp - Overview
Xedapp - Overview
 

Similar to PRESENTATION: Challenges of Digitization (November 2012)

Gilbane 2009 -- How Can Content Management Software Keep Pace?
Gilbane 2009 -- How Can Content Management Software Keep Pace?Gilbane 2009 -- How Can Content Management Software Keep Pace?
Gilbane 2009 -- How Can Content Management Software Keep Pace?weisinger
 
January 2006 Archival Storage Strategies and Technologies Presentation
January 2006 Archival Storage Strategies and Technologies PresentationJanuary 2006 Archival Storage Strategies and Technologies Presentation
January 2006 Archival Storage Strategies and Technologies PresentationJohn Wang
 
AIIM Cloud Collaboration Presentation Jan. 2012
AIIM Cloud Collaboration Presentation Jan. 2012AIIM Cloud Collaboration Presentation Jan. 2012
AIIM Cloud Collaboration Presentation Jan. 2012Porter-Roth Associates
 
Case Study – Deploying SharePoint Based eTMF in the Cloud
Case Study – Deploying SharePoint Based eTMF in the CloudCase Study – Deploying SharePoint Based eTMF in the Cloud
Case Study – Deploying SharePoint Based eTMF in the CloudMontrium
 
Document Archiving & Sharing System
Document Archiving & Sharing SystemDocument Archiving & Sharing System
Document Archiving & Sharing SystemAshik Iqbal
 
Wed van horik_handson_research data management
Wed van horik_handson_research data managementWed van horik_handson_research data management
Wed van horik_handson_research data managementeswcsummerschool
 
MBE Summit 2012
MBE Summit 2012MBE Summit 2012
MBE Summit 2012dopsahl
 
9/28/11 Slides - Introduction to DuraCloud, Slides
9/28/11 Slides - Introduction to DuraCloud, Slides9/28/11 Slides - Introduction to DuraCloud, Slides
9/28/11 Slides - Introduction to DuraCloud, SlidesDuraSpace
 
Digitisation Workshop Pres 2008(V1)
Digitisation Workshop Pres 2008(V1)Digitisation Workshop Pres 2008(V1)
Digitisation Workshop Pres 2008(V1)Mal Booth
 
October 2006 Impact of PDF/A on Content Management by Christy Hubbard
October 2006 Impact of PDF/A on Content Management by Christy HubbardOctober 2006 Impact of PDF/A on Content Management by Christy Hubbard
October 2006 Impact of PDF/A on Content Management by Christy HubbardJohn Wang
 
MongoDB Deployment Checklist
MongoDB Deployment ChecklistMongoDB Deployment Checklist
MongoDB Deployment ChecklistMongoDB
 
Webinar: Cloud Storage: The 5 Reasons IT Can Do it Better
Webinar: Cloud Storage: The 5 Reasons IT Can Do it BetterWebinar: Cloud Storage: The 5 Reasons IT Can Do it Better
Webinar: Cloud Storage: The 5 Reasons IT Can Do it BetterStorage Switzerland
 
Aras Connected Cloud for PLM
Aras Connected Cloud for PLMAras Connected Cloud for PLM
Aras Connected Cloud for PLMAras
 
Presentation1
Presentation1Presentation1
Presentation1f6aim
 
Digitisation workshop pres 2009(v1)
Digitisation workshop pres 2009(v1)Digitisation workshop pres 2009(v1)
Digitisation workshop pres 2009(v1)Mal Booth
 

Similar to PRESENTATION: Challenges of Digitization (November 2012) (20)

Real world rm in share point 2013
Real world rm in share point 2013Real world rm in share point 2013
Real world rm in share point 2013
 
Gilbane 2009 -- How Can Content Management Software Keep Pace?
Gilbane 2009 -- How Can Content Management Software Keep Pace?Gilbane 2009 -- How Can Content Management Software Keep Pace?
Gilbane 2009 -- How Can Content Management Software Keep Pace?
 
Single Source Publishing: Utilizing XML and DITA
Single Source Publishing: Utilizing XML and DITASingle Source Publishing: Utilizing XML and DITA
Single Source Publishing: Utilizing XML and DITA
 
PDF vs. TIFF, An Evaluation of Document Scanning File Formats
PDF vs. TIFF, An Evaluation of Document Scanning File FormatsPDF vs. TIFF, An Evaluation of Document Scanning File Formats
PDF vs. TIFF, An Evaluation of Document Scanning File Formats
 
January 2006 Archival Storage Strategies and Technologies Presentation
January 2006 Archival Storage Strategies and Technologies PresentationJanuary 2006 Archival Storage Strategies and Technologies Presentation
January 2006 Archival Storage Strategies and Technologies Presentation
 
AIIM Cloud Collaboration Presentation Jan. 2012
AIIM Cloud Collaboration Presentation Jan. 2012AIIM Cloud Collaboration Presentation Jan. 2012
AIIM Cloud Collaboration Presentation Jan. 2012
 
Case Study – Deploying SharePoint Based eTMF in the Cloud
Case Study – Deploying SharePoint Based eTMF in the CloudCase Study – Deploying SharePoint Based eTMF in the Cloud
Case Study – Deploying SharePoint Based eTMF in the Cloud
 
Document Archiving & Sharing System
Document Archiving & Sharing SystemDocument Archiving & Sharing System
Document Archiving & Sharing System
 
Wed van horik_handson_research data management
Wed van horik_handson_research data managementWed van horik_handson_research data management
Wed van horik_handson_research data management
 
SharePoint 2010: ECM-ready?
SharePoint 2010: ECM-ready?SharePoint 2010: ECM-ready?
SharePoint 2010: ECM-ready?
 
MBE Summit 2012
MBE Summit 2012MBE Summit 2012
MBE Summit 2012
 
9/28/11 Slides - Introduction to DuraCloud, Slides
9/28/11 Slides - Introduction to DuraCloud, Slides9/28/11 Slides - Introduction to DuraCloud, Slides
9/28/11 Slides - Introduction to DuraCloud, Slides
 
Digitisation Workshop Pres 2008(V1)
Digitisation Workshop Pres 2008(V1)Digitisation Workshop Pres 2008(V1)
Digitisation Workshop Pres 2008(V1)
 
October 2006 Impact of PDF/A on Content Management by Christy Hubbard
October 2006 Impact of PDF/A on Content Management by Christy HubbardOctober 2006 Impact of PDF/A on Content Management by Christy Hubbard
October 2006 Impact of PDF/A on Content Management by Christy Hubbard
 
MongoDB Deployment Checklist
MongoDB Deployment ChecklistMongoDB Deployment Checklist
MongoDB Deployment Checklist
 
Webinar: Cloud Storage: The 5 Reasons IT Can Do it Better
Webinar: Cloud Storage: The 5 Reasons IT Can Do it BetterWebinar: Cloud Storage: The 5 Reasons IT Can Do it Better
Webinar: Cloud Storage: The 5 Reasons IT Can Do it Better
 
Aras Connected Cloud for PLM
Aras Connected Cloud for PLMAras Connected Cloud for PLM
Aras Connected Cloud for PLM
 
Introduction to Document Management
Introduction to Document ManagementIntroduction to Document Management
Introduction to Document Management
 
Presentation1
Presentation1Presentation1
Presentation1
 
Digitisation workshop pres 2009(v1)
Digitisation workshop pres 2009(v1)Digitisation workshop pres 2009(v1)
Digitisation workshop pres 2009(v1)
 

More from Adlib - The PDF Experts

WEBINAR PRESENTATION: PDFA - its more than you think
WEBINAR PRESENTATION: PDFA - its more than you thinkWEBINAR PRESENTATION: PDFA - its more than you think
WEBINAR PRESENTATION: PDFA - its more than you thinkAdlib - The PDF Experts
 
PRESENTATION: Capture. Compliance. Centralization. How Advanced Rendering Del...
PRESENTATION: Capture. Compliance. Centralization. How Advanced Rendering Del...PRESENTATION: Capture. Compliance. Centralization. How Advanced Rendering Del...
PRESENTATION: Capture. Compliance. Centralization. How Advanced Rendering Del...Adlib - The PDF Experts
 
WEBINAR PRESENTATION: Controlling critical documents with Dassault Systemes E...
WEBINAR PRESENTATION: Controlling critical documents with Dassault Systemes E...WEBINAR PRESENTATION: Controlling critical documents with Dassault Systemes E...
WEBINAR PRESENTATION: Controlling critical documents with Dassault Systemes E...Adlib - The PDF Experts
 
PRESENTATION: Content Collaboration in SharePoint
PRESENTATION: Content Collaboration in SharePointPRESENTATION: Content Collaboration in SharePoint
PRESENTATION: Content Collaboration in SharePointAdlib - The PDF Experts
 
PRESENTATION: SharePoint Italy Summit 2013
PRESENTATION: SharePoint Italy Summit 2013PRESENTATION: SharePoint Italy Summit 2013
PRESENTATION: SharePoint Italy Summit 2013Adlib - The PDF Experts
 
PRESENTATION: Regulatory Submission - The Art of Avoiding the Resubmit
PRESENTATION: Regulatory Submission -  The Art of Avoiding the ResubmitPRESENTATION: Regulatory Submission -  The Art of Avoiding the Resubmit
PRESENTATION: Regulatory Submission - The Art of Avoiding the ResubmitAdlib - The PDF Experts
 

More from Adlib - The PDF Experts (6)

WEBINAR PRESENTATION: PDFA - its more than you think
WEBINAR PRESENTATION: PDFA - its more than you thinkWEBINAR PRESENTATION: PDFA - its more than you think
WEBINAR PRESENTATION: PDFA - its more than you think
 
PRESENTATION: Capture. Compliance. Centralization. How Advanced Rendering Del...
PRESENTATION: Capture. Compliance. Centralization. How Advanced Rendering Del...PRESENTATION: Capture. Compliance. Centralization. How Advanced Rendering Del...
PRESENTATION: Capture. Compliance. Centralization. How Advanced Rendering Del...
 
WEBINAR PRESENTATION: Controlling critical documents with Dassault Systemes E...
WEBINAR PRESENTATION: Controlling critical documents with Dassault Systemes E...WEBINAR PRESENTATION: Controlling critical documents with Dassault Systemes E...
WEBINAR PRESENTATION: Controlling critical documents with Dassault Systemes E...
 
PRESENTATION: Content Collaboration in SharePoint
PRESENTATION: Content Collaboration in SharePointPRESENTATION: Content Collaboration in SharePoint
PRESENTATION: Content Collaboration in SharePoint
 
PRESENTATION: SharePoint Italy Summit 2013
PRESENTATION: SharePoint Italy Summit 2013PRESENTATION: SharePoint Italy Summit 2013
PRESENTATION: SharePoint Italy Summit 2013
 
PRESENTATION: Regulatory Submission - The Art of Avoiding the Resubmit
PRESENTATION: Regulatory Submission -  The Art of Avoiding the ResubmitPRESENTATION: Regulatory Submission -  The Art of Avoiding the Resubmit
PRESENTATION: Regulatory Submission - The Art of Avoiding the Resubmit
 

Recently uploaded

New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 

Recently uploaded (20)

New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 

PRESENTATION: Challenges of Digitization (November 2012)

  • 1. PDF/A Addressing the challenges of digitizing and preserving paper-based documents in GoC Jeff Brand October 26, 2012 © ADLIB 2012. THIS SLIDE PRESENTATION CONTAINS PROPRIETARY AND/OR CONFIDENTIAL INFORMATION.
  • 2. Adlib – Who We Are • Software company – Burlington, Ontario, Canada • Leading expert in document-to-PDF transformation • Improve document intensive business processes • 10+ years experience • 5,000+ Customers Worldwide • 50+ Countries • 100+ Partners
  • 3. Bringing Value To Many Industries Financial Life Health Legal Mfg Gov’t Other Services Sciences Care
  • 5. Agenda Physical and Digital Media • Physical and Digital Archiving – Advantages and Disadvantages • Overview of PDF and PDF/A Approaches to Digitization • Maximize the retention of knowledge • Consider Security Implications Management of Digitized artifacts • Revisit Retention and Disposition polices • Maximize the value to Canada Opportunities for Savings • Time & Cost • Increased Flexibility to mitigate future costs Summary
  • 6. Physical Archiving • Preserving and Storing the original or exemplary specimen in original, physical form
  • 7. Physical Archiving - Advantages Assuming time has not deteriorated the media… • The physical archive is the original so there is no variance from the original • Relatively little / no question about authenticity, accuracy • Technology – All you need are eyes.
  • 8. Physical Archiving - Advantages • Sentimental Value Certain original documents will be desired to be maintained for as long as possible...
  • 9. Physical Archiving - Disadvantages Time Time heals all wounds…and destroys all documents
  • 10. Physical Archiving - Disadvantages Cost Elaborate and costly physical storage and preservation
  • 11. Physical Archiving - Disadvantages Availability There’s only one. Options to make it available to citizens are limited and require manual effort
  • 12. Physical Archiving - Disadvantages Effort to retrieve Locating relevant documents relies on appropriate and accurate taxonomy during on- boarding
  • 13. Physical Archiving - Disadvantages Environmental Impact Preserving documents require chemicals, such as 3M’s Novec 7100 Engineering Fluid
  • 14. Digital Archiving • Preserving and Storing the original or exemplary specimen in a digitized form.
  • 15. Digital Archiving - Advantages Space = 56,140,800 128GB USB Flash Drive $80 CDN Pages An entire warehouse of text can fit in a USB Thumb Drive
  • 16. Digital Archiving - Advantages Availability Digital copies can be shared with an unlimited number of people with little or no effort.
  • 17. Digital Archiving - Advantages Effort Technologies such as Full-Text-Searching make finding relevant documents easier and less dependent on taxonomy
  • 18. Digital Archiving - Advantages Automation Automatically execute Retention and Disposition policies, Audit and more without manual intervention
  • 19. Digital Archiving - Advantages Flexibility Flexibility to support changing Policy and Requirements easily
  • 20. Digital Archiving - Advantages Cost Digital archives typically cost 90% less to operate and maintain
  • 21. Digital Archiving - Challenges • Digital Dark Age • Wide variety of formats that require special technology to view • Reduced Sentimental Value
  • 22. Digital Archiving - Considerations Digital Dark Age Ensuring files are accessible tomorrow…
  • 23. Digital Archiving - TIFF Tagged Image File Format • Used by FAX Machines (CCITT Group 4) • Very common image format • Supports multiple pages • Significant increase in file size for digitally-born content • No Search capability • Not designed for Long-Term Archiving
  • 24. Digital Archiving – PDF/A Portable Document Format (For Archive) • Adopted by ISO for long-term preservation of documents • Based on PDF – the most popular document format on the Web today • Highest-Quality representation of document • Smallest possible file sizes • Guaranteed to look the same forever • Universally Viewable – Hardware / Software independent
  • 25. What is PDF? Portable Document Format Originally created by Adobe in the 1990’s, became an open, ISO Standard (32000:1) in 2008 The most popular file format on the web today (FileInfo.com)
  • 26. What else is PDF Used For? • Contracts • Agreements • Sales Proposals • Product Literature • Publications • Reports • Standard Operating Procedures • Long-Term Archiving • Sharing documents and content with others • So Much More
  • 27. Isn’t PDF Free? Many applications can save to PDF
  • 28. Isn’t PDF Free? A quick Google search shows dozens of free applications for creating PDF…
  • 29. Isn’t PDF Free? - Can your doc change? Original Excel Chart Free PDF Rendition Fidelity or quality of conversion is often the cost
  • 30. Isn’t PDF Free? - Can your doc change? Original Word Doc Free PDF Rendition Content Re-Flow - Font Substitution - Complex Formats
  • 31. Isn’t PDF Free? - Can you comply? PDF Features that are often required for compliance or even optimal document conversion are often missing in free or low-cost solutions
  • 32. Isn’t PDF Free? - Can you merge? It can be difficult and time-consuming to merge the content from multiple applications into a single document. Few if any free PDF solutions enable this.
  • 33. Isn’t PDF Free? - Can you keep up? Workers spend far too much time dealing with low quality and manual PDF rendering technologies
  • 34. Isn’t PDF Free? • Free or Low-Cost software can cost you: • Hours of lost productivity • Lost opportunities • Miss-communication • Business delays • Fines
  • 35. PDF/A
  • 36. PDF/A - What is it? • A more strict subset of the PDF specification • Specifically designed for the purpose of long-term preservation of documents • Audio, Video, JavaScript and Executables, Encryption, External references are all restricted • Designed to be 100% Self Contained – All fonts must be embedded
  • 37. PDF/A - What is it? Based on PDF, Initially Released in 2005 (3 years ahead of PDF as an ISO Standard!) • PDF/A-1 (a/b) • Based on PDF 1.4 Specification • PDF/A-2 (a/b/u) • Based on ISO 32000-1 • JPEG2000, Transparency, Layers, OpenType Fonts, PDF/A File Embedded • PDF/A-3 (a/b/u) • Arbitrary files can be embedded
  • 38. PDF/A - What is it? What makes a PDF a PDF/A? • A Special metadata tag that indicates that the document presents itself as PDF/A • Compliance to the PDF/A Standard
  • 39. PDF/A – Disadvantages File Size – Embedding fonts in each document means file sizes are larger when compared to PDF (This is still significantly better than alternatives such as TIFF)
  • 40. PDF/A – Summary • PDF provides many benefits over alternatives such as TIFF • Small size • High-quality • Searchable • Highly viewable • Portable • PDF/A Builds on this and ensures the long-term viability of content stored in this format
  • 41. The AIIM Document Life Cycle Optimize with Searchable Content OCR- Searchable Content Metadata Retention - Format PDF - Enhancements & Watermarks Support for: - Document Assembly - PDF/A - Personalization - TIFF - Security & Approvals
  • 42. PDF/A at Library and Archives Canada Services Services Services Upload Module Staging DAM Web Store Repositories Local/cloud FTP Module DAM Web Storage eMail Module Structured Templates Data Scan Module Structured Data Data Metadata Warehouse & Social Models
  • 43. PDF/A at Library and Archives Canada Adlib Services Services Services Upload Module Staging DAM Web Store Repositories Local/cloud FTP Module DAM Web Storage eMail Module Structured Templates Data Scan Module Structured Data Data Metadata Warehouse & Social Models
  • 44. PDF/A at Library and Archives Canada Adlib Services Services Services Upload Module Staging DAM Web Store Repositories Local/cloud FTP Module DAM Web Storage eMail Module Structured Templates Data Scan Module Structured Data Data Metadata Warehouse & Social Models
  • 45. PDF/A at Library and Archives Canada Services Services Services Upload Module Staging DAM Web Store Repositories Local/cloud FTP Module DAM Web Storage eMail Module Structured Templates Data Scan Module Structured Data Data Adlib Metadata Warehouse & Social Models
  • 46. PDF/A at Library and Archives Canada Adlib Services Services Services Upload Module Staging DAM Web Store Repositories Local/cloud FTP Module DAM Web Storage eMail Module Structured Templates Data Scan Module Structured Data Data Metadata Warehouse & Social Models
  • 47. PDF/A at Library and Archives Canada Adlib Services Services Services Upload Module Staging DAM Web Store Repositories Local/cloud FTP Module DAM Web Storage eMail Module Structured Templates Data Scan Module Structured Data Data Metadata Warehouse & Social Models
  • 48. PDF/A at Library and Archives Canada Services Services Services Upload Module Staging DAM Web Store Adlib Repositories Local/cloud FTP Module DAM Web Storage eMail Module Structured Templates Data Scan Module Structured Data Data Metadata Warehouse & Social Models
  • 50. Preparation – Typical Document Process
  • 51. Digitization – Processing Large Volumes • Digitizing entire libraries of content can be more than daunting but help is available: Seek out industry experts to ensure a successful transition of knowledge
  • 52. Digitization – Processing Large Volumes • Digitizing entire libraries of content can be more than daunting but help is available: • In-Sourcing and Out-Sourcing : Build a plan of action that considers Security requirements • Is the content potentially sensitive? • Is there risk of loss? • Is there a risk of contamination / degradation of the original content?
  • 53. Digitization – Processing Large Volumes • Digitizing entire libraries of content can be more than daunting but help is available: • Hardware & Software Investments • What do you need Today & Tomorrow • Consider Lease for Short Term requirements • Provision for the future
  • 54. Digitization – Processing Large Volumes • Measure Twice, Cut Once • Plan ahead and consider the future use of the content when defining requirements • Understand the entire lifecycle of the content when architecting the process • How long will we keep it? • How will we share it? • How will people find it? • How will we dispose of it? • Will we maintain the originals after digitization? • What are the specific requirements for each step in the process?
  • 55. Digitization – Processing Large Volumes • Start with Quality • Pay special attention to the digitization process • Higher quality at the IMAGING stage pays off • Files can be reduced as necessary later, you can never ADD quality • Consider pre-processing when scanning documents of questionable quality • Ensure highly accurate OCR is applied prior to on- boarding into the system, or as a part of the onboarding process
  • 56. Digitization – Maintaining Taxonomy Classification and indexes need to be maintained, but how? • Purely Physical • Index Cards, Catalogs, Within Content • Modernized Physical • Library systems & databases
  • 57. Digitization – Maintaining Taxonomy This is often achieved by making the classification data available on a cover sheet in front of each document. This can be extracted from the Library System / DB, or pulled directly from an Index Card and even processed from a Catalog (Even if it’s physical!)
  • 58. Digitization – Approaches There are 2 methods to digitizing a collection: 1. Batch • Everything is performed in one or multiple batches and the sequence of batching is pre-determined 2. Scan-On-Demand • More opportunistic, existing Archives are digitized as requested
  • 59. Digitization - Security Preventing Loss • Chain of custody • Limited transportation choices • Escorted Content Selective Outsourcing • Assess the risk • Employ multiple tiers for Outsourcing • In-Source for the most critical artifacts
  • 60. Management of Digitized Artifacts • Revisit Retention and Disposition Policies • Can we keep digital records longer? Indefinitely? • Maximizing the value to Canada • Making content available to Canadians • Using Search to maximize value and enhance classification paradigms in use today
  • 61. Sharing Canada’s Digitized Artifacts Maximizing the value to Canada: • Education • Legal • Innovation
  • 62. Cost Savings • Physical Storage • Management and Execution of Retention and Disposition Policies • Flexibility to support changing Policy and Requirements easily
  • 63. Cost Savings $36,659.20 1990 1995 $819.20 Price per GB 2000 $1,433.60 2005 $10.00 $40,000.00 2010 $0.10 $35,000.00 2012 $0.05 2020 $0.02 $30,000.00 $25,000.00 Price per GB $20,000.00 $15,000.00 $10,000.00 $5,000.00 $- 1990 1995 2000 2005 2010 2012 2020
  • 64. Cost Savings $36,659.20 1990 1995 $819.20 Price per GB 2000 $1,433.60 $0.12 2005 $10.00 2010 $0.10 2012 $0.05 $0.10 2020 $0.02 $0.08 Price per GB $0.06 $0.04 $0.02 $- 2010 2012 2020
  • 65. Summary • Hire an Expert – Or Become One! • Do it once and do it right • Digitize Everything • On Demand / Disposition • Physically preserve only sentimental and historic originals
  • 66. The AIIM Document Life Cycle Optimize with Searchable Content OCR- Searchable Content Metadata Retention - Format PDF - Enhancements & Watermarks Support for: - Document Assembly - PDF/A - Personalization - TIFF - Security & Approvals
  • 67. Adlib PDF Enterprise Input: Output: • MS Office Process: • PDF • MS InfoPath • Conversion • PDF/A • MS Project • Recognition (OCR) • XPS • Various CAD • Publication • XML • Various PDF • Merge • TIFF/JPG/BMP/PNG • Images • TOC • TXT • OpenOffice • Bookmarks • HTML • HTML • Headers/Footers • Over 400 File Types • Digital Signatures
  • 68. Adlib PDF Architecture Content Stores Connector SharePoint Folder Generic Management Console UI Connector Framework (Java 1.6/.NET) WCF / SOAP Services Interface Manager s System System System Manager Database Manager Engine s Transformation Transformation Engine Engine
  • 69. Adlib Software… …The PDF Experts! Your partner for Quality, Automated Document Transformation
  • 70. Contact Information Matt Woodworth Manager, Public Sector. North America. 613 218 6778 mwoodworth@adlibsoftware.com