SlideShare a Scribd company logo
1 of 6
Engineering                                                                                                                      What’s the Problem?
              100-year Digital Preservation†                                                                                      Digital infrastructure in its infancy; paper infrastructure has
                                                                                                                                     been slowly refined for over 3000 years
                                Most content starts in digital form.                                                              Society has demonstrated little commitment to digital
                                                                                                                                     preservation
                            Some content is fundamentally digital.
                                                                                                                                  After a dozen years of archives ’ efforts, no existing digital
                  People want to save some of it for the long term.                                                                  preservation program can match Internet scales

                                                                                                                                      “[No] concise and well-developed strategy that represents the views
                          What's needed to preserve bit-streams?
                                                                                                                                      of a broad community has yet emerged. Since 1989 at least twelv   e
                What will be needed to interpret saved bit-streams?                                                                   have been published. … we need to re-think how we are
                                                                                                                                      approaching research … [and] need to engage … researchers in
               What can be done to make the content trustworthy?                                                                      this process, and especially those with a strong computing science
                                                                                                                                      and engineering background.”       [Ross, ECDL 2007 keynote address]
          How can convenient software support be implemented?
                                                                                                                                  We sketch a solution to every technical challenge
          H.M. Gladney , Economics and Engineering for Preserving Digital Content submitted for publication, December
                                                                                ,
          2007. Draft available at http://home.pacbell.net/hgladney/LDPeng.pdf
                                                                             .
          †
              Adapted from a March 2006 U.S. NIST workshop presentation

        © 2008, H.M. Gladney, HMG Consulting                         http://home.pacbell.net/hgladney/                        1   © 2008, H.M. Gladney, HMG Consulting   http://home.pacbell.net/hgladney/   2




                                                                                                                                                     Digital Archiving and
                       Scope Limitations and Status
                                                                                                                                                     Digital Preservation
       Digital Preservation h mitigation of the deleterious effects
         of technology obsolescence, media degradation, and
                                                                                                                                  “Archiving” consists of
         fading human memory —effects that might impair the
         value of digital object copies long after originals were
                                                                                                                                  •   Exploiting digital library technology
         created.
                                                                                                                                  •   Tailoring client services
       Technical components only (e.g., not addressing professional
          education needed).
                                                                                                                                  •   Curating collections and catalogs
       Increment to available document and digital repository
         technology                                                                                                               •   Providing search services
                                                                                                                                  •   Sharing with other institutions
       Preservation is a different topic than                                                                                     •   Long-term preservation
        repository management
                                                                                                                                  These can be treated independently.
        © 2008, H.M. Gladney, HMG Consulting                         http://home.pacbell.net/hgladney/                        3   © 2008, H.M. Gladney, HMG Consulting   http://home.pacbell.net/hgladney/   4




                                                                                                                                               Information Producers’
                     End Users’ Requirements
                                                                                                                                                    Requirements
        What might a descendant want of content stored
        today? For any document preserved somewhere                                                                               Citizens will want to preserve information without
        in the world, help                                                                                                        asking anybody’ s permission, with convenient tools
                                                                                                                                          anybody’
                                                                                                                                  and infrastructure:
        • To find and retrieve a copy of its representing bit-string;
        • To use the content as its producers intended;                                                                           •   To preserve any content, doing so that descendants
        • To decide whether the content is trustworthy;                                                                               can use this content as specified in the prior slide;
                                                                                                                                  •   To submit such content to repositories that reliably
        • To exploit embedded references to reliably identify and
                                                                                                                                      promise to save it.
          retrieve contextual objects recursively; and
        • To achieve all this without technical difficulties.

        L. Graham and P.T. Metaxas , quot;Of Course It's True; I Saw It on the Internetquot; Critical Thinki g in the Internet Era,
                                                                                                   n
        Comm. ACM 46(5), 70 -75, May 2003.


        © 2008, H.M. Gladney, HMG Consulting                         http://home.pacbell.net/hgladney/                        5   © 2008, H.M. Gladney, HMG Consulting   http://home.pacbell.net/hgladney/   6




© 2006, H.M. Gladney, HMG Consulting                                                                                                                                                                             1
Scale of the Digital
                     Repository Institutions’
                                                                                                                       Preservation Challenge
                        Requirements
                                                                                                       # of archivists                                  ~ 5000 and stable
       What technology will repository institutions want? In                                           # of software engineers                          ~ 500,000 and stable
       addition to perfect world digital library technology, they
                                                                                                                                                        >106 and growing
                                                                                                       # of information producers
       will want support for:
                                                                                                                                                        >101 0 and growing
                                                                                                       # of digital objects
        • Continuing their day -to-day content management without
                                                                                                       What fraction is worth saving?                   Nobody knows!
          disruption;
        • Sharing content and metadata semi-automatically;
                                                                                                           Implication: existing cultural repositories cannot keep up
        • Scaling to immense numbers of large documents; and
                                                                                                           without radical change
        • Sharing preservation effort to avoid costs & burdens.
                                                                                                           Plausible increases in repository funding don’t change this
                                                                                                           Even if the above estimates are incorrect by a factor of 10
        © 2008, H.M. Gladney, HMG Consulting         http://home.pacbell.net/hgladney/             7   © 2008, H.M. Gladney, HMG Consulting       http://home.pacbell.net/hgladney/      8




                   Implications of the Scaling                                                                                    TDO† Approach
                            Challenge                                                                  Divide and conquer
                                                                                                       Assume the worst and mitigate its potential effects
                                                                                                       Ø
         Preservation of a significant fraction of what’s
                                                   what’                                                     Repositories are lost. Institutions disappear.
                                                                                                       Ø
         created can be achieved only by:                                                                    Clever rogues alter information feloniously.
                                                                                                       Ø     Disgruntled or careless employees defeat procedures.
           • Automating everything possible;
                                                                                                       Ø     The last guy with critical information is dead.
           • Shifting preservation work from archivists to
                                                                                                       Starting with a “worst case” solution, find least
             information creators;
                                                                                                         expensive means for simple, common cases.
           • Embedding preservation preparation into
                                                                                                       Focus on digital objects.æ
             existing document processing; and
                                                                                                                              †
                                                                                                                                  TDO h Trustworthy Digital Object
           • Effective partnership of archivists with SW                                               æ
                                                                                                           End users don’t care how archives work, just that they deliver what was stored!
             engineers and information producers.
        © 2008, H.M. Gladney, HMG Consulting         http://home.pacbell.net/hgladney/             9   © 2008, H.M. Gladney, HMG Consulting       http://home.pacbell.net/hgladney/     10




             Digital Archiving Models, Representation                                                      Preservation: a special case of Communication
                     Languages and Standards
                                                                                                       Information consumer cannot obtain producer’s explanations.
        Canonical representation for archiving?       TDO (Trustworthy Digital Object) structure

        How to compress data?                         Not a digital preservation topic.

        How to manage interoperability?               Current EDP standards.

        How to prepare content for archival           (1) Add metadata that archivists want.
           submission.                                (2) Encode bit -strings to be durable.
                                                      (3) Sign and seal content packages.

        Trustworthiness of archived information?      Durable Evidence (this talk)?

        Role of standards in information packages?    Starting point for Durable Encoding.
        OAIS packages and submission interfaces?      TDO format (this talk)?

        Taxonomies, Thesauri and Ontologies           Not a preservation topic.

                                                                                                       Talk about the users, not about repositories!
        Achieving Semantics Interoperability          “the Holy Grail of knowledge management ”

                                                                                                       Distinguishing current needs is politically pragmatic.
        © 2008, H.M. Gladney, HMG Consulting         http://home.pacbell.net/hgladney/         11      © 2008, H.M. Gladney, HMG Consulting       http://home.pacbell.net/hgladney/     12




© 2006, H.M. Gladney, HMG Consulting                                                                                                                                                         2
OAIS Repository
               An Information Communication Model
                                                                                                                                            (the conventional focus)?




          Can describe transformations objectively, except for 0è1 and 9è10.
          What the information consumer receives should not depend on the
           transmission path from the information producer. (Implies an OAIS
                             DIP to OAIS SIP relationship.) ?
        © 2008, H.M. Gladney, HMG Consulting                   http://home.pacbell.net/hgladney/                    13   © 2008, H.M. Gladney, HMG Consulting                      http://home.pacbell.net/hgladney/                      14




                                                                                                                         Digital Object (or a Collection of Anything)?
       Part of the Context: Nested Digital Repositories
                                                                                                                                                                                                                     Tightly bound
                                                                                                                                                                                                                     package.
                                                                                                                                                                                                                     Including
                                                                                                                                                                                                                     metadata.
                                                                                                                                                                                                                     Linked schema
                                                                                                                                                                                                                     & standards.
                                                                                                                                                                                                                     Nested
                                                                                                                                                                                                                     versions.
                                                                                                                                                                                                                     Using ternary
                                                                                                                                                                                                                     relationships.
                                                                                                                                                                                                                     Saving every
                                                                                                                                                                                                                     important
                                                                                                                                                                                                                     version.
                                                                                                                                                                                                                     Recursive and
                                                                                                                                                                                                                     grounded.
        © 2008, H.M. Gladney, HMG Consulting                   http://home.pacbell.net/hgladney/                    15   © 2008, H.M. Gladney, HMG Consulting                      http://home.pacbell.net/hgladney/                      16




                                                                                                                         Durable Evidence for Info. Packages
       Bit-string Durability: Replication (as for books)?
                                                                                                                         UUIDs for digital objects, for material objects, and for object collections
        LOCKSS* successful, but embeds too much policy
        Avatar implementing rules for replication                                                                        Relationships whose elements are identifiers and scalars
                                                                                                                         Certificates with recursion grounded in a few institutions
        è     Generalize
              Design rules for                                                                                           Share keys by personal connections
              replication
                                                                                                                         All packaged with XML
              Design rules for
              access control                                                                                             Signed and sealed just as
                                                                                                                             in the age-old practice
                                                                                                                             (Japanese ukiyo-e)?




            *Reich & Rosenthal, LOCKSS: A Permanent Web Publishing and Access System , D-Lib Magazine 7(6), 2001.
                                                                                                                         Gladney , Trustworthy … Objects: Evidence After Every Witness is Dead, ACM Trans. Info. Sys. 22(3), 406 -436, 2004.

        © 2008, H.M. Gladney, HMG Consulting                   http://home.pacbell.net/hgladney/                    17   © 2008, H.M. Gladney, HMG Consulting                      http://home.pacbell.net/hgladney/                      18




© 2006, H.M. Gladney, HMG Consulting                                                                                                                                                                                                           3
Durable Encoding for Content Blobs
       Durable Encoding for Content Bit-strings
       Starting point is a few widely understood standards
       Ø Unicode/UTF, small XML core, BNF, ASN.1, identifier syntax
       Universal Virtual Machine (enhanced Turing Machine)?
       Ø   Bit-addressed, any number of registers, condition codes
       Ø   Segmented memory     —some segments private, others shared
       Ø   Input/output streams,
       Ø   Still needs multiprogramming and real -time instructions
       Need one UVC emulator for each HW architecture                                                                      Objective: render for human intelligibility and/or
       Need one UVC application for each file type (~1000 types)?                                                          Make available to be program input
                                                                                                                           One UVC program for every file type (~1000 world- wide)?
       Don’t save anything irrelevant, e.g., O/S portions
                                                                                                                           One Restore Application per future computer architecture
       Save the original file together with UVC application
                                                                                                                           Problem with proprietary formats, e.g., MS Word files
        Lorie & vanDiessen, A Universal Virtual Computer for long-term preservation,IBM Research Report, RJ 10338, 2005.
        Gladney & Lorie Trustworthy … Digital Objects: Durable Encoding… , ACM Trans. Info. Sys. 23(3), 299-324, 2005.

        © 2008, H.M. Gladney, HMG Consulting                     http://home.pacbell.net/hgladney/                    19   © 2008, H.M. Gladney, HMG Consulting   http://home.pacbell.net/hgladney/   20




                   Properties of TDO Methodology                                                                                                Summary and Status
                                                                                                                           Digital Preservation h mitigation of the deleterious effects
       Format migration (which is error-prone) is not required
                                                                                                                             of technology obsolescence, media degradation, and
       Consumers can evaluate authenticity without human                                                                     fading human memory .
         administrative help
       Metadata-to-object dissociation rare and detectable                                                                 Every conceptual challenge answered; no criticism received.
                                                                                                                           Engineering design is available.
       Insensitive to Internet security risks
                                                                                                                           Ø Plug-in compatible with any digital library software package
       Links have authentication codes for reliability
                                                                                                                           Ø Compatible with existing & emerging standards
       Implementable without disrupting existing repositories
       Conformance with regulatory requirements will be easy                                                               Next steps: building a prototype and running a pilot
       Implementation easy (based on XML and text editors)?
       Scalable


        © 2008, H.M. Gladney, HMG Consulting                     http://home.pacbell.net/hgladney/                    21   © 2008, H.M. Gladney, HMG Consulting   http://home.pacbell.net/hgladney/   22




                                               References                                                                    Recommended Epistemological Texts
                                                                                                                           Ludwig Wittgenstein, Tractatus Logico-Philosophicus ,
       HMG, Preserving Digital Information, Springer Verlag, 2007.
                                                                                                                              1921; also Philosophical Investigations, 1953.
       HMG, Digital Preservation in a National Context: Questions
                                                                                                                           Rudolf Carnap, Logical Structure of the World, 1926.
        and Views of an NDIIPP Outsider, D-Lib Mag. 13(1), 2007.
                                                                                                                           Gilbert Ryle, The Concept of Mind, 1949.
           http://www.dlib.org/dlib/january07/gladney/01gladney.html
       S. Ross, Digital Preservation, Archival Science and                                                                 Ernst Cassirer, The Problem of Knowledge: Philosophy,
         Methodological Foundations for Digital Libraries , 11th                                                              Science, and History Since Hegel, 1950.
         European Conference on Digital Libraries, 2007.                                                                   Michael Polanyi, Personal Knowledge: Towards a Post-
           http://www.ecdl2007.org/Keynote_ECDL2007_SROSS.pdf
                                                                                                                              Critical Philosophy, 1958.
       HMG, Economics and Engineering for Preserving Digital
                                                                                                                           Willard Van Orman Quine, Word and Object, 1960.
        Content, to appear 2008.
                                                                                                                           J. Alberto Coffa, The Semantic Tradition from Kant to
           http://home.pacbell.net/hgladney/LDPeng.pdf
                                                                                                                              Carnap to the Vienna Station, 1991.


        © 2008, H.M. Gladney, HMG Consulting                     http://home.pacbell.net/hgladney/                    23   © 2008, H.M. Gladney, HMG Consulting   http://home.pacbell.net/hgladney/   24




© 2006, H.M. Gladney, HMG Consulting                                                                                                                                                                       4
Backup Slides                                          A Practical Definition for “Authentic”
                                                                                        • Given a derivation statement R,           “V is a copy of Y ( V=C(Y) )”,
                                                                                        • a provenance statement S, “X said or created Y as part of event
                                                                                          Z”, and
                                                                                        • a copy function,       “C(y) = Tn (… (T2( T1(y) ))), ”
                                                                                        • we say that V is a derivative of Y if V is related to Y according to R.
                                                                                        • We say that “by X as part of event Z” is a true provenance of V if R
                                                                                          and S are true.
                                                                                        • We say that V is sufficiently faithful to Y if C conforms to social
                                                                                          conventions for the genre and for the circumstances at hand.
                                                                                        • We say that V is an authentic copy of Y if it is a sufficiently faithful
                                                                                          derivative with true provenance.




        © 2008, H.M. Gladney, HMG Consulting   http://home.pacbell.net/hgladney/   25   © 2008, H.M. Gladney, HMG Consulting   http://home.pacbell.net/hgladney/     26




               Trustworthy Digital Object (TDO)?                                             Essential & Accidental Information
                                                                                        What ’s essential depends on what somebody specific
                                                                                          wants to accomplish (information producer or consumer).
                                                                                        Accidental content cannot be avoided.




                                                                                        Imperfect knowledge capture is a fact of life; many
                                                                                          examples in engineering specifications.
        © 2008, H.M. Gladney, HMG Consulting   http://home.pacbell.net/hgladney/   27   © 2008, H.M. Gladney, HMG Consulting   http://home.pacbell.net/hgladney/     28




                                                                                        Software Layering and Rules Administration
                  Schema for a Digital Collection




        © 2008, H.M. Gladney, HMG Consulting   http://home.pacbell.net/hgladney/   29   © 2008, H.M. Gladney, HMG Consulting   http://home.pacbell.net/hgladney/     30




© 2006, H.M. Gladney, HMG Consulting                                                                                                                                      5
Some Conclusions
        Preservation of manufacturing data does not present
          unique technical challenges è look at what members of
          other disciplines are writing and saying.
        But manufacturing does have specialized file formats and standar ds.
        Divide and conquer è treat preservation as content mgmt.
          increment.
        Focus on data structure first; later on data management.
        Be sensitive to the teachings of scientific philosophy.
        Age-old procedures for paper have digital counterparts.
        Big challenge: persuading information providers to write
          metadata
        We have an architecture, and are ready to start
         software design.


        © 2008, H.M. Gladney, HMG Consulting   http://home.pacbell.net/hgladney/   31




© 2006, H.M. Gladney, HMG Consulting                                                    6

More Related Content

What's hot

Making sense out of disagreement, University of Limerick Interaction Design C...
Making sense out of disagreement, University of Limerick Interaction Design C...Making sense out of disagreement, University of Limerick Interaction Design C...
Making sense out of disagreement, University of Limerick Interaction Design C...jodischneider
 
EMC Academic Alliance - Program Guide
EMC Academic Alliance - Program GuideEMC Academic Alliance - Program Guide
EMC Academic Alliance - Program Guideacademicalliance
 
AIIM Ottawa Presentation Digital Preservation A Wicked Problem
 AIIM Ottawa Presentation Digital Preservation A Wicked Problem  AIIM Ottawa Presentation Digital Preservation A Wicked Problem
AIIM Ottawa Presentation Digital Preservation A Wicked Problem Debra Power
 
WICSA 2012 tutorial
WICSA 2012 tutorialWICSA 2012 tutorial
WICSA 2012 tutorialLen Bass
 
Linked Open Data
Linked Open DataLinked Open Data
Linked Open DataDerilinx
 
4Projects Capital Asset Owner Brochure
4Projects Capital Asset Owner Brochure4Projects Capital Asset Owner Brochure
4Projects Capital Asset Owner Brochure4Projects
 
英文書報0517 詹竣策M0017106
英文書報0517 詹竣策M0017106英文書報0517 詹竣策M0017106
英文書報0517 詹竣策M0017106cambrain99
 
Xml finland-2011-sami-poikonen-ea nokia
Xml finland-2011-sami-poikonen-ea nokiaXml finland-2011-sami-poikonen-ea nokia
Xml finland-2011-sami-poikonen-ea nokiaSami Poikonen
 
Manfred Linking the Real World
Manfred Linking the Real WorldManfred Linking the Real World
Manfred Linking the Real Worldsssw2012
 
Data centers and the cloud
Data centers and the cloudData centers and the cloud
Data centers and the cloudPim Bilderbeek
 
D Space Proposal Tvm 1407
D Space Proposal Tvm 1407D Space Proposal Tvm 1407
D Space Proposal Tvm 1407Anil Mishra
 
Fi arch design-principles-new_seeds-v0.7.4
Fi arch design-principles-new_seeds-v0.7.4Fi arch design-principles-new_seeds-v0.7.4
Fi arch design-principles-new_seeds-v0.7.4Ioanna Papafili
 
What is an information professional?
What is an information professional?What is an information professional?
What is an information professional?John Mancini
 
Making Digital Library Content Interoperable @ IRCDL 2010
Making Digital Library Content Interoperable @ IRCDL 2010 Making Digital Library Content Interoperable @ IRCDL 2010
Making Digital Library Content Interoperable @ IRCDL 2010 Leonardo Candela
 
What happens when data start living their own life?
What happens when data start living their own life?What happens when data start living their own life?
What happens when data start living their own life?Fing
 
Webinar: Enterprise Social Networking to Foster Employee Engagement
Webinar: Enterprise Social Networking  to Foster Employee Engagement Webinar: Enterprise Social Networking  to Foster Employee Engagement
Webinar: Enterprise Social Networking to Foster Employee Engagement tibbr
 
Knowledge Management
Knowledge ManagementKnowledge Management
Knowledge ManagementAdrianMalone
 

What's hot (20)

Brunel University
Brunel University Brunel University
Brunel University
 
Making sense out of disagreement, University of Limerick Interaction Design C...
Making sense out of disagreement, University of Limerick Interaction Design C...Making sense out of disagreement, University of Limerick Interaction Design C...
Making sense out of disagreement, University of Limerick Interaction Design C...
 
EMC Academic Alliance - Program Guide
EMC Academic Alliance - Program GuideEMC Academic Alliance - Program Guide
EMC Academic Alliance - Program Guide
 
AIIM Ottawa Presentation Digital Preservation A Wicked Problem
 AIIM Ottawa Presentation Digital Preservation A Wicked Problem  AIIM Ottawa Presentation Digital Preservation A Wicked Problem
AIIM Ottawa Presentation Digital Preservation A Wicked Problem
 
Database Management
Database ManagementDatabase Management
Database Management
 
WICSA 2012 tutorial
WICSA 2012 tutorialWICSA 2012 tutorial
WICSA 2012 tutorial
 
Linked Open Data
Linked Open DataLinked Open Data
Linked Open Data
 
4Projects Capital Asset Owner Brochure
4Projects Capital Asset Owner Brochure4Projects Capital Asset Owner Brochure
4Projects Capital Asset Owner Brochure
 
英文書報0517 詹竣策M0017106
英文書報0517 詹竣策M0017106英文書報0517 詹竣策M0017106
英文書報0517 詹竣策M0017106
 
SunGard Case Study 3i
SunGard Case Study 3iSunGard Case Study 3i
SunGard Case Study 3i
 
Xml finland-2011-sami-poikonen-ea nokia
Xml finland-2011-sami-poikonen-ea nokiaXml finland-2011-sami-poikonen-ea nokia
Xml finland-2011-sami-poikonen-ea nokia
 
Manfred Linking the Real World
Manfred Linking the Real WorldManfred Linking the Real World
Manfred Linking the Real World
 
Data centers and the cloud
Data centers and the cloudData centers and the cloud
Data centers and the cloud
 
D Space Proposal Tvm 1407
D Space Proposal Tvm 1407D Space Proposal Tvm 1407
D Space Proposal Tvm 1407
 
Fi arch design-principles-new_seeds-v0.7.4
Fi arch design-principles-new_seeds-v0.7.4Fi arch design-principles-new_seeds-v0.7.4
Fi arch design-principles-new_seeds-v0.7.4
 
What is an information professional?
What is an information professional?What is an information professional?
What is an information professional?
 
Making Digital Library Content Interoperable @ IRCDL 2010
Making Digital Library Content Interoperable @ IRCDL 2010 Making Digital Library Content Interoperable @ IRCDL 2010
Making Digital Library Content Interoperable @ IRCDL 2010
 
What happens when data start living their own life?
What happens when data start living their own life?What happens when data start living their own life?
What happens when data start living their own life?
 
Webinar: Enterprise Social Networking to Foster Employee Engagement
Webinar: Enterprise Social Networking  to Foster Employee Engagement Webinar: Enterprise Social Networking  to Foster Employee Engagement
Webinar: Enterprise Social Networking to Foster Employee Engagement
 
Knowledge Management
Knowledge ManagementKnowledge Management
Knowledge Management
 

Viewers also liked

Link In Slide File
Link In Slide FileLink In Slide File
Link In Slide Fileskeelys
 
Guerrilla Readers - marketig pro knihovny
Guerrilla Readers - marketig pro knihovnyGuerrilla Readers - marketig pro knihovny
Guerrilla Readers - marketig pro knihovnyGuerrilla Readers
 
Essence Of Leadership
Essence Of LeadershipEssence Of Leadership
Essence Of LeadershipRavindran B
 
Records Conversion & Redaction With Food Final Print
Records Conversion & Redaction With Food Final PrintRecords Conversion & Redaction With Food Final Print
Records Conversion & Redaction With Food Final Printinfotokevin
 
Succeed To Lead Business Orientation
Succeed To Lead Business OrientationSucceed To Lead Business Orientation
Succeed To Lead Business OrientationByron Cherry
 

Viewers also liked (7)

Link In Slide File
Link In Slide FileLink In Slide File
Link In Slide File
 
GR (R)Evoluci spousti...
GR (R)Evoluci spousti...GR (R)Evoluci spousti...
GR (R)Evoluci spousti...
 
Guerrilla Readers - marketig pro knihovny
Guerrilla Readers - marketig pro knihovnyGuerrilla Readers - marketig pro knihovny
Guerrilla Readers - marketig pro knihovny
 
Essence Of Leadership
Essence Of LeadershipEssence Of Leadership
Essence Of Leadership
 
Records Conversion & Redaction With Food Final Print
Records Conversion & Redaction With Food Final PrintRecords Conversion & Redaction With Food Final Print
Records Conversion & Redaction With Food Final Print
 
Succeed To Lead Business Orientation
Succeed To Lead Business OrientationSucceed To Lead Business Orientation
Succeed To Lead Business Orientation
 
Report z Vyškov
Report z VyškovReport z Vyškov
Report z Vyškov
 

Similar to Ldp Executive Slides

Open Text And Timberland
Open Text And TimberlandOpen Text And Timberland
Open Text And TimberlandRich_C07
 
20120605 icse zurich
20120605 icse zurich20120605 icse zurich
20120605 icse zurichArian Zwegers
 
Neil Moore Delivery Management
Neil Moore   Delivery ManagementNeil Moore   Delivery Management
Neil Moore Delivery ManagementSocitm
 
National Digital Library and LLD
National Digital Library and LLDNational Digital Library and LLD
National Digital Library and LLDHansung University
 
Preparation, Proceed and Review of preservation of Digital Library
Preparation, Proceed and Review of preservation of Digital Library Preparation, Proceed and Review of preservation of Digital Library
Preparation, Proceed and Review of preservation of Digital Library Asheesh Kamal
 
Bringing Digital Curation to Architects to promote innovation in the Construc...
Bringing Digital Curation to Architects to promote innovation in the Construc...Bringing Digital Curation to Architects to promote innovation in the Construc...
Bringing Digital Curation to Architects to promote innovation in the Construc...Ruggero Lancia
 
Online Assignment - Digital Resources
Online Assignment - Digital ResourcesOnline Assignment - Digital Resources
Online Assignment - Digital Resourcessubhalekshmi2013
 
Preserving CAD: A briefing - Emerging trends - Architecture
Preserving CAD: A briefing - Emerging trends - Architecture Preserving CAD: A briefing - Emerging trends - Architecture
Preserving CAD: A briefing - Emerging trends - Architecture Ruggero Lancia
 
Content Digitization Services: Transforming Information for the Digital Age.pdf
Content Digitization Services: Transforming Information for the Digital Age.pdfContent Digitization Services: Transforming Information for the Digital Age.pdf
Content Digitization Services: Transforming Information for the Digital Age.pdfAcadecraft Pvt. Ltd.
 
Kreeo - collaborate for knowledge (Kreeo)
Kreeo - collaborate for knowledge (Kreeo)Kreeo - collaborate for knowledge (Kreeo)
Kreeo - collaborate for knowledge (Kreeo)ProductNation/iSPIRT
 
Defrag Keynote: Social Computing and the Enterprise-Bridging the Gap
Defrag Keynote: Social Computing and the Enterprise-Bridging the GapDefrag Keynote: Social Computing and the Enterprise-Bridging the Gap
Defrag Keynote: Social Computing and the Enterprise-Bridging the GapMark Koenig
 
Babak Hosseinzadeh IT Portfolio Management In Shared Services & CC
Babak Hosseinzadeh   IT Portfolio Management In Shared Services & CCBabak Hosseinzadeh   IT Portfolio Management In Shared Services & CC
Babak Hosseinzadeh IT Portfolio Management In Shared Services & CCBabak Hosseinzadeh
 
iDRM – Interoperability Mechanisms for Open Rights Management Platforms
iDRM – Interoperability Mechanisms for Open Rights Management PlatformsiDRM – Interoperability Mechanisms for Open Rights Management Platforms
iDRM – Interoperability Mechanisms for Open Rights Management PlatformsCarlos Serrao
 
Big Data Public Private Forum (BIG) @ European Data Forum 2013
Big Data Public Private Forum (BIG) @ European Data Forum 2013Big Data Public Private Forum (BIG) @ European Data Forum 2013
Big Data Public Private Forum (BIG) @ European Data Forum 2013Edward Curry
 
BBC DMI Project Info for Backstage (Part 1)
BBC DMI Project Info for Backstage (Part 1)BBC DMI Project Info for Backstage (Part 1)
BBC DMI Project Info for Backstage (Part 1)Ian Forrester
 
Hadoop World 2011: Security Considerations for Hadoop Deployments - Jeremy Gl...
Hadoop World 2011: Security Considerations for Hadoop Deployments - Jeremy Gl...Hadoop World 2011: Security Considerations for Hadoop Deployments - Jeremy Gl...
Hadoop World 2011: Security Considerations for Hadoop Deployments - Jeremy Gl...Cloudera, Inc.
 
(Apr 2009) Comparing Curricula for Digital Library and Digital Curation Educa...
(Apr 2009) Comparing Curricula for Digital Library and Digital Curation Educa...(Apr 2009) Comparing Curricula for Digital Library and Digital Curation Educa...
(Apr 2009) Comparing Curricula for Digital Library and Digital Curation Educa...Carolyn Hank
 
Higher education IAM-seminar Turku 10.12.2009
Higher education IAM-seminar Turku 10.12.2009Higher education IAM-seminar Turku 10.12.2009
Higher education IAM-seminar Turku 10.12.2009Kim Westerlund
 

Similar to Ldp Executive Slides (20)

Open Text And Timberland
Open Text And TimberlandOpen Text And Timberland
Open Text And Timberland
 
20120605 icse zurich
20120605 icse zurich20120605 icse zurich
20120605 icse zurich
 
Neil Moore Delivery Management
Neil Moore   Delivery ManagementNeil Moore   Delivery Management
Neil Moore Delivery Management
 
National Digital Library and LLD
National Digital Library and LLDNational Digital Library and LLD
National Digital Library and LLD
 
Preparation, Proceed and Review of preservation of Digital Library
Preparation, Proceed and Review of preservation of Digital Library Preparation, Proceed and Review of preservation of Digital Library
Preparation, Proceed and Review of preservation of Digital Library
 
Bringing Digital Curation to Architects to promote innovation in the Construc...
Bringing Digital Curation to Architects to promote innovation in the Construc...Bringing Digital Curation to Architects to promote innovation in the Construc...
Bringing Digital Curation to Architects to promote innovation in the Construc...
 
Online Assignment - Digital Resources
Online Assignment - Digital ResourcesOnline Assignment - Digital Resources
Online Assignment - Digital Resources
 
Preserving CAD: A briefing - Emerging trends - Architecture
Preserving CAD: A briefing - Emerging trends - Architecture Preserving CAD: A briefing - Emerging trends - Architecture
Preserving CAD: A briefing - Emerging trends - Architecture
 
Content Digitization Services: Transforming Information for the Digital Age.pdf
Content Digitization Services: Transforming Information for the Digital Age.pdfContent Digitization Services: Transforming Information for the Digital Age.pdf
Content Digitization Services: Transforming Information for the Digital Age.pdf
 
Kreeo - collaborate for knowledge (Kreeo)
Kreeo - collaborate for knowledge (Kreeo)Kreeo - collaborate for knowledge (Kreeo)
Kreeo - collaborate for knowledge (Kreeo)
 
Defrag Keynote: Social Computing and the Enterprise-Bridging the Gap
Defrag Keynote: Social Computing and the Enterprise-Bridging the GapDefrag Keynote: Social Computing and the Enterprise-Bridging the Gap
Defrag Keynote: Social Computing and the Enterprise-Bridging the Gap
 
Corrado -- Establishing the Landscape
Corrado -- Establishing the LandscapeCorrado -- Establishing the Landscape
Corrado -- Establishing the Landscape
 
Babak Hosseinzadeh IT Portfolio Management In Shared Services & CC
Babak Hosseinzadeh   IT Portfolio Management In Shared Services & CCBabak Hosseinzadeh   IT Portfolio Management In Shared Services & CC
Babak Hosseinzadeh IT Portfolio Management In Shared Services & CC
 
People not products
People not productsPeople not products
People not products
 
iDRM – Interoperability Mechanisms for Open Rights Management Platforms
iDRM – Interoperability Mechanisms for Open Rights Management PlatformsiDRM – Interoperability Mechanisms for Open Rights Management Platforms
iDRM – Interoperability Mechanisms for Open Rights Management Platforms
 
Big Data Public Private Forum (BIG) @ European Data Forum 2013
Big Data Public Private Forum (BIG) @ European Data Forum 2013Big Data Public Private Forum (BIG) @ European Data Forum 2013
Big Data Public Private Forum (BIG) @ European Data Forum 2013
 
BBC DMI Project Info for Backstage (Part 1)
BBC DMI Project Info for Backstage (Part 1)BBC DMI Project Info for Backstage (Part 1)
BBC DMI Project Info for Backstage (Part 1)
 
Hadoop World 2011: Security Considerations for Hadoop Deployments - Jeremy Gl...
Hadoop World 2011: Security Considerations for Hadoop Deployments - Jeremy Gl...Hadoop World 2011: Security Considerations for Hadoop Deployments - Jeremy Gl...
Hadoop World 2011: Security Considerations for Hadoop Deployments - Jeremy Gl...
 
(Apr 2009) Comparing Curricula for Digital Library and Digital Curation Educa...
(Apr 2009) Comparing Curricula for Digital Library and Digital Curation Educa...(Apr 2009) Comparing Curricula for Digital Library and Digital Curation Educa...
(Apr 2009) Comparing Curricula for Digital Library and Digital Curation Educa...
 
Higher education IAM-seminar Turku 10.12.2009
Higher education IAM-seminar Turku 10.12.2009Higher education IAM-seminar Turku 10.12.2009
Higher education IAM-seminar Turku 10.12.2009
 

Recently uploaded

My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentationphoebematthew05
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 

Recently uploaded (20)

My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentation
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 

Ldp Executive Slides

  • 1. Engineering What’s the Problem? 100-year Digital Preservation† Digital infrastructure in its infancy; paper infrastructure has been slowly refined for over 3000 years Most content starts in digital form. Society has demonstrated little commitment to digital preservation Some content is fundamentally digital. After a dozen years of archives ’ efforts, no existing digital People want to save some of it for the long term. preservation program can match Internet scales “[No] concise and well-developed strategy that represents the views What's needed to preserve bit-streams? of a broad community has yet emerged. Since 1989 at least twelv e What will be needed to interpret saved bit-streams? have been published. … we need to re-think how we are approaching research … [and] need to engage … researchers in What can be done to make the content trustworthy? this process, and especially those with a strong computing science and engineering background.” [Ross, ECDL 2007 keynote address] How can convenient software support be implemented? We sketch a solution to every technical challenge H.M. Gladney , Economics and Engineering for Preserving Digital Content submitted for publication, December , 2007. Draft available at http://home.pacbell.net/hgladney/LDPeng.pdf . † Adapted from a March 2006 U.S. NIST workshop presentation © 2008, H.M. Gladney, HMG Consulting http://home.pacbell.net/hgladney/ 1 © 2008, H.M. Gladney, HMG Consulting http://home.pacbell.net/hgladney/ 2 Digital Archiving and Scope Limitations and Status Digital Preservation Digital Preservation h mitigation of the deleterious effects of technology obsolescence, media degradation, and “Archiving” consists of fading human memory —effects that might impair the value of digital object copies long after originals were • Exploiting digital library technology created. • Tailoring client services Technical components only (e.g., not addressing professional education needed). • Curating collections and catalogs Increment to available document and digital repository technology • Providing search services • Sharing with other institutions Preservation is a different topic than • Long-term preservation repository management These can be treated independently. © 2008, H.M. Gladney, HMG Consulting http://home.pacbell.net/hgladney/ 3 © 2008, H.M. Gladney, HMG Consulting http://home.pacbell.net/hgladney/ 4 Information Producers’ End Users’ Requirements Requirements What might a descendant want of content stored today? For any document preserved somewhere Citizens will want to preserve information without in the world, help asking anybody’ s permission, with convenient tools anybody’ and infrastructure: • To find and retrieve a copy of its representing bit-string; • To use the content as its producers intended; • To preserve any content, doing so that descendants • To decide whether the content is trustworthy; can use this content as specified in the prior slide; • To submit such content to repositories that reliably • To exploit embedded references to reliably identify and promise to save it. retrieve contextual objects recursively; and • To achieve all this without technical difficulties. L. Graham and P.T. Metaxas , quot;Of Course It's True; I Saw It on the Internetquot; Critical Thinki g in the Internet Era, n Comm. ACM 46(5), 70 -75, May 2003. © 2008, H.M. Gladney, HMG Consulting http://home.pacbell.net/hgladney/ 5 © 2008, H.M. Gladney, HMG Consulting http://home.pacbell.net/hgladney/ 6 © 2006, H.M. Gladney, HMG Consulting 1
  • 2. Scale of the Digital Repository Institutions’ Preservation Challenge Requirements # of archivists ~ 5000 and stable What technology will repository institutions want? In # of software engineers ~ 500,000 and stable addition to perfect world digital library technology, they >106 and growing # of information producers will want support for: >101 0 and growing # of digital objects • Continuing their day -to-day content management without What fraction is worth saving? Nobody knows! disruption; • Sharing content and metadata semi-automatically; Implication: existing cultural repositories cannot keep up • Scaling to immense numbers of large documents; and without radical change • Sharing preservation effort to avoid costs & burdens. Plausible increases in repository funding don’t change this Even if the above estimates are incorrect by a factor of 10 © 2008, H.M. Gladney, HMG Consulting http://home.pacbell.net/hgladney/ 7 © 2008, H.M. Gladney, HMG Consulting http://home.pacbell.net/hgladney/ 8 Implications of the Scaling TDO† Approach Challenge Divide and conquer Assume the worst and mitigate its potential effects Ø Preservation of a significant fraction of what’s what’ Repositories are lost. Institutions disappear. Ø created can be achieved only by: Clever rogues alter information feloniously. Ø Disgruntled or careless employees defeat procedures. • Automating everything possible; Ø The last guy with critical information is dead. • Shifting preservation work from archivists to Starting with a “worst case” solution, find least information creators; expensive means for simple, common cases. • Embedding preservation preparation into Focus on digital objects.æ existing document processing; and † TDO h Trustworthy Digital Object • Effective partnership of archivists with SW æ End users don’t care how archives work, just that they deliver what was stored! engineers and information producers. © 2008, H.M. Gladney, HMG Consulting http://home.pacbell.net/hgladney/ 9 © 2008, H.M. Gladney, HMG Consulting http://home.pacbell.net/hgladney/ 10 Digital Archiving Models, Representation Preservation: a special case of Communication Languages and Standards Information consumer cannot obtain producer’s explanations. Canonical representation for archiving? TDO (Trustworthy Digital Object) structure How to compress data? Not a digital preservation topic. How to manage interoperability? Current EDP standards. How to prepare content for archival (1) Add metadata that archivists want. submission. (2) Encode bit -strings to be durable. (3) Sign and seal content packages. Trustworthiness of archived information? Durable Evidence (this talk)? Role of standards in information packages? Starting point for Durable Encoding. OAIS packages and submission interfaces? TDO format (this talk)? Taxonomies, Thesauri and Ontologies Not a preservation topic. Talk about the users, not about repositories! Achieving Semantics Interoperability “the Holy Grail of knowledge management ” Distinguishing current needs is politically pragmatic. © 2008, H.M. Gladney, HMG Consulting http://home.pacbell.net/hgladney/ 11 © 2008, H.M. Gladney, HMG Consulting http://home.pacbell.net/hgladney/ 12 © 2006, H.M. Gladney, HMG Consulting 2
  • 3. OAIS Repository An Information Communication Model (the conventional focus)? Can describe transformations objectively, except for 0è1 and 9è10. What the information consumer receives should not depend on the transmission path from the information producer. (Implies an OAIS DIP to OAIS SIP relationship.) ? © 2008, H.M. Gladney, HMG Consulting http://home.pacbell.net/hgladney/ 13 © 2008, H.M. Gladney, HMG Consulting http://home.pacbell.net/hgladney/ 14 Digital Object (or a Collection of Anything)? Part of the Context: Nested Digital Repositories Tightly bound package. Including metadata. Linked schema & standards. Nested versions. Using ternary relationships. Saving every important version. Recursive and grounded. © 2008, H.M. Gladney, HMG Consulting http://home.pacbell.net/hgladney/ 15 © 2008, H.M. Gladney, HMG Consulting http://home.pacbell.net/hgladney/ 16 Durable Evidence for Info. Packages Bit-string Durability: Replication (as for books)? UUIDs for digital objects, for material objects, and for object collections LOCKSS* successful, but embeds too much policy Avatar implementing rules for replication Relationships whose elements are identifiers and scalars Certificates with recursion grounded in a few institutions è Generalize Design rules for Share keys by personal connections replication All packaged with XML Design rules for access control Signed and sealed just as in the age-old practice (Japanese ukiyo-e)? *Reich & Rosenthal, LOCKSS: A Permanent Web Publishing and Access System , D-Lib Magazine 7(6), 2001. Gladney , Trustworthy … Objects: Evidence After Every Witness is Dead, ACM Trans. Info. Sys. 22(3), 406 -436, 2004. © 2008, H.M. Gladney, HMG Consulting http://home.pacbell.net/hgladney/ 17 © 2008, H.M. Gladney, HMG Consulting http://home.pacbell.net/hgladney/ 18 © 2006, H.M. Gladney, HMG Consulting 3
  • 4. Durable Encoding for Content Blobs Durable Encoding for Content Bit-strings Starting point is a few widely understood standards Ø Unicode/UTF, small XML core, BNF, ASN.1, identifier syntax Universal Virtual Machine (enhanced Turing Machine)? Ø Bit-addressed, any number of registers, condition codes Ø Segmented memory —some segments private, others shared Ø Input/output streams, Ø Still needs multiprogramming and real -time instructions Need one UVC emulator for each HW architecture Objective: render for human intelligibility and/or Need one UVC application for each file type (~1000 types)? Make available to be program input One UVC program for every file type (~1000 world- wide)? Don’t save anything irrelevant, e.g., O/S portions One Restore Application per future computer architecture Save the original file together with UVC application Problem with proprietary formats, e.g., MS Word files Lorie & vanDiessen, A Universal Virtual Computer for long-term preservation,IBM Research Report, RJ 10338, 2005. Gladney & Lorie Trustworthy … Digital Objects: Durable Encoding… , ACM Trans. Info. Sys. 23(3), 299-324, 2005. © 2008, H.M. Gladney, HMG Consulting http://home.pacbell.net/hgladney/ 19 © 2008, H.M. Gladney, HMG Consulting http://home.pacbell.net/hgladney/ 20 Properties of TDO Methodology Summary and Status Digital Preservation h mitigation of the deleterious effects Format migration (which is error-prone) is not required of technology obsolescence, media degradation, and Consumers can evaluate authenticity without human fading human memory . administrative help Metadata-to-object dissociation rare and detectable Every conceptual challenge answered; no criticism received. Engineering design is available. Insensitive to Internet security risks Ø Plug-in compatible with any digital library software package Links have authentication codes for reliability Ø Compatible with existing & emerging standards Implementable without disrupting existing repositories Conformance with regulatory requirements will be easy Next steps: building a prototype and running a pilot Implementation easy (based on XML and text editors)? Scalable © 2008, H.M. Gladney, HMG Consulting http://home.pacbell.net/hgladney/ 21 © 2008, H.M. Gladney, HMG Consulting http://home.pacbell.net/hgladney/ 22 References Recommended Epistemological Texts Ludwig Wittgenstein, Tractatus Logico-Philosophicus , HMG, Preserving Digital Information, Springer Verlag, 2007. 1921; also Philosophical Investigations, 1953. HMG, Digital Preservation in a National Context: Questions Rudolf Carnap, Logical Structure of the World, 1926. and Views of an NDIIPP Outsider, D-Lib Mag. 13(1), 2007. Gilbert Ryle, The Concept of Mind, 1949. http://www.dlib.org/dlib/january07/gladney/01gladney.html S. Ross, Digital Preservation, Archival Science and Ernst Cassirer, The Problem of Knowledge: Philosophy, Methodological Foundations for Digital Libraries , 11th Science, and History Since Hegel, 1950. European Conference on Digital Libraries, 2007. Michael Polanyi, Personal Knowledge: Towards a Post- http://www.ecdl2007.org/Keynote_ECDL2007_SROSS.pdf Critical Philosophy, 1958. HMG, Economics and Engineering for Preserving Digital Willard Van Orman Quine, Word and Object, 1960. Content, to appear 2008. J. Alberto Coffa, The Semantic Tradition from Kant to http://home.pacbell.net/hgladney/LDPeng.pdf Carnap to the Vienna Station, 1991. © 2008, H.M. Gladney, HMG Consulting http://home.pacbell.net/hgladney/ 23 © 2008, H.M. Gladney, HMG Consulting http://home.pacbell.net/hgladney/ 24 © 2006, H.M. Gladney, HMG Consulting 4
  • 5. Backup Slides A Practical Definition for “Authentic” • Given a derivation statement R, “V is a copy of Y ( V=C(Y) )”, • a provenance statement S, “X said or created Y as part of event Z”, and • a copy function, “C(y) = Tn (… (T2( T1(y) ))), ” • we say that V is a derivative of Y if V is related to Y according to R. • We say that “by X as part of event Z” is a true provenance of V if R and S are true. • We say that V is sufficiently faithful to Y if C conforms to social conventions for the genre and for the circumstances at hand. • We say that V is an authentic copy of Y if it is a sufficiently faithful derivative with true provenance. © 2008, H.M. Gladney, HMG Consulting http://home.pacbell.net/hgladney/ 25 © 2008, H.M. Gladney, HMG Consulting http://home.pacbell.net/hgladney/ 26 Trustworthy Digital Object (TDO)? Essential & Accidental Information What ’s essential depends on what somebody specific wants to accomplish (information producer or consumer). Accidental content cannot be avoided. Imperfect knowledge capture is a fact of life; many examples in engineering specifications. © 2008, H.M. Gladney, HMG Consulting http://home.pacbell.net/hgladney/ 27 © 2008, H.M. Gladney, HMG Consulting http://home.pacbell.net/hgladney/ 28 Software Layering and Rules Administration Schema for a Digital Collection © 2008, H.M. Gladney, HMG Consulting http://home.pacbell.net/hgladney/ 29 © 2008, H.M. Gladney, HMG Consulting http://home.pacbell.net/hgladney/ 30 © 2006, H.M. Gladney, HMG Consulting 5
  • 6. Some Conclusions Preservation of manufacturing data does not present unique technical challenges è look at what members of other disciplines are writing and saying. But manufacturing does have specialized file formats and standar ds. Divide and conquer è treat preservation as content mgmt. increment. Focus on data structure first; later on data management. Be sensitive to the teachings of scientific philosophy. Age-old procedures for paper have digital counterparts. Big challenge: persuading information providers to write metadata We have an architecture, and are ready to start software design. © 2008, H.M. Gladney, HMG Consulting http://home.pacbell.net/hgladney/ 31 © 2006, H.M. Gladney, HMG Consulting 6