Avoiding the 927 Problem: Standards, Digital Preservation, and Communities of Practice

Artefactual Systems - Archivematica
Artefactual Systems - ArchivematicaArtefactual Systems - Archivematica
Avoiding the 927 Problem:
Standards, Digital Preservation, and Communities of Practice
Dan Gillean
PASIG NYC 2016
October 26, 2016
What is a standard?
•A model or basis of comparison
•An agreed-upon set of characteristics,
definitions, and/or practices
•A minimum acceptable benchmark allowing for
quantitative or qualitative judgement
http://www.cas.edu/
De Jure vs De Facto
• “According to law,” “By right”
• Declared to be standards by an
authority
• Top-down distribution
• Can be formalized from de facto
standards; can become de facto
as well via adoption
• Generally open
• “In reality,” “As a matter of fact”
• Grow to be standards via
adoption
• Dependent on market or
community uptake
• Can become de jure standard
• Can be open or closed
De Jure De Facto
Open vs Proprietary
•Open can sometimes just refer to availability – royalty free
•Open source: community-driven, open exchange of ideas
•Open proprietary: Privately developed or owned but
freely available for implementation
•Closed proprietary: Privately developed/owned, must pay
licensing fee to implement
Standards
allow us to
communicate
across space
and time
https://pixabay.com/p-624054
Standards
allow us to
communicate
across space
and time
…but not to just anyonehttps://pixabay.com/p-624054
Communities of practice
•Shared craft, domain, or profession
•Shared common interest in improvement
•Established via mutual engagement, joint
enterprise, and shared repertoire
Crowd, by James Cridland. https://www.flickr.com/photos/jamescridland/613445810
Designated community:
• An identified group of potential
Consumers who should be able to
understand the preserved information
“Since a key purpose of an OAIS is to
preserve information for a Designated
Community, the OAIS must understand the
Knowledge Base of its Designated
Community to understand the minimum
Representation Information that must be
maintained.“ (p. 2-4)
Standards are only useful if
we use them
http://www.salon.com/2016/06/16/black_holes_are_colliding_scientists_confirm_ripples_in_spacetime_partner/
Standards are only useful if
we use them
https://commons.wikimedia.org/wiki/File:Snowflake_01.svg
Special!Special!
Special!
Standards are only useful if
we use them
The 927 problem:
https://xkcd.com/927/
https://en.wikipedia.org/wiki/Archive#/media/File:WikiXDC_
National_Archives_Tour_Hall_-_Stierch.jpg
Our standards
should be:
• Open
• Non-proprietary
• Widely adopted
• Evaluated by experts
• Endorsed by our community of
practice
• Agnostic and interoperable
ISO 14721:2002 ISO 16363:2012
ISO 14721 and 16363
ISO 14721
A reference model – not a
systems architecture!
https://wiki.archivematica.org/Overview
• Governance
• Organizational structure
• Staffing
• Procedural accountability
• Preservation policy framework
• Documentation
• Financial sustainability
• Security
ISO 16363
Reminds us that much of digital
preservation readiness is not technical
– it’s organizational
ISO 16363
??????
Meet Archivematica
https://www.archivematica.org
What is Archivematica?
Archivematica is a web-
and standards-based,
open-source application
which allows your
institution to preserve
long-term access to
trustworthy, authentic
and reliable digital
content.
Standards based
Open source
Customizable
Integrated w 3rd
party systems
Active community
Avoiding the 927 Problem: Standards, Digital Preservation, and Communities of Practice
PREMIS in METS XML
Archivematica AIP structure
Packaged according to BagIt specifications
Virus scan, normalization report, extraction log, etc
For browsing in Archivematica
Original + normalized
objects, submission
docs, original metadata
included at SIP creation
• Originally developed for exchange between
California Digital Library and Library of
Congress; specifications written up by IETF in
2008
• System agnostic, interoperable format for
storage and exchange
• “Bag and tag” approach: mandatory tag file
contains a manifest listing every file in the
payload together with its corresponding
checksum
BagIt
BagIt is a hierarchical file packaging format
designed to support disk-based or network-
based storage and transfer of arbitrary digital
content.
• It provides a wrapper for other metadata, such
as PREMIS and Dublin Core.
• It defines relationships between digital objects
and other digital objects, and between digital
objects and their metadata.
• It can be used to provide technical metadata
about digital objects (although Archivematica
doesn’t implement it that way: we wrap PREMIS
in it instead)
METS, or Metadata Encoding and
Transmission Standard, was designed to
support inter-repository data exchange.METS
• It captures technical information about an object in order
to support the implementation of preservation strategies
such as normalization, migration or emulation (PREMIS
Object)
• It describes relationships between digital objects (PREMIS
Object)
• It provides an audit trail of actions taken by the digital
preservation repository to preserve the object (PREMIS
Event)
• It names the individuals, organizations and software tools
responsible for taking actions to preserve digital objects
(PREMIS Agent)
• It specifies the actions a repository is allowed to take to
preserve digital objects (PREMIS Rights)
PREMIS
PREMIS, or Preservation Metadata
Implementation Strategies, is the
recognized standard for metadata
about objects in a digital
preservation system.
<mets:amdSec>
<mets:techMD>
PREMIS: OBJECT
<mets:rightsMD>
PREMIS: RIGHTS
<mets:digiprovMD>
PREMIS: EVENT
<mets:digiprovMD>
PREMIS: AGENT
PREMIS in METS
METS SECTIONS
<metsHdr> METS header
<dmdSec> Descriptive metadata
<amdSec> Administrative metadata
<fileSec> File section
<structMap> Structural Map
PREMIS in METS
<mets:amdSec ID="amdSec_1">
<mets:techMD ID="techMD_1">
<mets:mdWrap MDTYPE="PREMIS:OBJECT">
<mets:xmlData>
<premis:object xmlns:premis="info:lc/xmlns/premis-v2" xsi:type="premis:file"
xsi:schemaLocation="info:lc/xmlns/premis-v2
http://www.loc.gov/standards/premis/v2/premis-v2-2.xsd" version="2.2">
<premis:objectIdentifier>
<premis:objectIdentifierType>UUID</premis:objectIdentifierType>
<premis:objectIdentifierValue>bb52e3a0-2c5...</premis:objectIdentifierValue>
…etc
Avoiding the 927 Problem: Standards, Digital Preservation, and Communities of Practice
http://www.totallylocalvc.com/art-of-teamwork/
Interoperability
Consistency
Intelligibility
Collaboration
Exchange
Standards allow for…
Between agents of a
designated community
across space and time
dan@artefactual.com
Thanks!
Dan Gillean
PASIG NYC 2016
October 26, 2016
1 of 28

Recommended

Workshop slides - Introduction to AtoM and Archivematica by
Workshop slides - Introduction to AtoM and ArchivematicaWorkshop slides - Introduction to AtoM and Archivematica
Workshop slides - Introduction to AtoM and ArchivematicaArtefactual Systems - Archivematica
659 views30 slides
Archivematica presentation to SJSU iSchool Colloquia series by
Archivematica presentation to SJSU iSchool Colloquia seriesArchivematica presentation to SJSU iSchool Colloquia series
Archivematica presentation to SJSU iSchool Colloquia seriesArtefactual Systems - Archivematica
541 views10 slides
Digital Preservation with Archivematica: An Introduction by
Digital Preservation with Archivematica: An IntroductionDigital Preservation with Archivematica: An Introduction
Digital Preservation with Archivematica: An IntroductionArtefactual Systems - Archivematica
3.3K views71 slides
Do Something Now: Why Perfect is the Enemy of Good (Enough) in Digital Preser... by
Do Something Now: Why Perfect is the Enemy of Good (Enough) in Digital Preser...Do Something Now: Why Perfect is the Enemy of Good (Enough) in Digital Preser...
Do Something Now: Why Perfect is the Enemy of Good (Enough) in Digital Preser...Artefactual Systems - Archivematica
1K views40 slides
Archivematica Community Update - SAA 2016 by
Archivematica Community Update - SAA 2016Archivematica Community Update - SAA 2016
Archivematica Community Update - SAA 2016Artefactual Systems - Archivematica
408 views20 slides
Digital Preservation with Archivematica by
Digital Preservation with ArchivematicaDigital Preservation with Archivematica
Digital Preservation with ArchivematicaArtefactual Systems - Archivematica
632 views31 slides

More Related Content

What's hot

Archivematica and the digital archival chain of custody by
Archivematica and the digital archival chain of custodyArchivematica and the digital archival chain of custody
Archivematica and the digital archival chain of custodyArtefactual Systems - Archivematica
1.2K views39 slides
An Introduction to AtoM, Archivematica, and Artefactual Systems by
An Introduction to AtoM, Archivematica, and Artefactual SystemsAn Introduction to AtoM, Archivematica, and Artefactual Systems
An Introduction to AtoM, Archivematica, and Artefactual SystemsArtefactual Systems - AtoM
4.4K views57 slides
Introduction to Archivematica by
Introduction to ArchivematicaIntroduction to Archivematica
Introduction to ArchivematicaArtefactual Systems - Archivematica
764 views21 slides
Personal Digital Archiving 2015 - NYU - Workshop by
Personal Digital Archiving 2015 - NYU - WorkshopPersonal Digital Archiving 2015 - NYU - Workshop
Personal Digital Archiving 2015 - NYU - WorkshopArtefactual Systems - Archivematica
473 views31 slides
Report: Archivematica hosting in the cloud by
Report: Archivematica hosting in the cloudReport: Archivematica hosting in the cloud
Report: Archivematica hosting in the cloudArtefactual Systems - Archivematica
1.1K views10 slides

What's hot(20)

Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools... by Artefactual Systems - AtoM
Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools...Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools...
Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools...
UBC Library's Digital Preservation Strategy by UBC Library
UBC Library's Digital Preservation StrategyUBC Library's Digital Preservation Strategy
UBC Library's Digital Preservation Strategy
UBC Library3.1K views
COAR Next Generation Repositories Working Group by Paul Walk
COAR Next Generation Repositories Working GroupCOAR Next Generation Repositories Working Group
COAR Next Generation Repositories Working Group
Paul Walk1.4K views
DSpace Update from Open Repositories 2014 by Repository Fringe
DSpace Update from Open Repositories 2014DSpace Update from Open Repositories 2014
DSpace Update from Open Repositories 2014
Repository Fringe1.2K views
Rioxx 2 repository fringe by Paul Walk
Rioxx 2 repository fringeRioxx 2 repository fringe
Rioxx 2 repository fringe
Paul Walk1.7K views
State of the HydraSphere from Hydra Connect 3 (Sept 2015) by Tom-Cramer
State of the HydraSphere  from Hydra Connect 3 (Sept 2015)State of the HydraSphere  from Hydra Connect 3 (Sept 2015)
State of the HydraSphere from Hydra Connect 3 (Sept 2015)
Tom-Cramer3.4K views
Jabes 2008 - Conférence inaugurale, la grande révélation : penser les ressour... by ABES
Jabes 2008 - Conférence inaugurale, la grande révélation : penser les ressour...Jabes 2008 - Conférence inaugurale, la grande révélation : penser les ressour...
Jabes 2008 - Conférence inaugurale, la grande révélation : penser les ressour...
ABES112 views
Implementing RIOXX by Paul Walk
Implementing RIOXXImplementing RIOXX
Implementing RIOXX
Paul Walk921 views
ROTLD DNSSEC Implementation by Kevin Meynell
ROTLD DNSSEC ImplementationROTLD DNSSEC Implementation
ROTLD DNSSEC Implementation
Kevin Meynell128 views

Viewers also liked

Using and Developing with Open Source Digital Forensics Software in Digital A... by
Using and Developing with Open Source Digital Forensics Software in Digital A...Using and Developing with Open Source Digital Forensics Software in Digital A...
Using and Developing with Open Source Digital Forensics Software in Digital A...Mark Matienzo
4.4K views35 slides
Tackling File Characterization and Analysis in Archivematica by
Tackling File Characterization and Analysis in ArchivematicaTackling File Characterization and Analysis in Archivematica
Tackling File Characterization and Analysis in ArchivematicaCourtney Mumma
481 views15 slides
Accessioning-Based Metadata Extraction and Iterative Processing: Notes From t... by
Accessioning-Based Metadata Extraction and Iterative Processing: Notes From t...Accessioning-Based Metadata Extraction and Iterative Processing: Notes From t...
Accessioning-Based Metadata Extraction and Iterative Processing: Notes From t...Mark Matienzo
1.7K views11 slides
Seminar: OAIS Model application in digital preservation projects by
Seminar: OAIS Model application in digital preservation projectsSeminar: OAIS Model application in digital preservation projects
Seminar: OAIS Model application in digital preservation projectsMichael Day
2.4K views77 slides
One Core Preservation System for all your Data. No Exceptions! Marco Klindt a... by
One Core Preservation System for all your Data. No Exceptions! Marco Klindt a...One Core Preservation System for all your Data. No Exceptions! Marco Klindt a...
One Core Preservation System for all your Data. No Exceptions! Marco Klindt a...12th International Conference on Digital Preservation (iPRES 2015)
768 views43 slides
Financing Digital Preservation: Making digital preservation affordable - Valu... by
Financing Digital Preservation: Making digital preservation affordable - Valu...Financing Digital Preservation: Making digital preservation affordable - Valu...
Financing Digital Preservation: Making digital preservation affordable - Valu...Simon Tanner
859 views21 slides

Viewers also liked(20)

Using and Developing with Open Source Digital Forensics Software in Digital A... by Mark Matienzo
Using and Developing with Open Source Digital Forensics Software in Digital A...Using and Developing with Open Source Digital Forensics Software in Digital A...
Using and Developing with Open Source Digital Forensics Software in Digital A...
Mark Matienzo4.4K views
Tackling File Characterization and Analysis in Archivematica by Courtney Mumma
Tackling File Characterization and Analysis in ArchivematicaTackling File Characterization and Analysis in Archivematica
Tackling File Characterization and Analysis in Archivematica
Courtney Mumma481 views
Accessioning-Based Metadata Extraction and Iterative Processing: Notes From t... by Mark Matienzo
Accessioning-Based Metadata Extraction and Iterative Processing: Notes From t...Accessioning-Based Metadata Extraction and Iterative Processing: Notes From t...
Accessioning-Based Metadata Extraction and Iterative Processing: Notes From t...
Mark Matienzo1.7K views
Seminar: OAIS Model application in digital preservation projects by Michael Day
Seminar: OAIS Model application in digital preservation projectsSeminar: OAIS Model application in digital preservation projects
Seminar: OAIS Model application in digital preservation projects
Michael Day2.4K views
Financing Digital Preservation: Making digital preservation affordable - Valu... by Simon Tanner
Financing Digital Preservation: Making digital preservation affordable - Valu...Financing Digital Preservation: Making digital preservation affordable - Valu...
Financing Digital Preservation: Making digital preservation affordable - Valu...
Simon Tanner859 views
Lotar 101 Overview Current Jan 2009 by Rick Zuray
Lotar 101 Overview Current Jan 2009Lotar 101 Overview Current Jan 2009
Lotar 101 Overview Current Jan 2009
Rick Zuray1.5K views
POWRR Tools: Lessons learned from an IMLS National Leadership Grant by Lynne Thomas
POWRR Tools: Lessons learned from an IMLS National Leadership GrantPOWRR Tools: Lessons learned from an IMLS National Leadership Grant
POWRR Tools: Lessons learned from an IMLS National Leadership Grant
Lynne Thomas1K views
Processing at the University of Michigan Bentley Historical Library by mikeum
Processing at the University of Michigan Bentley Historical LibraryProcessing at the University of Michigan Bentley Historical Library
Processing at the University of Michigan Bentley Historical Library
mikeum1.1K views
Digital Preservation Best Practices: Lessons Learned From Across the Pond by Benoit Pauwels
Digital Preservation Best Practices: Lessons Learned From Across the PondDigital Preservation Best Practices: Lessons Learned From Across the Pond
Digital Preservation Best Practices: Lessons Learned From Across the Pond
Benoit Pauwels2.4K views
The lifecycle of a short story by Lynne Thomas
The lifecycle of a short storyThe lifecycle of a short story
The lifecycle of a short story
Lynne Thomas265 views
Mapping the Digital Preservation Wilderness: What you need to know by Jody DeRidder
Mapping the Digital Preservation Wilderness:  What you need to knowMapping the Digital Preservation Wilderness:  What you need to know
Mapping the Digital Preservation Wilderness: What you need to know
Jody DeRidder520 views

Similar to Avoiding the 927 Problem: Standards, Digital Preservation, and Communities of Practice

ERA CoBioTech Data Management Webinar by
ERA CoBioTech Data Management WebinarERA CoBioTech Data Management Webinar
ERA CoBioTech Data Management WebinarFAIRDOM
520 views40 slides
Collins, Hammer, Jones, and Lagace "NISO Update: Interoperability of Systems:... by
Collins, Hammer, Jones, and Lagace "NISO Update: Interoperability of Systems:...Collins, Hammer, Jones, and Lagace "NISO Update: Interoperability of Systems:...
Collins, Hammer, Jones, and Lagace "NISO Update: Interoperability of Systems:...National Information Standards Organization (NISO)
82 views50 slides
Data accessibilityandchallenges by
Data accessibilityandchallengesData accessibilityandchallenges
Data accessibilityandchallengesjyotikhadake
159 views33 slides
Cloud - NDT - Presentation by
Cloud - NDT - PresentationCloud - NDT - Presentation
Cloud - NDT - PresentationÉric Dusablon
251 views25 slides
Competency framework: engineers, statisticians, data scientists, librarians, ... by
Competency framework: engineers, statisticians, data scientists, librarians, ...Competency framework: engineers, statisticians, data scientists, librarians, ...
Competency framework: engineers, statisticians, data scientists, librarians, ...African Open Science Platform
382 views24 slides
Introduction to Digital Humanities: Metadata standards and ontologies by
Introduction to Digital Humanities: Metadata standards and ontologies Introduction to Digital Humanities: Metadata standards and ontologies
Introduction to Digital Humanities: Metadata standards and ontologies LIBIS
1K views100 slides

Similar to Avoiding the 927 Problem: Standards, Digital Preservation, and Communities of Practice(20)

ERA CoBioTech Data Management Webinar by FAIRDOM
ERA CoBioTech Data Management WebinarERA CoBioTech Data Management Webinar
ERA CoBioTech Data Management Webinar
FAIRDOM520 views
Data accessibilityandchallenges by jyotikhadake
Data accessibilityandchallengesData accessibilityandchallenges
Data accessibilityandchallenges
jyotikhadake159 views
Introduction to Digital Humanities: Metadata standards and ontologies by LIBIS
Introduction to Digital Humanities: Metadata standards and ontologies Introduction to Digital Humanities: Metadata standards and ontologies
Introduction to Digital Humanities: Metadata standards and ontologies
LIBIS1K views
What Do Records Managers Need to Know About Open Source, Open Standards, Open... by Cheryl McKinnon
What Do Records Managers Need to Know About Open Source, Open Standards, Open...What Do Records Managers Need to Know About Open Source, Open Standards, Open...
What Do Records Managers Need to Know About Open Source, Open Standards, Open...
Cheryl McKinnon934 views
Making DMPs actionable and public by Stephanie Simms
Making DMPs actionable and publicMaking DMPs actionable and public
Making DMPs actionable and public
Stephanie Simms3.5K views
FAIRDOM data management support for ERACoBioTech Proposals by FAIRDOM
FAIRDOM data management support for ERACoBioTech ProposalsFAIRDOM data management support for ERACoBioTech Proposals
FAIRDOM data management support for ERACoBioTech Proposals
FAIRDOM1.2K views
Webinar: End NAS Sprawl - Gain Control Over Unstructured Data by Storage Switzerland
Webinar: End NAS Sprawl - Gain Control Over Unstructured DataWebinar: End NAS Sprawl - Gain Control Over Unstructured Data
Webinar: End NAS Sprawl - Gain Control Over Unstructured Data
Blackboard Learn Deployment: A Detailed Update of Managed Hosting and SaaS De... by Blackboard APAC
Blackboard Learn Deployment: A Detailed Update of Managed Hosting and SaaS De...Blackboard Learn Deployment: A Detailed Update of Managed Hosting and SaaS De...
Blackboard Learn Deployment: A Detailed Update of Managed Hosting and SaaS De...
Blackboard APAC1.1K views
Building blocks for success: criteria for trusted institutional repositories by Ina Smith
Building blocks for success: criteria for trusted institutional repositoriesBuilding blocks for success: criteria for trusted institutional repositories
Building blocks for success: criteria for trusted institutional repositories
Ina Smith451 views
The most trusted, proven enterprise-class Cloud:Closer than you think by Uni Systems S.M.S.A.
The most trusted, proven enterprise-class Cloud:Closer than you think The most trusted, proven enterprise-class Cloud:Closer than you think
The most trusted, proven enterprise-class Cloud:Closer than you think
FAIRy stories: tales from building the FAIR Research Commons by Carole Goble
FAIRy stories: tales from building the FAIR Research CommonsFAIRy stories: tales from building the FAIR Research Commons
FAIRy stories: tales from building the FAIR Research Commons
Carole Goble1.4K views
The state of global research data initiatives: observations from a life on th... by Projeto RCAAP
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...
Projeto RCAAP154 views
FAIRsharing - ENVRI-FAIR Webinar by Peter McQuilton
FAIRsharing - ENVRI-FAIR WebinarFAIRsharing - ENVRI-FAIR Webinar
FAIRsharing - ENVRI-FAIR Webinar
Peter McQuilton246 views
Data Description Registry Interoperability WG at Research Data Alliance Third... by amiraryani
Data Description Registry Interoperability WG at Research Data Alliance Third...Data Description Registry Interoperability WG at Research Data Alliance Third...
Data Description Registry Interoperability WG at Research Data Alliance Third...
amiraryani468 views
DSpace-CRIS Workshop OR2015: Slides by Andrea Bollini
DSpace-CRIS Workshop OR2015: SlidesDSpace-CRIS Workshop OR2015: Slides
DSpace-CRIS Workshop OR2015: Slides
Andrea Bollini2.9K views

More from Artefactual Systems - Archivematica

Automation tools: making things go... (March 2019) by
Automation tools: making things go... (March 2019)Automation tools: making things go... (March 2019)
Automation tools: making things go... (March 2019)Artefactual Systems - Archivematica
573 views23 slides
Acts of maintenance by
Acts of maintenanceActs of maintenance
Acts of maintenanceArtefactual Systems - Archivematica
178 views24 slides
Archivematica Community Profile: University of Texas, San Antonio by Julianna... by
Archivematica Community Profile: University of Texas, San Antonio by Julianna...Archivematica Community Profile: University of Texas, San Antonio by Julianna...
Archivematica Community Profile: University of Texas, San Antonio by Julianna...Artefactual Systems - Archivematica
157 views16 slides
Archivematica Community Profile: University of Houston by Bethany Scott by
Archivematica Community Profile: University of Houston by Bethany ScottArchivematica Community Profile: University of Houston by Bethany Scott
Archivematica Community Profile: University of Houston by Bethany ScottArtefactual Systems - Archivematica
242 views8 slides
Archivematica Technical Training Diagnostics Guide (September 2018) by
Archivematica Technical Training Diagnostics Guide (September 2018)Archivematica Technical Training Diagnostics Guide (September 2018)
Archivematica Technical Training Diagnostics Guide (September 2018)Artefactual Systems - Archivematica
546 views29 slides
Introduction to the Archivematica API (September 2018) by
Introduction to the Archivematica API (September 2018)Introduction to the Archivematica API (September 2018)
Introduction to the Archivematica API (September 2018)Artefactual Systems - Archivematica
745 views40 slides

More from Artefactual Systems - Archivematica(11)

Recently uploaded

Case Study Copenhagen Energy and Business Central.pdf by
Case Study Copenhagen Energy and Business Central.pdfCase Study Copenhagen Energy and Business Central.pdf
Case Study Copenhagen Energy and Business Central.pdfAitana
16 views3 slides
Mini-Track: Challenges to Network Automation Adoption by
Mini-Track: Challenges to Network Automation AdoptionMini-Track: Challenges to Network Automation Adoption
Mini-Track: Challenges to Network Automation AdoptionNetwork Automation Forum
13 views27 slides
The Research Portal of Catalonia: Growing more (information) & more (services) by
The Research Portal of Catalonia: Growing more (information) & more (services)The Research Portal of Catalonia: Growing more (information) & more (services)
The Research Portal of Catalonia: Growing more (information) & more (services)CSUC - Consorci de Serveis Universitaris de Catalunya
80 views25 slides
Melek BEN MAHMOUD.pdf by
Melek BEN MAHMOUD.pdfMelek BEN MAHMOUD.pdf
Melek BEN MAHMOUD.pdfMelekBenMahmoud
14 views1 slide
PharoJS - Zürich Smalltalk Group Meetup November 2023 by
PharoJS - Zürich Smalltalk Group Meetup November 2023PharoJS - Zürich Smalltalk Group Meetup November 2023
PharoJS - Zürich Smalltalk Group Meetup November 2023Noury Bouraqadi
132 views17 slides
ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ... by
ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ...ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ...
ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ...Jasper Oosterveld
19 views49 slides

Recently uploaded(20)

Case Study Copenhagen Energy and Business Central.pdf by Aitana
Case Study Copenhagen Energy and Business Central.pdfCase Study Copenhagen Energy and Business Central.pdf
Case Study Copenhagen Energy and Business Central.pdf
Aitana16 views
PharoJS - Zürich Smalltalk Group Meetup November 2023 by Noury Bouraqadi
PharoJS - Zürich Smalltalk Group Meetup November 2023PharoJS - Zürich Smalltalk Group Meetup November 2023
PharoJS - Zürich Smalltalk Group Meetup November 2023
Noury Bouraqadi132 views
ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ... by Jasper Oosterveld
ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ...ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ...
ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ...
GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N... by James Anderson
GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N...GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N...
GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N...
James Anderson92 views
"Running students' code in isolation. The hard way", Yurii Holiuk by Fwdays
"Running students' code in isolation. The hard way", Yurii Holiuk "Running students' code in isolation. The hard way", Yurii Holiuk
"Running students' code in isolation. The hard way", Yurii Holiuk
Fwdays17 views
Igniting Next Level Productivity with AI-Infused Data Integration Workflows by Safe Software
Igniting Next Level Productivity with AI-Infused Data Integration Workflows Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Safe Software280 views
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f... by TrustArc
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...
TrustArc11 views
Voice Logger - Telephony Integration Solution at Aegis by Nirmal Sharma
Voice Logger - Telephony Integration Solution at AegisVoice Logger - Telephony Integration Solution at Aegis
Voice Logger - Telephony Integration Solution at Aegis
Nirmal Sharma39 views
SAP Automation Using Bar Code and FIORI.pdf by Virendra Rai, PMP
SAP Automation Using Bar Code and FIORI.pdfSAP Automation Using Bar Code and FIORI.pdf
SAP Automation Using Bar Code and FIORI.pdf
Business Analyst Series 2023 - Week 3 Session 5 by DianaGray10
Business Analyst Series 2023 -  Week 3 Session 5Business Analyst Series 2023 -  Week 3 Session 5
Business Analyst Series 2023 - Week 3 Session 5
DianaGray10300 views
【USB韌體設計課程】精選講義節錄-USB的列舉過程_艾鍗學院 by IttrainingIttraining
【USB韌體設計課程】精選講義節錄-USB的列舉過程_艾鍗學院【USB韌體設計課程】精選講義節錄-USB的列舉過程_艾鍗學院
【USB韌體設計課程】精選講義節錄-USB的列舉過程_艾鍗學院
iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas... by Bernd Ruecker
iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas...iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas...
iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas...
Bernd Ruecker40 views
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLive by Network Automation Forum
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLiveAutomating a World-Class Technology Conference; Behind the Scenes of CiscoLive
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLive

Avoiding the 927 Problem: Standards, Digital Preservation, and Communities of Practice

  • 1. Avoiding the 927 Problem: Standards, Digital Preservation, and Communities of Practice Dan Gillean PASIG NYC 2016 October 26, 2016
  • 2. What is a standard? •A model or basis of comparison •An agreed-upon set of characteristics, definitions, and/or practices •A minimum acceptable benchmark allowing for quantitative or qualitative judgement http://www.cas.edu/
  • 3. De Jure vs De Facto • “According to law,” “By right” • Declared to be standards by an authority • Top-down distribution • Can be formalized from de facto standards; can become de facto as well via adoption • Generally open • “In reality,” “As a matter of fact” • Grow to be standards via adoption • Dependent on market or community uptake • Can become de jure standard • Can be open or closed De Jure De Facto
  • 4. Open vs Proprietary •Open can sometimes just refer to availability – royalty free •Open source: community-driven, open exchange of ideas •Open proprietary: Privately developed or owned but freely available for implementation •Closed proprietary: Privately developed/owned, must pay licensing fee to implement
  • 5. Standards allow us to communicate across space and time https://pixabay.com/p-624054
  • 6. Standards allow us to communicate across space and time …but not to just anyonehttps://pixabay.com/p-624054
  • 7. Communities of practice •Shared craft, domain, or profession •Shared common interest in improvement •Established via mutual engagement, joint enterprise, and shared repertoire Crowd, by James Cridland. https://www.flickr.com/photos/jamescridland/613445810
  • 8. Designated community: • An identified group of potential Consumers who should be able to understand the preserved information “Since a key purpose of an OAIS is to preserve information for a Designated Community, the OAIS must understand the Knowledge Base of its Designated Community to understand the minimum Representation Information that must be maintained.“ (p. 2-4)
  • 9. Standards are only useful if we use them http://www.salon.com/2016/06/16/black_holes_are_colliding_scientists_confirm_ripples_in_spacetime_partner/
  • 10. Standards are only useful if we use them https://commons.wikimedia.org/wiki/File:Snowflake_01.svg Special!Special! Special!
  • 11. Standards are only useful if we use them The 927 problem: https://xkcd.com/927/
  • 12. https://en.wikipedia.org/wiki/Archive#/media/File:WikiXDC_ National_Archives_Tour_Hall_-_Stierch.jpg Our standards should be: • Open • Non-proprietary • Widely adopted • Evaluated by experts • Endorsed by our community of practice • Agnostic and interoperable
  • 13. ISO 14721:2002 ISO 16363:2012 ISO 14721 and 16363
  • 14. ISO 14721 A reference model – not a systems architecture! https://wiki.archivematica.org/Overview
  • 15. • Governance • Organizational structure • Staffing • Procedural accountability • Preservation policy framework • Documentation • Financial sustainability • Security ISO 16363 Reminds us that much of digital preservation readiness is not technical – it’s organizational
  • 18. What is Archivematica? Archivematica is a web- and standards-based, open-source application which allows your institution to preserve long-term access to trustworthy, authentic and reliable digital content. Standards based Open source Customizable Integrated w 3rd party systems Active community
  • 20. PREMIS in METS XML Archivematica AIP structure Packaged according to BagIt specifications Virus scan, normalization report, extraction log, etc For browsing in Archivematica Original + normalized objects, submission docs, original metadata included at SIP creation
  • 21. • Originally developed for exchange between California Digital Library and Library of Congress; specifications written up by IETF in 2008 • System agnostic, interoperable format for storage and exchange • “Bag and tag” approach: mandatory tag file contains a manifest listing every file in the payload together with its corresponding checksum BagIt BagIt is a hierarchical file packaging format designed to support disk-based or network- based storage and transfer of arbitrary digital content.
  • 22. • It provides a wrapper for other metadata, such as PREMIS and Dublin Core. • It defines relationships between digital objects and other digital objects, and between digital objects and their metadata. • It can be used to provide technical metadata about digital objects (although Archivematica doesn’t implement it that way: we wrap PREMIS in it instead) METS, or Metadata Encoding and Transmission Standard, was designed to support inter-repository data exchange.METS
  • 23. • It captures technical information about an object in order to support the implementation of preservation strategies such as normalization, migration or emulation (PREMIS Object) • It describes relationships between digital objects (PREMIS Object) • It provides an audit trail of actions taken by the digital preservation repository to preserve the object (PREMIS Event) • It names the individuals, organizations and software tools responsible for taking actions to preserve digital objects (PREMIS Agent) • It specifies the actions a repository is allowed to take to preserve digital objects (PREMIS Rights) PREMIS PREMIS, or Preservation Metadata Implementation Strategies, is the recognized standard for metadata about objects in a digital preservation system.
  • 24. <mets:amdSec> <mets:techMD> PREMIS: OBJECT <mets:rightsMD> PREMIS: RIGHTS <mets:digiprovMD> PREMIS: EVENT <mets:digiprovMD> PREMIS: AGENT PREMIS in METS METS SECTIONS <metsHdr> METS header <dmdSec> Descriptive metadata <amdSec> Administrative metadata <fileSec> File section <structMap> Structural Map
  • 25. PREMIS in METS <mets:amdSec ID="amdSec_1"> <mets:techMD ID="techMD_1"> <mets:mdWrap MDTYPE="PREMIS:OBJECT"> <mets:xmlData> <premis:object xmlns:premis="info:lc/xmlns/premis-v2" xsi:type="premis:file" xsi:schemaLocation="info:lc/xmlns/premis-v2 http://www.loc.gov/standards/premis/v2/premis-v2-2.xsd" version="2.2"> <premis:objectIdentifier> <premis:objectIdentifierType>UUID</premis:objectIdentifierType> <premis:objectIdentifierValue>bb52e3a0-2c5...</premis:objectIdentifierValue> …etc

Editor's Notes

  1. It’s difficult to come up with a broad definition of standards without straying into the uselessly general, but essentially, standards give us a means and method for evaluation, comparison, and use. They are a descriptive declaration of a set of features or characteristics, with which we can measure an implementation.
  2. De Jure example: ISO 8601, the International standard for date and time representations (YYYY-MM-DD). Its purpose is to provide an unambiguous and well-defined method of representing dates and times, especially in an international context where national and local conventions may vary greatly. De Facto example: VHS format for videotape recorders, which won out over Betamax not because it was a better specification, but thanks to broader market adoption.
  3. Open Source example: PCDM is quickly becoming a de facto, community-driven standard for Hydra implementers EXAMPLE of evolution: development of PDF format as a way to share documents with embedded fonts and images across diverse computer platforms in the early 1990’s. Developed first internally at Adobe as a closed proprietary standard, it was quickly released as an open proprietary standard in 1993. Through wide adoption, it became a de facto standard throughout the late 90’s and 2000’s. In 2008 it was formally released as an open standard, and adopted as ISO 32000-1:2008, making it an open de jure standard.
  4. Within the practice of digital preservation, this is most useful for considering its implications across space and time – standards provide a method of contextualizing and interpreting our data and our metadata so it can be understood and used by others.
  5. But let’s not forget that standards are not a Rosetta stone – they often come rife with presuppositions about the knowledge base of the reader. Digital preservation is a complex field full of jargon, and concepts requiring time and training to acquire. So to whom exactly do our standards communicate across space and time, then?
  6. I find it useful to think of the utility of standards within the framework of a community of practice. Originally coined as a pedagogical and social anthropology term, a community of practice refers to a group of people united via a shared craft, domain, or profession, with common goals and an interest in improvement. The term was first popularized by Jean Lave and Etienne Wagner, but it is useful to consider digital preservation as a domain bounded by a community of practice – and to conceptualize our standards as both an expression of this community, and an outcome of its shared goals.
  7. In fact, in one of the touchstone standards of our field – the OAIS reference model, now recognized as ISO 14721 – we frame the long-term goals and intelligibility of our digital preservation efforts within a concept very similar to a community of practice. The reference model speaks of a “Designated Community”: those to whom the preserved information should remain understandable, based on their presupposed knowledge base. Standards - being a useful tool for evaluation, comparison, and use - therefore comprise a key part of the knowledge base that we will require to make the information we preserve accessible and comprehensible in the future.
  8. All of this comes down to stating the obvious – Standards are only useful to a community of practice if we use them – correctly, and consistently, across time and space. Failing to do so can be akin to relegating our materials to a black hole – without the proper context and framework to interpret and evaluate the preserved information, how can we guarantee the information will be accessible and intelligible in the future?
  9. There are many reasons why digital preservation standards might NOT be used. At the institution level, one common culprit is the special snowflake effect: “our records, our workflows, our needs are so unique and specialized, the existing standards cannot possibly meet our needs.” This can lead to custom metadata profiles and bespoke systems. More knowledge required for preservation becomes siloed to specific individuals or systems; the burden of documentation is higher, and the efforts required to migrate environments or share access across institutional boundaries become increasingly challenging.
  10. Within our community of practice however, we can sometimes make perfect the enemy of the good – or good enough. This is the 927 problem: seeking the magic bullet format, technology, or standard that will supersede all previous efforts and bring about a golden age of universal adoption. The 927 problem is the reinvention of the wheel, over and over again, sometimes at the expense of previous efforts. It can often mean just adding one more option to a crowded field, and further bisecting our efforts along parallel but separate paths. So what standards should we be using then? How can we evaluate?
  11. If we expect our standards to be available for use and evaluation in the future, we should choose open standards – favoring openness is digital preservation best practice, from standards to formats to tools, and so on. Ideally they will be non-proprietary as well, to ensure they remain open. A standard need not be De Jure for it to be used in digital preservation, but we want to ensure that we’ve brought our collective expertise to bear in evaluating its validity and utility towards achieving our stated goals of long term preservation and access. And finally, no standard we adopt should force us to use a single tool or platform. With these criteria in mind, let’s take a look at just a few standards we can use in service of the shared goals of our community of practice.
  12. The starting point for any digital preservation standardization has become the OAIS Reference Model, AKA ISO 14721, and TRAC, or ISO 16363. Both OAIS and TRAC have become De Jure ISO standards with widespread adoption in our community of practice. OAIS provides us with a reference model for the functions and activities we need to consider for creating a comprehensive digital preservation environment, while TRAC supplements this with a series of metrics and requirements for auditing, monitoring, and evaluation needed to achieve full OAIS compliance.
  13. It’s worth quickly noting that OAIS is NOT a systems architecture, but a conceptual model. In practice, it is highly unlikely and possibly even undesirable to consider your preservation environment as a single monolithic system. Instead, it will be many different tools, platforms, and locations, each able to do one job well instead of many tasks poorly.
  14. Equally important to note is that digital preservation is not all tools and systems – much of it is organizational, covering internal policies and procedures, workflow documentation and accountability chains, mission statements, budgeting, staffing and succession planning. Regardless of your resources or the technical expertise you have in-house, considering and prioritizing these important aspects means that you can start working on digital preservation today.
  15. Section 4 of TRAC is where things start getting really technical, and where many institutions shake their head and defer action. How should we document every action taken during the preservation process? How should we capture all agents, both human and machine, involved in the process? What do we use to extract all the necessary technical, administrative, and preservation metadata, and how can we encode this in a standards-based way for it to be reusable and interoperable?
  16. I’m going to talk about a few standards that can help you do just that, in the context of Archivematica.
  17. Archivematica is an open-source digital preservation system that attempts to support standards-based workflows and outputs. Think of it as a standards-based sausage maker for generating what the OAIS reference model refers to as SIPs, AIPs, and DIPs – you provide the content to be preserved (the filling), implement format policies based on your institutional needs and Archivematica will add the standards based “casing”, generating administrative, technical, and preservation metadata that is platform independent and storage agnostic.
  18. Archivematica’s web based dashboard was designed with the OAIS reference model in mind. There are no magical turnkey solutions to digital preservation (and you should be wary of anyone promising such), but Archivematica can help cover some of the more technical aspects of creating your preservation workflow in a standards-based manner.
  19. Here’s a brief overview of an Archivematica Archival Information Package, or AIP. We package all AIPs according to the Library of Congress BagIT specification, and capture all relevant technical, administrative, preservation, and descriptive metadata using PREMIS, embedded in the METS XML included with each AIP and DIP, or Dissemination Information Package. The AIP is platform agnostic and interoperable – you choose your repository environment for long-term storage. There’s nothing about Archivematica’s AIPs or DIPs that requires Archivematica to open them in the future. Let’s quickly look a bit closer at each of these standards I’ve referenced.
  20. Here’s a quick look at how we embed PREMIS in METS. METS provides us with different sections for descriptive and administrative metadata, and within the administrative section we can embed PREMIS objects, rights, events, and agents.
  21. Here we can see an example of how a PREMIS object-level metadata is nested in the METS techMD. We simply use a wrapping element in METS, declare the standard used, and embed the PREMIS XML inside of it.
  22. Here we have a real example of a PREMIS normalization event as captured in Archivematica’s METS XML – we capture information about the format policy used, the type of event, the tool used, the outcome of the action, and the agents involved the in the event.