Preserving Digital Public Television: A Status Report
Upcoming SlideShare
Loading in...5
×
 

Preserving Digital Public Television: A Status Report

on

  • 1,682 views

Some findings from the Preserving Digital Public Television Project as it enters its final month. The project was a collaboration between WNET, WGBH, PBS, and NYU as part of the Library of Congress's ...

Some findings from the Preserving Digital Public Television Project as it enters its final month. The project was a collaboration between WNET, WGBH, PBS, and NYU as part of the Library of Congress's National Digital Information Infrastructure and Preservation Program (NDIIPP). This talk was prepared for the Society of Motion Picture and Television Engineers NY Section Meeting, February 24, 2010.

Statistics

Views

Total Views
1,682
Views on SlideShare
1,671
Embed Views
11

Actions

Likes
1
Downloads
36
Comments
0

2 Embeds 11

http://www.slideshare.net 8
http://www.linkedin.com 3

Accessibility

Categories

Upload Details

Uploaded via as Apple Keynote

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment
  • <br />
  • Early context for the project was that public television was supposed to depositing one copy of all programs at LC. That hasn&#x2019;t been happening. <br />
  • WNET and WGBH combined produce about 60% of nationally broadcast programming in the US. But they also produce and distribute local programming. PBS is only a distributor - not a content creator. NYU had expertise in building digital libraries (which public broadcasting did not) and an existing PR. LC is the funder, but also a repository. <br />
  • Why are we doing this? <br /> <br /> - Jacques Cousteau story. <br />
  • <br />
  • <br />
  • We&#x2019;re not dealing with digitizing things. <br />
  • <br />
  • <br />
  • <br />
  • An archive in this sense is not a server. It is processes, procedures, people with a mission of preservation, for which it is responsible into the indefinite future. <br />
  • <br />
  • &#x201C;The PR is designed as a set of loosely-coupled components communicating over stable interfaces.&#x201D; <br /> Storage Resource Broker = supports shared collections that can be distributed across multiple organizations and heterogeneous storage systems. Can be used as a Data Grid Management System. <br /> Dspace = DSpace open source software, used for Archival Storage, Data Management, Dissemenation functions of OAIS <br /> <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • These steps need just to process any one of those SIPs. These were the basic steps, but they were slightly different for each SIP class because of different metadata, different file formats, different PREMIS. <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • Processing is the same for both SIP classes (production masters and source files). <br />
  • <br />
  • <br />
  • The &#x201C;p&#x201D; word was never used. It actually made sense to make changes during transition, and no practices were too entrenched yet, but they didn&#x2019;t want to say it was for archival reasons, just that this was the way it was going to be done. <br /> <br />
  • <br />
  • Settings: Frame Size (1080i), Aspect Ratio (16/9), Frame Rate (29.97), Data Rate (117 mbps) <br />
  • <br />
  • Including technical & descriptive metadata <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • <br />
  • Consider a grid or distributed but federated system <br />
  • Combined with local storage for most frequently accessed materials with Grid solutions. Make sure your approached is managed. Take a look at the AVATARm project in the UK for more info. <br />
  • <br />
  • One word. You are going to need a lot of it. Video files are not self describing. Filenames are not good for search and retrieval, file-level metadata is not searchable. <br />
  • If there are no cataloging rules for descriptive metadata, everyone will input differently. Combine cataloging rules with controlled vocabularies. <br />
  • <br />
  • Because: you won&#x2019;t have to reinvent the wheel (elements, definitions, vocab), facilitates exchange, there is a support community. <br />
  • A few things about file formats: There is still no standard format for video preservation, especially for born digital, because it is born compressed. The most important thing to do is choose an open, widely support encoding format, that can be used in all systems in your core business processes without transcoding. MXF or QT (FCP) containers. <br />
  • <br />
  • <br />
  • Preservation does not happen in a vacuum. There must be ongoing commitment, funds, staffing, reviewed and updated policies and procedures, etc. <br />
  • <br />
  • <br />
  • <br />

Preserving Digital Public Television: A Status Report Preserving Digital Public Television: A Status Report Presentation Transcript

  • PRESERVING DIGITAL PUBLIC TELEVISION Part of the NDIIPP Program of the Library of Congress A STATUS REPORT Kara Van Malssen Senior Research Scholar & Metadata Specialist New York University February 24, 2010 SYMPTE-NY Section Meeting
  • NDIIPP = National Digital Information Infrastructure and Preservation Program of the Library of Congress www.digitalpreservation.gov Image by Smiley Man with a Hat via Flickr http://www.flickr.com/photos/smileymanwithahat/2477365291/
  • PRESERVING DIGITAL PUBLIC TELEVISION PARTNERS WNET WGBH NYU PBS Library of Congress
  • by massdistraction via Flickr http://www.flickr.com/photos/sharynmorrow/3718174646/in/set-72157621271414097/ DIGITAL ARCHAEOLOGY?
  • Design and build an PDPTV OAIS-compliant preservation repository GOALS for born digital public television
  • Implement and PDPTV recommend standards for metadata, wrapper GOALS and encoding formats, production workflow practices
  • PDPTV Recommend selection criteria for long-term GOALS retention
  • PDPTV Examine and recommend strategies for long term GOALS sustainability
  • “ An OAIS is an archive, consisting of an organization of people and systems, that has accepted the responsibility to preserve information and make it available for a designated community.... this distinguishes it “ from other uses of the term ‘archive.’ - Reference Model for an Open Archival Information System, ISO 14721:2003
  • PRESERVATION PLANNING DESCRIPTIVE DESCRIPTIVE P INFO INFO C R DATA O O MANAGEMENT queries N D S result sets U SIP INGEST ACCESS orders U C M ARCHIVAL E E STORAGE DIP R AIP R AIP ADMINISTRATION MANAGEMENT OAIS Functional Model
  • TECHNOLOGIES { some of the REPOSITORY PROJECT SPECIFIC CODE
  • } METADATA MODEL
  • by bredgur via Flickr: http://www.flickr.com/photos/bredgur/1323025528/ PHASE 2006-2008
  • SUBMISSION wnet NYU wgbh repository pbs
  • SUBMISSION wnet NYU wgbh repository pbs SIP Class A
  • SUBMISSION SD wnet ESS EN CE A NYU wgbh repository pbs SIP Class A
  • SUBMISSION SD wnet ES ME SEN TA CE DA TA A 1 NYU wgbh repository pbs SIP Class A
  • SUBMISSION SD wnet ES ME SEN TA CE DA TA A 1 NYU wgbh repository C EC EN E SS SD pbs SIP Class A
  • SUBMISSION SD wnet ES ME SEN TA CE DA TA A 1 NYU wgbh ATA2 repository AD CE C ET EN M E SS SD pbs SIP Class A
  • SUBMISSION SD wnet ES ME SEN TA CE DA TA B A 1 NYU wgbh ATA2 repository AD CE C ET EN M E SS SD pbs SIP Class B
  • SUBMISSION SD wnet ES ME SEN TA CE DA TA A 1 NYU wgbh AT A2 repository AD CE C ET EN M E SS ED D NC S SE ES pbs HD SIP Class C
  • SUBMISSION wnet NYU wgbh ATA2 repository AD CE C ET EN M E SS SD pbs SIP Class D
  • SUBMISSION wnet SD ESSENCE E METADATA 3 NYU wgbh ATA2 repository AD CE C ET EN M E SS SD pbs SIP Class D
  • 1. Aggregate content 2. Normalize filenames * 3. Aggregate & map descriptive metadata to PBCore * 4. Extract Technical Metadata 5. Map Technical Metadata to PBCore * 6. Generate PBCore * 7. Hunt down creating app info 8. Determine playback reqs * 9. Generate PREMIS * 10. Generate checksums 11. Generate METS * REPOSITORY 12. INGEST INGEST TASKS
  • by kapoue via Flickr http://www.flickr.com/photos/kapoue/2563697039/ PHASE 2008-2009
  • SUBMISSION NYU wnet repository SIP Class A
  • SUBMISSION HD ESSENCE NYU wnet repository SIP Class A
  • SUBMISSION HD ESSENCE PBCORE METADATA NYU wnet repository SIP Class A
  • SUBMISSION NYU wnet repository SIP Class B
  • SUBMISSION NYU wnet SOURCE FILES repository SIP Class B
  • SUBMISSION PBCORE METADATA NYU wnet SOURCE FILES repository SIP Class B
  • 1. Normalize filenames 2. Generate PREMIS 3. Generate METS 4. Validate checksum 5. INGEST REPOSITORY INGEST TASKS
  • PHASE 1 PHASE 2 12 5 processing steps processing steps
  • ? What changed
  • [preservation-ready] FILE-BASED WORKFLOW by Brian Daniel Eisenberg via Flickr http://www.flickr.com/photos/pplpwrd/2673102206/
  • File & Folder naming conventions for production and post- production by drpritch via Flickr http://www.flickr.com/photos/drpritch/305053820/
  • Standard MXF DV100 Recording formats & settings for MXF DV100 Transfer to HDD recording, QT Ingest & Re- editing, DV100 wrap broadcast, QT Edit archiving DV100 (i.e. no transcoding QT during workflow) DV100 Playout
  • Technical metadata extraction in house mediainfo.sourceforge.net/en
  • PBCore records created in-house pbcore.vermicel.li
  • Archiving can integrate seamlessly into file-based broadcast workflows if the right practices are introduced early on
  • a few more LESSONS LEARNED
  • 3 DIGITAL PRESERVATION REQUIREMENTS: 1. Bit Preservation 2. Accessibility of Content 3. Organizational Commitment
  • 1 BIT PRESERVATION
  • ONE COPY IS NO COPY by NightRPStar via Flickr http://www.flickr.com/photos/ninjanoodles/153893226/
  • “ (rules define how many copies to make, and which locations to put these in, with a typical strategy being 3 copies in 3 geographically separate “ locations) - M. Addis, et al “Sustainable Archiving and Storage Management of Audiovisual Digital Assets” SMPTE Motion Imaging Journal, Nov/Dec 2009
  • Photo by quapan via Flickr http://www.flickr.com/photos/hinkelstone/2435823037/ Consider federated storage models for cost and sustainability reasons
  • 2 CONTENT ACCESSIBILITY
  • by Shira Golding via Flickr http://www.flickr.com/photos/boojee/3743753784/
  • Define minimum metadata creation and collection requirements & rules, throughout the production & broadcast workflow
  • by ScrapyGraphics via Flickr http://www.flickr.com/photos/scrapygraphics/2515645664/ including file and folder naming conventions
  • Metadata should be standards-based
  • by DG Jones via Flickr http://www.flickr.com/photos/dgjones/1225183400/ a few words about file formats...
  • “ Businesses may use different encoding formats for different business processes, but should strive to avoid transcoding wherever possible, because it “ introduces a generation and thus reduces quality. - Peter Thomas “File Formats in Television Archiving and Content Exchange” SMPTE Motion Imaging Journal, Nov/Dec 2009
  • 3 ORGANIZATIONAL COMMITMENT
  • “ Benign neglect is the default stewardship, collection policy. Physical world, even more so “ in the digital world. - Cathy Marshall, Senior Researcher, Microsoft Research Keynote at Code4Lib, February 23, 2010. via Twitter @jschneider
  • TRUSTWORTHY REPOSITORIES AUDIT AND CERTIFICATION: CRITERIA AND CHECKLIST
  • coming soon... THE AMERICAN ARCHIVE of public broadcasting
  • THANK YOU www.thirteen.org/ptvdigitalarchive kvm211@nyu.edu http://www.slideshare.net/kvanmalssen