Adventures in Digital Asset Management: Fedora at the National ...

509 views

Published on

0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
509
On SlideShare
0
From Embeds
0
Number of Embeds
2
Actions
Shares
0
Downloads
4
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Adventures in Digital Asset Management: Fedora at the National ...

  1. 1. Adventures in Digital Asset Management: Fedora at the National Library of Wales Glen Robson National Library of Wales [email_address]
  2. 2. Contents <ul><li>The National Library of Wales </li></ul><ul><li>Why the NLW choose Fedora </li></ul><ul><li>The pilot </li></ul><ul><li>Theoretical look into preservation </li></ul><ul><li>Data Models </li></ul>
  3. 3. The National Library of Wales <ul><li>Nature of NLW </li></ul><ul><li>Collecting </li></ul><ul><ul><li>Variety of data types and formats </li></ul></ul><ul><li>Preserving </li></ul><ul><ul><li>Obsolescence </li></ul></ul><ul><ul><li>Lack of context information </li></ul></ul><ul><ul><li>Persistent identifiers </li></ul></ul><ul><ul><li>Integration </li></ul></ul><ul><li>Access </li></ul><ul><ul><li>Open collections </li></ul></ul>
  4. 4. Why we choose Fedora <ul><li>Comparison with D-Space </li></ul><ul><li>Fundamental issues </li></ul><ul><ul><li>Suitability for wide range of data types </li></ul></ul><ul><ul><li>Suitability for distribution of data types </li></ul></ul><ul><ul><li>Support for collection structures </li></ul></ul><ul><ul><li>Scalability </li></ul></ul><ul><ul><li>`Future-proof’ architecture </li></ul></ul>
  5. 5. The Pilot <ul><li>Understand the Fedora System </li></ul><ul><li>Experiment with different data types </li></ul><ul><li>Allow access to Digital Assets </li></ul><ul><li>Investigate workflows for moving digital material into the repository </li></ul>
  6. 6. Examples <ul><li>Ingested Digitised Images </li></ul><ul><li>E-Thesis </li></ul><ul><li>Ingested web pages </li></ul><ul><li>Born Digital Object </li></ul><ul><li>Basic authentication and rights management </li></ul>
  7. 7. E-Thesis <ul><li>Abstract </li></ul><ul><li>Word Document Thesis </li></ul><ul><ul><li>Original, PDF, Text, HTML and Tiff page images </li></ul></ul><ul><li>Video Composition </li></ul><ul><ul><li>Original, DivX and Web Viewer </li></ul></ul>
  8. 8. Web Pages <ul><li>Complex Digital Objects </li></ul><ul><ul><li>Arrive in a Compressed File </li></ul></ul><ul><li>Dissemination 1 Uncompress tar and serve </li></ul><ul><ul><li>Simple </li></ul></ul><ul><ul><li>Difficult to migrate formats </li></ul></ul><ul><li>Dissemination 2 Extracted and ingested into Fedora </li></ul><ul><ul><li>More complicated </li></ul></ul><ul><ul><li>Can do format migration without breaking links </li></ul></ul><ul><ul><li>HTML converted to XHTML </li></ul></ul><ul><ul><li>Meta data can be assigned to each page, image or movie. </li></ul></ul>
  9. 9. Digitised Images <ul><li>Problems: </li></ul><ul><ul><li>Obsolete Formats </li></ul></ul><ul><ul><li>Loss of context information </li></ul></ul><ul><ul><li>Persistent identifiers and URLs </li></ul></ul><ul><ul><li>Integration </li></ul></ul><ul><ul><li>Access </li></ul></ul>
  10. 10. Digital Images – Obsolete Formats <ul><li>What if we move from jpeg to jpeg2000 </li></ul><ul><ul><li>Website would have to be updated </li></ul></ul><ul><ul><li>All links would break: </li></ul></ul><ul><ul><ul><li>http://cairsweb.llgc.org.uk/images/mst/mst00001.jpg </li></ul></ul></ul><ul><ul><li>Special Viewer? </li></ul></ul><ul><li>Fedora’s Solution </li></ul><ul><ul><li>Find all images </li></ul></ul><ul><ul><li>Add a disseminator to convert jpg files to jpeg2000 </li></ul></ul><ul><ul><li>Links not file specific: </li></ul></ul><ul><ul><ul><li>http://teilo:9080/llgc/getImage/getMedium?PID=llgctest1:189 </li></ul></ul></ul><ul><ul><li>Record conversion in meta data </li></ul></ul><ul><ul><li>History automatically saved </li></ul></ul>
  11. 11. Digital Images – Context information <ul><li>Fedora’s Solution </li></ul><ul><ul><li>Mets Document in object as Data Stream </li></ul></ul><ul><ul><li>Version history so changes saved </li></ul></ul><ul><ul><li>Can store any type of Meta data: </li></ul></ul><ul><ul><ul><li>Mets Rights </li></ul></ul></ul><ul><ul><ul><li>PREMIS Preservation Meta data </li></ul></ul></ul><ul><ul><li>Could even store the intro page located on the Digital Mirror </li></ul></ul>
  12. 12. Digital Images – Persistent Identifiers <ul><li>Fedora’s Solution </li></ul><ul><ul><li>Data Type independent URLs </li></ul></ul><ul><ul><ul><li>http://teilo:9080/llgc/getImage/getMedium?PID=llgctest1:189 </li></ul></ul></ul><ul><ul><li>Fedora PID constant even through upgrades </li></ul></ul><ul><ul><li>Can add any Identifiers using Fedora relationships </li></ul></ul><ul><ul><li>URLs link to Servlets for redirection </li></ul></ul><ul><ul><ul><li>GetMedium Servlet1 </li></ul></ul></ul><ul><ul><ul><ul><li>Find pid llgctest1:189 </li></ul></ul></ul></ul><ul><ul><ul><ul><li>Get Mid sized image Data Stream </li></ul></ul></ul></ul><ul><ul><ul><ul><li>Convert to JPEG2000 </li></ul></ul></ul></ul><ul><ul><ul><ul><li>Return Image </li></ul></ul></ul></ul><ul><ul><ul><li>GetMedium Servlet2 </li></ul></ul></ul><ul><ul><ul><ul><li>Find pid llgctest1:189 </li></ul></ul></ul></ul><ul><ul><ul><ul><li>Resize large image Data Stream </li></ul></ul></ul></ul><ul><ul><ul><ul><li>Convert to JPEG2000 </li></ul></ul></ul></ul><ul><ul><ul><ul><li>Return Image </li></ul></ul></ul></ul>
  13. 13. Digital Images - Integration <ul><li>Existing Digital Content from the Digital Mirror </li></ul><ul><li>http://teilo:8080/cocoon/METS/MST00001/frames?div=1&subdiv=0&locale=en&mode=thumbnail </li></ul><ul><ul><li>Ingest existing Mets documents into Fedora </li></ul></ul><ul><ul><ul><li>No change to existing workflow </li></ul></ul></ul><ul><ul><li>Ingest images into Fedora </li></ul></ul><ul><ul><ul><li>Better preservation </li></ul></ul></ul><ul><ul><li>Allow original look and feel to website </li></ul></ul><ul><ul><ul><li>One line change to configuration file </li></ul></ul></ul><ul><ul><li>Enhanced Version (PDF of Book) </li></ul></ul>
  14. 14. Digital Images - Access <ul><li>3 Types </li></ul><ul><ul><li>Through Catalogue </li></ul></ul><ul><ul><ul><li>Difficult with Geac </li></ul></ul></ul><ul><ul><ul><li>New System OAI Harvesting? </li></ul></ul></ul><ul><ul><li>By Browsing </li></ul></ul><ul><ul><ul><li>Current Digital Mirror </li></ul></ul></ul><ul><ul><ul><li>Relationships </li></ul></ul></ul><ul><ul><ul><ul><li>View all digitised collection: </li></ul></ul></ul></ul><ul><ul><ul><ul><li>http://teilo:8080/cocoon/ViewCollection/llgctest1:DigitisedCollection </li></ul></ul></ul></ul><ul><ul><li>By searching repository </li></ul></ul><ul><ul><ul><li>Ambfish </li></ul></ul></ul><ul><ul><ul><li>Indexes Mets Documents </li></ul></ul></ul>
  15. 15. Data Models <ul><li>Object: Fish Book </li></ul><ul><li>PID: llgctest1:108 </li></ul><ul><li>DS1.0 Page 1 Large </li></ul><ul><li>DS2.0 Page 1 Mid </li></ul><ul><li>DS3.0 Page 2 Large </li></ul><ul><li>DS4.0 Page 2 Mid </li></ul><ul><li>DS5.0 Page 2 Large </li></ul><ul><li>DS6.0 Page 2 Mid </li></ul><ul><li>DS7.0 MIX Meta for Page 1 </li></ul><ul><li>DS8.0 MIX Meta for Page 2 </li></ul><ul><li>DS9.0 MIX Meta for Page 3 </li></ul><ul><li>DS10.0 METS Document </li></ul>
  16. 16. New Model Object: Page1 Fish Book PID: llgctest1:109 DS1.0 Image Large DS2.0 Image Mid DS3.0 MIX Meta about DS1.0 Object: Page2 Fish Book PID: llgctest1:110 DS1.0 Image Large DS2.0 Image Mid DS3.0 MIX Meta about DS1.0 Object: Fish Book PID: llgctest1:108 DS1.0 Mets Doc Is Part Of Is Part Of
  17. 17. Summary <ul><li>Fedora as DAMs </li></ul><ul><li>Fedora Community </li></ul><ul><li>Moving towards OAIS and Trusted Digital Repository status </li></ul>
  18. 18. Questions and Answers <ul><li>Glen Robson gmr@llgc.org.uk </li></ul>

×