Systems & processes; making
order out of chaos.
Digitisation Open Day, January 2014
Dave Thompson
Digital Curator, Wellcom...
Digitisation – process overview
Funding, staff, equipment, IT,
storage, data management
planning

Refine & review processe...
Lets be clear. Sticking
something under a
camera or on a scanner
is the last step in a
longer process.
Thinking conceptually … OAIS

http://www.fachlabor-gubler.ch/eng/consulting/underlying-principles/the-oais-reference-model...
There are simpler models…

http://www.library.cornell.edu/dlit/MathArc/web/StoryFrameset.html
We have three basic systems…
1. Workflow management system – ‘Goobi’ –
production.
2. Digital object repository – ‘Safety ...
The formats
• JPEG2000 is our master image format.
• Create dissemination images (JPEG) on the fly.
• Also use PDF, MPEG2,...
Goobi
• Manages & tracks the production of content.
• Workflow driven. Highly automated. Project
based.
• Allows us to set...
How Goobi works – a workflow
How Goobi works – METS editing
Pagination as per original

Structure

Descriptive metadata
from imported
MARC
Safety Deposit Box
• Workflow based easily ‘talks’ to other systems.
• Content agnostic.
• Creates administrative metadata...
How SDB works
How SDB works – behind the scenes
• No public access to SDB.
• Little direct staff access to SDB content.
• High levels of...
The player

Wellcomelibrary.org/player
How the player works
• Makes HTTP request to SDB for content based on
SDB PUID (Objects unique & permanent ID).
• Draws & ...
The systems overview
• Goobi. Manages & tracks the production of
digitised content.
• SDB. Repository that stores digitise...
Lessons from Goobi
• Design your workflows (Human & digital) in
advance. But be flexible.
• Automate as much as possible, ...
Lessons from SDB
• Plan your systems integration, which system talks
to which, and how.
• Plan workflows & processes.
• Da...
Lessons from the player
• The point of digitisation is access & managed
access is part of preservation.
• Automate access ...
So, to wrap up…
• Digitisation is an end to end process that brings
together objects & metadata.
• Have to think about the...
Thank you
Questions now, questions later…?
Dave Thompson, Digital Curator
Wellcome Library
d.thompson@wellcome.ac.uk - @d_...
Systems and Processes: making order out of chaos
Upcoming SlideShare
Loading in...5
×

Systems and Processes: making order out of chaos

588

Published on

Presentation from Digital Curator Dave Thompson on systems and processes for digitisation at the Wellcome Library for our fourth Digitisation Open Day.

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
588
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
5
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide
  • dnt
  • dnt
  • Systems and Processes: making order out of chaos

    1. 1. Systems & processes; making order out of chaos. Digitisation Open Day, January 2014 Dave Thompson Digital Curator, Wellcome Library
    2. 2. Digitisation – process overview Funding, staff, equipment, IT, storage, data management planning Refine & review processes document & share Identify resources Plan project Identify Plan Digitise material process process Document & share Digitisation Open Days Deliver
    3. 3. Lets be clear. Sticking something under a camera or on a scanner is the last step in a longer process.
    4. 4. Thinking conceptually … OAIS http://www.fachlabor-gubler.ch/eng/consulting/underlying-principles/the-oais-reference-model.html
    5. 5. There are simpler models… http://www.library.cornell.edu/dlit/MathArc/web/StoryFrameset.html
    6. 6. We have three basic systems… 1. Workflow management system – ‘Goobi’ – production. 2. Digital object repository – ‘Safety Deposit Box’ – storage. 3. Front end - ‘the player’ – access. Remember, this doesn’t include cataloguing or bibliographic systems. Here we’re just talking about the process of creating, storing & delivering digital content. You have to assume that those other systems are also in place.
    7. 7. The formats • JPEG2000 is our master image format. • Create dissemination images (JPEG) on the fly. • Also use PDF, MPEG2, MP3 We don’t have a system of ‘preferred formats’ for digitisation. We use a small number of ‘master’ formats for efficient data management but we give consideration to the way in which we disseminate information. JPEG2000 is a flexible format that allows us to present digitised content in a variety of ways, whilst allowing for the automated creation of different sizes of JPEG.
    8. 8. Goobi • Manages & tracks the production of content. • Workflow driven. Highly automated. Project based. • Allows us to set very granular access conditions. • Scalable & highly adaptable to different projects. Goobi is our workflow tracking & management system for the production of digital content. Automating as many of Goobi’s processes as possible allows our work to be both efficient & scalable. Goobi is also the system with which humans interact the most.
    9. 9. How Goobi works – a workflow
    10. 10. How Goobi works – METS editing Pagination as per original Structure Descriptive metadata from imported MARC
    11. 11. Safety Deposit Box • Workflow based easily ‘talks’ to other systems. • Content agnostic. • Creates administrative metadata on ingest. • Preservation orientated.
    12. 12. How SDB works
    13. 13. How SDB works – behind the scenes • No public access to SDB. • Little direct staff access to SDB content. • High levels of automation of ingest, Goobi. • Platform for dissemination mediated by the player. A centralised repository of & for digital content is a key part of both preservation of & access to your content. It’s a single place where we both store & manage our content.
    14. 14. The player Wellcomelibrary.org/player
    15. 15. How the player works • Makes HTTP request to SDB for content based on SDB PUID (Objects unique & permanent ID). • Draws & implements access conditions from METS file. • Permitted user actions drawn from METS. • Draws DMD from live catalogue. The player acts as a single point of access to our content, we have a unified delivery mechanism through which all content is delivered. Aim is to provide as seamless & as easy as possible access to all digital content. Easy for the user to understand & an interface with which they can quickly become familiar.
    16. 16. The systems overview • Goobi. Manages & tracks the production of digitised content. • SDB. Repository that stores digitised content along with its DMD & AMD. • Player. User interface to view digitised material.
    17. 17. Lessons from Goobi • Design your workflows (Human & digital) in advance. But be flexible. • Automate as much as possible, saves time & more efficient. • Document processes & procedures. • Share what you learn.
    18. 18. Lessons from SDB • Plan your systems integration, which system talks to which, and how. • Plan workflows & processes. • Data management plan. Your eggs in one basket. • Plan what you’ll do when it all turns to custard.
    19. 19. Lessons from the player • The point of digitisation is access & managed access is part of preservation. • Automate access in terms of what a user can do with content. • Single point of access for all digital content. • Test user interface & develop with user in mind!
    20. 20. So, to wrap up… • Digitisation is an end to end process that brings together objects & metadata. • Have to think about the whole system to deliver results. Process is one of combining metadata from different systems. • Document plans & document process. • Be prepared to be flexible & to change as necessary. But try to stick to the plan!
    21. 21. Thank you Questions now, questions later…? Dave Thompson, Digital Curator Wellcome Library d.thompson@wellcome.ac.uk - @d_n_t http://wellcomelibrary.org/
    1. A particular slide catching your eye?

      Clipping is a handy way to collect important slides you want to go back to later.

    ×