Managing large scale digitisation at the         Wellcome Library                    Christy Henshaw                   Pro...
The Wellcome Trust• A global charitable foundation• Achieving extraordinary improvements in human and animal health• Suppo...
The Wellcome Library The Wellcome Library• Major resource for the study of medical history• Collections of books, manuscri...
The Wellcome Library
The Wellcomethe story so far Digitisation – Library• Image library created from transparencies/prints, and on demand  phot...
The Library Transformation Strategy 2009 - 2014To provide global access to, and expert interpretation of, a world class co...
The Wellcome Digital Library pilot2010-2013Genetics and its Modern FoundationsA new online resource for everyone intereste...
Archival material – 1.1m imagesWellcome Library - 600,000 imagesExternal – 500,000 images
Books related to genetic research - 600,000 images
ProQuest, Early European Books – 5.5m images
Born digital material – initially small but growing
Digitisation strategy  Then                          Now  Small projects (<10,000 pp)   Large projects (>100,000 pp)  Rela...
Digitisation processes  Then                         Now  Manual processes             Automated processes  Centralised co...
Programme management - strategic• Specific strategy groups – cross-Library/ cross-Trust: Digital Library  IT Group, Engage...
Programme management – operational• Programme consists of 17 workpackages – “projects”• Project managers – one for each wo...
Digitisation strategy - selection• Thematic – relates to the theme of the pilot• Comprehensive - complete collections; cov...
Digitisation strategy - workflow• In-house digitisation – our own collections, digitised by Library staff  on-site (archiv...
Streamlining digitisation• Staff dedicated to specific projects, or streams of work• Carry out sample workflow tests for n...
Streamlining digitisation• Photographers do the photography…• Prepare materials separately• Leave loose pages and bindings...
Upscaling and streamlining digitsation  requires a higher level of project            management
Streamlining project management
What is it?• Web-based workflow system• Open source (core system)• Used by many libraries in Germany, and half a dozen oth...
What does it do?• Task-focused, customisable workflows developed by Intranda• User-specific “dashboard”• Import/export and...
User tasks
Digital asset management                   • Lightroom used to convert RAW to TIFF File conversion   • LuraWave converts T...
In-house (RAW)                   External (TIFF)                        External (JP2)           Temp                     ...
Thank you!    Christy Henshawc.henshaw@wellcome.ac.uk
Upcoming SlideShare
Loading in...5
×

Managing Large Scale Digitisation at the Wellcome Library

3,064

Published on

Published in: Education
0 Comments
2 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
3,064
On Slideshare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
5
Comments
0
Likes
2
Embeds 0
No embeds

No notes for slide

Managing Large Scale Digitisation at the Wellcome Library

  1. 1. Managing large scale digitisation at the Wellcome Library Christy Henshaw Programme Manager Wellcome Digital Library Sync or Sink: Opportunities for Libraries In the Digital Age Birkbeck College 24 Nov. 2011
  2. 2. The Wellcome Trust• A global charitable foundation• Achieving extraordinary improvements in human and animal health• Supporting the brightest minds in biomedical research and the medical humanities• Exploring medicine in historical and cultural contexts
  3. 3. The Wellcome Library The Wellcome Library• Major resource for the study of medical history• Collections of books, manuscripts, archives, films and pictures on the history of medicine from the earliest times to the present day.• Provide insight and information to anyone seeking to understand medicine and its role in society, past and present.• Provide access to a growing collection of contemporary biomedical information resources relating to consumer health, popular science, biomedical ethics and the public understanding of science.
  4. 4. The Wellcome Library
  5. 5. The Wellcomethe story so far Digitisation – Library• Image library created from transparencies/prints, and on demand photography – 300,000 images• Journal backfiles digitisation – (funder) Med. Hist., BMJ, etc. in PMC• Wellcome Film – 500+ titles (also Wellcome Film YouTube channel)• AIDS posters project – 3,000 posters• Arabic manuscripts – 500 manuscripts• 17th century recipe books – 74 manuscripts• Contributions to Europeana via the Europeana Libraries project, and World Digital Library
  6. 6. The Library Transformation Strategy 2009 - 2014To provide global access to, and expert interpretation of, a world class collection that explores medicine in its cultural contexts• Targeted collecting – putting challenges in context• Expert interpretation – engaging (new) audiences• Strategic digitisation – online access to our collections
  7. 7. The Wellcome Digital Library pilot2010-2013Genetics and its Modern FoundationsA new online resource for everyone interested in the history of human and animal health.Aims• build sustainable/expandable mechanism – foundation stone for WDL• digitise key library holdings - relating to a major Trust challenge area• digitise important third party content – linked to theme• use innovative content and tools – to encourage discovery and use• explore commercial partnerships – enhance access to non- theme material
  8. 8. Archival material – 1.1m imagesWellcome Library - 600,000 imagesExternal – 500,000 images
  9. 9. Books related to genetic research - 600,000 images
  10. 10. ProQuest, Early European Books – 5.5m images
  11. 11. Born digital material – initially small but growing
  12. 12. Digitisation strategy Then Now Small projects (<10,000 pp) Large projects (>100,000 pp) Relatively ad-hoc Major strategic programme SMT & Project teams Programme Board, advisors Library-centric W. Trust, external stakeholders Entirely open access Commercial partnerships Little impact on IT systems Requires major IT development Examples Everything (within reason)
  13. 13. Digitisation processes Then Now Manual processes Automated processes Centralised conservation Distributed conservation Low QA Increased QA, error minimization TIFF JPEG 2000 Individual tracking lists Centralised tracking system Incremental storage growth Completely new storage strategy Detailed, painstaking Streamlined, pragmatic
  14. 14. Programme management - strategic• Specific strategy groups – cross-Library/ cross-Trust: Digital Library IT Group, Engagement Strategy Group• WDL Project team – Library senior managers and Programme Manager, key decision makers, ensure cross-departmental communication in the Library, take papers to the Programme Board for approval• Advisory Committee – includes external HoM experts to advise on content selection• Programme Board – cross-Trust + external members: overall responsibility of the programme direction, approve budgets, staffing appointments, report to Trust Executive Board
  15. 15. Programme management – operational• Programme consists of 17 workpackages – “projects”• Project managers – one for each workpackage, most in Digital Services, some in Discovery and Engagement, chair project teams• Programme manager – manage specific projects, ensure communication between projects, manage programme budgets, project plan, contribute to overall strategy, etc.
  16. 16. Digitisation strategy - selection• Thematic – relates to the theme of the pilot• Comprehensive - complete collections; cover-to-cover; large-scale• Exemplar – demonstrate feasibility of the WDL to manage Library’s core materials, full-text searching, high-throughput digitisation, commercial partnerships• Ready – in a good condition to be digitised; catalogued• Approved – selection and prioritisation by the Advisory Committee
  17. 17. Digitisation strategy - workflow• In-house digitisation – our own collections, digitised by Library staff on-site (archives, photography on demand )• In-house commercial digitisation – our own collections, digitised by contracted staff on-site (ProQuest, maybe books, some on-demand photography)• External commercial digitisation – our own collections, digitised off-site by external suppliers (maybe books)• External partner digitisation – external collections, digitised by host institution, funded by the Wellcome Trust and destined for the WDL (archives from CSHL and UCL)
  18. 18. Streamlining digitisation• Staff dedicated to specific projects, or streams of work• Carry out sample workflow tests for new types of material• The right equipment for the right job – eliminate the “fiddly bits” • Live-view monitors • Easy-clean surfaces • Foot-pedals • Custom-made supports
  19. 19. Streamlining digitisation• Photographers do the photography…• Prepare materials separately• Leave loose pages and bindings as they are, they are easier to digitise that way!• Use existing staff as support – moving items to and from stack• Minimise movement• Keep plenty of shelving, working space at hand• Find a preferred supplier for ad hoc support
  20. 20. Upscaling and streamlining digitsation requires a higher level of project management
  21. 21. Streamlining project management
  22. 22. What is it?• Web-based workflow system• Open source (core system)• Used by many libraries in Germany, and half a dozen other European libraries• Intranda version developed by Intranda to meet Wellcome Library specific requirements
  23. 23. What does it do?• Task-focused, customisable workflows developed by Intranda• User-specific “dashboard”• Import/export and store metadata• Encode data as METS• Display progress of tasks, statistics on activities• Tracks projects, batches, and units (location, current activity)• “Command central” for 3rd party systems
  24. 24. User tasks
  25. 25. Digital asset management • Lightroom used to convert RAW to TIFF File conversion • LuraWave converts TIFF to JP2K • Validation of JP2K conversion coming soon – via Goobi •Automated ingest workflow in the DAM (Safety Deposit Box Ingest - SDB) – via Goobi •One file serves as master and dissemination file • DAM is a preservation system • Manages all preservation actions (characterisation, format Preservation migration) • API to allow 3rd party systems access to content • Master files backed up offsite to WORM storage drive Storage • WORM = Write Once Read Many – permanent storage • Self-healing of errors on main storage system from WORM
  26. 26. In-house (RAW) External (TIFF) External (JP2) Temp Temp Temp QA QA QALightroom - post-processing, convert to TIFF Hotfolder Hotfolder LuraWave automatically converts Hotfolder files to JP2 and outputs to a folder Goobi automatically triggers validation Person triggers ingest via Goobi WORM backup Pillar permanent Really permanent SDB ingests
  27. 27. Thank you! Christy Henshawc.henshaw@wellcome.ac.uk
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×