Grant presents a case study of the 19th Century Pamphlets digitisation project, covering the decisions made in planning the project, the challenges encountered, and key lessons learned.
Putting it into practice: a digitisation case study
1. JISC Digital Media Seminar 15 September 2009 Putting it into practice: a digitisation case study Grant Young Digitisation & Digital Preservation Specialist Cambridge University Library CAMBRIDGE UNIVERSITY LIBRARY
4. Content What did we create? JSTOR collection http://www.jstor.org/
5.
6.
7.
8.
9.
10. Libraries Collections Pamphlets (approx.) Durham Earls Grey Family collection 1,000 Liverpool Earls of Derby (Knowsley) Family collection 1,500 Newcastle Joseph Cowen (1829-1900) Personal collection 1,500 UCL Joseph Hume (1777-1855) Personal collection 5,000 Manchester Foreign Office & Colonial Office collections Government collections Local and anti-slavery collections 5,000 Bristol Selections from 19 th Century collection 5,000 LSE Selections from 19 th Century collection 7,000 26,000
11. Project How did we do it? Scoping study http://www.jisc.ac.uk/publications/documents/pub_digi_scopingstudy. aspx Project plan http://www.jisc.ac.uk/media/documents/programmes/digitisation/pampp. pdf Final report http://www.britishpamphlets.org.uk/docs/about/PamphletsFinalReport.pdf
12.
13.
14. Ensure discoverability Pamphlet Collection Google Scholar Search Copac Academic & National Library Catalogue Catalogues of libraries holding pamphlets JSTOR’s search interface 19 th Century Pamphlets Web Guide Pamphlet level (bibliographic) Full text search JSTOR Mimas Links from other JSTOR content Many other services, resources & collections CrossRef, OAI… Regular Google Search
15.
16.
17.
18. Technical standards Images: 600 bitonal for text; 300 grey for images OCR: 97-98% character accuracy Metadata: METS – structural MODS – bibliographic METS – technical* PREMIS – preservation* *selective use
23. Lessons What did we learn? Final report http://www.britishpamphlets.org.uk/docs/about/PamphletsFinalReport.pdf
24.
25.
26.
27.
28.
29. Must pay close attention to the workflow! Insufficient scanning rate detected New scanners Need for more pamphlets detected Additional pamphlets & month extension
30. Must pay close attention to the workflow! 20 seconds to write two-page grey image to file = significant operator delay Scanner is ready by time next page is set up
31.
32.
33.
34.
35.
36.
37.
38. Any questions or comments? With thanks to JISC, JSTOR and the university libraries of Cambridge and Southampton, particularly Christine Fowler of Southampton CAMBRIDGE UNIVERSITY LIBRARY