Just Digitise It by Daniel Wilksch of the Public Records Office Victoria. Presented at the 2018 Community Heritage Grants (CHG) Preservation and Collection Management Training Workshops
Just Digitise It - Daniel Wilksch of the Public Records Office Victoria
1. 31 October 2017
Just Digitise It
Community Heritage Grants Program digitisation workshop
2. Session outline
1.35 Arrival and welcome
1.40 - 2.40 pm Planning a digitisation project
• Setting standards
• Resources needed
• Care of your originals
• Care of your copies
2.40 - 3.00 pm Digitisation facility tour
Wan Wong, NLA
3.00 - 3.20 pm Afternoon tea
3.30 - 4.00 pm Negotiating rights before you digitise
Michael Proud, NLA
4.00 – 4.30 pm Providing access
• Getting images online
• Metadata and sharing images
4.30 - 5.00 pm Q & A
3. Further notes and resources online
Museums and Galleries NSW fact sheets
https://mgnsw.org.au/sector/resources/online-resources/
Just Digitise It (video demo, but quite old now)
http://prov.vic.gov.au/community/managing-your-collection/just-
digitise-it
Digital Access to Collections
http://www.digitalcollections.org.au/
Museums Australia (Victoria) training videos
https://mavic.asn.au/training-videos
Australian Libraries Copyright Council
http://libcopyright.org.au/content/resources
5. Matching standards to the project
Setting standards
Two main impetus…es for digitisation
• Preservation
– OHIO (only handle it once)
– colour management, ‘master’ copies
• Access
– search/ discoverability
– crowdsourcing
6. Factors
Setting standards
• How much material to copy?
• What condition? (preservation needs assesment)
• How much time/ money do you have?
• Has somebody already digitised it? (books…)
• What is its significance? (significance statement, etc.)
7. How do we see?
Setting standards
http://en.wikipedia.org/wiki/Color_vision
8. Describing light with numbers
Smooth gradient
Broken into 16 steps (4-bit)
No intensity
(0 in 8-bit scale)
Step 10 of 16
(160 in 8-bit
scale)
Step 16
(255 in 8-bit
scale)
Hint: the smooth gradient is in 8-bit steps – each level of intensity is 2 px wide in
the original drawing.
9. Bit-depth
Red 255 ff 1111 1111
Green 255 ff 1111 1111
Blue 255 ff 1111 1111
24 BInary digiTs
11. Resolution
Setting standards
1 inch 1 inch
1 inch 1 inch
@300dpi
= 90,000 pixels
@72dpi
= 5,184 pixels
‘Screen’ resolution Standard ‘Print’ resolution
12. How many dpi is enough?
Original 75 x 53 mm (VPRS 8609/P30 unit 3, item 6/108)
600 dpi
22. The most colours vs. the right colours
-- ‘positive’ images: scan for colour fidelity
-- negatives: scan for maximum tonal range
(consider 16/48-bit)
It’s better to make a good scan than correct a
‘poor’ one. ‘Levels’ adjustment when scanning is
different to adjusting an existing digital image.
Fidelity
24. Still image standards
Setting standards
National Library of Australia
http://www.nla.gov.au/standards/image-capture
Public Record Office Victoria
http://prov.vic.gov.au/government/standards-and-policy/capture
26. Sound and moving pictures (advice)
Setting standards
National Film and Sound Archives
http://www.nfsa.gov.au/preservation/care/caring-for-film/
http://www.nfsa.gov.au/preservation/care/caring-for-audio/
http://www.nfsa.gov.au/preservation/services/
30. Physical resources
Resources needed
• Space
– managed, secure (fire, flood, pests, ancient wiring, not about to be reclaimed by Council for
boutique carparks, etc.)
– flat (shelving, tables)
• Supplies
– rehousing materials for copied originals
– acid free paper, plastic film, gloves, pencils, spirit level, measuring tape/ rulers, gaffer
tape, extension cables, USB sticks, random things that aren’t too grubby
32. Human resources
Resources needed
• Project manager
• Project committee
(for when the manager heads off to Noosa)
• Tame experts
• Volunteers
– what do you need from them?
– what do they get out of it?
33. Documentation
Resources needed
• Digitisation policy/ strategy/ plan
• Project statement/ plan
• Risk management framework
• Specific policies/ procedures
• Written agreements with donors and digitisers
• Passwords. Write them down.
36. Collection management
Care of your originals
• Are the items catalogued? (Does the catalogue make sense?)
• Are they securely stored?
• Do you know who owns what?
… things go missing.
37. Preservation management
Care of your originals
• put it in a box (controls light, humidity, physical safety)
• wrapped in plastic (anything except PVC)
• write on the enclosure, not the object
• only take it out when you have to
… things get old.
38. Relationship management
Care of your originals
• have some handling rules (gloves, induction)
• digitisation providers should be able to describe their
security and preservation measures
• don’t break the original to digitise it
… things get dropped.
39. Further further reading
Care of your originals
National Standards for Museums and Galleries
• http://www.collectionsaustralia.net/sector_info_item/107
Keeping Archives
41. Hardware failure
Care of your copies
• backups, offsite preferably
• understand the limits of the storage technology
• checksums – or just look at your images every so often
43. Software obsolescence
Care of your copies
• open formats
(image formats have been stable for decades)
• open applications
(separate the data from the program)
• plan for and budget migrations
44. Poor management and documentation
Care of your copies
• Bill is your IT guy. He has
just fallen under a bus.
• Try not to implement
systems you don’t
understand.
45. Hardware failure
Digital Preservation
• backups, offsite preferably
• understand the limits of the storage technology
• checksums – or just look at your images every so often
46. Summary
Metadata
• All of your data needs to be easily extractable from the software it’s in.
• Create a simple file structure and make sure people stick to it.
• Manage your backups properly (no lending to people, manage your risks, NO shortcuts).
49. Originals and renditions
Getting images online
Constraints on delivering raw images
• Connection speed and bandwidth
• Screen size and resolution
• Control over rights to the image
50. Options for publishing
Getting images online
• Don’t publish at all…
• Use existing commercial tools and services (Flickr, Facebook, eHive)
• Use existing community services (Victorian Collections)
• Your own site (Wordpress, Omeka)
51. What and why to put online
Getting images online
• Marketing your organisation (‘going viral’)
• Online archive (TROVE)
• Storytelling
• Online communities
54. Definition
Metadata
• Data about data (and data systems)
• Metadata shares and translates between
– collections and users
– collections and collections.
55. Possible stages in metadata usage
• in-house catalogue of collection
• standardisation of data captured
card catalogue --> small museums cataloguing standard
• computerisation of data
card catalogue --> InMagic database
• web publishing of data
InMagic database --> CIDOC RDF in JSON via RESTful interface (http://data.culturehack.org.uk/dataset/37251018-British-Museum-object-catalog)
57. Common metadata formats
• CSV (comma separated values)
• XML
• JSON
RDF (resource description framework) is a set of standards and formats and vocabularies that aims to encompass everything talked about.
58. Metadata
• Think about how your existing data can be:
• Categorised into different functions (descriptive, discovery, preservation, etc.)
• Standardised (eg. Dublin Core) enabling matches with other collections and websites.
59. Metadata
• How does what you are recording 'translate' across systems?
– Identity (title, ‘control symbol’)
– Classification (subject, function)
– History (dates, purposes)
– [Description]
– Relationships
60. Identity
Metadata
• Some items may not have titles. What is the thing that distinguishes one item from the next in a collection?
• Remember physical cues not same as digital. Perhaps the filename of your image is the title?
• ‘Control Symbol’: Catalogue / collection / record-keeping number.
61. Classification
Metadata
• Information to manage items and help narrow down searches.
• Library: ‘subject’ – what is it about?
• Archive: ‘function’ – what does it do?
• Internet: ‘tagging’ – where did I put it again?
• Subject/ topic list for images: http://www.picturethesaurus.gov.au/
62. History
Metadata
• Archive/ Museum: ‘provenance’ – where is it from? (which collection, which donor)
• Management history: what has happened to it? ie., what date was it scanned?
• Scanning is another layer to existing management history that might be recorded in your collection database.
63. Description
Metadata
• Extended stories about item (mum on a bike)
• Description of physical original – dimensions, special features
• Description of digital copy – dpi, file format
64. Relationships
• This object is part of this collection
• This book was owned by this person between these dates
The Resource Description Framework expresses all metadata in terms of relationships
subject -> predicate -> object
65. Modern web applications
• Every URL is a reference to a resource
(book, photo, object, etc.)
• Information about that resource is attached to that URL in human-readable (web page) or
machine-readable (Dublin Core, Resource Description Framework, etc.) form
66. Description
Metadata
• Extended stories about item (mum on a bike)
• Description of physical original – dimensions, special features
• Description of digital copy – dpi, file format
67. Description
Metadata
• Extended stories about item (mum on a bike)
• Description of physical original – dimensions, special features
• Description of digital copy – dpi, file format
68. Description
Metadata
• Extended stories about item (mum on a bike)
• Description of physical original – dimensions, special features
• Description of digital copy – dpi, file format
69.
70.
71.
72. Contributing to Trove
An example of how to share metadata.
•Harvesting tools built in to commonly used applications:
http://help.nla.gov.au/trove/becoming-partner/for-smaller-collections
•Metadata embedded on pages (Sitemap method):
http://help.nla.gov.au/trove/html-record