A crowd of specimens: Digitising Collections at the Natural History Museum, London

A crowd of specimens
Digitising Collections at the Natural History Museum,
London
Helen Hardy, Digital Collections Programme Manager
Dr John Tweddle, Head of the Angela Marmont Centre for UK Biodiversity

80 million objects
• 30m insects
• 7m fossils
• 5,000 meteorites
• 250 yrs of collecting
• Almost 5b years
represented

‘It shall be the duty of the Trustees to secure, so far as appears to them to be
practicable, that the objects comprised in the collections of the Museum… are, when
required for inspection by members of the public, made available …’
British Museum Act 1963 section 3(3)

Mass digitisation?
Image (specimen, labels, register?)
Transcription (labels, register?)
Geo-referencing
Diagnostic
Images
Number of specimens
Computabledata

15,000+ transcriptions
for 5,000+ specimens
with 1,000+ volunteers

Automated
solutions (OCR
etc)
Crowd Paid and/or expert
A possible scenario...?
IconsfromTheNounProject

Connection with collections &
science

• NHM, U. Oxford (Zooniverse), UC
Davis, Botanical Society of Britain
and Ireland, funded by AHRC
• Explores impact of climate change
• By building an extended record of
flowering phenology for UK orchids
using two citizen science datasets:
museum specimens + field obsns.
• Experimental approach: combines
the two main natural history citizen
science approaches: field+online
• An experiment!
Fly Orchid (Roger Powley)

Outdoors: find and photograph 29 species of UK orchid (Spring-
Autumn 2015) and then upload the photos to the project’s webpages
Early-purple Orchid
(Mike Waller)

Online: identify species in photos + assign flowering stage

Transcribe & extract phenology data from historical specimens
Analytical approach: combine the new field observations and historical specimen
data with other observations from the field naturalist community, co-publish

Early findings on
Orchid Observers:
Did we meet our science
and engagement goals?
Pyramidal Orchid
(Fred Rumsey)
Based on three data sources:
• Scientific data (n=>20k)
• Patterns of participation (n=>55k)
• Social science questionnaire (n=126)

Dataset Core date
range
3,700 historical
specimens
1780-1980
Field obs. (BSBI) 1970-2014
1,700 new field obs. 2015
• Different data biases, but that’s OK
• Time-series spans >200 years
• Passion, expertise and legacy of
volunteer naturalists and other
‘citizen scientists’ over >3 Centuries!
The datasets generated through
the project are complementary
Green-winged Orchid
(field contributor)

Accuracy of online
consensus (n=1,462)
No.
species
100% 13
90-99% 12
79-85% 4
• Self-assigned confidence = reliable guide
• Online consensus of 5 people sufficient
Data quality is very high
Confidence of field
identifier (n=1,604)
Accuracy
Certain 99.2%
Likely 89.7%
Uncertain 66.7%
Musk Orchid (Fred Rumsey)

• We can look at patterns of flowering over time, the effects of different
climate components & predict response per unit change in e.g. Spring temp.
• More projects of this type as collections data become available open access
Green-winged Orchid
Because of these two factors, the science looks promising!
Advances four
days per degree
Celsius increase in
Feb-April
temperature
Peak flowering date
from 1800-2015

Bee Orchid
(collected in 1918)
Who took part and why?
• Field naturalists, Zooniverse, and
first-time citizen scientists (17%)
• Necessitated 3 promotion routes
• 292 people (field), 1,745 (online)
Top 3 reasons for taking part
• Interest in botany (66%)
• To contribute to science (63%)
• Enjoyment of Zooniverse (59%)
• Outdoor natural history (32%)

Participant group % group that contributed to
online activities
Field photos Specimens Both
Online (n=1,745) 35% 52% 14%
Field (n=292) 13% 4% 7%
How did people take part? Two key observations:
1. Participants generally focused on one activity
• Most kept doing what they already enjoyed doing (existing interests or
areas of confidence?), with limited - but important - cross-over.

How did people take part? Two key observations:
2. For each task we see a range of participation
• This includes a small number of ‘super-contributors’ and numerous
one-off participants
Task Contribution to data made by:
Top 5% 1-off
Online – field images (n=1,143) 74% (57) 0.9% (183)
Field activities (n=292) 53% (15) 8.0% (137)

Next step: participants
perspectives on the project
(confidence, skills, connection)
Autumn Lady’s-tresses
(field contributor)

With thanks to everyone that contributed their time, expertise
and photos: participants, BSBI, NHM and Zooniverse teams
Bee Orchid
(Kath Castillo)

Five conclusions
1. Experimental, but worked: appealed to
existing citizen scientists, attracted new
contributors, produced sound data
2. Project design is more complex: trade-
off between complexity, accessibility
and online environment
3. Many benefits to working with both
communities, but limited cross-over
4. Two areas to improve: extent to which
participants felt they were valued and
how connected they felt with team
5. …transcription still remains niche!
Bee Orchid
(collected in 1918)

A crowd of specimens: Digitising Collections at the Natural History Museum, London

Recommended

Recommended

More Related Content

Similar to A crowd of specimens: Digitising Collections at the Natural History Museum, London

Similar to A crowd of specimens: Digitising Collections at the Natural History Museum, London (20)

Recently uploaded

Recently uploaded (20)

A crowd of specimens: Digitising Collections at the Natural History Museum, London

Editor's Notes