SlideShare a Scribd company logo
1 of 33
Digitised collections:
Toward a digital strategy for
for the NHM, London
Vince Smith
Workshop 3, pro-iBiosphere, Berlin
23 May 2013
Digital Ambition: NHM Science Strategy 2013-2017
A New Voyage of Discovery
Three Focal Areas
1. Scientific discovery
2. Scientific Infrastructure
3. Scientific engagement
Five Challenges
1. The Digital NHM
2. Origins, evolution & futures
3. Biodiversity discovery
4. Natural resources & hazards
5. Science, society & skills
Resources & funding
Measuring success
data.nhm.ac.uk/globe/
A New Voyage of Discovery
Three Focal Areas
1. Scientific discovery
2. Scientific Infrastructure
3. Scientific engagement
Five Challenges
1. The Digital NHM
2. Origins, evolution & futures
3. Biodiversity discovery
4. Natural resources & hazards
5. Science, society & skills
Resources & funding
Measuring success
Digital Ambition: NHM Science Strategy 2013-2017
Scientific impact 1,000 papers in leading journals
Digital access 20M specimens available digitally
Engagement 1M face-to-face engagements
Collections Globally important collections
Diagnostic tools Diagnostic tools for key groups
Deep time Timeline of key transitions
Science & society Articulate of the role of science
UK network Act as a national museum
Earth sciences Earth Sciences Centre
Funding £10M for Five Challenge Areas
Overview
1. Existing digital content, sources & formats
• Research data
• Collections data
2. Making collections data digital
• Priorities
• Protocols & pathfinder activities
• Crowdsourcing transcription
3. Aggregation & delivery
• The NHM data portal
• Data visualisation, data sub-portals
4. Identifiers, links & interoperability
• DataCite DOIs
• Third party aggregators
• Portal API’s, download & analytical functions
5. Timeline & constraints
• Data policies
• Next steps
Digitisation
activities
Data
portal
NHM Research Outputs
• 49 papers, 45 available online
(4 print only or behind pay walls)
• 9 had supplementary data files
• 39 papers with tables, charts & other data
o >1000 sequences
o 826 figures
o 76 tables
o 1 genome
• No collective view of these data (37 journals)
• No consistent way of citing NHM data
• No consistent mechanism to access data
• Effectively invisible at the institutional level
One Month of NHM Science group papers
Data via Carolyn Lowry e-mail, 13th Feb. 2013
1. Existing digital content
NHM Collections Outputs: data
• Huge investment in NHM collection management system
• ≠ Imaging
• Most research projects need spatio-temporal records
• Different requirements for different purposes
NHM COLLECTIONS April 2013
Collection area
Estimate no of
specimens
No. records in
database
% collection in
database
% records with
location info
Botany 6,000,000 626,000 ~ 10% 96%
Entomology 32,000,000 316,000 <1% 68%
Mineralogy 500,000 422,000 ~ 95% 79%
Palaeontology 9,000,000 342,000 ~ 3% 89%
Zoology 28,000,000 1,131,000 ~ 60% via lots) 69%
TOTAL 76,000,000 2,837,000 3% (23% )
1. Existing digital content
• Many, many imaging projects (highly fragmented)
• Circa 40 TB for major collections (excluding library)
• 120,000 images in KE EMu (many others not in KE!)
• Circa 250,000 via NHM Photo unit (limited metadata)
Collection area No. image files Disk space
Botany 140,133 35,302
Entomology 529,106 3,172
Mineralogy 14,000 6
Palaeontology 122,548 993
Zoology 12,975 1,598
TOTAL 818,762 41,070
NHM Collections Outputs: images
1. Existing digital content
Current data formats
• Darwin Core Archive (DwCA) & extensions (collections)
• Circa 2020 fields mapped to 50 fields to generate archive
• Images mainly JPG & TIFF
• Metadata using EML & Genesis II standard
• Research data files in a wide array of formats (blob files)
Nexus (character data and Newick formatted
phylogenetic trees)
Non-NHM specimen lists (as Darwin Core
Archive files)
PhyloXML (an XML standard for representing
phylogenetic trees)
Output from the Imaging and Analysis Centre
(Micro CT datafile formats)
NeXML (an XML standard for representing
character data)
Collections of images from digitisation projects
(as a collection of links or a zipped archive)
Sequence trace files (.scf sequence
chromatogram format files)
Environmental sequence files
Taxon checklists (as Darwin Core Archive files) Collection level descriptions
1. Existing digital content
• Priorities linked to science strategic priorities
o Disease, sustainability, crop wild relatives, pests etc.
• Tiered approach, different needs for different collections
• Low hanging fruit (2D objects e.g. herb. sheets & slides)
2. Making collections data digital
Digitisation Priorities
• Priorities linked to science strategic priorities
o Disease, sustainability, crop wild relatives, pests etc.
• Tiered approach, different needs for different collections
• Low hanging fruit (2D objects e.g. herb. sheets & slides)
• Linked to strategic collaborations & financial opportunities
o e.g RBG Kew, RBG Edinburgh, Nat. Mum. Wales, Hunterian etc.
• Priorities dictate order – we plan to do it all (eventually)!
2. Making collections data digital
Digitisation Priorities
• Exercise to develop digitisation protocols across collection
o Slides, spirit, herbarium sheets, pinned, multispecimen/drawer
• Protocols mapped to high level collections descriptions
• Workflow software supporting rapid digitisation (to KE & DAMS)
2. Making collections data digital
Digitisation Protocols
• Exercise to develop digitisation protocols across collection
o Slides, spirit, herbarium sheets, pinned, multispecimen/drawer
• Protocols mapped to high level collections descriptions
• Workflow software supporting rapid digitisation (to KE & DAMS)
• Pathfinder activities for less well understood projects
o Entomological dry material (30 M specimens)
- iCollections (specimen-by-specimen) approach
- SatScan (drawer level multi-specimen) approach
2. Making collections data digital
Digitisation Protocols
• Specimen-by-specimen, traditional, dedicated 6 person team
• Digitising British Isles Lepidoptera collection
• ~500,000 specimens, 5,000 drawers
• Re-curation & specimen imaging
• Complete label information including georeferencing
• For use in Climate Change initiative
2. Making collections data digital
iCollections Initiative
• 4-6 people over 3 years, work broken into small tasks by teams
• Average imaging rate 163 specimen/day*person
• Averaging >3min per specimen (prep., imaging & databasing)
• >£1/specimen
• BUT: 6,800 person years for the entire collection
2. Making collections data digital
iCollections Initiative
• Drawer level digitisation, segmented down to specimens
• Very fast imaging, no specimen handling, just one view
• No label information, but some data extracted from drawer
• Specimens retrospectively cropped & annotated
2. Making collections data digital
SatScan Initiative
• Drawer level digitisation, segmented down to specimens
• Very fast imaging, no specimen handling, just one view
• No label information, but some data extracted from drawer
• Specimens retrospectively cropped & annotated
2. Making collections data digital
SatScan Initiative
• Dedicated specimen-level rapid annotation software
2. Making collections data digital
SatScan Initiative
Crowdsourcing & Transcription
• We have a massive transcription problem
• Experiments via Notes-from-Nature (a Zooniverse project)
• Transcribing the NHM ornithological accession registers
• Wikimedian in Residence (Wikisource transcription)
• 4 Month project, including specimen label transcription
2. Making collections data digital
data.nhm.ac.uk
• A focus for deposition and discovery of major NHM data sets
• Promote innovation though re-use of museum data
• Open Access, at a dedicated subdomain of the NHM website
• Started Jan. 2013 (3 years), consultation throughout 2012
NHM Data Portal
Functional
components
of the data
portal
3. Aggregation & Delivery
Search
Datasets
matching
criteria
Individual dataset
Results
Browse &
search
criteria
Advanced display
options
• Dataset registry, for dataset discovery, modeled on data.gov.uk
• Uses CKAN, an open-source data portal software platform
3. Aggregation & Delivery
NHM Data Portal: Registry
Metadata
about the
dataset
Name
Geographic
scope
Tags
“Social”
Authors
License
Download
Developer tools
Technical
Info.
(extracted
from data
file)
• Dataset metadata discovery
3. Aggregation & Delivery
NHM Data Portal: Registry
• Simple datasets upload workflow for non-collections data
1. Name the
dataset 2. Upload / link
the data file
3. Describe the
data file
4. Theme &
tag
5. Add additional
resources
6. Temporal
coverage
7. Geographic
coverage
8. Save & finish
3. Aggregation & Delivery
NHM Data Portal: Dataset upload
Zoomable
map
Applied
filters
Toggle map, table &
stats views
Search, download &
display options
No. records
No.
Georef.
records
• Dedicated interface to visualise & explore major datasets
• Focused on collections data, based on Canadensys.net, uses CartoDB
3. Aggregation & Delivery
NHM Data Portal: Data visualisation
Collections views
Statistical
summary
Specimen record views
Data field
mappings
Summary
preview
Full
record
Tables
Download
3. Aggregation & Delivery
NHM Data Portal: Data visualisation
• Using DataCite DOIs in the data portal
• datasets (2014) & specimens (2015)
• Unique, persistent and resolvable identifiers
• Easy to cite, alias existing specimen identifiers
• Conform to minimum DataCite requirements
• Landing page, min. metadata standard, fee, min. 10 yr. contract, DOI (pre)fixes
NHM Data Portal & DataCite
Breaks us out of the biodiversity data silo
4. Identifiers, links & interoperability
• Content within the NHM data portal will be highly accessible
o Collections harvestable (e.g. by GBIF as a DwCA)
o Download DwCAs on any search facet
o Wide set of API’s available of datasets (part of CKAN)
• Sub-portals (selected content, themed by topic)
o e.g Virtual Herbarium, NHM Science initiatives, geographic regions
• Analytical interface planned for 2015 (but not specified)
Data Aggregation, APIs & download
4. Identifiers, links & interoperability
• Data portal will be “open-by-default”
• Ambiguity in what this means & top down schizophrenia
• Conflicting mandates on open access & revenue opportunities
• Lots of guidance available, will use to form a common policy
• A cross institutional policy would be useful (but challenging)
Data Policies & Next Steps
5. Timeline & constraints
Jan 2013 Jan 2014 Jan 2015 Jan 2016
Requirements
& dataset discovery
Private alpha Stable public
beta
Full release &
sub-portals
Internal feedback, data
visualisation & DOIs
Subportals &
analytical tools
Project start
NHM Data portal timeline
Next 6 months
• More documentation (PID and Tech Spec)
• Consultation and advocacy (internal and external)
• Data mapping from KE EMu and software testing
• Development
o website wireframe design
o drafting data visualisation subcontract
o Construction of private alpha release
5. Timeline & constraints
Data Policies & Next Steps
Jan 2013 2014 2018
Path-finding &
Programme
development
Private alpha Stable public
beta
20 Million!!Project start
NHM digitisation timeline
Next 6 months
• Initial conclusions from path-finding digitisation activities
• Initial grant funding bids developed
• Advocacy, outreach & development of a digitisation “programme”
• Investigate possibilities for gallery development
• Develop crowdsourcing strategy
2015 2016 2017
Major funding
applications &
a new gallery?
Digitisie… Digitisie… Digitisie…
5. Timeline & constraints
Data Policies & Next Steps
QUESTIONS
Digitisation Priorities
• Priorities linked to science strategic priorities
o Disease, sustainability, crop wild relatives, pests etc.
0
100
200
300
400
500
600
700
Crop Wild Relatives (accepted taxa only)
2. Making collections data digital
• Priorities linked to science strategic priorities
o Disease, sustainability, crop wild relatives, pests etc.
• Tiered approach, different needs for different collections
Nick Poole, UK Collections Trust
2. Making collections data digital
Digitisation Priorities

More Related Content

What's hot

Web services for sharing germplasm data sets, at FAO in Rome (2006)
Web services for sharing germplasm data sets, at FAO in Rome (2006)Web services for sharing germplasm data sets, at FAO in Rome (2006)
Web services for sharing germplasm data sets, at FAO in Rome (2006)Dag Endresen
 
GBIF and Biodiversity informatics for museums, 15 March 2021
GBIF and Biodiversity informatics for museums, 15 March 2021GBIF and Biodiversity informatics for museums, 15 March 2021
GBIF and Biodiversity informatics for museums, 15 March 2021Dag Endresen
 
The Biodiversity Informatics Landscape
The Biodiversity Informatics LandscapeThe Biodiversity Informatics Landscape
The Biodiversity Informatics LandscapeVince Smith
 
GBIF BIFA mentoring, Day 4b Event core, July 2016
GBIF BIFA mentoring, Day 4b Event core, July 2016GBIF BIFA mentoring, Day 4b Event core, July 2016
GBIF BIFA mentoring, Day 4b Event core, July 2016Dag Endresen
 
Intro to GBIF: Infrastructures and Platforms for Environmental Crowd Sensing ...
Intro to GBIF: Infrastructures and Platforms for Environmental Crowd Sensing ...Intro to GBIF: Infrastructures and Platforms for Environmental Crowd Sensing ...
Intro to GBIF: Infrastructures and Platforms for Environmental Crowd Sensing ...Kyle Copas
 
Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Carole Goble
 
FAIR and open biodiversity collection data management
FAIR and open biodiversity collection data managementFAIR and open biodiversity collection data management
FAIR and open biodiversity collection data managementDag Endresen
 
Forschungsdaten-Repositorien Typen, Herausforderungen und Perspektiven
Forschungsdaten-Repositorien Typen, Herausforderungen und PerspektivenForschungsdaten-Repositorien Typen, Herausforderungen und Perspektiven
Forschungsdaten-Repositorien Typen, Herausforderungen und PerspektivenHeinz Pampel
 
GBIF towards 2030 (November 2018)
GBIF towards 2030 (November 2018)GBIF towards 2030 (November 2018)
GBIF towards 2030 (November 2018)Dag Endresen
 
Jim Woolley - Name Registration: One Less Impediment to Taxonomy
Jim Woolley - Name Registration: One Less Impediment to TaxonomyJim Woolley - Name Registration: One Less Impediment to Taxonomy
Jim Woolley - Name Registration: One Less Impediment to TaxonomyICZN
 
EURISCO demo installations of IPT, at GBIF EU Nodes meeting in Alicante (11 M...
EURISCO demo installations of IPT, at GBIF EU Nodes meeting in Alicante (11 M...EURISCO demo installations of IPT, at GBIF EU Nodes meeting in Alicante (11 M...
EURISCO demo installations of IPT, at GBIF EU Nodes meeting in Alicante (11 M...Dag Endresen
 
GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...
GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...
GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...Dag Endresen
 
GBIF-Norway status for the 6th European GBIF nodes meeting April 2014
GBIF-Norway status for the 6th European GBIF nodes meeting April 2014GBIF-Norway status for the 6th European GBIF nodes meeting April 2014
GBIF-Norway status for the 6th European GBIF nodes meeting April 2014Dag Endresen
 
GBIF-Norway at NMBU, January 2015
GBIF-Norway at NMBU, January 2015GBIF-Norway at NMBU, January 2015
GBIF-Norway at NMBU, January 2015Dag Endresen
 
GBIF-Norway node story lightning talk at GB26 in Leiden, October 2019
GBIF-Norway node story lightning talk at GB26 in Leiden, October 2019GBIF-Norway node story lightning talk at GB26 in Leiden, October 2019
GBIF-Norway node story lightning talk at GB26 in Leiden, October 2019Dag Endresen
 
Digital research: Collections, data, tools and methods
Digital research: Collections, data, tools and methods Digital research: Collections, data, tools and methods
Digital research: Collections, data, tools and methods Stella Wisdom
 
FAIR Data, Operations and Model management for Systems Biology and Systems Me...
FAIR Data, Operations and Model management for Systems Biology and Systems Me...FAIR Data, Operations and Model management for Systems Biology and Systems Me...
FAIR Data, Operations and Model management for Systems Biology and Systems Me...Carole Goble
 
GBIF & GRScicoll, Høstseminar Norges museumsforbunds Seksjon for natur, 2021-...
GBIF & GRScicoll, Høstseminar Norges museumsforbunds Seksjon for natur, 2021-...GBIF & GRScicoll, Høstseminar Norges museumsforbunds Seksjon for natur, 2021-...
GBIF & GRScicoll, Høstseminar Norges museumsforbunds Seksjon for natur, 2021-...Dag Endresen
 
nternational Biodiversity Projects and Natural History Museums: Current stat...
nternational Biodiversity Projects and Natural History Museums:  Current stat...nternational Biodiversity Projects and Natural History Museums:  Current stat...
nternational Biodiversity Projects and Natural History Museums: Current stat...Klaus Riede
 

What's hot (20)

Web services for sharing germplasm data sets, at FAO in Rome (2006)
Web services for sharing germplasm data sets, at FAO in Rome (2006)Web services for sharing germplasm data sets, at FAO in Rome (2006)
Web services for sharing germplasm data sets, at FAO in Rome (2006)
 
GBIF and Biodiversity informatics for museums, 15 March 2021
GBIF and Biodiversity informatics for museums, 15 March 2021GBIF and Biodiversity informatics for museums, 15 March 2021
GBIF and Biodiversity informatics for museums, 15 March 2021
 
The Biodiversity Informatics Landscape
The Biodiversity Informatics LandscapeThe Biodiversity Informatics Landscape
The Biodiversity Informatics Landscape
 
GBIF BIFA mentoring, Day 4b Event core, July 2016
GBIF BIFA mentoring, Day 4b Event core, July 2016GBIF BIFA mentoring, Day 4b Event core, July 2016
GBIF BIFA mentoring, Day 4b Event core, July 2016
 
Intro to GBIF: Infrastructures and Platforms for Environmental Crowd Sensing ...
Intro to GBIF: Infrastructures and Platforms for Environmental Crowd Sensing ...Intro to GBIF: Infrastructures and Platforms for Environmental Crowd Sensing ...
Intro to GBIF: Infrastructures and Platforms for Environmental Crowd Sensing ...
 
Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017
 
FAIR and open biodiversity collection data management
FAIR and open biodiversity collection data managementFAIR and open biodiversity collection data management
FAIR and open biodiversity collection data management
 
Forschungsdaten-Repositorien Typen, Herausforderungen und Perspektiven
Forschungsdaten-Repositorien Typen, Herausforderungen und PerspektivenForschungsdaten-Repositorien Typen, Herausforderungen und Perspektiven
Forschungsdaten-Repositorien Typen, Herausforderungen und Perspektiven
 
GBIF towards 2030 (November 2018)
GBIF towards 2030 (November 2018)GBIF towards 2030 (November 2018)
GBIF towards 2030 (November 2018)
 
Jim Woolley - Name Registration: One Less Impediment to Taxonomy
Jim Woolley - Name Registration: One Less Impediment to TaxonomyJim Woolley - Name Registration: One Less Impediment to Taxonomy
Jim Woolley - Name Registration: One Less Impediment to Taxonomy
 
EURISCO demo installations of IPT, at GBIF EU Nodes meeting in Alicante (11 M...
EURISCO demo installations of IPT, at GBIF EU Nodes meeting in Alicante (11 M...EURISCO demo installations of IPT, at GBIF EU Nodes meeting in Alicante (11 M...
EURISCO demo installations of IPT, at GBIF EU Nodes meeting in Alicante (11 M...
 
GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...
GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...
GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...
 
GBIF-Norway status for the 6th European GBIF nodes meeting April 2014
GBIF-Norway status for the 6th European GBIF nodes meeting April 2014GBIF-Norway status for the 6th European GBIF nodes meeting April 2014
GBIF-Norway status for the 6th European GBIF nodes meeting April 2014
 
GBIF-Norway at NMBU, January 2015
GBIF-Norway at NMBU, January 2015GBIF-Norway at NMBU, January 2015
GBIF-Norway at NMBU, January 2015
 
GBIF-Norway node story lightning talk at GB26 in Leiden, October 2019
GBIF-Norway node story lightning talk at GB26 in Leiden, October 2019GBIF-Norway node story lightning talk at GB26 in Leiden, October 2019
GBIF-Norway node story lightning talk at GB26 in Leiden, October 2019
 
GBIF Work Programme 2016 Update
GBIF Work Programme 2016 UpdateGBIF Work Programme 2016 Update
GBIF Work Programme 2016 Update
 
Digital research: Collections, data, tools and methods
Digital research: Collections, data, tools and methods Digital research: Collections, data, tools and methods
Digital research: Collections, data, tools and methods
 
FAIR Data, Operations and Model management for Systems Biology and Systems Me...
FAIR Data, Operations and Model management for Systems Biology and Systems Me...FAIR Data, Operations and Model management for Systems Biology and Systems Me...
FAIR Data, Operations and Model management for Systems Biology and Systems Me...
 
GBIF & GRScicoll, Høstseminar Norges museumsforbunds Seksjon for natur, 2021-...
GBIF & GRScicoll, Høstseminar Norges museumsforbunds Seksjon for natur, 2021-...GBIF & GRScicoll, Høstseminar Norges museumsforbunds Seksjon for natur, 2021-...
GBIF & GRScicoll, Høstseminar Norges museumsforbunds Seksjon for natur, 2021-...
 
nternational Biodiversity Projects and Natural History Museums: Current stat...
nternational Biodiversity Projects and Natural History Museums:  Current stat...nternational Biodiversity Projects and Natural History Museums:  Current stat...
nternational Biodiversity Projects and Natural History Museums: Current stat...
 

Viewers also liked

Vince smith-delivering biodiversity knowledge in the information age-notext
Vince smith-delivering biodiversity knowledge in the information age-notextVince smith-delivering biodiversity knowledge in the information age-notext
Vince smith-delivering biodiversity knowledge in the information age-notextVince Smith
 
Assisted restructure of web content for paper-based presentation: a look at w...
Assisted restructure of web content for paper-based presentation: a look at w...Assisted restructure of web content for paper-based presentation: a look at w...
Assisted restructure of web content for paper-based presentation: a look at w...Vince Smith
 
Consolidated ViBRANT Project Final Review Presentations
Consolidated ViBRANT Project Final Review PresentationsConsolidated ViBRANT Project Final Review Presentations
Consolidated ViBRANT Project Final Review PresentationsVince Smith
 
Scratchpad 2014-introduction
Scratchpad 2014-introductionScratchpad 2014-introduction
Scratchpad 2014-introductionVince Smith
 
Don't make me think: biodiversity data publishing made easy
Don't make me think: biodiversity data publishing made easyDon't make me think: biodiversity data publishing made easy
Don't make me think: biodiversity data publishing made easyVince Smith
 
Next generation sequencing requires next generation publishing: the Biodivers...
Next generation sequencing requires next generation publishing: the Biodivers...Next generation sequencing requires next generation publishing: the Biodivers...
Next generation sequencing requires next generation publishing: the Biodivers...Vince Smith
 
NHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeNHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeVince Smith
 
SYNTHESYS 3 Overview
SYNTHESYS 3 OverviewSYNTHESYS 3 Overview
SYNTHESYS 3 OverviewVince Smith
 
Use it or lose it: a hybrid model for sustaining e-infrastructures
Use it or lose it: a hybrid model for sustaining e-infrastructuresUse it or lose it: a hybrid model for sustaining e-infrastructures
Use it or lose it: a hybrid model for sustaining e-infrastructuresVince Smith
 
FP7 Funded RI Project experiences: some overly honest tips from a project coo...
FP7 Funded RI Project experiences: some overly honest tips from a project coo...FP7 Funded RI Project experiences: some overly honest tips from a project coo...
FP7 Funded RI Project experiences: some overly honest tips from a project coo...Vince Smith
 
The biodiversity informatics landscape: a systematics perspective
The biodiversity informatics landscape: a systematics perspectiveThe biodiversity informatics landscape: a systematics perspective
The biodiversity informatics landscape: a systematics perspectiveVince Smith
 

Viewers also liked (11)

Vince smith-delivering biodiversity knowledge in the information age-notext
Vince smith-delivering biodiversity knowledge in the information age-notextVince smith-delivering biodiversity knowledge in the information age-notext
Vince smith-delivering biodiversity knowledge in the information age-notext
 
Assisted restructure of web content for paper-based presentation: a look at w...
Assisted restructure of web content for paper-based presentation: a look at w...Assisted restructure of web content for paper-based presentation: a look at w...
Assisted restructure of web content for paper-based presentation: a look at w...
 
Consolidated ViBRANT Project Final Review Presentations
Consolidated ViBRANT Project Final Review PresentationsConsolidated ViBRANT Project Final Review Presentations
Consolidated ViBRANT Project Final Review Presentations
 
Scratchpad 2014-introduction
Scratchpad 2014-introductionScratchpad 2014-introduction
Scratchpad 2014-introduction
 
Don't make me think: biodiversity data publishing made easy
Don't make me think: biodiversity data publishing made easyDon't make me think: biodiversity data publishing made easy
Don't make me think: biodiversity data publishing made easy
 
Next generation sequencing requires next generation publishing: the Biodivers...
Next generation sequencing requires next generation publishing: the Biodivers...Next generation sequencing requires next generation publishing: the Biodivers...
Next generation sequencing requires next generation publishing: the Biodivers...
 
NHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeNHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-Life
 
SYNTHESYS 3 Overview
SYNTHESYS 3 OverviewSYNTHESYS 3 Overview
SYNTHESYS 3 Overview
 
Use it or lose it: a hybrid model for sustaining e-infrastructures
Use it or lose it: a hybrid model for sustaining e-infrastructuresUse it or lose it: a hybrid model for sustaining e-infrastructures
Use it or lose it: a hybrid model for sustaining e-infrastructures
 
FP7 Funded RI Project experiences: some overly honest tips from a project coo...
FP7 Funded RI Project experiences: some overly honest tips from a project coo...FP7 Funded RI Project experiences: some overly honest tips from a project coo...
FP7 Funded RI Project experiences: some overly honest tips from a project coo...
 
The biodiversity informatics landscape: a systematics perspective
The biodiversity informatics landscape: a systematics perspectiveThe biodiversity informatics landscape: a systematics perspective
The biodiversity informatics landscape: a systematics perspective
 

Similar to Digitised collections: Toward a digital strategy for for the NHM, London

Biodiversity Informatics at the Natural History Museum
Biodiversity Informatics at the Natural History MuseumBiodiversity Informatics at the Natural History Museum
Biodiversity Informatics at the Natural History MuseumEdward Baker
 
10-31-13 “Researcher Perspectives of Data Curation” Presentation Slides
10-31-13 “Researcher Perspectives of Data Curation” Presentation Slides10-31-13 “Researcher Perspectives of Data Curation” Presentation Slides
10-31-13 “Researcher Perspectives of Data Curation” Presentation SlidesDuraSpace
 
Delivering biodiversity knowledge in the information age
Delivering biodiversity knowledge in the information ageDelivering biodiversity knowledge in the information age
Delivering biodiversity knowledge in the information ageVince Smith
 
Optique presentation
Optique presentationOptique presentation
Optique presentationDBOnto
 
Introduction to Data Management Planning at Alien Challenge COST workshop
Introduction to Data Management Planning at Alien Challenge COST workshopIntroduction to Data Management Planning at Alien Challenge COST workshop
Introduction to Data Management Planning at Alien Challenge COST workshopAaike De Wever
 
Steve Mc Eachern Australian Data Archive
Steve Mc Eachern Australian Data ArchiveSteve Mc Eachern Australian Data Archive
Steve Mc Eachern Australian Data ArchiveFuture Perfect 2012
 
Green Shoots: Research Data Management Pilot at Imperial College London
Green Shoots:Research Data Management Pilot at Imperial College LondonGreen Shoots:Research Data Management Pilot at Imperial College London
Green Shoots: Research Data Management Pilot at Imperial College LondonTorsten Reimer
 
Scottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShareScottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShareRobin Rice
 
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...BigData_Europe
 
2 Discovery and Acquisition of Data1.pptx
2 Discovery and Acquisition of Data1.pptx2 Discovery and Acquisition of Data1.pptx
2 Discovery and Acquisition of Data1.pptxvijayapraba1
 
Research Cyberinfrastructure at UCSD - David Minor - RDAP12
Research Cyberinfrastructure at UCSD - David Minor - RDAP12Research Cyberinfrastructure at UCSD - David Minor - RDAP12
Research Cyberinfrastructure at UCSD - David Minor - RDAP12ASIS&T
 
re3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data Repositoriesre3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data RepositoriesHeinz Pampel
 
Digital Preservation
Digital PreservationDigital Preservation
Digital PreservationSmita Chandra
 
Digital Preservation
Digital PreservationDigital Preservation
Digital Preservationsmtcd
 
HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 Scott Edmunds
 

Similar to Digitised collections: Toward a digital strategy for for the NHM, London (20)

Biodiversity Informatics at the Natural History Museum
Biodiversity Informatics at the Natural History MuseumBiodiversity Informatics at the Natural History Museum
Biodiversity Informatics at the Natural History Museum
 
10-31-13 “Researcher Perspectives of Data Curation” Presentation Slides
10-31-13 “Researcher Perspectives of Data Curation” Presentation Slides10-31-13 “Researcher Perspectives of Data Curation” Presentation Slides
10-31-13 “Researcher Perspectives of Data Curation” Presentation Slides
 
Delivering biodiversity knowledge in the information age
Delivering biodiversity knowledge in the information ageDelivering biodiversity knowledge in the information age
Delivering biodiversity knowledge in the information age
 
Optique presentation
Optique presentationOptique presentation
Optique presentation
 
Baker - Evolution of Data Products and Designated Audiences
Baker - Evolution of Data Products and Designated AudiencesBaker - Evolution of Data Products and Designated Audiences
Baker - Evolution of Data Products and Designated Audiences
 
Big Data
Big Data Big Data
Big Data
 
Introduction to Data Management Planning at Alien Challenge COST workshop
Introduction to Data Management Planning at Alien Challenge COST workshopIntroduction to Data Management Planning at Alien Challenge COST workshop
Introduction to Data Management Planning at Alien Challenge COST workshop
 
Steve Mc Eachern Australian Data Archive
Steve Mc Eachern Australian Data ArchiveSteve Mc Eachern Australian Data Archive
Steve Mc Eachern Australian Data Archive
 
Researh data management
Researh data managementResearh data management
Researh data management
 
Green Shoots: Research Data Management Pilot at Imperial College London
Green Shoots:Research Data Management Pilot at Imperial College LondonGreen Shoots:Research Data Management Pilot at Imperial College London
Green Shoots: Research Data Management Pilot at Imperial College London
 
Rdm slides march 2014
Rdm slides march 2014Rdm slides march 2014
Rdm slides march 2014
 
Scottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShareScottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShare
 
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
 
Elab 16 5-13-re3data-scholze-final
Elab 16 5-13-re3data-scholze-finalElab 16 5-13-re3data-scholze-final
Elab 16 5-13-re3data-scholze-final
 
2 Discovery and Acquisition of Data1.pptx
2 Discovery and Acquisition of Data1.pptx2 Discovery and Acquisition of Data1.pptx
2 Discovery and Acquisition of Data1.pptx
 
Research Cyberinfrastructure at UCSD - David Minor - RDAP12
Research Cyberinfrastructure at UCSD - David Minor - RDAP12Research Cyberinfrastructure at UCSD - David Minor - RDAP12
Research Cyberinfrastructure at UCSD - David Minor - RDAP12
 
re3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data Repositoriesre3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data Repositories
 
Digital Preservation
Digital PreservationDigital Preservation
Digital Preservation
 
Digital Preservation
Digital PreservationDigital Preservation
Digital Preservation
 
HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9
 

More from Vince Smith

DiSSCo institutional benefits
DiSSCo institutional benefitsDiSSCo institutional benefits
DiSSCo institutional benefitsVince Smith
 
Bibliography of Life: Comprehensive services for biodiversity bibliographic r...
Bibliography of Life: Comprehensive services for biodiversity bibliographic r...Bibliography of Life: Comprehensive services for biodiversity bibliographic r...
Bibliography of Life: Comprehensive services for biodiversity bibliographic r...Vince Smith
 
Scratchpads: the Virtual Research Environment for biodiversity data
Scratchpads: the Virtual Research Environment for biodiversity dataScratchpads: the Virtual Research Environment for biodiversity data
Scratchpads: the Virtual Research Environment for biodiversity dataVince Smith
 
Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...
Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...
Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...Vince Smith
 
Don’t make me think: biodiversity data publishing made easy
Don’t make me think: biodiversity data publishing made easyDon’t make me think: biodiversity data publishing made easy
Don’t make me think: biodiversity data publishing made easyVince Smith
 
Virtual Research Environments supporting biodiversity research: Needs & prior...
Virtual Research Environments supporting biodiversity research: Needs & prior...Virtual Research Environments supporting biodiversity research: Needs & prior...
Virtual Research Environments supporting biodiversity research: Needs & prior...Vince Smith
 
2013 02 data portal science group update -v smith
2013 02 data portal science group update -v smith2013 02 data portal science group update -v smith
2013 02 data portal science group update -v smithVince Smith
 
Sharing, linking and publishing biodiversity data the ViBRANT way
Sharing, linking and publishing biodiversity data the ViBRANT waySharing, linking and publishing biodiversity data the ViBRANT way
Sharing, linking and publishing biodiversity data the ViBRANT wayVince Smith
 
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Vince Smith
 
Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...
Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...
Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...Vince Smith
 
Scratchpad 2, Virtual Research Environment: Project Update
 Scratchpad 2, Virtual Research Environment: Project Update Scratchpad 2, Virtual Research Environment: Project Update
Scratchpad 2, Virtual Research Environment: Project UpdateVince Smith
 
A timescale for the evolution of lice
A timescale for the evolution of liceA timescale for the evolution of lice
A timescale for the evolution of liceVince Smith
 
ViBRANT: linking communities and services
ViBRANT: linking communities and servicesViBRANT: linking communities and services
ViBRANT: linking communities and servicesVince Smith
 
Thoughts on addressing data citation challenges: experiences of Vibrant project
Thoughts on addressing data citation challenges: experiences of Vibrant projectThoughts on addressing data citation challenges: experiences of Vibrant project
Thoughts on addressing data citation challenges: experiences of Vibrant projectVince Smith
 

More from Vince Smith (14)

DiSSCo institutional benefits
DiSSCo institutional benefitsDiSSCo institutional benefits
DiSSCo institutional benefits
 
Bibliography of Life: Comprehensive services for biodiversity bibliographic r...
Bibliography of Life: Comprehensive services for biodiversity bibliographic r...Bibliography of Life: Comprehensive services for biodiversity bibliographic r...
Bibliography of Life: Comprehensive services for biodiversity bibliographic r...
 
Scratchpads: the Virtual Research Environment for biodiversity data
Scratchpads: the Virtual Research Environment for biodiversity dataScratchpads: the Virtual Research Environment for biodiversity data
Scratchpads: the Virtual Research Environment for biodiversity data
 
Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...
Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...
Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...
 
Don’t make me think: biodiversity data publishing made easy
Don’t make me think: biodiversity data publishing made easyDon’t make me think: biodiversity data publishing made easy
Don’t make me think: biodiversity data publishing made easy
 
Virtual Research Environments supporting biodiversity research: Needs & prior...
Virtual Research Environments supporting biodiversity research: Needs & prior...Virtual Research Environments supporting biodiversity research: Needs & prior...
Virtual Research Environments supporting biodiversity research: Needs & prior...
 
2013 02 data portal science group update -v smith
2013 02 data portal science group update -v smith2013 02 data portal science group update -v smith
2013 02 data portal science group update -v smith
 
Sharing, linking and publishing biodiversity data the ViBRANT way
Sharing, linking and publishing biodiversity data the ViBRANT waySharing, linking and publishing biodiversity data the ViBRANT way
Sharing, linking and publishing biodiversity data the ViBRANT way
 
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
 
Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...
Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...
Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...
 
Scratchpad 2, Virtual Research Environment: Project Update
 Scratchpad 2, Virtual Research Environment: Project Update Scratchpad 2, Virtual Research Environment: Project Update
Scratchpad 2, Virtual Research Environment: Project Update
 
A timescale for the evolution of lice
A timescale for the evolution of liceA timescale for the evolution of lice
A timescale for the evolution of lice
 
ViBRANT: linking communities and services
ViBRANT: linking communities and servicesViBRANT: linking communities and services
ViBRANT: linking communities and services
 
Thoughts on addressing data citation challenges: experiences of Vibrant project
Thoughts on addressing data citation challenges: experiences of Vibrant projectThoughts on addressing data citation challenges: experiences of Vibrant project
Thoughts on addressing data citation challenges: experiences of Vibrant project
 

Recently uploaded

Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 

Recently uploaded (20)

Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 

Digitised collections: Toward a digital strategy for for the NHM, London

  • 1. Digitised collections: Toward a digital strategy for for the NHM, London Vince Smith Workshop 3, pro-iBiosphere, Berlin 23 May 2013
  • 2. Digital Ambition: NHM Science Strategy 2013-2017 A New Voyage of Discovery Three Focal Areas 1. Scientific discovery 2. Scientific Infrastructure 3. Scientific engagement Five Challenges 1. The Digital NHM 2. Origins, evolution & futures 3. Biodiversity discovery 4. Natural resources & hazards 5. Science, society & skills Resources & funding Measuring success
  • 4. A New Voyage of Discovery Three Focal Areas 1. Scientific discovery 2. Scientific Infrastructure 3. Scientific engagement Five Challenges 1. The Digital NHM 2. Origins, evolution & futures 3. Biodiversity discovery 4. Natural resources & hazards 5. Science, society & skills Resources & funding Measuring success Digital Ambition: NHM Science Strategy 2013-2017 Scientific impact 1,000 papers in leading journals Digital access 20M specimens available digitally Engagement 1M face-to-face engagements Collections Globally important collections Diagnostic tools Diagnostic tools for key groups Deep time Timeline of key transitions Science & society Articulate of the role of science UK network Act as a national museum Earth sciences Earth Sciences Centre Funding £10M for Five Challenge Areas
  • 5. Overview 1. Existing digital content, sources & formats • Research data • Collections data 2. Making collections data digital • Priorities • Protocols & pathfinder activities • Crowdsourcing transcription 3. Aggregation & delivery • The NHM data portal • Data visualisation, data sub-portals 4. Identifiers, links & interoperability • DataCite DOIs • Third party aggregators • Portal API’s, download & analytical functions 5. Timeline & constraints • Data policies • Next steps Digitisation activities Data portal
  • 6. NHM Research Outputs • 49 papers, 45 available online (4 print only or behind pay walls) • 9 had supplementary data files • 39 papers with tables, charts & other data o >1000 sequences o 826 figures o 76 tables o 1 genome • No collective view of these data (37 journals) • No consistent way of citing NHM data • No consistent mechanism to access data • Effectively invisible at the institutional level One Month of NHM Science group papers Data via Carolyn Lowry e-mail, 13th Feb. 2013 1. Existing digital content
  • 7. NHM Collections Outputs: data • Huge investment in NHM collection management system • ≠ Imaging • Most research projects need spatio-temporal records • Different requirements for different purposes NHM COLLECTIONS April 2013 Collection area Estimate no of specimens No. records in database % collection in database % records with location info Botany 6,000,000 626,000 ~ 10% 96% Entomology 32,000,000 316,000 <1% 68% Mineralogy 500,000 422,000 ~ 95% 79% Palaeontology 9,000,000 342,000 ~ 3% 89% Zoology 28,000,000 1,131,000 ~ 60% via lots) 69% TOTAL 76,000,000 2,837,000 3% (23% ) 1. Existing digital content
  • 8. • Many, many imaging projects (highly fragmented) • Circa 40 TB for major collections (excluding library) • 120,000 images in KE EMu (many others not in KE!) • Circa 250,000 via NHM Photo unit (limited metadata) Collection area No. image files Disk space Botany 140,133 35,302 Entomology 529,106 3,172 Mineralogy 14,000 6 Palaeontology 122,548 993 Zoology 12,975 1,598 TOTAL 818,762 41,070 NHM Collections Outputs: images 1. Existing digital content
  • 9. Current data formats • Darwin Core Archive (DwCA) & extensions (collections) • Circa 2020 fields mapped to 50 fields to generate archive • Images mainly JPG & TIFF • Metadata using EML & Genesis II standard • Research data files in a wide array of formats (blob files) Nexus (character data and Newick formatted phylogenetic trees) Non-NHM specimen lists (as Darwin Core Archive files) PhyloXML (an XML standard for representing phylogenetic trees) Output from the Imaging and Analysis Centre (Micro CT datafile formats) NeXML (an XML standard for representing character data) Collections of images from digitisation projects (as a collection of links or a zipped archive) Sequence trace files (.scf sequence chromatogram format files) Environmental sequence files Taxon checklists (as Darwin Core Archive files) Collection level descriptions 1. Existing digital content
  • 10. • Priorities linked to science strategic priorities o Disease, sustainability, crop wild relatives, pests etc. • Tiered approach, different needs for different collections • Low hanging fruit (2D objects e.g. herb. sheets & slides) 2. Making collections data digital Digitisation Priorities
  • 11. • Priorities linked to science strategic priorities o Disease, sustainability, crop wild relatives, pests etc. • Tiered approach, different needs for different collections • Low hanging fruit (2D objects e.g. herb. sheets & slides) • Linked to strategic collaborations & financial opportunities o e.g RBG Kew, RBG Edinburgh, Nat. Mum. Wales, Hunterian etc. • Priorities dictate order – we plan to do it all (eventually)! 2. Making collections data digital Digitisation Priorities
  • 12. • Exercise to develop digitisation protocols across collection o Slides, spirit, herbarium sheets, pinned, multispecimen/drawer • Protocols mapped to high level collections descriptions • Workflow software supporting rapid digitisation (to KE & DAMS) 2. Making collections data digital Digitisation Protocols
  • 13. • Exercise to develop digitisation protocols across collection o Slides, spirit, herbarium sheets, pinned, multispecimen/drawer • Protocols mapped to high level collections descriptions • Workflow software supporting rapid digitisation (to KE & DAMS) • Pathfinder activities for less well understood projects o Entomological dry material (30 M specimens) - iCollections (specimen-by-specimen) approach - SatScan (drawer level multi-specimen) approach 2. Making collections data digital Digitisation Protocols
  • 14. • Specimen-by-specimen, traditional, dedicated 6 person team • Digitising British Isles Lepidoptera collection • ~500,000 specimens, 5,000 drawers • Re-curation & specimen imaging • Complete label information including georeferencing • For use in Climate Change initiative 2. Making collections data digital iCollections Initiative
  • 15. • 4-6 people over 3 years, work broken into small tasks by teams • Average imaging rate 163 specimen/day*person • Averaging >3min per specimen (prep., imaging & databasing) • >£1/specimen • BUT: 6,800 person years for the entire collection 2. Making collections data digital iCollections Initiative
  • 16. • Drawer level digitisation, segmented down to specimens • Very fast imaging, no specimen handling, just one view • No label information, but some data extracted from drawer • Specimens retrospectively cropped & annotated 2. Making collections data digital SatScan Initiative
  • 17. • Drawer level digitisation, segmented down to specimens • Very fast imaging, no specimen handling, just one view • No label information, but some data extracted from drawer • Specimens retrospectively cropped & annotated 2. Making collections data digital SatScan Initiative
  • 18. • Dedicated specimen-level rapid annotation software 2. Making collections data digital SatScan Initiative
  • 19. Crowdsourcing & Transcription • We have a massive transcription problem • Experiments via Notes-from-Nature (a Zooniverse project) • Transcribing the NHM ornithological accession registers • Wikimedian in Residence (Wikisource transcription) • 4 Month project, including specimen label transcription 2. Making collections data digital
  • 20. data.nhm.ac.uk • A focus for deposition and discovery of major NHM data sets • Promote innovation though re-use of museum data • Open Access, at a dedicated subdomain of the NHM website • Started Jan. 2013 (3 years), consultation throughout 2012 NHM Data Portal Functional components of the data portal 3. Aggregation & Delivery
  • 21. Search Datasets matching criteria Individual dataset Results Browse & search criteria Advanced display options • Dataset registry, for dataset discovery, modeled on data.gov.uk • Uses CKAN, an open-source data portal software platform 3. Aggregation & Delivery NHM Data Portal: Registry
  • 22. Metadata about the dataset Name Geographic scope Tags “Social” Authors License Download Developer tools Technical Info. (extracted from data file) • Dataset metadata discovery 3. Aggregation & Delivery NHM Data Portal: Registry
  • 23. • Simple datasets upload workflow for non-collections data 1. Name the dataset 2. Upload / link the data file 3. Describe the data file 4. Theme & tag 5. Add additional resources 6. Temporal coverage 7. Geographic coverage 8. Save & finish 3. Aggregation & Delivery NHM Data Portal: Dataset upload
  • 24. Zoomable map Applied filters Toggle map, table & stats views Search, download & display options No. records No. Georef. records • Dedicated interface to visualise & explore major datasets • Focused on collections data, based on Canadensys.net, uses CartoDB 3. Aggregation & Delivery NHM Data Portal: Data visualisation
  • 25. Collections views Statistical summary Specimen record views Data field mappings Summary preview Full record Tables Download 3. Aggregation & Delivery NHM Data Portal: Data visualisation
  • 26. • Using DataCite DOIs in the data portal • datasets (2014) & specimens (2015) • Unique, persistent and resolvable identifiers • Easy to cite, alias existing specimen identifiers • Conform to minimum DataCite requirements • Landing page, min. metadata standard, fee, min. 10 yr. contract, DOI (pre)fixes NHM Data Portal & DataCite Breaks us out of the biodiversity data silo 4. Identifiers, links & interoperability
  • 27. • Content within the NHM data portal will be highly accessible o Collections harvestable (e.g. by GBIF as a DwCA) o Download DwCAs on any search facet o Wide set of API’s available of datasets (part of CKAN) • Sub-portals (selected content, themed by topic) o e.g Virtual Herbarium, NHM Science initiatives, geographic regions • Analytical interface planned for 2015 (but not specified) Data Aggregation, APIs & download 4. Identifiers, links & interoperability
  • 28. • Data portal will be “open-by-default” • Ambiguity in what this means & top down schizophrenia • Conflicting mandates on open access & revenue opportunities • Lots of guidance available, will use to form a common policy • A cross institutional policy would be useful (but challenging) Data Policies & Next Steps 5. Timeline & constraints
  • 29. Jan 2013 Jan 2014 Jan 2015 Jan 2016 Requirements & dataset discovery Private alpha Stable public beta Full release & sub-portals Internal feedback, data visualisation & DOIs Subportals & analytical tools Project start NHM Data portal timeline Next 6 months • More documentation (PID and Tech Spec) • Consultation and advocacy (internal and external) • Data mapping from KE EMu and software testing • Development o website wireframe design o drafting data visualisation subcontract o Construction of private alpha release 5. Timeline & constraints Data Policies & Next Steps
  • 30. Jan 2013 2014 2018 Path-finding & Programme development Private alpha Stable public beta 20 Million!!Project start NHM digitisation timeline Next 6 months • Initial conclusions from path-finding digitisation activities • Initial grant funding bids developed • Advocacy, outreach & development of a digitisation “programme” • Investigate possibilities for gallery development • Develop crowdsourcing strategy 2015 2016 2017 Major funding applications & a new gallery? Digitisie… Digitisie… Digitisie… 5. Timeline & constraints Data Policies & Next Steps
  • 32. Digitisation Priorities • Priorities linked to science strategic priorities o Disease, sustainability, crop wild relatives, pests etc. 0 100 200 300 400 500 600 700 Crop Wild Relatives (accepted taxa only) 2. Making collections data digital
  • 33. • Priorities linked to science strategic priorities o Disease, sustainability, crop wild relatives, pests etc. • Tiered approach, different needs for different collections Nick Poole, UK Collections Trust 2. Making collections data digital Digitisation Priorities

Editor's Notes

  1. NHM has huge amount of digital ambition. As an institution we have a new science strategy taking us to to 2017, and “digital” as a concept runs through all most every aspect of that strategy. Just to underscore this, we put it on the front cover of the strategy.
  2. This visualisation shows the are 400k specimens across all the departments in the NHM that have good geo-locative data, and the length of the line corresponds to the collecting effort in that spot. Its not the most informative visualisation but the intention is that these globe will grow with more points over time as we digitise. In fact its going to have to grow a lot over the next 5 years.
  3. Our science strategy commits us to digitise 20M specimens over the next five year. This will involve an enormous ramping up of effort, given that at the moment we only have about 2.8M records.
  4. So my talk today is really about how we are going to ramp up to achieve this 20 M figure, and I have structured this presentation according to the points that we were asked me to speak on. So first off I’ll say a little about the digital content that we already have. How we are creating new digital content; how we are delivering that content; how we are going to link that content up; and finally what is the timeline for doing this work. And I’m going to focus on the digitisation activities and the data portal since these are the parts of this work that I am most closely associated with…
  5. So first off then what digital content do we (as an institution) already have. Well in the context of or research this is best represented by the papers we publish. On average the NHM produced about 50 papers a month and about 80% of these have a significant amount of digital data associated with them. However, this content is mostly invisible to the institution. Its only accessible through the papers.
  6. Showing all geotagged specimens on a map. You can click one of the specimen records to get an overview of the record. Then click through to see the full record.
  7. Shows all the information related to the record. You can also click through to see the data mapped to Darwin Core fields.
  8. DataCite
  9. Aggregation and access
  10. Open by default
  11. Data portal timeline
  12. Digitisation timeline
  13. Questions
  14. Example of how we set digitisation priorities
  15. The choice of what digitisation granularity we need is linked to the outcome for the data.