SlideShare a Scribd company logo
1 of 1
Download to read offline
Amélie Grégoire Taillefer and Terry A. Wheeler
Dept. of Natural Resource Sciences, McGill University, Ste-Anne-de-Bellevue, QC, Canada
Databasing the Lyman Entomological Museum:
challenges and opportunities
6 weeks sampling
Databases create opportunities
Challenges
Acknowledgments
Future work
1. Preparation
• Identify specimens to lowest taxonomic level possible
• Verify status of taxonomic name
• Add unique identifier to each specimen
The Lyman Entomological Museum began as the private collection of Henry H. Lyman,
which was bequeathed in 1914 to McGill University. The largest university insect
collection in Canada, it holds specimens in all orders with a worldwide terrestrial
coverage collected from 1860 to the present. Since the mid 1990s much of the focus in
collection development has been in the Diptera, although ongoing research projects,
donations and exchanges continue to add material in all orders, particularly
Coleoptera. 70% of the Diptera specimens were collected from Canada.
Digitization – recording specimen collection labels in digital form – is a time-consuming
and laborious process. Retrospective digitization of large collections is a costly
undertaking, but the benefits in terms of data sharing and accessibility far outweigh the
costs. Canadensys (canadensys.net), the Canadian biodiversity open database,
compiles taxonomic, geographic, temporal, numerical, and historical information about
three megadiverse groups: plants, insects and fungi, housed in 18 institutions across
Canada, which collectively house several million specimens. About 1.3 million
specimen records are currently available on Canadensys; the Lyman Entomological
Museum makes up 20% of that total.
Steps in digitization
Background and history
A digitized collection is a rich source of primary biodiversity data for a range of
applications in taxonomy, inventories, catalogs, and ecology. Data can be searched via
maps (as above) or in list format. Shared, open, accessible data creates opportunities
for building large datasets for analysis of large-scale patterns. Extraction of data on
Canadensys for a particular taxon, locality or set of samples is easy and rapid.
Collection databases have traditionally been used for curation, loan management or
taxonomic research. Digitization facilitates all these functions. However, because of the
extensive spatial, temporal and ecological data associated with specimen records,
these databases are also valuable resources for ecological and conservation research.
The dataset can easily be managed for the purpose of loans, systematic research,
taxonomic coverage within an area for systematic, ecological or conservation
purposes. The databases provide baseline data, as well as evidence of change over
time, for regions or biotas areas that may have experienced habitat change.
1. Implementing an efficient, standard data entry procedure
2. Old labels with minimal information
3. Georeferencing old specimen localities
4. Errors in coordinates or localities on labels
5. Misidentified specimens
6. Data cleaning, validation and correction
7. Training volunteers and staff for data search and new entries
Major funding for Canadensys was provided by the Canada Foundation for Innovation.
Canadensys coordinates ongoing open access to our database. We thank David
Shorthouse and Carole Sinou for all their help and advice in data cleaning and
formatting for publication on Canadensys.
No database is ever completed. Data checking and verification are an ongoing process
as taxonomic experts verify identifications or provide finer taxonomic resolution. New
specimens added to the collection require ongoing commitment by collection staff,
students or volunteers in data entry and publication. For example, more than 150,000
arctic Diptera and new accessions from other regions currently await digitization in the
Lyman Museum.
Lyman Museum
LEM-0013538
Progress to date
Order Geographic scope Specimens
databased
Diptera Worldwide 240,000 +
Neuroptera Canada 2,700 +
Coleoptera (Buprestidae, Dermestidae) Canada 2,600 +
Hymenoptera (Vespidae, Eumenidae) Canada 2,900 +
Araneae Canada 4,500 +
Source: Lyman Entomological Museum georeferenced records (253,061), Canadensys, Google Earth.
(accessed on 2013-10-11)
LEM0249541, from McGill University http://
dataset.canadensys.net/lemq-specimens
 (accessed on 2013-10-11)
Biota 2-The Biodiversity Database Manager, R.K. Colwell, University of Connecticut,
http://viceroy.eeb.uconn.edu/Biota/biota, specimen and collection record tables.
About 10% (253,000 specimens) of the
Lyman collection has been databased with
Canadensys support. Our database is freely
hosted by Canadensys and shared
internationally via the Global Biodiversity
Information Facility (www.gbif.org).
2. Databasing
• BIOTA 2 program used at
Lyman
• Data entry requires
frequent data verification
• Georeference records
3. Data publication
• Export data as text file
• Add columns and formulas for accepted data format
• Convert database information into Darwin Core (internationally accepted biodiversity
information standard)
• Add collection metadata
• Serve data via Canadensys and GBIF

More Related Content

What's hot

Practical interoperability across semantic stores of data for ecological, tax...
Practical interoperability across semantic stores of data for ecological, tax...Practical interoperability across semantic stores of data for ecological, tax...
Practical interoperability across semantic stores of data for ecological, tax...Cyndy Parr
 
Botanical Literature Goes Global: The Biodiversity Heritage Library
Botanical Literature Goes Global: The Biodiversity Heritage Library Botanical Literature Goes Global: The Biodiversity Heritage Library
Botanical Literature Goes Global: The Biodiversity Heritage Library warnemen
 
Science Seminar Series 4 Norman Johnson
Science Seminar Series 4 Norman JohnsonScience Seminar Series 4 Norman Johnson
Science Seminar Series 4 Norman JohnsonUniversity of Adelaide
 
Biodiversity Heritage Library : Development and Partnerhips
Biodiversity Heritage Library : Development and PartnerhipsBiodiversity Heritage Library : Development and Partnerhips
Biodiversity Heritage Library : Development and PartnerhipsNancy Gwinn
 
Mapping Biodiversity - The Atlas of Living Australia
Mapping Biodiversity - The Atlas of Living AustraliaMapping Biodiversity - The Atlas of Living Australia
Mapping Biodiversity - The Atlas of Living AustraliaDonald Hobern
 
Behavior ontology workshop princeton
Behavior ontology workshop princetonBehavior ontology workshop princeton
Behavior ontology workshop princetonCyndy Parr
 
Introduction to the crees research programme
Introduction to the crees research programmeIntroduction to the crees research programme
Introduction to the crees research programmecrees foundation
 
Closing the gap – linking collection data to applied research
Closing the gap – linking collection data to applied researchClosing the gap – linking collection data to applied research
Closing the gap – linking collection data to applied researchKlaus Riede
 
Crop diversity and climate change
Crop diversity and climate changeCrop diversity and climate change
Crop diversity and climate changeLuigi Guarino
 
Dm smith ecn2013
Dm smith ecn2013Dm smith ecn2013
Dm smith ecn2013ECNOfficer
 
Introduction to eMonocot (Dave Simpson)
Introduction to eMonocot (Dave Simpson)Introduction to eMonocot (Dave Simpson)
Introduction to eMonocot (Dave Simpson)eMonocot
 
2007.04.12 Nmnh Science Board
2007.04.12 Nmnh Science Board2007.04.12 Nmnh Science Board
2007.04.12 Nmnh Science Boardtgarnett
 
Information systems on fish and marine genetic resources
Information systems on fish and marine genetic resourcesInformation systems on fish and marine genetic resources
Information systems on fish and marine genetic resourcesapaari
 
Open Access to Legacy Biodiversity Literature
Open Access to Legacy Biodiversity LiteratureOpen Access to Legacy Biodiversity Literature
Open Access to Legacy Biodiversity Literaturetgarnett
 
Eol fellow-march2010
Eol fellow-march2010Eol fellow-march2010
Eol fellow-march2010tgarnett
 
English Culture Portal of TELDAP, Taiwan
English Culture Portal of TELDAP, TaiwanEnglish Culture Portal of TELDAP, Taiwan
English Culture Portal of TELDAP, TaiwanShih-Chieh Li
 

What's hot (20)

Practical interoperability across semantic stores of data for ecological, tax...
Practical interoperability across semantic stores of data for ecological, tax...Practical interoperability across semantic stores of data for ecological, tax...
Practical interoperability across semantic stores of data for ecological, tax...
 
KARINE GIL-WEIR resume
KARINE GIL-WEIR resumeKARINE GIL-WEIR resume
KARINE GIL-WEIR resume
 
Botanical Literature Goes Global: The Biodiversity Heritage Library
Botanical Literature Goes Global: The Biodiversity Heritage Library Botanical Literature Goes Global: The Biodiversity Heritage Library
Botanical Literature Goes Global: The Biodiversity Heritage Library
 
Science Seminar Series 4 Norman Johnson
Science Seminar Series 4 Norman JohnsonScience Seminar Series 4 Norman Johnson
Science Seminar Series 4 Norman Johnson
 
Biodiversity Heritage Library : Development and Partnerhips
Biodiversity Heritage Library : Development and PartnerhipsBiodiversity Heritage Library : Development and Partnerhips
Biodiversity Heritage Library : Development and Partnerhips
 
Mapping Biodiversity - The Atlas of Living Australia
Mapping Biodiversity - The Atlas of Living AustraliaMapping Biodiversity - The Atlas of Living Australia
Mapping Biodiversity - The Atlas of Living Australia
 
AusPlots
AusPlotsAusPlots
AusPlots
 
Behavior ontology workshop princeton
Behavior ontology workshop princetonBehavior ontology workshop princeton
Behavior ontology workshop princeton
 
Introduction to the crees research programme
Introduction to the crees research programmeIntroduction to the crees research programme
Introduction to the crees research programme
 
Closing the gap – linking collection data to applied research
Closing the gap – linking collection data to applied researchClosing the gap – linking collection data to applied research
Closing the gap – linking collection data to applied research
 
Crop diversity and climate change
Crop diversity and climate changeCrop diversity and climate change
Crop diversity and climate change
 
Dm smith ecn2013
Dm smith ecn2013Dm smith ecn2013
Dm smith ecn2013
 
Introduction to eMonocot (Dave Simpson)
Introduction to eMonocot (Dave Simpson)Introduction to eMonocot (Dave Simpson)
Introduction to eMonocot (Dave Simpson)
 
2007.04.12 Nmnh Science Board
2007.04.12 Nmnh Science Board2007.04.12 Nmnh Science Board
2007.04.12 Nmnh Science Board
 
Information systems on fish and marine genetic resources
Information systems on fish and marine genetic resourcesInformation systems on fish and marine genetic resources
Information systems on fish and marine genetic resources
 
Open Access to Legacy Biodiversity Literature
Open Access to Legacy Biodiversity LiteratureOpen Access to Legacy Biodiversity Literature
Open Access to Legacy Biodiversity Literature
 
Eol fellow-march2010
Eol fellow-march2010Eol fellow-march2010
Eol fellow-march2010
 
Wildlife Resume
Wildlife ResumeWildlife Resume
Wildlife Resume
 
AER & FAME Review May 2014
AER & FAME Review May 2014AER & FAME Review May 2014
AER & FAME Review May 2014
 
English Culture Portal of TELDAP, Taiwan
English Culture Portal of TELDAP, TaiwanEnglish Culture Portal of TELDAP, Taiwan
English Culture Portal of TELDAP, Taiwan
 

Similar to Databasing the Lyman Entomological Museum

iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK Cyndy Parr
 
Remsen celebration of discovery
Remsen celebration of discoveryRemsen celebration of discovery
Remsen celebration of discoveryDavid Remsen
 
Abbott utic ecn_2012
Abbott utic ecn_2012Abbott utic ecn_2012
Abbott utic ecn_2012ECNOfficer
 
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 1
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 1USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 1
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 1Gianpaolo Coro
 
Aanbieding vir bewaringsprojekte rev 15 sept
Aanbieding vir bewaringsprojekte rev 15 septAanbieding vir bewaringsprojekte rev 15 sept
Aanbieding vir bewaringsprojekte rev 15 septRenier Balt
 
Gil ecn2013 ppt
Gil ecn2013 pptGil ecn2013 ppt
Gil ecn2013 pptECNOfficer
 
Ruminations on the importance of vouchering, bycatch and accessibility
Ruminations on the importance of vouchering, bycatch and accessibilityRuminations on the importance of vouchering, bycatch and accessibility
Ruminations on the importance of vouchering, bycatch and accessibilityAlex Smith
 
Franz et al TDWG 2016 Updates on multiple neotropical symbiota portals
Franz et al TDWG 2016 Updates on multiple neotropical symbiota portalsFranz et al TDWG 2016 Updates on multiple neotropical symbiota portals
Franz et al TDWG 2016 Updates on multiple neotropical symbiota portalstaxonbytes
 
Laura Wood Diversity and Distribution of Amphibians in Luxembourg PhD Thesis
Laura Wood Diversity and Distribution of Amphibians in Luxembourg PhD ThesisLaura Wood Diversity and Distribution of Amphibians in Luxembourg PhD Thesis
Laura Wood Diversity and Distribution of Amphibians in Luxembourg PhD ThesisLaura Wood
 
Rubinoff ecn2013 uhim
Rubinoff ecn2013 uhimRubinoff ecn2013 uhim
Rubinoff ecn2013 uhimECNOfficer
 
Cobb, Seltmann, Franz. 2014. The Current State of Arthropod Biodiversity Data...
Cobb, Seltmann, Franz. 2014. The Current State of Arthropod Biodiversity Data...Cobb, Seltmann, Franz. 2014. The Current State of Arthropod Biodiversity Data...
Cobb, Seltmann, Franz. 2014. The Current State of Arthropod Biodiversity Data...taxonbytes
 
Fbip specify2015
Fbip specify2015Fbip specify2015
Fbip specify2015wcoetzer
 
Global patterns of insect diiversity, distribution and evolutionary distinctness
Global patterns of insect diiversity, distribution and evolutionary distinctnessGlobal patterns of insect diiversity, distribution and evolutionary distinctness
Global patterns of insect diiversity, distribution and evolutionary distinctnessAlison Specht
 
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 3
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 3USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 3
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 3Gianpaolo Coro
 

Similar to Databasing the Lyman Entomological Museum (20)

John La Salle - Opening Plenary
John La Salle - Opening PlenaryJohn La Salle - Opening Plenary
John La Salle - Opening Plenary
 
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
 
Remsen celebration of discovery
Remsen celebration of discoveryRemsen celebration of discovery
Remsen celebration of discovery
 
Abbott utic ecn_2012
Abbott utic ecn_2012Abbott utic ecn_2012
Abbott utic ecn_2012
 
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 1
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 1USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 1
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 1
 
Aanbieding vir bewaringsprojekte rev 15 sept
Aanbieding vir bewaringsprojekte rev 15 septAanbieding vir bewaringsprojekte rev 15 sept
Aanbieding vir bewaringsprojekte rev 15 sept
 
01 20100111 En
01 20100111 En01 20100111 En
01 20100111 En
 
Jeremy deWaard - Museum Harvesting
Jeremy deWaard - Museum HarvestingJeremy deWaard - Museum Harvesting
Jeremy deWaard - Museum Harvesting
 
Currsci Jan10 2003
Currsci Jan10 2003Currsci Jan10 2003
Currsci Jan10 2003
 
Gil ecn2013 ppt
Gil ecn2013 pptGil ecn2013 ppt
Gil ecn2013 ppt
 
Ruminations on the importance of vouchering, bycatch and accessibility
Ruminations on the importance of vouchering, bycatch and accessibilityRuminations on the importance of vouchering, bycatch and accessibility
Ruminations on the importance of vouchering, bycatch and accessibility
 
Franz et al TDWG 2016 Updates on multiple neotropical symbiota portals
Franz et al TDWG 2016 Updates on multiple neotropical symbiota portalsFranz et al TDWG 2016 Updates on multiple neotropical symbiota portals
Franz et al TDWG 2016 Updates on multiple neotropical symbiota portals
 
Laura Wood Diversity and Distribution of Amphibians in Luxembourg PhD Thesis
Laura Wood Diversity and Distribution of Amphibians in Luxembourg PhD ThesisLaura Wood Diversity and Distribution of Amphibians in Luxembourg PhD Thesis
Laura Wood Diversity and Distribution of Amphibians in Luxembourg PhD Thesis
 
Rubinoff ecn2013 uhim
Rubinoff ecn2013 uhimRubinoff ecn2013 uhim
Rubinoff ecn2013 uhim
 
Cobb, Seltmann, Franz. 2014. The Current State of Arthropod Biodiversity Data...
Cobb, Seltmann, Franz. 2014. The Current State of Arthropod Biodiversity Data...Cobb, Seltmann, Franz. 2014. The Current State of Arthropod Biodiversity Data...
Cobb, Seltmann, Franz. 2014. The Current State of Arthropod Biodiversity Data...
 
Fbip specify2015
Fbip specify2015Fbip specify2015
Fbip specify2015
 
Environmental Genomics
Environmental GenomicsEnvironmental Genomics
Environmental Genomics
 
Global patterns of insect diiversity, distribution and evolutionary distinctness
Global patterns of insect diiversity, distribution and evolutionary distinctnessGlobal patterns of insect diiversity, distribution and evolutionary distinctness
Global patterns of insect diiversity, distribution and evolutionary distinctness
 
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 3
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 3USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 3
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 3
 
Currsci Jan10 2004
Currsci Jan10 2004Currsci Jan10 2004
Currsci Jan10 2004
 

Databasing the Lyman Entomological Museum

  • 1. Amélie Grégoire Taillefer and Terry A. Wheeler Dept. of Natural Resource Sciences, McGill University, Ste-Anne-de-Bellevue, QC, Canada Databasing the Lyman Entomological Museum: challenges and opportunities 6 weeks sampling Databases create opportunities Challenges Acknowledgments Future work 1. Preparation • Identify specimens to lowest taxonomic level possible • Verify status of taxonomic name • Add unique identifier to each specimen The Lyman Entomological Museum began as the private collection of Henry H. Lyman, which was bequeathed in 1914 to McGill University. The largest university insect collection in Canada, it holds specimens in all orders with a worldwide terrestrial coverage collected from 1860 to the present. Since the mid 1990s much of the focus in collection development has been in the Diptera, although ongoing research projects, donations and exchanges continue to add material in all orders, particularly Coleoptera. 70% of the Diptera specimens were collected from Canada. Digitization – recording specimen collection labels in digital form – is a time-consuming and laborious process. Retrospective digitization of large collections is a costly undertaking, but the benefits in terms of data sharing and accessibility far outweigh the costs. Canadensys (canadensys.net), the Canadian biodiversity open database, compiles taxonomic, geographic, temporal, numerical, and historical information about three megadiverse groups: plants, insects and fungi, housed in 18 institutions across Canada, which collectively house several million specimens. About 1.3 million specimen records are currently available on Canadensys; the Lyman Entomological Museum makes up 20% of that total. Steps in digitization Background and history A digitized collection is a rich source of primary biodiversity data for a range of applications in taxonomy, inventories, catalogs, and ecology. Data can be searched via maps (as above) or in list format. Shared, open, accessible data creates opportunities for building large datasets for analysis of large-scale patterns. Extraction of data on Canadensys for a particular taxon, locality or set of samples is easy and rapid. Collection databases have traditionally been used for curation, loan management or taxonomic research. Digitization facilitates all these functions. However, because of the extensive spatial, temporal and ecological data associated with specimen records, these databases are also valuable resources for ecological and conservation research. The dataset can easily be managed for the purpose of loans, systematic research, taxonomic coverage within an area for systematic, ecological or conservation purposes. The databases provide baseline data, as well as evidence of change over time, for regions or biotas areas that may have experienced habitat change. 1. Implementing an efficient, standard data entry procedure 2. Old labels with minimal information 3. Georeferencing old specimen localities 4. Errors in coordinates or localities on labels 5. Misidentified specimens 6. Data cleaning, validation and correction 7. Training volunteers and staff for data search and new entries Major funding for Canadensys was provided by the Canada Foundation for Innovation. Canadensys coordinates ongoing open access to our database. We thank David Shorthouse and Carole Sinou for all their help and advice in data cleaning and formatting for publication on Canadensys. No database is ever completed. Data checking and verification are an ongoing process as taxonomic experts verify identifications or provide finer taxonomic resolution. New specimens added to the collection require ongoing commitment by collection staff, students or volunteers in data entry and publication. For example, more than 150,000 arctic Diptera and new accessions from other regions currently await digitization in the Lyman Museum. Lyman Museum LEM-0013538 Progress to date Order Geographic scope Specimens databased Diptera Worldwide 240,000 + Neuroptera Canada 2,700 + Coleoptera (Buprestidae, Dermestidae) Canada 2,600 + Hymenoptera (Vespidae, Eumenidae) Canada 2,900 + Araneae Canada 4,500 + Source: Lyman Entomological Museum georeferenced records (253,061), Canadensys, Google Earth. (accessed on 2013-10-11) LEM0249541, from McGill University http:// dataset.canadensys.net/lemq-specimens  (accessed on 2013-10-11) Biota 2-The Biodiversity Database Manager, R.K. Colwell, University of Connecticut, http://viceroy.eeb.uconn.edu/Biota/biota, specimen and collection record tables. About 10% (253,000 specimens) of the Lyman collection has been databased with Canadensys support. Our database is freely hosted by Canadensys and shared internationally via the Global Biodiversity Information Facility (www.gbif.org). 2. Databasing • BIOTA 2 program used at Lyman • Data entry requires frequent data verification • Georeference records 3. Data publication • Export data as text file • Add columns and formulas for accepted data format • Convert database information into Darwin Core (internationally accepted biodiversity information standard) • Add collection metadata • Serve data via Canadensys and GBIF