SlideShare a Scribd company logo
1 of 16
Bridging Discrepancies across North
American Butterfly Naming Authorities:
Supporting Citizen Science Data
Integration
Dana L Campbell, Anne E Thessen, Leslie Ries
annethessen@gmail.com
http://www.slideshare.net/athessen
@diatomsRcool
The Data Detektiv
Acknowledgements
• DBI – 1052875
• DEB - 1541557
Lepidopteran Monitoring
© Cláudio Dias Timm CC BY-NC-SA via Flickr and EOL
© 1999 California Academy of
Sciences CC BY-NC-SA via CalPhotos
and EOL
© michael_hodge CC BY via iNaturalist and EOL
By Andy Potter CC BY-SA via
Wikimedia Commons and EOL
Lepidopteran Monitoring
Photo by Bernard Dupont
Bioindicator
© Ted Kropiewnicki CC BY-NC-SA via Tree of Life Web Project and
EOL
Pollinator
Prey
Lepidopteran Monitoring
By Gail Gilliland, blogs.plos.org
wildresearch.ca
burrenbeo.com
By Kelly Donovan
Lepidopteran Monitoring
• North American
Butterfly Monitoring
Network
• Standardize protocols
(Pollard Walk)
• Develop data
management systems
(PollardBase)
Lepidopteran Monitoring
Lepidopteran Classifications
• ITIS
• NABA 2001
• Pelham 2014
• Opler & Warren 2005
Base List Number of taxa Number of species
Pelham 2827 820
Opler Warren 1043 784
NABA 1090 731
ITIS 2810 866
Subfamily
Deviation
Genus
Deviation
Lumping/
Splitting
Spelling
Species
Name
Unmatched
Taxon
ITIS/NABA 66 28 63 13 28 21
ITIS/OW 110 15 11 2 9 13
ITIS/Pelham 49 16 7 1 5 19
OW/NABA 69 34 62 11 22 17
OW/Pelham 159 28 13 3 14 27
NABA/Pelham 105 35 63 11 34 37
Project Lists
Project Start Year Base List
Number of
Species
Base List
Species
Deviations
Area (sq mi)
Illinois 1986 NABA 143 4 7,000
Michigan 2011 NABA 136 2 97,000
Colorado 2013 OW 244 22 104,000
Tennessee 2014 NABA 136 8 42,000
Irvine Ranch 2012 OW 80 8 50,000 acres
Cascades 2011 Pelham 152 4
Iowa 2007 NABA 119 2 56,000
Florida 2003 OW 192 8 66,000
MPG Ranch Pelham 101 15
Ohio 1995 NABA 165 7 45,000
Project Lists
Project
Species
Complex
Spelling Generic
Species
name
Unmatched
Taxon
Lumping/
Splitting
Total
Illinois 2 2 1 0 1 0 6
Michigan 1 1 0 0 0 0 2
Colorado 0 2 12 1 7 0 22
Tennessee 2 1 5 0 0 0 8
Irvine Ranch 0 0 1 0 6 1 8
Cascades 0 0 1 1 0 2 4
Iowa 0 0 2 0 0 0 2
Florida 0 0 4 0 3 1 8
MPG Ranch 4 2 3 0 0 6 15
Ohio 0 4 1 0 1 1 7
Total 9 12 30 2 18 11 80
Doing the Alignment
• Manual alignment of the four
base lists
• Projects declared a base list and
recorded discrepancies
• Used nomenclature and
geography
Celastrina ladon complex
Red = C. ladon
Blue = C. lucia
Green = C. neglecta
Purple = C. idella
Grey = C. echo
Yellow = C. humulus
Data Access
• Manual alignments
• Darwin Core Archive
• https://github.com/diatomsRcool/butterfly
Data Integration
• MPG Ranch Speyeria hesperis
• Equivalent in Ohio?
Speyeria hesperis
MPG Ranch
Speyeria hesperis
Pelham
Speyeria atlantis hesperis
NABA
Speyeria atlantis
Ohio
Data Integration
• Colorado Adelpha eulalia
• Equivalent in Cascades?
Adelpha eulalia
Colorado
Adelpha bredowii
O&W
Adelpha californica
Adelpha eulalia
Pelham
Adelpha californica
Cascades
© Jerry Oldenettel CC BY-NC-SA
via EOL and Fickr
Conclusions
• Local projects and citizen scientists are
important for Lepidopteran monitoring
• Large-scale problems require data integration
across projects
• We successfully built crosswalks between four
classifications and ten project lists
• Names infrastructure for integrating butterfly
data
Questions?
CC BY-NC via Flickr and EOL

More Related Content

Viewers also liked (9)

Urban Planning for City Leaders digital
Urban Planning for City Leaders digitalUrban Planning for City Leaders digital
Urban Planning for City Leaders digital
 
Avances de la propuesta de normas académicas de
Avances de la propuesta de normas académicas deAvances de la propuesta de normas académicas de
Avances de la propuesta de normas académicas de
 
Hacia un nuevo modelo de atención a la diversidad
Hacia un nuevo modelo de atención a la diversidadHacia un nuevo modelo de atención a la diversidad
Hacia un nuevo modelo de atención a la diversidad
 
презентация вич
презентация вичпрезентация вич
презентация вич
 
Como usar fila_unica
Como usar fila_unicaComo usar fila_unica
Como usar fila_unica
 
LSC Women's Basketball Weekly Release (12/14/16)
LSC Women's Basketball Weekly Release (12/14/16)LSC Women's Basketball Weekly Release (12/14/16)
LSC Women's Basketball Weekly Release (12/14/16)
 
3D Texturing in Animation
3D Texturing in Animation3D Texturing in Animation
3D Texturing in Animation
 
Holland America 2011
Holland America 2011Holland America 2011
Holland America 2011
 
El fenomeno de la comunicacion
El fenomeno de la comunicacionEl fenomeno de la comunicacion
El fenomeno de la comunicacion
 

Similar to Bridging discrepancies across North American butterfly naming authorities: Supporting citizen science data integration

Cloning within animals
Cloning within animalsCloning within animals
Cloning within animals
nik3088
 

Similar to Bridging discrepancies across North American butterfly naming authorities: Supporting citizen science data integration (8)

iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
 
TRY - a global database of plant traits
TRY - a global database of plant traitsTRY - a global database of plant traits
TRY - a global database of plant traits
 
Linking biodiversity data for ecology
Linking biodiversity data for ecologyLinking biodiversity data for ecology
Linking biodiversity data for ecology
 
Gregoire Taillefer poster ESC final
Gregoire Taillefer poster ESC finalGregoire Taillefer poster ESC final
Gregoire Taillefer poster ESC final
 
Keeping Up to Date on Data Management - UC3 Data Curation Workshop
Keeping Up to Date on Data Management - UC3 Data Curation WorkshopKeeping Up to Date on Data Management - UC3 Data Curation Workshop
Keeping Up to Date on Data Management - UC3 Data Curation Workshop
 
Behavior ontology workshop princeton
Behavior ontology workshop princetonBehavior ontology workshop princeton
Behavior ontology workshop princeton
 
Frontiers of discovery with Encyclopedia of Life
Frontiers of discovery with Encyclopedia of LifeFrontiers of discovery with Encyclopedia of Life
Frontiers of discovery with Encyclopedia of Life
 
Cloning within animals
Cloning within animalsCloning within animals
Cloning within animals
 

More from Anne Thessen

Predicting Phenotype from Multi-Scale Genomic and Environment Data using Neur...
Predicting Phenotype from Multi-Scale Genomic and Environment Data using Neur...Predicting Phenotype from Multi-Scale Genomic and Environment Data using Neur...
Predicting Phenotype from Multi-Scale Genomic and Environment Data using Neur...
Anne Thessen
 

More from Anne Thessen (12)

Predicting Phenotype from Multi-Scale Genomic and Environment Data using Neur...
Predicting Phenotype from Multi-Scale Genomic and Environment Data using Neur...Predicting Phenotype from Multi-Scale Genomic and Environment Data using Neur...
Predicting Phenotype from Multi-Scale Genomic and Environment Data using Neur...
 
Unifying Genomics, Phenomics, and Environments
Unifying Genomics, Phenomics, and EnvironmentsUnifying Genomics, Phenomics, and Environments
Unifying Genomics, Phenomics, and Environments
 
Combining Phenomes and Genomes to Fill Analytical Gaps: Data Management in Ph...
Combining Phenomes and Genomes to Fill Analytical Gaps: Data Management in Ph...Combining Phenomes and Genomes to Fill Analytical Gaps: Data Management in Ph...
Combining Phenomes and Genomes to Fill Analytical Gaps: Data Management in Ph...
 
Ontological Support of Data Discovery and Synthesis in Estuarine and Coastal ...
Ontological Support of Data Discovery and Synthesis in Estuarine and Coastal ...Ontological Support of Data Discovery and Synthesis in Estuarine and Coastal ...
Ontological Support of Data Discovery and Synthesis in Estuarine and Coastal ...
 
Next-Gen Taxonomic Descriptions for Microbial Eukaryotes
Next-Gen Taxonomic Descriptions for Microbial EukaryotesNext-Gen Taxonomic Descriptions for Microbial Eukaryotes
Next-Gen Taxonomic Descriptions for Microbial Eukaryotes
 
Data Infrastructure for Coastal and Estuarine Science
Data Infrastructure for Coastal and Estuarine ScienceData Infrastructure for Coastal and Estuarine Science
Data Infrastructure for Coastal and Estuarine Science
 
Gulf of Mexico Hydrocarbon Database: Integrating Heterogeneous Data for Impro...
Gulf of Mexico Hydrocarbon Database: Integrating Heterogeneous Data for Impro...Gulf of Mexico Hydrocarbon Database: Integrating Heterogeneous Data for Impro...
Gulf of Mexico Hydrocarbon Database: Integrating Heterogeneous Data for Impro...
 
Knowledge extraction from the Encyclopedia of Life using Python NLTK
Knowledge extraction from the Encyclopedia of Life using Python NLTKKnowledge extraction from the Encyclopedia of Life using Python NLTK
Knowledge extraction from the Encyclopedia of Life using Python NLTK
 
Marrying models and data: Adventures in Modeling, Data Wrangling and Software...
Marrying models and data: Adventures in Modeling, Data Wrangling and Software...Marrying models and data: Adventures in Modeling, Data Wrangling and Software...
Marrying models and data: Adventures in Modeling, Data Wrangling and Software...
 
Visualizing Evolution
Visualizing EvolutionVisualizing Evolution
Visualizing Evolution
 
The Future of Microalgal Taxonomy
The Future of Microalgal TaxonomyThe Future of Microalgal Taxonomy
The Future of Microalgal Taxonomy
 
Knowledge Extraction and Semantic Linking in the Encyclopedia of Life
Knowledge Extraction and Semantic Linking in the Encyclopedia of LifeKnowledge Extraction and Semantic Linking in the Encyclopedia of Life
Knowledge Extraction and Semantic Linking in the Encyclopedia of Life
 

Recently uploaded

Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.
Silpa
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
NazaninKarimi6
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
1301aanya
 
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Silpa
 
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
ANSARKHAN96
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
Scintica Instrumentation
 
CYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptxCYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptx
Silpa
 
Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.
Silpa
 

Recently uploaded (20)

module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical Science
 
Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.
 
Grade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsGrade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its Functions
 
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptxClimate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
 
Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
Role of AI in seed science Predictive modelling and Beyond.pptx
Role of AI in seed science  Predictive modelling and  Beyond.pptxRole of AI in seed science  Predictive modelling and  Beyond.pptx
Role of AI in seed science Predictive modelling and Beyond.pptx
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
 
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
 
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
 
Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspects
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
 
CYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptxCYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptx
 
Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.
 
Genetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditionsGenetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditions
 

Bridging discrepancies across North American butterfly naming authorities: Supporting citizen science data integration

  • 1. Bridging Discrepancies across North American Butterfly Naming Authorities: Supporting Citizen Science Data Integration Dana L Campbell, Anne E Thessen, Leslie Ries annethessen@gmail.com http://www.slideshare.net/athessen @diatomsRcool The Data Detektiv
  • 2. Acknowledgements • DBI – 1052875 • DEB - 1541557
  • 3. Lepidopteran Monitoring © Cláudio Dias Timm CC BY-NC-SA via Flickr and EOL © 1999 California Academy of Sciences CC BY-NC-SA via CalPhotos and EOL © michael_hodge CC BY via iNaturalist and EOL By Andy Potter CC BY-SA via Wikimedia Commons and EOL
  • 4. Lepidopteran Monitoring Photo by Bernard Dupont Bioindicator © Ted Kropiewnicki CC BY-NC-SA via Tree of Life Web Project and EOL Pollinator Prey
  • 5. Lepidopteran Monitoring By Gail Gilliland, blogs.plos.org wildresearch.ca burrenbeo.com By Kelly Donovan
  • 6. Lepidopteran Monitoring • North American Butterfly Monitoring Network • Standardize protocols (Pollard Walk) • Develop data management systems (PollardBase)
  • 8. Lepidopteran Classifications • ITIS • NABA 2001 • Pelham 2014 • Opler & Warren 2005 Base List Number of taxa Number of species Pelham 2827 820 Opler Warren 1043 784 NABA 1090 731 ITIS 2810 866 Subfamily Deviation Genus Deviation Lumping/ Splitting Spelling Species Name Unmatched Taxon ITIS/NABA 66 28 63 13 28 21 ITIS/OW 110 15 11 2 9 13 ITIS/Pelham 49 16 7 1 5 19 OW/NABA 69 34 62 11 22 17 OW/Pelham 159 28 13 3 14 27 NABA/Pelham 105 35 63 11 34 37
  • 9. Project Lists Project Start Year Base List Number of Species Base List Species Deviations Area (sq mi) Illinois 1986 NABA 143 4 7,000 Michigan 2011 NABA 136 2 97,000 Colorado 2013 OW 244 22 104,000 Tennessee 2014 NABA 136 8 42,000 Irvine Ranch 2012 OW 80 8 50,000 acres Cascades 2011 Pelham 152 4 Iowa 2007 NABA 119 2 56,000 Florida 2003 OW 192 8 66,000 MPG Ranch Pelham 101 15 Ohio 1995 NABA 165 7 45,000
  • 10. Project Lists Project Species Complex Spelling Generic Species name Unmatched Taxon Lumping/ Splitting Total Illinois 2 2 1 0 1 0 6 Michigan 1 1 0 0 0 0 2 Colorado 0 2 12 1 7 0 22 Tennessee 2 1 5 0 0 0 8 Irvine Ranch 0 0 1 0 6 1 8 Cascades 0 0 1 1 0 2 4 Iowa 0 0 2 0 0 0 2 Florida 0 0 4 0 3 1 8 MPG Ranch 4 2 3 0 0 6 15 Ohio 0 4 1 0 1 1 7 Total 9 12 30 2 18 11 80
  • 11. Doing the Alignment • Manual alignment of the four base lists • Projects declared a base list and recorded discrepancies • Used nomenclature and geography Celastrina ladon complex Red = C. ladon Blue = C. lucia Green = C. neglecta Purple = C. idella Grey = C. echo Yellow = C. humulus
  • 12. Data Access • Manual alignments • Darwin Core Archive • https://github.com/diatomsRcool/butterfly
  • 13. Data Integration • MPG Ranch Speyeria hesperis • Equivalent in Ohio? Speyeria hesperis MPG Ranch Speyeria hesperis Pelham Speyeria atlantis hesperis NABA Speyeria atlantis Ohio
  • 14. Data Integration • Colorado Adelpha eulalia • Equivalent in Cascades? Adelpha eulalia Colorado Adelpha bredowii O&W Adelpha californica Adelpha eulalia Pelham Adelpha californica Cascades © Jerry Oldenettel CC BY-NC-SA via EOL and Fickr
  • 15. Conclusions • Local projects and citizen scientists are important for Lepidopteran monitoring • Large-scale problems require data integration across projects • We successfully built crosswalks between four classifications and ten project lists • Names infrastructure for integrating butterfly data
  • 16. Questions? CC BY-NC via Flickr and EOL

Editor's Notes

  1. Overview of leps and their life stages
  2. Why leps are important
  3. This is what butterfly monitoring looks like. Very localized. Very analog.
  4. Starting to organize. Trying to standardize protocols and share data Pollard Walk data are most useful for tracking population trends and shifts in phenology. Most important from global change and conservation perspective. Contrast with iNaturalist observations of butterflies.
  5. Each project uses their own names list.
  6. *Pelham has twice as many taxa than OW, but similar number of species *Pelham and ITIS have similar numbers of species and taxa *OW and NABA have similar numbers of species and taxa *all lists are North American except for ITIS Unmatched taxa are mostly strays The lumping/splitting deviations are a problem for alignment
  7. Project lists are just lists of names, no higher level taxonomy Each program chooses their own list of names and their base lists We can’t force people to use a standard list or they wouldn’t use PollardBase. We had to match things up rather than forcing conformity.
  8. Project lists are just lists of names, no higher level taxonomy
  9. Project lists are just lists of names, no higher level taxonomy
  10. Project lists are just lists of names, no higher level taxonomy