SlideShare a Scribd company logo
1 of 24
Download to read offline
GBIF Checklist Bank
Indexing & Backbone
Checklist Scope
1.846 datasets registered 18 million name records
Plazi (1.131), Pensoft (178), CoL GSDs (156)
Denormalized Checklist
Normalized Checklist
Checklist Challenges
• Highly relational taxonomic data, almost all records linked in tree & basionym
• Wrong or missing records destroy dataset integrity, not just a single record!
• Different to flat, unrelated occurrence records
• Data Quality
• broken referential integrity
• bad names or placeholders (e.g. «Unallocated Family»)
• missing or unused controlled vcabularies, e.g. «art» for rank species
• Name strings can be published in several ways
• ScientificName
• ScientificName + Authorship
• Genus + SpeciesEpitheton + Rank + InfraspecificEpitheton + Authorship
• Classifications can be published in several ways
• Normalised via parentNameUsageID
• Normalised via parentNameUsage
• Denormalised via Kingdom,Phylum,Class,Order,Family,Genus
Checklist Indexing
• Basic archive validation
• unique ids
• Checklist Normalizer
• resolve relations
• create implicit taxa from denormalised classification
• interpret controlled vocabularies, e.g. rank
• match to backbone
• match to previous version to keep GBIF ids stable
• Checklist Importer
• Inserts data to PostgresDB and solr index for searches
• Checklist Analyser
• generate dataset metrics
Organizing Occurrences
• GBIF needs a single, consistent taxonomy
• for metrics, search, maps
• considerable variation in higher taxa
• synonymies can be very large
• Catalog of Life is largest single source
• ~90% of GBIF occurrence records (thanks to birds)
• ~50% of GBIF occurrence names (35% in 2010)
• GBIF needs to assemble a taxonomy
• originally merged (noisy) names found 

in occurrences. Resulted in lots of duplicates
• improved by stitching together checklist datasets
Cronquist classification
Mimosaceae: 3,200 species
Caesalpiniaceae: 2,000 species
Fabaceae: 14,000 species
“Modern” classification
Fabaceae: 19,200 species
Mimosoideae: 3,200 species
Cæsalpinioideae: 2,000 species
Faboideae: 14,000 species
Current Backbone Issues
• Far too many accepted species (acc/syn)
• Cactaceae: GBIF 12.062 (342 syn), TPL 2.233 (5.422 syn) + 5.500 unknown
• Genus Weingartia: GBIF 129 (0 syn), TPL 8 (26 syn) + 68 unknown
• Many accepted names based on the same basionym
• Sulcorebutia breviflora Backeb.
• Weingartia breviflora (Backeb.) Hentzschel & K.Augustin
• No synonyms with different authors possible
• Poa pubescens R.Br. synonym of Eragrostis pubescens (R.Br.) Steud.
• Poa pubescens Lej. synonym of Poa pratensis L.
• merged all names with exact same canonical name
• list of known homonym genera (IRMNG) used to disambiguate between larger groups
Backbone Building
• Overlay ordered sources
• Start with Catalog of Life
• Primary source defines status
• Create new name if kingdom, canonical name & authorship do not exist in
current nub
• Ignore source name if …
• not a major Linnean rank (infraspecifc ranks are included)
• higher ranks above family (configurable per source)
• status conflicts with already existing status
• hybrid formula, cultivar, candidatus or placeholder names !!!
Catalogue of Life
Fauna
Europaea
GRIN
Mammal
Species
World
Observations
Specimens 8000 Species Lists
10s of taxonomic resources
Me
Backbone Assembling
Animalia
Archaea
Bacteria
Chromista
Fungi
Plantae
Protozoa
Viruses
incertae sedis
• Nub build starts with 8
kingdoms
Backbone Assembling
Plantae
Magnoliophyta
Magnoliopsida
Asterales
Asteraceae
Helianthus L.
Helianthus anuus L.
• Catalog of Life is added
• Defines higher classification
Plantae
Magnoliophyta
Magnoliopsida
Asterales
Asteraceae
Helianthus L.
Helianthus anuus L.
Backbone Assembling
Plantae
Magnoliophyta
Magnoliopsida
Asterales
Asteraceae
Helianthus L.
Helianthus anuus L.
Cichorium
Cichorium intybus L.
• Missing genera are created
• Tribe is ignored
Asteraceae
Cichorieae Lam & DC. [tribe]
Cichorium intybus L.
Backbone Assembling
Plantae
Magnoliophyta
Magnoliopsida
Asterales
Asteraceae
Helianthus L.
Helianthus anuus L.
Cichorium Linneaus
Cichorium intybus L.
= C. balearicum Porta
= C. byzantinum Clementi
• Synonyms respect authors
• Author match very loose
• Existing genus author updated
Plantae
Asteraceae
Cichorium Linneaus
Cichorium intybus Linneaus
= Cichorium balearicum Porta
= Cichorium byzantinum Clem.
= Cichorium byzantinum Clementi
Backbone Assembling
Plantae
Magnoliophyta
Magnoliopsida
Asterales
Asteraceae
Helianthus L.
Helianthus anuus L.
Cichorium L.
Cichorium intybus L.
= C. balearicum Porta
= C. byzantinum Clem.
• Prefer authors from
nomenclators
Asteraceae
Cichorium L.
Cichorium byzantinum Clem.
Backbone Assembling
Asteraceae
Helianthus L.
Helianthus anuus L.
Agoseris
Agoseris apargioides (Less.) Greene
= A. maritima Eastw.
A. a. var. eastwoodiae (Fedde) Munz
A. a. var. maritima (E. Sheld.) Baird
Cichorium L.
Cichorium intybus L.
= C. balearicum Porta
= C. byzantinum Clem.
• Infraspecifics are included
Asteraceae
Agoseris apargioides (Less.) Greene
= A. maritima Eastw.
A. a. var. eastwoodiae (Fedde) Munz
A. a. var. maritima (E. Sheld.) Baird
Backbone Assembling
Asteraceae
Helianthus L.
Helianthus anuus L.
Agoseris
Agoseris apargioides (Less.) Greene
= A. maritima Eastw.
A. a. var. eastwoodiae (Fedde) Munz
A. a. var. maritima (E. Sheld.) Baird
Agoseris eastwoodiae Fedde
Agoseris maritima E. Sheld.
Cichorium L.
Cichorium intybus L.
= C. balearicum Porta
= C. byzantinum Clem.
• Other source treats them

as species
• Same canonical maritima
allowed twice - author different
Asteraceae
Agoseris eastwoodiae Fedde
Agoseris maritima E. Sheld.
Final Cleanup - Basionyms
Asteraceae
Helianthus L.
Helianthus anuus L.
Agoseris
Agoseris apargioides (Less.) Greene
= A. maritima Eastw.
A. a. var. eastwoodiae (Fedde) Munz
= Agoseris eastwoodiae Fedde
A. a. var. maritima (E. Sheld.) Baird
= Agoseris maritima E. Sheld.
Cichorium L.
Cichorium intybus L.
= C. balearicum Porta
= C. byzantinum Clem.
• Finally basionyms are detected
• by terminal epithet & author
within a family
• Only 1 accepted per group
• the most trusted first stays
Final Cleanup - Autonyms
Asteraceae
Helianthus L.
Helianthus anuus L.
Agoseris
Agoseris apargioides (Less.) Greene
= A. maritima Eastw.
A. a. var. apargioides
A. a. var. eastwoodiae (Fedde) Munz
= Agoseris eastwoodiae Fedde
A. a. var. maritima (E. Sheld.) Baird
= Agoseris maritima E. Sheld.
Cichorium L.
Cichorium intybus L.
= C. balearicum Porta
= C. byzantinum Clem.
• Create missing autonyms
Backbone Building Rules
• Create missing genus or species in classification
• only for accepted taxa
• Create missing autonyms for infraspecific
• Detect basionyms based on terminal epithet & authorship
• Assumes epithet & authorship in family is unique
• Converts all but one accepted to synonyms
• Flag taxa as doubtful
• genus or higher taxon without any species (IRMNG)
• species (or infrasp.) with a parent genus (or species) considered to be a synonym
• moved to newly accepted genus (or species)
• the case for potential children of synonymised basionym combination
Backbone Sources
• GBIF Backbone Patch
• Catalogue of Life
• World Register of Marine Species
• Dyntaxa - Svensk taxonomisk databas
• GRIN Taxonomy
• Fauna Europaea
• Integrated Taxonomic Information System
• Euro+Med Plantbase
• Interim Register of Marine and Nonmarine Genera
• The Clements Checklist
• IOC World Bird Names
• Mammal Species of the World
• Paleobiology Database
• Nomenclators
• International Plant Names Index
• Index Fungorum
• ZooBank
• Prokaryotic Nomenclature Up-to-
date
• ICTV Master Species List
• Organisations
• Species Files
• Biodiversity Data Journal (Pensoft)
• ZooKeys (Pensoft)
• PhytoKeys (Pensoft)
• Plazi ???
Backbone Matching
• Occurrence
• fuzzy name match
• classification match
• allow higher rank matches
• Checklist
• match kingdom
• require straight canonical match
• incl authorship comparison
• no webservice yet, only embedded
NameUsageParsed Name
Backbone Match
Citation
Dataset Metrics
Verbatim Record
Metrics
Extensions
• Checklists & Nub

same structure
• Parent-child
hierarchy
• normalized classification
• flexible ranks
• synonyms accepted rel.
• Dataset metrics

as timeseries
• Basionym relation
Schema
CLB Supported Extensions
• Description: human paragraphs about some topic
• Distribution: area ranges with statuses
• Identifier: additional identifier for the record
• Multimedia: image, video, sound
• Literature references: bibliography
• Occurrence (indexed via occurrence workflows)
• Species Profile: extinct, marine, freshwater, terrestrial flags
• Types and specimens: (overlaps with Occurrence)
• Vernacular names: name with language & region
http://rs.gbif.org/extension/gbif/1.0/
Normalizing Classifications

More Related Content

Viewers also liked

ENJ 200- Derecho Constitucional
ENJ 200- Derecho ConstitucionalENJ 200- Derecho Constitucional
ENJ 200- Derecho ConstitucionalENJ
 
CLH Hoisting Equipment Booklet
CLH Hoisting Equipment BookletCLH Hoisting Equipment Booklet
CLH Hoisting Equipment BookletJason W
 
Formatos finanzas e impuestos 2015 3(1)
Formatos finanzas e impuestos 2015 3(1)Formatos finanzas e impuestos 2015 3(1)
Formatos finanzas e impuestos 2015 3(1)karenidaniela
 
hindi project for class 10
hindi project for class 10hindi project for class 10
hindi project for class 10Bhavesh Sharma
 
ppt of our Solar system in hindi
ppt of our Solar system in hindippt of our Solar system in hindi
ppt of our Solar system in hindivethics
 
Ayurveda aapka swasthya aap ke haath by rajiv dixit
Ayurveda aapka swasthya aap ke haath by rajiv dixitAyurveda aapka swasthya aap ke haath by rajiv dixit
Ayurveda aapka swasthya aap ke haath by rajiv dixitBhim Upadhyaya
 
ENJ-100 Constitucionalismo y Estado de Derecho
ENJ-100 Constitucionalismo y Estado de DerechoENJ-100 Constitucionalismo y Estado de Derecho
ENJ-100 Constitucionalismo y Estado de Derechoguest8b5fd0
 

Viewers also liked (17)

RulesBharatRatna&PadmaAwards_0
RulesBharatRatna&PadmaAwards_0RulesBharatRatna&PadmaAwards_0
RulesBharatRatna&PadmaAwards_0
 
Mapa conceptual derecho c
Mapa conceptual derecho cMapa conceptual derecho c
Mapa conceptual derecho c
 
Requerimientos
Requerimientos Requerimientos
Requerimientos
 
Microsoft pdf excel
Microsoft pdf excelMicrosoft pdf excel
Microsoft pdf excel
 
mapas ilustrado finanzas
mapas ilustrado finanzas mapas ilustrado finanzas
mapas ilustrado finanzas
 
Hindi for android mobile Power Point Presentation
Hindi for android mobile Power Point PresentationHindi for android mobile Power Point Presentation
Hindi for android mobile Power Point Presentation
 
ENJ 200- Derecho Constitucional
ENJ 200- Derecho ConstitucionalENJ 200- Derecho Constitucional
ENJ 200- Derecho Constitucional
 
CLH Hoisting Equipment Booklet
CLH Hoisting Equipment BookletCLH Hoisting Equipment Booklet
CLH Hoisting Equipment Booklet
 
Poder electoral
Poder electoralPoder electoral
Poder electoral
 
Hemeroteca
HemerotecaHemeroteca
Hemeroteca
 
Formatos finanzas e impuestos 2015 3(1)
Formatos finanzas e impuestos 2015 3(1)Formatos finanzas e impuestos 2015 3(1)
Formatos finanzas e impuestos 2015 3(1)
 
hindi project for class 10
hindi project for class 10hindi project for class 10
hindi project for class 10
 
ppt of our Solar system in hindi
ppt of our Solar system in hindippt of our Solar system in hindi
ppt of our Solar system in hindi
 
Ayurveda aapka swasthya aap ke haath by rajiv dixit
Ayurveda aapka swasthya aap ke haath by rajiv dixitAyurveda aapka swasthya aap ke haath by rajiv dixit
Ayurveda aapka swasthya aap ke haath by rajiv dixit
 
Poder Ejecutivo
Poder EjecutivoPoder Ejecutivo
Poder Ejecutivo
 
Ensayo el Poder Ciudadano
Ensayo el Poder CiudadanoEnsayo el Poder Ciudadano
Ensayo el Poder Ciudadano
 
ENJ-100 Constitucionalismo y Estado de Derecho
ENJ-100 Constitucionalismo y Estado de DerechoENJ-100 Constitucionalismo y Estado de Derecho
ENJ-100 Constitucionalismo y Estado de Derecho
 

Similar to GBIF Checklist bank and the backbone

GBIF ChecklistBank and Backbone building
GBIF ChecklistBank and Backbone building GBIF ChecklistBank and Backbone building
GBIF ChecklistBank and Backbone building Markus Döring
 
The Flora of Southern Illinois - Lecture 1
The Flora of Southern Illinois - Lecture 1The Flora of Southern Illinois - Lecture 1
The Flora of Southern Illinois - Lecture 1Christopher Benda
 
Saarela arctic change 2014
Saarela arctic change 2014Saarela arctic change 2014
Saarela arctic change 2014Jeff Saarela
 
Q4-Classification-of-Living-Things.pptx
Q4-Classification-of-Living-Things.pptxQ4-Classification-of-Living-Things.pptx
Q4-Classification-of-Living-Things.pptxAaronJade2
 
Angiosperm systematics and biodiversity
Angiosperm systematics and biodiversityAngiosperm systematics and biodiversity
Angiosperm systematics and biodiversityDrReshma Sonwalkar
 
Introduction to plant Systematics by sarah Ashfaq.pptx
Introduction to plant Systematics by sarah Ashfaq.pptxIntroduction to plant Systematics by sarah Ashfaq.pptx
Introduction to plant Systematics by sarah Ashfaq.pptxSarahAshfaq4
 
Fbip specify2015
Fbip specify2015Fbip specify2015
Fbip specify2015wcoetzer
 
Something general on Eukaryotic Taxonomy
Something general on  Eukaryotic TaxonomySomething general on  Eukaryotic Taxonomy
Something general on Eukaryotic TaxonomyEukRef
 
Exploring Patterns of Darkling Beetle Distributions in the Genus Eleodes
Exploring Patterns of Darkling Beetle Distributions in the Genus EleodesExploring Patterns of Darkling Beetle Distributions in the Genus Eleodes
Exploring Patterns of Darkling Beetle Distributions in the Genus EleodesMAndrewJ
 
10 years of global biodiversity databases: are we there yet?
10 years of global biodiversity databases: are we there yet?10 years of global biodiversity databases: are we there yet?
10 years of global biodiversity databases: are we there yet?Tony Rees
 
Variety of life, Binomial Nomenclature
Variety of life, Binomial NomenclatureVariety of life, Binomial Nomenclature
Variety of life, Binomial NomenclatureIram Qaiser
 
Biological Names Talk 01
Biological Names Talk 01Biological Names Talk 01
Biological Names Talk 01rwakefor
 
Biological Names Talk 01
Biological Names Talk 01Biological Names Talk 01
Biological Names Talk 01scitech
 
Classificationnomenclature
ClassificationnomenclatureClassificationnomenclature
ClassificationnomenclatureJohn Gruber
 
Taxonomy_Classification_17_.ppt
Taxonomy_Classification_17_.pptTaxonomy_Classification_17_.ppt
Taxonomy_Classification_17_.pptaprilrances1
 
Binomial Classification of Animals and Taxonomy
Binomial Classification of Animals and TaxonomyBinomial Classification of Animals and Taxonomy
Binomial Classification of Animals and Taxonomyadamraymanlunas2
 
Global Names Architecture - Remsen
Global Names Architecture - RemsenGlobal Names Architecture - Remsen
Global Names Architecture - RemsenDavid Remsen
 
Classification Mr. Binder
Classification Mr. BinderClassification Mr. Binder
Classification Mr. Binderbinderline
 
Introduce the kingdam of animalia
Introduce the kingdam of  animaliaIntroduce the kingdam of  animalia
Introduce the kingdam of animaliaGANAPATHIS16
 

Similar to GBIF Checklist bank and the backbone (20)

GBIF ChecklistBank and Backbone building
GBIF ChecklistBank and Backbone building GBIF ChecklistBank and Backbone building
GBIF ChecklistBank and Backbone building
 
The Flora of Southern Illinois - Lecture 1
The Flora of Southern Illinois - Lecture 1The Flora of Southern Illinois - Lecture 1
The Flora of Southern Illinois - Lecture 1
 
Saarela arctic change 2014
Saarela arctic change 2014Saarela arctic change 2014
Saarela arctic change 2014
 
Q4-Classification-of-Living-Things.pptx
Q4-Classification-of-Living-Things.pptxQ4-Classification-of-Living-Things.pptx
Q4-Classification-of-Living-Things.pptx
 
Angiosperm systematics and biodiversity
Angiosperm systematics and biodiversityAngiosperm systematics and biodiversity
Angiosperm systematics and biodiversity
 
Introduction to plant Systematics by sarah Ashfaq.pptx
Introduction to plant Systematics by sarah Ashfaq.pptxIntroduction to plant Systematics by sarah Ashfaq.pptx
Introduction to plant Systematics by sarah Ashfaq.pptx
 
Remsen Lect04
Remsen Lect04Remsen Lect04
Remsen Lect04
 
Fbip specify2015
Fbip specify2015Fbip specify2015
Fbip specify2015
 
Something general on Eukaryotic Taxonomy
Something general on  Eukaryotic TaxonomySomething general on  Eukaryotic Taxonomy
Something general on Eukaryotic Taxonomy
 
Exploring Patterns of Darkling Beetle Distributions in the Genus Eleodes
Exploring Patterns of Darkling Beetle Distributions in the Genus EleodesExploring Patterns of Darkling Beetle Distributions in the Genus Eleodes
Exploring Patterns of Darkling Beetle Distributions in the Genus Eleodes
 
10 years of global biodiversity databases: are we there yet?
10 years of global biodiversity databases: are we there yet?10 years of global biodiversity databases: are we there yet?
10 years of global biodiversity databases: are we there yet?
 
Variety of life, Binomial Nomenclature
Variety of life, Binomial NomenclatureVariety of life, Binomial Nomenclature
Variety of life, Binomial Nomenclature
 
Biological Names Talk 01
Biological Names Talk 01Biological Names Talk 01
Biological Names Talk 01
 
Biological Names Talk 01
Biological Names Talk 01Biological Names Talk 01
Biological Names Talk 01
 
Classificationnomenclature
ClassificationnomenclatureClassificationnomenclature
Classificationnomenclature
 
Taxonomy_Classification_17_.ppt
Taxonomy_Classification_17_.pptTaxonomy_Classification_17_.ppt
Taxonomy_Classification_17_.ppt
 
Binomial Classification of Animals and Taxonomy
Binomial Classification of Animals and TaxonomyBinomial Classification of Animals and Taxonomy
Binomial Classification of Animals and Taxonomy
 
Global Names Architecture - Remsen
Global Names Architecture - RemsenGlobal Names Architecture - Remsen
Global Names Architecture - Remsen
 
Classification Mr. Binder
Classification Mr. BinderClassification Mr. Binder
Classification Mr. Binder
 
Introduce the kingdam of animalia
Introduce the kingdam of  animaliaIntroduce the kingdam of  animalia
Introduce the kingdam of animalia
 

Recently uploaded

Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |aasikanpl
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfSELF-EXPLANATORY
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxyaramohamed343013
 
Call Girls in Hauz Khas Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Hauz Khas Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Hauz Khas Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Hauz Khas Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |aasikanpl
 
Gas_Laws_powerpoint_notes.ppt for grade 10
Gas_Laws_powerpoint_notes.ppt for grade 10Gas_Laws_powerpoint_notes.ppt for grade 10
Gas_Laws_powerpoint_notes.ppt for grade 10ROLANARIBATO3
 
Evidences of Evolution General Biology 2
Evidences of Evolution General Biology 2Evidences of Evolution General Biology 2
Evidences of Evolution General Biology 2John Carlo Rollon
 
Solution chemistry, Moral and Normal solutions
Solution chemistry, Moral and Normal solutionsSolution chemistry, Moral and Normal solutions
Solution chemistry, Moral and Normal solutionsHajira Mahmood
 
Forest laws, Indian forest laws, why they are important
Forest laws, Indian forest laws, why they are importantForest laws, Indian forest laws, why they are important
Forest laws, Indian forest laws, why they are importantadityabhardwaj282
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.PraveenaKalaiselvan1
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real timeSatoshi NAKAHIRA
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024AyushiRastogi48
 
Call Girls in Aiims Metro Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Aiims Metro Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Aiims Metro Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Aiims Metro Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Transposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptTransposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptArshadWarsi13
 
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptx
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptxRESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptx
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptxFarihaAbdulRasheed
 
Welcome to GFDL for Take Your Child To Work Day
Welcome to GFDL for Take Your Child To Work DayWelcome to GFDL for Take Your Child To Work Day
Welcome to GFDL for Take Your Child To Work DayZachary Labe
 
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRCall Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRlizamodels9
 

Recently uploaded (20)

Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docx
 
Call Girls in Hauz Khas Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Hauz Khas Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Hauz Khas Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Hauz Khas Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
 
Gas_Laws_powerpoint_notes.ppt for grade 10
Gas_Laws_powerpoint_notes.ppt for grade 10Gas_Laws_powerpoint_notes.ppt for grade 10
Gas_Laws_powerpoint_notes.ppt for grade 10
 
Evidences of Evolution General Biology 2
Evidences of Evolution General Biology 2Evidences of Evolution General Biology 2
Evidences of Evolution General Biology 2
 
Solution chemistry, Moral and Normal solutions
Solution chemistry, Moral and Normal solutionsSolution chemistry, Moral and Normal solutions
Solution chemistry, Moral and Normal solutions
 
Forest laws, Indian forest laws, why they are important
Forest laws, Indian forest laws, why they are importantForest laws, Indian forest laws, why they are important
Forest laws, Indian forest laws, why they are important
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real time
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024
 
Call Girls in Aiims Metro Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Aiims Metro Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Aiims Metro Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Aiims Metro Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Transposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptTransposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.ppt
 
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptx
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptxRESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptx
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptx
 
Welcome to GFDL for Take Your Child To Work Day
Welcome to GFDL for Take Your Child To Work DayWelcome to GFDL for Take Your Child To Work Day
Welcome to GFDL for Take Your Child To Work Day
 
Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRCall Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
 

GBIF Checklist bank and the backbone

  • 2. Checklist Scope 1.846 datasets registered 18 million name records Plazi (1.131), Pensoft (178), CoL GSDs (156)
  • 5. Checklist Challenges • Highly relational taxonomic data, almost all records linked in tree & basionym • Wrong or missing records destroy dataset integrity, not just a single record! • Different to flat, unrelated occurrence records • Data Quality • broken referential integrity • bad names or placeholders (e.g. «Unallocated Family») • missing or unused controlled vcabularies, e.g. «art» for rank species • Name strings can be published in several ways • ScientificName • ScientificName + Authorship • Genus + SpeciesEpitheton + Rank + InfraspecificEpitheton + Authorship • Classifications can be published in several ways • Normalised via parentNameUsageID • Normalised via parentNameUsage • Denormalised via Kingdom,Phylum,Class,Order,Family,Genus
  • 6. Checklist Indexing • Basic archive validation • unique ids • Checklist Normalizer • resolve relations • create implicit taxa from denormalised classification • interpret controlled vocabularies, e.g. rank • match to backbone • match to previous version to keep GBIF ids stable • Checklist Importer • Inserts data to PostgresDB and solr index for searches • Checklist Analyser • generate dataset metrics
  • 7. Organizing Occurrences • GBIF needs a single, consistent taxonomy • for metrics, search, maps • considerable variation in higher taxa • synonymies can be very large • Catalog of Life is largest single source • ~90% of GBIF occurrence records (thanks to birds) • ~50% of GBIF occurrence names (35% in 2010) • GBIF needs to assemble a taxonomy • originally merged (noisy) names found 
 in occurrences. Resulted in lots of duplicates • improved by stitching together checklist datasets Cronquist classification Mimosaceae: 3,200 species Caesalpiniaceae: 2,000 species Fabaceae: 14,000 species “Modern” classification Fabaceae: 19,200 species Mimosoideae: 3,200 species Cæsalpinioideae: 2,000 species Faboideae: 14,000 species
  • 8. Current Backbone Issues • Far too many accepted species (acc/syn) • Cactaceae: GBIF 12.062 (342 syn), TPL 2.233 (5.422 syn) + 5.500 unknown • Genus Weingartia: GBIF 129 (0 syn), TPL 8 (26 syn) + 68 unknown • Many accepted names based on the same basionym • Sulcorebutia breviflora Backeb. • Weingartia breviflora (Backeb.) Hentzschel & K.Augustin • No synonyms with different authors possible • Poa pubescens R.Br. synonym of Eragrostis pubescens (R.Br.) Steud. • Poa pubescens Lej. synonym of Poa pratensis L. • merged all names with exact same canonical name • list of known homonym genera (IRMNG) used to disambiguate between larger groups
  • 9. Backbone Building • Overlay ordered sources • Start with Catalog of Life • Primary source defines status • Create new name if kingdom, canonical name & authorship do not exist in current nub • Ignore source name if … • not a major Linnean rank (infraspecifc ranks are included) • higher ranks above family (configurable per source) • status conflicts with already existing status • hybrid formula, cultivar, candidatus or placeholder names !!! Catalogue of Life Fauna Europaea GRIN Mammal Species World Observations Specimens 8000 Species Lists 10s of taxonomic resources Me
  • 11. Backbone Assembling Plantae Magnoliophyta Magnoliopsida Asterales Asteraceae Helianthus L. Helianthus anuus L. • Catalog of Life is added • Defines higher classification Plantae Magnoliophyta Magnoliopsida Asterales Asteraceae Helianthus L. Helianthus anuus L.
  • 12. Backbone Assembling Plantae Magnoliophyta Magnoliopsida Asterales Asteraceae Helianthus L. Helianthus anuus L. Cichorium Cichorium intybus L. • Missing genera are created • Tribe is ignored Asteraceae Cichorieae Lam & DC. [tribe] Cichorium intybus L.
  • 13. Backbone Assembling Plantae Magnoliophyta Magnoliopsida Asterales Asteraceae Helianthus L. Helianthus anuus L. Cichorium Linneaus Cichorium intybus L. = C. balearicum Porta = C. byzantinum Clementi • Synonyms respect authors • Author match very loose • Existing genus author updated Plantae Asteraceae Cichorium Linneaus Cichorium intybus Linneaus = Cichorium balearicum Porta = Cichorium byzantinum Clem. = Cichorium byzantinum Clementi
  • 14. Backbone Assembling Plantae Magnoliophyta Magnoliopsida Asterales Asteraceae Helianthus L. Helianthus anuus L. Cichorium L. Cichorium intybus L. = C. balearicum Porta = C. byzantinum Clem. • Prefer authors from nomenclators Asteraceae Cichorium L. Cichorium byzantinum Clem.
  • 15. Backbone Assembling Asteraceae Helianthus L. Helianthus anuus L. Agoseris Agoseris apargioides (Less.) Greene = A. maritima Eastw. A. a. var. eastwoodiae (Fedde) Munz A. a. var. maritima (E. Sheld.) Baird Cichorium L. Cichorium intybus L. = C. balearicum Porta = C. byzantinum Clem. • Infraspecifics are included Asteraceae Agoseris apargioides (Less.) Greene = A. maritima Eastw. A. a. var. eastwoodiae (Fedde) Munz A. a. var. maritima (E. Sheld.) Baird
  • 16. Backbone Assembling Asteraceae Helianthus L. Helianthus anuus L. Agoseris Agoseris apargioides (Less.) Greene = A. maritima Eastw. A. a. var. eastwoodiae (Fedde) Munz A. a. var. maritima (E. Sheld.) Baird Agoseris eastwoodiae Fedde Agoseris maritima E. Sheld. Cichorium L. Cichorium intybus L. = C. balearicum Porta = C. byzantinum Clem. • Other source treats them
 as species • Same canonical maritima allowed twice - author different Asteraceae Agoseris eastwoodiae Fedde Agoseris maritima E. Sheld.
  • 17. Final Cleanup - Basionyms Asteraceae Helianthus L. Helianthus anuus L. Agoseris Agoseris apargioides (Less.) Greene = A. maritima Eastw. A. a. var. eastwoodiae (Fedde) Munz = Agoseris eastwoodiae Fedde A. a. var. maritima (E. Sheld.) Baird = Agoseris maritima E. Sheld. Cichorium L. Cichorium intybus L. = C. balearicum Porta = C. byzantinum Clem. • Finally basionyms are detected • by terminal epithet & author within a family • Only 1 accepted per group • the most trusted first stays
  • 18. Final Cleanup - Autonyms Asteraceae Helianthus L. Helianthus anuus L. Agoseris Agoseris apargioides (Less.) Greene = A. maritima Eastw. A. a. var. apargioides A. a. var. eastwoodiae (Fedde) Munz = Agoseris eastwoodiae Fedde A. a. var. maritima (E. Sheld.) Baird = Agoseris maritima E. Sheld. Cichorium L. Cichorium intybus L. = C. balearicum Porta = C. byzantinum Clem. • Create missing autonyms
  • 19. Backbone Building Rules • Create missing genus or species in classification • only for accepted taxa • Create missing autonyms for infraspecific • Detect basionyms based on terminal epithet & authorship • Assumes epithet & authorship in family is unique • Converts all but one accepted to synonyms • Flag taxa as doubtful • genus or higher taxon without any species (IRMNG) • species (or infrasp.) with a parent genus (or species) considered to be a synonym • moved to newly accepted genus (or species) • the case for potential children of synonymised basionym combination
  • 20. Backbone Sources • GBIF Backbone Patch • Catalogue of Life • World Register of Marine Species • Dyntaxa - Svensk taxonomisk databas • GRIN Taxonomy • Fauna Europaea • Integrated Taxonomic Information System • Euro+Med Plantbase • Interim Register of Marine and Nonmarine Genera • The Clements Checklist • IOC World Bird Names • Mammal Species of the World • Paleobiology Database • Nomenclators • International Plant Names Index • Index Fungorum • ZooBank • Prokaryotic Nomenclature Up-to- date • ICTV Master Species List • Organisations • Species Files • Biodiversity Data Journal (Pensoft) • ZooKeys (Pensoft) • PhytoKeys (Pensoft) • Plazi ???
  • 21. Backbone Matching • Occurrence • fuzzy name match • classification match • allow higher rank matches • Checklist • match kingdom • require straight canonical match • incl authorship comparison • no webservice yet, only embedded
  • 22. NameUsageParsed Name Backbone Match Citation Dataset Metrics Verbatim Record Metrics Extensions • Checklists & Nub
 same structure • Parent-child hierarchy • normalized classification • flexible ranks • synonyms accepted rel. • Dataset metrics
 as timeseries • Basionym relation Schema
  • 23. CLB Supported Extensions • Description: human paragraphs about some topic • Distribution: area ranges with statuses • Identifier: additional identifier for the record • Multimedia: image, video, sound • Literature references: bibliography • Occurrence (indexed via occurrence workflows) • Species Profile: extinct, marine, freshwater, terrestrial flags • Types and specimens: (overlaps with Occurrence) • Vernacular names: name with language & region http://rs.gbif.org/extension/gbif/1.0/