SlideShare a Scribd company logo
1 of 33
Anchoring Biodiversity Information: From Sherborne to the 21 st  century and beyond Biodiversity Informatics – GBIFs role in linking information through scientific names. David Remsen Senior Programme Officer Global Biodiversity Information Facility (GBIF) 28 October 2011
BIODIVERSITY INFORMATICS
“ All  accumulated information  of a species is tied to a scientific name, a name that serves as a link between what has been learned in the past and what we today add to the body of knowledge.”
PRIMARY BIODIVERSITY DATA
PRIMARY BIODIVERSITY DATA
Primary Biodiversity Databases
GBIF IS A FEDERATED NETWORK A “network of networks”
COMMON COMMUNICATIONS REGISTRY
GLOBAL DATA INDEX
310,132,149 data records 9,290 datasets 6,112,683 “names”
DATA PORTAL DISCOVERY ACCESS
Use of Primary Biodiversity Data
Build Provisional Species Lists
Validate range maps
Predict species distributon
Change in suitability for cultivating common bean across the world, from present to 2020, showing a global loss in suitability, especially in Africa. Predict distribution changes
CBD Access & Benefit Sharing (Nagoya) Protocol
Challenges to presenting Taxon-oriented data
Tealia crassicornis  (Müller) Urticina crassicornis  (Müller)
Urticina felina  (Linnaeus 1767) Tealia felina, Tealia felina, Urticina crassicornis, Urticina columbiana, Tealia crassicornis, Urticina felina, Urticina coriacea, Stomphia churchiae, Rhodactinia crassicornis, Tealia lofotensis, Leiotealia spetsbergensis, Madoniactis lofotensis, Tealia tuberculata, Bolocera eques, Tealia greenii, Cereus coriaceus, Actinia felina, Bunodes crassicornis, Actinea tuberculata, Actinea coriacea, Actinia gemmacea, Actinia crassicornis, Actinia dævisii, Actinia coriacea, Actinia holsatica,
Gap in Synonymies
UNTRUSTWORTHY SCIENCE Trochilidae UNTRUSTWORTHY TAXONOMY
TRUSTED TAXONOMY BETTER SCIENCE
Unraveling Homonyms Oenanthe Plantae Magnoliophyta Magnoliopsida Apiales Umbelliferae Oenanthe Plantae Oenanthe Oenanthe Plantae Magnoliophyta Magnoliopsida Apiales Apiaceae Oenanthe ? Orchidaceae Oenanthe Animalia Chordata Aves Passeriformes Muscicapidae Oenanthe Animalia Chordata Aves Passeriformes Turdidae Oenanthe
Difficult for user to interpret Accurate search results Yesterday Today Unraveling Homonyms
 
A need for nomenclators Actinobacillus actimomycetemcomitans Actinobacillus actimycetemcomitans Actinobacillus actinmycetemcomitans Actinobacillus actinomicetemcomitans Actinobacillus actinomy Actinobacillus actinomyce Actinobacillus actinomycemcomitans Actinobacillus actinomyceremcomitans Actinobacillus actinomycetam Actinobacillus actinomycetamcomitans Actinobacillus actinomycetecomitans Actinobacillus actinomycetemcmitans Actinobacillus actinomycetemcomintans Actinobacillus actinomycetemcomitance Actinobacillus actinomycetemcomitans Actinobacillus actinomycetemcomitants Actinobacillus actinomycetemcommitans Actinobacillus actinomycetemocimitans Actinobacillus actinomycetencomitans Actinobacillus actinomycetum Actinobacillus actinomyctemcomitans Actinobacillus actinomyectomcomitans Actinobacillus actinomyetemcomitans Actinobacillus actinonmycetemcomitans Actinobacillus actionomycetemcomitans Actinobacillus actynomicetemcomitans Actinobacillus antinomycetemcomitans … and TaxaMatch
Agalinus paupercula borealis Agalinus pauperculum borealis Agalinis paupercula var. Borealis Agalinus pauperculum var. borealis Agalinus paupercula var. borealis Agalinus paupercula var. borealis Pennell Agalinus paupercula Britton var. borealis Pennell Agalinus paupercula (Gray) Britt. var. borealis Pennell Agalinis paupercula (A.Gray) Britton var. borealis Pennell Agalinus paupercula (Gray) Britton var. borealis (Pennell) Zenkert 1934 Issues of Orthography Reconciling different forms of the same name
Parsing
Dictionaries
Rapidly mines names from literature
Effective Biodiversity Informatics  requires Taxonomic and nomenclatural authority files &  services ,[object Object],[object Object],[object Object],[object Object],[object Object]
A GLOBAL NAMES ARCHITECTURE Another federated network ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

More Related Content

What's hot

Application of Whole Genome Sequencing in the infectious disease’ in vitro di...
Application of Whole Genome Sequencing in the infectious disease’ in vitro di...Application of Whole Genome Sequencing in the infectious disease’ in vitro di...
Application of Whole Genome Sequencing in the infectious disease’ in vitro di...ExternalEvents
 
Biology folding
Biology foldingBiology folding
Biology foldingSydgold15
 
Project Unity: The Way of the Future for Plant Breeding
Project Unity: The Way of the Future for Plant BreedingProject Unity: The Way of the Future for Plant Breeding
Project Unity: The Way of the Future for Plant BreedingPhenome Networks
 
Molecular Systematics and Biodiversity
Molecular Systematics and BiodiversityMolecular Systematics and Biodiversity
Molecular Systematics and BiodiversitySarwar A.D
 
The Chills and Thrills of Whole Genome Sequencing
The Chills and Thrills of Whole Genome SequencingThe Chills and Thrills of Whole Genome Sequencing
The Chills and Thrills of Whole Genome SequencingEmiliano De Cristofaro
 
Role of computer science in biotechnology
Role of computer science in biotechnologyRole of computer science in biotechnology
Role of computer science in biotechnologyParanjay Manchanda
 
Sharing the trail : Inspiring your students through GenOmics and other Social...
Sharing the trail : Inspiring your students through GenOmics and other Social...Sharing the trail : Inspiring your students through GenOmics and other Social...
Sharing the trail : Inspiring your students through GenOmics and other Social...gwardis
 
DNA Technology
DNA TechnologyDNA Technology
DNA Technologymgsonline
 
Human genome project[1]
Human genome project[1]Human genome project[1]
Human genome project[1]somsscience7
 
Human genome project
Human genome projectHuman genome project
Human genome projectDilip jaipal
 
Human genome project
Human genome projectHuman genome project
Human genome projectAmjad Afridi
 
Photosynthetic euglenids
Photosynthetic euglenidsPhotosynthetic euglenids
Photosynthetic euglenidsEukRef
 
Pallavi online assignment
Pallavi online assignmentPallavi online assignment
Pallavi online assignmentreshmafmtc
 

What's hot (18)

Application of Whole Genome Sequencing in the infectious disease’ in vitro di...
Application of Whole Genome Sequencing in the infectious disease’ in vitro di...Application of Whole Genome Sequencing in the infectious disease’ in vitro di...
Application of Whole Genome Sequencing in the infectious disease’ in vitro di...
 
SNPs in whales
SNPs in whalesSNPs in whales
SNPs in whales
 
Biology folding
Biology foldingBiology folding
Biology folding
 
Project Unity: The Way of the Future for Plant Breeding
Project Unity: The Way of the Future for Plant BreedingProject Unity: The Way of the Future for Plant Breeding
Project Unity: The Way of the Future for Plant Breeding
 
Crispr technology
Crispr  technologyCrispr  technology
Crispr technology
 
Molecular Systematics and Biodiversity
Molecular Systematics and BiodiversityMolecular Systematics and Biodiversity
Molecular Systematics and Biodiversity
 
The Chills and Thrills of Whole Genome Sequencing
The Chills and Thrills of Whole Genome SequencingThe Chills and Thrills of Whole Genome Sequencing
The Chills and Thrills of Whole Genome Sequencing
 
Role of computer science in biotechnology
Role of computer science in biotechnologyRole of computer science in biotechnology
Role of computer science in biotechnology
 
Sharing the trail : Inspiring your students through GenOmics and other Social...
Sharing the trail : Inspiring your students through GenOmics and other Social...Sharing the trail : Inspiring your students through GenOmics and other Social...
Sharing the trail : Inspiring your students through GenOmics and other Social...
 
DNA Technology
DNA TechnologyDNA Technology
DNA Technology
 
Human genome project[1]
Human genome project[1]Human genome project[1]
Human genome project[1]
 
Michelle Poster Draft
Michelle Poster DraftMichelle Poster Draft
Michelle Poster Draft
 
Human genome project
Human genome projectHuman genome project
Human genome project
 
Human genome project
Human genome projectHuman genome project
Human genome project
 
DNA in the Laboratory
DNA in the LaboratoryDNA in the Laboratory
DNA in the Laboratory
 
Photosynthetic euglenids
Photosynthetic euglenidsPhotosynthetic euglenids
Photosynthetic euglenids
 
Pallavi online assignment
Pallavi online assignmentPallavi online assignment
Pallavi online assignment
 
Magnetic phenomenon-cv19-vials
Magnetic phenomenon-cv19-vialsMagnetic phenomenon-cv19-vials
Magnetic phenomenon-cv19-vials
 

Viewers also liked

Nodes Portal Toolkit Primer
Nodes Portal Toolkit PrimerNodes Portal Toolkit Primer
Nodes Portal Toolkit PrimerDavid Remsen
 
Remsen celebration of discovery
Remsen celebration of discoveryRemsen celebration of discovery
Remsen celebration of discoveryDavid Remsen
 
Biodiversity capecod short
Biodiversity capecod shortBiodiversity capecod short
Biodiversity capecod shortDavid Remsen
 
Collaboration Forum Keynote
Collaboration Forum KeynoteCollaboration Forum Keynote
Collaboration Forum KeynoteDavid Remsen
 
Emergent interdisciplinary research opportunity for the MBL
Emergent interdisciplinary research opportunity for the MBLEmergent interdisciplinary research opportunity for the MBL
Emergent interdisciplinary research opportunity for the MBLDavid Remsen
 
ASP.Net MVC ile Web Uygulamaları - 1(Giriş)
ASP.Net MVC ile Web Uygulamaları - 1(Giriş)ASP.Net MVC ile Web Uygulamaları - 1(Giriş)
ASP.Net MVC ile Web Uygulamaları - 1(Giriş)İbrahim ATAY
 
ASP.Net MVC ile Web Uygulamaları -17(MVCContrib)
ASP.Net MVC ile Web Uygulamaları -17(MVCContrib)ASP.Net MVC ile Web Uygulamaları -17(MVCContrib)
ASP.Net MVC ile Web Uygulamaları -17(MVCContrib)İbrahim ATAY
 

Viewers also liked (9)

Nodes Portal Toolkit Primer
Nodes Portal Toolkit PrimerNodes Portal Toolkit Primer
Nodes Portal Toolkit Primer
 
Remsen celebration of discovery
Remsen celebration of discoveryRemsen celebration of discovery
Remsen celebration of discovery
 
Tdwg 2-remsen
Tdwg 2-remsenTdwg 2-remsen
Tdwg 2-remsen
 
Tdwg 1-remsen
Tdwg 1-remsenTdwg 1-remsen
Tdwg 1-remsen
 
Biodiversity capecod short
Biodiversity capecod shortBiodiversity capecod short
Biodiversity capecod short
 
Collaboration Forum Keynote
Collaboration Forum KeynoteCollaboration Forum Keynote
Collaboration Forum Keynote
 
Emergent interdisciplinary research opportunity for the MBL
Emergent interdisciplinary research opportunity for the MBLEmergent interdisciplinary research opportunity for the MBL
Emergent interdisciplinary research opportunity for the MBL
 
ASP.Net MVC ile Web Uygulamaları - 1(Giriş)
ASP.Net MVC ile Web Uygulamaları - 1(Giriş)ASP.Net MVC ile Web Uygulamaları - 1(Giriş)
ASP.Net MVC ile Web Uygulamaları - 1(Giriş)
 
ASP.Net MVC ile Web Uygulamaları -17(MVCContrib)
ASP.Net MVC ile Web Uygulamaları -17(MVCContrib)ASP.Net MVC ile Web Uygulamaları -17(MVCContrib)
ASP.Net MVC ile Web Uygulamaları -17(MVCContrib)
 

Similar to Remsen sherborne

Plant Pathology Seminar
Plant Pathology SeminarPlant Pathology Seminar
Plant Pathology SeminarBongsoo Park
 
"The Quest for A field Guide to the Microbes" talk by Jonathan Eisen February...
"The Quest for A field Guide to the Microbes" talk by Jonathan Eisen February..."The Quest for A field Guide to the Microbes" talk by Jonathan Eisen February...
"The Quest for A field Guide to the Microbes" talk by Jonathan Eisen February...Jonathan Eisen
 
2015 Soil Science of America Meeting
2015 Soil Science of America Meeting2015 Soil Science of America Meeting
2015 Soil Science of America MeetingAdina Chuang Howe
 
Applying agricultural biotechnology tools and capabilities to enhance food se...
Applying agricultural biotechnology tools and capabilities to enhance food se...Applying agricultural biotechnology tools and capabilities to enhance food se...
Applying agricultural biotechnology tools and capabilities to enhance food se...ExternalEvents
 
The Ginés‐Mera Fellowship Fund for Postgraduates Studies in Biodiversity
The Ginés‐Mera Fellowship Fund for Postgraduates Studies in BiodiversityThe Ginés‐Mera Fellowship Fund for Postgraduates Studies in Biodiversity
The Ginés‐Mera Fellowship Fund for Postgraduates Studies in BiodiversityCIAT
 
Development of genomics pipelines and its integration with breeding
Development of genomics pipelines and its integration with breedingDevelopment of genomics pipelines and its integration with breeding
Development of genomics pipelines and its integration with breedingCIAT
 
Biosafety of gmos and the role of entomologists
Biosafety of gmos and the role of entomologistsBiosafety of gmos and the role of entomologists
Biosafety of gmos and the role of entomologistsDr. Abiodun Denloye
 
Phyloinformatics: Introduction
Phyloinformatics: IntroductionPhyloinformatics: Introduction
Phyloinformatics: IntroductionRoderic Page
 
iPlant Tree of Life
iPlant Tree of LifeiPlant Tree of Life
iPlant Tree of LifeNaim Matasci
 
Eumicrobedb - Oomycetes Genomics Database
Eumicrobedb - Oomycetes Genomics Database Eumicrobedb - Oomycetes Genomics Database
Eumicrobedb - Oomycetes Genomics Database Arup Ghosh
 
Bioinformatics and its Applications in Agriculture/Sericulture and in other F...
Bioinformatics and its Applications in Agriculture/Sericulture and in other F...Bioinformatics and its Applications in Agriculture/Sericulture and in other F...
Bioinformatics and its Applications in Agriculture/Sericulture and in other F...mohd younus wani
 
Text-mining and ontologies - new approaches to knowledge discovery of microbi...
Text-mining and ontologies - new approaches to knowledge discovery of microbi...Text-mining and ontologies - new approaches to knowledge discovery of microbi...
Text-mining and ontologies - new approaches to knowledge discovery of microbi...Claire Nedellec
 

Similar to Remsen sherborne (20)

Plant Pathology Seminar
Plant Pathology SeminarPlant Pathology Seminar
Plant Pathology Seminar
 
Big data nebraska
Big data nebraskaBig data nebraska
Big data nebraska
 
"The Quest for A field Guide to the Microbes" talk by Jonathan Eisen February...
"The Quest for A field Guide to the Microbes" talk by Jonathan Eisen February..."The Quest for A field Guide to the Microbes" talk by Jonathan Eisen February...
"The Quest for A field Guide to the Microbes" talk by Jonathan Eisen February...
 
Big Data Field Museum
Big Data Field MuseumBig Data Field Museum
Big Data Field Museum
 
2015 Soil Science of America Meeting
2015 Soil Science of America Meeting2015 Soil Science of America Meeting
2015 Soil Science of America Meeting
 
The Garden Of Eden
The Garden Of EdenThe Garden Of Eden
The Garden Of Eden
 
H177 Midterm Dizon
H177 Midterm DizonH177 Midterm Dizon
H177 Midterm Dizon
 
Big data nebraska
Big data nebraskaBig data nebraska
Big data nebraska
 
Applying agricultural biotechnology tools and capabilities to enhance food se...
Applying agricultural biotechnology tools and capabilities to enhance food se...Applying agricultural biotechnology tools and capabilities to enhance food se...
Applying agricultural biotechnology tools and capabilities to enhance food se...
 
The Ginés‐Mera Fellowship Fund for Postgraduates Studies in Biodiversity
The Ginés‐Mera Fellowship Fund for Postgraduates Studies in BiodiversityThe Ginés‐Mera Fellowship Fund for Postgraduates Studies in Biodiversity
The Ginés‐Mera Fellowship Fund for Postgraduates Studies in Biodiversity
 
Development of genomics pipelines and its integration with breeding
Development of genomics pipelines and its integration with breedingDevelopment of genomics pipelines and its integration with breeding
Development of genomics pipelines and its integration with breeding
 
Biosafety of gmos and the role of entomologists
Biosafety of gmos and the role of entomologistsBiosafety of gmos and the role of entomologists
Biosafety of gmos and the role of entomologists
 
Currsci Sep25 2004
Currsci Sep25 2004Currsci Sep25 2004
Currsci Sep25 2004
 
Phyloinformatics: Introduction
Phyloinformatics: IntroductionPhyloinformatics: Introduction
Phyloinformatics: Introduction
 
Plant genome project(aribidopsis)
Plant genome project(aribidopsis)Plant genome project(aribidopsis)
Plant genome project(aribidopsis)
 
iPlant Tree of Life
iPlant Tree of LifeiPlant Tree of Life
iPlant Tree of Life
 
Eumicrobedb - Oomycetes Genomics Database
Eumicrobedb - Oomycetes Genomics Database Eumicrobedb - Oomycetes Genomics Database
Eumicrobedb - Oomycetes Genomics Database
 
Bioinformatics and its Applications in Agriculture/Sericulture and in other F...
Bioinformatics and its Applications in Agriculture/Sericulture and in other F...Bioinformatics and its Applications in Agriculture/Sericulture and in other F...
Bioinformatics and its Applications in Agriculture/Sericulture and in other F...
 
Text-mining and ontologies - new approaches to knowledge discovery of microbi...
Text-mining and ontologies - new approaches to knowledge discovery of microbi...Text-mining and ontologies - new approaches to knowledge discovery of microbi...
Text-mining and ontologies - new approaches to knowledge discovery of microbi...
 
Session 7: Probiotic diets to increase Queensland fruit fly male performance ...
Session 7: Probiotic diets to increase Queensland fruit fly male performance ...Session 7: Probiotic diets to increase Queensland fruit fly male performance ...
Session 7: Probiotic diets to increase Queensland fruit fly male performance ...
 

More from David Remsen

Use and Limits of Scientific Names in Biological Informatics
Use and Limits of Scientific Names in Biological InformaticsUse and Limits of Scientific Names in Biological Informatics
Use and Limits of Scientific Names in Biological InformaticsDavid Remsen
 
uBio presentation to UMLS group of NLM / NIH
uBio presentation to UMLS group of NLM / NIHuBio presentation to UMLS group of NLM / NIH
uBio presentation to UMLS group of NLM / NIHDavid Remsen
 
uBio presentation to Jim Edwards 2006
uBio presentation to Jim Edwards 2006uBio presentation to Jim Edwards 2006
uBio presentation to Jim Edwards 2006David Remsen
 
uBio presentation to Species 2000 May 2004
uBio presentation to Species 2000 May 2004uBio presentation to Species 2000 May 2004
uBio presentation to Species 2000 May 2004David Remsen
 
National Biodiversity Informatics Goals
National Biodiversity Informatics GoalsNational Biodiversity Informatics Goals
National Biodiversity Informatics GoalsDavid Remsen
 
Nodes Portal Toolkit primer
Nodes Portal Toolkit primerNodes Portal Toolkit primer
Nodes Portal Toolkit primerDavid Remsen
 
Remsen EOL Content Summit
Remsen EOL Content SummitRemsen EOL Content Summit
Remsen EOL Content SummitDavid Remsen
 
Global Names Architecture - Remsen
Global Names Architecture - RemsenGlobal Names Architecture - Remsen
Global Names Architecture - RemsenDavid Remsen
 
D3 02 Vernacular Names
D3 02 Vernacular NamesD3 02 Vernacular Names
D3 02 Vernacular NamesDavid Remsen
 
D3 02 National Checklists
D3 02 National ChecklistsD3 02 National Checklists
D3 02 National ChecklistsDavid Remsen
 
Cataloging Taxonomic Data
Cataloging Taxonomic DataCataloging Taxonomic Data
Cataloging Taxonomic DataDavid Remsen
 
Digitisation of Taxonomic Data: Current Approaches
Digitisation of Taxonomic Data: Current ApproachesDigitisation of Taxonomic Data: Current Approaches
Digitisation of Taxonomic Data: Current ApproachesDavid Remsen
 

More from David Remsen (14)

Use and Limits of Scientific Names in Biological Informatics
Use and Limits of Scientific Names in Biological InformaticsUse and Limits of Scientific Names in Biological Informatics
Use and Limits of Scientific Names in Biological Informatics
 
uBio presentation to UMLS group of NLM / NIH
uBio presentation to UMLS group of NLM / NIHuBio presentation to UMLS group of NLM / NIH
uBio presentation to UMLS group of NLM / NIH
 
uBio presentation to Jim Edwards 2006
uBio presentation to Jim Edwards 2006uBio presentation to Jim Edwards 2006
uBio presentation to Jim Edwards 2006
 
Thomson Reuters
Thomson ReutersThomson Reuters
Thomson Reuters
 
uBio presentation to Species 2000 May 2004
uBio presentation to Species 2000 May 2004uBio presentation to Species 2000 May 2004
uBio presentation to Species 2000 May 2004
 
National Biodiversity Informatics Goals
National Biodiversity Informatics GoalsNational Biodiversity Informatics Goals
National Biodiversity Informatics Goals
 
Nodes Portal Toolkit primer
Nodes Portal Toolkit primerNodes Portal Toolkit primer
Nodes Portal Toolkit primer
 
Remsen EOL Content Summit
Remsen EOL Content SummitRemsen EOL Content Summit
Remsen EOL Content Summit
 
Remsen sherborne
Remsen sherborneRemsen sherborne
Remsen sherborne
 
Global Names Architecture - Remsen
Global Names Architecture - RemsenGlobal Names Architecture - Remsen
Global Names Architecture - Remsen
 
D3 02 Vernacular Names
D3 02 Vernacular NamesD3 02 Vernacular Names
D3 02 Vernacular Names
 
D3 02 National Checklists
D3 02 National ChecklistsD3 02 National Checklists
D3 02 National Checklists
 
Cataloging Taxonomic Data
Cataloging Taxonomic DataCataloging Taxonomic Data
Cataloging Taxonomic Data
 
Digitisation of Taxonomic Data: Current Approaches
Digitisation of Taxonomic Data: Current ApproachesDigitisation of Taxonomic Data: Current Approaches
Digitisation of Taxonomic Data: Current Approaches
 

Recently uploaded

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 

Recently uploaded (20)

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 

Remsen sherborne

  • 1. Anchoring Biodiversity Information: From Sherborne to the 21 st century and beyond Biodiversity Informatics – GBIFs role in linking information through scientific names. David Remsen Senior Programme Officer Global Biodiversity Information Facility (GBIF) 28 October 2011
  • 3. “ All accumulated information of a species is tied to a scientific name, a name that serves as a link between what has been learned in the past and what we today add to the body of knowledge.”
  • 7. GBIF IS A FEDERATED NETWORK A “network of networks”
  • 10. 310,132,149 data records 9,290 datasets 6,112,683 “names”
  • 12. Use of Primary Biodiversity Data
  • 16. Change in suitability for cultivating common bean across the world, from present to 2020, showing a global loss in suitability, especially in Africa. Predict distribution changes
  • 17. CBD Access & Benefit Sharing (Nagoya) Protocol
  • 18. Challenges to presenting Taxon-oriented data
  • 19. Tealia crassicornis (Müller) Urticina crassicornis (Müller)
  • 20. Urticina felina (Linnaeus 1767) Tealia felina, Tealia felina, Urticina crassicornis, Urticina columbiana, Tealia crassicornis, Urticina felina, Urticina coriacea, Stomphia churchiae, Rhodactinia crassicornis, Tealia lofotensis, Leiotealia spetsbergensis, Madoniactis lofotensis, Tealia tuberculata, Bolocera eques, Tealia greenii, Cereus coriaceus, Actinia felina, Bunodes crassicornis, Actinea tuberculata, Actinea coriacea, Actinia gemmacea, Actinia crassicornis, Actinia dævisii, Actinia coriacea, Actinia holsatica,
  • 22. UNTRUSTWORTHY SCIENCE Trochilidae UNTRUSTWORTHY TAXONOMY
  • 24. Unraveling Homonyms Oenanthe Plantae Magnoliophyta Magnoliopsida Apiales Umbelliferae Oenanthe Plantae Oenanthe Oenanthe Plantae Magnoliophyta Magnoliopsida Apiales Apiaceae Oenanthe ? Orchidaceae Oenanthe Animalia Chordata Aves Passeriformes Muscicapidae Oenanthe Animalia Chordata Aves Passeriformes Turdidae Oenanthe
  • 25. Difficult for user to interpret Accurate search results Yesterday Today Unraveling Homonyms
  • 26.  
  • 27. A need for nomenclators Actinobacillus actimomycetemcomitans Actinobacillus actimycetemcomitans Actinobacillus actinmycetemcomitans Actinobacillus actinomicetemcomitans Actinobacillus actinomy Actinobacillus actinomyce Actinobacillus actinomycemcomitans Actinobacillus actinomyceremcomitans Actinobacillus actinomycetam Actinobacillus actinomycetamcomitans Actinobacillus actinomycetecomitans Actinobacillus actinomycetemcmitans Actinobacillus actinomycetemcomintans Actinobacillus actinomycetemcomitance Actinobacillus actinomycetemcomitans Actinobacillus actinomycetemcomitants Actinobacillus actinomycetemcommitans Actinobacillus actinomycetemocimitans Actinobacillus actinomycetencomitans Actinobacillus actinomycetum Actinobacillus actinomyctemcomitans Actinobacillus actinomyectomcomitans Actinobacillus actinomyetemcomitans Actinobacillus actinonmycetemcomitans Actinobacillus actionomycetemcomitans Actinobacillus actynomicetemcomitans Actinobacillus antinomycetemcomitans … and TaxaMatch
  • 28. Agalinus paupercula borealis Agalinus pauperculum borealis Agalinis paupercula var. Borealis Agalinus pauperculum var. borealis Agalinus paupercula var. borealis Agalinus paupercula var. borealis Pennell Agalinus paupercula Britton var. borealis Pennell Agalinus paupercula (Gray) Britt. var. borealis Pennell Agalinis paupercula (A.Gray) Britton var. borealis Pennell Agalinus paupercula (Gray) Britton var. borealis (Pennell) Zenkert 1934 Issues of Orthography Reconciling different forms of the same name
  • 31. Rapidly mines names from literature
  • 32.
  • 33.

Editor's Notes

  1. Biodiversity informatics fills a space between traditional bioinformatics with its focus on genomes and Ecoinformatics that looks at entire landscapes and their interaction with the physical world. Biodiversity informatics focuses on taxa and their interactions among each other.
  2. Nomenclature and taxonomy plays a central role within the handling of biodiversity information because nearly every piece of information or data related to a species (or more specifically – a taxon) is labeled with a scientific name.
  3. GBIF has a specific focus within biodiversity information in that our scope is restricted to the mobilisation, discovery, and use of primary biodiversity data. Primary biodiversity data are the digital text or multimedia data records that detail the instance of an organism – the ‘what, where, when, how and by whom’ of the organism’s occurrence and recording. One major class of primary biodiversity data is that derived from natural history collections.
  4. A second class of primary biodiversity data originate with observations of species and there are numerous instances of observational data networks that collect millions of species observations every year.
  5. These different classes of biodiversity information are typically stored in databases of some sort that are hosted throughout the world. These databases may contribute to larger networks or act as standalone data access systems. In most cases, the data are made available for access to the Internet through a variety of gateways or portals.
  6. GBIF represents a federated network that is composed of thousands of different primary biodiversity databases located all over the world.
  7. The thing that makes all of these different databases part of the GBIF network are: These data are made available on the Internet using a common set of communications protocols and data formats. A registry, representing a list of all members of the network and the location of the data itself (often a URL) serves as a master network directory.
  8. The registry and communications protocols are utilised to poll each database in the network and retrieve an index of the biodiversity data records they contain. The index includes the key taxonomic, geospatial, and provenance elements of the data record. This allows the data to be visually represented, for instance, on a map of the Earth.
  9. Currently the GBIF index stands at over 310 million records from over 9000 different databases. Each of these data records records the name of the taxon, usually a species, that the record is associated with. The total number of scientific names in this virtual dataset exceeds 6 million different text strings – far exceeding the number of known species. Correctly interpreting this list of names is a key requirement in enabling effective use of the index.
  10. This graph shows the growth of the GBIF occurrence index since 2007.
  11. Before I describe the challenges inherent to the index, I’d like to illustrate how biodiversity data has been used in various scientific and biodiversity policy-related contexts.
  12. In this example, occurrence data from the GBIF network has been geospatially joined with world protected area boundaries to generate provisional species lists and data distribution summaries for the protected area.
  13. Occurrence data has been combined with IUCN species range maps both to validate the distribution and identify potential gaps in coverage.
  14. Species occurrence data is geo-spatially integrated with additional data types such as climatic data to create an ecological profile for the species. Aquamaps uses ecological niche modeling to predict the distribution of marine species.
  15. In the example illustrated here, the model outputs project changes in distribution of a crop species based on possible climate change scenarios.
  16. Researchers at Lancaster University have utilised GBIF data mining tools and occurrence index to extract over 65,000 species names from the US and Worlds Patent indices and determine the distribution of these species among the worlds nations in order to inform Access and Benefit Sharing processes demanded by developing countries as a component of the Convention on Biological Diversity
  17. The uses illustrated here require access to primary biodiversity data that is organised around taxa – either species or higher groups like familes. This organisation is challenged by a number of different factors which I would like to illustrate.
  18. In a federated data environment, specimens may be labeled with different names that refer to the same species. Here is an example of a pair of nomenclatural synonyms that are initially interpreted as distinct taxa and subsequently result in distinct occurrence data maps.
  19. Access to authoritative synonymised species checklists, when properly annotated and interpreted, enable data records labeled with different names to be linked to the same taxon. This clearly impacts the resultant data distribution output and any subsequent uses of these data. A challenge for GBIF has been in 1) gaining access to taxonomic authority files. Until recently the only major taxonomic data source was the Catalogue of Life – a wonderful resource but one that only partially addressed this problem within the GBIF index.
  20. Edward Dickonson mentioned the problem with synonymy in birds and their compilations being scattered among a range of resources. A consequence of this is illustrated here where the Catalogue of Life provides the correct name for the blue tit, it does not include the original combination of the name coined by Linnaeus and as a consequence, misses the majority of occurrences in the index.
  21. Without access to sufficient authoritative taxonomic data, we have been forced to rely on less-accurate classification data originating in occurrence datasets. These datasets often contain errors such as illustrated here where a synonym of a European bird species was mistakenly placed in the hummingbird family. This creates knock-on effects that impact use beyond the single species to the entire family.
  22. With access to a more complete array of authoritative taxonomic sources, we are able to match more taxa and improve the taxonomic backbone used to organise and present species data records.
  23. The lack of a comprehensive multi-regnal nomenclator means that we have no clear indication of the number of homonyms that exist nor a method for determining which classification is ‘correct’ As a result the GBIF index may provide a confusing array of options for a user. Illustrated above is a typical case where we have a number of different Oenanthe but lack sufficient external taxonomic resources to reconcile this number any further.
  24. Access to a wider array of nomenclatural sources reveals there are exactly two genera with this name and includes a common name to help distinguish them.
  25. Difficulties with orthography in scientific names starts at the source. Here are some examples of insect specimen labels that have been transcribed to electronic databases.
  26. It may come as no surprise, therefore to see the sort of variation that may exist in a federated dataset for some of the more complex scientific names. Considerable work has gone into the development of ‘fuzzy matching’ algorithms, notably Tony Rees’ TaxaMatch. But it’s only authoritative nomenclatural sources that can inform us which is the correctly spelled version of the name.
  27. Reconciling orthograpny and nomenclature presents problems beyond simple misspellings. Nomenclatural formats include authorship, infraspecific ranks, and other notation, For a computer, all of these strings represent different names and present challenges to properly organising data records in a federated environment.
  28. Taxonomic name parsing services provide a solution for matching different forms of the same name whenever biodiversity data needs to be integrated from multiple sources. The service atomises name into recognisable constituent parts and reassembles a simplified canonical form that can will be equivalent for the different versions of the name.
  29. These name parsers – combined with authoritative nomenclatural data – extend the utility of this service by providing the raw materials for creating specialised taxonomic name dictionaries.
  30. These dictionaries, combined with software, result in name-mining services that can locate scientific names in literature– on specimen labels – and other full-text publications. It can rapidly and accurately extract all scientific names from large compilations of literature. Such services are employed by the BHL to develop taxonomic indices and by the CBD data mining example I cited earlier
  31. How do we facilitate this?
  32. At GBIF we are working today on extending our architectural framework to serve as a contributor to a Global Names Architecture. A framework that supports the discovery of, and access to, a range of nomenclatural and taxonomic resources. To enable the development of new integrated resources such as a consolidated nomenclatural index that can serve as a core authoritative names dictionary from which different taxonomies may be tied. And to promote the development of name services that enable taxonomy to serve as the core organisational framework for all biodiversity information. Thank you.