SlideShare a Scribd company logo
1 of 25
Universal Biological Indexer and Organizer
Research Funded by the Andrew W. Mellon Foundation
MBL / WHOI LIBRARY
Universal Biological Indexer and Organizer
Research Funded by the Andrew W. Mellon Foundation
MBL / WHOI LIBRARY
Newt: as concept
• Triturus viridescens Rafinesque 1820
• String
• a single specimen
• Nomenclatural concept
QuickTime™ and a
TIFF (Uncompressed) decompressor
are needed to see this picture.
viridis - to become green
Universal Biological Indexer and Organizer
Research Funded by the Andrew W. Mellon Foundation
MBL / WHOI LIBRARY
Concepts:Nomenclatural
• Triturus viridescens Rafinesque 1820
• Notopthalmus viridescens Baird 1850
• Notophthalmus viridescens Gray 1850
• Notophthalma viridescens Gray 1858 msp.
• Diemyctylus viridescens Hallowell 1856
• Triton viridescens Strauch, 1870
• Molge viridescens Boulanger, 1872
• Diemyctylus minatus viridescens Yarrow
•…
Common origin in a single real specimen (homotypic)
Creation of the new nomen concept is subjective
Relationship among them is not
QuickTime™ and a
TIFF (Uncompressed) decompressor
are needed to see this picture.
Universal Biological Indexer and Organizer
Research Funded by the Andrew W. Mellon Foundation
MBL / WHOI LIBRARY
Concepts: Nomenclatural
• Triturus viridescens dorsalis - Bishop, 1943
• Diemyctylus viridescens dorsalis Schmidt 1953
• Notophthalmus viridescens dorsalis - Smith, 1953
• Triturus viridescens louisianae - Strecker 1928
• Triturus viridescens louisianensis - Bishop, 1943
• Diemyctylus viridescens louisianensis Schmidt 1953
• Notophthalmus viridescens louisianensis - Smith, 1953
QuickTime™ and a
TIFF (Uncompressed) decompressor
are needed to see this picture.
Universal Biological Indexer and Organizer
Research Funded by the Andrew W. Mellon Foundation
MBL / WHOI LIBRARY
Concepts:Taxonomic
• Notopthalmus viridescens Valid name
•Triturus viridescens
• Notopthalmus viridescens
• Notophthalmus viridescens
• Notophthalma viridescens
• Diemyctylus viridescens
• Triton viridescens
• Molge viridescens
• Diemyctylus minatus viridescens
• Triturus viridescens dorsalis
• Diemyctylus viridescens dorsalis
• Notophthalmus viridescens dorsalis
•… 24 others
Frost 2005 AMNH
• Notopthalmus viridescens viridescens
•Triturus viridescens
• Notopthalmus viridescens
• Notophthalmus viridescens
• Notophthalma viridescens
• Diemyctylus viridescens
• Triton viridescens
• Molge viridescens
• Notophthalmus viridescens dorsalis
• Triturus viridescens dorsalis
• Diemyctylus viridescens dorsalis
• Notophthalmus viridescens louisianensis
Dolbe 2004
Expert interpretation of the original specimens
Universal Biological Indexer and Organizer
Research Funded by the Andrew W. Mellon Foundation
MBL / WHOI LIBRARY
Concepts:Taxonomic
• Amphibia
• Urodela
• Salamandridae
• Notophthalmus
• Notopthalmus viridescens
Frost 2005 AMNH
• Amphibia
• Batrachia
• Caudata
• Salamandroidea
• Salamandridae
• Notophthalmus
• Notopthalmus viridescens
NCBI 2005
Universal Biological Indexer and Organizer
Research Funded by the Andrew W. Mellon Foundation
MBL / WHOI LIBRARY
Concepts:Summary
• Factual
• Inter-relationships are objective
• No new science required
• (except to make new ones)
• Stable
• Expert scrutiny useful, not required
• Compilation potentially FAST
• uBio 1 million/year
• share (no opinion attached)
Nomenclatural Concepts
• Opinion
• Interelationships are subjective
• Derived from nomenclatural concepts
• Expert scrutiny is required
• Unstable
• Compilation slow
• CoL 50K / year
• Diptera 200K/15 years
• sharing concerns - opinions attached
Taxonomic Concepts
Universal Biological Indexer and Organizer
Research Funded by the Andrew W. Mellon Foundation
MBL / WHOI LIBRARY
Why this is a problem
Universal Biological Indexer and Organizer
Research Funded by the Andrew W. Mellon Foundation
MBL / WHOI LIBRARY
Don’t forget common names
Universal Biological Indexer and Organizer
Research Funded by the Andrew W. Mellon Foundation
MBL / WHOI LIBRARY
And additionally…
5-10% scientific names become invalid per decade
Scientific names aren’t unique
Acalyptus
Universal Biological Indexer and Organizer
Research Funded by the Andrew W. Mellon Foundation
MBL / WHOI LIBRARY
Names Challenges within PubMed
476 unique
Name (Nomenclatural Synonyms) PMID Date Unique
Notophthalmus viridescens 350 1965 349
Diemictylus viridescens 36 1959 36
Triturus viridescens 87 1949 86
Universal Biological Indexer and Organizer
Research Funded by the Andrew W. Mellon Foundation
MBL / WHOI LIBRARY
Names Challenges within PubMed
4208 unique PMID
Name (Taxonomic Synonyms) Total Unique
Brucella melitensis 1078 840 78.1%
Brucella abortus (Bacterium abortus) 3109 2852 91.7%
Brucella canis 178 146 82.0%
Brucella neotomae 12 4 33.3%
Brucella ovis 233 168 84.9%
Brucella suis 286 198 69.2%
Brucella melitensis DSMZ 2005
Universal Biological Indexer and Organizer
Research Funded by the Andrew W. Mellon Foundation
MBL / WHOI LIBRARY
How big is the problem?
• Not sure
• No comprehensive listing of
names
• 1.75M valid species names
• 2-?M+ invalid names
• 2-?M+ vernacular names
• + Misspellings, lexical forms
• 14,000 avian genera
Universal Biological Indexer and Organizer
Research Funded by the Andrew W. Mellon Foundation
MBL / WHOI LIBRARY
uBio
• Library service
• “System” must account for all names
• Any classifications
• Biological Name Server
• 2 million nomenclatural concepts
• 1.7 taxon concepts
• (60 classifications)
• SOAP/WSDL web services
Universal Biological Indexer and Organizer
Research Funded by the Andrew W. Mellon Foundation
MBL / WHOI LIBRARY
Universal Biological Indexer and Organizer
Research Funded by the Andrew W. Mellon Foundation
MBL / WHOI LIBRARY
Universal Biological Indexer and Organizer
Research Funded by the Andrew W. Mellon Foundation
MBL / WHOI LIBRARY
uBio
Universal Biological Indexer and Organizer
Research Funded by the Andrew W. Mellon Foundation
MBL / WHOI LIBRARY
Major Impediment to progress
• Different taxon concepts/needs
• Same nomenclatural concepts
• No obj/subj distinction
• Duplication
• Interconnectivity issues
NameBank
• Repository for all nomen
concepts
•Insulates taxonomic systems
from “bad” nomen concepts
•Consensus data only
• Common to any taxon
concept
• Shareable
• Distributable
NameBank
• NameBank is not a nomenclator nor are nomenclators
NameBank
• NameBank is an index of factually-derived name concepts that
include a much more broad names definition
• It overlaps, and is supported by nomenclators and should, I
think, provide a service on top of NameBank.
• NameBank provides an underlying unified index to systems like
IF that contain authoritative nomenclatural metadata.
• NameBank accomodates strings outside the scope of
nomenclators
NameBank: UMLS Metathesaurus style
NameBank: Current
Universal Biological Indexer and Organizer
Research Funded by the Andrew W. Mellon Foundation
MBL / WHOI LIBRARY
NameBank
• Repository for all nomen
concepts
• Insulates taxonomic systems
from “bad” nomen concepts
• Consensus data only
• Layered
• Shareable
• Distributable
• Independent compilation
Universal Biological Indexer and Organizer
Research Funded by the Andrew W. Mellon Foundation
MBL / WHOI LIBRARY
Share
QuickTime™ and a
TIFF (LZW) decompressor
are needed to see this picture.
• NameBank is a big job
• Catalog all names
• Map all factually derived relationships
• Share them for increased data access
•Proactive
• NCBI
• CBOL
• new submissions
•
Universal Biological Indexer and Organizer
Research Funded by the Andrew W. Mellon Foundation
MBL / WHOI LIBRARY
Federate
• Layered architecture
• Common Foundation
• Diverse expression
• Enhanced Interchange
• Cooperation
• Efficient
QuickTime™ and a
TIFF (Uncompressed) decompressor
are needed to see this picture.

More Related Content

Similar to uBio presentation to UMLS group of NLM / NIH

Fbip specify2015
Fbip specify2015Fbip specify2015
Fbip specify2015
wcoetzer
 
[Bio1] ch 1 evolution the themes of biology and scientific inquiry
[Bio1] ch 1 evolution the themes of biology and scientific inquiry[Bio1] ch 1 evolution the themes of biology and scientific inquiry
[Bio1] ch 1 evolution the themes of biology and scientific inquiry
RandomDude4
 
Parfrey smbe euk_2013_final
Parfrey smbe euk_2013_finalParfrey smbe euk_2013_final
Parfrey smbe euk_2013_final
Laura_Parfrey
 

Similar to uBio presentation to UMLS group of NLM / NIH (20)

Fbip specify2015
Fbip specify2015Fbip specify2015
Fbip specify2015
 
Evolution by Dr. Azharuddin Daphedar
Evolution by Dr. Azharuddin DaphedarEvolution by Dr. Azharuddin Daphedar
Evolution by Dr. Azharuddin Daphedar
 
The Good Species
The Good SpeciesThe Good Species
The Good Species
 
uBio presentation to Species 2000 May 2004
uBio presentation to Species 2000 May 2004uBio presentation to Species 2000 May 2004
uBio presentation to Species 2000 May 2004
 
[Bio1] ch 1 evolution the themes of biology and scientific inquiry
[Bio1] ch 1 evolution the themes of biology and scientific inquiry[Bio1] ch 1 evolution the themes of biology and scientific inquiry
[Bio1] ch 1 evolution the themes of biology and scientific inquiry
 
Citizen mycology
Citizen mycologyCitizen mycology
Citizen mycology
 
The Good species
The Good speciesThe Good species
The Good species
 
ArtificialSelection.ppt
ArtificialSelection.pptArtificialSelection.ppt
ArtificialSelection.ppt
 
Sbc174 evolution 2014 week2
Sbc174 evolution 2014 week2Sbc174 evolution 2014 week2
Sbc174 evolution 2014 week2
 
Species delimitation - species limits and character evolution
Species delimitation - species limits and character evolutionSpecies delimitation - species limits and character evolution
Species delimitation - species limits and character evolution
 
Taxonomy of prokaryotes
Taxonomy of prokaryotesTaxonomy of prokaryotes
Taxonomy of prokaryotes
 
Thomson Reuters
Thomson ReutersThomson Reuters
Thomson Reuters
 
I unit part 1
I unit   part 1I unit   part 1
I unit part 1
 
Ch 2 biological classification
Ch 2 biological classificationCh 2 biological classification
Ch 2 biological classification
 
Parfrey smbe euk_2013_final
Parfrey smbe euk_2013_finalParfrey smbe euk_2013_final
Parfrey smbe euk_2013_final
 
Diving_Beetles_of_the_World_Systematics_and_Biolo_3343724_(z-lib.org).pdf
Diving_Beetles_of_the_World_Systematics_and_Biolo_3343724_(z-lib.org).pdfDiving_Beetles_of_the_World_Systematics_and_Biolo_3343724_(z-lib.org).pdf
Diving_Beetles_of_the_World_Systematics_and_Biolo_3343724_(z-lib.org).pdf
 
River Out of Eden
River Out of EdenRiver Out of Eden
River Out of Eden
 
Overview of unicellular elements of domains of life
Overview of unicellular elements of domains of lifeOverview of unicellular elements of domains of life
Overview of unicellular elements of domains of life
 
Theoryofevolution 110815170612-phpapp01
Theoryofevolution 110815170612-phpapp01Theoryofevolution 110815170612-phpapp01
Theoryofevolution 110815170612-phpapp01
 
Intro bioinformatics
Intro bioinformaticsIntro bioinformatics
Intro bioinformatics
 

More from David Remsen

More from David Remsen (18)

Use and Limits of Scientific Names in Biological Informatics
Use and Limits of Scientific Names in Biological InformaticsUse and Limits of Scientific Names in Biological Informatics
Use and Limits of Scientific Names in Biological Informatics
 
Biodiversity capecod short
Biodiversity capecod shortBiodiversity capecod short
Biodiversity capecod short
 
Emergent interdisciplinary research opportunity for the MBL
Emergent interdisciplinary research opportunity for the MBLEmergent interdisciplinary research opportunity for the MBL
Emergent interdisciplinary research opportunity for the MBL
 
Remsen celebration of discovery
Remsen celebration of discoveryRemsen celebration of discovery
Remsen celebration of discovery
 
National Biodiversity Informatics Goals
National Biodiversity Informatics GoalsNational Biodiversity Informatics Goals
National Biodiversity Informatics Goals
 
Nodes Portal Toolkit primer
Nodes Portal Toolkit primerNodes Portal Toolkit primer
Nodes Portal Toolkit primer
 
Remsen EOL Content Summit
Remsen EOL Content SummitRemsen EOL Content Summit
Remsen EOL Content Summit
 
Remsen sherborne
Remsen sherborneRemsen sherborne
Remsen sherborne
 
Global Names Architecture - Remsen
Global Names Architecture - RemsenGlobal Names Architecture - Remsen
Global Names Architecture - Remsen
 
Nodes Portal Toolkit Primer
Nodes Portal Toolkit PrimerNodes Portal Toolkit Primer
Nodes Portal Toolkit Primer
 
Collaboration Forum Keynote
Collaboration Forum KeynoteCollaboration Forum Keynote
Collaboration Forum Keynote
 
Tdwg 2-remsen
Tdwg 2-remsenTdwg 2-remsen
Tdwg 2-remsen
 
Tdwg 1-remsen
Tdwg 1-remsenTdwg 1-remsen
Tdwg 1-remsen
 
Remsen sherborne
Remsen sherborneRemsen sherborne
Remsen sherborne
 
D3 02 Vernacular Names
D3 02 Vernacular NamesD3 02 Vernacular Names
D3 02 Vernacular Names
 
D3 02 National Checklists
D3 02 National ChecklistsD3 02 National Checklists
D3 02 National Checklists
 
Cataloging Taxonomic Data
Cataloging Taxonomic DataCataloging Taxonomic Data
Cataloging Taxonomic Data
 
Digitisation of Taxonomic Data: Current Approaches
Digitisation of Taxonomic Data: Current ApproachesDigitisation of Taxonomic Data: Current Approaches
Digitisation of Taxonomic Data: Current Approaches
 

uBio presentation to UMLS group of NLM / NIH

  • 1. Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY
  • 2. Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY Newt: as concept • Triturus viridescens Rafinesque 1820 • String • a single specimen • Nomenclatural concept QuickTime™ and a TIFF (Uncompressed) decompressor are needed to see this picture. viridis - to become green
  • 3. Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY Concepts:Nomenclatural • Triturus viridescens Rafinesque 1820 • Notopthalmus viridescens Baird 1850 • Notophthalmus viridescens Gray 1850 • Notophthalma viridescens Gray 1858 msp. • Diemyctylus viridescens Hallowell 1856 • Triton viridescens Strauch, 1870 • Molge viridescens Boulanger, 1872 • Diemyctylus minatus viridescens Yarrow •… Common origin in a single real specimen (homotypic) Creation of the new nomen concept is subjective Relationship among them is not QuickTime™ and a TIFF (Uncompressed) decompressor are needed to see this picture.
  • 4. Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY Concepts: Nomenclatural • Triturus viridescens dorsalis - Bishop, 1943 • Diemyctylus viridescens dorsalis Schmidt 1953 • Notophthalmus viridescens dorsalis - Smith, 1953 • Triturus viridescens louisianae - Strecker 1928 • Triturus viridescens louisianensis - Bishop, 1943 • Diemyctylus viridescens louisianensis Schmidt 1953 • Notophthalmus viridescens louisianensis - Smith, 1953 QuickTime™ and a TIFF (Uncompressed) decompressor are needed to see this picture.
  • 5. Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY Concepts:Taxonomic • Notopthalmus viridescens Valid name •Triturus viridescens • Notopthalmus viridescens • Notophthalmus viridescens • Notophthalma viridescens • Diemyctylus viridescens • Triton viridescens • Molge viridescens • Diemyctylus minatus viridescens • Triturus viridescens dorsalis • Diemyctylus viridescens dorsalis • Notophthalmus viridescens dorsalis •… 24 others Frost 2005 AMNH • Notopthalmus viridescens viridescens •Triturus viridescens • Notopthalmus viridescens • Notophthalmus viridescens • Notophthalma viridescens • Diemyctylus viridescens • Triton viridescens • Molge viridescens • Notophthalmus viridescens dorsalis • Triturus viridescens dorsalis • Diemyctylus viridescens dorsalis • Notophthalmus viridescens louisianensis Dolbe 2004 Expert interpretation of the original specimens
  • 6. Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY Concepts:Taxonomic • Amphibia • Urodela • Salamandridae • Notophthalmus • Notopthalmus viridescens Frost 2005 AMNH • Amphibia • Batrachia • Caudata • Salamandroidea • Salamandridae • Notophthalmus • Notopthalmus viridescens NCBI 2005
  • 7. Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY Concepts:Summary • Factual • Inter-relationships are objective • No new science required • (except to make new ones) • Stable • Expert scrutiny useful, not required • Compilation potentially FAST • uBio 1 million/year • share (no opinion attached) Nomenclatural Concepts • Opinion • Interelationships are subjective • Derived from nomenclatural concepts • Expert scrutiny is required • Unstable • Compilation slow • CoL 50K / year • Diptera 200K/15 years • sharing concerns - opinions attached Taxonomic Concepts
  • 8. Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY Why this is a problem
  • 9. Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY Don’t forget common names
  • 10. Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY And additionally… 5-10% scientific names become invalid per decade Scientific names aren’t unique Acalyptus
  • 11. Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY Names Challenges within PubMed 476 unique Name (Nomenclatural Synonyms) PMID Date Unique Notophthalmus viridescens 350 1965 349 Diemictylus viridescens 36 1959 36 Triturus viridescens 87 1949 86
  • 12. Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY Names Challenges within PubMed 4208 unique PMID Name (Taxonomic Synonyms) Total Unique Brucella melitensis 1078 840 78.1% Brucella abortus (Bacterium abortus) 3109 2852 91.7% Brucella canis 178 146 82.0% Brucella neotomae 12 4 33.3% Brucella ovis 233 168 84.9% Brucella suis 286 198 69.2% Brucella melitensis DSMZ 2005
  • 13. Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY How big is the problem? • Not sure • No comprehensive listing of names • 1.75M valid species names • 2-?M+ invalid names • 2-?M+ vernacular names • + Misspellings, lexical forms • 14,000 avian genera
  • 14. Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY uBio • Library service • “System” must account for all names • Any classifications • Biological Name Server • 2 million nomenclatural concepts • 1.7 taxon concepts • (60 classifications) • SOAP/WSDL web services
  • 15. Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY
  • 16. Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY
  • 17. Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY uBio
  • 18. Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY Major Impediment to progress • Different taxon concepts/needs • Same nomenclatural concepts • No obj/subj distinction • Duplication • Interconnectivity issues
  • 19. NameBank • Repository for all nomen concepts •Insulates taxonomic systems from “bad” nomen concepts •Consensus data only • Common to any taxon concept • Shareable • Distributable
  • 20. NameBank • NameBank is not a nomenclator nor are nomenclators NameBank • NameBank is an index of factually-derived name concepts that include a much more broad names definition • It overlaps, and is supported by nomenclators and should, I think, provide a service on top of NameBank. • NameBank provides an underlying unified index to systems like IF that contain authoritative nomenclatural metadata. • NameBank accomodates strings outside the scope of nomenclators
  • 23. Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY NameBank • Repository for all nomen concepts • Insulates taxonomic systems from “bad” nomen concepts • Consensus data only • Layered • Shareable • Distributable • Independent compilation
  • 24. Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY Share QuickTime™ and a TIFF (LZW) decompressor are needed to see this picture. • NameBank is a big job • Catalog all names • Map all factually derived relationships • Share them for increased data access •Proactive • NCBI • CBOL • new submissions •
  • 25. Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY Federate • Layered architecture • Common Foundation • Diverse expression • Enhanced Interchange • Cooperation • Efficient QuickTime™ and a TIFF (Uncompressed) decompressor are needed to see this picture.

Editor's Notes

  1. Depending on your Taxon concept these other