All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
uBio presentation to Jim Edwards 2006
1. universal Biological indexer and organizer 1
New Dimensions in Managing Biological
Information @ the MBLWHOI LIBRARY
David Remsen
June 27, 2006
2. universal Biological indexer and organizer 2
All accumulated information of a species is tied to a scientific
name, a name that serves as a link between what has been learned
in the past and what we today add to the body of knowledge.
- Grimaldi & Engel, 2005, Evolution of the Insects
3. universal Biological indexer and organizer 3
a name that serves as a link between what has been learned in the past
From T.E. Glover, The Fishes of Southwestern Japan, c.1870
4. universal Biological indexer and organizer 4
Universal Biological Indexer and Organizer
Research Funded by the Andrew W. Mellon Foundation
MBL / WHOI LIBRARY
…and what we today add to the body of knowledge.
6. universal Biological indexer and organizer 6
The challenge of names as keywords
Finding this…
Type keyword…
With this…
7. universal Biological indexer and organizer 7
Names – the only universal metadata for Biology
Names offer a logical way to search for and
index content
• Names annotate data objects
• All names annotate all data objects
• A compilation of all names ever used is the
foundation of a universal index for biology
• or for a semantic web for biology
8. universal Biological indexer and organizer 8
• Many names refer to one concept
• Vernacular concept
• Lexical or Nominal synonym
• Nomenclatural synonym
• Taxonomic Synonym
• Single name refers to many concepts
• Homonyms
• Taxonomic concepts
• Vernacular concepts
• Taxonomic Groups/Classifications
The Taxonomic Names Problem in Biology
9. universal Biological indexer and organizer 9
Many to One: Vernacular Concepts
• Equivalence implicit through co-occurrence
11. universal Biological indexer and organizer 11
QuickTime™ and a
TIFF (LZW) decompressor
are needed to see this picture.
Retention of lexical & nomenclatural variation
Loligo pealeii
Loligo pealii
Loligo pealei
Doryteuthis pealei
13. universal Biological indexer and organizer 13
Universal Biological Indexer and Organizer
Research Funded by the Andrew W. Mellon Foundation
MBL / WHOI LIBRARY
QuickTime™ and a
TIFF (LZW) decompressor
are needed to see this picture.
Taxonomic Concept
14. universal Biological indexer and organizer 14
Libraries
Publishers
Museums
Federal Agencies
Name IR impediments in current systems: NLM, JSTOR
15. universal Biological indexer and organizer 15
Name IR impediments in current systems: OBIS
One organism
4 scientific names
4 maps
We want one map
16. universal Biological indexer and organizer 16
• Basis for Relationships: Facts
• Vernacular concept
• Lexical or Nominal synonym
• Nomenclatural synonym
• Homonyms
• Basis for Relationship: Opinion
• Taxonomic Synonym
• Vernacular concepts
• Taxonomic Groups/Classifications
Division of Concepts
17. universal Biological indexer and organizer 17
Lexical Synonyms
Nomenclatural Synoyms
Vernacular Names
Taxonomic Hierarchies
Taxonomic Synonyms
Primary Components of uBio
Indexes to content
Indexes to taxonomic views
20. universal Biological indexer and organizer 20
Universal Biological Indexer and Organizer
Research Funded by the Andrew W. Mellon Foundation
MBL / WHOI LIBRARY
Fitting In
22. universal Biological indexer and organizer 22
Network Service :Attribution
• Every datum sent out via
service is logged
– nameBankID
– datestamp
– Client IP
– Calling method
– requestorIP
• <client optional>
23. universal Biological indexer and organizer 23
Universal Biological Indexer and Organizer
Research Funded by the Andrew W. Mellon Foundation
MBL / WHOI LIBRARY
Tools and Applications: FindIT
• Is trainable
• Locates names &
authorities
• Finds names it
doesn’t know
• Finds names
mangled by OCR
24. universal Biological indexer and organizer 24
Universal Biological Indexer and Organizer
Research Funded by the Andrew W. Mellon Foundation
MBL / WHOI LIBRARY
Tools and Applications: LinkIT
26. universal Biological indexer and organizer 27
Taxonomic intelligence applied to search
Synonymies expand the scope of queries
QuickTime™ and a
TIFF (LZW) decompressor
are needed to see this picture.
29. universal Biological indexer and organizer 30
Taxonomic hierarchies enhance data browsing
• Birds of the Belgian Congo
• 4500 pages
• One page has a species of
dipteran
• How would someone
interested find it?
• 50,000+ Diptera species to
choose from
Both enhancements apply to all name-annotated content
32. universal Biological indexer and organizer 33
Alternative names
Vernacular names
Expert view
More or less specific
Suggestions & corrections
Indexing power from NameBank
35. universal Biological indexer and organizer 36
content
certified
linkouts
to
authoritative
resources
XML source
Additional information from specific projects
37. universal Biological indexer and organizer 38
• data from various sources
may be merged
• red dots on the map
link back to the website that
provided the geographical
co-ordinates
Specimen distribution data from remote sources