For the meeting on Wednesday on legacy literature, we would like to ask you to give a brief (5-10min) outline of what your plans are with BHL, and especially your move into content. This would be helpful for a more informed following discussion.
ExtensiveAiming for a critical mass of biodiversity literatureGlobalOriginating in the US and UK, BHL now has nodes in Europe, China, Australia, Brazil, Egypt, and AfricaOpen Data is freely available for viewing, downloading, and re-use
On legacy literature, what your plans are with BHL, and especially your move into content?
Mention Neti Neti
You can see from this slide that accuracy goes way down when processing older blackletter-type typefaces.
On legacy literature, what your plans are with BHL, and especially your move into content?GrowthMore Global ContentTaxon NamesArticle MetadataMicrocitations and COiNSAPIZoobankOCR improvements through GamingCrowdsource MarkupWFO?
Natural history illustrations from the Biodiversity Heritage Library seem to leap across boundaries while being catalogued, emerging simultaneously as history, science and art. As historic documents, they paint a vibrant picture of the first time European scientists and explorers encountered exotic plants and animals in the 17th and 18th centuries, drawn by some of the finest illustrators of the world. Also, as biodiversity records, they provide valuable documentation of when, where, and who first observed a species, and some of them are our only surviving representations of extinct species. Finally, as aesthetic elements, they communicate human emotions and other values toward nature by exemplifying the mimesis in art and providing a vivid expression of human creativity and imagination.This year, the Missouri Botanical Garden received a grant from the National Endowment for the Humanities (NEH) to support a project called The Art of Life: Data Mining and Crowdsourcing the Identification and Description of Natural History Illustrations from the Biodiversity Heritage Library (BHL).
The authors have worked on the development of an effective metadata schema for such natural history illustrations, but instead of developing yet another schema from scratch, they have identified existing schemas that meet the needs of the project and integrated a solution that combines the best in biodiversity informatics and image curation standards and best practices. This schema needs to support three main objectives: (1) to enable the discovery, description and use of the identified images by artists, biologists, humanities scholars, and educators; (2) to make BHL’s metadata and images available to other platforms; and (3) to import crowdsourced metadata generated in other platforms back into BHL..A preliminary schema version will be presented to the TDWG community, explaining how we addressed metadata challenges specific to biodiversity data, in order to obtain feedback on the final version.
A new flora fauna mycota should...
What should a flora/fauna/mycotaof the future be able to do for me?William UlateBHL Technical DirectorGlobal BHL CoordinatorBerlin, GermanyMay 21, 2013
Dear Sir / Madam Can i justcongratulate you on anabsolutely brilliant onlineresource. I am compiling areport on an invasivehydromedusae and could notbelieve the ease and efficiencyof this web page whichgenuinely saved me weeks ofmy lifeResearch that previouslytook months now takesonly a few hoursLa plus grande#bibliotheque #botanique &#zoologique online Thelargest online botanical &zoological #library #BHLThe freeing of knowledgemay lead to newdiscoveries and changesin the way the naturalworld is perceived
22.0040.0084.8694.6105.859.216.431.835.438.9-20406080100120Oct-08 Oct-09 Oct-10 Oct-11 Oct-12Pages (Millions) and Volumes (in Thousands)included in BHLVolumes (K)Pages (M)More Online Content
Global Replication & ServingReplicated Data Center Portal Application
For me a future Flora/Fauna/Mycota should…learn from (the errors of) the past...
Scientific Name Extraction• TaxonFinder algorithm in production since2008– More than 100 million candidate name strings– More than 1.5 million unique, verified names– Available through UI, APIs, Data Exports & InternetArchive• New collaboration with Global Names– Improved algorithm, better precision & recall– More data with TaxonFinder and Neti Neti!
For me a future Flora/Fauna/Mycota should…allow me to provide and harvestmarked up content with namesof people & organizations, places, taxa,specimens, illustrations, coordinates,citations, tables of context and indexes.
*E.xvi�c�piteI von c. cXx.WptdvonfnrWmnbu�fbe;bcn.5 am cix bIa � S &3rn~ 41Xa�m cv(f b1air�o�et ert oiensr �; �,:�hlrfc�c wa ff�4am.diug bist a6aiw~s ff oJrJtwt nof bL4ecImt& blfafra memb t wag `wr 4 cn wiu 4 e8t5m.ed bvUratflb ckwuo, ma144*4I bttE5rmbebt =rt3kn am4ratif vrmr Waff C * t6rmnli an `tn�ciblatGteaMw ?ffoaifrn w4wmeu nu weib e , wpiteIvoE5teiri ct c ober gtUcr cit cm` 91 cLi biar J >bSciatl�Oiff ;Bruet wacfttc n qmcx b1a bl:bt5c lttmtt bb9 lkr w.llr#e iti ncn xoa ff cu :rtrtuft *e t � B Rn "� trv W1Rt ?Cm c blaswaIwutr Ober �ci ti 1V Ces wtgbtiemwwajfu tpctt, afferain 9 c: b�titbfof�r f eran m rs bra wlg auig4;f aer�m *mc vrtblatcabtfm wfru andeg~m rt blas IaumbwWt� run f ncmai b14ianf tJobrrfanebrut4net vnber Brwt Ober awawi*m.crriiibtafwfm uww c on$ it ttu wttkc 5,10 $ m~Cfca trc* cx u W�e�&mcyfbq4 Mabtt mmwrc a iiu bc Jcn ncI.end.*, blat s. a u:�rprd3rw4ftf wm c ii,+ ttCC tn wa frr9fr orfab fcfbtenb c optiti bt -r9 ceDa ttDcn i34M sn Sem i
Crowdsource MarkupDisplay text Species Profile Model categoryGeneral/summary TaxonBiologyGeographic range DistributionHabitat HabitatFood sources and feeding behavior TrophicStrategyPhysical description (general) DescriptionPhysical description (detailed morphology) DiagnosticDescription
For me a future Flora/Fauna/Mycota should…be digital, openly and freely accessible,marked up and mark-able by users,linked and registered ina Common Framework that allowsfor gradual crowd-sourced incrementalsemantic enrichment with proper attribution
For me a future Flora/Fauna/Mycota should…be easy to integrate with other knowledge,allow versioning and track changes,hold and show conflicting opinionsIndependent of format, mobile enabled &be continually growing.