Botanical Literature Goes Global: The Biodiversity Heritage Library
Upcoming SlideShare
Loading in...5
×
 

Botanical Literature Goes Global: The Biodiversity Heritage Library

on

  • 975 views

The BHL is an international collaboration of natural history libraries working together to make biodiversity literature available for use by the widest possible audience through open access and ...

The BHL is an international collaboration of natural history libraries working together to make biodiversity literature available for use by the widest possible audience through open access and sustainable management.

Statistics

Views

Total Views
975
Views on SlideShare
975
Embed Views
0

Actions

Likes
0
Downloads
3
Comments
0

0 Embeds 0

No embeds

Accessibility

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment
  • It’s an honor to be among you and I extend personal thanks to Dr. Ma and the Shanghai Chenshan Botanic Garden for inviting to contribute to this impressive symposium. My subject is one that is dear to me as it is one of the most rewarding collaborations of my long career in libraries. I would like to tell you about the history of the BHL. We are nearly 5 years old, so it won’t take too long, and then I want to show you how it works and finish with some plans for the future.
  • We have seen so many rapid advances in technologies and their applications in science that a meeting called Libraries and Laboratories was held in London in March of 2005 where the idea of a shared digital library andone outcome was for natural history libraries to meet and consider if it was possible to build an integrated digital library that would have the look and feel of MBG’s “Botanicus”May 2005: Libraries & laboratories, MHM, LondonJune 2006: BHL organizational meeting at SmithsonianOct. 2006: Technical meeting at MBG in St. Louis (use Botanicus as the platform for BHL)Feb. 2007: BHL organization meeting at HarvardMay 2007: BHL portal launched
  • In 2006, BHL was formed as a consortium of natural history museums, botanical garden libraries and research institutions in the United States and London. The Academy of Natural Science and the California Academy of Sciences joined more recently and Cornell University will probably join in the near future. With so many new international BHL’s online, we are referring to this BHL as “BHL Classic” – and we are only 4 years old.
  • As the idea for BHL was brewing, E.O. Wilson, Harvard University’s noted Pulitzer Prize-winning biodiversity scientist, encouraged the development of the Encyclopedia of Life, EOL, the project intended to produce a web site for every species.5 Since the biodiversity literature is inextricably entwined with species descriptions and reviews, the BHL emerged as a key component of the EOL project. Users looking at species pages in EOL are linked directly to the literature in BHL. BHL, as the scanning and digitization arm of the EOL, has been funded by EOL, BHL member institutions, or member institution grants through the MacArthur Foundation, Sloan Foundation, and Moore Foundation.
  • The Internet Archive (http://www.archive.org/ ) is the scanning partner for the BHL. The Internet Archive (IA) provides low cost scanning of materials and provides free storage, OCR, and derivative products such as PDFs and JPEG 2000 images.
  • The BHL was first conceived to help scientists who study taxonomy to gather the depth and breadth of literature needed for systematic inquiry. Taxonomists require access to all of the historical published species descriptions in their specialty, thus the early focus on public domain literature suits the needs of this group.
  • One of the early challenges was to describe the scope of the project. With the help of OCLC, the large cataloging service provider that we all use, we merged all of our records and attempted to use OCLC’s collection analysis tools with limited success. However, we did come up with some estimates that we are all fairly comfortable with. Since copyright is a legal complication the earlier literature is more difficult to gain access to, we agreed to focus on biodiversity literature published prior to 1923. Harvard’s attorneys determined that non-US imprints must be restricted to pre 1909 literature so we operate under that additional constraint.Tools were created to “bid” on serials titles and to document monograph selections. A workflow tool called “Wonderfetch” was developed to manage the progress of every volume from the shelf to the scanning center and then the reverse. T
  • In the fall of 2009 BHL began to routinely harvest natural history content from other IA library contributors like the California Digital Library and the University of Toronto. The early “ingests” nearly doubled the content of BHL.There are now more than 43,000 titles in the BHL, representing barely half the total available public domain biodiversity literature. The next large content loads will come from European and Chinese digitization projects and will greatly enrich the current BHL corpus. The BHL Collections Committee reviewed the results of the early ingested materials and revised the ingest criteria to improve the relevance of the materials selected. We have defined core and supporting areas to apply to the ingest and to inform our scanning priorities as the scanning dollars diminish in the next two years. The Supporting subject areas relate to or consist of the disciplines that support biodiversity scholarship like ecology, economic botany, areas of forestry, conservation, etc.
  • The architecture is open and
  • June 2008: BHL-EuropeSept. 2009: BHL-ChinaFeb. 2010: BHL-Brazil kick off meeting June 2010: BHL-Australia kick off meeting Oct. 2010: BHL Global tech meeting at Woods Hole
  • IA opened its scanning center at the CAS over the summer and they are working with the Chinese and US technical teams to iron out the transfer of content from IA to BHL Classic.Ernest Henry Wilson's photographs courtesy of the Arnold Arboretum.No. 1: Men laden with "Brick Tea" for Thibet. One man's load weighs 317 lbs.; the other's 298 lbsMen carry this tea as far as Tachien-lu accomplishing about six miles per day over vile roads. Altitude 5,000 ft. July 30, 1908No. 2: Ficus lacor Hamilton. Near Feng-tu Hsein, Yangtsze River, Western Szechuan, China. Height 40 ft. Circumference 12 ft. Head 60 ft. through. With wayside shrine and Opium Poppy. April 6, 1908.No. 3: Western Szechuan. "Pai-lu" memorial arch to the memory of a virtuous widow - a common wayside feature in the west. Near Kuing-Chou. Altitude 2,000 ft. August 8, 1908
  • BHL China links to EOL species pages and is building in georeference coordinates via Google maps.
  • BHLClassic welcome screen: Search box (general or specific)Browse search by author, title, subject, etc. lists…Limit by language or year.Search for specific language“Now online” updated frequently.BHL updates include announcements, books of the week, news, etc.
  • Why use the BHL instead of Google books? First, the focus is exclusively biodiversity literature, and second, we have tools!Note that this scan came from the University of Toronto.
  • If you click on this box it drops down so that you can look at the catalog record, or select from a variety of download options. You can take the whole volume, just the OCR, or selected pages…
  • You are asked to contribute some basic metadata that describes your selection. It may look a bit cumbersome, but there is a good reason….
  • Here is the page where you can opt for only certain page ranges or just illustrations. There is a 100-page limit because after that its more efficient to process a download of the whole volume.
  • If the metadata you supplied is adequate, the pages you downloaded could be retrievable in CiteBank. This service is under development to upload, display, and manage articles, (e.g. depository for e-published taxonomic papers if the IBC code is changed next year), meet community demands to manage, be a repository for community vetted taxonomic bibliographies, deliver more open access tools. Incorporate standard reference works and link to associated literature. IAPT has agreed to let the BHL recreate and electronic TL-3 and the Smithsonian has received Seidell funds to rescan and build the foundation for this.
  • If you click on “About BHL” in Citebank you will be linked to the public Wiki that organizes BHL news and presents information on the BHL Developer Tools, tutorials for using the BHL, and links to other social networking sites.
  • Yes, you can tweet the BHL, friend us on Facebook, check out our good times on FLICKR and watch all of our PowerPoint presentations as many time as you want on SlideShare. And you can join the BHL group in LinkIn -- in your spare time.
  • There is also a BHL blog where we feature a book of the week and stream of our fan mail.
  • The BHL has actively sought out society publishers and offered an opt-in copyright model described here. This has been very welcome by many of our smaller societies.
  • The agreements are posted on the Wike under “permissions” and we have agreements with about 40 publishers, including our members who have agreed to scan all of our publications for the BHL.
  • This is a recent snapshot of BHL usage. Nearly 170,000 visits or more than 56,500 visits per monthFrom all over the world.Since Jan. 2008 there have been about 1.5 million from 231 countries & territories.There are 192-196 countries in the world. There are 61 territories and 6 disputed territories.http://www.infoplease.com/ipa/A0762461.h…
  • California Academy of Science Academy of Natural SciencesHarvard University’s Herbaria and Botany Libraries Ernst Mayr Library, Museum of Comparative ZoologyMissouri Botanical GardenNew York Botanical GardenSmithsonian Institutions U.S. National Herbarium
  • The first received by the Smithsonian to catalog SI’s field notes and related materials. In addition to exposing their own collections, Rusty Russell and Anne Van Camp proposed to create a cataloging tool kit for other collections to use and the offer to serve as a central repository for the material.A companion grant was prepared by several BHL partners including the California Academy of Sciences, the Academy of Natural Sciences, Harvard University’s Herbaria and Botany Libraries and its Museum of Comparative Zoology, the Missouri Botanical Garden, and the New York Botanical Garden, to demonstrate the utility of the tool kit and repository. We learned that the grant was funded just a couple of weeks ago.
  • Here are some of my hard working colleagues basking in the glow of receiving this award at ALA’s June meeting in DC

Botanical Literature Goes Global: The Biodiversity Heritage Library Botanical Literature Goes Global: The Biodiversity Heritage Library Presentation Transcript

  • Botanical Literature Goes Global:
    Getting the Most out of the
    Biodiversity Heritage Library.
  • What is the BHL?
    The Biodiversity Heritage Library is an international collaboration of natural history libraries working together to make biodiversity literature available for use by the widest possible audience through open access and sustainable management.
    “The cultivation of natural science cannot be efficiently
    carried on without reference to an extensive library.”
    C. Darwin et al 1847
    Darwin, C. R. et al. 1847. Memorial to the First Lord of the Treasury [Lord John Russell... Accounts and Papers 1847, paper no. 268, vol. xxxiv, 253 (13 April): 1-3.
  • BHL Members US/UK
    Natural History Museums
    • Academy of Natural Science
    • American Museum of Natural History
    • California Academy of Sciences
    • Field Museum
    • Natural History Museum, London
    • Smithsonian Institution
    Botanical Gardens
    • Missouri Botanical Garden
    • New York Botanical Garden
    • Royal Botanic Gardens, Kew
    Academic Libraries
    • Botany Libraries, Harvard University
    • Ernst Mayr Library of the
    Museum of Comparative Zoology, Harvard University
    Research Institute
    • Marine Biological Laboratory/
    Woods Hole Oceanographic Institution Library
  • The Encyclopedia of Life
    The EOL is an international effort to create an authoritative website for every species of the earth’s biota. The goal is to create a page for each species.
    Project components include:
    • Education and Outreach
    • Informatics
    • Scanning & Digitization Literature
    • Species Pages
    http://www.eol.org/
  • The Internet Archive
    IA founded of the Open Content Alliance and is dedicated
    to “universal access to human knowledge.”
    IA provides BHL with:
    • Low cost mass scanning
    • Archival storage of files
    • Image processing
    • Technology development
    http://www.archive.org/
  • Taxonomic Literature
    • Encompasses more than 250 years of systematic
    description of life
    • The cited half-life of publications in taxonomy is longer
    than in any other scientific discipline
    • The decay rate is longer than all other scientific disciplines
    • Total literature represented by 1.3 million catalogue records
  • How big is the Biodiversity domain?
    • 800,000 monographs
    • 40,000 journal titles (12,500 current)‏
    • About 40% published pre-1923
    • 73% are monographs, others are serials
    • 63% are in English ; German is next (9%)
  • BHL Scanning Priorities
    Anatomy
    Amphibia
    Algae
    Angiosperms
    Arthropoda
    Arachnida
    Atlases and gazetteers
    Biodiversity conservation
    Core Materials: subjects that relate directly to or are closely associated with root disciplines of biodiversity scholarship
    Botany
    Bryology
    Biological diversity
    Classification and nomenclature
    Cyanobacteria
    Extinction
    Evolution
    Endangered species
    Entomology
    Ferns and allies
    Fungi
    Gymnosperms
    Geographical distribution
    Ichthyology
    History of natural sciences
    Linnaean works
    Invertebrates
    Mollusca
    Medical botany
    Morphology
    Mammalia
    Marine biology
    Natural history biographies
    Natural history dictionaries & encyclopedias
    Paleozoology
    Paleobotany
    Ornithology
    Phylogenetic relationships
    Plant anatomy
    Porifera
    Primatology
    Pre-Linnaean works
    Reproduction
    Reptilia
    Protozoa
    Scientific illustration
    Specimen catalogs
    Taxonomy
    Systematics
    Zoology
    Scientific expeditions
  • BHL Europe, London
    BHL, St. Louis
    BHL Content Distribution
    1
    BHL, Woods Hole
    2
    2
    • Code available (open sourced, BSD licensed):  [1] http://code.google.com/p/bhl-bits/source/browse/trunk/utilities/grabby/grabbyd[2] http://code.google.com/p/bhl-bits/source/browse/trunk/utilities/bhl-sync.sh 
  • The Global BHL
  • The Global BHL
    BHL-Europe
    http://www.bhl-europe.eu/
    BHL-Australia http://www.ala.org.au/
    BHL-Brazil http://www.scielo.br/
  • BHL-China
    Chinese Academy of Sciences
    • Institute of Botany
    • Institute of Zoology
    • Institute of Microbiology
    • Institute of Oceanography
    Photographs by Ernest H. Wilson,1908. Courtesy of the Arnold Arboretum Archives.
  • Taxonomic Name Server Services
    Developed by MBL/WHOI
    Taxonomic intelligence is the inclusion of taxonomic practices, skills and knowledge within informatics services to manage information about organisms. It uses a sophisticated algorithm to locate likely name strings in OCR text and has “discovered” 10.7 million name strings in NameBank and serves as a name thesaurus.
  • BHL & Copyright Holders
    BHL supports an “opt-in” copyright model that will …
    • integrate professional societies’ publications into the portal
    in keeping with the goals of the organization
    • scan and deliver publications at no cost to the societies
    • provide files to the publishers for their use
  • Related Projects
    Retooling Special Collections Digitization in the Age of Mass Scanning
    The IMLS Planning Grant (2008-2009) allowed BHL partners to identify and develop a cost-effective and efficient large-scale digitization workflow and to explore ways to enhance metadata for library materials that are designated as “special collections.” The group held a series of meetings, communicated by email, and established a wiki to record meetings, track progress, and share documents about costs, statistics and workflows, and small-scale scanning tests. The report included extensive cost analyses and recommendations for equipment configurations to scan rare and oversized materials.
    Smithsonian Institution Atherton Seidell Grant
    Taxonomic Literature, Online Edition: TL-3
    The Seidell grant , with the endorsement of IAPT, will allow SI to rescan all volumes of TL-2 and the supplements and deliver content via BHL. BHL envisions a dynamically linked TL-3 that will connect citations to published references and allow for corrections and the addition of new content.
  • Related Projects
    Cataloging Hidden Special Collections & Archives Grant
    Exposing Biodiversity Field Books and Original Expedition Journals at the Smithsonian Institution -- The Smithsonian Institution National Herbarium & Archives
    The Smithsonian will catalog all of its field books, unpublished journals, loose notes, sketches documenting field research related to all disciplines of biology. It will also will build a cataloging tool to and create a central repository so that other institutions can contribute their holdings. The enhanced level of description will improve access to these important research materials that are frequently difficult to discover and access remotely.
    IMLS Grant for Advancing Digital Resources
    Connecting Content: A Collaboration to Link Field Notes to Specimens and Published Literature -- BHL Partner Libraries & Herbaria
    The grant will develop a system for integrating biological researchers’ field and specimen notes with museum specimens and related electronically published literature. The enhanced and integrated access to biological data will serve a wide variety of users, and will connect to other ongoing projects such as the Biodiversity Heritage Library, a consortium that joined forces to deliver important, page-level digital content representing the core of published literature on natural history.
  • BHL Successes
    • Administratively separate and geographically dispersed institutions can collaborate effectively
    • Taxonomic intelligence (species name finding) is highly effective across millions of pages against nearly 11 million names in NameBank
    • The project has generated excitement in the international community and many opportunities to develop new partnerships and sources of funding
    • Society journal publishers are enthusiastic about participation in the BHL opt-in copyright model
    • Partners have proven ability to generate significant financial support
    • High levels of OCR accuracy in late 19th and 20th century printing
  • American Library Association Award
    The Association for Library Collections & Technical Services (ALCTS) awarded their Outstanding Collaboration Citation to the BHL on June 27, 2010 in recognition of their outstanding collaborative partnership.
  • BHL Challenges
    • Standards: “The great thing about standards is there are so
    many to choose from.”
    • Delivering and preserving content through digitization
    & retrospective ingestion
    • Establishing international governance
    • Avoiding duplication
    • Delivering new services
    • Sustainability, Financial &Digital
  • 谢谢
    Thank you!
    Celebrating the Asa Gray ‘s
    Bicentennial
    1810-2010
    Judith Warnement
    Botany Libraries
    Harvard University Herbaria
    22 Divinity Avenue
    Cambridge, MA 02138 USA
    warnemen@oeb.harvard.edu
    http://www.huh.harvard.edu/libraries/