Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.
The Biodiversity
  Heritage Library:
 Workflow Overview
      Martin R. Kalfatovic &
        Suzanne C. Pilsk
Smithsonian ...
How to make THIS …
into 0’s and 1’s
How to make THIS …
into 0’s and 1’s
If you digitize it …

Will they find it?

Search Gone BAD!
Metadata – failure to serve
-
  Specimen
-
  Plate or other visual image
-
  Taxonomic description
-
  Specimen
-
  Plate or other visual image
-
  Taxonomic description
Initial Metadata
 Analysis
We have 1.3 million catalogue
records
73% are monographs
(remainder are serials at title-
level...
Initial Metadata
 Analysis
Who has what?

What should we scan and
when?

Monographs vs Serials

Series treated as separate...
Selection Tools
Combined Serial list for
 selection of title to scan to
 avoid duplication of effort
Monographic “de-dupin...
Human Selection
Marine Biological
Laboratory/WHOI
> Marine monographs
> General Science
Museum of Comparative
Zoology
> MC...
Human Selection
University of Illinois
> Fieldiana
> Natural history of Illinois
American Museum of Natural
History
> AMNH...
Human Selection
Botany Collections
  Missouri Botanical Garden,
  New York Botanical Garden,
  Harvard Botany Libraries, a...
Human Selection
Smithsonian Libraries
> Smithsonian publications
> Entomology collection
> Marine mammals
> Fishes
> Selec...
Collections Coodinator
Collections Coordinator
on board in February
2009.
Bianca Lipscomb, based
at the Smithsonian, will
...
Mass Scanning
Workflow
Single Scribe Machine

Custom built by the
Internet Archive
Human operated
3,500 page per shift per...
Mass Scanning
Workflow
 Serial management
 Bid Lists

 Monograph Management
 Dedupper

 Pick Lists

 Packing Lists
Mass Scanning
Workflow
   Local data flow

   Vendor data flow
     WonderFetch tm

   Return of data

   Return of materi...
Mass Scanning
    Workflow
Flow of the Process


     Select Book ~Pull from Shelf

     Review Physically and
     Meta...
Mass Scanning
Workflow
Mass Scanning
    Workflow
Flow of the Process


     Book is scanned & QA

     Page images loaded to IA

     Derivat...
Mass Scanning
    Workflow
Flow of the Process


     Metadata files harvested from IA portal
     to BHL

     Taxonomi...
2007:
Cataloged, barcoded, inventoried
and created summary holdings for
1,738 serial titles and created
60,830 item record...
Staffing:                 Other things:

    Administration        
                              Travel

    Metadata ...
Items                     Pages

    “Cardboard to         
                              Approximated just
    Cardboar...
Picture Credits
Johann Christian Daniel von Schreber
  Die Saugthiere in Abbildungen
  nach der Natur mit Beschreibungen
 ...
The Biodiversity Heritage Library: Workflow Overview
The Biodiversity Heritage Library: Workflow Overview
The Biodiversity Heritage Library: Workflow Overview
The Biodiversity Heritage Library: Workflow Overview
The Biodiversity Heritage Library: Workflow Overview
The Biodiversity Heritage Library: Workflow Overview
The Biodiversity Heritage Library: Workflow Overview
The Biodiversity Heritage Library: Workflow Overview
The Biodiversity Heritage Library: Workflow Overview
The Biodiversity Heritage Library: Workflow Overview
The Biodiversity Heritage Library: Workflow Overview
The Biodiversity Heritage Library: Workflow Overview
The Biodiversity Heritage Library: Workflow Overview
The Biodiversity Heritage Library: Workflow Overview
The Biodiversity Heritage Library: Workflow Overview
The Biodiversity Heritage Library: Workflow Overview
The Biodiversity Heritage Library: Workflow Overview
Upcoming SlideShare
Loading in …5
×

The Biodiversity Heritage Library: Workflow Overview

1,023 views

Published on

The Biodiversity Heritage Library: Workflow Overview. Martin R. Kalfatovic and Suzanne C. Pilsk. BHL Australian Node Meeting: Melbourne Museum. 2 June 2010. Melbourne, Australia.

Published in: Technology, Education
  • Be the first to comment

  • Be the first to like this

The Biodiversity Heritage Library: Workflow Overview

  1. 1. The Biodiversity Heritage Library: Workflow Overview Martin R. Kalfatovic & Suzanne C. Pilsk Smithsonian Institution Libraries & Biodiversity Heritage Library BHL Australian Node Meeting ~ Museum Victoria ~ 2 June 2010
  2. 2. How to make THIS … into 0’s and 1’s
  3. 3. How to make THIS … into 0’s and 1’s
  4. 4. If you digitize it … Will they find it? Search Gone BAD!
  5. 5. Metadata – failure to serve
  6. 6. - Specimen - Plate or other visual image - Taxonomic description
  7. 7. - Specimen - Plate or other visual image - Taxonomic description
  8. 8. Initial Metadata Analysis We have 1.3 million catalogue records 73% are monographs (remainder are serials at title- level) 63% is English language material. The next most popular language (9%) is German. About 30% of material was published before 1923.
  9. 9. Initial Metadata Analysis Who has what? What should we scan and when? Monographs vs Serials Series treated as separates Can it be found and used once scanned?
  10. 10. Selection Tools Combined Serial list for selection of title to scan to avoid duplication of effort Monographic “de-duping” algorithm OCLC Collection Analysis
  11. 11. Human Selection Marine Biological Laboratory/WHOI > Marine monographs > General Science Museum of Comparative Zoology > MCZ publications > Herpetology monographs and serials > Ichthyology monographs and serials
  12. 12. Human Selection University of Illinois > Fieldiana > Natural history of Illinois American Museum of Natural History > AMNH publications > Ornithology Natural History Museum > NHM publications > Major natural history general serials
  13. 13. Human Selection Botany Collections Missouri Botanical Garden, New York Botanical Garden, Harvard Botany Libraries, and Royal Botanic Garden, Kew will cooperatively develop a methodology for botanical publications
  14. 14. Human Selection Smithsonian Libraries > Smithsonian publications > Entomology collection > Marine mammals > Fishes > Selected special collections materials
  15. 15. Collections Coodinator Collections Coordinator on board in February 2009. Bianca Lipscomb, based at the Smithsonian, will coordinate material selection across the BHL and contributing partners
  16. 16. Mass Scanning Workflow Single Scribe Machine Custom built by the Internet Archive Human operated 3,500 page per shift per day
  17. 17. Mass Scanning Workflow Serial management Bid Lists Monograph Management Dedupper Pick Lists Packing Lists
  18. 18. Mass Scanning Workflow Local data flow Vendor data flow WonderFetch tm Return of data Return of material Billing
  19. 19. Mass Scanning Workflow Flow of the Process  Select Book ~Pull from Shelf  Review Physically and Metadata  Establish viability and create Wonderfetch tm  Send to IA scanning center
  20. 20. Mass Scanning Workflow
  21. 21. Mass Scanning Workflow Flow of the Process  Book is scanned & QA  Page images loaded to IA  Derivatives created  Book returned  QA on returned book against images  Book returned to library
  22. 22. Mass Scanning Workflow Flow of the Process  Metadata files harvested from IA portal to BHL  Taxonomic Intelligence Added  Available through BHL
  23. 23. 2007: Cataloged, barcoded, inventoried and created summary holdings for 1,738 serial titles and created 60,830 item records in SIRIS for BHL 2008: Cataloged, barcoded, inventoried, and created summary holdings for 1,311 serial/journal titles and created 46,140 item records in SIRIS for the Biodiversity Heritage Library (BHL).
  24. 24. Staffing: Other things:  Administration  Travel  Metadata  Equipment  Collections support  Transportation  Database/Systems  Conservator  Technicians for pulling  Technicians for Quality Review
  25. 25. Items Pages  “Cardboard to  Approximated just Cardboard” over 300 pages in an  A barcoded “book” “item”  Estimated just over  Estimated just under 6,000 in a year 1,900,000 in a year  Cost: $70.26  Cost per page: 0.23
  26. 26. Picture Credits Johann Christian Daniel von Schreber Die Saugthiere in Abbildungen nach der Natur mit Beschreibungen (1826-) Richard Lydekker A hand-book to the marsupialia and monotremata (1896)

×