Smithsonian Libraries 2.0 and the  Biodiversity Heritage Library Project Martin R. Kalfatovic Smithsonian Institution Libr...
It's all about metrics!
Social Media / New Media What’s the R.O.I.?
Return on Investment Return on Intellect
Social Media in Use at SIL <ul><li>Social Media </li></ul><ul><li>Blog </li></ul><ul><li>Twitter </li></ul><ul><li>FaceBoo...
Existing Customers New Customers Existing Products New Products Leveraging SIL Content and Staff
New Media in Production <ul><li>Digital Imaging For: </li></ul><ul><li>Online project </li></ul><ul><li>Product Developmen...
Case Study
BHL Focus: Literature
BHL Focus: Literature
<ul><li>Over 250 years of systematic description of life </li></ul><ul><li>Systema naturae  (10 th  ed. 1758) by Carl von ...
<ul><li>Taxonomic descriptions must be published for the name to be valid </li></ul><ul><li>Publications must be available...
The Taxonomic Impediment “ The taxonomic impediment is a term that describes the gaps of knowledge in our taxonomic system...
Taxonomic Impediment <ul><li>Specimen collections </li></ul><ul><li>Databases </li></ul><ul><li>Publications </li></ul><ul...
Biologia Centrali-Americana Biologia Centrali-Americana Edited by Frederick Ducane Godman and Osbert Salvin London : Pub. ...
 
<ul><li>2003. Telluride. Encyclopedia of Life meeting </li></ul><ul><li>February 2005. London. Library and Laboratory: the...
American Museum of Natural History (New York)‏ Field Museum (Chicago)‏ Natural History Museum (London)‏ Smithsonian Instit...
<ul><li>BHL – Europe Launched in May 2009 </li></ul><ul><li>28 Institutions </li></ul><ul><li>14 countries </li></ul><ul><...
<ul><li>Smithsonian Libraries and BHL </li></ul><ul><li>Hosts the BHL Project Director (Tom Garnett) </li></ul><ul><li>Hos...
Initial grant from the MacArthur and Sloan Foundations (as part of the Encyclopedia of Life grant)‏ Additional support fro...
Costs 10 cents per page (scanning costs from Internet Archive) 13 cents per page for additional SIL provided work (adminis...
How much is there: Core literature pre-1923: 100 million pages (?) All pre-1923: 120-150 million pages All literature: 280...
… Names… Rectification of Names (Cheng Ming) What is necessary is to rectify names … If names be not correct, language is ...
 
<ul><li>Specimen </li></ul><ul><li>Plate or other visual image </li></ul><ul><li>Taxonomic description </li></ul>
<ul><li>11.1 million name strings in NameBank </li></ul><ul><li>Uses sophisticated algorithm (TaxonGrab) to locate likely ...
Build Content
What about copyright?
Permissions <ul><li>Seek permissions from copyright holders </li></ul><ul><li>Opt in Copyright Model: The BHL will activel...
BHL Advantages for publishers <ul><li>Use of the articles will increase as evidenced by citation upsurge </li></ul><ul><li...
How to make THIS into 0’s and 1’s
<ul><li>Smithsonian Institution Libraries </li></ul><ul><ul><li>Smithsonian publications </li></ul></ul><ul><ul><li>Entomo...
Single Scribe Machine Custom built by the Internet Archive Human operated 3,500 page per shift per day
<ul><li>Northeast Regional Scanning Center </li></ul><ul><ul><li>10 Scribe machines </li></ul></ul><ul><ul><li>MBL/WHOI </...
<ul><li>Washington, DC </li></ul><ul><ul><li>1 Scribe machine at Smithsonian Libraries </li></ul></ul><ul><ul><li>10 Scrib...
BHL Scanning Stats June 2009 Pages in production: 13,913,634 Items in production: 34,724 Titles in production: 13,108
Smithsonian Scanning Stats June 2009 Pages in production: 2,058,420 Items in production: 5,725 Titles in production: 3,38
Users January – May 2009 221,532 visitors 1,147,773 page views 2.11% of traffic comes from Wikipedia
The BHL Portal is not a library catalog
The BHL Portal!
 
Plant Names Specimens Plant Names Plant Names Specimens Descriptions Plant Names Plant Names Citations
 
BHL 2.0 <ul><li>BHL Blog for communication of technical notes and publicity </li></ul><ul><li>Twitter Announcements, comme...
Encyclopedia of Life … imagine for a moment that all the diversity of the world were finally revealed and then described, ...
 
H Informatics Marine Biological Laboratory Missouri Botanical Garden Species Pages &  Secretariat Smithsonian Education an...
Built from a variety of new and existing sources Views available for varying levels of expertise from novice to expert Leg...
Encyclopedia of Life
In any well-appointed Natural History Library there should be found every book and every edition of every book dealing in ...
 
 
 
 
Thanks for sticking around!
BHL Portal http://www.biodiversitylibrary.org Cite http://cite.biodiversitylibrary.org Internet Archive http://www.archive...
Credits <ul><li>Chris Freeland </li></ul><ul><li>Suzanne  Pilsk </li></ul><ul><li>Tom Garnett </li></ul><ul><li>Cathy Nort...
Upcoming SlideShare
Loading in...5
×

Smithsonian Libraries 2.0 and the Biodiversity Heritage Library Project

1,127

Published on

Smithsonian Libraries 2.0 and the Biodiversity Heritage Library Project. Martin R. Kalfatovic. Smithsonian Libraries Board Meeting. June 26, 2009. Landover, MD.

Published in: Education, Technology
0 Comments
2 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
1,127
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
38
Comments
0
Likes
2
Embeds 0
No embeds

No notes for slide
  • BHL Focus: Literature
  • BHL Focus: Literature
  • Over 250 years of systematic description of life Systema naturae (10th ed. 1758) by Carl von Linné
  • Taxonomic Literature: Taxonomic descriptions must be published for the name to be valid Publications must be available to the public through trusted sources Libraries have been the traditional place
  • BHL Focus: Literature Core literature pre-1923: 100 million pages (?)‏ All pre-1923: 120-150 million pages All literature: 280-320 million pages
  • So, remembering that a key concept of Confucius was the Rectification of Names …
  • Suzanne But remember to have it really named it has to be PUBLISHED So this specimen is referenced in a book that says that it was found, identified, named and that it now “exists” – We won’t going into Buddhist theory of existence in this presentation.
  • Suzanne But remember to have it really named it has to be PUBLISHED So this specimen is referenced in a book that says that it was found, identified, named and that it now “exists” – We won’t going into Buddhist theory of existence in this presentation.
  • Taxonomic intelligence 10.7 million name strings in NameBank Uses sophisticated algorithm (TaxonGrab) to locate likely name strings in OCR text Iterative processing of BHL texts will both increase the number of name strings in NameBank and increase the accuracy of name string recognition
  • BHL and publishers
  • Permissions Seek permissions from copyright holders Opt in Copyright Model: The BHL will actively work with professional societies and associations to integrate their publications into the BHL in a way that serves the societies’ missions and goals BHL will digitize learned society backfiles and mount them through the BHL Portal at no cost. Will provide a set of files to the publishers for reuse as they see fit.
  • Permissions Seek permissions from copyright holders Opt in Copyright Model: The BHL will actively work with professional societies and associations to integrate their publications into the BHL in a way that serves the societies’ missions and goals BHL will digitize learned society backfiles and mount them through the BHL Portal at no cost. Will provide a set of files to the publishers for reuse as they see fit.
  • BHL advantages Use of the articles will increase as evidenced by citation upsurge Long-term management of the digital assets is provided by the BHL at no cost Publishers’ content is embedded in the emerging knowledge ecology that is sweeping biology in this century Structural markup of backfiles into conformance with NLM DTD (just starting)‏
  • Suzanne
  • Scribe Machine Single Scribe Machine Custom built by the Internet Archive Human operated 3,500 page per shift per day
  • Internet Archive/BHL scanning centers Northeast Regional Scanning Center 10 Scribe machines MBL/WHOI Harvard New York Public Library 10 Scribe machines AMNH NYBG
  • Internet Archive/BHL scanning centers Washington, DC 1 Scribe machine at Smithsonian Libraries 10 Scribe facility at Library of Congress with Fedlink (operational Spring 2008)‏
  • Scanning stats 5.5 million plus total pages scanned (and growing daily)‏ &lt;90,000 Fieldiana (via UIUC)‏ &gt;100,000 pages each Harvard, New York Botanical Garden, 225,000+ pages from the American Museum of Natural History 400,000+ from Smithsonian Libraries 500,000+ from the Natural History Museum, London 800,000 Missouri Botanical Garden Library 1,000,000+ from the MBL/WHOI library
  • Scanning stats 5.5 million plus total pages scanned (and growing daily)‏ &lt;90,000 Fieldiana (via UIUC)‏ &gt;100,000 pages each Harvard, New York Botanical Garden, 225,000+ pages from the American Museum of Natural History 400,000+ from Smithsonian Libraries 500,000+ from the Natural History Museum, London 800,000 Missouri Botanical Garden Library 1,000,000+ from the MBL/WHOI library
  • Scanning stats 5.5 million plus total pages scanned (and growing daily)‏ &lt;90,000 Fieldiana (via UIUC)‏ &gt;100,000 pages each Harvard, New York Botanical Garden, 225,000+ pages from the American Museum of Natural History 400,000+ from Smithsonian Libraries 500,000+ from the Natural History Museum, London 800,000 Missouri Botanical Garden Library 1,000,000+ from the MBL/WHOI library
  • Martin: BHL Portal
  • BHL and EOL
  • Structure of EOL Built from a variety of new and existing sources Views available for varying levels of expertise from novice to expert Legacy literature a key component of the EOL species pages
  • Suzanne Integrate Literature Launch in February
  • A Global Library for Life In any well-appointed Natural History Library there should be found every book and every edition of every book dealing in the remotest way with the subjects concerned. Charles Davies Sherborn, Epilogue to Index Animalium , March 1922
  • Species going extinct as we talk
  • Species going extinct as we talk
  • Species going extinct as we talk
  • Species going extinct as we talk
  • Demo
  • Transcript of "Smithsonian Libraries 2.0 and the Biodiversity Heritage Library Project"

    1. 1. Smithsonian Libraries 2.0 and the Biodiversity Heritage Library Project Martin R. Kalfatovic Smithsonian Institution Libraries Smithsonian Libraries :: SIL Board Meeting :: 26 June 2009
    2. 2. It's all about metrics!
    3. 3. Social Media / New Media What’s the R.O.I.?
    4. 4. Return on Investment Return on Intellect
    5. 5. Social Media in Use at SIL <ul><li>Social Media </li></ul><ul><li>Blog </li></ul><ul><li>Twitter </li></ul><ul><li>FaceBook </li></ul><ul><li>Flickr </li></ul><ul><li>Flickr Commons </li></ul><ul><li>LinkedIn </li></ul><ul><li>YouTube </li></ul><ul><li>Wiki </li></ul>
    6. 6. Existing Customers New Customers Existing Products New Products Leveraging SIL Content and Staff
    7. 7. New Media in Production <ul><li>Digital Imaging For: </li></ul><ul><li>Online project </li></ul><ul><li>Product Development & Licensing </li></ul><ul><li>Researcher needs </li></ul>
    8. 8. Case Study
    9. 9. BHL Focus: Literature
    10. 10. BHL Focus: Literature
    11. 11. <ul><li>Over 250 years of systematic description of life </li></ul><ul><li>Systema naturae (10 th ed. 1758) by Carl von Linné </li></ul>Taxonomic Literature
    12. 12. <ul><li>Taxonomic descriptions must be published for the name to be valid </li></ul><ul><li>Publications must be available to the public through trusted sources </li></ul><ul><li>Libraries have been the traditional place </li></ul>Taxonomic Literature
    13. 13. The Taxonomic Impediment “ The taxonomic impediment is a term that describes the gaps of knowledge in our taxonomic system” - Darwin Declaration, 1998
    14. 14. Taxonomic Impediment <ul><li>Specimen collections </li></ul><ul><li>Databases </li></ul><ul><li>Publications </li></ul><ul><li>Observations </li></ul><ul><li>‘ Gray’ literature </li></ul><ul><li>Index cards </li></ul><ul><li>Field notebooks </li></ul>
    15. 15. Biologia Centrali-Americana Biologia Centrali-Americana Edited by Frederick Ducane Godman and Osbert Salvin London : Pub. for the editors by R. H. Porter, 1879-1915 Chart showing distribution in public collections of the complete 63 volume sets held worldwide. 2 complete copies in Central America held at the Smithsonian Tropical Research Institute Library
    16. 17. <ul><li>2003. Telluride. Encyclopedia of Life meeting </li></ul><ul><li>February 2005. London. Library and Laboratory: the Marriage of Research, Data and Taxonomic Literature </li></ul><ul><li>May 2005. Washington. Ground work for the Biodiversity Heritage Library </li></ul><ul><li>June 2006. Washington. Organizational and Technical meeting </li></ul><ul><li>August 2006. New York Botanical Garden. BHL Director’s Meeting. </li></ul><ul><li>October 2006. St. Louis/San Francisco. Technical meetings </li></ul><ul><li>February 2007. Museum of Comparative Zoology. Organizational meeting </li></ul><ul><li>May 2007. Encyclopedia of Life and BHL Portal Launch. Washington DC. </li></ul>
    17. 18. American Museum of Natural History (New York)‏ Field Museum (Chicago)‏ Natural History Museum (London)‏ Smithsonian Institution Libraries (Washington) Missouri Botanical Garden (St. Louis)‏ New York Botanical Garden (New York)‏ Royal Botanic Garden, Kew Botany Libraries, Harvard University Ernst Mayr Library of the Museum of Comparative Zoology, Harvard University Marine Biological Laboratory / Woods Hole Oceanographic Institution Academy of Natural Sciences (Philadelphia) California Academy of Sciences (San Francisco)
    18. 19. <ul><li>BHL – Europe Launched in May 2009 </li></ul><ul><li>28 Institutions </li></ul><ul><li>14 countries </li></ul><ul><li>3.4 million funding for three years </li></ul><ul><li>Discussions underway with the Chinese Academy of Science and the Atlas of Living Australia for BHL components </li></ul>
    19. 20. <ul><li>Smithsonian Libraries and BHL </li></ul><ul><li>Hosts the BHL Project Director (Tom Garnett) </li></ul><ul><li>Hosts the BHL Collections Coordinator (Bianca Lipscomb) </li></ul><ul><li>Serves on the Institutional Council (Nancy Gwinn) </li></ul><ul><li>Serves on BHL Technical Committee (Martin Kalfatovic) </li></ul><ul><li>Provides technical workflow assistance in systems development (Keri Thompson) </li></ul><ul><li>Coordinates metadata across BHL partners (Suzanne Pilsk) </li></ul><ul><li>Provides selection advice (staff of Natural History Libraries) </li></ul>
    20. 21. Initial grant from the MacArthur and Sloan Foundations (as part of the Encyclopedia of Life grant)‏ Additional support from parent institutions Supplemental grants in place for specific development (e.g. Moore Foundation for Fedora) Additional grants being actively pursued by BHL and individual members
    21. 22. Costs 10 cents per page (scanning costs from Internet Archive) 13 cents per page for additional SIL provided work (administration, pulling materials, scanning quality review, metadata review, etc.) Average book length 304 pages Average cost per book: $70.00
    22. 23. How much is there: Core literature pre-1923: 100 million pages (?) All pre-1923: 120-150 million pages All literature: 280-320 million pages
    23. 24. … Names… Rectification of Names (Cheng Ming) What is necessary is to rectify names … If names be not correct, language is not in accordance with the truth of things. If language be not in accordance with the truth of things, affairs cannot be carried on to success. The Analects of Confucius Book 13, verse 3 (Legge translation, 1980)
    24. 26. <ul><li>Specimen </li></ul><ul><li>Plate or other visual image </li></ul><ul><li>Taxonomic description </li></ul>
    25. 27. <ul><li>11.1 million name strings in NameBank </li></ul><ul><li>Uses sophisticated algorithm (TaxonGrab) to locate likely name strings in OCR text </li></ul><ul><li>Iterative processing of BHL texts will both increase the number of name strings in NameBank and increase the accuracy of name string recognition </li></ul>Taxonomic Intelligence
    26. 28. Build Content
    27. 29. What about copyright?
    28. 30. Permissions <ul><li>Seek permissions from copyright holders </li></ul><ul><li>Opt in Copyright Model: The BHL will actively work with professional societies and associations to integrate their publications into the BHL in a way that serves the societies’ missions and goals </li></ul><ul><li>BHL will digitize learned society backfiles and mount them through the BHL Portal at no cost. </li></ul><ul><li>Will provide a set of files to the publishers for reuse as they see fit </li></ul>
    29. 31. BHL Advantages for publishers <ul><li>Use of the articles will increase as evidenced by citation upsurge </li></ul><ul><li>Long-term management of the digital assets is provided by the BHL at no cost </li></ul><ul><li>Publishers’ content is embedded in the emerging knowledge ecology that is sweeping biology in this century </li></ul><ul><li>Structural mark-up of backfiles into conformance with NLM DTD (just starting)‏ </li></ul>
    30. 32. How to make THIS into 0’s and 1’s
    31. 33. <ul><li>Smithsonian Institution Libraries </li></ul><ul><ul><li>Smithsonian publications </li></ul></ul><ul><ul><li>Entomology collection </li></ul></ul><ul><ul><li>Marine mammals </li></ul></ul><ul><ul><li>Fishes </li></ul></ul><ul><ul><li>Selected special collections materials </li></ul></ul><ul><ul><li>Filling in behind other libraries </li></ul></ul>Rough Selection
    32. 34. Single Scribe Machine Custom built by the Internet Archive Human operated 3,500 page per shift per day
    33. 35. <ul><li>Northeast Regional Scanning Center </li></ul><ul><ul><li>10 Scribe machines </li></ul></ul><ul><ul><li>MBL/WHOI </li></ul></ul><ul><ul><li>Harvard </li></ul></ul><ul><li>Jersey City Facility </li></ul><ul><ul><li>10 Scribe machines </li></ul></ul><ul><ul><li>AMNH </li></ul></ul><ul><ul><li>NYBG </li></ul></ul><ul><li>University of Illinois </li></ul><ul><ul><li>2 Scribe machines </li></ul></ul><ul><li>Natural History Museum, London </li></ul><ul><ul><li>1 Scribe machine </li></ul></ul><ul><li>Missouri Botanical Garden </li></ul><ul><ul><li>Non-Scribe operation </li></ul></ul>
    34. 36. <ul><li>Washington, DC </li></ul><ul><ul><li>1 Scribe machine at Smithsonian Libraries </li></ul></ul><ul><ul><li>10 Scribe facility at Library of Congress </li></ul></ul>
    35. 37. BHL Scanning Stats June 2009 Pages in production: 13,913,634 Items in production: 34,724 Titles in production: 13,108
    36. 38. Smithsonian Scanning Stats June 2009 Pages in production: 2,058,420 Items in production: 5,725 Titles in production: 3,38
    37. 39. Users January – May 2009 221,532 visitors 1,147,773 page views 2.11% of traffic comes from Wikipedia
    38. 40. The BHL Portal is not a library catalog
    39. 41. The BHL Portal!
    40. 43. Plant Names Specimens Plant Names Plant Names Specimens Descriptions Plant Names Plant Names Citations
    41. 45. BHL 2.0 <ul><li>BHL Blog for communication of technical notes and publicity </li></ul><ul><li>Twitter Announcements, commentary, etc. </li></ul><ul><li>Flickr Collection highlights, publicity </li></ul><ul><li>Other? SecondLife, LibraryThing, OpenLibrary </li></ul>
    42. 46. Encyclopedia of Life … imagine for a moment that all the diversity of the world were finally revealed and then described, say one page to a species. The description would contain the scientific name, a photograph or drawing, a brief diagnosis, and information of where the species if found. If published in conventional book form … this Great Encyclopedia of Life would occupy 60 meters of library shelf per million species … 100 million species of organisms … would extend through 6 kilometers of shelving … E.O. Wilson (1992)‏
    43. 48. H Informatics Marine Biological Laboratory Missouri Botanical Garden Species Pages & Secretariat Smithsonian Education and Outreach Smithsonian & Harvard Synthesis Center Field Museum
    44. 49. Built from a variety of new and existing sources Views available for varying levels of expertise from novice to expert Legacy literature a key component of the EOL species pages Encyclopedia of Life Species Pages
    45. 50. Encyclopedia of Life
    46. 51. In any well-appointed Natural History Library there should be found every book and every edition of every book dealing in the remotest way with the subjects concerned. Charles Davies Sherborn, Epilogue to Index Animalium , March 1922 A Global Library for Life
    47. 56. Thanks for sticking around!
    48. 57. BHL Portal http://www.biodiversitylibrary.org Cite http://cite.biodiversitylibrary.org Internet Archive http://www.archive.org Ubio http://www.ubio.org Links
    49. 58. Credits <ul><li>Chris Freeland </li></ul><ul><li>Suzanne Pilsk </li></ul><ul><li>Tom Garnett </li></ul><ul><li>Cathy Norton </li></ul><ul><li>David Remsen </li></ul>
    1. A particular slide catching your eye?

      Clipping is a handy way to collect important slides you want to go back to later.

    ×