SlideShare a Scribd company logo
Smithsonian Institution Libraries
   “Metadata Mixing & Matching For
             Discovery”
                    LSC 888
    The Special Library/ Information Center



     Suzanne C. Pilsk ~ Smithsonian Institution Libraries ~ 2010
Facts and Figures
                  Smithsonian Institution Libraries


Washington, D.C.
   • Anacostia Museum & Center for African American History and Culture
     Library
   • Anthropology Library
   • Botany and Horticulture Library
   • The Dibner Library of the History of Science and Technology
   • Freer Gallery of Art and Arthur M. Sackler Gallery Library
   • Hirshhorn Museum and Sculpture Garden Library
   • Joseph F. Cullman 3rd Library of Natural History
Facts and Figures
Smithsonian Institution Libraries

    Washington, D.C. (continued)
       •   Museum Studies & Reference Library
       •   National Air and Space Museum Library
       •   National Museum of American History Library
       •   National Museum of Natural History Library
       •   National Postal Museum Library
       •   National Zoological Park Library
       •   Smithsonian American Art Museum/National Portrait Gallery Library
       •   Warren M. Robbins Library, National Museum of African Art
Facts and Figures
Smithsonian Institution Libraries
  Elsewhere
     Suitland, Md.
          • Museum Support Center Library
          • National Museum of the American Indian Library
     Edgewater, Md.
          • Smithsonian Environmental Research Center Library
     New York City
          • Cooper-Hewitt, National Design Museum Library
     Republic of Panama
          • Smithsonian Tropical Research Institute Library
Facts and Figures
            Smithsonian Institution Libraries
African Art                         Latino History and Culture
African American History and        Materials Research
    Culture                         Modern and Contemporary Art
Anthropology                        Museology
American Art                        Native American History and Culture
American History                    Natural History
Asian and Middle Eastern Art        Postal History
Aviation history and Space Flight   Tropical Biology
Design and Decorative Arts          Trade Literature
Environmental Management and        World’s Fair Ephemera
    Ecology
History of Science and Technology
What’s So Special?
                       Public Museum
Smithsonian Institution is the largest museum complex in the
                            world …
                   “The Nation’s Attic”
“Increase and Diffusion of Knowledge”


              Unlock the Mysteries of the Universe

              Understanding and Sustaining
              a Biodiverse Planet

              Valuing World Cultures

              Understanding the American Experience
SIL Mission
              (Smithsonian Directive 500)

As the largest and most diverse museum library
in the world, SIL leads the Smithsonian in taking
advantage of the opportunities of the digital
society. SIL provides authoritative information
and creates innovative services and programs for
Smithsonian Institution researchers, scholars and
curators, as well as the general public, to further
their quest for knowledge. Through paper
preservation and digital technologies, SIL ensures
broad and enduring access to the Libraries’
collections for all users.
SIL’s Strategic Plan “Focus on Service”
• GOAL 1: COLLABORATING ACROSS BOUNDARIES
   – SIL creates a compelling environment for connecting, collaborating and
     exploring across disciplines and information boundaries
• GOAL 2: DISCOVERING INFORMATION
   – SIL enhances and eases the discovery of information in our collections
     for SI scholars, researchers, scientists, and the larger world of learners
• GOAL 3: CONNECTING WITH USERS
   – SIL understands and meets user needs, serving users where they live
     and work
• GOAL 4: BUILDING EXPERTISE
   – SIL builds expertise on information discovery, navigation and
     management
• GOAL 5: ENABLING OUR MISSION
   – SIL ensures its success through increased financial strength, effective
     administrative support, and organizational excellence
Facts and Figures
          Smithsonian Institution Libraries
Total volumes
    > 1.7 million
    50,000 are rare books
    10,000 manuscripts
Trade Catalogs
     > 500, 000 items
     > 30,000 companies
    dating from the 1800s
Facts and Figures


          • 102 Smithsonian Libraries
            Staff

          • 17 Souls in Cataloging
            Services (with contractors)
• Traditional Library



• Traditional Services
Integrated Library System

Smithsonian Institution
  Research Information
  System (SIRIS)
– MARC
– AACR2r
– ISBD
– LC Classification
– LC Subject Headings
Traditional Cataloging

              •   Monographs
              •   Serials
              •   Videos
              •   Microfilm/fiche
              •   Sound Recordings
              •   CD/DVDs
              •   Electronic Resources
Traditional Cataloging

• OCLC

• Program for Cooperative
  Cataloging
   – NACO
   – SACO
   – BIBCO
SI Libraries Serves
•   Curators
•   Researchers
•   Post-Docs
•   Museum Administrators
•   Public
IFLA’s Functional Requirements for Bibliographic Data

                               To Find

                               To Identify

                               To Select

                               To Obtain

                               To USE
Determining Level of Metadata
•   What do you have?
•   What staff do you have?
•   Who are your users?
•   Where will it go?
•   Will it stay there or travel on and on and on
    and on and on and on and on and on
Metadata
Metadata – failure to serve
Metadata: MARC

         MARC

110 Oscar Mayer & Co.
650 Frankfurters
Metadata

Dublin Core

Creator:
  Oscar Mayer & Co.

Subject:
  Frankfurters
Metadata: Real MARC – Still failure to serve
02761nam 2200469
   4500001000700000005001700007008004100024010002300065019001300088035001400
   1010350023001150400061001380490027001990500015002261000042002412450193002
   8326000830047630000170055950403350057650501540091159001090106559000960117
   4650002601270945002101296945007301317945003101390945004801421945004801469
   9450047015179450079015649450044016439450046016879450048017339450076017819
   4500440185794500510190194500510195294500710200394500900207494500960216494
   5003102260-459797-20050131154400.0-731129m19021933enk b 000 0 lat
   c- -aagr03000069 //r582- -a14018362- -aABY6485LB- -a(OCoLC)ocm00751549- -aU.S.
   Dept. of Agr.
   Libr.-cRIU-dOCL-dCHS-dSER-dSMI-dWaOLN- -aSMI$-aSMIM-aSMIE-aSMIB-00-aQL354-b.S5-
   1 -aOscar Mayer & Co.-10-aPronto pup:-bhot dogs hamburgers/-ca Oscar Mayer and
   Company.- -aNew Orleans, La. :-bBourbon Street Foods,-c2000.
Metadata: MARCXML

<?xml version="1.0" encoding="UTF-8" ?>
<collection xmlns="http://www.loc.gov/MARC21/slim"
   xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
   xsi:schemaLocation="http://www.loc.gov/MARC21/slim
   http://www.loc.gov/standards/marcxml/schema/MARC21slim.xsd">
<record><leader>02761nam a2200469 4500</leader>
<controlfield tag="001">459797</controlfield>
<controlfield tag="005">20050131154400.0</controlfield>
<controlfield tag="008">731129m19021933enk b 000 0 lat
   c</controlfield>
<datafield tag="010" ind1=" " ind2=" ">
<subfield code="a">agr03000069 //r582</subfield>
</datafield>
How to make THIS into 0’s and 1’s
Virtual Library defined in the
                      Online Dictionary for
                Library and Information Science
A "library without walls" in which the collections do
not exist … [in] tangible form at a physical location but
are electronically accessible in digital format via
computer networks.
The term digital library is more appropriate because
virtual (borrowed from "virtual reality") suggests that
the experience of using such a library is not the same
as the "real" thing when in fact the experience of
reading or viewing a document on a computer screen
may be qualitatively different from reading the same
publication in print, but the information content is the
same regardless of format.
~ http://lu.com/odlis/odlis_v.cfm
Digital Library defined in the
                          Online Dictionary for
                    Library and Information Science

A library in which a significant proportion of the resources are
available in machine-readable format … . The digital content
may be locally held or accessed remotely via computer
networks. … In libraries, the process of digitization began with
the catalog, moved to periodical indexes and abstracting
services, then to periodicals and large reference works, and
finally to book publishing.
~ http://lu.com/odlis/odlis_v.cfm
Traditional Digital Library

• Electronic Journals &
  Databases
• Digital Editions
• Online Exhibitions
• Online Catalog
• Digital Reference
If you digitize it …



                       Will they find it?
Search Gone BAD!
- Specimen
- Plate or other visual image
- Taxonomic description
Beyond the Traditional

      Taxonomic Literature Needs/Requests



• Beyond the Scan
• Beyond the Re-Keyed
• Marking up the data in metadata schemas
MARC
             Milk, eggs, lactaid

Make dentist appt.
                     LCSH/LCCS
                               Feed the cat
        ISBD
                         AACR
                          Pick up dry cleaning
Access   relatedItem
                   MARC       Dublin Core
       XMP
                      Milk, eggs, lactaid
 METs         ISBD
      Faceted       RDA
                                LCSH/LCCS
Add hotdogs to grocery list
                              Feed the cat
               XML
  MODS                      Dewey AACR
            FRBR
                        Pick up dry cleaning
                 Hierarchical       TEI
                             ONIX
Discoverable
             Milk, eggs, lactaid

Make dentist appt.
                     Interoperability
                               Feed the cat
   Open Access
                             Collaboration
                          Pick up dry cleaning
Biodiversity Heritage Library (BHL)
EOL                               Bibliographic
 Curator         species                           Data from
 RequestEvaluate need                              SIRIS             Carts delivered to scanner
        title
Goin’ down is…
        Need
                                                                     Put on shipping cart,
                   “gap-fill”                  Picklist Database
the rows                                                             generate‘packinglist’ invoice
                   for other                   Stores Select /
                   BHL library                 reject / ship
                                                                     Update picklist if item record
                                               state & supplies
                                                                     has been changed
                                               item metadata         During cataloging touch-up
                                               to IA                 Circ to scanner

                        Select title
  serial?     no        in picklist,
                                                                                  Circ to cataloging
                        upload to
                        monograph de-duper                                        for MARC editing
  yes
                                       no      The Stacks          Reject in picklist,
                               Duplicate?                                                      fail
   Other         yes                                               Circ in Horizon
                                                                   Return to stacks
   library
   “bid” ?                                                                                 Meta-
             Reject in picklist,                                                           data
  no         return to stacks                                                              check       pass
   “Bid”                             Pull from stacks
                                                                                           Preser-
 on title,                           Circ in ILS                                           vation
 select in                           Preliminary metadata check                             review     pass
  picklist                           And physical check
                                                                                    fail
IA scanning process
                                                                      BHL Portal
                                   Unique IA id is assigned
                                   Metadata is gathered from          Periodically harvests
                                   SIRIS and the picklist db          Marc.xml (bib) and item
                                   And associated with the scan       Records, along with
                                   JP2000s generated                  JP2000 from
Carts delivered                    & transformed                      Archive.org
to scanner                         Served on archive.org
                                                                      To index and display
                                   QA is done by IA on 10%
                                                                      In the portal
Put on shipping cart,
generate ‘packinglist’         Books are returned,
Invoice, alert                 cart contents are
scanning center                verified against invoice

                               SIL does 20% QA                    Download .csv from
Update picklist                Checking for metadata matching
to indicate                                                       portal with SIL
                               With item, scan quality etc
rescan                                                            barcodes, Portal
                                                                  URLs
                         no               Pass QA?
                                                 yes
                              Updated in picklist as scanned
                              Circ in Horizon                      Send URLs to SIRIS
                              Place BHL sticker near barcode       Office for batch
                              Return to Stacks                     updates
BHL
Mass Scanning Workflow
  •Bid Lists
  •Serials Management
  •Pick Lists
  •Packing Lists
  •Monographic Management
  •Local data flow
  •WonderFetch          tm


  •Return of data
  •Return of material
  •Billing
                                                       Ernest Ingersoll
       Hand-book to the National Museum … Smithsonian Institution, 1886
BHL
1.  Select Book ~Pull from Shelf
2.  Review Physically and Metadata
3.  Establish viability and create
    Wonderfetchtm
4. Send to IA scanning center
5. Book is scanned & QA
6. Page images loaded
7. Derivatives created
8. Book returned to library
9. Files harvested from IA portal to
    BHL
10. Taxonomic Intelligence Added
11. Available through BHL
Monographic DeDuper
The BHL Portal is not a library catalog
Collections.SI.edu ~ SI Libraries

  842,000 Records in ILS
  27,805 Trade literature
74,613 Art and Artists files
4,000 SI Digital Repository
   (SI Research Online)
Not in
Collections.Si.Edu
Collections.SI.edu ~ Freer + Sackler
                             53% of the ENTIRE
                        collection at www.asia.si.edu
                         & collections.si.edu

                             12,269 objects online


 NOT: F/S G’s Study Collection – 10,872 objects only for
       study not for exhibit – will never go online
Collections.SI.edu ~ NPM
  12,000 Records
 Collections.si.edu

16,000 Records in the
       ARAGO

   214,000 Records
   in the database

6 Million objects
            = 0.2% in Collections.si.edu
Collections.SI.edu ~ NMNH
NMNH estimates 126 Million Specimens
Collections.SI.edu ~ NMNH
  NMNH estimates 126 Million Specimens

    5,400,000 Catalog Records in collection
            management system –
5,218,793 available on collections.nmnh.si.edu
        (181,207 records not available)
Collections.SI.edu ~ NMNH
          Coming soon:
 IZ 992,000 (68,000 with media)

 Bot 788,000 (1,300 with media)
Collections.SI.edu ~ NMNH
 NMNH estimates 126 Million Specimens
  5,400,000 Catalog Records in collection
management system – 5,218,793 available on
         collections.nmnh.si.edu
      (181,207 records not available)

       6 out of 10 units supplying data to
     collections.si.edu = 2,527,557 records
             (153,418 have images)
Collections.SI.edu

            4,600,000 Records
             445,000 Images
             40 Data sources

                 50%
    of the records are from 1 source
(NMNH and still growing 2,527,557 records
          with 153,418 images)
SI Wide Estimations

 • 136.9 MILLION objects

• 13 MILLION digital records

 • 821,000 digital images
“The worth and importance of
the Institution is not to be
estimated by what it
accumulates within the walls of
its building, but by what it sends
forth to the world.”

                 —Joseph Henry
     The Smithsonian Institution’s First Secretary
                       1852
Credits
    Thanks to staff at
      NMAI       SIL
NMNH     MBL/WHOI Library
    NPM        MoBot
  Freer/Sackler NYBG
           BHL
Smithsonian Institution Libraries
  “Metadata Mixing & Matching For
            Discovery”



              Suzanne C. Pilsk
       Smithsonian Institution Libraries
               PilskS@si.edu

More Related Content

What's hot

CUA 2008
CUA 2008CUA 2008
CUA 2008SCPilsk
 
Unlocking indexanimaliumstatic
Unlocking indexanimaliumstaticUnlocking indexanimaliumstatic
Unlocking indexanimaliumstaticSCPilsk
 
Building a Global Library of Taxonomic Literature
Building a Global Library of Taxonomic LiteratureBuilding a Global Library of Taxonomic Literature
Building a Global Library of Taxonomic Literature
Martin Kalfatovic
 
Suzanne Pilsk Presentation to SIL Board 2012
Suzanne Pilsk Presentation to SIL Board 2012Suzanne Pilsk Presentation to SIL Board 2012
Suzanne Pilsk Presentation to SIL Board 2012Smithsonian Libraries
 
Sherborn: Pilsk, Joel Richard & Kalfatovic - Unlocking the Index Animalium: F...
Sherborn: Pilsk, Joel Richard & Kalfatovic - Unlocking the Index Animalium: F...Sherborn: Pilsk, Joel Richard & Kalfatovic - Unlocking the Index Animalium: F...
Sherborn: Pilsk, Joel Richard & Kalfatovic - Unlocking the Index Animalium: F...
ICZN
 
Presentation ala2010
Presentation ala2010Presentation ala2010
Presentation ala2010SCPilsk
 
2009 05 20 Cimc Pilsk
2009 05 20 Cimc Pilsk2009 05 20 Cimc Pilsk
2009 05 20 Cimc PilskSCPilsk
 
Botany and the BHL: A Botanical Overview of the Biodiversity Heritage Library
Botany and the BHL: A Botanical Overview of the Biodiversity Heritage LibraryBotany and the BHL: A Botanical Overview of the Biodiversity Heritage Library
Botany and the BHL: A Botanical Overview of the Biodiversity Heritage Library
Martin Kalfatovic
 
PACSCL's "Hidden Collections" Processing Project
PACSCL's "Hidden Collections" Processing ProjectPACSCL's "Hidden Collections" Processing Project
PACSCL's "Hidden Collections" Processing Project
Holly Mengel
 
Usaf navy marine corps librarians 06 25-10
Usaf navy marine corps librarians 06 25-10Usaf navy marine corps librarians 06 25-10
Usaf navy marine corps librarians 06 25-10marciaadams
 
Tassonomia E Folksonomia
Tassonomia E FolksonomiaTassonomia E Folksonomia
Tassonomia E Folksonomia
funzionepubblica
 
Taxonomies and Folksonomies
Taxonomies and FolksonomiesTaxonomies and Folksonomies
Taxonomies and Folksonomies
K.G. Schneider
 
The Biodiversity Heritage Library: 30 Million Pages of Taxonomic Literature &...
The Biodiversity Heritage Library: 30 Million Pages of Taxonomic Literature &...The Biodiversity Heritage Library: 30 Million Pages of Taxonomic Literature &...
The Biodiversity Heritage Library: 30 Million Pages of Taxonomic Literature &...
Becky Morin
 
Best Genealogy Websites of 2013
Best Genealogy Websites of 2013Best Genealogy Websites of 2013
Best Genealogy Websites of 2013
May Chan
 
The Biodiversity Heritage Library
The Biodiversity Heritage LibraryThe Biodiversity Heritage Library
The Biodiversity Heritage Library
Martin Kalfatovic
 
ANTH140 Introduction to Prehistory
ANTH140 Introduction to PrehistoryANTH140 Introduction to Prehistory
ANTH140 Introduction to Prehistory
Cathy Cranston
 
ANTH140 - Introduction to Prehistory
ANTH140 - Introduction to PrehistoryANTH140 - Introduction to Prehistory
ANTH140 - Introduction to Prehistory
Cathy Cranston
 
The Biodiversity Heritage Library: Growing from Botanical Origins
The Biodiversity Heritage Library: Growing from Botanical OriginsThe Biodiversity Heritage Library: Growing from Botanical Origins
The Biodiversity Heritage Library: Growing from Botanical Origins
Martin Kalfatovic
 
OCLC Research: an overview and update for the National Library of Wales
OCLC Research: an overview and update for the National Library of WalesOCLC Research: an overview and update for the National Library of Wales
OCLC Research: an overview and update for the National Library of WalesJohn MacColl
 
Smithsonian Libraries 2.0 and the Biodiversity Heritage Library Project
Smithsonian Libraries 2.0 and the Biodiversity Heritage Library ProjectSmithsonian Libraries 2.0 and the Biodiversity Heritage Library Project
Smithsonian Libraries 2.0 and the Biodiversity Heritage Library Project
Martin Kalfatovic
 

What's hot (20)

CUA 2008
CUA 2008CUA 2008
CUA 2008
 
Unlocking indexanimaliumstatic
Unlocking indexanimaliumstaticUnlocking indexanimaliumstatic
Unlocking indexanimaliumstatic
 
Building a Global Library of Taxonomic Literature
Building a Global Library of Taxonomic LiteratureBuilding a Global Library of Taxonomic Literature
Building a Global Library of Taxonomic Literature
 
Suzanne Pilsk Presentation to SIL Board 2012
Suzanne Pilsk Presentation to SIL Board 2012Suzanne Pilsk Presentation to SIL Board 2012
Suzanne Pilsk Presentation to SIL Board 2012
 
Sherborn: Pilsk, Joel Richard & Kalfatovic - Unlocking the Index Animalium: F...
Sherborn: Pilsk, Joel Richard & Kalfatovic - Unlocking the Index Animalium: F...Sherborn: Pilsk, Joel Richard & Kalfatovic - Unlocking the Index Animalium: F...
Sherborn: Pilsk, Joel Richard & Kalfatovic - Unlocking the Index Animalium: F...
 
Presentation ala2010
Presentation ala2010Presentation ala2010
Presentation ala2010
 
2009 05 20 Cimc Pilsk
2009 05 20 Cimc Pilsk2009 05 20 Cimc Pilsk
2009 05 20 Cimc Pilsk
 
Botany and the BHL: A Botanical Overview of the Biodiversity Heritage Library
Botany and the BHL: A Botanical Overview of the Biodiversity Heritage LibraryBotany and the BHL: A Botanical Overview of the Biodiversity Heritage Library
Botany and the BHL: A Botanical Overview of the Biodiversity Heritage Library
 
PACSCL's "Hidden Collections" Processing Project
PACSCL's "Hidden Collections" Processing ProjectPACSCL's "Hidden Collections" Processing Project
PACSCL's "Hidden Collections" Processing Project
 
Usaf navy marine corps librarians 06 25-10
Usaf navy marine corps librarians 06 25-10Usaf navy marine corps librarians 06 25-10
Usaf navy marine corps librarians 06 25-10
 
Tassonomia E Folksonomia
Tassonomia E FolksonomiaTassonomia E Folksonomia
Tassonomia E Folksonomia
 
Taxonomies and Folksonomies
Taxonomies and FolksonomiesTaxonomies and Folksonomies
Taxonomies and Folksonomies
 
The Biodiversity Heritage Library: 30 Million Pages of Taxonomic Literature &...
The Biodiversity Heritage Library: 30 Million Pages of Taxonomic Literature &...The Biodiversity Heritage Library: 30 Million Pages of Taxonomic Literature &...
The Biodiversity Heritage Library: 30 Million Pages of Taxonomic Literature &...
 
Best Genealogy Websites of 2013
Best Genealogy Websites of 2013Best Genealogy Websites of 2013
Best Genealogy Websites of 2013
 
The Biodiversity Heritage Library
The Biodiversity Heritage LibraryThe Biodiversity Heritage Library
The Biodiversity Heritage Library
 
ANTH140 Introduction to Prehistory
ANTH140 Introduction to PrehistoryANTH140 Introduction to Prehistory
ANTH140 Introduction to Prehistory
 
ANTH140 - Introduction to Prehistory
ANTH140 - Introduction to PrehistoryANTH140 - Introduction to Prehistory
ANTH140 - Introduction to Prehistory
 
The Biodiversity Heritage Library: Growing from Botanical Origins
The Biodiversity Heritage Library: Growing from Botanical OriginsThe Biodiversity Heritage Library: Growing from Botanical Origins
The Biodiversity Heritage Library: Growing from Botanical Origins
 
OCLC Research: an overview and update for the National Library of Wales
OCLC Research: an overview and update for the National Library of WalesOCLC Research: an overview and update for the National Library of Wales
OCLC Research: an overview and update for the National Library of Wales
 
Smithsonian Libraries 2.0 and the Biodiversity Heritage Library Project
Smithsonian Libraries 2.0 and the Biodiversity Heritage Library ProjectSmithsonian Libraries 2.0 and the Biodiversity Heritage Library Project
Smithsonian Libraries 2.0 and the Biodiversity Heritage Library Project
 

Similar to Cua2010

The Wonderful Technicolor World Digital Goodness @ Smithsonian Libraries
The Wonderful Technicolor World Digital Goodness @ Smithsonian LibrariesThe Wonderful Technicolor World Digital Goodness @ Smithsonian Libraries
The Wonderful Technicolor World Digital Goodness @ Smithsonian Libraries
Martin Kalfatovic
 
Smithsonian Libraries Overview
Smithsonian Libraries OverviewSmithsonian Libraries Overview
Smithsonian Libraries Overview
Martin Kalfatovic
 
Exploring Cultural History Online -- Winding Rivers Library System Kickoff Event
Exploring Cultural History Online -- Winding Rivers Library System Kickoff EventExploring Cultural History Online -- Winding Rivers Library System Kickoff Event
Exploring Cultural History Online -- Winding Rivers Library System Kickoff Event
Recollection Wisconsin
 
The Semantic Web and the Digital Archaeological Workflow: A Case Study from S...
The Semantic Web and the Digital Archaeological Workflow: A Case Study from S...The Semantic Web and the Digital Archaeological Workflow: A Case Study from S...
The Semantic Web and the Digital Archaeological Workflow: A Case Study from S...
Marcus Smith
 
Tanya Szrajber, The British Museum Collection Database
Tanya Szrajber, The British Museum Collection DatabaseTanya Szrajber, The British Museum Collection Database
Tanya Szrajber, The British Museum Collection Database
Andrew Prescott
 
Wi ls worldwho talk
Wi ls worldwho talkWi ls worldwho talk
Wi ls worldwho talkWiLS
 
The DPLA and NY Heritage for Tech Camp 2014
The DPLA and NY Heritage for Tech Camp 2014The DPLA and NY Heritage for Tech Camp 2014
The DPLA and NY Heritage for Tech Camp 2014
Larry Naukam
 
Eastern Shores Library System digitization project
Eastern Shores Library System digitization projectEastern Shores Library System digitization project
Eastern Shores Library System digitization project
Recollection Wisconsin
 
CUA LSC818 2007
CUA LSC818 2007CUA LSC818 2007
CUA LSC818 2007
SCPilsk
 
Promoting Digital Cultural Heritage Collections: Challenges and Opportunities
Promoting Digital Cultural Heritage Collections: Challenges and OpportunitiesPromoting Digital Cultural Heritage Collections: Challenges and Opportunities
Promoting Digital Cultural Heritage Collections: Challenges and Opportunities
UCD Library
 
Beyond the Silos of the LAMs: The Evolving and Converging Environment for t...
Beyond the Silos of the LAMs: The Evolving and Converging Environment for t...Beyond the Silos of the LAMs: The Evolving and Converging Environment for t...
Beyond the Silos of the LAMs: The Evolving and Converging Environment for t...
Visual Resources Association
 
Ideas for how volunteers at cultural heritage institutions can help, using Tr...
Ideas for how volunteers at cultural heritage institutions can help, using Tr...Ideas for how volunteers at cultural heritage institutions can help, using Tr...
Ideas for how volunteers at cultural heritage institutions can help, using Tr...
Rose Holley
 
Sharing Your Digital Collection
Sharing Your Digital CollectionSharing Your Digital Collection
Sharing Your Digital Collection
WiLS
 
Creating Content: Smithsonian Institution Libraries' Digital Library Program
Creating Content: Smithsonian Institution Libraries' Digital Library ProgramCreating Content: Smithsonian Institution Libraries' Digital Library Program
Creating Content: Smithsonian Institution Libraries' Digital Library Program
Martin Kalfatovic
 
Code4 lib 2015
Code4 lib 2015Code4 lib 2015
Code4 lib 2015
Megan Forbes
 
Department Brownbags : Division of Birds, NMNH
Department Brownbags : Division of Birds, NMNHDepartment Brownbags : Division of Birds, NMNH
Department Brownbags : Division of Birds, NMNH
Sonoe Nakasone
 
Jennings directors station
Jennings directors stationJennings directors station
Jennings directors stationAmanda Carlson
 
Using digital collections como2015
Using digital collections como2015Using digital collections como2015
Using digital collections como2015
LYRASIS_PRODEV
 
Librarians are everywhere_azla
Librarians are everywhere_azlaLibrarians are everywhere_azla
Librarians are everywhere_azla
Bryan Heidorn
 

Similar to Cua2010 (20)

The Wonderful Technicolor World Digital Goodness @ Smithsonian Libraries
The Wonderful Technicolor World Digital Goodness @ Smithsonian LibrariesThe Wonderful Technicolor World Digital Goodness @ Smithsonian Libraries
The Wonderful Technicolor World Digital Goodness @ Smithsonian Libraries
 
Smithsonian Libraries Overview
Smithsonian Libraries OverviewSmithsonian Libraries Overview
Smithsonian Libraries Overview
 
Exploring Cultural History Online -- Winding Rivers Library System Kickoff Event
Exploring Cultural History Online -- Winding Rivers Library System Kickoff EventExploring Cultural History Online -- Winding Rivers Library System Kickoff Event
Exploring Cultural History Online -- Winding Rivers Library System Kickoff Event
 
The Semantic Web and the Digital Archaeological Workflow: A Case Study from S...
The Semantic Web and the Digital Archaeological Workflow: A Case Study from S...The Semantic Web and the Digital Archaeological Workflow: A Case Study from S...
The Semantic Web and the Digital Archaeological Workflow: A Case Study from S...
 
Tanya Szrajber, The British Museum Collection Database
Tanya Szrajber, The British Museum Collection DatabaseTanya Szrajber, The British Museum Collection Database
Tanya Szrajber, The British Museum Collection Database
 
Wi ls worldwho talk
Wi ls worldwho talkWi ls worldwho talk
Wi ls worldwho talk
 
The DPLA and NY Heritage for Tech Camp 2014
The DPLA and NY Heritage for Tech Camp 2014The DPLA and NY Heritage for Tech Camp 2014
The DPLA and NY Heritage for Tech Camp 2014
 
Eastern Shores Library System digitization project
Eastern Shores Library System digitization projectEastern Shores Library System digitization project
Eastern Shores Library System digitization project
 
CUA LSC818 2007
CUA LSC818 2007CUA LSC818 2007
CUA LSC818 2007
 
Promoting Digital Cultural Heritage Collections: Challenges and Opportunities
Promoting Digital Cultural Heritage Collections: Challenges and OpportunitiesPromoting Digital Cultural Heritage Collections: Challenges and Opportunities
Promoting Digital Cultural Heritage Collections: Challenges and Opportunities
 
Ecdl2004
Ecdl2004Ecdl2004
Ecdl2004
 
Beyond the Silos of the LAMs: The Evolving and Converging Environment for t...
Beyond the Silos of the LAMs: The Evolving and Converging Environment for t...Beyond the Silos of the LAMs: The Evolving and Converging Environment for t...
Beyond the Silos of the LAMs: The Evolving and Converging Environment for t...
 
Ideas for how volunteers at cultural heritage institutions can help, using Tr...
Ideas for how volunteers at cultural heritage institutions can help, using Tr...Ideas for how volunteers at cultural heritage institutions can help, using Tr...
Ideas for how volunteers at cultural heritage institutions can help, using Tr...
 
Sharing Your Digital Collection
Sharing Your Digital CollectionSharing Your Digital Collection
Sharing Your Digital Collection
 
Creating Content: Smithsonian Institution Libraries' Digital Library Program
Creating Content: Smithsonian Institution Libraries' Digital Library ProgramCreating Content: Smithsonian Institution Libraries' Digital Library Program
Creating Content: Smithsonian Institution Libraries' Digital Library Program
 
Code4 lib 2015
Code4 lib 2015Code4 lib 2015
Code4 lib 2015
 
Department Brownbags : Division of Birds, NMNH
Department Brownbags : Division of Birds, NMNHDepartment Brownbags : Division of Birds, NMNH
Department Brownbags : Division of Birds, NMNH
 
Jennings directors station
Jennings directors stationJennings directors station
Jennings directors station
 
Using digital collections como2015
Using digital collections como2015Using digital collections como2015
Using digital collections como2015
 
Librarians are everywhere_azla
Librarians are everywhere_azlaLibrarians are everywhere_azla
Librarians are everywhere_azla
 

Cua2010

  • 1. Smithsonian Institution Libraries “Metadata Mixing & Matching For Discovery” LSC 888 The Special Library/ Information Center Suzanne C. Pilsk ~ Smithsonian Institution Libraries ~ 2010
  • 2. Facts and Figures Smithsonian Institution Libraries Washington, D.C. • Anacostia Museum & Center for African American History and Culture Library • Anthropology Library • Botany and Horticulture Library • The Dibner Library of the History of Science and Technology • Freer Gallery of Art and Arthur M. Sackler Gallery Library • Hirshhorn Museum and Sculpture Garden Library • Joseph F. Cullman 3rd Library of Natural History
  • 3. Facts and Figures Smithsonian Institution Libraries Washington, D.C. (continued) • Museum Studies & Reference Library • National Air and Space Museum Library • National Museum of American History Library • National Museum of Natural History Library • National Postal Museum Library • National Zoological Park Library • Smithsonian American Art Museum/National Portrait Gallery Library • Warren M. Robbins Library, National Museum of African Art
  • 4. Facts and Figures Smithsonian Institution Libraries Elsewhere Suitland, Md. • Museum Support Center Library • National Museum of the American Indian Library Edgewater, Md. • Smithsonian Environmental Research Center Library New York City • Cooper-Hewitt, National Design Museum Library Republic of Panama • Smithsonian Tropical Research Institute Library
  • 5. Facts and Figures Smithsonian Institution Libraries African Art Latino History and Culture African American History and Materials Research Culture Modern and Contemporary Art Anthropology Museology American Art Native American History and Culture American History Natural History Asian and Middle Eastern Art Postal History Aviation history and Space Flight Tropical Biology Design and Decorative Arts Trade Literature Environmental Management and World’s Fair Ephemera Ecology History of Science and Technology
  • 6. What’s So Special? Public Museum Smithsonian Institution is the largest museum complex in the world … “The Nation’s Attic”
  • 7. “Increase and Diffusion of Knowledge” Unlock the Mysteries of the Universe Understanding and Sustaining a Biodiverse Planet Valuing World Cultures Understanding the American Experience
  • 8. SIL Mission (Smithsonian Directive 500) As the largest and most diverse museum library in the world, SIL leads the Smithsonian in taking advantage of the opportunities of the digital society. SIL provides authoritative information and creates innovative services and programs for Smithsonian Institution researchers, scholars and curators, as well as the general public, to further their quest for knowledge. Through paper preservation and digital technologies, SIL ensures broad and enduring access to the Libraries’ collections for all users.
  • 9. SIL’s Strategic Plan “Focus on Service” • GOAL 1: COLLABORATING ACROSS BOUNDARIES – SIL creates a compelling environment for connecting, collaborating and exploring across disciplines and information boundaries • GOAL 2: DISCOVERING INFORMATION – SIL enhances and eases the discovery of information in our collections for SI scholars, researchers, scientists, and the larger world of learners • GOAL 3: CONNECTING WITH USERS – SIL understands and meets user needs, serving users where they live and work • GOAL 4: BUILDING EXPERTISE – SIL builds expertise on information discovery, navigation and management • GOAL 5: ENABLING OUR MISSION – SIL ensures its success through increased financial strength, effective administrative support, and organizational excellence
  • 10. Facts and Figures Smithsonian Institution Libraries Total volumes > 1.7 million 50,000 are rare books 10,000 manuscripts Trade Catalogs > 500, 000 items > 30,000 companies dating from the 1800s
  • 11. Facts and Figures • 102 Smithsonian Libraries Staff • 17 Souls in Cataloging Services (with contractors)
  • 12. • Traditional Library • Traditional Services
  • 13. Integrated Library System Smithsonian Institution Research Information System (SIRIS) – MARC – AACR2r – ISBD – LC Classification – LC Subject Headings
  • 14. Traditional Cataloging • Monographs • Serials • Videos • Microfilm/fiche • Sound Recordings • CD/DVDs • Electronic Resources
  • 15. Traditional Cataloging • OCLC • Program for Cooperative Cataloging – NACO – SACO – BIBCO
  • 16. SI Libraries Serves • Curators • Researchers • Post-Docs • Museum Administrators • Public
  • 17.
  • 18. IFLA’s Functional Requirements for Bibliographic Data To Find To Identify To Select To Obtain To USE
  • 19. Determining Level of Metadata • What do you have? • What staff do you have? • Who are your users? • Where will it go? • Will it stay there or travel on and on and on and on and on and on and on and on
  • 22.
  • 23. Metadata: MARC MARC 110 Oscar Mayer & Co. 650 Frankfurters
  • 24. Metadata Dublin Core Creator: Oscar Mayer & Co. Subject: Frankfurters
  • 25. Metadata: Real MARC – Still failure to serve 02761nam 2200469 4500001000700000005001700007008004100024010002300065019001300088035001400 1010350023001150400061001380490027001990500015002261000042002412450193002 8326000830047630000170055950403350057650501540091159001090106559000960117 4650002601270945002101296945007301317945003101390945004801421945004801469 9450047015179450079015649450044016439450046016879450048017339450076017819 4500440185794500510190194500510195294500710200394500900207494500960216494 5003102260-459797-20050131154400.0-731129m19021933enk b 000 0 lat c- -aagr03000069 //r582- -a14018362- -aABY6485LB- -a(OCoLC)ocm00751549- -aU.S. Dept. of Agr. Libr.-cRIU-dOCL-dCHS-dSER-dSMI-dWaOLN- -aSMI$-aSMIM-aSMIE-aSMIB-00-aQL354-b.S5- 1 -aOscar Mayer & Co.-10-aPronto pup:-bhot dogs hamburgers/-ca Oscar Mayer and Company.- -aNew Orleans, La. :-bBourbon Street Foods,-c2000.
  • 26. Metadata: MARCXML <?xml version="1.0" encoding="UTF-8" ?> <collection xmlns="http://www.loc.gov/MARC21/slim" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.loc.gov/MARC21/slim http://www.loc.gov/standards/marcxml/schema/MARC21slim.xsd"> <record><leader>02761nam a2200469 4500</leader> <controlfield tag="001">459797</controlfield> <controlfield tag="005">20050131154400.0</controlfield> <controlfield tag="008">731129m19021933enk b 000 0 lat c</controlfield> <datafield tag="010" ind1=" " ind2=" "> <subfield code="a">agr03000069 //r582</subfield> </datafield>
  • 27. How to make THIS into 0’s and 1’s
  • 28. Virtual Library defined in the Online Dictionary for Library and Information Science A "library without walls" in which the collections do not exist … [in] tangible form at a physical location but are electronically accessible in digital format via computer networks. The term digital library is more appropriate because virtual (borrowed from "virtual reality") suggests that the experience of using such a library is not the same as the "real" thing when in fact the experience of reading or viewing a document on a computer screen may be qualitatively different from reading the same publication in print, but the information content is the same regardless of format. ~ http://lu.com/odlis/odlis_v.cfm
  • 29. Digital Library defined in the Online Dictionary for Library and Information Science A library in which a significant proportion of the resources are available in machine-readable format … . The digital content may be locally held or accessed remotely via computer networks. … In libraries, the process of digitization began with the catalog, moved to periodical indexes and abstracting services, then to periodicals and large reference works, and finally to book publishing. ~ http://lu.com/odlis/odlis_v.cfm
  • 30. Traditional Digital Library • Electronic Journals & Databases • Digital Editions • Online Exhibitions • Online Catalog • Digital Reference
  • 31.
  • 32. If you digitize it … Will they find it?
  • 34.
  • 35. - Specimen - Plate or other visual image - Taxonomic description
  • 36.
  • 37. Beyond the Traditional Taxonomic Literature Needs/Requests • Beyond the Scan • Beyond the Re-Keyed • Marking up the data in metadata schemas
  • 38.
  • 39.
  • 40. MARC Milk, eggs, lactaid Make dentist appt. LCSH/LCCS Feed the cat ISBD AACR Pick up dry cleaning
  • 41. Access relatedItem MARC Dublin Core XMP Milk, eggs, lactaid METs ISBD Faceted RDA LCSH/LCCS Add hotdogs to grocery list Feed the cat XML MODS Dewey AACR FRBR Pick up dry cleaning Hierarchical TEI ONIX
  • 42. Discoverable Milk, eggs, lactaid Make dentist appt. Interoperability Feed the cat Open Access Collaboration Pick up dry cleaning
  • 43.
  • 45.
  • 46. EOL Bibliographic Curator species Data from RequestEvaluate need SIRIS Carts delivered to scanner title Goin’ down is… Need Put on shipping cart, “gap-fill” Picklist Database the rows generate‘packinglist’ invoice for other Stores Select / BHL library reject / ship Update picklist if item record state & supplies has been changed item metadata During cataloging touch-up to IA Circ to scanner Select title serial? no in picklist, Circ to cataloging upload to monograph de-duper for MARC editing yes no The Stacks Reject in picklist, Duplicate? fail Other yes Circ in Horizon Return to stacks library “bid” ? Meta- Reject in picklist, data no return to stacks check pass “Bid” Pull from stacks Preser- on title, Circ in ILS vation select in Preliminary metadata check review pass picklist And physical check fail
  • 47. IA scanning process BHL Portal Unique IA id is assigned Metadata is gathered from Periodically harvests SIRIS and the picklist db Marc.xml (bib) and item And associated with the scan Records, along with JP2000s generated JP2000 from Carts delivered & transformed Archive.org to scanner Served on archive.org To index and display QA is done by IA on 10% In the portal Put on shipping cart, generate ‘packinglist’ Books are returned, Invoice, alert cart contents are scanning center verified against invoice SIL does 20% QA Download .csv from Update picklist Checking for metadata matching to indicate portal with SIL With item, scan quality etc rescan barcodes, Portal URLs no Pass QA? yes Updated in picklist as scanned Circ in Horizon Send URLs to SIRIS Place BHL sticker near barcode Office for batch Return to Stacks updates
  • 48. BHL Mass Scanning Workflow •Bid Lists •Serials Management •Pick Lists •Packing Lists •Monographic Management •Local data flow •WonderFetch tm •Return of data •Return of material •Billing Ernest Ingersoll Hand-book to the National Museum … Smithsonian Institution, 1886
  • 49. BHL 1. Select Book ~Pull from Shelf 2. Review Physically and Metadata 3. Establish viability and create Wonderfetchtm 4. Send to IA scanning center 5. Book is scanned & QA 6. Page images loaded 7. Derivatives created 8. Book returned to library 9. Files harvested from IA portal to BHL 10. Taxonomic Intelligence Added 11. Available through BHL
  • 51.
  • 52.
  • 53.
  • 54.
  • 55.
  • 56.
  • 57.
  • 58. The BHL Portal is not a library catalog
  • 59.
  • 60.
  • 61.
  • 62.
  • 63.
  • 64.
  • 65.
  • 66.
  • 67.
  • 68.
  • 69.
  • 70.
  • 71.
  • 72.
  • 73.
  • 74.
  • 75.
  • 76.
  • 77.
  • 78.
  • 79. Collections.SI.edu ~ SI Libraries 842,000 Records in ILS 27,805 Trade literature 74,613 Art and Artists files 4,000 SI Digital Repository (SI Research Online)
  • 81. Collections.SI.edu ~ Freer + Sackler 53% of the ENTIRE collection at www.asia.si.edu & collections.si.edu 12,269 objects online NOT: F/S G’s Study Collection – 10,872 objects only for study not for exhibit – will never go online
  • 82. Collections.SI.edu ~ NPM 12,000 Records Collections.si.edu 16,000 Records in the ARAGO 214,000 Records in the database 6 Million objects = 0.2% in Collections.si.edu
  • 83. Collections.SI.edu ~ NMNH NMNH estimates 126 Million Specimens
  • 84.
  • 85. Collections.SI.edu ~ NMNH NMNH estimates 126 Million Specimens 5,400,000 Catalog Records in collection management system – 5,218,793 available on collections.nmnh.si.edu (181,207 records not available)
  • 86. Collections.SI.edu ~ NMNH Coming soon: IZ 992,000 (68,000 with media) Bot 788,000 (1,300 with media)
  • 87. Collections.SI.edu ~ NMNH NMNH estimates 126 Million Specimens 5,400,000 Catalog Records in collection management system – 5,218,793 available on collections.nmnh.si.edu (181,207 records not available) 6 out of 10 units supplying data to collections.si.edu = 2,527,557 records (153,418 have images)
  • 88. Collections.SI.edu 4,600,000 Records 445,000 Images 40 Data sources 50% of the records are from 1 source (NMNH and still growing 2,527,557 records with 153,418 images)
  • 89. SI Wide Estimations • 136.9 MILLION objects • 13 MILLION digital records • 821,000 digital images
  • 90. “The worth and importance of the Institution is not to be estimated by what it accumulates within the walls of its building, but by what it sends forth to the world.” —Joseph Henry The Smithsonian Institution’s First Secretary 1852
  • 91.
  • 92. Credits Thanks to staff at NMAI SIL NMNH MBL/WHOI Library NPM MoBot Freer/Sackler NYBG BHL
  • 93. Smithsonian Institution Libraries “Metadata Mixing & Matching For Discovery” Suzanne C. Pilsk Smithsonian Institution Libraries PilskS@si.edu