Linked Library Data, LOD-LAM, &
     the Dublin Core Metadata Initiative




Presented 2012-06-01
Linked Ancient World Data Institute
Corey A Harper
Topical Overview


                                                      • - Value Vocabs
   • Talk outline:
                                                               - id.loc
     * A few principles that didn't come up
     yesterday                                                 - fast
       - Open vs. Closed world                                 - viaf
     assumption                                           - Instance data
       - Graph Merging                                          - British Library stuff & more
     * LOD-LAM                                            - W3C Incubator Group
     * Libraries in the LOD-LAM                           - Archives
       - RDF Vocabs                                       - ALA, IFLA, ExL, & more
            - Non-Library                               * DCMI
                * BiBo
                                                           - DCAM
                * FRBR Fake
                * DCterms                                  - Metadata Registries
            - Library                                      - Vocabulary Alignment
                * Official FRBR                         * Where does this fit?
                * ISBD & other IFLA                     * Services from Libraries & Library
                * MARC 21                               Data?
                * BibFrame                              * Use cases for LAWDI & AW
                * RDAdigi                               Scholarship


   2012-06-01                     Linked Ancient World Data Institute            2
3 Takeaways




   • Libraries are doing a lot in this
     space
   • We want to do more, including
     building infrastructure for your
     research
   • The Dublin Core Metadata Initiative
     is more than a set of vocabularies.
     We’re working on how to do this

   2012-06-01    Linked Ancient World Data Institute   3
Library Metadata




   • Rich stores of MARC, MODS, &c.
   • Robust Controlled Vocabularies
         Subject Heading lists
         Code lists
         Thesauri
   • Emerging data model in FR*


   2012-06-01          Linked Ancient World Data Institute   4
Era of
              “Perpetual
               Metadata
             Heterogeneity”

                       --or--

2012-06-01    Linked Ancient World Data Institute   5
Ethan Gruber’s Alphabet Soup




   MODS MARC METS DC EAD MADS TEI
    MEI CIDOC-CRM SKOS RDF RDFS
    XML FGDC FRBR EAC-CPF ISO-19115
    FRSAD MARC-XML VRA FRAD PB-
    CORE DARWIN-CORE &c &c &c


   2012-06-01             Linked Ancient World Data Institute   6
Bibliographic Vocabs




   • Bibliographic Ontology (Bibo)
         Zotero, Omeka, EPrints and Others
   • FRBR – unofficial
         And now Official (Thank you IFLA!)
   • International Standard for Bibliographic
     Description (ISBD)
   • Resource Description and Access (RDA)

   2012-06-01          Linked Ancient World Data Institute   7
2012-06-01   Linked Ancient World Data Institute   8
Linked Library [Archive, Museum] Data




   • LIBRIS (Swedish Union Catalog)
   • Library of Congress (LCSH, OSI)
   • German National Library
   • Hungarian National Library
   • British Library
   • Europeana
   • Archives Hub, LOCAH & Linking Lives

   2012-06-01              Linked Ancient World Data Institute   9
LCSH in Dublin Core




   • Encoding Scheme for DC Subject
   • No easy way to draw on equivelent
     terms and cross-references
   • Abstract Model, RDF and SKOS are
     enabling applications to make use of the
     whole vocabulary


   2012-06-01         Linked Ancient World Data Institute   10
LCSH as a Web Service!




   • Uses principles of linked data
   • lcsh.info -> id.loc.gov
   • People noticed when taken down
   • Links to French Subject Headings
   • URIs for Literal String lookup
   • http://id.loc.gov/authorities/label/World Wide Web



   2012-06-01               Linked Ancient World Data Institute   11
2012-06-01   Linked Ancient World Data Institute   12
2012-06-01   Linked Ancient World Data Institute   13
Swedish National Library
RDF Data




                                                                      http://blog.libris.kb.se/semweb/?p=7
                                                                      Image courtesy of Martin Malmstem
    2012-06-01             Linked Ancient World Data Institute   14
Image courtesy of Ed Summers
2012-06-01   Linked Ancient World Data Institute   15
Image courtesy of Ed Summers
2012-06-01   Linked Ancient World Data Institute   16
Other Vocabularies




   • Thesaurus for Economics
   • French Subject Headings
   • Swedish Subject Headings
   • IconClass (not on web yet)
   • OCLC Terminology Services
   • Dewey Decimal Classification
   • Virtual International Authority File (VIAF)
   • Faceted Application of Subject Terminology
     (FAST)

   2012-06-01        Linked Ancient World Data Institute   17
Virtual International Authority File (VIAF)




    • http://viaf.org/
    • http://viaf.org/viaf/data
    • 20+ Authority Files
    • Map/Reduce in Hadoop
    • ODC-By License
          For attribution at triple level, using OCLC
           URI is sufficient

    2012-06-01                Linked Ancient World Data Institute   18
Social Networks of Archival Context




  Image From: http://inkdroid.org/journal/2010/08/12/archival-context-on-the-web/
    2012-06-01                 Linked Ancient World Data Institute   19
Linking Lives – Screenshots from P. Johnston




   2012-06-01              Linked Ancient World Data Institute   20
Linking Lives – Screenshots from P. Johnston




   2012-06-01              Linked Ancient World Data Institute   21
Resource Description and Access




   • Current focus on MARC
         Much criticism
         Within MARC, not a tremendous change
         Different problems outside of MARC
   • Possible focus outside of MARC
         RDA as realization of FRBR
         RDA as Metadata Vocabularies
         RDA as related to Bibo

   2012-06-01             Linked Ancient World Data Institute   22
RDA as Metadata Vocabularies




                                                                          Slide Adapted from Diane Hillmann
    • RDA elements, roles and vocabularies
      have been provisionally registered
    • IFLA FRBRer and ISBD elements and
      vocabularies have been officially
      registered
    • Discussions about long term
      maintenance of both RDA and the
      vocabularies
    • Effort to create multi-language RDA
      Vocabularies
    2012-06-01                 Linked Ancient World Data Institute   23
Metadata Registries




   • Formerly NSDL Registry
         Now “Open Metadata Registry”
         Managing Vocabularies
         Providing Vocabulary Services


   • RDA – Now adding translations
   • IFLA Work
         FRBR, FRAD, FRSAD, ISBD


   2012-06-01            Linked Ancient World Data Institute   24
2012-06-01   Linked Ancient World Data Institute   25
BIBO and RDAVocab




   • Open question re: alignment
         DCMI Vocabulary Management & Alignment
   • Simplified view of Bib Data is useful
         Interlinking with more general data
         Interlinking with non-library domain data
   • FRBR as internal model for library
     domain
   • Examples
   2012-06-01           Linked Ancient World Data Institute   26
Dublin Core Efforts




    • Dublin Core Metadata Initiative, more
      than Dublin Core Metadata Element Set
    • Dublin Core Abstract Model
    • Application and Description Set Profiles
    • Harmonization, not Normalization
    • Vocabulary Management Infrastructure
          Including Mapping Efforts

    2012-06-01          Linked Ancient World Data Institute   27
DCAM Resource Model




   2012-06-01         Linked Ancient World Data Institute   28
DCAM Resource Model: Subtle Difference




   • Property-Value pairs built around notion
     of “Described Resource”
   • Centralizing triples around a single
     object
         Becoming Property-Value pairs
   • Preserving DC 1:1 Principle by including
     descriptions of “related resource”
   • How to validate???
   2012-06-01             Linked Ancient World Data Institute   29
2012-06-01   Linked Ancient World Data Institute   30
Conclusions




   • Distributed bibliographic control environment
         Linking Data
         Focus on identification over description
   • “In short, by treating values as non-literal
     resources and assigning URIs to them we give
     ourselves (and others) the hooks on which to
     hang further descriptions.” - Andy Powell


   2012-06-01             Linked Ancient World Data Institute   31
W3C Linked Library Data Incubator




   • Collected, Curated and Clustered over
     50 Use Cases
   • Mined use cases for functional
     requirements and design patterns
   • Recommendations to W3C
         Should lead to Working Groups
          (or new Community Groups)
   • http://www.w3.org/2005/Incubator/lld/
   2012-06-01              Linked Ancient World Data Institute   32
Other Activities



    • ALCTS/LITA Linked Library Data IG
    • IFLA Semantic Web IG
          https://wiki.d-nb.de/x/vA10Ag
    • Open Knowledge Foundation
          http://okfn.org/
    • CKAN Linked Library Data Group:
          http://ckan.net/group/lld
    • LC’s Bibliographic Framework Initiative
          http://www.loc.gov/marc/transition/
    • IGELU Linked Data Interest Group
          http://igelu.org/special-interests/lod
    2012-06-01                Linked Ancient World Data Institute   33
Thanks!




                         Questions?

                       corey.harper@nyu.edu
                           212.998.2479
                              @chrpr

                http://delicious.com/charper/lawditalk

   2012-06-01               Linked Ancient World Data Institute   34

Charper.lawdi.20120601

  • 1.
    Linked Library Data,LOD-LAM, & the Dublin Core Metadata Initiative Presented 2012-06-01 Linked Ancient World Data Institute Corey A Harper
  • 2.
    Topical Overview • - Value Vocabs • Talk outline: - id.loc * A few principles that didn't come up yesterday - fast - Open vs. Closed world - viaf assumption - Instance data - Graph Merging - British Library stuff & more * LOD-LAM - W3C Incubator Group * Libraries in the LOD-LAM - Archives - RDF Vocabs - ALA, IFLA, ExL, & more - Non-Library * DCMI * BiBo - DCAM * FRBR Fake * DCterms - Metadata Registries - Library - Vocabulary Alignment * Official FRBR * Where does this fit? * ISBD & other IFLA * Services from Libraries & Library * MARC 21 Data? * BibFrame * Use cases for LAWDI & AW * RDAdigi Scholarship 2012-06-01 Linked Ancient World Data Institute 2
  • 3.
    3 Takeaways • Libraries are doing a lot in this space • We want to do more, including building infrastructure for your research • The Dublin Core Metadata Initiative is more than a set of vocabularies. We’re working on how to do this 2012-06-01 Linked Ancient World Data Institute 3
  • 4.
    Library Metadata • Rich stores of MARC, MODS, &c. • Robust Controlled Vocabularies  Subject Heading lists  Code lists  Thesauri • Emerging data model in FR* 2012-06-01 Linked Ancient World Data Institute 4
  • 5.
    Era of “Perpetual Metadata Heterogeneity” --or-- 2012-06-01 Linked Ancient World Data Institute 5
  • 6.
    Ethan Gruber’s AlphabetSoup MODS MARC METS DC EAD MADS TEI MEI CIDOC-CRM SKOS RDF RDFS XML FGDC FRBR EAC-CPF ISO-19115 FRSAD MARC-XML VRA FRAD PB- CORE DARWIN-CORE &c &c &c 2012-06-01 Linked Ancient World Data Institute 6
  • 7.
    Bibliographic Vocabs • Bibliographic Ontology (Bibo)  Zotero, Omeka, EPrints and Others • FRBR – unofficial  And now Official (Thank you IFLA!) • International Standard for Bibliographic Description (ISBD) • Resource Description and Access (RDA) 2012-06-01 Linked Ancient World Data Institute 7
  • 8.
    2012-06-01 Linked Ancient World Data Institute 8
  • 9.
    Linked Library [Archive,Museum] Data • LIBRIS (Swedish Union Catalog) • Library of Congress (LCSH, OSI) • German National Library • Hungarian National Library • British Library • Europeana • Archives Hub, LOCAH & Linking Lives 2012-06-01 Linked Ancient World Data Institute 9
  • 10.
    LCSH in DublinCore • Encoding Scheme for DC Subject • No easy way to draw on equivelent terms and cross-references • Abstract Model, RDF and SKOS are enabling applications to make use of the whole vocabulary 2012-06-01 Linked Ancient World Data Institute 10
  • 11.
    LCSH as aWeb Service! • Uses principles of linked data • lcsh.info -> id.loc.gov • People noticed when taken down • Links to French Subject Headings • URIs for Literal String lookup • http://id.loc.gov/authorities/label/World Wide Web 2012-06-01 Linked Ancient World Data Institute 11
  • 12.
    2012-06-01 Linked Ancient World Data Institute 12
  • 13.
    2012-06-01 Linked Ancient World Data Institute 13
  • 14.
    Swedish National Library RDFData http://blog.libris.kb.se/semweb/?p=7 Image courtesy of Martin Malmstem 2012-06-01 Linked Ancient World Data Institute 14
  • 15.
    Image courtesy ofEd Summers 2012-06-01 Linked Ancient World Data Institute 15
  • 16.
    Image courtesy ofEd Summers 2012-06-01 Linked Ancient World Data Institute 16
  • 17.
    Other Vocabularies • Thesaurus for Economics • French Subject Headings • Swedish Subject Headings • IconClass (not on web yet) • OCLC Terminology Services • Dewey Decimal Classification • Virtual International Authority File (VIAF) • Faceted Application of Subject Terminology (FAST) 2012-06-01 Linked Ancient World Data Institute 17
  • 18.
    Virtual International AuthorityFile (VIAF) • http://viaf.org/ • http://viaf.org/viaf/data • 20+ Authority Files • Map/Reduce in Hadoop • ODC-By License  For attribution at triple level, using OCLC URI is sufficient 2012-06-01 Linked Ancient World Data Institute 18
  • 19.
    Social Networks ofArchival Context Image From: http://inkdroid.org/journal/2010/08/12/archival-context-on-the-web/ 2012-06-01 Linked Ancient World Data Institute 19
  • 20.
    Linking Lives –Screenshots from P. Johnston 2012-06-01 Linked Ancient World Data Institute 20
  • 21.
    Linking Lives –Screenshots from P. Johnston 2012-06-01 Linked Ancient World Data Institute 21
  • 22.
    Resource Description andAccess • Current focus on MARC  Much criticism  Within MARC, not a tremendous change  Different problems outside of MARC • Possible focus outside of MARC  RDA as realization of FRBR  RDA as Metadata Vocabularies  RDA as related to Bibo 2012-06-01 Linked Ancient World Data Institute 22
  • 23.
    RDA as MetadataVocabularies Slide Adapted from Diane Hillmann • RDA elements, roles and vocabularies have been provisionally registered • IFLA FRBRer and ISBD elements and vocabularies have been officially registered • Discussions about long term maintenance of both RDA and the vocabularies • Effort to create multi-language RDA Vocabularies 2012-06-01 Linked Ancient World Data Institute 23
  • 24.
    Metadata Registries • Formerly NSDL Registry  Now “Open Metadata Registry”  Managing Vocabularies  Providing Vocabulary Services • RDA – Now adding translations • IFLA Work  FRBR, FRAD, FRSAD, ISBD 2012-06-01 Linked Ancient World Data Institute 24
  • 25.
    2012-06-01 Linked Ancient World Data Institute 25
  • 26.
    BIBO and RDAVocab • Open question re: alignment  DCMI Vocabulary Management & Alignment • Simplified view of Bib Data is useful  Interlinking with more general data  Interlinking with non-library domain data • FRBR as internal model for library domain • Examples 2012-06-01 Linked Ancient World Data Institute 26
  • 27.
    Dublin Core Efforts • Dublin Core Metadata Initiative, more than Dublin Core Metadata Element Set • Dublin Core Abstract Model • Application and Description Set Profiles • Harmonization, not Normalization • Vocabulary Management Infrastructure  Including Mapping Efforts 2012-06-01 Linked Ancient World Data Institute 27
  • 28.
    DCAM Resource Model 2012-06-01 Linked Ancient World Data Institute 28
  • 29.
    DCAM Resource Model:Subtle Difference • Property-Value pairs built around notion of “Described Resource” • Centralizing triples around a single object  Becoming Property-Value pairs • Preserving DC 1:1 Principle by including descriptions of “related resource” • How to validate??? 2012-06-01 Linked Ancient World Data Institute 29
  • 30.
    2012-06-01 Linked Ancient World Data Institute 30
  • 31.
    Conclusions • Distributed bibliographic control environment  Linking Data  Focus on identification over description • “In short, by treating values as non-literal resources and assigning URIs to them we give ourselves (and others) the hooks on which to hang further descriptions.” - Andy Powell 2012-06-01 Linked Ancient World Data Institute 31
  • 32.
    W3C Linked LibraryData Incubator • Collected, Curated and Clustered over 50 Use Cases • Mined use cases for functional requirements and design patterns • Recommendations to W3C  Should lead to Working Groups (or new Community Groups) • http://www.w3.org/2005/Incubator/lld/ 2012-06-01 Linked Ancient World Data Institute 32
  • 33.
    Other Activities • ALCTS/LITA Linked Library Data IG • IFLA Semantic Web IG  https://wiki.d-nb.de/x/vA10Ag • Open Knowledge Foundation  http://okfn.org/ • CKAN Linked Library Data Group:  http://ckan.net/group/lld • LC’s Bibliographic Framework Initiative  http://www.loc.gov/marc/transition/ • IGELU Linked Data Interest Group  http://igelu.org/special-interests/lod 2012-06-01 Linked Ancient World Data Institute 33
  • 34.
    Thanks! Questions? corey.harper@nyu.edu 212.998.2479 @chrpr http://delicious.com/charper/lawditalk 2012-06-01 Linked Ancient World Data Institute 34