Practical Linked Open Data for
Libraries, Archives & Museums
Jon Voss
Historypin Strategic Partnerships Director
We Are What We Do
jon.voss@wearewhatwedo.org
@jonvoss
@historypin
historypin.com
http://www.flickr.com/photos/ohiouniversitylibraries/3485066089/
http://www.evolutionoftheweb.com
Linked Open Data in Libraries, Archives and Museums
#LODLAM




    Culture              Technology                   Law
Culture             Technology   Law

New Paradigm
Making This A Reality
http://bit.ly/WlscLJ
http://research.culturalequity.org/rc-b2/get-audio-detailed-recording.do?recordingId=4231
http://www.paulhagon.com/2008/08/19/powerhouse-street-view-mashup/
History and Mashup Culture
2010 National Archives Photo Contest




                                       +
History and Mashup Culture
NARA mashup on Historypin




                        http://www.flickr.com/photos/37377809@N00/5304492185/in/pool-1633053@N21/
Swedish artist Sanna Dullaway
http://burritojustice.com/2010/02/10/yesterdays-baseball-is-tomorrows-safeway/
http://oldsf.org
http://shawnclover.com
Culture   Technology   Law
                         Law
Going from Tables to Graphs




                              http://www.flickr.com/photos/thomasjwoods-com/2264301251
Going from Tables to Graphs
Nodes and links in a graph
Going from Tables to Graphs
As computing power increases, the ability to build
more and more complex graphs becomes a reality.




                                    msulibraries      lookbackmaps
                                    msulibraries      internetarchive
                                    msulibraries      librarycongress
                                    lookbackmaps      internetarchive
                                    internetarchive   librarycongress
Introducing Triples
Nodes and Links




                                 follows
            jonvoss                                    1n9r1d




• Quite simply: Subject, Predicate, Object

• gives us the ability to describe entities in a way
  that is machine readable
What do we know about the person:
Ed Summers (aside from the fact that he rocks)?


       Bio: Hacker for
       libraries, digital
       archaeologist,
       pragmatist.              bio        knows




                            depiction of   knows




                                                   http://inkdroid.org/ehs.rdf
Triples for machines
• Triples can be serialized in many different ways,
  including Resource Description Framework,
  RDF/XML, RDFa, N3, Turtle, etc, but they all
  describe things in the
  <subject><predicate><object> format.
• Of course, we need to be consistent and
  predictable for machines to understand us.
• We need to follow simple rules and protocols
http://www.flickr.com/photos/oface/3306994117/
http://inkdroid.org/ehs.rdf
• Consider graph demo: http://civilwardata150.net

• Civil War vocabulary, or a way to link and traverse across datasets
  • Regiments, Battles, Places

• Building apps that use this data
http://conflicthistory.com
http://conflicthistory.com
http://conflicthistory.com
http://dsl.richmond.edu/emancipation/
http://digital.library.lse.ac.uk/collections/streetlifeinlondon
http://timwray.net/2011/12/canvas/
https://www.smalldemons.com
http://invisibleaustralians.org
Culture               Technology   Law

New Tools Enabling Better Sharing
Metadata vs. data, assets, digital surrogates, images
http://www.loc.gov/pictures/collection/cwp/item/2003653763/
http://www.loc.gov/pictures/item/2003653763/marc/
Legal Tools


Open Data

              CC BY

              CC0

              Public Domain Mark




 Public Domain Dedication and License (PDDL)
 Attribution License (ODC-By)
 Open Database License (ODC-ODbL)


Open (ish)

               CC BY-SA
Legal Tools


Published (NOT OPEN) Data

          CC BY-NC-ND

          CC BY-NC

          CC BY-ND

          CC BY-NC-SA
Linked Open Data in Libraries, Archives and Museums
#LODLAM




    Culture              Technology                   Law
2010

British Library
Stanford University
University of Michigan
Open Library
CERN

VIAF
ID.LOC.GOV

W3C LLD
Roy Tennant
OCLC Research
tennantr@oclc.org
@rtennant




                    Roy
The World of Linked Data




Roy Tennant
OCLC Research
The World of Linked Data
The World of Linked Data

         The Linked Data
         cloud as a whole
         grew by 300% in
              2010...

           ...whereas the
          amount of data
       relevant for libraries
          grew by nearly

          1000%
            http://swib.org/swib11/
Library Linked Data
British National Library

              Publication
     Series                          SeS
               Events


                            Author

Subject       Resource
OCLC Linked Data

  Virtual        Dewey Decimal                 FAST: Faceted
  International Classification                 Application of
  Authority File                               Subject
  (VIAF)         http://dewey.info             Terminology
                 HTML/RDFa, RDF/XML, Turtle,
                    JSON - SPARQL
http://viaf.org                                http://id.worldcat.org/fast
HTML, RDF/XML                                  HTML, RDF/XML -
                                               Download

                  WorldCat
                  http://worldcat.org/
                  Embedded RDFa - Download (part)
http://www.w3.org/community/schemabibex

• Short lived group
• Libraries, Publishers, Consumers, System Vendors, Linked Data, Standards
Bodies
• Library is only one focus
Emerging BIBFRAME Model
Our Brave New World


                “Moving from
                cataloging to
                catalinking”
 Eric Miller,
 Zepheira
The World of Linked
    Knowledge
Ingrid Mason
eResearch Analyst
Intersect Australia
ingrid.mason@intersect.org.au
@1n9r1d
www.huni.net.au
www.intersect.org.au




                                Ingrid

                                         Ingrid Mason
ALIA Online 2013 | Be Different. Do Different.
Humanities Networked Infrastructure (Virtual Lab)
Ingrid Mason | eResearch Analyst | Intersect Australia
http://lodlam.net/




                                 ???



http://linkeddata.org/   http://www.flickr.com/photos/49503832@N06/
Linked Data.

• Goal is to enable researchers to explore and interpret the commonalities or
  divergences in the data.


• Multiple significant scholarly humanities datasets to pipe in and aggregate
  with varying levels of standards and technologies.


• Information design challenge to build an ontology and use linked data and
  controlled vocabularies for data to be aligned and related.


• Provide a virtual environment to explore this data and process with tools.
HuNI Data.
 AusStage
 AUSTLANG
 Mura & Pathways
 Media Archives Project
 Encyclopedia of Australian
 Science
 Colonial Australia Popular Fiction
 Find and Connect Victoria
 Australian Women’s Register
 eMelbourne: the Encyclopedia of
 Melbourne
                                      Scholarly Data Providers
 eGold: Electronic Encyclopedia of
 Gold in Australia
 Chinese-Australian Historical
 Images in Australia
 Reason in Revolt, Source
 Documents of Australian
 Radicalism
 Guide to Australian Business
                                           104
 Records
 Australian Trade Unions Archive
 CircusOz Living Archive Video
 Collection
 Australian Film Institute Research
 Collection
Rethinking Resource Discovery.

• Goal is to enable researchers to explore and interpret the commonalities or
  divergences in the data.


      • Support researcher needs to discern facts (what is?) and locate more
        information (where is?) drawing on Australian cultural scholarly data.


      • Major task is to aggregate heterogeneous humanities data that may be
        related to digital representations e.g. digital text, audio-visual etc files.


      • Method is linked data, using ontology and controlled vocabulary
        development.


      • Outcomes are the researcher can move easily from: what is? to where
        is? in their information seeking.
Information Seeking.

• Information seeking is supported by intellectual access tools, which include
  library catalogues, archival finding aids, reference tools, subject databases,
  journal repositories and information resources (physical and digital).


• Critical step in information seeking is undertaken in preliminary visit to the
  “reference shelf” and it can often be revisited to satisfy research questions.


• What is the researcher question? What information can I find out about this
  person and their life, what they have done, etc. A classic biographical
  enquiry.


• Know the name and other facts associated with a person in the HuNI graph
  (in the lab) and follow a hyperlink to a range of web resources related to that
  person.
Metadata Mash Up.

• A mash up of reference tools and finding aids: factbooks, dictionaries,
  encyclopaedias, bibliographies and catalogues and archival registers.


• Link needs to be forged with the ontologies for domain resources e.g. TEI and
  resource discovery metadata.


• Exploring this with the crosswalk between CIDOC-CRM and FRBR-OO (latest
  draft) and keeping an eye on BibFrame.


• Same problem being explored in another NeCTAR virtual lab project: Human
  and Communication Science.


• Role of annotation and metadata in discovery of new knowledge or the
  means to elucidate new knowledge needs to be unpacked.
Australian Women’s Register
Ontology Development.

• Information design challenge to build an ontology and use linked data and
  controlled vocabularies for data to be aligned and related.


  • Reading the data. Characteristics of the data determine the ontological
    components selected and the major “entities” (aka “access points” in
    library lingo).


  • Identified early as: people, organisations, events, relationships, places,
    dates, resources, and subjects.


  • Components from ontologies already available are being reused or kept in
    our sights: CIDOC-CRM, FOAF, SKOS, FRBR, FRBR-OO, BibFrame and
    PROV-O.
Project Links

@1n9r1d
ingrid.mason@intersect.org.au
http://www.intersect.org.au

@HuNIVL (project Twitter profile)
info@huni.net.au (project email)
http://www.huni.net.au (project website)
http://apidictor.huni.net.au/ (project wiki)
http://corbicula.huni.net.au/ (data store)
http://corbicula.huni.net.au/gadget/ (data analysis tool)
Useful References

NISO. Information Standards Quarterly, Spring/Summer 2012
http://www.niso.org/publications/isq/2012
Europeana. Linked Open Data – What is it?
http://vimeo.com/36752317
Linked Open Data – Libraries, Archives, Museums (LODLAM)
http://lodlam.net/
W3C Library Linked Data Incubator Group
http://www.w3.org/2005/Incubator/lld/
Twitter hashtag: #lodlam
Google Group: lod-lam@googlegroups.com
More Useful References.

LinkedData.org
http://linkeddata.org/
LinkedDataTools.com Introducing Linked Data and the Semantic Web
http://www.linkeddatatools.com/semantic-web-basics
DATA.GOV.UK. What is Linked Data?
http://data.gov.uk/linked-data/what-is-linked-data
AusGOAL
http://www.ausgoal.gov.au/
Creative Commons Australia
http://creativecommons.org.au/
Wikipedia. Resource Description Framework.
http://en.wikipedia.org/wiki/Resource_Description_Framework
You’re not alone...
http://www.youtube.com/watch?v=0uaClHZVvD0
Linked Open Data in Libraries, Archives and Museums
#LODLAM
http://lodlam.net
San Francisco, DC, Atlanta, London, Wellington, New York,
Canberra, Melbourne, St. Louis




                                                            #
                                   #    # ##
                                         #




           #
           #
                   #
2011–2012




            http://wiki.creativecommons.org/CC0_use_for_data
Who’s got next?
Join the LODLAM movement
resources and community on http://lodlam.net
ask for help on Google Group or #lodlam on Twitter
http://openglam.org
http://outreach.wikimedia.org/wiki/GLAM

Contribute!
Start small, but START
Google Refine
http://freeyourmetadata.org
Google Refine
http://freeyourmetadata.org




                http://summit2013.lodlam.net/challenge/challenge-entries/
Linked Open Data in Libraries, Archives and Museums
#LODLAM




    Culture              Technology                   Law
http://mashupbreakdown.com
The DJ in the Library?
http://youtu.be/Mt3qf3G3_TU
Practical Linked Open Data for
Libraries, Archives & Museums

ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums