Europeana Network Association AGM 2016 - 9 November - Speaker: Joan Cobb
1. The Getty Datasets and the Semantic Web
Joan Cobb
Technical Lead, Getty Vocabularies, J. Paul Getty Trust
Europeana Network Association AGM Riga, Latvia, 8-9 November 2016
jcobb
@getty.edu
4. TheGettyDatasets
andtheSemanticWeb
jcobb
@getty.edu
The Getty’s Open Content Program
Began in August 2013 with the release of
roughly 6,400 high-resolution images
from the collections of the Getty Museum
and the Getty Research Institute
The Getty Search Gateway now contains
more than 1 million items
http://search.getty.edu/gateway/landing
6. TheGettyDatasets
andtheSemanticWeb
jcobb
@getty.edu
Some 2013 LOD Influences
Jan Feb Mar Apr May Jun Jul Aug Sep
Tim Berners-Lee TED Talk
http://www.ted.com/talks/tim_berners_lee_the_year_open_data_went_worldwide.html
Dr. Zeng delivered a 62
page report on why we
should care about the
semantic web
Wanted very much to align
with the Getty Vocabs
Eero Hyvönen’s book
Publishing and Using Cultural Heritage
Linked Data on the Semantic Web
Europeana Video –
Sometimes a picture is worth a
thousand words – in this case,
it’s a video
http://vimeo.com/36752317
7. TheGettyDatasets
andtheSemanticWeb
jcobb
@getty.edu
The Getty Vocabularies
were the first datasets to be published as LOD
Reasons:
• AAT, TGN, and ULAN were already linked by design
• The data was clean
• We had complete control of the custom, in-house developed, databases and applications that
support and publish the Getty vocabularies
• We had been receiving request for years to publish these datasets as LOD
• The Getty vocabularies would help to connect the rest of the Getty resources to each other
12. TheGettyDatasets
andtheSemanticWeb
jcobb
@getty.edu
Art & Architecture Thesaurus®
Current totals
57,824 concepts
370,310 terms
Scope includes generic terms for
work types, roles, materials, styles,
cultures, techniques, attributes,
abstract concepts
AAT
Released as LOD
20 February 2014
22. TheGettyDatasets
andtheSemanticWeb
jcobb
@getty.edu
Tracking Usage Can Be Difficult
• Once data is truly open it is difficult to track usage.
• Usage is often not visible because the LOD links are part of
the machine code that create what is visible.
• Ways we know the data is being used:
• Comments on Twitter
• Comments on other cultural heritage sites like Europeana or
LODLAM (Linked Open Data for Libraries, Archives, and Museums)
• Discussions on our public forum
• Publications
• Email from users
23. TheGettyDatasets
andtheSemanticWeb
jcobb
@getty.edu
Usage Stories
• To date, we have received 42 usage stories from people
kind enough to let us know how they are making use of
the LOD publications of AAT, TGN and ULAN.
• In the future we want to provide links to these resources
from the Getty’s LOD sites.
• The following slides show some examples of usage.
31. TheGettyDatasets
andtheSemanticWeb
jcobb
@getty.edu
What’s Next?
Connect the silos in the process
• Museum Objects
• Library resources
• Provenance
• Conservation
• Research
• Images
• Getty Vocabularies
Build on Vocabularies to
Transform Cultural Heritage Resources as “Open Content” through
Linked Open Data (LOD) and
International Image Interoperability Framework
32. TheGettyDatasets
andtheSemanticWeb
jcobb
@getty.edu
With thanks to my Getty colleagues:
Patricia Harpring
Managing Editor, Getty Vocabularies
pharpring@getty.edu
Gregg Garcia
Lead Developer, Getty Vocabularies
ggarcia@getty.edu
Murtha Baca, Head
Digital Art History Program
mbaca@getty.edu
Rob Sanderson
Semantic Architect
rsanderson@getty.edu
Joan Cobb
Technical Lead, Getty Vocabularies
The J. Paul Getty Trust
1200 Getty Center Drive
Los Angeles, CA 90049
jcobb@getty.edu
Editor's Notes
Conservation Thesaurus
Reason - Since it can not officially start until CONA-VCS is complete because of resource restraints, and since most of the work left to do on the Analysis project are being done by Joan, it makes sense to sequence them