Integration of collection data - A case study from the Oxford Museums and Libraries (OXLOD)

Integration of collection data
A case study from the Oxford
Museums and Libraries
(OXLOD)
Athanasios Velios
a.velios@arts.ac.uk
16 Jan 2019 – LCC

Larger sample →Larger sample →
Better conclusionsBetter conclusions

DataData
integrationintegration

OXLOD targets
●
Test workflows and technologies
●
Assess quality of records
– Estimate amount of future work
●
Bring GLAM community together
– Knowledge Transfer

OCC
Oxford Conservation
Consortium

export
convert
lookup
fields
convert to
XML
Dataset
authority
lists
Mapping - CRM
RDF triples
RDF Visualizer
search interface
OXLOD
data
export
External reconciliation
Automatic
(fast, low quality)
OpenRefine
Manual
(slow, high quality)
upload
export
convert to
XML
convert to
XML

author collector maker artist person owner

Integration (one-to-one)

List of systems in Oxford
●
EMu (MHS, MNH)
– relational
●
Allegro (Bodleian)
– key/value (line based)
●
TEI (Bodleian)
– XML hierarchical
●
BRAHMS (plant science
+ herbaria)
– relational
●
...
●
Filemaker (PR, Bate)
– spreadsheet
●
Museum+
– Ashmolean
●
...
●
Humanities division dbs
– AHRC project outputs
– Researchers' personal dbs
– Local library catalogues
– ...

Actor
or simply E39

Actor
or simply E39
ontology

Ontology
●
Provides properties to link entities
– CIDOC-CRM
– heritage ontology

E7 Activity
P16 used specific object
E70 Thing
E39 Actor
P11 had participant
E53 Place
P7 took place at
E52 Time-Span
P4 has time-span

Implied relationships
●
lining part of spine

●
●
spine adhesive has type animal

●
●
●
endband has technique Greek

●
●
●
●
secondary thread has material silk

CRM relationships/properties
●
●
●
●
secondary thread has material silk
●
lining crm:P46_forms_part_of spine
●
spine adhesive crm:P2_has_type animal
●
making of endband crm:P32_used_general_technique Greek
●
secondary thread crm:P45_consists_of silk

RDF Implementation
(Linked Data)

Linked Data
●
World Wide Web Consortium
– Tim Berners-Lee five star open data
– http://5stardata.info/en/
●
Resource Description Framework
(RDF)

Predicate
(which property of subject?)
Resource Description
Framework
Object
(what value of property?)
Subject
(what do we describe?)

Predicate
(which property of subject?)
Resource Description
Framework
Object
(what value of property?)
Subject
(what do we describe?)
ParchmentMS. Ashmole 40
Has material

zzzzzzxxxxxx
yyyyyy

zzzzzzzz, located at:
http://vocab.getty.edu/
aat/300011851
xxxxxxx, located at:
https://medieval.bodleian.ox.ac.uk/
catalog/manuscript_344
yyyyyy

zzzzzzzz, located at:
http://vocab.getty.edu/
aat/300011851
xxxxxxx, located at:
https://medieval.bodleian.ox.ac.uk/
catalog/manuscript_344
yyyyyy, located at
http://www.cidoc-crm.org/
P45_consists_of

Linked Data
●
World Wide Web Consortium
– Tim Berners-Lee five star open data
– http://5stardata.info/en/
●
(RDF)
●
Universal Resource Identifiers
(URI)

Entity reconciliation
and thesauri

Activity
ThingActor
PlacesTime-span
Type Name (Appellation)
isoftype
hasname

OXLOD authorities / thesauri
●
Getty AAT, TGN, ULAN
●
VIAF
●
BM thesaurus
●
Geonames
●
China Biographical Data Base (CBDB)
●
China Historical GIS (CHGIS)
●
Paleobiology database (PBDB)
●
International Plant Names Index (IPNI)
●
... many local dataset authority lists

provenance
event x
Giovan Battista
Recanti
crm: P11 had participant
crm:E21 Person
rdf: type
birth of
Giovan Recant
crm: P89 was born
...
crm: P4 has time-span

provenance
event x
Giovan Battista
Recanti
crm:E21 Person
rdf: type
birth of
Giovan Recant
crm: P89 was born
...
Giovan Battista
Recanti
Venice
wdt: P19

provenance
event x
Giovan Battista
Recanti
crm:E21 Person
rdf: type
birth of
Giovan Recant
crm: P89 was born
...
Giovan Battista
Recanti
Venice
wdt: P19
reconciliation
owl:sameAs

provenance
event x
Giovan Battista
Recanti
crm:E21 Person
rdf: type
birth of
Giovan Recant
crm: P89 was born
...
Giovan Battista
Recanti
Venice
wdt: P19
reconciliation
owl:sameAs
wdt: ...
wdt: ...

OXLOD:
'structured
data' enables
accurate/deep search
across collections

OXLOD:
'structured
data' in GLAM

Pre-OXLOD: 8 steps per result. OXLOD: 1 step for all.

OXLOD:
structured data
consistent
terminology
(above existing data)

OXLOD:
enriched data
e.g. links from
Ashmolean...

OXLOD:
enriched data
e.g. links to China
Bibliographical
Database
(Harvard)
OXLOD:
enriched data
e.g. links to China
Bibliographical
Database
(Harvard)
OXLOD:
enriched data
e.g. links to China
Biographical
Database
(Harvard)

OXLOD:
enriched data
e.g. links to China
Historical GIS
(Harvard)

OXLOD:
faster search
links to
corresponding
types of objects

OXLOD:
cross-collection
linking to
Bodleian

OXLOD:
cross-collection
linking to
Oxford Herbaria

OXLOD:
record of species
(type of item)

OXLOD:
efficiency
e.g. export links
work across
collections

Findings
●
Policy for URI design
– Versioning
– Stability during iterative mappings
●
Prefer local storage
●
Reconciliation never ends
– Start with local authority lists
– Maintain a institutional authority lists
– Use Wikidata(?) to save effort for extenal
reconciliation
– Has to be done by domain experts

Findings
●
Linked Data backend
– Solid research and querying tools
– Mature triple stores
– Licensing enforcing prior to authentication
– Permission management at workflow
●
Linked Data frontend
– Exciting tools
– Easy dataset delivery
– Serendipitous links through searching/browsing

OXLOD in numbers

13,280,376 records (triples)

40,869 words of documentation
text

325 minutes of workshop videos

63 workshop participants

26 datasets

10 workshops

UAL integration
●
Calm
– EAD CRM mapping→
●
Koha
– FRBR LRM mapping→
●
E-prints
– CRMdig, FRBR
●
TMS – eMuseum
– CIDOC-CRM (in touch with Jay Hoffman)

Integration of collection data - A case study from the Oxford Museums and Libraries (OXLOD)

Recommended

Recommended

More Related Content

What's hot

What's hot (6)

Similar to Integration of collection data - A case study from the Oxford Museums and Libraries (OXLOD)

Similar to Integration of collection data - A case study from the Oxford Museums and Libraries (OXLOD) (20)

More from Athanasios Velios

More from Athanasios Velios (20)

Recently uploaded

Recently uploaded (20)

Integration of collection data - A case study from the Oxford Museums and Libraries (OXLOD)