Building a linked data based content discovery service for the RTÉ Archives

195 views

Published on

Published in: Technology, Education
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
195
On SlideShare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
0
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Building a linked data based content discovery service for the RTÉ Archives

  1. 1. Dr Sandra Collins Director, Digital Repository of Ireland Royal Irish Academy
  2. 2. Mission DRI is a trusted digital repository for Humanities and Social Sciences Data - linking and preserving the rich data held by Irish institutions, with a central internet access point - Our Cultural & Social Heritage
  3. 3. App App Linked Logainm App DRI Platform Access Preservation Discovery Federated Archives, Storage
  4. 4. Growing Digital Preservation & Access Policy Interviews National Practice Survey National Steering Committee National Guidelines Government adoption www.oaireland.ie
  5. 5. Metadata
  6. 6. Formats
  7. 7. Global Good Data Practice Digital Preservation Data citation, Permanent IDs Metrics, funding, allowable costs, training Sustained e-infrastructure Copyright, IPR, licensing, data protection Open metadata, open access Research Data Alliance 2014 Policy, Services, Systems → Practice
  8. 8. Repository Open source components, custom code engineering
  9. 9. OAIS Model
  10. 10. Search setup Objects injested into Fedora Commons Use the Solrizer gem to create the Solr index Object metadata all CC0 Search will return metadata on all records Authorization system will restrict access to the objects Multi-lingual data (English and Irish at the moment) Indices for each language
  11. 11. User Access Primarily through the blacklight search interface Other routes • Curated collections and virtual galleries • Georeferenced data – mapping • Temporal data – timelines • User defined collections • DOI references in papers
  12. 12. DRI Presentation
  13. 13. 200RESEARCHERS 74MEURO 30PARTNERS 40INVESTIGATORS 8INSTITUTIONS 1CENTRE!
  14. 14. 7X RESEARCH STRANDS Work Packages Linked Data Personal Sensing Media Analytics Recommender Decision Systems Analytics Semantic Web Reasoning
  15. 15. Goal of Archive Discovery Project Linked data based discovery platform Across multiple RTE Archives, media formats Enhanced data discovery and delivery Enhanced workflows, digital practices, tools Digital Preservation, discovery and access
  16. 16. Two Key Ingredients 1. RDF – Resource Description Framework Graph based Data – nodes and arcs – Identifies objects (URIs) – Interlink information (Relationships) 2. Vocabularies (Ontologies) – provide shared understanding of a domain – organise knowledge in a machinecomprehensible way – give an exploitable meaning to the data
  17. 17. Linked Open Data cloud UK government Media User-generated Government BBC Publications Cross-domain Geo Life sciences Over 200 open data sets with more than 26 billion facts, interlinked by 400 million typed links, doubling every 10 month! LinkedGeoData
  18. 18. Development 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. Data & System survey Architecture Specification Selection of Pilot Data Retrieval and transformation of data Setup & integration of the metadata repository Metadata enhancement Implementation of content discovery Classification & evaluation of discovery content Demonstrator Performance KPIs Enhanced workflows for content processing
  19. 19. Schematic Overview Complex systems, customised software, grown and adapted over time and use
  20. 20. Authorisation Public user Academic RTÉ RTÉ researcher researcher journalist RTÉ Archivist RTÉ Archives administrator Search ✔ ✔ ✔ ✔ ✔ ✔ View ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ Amend Create Delete ✔ User mgmt. ✔
  21. 21. OUTCOMES FOR RTÉ
  22. 22. OUTCOMES FOR RTÉ Pilot discovery platform for all media formats Enabling cross-Institutional, cross-collection curation

×