0
Dr Sandra Collins
Director, Digital Repository of Ireland
Royal Irish Academy
Mission
DRI is a trusted digital repository for Humanities
and Social Sciences Data
- linking and preserving the rich data...
App

App

Linked
Logainm

App

DRI Platform
Access

Preservation

Discovery

Federated Archives, Storage
Growing Digital Preservation & Access Policy
Interviews
National Practice Survey
National Steering Committee
National Guid...
Metadata
Formats
Global Good Data Practice
Digital Preservation
Data citation, Permanent IDs
Metrics, funding, allowable costs, training
Su...
Repository
Open source components, custom code engineering
OAIS Model
Search setup
Objects injested into Fedora Commons
Use the Solrizer gem to create the Solr index
Object metadata all CC0
Se...
User Access
Primarily through the blacklight search interface
Other routes
• Curated collections and virtual galleries
• G...
DRI Presentation
200RESEARCHERS
74MEURO 30PARTNERS
40INVESTIGATORS 8INSTITUTIONS
1CENTRE!
7X RESEARCH
STRANDS
Work Packages
Linked
Data
Personal
Sensing

Media
Analytics

Recommender Decision
Systems
Analytics

S...
Goal of Archive Discovery Project
Linked data based discovery platform
Across multiple RTE Archives, media formats
Enhance...
Two Key Ingredients
1. RDF – Resource Description Framework
Graph based Data – nodes and arcs
– Identifies objects (URIs)
...
Linked Open Data cloud
UK government

Media

User-generated
Government

BBC
Publications

Cross-domain
Geo
Life sciences

...
Development
1.
2.
3.
4.
5.
6.
7.
8.
9.
10.
11.

Data & System survey
Architecture Specification
Selection of Pilot Data
Re...
Schematic Overview

Complex systems, customised software, grown and
adapted over time and use
Authorisation
Public user Academic RTÉ
RTÉ
researcher researcher journalist

RTÉ
Archivist

RTÉ Archives
administrator

Se...
OUTCOMES FOR RTÉ
OUTCOMES FOR RTÉ

Pilot
Pilot
discovery
discovery
platform
platform
for all
for all
media
media
formats
formats

Enabling ...
Sandra Collins - Building a linked data based content discovery service for the RTÉ Archives
Sandra Collins - Building a linked data based content discovery service for the RTÉ Archives
Sandra Collins - Building a linked data based content discovery service for the RTÉ Archives
Upcoming SlideShare
Loading in...5
×

Sandra Collins - Building a linked data based content discovery service for the RTÉ Archives

294

Published on

Presentation at WMPA2014 - The 1st Winter School on Multimedia Processing and Applications
Dublin, Ireland, January 6-8, 2014
Co-located with MMM 2014, The 20th Anniversary International Conference on MultiMedia Modeling.
Trinity College Dublin

Published in: Technology, Education
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
294
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
4
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide
  • Insight is a large complex centre. It represents a coming together of 5 legacy centres to create a critical mass of 200 researchers across 7 institutions with more than 40 investigators and 30 industry partners, all under a single, unified centre brand.
  • Transcript of "Sandra Collins - Building a linked data based content discovery service for the RTÉ Archives"

    1. 1. Dr Sandra Collins Director, Digital Repository of Ireland Royal Irish Academy
    2. 2. Mission DRI is a trusted digital repository for Humanities and Social Sciences Data - linking and preserving the rich data held by Irish institutions, with a central internet access point - Our Cultural & Social Heritage
    3. 3. App App Linked Logainm App DRI Platform Access Preservation Discovery Federated Archives, Storage
    4. 4. Growing Digital Preservation & Access Policy Interviews National Practice Survey National Steering Committee National Guidelines Government adoption www.oaireland.ie
    5. 5. Metadata
    6. 6. Formats
    7. 7. Global Good Data Practice Digital Preservation Data citation, Permanent IDs Metrics, funding, allowable costs, training Sustained e-infrastructure Copyright, IPR, licensing, data protection Open metadata, open access Research Data Alliance 2014 Policy, Services, Systems → Practice
    8. 8. Repository Open source components, custom code engineering
    9. 9. OAIS Model
    10. 10. Search setup Objects injested into Fedora Commons Use the Solrizer gem to create the Solr index Object metadata all CC0 Search will return metadata on all records Authorization system will restrict access to the objects Multi-lingual data (English and Irish at the moment) Indices for each language
    11. 11. User Access Primarily through the blacklight search interface Other routes • Curated collections and virtual galleries • Georeferenced data – mapping • Temporal data – timelines • User defined collections • DOI references in papers
    12. 12. DRI Presentation
    13. 13. 200RESEARCHERS 74MEURO 30PARTNERS 40INVESTIGATORS 8INSTITUTIONS 1CENTRE!
    14. 14. 7X RESEARCH STRANDS Work Packages Linked Data Personal Sensing Media Analytics Recommender Decision Systems Analytics Semantic Web Reasoning
    15. 15. Goal of Archive Discovery Project Linked data based discovery platform Across multiple RTE Archives, media formats Enhanced data discovery and delivery Enhanced workflows, digital practices, tools Digital Preservation, discovery and access
    16. 16. Two Key Ingredients 1. RDF – Resource Description Framework Graph based Data – nodes and arcs – Identifies objects (URIs) – Interlink information (Relationships) 1. Vocabularies (Ontologies) – provide shared understanding of a domain – organise knowledge in a machinecomprehensible way – give an exploitable meaning to the data
    17. 17. Linked Open Data cloud UK government Media User-generated Government BBC Publications Cross-domain Geo Life sciences Over 200 open data sets with more than 26 billion facts, interlinked by 400 million typed links, doubling every 10 month! LinkedGeoData
    18. 18. Development 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. Data & System survey Architecture Specification Selection of Pilot Data Retrieval and transformation of data Setup & integration of the metadata repository Metadata enhancement Implementation of content discovery Classification & evaluation of discovery content Demonstrator Performance KPIs Enhanced workflows for content processing
    19. 19. Schematic Overview Complex systems, customised software, grown and adapted over time and use
    20. 20. Authorisation Public user Academic RTÉ RTÉ researcher researcher journalist RTÉ Archivist RTÉ Archives administrator Search ✔ ✔ ✔ ✔ ✔ ✔ View ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ Amend Create Delete ✔ User mgmt. ✔
    21. 21. OUTCOMES FOR RTÉ
    22. 22. OUTCOMES FOR RTÉ Pilot Pilot discovery discovery platform platform for all for all media media formats formats Enabling cross-Institutional, cross-collection curation
    1. A particular slide catching your eye?

      Clipping is a handy way to collect important slides you want to go back to later.

    ×