Linked Data Publication and
Interlinking Research within the
SFI funded ADAPT Centre
Christophe Debruyne - ADAPT, Trinity College Dublin, Dublin, Ireland
christophe.debruyne@adaptcentre.ie
June 26th, 2019 @ LIBER LOD Workshop 2019
The ADAPT Centre is funded under the SFI Research Centres Programme (Grant 13/RC/2106) and is co-funded under the European Regional Development Fund.
www.adaptcentre.ieSFI ADAPT Centre for Digital Content Technology
• Supported by Science Foundation Ireland, ADAPT combines the
expertise of researchers at four universities (TCD, DCU, UCD, and
TU Dublin). ADAPT engages with over 140 organizations to
produce ground-breaking digital content innovations.
• The ADAPT Research Centre focuses on developing next
generation digital technologies that aim to enhance people’s
lives by helping to analyse, personalise and deliver digital data
more effectively for businesses and individuals.
• ADAPT research and technologies enable businesses in all
sectors to harness global digital content and media technologies
to achieve unprecedented customer and community
engagement.
7/4/19 2
www.adaptcentre.ieSFI ADAPT Centre for Digital Content Technology
Theme A: Understanding
Global Content
Theme B: Transforming
Global Content
Theme D: Interacting with
Global Content
Theme C: Personalizing the
User Experience
Theme E: Managing
the Global
Conversation
7/4/19 3
NLP and
Machine Learning
Multimodal Interaction
And Search
Translation Automation
Personalization and
AdaptivityData and Privacy Management
www.adaptcentre.ieTheme E
• Within Theme E address the integration, management and
optimization of privacy-aware data flows between users
and content services.
• There are 3 “strands” within the theme:
1. Content Metadata for Privacy Management
à Led by Prof. Dave Lewis
2. Data Integration Mappings: User Interaction and
Continuous Management
à Led by Prof. Declan O’Sullivan
3. Data Governance and Data Value
à Led by Dr. Rob Brennan
7/4/19 4
www.adaptcentre.ieNAISC
7/4/19 5
NAISC – pronounced noshk – stands for Novel Authoritative
Interlinking of Schema and Concepts.
The work of Lucy McKenna – PhD student at the ADAPT
Currently given a presentation in another session, but at LIBER.
The remainder of the talk will be mostly about her research.
Based on:
• Lucy McKenna, Christophe Debruyne, Declan O'Sullivan: NAISC: An
Authoritative Linked Data Interlinking Approach for
the Library Domain. JCDL 2019 (to appear)
• Lucy McKenna, Christophe Debruyne, Declan O'Sullivan: Modelling the
Provenance of Linked Data Interlinks for the Library Domain. WWW
(Companion Volume) 2019: 954-958
• Lucy McKenna, Christophe Debruyne, Declan O'Sullivan: Understanding the
Position of Information Professionals with regards to Linked Data: A Survey of
Libraries, Archives and Museums. JCDL 2018: 7-16
www.adaptcentre.ieLibraries and Linked Data
• Using Linked Data (LD) could improve the discoverability,
searchability and interoperability of library data.
• Uptake of LD in libraries is relatively slow.
• Developed a LD interlinking approach for the library
domain.
• Aim: to improve LD interlinking accessibility for Information
Professionals (IPs) such as librarians, archivists and
cataloguers.
7/4/19 6
www.adaptcentre.ieLD Survey for Information Professionals
• Online questionnaire - 50 Questions - 185 participants
• Distributed to libraries, archives and museums.
• Challenges identified included:
• Lack of LD provenance data
• Technologically complex LD tooling
• LD tooling does not meet requirements of library
• Difficulty interlinking and integrating data
7/4/19 7
Interlinking method suitable for IPs
User friendly GUI supporting method
Provenance information
GOAL
www.adaptcentre.ieNAISC Framework
7/4/19 8
Staging
Characterization
Mappings in Use
Reuse
Matching
Align and Map
The focus is on the creation and management of interlinks.
www.adaptcentre.ieNAISC Provenance Model
Competency question driven ontology engineering
• Adoption of the PROV-O ontology
• Minimal extension
– introduction of
classes for NAISC
specific entities
and activities
• Prescription of
knowledge
organization –
clever use of
named graphs
7/4/19 9
www.adaptcentre.ieThe Tool – The collections you want to interlink
7/4/19 10
See the video at https://www.scss.tcd.ie/~mckennl3/naisc/
www.adaptcentre.ieThe Tool – Looking for secondary resources
7/4/19 11
See the video at https://www.scss.tcd.ie/~mckennl3/naisc/
www.adaptcentre.ie
The Tool – First find correspondences, then make relations
explicit.
7/4/19 12
See the video at https://www.scss.tcd.ie/~mckennl3/naisc/
www.adaptcentre.ieThe Tool
7/4/19 13
See the video at https://www.scss.tcd.ie/~mckennl3/naisc/
www.adaptcentre.ie( Lucy’s) Future work
• Refine method and tool based on feedback, and conduct a
final experiment.
• Investigate how named graphs and RDF* affect IPs.
How does this work fit in Theme E’s wider research agenda
on Linked Data publishing and interlinking?
7/4/19 14
www.adaptcentre.ieMapping Engineering and Governance
7/4/19 15
Staging
Characterization
Mappings in Use
Reuse
Matching
Align and Map
Christophe Debruyne, Brian Walshe, Declan O'Sullivan:
Towards a project centric metadata model and lifecycle for
ontology mapping governance. iiWAS 2015: 50:1-50:10
ReuseCharacterizationStaging
Scope
Data
Discovery
Requirements
Analysis
Data
Analysis
Discover
Evaluate
Reuse?
Matching
Evaluate Execute Plan
No
Align and Map
Create
Alignment
Prepare
Render
Mapping
Application
Yes
Source
Data
Target
Data
Candidate
Correspondences
Alignments
Mapping
Stakeholders
www.adaptcentre.ieMapping Engineering and Governance
7/4/19 16
Research on data integration mappings:
• Uplift: From non-RDF to RDF
• Extend alignment and mapping languages
• Traceable, transparent and self-contained
mappings
Christophe Debruyne, Harshvardhan J. Pandit, Dave
Lewis, Declan O'Sullivan: Towards Generating Policy-
Compliant Datasets. ICSC 2019: 199-203
Christophe Debruyne, Declan O'Sullivan: R2RML-F:
Towards Sharing and Executing Domain Logic in
R2RML Mappings. LDOW@WWW 2016
Staging
Characterization
Mappings in Use
Reuse
Matching
Align and Map
www.adaptcentre.ieMapping Engineering and Governance
7/4/19 17
Visual representation to facilitate creation, management and
understanding of uplift mappings. Broadens range of
stakeholders who can create mappings.
Ademar Crotti Junior, Christophe Debruyne, Declan
O'Sullivan: Juma Uplift: Using a Block Metaphor for
Representing Uplift Mappings. ICSC 2018: 211-218
Staging
Characterization
Mappings in Use
Reuse
Matching
Align and Map
www.adaptcentre.ieMapping Engineering and Governance
7/4/19 18
Monitoring the quality of produced Linked Data datasets
and the interlinks.
Alan Meehan, Dimitris Kontokostas, Markus Freudenberg,
Rob Brennan, Declan O'Sullivan: Validating Interlinks
Between Linked Data Datasets with the SUMMR
Methodology. OTM Conferences 2016: 654-672
Jeremy Debattista, Eamonn Clinton, Rob Brennan:
Assessing the Quality of Geospatial Linked Data -
Experiences from Ordnance Survey Ireland (OSi).
SEMANTICS Posters&Demos 2018
Jeremy Debattista, Christoph Lange, Sören Auer, Dominic
Cortis: Evaluating the quality of the LOD cloud: An
empirical investigation. Semantic Web 9(6): 859-901
(2018)
Staging
Characterization
Mappings in Use
Reuse
Matching
Align and Map
www.adaptcentre.ieSummary
• ADAPT – The Centre, its goals, and its themes
• Focus on Theme E about privacy and data management
• Recognized the need for authoritative interlinking for IPs
• Presented the work of Lucy McKenna on NAISC
• Presented some (mostly recent) research Linked Data
publication and interlinking within our Theme.
7/4/19 19

Linked Data Publication and Interlinking Research within the SFI funded ADAPT Centre

  • 1.
    Linked Data Publicationand Interlinking Research within the SFI funded ADAPT Centre Christophe Debruyne - ADAPT, Trinity College Dublin, Dublin, Ireland christophe.debruyne@adaptcentre.ie June 26th, 2019 @ LIBER LOD Workshop 2019 The ADAPT Centre is funded under the SFI Research Centres Programme (Grant 13/RC/2106) and is co-funded under the European Regional Development Fund.
  • 2.
    www.adaptcentre.ieSFI ADAPT Centrefor Digital Content Technology • Supported by Science Foundation Ireland, ADAPT combines the expertise of researchers at four universities (TCD, DCU, UCD, and TU Dublin). ADAPT engages with over 140 organizations to produce ground-breaking digital content innovations. • The ADAPT Research Centre focuses on developing next generation digital technologies that aim to enhance people’s lives by helping to analyse, personalise and deliver digital data more effectively for businesses and individuals. • ADAPT research and technologies enable businesses in all sectors to harness global digital content and media technologies to achieve unprecedented customer and community engagement. 7/4/19 2
  • 3.
    www.adaptcentre.ieSFI ADAPT Centrefor Digital Content Technology Theme A: Understanding Global Content Theme B: Transforming Global Content Theme D: Interacting with Global Content Theme C: Personalizing the User Experience Theme E: Managing the Global Conversation 7/4/19 3 NLP and Machine Learning Multimodal Interaction And Search Translation Automation Personalization and AdaptivityData and Privacy Management
  • 4.
    www.adaptcentre.ieTheme E • WithinTheme E address the integration, management and optimization of privacy-aware data flows between users and content services. • There are 3 “strands” within the theme: 1. Content Metadata for Privacy Management à Led by Prof. Dave Lewis 2. Data Integration Mappings: User Interaction and Continuous Management à Led by Prof. Declan O’Sullivan 3. Data Governance and Data Value à Led by Dr. Rob Brennan 7/4/19 4
  • 5.
    www.adaptcentre.ieNAISC 7/4/19 5 NAISC –pronounced noshk – stands for Novel Authoritative Interlinking of Schema and Concepts. The work of Lucy McKenna – PhD student at the ADAPT Currently given a presentation in another session, but at LIBER. The remainder of the talk will be mostly about her research. Based on: • Lucy McKenna, Christophe Debruyne, Declan O'Sullivan: NAISC: An Authoritative Linked Data Interlinking Approach for the Library Domain. JCDL 2019 (to appear) • Lucy McKenna, Christophe Debruyne, Declan O'Sullivan: Modelling the Provenance of Linked Data Interlinks for the Library Domain. WWW (Companion Volume) 2019: 954-958 • Lucy McKenna, Christophe Debruyne, Declan O'Sullivan: Understanding the Position of Information Professionals with regards to Linked Data: A Survey of Libraries, Archives and Museums. JCDL 2018: 7-16
  • 6.
    www.adaptcentre.ieLibraries and LinkedData • Using Linked Data (LD) could improve the discoverability, searchability and interoperability of library data. • Uptake of LD in libraries is relatively slow. • Developed a LD interlinking approach for the library domain. • Aim: to improve LD interlinking accessibility for Information Professionals (IPs) such as librarians, archivists and cataloguers. 7/4/19 6
  • 7.
    www.adaptcentre.ieLD Survey forInformation Professionals • Online questionnaire - 50 Questions - 185 participants • Distributed to libraries, archives and museums. • Challenges identified included: • Lack of LD provenance data • Technologically complex LD tooling • LD tooling does not meet requirements of library • Difficulty interlinking and integrating data 7/4/19 7 Interlinking method suitable for IPs User friendly GUI supporting method Provenance information GOAL
  • 8.
    www.adaptcentre.ieNAISC Framework 7/4/19 8 Staging Characterization Mappingsin Use Reuse Matching Align and Map The focus is on the creation and management of interlinks.
  • 9.
    www.adaptcentre.ieNAISC Provenance Model Competencyquestion driven ontology engineering • Adoption of the PROV-O ontology • Minimal extension – introduction of classes for NAISC specific entities and activities • Prescription of knowledge organization – clever use of named graphs 7/4/19 9
  • 10.
    www.adaptcentre.ieThe Tool –The collections you want to interlink 7/4/19 10 See the video at https://www.scss.tcd.ie/~mckennl3/naisc/
  • 11.
    www.adaptcentre.ieThe Tool –Looking for secondary resources 7/4/19 11 See the video at https://www.scss.tcd.ie/~mckennl3/naisc/
  • 12.
    www.adaptcentre.ie The Tool –First find correspondences, then make relations explicit. 7/4/19 12 See the video at https://www.scss.tcd.ie/~mckennl3/naisc/
  • 13.
    www.adaptcentre.ieThe Tool 7/4/19 13 Seethe video at https://www.scss.tcd.ie/~mckennl3/naisc/
  • 14.
    www.adaptcentre.ie( Lucy’s) Futurework • Refine method and tool based on feedback, and conduct a final experiment. • Investigate how named graphs and RDF* affect IPs. How does this work fit in Theme E’s wider research agenda on Linked Data publishing and interlinking? 7/4/19 14
  • 15.
    www.adaptcentre.ieMapping Engineering andGovernance 7/4/19 15 Staging Characterization Mappings in Use Reuse Matching Align and Map Christophe Debruyne, Brian Walshe, Declan O'Sullivan: Towards a project centric metadata model and lifecycle for ontology mapping governance. iiWAS 2015: 50:1-50:10 ReuseCharacterizationStaging Scope Data Discovery Requirements Analysis Data Analysis Discover Evaluate Reuse? Matching Evaluate Execute Plan No Align and Map Create Alignment Prepare Render Mapping Application Yes Source Data Target Data Candidate Correspondences Alignments Mapping Stakeholders
  • 16.
    www.adaptcentre.ieMapping Engineering andGovernance 7/4/19 16 Research on data integration mappings: • Uplift: From non-RDF to RDF • Extend alignment and mapping languages • Traceable, transparent and self-contained mappings Christophe Debruyne, Harshvardhan J. Pandit, Dave Lewis, Declan O'Sullivan: Towards Generating Policy- Compliant Datasets. ICSC 2019: 199-203 Christophe Debruyne, Declan O'Sullivan: R2RML-F: Towards Sharing and Executing Domain Logic in R2RML Mappings. LDOW@WWW 2016 Staging Characterization Mappings in Use Reuse Matching Align and Map
  • 17.
    www.adaptcentre.ieMapping Engineering andGovernance 7/4/19 17 Visual representation to facilitate creation, management and understanding of uplift mappings. Broadens range of stakeholders who can create mappings. Ademar Crotti Junior, Christophe Debruyne, Declan O'Sullivan: Juma Uplift: Using a Block Metaphor for Representing Uplift Mappings. ICSC 2018: 211-218 Staging Characterization Mappings in Use Reuse Matching Align and Map
  • 18.
    www.adaptcentre.ieMapping Engineering andGovernance 7/4/19 18 Monitoring the quality of produced Linked Data datasets and the interlinks. Alan Meehan, Dimitris Kontokostas, Markus Freudenberg, Rob Brennan, Declan O'Sullivan: Validating Interlinks Between Linked Data Datasets with the SUMMR Methodology. OTM Conferences 2016: 654-672 Jeremy Debattista, Eamonn Clinton, Rob Brennan: Assessing the Quality of Geospatial Linked Data - Experiences from Ordnance Survey Ireland (OSi). SEMANTICS Posters&Demos 2018 Jeremy Debattista, Christoph Lange, Sören Auer, Dominic Cortis: Evaluating the quality of the LOD cloud: An empirical investigation. Semantic Web 9(6): 859-901 (2018) Staging Characterization Mappings in Use Reuse Matching Align and Map
  • 19.
    www.adaptcentre.ieSummary • ADAPT –The Centre, its goals, and its themes • Focus on Theme E about privacy and data management • Recognized the need for authoritative interlinking for IPs • Presented the work of Lucy McKenna on NAISC • Presented some (mostly recent) research Linked Data publication and interlinking within our Theme. 7/4/19 19