SlideShare a Scribd company logo
1 of 19
Download to read offline
RTÉ Content Discovery Project
Christophe Debruyne
c.debruyne@ria.ie
christophe.debruyne@insight-centre.orgchristophe.debruyne@insight-centre.org
MDN Workshop -- 4th of June 2014
Outline
• Context
• Goal and Challenges of the RTÉ Content Discovery Project
• Tasks and Data Annotation• Tasks and Data Annotation
• EBU Core – Identification of problems
• Addressing the issues
• Using the ontology
• Conclusions and Recommendations
Context
RTÉ, Ireland's National
Television and Radio
Broadcaster
National trusted digital
repository for Ireland's
social and cultural data.
Centre for Data Analytics
Documents
Television
Radio
Stills
Linking and preserving data
held by Irish Institutions with
central internet access point.
• Standards
• Cataloguing
• Archiving
• Preservation
• Insight @ NUIG = DERI
• Semantic Technologies
• Linked Data
• Data Analytics Platform
Goal of the RTÉ Content
Discovery Project
• Discover implicit knowledge
• across the different archives
• and the Web of Data
• To facilitate internal workflows (e.g., search)
• For wider reuse and repackaging RTÉ’s
Documents
Television
Radio
Stills
• For wider reuse and repackaging RTÉ’s
information
• Challenges
• Heterogeneous databases
• Different guidelines and practices
• Legacy data (from previous systems)
• … “Linking Open Data cloud diagram,
by R. Cyganiak and A. Jentzsch.
http://lod-cloud.net/”
Part of a wider ambition …
OUTCOMES FOR RTÉ
RTÉ Content Discovery
In this presentation we focus on Television and Radio archives
Documents Television Radio Stills
• In this presentation we focus on Television and Radio archives
• The Television and Radio archives
• Are maintained on two different instances of the same system
• A system that is EBU Core “compatible”
• Different content, different guidelines, …
Three main tasks
• Annotate the data.
• Using relevant standards, ontologies and vocabularies.
• Resource Description Framework (RDF).
• Obtain an integrated view of the different archives by
creating links between the RDF representations of RTÉ’s
archival assets across the different archives.
• Apply advanced methods for discovering related data for a
given subject in external sources such as the Linked Data
Cloud.
Data annotation
Relational
Database
D2RQ RDF Dump Triplestore
Television
Radio
Map symbols of
database to
predicates
(relations and
concepts) in
chosen ontologies
/ vocabularies
Use D2RQ to
generate RDF
dump
Store RDF dump in
adequate triple
store (Jena TDB)
Which ontologies?
• Dublin Core, DC Terms
• Foaf
• EBU Core OWL
• …
EBU Core OWL
• The RTÉ Content Discovery platform will rely on Semantic
Web technologies to reason. Ontologies will therefore need
to be correct.
• But … while adopting the EBU Core OWL ontology, several
problems where identified.
• We contacted EBU to resolve these issues.
• We provide an overview of some of these problems.
Problems
• (1) Forgotten concept unions
• The property ebucore:description has multiple domain axioms.
<rdfs:domain rdf:resource="&ebu;BusinessObject"/>
<rdfs:domain rdf:resource="&ebu;MediaResource"/>
• Unintentionally the wrong implicit information can be inferred.• Unintentionally the wrong implicit information can be inferred.
• (2a) Property unsatisfiability – via class axioms
<owl:Class rdf:about="&ebu;BusinessObject">
… <owl:disjointWith rdf:resource="&ebu;Resource"/> …
</owl:Class>
• Because of (1) and (2), the property description could not be
used
Problems
• (2b) Property unsatisfiability – role hierarchies and datatypes
• Duration has the range xsd:string
• The subproperties of duration have other ranges (e.g., double in
the case of duration in edit units)
• Because each subproperty also inherits the range of the• Because each subproperty also inherits the range of the
superproperty, all instances in the object of that property must
be at the same time a string, and a double. This type conflict
results in a contradiction.
• With (2a) and (2b) we identified 40 properties that lead to
problems.
Problems
• (3) Inconsistencies between formal and informal definitions
• BusinessObject is defined as: "An image, a document, an annotation
[…], a tag […], or an audiovisual media resource […]. Other types of
BusinessObjects may be defined as subclasses.“BusinessObjects may be defined as subclasses.“
• Resource is defined as: "A manifestation of a BusinessObject." and
disjoint with BusinessObjects. Meaning no individual can be an
element of BusinessObjects and Resources at the same time.
• The domain of a title is BusinessObject, yet, it’s definition is:
"Specifies the title or name given to the resource. […]"
Problems
• (4) User readable labels
• Many different properties have the same human readable label,
which could confuse the end user – e.g., when generating an
Interface.
• E.g., there were 11 properties with the label “Name”• E.g., there were 11 properties with the label “Name”
• Some properties had empty labels
• (5) Roles – Loss of context
• Agents were related to Business Objects (BO)
• Agents were related to a Role
• But … a role did not relate to agents in relationship with a BO
• This lead to a loss of context.
Addressing the issues
• Problems were addressed over email.
• The discussions are “lost”, traces are only known to us …
• The ontology-engineering activities of EBU Core should adopt
appropriate methods and tools for collaboration.
• Participation of others
• Traceability (!)
• The ontology is still being developed as we go along, and we
have been able to make (parts of it) work…
Using the ontology
Conclusions and Recommendations
• RTÉ Archives aims at a wider reuse and repackaging of their
archival content on digital platforms through the innovative
use of Semantic and Linked Data technologies.
• We adopted the EBU Core OWL ontology for annotating the
television and radio archives, yet identified some issues.
• We adopted the EBU Core OWL ontology for annotating the
television and radio archives, yet identified some issues.
• We collaborated on resolving those issues together with EBU
• However, we feel that appropriate collaborative methods and
tools should be adopted to facilitate the ontology-
engineering process and – more importantly – enable other
to participate AND have visible traceability of the decisions.
References
• D2RQ, http://d2rq.org/
• Digital Repository of Ireland, http://www.dri.ie/
• Insight, http://www.insight-centre.org/• Insight, http://www.insight-centre.org/
• Jena TDB, http://jena.apache.org/documentation/tdb/
• RTÉ Archives, http://www.rte.ie/archives

More Related Content

Viewers also liked

Semantic Interoperation of Information Systems by Evolving Ontologies through...
Semantic Interoperation of Information Systems by Evolving Ontologies through...Semantic Interoperation of Information Systems by Evolving Ontologies through...
Semantic Interoperation of Information Systems by Evolving Ontologies through...Christophe Debruyne
 
Publishing open data and services for the Flemish Research Information Space
Publishing open data and services for the Flemish Research Information SpacePublishing open data and services for the Flemish Research Information Space
Publishing open data and services for the Flemish Research Information SpaceChristophe Debruyne
 
Using Semantic Technologies to Create Virtual Families from Historical Vital ...
Using Semantic Technologies to Create Virtual Families from Historical Vital ...Using Semantic Technologies to Create Virtual Families from Historical Vital ...
Using Semantic Technologies to Create Virtual Families from Historical Vital ...Christophe Debruyne
 

Viewers also liked (7)

Semantic Interoperation of Information Systems by Evolving Ontologies through...
Semantic Interoperation of Information Systems by Evolving Ontologies through...Semantic Interoperation of Information Systems by Evolving Ontologies through...
Semantic Interoperation of Information Systems by Evolving Ontologies through...
 
Indulge At The Jupiter Hotel 2011
Indulge At The Jupiter Hotel 2011Indulge At The Jupiter Hotel 2011
Indulge At The Jupiter Hotel 2011
 
Publishing open data and services for the Flemish Research Information Space
Publishing open data and services for the Flemish Research Information SpacePublishing open data and services for the Flemish Research Information Space
Publishing open data and services for the Flemish Research Information Space
 
Using Semantic Technologies to Create Virtual Families from Historical Vital ...
Using Semantic Technologies to Create Virtual Families from Historical Vital ...Using Semantic Technologies to Create Virtual Families from Historical Vital ...
Using Semantic Technologies to Create Virtual Families from Historical Vital ...
 
Award Maker 4 Teachers
Award Maker 4 TeachersAward Maker 4 Teachers
Award Maker 4 Teachers
 
Handwriting Worksheet Maker
Handwriting Worksheet MakerHandwriting Worksheet Maker
Handwriting Worksheet Maker
 
Nameplate Maker 4 Teachers
Nameplate Maker 4 TeachersNameplate Maker 4 Teachers
Nameplate Maker 4 Teachers
 

Similar to 2014 06-04-presentation-mdn-2014

What is New in W3C land?
What is New in W3C land?What is New in W3C land?
What is New in W3C land?Ivan Herman
 
Global lodlam_communities and open cultural data
Global lodlam_communities and open cultural dataGlobal lodlam_communities and open cultural data
Global lodlam_communities and open cultural dataMinerva Lin
 
High and Lows of Library Linked Data
High and Lows of Library Linked DataHigh and Lows of Library Linked Data
High and Lows of Library Linked DataAdrian Stevenson
 
Thinking about Preservation: OSUL Content Manage Workflow
Thinking about Preservation: OSUL Content Manage WorkflowThinking about Preservation: OSUL Content Manage Workflow
Thinking about Preservation: OSUL Content Manage WorkflowTerry Reese
 
Intro to Linked Open Data in Libraries, Archives & Museums
Intro to Linked Open Data in Libraries, Archives & MuseumsIntro to Linked Open Data in Libraries, Archives & Museums
Intro to Linked Open Data in Libraries, Archives & MuseumsJon Voss
 
Linked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & MuseumsLinked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & MuseumsJon Voss
 
MARC and BIBFRAME; Linking libraries and archives
MARC and BIBFRAME; Linking libraries and archivesMARC and BIBFRAME; Linking libraries and archives
MARC and BIBFRAME; Linking libraries and archivesDorothea Salo
 
Building OBO Foundry ontology using semantic web tools
Building OBO Foundry ontology using semantic web toolsBuilding OBO Foundry ontology using semantic web tools
Building OBO Foundry ontology using semantic web toolsMelanie Courtot
 
RDF Data and Image Annotations in ResearchSpace (slides)
RDF Data and Image Annotations in ResearchSpace (slides)RDF Data and Image Annotations in ResearchSpace (slides)
RDF Data and Image Annotations in ResearchSpace (slides)Vladimir Alexiev, PhD, PMP
 
Fondly Collisions: Archival hierarchy and the Europeana Data Model
Fondly Collisions: Archival hierarchy and the Europeana Data Model   Fondly Collisions: Archival hierarchy and the Europeana Data Model
Fondly Collisions: Archival hierarchy and the Europeana Data Model Valentine Charles
 
Alexandria winer20100623
Alexandria winer20100623Alexandria winer20100623
Alexandria winer20100623Dov Winer
 
Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.Jon Voss
 
ArchivesSpace: Building a Next-Generation Archives Management Tool
ArchivesSpace: Building a Next-Generation Archives Management ToolArchivesSpace: Building a Next-Generation Archives Management Tool
ArchivesSpace: Building a Next-Generation Archives Management ToolMark Matienzo
 
FDO as building block for digitization technology stacks
FDO as building block for digitization technology stacksFDO as building block for digitization technology stacks
FDO as building block for digitization technology stacksRaul Palma
 
SWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic WebSWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic WebPascal-Nicolas Becker
 
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎Libcorpio
 
Linked Data at the OU - the story so far
Linked Data at the OU - the story so farLinked Data at the OU - the story so far
Linked Data at the OU - the story so farEnrico Daga
 

Similar to 2014 06-04-presentation-mdn-2014 (20)

What is New in W3C land?
What is New in W3C land?What is New in W3C land?
What is New in W3C land?
 
Global lodlam_communities and open cultural data
Global lodlam_communities and open cultural dataGlobal lodlam_communities and open cultural data
Global lodlam_communities and open cultural data
 
High and Lows of Library Linked Data
High and Lows of Library Linked DataHigh and Lows of Library Linked Data
High and Lows of Library Linked Data
 
Thinking about Preservation: OSUL Content Manage Workflow
Thinking about Preservation: OSUL Content Manage WorkflowThinking about Preservation: OSUL Content Manage Workflow
Thinking about Preservation: OSUL Content Manage Workflow
 
Digital Content Management
Digital Content ManagementDigital Content Management
Digital Content Management
 
Intro to Linked Open Data in Libraries, Archives & Museums
Intro to Linked Open Data in Libraries, Archives & MuseumsIntro to Linked Open Data in Libraries, Archives & Museums
Intro to Linked Open Data in Libraries, Archives & Museums
 
Linked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & MuseumsLinked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & Museums
 
MARC and BIBFRAME; Linking libraries and archives
MARC and BIBFRAME; Linking libraries and archivesMARC and BIBFRAME; Linking libraries and archives
MARC and BIBFRAME; Linking libraries and archives
 
Building OBO Foundry ontology using semantic web tools
Building OBO Foundry ontology using semantic web toolsBuilding OBO Foundry ontology using semantic web tools
Building OBO Foundry ontology using semantic web tools
 
RDF Data and Image Annotations in ResearchSpace (slides)
RDF Data and Image Annotations in ResearchSpace (slides)RDF Data and Image Annotations in ResearchSpace (slides)
RDF Data and Image Annotations in ResearchSpace (slides)
 
Fondly Collisions: Archival hierarchy and the Europeana Data Model
Fondly Collisions: Archival hierarchy and the Europeana Data Model   Fondly Collisions: Archival hierarchy and the Europeana Data Model
Fondly Collisions: Archival hierarchy and the Europeana Data Model
 
Alexandria winer20100623
Alexandria winer20100623Alexandria winer20100623
Alexandria winer20100623
 
Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.
 
ArchivesSpace: Building a Next-Generation Archives Management Tool
ArchivesSpace: Building a Next-Generation Archives Management ToolArchivesSpace: Building a Next-Generation Archives Management Tool
ArchivesSpace: Building a Next-Generation Archives Management Tool
 
FDO as building block for digitization technology stacks
FDO as building block for digitization technology stacksFDO as building block for digitization technology stacks
FDO as building block for digitization technology stacks
 
NISO Webinar: Library Linked Data: From Vision to Reality
NISO Webinar: Library Linked Data: From Vision to RealityNISO Webinar: Library Linked Data: From Vision to Reality
NISO Webinar: Library Linked Data: From Vision to Reality
 
SWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic WebSWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic Web
 
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
 
The future of the DCC
The future of the DCCThe future of the DCC
The future of the DCC
 
Linked Data at the OU - the story so far
Linked Data at the OU - the story so farLinked Data at the OU - the story so far
Linked Data at the OU - the story so far
 

More from Christophe Debruyne

One year of DALIDA Data Literacy Workshops for Adults: a Report
One year of DALIDA Data Literacy Workshops for Adults: a ReportOne year of DALIDA Data Literacy Workshops for Adults: a Report
One year of DALIDA Data Literacy Workshops for Adults: a ReportChristophe Debruyne
 
Projet TOXIN : Des graphes de connaissances pour la recherche en toxicologie
Projet TOXIN : Des graphes de connaissances pour la recherche en toxicologieProjet TOXIN : Des graphes de connaissances pour la recherche en toxicologie
Projet TOXIN : Des graphes de connaissances pour la recherche en toxicologieChristophe Debruyne
 
Knowledge Graphs: Concept, mogelijkheden en aandachtspunten
Knowledge Graphs: Concept, mogelijkheden en aandachtspuntenKnowledge Graphs: Concept, mogelijkheden en aandachtspunten
Knowledge Graphs: Concept, mogelijkheden en aandachtspuntenChristophe Debruyne
 
Reusable SHACL Constraint Components for Validating Geospatial Linked Data
Reusable SHACL Constraint Components for Validating Geospatial Linked DataReusable SHACL Constraint Components for Validating Geospatial Linked Data
Reusable SHACL Constraint Components for Validating Geospatial Linked DataChristophe Debruyne
 
Hidden Amongst the Data: the Beyond 2022 Knowledge Graph
Hidden Amongst the Data: the Beyond 2022 Knowledge GraphHidden Amongst the Data: the Beyond 2022 Knowledge Graph
Hidden Amongst the Data: the Beyond 2022 Knowledge GraphChristophe Debruyne
 
Facilitating Data Curation: a Solution Developed in the Toxicology Domain
Facilitating Data Curation: a Solution Developed in the Toxicology DomainFacilitating Data Curation: a Solution Developed in the Toxicology Domain
Facilitating Data Curation: a Solution Developed in the Toxicology DomainChristophe Debruyne
 
Using Maps for Interlinking Geospatial Linked Data
Using Maps for Interlinking Geospatial Linked DataUsing Maps for Interlinking Geospatial Linked Data
Using Maps for Interlinking Geospatial Linked DataChristophe Debruyne
 
Linked Data Publication and Interlinking Research within the SFI funded ADAPT...
Linked Data Publication and Interlinking Research within the SFI funded ADAPT...Linked Data Publication and Interlinking Research within the SFI funded ADAPT...
Linked Data Publication and Interlinking Research within the SFI funded ADAPT...Christophe Debruyne
 
Towards Generating Policy-compliant Datasets (poster)
Towards GeneratingPolicy-compliant Datasets (poster)Towards GeneratingPolicy-compliant Datasets (poster)
Towards Generating Policy-compliant Datasets (poster)Christophe Debruyne
 
Towards Generating Policy-compliant Datasets
Towards Generating Policy-compliant DatasetsTowards Generating Policy-compliant Datasets
Towards Generating Policy-compliant DatasetsChristophe Debruyne
 
Generating Executable Mappings from RDF Data Cube Data Structure Definitions
Generating Executable Mappings from RDF Data Cube Data Structure DefinitionsGenerating Executable Mappings from RDF Data Cube Data Structure Definitions
Generating Executable Mappings from RDF Data Cube Data Structure DefinitionsChristophe Debruyne
 
Uplift – Generating RDF datasets from non-RDF data with R2RML
Uplift – Generating RDF datasets from non-RDF data with R2RMLUplift – Generating RDF datasets from non-RDF data with R2RML
Uplift – Generating RDF datasets from non-RDF data with R2RMLChristophe Debruyne
 
A Lightweight Approach to Explore, Enrich and Use Data with a Geospatial Dime...
A Lightweight Approach to Explore, Enrich and Use Data with a Geospatial Dime...A Lightweight Approach to Explore, Enrich and Use Data with a Geospatial Dime...
A Lightweight Approach to Explore, Enrich and Use Data with a Geospatial Dime...Christophe Debruyne
 
Client-side Processing of GeoSPARQL Functions with Triple Pattern Fragments
Client-side Processing of GeoSPARQL Functions with Triple Pattern FragmentsClient-side Processing of GeoSPARQL Functions with Triple Pattern Fragments
Client-side Processing of GeoSPARQL Functions with Triple Pattern FragmentsChristophe Debruyne
 
Serving Ireland's Geospatial Information as Linked Data
Serving Ireland's Geospatial Information as Linked DataServing Ireland's Geospatial Information as Linked Data
Serving Ireland's Geospatial Information as Linked DataChristophe Debruyne
 
Serving Ireland's Geospatial Information as Linked Data (ISWC 2016 Poster)
Serving Ireland's Geospatial Information as Linked Data (ISWC 2016 Poster)Serving Ireland's Geospatial Information as Linked Data (ISWC 2016 Poster)
Serving Ireland's Geospatial Information as Linked Data (ISWC 2016 Poster)Christophe Debruyne
 
R2RML-F: Towards Sharing and Executing Domain Logic in R2RML Mappings
R2RML-F: Towards Sharing and Executing Domain Logic in R2RML MappingsR2RML-F: Towards Sharing and Executing Domain Logic in R2RML Mappings
R2RML-F: Towards Sharing and Executing Domain Logic in R2RML MappingsChristophe Debruyne
 
Towards a Project Centric Metadata Model and Lifecycle for Ontology Mapping G...
Towards a Project Centric Metadata Model and Lifecycle for Ontology Mapping G...Towards a Project Centric Metadata Model and Lifecycle for Ontology Mapping G...
Towards a Project Centric Metadata Model and Lifecycle for Ontology Mapping G...Christophe Debruyne
 
Creating and Consuming Metadata from Transcribed Historical Vital Records for...
Creating and Consuming Metadata from Transcribed Historical Vital Records for...Creating and Consuming Metadata from Transcribed Historical Vital Records for...
Creating and Consuming Metadata from Transcribed Historical Vital Records for...Christophe Debruyne
 

More from Christophe Debruyne (20)

One year of DALIDA Data Literacy Workshops for Adults: a Report
One year of DALIDA Data Literacy Workshops for Adults: a ReportOne year of DALIDA Data Literacy Workshops for Adults: a Report
One year of DALIDA Data Literacy Workshops for Adults: a Report
 
Projet TOXIN : Des graphes de connaissances pour la recherche en toxicologie
Projet TOXIN : Des graphes de connaissances pour la recherche en toxicologieProjet TOXIN : Des graphes de connaissances pour la recherche en toxicologie
Projet TOXIN : Des graphes de connaissances pour la recherche en toxicologie
 
Knowledge Graphs: Concept, mogelijkheden en aandachtspunten
Knowledge Graphs: Concept, mogelijkheden en aandachtspuntenKnowledge Graphs: Concept, mogelijkheden en aandachtspunten
Knowledge Graphs: Concept, mogelijkheden en aandachtspunten
 
Reusable SHACL Constraint Components for Validating Geospatial Linked Data
Reusable SHACL Constraint Components for Validating Geospatial Linked DataReusable SHACL Constraint Components for Validating Geospatial Linked Data
Reusable SHACL Constraint Components for Validating Geospatial Linked Data
 
Hidden Amongst the Data: the Beyond 2022 Knowledge Graph
Hidden Amongst the Data: the Beyond 2022 Knowledge GraphHidden Amongst the Data: the Beyond 2022 Knowledge Graph
Hidden Amongst the Data: the Beyond 2022 Knowledge Graph
 
Facilitating Data Curation: a Solution Developed in the Toxicology Domain
Facilitating Data Curation: a Solution Developed in the Toxicology DomainFacilitating Data Curation: a Solution Developed in the Toxicology Domain
Facilitating Data Curation: a Solution Developed in the Toxicology Domain
 
Using Maps for Interlinking Geospatial Linked Data
Using Maps for Interlinking Geospatial Linked DataUsing Maps for Interlinking Geospatial Linked Data
Using Maps for Interlinking Geospatial Linked Data
 
Linked Data Publication and Interlinking Research within the SFI funded ADAPT...
Linked Data Publication and Interlinking Research within the SFI funded ADAPT...Linked Data Publication and Interlinking Research within the SFI funded ADAPT...
Linked Data Publication and Interlinking Research within the SFI funded ADAPT...
 
Towards Generating Policy-compliant Datasets (poster)
Towards GeneratingPolicy-compliant Datasets (poster)Towards GeneratingPolicy-compliant Datasets (poster)
Towards Generating Policy-compliant Datasets (poster)
 
Towards Generating Policy-compliant Datasets
Towards Generating Policy-compliant DatasetsTowards Generating Policy-compliant Datasets
Towards Generating Policy-compliant Datasets
 
Generating Executable Mappings from RDF Data Cube Data Structure Definitions
Generating Executable Mappings from RDF Data Cube Data Structure DefinitionsGenerating Executable Mappings from RDF Data Cube Data Structure Definitions
Generating Executable Mappings from RDF Data Cube Data Structure Definitions
 
Uplift – Generating RDF datasets from non-RDF data with R2RML
Uplift – Generating RDF datasets from non-RDF data with R2RMLUplift – Generating RDF datasets from non-RDF data with R2RML
Uplift – Generating RDF datasets from non-RDF data with R2RML
 
A Lightweight Approach to Explore, Enrich and Use Data with a Geospatial Dime...
A Lightweight Approach to Explore, Enrich and Use Data with a Geospatial Dime...A Lightweight Approach to Explore, Enrich and Use Data with a Geospatial Dime...
A Lightweight Approach to Explore, Enrich and Use Data with a Geospatial Dime...
 
Client-side Processing of GeoSPARQL Functions with Triple Pattern Fragments
Client-side Processing of GeoSPARQL Functions with Triple Pattern FragmentsClient-side Processing of GeoSPARQL Functions with Triple Pattern Fragments
Client-side Processing of GeoSPARQL Functions with Triple Pattern Fragments
 
Serving Ireland's Geospatial Information as Linked Data
Serving Ireland's Geospatial Information as Linked DataServing Ireland's Geospatial Information as Linked Data
Serving Ireland's Geospatial Information as Linked Data
 
Serving Ireland's Geospatial Information as Linked Data (ISWC 2016 Poster)
Serving Ireland's Geospatial Information as Linked Data (ISWC 2016 Poster)Serving Ireland's Geospatial Information as Linked Data (ISWC 2016 Poster)
Serving Ireland's Geospatial Information as Linked Data (ISWC 2016 Poster)
 
R2RML-F: Towards Sharing and Executing Domain Logic in R2RML Mappings
R2RML-F: Towards Sharing and Executing Domain Logic in R2RML MappingsR2RML-F: Towards Sharing and Executing Domain Logic in R2RML Mappings
R2RML-F: Towards Sharing and Executing Domain Logic in R2RML Mappings
 
Towards a Project Centric Metadata Model and Lifecycle for Ontology Mapping G...
Towards a Project Centric Metadata Model and Lifecycle for Ontology Mapping G...Towards a Project Centric Metadata Model and Lifecycle for Ontology Mapping G...
Towards a Project Centric Metadata Model and Lifecycle for Ontology Mapping G...
 
Creating and Consuming Metadata from Transcribed Historical Vital Records for...
Creating and Consuming Metadata from Transcribed Historical Vital Records for...Creating and Consuming Metadata from Transcribed Historical Vital Records for...
Creating and Consuming Metadata from Transcribed Historical Vital Records for...
 
What is Linked Data?
What is Linked Data?What is Linked Data?
What is Linked Data?
 

Recently uploaded

The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 

Recently uploaded (20)

The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 

2014 06-04-presentation-mdn-2014

  • 1. RTÉ Content Discovery Project Christophe Debruyne c.debruyne@ria.ie christophe.debruyne@insight-centre.orgchristophe.debruyne@insight-centre.org MDN Workshop -- 4th of June 2014
  • 2. Outline • Context • Goal and Challenges of the RTÉ Content Discovery Project • Tasks and Data Annotation• Tasks and Data Annotation • EBU Core – Identification of problems • Addressing the issues • Using the ontology • Conclusions and Recommendations
  • 3. Context RTÉ, Ireland's National Television and Radio Broadcaster National trusted digital repository for Ireland's social and cultural data. Centre for Data Analytics Documents Television Radio Stills Linking and preserving data held by Irish Institutions with central internet access point. • Standards • Cataloguing • Archiving • Preservation • Insight @ NUIG = DERI • Semantic Technologies • Linked Data • Data Analytics Platform
  • 4. Goal of the RTÉ Content Discovery Project • Discover implicit knowledge • across the different archives • and the Web of Data • To facilitate internal workflows (e.g., search) • For wider reuse and repackaging RTÉ’s Documents Television Radio Stills • For wider reuse and repackaging RTÉ’s information • Challenges • Heterogeneous databases • Different guidelines and practices • Legacy data (from previous systems) • … “Linking Open Data cloud diagram, by R. Cyganiak and A. Jentzsch. http://lod-cloud.net/”
  • 5. Part of a wider ambition …
  • 7. RTÉ Content Discovery In this presentation we focus on Television and Radio archives Documents Television Radio Stills • In this presentation we focus on Television and Radio archives • The Television and Radio archives • Are maintained on two different instances of the same system • A system that is EBU Core “compatible” • Different content, different guidelines, …
  • 8. Three main tasks • Annotate the data. • Using relevant standards, ontologies and vocabularies. • Resource Description Framework (RDF). • Obtain an integrated view of the different archives by creating links between the RDF representations of RTÉ’s archival assets across the different archives. • Apply advanced methods for discovering related data for a given subject in external sources such as the Linked Data Cloud.
  • 9. Data annotation Relational Database D2RQ RDF Dump Triplestore Television Radio Map symbols of database to predicates (relations and concepts) in chosen ontologies / vocabularies Use D2RQ to generate RDF dump Store RDF dump in adequate triple store (Jena TDB) Which ontologies? • Dublin Core, DC Terms • Foaf • EBU Core OWL • …
  • 10. EBU Core OWL • The RTÉ Content Discovery platform will rely on Semantic Web technologies to reason. Ontologies will therefore need to be correct. • But … while adopting the EBU Core OWL ontology, several problems where identified. • We contacted EBU to resolve these issues. • We provide an overview of some of these problems.
  • 11. Problems • (1) Forgotten concept unions • The property ebucore:description has multiple domain axioms. <rdfs:domain rdf:resource="&ebu;BusinessObject"/> <rdfs:domain rdf:resource="&ebu;MediaResource"/> • Unintentionally the wrong implicit information can be inferred.• Unintentionally the wrong implicit information can be inferred. • (2a) Property unsatisfiability – via class axioms <owl:Class rdf:about="&ebu;BusinessObject"> … <owl:disjointWith rdf:resource="&ebu;Resource"/> … </owl:Class> • Because of (1) and (2), the property description could not be used
  • 12. Problems • (2b) Property unsatisfiability – role hierarchies and datatypes • Duration has the range xsd:string • The subproperties of duration have other ranges (e.g., double in the case of duration in edit units) • Because each subproperty also inherits the range of the• Because each subproperty also inherits the range of the superproperty, all instances in the object of that property must be at the same time a string, and a double. This type conflict results in a contradiction. • With (2a) and (2b) we identified 40 properties that lead to problems.
  • 13. Problems • (3) Inconsistencies between formal and informal definitions • BusinessObject is defined as: "An image, a document, an annotation […], a tag […], or an audiovisual media resource […]. Other types of BusinessObjects may be defined as subclasses.“BusinessObjects may be defined as subclasses.“ • Resource is defined as: "A manifestation of a BusinessObject." and disjoint with BusinessObjects. Meaning no individual can be an element of BusinessObjects and Resources at the same time. • The domain of a title is BusinessObject, yet, it’s definition is: "Specifies the title or name given to the resource. […]"
  • 14. Problems • (4) User readable labels • Many different properties have the same human readable label, which could confuse the end user – e.g., when generating an Interface. • E.g., there were 11 properties with the label “Name”• E.g., there were 11 properties with the label “Name” • Some properties had empty labels • (5) Roles – Loss of context • Agents were related to Business Objects (BO) • Agents were related to a Role • But … a role did not relate to agents in relationship with a BO • This lead to a loss of context.
  • 15. Addressing the issues • Problems were addressed over email. • The discussions are “lost”, traces are only known to us … • The ontology-engineering activities of EBU Core should adopt appropriate methods and tools for collaboration. • Participation of others • Traceability (!) • The ontology is still being developed as we go along, and we have been able to make (parts of it) work…
  • 17.
  • 18. Conclusions and Recommendations • RTÉ Archives aims at a wider reuse and repackaging of their archival content on digital platforms through the innovative use of Semantic and Linked Data technologies. • We adopted the EBU Core OWL ontology for annotating the television and radio archives, yet identified some issues. • We adopted the EBU Core OWL ontology for annotating the television and radio archives, yet identified some issues. • We collaborated on resolving those issues together with EBU • However, we feel that appropriate collaborative methods and tools should be adopted to facilitate the ontology- engineering process and – more importantly – enable other to participate AND have visible traceability of the decisions.
  • 19. References • D2RQ, http://d2rq.org/ • Digital Repository of Ireland, http://www.dri.ie/ • Insight, http://www.insight-centre.org/• Insight, http://www.insight-centre.org/ • Jena TDB, http://jena.apache.org/documentation/tdb/ • RTÉ Archives, http://www.rte.ie/archives