Successfully reported this slideshow.

IPTC Semantic Web November 2010


Published on

IPTC is exploring the use of Semantic Web for the news industry.

This is the report I gave to the IPTC's 2010 November meeting in Rome. We are working on creating Linked Data (using SKOS), exploring the use of metadata in HTML (using RDFa and microformats) and creating a news ontology based on NewsML-G2 (using OWL).

Published in: Technology, Education
  • Be the first to comment

  • Be the first to like this

IPTC Semantic Web November 2010

  1. 1. IPTC and The Semantic Web Stuart Myles Associated Press 3rd November 2010
  2. 2. IPTC’s Semantic Web Agenda • IPTC news codes and media codes as Linked Data • Metadata in HTML • RDFa and microformats for news • A news ontology Both Upper Case and lower case semweb © 2010 IPTC ( All rights reserved 2
  3. 3. Semantic Web and Linked Data
  4. 4. IPTC Linked Data • Turn IPTC subject codes into Linked Data – Connect related data across the web using URIs, HTTP & RDF – A set of principles from Tim Berners Lee • Use URIs as names for things • Use HTTP URIs so that people can look up those names. • When someone looks up a URI, provide useful information, using the standards (RDF, SPARQL) • Include links to other URIs, so that they can discover more things • Apply the principles to IPTC’s subject codes – Already published as XML (G2 Knowledge Items) – And as HTML – The plan: convert XML into RDF, using SKOS and DC © 2010 IPTC ( All rights reserved 4
  5. 5. TLAs and FLAs: AKA RDF? SKOS? DC? • RDF is a single model - Subject Predicate Object • With multiple syntaxes – We selected RDF/XML and RDF/Turtle • And multiple “vocabularies” – Such as SKOS, Dublin Core • SKOS = Simple Knowledge Organization System – – Designed for representing thesauri and classification schemes • The Semantic Web “way” is – Use existing vocabularies as much as possible – When you invent a new term, link it to existing terms • We decided to use SKOS and DC as the main vocabs © 2010 IPTC ( All rights reserved 5
  6. 6. Mapping IPTC Codes to RDF • In general, IPTC KnowledgeItems map well to RDF – SKOS concepts – Dublin Core properties • Certain KI properties don’t have a direct mapping – Created and updated timestamps of KnowledgeItem properties • Difficult to determine more advanced mappings – SKOS wiki had some documentation – SKOS email list seems dormant – SemanticOverflow a great way to get questions answered modified-properties-to-skos-do-i-need-to-reify © 2010 IPTC ( All rights reserved 6
  7. 7. Linking from IPTC Codes to Other Schemes “Include links to other URIs, so that they can discover more things” • Linking is the heart of linked data • But linking is more like mapping – owl:sameas seems to have unintended consequences – SKOS’s mapping properties offer a range of options • closeMatch, exactMatch, broadMatch, narrowMatch, relatedMatch • • We decided to map the 17 top level IPTC subject codes to DBPedia – Some top level terms are really “umbrella” terms – difficult to map to a single equivalent © 2010 IPTC ( All rights reserved 7
  8. 8. IPTC Linked Data • Demonstration of the Linked Data versions of IPTC News Codes • Discussion of Proposed IPTC/MINDS Linked Data project © 2010 IPTC ( All rights reserved 8
  9. 9. IPTC and Linked Data: Next Steps • Request a more complete Linked Data mapping of IPTC Subject Codes and Media Topics • Reach out to the broader Semantic Web and news communities for feedback and collaboration • Participate with MINDS on the potential of Linked Data for the news business © 2010 IPTC ( All rights reserved 9