Introducing linked data into BBC News online

  • 1,145 views
Uploaded on

A presentation for the 2013 AGM of the IPTC held in Paris on June 24-26

A presentation for the 2013 AGM of the IPTC held in Paris on June 24-26

More in: Technology , Education
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Be the first to comment
    Be the first to like this
No Downloads

Views

Total Views
1,145
On Slideshare
0
From Embeds
0
Number of Embeds
2

Actions

Shares
Downloads
16
Comments
0
Likes
0

Embeds 0

No embeds

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide
  • UK's most popular news website - 6 million unique browsers every day (3rd biggest site in the UK after Google and Facebook) publish around 500 articles every day - local, national global publish in 27 languages as World Service (+ 2 UK languages alongside English) hundreds of journalists, many working cross-media (TV/radio/online)
  • articles created in a home-grown Content Management System flat page publishing via FTP - good for high load events but limits our UX and data potential
  • - need to minimise impact on journalists - integration with existing tools and workflow as much as possible
  • pilot - can we automate the production of the local news region sub-index pages?  (currently manual task to maintain these pages) GET articles about or mentioning places that fall within the BBC News region
  • - a simple ontology for people, organisations, places and intangibles (themes) and their intersection with events - based on rNews, the Event ontology and PA ’ s SNaP Stuff ontology - annotate articles with events, where the event:place is Birmingham etc.
  • - IPTC rNews terms in RDFa - basic publishing metadata in the <head> for rich snippets - linked open data in the body
  • - immediate results - rich snippets for articles - apparently better ranking by topic (anecdotal)
  • - we introduced the change in the first week of May - by the end of may we were seeing some positive press coverage, people were noticing

Transcript

  • 1. Linked Data in BBC News IPTC AGM June 25th 2013
  • 2. www.bbc.co.uk/news
  • 3. moving to linked data • moving from static HTML to dynamic, responsive site • introducing linked data to power content aggregations around related topics • starting to embed linked open data in every page as RDFa • using the IPTC rNews vocabulary to describe contnet in a machine-readable way
  • 4. impact on journalists • annotating (“tagging”) content with topics • tool embedded into existing CMS • concept extraction/NLP for topic suggestion • journalists accept/reject suggested topics for annotation
  • 5. pilot - local indexes
  • 6. learning from the pilot • generally - it works • but duplication for big events • also need pinning • concept extraction poor • journalists gaming the system
  • 7. corenews model
  • 8. pilot - publishing RDFa • using RDFa + rNews to embed machine- readable metadata in article source code • discoverability: rich snippets + better ranking • publish Linked Open Data: <articleURI> rdf:type rnews:Article <articleURI> rnews:about <thingURI> etc...
  • 9. learning from the pilot
  • 10. learning from the pilot
  • 11. next steps • rolling out tagging to journalists throughout BBC News • making better use of rNews/RDFa - full mark-up integration • piloting the use of storyline in data-driven news
  • 12. more info • http://www.bbc.co.uk/blogs/internet/posts/ News-Linked-Data-Ontology • http://www.bbc.co.uk/ontologies/news/2013 -05-01.shtml • jeremy.tarling@bbc.co.uk • twitter: @jeremytarling