The Storyline Ontology

Jeremy Tarling
Senior Data Architect, BBC News
@jeremytarling
The Storyline Ontology
• the problem space
• collaborative model development
• using it with news content
• prepared data ...
•story telling is
fragmented into
articles
•links between
articles are handcurated by
journalists
•aggregations around
top...
collaborative model development
www.purl.org/ontology/storyline
title
comment

synopsis
rdfs:
domain

rdfs:
domain

Topic

rdfs:
range

rdfs:
domain

topi...
1. annotate assets with topics
Preferred
Label

CPS asset

CPS asset

preferred
Label

about

Slug

slug
mentions
CPS asse...
2. annotate assets with storylines
CPS asset

CPS asset

Title

about
about

title
Storyline

synopsis

CPS asset

about

...
3. annotate assets with storylines
and storylines with topics
Title

CPS asset

CPS asset

Synopsis

title
about
about

sy...
4. model complex storylines
#newsHack

• APIs are ready
• storylines and events have been made
• (tell us if you want more)
• content from multiple so...
Storyline - for #newsHACK 2013 - Jeremy Tarling
Storyline - for #newsHACK 2013 - Jeremy Tarling
Upcoming SlideShare
Loading in …5
×

Storyline - for #newsHACK 2013 - Jeremy Tarling

1,326 views

Published on

Presentation by @jeremytarling for #newsHACK 2013.

0 Comments
2 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
1,326
On SlideShare
0
From Embeds
0
Number of Embeds
281
Actions
Shares
0
Downloads
17
Comments
0
Likes
2
Embeds 0
No embeds

No notes for slide
  • 8 minutes on the storyline ontology
    explain the problem space – how and did we come up with the storyline model
    look at applying the ontology to news content, starting simple then moving to more complex storylines
    finally point you at the data we have prepared, and explain how we can make more for you if needed
  • BBC News publishes ~500 articles + other content types every day
    manually maintained topic aggregations (indexes)
    manually maintained links between articles
    labour intensive
  • repetitive (much of an article will be repetition - 'clone' in CPS
    the story becomes fragmented - many articles and other content types to one storyline
    instead why not aggregate content as developments of updates in a storyline
  • collaboration - a broadcaster, a newspaper, a wire service, a search engine
    ran workshops, shared ideas, tested on real data
    produced a model, published it under a CC-SA license
    most of these guys are here today!
  • here’s v0.1 the model in it’s entirety
    best understood through it’s application to content
    take you through four ways the model can be used
  • 1 – basic annotation of content with people, places, orgs, etc
    allows dynamic aggregation by topic
    BBC already doing this for sport – recent articles about Man Utd
    others – Guardian?
  • 2 - as before, but here we have abstracted the Storyline as a concept
    ‘content about Edward Snowden’s assylum bid’
    some News orgs already doing this, eg itv.com/news
  • 3 – taking this firther we can annotate a Storyline with People, Places, Orgs and Themes
    allows for more accurate statements about content – not about Chine, but about an event that took place in China
    makes for useful onward journeys if we expose these 2-step annotations as links
  • 4 - a Storyline can have Slots
    Slots contain Events
    Slots can be related to each other (e.g. ‘follows’) to build narrative sequences of Events, or Storylines
    this is the space we really want to explore over the NewsHack event
  • over the past few weeks we have been seeding the BBC News Prototyping platform (Juicer) with these complex (level 4) storylines
    we’ve also annotated (‘tagged’) articles with events in these storylines
    list of the storylines we have created here:
    if there’s a story you love to build an app for that’s not here then tweet us with the #newsHack hash tag and myself or one of the other data monkeys will make it for you
  • Storyline - for #newsHACK 2013 - Jeremy Tarling

    1. 1. The Storyline Ontology Jeremy Tarling Senior Data Architect, BBC News @jeremytarling
    2. 2. The Storyline Ontology • the problem space • collaborative model development • using it with news content • prepared data for #newsHACK
    3. 3. •story telling is fragmented into articles •links between articles are handcurated by journalists •aggregations around topics (indexes) are hand-curated
    4. 4. collaborative model development
    5. 5. www.purl.org/ontology/storyline title comment synopsis rdfs: domain rdfs: domain Topic rdfs: range rdfs: domain topic Storyline rdfs: domain rdfs: subClassOf Storyline Component rdfs: subClassOf Event rdfs:domain rdfs:range attributedTo rdfs:range contais rdfs:domain rdfs:domain Attribution update rdfs: range rdfs:range Development follows rdfs:domain rdfs:domain time rdfs:domain index (ordinal)
    6. 6. 1. annotate assets with topics Preferred Label CPS asset CPS asset preferred Label about Slug slug mentions CPS asset etc topic short Label disambiguation Hint Short Label Disambiguation Hint
    7. 7. 2. annotate assets with storylines CPS asset CPS asset Title about about title Storyline synopsis CPS asset about Synopsis
    8. 8. 3. annotate assets with storylines and storylines with topics Title CPS asset CPS asset Synopsis title about about synopsis Storyline topic CPS asset Person about topic topic topic Organisation Theme Place
    9. 9. 4. model complex storylines
    10. 10. #newsHack • APIs are ready • storylines and events have been made • (tell us if you want more) • content from multiple sources

    ×