Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.
Building coherence at bbc.co.uk
Tom Scott
8 UK TV Channels
10 UK Radio Stations
5 National TV and radio
40 local radio stations
Plus the World Service (in 32 langua...
...and a website... since 1994
... that all makes for a big archive!
Historically the BBC has created a series of
microsites – each coherent in their own right but
not across the breadth of B...
Which means I can’t find everything about “CERN”
Which means I can’t find everything about “CERN”
...Paul Weller...

                    Paul Weller http://www.flickr.com/photos/johnbullas/3410330728/
...Lion...
...or even Jeremy Clarkson
I can’t follow my nose, I can’t browse by meaning,
from one page to the next following a semantic
thread
                 ...
But things are changing
Linked Data has helped us build a coherent,
scalable, sane service. One that we hope is a bit
more human literate.
     Li...
Use URIs to identify things not only documents

                      How it works: The Web http://flickr.com/photos/danbri...
Use HTTP URIs - globally unique names that
anyone can dereference

                   Colon Slash Slash http://www.flickr.c...
Provide useful information [in RDF] when someone
looks up a URI

                     Information Desk http://www.flickr.co...
Include links to other URIs to let people discover
related information

                             Links http://www.flick...
One implication of this is that I think there’s only
URIs and metadata... nothing else

                   Self-portraitur...
URIs are used as identifiers for real world things
...like Polar Bears and Jeremy Clarkson
Just as my passport is an identifier for me
...which in turn makes assertions about me
Thomas Scott
              16th May 1972       United Kingdom




...which in turn makes assertions about me
bbc.co.uk/nature/species/tiger
is an identifier for the tiger species with resources
which make assertions about it
bbc.co.uk/nature/species/tiger
is an identifier for the tiger species with resources
which make assertions about it
Linked Data at the BBC

                     Test Card X http://www.flickr.com/photos/marksmanuk/3098983708/
A page (URI) per programmes
bbc.co.uk/programmes/:pid
...and programme segments...
In the music domain we have a page for every
artist the BBC plays
bbc.co.uk/music/artist/:musicbrainzID
And in the natural history domain we have URIs of
animals...
bbc.co.uk/nature/:rank/:dbpediaID
...adaptations and behaviours...
bbc.co.uk/nature/adaptaion/:dbpediaID
...and habitats...
bbc.co.uk/nature/habitats/:dbpediaID
And because the web is about URIs not pages
there are separate URIs for each resource
These are our building blocks

                                Silos http://www.flickr.com/photos/bottleleaf/2218990208/
But context lies in the links between these domains
Programmes featuring a species
Clips from programmes about a species
Clips live at /programmes but are transcluded onto
other pages
                              Silos http://www.flickr.com/ph...
Tracks played in an episode
Programmes that have played an artist
How have we put the blocks together?
DBpedia as a controlled vocabulary

                             Silos http://www.flickr.com/photos/bottleleaf/2218990208/
Different teams model their domain
Brands



                           Series     Programme



                          Episodes
             Content
     ...
Link models together
Linked Data allows loosely coupled, distributed
teams to share data, share models and build on
each others work
Thank you
Programmes ontology
	 http://www.bbc.co.uk/ontologies/programmes
Understanding the big BBC graph
	 http://blogs....
Upcoming SlideShare
Loading in …5
×

Online Information Conference

6,419 views

Published on

Online Information Conference

  1. Building coherence at bbc.co.uk Tom Scott
  2. 8 UK TV Channels 10 UK Radio Stations 5 National TV and radio 40 local radio stations Plus the World Service (in 32 languages)
  3. ...and a website... since 1994
  4. ... that all makes for a big archive!
  5. Historically the BBC has created a series of microsites – each coherent in their own right but not across the breadth of BBC content Radio 4 Big Bang http://www.bbc.co.uk/radio4/bigbang/
  6. Which means I can’t find everything about “CERN”
  7. Which means I can’t find everything about “CERN”
  8. ...Paul Weller... Paul Weller http://www.flickr.com/photos/johnbullas/3410330728/
  9. ...Lion...
  10. ...or even Jeremy Clarkson
  11. I can’t follow my nose, I can’t browse by meaning, from one page to the next following a semantic thread Snickers http://www.flickr.com/photos/homer4k/386980596/
  12. But things are changing
  13. Linked Data has helped us build a coherent, scalable, sane service. One that we hope is a bit more human literate. Linked Data cloud diagram http://www4.wiwiss.fu-berlin.de/bizer/pub/lod-datasets_2009-03-05_colored.png
  14. Use URIs to identify things not only documents How it works: The Web http://flickr.com/photos/danbri/2415237566/
  15. Use HTTP URIs - globally unique names that anyone can dereference Colon Slash Slash http://www.flickr.com/photos/jeffsmallwood/299208539/
  16. Provide useful information [in RDF] when someone looks up a URI Information Desk http://www.flickr.com/photos/metropol2/149294506/
  17. Include links to other URIs to let people discover related information Links http://www.flickr.com/photos/ravages/2831688538/
  18. One implication of this is that I think there’s only URIs and metadata... nothing else Self-portraiture + metadata http://www.flickr.com/photos/saltatempo/323462998/
  19. URIs are used as identifiers for real world things ...like Polar Bears and Jeremy Clarkson
  20. Just as my passport is an identifier for me
  21. ...which in turn makes assertions about me
  22. Thomas Scott 16th May 1972 United Kingdom ...which in turn makes assertions about me
  23. bbc.co.uk/nature/species/tiger is an identifier for the tiger species with resources which make assertions about it
  24. bbc.co.uk/nature/species/tiger is an identifier for the tiger species with resources which make assertions about it
  25. Linked Data at the BBC Test Card X http://www.flickr.com/photos/marksmanuk/3098983708/
  26. A page (URI) per programmes bbc.co.uk/programmes/:pid
  27. ...and programme segments...
  28. In the music domain we have a page for every artist the BBC plays bbc.co.uk/music/artist/:musicbrainzID
  29. And in the natural history domain we have URIs of animals... bbc.co.uk/nature/:rank/:dbpediaID
  30. ...adaptations and behaviours... bbc.co.uk/nature/adaptaion/:dbpediaID
  31. ...and habitats... bbc.co.uk/nature/habitats/:dbpediaID
  32. And because the web is about URIs not pages there are separate URIs for each resource
  33. These are our building blocks Silos http://www.flickr.com/photos/bottleleaf/2218990208/
  34. But context lies in the links between these domains
  35. Programmes featuring a species
  36. Clips from programmes about a species
  37. Clips live at /programmes but are transcluded onto other pages Silos http://www.flickr.com/photos/bottleleaf/2218990208/
  38. Tracks played in an episode
  39. Programmes that have played an artist
  40. How have we put the blocks together?
  41. DBpedia as a controlled vocabulary Silos http://www.flickr.com/photos/bottleleaf/2218990208/
  42. Different teams model their domain
  43. Brands Series Programme Episodes Content Service Publishing Version Event Broadcast Different teams model their domain
  44. Link models together
  45. Linked Data allows loosely coupled, distributed teams to share data, share models and build on each others work
  46. Thank you Programmes ontology http://www.bbc.co.uk/ontologies/programmes Understanding the big BBC graph http://blogs.talis.com/n2/archives/569 Music ontology http://musicontology.com

×