Online Information Conference

•Download as KEY, PDF•

8 likes•7,223 views

Tom Scott

Technology Entertainment & Humor

8 UK TV Channels
10 UK Radio Stations
5 National TV and radio
40 local radio stations
Plus the World Service (in 32 languages)

Historically the BBC has created a series of
microsites – each coherent in their own right but
not across the breadth of BBC content
Radio 4 Big Bang http://www.bbc.co.uk/radio4/bigbang/

Which means I can’t ﬁnd everything about “CERN”

...Paul Weller...

Paul Weller http://www.ﬂickr.com/photos/johnbullas/3410330728/

I can’t follow my nose, I can’t browse by meaning,
from one page to the next following a semantic
thread
Snickers http://www.ﬂickr.com/photos/homer4k/386980596/

Linked Data has helped us build a coherent,
scalable, sane service. One that we hope is a bit
more human literate.
Linked Data cloud diagram http://www4.wiwiss.fu-berlin.de/bizer/pub/lod-datasets_2009-03-05_colored.png

Use URIs to identify things not only documents

How it works: The Web http://ﬂickr.com/photos/danbri/2415237566/

Use HTTP URIs - globally unique names that
anyone can dereference

Colon Slash Slash http://www.ﬂickr.com/photos/jeffsmallwood/299208539/

Provide useful information [in RDF] when someone
looks up a URI

Information Desk http://www.ﬂickr.com/photos/metropol2/149294506/

Include links to other URIs to let people discover
related information

Links http://www.ﬂickr.com/photos/ravages/2831688538/

One implication of this is that I think there’s only
URIs and metadata... nothing else

Self-portraiture + metadata http://www.ﬂickr.com/photos/saltatempo/323462998/

URIs are used as identiﬁers for real world things
...like Polar Bears and Jeremy Clarkson

Just as my passport is an identiﬁer for me

...which in turn makes assertions about me

Thomas Scott
16th May 1972 United Kingdom

...which in turn makes assertions about me

bbc.co.uk/nature/species/tiger
is an identiﬁer for the tiger species with resources
which make assertions about it

Linked Data at the BBC

Test Card X http://www.ﬂickr.com/photos/marksmanuk/3098983708/

A page (URI) per programmes
bbc.co.uk/programmes/:pid

In the music domain we have a page for every
artist the BBC plays
bbc.co.uk/music/artist/:musicbrainzID

And in the natural history domain we have URIs of
animals...
bbc.co.uk/nature/:rank/:dbpediaID

...adaptations and behaviours...
bbc.co.uk/nature/adaptaion/:dbpediaID

...and habitats...
bbc.co.uk/nature/habitats/:dbpediaID

And because the web is about URIs not pages
there are separate URIs for each resource

These are our building blocks

Silos http://www.ﬂickr.com/photos/bottleleaf/2218990208/

But context lies in the links between these domains

Clips live at /programmes but are transcluded onto
other pages
Silos http://www.ﬂickr.com/photos/bottleleaf/2218990208/

DBpedia as a controlled vocabulary

Silos http://www.ﬂickr.com/photos/bottleleaf/2218990208/

Brands

Series Programme

Episodes
Content
Service
Publishing
Version

Event Broadcast

Different teams model their domain

Linked Data allows loosely coupled, distributed
teams to share data, share models and build on
each others work

Thank you
Programmes ontology
http://www.bbc.co.uk/ontologies/programmes
Understanding the big BBC graph
http://blogs.talis.com/n2/archives/569
Music ontology
http://musicontology.com

Similar to Online Information Conference

Emtacl12, mlibraries12 conferences, 2012Kerryn Amery

A presentation about the traces left behind on twitter about the conference "...Margarida Fonseca

Web 2.0 Setting The Stage For Extending Our Reach: Resource Guidekennbicknell

Essential Digital Resources 2010Danny Nicholson

Web 2.0 for Archivists, Powerpoint VersionArian Ravanbakhsh

Adding Value to Cultural Heritage - Olaf Janssen lecturing for the course "Di...Olaf Janssen

BBC Programmes Ontology XTech2008Tom Scott

Internet MashupsCesare Pautasso

Harsh Horizons For the SocialmediaforumIan Forrester

Science in the OpenCameron Neylon

A Semantic Multimedia Web (Part 3)Raphael Troncy

Dark Matter - - the dark matter of the internet is open, social, peer-to-peer...Michael Edson

Semantically Capturing and Representing News Stories on the WebJose Luis Redondo Garcia

URIplay for Google Tech Talk (2008)Chris Jackson

BBC Backstage Web Horizon 2007 PresentationIan Forrester

Using The Web To Work TogetherPhil Wilson

The NoTube BeanCounter: Aggregating User Data for Television Programme Recomm...MODUL Technology GmbH

The Social Semantic Web: An IntroductionJohn Breslin

Ensuring Continuity of Access To Our Published HeritageEDINA, University of Edinburgh

Prof. Hendrik Speck - IMEA 3 Heidelberg - Social MediaHendrik Speck

Similar to Online Information Conference (20)

Emtacl12, mlibraries12 conferences, 2012

A presentation about the traces left behind on twitter about the conference "...

Web 2.0 Setting The Stage For Extending Our Reach: Resource Guide

Essential Digital Resources 2010

Web 2.0 for Archivists, Powerpoint Version

Adding Value to Cultural Heritage - Olaf Janssen lecturing for the course "Di...

BBC Programmes Ontology XTech2008

Internet Mashups

Harsh Horizons For the Socialmediaforum

Science in the Open

A Semantic Multimedia Web (Part 3)

Dark Matter - - the dark matter of the internet is open, social, peer-to-peer...

Semantically Capturing and Representing News Stories on the Web

URIplay for Google Tech Talk (2008)

BBC Backstage Web Horizon 2007 Presentation

Using The Web To Work Together

The NoTube BeanCounter: Aggregating User Data for Television Programme Recomm...

The Social Semantic Web: An Introduction

Ensuring Continuity of Access To Our Published Heritage

Prof. Hendrik Speck - IMEA 3 Heidelberg - Social Media

Recently uploaded

A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3

Generative AI - Gitex v1Generative AI - Gitex v1.pptxfnnc6jmgwh

2024 April Patch TuesdayIvanti

How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe

Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda

Glenn Lazarus- Why Your Observability Strategy Needs Security Observabilityitnewsafrica

Connecting the Dots for Information Discovery.pdfNeo4j

Zeshan Sattar- Assessing the skill requirements and industry expectations for...itnewsafrica

Decarbonising Buildings: Making a net-zero built environment a realityIES VE

Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integrationmarketing932765

A Journey Into the Emotions of Software DevelopersNicole Novielli

Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq

QCon London: Mastering long-running processes in modern architecturesBernd Ruecker

Top 10 Hubspot Development Companies in 2024TopCSSGallery

Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Nikki Chapple

New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada

Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3

Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González

Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3

So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda

Recently uploaded (20)

A Deep Dive on Passkeys: FIDO Paris Seminar.pptx

Generative AI - Gitex v1Generative AI - Gitex v1.pptx

2024 April Patch Tuesday

How AI, OpenAI, and ChatGPT impact business and software.

Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger

Glenn Lazarus- Why Your Observability Strategy Needs Security Observability

Connecting the Dots for Information Discovery.pdf

Zeshan Sattar- Assessing the skill requirements and industry expectations for...

Decarbonising Buildings: Making a net-zero built environment a reality

Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration

A Journey Into the Emotions of Software Developers

Genislab builds better products and faster go-to-market with Lean project man...

QCon London: Mastering long-running processes in modern architectures

Top 10 Hubspot Development Companies in 2024

Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...

New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024

Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx

Generative Artificial Intelligence: How generative AI works.pdf

Digital Identity is Under Attack: FIDO Paris Seminar.pptx

So einfach geht modernes Roaming fuer Notes und Nomad.pdf

Online Information Conference

1. Building coherence at bbc.co.uk Tom Scott

2. 8 UK TV Channels 10 UK Radio Stations 5 National TV and radio 40 local radio stations Plus the World Service (in 32 languages)

3. ...and a website... since 1994

4. ... that all makes for a big archive!

5. Historically the BBC has created a series of microsites – each coherent in their own right but not across the breadth of BBC content Radio 4 Big Bang http://www.bbc.co.uk/radio4/bigbang/

6. Which means I can’t ﬁnd everything about “CERN”

7. Which means I can’t ﬁnd everything about “CERN”

8. ...Paul Weller... Paul Weller http://www.ﬂickr.com/photos/johnbullas/3410330728/

9. ...Lion...

10. ...or even Jeremy Clarkson

11. I can’t follow my nose, I can’t browse by meaning, from one page to the next following a semantic thread Snickers http://www.ﬂickr.com/photos/homer4k/386980596/

12. But things are changing

13. Linked Data has helped us build a coherent, scalable, sane service. One that we hope is a bit more human literate. Linked Data cloud diagram http://www4.wiwiss.fu-berlin.de/bizer/pub/lod-datasets_2009-03-05_colored.png

14. Use URIs to identify things not only documents How it works: The Web http://ﬂickr.com/photos/danbri/2415237566/

15. Use HTTP URIs - globally unique names that anyone can dereference Colon Slash Slash http://www.ﬂickr.com/photos/jeffsmallwood/299208539/

16. Provide useful information [in RDF] when someone looks up a URI Information Desk http://www.ﬂickr.com/photos/metropol2/149294506/

17. Include links to other URIs to let people discover related information Links http://www.ﬂickr.com/photos/ravages/2831688538/

18. One implication of this is that I think there’s only URIs and metadata... nothing else Self-portraiture + metadata http://www.ﬂickr.com/photos/saltatempo/323462998/

19. URIs are used as identiﬁers for real world things ...like Polar Bears and Jeremy Clarkson

20. Just as my passport is an identiﬁer for me

21. ...which in turn makes assertions about me

22. Thomas Scott 16th May 1972 United Kingdom ...which in turn makes assertions about me

23. bbc.co.uk/nature/species/tiger is an identiﬁer for the tiger species with resources which make assertions about it

24. bbc.co.uk/nature/species/tiger is an identiﬁer for the tiger species with resources which make assertions about it

25. Linked Data at the BBC Test Card X http://www.ﬂickr.com/photos/marksmanuk/3098983708/

26. A page (URI) per programmes bbc.co.uk/programmes/:pid

27. ...and programme segments...

28. In the music domain we have a page for every artist the BBC plays bbc.co.uk/music/artist/:musicbrainzID

29. And in the natural history domain we have URIs of animals... bbc.co.uk/nature/:rank/:dbpediaID

30. ...adaptations and behaviours... bbc.co.uk/nature/adaptaion/:dbpediaID

31. ...and habitats... bbc.co.uk/nature/habitats/:dbpediaID

32. And because the web is about URIs not pages there are separate URIs for each resource

33. These are our building blocks Silos http://www.ﬂickr.com/photos/bottleleaf/2218990208/

34. But context lies in the links between these domains

35. Programmes featuring a species

36. Clips from programmes about a species

37. Clips live at /programmes but are transcluded onto other pages Silos http://www.ﬂickr.com/photos/bottleleaf/2218990208/

38. Tracks played in an episode

39. Programmes that have played an artist

40. How have we put the blocks together?

41. DBpedia as a controlled vocabulary Silos http://www.ﬂickr.com/photos/bottleleaf/2218990208/

42. Different teams model their domain

43. Brands Series Programme Episodes Content Service Publishing Version Event Broadcast Different teams model their domain

44. Link models together

45. Linked Data allows loosely coupled, distributed teams to share data, share models and build on each others work

46. Thank you Programmes ontology http://www.bbc.co.uk/ontologies/programmes Understanding the big BBC graph http://blogs.talis.com/n2/archives/569 Music ontology http://musicontology.com

Editor's Notes

Although I&#x2019;m in speaking in the semantic web strand of this conference I&#x2019;m not going to talk about RDF/XML. That&#x2019;s not because I don&#x2019;t think it&#x2019;s important, I do, but rather because RDF is often conflated with RDF/XML and I would rather consider the model for a bit - what it means and how we&#x2019;ve used it. So I guess what I really mean is that what I&#x2019;m going to be talking about is RDF the model not RDF the data format. If however that is something you are interested in that perhaps grab me after my talk because we are publishing lots and lots of RDF/XML.
The BBC is the largest broadcasting corporation in the world. Its mission is to enrich people's lives with programmes that inform, educate and entertain. It is a public service broadcaster, established by a Royal Charter and funded by the licence fee that is paid by UK households. The BBC uses the income from the licence fee to provide services, including... 8 national TV channels + regional variations and programming National TV and radio for Scotland, Wales and Northern Ireland plus 40 local radio stations
and that&#x2019;s before you get to the World Service which broadcasts to the world in 32 languages. We&#x2019;ve had a web presence since 1994 What all this means is that the BBC produces an incredible range, diversity and volume of content . This volume of content is a challenge in it&#x2019;s own right let alone before you consider the size of the existing archive
This size presents a number of challenges - how to organise, how to build
For starters traditional 'left hand nav' style navigation doesn't work. From a UX POV, nor from a coordination and governance POV. As a result the BBC has historically created a series of microsite. Each coherent in their own right but not across the breadth of BBC content. Consider for example I can navigate around a Radio 4 site about the opening of the LHC... but...
I can&#x2019;t find everything to BBC knows about CERN... but equally I can&#x2019;t find everything
I can&#x2019;t find everything to BBC knows about CERN... but equally I can&#x2019;t find everything
I can&#x2019;t find everything to BBC knows about CERN... but equally I can&#x2019;t find everything
I can&#x2019;t find everything to BBC knows about CERN... but equally I can&#x2019;t find everything
I can&#x2019;t find everything to BBC knows about CERN... but equally I can&#x2019;t find everything
Paul Weller, or any other artist, nor can I find everything
But things are changing.. Starting with the data and how people think about it rather than starting with the web page down. And when I say data I really mean starting with understanding what things people care about and giving each of those things a URI and returning appropriate representations...
Of course what I&#x2019;m talking about is Linked Data... even if we didn&#x2019;t quite realise that when we started. But the idea that we should care about our URIs, care about having one per concept, care about having machine representations for those resources instead of a separate API has helped us build a coherent, scalable, sane service. Linking Open Data is a grassroots project to use web technologies to expose data on the web. It is for many people synonymous with the semantic web, or worse web 3.0, a term I personally can&#x2019;t stand (esp when you consider that TimBLs original memo described a web of things). It does, as far as I&#x2019;m concerned, represent a very large subset of the semantic web project. But what is it? Well it can be described with 4 simple rules.
The web was designed to be a web of things, not just a web of documents. Those documents make assertions about things in the real world but that doesn&#x2019;t mean the identifiers can only be used to identify web documents. Minting URIs for things rather than pages helps make the web more human literate because it means we are identifying those things that people care about.
The beauty of the web is its ubiquitous nature - the fact it is decentralised and able to function on any platform. This is because of TimBL&#x2019;s key invention the HTTP URI. URI&#x2019;s are globally unique, open to all and decentralised. Don&#x2019;t go using DOI or any other identifier - on the web all you need is an HTTP URI.
And obviously you need to provide some information at that URI. When people dereference it you need to give them some data - ideally as RDF as well as HTML. Providing the data as RDF means that machines can process that information for people to use. Making it more useful.
And of course you also need to provide links to other resources so people can continue their journey. And that means contextual links to other resources elsewhere on the web, not just your site. And that&#x2019;s it. Pretty simple. And I would argue that, other than the RDF bit, these principles should be followed for any website - they just make sense.
Including that I look like this Was born here That my name is this (diff slide - my driving license is another identifier which also makes assertions about me)
Including that I look like this Was born here That my name is this (diff slide - my driving license is another identifier which also makes assertions about me)
Including that I look like this Was born here That my name is this (diff slide - my driving license is another identifier which also makes assertions about me)
Including that I look like this Was born here That my name is this (diff slide - my driving license is another identifier which also makes assertions about me)
Tigers look like this Sound like this Do these things This has happened to them They live here Do have this sort of way of life (adaptations)
Tigers look like this Sound like this Do these things This has happened to them They live here Do have this sort of way of life (adaptations)
Tigers look like this Sound like this Do these things This has happened to them They live here Do have this sort of way of life (adaptations)
Tigers look like this Sound like this Do these things This has happened to them They live here Do have this sort of way of life (adaptations)
People care about our programme brands - they search for them, love watching them and expect the BBC to provide footage/ clips of them.
And we have separate pages for every artist the BBC plays on the new music site.
And you can do the same thing for sounds, news stories, links, wikipedia etc
If you build things correctly then like lego we can stick things together to build more stuff
Information about a thing is important and it is interesting, but it&#x2019;s interest is somewhat limited. What&#x2019;s really interesting is the join the link between things.
What programmes or clips do we have about a given species?
Clips live at /programmes but are transcluded onto other pages
Which tracks were plaid on a particular show - linking through to the artist pages. Again the information about the artist &#x2018;lives&#x2019; at /music but it&#x2019;s pulled into the programme domain because
Which in turn tell you about which programmes and radio stations play that artist - with links through to the programme or station.
What probably isn&#x2019;t completely obvious is that we have modeled and structured the site around those things. So we have classes of object and relationships between them, and resources within each class. For example - a Lion is a Species and species have defined relationships to habitats, location, conservation status and adaptation. What this means is that when we create a new species it appears on it&#x2019;s habitat, adaptation page etc.

Online Information Conference

Recommended

Recommended

More Related Content

Similar to Online Information Conference

Similar to Online Information Conference (20)

Recently uploaded

Recently uploaded (20)

Online Information Conference

Editor's Notes