BBC News LabsYahoo! Hack Europe (London)
BBC News Labs• Matt Shearer – Lead, BBC News Labs@completedespair• Jeremy Tarling – Data Architect, BBC News@jeremytarling...
BBC News Labs1. History2. Now3. What’s possible4. APIs(no On-Stage, LIVE coding. sorry)
History• The News Juicer is 1 year old• 2012 : "All we have is a bunch of articles...We need a semantic prototyping platfo...
History
Now• 84,400+ News & Sport articles tagged.• 4 semantic APIs(there were only 3 yesterday)• bbc.mashery.com (yahoo:hack)
What’s possible
APIsJeremy Tarling - Data Architect, BBC News@jeremytarlingthe Juicer API has 4 endpoints:1. article semantic annotation2....
API 1 - article semantic annotation• GET a list of BBC News articles by concept• support for SPARQL queries• explore the D...
API 2 - article full text search• perform full text search of BBC News articles• filter by section: "politics", "business"...
API 3 - concept co-occurrence• select a DBpedia concept:<http://dbpedia.org/resource/David_Cameron>• specify the type:<htt...
API 4 - find concepts• full text search for concepts that are tagged onarticles• specify a search term or phrase• returns ...
Thanks and enjoy@BBC_News_Labs
Upcoming SlideShare
Loading in...5
×

Bbc news labs at yahoo! hack europe

22,838

Published on

0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
22,838
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
11
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide
  • Small, underfunded
  • Bbc news labs at yahoo! hack europe

    1. 1. BBC News LabsYahoo! Hack Europe (London)
    2. 2. BBC News Labs• Matt Shearer – Lead, BBC News Labs@completedespair• Jeremy Tarling – Data Architect, BBC News@jeremytarling• Paul Wilton – Tech Architect, Ontoba Ltd@pwilton• AND Viktor Tron, Matt Haynes, Mark Ransby
    3. 3. BBC News Labs1. History2. Now3. What’s possible4. APIs(no On-Stage, LIVE coding. sorry)
    4. 4. History• The News Juicer is 1 year old• 2012 : "All we have is a bunch of articles...We need a semantic prototyping platform!"1GrabBBC News& SportArticles2ExtractConcepts3Match toDBpedia4AnnotateArticle5Push toTriplestore6ExposeviaAPIThe News Juicer
    5. 5. History
    6. 6. Now• 84,400+ News & Sport articles tagged.• 4 semantic APIs(there were only 3 yesterday)• bbc.mashery.com (yahoo:hack)
    7. 7. What’s possible
    8. 8. APIsJeremy Tarling - Data Architect, BBC News@jeremytarlingthe Juicer API has 4 endpoints:1. article semantic annotation2. article full text search3. concept co-occurrence4. concept search
    9. 9. API 1 - article semantic annotation• GET a list of BBC News articles by concept• support for SPARQL queries• explore the DBpedia graph• exampleso "articles about Conservative politicians"o "articles about places within 25 miles of Chester"o "articles about companies in the aerospaceindustry"
    10. 10. API 2 - article full text search• perform full text search of BBC News articles• filter by section: "politics", "business"• specify data range, limit + offset• example: "find 5 articles with the wordshorsemeat and Tesco in from the UK sectionsince Jan 1st 2012"
    11. 11. API 3 - concept co-occurrence• select a DBpedia concept:<http://dbpedia.org/resource/David_Cameron>• specify the type:<http://dbpedia.org/ontology/Person>• returns an ordered list of people that alsoappear in BBC news articles alongside DavidCameron, and their frequencies
    12. 12. API 4 - find concepts• full text search for concepts that are tagged onarticles• specify a search term or phrase• returns OpenSearch Suggestions JSON for thesemantic concepts
    13. 13. Thanks and enjoy@BBC_News_Labs
    1. A particular slide catching your eye?

      Clipping is a handy way to collect important slides you want to go back to later.

    ×