SlideShare a Scribd company logo
www.sti-innsbruck.at© Copyright 2008 STI INNSBRUCK www.sti-innsbruck.at
Media Meets Semantic Web – How the BBC
Uses DBpedia and Linked Data to Make
Connections
Georgi Kobilarov et. al. ESWC 2009
www.sti-innsbruck.at
• BBC working to integrate data and linking documents across BBC
domains
• Collaboration with Freie Universität Berlin, Rattle Research (and
Ontotext)
• Semantic Web context: usage of Linked Data from MusicBrainz and
DBpedia
2
www.sti-innsbruck.at
Problem
• BBC publishes large amounts of online content text/videos/audio
• Mostly data for broadcast brands and domain specific microsites
• Division of its services by domain, e.g. food, music, news etc.
 No interlinking between these domain specific sites – not using the full
potential of available data
3
www.sti-innsbruck.at
Objectives
• DBpedia to provide a common ”controlled” vocabulary and
equivalency service, which in turn is used to add ”topic badges” to
existing, legacy web pages
• Soft transition of the old to the new system
– Developing a new service that supports the branding of our Radio stations, TV
channels and programmes (bbc.co.uk/programmes)
– Developing a new music offering (bbc.co.uk/music/beta) that builds on existing
open web standards and is fully integrated with programme support service
– Simple navigational elements (i.e. Topic Badges and term extraction) to support
contextual, semantic navigation
– Common set of web scale identifiers to help classify all BBC online content (and
external URLs) and to help create equivalency between multiple vocabularies
4
www.sti-innsbruck.at
Cross-Linking Legacy Content with Legacy Systems
• Desire to link to further BBC domains (apart from programmes and music)
– Through an about-relationship between programmes, people, places and subjects
• Data was created with a legacy auto-categorization system called CIS.
• CIS holds a hierarchy of terms in five main top-level classes:
– Proper names
– Subjects
– Brands
– Time periods
– Places
 Objects identified with /programmes and /music are also to be found within other
domains: Mechanism to map between equivalent terms
 Linking CIS Concepts to DBpedia
5
www.sti-innsbruck.at
Linking BBC Domains
6
www.sti-innsbruck.at
Linking BBC Domains
• DBpedia weighted Label Lookup using Wikipedia inter-article-links as weight
indicator
– links(redirect)*log2(weight(article))
• Context-Based Disambiguation
– Disambiguate possible concept matches to identify similarity contexts of CIS terms by clustering matches
and finding according contexts in DBpedia
7
www.sti-innsbruck.at
Linking Documents to Concepts
• Named entity extraction system Muddy Boots
– Instead of solutions from OpenCalais, Twine and Zemanta because it reuses existing
web identifiers, i.e. Wikipedia/Dbpedia URIs
• BBC News articles, recognize entities in those articles
• Use DBpedia identifier for those entities
• Content Link Tool to add or remove DBpedia identifiers from any given
BBC URL
8
www.sti-innsbruck.at
Create User Journeys:
Topic Pages and Navigation Badges
• Topic pages
– Creation of aggregation pages of unstructured and structured content
– Pull together the modeled world of BBC programmes (CIS identifiers mapped to
DBpedia) and unstructured world of BBC News articles
• Navigational Badges
– Once a user has entered an area of BBC content there are few links through to other
related content
– Providing this link is the role of the navigation badge
9
www.sti-innsbruck.at
Conclusions
• User experience in the center of BBC efforts
• Semantics as enabler
• What we can learn form the BBC
– User should be in the center of efforts
– Pages not strictly structured according to domain model
– Semantics primarily enable smart interlinking to additional content
– Well hidden magic
– Simplicity of domain models is beauty
• For more information refer to “Beyond the polar bear presentation”
– http://www.slideshare.net/reduxd/beyond-the-polar-bear
10

More Related Content

What's hot

2008-04-15 EGU Mtg Vienna Tagging
2008-04-15 EGU Mtg Vienna Tagging2008-04-15 EGU Mtg Vienna Tagging
2008-04-15 EGU Mtg Vienna TaggingRudolf Husar
 
Evolving the Web into a Global Database - Advances and Applications.
Evolving the Web into a Global Database - Advances and Applications. Evolving the Web into a Global Database - Advances and Applications.
Evolving the Web into a Global Database - Advances and Applications.
Chris Bizer
 
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
DeVonne Parks, CEM
 
Aggregation Using Linked Data – LOCAH Project Experiences
Aggregation Using Linked Data – LOCAH Project ExperiencesAggregation Using Linked Data – LOCAH Project Experiences
Aggregation Using Linked Data – LOCAH Project Experiences
Adrian Stevenson
 
VALA2008 L Plate Session1
VALA2008  L  Plate Session1VALA2008  L  Plate Session1
VALA2008 L Plate Session1
David Feighan
 
Linked Data past, present and futures
Linked Datapast, present and futuresLinked Datapast, present and futures
Linked Data past, present and futures
Pierre-Yves Vandenbussche, Ph.D.
 
Linked Data Efforts at the Bibliotheque Nationale de France
Linked Data Efforts at the Bibliotheque Nationale de FranceLinked Data Efforts at the Bibliotheque Nationale de France
Linked Data Efforts at the Bibliotheque Nationale de France
National Information Standards Organization (NISO)
 
Modeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROVModeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROV
EUDAT
 
鏈結資料在圖書館的應用20131107
鏈結資料在圖書館的應用20131107鏈結資料在圖書館的應用20131107
鏈結資料在圖書館的應用20131107
皓仁 柯
 

What's hot (9)

2008-04-15 EGU Mtg Vienna Tagging
2008-04-15 EGU Mtg Vienna Tagging2008-04-15 EGU Mtg Vienna Tagging
2008-04-15 EGU Mtg Vienna Tagging
 
Evolving the Web into a Global Database - Advances and Applications.
Evolving the Web into a Global Database - Advances and Applications. Evolving the Web into a Global Database - Advances and Applications.
Evolving the Web into a Global Database - Advances and Applications.
 
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
 
Aggregation Using Linked Data – LOCAH Project Experiences
Aggregation Using Linked Data – LOCAH Project ExperiencesAggregation Using Linked Data – LOCAH Project Experiences
Aggregation Using Linked Data – LOCAH Project Experiences
 
VALA2008 L Plate Session1
VALA2008  L  Plate Session1VALA2008  L  Plate Session1
VALA2008 L Plate Session1
 
Linked Data past, present and futures
Linked Datapast, present and futuresLinked Datapast, present and futures
Linked Data past, present and futures
 
Linked Data Efforts at the Bibliotheque Nationale de France
Linked Data Efforts at the Bibliotheque Nationale de FranceLinked Data Efforts at the Bibliotheque Nationale de France
Linked Data Efforts at the Bibliotheque Nationale de France
 
Modeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROVModeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROV
 
鏈結資料在圖書館的應用20131107
鏈結資料在圖書館的應用20131107鏈結資料在圖書館的應用20131107
鏈結資料在圖書館的應用20131107
 

Similar to Bbc semantic

BHL Technical Projects Updates
BHL Technical Projects UpdatesBHL Technical Projects Updates
BHL Technical Projects Updates
William Ulate
 
BHL Technical Update (May 2013)
BHL Technical Update (May 2013)BHL Technical Update (May 2013)
BHL Technical Update (May 2013)
William Ulate
 
Bibliographic References in BHL
Bibliographic References in BHLBibliographic References in BHL
Bibliographic References in BHLWilliam Ulate
 
Sw 3 bizer etal-d bpedia-crystallization-point-jws-preprint
Sw 3 bizer etal-d bpedia-crystallization-point-jws-preprintSw 3 bizer etal-d bpedia-crystallization-point-jws-preprint
Sw 3 bizer etal-d bpedia-crystallization-point-jws-preprintokeee
 
The Biodiversity Heritage Library and bibliographic citations: towards new u...
The Biodiversity Heritage Library and bibliographic citations: towards new u...The Biodiversity Heritage Library and bibliographic citations: towards new u...
The Biodiversity Heritage Library and bibliographic citations: towards new u...
Trish Rose-Sandler
 
Web 2.0 : Intellectual Property Issues
Web 2.0 : Intellectual Property IssuesWeb 2.0 : Intellectual Property Issues
Web 2.0 : Intellectual Property Issues
Karl Larson
 
Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)Anja Jentzsch
 
Tsakonas-Robbio·Open Bibliographic Data E-Lis
Tsakonas-Robbio·Open Bibliographic Data E-LisTsakonas-Robbio·Open Bibliographic Data E-Lis
Tsakonas-Robbio·Open Bibliographic Data E-Lis
LIS EPI Meeting
 
Open Bibliographic Data and E-LIS
Open Bibliographic Data and E-LISOpen Bibliographic Data and E-LIS
Open Bibliographic Data and E-LISGiannis Tsakonas
 
Documents, services, and data on the web
Documents, services, and data on the webDocuments, services, and data on the web
Documents, services, and data on the web
Chiara Del Vescovo
 
Improving library services with semantic web technology in the realm of repos...
Improving library services with semantic web technology in the realm of repos...Improving library services with semantic web technology in the realm of repos...
Improving library services with semantic web technology in the realm of repos...redsys
 
DBpedia talk at Fjord Berlin
DBpedia talk at Fjord BerlinDBpedia talk at Fjord Berlin
DBpedia talk at Fjord BerlinGeorgi Kobilarov
 
BD2K and the Commons : ELIXR All Hands
BD2K and the Commons : ELIXR All Hands BD2K and the Commons : ELIXR All Hands
BD2K and the Commons : ELIXR All Hands
Vivien Bonazzi
 
Cross-Community User Requirements and the Biodiversity Heritage Library
Cross-Community User Requirements and the Biodiversity Heritage LibraryCross-Community User Requirements and the Biodiversity Heritage Library
Cross-Community User Requirements and the Biodiversity Heritage Library
Chris Freeland
 
BHL hardware architecture - storage and clusters
BHL hardware architecture - storage and clustersBHL hardware architecture - storage and clusters
BHL hardware architecture - storage and clusters
Phil Cryer
 
Babouk: Focused Web Crawling for Corpus Compilation and Automatic Terminology...
Babouk: Focused Web Crawling for Corpus Compilation and Automatic Terminology...Babouk: Focused Web Crawling for Corpus Compilation and Automatic Terminology...
Babouk: Focused Web Crawling for Corpus Compilation and Automatic Terminology...
Christophe Tricot
 
Application Of Web 2.0 In Libraries A Study Of Asmita College Library
Application Of Web 2.0 In Libraries  A Study Of Asmita College LibraryApplication Of Web 2.0 In Libraries  A Study Of Asmita College Library
Application Of Web 2.0 In Libraries A Study Of Asmita College Library
Lori Moore
 
BoB and TRILT for Research
BoB and TRILT for ResearchBoB and TRILT for Research
BoB and TRILT for Research
Chris Willmott
 
Web 2.0/Library20
Web 2.0/Library20Web 2.0/Library20
Web 2.0/Library20
Cheryl Tanicala-Roldan
 
Web 2.0 Applications: Social Bookmarking
Web 2.0 Applications: Social BookmarkingWeb 2.0 Applications: Social Bookmarking
Web 2.0 Applications: Social Bookmarking
Pavlinka Kovatcheva
 

Similar to Bbc semantic (20)

BHL Technical Projects Updates
BHL Technical Projects UpdatesBHL Technical Projects Updates
BHL Technical Projects Updates
 
BHL Technical Update (May 2013)
BHL Technical Update (May 2013)BHL Technical Update (May 2013)
BHL Technical Update (May 2013)
 
Bibliographic References in BHL
Bibliographic References in BHLBibliographic References in BHL
Bibliographic References in BHL
 
Sw 3 bizer etal-d bpedia-crystallization-point-jws-preprint
Sw 3 bizer etal-d bpedia-crystallization-point-jws-preprintSw 3 bizer etal-d bpedia-crystallization-point-jws-preprint
Sw 3 bizer etal-d bpedia-crystallization-point-jws-preprint
 
The Biodiversity Heritage Library and bibliographic citations: towards new u...
The Biodiversity Heritage Library and bibliographic citations: towards new u...The Biodiversity Heritage Library and bibliographic citations: towards new u...
The Biodiversity Heritage Library and bibliographic citations: towards new u...
 
Web 2.0 : Intellectual Property Issues
Web 2.0 : Intellectual Property IssuesWeb 2.0 : Intellectual Property Issues
Web 2.0 : Intellectual Property Issues
 
Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)
 
Tsakonas-Robbio·Open Bibliographic Data E-Lis
Tsakonas-Robbio·Open Bibliographic Data E-LisTsakonas-Robbio·Open Bibliographic Data E-Lis
Tsakonas-Robbio·Open Bibliographic Data E-Lis
 
Open Bibliographic Data and E-LIS
Open Bibliographic Data and E-LISOpen Bibliographic Data and E-LIS
Open Bibliographic Data and E-LIS
 
Documents, services, and data on the web
Documents, services, and data on the webDocuments, services, and data on the web
Documents, services, and data on the web
 
Improving library services with semantic web technology in the realm of repos...
Improving library services with semantic web technology in the realm of repos...Improving library services with semantic web technology in the realm of repos...
Improving library services with semantic web technology in the realm of repos...
 
DBpedia talk at Fjord Berlin
DBpedia talk at Fjord BerlinDBpedia talk at Fjord Berlin
DBpedia talk at Fjord Berlin
 
BD2K and the Commons : ELIXR All Hands
BD2K and the Commons : ELIXR All Hands BD2K and the Commons : ELIXR All Hands
BD2K and the Commons : ELIXR All Hands
 
Cross-Community User Requirements and the Biodiversity Heritage Library
Cross-Community User Requirements and the Biodiversity Heritage LibraryCross-Community User Requirements and the Biodiversity Heritage Library
Cross-Community User Requirements and the Biodiversity Heritage Library
 
BHL hardware architecture - storage and clusters
BHL hardware architecture - storage and clustersBHL hardware architecture - storage and clusters
BHL hardware architecture - storage and clusters
 
Babouk: Focused Web Crawling for Corpus Compilation and Automatic Terminology...
Babouk: Focused Web Crawling for Corpus Compilation and Automatic Terminology...Babouk: Focused Web Crawling for Corpus Compilation and Automatic Terminology...
Babouk: Focused Web Crawling for Corpus Compilation and Automatic Terminology...
 
Application Of Web 2.0 In Libraries A Study Of Asmita College Library
Application Of Web 2.0 In Libraries  A Study Of Asmita College LibraryApplication Of Web 2.0 In Libraries  A Study Of Asmita College Library
Application Of Web 2.0 In Libraries A Study Of Asmita College Library
 
BoB and TRILT for Research
BoB and TRILT for ResearchBoB and TRILT for Research
BoB and TRILT for Research
 
Web 2.0/Library20
Web 2.0/Library20Web 2.0/Library20
Web 2.0/Library20
 
Web 2.0 Applications: Social Bookmarking
Web 2.0 Applications: Social BookmarkingWeb 2.0 Applications: Social Bookmarking
Web 2.0 Applications: Social Bookmarking
 

More from STIinnsbruck

Unister
UnisterUnister
Unister
STIinnsbruck
 
Twoo
TwooTwoo
Twibes
TwibesTwibes
Twibes
STIinnsbruck
 
Tweet deck 2012-01-02
Tweet deck 2012-01-02Tweet deck 2012-01-02
Tweet deck 2012-01-02
STIinnsbruck
 
Tv handbook revised_100120141
Tv handbook revised_100120141Tv handbook revised_100120141
Tv handbook revised_100120141
STIinnsbruck
 
Tv feratel 13032014
Tv feratel 13032014Tv feratel 13032014
Tv feratel 13032014
STIinnsbruck
 
Tv evaluation 12032014
Tv evaluation 12032014Tv evaluation 12032014
Tv evaluation 12032014
STIinnsbruck
 
T vb publication_rules_11032014
T vb publication_rules_11032014T vb publication_rules_11032014
T vb publication_rules_11032014
STIinnsbruck
 
T vb mapping_implementation_25032014
T vb mapping_implementation_25032014T vb mapping_implementation_25032014
T vb mapping_implementation_25032014
STIinnsbruck
 
T vb alignment_022814_0
T vb alignment_022814_0T vb alignment_022814_0
T vb alignment_022814_0
STIinnsbruck
 
Ttr 20130701
Ttr 20130701Ttr 20130701
Ttr 20130701
STIinnsbruck
 
Ttg mapping to_schema.org_
Ttg mapping to_schema.org_Ttg mapping to_schema.org_
Ttg mapping to_schema.org_
STIinnsbruck
 
Ttb 08042014
Ttb 08042014Ttb 08042014
Ttb 08042014
STIinnsbruck
 
Trust you
Trust youTrust you
Trust you
STIinnsbruck
 
Tripwolf
TripwolfTripwolf
Tripwolf
STIinnsbruck
 
Tripbirds
TripbirdsTripbirds
Tripbirds
STIinnsbruck
 
Traveltainment
TraveltainmentTraveltainment
Traveltainment
STIinnsbruck
 
Travelaudience
TravelaudienceTravelaudience
Travelaudience
STIinnsbruck
 
Tourismuszukunft
TourismuszukunftTourismuszukunft
Tourismuszukunft
STIinnsbruck
 
Tourismusverband innsbruck 24.09.2013
Tourismusverband innsbruck 24.09.2013Tourismusverband innsbruck 24.09.2013
Tourismusverband innsbruck 24.09.2013
STIinnsbruck
 

More from STIinnsbruck (20)

Unister
UnisterUnister
Unister
 
Twoo
TwooTwoo
Twoo
 
Twibes
TwibesTwibes
Twibes
 
Tweet deck 2012-01-02
Tweet deck 2012-01-02Tweet deck 2012-01-02
Tweet deck 2012-01-02
 
Tv handbook revised_100120141
Tv handbook revised_100120141Tv handbook revised_100120141
Tv handbook revised_100120141
 
Tv feratel 13032014
Tv feratel 13032014Tv feratel 13032014
Tv feratel 13032014
 
Tv evaluation 12032014
Tv evaluation 12032014Tv evaluation 12032014
Tv evaluation 12032014
 
T vb publication_rules_11032014
T vb publication_rules_11032014T vb publication_rules_11032014
T vb publication_rules_11032014
 
T vb mapping_implementation_25032014
T vb mapping_implementation_25032014T vb mapping_implementation_25032014
T vb mapping_implementation_25032014
 
T vb alignment_022814_0
T vb alignment_022814_0T vb alignment_022814_0
T vb alignment_022814_0
 
Ttr 20130701
Ttr 20130701Ttr 20130701
Ttr 20130701
 
Ttg mapping to_schema.org_
Ttg mapping to_schema.org_Ttg mapping to_schema.org_
Ttg mapping to_schema.org_
 
Ttb 08042014
Ttb 08042014Ttb 08042014
Ttb 08042014
 
Trust you
Trust youTrust you
Trust you
 
Tripwolf
TripwolfTripwolf
Tripwolf
 
Tripbirds
TripbirdsTripbirds
Tripbirds
 
Traveltainment
TraveltainmentTraveltainment
Traveltainment
 
Travelaudience
TravelaudienceTravelaudience
Travelaudience
 
Tourismuszukunft
TourismuszukunftTourismuszukunft
Tourismuszukunft
 
Tourismusverband innsbruck 24.09.2013
Tourismusverband innsbruck 24.09.2013Tourismusverband innsbruck 24.09.2013
Tourismusverband innsbruck 24.09.2013
 

Recently uploaded

0x01 - Newton's Third Law: Static vs. Dynamic Abusers
0x01 - Newton's Third Law:  Static vs. Dynamic Abusers0x01 - Newton's Third Law:  Static vs. Dynamic Abusers
0x01 - Newton's Third Law: Static vs. Dynamic Abusers
OWASP Beja
 
Getting started with Amazon Bedrock Studio and Control Tower
Getting started with Amazon Bedrock Studio and Control TowerGetting started with Amazon Bedrock Studio and Control Tower
Getting started with Amazon Bedrock Studio and Control Tower
Vladimir Samoylov
 
Eureka, I found it! - Special Libraries Association 2021 Presentation
Eureka, I found it! - Special Libraries Association 2021 PresentationEureka, I found it! - Special Libraries Association 2021 Presentation
Eureka, I found it! - Special Libraries Association 2021 Presentation
Access Innovations, Inc.
 
Sharpen existing tools or get a new toolbox? Contemporary cluster initiatives...
Sharpen existing tools or get a new toolbox? Contemporary cluster initiatives...Sharpen existing tools or get a new toolbox? Contemporary cluster initiatives...
Sharpen existing tools or get a new toolbox? Contemporary cluster initiatives...
Orkestra
 
Obesity causes and management and associated medical conditions
Obesity causes and management and associated medical conditionsObesity causes and management and associated medical conditions
Obesity causes and management and associated medical conditions
Faculty of Medicine And Health Sciences
 
Acorn Recovery: Restore IT infra within minutes
Acorn Recovery: Restore IT infra within minutesAcorn Recovery: Restore IT infra within minutes
Acorn Recovery: Restore IT infra within minutes
IP ServerOne
 
Announcement of 18th IEEE International Conference on Software Testing, Verif...
Announcement of 18th IEEE International Conference on Software Testing, Verif...Announcement of 18th IEEE International Conference on Software Testing, Verif...
Announcement of 18th IEEE International Conference on Software Testing, Verif...
Sebastiano Panichella
 
Competition and Regulation in Professional Services – KLEINER – June 2024 OEC...
Competition and Regulation in Professional Services – KLEINER – June 2024 OEC...Competition and Regulation in Professional Services – KLEINER – June 2024 OEC...
Competition and Regulation in Professional Services – KLEINER – June 2024 OEC...
OECD Directorate for Financial and Enterprise Affairs
 
Bitcoin Lightning wallet and tic-tac-toe game XOXO
Bitcoin Lightning wallet and tic-tac-toe game XOXOBitcoin Lightning wallet and tic-tac-toe game XOXO
Bitcoin Lightning wallet and tic-tac-toe game XOXO
Matjaž Lipuš
 
somanykidsbutsofewfathers-140705000023-phpapp02.pptx
somanykidsbutsofewfathers-140705000023-phpapp02.pptxsomanykidsbutsofewfathers-140705000023-phpapp02.pptx
somanykidsbutsofewfathers-140705000023-phpapp02.pptx
Howard Spence
 
Doctoral Symposium at the 17th IEEE International Conference on Software Test...
Doctoral Symposium at the 17th IEEE International Conference on Software Test...Doctoral Symposium at the 17th IEEE International Conference on Software Test...
Doctoral Symposium at the 17th IEEE International Conference on Software Test...
Sebastiano Panichella
 
Bonzo subscription_hjjjjjjjj5hhhhhhh_2024.pdf
Bonzo subscription_hjjjjjjjj5hhhhhhh_2024.pdfBonzo subscription_hjjjjjjjj5hhhhhhh_2024.pdf
Bonzo subscription_hjjjjjjjj5hhhhhhh_2024.pdf
khadija278284
 
Media as a Mind Controlling Strategy In Old and Modern Era
Media as a Mind Controlling Strategy In Old and Modern EraMedia as a Mind Controlling Strategy In Old and Modern Era
Media as a Mind Controlling Strategy In Old and Modern Era
faizulhassanfaiz1670
 
Supercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdf
Supercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdfSupercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdf
Supercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdf
Access Innovations, Inc.
 
María Carolina Martínez - eCommerce Day Colombia 2024
María Carolina Martínez - eCommerce Day Colombia 2024María Carolina Martínez - eCommerce Day Colombia 2024
María Carolina Martínez - eCommerce Day Colombia 2024
eCommerce Institute
 
International Workshop on Artificial Intelligence in Software Testing
International Workshop on Artificial Intelligence in Software TestingInternational Workshop on Artificial Intelligence in Software Testing
International Workshop on Artificial Intelligence in Software Testing
Sebastiano Panichella
 

Recently uploaded (16)

0x01 - Newton's Third Law: Static vs. Dynamic Abusers
0x01 - Newton's Third Law:  Static vs. Dynamic Abusers0x01 - Newton's Third Law:  Static vs. Dynamic Abusers
0x01 - Newton's Third Law: Static vs. Dynamic Abusers
 
Getting started with Amazon Bedrock Studio and Control Tower
Getting started with Amazon Bedrock Studio and Control TowerGetting started with Amazon Bedrock Studio and Control Tower
Getting started with Amazon Bedrock Studio and Control Tower
 
Eureka, I found it! - Special Libraries Association 2021 Presentation
Eureka, I found it! - Special Libraries Association 2021 PresentationEureka, I found it! - Special Libraries Association 2021 Presentation
Eureka, I found it! - Special Libraries Association 2021 Presentation
 
Sharpen existing tools or get a new toolbox? Contemporary cluster initiatives...
Sharpen existing tools or get a new toolbox? Contemporary cluster initiatives...Sharpen existing tools or get a new toolbox? Contemporary cluster initiatives...
Sharpen existing tools or get a new toolbox? Contemporary cluster initiatives...
 
Obesity causes and management and associated medical conditions
Obesity causes and management and associated medical conditionsObesity causes and management and associated medical conditions
Obesity causes and management and associated medical conditions
 
Acorn Recovery: Restore IT infra within minutes
Acorn Recovery: Restore IT infra within minutesAcorn Recovery: Restore IT infra within minutes
Acorn Recovery: Restore IT infra within minutes
 
Announcement of 18th IEEE International Conference on Software Testing, Verif...
Announcement of 18th IEEE International Conference on Software Testing, Verif...Announcement of 18th IEEE International Conference on Software Testing, Verif...
Announcement of 18th IEEE International Conference on Software Testing, Verif...
 
Competition and Regulation in Professional Services – KLEINER – June 2024 OEC...
Competition and Regulation in Professional Services – KLEINER – June 2024 OEC...Competition and Regulation in Professional Services – KLEINER – June 2024 OEC...
Competition and Regulation in Professional Services – KLEINER – June 2024 OEC...
 
Bitcoin Lightning wallet and tic-tac-toe game XOXO
Bitcoin Lightning wallet and tic-tac-toe game XOXOBitcoin Lightning wallet and tic-tac-toe game XOXO
Bitcoin Lightning wallet and tic-tac-toe game XOXO
 
somanykidsbutsofewfathers-140705000023-phpapp02.pptx
somanykidsbutsofewfathers-140705000023-phpapp02.pptxsomanykidsbutsofewfathers-140705000023-phpapp02.pptx
somanykidsbutsofewfathers-140705000023-phpapp02.pptx
 
Doctoral Symposium at the 17th IEEE International Conference on Software Test...
Doctoral Symposium at the 17th IEEE International Conference on Software Test...Doctoral Symposium at the 17th IEEE International Conference on Software Test...
Doctoral Symposium at the 17th IEEE International Conference on Software Test...
 
Bonzo subscription_hjjjjjjjj5hhhhhhh_2024.pdf
Bonzo subscription_hjjjjjjjj5hhhhhhh_2024.pdfBonzo subscription_hjjjjjjjj5hhhhhhh_2024.pdf
Bonzo subscription_hjjjjjjjj5hhhhhhh_2024.pdf
 
Media as a Mind Controlling Strategy In Old and Modern Era
Media as a Mind Controlling Strategy In Old and Modern EraMedia as a Mind Controlling Strategy In Old and Modern Era
Media as a Mind Controlling Strategy In Old and Modern Era
 
Supercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdf
Supercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdfSupercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdf
Supercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdf
 
María Carolina Martínez - eCommerce Day Colombia 2024
María Carolina Martínez - eCommerce Day Colombia 2024María Carolina Martínez - eCommerce Day Colombia 2024
María Carolina Martínez - eCommerce Day Colombia 2024
 
International Workshop on Artificial Intelligence in Software Testing
International Workshop on Artificial Intelligence in Software TestingInternational Workshop on Artificial Intelligence in Software Testing
International Workshop on Artificial Intelligence in Software Testing
 

Bbc semantic

  • 1. www.sti-innsbruck.at© Copyright 2008 STI INNSBRUCK www.sti-innsbruck.at Media Meets Semantic Web – How the BBC Uses DBpedia and Linked Data to Make Connections Georgi Kobilarov et. al. ESWC 2009
  • 2. www.sti-innsbruck.at • BBC working to integrate data and linking documents across BBC domains • Collaboration with Freie Universität Berlin, Rattle Research (and Ontotext) • Semantic Web context: usage of Linked Data from MusicBrainz and DBpedia 2
  • 3. www.sti-innsbruck.at Problem • BBC publishes large amounts of online content text/videos/audio • Mostly data for broadcast brands and domain specific microsites • Division of its services by domain, e.g. food, music, news etc.  No interlinking between these domain specific sites – not using the full potential of available data 3
  • 4. www.sti-innsbruck.at Objectives • DBpedia to provide a common ”controlled” vocabulary and equivalency service, which in turn is used to add ”topic badges” to existing, legacy web pages • Soft transition of the old to the new system – Developing a new service that supports the branding of our Radio stations, TV channels and programmes (bbc.co.uk/programmes) – Developing a new music offering (bbc.co.uk/music/beta) that builds on existing open web standards and is fully integrated with programme support service – Simple navigational elements (i.e. Topic Badges and term extraction) to support contextual, semantic navigation – Common set of web scale identifiers to help classify all BBC online content (and external URLs) and to help create equivalency between multiple vocabularies 4
  • 5. www.sti-innsbruck.at Cross-Linking Legacy Content with Legacy Systems • Desire to link to further BBC domains (apart from programmes and music) – Through an about-relationship between programmes, people, places and subjects • Data was created with a legacy auto-categorization system called CIS. • CIS holds a hierarchy of terms in five main top-level classes: – Proper names – Subjects – Brands – Time periods – Places  Objects identified with /programmes and /music are also to be found within other domains: Mechanism to map between equivalent terms  Linking CIS Concepts to DBpedia 5
  • 7. www.sti-innsbruck.at Linking BBC Domains • DBpedia weighted Label Lookup using Wikipedia inter-article-links as weight indicator – links(redirect)*log2(weight(article)) • Context-Based Disambiguation – Disambiguate possible concept matches to identify similarity contexts of CIS terms by clustering matches and finding according contexts in DBpedia 7
  • 8. www.sti-innsbruck.at Linking Documents to Concepts • Named entity extraction system Muddy Boots – Instead of solutions from OpenCalais, Twine and Zemanta because it reuses existing web identifiers, i.e. Wikipedia/Dbpedia URIs • BBC News articles, recognize entities in those articles • Use DBpedia identifier for those entities • Content Link Tool to add or remove DBpedia identifiers from any given BBC URL 8
  • 9. www.sti-innsbruck.at Create User Journeys: Topic Pages and Navigation Badges • Topic pages – Creation of aggregation pages of unstructured and structured content – Pull together the modeled world of BBC programmes (CIS identifiers mapped to DBpedia) and unstructured world of BBC News articles • Navigational Badges – Once a user has entered an area of BBC content there are few links through to other related content – Providing this link is the role of the navigation badge 9
  • 10. www.sti-innsbruck.at Conclusions • User experience in the center of BBC efforts • Semantics as enabler • What we can learn form the BBC – User should be in the center of efforts – Pages not strictly structured according to domain model – Semantics primarily enable smart interlinking to additional content – Well hidden magic – Simplicity of domain models is beauty • For more information refer to “Beyond the polar bear presentation” – http://www.slideshare.net/reduxd/beyond-the-polar-bear 10