SlideShare a Scribd company logo
1 of 68
Chris Sizemore Silver Oliver BBC Wikipedia as controlled vocabulary
I’m about ‘Victorians’
BBC Topic Page I’m about ‘Victorians’ Outside the BBC BBC silo #1 BBC silo #3 BBC silo #2
BBC Topic Page I’m about ‘Victorians’ viktorianisch V 잊도 r 이안  Ελληνικά   NY Times, flickr, wikipedia Outside the BBC BBC silo #1 BBC silo #3 BBC silo #2
An index language exists primarily to:
[object Object],[object Object]
[object Object],[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object],F.W. Lancaster Vocabulary control for information retrieval
Could Wikipedia be used as a universal language for identifying subjects?
Story of Wikipedia-as-CV
Story of Wikipedia-as-CV: personal origins
 
Story of Wikipedia-as-CV: personal origins We needed a system to categorise movie & TV reviews
Story of Wikipedia-as-CV: personal origins So of course we built a categorisation system from scratch -- including its own controlled vocab
Story of Wikipedia-as-CV: personal origins And when people saw the system, they always said: “Hey, that reminds me of Internet Movie Database…”
 
Story of Wikipedia-as-CV: personal origins It struck me that the way Internet Movie Database is set up isn’t dissimilar to the structure of a thesaurus or a very flat taxonomy…
Story of Wikipedia-as-CV: personal origins But its’s one where the emphasis is on “related to”, not broader/narrower, synonym, antonym, etc
Story of Wikipedia-as-CV: personal origins From then, I couldn’t help but be drawn to websites where the structure is clearly:
Story of Wikipedia-as-CV: personal origins From then, I couldn’t help but be drawn to websites where the structure is clearly:  “ a single primary Concept per page --  and pages for related Concepts  link to each other”
Story of Wikipedia-as-CV: personal origins Could those “one Concept per page” webpages be used as “terms” as in a controlled vocabulary?
Are some websites actually  “ indexing languages” in disguise?
conText  -- a Wikipedia-as-CV auto-categoriser prototype
 
conText --   a Wikipedia-as-CV auto-categoriser prototype: http://sells.welcomebackstage.com:5000/item/submit
 
Demo of  conText --   a Wikipedia-as-CV auto-categoriser prototype
Demo of  conText --   a Wikipedia-as-CV auto-categoriser prototype: Take text from audience!
Wikipedia is already being used across the Web as a form of subject identification & disambiguation, in a grassroots way:
Wikipedia is already being used across the Web as a form of subject identification & disambiguation, in a grassroots way:  in the form of hyperlinks  embedded by authors in blog posts, news articles, music reviews, etc everywhere!
http://en.wikipedia.org/wiki/British http://en.wikipedia.org/wiki/Science_fiction http://en.wikipedia.org/wiki/BBC http://en.wikipedia.org/wiki/Time_travel http://en.wikipedia.org/wiki/Dr_who http://en.wikipedia.org/wiki/Tardis
These days, by convention, when you link to Wikipedia from your webpage, more than saying “go and have a look at this other page”, you are more likely giving a definition to a concept referred to in your content…
These days, by convention, when you link to Wikipedia from your webpage, more than saying “go and have a look at this other page”, you are more likely giving a definition to a concept referred to in your content… Also used in this way for specific domains are Internet Movie Database (for films & TV programmes), MySpace (for bands), Amazon (for books), etc
For general knowledge, though, Wikipedia is becoming the Web’s defacto controlled vocabulary
http://en.wikipedia.org/wiki/Heerlen http://en.wikipedia.org/wiki/Beethoven http://en.wikipedia.org/wiki/Amsterdam http://en.wikipedia.org/wiki/Van_Gogh_Museum
[object Object],[object Object],[object Object],[object Object],F.W. Lancaster Vocabulary control for information retrieval
Wikipedia pages provide the best scope notes in the world
Wikipedia pages provide the best scope notes in the world Wikipedia-as-CV benefits from being developed through a social process, maintained and kept current by the Wikipedia community
Wikipedia pages provide the best scope notes in the world Wikipedia-as-CV benefits from being developed through a social process, maintained and kept current by the Wikipedia community Each concept represents a consensus view and its meaning can be understood simply by reading the associated Wikipedia page
Wikipedia pages provide the best scope notes in the world For each Concept, the document edit history, discussion around concept definition, & debate is important here…
 
[object Object],[object Object],[object Object],[object Object],F.W. Lancaster Vocabulary control for information retrieval
So, we can tag pretty accurately semi-automatically with globally unique subject identifiers using this approach… So what?
So, we can tag pretty accurately semi-automatically with globally unique subject identifiers using this approach… So what? Un-silo your content repository quickly and cheaply, by connecting it to the Web via Wikipedia
 
 
 
 
Now playing vs. the Web
 
 
Now playing vs. the Web Why not bring in BBC Archive materials to this service via Wikipedia-as-CV tagging and linked data bridge between Wikipedia & MusicBrainz?
 
 
By using  Wikipedia-as-CV,  you can get your repository onto this diagram quickly,  for free
 
[object Object],[object Object],[object Object],[object Object],F.W. Lancaster Vocabulary control for information retrieval
A Web-scale, globally accessible index language accidentally exists:
[object Object],[object Object]
[object Object],[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object],[object Object]
Chris Sizemore Silver Oliver BBC Wikipedia as controlled vocabulary Wikipedia is a controlled vocabulary
Chris Sizemore Silver Oliver BBC Wikipedia as controlled vocabulary Wikipedia is a controlled vocabulary
Chris Sizemore Silver Oliver BBC Wikipedia as controlled vocabulary Chris Sizemore Silver Oliver BBC Wikipedia is a controlled vocabulary
Chris Sizemore Silver Oliver BBC Wikipedia as controlled vocabulary Chris Sizemore Silver Oliver BBC Wikipedia is a controlled vocabulary Much thanks! Questions, comments, & constructive criticism?
Chris Sizemore Silver Oliver BBC Wikipedia as controlled vocabulary http://flickr.com/photos/deniscollette/1817034358/

More Related Content

Viewers also liked

Transforming the User Experience of the BBC
Transforming the User Experience of the BBCTransforming the User Experience of the BBC
Transforming the User Experience of the BBCRichard Titus
 
N. Humfrey. BBC Music - Using the Web as our Content Management System
N. Humfrey. BBC Music - Using the Web as our Content Management SystemN. Humfrey. BBC Music - Using the Web as our Content Management System
N. Humfrey. BBC Music - Using the Web as our Content Management SystemMusicNet
 
Controlled Vocabulary
Controlled VocabularyControlled Vocabulary
Controlled Vocabularyguest118a9a
 
Introduction To Controlled Vocabularies
Introduction To Controlled VocabulariesIntroduction To Controlled Vocabularies
Introduction To Controlled VocabulariesFred Leise
 
BBC2.0: The BBC’s 15 Web Principles
BBC2.0: The BBC’s 15 Web PrinciplesBBC2.0: The BBC’s 15 Web Principles
BBC2.0: The BBC’s 15 Web Principleshvs
 
BBC Playlister : What lies beneath the surface/service? - Keeping Tracks
BBC Playlister : What lies beneath the surface/service? - Keeping TracksBBC Playlister : What lies beneath the surface/service? - Keeping Tracks
BBC Playlister : What lies beneath the surface/service? - Keeping Tracksawilson_bl
 
Ten Years of Linked Data at the BBC
Ten Years of Linked Data at the BBCTen Years of Linked Data at the BBC
Ten Years of Linked Data at the BBCConnected Data World
 
News Archive - BBC News Labs presentation on Storylines, Topics & Tags
News Archive - BBC News Labs presentation on Storylines, Topics & TagsNews Archive - BBC News Labs presentation on Storylines, Topics & Tags
News Archive - BBC News Labs presentation on Storylines, Topics & TagsBBC News Labs
 
BBC Olympics: An Accessibility Study
BBC Olympics: An Accessibility StudyBBC Olympics: An Accessibility Study
BBC Olympics: An Accessibility StudyNomensa
 
Shaping the future of BBC News for the connected home
Shaping the future of BBC News for the connected homeShaping the future of BBC News for the connected home
Shaping the future of BBC News for the connected homeMassive Interactive
 
I Heart Wikipedia
I Heart WikipediaI Heart Wikipedia
I Heart WikipediaKevin Lim
 
BBC Programmes and Music on the Linking Open Data Cloud
BBC Programmes and Music on the Linking Open Data CloudBBC Programmes and Music on the Linking Open Data Cloud
BBC Programmes and Music on the Linking Open Data CloudPatrick Sinclair
 
Should we control vocabulary?
Should we control vocabulary?Should we control vocabulary?
Should we control vocabulary?kramsey
 
Mobilism 2013: A story of how we built Responsive BBC News
Mobilism 2013: A story of how we built Responsive BBC NewsMobilism 2013: A story of how we built Responsive BBC News
Mobilism 2013: A story of how we built Responsive BBC NewsJohn Cleveley
 
Ralph Rivera - BBC Online: One service, ten products, four screens
Ralph Rivera - BBC Online: One service, ten products, four screensRalph Rivera - BBC Online: One service, ten products, four screens
Ralph Rivera - BBC Online: One service, ten products, four screensBBC
 

Viewers also liked (19)

Transforming the User Experience of the BBC
Transforming the User Experience of the BBCTransforming the User Experience of the BBC
Transforming the User Experience of the BBC
 
N. Humfrey. BBC Music - Using the Web as our Content Management System
N. Humfrey. BBC Music - Using the Web as our Content Management SystemN. Humfrey. BBC Music - Using the Web as our Content Management System
N. Humfrey. BBC Music - Using the Web as our Content Management System
 
Controlled Vocabulary
Controlled VocabularyControlled Vocabulary
Controlled Vocabulary
 
Introduction To Controlled Vocabularies
Introduction To Controlled VocabulariesIntroduction To Controlled Vocabularies
Introduction To Controlled Vocabularies
 
BBC2.0: The BBC’s 15 Web Principles
BBC2.0: The BBC’s 15 Web PrinciplesBBC2.0: The BBC’s 15 Web Principles
BBC2.0: The BBC’s 15 Web Principles
 
BBC Playlister : What lies beneath the surface/service? - Keeping Tracks
BBC Playlister : What lies beneath the surface/service? - Keeping TracksBBC Playlister : What lies beneath the surface/service? - Keeping Tracks
BBC Playlister : What lies beneath the surface/service? - Keeping Tracks
 
Ten Years of Linked Data at the BBC
Ten Years of Linked Data at the BBCTen Years of Linked Data at the BBC
Ten Years of Linked Data at the BBC
 
News Archive - BBC News Labs presentation on Storylines, Topics & Tags
News Archive - BBC News Labs presentation on Storylines, Topics & TagsNews Archive - BBC News Labs presentation on Storylines, Topics & Tags
News Archive - BBC News Labs presentation on Storylines, Topics & Tags
 
BBC Olympics: An Accessibility Study
BBC Olympics: An Accessibility StudyBBC Olympics: An Accessibility Study
BBC Olympics: An Accessibility Study
 
Shaping the future of BBC News for the connected home
Shaping the future of BBC News for the connected homeShaping the future of BBC News for the connected home
Shaping the future of BBC News for the connected home
 
I Heart Wikipedia
I Heart WikipediaI Heart Wikipedia
I Heart Wikipedia
 
BBC Programmes and Music on the Linking Open Data Cloud
BBC Programmes and Music on the Linking Open Data CloudBBC Programmes and Music on the Linking Open Data Cloud
BBC Programmes and Music on the Linking Open Data Cloud
 
Thesauri
ThesauriThesauri
Thesauri
 
Should we control vocabulary?
Should we control vocabulary?Should we control vocabulary?
Should we control vocabulary?
 
Mobilism 2013: A story of how we built Responsive BBC News
Mobilism 2013: A story of how we built Responsive BBC NewsMobilism 2013: A story of how we built Responsive BBC News
Mobilism 2013: A story of how we built Responsive BBC News
 
Bbc Three Research Unit 4
Bbc Three Research Unit 4Bbc Three Research Unit 4
Bbc Three Research Unit 4
 
Ralph Rivera - BBC Online: One service, ten products, four screens
Ralph Rivera - BBC Online: One service, ten products, four screensRalph Rivera - BBC Online: One service, ten products, four screens
Ralph Rivera - BBC Online: One service, ten products, four screens
 
BBC - Better Business Cases - Foundation
BBC - Better Business Cases - FoundationBBC - Better Business Cases - Foundation
BBC - Better Business Cases - Foundation
 
BBC Mobile - Style Guide
BBC Mobile - Style GuideBBC Mobile - Style Guide
BBC Mobile - Style Guide
 

Similar to Wikipedia as controlled vocabulary

Connect With Your Users: Communicate Using Social Software Tools
Connect With Your Users: Communicate Using Social Software ToolsConnect With Your Users: Communicate Using Social Software Tools
Connect With Your Users: Communicate Using Social Software ToolsRobFav
 
Wikipedia Seminar For Cipr October 2010
Wikipedia Seminar For Cipr October 2010Wikipedia Seminar For Cipr October 2010
Wikipedia Seminar For Cipr October 2010SteveVirgin
 
Pensa-Wikipedia and history.pdf
Pensa-Wikipedia and history.pdfPensa-Wikipedia and history.pdf
Pensa-Wikipedia and history.pdfIolanda Pensa
 
Using wikis in library liaison work: overview & trends
Using wikis in library liaison work: overview & trendsUsing wikis in library liaison work: overview & trends
Using wikis in library liaison work: overview & trendsMolly Knapp
 
E Write Blogs Wikis Us Courts 9 408
E Write   Blogs Wikis Us Courts 9 408E Write   Blogs Wikis Us Courts 9 408
E Write Blogs Wikis Us Courts 9 408guest45c75b
 
Wikis and Blogs: When, Why, and How to Use Them
Wikis and Blogs: When, Why, and How to Use ThemWikis and Blogs: When, Why, and How to Use Them
Wikis and Blogs: When, Why, and How to Use ThemLeslieOflahavan
 
The Future of Libraries and Wikipedia
The Future of Libraries and WikipediaThe Future of Libraries and Wikipedia
The Future of Libraries and WikipediaJake Orlowitz
 
DM110 - Week 3 - Wikis
DM110 - Week 3 - WikisDM110 - Week 3 - Wikis
DM110 - Week 3 - WikisJohn Breslin
 
Effective Literature Searching 2011
Effective Literature Searching 2011Effective Literature Searching 2011
Effective Literature Searching 2011Middlesex University
 
Get Listed! Wikipedia Marketing Secrets Revealed
Get Listed! Wikipedia Marketing Secrets RevealedGet Listed! Wikipedia Marketing Secrets Revealed
Get Listed! Wikipedia Marketing Secrets RevealedCommPRO.biz
 
Publishing Articles in the English Wikipedia
Publishing Articles in the English WikipediaPublishing Articles in the English Wikipedia
Publishing Articles in the English WikipediaReniStoimenovasBlogg
 
Wikipedia as source of collaboratively created Knowledge Organization Systems
Wikipedia as source of collaboratively created Knowledge Organization SystemsWikipedia as source of collaboratively created Knowledge Organization Systems
Wikipedia as source of collaboratively created Knowledge Organization SystemsJakob .
 
Wikimedia Presentation for Schools
Wikimedia Presentation for SchoolsWikimedia Presentation for Schools
Wikimedia Presentation for SchoolsCraig Franklin
 
SLA Presentation - Institutional Partnerships with Wikipedia
SLA Presentation - Institutional Partnerships with Wikipedia SLA Presentation - Institutional Partnerships with Wikipedia
SLA Presentation - Institutional Partnerships with Wikipedia dorohoward
 

Similar to Wikipedia as controlled vocabulary (20)

Connect With Your Users: Communicate Using Social Software Tools
Connect With Your Users: Communicate Using Social Software ToolsConnect With Your Users: Communicate Using Social Software Tools
Connect With Your Users: Communicate Using Social Software Tools
 
Wikipedia Seminar For Cipr October 2010
Wikipedia Seminar For Cipr October 2010Wikipedia Seminar For Cipr October 2010
Wikipedia Seminar For Cipr October 2010
 
E Write Intro To Web 2
E Write   Intro To Web 2E Write   Intro To Web 2
E Write Intro To Web 2
 
Pensa-Wikipedia and history.pdf
Pensa-Wikipedia and history.pdfPensa-Wikipedia and history.pdf
Pensa-Wikipedia and history.pdf
 
Using wikis in library liaison work: overview & trends
Using wikis in library liaison work: overview & trendsUsing wikis in library liaison work: overview & trends
Using wikis in library liaison work: overview & trends
 
E Write Blogs Wikis Us Courts 9 408
E Write   Blogs Wikis Us Courts 9 408E Write   Blogs Wikis Us Courts 9 408
E Write Blogs Wikis Us Courts 9 408
 
Wikis and Blogs: When, Why, and How to Use Them
Wikis and Blogs: When, Why, and How to Use ThemWikis and Blogs: When, Why, and How to Use Them
Wikis and Blogs: When, Why, and How to Use Them
 
The Future of Libraries and Wikipedia
The Future of Libraries and WikipediaThe Future of Libraries and Wikipedia
The Future of Libraries and Wikipedia
 
DM110 - Week 3 - Wikis
DM110 - Week 3 - WikisDM110 - Week 3 - Wikis
DM110 - Week 3 - Wikis
 
BabelNet 3.0
BabelNet 3.0BabelNet 3.0
BabelNet 3.0
 
Effective Literature Searching 2011
Effective Literature Searching 2011Effective Literature Searching 2011
Effective Literature Searching 2011
 
Get Listed! Wikipedia Marketing Secrets Revealed
Get Listed! Wikipedia Marketing Secrets RevealedGet Listed! Wikipedia Marketing Secrets Revealed
Get Listed! Wikipedia Marketing Secrets Revealed
 
Publishing Articles in the English Wikipedia
Publishing Articles in the English WikipediaPublishing Articles in the English Wikipedia
Publishing Articles in the English Wikipedia
 
Wikis
WikisWikis
Wikis
 
Wikipedia as source of collaboratively created Knowledge Organization Systems
Wikipedia as source of collaboratively created Knowledge Organization SystemsWikipedia as source of collaboratively created Knowledge Organization Systems
Wikipedia as source of collaboratively created Knowledge Organization Systems
 
Wikimedia Presentation for Schools
Wikimedia Presentation for SchoolsWikimedia Presentation for Schools
Wikimedia Presentation for Schools
 
Web2.0 lac2013a
Web2.0 lac2013aWeb2.0 lac2013a
Web2.0 lac2013a
 
Web 2
Web 2Web 2
Web 2
 
SLA Presentation - Institutional Partnerships with Wikipedia
SLA Presentation - Institutional Partnerships with Wikipedia SLA Presentation - Institutional Partnerships with Wikipedia
SLA Presentation - Institutional Partnerships with Wikipedia
 
Weblio
WeblioWeblio
Weblio
 

Recently uploaded

APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentationphoebematthew05
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfjimielynbastida
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 

Recently uploaded (20)

APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentation
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdf
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 

Wikipedia as controlled vocabulary

  • 1. Chris Sizemore Silver Oliver BBC Wikipedia as controlled vocabulary
  • 3. BBC Topic Page I’m about ‘Victorians’ Outside the BBC BBC silo #1 BBC silo #3 BBC silo #2
  • 4. BBC Topic Page I’m about ‘Victorians’ viktorianisch V 잊도 r 이안 Ελληνικά NY Times, flickr, wikipedia Outside the BBC BBC silo #1 BBC silo #3 BBC silo #2
  • 5. An index language exists primarily to:
  • 6.
  • 7.
  • 8.
  • 9.
  • 10. Could Wikipedia be used as a universal language for identifying subjects?
  • 12. Story of Wikipedia-as-CV: personal origins
  • 13.  
  • 14. Story of Wikipedia-as-CV: personal origins We needed a system to categorise movie & TV reviews
  • 15. Story of Wikipedia-as-CV: personal origins So of course we built a categorisation system from scratch -- including its own controlled vocab
  • 16. Story of Wikipedia-as-CV: personal origins And when people saw the system, they always said: “Hey, that reminds me of Internet Movie Database…”
  • 17.  
  • 18. Story of Wikipedia-as-CV: personal origins It struck me that the way Internet Movie Database is set up isn’t dissimilar to the structure of a thesaurus or a very flat taxonomy…
  • 19. Story of Wikipedia-as-CV: personal origins But its’s one where the emphasis is on “related to”, not broader/narrower, synonym, antonym, etc
  • 20. Story of Wikipedia-as-CV: personal origins From then, I couldn’t help but be drawn to websites where the structure is clearly:
  • 21. Story of Wikipedia-as-CV: personal origins From then, I couldn’t help but be drawn to websites where the structure is clearly: “ a single primary Concept per page -- and pages for related Concepts link to each other”
  • 22. Story of Wikipedia-as-CV: personal origins Could those “one Concept per page” webpages be used as “terms” as in a controlled vocabulary?
  • 23. Are some websites actually “ indexing languages” in disguise?
  • 24. conText -- a Wikipedia-as-CV auto-categoriser prototype
  • 25.  
  • 26. conText -- a Wikipedia-as-CV auto-categoriser prototype: http://sells.welcomebackstage.com:5000/item/submit
  • 27.  
  • 28. Demo of conText -- a Wikipedia-as-CV auto-categoriser prototype
  • 29. Demo of conText -- a Wikipedia-as-CV auto-categoriser prototype: Take text from audience!
  • 30. Wikipedia is already being used across the Web as a form of subject identification & disambiguation, in a grassroots way:
  • 31. Wikipedia is already being used across the Web as a form of subject identification & disambiguation, in a grassroots way: in the form of hyperlinks embedded by authors in blog posts, news articles, music reviews, etc everywhere!
  • 32. http://en.wikipedia.org/wiki/British http://en.wikipedia.org/wiki/Science_fiction http://en.wikipedia.org/wiki/BBC http://en.wikipedia.org/wiki/Time_travel http://en.wikipedia.org/wiki/Dr_who http://en.wikipedia.org/wiki/Tardis
  • 33. These days, by convention, when you link to Wikipedia from your webpage, more than saying “go and have a look at this other page”, you are more likely giving a definition to a concept referred to in your content…
  • 34. These days, by convention, when you link to Wikipedia from your webpage, more than saying “go and have a look at this other page”, you are more likely giving a definition to a concept referred to in your content… Also used in this way for specific domains are Internet Movie Database (for films & TV programmes), MySpace (for bands), Amazon (for books), etc
  • 35. For general knowledge, though, Wikipedia is becoming the Web’s defacto controlled vocabulary
  • 37.
  • 38. Wikipedia pages provide the best scope notes in the world
  • 39. Wikipedia pages provide the best scope notes in the world Wikipedia-as-CV benefits from being developed through a social process, maintained and kept current by the Wikipedia community
  • 40. Wikipedia pages provide the best scope notes in the world Wikipedia-as-CV benefits from being developed through a social process, maintained and kept current by the Wikipedia community Each concept represents a consensus view and its meaning can be understood simply by reading the associated Wikipedia page
  • 41. Wikipedia pages provide the best scope notes in the world For each Concept, the document edit history, discussion around concept definition, & debate is important here…
  • 42.  
  • 43.
  • 44. So, we can tag pretty accurately semi-automatically with globally unique subject identifiers using this approach… So what?
  • 45. So, we can tag pretty accurately semi-automatically with globally unique subject identifiers using this approach… So what? Un-silo your content repository quickly and cheaply, by connecting it to the Web via Wikipedia
  • 46.  
  • 47.  
  • 48.  
  • 49.  
  • 50. Now playing vs. the Web
  • 51.  
  • 52.  
  • 53. Now playing vs. the Web Why not bring in BBC Archive materials to this service via Wikipedia-as-CV tagging and linked data bridge between Wikipedia & MusicBrainz?
  • 54.  
  • 55.  
  • 56. By using Wikipedia-as-CV, you can get your repository onto this diagram quickly, for free
  • 57.  
  • 58.
  • 59. A Web-scale, globally accessible index language accidentally exists:
  • 60.
  • 61.
  • 62.
  • 63.
  • 64. Chris Sizemore Silver Oliver BBC Wikipedia as controlled vocabulary Wikipedia is a controlled vocabulary
  • 65. Chris Sizemore Silver Oliver BBC Wikipedia as controlled vocabulary Wikipedia is a controlled vocabulary
  • 66. Chris Sizemore Silver Oliver BBC Wikipedia as controlled vocabulary Chris Sizemore Silver Oliver BBC Wikipedia is a controlled vocabulary
  • 67. Chris Sizemore Silver Oliver BBC Wikipedia as controlled vocabulary Chris Sizemore Silver Oliver BBC Wikipedia is a controlled vocabulary Much thanks! Questions, comments, & constructive criticism?
  • 68. Chris Sizemore Silver Oliver BBC Wikipedia as controlled vocabulary http://flickr.com/photos/deniscollette/1817034358/