Digitale Bibliothek Unless noted differently this presentation and all of its parts can be copied and reused under the ter...
What do  UDC  and  Wikipedia have in common?
Mundaneum
<ul><ul><li>“ Imagine a world in which every single person on the planet is given free access to the  sum of all human kno...
<ul><ul><li>“ Imagine a world in which every single person on the planet is given free access to the sum of all human know...
 
http://en.wikipedia.org/wiki/Infinite_monkey_theorem
The Happy Accident Joseph Reagle:  Wikipedia: The Happy Accident. ACM Interactions.  Volume 16, Number 3 (2009), pp 42-45....
It’s a wiki! <ul><li>one database of pages </li></ul><ul><li>heavily hyperlinked </li></ul><ul><li>everyone can edit </li>...
It’s the authority! <ul><li>Most visited websites </li></ul><ul><li>Google* </li></ul><ul><li>Facebook </li></ul><ul><li>Y...
UDC and Wikipedia <ul><li>Monographic principle one article per topic (and language) </li></ul><ul><li>Multilinguality 24 ...
How is  Knowledge in  Wikipedia Organized ?
1. Wikipedia Categories <ul><ul><li>Download: http://stats.wikimedia.org/EN/CategoryOverviewIndex.htm  </li></ul></ul><ul>...
UDC 35 Public administration 352(493) Belgian cities 3 Sciences sociales 352 Lowest level of administration (4) Europe (49...
Wikipedia Categories (en) skipped 5 steps skipped 2 steps http://toolserver.org/~dapete/catgraph/ Europe Capitals in Europ...
Categories and Classifications
2. Wikipedia Article structure <ul><li>substructure (sections, intro, sentences) </li></ul><ul><li>links between articles,...
http://en.wikipedia.org/wiki/Brussels#History
WikiWord Thesaurus <ul><li>multilingual thesaurus in SKOS, build by mining the Wikipedia link structure </li></ul><ul><ul>...
RDF triples field born in capital of same PND authority file DBpedia (from Wikipedia) other databases Brussels Paul Otlet ...
http://dbpedia.org/resource/Brussels http://sws.geonames.org/2800866/
a hub on the Semantic Web DBPedia the  ” Semantic Web ” as of March 2009
<ul><li>Wikipedia Encyclopedia </li></ul><ul><li>WikiWord & DBPedia </li></ul><ul><li>Used for NLP, database Mapping and S...
Wikipedia and UDC <ul><li>Link UDC and Wikipedia </li></ul><ul><ul><li>Index by UDC, get Wikipedia </li></ul></ul><ul><ul>...
Thoughts and Questions?
Upcoming SlideShare
Loading in...5
×

Wikipedia as Knowledge Organization System

3,711

Published on

Talk held at the UDC 2009 seminar: http://www.udcc.org/seminar2009/

0 Comments
3 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
3,711
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
77
Comments
0
Likes
3
Embeds 0
No embeds

No notes for slide

Wikipedia as Knowledge Organization System

  1. 1. Digitale Bibliothek Unless noted differently this presentation and all of its parts can be copied and reused under the terms of the Creative Commons Attribution-Share Alike 3.0 Unported license. Jakob Voß Verbundzentrale des GBV (VZG) Wikipedia as Knowledge Organization System International UDC Seminar 2009 The Hague, 29 October
  2. 2. What do UDC and Wikipedia have in common?
  3. 3. Mundaneum
  4. 4. <ul><ul><li>“ Imagine a world in which every single person on the planet is given free access to the sum of all human knowledge . That's what we're doing.” </li></ul></ul>Paul Otlet
  5. 5. <ul><ul><li>“ Imagine a world in which every single person on the planet is given free access to the sum of all human knowledge. That's what we're doing.” </li></ul></ul>Paul Otlet Jimbo Wales Wikimedia Commons
  6. 7. http://en.wikipedia.org/wiki/Infinite_monkey_theorem
  7. 8. The Happy Accident Joseph Reagle: Wikipedia: The Happy Accident. ACM Interactions. Volume 16, Number 3 (2009), pp 42-45. DOI: 10.1145/1516016.1516026 World Wide Web (Infrastructure) Nupedia (Funding) Wiki (Tool) 2001
  8. 9. It’s a wiki! <ul><li>one database of pages </li></ul><ul><li>heavily hyperlinked </li></ul><ul><li>everyone can edit </li></ul><ul><li>every edit is traced and revertible </li></ul>
  9. 10. It’s the authority! <ul><li>Most visited websites </li></ul><ul><li>Google* </li></ul><ul><li>Facebook </li></ul><ul><li>Yahoo! </li></ul><ul><li>YouTube </li></ul><ul><li>Windows Live </li></ul><ul><li>Wikipedia </li></ul>* 95% of Wikipedia articles (en) rank in Google‘s top 10 http://www.alexa.com/topsites
  10. 11. UDC and Wikipedia <ul><li>Monographic principle one article per topic (and language) </li></ul><ul><li>Multilinguality 24 Wikipedias with > 1,000,000 articles 81 Wikipedias with > 1,000 articles </li></ul><ul><li>Perpetual modifications Recent changes, page history </li></ul>
  11. 12. How is Knowledge in Wikipedia Organized ?
  12. 13. 1. Wikipedia Categories <ul><ul><li>Download: http://stats.wikimedia.org/EN/CategoryOverviewIndex.htm </li></ul></ul><ul><ul><li>Jakob Voss (2006):  Collaborative thesaurus tagging the Wikipedia way. http://arxiv.org/abs/cs/0604036 </li></ul></ul><ul><li>Each article can be sorted into multiple categories </li></ul><ul><li>Multihierarchy of categories </li></ul><ul><li>It’s a dynamic thesaurus </li></ul><ul><li>partly faceted, and precoordinated </li></ul>
  13. 14. UDC 35 Public administration 352(493) Belgian cities 3 Sciences sociales 352 Lowest level of administration (4) Europe (493) Belgium Brusseles
  14. 15. Wikipedia Categories (en) skipped 5 steps skipped 2 steps http://toolserver.org/~dapete/catgraph/ Europe Capitals in Europa Geography Geography of Europe Capitals Brusseles Regions of Belgium Geography by Continent Geography by Place ... by Continent Continents Social Sciences Belgium
  15. 16. Categories and Classifications
  16. 17. 2. Wikipedia Article structure <ul><li>substructure (sections, intro, sentences) </li></ul><ul><li>links between articles, to other language editions and to external resources </li></ul><ul><li>redirects (synonyms) and disambiguation pages (homonyms) </li></ul><ul><li>lists, portals, and navigation boxes </li></ul><ul><li>structured infoboxes and geodata </li></ul><ul><li>(bibliographic) references </li></ul><ul><li>.... </li></ul>
  17. 18. http://en.wikipedia.org/wiki/Brussels#History
  18. 19. WikiWord Thesaurus <ul><li>multilingual thesaurus in SKOS, build by mining the Wikipedia link structure </li></ul><ul><ul><li>Daniel Kinzler (2008): Automatischer Aufbau eines multilingualen Thesaurus durch Extraktion semantischer und lexikalischer Relationen aus der Wikipedia. </li></ul></ul><ul><ul><li>http://brightbyte.de/page/WikiWord </li></ul></ul>DBpedia <ul><li>extracts structured information from Wikipedia and converts it to RDF triples </li></ul><ul><ul><li>Kobilarov, Bizer, Auer & Lehmann (2009): DBpedia - A Linked Data Hub and Data </li></ul></ul><ul><ul><li>Source for Web and Enterprise Applications. http://www2009.eprints.org/228/ </li></ul></ul><ul><ul><li>http://dbpedia.org/ </li></ul></ul>
  19. 20. RDF triples field born in capital of same PND authority file DBpedia (from Wikipedia) other databases Brussels Paul Otlet Belgium Information science Paul Otlet
  20. 21. http://dbpedia.org/resource/Brussels http://sws.geonames.org/2800866/
  21. 22. a hub on the Semantic Web DBPedia the ” Semantic Web ” as of March 2009
  22. 23. <ul><li>Wikipedia Encyclopedia </li></ul><ul><li>WikiWord & DBPedia </li></ul><ul><li>Used for NLP, database Mapping and Semantic Tagging </li></ul><ul><ul><li>subject indexing with RDF concepts </li></ul></ul><ul><ul><li>for instance at BBC and by CommonTags </li></ul></ul>Wikipedia as KOS
  23. 24. Wikipedia and UDC <ul><li>Link UDC and Wikipedia </li></ul><ul><ul><li>Index by UDC, get Wikipedia </li></ul></ul><ul><ul><li>Index by Wikipedia, get UDC </li></ul></ul><ul><li>Make UDC part of the Semantic Web! </li></ul>
  24. 25. Thoughts and Questions?
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×