Wikidata presentation at SemTechBiz Berlin 2012

4,717 views

Published on

Published in: Technology, Business
2 Comments
5 Likes
Statistics
Notes
No Downloads
Views
Total views
4,717
On SlideShare
0
From Embeds
0
Number of Embeds
80
Actions
Shares
0
Downloads
45
Comments
2
Likes
5
Embeds 0
No embeds

No notes for slide
  • Counter information overload with visualizations
  • Wikidata presentation at SemTechBiz Berlin 2012

    1. 1. WikidataThe next big thing for WikipediaSemTechBiz Berlin, February 2012Denny VrandečićKIT Karlsruhe Institute of Technology / Wikimedia DeutschlandInstitut AIFB – Angewandte Informatik und Formale BeschreibungsverfahrenKIT – University of the State of Baden-Württemberg andNational Large-scale Research Center of the Helmholtz Association www.kit.edu
    2. 2. 07.02.2012 Wikidata Denny Vrandečić2
    3. 3. Imagine a world in which every single person is given free access to the sum of all human knowledge. 07.02.2012 Wikidata 3 Denny Vrandečić3
    4. 4. about 500 Million views per day 07.02.2012 Wikidata Denny Vrandečić4
    5. 5. 07.02.2012 Wikidata Denny Vrandečić5
    6. 6. 07.02.2012 Wikidata Denny Vrandečić6
    7. 7. Top 200 Website 12M+ media files All free to use 07.02.2012 Wikidata Denny Vrandečić7
    8. 8. 07.02.2012 Wikidata Denny Vrandečić8
    9. 9. 07.02.2012 Wikidata Denny Vrandečić9
    10. 10. 07.02.2012 Wikidata Denny Vrandečić10
    11. 11. 07.02.2012 Wikidata Denny Vrandečić11
    12. 12. 07.02.2012 Wikidata Denny Vrandečić12
    13. 13. 20M+ articles 1B+ edits 07.02.2012 Wikidata 280+ languages Denny Vrandečić13
    14. 14. Coverage by language English, German, French, Dutch: 1 Mio+ 40 languages: 100,000+ 107 languages: 10.000+ But what about other languages? 07.02.2012 Wikidata Denny Vrandečić14
    15. 15. English 07.02.2012 Wikidata Denny Vrandečić15
    16. 16. French 07.02.2012 Wikidata Denny Vrandečić16
    17. 17. Italian 07.02.2012 Wikidata Denny Vrandečić17
    18. 18. Catalan 07.02.2012 Wikidata Denny Vrandečić18
    19. 19. Greek 07.02.2012 Wikidata Denny Vrandečić19
    20. 20. Russian 07.02.2012 Wikidata Denny Vrandečić20
    21. 21. Chinese 07.02.2012 Wikidata Denny Vrandečić21
    22. 22. What Wikipedia knows Wikipedia has articles about… … all cities … their populations … their mayorsSo can I ask for a list of the world’s ten largest cities with a female mayor? 07.02.2012 Wikidata Denny Vrandečić22
    23. 23. Let’s see what happens… 07.02.2012 Wikidata Denny Vrandečić 23
    24. 24. WIKIPEDIA’S ANSWER: LISTS 07.02.2012 Wikidata Denny Vrandečić24
    25. 25. 07.02.2012 Wikidata Denny Vrandečić25
    26. 26. 07.02.2012 Wikidata Denny Vrandečić26
    27. 27. 07.02.2012 Wikidata Denny Vrandečić27
    28. 28. 07.02.2012 Wikidata Denny Vrandečić28
    29. 29. 07.02.2012 Wikidata Denny Vrandečić29
    30. 30. 07.02.2012 Wikidata Denny Vrandečić30
    31. 31. 07.02.2012 Wikidata Denny Vrandečić31
    32. 32. 07.02.2012 Wikidata Denny Vrandečić32
    33. 33. 07.02.2012 Wikidata Denny Vrandečić33
    34. 34. 07.02.2012 Wikidata Denny Vrandečić34
    35. 35. 07.02.2012 Wikidata Denny Vrandečić35
    36. 36. 07.02.2012 Wikidata Denny Vrandečić36
    37. 37. 07.02.2012 Wikidata Denny Vrandečić37
    38. 38. 07.02.2012 Wikidata Denny Vrandečić38
    39. 39. 07.02.2012 Wikidata Denny Vrandečić39
    40. 40. 07.02.2012 Wikidata Denny Vrandečić40
    41. 41. COMPUTERS ARE STUPID 07.02.2012 Wikidata Denny Vrandečić41
    42. 42. What humans see 07.02.2012 Wikidata Denny Vrandečić42
    43. 43. What humans seeBerlin ... has a population of 3,490,445 ... is located inGermany ... has mayorKlaus Wowereit ... has an area of 892 km2 07.02.2012 Wikidata Denny Vrandečić43
    44. 44. What computers see 07.02.2012 Wikidata Denny Vrandečić44
    45. 45. What computers seeBerlin 3,490,445 Germany 892 km2 07.02.2012 Wikidata Denny Vrandečić45
    46. 46. COMPUTERS DON‘T MAKE CONNECTIONS 07.02.2012 Wikidata Denny Vrandečić46
    47. 47. COMPUTERS NEED OUR HELP 07.02.2012 Wikidata Denny Vrandečić47
    48. 48. Berlin edit From Wikidata Capital of Germany edit Main page Also known as: City of Berlin edit |x Contents Access the API Random page Continent Europe [3 sources] Donate to Wikidata Country Germany [2 sources] Interaction Help Population 3,490,445 [1 source] About Wikidata Community portal 3,500,000 [2 sources] Recent changes [other values] Languages Catalá Cesky Calling code 030 [2 sources] Dansk Deutsch Mayor Klaus W| [0 sources] Eesti Klaus Wowereit Español Vehicle registration BGerman politician [1 source] Esperanto Klaus Wunderlich Français German musician Area 891.85km” [2 sources] Hrvatski Klaus Waldeck Italiano Austrian musician and former lawyer O’zbek Twin city Los Angeles [3 sources] Klaus Wagner Complete list German mathematician [new fact] Klaus Wagner Stalker of the British Royal Family 07.02.2012 Wikidata Denny Vrandečić48
    49. 49. Berlin edit From Wikidata Hauptstadtvon Deutschland edit Hauptseite Auchbekanntals:Stadt Berlin edit |x Inhalt API ZufälligeSeite Kontinent Europa [3 sources] Spende an Wikidata Land Deutschland [2 sources] Interaktion Hilfe Einwohner 3.490.445 [1 source] ÜberWikidata Benutzerportal 3.500.000 [2 sources] LetzeÄnderungen [weitereWerte] Sprachen Catalá Cesky Telefonvorwahl 030 [2 sources] Dansk Eesti Bürgermeister Klaus Wowereit [2 sources] English Español AmtlichesKennzeichen B [1 source] Esperanto Français Fläche 891,85 km” [2 sources] Hrvatski Italiano O’zbek Parnerstadt Los Angeles [3 sources] Complete list [new fact] 07.02.2012 Wikidata Denny Vrandečić49
    50. 50. Berlin Continent Europe.Berlin Country Germany.Berlin Population 3490445.Berlin Calling_code 030.Berlin Vehicle_registration B.Berlin Mayor Klaus_Wowereit.Berlin Twin_cityLos_Angeles. 07.02.2012 Wikidata Denny Vrandečić50
    51. 51. Klaus Wowereit Mayor Berlin 07.02.2012 Wikidata Denny Vrandečić51
    52. 52. WikiData Provide a database of the world’s knowledge that anyone can edit Collect references and quotes for millions of data items Engage a sustainable community that collects data from everywhere in a machine-readable way Increase the quality and lower the maintenance costs of Wikipedia and related projects Deliver software and community best practices enabling others to engage in projects of data collection and provisioning 07.02.2012 Wikidata Denny Vrandečić52
    53. 53. Extracts facts from Wikipedia infoboxes Publishes them in RDF Shows potential of machine-readable data 07.02.2012 Wikidata Denny Vrandečić53
    54. 54. WikiData Provide a database of the world’s knowledge that anyone can edit Collect references and quotes for millions of data items Engage a sustainable community that collects data from everywhere in a machine-readable way Increase the quality and lower the maintenance costs of Wikipedia and related projects Deliver software and community best practices enabling others to engage in projects of data collection and provisioning 07.02.2012 Wikidata Denny Vrandečić54
    55. 55. Secondary database Sources for every fact Reflect diversity 07.02.2012 Wikidata Denny Vrandečić55
    56. 56. WikiData Provide a database of the world’s knowledge that anyone can edit Collect references and quotes for millions of data items Engage a sustainable community that collects data from everywhere in a machine-readable way Increase the quality and lower the maintenance costs of Wikipedia and related projects Deliver software and community best practices enabling others to engage in projects of data collection and provisioning 07.02.2012 Wikidata Denny Vrandečić56
    57. 57. Project plan: 3 phases Phase 1: Language links Phase 2: Infobox augmentation Phase 3: Inline queries 07.02.2012 Wikidata Denny Vrandečić57
    58. 58. Phase 1: Language links Current: every language links to every other In Wikidata: create one page for each entity, list representations in each language In Wikipedias: pull language links from Wikidata 07.02.2012 Wikidata Denny Vrandečić58
    59. 59. Phase 2: Infobox augmentation Current: each article calls an infobox with values In Wikidata: centralize the values In Wikipedias: just call the infobox and populate it with values from Wikidata 07.02.2012 Wikidata Denny Vrandečić59
    60. 60. Phase 3: Inline queries Enable inline queries in Wikipedias With several formats 07.02.2012 Wikidata Denny Vrandečić60
    61. 61. Open source project 400+ users NASA, Europeana, Deut sche Telekom, … 20+ languages World-wide community Commercial support Many extensions semantic-mediawiki.org 07.02.2012 Wikidata Denny Vrandečić61
    62. 62. 07.02.2012 Wikidata Denny Vrandečić62
    63. 63. Conclusions Editable, common resource for data Enables much smaller contribution size Freely reusable, machine-readable data Able to answer question Available in 280+ languages 07.02.2012 Wikidata Denny Vrandečić63
    64. 64. Imagine a world in which every single person is given free access to the sum of all human knowledge. 07.02.2012 Wikidata 64 Denny Vrandečić64
    65. 65. Thank you! http://meta.wikipedia.org/wiki/Wikidata_WMDE Institut AIFB – Angewandte Informatik und Formale Beschreibungsverfahrenpresenting work done by Markus Krötzsch, YaronKoren, Daniel Kinzler,QamarnisoIsmoilova, Sergey Chernishev, Max Völkel, Heiko Haller,Sebastian Blohm, Philipp Sorg, Peter Haase, Than Tran, Basil Ell, DanielHerzig, BenediktKämpgen, Elena Simperl, Delia Rusu, Marko Grobelnik,Michael Cariaso, AmélieCordier, Jean Lieber, Emmanuel Nauer, YannickToussaint, Pascal Molli, HalaSkaf-Molli, Joel Natividad, Daniel Hanschand the Ontoprise team, and many others KIT – University of the State of Baden-Württemberg and National Large-scale Research Center of the Helmholtz Association www.kit.edu

    ×