View stunning SlideShares in full-screen with the new iOS app!Introducing SlideShare for AndroidExplore all your favorite topics in the SlideShare appGet the SlideShare app to Save for Later — even offline
View stunning SlideShares in full-screen with the new Android app!View stunning SlideShares in full-screen with the new iOS app!
Introduction to WikidataBritish Library, 26/4/13Andrew Grayandrew.firstname.lastname@example.org | @generalising
Wikidata summary●Central data repository for Wikimedia projects●Human- and machine-readable●Human- and machine-editable●Fully multilingual●Supports semantic relationshipswww.wikidata.org
Overall plan●Phase I– Centralise cross-language relationships●Phase II– Centralise core structured data●Phase III– Dynamic generation of list content
Phase I●Centralising all “interwiki” cross-language links– Historically, a major maintenance headache!●Single conceptual entity => many articles– ...some unexpected oddities arise; not all 1:1●Almost all entities now listed●Inclusion standards currently restricted
Phase II●Building structured data on these entities●“Phase 2.1” - harvesting data from Wikipedia– and supplemented from other sources●“Phase 2.2” - displaying data on Wikipedia– autogenerated information templates
Phase III●Automatic creation of lists and charts●Expected for late 2013...
Wikidata entities●Single entity corresponding to one or moreWikipedia articles– Name (in various languages) + WP links– Contains various Phase II properties– Properties can include sources/qualifiers●No support (yet!) for entities not existing in WP
Phase II – initial properties●Limited properties – gradual roll-outStandard●Single“main type”, but no restrictions on use– “the capital of Julius Caesar”●Relational properties implemented– but no automatic reciprocity yet●String datatypes created for identifiers●130 properties currently in use
Phase II – future properties●Properties created by community discussion●Several awaiting datatypes:– time– geocoordinate– number (and dimension)●Qualifiers yet to be added
Data reuse●Permanent numeric identifier for all items●API available (JSON)– but still being developed!●Regular XML dumps – dumps.wikimedia.org– all item/property data licensed as CC-0