Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

8th DBpedia meeting / California 2016

507 views

Published on

DBpedia state of affairs presentation at 8th DBpedia meeting in Sunnyvale / Yahoo OCt 27th 2016

Published in: Technology
  • Be the first to comment

8th DBpedia meeting / California 2016

  1. 1. DBpedia state of affairs 8th DBpedia Meeting, Oct 27th 2016, Sunnyvale CA Dimitris Kontokostas, DBpedia Association & AKSW/KILT http://dbpedia.org
  2. 2. Contents - Technical updates - Overview & updates - What we’re busy with - Organizational updates - DBpedia Association - Support & funding
  3. 3. Contents - Technical updates - Overview - What we’re busy with - Organizational updates - DBpedia Association - Support & funding
  4. 4. 2016-04 Release (highlights) - 9.5 billion facts - 130 languages, Commons, Wikidata - Citations & cited facts (exp.) - Raw duplicate property cleanup - Wikidata-based view
  5. 5. Data Quality - 38% improvement in schema conformant data (2015-04 -> 2016-04) - Measured with RDFUnit - Major data cleanup processes - Mapping validation tool (RDFUnit & RML) ○ From 18h to 30s ○ Mapping refinements ○ Ontology cleanup
  6. 6. Data Access - Main SPARQL endpoint see here for details - Now 11M Hits/Day (09/16) - Fun fact: Hoteli Maestral one of our very high traffic users - DataID - DCAT / VOID ++ extension - Dynamic Download page - Statistics generation - Dockerization
  7. 7. NLP / NIF Corpus Provide NIF Corpus (text & links) with each release - Start with abstracts plan for whole page - Fact extraction challenges NIF: NLP Interchange Format (RDF vocabulary)
  8. 8. Other things we are busy with - Shortening time-to-release (~5 months) - Wikidata data transform & integration - Global (inter-wikipedia) pagerank (to be announced soon) - Github-based Backlinking (to be announced soon) - Mapping alignment (UPM) - Mapping discovery / inference & provenance
  9. 9. Integration of (other) sources (Immediate) Goals: - Unify data from all Wikimedia projects ○ Start by basing on Wikidata IDs ○ Mint our own IDs and go beyond Wikimedia - Company dataset (in progress) - Media, publishing, law & legislation (in discussion)
  10. 10. Contents - Technical updates - Overview & updates - What we’re busy with - Organizational updates - DBpedia Association - Support & funding
  11. 11. DBpedia Association (non-profit) - Founded in 2014 - Operational since January 2016 - Support from DBpedia founding members (Soeren, Chris & Kinsley) - Draft charter online http://wiki.dbpedia.org/dbpedia-association ○ All data published by the DBpedia Association should be made available free of charge under a license equivalent to CC-0 or CC-BY without further restriction on commercial use and redistribution.
  12. 12. Membership type Joined Applying Students 18 6 individual/Self-Employed 1 18 SME, Research Institute 1 2 Start-up/Small Research Group 4 Voluntary Payment Option for membership fee - Applicants can apply for reduction - Board approval needed - Fees are reduced to 20€ per year - Member can pay full fee voluntarily DBpedia Members
  13. 13. Association goals (for 2017) - Official language & topic chapters - E.g. Dutch, German, Media, Law & Legislation, ... - Better community synchronization - Improve services and secure funding
  14. 14. Funding (Why) http://wiki.dbpedia.org/why-is-dbpedia-so-important - Basic side-financing via H2020 and German national funding for - Core staff - Some event & PR - However: - no funding for hosting - no sustainable funding (bound to projects) - Services are stable, but lot’s of “what if’s” We need organisations to join and provide support, links, backlinks & data.
  15. 15. Funding (What) - Increase quality of hosting and data - Provide support for community issues - Better publicity and exploitation - Merge community contributions - Systematically develop & improve DBpedia’s public data, software and services
  16. 16. Funding Strategies 4 main strategies developed during the DBpedia board discussion: ● Public fundraising (donation campaigns) ● Direct fundraising (direct company contact) ● Membership fees ● Community/Project fundraising (i.e. H2020, COST, ITN) Would love to hear your ideas to make DBpedia self-sustainable
  17. 17. Thank you for your attention! Questions? (Available until Tuesday to meet & discuss / ping me!!!) More information: http://dbpedia.org Join as a member: http://dbpedia.org/membership Slide contributions by Julia Holze & Sebastian Hellmann

×