4. DBpedia is a crowd-sourced community effort
to extract structured information from
Wikipedia and Wikidata
The DBpedia Community enriches the
extracted information with additional
taxonomies
Links: inbound / outbound / text
Data submission by DBpedians
DBpedia is … a fused, multi-domain, multilingual dataset
Ontology
6. DBpedia is … a public service and infrastructure
Main SPARQL Endpoint for querying with 99.3% uptime
source: http://sparqles.ai.wu.ac.at/endpoint?uri=http%3A%2F%2Fdbpedia.org%2Fsparql
Over 20 Language Chapter that volunteered to do
● local extraction and merging
● community building and linking
● hosting of local DBpedias and Spotlight
http://nl.dbpedia.org
http://nl.dbpedia.
org/resource/Den_Haag
8. DBpedia Association (non-profit)
● founded in 2014
● operational since January 2016
Furthermore:
● Markus Freudenberg, Robert Roessling, Jan Forberg
● Backed by AKSW research group and InfAI (over 50 dead-beat Linked Data
experts)
10. DBpedia Charter
● Public draft online: http://wiki.dbpedia.org/dbpedia-association
All data published by the DBpedia Association should be made available free of
charge under a license equivalent to CC-0 or CC-BY without further restriction
on commercial use and redistribution.
11. DBpedia Groups
Ontology Working Group
Communications Group - Facebook, Slack Discussion about communication channels
Internationalisation Group
DataID
(planned) Wikidata Group, Law?
- Ideas for groups? Association will help set up the group
13. Backlinking
https://github.com/dbpedia/links
- DBpedians can push their links to be loaded into the main endpoint
- (not yet published) Maven/Ant build system that updates, validates and
packages links
Please wait for the announcement and send a GitHub pull request
(Members can get help from the association for linking and submitting)
14. Wikidata
- Great source for DBpedia
- Mission is different from DBpedia (Collect core facts)
- Limited to notability
Integration, Enrichment
Quality Control
Increased Usefulness
15. Integration of sources
Two ways:
1. fully automated
a. automatic conversion
b. automatic error detection
2. highly-assisted workflows
a. semi-automatic suggestions
b. test-driven error detection of data errors
c. power tools
d. push to source
Mappings automation, validation & provenance
18. 99.3 % uptime
25 TB downloadable data
one DBpedia Ontology
decent linkage
decent data quality
Identifiers based on
Wikipedia/Wikidata
DBpedia Spotlight
DBpedia is … a very ambitious project
⇒ 99.99% uptime for all languages and chapters and spotlight,
scalable hosting via the cloud
⇒ 500 TB
⇒ many consistent domain sub-ontologies for each domain from
cars to gas turbines to amoeba to star systems to literature
⇒ highly inter- and backlinked, DBpedia can serve as an entry
point to find exactly the data (or knowledge) you need
⇒ improved testing and quality control
⇒ Identifiers based on all existing authoritative and robust
identifier providers… starting with company data
⇒ all languages, all domains, improved scalability
19. Funding
● current liquidity is around 10,000€
● some pre-financing via H2020 and German national funding
Services are stable, but lot’s of “what if’s”
● what if OpenLink stops hosting the main endpoint
● what opportunities do we miss without a properly oiled flagship
We need organisations to join and provide financial support, links and backlinks
and data.
20. Funding
Become a member, if …
● you profit from DBpedia in any way
● you wish to get more in touch with DBpedia
● the services mentioned on the flyer seem useful for your organisation
Include DBpedia in your proposals:
● deliverables
● subcontracting
● consortium member (either association or the national chapters)
21. Future Events
SEMANTiCS 2016 - The Linked Data Conference
Submission open: Industry talks / scientific paper
Sep 12-15 in Leipzig - http://semantics.cc
● Huge DBpedia meeting on 15th of September
22. Hope to see you in Leipzig
More information: http://dbpedia.org
Sign up: http://dbpedia.org/membership