Historical Knowledge Bases with Semantic MediaWiki
Talk at the conference "Transfer of Ideas in European Intellectual History: From Medieval Manuscripts to Interactive Online Content" at University of Innsbruck
Introduction
Managing partner atKM-A Knowledge
Management Associates
Active member of the Semantic MediaWiki
community ~ 20 years, board member of
MediaWiki Stakeholder’s Group
Knowledge graph/ wiki researcher at WU
Vienna
Knowledge Management lecturer at
university of applied sciences 2
2
• KM consulting
• KM training
• KM research
• open-source SMW stack
• professional hosting
Knowledge Graph
▪ Termwas made popular by Google
„Introducing the Knowledge Graph: things, not strings”
▪ Trivial definition: Ontology + Instances
https://blog.google/products/search/introducing-knowledge-graph-things-not/
actress woman
person birth date
???
is a
subclass of
has
has birthdate
Hedy Lamarr
Romy Schneider
Christiane Hörbiger
Paula Wessely
https://en.wikipedia.org/wiki/
Category:Actresses_from_Vienna
6.
A scientific definition(Paulheim 2016)
A knowledge graph
• mainly describes real world entities
and their interrelations, organized in a
graph,
• defines possible classes and relations
of entities in a schema,
• allows for potentially interrelating
arbitrary entities,
• covers various topical domains.
in MediaWiki
• real world entities = wiki pages
• classes = categories and relations of
entities = properties,
• interrelating entities = linking
• wiki topic
7.
Structures in MediaWiki
▪Formatted text (Headings, numerations, paragraphs, quotes)
▪ Templates
▪ Pages and subpages
▪ Namespaces
▪ Categories and subcategories
▪ Category „inflation“
▪ Manually curated lists
▪ No querying of data inside MediaWiki
8.
Knowledge Graphs andWikipedia
vs. custom KG
• extract structured information from Wikipedia and make
this information available on the Web
8
▪ free knowledge base that can be read and edited by
humans and machines alike… central storage for the data
that may be accessed by the client Wikipedias
▪ turns MediaWiki into a powerful and flexible knowledge
management system
▪ lets you store and query data within the wiki's pages
▪ a set of extensions for MediaWiki
9.
Semantic MediaWiki orWikibase?
https://www.mediawiki.org/wiki/Manual:Managing_data_in_MediaWiki
Semantic MediaWiki Wikibase
flexible data model data model of Wikidata
properties can be pre-defined or declared by annotating properties need to be pre-defined
properties (and datatypes) can be changed any time properties cannot be changed!
requires extensions for form-based input comes with a fixed, built-in edit interface
SPARQL only with external triplestore
internal query language (easier than SPARQL) no built-in querying of data
Website with text alongside data Data interface only (text properties possible)
9
Vienna History Wiki
▪City of Vienna
▪ https://www.geschichtewiki.wien.gv.at/
▪ German
▪ Editable by citizens, edits are checked by
the editorial team
▪ In operation since 2014
▪ Largest city wiki
People
Topographic
objects
Structures
Organizations
Events
Memorials
Maps
Terms
Other
SILVER Wiki: silver.kbr.be
▪Royal Library of
Belgium (KBR)
▪ Database of die-studies
for the Graeco-Roman
world. Estimate the
volume of ancient coin
production.
▪ Greek Overstrikes
Database: known
overstrikes for the
Greek world
Alcuin: Medieval texts
▪texts of the medieval magistri and scholarly authors
▪ real, doubtful, unreal texts
▪ University of Regensburg
▪ German
▪ unfortunatly, not live
▪ project info: www.alcuin.de
Authors
Manuscripts
Works
People
Editions
Publications
Series
Glossary
26.
REGEST: wiki.uibk.ac.at/regest/
▪ Universityof Innsbruck
▪ https://wiki.uibk.ac.at/regest/
▪ theological literature
translated from Greek to
Slavonic, starting from the
beginning of Slavonic literacy
in the 9th century until the
Ottoman conquest of the
Balkans in the 14th century.
SMW in historical/researchcontexts
• collaborative editing
of text
• version history
• unique identifiers
• open source, large
ecosystem
• API
• structured data
(Web database)
• form-based data entry
• internal querying
• data visualizations
• Semantic Web standards
• triplestore support