A Polyvocal and Contextualised Semantic Web


Marieke van Erp & Victor de Boer
ESWC 2021: Problems to solve before you die
What’s the problem?
Most knowledge graphs reflect the popular or majority vote


Most knowledge graphs are mined from contemporary sources


Most knowledge graphs represent a single perspective


This poses the danger of perpetuating certain views (e.g. gender-
biased, colonial-view…)
Why should I care?
Applications using single-perspective, contemporary
knowledge graphs cannot adequately deal with other
perspectives and/or diachronic knowledge


Problematic in for example:


Information retrieval and machine learning
What is a voice?
Hendrick Cornelis Vroom Een aantal Oostindiëvaarders voor de kust
Rijksmuseum SK-A-3108
Era of unbridled opportunity and
wealth in the Dutch Repulic
The Dutch maritime and military
prowess laid the foundations for
the first global multinational
corporation
The Dutch Golden Age shaped
Amsterdam and Dutch
architecture
Dutch scientific advancements
from this era were among the
most acclaimed in the world
What is a voice?
Hendrick Cornelis Vroom Een aantal Oostindiëvaarders voor de kust
Rijksmuseum SK-A-3108
Era of unbridled opportunity and
wealth in the Dutch Repulic
The Dutch maritime and military
prowess laid the foundations for
the first global multinational
corporation
The Dutch Golden Age shaped
Amsterdam and Dutch
architecture
Dutch scientific advancements
from this era were among the
most acclaimed in the world
What about the other side of the coin?


Slavery


Colonialism


….
What is a voice?
What is a voice?
perspective
perspective
perspective
perspective
perspective
perspective
perspective
perspective
perspective
perspective
perspective
perspective
VOICE
VOICE
2053 missionary objects from Africa
Loot
Gift
Idol
Fetish
Ancestor
PRESSING MATTER


OWNERSHIP, VALUE AND THE QUESTION OF COLONIAL HERITAGE IN MUSEUMS
Three challenges for polyvocal knowledge graphs
Identifying and acquiring polyvocal knowledge


Representation of polyvocality: datamodels and formalisms


Presentation and usage of polyvocal knowledge
Identifying and acquiring polyvocal knowledge


Identify existing voices in datasets


NLP/IE methods that are voice-aware


https:/
/www.create.humanities.uva.nl/education/unsilencing-the-archive/
M.Luthra and C. Jeurgens. Unsilencing the
VOC Testaments. DHBenelux 2021
Identifying and acquiring polyvocal knowledge


Metadata enrichment
and bias detection of
colonial architecture


Roz Sabir


Elicit information from polyvocal sources


Including through crowdsourcing


Methods that retain ‘disagreement’
Aroyo & Welty (2013) Crowd truth: Harnessing disagreement
in crowdsourcing a relation extraction gold standard
Data models


Formalisms


Design patterns


To represent disagreement on categorisation, provenance, etc.


Representation of polyvocality
Ockeloen, Niels, et al. "BiographyNet: Managing Provenance at Multiple Levels and from Different Perspectives." LISC@ ISWC. 2013.
Early work: Representing Traditional Knowledge
1 Where in the world do people practise
Yoni steaming?




2 How is the Sun dance ritual in the Hopi
tribe different from the Sun dance ritual
by the Cree people?




3 What traditional knowledge is practised
in Curacao?
Lois Hutubessy


Presentation and usage of polyvocal knowledge
How to visualise such polyvocal representations and frames
to a variety of end-users and source new enrichments from
these expert and non-expert users.


	
researchers


	
heritage professionals


	
general public


	
‘source communities’
Presentation and usage of polyvocal knowledge
Presentation and usage of polyvocal knowledge
Discussion
What we’re doing: The Cultural AI Lab


- Collaboration across 8 Dutch research and cultural heritage
institutions


- Interdisciplinary teams


- We don’t have all the answers


- We don’t have all the perspectives
Discussion
Where do we go from here?


Creating a polyvocal and contextualised Semantic Web is a community effort


It needs to be diverse and inclusive


It needs your input
https://cultural-ai.nl

A Polyvocal and Contextualised Semantic Web

  • 1.
    A Polyvocal andContextualised Semantic Web Marieke van Erp & Victor de Boer ESWC 2021: Problems to solve before you die
  • 2.
    What’s the problem? Mostknowledge graphs reflect the popular or majority vote Most knowledge graphs are mined from contemporary sources Most knowledge graphs represent a single perspective This poses the danger of perpetuating certain views (e.g. gender- biased, colonial-view…)
  • 3.
    Why should Icare? Applications using single-perspective, contemporary knowledge graphs cannot adequately deal with other perspectives and/or diachronic knowledge Problematic in for example: Information retrieval and machine learning
  • 4.
    What is avoice? Hendrick Cornelis Vroom Een aantal Oostindiëvaarders voor de kust Rijksmuseum SK-A-3108 Era of unbridled opportunity and wealth in the Dutch Repulic The Dutch maritime and military prowess laid the foundations for the first global multinational corporation The Dutch Golden Age shaped Amsterdam and Dutch architecture Dutch scientific advancements from this era were among the most acclaimed in the world
  • 5.
    What is avoice? Hendrick Cornelis Vroom Een aantal Oostindiëvaarders voor de kust Rijksmuseum SK-A-3108 Era of unbridled opportunity and wealth in the Dutch Repulic The Dutch maritime and military prowess laid the foundations for the first global multinational corporation The Dutch Golden Age shaped Amsterdam and Dutch architecture Dutch scientific advancements from this era were among the most acclaimed in the world What about the other side of the coin? Slavery Colonialism ….
  • 6.
    What is avoice?
  • 7.
    What is avoice? perspective perspective perspective perspective perspective perspective perspective perspective perspective perspective perspective perspective VOICE VOICE
  • 9.
    2053 missionary objectsfrom Africa Loot Gift Idol Fetish Ancestor PRESSING MATTER OWNERSHIP, VALUE AND THE QUESTION OF COLONIAL HERITAGE IN MUSEUMS
  • 10.
    Three challenges forpolyvocal knowledge graphs Identifying and acquiring polyvocal knowledge Representation of polyvocality: datamodels and formalisms Presentation and usage of polyvocal knowledge
  • 11.
    Identifying and acquiringpolyvocal knowledge Identify existing voices in datasets NLP/IE methods that are voice-aware https:/ /www.create.humanities.uva.nl/education/unsilencing-the-archive/ M.Luthra and C. Jeurgens. Unsilencing the VOC Testaments. DHBenelux 2021
  • 12.
    Identifying and acquiringpolyvocal knowledge Metadata enrichment and bias detection of colonial architecture 
 Roz Sabir 
 Elicit information from polyvocal sources Including through crowdsourcing Methods that retain ‘disagreement’ Aroyo & Welty (2013) Crowd truth: Harnessing disagreement in crowdsourcing a relation extraction gold standard
  • 13.
    Data models Formalisms Design patterns Torepresent disagreement on categorisation, provenance, etc. Representation of polyvocality
  • 14.
    Ockeloen, Niels, etal. "BiographyNet: Managing Provenance at Multiple Levels and from Different Perspectives." LISC@ ISWC. 2013.
  • 15.
    Early work: RepresentingTraditional Knowledge 1 Where in the world do people practise Yoni steaming? 
 2 How is the Sun dance ritual in the Hopi tribe different from the Sun dance ritual by the Cree people? 
 3 What traditional knowledge is practised in Curacao? Lois Hutubessy 

  • 16.
    Presentation and usageof polyvocal knowledge How to visualise such polyvocal representations and frames to a variety of end-users and source new enrichments from these expert and non-expert users. researchers heritage professionals general public ‘source communities’
  • 18.
    Presentation and usageof polyvocal knowledge
  • 19.
    Presentation and usageof polyvocal knowledge
  • 20.
    Discussion What we’re doing:The Cultural AI Lab - Collaboration across 8 Dutch research and cultural heritage institutions - Interdisciplinary teams - We don’t have all the answers - We don’t have all the perspectives
  • 21.
    Discussion Where do wego from here? Creating a polyvocal and contextualised Semantic Web is a community effort It needs to be diverse and inclusive It needs your input
  • 22.