The project Economics in the face of the New Economy
financed within the Regional Initiative for Excellence programme of the Minister of Science and
Higher Education of Poland, years 2019-2022, grant no. 004/RID/2018/19, financing 3,000,000 PLN
Włodzimierz Lewoniewski
Krzysztof Węcel
Witold Abramowicz
Reliability in Time: Evaluating the Web Sources
of Information on COVID-19 in Wikipedia
across Various Language Editions
from the Beginning of the Pandemic
Reliability of sources on Wikipedia
• Wikipedia articles should be based on
reliable sources
• Problem:
• There are over billion websites on the Web
• Only few developed language versions of
Wikipedia contains non-exhaustive lists of
popular sources
• The reliability of the same source may
change over time depending on a topic and
language version
2
Source: https://en.wikipedia.org/wiki/Wikipedia:Reliable_sources/Perennial_sources
3
Wikipedia references – complex extraction
Wiki markup (wiki code)
4
label
item identifier
property
statement group
values
qualifiers
aliases
Wikipedia articles
Topics modeling – Wikidata
Topics modeling - DBpedia
5
https://dbpedia.org/page/COVID-
19_pandemic_in_Germany
Property Value
dbp:location Germany
dbo:confirmedCases 3675296
rdf:type dbo:Outbreak
… …
6
Website adress identification
https://kie.ue.poznan.pl/en/
URL address
Main website for this URL address according to the Public Suffix List
2nd level domain
3rd level domain subdomain for poznan.pl
4th level domain subdomain for ue.poznan.pl
7
Source: Lewoniewski, W., Węcel, K., Abramowicz, W. (2020).
Modeling Popularity and Reliability of Sources in Multilingual Wikipedia.
• F-model - on the basis of frequency (F)
of source use.
• PR-model - based on the cumulative
number of page views (P) of the article in
which the source appears divided by the
number of references (R) in that article.
• PR2-model – similar to PR, only human
pageviews counted.
Popularity and reliability models
English Wikipedia: top web sources on COVID-19 pandemic
8
F-model PR2-model
The project Economics in the face of the New Economy
financed within the Regional Initiative for Excellence programme of the Minister of Science and
Higher Education of Poland, years 2019-2022, grant no. 004/RID/2018/19, financing 3,000,000 PLN
https://kie.ue.poznan.pl wlodzimierz.lewoniewski@ue.poznan.pl
Questions?
The publication is available on:
• Wiki Workshop 2022
• ResearchGate
• arXiv

Reliability in Time: Evaluating the Web Sources of Information on COVID-19 in Wikipedia across Various Language Editions from the Beginning of the Pandemic

  • 1.
    The project Economicsin the face of the New Economy financed within the Regional Initiative for Excellence programme of the Minister of Science and Higher Education of Poland, years 2019-2022, grant no. 004/RID/2018/19, financing 3,000,000 PLN Włodzimierz Lewoniewski Krzysztof Węcel Witold Abramowicz Reliability in Time: Evaluating the Web Sources of Information on COVID-19 in Wikipedia across Various Language Editions from the Beginning of the Pandemic
  • 2.
    Reliability of sourceson Wikipedia • Wikipedia articles should be based on reliable sources • Problem: • There are over billion websites on the Web • Only few developed language versions of Wikipedia contains non-exhaustive lists of popular sources • The reliability of the same source may change over time depending on a topic and language version 2 Source: https://en.wikipedia.org/wiki/Wikipedia:Reliable_sources/Perennial_sources
  • 3.
    3 Wikipedia references –complex extraction Wiki markup (wiki code)
  • 4.
  • 5.
    Topics modeling -DBpedia 5 https://dbpedia.org/page/COVID- 19_pandemic_in_Germany Property Value dbp:location Germany dbo:confirmedCases 3675296 rdf:type dbo:Outbreak … …
  • 6.
    6 Website adress identification https://kie.ue.poznan.pl/en/ URLaddress Main website for this URL address according to the Public Suffix List 2nd level domain 3rd level domain subdomain for poznan.pl 4th level domain subdomain for ue.poznan.pl
  • 7.
    7 Source: Lewoniewski, W.,Węcel, K., Abramowicz, W. (2020). Modeling Popularity and Reliability of Sources in Multilingual Wikipedia. • F-model - on the basis of frequency (F) of source use. • PR-model - based on the cumulative number of page views (P) of the article in which the source appears divided by the number of references (R) in that article. • PR2-model – similar to PR, only human pageviews counted. Popularity and reliability models
  • 8.
    English Wikipedia: topweb sources on COVID-19 pandemic 8 F-model PR2-model
  • 9.
    The project Economicsin the face of the New Economy financed within the Regional Initiative for Excellence programme of the Minister of Science and Higher Education of Poland, years 2019-2022, grant no. 004/RID/2018/19, financing 3,000,000 PLN https://kie.ue.poznan.pl wlodzimierz.lewoniewski@ue.poznan.pl Questions? The publication is available on: • Wiki Workshop 2022 • ResearchGate • arXiv