SlideShare a Scribd company logo
1 of 44
Download to read offline
1
NoBias onboarding week March 2021
Biases in Social Media Research
Presenting: Miriam Fernandez
@miriam_fs
fernandezmiriam
@miriamfs
2
NoBias onboarding week March 2021
2 Before we start…
• 1.- This is an online talk…
– Hope you took the necessary precautions!
– PJs are allowed and higlighy recommended J
• 2.- It is an overview of biases and problems in
social media research
– If you were expecting something very complex this may not
be for you J
• 3.- I hate talking alone for long periods of time
– So get ready for questions and discussions at any point!
3
NoBias onboarding week March 2021
3
Understanding Social Media
4
NoBias onboarding week March 2021
Internet Users: 59.5% of the World’s Population
Source: https://datareportal.com/reports/digital-2021-global-overview-report
5
NoBias onboarding week March 2021
Social Media are 53.6% of the Global Population
Source: https://datareportal.com/reports/digital-2021-global-overview-report
6
NoBias onboarding week March 2021
After TV users concentrate most of their internet time
in using Social Media
Source: https://datareportal.com/reports/digital-2021-global-overview-report
7
NoBias onboarding week March 2021
The World’s Most-Used Social Platforms
Source: https://datareportal.com/reports/digital-2021-global-overview-report
8
NoBias onboarding week March 2021
8 Yes. If are not on Tick-Tock (like me), you are too old!
Image from :https://www.telegraph.co.uk/women/life/cant-one-bewildered-tiktok/
9
NoBias onboarding week March 2021
Full of Challenges
10
NoBias onboarding week March 2021
10
AI & Social Media
AI -> for Social Media
Social Media -> for AI
11
NoBias onboarding week March 2021
AI for Social Media
• Recommender Systems /
Personalisation Systems
– Suggest Information
– Suggest User Connections
– Suggest Events
– Suggest Products
• Search/Ranking Systems
– Provide Information
– Personalise Information
• NLP Systems
– ‘Understand’ text
– Extract knowledge
• Image Processing Systems
Social Media for AI
• Understand phenomena at scale
– Business/brand monitoring
– Political reactions
– Marketing
Decision Making
– Policy making
– Employability
• Address societal challenges
– Misinformation
– Hate
– Radicalisation
– Disaster management
– Child grooming
– Climate Change
12
NoBias onboarding week March 2021
12
AI -> for Social Media
(not the focus of this talk but..)
13
NoBias onboarding week March 2021
RS are affected by
popularity and
homogeneity
biases
Bellogin, P. Castells, I. Cantador,
Statistical biases in information retrieval
metrics for recommender systems,
Information Retrieval Journal 20 (6)
(2017) 606–634.[
D. Jannach, L. Lerche, I. Kamehkhosh,
M. Jugovac, What recommenders
recommend: an analysis of
recommendation biases and possible
countermeasures, User Modeling
andUser-Adapted Interaction 25 (5)
(2015) 427–491
14
NoBias onboarding week March 2021
Search/Ranking Biases
Search biases may influence
+ The local business that are found
+ The products that are bought
+ The candidates that are hired
+ The events that are attended
+ The dating / affective success
+ ….
Castillo, Carlos. "Fairness and transparency in ranking." ACM SIGIR Forum. Vol. 52. No. 2. New
York, NY, USA: ACM, 2019
15
NoBias onboarding week March 2021
Personalisation and Filtering
• What social media algorithms show and to whom?
lack of transparency and accountability
16
NoBias onboarding week March 2021
16
Social Media -> for AI
17
NoBias onboarding week March 2021
Targeting Societal Challenges by Analysing Social Media
Data
17
Social
Phenomena
18
NoBias onboarding week March 2021
Many studies seem to assume that social media
data, the methods used for its analysis, and the AI
applications created on top or it, are adequate,—
with little or no scrutiny
Olteanu, Alexandra, et al. "Social data: Biases, methodological pitfalls, and ethical
boundaries." Frontiers in Big Data 2 (2019): 13.
19
NoBias onboarding week March 2021
Issues when working with Social Data
20
NoBias onboarding week March 2021
20
Types of Biases
21
NoBias onboarding week March 2021
Population Biases
• Differences in demographics or other user characteristics between
a population of users represented in a dataset or platform and a
target population.
• E.g., can we really use social media to inform Policy Making? To
whom are we listening?
22
NoBias onboarding week March 2021
Population Biases
Can we investigate misinformation
spreading among senior citizens by
looking at tick-tock data?
https://shorensteincenter.org/information
-disorder-framework-for-research-and-
policymaking/
As a Policy Maker, can I really
understand issues affecting my
constituency if I don’t have geo-
located data?
23
NoBias onboarding week March 2021
Behavioural Biases
• Differences in user behaviour
across platforms or contexts, or
across users represented in
different datasets.
• Participatory Budgeting
Platforms. If users from certain
ideologies / socio-economical
backgrounds are more active
making and voting on proposals
-> urban inequality
Ivan Cantador, Maria E. Cortes-Cediel, Miriam Fernandez.
Exploiting Open Data to analyze discussion and controversy
in online citizen participation. Information Processing and
Management 2020.
24
NoBias onboarding week March 2021
Content Production Biases
• Behavioral biases that are expressed as lexical, syntactic, semantic, and
structural differences in the contents generated by users.
• English vs. Other languages. Higher amount of research / tools
produced for the English language -> unequal opportunity, particularly
for users of underrepresented languages. We also know and
understand less about the needs of those populations.
25
NoBias onboarding week March 2021
Linking Biases
• Behavioral biases that are expressed as differences in the
attributes of networks obtained from user connections,
interactions or activity.
Is this network really
representative of the
general US
population?
Careful with
homophily!
26
NoBias onboarding week March 2021
Temporal Biases
• Differences in populations or behaviours over time.
• Classifiers or models developed today, may be biased to the
entities (Persons, Organisations, Geographical locations, …)
discussed today, and be ineffective to categorise / classify / filter
/predict tomorrow’s content
Missinformation Detection Systems
trained with data from 2020
27
NoBias onboarding week March 2021
27
Sources of Biases
28
NoBias onboarding week March 2021
Data Source
• “garbage in – garbage out” A system that receives the wrong data
often gets the wrong conclusions (e.g., misrepresentation of the
population)
• E.g., understand user behaviour (e.g., citizen participation,
misinformation spreading across users, etc.) without filtering
accounts from organisations, media outlets, bots, etc.
Psss, I’m not a human!
29
NoBias onboarding week March 2021
Data Collection / Verification
• Biases introduced due to the selection of data sources, or by the
way in which data from these sources are acquired and prepared
30
NoBias onboarding week March 2021
• Fernandez, Miriam and Alani, Harith (2019). Artificial Intelligence and Online Extremism: Challenges and
Opportunities. In: McDaniel, John L.M. and Pease, Ken eds. Predictive Policing and Artificial Intelligence.
Taylor & Francis. http://oro.open.ac.uk/69799/1/Fernandez_Alani_final_pdf.pdf
31
NoBias onboarding week March 2021
Data Processing: definitions
• Biases introduced by data processing operations such as cleaning,
enrichment, annotation, and aggregation.
• Same messages are
judged very differently
in different parts of the
world
• Highlights differences
and potential bias
Not-Radical
Radical
Tie
NA: North America
SA: South America
ME: Middle East
AS: Asia
EU: Europe
AF: Africa
32
NoBias onboarding week March 2021
Data Processing: disagreements
Mensio, Martino and Alani, Harith (2019). News Source Credibility in the Eyes of Different
Assessors. In: Conference for Truth and Trust Online, 4-5 Oct 2019, London, UK, (In Press)
http://oro.open.ac.uk/62771/1/TTO2019_credibility.pdf
33
NoBias onboarding week March 2021
Data Processing: data gaps
• Do not account for data gaps/ data imbalances
• To produce systems that are equally
secure to protect different groups against
online hate we need to account for
differences about how such hate is
manifested across groups
Farrell Tracie, Fernandez Miriam, Novotny Jakub and Alani Harith (2019). Exploring
Misogyny across the Manosphere in Reddit. 10th ACM Conference on Web Science
34
NoBias onboarding week March 2021
Data Analyses / Usage
• Studying social phenomena without a control group, or with a
“wrong” control group
vs. vs.
Radicalisation Detection Algorithm
Radical User:
uses
radicalisation
terminology
“General User”:
talks about cats
and other things
Researchers, media agencies,
journalists, political figures,
religious non-radical individuals:
use radicalisation terminology
Radical Non Radical Radical
False Positives
35
NoBias onboarding week March 2021
Data Analyses / Usage
• Lack of robustness against over-time changes
Tweets
Conceptual.
Semantics.
Extraction
DBpedia
Semantic.Graph.
Representation
Frequent.Semantic.
Subgraph.Mining
Classifier.Training
Pipeline of detecting pro-ISIS stances using semantic sub-graph mining-based feature extraction
Extract and use the semantic interdependencies and relations between
ISIS
Syria
Jihadist Group
Country
(Military Intervention Against ISIL, place, Syria)
Entities Concepts Semantic Relations
Saif, Hassan, et al. "On the role of semantics for detecting pro-isis stances on social media."
36
NoBias onboarding week March 2021
Data Analyses / Usage
Lack of robustness against new
types of events
Burel, Grégoire, et al. "On semantics
and deep learning for event detection
in crisis situations." (2017).
37
NoBias onboarding week March 2021
Data Analysis/Usage
Network & propagation patterns
Information source
Content Text/images/videos
Context Lists of
misleading sites
specific features
(hashtags, mentions)
Misinformation?
Partial view of the problem/
available data
Fernandez, Miriam, and Harith
Alani. "Online misinformation:
Challenges and future
directions." The Web Conference
2018. 2018.
38
NoBias onboarding week March 2021
Data Analysis / Usage
• Bias towards certain research fields/methodologies
• Historical/contextual approaches
• Rich description of communities
• Qualitative attempts to characterise
the phenomena
• Exacerbating factors, both social and
technological
• Impacts on society and culture
• Small number of researchers/ data
• Mostly qualitative
• Observational studies
• Automatic detection and
categorisation
• Preference for certain
platforms
• Less attention to sociology/
psychology models and domain
knowledge
• Bias to time snapshots
39
NoBias onboarding week March 2021
Roots of Radicalisation & Radicalisation Influence
Micro or
Individual roots
Macro or
Global roots
Meso or
Group roots
= Radicalisation
Influence
Fernandez, Miriam, Moizzah Asif, and Harith
Alani. "Understanding the roots of
radicalisation on twitter." Proceedings of the
10th ACM Conference on Web Science. 2018.
40
NoBias onboarding week March 2021
Olson, L. N., Daggs, J. L., Ellevold, B. L.
and Rogers, T. K. K. (2007),
Entrapping the Innocent: Toward a
Theory of Child Sexual Predators’
Luring Communication.
Communication Theory, 17
Child Grooming
Grooming Trust
Development
Physical
Approach
other
Yes
No
No
No
Cano, Amparo et al "Detecting child grooming behaviour patterns on social media.” 2014
41
NoBias onboarding week March 2021
Data Analysis/Usage
• Bias towards the obtained results (classification performance is
not always enough, particularly when humans are involved!)
Simply presenting people with corrective information is likely to fail in changing
their salient beliefs and opinions, or may, even, reinforce them
Provide an
explanation
rather than a
simple refute
Expose the user
to related but
disconfirming
stories
Revealing the
demographic
similarity of the
opposing group
Expose the
users to “small
doses” of
misinformation
Combatting
misinformation
Facts
Early detection of
malicious accounts
Use of ranking and
selection strategies
based on corrective
information
42
NoBias onboarding week March 2021
Data Presentation/Explanation of Results
• Bias towards expert users
43
NoBias onboarding week March 2021
Evaluation and Interpretation
• The choice of metrics shapes a research study
– Even if a metric indicates good overall performance on a classification task, it is
hard to know what that implies, as errors may be concentrated in one
particular class or group of classes
• False positives and false negatives should not always weight the
same!
• Negative results are often overlooked
• Big problems with data sharing and reproducibility
44
NoBias onboarding week March 2021
44
Biases on Social Media Research
Miriam Fernandez
Knowledge Media Institute
Open University, UK
@miriam_fs
@miriamfs
Credit to all these fantastic people!

More Related Content

What's hot

How does fakenews spread understanding pathways of disinformation spread thro...
How does fakenews spread understanding pathways of disinformation spread thro...How does fakenews spread understanding pathways of disinformation spread thro...
How does fakenews spread understanding pathways of disinformation spread thro...Araz Taeihagh
 
Using social networks in reputation management A study on the governmental or...
Using social networks in reputation management A study on the governmental or...Using social networks in reputation management A study on the governmental or...
Using social networks in reputation management A study on the governmental or...İtibar Yönetimi Enstitüsü
 
Social media? It's serious! Understanding the dark side of social media
Social media? It's serious! Understanding the dark side of social mediaSocial media? It's serious! Understanding the dark side of social media
Social media? It's serious! Understanding the dark side of social mediaIan McCarthy
 
Literature Review of Information Behaviour on Social Media
Literature Review of Information Behaviour on Social MediaLiterature Review of Information Behaviour on Social Media
Literature Review of Information Behaviour on Social MediaDavid Thompson
 
Social media mining hicss 46 part 1
Social media mining   hicss 46 part 1Social media mining   hicss 46 part 1
Social media mining hicss 46 part 1Dave King
 
Dialogue-Earth:-Mining-Social-Media
Dialogue-Earth:-Mining-Social-MediaDialogue-Earth:-Mining-Social-Media
Dialogue-Earth:-Mining-Social-MediaTom Masterman
 
Newsout: 30 examples of government transparency
Newsout: 30 examples of government transparencyNewsout: 30 examples of government transparency
Newsout: 30 examples of government transparencyBill Densmore
 
Mass media and communication
Mass media and communication Mass media and communication
Mass media and communication NiraliMakvana1
 
WIA 2015 Executive Summary
WIA 2015 Executive SummaryWIA 2015 Executive Summary
WIA 2015 Executive SummaryEvan Beck
 
Do your employees think your slogan is “fake news?” A framework for understan...
Do your employees think your slogan is “fake news?” A framework for understan...Do your employees think your slogan is “fake news?” A framework for understan...
Do your employees think your slogan is “fake news?” A framework for understan...Ian McCarthy
 
An Online Social Network for Emergency Management
An Online Social Network for Emergency ManagementAn Online Social Network for Emergency Management
An Online Social Network for Emergency ManagementConnie White
 
Social Media, Crisis Communication and Emergency Management: Leveraging Web 2...
Social Media, Crisis Communication and Emergency Management: Leveraging Web 2...Social Media, Crisis Communication and Emergency Management: Leveraging Web 2...
Social Media, Crisis Communication and Emergency Management: Leveraging Web 2...Connie White
 
Leveraging social capital in university-industry knowledge transfer strategie...
Leveraging social capital in university-industry knowledge transfer strategie...Leveraging social capital in university-industry knowledge transfer strategie...
Leveraging social capital in university-industry knowledge transfer strategie...Ian McCarthy
 
Social Media for the Government
Social Media for the GovernmentSocial Media for the Government
Social Media for the GovernmentKady Chiu
 
Teaching data journalism (Abraji 2021)
Teaching data journalism (Abraji 2021)Teaching data journalism (Abraji 2021)
Teaching data journalism (Abraji 2021)Paul Bradshaw
 
Misinformation, Disinformation, Malinformation, fake news and libraries
Misinformation, Disinformation, Malinformation, fake news and librariesMisinformation, Disinformation, Malinformation, fake news and libraries
Misinformation, Disinformation, Malinformation, fake news and librariesDr Trivedi
 
Journalism fake news disinformation
Journalism fake news disinformationJournalism fake news disinformation
Journalism fake news disinformationVittorio Pasteris
 

What's hot (20)

How does fakenews spread understanding pathways of disinformation spread thro...
How does fakenews spread understanding pathways of disinformation spread thro...How does fakenews spread understanding pathways of disinformation spread thro...
How does fakenews spread understanding pathways of disinformation spread thro...
 
Using social networks in reputation management A study on the governmental or...
Using social networks in reputation management A study on the governmental or...Using social networks in reputation management A study on the governmental or...
Using social networks in reputation management A study on the governmental or...
 
Social media? It's serious! Understanding the dark side of social media
Social media? It's serious! Understanding the dark side of social mediaSocial media? It's serious! Understanding the dark side of social media
Social media? It's serious! Understanding the dark side of social media
 
Literature Review of Information Behaviour on Social Media
Literature Review of Information Behaviour on Social MediaLiterature Review of Information Behaviour on Social Media
Literature Review of Information Behaviour on Social Media
 
Social media mining hicss 46 part 1
Social media mining   hicss 46 part 1Social media mining   hicss 46 part 1
Social media mining hicss 46 part 1
 
Dialogue-Earth:-Mining-Social-Media
Dialogue-Earth:-Mining-Social-MediaDialogue-Earth:-Mining-Social-Media
Dialogue-Earth:-Mining-Social-Media
 
Society Meets Social Media at reyerson-2015
Society Meets Social Media at reyerson-2015Society Meets Social Media at reyerson-2015
Society Meets Social Media at reyerson-2015
 
Newsout: 30 examples of government transparency
Newsout: 30 examples of government transparencyNewsout: 30 examples of government transparency
Newsout: 30 examples of government transparency
 
8th Milestones meeting: Cyber violence roundtable
8th Milestones meeting: Cyber violence roundtable8th Milestones meeting: Cyber violence roundtable
8th Milestones meeting: Cyber violence roundtable
 
Order 32740459
Order 32740459Order 32740459
Order 32740459
 
Mass media and communication
Mass media and communication Mass media and communication
Mass media and communication
 
WIA 2015 Executive Summary
WIA 2015 Executive SummaryWIA 2015 Executive Summary
WIA 2015 Executive Summary
 
Do your employees think your slogan is “fake news?” A framework for understan...
Do your employees think your slogan is “fake news?” A framework for understan...Do your employees think your slogan is “fake news?” A framework for understan...
Do your employees think your slogan is “fake news?” A framework for understan...
 
An Online Social Network for Emergency Management
An Online Social Network for Emergency ManagementAn Online Social Network for Emergency Management
An Online Social Network for Emergency Management
 
Social Media, Crisis Communication and Emergency Management: Leveraging Web 2...
Social Media, Crisis Communication and Emergency Management: Leveraging Web 2...Social Media, Crisis Communication and Emergency Management: Leveraging Web 2...
Social Media, Crisis Communication and Emergency Management: Leveraging Web 2...
 
Leveraging social capital in university-industry knowledge transfer strategie...
Leveraging social capital in university-industry knowledge transfer strategie...Leveraging social capital in university-industry knowledge transfer strategie...
Leveraging social capital in university-industry knowledge transfer strategie...
 
Social Media for the Government
Social Media for the GovernmentSocial Media for the Government
Social Media for the Government
 
Teaching data journalism (Abraji 2021)
Teaching data journalism (Abraji 2021)Teaching data journalism (Abraji 2021)
Teaching data journalism (Abraji 2021)
 
Misinformation, Disinformation, Malinformation, fake news and libraries
Misinformation, Disinformation, Malinformation, fake news and librariesMisinformation, Disinformation, Malinformation, fake news and libraries
Misinformation, Disinformation, Malinformation, fake news and libraries
 
Journalism fake news disinformation
Journalism fake news disinformationJournalism fake news disinformation
Journalism fake news disinformation
 

Similar to Biases in Social Media Research (NoBias EU project)

Researching Social Media – Big Data and Social Media Analysis
Researching Social Media – Big Data and Social Media AnalysisResearching Social Media – Big Data and Social Media Analysis
Researching Social Media – Big Data and Social Media AnalysisFarida Vis
 
Big Data for International Development
Big Data for International DevelopmentBig Data for International Development
Big Data for International DevelopmentAlex Rascanu
 
Big Data and Social Media Mining in Crisis and Emergency Management
Big Data and Social Media Mining in Crisis and Emergency ManagementBig Data and Social Media Mining in Crisis and Emergency Management
Big Data and Social Media Mining in Crisis and Emergency ManagementBYTE Project
 
Using Data for Informed Decision Making
Using Data for Informed Decision MakingUsing Data for Informed Decision Making
Using Data for Informed Decision MakingINGovConf
 
Practical Applications for Social Network Analysis in Public Sector Marketing...
Practical Applications for Social Network Analysis in Public Sector Marketing...Practical Applications for Social Network Analysis in Public Sector Marketing...
Practical Applications for Social Network Analysis in Public Sector Marketing...Mike Kujawski
 
Working with Social Media Data: Ethics & good practice around collecting, usi...
Working with Social Media Data: Ethics & good practice around collecting, usi...Working with Social Media Data: Ethics & good practice around collecting, usi...
Working with Social Media Data: Ethics & good practice around collecting, usi...Nicola Osborne
 
Why aren't Evaluators using Digital Media Analytics?
Why aren't Evaluators using Digital Media Analytics?Why aren't Evaluators using Digital Media Analytics?
Why aren't Evaluators using Digital Media Analytics?CesToronto
 
The Challenges and Pitfalls of Aggregating Social Media Data
The Challenges and Pitfalls of Aggregating Social Media DataThe Challenges and Pitfalls of Aggregating Social Media Data
The Challenges and Pitfalls of Aggregating Social Media DataDataCards
 
Social Big Data For Service
Social Big Data For ServiceSocial Big Data For Service
Social Big Data For ServiceSaptarshi Ghosh
 
The State of Social Media in Federal Government - April 2012
The State of Social Media in Federal Government - April 2012The State of Social Media in Federal Government - April 2012
The State of Social Media in Federal Government - April 2012GovLoop
 
Beyond-Data-Literacy-2015
Beyond-Data-Literacy-2015Beyond-Data-Literacy-2015
Beyond-Data-Literacy-2015Amanda noonan
 
SOCIAL MEDIA MONITORING IN ELECTIONS
SOCIAL MEDIA MONITORING IN ELECTIONSSOCIAL MEDIA MONITORING IN ELECTIONS
SOCIAL MEDIA MONITORING IN ELECTIONSJamaity
 
humaniki User Research Report
humaniki User Research Report humaniki User Research Report
humaniki User Research Report Sejal Khatri
 
humaniki User Research Report
humaniki User Research Report humaniki User Research Report
humaniki User Research Report Sejal Khatri
 
Online Misinformation: Challenges and Future Directions
Online Misinformation: Challenges and Future DirectionsOnline Misinformation: Challenges and Future Directions
Online Misinformation: Challenges and Future DirectionsMiriam Fernandez
 
Exploring big ‘crisis’ data in action: potential positive and negative extern...
Exploring big ‘crisis’ data in action: potential positive and negative extern...Exploring big ‘crisis’ data in action: potential positive and negative extern...
Exploring big ‘crisis’ data in action: potential positive and negative extern...Trilateral Research
 
Research Using Behavioral Big Data: A Tour and Why Mechanical Engineers Shoul...
Research Using Behavioral Big Data: A Tour and Why Mechanical Engineers Shoul...Research Using Behavioral Big Data: A Tour and Why Mechanical Engineers Shoul...
Research Using Behavioral Big Data: A Tour and Why Mechanical Engineers Shoul...Galit Shmueli
 
Digital development and Online Gender-Based Violence
Digital development and Online Gender-Based ViolenceDigital development and Online Gender-Based Violence
Digital development and Online Gender-Based ViolenceAnand Sheombar
 

Similar to Biases in Social Media Research (NoBias EU project) (20)

Researching Social Media – Big Data and Social Media Analysis
Researching Social Media – Big Data and Social Media AnalysisResearching Social Media – Big Data and Social Media Analysis
Researching Social Media – Big Data and Social Media Analysis
 
Big Data for International Development
Big Data for International DevelopmentBig Data for International Development
Big Data for International Development
 
Big Data and Social Media Mining in Crisis and Emergency Management
Big Data and Social Media Mining in Crisis and Emergency ManagementBig Data and Social Media Mining in Crisis and Emergency Management
Big Data and Social Media Mining in Crisis and Emergency Management
 
Using Data for Informed Decision Making
Using Data for Informed Decision MakingUsing Data for Informed Decision Making
Using Data for Informed Decision Making
 
Practical Applications for Social Network Analysis in Public Sector Marketing...
Practical Applications for Social Network Analysis in Public Sector Marketing...Practical Applications for Social Network Analysis in Public Sector Marketing...
Practical Applications for Social Network Analysis in Public Sector Marketing...
 
Working with Social Media Data: Ethics & good practice around collecting, usi...
Working with Social Media Data: Ethics & good practice around collecting, usi...Working with Social Media Data: Ethics & good practice around collecting, usi...
Working with Social Media Data: Ethics & good practice around collecting, usi...
 
Why aren't Evaluators using Digital Media Analytics?
Why aren't Evaluators using Digital Media Analytics?Why aren't Evaluators using Digital Media Analytics?
Why aren't Evaluators using Digital Media Analytics?
 
The Challenges and Pitfalls of Aggregating Social Media Data
The Challenges and Pitfalls of Aggregating Social Media DataThe Challenges and Pitfalls of Aggregating Social Media Data
The Challenges and Pitfalls of Aggregating Social Media Data
 
Social Big Data For Service
Social Big Data For ServiceSocial Big Data For Service
Social Big Data For Service
 
The State of Social Media in Federal Government - April 2012
The State of Social Media in Federal Government - April 2012The State of Social Media in Federal Government - April 2012
The State of Social Media in Federal Government - April 2012
 
Beyond-Data-Literacy-2015
Beyond-Data-Literacy-2015Beyond-Data-Literacy-2015
Beyond-Data-Literacy-2015
 
SOCIAL MEDIA MONITORING IN ELECTIONS
SOCIAL MEDIA MONITORING IN ELECTIONSSOCIAL MEDIA MONITORING IN ELECTIONS
SOCIAL MEDIA MONITORING IN ELECTIONS
 
humaniki User Research Report
humaniki User Research Report humaniki User Research Report
humaniki User Research Report
 
Social media in social work spaces
Social media in social work spacesSocial media in social work spaces
Social media in social work spaces
 
humaniki User Research Report
humaniki User Research Report humaniki User Research Report
humaniki User Research Report
 
Online Misinformation: Challenges and Future Directions
Online Misinformation: Challenges and Future DirectionsOnline Misinformation: Challenges and Future Directions
Online Misinformation: Challenges and Future Directions
 
Exploring big ‘crisis’ data in action: potential positive and negative extern...
Exploring big ‘crisis’ data in action: potential positive and negative extern...Exploring big ‘crisis’ data in action: potential positive and negative extern...
Exploring big ‘crisis’ data in action: potential positive and negative extern...
 
Delphi2 results (Cycle 2) and towards Delphi3
Delphi2 results (Cycle 2) and towards Delphi3Delphi2 results (Cycle 2) and towards Delphi3
Delphi2 results (Cycle 2) and towards Delphi3
 
Research Using Behavioral Big Data: A Tour and Why Mechanical Engineers Shoul...
Research Using Behavioral Big Data: A Tour and Why Mechanical Engineers Shoul...Research Using Behavioral Big Data: A Tour and Why Mechanical Engineers Shoul...
Research Using Behavioral Big Data: A Tour and Why Mechanical Engineers Shoul...
 
Digital development and Online Gender-Based Violence
Digital development and Online Gender-Based ViolenceDigital development and Online Gender-Based Violence
Digital development and Online Gender-Based Violence
 

More from Miriam Fernandez

Mining Social Media Data For Policing
Mining Social Media Data For PolicingMining Social Media Data For Policing
Mining Social Media Data For PolicingMiriam Fernandez
 
Slides 28-feb-2018-v2.pptx
Slides 28-feb-2018-v2.pptxSlides 28-feb-2018-v2.pptx
Slides 28-feb-2018-v2.pptxMiriam Fernandez
 
Artificial Intelligence for Policing
Artificial Intelligence for PolicingArtificial Intelligence for Policing
Artificial Intelligence for PolicingMiriam Fernandez
 
OUSocial OUSocMed conference
OUSocial OUSocMed conference OUSocial OUSocMed conference
OUSocial OUSocMed conference Miriam Fernandez
 
On the use of social media for evidence-based policing
On the use of social media for evidence-based policingOn the use of social media for evidence-based policing
On the use of social media for evidence-based policingMiriam Fernandez
 
SocInfo2014 CityLabs Workshop
SocInfo2014 CityLabs WorkshopSocInfo2014 CityLabs Workshop
SocInfo2014 CityLabs WorkshopMiriam Fernandez
 
ECSM2014: Using Social Media To Inform Policy Making: To whom are we listenin...
ECSM2014: Using Social Media To Inform Policy Making: To whom are we listenin...ECSM2014: Using Social Media To Inform Policy Making: To whom are we listenin...
ECSM2014: Using Social Media To Inform Policy Making: To whom are we listenin...Miriam Fernandez
 
ESWC 2014 Tutorial Handson 1: Collect Data from Facebook
ESWC 2014 Tutorial Handson 1: Collect Data from FacebookESWC 2014 Tutorial Handson 1: Collect Data from Facebook
ESWC 2014 Tutorial Handson 1: Collect Data from FacebookMiriam Fernandez
 
Wm unit1.6-slides-semantic web-final
Wm unit1.6-slides-semantic web-finalWm unit1.6-slides-semantic web-final
Wm unit1.6-slides-semantic web-finalMiriam Fernandez
 
Iswc 2011: Linking Data Across Universities: An Integrated Video Lectures Dat...
Iswc 2011: Linking Data Across Universities: An Integrated Video Lectures Dat...Iswc 2011: Linking Data Across Universities: An Integrated Video Lectures Dat...
Iswc 2011: Linking Data Across Universities: An Integrated Video Lectures Dat...Miriam Fernandez
 

More from Miriam Fernandez (15)

Mining Social Media Data For Policing
Mining Social Media Data For PolicingMining Social Media Data For Policing
Mining Social Media Data For Policing
 
Slides 28-feb-2018-v2.pptx
Slides 28-feb-2018-v2.pptxSlides 28-feb-2018-v2.pptx
Slides 28-feb-2018-v2.pptx
 
Artificial Intelligence for Policing
Artificial Intelligence for PolicingArtificial Intelligence for Policing
Artificial Intelligence for Policing
 
OUSocial OUSocMed conference
OUSocial OUSocMed conference OUSocial OUSocMed conference
OUSocial OUSocMed conference
 
On the use of social media for evidence-based policing
On the use of social media for evidence-based policingOn the use of social media for evidence-based policing
On the use of social media for evidence-based policing
 
SocInfo2014 CityLabs Workshop
SocInfo2014 CityLabs WorkshopSocInfo2014 CityLabs Workshop
SocInfo2014 CityLabs Workshop
 
ECSM2014: Using Social Media To Inform Policy Making: To whom are we listenin...
ECSM2014: Using Social Media To Inform Policy Making: To whom are we listenin...ECSM2014: Using Social Media To Inform Policy Making: To whom are we listenin...
ECSM2014: Using Social Media To Inform Policy Making: To whom are we listenin...
 
ESWC 2014 Tutorial Handson 1: Collect Data from Facebook
ESWC 2014 Tutorial Handson 1: Collect Data from FacebookESWC 2014 Tutorial Handson 1: Collect Data from Facebook
ESWC 2014 Tutorial Handson 1: Collect Data from Facebook
 
ESWC 2014 Tutorial Part 4
ESWC 2014 Tutorial Part 4ESWC 2014 Tutorial Part 4
ESWC 2014 Tutorial Part 4
 
ESWC 2014 Tutorial part 3
ESWC 2014 Tutorial part 3ESWC 2014 Tutorial part 3
ESWC 2014 Tutorial part 3
 
ESWC 2014 Tutorial part 2
ESWC 2014 Tutorial part 2ESWC 2014 Tutorial part 2
ESWC 2014 Tutorial part 2
 
ESWC 2014 Tutorial part 1
ESWC 2014 Tutorial part 1ESWC 2014 Tutorial part 1
ESWC 2014 Tutorial part 1
 
Wm unit1.6-slides-semantic web-final
Wm unit1.6-slides-semantic web-finalWm unit1.6-slides-semantic web-final
Wm unit1.6-slides-semantic web-final
 
CAEPIA 2011
CAEPIA 2011CAEPIA 2011
CAEPIA 2011
 
Iswc 2011: Linking Data Across Universities: An Integrated Video Lectures Dat...
Iswc 2011: Linking Data Across Universities: An Integrated Video Lectures Dat...Iswc 2011: Linking Data Across Universities: An Integrated Video Lectures Dat...
Iswc 2011: Linking Data Across Universities: An Integrated Video Lectures Dat...
 

Recently uploaded

Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhousejana861314
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxSwapnil Therkar
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real timeSatoshi NAKAHIRA
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCEPRINCE C P
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 sciencefloriejanemacaya1
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfnehabiju2046
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 

Recently uploaded (20)

Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhouse
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real time
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C P
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptx
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 science
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdf
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 

Biases in Social Media Research (NoBias EU project)

  • 1. 1 NoBias onboarding week March 2021 Biases in Social Media Research Presenting: Miriam Fernandez @miriam_fs fernandezmiriam @miriamfs
  • 2. 2 NoBias onboarding week March 2021 2 Before we start… • 1.- This is an online talk… – Hope you took the necessary precautions! – PJs are allowed and higlighy recommended J • 2.- It is an overview of biases and problems in social media research – If you were expecting something very complex this may not be for you J • 3.- I hate talking alone for long periods of time – So get ready for questions and discussions at any point!
  • 3. 3 NoBias onboarding week March 2021 3 Understanding Social Media
  • 4. 4 NoBias onboarding week March 2021 Internet Users: 59.5% of the World’s Population Source: https://datareportal.com/reports/digital-2021-global-overview-report
  • 5. 5 NoBias onboarding week March 2021 Social Media are 53.6% of the Global Population Source: https://datareportal.com/reports/digital-2021-global-overview-report
  • 6. 6 NoBias onboarding week March 2021 After TV users concentrate most of their internet time in using Social Media Source: https://datareportal.com/reports/digital-2021-global-overview-report
  • 7. 7 NoBias onboarding week March 2021 The World’s Most-Used Social Platforms Source: https://datareportal.com/reports/digital-2021-global-overview-report
  • 8. 8 NoBias onboarding week March 2021 8 Yes. If are not on Tick-Tock (like me), you are too old! Image from :https://www.telegraph.co.uk/women/life/cant-one-bewildered-tiktok/
  • 9. 9 NoBias onboarding week March 2021 Full of Challenges
  • 10. 10 NoBias onboarding week March 2021 10 AI & Social Media AI -> for Social Media Social Media -> for AI
  • 11. 11 NoBias onboarding week March 2021 AI for Social Media • Recommender Systems / Personalisation Systems – Suggest Information – Suggest User Connections – Suggest Events – Suggest Products • Search/Ranking Systems – Provide Information – Personalise Information • NLP Systems – ‘Understand’ text – Extract knowledge • Image Processing Systems Social Media for AI • Understand phenomena at scale – Business/brand monitoring – Political reactions – Marketing Decision Making – Policy making – Employability • Address societal challenges – Misinformation – Hate – Radicalisation – Disaster management – Child grooming – Climate Change
  • 12. 12 NoBias onboarding week March 2021 12 AI -> for Social Media (not the focus of this talk but..)
  • 13. 13 NoBias onboarding week March 2021 RS are affected by popularity and homogeneity biases Bellogin, P. Castells, I. Cantador, Statistical biases in information retrieval metrics for recommender systems, Information Retrieval Journal 20 (6) (2017) 606–634.[ D. Jannach, L. Lerche, I. Kamehkhosh, M. Jugovac, What recommenders recommend: an analysis of recommendation biases and possible countermeasures, User Modeling andUser-Adapted Interaction 25 (5) (2015) 427–491
  • 14. 14 NoBias onboarding week March 2021 Search/Ranking Biases Search biases may influence + The local business that are found + The products that are bought + The candidates that are hired + The events that are attended + The dating / affective success + …. Castillo, Carlos. "Fairness and transparency in ranking." ACM SIGIR Forum. Vol. 52. No. 2. New York, NY, USA: ACM, 2019
  • 15. 15 NoBias onboarding week March 2021 Personalisation and Filtering • What social media algorithms show and to whom? lack of transparency and accountability
  • 16. 16 NoBias onboarding week March 2021 16 Social Media -> for AI
  • 17. 17 NoBias onboarding week March 2021 Targeting Societal Challenges by Analysing Social Media Data 17 Social Phenomena
  • 18. 18 NoBias onboarding week March 2021 Many studies seem to assume that social media data, the methods used for its analysis, and the AI applications created on top or it, are adequate,— with little or no scrutiny Olteanu, Alexandra, et al. "Social data: Biases, methodological pitfalls, and ethical boundaries." Frontiers in Big Data 2 (2019): 13.
  • 19. 19 NoBias onboarding week March 2021 Issues when working with Social Data
  • 20. 20 NoBias onboarding week March 2021 20 Types of Biases
  • 21. 21 NoBias onboarding week March 2021 Population Biases • Differences in demographics or other user characteristics between a population of users represented in a dataset or platform and a target population. • E.g., can we really use social media to inform Policy Making? To whom are we listening?
  • 22. 22 NoBias onboarding week March 2021 Population Biases Can we investigate misinformation spreading among senior citizens by looking at tick-tock data? https://shorensteincenter.org/information -disorder-framework-for-research-and- policymaking/ As a Policy Maker, can I really understand issues affecting my constituency if I don’t have geo- located data?
  • 23. 23 NoBias onboarding week March 2021 Behavioural Biases • Differences in user behaviour across platforms or contexts, or across users represented in different datasets. • Participatory Budgeting Platforms. If users from certain ideologies / socio-economical backgrounds are more active making and voting on proposals -> urban inequality Ivan Cantador, Maria E. Cortes-Cediel, Miriam Fernandez. Exploiting Open Data to analyze discussion and controversy in online citizen participation. Information Processing and Management 2020.
  • 24. 24 NoBias onboarding week March 2021 Content Production Biases • Behavioral biases that are expressed as lexical, syntactic, semantic, and structural differences in the contents generated by users. • English vs. Other languages. Higher amount of research / tools produced for the English language -> unequal opportunity, particularly for users of underrepresented languages. We also know and understand less about the needs of those populations.
  • 25. 25 NoBias onboarding week March 2021 Linking Biases • Behavioral biases that are expressed as differences in the attributes of networks obtained from user connections, interactions or activity. Is this network really representative of the general US population? Careful with homophily!
  • 26. 26 NoBias onboarding week March 2021 Temporal Biases • Differences in populations or behaviours over time. • Classifiers or models developed today, may be biased to the entities (Persons, Organisations, Geographical locations, …) discussed today, and be ineffective to categorise / classify / filter /predict tomorrow’s content Missinformation Detection Systems trained with data from 2020
  • 27. 27 NoBias onboarding week March 2021 27 Sources of Biases
  • 28. 28 NoBias onboarding week March 2021 Data Source • “garbage in – garbage out” A system that receives the wrong data often gets the wrong conclusions (e.g., misrepresentation of the population) • E.g., understand user behaviour (e.g., citizen participation, misinformation spreading across users, etc.) without filtering accounts from organisations, media outlets, bots, etc. Psss, I’m not a human!
  • 29. 29 NoBias onboarding week March 2021 Data Collection / Verification • Biases introduced due to the selection of data sources, or by the way in which data from these sources are acquired and prepared
  • 30. 30 NoBias onboarding week March 2021 • Fernandez, Miriam and Alani, Harith (2019). Artificial Intelligence and Online Extremism: Challenges and Opportunities. In: McDaniel, John L.M. and Pease, Ken eds. Predictive Policing and Artificial Intelligence. Taylor & Francis. http://oro.open.ac.uk/69799/1/Fernandez_Alani_final_pdf.pdf
  • 31. 31 NoBias onboarding week March 2021 Data Processing: definitions • Biases introduced by data processing operations such as cleaning, enrichment, annotation, and aggregation. • Same messages are judged very differently in different parts of the world • Highlights differences and potential bias Not-Radical Radical Tie NA: North America SA: South America ME: Middle East AS: Asia EU: Europe AF: Africa
  • 32. 32 NoBias onboarding week March 2021 Data Processing: disagreements Mensio, Martino and Alani, Harith (2019). News Source Credibility in the Eyes of Different Assessors. In: Conference for Truth and Trust Online, 4-5 Oct 2019, London, UK, (In Press) http://oro.open.ac.uk/62771/1/TTO2019_credibility.pdf
  • 33. 33 NoBias onboarding week March 2021 Data Processing: data gaps • Do not account for data gaps/ data imbalances • To produce systems that are equally secure to protect different groups against online hate we need to account for differences about how such hate is manifested across groups Farrell Tracie, Fernandez Miriam, Novotny Jakub and Alani Harith (2019). Exploring Misogyny across the Manosphere in Reddit. 10th ACM Conference on Web Science
  • 34. 34 NoBias onboarding week March 2021 Data Analyses / Usage • Studying social phenomena without a control group, or with a “wrong” control group vs. vs. Radicalisation Detection Algorithm Radical User: uses radicalisation terminology “General User”: talks about cats and other things Researchers, media agencies, journalists, political figures, religious non-radical individuals: use radicalisation terminology Radical Non Radical Radical False Positives
  • 35. 35 NoBias onboarding week March 2021 Data Analyses / Usage • Lack of robustness against over-time changes Tweets Conceptual. Semantics. Extraction DBpedia Semantic.Graph. Representation Frequent.Semantic. Subgraph.Mining Classifier.Training Pipeline of detecting pro-ISIS stances using semantic sub-graph mining-based feature extraction Extract and use the semantic interdependencies and relations between ISIS Syria Jihadist Group Country (Military Intervention Against ISIL, place, Syria) Entities Concepts Semantic Relations Saif, Hassan, et al. "On the role of semantics for detecting pro-isis stances on social media."
  • 36. 36 NoBias onboarding week March 2021 Data Analyses / Usage Lack of robustness against new types of events Burel, Grégoire, et al. "On semantics and deep learning for event detection in crisis situations." (2017).
  • 37. 37 NoBias onboarding week March 2021 Data Analysis/Usage Network & propagation patterns Information source Content Text/images/videos Context Lists of misleading sites specific features (hashtags, mentions) Misinformation? Partial view of the problem/ available data Fernandez, Miriam, and Harith Alani. "Online misinformation: Challenges and future directions." The Web Conference 2018. 2018.
  • 38. 38 NoBias onboarding week March 2021 Data Analysis / Usage • Bias towards certain research fields/methodologies • Historical/contextual approaches • Rich description of communities • Qualitative attempts to characterise the phenomena • Exacerbating factors, both social and technological • Impacts on society and culture • Small number of researchers/ data • Mostly qualitative • Observational studies • Automatic detection and categorisation • Preference for certain platforms • Less attention to sociology/ psychology models and domain knowledge • Bias to time snapshots
  • 39. 39 NoBias onboarding week March 2021 Roots of Radicalisation & Radicalisation Influence Micro or Individual roots Macro or Global roots Meso or Group roots = Radicalisation Influence Fernandez, Miriam, Moizzah Asif, and Harith Alani. "Understanding the roots of radicalisation on twitter." Proceedings of the 10th ACM Conference on Web Science. 2018.
  • 40. 40 NoBias onboarding week March 2021 Olson, L. N., Daggs, J. L., Ellevold, B. L. and Rogers, T. K. K. (2007), Entrapping the Innocent: Toward a Theory of Child Sexual Predators’ Luring Communication. Communication Theory, 17 Child Grooming Grooming Trust Development Physical Approach other Yes No No No Cano, Amparo et al "Detecting child grooming behaviour patterns on social media.” 2014
  • 41. 41 NoBias onboarding week March 2021 Data Analysis/Usage • Bias towards the obtained results (classification performance is not always enough, particularly when humans are involved!) Simply presenting people with corrective information is likely to fail in changing their salient beliefs and opinions, or may, even, reinforce them Provide an explanation rather than a simple refute Expose the user to related but disconfirming stories Revealing the demographic similarity of the opposing group Expose the users to “small doses” of misinformation Combatting misinformation Facts Early detection of malicious accounts Use of ranking and selection strategies based on corrective information
  • 42. 42 NoBias onboarding week March 2021 Data Presentation/Explanation of Results • Bias towards expert users
  • 43. 43 NoBias onboarding week March 2021 Evaluation and Interpretation • The choice of metrics shapes a research study – Even if a metric indicates good overall performance on a classification task, it is hard to know what that implies, as errors may be concentrated in one particular class or group of classes • False positives and false negatives should not always weight the same! • Negative results are often overlooked • Big problems with data sharing and reproducibility
  • 44. 44 NoBias onboarding week March 2021 44 Biases on Social Media Research Miriam Fernandez Knowledge Media Institute Open University, UK @miriam_fs @miriamfs Credit to all these fantastic people!