SlideShare a Scribd company logo
Automatically Identifying Islamophobia
in Social Media
Ted Pedersen
Department of Computer Science
University of Minnesota, Duluth
tpederse@umn.edu
@SeeTedTalk
http://www.d.umn.edu/~tpederse
Today’s Agenda
Islamophobia in general and in Minnesota
Collecting & Annotating Twitter Data
What We’ve Learned from Annotating
The Way Forward
Islamophobia
A legacy of colonial histories,
particularly those that view the
Muslim world as exotic,
savage, dangerous - all
leading to a “Clash of
Civlilizations.”
Orientalism (Said, 1978)
Islamophobia
A recent term for an older phenomena
Runnymede Trust (1997,
2017)
Unfounded hostility towards Islam
Practical consequences of such
hostility in unfair discrimination
against Muslim individuals and
communities
Exclusion of Muslims from mainstream
political and social affairs.
Common Anti-Muslim Tropes (Bridge Institute)
Islam and Muslims are inherently
violent.
Islam and Muslims are oppresive
to women.
Islam and Muslims are intolerant
towards other religions.
Islam is a political ideology, not
a religion.
In the West, Mulims are using
non-violent stealth jihad to
implement Sharia Law.
Islam is foreign, medieval, and
at odds with Western modernity.
Islam is a monolith.
All Muslims are Arab or Brown.
Many governments fueling Islamophobia
Meanwhile, back home in Minnesota …
New York Times, June 20, 2019
Sahan Journal, Sept 14, 2020
(Not True)
Goal : Identify Islamophobia in Social Media Text
Why?
Relatively understudied in NLP.
Highly intersectional problem since Muslim identity is multi-faceted.
Significant influence on events in the World, the USA, and Minnesota.
How?
Use ideas from NLP, especially Hate Speech Detection.
Create annotated corpora in order to understand problem better, and then
apply Machine Learning or Deep Learning.
Guiding Principles
This is not Just Another Classification Task. Seek out domain expertise, build
relationships, don’t reduce the problem to a data set.
Frey et al. (2018). Artificial Intelligence and Inclusion: Formerly Gang-Involved
Youth as Domain Experts for Analyzing Unstructured Twitter Data. Social
Science Computer Review.
Creating annotated data is likely necessary. Be careful to fully document the
decisions made along the way, paying special attention to annotator background.
Bender and Friedman (2018) Data Statements for NLP : Toward Mitigating
Systems Bias and Enabling Better Science. TACL.
How Can We Detect Islamophobia (with NLP)?
Carry out a Qualitative Analysis of text with input from domain experts.
Collect and annotate Tweets.
Seek out a diverse pool of annotators.
Develop annotation scheme / code book.
Be Iterative.
Carry out Quantitative Analysis using Machine Learning or Deep Learning.
Data Collection
Islamophobia is global, but has many local variations each with their own issues,
terminology, and ways of being expressed.
This suggests the need for the data to have a regional focus - Islamophobia in the
UK, France, India, the USA, Minnesota, etc.
While she is a national figure, Ilhan Omar is from Minnesota, and our data
collection starts with her.
Muslim, but also a black Somali woman who was an immigrant / refugee.
Highly intersectional identity.
Tweet Collection (using Twitter public API)
Collecting since April 2019, any tweet that includes one or more of :
‘Ilhan omar’, ilhan, omar, @ilhanmn, ilhanmn, #ilhanmn, #ilhanomar, #ilhan
Pilot Annotation based on April 2019 - April 2020, approx 5 million total tweets.
1020 Annotation based on Nov. 2019 - Oct. 2020, approx 10 million total tweets.
Twitter public API does not give you all tweets, downsamples.
1020 Annotation (October 2020)
9.6 million tweets (incl. RT) collected Nov 2019 - Oct 2020.
1 million unique tweets.
Selected random samples of 384 tweets for annotation.
Agreement improved with more extensive set of labels.
Began to consider profile descriptions of “speakers” (Tweeters).
1020 Annotation Labels
Neutral - apolitical or about someone
other than Ilhan Omar
Support - expresses support for position
or person of Ilhan Omar
Political - expression of political
difference of opinion with Ilhan Omar
Insult - personal insult directed at Ilhan
Omar not related to other labels
Immigration - Ilhan Omar has committed
fraud to remain in USA
Terrorist - Ilhan Omar is a terrorist or
supports them
Loyalty - Ilhan Omar is unAmerican,
disployal, or a traitor
Jail - Ilhan Omar should be prosecuted,
convicted, or incarcerated
Sharia - Ilhan Omar wants to replace US
law with Sharia Law
Adultery - Ilhan Omar is an adulterer or
married to her brother
Loyalty, Terrorism
Profile Description
Loyalty, Sharia, Insult
Profile Description
Most frequent 2 grams in (re)Tweeter profiles
#maga #kag (25416), trump supporter (18969), trump 2020 (14241), president
trump (13951), husband father (12562), pro life (11502), happily married (10383),
god family (9690), proud american (9281), god bless (9100), wife mother (8487),
lives matter (7699), love god (7609), wife mom (6833), #maga #trump2020 (6799),
maga kag (6195), jesus christ (6187), christian conservative (6103), #kag
#trump2020 (6096), family country (5749), business owner (5733), american
patriot (5055), bless america (4916), common sense (4672), #trump2020 #maga
(4478), black lives (4230), truth seeker (4138), conservative christian (4132),
father husband (3991), donald trump (3931), constitutional conservative (3908),
united states (3884), 2nd amendment (3841), mother grandmother (3811),
america great (3801), #maga #wwg1wga (3725), army veteran (3486), human
rights (3419), dog lover (3414), #wwg1wga #maga (3112), free speech (3044)
1 grams (muslim,islam,quran) in all 1020 Tweets
muslim (14,791), muslims (4,849), islamic (3,827),
islam (3,302,), islamist (1650), islamophobia (607),
islamists (600), quran (580), islamophobic (553),
congressmuslim (435)
2 grams
a muslim (2,446), muslim brotherhood (1,631), the
muslim (1,440), islamic terrorist (594), anti muslim
(591), radical muslim (512), muslims in (483), muslim
woman (477), radical islamic (427), islam is (412)
3 grams
the muslim brotherhood (624), is a muslim (488),
congressmuslim ilhan omar (376), as a muslim
(285), a muslim american (217), muslim ilhan omar
(197), muslim american trump (195), of the muslim
(192), a radical muslim (181), muslim anti
immigrant (181)
4 grams
as a muslim american (198),a muslim american trump
(195),muslim american trump admirer (191),ahmed as
a muslim (183),muslim anti immigrant anti (175),she is
a muslim (171),somali congressmuslim ilhan omar
(166),omar is a muslim (151),muslim brotherhood ilhan
omar (136),muslim refugee dalia al (119)
as a muslim american trump (193), a muslim american
trump admirer (191), muslim american trump admirer i
(187), ahmed as a muslim american (182), muslim anti
immigrant anti black (152), qanta ahmed as a muslim
(144), icg obama isis soros muslim (117), obama isis soros
muslim brotherhood (117), isis soros muslim brotherhood
ilhan (116),omar and the progressive islamist (115)
5 grams
Hashtags (muslim, islam) in all Tweets
#muslimbrotherhood (189), #muslim (155), #islam (127), #banislam (108),
#muslims (82), #islamic (72), #islamophobia (69), #islamist (39), #islamophobic
(20), #muslims4justice (17), #stopislam (14), #banmuslimbrotherhoods (12),
#islamicstate (11), #islamophobe (11), #islamofascist (11), #islamistheproblem
(10), #islamists (10),#muslimbrotherhood (10), #islamicterrorist (10),
#banthemuslimbrothethood (10), #radicalislamicterrorist (9), #muslimban (9),
#islamicterrorism (9), #banmuslimbrotherhood (8), #banmuslims (8),
#muslimprivilege (8), #scumoftheearthfilthymuslimdemonrats (7),
#islamicrepublicvirus (7), #dirtyfilthymuslimdemonrats (7), #muslimban (6),
#banmuslimsfromamerica (6), #banmuslimimmigration (6)
1020 Annotation Labels
Neutral - apolitical or about someone
other than Ilhan Omar
Support - expresses support for position
or person of Ilhan Omar
Political - expression of political
difference of opinion with Ilhan Omar
Insult - personal insult directed at Ilhan
Omar not related to other labels
Immigration - Ilhan Omar has committed
fraud to remain in USA
Terrorist - Ilhan Omar is a terrorist or
supports them
Loyalty - Ilhan Omar is unAmerican,
disployal, or a traitor
Jail - Ilhan Omar should be prosecuted,
convicted, or incarcerated
Sharia - Ilhan Omar wants to replace US
law with Sharia Law
Adultery - Ilhan Omar is an adulterer or
married to her brother
Lessons Learned
Impact of “lock her up” and “send her back” rhetoric clearly seen in annotation.
Annotation labels must be nuanced, can’t simply label as Islamophobic or not
since content may be based on gender, race, immigration or marital status,
political beliefs in addition to or instead of religion.
A highly visible or politicized personality attracts a lot of repetitive and viral content
based on most recent accusation or conspiracy.
Profile descriptions are an important clues.
Current Questions
Which public events are correlated with online Islamophobia?
What is the impact of Tweeter location and profile description?
How are less prominent public figures who are Muslim targeted?
Are political figures who are known to be Christian, Jewish, Hindu, and other
religions targeted to greater or lesser extents?
Can crowdsourcing be effective for more nuanced annotation problems?
Automatically Identifying Islamophobia
in Social Media
Ted Pedersen
Department of Computer Science
University of Minnesota, Duluth
tpederse@umn.edu
@SeeTedTalk
http://www.d.umn.edu/~tpederse

More Related Content

What's hot

GBarton-OpEd-TheMonthly-DamagedGoodsAsWeapons-Dec14
GBarton-OpEd-TheMonthly-DamagedGoodsAsWeapons-Dec14GBarton-OpEd-TheMonthly-DamagedGoodsAsWeapons-Dec14
GBarton-OpEd-TheMonthly-DamagedGoodsAsWeapons-Dec14
Greg Barton
 
Gangs And The Military 4of7
Gangs And The Military 4of7Gangs And The Military 4of7
Gangs And The Military 4of7
Carter F. Smith, J.D., Ph.D.
 
262_TeamBII_Letter_to_Congressional_Leaders_32011%5b1%5d
262_TeamBII_Letter_to_Congressional_Leaders_32011%5b1%5d262_TeamBII_Letter_to_Congressional_Leaders_32011%5b1%5d
262_TeamBII_Letter_to_Congressional_Leaders_32011%5b1%5d
dave lane
 
US Politics
US PoliticsUS Politics
US Politics
eminentoomph4388
 
What is Shariah?
What is Shariah?What is Shariah?
What is Shariah?
John Guandolo
 
Bloods Vs Crips
Bloods Vs CripsBloods Vs Crips
Bloods Vs Crips
guest7f8ab147
 
Gangs And The Military 5of7
Gangs And The Military 5of7Gangs And The Military 5of7
Gangs And The Military 5of7
Carter F. Smith, J.D., Ph.D.
 
Left-wing Support for Islamist Oppressors
Left-wing Support for Islamist OppressorsLeft-wing Support for Islamist Oppressors
Left-wing Support for Islamist Oppressors
John Guandolo
 
Iran In Context
Iran In ContextIran In Context
Iran In Context
Reza Firouzbakht
 
Gangs And The Military 3of7
Gangs And The Military 3of7Gangs And The Military 3of7
Gangs And The Military 3of7
Carter F. Smith, J.D., Ph.D.
 
The Salience of Sectarianism, Making Sect Stick in Syria and Iraq
The Salience of Sectarianism, Making Sect Stick in Syria and IraqThe Salience of Sectarianism, Making Sect Stick in Syria and Iraq
The Salience of Sectarianism, Making Sect Stick in Syria and Iraq
Craig Browne
 
Iran and the Bomb – Summary, Panel Discussion – Nov 18, 2013 – Elan Journo
Iran and the Bomb – Summary, Panel Discussion – Nov 18, 2013 – Elan JournoIran and the Bomb – Summary, Panel Discussion – Nov 18, 2013 – Elan Journo
Iran and the Bomb – Summary, Panel Discussion – Nov 18, 2013 – Elan Journo
cjhs
 
INTERSECTIONALITY
INTERSECTIONALITYINTERSECTIONALITY
INTERSECTIONALITY
tiffany_bednar
 
Persuasion of Hate
Persuasion of HatePersuasion of Hate
Persuasion of Hate
Jmclea01
 
2016 election already here for fringe hopefuls politico
2016 election already here for fringe hopefuls   politico2016 election already here for fringe hopefuls   politico
2016 election already here for fringe hopefuls politico
Temperance Lancecouncil
 
Term project presentation
Term project presentationTerm project presentation
Term project presentation
Laura Nicholas
 
Who arethetalibanrealcopy
Who arethetalibanrealcopyWho arethetalibanrealcopy
Who arethetalibanrealcopy
bubblehead160
 
Civil military relation
Civil military relationCivil military relation
Civil military relation
aungkokotoe
 
Obama Strategies; What Lies Beneath; Stupid or Subversive?
Obama Strategies; What Lies Beneath; Stupid or Subversive?Obama Strategies; What Lies Beneath; Stupid or Subversive?
Obama Strategies; What Lies Beneath; Stupid or Subversive?
Gerald Furnkranz
 
Northern Virginia’s ADAMS Center and Muslim Brotherhood Ties
Northern Virginia’s ADAMS Center and Muslim Brotherhood TiesNorthern Virginia’s ADAMS Center and Muslim Brotherhood Ties
Northern Virginia’s ADAMS Center and Muslim Brotherhood Ties
John Guandolo
 

What's hot (20)

GBarton-OpEd-TheMonthly-DamagedGoodsAsWeapons-Dec14
GBarton-OpEd-TheMonthly-DamagedGoodsAsWeapons-Dec14GBarton-OpEd-TheMonthly-DamagedGoodsAsWeapons-Dec14
GBarton-OpEd-TheMonthly-DamagedGoodsAsWeapons-Dec14
 
Gangs And The Military 4of7
Gangs And The Military 4of7Gangs And The Military 4of7
Gangs And The Military 4of7
 
262_TeamBII_Letter_to_Congressional_Leaders_32011%5b1%5d
262_TeamBII_Letter_to_Congressional_Leaders_32011%5b1%5d262_TeamBII_Letter_to_Congressional_Leaders_32011%5b1%5d
262_TeamBII_Letter_to_Congressional_Leaders_32011%5b1%5d
 
US Politics
US PoliticsUS Politics
US Politics
 
What is Shariah?
What is Shariah?What is Shariah?
What is Shariah?
 
Bloods Vs Crips
Bloods Vs CripsBloods Vs Crips
Bloods Vs Crips
 
Gangs And The Military 5of7
Gangs And The Military 5of7Gangs And The Military 5of7
Gangs And The Military 5of7
 
Left-wing Support for Islamist Oppressors
Left-wing Support for Islamist OppressorsLeft-wing Support for Islamist Oppressors
Left-wing Support for Islamist Oppressors
 
Iran In Context
Iran In ContextIran In Context
Iran In Context
 
Gangs And The Military 3of7
Gangs And The Military 3of7Gangs And The Military 3of7
Gangs And The Military 3of7
 
The Salience of Sectarianism, Making Sect Stick in Syria and Iraq
The Salience of Sectarianism, Making Sect Stick in Syria and IraqThe Salience of Sectarianism, Making Sect Stick in Syria and Iraq
The Salience of Sectarianism, Making Sect Stick in Syria and Iraq
 
Iran and the Bomb – Summary, Panel Discussion – Nov 18, 2013 – Elan Journo
Iran and the Bomb – Summary, Panel Discussion – Nov 18, 2013 – Elan JournoIran and the Bomb – Summary, Panel Discussion – Nov 18, 2013 – Elan Journo
Iran and the Bomb – Summary, Panel Discussion – Nov 18, 2013 – Elan Journo
 
INTERSECTIONALITY
INTERSECTIONALITYINTERSECTIONALITY
INTERSECTIONALITY
 
Persuasion of Hate
Persuasion of HatePersuasion of Hate
Persuasion of Hate
 
2016 election already here for fringe hopefuls politico
2016 election already here for fringe hopefuls   politico2016 election already here for fringe hopefuls   politico
2016 election already here for fringe hopefuls politico
 
Term project presentation
Term project presentationTerm project presentation
Term project presentation
 
Who arethetalibanrealcopy
Who arethetalibanrealcopyWho arethetalibanrealcopy
Who arethetalibanrealcopy
 
Civil military relation
Civil military relationCivil military relation
Civil military relation
 
Obama Strategies; What Lies Beneath; Stupid or Subversive?
Obama Strategies; What Lies Beneath; Stupid or Subversive?Obama Strategies; What Lies Beneath; Stupid or Subversive?
Obama Strategies; What Lies Beneath; Stupid or Subversive?
 
Northern Virginia’s ADAMS Center and Muslim Brotherhood Ties
Northern Virginia’s ADAMS Center and Muslim Brotherhood TiesNorthern Virginia’s ADAMS Center and Muslim Brotherhood Ties
Northern Virginia’s ADAMS Center and Muslim Brotherhood Ties
 

Similar to Muslims in Machine Learning workshop (NeurlPS 2021) - Automatically Identifying Islamophobia in Social Media

Automatically Identifying Islamophobia in Social Media
Automatically Identifying Islamophobia in Social MediaAutomatically Identifying Islamophobia in Social Media
Automatically Identifying Islamophobia in Social Media
University of Minnesota, Duluth
 
EXTREMISM A GENERAL CONCEPT
EXTREMISM A GENERAL CONCEPTEXTREMISM A GENERAL CONCEPT
EXTREMISM A GENERAL CONCEPT
maryam_arif
 
diference bitween islam and terrorism
diference bitween islam and terrorismdiference bitween islam and terrorism
diference bitween islam and terrorism
ouhida
 
Islam and terrorism
Islam and terrorismIslam and terrorism
Islam and terrorism
meriemu
 
Stereotyping, Muslim Stereotyping & Islamophobia by Abid Zafar
Stereotyping, Muslim Stereotyping & Islamophobia by Abid ZafarStereotyping, Muslim Stereotyping & Islamophobia by Abid Zafar
Stereotyping, Muslim Stereotyping & Islamophobia by Abid Zafar
Abid Zafar
 
Terrorism causes, effects, and solutions
Terrorism causes, effects, and solutionsTerrorism causes, effects, and solutions
Terrorism causes, effects, and solutions
Srun Sakada
 
Terrorism | Types of Terrorism | Impacts of terrorism
Terrorism | Types of Terrorism | Impacts of terrorism Terrorism | Types of Terrorism | Impacts of terrorism
Terrorism | Types of Terrorism | Impacts of terrorism
Mian Muhammad Zafar
 
September 11th Terrorist Attacks
September 11th Terrorist AttacksSeptember 11th Terrorist Attacks
September 11th Terrorist Attacks
Aaron Carn
 
Satanic Ritual Abuse And The Illuminati Part 1
Satanic Ritual Abuse And The Illuminati Part 1Satanic Ritual Abuse And The Illuminati Part 1
Satanic Ritual Abuse And The Illuminati Part 1
Mormons4justice
 
Global Terrorism Challenges & Response
Global Terrorism Challenges & ResponseGlobal Terrorism Challenges & Response
Global Terrorism Challenges & Response
Shahid Hussain Raja
 
Lecture 1 introduction to terrorism
Lecture 1   introduction to terrorismLecture 1   introduction to terrorism
Lecture 1 introduction to terrorism
James Feldkamp
 
reply to each post 150 words min .1. Explain what psychologic.docx
reply to each post 150 words min .1. Explain what psychologic.docxreply to each post 150 words min .1. Explain what psychologic.docx
reply to each post 150 words min .1. Explain what psychologic.docx
chris293
 
473 2015 up political tolerance competence (1 21-15)
473 2015 up political tolerance competence (1 21-15)473 2015 up political tolerance competence (1 21-15)
473 2015 up political tolerance competence (1 21-15)
mpeffl
 
Terrorism Detailed doc
Terrorism Detailed docTerrorism Detailed doc
Terrorism Detailed doc
Hassan Shahzad
 
Global Terrorism Challenges & Response
Global Terrorism Challenges & ResponseGlobal Terrorism Challenges & Response
Global Terrorism Challenges & Response
Shahid Hussain Raja
 

Similar to Muslims in Machine Learning workshop (NeurlPS 2021) - Automatically Identifying Islamophobia in Social Media (15)

Automatically Identifying Islamophobia in Social Media
Automatically Identifying Islamophobia in Social MediaAutomatically Identifying Islamophobia in Social Media
Automatically Identifying Islamophobia in Social Media
 
EXTREMISM A GENERAL CONCEPT
EXTREMISM A GENERAL CONCEPTEXTREMISM A GENERAL CONCEPT
EXTREMISM A GENERAL CONCEPT
 
diference bitween islam and terrorism
diference bitween islam and terrorismdiference bitween islam and terrorism
diference bitween islam and terrorism
 
Islam and terrorism
Islam and terrorismIslam and terrorism
Islam and terrorism
 
Stereotyping, Muslim Stereotyping & Islamophobia by Abid Zafar
Stereotyping, Muslim Stereotyping & Islamophobia by Abid ZafarStereotyping, Muslim Stereotyping & Islamophobia by Abid Zafar
Stereotyping, Muslim Stereotyping & Islamophobia by Abid Zafar
 
Terrorism causes, effects, and solutions
Terrorism causes, effects, and solutionsTerrorism causes, effects, and solutions
Terrorism causes, effects, and solutions
 
Terrorism | Types of Terrorism | Impacts of terrorism
Terrorism | Types of Terrorism | Impacts of terrorism Terrorism | Types of Terrorism | Impacts of terrorism
Terrorism | Types of Terrorism | Impacts of terrorism
 
September 11th Terrorist Attacks
September 11th Terrorist AttacksSeptember 11th Terrorist Attacks
September 11th Terrorist Attacks
 
Satanic Ritual Abuse And The Illuminati Part 1
Satanic Ritual Abuse And The Illuminati Part 1Satanic Ritual Abuse And The Illuminati Part 1
Satanic Ritual Abuse And The Illuminati Part 1
 
Global Terrorism Challenges & Response
Global Terrorism Challenges & ResponseGlobal Terrorism Challenges & Response
Global Terrorism Challenges & Response
 
Lecture 1 introduction to terrorism
Lecture 1   introduction to terrorismLecture 1   introduction to terrorism
Lecture 1 introduction to terrorism
 
reply to each post 150 words min .1. Explain what psychologic.docx
reply to each post 150 words min .1. Explain what psychologic.docxreply to each post 150 words min .1. Explain what psychologic.docx
reply to each post 150 words min .1. Explain what psychologic.docx
 
473 2015 up political tolerance competence (1 21-15)
473 2015 up political tolerance competence (1 21-15)473 2015 up political tolerance competence (1 21-15)
473 2015 up political tolerance competence (1 21-15)
 
Terrorism Detailed doc
Terrorism Detailed docTerrorism Detailed doc
Terrorism Detailed doc
 
Global Terrorism Challenges & Response
Global Terrorism Challenges & ResponseGlobal Terrorism Challenges & Response
Global Terrorism Challenges & Response
 

More from University of Minnesota, Duluth

What Makes Hate Speech : an interactive workshop
What Makes Hate Speech : an interactive workshopWhat Makes Hate Speech : an interactive workshop
What Makes Hate Speech : an interactive workshop
University of Minnesota, Duluth
 
Algorithmic Bias - What is it? Why should we care? What can we do about it?
Algorithmic Bias - What is it? Why should we care? What can we do about it? Algorithmic Bias - What is it? Why should we care? What can we do about it?
Algorithmic Bias - What is it? Why should we care? What can we do about it?
University of Minnesota, Duluth
 
Algorithmic Bias : What is it? Why should we care? What can we do about it?
Algorithmic Bias : What is it? Why should we care? What can we do about it?Algorithmic Bias : What is it? Why should we care? What can we do about it?
Algorithmic Bias : What is it? Why should we care? What can we do about it?
University of Minnesota, Duluth
 
Duluth at Semeval 2017 Task 6 - Language Models in Humor Detection
Duluth at Semeval 2017 Task 6 - Language Models in Humor Detection Duluth at Semeval 2017 Task 6 - Language Models in Humor Detection
Duluth at Semeval 2017 Task 6 - Language Models in Humor Detection
University of Minnesota, Duluth
 
Who's to say what's funny? A computer using Language Models and Deep Learning...
Who's to say what's funny? A computer using Language Models and Deep Learning...Who's to say what's funny? A computer using Language Models and Deep Learning...
Who's to say what's funny? A computer using Language Models and Deep Learning...
University of Minnesota, Duluth
 
Duluth at Semeval 2017 Task 7 - Puns upon a Midnight Dreary, Lexical Semantic...
Duluth at Semeval 2017 Task 7 - Puns upon a Midnight Dreary, Lexical Semantic...Duluth at Semeval 2017 Task 7 - Puns upon a Midnight Dreary, Lexical Semantic...
Duluth at Semeval 2017 Task 7 - Puns upon a Midnight Dreary, Lexical Semantic...
University of Minnesota, Duluth
 
Puns upon a midnight dreary, lexical semantics for the weak and weary
Puns upon a midnight dreary, lexical semantics for the weak and wearyPuns upon a midnight dreary, lexical semantics for the weak and weary
Puns upon a midnight dreary, lexical semantics for the weak and weary
University of Minnesota, Duluth
 
The horizon isn't found in a dictionary : Identifying emerging word senses a...
The horizon isn't found in a  dictionary : Identifying emerging word senses a...The horizon isn't found in a  dictionary : Identifying emerging word senses a...
The horizon isn't found in a dictionary : Identifying emerging word senses a...
University of Minnesota, Duluth
 
Screening Twitter Users for Depression and PTSD
Screening Twitter Users for Depression and PTSDScreening Twitter Users for Depression and PTSD
Screening Twitter Users for Depression and PTSD
University of Minnesota, Duluth
 
Duluth : Word Sense Discrimination in the Service of Lexicography
Duluth : Word Sense Discrimination in the Service of LexicographyDuluth : Word Sense Discrimination in the Service of Lexicography
Duluth : Word Sense Discrimination in the Service of Lexicography
University of Minnesota, Duluth
 
Pedersen masters-thesis-oct-10-2014
Pedersen masters-thesis-oct-10-2014Pedersen masters-thesis-oct-10-2014
Pedersen masters-thesis-oct-10-2014
University of Minnesota, Duluth
 
MICAI 2013 Tutorial Slides - Measuring the Similarity and Relatedness of Conc...
MICAI 2013 Tutorial Slides - Measuring the Similarity and Relatedness of Conc...MICAI 2013 Tutorial Slides - Measuring the Similarity and Relatedness of Conc...
MICAI 2013 Tutorial Slides - Measuring the Similarity and Relatedness of Conc...
University of Minnesota, Duluth
 
What it's like to do a Master's thesis with me (Ted Pedersen)
What it's like to do a Master's thesis with me (Ted Pedersen)What it's like to do a Master's thesis with me (Ted Pedersen)
What it's like to do a Master's thesis with me (Ted Pedersen)
University of Minnesota, Duluth
 
Pedersen naacl-2013-demo-poster-may25
Pedersen naacl-2013-demo-poster-may25Pedersen naacl-2013-demo-poster-may25
Pedersen naacl-2013-demo-poster-may25
University of Minnesota, Duluth
 
Pedersen semeval-2013-poster-may24
Pedersen semeval-2013-poster-may24Pedersen semeval-2013-poster-may24
Pedersen semeval-2013-poster-may24
University of Minnesota, Duluth
 
Talk at UAB, April 12, 2013
Talk at UAB, April 12, 2013Talk at UAB, April 12, 2013
Talk at UAB, April 12, 2013
University of Minnesota, Duluth
 
Feb20 mayo-webinar-21feb2012
Feb20 mayo-webinar-21feb2012Feb20 mayo-webinar-21feb2012
Feb20 mayo-webinar-21feb2012
University of Minnesota, Duluth
 
Ihi2012 semantic-similarity-tutorial-part1
Ihi2012 semantic-similarity-tutorial-part1Ihi2012 semantic-similarity-tutorial-part1
Ihi2012 semantic-similarity-tutorial-part1
University of Minnesota, Duluth
 
Pedersen ACL Disco-2011 workshop
Pedersen ACL Disco-2011 workshopPedersen ACL Disco-2011 workshop
Pedersen ACL Disco-2011 workshop
University of Minnesota, Duluth
 
Pedersen acl2011-business-meeting
Pedersen acl2011-business-meetingPedersen acl2011-business-meeting
Pedersen acl2011-business-meeting
University of Minnesota, Duluth
 

More from University of Minnesota, Duluth (20)

What Makes Hate Speech : an interactive workshop
What Makes Hate Speech : an interactive workshopWhat Makes Hate Speech : an interactive workshop
What Makes Hate Speech : an interactive workshop
 
Algorithmic Bias - What is it? Why should we care? What can we do about it?
Algorithmic Bias - What is it? Why should we care? What can we do about it? Algorithmic Bias - What is it? Why should we care? What can we do about it?
Algorithmic Bias - What is it? Why should we care? What can we do about it?
 
Algorithmic Bias : What is it? Why should we care? What can we do about it?
Algorithmic Bias : What is it? Why should we care? What can we do about it?Algorithmic Bias : What is it? Why should we care? What can we do about it?
Algorithmic Bias : What is it? Why should we care? What can we do about it?
 
Duluth at Semeval 2017 Task 6 - Language Models in Humor Detection
Duluth at Semeval 2017 Task 6 - Language Models in Humor Detection Duluth at Semeval 2017 Task 6 - Language Models in Humor Detection
Duluth at Semeval 2017 Task 6 - Language Models in Humor Detection
 
Who's to say what's funny? A computer using Language Models and Deep Learning...
Who's to say what's funny? A computer using Language Models and Deep Learning...Who's to say what's funny? A computer using Language Models and Deep Learning...
Who's to say what's funny? A computer using Language Models and Deep Learning...
 
Duluth at Semeval 2017 Task 7 - Puns upon a Midnight Dreary, Lexical Semantic...
Duluth at Semeval 2017 Task 7 - Puns upon a Midnight Dreary, Lexical Semantic...Duluth at Semeval 2017 Task 7 - Puns upon a Midnight Dreary, Lexical Semantic...
Duluth at Semeval 2017 Task 7 - Puns upon a Midnight Dreary, Lexical Semantic...
 
Puns upon a midnight dreary, lexical semantics for the weak and weary
Puns upon a midnight dreary, lexical semantics for the weak and wearyPuns upon a midnight dreary, lexical semantics for the weak and weary
Puns upon a midnight dreary, lexical semantics for the weak and weary
 
The horizon isn't found in a dictionary : Identifying emerging word senses a...
The horizon isn't found in a  dictionary : Identifying emerging word senses a...The horizon isn't found in a  dictionary : Identifying emerging word senses a...
The horizon isn't found in a dictionary : Identifying emerging word senses a...
 
Screening Twitter Users for Depression and PTSD
Screening Twitter Users for Depression and PTSDScreening Twitter Users for Depression and PTSD
Screening Twitter Users for Depression and PTSD
 
Duluth : Word Sense Discrimination in the Service of Lexicography
Duluth : Word Sense Discrimination in the Service of LexicographyDuluth : Word Sense Discrimination in the Service of Lexicography
Duluth : Word Sense Discrimination in the Service of Lexicography
 
Pedersen masters-thesis-oct-10-2014
Pedersen masters-thesis-oct-10-2014Pedersen masters-thesis-oct-10-2014
Pedersen masters-thesis-oct-10-2014
 
MICAI 2013 Tutorial Slides - Measuring the Similarity and Relatedness of Conc...
MICAI 2013 Tutorial Slides - Measuring the Similarity and Relatedness of Conc...MICAI 2013 Tutorial Slides - Measuring the Similarity and Relatedness of Conc...
MICAI 2013 Tutorial Slides - Measuring the Similarity and Relatedness of Conc...
 
What it's like to do a Master's thesis with me (Ted Pedersen)
What it's like to do a Master's thesis with me (Ted Pedersen)What it's like to do a Master's thesis with me (Ted Pedersen)
What it's like to do a Master's thesis with me (Ted Pedersen)
 
Pedersen naacl-2013-demo-poster-may25
Pedersen naacl-2013-demo-poster-may25Pedersen naacl-2013-demo-poster-may25
Pedersen naacl-2013-demo-poster-may25
 
Pedersen semeval-2013-poster-may24
Pedersen semeval-2013-poster-may24Pedersen semeval-2013-poster-may24
Pedersen semeval-2013-poster-may24
 
Talk at UAB, April 12, 2013
Talk at UAB, April 12, 2013Talk at UAB, April 12, 2013
Talk at UAB, April 12, 2013
 
Feb20 mayo-webinar-21feb2012
Feb20 mayo-webinar-21feb2012Feb20 mayo-webinar-21feb2012
Feb20 mayo-webinar-21feb2012
 
Ihi2012 semantic-similarity-tutorial-part1
Ihi2012 semantic-similarity-tutorial-part1Ihi2012 semantic-similarity-tutorial-part1
Ihi2012 semantic-similarity-tutorial-part1
 
Pedersen ACL Disco-2011 workshop
Pedersen ACL Disco-2011 workshopPedersen ACL Disco-2011 workshop
Pedersen ACL Disco-2011 workshop
 
Pedersen acl2011-business-meeting
Pedersen acl2011-business-meetingPedersen acl2011-business-meeting
Pedersen acl2011-business-meeting
 

Recently uploaded

Factors affecting undergraduate students’ motivation at a university in Tra Vinh
Factors affecting undergraduate students’ motivation at a university in Tra VinhFactors affecting undergraduate students’ motivation at a university in Tra Vinh
Factors affecting undergraduate students’ motivation at a university in Tra Vinh
AJHSSR Journal
 
原版制作(Hull毕业证书)赫尔大学毕业证Offer一模一样
原版制作(Hull毕业证书)赫尔大学毕业证Offer一模一样原版制作(Hull毕业证书)赫尔大学毕业证Offer一模一样
原版制作(Hull毕业证书)赫尔大学毕业证Offer一模一样
7lkkjxt
 
UR BHATTI ACADEMY AND ONLINE COURSES.pdf
UR BHATTI ACADEMY AND ONLINE COURSES.pdfUR BHATTI ACADEMY AND ONLINE COURSES.pdf
UR BHATTI ACADEMY AND ONLINE COURSES.pdf
urbhattiacademy
 
HMS Facebook Stories All V1 06092024.docx
HMS Facebook Stories All V1 06092024.docxHMS Facebook Stories All V1 06092024.docx
HMS Facebook Stories All V1 06092024.docx
Charles Bayless
 
一比一原版(AU毕业证)英国阿伯丁大学毕业证如何办理
一比一原版(AU毕业证)英国阿伯丁大学毕业证如何办理一比一原版(AU毕业证)英国阿伯丁大学毕业证如何办理
一比一原版(AU毕业证)英国阿伯丁大学毕业证如何办理
anubug
 
Maximize Your Twitch Potential!..........
Maximize Your Twitch Potential!..........Maximize Your Twitch Potential!..........
Maximize Your Twitch Potential!..........
SocioCosmos
 
STUDY ON THE DEVELOPMENT STRATEGY OF HUZHOU TOURISM
STUDY ON THE DEVELOPMENT STRATEGY OF HUZHOU TOURISMSTUDY ON THE DEVELOPMENT STRATEGY OF HUZHOU TOURISM
STUDY ON THE DEVELOPMENT STRATEGY OF HUZHOU TOURISM
AJHSSR Journal
 
The Impact of Work Stress and Digital Literacy on Employee Performance at PT ...
The Impact of Work Stress and Digital Literacy on Employee Performance at PT ...The Impact of Work Stress and Digital Literacy on Employee Performance at PT ...
The Impact of Work Stress and Digital Literacy on Employee Performance at PT ...
AJHSSR Journal
 
快速办理(worcester毕业证书)伍斯特大学毕业证PDF成绩单一模一样
快速办理(worcester毕业证书)伍斯特大学毕业证PDF成绩单一模一样快速办理(worcester毕业证书)伍斯特大学毕业证PDF成绩单一模一样
快速办理(worcester毕业证书)伍斯特大学毕业证PDF成绩单一模一样
9u4xjk4w
 
Dominate Reddit Discussions.............
Dominate Reddit Discussions.............Dominate Reddit Discussions.............
Dominate Reddit Discussions.............
SocioCosmos
 

Recently uploaded (10)

Factors affecting undergraduate students’ motivation at a university in Tra Vinh
Factors affecting undergraduate students’ motivation at a university in Tra VinhFactors affecting undergraduate students’ motivation at a university in Tra Vinh
Factors affecting undergraduate students’ motivation at a university in Tra Vinh
 
原版制作(Hull毕业证书)赫尔大学毕业证Offer一模一样
原版制作(Hull毕业证书)赫尔大学毕业证Offer一模一样原版制作(Hull毕业证书)赫尔大学毕业证Offer一模一样
原版制作(Hull毕业证书)赫尔大学毕业证Offer一模一样
 
UR BHATTI ACADEMY AND ONLINE COURSES.pdf
UR BHATTI ACADEMY AND ONLINE COURSES.pdfUR BHATTI ACADEMY AND ONLINE COURSES.pdf
UR BHATTI ACADEMY AND ONLINE COURSES.pdf
 
HMS Facebook Stories All V1 06092024.docx
HMS Facebook Stories All V1 06092024.docxHMS Facebook Stories All V1 06092024.docx
HMS Facebook Stories All V1 06092024.docx
 
一比一原版(AU毕业证)英国阿伯丁大学毕业证如何办理
一比一原版(AU毕业证)英国阿伯丁大学毕业证如何办理一比一原版(AU毕业证)英国阿伯丁大学毕业证如何办理
一比一原版(AU毕业证)英国阿伯丁大学毕业证如何办理
 
Maximize Your Twitch Potential!..........
Maximize Your Twitch Potential!..........Maximize Your Twitch Potential!..........
Maximize Your Twitch Potential!..........
 
STUDY ON THE DEVELOPMENT STRATEGY OF HUZHOU TOURISM
STUDY ON THE DEVELOPMENT STRATEGY OF HUZHOU TOURISMSTUDY ON THE DEVELOPMENT STRATEGY OF HUZHOU TOURISM
STUDY ON THE DEVELOPMENT STRATEGY OF HUZHOU TOURISM
 
The Impact of Work Stress and Digital Literacy on Employee Performance at PT ...
The Impact of Work Stress and Digital Literacy on Employee Performance at PT ...The Impact of Work Stress and Digital Literacy on Employee Performance at PT ...
The Impact of Work Stress and Digital Literacy on Employee Performance at PT ...
 
快速办理(worcester毕业证书)伍斯特大学毕业证PDF成绩单一模一样
快速办理(worcester毕业证书)伍斯特大学毕业证PDF成绩单一模一样快速办理(worcester毕业证书)伍斯特大学毕业证PDF成绩单一模一样
快速办理(worcester毕业证书)伍斯特大学毕业证PDF成绩单一模一样
 
Dominate Reddit Discussions.............
Dominate Reddit Discussions.............Dominate Reddit Discussions.............
Dominate Reddit Discussions.............
 

Muslims in Machine Learning workshop (NeurlPS 2021) - Automatically Identifying Islamophobia in Social Media

  • 1. Automatically Identifying Islamophobia in Social Media Ted Pedersen Department of Computer Science University of Minnesota, Duluth tpederse@umn.edu @SeeTedTalk http://www.d.umn.edu/~tpederse
  • 2. Today’s Agenda Islamophobia in general and in Minnesota Collecting & Annotating Twitter Data What We’ve Learned from Annotating The Way Forward
  • 3. Islamophobia A legacy of colonial histories, particularly those that view the Muslim world as exotic, savage, dangerous - all leading to a “Clash of Civlilizations.” Orientalism (Said, 1978)
  • 4. Islamophobia A recent term for an older phenomena Runnymede Trust (1997, 2017) Unfounded hostility towards Islam Practical consequences of such hostility in unfair discrimination against Muslim individuals and communities Exclusion of Muslims from mainstream political and social affairs.
  • 5. Common Anti-Muslim Tropes (Bridge Institute) Islam and Muslims are inherently violent. Islam and Muslims are oppresive to women. Islam and Muslims are intolerant towards other religions. Islam is a political ideology, not a religion. In the West, Mulims are using non-violent stealth jihad to implement Sharia Law. Islam is foreign, medieval, and at odds with Western modernity. Islam is a monolith. All Muslims are Arab or Brown.
  • 7. Meanwhile, back home in Minnesota … New York Times, June 20, 2019
  • 10.
  • 11. Goal : Identify Islamophobia in Social Media Text Why? Relatively understudied in NLP. Highly intersectional problem since Muslim identity is multi-faceted. Significant influence on events in the World, the USA, and Minnesota. How? Use ideas from NLP, especially Hate Speech Detection. Create annotated corpora in order to understand problem better, and then apply Machine Learning or Deep Learning.
  • 12. Guiding Principles This is not Just Another Classification Task. Seek out domain expertise, build relationships, don’t reduce the problem to a data set. Frey et al. (2018). Artificial Intelligence and Inclusion: Formerly Gang-Involved Youth as Domain Experts for Analyzing Unstructured Twitter Data. Social Science Computer Review. Creating annotated data is likely necessary. Be careful to fully document the decisions made along the way, paying special attention to annotator background. Bender and Friedman (2018) Data Statements for NLP : Toward Mitigating Systems Bias and Enabling Better Science. TACL.
  • 13. How Can We Detect Islamophobia (with NLP)? Carry out a Qualitative Analysis of text with input from domain experts. Collect and annotate Tweets. Seek out a diverse pool of annotators. Develop annotation scheme / code book. Be Iterative. Carry out Quantitative Analysis using Machine Learning or Deep Learning.
  • 14. Data Collection Islamophobia is global, but has many local variations each with their own issues, terminology, and ways of being expressed. This suggests the need for the data to have a regional focus - Islamophobia in the UK, France, India, the USA, Minnesota, etc. While she is a national figure, Ilhan Omar is from Minnesota, and our data collection starts with her. Muslim, but also a black Somali woman who was an immigrant / refugee. Highly intersectional identity.
  • 15. Tweet Collection (using Twitter public API) Collecting since April 2019, any tweet that includes one or more of : ‘Ilhan omar’, ilhan, omar, @ilhanmn, ilhanmn, #ilhanmn, #ilhanomar, #ilhan Pilot Annotation based on April 2019 - April 2020, approx 5 million total tweets. 1020 Annotation based on Nov. 2019 - Oct. 2020, approx 10 million total tweets. Twitter public API does not give you all tweets, downsamples.
  • 16. 1020 Annotation (October 2020) 9.6 million tweets (incl. RT) collected Nov 2019 - Oct 2020. 1 million unique tweets. Selected random samples of 384 tweets for annotation. Agreement improved with more extensive set of labels. Began to consider profile descriptions of “speakers” (Tweeters).
  • 17. 1020 Annotation Labels Neutral - apolitical or about someone other than Ilhan Omar Support - expresses support for position or person of Ilhan Omar Political - expression of political difference of opinion with Ilhan Omar Insult - personal insult directed at Ilhan Omar not related to other labels Immigration - Ilhan Omar has committed fraud to remain in USA Terrorist - Ilhan Omar is a terrorist or supports them Loyalty - Ilhan Omar is unAmerican, disployal, or a traitor Jail - Ilhan Omar should be prosecuted, convicted, or incarcerated Sharia - Ilhan Omar wants to replace US law with Sharia Law Adultery - Ilhan Omar is an adulterer or married to her brother
  • 22. Most frequent 2 grams in (re)Tweeter profiles #maga #kag (25416), trump supporter (18969), trump 2020 (14241), president trump (13951), husband father (12562), pro life (11502), happily married (10383), god family (9690), proud american (9281), god bless (9100), wife mother (8487), lives matter (7699), love god (7609), wife mom (6833), #maga #trump2020 (6799), maga kag (6195), jesus christ (6187), christian conservative (6103), #kag #trump2020 (6096), family country (5749), business owner (5733), american patriot (5055), bless america (4916), common sense (4672), #trump2020 #maga (4478), black lives (4230), truth seeker (4138), conservative christian (4132), father husband (3991), donald trump (3931), constitutional conservative (3908), united states (3884), 2nd amendment (3841), mother grandmother (3811), america great (3801), #maga #wwg1wga (3725), army veteran (3486), human rights (3419), dog lover (3414), #wwg1wga #maga (3112), free speech (3044)
  • 23. 1 grams (muslim,islam,quran) in all 1020 Tweets muslim (14,791), muslims (4,849), islamic (3,827), islam (3,302,), islamist (1650), islamophobia (607), islamists (600), quran (580), islamophobic (553), congressmuslim (435)
  • 24. 2 grams a muslim (2,446), muslim brotherhood (1,631), the muslim (1,440), islamic terrorist (594), anti muslim (591), radical muslim (512), muslims in (483), muslim woman (477), radical islamic (427), islam is (412)
  • 25. 3 grams the muslim brotherhood (624), is a muslim (488), congressmuslim ilhan omar (376), as a muslim (285), a muslim american (217), muslim ilhan omar (197), muslim american trump (195), of the muslim (192), a radical muslim (181), muslim anti immigrant (181)
  • 26. 4 grams as a muslim american (198),a muslim american trump (195),muslim american trump admirer (191),ahmed as a muslim (183),muslim anti immigrant anti (175),she is a muslim (171),somali congressmuslim ilhan omar (166),omar is a muslim (151),muslim brotherhood ilhan omar (136),muslim refugee dalia al (119)
  • 27. as a muslim american trump (193), a muslim american trump admirer (191), muslim american trump admirer i (187), ahmed as a muslim american (182), muslim anti immigrant anti black (152), qanta ahmed as a muslim (144), icg obama isis soros muslim (117), obama isis soros muslim brotherhood (117), isis soros muslim brotherhood ilhan (116),omar and the progressive islamist (115) 5 grams
  • 28. Hashtags (muslim, islam) in all Tweets #muslimbrotherhood (189), #muslim (155), #islam (127), #banislam (108), #muslims (82), #islamic (72), #islamophobia (69), #islamist (39), #islamophobic (20), #muslims4justice (17), #stopislam (14), #banmuslimbrotherhoods (12), #islamicstate (11), #islamophobe (11), #islamofascist (11), #islamistheproblem (10), #islamists (10),#muslimbrotherhood (10), #islamicterrorist (10), #banthemuslimbrothethood (10), #radicalislamicterrorist (9), #muslimban (9), #islamicterrorism (9), #banmuslimbrotherhood (8), #banmuslims (8), #muslimprivilege (8), #scumoftheearthfilthymuslimdemonrats (7), #islamicrepublicvirus (7), #dirtyfilthymuslimdemonrats (7), #muslimban (6), #banmuslimsfromamerica (6), #banmuslimimmigration (6)
  • 29. 1020 Annotation Labels Neutral - apolitical or about someone other than Ilhan Omar Support - expresses support for position or person of Ilhan Omar Political - expression of political difference of opinion with Ilhan Omar Insult - personal insult directed at Ilhan Omar not related to other labels Immigration - Ilhan Omar has committed fraud to remain in USA Terrorist - Ilhan Omar is a terrorist or supports them Loyalty - Ilhan Omar is unAmerican, disployal, or a traitor Jail - Ilhan Omar should be prosecuted, convicted, or incarcerated Sharia - Ilhan Omar wants to replace US law with Sharia Law Adultery - Ilhan Omar is an adulterer or married to her brother
  • 30.
  • 31. Lessons Learned Impact of “lock her up” and “send her back” rhetoric clearly seen in annotation. Annotation labels must be nuanced, can’t simply label as Islamophobic or not since content may be based on gender, race, immigration or marital status, political beliefs in addition to or instead of religion. A highly visible or politicized personality attracts a lot of repetitive and viral content based on most recent accusation or conspiracy. Profile descriptions are an important clues.
  • 32. Current Questions Which public events are correlated with online Islamophobia? What is the impact of Tweeter location and profile description? How are less prominent public figures who are Muslim targeted? Are political figures who are known to be Christian, Jewish, Hindu, and other religions targeted to greater or lesser extents? Can crowdsourcing be effective for more nuanced annotation problems?
  • 33. Automatically Identifying Islamophobia in Social Media Ted Pedersen Department of Computer Science University of Minnesota, Duluth tpederse@umn.edu @SeeTedTalk http://www.d.umn.edu/~tpederse