Comparative studies on detecting abusive language on twitter

ABUSIVE LANGUAGE
DETECTION—
COMPARATIVE STUDY AND
ITS APPLICATIONS
YOUNGHUN LEE

/ 48
ABUSIVE LANGUAGE DETECTION—COMPARATIVE STUDY AND ITS APPLICATIONS
BEFORE WE BEGIN..
▸ B.S. in Electrical and Computer Engineering
at Seoul National University
▸ M.S. in Information at the University of
Michigan
▸ (Current) Research Intern at Machine
Intelligence Lab @ Seoul National University
▸ (Current) Ph.D. applicant in Machine
Learning and Natural Language Processing
2

/ 48
WHAT THIS PRESENTATION IS COVERING
I. Introduction to Abusive Language Detection
▸ What is it? Why is it important to study?
▸ Recent studies
II. About the paper, “Comparative Studies of Detecting Abusive Language on Twitter”
▸ Research idea and its novelty
▸ Empirical results and ﬁndings
III. Future relevance with Naver
▸ Research possibilities and their challenges
3

/ 48
ACKNOWLEDGEMENTS
▸ Zeerak Waseem
▸ Ph.D. Candidate at the University of Shefﬁeld
▸ Organizer of the Abusive Language Online Workshop
4

I. INTRODUCTION TO
ABUSIVE LANGUAGE DETECTION

/ 48
I. INTRODUCTION TO ABUSIVE LANGUAGE DETECTION
▸ Varying deﬁnition and terminology
6
EARLY DEFINITION
Messages that most users consider to
be annoying or upsetting1

/ 48
7
RECENT
Messages that under-rate a person or a
group on the basis of characteristics
(race, ethnicity, gender, religion, etc.)2
EARLY DEFINITION

/ 48
8
EXAMPLE
1) Go fucking kill yourself useless
scumbag
2) Hell yeah! Go bitches!
3) Jews are lower class pigs
EARLY DEFINITION
RECENT

/ 48
9
EXAMPLE
scumbag
EARLY DEFINITION
RECENT

/ 48
10
EXAMPLE
scumbag
EARLY DEFINITION
RECENT

/ 48
11
TERMINOLOGY
abusive language
hate speech
offensive language
cyberbullying
hostile ﬂames
vulgar language
insults, profanity
…
EARLY DEFINITION
RECENT

/ 48
12
TERMINOLOGY
abusive language
hate speech
offensive language
cyberbullying
hostile ﬂames
vulgar language
insults, profanity
…
EARLY DEFINITION
RECENT
PRO)
Enables considering
diverse situations
CON)
Confusing deﬁnition
-> hard to annotate

/ 48
▸ Why is it important these days?
13
1) Increasing usage of social media3

/ 48
14
1) Increasing usage of social media
2) Social media signiﬁcantly affects
current day society

/ 48
15
4

/ 48
16
5
current day society
3) More and more users have been
experiencing online harassment

/ 48
17
current day society
3) More and more users have been
experiencing online harassment
4) Yet, major social media companies
fail to successfully resolve the issueArtist stencils hate speech tweets
outside Twitter HQ to highlight failure
to deal with offensive messages

/ 48
▸ Characteristics of Abusive Language Online
▸ Abusive messages might be inﬂuenced by…6
18
- its discourse context
- its co-occurring media (images, videos)
- world events
- identity of the author and target

/ 48
▸ Characteristics of Abusive Language Online
▸ Abusive messages might be inﬂuenced by…6
▸ What is it different from other advanced text classiﬁcation tasks such as
sarcasm detection and fake news detection?
19
- its discourse context
- its co-occurring media (images, videos)
- world events
- identity of the author and target
- CONTEXT is its core (e.g. ‘nigger’, ‘bitch’)

/ 48
▸ Recent studies
▸ Data crawling and annotation
▸ Feature-engineering models for accurate classiﬁcation
▸ Using meta-information such as age, gender, location as additional features
▸ Applying deep models
▸ Concerning fairness issues
▸ Resolving unintentional bias issue (Woman, Jew being abusive?)
20

/ 48
▸ Future focus
▸ Creating a gold-standard test set
▸ More reliable annotations
▸ Getting more context information
▸ Multi-modal abusive detection
21

/ 48
▸ Future focus
▸ Creating a gold-standard test set
▸ More reliable annotations
▸ Getting more context information
▸ Multi-modal abusive detection
22

II. ABOUT THE PAPER,
“COMPARATIVE STUDIES OF DETECTING
ABUSIVE LANGUAGE ON TWITTER”

/ 48
II. COMPARATIVE STUDIES OF DETECTING ABUSIVE LANGUAGE ON TWITTER
▸ Brief overview of the paper
Comparing the accuracy (F1 scores) of different machine learning models and
different features in detecting abusive language on a recently released Twitter
dataset
24

/ 48
▸ About the dataset, “Hate and Abusive Speech on Twitter” 7
▸ Approximately 100K labeled tweets (other datasets: 10K to 35K)
▸ Reduced overlapping labels by calculating correlation coefﬁcients
▸ E.g. ‘abusive’, ‘offensive’, ‘aggressive’, ‘cyberbullying’ ‘abusive’
▸ Multi-label dataset (4 labels: abusive, hateful, spam, none)
25
Label distribution of crawled tweets

/ 48
▸ Implemented models and features
26
• Naïve Bayes

• Logistic Regression

• Support Vector Machine

• Random Forests

• Gradient Boosted Trees

• Word-level / Char-level
FEATURE-ENGINEERING ML NEURAL NETWORKS
• Conv Neural Network

• Recurrent Neural Network


• Hybrid CNN

• Self-matching attention RNN

• Latent Topic Clustering

• Context Tweets

/ 48
27
NEURAL NETWORKS



• Hybrid CNN


• Context Tweets
Architecture of Hybrid CNN8

/ 48
28
NEURAL NETWORKS



• Hybrid CNN


• Context Tweets
Gated self-matching attention networks (Wang et al.,)9

/ 48
29
NEURAL NETWORKS



• Hybrid CNN


• Context Tweets
Hierarchical Recurrent Dual Encoder with
Latent Topic Clustering module10

/ 48
30
NEURAL NETWORKS



• Hybrid CNN



• Context Tweets
WHAT ARE CONTEXT TWEETS?
‣ Looking at the tweet one has
replied to or has quoted provides
signiﬁcant contextual information
‣ Assumption: ML models beneﬁt
from taking context tweets into
account in detecting abusive
language.

/ 48
31
NEURAL NETWORKS



• Hybrid CNN



• Context Tweets
SAMPLE CONTEXT TWEET
Who the HELL is “LIKE” ING this post?
Sick people….
@user_B LABELED TWEET

/ 48
32
NEURAL NETWORKS



• Hybrid CNN



• Context Tweets
SAMPLE CONTEXT TWEET
Survivors of #Syria Gas Attack Recount
‘a Cruel Scene’.
@user_A
Who the HELL is “LIKE” ING this post?
Sick people….
@user_B
CONTEXT TWEET
LABELED TWEET

/ 48
33
NEURAL NETWORKS



• Hybrid CNN



• Context Tweets
HOW TO INTEGRATE CONTEXT TWEETS
LABELED
TWEET
CONTEXT
TWEET
RNN
CNN
max_pooled
max_pooled
last_hidden
last_hidden
σ
σ

/ 48
34
Experimental results of learning models and their variants, followed by the context tweet models.
The top 2 scores are marked as bold for each metric.
‣ Neural network
models with word-
level features are
accurate in general
compared to feature-
engineering models

/ 48
35
‣ Neural network
models with word-
level features are
accurate in general
engineering models
‣ Char-level features
beneﬁt feature-
engineering ML
models while cripple
neural models

/ 48
36
‣ Neural network
models with word-
level features are
accurate in general
engineering models
‣ Char-level features
beneﬁt feature-
engineering ML
models while cripple
neural models
‣ Context data doesn’t
improve the overall
accuracy, however, it
is more effective in
classifying lower-
distributed labels

/ 48
37
▸ Discussion
BASELINE CONTEXT
p_normal

.1614

p_abusive
.5958
p_normal

.3709

p_abusive

.2582
LABEL: abusive
8 dudes added me in a group chat
telling me I should kill myself,
lmfaooo I think I reached a whole
new level of twitter.
@user_A
They hatin cause you a damn
queen who doesn’t answer to
anybody. Let em be mad. Keep
rolling mamas.
@user_B
‣ Only baseline model
was correct, but is
‘abusive’ label for
this tweet accurate?

/ 48
38
▸ Discussion
BASELINE CONTEXT
LABEL: normal
Oklahoma home invasion shooting:
No charges against man who killed
3 intruders.
@user_C
You have every right to protect your
family and home from worthless
thieves.
@user_D
‣ Context tweets gave
more information to
accurately classify
the tweet
p_normal

.3287

p_abusive

.3718
p_normal
.8104
p_abusive

.0962

/ 48
39
▸ Discussion
▸ How to efficiently train the classifier for imbalanced datasets?
▸ How can we better incorporate context tweets into the original data? How to
handle ‘no_context’?
▸ Can we annotate the dataset regarding context data? Specifically, can we
make annotators consider the content of its context tweet when deciding
abusiveness of the tweet?
▸ What are other applications that could benefit from using context data?

III. FUTURE RELEVANCE
WITH NAVER

/ 48
III. FUTURE RELEVANCE WITH NAVER
▸ Hate Speech in Korea
▸ 남성혐오 / 여성혐오 (워마드, 일베저장소)
▸ 극단적 진보 / 보수 정치 성향
▸ 사회적 문제로 확장
41

/ 48
▸ Possible Contribution
▸ DATASET!!!
42

/ 48
43

/ 48
44

/ 48
45
RICH TEXTUAL & CONTEXTUAL
INFORMATION FROM ONE NEWS ARTICLE
1) Article, title, and its topic
2) Comments and comments of comments
2) Emotions on the article
3) Number of comments, demographic
distribution of commenters
4) Upvotes and downvotes of comments
5) User meta-information

/ 48
▸ Possible applications
▸ Detecting abusive language in news article comments
▸ Enabling personalized settings for not encountering toxic messages
▸ CLOVA: read comments of news articles that are not abusive
▸ Challenges
▸ Korean text: not extensively studied compared to English
▸ Lack of user meta-information
46

/ 48
REFERENCE
1. Ellen Spertus. 1997. Smokey: Automatic recognition of hostile messages. In AAAI/IAAI, pages 1058– 1065.
2. John T. Nockleby. 2000. Hate Speech. In Leonard W. Levy, Kenneth L. Karst, and Dennis J. Mahoney, editors, Encyclopedia of
the American Constitution, pages 1277–1279. Macmillan, 2nd edition.
3. Aaron Smith and Monica Anderson. 2018. Social Media Use in 2018. Pew Research Center; accessed 8-December-2018.
4. Monica Anderson and Skye Toor. How social media users have discussed sexual harassment since #MeToo went viral. Pew
Research Center; accessed 8-December-2018.
5. Maeve Duggan. 2017. Online harassment 2017. Pew Research Center; accessed 8-December-2018.
6. Anna Schmidt and Michael Wiegand. 2017. A survey on hate speech detection using natural language processing. In
Proceedings of the Fifth International Workshop on Natural Language Processing for So- cial Media, pages 1–10.
7. Antigoni Founta, Constantinos Djouvas, Despoina Chatzakou, Ilias Leontiadis, Jeremy Blackburn, Gianluca Stringhini, Athena
Vakali, Michael Sirivianos, and Nicolas Kourtellis. 2018. Large scale crowdsourcing and characterization of twitter abusive
behavior. In Proceedings of the International AAAI Conference on Web and Social Media.
8. Ji Ho Park and Pascale Fung. 2017. One-step and two- step classiﬁcation for abusive language detection on twitter. In
Proceedings of the First Workshop on Abusive Language Online, pages 41–45.
9. Wenhui Wang, Nan Yang, Furu Wei, Baobao Chang, and Ming Zhou. 2017. Gated self-matching networks for reading
comprehension and question answering. In Proceedings of the 55th Annual Meeting of the Association for Computational
Linguistics, volume 1, pages 189–198.
10.Seunghyun Yoon, Joongbo Shin, and Kyomin Jung. 2018. Learning to rank question-answer pairs using hierarchical recurrent
encoder with latent topic clustering. In Proceedings of the 2018 Conference of the North American Chapter of the Association
for Computational Linguistics: Human Language Technologies, volume 1, pages 1575–1584.
47

Comparative studies on detecting abusive language on twitter

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Comparative studies on detecting abusive language on twitter

Similar to Comparative studies on detecting abusive language on twitter (20)

More from NAVER Engineering

More from NAVER Engineering (20)

Recently uploaded

Recently uploaded (20)

Comparative studies on detecting abusive language on twitter