Kamel ben kmala_NLP

•Download as PPTX, PDF•

1 like•253 views

Kamel Ben Kmala

Deep Learning for Detecting Cyberbullying Across Multiple Social Media Platforms

Engineering

OUTLINE
Introduction
Past Works VS This Work
Datasets
DNN Based Models
Experiments
Conclusion

INTRODUCTION
•What is cyberbullying ?
•Between 10% and 40% of internet users
are victims of cyberbullying
• Definition of what constitutes
cyberbullying is quite subjective

Past Works VS This Work
 Target only one social
media platform
 Only one topic of
cyberbullying
 Rely on handcrafted
features
 Targets three different
types of social networks
(Formspring, Twitter,
Wikipedia)
 Three topics (personal
attack, racism and
sexism)
 No feature engineering
Past Works This Work

DNN
DNNs
CNN LSTM BLSTM
BLSTM With
Attention

Experiments
Results for Traditional ML Models Using F1 Score

Experiments
Unbalanced
Datasets
Oversampling

Experiments
Effect of Oversampling Bullying Posts Using BLSTM
with attention

Conclusion
DNN models can be used for cyberbullying
detection on various topics across multiple
SMPs using three datasets and four DNN
models
Cyberbullying detection models can be further
improved to take a variety of actions depending
on the perceived seriousness of the posts.

THANK YOU
FOR
YOUR ATTENTION
ANY QUESTIONS
?

What's hot

Social network analysisprasadkulkarnigit

Social Network AnalysisSujoy Bag

How to conduct a social network analysis: A tool for empowering teams and wor...Jeromy Anglim

Fundamentals of Deep Recommender SystemsWQ Fan

Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...Xiaohan Zeng

Comparative Study of Cyberbullying Detection using Different Machine Learning...ijtsrd

IRJET- Identification of Clone Attacks in Social Networking SitesIRJET Journal

The Basics of Social Network AnalysisRory Sie

CSE509 Lecture 6Web Science Research Group at Institute of Business Administration, Karachi, Pakistan

Social Network Analysis: Applications & ChallengesIIIT Hyderabad

Mining and analyzing social media part 2 - hicss47 tutorial - dave kingDave King

Introduction to Recommender SystemWQ Fan

IRJET- Design and Development of a System for Predicting Threats using Data S...IRJET Journal

Social Network AnalysisGiorgos Cheliotis

Comparison of Online Social Relations in terms of Volume vs. Interaction: A C...Haewoon Kwak

Graph Neural Networks for RecommendationsWQ Fan

ESWC 2014 Tutorial Part 4Miriam Fernandez

Current trends of opinion mining and sentiment analysis in social networkseSAT Publishing House

LAK13 Tutorial Social Network Analysis 4 Learning Analyticsgoehnert

Big social data analytics - social network analysis Jari Jussila

What's hot (20)

Social network analysis

Social Network Analysis

How to conduct a social network analysis: A tool for empowering teams and wor...

Fundamentals of Deep Recommender Systems

Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...

Comparative Study of Cyberbullying Detection using Different Machine Learning...

IRJET- Identification of Clone Attacks in Social Networking Sites

The Basics of Social Network Analysis

CSE509 Lecture 6

Social Network Analysis: Applications & Challenges

Mining and analyzing social media part 2 - hicss47 tutorial - dave king

Introduction to Recommender System

IRJET- Design and Development of a System for Predicting Threats using Data S...

Social Network Analysis

Comparison of Online Social Relations in terms of Volume vs. Interaction: A C...

Graph Neural Networks for Recommendations

ESWC 2014 Tutorial Part 4

Current trends of opinion mining and sentiment analysis in social networks

LAK13 Tutorial Social Network Analysis 4 Learning Analytics

Big social data analytics - social network analysis

Similar to Kamel ben kmala_NLP

A study of cyberbullying detection using Deep Learning and Machine Learning T...IRJET Journal

A Machine Learning Ensemble Model for the Detection of Cyberbullyinggerogepatton

A MACHINE LEARNING ENSEMBLE MODEL FOR THE DETECTION OF CYBERBULLYINGijaia

A Machine Learning Ensemble Model for the Detection of Cyberbullyinggerogepatton

Towards a More Holistic Approach on Online Abuse and AntisemitismIIIT Hyderabad

Predicting cyber bullying on t witter using machine learningMirXahid1

Integrated approach to detect spam in social media networks using hybrid feat...IJECEIAES

BINARY TEXT CLASSIFICATION OF CYBER HARASSMENT USING DEEP LEARNINGIRJET Journal

2010 Catalyst Conference - Trends in Social Network AnalysisMarc Smith

Carma internet research module: Sampling for internetSyracuse University

Categorize balanced dataset for troll detectionvivatechijri

CYBERBULLIYING DETECTIONSine19

DP1_160430723010_Divya.pptxDivyaPatel729457

A Review Paper On Cyber Harassment Detection Using Machine Learning Algorithm...Lisa Muthukumar

A DATA MINING APPROACH FOR FILTERING OUT SOCIAL SPAMMERS IN LARGE-SCALE TWITT...ijaia

Detecting cyberbullying text using the approaches with machine learning model...IAESIJAI

Violence Detection: Introducing a Machine Learning Based Novel Method Arindam Paul

Presentation - Application of Actor Level Social Characteristic Indicator Sel...Joshua S. White, PhD josh@securemind.org

IRJET- Twitter Spammer DetectionIRJET Journal

Similar to Kamel ben kmala_NLP (20)

A study of cyberbullying detection using Deep Learning and Machine Learning T...

A Machine Learning Ensemble Model for the Detection of Cyberbullying

A MACHINE LEARNING ENSEMBLE MODEL FOR THE DETECTION OF CYBERBULLYING

A Machine Learning Ensemble Model for the Detection of Cyberbullying

Towards a More Holistic Approach on Online Abuse and Antisemitism

Predicting cyber bullying on t witter using machine learning

Integrated approach to detect spam in social media networks using hybrid feat...

BINARY TEXT CLASSIFICATION OF CYBER HARASSMENT USING DEEP LEARNING

2010 Catalyst Conference - Trends in Social Network Analysis

Carma internet research module: Sampling for internet

Categorize balanced dataset for troll detection

CYBERBULLIYING DETECTION

DP1_160430723010_Divya.pptx

A Review Paper On Cyber Harassment Detection Using Machine Learning Algorithm...

A DATA MINING APPROACH FOR FILTERING OUT SOCIAL SPAMMERS IN LARGE-SCALE TWITT...

Detecting cyberbullying text using the approaches with machine learning model...

Violence Detection: Introducing a Machine Learning Based Novel Method

Presentation - Application of Actor Level Social Characteristic Indicator Sel...

IRJET- Twitter Spammer Detection

Recently uploaded

result management system report for college projectTonystark477637

VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor

Software Development Life Cycle By Team Orange (Dept. of Pharmacy)Suman Mia

Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile

(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat

Introduction and different types of Ethernet.pptxupamatechverse

HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95

247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).pptssuser5c9d4b1

Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234

DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEslot gacor bisa pakai pulsa

Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile

Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Christo Ananth

Porous Ceramics seminar and technical writingrakeshbaidya232001

UNIT-II FMM-Flow Through Circular Conduitsrknatarajan

MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N

(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat

SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome

OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal

★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR9953056974 Low Rate Call Girls In Saket, Delhi NCR

Introduction to Multiple Access Protocol.pptxupamatechverse

Recently uploaded (20)

result management system report for college project

VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130

Software Development Life Cycle By Team Orange (Dept. of Pharmacy)

Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts

(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...

Introduction and different types of Ethernet.pptx

HARMONY IN THE NATURE AND EXISTENCE - Unit-IV

247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt

Microscopic Analysis of Ceramic Materials.pptx

DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE

Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik

Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...

Porous Ceramics seminar and technical writing

UNIT-II FMM-Flow Through Circular Conduits

MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS

(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...

SPICE PARK APR2024 ( 6,793 SPICE Models )

OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...

★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR

Introduction to Multiple Access Protocol.pptx

Kamel ben kmala_NLP

1. DEEP LEARNING FOR DETECTING CYBERBULLYING ACROSS MULTIPLE SOCIAL MEDIA PLATFORMS Sweta Agrawal, Amit Awekar Indian Institute of Technology, Guwahati 19 Jan 2018 Kamel Ben Kmala10/12/2018

2. OUTLINE Introduction Past Works VS This Work Datasets DNN Based Models Experiments Conclusion

3. INTRODUCTION •What is cyberbullying ? •Between 10% and 40% of internet users are victims of cyberbullying • Definition of what constitutes cyberbullying is quite subjective

4. Past Works VS This Work  Target only one social media platform  Only one topic of cyberbullying  Rely on handcrafted features  Targets three different types of social networks (Formspring, Twitter, Wikipedia)  Three topics (personal attack, racism and sexism)  No feature engineering Past Works This Work

5. Datasets

6. DNN DNNs CNN LSTM BLSTM BLSTM With Attention

7. DNN

8. Experiments Results for Traditional ML Models Using F1 Score

9. Experiments Unbalanced Datasets Oversampling

10. Experiments Effect of Oversampling Bullying Posts Using BLSTM with attention

11. Conclusion DNN models can be used for cyberbullying detection on various topics across multiple SMPs using three datasets and four DNN models Cyberbullying detection models can be further improved to take a variety of actions depending on the perceived seriousness of the posts.

12. THANK YOU FOR YOUR ATTENTION ANY QUESTIONS ?

Editor's Notes

Cyberbullying has been defined by the National Crime Prevention Council as the use of the Internet, cell phones or other devices to send or post text or images intended to hurt or embarrass another person. Detection of cyberbullying in social media is a challenging task. Definition of what constitutes cyberbullying is quite subjective. For example, frequent use of swear words might be considered as bullying by the general population. However, for teen oriented social media platforms such as Formspring, this does not necessarily mean bullying.
Without doing any feature engineering by developing deep learning based models along with transfer learning.
Twitter dataset contains examples of racism and sexism. Wikipedia dataset contains examples of personal attack. However, Formspring dataset is not specifically about any single topic. …………………………………………………………………………………………… //// FormSpring: ***The dataset includes 12K annotated question and answer pairs. Among these pairs, 825 were labeled as containing cyberbullying. ///// Twitter: ****This dataset includes 16K annotated tweets. 3117 are labeled as sexist, 1937 as racist, and the remaining are marked as neither sexist nor racist. //// Wikipedia: **** This data set includes over 100k labeled discussion comments from English Wikipedia’s talk pages. There are total 13590 comments labeled as personal attack.
Four DNN based models were experimented for cyberbullying detection: CNN, LSTM, BLSTM, and BLSTM with attention. These models are listed in the increasing complexity of their neural architecture and amount of information used by these models. CNNs recently used for sentiment classification -Long Short Term Memory networks are a special kind of RNN, capable of learning long-term dependencies. -Bidirectional LSTMs further increase the amount of input information available to the network by encoding information in both forward and backward direction. -Attention mechanisms allow for a more direct dependence between the states of the model at different points in time.
This is the general architecture that was used across four models. Various models differ only in the Neural Architecture layer while having identical rest of the layers. …………………………………………… -The embedding layer processes a fixed size sequence of words. Each word is represented as a real-valued vector, also known as word embeddings. Three methods were experimented for initializing word embeddings: random, GloVe, and SSWE. -To avoid overfitting, two dropout layers were used, one before the neural architecture layer and one after, with dropout rates of 0.25 and 0.5 respectively.
This table shows the Results for Traditional ML Models Using F1 Score Four models: logistic regression (LR), support vector machine (SVM), random forest (RF), and naive Bayes (NB), as these are used in previous works As compared to DNN models, performance of all four traditional machine learning models was significantly lower.
The training datasets had a major problem of class imbalance with posts marked as bullying in the minority. As a result, all models were biased towards labeling the posts as non-bullying. To remove this bias, we oversampled the data from bullying class thrice. That is, we replicated bullying posts thrice in the training data. Oversampling particularly helps the smallest dataset Formspring where number of training instances for bullying class is quite small (825).
– Datasets: F (Formspring), T (Twitter), W (Wikipedia) – Datasets with oversampling of bullying posts: F+ (Formspring), T+ (Twitter), W+ (Wikipedia) – Evaluation measures: P (Precision), R (Recall), F1 (F1 score) This significantly improved the performance of all DNN models with major leap in all three evaluation measures. This Table shows the effect of oversampling for a variety of word embedding methods with BLSTM Attention as the detection model.

Kamel ben kmala_NLP

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Kamel ben kmala_NLP

Similar to Kamel ben kmala_NLP (20)

Recently uploaded

Recently uploaded (20)

Kamel ben kmala_NLP

Editor's Notes