SlideShare a Scribd company logo
DINA: A Multi-Dialect Dataset for
Arabic Emotion Analysis
Muhammad Abdul-Mageed1,2, Hassan AlHuzliy1,
Duaa’ Abu Elhija1, Mona Diab2
Indiana University1, The George Washington
University2
Emotions
• Categories of emotion:
– Ekman (e.g., 1992) proposes there are 6 basic
emotions: anger, disgust, fear, happiness, sadness,
and surprise
– Plutchik (1980, 1985, 1994) adds trust and
anticipation
• Emotion on 3 dimensions:
– e.g., Francisco and Gervas (2006) mark the attributes
of pleasantness, activation, and dominance in the
genre of fairy tales.
– DINA is focused on the Ekman emotions.
2
Motivations
• Opinion Mining:
– Provides an enriching component beyond the mere binary
valence (i.e. positive and negative) of most sentiment
analysis systems.
• Health & Wellness
– Early detection of certain emotional disorders such as depression.
– Improving the well-being of people by exposing them to desired
emotions (since emotion is contagious [Kramer et al., 2014]).
• Education:
– Integrating emotionally-aware agents in intelligent
computer-assisted language learning, for example, should
prove useful and enhance the naturalness of the
pedagogical experience. 3
Motivations Cont.
• Marketing:
– e.g., emotion-sensitive language generation can help with
marketing (Heath et al., 2001; Tan et al., 2014), political
campaigning, etc.
• Security:
– Deflect potential hazards and anticipate dangerous
behaviors
• Author Profiling:
– Useful for predicting age and gender (Meina et al., 2013;
Flekova and Gurevych, 2013; Farias et al., 2013; Bamman
et al., 2014; Forner et al., 2013) and personality
(Mohammad and Kiritchenko, 2013)
4
Related Work
• SemEval-2007 Affective Text task (Strapparava and
Mihalcea, 2007) [SEM07]:
– Collection and classification of emotion and
valence in news headlines
• Aman and Szpakowicz (2007):
– Annotation and detection of emotions from blogs
• Qadir and Riloff (2014), Mohammad (2012), Wang
et al. (2012):
– use hashtags as an approximation of emotion
categories to collect emotion data
5
Arabic: Motivations
• Morphologically Rich Language
– Highly inflected: person, number, gender, case,
mood, aspect, voice
• Strategic Language:
– One of the 6 languages of UN, with ~ 300M
speakers worldwide
• Exponential Web growth:
– More than 2000% growth rate on the Web in 2010
onwards (www.internetworldstats.com).
6
Arabic Dialects
7
Data Collection
8
• Crawled Twitter data using a seed set of size < 10
phrases for each of the six Ekman emotion types.
• Each phrase is composed of an emotion word (e.g.,
“happy”) and the first personal pronoun “I”.
• We collect only tweets where a seed phrase occurs
in the tweet body text.
• This approach does not depend on hashtags.
• We collect 500 tweets from each of the 6 emotion
types. Total = 3,000.
• Seeds capture various Arabic dialects.
Seeds
9
Table 1. Example seeds
Annotation
10
• To verify the utility of this seeds approach, two
college-educated native speakers of Arabic
labeled the data.
• For labeling, we use one of four tags from the set
{“no-emotion/zero”, “weak-emotion”,
“moderate/fair-emotion”, “strong-emotion”}.
• We measure inter-annotator agreement as to
these intensity labels in Cohen’s Kappa.
• We also calculate the % of emotion-carrying
tweets per category (those that did not end up
assigned the label “no-emotion/zero”).
DINA: Agreement & % Emotion
11
Table 3. Agreement in fine-grained annotation and
average percentage of emotion
Gold Labels from Happiness Class
12
Table 2. Agreement in happiness annotation
Examples: Anger
13
Examples: Disgust
14
Examples: Fear
15
Examples: Happiness
16
Examples: Sadness
17
Examples: Surprise
18
Context of No- and Mixed Emotions
19
• Even with a list of well-crafted seeds, both
annotators assign “no-emotion” for 7.5% of the
data.
• This is a function of emotion being a
pragmatics-level phenomenon.
• Contexts for “no-emotion” include:
– Reported speech
– Sarcasm
Reported Speech
20
Sarcasm
21
Conclusion
22
• Emotion is like other pragmatic-level phenomena;
hence a seed-collection approach is useful, but not
perfect.
• Phenomena like reported speech and sarcasm interact
with our method for emotion data collection.
• DINA is multidialectal, but we do not have exact
dialect labels on the tweets.
• DINA is at 3,000 tweets, and we plan to grow the size.
• Full evaluation of DINA is only possible when we build
models exploiting these data, which we plan to do.

More Related Content

Similar to P05- DINA: A Multi-Dialect Dataset for Arabic Emotion Analysis

H010215561
H010215561H010215561
H010215561
IOSR Journals
 
ASERS-CNN: ARABIC SPEECH EMOTION RECOGNITION SYSTEM BASED ON CNN MODEL
ASERS-CNN: ARABIC SPEECH EMOTION RECOGNITION SYSTEM BASED ON CNN MODELASERS-CNN: ARABIC SPEECH EMOTION RECOGNITION SYSTEM BASED ON CNN MODEL
ASERS-CNN: ARABIC SPEECH EMOTION RECOGNITION SYSTEM BASED ON CNN MODEL
sipij
 
ASERS-CNN: Arabic Speech Emotion Recognition System based on CNN Model
ASERS-CNN: Arabic Speech Emotion Recognition System based on CNN ModelASERS-CNN: Arabic Speech Emotion Recognition System based on CNN Model
ASERS-CNN: Arabic Speech Emotion Recognition System based on CNN Model
sipij
 
Signal & Image Processing : An International Journal
Signal & Image Processing : An International JournalSignal & Image Processing : An International Journal
Signal & Image Processing : An International Journal
sipij
 
Emotions of Facebook Data
Emotions of Facebook DataEmotions of Facebook Data
Emotions of Facebook Data
CBS Competitiveness Platform
 
Interactive Visual Analysis of Human Emotions
Interactive Visual Analysis of Human EmotionsInteractive Visual Analysis of Human Emotions
Interactive Visual Analysis of Human Emotions
Michelle Zhou
 
Emotion detection on social media status in Myanmar language
Emotion detection on social media status in Myanmar language Emotion detection on social media status in Myanmar language
Emotion detection on social media status in Myanmar language
IJECEIAES
 
Colloquium on "factors influencing Emotional Intelligence among b.Ed. Student...
Colloquium on "factors influencing Emotional Intelligence among b.Ed. Student...Colloquium on "factors influencing Emotional Intelligence among b.Ed. Student...
Colloquium on "factors influencing Emotional Intelligence among b.Ed. Student...
Atul Thakur
 
3-540-45453-5_71.pdf
3-540-45453-5_71.pdf3-540-45453-5_71.pdf
3-540-45453-5_71.pdf
Jyoti863900
 
Emotional intelligence for personal growth
Emotional intelligence for personal growthEmotional intelligence for personal growth
Emotional intelligence for personal growth
Preeti Bhaskar
 
Emotional intelligence for personal growth
Emotional intelligence for personal growthEmotional intelligence for personal growth
Emotional intelligence for personal growth
Preeti Bhaskar
 
Emotion recognition based on the energy distribution of plosive syllables
Emotion recognition based on the energy distribution of plosive  syllablesEmotion recognition based on the energy distribution of plosive  syllables
Emotion recognition based on the energy distribution of plosive syllables
IJECEIAES
 
Emotional Profiling of Locations based on Social Media
Emotional Profiling of Locations based on Social MediaEmotional Profiling of Locations based on Social Media
Emotional Profiling of Locations based on Social Media
Dr. Juan Bernabe Moreno
 
The influence of user’s emotions in Recommender Systems for Decision Making
The influence of user’s emotions in Recommender Systems for Decision MakingThe influence of user’s emotions in Recommender Systems for Decision Making
The influence of user’s emotions in Recommender Systems for Decision Making
Marco Polignano
 
Recognizing Emotions in Spontaneous Speech
Recognizing Emotions in Spontaneous SpeechRecognizing Emotions in Spontaneous Speech
Recognizing Emotions in Spontaneous Speech
Sunil Kumar Kopparapu
 
Энэхүү илтгэлийг англи хэл дээр олон улсын хуралд илтгэл хэлэлцүүлсэн.
Энэхүү илтгэлийг англи хэл дээр олон улсын хуралд илтгэл хэлэлцүүлсэн.Энэхүү илтгэлийг англи хэл дээр олон улсын хуралд илтгэл хэлэлцүүлсэн.
Энэхүү илтгэлийг англи хэл дээр олон улсын хуралд илтгэл хэлэлцүүлсэн.
BayarmaaBuyantogtokh1
 
Emotional Multi-Agents System for Peer to peer E-Learning
Emotional Multi-Agents System for Peer to peer E-LearningEmotional Multi-Agents System for Peer to peer E-Learning
Emotional Multi-Agents System for Peer to peer E-Learning
Karlos Svoboda
 
Emotional intelligence.ppp
Emotional intelligence.pppEmotional intelligence.ppp
Emotional intelligence.ppp
Hannah Mungroo
 
emotional intel.PPP
emotional intel.PPPemotional intel.PPP
emotional intel.PPP
Hannah Mungroo
 
Positive Psychology in Organization
Positive Psychology in OrganizationPositive Psychology in Organization
Positive Psychology in Organization
Md Huzaifa Arshad
 

Similar to P05- DINA: A Multi-Dialect Dataset for Arabic Emotion Analysis (20)

H010215561
H010215561H010215561
H010215561
 
ASERS-CNN: ARABIC SPEECH EMOTION RECOGNITION SYSTEM BASED ON CNN MODEL
ASERS-CNN: ARABIC SPEECH EMOTION RECOGNITION SYSTEM BASED ON CNN MODELASERS-CNN: ARABIC SPEECH EMOTION RECOGNITION SYSTEM BASED ON CNN MODEL
ASERS-CNN: ARABIC SPEECH EMOTION RECOGNITION SYSTEM BASED ON CNN MODEL
 
ASERS-CNN: Arabic Speech Emotion Recognition System based on CNN Model
ASERS-CNN: Arabic Speech Emotion Recognition System based on CNN ModelASERS-CNN: Arabic Speech Emotion Recognition System based on CNN Model
ASERS-CNN: Arabic Speech Emotion Recognition System based on CNN Model
 
Signal & Image Processing : An International Journal
Signal & Image Processing : An International JournalSignal & Image Processing : An International Journal
Signal & Image Processing : An International Journal
 
Emotions of Facebook Data
Emotions of Facebook DataEmotions of Facebook Data
Emotions of Facebook Data
 
Interactive Visual Analysis of Human Emotions
Interactive Visual Analysis of Human EmotionsInteractive Visual Analysis of Human Emotions
Interactive Visual Analysis of Human Emotions
 
Emotion detection on social media status in Myanmar language
Emotion detection on social media status in Myanmar language Emotion detection on social media status in Myanmar language
Emotion detection on social media status in Myanmar language
 
Colloquium on "factors influencing Emotional Intelligence among b.Ed. Student...
Colloquium on "factors influencing Emotional Intelligence among b.Ed. Student...Colloquium on "factors influencing Emotional Intelligence among b.Ed. Student...
Colloquium on "factors influencing Emotional Intelligence among b.Ed. Student...
 
3-540-45453-5_71.pdf
3-540-45453-5_71.pdf3-540-45453-5_71.pdf
3-540-45453-5_71.pdf
 
Emotional intelligence for personal growth
Emotional intelligence for personal growthEmotional intelligence for personal growth
Emotional intelligence for personal growth
 
Emotional intelligence for personal growth
Emotional intelligence for personal growthEmotional intelligence for personal growth
Emotional intelligence for personal growth
 
Emotion recognition based on the energy distribution of plosive syllables
Emotion recognition based on the energy distribution of plosive  syllablesEmotion recognition based on the energy distribution of plosive  syllables
Emotion recognition based on the energy distribution of plosive syllables
 
Emotional Profiling of Locations based on Social Media
Emotional Profiling of Locations based on Social MediaEmotional Profiling of Locations based on Social Media
Emotional Profiling of Locations based on Social Media
 
The influence of user’s emotions in Recommender Systems for Decision Making
The influence of user’s emotions in Recommender Systems for Decision MakingThe influence of user’s emotions in Recommender Systems for Decision Making
The influence of user’s emotions in Recommender Systems for Decision Making
 
Recognizing Emotions in Spontaneous Speech
Recognizing Emotions in Spontaneous SpeechRecognizing Emotions in Spontaneous Speech
Recognizing Emotions in Spontaneous Speech
 
Энэхүү илтгэлийг англи хэл дээр олон улсын хуралд илтгэл хэлэлцүүлсэн.
Энэхүү илтгэлийг англи хэл дээр олон улсын хуралд илтгэл хэлэлцүүлсэн.Энэхүү илтгэлийг англи хэл дээр олон улсын хуралд илтгэл хэлэлцүүлсэн.
Энэхүү илтгэлийг англи хэл дээр олон улсын хуралд илтгэл хэлэлцүүлсэн.
 
Emotional Multi-Agents System for Peer to peer E-Learning
Emotional Multi-Agents System for Peer to peer E-LearningEmotional Multi-Agents System for Peer to peer E-Learning
Emotional Multi-Agents System for Peer to peer E-Learning
 
Emotional intelligence.ppp
Emotional intelligence.pppEmotional intelligence.ppp
Emotional intelligence.ppp
 
emotional intel.PPP
emotional intel.PPPemotional intel.PPP
emotional intel.PPP
 
Positive Psychology in Organization
Positive Psychology in OrganizationPositive Psychology in Organization
Positive Psychology in Organization
 

More from iwan_rg

Automatic text simplification evaluation aspects
Automatic text simplification  evaluation aspectsAutomatic text simplification  evaluation aspects
Automatic text simplification evaluation aspects
iwan_rg
 
تلخيص كتاب مقدمة في معالجة اللغة العربية
تلخيص كتاب مقدمة في معالجة اللغة العربيةتلخيص كتاب مقدمة في معالجة اللغة العربية
تلخيص كتاب مقدمة في معالجة اللغة العربية
iwan_rg
 
Building theoretical models using structured equation modeling
Building theoretical models using structured equation modelingBuilding theoretical models using structured equation modeling
Building theoretical models using structured equation modeling
iwan_rg
 
ورشة تضمين الكلمات في التعلم العميق Word embeddings workshop
ورشة تضمين الكلمات في التعلم العميق Word embeddings workshopورشة تضمين الكلمات في التعلم العميق Word embeddings workshop
ورشة تضمين الكلمات في التعلم العميق Word embeddings workshop
iwan_rg
 
Introduction to Arabic natural language processing (Infographics)
Introduction to Arabic natural language processing (Infographics)Introduction to Arabic natural language processing (Infographics)
Introduction to Arabic natural language processing (Infographics)
iwan_rg
 
Summary of Multilingual Natural Language Processing Applications: From Theory...
Summary of Multilingual Natural Language Processing Applications: From Theory...Summary of Multilingual Natural Language Processing Applications: From Theory...
Summary of Multilingual Natural Language Processing Applications: From Theory...
iwan_rg
 
التقرير السنوي لمجموعة إيوان البحثية 1437هـ-1438هـ
التقرير السنوي لمجموعة إيوان البحثية 1437هـ-1438هـالتقرير السنوي لمجموعة إيوان البحثية 1437هـ-1438هـ
التقرير السنوي لمجموعة إيوان البحثية 1437هـ-1438هـ
iwan_rg
 
CHOOSING RESEARCH TOPICS AND WRITING RESEARCH PAPERS
CHOOSING RESEARCH TOPICS AND WRITING RESEARCH PAPERSCHOOSING RESEARCH TOPICS AND WRITING RESEARCH PAPERS
CHOOSING RESEARCH TOPICS AND WRITING RESEARCH PAPERS
iwan_rg
 
التقرير السنوي لمجموعة إيوان البحثية 1436هـ-1437هـ
التقرير السنوي لمجموعة إيوان البحثية 1436هـ-1437هـالتقرير السنوي لمجموعة إيوان البحثية 1436هـ-1437هـ
التقرير السنوي لمجموعة إيوان البحثية 1436هـ-1437هـ
iwan_rg
 
مركز تميز الحوسبة العربية المتقدمة
مركز تميز  الحوسبة العربية المتقدمةمركز تميز  الحوسبة العربية المتقدمة
مركز تميز الحوسبة العربية المتقدمة
iwan_rg
 
P03- MANDIAC: A Web-based Annotation System For Manual Arabic Diacritization
P03- MANDIAC: A Web-based Annotation System For Manual Arabic Diacritization P03- MANDIAC: A Web-based Annotation System For Manual Arabic Diacritization
P03- MANDIAC: A Web-based Annotation System For Manual Arabic Diacritization
iwan_rg
 
P04- Toward an Arabic Punctuated Corpus: Annotation Guidelines and Evaluation
P04- Toward an Arabic Punctuated Corpus: Annotation Guidelines and Evaluation P04- Toward an Arabic Punctuated Corpus: Annotation Guidelines and Evaluation
P04- Toward an Arabic Punctuated Corpus: Annotation Guidelines and Evaluation
iwan_rg
 
P02- Towards a New Arabic Corpus of Dyslexic Texts
P02- Towards a New Arabic Corpus of Dyslexic TextsP02- Towards a New Arabic Corpus of Dyslexic Texts
P02- Towards a New Arabic Corpus of Dyslexic Texts
iwan_rg
 
P01- Toward a rich Arabic Speech Parallel Corpus for Algerian sub-Dialects
P01- Toward a rich Arabic Speech Parallel Corpus for Algerian sub-Dialects P01- Toward a rich Arabic Speech Parallel Corpus for Algerian sub-Dialects
P01- Toward a rich Arabic Speech Parallel Corpus for Algerian sub-Dialects
iwan_rg
 
Keynote - Computational Processing of Arabic Dialects: Challenges, Advances a...
Keynote - Computational Processing of Arabic Dialects: Challenges, Advances a...Keynote - Computational Processing of Arabic Dialects: Challenges, Advances a...
Keynote - Computational Processing of Arabic Dialects: Challenges, Advances a...
iwan_rg
 
OSACT2 LREC 2016 workshop proceedings
OSACT2 LREC 2016 workshop proceedingsOSACT2 LREC 2016 workshop proceedings
OSACT2 LREC 2016 workshop proceedings
iwan_rg
 
محاضرة المدونات اللغوية وأدواتها
محاضرة المدونات اللغوية وأدواتهامحاضرة المدونات اللغوية وأدواتها
محاضرة المدونات اللغوية وأدواتها
iwan_rg
 
لغويات المدونة الحاسوبية
لغويات المدونة الحاسوبيةلغويات المدونة الحاسوبية
لغويات المدونة الحاسوبية
iwan_rg
 
iWAN Annual Report 1435/1436H
 iWAN Annual Report 1435/1436H iWAN Annual Report 1435/1436H
iWAN Annual Report 1435/1436H
iwan_rg
 
التقرير السنوي لمجموعة إيوان البحثية لعام 1434 هـ - 1435هـ
التقرير السنوي لمجموعة إيوان البحثية لعام 1434 هـ -  1435هـالتقرير السنوي لمجموعة إيوان البحثية لعام 1434 هـ -  1435هـ
التقرير السنوي لمجموعة إيوان البحثية لعام 1434 هـ - 1435هـiwan_rg
 

More from iwan_rg (20)

Automatic text simplification evaluation aspects
Automatic text simplification  evaluation aspectsAutomatic text simplification  evaluation aspects
Automatic text simplification evaluation aspects
 
تلخيص كتاب مقدمة في معالجة اللغة العربية
تلخيص كتاب مقدمة في معالجة اللغة العربيةتلخيص كتاب مقدمة في معالجة اللغة العربية
تلخيص كتاب مقدمة في معالجة اللغة العربية
 
Building theoretical models using structured equation modeling
Building theoretical models using structured equation modelingBuilding theoretical models using structured equation modeling
Building theoretical models using structured equation modeling
 
ورشة تضمين الكلمات في التعلم العميق Word embeddings workshop
ورشة تضمين الكلمات في التعلم العميق Word embeddings workshopورشة تضمين الكلمات في التعلم العميق Word embeddings workshop
ورشة تضمين الكلمات في التعلم العميق Word embeddings workshop
 
Introduction to Arabic natural language processing (Infographics)
Introduction to Arabic natural language processing (Infographics)Introduction to Arabic natural language processing (Infographics)
Introduction to Arabic natural language processing (Infographics)
 
Summary of Multilingual Natural Language Processing Applications: From Theory...
Summary of Multilingual Natural Language Processing Applications: From Theory...Summary of Multilingual Natural Language Processing Applications: From Theory...
Summary of Multilingual Natural Language Processing Applications: From Theory...
 
التقرير السنوي لمجموعة إيوان البحثية 1437هـ-1438هـ
التقرير السنوي لمجموعة إيوان البحثية 1437هـ-1438هـالتقرير السنوي لمجموعة إيوان البحثية 1437هـ-1438هـ
التقرير السنوي لمجموعة إيوان البحثية 1437هـ-1438هـ
 
CHOOSING RESEARCH TOPICS AND WRITING RESEARCH PAPERS
CHOOSING RESEARCH TOPICS AND WRITING RESEARCH PAPERSCHOOSING RESEARCH TOPICS AND WRITING RESEARCH PAPERS
CHOOSING RESEARCH TOPICS AND WRITING RESEARCH PAPERS
 
التقرير السنوي لمجموعة إيوان البحثية 1436هـ-1437هـ
التقرير السنوي لمجموعة إيوان البحثية 1436هـ-1437هـالتقرير السنوي لمجموعة إيوان البحثية 1436هـ-1437هـ
التقرير السنوي لمجموعة إيوان البحثية 1436هـ-1437هـ
 
مركز تميز الحوسبة العربية المتقدمة
مركز تميز  الحوسبة العربية المتقدمةمركز تميز  الحوسبة العربية المتقدمة
مركز تميز الحوسبة العربية المتقدمة
 
P03- MANDIAC: A Web-based Annotation System For Manual Arabic Diacritization
P03- MANDIAC: A Web-based Annotation System For Manual Arabic Diacritization P03- MANDIAC: A Web-based Annotation System For Manual Arabic Diacritization
P03- MANDIAC: A Web-based Annotation System For Manual Arabic Diacritization
 
P04- Toward an Arabic Punctuated Corpus: Annotation Guidelines and Evaluation
P04- Toward an Arabic Punctuated Corpus: Annotation Guidelines and Evaluation P04- Toward an Arabic Punctuated Corpus: Annotation Guidelines and Evaluation
P04- Toward an Arabic Punctuated Corpus: Annotation Guidelines and Evaluation
 
P02- Towards a New Arabic Corpus of Dyslexic Texts
P02- Towards a New Arabic Corpus of Dyslexic TextsP02- Towards a New Arabic Corpus of Dyslexic Texts
P02- Towards a New Arabic Corpus of Dyslexic Texts
 
P01- Toward a rich Arabic Speech Parallel Corpus for Algerian sub-Dialects
P01- Toward a rich Arabic Speech Parallel Corpus for Algerian sub-Dialects P01- Toward a rich Arabic Speech Parallel Corpus for Algerian sub-Dialects
P01- Toward a rich Arabic Speech Parallel Corpus for Algerian sub-Dialects
 
Keynote - Computational Processing of Arabic Dialects: Challenges, Advances a...
Keynote - Computational Processing of Arabic Dialects: Challenges, Advances a...Keynote - Computational Processing of Arabic Dialects: Challenges, Advances a...
Keynote - Computational Processing of Arabic Dialects: Challenges, Advances a...
 
OSACT2 LREC 2016 workshop proceedings
OSACT2 LREC 2016 workshop proceedingsOSACT2 LREC 2016 workshop proceedings
OSACT2 LREC 2016 workshop proceedings
 
محاضرة المدونات اللغوية وأدواتها
محاضرة المدونات اللغوية وأدواتهامحاضرة المدونات اللغوية وأدواتها
محاضرة المدونات اللغوية وأدواتها
 
لغويات المدونة الحاسوبية
لغويات المدونة الحاسوبيةلغويات المدونة الحاسوبية
لغويات المدونة الحاسوبية
 
iWAN Annual Report 1435/1436H
 iWAN Annual Report 1435/1436H iWAN Annual Report 1435/1436H
iWAN Annual Report 1435/1436H
 
التقرير السنوي لمجموعة إيوان البحثية لعام 1434 هـ - 1435هـ
التقرير السنوي لمجموعة إيوان البحثية لعام 1434 هـ -  1435هـالتقرير السنوي لمجموعة إيوان البحثية لعام 1434 هـ -  1435هـ
التقرير السنوي لمجموعة إيوان البحثية لعام 1434 هـ - 1435هـ
 

Recently uploaded

বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdfবাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
eBook.com.bd (প্রয়োজনীয় বাংলা বই)
 
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptxNEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
iammrhaywood
 
UGC NET Exam Paper 1- Unit 1:Teaching Aptitude
UGC NET Exam Paper 1- Unit 1:Teaching AptitudeUGC NET Exam Paper 1- Unit 1:Teaching Aptitude
UGC NET Exam Paper 1- Unit 1:Teaching Aptitude
S. Raj Kumar
 
Bed Making ( Introduction, Purpose, Types, Articles, Scientific principles, N...
Bed Making ( Introduction, Purpose, Types, Articles, Scientific principles, N...Bed Making ( Introduction, Purpose, Types, Articles, Scientific principles, N...
Bed Making ( Introduction, Purpose, Types, Articles, Scientific principles, N...
Leena Ghag-Sakpal
 
Temple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation resultsTemple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation results
Krassimira Luka
 
คำศัพท์ คำพื้นฐานการอ่าน ภาษาอังกฤษ ระดับชั้น ม.1
คำศัพท์ คำพื้นฐานการอ่าน ภาษาอังกฤษ ระดับชั้น ม.1คำศัพท์ คำพื้นฐานการอ่าน ภาษาอังกฤษ ระดับชั้น ม.1
คำศัพท์ คำพื้นฐานการอ่าน ภาษาอังกฤษ ระดับชั้น ม.1
สมใจ จันสุกสี
 
B. Ed Syllabus for babasaheb ambedkar education university.pdf
B. Ed Syllabus for babasaheb ambedkar education university.pdfB. Ed Syllabus for babasaheb ambedkar education university.pdf
B. Ed Syllabus for babasaheb ambedkar education university.pdf
BoudhayanBhattachari
 
LAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UP
LAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UPLAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UP
LAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UP
RAHUL
 
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
Nguyen Thanh Tu Collection
 
How to Create a More Engaging and Human Online Learning Experience
How to Create a More Engaging and Human Online Learning Experience How to Create a More Engaging and Human Online Learning Experience
How to Create a More Engaging and Human Online Learning Experience
Wahiba Chair Training & Consulting
 
Beyond Degrees - Empowering the Workforce in the Context of Skills-First.pptx
Beyond Degrees - Empowering the Workforce in the Context of Skills-First.pptxBeyond Degrees - Empowering the Workforce in the Context of Skills-First.pptx
Beyond Degrees - Empowering the Workforce in the Context of Skills-First.pptx
EduSkills OECD
 
How to deliver Powerpoint Presentations.pptx
How to deliver Powerpoint  Presentations.pptxHow to deliver Powerpoint  Presentations.pptx
How to deliver Powerpoint Presentations.pptx
HajraNaeem15
 
Solutons Maths Escape Room Spatial .pptx
Solutons Maths Escape Room Spatial .pptxSolutons Maths Escape Room Spatial .pptx
Solutons Maths Escape Room Spatial .pptx
spdendr
 
Walmart Business+ and Spark Good for Nonprofits.pdf
Walmart Business+ and Spark Good for Nonprofits.pdfWalmart Business+ and Spark Good for Nonprofits.pdf
Walmart Business+ and Spark Good for Nonprofits.pdf
TechSoup
 
How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17
Celine George
 
Liberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdfLiberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdf
WaniBasim
 
Film vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movieFilm vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movie
Nicholas Montgomery
 
Main Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docxMain Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docx
adhitya5119
 
IGCSE Biology Chapter 14- Reproduction in Plants.pdf
IGCSE Biology Chapter 14- Reproduction in Plants.pdfIGCSE Biology Chapter 14- Reproduction in Plants.pdf
IGCSE Biology Chapter 14- Reproduction in Plants.pdf
Amin Marwan
 
Constructing Your Course Container for Effective Communication
Constructing Your Course Container for Effective CommunicationConstructing Your Course Container for Effective Communication
Constructing Your Course Container for Effective Communication
Chevonnese Chevers Whyte, MBA, B.Sc.
 

Recently uploaded (20)

বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdfবাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
 
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptxNEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
 
UGC NET Exam Paper 1- Unit 1:Teaching Aptitude
UGC NET Exam Paper 1- Unit 1:Teaching AptitudeUGC NET Exam Paper 1- Unit 1:Teaching Aptitude
UGC NET Exam Paper 1- Unit 1:Teaching Aptitude
 
Bed Making ( Introduction, Purpose, Types, Articles, Scientific principles, N...
Bed Making ( Introduction, Purpose, Types, Articles, Scientific principles, N...Bed Making ( Introduction, Purpose, Types, Articles, Scientific principles, N...
Bed Making ( Introduction, Purpose, Types, Articles, Scientific principles, N...
 
Temple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation resultsTemple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation results
 
คำศัพท์ คำพื้นฐานการอ่าน ภาษาอังกฤษ ระดับชั้น ม.1
คำศัพท์ คำพื้นฐานการอ่าน ภาษาอังกฤษ ระดับชั้น ม.1คำศัพท์ คำพื้นฐานการอ่าน ภาษาอังกฤษ ระดับชั้น ม.1
คำศัพท์ คำพื้นฐานการอ่าน ภาษาอังกฤษ ระดับชั้น ม.1
 
B. Ed Syllabus for babasaheb ambedkar education university.pdf
B. Ed Syllabus for babasaheb ambedkar education university.pdfB. Ed Syllabus for babasaheb ambedkar education university.pdf
B. Ed Syllabus for babasaheb ambedkar education university.pdf
 
LAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UP
LAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UPLAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UP
LAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UP
 
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
 
How to Create a More Engaging and Human Online Learning Experience
How to Create a More Engaging and Human Online Learning Experience How to Create a More Engaging and Human Online Learning Experience
How to Create a More Engaging and Human Online Learning Experience
 
Beyond Degrees - Empowering the Workforce in the Context of Skills-First.pptx
Beyond Degrees - Empowering the Workforce in the Context of Skills-First.pptxBeyond Degrees - Empowering the Workforce in the Context of Skills-First.pptx
Beyond Degrees - Empowering the Workforce in the Context of Skills-First.pptx
 
How to deliver Powerpoint Presentations.pptx
How to deliver Powerpoint  Presentations.pptxHow to deliver Powerpoint  Presentations.pptx
How to deliver Powerpoint Presentations.pptx
 
Solutons Maths Escape Room Spatial .pptx
Solutons Maths Escape Room Spatial .pptxSolutons Maths Escape Room Spatial .pptx
Solutons Maths Escape Room Spatial .pptx
 
Walmart Business+ and Spark Good for Nonprofits.pdf
Walmart Business+ and Spark Good for Nonprofits.pdfWalmart Business+ and Spark Good for Nonprofits.pdf
Walmart Business+ and Spark Good for Nonprofits.pdf
 
How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17
 
Liberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdfLiberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdf
 
Film vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movieFilm vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movie
 
Main Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docxMain Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docx
 
IGCSE Biology Chapter 14- Reproduction in Plants.pdf
IGCSE Biology Chapter 14- Reproduction in Plants.pdfIGCSE Biology Chapter 14- Reproduction in Plants.pdf
IGCSE Biology Chapter 14- Reproduction in Plants.pdf
 
Constructing Your Course Container for Effective Communication
Constructing Your Course Container for Effective CommunicationConstructing Your Course Container for Effective Communication
Constructing Your Course Container for Effective Communication
 

P05- DINA: A Multi-Dialect Dataset for Arabic Emotion Analysis

  • 1. DINA: A Multi-Dialect Dataset for Arabic Emotion Analysis Muhammad Abdul-Mageed1,2, Hassan AlHuzliy1, Duaa’ Abu Elhija1, Mona Diab2 Indiana University1, The George Washington University2
  • 2. Emotions • Categories of emotion: – Ekman (e.g., 1992) proposes there are 6 basic emotions: anger, disgust, fear, happiness, sadness, and surprise – Plutchik (1980, 1985, 1994) adds trust and anticipation • Emotion on 3 dimensions: – e.g., Francisco and Gervas (2006) mark the attributes of pleasantness, activation, and dominance in the genre of fairy tales. – DINA is focused on the Ekman emotions. 2
  • 3. Motivations • Opinion Mining: – Provides an enriching component beyond the mere binary valence (i.e. positive and negative) of most sentiment analysis systems. • Health & Wellness – Early detection of certain emotional disorders such as depression. – Improving the well-being of people by exposing them to desired emotions (since emotion is contagious [Kramer et al., 2014]). • Education: – Integrating emotionally-aware agents in intelligent computer-assisted language learning, for example, should prove useful and enhance the naturalness of the pedagogical experience. 3
  • 4. Motivations Cont. • Marketing: – e.g., emotion-sensitive language generation can help with marketing (Heath et al., 2001; Tan et al., 2014), political campaigning, etc. • Security: – Deflect potential hazards and anticipate dangerous behaviors • Author Profiling: – Useful for predicting age and gender (Meina et al., 2013; Flekova and Gurevych, 2013; Farias et al., 2013; Bamman et al., 2014; Forner et al., 2013) and personality (Mohammad and Kiritchenko, 2013) 4
  • 5. Related Work • SemEval-2007 Affective Text task (Strapparava and Mihalcea, 2007) [SEM07]: – Collection and classification of emotion and valence in news headlines • Aman and Szpakowicz (2007): – Annotation and detection of emotions from blogs • Qadir and Riloff (2014), Mohammad (2012), Wang et al. (2012): – use hashtags as an approximation of emotion categories to collect emotion data 5
  • 6. Arabic: Motivations • Morphologically Rich Language – Highly inflected: person, number, gender, case, mood, aspect, voice • Strategic Language: – One of the 6 languages of UN, with ~ 300M speakers worldwide • Exponential Web growth: – More than 2000% growth rate on the Web in 2010 onwards (www.internetworldstats.com). 6
  • 8. Data Collection 8 • Crawled Twitter data using a seed set of size < 10 phrases for each of the six Ekman emotion types. • Each phrase is composed of an emotion word (e.g., “happy”) and the first personal pronoun “I”. • We collect only tweets where a seed phrase occurs in the tweet body text. • This approach does not depend on hashtags. • We collect 500 tweets from each of the 6 emotion types. Total = 3,000. • Seeds capture various Arabic dialects.
  • 10. Annotation 10 • To verify the utility of this seeds approach, two college-educated native speakers of Arabic labeled the data. • For labeling, we use one of four tags from the set {“no-emotion/zero”, “weak-emotion”, “moderate/fair-emotion”, “strong-emotion”}. • We measure inter-annotator agreement as to these intensity labels in Cohen’s Kappa. • We also calculate the % of emotion-carrying tweets per category (those that did not end up assigned the label “no-emotion/zero”).
  • 11. DINA: Agreement & % Emotion 11 Table 3. Agreement in fine-grained annotation and average percentage of emotion
  • 12. Gold Labels from Happiness Class 12 Table 2. Agreement in happiness annotation
  • 19. Context of No- and Mixed Emotions 19 • Even with a list of well-crafted seeds, both annotators assign “no-emotion” for 7.5% of the data. • This is a function of emotion being a pragmatics-level phenomenon. • Contexts for “no-emotion” include: – Reported speech – Sarcasm
  • 22. Conclusion 22 • Emotion is like other pragmatic-level phenomena; hence a seed-collection approach is useful, but not perfect. • Phenomena like reported speech and sarcasm interact with our method for emotion data collection. • DINA is multidialectal, but we do not have exact dialect labels on the tweets. • DINA is at 3,000 tweets, and we plan to grow the size. • Full evaluation of DINA is only possible when we build models exploiting these data, which we plan to do.

Editor's Notes

  1. This paper, presented DINA, a multi-dialect data set for Arabic emotion analysis. We described an automatic approach of acquiring emotion data in MSA and dialectal Arabic from the Twitter genre, and evaluated that approach via human annotation. The DINA corpus comprises Modern Standard Arabic (MSA) and dialectal Arabic (DA) data crawled from Twitter
  2. Detection of emotion is a useful task, as it can have many practical applications. The following are only some examples:
  3. Emotion for personality: Mohammad & Kiritchenko, 2015 Mood: Bollen et al. (2011) use moods like tension, depression, anger, vigor, fatigue, and confusion to predict the stock market Other: Pearl and Steyvers (2010) describe work on identifying attitudes and intentions, and discuss attributes like politeness, rudeness, embarrassment, formality, persuasion, deception, confidence, and disbelief .
  4. This map distribution of Arabic Dialects within the Arabic countries
  5. Table 1. reports some features of the studied DA and MSA+DA corpora. Examples of the seeds words for each emotion type are provided in Table 1
  6. - As Table 2 shows, annotators agreed to assign one or another of the emotion intensity tags (i.e., ”weak,” ”fair,” and ”strong”) in 92.48% of the cases. This 92.48% agreement reflects the effectiveness of the phrase-based seed approach for automatically acquiring emotion data. - Table 2 also shows that the judges disagreed to varying degrees when choosing the specific positive emotion tag to assign, with agreement as high as 0.71% in the case of happiness and as low as 0.23% in the case of fear, and an average of 0.51%. We now discuss further details related to each of the 6 types of emotions.
  7. Table 6 below shows agreement between the annotators on labeling the automatically acquired happiness data. Table 6 also shows that the ”zero” is the most confused label and that it is confused with all other labels by one judge and by all other labels except ”weak” by the other judge
  8. - Annotators assign “no-emotion” because of the existence of “reported speech” here.
  9. - Annotators assign “no-emotion” because of the existence of “sarcasm” here.