thesis_palogiannidi

Aﬀective Analysis and Modeling of Spoken
Dialogue Transcripts
Thesis presentation
Elisavet Palogiannidi
Committee
Alexandros Potamianos (supervisor)
Polychronis Koutsakis (co-supervisor)
Aikaterini Mania
School of Electronic and Computer Engineering
Technical University of Crete
Chania, Crete
11 July 2016

Introduction Aﬀective models Experiments and Results Q&A Conclusions
What if there was no emotion?
Elisavet Palogiannidi TUC Aﬀective Analysis and Modeling of Spoken Dialogue Transcripts 2/49

What if there were no computers?

What is the relationship between computers and emotions?

What is all about?

Outline
1 Introduction
Motivation
Emotion
Contributions
2 Affective models
Semantic Affective Model
Compositional Affective Model
Sentence level Affective Models
3 Experiments and Results
Semantic - Affective model
Sentence level affective models
4 Q&A
5 Conclusions

Outline
1 Introduction
Motivation
Emotion
Contributions
2 Aﬀective models
4 Q&A
5 Conclusions

Motivation
Emotion detection from text
“Emotion is perceived in text and it can be elicited by its
content and form”
Goal:Assign continuous high quatlity affective scores on
various granularity lexical tokens, using semantic and affective
features, for multiple languages
Motivation: “Semantic similarity implies affective similarity”
Affective text labelling at the core of many applications

Motivation
Applications
Aﬀective text applications
Sentiment analysis of Social Media, news, product reviews
Emotion detection on spoken dialogue
Multimodal applications
Semantic aﬀective model (SAM) [Malandrakis et al. 2013]
Has been applied to tweets, sms and news headlines
Is applicable to words or n-grams and numerous dimensions
Valence, Arousal, Dominance, Concreteness, Imagability,
Familiarity, Gender Ladenness
We focus on the prediction of Valence, Arousal, Dominance

Emotion
Continuous Affective space
Introduction
• Goals: 1) Create an emotional resource for the Greek language
2) Use it to automatically estimate affective ratings of words
• Manually created resources have low language coverage (about 1K words)
• Computational models are used to expand manually created affective lexica
Affective (Emotional) Dimensions
Valence Arousal Dominance
Negative to positive Calming to exciting Controlled to controller
Valence-Arousal Distributions Across Languages
• Valence-Arousal distributions for different languages affective lexica
Greek affective lexicon ratings V-shape across languages
0.25
0.5
0.75
1
Arousal
flirtation
treasure
friend
happy
laugh
victory
poster
slave
sadness
pillow
syphilis
anger
commit suicide
failure
−1 −0.5 0 0.5 1
0.25
0.5
0.75
1
0.5
0.75
1
usal
L

Contributions
Annotated Resources: Greek ANEW
We created the ﬁrst Greek Aﬀective Lexicon

Contributions
Models for multiple languages
We extended SAM to multiple languages
We improved the mapping from semantic to aﬀective space
We tried various contextual features and weighting schemes

Contributions
Compositional Affective models
The meaning of complex lexical structures p is composed by
the meaning of the constituent words α, β
Compositional approaches in vector-based semantics:
Composition of semantic representation of the phrase’s
constituent words
Combine by addition and multiplication [Mitchell and Lapata.,
2008; Mitchell and Lapata, 2010]
.[Baroni and Zamparelli.,2010] compositional approach based
on POS tags
We assume that composition occurs in the affective space,
Combine affective ratings and not semantic representation of
constituent words

Contributions
Sentiment Analysis in Twitter
We achieved state of the art performance winning a word wide
competition
...................
Semantic Affective system (Baseline)
.
• Tools: POS-tagging, multiword expression, hashtag expan
– Semantic similarity implies affective similarity: SAM “Distr
Semantic Models for Affective Text Analysis, Malandrakis et al. 2013”
• Goal:estimate the affect of word pairs mor
curately than the non-compositional models
• Compositionality: the meaning of the w
is constructed form the meaning of the part
• Novelty: Applied on affective space
• Adopt modifier-head structure: p = m.h
• E.g., : p=“green parrot” and p=“dead par
– m : green/dead & h : parrot
– m modifies the affect of h
Continuous Affective spaces
• Valence - Arousal - Dominance
Semantic Affective Model (SAM
Semantic similarity implies affective similarity
tributional Semantic Models for Affective Text Analysis, Malandrakis et a
ˆυ(tj) = a0 +
N∑
i=1
aiυ(wi)S(tj, wi)
• ˆυ(tj): the affective rating of the unknown t
tj, w1..N: the seeds, υ(wi) and ai: the affe
rating and the weight of wi, a0: the bias,
semantic similarity between tokens
Each modifie
unique bahavi
Applied on
words &
word pairs!
number of seedsaffective rating
of the unknown token
bias
weights
assigned to seeds
Semantic similarity
between tokens
affective ratings
of seeds
• Two step feature selection, Naive Bayes (NB) tree classifi
....
Topic Modeling - based System (TM)
.
• Adapt semantic space on each tweet
• LDA → detect topics (16)→ split corpus →
..............................
In Subtask
is used as f
.
Subtask B
at SemEval 2016 Task 4
Sentiment Analysis in Twitter
using Semantic-Affective Model Ad

Contributions
Publications
1 Elisavet Palogiannidi, Elias Iosif, Polychronis Koutsakis and Alexandros Potamianos, “Valence, Arousal
and Dominance Estimation for English, German, Greek, Portuguese and Spanish Lexica using Semantic
Models”, in Proceedings of Interspeech, September 2015.
2 Elisavet Palogiannidi, Elias Iosif, Polychronis Koutsakis and Alexandros Potamianos “Affective lexicon
creation for the Greek language”, in Proceedings of the 10th edition of the Language Resources and
Evaluation Conference (LREC) 2016.
3 Elisavet Palogiannidi, Polychronis Koutsakis and Alexandros Potamianos, “A semantic-affective
compositional approach for the affective labelling of adjective-noun and noun-noun pairs”, in Proceedings
of WASSA 2016.
4 Elisavet Palogiannidi, Athanasia Kolovou, Fenia Christopoulou, Filippos Kokkinos, Elias Iosif, Nikolaos
Malandrakis, Harris Papageorgiou , Shrikanth Narayanan and Alexandros Potamianos, “Tweester:
Sentiment analysis in twitter using semantic-affective model adaptation”, in Proceedings of the 10th
International Workshop on Semantic Evaluation (SemEval) 2016.
5 Jose Lopes, Arodami Chorianopoulou, Elisavet Palogiannidi, Helena Moniz, Alberto Abad, Katerina Louka,
Elias Iosif and Aleandros Potamianos “The SpeDial Datasets: Datasets for Spoken Dialogue Systems
Analytics”, in Proceedings of the 10th edition of the Language Resources and Evaluation Conference
(LREC) 2016.
6 Spiros Georgiladakis, Georgia Athanasopoulou, Raveesh Meena, Jose Lopes, Arodami Chorianopoulou,
Elisavet Palogiannidi, Elias Iosif, Gabriel Skantze and Alexandros Potamianos “Root Cause Analysis of
Miscommunication Hotspots in Spoken Dialogue Systems”, in Proceedings of Interspeech 2016 (to appear).

Outline
1 Introduction
Motivation
Emotion
Contributions
2 Aﬀective models
4 Q&A
5 Conclusions

Semantic models
Building block for machine learning in NLP
Corpus based approach: Distributional Semantic Models
(DSM)
Semantic information extracted from word frequencies
(co-occurence counts, context vectors)
Context based semantic similarities
“Similarity of context implies similarity of meaning” [Harris ’54]
Contextual windows that contain words or character n-grams
Binary or PPMI weighting scheme
Semantic similarity between two words: cosine of their
contextual feature vectors

From Semantic to Affective Space
Affective model: Extension of [Turney and Littman, 2002],
proposed by [Malandrakis et al. 2013b]
The semantic model is built,
based on the corpus
Training phase for the semantic
to the affective mapping
Affective lexica are used for the
training, e.g., ANEW [Bradley
and Lang 1999]
[Malandrakis et al. 2014]

Affective model [Malandrakis et al. ’13]
Requires a small, manually annotated affective lexicon
Assumption: The affective score of a word can be expressed
as a linear combination of the affective ratings of seed words
weighted by semantic similarity and trainable weights αi
ˆυ(wj ) = α0 +
N
i=1
αi υ(wi )S(wj , wi ) (1)
ˆυ(wj ): estimated affective rating of the unknown word wj
w1..N : seed words
υ(wi ): affective rating of wi (valence, arousal or dominance)
αi : weight assigned to wi (α0: bias)
S(·): semantic similarity between wj and wi

Semantic - aﬀective mapping
Not all seeds are equally salient
Weights estimation (α0 · · · αN) through supervised learning



1 S(w1, w1)υ(w1) · · · S(w1, wN )υ(wN )
1
...
...
...
1 S(wK , w1)υ(w1) · · · S(wK , wN )υ(wN )


 ·



α0
...
αN


 =





1
υ(w1)
.
..
υ(wK )





(2)
A system of K linear equations with N + 1 (N < K) unknown
variables is solved using
Least Squares Estimation (LSE)
Ridge Regression (RR)

Compositionality
The meaning of the whole is constructed by the meaning of
the parts
New idea: Applied on affective instead of semantic space
Adopt a modifier-head (m − h) structure for word pairs
Assumption: each modifier has unique behavior that can be
learnt in a distributional approach
e.g., green parrot Vs. dead parrot
modifiers m modify the affective content of h

Compositional model (1/2)
The meaning of more complex lexical structures is composed
by the meaning of the constituent words

Compositional model (2/2)
The affective content of the word pair is the modified affective
content of the head
ˆυc(p) = β + W ˆυ(h)
β, W are modifier’s bahavior
ˆυ(h) is the affective content of the head
Applied on 1D (W , β are scalars ) and 3D (W ∈ IR3×3
,
β ∈ IR3
) affective spaces
Compositionality measure: Mean Squared Error over training
pairs
Measured between compositional and bigram SAM
High MSE → low compositional model appropriateness

Fusion of Compositional and non compositional models
Each word pair has diﬀerent compositionality degree
Non-compositional models
1 Unigram SAM (U-SAM): average of words’ aﬀective ratings
2 Bigram (B-SAM): apply SAM directly on word pair
Fusion schemes
Average (Avg) and Weighted average
MSE-based :
Estimate λ (pj ) = 0.5
1+e
−MSE(pj ) for each training pair
Average all λ (pj ) to learn the parameter λ(p) of the test pair
Weight compositional (C) and non-compositional (nC) models
based on λ(p), i.e., υφ(p) = λ(p)nC + (1 − λ(p))C

Fusion of words’ affective ratings
Sentence level affective rating approaches
1 Aggregation of the constituent words’ affective ratings
Average
Weighted Average
Maximum absolute affective rating
2 Classification based on affective features
Statistics of words’ affective ratings
POS-tag grouping

Tweester: Semantic aﬀective model system
Two - step feature selection
Naive Bayes tree classiﬁer

Outline
1 Introduction
Motivation
Emotion
Contributions
2 Aﬀective models
4 Q&A
5 Conclusions

Experimental Procedure
Goal
Estimate Valence, Arousal and Dominance scores of words in
multiple languages (English, German, Greek, Portuguese, Spanish)
Semantic similarity computation
Words (W) and character n-grams contextual features
Binary (B) and PPMI weighting schemes
Fusion: combine different types of contextual feature vectors
Evaluation datasets
The affective lexica of each language
10-fold cross validation: 90% train and 10% test
Evaluation Metrics: Pearson Correlation, Binary classification
accuracy (positive vs. negative values)

Valence performance as a function of the seeds
Valence correlation and classiﬁcation accuracy
Performance as a function of the seeds
Valence evaluation of ﬁve languages
0 100 200 300 400 500 600
0.65
0.7
0.75
0.8
0.85
0.9
Number of seeds
Correlation
0 100 200 300 400 500 600
0.7
0.75
0.8
0.85
0.9
Number of seeds
ClassificationAccuracy
English Greek German Portuguese Spanish

Comparison of aﬀective dimensions
Valence (a), Arousal (b), Dominance (c) clas. accuracy
0 100 200 300 400 500 600
0.7
0.75
0.8
0.85
0.9
Number of seeds
English Greek German Portuguese Spanish
(a)
0 100 200 300 400 500 600
0.65
0.7
0.75
0.8
0.85
0.9
Number of seeds
(b)
0 100 200 300 400 500 600
0.65
0.7
0.75
0.8
0.85
0.9
Number of seeds
(c)

Comparison of RR and LSE
10 200 400 600 900
0.2
0.4
0.6
0.7
0.8
Number of seeds
Correlation
Arousal
10 200 400 600 900
0.65
0.7
0.75
0.8
Number of seeds
Arousal
Spanish − RR Spanish − LSE Greek − LSE Greek − RR
Using RR with the appropriate λ
Performance stays robust for a large number of seeds
RR improves performance of Greek and Spanish on Arousal

Valence classiﬁcation accuracy for 600 seeds
PPMI works better than binary
Sem. Similarity English Greek Spanish Portuguese German
W-B 86.9 84.3 85.9 89.3 77.1
W-PPMI 90.9 87.6 85.3 90.8 85.2

Character n-grams work equally well with words
W-B 86.9 84.3 85.9 89.3 77.1
W-PPMI 90.9 87.6 85.3 90.8 85.2
4gram-PPMI 89.8 87.5 87.7 87.4 82.6

Concatenating diﬀerent contextual vectors does not improve
the performance
W-B 86.9 84.3 85.9 89.3 77.1
W-PPMI 90.9 87.6 85.3 90.8 85.2
4gram-PPMI 89.8 87.5 87.7 87.4 82.6
W/4gram-PPMI 90.5 87.2 87.9 89.3 83.0

Concatenating diﬀerent contextual vectors does not improve
the performance
W-B 86.9 84.3 85.9 89.3 77.1
W-PPMI 90.9 87.6 85.3 90.8 85.2
4gram-PPMI 89.8 87.5 87.7 87.4 82.6
W/4gram-PPMI 90.5 87.2 87.9 89.3 83.0
Weighting scheme is the most important parameter
English achieves highest performance
German achieves highest performance increase
Char. 4-gram-PPMI works almost always better than W-B

Experimental procedure
Goal
Estimate Valence scores of word pairs employing compositional
phenomena
Movie domain word pairs
1009 Adjective Noun (AN) and 357 Noun Noun (NN)
Training corpus: 116M web snippets
Extra training on fusion schemes for weights estimation

Classiﬁcation Accuracy for AN and NN word pairs
U−SAMB−SAM 1D 3D Avg W.Avg MSE−Based
74
76
80
84
86
88
Affective models
ClassificationAccuracy(%)
NN AN Chance − NN Chance − AN

74
76
80
84
86
88
Affective models
Compositional models work better than B-SAMs but worse
than U-SAMs

74
76
80
84
86
88
Affective models
than U-SAMs
Highest performance achieved for fusion of compositional and
non-compositional models

74
76
80
84
86
88
Affective models
than U-SAMs
Highest performance achieved for fusion of compositional and
non-compositional models
Small diﬀerences between 1D and 3D models

Evaluation on News Headlines
Valence estimation of 1000 news headlines aggregating
affective ratings
Affective Model Classification Accuracy (%)
Chance 52.6
Content Words All words
Average 72.4 70.9
Weighted Average 71.6 73.1
Maximum absolute valence 67 66.4

Evaluation on Movie Subtitles
Valence estimation of movie subtitles from 12 movies
Annotate subtitles on Valence through Crowdsourcing
Leave-one-movie-out scheme
Average performance for all the movies as a function of the
seeds
10 50 100 200 300 400 500 600 700 800
0.5
0.55
0.6
0.65
0.7
Movies subtitles Dataset
Seeds
10 50 100 200 300 400 500 600 700 800
0
0.1
0.2
0.3
0.4
Movies subtitles Dataset
Seeds
Correlation

Outline
1 Introduction
Motivation
Emotion
Contributions
2 Aﬀective models
4 Q&A
5 Conclusions

How the sentence level models perform on real data?
Twitter (written text)
Polarity detection task (positive vs. negative tweets)
Classifier with affective features trained on tweets
Evaluation metric: average recall of positive, negative class ρ
System ρ
Baseline 0.821
LYS (Spain 0.791
Amazon 0.784
Spoken Dialogue (transcriptions of speech)
The same utterance can be expressed with different emotion
Affective text models usually don’t work for short utterances
Moderate performance is reached for larger utterances of real
dialogues
Performance improves when fusing with speech system

Can SAM be applied on a language with no affective
lexicon lexicon?
1 Create a new affective lexicon
2 Use cross-language modeling
Translate the words of an already existing affective lexicon
Use the other language’s affective ratings
0 100 200 300 400 500 600
0.75
0.8
0.85
0.9
0.95
Seeds
S: Greek, T: Portuguese
S: English, T: Portuguese
S: Spanish, T: Portuguese
Portuguese
S: Greek, English, Spanish

Outline
1 Introduction
Motivation
Emotion
Contributions
2 Aﬀective models
4 Q&A
5 Conclusions

Conclusions
Affective models for emotion detection of various granularity
lexical units
We showed that SAM for words
Is language and affective dimension independent
Performance depends on the weights estimation method
We showed that Cross language SAM performs equally well
Compositional models can be applied on affective space
The nature of the written data determines the performance of
the sentence level model

Future work
Identify parameters that deﬁne compositionality
Employ compositional semantics on compositional model
Ambiguous interaction between the words of the word pair
Incorporate morphological information in diﬀerent languages’
SAMs
Compositional models for sentences

References
Malandrakis et al. 2013 N. Malandrakis, A. Potamianos, E. Iosif and S. Narayanan. 2013. “Distributional
Semantic Models for Aﬀective Text Analysis”. IEEE Transactions on Audio, Speech and Language Processing. 2013
Malandrakis et al. 2014 N.Malandrakis, A. Potamianos, K. J. Hsu , K. N. Babeva, M. C. Feng , G. C. Davison , S.
Narayanan, 2014 “Aﬀective Language Model Adaptation Via Corpus Selection”, ICASSP 2014
Turney and Littman 2002 P. Turney and M. L. Littman, “Unsupervised Learning of Semantic Orientation from a
Hundred-Billion-Word Corpus. Technical report ERC-1094 (NRC 44929),” National Research. Council of Canada,
2002.
Mitchell, J., and Lapata 2008 J. Mitchell and M. Lapata. Vector-based models of semantic composition. In Proc.
of (ACL), pages 236244. 2008
Mitchell, J., and Lapata 2010 Mitchell, J., and Lapata, M. Composition in distributional models of semantics.
Cognitive science 34, 8 (2010)
Baroni and Zamparelli 2010 Baroni, M., and Zamparelli., R. Nouns are vectors, adjectives are matrices:
Representing adjective-noun constructions in semantic space. In in Proc. of EMNLP (2010).

thesis_palogiannidi

Recommended

Recommended

More Related Content

What's hot

What's hot (14)

Viewers also liked

Viewers also liked (8)

Similar to thesis_palogiannidi

Similar to thesis_palogiannidi (20)

thesis_palogiannidi