Automatic Personality Prediction with Attention-based Neural Networks

Automatic Personality Prediction
with Attention-based Neural
Networks
Hang Jiang
Jinho D. Choi, PhD
Program in Linguistics

1. Introduction
Objectives
Motivation
Previous Works
Personality
Theories

Big Five Personality Inventory
(Norman, 1963; Goldberg, 1981)
1. Openness to experience
2. Extraversion
3. Conscientiousness
4. Emotional stability
(vs. Neuroticism)
5. Agreeableness
Language
Use
Agreeable
Stable
Conscientious
Open
Extraverted

Previous Works
1. Pennebaker and King (1999)
a. Self-report Essays Dataset with 2468 instances
2. Automatic Personality Prediction (Pennebaker et al., 2001) based
on text
a. Extracted linguistic features using Linguistic Inquiry and Word
Count (LIWC) text analysis tool
3. Mohammad and Kiritchenko (2013) introduced new linguistic
features
4. Tighe et al. (2016) applied feature reduction techniques like
Principal Component Analysis (PCA) and Information Gain (IG)

Motivation
● Applications in daily-life domains
○ Dating websites
○ Anti-terrorism
● Character mining
○ Attribute extraction

Objectives
● Create new Friends Dataset for the task
● Present a novel approach to automatic personality
prediction using attention-based neural networks with
word embeddings
● Evaluate our models on both datasets

2. Background
Neural
Networks
Previous
Datasets
LIWC Features
Big Five
Theories
Evaluation
Metrics

Big Five Theories (John et al., 1991)
Big Five Traits Facets
Extraversion vs. introversion sociable, forceful, energetic, adventurous,
enthusiastic, outgoing
Agreeable vs. antagonism forgiving, not demanding, warm, not
stubborn, not show-off, sympathetic
Conscientiousness vs. lack of direction efficient, organized, not careless, thorough,
not lazy, not impulsive
Neuroticism vs. emotional stability tense, irritable, not contented, shy, moody,
not self-confident
openness vs. closeness to experience curious, imaginative, artistic, wide interest,
excitable, unconventional

Linguistic Inquiry and Word Count (LIWC)
Categories Examples
Past tense walked, were, had
Negations no, never, not
Swear words *****
Friends pal, buddy, coworker
Positive Emotions happy, pretty, good
Anger hate, kill, pissed
Assent agree, OK, yes
Nonfluencies uh, rr*

Essays Electronically Activated Recorder (EAR)

Neural Networks for Text Classification
1. Multiple Layer
Perceptrons (MLP)
2. Convolutional Neural
Networks (CNN)
3. Bidirectional Long
Short-Term Memory
Networks (LSTM)
MLP Model

Convolutional Neural Networks (Kim, 2014)

Evaluation Metrics
- 10-fold
Cross-Validation
- Accuracy

3. Corpus
Final
Annotation
Agreement
Check
Online
Annotation
Sub-scene
Extraction

Friends Dataset
- Not domain-specific
- Simple language
Dataset Essays Friends EAR
Source written observation Spoken
Structure monologue dialogue dialogue
Report Type self-report observation
self-report &
observation
Number of
words
1.9 million 556,273 97,468
Instances 2,468 3,488 96
Words per
Instance
651 161 1015

Sub-scene Extraction Process
1. Use window technique to find
a main speaker’s frequency
distribution in each scene
2. Choose peaks in each
frequency distribution
3. Use the peaks to find the
index range of each
sub-scene
4. Extract multiple sub-scenes
from each scene, thus
increasing our data size
5. Optimize window_size and
min_conversation_length

Annotation through Crowdsourcing (online annotation)
- Extracted 8738
sub-scenes from
10-season Friends
transcript
- Had 3448 sub-scenes
annotated from the
first 4 seasons
through Amazon
Mechanical Turk
Platform

What do the questions look like?

Inter-rater Agreement
Personality
Trait
Fleiss
Kappa*
Observed
Agreement
Estimated
Agreement
Agreeable 0.053 0.414 0.381
Conscientious 0.017 0.387 0.376
Extraverted 0.016 0.455 0.446
Stable 0.031 0.379 0.359
Open 0.041 0.409 0.383

Final Annotation
Steps:
1. Not change initial annotations
[-1,1]
2. Add three annotators’ scores;
produce 7 classes [-3,3]
3. Classes -3 and 3 are too small
4. Merge -3 and -2, and 3 and 2;
produce 5 classes

Three formats of Friends dataset
Ross: Hi, Rachel.
Rachel: Hi Ross.
Ross: I have a bad day.
Rachel: Oh.
Ross: How is your day?
Original Conversation
Ross: Hi, Rachel.
Single
Ross: Hi, Rachel.
Rachel: Hi Ross.
Rachel: Oh.
Single+Context
#Targ# Ross: Hi, Rachel.
#NonTarg# Rachel: Hi
Ross.
#Targ# Ross: I have a
bad day.
#NonTarg# Rachel: Oh.
#Targ# Ross: How is
your day?
Target

Three formats of Friends dataset
Ross: Hi, Rachel.
Rachel: Hi Ross.
Rachel: Oh.
Original Conversation
Ross: Hi, Rachel.
Single
Ross: Hi, Rachel.
Rachel: Hi Ross.
Rachel: Oh.
Single+Context
#Targ# Ross: Hi, Rachel.
#NonTarg# Rachel: Hi
Ross.
#Targ# Ross: I have a bad
day.
#NonTarg# Rachel: Oh.
#Targ# Ross: How is your
day?
Target

5. Approaches
1. Classic classification
algorithms with Weka
2. Multiple Layer Perceptrons
(MLP)
3. Convolutional Neural
Networks (CNN)
4. Bidirectional Long
Short-Term Memory
Networks (LSTM)
5. Attention-based CNN
6. Attention-based LSTM

5. Experiment
Modeling on
Friends Dataset
Modeling on
Essays Dataset
LIWC vs. Word
Embeddings

LIWC Features vs. Word Embeddings
Trait Majority MLP with LIWC
MLP with Word
Embeddings
Agreeableness 53.08 57.90 55.51
Conscientiousness 50.81 56.62 58.59
Extraversion 51.74 55.96 56.69
Neuroticism 50.04 56.72 56.93
Openness 51.54 57.62 59.44

Results on Essays Dataset
10-fold CV /
accuracy
Majority
Tighe et
al., 2016
Jiang & Choi, 2018
Trait Baseline Weka MLP CNN ABCNN BLSTM ABLSTM
Agreeablene
ss
53.08 57.54 55.51 57.38 57.82 56.64 58.85
Conscientiou
sness
50.81 56.04 58.59 57.74 60.13 57.83 59.55
Extraversion 51.74 55.75 56.69 56.28 58.75 59.18 59.32
Neuroticism 50.04 58.31 56.93 57.09 58.51 57.70 59.56
Openness 51.54 61.95 59.44 63.49 63.65 63.02 63.99

6. Conclusion
● New Friends dataset is created, and it shows the challenges of
annotating dialogue text data
● A novel approach to automatic personality prediction
● A new benchmark is achieved on essays dataset
● All models fail to work on Friends dataset, implicating the annotations
do not have much consistency

Future Works
● LIWC integrated CNN/LSTM with Attention Mechanism
● A platform to support human annotation process by providing
multimodal information
● The use of the Big Five Inventory Questionnaire

Automatic Personality Prediction with Attention-based Neural Networks

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Automatic Personality Prediction with Attention-based Neural Networks

Similar to Automatic Personality Prediction with Attention-based Neural Networks (20)

More from Jinho Choi

More from Jinho Choi (20)

Recently uploaded

Recently uploaded (20)

Automatic Personality Prediction with Attention-based Neural Networks