Naturalness is fundamental for creating conversational agents that are truly smart and can integrate seamlessly into the lives of human beings. To this end they must be endowed with a Persona.
3. Where are we heading to?
Some recent advances in Conversational Agents,
naturalness is always a key element:
1.⯠learn to memorize
2.⯠learn to personalize
3.⯠learn to empathize
4.⯠learn to express emotions
In all aspects of this area, we encounter the challenge
of a lack of well-balanced and well-labeled data.
Pascale Fung - invited talk at ACL 2019
4. Structure
We must build agents with a Persona, i.e. with a
specific, persistent, consistent interaction style.
it is NOT just a technical problem
5. What is Persona* about?
â˘âŻ Three concepts, not to be confused:
ââŻPersonality: the way we behave/react to the world
ââŻPersona: our personal story and preferences
ââŻPersonalization: the ability to respond to otherâs
preferences
6. From Persona* to dialogues
â˘âŻ All these elements have an impact on specific
communication aspects.
â˘âŻ The impact is not limited to chit-chat agents but
also to task-oriented ones.
â˘âŻ Some examples follow: a task oriented CA with
and without naturalness/human-like skills.
Guerini, M., Falcone, S., & Magnini, B. (2018). A Methodology for Evaluating Interaction Strategies of
Task-Oriented Conversational Agents. In Proceedings of EMNLP SCAI workshop, (pp. 24-32).
7. CH1: Hello Andrea! What did you eat for
your last meal?
User: I ate a plate of spaghetti with
tomato
CH1: How much spaghetti with tomato did
you eat?
User: Unfortunately I messed up, it was a
generous helping...
CH1: Your meal consisting of a generous
plate (200g) of spaghetti with tomato
corresponds to 30.85 g of carbohydrates. I
hope I have been helpful! See you soon!
CH1: Hello Andrea! What did you eat for
your last meal?
User: I ate a plate of spaghetti with
tomato
CH1: How much spaghetti with tomato did
you eat?
User: Unfortunately I messed up, it was a
generous helping...
CH1: Donât worry about it, everyone
messes up sometimes! Anyway, your meal
consisting of a generous plate (200g) of
spaghetti with tomato corresponds to
30.85 g of carbohydrates. I hope I have
been helpful! See you soon!
Empathy
10. CH1: Good morning Andrea. What did you
eat for your last meal?
User: I ate an awful chicken salad
CH1: I am sorry to hear that, how much
chicken salad did you eat?
User: 100g
CH1: Your meal consisting of 100g of
chicken salad corresponds to 2.85 g of
carbohydrates. Thank you and have a nice
day.
CH1: whassup Andrea? Shoot me your last
meal!
User: I ate an awful chicken salad
CH1: that sucks man! How much chicken
salad did you eat?
User: 100g
CH1: uh-huh, 100g of chicken salad
corresponds to 2.85 g of carbohydrates.
Catch yaâ later!
Formality Level
14. BIG FIVE â Personality model
â˘âŻ An integration of personality research that
represents the various personality descriptions in
one common framework.
â˘âŻ Individual differences in social and emotional life
organized into a five-factor model of personality
15. BIG FIVE â Personality model
Idea: we can measure personality using specific questionnaires.
17. OCEAN traits
â˘âŻ Openness
inventive/curious vs. consistent/cautious
â˘âŻ Conscientiousness
efficient/organized vs. easy-going/careless
â˘âŻ Extraversion
outgoing/energetic vs. solitary/reserved
â˘âŻ Agreeableness
friendly/compassionate vs. challenging/detached
â˘âŻ Neuroticism
sensitive/nervous vs. secure/confident
18. A personality Model
â˘âŻ In principle we can express a
personality with 5 simple
numbers (or labels if we
binarize the data)âŚ
â˘âŻ How to connect these numbers
with linguistic production?
â˘âŻ How to connect these numbers
with linguistic interaction?
19. Rule Based Approaches
â˘âŻ Rely on the psychological findings that correlate
personality traits and language production.
Parametrize them and vary utterance characteristics
according to these parameters. E.g.
Extroversion â Verbosity
â˘âŻ Personage (PERSONAlity GEnerator), which feed these
parameters to a standard NLG pipeline to affect from
content planning to surface realization.
Mairesse, F., & Walker, M. A. (2009). Can Conversational Agents Express Big Five Personality
Traits through Language?: Evaluating a Psychologically-Informed Language Generator.
20. The Open Vocabulary Approach
700 million words, phrases, and topic instances
collected from the Facebook messages of 75,000
volunteers, who took standard personality tests.
1.⯠Users on FB take a personality questionnaire
2.⯠Download all their posts
3.⯠Correlate words with personality
Schwartz, H. Andrew, et al. "Personality, gender, and age in the language of social media:
The open-vocabulary approach." PloS one 8.9 (2013): e73791.
21. The Open Vocabulary Approach
â˘âŻ Some words are more
associated with a
personality trait than
others.
22. GlobalTrait Approach
â˘âŻ A multilingual model to recognize Big Five Personality
traits from text data in four different languages.
â˘âŻ Words having a similar semantic meaning in different
languages do not necessarily correspond to the same
personality traits.
â˘âŻ GlobalTrait: alignment method such that words that
correlate positively to a trait are close together in the
multilingual vector space.
Siddique, F. B., Bertero, D., & Fung, P. (2019). GlobalTrait: Personality Alignment of Multilingual
Word Embeddings. Proceedings of AAAI Conference on Artificial Intelligence 33 - pp. 7015-7022.
25. What is a Persona?
â˘âŻ The character that a CA plays during dialogues.
â˘âŻ A persona is a composite of elements of identity:
ââŻbackground facts or user profile
ââŻlanguage behavior
ââŻinteraction style
â˘âŻ A persona is also adaptive: present different facets
to different interlocutors.
Li, J., Galley, M., Brockett, C., Spithourakis, G. P., Gao, J., & Dolan, B. (2016). A persona-based
neural conversation model. In proceedings of ACL 2016.
26. Persona and Neural Models
â˘âŻ How to integrate a Persona into a neural model?
â˘âŻ What data do we need?
â˘âŻ Where can we take the data to train such models?
28. Issues with simple models
Common issues with chit-chat models include:
â˘âŻ Lack of a consistent personality - typically trained
over many dialogs each with different speakers.
â˘âŻ Lack of an explicit long-term memory - typically
trained to produce an utterance given only the
recent dialogue history
â˘âŻ Tendency to produce non-specific answers like âI
donât knowâ
31. Data annotated with Speaker ID.
â˘âŻ Use an open-domain corpus of Twitter
conversations
â˘âŻ dialog datasets comprising TV series scripts show
that leveraging persona
â˘âŻ Each user/character is a persona or
â˘âŻ Cluster characters by demographics
33. Flat use of the data (-)
Jack: Hi Steve. What are you doing here?
Steve: I'm meeting a friend here for dinner. How about you?
Jack: I'm on my way home but I needed to stop by the book store to buy a
text book.
Steve: Didn't you finish school yet?
Jack: I have one more year, and then I'm done.
Steve: What are you majoring in?
Jack: I'm majoring in Sociology
Steve: How do you like your major?
Jack: I really find the subject very interesting. I'm enjoying all my classes.
Steve: That's great.
Jack: I'd better go now. I don't want to miss the bus.
Steve: Alright, I'll talk to you later.
Jack: Okay. See you later.
34. Character based annotation (+)
Jack: Hi Steve. What are you doing here?
Steve: I'm meeting a friend here for dinner. How about you?
Jack: I'm on my way home but I needed to stop by the book store to buy a
text book.
Steve: Didn't you finish school yet?
Jack: I have one more year, and then I'm done.
Steve: What are you majoring in?
Jack: I'm majoring in Sociology
Steve: How do you like your major?
Jack: I really find the subject very interesting. I'm enjoying all my classes.
Steve: That's great.
Jack: I'd better go now. I don't want to miss the bus.
Steve: Alright, I'll talk to you later.
Jack: Okay. See you later.
35. Learn one Model for each Persona
Jack: Hi Steve. What are you doing here?
Steve: I'm meeting a friend here for dinner. How
about you?
Jack: I'm on my way home but I needed to stop by
the book store to buy a text book.
Steve: Didn't you finish school yet?
Jack: I have one more year, and then I'm done.
Steve: What are you majoring in?
Jack: I'm majoring in Sociology
Jack: Hi Steve. What are you doing here?
Steve: I'm meeting a friend here for dinner. How
about you?
Jack: I'm on my way home but I needed to stop by
the book store to buy a text book.
Steve: Didn't you finish school yet?
Jack: I have one more year, and then I'm done.
Steve: What are you majoring in?
Jack: I'm majoring in Sociology
Jack: I'm majoring in Sociology
Steve: What are you majoring in?
36. Learn all Persona in one Model
Steve: What are you majoring in?
Jack: Hi Steve. What are you doing here?
Steve: I'm meeting a friend here for dinner. How
about you?
Jack: I'm on my way home but I needed to stop
by the book store to buy a text book.
Steve: Didn't you finish school yet?
Jack: I have one more year, and then I'm done.
Steve: What are you majoring in?
Jack: I'm majoring in Sociology
STEVE | JACK
(i) Personae encoded in one hot vector
37. Learn all Persona in one Model
Steve: What are you majoring in?
Jack: Hi Steve. What are you doing here?
Steve: I'm meeting a friend here for dinner. How
about you?
Jack: I'm on my way home but I needed to stop
by the book store to buy a text book.
Steve: Didn't you finish school yet?
Jack: I have one more year, and then I'm done.
Steve: What are you majoring in?
Jack: I'm majoring in Sociology
< 0.5 , 0.3 , -0.7 >
(ii) Use a Personality Embedding
42. MN for storing dialogue
To correctly predict what is the next move in a
dialogue the system must look up its memory and
select the relevant turn (not necessarily the last).
43. MN for storing dialogue
To correctly predict what is the next move in a
dialogue the system must look up its memory and
select the relevant turn (not necessarily the last).
44. MN for Modeling Persona
Represent persona as a set of statements. Store them in MM. Collect
crowdsourced dialogues providing persona profiles.
Persona 1 Persona 2
I like to ski I am an artist
My wife does not like me anymore I have four children
I have went to Mexico 4 times this year I recently got a cat
I hate Mexican food I enjoy walking for exercise
I like to eat cheetos I love watching Game of Thrones
[PERSON 1:] Hi
[PERSON 2:] Hello ! How are you today ?
[PERSON 1:] I am good thank you , how are you.
[PERSON 2:] Great, thanks ! My children and I were just about to watch Game of Thrones.
[PERSON 1:] Nice ! How old are your children?
[PERSON 2:] I have four that range in age from 10 to 21. You?
[PERSON 1:] I do not have children at the moment.
[PERSON 2:] That just means you get to keep all the popcorn for yourself.
[PERSON 1:] And Cheetos at the moment!
[PERSON 2:] Good choice. Do you watch Game of Thrones?
[PERSON 1:] No, I do not have much time for TV.
[PERSON 2:] I usually spend my time painting: but, I love the show.
Table 2: Example dialog from the PERSONA-CHAT dataset. Person 1 is given their own persona (top
left) at the beginning of the chat, but does not know the persona of Person 2, and vice-versa. They
have to get to know each other during the conversation.
Zhang, S., Dinan, E., Urbanek, J., Szlam, A., Kiela, D., & Weston, J. (2018). Personalizing Dialogue
Agents: I have a dog, do you have pets too? Proceedings of ACL.
45. MN for Modeling Persona
Represent persona as a set of statements. Store them in MM. Collect
crowdsourced dialogues providing persona profiles.
Persona 1 Persona 2
I like to ski I am an artist
My wife does not like me anymore I have four children
I have went to Mexico 4 times this year I recently got a cat
I hate Mexican food I enjoy walking for exercise
I like to eat cheetos I love watching Game of Thrones
[PERSON 1:] Hi
[PERSON 2:] Hello ! How are you today ?
[PERSON 1:] I am good thank you , how are you.
[PERSON 2:] Great, thanks ! My children and I were just about to watch Game of Thrones.
[PERSON 1:] Nice ! How old are your children?
[PERSON 2:] I have four that range in age from 10 to 21. You?
[PERSON 1:] I do not have children at the moment.
[PERSON 2:] That just means you get to keep all the popcorn for yourself.
[PERSON 1:] And Cheetos at the moment!
[PERSON 2:] Good choice. Do you watch Game of Thrones?
[PERSON 1:] No, I do not have much time for TV.
[PERSON 2:] I usually spend my time painting: but, I love the show.
Table 2: Example dialog from the PERSONA-CHAT dataset. Person 1 is given their own persona (top
left) at the beginning of the chat, but does not know the persona of Person 2, and vice-versa. They
have to get to know each other during the conversation.
Zhang, S., Dinan, E., Urbanek, J., Szlam, A., Kiela, D., & Weston, J. (2018). Personalizing Dialogue
Agents: I have a dog, do you have pets too? Proceedings of ACL.
49. Synthetic Persona-Based Dialogues
Actual simulation of a dialogue with an elderly and a
young customer starting from the same outline.
Hi! Hi!
Greeting sir, what may I assist you with? Hey dude, whatâs up?
Iâd like to book a table for six in an
expensive restaurant that serves british
food
Iâd like to book a table for six in an
expensive restaurant that serves british
food
Iâll start the request now. May I know
where the restaurant should be located?
Iâm on it! Where should it be?
London London
Excellent sir, please give me a moment to
provide you with options
Ok, looking for options!
50. Evolution of Dialogue Architectures to fit Persona
From single memory
network to split
memory network.
Conversation History
and persona facts are
stored in different
memories.
53. Standardized evaluation - still an open issue
Single Components Whole Interaction
SYSTEMS
Grammaticality; Slots correctly
realized; Slots correctly recognized;
etc.
Success rate
SIMULATION - Wizard of Oz
additional possible dimensions: manual vs automatic, qualitative vs quantitative, ...
54. Standardized evaluation - still an open issue
Single Components Whole Interaction
SYSTEMS
Grammaticality; Slots correctly
realized; Slots correctly recognized;
etc.
Success rate
SIMULATION - Wizard of Oz
Evaluating human-like interaction capabilities
56. Wizard of OZ
â˘âŻ Time-consuming interactions to get the evaluation;
â˘âŻ Might be biased by confounding variables;
â˘âŻ Significant training: providing behavior instructions' to wizard and
possibly users.
â˘âŻ These 'behavior instructions' cannot describe every single reaction.
â˘âŻ Difficult for wizards to provide consistent responses across sessions;
â˘âŻ ...
59. Experimental Setting
We served experimental and
control condition together (within
subject design) with short
instructions on the task.
Forced decision between
alternatives.
CrowdFlower
Instructions
Question 1
Question 2
Question 3
Question 4
Question 5
B A
66. Take Away result: Different users prefer different
CAâs personality
An analysis at the gender level, revealed that
some personalities are preferred more according
to user gender.
Male Female
Linguistic
Alignment
0.61 0.68
Facing 0.76 0.64