SlideShare a Scribd company logo
From Naive Physics to Connotation:
Modeling Commonsense
in Frame Semantics
Yejin Choi
Paul G. Allen School of
Computer Science & Engineering
Intelligent Communication
è reading between the lines
Intelligent Communication
è Reading between the lines
Understanding
what is said
+
what is not said
“CHEESEBURGER STABBING”
• Someone stabbed a cheeseburger?
• A cheeseburger stabbed someone?
• A cheeseburger stabbed another cheeseburger?
• Someone stabbed someone else with a cheeseburger?
• Someone stabbed someone else over a cheeseburger?
Intelligent Communication
è Reading between the lines
Understanding
what is said
+
what is not said
“CHEESEBURGER STABBING”
• Someone stabbed a cheeseburger?
• A cheeseburger stabbed someone?
• A cheeseburger stabbed another cheeseburger?
• Someone stabbed someone else with a cheeseburger?
• Someone stabbed someone else over a cheeseburger?
Knowledge about the world
to fill in the gap!
Blueberry Muffins
Ingredients
1 cup milk
1 egg
1/3 cup vegetable oil
2 cups all-purpose flour
2 teaspoons baking powder
1/2 cup white sugar
1/2 cup fresh blueberries
Procedure
1. Preheat oven to 400 degrees F. Line a 12-cup muffin tin with paper liners.
2. In a large bowl, stir together milk, egg, and oil. Add flour, baking powder, sugar, and
blueberries; gently mix the batter with only a few strokes. Spoon batter into cups.
3. Bake for 30 minutes. Serve hot.
http://allrecipes.com/Recipe/Blueberry-Muffins-I/
Intelligent Communication
Bake what? where?
Knowledge about the world
to fill in the gap!
Encyclopedic knowledge
– Who is the president of which
country and born in what year…
Commonsense knowledge
– It’s not possible to stab someone
using a cheeseburger
– Stabbing a cheeseburger is not
newsworthy…
– Stabbing someone is generally
immoral
– How to make a cheeseburger
Types of Knowledge
Information Extraction
Naïve Physics
Social Norms
Procedural (How-to)
Knowledge
Plan
1. Commonsense Frame Semantics
– Naive physics
2. Modeling the World, not just Language
Verb Physics
relative physical knowledge about actions and objects
Maxwell	Forbes	et	al.	(ACL	2017)
Winograd Schema Challenge
• The trophy would not fit in the brown
suitcase because it was too big (small).
What was too big (small)?
Answer 0: the trophy
Answer 1: the suitcase
What	are	the	pre- and	
post-conditions	
of	the	action	
“to	put X into Y”?
Physical Knowledge about the World
“Hey,	Robot,	Fetch	me	a	container to	put this	pie.”	
If	I	drop this	styrofoam	ball	
into the	steel	table,	
will	either	break?
Support from Psychology Studies
• A familiar-size stroop effect. (Konkle, T., and Oliva, A. 2012.)
• Knowledge on size represented in a perceptual or analog format
(Moyer, 1973; Paivio, 1975; Rubinsten & Henik, 2002).
• Objects have a canonical visual size (Konkle & Oliva, 2011; Linsen,
Leyssen, Sammartino, & Palmer, 2011).
congruent incongruent
Overcoming Reporting Bias
• People don’t state the obvious
– (Van Durme 2010, Sorower et al., 2011)
“I am larger than a chair”
“I am larger than a chair”
“I am larger than a chair”
“I am larger than a ball”
“I am larger than a stone”
“I am larger than a pen”
“I am larger than a towel”
“The horse was
as small as
a dog!”
⟹ horse =size dog ?
• Language – absolute
estimation
• “car is * x * m”
• “person is * m tall”
• Vision – relative
estimation
area(box1)
area(box2 )
×
depth(box1)2
depth(box2 )2
size(Oi )
size(Oj )
=
Are Elephants Bigger than Butterflies?
(Bagherinezhad et al. @ AAAI 2016)
Overcoming Reporting Bias
• People don’t state the obvious
– Elephants are bigger than butterflies
• Thoughts last year
– Look beyond language, such as vision!
– (AAAI 2016, ACL 2016, ICCV 2015)
– Can’t learn diverse physical knowledge (weight,
strength, rigidness, strength…)
• Thoughts this year
– Back to language, but with a different game plan!
Key Insight
“I am larger than a chair”
“I am larger than a ball”
“I am larger than a stone”
“I am larger than a pen”
“I am larger than a towel”
“I threw the _____”
pen
stone
chair
ball
towel
x threw y
x is bigger than y
x weighs more than y
as a result, y will be moving faster than x
Let’s solve two related problems
Physical properties implied by predicates
“I	picked up the	<unk>.”
“The	<unk>	shattered when	it	hit	
the	ground
Physical properties of objects
“I	dropped a	cherry	into
the	<unk>.”
Along Five Dimensions
x >size y
x >weight y
x <rigidness y
x >strength y
x <speed y
Learning Knowledge from Language
Using IE Patterns
(Gordon et al., 2010, Gordon and Schubert, 2012, Narisawa
et al. 2013, Tandon et al. 2014)
Narrative Schemas, Script knowledge
(Chambers and Jurafsky. 2009, Pichotta and Mooney 2014,
2016)
Inferring Knowledge through Reasoning
Inverting Grice’s maxims
Mohammad S. Sorrower, et al., 2011
Natural logic, natural reasoning, and entailment
MacCartney and Manning, 2007, Angeli and Manning 2013,
2014, Bowman et al., 2015
Representation: VerbPhysics
Representation: VerbPhysics Frames
action
theme
agent
tool
goal
…
x threw y
⇒ x is larger than y
⇒ x is heavier than y
⇒ x is slower than y
R(agent,	theme)
Representation: VerbPhysics Frames
action
theme
agent
tool
goal
…
“He threw the ball”
“He threw the ball”
x threw y
⇒ x is larger than y
⇒ x is heavier than y
⇒ x is slower than y
“We walked into the house”
x walked into y
⇒ x is smaller than y
⇒ x is lighter than y
⇒ x is faster than y
R(agent,	theme)
Representation: VerbPhysics Frames
action
theme
agent
tool
goal
…
R(agent,	goal)
“He threw the ball”
x threw y
⇒ x is larger than y
⇒ x is heavier than y
⇒ x is slower than y
“We walked into the house”
x walked into y
⇒ x is smaller than y
⇒ x is lighter than y
⇒ x is faster than y
“I squashed the bug with my boot”
squashed x with y
⇒ x is smaller than y
⇒ x is lighter than y
⇒ x is weaker than y
⇒ x is less rigid than y
⇒ x is slower than y
R(agent,	theme) R(agent,	goal) R(theme,	goal)
Representation: VerbPhysics Frames
action
theme
agent
tool
goal
…
Model
…
…
…
random variable (RV)
group of RVs
factor connects RV
factors connect
subset of RVs
verb similarity
frame similarity
object similarity
attribute similarity
selectional preference
group of RVs
Each variable represents an entailment knowledge
“If X throws Y, then X > Y in terms of size”
Or
“If X throws Y, then Z in size” where Z = {>, <, =}
(Using Levin action verbs (Levin, 1993))
random variable (RV)
group of RVs
Each variable represents an entailment knowledge
“If X throws Y, then X > Y in terms of size”
Or
“If X throws Y, then Z in size” where Z = {>, <, =}
(Using Levin action verbs (Levin, 1993))
~ 8 frames per predicate
x threw y
PERSON threw x into y
PERSON threw x in y
PERSON threw x on y
PERSON threw x
PERSON threw out x
…
x threw out y
“threw”
random variable (RV)
group of RVs
factor connects RV
frame similarity
Each variable represents an entailment knowledge
“If X throws Y, then X > Y in terms of size”
Or
“If X throws Y, then Z in size” where Z = {>, <, =}
(Using Levin action verbs (Levin, 1993))
~ 8 frames per predicate
x threw y
PERSON threw x into y
PERSON threw x in y
PERSON threw x on y
PERSON threw x
PERSON threw out x
…
x threw out y
“threw”
random variable (RV)
group of RVs
factor connects RV
frame similarity
random variable (RV)
group of RVs
factor connects RV
factors connect
subset of RVs
verb similarity
frame similarity
random variable (RV)
group of RVs
factor connects RV
factors connect
subset of RVs
verb similarity
frame similarity
Each variable represents relative object knowledge
“p is generally bigger than q in terms of size”
Or
“p Z q in size” where Z = {>, <, =}
random variable (RV)
group of RVs
factor connects RV
factors connect
subset of RVs
verb similarity
frame similarity
Each variable represents relative object knowledge
“p is generally bigger than q in terms of size”
Or
“p Z q in size” where Z = {>, <, =}
random variable (RV)
group of RVs
factor connects RV
factors connect
subset of RVs
verb similarity
frame similarity
object similarity
Each variable represents relative object knowledge
“p is generally bigger than q in terms of size”
Or
“p Z q in size” where Z = {>, <, =}
Each variable represents an entailment knowledge
“If X throws Y, then X > Y in terms of size”
Or
“If X throws Y, then Z in size” where Z = {>, <, =}
random variable (RV)
group of RVs
factor connects RV
factors connect
subset of RVs
verb similarity
frame similarity
object similarity
selectional preference
Each variable represents relative object knowledge
“p is generally bigger than q in terms of size”
Or
“p Z q in size” where Z = {>, <, =}
Each variable represents an entailment knowledge
“If X throws Y, then X > Y in terms of size”
Or
“If X throws Y, then Z in size” where Z = {>, <, =}
random variable (RV)
group of RVs
factor connects RV
factors connect
subset of RVs
verb similarity
frame similarity
object similarity
selectional preference
random variable (RV)
group of RVs
factor connects RV
factors connect
subset of RVs
verb similarity
frame similarity
object similarity
attribute similarity
selectional preference
…
random variable (RV)
group of RVs
factor connects RV
factors connect
subset of RVs
verb similarity
frame similarity
object similarity
attribute similarity
selectional preference
…
…
…
random variable (RV)
group of RVs
factor connects RV
factors connect
subset of RVs
verb similarity
frame similarity
object similarity
attribute similarity
selectional preference
Loopy	belief	propagation
Log-linear	factors	trained	using	embeddings
GloVe
One-hot	frame	type
0 00 0
0
1
1 1
0.75
0.70
0
0.2
0.4
0.6
0.8
ACCURACY(TEST)
FRAMES OBJECTS
To Conclude
• Reverse Engineering Commonsense Knowledge
• By solving both puzzles simultaneously:
– Learn VerbPhysics frames
– Learn object – object relations
• w.r.t. relative physical knowledge over 5
dimensions:
– Size, weight, strength, flexibility, speed
èJust from text, without embodiment
è Vision can’t do weight, strength, flexibility …
Zero-shot Activity Recognition with
Verb Attribute Induction
Rowan	Zellers	et	al.	@	EMNLP	2017
Zero-shot Activity Recognition
Verb Attribute Induction
Plan
1. Commonsense Frame Semantics
– Naive physics
– Social commonsense
2. Modeling the World, not just Language
Connotation Frames
Hannah	Rashkin et	al.	(ACL	2016)
How Language Describes the World
“All I know is what I read in the papers.”
– Will Rogers
How Language Describes the World
(per pro-life)
Supreme
court
Texas
Law
“endanger”
Women
abortion
“protect
lives and
health”
“refused to hear”
“blocking regulations”
“protect”
“violate”
clinic
“close,
require”
An Alternative World View
(per pro-choice)
Supreme
court
Texas
Law
“strike down”
Women
abortion
“impose”
“seeks,
obtains”
Clinics
“violate,
cause”
“make choices,
has a constitutional right”
“shutdown,
force,
threaten”
Prior Literature
• A ton of work on sentiment analysis
- Especially on product reviews and movie reviews
- (Pang and Lee 2009 for survey, Socher et al 2013)
• Recent work on implied sentiment
– (Greene and Resnik 2009 , Feng et al 2013; Choi and Wiebe 2014;
Mohammad and Turney 2010; Deng and Wiebe 2014)
Supreme
court
Texas
Law
“strike down”
Women
“impose”
“criticize” frame
per left-leaning sources:
Obama criticize someone?
or
Obama is critized by someone?
(Based on the Stream Corpus (2014) w.r.t. 30 news sources with known liberal / conservative leanings)
Connotative Meaning of Words
• Semantic prosody (Sinclair 1991, Louw 1993)
“______ caused ______”
• Wittgenstein’s view:
– the meaning of a word is its use in the language
• Connotation arises from the (typical) context
in which words are used
• Can we encode connotation in frame
semantics?
headache, cancer, …
appreciation, award?
Texas
Law “cause”
“violate” frame
agent theme
writer
reader
• Perspective (x->y) : directed sentiment
“violate” frame
agent theme
writer
reader
=
=
=
• Perspective (x->y) : directed sentiment
• Effect (x) : whether the event is good for (or bad for) x
• State (x) : the private state of x as a result of the event
• Value (x) : the intrinsic value of x
Connotation judgments are not always categorical, some are better to be modeled as
distributions. (analogously to de Marneffe et al. 2012 on veridicality assessment)
“criticize” frame
per left-leaning sources:
Obama criticize someone?
or
Obama is critized by someone?
(Based on the Stream Corpus (2014) w.r.t. 30 news sources with known liberal / conservative leanings)
Left/right leaning sources
Verb Role Left-leaning sources Right-leaning sources
accuse subject [-] Putin, Progressives, Limbaugh,
Gingrich
activist, U.S., protestor, Chavez
object [+] Official, rival, administration,
leader
Romney, Iran, Gingrich, regime,
opposition
attack subject [-] McCain, Trump, Limbaugh Obama, campaign, Biden, Israel
object [+] Gingrich, Obama, policy citizen, Zimmerman
criticize subject [-] Ugandans, rival, Romney, Tyson Britain, passage, Obama,
Maddow
object [+] sensation, Obama, Allen,
Cameron, Congress
threat, Pelosi, Romey, GOP,
Republicans
suffer subject
[+]
woman, victim, officer, father Christie, Rangers, people, man
object [-] attack, blow, burn, seizure,
casualty
injury, fool, loss, concussion
(Based on the Stream Corpus (2014) w.r.t. 30 news sources with known liberal / conservative leanings)
Connotation Frames
1. Analysis on large-scale news corpora
2. Crowdsourcing experiments
15 annotators answered 16 questions for each of the top 1000 verbs from the NY times corpus.
Average Krippendorff’s alpha is 0.25 and 2-way soft agreement is 95%.
3. Computational models
Writer: The judge upheld the verdict.
Questions: How the judge feels about the verdict:
Negative PositiveNeutral
Negative or
Neutral
Positive or
Neutral
Computational Models
Base Model:
Embedding to Connotation Frames
word embedding for verb v
(Levy and Goldberg 2014)
…
Log-linear
parameterization
• Frame lexicon induction
• Separate frame prediction for each predicate
Connotation Dynamics
agent theme
writer
1. Perspective Triads:
~ social balance theory (Cartwright 1956)
is positive towards xj, and xj is positive towards xk, then we expect xi is al
mics hold for the negative case.
Pw!xi = ¬ (Pw!xj Pxi!xj )
ate has a positive effect on one argument, then we expect that the interactio
e. Similar dynamics hold for the negative case.
Exi = Pxj !xi
xi is presupposed as valuable, then we expect that the writer also views x
or the negative case.
Vxi = Pw!xi
icate has a positive effect on an argument xi, then we expect that xi will gain
mics hold for the negative case.
Sxi = Exi
ped Relations: we propose models that parameterize these dynamics using lo
Connotation Dynamics
agent theme
writer
1. Perspective Triads:
~ social balance theory (Cartwright 1956)
2. Effect – Mental State:
is positive towards xj, and xj is positive towards xk, then we expect xi is al
mics hold for the negative case.
Pw!xi = ¬ (Pw!xj Pxi!xj )
ate has a positive effect on one argument, then we expect that the interactio
e. Similar dynamics hold for the negative case.
Exi = Pxj !xi
xi is presupposed as valuable, then we expect that the writer also views x
or the negative case.
Vxi = Pw!xi
icate has a positive effect on an argument xi, then we expect that xi will gain
mics hold for the negative case.
Sxi = Exi
ped Relations: we propose models that parameterize these dynamics using lo
= ¬ (Pw!xj Pxi!xj )
sitive effect on one argument, then we expect that the interaction
ynamics hold for the negative case.
Exi = Pxj !xi
pposed as valuable, then we expect that the writer also views xi
ive case.
Vxi = Pw!xi
ositive effect on an argument xi, then we expect that xi will gain a
or the negative case.
Sxi = Exi
ns: we propose models that parameterize these dynamics using log-linea
ously unseen verbs. For each verb predicate, we in
fer a connotation frame composed of 9 relationshi
aspects that represent: perspective {P(w ! t)
P(w ! a), P(a ! t)}, effect {E(t), E(a)}, valu
Connotation Dynamics
agent theme
writer
1. Perspective Triads:
~ social balance theory (Cartwright 1956)
2. Effect – Mental State:
3. Effect – Perspective:
is positive towards xj, and xj is positive towards xk, then we expect xi is al
mics hold for the negative case.
Pw!xi = ¬ (Pw!xj Pxi!xj )
ate has a positive effect on one argument, then we expect that the interactio
e. Similar dynamics hold for the negative case.
Exi = Pxj !xi
xi is presupposed as valuable, then we expect that the writer also views x
or the negative case.
Vxi = Pw!xi
icate has a positive effect on an argument xi, then we expect that xi will gain
mics hold for the negative case.
Sxi = Exi
ped Relations: we propose models that parameterize these dynamics using lo
= ¬ (Pw!xj Pxi!xj )
sitive effect on one argument, then we expect that the interaction
ynamics hold for the negative case.
Exi = Pxj !xi
pposed as valuable, then we expect that the writer also views xi
ive case.
Vxi = Pw!xi
ositive effect on an argument xi, then we expect that xi will gain a
or the negative case.
Sxi = Exi
ns: we propose models that parameterize these dynamics using log-linea
ously unseen verbs. For each verb predicate, we in
fer a connotation frame composed of 9 relationshi
aspects that represent: perspective {P(w ! t)
P(w ! a), P(a ! t)}, effect {E(t), E(a)}, valu
e towards xj, and xj is positive towards xk, then we expect xi is also
for the negative case.
!xi = ¬ (Pw!xj Pxi!xj )
positive effect on one argument, then we expect that the interaction
r dynamics hold for the negative case.
Exi = Pxj !xi
esupposed as valuable, then we expect that the writer also views xi
gative case.
Vxi = Pw!xi
a positive effect on an argument xi, then we expect that xi will gain a
for the negative case.
Sxi = Exi
Connotation Dynamics
agent theme
writer
1. Perspective Triads:
~ social balance theory (Cartwright 1956)
2. Effect – Mental State:
3. Effect – Perspective:
4. Value – Perspective:
is positive towards xj, and xj is positive towards xk, then we expect xi is al
mics hold for the negative case.
Pw!xi = ¬ (Pw!xj Pxi!xj )
ate has a positive effect on one argument, then we expect that the interactio
e. Similar dynamics hold for the negative case.
Exi = Pxj !xi
xi is presupposed as valuable, then we expect that the writer also views x
or the negative case.
Vxi = Pw!xi
icate has a positive effect on an argument xi, then we expect that xi will gain
mics hold for the negative case.
Sxi = Exi
ped Relations: we propose models that parameterize these dynamics using lo
= ¬ (Pw!xj Pxi!xj )
sitive effect on one argument, then we expect that the interaction
ynamics hold for the negative case.
Exi = Pxj !xi
pposed as valuable, then we expect that the writer also views xi
ive case.
Vxi = Pw!xi
ositive effect on an argument xi, then we expect that xi will gain a
or the negative case.
Sxi = Exi
ns: we propose models that parameterize these dynamics using log-linea
ously unseen verbs. For each verb predicate, we in
fer a connotation frame composed of 9 relationshi
aspects that represent: perspective {P(w ! t)
P(w ! a), P(a ! t)}, effect {E(t), E(a)}, valu
e towards xj, and xj is positive towards xk, then we expect xi is also
for the negative case.
xi = ¬ (Pw!xj Pxi!xj )
positive effect on one argument, then we expect that the interaction
dynamics hold for the negative case.
Exi = Pxj !xi
supposed as valuable, then we expect that the writer also views xi
gative case.
Vxi = Pw!xi
a positive effect on an argument xi, then we expect that xi will gain a
for the negative case.
Sxi = Exi
e towards xj, and xj is positive towards xk, then we expect xi is also
for the negative case.
!xi = ¬ (Pw!xj Pxi!xj )
positive effect on one argument, then we expect that the interaction
r dynamics hold for the negative case.
Exi = Pxj !xi
esupposed as valuable, then we expect that the writer also views xi
gative case.
Vxi = Pw!xi
a positive effect on an argument xi, then we expect that xi will gain a
for the negative case.
Sxi = Exi
Marginal Probability:
Factor Graph Model
Factor potentials:
Inference: Message passing:
58.81
63.71
67.93 68.26
55
58.75
62.5
66.25
70
Accuracy
41.46
47.30
53.17 53.50
35
41.25
47.5
53.75
60
F1 Scores
Experimental Results
train/dev/test sets of 300 predicates each, 3-way classification
Power and Agency Frames
in Modern Films
Maarten	Sap	et	al.	(EMNLP	2017	in	submission)
Bias in Modern Films
• Not a news per say, but more
insights from recent studies
((https://www.google.com/intl/en/about/main/gende
r-equality-films/)
• Bechdel test:
– At least two women
– talk to each other
– about something other than a man.
• Agarwal et al. 2015 report 40% of
the films pass the Bechdel test.
• However, those that pass the test
still have problems…
A character in Dykes to Watch Out For explains
the rules that later came to be known as the
Bechdel test (1985)
Character Portrayal in Films
“… Elsa objects, unleashing her powers…
Elsa discards her crown and creates a
palace… Elsa […] freezes Anna … She then
summons… Elsa ends the winter”
“… accidentally injures Anna … Hans
proposes to Anna … She gets lost …
chases Anna … Anna spots Hans … He
locks Anna in a room…”
“Cinderella lives an unhappy life… agrees to let
Cinderella go… he sees Cinderella… Cinderella
hears the clock… allowing the mice to free
Cinderella… Cinderella appears…”
Power and Agency Frame
agent theme
writer
readerAgency (x) Agency (x)
Power(x,y)
• Agency (x)
– Powerful, decisive, capable of pushing forward their own storyline
– “waiting” v.s. “searching” for your prince.
• Power (x,y)
– Differential authority level between the verb agent and theme
– “demanding” v.s. “begging” for mercy
Frame Verbs
Agency (+) terminate, silence, sue, command, invade,
throw, organize, authorize, persuade, battle
Agency (-) shimmer, remain, consist of, border, plummet,
age, whimper, inherit, recede, resemble
Power (agent) squash, seduce, assign, order, suspend,
regulate,
possess, eject, prohibit, ignite, destroy
Power (theme) obey, dread, ask, worship, belong to, trust,
bow to, serve, implore, require, salute
Collected using Mturk, annotated ~2000 verbs
Power and Agency Frame
Power and agency connotation frames run on the Wikipedia
plot summaries of Frozen (2013) and Cinderella (1950)
Bias in Modern Films
• Male characters tend to drive
the plot more through high
agency verbs
• Connotation frames provide
more nuanced analysis
compared to existing tests
(Bechdel, Mako Mori, Furiosa,
Sexy Lamp test)
Frame Gender
association
Agency (+) M**
Agency (-) F**
Power (agent) M**
Power (theme) n.s.
Gender association with connotation frames using a
logistic regression, controlling for number of words. **:
p<0.001 (Holm corrected)
772 scripts (Gorinski & Lapata, 2015), 21K characters with automatically extracted gender
To Conclude
• Aspects of connotation can be packed into
lexical and frame semantics
agent theme
writer
readerAgency
(x)
Agency
(x)
Power(x,
y)
NLP for Social Good!
• Identifying unwanted biases in language
– (Sap et al. @ EMNLP 2017!)
• graded deception detection in media
– www.politifact.org
– (Rashkin et al. @ EMNLP 2017!)
Game Plan
1. Commonsense Frame Semantics
– Naive physics
– Social commonsense
2. Modeling the World, not just Language
– Dynamic entities
– Action causalities
one vector
You can't cram the meaning of
a whole %&!$# sentence
into a single $&!#* vector!
a sequence of vectors?
by
byoutput
output
token
<S> token
tokendecode
decode
byinput tokenencode token
Neural Checklist Models
Chloe	Kiddon et	al.	(EMNLP	2016)
Task Definition
𝑓
• Input:
– Goal: an (optional) title phrase
– Agenda: a list of items to talk about
• Output: long text with many sentences
Blueberry Muffins
Ingredients
1 cup milk
1 egg
1/3 cup vegetable oil
2 cups all-purpose flour
2 teaspoons baking powder
1/2 cup white sugar
1/2 cup fresh blueberries
Procedure
1. Preheat oven to 400 degrees F. Line a 12-cup muffin tin with paper liners.
2. In a large bowl, stir together milk, egg, and oil. Add flour, baking powder, sugar, and
blueberries; gently mix the batter with only a few strokes. Spoon batter into cups.
3. Bake for 20 minutes. Serve hot.
http://allrecipes.com/Recipe/Blueberry-Muffins-I/
Cooking Instructions
How to generate recipes
planning	models
interpretation generation
abstraction planning
recipe	plan
recipe	text
Is it possible to simplify this?
Document-level Graph Parsing
(Kiddon et al., 2015)
Blueberry Muffins
Ingredients
1 cup milk
1 egg
1/3 cup vegetable oil
2 cups all-purpose flour
2 teaspoons baking powder
1/2 cup white sugar
1/2 cup fresh blueberries
Procedure
1. Preheat oven to 400 degrees F (205 degrees C). Line
a 12-cup muffin tin with paper liners.
2. In a large bowl, stir together milk, egg, and oil. Add
flour, baking powder, sugar, and blueberries; gently mix
the batter with only a few strokes. Spoon batter into
cups.
3. Bake for 20 minutes. Serve hot.
1. Predicate argument
structure with implicit
arguments (~ SRL)
2. How entities are
introduced and then flow
through different actions
(~ reference resolution)
3. Discourse-level parsing
Recipe generation as machine
translation?
• Encoder-decoder NNs work great for
machine translation (Cho et al.2014, Sutskever et
al. 2014)
by
byoutput
output
token
<S> token
tokendecode
decode
byinput tokenencode token
• We can translate a
recipe title into
the text
Encode title - decode recipe
Cut each sandwich in halves.
Sandwiches with sandwiches.
Sandwiches, sandwiches, Sandwiches,
sandwiches, sandwiches
sandwiches, sandwiches, sandwiches,
sandwiches, sandwiches, sandwiches, or
sandwiches or triangles, a griddle, each
sandwich.
Top each with a slice of cheese, tomato,
and cheese.
Top with remaining cheese mixture.
Top with remaining cheese.
Broil until tops are bubbly and cheese is
melted, about 5 minutes.
sausage sandwiches
Recipe generation vs machine
translation
by
byrecipe
recipe
token
<S> token
tokendecode
decode
recipe title
ingredient 1
ingredient 2
ingredient 3
ingredient 4
Two input sources
• Only ~6-10% words align
between input and output.
• The rest must be generated
from context (and implicit
knowledge about cooking)
• Contextual switch between
two different input sources
Challenges
1. Modeling global coherence and coverage
– May not be possible to compress everything on to
one hidden vector!
2. Need a mechanism to track an “agenda”
-- which ingredients have been already used
-- what need to be yet introduced
Neural checklist model
Let’s make salsa!
Garlic tomato salsa
tomatoes
onions
garlic
salt
Neural checklist model
LM
Chop
<S>
hidden state classifier:
non-ingredient
new ingredient
used ingredient
which ingredients
are still available
new hidden state
tomato salsagarlic
Neural checklist model
tomatoes
tomatoesChop
Chop
the
the<S>
0.85
0.10
0.04
0.01
.
non-
ingredient
new
ingredient
✓
Neural checklist model
onions
onionsDice
Dice
the
the
.
.
0.00
0.94
0.03
0.01
✓✓ ✓
Neural checklist model
tomatoes
tomatoesAdd
Add
to
to
.
.
0.94
0.04
0.01
0.01
used
ingredient
✓✓ ✓✓
Checklist is probabilistic
tomatoes
tomatoesAdd
Add
to
to
.
.
0.90
0.08
0.01
0.01
used
ingredient
0.85 1.001.000.85
0.85
1.00
0.020.02
0.02
0.040.04
0.04
= new ingredient prob. distribution
Hidden state classifier is soft
tomatoes
tomatoes
Add
Add
to
to
.
.
0.90
0.08
0.01
0.01
0.85 1.001.000.85
0.85
1.00
0.020.02
0.02
0.040.04
0.04
0.00
0.00
0.50
0.50
0.85
1.00
0.02
0.04
0.940.05
0.01
Neural
Recipe
Example #1
Cook eggplant in boiling water , covered , for 10 min .
Drain and cut in half lengthwise . scoop out insides
leaving 1/2 '' shell . Mash insides with cottage cheese ,
onion , bay leaf , basil , oregano and tomato sauce .
Preheat oven to 350 ^ stuff eggplant halves , place in
casserole dish and bake covered for 15 min .
Add a little water to bottom of pan to keep eggplant
moist . top with provolone cheese .
Bake 5 more min uncovered 1 serving =
In a small bowl , combine the cheese , eggplant ,
basil , oregano , tomato sauce and onion . Mix well .
Shape mixture into 6 patties , each about 3/4-inch
thick.
Place on baking sheet .
Bake at 350 degrees for 30 minutes or until lightly
browned .
Southern living magazine , sometime in 1980 .
Typed for you by nancy coleman .
eggplant
cheese cottage
lowfat
chopped onion
bay ground leaf
basil
oregano
tomato sauce
provolone
title: oven
eggplant
Example: skillet chicken rice
Ingredients
2 lb chicken pieces skinned
3 cups mushrooms fresh sliced
4 carrots peeled sliced 1/2”
3/4 cups long grain rice
1/2 c onion chopped
1 tsp poultry seasoning
1 tsp bouillon chicken granules
1/4 tsp salt
In a large skillet, brown chicken in oil.
Add chicken and rice; cook, stirring, until
chicken is no longer pink.
Add rice and broth; bring to a boil.
Reduce heat to low; cover and cook 5
minutes or until rice is tender and liquid is
absorbed. Stir in rice.
Heat to boiling; reduce heat.
Cover and simmer until rice is tender,
about 20 minutes.
Stir in rice. Heat to boiling; reduce heat.
Cover and simmer 20 minutes or until rice
is tender and liquid is absorbed and rice
is tender.
Baseline
In large skillet, heat rice and onion.
Add carrots and mushrooms; cook,
stirring constantly, until hot and
bubbly. Stir in seasonings. Cook
and stir over medium heat until hot
and bubbly. Stir in chicken and salt.
Cover and cook 5 minutes or until
chicken is no longer pink in center.
Serve over rice.
Checklist
Spray a 12-inch skillet with nonstick spray
coating.
Brown chicken pieces on all sides over
medium heat about 15 minutes.
Remove chicken.
Drain fat from skillet, if neccessary.
Add mushrooms, carrots, rice, onion,
bouillon, poultry seasoning, 2 cups
water, salt.
Place chicken atop rice mixture.
Cover; simmer 30 minutes or till chicken
and rice are done.
True recipe
Example: chocolate covered potato
chips
Ingredients
8 oz sweet milk chocolate
24 thick sliced ridged potato chips
Preheat oven to 350 degrees.
Grease and flour a 9 x 13 inch pan.
Bake at 350 degrees for 30 minutes.
Baseline
Bake in a 400 degree f oven for 15
minutes or until lightly browned.
Drain on paper towels. Melt
chocolate in top of double boiler
over simmering water. Remove from
heat and cool to room temp. Add
potato mixture to the potato
mixture; mix well. Cover tightly with
plastic wrap. Refrigerate until firm
enough to handle. Shape into
croquettes. Fry in hot oil until
golden brown. Drain on paper
towels. Serve hot.
Checklist
Melt 6 ounces of chocolate. Remove from
heat. While chocolate is still hot, mix the 2
ounces of chocolate into the melted
chocolate, in 3 additions, stirring until each
addition is completely melted before adding
the next. Dip the potato chips, 1 at a time, in
the chocolate. Coat completely and lift with a
small fork. Shake off excess chocolate by
rapping the fork on the edge of the bowl
lightly. Remove any drips from the bottom by
running the fork across the edge of the bowl.
Slide the chips onto a cookie sheet lined with
parchment or wax paper. Allow to cool until
solid. Let chips sit at room temperature or in
the refrigerator.
True recipe
Baselines
• Encoder-Decoder baseline (EncDec)
– Encode title and ingredients as sequence, decode
recipe
• Attention model baseline
– Use attention embedding to ingredient list during
output computation at each step
Human evaluation
• Amazon Mechanical Turk
0 1 2 3 4 5
Ingredient use
EncDec baseline
Checklist
True recipes
Human evaluation
• Amazon Mechanical Turk
0 1 2 3 4 5
Ingredient use
Follows goal
EncDec baseline
Checklist
True recipes
Human evaluation
• Amazon Mechanical Turk
0 1 2 3 4 5
Grammar
Ingredient use
Follows goal
EncDec baseline
Checklist
True recipes
Natural vs. Neural Language (on dev set)
Human Recipes
Checklist NN
Enc-Dec RNN
|V| = 5206
33% of
|V|
16% of
|V|
Neural checklist model for dialogue
generation
• Dialogue system output generation
– Hotel & restaurant informational systems
– Wen et al. 2015
– inform{name(Hotel Stratford),has_internet(no)}
• “Hotel Stratford does not have internet.”
Examples
inform(name=piperade;good_for_meal=dinner;food=basque)
piperade is a basque restaurant that is good for dinner.
inform(name='red door cafe';good_for_meal=breakfast;area='cathedral
hill';kids_allowed=no)
the red door cafe is good for breakfast, does not allow children and is in
the cathedral hill area.
inform(name='manna';area='hayes valley or inset';address='845 irving
street')
manna is in the hayes valley or inset area. the address is 845 irving street.
?select(dogs_allowed='yes';dogs_allowed='no')
would you like a hotel that allows dogs?
Hotel domain
56
48
87
61
91
61
0
25
50
75
100
BLEU METEOR
Handcrafted baseline SC-LSTM Checklist
Restaurant results
44 43
74
54
78
54
0
25
50
75
100
BLEU METEOR
Handcrafted baseline SC-LSTM Checklist
Dynamic Entity Representations in
Neural Language Models
Yangfeng Ji et	al.	@	EMNLP	2017
Static vs Dynamic Entity
Representations
• Human: what is your job?
• Machine: i’m a lawyer
• Human: what do you do?
• Machine: i’m a doctor
I am a lawyer
Example	from	Paperno et	al.	2016
…
I am a doctor …
Neural Process Network
Antoine	Bosselut et	al.	(In	Preparation)
What’s missing in the end-to-end…
• Title: deep-fried cauliflower
• Ingredients: cauliflower, frying oil, sauce, salt, pepper.
è
Wash and dry the cauliflower.
Heat the oil in a skillet and fry the sauce until they are
golden brown.
Drain on paper towels.
Add the sauce to the sauce and mix well.
Serve hot or cold.
Game Plan - Recap
1. Commonsense Frame Semantics
– naive physics
– Social commonsense
2. Modeling the World, not just Language
– Dynamic entities
– Action causalities
Choose Action:
slice
Choose Entity:
tomato
Read:
Slice the
tomatoes
State Changes:
shape separated
Choose Action:
bake
Choose Entity:
<IMPLICIT>
Read:
Bake in pan for
20 minutes.
State Changes:
cookedness cooked
temperature hot
location oven
Read:
<END_RECIPE>
Action Causality
Neural Process Networks
“understanding by simulation”
Concluding Remarks
Toward Intelligent Communication
1. Commonsense Frame Semantics
– naive physics
– Social commonsense
– Packing knowledge into lexical/frame
semantics
2. Modeling the World, not just Language
– Dynamic entities
– Action causalities
Fillmore Tribute Workshop @ ACL 2014
“X stabbing Y with Z”
agent theme
writer
reader=
=
=
Connotation
Frame
Perspective(wàX) = The writer considers X guilty.
Effect(Y) = What happenedis bad for Y.
Value(Y) = Y is inherently valuable.
VerbPhysics Y < Z in RIGID It’s not possible to stab someone
with a cheeseburger.
Thanks! Questions?

More Related Content

Similar to Yejin Choi - 2017 - From Naive Physics to Connotation: Modeling Commonsense in Frame Semantics

Science and Religion: A Power Point
Science and Religion: A Power PointScience and Religion: A Power Point
Science and Religion: A Power Point
evidenceforchristianity
 
Graphic Organizers for Vocabulary
Graphic Organizers for Vocabulary Graphic Organizers for Vocabulary
Graphic Organizers for Vocabulary
TDWolsey
 
Research for the terrified (and the tied up, the tired and the just plain tur...
Research for the terrified (and the tied up, the tired and the just plain tur...Research for the terrified (and the tied up, the tired and the just plain tur...
Research for the terrified (and the tied up, the tired and the just plain tur...
IL Group (CILIP Information Literacy Group)
 
Research methodology hypothesis construction
Research methodology   hypothesis constructionResearch methodology   hypothesis construction
Research methodology hypothesis construction
Sami Nighaoui
 
Soft computing
Soft computingSoft computing
Soft computing
rohith attaluri
 
Research for the terrified
Research for the terrifiedResearch for the terrified
Research for the terrified
Emma Coonan
 
first_order_logic.pdf
first_order_logic.pdffirst_order_logic.pdf
first_order_logic.pdf
AhmedNURHUSIEN
 
Logic
LogicLogic
Natural sciences 2017 18
Natural sciences 2017 18Natural sciences 2017 18
Natural sciences 2017 18
Kieran Ryan
 
Bayeasian inference
Bayeasian inferenceBayeasian inference
Bayeasian inference
Global Polis
 

Similar to Yejin Choi - 2017 - From Naive Physics to Connotation: Modeling Commonsense in Frame Semantics (10)

Science and Religion: A Power Point
Science and Religion: A Power PointScience and Religion: A Power Point
Science and Religion: A Power Point
 
Graphic Organizers for Vocabulary
Graphic Organizers for Vocabulary Graphic Organizers for Vocabulary
Graphic Organizers for Vocabulary
 
Research for the terrified (and the tied up, the tired and the just plain tur...
Research for the terrified (and the tied up, the tired and the just plain tur...Research for the terrified (and the tied up, the tired and the just plain tur...
Research for the terrified (and the tied up, the tired and the just plain tur...
 
Research methodology hypothesis construction
Research methodology   hypothesis constructionResearch methodology   hypothesis construction
Research methodology hypothesis construction
 
Soft computing
Soft computingSoft computing
Soft computing
 
Research for the terrified
Research for the terrifiedResearch for the terrified
Research for the terrified
 
first_order_logic.pdf
first_order_logic.pdffirst_order_logic.pdf
first_order_logic.pdf
 
Logic
LogicLogic
Logic
 
Natural sciences 2017 18
Natural sciences 2017 18Natural sciences 2017 18
Natural sciences 2017 18
 
Bayeasian inference
Bayeasian inferenceBayeasian inference
Bayeasian inference
 

More from Association for Computational Linguistics

Muis - 2016 - Weak Semi-Markov CRFs for NP Chunking in Informal Text
Muis - 2016 - Weak Semi-Markov CRFs for NP Chunking in Informal TextMuis - 2016 - Weak Semi-Markov CRFs for NP Chunking in Informal Text
Muis - 2016 - Weak Semi-Markov CRFs for NP Chunking in Informal Text
Association for Computational Linguistics
 
Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...
Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...
Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...
Association for Computational Linguistics
 
Castro - 2018 - A Crowd-Annotated Spanish Corpus for Humour Analysis
Castro - 2018 - A Crowd-Annotated Spanish Corpus for Humour AnalysisCastro - 2018 - A Crowd-Annotated Spanish Corpus for Humour Analysis
Castro - 2018 - A Crowd-Annotated Spanish Corpus for Humour Analysis
Association for Computational Linguistics
 
Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...
Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...
Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...
Association for Computational Linguistics
 
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future DirectionsDaniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Association for Computational Linguistics
 
Elior Sulem - 2018 - Semantic Structural Evaluation for Text Simplification
Elior Sulem - 2018 - Semantic Structural Evaluation for Text SimplificationElior Sulem - 2018 - Semantic Structural Evaluation for Text Simplification
Elior Sulem - 2018 - Semantic Structural Evaluation for Text Simplification
Association for Computational Linguistics
 
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future DirectionsDaniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Association for Computational Linguistics
 
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...
Association for Computational Linguistics
 
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
Association for Computational Linguistics
 
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
Association for Computational Linguistics
 
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopSatoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Association for Computational Linguistics
 
Chenchen Ding - 2015 - NICT at WAT 2015
Chenchen Ding - 2015 - NICT at WAT 2015Chenchen Ding - 2015 - NICT at WAT 2015
Chenchen Ding - 2015 - NICT at WAT 2015
Association for Computational Linguistics
 
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
Association for Computational Linguistics
 
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
Association for Computational Linguistics
 
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Association for Computational Linguistics
 
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Association for Computational Linguistics
 
Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015
Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015
Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015
Association for Computational Linguistics
 
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopSatoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Association for Computational Linguistics
 
Chenchen Ding - 2015 - NICT at WAT 2015
Chenchen Ding - 2015 - NICT at WAT 2015Chenchen Ding - 2015 - NICT at WAT 2015
Chenchen Ding - 2015 - NICT at WAT 2015
Association for Computational Linguistics
 
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Association for Computational Linguistics
 

More from Association for Computational Linguistics (20)

Muis - 2016 - Weak Semi-Markov CRFs for NP Chunking in Informal Text
Muis - 2016 - Weak Semi-Markov CRFs for NP Chunking in Informal TextMuis - 2016 - Weak Semi-Markov CRFs for NP Chunking in Informal Text
Muis - 2016 - Weak Semi-Markov CRFs for NP Chunking in Informal Text
 
Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...
Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...
Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...
 
Castro - 2018 - A Crowd-Annotated Spanish Corpus for Humour Analysis
Castro - 2018 - A Crowd-Annotated Spanish Corpus for Humour AnalysisCastro - 2018 - A Crowd-Annotated Spanish Corpus for Humour Analysis
Castro - 2018 - A Crowd-Annotated Spanish Corpus for Humour Analysis
 
Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...
Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...
Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...
 
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future DirectionsDaniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
 
Elior Sulem - 2018 - Semantic Structural Evaluation for Text Simplification
Elior Sulem - 2018 - Semantic Structural Evaluation for Text SimplificationElior Sulem - 2018 - Semantic Structural Evaluation for Text Simplification
Elior Sulem - 2018 - Semantic Structural Evaluation for Text Simplification
 
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future DirectionsDaniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
 
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...
 
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
 
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
 
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopSatoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
 
Chenchen Ding - 2015 - NICT at WAT 2015
Chenchen Ding - 2015 - NICT at WAT 2015Chenchen Ding - 2015 - NICT at WAT 2015
Chenchen Ding - 2015 - NICT at WAT 2015
 
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
 
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
 
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
 
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
 
Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015
Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015
Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015
 
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopSatoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
 
Chenchen Ding - 2015 - NICT at WAT 2015
Chenchen Ding - 2015 - NICT at WAT 2015Chenchen Ding - 2015 - NICT at WAT 2015
Chenchen Ding - 2015 - NICT at WAT 2015
 
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
 

Recently uploaded

Benner "Expanding Pathways to Publishing Careers"
Benner "Expanding Pathways to Publishing Careers"Benner "Expanding Pathways to Publishing Careers"
Benner "Expanding Pathways to Publishing Careers"
National Information Standards Organization (NISO)
 
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
Nguyen Thanh Tu Collection
 
مصحف القراءات العشر أعد أحرف الخلاف سمير بسيوني.pdf
مصحف القراءات العشر   أعد أحرف الخلاف سمير بسيوني.pdfمصحف القراءات العشر   أعد أحرف الخلاف سمير بسيوني.pdf
مصحف القراءات العشر أعد أحرف الخلاف سمير بسيوني.pdf
سمير بسيوني
 
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
imrankhan141184
 
Oliver Asks for More by Charles Dickens (9)
Oliver Asks for More by Charles Dickens (9)Oliver Asks for More by Charles Dickens (9)
Oliver Asks for More by Charles Dickens (9)
nitinpv4ai
 
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptxPrésentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
siemaillard
 
A Visual Guide to 1 Samuel | A Tale of Two Hearts
A Visual Guide to 1 Samuel | A Tale of Two HeartsA Visual Guide to 1 Samuel | A Tale of Two Hearts
A Visual Guide to 1 Samuel | A Tale of Two Hearts
Steve Thomason
 
spot a liar (Haiqa 146).pptx Technical writhing and presentation skills
spot a liar (Haiqa 146).pptx Technical writhing and presentation skillsspot a liar (Haiqa 146).pptx Technical writhing and presentation skills
spot a liar (Haiqa 146).pptx Technical writhing and presentation skills
haiqairshad
 
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...
TechSoup
 
Geography as a Discipline Chapter 1 __ Class 11 Geography NCERT _ Class Notes...
Geography as a Discipline Chapter 1 __ Class 11 Geography NCERT _ Class Notes...Geography as a Discipline Chapter 1 __ Class 11 Geography NCERT _ Class Notes...
Geography as a Discipline Chapter 1 __ Class 11 Geography NCERT _ Class Notes...
ImMuslim
 
Haunted Houses by H W Longfellow for class 10
Haunted Houses by H W Longfellow for class 10Haunted Houses by H W Longfellow for class 10
Haunted Houses by H W Longfellow for class 10
nitinpv4ai
 
SWOT analysis in the project Keeping the Memory @live.pptx
SWOT analysis in the project Keeping the Memory @live.pptxSWOT analysis in the project Keeping the Memory @live.pptx
SWOT analysis in the project Keeping the Memory @live.pptx
zuzanka
 
How to deliver Powerpoint Presentations.pptx
How to deliver Powerpoint  Presentations.pptxHow to deliver Powerpoint  Presentations.pptx
How to deliver Powerpoint Presentations.pptx
HajraNaeem15
 
HYPERTENSION - SLIDE SHARE PRESENTATION.
HYPERTENSION - SLIDE SHARE PRESENTATION.HYPERTENSION - SLIDE SHARE PRESENTATION.
HYPERTENSION - SLIDE SHARE PRESENTATION.
deepaannamalai16
 
BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...
BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...
BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...
Nguyen Thanh Tu Collection
 
RESULTS OF THE EVALUATION QUESTIONNAIRE.pptx
RESULTS OF THE EVALUATION QUESTIONNAIRE.pptxRESULTS OF THE EVALUATION QUESTIONNAIRE.pptx
RESULTS OF THE EVALUATION QUESTIONNAIRE.pptx
zuzanka
 
Beyond Degrees - Empowering the Workforce in the Context of Skills-First.pptx
Beyond Degrees - Empowering the Workforce in the Context of Skills-First.pptxBeyond Degrees - Empowering the Workforce in the Context of Skills-First.pptx
Beyond Degrees - Empowering the Workforce in the Context of Skills-First.pptx
EduSkills OECD
 
Temple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation resultsTemple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation results
Krassimira Luka
 
Standardized tool for Intelligence test.
Standardized tool for Intelligence test.Standardized tool for Intelligence test.
Standardized tool for Intelligence test.
deepaannamalai16
 
Level 3 NCEA - NZ: A Nation In the Making 1872 - 1900 SML.ppt
Level 3 NCEA - NZ: A  Nation In the Making 1872 - 1900 SML.pptLevel 3 NCEA - NZ: A  Nation In the Making 1872 - 1900 SML.ppt
Level 3 NCEA - NZ: A Nation In the Making 1872 - 1900 SML.ppt
Henry Hollis
 

Recently uploaded (20)

Benner "Expanding Pathways to Publishing Careers"
Benner "Expanding Pathways to Publishing Careers"Benner "Expanding Pathways to Publishing Careers"
Benner "Expanding Pathways to Publishing Careers"
 
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
 
مصحف القراءات العشر أعد أحرف الخلاف سمير بسيوني.pdf
مصحف القراءات العشر   أعد أحرف الخلاف سمير بسيوني.pdfمصحف القراءات العشر   أعد أحرف الخلاف سمير بسيوني.pdf
مصحف القراءات العشر أعد أحرف الخلاف سمير بسيوني.pdf
 
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
 
Oliver Asks for More by Charles Dickens (9)
Oliver Asks for More by Charles Dickens (9)Oliver Asks for More by Charles Dickens (9)
Oliver Asks for More by Charles Dickens (9)
 
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptxPrésentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
 
A Visual Guide to 1 Samuel | A Tale of Two Hearts
A Visual Guide to 1 Samuel | A Tale of Two HeartsA Visual Guide to 1 Samuel | A Tale of Two Hearts
A Visual Guide to 1 Samuel | A Tale of Two Hearts
 
spot a liar (Haiqa 146).pptx Technical writhing and presentation skills
spot a liar (Haiqa 146).pptx Technical writhing and presentation skillsspot a liar (Haiqa 146).pptx Technical writhing and presentation skills
spot a liar (Haiqa 146).pptx Technical writhing and presentation skills
 
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...
 
Geography as a Discipline Chapter 1 __ Class 11 Geography NCERT _ Class Notes...
Geography as a Discipline Chapter 1 __ Class 11 Geography NCERT _ Class Notes...Geography as a Discipline Chapter 1 __ Class 11 Geography NCERT _ Class Notes...
Geography as a Discipline Chapter 1 __ Class 11 Geography NCERT _ Class Notes...
 
Haunted Houses by H W Longfellow for class 10
Haunted Houses by H W Longfellow for class 10Haunted Houses by H W Longfellow for class 10
Haunted Houses by H W Longfellow for class 10
 
SWOT analysis in the project Keeping the Memory @live.pptx
SWOT analysis in the project Keeping the Memory @live.pptxSWOT analysis in the project Keeping the Memory @live.pptx
SWOT analysis in the project Keeping the Memory @live.pptx
 
How to deliver Powerpoint Presentations.pptx
How to deliver Powerpoint  Presentations.pptxHow to deliver Powerpoint  Presentations.pptx
How to deliver Powerpoint Presentations.pptx
 
HYPERTENSION - SLIDE SHARE PRESENTATION.
HYPERTENSION - SLIDE SHARE PRESENTATION.HYPERTENSION - SLIDE SHARE PRESENTATION.
HYPERTENSION - SLIDE SHARE PRESENTATION.
 
BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...
BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...
BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...
 
RESULTS OF THE EVALUATION QUESTIONNAIRE.pptx
RESULTS OF THE EVALUATION QUESTIONNAIRE.pptxRESULTS OF THE EVALUATION QUESTIONNAIRE.pptx
RESULTS OF THE EVALUATION QUESTIONNAIRE.pptx
 
Beyond Degrees - Empowering the Workforce in the Context of Skills-First.pptx
Beyond Degrees - Empowering the Workforce in the Context of Skills-First.pptxBeyond Degrees - Empowering the Workforce in the Context of Skills-First.pptx
Beyond Degrees - Empowering the Workforce in the Context of Skills-First.pptx
 
Temple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation resultsTemple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation results
 
Standardized tool for Intelligence test.
Standardized tool for Intelligence test.Standardized tool for Intelligence test.
Standardized tool for Intelligence test.
 
Level 3 NCEA - NZ: A Nation In the Making 1872 - 1900 SML.ppt
Level 3 NCEA - NZ: A  Nation In the Making 1872 - 1900 SML.pptLevel 3 NCEA - NZ: A  Nation In the Making 1872 - 1900 SML.ppt
Level 3 NCEA - NZ: A Nation In the Making 1872 - 1900 SML.ppt
 

Yejin Choi - 2017 - From Naive Physics to Connotation: Modeling Commonsense in Frame Semantics

  • 1. From Naive Physics to Connotation: Modeling Commonsense in Frame Semantics Yejin Choi Paul G. Allen School of Computer Science & Engineering
  • 3. Intelligent Communication è Reading between the lines Understanding what is said + what is not said “CHEESEBURGER STABBING” • Someone stabbed a cheeseburger? • A cheeseburger stabbed someone? • A cheeseburger stabbed another cheeseburger? • Someone stabbed someone else with a cheeseburger? • Someone stabbed someone else over a cheeseburger?
  • 4. Intelligent Communication è Reading between the lines Understanding what is said + what is not said “CHEESEBURGER STABBING” • Someone stabbed a cheeseburger? • A cheeseburger stabbed someone? • A cheeseburger stabbed another cheeseburger? • Someone stabbed someone else with a cheeseburger? • Someone stabbed someone else over a cheeseburger? Knowledge about the world to fill in the gap!
  • 5. Blueberry Muffins Ingredients 1 cup milk 1 egg 1/3 cup vegetable oil 2 cups all-purpose flour 2 teaspoons baking powder 1/2 cup white sugar 1/2 cup fresh blueberries Procedure 1. Preheat oven to 400 degrees F. Line a 12-cup muffin tin with paper liners. 2. In a large bowl, stir together milk, egg, and oil. Add flour, baking powder, sugar, and blueberries; gently mix the batter with only a few strokes. Spoon batter into cups. 3. Bake for 30 minutes. Serve hot. http://allrecipes.com/Recipe/Blueberry-Muffins-I/ Intelligent Communication Bake what? where? Knowledge about the world to fill in the gap!
  • 6. Encyclopedic knowledge – Who is the president of which country and born in what year… Commonsense knowledge – It’s not possible to stab someone using a cheeseburger – Stabbing a cheeseburger is not newsworthy… – Stabbing someone is generally immoral – How to make a cheeseburger Types of Knowledge Information Extraction Naïve Physics Social Norms Procedural (How-to) Knowledge
  • 7. Plan 1. Commonsense Frame Semantics – Naive physics 2. Modeling the World, not just Language
  • 8. Verb Physics relative physical knowledge about actions and objects Maxwell Forbes et al. (ACL 2017)
  • 9. Winograd Schema Challenge • The trophy would not fit in the brown suitcase because it was too big (small). What was too big (small)? Answer 0: the trophy Answer 1: the suitcase
  • 10. What are the pre- and post-conditions of the action “to put X into Y”? Physical Knowledge about the World “Hey, Robot, Fetch me a container to put this pie.” If I drop this styrofoam ball into the steel table, will either break?
  • 11. Support from Psychology Studies • A familiar-size stroop effect. (Konkle, T., and Oliva, A. 2012.) • Knowledge on size represented in a perceptual or analog format (Moyer, 1973; Paivio, 1975; Rubinsten & Henik, 2002). • Objects have a canonical visual size (Konkle & Oliva, 2011; Linsen, Leyssen, Sammartino, & Palmer, 2011). congruent incongruent
  • 12. Overcoming Reporting Bias • People don’t state the obvious – (Van Durme 2010, Sorower et al., 2011)
  • 13. “I am larger than a chair”
  • 14. “I am larger than a chair”
  • 15. “I am larger than a chair” “I am larger than a ball” “I am larger than a stone” “I am larger than a pen” “I am larger than a towel”
  • 16. “The horse was as small as a dog!” ⟹ horse =size dog ?
  • 17. • Language – absolute estimation • “car is * x * m” • “person is * m tall” • Vision – relative estimation area(box1) area(box2 ) × depth(box1)2 depth(box2 )2 size(Oi ) size(Oj ) = Are Elephants Bigger than Butterflies? (Bagherinezhad et al. @ AAAI 2016)
  • 18. Overcoming Reporting Bias • People don’t state the obvious – Elephants are bigger than butterflies • Thoughts last year – Look beyond language, such as vision! – (AAAI 2016, ACL 2016, ICCV 2015) – Can’t learn diverse physical knowledge (weight, strength, rigidness, strength…) • Thoughts this year – Back to language, but with a different game plan!
  • 20. “I am larger than a chair” “I am larger than a ball” “I am larger than a stone” “I am larger than a pen” “I am larger than a towel”
  • 21. “I threw the _____” pen stone chair ball towel
  • 22. x threw y x is bigger than y x weighs more than y as a result, y will be moving faster than x
  • 23. Let’s solve two related problems Physical properties implied by predicates “I picked up the <unk>.” “The <unk> shattered when it hit the ground Physical properties of objects “I dropped a cherry into the <unk>.”
  • 24. Along Five Dimensions x >size y x >weight y x <rigidness y x >strength y x <speed y
  • 25. Learning Knowledge from Language Using IE Patterns (Gordon et al., 2010, Gordon and Schubert, 2012, Narisawa et al. 2013, Tandon et al. 2014) Narrative Schemas, Script knowledge (Chambers and Jurafsky. 2009, Pichotta and Mooney 2014, 2016) Inferring Knowledge through Reasoning Inverting Grice’s maxims Mohammad S. Sorrower, et al., 2011 Natural logic, natural reasoning, and entailment MacCartney and Manning, 2007, Angeli and Manning 2013, 2014, Bowman et al., 2015
  • 28. x threw y ⇒ x is larger than y ⇒ x is heavier than y ⇒ x is slower than y R(agent, theme) Representation: VerbPhysics Frames action theme agent tool goal … “He threw the ball”
  • 29. “He threw the ball” x threw y ⇒ x is larger than y ⇒ x is heavier than y ⇒ x is slower than y “We walked into the house” x walked into y ⇒ x is smaller than y ⇒ x is lighter than y ⇒ x is faster than y R(agent, theme) Representation: VerbPhysics Frames action theme agent tool goal … R(agent, goal)
  • 30. “He threw the ball” x threw y ⇒ x is larger than y ⇒ x is heavier than y ⇒ x is slower than y “We walked into the house” x walked into y ⇒ x is smaller than y ⇒ x is lighter than y ⇒ x is faster than y “I squashed the bug with my boot” squashed x with y ⇒ x is smaller than y ⇒ x is lighter than y ⇒ x is weaker than y ⇒ x is less rigid than y ⇒ x is slower than y R(agent, theme) R(agent, goal) R(theme, goal) Representation: VerbPhysics Frames action theme agent tool goal …
  • 31. Model
  • 32. … … … random variable (RV) group of RVs factor connects RV factors connect subset of RVs verb similarity frame similarity object similarity attribute similarity selectional preference
  • 33. group of RVs Each variable represents an entailment knowledge “If X throws Y, then X > Y in terms of size” Or “If X throws Y, then Z in size” where Z = {>, <, =} (Using Levin action verbs (Levin, 1993))
  • 34. random variable (RV) group of RVs Each variable represents an entailment knowledge “If X throws Y, then X > Y in terms of size” Or “If X throws Y, then Z in size” where Z = {>, <, =} (Using Levin action verbs (Levin, 1993)) ~ 8 frames per predicate x threw y PERSON threw x into y PERSON threw x in y PERSON threw x on y PERSON threw x PERSON threw out x … x threw out y “threw”
  • 35. random variable (RV) group of RVs factor connects RV frame similarity Each variable represents an entailment knowledge “If X throws Y, then X > Y in terms of size” Or “If X throws Y, then Z in size” where Z = {>, <, =} (Using Levin action verbs (Levin, 1993)) ~ 8 frames per predicate x threw y PERSON threw x into y PERSON threw x in y PERSON threw x on y PERSON threw x PERSON threw out x … x threw out y “threw”
  • 36. random variable (RV) group of RVs factor connects RV frame similarity
  • 37. random variable (RV) group of RVs factor connects RV factors connect subset of RVs verb similarity frame similarity
  • 38. random variable (RV) group of RVs factor connects RV factors connect subset of RVs verb similarity frame similarity Each variable represents relative object knowledge “p is generally bigger than q in terms of size” Or “p Z q in size” where Z = {>, <, =}
  • 39. random variable (RV) group of RVs factor connects RV factors connect subset of RVs verb similarity frame similarity Each variable represents relative object knowledge “p is generally bigger than q in terms of size” Or “p Z q in size” where Z = {>, <, =}
  • 40. random variable (RV) group of RVs factor connects RV factors connect subset of RVs verb similarity frame similarity object similarity Each variable represents relative object knowledge “p is generally bigger than q in terms of size” Or “p Z q in size” where Z = {>, <, =} Each variable represents an entailment knowledge “If X throws Y, then X > Y in terms of size” Or “If X throws Y, then Z in size” where Z = {>, <, =}
  • 41. random variable (RV) group of RVs factor connects RV factors connect subset of RVs verb similarity frame similarity object similarity selectional preference Each variable represents relative object knowledge “p is generally bigger than q in terms of size” Or “p Z q in size” where Z = {>, <, =} Each variable represents an entailment knowledge “If X throws Y, then X > Y in terms of size” Or “If X throws Y, then Z in size” where Z = {>, <, =}
  • 42. random variable (RV) group of RVs factor connects RV factors connect subset of RVs verb similarity frame similarity object similarity selectional preference
  • 43. random variable (RV) group of RVs factor connects RV factors connect subset of RVs verb similarity frame similarity object similarity attribute similarity selectional preference
  • 44. … random variable (RV) group of RVs factor connects RV factors connect subset of RVs verb similarity frame similarity object similarity attribute similarity selectional preference
  • 45. … … … random variable (RV) group of RVs factor connects RV factors connect subset of RVs verb similarity frame similarity object similarity attribute similarity selectional preference
  • 47. 0 00 0 0 1 1 1 0.75 0.70 0 0.2 0.4 0.6 0.8 ACCURACY(TEST) FRAMES OBJECTS
  • 48. To Conclude • Reverse Engineering Commonsense Knowledge • By solving both puzzles simultaneously: – Learn VerbPhysics frames – Learn object – object relations • w.r.t. relative physical knowledge over 5 dimensions: – Size, weight, strength, flexibility, speed èJust from text, without embodiment è Vision can’t do weight, strength, flexibility …
  • 49. Zero-shot Activity Recognition with Verb Attribute Induction Rowan Zellers et al. @ EMNLP 2017
  • 52. Plan 1. Commonsense Frame Semantics – Naive physics – Social commonsense 2. Modeling the World, not just Language
  • 54. How Language Describes the World “All I know is what I read in the papers.” – Will Rogers
  • 55. How Language Describes the World (per pro-life) Supreme court Texas Law “endanger” Women abortion “protect lives and health” “refused to hear” “blocking regulations” “protect” “violate” clinic “close, require”
  • 56. An Alternative World View (per pro-choice) Supreme court Texas Law “strike down” Women abortion “impose” “seeks, obtains” Clinics “violate, cause” “make choices, has a constitutional right” “shutdown, force, threaten”
  • 57. Prior Literature • A ton of work on sentiment analysis - Especially on product reviews and movie reviews - (Pang and Lee 2009 for survey, Socher et al 2013) • Recent work on implied sentiment – (Greene and Resnik 2009 , Feng et al 2013; Choi and Wiebe 2014; Mohammad and Turney 2010; Deng and Wiebe 2014) Supreme court Texas Law “strike down” Women “impose”
  • 58. “criticize” frame per left-leaning sources: Obama criticize someone? or Obama is critized by someone? (Based on the Stream Corpus (2014) w.r.t. 30 news sources with known liberal / conservative leanings)
  • 59. Connotative Meaning of Words • Semantic prosody (Sinclair 1991, Louw 1993) “______ caused ______” • Wittgenstein’s view: – the meaning of a word is its use in the language • Connotation arises from the (typical) context in which words are used • Can we encode connotation in frame semantics? headache, cancer, … appreciation, award? Texas Law “cause”
  • 60. “violate” frame agent theme writer reader • Perspective (x->y) : directed sentiment
  • 61. “violate” frame agent theme writer reader = = = • Perspective (x->y) : directed sentiment • Effect (x) : whether the event is good for (or bad for) x • State (x) : the private state of x as a result of the event • Value (x) : the intrinsic value of x Connotation judgments are not always categorical, some are better to be modeled as distributions. (analogously to de Marneffe et al. 2012 on veridicality assessment)
  • 62. “criticize” frame per left-leaning sources: Obama criticize someone? or Obama is critized by someone? (Based on the Stream Corpus (2014) w.r.t. 30 news sources with known liberal / conservative leanings)
  • 63. Left/right leaning sources Verb Role Left-leaning sources Right-leaning sources accuse subject [-] Putin, Progressives, Limbaugh, Gingrich activist, U.S., protestor, Chavez object [+] Official, rival, administration, leader Romney, Iran, Gingrich, regime, opposition attack subject [-] McCain, Trump, Limbaugh Obama, campaign, Biden, Israel object [+] Gingrich, Obama, policy citizen, Zimmerman criticize subject [-] Ugandans, rival, Romney, Tyson Britain, passage, Obama, Maddow object [+] sensation, Obama, Allen, Cameron, Congress threat, Pelosi, Romey, GOP, Republicans suffer subject [+] woman, victim, officer, father Christie, Rangers, people, man object [-] attack, blow, burn, seizure, casualty injury, fool, loss, concussion (Based on the Stream Corpus (2014) w.r.t. 30 news sources with known liberal / conservative leanings)
  • 64. Connotation Frames 1. Analysis on large-scale news corpora 2. Crowdsourcing experiments 15 annotators answered 16 questions for each of the top 1000 verbs from the NY times corpus. Average Krippendorff’s alpha is 0.25 and 2-way soft agreement is 95%. 3. Computational models Writer: The judge upheld the verdict. Questions: How the judge feels about the verdict: Negative PositiveNeutral Negative or Neutral Positive or Neutral
  • 66. Base Model: Embedding to Connotation Frames word embedding for verb v (Levy and Goldberg 2014) … Log-linear parameterization • Frame lexicon induction • Separate frame prediction for each predicate
  • 67. Connotation Dynamics agent theme writer 1. Perspective Triads: ~ social balance theory (Cartwright 1956) is positive towards xj, and xj is positive towards xk, then we expect xi is al mics hold for the negative case. Pw!xi = ¬ (Pw!xj Pxi!xj ) ate has a positive effect on one argument, then we expect that the interactio e. Similar dynamics hold for the negative case. Exi = Pxj !xi xi is presupposed as valuable, then we expect that the writer also views x or the negative case. Vxi = Pw!xi icate has a positive effect on an argument xi, then we expect that xi will gain mics hold for the negative case. Sxi = Exi ped Relations: we propose models that parameterize these dynamics using lo
  • 68. Connotation Dynamics agent theme writer 1. Perspective Triads: ~ social balance theory (Cartwright 1956) 2. Effect – Mental State: is positive towards xj, and xj is positive towards xk, then we expect xi is al mics hold for the negative case. Pw!xi = ¬ (Pw!xj Pxi!xj ) ate has a positive effect on one argument, then we expect that the interactio e. Similar dynamics hold for the negative case. Exi = Pxj !xi xi is presupposed as valuable, then we expect that the writer also views x or the negative case. Vxi = Pw!xi icate has a positive effect on an argument xi, then we expect that xi will gain mics hold for the negative case. Sxi = Exi ped Relations: we propose models that parameterize these dynamics using lo = ¬ (Pw!xj Pxi!xj ) sitive effect on one argument, then we expect that the interaction ynamics hold for the negative case. Exi = Pxj !xi pposed as valuable, then we expect that the writer also views xi ive case. Vxi = Pw!xi ositive effect on an argument xi, then we expect that xi will gain a or the negative case. Sxi = Exi ns: we propose models that parameterize these dynamics using log-linea ously unseen verbs. For each verb predicate, we in fer a connotation frame composed of 9 relationshi aspects that represent: perspective {P(w ! t) P(w ! a), P(a ! t)}, effect {E(t), E(a)}, valu
  • 69. Connotation Dynamics agent theme writer 1. Perspective Triads: ~ social balance theory (Cartwright 1956) 2. Effect – Mental State: 3. Effect – Perspective: is positive towards xj, and xj is positive towards xk, then we expect xi is al mics hold for the negative case. Pw!xi = ¬ (Pw!xj Pxi!xj ) ate has a positive effect on one argument, then we expect that the interactio e. Similar dynamics hold for the negative case. Exi = Pxj !xi xi is presupposed as valuable, then we expect that the writer also views x or the negative case. Vxi = Pw!xi icate has a positive effect on an argument xi, then we expect that xi will gain mics hold for the negative case. Sxi = Exi ped Relations: we propose models that parameterize these dynamics using lo = ¬ (Pw!xj Pxi!xj ) sitive effect on one argument, then we expect that the interaction ynamics hold for the negative case. Exi = Pxj !xi pposed as valuable, then we expect that the writer also views xi ive case. Vxi = Pw!xi ositive effect on an argument xi, then we expect that xi will gain a or the negative case. Sxi = Exi ns: we propose models that parameterize these dynamics using log-linea ously unseen verbs. For each verb predicate, we in fer a connotation frame composed of 9 relationshi aspects that represent: perspective {P(w ! t) P(w ! a), P(a ! t)}, effect {E(t), E(a)}, valu e towards xj, and xj is positive towards xk, then we expect xi is also for the negative case. !xi = ¬ (Pw!xj Pxi!xj ) positive effect on one argument, then we expect that the interaction r dynamics hold for the negative case. Exi = Pxj !xi esupposed as valuable, then we expect that the writer also views xi gative case. Vxi = Pw!xi a positive effect on an argument xi, then we expect that xi will gain a for the negative case. Sxi = Exi
  • 70. Connotation Dynamics agent theme writer 1. Perspective Triads: ~ social balance theory (Cartwright 1956) 2. Effect – Mental State: 3. Effect – Perspective: 4. Value – Perspective: is positive towards xj, and xj is positive towards xk, then we expect xi is al mics hold for the negative case. Pw!xi = ¬ (Pw!xj Pxi!xj ) ate has a positive effect on one argument, then we expect that the interactio e. Similar dynamics hold for the negative case. Exi = Pxj !xi xi is presupposed as valuable, then we expect that the writer also views x or the negative case. Vxi = Pw!xi icate has a positive effect on an argument xi, then we expect that xi will gain mics hold for the negative case. Sxi = Exi ped Relations: we propose models that parameterize these dynamics using lo = ¬ (Pw!xj Pxi!xj ) sitive effect on one argument, then we expect that the interaction ynamics hold for the negative case. Exi = Pxj !xi pposed as valuable, then we expect that the writer also views xi ive case. Vxi = Pw!xi ositive effect on an argument xi, then we expect that xi will gain a or the negative case. Sxi = Exi ns: we propose models that parameterize these dynamics using log-linea ously unseen verbs. For each verb predicate, we in fer a connotation frame composed of 9 relationshi aspects that represent: perspective {P(w ! t) P(w ! a), P(a ! t)}, effect {E(t), E(a)}, valu e towards xj, and xj is positive towards xk, then we expect xi is also for the negative case. xi = ¬ (Pw!xj Pxi!xj ) positive effect on one argument, then we expect that the interaction dynamics hold for the negative case. Exi = Pxj !xi supposed as valuable, then we expect that the writer also views xi gative case. Vxi = Pw!xi a positive effect on an argument xi, then we expect that xi will gain a for the negative case. Sxi = Exi e towards xj, and xj is positive towards xk, then we expect xi is also for the negative case. !xi = ¬ (Pw!xj Pxi!xj ) positive effect on one argument, then we expect that the interaction r dynamics hold for the negative case. Exi = Pxj !xi esupposed as valuable, then we expect that the writer also views xi gative case. Vxi = Pw!xi a positive effect on an argument xi, then we expect that xi will gain a for the negative case. Sxi = Exi
  • 71. Marginal Probability: Factor Graph Model Factor potentials: Inference: Message passing:
  • 72. 58.81 63.71 67.93 68.26 55 58.75 62.5 66.25 70 Accuracy 41.46 47.30 53.17 53.50 35 41.25 47.5 53.75 60 F1 Scores Experimental Results train/dev/test sets of 300 predicates each, 3-way classification
  • 73. Power and Agency Frames in Modern Films Maarten Sap et al. (EMNLP 2017 in submission)
  • 74. Bias in Modern Films • Not a news per say, but more insights from recent studies ((https://www.google.com/intl/en/about/main/gende r-equality-films/) • Bechdel test: – At least two women – talk to each other – about something other than a man. • Agarwal et al. 2015 report 40% of the films pass the Bechdel test. • However, those that pass the test still have problems… A character in Dykes to Watch Out For explains the rules that later came to be known as the Bechdel test (1985)
  • 75. Character Portrayal in Films “… Elsa objects, unleashing her powers… Elsa discards her crown and creates a palace… Elsa […] freezes Anna … She then summons… Elsa ends the winter” “… accidentally injures Anna … Hans proposes to Anna … She gets lost … chases Anna … Anna spots Hans … He locks Anna in a room…” “Cinderella lives an unhappy life… agrees to let Cinderella go… he sees Cinderella… Cinderella hears the clock… allowing the mice to free Cinderella… Cinderella appears…”
  • 76. Power and Agency Frame agent theme writer readerAgency (x) Agency (x) Power(x,y) • Agency (x) – Powerful, decisive, capable of pushing forward their own storyline – “waiting” v.s. “searching” for your prince. • Power (x,y) – Differential authority level between the verb agent and theme – “demanding” v.s. “begging” for mercy
  • 77. Frame Verbs Agency (+) terminate, silence, sue, command, invade, throw, organize, authorize, persuade, battle Agency (-) shimmer, remain, consist of, border, plummet, age, whimper, inherit, recede, resemble Power (agent) squash, seduce, assign, order, suspend, regulate, possess, eject, prohibit, ignite, destroy Power (theme) obey, dread, ask, worship, belong to, trust, bow to, serve, implore, require, salute Collected using Mturk, annotated ~2000 verbs Power and Agency Frame
  • 78. Power and agency connotation frames run on the Wikipedia plot summaries of Frozen (2013) and Cinderella (1950)
  • 79. Bias in Modern Films • Male characters tend to drive the plot more through high agency verbs • Connotation frames provide more nuanced analysis compared to existing tests (Bechdel, Mako Mori, Furiosa, Sexy Lamp test) Frame Gender association Agency (+) M** Agency (-) F** Power (agent) M** Power (theme) n.s. Gender association with connotation frames using a logistic regression, controlling for number of words. **: p<0.001 (Holm corrected) 772 scripts (Gorinski & Lapata, 2015), 21K characters with automatically extracted gender
  • 80. To Conclude • Aspects of connotation can be packed into lexical and frame semantics agent theme writer readerAgency (x) Agency (x) Power(x, y)
  • 81. NLP for Social Good! • Identifying unwanted biases in language – (Sap et al. @ EMNLP 2017!) • graded deception detection in media – www.politifact.org – (Rashkin et al. @ EMNLP 2017!)
  • 82. Game Plan 1. Commonsense Frame Semantics – Naive physics – Social commonsense 2. Modeling the World, not just Language – Dynamic entities – Action causalities
  • 83. one vector You can't cram the meaning of a whole %&!$# sentence into a single $&!#* vector! a sequence of vectors? by byoutput output token <S> token tokendecode decode byinput tokenencode token
  • 85. Task Definition 𝑓 • Input: – Goal: an (optional) title phrase – Agenda: a list of items to talk about • Output: long text with many sentences
  • 86. Blueberry Muffins Ingredients 1 cup milk 1 egg 1/3 cup vegetable oil 2 cups all-purpose flour 2 teaspoons baking powder 1/2 cup white sugar 1/2 cup fresh blueberries Procedure 1. Preheat oven to 400 degrees F. Line a 12-cup muffin tin with paper liners. 2. In a large bowl, stir together milk, egg, and oil. Add flour, baking powder, sugar, and blueberries; gently mix the batter with only a few strokes. Spoon batter into cups. 3. Bake for 20 minutes. Serve hot. http://allrecipes.com/Recipe/Blueberry-Muffins-I/ Cooking Instructions
  • 87. How to generate recipes planning models interpretation generation abstraction planning recipe plan recipe text Is it possible to simplify this?
  • 88. Document-level Graph Parsing (Kiddon et al., 2015) Blueberry Muffins Ingredients 1 cup milk 1 egg 1/3 cup vegetable oil 2 cups all-purpose flour 2 teaspoons baking powder 1/2 cup white sugar 1/2 cup fresh blueberries Procedure 1. Preheat oven to 400 degrees F (205 degrees C). Line a 12-cup muffin tin with paper liners. 2. In a large bowl, stir together milk, egg, and oil. Add flour, baking powder, sugar, and blueberries; gently mix the batter with only a few strokes. Spoon batter into cups. 3. Bake for 20 minutes. Serve hot. 1. Predicate argument structure with implicit arguments (~ SRL) 2. How entities are introduced and then flow through different actions (~ reference resolution) 3. Discourse-level parsing
  • 89. Recipe generation as machine translation? • Encoder-decoder NNs work great for machine translation (Cho et al.2014, Sutskever et al. 2014) by byoutput output token <S> token tokendecode decode byinput tokenencode token • We can translate a recipe title into the text
  • 90. Encode title - decode recipe Cut each sandwich in halves. Sandwiches with sandwiches. Sandwiches, sandwiches, Sandwiches, sandwiches, sandwiches sandwiches, sandwiches, sandwiches, sandwiches, sandwiches, sandwiches, or sandwiches or triangles, a griddle, each sandwich. Top each with a slice of cheese, tomato, and cheese. Top with remaining cheese mixture. Top with remaining cheese. Broil until tops are bubbly and cheese is melted, about 5 minutes. sausage sandwiches
  • 91. Recipe generation vs machine translation by byrecipe recipe token <S> token tokendecode decode recipe title ingredient 1 ingredient 2 ingredient 3 ingredient 4 Two input sources • Only ~6-10% words align between input and output. • The rest must be generated from context (and implicit knowledge about cooking) • Contextual switch between two different input sources
  • 92. Challenges 1. Modeling global coherence and coverage – May not be possible to compress everything on to one hidden vector! 2. Need a mechanism to track an “agenda” -- which ingredients have been already used -- what need to be yet introduced
  • 94. Let’s make salsa! Garlic tomato salsa tomatoes onions garlic salt
  • 95. Neural checklist model LM Chop <S> hidden state classifier: non-ingredient new ingredient used ingredient which ingredients are still available new hidden state tomato salsagarlic
  • 99. Checklist is probabilistic tomatoes tomatoesAdd Add to to . . 0.90 0.08 0.01 0.01 used ingredient 0.85 1.001.000.85 0.85 1.00 0.020.02 0.02 0.040.04 0.04 = new ingredient prob. distribution
  • 100. Hidden state classifier is soft tomatoes tomatoes Add Add to to . . 0.90 0.08 0.01 0.01 0.85 1.001.000.85 0.85 1.00 0.020.02 0.02 0.040.04 0.04 0.00 0.00 0.50 0.50 0.85 1.00 0.02 0.04 0.940.05 0.01
  • 101. Neural Recipe Example #1 Cook eggplant in boiling water , covered , for 10 min . Drain and cut in half lengthwise . scoop out insides leaving 1/2 '' shell . Mash insides with cottage cheese , onion , bay leaf , basil , oregano and tomato sauce . Preheat oven to 350 ^ stuff eggplant halves , place in casserole dish and bake covered for 15 min . Add a little water to bottom of pan to keep eggplant moist . top with provolone cheese . Bake 5 more min uncovered 1 serving = In a small bowl , combine the cheese , eggplant , basil , oregano , tomato sauce and onion . Mix well . Shape mixture into 6 patties , each about 3/4-inch thick. Place on baking sheet . Bake at 350 degrees for 30 minutes or until lightly browned . Southern living magazine , sometime in 1980 . Typed for you by nancy coleman . eggplant cheese cottage lowfat chopped onion bay ground leaf basil oregano tomato sauce provolone title: oven eggplant
  • 102. Example: skillet chicken rice Ingredients 2 lb chicken pieces skinned 3 cups mushrooms fresh sliced 4 carrots peeled sliced 1/2” 3/4 cups long grain rice 1/2 c onion chopped 1 tsp poultry seasoning 1 tsp bouillon chicken granules 1/4 tsp salt In a large skillet, brown chicken in oil. Add chicken and rice; cook, stirring, until chicken is no longer pink. Add rice and broth; bring to a boil. Reduce heat to low; cover and cook 5 minutes or until rice is tender and liquid is absorbed. Stir in rice. Heat to boiling; reduce heat. Cover and simmer until rice is tender, about 20 minutes. Stir in rice. Heat to boiling; reduce heat. Cover and simmer 20 minutes or until rice is tender and liquid is absorbed and rice is tender. Baseline In large skillet, heat rice and onion. Add carrots and mushrooms; cook, stirring constantly, until hot and bubbly. Stir in seasonings. Cook and stir over medium heat until hot and bubbly. Stir in chicken and salt. Cover and cook 5 minutes or until chicken is no longer pink in center. Serve over rice. Checklist Spray a 12-inch skillet with nonstick spray coating. Brown chicken pieces on all sides over medium heat about 15 minutes. Remove chicken. Drain fat from skillet, if neccessary. Add mushrooms, carrots, rice, onion, bouillon, poultry seasoning, 2 cups water, salt. Place chicken atop rice mixture. Cover; simmer 30 minutes or till chicken and rice are done. True recipe
  • 103. Example: chocolate covered potato chips Ingredients 8 oz sweet milk chocolate 24 thick sliced ridged potato chips Preheat oven to 350 degrees. Grease and flour a 9 x 13 inch pan. Bake at 350 degrees for 30 minutes. Baseline Bake in a 400 degree f oven for 15 minutes or until lightly browned. Drain on paper towels. Melt chocolate in top of double boiler over simmering water. Remove from heat and cool to room temp. Add potato mixture to the potato mixture; mix well. Cover tightly with plastic wrap. Refrigerate until firm enough to handle. Shape into croquettes. Fry in hot oil until golden brown. Drain on paper towels. Serve hot. Checklist Melt 6 ounces of chocolate. Remove from heat. While chocolate is still hot, mix the 2 ounces of chocolate into the melted chocolate, in 3 additions, stirring until each addition is completely melted before adding the next. Dip the potato chips, 1 at a time, in the chocolate. Coat completely and lift with a small fork. Shake off excess chocolate by rapping the fork on the edge of the bowl lightly. Remove any drips from the bottom by running the fork across the edge of the bowl. Slide the chips onto a cookie sheet lined with parchment or wax paper. Allow to cool until solid. Let chips sit at room temperature or in the refrigerator. True recipe
  • 104. Baselines • Encoder-Decoder baseline (EncDec) – Encode title and ingredients as sequence, decode recipe • Attention model baseline – Use attention embedding to ingredient list during output computation at each step
  • 105. Human evaluation • Amazon Mechanical Turk 0 1 2 3 4 5 Ingredient use EncDec baseline Checklist True recipes
  • 106. Human evaluation • Amazon Mechanical Turk 0 1 2 3 4 5 Ingredient use Follows goal EncDec baseline Checklist True recipes
  • 107. Human evaluation • Amazon Mechanical Turk 0 1 2 3 4 5 Grammar Ingredient use Follows goal EncDec baseline Checklist True recipes
  • 108. Natural vs. Neural Language (on dev set) Human Recipes Checklist NN Enc-Dec RNN |V| = 5206 33% of |V| 16% of |V|
  • 109. Neural checklist model for dialogue generation • Dialogue system output generation – Hotel & restaurant informational systems – Wen et al. 2015 – inform{name(Hotel Stratford),has_internet(no)} • “Hotel Stratford does not have internet.”
  • 110. Examples inform(name=piperade;good_for_meal=dinner;food=basque) piperade is a basque restaurant that is good for dinner. inform(name='red door cafe';good_for_meal=breakfast;area='cathedral hill';kids_allowed=no) the red door cafe is good for breakfast, does not allow children and is in the cathedral hill area. inform(name='manna';area='hayes valley or inset';address='845 irving street') manna is in the hayes valley or inset area. the address is 845 irving street. ?select(dogs_allowed='yes';dogs_allowed='no') would you like a hotel that allows dogs?
  • 112. Restaurant results 44 43 74 54 78 54 0 25 50 75 100 BLEU METEOR Handcrafted baseline SC-LSTM Checklist
  • 113. Dynamic Entity Representations in Neural Language Models Yangfeng Ji et al. @ EMNLP 2017
  • 114. Static vs Dynamic Entity Representations • Human: what is your job? • Machine: i’m a lawyer • Human: what do you do? • Machine: i’m a doctor I am a lawyer Example from Paperno et al. 2016 … I am a doctor …
  • 115. Neural Process Network Antoine Bosselut et al. (In Preparation)
  • 116. What’s missing in the end-to-end… • Title: deep-fried cauliflower • Ingredients: cauliflower, frying oil, sauce, salt, pepper. è Wash and dry the cauliflower. Heat the oil in a skillet and fry the sauce until they are golden brown. Drain on paper towels. Add the sauce to the sauce and mix well. Serve hot or cold.
  • 117. Game Plan - Recap 1. Commonsense Frame Semantics – naive physics – Social commonsense 2. Modeling the World, not just Language – Dynamic entities – Action causalities Choose Action: slice Choose Entity: tomato Read: Slice the tomatoes State Changes: shape separated Choose Action: bake Choose Entity: <IMPLICIT> Read: Bake in pan for 20 minutes. State Changes: cookedness cooked temperature hot location oven Read: <END_RECIPE> Action Causality
  • 120. Toward Intelligent Communication 1. Commonsense Frame Semantics – naive physics – Social commonsense – Packing knowledge into lexical/frame semantics 2. Modeling the World, not just Language – Dynamic entities – Action causalities
  • 122. “X stabbing Y with Z” agent theme writer reader= = = Connotation Frame Perspective(wàX) = The writer considers X guilty. Effect(Y) = What happenedis bad for Y. Value(Y) = Y is inherently valuable. VerbPhysics Y < Z in RIGID It’s not possible to stab someone with a cheeseburger.