NLP Community Conference - Dr. Catherine Havasi (ConceptNet/MIT Media Lab/Luminoso)

Dr. Catherine Havasi, MIT Media Lab
ConceptNet at Twenty:
Reflections on common sense in an era of
machine learning
Photo: MichaelVesia, CC-By

• Design Principles
Photo by: User: Takeaway CC BY-SA 3.0.
A bit of an introduction

MEANWHILE, AT THE
MIT MEDIA LAB
Photo: MIT Media Lab

COLLECTING COMMON-SENSE KNOWLEDGE

We have built models of how people think about the world in 73
languages – called ConceptNet.

Languages in ConceptNetMultilingual coverage
English
6.5 million edges
French
4.9 million edges
German
1.6M
Italian
1.1M
Spanish
830k
Japanese
740k
Russian
620k
Portuguese
540k
Chinese
500k
Finnish
420k
Dutch
400k
Swedish
300k
bg pl cs sh eo ms sl ar
Total: 24.6 million edges in 70+ languages
. . .
rces
mmon Sense
l
glish, French, German)
ed knowledge
ish)
nese)
ame (Chinese)
a purpose
ual WordNet
urces
Open code
At http://conceptnet
• Code on GitHub to
• A browsable Web in
• A Linked Data REST
All data is available un
Creative Commons Att
ShareAlike 4.0 license.

Photo: Steve Hopson, CC-By
But then things moved on without us –
users changed how they interacted with search engines

We found a new use of this data:
integrating into machine learning to
facilitate adaptability
Photo by: Chris Rodley

“I don’t have to actually experience crashing
my car into a wall a few hundred times before I
slowly start avoiding to do so.”
- Andrej Karpathy,OpenAI

Photo by: Fir0002 CC BY-SA 3.0.

Photo by: Nicholas A. Tonelli CC BY-SA 2.0.
We build analogies to understand
complicated concepts.

Emotion has always been
central to the data we
collected.
Photo by: Petar Milošević CC BY-SA 3.0.

Distributional semantics
“You shall know a word by the company it
keeps.” – J. R. Firth
a word by the company it keeps

Retrofitting
• Created by Manaal Faruqui in 2015
• Apply knowledge-based constraints after training
distributional word vectors
• It works better than during training, for some reason

Retrofitting
• Terms that are connected in the knowledge graph should
have vectors that are closer together
• Many extensions now, such as “antonyms should be farther
apart” (Mrkšić et al., 2016)-
oak
tree
furnitur
e

Retrofitting just works
• On intrinsic evaluations, the top-performing systems almost
always use retrofitting
– If you see a purely distributional algorithm claim “state of
the art on SimLex”, it may be “state of the art assuming
no knowledge graph”

• State-of-the-art word vectors
• Hybrid of ConceptNet and distributional
semantics
• Multilingual by design
• Open source, open data

Building ConceptNet Numberbatch
Common
Crawl
Open
Subtitles
ConceptNe
t
Google
News
word2vec GloVe fastText
Retrofit Retrofit Retrofit
Join
Reduce
dimensionality
De-bias
ConceptNet
Numberbatc
h
Many data
sources

Photo by: David Lapetina CC BY-SA 3.0.
In order to beat a human
player at chess, Google’s
AlphaZero had to play
68 million games against
itself.

You cannot simulate your call center
calling itself 68 million times.
Photo by: Nebiyu.s CC BY-SA 4.0.

What is domain Adaptation?
domain
general
data
domain
specific
data
customer intents,
product names,
industry jargon,
specific issues
common words,
multiple languages,
paraphrases,
general sentiment

For more info: havasi@luminoso.com

What are other examples of transfer
learning?
• Pretraining
• Fine tuning and layer freezing for Elmo and
Bert (and GPT-2)
• Fast.ai’s ULMFiT (http://nlp.fast.ai/)

Data is important…
but don’t let that stop you
Photo by:Gene Kogan

Can you tell
me how to get
to Sesame
Street?

Could you train networks to modify
networks?
Work Credit: PedroColon

John Hewitt and Christopher D. Manning: A Structural Probe for Finding Syntax in Word
Representations
Andy Coenen , Emily Reif, Ann Yuan, Been Kim, Adam Pearce, Fernanda Viégas, Martin
Wattenberg.Visualizing and Measuring the Geometry of BERT

It’s happening again: today’s agents don’t communicate
like us.
ImageCredit:Amazon

ImageCredit: D J Shin CC-by-SA
No one wants to talk to a robot

Remember Ubiquitous
Computing?

Photo by: David Udvardy,CC-by-SA
AI works best when it prioritizes uses case
and blends into it’s surroundings.

The relationship between humans and
machines is inherently emotional.
Photo by: Rene Passet CC-by-SANC

Photo by:Anna Dziubinska, CC-by-SA
Emotion is a critical part of
communication

Gartner infamously predicted last year that
soon your phone “know more about your
emotional state than your own family.”
Source: Gartner

Emotional AI allows humans to
have time to develop emotional
relationships.

One Size Does Not Fit All:
Language
Domain
Expertise
Customer Journey Stage
Engagement Goal

In 2017, the NYTimes reported Icelanders bemoaning a
decrease in younger people speaking Icelandic because
“voice activated” systems didn’t speak it.
Image Credit: Andreas Tille, CC-by-SA. NyTimes: Icelanders Seek to Keep Their Language Alive and Out of ‘the Latin Bin’

Marketing has been talking
about personalization for years.

mass production personalization

What is generative or adaptive content?
Are we there yet? What gets us there?

Focus on
Experience versus
Agent

Image Credit: Netflix Promotional
What about hyper-personalized media?

Why do we
make solo
agents?
Confidential

Conversation is a negotiation.
Negotiations require planning.

We need a
framework
that knows
how to grow
up.

Let’s find a way to communicate
with machines like we always
dreamed of.

Transparency Matters
Values Matter
Data Matters

conceptnet.io – a browsable
interface

api.conceptnet.io – a Linked Data
API

Thank you to all of my crowds
over the years.
Photo: Moses Namkung, CC-By

Thank you!
Dr. Catherine Havasi
havasi@media.mit.edu
@catherinehavasi

We are creating a new type of media.
Just for you.
We are building an engine for creators and brands to build personalized
experiences at scale.
Confidential Image Credit: Anna Dziubinska CC-by-SA

Let’s make a promise to our
users about their data.

51% of consumers expect that companies will
anticipate their needs and make relevant
suggestions – even if they’ve never bought
from the brand before.
Source: Salesforce

80% of shoppers are more likely to frequent and buy
from stores with more personalized experiences.
Source: Epsilon

Source: Split learning for health: Distributed deep learning without sharing raw patient data, Praneeth
Federated or Split Learning

NLP Community Conference - Dr. Catherine Havasi (ConceptNet/MIT Media Lab/Luminoso)

Recommended

Recommended

More Related Content

Similar to NLP Community Conference - Dr. Catherine Havasi (ConceptNet/MIT Media Lab/Luminoso)

Similar to NLP Community Conference - Dr. Catherine Havasi (ConceptNet/MIT Media Lab/Luminoso) (20)

More from Maryam Farooq

More from Maryam Farooq (14)

Recently uploaded

Recently uploaded (20)

NLP Community Conference - Dr. Catherine Havasi (ConceptNet/MIT Media Lab/Luminoso)