Introduction to Natural Language Processing

An Introduction to
Machine Learning in
Natural Language Processing
The use of different techniques from computer science to
understand and manipulate human language and speech.
Isar Nejadgholi
isar@imrsv.ai

What do we want machines to
learn?
• Prediction or labeling
• Reasoning
• Understanding
• Language generation

LANGUAGE IS HARD
Irony and sarcasm
Elizabeth told Amanda
that she had a problem.
Reference resolution
The tank was full of water
I saw the military tank.
Lexical ambiguity
visiting relatives can
cause problems.
Syntactic (structural) ambiguity
I bought an iPhone a few
days ago. It was such a nice
phone. The touch screen
was really cool. The battery
life was not long, though and
my mother thought the
phone was too expensive.
Subjectivity of annotations
Tell me something
I don’t know.

APPLICATIONS
Natural
Language
Processing

Classical Natural Language Processing
Computational
Linguistics
Statistical and
Probabilistic
ML

TEXT REPRESENTATION IN CLASSICAL NLP
Term frequency vector
How to get a dense and informative representation?
… …

CLASSICAL NLP PIPELINE
• Frequency based representation
• Converting sparse representation to dense
vectors using SVD-based methods
• Statistical inference, probabilistic models,
similarity metrics
• Knowledge base of lexical rules and
word relations
• Character pattern matching

NLP Meets Deep Learning
Recent advances in NLP

DEEP LEARNING NLP MODEL
Embed, encode, attend, predict: The new deep learning formula for state-of-the-art NLP models
November 10, 2016, Matthew Honnibal

EMBED: WORD REPRESENTATION WITH WORD EMBEDDINGS
INPUT PROJECTION
OUTPUT
w(t)
w(t-2)
w(t+2)
w(t+1)
w(t-1)
Pretrained embeddings:
• Word2vec
• Glove
• Fasttext

REASONING WITH WORD VECTORS
Pair 1 Pair 2
China - Chinese Sri Lanka - Sri Lankan
Colombian - FARC Somalian - Alshabab
Roma - Hungarian Bahai - Iranian
Palestinian - Hamas Lebanon - Hezbollah
PRRA - Preremoval RPD - post hearing
Man is to woman as king is to ______?
Meaning(king) - meaning(man) + meaning(woman) = ?
(Mikolov et al. 2013)
Analogies in Legal Embedding

IMRSV WORD SUMMARIZER
Probabilistic PCA

ENCODE: SENTENCE REPRESENTATION WITH
RNNs
• Converts concatenation of word vectors (or bag of words) to a more meaningful sentence matrix
• Gets meaning of word in the context using memory and sequential modeling
• Can be bidirectional
• Popular models: GRU, LSTM, BiLSTM
• Can be used as decoder to generate text

Context vector or matrix
Predict
ATTEND: TEXT REPRESENTATION WITH
ATTENTION
• Reduces sentence matrix to sentence vector
• Without attention this conversion is done by averaging or max pooling
• Attention is a nonlinear weighted averaging of sentence matrix and context and learns what to keep

UNBALANCED AND MULTI-LABEL CLASSIFICATION PROBLEM

DATA PREPARATION
Cleaning
• Removing punctuations
• Autocorrect a set of critical words:
a$$ clowns -> ass clowns
sh!t -> shit
5hit -> shit
b l o o d y -> bloody
Augmentation with translation
• en -> fr -> en
• en -> de -> en
• en -> sp -> en

STRUCTURE
OF TOXICITY
CLASSIFIER

VIOLENCE IN ISIS FANBOY’S TWEETS

VIOLENT TWEETS OVER TIME
Explosion in Pakistan
ISIS commander killed
Explosion in Turkey
Numberofviolenttweets

WHERE DO I GET THE RIGHT DATA?
• Text is everywhere, but it is messy
• Volume, variety and velocity
• Annotation is subjective and expensive
Get messy!
Learn to clean!
Learn to augment!
Try to develop an intuition about your data!

Introduction to Natural Language Processing

Recommended

Recommended

More Related Content

Similar to Introduction to Natural Language Processing

Similar to Introduction to Natural Language Processing (20)

More from Jenny Midwinter

More from Jenny Midwinter (11)

Recently uploaded

Recently uploaded (20)

Introduction to Natural Language Processing