SlideShare a Scribd company logo
Deep Learning for
Semantic Relation
Classification
Sneha Rajana
Amazon
Natural Language Understanding
Natural Language Understanding
Human language is a rich, varied, and
growing source of information
Challenges of NLU
• Multiple words with same meanings (Synonyms)
• Words with multiple meanings (polysemy) some of
which are entirely opposite in nature (auto-
antonyms)
• Words which behave differently when used as noun
and verb
Challenges of NLU
• Multiple words with same meanings (Synonyms)
• Words with multiple meanings (polysemy) some of
which are entirely opposite in nature (auto-
antonyms)
• Words which behave differently when used as noun
and verb
Hot water/Cool water
Hot topic/Cool topic
Challenges of NLU
• Multiple words with same meanings (Synonyms)
• Words with multiple meanings (polysemy) some of
which are entirely opposite in nature (auto-
antonyms)
• Words which behave differently when used as noun
and verb
Words make sense contextually in natural
language which humans can comprehend
and distinguish easily, but machines can’t
Recognizing various semantic
relations between entity pairs in
sentences is an important task in
Natural Language Processing (NLP).
Lexical Semantic Relations
hot/cold
good/bad
friend/enemy
enemy/fan
cold/lukewarm
ascend/slip
up/down
spirited/spiritless
cool/not cool
honest/dishonest
penguin/clown
cold/chilly
boat/rudder
agree/disagree
absence/presence
acceptable/intolerable
hello/fine
voice/silent
love/hate
like/love
mouse/animal
nose/human
happy/joy
Lexical Semantic Relations
hot/cold
good/bad
friend/enemy
enemy/fan
cold/lukewarm
ascend/slip
up/down
spirited/spiritless
cool/not cool
honest/dishonest
penguin/clown
cold/chilly
boat/rudder
agree/disagree
absence/presence
acceptable/intolerable
hello/fine
voice/silent
love/hate
like/love
mouse/animal
nose/human
happy/joy
hypernym
meronym
synonym
Antonyms: Semantically related but not
semantically similar!
Possible Applications
What are Antonyms?
hot/cold
good/bad
friend/enemy
enemy/fan
cold/lukewarm
ascend/slip
up/down
spirited/spiritless
cool/not cool
honest/dishonest
penguin/clown
cold/chilly
boat/rudder
agree/disagree
absence/presence
acceptable/intolerable
hello/fine
voice/silent
love/hate
like/love
mouse/animal
nose/human
happy/joy
What are Antonyms?
hot/cold
good/bad
friend/enemy
enemy/fan
cold/lukewarm
ascend/slip
up/down
spirited/spiritless
cool/not cool
honest/dishonest
penguin/clown
cold/chilly
boat/rudder
agree/disagree
absence/presence
acceptable/intolerable
hello/fine
voice/silent
love/hate
like/love
mouse/animal
nose/human
happy/joy
strongly antonymous
not antonymous
semantically contrasting
Goal: Antonym Detection
Given two terms x and y, decide whether x
and y are antonyms of each other
Main Contributions:
• Learning antonyms with paraphrases
• Learning antonyms with a morphology-aware neural network
University of Pennsylvania
Possible Applications
Possible Applications
Deriving Antonyms from
Paraphrases
not allowed in here ~ not permitted
did not plan ~ had no intention
never mind about that ~ it matters not
Phrases expressing the same meaning usually
occurring in similar textual contexts or have
common translations in other languages
PPDB: The Paraphrase Database
An automatically extracted database containing
millions of paraphrases
• 22 different languages
• ~100M word and phrase pairs
• Big and noisy
• Currently the largest available collection of paraphrases
PPDB: The Paraphrase Database
An automatically extracted database containing
millions of paraphrases
Step 1: WordNet Seed Set
Direct
antonyms
E.g. clean/dirty
Indirect
antonyms
E.g. clean/foul
E.g. clean/grime
WORDNET
A large lexical English
database
Nouns, verbs, adjectives,
adverbs are grouped
into sets of cognitive
synonyms or sunsets
Synsets
Step 2: Antonyms from
Paraphrases
Negating word
(Not happy, unhappy)
-> (happy, unhappy)
Negating prefix
(unjustifiable,
unreasonable)
-> (justifiable,
unreasonable)
Used PPDB to retrieve
paraphrase mappings of
2 types
Negating word
(Not X, Y)
-> (X, Y)
Negating prefix
(Neg-Prefix(X), Y)
-> (X, Y)
Step 3: Indirect Antonyms
via Expansion
-> (happy, unhappy)
-> (happy, synsets(unhappy))
-> (synsets(happy), unhappy)
-> (justifiable, unreasonable)
-> (justifiable,
paraphrases(unreasonable))
-> (paraphrases(justifiable),
unreasonable)
(~X, Y)
-> (X, Y)
-> (X, synonyms(Y))
-> (synonyms(X), Y)
Synsets from
WordNet and
paraphrases from
PPDB
Antonym Generation
Direct Antonyms Indirect Antonyms
clean/dirty clean/foul
rise/fall rise/downfall
sleep/wake sleep/rise
above/below above/under
Paraphrase Pair Antonyms Pair
deactivated/turned off activated/turned off
unjustifiable/unreasonable justifiable/unreasonable
deforestation/destruction forestation/destruction
anti-hatred/non-hatred hatred/non-hatred
Paraphrase Pair Antonym Pair
not true/untrue true/untrue
not identical/different identical/different
not acceptable/objectionable acceptable/objectionable
not sufficient/insufficient sufficient/insufficient
WordNet expansion
Removal of negating word Removal of negating prefix
Antonyms derived from PPDB
0
22500
45000
67500
90000
Wordnet (direct) WordNet (indirect) (X,Y) from (~X,Y) Synset Expansion Paraphrase Expansion
81,221
35,686
80,669
14,9693,337
Number of unique antonyms generated
Classification of non-antonyms
Unrelated
long/rare
much/worthless
disability/present/
equality/gap
Paraphrases
simply/merely
correct/that’s right
till/until
right/alright
Other
twinkle/dark
access/available
valuable/premium
naw/not gonna
Entailment
valid/equally
valid
significant/
statistically
Categories
Africa/Asia
Jan/Feb
Black/Red
Blonde/
Brunette
Learning Antonyms with
Paraphrases and a Morphology-
aware Neural Network
*Sem 2017, Vancouver, Canada
Sneha Rajana*, Chris Callison-Burch*, Marianna
Appidianaki* 𝛹, Vered Shwartzϕ
*Computer and Information Science Department, University of
Pennsylvania, USA
𝛹LIMSI, CNRS, University Paris-Saclay, 91403 Orsay
ϕComputer Science Department, Bar-Ilan University, Israel
Background
• Prior work: Path-based, Distributional
• Integrated neural path-based (improved path-
based) and distributional method for detecting
Hypernymy - HypeNET [Vered et al., 2015]
• Integrated neural path-based (improved path-
based) and distributional method for detecting
multiple semantic relations - LexNET [Vered et al.,
2016]
Distributional Approach
Recognize the relation between x and y based on
their separate occurrences in the corpus
Distributional Hypothesis
Words that occur in similar contexts have similar meanings
Using x and y's word embeddings [Mikolov et al., 2013,
Pennington et al. 2014] as distributional vector representations
Supervised Distributional
Methods
• Represent (x, y) as a feature vector, based on the
term’s embeddings
• Train a classifier to predict whether y is a <relation>
of x
Concatenation[Baroni et al. 2012]
x + y
They don’t learn the relation between x and y, but mostly that is a
prototypical relation!
E.g. (x, fruit), (x, animal) are always hypernyms
Path-based Approach
Recognize the relation between x and y based on
their joint occurrences in the corpus
Hearst Patterns [Hearst, 1992]
Patterns connecting x and y may indicate
that x is a <relation> of y
X is a Y (Hypernym)
Neither X nor Y
(Antonym)
Patterns can be represented using
dependency paths
Supervised Path-based Method
• Features: all dependency paths that connect x and
y in a corpus
• Supervised: Labelled training data (word pairs)
• Trained a logistic regression classifier to predict a
relation
Feature space is too sparse!
Similar paths share no information
X inc. is a Y, X group is a Y, X organization is a Y
Neural path-based method
HypeNET
• Split each path between X and Y into edges
• Each edge consists of 4 components: lemma/POS/
dependency label/direction
• Learn embedding vectors for each component
LSTM LSTM LSTM LSTM
Neural path-based method
• Feed the edges sequentially to an LSTM
• Use the last output vector as the path embedding
• The LSTM may focus on edges that are more informative or
the classification task, while ignoring others
Neural path-based method
• The LSTM encodes a single path
• Each pair of terms occurs in multiple paths
• Represent a term-pair as its averaged path embedding
• Classify for hypernym (or other lexical relationship)
LSTM LSTM LSTM LSTM
LexNET: Multiple Semantic
Relations
• LexNET: An extension of HypeNET to classify
multiple semantic relations (E.g. meronymy,
synonymy, antonymy etc.)
•
Term-pair Classification
• Screenshot
AntNET
• Variant of LexNET
• Morphology aware path features
• Handles multi-word expressions
Improvement
X/NOUN/pobj/^/1 alongside/ADP/prep/V/0 non-negated(Y)/NOUN/conj/</2
LexNET
AntNET
non-negated(Y)/NOUN/conj/</2
lemma/POS/dep/direction
non-negated(lemma)/POS/dep/direction/neg
Replacement of word
embeddings
• Rare Paths: neither happy nor sad vs.
neither happy nor unhappy
• Seemingly negated words: valuable -
invaluable
• Multi-Word Expressions: not happy
AntNET: Network Architecture
• Screenshot
Term-Pair Classification (Binary or Multiclass)
Integrated Model
• Add distributional information with path information
• Concatenate x and y’s word embeddings to the averaged path
• Classify for antonymy (integrated network)
• dd
• dd
• dd
•
Experiments
Corpus and Dataset
Knowledge
resources
WikiPedia dump
English
May 2015
GloVe: Global
Vectors for Word
Representation
Unsupervised learning
algorithm for obtaining
vector representation of
words
Computed paths between
the most frequent
unigrams, bigrams, and
trigrams in Wikipedia
based on GloVe
vocabulary and the most
frequent 100K bigrams
and trigrams.
GloVe Embeddings
Used pre-trained word
embeddings of 50, 100,
and 200 dimensions
Vocabulary
PPDB words that were
contained in the most
common 400k words and
the most common 100k
bigrams and trigrams in
Wikipedia
Dataset
Generated from PPDB
Size so far: ~4000 pairs
Train/Test/Validation:
70/25/5
AntNET: Results
Metric Model Binary Multiclass
Precision
Path-based
Combined
0.732
0.803
0.652
0.746
Recall
Path-based
Combined
0.724
0.788
0.687
0.757
F1
Path-based
Combined
0.713
0.802**
0.661
0.746**
paired t-test, *p<0.1, **p<0.05
Effect of the negation-marking
feature
0.72
0.743
0.765
0.788
0.81
LexNET AntNET-neg AntNET AntNET-distance
0.734
0.746
0.7400.738
0.788
0.802
0.793
0.788
Binary Multiclass
Performance(F1Score)
lemma/POS/dep/
direction
lemma/POS/dep/
direction/neg
non-negated(lemma)/
POS/dep/direction/neg
non-negated(lemma)/
POS/dep/distance/neg
AntNET: Evaluation
0
0.225
0.45
0.675
0.9
Majority Class Word Embedding + SVM LexNET AntNET
0.750.74
0.34
0.30
0.800.79
0.44
0.39
Binary Multiclass
Performance(F1Score)
AntNET: Evaluation
Normalized Confusion Matrix
AntNET: Evaluation
absence-presence
absolute-relative
unfashionable-
fashionable
duck-stand up
imperviousness-
perviousness
ascertain-unclear
spiritless-spirited
ripe-rotten
turn-straight
cisc-risc
sawtoothed-toothless
interchange-unaltered
polite-sassy
black-white
large-minimum
indeterminate-influence
pear shaped - square
appropriately-ghastly
salutary-scary
irrelevant-discipline
T
T
F
F
gold
predicted
Artifacts
Code and Data
https://github.com/srajana/AntNET
Publication
http://www.aclweb.org/anthology/S/S17/S17-1002.pdf
Improvements
• In recent years, SOTA performance has been achieved using neural
models by incorporating lexical and syntactic features such as POS tags
and dependency trees.
• Although syntactic features are no doubt helpful, a known challenge is
that parsers are not available for every language, and even when
available, they may not be sufficiently robust, especially for out-of-domain
text, which may even hurt performance
• Recently, the NLP community has seen excitement around neural models
that make heavy use of pre-training based on language modeling
• Without using any external features, a simple BERT-based model can
achieve SOTA performance for Relation Extraction and Semantic Role
Labeling [Shi et. al. 2019, You et. al. 2019].
BERT-based models for
Multi-way classification of semantic
relations (SemEval)
The task is, given a sentence and two tagged nominals,
to predict the relation between those nominals and the
direction of the relation.
Model F1 Score
Matching-the-Blanks (Baldini Soares
et al., 2019) 89.5
R-BERT (Wu et al. 2019)
89.25
Multi-Attention CNN (Wang et al. 2016) 88.0
Entity Attention Bi-LSTM (Lee et al.,
2019) - RNN-based Model
85.2
Thank You!
• Questions? Email srajana@amazon.com
• Twitter: @sneha_rajana
• Medium: @sneharajana
• LinkedIn: www.linkedin.com/in/sneha-rajana

More Related Content

What's hot

Introduction to development of lexical databases
Introduction to development of lexical databasesIntroduction to development of lexical databases
Introduction to development of lexical databases
Muhammad Shoaib Chaudhary
 
Nlp (1)
Nlp (1)Nlp (1)
NLP_KASHK:POS Tagging
NLP_KASHK:POS TaggingNLP_KASHK:POS Tagging
NLP_KASHK:POS Tagging
Hemantha Kulathilake
 
OWL briefing
OWL briefingOWL briefing
OWL briefing
Frank van Harmelen
 
Natural Language processing Parts of speech tagging, its classes, and how to ...
Natural Language processing Parts of speech tagging, its classes, and how to ...Natural Language processing Parts of speech tagging, its classes, and how to ...
Natural Language processing Parts of speech tagging, its classes, and how to ...
Rajnish Raj
 
NLP
NLPNLP
Formal languages
Formal languagesFormal languages
Formal languages
Vestforsk.no
 
Lfg and gpsg
Lfg and gpsgLfg and gpsg
Lfg and gpsg
SubramanianMuthusamy3
 
Ontology engineering: Ontology alignment
Ontology engineering: Ontology alignmentOntology engineering: Ontology alignment
Ontology engineering: Ontology alignmentGuus Schreiber
 
Phrase structure grammar
Phrase structure grammarPhrase structure grammar
Phrase structure grammar
SubramanianMuthusamy3
 
Word sense dissambiguation
Word sense dissambiguationWord sense dissambiguation
Word sense dissambiguation
Ashwin Perti
 
NLP_KASHK:Context-Free Grammar for English
NLP_KASHK:Context-Free Grammar for EnglishNLP_KASHK:Context-Free Grammar for English
NLP_KASHK:Context-Free Grammar for English
Hemantha Kulathilake
 
Corpora analysis bruno natalia sarah
Corpora analysis   bruno natalia sarahCorpora analysis   bruno natalia sarah
Corpora analysis bruno natalia sarah
Bruno Bruno Bruno Bruno
 
NLP_KASHK:Morphology
NLP_KASHK:MorphologyNLP_KASHK:Morphology
NLP_KASHK:Morphology
Hemantha Kulathilake
 
Amharic WSD using WordNet
Amharic WSD using WordNetAmharic WSD using WordNet
Amharic WSD using WordNet
Seid Hassen
 
Modular Ontologies: Package-based Description Logics Approach
Modular Ontologies: Package-based Description Logics ApproachModular Ontologies: Package-based Description Logics Approach
Modular Ontologies: Package-based Description Logics ApproachJie Bao
 
Parts of Speect Tagging
Parts of Speect TaggingParts of Speect Tagging
Parts of Speect Taggingtheyaseen51
 
The Ins and Outs of Preposition Semantics:
 Challenges in Comprehensive Corpu...
The Ins and Outs of Preposition Semantics:
 Challenges in Comprehensive Corpu...The Ins and Outs of Preposition Semantics:
 Challenges in Comprehensive Corpu...
The Ins and Outs of Preposition Semantics:
 Challenges in Comprehensive Corpu...
Seth Grimes
 

What's hot (20)

Introduction to development of lexical databases
Introduction to development of lexical databasesIntroduction to development of lexical databases
Introduction to development of lexical databases
 
Nlp (1)
Nlp (1)Nlp (1)
Nlp (1)
 
NLP_KASHK:POS Tagging
NLP_KASHK:POS TaggingNLP_KASHK:POS Tagging
NLP_KASHK:POS Tagging
 
OWL briefing
OWL briefingOWL briefing
OWL briefing
 
Natural Language processing Parts of speech tagging, its classes, and how to ...
Natural Language processing Parts of speech tagging, its classes, and how to ...Natural Language processing Parts of speech tagging, its classes, and how to ...
Natural Language processing Parts of speech tagging, its classes, and how to ...
 
NLP
NLPNLP
NLP
 
Ontology matching
Ontology matchingOntology matching
Ontology matching
 
Formal languages
Formal languagesFormal languages
Formal languages
 
Lfg and gpsg
Lfg and gpsgLfg and gpsg
Lfg and gpsg
 
NLTK
NLTKNLTK
NLTK
 
Ontology engineering: Ontology alignment
Ontology engineering: Ontology alignmentOntology engineering: Ontology alignment
Ontology engineering: Ontology alignment
 
Phrase structure grammar
Phrase structure grammarPhrase structure grammar
Phrase structure grammar
 
Word sense dissambiguation
Word sense dissambiguationWord sense dissambiguation
Word sense dissambiguation
 
NLP_KASHK:Context-Free Grammar for English
NLP_KASHK:Context-Free Grammar for EnglishNLP_KASHK:Context-Free Grammar for English
NLP_KASHK:Context-Free Grammar for English
 
Corpora analysis bruno natalia sarah
Corpora analysis   bruno natalia sarahCorpora analysis   bruno natalia sarah
Corpora analysis bruno natalia sarah
 
NLP_KASHK:Morphology
NLP_KASHK:MorphologyNLP_KASHK:Morphology
NLP_KASHK:Morphology
 
Amharic WSD using WordNet
Amharic WSD using WordNetAmharic WSD using WordNet
Amharic WSD using WordNet
 
Modular Ontologies: Package-based Description Logics Approach
Modular Ontologies: Package-based Description Logics ApproachModular Ontologies: Package-based Description Logics Approach
Modular Ontologies: Package-based Description Logics Approach
 
Parts of Speect Tagging
Parts of Speect TaggingParts of Speect Tagging
Parts of Speect Tagging
 
The Ins and Outs of Preposition Semantics:
 Challenges in Comprehensive Corpu...
The Ins and Outs of Preposition Semantics:
 Challenges in Comprehensive Corpu...The Ins and Outs of Preposition Semantics:
 Challenges in Comprehensive Corpu...
The Ins and Outs of Preposition Semantics:
 Challenges in Comprehensive Corpu...
 

Similar to Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks

An Improved Approach to Word Sense Disambiguation
An Improved Approach to Word Sense DisambiguationAn Improved Approach to Word Sense Disambiguation
An Improved Approach to Word Sense DisambiguationSurabhi Verma
 
WORDNET: A Database of Lexical Relations
WORDNET: A Database of Lexical RelationsWORDNET: A Database of Lexical Relations
WORDNET: A Database of Lexical Relations
Ahmed Abd-Elwasaa
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
Toine Bogers
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
Saurabh Kaushik
 
DETECTING OXYMORON IN A SINGLE STATEMENT
DETECTING OXYMORON IN A SINGLE STATEMENTDETECTING OXYMORON IN A SINGLE STATEMENT
DETECTING OXYMORON IN A SINGLE STATEMENT
WarNik Chow
 
Visual Word Recognition. The Journey from Features to Meaning
Visual Word Recognition. The Journey from Features to MeaningVisual Word Recognition. The Journey from Features to Meaning
Visual Word Recognition. The Journey from Features to Meaning
fawzia
 
Word sense disambiguation using wsd specific wordnet of polysemy words
Word sense disambiguation using wsd specific wordnet of polysemy wordsWord sense disambiguation using wsd specific wordnet of polysemy words
Word sense disambiguation using wsd specific wordnet of polysemy words
ijnlc
 
Deep Learning for Natural Language Processing: Word Embeddings
Deep Learning for Natural Language Processing: Word EmbeddingsDeep Learning for Natural Language Processing: Word Embeddings
Deep Learning for Natural Language Processing: Word Embeddings
Roelof Pieters
 
Principles of parameters
Principles of parametersPrinciples of parameters
Principles of parametersVelnar
 
Anaphora resolution in hindi language using gazetteer method
Anaphora resolution in hindi language using gazetteer methodAnaphora resolution in hindi language using gazetteer method
Anaphora resolution in hindi language using gazetteer method
ijcsa
 
Engineering Intelligent NLP Applications Using Deep Learning – Part 1
Engineering Intelligent NLP Applications Using Deep Learning – Part 1Engineering Intelligent NLP Applications Using Deep Learning – Part 1
Engineering Intelligent NLP Applications Using Deep Learning – Part 1
Saurabh Kaushik
 
Sentence Processing by Muhammad Saleem.pptx
Sentence Processing by Muhammad Saleem.pptxSentence Processing by Muhammad Saleem.pptx
Sentence Processing by Muhammad Saleem.pptx
E&S Education Department, KP
 
Svetlin Nakov - ArtsSemNet: From Bilingual Dictionary to Bilingual Semantic N...
Svetlin Nakov - ArtsSemNet: From Bilingual Dictionary to Bilingual Semantic N...Svetlin Nakov - ArtsSemNet: From Bilingual Dictionary to Bilingual Semantic N...
Svetlin Nakov - ArtsSemNet: From Bilingual Dictionary to Bilingual Semantic N...
Svetlin Nakov
 
ArtsSemNet - Thessaloniki Presentation
ArtsSemNet - Thessaloniki PresentationArtsSemNet - Thessaloniki Presentation
ArtsSemNet - Thessaloniki PresentationSvetlin Nakov
 
Introduction to Distributional Semantics
Introduction to Distributional SemanticsIntroduction to Distributional Semantics
Introduction to Distributional SemanticsAndre Freitas
 

Similar to Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks (20)

An Improved Approach to Word Sense Disambiguation
An Improved Approach to Word Sense DisambiguationAn Improved Approach to Word Sense Disambiguation
An Improved Approach to Word Sense Disambiguation
 
WORDNET: A Database of Lexical Relations
WORDNET: A Database of Lexical RelationsWORDNET: A Database of Lexical Relations
WORDNET: A Database of Lexical Relations
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Class14
Class14Class14
Class14
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
DETECTING OXYMORON IN A SINGLE STATEMENT
DETECTING OXYMORON IN A SINGLE STATEMENTDETECTING OXYMORON IN A SINGLE STATEMENT
DETECTING OXYMORON IN A SINGLE STATEMENT
 
Word Net
Word NetWord Net
Word Net
 
Visual Word Recognition. The Journey from Features to Meaning
Visual Word Recognition. The Journey from Features to MeaningVisual Word Recognition. The Journey from Features to Meaning
Visual Word Recognition. The Journey from Features to Meaning
 
Word sense disambiguation using wsd specific wordnet of polysemy words
Word sense disambiguation using wsd specific wordnet of polysemy wordsWord sense disambiguation using wsd specific wordnet of polysemy words
Word sense disambiguation using wsd specific wordnet of polysemy words
 
Exempler approach
Exempler approachExempler approach
Exempler approach
 
Syntax
SyntaxSyntax
Syntax
 
Deep Learning for Natural Language Processing: Word Embeddings
Deep Learning for Natural Language Processing: Word EmbeddingsDeep Learning for Natural Language Processing: Word Embeddings
Deep Learning for Natural Language Processing: Word Embeddings
 
Principles of parameters
Principles of parametersPrinciples of parameters
Principles of parameters
 
NLP
NLPNLP
NLP
 
Anaphora resolution in hindi language using gazetteer method
Anaphora resolution in hindi language using gazetteer methodAnaphora resolution in hindi language using gazetteer method
Anaphora resolution in hindi language using gazetteer method
 
Engineering Intelligent NLP Applications Using Deep Learning – Part 1
Engineering Intelligent NLP Applications Using Deep Learning – Part 1Engineering Intelligent NLP Applications Using Deep Learning – Part 1
Engineering Intelligent NLP Applications Using Deep Learning – Part 1
 
Sentence Processing by Muhammad Saleem.pptx
Sentence Processing by Muhammad Saleem.pptxSentence Processing by Muhammad Saleem.pptx
Sentence Processing by Muhammad Saleem.pptx
 
Svetlin Nakov - ArtsSemNet: From Bilingual Dictionary to Bilingual Semantic N...
Svetlin Nakov - ArtsSemNet: From Bilingual Dictionary to Bilingual Semantic N...Svetlin Nakov - ArtsSemNet: From Bilingual Dictionary to Bilingual Semantic N...
Svetlin Nakov - ArtsSemNet: From Bilingual Dictionary to Bilingual Semantic N...
 
ArtsSemNet - Thessaloniki Presentation
ArtsSemNet - Thessaloniki PresentationArtsSemNet - Thessaloniki Presentation
ArtsSemNet - Thessaloniki Presentation
 
Introduction to Distributional Semantics
Introduction to Distributional SemanticsIntroduction to Distributional Semantics
Introduction to Distributional Semantics
 

More from MLconf

Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
MLconf
 
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingTed Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
MLconf
 
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
MLconf
 
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold RushIgor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
MLconf
 
Josh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceJosh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious Experience
MLconf
 
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
MLconf
 
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
MLconf
 
Meghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMeghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the Cheap
MLconf
 
Noam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionNoam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data Collection
MLconf
 
June Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLJune Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of ML
MLconf
 
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
MLconf
 
Vito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldVito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI World
MLconf
 
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
MLconf
 
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
MLconf
 
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
MLconf
 
Neel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeNeel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to code
MLconf
 
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
MLconf
 
Soumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareSoumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better Software
MLconf
 
Roy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesRoy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime Changes
MLconf
 
Madalina Fiterau - Hybrid Machine Learning Methods for the Interpretation and...
Madalina Fiterau - Hybrid Machine Learning Methods for the Interpretation and...Madalina Fiterau - Hybrid Machine Learning Methods for the Interpretation and...
Madalina Fiterau - Hybrid Machine Learning Methods for the Interpretation and...
MLconf
 

More from MLconf (20)

Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
 
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingTed Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
 
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
 
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold RushIgor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
 
Josh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceJosh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious Experience
 
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
 
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
 
Meghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMeghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the Cheap
 
Noam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionNoam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data Collection
 
June Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLJune Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of ML
 
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
 
Vito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldVito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI World
 
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
 
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
 
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
 
Neel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeNeel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to code
 
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
 
Soumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareSoumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better Software
 
Roy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesRoy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime Changes
 
Madalina Fiterau - Hybrid Machine Learning Methods for the Interpretation and...
Madalina Fiterau - Hybrid Machine Learning Methods for the Interpretation and...Madalina Fiterau - Hybrid Machine Learning Methods for the Interpretation and...
Madalina Fiterau - Hybrid Machine Learning Methods for the Interpretation and...
 

Recently uploaded

The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
Jemma Hussein Allen
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Inflectra
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
Elena Simperl
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Product School
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
Paul Groth
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
RTTS
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
DianaGray10
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
Cheryl Hung
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
Product School
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
Product School
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
Ralf Eggert
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Product School
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Jeffrey Haguewood
 

Recently uploaded (20)

The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
 

Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks

  • 1. Deep Learning for Semantic Relation Classification Sneha Rajana Amazon
  • 3. Natural Language Understanding Human language is a rich, varied, and growing source of information
  • 4. Challenges of NLU • Multiple words with same meanings (Synonyms) • Words with multiple meanings (polysemy) some of which are entirely opposite in nature (auto- antonyms) • Words which behave differently when used as noun and verb
  • 5. Challenges of NLU • Multiple words with same meanings (Synonyms) • Words with multiple meanings (polysemy) some of which are entirely opposite in nature (auto- antonyms) • Words which behave differently when used as noun and verb Hot water/Cool water Hot topic/Cool topic
  • 6. Challenges of NLU • Multiple words with same meanings (Synonyms) • Words with multiple meanings (polysemy) some of which are entirely opposite in nature (auto- antonyms) • Words which behave differently when used as noun and verb Words make sense contextually in natural language which humans can comprehend and distinguish easily, but machines can’t
  • 7. Recognizing various semantic relations between entity pairs in sentences is an important task in Natural Language Processing (NLP).
  • 8. Lexical Semantic Relations hot/cold good/bad friend/enemy enemy/fan cold/lukewarm ascend/slip up/down spirited/spiritless cool/not cool honest/dishonest penguin/clown cold/chilly boat/rudder agree/disagree absence/presence acceptable/intolerable hello/fine voice/silent love/hate like/love mouse/animal nose/human happy/joy
  • 9. Lexical Semantic Relations hot/cold good/bad friend/enemy enemy/fan cold/lukewarm ascend/slip up/down spirited/spiritless cool/not cool honest/dishonest penguin/clown cold/chilly boat/rudder agree/disagree absence/presence acceptable/intolerable hello/fine voice/silent love/hate like/love mouse/animal nose/human happy/joy hypernym meronym synonym Antonyms: Semantically related but not semantically similar!
  • 11. What are Antonyms? hot/cold good/bad friend/enemy enemy/fan cold/lukewarm ascend/slip up/down spirited/spiritless cool/not cool honest/dishonest penguin/clown cold/chilly boat/rudder agree/disagree absence/presence acceptable/intolerable hello/fine voice/silent love/hate like/love mouse/animal nose/human happy/joy
  • 12. What are Antonyms? hot/cold good/bad friend/enemy enemy/fan cold/lukewarm ascend/slip up/down spirited/spiritless cool/not cool honest/dishonest penguin/clown cold/chilly boat/rudder agree/disagree absence/presence acceptable/intolerable hello/fine voice/silent love/hate like/love mouse/animal nose/human happy/joy strongly antonymous not antonymous semantically contrasting
  • 13. Goal: Antonym Detection Given two terms x and y, decide whether x and y are antonyms of each other Main Contributions: • Learning antonyms with paraphrases • Learning antonyms with a morphology-aware neural network University of Pennsylvania
  • 16. Deriving Antonyms from Paraphrases not allowed in here ~ not permitted did not plan ~ had no intention never mind about that ~ it matters not Phrases expressing the same meaning usually occurring in similar textual contexts or have common translations in other languages
  • 17. PPDB: The Paraphrase Database An automatically extracted database containing millions of paraphrases • 22 different languages • ~100M word and phrase pairs • Big and noisy • Currently the largest available collection of paraphrases
  • 18. PPDB: The Paraphrase Database An automatically extracted database containing millions of paraphrases
  • 19. Step 1: WordNet Seed Set Direct antonyms E.g. clean/dirty Indirect antonyms E.g. clean/foul E.g. clean/grime WORDNET A large lexical English database Nouns, verbs, adjectives, adverbs are grouped into sets of cognitive synonyms or sunsets Synsets
  • 20. Step 2: Antonyms from Paraphrases Negating word (Not happy, unhappy) -> (happy, unhappy) Negating prefix (unjustifiable, unreasonable) -> (justifiable, unreasonable) Used PPDB to retrieve paraphrase mappings of 2 types Negating word (Not X, Y) -> (X, Y) Negating prefix (Neg-Prefix(X), Y) -> (X, Y)
  • 21. Step 3: Indirect Antonyms via Expansion -> (happy, unhappy) -> (happy, synsets(unhappy)) -> (synsets(happy), unhappy) -> (justifiable, unreasonable) -> (justifiable, paraphrases(unreasonable)) -> (paraphrases(justifiable), unreasonable) (~X, Y) -> (X, Y) -> (X, synonyms(Y)) -> (synonyms(X), Y) Synsets from WordNet and paraphrases from PPDB
  • 22. Antonym Generation Direct Antonyms Indirect Antonyms clean/dirty clean/foul rise/fall rise/downfall sleep/wake sleep/rise above/below above/under Paraphrase Pair Antonyms Pair deactivated/turned off activated/turned off unjustifiable/unreasonable justifiable/unreasonable deforestation/destruction forestation/destruction anti-hatred/non-hatred hatred/non-hatred Paraphrase Pair Antonym Pair not true/untrue true/untrue not identical/different identical/different not acceptable/objectionable acceptable/objectionable not sufficient/insufficient sufficient/insufficient WordNet expansion Removal of negating word Removal of negating prefix
  • 23. Antonyms derived from PPDB 0 22500 45000 67500 90000 Wordnet (direct) WordNet (indirect) (X,Y) from (~X,Y) Synset Expansion Paraphrase Expansion 81,221 35,686 80,669 14,9693,337 Number of unique antonyms generated
  • 24. Classification of non-antonyms Unrelated long/rare much/worthless disability/present/ equality/gap Paraphrases simply/merely correct/that’s right till/until right/alright Other twinkle/dark access/available valuable/premium naw/not gonna Entailment valid/equally valid significant/ statistically Categories Africa/Asia Jan/Feb Black/Red Blonde/ Brunette
  • 25. Learning Antonyms with Paraphrases and a Morphology- aware Neural Network *Sem 2017, Vancouver, Canada Sneha Rajana*, Chris Callison-Burch*, Marianna Appidianaki* 𝛹, Vered Shwartzϕ *Computer and Information Science Department, University of Pennsylvania, USA 𝛹LIMSI, CNRS, University Paris-Saclay, 91403 Orsay ϕComputer Science Department, Bar-Ilan University, Israel
  • 26. Background • Prior work: Path-based, Distributional • Integrated neural path-based (improved path- based) and distributional method for detecting Hypernymy - HypeNET [Vered et al., 2015] • Integrated neural path-based (improved path- based) and distributional method for detecting multiple semantic relations - LexNET [Vered et al., 2016]
  • 27. Distributional Approach Recognize the relation between x and y based on their separate occurrences in the corpus Distributional Hypothesis Words that occur in similar contexts have similar meanings Using x and y's word embeddings [Mikolov et al., 2013, Pennington et al. 2014] as distributional vector representations
  • 28. Supervised Distributional Methods • Represent (x, y) as a feature vector, based on the term’s embeddings • Train a classifier to predict whether y is a <relation> of x Concatenation[Baroni et al. 2012] x + y They don’t learn the relation between x and y, but mostly that is a prototypical relation! E.g. (x, fruit), (x, animal) are always hypernyms
  • 29. Path-based Approach Recognize the relation between x and y based on their joint occurrences in the corpus Hearst Patterns [Hearst, 1992] Patterns connecting x and y may indicate that x is a <relation> of y X is a Y (Hypernym) Neither X nor Y (Antonym) Patterns can be represented using dependency paths
  • 30. Supervised Path-based Method • Features: all dependency paths that connect x and y in a corpus • Supervised: Labelled training data (word pairs) • Trained a logistic regression classifier to predict a relation Feature space is too sparse! Similar paths share no information X inc. is a Y, X group is a Y, X organization is a Y
  • 31. Neural path-based method HypeNET • Split each path between X and Y into edges • Each edge consists of 4 components: lemma/POS/ dependency label/direction • Learn embedding vectors for each component LSTM LSTM LSTM LSTM
  • 32. Neural path-based method • Feed the edges sequentially to an LSTM • Use the last output vector as the path embedding • The LSTM may focus on edges that are more informative or the classification task, while ignoring others
  • 33. Neural path-based method • The LSTM encodes a single path • Each pair of terms occurs in multiple paths • Represent a term-pair as its averaged path embedding • Classify for hypernym (or other lexical relationship) LSTM LSTM LSTM LSTM
  • 34. LexNET: Multiple Semantic Relations • LexNET: An extension of HypeNET to classify multiple semantic relations (E.g. meronymy, synonymy, antonymy etc.) •
  • 36. AntNET • Variant of LexNET • Morphology aware path features • Handles multi-word expressions
  • 38. Replacement of word embeddings • Rare Paths: neither happy nor sad vs. neither happy nor unhappy • Seemingly negated words: valuable - invaluable • Multi-Word Expressions: not happy
  • 39. AntNET: Network Architecture • Screenshot Term-Pair Classification (Binary or Multiclass)
  • 40. Integrated Model • Add distributional information with path information • Concatenate x and y’s word embeddings to the averaged path • Classify for antonymy (integrated network) • dd • dd • dd •
  • 42. Corpus and Dataset Knowledge resources WikiPedia dump English May 2015 GloVe: Global Vectors for Word Representation Unsupervised learning algorithm for obtaining vector representation of words Computed paths between the most frequent unigrams, bigrams, and trigrams in Wikipedia based on GloVe vocabulary and the most frequent 100K bigrams and trigrams. GloVe Embeddings Used pre-trained word embeddings of 50, 100, and 200 dimensions Vocabulary PPDB words that were contained in the most common 400k words and the most common 100k bigrams and trigrams in Wikipedia Dataset Generated from PPDB Size so far: ~4000 pairs Train/Test/Validation: 70/25/5
  • 43. AntNET: Results Metric Model Binary Multiclass Precision Path-based Combined 0.732 0.803 0.652 0.746 Recall Path-based Combined 0.724 0.788 0.687 0.757 F1 Path-based Combined 0.713 0.802** 0.661 0.746** paired t-test, *p<0.1, **p<0.05
  • 44. Effect of the negation-marking feature 0.72 0.743 0.765 0.788 0.81 LexNET AntNET-neg AntNET AntNET-distance 0.734 0.746 0.7400.738 0.788 0.802 0.793 0.788 Binary Multiclass Performance(F1Score) lemma/POS/dep/ direction lemma/POS/dep/ direction/neg non-negated(lemma)/ POS/dep/direction/neg non-negated(lemma)/ POS/dep/distance/neg
  • 45. AntNET: Evaluation 0 0.225 0.45 0.675 0.9 Majority Class Word Embedding + SVM LexNET AntNET 0.750.74 0.34 0.30 0.800.79 0.44 0.39 Binary Multiclass Performance(F1Score)
  • 49. Improvements • In recent years, SOTA performance has been achieved using neural models by incorporating lexical and syntactic features such as POS tags and dependency trees. • Although syntactic features are no doubt helpful, a known challenge is that parsers are not available for every language, and even when available, they may not be sufficiently robust, especially for out-of-domain text, which may even hurt performance • Recently, the NLP community has seen excitement around neural models that make heavy use of pre-training based on language modeling • Without using any external features, a simple BERT-based model can achieve SOTA performance for Relation Extraction and Semantic Role Labeling [Shi et. al. 2019, You et. al. 2019].
  • 50. BERT-based models for Multi-way classification of semantic relations (SemEval) The task is, given a sentence and two tagged nominals, to predict the relation between those nominals and the direction of the relation. Model F1 Score Matching-the-Blanks (Baldini Soares et al., 2019) 89.5 R-BERT (Wu et al. 2019) 89.25 Multi-Attention CNN (Wang et al. 2016) 88.0 Entity Attention Bi-LSTM (Lee et al., 2019) - RNN-based Model 85.2
  • 51. Thank You! • Questions? Email srajana@amazon.com • Twitter: @sneha_rajana • Medium: @sneharajana • LinkedIn: www.linkedin.com/in/sneha-rajana