SlideShare a Scribd company logo
1 of 1
Download to read offline
www.postersession.com
Word embeddings are being used for several linguistic problems and NLP
applications. The quality of these embeddings have improved largely over the
past few years. However, vector embeddings of phrases keeping semantics
intact with words has been challenging. We present a novel methodology using
Siamese deep neural networks to embed multi-word units and fine-tune the
current state-of-the-art word embeddings keeping both in the same vector
space.
Vijay Prakash Dwivedi Manish Shrivastava
CSED, MNNIT Allahabad LTRC, IIIT Hyderabad
mail@vijaydwivedi.com.np m.shrivastava@iiit.ac.in
Introduction
 Currently, compositional models are used to build phrase embeddings with
less emphasis on building the compositions using deep learning and more using
specific composition functions.
 Our major contribution in this work is employing deep neural architectures to
transform constituent word embeddings of a multi-word unit into its vector
representation.
Contribution
 Model developed with the objective of training a system to predict similarity
between a word and a phrase leveraging a similarity dataset.
 We capture the sequence of words in a phrase and their interaction as well.
 The model learns to generate phrase representations according to its closest
single word meaning.
The more the semantic similarity between the word and the phrase, the closer
is their similarity metric output to 0 (1 otherwise).
Approach
Input Word Input Phrase Output
remorse
athletes
suez
payment
deep regret
bring up
the suez canal
earth orbit
0
1
0
1
Table 1. Input-Output samples. Output is the similarity metric output, i.e. 0 for similar and 1 for dissimilar
Model Architecture: Siamese LSTM
Fig 1. Model Architecture; First input Xa is a single word unit and second input Xb is a multi-word unit
We use 3 layers of stacked LSTMs in the Siamese network. The hidden layers
and the number of neurons were selected after repeated experiments and we
obtained best results with this configuration and the computational constraints.
Data set and Baseline Word Vectors
We use the PPDB dataset (Ganitkevitch et al., 2013)
Size XL
N-gram Numbers Percentage
2-gram
3-gram
4-gram
5-gram
6-gram
298,536
60,657
13,132
2,993
682
79.40 %
16.13 %
3.49 %
0.80 %
0.18 %
Total 376,000 100.00 %
As base word
embeddings for the input
layers, we use GloVe
word vector embeddings
(Pennington et al., 2014)
of dimension 200.
GloVe: https://nlp.stanford.edu/projects/
glove
Experiments and Results
Table 2. Composition of the PPDB dataset of size XL showing only 0.18% of the phrases are of more than
5-grams; the scores are 0.15% & 0.12% for XXL & XXXL sizes respectively
1. Word-Phrase Similarity Task 2. SemEval2013 5(a) Task
Fig 2. Accuracy of similarity over dataset size
Model SemEval2013 Test
RAE
FCT-LM
FCT-Joint
ReNN
Our System
51.75
67.22
70.64
72.22
72.14
Table 3. Performance Comparison;
RAE: Recursive Auto-encoder; FCT: Feature
Rich Compositional Transformation Models;
ReNN: Recursive Neural Networks
3. Nearest Words and Phrases
Word Nearest Words Nearest Phrases
viewpoints perspectives, opinions,
viewpoint, views
differing views, the viewpoints, different
opinions, different perspectives
upbeat optimistic, cautious,
outlook, gloomy
optimistic about, overly optimistic, more
cautious, cautious
1600 1400, 1700, 1300, 1500 1600 hours, 1300 hours, 1700 hours,
1100 hours
asem apec, asean, g20, summit the asem process, apec leaders, apec
economies, summit meetings
Phrase Nearest Phrases
are crucial are important, are needed, are essential, are necessary
2005 to 2006 2005 to 2007, 2005 to 2008, 2003 to 2006, 2004 to 2005
president hosni
mubarak
president mubarak, egyptian president hosni mubarak, hosni
mubarak, the egyptian president hosni mubarak
the violence the acts of violence, the violent, the prevalence of violence, the
cycle of violence
References
1. Juri Ganitkevitch, Benjamin Van Durme and Chris Callison-Burch. 2013. PPDB: The Paraphrase
Database. NAACL-HLT, 758-764.
2. Richard Socher, Jeffrey Pennington, Eric H. Huang, Andrew Y. Ng, and Christopher D. Manning. 2011.
Semi-supervised Recursive Autoencoders for Predicting Sentiment Distributions. EMNLP,151-161.
3. Richard Socher, John Bauer, Christopher D. Manning, and Andrew Ng. 2013. Parsing with compositional
vector grammars. ACL, 455-465.
4. Mo Yu and Mark Dredze. 2015. Learning Composition Models for Phrase Embeddings. ACL, 227-242.

More Related Content

What's hot

SNLI_presentation_2
SNLI_presentation_2SNLI_presentation_2
SNLI_presentation_2
Viral Gupta
 
Maximum-Length Comparison Method Of Automatic Word Segmentation for Myanmar...
Maximum-Length Comparison Method  Of Automatic Word Segmentation  for Myanmar...Maximum-Length Comparison Method  Of Automatic Word Segmentation  for Myanmar...
Maximum-Length Comparison Method Of Automatic Word Segmentation for Myanmar...
Htet Myet Lynn
 

What's hot (19)

[Paper Reading] Unsupervised Learning of Sentence Embeddings using Compositi...
[Paper Reading]  Unsupervised Learning of Sentence Embeddings using Compositi...[Paper Reading]  Unsupervised Learning of Sentence Embeddings using Compositi...
[Paper Reading] Unsupervised Learning of Sentence Embeddings using Compositi...
 
Spell checker using Natural language processing
Spell checker using Natural language processing Spell checker using Natural language processing
Spell checker using Natural language processing
 
POS tagging using Resourch Rich Language
POS tagging using Resourch Rich LanguagePOS tagging using Resourch Rich Language
POS tagging using Resourch Rich Language
 
SNLI_presentation_2
SNLI_presentation_2SNLI_presentation_2
SNLI_presentation_2
 
Representation Learning of Vectors of Words and Phrases
Representation Learning of Vectors of Words and PhrasesRepresentation Learning of Vectors of Words and Phrases
Representation Learning of Vectors of Words and Phrases
 
Hua Shan - 2015 - A Dependency-to-String Model for Chinese-Japanese SMT System
Hua Shan - 2015 - A Dependency-to-String Model for Chinese-Japanese SMT SystemHua Shan - 2015 - A Dependency-to-String Model for Chinese-Japanese SMT System
Hua Shan - 2015 - A Dependency-to-String Model for Chinese-Japanese SMT System
 
Language models
Language modelsLanguage models
Language models
 
(Deep) Neural Networks在 NLP 和 Text Mining 总结
(Deep) Neural Networks在 NLP 和 Text Mining 总结(Deep) Neural Networks在 NLP 和 Text Mining 总结
(Deep) Neural Networks在 NLP 和 Text Mining 总结
 
Extraction Based automatic summarization
Extraction Based automatic summarizationExtraction Based automatic summarization
Extraction Based automatic summarization
 
Word representations in vector space
Word representations in vector spaceWord representations in vector space
Word representations in vector space
 
(Kpi summer school 2015) word embeddings and neural language modeling
(Kpi summer school 2015) word embeddings and neural language modeling(Kpi summer school 2015) word embeddings and neural language modeling
(Kpi summer school 2015) word embeddings and neural language modeling
 
Text Mining for Lexicography
Text Mining for LexicographyText Mining for Lexicography
Text Mining for Lexicography
 
Fasttext 20170720 yjy
Fasttext 20170720 yjyFasttext 20170720 yjy
Fasttext 20170720 yjy
 
Science in text mining
Science in text miningScience in text mining
Science in text mining
 
What is word2vec?
What is word2vec?What is word2vec?
What is word2vec?
 
Maximum-Length Comparison Method Of Automatic Word Segmentation for Myanmar...
Maximum-Length Comparison Method  Of Automatic Word Segmentation  for Myanmar...Maximum-Length Comparison Method  Of Automatic Word Segmentation  for Myanmar...
Maximum-Length Comparison Method Of Automatic Word Segmentation for Myanmar...
 
Bangla spell checker & suggestion generator
Bangla spell checker & suggestion generatorBangla spell checker & suggestion generator
Bangla spell checker & suggestion generator
 
Nlp
NlpNlp
Nlp
 
ENHANCING THE PERFORMANCE OF SENTIMENT ANALYSIS SUPERVISED LEARNING USING SEN...
ENHANCING THE PERFORMANCE OF SENTIMENT ANALYSIS SUPERVISED LEARNING USING SEN...ENHANCING THE PERFORMANCE OF SENTIMENT ANALYSIS SUPERVISED LEARNING USING SEN...
ENHANCING THE PERFORMANCE OF SENTIMENT ANALYSIS SUPERVISED LEARNING USING SEN...
 

Similar to Beyond Word2Vec: Embedding Words and Phrases in Same Vector Space

Chunker Based Sentiment Analysis and Tense Classification for Nepali Text
Chunker Based Sentiment Analysis and Tense Classification for Nepali TextChunker Based Sentiment Analysis and Tense Classification for Nepali Text
Chunker Based Sentiment Analysis and Tense Classification for Nepali Text
kevig
 
Chunker Based Sentiment Analysis and Tense Classification for Nepali Text
Chunker Based Sentiment Analysis and Tense Classification for Nepali TextChunker Based Sentiment Analysis and Tense Classification for Nepali Text
Chunker Based Sentiment Analysis and Tense Classification for Nepali Text
kevig
 

Similar to Beyond Word2Vec: Embedding Words and Phrases in Same Vector Space (20)

THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIESTHE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
 
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIESTHE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
 
Turkish language modeling using BERT
Turkish language modeling using BERTTurkish language modeling using BERT
Turkish language modeling using BERT
 
semeval2016
semeval2016semeval2016
semeval2016
 
1808.10245v1 (1).pdf
1808.10245v1 (1).pdf1808.10245v1 (1).pdf
1808.10245v1 (1).pdf
 
Chunker Based Sentiment Analysis and Tense Classification for Nepali Text
Chunker Based Sentiment Analysis and Tense Classification for Nepali TextChunker Based Sentiment Analysis and Tense Classification for Nepali Text
Chunker Based Sentiment Analysis and Tense Classification for Nepali Text
 
Chunker Based Sentiment Analysis and Tense Classification for Nepali Text
Chunker Based Sentiment Analysis and Tense Classification for Nepali TextChunker Based Sentiment Analysis and Tense Classification for Nepali Text
Chunker Based Sentiment Analysis and Tense Classification for Nepali Text
 
Suggestion Generation for Specific Erroneous Part in a Sentence using Deep Le...
Suggestion Generation for Specific Erroneous Part in a Sentence using Deep Le...Suggestion Generation for Specific Erroneous Part in a Sentence using Deep Le...
Suggestion Generation for Specific Erroneous Part in a Sentence using Deep Le...
 
NBLex: emotion prediction in Kannada-English code-switchtext using naïve baye...
NBLex: emotion prediction in Kannada-English code-switchtext using naïve baye...NBLex: emotion prediction in Kannada-English code-switchtext using naïve baye...
NBLex: emotion prediction in Kannada-English code-switchtext using naïve baye...
 
A technical paper presentation on Evaluation of Deep Learning techniques in S...
A technical paper presentation on Evaluation of Deep Learning techniques in S...A technical paper presentation on Evaluation of Deep Learning techniques in S...
A technical paper presentation on Evaluation of Deep Learning techniques in S...
 
Fast and Accurate Preordering for SMT using Neural Networks
Fast and Accurate Preordering for SMT using Neural NetworksFast and Accurate Preordering for SMT using Neural Networks
Fast and Accurate Preordering for SMT using Neural Networks
 
Image captions.pptx
Image captions.pptxImage captions.pptx
Image captions.pptx
 
Analysis of Opinionated Text for Opinion Mining
Analysis of Opinionated Text for Opinion MiningAnalysis of Opinionated Text for Opinion Mining
Analysis of Opinionated Text for Opinion Mining
 
Analyse de sentiment et classification par approche neuronale en Python et Weka
Analyse de sentiment et classification par approche neuronale en Python et WekaAnalyse de sentiment et classification par approche neuronale en Python et Weka
Analyse de sentiment et classification par approche neuronale en Python et Weka
 
IDENTIFYING THE SEMANTIC RELATIONS ON UNSTRUCTURED DATA
IDENTIFYING THE SEMANTIC RELATIONS ON UNSTRUCTURED DATAIDENTIFYING THE SEMANTIC RELATIONS ON UNSTRUCTURED DATA
IDENTIFYING THE SEMANTIC RELATIONS ON UNSTRUCTURED DATA
 
Identifying the semantic relations on
Identifying the semantic relations onIdentifying the semantic relations on
Identifying the semantic relations on
 
Classification of Machine Translation Outputs Using NB Classifier and SVM for...
Classification of Machine Translation Outputs Using NB Classifier and SVM for...Classification of Machine Translation Outputs Using NB Classifier and SVM for...
Classification of Machine Translation Outputs Using NB Classifier and SVM for...
 
Deep Neural Methods for Retrieval
Deep Neural Methods for RetrievalDeep Neural Methods for Retrieval
Deep Neural Methods for Retrieval
 
SENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORK
SENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORKSENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORK
SENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORK
 
Sentiment Analysis In Myanmar Language Using Convolutional Lstm Neural Network
Sentiment Analysis In Myanmar Language Using Convolutional Lstm Neural NetworkSentiment Analysis In Myanmar Language Using Convolutional Lstm Neural Network
Sentiment Analysis In Myanmar Language Using Convolutional Lstm Neural Network
 

Recently uploaded

Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...
Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...
Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...
David Celestin
 
Uncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac FolorunsoUncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac Folorunso
Kayode Fayemi
 
Unlocking Exploration: Self-Motivated Agents Thrive on Memory-Driven Curiosity
Unlocking Exploration: Self-Motivated Agents Thrive on Memory-Driven CuriosityUnlocking Exploration: Self-Motivated Agents Thrive on Memory-Driven Curiosity
Unlocking Exploration: Self-Motivated Agents Thrive on Memory-Driven Curiosity
Hung Le
 
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
amilabibi1
 
Jual obat aborsi Jakarta 085657271886 Cytote pil telat bulan penggugur kandun...
Jual obat aborsi Jakarta 085657271886 Cytote pil telat bulan penggugur kandun...Jual obat aborsi Jakarta 085657271886 Cytote pil telat bulan penggugur kandun...
Jual obat aborsi Jakarta 085657271886 Cytote pil telat bulan penggugur kandun...
ZurliaSoop
 

Recently uploaded (17)

My Presentation "In Your Hands" by Halle Bailey
My Presentation "In Your Hands" by Halle BaileyMy Presentation "In Your Hands" by Halle Bailey
My Presentation "In Your Hands" by Halle Bailey
 
ICT role in 21st century education and it's challenges.pdf
ICT role in 21st century education and it's challenges.pdfICT role in 21st century education and it's challenges.pdf
ICT role in 21st century education and it's challenges.pdf
 
lONG QUESTION ANSWER PAKISTAN STUDIES10.
lONG QUESTION ANSWER PAKISTAN STUDIES10.lONG QUESTION ANSWER PAKISTAN STUDIES10.
lONG QUESTION ANSWER PAKISTAN STUDIES10.
 
Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...
Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...
Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...
 
SOLID WASTE MANAGEMENT SYSTEM OF FENI PAURASHAVA, BANGLADESH.pdf
SOLID WASTE MANAGEMENT SYSTEM OF FENI PAURASHAVA, BANGLADESH.pdfSOLID WASTE MANAGEMENT SYSTEM OF FENI PAURASHAVA, BANGLADESH.pdf
SOLID WASTE MANAGEMENT SYSTEM OF FENI PAURASHAVA, BANGLADESH.pdf
 
Dreaming Marissa Sánchez Music Video Treatment
Dreaming Marissa Sánchez Music Video TreatmentDreaming Marissa Sánchez Music Video Treatment
Dreaming Marissa Sánchez Music Video Treatment
 
Report Writing Webinar Training
Report Writing Webinar TrainingReport Writing Webinar Training
Report Writing Webinar Training
 
Uncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac FolorunsoUncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac Folorunso
 
Unlocking Exploration: Self-Motivated Agents Thrive on Memory-Driven Curiosity
Unlocking Exploration: Self-Motivated Agents Thrive on Memory-Driven CuriosityUnlocking Exploration: Self-Motivated Agents Thrive on Memory-Driven Curiosity
Unlocking Exploration: Self-Motivated Agents Thrive on Memory-Driven Curiosity
 
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
 
Zone Chairperson Role and Responsibilities New updated.pptx
Zone Chairperson Role and Responsibilities New updated.pptxZone Chairperson Role and Responsibilities New updated.pptx
Zone Chairperson Role and Responsibilities New updated.pptx
 
Introduction to Artificial intelligence.
Introduction to Artificial intelligence.Introduction to Artificial intelligence.
Introduction to Artificial intelligence.
 
in kuwait௹+918133066128....) @abortion pills for sale in Kuwait City
in kuwait௹+918133066128....) @abortion pills for sale in Kuwait Cityin kuwait௹+918133066128....) @abortion pills for sale in Kuwait City
in kuwait௹+918133066128....) @abortion pills for sale in Kuwait City
 
Jual obat aborsi Jakarta 085657271886 Cytote pil telat bulan penggugur kandun...
Jual obat aborsi Jakarta 085657271886 Cytote pil telat bulan penggugur kandun...Jual obat aborsi Jakarta 085657271886 Cytote pil telat bulan penggugur kandun...
Jual obat aborsi Jakarta 085657271886 Cytote pil telat bulan penggugur kandun...
 
Digital collaboration with Microsoft 365 as extension of Drupal
Digital collaboration with Microsoft 365 as extension of DrupalDigital collaboration with Microsoft 365 as extension of Drupal
Digital collaboration with Microsoft 365 as extension of Drupal
 
Dreaming Music Video Treatment _ Project & Portfolio III
Dreaming Music Video Treatment _ Project & Portfolio IIIDreaming Music Video Treatment _ Project & Portfolio III
Dreaming Music Video Treatment _ Project & Portfolio III
 
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdfAWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
 

Beyond Word2Vec: Embedding Words and Phrases in Same Vector Space

  • 1. www.postersession.com Word embeddings are being used for several linguistic problems and NLP applications. The quality of these embeddings have improved largely over the past few years. However, vector embeddings of phrases keeping semantics intact with words has been challenging. We present a novel methodology using Siamese deep neural networks to embed multi-word units and fine-tune the current state-of-the-art word embeddings keeping both in the same vector space. Vijay Prakash Dwivedi Manish Shrivastava CSED, MNNIT Allahabad LTRC, IIIT Hyderabad mail@vijaydwivedi.com.np m.shrivastava@iiit.ac.in Introduction  Currently, compositional models are used to build phrase embeddings with less emphasis on building the compositions using deep learning and more using specific composition functions.  Our major contribution in this work is employing deep neural architectures to transform constituent word embeddings of a multi-word unit into its vector representation. Contribution  Model developed with the objective of training a system to predict similarity between a word and a phrase leveraging a similarity dataset.  We capture the sequence of words in a phrase and their interaction as well.  The model learns to generate phrase representations according to its closest single word meaning. The more the semantic similarity between the word and the phrase, the closer is their similarity metric output to 0 (1 otherwise). Approach Input Word Input Phrase Output remorse athletes suez payment deep regret bring up the suez canal earth orbit 0 1 0 1 Table 1. Input-Output samples. Output is the similarity metric output, i.e. 0 for similar and 1 for dissimilar Model Architecture: Siamese LSTM Fig 1. Model Architecture; First input Xa is a single word unit and second input Xb is a multi-word unit We use 3 layers of stacked LSTMs in the Siamese network. The hidden layers and the number of neurons were selected after repeated experiments and we obtained best results with this configuration and the computational constraints. Data set and Baseline Word Vectors We use the PPDB dataset (Ganitkevitch et al., 2013) Size XL N-gram Numbers Percentage 2-gram 3-gram 4-gram 5-gram 6-gram 298,536 60,657 13,132 2,993 682 79.40 % 16.13 % 3.49 % 0.80 % 0.18 % Total 376,000 100.00 % As base word embeddings for the input layers, we use GloVe word vector embeddings (Pennington et al., 2014) of dimension 200. GloVe: https://nlp.stanford.edu/projects/ glove Experiments and Results Table 2. Composition of the PPDB dataset of size XL showing only 0.18% of the phrases are of more than 5-grams; the scores are 0.15% & 0.12% for XXL & XXXL sizes respectively 1. Word-Phrase Similarity Task 2. SemEval2013 5(a) Task Fig 2. Accuracy of similarity over dataset size Model SemEval2013 Test RAE FCT-LM FCT-Joint ReNN Our System 51.75 67.22 70.64 72.22 72.14 Table 3. Performance Comparison; RAE: Recursive Auto-encoder; FCT: Feature Rich Compositional Transformation Models; ReNN: Recursive Neural Networks 3. Nearest Words and Phrases Word Nearest Words Nearest Phrases viewpoints perspectives, opinions, viewpoint, views differing views, the viewpoints, different opinions, different perspectives upbeat optimistic, cautious, outlook, gloomy optimistic about, overly optimistic, more cautious, cautious 1600 1400, 1700, 1300, 1500 1600 hours, 1300 hours, 1700 hours, 1100 hours asem apec, asean, g20, summit the asem process, apec leaders, apec economies, summit meetings Phrase Nearest Phrases are crucial are important, are needed, are essential, are necessary 2005 to 2006 2005 to 2007, 2005 to 2008, 2003 to 2006, 2004 to 2005 president hosni mubarak president mubarak, egyptian president hosni mubarak, hosni mubarak, the egyptian president hosni mubarak the violence the acts of violence, the violent, the prevalence of violence, the cycle of violence References 1. Juri Ganitkevitch, Benjamin Van Durme and Chris Callison-Burch. 2013. PPDB: The Paraphrase Database. NAACL-HLT, 758-764. 2. Richard Socher, Jeffrey Pennington, Eric H. Huang, Andrew Y. Ng, and Christopher D. Manning. 2011. Semi-supervised Recursive Autoencoders for Predicting Sentiment Distributions. EMNLP,151-161. 3. Richard Socher, John Bauer, Christopher D. Manning, and Andrew Ng. 2013. Parsing with compositional vector grammars. ACL, 455-465. 4. Mo Yu and Mark Dredze. 2015. Learning Composition Models for Phrase Embeddings. ACL, 227-242.