SlideShare a Scribd company logo
Komachi Lab
M1 Ryosuke Miyazaki
2015/10/16
Cross-Lingual Sentiment Analysis using modified BRAE
Sarthak Jain and Shashank Batra
EMNLP 2015
EMNLP 2015 reading group
※ All figures in this slide are cited from original paper
Komachi Lab
Abstract
✤ To perform Cross Lingual Sentiment Analysis
- They use parallel corpus that include

resource rich (English) and resource poor (Hindi)
✤ They create new Movie Reviews Dataset in Hindi

for evaluation
✤ Their model significantly outperforms state of the art,

especially when labeled data is scarce
2
Komachi Lab
Model and Training
3
Komachi Lab
BRAE Model
4
Bilingually Constrained Recursive Auto-encoder
First, we consider standard Recursive Auto-encoder for each language respectively
construct parent vector reconstruct children vector
Minimize reconstruction errors (Euclidean distance)
c: child vector
y, p: parent vector
Komachi Lab
BRAE Model
5
Loss Function
They also produce representation from another language
Assumption
A phrase and its correct translation should

share the same semantic meaning
Loss Function about source language
Transforming loss
Like wise, they define for target language
Objective function
Komachi Lab
Training (Unsupervised)
✤ Word embeddings are pre-trained by Word2Vec
✤ 1st: Pre-train ps, and pt respectively on RAE
6
✤ 2nd: Fix pt and train ps on BRAE
- Vice-versa for ps
- Set ps = p’s, pt = p’t when it reaching a local minima.
Komachi Lab
Training (Supervise)
✤ Modification for Classifying Sentiment
✤ Adding Softmax and Cross entropy error functions

to only source language (resource rich language)
✤ In this phase, penalty term is included in reconstruction error
7
✤ And, transformation weights (θt
s, θs
t) are not updated in this phase
Komachi Lab
Training (Supervise)
✤ 1st: only update resource rich related parameters
8
ce: cross entropy
✤ 2nd: only update resource poor related parameters
- Since the gold labels are only associated with resource rich,

they use transformation to obtain sentiment distribution
✤ Predict overall sentiment associated with the resource poor
- concat pt, p’s then 

train by softmax regression using weight matrix
Komachi Lab
Experiments
9
Komachi Lab
Experimental Settings
✤ HindMonoCorp 0.5 (44.49M sentences) and

English Gigaword Corpus for word embeddings
✤ Bilingual sentence-aligned data from HindEnCrop
(273.9k sentence pairs)

10
For Unsupervised phase
For Supervised phase (use MOSES to obtain bilingual phrase pairs)
✤ IMDB11 dataset (25000 pos, 25000 neg)
✤ Rotten Tomatoes Review dataset (4 documents, {0, 1, 2, 3})
✤ Their model was able to correctly infer word sense for polysemous words
Komachi Lab
Experimental Setting
✤ Rating Based Hindi Movie Review Dataset (2945 movie reviews, {1, 2, 3, 4})

they create this new dataset for evaluation
✤ Standard Movie Reviews Dataset (125 positive, 125 negative)
11
Evaluation Data set
✤ learning rate: 0.05
✤ word vector dimension: 80
✤ joint error of BRAE (α): 0.2
✤ λL: 0.001
✤ λBRAE: 0.0001
Tuning by Grid Search on Cross Validation
✤ κ: 0.2, η: 0.35
✤ λp: 0.01
✤ λS: 0.1
✤ λT: 0.04
Komachi Lab
Results
✤ BRAE-U: neither include penalty term, nor fix the transformations weights
✤ BRAE-P: only include the penalty term
✤ BRAE-F: include both term
12
monolingual
cross lingual
monolingual
monolingual
monolingual
cross lingual
cross lingual
cross lingual Confusion matrix (BRAE-F)
Komachi Lab
Results
13
Accuracy with amount of

labeled training data used
✤ Their model achieve best performance even though

data are 50% less than those of others.
Accuracy with amount of

unlabeled training data used
Komachi Lab
Analysis
✤ Since the movement in semantic vector space was restricted, their
model have an advantage about unknown words
14
“Her acting of a schizophrenic mother made our hearts weep”
base line classify as negative due to “weep”, but their model correctly predict positive
Example:
✤ Their model was able to correctly infer word sense for polysemous words
Komachi Lab
Error Analysis
✤ conflicting sentiments about two different aspects about the same object
✤ presence of subtle contextual references
15
Difficult situation
✤ “His poor acting generally destroys a movie, but this time it didn’t”
- correct is positive, predict rate is 2
✤ “This movie made his last one looked good”
- wrong prediction of rating 3
Example of latter case

More Related Content

What's hot

Dynamic Polymorphism in C++
Dynamic Polymorphism in C++Dynamic Polymorphism in C++
Dynamic Polymorphism in C++
Dharmisha Sharma
 
PL Lecture 02 - Binding and Scope
PL Lecture 02 - Binding and ScopePL Lecture 02 - Binding and Scope
PL Lecture 02 - Binding and Scope
Schwannden Kuo
 
Decision properties of reular languages
Decision properties of reular languagesDecision properties of reular languages
Decision properties of reular languages
SOMNATHMORE2
 
PL Lecture 01 - preliminaries
PL Lecture 01 - preliminariesPL Lecture 01 - preliminaries
PL Lecture 01 - preliminaries
Schwannden Kuo
 
Decision properties of reular languages
Decision properties of reular languagesDecision properties of reular languages
Decision properties of reular languages
SOMNATHMORE2
 
Candeias sti lg2p_vfinal
Candeias sti lg2p_vfinalCandeias sti lg2p_vfinal
Candeias sti lg2p_vfinalSara Candeias
 
BERT
BERTBERT
Ds 7202 ct-ii
Ds 7202 ct-iiDs 7202 ct-ii
Ds 7202 ct-ii
Prabin Jose
 

What's hot (8)

Dynamic Polymorphism in C++
Dynamic Polymorphism in C++Dynamic Polymorphism in C++
Dynamic Polymorphism in C++
 
PL Lecture 02 - Binding and Scope
PL Lecture 02 - Binding and ScopePL Lecture 02 - Binding and Scope
PL Lecture 02 - Binding and Scope
 
Decision properties of reular languages
Decision properties of reular languagesDecision properties of reular languages
Decision properties of reular languages
 
PL Lecture 01 - preliminaries
PL Lecture 01 - preliminariesPL Lecture 01 - preliminaries
PL Lecture 01 - preliminaries
 
Decision properties of reular languages
Decision properties of reular languagesDecision properties of reular languages
Decision properties of reular languages
 
Candeias sti lg2p_vfinal
Candeias sti lg2p_vfinalCandeias sti lg2p_vfinal
Candeias sti lg2p_vfinal
 
BERT
BERTBERT
BERT
 
Ds 7202 ct-ii
Ds 7202 ct-iiDs 7202 ct-ii
Ds 7202 ct-ii
 

Similar to Cross-Lingual Sentiment Analysis using modified BRAE

Tiancheng Zhao - 2017 - Learning Discourse-level Diversity for Neural Dialog...
Tiancheng Zhao - 2017 -  Learning Discourse-level Diversity for Neural Dialog...Tiancheng Zhao - 2017 -  Learning Discourse-level Diversity for Neural Dialog...
Tiancheng Zhao - 2017 - Learning Discourse-level Diversity for Neural Dialog...
Association for Computational Linguistics
 
A Study Of Statistical Models For Query Translation :Finding A Good Unit Of T...
A Study Of Statistical Models For Query Translation :Finding A Good Unit Of T...A Study Of Statistical Models For Query Translation :Finding A Good Unit Of T...
A Study Of Statistical Models For Query Translation :Finding A Good Unit Of T...iyo
 
NLP_KASHK:Evaluating Language Model
NLP_KASHK:Evaluating Language ModelNLP_KASHK:Evaluating Language Model
NLP_KASHK:Evaluating Language Model
Hemantha Kulathilake
 
Is Reinforcement Learning (Not) for Natural Language Processing.pdf
Is Reinforcement Learning (Not) for Natural
Language Processing.pdfIs Reinforcement Learning (Not) for Natural
Language Processing.pdf
Is Reinforcement Learning (Not) for Natural Language Processing.pdf
Po-Chuan Chen
 
Neural machine translation of rare words with subword units
Neural machine translation of rare words with subword unitsNeural machine translation of rare words with subword units
Neural machine translation of rare words with subword units
Tae Hwan Jung
 
2-Chapter Two-N-gram Language Models.ppt
2-Chapter Two-N-gram Language Models.ppt2-Chapter Two-N-gram Language Models.ppt
2-Chapter Two-N-gram Language Models.ppt
milkesa13
 
Analyse de sentiment et classification par approche neuronale en Python et Weka
Analyse de sentiment et classification par approche neuronale en Python et WekaAnalyse de sentiment et classification par approche neuronale en Python et Weka
Analyse de sentiment et classification par approche neuronale en Python et Weka
Patrice Bellot - Aix-Marseille Université / CNRS (LIS, INS2I)
 
[Paper Introduction] Translating into Morphologically Rich Languages with Syn...
[Paper Introduction] Translating into Morphologically Rich Languages with Syn...[Paper Introduction] Translating into Morphologically Rich Languages with Syn...
[Paper Introduction] Translating into Morphologically Rich Languages with Syn...
NAIST Machine Translation Study Group
 
2021 04-04-google nmt
2021 04-04-google nmt2021 04-04-google nmt
2021 04-04-google nmt
JAEMINJEONG5
 
Achieving Algorithmic Transparency with Shapley Additive Explanations (H2O Lo...
Achieving Algorithmic Transparency with Shapley Additive Explanations (H2O Lo...Achieving Algorithmic Transparency with Shapley Additive Explanations (H2O Lo...
Achieving Algorithmic Transparency with Shapley Additive Explanations (H2O Lo...
Sri Ambati
 
Fast and Accurate Preordering for SMT using Neural Networks
Fast and Accurate Preordering for SMT using Neural NetworksFast and Accurate Preordering for SMT using Neural Networks
Fast and Accurate Preordering for SMT using Neural Networks
SDL
 
ACL-WMT2013.A Description of Tunable Machine Translation Evaluation Systems i...
ACL-WMT2013.A Description of Tunable Machine Translation Evaluation Systems i...ACL-WMT2013.A Description of Tunable Machine Translation Evaluation Systems i...
ACL-WMT2013.A Description of Tunable Machine Translation Evaluation Systems i...
Lifeng (Aaron) Han
 
Open vocabulary problem
Open vocabulary problemOpen vocabulary problem
Open vocabulary problem
JaeHo Jang
 
Natural language processing and transformer models
Natural language processing and transformer modelsNatural language processing and transformer models
Natural language processing and transformer models
Ding Li
 
Chunker Based Sentiment Analysis and Tense Classification for Nepali Text
Chunker Based Sentiment Analysis and Tense Classification for Nepali TextChunker Based Sentiment Analysis and Tense Classification for Nepali Text
Chunker Based Sentiment Analysis and Tense Classification for Nepali Text
kevig
 
Chunker Based Sentiment Analysis and Tense Classification for Nepali Text
Chunker Based Sentiment Analysis and Tense Classification for Nepali TextChunker Based Sentiment Analysis and Tense Classification for Nepali Text
Chunker Based Sentiment Analysis and Tense Classification for Nepali Text
kevig
 
COLING 2014: Joint Opinion Relation Detection Using One-Class Deep Neural Net...
COLING 2014: Joint Opinion Relation Detection Using One-Class Deep Neural Net...COLING 2014: Joint Opinion Relation Detection Using One-Class Deep Neural Net...
COLING 2014: Joint Opinion Relation Detection Using One-Class Deep Neural Net...
Peinan ZHANG
 
Nlp research presentation
Nlp research presentationNlp research presentation
Nlp research presentation
Surya Sg
 

Similar to Cross-Lingual Sentiment Analysis using modified BRAE (20)

Tiancheng Zhao - 2017 - Learning Discourse-level Diversity for Neural Dialog...
Tiancheng Zhao - 2017 -  Learning Discourse-level Diversity for Neural Dialog...Tiancheng Zhao - 2017 -  Learning Discourse-level Diversity for Neural Dialog...
Tiancheng Zhao - 2017 - Learning Discourse-level Diversity for Neural Dialog...
 
A Study Of Statistical Models For Query Translation :Finding A Good Unit Of T...
A Study Of Statistical Models For Query Translation :Finding A Good Unit Of T...A Study Of Statistical Models For Query Translation :Finding A Good Unit Of T...
A Study Of Statistical Models For Query Translation :Finding A Good Unit Of T...
 
NLP_KASHK:Evaluating Language Model
NLP_KASHK:Evaluating Language ModelNLP_KASHK:Evaluating Language Model
NLP_KASHK:Evaluating Language Model
 
Is Reinforcement Learning (Not) for Natural Language Processing.pdf
Is Reinforcement Learning (Not) for Natural
Language Processing.pdfIs Reinforcement Learning (Not) for Natural
Language Processing.pdf
Is Reinforcement Learning (Not) for Natural Language Processing.pdf
 
UWB semeval2016-task5
UWB semeval2016-task5UWB semeval2016-task5
UWB semeval2016-task5
 
Neural machine translation of rare words with subword units
Neural machine translation of rare words with subword unitsNeural machine translation of rare words with subword units
Neural machine translation of rare words with subword units
 
2-Chapter Two-N-gram Language Models.ppt
2-Chapter Two-N-gram Language Models.ppt2-Chapter Two-N-gram Language Models.ppt
2-Chapter Two-N-gram Language Models.ppt
 
Analyse de sentiment et classification par approche neuronale en Python et Weka
Analyse de sentiment et classification par approche neuronale en Python et WekaAnalyse de sentiment et classification par approche neuronale en Python et Weka
Analyse de sentiment et classification par approche neuronale en Python et Weka
 
CICLing_2016_paper_52
CICLing_2016_paper_52CICLing_2016_paper_52
CICLing_2016_paper_52
 
[Paper Introduction] Translating into Morphologically Rich Languages with Syn...
[Paper Introduction] Translating into Morphologically Rich Languages with Syn...[Paper Introduction] Translating into Morphologically Rich Languages with Syn...
[Paper Introduction] Translating into Morphologically Rich Languages with Syn...
 
2021 04-04-google nmt
2021 04-04-google nmt2021 04-04-google nmt
2021 04-04-google nmt
 
Achieving Algorithmic Transparency with Shapley Additive Explanations (H2O Lo...
Achieving Algorithmic Transparency with Shapley Additive Explanations (H2O Lo...Achieving Algorithmic Transparency with Shapley Additive Explanations (H2O Lo...
Achieving Algorithmic Transparency with Shapley Additive Explanations (H2O Lo...
 
Fast and Accurate Preordering for SMT using Neural Networks
Fast and Accurate Preordering for SMT using Neural NetworksFast and Accurate Preordering for SMT using Neural Networks
Fast and Accurate Preordering for SMT using Neural Networks
 
ACL-WMT2013.A Description of Tunable Machine Translation Evaluation Systems i...
ACL-WMT2013.A Description of Tunable Machine Translation Evaluation Systems i...ACL-WMT2013.A Description of Tunable Machine Translation Evaluation Systems i...
ACL-WMT2013.A Description of Tunable Machine Translation Evaluation Systems i...
 
Open vocabulary problem
Open vocabulary problemOpen vocabulary problem
Open vocabulary problem
 
Natural language processing and transformer models
Natural language processing and transformer modelsNatural language processing and transformer models
Natural language processing and transformer models
 
Chunker Based Sentiment Analysis and Tense Classification for Nepali Text
Chunker Based Sentiment Analysis and Tense Classification for Nepali TextChunker Based Sentiment Analysis and Tense Classification for Nepali Text
Chunker Based Sentiment Analysis and Tense Classification for Nepali Text
 
Chunker Based Sentiment Analysis and Tense Classification for Nepali Text
Chunker Based Sentiment Analysis and Tense Classification for Nepali TextChunker Based Sentiment Analysis and Tense Classification for Nepali Text
Chunker Based Sentiment Analysis and Tense Classification for Nepali Text
 
COLING 2014: Joint Opinion Relation Detection Using One-Class Deep Neural Net...
COLING 2014: Joint Opinion Relation Detection Using One-Class Deep Neural Net...COLING 2014: Joint Opinion Relation Detection Using One-Class Deep Neural Net...
COLING 2014: Joint Opinion Relation Detection Using One-Class Deep Neural Net...
 
Nlp research presentation
Nlp research presentationNlp research presentation
Nlp research presentation
 

More from marujirou

Deep Multi-Task Learning with Shared Memory
Deep Multi-Task Learning with Shared MemoryDeep Multi-Task Learning with Shared Memory
Deep Multi-Task Learning with Shared Memory
marujirou
 
怖くない誤差逆伝播法 Chainerを添えて
怖くない誤差逆伝播法 Chainerを添えて怖くない誤差逆伝播法 Chainerを添えて
怖くない誤差逆伝播法 Chainerを添えて
marujirou
 
Learning Tag Embeddings and Tag-specific Composition Functions in Recursive N...
Learning Tag Embeddings and Tag-specific Composition Functions in Recursive N...Learning Tag Embeddings and Tag-specific Composition Functions in Recursive N...
Learning Tag Embeddings and Tag-specific Composition Functions in Recursive N...
marujirou
 
2015 08 survey
2015 08 survey2015 08 survey
2015 08 survey
marujirou
 
Representation Learning Using Multi-Task Deep Neural Networks
for Semantic Cl...
Representation Learning Using Multi-Task Deep Neural Networks
for Semantic Cl...Representation Learning Using Multi-Task Deep Neural Networks
for Semantic Cl...
Representation Learning Using Multi-Task Deep Neural Networks
for Semantic Cl...
marujirou
 
Combining Distant and Partial Supervision for Relation Extraction (Angeli et ...
Combining Distant and Partial Supervision for Relation Extraction (Angeli et ...Combining Distant and Partial Supervision for Relation Extraction (Angeli et ...
Combining Distant and Partial Supervision for Relation Extraction (Angeli et ...
marujirou
 
Semantic Compositionality through Recursive Matrix-Vector Spaces (Socher et al.)
Semantic Compositionality through Recursive Matrix-Vector Spaces (Socher et al.)Semantic Compositionality through Recursive Matrix-Vector Spaces (Socher et al.)
Semantic Compositionality through Recursive Matrix-Vector Spaces (Socher et al.)
marujirou
 
Relation Classification via Convolutional Deep Neural Network (Zeng et al.)
Relation Classification via Convolutional Deep Neural Network (Zeng et al.)Relation Classification via Convolutional Deep Neural Network (Zeng et al.)
Relation Classification via Convolutional Deep Neural Network (Zeng et al.)
marujirou
 
DL勉強会 01ディープボルツマンマシン
DL勉強会 01ディープボルツマンマシンDL勉強会 01ディープボルツマンマシン
DL勉強会 01ディープボルツマンマシン
marujirou
 

More from marujirou (9)

Deep Multi-Task Learning with Shared Memory
Deep Multi-Task Learning with Shared MemoryDeep Multi-Task Learning with Shared Memory
Deep Multi-Task Learning with Shared Memory
 
怖くない誤差逆伝播法 Chainerを添えて
怖くない誤差逆伝播法 Chainerを添えて怖くない誤差逆伝播法 Chainerを添えて
怖くない誤差逆伝播法 Chainerを添えて
 
Learning Tag Embeddings and Tag-specific Composition Functions in Recursive N...
Learning Tag Embeddings and Tag-specific Composition Functions in Recursive N...Learning Tag Embeddings and Tag-specific Composition Functions in Recursive N...
Learning Tag Embeddings and Tag-specific Composition Functions in Recursive N...
 
2015 08 survey
2015 08 survey2015 08 survey
2015 08 survey
 
Representation Learning Using Multi-Task Deep Neural Networks
for Semantic Cl...
Representation Learning Using Multi-Task Deep Neural Networks
for Semantic Cl...Representation Learning Using Multi-Task Deep Neural Networks
for Semantic Cl...
Representation Learning Using Multi-Task Deep Neural Networks
for Semantic Cl...
 
Combining Distant and Partial Supervision for Relation Extraction (Angeli et ...
Combining Distant and Partial Supervision for Relation Extraction (Angeli et ...Combining Distant and Partial Supervision for Relation Extraction (Angeli et ...
Combining Distant and Partial Supervision for Relation Extraction (Angeli et ...
 
Semantic Compositionality through Recursive Matrix-Vector Spaces (Socher et al.)
Semantic Compositionality through Recursive Matrix-Vector Spaces (Socher et al.)Semantic Compositionality through Recursive Matrix-Vector Spaces (Socher et al.)
Semantic Compositionality through Recursive Matrix-Vector Spaces (Socher et al.)
 
Relation Classification via Convolutional Deep Neural Network (Zeng et al.)
Relation Classification via Convolutional Deep Neural Network (Zeng et al.)Relation Classification via Convolutional Deep Neural Network (Zeng et al.)
Relation Classification via Convolutional Deep Neural Network (Zeng et al.)
 
DL勉強会 01ディープボルツマンマシン
DL勉強会 01ディープボルツマンマシンDL勉強会 01ディープボルツマンマシン
DL勉強会 01ディープボルツマンマシン
 

Recently uploaded

Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIEnchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Vladimir Iglovikov, Ph.D.
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
Zilliz
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
SOFTTECHHUB
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
名前 です男
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
Pierluigi Pugliese
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Zilliz
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
danishmna97
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
Neo4j
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems S.M.S.A.
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
Matthew Sinclair
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
Neo4j
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
KAMESHS29
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
James Anderson
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
DianaGray10
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
sonjaschweigert1
 

Recently uploaded (20)

Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIEnchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
 

Cross-Lingual Sentiment Analysis using modified BRAE

  • 1. Komachi Lab M1 Ryosuke Miyazaki 2015/10/16 Cross-Lingual Sentiment Analysis using modified BRAE Sarthak Jain and Shashank Batra EMNLP 2015 EMNLP 2015 reading group ※ All figures in this slide are cited from original paper
  • 2. Komachi Lab Abstract ✤ To perform Cross Lingual Sentiment Analysis - They use parallel corpus that include
 resource rich (English) and resource poor (Hindi) ✤ They create new Movie Reviews Dataset in Hindi
 for evaluation ✤ Their model significantly outperforms state of the art,
 especially when labeled data is scarce 2
  • 4. Komachi Lab BRAE Model 4 Bilingually Constrained Recursive Auto-encoder First, we consider standard Recursive Auto-encoder for each language respectively construct parent vector reconstruct children vector Minimize reconstruction errors (Euclidean distance) c: child vector y, p: parent vector
  • 5. Komachi Lab BRAE Model 5 Loss Function They also produce representation from another language Assumption A phrase and its correct translation should
 share the same semantic meaning Loss Function about source language Transforming loss Like wise, they define for target language Objective function
  • 6. Komachi Lab Training (Unsupervised) ✤ Word embeddings are pre-trained by Word2Vec ✤ 1st: Pre-train ps, and pt respectively on RAE 6 ✤ 2nd: Fix pt and train ps on BRAE - Vice-versa for ps - Set ps = p’s, pt = p’t when it reaching a local minima.
  • 7. Komachi Lab Training (Supervise) ✤ Modification for Classifying Sentiment ✤ Adding Softmax and Cross entropy error functions
 to only source language (resource rich language) ✤ In this phase, penalty term is included in reconstruction error 7 ✤ And, transformation weights (θt s, θs t) are not updated in this phase
  • 8. Komachi Lab Training (Supervise) ✤ 1st: only update resource rich related parameters 8 ce: cross entropy ✤ 2nd: only update resource poor related parameters - Since the gold labels are only associated with resource rich,
 they use transformation to obtain sentiment distribution ✤ Predict overall sentiment associated with the resource poor - concat pt, p’s then 
 train by softmax regression using weight matrix
  • 10. Komachi Lab Experimental Settings ✤ HindMonoCorp 0.5 (44.49M sentences) and
 English Gigaword Corpus for word embeddings ✤ Bilingual sentence-aligned data from HindEnCrop (273.9k sentence pairs)
 10 For Unsupervised phase For Supervised phase (use MOSES to obtain bilingual phrase pairs) ✤ IMDB11 dataset (25000 pos, 25000 neg) ✤ Rotten Tomatoes Review dataset (4 documents, {0, 1, 2, 3}) ✤ Their model was able to correctly infer word sense for polysemous words
  • 11. Komachi Lab Experimental Setting ✤ Rating Based Hindi Movie Review Dataset (2945 movie reviews, {1, 2, 3, 4})
 they create this new dataset for evaluation ✤ Standard Movie Reviews Dataset (125 positive, 125 negative) 11 Evaluation Data set ✤ learning rate: 0.05 ✤ word vector dimension: 80 ✤ joint error of BRAE (α): 0.2 ✤ λL: 0.001 ✤ λBRAE: 0.0001 Tuning by Grid Search on Cross Validation ✤ κ: 0.2, η: 0.35 ✤ λp: 0.01 ✤ λS: 0.1 ✤ λT: 0.04
  • 12. Komachi Lab Results ✤ BRAE-U: neither include penalty term, nor fix the transformations weights ✤ BRAE-P: only include the penalty term ✤ BRAE-F: include both term 12 monolingual cross lingual monolingual monolingual monolingual cross lingual cross lingual cross lingual Confusion matrix (BRAE-F)
  • 13. Komachi Lab Results 13 Accuracy with amount of
 labeled training data used ✤ Their model achieve best performance even though
 data are 50% less than those of others. Accuracy with amount of
 unlabeled training data used
  • 14. Komachi Lab Analysis ✤ Since the movement in semantic vector space was restricted, their model have an advantage about unknown words 14 “Her acting of a schizophrenic mother made our hearts weep” base line classify as negative due to “weep”, but their model correctly predict positive Example: ✤ Their model was able to correctly infer word sense for polysemous words
  • 15. Komachi Lab Error Analysis ✤ conflicting sentiments about two different aspects about the same object ✤ presence of subtle contextual references 15 Difficult situation ✤ “His poor acting generally destroys a movie, but this time it didn’t” - correct is positive, predict rate is 2 ✤ “This movie made his last one looked good” - wrong prediction of rating 3 Example of latter case