SlideShare a Scribd company logo
1 of 16
Download to read offline
ReVal: A Simple and Effec2ve Machine Transla2on
Evalua2on Metric Based on Recurrent Neural Networks
2018/01/10 B4 Hiroki Shimanaka
Rohit Gupta, Constan/n Ora ̆san, Josef van Genabith
EMNLP 2015
Abstract & Introduction (1)
l
l RNN LSTM
l 2
1
Abstract & Introhuction (2)
l
WMT-13 SICK
l
WMTl -14
2
3
lBLEU [Papineni et al., 2002], NIST [Doddington, 2002]
l
lMETEOR [Denkowski and Lavie, 2014], TERp[Snover et al., 2008]
lWordNet
lDISKOTK-PARTY-TUNED [Joty et al., 2014]
l WMT-14 metric shared
task
lLAYERED [Gautam and Bhat- tacharyya, 2014]
l WMT-14 metric shared
task 2
Related Work
4
WMTl -14 metric shared task
l (WMT SICK )
LSTM
Approach
5
l LSTM
l LSTM
Tree-LSTM
l LSTM !" #" $
Tree-LSTM
$
LSTMs and Tree-LSTMs
6
l
ℎ"#$, ℎ&"' ∶ LSTM
̂*+ :
,- ∶
./
∶ [1 2 … 4]
Evaluation Metric (1)
7
! " ∶ KL
$ : 2
Evalua&on Metric (2)
8
l
(RR)
WMTl -13 RR 5 1 5
1
l RR
Computing Similarity Scores from WMT Rankings (1)
9
Computing Similarity Scores from WMT Rankings (2)
10
Experiment (DataSet)
L: WMT-13 4
LNF: WMT-13 4
L+Sick: L Sick 4,500
Sick 500
XL+Sick: L+Sick WMT-13
11
Experiment (Model)
l GloVe [Pen- nington et al., 2014]
l LSTM Tree-LSTM
lLSTM
lBatch size : 25
lTrain rate : 0.05
lregularization strength : 0.0001
lHidden dimension : 50 or 100
lMemory dimension : 150 or 300
lTrain epochs: 10
lSystem-level scores are computed by ag-gregating and normalising segment-
level scores.
12
Result (1)
13
Result (2)
14
Analysis
Systeml -level segment-level Kendall
Tau
l :
01 :
01 (0.10), 04 (0.71), 02 (0.72), 03 (0.73)
02 :
02 (0.71), 01 (0.72), 03 (0.73), 04 (0.74)
:
01 (0.10), 02 (0.71), 03 (0.72), 04 (0.73)
l 01
Kendall Tau 02
02
l
15
Conclusions
lGupta Tree-LSTM
ReVal
lReVal

More Related Content

Similar to [論文紹介] ReVal: A Simple and Effective Machine Translation Evaluation Metric Based on Recurrent Neural Networks

2015 - 2016 ieee ns2 project titles
2015 - 2016 ieee ns2 project titles2015 - 2016 ieee ns2 project titles
2015 - 2016 ieee ns2 project titles
JPINFOTECH JAYAPRAKASH
 
Iciic 2010 114
Iciic 2010 114Iciic 2010 114
Iciic 2010 114
hanums1
 
Supporting Cross-Device Web Search with Social Navigation-Based Mobile Touch ...
Supporting Cross-Device Web Search with Social Navigation-Based Mobile Touch ...Supporting Cross-Device Web Search with Social Navigation-Based Mobile Touch ...
Supporting Cross-Device Web Search with Social Navigation-Based Mobile Touch ...
Peter Brusilovsky
 
Modeling and meta-modeling presentation at LTH, Sweden
Modeling and meta-modeling presentation at LTH, Sweden Modeling and meta-modeling presentation at LTH, Sweden
Modeling and meta-modeling presentation at LTH, Sweden
Saïd Assar
 

Similar to [論文紹介] ReVal: A Simple and Effective Machine Translation Evaluation Metric Based on Recurrent Neural Networks (19)

Game Assignments in computer Science
Game Assignments in computer ScienceGame Assignments in computer Science
Game Assignments in computer Science
 
Image Fusion Ehancement using DT-CWT Technique
Image Fusion Ehancement using DT-CWT TechniqueImage Fusion Ehancement using DT-CWT Technique
Image Fusion Ehancement using DT-CWT Technique
 
IRJET- Review on Raspberry Pi based Assistive Communication System for Blind,...
IRJET- Review on Raspberry Pi based Assistive Communication System for Blind,...IRJET- Review on Raspberry Pi based Assistive Communication System for Blind,...
IRJET- Review on Raspberry Pi based Assistive Communication System for Blind,...
 
2015 - 2016 ieee ns2 project titles
2015 - 2016 ieee ns2 project titles2015 - 2016 ieee ns2 project titles
2015 - 2016 ieee ns2 project titles
 
Iciap 2
Iciap 2Iciap 2
Iciap 2
 
IRJET- Next Generation System Assistant
IRJET- Next Generation System AssistantIRJET- Next Generation System Assistant
IRJET- Next Generation System Assistant
 
Iciic 2010 114
Iciic 2010 114Iciic 2010 114
Iciic 2010 114
 
BERT: Bidirectional Encoder Representations from Transformers
BERT: Bidirectional Encoder Representations from TransformersBERT: Bidirectional Encoder Representations from Transformers
BERT: Bidirectional Encoder Representations from Transformers
 
Improving performance of routing protocols using mrp framework
Improving performance of routing protocols using mrp frameworkImproving performance of routing protocols using mrp framework
Improving performance of routing protocols using mrp framework
 
Improving performance of routing protocols using mrp framework
Improving performance of routing protocols using mrp frameworkImproving performance of routing protocols using mrp framework
Improving performance of routing protocols using mrp framework
 
Predictive Business Process Monitoring with LSTM Neural Networks
Predictive Business Process Monitoring with LSTM Neural NetworksPredictive Business Process Monitoring with LSTM Neural Networks
Predictive Business Process Monitoring with LSTM Neural Networks
 
NTCIR-12 task proposal: Short Text Conversation (STC)
NTCIR-12 task proposal: Short Text Conversation (STC)NTCIR-12 task proposal: Short Text Conversation (STC)
NTCIR-12 task proposal: Short Text Conversation (STC)
 
Research Summary of Ryoichi Shinkuma, Kyoto University, Japan
Research Summary of Ryoichi Shinkuma, Kyoto University, JapanResearch Summary of Ryoichi Shinkuma, Kyoto University, Japan
Research Summary of Ryoichi Shinkuma, Kyoto University, Japan
 
Multiprocessor scheduling of dependent tasks to minimize makespan and reliabi...
Multiprocessor scheduling of dependent tasks to minimize makespan and reliabi...Multiprocessor scheduling of dependent tasks to minimize makespan and reliabi...
Multiprocessor scheduling of dependent tasks to minimize makespan and reliabi...
 
Interactive Information Retrieval inspired by Quantum Theory
Interactive Information Retrieval inspired by Quantum TheoryInteractive Information Retrieval inspired by Quantum Theory
Interactive Information Retrieval inspired by Quantum Theory
 
Reading_0413_var_Transformers.pptx
Reading_0413_var_Transformers.pptxReading_0413_var_Transformers.pptx
Reading_0413_var_Transformers.pptx
 
Supporting Cross-Device Web Search with Social Navigation-Based Mobile Touch ...
Supporting Cross-Device Web Search with Social Navigation-Based Mobile Touch ...Supporting Cross-Device Web Search with Social Navigation-Based Mobile Touch ...
Supporting Cross-Device Web Search with Social Navigation-Based Mobile Touch ...
 
A Connectionist Approach to Dynamic Resource Management for Virtualised Netwo...
A Connectionist Approach to Dynamic Resource Management for Virtualised Netwo...A Connectionist Approach to Dynamic Resource Management for Virtualised Netwo...
A Connectionist Approach to Dynamic Resource Management for Virtualised Netwo...
 
Modeling and meta-modeling presentation at LTH, Sweden
Modeling and meta-modeling presentation at LTH, Sweden Modeling and meta-modeling presentation at LTH, Sweden
Modeling and meta-modeling presentation at LTH, Sweden
 

More from Hiroki Shimanaka

More from Hiroki Shimanaka (8)

[Tutorial] Sentence Representation
[Tutorial] Sentence Representation[Tutorial] Sentence Representation
[Tutorial] Sentence Representation
 
[論文紹介] Reference Bias in Monolingual Machine Translation Evaluation
[論文紹介] Reference Bias in Monolingual Machine Translation Evaluation[論文紹介] Reference Bias in Monolingual Machine Translation Evaluation
[論文紹介] Reference Bias in Monolingual Machine Translation Evaluation
 
[Paper Reading] Unsupervised Learning of Sentence Embeddings using Compositi...
[Paper Reading]  Unsupervised Learning of Sentence Embeddings using Compositi...[Paper Reading]  Unsupervised Learning of Sentence Embeddings using Compositi...
[Paper Reading] Unsupervised Learning of Sentence Embeddings using Compositi...
 
[論文紹介] PARANMT-50M- Pushing the Limits of Paraphrastic Sentence Embeddings wi...
[論文紹介] PARANMT-50M- Pushing the Limits of Paraphrastic Sentence Embeddings wi...[論文紹介] PARANMT-50M- Pushing the Limits of Paraphrastic Sentence Embeddings wi...
[論文紹介] PARANMT-50M- Pushing the Limits of Paraphrastic Sentence Embeddings wi...
 
[論文紹介] AN EFFICIENT FRAMEWORK FOR LEARNING SENTENCE REPRESENTATIONS.
[論文紹介] AN EFFICIENT FRAMEWORK FOR LEARNING SENTENCE REPRESENTATIONS.[論文紹介] AN EFFICIENT FRAMEWORK FOR LEARNING SENTENCE REPRESENTATIONS.
[論文紹介] AN EFFICIENT FRAMEWORK FOR LEARNING SENTENCE REPRESENTATIONS.
 
[論文紹介] Are BLEU and Meaning Representation in Opposition?
[論文紹介] Are BLEU and Meaning Representation in Opposition?[論文紹介] Are BLEU and Meaning Representation in Opposition?
[論文紹介] Are BLEU and Meaning Representation in Opposition?
 
[論文紹介] Skip-Thought Vectors
[論文紹介] Skip-Thought Vectors[論文紹介] Skip-Thought Vectors
[論文紹介] Skip-Thought Vectors
 
[Paper Reading] Supervised Learning of Universal Sentence Representations fro...
[Paper Reading] Supervised Learning of Universal Sentence Representations fro...[Paper Reading] Supervised Learning of Universal Sentence Representations fro...
[Paper Reading] Supervised Learning of Universal Sentence Representations fro...
 

Recently uploaded

DeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakesDeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakes
MayuraD1
 
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills KuwaitKuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
jaanualu31
 
Integrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - NeometrixIntegrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - Neometrix
Neometrix_Engineering_Pvt_Ltd
 
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak HamilCara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Kandungan 087776558899
 

Recently uploaded (20)

DeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakesDeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakes
 
AIRCANVAS[1].pdf mini project for btech students
AIRCANVAS[1].pdf mini project for btech studentsAIRCANVAS[1].pdf mini project for btech students
AIRCANVAS[1].pdf mini project for btech students
 
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKARHAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
 
Generative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTGenerative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPT
 
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills KuwaitKuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
 
Block diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.pptBlock diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.ppt
 
Air Compressor reciprocating single stage
Air Compressor reciprocating single stageAir Compressor reciprocating single stage
Air Compressor reciprocating single stage
 
Rums floating Omkareshwar FSPV IM_16112021.pdf
Rums floating Omkareshwar FSPV IM_16112021.pdfRums floating Omkareshwar FSPV IM_16112021.pdf
Rums floating Omkareshwar FSPV IM_16112021.pdf
 
Work-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptxWork-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptx
 
Bridge Jacking Design Sample Calculation.pptx
Bridge Jacking Design Sample Calculation.pptxBridge Jacking Design Sample Calculation.pptx
Bridge Jacking Design Sample Calculation.pptx
 
A Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna MunicipalityA Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna Municipality
 
Online electricity billing project report..pdf
Online electricity billing project report..pdfOnline electricity billing project report..pdf
Online electricity billing project report..pdf
 
Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086
 
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best ServiceTamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
 
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced LoadsFEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
 
Integrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - NeometrixIntegrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - Neometrix
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torque
 
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak HamilCara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
 
Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...
Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...
Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...
 
Online food ordering system project report.pdf
Online food ordering system project report.pdfOnline food ordering system project report.pdf
Online food ordering system project report.pdf
 

[論文紹介] ReVal: A Simple and Effective Machine Translation Evaluation Metric Based on Recurrent Neural Networks