Evaluating Neural Machine Translation Models for English-Japanese Task

•

1 like•484 views

Association for Computational Linguistics

WAT - W15-5007

Education

Evaluating Neural Machine Translation
in English-Japanese Task
(TEAM ID: WEBLIO MT)
Zhongyuan Zhu
@raphaelshu
1

Empirically evaluate various models in EJ task
‣ Two network architectures
2
‣ Three recurrent units
‣ LSTM, GRU, IRNN
multi-layer encoder-decoder model soft-attention model
‣ Two kinds of training data
‣ naturally-ordered, pre-reordered

Results: evaluation scores
4
BLEU RIBES HUMAN JPO
Baseline phrase-based SMT 29.80 0.691
Baseline hierarchical phrase-based SMT 32.56 0.746
Baseline Tree-to-string SMT 33.44 0.758 30.00
Submitted system 1
(NMT)
34.19 0.802 43.50
Submitted system 2
(NMT + System combination)
36.21 0.809 53.75 3.81
Best competitor 1: NAIST
(Travatar System with NeuralMT Reranking)
38.17 0.813 62.25 4.04
Best competitor 2: naver
(SMT t2s + Spell correction + NMT reranking)
36.14 0.803 53.25 4.00

Finding & Insights
‣ Soft-attention models outperforms multi-layer
encoder-decoder models
‣ Training models on pre-reordered data hurts
the performance
‣ NMT models tend to make grammatically
valid but incomplete translations
5

What's hot

DL for sentence classification project Write-upHoàng Triều Trịnh

Multi-Layer PerceptronsESCOM

15 Machine Learning Multilayer PerceptronAndres Mendez-Vazquez

Neural networksHarshitGupta367

Introduction to Applied Machine LearningSheilaJimenezMorejon

DL for setence classification project presentationHoàng Triều Trịnh

Machine Learning: Introduction to Neural NetworksFrancesco Collova'

deep CNN vs conventional MLChao Han chaohan@vt.edu

Recurrent Neural Networks I (D2L2 Deep Learning for Speech and Language UPC 2...Universitat Politècnica de Catalunya

Mca2050 computer architecturesmumbahelp

02 Fundamental Concepts of ANNTamer Ahmed Farrag, PhD

Artificial neural networkIshaneeSharma

2021 03-01-on the relationship between self-attention and convolutional layersJAEMINJEONG5

Introduction to CNNShuai Zhang

Deep Belief NetworksHasan H Topcu

Introduction For seq2seq(sequence to sequence) and RNNHye-min Ahn

Clustering tutorialLio Gonçalves

Artificial Neural Networks Lect5: Multi-Layer Perceptron & BackpropagationMohammed Bennamoun

Learning Financial Market Data with Recurrent Autoencoders and TensorFlowAltoros

A New Classifier Based onRecurrent Neural Network Using Multiple Binary-Outpu...iosrjce

What's hot (20)

DL for sentence classification project Write-up

Multi-Layer Perceptrons

15 Machine Learning Multilayer Perceptron

Neural networks

Introduction to Applied Machine Learning

DL for setence classification project presentation

Machine Learning: Introduction to Neural Networks

deep CNN vs conventional ML

Recurrent Neural Networks I (D2L2 Deep Learning for Speech and Language UPC 2...

Mca2050 computer architecture

02 Fundamental Concepts of ANN

Artificial neural network

2021 03-01-on the relationship between self-attention and convolutional layers

Introduction to CNN

Deep Belief Networks

Introduction For seq2seq(sequence to sequence) and RNN

Clustering tutorial

Artificial Neural Networks Lect5: Multi-Layer Perceptron & Backpropagation

Learning Financial Market Data with Recurrent Autoencoders and TensorFlow

A New Classifier Based onRecurrent Neural Network Using Multiple Binary-Outpu...

Viewers also liked

Visual-Semantic Embeddings: some thoughts on LanguageRoelof Pieters

State of Blockchain 2017: Smartnetworks and the Blockchain EconomyMelanie Swan

Roee Aharoni - 2017 - Towards String-to-Tree Neural Machine TranslationAssociation for Computational Linguistics

Deep Learning & NLP: Graphs to the Rescue!Roelof Pieters

iPhone5c的最后猜测Yanbin Kong

Philosophy of Deep LearningMelanie Swan

Vectorland: Brief Notes from Using Text Embeddings for SearchBhaskar Mitra

Chris Dyer - 2017 - CoNLL Invited Talk: Should Neural Network Architecture Re...Association for Computational Linguistics

Blockchain Economic TheoryMelanie Swan

Chenchen Ding - 2015 - NICT at WAT 2015Association for Computational Linguistics

Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopAssociation for Computational Linguistics

Neural Models for Document RankingBhaskar Mitra

Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopAssociation for Computational Linguistics

John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...Association for Computational Linguistics

Using Text Embeddings for Information RetrievalBhaskar Mitra

John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...Association for Computational Linguistics

Advanced Node.JS MeetupLINAGORA

Cs231n 2017 lecture13 Generative ModelYanbin Kong

Exploring Session Context using Distributed Representations of Queries and Re...Bhaskar Mitra

Deep Learning in practice : Speech recognition and beyond - MeetupLINAGORA

Viewers also liked (20)

Visual-Semantic Embeddings: some thoughts on Language

State of Blockchain 2017: Smartnetworks and the Blockchain Economy

Roee Aharoni - 2017 - Towards String-to-Tree Neural Machine Translation

Deep Learning & NLP: Graphs to the Rescue!

iPhone5c的最后猜测

Philosophy of Deep Learning

Vectorland: Brief Notes from Using Text Embeddings for Search

Chris Dyer - 2017 - CoNLL Invited Talk: Should Neural Network Architecture Re...

Blockchain Economic Theory

Chenchen Ding - 2015 - NICT at WAT 2015

Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop

Neural Models for Document Ranking

Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop

John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...

Using Text Embeddings for Information Retrieval

John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...

Advanced Node.JS Meetup

Cs231n 2017 lecture13 Generative Model

Exploring Session Context using Distributed Representations of Queries and Re...

Deep Learning in practice : Speech recognition and beyond - Meetup

Similar to Evaluating Neural Machine Translation Models for English-Japanese Task

Long Zhou - 2017 - Neural System Combination for Machine TransaltionAssociation for Computational Linguistics

MLConf 2013: Metronome and Parallel Iterative Algorithms on YARNJosh Patterson

Tokyo Webmining Talk1Kenta Oono

Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Association for Computational Linguistics

Presentation, navid khoobRealsim, Fanavaran Sharif

A Platform for Accelerating Machine Learning ApplicationsNVIDIA Taiwan

Supervised sequence labelling with recurrent neural networks ch1 6SungminYou

A Hybrid Method of CART and Artificial Neural Network for Short Term Load For...Salford Systems

A seriously simple memetic approach with a high performanceFabio Caraffini

Efficient Implementation of Self-Organizing Map for Sparse Input Dataymelka

Ultrasound nerve segmentation, kaggle reviewEduard Tyantov

Extracted pages from Neural Fuzzy Systems.docxdannyabe

6조butest

Expert estimation from Multimodal FeaturesXavier Ochoa

Tridiagonal solver in gpuHunan University

MaPU-HPCA2016Shaolin Xie

Standardising the compressed representation of neural networksFörderverein Technische Fakultät

TiReX: Tiled Regular eXpression matching architectureNECST Lab @ Politecnico di Milano

Protocol Type Based Intrusion Detection Using RBF Neural NetworkWaqas Tariq

Architecture neural network deep optimizing based on self organizing feature ...journalBEEI

Similar to Evaluating Neural Machine Translation Models for English-Japanese Task (20)

Long Zhou - 2017 - Neural System Combination for Machine Transaltion

MLConf 2013: Metronome and Parallel Iterative Algorithms on YARN

Tokyo Webmining Talk1

Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...

Presentation, navid khoob

A Platform for Accelerating Machine Learning Applications

Supervised sequence labelling with recurrent neural networks ch1 6

A Hybrid Method of CART and Artificial Neural Network for Short Term Load For...

A seriously simple memetic approach with a high performance

Efficient Implementation of Self-Organizing Map for Sparse Input Data

Ultrasound nerve segmentation, kaggle review

Extracted pages from Neural Fuzzy Systems.docx

6조

Expert estimation from Multimodal Features

Tridiagonal solver in gpu

MaPU-HPCA2016

Standardising the compressed representation of neural networks

TiReX: Tiled Regular eXpression matching architecture

Protocol Type Based Intrusion Detection Using RBF Neural Network

Architecture neural network deep optimizing based on self organizing feature ...

Recently uploaded

The basics of sentences session 2pptx copy.pptxheathfieldcps1

Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019

Introduction to AI in Higher Education_draft.pptxpboyjonauth

Código Creativo y Arte de Software | Unidad 1Maestría en Comunicación Digital Interactiva - UNR

_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon

Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝9953056974 Low Rate Call Girls In Saket, Delhi NCR

URLs and Routing in the Odoo 17 Website AppCeline George

POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar

Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732

Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha

call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR

Staff of Color (SOC) Retention Efforts DDSDDavid Douglas School District

CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2

Concept of Vouching. B.Com(Hons) /B.CompdfUmakantAnnand

Accessible design: Minimum effort, maximum impactdawncurless

Alper Gobel In Media Res Media ComponentInMediaRes1

Software Engineering Methodologies (overview)eniolaolutunde

TataKelola dan KamSiber Kecerdasan Buatan v022.pdfSarwono Sutikno, Dr.Eng.,CISA,CISSP,CISM,CSX-F

Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre

Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching

Recently uploaded (20)

The basics of sentences session 2pptx copy.pptx

Sanyam Choudhary Chemistry practical.pdf

Introduction to AI in Higher Education_draft.pptx

Código Creativo y Arte de Software | Unidad 1

_Math 4-Q4 Week 5.pptx Steps in Collecting Data

Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝

URLs and Routing in the Odoo 17 Website App

POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx

Separation of Lanthanides/ Lanthanides and Actinides

Call Girls in Dwarka Mor Delhi Contact Us 9654467111

call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️

Staff of Color (SOC) Retention Efforts DDSD

CARE OF CHILD IN INCUBATOR..........pptx

Concept of Vouching. B.Com(Hons) /B.Compdf

Accessible design: Minimum effort, maximum impact

Alper Gobel In Media Res Media Component

Software Engineering Methodologies (overview)

TataKelola dan KamSiber Kecerdasan Buatan v022.pdf

Organic Name Reactions for the students and aspirants of Chemistry12th.pptx

Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...

Evaluating Neural Machine Translation Models for English-Japanese Task

1. Evaluating Neural Machine Translation in English-Japanese Task (TEAM ID: WEBLIO MT) Zhongyuan Zhu @raphaelshu 1

2. Empirically evaluate various models in EJ task ‣ Two network architectures 2 ‣ Three recurrent units ‣ LSTM, GRU, IRNN multi-layer encoder-decoder model soft-attention model ‣ Two kinds of training data ‣ naturally-ordered, pre-reordered

3. Results: perplexities 3

4. Results: evaluation scores 4 BLEU RIBES HUMAN JPO Baseline phrase-based SMT 29.80 0.691 Baseline hierarchical phrase-based SMT 32.56 0.746 Baseline Tree-to-string SMT 33.44 0.758 30.00 Submitted system 1 (NMT) 34.19 0.802 43.50 Submitted system 2 (NMT + System combination) 36.21 0.809 53.75 3.81 Best competitor 1: NAIST (Travatar System with NeuralMT Reranking) 38.17 0.813 62.25 4.04 Best competitor 2: naver (SMT t2s + Spell correction + NMT reranking) 36.14 0.803 53.25 4.00

5. Finding & Insights ‣ Soft-attention models outperforms multi-layer encoder-decoder models ‣ Training models on pre-reordered data hurts the performance ‣ NMT models tend to make grammatically valid but incomplete translations 5

6. Thanks. 6

Evaluating Neural Machine Translation Models for English-Japanese Task

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (20)

Similar to Evaluating Neural Machine Translation Models for English-Japanese Task

Similar to Evaluating Neural Machine Translation Models for English-Japanese Task (20)

More from Association for Computational Linguistics

More from Association for Computational Linguistics (20)

Recently uploaded

Recently uploaded (20)

Evaluating Neural Machine Translation Models for English-Japanese Task