SlideShare a Scribd company logo
1 of 1
Download to read offline
Source Phrase Segmentation and Translation
for Japanese to English Translation Using Dependency Structure 	
Junki Matsuo, Kenichi Ohwada, and Mamoru Komachi	
Graduate School of System Design, Tokyo Metropolitan University, Japan	
We	
  propose	
  a	
  linguis.cally	
  mo.vated	
  approach	
  
based	
  on	
  segmen.ng	
  a	
  source	
  phrase	
  using	
  
dependency	
  structure	
  and	
  transla.ng	
  each	
  phrase	
  
with	
  PBSMT.	
  This	
  work	
  presents	
  the	
  results	
  of	
  our	
  
method	
  on	
  Japanese-­‐English	
  transla.on	
  and	
  
discusses	
  poten.al	
  improvements.	
  
	
  
①	
  Abstract	
②	
  Source	
  Phrase	
  Segmenta.on	
  and	
  Transla.on	
  Using	
  Dependency	
  Structure	
④	
  Experimental	
  Results	
⑤	
  Error	
  Analysis	
  (in	
  100	
  sentences)	
③	
  Experimental	
  SeQngs	
Decoder	
  :	
  Moses	
  (2.1.1)	
  default	
  seQngs	
  
Corpus	
  :	
  The	
  ASPEC	
  (3	
  million	
  parallel	
   	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  sentences)	
  
Segmenta.on	
  :	
  JUMAN	
  (7.0)	
  
Alignment	
  :	
  GIZA++	
  (1.0.7)	
  
MERT	
  :	
  Performed	
  on	
  the	
  full	
  dev-­‐set	
  
Dependency	
  :	
  CaboCha(0.68)	
  
Baseline	
  :	
  Standard	
  output	
  
	
  
Our	
  method	
  only	
  uses	
  CaboCha	
  for	
  crea.ng	
  
a	
  basic	
  frame	
  and	
  dependent	
  phrases.	
  (Not	
  
segmenta.on)	
  
Algorithm	
  
	
  
1.  Our	
  method	
  creates	
  a	
  basic	
  frame	
  and	
  
dependent	
  phrases.	
  (Basic	
  frame	
  :	
  broken	
  line	
  
circle,	
  Dependent	
  phrases	
  :	
  solid	
  line	
  circle.)	
  
2.  The	
  basic	
  frame	
  and	
  the	
  dependent	
  phrases	
  
are	
  translated	
  by	
  a	
  decoder.	
  
3.  The	
  anchor	
  words	
  of	
  the	
  basic	
  frame	
  are	
  
replaced	
  with	
  the	
  transla.ons	
  of	
  the	
  
corresponding	
  dependent	
  phrases.	
  (Anchor	
  
words	
  :	
  the	
  yellow	
  under-­‐lined	
  words)	
  
⑥	
  Conclusion &	
  Future	
  Work	
An	
  example	
  of	
  the	
  second	
  method	
no-­‐seg&trans	
  	
  	
  	
  … Standard	
  Moses	
  output	
  
seg&trans	
  	
  	
  	
  …	
  	
  	
  	
  The	
  first	
  method	
  
seg&trans	
  	
  	
  	
  …	
  	
  	
  	
  The	
  second	
  method	
  
We	
  inves.gate	
  the	
  reason	
  why	
  BLEU	
  and	
  RIBES	
  fall	
  
off.	
  The	
  number	
  of	
  the	
  pairs	
  of	
  input	
  and	
  reference	
  
that	
  have	
  different	
  voices	
  (ac.ve	
  or	
  passive)	
  is	
  35.	
  
	
  
We	
  observed	
  that	
  transla.on	
  of	
  dependent	
  phrases	
  
is	
  the	
  most	
  frequent	
  error	
  type.	
  	
  It	
  is	
  because	
  the	
  
language	
  and	
  the	
  transla.on	
  models	
  are	
  not	
  
op.mized	
  for	
  transla.ng	
  phrases.	
  
•  Our	
  finding	
  is	
  that	
  our	
  proposed	
  methods	
  
have	
  three	
  problems:	
  a	
  dependency	
  
parsing,	
  transla.on	
  of	
  a	
  basic	
  frame	
  and	
  
transla.on	
  of	
  dependent	
  phrases.	
  	
  
•  We	
  plan	
  to	
  op.mize	
  the	
  language	
  and	
  the	
  
transla.on	
  models	
  for	
  transla.ng	
  phrases.	
  
“雷撃比は等しい”	
  
↓	
  
“The	
  equal	
  to	
  lightning	
  stroke	
  ra.o”	
  
→ Ungramma'cal	
  Sentence	
  
Error	
  Example	
・Transla.on	
  of	
  the	
  basic	
  frame	
  
・Transla.on	
  of	
  the	
  dependent	
  phrases	
  
“解決を”	
  
↓	
  
“We	
  solve”	
  
→ Not	
  an	
  NP	
  
“DERSに”	
  
↓	
  
“In	
  DERS”	
  
→	
  	
  Missing	
  context	
  
An	
  example	
  of	
  the	
  first	
  method	
The	
  difference	
  between	
  the	
  first	
  and	
  second	
  
methods	
  is	
  not	
  to	
  replace	
  a	
  preposi.on	
  that	
  
follows	
  the	
  predicate	
  in	
  the	
  basic	
  frame.	
  	
  
	
  
Because	
  the	
  transla.on	
  of	
  dependent	
  phrases	
  in	
  
our	
  first	
  method	
  uses	
  the	
  language	
  and	
  
transla.on	
  models	
  op.mized	
  for	
  a	
  sentence,	
  our	
  
first	
  method	
  might	
  not	
  be	
  able	
  to	
  use	
  a	
  model	
  
op.mized	
  for	
  transla.ng	
  phrases.	
  

More Related Content

What's hot

NLP Asignment Final Presentation [IIT-Bombay]
NLP Asignment Final Presentation [IIT-Bombay]NLP Asignment Final Presentation [IIT-Bombay]
NLP Asignment Final Presentation [IIT-Bombay]Sagar Ahire
 
Statistical machine translation
Statistical machine translationStatistical machine translation
Statistical machine translationHrishikesh Nair
 
5a use of annotated corpus
5a use of annotated corpus5a use of annotated corpus
5a use of annotated corpusThennarasuSakkan
 
Doppl development iteration #4
Doppl development   iteration #4Doppl development   iteration #4
Doppl development iteration #4Diego Perini
 
TCUK 2013 - Mike Unwalla - Patterns in language for POS disambiguation in a s...
TCUK 2013 - Mike Unwalla - Patterns in language for POS disambiguation in a s...TCUK 2013 - Mike Unwalla - Patterns in language for POS disambiguation in a s...
TCUK 2013 - Mike Unwalla - Patterns in language for POS disambiguation in a s...TCUK Conference
 
Lecture 3: Semantic Role Labelling
Lecture 3: Semantic Role LabellingLecture 3: Semantic Role Labelling
Lecture 3: Semantic Role LabellingMarina Santini
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)inventionjournals
 
Emnlp読み会@2017 02-15
Emnlp読み会@2017 02-15Emnlp読み会@2017 02-15
Emnlp読み会@2017 02-15sekizawayuuki
 
Mining Revision Log of Language Learning SNS for Automated Japanese Error Cor...
Mining Revision Log of Language Learning SNS for Automated Japanese Error Cor...Mining Revision Log of Language Learning SNS for Automated Japanese Error Cor...
Mining Revision Log of Language Learning SNS for Automated Japanese Error Cor...Tomoya Mizumoto
 

What's hot (18)

NLP Asignment Final Presentation [IIT-Bombay]
NLP Asignment Final Presentation [IIT-Bombay]NLP Asignment Final Presentation [IIT-Bombay]
NLP Asignment Final Presentation [IIT-Bombay]
 
Pxc3898474
Pxc3898474Pxc3898474
Pxc3898474
 
Statistical machine translation
Statistical machine translationStatistical machine translation
Statistical machine translation
 
5a use of annotated corpus
5a use of annotated corpus5a use of annotated corpus
5a use of annotated corpus
 
Doppl development iteration #4
Doppl development   iteration #4Doppl development   iteration #4
Doppl development iteration #4
 
TCUK 2013 - Mike Unwalla - Patterns in language for POS disambiguation in a s...
TCUK 2013 - Mike Unwalla - Patterns in language for POS disambiguation in a s...TCUK 2013 - Mike Unwalla - Patterns in language for POS disambiguation in a s...
TCUK 2013 - Mike Unwalla - Patterns in language for POS disambiguation in a s...
 
Lecture 3: Semantic Role Labelling
Lecture 3: Semantic Role LabellingLecture 3: Semantic Role Labelling
Lecture 3: Semantic Role Labelling
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)
 
8 issues in pos tagging
8 issues in pos tagging8 issues in pos tagging
8 issues in pos tagging
 
NLP
NLPNLP
NLP
 
Lfg and gpsg
Lfg and gpsgLfg and gpsg
Lfg and gpsg
 
Chat adapted pos tagger for romanian language
Chat adapted pos tagger for romanian languageChat adapted pos tagger for romanian language
Chat adapted pos tagger for romanian language
 
NLP_KASHK:POS Tagging
NLP_KASHK:POS TaggingNLP_KASHK:POS Tagging
NLP_KASHK:POS Tagging
 
Emnlp読み会@2017 02-15
Emnlp読み会@2017 02-15Emnlp読み会@2017 02-15
Emnlp読み会@2017 02-15
 
Nlp
NlpNlp
Nlp
 
CLUE-Aligner: An Alignment Tool to Annotate Pairs of Paraphrastic and Transla...
CLUE-Aligner: An Alignment Tool to Annotate Pairs of Paraphrastic and Transla...CLUE-Aligner: An Alignment Tool to Annotate Pairs of Paraphrastic and Transla...
CLUE-Aligner: An Alignment Tool to Annotate Pairs of Paraphrastic and Transla...
 
Mining Revision Log of Language Learning SNS for Automated Japanese Error Cor...
Mining Revision Log of Language Learning SNS for Automated Japanese Error Cor...Mining Revision Log of Language Learning SNS for Automated Japanese Error Cor...
Mining Revision Log of Language Learning SNS for Automated Japanese Error Cor...
 
Phrase structure grammar
Phrase structure grammarPhrase structure grammar
Phrase structure grammar
 

More from Association for Computational Linguistics

Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...
Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...
Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...Association for Computational Linguistics
 
Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...
Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...
Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...Association for Computational Linguistics
 
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future DirectionsDaniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future DirectionsAssociation for Computational Linguistics
 
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future DirectionsDaniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future DirectionsAssociation for Computational Linguistics
 
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...Association for Computational Linguistics
 
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...Association for Computational Linguistics
 
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...Association for Computational Linguistics
 
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopSatoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopAssociation for Computational Linguistics
 
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...Association for Computational Linguistics
 
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...Association for Computational Linguistics
 
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Association for Computational Linguistics
 
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Association for Computational Linguistics
 
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopSatoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopAssociation for Computational Linguistics
 
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...Association for Computational Linguistics
 

More from Association for Computational Linguistics (20)

Muis - 2016 - Weak Semi-Markov CRFs for NP Chunking in Informal Text
Muis - 2016 - Weak Semi-Markov CRFs for NP Chunking in Informal TextMuis - 2016 - Weak Semi-Markov CRFs for NP Chunking in Informal Text
Muis - 2016 - Weak Semi-Markov CRFs for NP Chunking in Informal Text
 
Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...
Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...
Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...
 
Castro - 2018 - A Crowd-Annotated Spanish Corpus for Humour Analysis
Castro - 2018 - A Crowd-Annotated Spanish Corpus for Humour AnalysisCastro - 2018 - A Crowd-Annotated Spanish Corpus for Humour Analysis
Castro - 2018 - A Crowd-Annotated Spanish Corpus for Humour Analysis
 
Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...
Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...
Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...
 
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future DirectionsDaniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
 
Elior Sulem - 2018 - Semantic Structural Evaluation for Text Simplification
Elior Sulem - 2018 - Semantic Structural Evaluation for Text SimplificationElior Sulem - 2018 - Semantic Structural Evaluation for Text Simplification
Elior Sulem - 2018 - Semantic Structural Evaluation for Text Simplification
 
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future DirectionsDaniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
 
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...
 
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
 
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
 
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopSatoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
 
Chenchen Ding - 2015 - NICT at WAT 2015
Chenchen Ding - 2015 - NICT at WAT 2015Chenchen Ding - 2015 - NICT at WAT 2015
Chenchen Ding - 2015 - NICT at WAT 2015
 
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
 
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
 
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
 
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
 
Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015
Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015
Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015
 
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopSatoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
 
Chenchen Ding - 2015 - NICT at WAT 2015
Chenchen Ding - 2015 - NICT at WAT 2015Chenchen Ding - 2015 - NICT at WAT 2015
Chenchen Ding - 2015 - NICT at WAT 2015
 
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
 

Recently uploaded

30 ĐỀ PHÁT TRIỂN THEO CẤU TRÚC ĐỀ MINH HỌA BGD NGÀY 22-3-2024 KỲ THI TỐT NGHI...
30 ĐỀ PHÁT TRIỂN THEO CẤU TRÚC ĐỀ MINH HỌA BGD NGÀY 22-3-2024 KỲ THI TỐT NGHI...30 ĐỀ PHÁT TRIỂN THEO CẤU TRÚC ĐỀ MINH HỌA BGD NGÀY 22-3-2024 KỲ THI TỐT NGHI...
30 ĐỀ PHÁT TRIỂN THEO CẤU TRÚC ĐỀ MINH HỌA BGD NGÀY 22-3-2024 KỲ THI TỐT NGHI...Nguyen Thanh Tu Collection
 
، ژیانا ئینگلیزا ب کوردی ، ئینگلیزەکان ، راپورتی کوردی ، راپورتا مێژوی ، ژ...
، ژیانا ئینگلیزا ب کوردی ، ئینگلیزەکان ، راپورتی کوردی ،    راپورتا مێژوی ، ژ...، ژیانا ئینگلیزا ب کوردی ، ئینگلیزەکان ، راپورتی کوردی ،    راپورتا مێژوی ، ژ...
، ژیانا ئینگلیزا ب کوردی ، ئینگلیزەکان ، راپورتی کوردی ، راپورتا مێژوی ، ژ...Idrees.Hishyar
 
French Revolution (फ्रेंच राज्यक्रांती)
French Revolution  (फ्रेंच राज्यक्रांती)French Revolution  (फ्रेंच राज्यक्रांती)
French Revolution (फ्रेंच राज्यक्रांती)Shankar Aware
 
TUYỂN TẬP 25 ĐỀ THI HỌC SINH GIỎI MÔN TIẾNG ANH LỚP 6 NĂM 2023 CÓ ĐÁP ÁN (SƯU...
TUYỂN TẬP 25 ĐỀ THI HỌC SINH GIỎI MÔN TIẾNG ANH LỚP 6 NĂM 2023 CÓ ĐÁP ÁN (SƯU...TUYỂN TẬP 25 ĐỀ THI HỌC SINH GIỎI MÔN TIẾNG ANH LỚP 6 NĂM 2023 CÓ ĐÁP ÁN (SƯU...
TUYỂN TẬP 25 ĐỀ THI HỌC SINH GIỎI MÔN TIẾNG ANH LỚP 6 NĂM 2023 CÓ ĐÁP ÁN (SƯU...Nguyen Thanh Tu Collection
 
أَسَانِيدُ كُتُبِ وَأُصُولِ النَّشْرِ لِابْنِ الْجَزَرِيِّ وَالْوَصْلُ بِهَا....
أَسَانِيدُ كُتُبِ وَأُصُولِ النَّشْرِ لِابْنِ الْجَزَرِيِّ وَالْوَصْلُ بِهَا....أَسَانِيدُ كُتُبِ وَأُصُولِ النَّشْرِ لِابْنِ الْجَزَرِيِّ وَالْوَصْلُ بِهَا....
أَسَانِيدُ كُتُبِ وَأُصُولِ النَّشْرِ لِابْنِ الْجَزَرِيِّ وَالْوَصْلُ بِهَا....سمير بسيوني
 
TUYỂN TẬP 20 ĐỀ THI KHẢO SÁT HỌC SINH GIỎI MÔN TIẾNG ANH LỚP 6 NĂM 2020 (CÓ Đ...
TUYỂN TẬP 20 ĐỀ THI KHẢO SÁT HỌC SINH GIỎI MÔN TIẾNG ANH LỚP 6 NĂM 2020 (CÓ Đ...TUYỂN TẬP 20 ĐỀ THI KHẢO SÁT HỌC SINH GIỎI MÔN TIẾNG ANH LỚP 6 NĂM 2020 (CÓ Đ...
TUYỂN TẬP 20 ĐỀ THI KHẢO SÁT HỌC SINH GIỎI MÔN TIẾNG ANH LỚP 6 NĂM 2020 (CÓ Đ...Nguyen Thanh Tu Collection
 

Recently uploaded (6)

30 ĐỀ PHÁT TRIỂN THEO CẤU TRÚC ĐỀ MINH HỌA BGD NGÀY 22-3-2024 KỲ THI TỐT NGHI...
30 ĐỀ PHÁT TRIỂN THEO CẤU TRÚC ĐỀ MINH HỌA BGD NGÀY 22-3-2024 KỲ THI TỐT NGHI...30 ĐỀ PHÁT TRIỂN THEO CẤU TRÚC ĐỀ MINH HỌA BGD NGÀY 22-3-2024 KỲ THI TỐT NGHI...
30 ĐỀ PHÁT TRIỂN THEO CẤU TRÚC ĐỀ MINH HỌA BGD NGÀY 22-3-2024 KỲ THI TỐT NGHI...
 
، ژیانا ئینگلیزا ب کوردی ، ئینگلیزەکان ، راپورتی کوردی ، راپورتا مێژوی ، ژ...
، ژیانا ئینگلیزا ب کوردی ، ئینگلیزەکان ، راپورتی کوردی ،    راپورتا مێژوی ، ژ...، ژیانا ئینگلیزا ب کوردی ، ئینگلیزەکان ، راپورتی کوردی ،    راپورتا مێژوی ، ژ...
، ژیانا ئینگلیزا ب کوردی ، ئینگلیزەکان ، راپورتی کوردی ، راپورتا مێژوی ، ژ...
 
French Revolution (फ्रेंच राज्यक्रांती)
French Revolution  (फ्रेंच राज्यक्रांती)French Revolution  (फ्रेंच राज्यक्रांती)
French Revolution (फ्रेंच राज्यक्रांती)
 
TUYỂN TẬP 25 ĐỀ THI HỌC SINH GIỎI MÔN TIẾNG ANH LỚP 6 NĂM 2023 CÓ ĐÁP ÁN (SƯU...
TUYỂN TẬP 25 ĐỀ THI HỌC SINH GIỎI MÔN TIẾNG ANH LỚP 6 NĂM 2023 CÓ ĐÁP ÁN (SƯU...TUYỂN TẬP 25 ĐỀ THI HỌC SINH GIỎI MÔN TIẾNG ANH LỚP 6 NĂM 2023 CÓ ĐÁP ÁN (SƯU...
TUYỂN TẬP 25 ĐỀ THI HỌC SINH GIỎI MÔN TIẾNG ANH LỚP 6 NĂM 2023 CÓ ĐÁP ÁN (SƯU...
 
أَسَانِيدُ كُتُبِ وَأُصُولِ النَّشْرِ لِابْنِ الْجَزَرِيِّ وَالْوَصْلُ بِهَا....
أَسَانِيدُ كُتُبِ وَأُصُولِ النَّشْرِ لِابْنِ الْجَزَرِيِّ وَالْوَصْلُ بِهَا....أَسَانِيدُ كُتُبِ وَأُصُولِ النَّشْرِ لِابْنِ الْجَزَرِيِّ وَالْوَصْلُ بِهَا....
أَسَانِيدُ كُتُبِ وَأُصُولِ النَّشْرِ لِابْنِ الْجَزَرِيِّ وَالْوَصْلُ بِهَا....
 
TUYỂN TẬP 20 ĐỀ THI KHẢO SÁT HỌC SINH GIỎI MÔN TIẾNG ANH LỚP 6 NĂM 2020 (CÓ Đ...
TUYỂN TẬP 20 ĐỀ THI KHẢO SÁT HỌC SINH GIỎI MÔN TIẾNG ANH LỚP 6 NĂM 2020 (CÓ Đ...TUYỂN TẬP 20 ĐỀ THI KHẢO SÁT HỌC SINH GIỎI MÔN TIẾNG ANH LỚP 6 NĂM 2020 (CÓ Đ...
TUYỂN TẬP 20 ĐỀ THI KHẢO SÁT HỌC SINH GIỎI MÔN TIẾNG ANH LỚP 6 NĂM 2020 (CÓ Đ...
 

Junki Matsuo - 2015 - Source Phrase Segmentation and Translation for Japanese-English Translation Using Dependency Structure

  • 1. Source Phrase Segmentation and Translation for Japanese to English Translation Using Dependency Structure Junki Matsuo, Kenichi Ohwada, and Mamoru Komachi Graduate School of System Design, Tokyo Metropolitan University, Japan We  propose  a  linguis.cally  mo.vated  approach   based  on  segmen.ng  a  source  phrase  using   dependency  structure  and  transla.ng  each  phrase   with  PBSMT.  This  work  presents  the  results  of  our   method  on  Japanese-­‐English  transla.on  and   discusses  poten.al  improvements.     ①  Abstract ②  Source  Phrase  Segmenta.on  and  Transla.on  Using  Dependency  Structure ④  Experimental  Results ⑤  Error  Analysis  (in  100  sentences) ③  Experimental  SeQngs Decoder  :  Moses  (2.1.1)  default  seQngs   Corpus  :  The  ASPEC  (3  million  parallel                                          sentences)   Segmenta.on  :  JUMAN  (7.0)   Alignment  :  GIZA++  (1.0.7)   MERT  :  Performed  on  the  full  dev-­‐set   Dependency  :  CaboCha(0.68)   Baseline  :  Standard  output     Our  method  only  uses  CaboCha  for  crea.ng   a  basic  frame  and  dependent  phrases.  (Not   segmenta.on)   Algorithm     1.  Our  method  creates  a  basic  frame  and   dependent  phrases.  (Basic  frame  :  broken  line   circle,  Dependent  phrases  :  solid  line  circle.)   2.  The  basic  frame  and  the  dependent  phrases   are  translated  by  a  decoder.   3.  The  anchor  words  of  the  basic  frame  are   replaced  with  the  transla.ons  of  the   corresponding  dependent  phrases.  (Anchor   words  :  the  yellow  under-­‐lined  words)   ⑥  Conclusion &  Future  Work An  example  of  the  second  method no-­‐seg&trans        … Standard  Moses  output   seg&trans        …        The  first  method   seg&trans        …        The  second  method   We  inves.gate  the  reason  why  BLEU  and  RIBES  fall   off.  The  number  of  the  pairs  of  input  and  reference   that  have  different  voices  (ac.ve  or  passive)  is  35.     We  observed  that  transla.on  of  dependent  phrases   is  the  most  frequent  error  type.    It  is  because  the   language  and  the  transla.on  models  are  not   op.mized  for  transla.ng  phrases.   •  Our  finding  is  that  our  proposed  methods   have  three  problems:  a  dependency   parsing,  transla.on  of  a  basic  frame  and   transla.on  of  dependent  phrases.     •  We  plan  to  op.mize  the  language  and  the   transla.on  models  for  transla.ng  phrases.   “雷撃比は等しい”   ↓   “The  equal  to  lightning  stroke  ra.o”   → Ungramma'cal  Sentence   Error  Example ・Transla.on  of  the  basic  frame   ・Transla.on  of  the  dependent  phrases   “解決を”   ↓   “We  solve”   → Not  an  NP   “DERSに”   ↓   “In  DERS”   →    Missing  context   An  example  of  the  first  method The  difference  between  the  first  and  second   methods  is  not  to  replace  a  preposi.on  that   follows  the  predicate  in  the  basic  frame.       Because  the  transla.on  of  dependent  phrases  in   our  first  method  uses  the  language  and   transla.on  models  op.mized  for  a  sentence,  our   first  method  might  not  be  able  to  use  a  model   op.mized  for  transla.ng  phrases.