SlideShare a Scribd company logo
1 of 25
Download to read offline
KyotoEBMT  System  
Descrip3on  for  the    
2nd  Workshop  on  Asian  Transla3on	
John Richardson Raj Dabre Chenhui Chu
Fabien Cromières Toshiaki Nakazawa Sadao Kurohashi
	
1
Outline	
•  Overview	
  of	
  the	
  system	
  
•  Improvements	
  since	
  WAT2014	
  
•  Results	
  for	
  WAT2015	
  
•  Conclusion	
  
2
Overview  of  Kyoto-­‐EBMT	
3
KyotoEBMT  Overview	
•  Example-­‐Based	
  MT	
  paradigm	
  
•  Need	
  parallel	
  corpus	
  
•  Few	
  language-­‐specific	
  assumpJons	
  
•  sJll	
  a	
  few	
  language-­‐specific	
  rules	
  
•  Tree-­‐to-­‐Tree	
  Machine	
  TranslaJon	
  
•  Maybe	
  the	
  least	
  commonly	
  used	
  variant	
  of	
  x-­‐to-­‐x	
  
•  SensiJve	
  to	
  parsing	
  quality	
  of	
  both	
  source	
  and	
  target	
  languages	
  
•  Maximize	
  the	
  chances	
  of	
  preserving	
  informaJon	
  
•  Dependency	
  trees	
  
•  Less	
  commonly	
  used	
  than	
  ConsJtuent	
  trees	
  
•  Most	
  natural	
  for	
  Japanese	
  
•  Should	
  contain	
  all	
  important	
  semanJc	
  informaJon	
  
4
KyotoEBMT  pipeline	
•  Somehow	
  classic	
  pipeline	
  
•  1-­‐	
  Preprocessing	
  of	
  the	
  parallel	
  corpus	
  
•  2-­‐	
  Processing	
  of	
  input	
  sentence	
  
•  3-­‐	
  Decoding/Tuning/Reranking	
  
•  Tuning	
  and	
  reranking	
  done	
  with	
  kbMira	
  
•  seems	
  to	
  work	
  beVer	
  than	
  PRO	
  for	
  us	
  
5
Other  specifici3es	
•  No	
  “phrase-­‐table”	
  
•  all	
  translaJon	
  rules	
  computed	
  on-­‐the-­‐fly	
  for	
  each	
  input	
  
•  cons:	
  
•  possibly	
  slower	
  (but	
  not	
  so	
  slow)	
  
•  compuJng	
  significance/	
  sparse	
  features	
  more	
  complicated	
  
•  pros:	
  
•  full-­‐context	
  available	
  for	
  compuJng	
  features	
  
•  no	
  limit	
  on	
  the	
  size	
  of	
  matched	
  rules	
  
•  possibility	
  to	
  output	
  perfect	
  translaJon	
  when	
  input	
  is	
  very	
  similar	
  to	
  an	
  example	
  
•  “Flexible”	
  translaJon	
  rules	
  
•  OpJonal	
  words	
  
•  AlternaJve	
  inserJon	
  posiJons	
  
•  Decoder	
  can	
  process	
  flexible	
  rules	
  more	
  efficiently	
  than	
  a	
  long	
  list	
  of	
  alternaJve	
  rules	
  
•  some	
  “flexible	
  rules”	
  may	
  actually	
  encode	
  >millions	
  of	
  “standard	
  rules”	
  
6
Flexible  Rules  Extracted  on-­‐the-­‐fly	
7	
プラスチック	
  
	
  (plasJc)	
は	
石油	
  
から	
製造	
さ	
れる	
例えば(for	
  
	
  example)	
  
the	
水素	
は	
現在	
天然ガス	
や	
石油	
から	
製造	
さ	
れる	
hydrogen	
is	
produced	
from	
natural	
gas	
and	
petroleum	
at	
present	
raw	
X(plasJc)	
is	
petroleum	
produced	
from	
Y(for	
  example)	
?	
Y(for	
  example)	
Y(for	
  example)	
raw*	
opJonal	
  	
  
word	
Input:	
Matched	
  Example:	
Flexible	
  translaJon	
  rule	
  
created	
  on-­‐the-­‐fly:	
X:	
  Simple	
  case	
  	
  
(X	
  has	
  an	
  equivalent	
  in	
  
the	
  source	
  example)	
Y:	
  ambiguous	
  
inserJon	
  posiJon	
“raw”:	
  null-­‐aligned	
  -­‐>	
  
opJonal	
(encode	
  many	
  translaJon	
  opJons	
  at	
  once)
Other  specifici3es	
•  No	
  “phrase-­‐table”	
  
•  all	
  translaJon	
  rules	
  computed	
  on-­‐the-­‐fly	
  for	
  each	
  input	
  
•  cons:	
  
•  possibly	
  slower	
  (but	
  not	
  so	
  slow)	
  
•  compuJng	
  significance/	
  sparse	
  features	
  more	
  complicated	
  
•  pros:	
  
•  full-­‐context	
  available	
  for	
  compuJng	
  features	
  
•  no	
  limit	
  on	
  the	
  size	
  of	
  matched	
  rules	
  
•  possibility	
  to	
  output	
  perfect	
  translaJon	
  when	
  input	
  is	
  very	
  similar	
  to	
  an	
  example	
  
•  “Flexible”	
  translaJon	
  rules	
  
•  OpJonal	
  words	
  
•  AlternaJve	
  inserJon	
  posiJons	
  
•  Decoder	
  can	
  process	
  flexible	
  rules	
  more	
  efficiently	
  than	
  a	
  long	
  list	
  of	
  alternaJve	
  rules	
  
•  some	
  “flexible	
  rules”	
  may	
  actually	
  encode	
  >millions	
  of	
  “standard	
  rules”	
  
8
Improvements  since  WAT2014	
9
KyotoEBMT  improvements	
•  Our	
  system	
  is	
  very	
  sensiJve	
  to	
  
parsing	
  errors	
  
•  ConJnuous	
  improvements	
  to	
  
•  Juman	
  
•  KNP	
  
•  SKP	
  
•  Added	
  support	
  for	
  parse	
  forests	
  	
  
•  (compact	
  representaJons)	
  
Forest	
  
parses	
10
Forest  Input	
•  A	
  parJal	
  soluJon	
  to	
  the	
  issues	
  of	
  Tree-­‐to-­‐Tree	
  MT	
  
•  Can	
  help	
  with	
  parsing	
  errors	
  
•  Can	
  help	
  with	
  syntacJc	
  divergences	
  
•  In	
  WAT2014,	
  
•  we	
  used	
  20-­‐best	
  input	
  parses	
  
•  n-­‐best	
  list	
  of	
  all	
  inputs	
  merged	
  and	
  reranked	
  
•  Now,	
  with	
  forest:	
  
•  an	
  exponenJal	
  number	
  of	
  input	
  parses	
  can	
  be	
  encoded	
  
•  the	
  selecJon	
  of	
  parses	
  is	
  done	
  during	
  decoding	
  
11
KyotoEBMT  improvements	
•  System	
  is	
  also	
  very	
  sensiJve	
  to	
  
alignment	
  errors	
  
•  We	
  used	
  to	
  correct	
  alignments	
  by	
  
using	
  dependency	
  trees	
  (Nakazawa	
  
and	
  Kurohashi,	
  2012)	
  
•  Now	
  we	
  further	
  improve	
  them	
  with	
  
Nile	
  (Riesa	
  et	
  al.,	
  2011)	
  	
  
Forest	
  
parses	
12
Alignment  Improvements	
•  Used	
  Nile	
  (Riesa	
  et	
  al.,	
  2011)	
  	
  to	
  improve	
  the	
  alignment	
  
•  As	
  suggested	
  by	
  (Neubig	
  and	
  Duh,	
  2014)	
  
•  Require	
  us	
  to	
  parse	
  into	
  consJtuent	
  trees	
  as	
  well	
  
•  Ckylark	
  parser	
  for	
  Japanese	
  (Oda+,	
  2015)	
  
•  Berkeley	
  Parser	
  for	
  Chinese/English	
  
•  Nile	
  becomes	
  the	
  third	
  element	
  of	
  an	
  alignment	
  pipeline	
  	
  
Giza++	
(Nakazawa	
  and	
  
Kurohashi,	
  2012)	
 Nile	
with	
  dependency	
  trees	
 with	
  consJtuent	
  trees	
13	
F:0.63	
 F:0.69	
 F:0.75	
JC	
  alignment	
  F	
  -­‐>	
(Giza++	
  /	
  Nile	
  only	
  -­‐>	
  F:0.70)
KyotoEBMT  improvements	
•  Many	
  small	
  improvements	
  
•  BeVer	
  handling	
  of	
  flexible	
  rules	
  
•  Bug	
  fixes	
  
•  10	
  new	
  features	
  
•  alignment	
  score	
  
•  context	
  similarity	
  score	
  based	
  on	
  
word2vec	
  vectors	
  
•  …	
  
Forest	
  
parses	
14
KyotoEBMT  improvements	
• Reranking	
  
•  Previously	
  used	
  features:	
  
•  7-­‐gram	
  language	
  model	
  
•  RNNLM	
  language	
  model	
  
•  Now	
  also	
  using	
  a	
  Neural	
  MT	
  based	
  
bilingual	
  Language	
  Model	
  	
  
Forest	
  
parses	
15
Bilingual  Neural  Network  Language  Model	
•  Combine	
  Neural	
  MT	
  with	
  EBMT	
  
•  We	
  use	
  the	
  state-­‐of-­‐the-­‐art	
  model	
  described	
  by	
  (Bahdanau	
  et	
  al.,	
  2015)	
  
•  Model	
  seen	
  as	
  a	
  Language	
  Model	
  condiJonalized	
  on	
  the	
  input	
  
•  Remarks:	
  
•  Processing	
  Japanese	
  and	
  Chinese	
  as	
  sequences	
  of	
  characters	
  gave	
  good	
  results	
  
•  No	
  need	
  to	
  limit	
  vocabulary	
  (~4000/6000	
  characters	
  for	
  J/C)	
  
•  Avoid	
  segmentaJon	
  issues	
  
•  Faster	
  training	
  
•  Neural	
  MT	
  models	
  alone	
  produced	
  bad	
  translaJons	
  
•  eg.	
  Character	
  BLEU	
  for	
  C-­‐>J	
  almost	
  half	
  that	
  of	
  KyotoEBMT	
  
•  Reranking	
  performances	
  saturates	
  before	
  MT	
  performances	
16	
0	
  
10	
  
20	
  
30	
  
40	
  
50	
  
60	
  
10	
   20	
   40	
  
Reranked	
  BLEU/	
  NeuralMT	
  
char-­‐BLEU	
  vs	
  Epochs	
  for	
  J-­‐>C	
Neural	
  MT	
  cBLEU	
   reranked	
  BLEU	
  
KyotoEBMT	
  cBLEU	
  
KyotoEBMT  improvements	
•  Improved	
  working	
  methods	
  
(that	
  maVers!)	
  
•  automaJc	
  nightly	
  tesJng	
  for	
  
variaJons	
  in	
  BLEU/	
  asserJon	
  errors/	
  
memory	
  leaks	
  
•  Overall	
  improvements	
  across	
  all	
  
the	
  pipeline	
  
•  EsJmaJng	
  the	
  global	
  
contribuJon	
  of	
  each	
  element	
  is	
  
tough,	
  but	
  here	
  are	
  the	
  final	
  
results,	
  …	
  	
  
Forest	
  
parses	
17
Results	
18
Results  for  WAT2015	
Reranking	
 BLEU	
 RIBES	
 HUMAN	
J-­‐>E	
NO	
 21.31	
  (+0.71)	
 70.65	
  (+0.53)	
 16.50	
YES	
 22.89	
  (+1.82)	
 72.46	
  (+2.56)	
 32.50	
E-­‐>J	
NO	
   30.69	
  (+0.92)	
 76.78	
  (+1.57)	
 40.50	
YES	
 33.06	
  (+1.97)	
 78.95	
  (+2.99)	
 51.00	
J-­‐>C	
NO	
 29.99	
  (+2.78)	
 80.71	
  (+1.58)	
 16.00	
YES	
 31.40	
  (+3.83)	
 82.70	
  (+3.87)	
 12.50	
C-­‐>J	
NO	
 36.30	
  (+2.73)	
 81.97	
  (+1.87)	
 16.75	
YES	
 38.53	
  (+3.78)	
 84.07	
  (+3.81)	
 18.50	
19
Results  for  WAT2015	
Reranking	
 BLEU	
 RIBES	
 HUMAN	
J-­‐>E	
NO	
 21.31	
  (+0.71)	
 70.65	
  (+0.53)	
 16.50	
YES	
 22.89	
  (+1.82)	
 72.46	
  (+2.56)	
 32.50	
E-­‐>J	
NO	
   30.69	
  (+0.92)	
 76.78	
  (+1.57)	
 40.50	
YES	
 33.06	
  (+1.97)	
 78.95	
  (+2.99)	
 51.00	
J-­‐>C	
NO	
 29.99	
  (+2.78)	
 80.71	
  (+1.58)	
 16.00	
YES	
 31.40	
  (+3.83)	
 82.70	
  (+3.87)	
 12.50	
C-­‐>J	
NO	
 36.30	
  (+2.73)	
 81.97	
  (+1.87)	
 16.75	
YES	
 38.53	
  (+3.78)	
 84.07	
  (+3.81)	
 18.50	
+0.71	
+1.82	
+0.92	
+1.97	
+2.78	
+3.83	
+2.73	
+3.78	
The	
  various	
  improvements	
  
lead	
  to	
  good	
  changes	
  in	
  BLEU.	
  
Almost	
  +4	
  BLEU	
  for	
  the	
  JC/CJ	
20
Results  for  WAT2015	
Reranking	
 BLEU	
 RIBES	
 HUMAN	
J-­‐>E	
NO	
 21.31	
  (+0.71)	
 70.65	
  (+0.53)	
 16.50	
YES	
 22.89	
  (+1.82)	
 72.46	
  (+2.56)	
 32.50	
E-­‐>J	
NO	
   30.69	
  (+0.92)	
 76.78	
  (+1.57)	
 40.50	
YES	
 33.06	
  (+1.97)	
 78.95	
  (+2.99)	
 51.00	
J-­‐>C	
NO	
 29.99	
  (+2.78)	
 80.71	
  (+1.58)	
 16.00	
YES	
 31.40	
  (+3.83)	
 82.70	
  (+3.87)	
 12.50	
C-­‐>J	
NO	
 36.30	
  (+2.73)	
 81.97	
  (+1.87)	
 16.75	
YES	
 38.53	
  (+3.78)	
 84.07	
  (+3.81)	
 18.50	
Mystery!	
  
Only	
  for	
  J-­‐>C,	
  we	
  find	
  that	
  reranking	
  decreased	
  
Human	
  EvaluaJon	
  score.	
  
(While	
  sJll	
  improving	
  BLEU/RIBES)	
21
Results  for  WAT2015	
Reranking	
 BLEU	
 RIBES	
 HUMAN	
J-­‐>E	
NO	
 21.31	
  (+0.71)	
 70.65	
  (+0.53)	
 16.50	
YES	
 22.89	
  (+1.82)	
 72.46	
  (+2.56)	
 32.50	
E-­‐>J	
NO	
   30.69	
  (+0.92)	
 76.78	
  (+1.57)	
 40.50	
YES	
 33.06	
  (+1.97)	
 78.95	
  (+2.99)	
 51.00	
J-­‐>C	
NO	
 29.99	
  (+2.78)	
 80.71	
  (+1.58)	
 16.00	
YES	
 31.40	
  (+3.83)	
 82.70	
  (+3.87)	
 12.50	
C-­‐>J	
NO	
 36.30	
  (+2.73)	
 81.97	
  (+1.87)	
 16.75	
YES	
 38.53	
  (+3.78)	
 84.07	
  (+3.81)	
 18.50	
22
Code  is  available  and  Open-­‐sourced	
•  Version	
  1.0	
  released	
  
•  1	
  year	
  arer	
  version	
  0.1	
  
•  2	
  years	
  arer	
  development	
  started	
  
•  Downloadable	
  at:	
  hVp://nlp.ist.i.kyoto-­‐u.ac.jp/kyotoebmt/	
  
•  GPL	
  Licence	
23
Conclusion	
•  KyotoEBMT	
  is	
  a	
  (Dependency)	
  Tree-­‐to-­‐Tree	
  MT	
  system	
  with	
  state-­‐of-­‐
the-­‐art	
  results	
  
•  Open-­‐sourced	
  (hVp://nlp.ist.i.kyoto-­‐u.ac.jp/kyotoebmt/)	
  
•  Improvements	
  across	
  the	
  whole	
  pipeline	
  lead	
  us	
  to	
  close	
  to	
  +4	
  BLEU	
  
improvements	
  
•  Some	
  future	
  works:	
  
•  Make	
  more	
  use	
  of	
  the	
  target	
  structure	
  
•  Use	
  of	
  deep	
  learning	
  features	
  in	
  the	
  decoder	
  
•  eg.	
  as	
  in	
  (Devlin	
  et	
  al.,	
  2014)	
  
•  …	
  
24
Thank  you!	
25

More Related Content

Viewers also liked

Construisons ensemble le chatbot bancaire dedemain !
Construisons ensemble le chatbot bancaire dedemain !Construisons ensemble le chatbot bancaire dedemain !
Construisons ensemble le chatbot bancaire dedemain !LINAGORA
 
Hackathon 2014 NLP Hack
Hackathon 2014 NLP HackHackathon 2014 NLP Hack
Hackathon 2014 NLP HackRoelof Pieters
 
Deep Learning in practice : Speech recognition and beyond - Meetup
Deep Learning in practice : Speech recognition and beyond - MeetupDeep Learning in practice : Speech recognition and beyond - Meetup
Deep Learning in practice : Speech recognition and beyond - MeetupLINAGORA
 
Visual-Semantic Embeddings: some thoughts on Language
Visual-Semantic Embeddings: some thoughts on LanguageVisual-Semantic Embeddings: some thoughts on Language
Visual-Semantic Embeddings: some thoughts on LanguageRoelof Pieters
 
Advanced Node.JS Meetup
Advanced Node.JS MeetupAdvanced Node.JS Meetup
Advanced Node.JS MeetupLINAGORA
 
Using Text Embeddings for Information Retrieval
Using Text Embeddings for Information RetrievalUsing Text Embeddings for Information Retrieval
Using Text Embeddings for Information RetrievalBhaskar Mitra
 
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopSatoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopAssociation for Computational Linguistics
 
Neural Models for Document Ranking
Neural Models for Document RankingNeural Models for Document Ranking
Neural Models for Document RankingBhaskar Mitra
 
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Association for Computational Linguistics
 
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopSatoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopAssociation for Computational Linguistics
 
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...Association for Computational Linguistics
 
State of Blockchain 2017: Smartnetworks and the Blockchain Economy
State of Blockchain 2017:  Smartnetworks and the Blockchain EconomyState of Blockchain 2017:  Smartnetworks and the Blockchain Economy
State of Blockchain 2017: Smartnetworks and the Blockchain EconomyMelanie Swan
 
Exploring Session Context using Distributed Representations of Queries and Re...
Exploring Session Context using Distributed Representations of Queries and Re...Exploring Session Context using Distributed Representations of Queries and Re...
Exploring Session Context using Distributed Representations of Queries and Re...Bhaskar Mitra
 
Deep Learning for Chatbot (3/4)
Deep Learning for Chatbot (3/4)Deep Learning for Chatbot (3/4)
Deep Learning for Chatbot (3/4)Jaemin Cho
 
Recommender Systems, Matrices and Graphs
Recommender Systems, Matrices and GraphsRecommender Systems, Matrices and Graphs
Recommender Systems, Matrices and GraphsRoelof Pieters
 
Blockchain Economic Theory
Blockchain Economic TheoryBlockchain Economic Theory
Blockchain Economic TheoryMelanie Swan
 
Deep Learning for Chatbot (1/4)
Deep Learning for Chatbot (1/4)Deep Learning for Chatbot (1/4)
Deep Learning for Chatbot (1/4)Jaemin Cho
 
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...Association for Computational Linguistics
 
Deep Learning Explained
Deep Learning ExplainedDeep Learning Explained
Deep Learning ExplainedMelanie Swan
 

Viewers also liked (20)

Construisons ensemble le chatbot bancaire dedemain !
Construisons ensemble le chatbot bancaire dedemain !Construisons ensemble le chatbot bancaire dedemain !
Construisons ensemble le chatbot bancaire dedemain !
 
Hackathon 2014 NLP Hack
Hackathon 2014 NLP HackHackathon 2014 NLP Hack
Hackathon 2014 NLP Hack
 
Deep Learning in practice : Speech recognition and beyond - Meetup
Deep Learning in practice : Speech recognition and beyond - MeetupDeep Learning in practice : Speech recognition and beyond - Meetup
Deep Learning in practice : Speech recognition and beyond - Meetup
 
Visual-Semantic Embeddings: some thoughts on Language
Visual-Semantic Embeddings: some thoughts on LanguageVisual-Semantic Embeddings: some thoughts on Language
Visual-Semantic Embeddings: some thoughts on Language
 
Advanced Node.JS Meetup
Advanced Node.JS MeetupAdvanced Node.JS Meetup
Advanced Node.JS Meetup
 
Using Text Embeddings for Information Retrieval
Using Text Embeddings for Information RetrievalUsing Text Embeddings for Information Retrieval
Using Text Embeddings for Information Retrieval
 
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopSatoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
 
Neural Models for Document Ranking
Neural Models for Document RankingNeural Models for Document Ranking
Neural Models for Document Ranking
 
Chenchen Ding - 2015 - NICT at WAT 2015
Chenchen Ding - 2015 - NICT at WAT 2015Chenchen Ding - 2015 - NICT at WAT 2015
Chenchen Ding - 2015 - NICT at WAT 2015
 
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
 
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopSatoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
 
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
 
State of Blockchain 2017: Smartnetworks and the Blockchain Economy
State of Blockchain 2017:  Smartnetworks and the Blockchain EconomyState of Blockchain 2017:  Smartnetworks and the Blockchain Economy
State of Blockchain 2017: Smartnetworks and the Blockchain Economy
 
Exploring Session Context using Distributed Representations of Queries and Re...
Exploring Session Context using Distributed Representations of Queries and Re...Exploring Session Context using Distributed Representations of Queries and Re...
Exploring Session Context using Distributed Representations of Queries and Re...
 
Deep Learning for Chatbot (3/4)
Deep Learning for Chatbot (3/4)Deep Learning for Chatbot (3/4)
Deep Learning for Chatbot (3/4)
 
Recommender Systems, Matrices and Graphs
Recommender Systems, Matrices and GraphsRecommender Systems, Matrices and Graphs
Recommender Systems, Matrices and Graphs
 
Blockchain Economic Theory
Blockchain Economic TheoryBlockchain Economic Theory
Blockchain Economic Theory
 
Deep Learning for Chatbot (1/4)
Deep Learning for Chatbot (1/4)Deep Learning for Chatbot (1/4)
Deep Learning for Chatbot (1/4)
 
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
 
Deep Learning Explained
Deep Learning ExplainedDeep Learning Explained
Deep Learning Explained
 

Similar to John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on Asian Translation

Introduction to OpenSees by Frank McKenna
Introduction to OpenSees by Frank McKennaIntroduction to OpenSees by Frank McKenna
Introduction to OpenSees by Frank McKennaopenseesdays
 
The Maze of Deletion in Ontology Stream Reasoning
The Maze of Deletion in Ontology Stream Reasoning The Maze of Deletion in Ontology Stream Reasoning
The Maze of Deletion in Ontology Stream Reasoning Jeff Z. Pan
 
Java Thread and Process Performance for Parallel Machine Learning on Multicor...
Java Thread and Process Performance for Parallel Machine Learning on Multicor...Java Thread and Process Performance for Parallel Machine Learning on Multicor...
Java Thread and Process Performance for Parallel Machine Learning on Multicor...Saliya Ekanayake
 
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...Oscar Corcho
 
MODELS 2019: Querying and annotating model histories with time-aware patterns
MODELS 2019: Querying and annotating model histories with time-aware patternsMODELS 2019: Querying and annotating model histories with time-aware patterns
MODELS 2019: Querying and annotating model histories with time-aware patternsAntonio García-Domínguez
 
Data Stream Management
Data Stream ManagementData Stream Management
Data Stream Managementk_tauhid
 
OpenDiscovery
OpenDiscoveryOpenDiscovery
OpenDiscoverygwprice
 
20110524zurichngs 2nd pub
20110524zurichngs 2nd pub20110524zurichngs 2nd pub
20110524zurichngs 2nd pubsesejun
 
Go from a PHP Perspective
Go from a PHP PerspectiveGo from a PHP Perspective
Go from a PHP PerspectiveBarry Jones
 
Paper Study - Demand-Driven Computation of Interprocedural Data Flow
Paper Study - Demand-Driven Computation of Interprocedural Data FlowPaper Study - Demand-Driven Computation of Interprocedural Data Flow
Paper Study - Demand-Driven Computation of Interprocedural Data FlowMin-Yih Hsu
 
Juman++ v2: A Practical and Modern Morphological Analyzer
Juman++ v2: A Practical and Modern Morphological AnalyzerJuman++ v2: A Practical and Modern Morphological Analyzer
Juman++ v2: A Practical and Modern Morphological Analyzereiennohito
 
H2O World - What's New in H2O with Cliff Click
H2O World - What's New in H2O with Cliff ClickH2O World - What's New in H2O with Cliff Click
H2O World - What's New in H2O with Cliff ClickSri Ambati
 
Queuing theory and traffic analysis in depth
Queuing theory and traffic analysis in depthQueuing theory and traffic analysis in depth
Queuing theory and traffic analysis in depthIdcIdk1
 
Scalable up genomic analysis with ADAM
Scalable up genomic analysis with ADAMScalable up genomic analysis with ADAM
Scalable up genomic analysis with ADAMfnothaft
 
Resilience at exascale
Resilience at exascaleResilience at exascale
Resilience at exascaleMarc Snir
 

Similar to John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on Asian Translation (20)

Introduction to OpenSees by Frank McKenna
Introduction to OpenSees by Frank McKennaIntroduction to OpenSees by Frank McKenna
Introduction to OpenSees by Frank McKenna
 
The Maze of Deletion in Ontology Stream Reasoning
The Maze of Deletion in Ontology Stream Reasoning The Maze of Deletion in Ontology Stream Reasoning
The Maze of Deletion in Ontology Stream Reasoning
 
Java Thread and Process Performance for Parallel Machine Learning on Multicor...
Java Thread and Process Performance for Parallel Machine Learning on Multicor...Java Thread and Process Performance for Parallel Machine Learning on Multicor...
Java Thread and Process Performance for Parallel Machine Learning on Multicor...
 
Searching Algorithms
Searching AlgorithmsSearching Algorithms
Searching Algorithms
 
N20190729
N20190729N20190729
N20190729
 
Streams in Java 8
Streams in Java 8Streams in Java 8
Streams in Java 8
 
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
 
MODELS 2019: Querying and annotating model histories with time-aware patterns
MODELS 2019: Querying and annotating model histories with time-aware patternsMODELS 2019: Querying and annotating model histories with time-aware patterns
MODELS 2019: Querying and annotating model histories with time-aware patterns
 
Data Stream Management
Data Stream ManagementData Stream Management
Data Stream Management
 
OpenDiscovery
OpenDiscoveryOpenDiscovery
OpenDiscovery
 
20110524zurichngs 2nd pub
20110524zurichngs 2nd pub20110524zurichngs 2nd pub
20110524zurichngs 2nd pub
 
Go from a PHP Perspective
Go from a PHP PerspectiveGo from a PHP Perspective
Go from a PHP Perspective
 
Paper Study - Demand-Driven Computation of Interprocedural Data Flow
Paper Study - Demand-Driven Computation of Interprocedural Data FlowPaper Study - Demand-Driven Computation of Interprocedural Data Flow
Paper Study - Demand-Driven Computation of Interprocedural Data Flow
 
Juman++ v2: A Practical and Modern Morphological Analyzer
Juman++ v2: A Practical and Modern Morphological AnalyzerJuman++ v2: A Practical and Modern Morphological Analyzer
Juman++ v2: A Practical and Modern Morphological Analyzer
 
TINET_FRnOG_2008_public
TINET_FRnOG_2008_publicTINET_FRnOG_2008_public
TINET_FRnOG_2008_public
 
Temporal Segment Network
Temporal Segment NetworkTemporal Segment Network
Temporal Segment Network
 
H2O World - What's New in H2O with Cliff Click
H2O World - What's New in H2O with Cliff ClickH2O World - What's New in H2O with Cliff Click
H2O World - What's New in H2O with Cliff Click
 
Queuing theory and traffic analysis in depth
Queuing theory and traffic analysis in depthQueuing theory and traffic analysis in depth
Queuing theory and traffic analysis in depth
 
Scalable up genomic analysis with ADAM
Scalable up genomic analysis with ADAMScalable up genomic analysis with ADAM
Scalable up genomic analysis with ADAM
 
Resilience at exascale
Resilience at exascaleResilience at exascale
Resilience at exascale
 

More from Association for Computational Linguistics

Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...
Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...
Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...Association for Computational Linguistics
 
Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...
Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...
Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...Association for Computational Linguistics
 
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future DirectionsDaniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future DirectionsAssociation for Computational Linguistics
 
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future DirectionsDaniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future DirectionsAssociation for Computational Linguistics
 
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...Association for Computational Linguistics
 
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...Association for Computational Linguistics
 
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Association for Computational Linguistics
 
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...Association for Computational Linguistics
 
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...Association for Computational Linguistics
 
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...Association for Computational Linguistics
 
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...Association for Computational Linguistics
 
Toshiaki Nakazawa - 2015 - Overview of the 2nd Workshop on Asian Translation
Toshiaki Nakazawa  - 2015 - Overview of the 2nd Workshop on Asian TranslationToshiaki Nakazawa  - 2015 - Overview of the 2nd Workshop on Asian Translation
Toshiaki Nakazawa - 2015 - Overview of the 2nd Workshop on Asian TranslationAssociation for Computational Linguistics
 
Hua Shan - 2015 - A Dependency-to-String Model for Chinese-Japanese SMT System
Hua Shan - 2015 - A Dependency-to-String Model for Chinese-Japanese SMT SystemHua Shan - 2015 - A Dependency-to-String Model for Chinese-Japanese SMT System
Hua Shan - 2015 - A Dependency-to-String Model for Chinese-Japanese SMT SystemAssociation for Computational Linguistics
 
Wei Yang - 2015 - Sampling-based Alignment and Hierarchical Sub-sentential Al...
Wei Yang - 2015 - Sampling-based Alignment and Hierarchical Sub-sentential Al...Wei Yang - 2015 - Sampling-based Alignment and Hierarchical Sub-sentential Al...
Wei Yang - 2015 - Sampling-based Alignment and Hierarchical Sub-sentential Al...Association for Computational Linguistics
 
Wei Yang - 2015 - Sampling-based Alignment and Hierarchical Sub-sentential Al...
Wei Yang - 2015 - Sampling-based Alignment and Hierarchical Sub-sentential Al...Wei Yang - 2015 - Sampling-based Alignment and Hierarchical Sub-sentential Al...
Wei Yang - 2015 - Sampling-based Alignment and Hierarchical Sub-sentential Al...Association for Computational Linguistics
 

More from Association for Computational Linguistics (20)

Muis - 2016 - Weak Semi-Markov CRFs for NP Chunking in Informal Text
Muis - 2016 - Weak Semi-Markov CRFs for NP Chunking in Informal TextMuis - 2016 - Weak Semi-Markov CRFs for NP Chunking in Informal Text
Muis - 2016 - Weak Semi-Markov CRFs for NP Chunking in Informal Text
 
Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...
Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...
Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...
 
Castro - 2018 - A Crowd-Annotated Spanish Corpus for Humour Analysis
Castro - 2018 - A Crowd-Annotated Spanish Corpus for Humour AnalysisCastro - 2018 - A Crowd-Annotated Spanish Corpus for Humour Analysis
Castro - 2018 - A Crowd-Annotated Spanish Corpus for Humour Analysis
 
Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...
Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...
Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...
 
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future DirectionsDaniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
 
Elior Sulem - 2018 - Semantic Structural Evaluation for Text Simplification
Elior Sulem - 2018 - Semantic Structural Evaluation for Text SimplificationElior Sulem - 2018 - Semantic Structural Evaluation for Text Simplification
Elior Sulem - 2018 - Semantic Structural Evaluation for Text Simplification
 
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future DirectionsDaniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
 
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...
 
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
 
Chenchen Ding - 2015 - NICT at WAT 2015
Chenchen Ding - 2015 - NICT at WAT 2015Chenchen Ding - 2015 - NICT at WAT 2015
Chenchen Ding - 2015 - NICT at WAT 2015
 
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
 
Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015
Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015
Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015
 
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
 
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
 
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...
 
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...
 
Toshiaki Nakazawa - 2015 - Overview of the 2nd Workshop on Asian Translation
Toshiaki Nakazawa  - 2015 - Overview of the 2nd Workshop on Asian TranslationToshiaki Nakazawa  - 2015 - Overview of the 2nd Workshop on Asian Translation
Toshiaki Nakazawa - 2015 - Overview of the 2nd Workshop on Asian Translation
 
Hua Shan - 2015 - A Dependency-to-String Model for Chinese-Japanese SMT System
Hua Shan - 2015 - A Dependency-to-String Model for Chinese-Japanese SMT SystemHua Shan - 2015 - A Dependency-to-String Model for Chinese-Japanese SMT System
Hua Shan - 2015 - A Dependency-to-String Model for Chinese-Japanese SMT System
 
Wei Yang - 2015 - Sampling-based Alignment and Hierarchical Sub-sentential Al...
Wei Yang - 2015 - Sampling-based Alignment and Hierarchical Sub-sentential Al...Wei Yang - 2015 - Sampling-based Alignment and Hierarchical Sub-sentential Al...
Wei Yang - 2015 - Sampling-based Alignment and Hierarchical Sub-sentential Al...
 
Wei Yang - 2015 - Sampling-based Alignment and Hierarchical Sub-sentential Al...
Wei Yang - 2015 - Sampling-based Alignment and Hierarchical Sub-sentential Al...Wei Yang - 2015 - Sampling-based Alignment and Hierarchical Sub-sentential Al...
Wei Yang - 2015 - Sampling-based Alignment and Hierarchical Sub-sentential Al...
 

Recently uploaded

Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docxPoojaSen20
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 

Recently uploaded (20)

Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and Actinides
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docx
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 

John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on Asian Translation

  • 1. KyotoEBMT  System   Descrip3on  for  the     2nd  Workshop  on  Asian  Transla3on John Richardson Raj Dabre Chenhui Chu Fabien Cromières Toshiaki Nakazawa Sadao Kurohashi 1
  • 2. Outline •  Overview  of  the  system   •  Improvements  since  WAT2014   •  Results  for  WAT2015   •  Conclusion   2
  • 4. KyotoEBMT  Overview •  Example-­‐Based  MT  paradigm   •  Need  parallel  corpus   •  Few  language-­‐specific  assumpJons   •  sJll  a  few  language-­‐specific  rules   •  Tree-­‐to-­‐Tree  Machine  TranslaJon   •  Maybe  the  least  commonly  used  variant  of  x-­‐to-­‐x   •  SensiJve  to  parsing  quality  of  both  source  and  target  languages   •  Maximize  the  chances  of  preserving  informaJon   •  Dependency  trees   •  Less  commonly  used  than  ConsJtuent  trees   •  Most  natural  for  Japanese   •  Should  contain  all  important  semanJc  informaJon   4
  • 5. KyotoEBMT  pipeline •  Somehow  classic  pipeline   •  1-­‐  Preprocessing  of  the  parallel  corpus   •  2-­‐  Processing  of  input  sentence   •  3-­‐  Decoding/Tuning/Reranking   •  Tuning  and  reranking  done  with  kbMira   •  seems  to  work  beVer  than  PRO  for  us   5
  • 6. Other  specifici3es •  No  “phrase-­‐table”   •  all  translaJon  rules  computed  on-­‐the-­‐fly  for  each  input   •  cons:   •  possibly  slower  (but  not  so  slow)   •  compuJng  significance/  sparse  features  more  complicated   •  pros:   •  full-­‐context  available  for  compuJng  features   •  no  limit  on  the  size  of  matched  rules   •  possibility  to  output  perfect  translaJon  when  input  is  very  similar  to  an  example   •  “Flexible”  translaJon  rules   •  OpJonal  words   •  AlternaJve  inserJon  posiJons   •  Decoder  can  process  flexible  rules  more  efficiently  than  a  long  list  of  alternaJve  rules   •  some  “flexible  rules”  may  actually  encode  >millions  of  “standard  rules”   6
  • 7. Flexible  Rules  Extracted  on-­‐the-­‐fly 7 プラスチック    (plasJc) は 石油   から 製造 さ れる 例えば(for    example)   the 水素 は 現在 天然ガス や 石油 から 製造 さ れる hydrogen is produced from natural gas and petroleum at present raw X(plasJc) is petroleum produced from Y(for  example) ? Y(for  example) Y(for  example) raw* opJonal     word Input: Matched  Example: Flexible  translaJon  rule   created  on-­‐the-­‐fly: X:  Simple  case     (X  has  an  equivalent  in   the  source  example) Y:  ambiguous   inserJon  posiJon “raw”:  null-­‐aligned  -­‐>   opJonal (encode  many  translaJon  opJons  at  once)
  • 8. Other  specifici3es •  No  “phrase-­‐table”   •  all  translaJon  rules  computed  on-­‐the-­‐fly  for  each  input   •  cons:   •  possibly  slower  (but  not  so  slow)   •  compuJng  significance/  sparse  features  more  complicated   •  pros:   •  full-­‐context  available  for  compuJng  features   •  no  limit  on  the  size  of  matched  rules   •  possibility  to  output  perfect  translaJon  when  input  is  very  similar  to  an  example   •  “Flexible”  translaJon  rules   •  OpJonal  words   •  AlternaJve  inserJon  posiJons   •  Decoder  can  process  flexible  rules  more  efficiently  than  a  long  list  of  alternaJve  rules   •  some  “flexible  rules”  may  actually  encode  >millions  of  “standard  rules”   8
  • 10. KyotoEBMT  improvements •  Our  system  is  very  sensiJve  to   parsing  errors   •  ConJnuous  improvements  to   •  Juman   •  KNP   •  SKP   •  Added  support  for  parse  forests     •  (compact  representaJons)   Forest   parses 10
  • 11. Forest  Input •  A  parJal  soluJon  to  the  issues  of  Tree-­‐to-­‐Tree  MT   •  Can  help  with  parsing  errors   •  Can  help  with  syntacJc  divergences   •  In  WAT2014,   •  we  used  20-­‐best  input  parses   •  n-­‐best  list  of  all  inputs  merged  and  reranked   •  Now,  with  forest:   •  an  exponenJal  number  of  input  parses  can  be  encoded   •  the  selecJon  of  parses  is  done  during  decoding   11
  • 12. KyotoEBMT  improvements •  System  is  also  very  sensiJve  to   alignment  errors   •  We  used  to  correct  alignments  by   using  dependency  trees  (Nakazawa   and  Kurohashi,  2012)   •  Now  we  further  improve  them  with   Nile  (Riesa  et  al.,  2011)     Forest   parses 12
  • 13. Alignment  Improvements •  Used  Nile  (Riesa  et  al.,  2011)    to  improve  the  alignment   •  As  suggested  by  (Neubig  and  Duh,  2014)   •  Require  us  to  parse  into  consJtuent  trees  as  well   •  Ckylark  parser  for  Japanese  (Oda+,  2015)   •  Berkeley  Parser  for  Chinese/English   •  Nile  becomes  the  third  element  of  an  alignment  pipeline     Giza++ (Nakazawa  and   Kurohashi,  2012) Nile with  dependency  trees with  consJtuent  trees 13 F:0.63 F:0.69 F:0.75 JC  alignment  F  -­‐> (Giza++  /  Nile  only  -­‐>  F:0.70)
  • 14. KyotoEBMT  improvements •  Many  small  improvements   •  BeVer  handling  of  flexible  rules   •  Bug  fixes   •  10  new  features   •  alignment  score   •  context  similarity  score  based  on   word2vec  vectors   •  …   Forest   parses 14
  • 15. KyotoEBMT  improvements • Reranking   •  Previously  used  features:   •  7-­‐gram  language  model   •  RNNLM  language  model   •  Now  also  using  a  Neural  MT  based   bilingual  Language  Model     Forest   parses 15
  • 16. Bilingual  Neural  Network  Language  Model •  Combine  Neural  MT  with  EBMT   •  We  use  the  state-­‐of-­‐the-­‐art  model  described  by  (Bahdanau  et  al.,  2015)   •  Model  seen  as  a  Language  Model  condiJonalized  on  the  input   •  Remarks:   •  Processing  Japanese  and  Chinese  as  sequences  of  characters  gave  good  results   •  No  need  to  limit  vocabulary  (~4000/6000  characters  for  J/C)   •  Avoid  segmentaJon  issues   •  Faster  training   •  Neural  MT  models  alone  produced  bad  translaJons   •  eg.  Character  BLEU  for  C-­‐>J  almost  half  that  of  KyotoEBMT   •  Reranking  performances  saturates  before  MT  performances 16 0   10   20   30   40   50   60   10   20   40   Reranked  BLEU/  NeuralMT   char-­‐BLEU  vs  Epochs  for  J-­‐>C Neural  MT  cBLEU   reranked  BLEU   KyotoEBMT  cBLEU  
  • 17. KyotoEBMT  improvements •  Improved  working  methods   (that  maVers!)   •  automaJc  nightly  tesJng  for   variaJons  in  BLEU/  asserJon  errors/   memory  leaks   •  Overall  improvements  across  all   the  pipeline   •  EsJmaJng  the  global   contribuJon  of  each  element  is   tough,  but  here  are  the  final   results,  …     Forest   parses 17
  • 19. Results  for  WAT2015 Reranking BLEU RIBES HUMAN J-­‐>E NO 21.31  (+0.71) 70.65  (+0.53) 16.50 YES 22.89  (+1.82) 72.46  (+2.56) 32.50 E-­‐>J NO   30.69  (+0.92) 76.78  (+1.57) 40.50 YES 33.06  (+1.97) 78.95  (+2.99) 51.00 J-­‐>C NO 29.99  (+2.78) 80.71  (+1.58) 16.00 YES 31.40  (+3.83) 82.70  (+3.87) 12.50 C-­‐>J NO 36.30  (+2.73) 81.97  (+1.87) 16.75 YES 38.53  (+3.78) 84.07  (+3.81) 18.50 19
  • 20. Results  for  WAT2015 Reranking BLEU RIBES HUMAN J-­‐>E NO 21.31  (+0.71) 70.65  (+0.53) 16.50 YES 22.89  (+1.82) 72.46  (+2.56) 32.50 E-­‐>J NO   30.69  (+0.92) 76.78  (+1.57) 40.50 YES 33.06  (+1.97) 78.95  (+2.99) 51.00 J-­‐>C NO 29.99  (+2.78) 80.71  (+1.58) 16.00 YES 31.40  (+3.83) 82.70  (+3.87) 12.50 C-­‐>J NO 36.30  (+2.73) 81.97  (+1.87) 16.75 YES 38.53  (+3.78) 84.07  (+3.81) 18.50 +0.71 +1.82 +0.92 +1.97 +2.78 +3.83 +2.73 +3.78 The  various  improvements   lead  to  good  changes  in  BLEU.   Almost  +4  BLEU  for  the  JC/CJ 20
  • 21. Results  for  WAT2015 Reranking BLEU RIBES HUMAN J-­‐>E NO 21.31  (+0.71) 70.65  (+0.53) 16.50 YES 22.89  (+1.82) 72.46  (+2.56) 32.50 E-­‐>J NO   30.69  (+0.92) 76.78  (+1.57) 40.50 YES 33.06  (+1.97) 78.95  (+2.99) 51.00 J-­‐>C NO 29.99  (+2.78) 80.71  (+1.58) 16.00 YES 31.40  (+3.83) 82.70  (+3.87) 12.50 C-­‐>J NO 36.30  (+2.73) 81.97  (+1.87) 16.75 YES 38.53  (+3.78) 84.07  (+3.81) 18.50 Mystery!   Only  for  J-­‐>C,  we  find  that  reranking  decreased   Human  EvaluaJon  score.   (While  sJll  improving  BLEU/RIBES) 21
  • 22. Results  for  WAT2015 Reranking BLEU RIBES HUMAN J-­‐>E NO 21.31  (+0.71) 70.65  (+0.53) 16.50 YES 22.89  (+1.82) 72.46  (+2.56) 32.50 E-­‐>J NO   30.69  (+0.92) 76.78  (+1.57) 40.50 YES 33.06  (+1.97) 78.95  (+2.99) 51.00 J-­‐>C NO 29.99  (+2.78) 80.71  (+1.58) 16.00 YES 31.40  (+3.83) 82.70  (+3.87) 12.50 C-­‐>J NO 36.30  (+2.73) 81.97  (+1.87) 16.75 YES 38.53  (+3.78) 84.07  (+3.81) 18.50 22
  • 23. Code  is  available  and  Open-­‐sourced •  Version  1.0  released   •  1  year  arer  version  0.1   •  2  years  arer  development  started   •  Downloadable  at:  hVp://nlp.ist.i.kyoto-­‐u.ac.jp/kyotoebmt/   •  GPL  Licence 23
  • 24. Conclusion •  KyotoEBMT  is  a  (Dependency)  Tree-­‐to-­‐Tree  MT  system  with  state-­‐of-­‐ the-­‐art  results   •  Open-­‐sourced  (hVp://nlp.ist.i.kyoto-­‐u.ac.jp/kyotoebmt/)   •  Improvements  across  the  whole  pipeline  lead  us  to  close  to  +4  BLEU   improvements   •  Some  future  works:   •  Make  more  use  of  the  target  structure   •  Use  of  deep  learning  features  in  the  decoder   •  eg.  as  in  (Devlin  et  al.,  2014)   •  …   24