SlideShare a Scribd company logo
1 of 75
Download to read offline
Translation Memory Retrieval Methods
[Bloodgood and Strauss, 2014] in Proc of 14th EACL
Koichi Akabe and Philip Arthur
NAIST MT Study
2014-07-03
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 1 / 27
Introduction
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 2 / 27
Translation Memory (TM)
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 3 / 27
Translation Memory (TM)
▶ Most widely used computer-assisted translation (CAT) tool
▶ Suggest translations using other translations
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 3 / 27
Translation Memory (TM)
▶ Most widely used computer-assisted translation (CAT) tool
▶ Suggest translations using other translations
En The dog opened the door.
Ja 犬がドアを開けた。
En I saw a girl with a telescope.
Ja 僕は望遠鏡で少女を見た。
En John opened the door.
Ja
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 3 / 27
Translation Memory (TM)
▶ Most widely used computer-assisted translation (CAT) tool
▶ Suggest translations using other translations
En The dog opened the door.
Ja 犬がドアを開けた。
En I saw a girl with a telescope.
Ja 僕は望遠鏡で少女を見た。
En John opened the door.
Ja
1. Find the nearest source sentence
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 3 / 27
Translation Memory (TM)
▶ Most widely used computer-assisted translation (CAT) tool
▶ Suggest translations using other translations
En The dog opened the door.
Ja 犬がドアを開けた。
En I saw a girl with a telescope.
Ja 僕は望遠鏡で少女を見た。
En John opened the door.
Ja 犬がドアを開けた。 (fuzzy)
1. Find the nearest source sentence
2. Suggest a translation
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 3 / 27
Translation Memory (TM)
▶ Most widely used computer-assisted translation (CAT) tool
▶ Suggest translations using other translations
En The dog opened the door.
Ja 犬がドアを開けた。
En I saw a girl with a telescope.
Ja 僕は望遠鏡で少女を見た。
En John opened the door.
Ja 犬がドアを開けた。 (fuzzy)
1. Find the nearest source sentence
2. Suggest a translation
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 3 / 27
Translation Memory (TM)
▶ Most widely used computer-assisted translation (CAT) tool
▶ Suggest translations using other translations
En The dog opened the door.
Ja 犬がドアを開けた。
En I saw a girl with a telescope.
Ja 僕は望遠鏡で少女を見た。
En John opened the door.
Ja ジョンがドアを開けた。
1. Find the nearest source sentence
2. Suggest a translation
3. Post-editing
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 3 / 27
How to find the nearest source sentence?
TM finds the nearest source sentence using similarity metrics
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 4 / 27
How to find the nearest source sentence?
TM finds the nearest source sentence using similarity metrics
▶ Edit distance (Leven-shtein distance)
−→ Widely used metric
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 4 / 27
How to find the nearest source sentence?
TM finds the nearest source sentence using similarity metrics
▶ Edit distance (Leven-shtein distance)
−→ Widely used metric
▶ MT evaluation metrics [Simard and Fujita, 2012]
−→ WER, BLEU, NIST, VMeteor, Meteor as TM metrics
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 4 / 27
How to find the nearest source sentence?
TM finds the nearest source sentence using similarity metrics
▶ Edit distance (Leven-shtein distance)
−→ Widely used metric
▶ MT evaluation metrics [Simard and Fujita, 2012]
−→ WER, BLEU, NIST, VMeteor, Meteor as TM metrics
▶ This paper
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 4 / 27
Threshold of helpfulness
Matching algorithm always returns the nearest sentence
However, low score suggestions should not be shown
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 5 / 27
Threshold of helpfulness
Matching algorithm always returns the nearest sentence
However, low score suggestions should not be shown
TM softwares set the threshold at 70% in practice
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 5 / 27
Threshold of helpfulness
Matching algorithm always returns the nearest sentence
However, low score suggestions should not be shown
TM softwares set the threshold at 70% in practice −→ Why?
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 5 / 27
Translation Memory Similarity Metrics
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 6 / 27
Definitions
TM Similarity Metrics compare M and C.
M: workload sentence
C: source language side of a candidate pre-existing translation
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 7 / 27
Definitions
TM Similarity Metrics compare M and C.
M: workload sentence
C: source language side of a candidate pre-existing translation
En The dog opened the door .
Ja 犬がドアを開けた。
En I saw a girl with a telescope .
Ja 僕は望遠鏡で少女を見た。
En John opened the door .
Ja 犬がドアを開けた。 (fuzzy)
M =John opened the door .
C1 =The dog opened the door .
C2 =I saw a girl with a telescope .
...
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 7 / 27
Translation Memory Similarity Metrics
Compare the following metrics:
▶ Percent Match
▶ Weighted Percent Match
▶ Edit Distance
▶ N-gram Precision
▶ Weighted N-gram Precision
▶ Modified Weighted N-gram Precision
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 8 / 27
Percent Match (PM)
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 9 / 27
Percent Match (PM)
The simplest metric
PM(M, C) =
|Munigrams ∩ Cunigrams|
|Munigrams|
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 9 / 27
Percent Match (PM)
The simplest metric
PM(M, C) =
|Munigrams ∩ Cunigrams|
|Munigrams|
e.g.
M =John opened the door .
C =The dog opened the door .
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 9 / 27
Percent Match (PM)
The simplest metric
PM(M, C) =
|Munigrams ∩ Cunigrams|
|Munigrams|
e.g.
M =John opened the door .
C =The dog opened the door .
PM(M, C) =
4
5
= 0.80
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 9 / 27
Weighted Percent Match (WPM)
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 10 / 27
Weighted Percent Match (WPM)
We want to know translation of rare words
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 10 / 27
Weighted Percent Match (WPM)
We want to know translation of rare words
PM with IDF weighting
WPM(M, C) =
∑
u∈{Munigrams∩Cunigrams}
idf(u, D)
∑
u∈Munigrams
idf(u, D)
where D is a set of all source sentences in the parallel corpus
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 10 / 27
Problem of PM and WPM
PM and WPM only consider coverage of words
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 11 / 27
Problem of PM and WPM
PM and WPM only consider coverage of words
−→ They cannnot see any context
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 11 / 27
Problem of PM and WPM
PM and WPM only consider coverage of words
−→ They cannnot see any context
We show methods that consider contexts in next slides
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 11 / 27
Edit Distance (ED)
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 12 / 27
Edit Distance (ED)
Widely used metric
ED = max
(
1 −
edit-dist(M, C)
|Munigrams|
, 0
)
where edit-dist(M, C) is the number of word insertions, deletions,
and substitutions required to transform M into C
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 12 / 27
Edit Distance (ED)
Widely used metric
ED = max
(
1 −
edit-dist(M, C)
|Munigrams|
, 0
)
where edit-dist(M, C) is the number of word insertions, deletions,
and substitutions required to transform M into C
e.g.
M =John opened the door .
C =The dog opened the door .
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 12 / 27
Edit Distance (ED)
Widely used metric
ED = max
(
1 −
edit-dist(M, C)
|Munigrams|
, 0
)
where edit-dist(M, C) is the number of word insertions, deletions,
and substitutions required to transform M into C
e.g.
M =John opened the door .
C =The dog opened the door .
substitution: 1
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 12 / 27
Edit Distance (ED)
Widely used metric
ED = max
(
1 −
edit-dist(M, C)
|Munigrams|
, 0
)
where edit-dist(M, C) is the number of word insertions, deletions,
and substitutions required to transform M into C
e.g.
M =John opened the door .
C =The dog opened the door .
substitution: 1
insertion: 1
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 12 / 27
Edit Distance (ED)
Widely used metric
ED = max
(
1 −
edit-dist(M, C)
|Munigrams|
, 0
)
where edit-dist(M, C) is the number of word insertions, deletions,
and substitutions required to transform M into C
e.g.
M =John opened the door .
C =The dog opened the door .
substitution: 1
insertion: 1
ED(M, C) = 1 −
2
5
= 0.60
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 12 / 27
N-gram Precision (NGP)
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 13 / 27
N-gram Precision (NGP)
Mean of N-gram precision (like the BLEU metric)
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 13 / 27
N-gram Precision (NGP)
Mean of N-gram precision (like the BLEU metric)
However, BLEU → 0 when the precision of longer N-grams is 0
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 13 / 27
N-gram Precision (NGP)
Mean of N-gram precision (like the BLEU metric)
However, BLEU → 0 when the precision of longer N-grams is 0
This work uses arithmetic mean instead of geometric mean
NGP =
1
N
N∑
n=1
pn
pn =
|Mn-grams ∩ Cn-grams|
Z ∗ |Mn-grams| + (1 − Z) ∗ |Cn-grams|
where Z is a parameter to control normalization,
and N is the maximum length of N-gram
N = 4 and Z = 0.75 in main experiments (discuss later)
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 13 / 27
Weighted N-gram Precision (WNGP)
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 14 / 27
Weighted N-gram Precision (WNGP)
NGP with IDF weighting
WNGP =
N∑
n=1
1
N
wpn
wpn =
∑
i∈{Mn-grams∩Cn-grams}
w(i)
Z ∗


∑
i∈Mn-grams
w(i)

 + (1 − Z) ∗


∑
i∈Cn-grams
w(i)


w(i) =
∑
1-gram∈i
idf(1-gram, D)
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 14 / 27
Modified Weighted N-gram Precision (MWNGP)
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 15 / 27
Modified Weighted N-gram Precision (MWNGP)
Shorter N-grams may help translators more than longer N-grams
WNGP =
N∑
n=1
1
N
wpn
MWNGP =
2N
2N − 1
N∑
n=1
1
2n
wpn
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 15 / 27
Experiment
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 16 / 27
Experiment
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 17 / 27
Experiment
Two different technicals domains with Two different language pairs
(Fr-En, Zn-En).
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 17 / 27
Experiment
Two different technicals domains with Two different language pairs
(Fr-En, Zn-En).
▶ Zn-En: OpenOffice3
▶ Fr-En: EMEA
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 17 / 27
Experiment
Two different technicals domains with Two different language pairs
(Fr-En, Zn-En).
▶ Zn-En: OpenOffice3
▶ Fr-En: EMEA
Preprocessing is performed on both source sides to produce valid
segment.
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 17 / 27
Experiment
Two different technicals domains with Two different language pairs
(Fr-En, Zn-En).
▶ Zn-En: OpenOffice3
▶ Fr-En: EMEA
Preprocessing is performed on both source sides to produce valid
segment.
Some sentences are randomly sampled from corpus as M and C.
▶ Zn-En: 400 M and 10.000 C.
▶ Fr-En: 300 M and 10.000 C.
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 17 / 27
Evaluation
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 18 / 27
Evaluation
Evaluation is performed with Human Evaluation using Amazon
Mechanical Turk.
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 18 / 27
Evaluation
Evaluation is performed with Human Evaluation using Amazon
Mechanical Turk.
The Score is ranging from 1 to 5 (Not Helpful until Extremely
Helpful).
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 18 / 27
Evaluation
Evaluation is performed with Human Evaluation using Amazon
Mechanical Turk.
The Score is ranging from 1 to 5 (Not Helpful until Extremely
Helpful).
Each segment M is rated by 5 Turkers and we keep track which
metric performs best (ties is allowed).
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 18 / 27
Evaluation
Evaluation is performed with Human Evaluation using Amazon
Mechanical Turk.
The Score is ranging from 1 to 5 (Not Helpful until Extremely
Helpful).
Each segment M is rated by 5 Turkers and we keep track which
metric performs best (ties is allowed).
The scores of each M are averaged as Mean Opinion Score
(MOS).
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 18 / 27
Result and Analysis
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 19 / 27
Result: Which metric performs best?
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 20 / 27
Result: Which metric performs best?
Table OO3 Zn-En
Metric Found Best Total C
PM 178 400
WPM 200 400
ED 193 400
NGP 251 400
WNGP 271 400
MWNGP 282 400
Table EMEA Fr-En
Metric Found Best Total C
PM 166 300
WPM 184 300
ED 148 300
NGP 188 300
WNGP 198 300
MWNGP 201 300
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 20 / 27
Result: Which metric performs best?
Table OO3 Zn-En
Metric Found Best Total C
PM 178 400
WPM 200 400
ED 193 400
NGP 251 400
WNGP 271 400
MWNGP 282 400
Table EMEA Fr-En
Metric Found Best Total C
PM 166 300
WPM 184 300
ED 148 300
NGP 188 300
WNGP 198 300
MWNGP 201 300
Modified Weighted N-Gram Precision (MWNGP) achieved the
best result compared to any other metrics.
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 20 / 27
Result: Which metric performs best?
Table OO3 Zn-En
Metric Found Best Total C
PM 178 400
WPM 200 400
ED 193 400
NGP 251 400
WNGP 271 400
MWNGP 282 400
Table EMEA Fr-En
Metric Found Best Total C
PM 166 300
WPM 184 300
ED 148 300
NGP 188 300
WNGP 198 300
MWNGP 201 300
Modified Weighted N-Gram Precision (MWNGP) achieved the
best result compared to any other metrics.
There are slight different between WNGP and Modified-WNGP.
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 20 / 27
Scatterplot: OO3 Percent Match
1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0
MOS
0.0
0.2
0.4
0.6
0.8
1.0
MetricValue
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 21 / 27
Scatterplot: OO3 Edit Distance
1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0
MOS
0.0
0.2
0.4
0.6
0.8
1.0
MetricValue
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 22 / 27
Scatterplot: OO3 Modified N-Gram Precision
1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0
MOS
0.0
0.2
0.4
0.6
0.8
1.0
MetricValue
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 23 / 27
The effect of Z: Adjusting for length preferences
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 24 / 27
The effect of Z: Adjusting for length preferences
Many of the metrics are using Z as parameters.
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 24 / 27
The effect of Z: Adjusting for length preferences
Many of the metrics are using Z as parameters.
Z parameter can be used to control for length preferences.
Table EMEA Fr-En
Z Value Avg Length
0.00 9.9298
0.25 13.204
0.50 16.0134
0.75 19.6355
1.00 27.8829
Table OO3 Zn-En
Z Value Avg Length
0.00 7.2475
0.25 9.5600
0.50 11.1250
0.75 14.1825
1.00 25.0875
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 24 / 27
The effect of Z: Adjusting for length preferences
Many of the metrics are using Z as parameters.
Z parameter can be used to control for length preferences.
Table EMEA Fr-En
Z Value Avg Length
0.00 9.9298
0.25 13.204
0.50 16.0134
0.75 19.6355
1.00 27.8829
Table OO3 Zn-En
Z Value Avg Length
0.00 7.2475
0.25 9.5600
0.50 11.1250
0.75 14.1825
1.00 25.0875
Smaller Z prefered shorter match that are more precise and
increased precision.
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 24 / 27
The effect of Z: Adjusting for length preferences
Many of the metrics are using Z as parameters.
Z parameter can be used to control for length preferences.
Table EMEA Fr-En
Z Value Avg Length
0.00 9.9298
0.25 13.204
0.50 16.0134
0.75 19.6355
1.00 27.8829
Table OO3 Zn-En
Z Value Avg Length
0.00 7.2475
0.25 9.5600
0.50 11.1250
0.75 14.1825
1.00 25.0875
Smaller Z prefered shorter match that are more precise and
increased precision.
Larger Z prefers longer match that contains many correct
translations and increased recall.
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 24 / 27
Conclusion
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 25 / 27
Conclusion
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 26 / 27
Conclusion
▶ This paper compares TM similarity metrics.
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 26 / 27
Conclusion
▶ This paper compares TM similarity metrics.
▶ The best method is Modified Weighted N-Gram Precision.
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 26 / 27
Conclusion
▶ This paper compares TM similarity metrics.
▶ The best method is Modified Weighted N-Gram Precision.
▶ All the discussed metrics only consider source sides in the
calculation.
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 26 / 27
Conclusion
▶ This paper compares TM similarity metrics.
▶ The best method is Modified Weighted N-Gram Precision.
▶ All the discussed metrics only consider source sides in the
calculation.
▶ Z parameter is used to adjust the length preferences of the
retrieved TM.
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 26 / 27
Thank you for your attention!
2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 27 / 27

More Related Content

Recently uploaded

Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentationphoebematthew05
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
 

Recently uploaded (20)

Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentation
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
 

Featured

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at WorkGetSmarter
 

Featured (20)

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
 

Presentation

  • 1. Translation Memory Retrieval Methods [Bloodgood and Strauss, 2014] in Proc of 14th EACL Koichi Akabe and Philip Arthur NAIST MT Study 2014-07-03 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 1 / 27
  • 2. Introduction 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 2 / 27
  • 3. Translation Memory (TM) 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 3 / 27
  • 4. Translation Memory (TM) ▶ Most widely used computer-assisted translation (CAT) tool ▶ Suggest translations using other translations 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 3 / 27
  • 5. Translation Memory (TM) ▶ Most widely used computer-assisted translation (CAT) tool ▶ Suggest translations using other translations En The dog opened the door. Ja 犬がドアを開けた。 En I saw a girl with a telescope. Ja 僕は望遠鏡で少女を見た。 En John opened the door. Ja 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 3 / 27
  • 6. Translation Memory (TM) ▶ Most widely used computer-assisted translation (CAT) tool ▶ Suggest translations using other translations En The dog opened the door. Ja 犬がドアを開けた。 En I saw a girl with a telescope. Ja 僕は望遠鏡で少女を見た。 En John opened the door. Ja 1. Find the nearest source sentence 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 3 / 27
  • 7. Translation Memory (TM) ▶ Most widely used computer-assisted translation (CAT) tool ▶ Suggest translations using other translations En The dog opened the door. Ja 犬がドアを開けた。 En I saw a girl with a telescope. Ja 僕は望遠鏡で少女を見た。 En John opened the door. Ja 犬がドアを開けた。 (fuzzy) 1. Find the nearest source sentence 2. Suggest a translation 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 3 / 27
  • 8. Translation Memory (TM) ▶ Most widely used computer-assisted translation (CAT) tool ▶ Suggest translations using other translations En The dog opened the door. Ja 犬がドアを開けた。 En I saw a girl with a telescope. Ja 僕は望遠鏡で少女を見た。 En John opened the door. Ja 犬がドアを開けた。 (fuzzy) 1. Find the nearest source sentence 2. Suggest a translation 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 3 / 27
  • 9. Translation Memory (TM) ▶ Most widely used computer-assisted translation (CAT) tool ▶ Suggest translations using other translations En The dog opened the door. Ja 犬がドアを開けた。 En I saw a girl with a telescope. Ja 僕は望遠鏡で少女を見た。 En John opened the door. Ja ジョンがドアを開けた。 1. Find the nearest source sentence 2. Suggest a translation 3. Post-editing 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 3 / 27
  • 10. How to find the nearest source sentence? TM finds the nearest source sentence using similarity metrics 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 4 / 27
  • 11. How to find the nearest source sentence? TM finds the nearest source sentence using similarity metrics ▶ Edit distance (Leven-shtein distance) −→ Widely used metric 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 4 / 27
  • 12. How to find the nearest source sentence? TM finds the nearest source sentence using similarity metrics ▶ Edit distance (Leven-shtein distance) −→ Widely used metric ▶ MT evaluation metrics [Simard and Fujita, 2012] −→ WER, BLEU, NIST, VMeteor, Meteor as TM metrics 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 4 / 27
  • 13. How to find the nearest source sentence? TM finds the nearest source sentence using similarity metrics ▶ Edit distance (Leven-shtein distance) −→ Widely used metric ▶ MT evaluation metrics [Simard and Fujita, 2012] −→ WER, BLEU, NIST, VMeteor, Meteor as TM metrics ▶ This paper 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 4 / 27
  • 14. Threshold of helpfulness Matching algorithm always returns the nearest sentence However, low score suggestions should not be shown 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 5 / 27
  • 15. Threshold of helpfulness Matching algorithm always returns the nearest sentence However, low score suggestions should not be shown TM softwares set the threshold at 70% in practice 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 5 / 27
  • 16. Threshold of helpfulness Matching algorithm always returns the nearest sentence However, low score suggestions should not be shown TM softwares set the threshold at 70% in practice −→ Why? 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 5 / 27
  • 17. Translation Memory Similarity Metrics 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 6 / 27
  • 18. Definitions TM Similarity Metrics compare M and C. M: workload sentence C: source language side of a candidate pre-existing translation 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 7 / 27
  • 19. Definitions TM Similarity Metrics compare M and C. M: workload sentence C: source language side of a candidate pre-existing translation En The dog opened the door . Ja 犬がドアを開けた。 En I saw a girl with a telescope . Ja 僕は望遠鏡で少女を見た。 En John opened the door . Ja 犬がドアを開けた。 (fuzzy) M =John opened the door . C1 =The dog opened the door . C2 =I saw a girl with a telescope . ... 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 7 / 27
  • 20. Translation Memory Similarity Metrics Compare the following metrics: ▶ Percent Match ▶ Weighted Percent Match ▶ Edit Distance ▶ N-gram Precision ▶ Weighted N-gram Precision ▶ Modified Weighted N-gram Precision 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 8 / 27
  • 21. Percent Match (PM) 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 9 / 27
  • 22. Percent Match (PM) The simplest metric PM(M, C) = |Munigrams ∩ Cunigrams| |Munigrams| 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 9 / 27
  • 23. Percent Match (PM) The simplest metric PM(M, C) = |Munigrams ∩ Cunigrams| |Munigrams| e.g. M =John opened the door . C =The dog opened the door . 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 9 / 27
  • 24. Percent Match (PM) The simplest metric PM(M, C) = |Munigrams ∩ Cunigrams| |Munigrams| e.g. M =John opened the door . C =The dog opened the door . PM(M, C) = 4 5 = 0.80 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 9 / 27
  • 25. Weighted Percent Match (WPM) 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 10 / 27
  • 26. Weighted Percent Match (WPM) We want to know translation of rare words 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 10 / 27
  • 27. Weighted Percent Match (WPM) We want to know translation of rare words PM with IDF weighting WPM(M, C) = ∑ u∈{Munigrams∩Cunigrams} idf(u, D) ∑ u∈Munigrams idf(u, D) where D is a set of all source sentences in the parallel corpus 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 10 / 27
  • 28. Problem of PM and WPM PM and WPM only consider coverage of words 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 11 / 27
  • 29. Problem of PM and WPM PM and WPM only consider coverage of words −→ They cannnot see any context 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 11 / 27
  • 30. Problem of PM and WPM PM and WPM only consider coverage of words −→ They cannnot see any context We show methods that consider contexts in next slides 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 11 / 27
  • 31. Edit Distance (ED) 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 12 / 27
  • 32. Edit Distance (ED) Widely used metric ED = max ( 1 − edit-dist(M, C) |Munigrams| , 0 ) where edit-dist(M, C) is the number of word insertions, deletions, and substitutions required to transform M into C 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 12 / 27
  • 33. Edit Distance (ED) Widely used metric ED = max ( 1 − edit-dist(M, C) |Munigrams| , 0 ) where edit-dist(M, C) is the number of word insertions, deletions, and substitutions required to transform M into C e.g. M =John opened the door . C =The dog opened the door . 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 12 / 27
  • 34. Edit Distance (ED) Widely used metric ED = max ( 1 − edit-dist(M, C) |Munigrams| , 0 ) where edit-dist(M, C) is the number of word insertions, deletions, and substitutions required to transform M into C e.g. M =John opened the door . C =The dog opened the door . substitution: 1 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 12 / 27
  • 35. Edit Distance (ED) Widely used metric ED = max ( 1 − edit-dist(M, C) |Munigrams| , 0 ) where edit-dist(M, C) is the number of word insertions, deletions, and substitutions required to transform M into C e.g. M =John opened the door . C =The dog opened the door . substitution: 1 insertion: 1 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 12 / 27
  • 36. Edit Distance (ED) Widely used metric ED = max ( 1 − edit-dist(M, C) |Munigrams| , 0 ) where edit-dist(M, C) is the number of word insertions, deletions, and substitutions required to transform M into C e.g. M =John opened the door . C =The dog opened the door . substitution: 1 insertion: 1 ED(M, C) = 1 − 2 5 = 0.60 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 12 / 27
  • 37. N-gram Precision (NGP) 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 13 / 27
  • 38. N-gram Precision (NGP) Mean of N-gram precision (like the BLEU metric) 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 13 / 27
  • 39. N-gram Precision (NGP) Mean of N-gram precision (like the BLEU metric) However, BLEU → 0 when the precision of longer N-grams is 0 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 13 / 27
  • 40. N-gram Precision (NGP) Mean of N-gram precision (like the BLEU metric) However, BLEU → 0 when the precision of longer N-grams is 0 This work uses arithmetic mean instead of geometric mean NGP = 1 N N∑ n=1 pn pn = |Mn-grams ∩ Cn-grams| Z ∗ |Mn-grams| + (1 − Z) ∗ |Cn-grams| where Z is a parameter to control normalization, and N is the maximum length of N-gram N = 4 and Z = 0.75 in main experiments (discuss later) 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 13 / 27
  • 41. Weighted N-gram Precision (WNGP) 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 14 / 27
  • 42. Weighted N-gram Precision (WNGP) NGP with IDF weighting WNGP = N∑ n=1 1 N wpn wpn = ∑ i∈{Mn-grams∩Cn-grams} w(i) Z ∗   ∑ i∈Mn-grams w(i)   + (1 − Z) ∗   ∑ i∈Cn-grams w(i)   w(i) = ∑ 1-gram∈i idf(1-gram, D) 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 14 / 27
  • 43. Modified Weighted N-gram Precision (MWNGP) 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 15 / 27
  • 44. Modified Weighted N-gram Precision (MWNGP) Shorter N-grams may help translators more than longer N-grams WNGP = N∑ n=1 1 N wpn MWNGP = 2N 2N − 1 N∑ n=1 1 2n wpn 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 15 / 27
  • 45. Experiment 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 16 / 27
  • 46. Experiment 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 17 / 27
  • 47. Experiment Two different technicals domains with Two different language pairs (Fr-En, Zn-En). 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 17 / 27
  • 48. Experiment Two different technicals domains with Two different language pairs (Fr-En, Zn-En). ▶ Zn-En: OpenOffice3 ▶ Fr-En: EMEA 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 17 / 27
  • 49. Experiment Two different technicals domains with Two different language pairs (Fr-En, Zn-En). ▶ Zn-En: OpenOffice3 ▶ Fr-En: EMEA Preprocessing is performed on both source sides to produce valid segment. 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 17 / 27
  • 50. Experiment Two different technicals domains with Two different language pairs (Fr-En, Zn-En). ▶ Zn-En: OpenOffice3 ▶ Fr-En: EMEA Preprocessing is performed on both source sides to produce valid segment. Some sentences are randomly sampled from corpus as M and C. ▶ Zn-En: 400 M and 10.000 C. ▶ Fr-En: 300 M and 10.000 C. 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 17 / 27
  • 51. Evaluation 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 18 / 27
  • 52. Evaluation Evaluation is performed with Human Evaluation using Amazon Mechanical Turk. 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 18 / 27
  • 53. Evaluation Evaluation is performed with Human Evaluation using Amazon Mechanical Turk. The Score is ranging from 1 to 5 (Not Helpful until Extremely Helpful). 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 18 / 27
  • 54. Evaluation Evaluation is performed with Human Evaluation using Amazon Mechanical Turk. The Score is ranging from 1 to 5 (Not Helpful until Extremely Helpful). Each segment M is rated by 5 Turkers and we keep track which metric performs best (ties is allowed). 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 18 / 27
  • 55. Evaluation Evaluation is performed with Human Evaluation using Amazon Mechanical Turk. The Score is ranging from 1 to 5 (Not Helpful until Extremely Helpful). Each segment M is rated by 5 Turkers and we keep track which metric performs best (ties is allowed). The scores of each M are averaged as Mean Opinion Score (MOS). 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 18 / 27
  • 56. Result and Analysis 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 19 / 27
  • 57. Result: Which metric performs best? 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 20 / 27
  • 58. Result: Which metric performs best? Table OO3 Zn-En Metric Found Best Total C PM 178 400 WPM 200 400 ED 193 400 NGP 251 400 WNGP 271 400 MWNGP 282 400 Table EMEA Fr-En Metric Found Best Total C PM 166 300 WPM 184 300 ED 148 300 NGP 188 300 WNGP 198 300 MWNGP 201 300 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 20 / 27
  • 59. Result: Which metric performs best? Table OO3 Zn-En Metric Found Best Total C PM 178 400 WPM 200 400 ED 193 400 NGP 251 400 WNGP 271 400 MWNGP 282 400 Table EMEA Fr-En Metric Found Best Total C PM 166 300 WPM 184 300 ED 148 300 NGP 188 300 WNGP 198 300 MWNGP 201 300 Modified Weighted N-Gram Precision (MWNGP) achieved the best result compared to any other metrics. 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 20 / 27
  • 60. Result: Which metric performs best? Table OO3 Zn-En Metric Found Best Total C PM 178 400 WPM 200 400 ED 193 400 NGP 251 400 WNGP 271 400 MWNGP 282 400 Table EMEA Fr-En Metric Found Best Total C PM 166 300 WPM 184 300 ED 148 300 NGP 188 300 WNGP 198 300 MWNGP 201 300 Modified Weighted N-Gram Precision (MWNGP) achieved the best result compared to any other metrics. There are slight different between WNGP and Modified-WNGP. 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 20 / 27
  • 61. Scatterplot: OO3 Percent Match 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0 MOS 0.0 0.2 0.4 0.6 0.8 1.0 MetricValue 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 21 / 27
  • 62. Scatterplot: OO3 Edit Distance 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0 MOS 0.0 0.2 0.4 0.6 0.8 1.0 MetricValue 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 22 / 27
  • 63. Scatterplot: OO3 Modified N-Gram Precision 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0 MOS 0.0 0.2 0.4 0.6 0.8 1.0 MetricValue 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 23 / 27
  • 64. The effect of Z: Adjusting for length preferences 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 24 / 27
  • 65. The effect of Z: Adjusting for length preferences Many of the metrics are using Z as parameters. 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 24 / 27
  • 66. The effect of Z: Adjusting for length preferences Many of the metrics are using Z as parameters. Z parameter can be used to control for length preferences. Table EMEA Fr-En Z Value Avg Length 0.00 9.9298 0.25 13.204 0.50 16.0134 0.75 19.6355 1.00 27.8829 Table OO3 Zn-En Z Value Avg Length 0.00 7.2475 0.25 9.5600 0.50 11.1250 0.75 14.1825 1.00 25.0875 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 24 / 27
  • 67. The effect of Z: Adjusting for length preferences Many of the metrics are using Z as parameters. Z parameter can be used to control for length preferences. Table EMEA Fr-En Z Value Avg Length 0.00 9.9298 0.25 13.204 0.50 16.0134 0.75 19.6355 1.00 27.8829 Table OO3 Zn-En Z Value Avg Length 0.00 7.2475 0.25 9.5600 0.50 11.1250 0.75 14.1825 1.00 25.0875 Smaller Z prefered shorter match that are more precise and increased precision. 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 24 / 27
  • 68. The effect of Z: Adjusting for length preferences Many of the metrics are using Z as parameters. Z parameter can be used to control for length preferences. Table EMEA Fr-En Z Value Avg Length 0.00 9.9298 0.25 13.204 0.50 16.0134 0.75 19.6355 1.00 27.8829 Table OO3 Zn-En Z Value Avg Length 0.00 7.2475 0.25 9.5600 0.50 11.1250 0.75 14.1825 1.00 25.0875 Smaller Z prefered shorter match that are more precise and increased precision. Larger Z prefers longer match that contains many correct translations and increased recall. 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 24 / 27
  • 69. Conclusion 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 25 / 27
  • 70. Conclusion 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 26 / 27
  • 71. Conclusion ▶ This paper compares TM similarity metrics. 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 26 / 27
  • 72. Conclusion ▶ This paper compares TM similarity metrics. ▶ The best method is Modified Weighted N-Gram Precision. 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 26 / 27
  • 73. Conclusion ▶ This paper compares TM similarity metrics. ▶ The best method is Modified Weighted N-Gram Precision. ▶ All the discussed metrics only consider source sides in the calculation. 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 26 / 27
  • 74. Conclusion ▶ This paper compares TM similarity metrics. ▶ The best method is Modified Weighted N-Gram Precision. ▶ All the discussed metrics only consider source sides in the calculation. ▶ Z parameter is used to adjust the length preferences of the retrieved TM. 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 26 / 27
  • 75. Thank you for your attention! 2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 27 / 27