Carolina Scarton - ESR 7 - USFD

Machine Translation
Quality Estimation with
QuEst++
Carolina Scarton – c.scarton@sheffield.ac.uk
University of Sheffield

Quality Estimation of Machine Translation

Predict quality of unseen data



Only few labelled data points for training




Only uses information from source and target → no references!




Only uses information from source and target → no references!

Reduce post-editing and revision time → minimize costs!

Spotting errors

Estimate post-editing effort and time

QuEst++

Framework for QE

Word-level, sentence-level and document-level

QuEst++

Framework for QE


Feature extraction module →features from source and target

plus information from MT system

QuEst++

Framework for QE


Feature extraction module →features from source and target

plus information from MT system

Machine learning module → use the features for building a QE
model

QE model → can predict the quality of unseen data

Target documents
(training)
Source documents
(training)
Training scenario – annotated data
Example: post-editing effort (1 to 5) – training data (with annotation)
Source MT Score
Barack Obama becomes the fourth American
president to receive the Nobel Peace Prize
Barack Obama se convierte en el cuarto
presidente estadounidense para recibir el
Premio Nobel de la Paz
4.5
The presidential couple then has a meeting
scheduled with King Harald V and Queen
Sonja of Norway.
La pareja presidencial entonces tiene una
reunión programada con el Rey Harald V y
Reina Sonja de Noruega.
4.0
Although, at the cost of the state falling
deeper into debt – next year the treasury
won't just be 163 billion short, but even more.
Aunque, a costa del estado cayendo más –
el año que viene el Tesoro no sólo se 163
millones de corto, pero aún más.
2.0
Transformer worth tens of millions of crowns
burns in Louny region
La pena de transformadores decenas de
millones de coronas Louny quemaduras en
la región
1.5

Target documents
(training)
Source documents
(training)
MT system
External resources/tools:
- SRILM
- GIZA++ tables
- TreeTagger
- StanfordParser
Information from the MT
system that translated the
documents (if available)
Feature extractor

Black-box:
- number of tokens
- number of punctuation
- LM perplexity
- n-gram counts
- POS counts
- syntactic tree
- lexical cohesion
Glass-box:
- n-best list information
Target documents
(training)
Feature extractor
Source documents
(training)
MT system
Features for QE

Quality labels

Post-editing effort

Post-editing time

HTER

BLEU

...
QE model training
ML algorithms:
- SVC
- SVR
- CRF
Target documents
(training)
Feature extractor
Source documents
(training)
MT system
Features for QE

QE model
Quality labels

Post-editing effort

Post-editing time

HTER

BLEU

...
QE model training
Target documents
(training)
Feature extractor
Source documents
(training)
MT system
Features for QE

Target documents
(unseen data)
Source documents
(unseen data)
Predicting labels for unseen data
Source MT Score
Mass Slaughter on a Personal Level El sacrificio masivo a nivel personal
?
People begin to ask why their leaders are
making them fight.
La gente empiece a preguntar por qué sus
líderes están haciendo ellos lucha. ?
As the community affairs officers moved into
the park in their light-blue windbreakers,
many protesters simply gathered their
belongings and left.
Asuntos como la Comunidad oficiales
movido en el parque en su luz-azul
windbreakers, muchos manifestantes
simplemente se reunieron sus pertenencias
y de la izquierda.
?
Some stories are about honor and bravery. Algunas historias son de honor y valentía.
?
Example: post-editing effort (1 to 5) – unseen data

Target documents
(unseen data)
Feature extractor
Source documents
(unseen data)
MT system

Features are the
same as the ones
extracted at training
time
Target documents
(unseen data)
Feature extractor
Source documents
(unseen data)
MT system
Features for QE

QE model
Can predict labels for
the new data → ML
magic!
Target documents
(unseen data)
Feature extractor
Source documents
(unseen data)
MT system
Features for QE

Source MT Score
Mass Slaughter on a Personal Level El sacrificio masivo a nivel personal
4.5
People begin to ask why their leaders are
making them fight.
La gente empiece a preguntar por qué sus
líderes están haciendo ellos lucha. 3.0
As the community affairs officers moved into
the park in their light-blue windbreakers,
many protesters simply gathered their
belongings and left.
Asuntos como la Comunidad oficiales
movido en el parque en su luz-azul
windbreakers, muchos manifestantes
simplemente se reunieron sus pertenencias
y de la izquierda.
1.8
Some stories are about honor and bravery. Algunas historias son de honor y valentía.
4.5
Example: post-editing effort (1 to 5) – unseen data → predictions!
Predicted
scores

Thank you!
QuEst++ download: http://www.quest.dcs.shef.ac.uk/

Carolina Scarton - ESR 7 - USFD

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (16)

Similar to Carolina Scarton - ESR 7 - USFD

Similar to Carolina Scarton - ESR 7 - USFD (20)

More from RIILP

More from RIILP (15)

Recently uploaded

Recently uploaded (20)

Carolina Scarton - ESR 7 - USFD