SlideShare a Scribd company logo
Applicative evaluation of bilingual
terminologies
Estelle Delpech
NODALIDA
12th May 2011

1
Outline
1. Context and scope of work
2. Comparable corpora and terminology
evaluation
3. Applicative evaluation protocol
4. Experimentation and results
5. Future improvements
2

2 / 47
Outline
1. Context and scope of work
2. Comparable corpora and terminology
evaluation
3. Applicative evaluation protocol
4. Experimentation and results
5. Future improvements
3 / 47

3
Context of the work
• Bilingual terminology mining from
comparable corpora
• Application to:
– computer-aided translation
– computer-aided terminology

4

4 / 47
Scope of the work
• Find a way to show the "added-value" of
the acquired terminology when used for
technical translation
– do translators translate better and/or faster ?

• Conception and experimentation of an
"applicative" evaluation protocol for
bilingual terminologies

5

5 / 47
Outline
1. Context and scope of work
2. Comparable corpora and
terminology evaluation
3. Applicative evaluation protocol
4. Experimentation and results
5. Future improvements
6 / 47

6
Comparable corpora
English texts on breast
cancer

French texts on breast
cancer

It has been suggested that
breast magnetic resonance
imaging (MRI) is more
accurate in the diagnosis of
breast cancer...

L'imagerie par résonance
magnétique avec injection
de gadolinium (IRM) est une
technique indépendante de
la densité mammaire....

Histological evaluation
revealed the presence of
DCIS...

Un diagnostic histologique
est nécessaire...

7

7 / 47
Comparable corpora
English texts on breast
cancer

French texts on breast
cancer

It has been suggested that
breast magnetic resonance
imaging (MRI) is more
accurate in the diagnosis of
breast cancer...

L'imagerie par résonance
magnétique avec injection
de gadolinium (IRM) est une
technique indépendante de
la densité mammaire....

Histological evaluation
revealed the presence of
DCIS...

Un diagnostic histologique
est nécessaire...

8

8 / 47
Comparable corpora
English texts on breast cancer

French texts on breast
cancer

It has been suggested that breast
magnetic resonance imaging
(MRI) is more accurate in the
diagnosis of breast cancer...

L'imagerie par résonance
magnétique avec injection
de gadolinium (IRM) est
une technique
indépendante de la
densité mammaire....

Histological evaluation revealed
the presence of ductal
carcinoma in situ.

Un diagnostic histologique
est nécessaire...
9

9 / 47
Advantages of comparable
corpora
• More available
– new domains
– unprecedented language pairs

• Quality
– spontaneous language
– not influenced from source texts

10

10 / 47
Reference evaluation of
bilingual terminologies
• Reference evaluation:
– output of the program is compared with a list
of reference translations

• Precision:
– percentage of output translations which are in
the reference

output∩reference
output
11

11 / 47
Reference evaluation with
comparable corpora
• Output:
– source term → ordered list of candidate
translations

• Example:
– histological → diagnostic1, histologie2,
histologique3, … nécessairen

12

12 / 47
Reference evaluation with
comparable corpora
• Precision:
– percentage of output translations which are in
the reference when you take into account
the Top 20 or Top 10 candidate
translations
• State-of-the-art:
– between 42% and 80% on Top 20
depending on corpus size, corpus type,
nature of translated elements [Morin and
Daille, 2009]
13

13 / 47
Reference vs. Applicative
evaluation
• Reference evaluation:
– ok for testing/developing the alignment
program
– fast, cheap, reproducible, objective

• Applicative evaluation:
– how much does the alignment program help
the end-users ?
– can the terminologies improve translation
quality?
14

14 / 47
Outline
1. Context and scope of work
2. Comparable corpora and terminology
evaluation
3. Applicative evaluation protocol
4. Experimentation and results
5. Future improvements
15
Applicative evaluation scenario

16

16 / 47
Applicative evaluation scenario

17

17 / 47
Applicative evaluation scenario

18

18 / 47
Applicative evaluation scenario

19

19 / 47
Questions raised
1) How do you assess translation quality ?
2) Evaluate the whole of the translations or
technical terms only ?

20

20 / 47
1) How do you assess translation
quality ?
• Translation studies evaluation grids:
– SICAL, SAE J 2450
– too complex, scarcely documented

• Machine translation objective metrics
– BLEU, METEOR
– not adapted to human translation
– reproducibility is not an advantage in our case
21

21 / 47
1) How do you assess translation
quality ?
•  Machine translation subjective
evaluation
– translations evaluated by humans:
• quality judgement: adequacy, fluency...
• ranking

– use annotator agreement measure to ensure
judges agreement is sufficient

22

22 / 47
2) Evaluate the whole text or
just some terms ?
• Quality of a text translation = complex
interaction of several parameters
• Focus on those elements for which the
translator felt he/she needed a linguistic
resource:
– evaluates only the part of the translation on
which the terminology has an impact
– easier and faster
23

23 / 47
Applicative evaluation protocol
• Compare 3 different "situations of
translations"
– one situation = one type of resource

• Translators do the translation, note down
the terms they had to look up
• The quality of the terms' translations is
assessed by human judges

24

24 / 47
Situations of translation

25

25 / 47
Situations of translation

26

26 / 47
Situations of translation

27

27 / 47
Situations of translation

28

28 / 47
Translations' assessment
1. Quality judgement :
– correct: standard term or expression
– acceptable: meaning is retained
– wrong: no meaning is retained

2. Ranking :
– from best to worst
– ties allowed

29

29 / 47
Outline
1. Context and scope of work
2. Comparable corpora and terminology
evaluation
3. Applicative evaluation protocol
4. Experimentation and results
5. Future improvements
30
Data
• Comparable corpora :
– breast cancer: 400k words/language
– water science: 2M words/language

• Texts to translate :
– research paper abstracts: ~500 words/domain
– lay science texts: ~500 words/domain

31
Translators' feedback
" Globally, 75% of technical words aren't in the
glossary, and for the other 25%, 99% have between
10 and 20 candidate translations and none has
been validated. So most of the time, you are just
partly sure, but you are never totally sure of your
translation. And in the worst cases, you translate
instinctively ".

 Translators were not prepared to use a bilingual
terminology with many candidate translations

 The terminology covered partially the
vocabulary of the texts to translate

32

32 / 47
Terminology coverage of texts to
translate
• Breast Cancer
– 94% of the vocabulary of the texts is in the
terminology
– fine-grained topic

• Water Science
– 14% of the vocabulary of the texts is in the
terminology
– topic is too general

33

33 / 47
Quality judgement / Breast Cancer
• equivalent proportion
of incorrect
translations
• Internet gives the
more correct
translations, then the
Comparable Corpora.

BREAST CANCER
K = 0,25
100%
90%

20%

19%

18%

42%

38%

35%

38%

43%

47%

80%
70%
60%
50%
40%
30%
20%
10%
0%

SIT. 1 / CC
SIT. 0 / GEN. LANG.
SIT. 2 / WEB

34

34 / 47
Quality judgement / Water Science
• Translations are
much better with
Internet
• Comparable corpora
produces worse
translations than the
general resources

WATER SCIENCE
K = 0,42

100%

18%

21%

23%

23%

90%
80%

7%
16%

70%
60%
50%
40%
30%

77%
59%

56%

20%
10%
0%

SIT. 1 / CC
SIT. 0/ GEN. LANG.

35

SIT. 2 / WEB

35 / 47
Results seem incoherent
• Translations produced
in situation 1 are
worse than
translations produced
in sit. 2
• But they share the
same "general
language resource"
basis

Terminology
mined from
COMPARABLE
CORPORA
general
language
resources

general
language
resources

BASELINE

Situation 1

36

36 / 47
Possible explanation
BASELINE
General Language
resource
Specialized resource
Intuition

SITUATION 1
Comparable corpora

SITUATION 2
Web

43%

14%

3%

-

25%

56%

79%

77%

44%

When translators have a specialized
ressource they tend to ignore the general
language resource
37

37 / 47
Possible explanation
BASELINE
General Language
resource
Specialized resource
Intuition

SITUATION 1
Comparable
corpora

SITUATION 2
Web

43%

14%

3%

-

25%

56%

79%

77%

44%

If translators of situation 1 had always looked
up the general resource first, translations of
situation 1 would have been at least as good
as translations of situation 0
38
38 / 47
Ranking / Breast Cancer
BREAST CANCER
K=0,69

42%

47%

45

32%

40
35
30

28%

26%

26%

25
20
15
10
5
0

CC vs. GEN. LANG.

CC vs. WEB

39

39 / 47
Ranking / Water science
WATER SCIENCE
K=0,63

49%
90

41% 43%

80

33%

70
60
50
40

18%

16%

30
20
10
0

CC vs. GEN. LANG.

CC vs. WEB

40

40 / 47
Outline
1. Context and scope of work
2. Bilingual terminology mining :
comparable vs. parallel corpora
3. Evaluation of bilingual terminologies
4. Applicative evaluation protocol
5. Experimentation and results
6. Future improvements
41
Improvements: terminology
coverage
• dependency between:
– added-value of the bilingual terminology
– its coverage of the texts to translate

• any added-value measure should also
indicate to what extent the terminology
contains the vocabulary of the translated
texts

42

42 / 47
Improvement 1: terminology
coverage
• Perspectives:
– create a "coverage" measure
– find out what is the minimum coverage for a
terminology to be "useful" to translate a given
text
– gather smaller but finer-grained corpora

43

43 / 47
Improvement 2: situations of
translations
• When translators have several ressources
at their disposal, they tend to ignore the
general language resource
• Consequence : the same resource is used
differently depending on the situation
• Seems to be the cause for incoherent
results

44

44 / 47
Improvement 2: situations of
translations
• Perspective : use 0 or 1 resource per
situation of translation

terminology
mined from
Comparable
Corpora

Situation 0

Situation 1

Web

Situation 2

45

45 / 47
Improvement 3: train translators
• Prepare translators to use "ambiguous",
unvalidated terminologies
• Do a first blank evaluation to :
– train the translators
– train the judges → results in higher
agreement

46

46 / 47
Acknowledgements
This work was funded by:
– French National Research Agency, subvention
n° ANR-08-CORD-009
– Lingua et Machina, www.lingua-et-machina.com

Annotators:
– Clémence De Baudus
– Mathieu Delage
47

More Related Content

Viewers also liked

Bilingual Terminology Extraction based on Translation Patterns
Bilingual Terminology Extraction based on Translation PatternsBilingual Terminology Extraction based on Translation Patterns
Bilingual Terminology Extraction based on Translation Patterns
Alberto Simões
 
Philippe Langlais - 2017 - Users and Data: The Two Neglected Children of Bili...
Philippe Langlais - 2017 - Users and Data: The Two Neglected Children of Bili...Philippe Langlais - 2017 - Users and Data: The Two Neglected Children of Bili...
Philippe Langlais - 2017 - Users and Data: The Two Neglected Children of Bili...
Association for Computational Linguistics
 
Chelo Vargas-Sierra
Chelo Vargas-SierraChelo Vargas-Sierra
Chelo Vargas-Sierra
Chelo Vargas
 
Enriching Transliteration Lexicon Using Automatic Transliteration Extraction
Enriching Transliteration Lexicon Using Automatic Transliteration ExtractionEnriching Transliteration Lexicon Using Automatic Transliteration Extraction
Enriching Transliteration Lexicon Using Automatic Transliteration Extraction
Sarvnaz Karimi
 
Michael Bloodgood - 2017 - Acquisition of Translation Lexicons for Historical...
Michael Bloodgood - 2017 - Acquisition of Translation Lexicons for Historical...Michael Bloodgood - 2017 - Acquisition of Translation Lexicons for Historical...
Michael Bloodgood - 2017 - Acquisition of Translation Lexicons for Historical...
Association for Computational Linguistics
 
Dealing with Lexicon Acquired from Comparable Corpora: post-edition and exchange
Dealing with Lexicon Acquired from Comparable Corpora: post-edition and exchangeDealing with Lexicon Acquired from Comparable Corpora: post-edition and exchange
Dealing with Lexicon Acquired from Comparable Corpora: post-edition and exchange
Estelle Delpech
 
Identification of Fertile Translations in Comparable Corpora: a Morpho-Compos...
Identification of Fertile Translations in Comparable Corpora: a Morpho-Compos...Identification of Fertile Translations in Comparable Corpora: a Morpho-Compos...
Identification of Fertile Translations in Comparable Corpora: a Morpho-Compos...
Estelle Delpech
 
Meng Zhang - 2017 - Adversarial Training for Unsupervised Bilingual Lexicon I...
Meng Zhang - 2017 - Adversarial Training for Unsupervised Bilingual Lexicon I...Meng Zhang - 2017 - Adversarial Training for Unsupervised Bilingual Lexicon I...
Meng Zhang - 2017 - Adversarial Training for Unsupervised Bilingual Lexicon I...
Association for Computational Linguistics
 
Cross-lingual ontology lexicalisation, translation and information extraction...
Cross-lingual ontology lexicalisation, translation and information extraction...Cross-lingual ontology lexicalisation, translation and information extraction...
Cross-lingual ontology lexicalisation, translation and information extraction...
Tobias Wunner
 
Bilingual terminology mining
Bilingual terminology miningBilingual terminology mining
Bilingual terminology mining
Estelle Delpech
 
A cognitive view of the bilingual lexicon
A cognitive view of the bilingual lexiconA cognitive view of the bilingual lexicon
A cognitive view of the bilingual lexicon
İrem Tümer
 
Macro economische analyse van brazilië
Macro economische analyse van braziliëMacro economische analyse van brazilië
Macro economische analyse van braziliëJan-Willem Lammens
 
Embedded Human Computation for Knowledge Extraction and Evaluation
Embedded Human Computation for Knowledge Extraction and EvaluationEmbedded Human Computation for Knowledge Extraction and Evaluation
Embedded Human Computation for Knowledge Extraction and Evaluation
webLyzard technology
 
Parallel text extraction from multimodal comparable corpora
Parallel text extraction from multimodal comparable corporaParallel text extraction from multimodal comparable corpora
Parallel text extraction from multimodal comparable corpora
Haithem Afli
 
Challenges in the linguistic exploitation of specialized republishable web co...
Challenges in the linguistic exploitation of specialized republishable web co...Challenges in the linguistic exploitation of specialized republishable web co...
Challenges in the linguistic exploitation of specialized republishable web co...
Adrien Barbaresi
 
Extraction of domain-specific bilingual lexicon from comparable corpora: comp...
Extraction of domain-specific bilingual lexicon from comparable corpora: comp...Extraction of domain-specific bilingual lexicon from comparable corpora: comp...
Extraction of domain-specific bilingual lexicon from comparable corpora: comp...
Estelle Delpech
 
Word Formation in English
Word Formation in EnglishWord Formation in English
Word Formation in English
teflang
 

Viewers also liked (17)

Bilingual Terminology Extraction based on Translation Patterns
Bilingual Terminology Extraction based on Translation PatternsBilingual Terminology Extraction based on Translation Patterns
Bilingual Terminology Extraction based on Translation Patterns
 
Philippe Langlais - 2017 - Users and Data: The Two Neglected Children of Bili...
Philippe Langlais - 2017 - Users and Data: The Two Neglected Children of Bili...Philippe Langlais - 2017 - Users and Data: The Two Neglected Children of Bili...
Philippe Langlais - 2017 - Users and Data: The Two Neglected Children of Bili...
 
Chelo Vargas-Sierra
Chelo Vargas-SierraChelo Vargas-Sierra
Chelo Vargas-Sierra
 
Enriching Transliteration Lexicon Using Automatic Transliteration Extraction
Enriching Transliteration Lexicon Using Automatic Transliteration ExtractionEnriching Transliteration Lexicon Using Automatic Transliteration Extraction
Enriching Transliteration Lexicon Using Automatic Transliteration Extraction
 
Michael Bloodgood - 2017 - Acquisition of Translation Lexicons for Historical...
Michael Bloodgood - 2017 - Acquisition of Translation Lexicons for Historical...Michael Bloodgood - 2017 - Acquisition of Translation Lexicons for Historical...
Michael Bloodgood - 2017 - Acquisition of Translation Lexicons for Historical...
 
Dealing with Lexicon Acquired from Comparable Corpora: post-edition and exchange
Dealing with Lexicon Acquired from Comparable Corpora: post-edition and exchangeDealing with Lexicon Acquired from Comparable Corpora: post-edition and exchange
Dealing with Lexicon Acquired from Comparable Corpora: post-edition and exchange
 
Identification of Fertile Translations in Comparable Corpora: a Morpho-Compos...
Identification of Fertile Translations in Comparable Corpora: a Morpho-Compos...Identification of Fertile Translations in Comparable Corpora: a Morpho-Compos...
Identification of Fertile Translations in Comparable Corpora: a Morpho-Compos...
 
Meng Zhang - 2017 - Adversarial Training for Unsupervised Bilingual Lexicon I...
Meng Zhang - 2017 - Adversarial Training for Unsupervised Bilingual Lexicon I...Meng Zhang - 2017 - Adversarial Training for Unsupervised Bilingual Lexicon I...
Meng Zhang - 2017 - Adversarial Training for Unsupervised Bilingual Lexicon I...
 
Cross-lingual ontology lexicalisation, translation and information extraction...
Cross-lingual ontology lexicalisation, translation and information extraction...Cross-lingual ontology lexicalisation, translation and information extraction...
Cross-lingual ontology lexicalisation, translation and information extraction...
 
Bilingual terminology mining
Bilingual terminology miningBilingual terminology mining
Bilingual terminology mining
 
A cognitive view of the bilingual lexicon
A cognitive view of the bilingual lexiconA cognitive view of the bilingual lexicon
A cognitive view of the bilingual lexicon
 
Macro economische analyse van brazilië
Macro economische analyse van braziliëMacro economische analyse van brazilië
Macro economische analyse van brazilië
 
Embedded Human Computation for Knowledge Extraction and Evaluation
Embedded Human Computation for Knowledge Extraction and EvaluationEmbedded Human Computation for Knowledge Extraction and Evaluation
Embedded Human Computation for Knowledge Extraction and Evaluation
 
Parallel text extraction from multimodal comparable corpora
Parallel text extraction from multimodal comparable corporaParallel text extraction from multimodal comparable corpora
Parallel text extraction from multimodal comparable corpora
 
Challenges in the linguistic exploitation of specialized republishable web co...
Challenges in the linguistic exploitation of specialized republishable web co...Challenges in the linguistic exploitation of specialized republishable web co...
Challenges in the linguistic exploitation of specialized republishable web co...
 
Extraction of domain-specific bilingual lexicon from comparable corpora: comp...
Extraction of domain-specific bilingual lexicon from comparable corpora: comp...Extraction of domain-specific bilingual lexicon from comparable corpora: comp...
Extraction of domain-specific bilingual lexicon from comparable corpora: comp...
 
Word Formation in English
Word Formation in EnglishWord Formation in English
Word Formation in English
 

Similar to Applicative evaluation of bilingual terminologies

Test adaptation by sumiran khatri
Test adaptation by sumiran khatriTest adaptation by sumiran khatri
Test adaptation by sumiran khatri
Sumiran Khatri
 
Methodology of MT Post-Editors Training
Methodology of MT Post-Editors TrainingMethodology of MT Post-Editors Training
Methodology of MT Post-Editors Training
Jakub Absolon
 
Auditing_COAs_TranslationSilviaZaragoza
Auditing_COAs_TranslationSilviaZaragozaAuditing_COAs_TranslationSilviaZaragoza
Auditing_COAs_TranslationSilviaZaragoza
Neuropsychological Research Organization s.l
 
Beyond Transparency: Success & Lessons From tambisBoston2003
Beyond Transparency: Success & Lessons From tambisBoston2003Beyond Transparency: Success & Lessons From tambisBoston2003
Beyond Transparency: Success & Lessons From tambisBoston2003
robertstevens65
 
Challenges of cultural adaptation and translation of tests..pptx
Challenges of cultural adaptation and translation of tests..pptxChallenges of cultural adaptation and translation of tests..pptx
Challenges of cultural adaptation and translation of tests..pptx
rishavsingh94
 
A Comparison of NER Tools w.r.t. a Domain-Specific Vocabulary
A Comparison of NER Tools w.r.t. a Domain-Specific VocabularyA Comparison of NER Tools w.r.t. a Domain-Specific Vocabulary
A Comparison of NER Tools w.r.t. a Domain-Specific Vocabulary
Timm Heuss
 
Lepor: augmented automatic MT evaluation metric
Lepor: augmented automatic MT evaluation metricLepor: augmented automatic MT evaluation metric
Lepor: augmented automatic MT evaluation metric
Lifeng (Aaron) Han
 
LEPOR: an augmented machine translation evaluation metric - Thesis PPT
LEPOR: an augmented machine translation evaluation metric - Thesis PPT LEPOR: an augmented machine translation evaluation metric - Thesis PPT
LEPOR: an augmented machine translation evaluation metric - Thesis PPT
Lifeng (Aaron) Han
 
Using Semantic and Domain-based Information in CLIR Systems
Using Semantic and Domain-based Information in CLIR SystemsUsing Semantic and Domain-based Information in CLIR Systems
Using Semantic and Domain-based Information in CLIR Systems
Mauro Dragoni
 
Improving Your Literature Reviews with NVivo 10 for Windows
Improving Your Literature Reviews with NVivo 10 for WindowsImproving Your Literature Reviews with NVivo 10 for Windows
Improving Your Literature Reviews with NVivo 10 for Windows
QSR International
 
2. Constantin Orasan (UoW) EXPERT Introduction
2. Constantin Orasan (UoW) EXPERT Introduction2. Constantin Orasan (UoW) EXPERT Introduction
2. Constantin Orasan (UoW) EXPERT Introduction
RIILP
 
Psychological test adaptation
Psychological test adaptationPsychological test adaptation
Psychological test adaptation
Carlo Magno
 
Communicating Uncertainty with Probability Phrases
Communicating Uncertainty with Probability PhrasesCommunicating Uncertainty with Probability Phrases
Communicating Uncertainty with Probability Phrases
ipcc-media
 
Natural Language Processing, Techniques, Current Trends and Applications in I...
Natural Language Processing, Techniques, Current Trends and Applications in I...Natural Language Processing, Techniques, Current Trends and Applications in I...
Natural Language Processing, Techniques, Current Trends and Applications in I...
RajkiranVeluri
 
Hl7 Standards
Hl7 StandardsHl7 Standards
Project Presentation
Project PresentationProject Presentation
Project Presentation
butest
 
EASE 2014 - Cochrane's translation strategy
EASE 2014 - Cochrane's translation strategyEASE 2014 - Cochrane's translation strategy
EASE 2014 - Cochrane's translation strategy
Juliane Ried
 
الرسالة النهائية مازن الفداوي
الرسالة النهائية مازن الفداويالرسالة النهائية مازن الفداوي
الرسالة النهائية مازن الفداوي
Mazen Al Fedawi
 
co:op-READ-Convention Marburg - Enrique Vidal
co:op-READ-Convention Marburg - Enrique Vidalco:op-READ-Convention Marburg - Enrique Vidal
co:op-READ-Convention Marburg - Enrique Vidal
ICARUS - International Centre for Archival Research
 
Hl7 Standards
Hl7 StandardsHl7 Standards

Similar to Applicative evaluation of bilingual terminologies (20)

Test adaptation by sumiran khatri
Test adaptation by sumiran khatriTest adaptation by sumiran khatri
Test adaptation by sumiran khatri
 
Methodology of MT Post-Editors Training
Methodology of MT Post-Editors TrainingMethodology of MT Post-Editors Training
Methodology of MT Post-Editors Training
 
Auditing_COAs_TranslationSilviaZaragoza
Auditing_COAs_TranslationSilviaZaragozaAuditing_COAs_TranslationSilviaZaragoza
Auditing_COAs_TranslationSilviaZaragoza
 
Beyond Transparency: Success & Lessons From tambisBoston2003
Beyond Transparency: Success & Lessons From tambisBoston2003Beyond Transparency: Success & Lessons From tambisBoston2003
Beyond Transparency: Success & Lessons From tambisBoston2003
 
Challenges of cultural adaptation and translation of tests..pptx
Challenges of cultural adaptation and translation of tests..pptxChallenges of cultural adaptation and translation of tests..pptx
Challenges of cultural adaptation and translation of tests..pptx
 
A Comparison of NER Tools w.r.t. a Domain-Specific Vocabulary
A Comparison of NER Tools w.r.t. a Domain-Specific VocabularyA Comparison of NER Tools w.r.t. a Domain-Specific Vocabulary
A Comparison of NER Tools w.r.t. a Domain-Specific Vocabulary
 
Lepor: augmented automatic MT evaluation metric
Lepor: augmented automatic MT evaluation metricLepor: augmented automatic MT evaluation metric
Lepor: augmented automatic MT evaluation metric
 
LEPOR: an augmented machine translation evaluation metric - Thesis PPT
LEPOR: an augmented machine translation evaluation metric - Thesis PPT LEPOR: an augmented machine translation evaluation metric - Thesis PPT
LEPOR: an augmented machine translation evaluation metric - Thesis PPT
 
Using Semantic and Domain-based Information in CLIR Systems
Using Semantic and Domain-based Information in CLIR SystemsUsing Semantic and Domain-based Information in CLIR Systems
Using Semantic and Domain-based Information in CLIR Systems
 
Improving Your Literature Reviews with NVivo 10 for Windows
Improving Your Literature Reviews with NVivo 10 for WindowsImproving Your Literature Reviews with NVivo 10 for Windows
Improving Your Literature Reviews with NVivo 10 for Windows
 
2. Constantin Orasan (UoW) EXPERT Introduction
2. Constantin Orasan (UoW) EXPERT Introduction2. Constantin Orasan (UoW) EXPERT Introduction
2. Constantin Orasan (UoW) EXPERT Introduction
 
Psychological test adaptation
Psychological test adaptationPsychological test adaptation
Psychological test adaptation
 
Communicating Uncertainty with Probability Phrases
Communicating Uncertainty with Probability PhrasesCommunicating Uncertainty with Probability Phrases
Communicating Uncertainty with Probability Phrases
 
Natural Language Processing, Techniques, Current Trends and Applications in I...
Natural Language Processing, Techniques, Current Trends and Applications in I...Natural Language Processing, Techniques, Current Trends and Applications in I...
Natural Language Processing, Techniques, Current Trends and Applications in I...
 
Hl7 Standards
Hl7 StandardsHl7 Standards
Hl7 Standards
 
Project Presentation
Project PresentationProject Presentation
Project Presentation
 
EASE 2014 - Cochrane's translation strategy
EASE 2014 - Cochrane's translation strategyEASE 2014 - Cochrane's translation strategy
EASE 2014 - Cochrane's translation strategy
 
الرسالة النهائية مازن الفداوي
الرسالة النهائية مازن الفداويالرسالة النهائية مازن الفداوي
الرسالة النهائية مازن الفداوي
 
co:op-READ-Convention Marburg - Enrique Vidal
co:op-READ-Convention Marburg - Enrique Vidalco:op-READ-Convention Marburg - Enrique Vidal
co:op-READ-Convention Marburg - Enrique Vidal
 
Hl7 Standards
Hl7 StandardsHl7 Standards
Hl7 Standards
 

More from Estelle Delpech

Génération automatique de texte
Génération automatique de texteGénération automatique de texte
Génération automatique de texte
Estelle Delpech
 
Identification de compatibilités entre tages descriptifs de lieux
Identification de compatibilités entre tages descriptifs de lieuxIdentification de compatibilités entre tages descriptifs de lieux
Identification de compatibilités entre tages descriptifs de lieux
Estelle Delpech
 
Découverte du Traitement Automatique des Langues
Découverte du Traitement Automatique des LanguesDécouverte du Traitement Automatique des Langues
Découverte du Traitement Automatique des Langues
Estelle Delpech
 
Invited speaker, ATALA 2014 Ph. D. Thesis award
Invited speaker, ATALA 2014 Ph. D. Thesis awardInvited speaker, ATALA 2014 Ph. D. Thesis award
Invited speaker, ATALA 2014 Ph. D. Thesis awardEstelle Delpech
 
Corpus comparables et traduction assistée par ordinateur, contributions à la ...
Corpus comparables et traduction assistée par ordinateur, contributions à la ...Corpus comparables et traduction assistée par ordinateur, contributions à la ...
Corpus comparables et traduction assistée par ordinateur, contributions à la ...
Estelle Delpech
 
Identification de compatibilites sémantiques entre descripteurs de lieux
Identification de compatibilites sémantiques entre descripteurs de lieuxIdentification de compatibilites sémantiques entre descripteurs de lieux
Identification de compatibilites sémantiques entre descripteurs de lieux
Estelle Delpech
 
Usage du TAL dans des applications industrielles : gestion des contenus multi...
Usage du TAL dans des applications industrielles : gestion des contenus multi...Usage du TAL dans des applications industrielles : gestion des contenus multi...
Usage du TAL dans des applications industrielles : gestion des contenus multi...
Estelle Delpech
 
Nomao: data analysis for personalized local search
Nomao: data analysis for personalized local searchNomao: data analysis for personalized local search
Nomao: data analysis for personalized local search
Estelle Delpech
 
Nomao: carnet de bonnes adresses (entre amis)
Nomao: carnet de bonnes adresses (entre amis)Nomao: carnet de bonnes adresses (entre amis)
Nomao: carnet de bonnes adresses (entre amis)
Estelle Delpech
 
Nomao: local search and recommendation engine
Nomao: local search and recommendation engineNomao: local search and recommendation engine
Nomao: local search and recommendation engine
Estelle Delpech
 
Évaluation applicative des terminologies destinées à la traduction spécialisée
Évaluation applicative des terminologies destinées à la traduction spécialiséeÉvaluation applicative des terminologies destinées à la traduction spécialisée
Évaluation applicative des terminologies destinées à la traduction spécialisée
Estelle Delpech
 
R&D Lingua et Machina
R&D Lingua et MachinaR&D Lingua et Machina
R&D Lingua et Machina
Estelle Delpech
 
Robust rule-based parsing
Robust rule-based parsingRobust rule-based parsing
Robust rule-based parsing
Estelle Delpech
 
Experimenting the TextTiling Algorithm
Experimenting the TextTiling AlgorithmExperimenting the TextTiling Algorithm
Experimenting the TextTiling Algorithm
Estelle Delpech
 
Text Processing for Procedural Question Answering
Text Processing for Procedural Question AnsweringText Processing for Procedural Question Answering
Text Processing for Procedural Question Answering
Estelle Delpech
 

More from Estelle Delpech (15)

Génération automatique de texte
Génération automatique de texteGénération automatique de texte
Génération automatique de texte
 
Identification de compatibilités entre tages descriptifs de lieux
Identification de compatibilités entre tages descriptifs de lieuxIdentification de compatibilités entre tages descriptifs de lieux
Identification de compatibilités entre tages descriptifs de lieux
 
Découverte du Traitement Automatique des Langues
Découverte du Traitement Automatique des LanguesDécouverte du Traitement Automatique des Langues
Découverte du Traitement Automatique des Langues
 
Invited speaker, ATALA 2014 Ph. D. Thesis award
Invited speaker, ATALA 2014 Ph. D. Thesis awardInvited speaker, ATALA 2014 Ph. D. Thesis award
Invited speaker, ATALA 2014 Ph. D. Thesis award
 
Corpus comparables et traduction assistée par ordinateur, contributions à la ...
Corpus comparables et traduction assistée par ordinateur, contributions à la ...Corpus comparables et traduction assistée par ordinateur, contributions à la ...
Corpus comparables et traduction assistée par ordinateur, contributions à la ...
 
Identification de compatibilites sémantiques entre descripteurs de lieux
Identification de compatibilites sémantiques entre descripteurs de lieuxIdentification de compatibilites sémantiques entre descripteurs de lieux
Identification de compatibilites sémantiques entre descripteurs de lieux
 
Usage du TAL dans des applications industrielles : gestion des contenus multi...
Usage du TAL dans des applications industrielles : gestion des contenus multi...Usage du TAL dans des applications industrielles : gestion des contenus multi...
Usage du TAL dans des applications industrielles : gestion des contenus multi...
 
Nomao: data analysis for personalized local search
Nomao: data analysis for personalized local searchNomao: data analysis for personalized local search
Nomao: data analysis for personalized local search
 
Nomao: carnet de bonnes adresses (entre amis)
Nomao: carnet de bonnes adresses (entre amis)Nomao: carnet de bonnes adresses (entre amis)
Nomao: carnet de bonnes adresses (entre amis)
 
Nomao: local search and recommendation engine
Nomao: local search and recommendation engineNomao: local search and recommendation engine
Nomao: local search and recommendation engine
 
Évaluation applicative des terminologies destinées à la traduction spécialisée
Évaluation applicative des terminologies destinées à la traduction spécialiséeÉvaluation applicative des terminologies destinées à la traduction spécialisée
Évaluation applicative des terminologies destinées à la traduction spécialisée
 
R&D Lingua et Machina
R&D Lingua et MachinaR&D Lingua et Machina
R&D Lingua et Machina
 
Robust rule-based parsing
Robust rule-based parsingRobust rule-based parsing
Robust rule-based parsing
 
Experimenting the TextTiling Algorithm
Experimenting the TextTiling AlgorithmExperimenting the TextTiling Algorithm
Experimenting the TextTiling Algorithm
 
Text Processing for Procedural Question Answering
Text Processing for Procedural Question AnsweringText Processing for Procedural Question Answering
Text Processing for Procedural Question Answering
 

Recently uploaded

Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Safe Software
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Tosin Akinosho
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
名前 です男
 
Webinar: Designing a schema for a Data Warehouse
Webinar: Designing a schema for a Data WarehouseWebinar: Designing a schema for a Data Warehouse
Webinar: Designing a schema for a Data Warehouse
Federico Razzoli
 
Recommendation System using RAG Architecture
Recommendation System using RAG ArchitectureRecommendation System using RAG Architecture
Recommendation System using RAG Architecture
fredae14
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
Mariano Tinti
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
akankshawande
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
Daiki Mogmet Ito
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Malak Abu Hammad
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
Pixlogix Infotech
 
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptxOcean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
SitimaJohn
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
Ivanti
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
UI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentationUI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentation
Wouter Lemaire
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
Zilliz
 

Recently uploaded (20)

Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
 
Webinar: Designing a schema for a Data Warehouse
Webinar: Designing a schema for a Data WarehouseWebinar: Designing a schema for a Data Warehouse
Webinar: Designing a schema for a Data Warehouse
 
Recommendation System using RAG Architecture
Recommendation System using RAG ArchitectureRecommendation System using RAG Architecture
Recommendation System using RAG Architecture
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
 
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptxOcean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
UI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentationUI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentation
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
 

Applicative evaluation of bilingual terminologies

  • 1. Applicative evaluation of bilingual terminologies Estelle Delpech NODALIDA 12th May 2011 1
  • 2. Outline 1. Context and scope of work 2. Comparable corpora and terminology evaluation 3. Applicative evaluation protocol 4. Experimentation and results 5. Future improvements 2 2 / 47
  • 3. Outline 1. Context and scope of work 2. Comparable corpora and terminology evaluation 3. Applicative evaluation protocol 4. Experimentation and results 5. Future improvements 3 / 47 3
  • 4. Context of the work • Bilingual terminology mining from comparable corpora • Application to: – computer-aided translation – computer-aided terminology 4 4 / 47
  • 5. Scope of the work • Find a way to show the "added-value" of the acquired terminology when used for technical translation – do translators translate better and/or faster ? • Conception and experimentation of an "applicative" evaluation protocol for bilingual terminologies 5 5 / 47
  • 6. Outline 1. Context and scope of work 2. Comparable corpora and terminology evaluation 3. Applicative evaluation protocol 4. Experimentation and results 5. Future improvements 6 / 47 6
  • 7. Comparable corpora English texts on breast cancer French texts on breast cancer It has been suggested that breast magnetic resonance imaging (MRI) is more accurate in the diagnosis of breast cancer... L'imagerie par résonance magnétique avec injection de gadolinium (IRM) est une technique indépendante de la densité mammaire.... Histological evaluation revealed the presence of DCIS... Un diagnostic histologique est nécessaire... 7 7 / 47
  • 8. Comparable corpora English texts on breast cancer French texts on breast cancer It has been suggested that breast magnetic resonance imaging (MRI) is more accurate in the diagnosis of breast cancer... L'imagerie par résonance magnétique avec injection de gadolinium (IRM) est une technique indépendante de la densité mammaire.... Histological evaluation revealed the presence of DCIS... Un diagnostic histologique est nécessaire... 8 8 / 47
  • 9. Comparable corpora English texts on breast cancer French texts on breast cancer It has been suggested that breast magnetic resonance imaging (MRI) is more accurate in the diagnosis of breast cancer... L'imagerie par résonance magnétique avec injection de gadolinium (IRM) est une technique indépendante de la densité mammaire.... Histological evaluation revealed the presence of ductal carcinoma in situ. Un diagnostic histologique est nécessaire... 9 9 / 47
  • 10. Advantages of comparable corpora • More available – new domains – unprecedented language pairs • Quality – spontaneous language – not influenced from source texts 10 10 / 47
  • 11. Reference evaluation of bilingual terminologies • Reference evaluation: – output of the program is compared with a list of reference translations • Precision: – percentage of output translations which are in the reference output∩reference output 11 11 / 47
  • 12. Reference evaluation with comparable corpora • Output: – source term → ordered list of candidate translations • Example: – histological → diagnostic1, histologie2, histologique3, … nécessairen 12 12 / 47
  • 13. Reference evaluation with comparable corpora • Precision: – percentage of output translations which are in the reference when you take into account the Top 20 or Top 10 candidate translations • State-of-the-art: – between 42% and 80% on Top 20 depending on corpus size, corpus type, nature of translated elements [Morin and Daille, 2009] 13 13 / 47
  • 14. Reference vs. Applicative evaluation • Reference evaluation: – ok for testing/developing the alignment program – fast, cheap, reproducible, objective • Applicative evaluation: – how much does the alignment program help the end-users ? – can the terminologies improve translation quality? 14 14 / 47
  • 15. Outline 1. Context and scope of work 2. Comparable corpora and terminology evaluation 3. Applicative evaluation protocol 4. Experimentation and results 5. Future improvements 15
  • 20. Questions raised 1) How do you assess translation quality ? 2) Evaluate the whole of the translations or technical terms only ? 20 20 / 47
  • 21. 1) How do you assess translation quality ? • Translation studies evaluation grids: – SICAL, SAE J 2450 – too complex, scarcely documented • Machine translation objective metrics – BLEU, METEOR – not adapted to human translation – reproducibility is not an advantage in our case 21 21 / 47
  • 22. 1) How do you assess translation quality ? •  Machine translation subjective evaluation – translations evaluated by humans: • quality judgement: adequacy, fluency... • ranking – use annotator agreement measure to ensure judges agreement is sufficient 22 22 / 47
  • 23. 2) Evaluate the whole text or just some terms ? • Quality of a text translation = complex interaction of several parameters • Focus on those elements for which the translator felt he/she needed a linguistic resource: – evaluates only the part of the translation on which the terminology has an impact – easier and faster 23 23 / 47
  • 24. Applicative evaluation protocol • Compare 3 different "situations of translations" – one situation = one type of resource • Translators do the translation, note down the terms they had to look up • The quality of the terms' translations is assessed by human judges 24 24 / 47
  • 29. Translations' assessment 1. Quality judgement : – correct: standard term or expression – acceptable: meaning is retained – wrong: no meaning is retained 2. Ranking : – from best to worst – ties allowed 29 29 / 47
  • 30. Outline 1. Context and scope of work 2. Comparable corpora and terminology evaluation 3. Applicative evaluation protocol 4. Experimentation and results 5. Future improvements 30
  • 31. Data • Comparable corpora : – breast cancer: 400k words/language – water science: 2M words/language • Texts to translate : – research paper abstracts: ~500 words/domain – lay science texts: ~500 words/domain 31
  • 32. Translators' feedback " Globally, 75% of technical words aren't in the glossary, and for the other 25%, 99% have between 10 and 20 candidate translations and none has been validated. So most of the time, you are just partly sure, but you are never totally sure of your translation. And in the worst cases, you translate instinctively ".  Translators were not prepared to use a bilingual terminology with many candidate translations  The terminology covered partially the vocabulary of the texts to translate 32 32 / 47
  • 33. Terminology coverage of texts to translate • Breast Cancer – 94% of the vocabulary of the texts is in the terminology – fine-grained topic • Water Science – 14% of the vocabulary of the texts is in the terminology – topic is too general 33 33 / 47
  • 34. Quality judgement / Breast Cancer • equivalent proportion of incorrect translations • Internet gives the more correct translations, then the Comparable Corpora. BREAST CANCER K = 0,25 100% 90% 20% 19% 18% 42% 38% 35% 38% 43% 47% 80% 70% 60% 50% 40% 30% 20% 10% 0% SIT. 1 / CC SIT. 0 / GEN. LANG. SIT. 2 / WEB 34 34 / 47
  • 35. Quality judgement / Water Science • Translations are much better with Internet • Comparable corpora produces worse translations than the general resources WATER SCIENCE K = 0,42 100% 18% 21% 23% 23% 90% 80% 7% 16% 70% 60% 50% 40% 30% 77% 59% 56% 20% 10% 0% SIT. 1 / CC SIT. 0/ GEN. LANG. 35 SIT. 2 / WEB 35 / 47
  • 36. Results seem incoherent • Translations produced in situation 1 are worse than translations produced in sit. 2 • But they share the same "general language resource" basis Terminology mined from COMPARABLE CORPORA general language resources general language resources BASELINE Situation 1 36 36 / 47
  • 37. Possible explanation BASELINE General Language resource Specialized resource Intuition SITUATION 1 Comparable corpora SITUATION 2 Web 43% 14% 3% - 25% 56% 79% 77% 44% When translators have a specialized ressource they tend to ignore the general language resource 37 37 / 47
  • 38. Possible explanation BASELINE General Language resource Specialized resource Intuition SITUATION 1 Comparable corpora SITUATION 2 Web 43% 14% 3% - 25% 56% 79% 77% 44% If translators of situation 1 had always looked up the general resource first, translations of situation 1 would have been at least as good as translations of situation 0 38 38 / 47
  • 39. Ranking / Breast Cancer BREAST CANCER K=0,69 42% 47% 45 32% 40 35 30 28% 26% 26% 25 20 15 10 5 0 CC vs. GEN. LANG. CC vs. WEB 39 39 / 47
  • 40. Ranking / Water science WATER SCIENCE K=0,63 49% 90 41% 43% 80 33% 70 60 50 40 18% 16% 30 20 10 0 CC vs. GEN. LANG. CC vs. WEB 40 40 / 47
  • 41. Outline 1. Context and scope of work 2. Bilingual terminology mining : comparable vs. parallel corpora 3. Evaluation of bilingual terminologies 4. Applicative evaluation protocol 5. Experimentation and results 6. Future improvements 41
  • 42. Improvements: terminology coverage • dependency between: – added-value of the bilingual terminology – its coverage of the texts to translate • any added-value measure should also indicate to what extent the terminology contains the vocabulary of the translated texts 42 42 / 47
  • 43. Improvement 1: terminology coverage • Perspectives: – create a "coverage" measure – find out what is the minimum coverage for a terminology to be "useful" to translate a given text – gather smaller but finer-grained corpora 43 43 / 47
  • 44. Improvement 2: situations of translations • When translators have several ressources at their disposal, they tend to ignore the general language resource • Consequence : the same resource is used differently depending on the situation • Seems to be the cause for incoherent results 44 44 / 47
  • 45. Improvement 2: situations of translations • Perspective : use 0 or 1 resource per situation of translation terminology mined from Comparable Corpora Situation 0 Situation 1 Web Situation 2 45 45 / 47
  • 46. Improvement 3: train translators • Prepare translators to use "ambiguous", unvalidated terminologies • Do a first blank evaluation to : – train the translators – train the judges → results in higher agreement 46 46 / 47
  • 47. Acknowledgements This work was funded by: – French National Research Agency, subvention n° ANR-08-CORD-009 – Lingua et Machina, www.lingua-et-machina.com Annotators: – Clémence De Baudus – Mathieu Delage 47