SlideShare a Scribd company logo
1 of 17
An Empirical Simulation-based Study of 
Real-Time Speech Translation for 
Multilingual Global Project Teams 
Fabio Calefato 
Filippo Lanubile 
University of Bari, Italy 
Rafael Prikladnicki 
João Henrique Stocker Pinto 
PUCRS, Brazil 
ESEM'14 - Turin, Sept. 18-19, 2014 1
Motivation 
• Global software projects challenged by 
language differences 
– Especially requirements meetings 
• Speech translation technology for remote 
meetings in countries with 
– Opportunities for global projects 
– Lack of English speaking professionals 
• Goal: 
Evaluate the feasibility of adopting a real-time 
speech translation to support multilingual 
requirements meetings 
ESEM'14 - Turin, Sept. 18-19, 2014 2
Speech Translation 
Speech Recognition + 
• First prototypes date back 
to early 70s 
– Appropriate for dictation 
only, not for real-time 
captioning of speech 
YET 
– Recent progress, 
especially with mobile 
devices 
– Need for further 
investigation 
Machine Translation 
• First prototypes date back 
to 50s 
– Still far from 100% accurate 
for multilingual group 
communication 
YET 
– Not disruptive of the 
conversation flow 
– Does not prevent complex 
tasks completion 
– Even grants more balanced 
discussions
Research questions 
• RQ1: How well does speech translation work 
for continuous speech in global software 
projects? 
• RQ2: How does technical jargon affect speech 
translation in global software projects? 
ESEM'14 - Turin, Sept. 18-19, 2014 4
Simulation-based study 
• 8 Participants 
– Software engineering professionals 
– 4 from Bari (Italy) and 4 from Porto Alegre (Brazil) 
– 7 males, 1 female 
ESEM'14 - Turin, Sept. 18-19, 2014 5
Instrumentation 
• 60 sentences from 5 requirements workshop logs 
– Half containing jargon 
– Half generic 
– Increasing length (# words 5-30) 
• Manually translated EN -> IT, PT 
• Google Chrome Web Speech API Demo + Google 
Translate 
ESEM'14 - Turin, Sept. 18-19, 2014 6 
Speech transcript 
(IT / PT) 
Resulting 
translation 
(IT / PT / EN) 
Speech transcript 
(IT / PT)
Variables for 
Speech Recognition 
• Independent 
1. Source Language (IT, PT) 
2. Speaker (Speaker1-4, nested under Language) 
3. Lexicon (generic, jargon) 
4. Replication (R1-R30, nested under Lexicon) 
• Dependent 
1. Transcript accuracy 
′ = 푇 푎푐푐 + 1 2 
ESEM'14 - Turin, Sept. 18-19, 2014 7 
푇푎푐푐 = 
# 푟푒푐표푔푛푖푧푒푑 푤표푟푑푠 − # 푒푟푟표푟푠 
# 푤표푟푑푠 푖푛 푢푡푡푒푟푎푛푐푒 
푇푎푐푐
Variables for 
Machine Translation 
• Independent 
1. Language Pairs 
(ITEN, PTIT, PTEN, ITPT) 
2. Lexicon (generic, jargon) 
3. Replication (R1-R30, nested under Lexicon) 
• Dependent 
1. Translation adequacy 
ESEM'14 - Turin, Sept. 18-19, 2014 8
Translation Adequacy 
scoring scheme 
Category Description 
4 
Completely adequate. The translation clearly reflects the information 
contained in the original sentence. It is perfectly clear, intelligible, 
grammatically correct, and reads like ordinary tex. 
3 
Fairly adequate. The translation generally reflects the information 
contained in the original sentence, despite some inaccuracies or 
infelicities in the text. It is generally clear and intelligible, and one can 
(almost) immediately understand what it means. 
2 
Poorly adequate. The translation poorly reflects the information 
contained in the original sentence. It contains grammatical errors and/or 
poor word choices. The general idea of the text is intelligible only after 
considerable study. 
1 
Completely adequate. The translation is unintelligible and it is not 
possible to obtain the information contained in the original sentence. 
Studying the meaning of the text is hopeless and, even allowing for 
context, one feels that guessing would be too unreliable. 
Adapted from: D. Arnold et al. "Machine Translation: an Introductory Guide" (1994)
Results: Speech Recognition 
Accuracy (1/2) 
Mean 
• Minimal differences in mean accuracy 
– by Language and Lexicon 
– by Speaker (except PT-Speaker2) 
ESEM'14 - Turin, Sept. 18-19, 2014 10 
Language 
IT .81 
PT .75 
Lexicon 
Generic .80 
Jargon .77 
Speaker Language Mean 
PT-Speaker1 PT .78 
PT-Speaker2 PT .68 
PT-Speaker3 PT .79 
PT-Speaker4 PT .73 
IT-Speaker1 IT .76 
IT-Speaker2 IT .88 
IT-Speaker3 IT .78 
IT-Speaker4 IT .82
Results: Speech Recognition 
Accuracy (2/2) 
Source df 
Mean 
Square 
F Sig. 
Intercept 1 290.785 587.408 .017 
Language 1 .460 2.948 .144 
Speaker(Language) 6 .166 12.907 .003† 
Lexicon 1 .125 3.285 .104 
Replication(Lexicon) 58 .082 1.740 .018† 
Language * Lexicon 1 .003 .068 .797 
UNIANOVA 
• Speaker(Language) and Replication(Lexicon) 
the only significant factors 
ESEM'14 - Turin, Sept. 18-19, 2014 11 
Language * 
Replication(Lexicon) 
58 .047 3.004 .000 
Lexicon * Speaker(Language) 6 .013 .817 .557 
† Significant at 5% level
Results: Speech Translation 
Adequacy (1/2) 
Adequacy by Language pairs Adequacy by Lexicon 
152 
141 
165 
50 
49 
30 
70 
71 
90 
38 
50 
45 
82 
70 
75 
IT->PT 
IT->EN 
PT->IT 
ESEM'14 - Turin, Sept. 18-19, 2014 12 
88 
99 
75 
80 
160 
0% 20% 40% 60% 80% 100% 
IT->PT 
IT->EN 
PT->IT 
PT->EN 
Adequate 
(categories 4 and 3) 
Inadequate 
(categories 2 and 1) 
32 
88 
48 
72 
0 100 200 
PT->EN 
Jargon 
Adequate Inadequate 
Generic 
Adequate Inadequate 
41% 
31%
Results: Speech Translation 
Adequacy (2/2) 
Spearman's rho 
(N=120) 
PT->IT PT->EN IT->PT IT->EN 
Jargon Generic Jargon Generic Jargon Generic Jargon Generic 
Translation adequacy 
Transcription accuracy 
Translation adequacy 
Transcription accuracy 
Translation adequacy 
Transcription accuracy 
Translation adequacy 
Transcription accuracy 
Translation adequacy 
Transcription accuracy 
Translation adequacy 
Transcription accuracy 
Translation adequacy 
Transcription accuracy 
Translation adequacy 
Transcription accuracy 
Translat. 
adequacy 
Correl. 1.0 .55* 1.0 .54* 1.0 .63* 1.0 .59* 1.0 .71* 1.0 .55* 1.0 .72* 1.0 .61* 
Sig. . .00 . .00 . .00 . .00 . .00 . .00 . .00 . .00 
Transcript. 
accuracy 
Correl. .55* 1.0 .54* 1.0 .63* 1.0 .59* 1.0 .71* 1.0 .55* 1.0 .72* 1.0 .61* 1.0 
Sig. .00 . .00 . .00 . .00 . .00 . .00 . .00 . .00 . 
* Correlation significant at the 0.01 level (2-tailed). 
• Moderate positive correlation between 
Transcription Accuracy and Translation Adequacy
Conclusions: RQ1 
How well does speech translation work for 
continuous speech? 
• Our study setup: Simulation of a conversation 
– Similar to: Automatic generation of closed captioning 
from webcasts 
• Our findings 
– In line with baseline: 75% word accuracy [1] 
YET 
• Adequate speech translations 31-41% 
• Some domains more critical than others 
ESEM'14 - Turin, Sept. 18-19, 2014 14 
[1] Munteanu et al. “Collaborative editing for improved usefulness and usability of transcript-enhanced webcasts”, CHI’08
Conclusions: RQ2 
How does technical jargon affect speech 
translation? 
• No evidence that jargon generates worse 
speech translations 
– At least, in the CS domain 
HOWEVER 
• Professionals reads jargon differently 
– e.g., “SQL” → SEQUEL, spelled in Italian, in 
English… 
ESEM'14 - Turin, Sept. 18-19, 2014 15
Study limitation & Future work 
- Simulation-based study 
- What would happen in a 
real setting? 
- Refine transcription 
accuracy construct 
(errors) 
- One technology only 
- i.e., Google’s Web Speech 
API and Translate 
- Effect of accents, 
pronunciations, gender? 
- i.e., only 8 speakers, 1 
female 
+ Run a controlled 
experiment 
+ Multi-language group task 
+ Distinguish between 
incorrect and missing 
words 
+ Compare more speech 
translation solutions 
+ e.g., Nuance, Sphinx, Bing 
+ Involve more speakers in 
experiments 
+ Also include EN native 
speakers 
ESEM'14 - Turin, Sept. 18-19, 2014 16
Thanks! 
QUESTIONS? 
fabio.calefato@uniba.it

More Related Content

What's hot

Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognitionManthan Gandhi
 
Open-source machine translation for Icelandic: the Apertium platform as an o...
Open-source machine translation for Icelandic:
 the Apertium platform as an o...Open-source machine translation for Icelandic:
 the Apertium platform as an o...
Open-source machine translation for Icelandic: the Apertium platform as an o...Forcada Mikel
 
Efficient Speech Emotion Recognition using SVM and Decision Trees
Efficient Speech Emotion Recognition using SVM and Decision TreesEfficient Speech Emotion Recognition using SVM and Decision Trees
Efficient Speech Emotion Recognition using SVM and Decision TreesIRJET Journal
 
Lessons learned on localization testing
Lessons learned on localization testingLessons learned on localization testing
Lessons learned on localization testinggaoliang641
 
NLP pipeline in machine translation
NLP pipeline in machine translationNLP pipeline in machine translation
NLP pipeline in machine translationMarcis Pinnis
 
Natural language identification
Natural language identificationNatural language identification
Natural language identificationShaktiTaneja
 
Internationalization & localization testing
Internationalization & localization testingInternationalization & localization testing
Internationalization & localization testingRobin0590
 
Lec 15,16,17 NLP.machine translation
Lec 15,16,17  NLP.machine translationLec 15,16,17  NLP.machine translation
Lec 15,16,17 NLP.machine translationguest873a50
 
Techniques in translation, computer assisted, machine translation, subtitling...
Techniques in translation, computer assisted, machine translation, subtitling...Techniques in translation, computer assisted, machine translation, subtitling...
Techniques in translation, computer assisted, machine translation, subtitling...Moses Altovar
 
Chapter 2 representation of algorithms
Chapter 2 representation of algorithmsChapter 2 representation of algorithms
Chapter 2 representation of algorithmsLi-Anne Serrano
 
virtual Touch keypad 12-key Chinese text input benchmark test report
virtual Touch keypad 12-key Chinese text input benchmark test reportvirtual Touch keypad 12-key Chinese text input benchmark test report
virtual Touch keypad 12-key Chinese text input benchmark test reportJohn Chen, Jun
 
Past, Present, and Future: Machine Translation & Natural Language Processing ...
Past, Present, and Future: Machine Translation & Natural Language Processing ...Past, Present, and Future: Machine Translation & Natural Language Processing ...
Past, Present, and Future: Machine Translation & Natural Language Processing ...John Tinsley
 
Translation j 2009-cocci (1)
Translation j 2009-cocci (1)Translation j 2009-cocci (1)
Translation j 2009-cocci (1)FabiolaPanetti
 
Speech recognition An overview
Speech recognition An overviewSpeech recognition An overview
Speech recognition An overviewsajanazoya
 
What is Internationalization & Localization Testing?
What is Internationalization & Localization Testing?What is Internationalization & Localization Testing?
What is Internationalization & Localization Testing?QA InfoTech
 

What's hot (20)

Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognition
 
Open-source machine translation for Icelandic: the Apertium platform as an o...
Open-source machine translation for Icelandic:
 the Apertium platform as an o...Open-source machine translation for Icelandic:
 the Apertium platform as an o...
Open-source machine translation for Icelandic: the Apertium platform as an o...
 
Efficient Speech Emotion Recognition using SVM and Decision Trees
Efficient Speech Emotion Recognition using SVM and Decision TreesEfficient Speech Emotion Recognition using SVM and Decision Trees
Efficient Speech Emotion Recognition using SVM and Decision Trees
 
Lessons learned on localization testing
Lessons learned on localization testingLessons learned on localization testing
Lessons learned on localization testing
 
Techniques in Translation
Techniques in TranslationTechniques in Translation
Techniques in Translation
 
NLP pipeline in machine translation
NLP pipeline in machine translationNLP pipeline in machine translation
NLP pipeline in machine translation
 
Machine Tanslation
Machine TanslationMachine Tanslation
Machine Tanslation
 
Natural language identification
Natural language identificationNatural language identification
Natural language identification
 
Internationalization & localization testing
Internationalization & localization testingInternationalization & localization testing
Internationalization & localization testing
 
Lec 15,16,17 NLP.machine translation
Lec 15,16,17  NLP.machine translationLec 15,16,17  NLP.machine translation
Lec 15,16,17 NLP.machine translation
 
Techniques in translation, computer assisted, machine translation, subtitling...
Techniques in translation, computer assisted, machine translation, subtitling...Techniques in translation, computer assisted, machine translation, subtitling...
Techniques in translation, computer assisted, machine translation, subtitling...
 
Machine Translation
Machine TranslationMachine Translation
Machine Translation
 
Chapter 2 representation of algorithms
Chapter 2 representation of algorithmsChapter 2 representation of algorithms
Chapter 2 representation of algorithms
 
virtual Touch keypad 12-key Chinese text input benchmark test report
virtual Touch keypad 12-key Chinese text input benchmark test reportvirtual Touch keypad 12-key Chinese text input benchmark test report
virtual Touch keypad 12-key Chinese text input benchmark test report
 
Past, Present, and Future: Machine Translation & Natural Language Processing ...
Past, Present, and Future: Machine Translation & Natural Language Processing ...Past, Present, and Future: Machine Translation & Natural Language Processing ...
Past, Present, and Future: Machine Translation & Natural Language Processing ...
 
Translation j 2009-cocci (1)
Translation j 2009-cocci (1)Translation j 2009-cocci (1)
Translation j 2009-cocci (1)
 
Machine translation
Machine translationMachine translation
Machine translation
 
Speech recognition An overview
Speech recognition An overviewSpeech recognition An overview
Speech recognition An overview
 
What is Internationalization & Localization Testing?
What is Internationalization & Localization Testing?What is Internationalization & Localization Testing?
What is Internationalization & Localization Testing?
 
Bridge to r
Bridge to rBridge to r
Bridge to r
 

Viewers also liked

The translator (session 3)
The translator (session 3)The translator (session 3)
The translator (session 3)Victormorses
 
Instant speech translation 10BM60080 - VGSOM
Instant speech translation   10BM60080 - VGSOMInstant speech translation   10BM60080 - VGSOM
Instant speech translation 10BM60080 - VGSOMsathiyaseelanm
 
Conversational Speech Translation - Challenges and Techniques, by Chris Wendt...
Conversational Speech Translation - Challenges and Techniques, by Chris Wendt...Conversational Speech Translation - Challenges and Techniques, by Chris Wendt...
Conversational Speech Translation - Challenges and Techniques, by Chris Wendt...TAUS - The Language Data Network
 
Personalising speech to-speech translation
Personalising speech to-speech translationPersonalising speech to-speech translation
Personalising speech to-speech translationbehzad66
 
Effect of Machine Translation in Interlingual Conversation: Lessons from a Fo...
Effect of Machine Translation in Interlingual Conversation: Lessons from a Fo...Effect of Machine Translation in Interlingual Conversation: Lessons from a Fo...
Effect of Machine Translation in Interlingual Conversation: Lessons from a Fo...Kotaro Hara
 
Gujarati Text-to-Speech Presentation
Gujarati Text-to-Speech PresentationGujarati Text-to-Speech Presentation
Gujarati Text-to-Speech Presentationsamyakbhuta
 
Wearable Technology Design
Wearable Technology DesignWearable Technology Design
Wearable Technology DesignJeffrey Funk
 
fire detection and alarm system
fire detection and alarm systemfire detection and alarm system
fire detection and alarm systemsingh1515
 
Sign language translator ieee power point
Sign language translator ieee power pointSign language translator ieee power point
Sign language translator ieee power pointMadhuri Yellapu
 

Viewers also liked (9)

The translator (session 3)
The translator (session 3)The translator (session 3)
The translator (session 3)
 
Instant speech translation 10BM60080 - VGSOM
Instant speech translation   10BM60080 - VGSOMInstant speech translation   10BM60080 - VGSOM
Instant speech translation 10BM60080 - VGSOM
 
Conversational Speech Translation - Challenges and Techniques, by Chris Wendt...
Conversational Speech Translation - Challenges and Techniques, by Chris Wendt...Conversational Speech Translation - Challenges and Techniques, by Chris Wendt...
Conversational Speech Translation - Challenges and Techniques, by Chris Wendt...
 
Personalising speech to-speech translation
Personalising speech to-speech translationPersonalising speech to-speech translation
Personalising speech to-speech translation
 
Effect of Machine Translation in Interlingual Conversation: Lessons from a Fo...
Effect of Machine Translation in Interlingual Conversation: Lessons from a Fo...Effect of Machine Translation in Interlingual Conversation: Lessons from a Fo...
Effect of Machine Translation in Interlingual Conversation: Lessons from a Fo...
 
Gujarati Text-to-Speech Presentation
Gujarati Text-to-Speech PresentationGujarati Text-to-Speech Presentation
Gujarati Text-to-Speech Presentation
 
Wearable Technology Design
Wearable Technology DesignWearable Technology Design
Wearable Technology Design
 
fire detection and alarm system
fire detection and alarm systemfire detection and alarm system
fire detection and alarm system
 
Sign language translator ieee power point
Sign language translator ieee power pointSign language translator ieee power point
Sign language translator ieee power point
 

Similar to 65 - An Empirical Simulation-based Study of Real-Time Speech Translation for Multilingual Global Project Teams

Research data as an aid in teaching technical competence in subtitling
Research data as an aid in teaching technical competence in subtitlingResearch data as an aid in teaching technical competence in subtitling
Research data as an aid in teaching technical competence in subtitlingUniversity of Warsaw
 
12EEE032- text 2 voice
12EEE032-  text 2 voice12EEE032-  text 2 voice
12EEE032- text 2 voiceNsaroj kumar
 
Methodology of MT Post-Editors Training
Methodology of MT Post-Editors TrainingMethodology of MT Post-Editors Training
Methodology of MT Post-Editors TrainingJakub Absolon
 
Respeaking as a part of translation and interpreting curriculum
Respeaking as a part of translation and interpreting curriculumRespeaking as a part of translation and interpreting curriculum
Respeaking as a part of translation and interpreting curriculumUniversity of Warsaw
 
Topic 4: The Magician's Hat: Turning Data into Business Intelligence (3)
Topic 4: The Magician's Hat: Turning Data into Business Intelligence (3)Topic 4: The Magician's Hat: Turning Data into Business Intelligence (3)
Topic 4: The Magician's Hat: Turning Data into Business Intelligence (3)TAUS - The Language Data Network
 
Quality Translation Enhancement Using Sequence Knowledge and Pruning in Stati...
Quality Translation Enhancement Using Sequence Knowledge and Pruning in Stati...Quality Translation Enhancement Using Sequence Knowledge and Pruning in Stati...
Quality Translation Enhancement Using Sequence Knowledge and Pruning in Stati...TELKOMNIKA JOURNAL
 
Natural language processing for requirements engineering: ICSE 2021 Technical...
Natural language processing for requirements engineering: ICSE 2021 Technical...Natural language processing for requirements engineering: ICSE 2021 Technical...
Natural language processing for requirements engineering: ICSE 2021 Technical...alessio_ferrari
 
G2 pil a grapheme to-phoneme conversion tool for the italian language
G2 pil a grapheme to-phoneme conversion tool for the italian languageG2 pil a grapheme to-phoneme conversion tool for the italian language
G2 pil a grapheme to-phoneme conversion tool for the italian languageijnlc
 
COMPUTATIONAL APPROACHES TO THE SYNTAX-PROSODY INTERFACE: USING PROSODY TO IM...
COMPUTATIONAL APPROACHES TO THE SYNTAX-PROSODY INTERFACE: USING PROSODY TO IM...COMPUTATIONAL APPROACHES TO THE SYNTAX-PROSODY INTERFACE: USING PROSODY TO IM...
COMPUTATIONAL APPROACHES TO THE SYNTAX-PROSODY INTERFACE: USING PROSODY TO IM...Hussein Ghaly
 
07-Effect-Of-Machine-Translation-In-Interlingual-Conversation.pdf
07-Effect-Of-Machine-Translation-In-Interlingual-Conversation.pdf07-Effect-Of-Machine-Translation-In-Interlingual-Conversation.pdf
07-Effect-Of-Machine-Translation-In-Interlingual-Conversation.pdfsimonp16
 
English speaking proficiency assessment using speech and electroencephalograp...
English speaking proficiency assessment using speech and electroencephalograp...English speaking proficiency assessment using speech and electroencephalograp...
English speaking proficiency assessment using speech and electroencephalograp...IJECEIAES
 
IRJET- Speech to Speech Translation System
IRJET- Speech to Speech Translation SystemIRJET- Speech to Speech Translation System
IRJET- Speech to Speech Translation SystemIRJET Journal
 
Acceptance Testing Of A Spoken Language Translation System
Acceptance Testing Of A Spoken Language Translation SystemAcceptance Testing Of A Spoken Language Translation System
Acceptance Testing Of A Spoken Language Translation SystemMichele Thomas
 
The Future does not need Translators. Or does it?- Marcello Federico
The Future does not need Translators. Or does it?- Marcello FedericoThe Future does not need Translators. Or does it?- Marcello Federico
The Future does not need Translators. Or does it?- Marcello FedericoTAUS - The Language Data Network
 
Morphological Analyzer and Generator for Tamil Language
Morphological Analyzer and Generator for Tamil LanguageMorphological Analyzer and Generator for Tamil Language
Morphological Analyzer and Generator for Tamil LanguageLushanthan Sivaneasharajah
 
Direct Punjabi to English Speech Translation using Discrete Units
Direct Punjabi to English Speech Translation using Discrete UnitsDirect Punjabi to English Speech Translation using Discrete Units
Direct Punjabi to English Speech Translation using Discrete UnitsIJCI JOURNAL
 
Automatic subtitle generation
Automatic subtitle generationAutomatic subtitle generation
Automatic subtitle generationtanyasaxena1611
 
Human Evaluation: Why do we need it? - Dr. Sheila Castilho
Human Evaluation: Why do we need it? - Dr. Sheila CastilhoHuman Evaluation: Why do we need it? - Dr. Sheila Castilho
Human Evaluation: Why do we need it? - Dr. Sheila CastilhoSebastian Ruder
 

Similar to 65 - An Empirical Simulation-based Study of Real-Time Speech Translation for Multilingual Global Project Teams (20)

Research data as an aid in teaching technical competence in subtitling
Research data as an aid in teaching technical competence in subtitlingResearch data as an aid in teaching technical competence in subtitling
Research data as an aid in teaching technical competence in subtitling
 
**JUNK** (no subject)
**JUNK** (no subject)**JUNK** (no subject)
**JUNK** (no subject)
 
12EEE032- text 2 voice
12EEE032-  text 2 voice12EEE032-  text 2 voice
12EEE032- text 2 voice
 
Methodology of MT Post-Editors Training
Methodology of MT Post-Editors TrainingMethodology of MT Post-Editors Training
Methodology of MT Post-Editors Training
 
Respeaking as a part of translation and interpreting curriculum
Respeaking as a part of translation and interpreting curriculumRespeaking as a part of translation and interpreting curriculum
Respeaking as a part of translation and interpreting curriculum
 
Topic 4: The Magician's Hat: Turning Data into Business Intelligence (3)
Topic 4: The Magician's Hat: Turning Data into Business Intelligence (3)Topic 4: The Magician's Hat: Turning Data into Business Intelligence (3)
Topic 4: The Magician's Hat: Turning Data into Business Intelligence (3)
 
Quality Translation Enhancement Using Sequence Knowledge and Pruning in Stati...
Quality Translation Enhancement Using Sequence Knowledge and Pruning in Stati...Quality Translation Enhancement Using Sequence Knowledge and Pruning in Stati...
Quality Translation Enhancement Using Sequence Knowledge and Pruning in Stati...
 
Natural language processing for requirements engineering: ICSE 2021 Technical...
Natural language processing for requirements engineering: ICSE 2021 Technical...Natural language processing for requirements engineering: ICSE 2021 Technical...
Natural language processing for requirements engineering: ICSE 2021 Technical...
 
G2 pil a grapheme to-phoneme conversion tool for the italian language
G2 pil a grapheme to-phoneme conversion tool for the italian languageG2 pil a grapheme to-phoneme conversion tool for the italian language
G2 pil a grapheme to-phoneme conversion tool for the italian language
 
COMPUTATIONAL APPROACHES TO THE SYNTAX-PROSODY INTERFACE: USING PROSODY TO IM...
COMPUTATIONAL APPROACHES TO THE SYNTAX-PROSODY INTERFACE: USING PROSODY TO IM...COMPUTATIONAL APPROACHES TO THE SYNTAX-PROSODY INTERFACE: USING PROSODY TO IM...
COMPUTATIONAL APPROACHES TO THE SYNTAX-PROSODY INTERFACE: USING PROSODY TO IM...
 
07-Effect-Of-Machine-Translation-In-Interlingual-Conversation.pdf
07-Effect-Of-Machine-Translation-In-Interlingual-Conversation.pdf07-Effect-Of-Machine-Translation-In-Interlingual-Conversation.pdf
07-Effect-Of-Machine-Translation-In-Interlingual-Conversation.pdf
 
English speaking proficiency assessment using speech and electroencephalograp...
English speaking proficiency assessment using speech and electroencephalograp...English speaking proficiency assessment using speech and electroencephalograp...
English speaking proficiency assessment using speech and electroencephalograp...
 
IRJET- Speech to Speech Translation System
IRJET- Speech to Speech Translation SystemIRJET- Speech to Speech Translation System
IRJET- Speech to Speech Translation System
 
srinu.pptx
srinu.pptxsrinu.pptx
srinu.pptx
 
Acceptance Testing Of A Spoken Language Translation System
Acceptance Testing Of A Spoken Language Translation SystemAcceptance Testing Of A Spoken Language Translation System
Acceptance Testing Of A Spoken Language Translation System
 
The Future does not need Translators. Or does it?- Marcello Federico
The Future does not need Translators. Or does it?- Marcello FedericoThe Future does not need Translators. Or does it?- Marcello Federico
The Future does not need Translators. Or does it?- Marcello Federico
 
Morphological Analyzer and Generator for Tamil Language
Morphological Analyzer and Generator for Tamil LanguageMorphological Analyzer and Generator for Tamil Language
Morphological Analyzer and Generator for Tamil Language
 
Direct Punjabi to English Speech Translation using Discrete Units
Direct Punjabi to English Speech Translation using Discrete UnitsDirect Punjabi to English Speech Translation using Discrete Units
Direct Punjabi to English Speech Translation using Discrete Units
 
Automatic subtitle generation
Automatic subtitle generationAutomatic subtitle generation
Automatic subtitle generation
 
Human Evaluation: Why do we need it? - Dr. Sheila Castilho
Human Evaluation: Why do we need it? - Dr. Sheila CastilhoHuman Evaluation: Why do we need it? - Dr. Sheila Castilho
Human Evaluation: Why do we need it? - Dr. Sheila Castilho
 

More from ESEM 2014

Keynote 2 - The 20% of software engineering practices that contribute to 80% ...
Keynote 2 - The 20% of software engineering practices that contribute to 80% ...Keynote 2 - The 20% of software engineering practices that contribute to 80% ...
Keynote 2 - The 20% of software engineering practices that contribute to 80% ...ESEM 2014
 
Keynote 1 - Engineering Software Analytics Studies
Keynote 1 - Engineering Software Analytics StudiesKeynote 1 - Engineering Software Analytics Studies
Keynote 1 - Engineering Software Analytics StudiesESEM 2014
 
33 - On Knowledge Transfer Skill in Pair Programming
33 - On Knowledge Transfer Skill in Pair Programming33 - On Knowledge Transfer Skill in Pair Programming
33 - On Knowledge Transfer Skill in Pair ProgrammingESEM 2014
 
222 - Design Pattern Decay: The Case for Class Grime
222 - Design Pattern Decay: The Case for Class Grime222 - Design Pattern Decay: The Case for Class Grime
222 - Design Pattern Decay: The Case for Class GrimeESEM 2014
 
210 - Software Population Pyramids: The Current and the Future of OSS Develop...
210 - Software Population Pyramids: The Current and the Future of OSS Develop...210 - Software Population Pyramids: The Current and the Future of OSS Develop...
210 - Software Population Pyramids: The Current and the Future of OSS Develop...ESEM 2014
 
169 - Bridging the Gap: SE Technology Transfer into Practice - Study Design a...
169 - Bridging the Gap: SE Technology Transfer into Practice - Study Design a...169 - Bridging the Gap: SE Technology Transfer into Practice - Study Design a...
169 - Bridging the Gap: SE Technology Transfer into Practice - Study Design a...ESEM 2014
 
196 - Evaluation in Practice: Artifact-based Requirements Engineering and Sc...
196  - Evaluation in Practice: Artifact-based Requirements Engineering and Sc...196  - Evaluation in Practice: Artifact-based Requirements Engineering and Sc...
196 - Evaluation in Practice: Artifact-based Requirements Engineering and Sc...ESEM 2014
 
42- Using Templates to Elicit Implied Security Requirements from Functional R...
42- Using Templates to Elicit Implied Security Requirements from Functional R...42- Using Templates to Elicit Implied Security Requirements from Functional R...
42- Using Templates to Elicit Implied Security Requirements from Functional R...ESEM 2014
 
166 - ISBSG variables most frequently used for software effort estimation: A ...
166 - ISBSG variables most frequently used for software effort estimation: A ...166 - ISBSG variables most frequently used for software effort estimation: A ...
166 - ISBSG variables most frequently used for software effort estimation: A ...ESEM 2014
 
112 - The Role of Mentoring and Project Characteristics for Onboarding in Ope...
112 - The Role of Mentoring and Project Characteristics for Onboarding in Ope...112 - The Role of Mentoring and Project Characteristics for Onboarding in Ope...
112 - The Role of Mentoring and Project Characteristics for Onboarding in Ope...ESEM 2014
 
224 - Factors Impacting Rapid Releases: An Industrial Case Study
224 - Factors Impacting Rapid Releases: An Industrial Case Study224 - Factors Impacting Rapid Releases: An Industrial Case Study
224 - Factors Impacting Rapid Releases: An Industrial Case StudyESEM 2014
 
215 Towards a Framework to Support Large Scale Sampling in Software Engineeri...
215 Towards a Framework to Support Large Scale Sampling in Software Engineeri...215 Towards a Framework to Support Large Scale Sampling in Software Engineeri...
215 Towards a Framework to Support Large Scale Sampling in Software Engineeri...ESEM 2014
 
214 - Sampling Improvement in Software Engineering Surveys
214 - Sampling Improvement in Software Engineering Surveys214 - Sampling Improvement in Software Engineering Surveys
214 - Sampling Improvement in Software Engineering SurveysESEM 2014
 
201 - Using Qualitative Metasummary to Synthesize Empirical Findings in Liter...
201 - Using Qualitative Metasummary to Synthesize Empirical Findings in Liter...201 - Using Qualitative Metasummary to Synthesize Empirical Findings in Liter...
201 - Using Qualitative Metasummary to Synthesize Empirical Findings in Liter...ESEM 2014
 
130 - Motivated software engineers are engaged and focused, while satisfied o...
130 - Motivated software engineers are engaged and focused, while satisfied o...130 - Motivated software engineers are engaged and focused, while satisfied o...
130 - Motivated software engineers are engaged and focused, while satisfied o...ESEM 2014
 
178 - A replicated study on duplicate detection: Using Apache Lucene to searc...
178 - A replicated study on duplicate detection: Using Apache Lucene to searc...178 - A replicated study on duplicate detection: Using Apache Lucene to searc...
178 - A replicated study on duplicate detection: Using Apache Lucene to searc...ESEM 2014
 
124 - Impact of Developer Reputation on Code Review Outcomes in OSS Projects:...
124 - Impact of Developer Reputation on Code Review Outcomes in OSS Projects:...124 - Impact of Developer Reputation on Code Review Outcomes in OSS Projects:...
124 - Impact of Developer Reputation on Code Review Outcomes in OSS Projects:...ESEM 2014
 
18 - Impact of Process Conformance on the Effects of Test-driven Development
18 - Impact of Process Conformance on the Effects of Test-driven Development18 - Impact of Process Conformance on the Effects of Test-driven Development
18 - Impact of Process Conformance on the Effects of Test-driven DevelopmentESEM 2014
 
52 - The Impact of Test Ownership and Team Structure on the Reliability and E...
52 - The Impact of Test Ownership and Team Structure on the Reliability and E...52 - The Impact of Test Ownership and Team Structure on the Reliability and E...
52 - The Impact of Test Ownership and Team Structure on the Reliability and E...ESEM 2014
 
167 - Productivity for proof engineering
167 - Productivity for proof engineering167 - Productivity for proof engineering
167 - Productivity for proof engineeringESEM 2014
 

More from ESEM 2014 (20)

Keynote 2 - The 20% of software engineering practices that contribute to 80% ...
Keynote 2 - The 20% of software engineering practices that contribute to 80% ...Keynote 2 - The 20% of software engineering practices that contribute to 80% ...
Keynote 2 - The 20% of software engineering practices that contribute to 80% ...
 
Keynote 1 - Engineering Software Analytics Studies
Keynote 1 - Engineering Software Analytics StudiesKeynote 1 - Engineering Software Analytics Studies
Keynote 1 - Engineering Software Analytics Studies
 
33 - On Knowledge Transfer Skill in Pair Programming
33 - On Knowledge Transfer Skill in Pair Programming33 - On Knowledge Transfer Skill in Pair Programming
33 - On Knowledge Transfer Skill in Pair Programming
 
222 - Design Pattern Decay: The Case for Class Grime
222 - Design Pattern Decay: The Case for Class Grime222 - Design Pattern Decay: The Case for Class Grime
222 - Design Pattern Decay: The Case for Class Grime
 
210 - Software Population Pyramids: The Current and the Future of OSS Develop...
210 - Software Population Pyramids: The Current and the Future of OSS Develop...210 - Software Population Pyramids: The Current and the Future of OSS Develop...
210 - Software Population Pyramids: The Current and the Future of OSS Develop...
 
169 - Bridging the Gap: SE Technology Transfer into Practice - Study Design a...
169 - Bridging the Gap: SE Technology Transfer into Practice - Study Design a...169 - Bridging the Gap: SE Technology Transfer into Practice - Study Design a...
169 - Bridging the Gap: SE Technology Transfer into Practice - Study Design a...
 
196 - Evaluation in Practice: Artifact-based Requirements Engineering and Sc...
196  - Evaluation in Practice: Artifact-based Requirements Engineering and Sc...196  - Evaluation in Practice: Artifact-based Requirements Engineering and Sc...
196 - Evaluation in Practice: Artifact-based Requirements Engineering and Sc...
 
42- Using Templates to Elicit Implied Security Requirements from Functional R...
42- Using Templates to Elicit Implied Security Requirements from Functional R...42- Using Templates to Elicit Implied Security Requirements from Functional R...
42- Using Templates to Elicit Implied Security Requirements from Functional R...
 
166 - ISBSG variables most frequently used for software effort estimation: A ...
166 - ISBSG variables most frequently used for software effort estimation: A ...166 - ISBSG variables most frequently used for software effort estimation: A ...
166 - ISBSG variables most frequently used for software effort estimation: A ...
 
112 - The Role of Mentoring and Project Characteristics for Onboarding in Ope...
112 - The Role of Mentoring and Project Characteristics for Onboarding in Ope...112 - The Role of Mentoring and Project Characteristics for Onboarding in Ope...
112 - The Role of Mentoring and Project Characteristics for Onboarding in Ope...
 
224 - Factors Impacting Rapid Releases: An Industrial Case Study
224 - Factors Impacting Rapid Releases: An Industrial Case Study224 - Factors Impacting Rapid Releases: An Industrial Case Study
224 - Factors Impacting Rapid Releases: An Industrial Case Study
 
215 Towards a Framework to Support Large Scale Sampling in Software Engineeri...
215 Towards a Framework to Support Large Scale Sampling in Software Engineeri...215 Towards a Framework to Support Large Scale Sampling in Software Engineeri...
215 Towards a Framework to Support Large Scale Sampling in Software Engineeri...
 
214 - Sampling Improvement in Software Engineering Surveys
214 - Sampling Improvement in Software Engineering Surveys214 - Sampling Improvement in Software Engineering Surveys
214 - Sampling Improvement in Software Engineering Surveys
 
201 - Using Qualitative Metasummary to Synthesize Empirical Findings in Liter...
201 - Using Qualitative Metasummary to Synthesize Empirical Findings in Liter...201 - Using Qualitative Metasummary to Synthesize Empirical Findings in Liter...
201 - Using Qualitative Metasummary to Synthesize Empirical Findings in Liter...
 
130 - Motivated software engineers are engaged and focused, while satisfied o...
130 - Motivated software engineers are engaged and focused, while satisfied o...130 - Motivated software engineers are engaged and focused, while satisfied o...
130 - Motivated software engineers are engaged and focused, while satisfied o...
 
178 - A replicated study on duplicate detection: Using Apache Lucene to searc...
178 - A replicated study on duplicate detection: Using Apache Lucene to searc...178 - A replicated study on duplicate detection: Using Apache Lucene to searc...
178 - A replicated study on duplicate detection: Using Apache Lucene to searc...
 
124 - Impact of Developer Reputation on Code Review Outcomes in OSS Projects:...
124 - Impact of Developer Reputation on Code Review Outcomes in OSS Projects:...124 - Impact of Developer Reputation on Code Review Outcomes in OSS Projects:...
124 - Impact of Developer Reputation on Code Review Outcomes in OSS Projects:...
 
18 - Impact of Process Conformance on the Effects of Test-driven Development
18 - Impact of Process Conformance on the Effects of Test-driven Development18 - Impact of Process Conformance on the Effects of Test-driven Development
18 - Impact of Process Conformance on the Effects of Test-driven Development
 
52 - The Impact of Test Ownership and Team Structure on the Reliability and E...
52 - The Impact of Test Ownership and Team Structure on the Reliability and E...52 - The Impact of Test Ownership and Team Structure on the Reliability and E...
52 - The Impact of Test Ownership and Team Structure on the Reliability and E...
 
167 - Productivity for proof engineering
167 - Productivity for proof engineering167 - Productivity for proof engineering
167 - Productivity for proof engineering
 

Recently uploaded

What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWave PLM
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfPower Karaoke
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Velvetech LLC
 
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanySuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanyChristoph Pohl
 
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfGOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfAlina Yurenko
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaHanief Utama
 
MYjobs Presentation Django-based project
MYjobs Presentation Django-based projectMYjobs Presentation Django-based project
MYjobs Presentation Django-based projectAnoyGreter
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataBradBedford3
 
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...OnePlan Solutions
 
Asset Management Software - Infographic
Asset Management Software - InfographicAsset Management Software - Infographic
Asset Management Software - InfographicHr365.us smith
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software DevelopersVinodh Ram
 
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024StefanoLambiase
 
What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....kzayra69
 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...Christina Lin
 
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...OnePlan Solutions
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsAhmed Mohamed
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...stazi3110
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...soniya singh
 
Unveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesUnveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesŁukasz Chruściel
 

Recently uploaded (20)

What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need It
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdf
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...
 
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanySuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
 
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfGOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief Utama
 
MYjobs Presentation Django-based project
MYjobs Presentation Django-based projectMYjobs Presentation Django-based project
MYjobs Presentation Django-based project
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
 
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...
 
Asset Management Software - Infographic
Asset Management Software - InfographicAsset Management Software - Infographic
Asset Management Software - Infographic
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software Developers
 
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
 
What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....
 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
 
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML Diagrams
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service Consultant
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
 
Unveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesUnveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New Features
 

65 - An Empirical Simulation-based Study of Real-Time Speech Translation for Multilingual Global Project Teams

  • 1. An Empirical Simulation-based Study of Real-Time Speech Translation for Multilingual Global Project Teams Fabio Calefato Filippo Lanubile University of Bari, Italy Rafael Prikladnicki João Henrique Stocker Pinto PUCRS, Brazil ESEM'14 - Turin, Sept. 18-19, 2014 1
  • 2. Motivation • Global software projects challenged by language differences – Especially requirements meetings • Speech translation technology for remote meetings in countries with – Opportunities for global projects – Lack of English speaking professionals • Goal: Evaluate the feasibility of adopting a real-time speech translation to support multilingual requirements meetings ESEM'14 - Turin, Sept. 18-19, 2014 2
  • 3. Speech Translation Speech Recognition + • First prototypes date back to early 70s – Appropriate for dictation only, not for real-time captioning of speech YET – Recent progress, especially with mobile devices – Need for further investigation Machine Translation • First prototypes date back to 50s – Still far from 100% accurate for multilingual group communication YET – Not disruptive of the conversation flow – Does not prevent complex tasks completion – Even grants more balanced discussions
  • 4. Research questions • RQ1: How well does speech translation work for continuous speech in global software projects? • RQ2: How does technical jargon affect speech translation in global software projects? ESEM'14 - Turin, Sept. 18-19, 2014 4
  • 5. Simulation-based study • 8 Participants – Software engineering professionals – 4 from Bari (Italy) and 4 from Porto Alegre (Brazil) – 7 males, 1 female ESEM'14 - Turin, Sept. 18-19, 2014 5
  • 6. Instrumentation • 60 sentences from 5 requirements workshop logs – Half containing jargon – Half generic – Increasing length (# words 5-30) • Manually translated EN -> IT, PT • Google Chrome Web Speech API Demo + Google Translate ESEM'14 - Turin, Sept. 18-19, 2014 6 Speech transcript (IT / PT) Resulting translation (IT / PT / EN) Speech transcript (IT / PT)
  • 7. Variables for Speech Recognition • Independent 1. Source Language (IT, PT) 2. Speaker (Speaker1-4, nested under Language) 3. Lexicon (generic, jargon) 4. Replication (R1-R30, nested under Lexicon) • Dependent 1. Transcript accuracy ′ = 푇 푎푐푐 + 1 2 ESEM'14 - Turin, Sept. 18-19, 2014 7 푇푎푐푐 = # 푟푒푐표푔푛푖푧푒푑 푤표푟푑푠 − # 푒푟푟표푟푠 # 푤표푟푑푠 푖푛 푢푡푡푒푟푎푛푐푒 푇푎푐푐
  • 8. Variables for Machine Translation • Independent 1. Language Pairs (ITEN, PTIT, PTEN, ITPT) 2. Lexicon (generic, jargon) 3. Replication (R1-R30, nested under Lexicon) • Dependent 1. Translation adequacy ESEM'14 - Turin, Sept. 18-19, 2014 8
  • 9. Translation Adequacy scoring scheme Category Description 4 Completely adequate. The translation clearly reflects the information contained in the original sentence. It is perfectly clear, intelligible, grammatically correct, and reads like ordinary tex. 3 Fairly adequate. The translation generally reflects the information contained in the original sentence, despite some inaccuracies or infelicities in the text. It is generally clear and intelligible, and one can (almost) immediately understand what it means. 2 Poorly adequate. The translation poorly reflects the information contained in the original sentence. It contains grammatical errors and/or poor word choices. The general idea of the text is intelligible only after considerable study. 1 Completely adequate. The translation is unintelligible and it is not possible to obtain the information contained in the original sentence. Studying the meaning of the text is hopeless and, even allowing for context, one feels that guessing would be too unreliable. Adapted from: D. Arnold et al. "Machine Translation: an Introductory Guide" (1994)
  • 10. Results: Speech Recognition Accuracy (1/2) Mean • Minimal differences in mean accuracy – by Language and Lexicon – by Speaker (except PT-Speaker2) ESEM'14 - Turin, Sept. 18-19, 2014 10 Language IT .81 PT .75 Lexicon Generic .80 Jargon .77 Speaker Language Mean PT-Speaker1 PT .78 PT-Speaker2 PT .68 PT-Speaker3 PT .79 PT-Speaker4 PT .73 IT-Speaker1 IT .76 IT-Speaker2 IT .88 IT-Speaker3 IT .78 IT-Speaker4 IT .82
  • 11. Results: Speech Recognition Accuracy (2/2) Source df Mean Square F Sig. Intercept 1 290.785 587.408 .017 Language 1 .460 2.948 .144 Speaker(Language) 6 .166 12.907 .003† Lexicon 1 .125 3.285 .104 Replication(Lexicon) 58 .082 1.740 .018† Language * Lexicon 1 .003 .068 .797 UNIANOVA • Speaker(Language) and Replication(Lexicon) the only significant factors ESEM'14 - Turin, Sept. 18-19, 2014 11 Language * Replication(Lexicon) 58 .047 3.004 .000 Lexicon * Speaker(Language) 6 .013 .817 .557 † Significant at 5% level
  • 12. Results: Speech Translation Adequacy (1/2) Adequacy by Language pairs Adequacy by Lexicon 152 141 165 50 49 30 70 71 90 38 50 45 82 70 75 IT->PT IT->EN PT->IT ESEM'14 - Turin, Sept. 18-19, 2014 12 88 99 75 80 160 0% 20% 40% 60% 80% 100% IT->PT IT->EN PT->IT PT->EN Adequate (categories 4 and 3) Inadequate (categories 2 and 1) 32 88 48 72 0 100 200 PT->EN Jargon Adequate Inadequate Generic Adequate Inadequate 41% 31%
  • 13. Results: Speech Translation Adequacy (2/2) Spearman's rho (N=120) PT->IT PT->EN IT->PT IT->EN Jargon Generic Jargon Generic Jargon Generic Jargon Generic Translation adequacy Transcription accuracy Translation adequacy Transcription accuracy Translation adequacy Transcription accuracy Translation adequacy Transcription accuracy Translation adequacy Transcription accuracy Translation adequacy Transcription accuracy Translation adequacy Transcription accuracy Translation adequacy Transcription accuracy Translat. adequacy Correl. 1.0 .55* 1.0 .54* 1.0 .63* 1.0 .59* 1.0 .71* 1.0 .55* 1.0 .72* 1.0 .61* Sig. . .00 . .00 . .00 . .00 . .00 . .00 . .00 . .00 Transcript. accuracy Correl. .55* 1.0 .54* 1.0 .63* 1.0 .59* 1.0 .71* 1.0 .55* 1.0 .72* 1.0 .61* 1.0 Sig. .00 . .00 . .00 . .00 . .00 . .00 . .00 . .00 . * Correlation significant at the 0.01 level (2-tailed). • Moderate positive correlation between Transcription Accuracy and Translation Adequacy
  • 14. Conclusions: RQ1 How well does speech translation work for continuous speech? • Our study setup: Simulation of a conversation – Similar to: Automatic generation of closed captioning from webcasts • Our findings – In line with baseline: 75% word accuracy [1] YET • Adequate speech translations 31-41% • Some domains more critical than others ESEM'14 - Turin, Sept. 18-19, 2014 14 [1] Munteanu et al. “Collaborative editing for improved usefulness and usability of transcript-enhanced webcasts”, CHI’08
  • 15. Conclusions: RQ2 How does technical jargon affect speech translation? • No evidence that jargon generates worse speech translations – At least, in the CS domain HOWEVER • Professionals reads jargon differently – e.g., “SQL” → SEQUEL, spelled in Italian, in English… ESEM'14 - Turin, Sept. 18-19, 2014 15
  • 16. Study limitation & Future work - Simulation-based study - What would happen in a real setting? - Refine transcription accuracy construct (errors) - One technology only - i.e., Google’s Web Speech API and Translate - Effect of accents, pronunciations, gender? - i.e., only 8 speakers, 1 female + Run a controlled experiment + Multi-language group task + Distinguish between incorrect and missing words + Compare more speech translation solutions + e.g., Nuance, Sphinx, Bing + Involve more speakers in experiments + Also include EN native speakers ESEM'14 - Turin, Sept. 18-19, 2014 16

Editor's Notes

  1. A few bg info on Speech Translation, which is the combination of two technologies SR+MT SR Past decade research proved it appropriate for dictation only, not for real-time captioning of speech [1] recent technological progress in the field of automatic speech recognition has also found its way in mobile devices, something that definitely calls for further investigation, especially in combination with machine translation MT More established technology (~60 years in the making) Our findings indicate that state-of-the-art MT technology is already a viable solution for multilingual group communication since it is not disruptive of the conversation flow, it does not prevent group to complete complex tasks, and it even grants discussions that are more balanced. Yet, MT technology currently available is still far from 100% accurate and, as such, its adoption comes with costs. In fact, translations inaccuracies needs to be repaired by rephrasing the original content, thus causing a decrease in efficiency.
  2. As I am going to present a study, the overall goal of which is to assess the usage of real-time speech translation to support communication in multilingual requirements meetings. Let me discuss first the research questions addressed by this study RQ1 – Research from the past decade has shown evidence that the speech recognition technology available was unsuitable for providing real-time captioning or transcription of speech [14]. Although commercial speech recognition tools available today claim to achieve a word recognition accuracy as high as 99%, they have been developed for dictation rather than to produce a transcript from a continuous and unbroken stream without any punctuation [2][3][18]. RQ2 – When stakeholders communicate during requirements meetings, many technical words are used. On top of that, technical words might even be in a language different from the one used by speakers. For instance, lawyers sometimes use Latin jargon; computer scientists typically use technical words in English. As such, technical jargon is less likely to occur both in real communication and in training sets used to build language models for speech recognition engines. Therefore, it has been previously observed that speech recognition errors are more likely to occur in words given a very low probability by the language model [16].
  3. This this simulation-based study is a necessary preliminary step towards the design of future experiments that will involve real-time communication among individuals, augmented with speech translation.
  4. As the test set, we selected 60 sentences of growing length (word count, min. 5, max. 30). The sentences were selected from real chat logs in English, collected from five requirements workshops run as part of an experiment on the effects of text-based communication in distributed requirements engineering [6]. Participants in each workshop ranged from five to eight undergraduate students attending a requirements engineering course at the University of Victoria, Canada. During a workshop, the participants, either acting as a client or as a developer, had first to elicit the requirements specification of a web application (first session); then, they had to negotiate and reach closure on the previously collected requirements (second session). generic utterances, contained only words that are included in an Italian or Portuguese dictionary. jargon, contained one or more technical terms or characteristic acronyms used by software developers. Selected sentences they were manually translated by two of the researchers from English into both Italian and Brazilian Portuguese. The set of original utterances in English together with the manual translation in Italian and Portuguese formed the experimental sample. we were extremely careful at maintaining both the original meaning and the interaction style intact For each of the 60 utterances in the sample, a speaker started by clicking on the microphone icon and began speaking until the end of the utterance. Participants spoke in a colloquial style at their own pace. If the researcher realized that the spoken utterance was different from the original content or the speaker stopped before arriving to the end of the utterance, then the researcher invited the speaker to try again. On average, a speaker finished the simulation in about 30 minutes, at a pace of two utterances per minute.
  5. We have different independent and dependent vars for speech recognition and speech translation As for SR Lexicon is a fixed effect factor The 30 replication under each lexicon level are considered a random effect factor nested under lexicon Transcript accuracy is the standard measure used to evaluate speech recognition system performance Where Errors include missing and wrong words. As Tacc is defined in [1-,1], it is then normalized into 𝑇 𝑎𝑐𝑐 ′ = (𝑇 𝑎𝑐𝑐 +1) 2 to show its values as percentages ([0,1])
  6. As for MT The notation means that lang on the left is the source lang that the speaker used when reading senteces in, the other is the target languages, translated into Adequacy the effectiveness of a machine translation service relates to the fluency and fidelity of the translated output, the effectiveness of a speech recognition system relates to the correct number of words recognized in a spoken sentence. (because errors in the speech recognition process negatively affect the outcome of machine translation, )
  7. not too fine grained no middle values – avoids central tendency bias by forcing raters to judge a translation as either adequate (1-2) or inadequate (points 3-4) description clear to raters Once all the subjects completed their tasks, one researcher at UniBari rated the quality of translations of all the translations from Italian to English (IT->EN) and from Portuguese to Italian (PT->EN); likewise, one researcher at PUCRS rated the translations from Brazilian Portuguese to English (PT->EN) and from Italian to Brazilian Portuguese (IT->PT). We note that the set of language pairs rated by the two researchers were disjoint, so no inter-rater agreement thru Chronbach alfa could be measured
  8. Left-hand side Table reports the mean values of the transcript accuracy measured by language and lexicon. In both cases, we observe minimal differences. In fact, the mean accuracy for utterances spoke in Italian is 81%, whereas for Brazilian Portuguese it is 75%. Likewise, slightly better accuracy results were achieved on average for generic utterances (80%) as compared to jargon utterances (77%). Right-hand side Table , instead, reports the average accuracy per speaker. In addition, in this case we cannot observe large differences. The only noticeable result is the performance of Brazilian subject PT-Speaker2, who achieved the lowest accuracy of 68% especially compared to the best accuracy was achieved by the Italian subject IT-Speaker2 who achieved 88% of accurate transcript
  9. Finally, we tried to identify any difference produced by the factors and their interactions in the accuracy of transcripts . we run a univariate analysis of variance (UNIANOVA procedure) The analysis of variance showed that differences in the speakers and the sentences (replications) significantly affected the result of the speech recognition process
  10. Goal: identify differences in the quality of translation produced according to the various combinations of language pairs and lexicon we first evaluated translation results by language pairs, calculating how many sentences were rated adequate (i.e., categories 4 and 3) and inadequate (i.e., categories 1 and 2) by language pairs. Figure on left shows such a breakdown . We can observe a similar behavior for all the combinations, with a minimum of 75 (PT->IT, 31%) and a maximum of 99 (IT->EN, 41%) adequately translated utterances. Then, we performed a similar analysis evaluating adequacy of translation results further grouped by Lexicon. The figure on the right-hand side shows that that, for all the four language pairs, the inadequate translations again outnumber the others categories regardless of the lexicon. In other words, generic utterances were translated no more adequately than jargon utterances, independently of the language pair combination.
  11. Finally, because translation and recognition are clearly are interdependent, as translation adequacy results affected by the accuracy of the transcript produced in the first step of the process, we computed spearman’s rho to weigh this correlation The results in the table shows that, regardless of both the lexicon and the language pairs, there is a moderate positive correlation between transcription accuracy and translation adequacy. In other words, when the speech recognition component produced an inaccurate transcription, the machine translation tended to produce a less adequate translation. One explanation for the moderate only correlation is that while there may be cases where inadequate translation occurred with accurate transcriptions, the opposite, adequate translation from inaccurate transcriptions, can never happened.
  12. 75% is perfectly in line with our own findings. Yet, speech recognition in our case is the first of two steps. The results of final translations show that, no matter what the language pair was, the # of inadequate translations outnumbered the # of the adequate ones. Plus, the correlation test that showed only a moderate correlation between accurate transcriptions and adequate translations actually proves that speech recognition is the critical component of a speech translation systems , so 75% as a baseline might be too low And when we’re talking about domains, we’re talking about technical words part of the specific domain vocabulary, or in one word, jargon
  13. A more encouraging results is that, wrt RQ2, is that
  14. As future work, we intend to seek for confirmation of these initial results. In the simulation just described several professional developers read several utterances unrelated to each other into a speech translation system. Although collected from several real requirements meetings, such set does not fully represent an example of real requirements workshop meeting augmented with speech translation. In fact, our simulation does not take into account factors like task completion, communication flow, context and grounding. Therefore, we acknowledge the need to perform future controlled experiments that involve cross-language group communication augmented with speech translation. In particular, we will compare groups of people who communicate through a speech translations system, using either English or their native languages, to complete communication-intensive tasks in the context of globally distributed development teams. In our simulation we only used one speech translation system (Google Translate mobile). Therefore, findings might not extend to other existing speech translation technologies available. We acknowledge the need to compare the performance of more systems in our future work. Finally, our findings showed that accuracy in terms of speech recognition was significantly affected by speaker and utterance differences. As such, we acknowledge that the limited numbers of speakers (4 for each of the two source languages) and utterances (30 for each of the two kinds of lexicon) are not ideal from the statistical point of view. Such limitations will be addressed in future replications. Therefore, in future controlled experiments we will involve more subjects and possibly also native EN speakers to understand the effect of pronunciations of non-native speakers when using EN as a lingua franca in multilingual group