SlideShare a Scribd company logo
ENGAGING
POST-EDITORS
JOSÉ LUIS BONILLA SÁNCHEZ
October 14, 2015
TOPICS
•BACKGROUND: MT AT EBAY
•THE CHALLENGES OF POST-EDITING (INDUSTRY AND EBAY)
•THE EBAY EXPERIENCE: PROBLEMS AND SOLUTIONS
•A LOOK AT THE FUTURE
•DISCUSSION
ENGAGING POST-EDITORS 2
MT AT EBAY
•Since 2013
•Home-grown, statistical (Moses) engines
•Covering 10 language pairs (FR IT ESES ESLA DE BRPT
•RU ZH USEN UKEN)
•Content translated: listings (item titles and descriptions), keywords
ENGAGING POST-EDITORS 3
CHALLENGES OF POST-EDITING
ENGAGING POST-EDITORS 4
INDUSTRY CHALLLENGES
ENGAGING POST-EDITORS 5
Post-Editing projects add extra complications to the regular L10n flow. For
instance:
- Quality expectations can be unclear (definitions of light and
full post-editing vary).
- No universal agreement on rates (Per hour? Percentage
discount? Edit distance?).
- Every project (and engine) comes with its own difficulties:
statistical vs rule-based engines, technical vs social content…
EBAY CHALLENGES
ENGAGING POST-EDITORS 6
At eBay, we deal with especially complex projects:
- 12k+ eBay categories, many with their own terminology
- User-generated content: Unpredictable quality, slang, non-
standard acronyms and abbreviations…
- Very specific requirements: Our goal is not polished content,
but content which can be understood and useful to train the
engine
Our Solutions
ENGAGING POST-EDITORS 8
Modular Guidelines…
ENGAGING POST-EDITORS 9
…Structured to Facilitate Learning
ENGAGING POST-EDITORS 10
General
introduction
PE-Specific
Instructions
Item Titles Languages
RU
BRPT
ESLA
FR
IT
ESES
DE
ZH
Item
Descriptions
Queries
General
Translation
Instructions
Recorded Trainings
ENGAGING POST-EDITORS 11
The more specialized the training…
…the more important to preserve the
information in recorded format so future
post-editors can refer to it.
Escalation as Needed
ENGAGING POST-EDITORS 12
3rd fail triggers a call with the vendor.
• Participants:
Linguists, PjM, Quality Manager
• Agenda:
- Diagnosis
- Vendor Action Plan
- Feedback for Client
2014 RESULTS (FAILS VS PASSES)
ENGAGING POST-EDITORS 13
0
5
10
15
20
25
30
35
40
45
JAN FEB MAR APR MAY JUN JUL AUG SEP OCT NOV DEC
Reviews Pass Fail
ENGAGING POST-EDITORS 14
EARLY SAMPLE REVIEWS
ENGAGING POST-EDITORS 15
In-progress review for most
problematic language combinations:
First 2k words of the project,
in the first project week
ADDITIONAL REFERENCE MATERIAL
Providing vendors with MT
translations from 2 engines
(generic and customized) so they
pick the best
Out of Vocabulary words are unknown to
the system, and left in English, which
often makes them easy to mistake for
brand names. By automatically tagging
OOV, we allow the vendor to focus on
them.
eBay listing titles can be difficult
to understand without context
(images, descriptions).
Alternative MT
translation
Tagging “suspect”
terms (OOV)
Providing full context
(HTML files)
ENGAGING POST-EDITORS 16
RESULTS - PRODUCTIVITY
ENGAGING POST-EDITORS 17
0%
5%
10%
15%
20%
25%
30%
35%
40%
45%
Productivity Increase
Alternative MT
Translation
Tagging
"suspect" terms
(OOV)
Providing full
context
RESULTS - QUALITY
ENGAGING POST-EDITORS 18
0
5
10
15
20
25
30
35
40
45
JAN FEB MAR APR MAY JUN JUL AUG SEP OCT NOV DEC
Pass/Fail over Time - 2014
Reviews Pass Fail
0
10
20
30
40
50
60
70
JAN FEB MAR APR MAY JUN JUL AUG SEP OCT NOV DEC
Pass/Fail over Time- 2015
Reviews Pass Fail
THE FUTURE
Divider sub-headline goes here
THE NEXT BREAKTHROUGHS
By predicting the quality of the MT output,
we could:
- Filter and send to our vendors the best (or
worst) translations.
- Map QE score to time spent, and use it to
calculate more accurate initial rates.
Tools like iOmegaT or Matecat offer time
tracking, edit distance analysis and even
action recording – usable to:
- Analyze post-editor behavior and identify
areas to help them improve.
- Calculate accurate rates.
The future is for online tools – they will allow
for more direct interaction (early sampling,
continuous communication, centralized
information repository), further integrating
the post-editors with the team.
Quality Estimation
Behavior Tracking
Online Collaboration
ENGAGING POST-EDITORS 20
DISCUSSION TIME
ENGAGING POST-EDITORS 21

More Related Content

Viewers also liked

Viewers also liked (11)

Common industry API for translation services presented by TAUS at FEISGILTT
Common industry API for translation services presented by TAUS at FEISGILTTCommon industry API for translation services presented by TAUS at FEISGILTT
Common industry API for translation services presented by TAUS at FEISGILTT
 
TAUS Best Practices Error Typology Guidelines
TAUS Best Practices Error Typology GuidelinesTAUS Best Practices Error Typology Guidelines
TAUS Best Practices Error Typology Guidelines
 
TAUS Best Practices Adequacy/Fluency Guidelines
TAUS Best Practices Adequacy/Fluency GuidelinesTAUS Best Practices Adequacy/Fluency Guidelines
TAUS Best Practices Adequacy/Fluency Guidelines
 
TAUS USER CONFERENCE 2010, Machine translation in the imperfect world - Pract...
TAUS USER CONFERENCE 2010, Machine translation in the imperfect world - Pract...TAUS USER CONFERENCE 2010, Machine translation in the imperfect world - Pract...
TAUS USER CONFERENCE 2010, Machine translation in the imperfect world - Pract...
 
TAUS USER CONFERENCE 2010, What’s on the horizon? The research agenda
TAUS USER CONFERENCE 2010, What’s on the horizon? The research agendaTAUS USER CONFERENCE 2010, What’s on the horizon? The research agenda
TAUS USER CONFERENCE 2010, What’s on the horizon? The research agenda
 
TAUS MT SHOWCASE, The WeMT Program, Olga Beregovaya, Welocalize, 10 October 2...
TAUS MT SHOWCASE, The WeMT Program, Olga Beregovaya, Welocalize, 10 October 2...TAUS MT SHOWCASE, The WeMT Program, Olga Beregovaya, Welocalize, 10 October 2...
TAUS MT SHOWCASE, The WeMT Program, Olga Beregovaya, Welocalize, 10 October 2...
 
Quality dashboard-may-2015
Quality dashboard-may-2015Quality dashboard-may-2015
Quality dashboard-may-2015
 
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Seattle, Language Processing T...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Seattle, Language Processing T...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Seattle, Language Processing T...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Seattle, Language Processing T...
 
WEBINAR: TAUS Outlook 2013
WEBINAR: TAUS Outlook 2013WEBINAR: TAUS Outlook 2013
WEBINAR: TAUS Outlook 2013
 
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Manuel Herranz, Pangean...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Manuel Herranz, Pangean...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Manuel Herranz, Pangean...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Manuel Herranz, Pangean...
 
TAUS USER CONFERENCE 2010, The Deep Hybrid machine translation engine
TAUS USER CONFERENCE 2010, The Deep Hybrid machine translation engineTAUS USER CONFERENCE 2010, The Deep Hybrid machine translation engine
TAUS USER CONFERENCE 2010, The Deep Hybrid machine translation engine
 

Similar to How to keep post-editors engaged and prevent attrition. (Jose Sanchez, eBay)

Resume (Ronnie B. Pedarios)
Resume (Ronnie B. Pedarios)Resume (Ronnie B. Pedarios)
Resume (Ronnie B. Pedarios)
Ronnie Pedarios
 
Translation quality assessment redefined
Translation quality assessment redefinedTranslation quality assessment redefined
Translation quality assessment redefined
Denis Khamin
 
Curriculum Vitae v 6.0- Itamar Gelber
Curriculum Vitae v 6.0- Itamar GelberCurriculum Vitae v 6.0- Itamar Gelber
Curriculum Vitae v 6.0- Itamar Gelber
Itamar Gelber
 
Dtp, web design & presentation software revision
Dtp, web design & presentation software revisionDtp, web design & presentation software revision
Dtp, web design & presentation software revision
MrJRogers
 
Eliminating End Game - How to be a hero and streamline documentation production
Eliminating End Game - How to be a hero and streamline documentation productionEliminating End Game - How to be a hero and streamline documentation production
Eliminating End Game - How to be a hero and streamline documentation production
WebWorks
 
portfolio of products and processes
portfolio of products and processesportfolio of products and processes
portfolio of products and processes
Mark Stempski, Ph.D.
 

Similar to How to keep post-editors engaged and prevent attrition. (Jose Sanchez, eBay) (20)

TAUS QE Summit 2017 eBay EN-DE MT Pilot
TAUS QE Summit 2017   eBay EN-DE MT PilotTAUS QE Summit 2017   eBay EN-DE MT Pilot
TAUS QE Summit 2017 eBay EN-DE MT Pilot
 
Machine Translation: Latest Innovations and their Impact on Commercial Transl...
Machine Translation: Latest Innovations and their Impact on Commercial Transl...Machine Translation: Latest Innovations and their Impact on Commercial Transl...
Machine Translation: Latest Innovations and their Impact on Commercial Transl...
 
TDC 2020 - Implementing a Mini-Language
TDC 2020 - Implementing a Mini-LanguageTDC 2020 - Implementing a Mini-Language
TDC 2020 - Implementing a Mini-Language
 
Resume (Ronnie B. Pedarios)
Resume (Ronnie B. Pedarios)Resume (Ronnie B. Pedarios)
Resume (Ronnie B. Pedarios)
 
Translation quality assessment redefined
Translation quality assessment redefinedTranslation quality assessment redefined
Translation quality assessment redefined
 
Test Driven Development: Part 2
Test Driven Development: Part 2Test Driven Development: Part 2
Test Driven Development: Part 2
 
Cognos Analytics V11 Report Authoring Demonstration
Cognos Analytics V11 Report Authoring DemonstrationCognos Analytics V11 Report Authoring Demonstration
Cognos Analytics V11 Report Authoring Demonstration
 
Machine Translation Quality - Are We There Yet? - Olga Beregovaya (Welocalize)
Machine Translation Quality - Are We There Yet? - Olga Beregovaya (Welocalize)Machine Translation Quality - Are We There Yet? - Olga Beregovaya (Welocalize)
Machine Translation Quality - Are We There Yet? - Olga Beregovaya (Welocalize)
 
Evaluation of MT Quality/Productivity at eBay - AMTA 2018
Evaluation of MT Quality/Productivity at eBay - AMTA 2018Evaluation of MT Quality/Productivity at eBay - AMTA 2018
Evaluation of MT Quality/Productivity at eBay - AMTA 2018
 
Curriculum Vitae v 6.0- Itamar Gelber
Curriculum Vitae v 6.0- Itamar GelberCurriculum Vitae v 6.0- Itamar Gelber
Curriculum Vitae v 6.0- Itamar Gelber
 
Confused CMS Presentation - Internet World London 2011 #iwexpo. Delivered on...
Confused CMS Presentation - Internet World London 2011 #iwexpo.  Delivered on...Confused CMS Presentation - Internet World London 2011 #iwexpo.  Delivered on...
Confused CMS Presentation - Internet World London 2011 #iwexpo. Delivered on...
 
Dtp, web design & presentation software revision
Dtp, web design & presentation software revisionDtp, web design & presentation software revision
Dtp, web design & presentation software revision
 
Eliminating End Game - How to be a hero and streamline documentation production
Eliminating End Game - How to be a hero and streamline documentation productionEliminating End Game - How to be a hero and streamline documentation production
Eliminating End Game - How to be a hero and streamline documentation production
 
Creating UI Marketers Won't F*Up
Creating UI Marketers Won't F*UpCreating UI Marketers Won't F*Up
Creating UI Marketers Won't F*Up
 
Webpresentation Mountain View
Webpresentation Mountain ViewWebpresentation Mountain View
Webpresentation Mountain View
 
15 tips for bullet proof requirements analysis on SharePoint projects
15 tips for bullet proof requirements analysis on SharePoint projects15 tips for bullet proof requirements analysis on SharePoint projects
15 tips for bullet proof requirements analysis on SharePoint projects
 
LeSS Like Adoption @ SAP
LeSS Like Adoption @ SAPLeSS Like Adoption @ SAP
LeSS Like Adoption @ SAP
 
Ankita_resume
Ankita_resumeAnkita_resume
Ankita_resume
 
portfolio of products and processes
portfolio of products and processesportfolio of products and processes
portfolio of products and processes
 
Testing in the future. today
Testing in the future.  today Testing in the future.  today
Testing in the future. today
 

More from TAUS - The Language Data Network

More from TAUS - The Language Data Network (20)

TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
 
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
 
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
 
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
 
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
 
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
 
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
 
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann... Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 
A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...
 
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
 
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
 
Farmer Lv (TrueTran)
Farmer Lv (TrueTran)Farmer Lv (TrueTran)
Farmer Lv (TrueTran)
 
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
 
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 The Theory and Practice of Computer Aided Translation Training System, Liu Q... The Theory and Practice of Computer Aided Translation Training System, Liu Q...
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 
Translation Technology Showcase in Shenzhen
Translation Technology Showcase in ShenzhenTranslation Technology Showcase in Shenzhen
Translation Technology Showcase in Shenzhen
 
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
 
SDL Trados Studio 2017, Jocelyn He (SDL)
SDL Trados Studio 2017, Jocelyn He (SDL)SDL Trados Studio 2017, Jocelyn He (SDL)
SDL Trados Studio 2017, Jocelyn He (SDL)
 
How we train post-editors - Yongpeng Wei (Lingosail)
How we train post-editors - Yongpeng Wei (Lingosail)How we train post-editors - Yongpeng Wei (Lingosail)
How we train post-editors - Yongpeng Wei (Lingosail)
 
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 A use-case for getting MT into your company, Kerstin Berns (berns language c... A use-case for getting MT into your company, Kerstin Berns (berns language c...
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 
QE integrated in XTM, by Bob Willans (XTM)
QE integrated in XTM, by Bob Willans (XTM)QE integrated in XTM, by Bob Willans (XTM)
QE integrated in XTM, by Bob Willans (XTM)
 

Recently uploaded

Introduction of Biology in living organisms
Introduction of Biology in living organismsIntroduction of Biology in living organisms
Introduction of Biology in living organisms
soumyapottola
 
527598851-ppc-due-to-various-govt-policies.pdf
527598851-ppc-due-to-various-govt-policies.pdf527598851-ppc-due-to-various-govt-policies.pdf
527598851-ppc-due-to-various-govt-policies.pdf
rajpreetkaur75080
 

Recently uploaded (14)

05232024 Joint Meeting - Community Networking
05232024 Joint Meeting - Community Networking05232024 Joint Meeting - Community Networking
05232024 Joint Meeting - Community Networking
 
Oracle Database Administration I (1Z0-082) Exam Dumps 2024.pdf
Oracle Database Administration I (1Z0-082) Exam Dumps 2024.pdfOracle Database Administration I (1Z0-082) Exam Dumps 2024.pdf
Oracle Database Administration I (1Z0-082) Exam Dumps 2024.pdf
 
Introduction of Biology in living organisms
Introduction of Biology in living organismsIntroduction of Biology in living organisms
Introduction of Biology in living organisms
 
Acorn Recovery: Restore IT infra within minutes
Acorn Recovery: Restore IT infra within minutesAcorn Recovery: Restore IT infra within minutes
Acorn Recovery: Restore IT infra within minutes
 
Eureka, I found it! - Special Libraries Association 2021 Presentation
Eureka, I found it! - Special Libraries Association 2021 PresentationEureka, I found it! - Special Libraries Association 2021 Presentation
Eureka, I found it! - Special Libraries Association 2021 Presentation
 
The Canoga Gardens Development Project. PDF
The Canoga Gardens Development Project. PDFThe Canoga Gardens Development Project. PDF
The Canoga Gardens Development Project. PDF
 
Sharpen existing tools or get a new toolbox? Contemporary cluster initiatives...
Sharpen existing tools or get a new toolbox? Contemporary cluster initiatives...Sharpen existing tools or get a new toolbox? Contemporary cluster initiatives...
Sharpen existing tools or get a new toolbox? Contemporary cluster initiatives...
 
123445566544333222333444dxcvbcvcvharsh.pptx
123445566544333222333444dxcvbcvcvharsh.pptx123445566544333222333444dxcvbcvcvharsh.pptx
123445566544333222333444dxcvbcvcvharsh.pptx
 
527598851-ppc-due-to-various-govt-policies.pdf
527598851-ppc-due-to-various-govt-policies.pdf527598851-ppc-due-to-various-govt-policies.pdf
527598851-ppc-due-to-various-govt-policies.pdf
 
Getting started with Amazon Bedrock Studio and Control Tower
Getting started with Amazon Bedrock Studio and Control TowerGetting started with Amazon Bedrock Studio and Control Tower
Getting started with Amazon Bedrock Studio and Control Tower
 
Writing Sample 2 -Bridging the Divide: Enhancing Public Engagement in Urban D...
Writing Sample 2 -Bridging the Divide: Enhancing Public Engagement in Urban D...Writing Sample 2 -Bridging the Divide: Enhancing Public Engagement in Urban D...
Writing Sample 2 -Bridging the Divide: Enhancing Public Engagement in Urban D...
 
Pollinator Ambassador Earth Steward Day Presentation 2024-05-22
Pollinator Ambassador Earth Steward Day Presentation 2024-05-22Pollinator Ambassador Earth Steward Day Presentation 2024-05-22
Pollinator Ambassador Earth Steward Day Presentation 2024-05-22
 
Hi-Tech Industry 2024-25 Prospective.pptx
Hi-Tech Industry 2024-25 Prospective.pptxHi-Tech Industry 2024-25 Prospective.pptx
Hi-Tech Industry 2024-25 Prospective.pptx
 
0x01 - Newton's Third Law: Static vs. Dynamic Abusers
0x01 - Newton's Third Law:  Static vs. Dynamic Abusers0x01 - Newton's Third Law:  Static vs. Dynamic Abusers
0x01 - Newton's Third Law: Static vs. Dynamic Abusers
 

How to keep post-editors engaged and prevent attrition. (Jose Sanchez, eBay)

  • 1. ENGAGING POST-EDITORS JOSÉ LUIS BONILLA SÁNCHEZ October 14, 2015
  • 2. TOPICS •BACKGROUND: MT AT EBAY •THE CHALLENGES OF POST-EDITING (INDUSTRY AND EBAY) •THE EBAY EXPERIENCE: PROBLEMS AND SOLUTIONS •A LOOK AT THE FUTURE •DISCUSSION ENGAGING POST-EDITORS 2
  • 3. MT AT EBAY •Since 2013 •Home-grown, statistical (Moses) engines •Covering 10 language pairs (FR IT ESES ESLA DE BRPT •RU ZH USEN UKEN) •Content translated: listings (item titles and descriptions), keywords ENGAGING POST-EDITORS 3
  • 5. INDUSTRY CHALLLENGES ENGAGING POST-EDITORS 5 Post-Editing projects add extra complications to the regular L10n flow. For instance: - Quality expectations can be unclear (definitions of light and full post-editing vary). - No universal agreement on rates (Per hour? Percentage discount? Edit distance?). - Every project (and engine) comes with its own difficulties: statistical vs rule-based engines, technical vs social content…
  • 6. EBAY CHALLENGES ENGAGING POST-EDITORS 6 At eBay, we deal with especially complex projects: - 12k+ eBay categories, many with their own terminology - User-generated content: Unpredictable quality, slang, non- standard acronyms and abbreviations… - Very specific requirements: Our goal is not polished content, but content which can be understood and useful to train the engine
  • 10. …Structured to Facilitate Learning ENGAGING POST-EDITORS 10 General introduction PE-Specific Instructions Item Titles Languages RU BRPT ESLA FR IT ESES DE ZH Item Descriptions Queries General Translation Instructions
  • 11. Recorded Trainings ENGAGING POST-EDITORS 11 The more specialized the training… …the more important to preserve the information in recorded format so future post-editors can refer to it.
  • 12. Escalation as Needed ENGAGING POST-EDITORS 12 3rd fail triggers a call with the vendor. • Participants: Linguists, PjM, Quality Manager • Agenda: - Diagnosis - Vendor Action Plan - Feedback for Client
  • 13. 2014 RESULTS (FAILS VS PASSES) ENGAGING POST-EDITORS 13 0 5 10 15 20 25 30 35 40 45 JAN FEB MAR APR MAY JUN JUL AUG SEP OCT NOV DEC Reviews Pass Fail
  • 15. EARLY SAMPLE REVIEWS ENGAGING POST-EDITORS 15 In-progress review for most problematic language combinations: First 2k words of the project, in the first project week
  • 16. ADDITIONAL REFERENCE MATERIAL Providing vendors with MT translations from 2 engines (generic and customized) so they pick the best Out of Vocabulary words are unknown to the system, and left in English, which often makes them easy to mistake for brand names. By automatically tagging OOV, we allow the vendor to focus on them. eBay listing titles can be difficult to understand without context (images, descriptions). Alternative MT translation Tagging “suspect” terms (OOV) Providing full context (HTML files) ENGAGING POST-EDITORS 16
  • 17. RESULTS - PRODUCTIVITY ENGAGING POST-EDITORS 17 0% 5% 10% 15% 20% 25% 30% 35% 40% 45% Productivity Increase Alternative MT Translation Tagging "suspect" terms (OOV) Providing full context
  • 18. RESULTS - QUALITY ENGAGING POST-EDITORS 18 0 5 10 15 20 25 30 35 40 45 JAN FEB MAR APR MAY JUN JUL AUG SEP OCT NOV DEC Pass/Fail over Time - 2014 Reviews Pass Fail 0 10 20 30 40 50 60 70 JAN FEB MAR APR MAY JUN JUL AUG SEP OCT NOV DEC Pass/Fail over Time- 2015 Reviews Pass Fail
  • 20. THE NEXT BREAKTHROUGHS By predicting the quality of the MT output, we could: - Filter and send to our vendors the best (or worst) translations. - Map QE score to time spent, and use it to calculate more accurate initial rates. Tools like iOmegaT or Matecat offer time tracking, edit distance analysis and even action recording – usable to: - Analyze post-editor behavior and identify areas to help them improve. - Calculate accurate rates. The future is for online tools – they will allow for more direct interaction (early sampling, continuous communication, centralized information repository), further integrating the post-editors with the team. Quality Estimation Behavior Tracking Online Collaboration ENGAGING POST-EDITORS 20