SlideShare a Scribd company logo
1 of 17
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE


Using Moses To Win Business That Would Otherwise Be
Lost. Two Practical Use Cases at AVB Translations

10:00-10:20
Wednesday, 17 October

Joël Sigling
AVB Translations
Joël Sigling
                                        Director Technology &
                                        Business Partners




         Using Moses to win
                    business
   that would otherwise be lost
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE
Seattle, 17 October 2012
                                           ©AVB Translations , 2012
AVB Translations background

•   Amstelveens Vertaalburo: founded 1972 – traditional, high-quality agency

•   Translation World: founded 2002, tech-savvy all-round player

•   Merger in 2010 >> AVB Translations: premium brand with strong tech focus

•   Top 5 player in The Netherlands, 2011 turnover € 4.6 million

•   Core business: general translations – legal, financial, technical, …
    New branch: Single Language Vendor Dutch for global MLVs
History of MT interest
•   Member of TAUS since 2008, 1st round table Amsterdam

•   Visited TAUS User Conferences in US since 2009

•   Sense of urgency developed, merger distraction 2010

•   Action in 2011 after merger

•   2011: Our own choice: Dutch <> English legal domain engine

•   2012: At customer request, tourism engines English > X languages

•   Why SMT, why Moses? Quicker, cheaper, similar quality (shows research)
Case 1: Legal domain engine

•   Legal translations about approx. 40% of AVB business, 80% Dutch <>English

•   Not the obvious choice: people said MT wouldn’t work for legal: sentences
    too long, material to intricate

•   Statistical MT suited to non-stylistic materials: eg legal

•   If this works, we can make MT happen for all other domains
Legal engine: Objectives

•   Increased productivity, no BLEU % target, but tangible, practical results.
    How much extra can a translator do when compared to HT?

•   Tool to offer usable quality with very quick turnarounds for high volume
    (typical “Friday afternoon lawyer requests”)

•   Becoming an MT front runner in the non-localization sector for Dutch
    (5th language in Europe after FIGS)
Legal engine: Development

•   Choice between in-house and external development
     • In-house: control, developing expertise, lower long-term cost
     • External: lower initial cost, much more expertise > best for now

•   Our pre-requisites for development option
     • ownership and free access to engine
     • assurance data will not be used or copied by builder
     • Acceptable costs for development & usage
     • Skilled partner > AsiaOnline, CrossLang, Pangeanic, LetsMT,
        SmartMate??

•   CrossLang > all of the above, closest to our office, independent
Legal engine: Input data

•   Highest quality AVB Dutch <>English legal translations: approx.
    700k words per language. Predominantly civil law.

•   Not fully reviewed AVB TM, still high-quality: approx. 10 mi.
    words per language. Predominantly civil law.

•   Legal translations harvested by CrossLang, more diverse legal
    material: 7 mi. words per language
Legal engine: Initial test results

  Various productivity tests done in CrossLang and TAUS
  productivity assessment tools (very similar):

  productivity between 5% and 20% higher
  for post-ediding than human-only output. These results both for
  very experienced legal translator and translation novice (intern).

  Encouragement to continue…
Legal engine: Results in practice
•   Live rush ranslations done in first few weeks for new customers:
     • 1,500 word trial done for law firm needing high volume in
        very short time. Post-edited in 75 minutes. Customer happy
        with quality/price ratio.
     • 25,000 words in two days with light PE effort by two post-
        editors. Quality estimate 80-90% of human translation.
     • 4,500 words in 3 hours with almost full PE effort by one
        post-editor. Quality estimate >90% of human translation
     • 15,000 words in one day, done by two post-editors. Quality
        estimate 80-90% of human translation
Legal engine: more success
•   Other successes:
     • 7 documents translated and post-edited this way in one
        court-case. Customer very happy, case won!

    •   Three new customers secured through the fact that we
        were able to turnaround a large volume in a very short time.
    •
    •   Some customers are ordering this type of translation in
        stead of normal translation even with normal deadlines.

    •   Sales department have a new USP to sell to our legal
        customers. Interest is growing!
Case 2: Hotel booking engine
SITUATION
•   Big (4 million words) hotel booking site, partly available in 10
    languages

•   Booking funnel fully and best-selling hotels largely translated

Customer ambition:
    • Have all 4M words translated & do more languages
       BUT
    • No budget for full human translation/post-editing
    • Google too expensive and risk of being considered duplicate
      content at some point in the future
Booking engine: AVB solution
•   Combination of TM & MT to translate all content & differentiate in
    quality where necessary

•   Build up new TMs for “new” languages and use TM from “old”
    languages to create dataset for engines with harvested material

•   Customer to get any TM matches for free after paying for initial
    baseline (20,000 words)

•   Engine development paid for, but throughput almost free
Booking engine: Challenges
•   Getting traction within client DMT (took two years)

•   Finding enough relevant data to build a decent engine (classic)

•   Deciding on KPIs from the customer: when would it be successful?

•   Technical: getting workable data in and out of customer CMS
Booking engine: Results
•   Two languages chosen for trial: French and Polish

•   Baseline TM built for these, all website TM matches replaced for
    French and Polish. Has already save customer thousands of euros.

•   Two Moses engines built by TauYou as trial: French and Polish

•   1st Polish engine results were put on line as test, bookings increased
    in spite of poor initial language quality

•   Customer has approved four more engines: Dutch, French, German,
    Spanish and Italian
Business conclusions
  •   We are already making money on MT. All investments paid back in
      half a year.

  •   Extra turnover forecast thanks to legal engine: EUR 50,000+ in
      2013 + EUR 20,000 in cost savings (bottom line).

  •   Turnover forecast for TM+MT solution for hotel booking site in
      2013: anywhere between EUR 50,000 and 200,000.


  FOR AVB, MT IS A GOOD INVESTMENT!
Phone:     +31 20 645.66.10
Mobile:    +31 625.025.475
E-mail:    joel.sigling@avb.nl
Twitter:   @JoelAVB
Adres:     Ouderkerkerlaan 50
           1185 AD Amstelveen
           The Netherlands
Website:   www.avb.nl

More Related Content

Similar to TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Seattle, Two Practical Use Cases at AVB Translations, Joel Sigling, AVB, 17 October 2012

TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Joel Sigling, AVB, 25 ...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Joel Sigling, AVB, 25 ...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Joel Sigling, AVB, 25 ...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Joel Sigling, AVB, 25 ...TAUS - The Language Data Network
 
Webinar automotive and engineering content 16.06.16
Webinar   automotive and engineering content 16.06.16Webinar   automotive and engineering content 16.06.16
Webinar automotive and engineering content 16.06.16kantanmt
 
Managing Translation Memories for Engineering and Automotive Translation
Managing Translation Memories for Engineering and Automotive TranslationManaging Translation Memories for Engineering and Automotive Translation
Managing Translation Memories for Engineering and Automotive TranslationPoulomi Choudhury
 
FIPOTranslations - Who Need Them and How LE technologies Can Help, Henry Wang...
FIPOTranslations - Who Need Them and How LE technologies Can Help, Henry Wang...FIPOTranslations - Who Need Them and How LE technologies Can Help, Henry Wang...
FIPOTranslations - Who Need Them and How LE technologies Can Help, Henry Wang...TAUS - The Language Data Network
 
iMT Language Solutions
iMT Language SolutionsiMT Language Solutions
iMT Language SolutionsSDL
 
Lexcelera MT Breaking Compromises
Lexcelera MT Breaking CompromisesLexcelera MT Breaking Compromises
Lexcelera MT Breaking CompromisesLoriThicke
 
LavaCon 2015: Efficient Translation Management - 5 Specific Metrics That Wil...
LavaCon 2015:  Efficient Translation Management - 5 Specific Metrics That Wil...LavaCon 2015:  Efficient Translation Management - 5 Specific Metrics That Wil...
LavaCon 2015: Efficient Translation Management - 5 Specific Metrics That Wil...Scott Carothers
 
Localization and DITA: What you Need to Know - LocWorld32
Localization and DITA: What you Need to Know - LocWorld32Localization and DITA: What you Need to Know - LocWorld32
Localization and DITA: What you Need to Know - LocWorld32IXIASOFT
 
LavaCon Kinetic, Mangaing the Translation Process-A Peek Behind the Curtain
LavaCon  Kinetic, Mangaing the Translation Process-A Peek Behind the CurtainLavaCon  Kinetic, Mangaing the Translation Process-A Peek Behind the Curtain
LavaCon Kinetic, Mangaing the Translation Process-A Peek Behind the CurtainScott Carothers
 
LavaCon2014 Kinetic, Mangaing the Translation Process- A Peek Behind the Cu...
LavaCon2014   Kinetic, Mangaing the Translation Process- A Peek Behind the Cu...LavaCon2014   Kinetic, Mangaing the Translation Process- A Peek Behind the Cu...
LavaCon2014 Kinetic, Mangaing the Translation Process- A Peek Behind the Cu...Scott Carothers
 
2012 GALA Webinar: Machine Translation for LSPs. A practical guide
2012 GALA Webinar: Machine Translation for LSPs. A practical guide2012 GALA Webinar: Machine Translation for LSPs. A practical guide
2012 GALA Webinar: Machine Translation for LSPs. A practical guidetauyou
 
Is Your Enterprise “fire-fighting” translation issues? Optimize the process w...
Is Your Enterprise “fire-fighting” translation issues? Optimize the process w...Is Your Enterprise “fire-fighting” translation issues? Optimize the process w...
Is Your Enterprise “fire-fighting” translation issues? Optimize the process w...dclsocialmedia
 
Profoundis - Why OpenERP
Profoundis - Why OpenERPProfoundis - Why OpenERP
Profoundis - Why OpenERPArjun Pillai
 
An MT Journey Intuit and Welocalize Localization World 2013
An MT Journey Intuit and Welocalize Localization World 2013An MT Journey Intuit and Welocalize Localization World 2013
An MT Journey Intuit and Welocalize Localization World 2013Welocalize
 
Language Quality Management: Models, Measures, Methodologies
Language Quality Management: Models, Measures, Methodologies Language Quality Management: Models, Measures, Methodologies
Language Quality Management: Models, Measures, Methodologies Sajan
 
How to Improve Translation Productivity
How to Improve Translation ProductivityHow to Improve Translation Productivity
How to Improve Translation Productivitykantanmt
 

Similar to TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Seattle, Two Practical Use Cases at AVB Translations, Joel Sigling, AVB, 17 October 2012 (20)

TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Joel Sigling, AVB, 25 ...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Joel Sigling, AVB, 25 ...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Joel Sigling, AVB, 25 ...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Joel Sigling, AVB, 25 ...
 
Webinar automotive and engineering content 16.06.16
Webinar   automotive and engineering content 16.06.16Webinar   automotive and engineering content 16.06.16
Webinar automotive and engineering content 16.06.16
 
Managing Translation Memories for Engineering and Automotive Translation
Managing Translation Memories for Engineering and Automotive TranslationManaging Translation Memories for Engineering and Automotive Translation
Managing Translation Memories for Engineering and Automotive Translation
 
FIPOTranslations - Who Need Them and How LE technologies Can Help, Henry Wang...
FIPOTranslations - Who Need Them and How LE technologies Can Help, Henry Wang...FIPOTranslations - Who Need Them and How LE technologies Can Help, Henry Wang...
FIPOTranslations - Who Need Them and How LE technologies Can Help, Henry Wang...
 
iMT Language Solutions
iMT Language SolutionsiMT Language Solutions
iMT Language Solutions
 
Lexcelera MT Breaking Compromises
Lexcelera MT Breaking CompromisesLexcelera MT Breaking Compromises
Lexcelera MT Breaking Compromises
 
LavaCon 2015: Efficient Translation Management - 5 Specific Metrics That Wil...
LavaCon 2015:  Efficient Translation Management - 5 Specific Metrics That Wil...LavaCon 2015:  Efficient Translation Management - 5 Specific Metrics That Wil...
LavaCon 2015: Efficient Translation Management - 5 Specific Metrics That Wil...
 
SDL Trados Studio 2017, Jocelyn He (SDL)
SDL Trados Studio 2017, Jocelyn He (SDL)SDL Trados Studio 2017, Jocelyn He (SDL)
SDL Trados Studio 2017, Jocelyn He (SDL)
 
Sujay_Resume
Sujay_ResumeSujay_Resume
Sujay_Resume
 
Localization and DITA: What you Need to Know - LocWorld32
Localization and DITA: What you Need to Know - LocWorld32Localization and DITA: What you Need to Know - LocWorld32
Localization and DITA: What you Need to Know - LocWorld32
 
Quick Tips from the IT Trenches
Quick Tips from the IT TrenchesQuick Tips from the IT Trenches
Quick Tips from the IT Trenches
 
LeadDeskin kasvutarina
LeadDeskin kasvutarinaLeadDeskin kasvutarina
LeadDeskin kasvutarina
 
LavaCon Kinetic, Mangaing the Translation Process-A Peek Behind the Curtain
LavaCon  Kinetic, Mangaing the Translation Process-A Peek Behind the CurtainLavaCon  Kinetic, Mangaing the Translation Process-A Peek Behind the Curtain
LavaCon Kinetic, Mangaing the Translation Process-A Peek Behind the Curtain
 
LavaCon2014 Kinetic, Mangaing the Translation Process- A Peek Behind the Cu...
LavaCon2014   Kinetic, Mangaing the Translation Process- A Peek Behind the Cu...LavaCon2014   Kinetic, Mangaing the Translation Process- A Peek Behind the Cu...
LavaCon2014 Kinetic, Mangaing the Translation Process- A Peek Behind the Cu...
 
2012 GALA Webinar: Machine Translation for LSPs. A practical guide
2012 GALA Webinar: Machine Translation for LSPs. A practical guide2012 GALA Webinar: Machine Translation for LSPs. A practical guide
2012 GALA Webinar: Machine Translation for LSPs. A practical guide
 
Is Your Enterprise “fire-fighting” translation issues? Optimize the process w...
Is Your Enterprise “fire-fighting” translation issues? Optimize the process w...Is Your Enterprise “fire-fighting” translation issues? Optimize the process w...
Is Your Enterprise “fire-fighting” translation issues? Optimize the process w...
 
Profoundis - Why OpenERP
Profoundis - Why OpenERPProfoundis - Why OpenERP
Profoundis - Why OpenERP
 
An MT Journey Intuit and Welocalize Localization World 2013
An MT Journey Intuit and Welocalize Localization World 2013An MT Journey Intuit and Welocalize Localization World 2013
An MT Journey Intuit and Welocalize Localization World 2013
 
Language Quality Management: Models, Measures, Methodologies
Language Quality Management: Models, Measures, Methodologies Language Quality Management: Models, Measures, Methodologies
Language Quality Management: Models, Measures, Methodologies
 
How to Improve Translation Productivity
How to Improve Translation ProductivityHow to Improve Translation Productivity
How to Improve Translation Productivity
 

More from TAUS - The Language Data Network

TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...TAUS - The Language Data Network
 
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...TAUS - The Language Data Network
 
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)TAUS - The Language Data Network
 
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann... Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...TAUS - The Language Data Network
 
A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...TAUS - The Language Data Network
 
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...TAUS - The Language Data Network
 
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...TAUS - The Language Data Network
 
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...TAUS - The Language Data Network
 
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 The Theory and Practice of Computer Aided Translation Training System, Liu Q... The Theory and Practice of Computer Aided Translation Training System, Liu Q...
The Theory and Practice of Computer Aided Translation Training System, Liu Q...TAUS - The Language Data Network
 
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)TAUS - The Language Data Network
 
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 A use-case for getting MT into your company, Kerstin Berns (berns language c... A use-case for getting MT into your company, Kerstin Berns (berns language c...
A use-case for getting MT into your company, Kerstin Berns (berns language c...TAUS - The Language Data Network
 
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)TAUS - The Language Data Network
 

More from TAUS - The Language Data Network (20)

TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
 
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
 
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
 
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
 
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
 
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
 
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
 
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann... Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 
A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...
 
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
 
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
 
Farmer Lv (TrueTran)
Farmer Lv (TrueTran)Farmer Lv (TrueTran)
Farmer Lv (TrueTran)
 
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
 
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 The Theory and Practice of Computer Aided Translation Training System, Liu Q... The Theory and Practice of Computer Aided Translation Training System, Liu Q...
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 
Translation Technology Showcase in Shenzhen
Translation Technology Showcase in ShenzhenTranslation Technology Showcase in Shenzhen
Translation Technology Showcase in Shenzhen
 
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
 
How we train post-editors - Yongpeng Wei (Lingosail)
How we train post-editors - Yongpeng Wei (Lingosail)How we train post-editors - Yongpeng Wei (Lingosail)
How we train post-editors - Yongpeng Wei (Lingosail)
 
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 A use-case for getting MT into your company, Kerstin Berns (berns language c... A use-case for getting MT into your company, Kerstin Berns (berns language c...
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 
QE integrated in XTM, by Bob Willans (XTM)
QE integrated in XTM, by Bob Willans (XTM)QE integrated in XTM, by Bob Willans (XTM)
QE integrated in XTM, by Bob Willans (XTM)
 
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)
 

Recently uploaded

What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 

Recently uploaded (20)

What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 

TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Seattle, Two Practical Use Cases at AVB Translations, Joel Sigling, AVB, 17 October 2012

  • 1. TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE Using Moses To Win Business That Would Otherwise Be Lost. Two Practical Use Cases at AVB Translations 10:00-10:20 Wednesday, 17 October Joël Sigling AVB Translations
  • 2. Joël Sigling Director Technology & Business Partners Using Moses to win business that would otherwise be lost TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE Seattle, 17 October 2012 ©AVB Translations , 2012
  • 3. AVB Translations background • Amstelveens Vertaalburo: founded 1972 – traditional, high-quality agency • Translation World: founded 2002, tech-savvy all-round player • Merger in 2010 >> AVB Translations: premium brand with strong tech focus • Top 5 player in The Netherlands, 2011 turnover € 4.6 million • Core business: general translations – legal, financial, technical, … New branch: Single Language Vendor Dutch for global MLVs
  • 4. History of MT interest • Member of TAUS since 2008, 1st round table Amsterdam • Visited TAUS User Conferences in US since 2009 • Sense of urgency developed, merger distraction 2010 • Action in 2011 after merger • 2011: Our own choice: Dutch <> English legal domain engine • 2012: At customer request, tourism engines English > X languages • Why SMT, why Moses? Quicker, cheaper, similar quality (shows research)
  • 5. Case 1: Legal domain engine • Legal translations about approx. 40% of AVB business, 80% Dutch <>English • Not the obvious choice: people said MT wouldn’t work for legal: sentences too long, material to intricate • Statistical MT suited to non-stylistic materials: eg legal • If this works, we can make MT happen for all other domains
  • 6. Legal engine: Objectives • Increased productivity, no BLEU % target, but tangible, practical results. How much extra can a translator do when compared to HT? • Tool to offer usable quality with very quick turnarounds for high volume (typical “Friday afternoon lawyer requests”) • Becoming an MT front runner in the non-localization sector for Dutch (5th language in Europe after FIGS)
  • 7. Legal engine: Development • Choice between in-house and external development • In-house: control, developing expertise, lower long-term cost • External: lower initial cost, much more expertise > best for now • Our pre-requisites for development option • ownership and free access to engine • assurance data will not be used or copied by builder • Acceptable costs for development & usage • Skilled partner > AsiaOnline, CrossLang, Pangeanic, LetsMT, SmartMate?? • CrossLang > all of the above, closest to our office, independent
  • 8. Legal engine: Input data • Highest quality AVB Dutch <>English legal translations: approx. 700k words per language. Predominantly civil law. • Not fully reviewed AVB TM, still high-quality: approx. 10 mi. words per language. Predominantly civil law. • Legal translations harvested by CrossLang, more diverse legal material: 7 mi. words per language
  • 9. Legal engine: Initial test results Various productivity tests done in CrossLang and TAUS productivity assessment tools (very similar): productivity between 5% and 20% higher for post-ediding than human-only output. These results both for very experienced legal translator and translation novice (intern). Encouragement to continue…
  • 10. Legal engine: Results in practice • Live rush ranslations done in first few weeks for new customers: • 1,500 word trial done for law firm needing high volume in very short time. Post-edited in 75 minutes. Customer happy with quality/price ratio. • 25,000 words in two days with light PE effort by two post- editors. Quality estimate 80-90% of human translation. • 4,500 words in 3 hours with almost full PE effort by one post-editor. Quality estimate >90% of human translation • 15,000 words in one day, done by two post-editors. Quality estimate 80-90% of human translation
  • 11. Legal engine: more success • Other successes: • 7 documents translated and post-edited this way in one court-case. Customer very happy, case won! • Three new customers secured through the fact that we were able to turnaround a large volume in a very short time. • • Some customers are ordering this type of translation in stead of normal translation even with normal deadlines. • Sales department have a new USP to sell to our legal customers. Interest is growing!
  • 12. Case 2: Hotel booking engine SITUATION • Big (4 million words) hotel booking site, partly available in 10 languages • Booking funnel fully and best-selling hotels largely translated Customer ambition: • Have all 4M words translated & do more languages BUT • No budget for full human translation/post-editing • Google too expensive and risk of being considered duplicate content at some point in the future
  • 13. Booking engine: AVB solution • Combination of TM & MT to translate all content & differentiate in quality where necessary • Build up new TMs for “new” languages and use TM from “old” languages to create dataset for engines with harvested material • Customer to get any TM matches for free after paying for initial baseline (20,000 words) • Engine development paid for, but throughput almost free
  • 14. Booking engine: Challenges • Getting traction within client DMT (took two years) • Finding enough relevant data to build a decent engine (classic) • Deciding on KPIs from the customer: when would it be successful? • Technical: getting workable data in and out of customer CMS
  • 15. Booking engine: Results • Two languages chosen for trial: French and Polish • Baseline TM built for these, all website TM matches replaced for French and Polish. Has already save customer thousands of euros. • Two Moses engines built by TauYou as trial: French and Polish • 1st Polish engine results were put on line as test, bookings increased in spite of poor initial language quality • Customer has approved four more engines: Dutch, French, German, Spanish and Italian
  • 16. Business conclusions • We are already making money on MT. All investments paid back in half a year. • Extra turnover forecast thanks to legal engine: EUR 50,000+ in 2013 + EUR 20,000 in cost savings (bottom line). • Turnover forecast for TM+MT solution for hotel booking site in 2013: anywhere between EUR 50,000 and 200,000. FOR AVB, MT IS A GOOD INVESTMENT!
  • 17. Phone: +31 20 645.66.10 Mobile: +31 625.025.475 E-mail: joel.sigling@avb.nl Twitter: @JoelAVB Adres: Ouderkerkerlaan 50 1185 AD Amstelveen The Netherlands Website: www.avb.nl