SlideShare a Scribd company logo
TAUS USER CONFERENCE 2010
LANGUAGE BUSINESS INNOVATION
4 – 6 OCTOBER / PORTLAND (OR), USA




MONDAY 4 OCTOBER / 15.00

MAN, MACHINE AND ADVANCED TRANSLATION
MEMORY LEVERAGING
Daniel Gervais, MultiCorpora
Five New Technologies...


     ...that will change enterprise computing.

          Search – the Next Generation
          Environments to create Virtual Companies
          Virtualization Management Consoles
          Secure Cloud Creation
          Management Technologies

            Source:
            Eric Lundquist, Editor-in-Chief, eWeek
            smartertechnology.com




© 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.
So, what does that mean for us?

     • elastic capacity                                         • Search – the Next Generation
     • fault tolerant                                           • Environments to create Virtual Companies
     • Scalable                                                 • Virtualization Management Consoles
     • Secure                                                   • Secure Cloud Creation
     • and easily maintained                                    • Management Technologies


     Cool concepts, but...
      How does this affect our industry?
      How do we access them?
      How do we harness them for greater productivity?
      What are the real benefits?
      What is the cost?
      What are the best practices?
      Where are they going?


© 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.
A brief roundup of SCbDS


      Super-Cloud based Data Sharing
           o      TDA
           o      MyMemory
           o      Google Translate
           o      Grand Dictionnaire Terminologique, Termium, IATE, ...
           o      EUR-Lex
           o      Other multilingual public-domain sources
           o      ...



© 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.
SCbDS upsides

      Advances in technology support large translation memories
       o Build vs. Existing
       o Proprietary vs. shared
       o Public domain mining
      Align large multilingual corpora
      Data mine within aligned corpora
      Measurable benefits have been obtained through ALTM on top of large
       memories
      BUT THERE’S A DANGER: Translation memory pollution & too much
       automation!




© 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.
Translation Memory Pollution is...

      Correctly aligned segments containing poor translation:
           o      Inadequate editing
           o      Poor post-mortem cleanup
      Incorrectly aligned segments:
           o      Poor alignment technology
           o      Inadequate post-alignment proofing
          Rogue tags
          Correct translation of undesired content
          Correct translation of obsolete source
          Obsolete translation of correct source
          Poor translation of poorly written source content:




© 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.
Translation Memory Pollution: overall conclusion

           Sentence-level leveraging in absence of contextual information is too
           simplistic and can lead to unsatisfactory results!




                    TM                                   ???
                                                                                          3§“§%!°“§$%“§$&$&/!


                                                                                                     




© 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.
The Big Question

      Does increased
       matching through
       ALTM equate to
       REAL productivity
       gain?




          We say    YES!

© 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.
Here‘s why we say YES!


      Large enterprise case
      Large government case
      Department of Justice
      Medax
      UNESCO
      Services Canada
© 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.
The main problem
      Wide variation of Document Types
      Legacy files in PDF
      No TM for certain customers
     Secondary problems
      Content is often complex
      Highly sensitive to context and style
      Highly client-specific




© 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.
Conventional TMs

     Mixed Results:
          No promised massive cost savings
          Useful enforcement tool
          Conventional terminology tool unwieldy
          Excel spreadsheets preferred!
     Time Investment Critical
      Therefore, selectivity of clients
      No ability to influence clients at the authoring stage - Documents are
       rarely repetitive on a traditional segment model
      Cost-benefit decisions: no TMs or truncated TMs




© 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.
ALTM addressed needs for:

                 Context
                 Matches at the paragraph, level
                 Matches at the segment and sub-segment levels
                 Interfacing/Compatibility with external vendors who
                  used various TM tools
                 Better integration with terminology management, live
                  online deployment
                 Server-based solution to link global production platform




© 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.
ALTM Benefits:


          Alignment automation = Low overhead for maintaining memory
          Rapid creation of larger memories = Faster project scoping and bidding
          Higher probability of matches
          Context provided at all times = Reduce research time
          Identification of sub-expressions = Result in more matches
          Terminology integration = Reduce research time, increase consistency


                       In general, more matches reduce revision time
                       Used to rebuild out-of-date conventional TM’s
                               Cost-effective competitiveness

© 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.
Proof of proposal example


      Translation Bureau RFP
           o For 1200 licenses
           o Proof of Proposal – 5 consecutive business days:
                                Install full client-server, 20 workstations
                                Create a production TM of 15 000 pairs of unstructured
                                 documents in various formats (≈ 20 M source words)
                                1 day - 10 people user training
                                1 day – production simulation use
                                Ensure no productivity loss - compute gains
                                              MultiCorpora won the RFP

© 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.
Harmonize legacy documents


      Department of Justice Canada
           o Laws & Regulations in French and English
           o No harmonization of ambiguous terms
           o ALTM allowed to extract terminology, see the translation
             discrepancies in context and identify corrections
           o ALTM combined with terminology allowed building TermBases of
             ambiguous terms from process on one document, and correct in all
             other documents
           o Continuous learning process, powered by ALTM
                                        Do in computing minutes
                                      what used to take people months

© 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.
• German Translation Service Provider
            •      geographically dispersed translator pool - roughly 250 doctors and pharmacists
            •      seven full-time employees oversee processing of nearly 5 million words per year
      • Historically no clear TM strategy
            •      Document types not conducive to TM
            •      Lacklustre productivity gains vs. overhead
      • Discovered ROI from the terminology management and sub-segment
        matching
            •      high number of shorter, domain-specific repeated sub-segment phrases
       Creates hybrid, partially pre-translated documents containing
        “pre-harmonized” terminology to send out
            o      90% comes from the TermBase, created by sub-segment matches, analysis
            o      Remaining 10% from the TextBase



© 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.
UNESCO

       “On the Fly” translation memories
            o Analyse docs against all translation memories
            o Identify which docs and memories are the most used
            o Re-build specific memories from UNESCO documents, and related
              organisations’ documents referenced in documents
            o Achieve higher degree of recycling from partner organisation’s
              documents
            o Ability to recycle / harmonize domain-specific terminology by
              example, powered by ALTM.
            o Continuous improvement virtuous circle
                Create a TM in minutes vs. what would take months to align
                                Add additional external content
                 Get domain-specific terminology though sub-expressions
© 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.
Services Canada - Job Bank


      Distinctive Hybrid translation
       process
           o 90M words per year
           o TM / MT / post editing
           o Linguistic assets comprise
                                Previous job offers
                                Domain-specific terms
                                Shared data increased productivity




© 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.
Translation Memory Pollution: Antidote


      Content selection
           o Too much unstructured content
           o Need establish mining hierarchy

      Use of statistics
           o Generate usage & translation distribution statistics per content
             repositories
           o Standardize in “live” Terminology Databases

      Use human intelligence
           o Human needs to be involved. Too much automation only
             propagates pollution…
           o Virtuous improvement circle

© 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.
Other uses of ALTM


      Monolingual analysis
           o Identify single source candidates
           o Identify terms to standardize
           o Identify deviations of customized documents from
             baseline texts
           o Identify localization order prioritization of baseline
             documents - 15% savings potential
                                TextBase repetitions
                                Term repetitions




© 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.
The Journey Is Not Yet Finished

      More automation of the antidotes to pollution
      Recent improvement in term extraction algorithms can
       expose pollution sources
      Evangelization of the processes
      No quick fix: Human factor remains involved. Not yet at the
       vision of fully automated pre-translated ALTM.
      New collaboration models between linguists and TM
       systems
      Better support for linguistic decision-making
      Evangelization of the role of the post-editor


© 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.
TAUS USER CONFERENCE 2010, Man, Machine and advanced translation memory leveraging
TAUS USER CONFERENCE 2010, Man, Machine and advanced translation memory leveraging

More Related Content

Viewers also liked

TAUS webinar The Big Picture View On The Translation Industry, March 2013
TAUS webinar The Big Picture View On The Translation Industry, March 2013TAUS webinar The Big Picture View On The Translation Industry, March 2013
TAUS webinar The Big Picture View On The Translation Industry, March 2013
TAUS - The Language Data Network
 
TAUS Moses Roundtable, Prague, 11 September 2013
TAUS Moses Roundtable, Prague, 11 September 2013TAUS Moses Roundtable, Prague, 11 September 2013
TAUS Moses Roundtable, Prague, 11 September 2013
TAUS - The Language Data Network
 
TAUS New Year's Reception 2014
TAUS New Year's Reception 2014TAUS New Year's Reception 2014
TAUS New Year's Reception 2014
TAUS - The Language Data Network
 
The Future of Technical Communication is Marketing
The Future of Technical Communication is MarketingThe Future of Technical Communication is Marketing
The Future of Technical Communication is Marketing
Scott Abel
 
TAUS MT SHOWCASE, The WeMT Program, Olga Beregovaya, Welocalize, 10 October 2...
TAUS MT SHOWCASE, The WeMT Program, Olga Beregovaya, Welocalize, 10 October 2...TAUS MT SHOWCASE, The WeMT Program, Olga Beregovaya, Welocalize, 10 October 2...
TAUS MT SHOWCASE, The WeMT Program, Olga Beregovaya, Welocalize, 10 October 2...
TAUS - The Language Data Network
 
Antzinaroa eta erdi aroa nora taus
Antzinaroa eta erdi aroa nora tausAntzinaroa eta erdi aroa nora taus
Antzinaroa eta erdi aroa nora taus
Lourdes Macicior
 
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
TAUS - The Language Data Network
 
TAUS USER CONFERENCE 2010, The Deep Hybrid machine translation engine
TAUS USER CONFERENCE 2010, The Deep Hybrid machine translation engineTAUS USER CONFERENCE 2010, The Deep Hybrid machine translation engine
TAUS USER CONFERENCE 2010, The Deep Hybrid machine translation engine
TAUS - The Language Data Network
 
The cognitive era and the future of content
The cognitive era and the future of contentThe cognitive era and the future of content
The cognitive era and the future of content
Scott Abel
 

Viewers also liked (10)

TAUS webinar The Big Picture View On The Translation Industry, March 2013
TAUS webinar The Big Picture View On The Translation Industry, March 2013TAUS webinar The Big Picture View On The Translation Industry, March 2013
TAUS webinar The Big Picture View On The Translation Industry, March 2013
 
TAUS Moses Roundtable, Prague, 11 September 2013
TAUS Moses Roundtable, Prague, 11 September 2013TAUS Moses Roundtable, Prague, 11 September 2013
TAUS Moses Roundtable, Prague, 11 September 2013
 
TAUS New Year's Reception 2014
TAUS New Year's Reception 2014TAUS New Year's Reception 2014
TAUS New Year's Reception 2014
 
The Future of Technical Communication is Marketing
The Future of Technical Communication is MarketingThe Future of Technical Communication is Marketing
The Future of Technical Communication is Marketing
 
TAUS MT SHOWCASE, The WeMT Program, Olga Beregovaya, Welocalize, 10 October 2...
TAUS MT SHOWCASE, The WeMT Program, Olga Beregovaya, Welocalize, 10 October 2...TAUS MT SHOWCASE, The WeMT Program, Olga Beregovaya, Welocalize, 10 October 2...
TAUS MT SHOWCASE, The WeMT Program, Olga Beregovaya, Welocalize, 10 October 2...
 
TAUS MT Post-Editing Guidelines
TAUS MT Post-Editing GuidelinesTAUS MT Post-Editing Guidelines
TAUS MT Post-Editing Guidelines
 
Antzinaroa eta erdi aroa nora taus
Antzinaroa eta erdi aroa nora tausAntzinaroa eta erdi aroa nora taus
Antzinaroa eta erdi aroa nora taus
 
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
 
TAUS USER CONFERENCE 2010, The Deep Hybrid machine translation engine
TAUS USER CONFERENCE 2010, The Deep Hybrid machine translation engineTAUS USER CONFERENCE 2010, The Deep Hybrid machine translation engine
TAUS USER CONFERENCE 2010, The Deep Hybrid machine translation engine
 
The cognitive era and the future of content
The cognitive era and the future of contentThe cognitive era and the future of content
The cognitive era and the future of content
 

Similar to TAUS USER CONFERENCE 2010, Man, Machine and advanced translation memory leveraging

DEC-16-UNLEASH THE POWER OF HUMAN COLLABORATION
DEC-16-UNLEASH THE POWER OF HUMAN COLLABORATIONDEC-16-UNLEASH THE POWER OF HUMAN COLLABORATION
DEC-16-UNLEASH THE POWER OF HUMAN COLLABORATIONMichael G. Schwarzwalder
 
Insights in the MT Market, by Jaap van der Meer, TAUS
Insights in the MT Market, by Jaap van der Meer, TAUSInsights in the MT Market, by Jaap van der Meer, TAUS
Insights in the MT Market, by Jaap van der Meer, TAUS
TAUS - The Language Data Network
 
Intro to watson bluemix services
Intro to watson bluemix servicesIntro to watson bluemix services
Intro to watson bluemix services
Vikas Manoria
 
Tempo - Mobile access with Governance
Tempo - Mobile access with GovernanceTempo - Mobile access with Governance
Tempo - Mobile access with GovernanceGabe Faraone
 
SDL Server 2009 Launch Presentation
SDL Server 2009 Launch PresentationSDL Server 2009 Launch Presentation
SDL Server 2009 Launch Presentation
anthonytate88
 
Lotusphere BP304: Looking For the Right Document Management Alternative
Lotusphere BP304: Looking For the Right Document Management AlternativeLotusphere BP304: Looking For the Right Document Management Alternative
Lotusphere BP304: Looking For the Right Document Management Alternative
Roland Driesen
 
Tely Labs Webinar Intro May 22nd 2014
Tely Labs Webinar Intro May 22nd 2014Tely Labs Webinar Intro May 22nd 2014
Tely Labs Webinar Intro May 22nd 2014
Paul Richards
 
Exponential e-unified-communications-presentations
Exponential e-unified-communications-presentationsExponential e-unified-communications-presentations
Exponential e-unified-communications-presentations
Exponential_e
 
Unified Communications - Collaborative services that deliver greater busines...
Unified Communications  - Collaborative services that deliver greater busines...Unified Communications  - Collaborative services that deliver greater busines...
Unified Communications - Collaborative services that deliver greater busines...
Exponential_e
 
Open Source in Government / Graham Taylor
Open Source in Government / Graham TaylorOpen Source in Government / Graham Taylor
Open Source in Government / Graham Taylor
Paris Open Source Summit
 
The Human ROI: Past, Present and Future of Localization
The Human ROI: Past, Present and Future of LocalizationThe Human ROI: Past, Present and Future of Localization
The Human ROI: Past, Present and Future of Localization
Michael Meinhardt
 
UG Software Technologies
UG Software TechnologiesUG Software Technologies
UG Software Technologies
Ug Webmart
 
Case Studies in Enterprise Messaging Federation
Case Studies in Enterprise Messaging FederationCase Studies in Enterprise Messaging Federation
Case Studies in Enterprise Messaging Federation
Alan Quayle
 
Evolve Com Tec Presentation
Evolve Com Tec PresentationEvolve Com Tec Presentation
Evolve Com Tec Presentationsamanthahubbard
 
Microsoft Skype for Business and the quest for legacy video interoperability
Microsoft Skype for Business and the quest for legacy video interoperabilityMicrosoft Skype for Business and the quest for legacy video interoperability
Microsoft Skype for Business and the quest for legacy video interoperability
Anders Løkke
 
OpenText PowerDOCS: A Cloud Solution for Document Generation
OpenText PowerDOCS: A Cloud Solution for Document GenerationOpenText PowerDOCS: A Cloud Solution for Document Generation
OpenText PowerDOCS: A Cloud Solution for Document Generation
Marc St-Pierre
 
IBM Lotus Sametime - IM for the Enterprise
IBM Lotus Sametime - IM for the EnterpriseIBM Lotus Sametime - IM for the Enterprise
IBM Lotus Sametime - IM for the Enterprise
Dvir Reznik
 
The Growing Research that Open Source Owns the Future in Cloud
The Growing Research that Open Source Owns the Future in CloudThe Growing Research that Open Source Owns the Future in Cloud
The Growing Research that Open Source Owns the Future in Cloud
All Things Open
 
Train foundation model for domain-specific language model
Train foundation model for domain-specific language modelTrain foundation model for domain-specific language model
Train foundation model for domain-specific language model
Benjaminlapid1
 
MiTiN 2013 Keynote in Detroit Michigan
MiTiN 2013 Keynote in Detroit MichiganMiTiN 2013 Keynote in Detroit Michigan
MiTiN 2013 Keynote in Detroit Michigan
Kirti Vashee
 

Similar to TAUS USER CONFERENCE 2010, Man, Machine and advanced translation memory leveraging (20)

DEC-16-UNLEASH THE POWER OF HUMAN COLLABORATION
DEC-16-UNLEASH THE POWER OF HUMAN COLLABORATIONDEC-16-UNLEASH THE POWER OF HUMAN COLLABORATION
DEC-16-UNLEASH THE POWER OF HUMAN COLLABORATION
 
Insights in the MT Market, by Jaap van der Meer, TAUS
Insights in the MT Market, by Jaap van der Meer, TAUSInsights in the MT Market, by Jaap van der Meer, TAUS
Insights in the MT Market, by Jaap van der Meer, TAUS
 
Intro to watson bluemix services
Intro to watson bluemix servicesIntro to watson bluemix services
Intro to watson bluemix services
 
Tempo - Mobile access with Governance
Tempo - Mobile access with GovernanceTempo - Mobile access with Governance
Tempo - Mobile access with Governance
 
SDL Server 2009 Launch Presentation
SDL Server 2009 Launch PresentationSDL Server 2009 Launch Presentation
SDL Server 2009 Launch Presentation
 
Lotusphere BP304: Looking For the Right Document Management Alternative
Lotusphere BP304: Looking For the Right Document Management AlternativeLotusphere BP304: Looking For the Right Document Management Alternative
Lotusphere BP304: Looking For the Right Document Management Alternative
 
Tely Labs Webinar Intro May 22nd 2014
Tely Labs Webinar Intro May 22nd 2014Tely Labs Webinar Intro May 22nd 2014
Tely Labs Webinar Intro May 22nd 2014
 
Exponential e-unified-communications-presentations
Exponential e-unified-communications-presentationsExponential e-unified-communications-presentations
Exponential e-unified-communications-presentations
 
Unified Communications - Collaborative services that deliver greater busines...
Unified Communications  - Collaborative services that deliver greater busines...Unified Communications  - Collaborative services that deliver greater busines...
Unified Communications - Collaborative services that deliver greater busines...
 
Open Source in Government / Graham Taylor
Open Source in Government / Graham TaylorOpen Source in Government / Graham Taylor
Open Source in Government / Graham Taylor
 
The Human ROI: Past, Present and Future of Localization
The Human ROI: Past, Present and Future of LocalizationThe Human ROI: Past, Present and Future of Localization
The Human ROI: Past, Present and Future of Localization
 
UG Software Technologies
UG Software TechnologiesUG Software Technologies
UG Software Technologies
 
Case Studies in Enterprise Messaging Federation
Case Studies in Enterprise Messaging FederationCase Studies in Enterprise Messaging Federation
Case Studies in Enterprise Messaging Federation
 
Evolve Com Tec Presentation
Evolve Com Tec PresentationEvolve Com Tec Presentation
Evolve Com Tec Presentation
 
Microsoft Skype for Business and the quest for legacy video interoperability
Microsoft Skype for Business and the quest for legacy video interoperabilityMicrosoft Skype for Business and the quest for legacy video interoperability
Microsoft Skype for Business and the quest for legacy video interoperability
 
OpenText PowerDOCS: A Cloud Solution for Document Generation
OpenText PowerDOCS: A Cloud Solution for Document GenerationOpenText PowerDOCS: A Cloud Solution for Document Generation
OpenText PowerDOCS: A Cloud Solution for Document Generation
 
IBM Lotus Sametime - IM for the Enterprise
IBM Lotus Sametime - IM for the EnterpriseIBM Lotus Sametime - IM for the Enterprise
IBM Lotus Sametime - IM for the Enterprise
 
The Growing Research that Open Source Owns the Future in Cloud
The Growing Research that Open Source Owns the Future in CloudThe Growing Research that Open Source Owns the Future in Cloud
The Growing Research that Open Source Owns the Future in Cloud
 
Train foundation model for domain-specific language model
Train foundation model for domain-specific language modelTrain foundation model for domain-specific language model
Train foundation model for domain-specific language model
 
MiTiN 2013 Keynote in Detroit Michigan
MiTiN 2013 Keynote in Detroit MichiganMiTiN 2013 Keynote in Detroit Michigan
MiTiN 2013 Keynote in Detroit Michigan
 

More from TAUS - The Language Data Network

TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS - The Language Data Network
 
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
TAUS - The Language Data Network
 
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann... Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
TAUS - The Language Data Network
 
A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...
TAUS - The Language Data Network
 
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
TAUS - The Language Data Network
 
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
TAUS - The Language Data Network
 
Farmer Lv (TrueTran)
Farmer Lv (TrueTran)Farmer Lv (TrueTran)
Farmer Lv (TrueTran)
TAUS - The Language Data Network
 
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
TAUS - The Language Data Network
 
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 The Theory and Practice of Computer Aided Translation Training System, Liu Q... The Theory and Practice of Computer Aided Translation Training System, Liu Q...
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
TAUS - The Language Data Network
 
Translation Technology Showcase in Shenzhen
Translation Technology Showcase in ShenzhenTranslation Technology Showcase in Shenzhen
Translation Technology Showcase in Shenzhen
TAUS - The Language Data Network
 
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
TAUS - The Language Data Network
 
SDL Trados Studio 2017, Jocelyn He (SDL)
SDL Trados Studio 2017, Jocelyn He (SDL)SDL Trados Studio 2017, Jocelyn He (SDL)
SDL Trados Studio 2017, Jocelyn He (SDL)
TAUS - The Language Data Network
 
How we train post-editors - Yongpeng Wei (Lingosail)
How we train post-editors - Yongpeng Wei (Lingosail)How we train post-editors - Yongpeng Wei (Lingosail)
How we train post-editors - Yongpeng Wei (Lingosail)
TAUS - The Language Data Network
 
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 A use-case for getting MT into your company, Kerstin Berns (berns language c... A use-case for getting MT into your company, Kerstin Berns (berns language c...
A use-case for getting MT into your company, Kerstin Berns (berns language c...
TAUS - The Language Data Network
 
QE integrated in XTM, by Bob Willans (XTM)
QE integrated in XTM, by Bob Willans (XTM)QE integrated in XTM, by Bob Willans (XTM)
QE integrated in XTM, by Bob Willans (XTM)
TAUS - The Language Data Network
 
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)
TAUS - The Language Data Network
 

More from TAUS - The Language Data Network (20)

TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
 
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
 
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
 
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
 
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
 
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
 
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann... Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 
A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...
 
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
 
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
 
Farmer Lv (TrueTran)
Farmer Lv (TrueTran)Farmer Lv (TrueTran)
Farmer Lv (TrueTran)
 
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
 
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 The Theory and Practice of Computer Aided Translation Training System, Liu Q... The Theory and Practice of Computer Aided Translation Training System, Liu Q...
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 
Translation Technology Showcase in Shenzhen
Translation Technology Showcase in ShenzhenTranslation Technology Showcase in Shenzhen
Translation Technology Showcase in Shenzhen
 
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
 
SDL Trados Studio 2017, Jocelyn He (SDL)
SDL Trados Studio 2017, Jocelyn He (SDL)SDL Trados Studio 2017, Jocelyn He (SDL)
SDL Trados Studio 2017, Jocelyn He (SDL)
 
How we train post-editors - Yongpeng Wei (Lingosail)
How we train post-editors - Yongpeng Wei (Lingosail)How we train post-editors - Yongpeng Wei (Lingosail)
How we train post-editors - Yongpeng Wei (Lingosail)
 
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 A use-case for getting MT into your company, Kerstin Berns (berns language c... A use-case for getting MT into your company, Kerstin Berns (berns language c...
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 
QE integrated in XTM, by Bob Willans (XTM)
QE integrated in XTM, by Bob Willans (XTM)QE integrated in XTM, by Bob Willans (XTM)
QE integrated in XTM, by Bob Willans (XTM)
 
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)
 

Recently uploaded

State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...
ThomasParaiso2
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
James Anderson
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
Matthew Sinclair
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
Dorra BARTAGUIZ
 
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfSAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
Peter Spielvogel
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
91mobiles
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
Ralf Eggert
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
sonjaschweigert1
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems S.M.S.A.
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 

Recently uploaded (20)

State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
 
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfSAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 

TAUS USER CONFERENCE 2010, Man, Machine and advanced translation memory leveraging

  • 1. TAUS USER CONFERENCE 2010 LANGUAGE BUSINESS INNOVATION 4 – 6 OCTOBER / PORTLAND (OR), USA MONDAY 4 OCTOBER / 15.00 MAN, MACHINE AND ADVANCED TRANSLATION MEMORY LEVERAGING Daniel Gervais, MultiCorpora
  • 2.
  • 3. Five New Technologies... ...that will change enterprise computing.  Search – the Next Generation  Environments to create Virtual Companies  Virtualization Management Consoles  Secure Cloud Creation  Management Technologies Source: Eric Lundquist, Editor-in-Chief, eWeek smartertechnology.com © 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.
  • 4. So, what does that mean for us? • elastic capacity • Search – the Next Generation • fault tolerant • Environments to create Virtual Companies • Scalable • Virtualization Management Consoles • Secure • Secure Cloud Creation • and easily maintained • Management Technologies Cool concepts, but...  How does this affect our industry?  How do we access them?  How do we harness them for greater productivity?  What are the real benefits?  What is the cost?  What are the best practices?  Where are they going? © 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.
  • 5. A brief roundup of SCbDS  Super-Cloud based Data Sharing o TDA o MyMemory o Google Translate o Grand Dictionnaire Terminologique, Termium, IATE, ... o EUR-Lex o Other multilingual public-domain sources o ... © 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.
  • 6. SCbDS upsides  Advances in technology support large translation memories o Build vs. Existing o Proprietary vs. shared o Public domain mining  Align large multilingual corpora  Data mine within aligned corpora  Measurable benefits have been obtained through ALTM on top of large memories  BUT THERE’S A DANGER: Translation memory pollution & too much automation! © 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.
  • 7. Translation Memory Pollution is...  Correctly aligned segments containing poor translation: o Inadequate editing o Poor post-mortem cleanup  Incorrectly aligned segments: o Poor alignment technology o Inadequate post-alignment proofing  Rogue tags  Correct translation of undesired content  Correct translation of obsolete source  Obsolete translation of correct source  Poor translation of poorly written source content: © 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.
  • 8. Translation Memory Pollution: overall conclusion Sentence-level leveraging in absence of contextual information is too simplistic and can lead to unsatisfactory results! TM ??? 3§“§%!°“§$%“§$&$&/!  © 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.
  • 9. The Big Question  Does increased matching through ALTM equate to REAL productivity gain?  We say YES! © 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.
  • 10. Here‘s why we say YES!  Large enterprise case  Large government case  Department of Justice  Medax  UNESCO  Services Canada © 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.
  • 11. The main problem  Wide variation of Document Types  Legacy files in PDF  No TM for certain customers Secondary problems  Content is often complex  Highly sensitive to context and style  Highly client-specific © 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.
  • 12. Conventional TMs Mixed Results:  No promised massive cost savings  Useful enforcement tool  Conventional terminology tool unwieldy  Excel spreadsheets preferred! Time Investment Critical  Therefore, selectivity of clients  No ability to influence clients at the authoring stage - Documents are rarely repetitive on a traditional segment model  Cost-benefit decisions: no TMs or truncated TMs © 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.
  • 13. ALTM addressed needs for:  Context  Matches at the paragraph, level  Matches at the segment and sub-segment levels  Interfacing/Compatibility with external vendors who used various TM tools  Better integration with terminology management, live online deployment  Server-based solution to link global production platform © 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.
  • 14. ALTM Benefits:  Alignment automation = Low overhead for maintaining memory  Rapid creation of larger memories = Faster project scoping and bidding  Higher probability of matches  Context provided at all times = Reduce research time  Identification of sub-expressions = Result in more matches  Terminology integration = Reduce research time, increase consistency In general, more matches reduce revision time Used to rebuild out-of-date conventional TM’s Cost-effective competitiveness © 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.
  • 15. Proof of proposal example  Translation Bureau RFP o For 1200 licenses o Proof of Proposal – 5 consecutive business days:  Install full client-server, 20 workstations  Create a production TM of 15 000 pairs of unstructured documents in various formats (≈ 20 M source words)  1 day - 10 people user training  1 day – production simulation use  Ensure no productivity loss - compute gains MultiCorpora won the RFP © 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.
  • 16. Harmonize legacy documents  Department of Justice Canada o Laws & Regulations in French and English o No harmonization of ambiguous terms o ALTM allowed to extract terminology, see the translation discrepancies in context and identify corrections o ALTM combined with terminology allowed building TermBases of ambiguous terms from process on one document, and correct in all other documents o Continuous learning process, powered by ALTM Do in computing minutes what used to take people months © 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.
  • 17. • German Translation Service Provider • geographically dispersed translator pool - roughly 250 doctors and pharmacists • seven full-time employees oversee processing of nearly 5 million words per year • Historically no clear TM strategy • Document types not conducive to TM • Lacklustre productivity gains vs. overhead • Discovered ROI from the terminology management and sub-segment matching • high number of shorter, domain-specific repeated sub-segment phrases  Creates hybrid, partially pre-translated documents containing “pre-harmonized” terminology to send out o 90% comes from the TermBase, created by sub-segment matches, analysis o Remaining 10% from the TextBase © 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.
  • 18. UNESCO  “On the Fly” translation memories o Analyse docs against all translation memories o Identify which docs and memories are the most used o Re-build specific memories from UNESCO documents, and related organisations’ documents referenced in documents o Achieve higher degree of recycling from partner organisation’s documents o Ability to recycle / harmonize domain-specific terminology by example, powered by ALTM. o Continuous improvement virtuous circle Create a TM in minutes vs. what would take months to align Add additional external content Get domain-specific terminology though sub-expressions © 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.
  • 19. Services Canada - Job Bank  Distinctive Hybrid translation process o 90M words per year o TM / MT / post editing o Linguistic assets comprise  Previous job offers  Domain-specific terms  Shared data increased productivity © 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.
  • 20. Translation Memory Pollution: Antidote  Content selection o Too much unstructured content o Need establish mining hierarchy  Use of statistics o Generate usage & translation distribution statistics per content repositories o Standardize in “live” Terminology Databases  Use human intelligence o Human needs to be involved. Too much automation only propagates pollution… o Virtuous improvement circle © 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.
  • 21. Other uses of ALTM  Monolingual analysis o Identify single source candidates o Identify terms to standardize o Identify deviations of customized documents from baseline texts o Identify localization order prioritization of baseline documents - 15% savings potential  TextBase repetitions  Term repetitions © 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.
  • 22. The Journey Is Not Yet Finished  More automation of the antidotes to pollution  Recent improvement in term extraction algorithms can expose pollution sources  Evangelization of the processes  No quick fix: Human factor remains involved. Not yet at the vision of fully automated pre-translated ALTM.  New collaboration models between linguists and TM systems  Better support for linguistic decision-making  Evangelization of the role of the post-editor © 2009 – 2010 | This confidential document is the property of MultiCorpora and cannot be shared, reproduced, distributed or used without permission.