TAUS Data Association | Consultation: Strategic Roadmap - Webinar (May 12, 2010)


Published on

The proposed developments were wide-reaching and have significant implications for how the industry conducts business. We received 680 usable responses to the consultation questionnaire and a wealth of new ideas on further new features and services. Responses came from every stakeholder group in the industry: translators, corporate buyers, public sector buyers, service providers, technology vendors, academia, and consultants / sector analysts / commentators.

A very large majority recognize the benefits of sharing translation memories. There was a strong endorsement of plans to provide users and members with greater intelligent access and easier access to data through translation matching and open APIs for services. View the presentation to see how people voted, what has been prioritized, and when new services will be delivered.

Published in: Technology
  • Be the first to comment

  • Be the first to like this

TAUS Data Association | Consultation: Strategic Roadmap - Webinar (May 12, 2010)

  1. 1. TAUS Data Association Roadmap ConsultationWebinar<br />5PM CEST / 8 AM PDT<br />12 May 2010<br />
  2. 2. Hosts<br />Jaap van der Meer, Director<br />Rahzeb Choudhury, Operations Director<br />
  3. 3. Webinar Participants<br /><ul><li>280 registered
  4. 4. 50% translators
  5. 5. 30% service providers
  6. 6. 10% buyers providers (held buyer webinar 28 April)
  7. 7. 5% technology providers
  8. 8. 5% other</li></ul>From all corners of the globe and all have reviewed the following three tables and indicated needs and preferences:<br />
  9. 9. How People Use TDA Today<br />
  10. 10. Proposed New Features & Services<br />Private: members may limit sharing of TMs to their own selection of registered users (‘private vaults’).<br />Integration: API’s for all services will be available to everyone.<br />
  11. 11. Use Scenarios <br />Registration and acceptance of terms is required for all use scenarios. <br />
  12. 12. Agenda<br /><ul><li>Implement an industry vision
  13. 13. Background
  14. 14. Status
  15. 15. Consultation
  16. 16. Roadmap
  17. 17. To share or not to share
  18. 18. Beyond current proposals
  19. 19. Questions
  20. 20. Next steps
  21. 21. Thanks again</li></li></ul><li>Implement an industry vision<br />See TAUS animation on 2000 years of translation on YouTube or dotSUB – Translation: yesterday, today, tomorrow<br />Share translation memories ….<br /><ul><li>to extend our investments
  22. 22. to translate much more content
  23. 23. to help world communicate better!</li></li></ul><li>Background - from closed to open industry<br />The evolution is irreversible .…From fragmentation to consolidation …. From closed to open …. From desk-top to enterprise server to industry shared platforms. <br />TM<br />TM<br />TM<br />Industry-shared<br />language data<br />TM<br />TM<br />TM<br />TM<br />TM<br />TM<br />TM<br />TM<br />TM<br />TM<br />TM<br />TM<br />TM<br />TM<br />Sharing language data accelerates automation and innovation in the translation industry.<br />
  24. 24. Background - milestones<br />
  25. 25. Status – TDA and the market<br /><ul><li>Sixty-five member organizations
  26. 26. 7 billion words downloaded, 2.6 billion uploaded
  27. 27. Sharing translation memories growing industry practice
  28. 28. Translation tools moving to the “Cloud”</li></ul>TDA is the only member-driven and industry-sanctioned choice<br />
  29. 29. Consultation<br /><ul><li>680 usable responses to survey
  30. 30. 45% had not heard of TDA before the consultation
  31. 31. Overall 85% believe/agree sharing translation memories brings improvements to term use, leveraging, machine translation quality
  32. 32. 94% want to make use of TAUS Search
  33. 33. 75% want TAUS Search integrated into own environment
  34. 34. 86% want to be able to download data with translation matches
  35. 35. 70% want to private vaults for limited sharing</li></li></ul><li>Roadmap in implementation<br />TM Cleaning already in place – tag filters, remove corrupt characters, remove corrupt XML, reject duplicates, flag missing translations, users report errors via TAUS Search<br />Free TAUS Search API available by end May<br />Translation Matching using scaled up open-source GlobalSight engine scheduled for October (sponsored by Welocalize)<br /><ul><li>Free translation matching API to reuse short strings
  36. 36. TDA integration layer enabling connectivity with any CAT tool using XLIFF</li></ul>Private Vaults for limited sharing – by end of year<br />
  37. 37. Roadmap in pilot or testing<br />TM Cleaning to come – Use statistical algorithms to flag bad translations. Proven in research now being tested for scalability<br />Matching Scores – for more granular data selection for MT training & TM leveraging being tested<br />MT Trainer – currently piloting<br />To enable fully automated customization of machine translation engines. Prospective buyer loads sample file & own data. This is sent with additional data from TDA to MT vendors who train engines. Output quality is benchmarked to provide objective reporting across engines from various providers. Will allow small specialized solution providers to connect with buyers.<br />Results to be presented 20 May during the TAUS Executive Forum in Copenhagen<br />
  38. 38. Roadmap<br />Improvements to TAUS Search, matrix search and matrix TM, genre identification, full MT trainer service, tool compliance (saving in tool specific formats) will be delivered next year<br />
  39. 39. To share or not to share<br /><ul><li>50% willing to share TMs for others to download
  40. 40. 60% willing to share for TAUS Search only
  41. 41. 70% happy to share in private vaults</li></li></ul><li>Beyond current proposals<br />Responses to consultation include a wealth of ideas on new features:<br /> Forums, open source middleware, glossary creation tools, flagging jargon and much more<br /><ul><li>People / companies offering services instead of sharing data
  42. 42. We will report more use cases, provide more guidance on good practices with shared data, and create a blog
  43. 43. Assessing potential to open-source TDA technology</li></li></ul><li>Questions<br />30 minutes were taken to answer questions on a wide variety of issues:<br />A few answers :<br />Member fees represent a collective investment by the all stakeholders in the industry to create a common infrastructure for everyone to win from sharing and benefit from reuse of translation memories<br />Yes, TDA is partnering with other industry organizations<br />Legal issues are covered by the sharing and pooling conditions found via the JOIN TDA web page<br />
  44. 44. Next steps<br />May 19 - Members vote on reduced entry-level member fees (EGM in Copenhagen)<br />May 20 - Report results of MT Trainer Pilot (Copenhagen)<br />End May - Open API for TAUS Search<br />June - TAUS Data Association blog<br /> Report more use cases<br />MultiCorpora’sMultiTrans fully integrated for seamless upload and download<br />July - Lionbridge’s Translator Workspace fully integrated <br />October - Translation matching live with open API and XLIFF <br /> GlobalSight fully integrated<br />TDA Annual General Meeting, Portland (OR), USA<br />
  45. 45. Thanks!<br />Questions and comments are very welcome<br />jaap@translationautomation.com<br />rahzeb@translationautomation.com<br />