MATI 2009 Conference: CAT Tools

1,896 views

Published on

This is the official presentation "Computer-Assisted Translation Tools" from the 2009 MATI conference.

Published in: Business, Lifestyle, Technology
0 Comments
2 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
1,896
On SlideShare
0
From Embeds
0
Number of Embeds
10
Actions
Shares
0
Downloads
41
Comments
0
Likes
2
Embeds 0
No embeds

No notes for slide

MATI 2009 Conference: CAT Tools

  1. 1. 2009 MATI Annual Conference 2009 MATI Annual Conference Dierk Seeburg IcoText 1 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  2. 2. 2009 MATI Annual Conference Computer-Assisted Translation Tools Dierk Seeburg IcoText (Presentation available at icotext.com in a few days) (CC) Creative Commons. Some Rights Reserved September 12, 2009
  3. 3. 2009 MATI Annual Conference 3 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  4. 4. 2009 MATI Annual Conference Agenda • Intro • PM, TMS, L18n • History • Terminology • ITEs • MT • Questions • Questions • Break 4 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  5. 5. 2009 MATI Annual Conference Purpose •Intro •Overview (incomplete) •Decision Guide 5 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  6. 6. 2009 MATI Annual Conference Dierk Seeburg IcoText 6 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  7. 7. 2009 MATI Annual Conference Marburg 7 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  8. 8. 2009 MATI Annual Conference Dierk Seeburg IcoText 8 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  9. 9. 2009 MATI Annual Conference Dierk Seeburg IcoText 9 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  10. 10. 2009 MATI Annual Conference Phoenix 10 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  11. 11. 2009 MATI Annual Conference Dierk Seeburg IcoText 11 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  12. 12. 2009 MATI Annual Conference Dierk Seeburg IcoText 12 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  13. 13. 2009 MATI Annual Conference Survey •Are you a freelance translator? 13 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  14. 14. 2009 MATI Annual Conference Survey •Are you a freelance interpreter? 14 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  15. 15. 2009 MATI Annual Conference Survey •Do you work for an LSP? 15 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  16. 16. 2009 MATI Annual Conference Survey •Do you work in-house? 16 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  17. 17. 2009 MATI Annual Conference Glossary • G11n = Globalization • I18n = Internationalization • L10n = Localization • T9n = Translation • ITE = Integrated Translation Environment • TMS = Translation Management System • MTS = Machine Translation System • WCMS = Web Content Management System • GMS = Globalization Management System 17 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  18. 18. 2009 MATI Annual Conference Translators and Technology • Translators & • 300,000 interpreters worldwide • Translators regularly • 30-35,000 using Internet • Translators using • 10-15,000 translation memory technology Source: Jeff Allen 18 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  19. 19. 2009 MATI Annual Conference History "Disdain on the side of the professional translators for the hilarious and stupid MT mistakes gave birth to a new variant of MT called translation memory (TM). TM started off as a lower-level feature of commercial MT systems (...). But the success of TM came with dedicated products such as IBM TM/2 and Trados. The marketing message was tuned in to what the professional translation industry wanted to hear: "Forget about MT; it doesn't work well. Instead, use our TM product because it leaves you in full control of the process." Jaap van der Meer in MultiLingual 71 (2005) 19 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  20. 20. 2009 MATI Annual Conference History "The message worked well, within a period of 10 to 15 years, TM products have found their way to the workstations of more than 50,000 translators in the world. But the message had also caused a "cognitive disorder" in the translation industry, namely that TM is good and MT is evil, foregoing the fact that TM is just a new variant of MT (...). The damage is done, however, and it will take years to convince the community of business translators that post-editing fuzzy matches from TM databases is, in fact, not different from post-editing fuzzy matches from any other MT system." Jaap van der Meer in MultiLingual 71 (2005) 20 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  21. 21. 2009 MATI Annual Conference Development • Market demands • Higher effciencies • Evolutionary approach: – Letter – FAX – PC – Internet – Translation memory technology • Spiral goes up: what's next? 21 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  22. 22. 2009 MATI Annual Conference Survey •Do you have more than 1 year experience? 22 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  23. 23. 2009 MATI Annual Conference Survey •Do you have more than 5 years experience? 23 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  24. 24. 2009 MATI Annual Conference Survey •Do you have more than 10 years experience? 24 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  25. 25. 2009 MATI Annual Conference Survey •Do you have more than 15 years experience? 25 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  26. 26. 2009 MATI Annual Conference Survey •Do you have more than 20 years experience? 26 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  27. 27. 2009 MATI Annual Conference Translation Process • Transfer • Conversion • Import • Pretranslation • Translation • Editing • QA • Terminology Management • Back-Conversion/-Transfer 27 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  28. 28. 2009 MATI Annual Conference CAT Tools •Computer Assisted Translation tools: – All tools that aid in the translation process 28 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  29. 29. 2009 MATI Annual Conference CAT Tools – Tool Cat's • Integrated Translation Environments (ITEs) • Translation Management System (TMS): GlobalSight, LTC Organiser, Translation Offce 3000, Vertimus, ]project-open[ • Term extraction tools: Xerox XTS, LogiTerm, TermExtract, SDLPhraseFinder, Okapi • Aligners: AlignFactory, NoBabel AutoAligner • Term Management tools: Bitext2tmx, Olifant • Concordancers: AntConc, WebCONC, Lucon • Word count tools: PractiCount, AnyCount • Localization tools: Catalyst, Sisulizer, Lingobit • Machine translation programs: Apertium, Moses, OpenLogos • Lookup tools: europeana.eu, perseus.tufts.edu, dict.leo.org, dict.cc 29 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  30. 30. 2009 MATI Annual Conference Survey •Have you used CAT Tools in the past? 30 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  31. 31. 2009 MATI Annual Conference Survey •Are you currently using a CAT Tool? 31 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  32. 32. 2009 MATI Annual Conference Survey •Are you planning on using a CAT Tool in the future? 32 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  33. 33. 2009 MATI Annual Conference ITE Categories •Tools with MS Word as their main translation interface •Tools with independent translation interface •Tools for translating online 33 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  34. 34. 2009 MATI Annual Conference Translation Memories The 3 different technologies behind translation memory: •Reference-based matching on segment level (Star Transit) •Database-based matching on segment level (most tools) •Database-based matching on sub-segment level (Multitrans, Logiterm, Find) 34 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  35. 35. 2009 MATI Annual Conference Obstacles: File Types XLIFF, HTML/XHTML, XML, SGML, SVG, PDF, Plain Text, OpenOffce, StarOffce, AbiWord, Corel WordPerfect, CorelDraw, PO/POT (GNU Gettext), CSV, TSV, ASP, PHP, Adobe FrameMaker (MIF), Adobe InDesign (INX), Adobe PageMaker, Adobe Illustrator, Adobe Photoshop, Interleaf /Quicksilver, XGate for QuarkXPress, AutoCAD, DocBook, DITA, Java Properties, JavaScript, RTF, Trados Tagged RTF, Trados TTX, MS Offce 2003 XML, MS Offce 2007, MS Visio, MS Publisher, ResX (Windows .NET Resources), RC (Windows C/C++ Resources), TMX, TBX, XLIFF, SRX, xml:tm, W3C ITS 35 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  36. 36. 2009 MATI Annual Conference Obstacles: Standards • XLIFF • Orgs: • TMX – ISO • TBX – LISA – OASIS • SRX – ATA • xml:tm – FIT • ITS – ITI 36 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  37. 37. 2009 MATI Annual Conference Obstacles: Other  Interoperability  User Interface  Integration 37 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  38. 38. 2009 MATI Annual Conference Strategies •SWOT •Six Questions •See-Think-Draw – Situation – Goals – Route 38 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  39. 39. 2009 MATI Annual Conference SWOT 39 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  40. 40. 2009 MATI Annual Conference SWOT • Strengths: attributes of the organization that are helpful to achieving the objective. • Weaknesses: attributes of the organization that are harmful to achieving the objective. • Opportunities: external conditions that are helpful to achieving the objective. • Threats: external conditions which could do damage to the business's performance. Source: Wikipedia 40 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  41. 41. 2009 MATI Annual Conference Six Questions •Who •What •Where •When •Why •How 41 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  42. 42. 2009 MATI Annual Conference Tools with MS Word as their main translation interface • Anaphraseus (http://anaphraseus.sourceforge.net/) • Anymem (http://www.anymem.com/) • MetaTexis (http://www.metatexis.com) • MultiTrans (http://www.multicorpora.com) • Similis (http://www.lingua-et-machina.com) • Snowball (http://www.trmem.com) • TinyTM (http://tinytm.sourceforge.net/) • Trados (http://www.trados.com) • Wordfast (http://www.wordfast.net) • WordFisher (http://www.wordfsher.com) 42 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  43. 43. 2009 MATI Annual Conference Anaphraseus 43 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  44. 44. 2009 MATI Annual Conference Anymem 44 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  45. 45. 2009 MATI Annual Conference MetaTexis 45 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  46. 46. 2009 MATI Annual Conference MultiTrans 46 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  47. 47. 2009 MATI Annual Conference Snowball 47 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  48. 48. 2009 MATI Annual Conference TinyTM 48 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  49. 49. 2009 MATI Annual Conference Trados 49 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  50. 50. 2009 MATI Annual Conference Wordfast 50 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  51. 51. 2009 MATI Annual Conference Wordfsher 51 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  52. 52. 2009 MATI Annual Conference Tools with independent translation interface • Across (http://www.across.net) • AppleTrans (http://www.apple.com/) • Cafetran (http://www.cafetran.republika.pl/) • Catalyst (http://www.alchemysoftware.ie/) • Déjà Vu (http://www.atril.com/) • Esperantilo (http://www.esperantilo.org/tm/index.html) • ForeignDesk (http://www.foreigndesk.net/) • Fusion (http://www.jivefusiontech.com/) • Heartsome (http://www.heartsome.net/) • Kbabel (http://kbabel.kde.org/) • LogiTerm (http://www.terminotix.com/) 52 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  53. 53. 2009 MATI Annual Conference Across 53 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  54. 54. 2009 MATI Annual Conference AppleTrans 54 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  55. 55. 2009 MATI Annual Conference Cafetran 55 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  56. 56. 2009 MATI Annual Conference Catalyst 56 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  57. 57. 2009 MATI Annual Conference Déjà Vu 57 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  58. 58. 2009 MATI Annual Conference EsperantiloTM 58 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  59. 59. 2009 MATI Annual Conference ForeignDesk 59 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  60. 60. 2009 MATI Annual Conference Fusion 60 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  61. 61. 2009 MATI Annual Conference Heartsome 61 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  62. 62. 2009 MATI Annual Conference KBabel 62 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  63. 63. 2009 MATI Annual Conference LogiTerm 63 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  64. 64. 2009 MATI Annual Conference Tools with independent translation interface • MemoQ (http://www.kilgray.com/) • OmegaT (http://www.omegat.org/) • Open Language Editor (http://open-language-tools.dev.java.net/) • SDL Trados (http://www.sdlx.com/) • Swordfsh (http://www.maxprograms.com/) • Sisulizer (http://www.sisulizer.com/) • Transit (http://www.star-transit.com/) • Transolution (http://transolution.python-hosting.com/) • Virtaal (http://translate.sourceforge.net/wiki/virtaal/index) • WordForge (http://www.khmeros.info/drupal/? q=en/download/Translation_Editor) 64 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  65. 65. 2009 MATI Annual Conference MemoQ 65 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  66. 66. 2009 MATI Annual Conference OmegaT 66 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  67. 67. 2009 MATI Annual Conference Open Language Tools 67 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  68. 68. 2009 MATI Annual Conference SDL Trados 68 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  69. 69. 2009 MATI Annual Conference Swordfsh 69 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  70. 70. 2009 MATI Annual Conference Transolution 70 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  71. 71. 2009 MATI Annual Conference Virtaal 71 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  72. 72. 2009 MATI Annual Conference Tools for translating online •Lingotek (http://www.lingotek.com/) •Logoport (http://www.lionbridge.com/) •Pootle (http://translate.sourceforge.net/wiki/po otle/) 72 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  73. 73. 2009 MATI Annual Conference Break 73 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  74. 74. 2009 MATI Annual Conference ITE Features •Leveraging translated content •Building and maintaining terminology databases •Work in complex fle formats •Methods for analysis •Quality assurance •Project management capabilities 74 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  75. 75. 2009 MATI Annual Conference ITEs – The Similarities • Freely confgurable terminology component • Unicode and TMX support (with caveats) • Alignment features, e.g. – AlignFactory – NoBabel AutoAligner • Similar range of supported fle formats (s. Wikipedia) • Concordance searches • Analysis features 75 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  76. 76. 2009 MATI Annual Conference ITEs – Differentiators • Batch vs. individual fle processing • Terminology handling • Support for competing formats / TMs / termbases • Support for co-workers • Quality assurance • Remote connectivity • Code handling • Support and Training 76 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  77. 77. 2009 MATI Annual Conference ITEs – Differentiators •Batch vs. individual fle processing •Déjà Vu vs. Trados 77 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  78. 78. 2009 MATI Annual Conference ITEs – Differentiators •Terminology handling •Transit vs. Déjà Vu vs. Wordfast 78 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  79. 79. 2009 MATI Annual Conference ITEs – Differentiators •Quality assurance •Trados vs. MemoQ 79 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  80. 80. 2009 MATI Annual Conference ITEs – Differentiators •Remote connectivity •Wordfast vs. Lingotek 80 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  81. 81. 2009 MATI Annual Conference ITEs – Differentiators •Support and Training •Purchase = frst step only •Compare to car maintenance 81 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  82. 82. 2009 MATI Annual Conference ITEs – Differentiators •Support and Training – Tool Vendor Contract – Consultant – College classes – ATA seminars – Jost's Translators Toolkit – Websites: translatorstraining.com, proz.com, translatorscafe.com 82 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  83. 83. 2009 MATI Annual Conference Terminology – Prejudices • TM tool = emphasis is on the translation memory ? • More immediate gain: perfect and fuzzy matches • TMs = easily built up by aligning existing translated fle pairs/texts and/or automatically by translating new texts • Construction of terminology databases = tedious 83 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  84. 84. 2009 MATI Annual Conference Terminology – The Reality • The terminology database = investment – defning words and phrases • Grammatically • Contextually • by contrast • "Living dictionaries" for each segment translated • Term entries contain meta data relevance • Consistency throughout workgroups 84 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  85. 85. 2009 MATI Annual Conference Translation and Technology: Machine Translation •Statistically-based machine translation (SBMT) •Rules-based machine translation (RBMT) •Example-based machine translation (EBMT) •Human-assisted machine translation (HAMT) •Hybrid machine translation (HMT) 85 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  86. 86. 2009 MATI Annual Conference Translation and Technology: Machine Translation • Statistically-based machine translation works (sort of) • Rules-based approach does not work on its own • Translation can only be evaluated by its effectiveness, not by some elusive goal of "perfection" • Amount of relevant data can not be translated by humans • Microsoft • Symantec • Systran • AppTek • Promt • Google 86 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  87. 87. 2009 MATI Annual Conference 87 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  88. 88. 2009 MATI Annual Conference 88 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  89. 89. 2009 MATI Annual Conference Survey •Are you planning on using an ITE in the future? 89 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  90. 90. 2009 MATI Annual Conference Survey •Are you planning on using a TMS in the future? 90 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  91. 91. 2009 MATI Annual Conference Survey •Are you planning on using a GMS in the future? 91 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  92. 92. 2009 MATI Annual Conference Survey •Are you planning on using an MTS in the future? 92 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  93. 93. 2009 MATI Annual Conference Trends • Consolidation • Machine Translation (SMT, RBMT, EBMT, HAMT) • Standards (XLIFF, TMX, TBX, SRX) • Translation Management (CMS, GMS) • Groups (Yahoo, Google, LinkedIn) • Blogs (LTD, ThoughtsOnT9n, Masked T9r) • Translators Sites (ProZ, Translators Cafe, Aquarius) • Social Sites (Ning, Facebook, Twitter) • Instant Messaging (SMS, Skype, GTalk) 93 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  94. 94. 2009 MATI Annual Conference Trends • Desktop to Server • Server to Web • Web to Mobile • Pretranslation • Sub-sentence Fuzzy Matching • UX = User Experience • Integration • Collaboration • Open Source 94 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  95. 95. 2009 MATI Annual Conference Survey •Have you attended the ATA Translation Tools Forum in the past? 95 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  96. 96. 2009 MATI Annual Conference Survey •Are you planning on attending the ATA Translation Tools Forum in the future? 96 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  97. 97. 2009 MATI Annual Conference Take-Home •Don't buy a golden hammer •Subscribe to LTD News 97 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  98. 98. 2009 MATI Annual Conference Take-Home 98 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  99. 99. 2009 MATI Annual Conference Acknowledgements Jost Zetzsche, Rosalie Wells Di Wu & MATI ATA Prof'l. Dev. Committee ATA LT Division ATA staff, volunteers You 99 (CC) Creative Commons. Some Rights Reserved September 12, 2009
  100. 100. 2009 MATI Annual Conference Thank you! See you in New York in October! Download slides at <icotext.com>. (CC) Creative Commons. Some Rights Reserved September 12, 2009
  101. 101. 2009 MATI Annual Conference Q&A (CC) Creative Commons. Some Rights Reserved September 12, 2009

×