A generation ago an emerging group of localization service providers were able to the exploit the opportunities of their times and earn rich rewards, leaving the traditional industry in their wake.
We are at a pivotal moment once again. New opportunities with Technology, Data, Metrics and Connectivity are already fueling growth in our industry. There will be winners and losers.
This Webinar covers the market landscape and TAUS activity on:
Translation automation, Language data sharing, Translation quality evaluation, Interoperability
3. Hosts
Jaap van der Meer
Director, TAUS
Rahzeb Choudhury
Operations Director, TAUS
This slide may not be used or copied without permission from TAUS
4. What TAUS is about...
... not solving yesterday’s problems
... not telling you what you already know
BUT...
Thinking ahead of the curve
Researching the future of crosslingual communications
Establishing a platform for industry-shared services
Helping global companies and entrepreneurs in the
translation industry grow their business.
Helping the world communicate better.
This slide may not be used or copied without permission from TAUS
5. Who we are. How we work.
Industry Organization
Partner Structure
International team in Amsterdam
Outsourced Development
This slide may not be used or copied without permission from TAUS
6. What Inspires TAUS ...
Nature of demand - Translation as a Utility
Translation will be a feature in every app and on every screen. The translation industry is
reaching billions of „new customers‟ all connected on the internet. For the first time the
translation industry has the opportunity to scale up and become a large mature global
industry offering a wider range of services and solutions.
Sharing data and knowledge - TAUS approach
Sharing data and knowledge on an industry-level brings us all to a higher level
competence and intelligence. We will be competing on a level that really matters, making a
giant leap forward as an industry.
Think of:
the Human Genome Project: a $3.8 Billion investment in sharing data about the human genome
drove $796 Billion in economic impact, and spurred growth in the life sciences industry.
This slide may not be used or copied without permission from TAUS
7. TAUS Overview: an Evolving Role
Think Tank (since 2005) Industry-Shared Services
Reports, Best Practices 1. Technology: Promoting MT (since 2006)
Research, Directories 2. Data: Sharing Translation Data (since 2008)
Events, User Forums 3. Interoperability: Translation API (since 2011)
Workshops 4. Metrics: Dynamic Quality Framework (since 2011)
Pushing the industry agenda and Reducing costs and driving innovation and efficiency
sharing knowledge for members
8. Rationale of Industry-Shared Services
Every sector needs agreement on practices, transactions and
definitions of deliverables in order to build customer confidence and
grow a mature industry.
Think of IATA for the travel industry and SWIFT for the banking
industry.
TAUS focuses on four industry-shared service programs:
1. Technology
2. Data
3. Metrics
4. Interoperability
This slide may not be used or copied without permission from TAUS
9. Promoting Translation Automation
Market Landscape
Opportunities are too Great
Why:
Opportunities are too great to ignore. Machine translation technology leads to innovation
inside and outside the translation industry. Embracing the technology will allow
companies to grow and reach many new customers with new services and solutions.
Mind you:
MT technology needs to improve continuously: from generic language translation to
industry-specific, domain training and personalization.
This slide may not be used or copied without permission from TAUS
10. Promoting Translation Automation
TAUS Activity
What’s been done Focus 2013 - Shared Knowledge
Research: a library of reports, use cases, best Report: Translation Technology Landscape
practices, technology briefings Research: test engines with TAUS Data and
Directory: TAUS Tracker with listings of MT share reports for benchmarking
and translation and language technologies MT Showcase: events with MT providers
MT Users Forum: monthly meeting and demonstrating the best and the latest
knowledge base (FAQ). Moses tutorial: online step-by-step training
Moses Showcases: EC funded events sharing for Moses implementors
experiences of dozens of Moses users Developing talent: collaboration with
Latest Report: Advancing Best Practices In universities worldwide training test engines
MT Evaluation‟ report (July 2012) with TAUS Data
MT User Forum: developing the FAQ Forum
Want to get (more) involved
Learn from reports and the MT FAQ Forum
Take part in MT User Forums (buyers only)
Take part in MT Showcases
Take part in “Developing Talent” project (for academics)
This slide may not be used or copied without permission from TAUS
11. Sharing Translation Data
Market Landscape
Multiply your Benefits
Why:
Multiply the benefits of your translation memories. Give your data and get access to all
the shared industry data to train MT engines and improve translation quality.
Mind you:
Most translations are published on the internet and used already for MT training (e.g.
Google, Bing, Baidu, Yandex). The TAUS Data repository is the only legitimate industry-
shared repository of trusted translation data.
This slide may not be used or copied without permission from TAUS
12. Sharing Translation Data
TAUS Activity
What’s been done Focus 2013 – Shared R&D
Search: Translators find terms, fuzzy and full Private Vaults: all the benefits of TAUS Data
matches in data repository. (500K searches without public sharing
per month). Data Collector: a supervised crawler and
TM Upload and Download: 4 billion words data aligner, allowing users to collect parallel
uploaded and more than 20 billion words data from trusted sites and validate the
downloaded since 2009. aligned data before uploading to TAUS Data.
Matrix: increased the volume of down- Term Extractor: an online feature allowing
loadable and searchable data from 4 Billion users to upload a file and retrieve all the
words in 370 language pairs to 50 Billion matching terms and phrases from the TAUS
words in more than 2,000 language pairs. Data repository.
API: Search integrated in 10 products. Matching Scores: semantic clustering.
Data Marketplace: users can ask for bids for
Want to get (more) involved data.
Use TAUS Data to train new language engines or customize engines.
Pool data (members can take 5 times their uploads) or buy credits.
Use API to integrate TAUS Search and TM Upload and Download in your product or web site.
Participate in new Data Marketplace project (see separate presentation).
This slide may not be used or copied without permission from TAUS
13. TAUS Dynamic Quality Framework
Market Landscape
Problem:
Quality evaluation (QE) in the translation industry is problematic:
It‟s static: one translation quality fits all purposes and all content types
It‟s subjective: it lacks industry-level agreed methods and metrics
The problem of QE aggravates with the introduction of MT technology and the growing need to
translate new content types
Solution:
TAUS proposes:
A dynamic approach to QE
An industry-shared QE platform for MT benchmarking
The TAUS Dynamic Quality Framework brings credibility and transparency to MT quality evaluation,
solving bottlenecks, saving time and costs and increasing efficiency and performance of MT engines.
This slide may not be used or copied without permission from TAUS
14. Work Already Done
TAUS Activity
Dynamic Quality Framework Report (November 2011):
Starting in Q1 2011 TAUS explored best practices in translation quality evaluation and researched new methodologies
and metrics. In the report following this study TAUS presents a dynamic QE framework, which offers a more flexible
approach to the common static QE models. The Dynamic QE framework is based on the three parameters of utility,
time and sentiment (UTS). Download the 32 pages report.
DQF Knowledge Base (June 2012):
In consultation with sixteen enterprise members TAUS developed a knowledge base providing a framework of the best
fit translation quality evaluation models based on content types, intended usage, tools, processes and other variables.
The DQF knowledge base describes seven different QE models: adequacy/fluency, community-based, customer
feedback, error typology, readability, usability, adherence to regulatory markets. See DQF knowledge base.
Content Profiling (June 2012):
The participating enterprise members ranked their different content types using the UTS scoring method. Eight different
content categories were defined based on this scoring. For each category the best suited QE model is assigned. New
users can select one of the eight defined content categories to find recommendations for QE or they can go through an
exercise of UTS scoring to map their content types to the agreed general categories. See content profiling.
DQF Tools (Q3 2012):
In July 2012 TAUS released two different quality evaluation tools for beta-testing by a select number of TAUS
members: a productivity testing tool for machine translation output and a quality evaluation tool (adequacy-fluency and
error typology). These tools will be released for general use by all TAUS members in the fourth quarter 2012.
This slide may not be used or copied without permission from TAUS
15. Plans Going Forward
TAUS Activity
Benchmarking (Q1 2013):
In the first quarter of 2013 TAUS wants to release benchmarking services on the DQF platform that
will allow users to evaluate and compare the quality of MT engines that they develop or use internally
and to also benchmark their MT engines with those of other MT users and developers.
Attributes
Benchmarking data collected are: language pair, industry sector, content category, sentence length,
date, name of engine, company name. Users will be able to compare the performance of their MT
engines against averages in selected groupings based on these attributes.
Evaluation Models and Metrics
The following evaluation models and metrics will be available:
Productivity: time spent by 1,000 words and number of edits per 1,000 words, compare
productivity with no match
Error typology: number of errors per category
Adequacy: on a scale from 1 to 5
Fluency: on a scale from 1 to 5
Readability: on a scale from 1 to 5
Better – Worse: e.g. 5 (better) to 1 (worse)
This slide may not be used or copied without permission from TAUS
16. Use Cases – KB and Content Profiling
TAUS Activity
Target users
The Knowledge Base and Content Profiling services are aimed at all translation professionals.
Access to DQF Knowledge Base (June 2012
Use case: User needs to identify best practice, templates, tools and metrics for new more dynamic
quality evaluation models. The DQF Knowledge Base gives a good overview of the industry‟s
collective intelligence on translation quality evaluation.
Benefits:
reduced overhead costs for QE
industry referencable best practices increase credibility and transparency of QE.
Content Profiling (June 2012)
Use case: User needs to identify best matching QE model for content type and best matching content
types for MT technology.
Benefits:
industry referencable best practices increase credibility and transparency for MT usage..
This slide may not be used or copied without permission from TAUS
17. Use Cases – MT Evaluation and Benchmarking
TAUS Activity
Target users
The MT Evaluation and Benchmarking services are aimed at MT users and developers.
MT Internal Evaluation (October 2102)
Use case: User needs to compare the quality and performance of different MT engines in order to
select the best engine for their own use. S/he can upload the machine-translated files, fill in the
attributes, assign evaluators/editors and retrieve metrics for comparison. Options are to do productivity
testing, error typology, adequacy/fluency and better-worse evaluation.
Benefits:
industry-standard tools and metrics increase reliability of the evaluation scores
vendor-independent evaluation increases flexibility and reliability
reduced overhead costs for MT evaluation
MT Industry Benchmarking (January 2013)
Use case: User needs to compare the quality and performance of their (best) MT engines for
business analysis. Options are to benchmark productivity scores, error typology, adequacy/fluency
and better-worse evaluation scores.
Benefits:
Benchmarking data are essential for translation strategies decisions
Benchmarking data help to continuously improve the performance of MT engines
Benchmarking data help to set referencable industry data for productivity
This slide may not be used or copied without permission from TAUS
18. Translation Interoperability
Market Landscape
Why:
Resources, files, jobs need to „travel‟ frictionless between vendors and tool platforms for customers
and users to be happy, grow their business and increase their translation volumes.
Mind you:
Standards move very slowly. It is extremely complex to deal with numerous file formats and different
interests from so many parties in the legacy translation industry.
Translation volumes are shifting from file-based to web-based: many small jobs and transactions on
the internet rather than heavy doc sets.
TAUS now focusing on interoperability of web-based translation activities.
This slide may not be used or copied without permission from TAUS
19. Translation Interoperability
What’s been done Focus 2013
Technical Committees: TAUS participates in Translation API: provide a self-help kit to
TC‟s for XLIFF and TMX standards. integrate translation API in workflows.
Translation Technology Roundtable: TAUS Translation Technology Roundtable: Bring
organized a first meeting exclusive for industry players from supply side together in
translation technology providers. focused events.
Translation API: Defined and reviewed
translation services API specifications with
TAUS members.
Launched Translation API and validation
service.
Want to get (more) involved
Implement and help enhance the Translation API.
Join the Translation Technology Roundtable events (for technology providers).
Contribute to Translation Technology Landscape Report.
This slide may not be used or copied without permission from TAUS
20. Massive Growth in Business Use of APIs
http://blog.programmableweb.com/2012/05/22/6000-apis-its-
business-its-social-and-its-happening-quickly/
21. TAUS Events
Conference Program “Virtual Meetings”
TAUS QE Summit (March 14-15) MT User Forums
Hosted by Microsoft in Dublin Teleconference meeting for MT
Users, every last Thursday of even
TAUS Tokyo Executive Forum (April 11-12) months
Hosted by Oracle Japan
TAUS Asia Summit (September) DQF User Forums
Teleconference meeting for DQF
Park Plaza Hotel, Beijing
Users, every last Thursday of
TAUS European Summit (June 10-11) uneven months
Westin Hotel, Dublin
Quarterly Member Webinar
TAUS User Conference (October 7-8) Once a quarter we invite all
Governor Hotel, Portland (OR) members for a briefing webinar on
industry-shared programs
Semi-Annual Webinar
Twice a year we hold a webinar for
all candidate members to share
updates on industry perspectives
22. Your feedback
Which of the TAUS action lines is most relevant and
useful for you:
1. Promoting Translation Automation
2. Sharing Translation Data
3. Dynamic Quality Framework for Quality Evaluation
4. Translation Interoperability
This slide may not be used or copied without permission from TAUS
23. Come to the TAUS events in 2013
This slide may not be used or copied without permission from TAUS
24. Thank you. We hope you found the session helpful.
For more information, contact us at:
memberservices@translationautomation.com
This slide may not be used or copied without permission from TAUS