The document discusses TAUS's mission to increase the size and significance of the translation industry through open sharing of data and terminology. It highlights four focus areas: technology, data, translation as a utility, and metrics. It notes that terminology work is important but current sources lack resources, quality, and verification. TAUS aims to address these issues by centralizing data, terminology, and crowdsourcing input from language workers and the general public. The goal is to improve machine translation and help the world communicate better by building a central hub for industry-scale sharing of translation resources and data.
2. TAUS Mission Knowledge
Sharing Data &
Our an industry-level in an
mission is to increase
…on
the size and significance
open and transparent
of the translation all to a
landscape brings us
industry to help the
higher level of competence.
world communicate
better.
14. Importance of Terminology Work
14.8%
1.8%
43.5%
Very important
Quite important
Less important
Not important
39.9%
Source: TaaS User Needs Survey, 2012. 1735 responses (approx 40% technical
writers, 30% translators, plus others)
15. Information Sources
Source: TaaS User Needs Survey, 2012. 1735 responses (approx 40% technical
writers, 30% translators, plus others)
16. Information Sources
Source: TaaS User Needs Survey, 2012. 1735 responses (approx 40% technical
writers, 30% translators, plus others)
17. Information Sources
Source: TaaS User Needs Survey, 2012. 1735 responses (approx 40% technical
writers, 30% translators, plus others)
18. Information Sources
Source: TaaS User Needs Survey, 2012. 1735 responses (approx 40% technical
writers, 30% translators, plus others)
19. Information Sources
Source: TaaS User Needs Survey, 2012. 1735 responses (approx 40% technical
writers, 30% translators, plus others)
20. Main Problems
9.4%
Lack of
resources/Insufficient
terminology
management
Poor quality/Up-todateness
20.6%
Lack of information
12.2%
36.0%
11.5%
10.3%
Lack of convincing
verification/Misleading
information online
Rest
Source: TaaS User Needs Survey, 2012. 1735 responses (approx 40% technical
writers, 30% translators, plus others)
35. TAUS Mission Knowledge
Sharing Data &
Our an industry-level in an
mission is to increase
…on
the size and significance
open and transparent
of the translation all to a
landscape brings us
industry to help the
higher level of competence.
world communicate
better.
36. Central Sources of Data and Terminology
For language workers, CAT Tools & MT Systems
Own Data – Private Vault
Shared Data – In domain data
Web Data – Data Collector
Own Terms – Build Own Collections
Shared Term – In-domain terms
Web Terms – Term Collector
But what about the crowd?
37. Main Problems
9.4%
Lack of
resources/Insufficient
terminology
management
Poor quality/Up-todateness
20.6%
Lack of information
12.2%
36.0%
11.5%
10.3%
Lack of convincing
verification/Misleading
information online
Rest
Source: TaaS User Needs Survey, 2012. 1735 responses (approx 40% technical
writers, 30% translators, plus others)
38. Central Sourcing of Data and Terminology
But what about the crowd?
The crowd must source!
The crowd must verify!
Web Data – Data Collector
Web Terms – Term Collector
39. Too many sources.
Takes time. to
We maintain the
Unless the crowd helps
source is duplicated.
and verify…….
Effort
status quo..
Results questionable.