Pre:D Say What You Mean

2,866 views

Published on

The right term for translation - every time.

Published in: Data & Analytics
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
2,866
On SlideShare
0
From Embeds
0
Number of Embeds
2,780
Actions
Shares
0
Downloads
0
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Pre:D Say What You Mean

  1. 1. Pre:D stands for “predisambiguation” – finding the exact meanings of terms in the original language, before translation, to select the right vocabulary in other languages.
  2. 2. disambiguation: figuring out the right sense of a term that has multiple meanings When you say “seal”, do you mean: or
  3. 3. Nicole: We got a letter from Champittet!
  4. 4. Nicole: We got a letter from Champittet! Tati (Daddy): Did you read that word, or did you recognize the seal?
  5. 5. Nicole: We got a letter from Champittet! Tati (Daddy): Did you read that word, or did you recognize the seal? Nicole: Tati, that’s not a seal, it’s a dolphin!
  6. 6. While reading a website in French, I came across a word I didn’t know. I clicked the button for Google to translate. Because my Google was set to Swahili, this is the result I saw. I was mystified, because “plagi” is an electrical plug or outlet.
  7. 7. This picture from Wiktionary shows the idea that translates to Swahili as “plagi”. Why was Google telling me that Innovaud will help Kamusi establish totally new electrical sockets?
  8. 8. Switching the translation language to English, you can see what happened. The French word was translated as “outlet”.
  9. 9. “Outlet” is a fine suggestion.
  10. 10. However, “outlet” has a lot of different meanings in English. Those different meanings translate to a lot of different terms in French.
  11. 11. Those different meanings also translate to a lot of different terms in Swahili. And the different meanings of “outlet” will be translated to a variety of different terms within each other language. Look up “outlet” in an English-Italian dictionary: “presa” is the top result. That makes sense. “Electrical outlet” is statistically your most likely choice. “Commercial outlet” is the bottom of the list.
  12. 12. Here is Google saying in Italian that Innovaud will help establish completely new electrical sockets. But I didn’t go to Innovaud.ch because we have a new electrical socket. I went because Kamusi has a completely new approach to inter-language communication, which will revolutionize the translation industry when we bring it to market.
  13. 13. To translate something, a source document must be analyzed to find the possible meanings of a term in its original language. The problem with current Machine Translation (MT) is that meaning is selected based on estimated frequency across millions of known uses. If “outlet” is electrical 70% of the time in the literature, that sense is often a good bet.
  14. 14. Look at Google’s prediction graphs for their translation of “the spring in her step”. You can see how good bets end badly. To the right you can see Google’s estimated likelihoods for “spring” and “step” in French. Below, you can see those probabilities in action.
  15. 15. In state-of-the-art statistical translation, the machine makes its best guess. It waves a magic wand, and… Presto: PRESA, PLAGI, electrical outlet. At Kamusi, we know what the possible meanings are, for “outlet” and 100,000 other English words. We have separated out the translation options for over 200,000 specific meanings, with more than 10 million links among 1.2 million terms in twenty languages, and counting.
  16. 16. Pre:D will show you all the definitions that match the word in your source language, sentence by sentence. We’ll rank your options statistically, but you’ll be the final judge. You choose your meaning. Even if the expression is multiple words, like “spring in her step”. Even if a word is a grammatical form, like “drove” and “driving”. Even if the expression has multiple words and grammatical forms, separated by many other words, like “drives [everybody she meets] crazy”. We give you the matching concept in your translation language. English, French, Swahili, Finnish, Thai …. any language we’ve got, and we’re adding languages all the time.
  17. 17. For example: ◎ early (for a meeting) ◉ early (in the day) ◎ early (in formative stages) Kamusi gives you translations that have already been mapped by human intelligence. Pre:D can then integrate with MT software to put those words into well-structured sentences, in one translation language or one hundred. And if we do not already have the terms you need in the languages you need, we have innovative systems to quickly learn your words from native speakers.
  18. 18. Imagine a company that needs to translate one document to 80 languages. With Pre:D, they predisambiguate once, and all their translators know the intended meanings of every word in the source document. And each translation team member sees the recommended vocabulary for their target language. Imagine someone in Lausanne chatting in real time with a business client in Shanghai. With Pre:D, they select their meaning in French, and they are certain that the client understands their meaning in Chinese.
  19. 19. The Language Services Market: • $35 billion globally (Common Sense Advisory, 2014) • 5% annual growth • Current industry serves major language pairs, particularly with English. Pre:D will offer unparalleled vocabulary accuracy for these translators • Kamusi technology will also enable thousands of valuable new non-English translation pairs (theoretical maximum for all 7000 world languages is 24,496,500 pairs – the chart on the left shows early parallel-word figures from our Big Data Beta) telephone 1916 telephone 2016 translation 2016 translation 2026
  20. 20. Expected Users: • Multinational organizations – need to translate same document to many languages • One person makes sense of the document • Translators make sense in their target languages • Professional translators – homing in on precise vocabulary, including terminology and idioms • Language-aware civilians – much more control than Google Translate • Non-English pairs – thousand of new combinations, eliminates most noise from English pivot • English<->underserved languages – kamusi is gathering new parallel data for 100s of languages
  21. 21. Kamusi Pre:D No guessing No magic wands In one language after another, just one thing that will erupt the language industry: The right translation

×