Predictive Analysis in Machine Translation is Business Intelligence.

•Download as PPTX, PDF•

0 likes•553 views

Tony O’Dowd (KantanMT). KantanMT enables its community to generate meaningful business intelligence that helps them identify the scope of their customised machine translation projects. More importantly, it helps them schedule and scale those projects to achieve maximum translation productivity and a positive ROI.

Presentations & Public Speaking

Tony O’Dowd
Founder & Chief Architect
tonyod@kantanmt.com
Predictive Analysis in
Machine Translation
is Business Intelligence

What we aim to cover today?
 What is KantanMT.com?
 Types of Quality Estimation
 Comparative Quality Estimations
 Predictive Quality Estimations
 Benefits to Industry
 Product Scope Determination
 Tiered Pricing Capabilities
 Conclusions

What is KantanMT.com?
 Statistical MT Platform
 Cloud-based
 Highly scalable
 Inexpensive to operate
 Fusion of TM & MT & rules
 High speed, high quality
translations
 Our Vision
 To put Machine Translation
 Customisation
 Improvement
 Deployment
 into your hands
Active KantanMT Engines
7,783
Training Words Uploaded
143,078,042,293
Member Words Translated
4,259,399,846
www.kantanMT.com

Types of MT Quality Estimation
 Comparative MT Quality Estimation
 Uses a reference translation to calculate:-
 Word recall & precision
 Text Similarities
 Word Order correlations
 Linguistic similarities

 F-Measure Score
 Recall & Precision calculation
 Closely linked to the relevancy of word selection
for MT systems
Types of MT Quality Estimation
KantanBuildAnalytics™

 BLEU Score
 Improvement upon F-Measure
 Takes word-order into consideration
 Linked to a sense of translation ‘fluency’
Types of MT Quality Estimation
KantanBuildAnalytics™

Types of MT Quality Estimation
 TER Score
 A method to help in predict the post-editing effort
 TER is quick to use and correlates highly with actual post-
editing effort
KantanBuildAnalytics™

Types of MT Quality Estimation
 Useful for
 Engine Development
 Baseline measurements
 Determination of ‘possible’ engine
quality and relevancy
 Reference set of comparative
translations required
 Does not work on unseen translations
 Of limited use in determining
 PE effort
 Resources
 Costs
Kantan BuildAnalytics™

Kantan TotalRecall – Advanced TM
% of TM hits in this job
KantanMT – automated translations
% of automated translations for this job
Range of QE Scores
QE range defined to match existing fuzzy match ranges used by
L10N industry
Quality Estimation Scores
Segment level QE scores – akin to fuzzy match scores
Word Counts – Project Stats
Can be used to develop Project TimeLine and Tiered Pricing
Model for Post-Editing Projects
Placeholder & Tag Counts
Used by PM for complexity sur-charges
Types of MT Quality Estimation
KantanAnalytics™

Types of MT Quality Estimation
 KantanAnalytics™
 No Reference set reqd.
 Predictive, not comparative
 Benefits
 Tiered Pricing Model
 Prioritise PE activity
 Schedule
 Resources
 Cost
 Seamlessly integrated into all
CAT tools
KantanAnalytics™ - a predictive quality
estimation technology

Tony O’Dowd
Founder & Chief Architect
tonyod@kantanmt.com

Predictive Analysis in Machine Translation is Business Intelligence.

What's hot

Nesma autumn conference - the gains of unit based pricing - Sytse van der Schaaf

Nesma

Introduction KPIs and OEE

JvdZ

The challenge of IT Outsourcing

Nesma

EAMT Workshop 2015 - KantanMT

kantanmt

Nesma autumn conference - Contracting & Performance management - Cees Kuijpers

Nesma

Analytics and Data as a Keystone Technology for Translation Companies, Doron ...

TAUS - The Language Data Network

The Nesma perspective on FSM automation

Nesma

Tmj disorders

Indian dental academy

Getting From Understanding to Execution: Making Implicit Processes Actionable...

Nathaniel Palmer

Building Advice energy assessment technology

AirAdvice

Ac2017 3. cast software-metricsincontracts

Nesma

Workforce Management & BPM Integration

Nathaniel Palmer

Dimensional planning (Devoxx 2009)

inxin

Translation is supposed to be unmeasurable. However, as the famous statistician David S. Moore said, “If you don’t know what to measure, measure anyway. You’ll learn what to measure.” In fact, measuring allows decision-makers to reduce uncertainty and the risk of wasting money. During the presentation, I will go through the following questions that must be answered before making a measurement: What decision a measurement is supposed to support? What is the definition of the thing being measured? How does this thing matter to the decision? What is the current level of uncertainty? What is the value of additional information?

Measuring Translation, Luigi Muzii, sQuid

TAUS - The Language Data Network

Erp for construction

eresource infotech pvt ltd

Process automation for Technical Writing

Amsi Academy

BRD document for test automation estimation

Software Testing Board

Nesma autumn conference 2015 - Bye bye productivity, hello Business Value - F...

Nesma

Selling commercial Solar +/or Energy Storage solutions? You need GridMAP!

Iain Beveridge

Ac2017 2. added value!

Nesma

What's hot (20)

Nesma autumn conference - the gains of unit based pricing - Sytse van der Schaaf

Introduction KPIs and OEE

The challenge of IT Outsourcing

EAMT Workshop 2015 - KantanMT

Nesma autumn conference - Contracting & Performance management - Cees Kuijpers

Analytics and Data as a Keystone Technology for Translation Companies, Doron ...

The Nesma perspective on FSM automation

Tmj disorders

Getting From Understanding to Execution: Making Implicit Processes Actionable...

Building Advice energy assessment technology

Ac2017 3. cast software-metricsincontracts

Workforce Management & BPM Integration

Dimensional planning (Devoxx 2009)

Measuring Translation, Luigi Muzii, sQuid

Erp for construction

Process automation for Technical Writing

BRD document for test automation estimation

Nesma autumn conference 2015 - Bye bye productivity, hello Business Value - F...

Selling commercial Solar +/or Energy Storage solutions? You need GridMAP!

Ac2017 2. added value!

Viewers also liked

The music-loving Baltic countries are a multilingual hotspot in Europe, with the majority of citizens speaking (and singing) three languages on a daily basis. At the same time, the melodious Baltic languages are famously complex and morphologically rich, containing lots of ambiguity and intricate word agreements. Taken together, these factors make the region a prime spot for driving innovation in language technologies. Tilde, a language technology company specializing in custom MT and terminology services, has leveraged its extensive linguistic experience in the Baltic region to create custom MT systems for a wide variety of languages and domains, helping EU and global companies to boost translation productivity and make their applications multilingual. Tilde recently embarked on the challenging task of building a large-scale MT service for the Latvian government, Hugo.lv. This service was adapted to create a communication tool for the 2015 EU Presidency. The presentation will introduce the audience to languages and MT in the Baltic region and highlight these two case studies, which showcased the crucial role of language technology in enabling multilingual communication in the digital age.

Why the Baltics are a prime region for driving innovation in language technol...

TAUS - The Language Data Network

Plantilla hecha bien 2

laura lopez sanchez

Nietzsche

crisandriu

شهاده خبره محمد جلال

Mahmoud Aly

Quality estimation: the Holy Grail in the MT scene (Gábor Bessenyei, CEO of M...

TAUS - The Language Data Network

Notas ingles

Bryan Ivan

In all of our translation production activities we are producing data, lots of data. We are not talking now about the actual translations that are stored as translation memory data. These translation memory data have proven to be very valuable over the years and recently again as training data for Machine Translation engines. But in this session we are talking about the other data: data about the translation process. How much time was spent on different tasks, for different languages, content types, per project? What was the quality score for the translator, for the vendor? What was the user feedback on this machine translated support article? How is our MT engine performing? And has it improved since last year, since we have added 13 million more words in the training set? Some of the buyers and providers of translation are further ahead with the use of all these translation management data than others. The TAUS Dynamic Quality Framework (DQF) tracks translation management data through plug-ins that are already available for various translation tools and platforms. The vision is becoming very clear: the translation industry can have its own “Big Data”. In the past couple of months TAUS enterprise members have contributed their wishes and requirements for an industry benchmarking platform for translation quality and productivity. In this session several TAUS members will share and discuss their plans for using DQF and the Quality Dashboard. What data would you like to track? Session host: Daniel Goldschmidt (Microsoft) Presenters and panelists are: Annya Sedakova-Bertram (EMC), Fred Tuinstra (Lionbridge), Achim Ruopp (TAUS)

What Data would you like to Track? - Fred Tuinstra

TAUS - The Language Data Network

In this session, with clear focus on Machine Translation (MT) quality, we will discuss different ways to improve MT engines. Which engine do you use and how do you measure improvement? What are the right metrics to evaluate MT quality for the specific content types? How do you interpret and act on the evaluation results? It's fine when errors are labeled and analyzed, but how can that help improve your engine? Are there best practices available? And how about Neural MT? Should we measure that differently? After some use cases shared by the speakers, these questions will be addressed in the break-out session.

Topic 2: How to Pump up Your MT Quality (5)

TAUS - The Language Data Network

Data and Linguistics: Delivering Machine Translation with Subject Matter Expe...

Iconic Translation Machines

Improving Translator Productivity with MT: A Patent Translation Case Study

Iconic Translation Machines

Formatos capacitacion aplicados

luisafernandaalex

Viewers also liked (11)

Why the Baltics are a prime region for driving innovation in language technol...

Plantilla hecha bien 2

Nietzsche

شهاده خبره محمد جلال

Quality estimation: the Holy Grail in the MT scene (Gábor Bessenyei, CEO of M...

Notas ingles

What Data would you like to Track? - Fred Tuinstra

Topic 2: How to Pump up Your MT Quality (5)

Data and Linguistics: Delivering Machine Translation with Subject Matter Expe...

Improving Translator Productivity with MT: A Patent Translation Case Study

Formatos capacitacion aplicados

Similar to Predictive Analysis in Machine Translation is Business Intelligence.

Maximising Machine Translation Return on Investment (KantanMT/Medialocate)

kantanmt

Working with MOSES and building high quality MT systems is not for the faint hearted. It requires a wide range of technical and linguistic based knowledge that is often difficult to find and develop within organisations. Consequently, only the biggest organisations have the financial muscle to invest and reap the awards of MT. This puts the small-to-medium sized organisations at a distinct disadvantage. KantanMT changes everything! KantanMT is a cloud-based implementation of MOSES which enables SMEs to embrace the advantages of MT - quickly and economically. This presentation will demonstrate the KantanMT approach to rapid engine training and tuning, data analytics used to predict MT quality and create tiered pricing structures and instantaneous engine deployment - all of which are driving the new MT Revolution!

TAUS MT Showcase 2014, Enabling MT for the Everyone! Tony O’Dowd, KantanMT

TAUS - The Language Data Network

This presentation is a part of the MosesCore project that encourages the development and usage of open source machine translation tools, notably the Moses statistical MT toolkit.   MosesCore is supported by the European Commission Grant Number 288487 under the 7th Framework Programme.      For the latest updates go to http://www.statmt.org/mosescore/ or follow us on Twitter - #MosesCore

TAUS MT SHOWCASE, Creating Competitive Advantage with Rapid Customization & D...

TAUS - The Language Data Network

Tony O’Dowd takes us through some of the most innovative technologies offered on the KantanMT.com platform which are helping a growing community of KantanMT users to develop and self-manage custom Machine Translation engines in the cloud. Maxim Khalilov then illustrates bmmt’s journey with Machine Translation on KantanMT. He discusses what they have achieved so far in terms of MT engine development and showcases the value that his team is bringing to their growing international client base through the use of Machine Translation.

New Breakthroughs in Machine Transation Technology

kantanmt

TAUS Evaluating Post-Editor Performance Guidelines

TAUS - The Language Data Network

In this joint presentation, Tony O’Dowd, Founder and Chief Architect of KantanMT and Maxim Khalilov, Technical Lead of bmmt deliver an overview of the MT technology currently available in the language technology market, the challenges of operating MT systems at scale and speed, and their opinions on the future trajectory of MT. Each presentation will be grounded with client examples, and how they’ve successfully integrated MT into their localization workflows. Finally, both presenters will finish off with a 5 point checklist for successful MT deployment based on both the MT provider and LSP point of view. If you have any questions about this presentation or want to get in touch with either company please contact: Louise Irwin, Marketing Specialist at KantanMT (louisei@kantanmt.com) Peggy Linder, Operations Manager at bmmt (peggy.lindner@bmmt.eu)

5 challenges of scaling l10n workflows KantanMT/bmmt webinar

kantanmt

KantanLQR

Poulomi Choudhury

Tony (Chief Architect, KantanMT.com) opens the proceedings with a temporal look at how MT technology has progressed. While embracing Rule Based MT in the 1970s, the industry switched over to Statistical MT around 2002 and is now faced with a new paradigm of Neural MT in 2016. For each technology progression, improved translation quality and fluency were achieved. Summary: https://www.youtube.com/watch?v=19yyDa6mAsc Full video: https://www.youtube.com/watch?v=EtbML0DTNHk

KantanFest: Tony O'Dowd

kantanmt

KantanMT

kantanmt

TAUS Roundtable Moscow, CAT or TMS Implementation-Calculation of the Number o...

TAUS - The Language Data Network

Redefining Business Collaboration

Juan Manuel Mogollón

This slide deck on achieving agile localization for high-volume content with the help of Machine Translation was presented by Tony O’Dowd, Founder and Chief Architect at KantanMT during the annual tcworld conference 2015, which was held in Stuttgart, Germany. It outlines the best practices for developing and implementing a dynamic and agile localization strategy that integrates Custom Machine Translation (CMT) into the localization workflow, with the final aim of developing a scalable localization strategy that makes it possible to create and publish high-volume multilingual content.

How to Achieve Agile Localization for High-Volume Content with Machine Transl...

kantanmt

CAT or TMS Implementation: Calculation of the Number of Licenses and the Tota...

ABBYY Language Serivces

Learn the different approaches to machine translation and how to improve the ...

SDL

TAUS Quality Evaluation Summit - 28 May 2015, Dublin Machine translation (MT) has been the hot-topic of the localisation industry for some time now. Buyers of localisation know they should be maximising the use of MT in their workflows, but it can be difficult to decide how, when and where to use it. Building an MT infrastructure and deploying it as part of your workflow comes at a cost – and it can be very difficult to calculate ROI. Quite often, investing in MT can come with a leap of faith. This informative presentation, presented by Tom Shaw from Capita Translation and Interpreting (Capita TI), will talk about how the TAUS DQF tools can be used to evaluate the outputs of MT from two different systems. The results of the Productivity and Quality tests can be used to help benchmark MT outputs, and can be used to help decide which engine to go with (if any), and what the associated ROI can be.

MT Benchmarking and Business Intelligence - Tom Shaw (Capita TI)

TAUS - The Language Data Network

Machine Translation (MT) has experienced a surge in popularity in recent years. However, achieving the right level of quality output can be challenging, even for the most expert MT engineers. MT engines learn from carefully selected bilingual and monolingual training data, and engine quality is enhanced through the use of terminology, fine tuning and a series of pre and post processing steps. Since these practices have a significant effect on the results of an MT workflow, it’s important to map out each step and develop a clear training strategy before deploying an MT solution. Joining KantanMT’s Founder and Chief Architect, Tony O’Dowd is Selçuk Özcan, Co-founder of Transistent Language Automation Services. Transistent helps companies invest and integrate new language automation procedures into translation workflows. It is also the first company to focus on MT and quality automation services in Turkey and the Middle East. During this webinar, Selçuk will talk about Transistent’s experience using KantanMT.com to build and deploy high quality KantanMT engines. During this webinar you will learn: • About the potential uses of Machine Translation • Importance of training data and how it impacts on MT quality • Tips for Preparing Training Data for High Quality MT

Tips for Preparing Training Data for High Quality Machine Translation

kantanmt

"Empower" is a buzz word that has been pushed around extensively by many Machine Translation (MT) vendors. Empowerment implies by its very nature that you are required to put in some effort to have the control that empowerment promises and of course that you have the necessary experience and skills required to be empowered. In reality, few MT vendors offer little more than the ability to upload translation memories. True MT empowerment comes by having total control and transparency in the entire customization and translation process. MT empowerment also enables the business as a whole to expand its capabilities and reach by performing tasks that were previously unobtainable with a human only translation approach. This showcase demonstrates on the how Language Studio™ empowers organizations to use MT optimally and strategically by enabling project managers to control and define the MT customization process. Language Studio™ provides a wide range of tools and processes that enable customers to have complete control over their custom MT engines. With the guidance of Language Studio™ Linguists, the process is streamlined with expertise gained from building thousands of custom engines. This expertise is leveraged to meet your specific custom MT requirements. Just like a human translation project, every custom engine is unique and is managed in a similar manner to human translation projects with term definition, style guides and quality assurance.

User Empowered Machine Translation. Dion Wiggins, Asia Online

ABBYY Language Serivces

Lucia Specia - Estimativa de qualidade em TA

I Conferência Internacional de Tradução e Tecnologia

Qtp interview questions_1

Ramu Palanki

Are Function Points Still Relevant?

Premios Group

Similar to Predictive Analysis in Machine Translation is Business Intelligence. (20)

Maximising Machine Translation Return on Investment (KantanMT/Medialocate)

TAUS MT Showcase 2014, Enabling MT for the Everyone! Tony O’Dowd, KantanMT

TAUS MT SHOWCASE, Creating Competitive Advantage with Rapid Customization & D...

New Breakthroughs in Machine Transation Technology

TAUS Evaluating Post-Editor Performance Guidelines

5 challenges of scaling l10n workflows KantanMT/bmmt webinar

KantanLQR

KantanFest: Tony O'Dowd

KantanMT

TAUS Roundtable Moscow, CAT or TMS Implementation-Calculation of the Number o...

Redefining Business Collaboration

How to Achieve Agile Localization for High-Volume Content with Machine Transl...

CAT or TMS Implementation: Calculation of the Number of Licenses and the Tota...

Learn the different approaches to machine translation and how to improve the ...

MT Benchmarking and Business Intelligence - Tom Shaw (Capita TI)

Tips for Preparing Training Data for High Quality Machine Translation

User Empowered Machine Translation. Dion Wiggins, Asia Online

Lucia Specia - Estimativa de qualidade em TA

Qtp interview questions_1

Are Function Points Still Relevant?

More from TAUS - The Language Data Network

TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...

TAUS - The Language Data Network

TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...

TAUS - The Language Data Network

TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...

TAUS - The Language Data Network

TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...

TAUS - The Language Data Network

TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...

TAUS - The Language Data Network

As contents published on the Internet are becoming more and more dominated by videos, requirements on the language translation have also changed. Specifically, video publishers and distributors have a significant interest in balancing both the translation time and the accuracy. To this end, Pactera has invested in solutions, which leverage machine translation to reduce the overall translation time, and recruit human translators to improve the accuracy in a Wikipedia-like fashion. At Pactera, we aim to help video contents to reach billions of people that were not possible before.

Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...

TAUS - The Language Data Network

Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)

TAUS - The Language Data Network

Review processes as the last step in quality assurance workflows are “notorious for causing delays and frustrations”. The reason normally is a flawed process: Many manual steps for the PMs, the lack of intuitive, layout-oriented collaboration software, plus the expectation of review to “fix a broken translation” in the last second rather than giving strategic process input. globalReview shifts this paradigm: As an integrated, collaborative platform with full layout editing it provides a positive review experience. At the same time, it pushes quality upstream applying DQF principles: Flexible content profiles define precise quality expectations; issue categories and scoring effectively gauge and also track translation quality over time; a sampling module allows for fast yet accurate quality evaluation. Put together, this allows the customer to raise the process from painful review to strategic quality management and gain valuable business intelligence.

Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...

TAUS - The Language Data Network

A translation memory P2P trading platform - to make global translation memory...

TAUS - The Language Data Network

The presentation will introduce the NLP technologies used in Shiyibao and the main product features, covering the following points: Function of giving automatic grades for translations based on translation quality automatic evaluation algorithm; Function of giving automatic comments based on rules matching; Function of sorting translations according to their similarity or some specific fragments to dramatically improve the efficiency of reviewing and commenting on translations.

Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...

TAUS - The Language Data Network

In today’s digital economy, content is becoming smaller, more fragmented, and in need of on-demand translation in minutes and around the clock. Traditional localization models are no longer sufficient in meeting these always-on, agile, fast, and small translation requirements of the digital age. This is why mobile translation services like Stepes that are able to deliver quality, speed, and scalability are poised to see tremendous growth. During this 6-minute presentation, Stepes will demonstrate live its instant human translation service for micro content. Powered by human translators from around the world, Stepes is the world’s first mobile translation ecosystem delivering quality translation services using a networking model similar to Uber and Lyft.

Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...

TAUS - The Language Data Network

Farmer Lv (TrueTran)

TAUS - The Language Data Network

Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...

TAUS - The Language Data Network

Computer Aided Translation Training System (CATS) provides a package solutions to the problems of translation translation. CATS combines artificial intelligence, data collection, and visualization of information technology, which makes the translation teaching, class management and monitoring on one single platform areality. Translation and interpretaton teaching resources on CATS are updated regularly into detailed categories, making the teaching materials easy to access. CATS supports translation and interpretation teaching and practices, company internships as well as scientific research.

The Theory and Practice of Computer Aided Translation Training System, Liu Q...

TAUS - The Language Data Network

Translation Technology Showcase in Shenzhen

TAUS - The Language Data Network

Most of LSPs have not converted the translated bilingual documents to TM till now. Even the LSPs have established TMs, they are also confronted with disordered management of TMs and low efficiency. This report will share the way of quick TM establishment with Tmxmall Cloud-Based Smart Aligner, the way of Management of large-scale TMs with Private Cloud-Based TM for achieving pre-translation with large-scale TMs and team cooperation and etc.. Besides, the report will introduce Tmxmall TM marketplace, which is expected to promote TM sharing. Finally, we will share the experience of LSPs on alignment and Private Cloud-Based TM management for reducing translation costs and increasing profits.

How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)

TAUS - The Language Data Network

SDL is the leader in global content management and language translation solutions. With more than 20 years of experience, SDL helps companies build relevant online experiences that deliver transformative business results on a global scale. Translation Industry continues to grow, and Freelancers, LSPs and Corporate clients all see increased demand as more and more content is created, so we have to address them all. As a Market-leading translation productivity tool, SDL Trados Studio is trusted by over 200,000 translation professionals to boost productivity, control quality and aid collaboration. SDL has launched Trados Studio 2017. This presentation will introduce SDL Trados Studio 2017 and highlight SDL’s new productivity booster- UPLIFT, which is well welcomed by global clients.

SDL Trados Studio 2017, Jocelyn He (SDL)

TAUS - The Language Data Network

How we train post-editors - Yongpeng Wei (Lingosail)

TAUS - The Language Data Network

A use-case for getting MT into your company, Kerstin Berns (berns language c...

TAUS - The Language Data Network

QE integrated in XTM, by Bob Willans (XTM)

TAUS - The Language Data Network

More from TAUS - The Language Data Network (20)