SDL provides machine translation solutions to customers. They have a team of over 50 professionals across various locations that work on driving MT adoption, building custom engines, and conducting linguistic projects. SDL's approach involves evaluating data, training machine translation engines, testing outputs, and refining engines through an iterative process with a focus on maximizing quality. They provide customized solutions through domain-specific engines and language verticals to meet the needs of different customers and content types.
High Volume, Rapid Turn Around Localization: Lessons LearnedSDL
Customer success story by Johnson & Johnson on best practices and lesson learned in localization process. Learn the importance of prioritization, in-country review, terminology database, teamwork and collaboration. Delivered at the SDL Customer Success Summit Montreal 2016.
SDL introduces two groundbreaking TM and MT technologies: 1) upLIFT leverages segment fragments from TM in new ways, and 2) AdaptiveMT self-learns in real time from translators’ output, based on each post-edit sent back to the engine. Carla Schelfhout (SDL)
MT best practices for price, speed AND quality, as well as Lexcelera’s machine translation case studies and services including training, integration, post-editing and hosted MT
High Volume, Rapid Turn Around Localization: Lessons LearnedSDL
Customer success story by Johnson & Johnson on best practices and lesson learned in localization process. Learn the importance of prioritization, in-country review, terminology database, teamwork and collaboration. Delivered at the SDL Customer Success Summit Montreal 2016.
SDL introduces two groundbreaking TM and MT technologies: 1) upLIFT leverages segment fragments from TM in new ways, and 2) AdaptiveMT self-learns in real time from translators’ output, based on each post-edit sent back to the engine. Carla Schelfhout (SDL)
MT best practices for price, speed AND quality, as well as Lexcelera’s machine translation case studies and services including training, integration, post-editing and hosted MT
Jaap van der Meer will present key findings from the MT Market Report that TAUS published. For more information, see: https://www.taus.net/think-tank/reports/translate-reports/mt-market-report-2014
Delivered at the European Patent Office's Patent Information Conference.
November 11th 2015
Miami, Florida.
In this talk, we talk about recent advances in MT for patents and introduce our IPTranslator.com application for on-demand translation.
How to take the most advantage of the translation data of our company over the past 10 years. How to convert these history data to effective language assets and use the combined translation data of the industry in order to improve our translation efficiency and quality, save translation costs, so as to further enhance the core competitiveness of our company in the industry. I will present some difficulties and challenges in the production and use of corpus.
Currently, MT is taking on a new cross-paradigm development characterized by the hybridization of data-driven statistical approaches and linguistic knowledge models. This presentation introduces the structural design of CCID's hybrid MT system including extracting term and lexicon entries from parallel corpora, extracting multi-word expressions (MWEs) in native English from the three-tuple comparable corpora, and new approach in statistical post-editing of Rule-based MT for domain adaptation. Research and Application of MT and language corpora in areas such as Corporate financial narratives will also be discussed in this session.
It’s no longer a question of whether technologies will reshape language service industry; it's just a matter of how quickly and how deeply. The future of translation industry will be prosperously stimulated by the further development of economic globalization and the popularization of internet usage, but clients' requirement will be further diverse in terms of quality, turnaround, volume, content, and cost. To deliver matched services, language service providers need to enable agile translation, machine translation and scalable translation capacity. However, it is hard to deliver all these diverse services without the help of technology and an innovative business model. New technologies, such as workflow automation, online service, crowdsourcing supported by new technologies and internet becoming a competitive edge for LSPs, and new relationships with resources (translators) are also a must after millennial generation dominate the labor market, the succeed of a LSP will be more technologies- and innovations-driven. This presentation will focus on how technology/Internet is reshaping the translation industry in China.
Delivered at Machine Translation Summit during a special workshop on post-editing.
November 3rd 2015
Miami, Florida.
In this talk, we describe the latest advances in the world of commercial and academic machine translation development that are having the effect of improving acceptance of the technology and keeping its users happy.
Delivered at Machine Translation Summit during a special workshop on MT for patent and scientific literature.
October 30th 2015
Miami, Florida.
In this talk, we describe how we adapted machine translation for patents to help a translation company improve their productivity.
As a member of TAUS, Lingosail provides language technologies for LSPs, especially in the intellectual property industry. Aiming at Human-Machine Cooperation we have developed and deployed MT systems. Currently we use MT mainly in three scenarios: post-editing production, terminology extraction and cross-language information retrieval. This presentation will show some of our experiences in these use cases, and some of our expectations for MT's feature utilities.
Brief overview of the history of the controlled language movement and the benefits of using a controlled language; includes examples, references, and resources.
Language Quality Management: Models, Measures, Methodologies Sajan
With growing content, shorter release dates and many target languages, it’s important for global companies to have a process in place to track and measure translation effectiveness. Learn how big companies like Microsoft, Snap-on Diagnostics and Symantec manage translation quality.
Delivered at the 29th LocWorld conference.
October 16th 2015
Santa Clara, CA, USA.
In this talk, we describe how we carried out a successful large scale evaluation and deployment of machine translation at RWS.
Welocalize Throughputs and Post-Editing Productivity Webinar Laura CasanellasWelocalize
Welocalize language tools expert Laura Casanellas details key topics related to human translation and machine translation post-editing, production, throughputs and measuring success. This is the presentation used in a recent online webinar you can find at http://www.welocalize.com/wemt/wemt-webinars/
Topics for this recorded webinar include:
- Defining throughputs for human translation and machine translation post-editing
- How to accurately compare individual throughputs for translating and post-editing
- What are the most common deviations in throughputs
- How to spot progress and performance improvement
- Who really benefits from post-editing
Jaap van der Meer will present key findings from the MT Market Report that TAUS published. For more information, see: https://www.taus.net/think-tank/reports/translate-reports/mt-market-report-2014
Delivered at the European Patent Office's Patent Information Conference.
November 11th 2015
Miami, Florida.
In this talk, we talk about recent advances in MT for patents and introduce our IPTranslator.com application for on-demand translation.
How to take the most advantage of the translation data of our company over the past 10 years. How to convert these history data to effective language assets and use the combined translation data of the industry in order to improve our translation efficiency and quality, save translation costs, so as to further enhance the core competitiveness of our company in the industry. I will present some difficulties and challenges in the production and use of corpus.
Currently, MT is taking on a new cross-paradigm development characterized by the hybridization of data-driven statistical approaches and linguistic knowledge models. This presentation introduces the structural design of CCID's hybrid MT system including extracting term and lexicon entries from parallel corpora, extracting multi-word expressions (MWEs) in native English from the three-tuple comparable corpora, and new approach in statistical post-editing of Rule-based MT for domain adaptation. Research and Application of MT and language corpora in areas such as Corporate financial narratives will also be discussed in this session.
It’s no longer a question of whether technologies will reshape language service industry; it's just a matter of how quickly and how deeply. The future of translation industry will be prosperously stimulated by the further development of economic globalization and the popularization of internet usage, but clients' requirement will be further diverse in terms of quality, turnaround, volume, content, and cost. To deliver matched services, language service providers need to enable agile translation, machine translation and scalable translation capacity. However, it is hard to deliver all these diverse services without the help of technology and an innovative business model. New technologies, such as workflow automation, online service, crowdsourcing supported by new technologies and internet becoming a competitive edge for LSPs, and new relationships with resources (translators) are also a must after millennial generation dominate the labor market, the succeed of a LSP will be more technologies- and innovations-driven. This presentation will focus on how technology/Internet is reshaping the translation industry in China.
Delivered at Machine Translation Summit during a special workshop on post-editing.
November 3rd 2015
Miami, Florida.
In this talk, we describe the latest advances in the world of commercial and academic machine translation development that are having the effect of improving acceptance of the technology and keeping its users happy.
Delivered at Machine Translation Summit during a special workshop on MT for patent and scientific literature.
October 30th 2015
Miami, Florida.
In this talk, we describe how we adapted machine translation for patents to help a translation company improve their productivity.
As a member of TAUS, Lingosail provides language technologies for LSPs, especially in the intellectual property industry. Aiming at Human-Machine Cooperation we have developed and deployed MT systems. Currently we use MT mainly in three scenarios: post-editing production, terminology extraction and cross-language information retrieval. This presentation will show some of our experiences in these use cases, and some of our expectations for MT's feature utilities.
Brief overview of the history of the controlled language movement and the benefits of using a controlled language; includes examples, references, and resources.
Language Quality Management: Models, Measures, Methodologies Sajan
With growing content, shorter release dates and many target languages, it’s important for global companies to have a process in place to track and measure translation effectiveness. Learn how big companies like Microsoft, Snap-on Diagnostics and Symantec manage translation quality.
Delivered at the 29th LocWorld conference.
October 16th 2015
Santa Clara, CA, USA.
In this talk, we describe how we carried out a successful large scale evaluation and deployment of machine translation at RWS.
Welocalize Throughputs and Post-Editing Productivity Webinar Laura CasanellasWelocalize
Welocalize language tools expert Laura Casanellas details key topics related to human translation and machine translation post-editing, production, throughputs and measuring success. This is the presentation used in a recent online webinar you can find at http://www.welocalize.com/wemt/wemt-webinars/
Topics for this recorded webinar include:
- Defining throughputs for human translation and machine translation post-editing
- How to accurately compare individual throughputs for translating and post-editing
- What are the most common deviations in throughputs
- How to spot progress and performance improvement
- Who really benefits from post-editing
Webinar automotive and engineering content 16.06.16kantanmt
High quality translations that are delivered quickly are a result of a seamless and efficient translation process, but getting to this stage requires a well thought out plan, rigorous content preprocessing techniques and most importantly, clear and transparent communication between the automated translation vendor and language service provider.
In this webinar, Christian Taube and Brian Coyle discusses how the Matrix and KantanMT partnership delivers a high quality, scalable solution that increases translation productivity and supports engineering and automotive terminology standards. The webinar uses specific case study examples including a discussion on what types of content to focus on and preparing and managing Translation Memory data. Discussion includes:
• Managing content for best results
• Preparing TM data
• Tools that generate high quality results
WeMT Tools and Processes Welocalize TAUS Showcase October 2013 Localization W...Welocalize
WeMT Tools and Processes, a presentation by Olga Beregovaya at Localization World 2013 in Silicon Valley. Presented during TAUS Showcase. Discussion of automation and machine translation programs. Welocalize is the leader in localization and translation solutions.
Managing Translation Memories for Engineering and Automotive TranslationPoulomi Choudhury
High quality translations that are delivered quickly are a result of a seamless and efficient translation process, but getting to this stage requires a well thought out plan, rigorous content preprocessing techniques and most importantly, clear and transparent communication between the automated translation vendor and language service provider.
In this webinar, Christian Taube and Brian Coyle discusses how the Matrix and KantanMT partnership delivers a high quality, scalable solution that increases translation productivity and supports engineering and automotive terminology standards. The webinar uses specific case study examples including a discussion on what types of content to focus on and preparing and managing Translation Memory data. Discussion includes:
• Managing content for best results
• Preparing TM data
• Tools that generate high quality results
This presentation is a part of the MosesCore project that encourages the development and usage of open source machine translation tools, notably the Moses statistical MT toolkit. MosesCore is supported by the European Commission Grant Number 288487 under the 7th Framework Programme.
For the latest updates go to http://www.statmt.org/mosescore/
or follow us on Twitter - #MosesCore
A summary of the philosophy and approach taken by the TravelBird Data Science team (and company as a whole) that allows rapid development of new machine learning algorithms, data insights, and integration into production and operations.
5 challenges of scaling l10n workflows KantanMT/bmmt webinarkantanmt
In this joint presentation, Tony O’Dowd, Founder and Chief Architect of KantanMT and Maxim Khalilov, Technical Lead of bmmt deliver an overview of the MT technology currently available in the language technology market, the challenges of operating MT systems at scale and speed, and their opinions on the future trajectory of MT.
Each presentation will be grounded with client examples, and how they’ve successfully integrated MT into their localization workflows.
Finally, both presenters will finish off with a 5 point checklist for successful MT deployment based on both the MT provider and LSP point of view.
If you have any questions about this presentation or want to get in touch with either company please contact:
Louise Irwin, Marketing Specialist at KantanMT (louisei@kantanmt.com)
Peggy Linder, Operations Manager at bmmt (peggy.lindner@bmmt.eu)
MT Summit 2013 Welocalize Getting the MT Recipe Right by L Casanellas and L MargWelocalize
Connectivity, Adaptability, Productivity, Quality, Price - Getting the MT (Machine Translation) Recipe Right. MT Summit 2013 Welocalize Presentation in Nice, France. Localization, Translation, Language Tools, Language Service Providers, Automation, Technology
Machine Translation Master Class at the EUATC Conference by Diego Bartolometauyou
Machine translation enables new business models to create new revenue sources for your business. However, integrating it into your workflow might be challenging. In this Master Class, Diego Bartolome will cover the most important aspects and lessons learned during the past seven years, which include technology, people, and processes.
We had 2,784 respondents across 115 countries and in 9 languages, from translation professionals across the entire translation supply chain – enterprises, corporates, government
bodies, language service providers and freelance translators.
New Breakthroughs in Machine Transation Technologykantanmt
Tony O’Dowd takes us through some of the most innovative technologies offered on the KantanMT.com platform which are helping a growing community of KantanMT users to develop and self-manage custom Machine Translation engines in the cloud.
Maxim Khalilov then illustrates bmmt’s journey with Machine Translation on KantanMT. He discusses what they have achieved so far in terms of MT engine development and showcases the value that his team is bringing to their growing international client base through the use of Machine Translation.
MT providers claim that (customized) MT "helps you translate more words and grow your business". It "boosts productivity". And it can even "increase revenues" or "optimize customer service and support". What is the reality? Is MT improving or did we reach a plateau? Speakers at last year's QE Summit agreed: one of the main problems in the translation industry today is the lack of benchmarking. The output of MT engines cannot be compared to industry averages or standards because these are not yet available. Automated scores are meaningless outside the “laboratory”. At the same time, buyers of translation services are increasingly interested in translated content of different quality levels. They also want to know how the different engines are performing on different content types and in different language pairs. How do we know? Can we predict the output quality? Shouldn’t MT providers become more transparent to help buyers of these technologies make informed decisions?
Session leader: Dag Schmidtke (Microsoft)
Panelists: John Tinsley (Iconic), Olga Beregovaya (Welocalize), Olga Pospelova (eBay)
Similar to Learn the different approaches to machine translation and how to improve the quality of your global strategy with machine translation (20)
The Marketer's Dilemma in Today's Global Digital Era - Liesl Leary and Henry ...SDL
Liesl Leary, Product Marketing Director at SDL and Henry Barfoot-Saunt, SVP Northern Europe Sales at SDL present at SDL Connect, Palo Alto, November 2016.
In this presentation you will learn the benefits of a global video strategy, including:
*The different types of video localization, when to use each type, and the relative cost for each method
*Tips for creating videos from the start that are easy to localize
*Project management strategies to make video localization easier, faster, and less expensive
Subtitling, audio dubbing, and transcreation. Tips to conceptualize videos from the start so they are easier and cheaper to localize. Emerging technologies and techniques that separate audio and subtitling from videos, easing the translation management and multilingual deployment. Delivered at the SDL Customer Success Summit Montreal 2016.
Transcreation for Deep Cross-Cultural ConnectionSDL
What is transcreation and how do you determine you need it? Customer success story on overcoming the challenges of the transcreation process delivered at the SDL Customer Success Summit Montreal 2016.
Philips Healthcare: A Case Study. Adoptiong a Test Center Approach to Launch...SDL
As Philips Healthcare was preparing to launch its newest advanced defibrillator/monitor, it wanted to ensure a high-quality user experience for its non-English speaking as well as English speaking customers. Key to that was accurately localizing the software, the voice prompts and remaining user interface for markets around the world.
How to Extend Your Content Marketing Plan to a Global AudienceSDL
You're churning out amazing content and working tirelessly to get that content in front of your best prospects. But if you're only creating content in English, you might be ignoring copious amounts of potential customers. 71% (gasp!) of the global market prefers to be spoken to in a language other than English.
How do you reach these prospects without breaking the bank? Good news: it's not nearly as difficult as you think it is.
View our webinar, “How to Extend your Content Marketing Plan to a Global Audience,” to learn best practices for creating content that's easy to translate and localize. You'll also discover workflow processes that make the whole process easier, so you can realize maximum ROI.
For more detail, please copy and paste the following link itnto the browser of your choice: http://www.sdl.com/olp/language/how-to-extend-your-content-marketing-plan-to-a-global-audience.html
Redefine Your Global Video Strategy: Video LocalizationSDL
The SDL Language presentation on localizing video content to boost brand's global digital marketing effectiveness delivered at the Customer Success Summit Montreal 2015.
Fashion Days with Howard Beader & Andreas Meier at Forrester #CXNYC 2015SDL
At Forrester CXNYC 2015 in New York Howard Beader, Vice President Product Marketing at SDL discussed brand relationships and customer experience. He then joined forces with Andreas Meier, Head of Business Development for Fashion Days Group to talk about customer experience in retail.
UiPath Test Automation using UiPath Test Suite series, part 4DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 4. In this session, we will cover Test Manager overview along with SAP heatmap.
The UiPath Test Manager overview with SAP heatmap webinar offers a concise yet comprehensive exploration of the role of a Test Manager within SAP environments, coupled with the utilization of heatmaps for effective testing strategies.
Participants will gain insights into the responsibilities, challenges, and best practices associated with test management in SAP projects. Additionally, the webinar delves into the significance of heatmaps as a visual aid for identifying testing priorities, areas of risk, and resource allocation within SAP landscapes. Through this session, attendees can expect to enhance their understanding of test management principles while learning practical approaches to optimize testing processes in SAP environments using heatmap visualization techniques
What will you get from this session?
1. Insights into SAP testing best practices
2. Heatmap utilization for testing
3. Optimization of testing processes
4. Demo
Topics covered:
Execution from the test manager
Orchestrator execution result
Defect reporting
SAP heatmap example with demo
Speaker:
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
Search and Society: Reimagining Information Access for Radical FuturesBhaskar Mitra
The field of Information retrieval (IR) is currently undergoing a transformative shift, at least partly due to the emerging applications of generative AI to information access. In this talk, we will deliberate on the sociotechnical implications of generative AI for information access. We will argue that there is both a critical necessity and an exciting opportunity for the IR community to re-center our research agendas on societal needs while dismantling the artificial separation between the work on fairness, accountability, transparency, and ethics in IR and the rest of IR research. Instead of adopting a reactionary strategy of trying to mitigate potential social harms from emerging technologies, the community should aim to proactively set the research agenda for the kinds of systems we should build inspired by diverse explicitly stated sociotechnical imaginaries. The sociotechnical imaginaries that underpin the design and development of information access technologies needs to be explicitly articulated, and we need to develop theories of change in context of these diverse perspectives. Our guiding future imaginaries must be informed by other academic fields, such as democratic theory and critical theory, and should be co-developed with social science scholars, legal scholars, civil rights and social justice activists, and artists, among others.
Accelerate your Kubernetes clusters with Varnish CachingThijs Feryn
A presentation about the usage and availability of Varnish on Kubernetes. This talk explores the capabilities of Varnish caching and shows how to use the Varnish Helm chart to deploy it to Kubernetes.
This presentation was delivered at K8SUG Singapore. See https://feryn.eu/presentations/accelerate-your-kubernetes-clusters-with-varnish-caching-k8sug-singapore-28-2024 for more details.
UiPath Test Automation using UiPath Test Suite series, part 3DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 3. In this session, we will cover desktop automation along with UI automation.
Topics covered:
UI automation Introduction,
UI automation Sample
Desktop automation flow
Pradeep Chinnala, Senior Consultant Automation Developer @WonderBotz and UiPath MVP
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...UiPathCommunity
💥 Speed, accuracy, and scaling – discover the superpowers of GenAI in action with UiPath Document Understanding and Communications Mining™:
See how to accelerate model training and optimize model performance with active learning
Learn about the latest enhancements to out-of-the-box document processing – with little to no training required
Get an exclusive demo of the new family of UiPath LLMs – GenAI models specialized for processing different types of documents and messages
This is a hands-on session specifically designed for automation developers and AI enthusiasts seeking to enhance their knowledge in leveraging the latest intelligent document processing capabilities offered by UiPath.
Speakers:
👨🏫 Andras Palfi, Senior Product Manager, UiPath
👩🏫 Lenka Dulovicova, Product Program Manager, UiPath
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf91mobiles
91mobiles recently conducted a Smart TV Buyer Insights Survey in which we asked over 3,000 respondents about the TV they own, aspects they look at on a new TV, and their TV buying preferences.
PHP Frameworks: I want to break free (IPC Berlin 2024)Ralf Eggert
In this presentation, we examine the challenges and limitations of relying too heavily on PHP frameworks in web development. We discuss the history of PHP and its frameworks to understand how this dependence has evolved. The focus will be on providing concrete tips and strategies to reduce reliance on these frameworks, based on real-world examples and practical considerations. The goal is to equip developers with the skills and knowledge to create more flexible and future-proof web applications. We'll explore the importance of maintaining autonomy in a rapidly changing tech landscape and how to make informed decisions in PHP development.
This talk is aimed at encouraging a more independent approach to using PHP frameworks, moving towards a more flexible and future-proof approach to PHP development.
Neuro-symbolic is not enough, we need neuro-*semantic*Frank van Harmelen
Neuro-symbolic (NeSy) AI is on the rise. However, simply machine learning on just any symbolic structure is not sufficient to really harvest the gains of NeSy. These will only be gained when the symbolic structures have an actual semantics. I give an operational definition of semantics as “predictable inference”.
All of this illustrated with link prediction over knowledge graphs, but the argument is general.
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Jeffrey Haguewood
Sidekick Solutions uses Bonterra Impact Management (fka Social Solutions Apricot) and automation solutions to integrate data for business workflows.
We believe integration and automation are essential to user experience and the promise of efficient work through technology. Automation is the critical ingredient to realizing that full vision. We develop integration products and services for Bonterra Case Management software to support the deployment of automations for a variety of use cases.
This video focuses on the notifications, alerts, and approval requests using Slack for Bonterra Impact Management. The solutions covered in this webinar can also be deployed for Microsoft Teams.
Interested in deploying notification automations for Bonterra Impact Management? Contact us at sales@sidekicksolutionsllc.com to discuss next steps.
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
Let's dive deeper into the world of ODC! Ricardo Alves (OutSystems) will join us to tell all about the new Data Fabric. After that, Sezen de Bruijn (OutSystems) will get into the details on how to best design a sturdy architecture within ODC.
Learn the different approaches to machine translation and how to improve the quality of your global strategy with machine translation
1. SDL Proprietary and Confidential
How to Attain
Maximum Machine
Translation Quality
Rodrigo Fuentes Corradi, MT Consultant
SDL Language Customer Success Summit | June 7, 2016
2. 2
Overview: The SDL MT Team
Who we are
First to commercialize Statistical
Machine Translation
o 50+ Professionals
o Over 10 Nationalities
o Across 5 Time Zones
o 8 Locations
o Computational
Linguists
o Project
Managers
Widespread team of language lovers:
o Data
Specialists
o Post-
Editors
…all gathered from the
four corners of SDL!
What we do
Drive MT Adoption:
Educate, promote and support MT
usage in existing SDL accounts
& new opportunities
o Design
o Create
o Test
o Implement
o Monitor
Custom Engine Builds:
…custom
Statistical Machine
Translation
engines
Linguistic Projects:
Semantic annotation projects
for US Government bodies
& academic institutions
How we do it
o Los Angeles, CA
o Cambridge, UK
Two Research Labs:
o 30+ Production offices
resourcing MTPE
o Custom Training for MTPE
resources
o Investment in Universities
and future supply chain
We’re Evangelists…about
Machine Translation, using
automation to accelerate
productivity
PE Production offices
3. 3
Post-Edit
SDL’s Intelligent Machine Translation (iMT):
Key steps in MT life cycle
Evaluate Train MT Test
SDL Approach
Refine
Engineers Developers ScientistsPost-Editor
Process Workflow
Resource Pool
Computational
Linguists
4. 4
Teamwork for MT success
○ The MT market is undergoing
radical transformation
○ Scepticism remains in terms
of what benefit MT can bring
to business
○ Increasing numbers of mature
MT players opt for a structured
MT approach to match current
communication demands
○ The secret of MTPE success
lies in a step-by-step,
resource-by-resource approach
to Enterprise scale Post-Editing
Account Managers
& Consultants
o Technical consulting
o Research & implement
specific solutions
o Sales support
PJMs
o Communications
o Project coordinate
o Reporting
o Support for
consulting
Linguists
o Prepare
customized
material
o Give trainings
online or on-site
Linguists
o Data cleaning
o Expert training
o Engine testing
o Maintenance
Engineers
o Data evaluation
o Alignment
o Conversion
Translation
Manager
o Consolidate
feedback on quality
o Run PE Certification
to improve quality
SDL MT Team Roles
Post-Edit Training
Engine Building
& Testing
Data Analysis
& Management
Quality Management
Project Management
6. 6
○ Faster throughput without sacrificing quality
○ To meet aggressive turnarounds
○ Ability to handle increasing content volume / volume fluctuation
○ Lower production costs
○ For high volume, MT can be more consistent
The demand for MT solutions is growing quickly & post-editing
is rapidly becoming a basic skill for translators
Why companies use MT post-editing
7. 7
Right translation method, right price, right time
Quality
Volume
Human Translation Machine Translation
Blogs
User Forums
Reviews
Chat
Email
Support
FAQ
Websites
Wikis
Knowledge
Base
Alerts/
Notifications
Help
User
Guides
Documentation
Post-Edit
Newsletters
Advertising Marketing
Legal
Light Post-Edit
8. 8
SDL’s solutions for increasing MT quality
Customized
Engines
Domain
Verticals
Baselines
Language
Verticals
10. 10
Good data for customized engines
How much?
What
content?
What style?
Engineers
Vertical engines or baselines may work better if you don’t have enough or the right type of content
Computational
Linguists
o More is better. The statistical algorithms work better with many
words to analyse. Upwards of one million words for best
success. For very consistent, clean data, half of that may work.
o Content should all be from one content type, using similar
terminology. A mix of content types (e.g., technical
manuals, advertising, etc.) may produce poor results.
o Style should be consistent. The algorithms learn patterns from
similarities, and perform better if data is in similar form. Very
long sentences, or creative and varied styles, can negatively
affect trainings.
11. 11
Types of training data
Bilingual
Parallel
Terminology
Source Only
Target Only
o Core training data: translated content, usually in a translation memory.
This is the content that works best and can be processed the fastest.
o Translated content, but in separate files. This can be used if the content has
been translated exactly, and the format is the same. If for example the
document has extra tables in one language, or has been rewritten
substantially to fit a different market, it is hard to find matching sentences.
o Added to the training data to ensure corporate terms and brands are
translated consistently. This can be a termbase or a simple bilingual word list.
o Representative documents of the content that will be translated. They
are used in initial evaluations of suitability for MT and to test the quality of
the engine. Depending on their size, some 50-100 documents are ideal.
o Representative documents in the translated language. They are used
during the training and contribute to the fluency of the output. To have
an effect, large numbers are needed, several million words are ideal.
12. 12
Goals:
o Enable volume translations
o Migrate content from HT to PE
o Provide accuracy and term
consistency
o Provide productivity increases
Feedback
New MT customization workflow
Utility and / or
Productivity Testing
SDL
Assessment
Client
Request
Engine
Trainings
Auto Eval
Metrics
Data Intake
&
Processing
Blind Human
Evaluation
Deploy
Engine
Method
o Iterative engine trainings, with several
engines created with the best being deployed
o Output matches your style and terminology
o Engines “learn” from your Translation
Memories and terminology
o Work in combination with Baseline language
engines
Post-Editor
Computational
Linguists
14. 14
MT testing approaches
Automated Measures
o Useful to compare competing engines and identify the best engine with a high reliability
o No predictive value for Post-editing productivity but can validate post-editor’s feedback on MT output
o All automated measures have their flaws, but SDL has found a weighted combination of measures that gives
significant insights.
Human - Quality Scoring
o Resources are asked to score the MT output according to instructions, with a focus on understandability.
o Advantage of method: Human evaluation is considered more robust to alternative, but also valid translations.
Note: Human evaluations are prone to subjectivity so you need multiple test subjects. Performing this kind of test is more
expensive and time consuming than an automated approach, but can give an absolute value for one engine, not just a comparison.
Human – Productivity Testing
o Productivity gain for MT is calculated by comparing post-editing speed with conventional translation speed so
evaluators can assess how much value post-editing would add in a production environment.
o Advantage of method: For Post-Editing, results are a good indicator of the suitability of the MT output.
Note: Productivity increase is a difficult factor to predict for all cases and It’s also the most expensive and time consuming
test of the three.
Engineers Developers
MT evaluations should be relevant to your content, from the method of testing
(Automatic vs. Human Evaluation) to the testbed. It should represent true
life scenarios, taking the available Science and applying it commercially.
Computational
Linguists
15. 15
SDL’s custom MT evaluation platform
○ Data is presented to evaluators in a blind test scenario
in order to safeguard validity of results
○ Evaluation speed is recorded per segment
○ Multiple evaluators assess the same set of sentences
○ Each individual performance is compared to ensure
consistency
Additional measures for productivity tests:
○ Productivity increase from HT to PE
○ Translator’s editing actions (insert, copy-paste, pause)
○ Percentage of MT segments that do not require editing
○ Levenshtein edit distance from MT to final translation
1,127
1,510
1,026
1,188
1,123
1,816
1,470 1,414
Client
resource
SDL
resource 1
SDL
resource 2
Total
Speed (WPH)
Human
Baseline
Can evaluate both Sentence level quality & post-edit
productivity gain via a custom testing platform and
ensure the validity of results
3.15 3.04 3.09
3.01 2.92 2.97
0.13 0.12 0.13
evaluator1 evaluator2 Average total
Customization-Baseline: Average scores
Customization
Baseline
Delta
17. 17
Achieving effective post-editing process
Raw output:
Building blocks
are in place
Linguists focus on
refining the output
Terminology &
style are applied
At high volume,
MT can deliver
greater consistency
Trained linguists
certified in MT
post-editing
Post-Editor
18. 18
Post-editing quality guidelines
When post-editing to publishable quality, the following basic principles still apply:
o The same
references must
be used as
for conventional
translation (project-
specific guidelines,
TMs, glossaries,
termbases, etc.)
o Grammar,
spelling and
punctuation
must be correct
o Appropriate
style & correct
terminology
must be used
consistently
o The translation
must read well
and be suitable
for its intended
purpose
Customer
User Guide
19. 19
What is your quality requirement?
Error Category Specific Issue
Translation
($$$)
Publishable PE
($$)
Light PE
($)
Mistranslation Error
Terminology Glossary adherence
Consistency x
Accuracy Omissions/Additions
Language
Grammar x
Spelling x
Punctuation x
Style General Style x
Country
Country Standards x
Register & Tone x
21. 21
Technical
support
Product
development
Product
development
iMT
consultants
Scientific
development
Hotfix
Terms & brands
Python filters to
protect and
transform patterns
Fundamental
problem
Influence long term
scientific strategy
iMT
consultants
Scheduled fix for
future product
release
Analysis of setup,
technical advice
Major tool issue
Minor tool issue
Protected content
translated, wrong
terminology Translation errors
following patterns,
like dates
Expected MT
behaviour
Linguistic
Technical
The effects of post-editor feedback
22. 22
Post-editors identify expected SMT misbehavior
Incorrect
formatting
Additional or
missing words
Words not
localised
Gender, number,
agreement or
verb inflection
issues
Compound
formation issues
Syntax and word
order issues
Wrong
punctuation
Inconsistent or
non-compliant
terminology
Mistranslations
23. 23
Punctuation not following
the specific language rules
Syntax and word order issues
very frequently observed
Inconsistent or wrong terminology
very frequently observed
Examples of unexpected misbehavior
HTML entities instead of the correct
character (i.e. & instead of &)
Words in a language other than
the target
Engineers
Scientists
Post-Editor
Computational
Linguists
25. 25
SDL iMT Group are constantly researching
ways to improve Vertical and Customized
MT Engines
SDL Research Scientists are continuously
improving the Statistical Machine Translation
algorithms (e.g. Language Models, Translation
Models, Reordering Models, Syntax,
Transliteration, Rule-Based Components, etc…)
SDL Data Engineers are
continuously mining large
amounts of good data used
by the statistical algorithms
Continuous improvement
28. 28
Legacy MT systems are static
MT Provider Post-Editor
MT
Engine
xx x xxx
xx xxxxx
xxxx xxx
x x xx x
xxx x xx
PE Edited
xx x xxx
xx xxxxx
xxxx xxx
x x xx x
xxx x xx
MT Output
29. 29
SDL MT innovation – Adaptive MT
○ New technology developed by SDL Research
○ An Adaptive MT engine that learns interactively from
the post-editor’s edits
SDL Adaptive MT Post-Editor
MT
Engine
Adaptive MT
Processor
xx x xxx
xx xxxxx
xxxx xxx x
x xx x xxx
x xx
PE Edited
xx x xxx
xx xxxxx
xxxx xxx
x x xx x
xxx x xx
MT Output
30. 30
Adaptive MT key Features & Benefits
○ Creates a personal
adaptive MT engine
for the user
○ Interactive
o Improves
post-editor’s
productivity
○ Reduces the
frustration of editing
the same incorrect MT
○ Cumulative learning
over time – saved
from job to job
○ No need to wait for a
retrain
31. 31
French
Le service était exceptionnel
Lits très à l'aise
La vue était breathtaking
French Translation
Le service clientèle était exceptionnel
Lits très confortables à l'aise
La vue était à couper le souffle breathtaking
English Document
The customer service was outstanding
Very comfortable beds
The view was breathtaking
French Translation
Le service ____ était excellent
Les lits étaient très à l'aise
Quelle breathtaking vue!
User
Feedback
English Document
The customer service was excellent
The beds were very comfortable
What a breathtaking view!
Before Adaptive MT
Machine
Translation
32. 32
French
Le service était exceptionnel
Lits très à l'aise
La vue était breathtaking
French Translation
Le service clientèle était exceptionnel
Lits très confortables à l'aise
La vue était à couper le souffle breathtaking
English Document
The customer service was outstanding
Very comfortable beds
The view was breathtaking
French Translation
Le service clientèle était excellent
Les lits étaient très confortables
Quelle vue à couper le souffle!
User
Feedback
English Document
The customer service was excellent
The beds were very comfortable
What a breathtaking view!
Machine
Translation
Adaptive MT
Engineers Post-Editor Developers Scientists
Computational
Linguists
With Adaptive MT
34. 34
Focus on Canada’s market challenges
Flavor
requirements
Large retail
projects, no or
small starting
TMs
High
turnover
High quality
requirements
Traditional
offer (SDL
prior to 2014,
Google, Bing)
Mixed
French
flavor
Mixed
domains,
no retail
vertical
Lack of suitable
generic solutions
prevent MTPE
from the start
Lack of flavor &
domain-specific
terminology
increase PE
effort and review
costs
36. 36
SDL’s solution maturity roadmap
Generic
FR-CA
solutions
o Win clients
o Meet deadlines
o Collect project-specific data
Customizations
o Improve productivity
& quality
o Collect more data and
share feedback
Retrainings
o Further
improvement to
productivity and
quality
M A T U R I T Y
37. 37
SDL’s answer to Canada’s market challenges
Flavor
requirements
Large retail
projects, no or
small starting
TMs
High
turnover
High quality
requirements
SDL’s offer
after 2014
Training
material is
handpicked to
ensure correct
flavor
We have grown
retail solutions
to fit current
& new
opportunites
We have a
portfolio of
training material
& success
recipes for a
quick start
Combination of
adapted MT
solutions &
shrewd testing
and feedback
processes
39. 39
How do I get started?
Let’s have a conversation:
What content do you
need translated?
What are your quality
requirements?
What can you use for
a training corpus?
40. 40
Takeaway
o Measure
& improve
1 2 3 4 5
o MT can be
complex, so
choose your
MT provider
wisely
o Document
your quality
requirement
o Integrate MT
within your
larger
localization
infrastructure
o Use trained,
certified
post-editors
Fix technical issues
Get useful hints for retraining the engines
Empower post-editors and make them part of the process
Gather experience and improve MT in the long run through continuous research
Major tool issue:No tags, every second word Spanish, systematic errors, hotfix product management
Minor tool issue: doesn’t hamper everyday work, is registered, goes to product development but can take a bit longer to fix
Building on over 16 years of Machine Translation leadership, SDL has developed the next generation machine translation platform – SDL XMT
While the legacy SDL MT platform was designed as monolithic phrase-based system, SDL XMT has a unique modular design, which allows the rapid development and integration of special-purpose modules that can address specific challenges.
SDL XMT incorporates all previous innovations and algorithms but also enables rapid transition of new technologies and innovations thanks to its unique modular and robust design.
Different translation technologies can be applied to different language pairs, depending on what produces the best translation quality. You will begin to see much higher language quality, particularly for languages such as English to Japanese and Chinese.
If you are not using MT in your Translation process you are missing out on a powerful tool.
User provides feedback by fixing machine translation errors
Machine cannot learn from the feedback
Without Language Learning, Machine repeats the same errors again and again
With Language Learning, the Machine learns in real time, seamlessly, and continuously from the user feedback
The machine improves over time and does not repeat the same mistakes
Your MT provider must have the right combo: Technology, people, process