SlideShare a Scribd company logo
Beyond MT?
A few premature reflections on the
use of AI in translation
TAUS Global Content Summit Amsterdam, 6 March 2019
Dieter Rummel, EC, Directorate General for Translation
2
Directorate General
for Translation
Main document types
2015
38
16%
14%
6%
1% 11%
2%
2%
5%
2% 3%
1 EU law, including the legislative process
2 Guardian of the Treaties/Implementation of EU law
3 Correspondence
4 Political documents
5 Relations with other EU institutions
6 Communication, web, media, publications
7 Budget, budgetary procedure
8 Documents linked to international organisations and non-EU countries
9 Notices for publication in OJ
10 Commission working or internal documents
11 Other3
Evolution 2012-2018 : Number of translated pages and number of DGT staff
2200
2250
2300
2350
2400
2450
2500
2550
2600
0
500,000
1,000,000
1,500,000
2,000,000
2,500,000
2012 2013 2014 2015 2016 2017 2018
Pages
Staff
Context
Long-standing use of language technology + CAT tools
"More (better) with less"
More complexity, new formats, new ways of working
Stronger recourse to outsourcing
Shift from documents to content
Machine Translation as integral part of the resource mix
EC
Systran/ECMT
Rule-based MT
Ca. 1976 to 2010
MT@EC
Statistical MT
Moses Decoder
2013 - 2018
eTranslation
Neural MT
Connecting Europe
Facility (CEF)
From 2018
Machine translation at DGT
eTranslation use in DGT (up to Q3/2018)
Origin of translated segments
Buzz kill – or why I hate “AI”
• Beware of the images
• Neural MT vs. Recursive hetero-associative memories for translation
• Artificial intelligence is not about intelligence
• Neural networks have little to do with actual neurons
• Big data + neurons + deep learning + magic = Amazing stuff
happens!
• Do we really have big(-ish) data?
• Believe the hype - but in moderation
• Technology is not a solution
• Poor processes don’t get better through AI
• Doing the same and expecting different results = insanity
So, this had to be said.
But it’s pretty cool anyway.
• The technology has become accessible.
• “Big data” discussions have shown the possibilities of correlating
data from different sources.
• New ways of transforming data into usable information?
Describe
What is
happening?
Diagnose
Why did it
happen?
Predict
What will
happen?
Decide
What
should I
do?
Big data? - Big Questions!
What we translate
• What is the
document/content about?
• Is the document difficult, i.e.
demanding or complex?
• Are we working on
something similar?
• Do we have reliable
resources for this
document?
• How well will MT work for
this document?
Organising work
• How should this content be
best translated?
• Who is most suitable to
translate/revise the
document?
• How should the content be
split between several
translators (=meaningful
clustering)?
• What is our capacity to
translate?
• Are there meaningful
alternatives to the existing
forecasting model?
External service
providers
• How good is the contractor’s
work?
• How confident are we that
they will deliver good
quality?
• How reliable are they?
• Can we correlate
freelancer/agency, history of
evaluations, domain,
document type, document
complexity to calculate a
“reliability indicator” that
could support outsourcing
decisions?
More Big Questions!
Quality
• How good is a given translation?
• How good are our language
resources?
• Can we automatically detect
technically and linguistically poor
or suspect?
• How can we learn from mistakes?
Customers
• What are the common issues in
source documents?
• What do they have in common?
• Do we have the linguistic
resources to handle their
documents?
• What are their request patterns?
What next?
•Multi-disciplinary
•Explore use cases and
questions
•Break silos
•Validate or reject ideas
and assumptions in a
cost-effective way
•Training (also for
managers!)
•Learn what we do not
know
•Develop skills
•Translation memories
•Terminology
•XLIFF
•“Bad data”
•Missing data
Think about
Data
Create
understanding
and capacity
Incubate!Experiment
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflections on the use of AI in translation. By Dieter Rummel (Head of Informatics, DGT European Commission)

More Related Content

Similar to TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflections on the use of AI in translation. By Dieter Rummel (Head of Informatics, DGT European Commission)

TAUS 2.0 and the Game Changers in Localization (Jaap van der Meer, director o...
TAUS 2.0 and the Game Changers in Localization (Jaap van der Meer, director o...TAUS 2.0 and the Game Changers in Localization (Jaap van der Meer, director o...
TAUS 2.0 and the Game Changers in Localization (Jaap van der Meer, director o...
TAUS - The Language Data Network
 
The TAUS Translation Data Landscape Report, by Jaap van der Meer, TAUS
The TAUS Translation Data Landscape Report, by Jaap van der Meer, TAUSThe TAUS Translation Data Landscape Report, by Jaap van der Meer, TAUS
The TAUS Translation Data Landscape Report, by Jaap van der Meer, TAUS
TAUS - The Language Data Network
 
Monetize Big Data
Monetize Big DataMonetize Big Data
TAUS New Year's Reception 2014
TAUS New Year's Reception 2014TAUS New Year's Reception 2014
TAUS New Year's Reception 2014
TAUS - The Language Data Network
 
Sample
Sample Sample
The web of data: how are we doing so far?
The web of data: how are we doing so far?The web of data: how are we doing so far?
The web of data: how are we doing so far?
Elena Simperl
 
Ima g ine2014_8c1report
Ima g ine2014_8c1reportIma g ine2014_8c1report
Ima g ine2014_8c1report
Remetey-Fülöpp Gábor
 
Data Modeling for communication
Data Modeling for communicationData Modeling for communication
Data Modeling for communicationRichard Freggi
 
Gift presentation
Gift presentationGift presentation
Gift presentation
Lucy Chambers
 
XML Drafting Discussion - PCC IT Conference 2013
XML Drafting Discussion - PCC IT Conference 2013XML Drafting Discussion - PCC IT Conference 2013
XML Drafting Discussion - PCC IT Conference 2013
Gareth Oakes
 
Translation_integration_into_the_documentation_process_en
Translation_integration_into_the_documentation_process_enTranslation_integration_into_the_documentation_process_en
Translation_integration_into_the_documentation_process_en
Vyacheslav Guzovsky
 
elgendy2014.pdf
elgendy2014.pdfelgendy2014.pdf
elgendy2014.pdf
Akuhuruf
 
Language First Protocol from QSi
Language First Protocol from QSiLanguage First Protocol from QSi
Language First Protocol from QSi
John O'Gorman
 
Mapping the content ecosystem
Mapping the content ecosystemMapping the content ecosystem
Mapping the content ecosystem
Rob Hanna, ECMs
 
1. 'Interoperability. A quick chat, a few war stories'. Carl Wilson, Open Pla...
1. 'Interoperability. A quick chat, a few war stories'. Carl Wilson, Open Pla...1. 'Interoperability. A quick chat, a few war stories'. Carl Wilson, Open Pla...
1. 'Interoperability. A quick chat, a few war stories'. Carl Wilson, Open Pla...
IMPACT Centre of Competence
 
GATE: a text analysis tool for social media
GATE: a text analysis tool for social mediaGATE: a text analysis tool for social media
GATE: a text analysis tool for social media
Diana Maynard
 
VisibleThread Users Conference 2018 - Welcome
VisibleThread Users Conference 2018 - WelcomeVisibleThread Users Conference 2018 - Welcome
VisibleThread Users Conference 2018 - Welcome
VisibleThread
 
Open Source & Open Data Session report from imaGIne 2014 Conference
Open Source & Open Data Session report from imaGIne 2014 ConferenceOpen Source & Open Data Session report from imaGIne 2014 Conference
Open Source & Open Data Session report from imaGIne 2014 Conference
GSDI Association
 
Making Inter-operability Visible
Making Inter-operability VisibleMaking Inter-operability Visible
Making Inter-operability Visibleliddy
 

Similar to TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflections on the use of AI in translation. By Dieter Rummel (Head of Informatics, DGT European Commission) (20)

TAUS 2.0 and the Game Changers in Localization (Jaap van der Meer, director o...
TAUS 2.0 and the Game Changers in Localization (Jaap van der Meer, director o...TAUS 2.0 and the Game Changers in Localization (Jaap van der Meer, director o...
TAUS 2.0 and the Game Changers in Localization (Jaap van der Meer, director o...
 
The TAUS Translation Data Landscape Report, by Jaap van der Meer, TAUS
The TAUS Translation Data Landscape Report, by Jaap van der Meer, TAUSThe TAUS Translation Data Landscape Report, by Jaap van der Meer, TAUS
The TAUS Translation Data Landscape Report, by Jaap van der Meer, TAUS
 
Monetize Big Data
Monetize Big DataMonetize Big Data
Monetize Big Data
 
TAUS New Year's Reception 2014
TAUS New Year's Reception 2014TAUS New Year's Reception 2014
TAUS New Year's Reception 2014
 
Sample
Sample Sample
Sample
 
The web of data: how are we doing so far?
The web of data: how are we doing so far?The web of data: how are we doing so far?
The web of data: how are we doing so far?
 
Ima g ine2014_8c1report
Ima g ine2014_8c1reportIma g ine2014_8c1report
Ima g ine2014_8c1report
 
Data Modeling for communication
Data Modeling for communicationData Modeling for communication
Data Modeling for communication
 
Gift presentation
Gift presentationGift presentation
Gift presentation
 
XML Drafting Discussion - PCC IT Conference 2013
XML Drafting Discussion - PCC IT Conference 2013XML Drafting Discussion - PCC IT Conference 2013
XML Drafting Discussion - PCC IT Conference 2013
 
Translation_integration_into_the_documentation_process_en
Translation_integration_into_the_documentation_process_enTranslation_integration_into_the_documentation_process_en
Translation_integration_into_the_documentation_process_en
 
elgendy2014.pdf
elgendy2014.pdfelgendy2014.pdf
elgendy2014.pdf
 
Language First Protocol from QSi
Language First Protocol from QSiLanguage First Protocol from QSi
Language First Protocol from QSi
 
Mapping the content ecosystem
Mapping the content ecosystemMapping the content ecosystem
Mapping the content ecosystem
 
1. 'Interoperability. A quick chat, a few war stories'. Carl Wilson, Open Pla...
1. 'Interoperability. A quick chat, a few war stories'. Carl Wilson, Open Pla...1. 'Interoperability. A quick chat, a few war stories'. Carl Wilson, Open Pla...
1. 'Interoperability. A quick chat, a few war stories'. Carl Wilson, Open Pla...
 
Martinez treasury 4 11
Martinez treasury 4 11Martinez treasury 4 11
Martinez treasury 4 11
 
GATE: a text analysis tool for social media
GATE: a text analysis tool for social mediaGATE: a text analysis tool for social media
GATE: a text analysis tool for social media
 
VisibleThread Users Conference 2018 - Welcome
VisibleThread Users Conference 2018 - WelcomeVisibleThread Users Conference 2018 - Welcome
VisibleThread Users Conference 2018 - Welcome
 
Open Source & Open Data Session report from imaGIne 2014 Conference
Open Source & Open Data Session report from imaGIne 2014 ConferenceOpen Source & Open Data Session report from imaGIne 2014 Conference
Open Source & Open Data Session report from imaGIne 2014 Conference
 
Making Inter-operability Visible
Making Inter-operability VisibleMaking Inter-operability Visible
Making Inter-operability Visible
 

More from TAUS - The Language Data Network

TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS - The Language Data Network
 
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
TAUS - The Language Data Network
 
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
TAUS - The Language Data Network
 
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann... Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
TAUS - The Language Data Network
 
A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...
TAUS - The Language Data Network
 
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
TAUS - The Language Data Network
 
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
TAUS - The Language Data Network
 
Farmer Lv (TrueTran)
Farmer Lv (TrueTran)Farmer Lv (TrueTran)
Farmer Lv (TrueTran)
TAUS - The Language Data Network
 
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
TAUS - The Language Data Network
 
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 The Theory and Practice of Computer Aided Translation Training System, Liu Q... The Theory and Practice of Computer Aided Translation Training System, Liu Q...
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
TAUS - The Language Data Network
 
Translation Technology Showcase in Shenzhen
Translation Technology Showcase in ShenzhenTranslation Technology Showcase in Shenzhen
Translation Technology Showcase in Shenzhen
TAUS - The Language Data Network
 
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
TAUS - The Language Data Network
 
SDL Trados Studio 2017, Jocelyn He (SDL)
SDL Trados Studio 2017, Jocelyn He (SDL)SDL Trados Studio 2017, Jocelyn He (SDL)
SDL Trados Studio 2017, Jocelyn He (SDL)
TAUS - The Language Data Network
 
How we train post-editors - Yongpeng Wei (Lingosail)
How we train post-editors - Yongpeng Wei (Lingosail)How we train post-editors - Yongpeng Wei (Lingosail)
How we train post-editors - Yongpeng Wei (Lingosail)
TAUS - The Language Data Network
 
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 A use-case for getting MT into your company, Kerstin Berns (berns language c... A use-case for getting MT into your company, Kerstin Berns (berns language c...
A use-case for getting MT into your company, Kerstin Berns (berns language c...
TAUS - The Language Data Network
 
QE integrated in XTM, by Bob Willans (XTM)
QE integrated in XTM, by Bob Willans (XTM)QE integrated in XTM, by Bob Willans (XTM)
QE integrated in XTM, by Bob Willans (XTM)
TAUS - The Language Data Network
 
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)
TAUS - The Language Data Network
 
Traditional Models of Translation Outsourcing Seem Well-Established and Sound...
Traditional Models of Translation Outsourcing Seem Well-Established and Sound...Traditional Models of Translation Outsourcing Seem Well-Established and Sound...
Traditional Models of Translation Outsourcing Seem Well-Established and Sound...
TAUS - The Language Data Network
 

More from TAUS - The Language Data Network (20)

TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
 
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
 
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
 
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
 
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
 
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann... Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 
A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...
 
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
 
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
 
Farmer Lv (TrueTran)
Farmer Lv (TrueTran)Farmer Lv (TrueTran)
Farmer Lv (TrueTran)
 
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
 
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 The Theory and Practice of Computer Aided Translation Training System, Liu Q... The Theory and Practice of Computer Aided Translation Training System, Liu Q...
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 
Translation Technology Showcase in Shenzhen
Translation Technology Showcase in ShenzhenTranslation Technology Showcase in Shenzhen
Translation Technology Showcase in Shenzhen
 
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
 
SDL Trados Studio 2017, Jocelyn He (SDL)
SDL Trados Studio 2017, Jocelyn He (SDL)SDL Trados Studio 2017, Jocelyn He (SDL)
SDL Trados Studio 2017, Jocelyn He (SDL)
 
How we train post-editors - Yongpeng Wei (Lingosail)
How we train post-editors - Yongpeng Wei (Lingosail)How we train post-editors - Yongpeng Wei (Lingosail)
How we train post-editors - Yongpeng Wei (Lingosail)
 
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 A use-case for getting MT into your company, Kerstin Berns (berns language c... A use-case for getting MT into your company, Kerstin Berns (berns language c...
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 
QE integrated in XTM, by Bob Willans (XTM)
QE integrated in XTM, by Bob Willans (XTM)QE integrated in XTM, by Bob Willans (XTM)
QE integrated in XTM, by Bob Willans (XTM)
 
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)
 
Traditional Models of Translation Outsourcing Seem Well-Established and Sound...
Traditional Models of Translation Outsourcing Seem Well-Established and Sound...Traditional Models of Translation Outsourcing Seem Well-Established and Sound...
Traditional Models of Translation Outsourcing Seem Well-Established and Sound...
 

Recently uploaded

JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
RTTS
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
Cheryl Hung
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Product School
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Thierry Lestable
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
DianaGray10
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
Dorra BARTAGUIZ
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
Elena Simperl
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
Alison B. Lowndes
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Ramesh Iyer
 

Recently uploaded (20)

JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
 

TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflections on the use of AI in translation. By Dieter Rummel (Head of Informatics, DGT European Commission)

  • 1. Beyond MT? A few premature reflections on the use of AI in translation TAUS Global Content Summit Amsterdam, 6 March 2019 Dieter Rummel, EC, Directorate General for Translation
  • 3. Main document types 2015 38 16% 14% 6% 1% 11% 2% 2% 5% 2% 3% 1 EU law, including the legislative process 2 Guardian of the Treaties/Implementation of EU law 3 Correspondence 4 Political documents 5 Relations with other EU institutions 6 Communication, web, media, publications 7 Budget, budgetary procedure 8 Documents linked to international organisations and non-EU countries 9 Notices for publication in OJ 10 Commission working or internal documents 11 Other3
  • 4. Evolution 2012-2018 : Number of translated pages and number of DGT staff 2200 2250 2300 2350 2400 2450 2500 2550 2600 0 500,000 1,000,000 1,500,000 2,000,000 2,500,000 2012 2013 2014 2015 2016 2017 2018 Pages Staff
  • 5. Context Long-standing use of language technology + CAT tools "More (better) with less" More complexity, new formats, new ways of working Stronger recourse to outsourcing Shift from documents to content Machine Translation as integral part of the resource mix
  • 6. EC Systran/ECMT Rule-based MT Ca. 1976 to 2010 MT@EC Statistical MT Moses Decoder 2013 - 2018 eTranslation Neural MT Connecting Europe Facility (CEF) From 2018 Machine translation at DGT
  • 7. eTranslation use in DGT (up to Q3/2018)
  • 9. Buzz kill – or why I hate “AI” • Beware of the images • Neural MT vs. Recursive hetero-associative memories for translation • Artificial intelligence is not about intelligence • Neural networks have little to do with actual neurons • Big data + neurons + deep learning + magic = Amazing stuff happens! • Do we really have big(-ish) data? • Believe the hype - but in moderation • Technology is not a solution • Poor processes don’t get better through AI • Doing the same and expecting different results = insanity
  • 10. So, this had to be said. But it’s pretty cool anyway. • The technology has become accessible. • “Big data” discussions have shown the possibilities of correlating data from different sources. • New ways of transforming data into usable information? Describe What is happening? Diagnose Why did it happen? Predict What will happen? Decide What should I do?
  • 11. Big data? - Big Questions! What we translate • What is the document/content about? • Is the document difficult, i.e. demanding or complex? • Are we working on something similar? • Do we have reliable resources for this document? • How well will MT work for this document? Organising work • How should this content be best translated? • Who is most suitable to translate/revise the document? • How should the content be split between several translators (=meaningful clustering)? • What is our capacity to translate? • Are there meaningful alternatives to the existing forecasting model? External service providers • How good is the contractor’s work? • How confident are we that they will deliver good quality? • How reliable are they? • Can we correlate freelancer/agency, history of evaluations, domain, document type, document complexity to calculate a “reliability indicator” that could support outsourcing decisions?
  • 12. More Big Questions! Quality • How good is a given translation? • How good are our language resources? • Can we automatically detect technically and linguistically poor or suspect? • How can we learn from mistakes? Customers • What are the common issues in source documents? • What do they have in common? • Do we have the linguistic resources to handle their documents? • What are their request patterns?
  • 13. What next? •Multi-disciplinary •Explore use cases and questions •Break silos •Validate or reject ideas and assumptions in a cost-effective way •Training (also for managers!) •Learn what we do not know •Develop skills •Translation memories •Terminology •XLIFF •“Bad data” •Missing data Think about Data Create understanding and capacity Incubate!Experiment