SlideShare a Scribd company logo
Translation Quality Assessment:
Five Easy Steps
Using Multidimensional Quality Metrics to
improve quality assessment and management
Prepared by the QTLaunchPad project (info@qt21.eu)
version 1.0 (26.April 2013)
Who does this apply to?
 Requesters of translation services looking for relevant
quality metrics
 Language Service Providers (LSPs) delivering translation
services to their clients
 The following materials will apply to negotiation
between requesters and providers
 This description does not apply to individual translators
(although they may want to be aware of the contents)
Step 1: Specifications
Basic questions about your project
E.g.,
 What languages are you working in?
 What is your subject field?
 What sort of project is it (e.g., user
interface, documentation, advertising)?
 What technology are you using (MT, CAT, etc.)?
 What register and style are you using?
Step 2. Select Metrics
Based on your specifications…
 MQM recommendation tool will:
 suggest a pre-defined metric used for similar projects, or
 recommend a custom metric that applies to your project
 You are free to modify the metric as needed
 Create a metrics specification file that
 defines the issues to be examined
 provides weights (descriptions of how important the issues
are)
 Metrics specification file can be used by an MQM-
compliant tool
Step 3: Evaluation Method
Three options:
1. Sampling: Examine a portion of the text to determine
whether to pass or fail the entire text. Sampling can
utilize quality estimation for better results
2. Full error analysis: Review the entire text (needed for
critical legal or safety texts)
3. Rubric: Rate the text on a numerical scale (suitable
for quick assessment of suitability)
Automated Metrics
 If sampling is used, MQM’s quality estimation tools will
help focus sampling on those parts of the text that need
attention
 Automatic metrics can be used in some cases where
human evaluation is too expensive or time-consuming
Step 4: Evaluation
Evaluation…
 Can be conducted by the requester or LSP in accordance
with the agreement between the parties
 Follows the method chosen in Step 3 (evaluation
method)
 Issues must match the metric chosen in Step 2: issues
not found in the metric should not be considered errors
MQM provides capabilities
 For human evaluation
 Inline markup provides an audit trail:
 Allows independent verification of errors
 Helps ensure that issues are corrected
 Full reporting functions:
 See what types of errors are reported
 Understand where errors come from
 For automatic evaluation
 Integrated use of existing quality metrics to help provide
evaluation
translate5
 These capabilities are being integrated into an open-
source editing tool, translate5
(http://www.translate5.net)
 All results are free to implement in additional tools
(both open source and proprietary)
 Parties interested in development should contact
info@qt21.eu
The source matters
 Full MQM evaluation includes the source
 Source quality evaluation can help identify reasons for
problems and resolve them
 Translators can be rewarded for addressing source
deficiencies (scores over 100% are possible!)
Step 5: Scoring
Scoring Formula
 (Q = whatever set of issues being counted within the bigger
formula)
 Provides consistency with LISA QA Model scoring method
 Can be customized to support other legacy systems
 Can be applied to individual parts of the overall formula:
i.e., fluency, accuracy, grammar, etc. subscores can be
derived
 Weights (not shown) can be used to adjust importance of
various issue types
Scores help guide decisions
 Scores are given on a 100% basis
 Scores can be broken down into more fine-grained
reports.
 E.g., a score of 96% could have 100% accuracy but 92%
fluency.
 Helps target actions for quality control.
Example
1. Specifications
Parameter Value
Language/Locale Source: English; Target: Japanese
Subject field/domain Medical
Text type Narrative
Audience Educated readers with an interest in medicine
Purpose Education about a new procedure for managing
diabetes
Register Moderately formal
Style no specified style – match source if possible
Content correspondence Literal translation
Output modality subtitles (speech to text)
File format Time-coded XML for dotSub
Production technology human translation
2. Recommended Metric
Issue type Weight (high,
medium, low)
Notes
Fluency
Orthography High
Grammar High
Accuracy
Mistranslation High
Omission Low Due to nature as captions,
some information loss is
expected. Captions should be
60% of spoken dialogue
Untranslated High
Legal
requirements
High Must make sure that legal
claims are admissible under
Japanese law
Chosen from…
Issue types are a subset of the full catalog of types
Chosen from…
Quality Formula (1)
TQ = (Atr + At - As) + (Ft – Fs)
with respect to specifications
 TQ = translation quality
 Atr = accuracy (transfer)
 At = accuracy for the target text
 As = accuracy for the source text
 Ft = fluency score for target text
 Fs = fluency score for source text
Quality Formula (2)
TQ = (Atr + At - As) + (Ft – Fs)
with respect to specifications
Definition: A quality translation demonstrates required
accuracy and fluency for the audience and purpose and
complies with all other negotiated specifications, taking
into account end-user needs.
The gold portion = dimensions (specifications)
3. Evaluation method
 In this example, portions of the text are marketing:
sampling is an acceptable evaluation method for these
parts
 Other portions contain legal and regulatory claims: full
error analysis is required for those portions
 Inline markup can be used via MQM namespace (because
text is in XML) to ensure corrections are made.
4. Evaluation
• Evaluation includes
subsegment markup
with issues in metric
• Issues stored in MQM
namespace to allow
audit and revision
• Users can select three severity levels:
• critical: the issue renders the text unusable
• major: the issue leaves the text usable, but is an obstacle to
understanding
• minor: the issue does not impact usability of the text
screenshot: translate5.net showing MQM markup tool
5. Scoring
Issue type Weight Minor Major Critical Penalty Adjusted Total
Fluency
Orthography 1.0 8 2 1 28 28 97.2%
Grammar 1.0 6 2 0 16 16 98.4%
Subtotal 44 95.6%
Accuracy
Mistranslation 1.0 4 0 0 4 4 99.6%
Omission 0.2 12 4 1 42 8.4 99.2%
Untranslated 1.0 1 0 0 1 1 99.9%
Legal
requirements
1.0 0 0 1 10 10 99.0%
Subtotal 23.4 97.7%
Total 67.4 93.3%
Assumes 1000-word sample
Because Omission is considered a low
priority in this case, it is given a low
weight
5. Scoring
 Without weighting of Omission, the score would be
89.9%
 We can see that the translator has more problems with
fluency than with accuracy
5. Full scoring (including source)
Issue type Source Target Adjusted
Fluency
Orthography 96.1% 97.2% 101.1%
Grammar 99.0% 98.4% 99.6%
Subtotal 95.1% 95.6% ☞ 100.5%
Accuracy
Mistranslation (100%) 99.6% 99.6%
Omission (100%) 99.2% 99.2%
Untranslated (100%) 99.9% 99.9%
Legal requirements (100%) 99.0% 99.0%
Subtotal 100% 97.7% 97.7%
Total 95.1% 89.9% 98.2%
Assumes 1000-word sample. Source accuracy set to 100% for computational purposes.
5. Scoring (including source)
 In many cases, some problems in a translation are not
caused by the translator.
 In this case, the translator fixed problems in the
source, resulting in better quality for fluency in the
target. The translator should be recognized for this
work.
For more information
 Please visit http://www.qt21.eu/launchpad/
 Write to info@qt21.eu

More Related Content

What's hot

Top Trans Survey Translation Issues
Top Trans Survey Translation IssuesTop Trans Survey Translation Issues
Top Trans Survey Translation Issues
Raya Wasser
 
Serge astm-presentation-chicago-2014-final
Serge astm-presentation-chicago-2014-finalSerge astm-presentation-chicago-2014-final
Serge astm-presentation-chicago-2014-final
Serge Gladkoff
 
Lucia Specia - Estimativa de qualidade em TA
Lucia Specia - Estimativa de qualidade em TALucia Specia - Estimativa de qualidade em TA
Lucia Specia - Estimativa de qualidade em TA
I Conferência Internacional de Tradução e Tecnologia
 
TAUS Evaluating Post-Editor Performance Guidelines
TAUS Evaluating Post-Editor Performance GuidelinesTAUS Evaluating Post-Editor Performance Guidelines
TAUS Evaluating Post-Editor Performance Guidelines
TAUS - The Language Data Network
 
Tech capabilities with_sa
Tech capabilities with_saTech capabilities with_sa
Tech capabilities with_sa
Robert Martin
 
18. Alessandro Cattelan (Translated) Terminology
18. Alessandro Cattelan (Translated) Terminology18. Alessandro Cattelan (Translated) Terminology
18. Alessandro Cattelan (Translated) Terminology
RIILP
 
A Survey Of Agile Development Methodologies
A Survey Of Agile Development MethodologiesA Survey Of Agile Development Methodologies
A Survey Of Agile Development Methodologies
Abdul Basit
 
ATA grading rubric
ATA grading rubricATA grading rubric
ATA grading rubric
danyshrek2013
 
Welocalize Throughputs and Post-Editing Productivity Webinar Laura Casanellas
Welocalize Throughputs and Post-Editing Productivity Webinar Laura CasanellasWelocalize Throughputs and Post-Editing Productivity Webinar Laura Casanellas
Welocalize Throughputs and Post-Editing Productivity Webinar Laura Casanellas
Welocalize
 
High Volume, Rapid Turn Around Localization: Lessons Learned
High Volume, Rapid Turn Around Localization: Lessons LearnedHigh Volume, Rapid Turn Around Localization: Lessons Learned
High Volume, Rapid Turn Around Localization: Lessons Learned
SDL
 
The Latest Advances in Patent Machine Translation
The Latest Advances in Patent Machine TranslationThe Latest Advances in Patent Machine Translation
The Latest Advances in Patent Machine Translation
Iconic Translation Machines
 
Auditing_COAs_TranslationSilviaZaragoza
Auditing_COAs_TranslationSilviaZaragozaAuditing_COAs_TranslationSilviaZaragoza
Auditing_COAs_TranslationSilviaZaragoza
Neuropsychological Research Organization s.l
 
Tools-Driven Content Curation & Engine Training ATMA 2014
Tools-Driven Content Curation & Engine Training ATMA 2014Tools-Driven Content Curation & Engine Training ATMA 2014
Tools-Driven Content Curation & Engine Training ATMA 2014
Welocalize
 
Building and Implementing MT systems @ eBay – TAUS Global Content Summit 2019
Building and Implementing MT systems @ eBay – TAUS Global Content Summit 2019Building and Implementing MT systems @ eBay – TAUS Global Content Summit 2019
Building and Implementing MT systems @ eBay – TAUS Global Content Summit 2019
Jose Luis Bonilla Sánchez
 
Goods procurement­
Goods procurement­Goods procurement­
Unit1
Unit1Unit1
Documentation Checklist
Documentation ChecklistDocumentation Checklist
Documentation Checklist
Raghunath (Gautam) Soman
 

What's hot (17)

Top Trans Survey Translation Issues
Top Trans Survey Translation IssuesTop Trans Survey Translation Issues
Top Trans Survey Translation Issues
 
Serge astm-presentation-chicago-2014-final
Serge astm-presentation-chicago-2014-finalSerge astm-presentation-chicago-2014-final
Serge astm-presentation-chicago-2014-final
 
Lucia Specia - Estimativa de qualidade em TA
Lucia Specia - Estimativa de qualidade em TALucia Specia - Estimativa de qualidade em TA
Lucia Specia - Estimativa de qualidade em TA
 
TAUS Evaluating Post-Editor Performance Guidelines
TAUS Evaluating Post-Editor Performance GuidelinesTAUS Evaluating Post-Editor Performance Guidelines
TAUS Evaluating Post-Editor Performance Guidelines
 
Tech capabilities with_sa
Tech capabilities with_saTech capabilities with_sa
Tech capabilities with_sa
 
18. Alessandro Cattelan (Translated) Terminology
18. Alessandro Cattelan (Translated) Terminology18. Alessandro Cattelan (Translated) Terminology
18. Alessandro Cattelan (Translated) Terminology
 
A Survey Of Agile Development Methodologies
A Survey Of Agile Development MethodologiesA Survey Of Agile Development Methodologies
A Survey Of Agile Development Methodologies
 
ATA grading rubric
ATA grading rubricATA grading rubric
ATA grading rubric
 
Welocalize Throughputs and Post-Editing Productivity Webinar Laura Casanellas
Welocalize Throughputs and Post-Editing Productivity Webinar Laura CasanellasWelocalize Throughputs and Post-Editing Productivity Webinar Laura Casanellas
Welocalize Throughputs and Post-Editing Productivity Webinar Laura Casanellas
 
High Volume, Rapid Turn Around Localization: Lessons Learned
High Volume, Rapid Turn Around Localization: Lessons LearnedHigh Volume, Rapid Turn Around Localization: Lessons Learned
High Volume, Rapid Turn Around Localization: Lessons Learned
 
The Latest Advances in Patent Machine Translation
The Latest Advances in Patent Machine TranslationThe Latest Advances in Patent Machine Translation
The Latest Advances in Patent Machine Translation
 
Auditing_COAs_TranslationSilviaZaragoza
Auditing_COAs_TranslationSilviaZaragozaAuditing_COAs_TranslationSilviaZaragoza
Auditing_COAs_TranslationSilviaZaragoza
 
Tools-Driven Content Curation & Engine Training ATMA 2014
Tools-Driven Content Curation & Engine Training ATMA 2014Tools-Driven Content Curation & Engine Training ATMA 2014
Tools-Driven Content Curation & Engine Training ATMA 2014
 
Building and Implementing MT systems @ eBay – TAUS Global Content Summit 2019
Building and Implementing MT systems @ eBay – TAUS Global Content Summit 2019Building and Implementing MT systems @ eBay – TAUS Global Content Summit 2019
Building and Implementing MT systems @ eBay – TAUS Global Content Summit 2019
 
Goods procurement­
Goods procurement­Goods procurement­
Goods procurement­
 
Unit1
Unit1Unit1
Unit1
 
Documentation Checklist
Documentation ChecklistDocumentation Checklist
Documentation Checklist
 

Viewers also liked

Mekheimer
MekheimerMekheimer
Wind Power in Maine
Wind Power in MaineWind Power in Maine
Wind Power in Maine
Hall-Dale
 
Quality metrics in project management
Quality metrics in project managementQuality metrics in project management
Quality metrics in project management
selinasimpson1601
 
Project quality (and test process) metrics
Project quality (and test process) metricsProject quality (and test process) metrics
Project quality (and test process) metrics
Zbyszek Mockun
 
Quality metrics project management
Quality metrics project managementQuality metrics project management
Quality metrics project management
selinasimpson1501
 
5 of 5 Quality Metrics & Science by Obaid Ali
5 of 5  Quality Metrics & Science by Obaid Ali5 of 5  Quality Metrics & Science by Obaid Ali
5 of 5 Quality Metrics & Science by Obaid Ali
Obaid Ali / Roohi B. Obaid
 
Software Quality Metrics for Testers - StarWest 2013
Software Quality Metrics for Testers - StarWest 2013Software Quality Metrics for Testers - StarWest 2013
Software Quality Metrics for Testers - StarWest 2013
XBOSoft
 
Quality Metrics
Quality Metrics Quality Metrics
Quality Metrics
Haroon Abbu
 
Quality Metrics
Quality MetricsQuality Metrics
Construction quality management plan (Construction Productivity Analysis)
Construction quality management plan (Construction Productivity Analysis)Construction quality management plan (Construction Productivity Analysis)
Construction quality management plan (Construction Productivity Analysis)
Jayson Narito
 
Quality management slides
Quality management slidesQuality management slides
Quality management slides
Saba Afaq
 
Project quality management - PMI PMBOK Knowledge Area
Project quality management - PMI PMBOK Knowledge AreaProject quality management - PMI PMBOK Knowledge Area
Project quality management - PMI PMBOK Knowledge Area
Imran Jamil
 
Project Metrics & Measures
Project Metrics & MeasuresProject Metrics & Measures
Project Metrics & Measures
Anand Subramaniam
 

Viewers also liked (13)

Mekheimer
MekheimerMekheimer
Mekheimer
 
Wind Power in Maine
Wind Power in MaineWind Power in Maine
Wind Power in Maine
 
Quality metrics in project management
Quality metrics in project managementQuality metrics in project management
Quality metrics in project management
 
Project quality (and test process) metrics
Project quality (and test process) metricsProject quality (and test process) metrics
Project quality (and test process) metrics
 
Quality metrics project management
Quality metrics project managementQuality metrics project management
Quality metrics project management
 
5 of 5 Quality Metrics & Science by Obaid Ali
5 of 5  Quality Metrics & Science by Obaid Ali5 of 5  Quality Metrics & Science by Obaid Ali
5 of 5 Quality Metrics & Science by Obaid Ali
 
Software Quality Metrics for Testers - StarWest 2013
Software Quality Metrics for Testers - StarWest 2013Software Quality Metrics for Testers - StarWest 2013
Software Quality Metrics for Testers - StarWest 2013
 
Quality Metrics
Quality Metrics Quality Metrics
Quality Metrics
 
Quality Metrics
Quality MetricsQuality Metrics
Quality Metrics
 
Construction quality management plan (Construction Productivity Analysis)
Construction quality management plan (Construction Productivity Analysis)Construction quality management plan (Construction Productivity Analysis)
Construction quality management plan (Construction Productivity Analysis)
 
Quality management slides
Quality management slidesQuality management slides
Quality management slides
 
Project quality management - PMI PMBOK Knowledge Area
Project quality management - PMI PMBOK Knowledge AreaProject quality management - PMI PMBOK Knowledge Area
Project quality management - PMI PMBOK Knowledge Area
 
Project Metrics & Measures
Project Metrics & MeasuresProject Metrics & Measures
Project Metrics & Measures
 

Similar to Overview of Multidimensional Quality Metrics (QTLaunchPad)

Algorithm ExampleFor the following taskUse the random module .docx
Algorithm ExampleFor the following taskUse the random module .docxAlgorithm ExampleFor the following taskUse the random module .docx
Algorithm ExampleFor the following taskUse the random module .docx
daniahendric
 
Improving Translator Productivity with MT: A Patent Translation Case Study
Improving Translator Productivity with MT: A Patent Translation Case StudyImproving Translator Productivity with MT: A Patent Translation Case Study
Improving Translator Productivity with MT: A Patent Translation Case Study
Iconic Translation Machines
 
Workshop on the tauyou machine translation platform
Workshop on the tauyou machine translation platformWorkshop on the tauyou machine translation platform
Workshop on the tauyou machine translation platform
tauyou
 
Quality is in the Eye of the Beholder - Part 2
Quality is in the Eye of the Beholder - Part 2Quality is in the Eye of the Beholder - Part 2
Quality is in the Eye of the Beholder - Part 2
Think Latin America
 
Quality is in the Eye of the Beholder, by Eva Klaudinyova
Quality is in the Eye of the Beholder, by Eva KlaudinyovaQuality is in the Eye of the Beholder, by Eva Klaudinyova
Quality is in the Eye of the Beholder, by Eva Klaudinyova
Ccaps Translation and Localization
 
ICALT 2010: Supporting Exception Handling in Scripted Collaborative Courses
ICALT 2010: Supporting Exception Handling in Scripted Collaborative CoursesICALT 2010: Supporting Exception Handling in Scripted Collaborative Courses
ICALT 2010: Supporting Exception Handling in Scripted Collaborative Courses
Roberto Perez-Rodriguez
 
What machine translation developers are doing to make post-editors happy
What machine translation developers are doing to make post-editors happyWhat machine translation developers are doing to make post-editors happy
What machine translation developers are doing to make post-editors happy
Iconic Translation Machines
 
Software testing
Software testingSoftware testing
Software testing
thaneofife
 
Is Your Enterprise “fire-fighting” translation issues? Optimize the process w...
Is Your Enterprise “fire-fighting” translation issues? Optimize the process w...Is Your Enterprise “fire-fighting” translation issues? Optimize the process w...
Is Your Enterprise “fire-fighting” translation issues? Optimize the process w...
dclsocialmedia
 
Tm challenges
Tm challengesTm challenges
Tm challenges
ITIRussia
 
TAUS Best Practices Error Typology Guidelines
TAUS Best Practices Error Typology GuidelinesTAUS Best Practices Error Typology Guidelines
TAUS Best Practices Error Typology Guidelines
TAUS - The Language Data Network
 
Language Quality Management: Models, Measures, Methodologies
Language Quality Management: Models, Measures, Methodologies Language Quality Management: Models, Measures, Methodologies
Language Quality Management: Models, Measures, Methodologies
Sajan
 
Error Analysis of Rule-based Machine Translation Outputs
Error Analysis of Rule-based Machine Translation OutputsError Analysis of Rule-based Machine Translation Outputs
Error Analysis of Rule-based Machine Translation Outputs
Parisa Niksefat
 
Sda 4
Sda   4Sda   4
Check upload1
Check upload1Check upload1
Check upload1
Referral Bhai
 
Check upload1
Check upload1Check upload1
Check upload1
Nitish Bhardwaj
 
Prvt file test
Prvt file testPrvt file test
Prvt file test
dummyuser1analytics
 
Top Natural Language Processing |aitech.studio
Top Natural Language Processing |aitech.studioTop Natural Language Processing |aitech.studio
Top Natural Language Processing |aitech.studio
AITechStudio
 
Teaminology - A New Crowdsourcing Application for Term & Translation Governan...
Teaminology - A New Crowdsourcing Application for Term & Translation Governan...Teaminology - A New Crowdsourcing Application for Term & Translation Governan...
Teaminology - A New Crowdsourcing Application for Term & Translation Governan...
TAUS - The Language Data Network
 
Machine Translation Master Class at the EUATC Conference by Diego Bartolome
Machine Translation Master Class at the EUATC Conference by Diego BartolomeMachine Translation Master Class at the EUATC Conference by Diego Bartolome
Machine Translation Master Class at the EUATC Conference by Diego Bartolome
tauyou
 

Similar to Overview of Multidimensional Quality Metrics (QTLaunchPad) (20)

Algorithm ExampleFor the following taskUse the random module .docx
Algorithm ExampleFor the following taskUse the random module .docxAlgorithm ExampleFor the following taskUse the random module .docx
Algorithm ExampleFor the following taskUse the random module .docx
 
Improving Translator Productivity with MT: A Patent Translation Case Study
Improving Translator Productivity with MT: A Patent Translation Case StudyImproving Translator Productivity with MT: A Patent Translation Case Study
Improving Translator Productivity with MT: A Patent Translation Case Study
 
Workshop on the tauyou machine translation platform
Workshop on the tauyou machine translation platformWorkshop on the tauyou machine translation platform
Workshop on the tauyou machine translation platform
 
Quality is in the Eye of the Beholder - Part 2
Quality is in the Eye of the Beholder - Part 2Quality is in the Eye of the Beholder - Part 2
Quality is in the Eye of the Beholder - Part 2
 
Quality is in the Eye of the Beholder, by Eva Klaudinyova
Quality is in the Eye of the Beholder, by Eva KlaudinyovaQuality is in the Eye of the Beholder, by Eva Klaudinyova
Quality is in the Eye of the Beholder, by Eva Klaudinyova
 
ICALT 2010: Supporting Exception Handling in Scripted Collaborative Courses
ICALT 2010: Supporting Exception Handling in Scripted Collaborative CoursesICALT 2010: Supporting Exception Handling in Scripted Collaborative Courses
ICALT 2010: Supporting Exception Handling in Scripted Collaborative Courses
 
What machine translation developers are doing to make post-editors happy
What machine translation developers are doing to make post-editors happyWhat machine translation developers are doing to make post-editors happy
What machine translation developers are doing to make post-editors happy
 
Software testing
Software testingSoftware testing
Software testing
 
Is Your Enterprise “fire-fighting” translation issues? Optimize the process w...
Is Your Enterprise “fire-fighting” translation issues? Optimize the process w...Is Your Enterprise “fire-fighting” translation issues? Optimize the process w...
Is Your Enterprise “fire-fighting” translation issues? Optimize the process w...
 
Tm challenges
Tm challengesTm challenges
Tm challenges
 
TAUS Best Practices Error Typology Guidelines
TAUS Best Practices Error Typology GuidelinesTAUS Best Practices Error Typology Guidelines
TAUS Best Practices Error Typology Guidelines
 
Language Quality Management: Models, Measures, Methodologies
Language Quality Management: Models, Measures, Methodologies Language Quality Management: Models, Measures, Methodologies
Language Quality Management: Models, Measures, Methodologies
 
Error Analysis of Rule-based Machine Translation Outputs
Error Analysis of Rule-based Machine Translation OutputsError Analysis of Rule-based Machine Translation Outputs
Error Analysis of Rule-based Machine Translation Outputs
 
Sda 4
Sda   4Sda   4
Sda 4
 
Check upload1
Check upload1Check upload1
Check upload1
 
Check upload1
Check upload1Check upload1
Check upload1
 
Prvt file test
Prvt file testPrvt file test
Prvt file test
 
Top Natural Language Processing |aitech.studio
Top Natural Language Processing |aitech.studioTop Natural Language Processing |aitech.studio
Top Natural Language Processing |aitech.studio
 
Teaminology - A New Crowdsourcing Application for Term & Translation Governan...
Teaminology - A New Crowdsourcing Application for Term & Translation Governan...Teaminology - A New Crowdsourcing Application for Term & Translation Governan...
Teaminology - A New Crowdsourcing Application for Term & Translation Governan...
 
Machine Translation Master Class at the EUATC Conference by Diego Bartolome
Machine Translation Master Class at the EUATC Conference by Diego BartolomeMachine Translation Master Class at the EUATC Conference by Diego Bartolome
Machine Translation Master Class at the EUATC Conference by Diego Bartolome
 

Recently uploaded

Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
Zilliz
 
Recommendation System using RAG Architecture
Recommendation System using RAG ArchitectureRecommendation System using RAG Architecture
Recommendation System using RAG Architecture
fredae14
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
Daiki Mogmet Ito
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Tosin Akinosho
 
OpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - AuthorizationOpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - Authorization
David Brossard
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
Postman
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
Hiroshi SHIBATA
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
Pixlogix Infotech
 
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial IntelligenceAI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
IndexBug
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Malak Abu Hammad
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
Jakub Marek
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
Tatiana Kojar
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
Tomaz Bratanic
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
kumardaparthi1024
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
saastr
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
danishmna97
 

Recently uploaded (20)

Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
 
Recommendation System using RAG Architecture
Recommendation System using RAG ArchitectureRecommendation System using RAG Architecture
Recommendation System using RAG Architecture
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
 
OpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - AuthorizationOpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - Authorization
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
 
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial IntelligenceAI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
 

Overview of Multidimensional Quality Metrics (QTLaunchPad)

  • 1. Translation Quality Assessment: Five Easy Steps Using Multidimensional Quality Metrics to improve quality assessment and management Prepared by the QTLaunchPad project (info@qt21.eu) version 1.0 (26.April 2013)
  • 2. Who does this apply to?  Requesters of translation services looking for relevant quality metrics  Language Service Providers (LSPs) delivering translation services to their clients  The following materials will apply to negotiation between requesters and providers  This description does not apply to individual translators (although they may want to be aware of the contents)
  • 4. Basic questions about your project E.g.,  What languages are you working in?  What is your subject field?  What sort of project is it (e.g., user interface, documentation, advertising)?  What technology are you using (MT, CAT, etc.)?  What register and style are you using?
  • 5. Step 2. Select Metrics
  • 6. Based on your specifications…  MQM recommendation tool will:  suggest a pre-defined metric used for similar projects, or  recommend a custom metric that applies to your project  You are free to modify the metric as needed  Create a metrics specification file that  defines the issues to be examined  provides weights (descriptions of how important the issues are)  Metrics specification file can be used by an MQM- compliant tool
  • 8. Three options: 1. Sampling: Examine a portion of the text to determine whether to pass or fail the entire text. Sampling can utilize quality estimation for better results 2. Full error analysis: Review the entire text (needed for critical legal or safety texts) 3. Rubric: Rate the text on a numerical scale (suitable for quick assessment of suitability)
  • 9. Automated Metrics  If sampling is used, MQM’s quality estimation tools will help focus sampling on those parts of the text that need attention  Automatic metrics can be used in some cases where human evaluation is too expensive or time-consuming
  • 11. Evaluation…  Can be conducted by the requester or LSP in accordance with the agreement between the parties  Follows the method chosen in Step 3 (evaluation method)  Issues must match the metric chosen in Step 2: issues not found in the metric should not be considered errors
  • 12. MQM provides capabilities  For human evaluation  Inline markup provides an audit trail:  Allows independent verification of errors  Helps ensure that issues are corrected  Full reporting functions:  See what types of errors are reported  Understand where errors come from  For automatic evaluation  Integrated use of existing quality metrics to help provide evaluation
  • 13. translate5  These capabilities are being integrated into an open- source editing tool, translate5 (http://www.translate5.net)  All results are free to implement in additional tools (both open source and proprietary)  Parties interested in development should contact info@qt21.eu
  • 14. The source matters  Full MQM evaluation includes the source  Source quality evaluation can help identify reasons for problems and resolve them  Translators can be rewarded for addressing source deficiencies (scores over 100% are possible!)
  • 16. Scoring Formula  (Q = whatever set of issues being counted within the bigger formula)  Provides consistency with LISA QA Model scoring method  Can be customized to support other legacy systems  Can be applied to individual parts of the overall formula: i.e., fluency, accuracy, grammar, etc. subscores can be derived  Weights (not shown) can be used to adjust importance of various issue types
  • 17. Scores help guide decisions  Scores are given on a 100% basis  Scores can be broken down into more fine-grained reports.  E.g., a score of 96% could have 100% accuracy but 92% fluency.  Helps target actions for quality control.
  • 19. 1. Specifications Parameter Value Language/Locale Source: English; Target: Japanese Subject field/domain Medical Text type Narrative Audience Educated readers with an interest in medicine Purpose Education about a new procedure for managing diabetes Register Moderately formal Style no specified style – match source if possible Content correspondence Literal translation Output modality subtitles (speech to text) File format Time-coded XML for dotSub Production technology human translation
  • 20. 2. Recommended Metric Issue type Weight (high, medium, low) Notes Fluency Orthography High Grammar High Accuracy Mistranslation High Omission Low Due to nature as captions, some information loss is expected. Captions should be 60% of spoken dialogue Untranslated High Legal requirements High Must make sure that legal claims are admissible under Japanese law
  • 21. Chosen from… Issue types are a subset of the full catalog of types
  • 23. Quality Formula (1) TQ = (Atr + At - As) + (Ft – Fs) with respect to specifications  TQ = translation quality  Atr = accuracy (transfer)  At = accuracy for the target text  As = accuracy for the source text  Ft = fluency score for target text  Fs = fluency score for source text
  • 24. Quality Formula (2) TQ = (Atr + At - As) + (Ft – Fs) with respect to specifications Definition: A quality translation demonstrates required accuracy and fluency for the audience and purpose and complies with all other negotiated specifications, taking into account end-user needs. The gold portion = dimensions (specifications)
  • 25. 3. Evaluation method  In this example, portions of the text are marketing: sampling is an acceptable evaluation method for these parts  Other portions contain legal and regulatory claims: full error analysis is required for those portions  Inline markup can be used via MQM namespace (because text is in XML) to ensure corrections are made.
  • 26. 4. Evaluation • Evaluation includes subsegment markup with issues in metric • Issues stored in MQM namespace to allow audit and revision • Users can select three severity levels: • critical: the issue renders the text unusable • major: the issue leaves the text usable, but is an obstacle to understanding • minor: the issue does not impact usability of the text screenshot: translate5.net showing MQM markup tool
  • 27. 5. Scoring Issue type Weight Minor Major Critical Penalty Adjusted Total Fluency Orthography 1.0 8 2 1 28 28 97.2% Grammar 1.0 6 2 0 16 16 98.4% Subtotal 44 95.6% Accuracy Mistranslation 1.0 4 0 0 4 4 99.6% Omission 0.2 12 4 1 42 8.4 99.2% Untranslated 1.0 1 0 0 1 1 99.9% Legal requirements 1.0 0 0 1 10 10 99.0% Subtotal 23.4 97.7% Total 67.4 93.3% Assumes 1000-word sample Because Omission is considered a low priority in this case, it is given a low weight
  • 28. 5. Scoring  Without weighting of Omission, the score would be 89.9%  We can see that the translator has more problems with fluency than with accuracy
  • 29. 5. Full scoring (including source) Issue type Source Target Adjusted Fluency Orthography 96.1% 97.2% 101.1% Grammar 99.0% 98.4% 99.6% Subtotal 95.1% 95.6% ☞ 100.5% Accuracy Mistranslation (100%) 99.6% 99.6% Omission (100%) 99.2% 99.2% Untranslated (100%) 99.9% 99.9% Legal requirements (100%) 99.0% 99.0% Subtotal 100% 97.7% 97.7% Total 95.1% 89.9% 98.2% Assumes 1000-word sample. Source accuracy set to 100% for computational purposes.
  • 30. 5. Scoring (including source)  In many cases, some problems in a translation are not caused by the translator.  In this case, the translator fixed problems in the source, resulting in better quality for fluency in the target. The translator should be recognized for this work.
  • 31. For more information  Please visit http://www.qt21.eu/launchpad/  Write to info@qt21.eu