SlideShare a Scribd company logo
Translation Studies
Simplification and Explicitation Universals




                                  Claudiu Mih˘il˘
                                             a a

                             Faculty of Computer Science
                        ”Alexandru Ioan Cuza” University of Ia¸i
                                                              s


                                   21 April 2010
Outline

Introduction
    Motivation
    Translation studies
Simplification
   Definiton
   Simplification pros
   Simplification cons
Explicitation
   Definiton
   Explicitation pros
   Explicitation cons
Conclusions
 2 of 13
Motivation
• The questions
  ◦ Is there a difference between original and translated language?
  ◦ If so, is it automatically detectable?
  ◦ And if so, does it improve NLP quality?




3 of 13
Motivation
• The questions
  ◦ Is there a difference between original and translated language?
  ◦ If so, is it automatically detectable?
  ◦ And if so, does it improve NLP quality?
• The answers
  ◦ Yes!
  ◦ Yes: up to 97.62% for simplification
  ◦ Yes:
          • Human translator (self-)assessment
          • Statistical machine translation
          • Multilingual plagiarism detection




3 of 13
Translation studies
• Specific lexico-grammatical and syntactic characteristics




4 of 13
Translation studies
• Specific lexico-grammatical and syntactic characteristics
• Translationese - Gellerstam (1986)
  ◦ ”Fingerprints” left behind by the translation process




4 of 13
Translation studies
• Specific lexico-grammatical and syntactic characteristics
• Translationese - Gellerstam (1986)
  ◦ ”Fingerprints” left behind by the translation process
• Translation laws - Toury (1983)
  ◦ Standardisation, Interference




4 of 13
Translation studies
• Specific lexico-grammatical and syntactic characteristics
• Translationese - Gellerstam (1986)
  ◦ ”Fingerprints” left behind by the translation process
• Translation laws - Toury (1983)
  ◦ Standardisation, Interference
• Translation universals - Baker (1993)
  ◦ Simplification, Explicitation, Convergence, Normalisation




4 of 13
Simplification
• Tendency to produce simpler and easier-to-follow texts




5 of 13
Simplification
• Tendency to produce simpler and easier-to-follow texts
• Laviosa (2002)
  ◦ Study on small corpus
  ◦ Features for simplification
  ◦ Insufficient evidence




5 of 13
Simplification pros
• Baroni (2006)
  ◦ Detect originals and translations in an Italian corpus
  ◦ Uni-, bi-, tri-grams, word forms, lemmas, and POS tags
  ◦ Supervised learning system
  ◦ Accuracy up to 87%




6 of 13
Simplification pros
• Baroni (2006)
  ◦ Detect originals and translations in an Italian corpus
  ◦ Uni-, bi-, tri-grams, word forms, lemmas, and POS tags
  ◦ Supervised learning system
  ◦ Accuracy up to 87%
• Corpas (2008a)
  ◦ English-into-Spanish and Spanish medical and technical texts
  ◦ Validated for lexical richness
  ◦ Contradicted for complex sentences, sentence length, ambiguity,
    information load, depth of syntactic trees




6 of 13
Simplification pros
• Baroni (2006)
  ◦ Detect originals and translations in an Italian corpus
  ◦ Uni-, bi-, tri-grams, word forms, lemmas, and POS tags
  ◦ Supervised learning system
  ◦ Accuracy up to 87%
• Corpas (2008a)
  ◦ English-into-Spanish and Spanish medical and technical texts
  ◦ Validated for lexical richness
  ◦ Contradicted for complex sentences, sentence length, ambiguity,
    information load, depth of syntactic trees
• Corpas (2008b)
  ◦ Validated for lexical richness and density, number of discourse
    markers, complex sentences, sentence length
  ◦ More visible for technical domain

6 of 13
Simplification pros
• Ilisei (2010)
   ◦ 21 language-independent features
   ◦ Supervised machine learning - 8 classifiers
   ◦ Accuracy of 97.62%
   ◦ Most salient features - InfoGain, ChiSquare
          • Lexical richness
          • Sentence length
          • Proportions of pronouns, conjunctions, grammatical and lexical words




7 of 13
Simplification cons
• Jantunen (2001)
   ◦ Boosters in Finnish translations - hyvin, kovin, oikein
   ◦ typical lexical combinations in most cases




8 of 13
Simplification cons
• Jantunen (2001)
   ◦ Boosters in Finnish translations - hyvin, kovin, oikein
   ◦ typical lexical combinations in most cases
• Jantunen (2004)
   ◦ Boosters in Finnish translations - hyvin, kovin, oikein
   ◦ untypical lexical combinations in translations
   ◦ similar colligations in originals and translations




8 of 13
Explicitation
• Introducing overt information into the translation that is implicit in
  the source language




9 of 13
Explicitation
• Introducing overt information into the translation that is implicit in
  the source language
• Classification - Pym (2005)
   ◦ Obligatory explicitation
          • Forced by language specificity or grammar
   ◦ Voluntary explicitation
          • Optional information to avoid misinterpretations




9 of 13
Explicitation pros
• Burnett (1999)
  ◦ BNC vs. TEC
  ◦ suggest, admit, claim, think, believe, hope, know




10 of 13
Explicitation pros
• Burnett (1999)
  ◦ BNC vs. TEC
  ◦ suggest, admit, claim, think, believe, hope, know
• Olohan (2000)
  ◦ BNC vs. TEC
  ◦ say / tell + that / zero connective




10 of 13
Explicitation pros
• Burnett (1999)
  ◦ BNC vs. TEC
  ◦ suggest, admit, claim, think, believe, hope, know
• Olohan (2000)
  ◦ BNC vs. TEC
  ◦ say / tell + that / zero connective
• Olohan (2001)
  ◦ BNC vs. TEC
  ◦ promise + that / zero connective




10 of 13
Explicitation cons
• Cheong (2006)
  ◦ Explicitation vs. implicitation
  ◦ English-into-Korean translations
  ◦ The phenomena appear equally
  ◦ The direction of translation influences their behaviour




11 of 13
Conclusions
• Simplification
  ◦ Many studies supporting it
  ◦ Many studies contradicting it
  ◦ Not yet clearly confirmed




12 of 13
Conclusions
• Simplification
  ◦ Many studies supporting it
  ◦ Many studies contradicting it
  ◦ Not yet clearly confirmed
• Explicitation
  ◦ Occuring often to avoid misinterpretations
  ◦ Implicitation needs to be considered as well




12 of 13
Conclusions
• Simplification
  ◦ Many studies supporting it
  ◦ Many studies contradicting it
  ◦ Not yet clearly confirmed
• Explicitation
  ◦ Occuring often to avoid misinterpretations
  ◦ Implicitation needs to be considered as well
• Usefulness
  ◦ SMT
  ◦ Multilingual plagiarism detection
  ◦ (Self-)assessment of translators’s work




12 of 13
Thank you!
• Questions?




13 of 13

More Related Content

Similar to Simplification and Explicitation Universals

Identification of Translationese: A Machine Learning Approach
Identification of Translationese: A Machine Learning ApproachIdentification of Translationese: A Machine Learning Approach
Identification of Translationese: A Machine Learning Approach
iustinailisei
 
2106 PRSLLS
2106 PRSLLS2106 PRSLLS
2106 PRSLLS
WarNik Chow
 
Filled pauses and L2 proficiency: Finnish Australians speaking English
Filled pauses and L2 proficiency: Finnish Australians speaking EnglishFilled pauses and L2 proficiency: Finnish Australians speaking English
Filled pauses and L2 proficiency: Finnish Australians speaking EnglishWybo Wiersma
 
語言議題
語言議題語言議題
語言議題
Kevin Chun-Hsien Hsu
 
The effects of learner characteristics and beliefs on usage of ASR-CALL systems
The effects of learner characteristics and beliefs on usage of ASR-CALL systemsThe effects of learner characteristics and beliefs on usage of ASR-CALL systems
The effects of learner characteristics and beliefs on usage of ASR-CALL systems
Bindi Clements
 
NLP_guest_lecture.pdf
NLP_guest_lecture.pdfNLP_guest_lecture.pdf
NLP_guest_lecture.pdf
Soha82
 
Listening & speaking part 1
Listening & speaking part 1Listening & speaking part 1
Listening & speaking part 1Nombre Apellidos
 
Winke, gass, and syderenko presentation
Winke, gass, and syderenko presentationWinke, gass, and syderenko presentation
Winke, gass, and syderenko presentation
John Whalen
 
Scientific and Technical Translation in English - Week 2 2019
Scientific and Technical Translation in English - Week 2 2019Scientific and Technical Translation in English - Week 2 2019
Scientific and Technical Translation in English - Week 2 2019
Ron Martinez
 
Scientific and Technical Translation in English: Week 2
Scientific and Technical Translation in English: Week 2Scientific and Technical Translation in English: Week 2
Scientific and Technical Translation in English: Week 2
Ron Martinez
 
Dictogloss replication study: ESSE Brno 2018
Dictogloss replication study: ESSE Brno 2018Dictogloss replication study: ESSE Brno 2018
Dictogloss replication study: ESSE Brno 2018
Shona Whyte
 
How to Paraphrase a Sentence & Effective Word Choice
How to Paraphrase a Sentence & Effective Word ChoiceHow to Paraphrase a Sentence & Effective Word Choice
How to Paraphrase a Sentence & Effective Word Choice
sejin cheon
 
Speech Technology and how it will Transform Medicine
Speech Technology and how it will Transform MedicineSpeech Technology and how it will Transform Medicine
Speech Technology and how it will Transform Medicine
Nick van Terheyden
 
Tpological Universals & SLA (Linguistic Typology)
Tpological Universals & SLA (Linguistic Typology)Tpological Universals & SLA (Linguistic Typology)
Tpological Universals & SLA (Linguistic Typology)
Seray Tanyer
 
Tamura & Kusanagi (2014) CELES
Tamura & Kusanagi (2014) CELESTamura & Kusanagi (2014) CELES
Tamura & Kusanagi (2014) CELESYu Tamura
 
eufimism.ppt
eufimism.ppteufimism.ppt
eufimism.ppt
ayu dharma
 
My orthodontic journey 2014
My orthodontic journey 2014My orthodontic journey 2014
Reys_Poster_April_27_ FINAL
Reys_Poster_April_27_ FINALReys_Poster_April_27_ FINAL
Reys_Poster_April_27_ FINALKelly Allen
 
Classroom Management of a Child with APD
Classroom Management of a Child with APDClassroom Management of a Child with APD
Classroom Management of a Child with APD
kjmagnon
 
Barriers faced by international students on understanding the spoken content ...
Barriers faced by international students on understanding the spoken content ...Barriers faced by international students on understanding the spoken content ...
Barriers faced by international students on understanding the spoken content ...
decolonisingdmu
 

Similar to Simplification and Explicitation Universals (20)

Identification of Translationese: A Machine Learning Approach
Identification of Translationese: A Machine Learning ApproachIdentification of Translationese: A Machine Learning Approach
Identification of Translationese: A Machine Learning Approach
 
2106 PRSLLS
2106 PRSLLS2106 PRSLLS
2106 PRSLLS
 
Filled pauses and L2 proficiency: Finnish Australians speaking English
Filled pauses and L2 proficiency: Finnish Australians speaking EnglishFilled pauses and L2 proficiency: Finnish Australians speaking English
Filled pauses and L2 proficiency: Finnish Australians speaking English
 
語言議題
語言議題語言議題
語言議題
 
The effects of learner characteristics and beliefs on usage of ASR-CALL systems
The effects of learner characteristics and beliefs on usage of ASR-CALL systemsThe effects of learner characteristics and beliefs on usage of ASR-CALL systems
The effects of learner characteristics and beliefs on usage of ASR-CALL systems
 
NLP_guest_lecture.pdf
NLP_guest_lecture.pdfNLP_guest_lecture.pdf
NLP_guest_lecture.pdf
 
Listening & speaking part 1
Listening & speaking part 1Listening & speaking part 1
Listening & speaking part 1
 
Winke, gass, and syderenko presentation
Winke, gass, and syderenko presentationWinke, gass, and syderenko presentation
Winke, gass, and syderenko presentation
 
Scientific and Technical Translation in English - Week 2 2019
Scientific and Technical Translation in English - Week 2 2019Scientific and Technical Translation in English - Week 2 2019
Scientific and Technical Translation in English - Week 2 2019
 
Scientific and Technical Translation in English: Week 2
Scientific and Technical Translation in English: Week 2Scientific and Technical Translation in English: Week 2
Scientific and Technical Translation in English: Week 2
 
Dictogloss replication study: ESSE Brno 2018
Dictogloss replication study: ESSE Brno 2018Dictogloss replication study: ESSE Brno 2018
Dictogloss replication study: ESSE Brno 2018
 
How to Paraphrase a Sentence & Effective Word Choice
How to Paraphrase a Sentence & Effective Word ChoiceHow to Paraphrase a Sentence & Effective Word Choice
How to Paraphrase a Sentence & Effective Word Choice
 
Speech Technology and how it will Transform Medicine
Speech Technology and how it will Transform MedicineSpeech Technology and how it will Transform Medicine
Speech Technology and how it will Transform Medicine
 
Tpological Universals & SLA (Linguistic Typology)
Tpological Universals & SLA (Linguistic Typology)Tpological Universals & SLA (Linguistic Typology)
Tpological Universals & SLA (Linguistic Typology)
 
Tamura & Kusanagi (2014) CELES
Tamura & Kusanagi (2014) CELESTamura & Kusanagi (2014) CELES
Tamura & Kusanagi (2014) CELES
 
eufimism.ppt
eufimism.ppteufimism.ppt
eufimism.ppt
 
My orthodontic journey 2014
My orthodontic journey 2014My orthodontic journey 2014
My orthodontic journey 2014
 
Reys_Poster_April_27_ FINAL
Reys_Poster_April_27_ FINALReys_Poster_April_27_ FINAL
Reys_Poster_April_27_ FINAL
 
Classroom Management of a Child with APD
Classroom Management of a Child with APDClassroom Management of a Child with APD
Classroom Management of a Child with APD
 
Barriers faced by international students on understanding the spoken content ...
Barriers faced by international students on understanding the spoken content ...Barriers faced by international students on understanding the spoken content ...
Barriers faced by international students on understanding the spoken content ...
 

More from Claudiu Mihăilă

News Search Using Discourse Analytics
News Search Using Discourse AnalyticsNews Search Using Discourse Analytics
News Search Using Discourse Analytics
Claudiu Mihăilă
 
Analysing Entity Type Variation across Biomedical Subdomains
Analysing Entity Type Variation across Biomedical SubdomainsAnalysing Entity Type Variation across Biomedical Subdomains
Analysing Entity Type Variation across Biomedical SubdomainsClaudiu Mihăilă
 
Translation studies: Simplification and Explicitation Universals
Translation studies: Simplification and Explicitation UniversalsTranslation studies: Simplification and Explicitation Universals
Translation studies: Simplification and Explicitation Universals
Claudiu Mihăilă
 
Functional Dependency Grammar
Functional Dependency GrammarFunctional Dependency Grammar
Functional Dependency Grammar
Claudiu Mihăilă
 
To Be or Not to be a Zero Pronoun: A Machine Learning Approach for Romanian
To Be or Not to be a Zero Pronoun: A Machine Learning Approach for RomanianTo Be or Not to be a Zero Pronoun: A Machine Learning Approach for Romanian
To Be or Not to be a Zero Pronoun: A Machine Learning Approach for Romanian
Claudiu Mihăilă
 
TEDDY - Thesaurus Editor: Design and Definition Yarn
TEDDY - Thesaurus Editor: Design and Definition YarnTEDDY - Thesaurus Editor: Design and Definition Yarn
TEDDY - Thesaurus Editor: Design and Definition Yarn
Claudiu Mihăilă
 
Nature-inspired methods for the Semantic Web
Nature-inspired methods for the Semantic WebNature-inspired methods for the Semantic Web
Nature-inspired methods for the Semantic Web
Claudiu Mihăilă
 
Zemanta: A Content Recommendation Engine
Zemanta: A Content Recommendation EngineZemanta: A Content Recommendation Engine
Zemanta: A Content Recommendation Engine
Claudiu Mihăilă
 
Modelling social Web applications via tinydb
Modelling social Web applications via tinydbModelling social Web applications via tinydb
Modelling social Web applications via tinydbClaudiu Mihăilă
 

More from Claudiu Mihăilă (9)

News Search Using Discourse Analytics
News Search Using Discourse AnalyticsNews Search Using Discourse Analytics
News Search Using Discourse Analytics
 
Analysing Entity Type Variation across Biomedical Subdomains
Analysing Entity Type Variation across Biomedical SubdomainsAnalysing Entity Type Variation across Biomedical Subdomains
Analysing Entity Type Variation across Biomedical Subdomains
 
Translation studies: Simplification and Explicitation Universals
Translation studies: Simplification and Explicitation UniversalsTranslation studies: Simplification and Explicitation Universals
Translation studies: Simplification and Explicitation Universals
 
Functional Dependency Grammar
Functional Dependency GrammarFunctional Dependency Grammar
Functional Dependency Grammar
 
To Be or Not to be a Zero Pronoun: A Machine Learning Approach for Romanian
To Be or Not to be a Zero Pronoun: A Machine Learning Approach for RomanianTo Be or Not to be a Zero Pronoun: A Machine Learning Approach for Romanian
To Be or Not to be a Zero Pronoun: A Machine Learning Approach for Romanian
 
TEDDY - Thesaurus Editor: Design and Definition Yarn
TEDDY - Thesaurus Editor: Design and Definition YarnTEDDY - Thesaurus Editor: Design and Definition Yarn
TEDDY - Thesaurus Editor: Design and Definition Yarn
 
Nature-inspired methods for the Semantic Web
Nature-inspired methods for the Semantic WebNature-inspired methods for the Semantic Web
Nature-inspired methods for the Semantic Web
 
Zemanta: A Content Recommendation Engine
Zemanta: A Content Recommendation EngineZemanta: A Content Recommendation Engine
Zemanta: A Content Recommendation Engine
 
Modelling social Web applications via tinydb
Modelling social Web applications via tinydbModelling social Web applications via tinydb
Modelling social Web applications via tinydb
 

Recently uploaded

SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfSAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
Peter Spielvogel
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
KAMESHS29
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
Alpen-Adria-Universität
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
UiPathCommunity
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
OnBoard
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Paige Cruz
 
Assure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyesAssure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
91mobiles
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
James Anderson
 
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofszkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
Alex Pruden
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
sonjaschweigert1
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 

Recently uploaded (20)

SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfSAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
 
Assure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyesAssure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyes
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
 
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofszkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 

Simplification and Explicitation Universals

  • 1. Translation Studies Simplification and Explicitation Universals Claudiu Mih˘il˘ a a Faculty of Computer Science ”Alexandru Ioan Cuza” University of Ia¸i s 21 April 2010
  • 2. Outline Introduction Motivation Translation studies Simplification Definiton Simplification pros Simplification cons Explicitation Definiton Explicitation pros Explicitation cons Conclusions 2 of 13
  • 3. Motivation • The questions ◦ Is there a difference between original and translated language? ◦ If so, is it automatically detectable? ◦ And if so, does it improve NLP quality? 3 of 13
  • 4. Motivation • The questions ◦ Is there a difference between original and translated language? ◦ If so, is it automatically detectable? ◦ And if so, does it improve NLP quality? • The answers ◦ Yes! ◦ Yes: up to 97.62% for simplification ◦ Yes: • Human translator (self-)assessment • Statistical machine translation • Multilingual plagiarism detection 3 of 13
  • 5. Translation studies • Specific lexico-grammatical and syntactic characteristics 4 of 13
  • 6. Translation studies • Specific lexico-grammatical and syntactic characteristics • Translationese - Gellerstam (1986) ◦ ”Fingerprints” left behind by the translation process 4 of 13
  • 7. Translation studies • Specific lexico-grammatical and syntactic characteristics • Translationese - Gellerstam (1986) ◦ ”Fingerprints” left behind by the translation process • Translation laws - Toury (1983) ◦ Standardisation, Interference 4 of 13
  • 8. Translation studies • Specific lexico-grammatical and syntactic characteristics • Translationese - Gellerstam (1986) ◦ ”Fingerprints” left behind by the translation process • Translation laws - Toury (1983) ◦ Standardisation, Interference • Translation universals - Baker (1993) ◦ Simplification, Explicitation, Convergence, Normalisation 4 of 13
  • 9. Simplification • Tendency to produce simpler and easier-to-follow texts 5 of 13
  • 10. Simplification • Tendency to produce simpler and easier-to-follow texts • Laviosa (2002) ◦ Study on small corpus ◦ Features for simplification ◦ Insufficient evidence 5 of 13
  • 11. Simplification pros • Baroni (2006) ◦ Detect originals and translations in an Italian corpus ◦ Uni-, bi-, tri-grams, word forms, lemmas, and POS tags ◦ Supervised learning system ◦ Accuracy up to 87% 6 of 13
  • 12. Simplification pros • Baroni (2006) ◦ Detect originals and translations in an Italian corpus ◦ Uni-, bi-, tri-grams, word forms, lemmas, and POS tags ◦ Supervised learning system ◦ Accuracy up to 87% • Corpas (2008a) ◦ English-into-Spanish and Spanish medical and technical texts ◦ Validated for lexical richness ◦ Contradicted for complex sentences, sentence length, ambiguity, information load, depth of syntactic trees 6 of 13
  • 13. Simplification pros • Baroni (2006) ◦ Detect originals and translations in an Italian corpus ◦ Uni-, bi-, tri-grams, word forms, lemmas, and POS tags ◦ Supervised learning system ◦ Accuracy up to 87% • Corpas (2008a) ◦ English-into-Spanish and Spanish medical and technical texts ◦ Validated for lexical richness ◦ Contradicted for complex sentences, sentence length, ambiguity, information load, depth of syntactic trees • Corpas (2008b) ◦ Validated for lexical richness and density, number of discourse markers, complex sentences, sentence length ◦ More visible for technical domain 6 of 13
  • 14. Simplification pros • Ilisei (2010) ◦ 21 language-independent features ◦ Supervised machine learning - 8 classifiers ◦ Accuracy of 97.62% ◦ Most salient features - InfoGain, ChiSquare • Lexical richness • Sentence length • Proportions of pronouns, conjunctions, grammatical and lexical words 7 of 13
  • 15. Simplification cons • Jantunen (2001) ◦ Boosters in Finnish translations - hyvin, kovin, oikein ◦ typical lexical combinations in most cases 8 of 13
  • 16. Simplification cons • Jantunen (2001) ◦ Boosters in Finnish translations - hyvin, kovin, oikein ◦ typical lexical combinations in most cases • Jantunen (2004) ◦ Boosters in Finnish translations - hyvin, kovin, oikein ◦ untypical lexical combinations in translations ◦ similar colligations in originals and translations 8 of 13
  • 17. Explicitation • Introducing overt information into the translation that is implicit in the source language 9 of 13
  • 18. Explicitation • Introducing overt information into the translation that is implicit in the source language • Classification - Pym (2005) ◦ Obligatory explicitation • Forced by language specificity or grammar ◦ Voluntary explicitation • Optional information to avoid misinterpretations 9 of 13
  • 19. Explicitation pros • Burnett (1999) ◦ BNC vs. TEC ◦ suggest, admit, claim, think, believe, hope, know 10 of 13
  • 20. Explicitation pros • Burnett (1999) ◦ BNC vs. TEC ◦ suggest, admit, claim, think, believe, hope, know • Olohan (2000) ◦ BNC vs. TEC ◦ say / tell + that / zero connective 10 of 13
  • 21. Explicitation pros • Burnett (1999) ◦ BNC vs. TEC ◦ suggest, admit, claim, think, believe, hope, know • Olohan (2000) ◦ BNC vs. TEC ◦ say / tell + that / zero connective • Olohan (2001) ◦ BNC vs. TEC ◦ promise + that / zero connective 10 of 13
  • 22. Explicitation cons • Cheong (2006) ◦ Explicitation vs. implicitation ◦ English-into-Korean translations ◦ The phenomena appear equally ◦ The direction of translation influences their behaviour 11 of 13
  • 23. Conclusions • Simplification ◦ Many studies supporting it ◦ Many studies contradicting it ◦ Not yet clearly confirmed 12 of 13
  • 24. Conclusions • Simplification ◦ Many studies supporting it ◦ Many studies contradicting it ◦ Not yet clearly confirmed • Explicitation ◦ Occuring often to avoid misinterpretations ◦ Implicitation needs to be considered as well 12 of 13
  • 25. Conclusions • Simplification ◦ Many studies supporting it ◦ Many studies contradicting it ◦ Not yet clearly confirmed • Explicitation ◦ Occuring often to avoid misinterpretations ◦ Implicitation needs to be considered as well • Usefulness ◦ SMT ◦ Multilingual plagiarism detection ◦ (Self-)assessment of translators’s work 12 of 13