SlideShare a Scribd company logo
Statistical Machine Translation  Waleed Oransa, M.Sc. College of Computing and Information Technology Arab Academy for Science and Technology Cairo, Egypt [email_address]
Agenda ,[object Object],[object Object],[object Object],[object Object],[object Object]
Agenda ,[object Object],[object Object],[object Object],[object Object],[object Object]
Why is Machine Translation so Hard? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
MT Approaches ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Interlingua Semantic Syntactic  Structure Word s Semantic  Syntactic  Structure Word s Direct Syntactic Transfer Semantic Transfer Source Language Text Target Language Text Conceptual  Generation Semantic   Generation Syntactic  Generation Morphological  Generation Conceptual  Analysis Semantic Analysis Parsing  Morphological Analysis Better Quality & More difficulty
Why Statistical Machine Translation (SMT)? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Example of a parallel corpus ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Statistical Machine Translation (SMT) ,[object Object],[object Object],[object Object],[object Object],[object Object]
How to build SMT System ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
[object Object]
SMT Training Phase English Sentences PBSMT System Training Language  Model (Arabic) Input:  Training Corpus Arabic/English Bi-Text Output:  Language Model  and Translation Model Arabic Sentences Translation Model (English/Arabic) Language   Modeling Training (Tool: SRILM toolkit) Translation   Modeling Training (Tool: Giza++ & Moses toolkit) What is the Language Model?
Language Model (LM) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
LM Role in SMT مشرف  يجتمعوا مشرف  تجتمع مشرف  يجتمع مشرف  يتقابل  مشرف  يقابل  مشرف  يجتمعا مشرف  تجتمعا  مشرف  يجتمعن مشرف  يجتمعون مشرف  يتقابلان Language Model  مشرف   xxxx مع  كبار المسؤولين المدنيين والعسكريين Language Model  يجتمع يجتمعوا تجتمع يجتمع يتقابل يقابل يجتمعا تجتمعا يجتمعن يجتمعون يتقابلان
[object Object]
SMT Training Phase English Sentences PBSMT System Training Language  Model (Arabic) Input:  Training Corpus Arabic/English Bi-Text Output:  Language Model  and Translation Model Arabic Sentences Translation Model (English/Arabic) Language   Modeling Training (Tool: SRILM toolkit) Translation   Modeling Training (Tool: Giza++ & Moses toolkit) What is the Translation Model?
Translation Model (TM) ,[object Object],[object Object],[object Object]
Translation Model (TM) ,[object Object],[object Object],[object Object],[object Object]
Translation Model (TM) ,[object Object],[object Object]
SMT Translation Phase SMT System (Decoder) Source Text Target Text Language  Model (Arabic) Translation Model (English/Arabic) Musharraf Meets with Senior Civilian  ,[object Object],[object Object],[object Object],[object Object],[object Object],Initial N-best hypotheses p=0.13 p=0.21 p=0.23 p=0.12 p=0.18 ,[object Object],[object Object],[object Object],[object Object],[object Object],p=0.53 p=0.42 p=0.37 p=0.22 p=0.48 Final N-best hypotheses ,[object Object]
How TM & LM work together? Musharraf Meets with Senior Civilian and Military Officials مشرف  *****  مع  كبار المسؤولين المدنيين والعسكريين يجتمع Language Model  Translation Model  يجتمعوا تجتمع يجتمع يتقابل يقابل يجتمعا تجتمعا يجتمعن يجتمعون يتقابلان
Agenda ,[object Object],[object Object],[object Object],[object Object]
PBSMT Approach ,[object Object],[object Object],[object Object],[object Object],ولد The prophet Mohamed was born في سنة  570  ميلادية in 570 A.D الرسول محمد
PBSMT Training Phase English Sentences PBSMT System Training Language  Model (Arabic) Input:  Training Corpus Arabic/English Bi-Text Output:  Language Model  and Translation Model PBSMT Normal Training Phase Arabic Sentences Translation Model  Phrase Table (English/Arabic) Language   Modeling Training (Tool: SRILM toolkit) Translation   Modeling Training (Tool: Giza++ & Moses toolkit)
Phrase based alignment (The prophet,  الرسول ) (The prophet Mohamed,  الرسول محمد ) (great man,  رجل عظيم ) (Mohamed is a great man,  محمد رجل عظيم ) (The prophet Mohamed is a great man,  الرسول محمد رجل عظيم ) etc. Extract all phrase: English to Arabic word alignment  Arabic to English word alignment  Intersection of both alignments
PBSMT  drawbacks ,[object Object],[object Object],[object Object]
Agenda ,[object Object],[object Object],[object Object],[object Object]
MT Evaluation ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Human Evaluation 5 Adequacy (1-5) 4 Fluency (1-5) مشرف  يتقابل مع  كبار المسؤولين المدنيين والعسكريين  بخصوص  ملف الحد من الانتشار النووي MT Musharraf Meets with Senior Civilian and Military Officials to Examine Nuclear Anti-Proliferation Dossier Source
Automatic Evaluation ,[object Object],[object Object],[object Object],[object Object],Higher BLEU Score مشرف يجتمع بكبار  المسؤولين المدنيين والعسكريين  لبحث  ملف الحد من الانتشار النووي Ref3 مشرف  يجتمع  مع  كبار المسؤولين المدنيين   والعسكريين  لدرس  ملف الحد من الانتشار النووي Ref2 مشرف يجتمع  بكبار  المسؤولين المدنيين والعسكريين  لدرس  ملف الحد من الانتشار النووي Ref1 مشرف   يجتمع   بالقادة   ال مدنيين و   الع سكريين   رفيعي المستوى  ل دراسة   ملف  وقف  الانتشار النووي MT2 مشرف   يتقابل   مع  كبار المسؤولين المدنيين والعسكريين   بخصوص   ملف الحد من الانتشار النووي MT1 Musharraf Meets with Senior Civilian and Military Officials to Examine Nuclear Anti-Proliferation Dossier Source
Agenda ,[object Object],[object Object],[object Object],[object Object]
Online MT Services Review البنتان قالتا أنّنا جيّدين SK 8 قال الاثنان بنات  " نحن جيّد " SY 7 ان فتاتين  " نحن جيدة " MS 6 الفتاتين وقال  " نحن جاهزون " GO 5 قالت الفتاتان  " نحن جيدات " The two girls said "we are good" B خمسة عشر بنتًا Sakhr Trjem (SK) 4 خمسة عشر بنات Systran translator  (SY) 3 خمسة عشر الفتيات MS-Bing Translator (MS) 2 خمسة عشر فتيات Google Translate (GO) 1 خمس عشرة فتاة Fifteen girls A Arabic Translation  Sentence/Translation Service
Thank you  شكراً

More Related Content

What's hot

SMT3
SMT3SMT3
NLP_KASHK:Text Normalization
NLP_KASHK:Text NormalizationNLP_KASHK:Text Normalization
NLP_KASHK:Text Normalization
Hemantha Kulathilake
 
NLP_KASHK:POS Tagging
NLP_KASHK:POS TaggingNLP_KASHK:POS Tagging
NLP_KASHK:POS Tagging
Hemantha Kulathilake
 
Experiments with Different Models of Statistcial Machine Translation
Experiments with Different Models of Statistcial Machine TranslationExperiments with Different Models of Statistcial Machine Translation
Experiments with Different Models of Statistcial Machine Translation
khyati gupta
 
CBAS: CONTEXT BASED ARABIC STEMMER
CBAS: CONTEXT BASED ARABIC STEMMERCBAS: CONTEXT BASED ARABIC STEMMER
CBAS: CONTEXT BASED ARABIC STEMMER
ijnlc
 
Ijetcas14 575
Ijetcas14 575Ijetcas14 575
Ijetcas14 575
Iasir Journals
 
Assamese to English Statistical Machine Translation
Assamese to English Statistical Machine TranslationAssamese to English Statistical Machine Translation
Assamese to English Statistical Machine Translation
Kalyanee Baruah
 
Parallel Corpora in (Machine) Translation: goals, issues and methodologies
Parallel Corpora in (Machine) Translation: goals, issues and methodologiesParallel Corpora in (Machine) Translation: goals, issues and methodologies
Parallel Corpora in (Machine) Translation: goals, issues and methodologies
Antonio Toral
 
Moore_slides.ppt
Moore_slides.pptMoore_slides.ppt
Moore_slides.ppt
butest
 
NLP_KASHK:Evaluating Language Model
NLP_KASHK:Evaluating Language ModelNLP_KASHK:Evaluating Language Model
NLP_KASHK:Evaluating Language Model
Hemantha Kulathilake
 
7 probability and statistics an introduction
7 probability and statistics an introduction7 probability and statistics an introduction
7 probability and statistics an introduction
ThennarasuSakkan
 
Natural Language Processing: Parsing
Natural Language Processing: ParsingNatural Language Processing: Parsing
Natural Language Processing: Parsing
Rushdi Shams
 
Intro to NLP. Lecture 2
Intro to NLP.  Lecture 2Intro to NLP.  Lecture 2
Intro to NLP. Lecture 2
Ekaterina Chernyak
 
Evaluation of hindi english mt systems, challenges and solutions
Evaluation of hindi english mt systems, challenges and solutionsEvaluation of hindi english mt systems, challenges and solutions
Evaluation of hindi english mt systems, challenges and solutions
Sajeed Mahaboob
 
Lec 15,16,17 NLP.machine translation
Lec 15,16,17  NLP.machine translationLec 15,16,17  NLP.machine translation
Lec 15,16,17 NLP.machine translation
guest873a50
 
HMM BASED POS TAGGER FOR HINDI
HMM BASED POS TAGGER FOR HINDIHMM BASED POS TAGGER FOR HINDI
HMM BASED POS TAGGER FOR HINDI
cscpconf
 
Barreiro-Batista-LR4NLP@Coling2018-presentation
Barreiro-Batista-LR4NLP@Coling2018-presentationBarreiro-Batista-LR4NLP@Coling2018-presentation
Barreiro-Batista-LR4NLP@Coling2018-presentation
INESC-ID (Spoken Language Systems Laboratory - L2F)
 
5a use of annotated corpus
5a use of annotated corpus5a use of annotated corpus
5a use of annotated corpus
ThennarasuSakkan
 
RULE BASED TRANSLITERATION SCHEME FOR ENGLISH TO PUNJABI
RULE BASED TRANSLITERATION SCHEME FOR ENGLISH TO PUNJABIRULE BASED TRANSLITERATION SCHEME FOR ENGLISH TO PUNJABI
RULE BASED TRANSLITERATION SCHEME FOR ENGLISH TO PUNJABI
ijnlc
 
Arabic MT Project
Arabic MT ProjectArabic MT Project
Arabic MT Project
Hind Abdulkhaleq
 

What's hot (20)

SMT3
SMT3SMT3
SMT3
 
NLP_KASHK:Text Normalization
NLP_KASHK:Text NormalizationNLP_KASHK:Text Normalization
NLP_KASHK:Text Normalization
 
NLP_KASHK:POS Tagging
NLP_KASHK:POS TaggingNLP_KASHK:POS Tagging
NLP_KASHK:POS Tagging
 
Experiments with Different Models of Statistcial Machine Translation
Experiments with Different Models of Statistcial Machine TranslationExperiments with Different Models of Statistcial Machine Translation
Experiments with Different Models of Statistcial Machine Translation
 
CBAS: CONTEXT BASED ARABIC STEMMER
CBAS: CONTEXT BASED ARABIC STEMMERCBAS: CONTEXT BASED ARABIC STEMMER
CBAS: CONTEXT BASED ARABIC STEMMER
 
Ijetcas14 575
Ijetcas14 575Ijetcas14 575
Ijetcas14 575
 
Assamese to English Statistical Machine Translation
Assamese to English Statistical Machine TranslationAssamese to English Statistical Machine Translation
Assamese to English Statistical Machine Translation
 
Parallel Corpora in (Machine) Translation: goals, issues and methodologies
Parallel Corpora in (Machine) Translation: goals, issues and methodologiesParallel Corpora in (Machine) Translation: goals, issues and methodologies
Parallel Corpora in (Machine) Translation: goals, issues and methodologies
 
Moore_slides.ppt
Moore_slides.pptMoore_slides.ppt
Moore_slides.ppt
 
NLP_KASHK:Evaluating Language Model
NLP_KASHK:Evaluating Language ModelNLP_KASHK:Evaluating Language Model
NLP_KASHK:Evaluating Language Model
 
7 probability and statistics an introduction
7 probability and statistics an introduction7 probability and statistics an introduction
7 probability and statistics an introduction
 
Natural Language Processing: Parsing
Natural Language Processing: ParsingNatural Language Processing: Parsing
Natural Language Processing: Parsing
 
Intro to NLP. Lecture 2
Intro to NLP.  Lecture 2Intro to NLP.  Lecture 2
Intro to NLP. Lecture 2
 
Evaluation of hindi english mt systems, challenges and solutions
Evaluation of hindi english mt systems, challenges and solutionsEvaluation of hindi english mt systems, challenges and solutions
Evaluation of hindi english mt systems, challenges and solutions
 
Lec 15,16,17 NLP.machine translation
Lec 15,16,17  NLP.machine translationLec 15,16,17  NLP.machine translation
Lec 15,16,17 NLP.machine translation
 
HMM BASED POS TAGGER FOR HINDI
HMM BASED POS TAGGER FOR HINDIHMM BASED POS TAGGER FOR HINDI
HMM BASED POS TAGGER FOR HINDI
 
Barreiro-Batista-LR4NLP@Coling2018-presentation
Barreiro-Batista-LR4NLP@Coling2018-presentationBarreiro-Batista-LR4NLP@Coling2018-presentation
Barreiro-Batista-LR4NLP@Coling2018-presentation
 
5a use of annotated corpus
5a use of annotated corpus5a use of annotated corpus
5a use of annotated corpus
 
RULE BASED TRANSLITERATION SCHEME FOR ENGLISH TO PUNJABI
RULE BASED TRANSLITERATION SCHEME FOR ENGLISH TO PUNJABIRULE BASED TRANSLITERATION SCHEME FOR ENGLISH TO PUNJABI
RULE BASED TRANSLITERATION SCHEME FOR ENGLISH TO PUNJABI
 
Arabic MT Project
Arabic MT ProjectArabic MT Project
Arabic MT Project
 

Similar to Pbsmt presenation waleed_oransa_29_april2010

Translationusing moses1
Translationusing moses1Translationusing moses1
Translationusing moses1
Kalyanee Baruah
 
Real-time DirectTranslation System for Sinhala and Tamil Languages.
Real-time DirectTranslation System for Sinhala and Tamil Languages.Real-time DirectTranslation System for Sinhala and Tamil Languages.
Real-time DirectTranslation System for Sinhala and Tamil Languages.
Sheeyam Shellvacumar
 
A deep analysis of Multi-word Expression and Machine Translation
A deep analysis of Multi-word Expression and Machine TranslationA deep analysis of Multi-word Expression and Machine Translation
A deep analysis of Multi-word Expression and Machine Translation
Lifeng (Aaron) Han
 
Shallow parser for hindi language with an input from a transliterator
Shallow parser for hindi language with an input from a transliteratorShallow parser for hindi language with an input from a transliterator
Shallow parser for hindi language with an input from a transliterator
Shashank Shisodia
 
Junki Matsuo - 2015 - Source Phrase Segmentation and Translation for Japanese...
Junki Matsuo - 2015 - Source Phrase Segmentation and Translation for Japanese...Junki Matsuo - 2015 - Source Phrase Segmentation and Translation for Japanese...
Junki Matsuo - 2015 - Source Phrase Segmentation and Translation for Japanese...
Association for Computational Linguistics
 
Personalising speech to-speech translation
Personalising speech to-speech translationPersonalising speech to-speech translation
Personalising speech to-speech translation
behzad66
 
Hps a hierarchical persian stemming method
Hps a hierarchical persian stemming methodHps a hierarchical persian stemming method
Hps a hierarchical persian stemming method
ijnlc
 
S URVEY O N M ACHINE T RANSLITERATION A ND M ACHINE L EARNING M ODELS
S URVEY  O N M ACHINE  T RANSLITERATION A ND  M ACHINE L EARNING M ODELSS URVEY  O N M ACHINE  T RANSLITERATION A ND  M ACHINE L EARNING M ODELS
S URVEY O N M ACHINE T RANSLITERATION A ND M ACHINE L EARNING M ODELS
ijnlc
 
"Machine Translation 101" and the Challenge of Patents
"Machine Translation 101" and the Challenge of Patents"Machine Translation 101" and the Challenge of Patents
"Machine Translation 101" and the Challenge of Patents
Iconic Translation Machines
 
Machine Translation Approaches and Design Aspects
Machine Translation Approaches and Design AspectsMachine Translation Approaches and Design Aspects
Machine Translation Approaches and Design Aspects
IOSR Journals
 
NLP
NLPNLP
NLP
NLPNLP
D2 anandkumar
D2 anandkumarD2 anandkumar
D2 anandkumar
Jasline Presilda
 
**JUNK** (no subject)
**JUNK** (no subject)**JUNK** (no subject)
**JUNK** (no subject)
muthukumaran.tdr95
 
Machine translation
Machine translationMachine translation
Machine translation
mohamed hassan
 
Effectof morphologicalsegmentation&de segmentationonmachinetranslation
Effectof morphologicalsegmentation&de segmentationonmachinetranslationEffectof morphologicalsegmentation&de segmentationonmachinetranslation
Effectof morphologicalsegmentation&de segmentationonmachinetranslation
Sunayana Gawde
 
Svetlin Nakov - Improved Word Alignments Using the Web as a Corpus
Svetlin Nakov - Improved Word Alignments Using the Web as a CorpusSvetlin Nakov - Improved Word Alignments Using the Web as a Corpus
Svetlin Nakov - Improved Word Alignments Using the Web as a Corpus
Svetlin Nakov
 
English to Bangla Translation
English to Bangla TranslationEnglish to Bangla Translation
English to Bangla Translation
Saugata Bose
 
Machine translation ppt by shantanu arora
Machine translation ppt by shantanu aroraMachine translation ppt by shantanu arora
Machine translation ppt by shantanu arora
VaishnaviKhandelwal6
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
GeekNightHyderabad
 

Similar to Pbsmt presenation waleed_oransa_29_april2010 (20)

Translationusing moses1
Translationusing moses1Translationusing moses1
Translationusing moses1
 
Real-time DirectTranslation System for Sinhala and Tamil Languages.
Real-time DirectTranslation System for Sinhala and Tamil Languages.Real-time DirectTranslation System for Sinhala and Tamil Languages.
Real-time DirectTranslation System for Sinhala and Tamil Languages.
 
A deep analysis of Multi-word Expression and Machine Translation
A deep analysis of Multi-word Expression and Machine TranslationA deep analysis of Multi-word Expression and Machine Translation
A deep analysis of Multi-word Expression and Machine Translation
 
Shallow parser for hindi language with an input from a transliterator
Shallow parser for hindi language with an input from a transliteratorShallow parser for hindi language with an input from a transliterator
Shallow parser for hindi language with an input from a transliterator
 
Junki Matsuo - 2015 - Source Phrase Segmentation and Translation for Japanese...
Junki Matsuo - 2015 - Source Phrase Segmentation and Translation for Japanese...Junki Matsuo - 2015 - Source Phrase Segmentation and Translation for Japanese...
Junki Matsuo - 2015 - Source Phrase Segmentation and Translation for Japanese...
 
Personalising speech to-speech translation
Personalising speech to-speech translationPersonalising speech to-speech translation
Personalising speech to-speech translation
 
Hps a hierarchical persian stemming method
Hps a hierarchical persian stemming methodHps a hierarchical persian stemming method
Hps a hierarchical persian stemming method
 
S URVEY O N M ACHINE T RANSLITERATION A ND M ACHINE L EARNING M ODELS
S URVEY  O N M ACHINE  T RANSLITERATION A ND  M ACHINE L EARNING M ODELSS URVEY  O N M ACHINE  T RANSLITERATION A ND  M ACHINE L EARNING M ODELS
S URVEY O N M ACHINE T RANSLITERATION A ND M ACHINE L EARNING M ODELS
 
"Machine Translation 101" and the Challenge of Patents
"Machine Translation 101" and the Challenge of Patents"Machine Translation 101" and the Challenge of Patents
"Machine Translation 101" and the Challenge of Patents
 
Machine Translation Approaches and Design Aspects
Machine Translation Approaches and Design AspectsMachine Translation Approaches and Design Aspects
Machine Translation Approaches and Design Aspects
 
NLP
NLPNLP
NLP
 
NLP
NLPNLP
NLP
 
D2 anandkumar
D2 anandkumarD2 anandkumar
D2 anandkumar
 
**JUNK** (no subject)
**JUNK** (no subject)**JUNK** (no subject)
**JUNK** (no subject)
 
Machine translation
Machine translationMachine translation
Machine translation
 
Effectof morphologicalsegmentation&de segmentationonmachinetranslation
Effectof morphologicalsegmentation&de segmentationonmachinetranslationEffectof morphologicalsegmentation&de segmentationonmachinetranslation
Effectof morphologicalsegmentation&de segmentationonmachinetranslation
 
Svetlin Nakov - Improved Word Alignments Using the Web as a Corpus
Svetlin Nakov - Improved Word Alignments Using the Web as a CorpusSvetlin Nakov - Improved Word Alignments Using the Web as a Corpus
Svetlin Nakov - Improved Word Alignments Using the Web as a Corpus
 
English to Bangla Translation
English to Bangla TranslationEnglish to Bangla Translation
English to Bangla Translation
 
Machine translation ppt by shantanu arora
Machine translation ppt by shantanu aroraMachine translation ppt by shantanu arora
Machine translation ppt by shantanu arora
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 

Recently uploaded

PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.
Dr. Shivangi Singh Parihar
 
Liberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdfLiberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdf
WaniBasim
 
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Dr. Vinod Kumar Kanvaria
 
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective UpskillingYour Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Excellence Foundation for South Sudan
 
MARY JANE WILSON, A “BOA MÃE” .
MARY JANE WILSON, A “BOA MÃE”           .MARY JANE WILSON, A “BOA MÃE”           .
MARY JANE WILSON, A “BOA MÃE” .
Colégio Santa Teresinha
 
The basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptxThe basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptx
heathfieldcps1
 
Walmart Business+ and Spark Good for Nonprofits.pdf
Walmart Business+ and Spark Good for Nonprofits.pdfWalmart Business+ and Spark Good for Nonprofits.pdf
Walmart Business+ and Spark Good for Nonprofits.pdf
TechSoup
 
LAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UP
LAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UPLAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UP
LAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UP
RAHUL
 
Community pharmacy- Social and preventive pharmacy UNIT 5
Community pharmacy- Social and preventive pharmacy UNIT 5Community pharmacy- Social and preventive pharmacy UNIT 5
Community pharmacy- Social and preventive pharmacy UNIT 5
sayalidalavi006
 
Film vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movieFilm vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movie
Nicholas Montgomery
 
The simplified electron and muon model, Oscillating Spacetime: The Foundation...
The simplified electron and muon model, Oscillating Spacetime: The Foundation...The simplified electron and muon model, Oscillating Spacetime: The Foundation...
The simplified electron and muon model, Oscillating Spacetime: The Foundation...
RitikBhardwaj56
 
The History of Stoke Newington Street Names
The History of Stoke Newington Street NamesThe History of Stoke Newington Street Names
The History of Stoke Newington Street Names
History of Stoke Newington
 
writing about opinions about Australia the movie
writing about opinions about Australia the moviewriting about opinions about Australia the movie
writing about opinions about Australia the movie
Nicholas Montgomery
 
Digital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental DesignDigital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental Design
amberjdewit93
 
How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17
Celine George
 
PIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf IslamabadPIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf Islamabad
AyyanKhan40
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
Scholarhat
 
How to Setup Warehouse & Location in Odoo 17 Inventory
How to Setup Warehouse & Location in Odoo 17 InventoryHow to Setup Warehouse & Location in Odoo 17 Inventory
How to Setup Warehouse & Location in Odoo 17 Inventory
Celine George
 
Life upper-Intermediate B2 Workbook for student
Life upper-Intermediate B2 Workbook for studentLife upper-Intermediate B2 Workbook for student
Life upper-Intermediate B2 Workbook for student
NgcHiNguyn25
 
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat  Leveraging AI for Diversity, Equity, and InclusionExecutive Directors Chat  Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
TechSoup
 

Recently uploaded (20)

PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.
 
Liberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdfLiberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdf
 
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
 
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective UpskillingYour Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective Upskilling
 
MARY JANE WILSON, A “BOA MÃE” .
MARY JANE WILSON, A “BOA MÃE”           .MARY JANE WILSON, A “BOA MÃE”           .
MARY JANE WILSON, A “BOA MÃE” .
 
The basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptxThe basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptx
 
Walmart Business+ and Spark Good for Nonprofits.pdf
Walmart Business+ and Spark Good for Nonprofits.pdfWalmart Business+ and Spark Good for Nonprofits.pdf
Walmart Business+ and Spark Good for Nonprofits.pdf
 
LAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UP
LAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UPLAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UP
LAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UP
 
Community pharmacy- Social and preventive pharmacy UNIT 5
Community pharmacy- Social and preventive pharmacy UNIT 5Community pharmacy- Social and preventive pharmacy UNIT 5
Community pharmacy- Social and preventive pharmacy UNIT 5
 
Film vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movieFilm vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movie
 
The simplified electron and muon model, Oscillating Spacetime: The Foundation...
The simplified electron and muon model, Oscillating Spacetime: The Foundation...The simplified electron and muon model, Oscillating Spacetime: The Foundation...
The simplified electron and muon model, Oscillating Spacetime: The Foundation...
 
The History of Stoke Newington Street Names
The History of Stoke Newington Street NamesThe History of Stoke Newington Street Names
The History of Stoke Newington Street Names
 
writing about opinions about Australia the movie
writing about opinions about Australia the moviewriting about opinions about Australia the movie
writing about opinions about Australia the movie
 
Digital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental DesignDigital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental Design
 
How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17
 
PIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf IslamabadPIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf Islamabad
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
 
How to Setup Warehouse & Location in Odoo 17 Inventory
How to Setup Warehouse & Location in Odoo 17 InventoryHow to Setup Warehouse & Location in Odoo 17 Inventory
How to Setup Warehouse & Location in Odoo 17 Inventory
 
Life upper-Intermediate B2 Workbook for student
Life upper-Intermediate B2 Workbook for studentLife upper-Intermediate B2 Workbook for student
Life upper-Intermediate B2 Workbook for student
 
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat  Leveraging AI for Diversity, Equity, and InclusionExecutive Directors Chat  Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
 

Pbsmt presenation waleed_oransa_29_april2010

  • 1. Statistical Machine Translation Waleed Oransa, M.Sc. College of Computing and Information Technology Arab Academy for Science and Technology Cairo, Egypt [email_address]
  • 2.
  • 3.
  • 4.
  • 5.
  • 6. Interlingua Semantic Syntactic Structure Word s Semantic Syntactic Structure Word s Direct Syntactic Transfer Semantic Transfer Source Language Text Target Language Text Conceptual Generation Semantic Generation Syntactic Generation Morphological Generation Conceptual Analysis Semantic Analysis Parsing Morphological Analysis Better Quality & More difficulty
  • 7.
  • 8.
  • 9.
  • 10.
  • 11.
  • 12. SMT Training Phase English Sentences PBSMT System Training Language Model (Arabic) Input: Training Corpus Arabic/English Bi-Text Output: Language Model and Translation Model Arabic Sentences Translation Model (English/Arabic) Language Modeling Training (Tool: SRILM toolkit) Translation Modeling Training (Tool: Giza++ & Moses toolkit) What is the Language Model?
  • 13.
  • 14. LM Role in SMT مشرف يجتمعوا مشرف تجتمع مشرف يجتمع مشرف يتقابل مشرف يقابل مشرف يجتمعا مشرف تجتمعا مشرف يجتمعن مشرف يجتمعون مشرف يتقابلان Language Model مشرف xxxx مع كبار المسؤولين المدنيين والعسكريين Language Model يجتمع يجتمعوا تجتمع يجتمع يتقابل يقابل يجتمعا تجتمعا يجتمعن يجتمعون يتقابلان
  • 15.
  • 16. SMT Training Phase English Sentences PBSMT System Training Language Model (Arabic) Input: Training Corpus Arabic/English Bi-Text Output: Language Model and Translation Model Arabic Sentences Translation Model (English/Arabic) Language Modeling Training (Tool: SRILM toolkit) Translation Modeling Training (Tool: Giza++ & Moses toolkit) What is the Translation Model?
  • 17.
  • 18.
  • 19.
  • 20.
  • 21. How TM & LM work together? Musharraf Meets with Senior Civilian and Military Officials مشرف ***** مع كبار المسؤولين المدنيين والعسكريين يجتمع Language Model Translation Model يجتمعوا تجتمع يجتمع يتقابل يقابل يجتمعا تجتمعا يجتمعن يجتمعون يتقابلان
  • 22.
  • 23.
  • 24. PBSMT Training Phase English Sentences PBSMT System Training Language Model (Arabic) Input: Training Corpus Arabic/English Bi-Text Output: Language Model and Translation Model PBSMT Normal Training Phase Arabic Sentences Translation Model Phrase Table (English/Arabic) Language Modeling Training (Tool: SRILM toolkit) Translation Modeling Training (Tool: Giza++ & Moses toolkit)
  • 25. Phrase based alignment (The prophet, الرسول ) (The prophet Mohamed, الرسول محمد ) (great man, رجل عظيم ) (Mohamed is a great man, محمد رجل عظيم ) (The prophet Mohamed is a great man, الرسول محمد رجل عظيم ) etc. Extract all phrase: English to Arabic word alignment Arabic to English word alignment Intersection of both alignments
  • 26.
  • 27.
  • 28.
  • 29. Human Evaluation 5 Adequacy (1-5) 4 Fluency (1-5) مشرف يتقابل مع كبار المسؤولين المدنيين والعسكريين بخصوص ملف الحد من الانتشار النووي MT Musharraf Meets with Senior Civilian and Military Officials to Examine Nuclear Anti-Proliferation Dossier Source
  • 30.
  • 31.
  • 32. Online MT Services Review البنتان قالتا أنّنا جيّدين SK 8 قال الاثنان بنات " نحن جيّد " SY 7 ان فتاتين " نحن جيدة " MS 6 الفتاتين وقال " نحن جاهزون " GO 5 قالت الفتاتان " نحن جيدات " The two girls said "we are good" B خمسة عشر بنتًا Sakhr Trjem (SK) 4 خمسة عشر بنات Systran translator (SY) 3 خمسة عشر الفتيات MS-Bing Translator (MS) 2 خمسة عشر فتيات Google Translate (GO) 1 خمس عشرة فتاة Fifteen girls A Arabic Translation Sentence/Translation Service
  • 33. Thank you شكراً