SlideShare a Scribd company logo
1 of 19
Download to read offline
How to Know Best Machine
Translation System in Advance
before Translating a
Sentence?
Bibekananda Kundu and Sanjay Kumar Choudhury
Centre for Development of Advanced
Computing
December 19, 2014
{ bibekananda.kundu,sanjay.choudhury }@cdac.in
2/19
Contents
* Research Problem
* Methodology and Contributions
* Feature Set for Selecting Best MT System
* Experiments
* Results and Discussion
* Conclusions
3/19
* Research Problem
....................
Was my camera repaired already?
.
MT1
.
MT2
.
MT3
...
আমার ক ােমরা িক ইিতমে মরামত করা হে িছল ?
.
আমার ক ােমরা িক ইিতমে মরামত করা িছেলা ?
.
িক আমার ক ােমরা ইতঃ েব মরামত করা হেয়িছল ?
.
আমার ক ােমরা ইিতমে মরামত করা ?
.
আমার ক ােমরা ইিতমে িছল মরামত ?
How to identify a MT system from a set of multiple MT
systems in advance, capable of producing most appropriate
translation for a source sentence without having any idea
about working principle of these MT systems.
4/19
5/19
6/19
7/19
8/19
* Feature Set for Selecting Best MT System
* Phrase-structure features : represent structural
complexity of a sentence.
* Dependency based features: represent how words in
a sentence depend on each other even for long distances.
* Probabilistic features: represent complexity in
term of out-of-vocabulary (OOV), likelihood of a
sentence, likelihood of a dependency relation, mapping
capability of a source word to multiple target words or
vice versa.
9/19
* Feature Set for Selecting Best MT System
* Phrase-structure features :
.
Number of Unique POS Tags (NUPT)
.
POS Tag Density (PTD)
.
Maximum and Mean Depth
.
Number of Internal nodes
.
Maximum and Mean Number of Child Nodes for
each Node
....S1...
..SQ.....
......
..?
.
....
..VP.....
..ADVP...
..RB...
..already
.
..
..VBN...
..repaired
.
....
..NP.....
..NN...
..camera
.
..
..PRP$...
..my
.
..
..AUX...
..Was
10
/19
* Feature Set for Selecting Best MT System
* Probabilistic features :
.
Joint Probability of Input Sentence (JPIS): We
have approximated JPIS using trigram sequences.
P(S = w1w2w3 · · · wn) = P(w1)
× P(w2|w1)
× P(w3|w1w2)
× · · ·
× P(wn|wn−2wn−1)
11/19
* Feature Set for Selecting Best MT System
* Probabilistic features :
Joint Probability Using N-gram Dependency (JPUND):
Dependency based language model is reported in
(Shen et al. 2008). JPUND for the dependency tree is
calculated as :
JPUND = PT (repaired)
× PL(camera | repairedhead )
× PL(my | camerahead )
× PL(was | my, camerahead )
× PR (already | repairedhead )
× PR (? | already, repairedhead )
..
Was
.
my
.
camera
.
repaired
.
already
.
nsubj
.
pos
.
advmod
.
cop
.
?
.
punct
.
ROOT
Figure : A dependency tree.
12/19
* Feature Set for Selecting Best MT System
* Dependency based features :
.
Number of Dependency Link (NDL)
.
Maximum Dependency Distance (MDD)
.
Maximum amongst the Number of Dependent of a
Word (MNDW)
..
Was
.
my
.
camera
.
repaired
.
already
.
nsubj
.
pos
.
advmod
.
cop
.
?
.
punct
.
ROOT
Figure : A dependency tree.
13/19
* Experiemnts
* Questions to Answer. Can features extracted from source sentences predict
the quality of a MT system?
. Which machine learning algorithm is most
appropriate for this classification task?
. How selection of different types of features influences
the performances of classifiers?
14/19
* Experiemnts
* English-Bangla MT Systems
. AnglaMT: http://tdil-dc.in
. GoogleMT: https://translate.google.co.in
* Data Preparation
. 20K Basic Travel Expression Corpus (BTEC)
. 50K ILCI corpus: http://www.tdil-dc.in/
* Tools used in this experiments
. WEKA: http://www.cs.waikato.ac.nz/ ml/weka
. Charniak parser: http://cs.brown.edu/ ec/
. Malt parser: http://www.maltparser.org/
. Moses toolkit: http://www.statmt.org/moses/
15/19
* Results and Discussion
16/19
* Conclusions
. A machine learning approach for selecting a MT system
producing most appropriate translation before translating the
input sentence.
. Our approach uses phrase-structure, probabilistic and
dependency features.
. Features used in this paper can also be applied on similar
NLP tasks where measuring confidence of the system is
required.
. Experiment shows IB1 classifier provides best performance
when compare to other classifiers.
17/19
18/19
* Courtesy
19/19
Thank you

More Related Content

What's hot

ADAPTIVE BLIND MULTIUSER DETECTION UNDER IMPULSIVE NOISE USING PRINCIPAL COMP...
ADAPTIVE BLIND MULTIUSER DETECTION UNDER IMPULSIVE NOISE USING PRINCIPAL COMP...ADAPTIVE BLIND MULTIUSER DETECTION UNDER IMPULSIVE NOISE USING PRINCIPAL COMP...
ADAPTIVE BLIND MULTIUSER DETECTION UNDER IMPULSIVE NOISE USING PRINCIPAL COMP...
csandit
 
Adaptive blind multiuser detection under impulsive noise using principal comp...
Adaptive blind multiuser detection under impulsive noise using principal comp...Adaptive blind multiuser detection under impulsive noise using principal comp...
Adaptive blind multiuser detection under impulsive noise using principal comp...
csandit
 
Image compression introductory presentation
Image compression introductory presentationImage compression introductory presentation
Image compression introductory presentation
Tariq Abbas
 

What's hot (11)

IRJET - Clustering Algorithm for Brain Image Segmentation
IRJET - Clustering Algorithm for Brain Image SegmentationIRJET - Clustering Algorithm for Brain Image Segmentation
IRJET - Clustering Algorithm for Brain Image Segmentation
 
IMAGE AUTHENTICATION THROUGH ZTRANSFORM WITH LOW ENERGY AND BANDWIDTH (IAZT)
IMAGE AUTHENTICATION THROUGH ZTRANSFORM WITH LOW ENERGY AND BANDWIDTH (IAZT)IMAGE AUTHENTICATION THROUGH ZTRANSFORM WITH LOW ENERGY AND BANDWIDTH (IAZT)
IMAGE AUTHENTICATION THROUGH ZTRANSFORM WITH LOW ENERGY AND BANDWIDTH (IAZT)
 
Automatic Determination Number of Cluster for NMKFC-Means Algorithms on Image...
Automatic Determination Number of Cluster for NMKFC-Means Algorithms on Image...Automatic Determination Number of Cluster for NMKFC-Means Algorithms on Image...
Automatic Determination Number of Cluster for NMKFC-Means Algorithms on Image...
 
Lecture 4 Relationship between pixels
Lecture 4 Relationship between pixelsLecture 4 Relationship between pixels
Lecture 4 Relationship between pixels
 
MULTI-OBJECTIVE ENERGY EFFICIENT OPTIMIZATION ALGORITHM FOR COVERAGE CONTROL ...
MULTI-OBJECTIVE ENERGY EFFICIENT OPTIMIZATION ALGORITHM FOR COVERAGE CONTROL ...MULTI-OBJECTIVE ENERGY EFFICIENT OPTIMIZATION ALGORITHM FOR COVERAGE CONTROL ...
MULTI-OBJECTIVE ENERGY EFFICIENT OPTIMIZATION ALGORITHM FOR COVERAGE CONTROL ...
 
ADAPTIVE BLIND MULTIUSER DETECTION UNDER IMPULSIVE NOISE USING PRINCIPAL COMP...
ADAPTIVE BLIND MULTIUSER DETECTION UNDER IMPULSIVE NOISE USING PRINCIPAL COMP...ADAPTIVE BLIND MULTIUSER DETECTION UNDER IMPULSIVE NOISE USING PRINCIPAL COMP...
ADAPTIVE BLIND MULTIUSER DETECTION UNDER IMPULSIVE NOISE USING PRINCIPAL COMP...
 
Adaptive blind multiuser detection under impulsive noise using principal comp...
Adaptive blind multiuser detection under impulsive noise using principal comp...Adaptive blind multiuser detection under impulsive noise using principal comp...
Adaptive blind multiuser detection under impulsive noise using principal comp...
 
Image compression introductory presentation
Image compression introductory presentationImage compression introductory presentation
Image compression introductory presentation
 
BER Performance of Antenna Array-Based Receiver using Multi-user Detection in...
BER Performance of Antenna Array-Based Receiver using Multi-user Detection in...BER Performance of Antenna Array-Based Receiver using Multi-user Detection in...
BER Performance of Antenna Array-Based Receiver using Multi-user Detection in...
 
Multimedia lossy compression algorithms
Multimedia lossy compression algorithmsMultimedia lossy compression algorithms
Multimedia lossy compression algorithms
 
An Efficient Multiplierless Transform algorithm for Video Coding
An Efficient Multiplierless Transform algorithm for Video CodingAn Efficient Multiplierless Transform algorithm for Video Coding
An Efficient Multiplierless Transform algorithm for Video Coding
 

Similar to How to Know Best Machine Translation System in Advance before Translating a Sentence?

Internship project report,Predictive Modelling
Internship project report,Predictive ModellingInternship project report,Predictive Modelling
Internship project report,Predictive Modelling
Amit Kumar
 
PyCon Balkans 2018 // Recommender systems - collaborative filtering and dimen...
PyCon Balkans 2018 // Recommender systems - collaborative filtering and dimen...PyCon Balkans 2018 // Recommender systems - collaborative filtering and dimen...
PyCon Balkans 2018 // Recommender systems - collaborative filtering and dimen...
Mladen Jovanovic
 

Similar to How to Know Best Machine Translation System in Advance before Translating a Sentence? (20)

Methodological study of opinion mining and sentiment analysis techniques
Methodological study of opinion mining and sentiment analysis techniquesMethodological study of opinion mining and sentiment analysis techniques
Methodological study of opinion mining and sentiment analysis techniques
 
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHES
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHESIMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHES
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHES
 
Internship project report,Predictive Modelling
Internship project report,Predictive ModellingInternship project report,Predictive Modelling
Internship project report,Predictive Modelling
 
Itc542 network design research
Itc542 network design researchItc542 network design research
Itc542 network design research
 
Methodological Study Of Opinion Mining And Sentiment Analysis Techniques
Methodological Study Of Opinion Mining And Sentiment Analysis Techniques  Methodological Study Of Opinion Mining And Sentiment Analysis Techniques
Methodological Study Of Opinion Mining And Sentiment Analysis Techniques
 
Kaggle kenneth
Kaggle kennethKaggle kenneth
Kaggle kenneth
 
PyCon Balkans 2018 // Recommender systems - collaborative filtering and dimen...
PyCon Balkans 2018 // Recommender systems - collaborative filtering and dimen...PyCon Balkans 2018 // Recommender systems - collaborative filtering and dimen...
PyCon Balkans 2018 // Recommender systems - collaborative filtering and dimen...
 
Machine learning_ Replicating Human Brain
Machine learning_ Replicating Human BrainMachine learning_ Replicating Human Brain
Machine learning_ Replicating Human Brain
 
IRJET- Factoid Question and Answering System
IRJET-  	  Factoid Question and Answering SystemIRJET-  	  Factoid Question and Answering System
IRJET- Factoid Question and Answering System
 
Session-Based Recommendations with Recurrent Neural Networks (Balazs Hidasi, ...
Session-Based Recommendations with Recurrent Neural Networks(Balazs Hidasi, ...Session-Based Recommendations with Recurrent Neural Networks(Balazs Hidasi, ...
Session-Based Recommendations with Recurrent Neural Networks (Balazs Hidasi, ...
 
Algoritma Random Forest beserta aplikasi nya
Algoritma Random Forest beserta aplikasi nyaAlgoritma Random Forest beserta aplikasi nya
Algoritma Random Forest beserta aplikasi nya
 
IRJET- Intrusion Detection based on J48 Algorithm
IRJET- Intrusion Detection based on J48 AlgorithmIRJET- Intrusion Detection based on J48 Algorithm
IRJET- Intrusion Detection based on J48 Algorithm
 
CONSTRUCTING A FUZZY NETWORK INTRUSION CLASSIFIER BASED ON DIFFERENTIAL EVOLU...
CONSTRUCTING A FUZZY NETWORK INTRUSION CLASSIFIER BASED ON DIFFERENTIAL EVOLU...CONSTRUCTING A FUZZY NETWORK INTRUSION CLASSIFIER BASED ON DIFFERENTIAL EVOLU...
CONSTRUCTING A FUZZY NETWORK INTRUSION CLASSIFIER BASED ON DIFFERENTIAL EVOLU...
 
Applications of Pattern Recognition Algorithms in Agriculture: A Review
Applications of Pattern Recognition Algorithms in Agriculture: A ReviewApplications of Pattern Recognition Algorithms in Agriculture: A Review
Applications of Pattern Recognition Algorithms in Agriculture: A Review
 
Utilizing Machine Learning, Detect Chronic Kidney Disease and Suggest A Healt...
Utilizing Machine Learning, Detect Chronic Kidney Disease and Suggest A Healt...Utilizing Machine Learning, Detect Chronic Kidney Disease and Suggest A Healt...
Utilizing Machine Learning, Detect Chronic Kidney Disease and Suggest A Healt...
 
Natural Computing for Vehicular Networks
Natural Computing for Vehicular NetworksNatural Computing for Vehicular Networks
Natural Computing for Vehicular Networks
 
Text Summarization of Food Reviews using AbstractiveSummarization and Recurre...
Text Summarization of Food Reviews using AbstractiveSummarization and Recurre...Text Summarization of Food Reviews using AbstractiveSummarization and Recurre...
Text Summarization of Food Reviews using AbstractiveSummarization and Recurre...
 
ON THE PERFORMANCE OF INTRUSION DETECTION SYSTEMS WITH HIDDEN MULTILAYER NEUR...
ON THE PERFORMANCE OF INTRUSION DETECTION SYSTEMS WITH HIDDEN MULTILAYER NEUR...ON THE PERFORMANCE OF INTRUSION DETECTION SYSTEMS WITH HIDDEN MULTILAYER NEUR...
ON THE PERFORMANCE OF INTRUSION DETECTION SYSTEMS WITH HIDDEN MULTILAYER NEUR...
 
On The Performance of Intrusion Detection Systems with Hidden Multilayer Neur...
On The Performance of Intrusion Detection Systems with Hidden Multilayer Neur...On The Performance of Intrusion Detection Systems with Hidden Multilayer Neur...
On The Performance of Intrusion Detection Systems with Hidden Multilayer Neur...
 
Real Estate Investment Advising Using Machine Learning
Real Estate Investment Advising Using Machine LearningReal Estate Investment Advising Using Machine Learning
Real Estate Investment Advising Using Machine Learning
 

Recently uploaded

Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptxHarnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
FIDO Alliance
 
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc
 
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider  Progress from Awareness to Implementation.pptxTales from a Passkey Provider  Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
FIDO Alliance
 

Recently uploaded (20)

Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
 
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
 
Top 10 CodeIgniter Development Companies
Top 10 CodeIgniter Development CompaniesTop 10 CodeIgniter Development Companies
Top 10 CodeIgniter Development Companies
 
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptxHarnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
 
The Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightThe Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and Insight
 
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
 
Working together SRE & Platform Engineering
Working together SRE & Platform EngineeringWorking together SRE & Platform Engineering
Working together SRE & Platform Engineering
 
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
 
How we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdfHow we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdf
 
Cyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptx
Cyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptxCyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptx
Cyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptx
 
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
 
Portal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russePortal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russe
 
ERP Contender Series: Acumatica vs. Sage Intacct
ERP Contender Series: Acumatica vs. Sage IntacctERP Contender Series: Acumatica vs. Sage Intacct
ERP Contender Series: Acumatica vs. Sage Intacct
 
JavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate GuideJavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate Guide
 
State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptx
 
Overview of Hyperledger Foundation
Overview of Hyperledger FoundationOverview of Hyperledger Foundation
Overview of Hyperledger Foundation
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider  Progress from Awareness to Implementation.pptxTales from a Passkey Provider  Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
 
AI mind or machine power point presentation
AI mind or machine power point presentationAI mind or machine power point presentation
AI mind or machine power point presentation
 

How to Know Best Machine Translation System in Advance before Translating a Sentence?

  • 1. How to Know Best Machine Translation System in Advance before Translating a Sentence? Bibekananda Kundu and Sanjay Kumar Choudhury Centre for Development of Advanced Computing December 19, 2014 { bibekananda.kundu,sanjay.choudhury }@cdac.in
  • 2. 2/19 Contents * Research Problem * Methodology and Contributions * Feature Set for Selecting Best MT System * Experiments * Results and Discussion * Conclusions
  • 3. 3/19 * Research Problem .................... Was my camera repaired already? . MT1 . MT2 . MT3 ... আমার ক ােমরা িক ইিতমে মরামত করা হে িছল ? . আমার ক ােমরা িক ইিতমে মরামত করা িছেলা ? . িক আমার ক ােমরা ইতঃ েব মরামত করা হেয়িছল ? . আমার ক ােমরা ইিতমে মরামত করা ? . আমার ক ােমরা ইিতমে িছল মরামত ? How to identify a MT system from a set of multiple MT systems in advance, capable of producing most appropriate translation for a source sentence without having any idea about working principle of these MT systems.
  • 8. 8/19 * Feature Set for Selecting Best MT System * Phrase-structure features : represent structural complexity of a sentence. * Dependency based features: represent how words in a sentence depend on each other even for long distances. * Probabilistic features: represent complexity in term of out-of-vocabulary (OOV), likelihood of a sentence, likelihood of a dependency relation, mapping capability of a source word to multiple target words or vice versa.
  • 9. 9/19 * Feature Set for Selecting Best MT System * Phrase-structure features : . Number of Unique POS Tags (NUPT) . POS Tag Density (PTD) . Maximum and Mean Depth . Number of Internal nodes . Maximum and Mean Number of Child Nodes for each Node ....S1... ..SQ..... ...... ..? . .... ..VP..... ..ADVP... ..RB... ..already . .. ..VBN... ..repaired . .... ..NP..... ..NN... ..camera . .. ..PRP$... ..my . .. ..AUX... ..Was
  • 10. 10 /19 * Feature Set for Selecting Best MT System * Probabilistic features : . Joint Probability of Input Sentence (JPIS): We have approximated JPIS using trigram sequences. P(S = w1w2w3 · · · wn) = P(w1) × P(w2|w1) × P(w3|w1w2) × · · · × P(wn|wn−2wn−1)
  • 11. 11/19 * Feature Set for Selecting Best MT System * Probabilistic features : Joint Probability Using N-gram Dependency (JPUND): Dependency based language model is reported in (Shen et al. 2008). JPUND for the dependency tree is calculated as : JPUND = PT (repaired) × PL(camera | repairedhead ) × PL(my | camerahead ) × PL(was | my, camerahead ) × PR (already | repairedhead ) × PR (? | already, repairedhead ) .. Was . my . camera . repaired . already . nsubj . pos . advmod . cop . ? . punct . ROOT Figure : A dependency tree.
  • 12. 12/19 * Feature Set for Selecting Best MT System * Dependency based features : . Number of Dependency Link (NDL) . Maximum Dependency Distance (MDD) . Maximum amongst the Number of Dependent of a Word (MNDW) .. Was . my . camera . repaired . already . nsubj . pos . advmod . cop . ? . punct . ROOT Figure : A dependency tree.
  • 13. 13/19 * Experiemnts * Questions to Answer. Can features extracted from source sentences predict the quality of a MT system? . Which machine learning algorithm is most appropriate for this classification task? . How selection of different types of features influences the performances of classifiers?
  • 14. 14/19 * Experiemnts * English-Bangla MT Systems . AnglaMT: http://tdil-dc.in . GoogleMT: https://translate.google.co.in * Data Preparation . 20K Basic Travel Expression Corpus (BTEC) . 50K ILCI corpus: http://www.tdil-dc.in/ * Tools used in this experiments . WEKA: http://www.cs.waikato.ac.nz/ ml/weka . Charniak parser: http://cs.brown.edu/ ec/ . Malt parser: http://www.maltparser.org/ . Moses toolkit: http://www.statmt.org/moses/
  • 15. 15/19 * Results and Discussion
  • 16. 16/19 * Conclusions . A machine learning approach for selecting a MT system producing most appropriate translation before translating the input sentence. . Our approach uses phrase-structure, probabilistic and dependency features. . Features used in this paper can also be applied on similar NLP tasks where measuring confidence of the system is required. . Experiment shows IB1 classifier provides best performance when compare to other classifiers.
  • 17. 17/19