SlideShare a Scribd company logo
Answer Selection and Validation for 
Arabic Questions 
Ahmed Magdy Ezzeldin, Yasser El-Sonbaty, Mohamed Hamed Kholief 
College of Computing and Information Technology, AASTMT 
Alexandria, Egypt
Outline 
● Arabic QA and its significance 
● QA pipeline 
● Importance of answer selection to QA 
● QA4MRE (Question Answering for Machine Reading Evaluation) 
● Motivation 
● Objectives 
● Test-set & Evaluation metrics 
● Related Works 
● Tools and resources 
● ALQASIM 1.0 
● ALQASIM 2.0 
● Evaluation and discussion 
● Future work
Arabic QA and its significance 
● Question Answering (QA) : automatically 
providing an answer for a question posed by a 
human in a natural language 
● Significance of Arabic QA 
– Arabic has more than 350 million native speakers 
– A lot of Arabic content on the Internet 
– High need for fast precise information 
– IR based approaches do not satisfy this need
QA pipeline 
● Question Analysis 
● Passage Retrieval 
● Answer Extraction
Importance of Answer Selection to QA 
● Error propagation in QA pipeline makes 
systems hit an upper bound of 60% 
● Most QA systems provide 5 answer choices 
for each question and most of the time the 
answer is not in the first choice 
● Systems should be certain about the questions 
they answer
QA4MRE 
● Question Answering for Machine Reading Evaluation 
● Skips the answer generation tasks of QA (Passage 
Retrieval) 
● Focuses only on the answer selection and 
validation subtasks 
● A typical QA4MRE system chooses 1 answer form 5 
choices supported with 1 document that contains 
the answer 
● QA4MRE is used interchangeably with Answer 
Selection and Validation
Motivation 
● Create an efficient answer selection and 
validation module to: 
– Improve the performance of any Arabic QA 
system 
– Reduce the effect of error that propagates 
through the QA pipeline by selecting the correct 
answers only 
– Enhance the certainty of Arabic QA systems
Objectives 
● Mimic the human behavior by analyzing the 
reading test document instead of just 
analyzing the question 
● Apply syntactic & semantic analysis and 
expansion to the reading test document and 
the question 
● Make use of background knowledge to gain 
more contextual knowledge about the test 
document and question keywords
Test-set 
● The QA4MRE test-set is composed of: 
– 4 topics ("AIDS", "Climate change", "Music and Society", and "Alzheimer") 
– 16 test documents (4 documents per topic) 
– 160 questions (10 questions per document) 
– 800 answer choices/options (5 per question) 
– 4 background collections of documents (1 per topic) 
● Various question types 
– Factoid: (where, when, by-whom) 
– Causal: (what was the cause/result of event X?) 
– Method: (how did X do Y? or in what way did X come about?) 
– Purpose: (why was X brought about? or what was the reason for doing X?) 
– Which is true: (what can a 14 year old girl do?)
Evaluation Metrics 
● Accuracy & C@1 (Correctness at 1) 
● C@1 is introduced in CLEF (Conference and Labs of the 
Evaluation Forum) 2011 by Penas et al. 
● Gives partial credit for systems that leave some 
questions unanswered in cases of uncertainty
Related Works 
● The 2 Arabic QA4MRE systems in CLEF 2012 
- Trigui et al. 2012 
- Abouenour et al. 2012 (IDRAAQ) 
● Best performing QA4MRE system on the same 
test-set uses the English test-set 
- Bhaskar et al. 2012 (English QA4MRE)
Related Works [continued] 
● Abouenour et al. 2012 (IDRAAQ) 
– Accuracy : 0.13 
– C@1 : 0.21 
● Used JIRS (Java Information Retrieval System) for 
passage retrieval (Distance Density N-gram Model) 
● Semantic expansion using Arabic WordNet (AWN) 
● Did not use the CLEF background collections
Related Works [continued] 
● Trigui et al. 2012: 
– Accuracy : 0.19 
– C@1 : 0.19 
● Has not marked any questions as unanswered 
● Retrieves the passages that have the question keywords and 
aligns them with the answer choices 
● Finds the best answer choice in the retrieved passages 
● Expands the answer choices that could not be found in the 
retrieved passages using some inference rules on the 
background collection 
● Depends on the background collection as it offers enough 
redundancy for the passage retrieval module
Related Works [continued] 
● Bhaskar et al. 2012 (on the English test-set) 
– Accuracy : 0.53 
– C@1 : 0.65 
● Combines each answer choice with the question in 
a hypothesis 
● Searches for the hypothesis keywords in the 
document 
● Uses textual entailment to rank the retrieved 
passages
Related Works [continued] 
● The 2 Arabic QA4MRE systems 
– Use the typical QA pipeline that depend mainly on 
passage retrieval 
– Do not analyze the reading test document 
● The English system (Bhaskar et al.) 
– Analyzes the reading test document 
– Uses many English specific NLP tools that are not 
found in Arabic (at least in the public domain) like: 
● semantic parsing 
● semantic role labeling 
● dependency analysis
●Tools and resources 
● MADA+TOKAN (Morphological Analyzer) 
– PoS tags, gender, number & light stem 
● Arabic WordNet (AWN) 
– Semantic expansion by (synonyms) 
● Root Stemmers 
– Khoja 
– ISRI 
– Tashaphyne
ALQASIM 1.0 
● Ezzeldin, A. M., Kholief, M. H., & El-Sonbaty, Y. 
(September 2013). ALQASIM: Arabic language 
question answer selection in machines. In 
Information Access Evaluation. Multilinguality, 
Multimodality, and Visualization (pp. 100-103). 
Springer Berlin Heidelberg.
ALQASIM 1.0
Comparison between ALQASIM 1.0 and 
other Arabic QA4MRE systems
ALQASIM 1.0 Pros & Cons 
● Pros 
- Analyzing the reading test document provides a 
better context for the morphological analyzer 
- Distance between the keywords and between 
Q & A found snippets. 
- Weights according to PoS tags 
● Cons 
- Does not consider the natural boundary of 
sentences 
- Prefer distance over keyword weights 
- Has many manually adjusted weights and 
thresholds that may make the system over-fit for 
the test-set.
ALQASIM 1.0 Pros & Cons [continued] 
● ALQASIM 1.0 could not identify the question snippet 
because its keywords are a bit far from each other.
ALQASIM 2.0 
● Ezzeldin, A. M., El-Sonbaty, Y., & Kholief, M. H. 
(October 2014). Exploring the Effects of Root 
Expansion, Sentence Splitting and Ontology 
on Arabic Answer Selection. In Proceedings of 
the 11th International Workshop on Natural 
Language Processing and Cognitive Science 
(NLPCS 2014), Venice, Italy.
High Level Architecture
Document Analysis Module
Question Analysis Module
Answer Selection Module
ALQASIM 2.0 [continued] 
A basic ontology of hypernyms and their hyponyms 
generated automatically from the background 
collection
Example 1: Sentence Splitting
Example 2: Ontology Expansion
Example 3: Root Expansion
Evaluation 
Description Correct Unanswered Wrong Accuracy C@1 
ALQASIM 1.0 49 30 81 0.31 0.36 
Run (1) Baseline 46 51 63 0.29 0.38 
Run (2) Baseline + 
45 46 69 0.28 0.36 
AWN Semantic 
Expansion 
Run (3) Baseline + Bg 
Ontology Expansion 
51 41 68 0.32 0.40 
Run (4) Baseline + Bg 
Ontology Expansion + 
Root Expansion 
57 29 74 0.36 0.42
Evaluation [continued]
Evaluation [continued] 
Root Stemmer Accuracy C@1 
ALQASIM 2.0 without root expansion 0.32 0.4 
ISRI 0.36 0.42 
Khoja 0.32 0.4 
Tashaphyne 0.31 0.36
Discussion 
● Sentence splitting helps identify the correct 
and the incorrect answers 
● AWN expansion degrades performance by 1 
to 2% due to its generic nature 
● Root expansion helps answer selection 
especially with highly derivational texts 
● An automatically generated ontology 
specifically created for the test topic improves 
performance and can be used to boost some 
question patterns
Root Stemmers 
● Khoja vs ISRI on 15163 words from the test-set. 
Khoja ISRI 
Possible root (a token that is not 
equal to the light stem or the original 
word) 
36 0.23% 11324 74.68 
% 
Light stem (already generated by 
MADA) 
3647 24.05% 3204 21.13 
% 
Original Word 11480 75.71% 635 4.18% 
Tashaphyne 
Root 
Original Words English Translation 
ماء" ، "سوءا" ، "سواء" ، "وباء" ء " “water”, “worse”, “alike”, “epidemic” 
آلت" ، "آفات" آا " “machinery”, “pests” 
أي" ، "أتى" أ " “any”, “came”
Future Work 
● Automatically building better ontologies 
● Anaphora resolution 
● Semantic parsing and semantic role labeling 
● Applying Arabic specific rule-based 
techniques
Published Papers 
● Ezzeldin, A. M., Kholief, M. H., & El-Sonbaty, Y. (September 2013). 
ALQASIM: Arabic language question answer selection in machines. In 
Information Access Evaluation. Multilinguality, Multimodality, and 
Visualization (pp. 100-103). Springer Berlin Heidelberg. 
● Ezzeldin, A. M., El-Sonbaty, Y., & Kholief, M. H. (October 2014). 
Exploring the Effects of Root Expansion, Sentence Splitting and 
Ontology on Arabic Answer Selection. In Proceedings of the 11th 
International Workshop on Natural Language Processing and Cognitive 
Science (NLPCS 2014), Venice, Italy.

More Related Content

What's hot

Practical machine learning - Part 1
Practical machine learning - Part 1Practical machine learning - Part 1
Practical machine learning - Part 1
Traian Rebedea
 
Question Answering System using machine learning approach
Question Answering System using machine learning approachQuestion Answering System using machine learning approach
Question Answering System using machine learning approach
Garima Nanda
 
Chinese Character Decomposition for Neural MT with Multi-Word Expressions
Chinese Character Decomposition for  Neural MT with Multi-Word ExpressionsChinese Character Decomposition for  Neural MT with Multi-Word Expressions
Chinese Character Decomposition for Neural MT with Multi-Word Expressions
Lifeng (Aaron) Han
 
Natural language processing for requirements engineering: ICSE 2021 Technical...
Natural language processing for requirements engineering: ICSE 2021 Technical...Natural language processing for requirements engineering: ICSE 2021 Technical...
Natural language processing for requirements engineering: ICSE 2021 Technical...
alessio_ferrari
 
Searching with vectors
Searching with vectorsSearching with vectors
Searching with vectors
Simon Hughes
 
Question Answering - Application and Challenges
Question Answering - Application and ChallengesQuestion Answering - Application and Challenges
Question Answering - Application and Challenges
Jens Lehmann
 
Natural Language Processing (NLP) for Requirements Engineering (RE): an Overview
Natural Language Processing (NLP) for Requirements Engineering (RE): an OverviewNatural Language Processing (NLP) for Requirements Engineering (RE): an Overview
Natural Language Processing (NLP) for Requirements Engineering (RE): an Overview
alessio_ferrari
 
Scaling Quality on Quora Using Machine Learning
Scaling Quality on Quora Using Machine LearningScaling Quality on Quora Using Machine Learning
Scaling Quality on Quora Using Machine Learning
Vo Viet Anh
 
CUHK intern PPT. Machine Translation Evaluation: Methods and Tools
CUHK intern PPT. Machine Translation Evaluation: Methods and Tools CUHK intern PPT. Machine Translation Evaluation: Methods and Tools
CUHK intern PPT. Machine Translation Evaluation: Methods and Tools
Lifeng (Aaron) Han
 
Vectors in Search - Towards More Semantic Matching
Vectors in Search - Towards More Semantic MatchingVectors in Search - Towards More Semantic Matching
Vectors in Search - Towards More Semantic Matching
Simon Hughes
 
Sentence representations and question answering (YerevaNN)
Sentence representations and question answering (YerevaNN)Sentence representations and question answering (YerevaNN)
Sentence representations and question answering (YerevaNN)
YerevaNN research lab
 
Meta-evaluation of machine translation evaluation methods
Meta-evaluation of machine translation evaluation methodsMeta-evaluation of machine translation evaluation methods
Meta-evaluation of machine translation evaluation methods
Lifeng (Aaron) Han
 
Incorporating Chinese Radicals Into Neural Machine Translation: Deeper Than C...
Incorporating Chinese Radicals Into Neural Machine Translation: Deeper Than C...Incorporating Chinese Radicals Into Neural Machine Translation: Deeper Than C...
Incorporating Chinese Radicals Into Neural Machine Translation: Deeper Than C...
Lifeng (Aaron) Han
 
Question answering
Question answeringQuestion answering
Question answering
Nafiseh Navabpour
 
Open domain Question Answering System - Research project in NLP
Open domain  Question Answering System - Research project in NLPOpen domain  Question Answering System - Research project in NLP
Open domain Question Answering System - Research project in NLP
GVS Chaitanya
 
Offline evaluation of recommender systems: all pain and no gain?
Offline evaluation of recommender systems: all pain and no gain?Offline evaluation of recommender systems: all pain and no gain?
Offline evaluation of recommender systems: all pain and no gain?
Mark Levy
 
VOC real world enterprise needs
VOC real world enterprise needsVOC real world enterprise needs
VOC real world enterprise needs
Ivan Berlocher
 
How useful are semantic links for the detection of implicit references in csc...
How useful are semantic links for the detection of implicit references in csc...How useful are semantic links for the detection of implicit references in csc...
How useful are semantic links for the detection of implicit references in csc...
Traian Rebedea
 
MultiMWE: Building a Multi-lingual Multi-Word Expression (MWE) Parallel Corpora
MultiMWE: Building a Multi-lingual Multi-Word Expression (MWE) Parallel CorporaMultiMWE: Building a Multi-lingual Multi-Word Expression (MWE) Parallel Corpora
MultiMWE: Building a Multi-lingual Multi-Word Expression (MWE) Parallel Corpora
Lifeng (Aaron) Han
 
Ajeb First Seminar
Ajeb First SeminarAjeb First Seminar
Ajeb First SeminarKhaled Sayed
 

What's hot (20)

Practical machine learning - Part 1
Practical machine learning - Part 1Practical machine learning - Part 1
Practical machine learning - Part 1
 
Question Answering System using machine learning approach
Question Answering System using machine learning approachQuestion Answering System using machine learning approach
Question Answering System using machine learning approach
 
Chinese Character Decomposition for Neural MT with Multi-Word Expressions
Chinese Character Decomposition for  Neural MT with Multi-Word ExpressionsChinese Character Decomposition for  Neural MT with Multi-Word Expressions
Chinese Character Decomposition for Neural MT with Multi-Word Expressions
 
Natural language processing for requirements engineering: ICSE 2021 Technical...
Natural language processing for requirements engineering: ICSE 2021 Technical...Natural language processing for requirements engineering: ICSE 2021 Technical...
Natural language processing for requirements engineering: ICSE 2021 Technical...
 
Searching with vectors
Searching with vectorsSearching with vectors
Searching with vectors
 
Question Answering - Application and Challenges
Question Answering - Application and ChallengesQuestion Answering - Application and Challenges
Question Answering - Application and Challenges
 
Natural Language Processing (NLP) for Requirements Engineering (RE): an Overview
Natural Language Processing (NLP) for Requirements Engineering (RE): an OverviewNatural Language Processing (NLP) for Requirements Engineering (RE): an Overview
Natural Language Processing (NLP) for Requirements Engineering (RE): an Overview
 
Scaling Quality on Quora Using Machine Learning
Scaling Quality on Quora Using Machine LearningScaling Quality on Quora Using Machine Learning
Scaling Quality on Quora Using Machine Learning
 
CUHK intern PPT. Machine Translation Evaluation: Methods and Tools
CUHK intern PPT. Machine Translation Evaluation: Methods and Tools CUHK intern PPT. Machine Translation Evaluation: Methods and Tools
CUHK intern PPT. Machine Translation Evaluation: Methods and Tools
 
Vectors in Search - Towards More Semantic Matching
Vectors in Search - Towards More Semantic MatchingVectors in Search - Towards More Semantic Matching
Vectors in Search - Towards More Semantic Matching
 
Sentence representations and question answering (YerevaNN)
Sentence representations and question answering (YerevaNN)Sentence representations and question answering (YerevaNN)
Sentence representations and question answering (YerevaNN)
 
Meta-evaluation of machine translation evaluation methods
Meta-evaluation of machine translation evaluation methodsMeta-evaluation of machine translation evaluation methods
Meta-evaluation of machine translation evaluation methods
 
Incorporating Chinese Radicals Into Neural Machine Translation: Deeper Than C...
Incorporating Chinese Radicals Into Neural Machine Translation: Deeper Than C...Incorporating Chinese Radicals Into Neural Machine Translation: Deeper Than C...
Incorporating Chinese Radicals Into Neural Machine Translation: Deeper Than C...
 
Question answering
Question answeringQuestion answering
Question answering
 
Open domain Question Answering System - Research project in NLP
Open domain  Question Answering System - Research project in NLPOpen domain  Question Answering System - Research project in NLP
Open domain Question Answering System - Research project in NLP
 
Offline evaluation of recommender systems: all pain and no gain?
Offline evaluation of recommender systems: all pain and no gain?Offline evaluation of recommender systems: all pain and no gain?
Offline evaluation of recommender systems: all pain and no gain?
 
VOC real world enterprise needs
VOC real world enterprise needsVOC real world enterprise needs
VOC real world enterprise needs
 
How useful are semantic links for the detection of implicit references in csc...
How useful are semantic links for the detection of implicit references in csc...How useful are semantic links for the detection of implicit references in csc...
How useful are semantic links for the detection of implicit references in csc...
 
MultiMWE: Building a Multi-lingual Multi-Word Expression (MWE) Parallel Corpora
MultiMWE: Building a Multi-lingual Multi-Word Expression (MWE) Parallel CorporaMultiMWE: Building a Multi-lingual Multi-Word Expression (MWE) Parallel Corpora
MultiMWE: Building a Multi-lingual Multi-Word Expression (MWE) Parallel Corpora
 
Ajeb First Seminar
Ajeb First SeminarAjeb First Seminar
Ajeb First Seminar
 

Viewers also liked

Style Me Confident Red Nose Day
Style Me Confident Red Nose DayStyle Me Confident Red Nose Day
Style Me Confident Red Nose Daystylemeconfident
 
Make Earth Day Every Day
Make Earth Day Every Day Make Earth Day Every Day
Make Earth Day Every Day
Julie-Ann
 
Ible or able
Ible or ableIble or able
Ible or able
lgio64
 
Comic Relief and Red Nose Day
Comic Relief and Red Nose DayComic Relief and Red Nose Day
Comic Relief and Red Nose Day
Sandy Millin
 
Human digestive system
Human digestive systemHuman digestive system
Human digestive systemSimren Cena
 
The Digestive System Powerpoint
The Digestive System   PowerpointThe Digestive System   Powerpoint
The Digestive System Powerpointangellacx
 
The Digestive System
The Digestive SystemThe Digestive System
The Digestive System
science_lablinks
 

Viewers also liked (9)

Style Me Confident Red Nose Day
Style Me Confident Red Nose DayStyle Me Confident Red Nose Day
Style Me Confident Red Nose Day
 
Make Earth Day Every Day
Make Earth Day Every Day Make Earth Day Every Day
Make Earth Day Every Day
 
Ible or able
Ible or ableIble or able
Ible or able
 
Bullying powerpoint
Bullying powerpointBullying powerpoint
Bullying powerpoint
 
Comic relief lesson
Comic relief lessonComic relief lesson
Comic relief lesson
 
Comic Relief and Red Nose Day
Comic Relief and Red Nose DayComic Relief and Red Nose Day
Comic Relief and Red Nose Day
 
Human digestive system
Human digestive systemHuman digestive system
Human digestive system
 
The Digestive System Powerpoint
The Digestive System   PowerpointThe Digestive System   Powerpoint
The Digestive System Powerpoint
 
The Digestive System
The Digestive SystemThe Digestive System
The Digestive System
 

Similar to Answer Selection and Validation for Arabic Questions

Cser13.ppt
Cser13.pptCser13.ppt
Search Solutions 2015: Towards a new model of search relevance testing
Search Solutions 2015:  Towards a new model of search relevance testingSearch Solutions 2015:  Towards a new model of search relevance testing
Search Solutions 2015: Towards a new model of search relevance testing
Charlie Hull
 
Detection of SOA Antipatterns
Detection of SOA AntipatternsDetection of SOA Antipatterns
Detection of SOA Antipatterns
Francis Palma
 
QUrdPro: Query processing system for Urdu Language
QUrdPro: Query processing system for Urdu LanguageQUrdPro: Query processing system for Urdu Language
QUrdPro: Query processing system for Urdu Language
IJERA Editor
 
Question Retrieval in Community Question Answering via NON-Negative Matrix Fa...
Question Retrieval in Community Question Answering via NON-Negative Matrix Fa...Question Retrieval in Community Question Answering via NON-Negative Matrix Fa...
Question Retrieval in Community Question Answering via NON-Negative Matrix Fa...
IRJET Journal
 
Lepor: augmented automatic MT evaluation metric
Lepor: augmented automatic MT evaluation metricLepor: augmented automatic MT evaluation metric
Lepor: augmented automatic MT evaluation metricLifeng (Aaron) Han
 
LEPOR: an augmented machine translation evaluation metric - Thesis PPT
LEPOR: an augmented machine translation evaluation metric - Thesis PPT LEPOR: an augmented machine translation evaluation metric - Thesis PPT
LEPOR: an augmented machine translation evaluation metric - Thesis PPT
Lifeng (Aaron) Han
 
Development and evaluation of a web based question answering system for arabi...
Development and evaluation of a web based question answering system for arabi...Development and evaluation of a web based question answering system for arabi...
Development and evaluation of a web based question answering system for arabi...
csandit
 
Triantafyllia Voulibasi
Triantafyllia VoulibasiTriantafyllia Voulibasi
Triantafyllia Voulibasi
ISSEL
 
Cser13.ppt
Cser13.pptCser13.ppt
Cser13.ppt
Ptidej Team
 
Artificial Intelligence for Automated Software Testing
Artificial Intelligence for Automated Software TestingArtificial Intelligence for Automated Software Testing
Artificial Intelligence for Automated Software Testing
Lionel Briand
 
How to Actually DO High-volume Automated Testing
How to Actually DO High-volume Automated TestingHow to Actually DO High-volume Automated Testing
How to Actually DO High-volume Automated Testing
TechWell
 
Orthogonal array testing course
Orthogonal array testing courseOrthogonal array testing course
Orthogonal array testing course
Narayanan Palani
 
Natural Language Processing using Java
Natural Language Processing using JavaNatural Language Processing using Java
Natural Language Processing using Java
Sangameswar Venkatraman
 
Asking Clarifying Questions in Open-Domain Information-Seeking Conversations
Asking Clarifying Questions in Open-Domain Information-Seeking ConversationsAsking Clarifying Questions in Open-Domain Information-Seeking Conversations
Asking Clarifying Questions in Open-Domain Information-Seeking Conversations
Mohammad Aliannejadi
 
Final presentation
Final presentationFinal presentation
Final presentation
Nitish Upreti
 
Sound Empirical Evidence in Software Testing
Sound Empirical Evidence in Software TestingSound Empirical Evidence in Software Testing
Sound Empirical Evidence in Software Testing
Jaguaraci Silva
 
Application of hidden markov model in question answering systems
Application of hidden markov model in question answering systemsApplication of hidden markov model in question answering systems
Application of hidden markov model in question answering systems
ijcsa
 
A_Review_of_Question_Answering_Systems.pdf
A_Review_of_Question_Answering_Systems.pdfA_Review_of_Question_Answering_Systems.pdf
A_Review_of_Question_Answering_Systems.pdf
ssuser98a1af
 
Answer extraction and passage retrieval for
Answer extraction and passage retrieval forAnswer extraction and passage retrieval for
Answer extraction and passage retrieval for
Waheeb Ahmed
 

Similar to Answer Selection and Validation for Arabic Questions (20)

Cser13.ppt
Cser13.pptCser13.ppt
Cser13.ppt
 
Search Solutions 2015: Towards a new model of search relevance testing
Search Solutions 2015:  Towards a new model of search relevance testingSearch Solutions 2015:  Towards a new model of search relevance testing
Search Solutions 2015: Towards a new model of search relevance testing
 
Detection of SOA Antipatterns
Detection of SOA AntipatternsDetection of SOA Antipatterns
Detection of SOA Antipatterns
 
QUrdPro: Query processing system for Urdu Language
QUrdPro: Query processing system for Urdu LanguageQUrdPro: Query processing system for Urdu Language
QUrdPro: Query processing system for Urdu Language
 
Question Retrieval in Community Question Answering via NON-Negative Matrix Fa...
Question Retrieval in Community Question Answering via NON-Negative Matrix Fa...Question Retrieval in Community Question Answering via NON-Negative Matrix Fa...
Question Retrieval in Community Question Answering via NON-Negative Matrix Fa...
 
Lepor: augmented automatic MT evaluation metric
Lepor: augmented automatic MT evaluation metricLepor: augmented automatic MT evaluation metric
Lepor: augmented automatic MT evaluation metric
 
LEPOR: an augmented machine translation evaluation metric - Thesis PPT
LEPOR: an augmented machine translation evaluation metric - Thesis PPT LEPOR: an augmented machine translation evaluation metric - Thesis PPT
LEPOR: an augmented machine translation evaluation metric - Thesis PPT
 
Development and evaluation of a web based question answering system for arabi...
Development and evaluation of a web based question answering system for arabi...Development and evaluation of a web based question answering system for arabi...
Development and evaluation of a web based question answering system for arabi...
 
Triantafyllia Voulibasi
Triantafyllia VoulibasiTriantafyllia Voulibasi
Triantafyllia Voulibasi
 
Cser13.ppt
Cser13.pptCser13.ppt
Cser13.ppt
 
Artificial Intelligence for Automated Software Testing
Artificial Intelligence for Automated Software TestingArtificial Intelligence for Automated Software Testing
Artificial Intelligence for Automated Software Testing
 
How to Actually DO High-volume Automated Testing
How to Actually DO High-volume Automated TestingHow to Actually DO High-volume Automated Testing
How to Actually DO High-volume Automated Testing
 
Orthogonal array testing course
Orthogonal array testing courseOrthogonal array testing course
Orthogonal array testing course
 
Natural Language Processing using Java
Natural Language Processing using JavaNatural Language Processing using Java
Natural Language Processing using Java
 
Asking Clarifying Questions in Open-Domain Information-Seeking Conversations
Asking Clarifying Questions in Open-Domain Information-Seeking ConversationsAsking Clarifying Questions in Open-Domain Information-Seeking Conversations
Asking Clarifying Questions in Open-Domain Information-Seeking Conversations
 
Final presentation
Final presentationFinal presentation
Final presentation
 
Sound Empirical Evidence in Software Testing
Sound Empirical Evidence in Software TestingSound Empirical Evidence in Software Testing
Sound Empirical Evidence in Software Testing
 
Application of hidden markov model in question answering systems
Application of hidden markov model in question answering systemsApplication of hidden markov model in question answering systems
Application of hidden markov model in question answering systems
 
A_Review_of_Question_Answering_Systems.pdf
A_Review_of_Question_Answering_Systems.pdfA_Review_of_Question_Answering_Systems.pdf
A_Review_of_Question_Answering_Systems.pdf
 
Answer extraction and passage retrieval for
Answer extraction and passage retrieval forAnswer extraction and passage retrieval for
Answer extraction and passage retrieval for
 

More from Ahmed Magdy Ezzeldin, MSc.

Distributed RDBMS: Challenges, Solutions & Trade-offs
Distributed RDBMS: Challenges, Solutions & Trade-offsDistributed RDBMS: Challenges, Solutions & Trade-offs
Distributed RDBMS: Challenges, Solutions & Trade-offs
Ahmed Magdy Ezzeldin, MSc.
 
Win any Interview like a Boss
Win any Interview like a BossWin any Interview like a Boss
Win any Interview like a Boss
Ahmed Magdy Ezzeldin, MSc.
 
A survey of fault prediction using machine learning algorithms
A survey of fault prediction using machine learning algorithmsA survey of fault prediction using machine learning algorithms
A survey of fault prediction using machine learning algorithms
Ahmed Magdy Ezzeldin, MSc.
 
GATE : General Architecture for Text Engineering
GATE : General Architecture for Text EngineeringGATE : General Architecture for Text Engineering
GATE : General Architecture for Text Engineering
Ahmed Magdy Ezzeldin, MSc.
 
Networks and Natural Language Processing
Networks and Natural Language ProcessingNetworks and Natural Language Processing
Networks and Natural Language Processing
Ahmed Magdy Ezzeldin, MSc.
 
Objective C Memory Management
Objective C Memory ManagementObjective C Memory Management
Objective C Memory Management
Ahmed Magdy Ezzeldin, MSc.
 
Object Role Modeling
Object Role ModelingObject Role Modeling
Object Role Modeling
Ahmed Magdy Ezzeldin, MSc.
 

More from Ahmed Magdy Ezzeldin, MSc. (11)

Distributed RDBMS: Challenges, Solutions & Trade-offs
Distributed RDBMS: Challenges, Solutions & Trade-offsDistributed RDBMS: Challenges, Solutions & Trade-offs
Distributed RDBMS: Challenges, Solutions & Trade-offs
 
Win any Interview like a Boss
Win any Interview like a BossWin any Interview like a Boss
Win any Interview like a Boss
 
A survey of fault prediction using machine learning algorithms
A survey of fault prediction using machine learning algorithmsA survey of fault prediction using machine learning algorithms
A survey of fault prediction using machine learning algorithms
 
GATE : General Architecture for Text Engineering
GATE : General Architecture for Text EngineeringGATE : General Architecture for Text Engineering
GATE : General Architecture for Text Engineering
 
Networks and Natural Language Processing
Networks and Natural Language ProcessingNetworks and Natural Language Processing
Networks and Natural Language Processing
 
Distributed Coordination-Based Systems
Distributed Coordination-Based SystemsDistributed Coordination-Based Systems
Distributed Coordination-Based Systems
 
Distributed Systems Naming
Distributed Systems NamingDistributed Systems Naming
Distributed Systems Naming
 
Cyclcone a safe dialect of C
Cyclcone a safe dialect of CCyclcone a safe dialect of C
Cyclcone a safe dialect of C
 
Objective C Memory Management
Objective C Memory ManagementObjective C Memory Management
Objective C Memory Management
 
Bash Scripting Workshop
Bash Scripting WorkshopBash Scripting Workshop
Bash Scripting Workshop
 
Object Role Modeling
Object Role ModelingObject Role Modeling
Object Role Modeling
 

Recently uploaded

UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
DianaGray10
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Product School
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
Product School
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Ramesh Iyer
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
Product School
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
Dorra BARTAGUIZ
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
DianaGray10
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Product School
 
Generating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using SmithyGenerating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using Smithy
g2nightmarescribd
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
91mobiles
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
OnBoard
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
Product School
 

Recently uploaded (20)

UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
 
Generating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using SmithyGenerating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using Smithy
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 

Answer Selection and Validation for Arabic Questions

  • 1. Answer Selection and Validation for Arabic Questions Ahmed Magdy Ezzeldin, Yasser El-Sonbaty, Mohamed Hamed Kholief College of Computing and Information Technology, AASTMT Alexandria, Egypt
  • 2. Outline ● Arabic QA and its significance ● QA pipeline ● Importance of answer selection to QA ● QA4MRE (Question Answering for Machine Reading Evaluation) ● Motivation ● Objectives ● Test-set & Evaluation metrics ● Related Works ● Tools and resources ● ALQASIM 1.0 ● ALQASIM 2.0 ● Evaluation and discussion ● Future work
  • 3. Arabic QA and its significance ● Question Answering (QA) : automatically providing an answer for a question posed by a human in a natural language ● Significance of Arabic QA – Arabic has more than 350 million native speakers – A lot of Arabic content on the Internet – High need for fast precise information – IR based approaches do not satisfy this need
  • 4. QA pipeline ● Question Analysis ● Passage Retrieval ● Answer Extraction
  • 5. Importance of Answer Selection to QA ● Error propagation in QA pipeline makes systems hit an upper bound of 60% ● Most QA systems provide 5 answer choices for each question and most of the time the answer is not in the first choice ● Systems should be certain about the questions they answer
  • 6. QA4MRE ● Question Answering for Machine Reading Evaluation ● Skips the answer generation tasks of QA (Passage Retrieval) ● Focuses only on the answer selection and validation subtasks ● A typical QA4MRE system chooses 1 answer form 5 choices supported with 1 document that contains the answer ● QA4MRE is used interchangeably with Answer Selection and Validation
  • 7. Motivation ● Create an efficient answer selection and validation module to: – Improve the performance of any Arabic QA system – Reduce the effect of error that propagates through the QA pipeline by selecting the correct answers only – Enhance the certainty of Arabic QA systems
  • 8. Objectives ● Mimic the human behavior by analyzing the reading test document instead of just analyzing the question ● Apply syntactic & semantic analysis and expansion to the reading test document and the question ● Make use of background knowledge to gain more contextual knowledge about the test document and question keywords
  • 9. Test-set ● The QA4MRE test-set is composed of: – 4 topics ("AIDS", "Climate change", "Music and Society", and "Alzheimer") – 16 test documents (4 documents per topic) – 160 questions (10 questions per document) – 800 answer choices/options (5 per question) – 4 background collections of documents (1 per topic) ● Various question types – Factoid: (where, when, by-whom) – Causal: (what was the cause/result of event X?) – Method: (how did X do Y? or in what way did X come about?) – Purpose: (why was X brought about? or what was the reason for doing X?) – Which is true: (what can a 14 year old girl do?)
  • 10. Evaluation Metrics ● Accuracy & C@1 (Correctness at 1) ● C@1 is introduced in CLEF (Conference and Labs of the Evaluation Forum) 2011 by Penas et al. ● Gives partial credit for systems that leave some questions unanswered in cases of uncertainty
  • 11. Related Works ● The 2 Arabic QA4MRE systems in CLEF 2012 - Trigui et al. 2012 - Abouenour et al. 2012 (IDRAAQ) ● Best performing QA4MRE system on the same test-set uses the English test-set - Bhaskar et al. 2012 (English QA4MRE)
  • 12. Related Works [continued] ● Abouenour et al. 2012 (IDRAAQ) – Accuracy : 0.13 – C@1 : 0.21 ● Used JIRS (Java Information Retrieval System) for passage retrieval (Distance Density N-gram Model) ● Semantic expansion using Arabic WordNet (AWN) ● Did not use the CLEF background collections
  • 13. Related Works [continued] ● Trigui et al. 2012: – Accuracy : 0.19 – C@1 : 0.19 ● Has not marked any questions as unanswered ● Retrieves the passages that have the question keywords and aligns them with the answer choices ● Finds the best answer choice in the retrieved passages ● Expands the answer choices that could not be found in the retrieved passages using some inference rules on the background collection ● Depends on the background collection as it offers enough redundancy for the passage retrieval module
  • 14. Related Works [continued] ● Bhaskar et al. 2012 (on the English test-set) – Accuracy : 0.53 – C@1 : 0.65 ● Combines each answer choice with the question in a hypothesis ● Searches for the hypothesis keywords in the document ● Uses textual entailment to rank the retrieved passages
  • 15. Related Works [continued] ● The 2 Arabic QA4MRE systems – Use the typical QA pipeline that depend mainly on passage retrieval – Do not analyze the reading test document ● The English system (Bhaskar et al.) – Analyzes the reading test document – Uses many English specific NLP tools that are not found in Arabic (at least in the public domain) like: ● semantic parsing ● semantic role labeling ● dependency analysis
  • 16. ●Tools and resources ● MADA+TOKAN (Morphological Analyzer) – PoS tags, gender, number & light stem ● Arabic WordNet (AWN) – Semantic expansion by (synonyms) ● Root Stemmers – Khoja – ISRI – Tashaphyne
  • 17. ALQASIM 1.0 ● Ezzeldin, A. M., Kholief, M. H., & El-Sonbaty, Y. (September 2013). ALQASIM: Arabic language question answer selection in machines. In Information Access Evaluation. Multilinguality, Multimodality, and Visualization (pp. 100-103). Springer Berlin Heidelberg.
  • 19.
  • 20. Comparison between ALQASIM 1.0 and other Arabic QA4MRE systems
  • 21. ALQASIM 1.0 Pros & Cons ● Pros - Analyzing the reading test document provides a better context for the morphological analyzer - Distance between the keywords and between Q & A found snippets. - Weights according to PoS tags ● Cons - Does not consider the natural boundary of sentences - Prefer distance over keyword weights - Has many manually adjusted weights and thresholds that may make the system over-fit for the test-set.
  • 22. ALQASIM 1.0 Pros & Cons [continued] ● ALQASIM 1.0 could not identify the question snippet because its keywords are a bit far from each other.
  • 23. ALQASIM 2.0 ● Ezzeldin, A. M., El-Sonbaty, Y., & Kholief, M. H. (October 2014). Exploring the Effects of Root Expansion, Sentence Splitting and Ontology on Arabic Answer Selection. In Proceedings of the 11th International Workshop on Natural Language Processing and Cognitive Science (NLPCS 2014), Venice, Italy.
  • 28. ALQASIM 2.0 [continued] A basic ontology of hypernyms and their hyponyms generated automatically from the background collection
  • 29. Example 1: Sentence Splitting
  • 30. Example 2: Ontology Expansion
  • 31. Example 3: Root Expansion
  • 32. Evaluation Description Correct Unanswered Wrong Accuracy C@1 ALQASIM 1.0 49 30 81 0.31 0.36 Run (1) Baseline 46 51 63 0.29 0.38 Run (2) Baseline + 45 46 69 0.28 0.36 AWN Semantic Expansion Run (3) Baseline + Bg Ontology Expansion 51 41 68 0.32 0.40 Run (4) Baseline + Bg Ontology Expansion + Root Expansion 57 29 74 0.36 0.42
  • 34. Evaluation [continued] Root Stemmer Accuracy C@1 ALQASIM 2.0 without root expansion 0.32 0.4 ISRI 0.36 0.42 Khoja 0.32 0.4 Tashaphyne 0.31 0.36
  • 35. Discussion ● Sentence splitting helps identify the correct and the incorrect answers ● AWN expansion degrades performance by 1 to 2% due to its generic nature ● Root expansion helps answer selection especially with highly derivational texts ● An automatically generated ontology specifically created for the test topic improves performance and can be used to boost some question patterns
  • 36. Root Stemmers ● Khoja vs ISRI on 15163 words from the test-set. Khoja ISRI Possible root (a token that is not equal to the light stem or the original word) 36 0.23% 11324 74.68 % Light stem (already generated by MADA) 3647 24.05% 3204 21.13 % Original Word 11480 75.71% 635 4.18% Tashaphyne Root Original Words English Translation ماء" ، "سوءا" ، "سواء" ، "وباء" ء " “water”, “worse”, “alike”, “epidemic” آلت" ، "آفات" آا " “machinery”, “pests” أي" ، "أتى" أ " “any”, “came”
  • 37. Future Work ● Automatically building better ontologies ● Anaphora resolution ● Semantic parsing and semantic role labeling ● Applying Arabic specific rule-based techniques
  • 38. Published Papers ● Ezzeldin, A. M., Kholief, M. H., & El-Sonbaty, Y. (September 2013). ALQASIM: Arabic language question answer selection in machines. In Information Access Evaluation. Multilinguality, Multimodality, and Visualization (pp. 100-103). Springer Berlin Heidelberg. ● Ezzeldin, A. M., El-Sonbaty, Y., & Kholief, M. H. (October 2014). Exploring the Effects of Root Expansion, Sentence Splitting and Ontology on Arabic Answer Selection. In Proceedings of the 11th International Workshop on Natural Language Processing and Cognitive Science (NLPCS 2014), Venice, Italy.