SlideShare a Scribd company logo
1 of 13
Download to read offline
International Journal on Natural Language Computing (IJNLC) Vol. 5, No.2, April 2016
DOI: 10.5121/ijnlc.2016.5202 13
DEVELOPMENT OF LEXICONS GENERATION TOOLS
FOR ARABIC: CASE OF AN OPEN SOURCE
CONJUGATOR
Mourchid Mohamed1
, El Faddouli Nour-eddine2
and Amali Said3
1
MIC search team, Laboratory MISC, Faculty of sciences, Ibn Tofail University Kenitra-
Morocco
2
RIME search team, LRIE laboratory, Mahammadia School of Engineers, Mohammed
Vth
University in Rabat, Morocco
3
OMEGA search team, Laboratory LERES, Faculty FSJES, Moulay Ismaïl University
Meknes-Morocco
ABSTRACT
The dictionary resources are very important for Natural Language Processing (NLP). Generating high
quality dictionary resources is a crucial step for the success and effectiveness of NLP application.
Linguistic information about lexical database is complex, large size and various (ie, phonological,
morphological, syntactic, semantic and pragmatic). Among such lexical database entries, we find
conjugated verbs. To this end, we present in this paper the open source mobile application of our
conjugator that we developed in Java platform under the Android. This Conjugator allowed us to generate
a lexicon of more than 18667 conjugated verbs. This lexicon will be used to generate textual words. The
resultant lexicon can be used in various applications such as morphological analysis (lexical approach),
text indexing, etc…
KEYWORDS
NLP, Lexical databases, conjugator, Root, Pattern, textual words, morphological analysis.
1. INTRODUCTION
In the Arabic language, each word having a meaning consists of a root and a pattern. So we can
represent all the Arabic words by a matrix in which the patterns are the columns and roots are the
rows. A word is simply an element in this matrix [1] [3] [5] [9] (see Figure. 1).
Figure 1. Matrix Root/Pattern
International Journal on Natural Language Computing (IJNLC) Vol. 5, No.2, April 2016
14
With this matrix, we can perform two operations [7][10]:
- Analysis: extract the pattern and the root of a given word.
- Generation: building a word from a column (pattern) and a row (root).
Figure 2 shows an example of using this matrix to extract the root and the pattern of the word
ٌ َ َ ْ َ (MaKtaba , Library) and to generate the word ٌ ِ‫ﺗ‬ َ from the root (Root KTB,
Write)and the pattern ٌ ِ َ .
Figure 2. Operations: Analysis and Generation
In this paper, we present the approach of our conjugator developed in Java in two versions. The
first is a mobile application under android, while the second is a desktop application to power a
lexicon for morphological analysis.
The principle of morphological generation is presented in the second section.
In the third section, we describe the different approach steps of our conjugator and the used
repository.
The fourth section is dedicated to description of the conjugator mobile application.
The exploitation of conjugator to enrich the lexicon of morphological analysis by generating
textual words is described in the fifth section.
In conclusion, we discuss the results and the main perspectives of our research.
2. MORPHOLOGICAL GENERATION PRINCIPLE
Morphological generation is a succession of morphological operations applied to an initial word
accompanied by a set of attributes, to produce a final form of the word.
These attributes are for :
• Names: Number (singular, dual, plural); case (nominative, accusative, gerund);
determination (defined, undefined), annexation (annexed)
International Journal on Natural Language Computing (IJNLC) Vol. 5, No.2, April 2016
15
• Verbs: Aspect (perfect, imperfect, imperative); voice (passive, active); Number (singular,
dual, plural); gender (male, female); person (first person, second person, third person);
case ( nominative, accusative, apocope ); insistence.
2.1. Generation Methods
There are three generation methods differentiated by their approaches [5].
2.1.1. Successive Transformations Method
This method relies on applying transformations progressively until obtaining of the final form.
These transformations are shown as follows:
Initial word + AT1====> word1
word1+ AT2 ====> word2
….
wordn-1 + ATn ====> final word
Where ATi is an attribute.
The figure 3 shows how to get final word َ‫م‬ ُ ُ ْ ‫ا‬ (the sciences) from the initial word ٌ ْ ِ (science )
using the set of attributes(Plural , Defined , Accusative).
Figure 3. The final word is َ‫م‬ ُ ُ ْ ‫ا‬ (AL-OLOMA, the sciences)
2.1.2. Method of Pre-established Models
This method is based on the use of a set of predetermined models each of which is associated
with a class of words having the same characteristics.
Thereby the word generation is done in two steps:
• Determination of suitable model from the characteristics of the original word.
• Performing a substitution operation from the initial word and the corresponding model.
The figure 4 depicts the steps followed to obtain the mould from a set attributes.
International Journal on Natural Language Computing (IJNLC) Vol. 5, No.2, April 2016
16
Figure 4. Obtaining the mould
2.1.3. MIXED METHOD
We apply first the pre-established model method. Then we apply the successive transformations
method on the result of the first step (See figure 5).
Figure 5. The final word is ‫ُوا‬‫ؤ‬َ ْ َ (YAQoRaAu, They read)
3. CONJUGATOR OF VERBS [5]
The method adopted for conjugation of the verb is the mixed method.
Thus, the conjugation operation of a verb proceeds in six steps:
International Journal on Natural Language Computing (IJNLC) Vol. 5, No.2, April 2016
17
• Determining the verb class (see Figure. 6) by consulting the lexicon t of triliteral verbs or
make a treatment based on the length of the verb and the location of certain consonants.
Figure 6. Determining of the verb class
• The adequate model is determined from the triple (class tense, time, voice). (see Figure
7).
Figure 7. Determining the model.
• The triple (person, number, gender) and the model number allow to extract the desired
mould (see Figure 8).
International Journal on Natural Language Computing (IJNLC) Vol. 5, No.2, April 2016
18
Figure 8. Determination of the mould
• Substitution operation consist to replace the mould numbers with the corresponding
consonants of the verb as shown in the example of table 1
Table 1. The substitution operation.
Verb Mould Substitution
َ َ َ‫د‬ (DaKaLa, Enter) َ‫ي‬1ْ2ُ3ُ ُ ُ ْ"َ (YaDoKuLu,Enter)
• Application of the transformation rules (T.R) to deal the case (Apocope, accusative,
insistence) of imperfect tense.
For each case, we define the actions to perform on the consonants and vowels of the verb. Table 2
shows an example of the transformations actions:
Table 2. Some transformations actions.
Action
number
Action Coding of actions
1 Change the last character of the vowel part by " َ◌" M
2 remove the last character of the consonant part S
3 remove the last character of the vowel part S
4 Add the character '‫ا‬' to the last position A
Table 3 is an example of application of the transformation rules.
International Journal on Natural Language Computing (IJNLC) Vol. 5, No.2, April 2016
19
Table 3. Application of the RT.
Rule number Transformations actions Example ( Before) Example ( After)
1 1 ُ َ ْ َ َ َ ْ َ
3 2,3 ِ‫ن‬ َ%َ ْ َ َ%َ ْ َ
4 2,3,4 َ‫ن‬ ُ%َ ْ َ ‫ا‬ ُ%َ ْ َ
• Spellchecking: This step involves performing the possible corrections on the verb, based
on the lexicon of the corresponding anomalies and corrections; an example is giv-en in
table 4.
Table 4. Anomaly and correction.
Verb anomaly Verb after correcting
ُ&َ ْ‫أ‬َ‫أ‬ َ&َ ‫آ‬
4. IMPLEMENTATION AND TESTING OF OUR CONJUGATOR
We realised our conjuguator based on the rules outlined in the previous sections. It consists of
four modules (see Figure 9).
Figure 9. Conjugator modules
• The presentation module enables communication with the user, which can be a person or
an application in which our conjugator can be integrated. This module makes the
necessary checks on the verb to conjugate and determines the pattern to apply.
• The substitution and transformation module enables to apply the transformation rules on
the pattern determined by the previous module.
• The correction module applies correction rules on the result provided by the
transformation module to obtain the final shape.
• The acquisition module and control will be used by a linguist to feed the repository by the
verbs, the models and the correction rules missing.
International Journal on Natural Language Computing (IJNLC) Vol. 5, No.2, April 2016
20
The repository of our conjugutor contains all relevant data: verbs, models and correction rules.
These data can be stored in several forms (relational DB, XML, Jason ...).
We implemented our conjugator, in its first version, as an Android mobile application using the
Java language for coding, the DBMS SQLite for managing the repository and XML to generate
mobile interfaces. The mobile application is open source and can be downloaded from the RIME
search team web site ( http://rime.emi.ac.ma/arabic_conjugator/conjugator.apk).
Figure 10 shows the interface that allows the user to enter the verb, choose the aspect, the voice,
and the mode of conjugation.
Figure 10. Data entry
These data will be recovered by the presentation module that will determine the model to apply.
The transformation rules for the latter will be applied by the substitution module whose result will
be processed by the correction module. The final result obtained will be communicated to the user
(see Figure 11).
International Journal on Natural Language Computing (IJNLC) Vol. 5, No.2, April 2016
21
Figure 11. Conjugation of the verb ' ' at the accomplied tense
The current repository of our conjugator contains:
Table 5. Statistics.
Arabic term Number
Roots 6413
Triliteral roots 5635
Roots quadrilitaires 592
Canonical schemas 199
Irreducible trilitaires verbs ‫دة‬ *( ) 6138
Reducible trilitaires verbs ( ‫"ة‬ + ) 11832
Irreducible quadrilitaires verbs ‫دة‬ *( ) 452
Reducible quadrilitaire verbs ‫"ة‬ +( ) 245
5. GENERATION OF TEXTUAL WORDS
In the previous section, we used our conjugator as core of a mobile application.
In this section, we use our conjugator to generate textual words using the lexicon of verbs noted
M0. These textual words will enrich the dictionary that can be exploited for morphological
analysis.
5.1. GENERATION OF INFLECTED FORMS
The Arabic words are built or bending. The words built have an invariant form whatever their
position in the text; this is the case of tools words (prepositions, conjunctions, etc...).
International Journal on Natural Language Computing (IJNLC) Vol. 5, No.2, April 2016
22
The bending words have a termination that changes depending on the case: nominative,
accusative, apocope, etc…
We try to generate the inflected forms (lexical kernel of the textual word), noted M0C, for each
verb in the lexicon M0. The principle of generation processing of inflected forms is as follows:
M0 is the set of verbs
M0 = v1, v2,...
For each m in M0
if m.VALG =’ imperfect’ then
+ Inflected_Form1 Generate_Accusative(m)
+ Inflected_Form2 Generate_ Apocope (m)
+ Add(Inflected_Form1, M0C) {to add Inflected_Form1 to the lexicon M0C}
+ Add(Inflected_Form2, M0C) {to add Inflected_Form1 to the lexicon M0C}
End if
End for
Where: VALG = Grammatical value of the verb (perfect, imperfect, imperative)
5.2. GENERATION OF TEXTUAL WORDS
The Lexicons M0, M0C represent respectively the lexical words and textual words not having
prefixes and suffixes. Arabic prefixes are sets of letters attached to the beginning of the lexical
word and written as part of it ( for example, prepositions ‘ِ‫ب‬’ ‘Bi’ ‘with’ , ‘ِ‫ل‬’ , ‘Li’, ‘for’, ‘َ‫ك‬’
‘Ka’, ‘like’ and conjunctions ‘َ‫و‬’ ‘Wa’, ‘and’ , ‘ َ‫ف‬’ ‘Fa’, ‘and’, the determiner '‫ال‬,' '| aL', 'the’ ) ,
while suffixes are sets of letters and pronouns attached to the end of the word and written as part
of ( for example, the object pronoun ‘ ‫ھ‬‘ ‘hum‘ , ‘them’) [6].
5.2.1. CONCEPT OF MICRO-SYNTAX [2][4] [8]
We have defined a micro-syntax that represents the set of rules for arranging correctly the
attachment of prefixes and suffixes to a word of the lexicons M0 or M0C to generate a textual
word.
Formally, we can write: Textual word = + FS + with:
Pi: is a prefix.
FS: is a simple form that can be a lexical word owned at M0 or M0C.
Si: is a suffix.
The attachment of prefixes and suffixes to words presents four major problems:
1) The sequences of prefixes ( ) and suffixes ( ) are not arbitrary.
Example: the sequence ‘‫ل‬ 1‫و‬’ (‘WaBi|aL’, ‘and with the ’) is valid while ‘‫ال‬ 1’
(‘BiWa|aL’) is not.
2) Some prefixes or suffixes can be attached only to specified simple forms; for example,
the prefix "‫س‬" (Sa, will) can be attached to verbal forms at the imperfect, while the prefix
"‫ال‬" (| aL, the) can be only attached to nominal forms.
3) Some prefixes cannot be glued to a given word, at the same time as certain suffix; for
example, we can say "" ‫ا‬" ( |aLWaLaDu, the boy ) or "‫"ه‬ ‫و‬" ( WaLaDuHu, his son ) but
we can’t say "‫"ه‬ ‫ا‬"( |aLWaLaDuHu).
International Journal on Natural Language Computing (IJNLC) Vol. 5, No.2, April 2016
23
4) The attachement of certain prefixes or suffixes to a FS is not limited to a simple
concatenation, it implies in many cases a change in the FS termination.
Example:
5.2.2. GENERATION ALGORITHM OF TEXTUAL WORDS
Let P be the set of simple prefixes and compounds: P = {P1, P2 ...}.
The constitution of these sequences is done manually from any particles that exist in Arabic
(preposition, conjunctions ...).
Analogously to the prefixes, we build the set of suffixes: S= {S1, S2 …}.
We define valid combinations between those elements of P and S. It constitutes thus the Pi
couples, Sj accompanied by a motion to indicate for which categories of words could be apply
(name, verb, etc. ...). This gives what we call a text morphological generation rule (TMR):
Pi , Sj , Motion
Motion which may be of type:
Noun: all word of type noun, adjective are compatible with this Pi couple Sj.
Verb: perfective, imperfective or imperative are compatible with this Pi couple Sj.
Verb imperfective: Only words with the unaccomplished are compatible with this Pi couple Sj.
Examples :
‘‫وس‬’ ( WaSa, and will) , ‘ ‫ھ‬’(Hum, them) , imperfective
‘‫وب‬’, (Wabi, and with) , ‘ ‫ھ‬’(Him, them) , Noun
In practice, items are handled by classes micro-syntactic. For example, the two previous
generation rules will be represented by:
[Wabi] , [Him] , Noun
[WaSa] , [Hum] , imperfect
The proposed generation algorithm of textual words composing lexicon noted M1 is:
M=M0 ∪ M0C is the set of words to process
Let R be the lexicon rules of textual morphological generation: R= {r1,r2,….}
For each m in M
For each r in R
textual_word Generate_textual_word(m, r) { apply the rule r to the word m }
Add(textual_word, M1) {to add textual_word to the lexicon M1}
End for
End for
‫ى‬َ ْ َmeaning)(+ُ‫ه‬‫اه‬َ ْ َhis meaning)(
International Journal on Natural Language Computing (IJNLC) Vol. 5, No.2, April 2016
24
6. CONCLUSIONS
In this paper, we presented the architecture of a system of conjugation of Arab verbs. It operates
according to a five-step process: determining the class of verb, determining the model, the
substitution operation, applying transformation rules, and spells correcting. The results of the
performed tests are very satisfactory.
We used the conjugator to enrich the lexical database by the verbs for morphological analysis
using the dictionary-based approach. We proposed also a micro-syntax and an algorithm to
generate the textual words to enrich the same lexical database.
We intend to use our conjugator as the core of a learning environment for Arabic and especially
conjugation for kids and non-native speakers.
To complete the necessary dictionary for morphological analysis, we intend to generate the
textual words for nouns.
REFERENCES
[1] Ali Nabil (1988) “Arabic Language and Computer”, [in Arabic], Ta’areeb.
[2] Hlal Yahya, (1979) “Learning Methods for Morphosyntactic Analysis (Experienced in the case of
Arabic and French)”, thesis doctoral degree, University Paris.
[3] Hlal Yahya, (1987) “Generation from the root and pattern”, Conference on the progress of linguistics
in Arab countries.
[4] Hlal Yahya, (1990) “Morphology and syntax of the Arabic language”, In Proceedings of the Arab
School of Science and Technology Applied Arabic Linguistics for Informatics (pp.201).
[5] Mourchid Mohamed, (1999) “Generation Morphological and Applications”, Specialty thesis of 3rd
round, Mohammed V University in Rabat-Morocco.
[6] Nizar Y. Habash, (2010) Introduction to Arabic Natural Language Processing, Morgan & Claypool
Publishers series.
[7] Abdelhadi Soudi, Antal van den BoschNizar, & G¨unter Neumann (2007) Arabic Computational
Morphology, Knowledge-based and Empirical Methods, Springer
[8] Joseph Dichy, Ali Farghaly, (2003) “Roots & Patterns vs. Stems plus Grammar-Lexis Specifications:
on what basis should a multilingual lexical database centred on Arabic be built?” Workshop on
Machine Translation for Semitic Languages: issues and approaches – New Orleans, USA.
[9] Riyad Al-Shalabi, (2005) “Pattern-based Stemmer for Finding Arabic Roots”, Information
Technology Journal, 4(1): p. 38-43.
[10] A.Yousfi, (2010) “The morphological analysis of Arabic verbs by using the surface patterns”, IJCSI
International Journal of Computer Science Issues,7(3(11)): p. 33-36.
International Journal on Natural Language Computing (IJNLC) Vol. 5, No.2, April 2016
25
AUTHORS
M. MOURCHID Doctorate degree in Computer Science in 1999; Assistant Professor
at the Computer Science Department at the Faculty of sciences, Ibn Tofail University
in Kenitra Morocco; Ongoing research interests: Natural Language Processing, Web
Semantic, and information systems.
N. El Faddouli Doctorate degree in Computer Science in 1999; Assistant Professor at
the Computer Science Department at the Mohammadia School of Engineers (EMI); 15
recent publications papers between 2007 and 2015; Ongoing research interests: e-
learning, Big Da ta, Natural Language Processing, and information systems. Language
Processing, and information systems
S. AMALI Doctorate degree in Computer Science in 1999; Assistant Professor at the
Computer Science Department at the Moulay Ismaïl University; Ongoing research
interests: e-learning, Natural Language Processing, and information systems.

More Related Content

Viewers also liked

Implementation of the Open Source Virtualization Technologies in Cloud Computing
Implementation of the Open Source Virtualization Technologies in Cloud ComputingImplementation of the Open Source Virtualization Technologies in Cloud Computing
Implementation of the Open Source Virtualization Technologies in Cloud Computingijccsa
 
Machine Learning Techniques with Ontology for Subjective Answer Evaluation
Machine Learning Techniques with Ontology for Subjective Answer EvaluationMachine Learning Techniques with Ontology for Subjective Answer Evaluation
Machine Learning Techniques with Ontology for Subjective Answer Evaluationijnlc
 
An Innovative Moving Object Detection and Tracking System by Using Modified R...
An Innovative Moving Object Detection and Tracking System by Using Modified R...An Innovative Moving Object Detection and Tracking System by Using Modified R...
An Innovative Moving Object Detection and Tracking System by Using Modified R...sipij
 
A Counterexample to the Forward Recursion in Fuzzy Critical Path Analysis Und...
A Counterexample to the Forward Recursion in Fuzzy Critical Path Analysis Und...A Counterexample to the Forward Recursion in Fuzzy Critical Path Analysis Und...
A Counterexample to the Forward Recursion in Fuzzy Critical Path Analysis Und...ijfls
 
Routing in All-Optical Networks Using Recursive State Space Technique
Routing in All-Optical Networks Using Recursive State Space TechniqueRouting in All-Optical Networks Using Recursive State Space Technique
Routing in All-Optical Networks Using Recursive State Space Techniquesipij
 
Design and Development of Arm-Based Control System for Nursing Bed
Design and Development of Arm-Based Control System for Nursing Bed Design and Development of Arm-Based Control System for Nursing Bed
Design and Development of Arm-Based Control System for Nursing Bed IJCSES Journal
 
Effective Road Model for Congestion Control in VANETS
Effective Road Model for Congestion Control in VANETSEffective Road Model for Congestion Control in VANETS
Effective Road Model for Congestion Control in VANETSijwmn
 
Analyse Performance of Fractional Fourier Transform on Timing and Carrier Fr...
 Analyse Performance of Fractional Fourier Transform on Timing and Carrier Fr... Analyse Performance of Fractional Fourier Transform on Timing and Carrier Fr...
Analyse Performance of Fractional Fourier Transform on Timing and Carrier Fr...ijwmn
 
On the Covering Radius of Codes Over Z6
On the Covering Radius of Codes Over Z6 On the Covering Radius of Codes Over Z6
On the Covering Radius of Codes Over Z6 ijitjournal
 
Efficient Reversible Data Hiding Algorithms Based on Dual Prediction
Efficient Reversible Data Hiding Algorithms Based on Dual PredictionEfficient Reversible Data Hiding Algorithms Based on Dual Prediction
Efficient Reversible Data Hiding Algorithms Based on Dual Predictionsipij
 
Implementation of Fuzzy Controlled Photo Voltaic Fed Dynamic Voltage Restorer...
Implementation of Fuzzy Controlled Photo Voltaic Fed Dynamic Voltage Restorer...Implementation of Fuzzy Controlled Photo Voltaic Fed Dynamic Voltage Restorer...
Implementation of Fuzzy Controlled Photo Voltaic Fed Dynamic Voltage Restorer...ijfls
 
Dynamic K-Means Algorithm for Optimized Routing in Mobile Ad Hoc Networks
Dynamic K-Means Algorithm for Optimized Routing in Mobile Ad Hoc Networks Dynamic K-Means Algorithm for Optimized Routing in Mobile Ad Hoc Networks
Dynamic K-Means Algorithm for Optimized Routing in Mobile Ad Hoc Networks IJCSES Journal
 
A Survey on Object Detection Methods in Visual Sensor Networks
A Survey on Object Detection Methods in Visual Sensor Networks A Survey on Object Detection Methods in Visual Sensor Networks
A Survey on Object Detection Methods in Visual Sensor Networks ijassn
 
Fuzzy Clustering Based Segmentation of Vertebrae in T1-Weighted Spinal MR Images
Fuzzy Clustering Based Segmentation of Vertebrae in T1-Weighted Spinal MR ImagesFuzzy Clustering Based Segmentation of Vertebrae in T1-Weighted Spinal MR Images
Fuzzy Clustering Based Segmentation of Vertebrae in T1-Weighted Spinal MR Imagesijfls
 
Design and Implementation a New Energy Efficient Clustering Algorithm Using t...
Design and Implementation a New Energy Efficient Clustering Algorithm Using t...Design and Implementation a New Energy Efficient Clustering Algorithm Using t...
Design and Implementation a New Energy Efficient Clustering Algorithm Using t...ijmnct
 
Optimal Alternative Selection Using MOORA in Industrial Sector - A
Optimal Alternative Selection Using MOORA in Industrial Sector - A  Optimal Alternative Selection Using MOORA in Industrial Sector - A
Optimal Alternative Selection Using MOORA in Industrial Sector - A ijfls
 
Software Testing Using Genetic Algorithms
Software Testing Using Genetic AlgorithmsSoftware Testing Using Genetic Algorithms
Software Testing Using Genetic AlgorithmsIJCSES Journal
 
Comparative Design of Regular Structured Modified Booth Multiplier
Comparative Design of Regular Structured Modified Booth MultiplierComparative Design of Regular Structured Modified Booth Multiplier
Comparative Design of Regular Structured Modified Booth MultiplierVLSICS Design
 
Study and Analysis of Multidimensional Hilbert Space Filling Curve and Its Ap...
Study and Analysis of Multidimensional Hilbert Space Filling Curve and Its Ap...Study and Analysis of Multidimensional Hilbert Space Filling Curve and Its Ap...
Study and Analysis of Multidimensional Hilbert Space Filling Curve and Its Ap...ijcseit
 
Optimized Biometric System Based on Combination of Face Images and Log Transf...
Optimized Biometric System Based on Combination of Face Images and Log Transf...Optimized Biometric System Based on Combination of Face Images and Log Transf...
Optimized Biometric System Based on Combination of Face Images and Log Transf...sipij
 

Viewers also liked (20)

Implementation of the Open Source Virtualization Technologies in Cloud Computing
Implementation of the Open Source Virtualization Technologies in Cloud ComputingImplementation of the Open Source Virtualization Technologies in Cloud Computing
Implementation of the Open Source Virtualization Technologies in Cloud Computing
 
Machine Learning Techniques with Ontology for Subjective Answer Evaluation
Machine Learning Techniques with Ontology for Subjective Answer EvaluationMachine Learning Techniques with Ontology for Subjective Answer Evaluation
Machine Learning Techniques with Ontology for Subjective Answer Evaluation
 
An Innovative Moving Object Detection and Tracking System by Using Modified R...
An Innovative Moving Object Detection and Tracking System by Using Modified R...An Innovative Moving Object Detection and Tracking System by Using Modified R...
An Innovative Moving Object Detection and Tracking System by Using Modified R...
 
A Counterexample to the Forward Recursion in Fuzzy Critical Path Analysis Und...
A Counterexample to the Forward Recursion in Fuzzy Critical Path Analysis Und...A Counterexample to the Forward Recursion in Fuzzy Critical Path Analysis Und...
A Counterexample to the Forward Recursion in Fuzzy Critical Path Analysis Und...
 
Routing in All-Optical Networks Using Recursive State Space Technique
Routing in All-Optical Networks Using Recursive State Space TechniqueRouting in All-Optical Networks Using Recursive State Space Technique
Routing in All-Optical Networks Using Recursive State Space Technique
 
Design and Development of Arm-Based Control System for Nursing Bed
Design and Development of Arm-Based Control System for Nursing Bed Design and Development of Arm-Based Control System for Nursing Bed
Design and Development of Arm-Based Control System for Nursing Bed
 
Effective Road Model for Congestion Control in VANETS
Effective Road Model for Congestion Control in VANETSEffective Road Model for Congestion Control in VANETS
Effective Road Model for Congestion Control in VANETS
 
Analyse Performance of Fractional Fourier Transform on Timing and Carrier Fr...
 Analyse Performance of Fractional Fourier Transform on Timing and Carrier Fr... Analyse Performance of Fractional Fourier Transform on Timing and Carrier Fr...
Analyse Performance of Fractional Fourier Transform on Timing and Carrier Fr...
 
On the Covering Radius of Codes Over Z6
On the Covering Radius of Codes Over Z6 On the Covering Radius of Codes Over Z6
On the Covering Radius of Codes Over Z6
 
Efficient Reversible Data Hiding Algorithms Based on Dual Prediction
Efficient Reversible Data Hiding Algorithms Based on Dual PredictionEfficient Reversible Data Hiding Algorithms Based on Dual Prediction
Efficient Reversible Data Hiding Algorithms Based on Dual Prediction
 
Implementation of Fuzzy Controlled Photo Voltaic Fed Dynamic Voltage Restorer...
Implementation of Fuzzy Controlled Photo Voltaic Fed Dynamic Voltage Restorer...Implementation of Fuzzy Controlled Photo Voltaic Fed Dynamic Voltage Restorer...
Implementation of Fuzzy Controlled Photo Voltaic Fed Dynamic Voltage Restorer...
 
Dynamic K-Means Algorithm for Optimized Routing in Mobile Ad Hoc Networks
Dynamic K-Means Algorithm for Optimized Routing in Mobile Ad Hoc Networks Dynamic K-Means Algorithm for Optimized Routing in Mobile Ad Hoc Networks
Dynamic K-Means Algorithm for Optimized Routing in Mobile Ad Hoc Networks
 
A Survey on Object Detection Methods in Visual Sensor Networks
A Survey on Object Detection Methods in Visual Sensor Networks A Survey on Object Detection Methods in Visual Sensor Networks
A Survey on Object Detection Methods in Visual Sensor Networks
 
Fuzzy Clustering Based Segmentation of Vertebrae in T1-Weighted Spinal MR Images
Fuzzy Clustering Based Segmentation of Vertebrae in T1-Weighted Spinal MR ImagesFuzzy Clustering Based Segmentation of Vertebrae in T1-Weighted Spinal MR Images
Fuzzy Clustering Based Segmentation of Vertebrae in T1-Weighted Spinal MR Images
 
Design and Implementation a New Energy Efficient Clustering Algorithm Using t...
Design and Implementation a New Energy Efficient Clustering Algorithm Using t...Design and Implementation a New Energy Efficient Clustering Algorithm Using t...
Design and Implementation a New Energy Efficient Clustering Algorithm Using t...
 
Optimal Alternative Selection Using MOORA in Industrial Sector - A
Optimal Alternative Selection Using MOORA in Industrial Sector - A  Optimal Alternative Selection Using MOORA in Industrial Sector - A
Optimal Alternative Selection Using MOORA in Industrial Sector - A
 
Software Testing Using Genetic Algorithms
Software Testing Using Genetic AlgorithmsSoftware Testing Using Genetic Algorithms
Software Testing Using Genetic Algorithms
 
Comparative Design of Regular Structured Modified Booth Multiplier
Comparative Design of Regular Structured Modified Booth MultiplierComparative Design of Regular Structured Modified Booth Multiplier
Comparative Design of Regular Structured Modified Booth Multiplier
 
Study and Analysis of Multidimensional Hilbert Space Filling Curve and Its Ap...
Study and Analysis of Multidimensional Hilbert Space Filling Curve and Its Ap...Study and Analysis of Multidimensional Hilbert Space Filling Curve and Its Ap...
Study and Analysis of Multidimensional Hilbert Space Filling Curve and Its Ap...
 
Optimized Biometric System Based on Combination of Face Images and Log Transf...
Optimized Biometric System Based on Combination of Face Images and Log Transf...Optimized Biometric System Based on Combination of Face Images and Log Transf...
Optimized Biometric System Based on Combination of Face Images and Log Transf...
 

Similar to Development of Lexicons Generation Tools for Arabic: Case of an Open Source Conjugator

Phonetic distance based accent
Phonetic distance based accentPhonetic distance based accent
Phonetic distance based accentsipij
 
An expert system for automatic reading of a text written in standard arabic
An expert system for automatic reading of a text written in standard arabicAn expert system for automatic reading of a text written in standard arabic
An expert system for automatic reading of a text written in standard arabicijnlc
 
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARSYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARijcseit
 
5215ijcseit01
5215ijcseit015215ijcseit01
5215ijcseit01ijcsit
 
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARSYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARijcseit
 
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARSYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARijcseit
 
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARSYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARijcseit
 
SENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORK
SENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORKSENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORK
SENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORKijnlc
 
Sentiment Analysis In Myanmar Language Using Convolutional Lstm Neural Network
Sentiment Analysis In Myanmar Language Using Convolutional Lstm Neural NetworkSentiment Analysis In Myanmar Language Using Convolutional Lstm Neural Network
Sentiment Analysis In Myanmar Language Using Convolutional Lstm Neural Networkkevig
 
Parameters Optimization for Improving ASR Performance in Adverse Real World N...
Parameters Optimization for Improving ASR Performance in Adverse Real World N...Parameters Optimization for Improving ASR Performance in Adverse Real World N...
Parameters Optimization for Improving ASR Performance in Adverse Real World N...Waqas Tariq
 
PUNJABI SPEECH SYNTHESIS SYSTEM USING HTK
PUNJABI SPEECH SYNTHESIS SYSTEM USING HTKPUNJABI SPEECH SYNTHESIS SYSTEM USING HTK
PUNJABI SPEECH SYNTHESIS SYSTEM USING HTKijistjournal
 
PUNJABI SPEECH SYNTHESIS SYSTEM USING HTK
PUNJABI SPEECH SYNTHESIS SYSTEM USING HTKPUNJABI SPEECH SYNTHESIS SYSTEM USING HTK
PUNJABI SPEECH SYNTHESIS SYSTEM USING HTKijistjournal
 
A Novel approach for Document Clustering using Concept Extraction
A Novel approach for Document Clustering using Concept ExtractionA Novel approach for Document Clustering using Concept Extraction
A Novel approach for Document Clustering using Concept ExtractionAM Publications
 
Classification of Machine Translation Outputs Using NB Classifier and SVM for...
Classification of Machine Translation Outputs Using NB Classifier and SVM for...Classification of Machine Translation Outputs Using NB Classifier and SVM for...
Classification of Machine Translation Outputs Using NB Classifier and SVM for...mlaij
 
Acceptance Testing Of A Spoken Language Translation System
Acceptance Testing Of A Spoken Language Translation SystemAcceptance Testing Of A Spoken Language Translation System
Acceptance Testing Of A Spoken Language Translation SystemMichele Thomas
 
A hybrid composite features based sentence level sentiment analyzer
A hybrid composite features based sentence level sentiment analyzerA hybrid composite features based sentence level sentiment analyzer
A hybrid composite features based sentence level sentiment analyzerIAESIJAI
 
Natural Language Interface for Java Programming: Survey
Natural Language Interface for Java Programming: SurveyNatural Language Interface for Java Programming: Survey
Natural Language Interface for Java Programming: Surveyrahulmonikasharma
 
AUTOMATIC TRAINING DATA SYNTHESIS FOR HANDWRITING RECOGNITION USING THE STRUC...
AUTOMATIC TRAINING DATA SYNTHESIS FOR HANDWRITING RECOGNITION USING THE STRUC...AUTOMATIC TRAINING DATA SYNTHESIS FOR HANDWRITING RECOGNITION USING THE STRUC...
AUTOMATIC TRAINING DATA SYNTHESIS FOR HANDWRITING RECOGNITION USING THE STRUC...ijaia
 

Similar to Development of Lexicons Generation Tools for Arabic: Case of an Open Source Conjugator (20)

Phonetic distance based accent
Phonetic distance based accentPhonetic distance based accent
Phonetic distance based accent
 
An expert system for automatic reading of a text written in standard arabic
An expert system for automatic reading of a text written in standard arabicAn expert system for automatic reading of a text written in standard arabic
An expert system for automatic reading of a text written in standard arabic
 
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARSYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
 
5215ijcseit01
5215ijcseit015215ijcseit01
5215ijcseit01
 
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARSYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
 
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARSYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
 
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARSYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
 
SENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORK
SENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORKSENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORK
SENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORK
 
Sentiment Analysis In Myanmar Language Using Convolutional Lstm Neural Network
Sentiment Analysis In Myanmar Language Using Convolutional Lstm Neural NetworkSentiment Analysis In Myanmar Language Using Convolutional Lstm Neural Network
Sentiment Analysis In Myanmar Language Using Convolutional Lstm Neural Network
 
50120140505010
5012014050501050120140505010
50120140505010
 
Parameters Optimization for Improving ASR Performance in Adverse Real World N...
Parameters Optimization for Improving ASR Performance in Adverse Real World N...Parameters Optimization for Improving ASR Performance in Adverse Real World N...
Parameters Optimization for Improving ASR Performance in Adverse Real World N...
 
PUNJABI SPEECH SYNTHESIS SYSTEM USING HTK
PUNJABI SPEECH SYNTHESIS SYSTEM USING HTKPUNJABI SPEECH SYNTHESIS SYSTEM USING HTK
PUNJABI SPEECH SYNTHESIS SYSTEM USING HTK
 
PUNJABI SPEECH SYNTHESIS SYSTEM USING HTK
PUNJABI SPEECH SYNTHESIS SYSTEM USING HTKPUNJABI SPEECH SYNTHESIS SYSTEM USING HTK
PUNJABI SPEECH SYNTHESIS SYSTEM USING HTK
 
A Novel approach for Document Clustering using Concept Extraction
A Novel approach for Document Clustering using Concept ExtractionA Novel approach for Document Clustering using Concept Extraction
A Novel approach for Document Clustering using Concept Extraction
 
Classification of Machine Translation Outputs Using NB Classifier and SVM for...
Classification of Machine Translation Outputs Using NB Classifier and SVM for...Classification of Machine Translation Outputs Using NB Classifier and SVM for...
Classification of Machine Translation Outputs Using NB Classifier and SVM for...
 
Acceptance Testing Of A Spoken Language Translation System
Acceptance Testing Of A Spoken Language Translation SystemAcceptance Testing Of A Spoken Language Translation System
Acceptance Testing Of A Spoken Language Translation System
 
A hybrid composite features based sentence level sentiment analyzer
A hybrid composite features based sentence level sentiment analyzerA hybrid composite features based sentence level sentiment analyzer
A hybrid composite features based sentence level sentiment analyzer
 
Natural Language Interface for Java Programming: Survey
Natural Language Interface for Java Programming: SurveyNatural Language Interface for Java Programming: Survey
Natural Language Interface for Java Programming: Survey
 
AUTOMATIC TRAINING DATA SYNTHESIS FOR HANDWRITING RECOGNITION USING THE STRUC...
AUTOMATIC TRAINING DATA SYNTHESIS FOR HANDWRITING RECOGNITION USING THE STRUC...AUTOMATIC TRAINING DATA SYNTHESIS FOR HANDWRITING RECOGNITION USING THE STRUC...
AUTOMATIC TRAINING DATA SYNTHESIS FOR HANDWRITING RECOGNITION USING THE STRUC...
 
D111823
D111823D111823
D111823
 

Recently uploaded

Introduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaIntroduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaOmar Fathy
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VDineshKumar4165
 
Employee leave management system project.
Employee leave management system project.Employee leave management system project.
Employee leave management system project.Kamal Acharya
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performancesivaprakash250
 
Generative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTGenerative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTbhaskargani46
 
Hostel management system project report..pdf
Hostel management system project report..pdfHostel management system project report..pdf
Hostel management system project report..pdfKamal Acharya
 
Unit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdfUnit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdfRagavanV2
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...roncy bisnoi
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlysanyuktamishra911
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueBhangaleSonal
 
22-prompt engineering noted slide shown.pdf
22-prompt engineering noted slide shown.pdf22-prompt engineering noted slide shown.pdf
22-prompt engineering noted slide shown.pdf203318pmpc
 
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...SUHANI PANDEY
 
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...tanu pandey
 
Unit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdfUnit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdfRagavanV2
 
chapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringchapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringmulugeta48
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfKamal Acharya
 

Recently uploaded (20)

Introduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaIntroduction to Serverless with AWS Lambda
Introduction to Serverless with AWS Lambda
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
 
Employee leave management system project.
Employee leave management system project.Employee leave management system project.
Employee leave management system project.
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
Generative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTGenerative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPT
 
Hostel management system project report..pdf
Hostel management system project report..pdfHostel management system project report..pdf
Hostel management system project report..pdf
 
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
 
Unit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdfUnit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdf
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced LoadsFEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torque
 
22-prompt engineering noted slide shown.pdf
22-prompt engineering noted slide shown.pdf22-prompt engineering noted slide shown.pdf
22-prompt engineering noted slide shown.pdf
 
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
 
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
 
Unit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdfUnit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdf
 
chapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringchapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineering
 
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
 
Integrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - NeometrixIntegrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - Neometrix
 

Development of Lexicons Generation Tools for Arabic: Case of an Open Source Conjugator

  • 1. International Journal on Natural Language Computing (IJNLC) Vol. 5, No.2, April 2016 DOI: 10.5121/ijnlc.2016.5202 13 DEVELOPMENT OF LEXICONS GENERATION TOOLS FOR ARABIC: CASE OF AN OPEN SOURCE CONJUGATOR Mourchid Mohamed1 , El Faddouli Nour-eddine2 and Amali Said3 1 MIC search team, Laboratory MISC, Faculty of sciences, Ibn Tofail University Kenitra- Morocco 2 RIME search team, LRIE laboratory, Mahammadia School of Engineers, Mohammed Vth University in Rabat, Morocco 3 OMEGA search team, Laboratory LERES, Faculty FSJES, Moulay Ismaïl University Meknes-Morocco ABSTRACT The dictionary resources are very important for Natural Language Processing (NLP). Generating high quality dictionary resources is a crucial step for the success and effectiveness of NLP application. Linguistic information about lexical database is complex, large size and various (ie, phonological, morphological, syntactic, semantic and pragmatic). Among such lexical database entries, we find conjugated verbs. To this end, we present in this paper the open source mobile application of our conjugator that we developed in Java platform under the Android. This Conjugator allowed us to generate a lexicon of more than 18667 conjugated verbs. This lexicon will be used to generate textual words. The resultant lexicon can be used in various applications such as morphological analysis (lexical approach), text indexing, etc… KEYWORDS NLP, Lexical databases, conjugator, Root, Pattern, textual words, morphological analysis. 1. INTRODUCTION In the Arabic language, each word having a meaning consists of a root and a pattern. So we can represent all the Arabic words by a matrix in which the patterns are the columns and roots are the rows. A word is simply an element in this matrix [1] [3] [5] [9] (see Figure. 1). Figure 1. Matrix Root/Pattern
  • 2. International Journal on Natural Language Computing (IJNLC) Vol. 5, No.2, April 2016 14 With this matrix, we can perform two operations [7][10]: - Analysis: extract the pattern and the root of a given word. - Generation: building a word from a column (pattern) and a row (root). Figure 2 shows an example of using this matrix to extract the root and the pattern of the word ٌ َ َ ْ َ (MaKtaba , Library) and to generate the word ٌ ِ‫ﺗ‬ َ from the root (Root KTB, Write)and the pattern ٌ ِ َ . Figure 2. Operations: Analysis and Generation In this paper, we present the approach of our conjugator developed in Java in two versions. The first is a mobile application under android, while the second is a desktop application to power a lexicon for morphological analysis. The principle of morphological generation is presented in the second section. In the third section, we describe the different approach steps of our conjugator and the used repository. The fourth section is dedicated to description of the conjugator mobile application. The exploitation of conjugator to enrich the lexicon of morphological analysis by generating textual words is described in the fifth section. In conclusion, we discuss the results and the main perspectives of our research. 2. MORPHOLOGICAL GENERATION PRINCIPLE Morphological generation is a succession of morphological operations applied to an initial word accompanied by a set of attributes, to produce a final form of the word. These attributes are for : • Names: Number (singular, dual, plural); case (nominative, accusative, gerund); determination (defined, undefined), annexation (annexed)
  • 3. International Journal on Natural Language Computing (IJNLC) Vol. 5, No.2, April 2016 15 • Verbs: Aspect (perfect, imperfect, imperative); voice (passive, active); Number (singular, dual, plural); gender (male, female); person (first person, second person, third person); case ( nominative, accusative, apocope ); insistence. 2.1. Generation Methods There are three generation methods differentiated by their approaches [5]. 2.1.1. Successive Transformations Method This method relies on applying transformations progressively until obtaining of the final form. These transformations are shown as follows: Initial word + AT1====> word1 word1+ AT2 ====> word2 …. wordn-1 + ATn ====> final word Where ATi is an attribute. The figure 3 shows how to get final word َ‫م‬ ُ ُ ْ ‫ا‬ (the sciences) from the initial word ٌ ْ ِ (science ) using the set of attributes(Plural , Defined , Accusative). Figure 3. The final word is َ‫م‬ ُ ُ ْ ‫ا‬ (AL-OLOMA, the sciences) 2.1.2. Method of Pre-established Models This method is based on the use of a set of predetermined models each of which is associated with a class of words having the same characteristics. Thereby the word generation is done in two steps: • Determination of suitable model from the characteristics of the original word. • Performing a substitution operation from the initial word and the corresponding model. The figure 4 depicts the steps followed to obtain the mould from a set attributes.
  • 4. International Journal on Natural Language Computing (IJNLC) Vol. 5, No.2, April 2016 16 Figure 4. Obtaining the mould 2.1.3. MIXED METHOD We apply first the pre-established model method. Then we apply the successive transformations method on the result of the first step (See figure 5). Figure 5. The final word is ‫ُوا‬‫ؤ‬َ ْ َ (YAQoRaAu, They read) 3. CONJUGATOR OF VERBS [5] The method adopted for conjugation of the verb is the mixed method. Thus, the conjugation operation of a verb proceeds in six steps:
  • 5. International Journal on Natural Language Computing (IJNLC) Vol. 5, No.2, April 2016 17 • Determining the verb class (see Figure. 6) by consulting the lexicon t of triliteral verbs or make a treatment based on the length of the verb and the location of certain consonants. Figure 6. Determining of the verb class • The adequate model is determined from the triple (class tense, time, voice). (see Figure 7). Figure 7. Determining the model. • The triple (person, number, gender) and the model number allow to extract the desired mould (see Figure 8).
  • 6. International Journal on Natural Language Computing (IJNLC) Vol. 5, No.2, April 2016 18 Figure 8. Determination of the mould • Substitution operation consist to replace the mould numbers with the corresponding consonants of the verb as shown in the example of table 1 Table 1. The substitution operation. Verb Mould Substitution َ َ َ‫د‬ (DaKaLa, Enter) َ‫ي‬1ْ2ُ3ُ ُ ُ ْ"َ (YaDoKuLu,Enter) • Application of the transformation rules (T.R) to deal the case (Apocope, accusative, insistence) of imperfect tense. For each case, we define the actions to perform on the consonants and vowels of the verb. Table 2 shows an example of the transformations actions: Table 2. Some transformations actions. Action number Action Coding of actions 1 Change the last character of the vowel part by " َ◌" M 2 remove the last character of the consonant part S 3 remove the last character of the vowel part S 4 Add the character '‫ا‬' to the last position A Table 3 is an example of application of the transformation rules.
  • 7. International Journal on Natural Language Computing (IJNLC) Vol. 5, No.2, April 2016 19 Table 3. Application of the RT. Rule number Transformations actions Example ( Before) Example ( After) 1 1 ُ َ ْ َ َ َ ْ َ 3 2,3 ِ‫ن‬ َ%َ ْ َ َ%َ ْ َ 4 2,3,4 َ‫ن‬ ُ%َ ْ َ ‫ا‬ ُ%َ ْ َ • Spellchecking: This step involves performing the possible corrections on the verb, based on the lexicon of the corresponding anomalies and corrections; an example is giv-en in table 4. Table 4. Anomaly and correction. Verb anomaly Verb after correcting ُ&َ ْ‫أ‬َ‫أ‬ َ&َ ‫آ‬ 4. IMPLEMENTATION AND TESTING OF OUR CONJUGATOR We realised our conjuguator based on the rules outlined in the previous sections. It consists of four modules (see Figure 9). Figure 9. Conjugator modules • The presentation module enables communication with the user, which can be a person or an application in which our conjugator can be integrated. This module makes the necessary checks on the verb to conjugate and determines the pattern to apply. • The substitution and transformation module enables to apply the transformation rules on the pattern determined by the previous module. • The correction module applies correction rules on the result provided by the transformation module to obtain the final shape. • The acquisition module and control will be used by a linguist to feed the repository by the verbs, the models and the correction rules missing.
  • 8. International Journal on Natural Language Computing (IJNLC) Vol. 5, No.2, April 2016 20 The repository of our conjugutor contains all relevant data: verbs, models and correction rules. These data can be stored in several forms (relational DB, XML, Jason ...). We implemented our conjugator, in its first version, as an Android mobile application using the Java language for coding, the DBMS SQLite for managing the repository and XML to generate mobile interfaces. The mobile application is open source and can be downloaded from the RIME search team web site ( http://rime.emi.ac.ma/arabic_conjugator/conjugator.apk). Figure 10 shows the interface that allows the user to enter the verb, choose the aspect, the voice, and the mode of conjugation. Figure 10. Data entry These data will be recovered by the presentation module that will determine the model to apply. The transformation rules for the latter will be applied by the substitution module whose result will be processed by the correction module. The final result obtained will be communicated to the user (see Figure 11).
  • 9. International Journal on Natural Language Computing (IJNLC) Vol. 5, No.2, April 2016 21 Figure 11. Conjugation of the verb ' ' at the accomplied tense The current repository of our conjugator contains: Table 5. Statistics. Arabic term Number Roots 6413 Triliteral roots 5635 Roots quadrilitaires 592 Canonical schemas 199 Irreducible trilitaires verbs ‫دة‬ *( ) 6138 Reducible trilitaires verbs ( ‫"ة‬ + ) 11832 Irreducible quadrilitaires verbs ‫دة‬ *( ) 452 Reducible quadrilitaire verbs ‫"ة‬ +( ) 245 5. GENERATION OF TEXTUAL WORDS In the previous section, we used our conjugator as core of a mobile application. In this section, we use our conjugator to generate textual words using the lexicon of verbs noted M0. These textual words will enrich the dictionary that can be exploited for morphological analysis. 5.1. GENERATION OF INFLECTED FORMS The Arabic words are built or bending. The words built have an invariant form whatever their position in the text; this is the case of tools words (prepositions, conjunctions, etc...).
  • 10. International Journal on Natural Language Computing (IJNLC) Vol. 5, No.2, April 2016 22 The bending words have a termination that changes depending on the case: nominative, accusative, apocope, etc… We try to generate the inflected forms (lexical kernel of the textual word), noted M0C, for each verb in the lexicon M0. The principle of generation processing of inflected forms is as follows: M0 is the set of verbs M0 = v1, v2,... For each m in M0 if m.VALG =’ imperfect’ then + Inflected_Form1 Generate_Accusative(m) + Inflected_Form2 Generate_ Apocope (m) + Add(Inflected_Form1, M0C) {to add Inflected_Form1 to the lexicon M0C} + Add(Inflected_Form2, M0C) {to add Inflected_Form1 to the lexicon M0C} End if End for Where: VALG = Grammatical value of the verb (perfect, imperfect, imperative) 5.2. GENERATION OF TEXTUAL WORDS The Lexicons M0, M0C represent respectively the lexical words and textual words not having prefixes and suffixes. Arabic prefixes are sets of letters attached to the beginning of the lexical word and written as part of it ( for example, prepositions ‘ِ‫ب‬’ ‘Bi’ ‘with’ , ‘ِ‫ل‬’ , ‘Li’, ‘for’, ‘َ‫ك‬’ ‘Ka’, ‘like’ and conjunctions ‘َ‫و‬’ ‘Wa’, ‘and’ , ‘ َ‫ف‬’ ‘Fa’, ‘and’, the determiner '‫ال‬,' '| aL', 'the’ ) , while suffixes are sets of letters and pronouns attached to the end of the word and written as part of ( for example, the object pronoun ‘ ‫ھ‬‘ ‘hum‘ , ‘them’) [6]. 5.2.1. CONCEPT OF MICRO-SYNTAX [2][4] [8] We have defined a micro-syntax that represents the set of rules for arranging correctly the attachment of prefixes and suffixes to a word of the lexicons M0 or M0C to generate a textual word. Formally, we can write: Textual word = + FS + with: Pi: is a prefix. FS: is a simple form that can be a lexical word owned at M0 or M0C. Si: is a suffix. The attachment of prefixes and suffixes to words presents four major problems: 1) The sequences of prefixes ( ) and suffixes ( ) are not arbitrary. Example: the sequence ‘‫ل‬ 1‫و‬’ (‘WaBi|aL’, ‘and with the ’) is valid while ‘‫ال‬ 1’ (‘BiWa|aL’) is not. 2) Some prefixes or suffixes can be attached only to specified simple forms; for example, the prefix "‫س‬" (Sa, will) can be attached to verbal forms at the imperfect, while the prefix "‫ال‬" (| aL, the) can be only attached to nominal forms. 3) Some prefixes cannot be glued to a given word, at the same time as certain suffix; for example, we can say "" ‫ا‬" ( |aLWaLaDu, the boy ) or "‫"ه‬ ‫و‬" ( WaLaDuHu, his son ) but we can’t say "‫"ه‬ ‫ا‬"( |aLWaLaDuHu).
  • 11. International Journal on Natural Language Computing (IJNLC) Vol. 5, No.2, April 2016 23 4) The attachement of certain prefixes or suffixes to a FS is not limited to a simple concatenation, it implies in many cases a change in the FS termination. Example: 5.2.2. GENERATION ALGORITHM OF TEXTUAL WORDS Let P be the set of simple prefixes and compounds: P = {P1, P2 ...}. The constitution of these sequences is done manually from any particles that exist in Arabic (preposition, conjunctions ...). Analogously to the prefixes, we build the set of suffixes: S= {S1, S2 …}. We define valid combinations between those elements of P and S. It constitutes thus the Pi couples, Sj accompanied by a motion to indicate for which categories of words could be apply (name, verb, etc. ...). This gives what we call a text morphological generation rule (TMR): Pi , Sj , Motion Motion which may be of type: Noun: all word of type noun, adjective are compatible with this Pi couple Sj. Verb: perfective, imperfective or imperative are compatible with this Pi couple Sj. Verb imperfective: Only words with the unaccomplished are compatible with this Pi couple Sj. Examples : ‘‫وس‬’ ( WaSa, and will) , ‘ ‫ھ‬’(Hum, them) , imperfective ‘‫وب‬’, (Wabi, and with) , ‘ ‫ھ‬’(Him, them) , Noun In practice, items are handled by classes micro-syntactic. For example, the two previous generation rules will be represented by: [Wabi] , [Him] , Noun [WaSa] , [Hum] , imperfect The proposed generation algorithm of textual words composing lexicon noted M1 is: M=M0 ∪ M0C is the set of words to process Let R be the lexicon rules of textual morphological generation: R= {r1,r2,….} For each m in M For each r in R textual_word Generate_textual_word(m, r) { apply the rule r to the word m } Add(textual_word, M1) {to add textual_word to the lexicon M1} End for End for ‫ى‬َ ْ َmeaning)(+ُ‫ه‬‫اه‬َ ْ َhis meaning)(
  • 12. International Journal on Natural Language Computing (IJNLC) Vol. 5, No.2, April 2016 24 6. CONCLUSIONS In this paper, we presented the architecture of a system of conjugation of Arab verbs. It operates according to a five-step process: determining the class of verb, determining the model, the substitution operation, applying transformation rules, and spells correcting. The results of the performed tests are very satisfactory. We used the conjugator to enrich the lexical database by the verbs for morphological analysis using the dictionary-based approach. We proposed also a micro-syntax and an algorithm to generate the textual words to enrich the same lexical database. We intend to use our conjugator as the core of a learning environment for Arabic and especially conjugation for kids and non-native speakers. To complete the necessary dictionary for morphological analysis, we intend to generate the textual words for nouns. REFERENCES [1] Ali Nabil (1988) “Arabic Language and Computer”, [in Arabic], Ta’areeb. [2] Hlal Yahya, (1979) “Learning Methods for Morphosyntactic Analysis (Experienced in the case of Arabic and French)”, thesis doctoral degree, University Paris. [3] Hlal Yahya, (1987) “Generation from the root and pattern”, Conference on the progress of linguistics in Arab countries. [4] Hlal Yahya, (1990) “Morphology and syntax of the Arabic language”, In Proceedings of the Arab School of Science and Technology Applied Arabic Linguistics for Informatics (pp.201). [5] Mourchid Mohamed, (1999) “Generation Morphological and Applications”, Specialty thesis of 3rd round, Mohammed V University in Rabat-Morocco. [6] Nizar Y. Habash, (2010) Introduction to Arabic Natural Language Processing, Morgan & Claypool Publishers series. [7] Abdelhadi Soudi, Antal van den BoschNizar, & G¨unter Neumann (2007) Arabic Computational Morphology, Knowledge-based and Empirical Methods, Springer [8] Joseph Dichy, Ali Farghaly, (2003) “Roots & Patterns vs. Stems plus Grammar-Lexis Specifications: on what basis should a multilingual lexical database centred on Arabic be built?” Workshop on Machine Translation for Semitic Languages: issues and approaches – New Orleans, USA. [9] Riyad Al-Shalabi, (2005) “Pattern-based Stemmer for Finding Arabic Roots”, Information Technology Journal, 4(1): p. 38-43. [10] A.Yousfi, (2010) “The morphological analysis of Arabic verbs by using the surface patterns”, IJCSI International Journal of Computer Science Issues,7(3(11)): p. 33-36.
  • 13. International Journal on Natural Language Computing (IJNLC) Vol. 5, No.2, April 2016 25 AUTHORS M. MOURCHID Doctorate degree in Computer Science in 1999; Assistant Professor at the Computer Science Department at the Faculty of sciences, Ibn Tofail University in Kenitra Morocco; Ongoing research interests: Natural Language Processing, Web Semantic, and information systems. N. El Faddouli Doctorate degree in Computer Science in 1999; Assistant Professor at the Computer Science Department at the Mohammadia School of Engineers (EMI); 15 recent publications papers between 2007 and 2015; Ongoing research interests: e- learning, Big Da ta, Natural Language Processing, and information systems. Language Processing, and information systems S. AMALI Doctorate degree in Computer Science in 1999; Assistant Professor at the Computer Science Department at the Moulay Ismaïl University; Ongoing research interests: e-learning, Natural Language Processing, and information systems.