SlideShare a Scribd company logo
1 of 10
Download to read offline
International Journal on Natural Language Computing (IJNLC) Vol. 6, No.1, February 2017
DOI: 10.5121/ijnlc.2017.6103 23
STANDARD ARABIC VERBS INFLECTIONS USING
NOOJ PLATFORM
Mohammed Mourchid
1
Ilham Blanchete 2
and Abdelaziz Mouloudi3
1
MIC search team, Laboratory MISC, Ibn Tofail University Kenitra- Morocco
2
Department of Computer Science FSK, Ibn Tofail University, Kenitra, Morocco
3
MIC search team, Laboratory MISC, Ibn Tofail University Kenitra- Morocco
ABSTRACT
This article describes the morphological analysis of a standard Arabic natural language processing, as a
part of an electronic dictionary-constricting phase. A fully 3-lettered inflected verbs model are formalized
based on a linguistic classification, using NOOJ platform, the classification gives certain representative
verbs that will considered as lemmas, this verbs form our dictionary entries, they are also conjugated
according to our inflection paradigm relying on certain specific morphological properties. This dictionary
will be considered as an Arabic resource, which will help NLP applications and NOOJ platform to analyse
sophisticated Arabic corpora.
KEYWORDS
Morphological analysis, NOOJ, ANLP & Arabic verb inflections
1. INTRODUCTION
The Arabic natural language applications need a fully and automatic Arabic dictionary to analyse
the sophisticated corpora, as a first phase of building this dictionary we started by formalizing the
trilateral verbs based on a linguistic verbs classification [6]. The linguistic analysis must go
through a first step of lexical and morphological analysis, which consists in testing membership
of each word of the text to the Arabic vocabulary [1] we started from a basic kind of verbs, which
are called trilateral, verbs that contain three letters. Using a specific linguistic
classification of these verbs we guarantee that we are going to cover all Arabic trilateral verbs [7],
this verbs will also attached to their inflectional paradigms to cover all conjugated forms, in this
paper we give examples of our implemented dictionary and grammars in NooJ platform as
figures.
2. DEFINITIONS
2.1. Nooj Platform
NooJ is a linguistic developmental environment, which can analyze texts of several million words
in real time. It includes tools to construct, test and maintain large coverage of lexical resources,
International Journal on Natural Language Computing (IJNLC) Vol. 6, No.1, February 2017
24
as well as morphological and syntactic grammars. Dictionaries and grammars are applied to texts
in order to locate morphological, lexicological and syntactic patterns, remove ambiguities, and
tag simple and compound words [5]. NooJ platform works on cascade model; the result of each
analysis step is the input of the next one. For more information please consult the official NooJ
website. We adopted this platform because it allows us to:
- Implement all linguistic analysis phases: morphological, syntactical and semantic
analysis.
- Create our own corpora and apply search option using special queries.
- To implement our grammars and dictionary using its linguistic engine.
- To analyse our text by giving morphological, syntactical and semantic properties of each
word/sentence.
2.2.Nooj Architecture
NooJ platform is Programmed using C#/.net Framework. NooJ follows a component-based
software approach, which is a step beyond the object oriented programming paradigm . The
system consists of three modules, corpus handling, lexicon and grammar development that are
integrated into a single intuitive graphical user interface (command line operation is also
available). NooJ processes texts and corpora (i.e. sets of text files) at the Orthographical, Lexical,
Morphological, Syntactic and Semantic levels. All linguistic information (at any level) is
represented by annotations that are stored in the Text Annotation Structure (TAS)[5]. We use this
platform to formalize the Arabic 3-lettered Verbs model as a first step of Arabic dictionary
constructing phase; starting by building our dictionary that contains the previous verb category
and linking it with the our productive grammars that give all inflectional forms for each
dictionary entry, we will detail this in next sections.
2.3. Natural language
Is a human spoken and/or written languages like Arabic, French, and English.
2.4. Natural Language Processing
Is a subfield of Artificial Intelligence and linguistic, devoted to make computers understand the
Statements or words written in human languages. A natural language also known as a spoken or
written language by people for general-purpose communication [2].
2.5. Arabic Natural Language
Arabic is a Semitic language spoken by more than 330 million people as a native language, in an
area extending from the Arabian/Persian Gulf in the East to the Atlantic Ocean in the West.
Arabic is a highly structured and derivational language where morphology plays a very important
role [2]. Morphology is central in working on Arabic NLP because of its important interactions
with both orthography and syntax. Arabic’s rich morphology is perhaps the most studied and
written about aspect of Arabic. As a result, there is a wealth of terminology, some of it
inconsistent that may intimidate and confuse new researchers [3].
International Journal on Natural Language Computing (IJNLC) Vol. 6, No.1, February 2017
25
2.6. Arabic Natural Language Processing
Over the last few years, Arabic natural language processing (ANLP) has gained increasing
importance, and several state-of-the-art systems have been developed for a wide range of
applications, including machine translation, information retrieval and extraction, speech
synthesis and recognition, localization and multilingual information retrieval systems, text to
speech, and tutoring systems. These applications had to deal with several complex problems
pertinent to the nature and structure of the Arabic language. Most ANLP systems developed in
the Western world focus on tools to enable non-Arabic speakers make sense of Arabic texts.
Since understanding Arabic language becomes a point of interest for non Arabic speakers,
funding became available for companies and research centers to develop tools such as named
entity recognition, machine translation, especially spoken machine translation, document
categorization, etc [4].
3. NLP STEPS
There are 3 phases involved in natural language processing: Morphological Analysis, Syntactic
Analysis and Semantic Analysis. The first step will be detailed in section 6. , we will define
briefly the other steps.
3.1. Syntactic Analysis
This involves analysation of the words in a sentence to depict the grammatical structure of the
sentence. The words are transformed into structure that shows how the words are related to each
other Eg. “The girl the go to the school”. This would definitely be rejected by the English
syntactic analyzer [2].
3.2. Semantic Analysis
This abstracts the dictionary meaning or the exact meaning from context. The structures, which
are created by the syntactic analyser, are assigned meaning. There is a mapping between the
syntactic structures and the objects in task domain. Eg. “Colorless blue idea”. The analyser would
reject this as colorless blue do not make any sense together [2]. NooJ helps us to implement this
steps, using their linguistic engine to build our dictionary and grammars rules - as it will
explained in next sections - that gives the inflectional forms for each dictionary,
4. 3-LETTEREDVERBS
Most Arabic words are derived from three-letter Verbs or 3- lettered verbs, we started by
formalizing this kind of verbs standing on their linguistic classification as Figure 1 shows; this
figure will be detailed in section 5.
International Journal on Natural Language Computing (IJNLC) Vol. 6, No.1, February 2017
26
Figure 1. Arabic 3-lettered verbs classification, regular verbs part
As each Arabic verb has its morphological prosperities like root and pattern [8], we attached
each verb/dictionary entry with this properties, and with their conjugation form, for instance the
verb (to write - َ َ َ - KaTaBa ) takes ‫ب‬ ‫ت‬ ‫ك‬ as roots letters ( in Arabic the root letters are
separated َ َ َ faEala as pattern and ُ ُ ْ َ yafEalo as conjugation form . The conjugation form of
the previous verb: َ َ َ KaTaBa is ُ ُ ْ َ yaKToBo according to the matching process between the
pattern and the conjugational form ( َ َ َ faEala and ُ ُ ْ َ yafEalo ), There are 3 types of the
3-lettered verbs patterns in Arabic they are distinguished according to the second letter diacritic:
( fatha َ◌kasra ِ◌, dama ُ◌).
َ َ َ – faEala is a pattern of the verb: (to write َ َ َ ) .
َ ُ َ faEula is a pattern of the verb : (to grow – َ ُ َ ).
َ َ ِ –faEila is a pattern of :(to play- َ ِ َ ).
May takes three conjugation forms as table 1 shows.
Table 1. All possible patterns with their conjugation forms in 3-letter Arabic verbs
pattern Conjugation form Conjugation form Conjugation form
faEala َ َ َ yafEalo ُ ُ ْ َ yafEilo ُ ِ ْ َ yafEolo ُ ُ ْ َ
faEula َ ُ َ yafEalo ُ ُ ْ َ yafEilo ُ ِ ْ َ yafEolo ُ ُ ْ َ
faEila َ ِ َ yafEalo ُ ُ ْ َ yafEilo ُ ِ ْ َ yafEolo ُ ُ ْ َ
For instance: the verb (to write - kataba - ) is the result of matching its root with its
pattern, by switching root letters with patterns one; without any changing on the pattern
diacritics : ( X َ َ َ ), as their conjugation form is(yafEolo ُ ُ ْ َ ) it takes this
model : ( ُ ُ ْ َ X ) to be conjugated.
International Journal on Natural Language Computing (IJNLC) Vol. 6, No.1, February 2017
27
5. Arabic 3-Letterd Verbs Classification:
The following classification covers all 3-lettered verbs in standard Arabic language, each
representative verbs may takes the three previous patterns ( faEala َ َ َ / faEula َ ُ َ / faEila َ ِ َ ) ,
and each pattern may take 3 conjugation forms ( yafEolo ُ ُ ْ َ / yafEalo ُ َ ْ َ / yafEilo ُ ِ ْ َ ) ,for
example: The path in the following classification as shown in figure 1 : [verb] [regular verbs]
[* ]: represents the verbs that their last letter neither a (n )‫ن‬ nor a (t ‫)ت‬ character , and it takes
all this representative verbs:
( َ َ َ -FaTaHa-to open) as : ( faEala َ َ َ - yafEalo ُ َ ْ َ ).
(َ َ َ - KaTaBa-to write ) as : ( faEala َ َ َ - yafEolo ُ ُ ْ َ ).
(َ َ َ - JaLaSa- to set) as : ( faEala َ َ َ - yafEilo ُ ِ ْ َ ).
(َ ُ َ -KaBoRa- to grow) as : (faEula َ ُ َ - yafEolo ُ ُ ْ َ ).
(َ ِ َ -AaLiMa- to know) as : (faEila َ ِ َ - yafEalo ُ َ ْ َ ).
Figure 1 gives a part of the Arabic 3-lettered verbs classification; here are the definitions of
some used abbreviation.
REG verbs: regular verbs (afeal sahiha – َ ْ ِ َ! ‫ا‬ ‫$ل‬َ ْ َ%‫ا‬ ) contains three verbs kind [
Hamzated verbs – duplicated verbs – salim verbs ].
Hamzated verbs (mahouza – ‫'ز‬ُ(ْ)َ( ‫ا‬ ) verbs that contain the‫أ‬ hamza character
Duplicated verbs (modaEafa – $+‫ﻣ‬ ) verbs that contain a duplicated character.
Salim verbs: (salim $-) verbs that are neither hamzated nor duplicated.
* : verbs that their last letter neither a (n )‫ن‬ nor a (t ‫)ت‬ character , n : last character is ‫ن‬ , t :
last character is ‫.ت‬
1st,2nd,3rd: the first and second and third character in the verb.
IRRG : verbs that contains one of the Arabic long characters : alif . ‫ا‬ , yae ‫$ء‬ , waw ‫واو‬
or the hamza character. In this paper I am going to present only the regular verbs.
Each representative verb considered as dictionary entry that will be assigned to a unique
inflectional paradigm, only and only if the verb accept to be conjugated in standard Arabic,
then all verbs that are conjugate in the same manner, will take the same inflectional
paradigm, tow verbs are conjugated with the same manner or with the same inflectional
paradigm if they have the same conjugation form.
6. Morphological Analysis
The lexicon of a language is its vocabulary that includes its words and expressions, while
morphological Analysis involves dividing a text into paragraphs, words and the sentences
and its main role is to represent the Atomic Language Unit (ALUs) which is the smallest
elements that make up the sentence, we are going to define these ALUs/3-lettered verbs as
dictionary entries that represent the language vocabulary these entries are associated
with their morphological properties which enrich it with linguistic information like: s
means singular, p means plural , as basic properties while we add our specific verb
morphological properties like:(Root/Pattern/Category Numb),that will be used in advanced
analysis phases ,Figure 2 shows our constructed dictionary that calls at first our inflectional
International Journal on Natural Language Computing (IJNLC) Vol. 6, No.1, February 2017
28
grammars G_Verbs ,as it is shown in figure 3, the dictionary contains the language vocabulary
with their special morphological properties: V: verb, Tr : transitive verb , 1: verb category
which determines verbs conjugation form( ُ ُ ْ َ - yafEolo), pattern (َ َ َ : faEala ).
Figure 2. Our constructed dictionary
The FLX paradigm represents all inflectional forms in active and passive voice for each
dictionary entry, this FLXs are represented using our defined rules graphs as Figure 3 shows,
we preferred to describe this inflectional paradigms using NooJ’s graphical rules interface that
is equivalent to the textual rule editor, here is our inflectional paradigms that are assigned to
dictionary, each dictionary entry has an inflectional paradigm, that generate all its inflectional
and derivational forms.
Figure 3: Verbs inflectional paradigm
For instance the lexical entry (to write – kataba) has an inflectional
paradigms: FLX=V_Kataba that matches any form in the set of {they write ‫'ا‬
KaTaBo, he writes KaTaBa, we write 1 NaKToBo, you wrote َ2ْ َ َ
International Journal on Natural Language Computing (IJNLC) Vol. 6, No.1, February 2017
29
KaTaBTa,..} NooJ recognizes all this forms even if they are semi or non- vowelized.
Each verb is conjugated in 12 deferent tenses as it is shown in Figure 4. This figure shows
all possible verb inflections in both of active and passive voice. ACC_Kataba presents the
past tens, we used the pre-defined NOOJ operators to define the inflections of this entry
here is some special operators <Z>,<T> AND <M> that are defined only for Arabic
language for more information please read NOOJ manual.
Figure 4: all possible verb inflections
Figure 5 shows how we easily define the past tens for the verb (to write) using NOOJ
predefined operators <B> : will erase the last character, <Z> will add ‫ت‬ character while the
last character of the given verb is not a ‫,ت‬ else it will add ‫ﱡ‬‫ت‬ , chadda appearance
refers to a duplication for instance : the conjugated form of the verb ( to write ) for the first
person is I Wrote ُ2ْ َ َ so the operator <Z> adds ‫ت‬ character while the last character of the
given verb is not a ‫ت‬ else it will add ‫ﱡ‬‫ت‬ for all verbs that finished with ‫ت‬ like 2 4 will
converted to ‫ﱡ‬2َ َ4 while its last character is ‫ت‬ . As its mentioned above this graph
recognizes all appearances of the past conjugation set{ I wrote , you wrote ,...}, and
annotate them with the morphological defined properties which appear under each
node for instance { A+I+1+s } : the first singular person in active voice . A: active voice,
I: past / 1: first person and s : singular .
International Journal on Natural Language Computing (IJNLC) Vol. 6, No.1, February 2017
30
Figure 5: inflectional graph in active voice
7. ANNOTATIONS
Once our dictionary is compiled, we can use it as resource to analyze any Arabic text, that
contains only 3-letterad verbs or their inflectional forms, when parsing a text or a corpus, NooJ
builds a Text Annotation Structure (TAS) in which each linguistic unit is
represented by a corresponding annotation. An annotation stores the position in the text of
the text unit to be Represented, its length, and linguistic information, (Silberztein 2007).
NooJ adds annotations to the TAS automatically at various stages of the analyses phase,
morphological and syntactic parsers provide tools to add, remove and export annotations to
TAS in morphological level, and the morphological parser typically applies dictionaries to
the text to produce annotations. When the parser recognizes any lemma or
inflectional/derivational form in the text it produces the corresponding morphological
annotations according to the morphological properties that we have assigned before, the
figure 6 shows our text that will be morphologically analyzed on NOOJ platform, it contains
different conjugated forms of the verb ( to write ) vowelized , semi or non vowelized .
Figure 6: text to be analyzed
International Journal on Natural Language Computing (IJNLC) Vol. 6, No.1, February 2017
31
The vowelized ALU like َ َ َ KaTaBa he writes will take an expected annotation as it refers
to [the third person, masculine singular], but in case of the non vowelized ALU, Nooj
Table Annotation Structure gives all possible annotations ,this annotations triggered when
the parser matches any inflectional form of a dictionary lemma that we are defined in the
inflectional paradigm with the text ALU’s, for instance the ALU 2 takes all possible
annotations, as it is non- vowelized and the annotations of this inflectional forms
{‫ُ'ا‬ َ َ , َ‫ُ'ن‬ ُ ْ َ ,2 , َ َ َ …..}Will be triggered, as they are defined in the inflectional
paradigm V_Kataba . The first annotation form is dedicated for the fully diarized verb , the
annotation shows that this entry is a verb V , and it is a transitive verb Tr, gives their root 2 6‫ك‬
, and their pattern is , NOOJ TAS gives all this properties as Representative verbs that
are considered as dictionary entries, all verbs that have the same conjugation form will be
assigned to the same inflectional paradigm. Once the dictionary is compiled we can easily
use it as a linguistic recourse to analyze the sophisticated corpora.
8. CONCLUSION AND PRESPECTIVES
Using the given 3-lettered linguistic classification, we constructed fully inflected verbal Arabic
recourses of the previous verbs category, using NOOJ platform.
Lemma –based verbs are used
as dictionary entries; an inflectional paradigm may be assigned to a dictionary entry that gives
all possible conjugated forms of the entry. The dictionary contains representative verbs for each
leaf of the given linguistic classification tree, each leaf has at most 9 representative verbs that
are considered as dictionary entries, all verbs that have the same conjugation form will be
assigned to the same inflectional paradigm. Once the dictionary is compiled we can easily use it
as a linguistic recourse to analyze the sophisticated corpora. The perspective opened over this
work is to extend our dictionary to the inflections of the rest verbs categories, nouns,
adjectives also to add some morphological grammars in order to generate broken plural, ALU’s
with affixes and other sophisticated morphological phenomena like ibdal and ielal and idgham .
REFERENCES
[1] D. Revuz ,( 1991)”Dictionnaires et lexiques”,Thesis, Paris 7 University. : methodes et algorithmes
(Doctoral dissertation).
[2] A. Chopra, A. Prashar and C. Sain, "Natural Language Processing", the International journal of
technology enhancement and emerging engineering research, vol. 1, issue 4, ISSN 2
[3] N. Habash, (2010) Introduction to Arabic Natural Language Processing, Morgan & Claypool
Publishers series.
[4] K. Shaalan, A. Farghaly, (2009) ” Arabic Natural Language Processing: Challenges and Solutions”,
ACM Transactions on Asian Language Information Processing (TALIP) 8.4,14.
[5] S. Max,”Nooj Manual” , www.nooj-association.org.
[6] M.El-Ghalayani,(2004) “Jamie aldorous alaarabia”,al moassassa al haditha lilkitab, Tripoli,Lebanon,.
[7] M Mohamed, “Generation Morphological and Applications”, Specialty thesis of 3rd round,
Mohammed V University in Rabat-Morocco, 1999.
[8] A.Yousfi, (2010) “The morphological analysis of Arabic verbs by using the surface patterns”, IJCSI
International Journal of Computer Science Issues,7(3(11)): p. 33-36.
International Journal on Natural Language Computing (IJNLC) Vol. 6, No.1, February 2017
32
Authors
M. Mourchid Doctorate Degree In Computer Science In 1999; Associate
Professor At The Computer Science Department At The Faculty Of
Sciences, Ibn Tofail University In Kenitra Morocco; On Going Research
Interests: Natural Language Processing, Web Semantic, And Information
Systems.
I. Blanchete, Phd Student At Ibn Tofail University Department Of
Computer Science, Laboratory Of Misc Kenitra , Morocco ;Graduated
From Damascus University Faculty Of It Engineering 2010.
A. Mouloudi was born in 1959 in Morocco. He received the B.S degreein
applied mathematics from Mohamed V University in Morocco at
1982;Master in Computer Science from the University Mohammed V, at
1984; Ph.D in Computer Science from the same university, at 1988. He
obtains Habilitation to direct academic research in Computer Science, from
Ibn Tofail University in Morocco, at 2008. Currently, A. MOULOUDI is
the Director of the laboratory MISC (Information Modelling and
Communication System), at the sciences faculty, Ibn Tofail University, in
Morocco.

More Related Content

What's hot

Artificially Generatedof Concatenative Syllable based Text to Speech Synthesi...
Artificially Generatedof Concatenative Syllable based Text to Speech Synthesi...Artificially Generatedof Concatenative Syllable based Text to Speech Synthesi...
Artificially Generatedof Concatenative Syllable based Text to Speech Synthesi...iosrjce
 
Grapheme-To-Phoneme Tools for the Marathi Speech Synthesis
Grapheme-To-Phoneme Tools for the Marathi Speech SynthesisGrapheme-To-Phoneme Tools for the Marathi Speech Synthesis
Grapheme-To-Phoneme Tools for the Marathi Speech SynthesisIJERA Editor
 
ADVANCEMENTS ON NLP APPLICATIONS FOR MANIPURI LANGUAGE
ADVANCEMENTS ON NLP APPLICATIONS FOR MANIPURI LANGUAGEADVANCEMENTS ON NLP APPLICATIONS FOR MANIPURI LANGUAGE
ADVANCEMENTS ON NLP APPLICATIONS FOR MANIPURI LANGUAGEijnlc
 
Smart grammar a dynamic spoken language understanding grammar for inflective ...
Smart grammar a dynamic spoken language understanding grammar for inflective ...Smart grammar a dynamic spoken language understanding grammar for inflective ...
Smart grammar a dynamic spoken language understanding grammar for inflective ...ijnlc
 
An implementation of apertium based assamese morphological analyzer
An implementation of apertium based assamese morphological analyzerAn implementation of apertium based assamese morphological analyzer
An implementation of apertium based assamese morphological analyzerijnlc
 
T URN S EGMENTATION I NTO U TTERANCES F OR A RABIC S PONTANEOUS D IALOGUES ...
T URN S EGMENTATION I NTO U TTERANCES F OR  A RABIC  S PONTANEOUS D IALOGUES ...T URN S EGMENTATION I NTO U TTERANCES F OR  A RABIC  S PONTANEOUS D IALOGUES ...
T URN S EGMENTATION I NTO U TTERANCES F OR A RABIC S PONTANEOUS D IALOGUES ...ijnlc
 
Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)IJERD Editor
 
PERFORMANCE ANALYSIS OF DIFFERENT ACOUSTIC FEATURES BASED ON LSTM FOR BANGLA ...
PERFORMANCE ANALYSIS OF DIFFERENT ACOUSTIC FEATURES BASED ON LSTM FOR BANGLA ...PERFORMANCE ANALYSIS OF DIFFERENT ACOUSTIC FEATURES BASED ON LSTM FOR BANGLA ...
PERFORMANCE ANALYSIS OF DIFFERENT ACOUSTIC FEATURES BASED ON LSTM FOR BANGLA ...ijma
 
Hps a hierarchical persian stemming method
Hps a hierarchical persian stemming methodHps a hierarchical persian stemming method
Hps a hierarchical persian stemming methodijnlc
 
Transliteration by orthography or phonology for hindi and marathi to english ...
Transliteration by orthography or phonology for hindi and marathi to english ...Transliteration by orthography or phonology for hindi and marathi to english ...
Transliteration by orthography or phonology for hindi and marathi to english ...ijnlc
 
HANDLING CHALLENGES IN RULE BASED MACHINE TRANSLATION FROM MARATHI TO ENGLISH
HANDLING CHALLENGES IN RULE BASED MACHINE TRANSLATION FROM MARATHI TO ENGLISHHANDLING CHALLENGES IN RULE BASED MACHINE TRANSLATION FROM MARATHI TO ENGLISH
HANDLING CHALLENGES IN RULE BASED MACHINE TRANSLATION FROM MARATHI TO ENGLISHijnlc
 
Shallow parser for hindi language with an input from a transliterator
Shallow parser for hindi language with an input from a transliteratorShallow parser for hindi language with an input from a transliterator
Shallow parser for hindi language with an input from a transliteratorShashank Shisodia
 
COMPARATIVE ANALYSIS OF ARABIC STEMMING ALGORITHMS
COMPARATIVE ANALYSIS OF ARABIC STEMMING ALGORITHMSCOMPARATIVE ANALYSIS OF ARABIC STEMMING ALGORITHMS
COMPARATIVE ANALYSIS OF ARABIC STEMMING ALGORITHMSIJMIT JOURNAL
 
Identification of prosodic features of punjabi for enhancing the pronunciatio...
Identification of prosodic features of punjabi for enhancing the pronunciatio...Identification of prosodic features of punjabi for enhancing the pronunciatio...
Identification of prosodic features of punjabi for enhancing the pronunciatio...ijnlc
 
A New Approach to Parts of Speech Tagging in Malayalam
A New Approach to Parts of Speech Tagging in MalayalamA New Approach to Parts of Speech Tagging in Malayalam
A New Approach to Parts of Speech Tagging in Malayalamijcsit
 

What's hot (17)

Artificially Generatedof Concatenative Syllable based Text to Speech Synthesi...
Artificially Generatedof Concatenative Syllable based Text to Speech Synthesi...Artificially Generatedof Concatenative Syllable based Text to Speech Synthesi...
Artificially Generatedof Concatenative Syllable based Text to Speech Synthesi...
 
Grapheme-To-Phoneme Tools for the Marathi Speech Synthesis
Grapheme-To-Phoneme Tools for the Marathi Speech SynthesisGrapheme-To-Phoneme Tools for the Marathi Speech Synthesis
Grapheme-To-Phoneme Tools for the Marathi Speech Synthesis
 
ADVANCEMENTS ON NLP APPLICATIONS FOR MANIPURI LANGUAGE
ADVANCEMENTS ON NLP APPLICATIONS FOR MANIPURI LANGUAGEADVANCEMENTS ON NLP APPLICATIONS FOR MANIPURI LANGUAGE
ADVANCEMENTS ON NLP APPLICATIONS FOR MANIPURI LANGUAGE
 
Smart grammar a dynamic spoken language understanding grammar for inflective ...
Smart grammar a dynamic spoken language understanding grammar for inflective ...Smart grammar a dynamic spoken language understanding grammar for inflective ...
Smart grammar a dynamic spoken language understanding grammar for inflective ...
 
An implementation of apertium based assamese morphological analyzer
An implementation of apertium based assamese morphological analyzerAn implementation of apertium based assamese morphological analyzer
An implementation of apertium based assamese morphological analyzer
 
T URN S EGMENTATION I NTO U TTERANCES F OR A RABIC S PONTANEOUS D IALOGUES ...
T URN S EGMENTATION I NTO U TTERANCES F OR  A RABIC  S PONTANEOUS D IALOGUES ...T URN S EGMENTATION I NTO U TTERANCES F OR  A RABIC  S PONTANEOUS D IALOGUES ...
T URN S EGMENTATION I NTO U TTERANCES F OR A RABIC S PONTANEOUS D IALOGUES ...
 
Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)
 
PERFORMANCE ANALYSIS OF DIFFERENT ACOUSTIC FEATURES BASED ON LSTM FOR BANGLA ...
PERFORMANCE ANALYSIS OF DIFFERENT ACOUSTIC FEATURES BASED ON LSTM FOR BANGLA ...PERFORMANCE ANALYSIS OF DIFFERENT ACOUSTIC FEATURES BASED ON LSTM FOR BANGLA ...
PERFORMANCE ANALYSIS OF DIFFERENT ACOUSTIC FEATURES BASED ON LSTM FOR BANGLA ...
 
Hps a hierarchical persian stemming method
Hps a hierarchical persian stemming methodHps a hierarchical persian stemming method
Hps a hierarchical persian stemming method
 
Transliteration by orthography or phonology for hindi and marathi to english ...
Transliteration by orthography or phonology for hindi and marathi to english ...Transliteration by orthography or phonology for hindi and marathi to english ...
Transliteration by orthography or phonology for hindi and marathi to english ...
 
HANDLING CHALLENGES IN RULE BASED MACHINE TRANSLATION FROM MARATHI TO ENGLISH
HANDLING CHALLENGES IN RULE BASED MACHINE TRANSLATION FROM MARATHI TO ENGLISHHANDLING CHALLENGES IN RULE BASED MACHINE TRANSLATION FROM MARATHI TO ENGLISH
HANDLING CHALLENGES IN RULE BASED MACHINE TRANSLATION FROM MARATHI TO ENGLISH
 
Shallow parser for hindi language with an input from a transliterator
Shallow parser for hindi language with an input from a transliteratorShallow parser for hindi language with an input from a transliterator
Shallow parser for hindi language with an input from a transliterator
 
Interpreter
InterpreterInterpreter
Interpreter
 
Aw32322326
Aw32322326Aw32322326
Aw32322326
 
COMPARATIVE ANALYSIS OF ARABIC STEMMING ALGORITHMS
COMPARATIVE ANALYSIS OF ARABIC STEMMING ALGORITHMSCOMPARATIVE ANALYSIS OF ARABIC STEMMING ALGORITHMS
COMPARATIVE ANALYSIS OF ARABIC STEMMING ALGORITHMS
 
Identification of prosodic features of punjabi for enhancing the pronunciatio...
Identification of prosodic features of punjabi for enhancing the pronunciatio...Identification of prosodic features of punjabi for enhancing the pronunciatio...
Identification of prosodic features of punjabi for enhancing the pronunciatio...
 
A New Approach to Parts of Speech Tagging in Malayalam
A New Approach to Parts of Speech Tagging in MalayalamA New Approach to Parts of Speech Tagging in Malayalam
A New Approach to Parts of Speech Tagging in Malayalam
 

Viewers also liked

A NOVEL APPROACH FOR INFORMATION RETRIEVAL TECHNIQUE FOR WEB USING NLP
A NOVEL APPROACH FOR INFORMATION RETRIEVAL TECHNIQUE FOR WEB USING NLPA NOVEL APPROACH FOR INFORMATION RETRIEVAL TECHNIQUE FOR WEB USING NLP
A NOVEL APPROACH FOR INFORMATION RETRIEVAL TECHNIQUE FOR WEB USING NLPijnlc
 
DESIGN OF LOW POWER SAR ADC FOR ECG USING 45nm CMOS TECHNOLOGY
DESIGN OF LOW POWER SAR ADC FOR ECG USING 45nm CMOS TECHNOLOGYDESIGN OF LOW POWER SAR ADC FOR ECG USING 45nm CMOS TECHNOLOGY
DESIGN OF LOW POWER SAR ADC FOR ECG USING 45nm CMOS TECHNOLOGYVLSICS Design
 
DEVELOPMENT OF ARABIC NOUN PHRASE EXTRACTOR (ANPE)
DEVELOPMENT OF ARABIC NOUN PHRASE EXTRACTOR (ANPE)DEVELOPMENT OF ARABIC NOUN PHRASE EXTRACTOR (ANPE)
DEVELOPMENT OF ARABIC NOUN PHRASE EXTRACTOR (ANPE)ijnlc
 
BUILDING A SYLLABLE DATABASE TO SOLVE THE PROBLEM OF KHMER WORD SEGMENTATION
BUILDING A SYLLABLE DATABASE TO SOLVE THE PROBLEM OF KHMER WORD SEGMENTATIONBUILDING A SYLLABLE DATABASE TO SOLVE THE PROBLEM OF KHMER WORD SEGMENTATION
BUILDING A SYLLABLE DATABASE TO SOLVE THE PROBLEM OF KHMER WORD SEGMENTATIONijnlc
 
MULTIMODAL BIOMETRICS RECOGNITION FROM FACIAL VIDEO VIA DEEP LEARNING
MULTIMODAL BIOMETRICS RECOGNITION FROM FACIAL VIDEO VIA DEEP LEARNINGMULTIMODAL BIOMETRICS RECOGNITION FROM FACIAL VIDEO VIA DEEP LEARNING
MULTIMODAL BIOMETRICS RECOGNITION FROM FACIAL VIDEO VIA DEEP LEARNINGsipij
 
INNOVATIVE AND SOCIAL TECHNOLOGIES ARE REVOLUTIONIZING SAUDI CONSUMER ATTITUD...
INNOVATIVE AND SOCIAL TECHNOLOGIES ARE REVOLUTIONIZING SAUDI CONSUMER ATTITUD...INNOVATIVE AND SOCIAL TECHNOLOGIES ARE REVOLUTIONIZING SAUDI CONSUMER ATTITUD...
INNOVATIVE AND SOCIAL TECHNOLOGIES ARE REVOLUTIONIZING SAUDI CONSUMER ATTITUD...ijsptm
 
ANALYSIS OF MMSE SPEECH ESTIMATION IMPACT IN WEST SUMATRA'S NOISES
ANALYSIS OF MMSE SPEECH ESTIMATION IMPACT IN WEST SUMATRA'S NOISESANALYSIS OF MMSE SPEECH ESTIMATION IMPACT IN WEST SUMATRA'S NOISES
ANALYSIS OF MMSE SPEECH ESTIMATION IMPACT IN WEST SUMATRA'S NOISESsipij
 
A PROPOSED MULTI-DOMAIN APPROACH FOR AUTOMATIC CLASSIFICATION OF TEXT DOCUMENTS
A PROPOSED MULTI-DOMAIN APPROACH FOR AUTOMATIC CLASSIFICATION OF TEXT DOCUMENTSA PROPOSED MULTI-DOMAIN APPROACH FOR AUTOMATIC CLASSIFICATION OF TEXT DOCUMENTS
A PROPOSED MULTI-DOMAIN APPROACH FOR AUTOMATIC CLASSIFICATION OF TEXT DOCUMENTSijsc
 
ANALYSIS OF MWES IN HINDI TEXT USING NLTK
ANALYSIS OF MWES IN HINDI TEXT USING NLTKANALYSIS OF MWES IN HINDI TEXT USING NLTK
ANALYSIS OF MWES IN HINDI TEXT USING NLTKijnlc
 
A PREDICTION METHOD OF GESTURE TRAJECTORY BASED ON LEAST SQUARES FITTING MODEL
A PREDICTION METHOD OF GESTURE TRAJECTORY BASED ON LEAST SQUARES FITTING MODELA PREDICTION METHOD OF GESTURE TRAJECTORY BASED ON LEAST SQUARES FITTING MODEL
A PREDICTION METHOD OF GESTURE TRAJECTORY BASED ON LEAST SQUARES FITTING MODELVLSICS Design
 
A CASE STUDY ON AUTO SOCIALIZATION IN ONLINE PLATFORMS
A CASE STUDY ON AUTO SOCIALIZATION IN ONLINE PLATFORMSA CASE STUDY ON AUTO SOCIALIZATION IN ONLINE PLATFORMS
A CASE STUDY ON AUTO SOCIALIZATION IN ONLINE PLATFORMSIJMIT JOURNAL
 
TRUST: DIFFERENT VIEWS, ONE GOAL
TRUST: DIFFERENT VIEWS, ONE GOALTRUST: DIFFERENT VIEWS, ONE GOAL
TRUST: DIFFERENT VIEWS, ONE GOALijsptm
 
A STUDY ON QUANTITATIVE PARAMETERS OF SPECTRUM HANDOFF IN COGNITIVE RADIO NET...
A STUDY ON QUANTITATIVE PARAMETERS OF SPECTRUM HANDOFF IN COGNITIVE RADIO NET...A STUDY ON QUANTITATIVE PARAMETERS OF SPECTRUM HANDOFF IN COGNITIVE RADIO NET...
A STUDY ON QUANTITATIVE PARAMETERS OF SPECTRUM HANDOFF IN COGNITIVE RADIO NET...ijwmn
 
NETWORK PERFORMANCE ENHANCEMENT WITH OPTIMIZATION SENSOR PLACEMENT IN WIRELES...
NETWORK PERFORMANCE ENHANCEMENT WITH OPTIMIZATION SENSOR PLACEMENT IN WIRELES...NETWORK PERFORMANCE ENHANCEMENT WITH OPTIMIZATION SENSOR PLACEMENT IN WIRELES...
NETWORK PERFORMANCE ENHANCEMENT WITH OPTIMIZATION SENSOR PLACEMENT IN WIRELES...ijwmn
 
DEVELOPMENT OF AN ANDROID APPLICATION FOR OBJECT DETECTION BASED ON COLOR, SH...
DEVELOPMENT OF AN ANDROID APPLICATION FOR OBJECT DETECTION BASED ON COLOR, SH...DEVELOPMENT OF AN ANDROID APPLICATION FOR OBJECT DETECTION BASED ON COLOR, SH...
DEVELOPMENT OF AN ANDROID APPLICATION FOR OBJECT DETECTION BASED ON COLOR, SH...ijma
 
A NOVEL METHOD FOR PERSON TRACKING BASED K-NN : COMPARISON WITH SIFT AND MEAN...
A NOVEL METHOD FOR PERSON TRACKING BASED K-NN : COMPARISON WITH SIFT AND MEAN...A NOVEL METHOD FOR PERSON TRACKING BASED K-NN : COMPARISON WITH SIFT AND MEAN...
A NOVEL METHOD FOR PERSON TRACKING BASED K-NN : COMPARISON WITH SIFT AND MEAN...sipij
 
A NOVEL IMAGE STEGANOGRAPHY APPROACH USING MULTI-LAYERS DCT FEATURES BASED ON...
A NOVEL IMAGE STEGANOGRAPHY APPROACH USING MULTI-LAYERS DCT FEATURES BASED ON...A NOVEL IMAGE STEGANOGRAPHY APPROACH USING MULTI-LAYERS DCT FEATURES BASED ON...
A NOVEL IMAGE STEGANOGRAPHY APPROACH USING MULTI-LAYERS DCT FEATURES BASED ON...ijma
 
ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...
ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...
ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...sipij
 
A COLLAGE IMAGE CREATION & “KANISEI” ANALYSIS SYSTEM BY COMBINING MULTIPLE IM...
A COLLAGE IMAGE CREATION & “KANISEI” ANALYSIS SYSTEM BY COMBINING MULTIPLE IM...A COLLAGE IMAGE CREATION & “KANISEI” ANALYSIS SYSTEM BY COMBINING MULTIPLE IM...
A COLLAGE IMAGE CREATION & “KANISEI” ANALYSIS SYSTEM BY COMBINING MULTIPLE IM...ijma
 
ASPHALTIC MATERIAL IN THE CONTEXT OF GENERALIZED POROTHERMOELASTICITY
ASPHALTIC MATERIAL IN THE CONTEXT OF GENERALIZED POROTHERMOELASTICITYASPHALTIC MATERIAL IN THE CONTEXT OF GENERALIZED POROTHERMOELASTICITY
ASPHALTIC MATERIAL IN THE CONTEXT OF GENERALIZED POROTHERMOELASTICITYijsc
 

Viewers also liked (20)

A NOVEL APPROACH FOR INFORMATION RETRIEVAL TECHNIQUE FOR WEB USING NLP
A NOVEL APPROACH FOR INFORMATION RETRIEVAL TECHNIQUE FOR WEB USING NLPA NOVEL APPROACH FOR INFORMATION RETRIEVAL TECHNIQUE FOR WEB USING NLP
A NOVEL APPROACH FOR INFORMATION RETRIEVAL TECHNIQUE FOR WEB USING NLP
 
DESIGN OF LOW POWER SAR ADC FOR ECG USING 45nm CMOS TECHNOLOGY
DESIGN OF LOW POWER SAR ADC FOR ECG USING 45nm CMOS TECHNOLOGYDESIGN OF LOW POWER SAR ADC FOR ECG USING 45nm CMOS TECHNOLOGY
DESIGN OF LOW POWER SAR ADC FOR ECG USING 45nm CMOS TECHNOLOGY
 
DEVELOPMENT OF ARABIC NOUN PHRASE EXTRACTOR (ANPE)
DEVELOPMENT OF ARABIC NOUN PHRASE EXTRACTOR (ANPE)DEVELOPMENT OF ARABIC NOUN PHRASE EXTRACTOR (ANPE)
DEVELOPMENT OF ARABIC NOUN PHRASE EXTRACTOR (ANPE)
 
BUILDING A SYLLABLE DATABASE TO SOLVE THE PROBLEM OF KHMER WORD SEGMENTATION
BUILDING A SYLLABLE DATABASE TO SOLVE THE PROBLEM OF KHMER WORD SEGMENTATIONBUILDING A SYLLABLE DATABASE TO SOLVE THE PROBLEM OF KHMER WORD SEGMENTATION
BUILDING A SYLLABLE DATABASE TO SOLVE THE PROBLEM OF KHMER WORD SEGMENTATION
 
MULTIMODAL BIOMETRICS RECOGNITION FROM FACIAL VIDEO VIA DEEP LEARNING
MULTIMODAL BIOMETRICS RECOGNITION FROM FACIAL VIDEO VIA DEEP LEARNINGMULTIMODAL BIOMETRICS RECOGNITION FROM FACIAL VIDEO VIA DEEP LEARNING
MULTIMODAL BIOMETRICS RECOGNITION FROM FACIAL VIDEO VIA DEEP LEARNING
 
INNOVATIVE AND SOCIAL TECHNOLOGIES ARE REVOLUTIONIZING SAUDI CONSUMER ATTITUD...
INNOVATIVE AND SOCIAL TECHNOLOGIES ARE REVOLUTIONIZING SAUDI CONSUMER ATTITUD...INNOVATIVE AND SOCIAL TECHNOLOGIES ARE REVOLUTIONIZING SAUDI CONSUMER ATTITUD...
INNOVATIVE AND SOCIAL TECHNOLOGIES ARE REVOLUTIONIZING SAUDI CONSUMER ATTITUD...
 
ANALYSIS OF MMSE SPEECH ESTIMATION IMPACT IN WEST SUMATRA'S NOISES
ANALYSIS OF MMSE SPEECH ESTIMATION IMPACT IN WEST SUMATRA'S NOISESANALYSIS OF MMSE SPEECH ESTIMATION IMPACT IN WEST SUMATRA'S NOISES
ANALYSIS OF MMSE SPEECH ESTIMATION IMPACT IN WEST SUMATRA'S NOISES
 
A PROPOSED MULTI-DOMAIN APPROACH FOR AUTOMATIC CLASSIFICATION OF TEXT DOCUMENTS
A PROPOSED MULTI-DOMAIN APPROACH FOR AUTOMATIC CLASSIFICATION OF TEXT DOCUMENTSA PROPOSED MULTI-DOMAIN APPROACH FOR AUTOMATIC CLASSIFICATION OF TEXT DOCUMENTS
A PROPOSED MULTI-DOMAIN APPROACH FOR AUTOMATIC CLASSIFICATION OF TEXT DOCUMENTS
 
ANALYSIS OF MWES IN HINDI TEXT USING NLTK
ANALYSIS OF MWES IN HINDI TEXT USING NLTKANALYSIS OF MWES IN HINDI TEXT USING NLTK
ANALYSIS OF MWES IN HINDI TEXT USING NLTK
 
A PREDICTION METHOD OF GESTURE TRAJECTORY BASED ON LEAST SQUARES FITTING MODEL
A PREDICTION METHOD OF GESTURE TRAJECTORY BASED ON LEAST SQUARES FITTING MODELA PREDICTION METHOD OF GESTURE TRAJECTORY BASED ON LEAST SQUARES FITTING MODEL
A PREDICTION METHOD OF GESTURE TRAJECTORY BASED ON LEAST SQUARES FITTING MODEL
 
A CASE STUDY ON AUTO SOCIALIZATION IN ONLINE PLATFORMS
A CASE STUDY ON AUTO SOCIALIZATION IN ONLINE PLATFORMSA CASE STUDY ON AUTO SOCIALIZATION IN ONLINE PLATFORMS
A CASE STUDY ON AUTO SOCIALIZATION IN ONLINE PLATFORMS
 
TRUST: DIFFERENT VIEWS, ONE GOAL
TRUST: DIFFERENT VIEWS, ONE GOALTRUST: DIFFERENT VIEWS, ONE GOAL
TRUST: DIFFERENT VIEWS, ONE GOAL
 
A STUDY ON QUANTITATIVE PARAMETERS OF SPECTRUM HANDOFF IN COGNITIVE RADIO NET...
A STUDY ON QUANTITATIVE PARAMETERS OF SPECTRUM HANDOFF IN COGNITIVE RADIO NET...A STUDY ON QUANTITATIVE PARAMETERS OF SPECTRUM HANDOFF IN COGNITIVE RADIO NET...
A STUDY ON QUANTITATIVE PARAMETERS OF SPECTRUM HANDOFF IN COGNITIVE RADIO NET...
 
NETWORK PERFORMANCE ENHANCEMENT WITH OPTIMIZATION SENSOR PLACEMENT IN WIRELES...
NETWORK PERFORMANCE ENHANCEMENT WITH OPTIMIZATION SENSOR PLACEMENT IN WIRELES...NETWORK PERFORMANCE ENHANCEMENT WITH OPTIMIZATION SENSOR PLACEMENT IN WIRELES...
NETWORK PERFORMANCE ENHANCEMENT WITH OPTIMIZATION SENSOR PLACEMENT IN WIRELES...
 
DEVELOPMENT OF AN ANDROID APPLICATION FOR OBJECT DETECTION BASED ON COLOR, SH...
DEVELOPMENT OF AN ANDROID APPLICATION FOR OBJECT DETECTION BASED ON COLOR, SH...DEVELOPMENT OF AN ANDROID APPLICATION FOR OBJECT DETECTION BASED ON COLOR, SH...
DEVELOPMENT OF AN ANDROID APPLICATION FOR OBJECT DETECTION BASED ON COLOR, SH...
 
A NOVEL METHOD FOR PERSON TRACKING BASED K-NN : COMPARISON WITH SIFT AND MEAN...
A NOVEL METHOD FOR PERSON TRACKING BASED K-NN : COMPARISON WITH SIFT AND MEAN...A NOVEL METHOD FOR PERSON TRACKING BASED K-NN : COMPARISON WITH SIFT AND MEAN...
A NOVEL METHOD FOR PERSON TRACKING BASED K-NN : COMPARISON WITH SIFT AND MEAN...
 
A NOVEL IMAGE STEGANOGRAPHY APPROACH USING MULTI-LAYERS DCT FEATURES BASED ON...
A NOVEL IMAGE STEGANOGRAPHY APPROACH USING MULTI-LAYERS DCT FEATURES BASED ON...A NOVEL IMAGE STEGANOGRAPHY APPROACH USING MULTI-LAYERS DCT FEATURES BASED ON...
A NOVEL IMAGE STEGANOGRAPHY APPROACH USING MULTI-LAYERS DCT FEATURES BASED ON...
 
ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...
ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...
ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...
 
A COLLAGE IMAGE CREATION & “KANISEI” ANALYSIS SYSTEM BY COMBINING MULTIPLE IM...
A COLLAGE IMAGE CREATION & “KANISEI” ANALYSIS SYSTEM BY COMBINING MULTIPLE IM...A COLLAGE IMAGE CREATION & “KANISEI” ANALYSIS SYSTEM BY COMBINING MULTIPLE IM...
A COLLAGE IMAGE CREATION & “KANISEI” ANALYSIS SYSTEM BY COMBINING MULTIPLE IM...
 
ASPHALTIC MATERIAL IN THE CONTEXT OF GENERALIZED POROTHERMOELASTICITY
ASPHALTIC MATERIAL IN THE CONTEXT OF GENERALIZED POROTHERMOELASTICITYASPHALTIC MATERIAL IN THE CONTEXT OF GENERALIZED POROTHERMOELASTICITY
ASPHALTIC MATERIAL IN THE CONTEXT OF GENERALIZED POROTHERMOELASTICITY
 

Similar to STANDARD ARABIC VERBS INFLECTIONS USING NOOJ PLATFORM

Construction of Amharic-arabic Parallel Text Corpus for Neural Machine Transl...
Construction of Amharic-arabic Parallel Text Corpus for Neural Machine Transl...Construction of Amharic-arabic Parallel Text Corpus for Neural Machine Transl...
Construction of Amharic-arabic Parallel Text Corpus for Neural Machine Transl...gerogepatton
 
CONSTRUCTION OF AMHARIC-ARABIC PARALLEL TEXT CORPUS FOR NEURAL MACHINE TRANSL...
CONSTRUCTION OF AMHARIC-ARABIC PARALLEL TEXT CORPUS FOR NEURAL MACHINE TRANSL...CONSTRUCTION OF AMHARIC-ARABIC PARALLEL TEXT CORPUS FOR NEURAL MACHINE TRANSL...
CONSTRUCTION OF AMHARIC-ARABIC PARALLEL TEXT CORPUS FOR NEURAL MACHINE TRANSL...gerogepatton
 
Arabic words stemming approach using arabic wordnet
Arabic words stemming approach using arabic wordnetArabic words stemming approach using arabic wordnet
Arabic words stemming approach using arabic wordnetIJDKP
 
Deterministic Finite State Automaton of Arabic Verb System: A Morphological S...
Deterministic Finite State Automaton of Arabic Verb System: A Morphological S...Deterministic Finite State Automaton of Arabic Verb System: A Morphological S...
Deterministic Finite State Automaton of Arabic Verb System: A Morphological S...CSCJournals
 
XMODEL: An XML-based Morphological Analyzer for Arabic Language
XMODEL: An XML-based Morphological Analyzer for Arabic LanguageXMODEL: An XML-based Morphological Analyzer for Arabic Language
XMODEL: An XML-based Morphological Analyzer for Arabic LanguageWaqas Tariq
 
Building of Database for English-Azerbaijani Machine Translation Expert System
Building of Database for English-Azerbaijani Machine Translation Expert SystemBuilding of Database for English-Azerbaijani Machine Translation Expert System
Building of Database for English-Azerbaijani Machine Translation Expert SystemWaqas Tariq
 
DEVELOPING A SIMPLIFIED MORPHOLOGICAL ANALYZER FOR ARABIC PRONOMINAL SYSTEM
DEVELOPING A SIMPLIFIED MORPHOLOGICAL ANALYZER FOR ARABIC PRONOMINAL SYSTEMDEVELOPING A SIMPLIFIED MORPHOLOGICAL ANALYZER FOR ARABIC PRONOMINAL SYSTEM
DEVELOPING A SIMPLIFIED MORPHOLOGICAL ANALYZER FOR ARABIC PRONOMINAL SYSTEMkevig
 
Segmentation Words for Speech Synthesis in Persian Language Based On Silence
Segmentation Words for Speech Synthesis in Persian Language Based On SilenceSegmentation Words for Speech Synthesis in Persian Language Based On Silence
Segmentation Words for Speech Synthesis in Persian Language Based On Silencepaperpublications3
 
Hybrid approaches for automatic vowelization of arabic texts
Hybrid approaches for automatic vowelization of arabic textsHybrid approaches for automatic vowelization of arabic texts
Hybrid approaches for automatic vowelization of arabic textsijnlc
 
Using automated lexical resources in arabic sentence subjectivity
Using automated lexical resources in arabic sentence subjectivityUsing automated lexical resources in arabic sentence subjectivity
Using automated lexical resources in arabic sentence subjectivityijaia
 
International Journal on Natural Language Computing (IJNLC) Vol. 4, No.2,Apri...
International Journal on Natural Language Computing (IJNLC) Vol. 4, No.2,Apri...International Journal on Natural Language Computing (IJNLC) Vol. 4, No.2,Apri...
International Journal on Natural Language Computing (IJNLC) Vol. 4, No.2,Apri...ijnlc
 
USING AUTOMATED LEXICAL RESOURCES IN ARABIC SENTENCE SUBJECTIVITY
USING AUTOMATED LEXICAL RESOURCES IN ARABIC SENTENCE SUBJECTIVITYUSING AUTOMATED LEXICAL RESOURCES IN ARABIC SENTENCE SUBJECTIVITY
USING AUTOMATED LEXICAL RESOURCES IN ARABIC SENTENCE SUBJECTIVITYijaia
 
An expert system for automatic reading of a text written in standard arabic
An expert system for automatic reading of a text written in standard arabicAn expert system for automatic reading of a text written in standard arabic
An expert system for automatic reading of a text written in standard arabicijnlc
 
Hybrid Phonemic and Graphemic Modeling for Arabic Speech Recognition
Hybrid Phonemic and Graphemic Modeling for Arabic Speech RecognitionHybrid Phonemic and Graphemic Modeling for Arabic Speech Recognition
Hybrid Phonemic and Graphemic Modeling for Arabic Speech RecognitionWaqas Tariq
 
The Arabic Speech Database: PADAS
The Arabic Speech Database: PADASThe Arabic Speech Database: PADAS
The Arabic Speech Database: PADASCSCJournals
 
FURTHER INVESTIGATIONS ON DEVELOPING AN ARABIC SENTIMENT LEXICON
FURTHER INVESTIGATIONS ON DEVELOPING AN ARABIC SENTIMENT LEXICONFURTHER INVESTIGATIONS ON DEVELOPING AN ARABIC SENTIMENT LEXICON
FURTHER INVESTIGATIONS ON DEVELOPING AN ARABIC SENTIMENT LEXICONkevig
 
FURTHER INVESTIGATIONS ON DEVELOPING AN ARABIC SENTIMENT LEXICON
FURTHER INVESTIGATIONS ON DEVELOPING AN ARABIC SENTIMENT LEXICONFURTHER INVESTIGATIONS ON DEVELOPING AN ARABIC SENTIMENT LEXICON
FURTHER INVESTIGATIONS ON DEVELOPING AN ARABIC SENTIMENT LEXICONijnlc
 
A GRAMMATICALLY AND STRUCTURALLY BASED PART OF SPEECH (POS) TAGGER FOR ARABIC...
A GRAMMATICALLY AND STRUCTURALLY BASED PART OF SPEECH (POS) TAGGER FOR ARABIC...A GRAMMATICALLY AND STRUCTURALLY BASED PART OF SPEECH (POS) TAGGER FOR ARABIC...
A GRAMMATICALLY AND STRUCTURALLY BASED PART OF SPEECH (POS) TAGGER FOR ARABIC...kevig
 
A GRAMMATICALLY AND STRUCTURALLY BASED PART OF SPEECH (POS) TAGGER FOR ARABIC...
A GRAMMATICALLY AND STRUCTURALLY BASED PART OF SPEECH (POS) TAGGER FOR ARABIC...A GRAMMATICALLY AND STRUCTURALLY BASED PART OF SPEECH (POS) TAGGER FOR ARABIC...
A GRAMMATICALLY AND STRUCTURALLY BASED PART OF SPEECH (POS) TAGGER FOR ARABIC...kevig
 

Similar to STANDARD ARABIC VERBS INFLECTIONS USING NOOJ PLATFORM (20)

Construction of Amharic-arabic Parallel Text Corpus for Neural Machine Transl...
Construction of Amharic-arabic Parallel Text Corpus for Neural Machine Transl...Construction of Amharic-arabic Parallel Text Corpus for Neural Machine Transl...
Construction of Amharic-arabic Parallel Text Corpus for Neural Machine Transl...
 
CONSTRUCTION OF AMHARIC-ARABIC PARALLEL TEXT CORPUS FOR NEURAL MACHINE TRANSL...
CONSTRUCTION OF AMHARIC-ARABIC PARALLEL TEXT CORPUS FOR NEURAL MACHINE TRANSL...CONSTRUCTION OF AMHARIC-ARABIC PARALLEL TEXT CORPUS FOR NEURAL MACHINE TRANSL...
CONSTRUCTION OF AMHARIC-ARABIC PARALLEL TEXT CORPUS FOR NEURAL MACHINE TRANSL...
 
Arabic words stemming approach using arabic wordnet
Arabic words stemming approach using arabic wordnetArabic words stemming approach using arabic wordnet
Arabic words stemming approach using arabic wordnet
 
Deterministic Finite State Automaton of Arabic Verb System: A Morphological S...
Deterministic Finite State Automaton of Arabic Verb System: A Morphological S...Deterministic Finite State Automaton of Arabic Verb System: A Morphological S...
Deterministic Finite State Automaton of Arabic Verb System: A Morphological S...
 
XMODEL: An XML-based Morphological Analyzer for Arabic Language
XMODEL: An XML-based Morphological Analyzer for Arabic LanguageXMODEL: An XML-based Morphological Analyzer for Arabic Language
XMODEL: An XML-based Morphological Analyzer for Arabic Language
 
Building of Database for English-Azerbaijani Machine Translation Expert System
Building of Database for English-Azerbaijani Machine Translation Expert SystemBuilding of Database for English-Azerbaijani Machine Translation Expert System
Building of Database for English-Azerbaijani Machine Translation Expert System
 
DEVELOPING A SIMPLIFIED MORPHOLOGICAL ANALYZER FOR ARABIC PRONOMINAL SYSTEM
DEVELOPING A SIMPLIFIED MORPHOLOGICAL ANALYZER FOR ARABIC PRONOMINAL SYSTEMDEVELOPING A SIMPLIFIED MORPHOLOGICAL ANALYZER FOR ARABIC PRONOMINAL SYSTEM
DEVELOPING A SIMPLIFIED MORPHOLOGICAL ANALYZER FOR ARABIC PRONOMINAL SYSTEM
 
Segmentation Words for Speech Synthesis in Persian Language Based On Silence
Segmentation Words for Speech Synthesis in Persian Language Based On SilenceSegmentation Words for Speech Synthesis in Persian Language Based On Silence
Segmentation Words for Speech Synthesis in Persian Language Based On Silence
 
Hybrid approaches for automatic vowelization of arabic texts
Hybrid approaches for automatic vowelization of arabic textsHybrid approaches for automatic vowelization of arabic texts
Hybrid approaches for automatic vowelization of arabic texts
 
Jq3616701679
Jq3616701679Jq3616701679
Jq3616701679
 
Using automated lexical resources in arabic sentence subjectivity
Using automated lexical resources in arabic sentence subjectivityUsing automated lexical resources in arabic sentence subjectivity
Using automated lexical resources in arabic sentence subjectivity
 
International Journal on Natural Language Computing (IJNLC) Vol. 4, No.2,Apri...
International Journal on Natural Language Computing (IJNLC) Vol. 4, No.2,Apri...International Journal on Natural Language Computing (IJNLC) Vol. 4, No.2,Apri...
International Journal on Natural Language Computing (IJNLC) Vol. 4, No.2,Apri...
 
USING AUTOMATED LEXICAL RESOURCES IN ARABIC SENTENCE SUBJECTIVITY
USING AUTOMATED LEXICAL RESOURCES IN ARABIC SENTENCE SUBJECTIVITYUSING AUTOMATED LEXICAL RESOURCES IN ARABIC SENTENCE SUBJECTIVITY
USING AUTOMATED LEXICAL RESOURCES IN ARABIC SENTENCE SUBJECTIVITY
 
An expert system for automatic reading of a text written in standard arabic
An expert system for automatic reading of a text written in standard arabicAn expert system for automatic reading of a text written in standard arabic
An expert system for automatic reading of a text written in standard arabic
 
Hybrid Phonemic and Graphemic Modeling for Arabic Speech Recognition
Hybrid Phonemic and Graphemic Modeling for Arabic Speech RecognitionHybrid Phonemic and Graphemic Modeling for Arabic Speech Recognition
Hybrid Phonemic and Graphemic Modeling for Arabic Speech Recognition
 
The Arabic Speech Database: PADAS
The Arabic Speech Database: PADASThe Arabic Speech Database: PADAS
The Arabic Speech Database: PADAS
 
FURTHER INVESTIGATIONS ON DEVELOPING AN ARABIC SENTIMENT LEXICON
FURTHER INVESTIGATIONS ON DEVELOPING AN ARABIC SENTIMENT LEXICONFURTHER INVESTIGATIONS ON DEVELOPING AN ARABIC SENTIMENT LEXICON
FURTHER INVESTIGATIONS ON DEVELOPING AN ARABIC SENTIMENT LEXICON
 
FURTHER INVESTIGATIONS ON DEVELOPING AN ARABIC SENTIMENT LEXICON
FURTHER INVESTIGATIONS ON DEVELOPING AN ARABIC SENTIMENT LEXICONFURTHER INVESTIGATIONS ON DEVELOPING AN ARABIC SENTIMENT LEXICON
FURTHER INVESTIGATIONS ON DEVELOPING AN ARABIC SENTIMENT LEXICON
 
A GRAMMATICALLY AND STRUCTURALLY BASED PART OF SPEECH (POS) TAGGER FOR ARABIC...
A GRAMMATICALLY AND STRUCTURALLY BASED PART OF SPEECH (POS) TAGGER FOR ARABIC...A GRAMMATICALLY AND STRUCTURALLY BASED PART OF SPEECH (POS) TAGGER FOR ARABIC...
A GRAMMATICALLY AND STRUCTURALLY BASED PART OF SPEECH (POS) TAGGER FOR ARABIC...
 
A GRAMMATICALLY AND STRUCTURALLY BASED PART OF SPEECH (POS) TAGGER FOR ARABIC...
A GRAMMATICALLY AND STRUCTURALLY BASED PART OF SPEECH (POS) TAGGER FOR ARABIC...A GRAMMATICALLY AND STRUCTURALLY BASED PART OF SPEECH (POS) TAGGER FOR ARABIC...
A GRAMMATICALLY AND STRUCTURALLY BASED PART OF SPEECH (POS) TAGGER FOR ARABIC...
 

Recently uploaded

Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 

Recently uploaded (20)

Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 

STANDARD ARABIC VERBS INFLECTIONS USING NOOJ PLATFORM

  • 1. International Journal on Natural Language Computing (IJNLC) Vol. 6, No.1, February 2017 DOI: 10.5121/ijnlc.2017.6103 23 STANDARD ARABIC VERBS INFLECTIONS USING NOOJ PLATFORM Mohammed Mourchid 1 Ilham Blanchete 2 and Abdelaziz Mouloudi3 1 MIC search team, Laboratory MISC, Ibn Tofail University Kenitra- Morocco 2 Department of Computer Science FSK, Ibn Tofail University, Kenitra, Morocco 3 MIC search team, Laboratory MISC, Ibn Tofail University Kenitra- Morocco ABSTRACT This article describes the morphological analysis of a standard Arabic natural language processing, as a part of an electronic dictionary-constricting phase. A fully 3-lettered inflected verbs model are formalized based on a linguistic classification, using NOOJ platform, the classification gives certain representative verbs that will considered as lemmas, this verbs form our dictionary entries, they are also conjugated according to our inflection paradigm relying on certain specific morphological properties. This dictionary will be considered as an Arabic resource, which will help NLP applications and NOOJ platform to analyse sophisticated Arabic corpora. KEYWORDS Morphological analysis, NOOJ, ANLP & Arabic verb inflections 1. INTRODUCTION The Arabic natural language applications need a fully and automatic Arabic dictionary to analyse the sophisticated corpora, as a first phase of building this dictionary we started by formalizing the trilateral verbs based on a linguistic verbs classification [6]. The linguistic analysis must go through a first step of lexical and morphological analysis, which consists in testing membership of each word of the text to the Arabic vocabulary [1] we started from a basic kind of verbs, which are called trilateral, verbs that contain three letters. Using a specific linguistic classification of these verbs we guarantee that we are going to cover all Arabic trilateral verbs [7], this verbs will also attached to their inflectional paradigms to cover all conjugated forms, in this paper we give examples of our implemented dictionary and grammars in NooJ platform as figures. 2. DEFINITIONS 2.1. Nooj Platform NooJ is a linguistic developmental environment, which can analyze texts of several million words in real time. It includes tools to construct, test and maintain large coverage of lexical resources,
  • 2. International Journal on Natural Language Computing (IJNLC) Vol. 6, No.1, February 2017 24 as well as morphological and syntactic grammars. Dictionaries and grammars are applied to texts in order to locate morphological, lexicological and syntactic patterns, remove ambiguities, and tag simple and compound words [5]. NooJ platform works on cascade model; the result of each analysis step is the input of the next one. For more information please consult the official NooJ website. We adopted this platform because it allows us to: - Implement all linguistic analysis phases: morphological, syntactical and semantic analysis. - Create our own corpora and apply search option using special queries. - To implement our grammars and dictionary using its linguistic engine. - To analyse our text by giving morphological, syntactical and semantic properties of each word/sentence. 2.2.Nooj Architecture NooJ platform is Programmed using C#/.net Framework. NooJ follows a component-based software approach, which is a step beyond the object oriented programming paradigm . The system consists of three modules, corpus handling, lexicon and grammar development that are integrated into a single intuitive graphical user interface (command line operation is also available). NooJ processes texts and corpora (i.e. sets of text files) at the Orthographical, Lexical, Morphological, Syntactic and Semantic levels. All linguistic information (at any level) is represented by annotations that are stored in the Text Annotation Structure (TAS)[5]. We use this platform to formalize the Arabic 3-lettered Verbs model as a first step of Arabic dictionary constructing phase; starting by building our dictionary that contains the previous verb category and linking it with the our productive grammars that give all inflectional forms for each dictionary entry, we will detail this in next sections. 2.3. Natural language Is a human spoken and/or written languages like Arabic, French, and English. 2.4. Natural Language Processing Is a subfield of Artificial Intelligence and linguistic, devoted to make computers understand the Statements or words written in human languages. A natural language also known as a spoken or written language by people for general-purpose communication [2]. 2.5. Arabic Natural Language Arabic is a Semitic language spoken by more than 330 million people as a native language, in an area extending from the Arabian/Persian Gulf in the East to the Atlantic Ocean in the West. Arabic is a highly structured and derivational language where morphology plays a very important role [2]. Morphology is central in working on Arabic NLP because of its important interactions with both orthography and syntax. Arabic’s rich morphology is perhaps the most studied and written about aspect of Arabic. As a result, there is a wealth of terminology, some of it inconsistent that may intimidate and confuse new researchers [3].
  • 3. International Journal on Natural Language Computing (IJNLC) Vol. 6, No.1, February 2017 25 2.6. Arabic Natural Language Processing Over the last few years, Arabic natural language processing (ANLP) has gained increasing importance, and several state-of-the-art systems have been developed for a wide range of applications, including machine translation, information retrieval and extraction, speech synthesis and recognition, localization and multilingual information retrieval systems, text to speech, and tutoring systems. These applications had to deal with several complex problems pertinent to the nature and structure of the Arabic language. Most ANLP systems developed in the Western world focus on tools to enable non-Arabic speakers make sense of Arabic texts. Since understanding Arabic language becomes a point of interest for non Arabic speakers, funding became available for companies and research centers to develop tools such as named entity recognition, machine translation, especially spoken machine translation, document categorization, etc [4]. 3. NLP STEPS There are 3 phases involved in natural language processing: Morphological Analysis, Syntactic Analysis and Semantic Analysis. The first step will be detailed in section 6. , we will define briefly the other steps. 3.1. Syntactic Analysis This involves analysation of the words in a sentence to depict the grammatical structure of the sentence. The words are transformed into structure that shows how the words are related to each other Eg. “The girl the go to the school”. This would definitely be rejected by the English syntactic analyzer [2]. 3.2. Semantic Analysis This abstracts the dictionary meaning or the exact meaning from context. The structures, which are created by the syntactic analyser, are assigned meaning. There is a mapping between the syntactic structures and the objects in task domain. Eg. “Colorless blue idea”. The analyser would reject this as colorless blue do not make any sense together [2]. NooJ helps us to implement this steps, using their linguistic engine to build our dictionary and grammars rules - as it will explained in next sections - that gives the inflectional forms for each dictionary, 4. 3-LETTEREDVERBS Most Arabic words are derived from three-letter Verbs or 3- lettered verbs, we started by formalizing this kind of verbs standing on their linguistic classification as Figure 1 shows; this figure will be detailed in section 5.
  • 4. International Journal on Natural Language Computing (IJNLC) Vol. 6, No.1, February 2017 26 Figure 1. Arabic 3-lettered verbs classification, regular verbs part As each Arabic verb has its morphological prosperities like root and pattern [8], we attached each verb/dictionary entry with this properties, and with their conjugation form, for instance the verb (to write - َ َ َ - KaTaBa ) takes ‫ب‬ ‫ت‬ ‫ك‬ as roots letters ( in Arabic the root letters are separated َ َ َ faEala as pattern and ُ ُ ْ َ yafEalo as conjugation form . The conjugation form of the previous verb: َ َ َ KaTaBa is ُ ُ ْ َ yaKToBo according to the matching process between the pattern and the conjugational form ( َ َ َ faEala and ُ ُ ْ َ yafEalo ), There are 3 types of the 3-lettered verbs patterns in Arabic they are distinguished according to the second letter diacritic: ( fatha َ◌kasra ِ◌, dama ُ◌). َ َ َ – faEala is a pattern of the verb: (to write َ َ َ ) . َ ُ َ faEula is a pattern of the verb : (to grow – َ ُ َ ). َ َ ِ –faEila is a pattern of :(to play- َ ِ َ ). May takes three conjugation forms as table 1 shows. Table 1. All possible patterns with their conjugation forms in 3-letter Arabic verbs pattern Conjugation form Conjugation form Conjugation form faEala َ َ َ yafEalo ُ ُ ْ َ yafEilo ُ ِ ْ َ yafEolo ُ ُ ْ َ faEula َ ُ َ yafEalo ُ ُ ْ َ yafEilo ُ ِ ْ َ yafEolo ُ ُ ْ َ faEila َ ِ َ yafEalo ُ ُ ْ َ yafEilo ُ ِ ْ َ yafEolo ُ ُ ْ َ For instance: the verb (to write - kataba - ) is the result of matching its root with its pattern, by switching root letters with patterns one; without any changing on the pattern diacritics : ( X َ َ َ ), as their conjugation form is(yafEolo ُ ُ ْ َ ) it takes this model : ( ُ ُ ْ َ X ) to be conjugated.
  • 5. International Journal on Natural Language Computing (IJNLC) Vol. 6, No.1, February 2017 27 5. Arabic 3-Letterd Verbs Classification: The following classification covers all 3-lettered verbs in standard Arabic language, each representative verbs may takes the three previous patterns ( faEala َ َ َ / faEula َ ُ َ / faEila َ ِ َ ) , and each pattern may take 3 conjugation forms ( yafEolo ُ ُ ْ َ / yafEalo ُ َ ْ َ / yafEilo ُ ِ ْ َ ) ,for example: The path in the following classification as shown in figure 1 : [verb] [regular verbs] [* ]: represents the verbs that their last letter neither a (n )‫ن‬ nor a (t ‫)ت‬ character , and it takes all this representative verbs: ( َ َ َ -FaTaHa-to open) as : ( faEala َ َ َ - yafEalo ُ َ ْ َ ). (َ َ َ - KaTaBa-to write ) as : ( faEala َ َ َ - yafEolo ُ ُ ْ َ ). (َ َ َ - JaLaSa- to set) as : ( faEala َ َ َ - yafEilo ُ ِ ْ َ ). (َ ُ َ -KaBoRa- to grow) as : (faEula َ ُ َ - yafEolo ُ ُ ْ َ ). (َ ِ َ -AaLiMa- to know) as : (faEila َ ِ َ - yafEalo ُ َ ْ َ ). Figure 1 gives a part of the Arabic 3-lettered verbs classification; here are the definitions of some used abbreviation. REG verbs: regular verbs (afeal sahiha – َ ْ ِ َ! ‫ا‬ ‫$ل‬َ ْ َ%‫ا‬ ) contains three verbs kind [ Hamzated verbs – duplicated verbs – salim verbs ]. Hamzated verbs (mahouza – ‫'ز‬ُ(ْ)َ( ‫ا‬ ) verbs that contain the‫أ‬ hamza character Duplicated verbs (modaEafa – $+‫ﻣ‬ ) verbs that contain a duplicated character. Salim verbs: (salim $-) verbs that are neither hamzated nor duplicated. * : verbs that their last letter neither a (n )‫ن‬ nor a (t ‫)ت‬ character , n : last character is ‫ن‬ , t : last character is ‫.ت‬ 1st,2nd,3rd: the first and second and third character in the verb. IRRG : verbs that contains one of the Arabic long characters : alif . ‫ا‬ , yae ‫$ء‬ , waw ‫واو‬ or the hamza character. In this paper I am going to present only the regular verbs. Each representative verb considered as dictionary entry that will be assigned to a unique inflectional paradigm, only and only if the verb accept to be conjugated in standard Arabic, then all verbs that are conjugate in the same manner, will take the same inflectional paradigm, tow verbs are conjugated with the same manner or with the same inflectional paradigm if they have the same conjugation form. 6. Morphological Analysis The lexicon of a language is its vocabulary that includes its words and expressions, while morphological Analysis involves dividing a text into paragraphs, words and the sentences and its main role is to represent the Atomic Language Unit (ALUs) which is the smallest elements that make up the sentence, we are going to define these ALUs/3-lettered verbs as dictionary entries that represent the language vocabulary these entries are associated with their morphological properties which enrich it with linguistic information like: s means singular, p means plural , as basic properties while we add our specific verb morphological properties like:(Root/Pattern/Category Numb),that will be used in advanced analysis phases ,Figure 2 shows our constructed dictionary that calls at first our inflectional
  • 6. International Journal on Natural Language Computing (IJNLC) Vol. 6, No.1, February 2017 28 grammars G_Verbs ,as it is shown in figure 3, the dictionary contains the language vocabulary with their special morphological properties: V: verb, Tr : transitive verb , 1: verb category which determines verbs conjugation form( ُ ُ ْ َ - yafEolo), pattern (َ َ َ : faEala ). Figure 2. Our constructed dictionary The FLX paradigm represents all inflectional forms in active and passive voice for each dictionary entry, this FLXs are represented using our defined rules graphs as Figure 3 shows, we preferred to describe this inflectional paradigms using NooJ’s graphical rules interface that is equivalent to the textual rule editor, here is our inflectional paradigms that are assigned to dictionary, each dictionary entry has an inflectional paradigm, that generate all its inflectional and derivational forms. Figure 3: Verbs inflectional paradigm For instance the lexical entry (to write – kataba) has an inflectional paradigms: FLX=V_Kataba that matches any form in the set of {they write ‫'ا‬ KaTaBo, he writes KaTaBa, we write 1 NaKToBo, you wrote َ2ْ َ َ
  • 7. International Journal on Natural Language Computing (IJNLC) Vol. 6, No.1, February 2017 29 KaTaBTa,..} NooJ recognizes all this forms even if they are semi or non- vowelized. Each verb is conjugated in 12 deferent tenses as it is shown in Figure 4. This figure shows all possible verb inflections in both of active and passive voice. ACC_Kataba presents the past tens, we used the pre-defined NOOJ operators to define the inflections of this entry here is some special operators <Z>,<T> AND <M> that are defined only for Arabic language for more information please read NOOJ manual. Figure 4: all possible verb inflections Figure 5 shows how we easily define the past tens for the verb (to write) using NOOJ predefined operators <B> : will erase the last character, <Z> will add ‫ت‬ character while the last character of the given verb is not a ‫,ت‬ else it will add ‫ﱡ‬‫ت‬ , chadda appearance refers to a duplication for instance : the conjugated form of the verb ( to write ) for the first person is I Wrote ُ2ْ َ َ so the operator <Z> adds ‫ت‬ character while the last character of the given verb is not a ‫ت‬ else it will add ‫ﱡ‬‫ت‬ for all verbs that finished with ‫ت‬ like 2 4 will converted to ‫ﱡ‬2َ َ4 while its last character is ‫ت‬ . As its mentioned above this graph recognizes all appearances of the past conjugation set{ I wrote , you wrote ,...}, and annotate them with the morphological defined properties which appear under each node for instance { A+I+1+s } : the first singular person in active voice . A: active voice, I: past / 1: first person and s : singular .
  • 8. International Journal on Natural Language Computing (IJNLC) Vol. 6, No.1, February 2017 30 Figure 5: inflectional graph in active voice 7. ANNOTATIONS Once our dictionary is compiled, we can use it as resource to analyze any Arabic text, that contains only 3-letterad verbs or their inflectional forms, when parsing a text or a corpus, NooJ builds a Text Annotation Structure (TAS) in which each linguistic unit is represented by a corresponding annotation. An annotation stores the position in the text of the text unit to be Represented, its length, and linguistic information, (Silberztein 2007). NooJ adds annotations to the TAS automatically at various stages of the analyses phase, morphological and syntactic parsers provide tools to add, remove and export annotations to TAS in morphological level, and the morphological parser typically applies dictionaries to the text to produce annotations. When the parser recognizes any lemma or inflectional/derivational form in the text it produces the corresponding morphological annotations according to the morphological properties that we have assigned before, the figure 6 shows our text that will be morphologically analyzed on NOOJ platform, it contains different conjugated forms of the verb ( to write ) vowelized , semi or non vowelized . Figure 6: text to be analyzed
  • 9. International Journal on Natural Language Computing (IJNLC) Vol. 6, No.1, February 2017 31 The vowelized ALU like َ َ َ KaTaBa he writes will take an expected annotation as it refers to [the third person, masculine singular], but in case of the non vowelized ALU, Nooj Table Annotation Structure gives all possible annotations ,this annotations triggered when the parser matches any inflectional form of a dictionary lemma that we are defined in the inflectional paradigm with the text ALU’s, for instance the ALU 2 takes all possible annotations, as it is non- vowelized and the annotations of this inflectional forms {‫ُ'ا‬ َ َ , َ‫ُ'ن‬ ُ ْ َ ,2 , َ َ َ …..}Will be triggered, as they are defined in the inflectional paradigm V_Kataba . The first annotation form is dedicated for the fully diarized verb , the annotation shows that this entry is a verb V , and it is a transitive verb Tr, gives their root 2 6‫ك‬ , and their pattern is , NOOJ TAS gives all this properties as Representative verbs that are considered as dictionary entries, all verbs that have the same conjugation form will be assigned to the same inflectional paradigm. Once the dictionary is compiled we can easily use it as a linguistic recourse to analyze the sophisticated corpora. 8. CONCLUSION AND PRESPECTIVES Using the given 3-lettered linguistic classification, we constructed fully inflected verbal Arabic recourses of the previous verbs category, using NOOJ platform.
Lemma –based verbs are used as dictionary entries; an inflectional paradigm may be assigned to a dictionary entry that gives all possible conjugated forms of the entry. The dictionary contains representative verbs for each leaf of the given linguistic classification tree, each leaf has at most 9 representative verbs that are considered as dictionary entries, all verbs that have the same conjugation form will be assigned to the same inflectional paradigm. Once the dictionary is compiled we can easily use it as a linguistic recourse to analyze the sophisticated corpora. The perspective opened over this work is to extend our dictionary to the inflections of the rest verbs categories, nouns, adjectives also to add some morphological grammars in order to generate broken plural, ALU’s with affixes and other sophisticated morphological phenomena like ibdal and ielal and idgham . REFERENCES [1] D. Revuz ,( 1991)”Dictionnaires et lexiques”,Thesis, Paris 7 University. : methodes et algorithmes (Doctoral dissertation). [2] A. Chopra, A. Prashar and C. Sain, "Natural Language Processing", the International journal of technology enhancement and emerging engineering research, vol. 1, issue 4, ISSN 2 [3] N. Habash, (2010) Introduction to Arabic Natural Language Processing, Morgan & Claypool Publishers series. [4] K. Shaalan, A. Farghaly, (2009) ” Arabic Natural Language Processing: Challenges and Solutions”, ACM Transactions on Asian Language Information Processing (TALIP) 8.4,14. [5] S. Max,”Nooj Manual” , www.nooj-association.org. [6] M.El-Ghalayani,(2004) “Jamie aldorous alaarabia”,al moassassa al haditha lilkitab, Tripoli,Lebanon,. [7] M Mohamed, “Generation Morphological and Applications”, Specialty thesis of 3rd round, Mohammed V University in Rabat-Morocco, 1999. [8] A.Yousfi, (2010) “The morphological analysis of Arabic verbs by using the surface patterns”, IJCSI International Journal of Computer Science Issues,7(3(11)): p. 33-36.
  • 10. International Journal on Natural Language Computing (IJNLC) Vol. 6, No.1, February 2017 32 Authors M. Mourchid Doctorate Degree In Computer Science In 1999; Associate Professor At The Computer Science Department At The Faculty Of Sciences, Ibn Tofail University In Kenitra Morocco; On Going Research Interests: Natural Language Processing, Web Semantic, And Information Systems. I. Blanchete, Phd Student At Ibn Tofail University Department Of Computer Science, Laboratory Of Misc Kenitra , Morocco ;Graduated From Damascus University Faculty Of It Engineering 2010. A. Mouloudi was born in 1959 in Morocco. He received the B.S degreein applied mathematics from Mohamed V University in Morocco at 1982;Master in Computer Science from the University Mohammed V, at 1984; Ph.D in Computer Science from the same university, at 1988. He obtains Habilitation to direct academic research in Computer Science, from Ibn Tofail University in Morocco, at 2008. Currently, A. MOULOUDI is the Director of the laboratory MISC (Information Modelling and Communication System), at the sciences faculty, Ibn Tofail University, in Morocco.