1. 1
UNIVERSITI UTARA MALAYSIA
SCHOOL OF EDUCATION AND MODERN LANGUAGES
MASTER IN APPLIED LINGUISTICS
Sociolinguistics
SCLE5233
Prepared For:
Dr. Hariharan a / l N.Krishnasamy
Prepared By:
Moustafa Mohammad Shalabi
817137
Egyptian Arabic
2. 2
“Everyone knows that language is variable.” Sapir (1921: 147)
Introduction
The two articles that were chosen to response is one about “Dialectal Arabic” – “Standard
Arabic” and the second is about “a Comparison of Egyptian Arabic and Modern Standard Arabic
Sociolinguistic Variables through Formality Effects”
"The language is a morally human and non- voluptuous of delivery of thoughts and feelings and
wishes through a system of voluntary production of symbols". These symbols, in the first
occurrence, hear and is produced, including the so-called ‘organs of speech’ (Sapir 1921).
Social linguistics is a vivid revision of the properties of any and all features of civilization,
including traditional standards, expectations, and the approach language is used, and the
consequences of the use of language in the Community.
Ahlan Wa Sahlan! – Welcome
Egypt صرِم Miṣr صرَم Maṣr, The country officially Arab Republic of Egypt pancontinental
extends across continents in north-east and south-west of the corner of Asia, to the formation of a
bridge from the Sinai Peninsula, near only eurafrasian nation more than the land of Egypt, from
“1,010,408” square kilometers (390,000 square miles) is located in the River Nile.
3. 3
Egypt is located on the coast of the Mediterranean Sea ,surrounded by
the Aqaba Gulf to the east , to the north-east Gaza Strip in the middle of the
South came Sudan and in the west Libya. With more than 100 million
people, Egypt is a very populated state located in the north of Africa .Egypt
is the core of the Arab world. It is rated as number three large countries in
the continent of Africa.
Approximately half of the Egyptian people stay in city areas, and the
biggest dense inhabited axes of Great Cairo, the second biggest city
Alexandria added to the main towns along the River Nile.
Egypt has deep historical roots since the tenth century BC, one of the first
States. Some of the earliest developments it is considered the birth place of
urbanization. Venerable Egypt practiced the early development in
agriculture, inscription, Sculpture organized beliefs, urbanization and the
fundamental government through olden times.
Ancient Egypt known as the monuments icon, including the great Sphinx
and Giza pyramids, as also Memphis ruins in Thebes, Karnak and the Valley
of the Kings. Still fascinating public attention and archaeologists researches
all over the world.
Egypt rich cultural heritage is an important part of its domestic
uniqueness, through ages suffered and absorbed many outer effects, as well
as the Persian, Greek, Romanian, Arabic, European and Ottoman.
EGYPT
4. 4
Egyptian Arabic language, identified as Spoken Egyptian
Arabic, Massry, Masri, Ordinary Egyptian language and Egyptian Informal
language, is a variability of Arabic which has its origins to the Semitic family
language.
Approximately “52,500,000” people speak this language in Maser. It is
too spoken as a second dialectal in different states in the Arab countries, for
example “Saudi Arabia, Kuwait, Jordan, Libya, Iraq, Yemen and United
Arab Emirates”. It is estimated totally more than 54 million.
Before the Arabic invasion to Egypt in the 1600 AD, Egypt people
speak Coptic. Coptic grammatically thoroughly parallel to old Egyptian,
written in the Hieroglyphic scripts that came after the Arab conquest, there
was an extended era when both Arabic and Coptic were used to talk through
the Egyptian state.
Egypt Verbal Arabic, as distinctive as Standard Arabic with the diversity
spoken by the Arab occupations, advanced in the earliest capital of the
Islamic state, nowadays part of capital Cairo. With augmented inspiration of
Islamic religion and country arabization, people of Egypt shifted to Arabic
gradually substituting Coptic.
But, there is no proof that the Copts do not continue to be spoken, even
in 17th-century. In spite of the Copts do not know, today the first language
speakers, and still feel the language that aid such as the language used inside
the Egyptian Church.
EGYPT
5. 5
Egypt Pronounced language is the dialectal of a broader way as an exchange of information
and de facto Etymological in Egypt, even though the certified language in Maser is ‘Modern
Standard Arabic’ (MSA). Egyptians apply their language in literature, containing dramas, poems
in addition to novels, popularly used in mass media for instance in “marketing, comedy, a lot of
newspapers, also to transcribe common songs. MSA is used in mostly in further printed
newspapers side by side with TV programs”.
MSA is used in all official circumstances and for all ceremonial dedications. Egyptian Arabic
speech generally easy to be understood, in Arab countries, as Egypt was the prevailing force in the
movie industry added the broad casting. Egypt is considered as the first Arab-speaking Egyptian
movie production and films and widespread in all parts of the Arab world. The Egyptian film
production has over “3,000” movies ever since “1924” has been named the "Hollywood on the
Nile". Therefore, the speeches in Arabic Egyptian regularly choice spoken language Arabic
language education students a foreign language.
Egyptian language seems to preserve a noteworthy Coptic substrate Lexis, phonology and
syntax. Coptic was the latest phase of the original Egyptian language is spoken, until the mid-17th-
century as it lastly completely replaced by Egyptian Arabic. Nearly the structures originally shares
Arab Egyptian ancient Egyptian language include some of the earlier and verbal pairing some
uncertain and glottalized consonants, in addition to a large number of bilateral triliteral verbal
correspondence.
Two syntax characteristics specific to Egyptian language transferred from Coptic are:
The delay of demonstratives "this" and "that" are sited after the noun.
6. 6
Illustrations: " this man " /ir-rˤaːɡil da/ ده (الراجلlit. "the man this"; in Literary Arabic / haːðaː r-
raɡul /) الرجل هذاand "this girl" / il-bitt di /دي البت (lit. " the girl this " ; in Literary Arabic / haːðihi
l-bint / )البنت .هذه
WH words االستفهام كلمات (i.e. "when", "why", "who" exist in their "logical" locations in a
sentence rather than being proposed, or relocated to the front of the sentence, as in Literary
Arabic or English).
Examples:
راحمصرإمتا؟) ) /rˤaːħ masˤrI ʔimta / ("When (/ʔimta/) did he go to Egypt / Cairo?"
(lit. "He went to Egypt/Cairo when?")
( راحمصرليه؟ ) /rˤaːħ masˤrI leːh/ "Why (/leːh/) did he go to Egypt/Cairo? (lit. "He
went to Egypt/Cairo why?")
مين[اللى]راحمصر؟) )/miːn rˤaːħ masˤr/ or /miːn illi rˤaːħ masˤr/ ("Who (/miːn/) went
to Egypt/Cairo? (literally - same order)
In Literary Arabic the same sentences in the beginning of the sentence) (with all the question words
(wh-words) would be:
/mataː ðahaba ʔilaː misˤr/ متىذهبإلىمصر؟
/lima ðahaba ʔilaː misˤr/ َمِلذهبإلىمصر؟
/man ðahaba ʔilaː misˤr/ منذهبإلىمصر؟
Since Coptic languages such as North Africa, the interdental space. consonants missing it can
affect the appearance of these incidents in typical Arabic /θ/ð) /ðˤ) counterparts in the dental /T)
/d) stressed dentists /dˤ).
7. 7
The fast grow of Egyptian Arabic usage clarifies the importance of this lexicon which lies in
the fact that it is the first resource of its kind bridging multiple variants of Arabic with English.
Furthermore, it is a wide coverage lexical resource containing over “73,000” Egyptian entries.
Tharwa is publicly available. The authors believe it will have a significant impact on both
Theoretical Linguistics as well as Computational Linguistics research. Egyptian Arabic in most
cases with Arab modern classics usually only in writing and in the religious and/or official event.
Nevertheless in Egyptian Arabic, there is an extensive variety of change. Still, inside Egyptian
Arabic, there is a comprehensive array of distinction.
Al-Said Badawi identified three levels are different from the language of the Egyptian-based
primarily on the number of non-Arab verbally in lexis products : عامية ()المثقفين (`Āmmiyyat al-
Musaqqafīn) (Talk about intellectual slang or official Arabic), ( عاميةالمتنورين ) (`Āmmiyyat al-
Mutanawwirīn) (Tolerant or Literate Idiomatic), ( عاميةاالميين ) and(`Āmmiyyat al-
'Ummiyīn) (Uneducated Idiomatic).
Well-educated Conversational/Official Spoken Arabic is distinguishing of the cultured
modules and is the language of conversation of high-level topics, but however Egyptian Arabic;
it is featured by technical terms usage of imported words taken from foreign languages and MSA,
as well as closer attention to the pronunciation of certain letters (in particular qāf).
It is comparatively consistent and, seems to be closer to the typical, is agreed properly all
over the Arab world. On the contradictory end of the continuum, Illiterate Colloquial, communal
to countryside areas and to working-class districts in the cities, has an almost completely Arabic
terminology; loanwords are usually either very old borrowings (e.g. gambari
8. 8
جمبرى, [ɡæmˈbæɾi] "shrimp," from Italian gambari, "shrimp" (pl.)) or to use some technological
terms or words that find no or poor equals in Arabic as in: تلفزيون til (i)vizyōn/
til(i)fezyōn [tel(e)vezˈjoːn, tel(e)fezˈjoːn], television).
There are some contradiction between MSA and Egyptian Arabic will be discussed later on ,
but the varsity of usage in Egyptian Arabic through its long history of colonization, number of
immigrants and it is the route of world trade gives Egypt its distinctive features, this makes it
acquires a different unique style for instance the Educated Informal (`Āmmiyyat al-Mutanawwirīn)
)المتنورين (أميةis the language of those who have had some education and are comparatively wealthy;
the loanwords tend to denote to pop-cultural items, consumer products, and fashions. It is also
understood widely in the Arab world, as it is the lingua franca of Egyptian movie and TV.
Comparing MSA with other variabilities of Arabic language, Egyptian Arabic enjoys a formula
of the T-V distinction. For instance the singular form, انت inta للمذكر /inti للمؤنث is satisfactory in
many situations, nevertheless after speaking in clear societal seniors (e.g. persons older than
oneself, superiors at work(manager), definite administration administrators), the form حضرتك
ḥaḍritak/ḥaḍritik, significance "Your Grace" is favored (c.f. Spanish usted). The usage
of ḥaḍritak/ḥaḍritik is related to the classification of honorifics respect in daily Egyptian speech.
This shows the relation between the speaker and their occupation. Let’s have a look at the
following table:
9. 9
Samples honorifics in Egyptian language
Honorific IPA Derivation/meaning Usage and notes
1 Siyadtak [seˈjættæk,
seˈjædtæk]
Standard Arabic
siyādatuka,"Your
Lordship"
Social people is much higher than the
speakers, especially in the work. It also
applies to senior government officials, as
well as the President. An equal in practice
to " Your Excellency " or " Your Honor ".
2 sa`adtak [sæˈʕættæk,
sæˈʕædtæk]
Typical Arabic
sa`ādatuka, "Your
Happiness"
Representatives and other social position
much higher. The value of the state
environments "Your Excellency," or
"Your Honor” when addressing the judge.
3 ma`alīk [mæʕæˈliːk] Typical Arabic ma`ālīka,
"Your Highness"
Minister of the government. An equal in
practice to "Your Excellency" or "The
Right Honourable."
4 ḥagg/ḥagga [ˈħæɡ(ɡ)]/[ˈħæɡɡæ] Typical Arabic ḥāǧ Conventionally, each Muslim pilgrimage,
or Christians make the pilgrimage to
Jerusalem. In today's time uses the notion
of respect for older people.
5 Bāsha [ˈbæːʃæ] Ottoman Turkish pasha Familiar speech to men is equal to or less
than the social situation. The
corresponding of "Man" or "Dude" in the
English word 'non-official.
10. 10
This table illustrate the following aspects:
1. ""سيادتك When personnel with a faraway advanced societal higher than the talker, mostly in
place of your job. Similarly functional to extraordinary administration administrators, as
well as the Commander. Equal in real-world relations to "Your Excellency" or "The Most
Honourable."
2. ""سعادتك Government administrators and others with significantly high-class.
Corresponding in official circumstances "Your Excellency," or "Your Honor" when
speaking to the judge in the court.
3. " "معاليك Government ministers. Equal in everyday languages to "Your Excellency" or "The
Right Honourable."
4. "/ حاج"حاجة Usually, any Muslim when performing the Hajj, also the Christian who did
pilgrimage in Jerusalem. Now it is spoken generally as a form of admiration and respect
for all old people.
5. " باشا " Casual speech to a male of equal or lesser societal rank. Approximately equal to
"man" ""رجل or "dude" "المتأنق,مديني" in casual English speech.
We can notice some distinguishing lexis and utterances in Egyptian Arabic for instance:
[ezˈzæjjæk] ("How are you [m.]") إزيكللمذكر ,[ezˈzæjjek] ("How are you [f.]") إزيك
[,للمؤنثezzæjˈjoko] ("How are you [pl.]") إزيكوللجمع ,[ˈʔeː ˈdæ] ايهده ("What's all this?", "What's the
point", "What's this?" –a phrase denote frustration)
Ex.: إنتهبتقوللهمعلياكدهليه,ايهده؟ [entæ betʔolˈlohom ʕæˈlæjjæ ˈkedæ ˈleː ˈʔeː dæ] ("Why are you
telling them such things about me, what's all this?").خالص [xɑˈlɑːsˤ]: several meanings, though its
main meaning is "enough", often adverbial خالص, [xɑˈlɑːsˤ]: several meanings, though its main
11. 11
meaning is "enough", often adverbial, "Stop it!" Ex.: زهقت,خالص [zeˈheʔte xɑˈlɑːsˤ] ("I'm annoyed,
stop it! ") "It's over!", "finally, eventually"
Ex.: [ˈmɑmti kæːnet ʕajˈjæːnæ wˈmæːtet xɑˈlɑːsˤ]| مامتىكانتعيانهوماتت,خالص ("My mother was
ill and died finally." [or "...and it's over now"])."Ok, then!" Ex.: [xɑˈlɑːsˤ ʔæˈʃuːfæk ˈbokɾɑ] خالص,
أشوفكبكرا ("I'll see you tomorrow then"). [ˈxɑːlesˤ]خالص ("at all"). ماعندناشحاجهنقولهاخالص
[mæʕændeˈnæːʃ ˈħæːɡæ nˈʔolhæ ˈxɑːlesˤ] ("We have nothing at all to say"). [keˈfæːjæ]كفاية ("It's
enough!" or "That's enough"). [ˈjæʕni] يعنى ("that's to say" or "meaning" or "y'know")
As answer to [entæ ˈʕæːmel ˈ(ʔ)eː] إنتاعاملإيه؟ ("How do you do [m.]?") (as an answer: [meʃ
ˈʔædde ˈkedæ] مشأدكده "I am so so" or [ˈnosˤse ˈnosˤ] نصنص "half half" = مشتمام [meʃ tæˈmæːm]
"not perfect"قوي .)مش [jæʕni ˈʔeː] يعنىايه؟ ("What does that mean?")
[ˈemtæ hɑtˈxɑllɑsˤ ˈjæʕni] إمتاهتخلصيعنى؟ ("When are you finishing exactly, then?)بقى [ˈbæʔæ]
(particle of enforcement → "just" in imperative clauses and "well,...then?" in questions)
[ˈhæːto ˈbæʔæ] هاتهبقى ("Just give it to me!)" [ˈʕæmæl ˈ(ʔ)eː ˈbæʔæ] أعملايهبقى؟ or [ˈʕæmæl ˈ(ʔ)eː
ˈbæʔæ] ("Well, what did he do then?")
I moved through Egypt from south to north and from east to west I noticed that there are several
dialects Egyptian Arab usually divided into the language of the large collections.
Lower Egypt (Northern). Cairene Arabic (Cairo speech), the prestigious dialect spoken
in Cairo, is a Lower Egyptian dialect.
Upper Egypt (Southern, also called Sa`idi)صعيدي Common; Cairo to Sudan border. Enjoys
fairly little prestige, though it is extensively articulated.
Let us move closer to Egyptian Arabic structure and sound system, depending on Ethnologue,
talkers of Cairene(Cairo dialect) Arabic find it somehow difficult to understand speakers
of (Sa`idi) الصعيدية اللهجة upper Egypt, on the other hand speakers of(Sa`idi) apprehend talkers of
12. 12
Cairene Arabic. This type of non-reciprocal unambiguousness normally happens among (high
prestige) urban speakers and (low prestige) rural speakers.
One of Egyptian Spoken Arabic contradictions is it possesses more vowels than Modern
Standard Arabic. The following table demonstrate these vowels they are six long and four short
vowels, comparatively with three long and three short vowels in “MSA”. The table blew shows
the vowel phonemes and said almost as Egyptian Arab.
Front Back
Close i, i: u:
Close-mid e, e: o
Mid æ, æ:
Open ɑ, ɑ:
Talk about some other properties expanded Egyptian Arabic vowels:
“All long vowels become shortened in unstressed positions and before consonant
clusters”.
“Short /i/ and /u/ are often dropped when another vowel is added to a word,
e.g., kaatib, ‘having written’ (masculine) becomes katba, ‘having written’
(feminine)”.
“MSA diphthongs /ay/ and /aw/ have become long vowels in Egyptian Arabic”.
List of compatible almost as phonemes and spoke of Egyptian Arab similar to the MSA price, but
there are some differences. The following table shows the consistent almost as phonemes and
spoke of Egyptian Arab.
13. 13
Bilabial
Labio-
dental Alveolar
Palato-
alveolar Velar Uvular Pharyngeal Glottal
Plain Emphatic Plain Emphatic
Stops
voiceless (p) t tˤ xx k q ʔ
voiced (bˤ) d dˤ g
Fricatives
voiceless f s sˤ ʃ x ħ h
voiced (v) z zˤ (ʒ) ɣ ʕ
Affricates x x x
Nasals m (mˤ) n x x
Laterals l x
Tap or trill ɾ~r ɾˤ~rˤ x xx x
Approximants w j x
“(p), (v), (ʒ) occur mostly in loanwords”
“(bˤ), (mˤ) have marginal status”
“/tˤ, dˤ, sˤ, zˤ, rˤ, ɾˤ, ɾˤ/ are pharyngealized consonants that have no equivalents in English.
Not all speakers of Egyptian Arabic can pronounce these consonants”.
“/ʔ/ = sound between the vowels in uh-oh”.
“/ʃ/ = sh in sheep”
“/ʒ/ = s in vision”
“/q, χ, ʁ, ħ, ʕ/ have no equivalents in English”.
When we talk about the stress we will find that Egyptian Arabic has five syllable categories:
“CV, CV: , CVC, CV:C”, and
“CVCC (where C = consonant, V = short vowel, V: = long vowel”.
“CV:, CV:C”, and
“CVCC are long, or heavy, syllables. Going from right to left in a word, stress falls on
the first encountered”
“CV:, CV:C, or CVCC syllable”.
14. 14
When we talk about Grammar there are some differences between MSA and Egyptian Spoken
Arabic regarding Noun phrase, Verb phrase, Word order, Vocabulary and writing:
Unlike MSA, nouns in Egyptian Arabic are not inflected for case.
Nouns have both direct and indirect object clitics that follow direct object clitics and
precede the negative marker -s.
There is no dual number.
The plural is usually molded by adding a suffix to the end of a word. In some instances, the
plural is expressed by changing the vowel structure of a word. There are many patterns of
broken plurals that depend on the structure of the root.
Object pronouns are attached as clitics to the end of a nouns, verbs or prepositions,
e.g., béet ‘house’,béet-i ‘my house’.
, past, or future. Colloquial Egyptian Arabic has also developed a future tense marker ħa-,
e.g., hayiktib, ‘he will write‘.
The normal system of Egyptian language is Subject-Verb-Object. The
demonstratives this and that come after the nouns they modify. Wh- question words are not
moved to the front of the sentence as in MSA, e.g., ráh maṣr ʔ imta?إمته مصر راح Literally, ‘He
went when? ‘Like further varieties of Arabic, Egyptian language originates most of its lexis by
applying various vowel insertion patterns and templates to consonant roots.
Egyptian Arabs rarely written as Arab standard is usually written. But the Egyptian Arabic is the
use of the Arabic language in writing novels and plays and poems, as well as affected by moving
the texts, the texts of the spoken language, declarations, and in some newspapers. Mission
subsistence allowance in most print media and television. Usually written Egyptian Language
15. 15
There is an urgent need for an electronic dictionary application three-way, Tharwa, including
Dialectal Arabic, Modern Standard Arabic and English correspondents.
Already there is a clear gap in DA resources, especially those that bridge over variable and English.
Some calculation methods to dialectal processing, such as Abu Bakr and others. (2008) , Salloum
and Habash (2011), stared the gap by extending BAMA / SAMA (Buckwalter, 2004; .. Graf et al,
2009), to accept DA prefixes and suffixes.
Tharwa is primarily a lemma based resource, namely all the DA and MSA and ENG entries are
chosen conventionalized citation forms. It currently serves as a nucleus to be extended to other
Arabic variants. Let’s have a look at the Phonological Variation, Morphological Variation,
affixation, Case inflection, and Derivational differences, Lexical Variation, Identical, Semantic
Cognates and Homographs/Homophones:
As is the case for many languages and dialects and pronunciation of some phonemes of MSA
shifted in EGY. Some of the shifts are quite regular such as / q / ( of the letter ق ) becoming a
glottal stop /’/ except for few words borrowed from MSA or Classical Arabic, e.g., the word قلب
‘heart’ is pronounced /qalb/ in MSA but /’alb / in EGY.
Another example is the MSA /θ / phoneme (of the letter )ثwhich shifts in some words to /t/
and in others to /s/, e.g., ثالثة vlAvp ‘three’ is pronounced as MSA /θ ala:θ a/ or EGY/ tala:ta /, and
ثروة ‘wealth, fortune’ is pronounced as MSA /θ arwa / and EGY /sarwa/. The differences in the
phonology affect how people write, especially given the absence of an orthographic standard for
EGY. Habash et al. (2012b).
16. 16
EGY morphology shows a significant difference from the MSA in each of the inflectional
derivational morphology. It is noted that the derivational differences are more suitable for the
development of vocabulary resources such Tharwa. (Habash 0.2010).
EGY has some unique prefixes, suffixes and clitic morphemes that are not shared by MSA,
e.g., the EGY future tense prefixes + هـ ha+ 1
and ح Ha + are notably different from the MSA
future prefix + س sa + .
While MSA has a complex case system, EGY does not. Different inflected forms in MSA map to
the same form in EGY, e.g., MSA موظفون mwZfwn, ‘employees [nom.]’ and MSA موظفين
mwZfyn, ‘employees [acc. /gen.]’ map to EGY موظفين mwZfyn, ‘employees’.
MSA and EGY have comparability in word formation mechanisms, particularly because
derivational morphology depends on roots and patterns. However, EGY has some morphological
patterns which are not used in MSA such as, استخبي Aisotaxab~aY ‘to hide’. In addition EGY
utilizes non-MSA morphological patterns to represent the passive voice or the unaccusative form
of some verbs such as (e.g. اتكتب Aitokatab ‘to be written’), (e.g. وراتص AitoSaw~ ar ‘to have his
picture taken’), and (e.g. اكلات Ait∼Akil ‘to be eaten’).
The EGY lexicon comprises entries that differ as well as overlap with MSA:
EGY and MSA: words that are identical in all respects phonological, orthographic,
morphological, and semantic, e.g.نشيط na$iyT, ‘active’.
EGY and MSA that share the same meaning but with some regular phonological and/or
orthographic variation, e.g., EGY verb لعب liEib ‘to play’ corresponds to MSA verb لعب laEib.
EGY and MSA that have the same orthography and pronunciation but different
meanings, e.g. حاجه HAjap is ‘necessity’ in MSA, but could mean both ‘thing’ as well as
‘necessity’ in EGY.
17. 17
Words that belong uniquely to only one of the varieties EGY or MSA, e.g. مش mi$ ‘not’, بس
bas ‘only, enough’, and دغري dugoriy ‘straight-ahead’ are only used in EGY.
All the resources go through a process of standardization for the EGY entries to be rendered
CODA compliant.
A round of clean-up is performed on the entries correcting for spelling mistakes especially paying
particular attention for Hamza variants, and Alif-Maqsura ي Y versus Ya ي y cases. Providing the
full diacritization for each of the entries allows for variation in the POS tag associated with an
entry. The undiacritized forms pack several POS tags in addition to the semantic homonyms and
synonyms. For instance, the entry >أمرmr as rendered in the original BADAWI lexicon once
diacritized is split into three different entries: (a) adjective, >amar ‘more bitter’, (b) noun,>amor
‘order’, and (c) verb, >amar ‘to order’. As mentioned above, some of the resources provide POS
tag information.
CODA This the diacritized conventional orthography lemma form of the EGY entries (Habash
et al., 2012b).. The following statistics show the level of overlap between the EGY entry and their
MSA equivalent as defined before These statistics are calculated on the lemma entries only
amounting to entries.33.5% of the entries are Identical (meaning and diacritized form) to MSA
words, e.g.بخيل baxiyl ‘miserly, cheap’; 14.4% are semantic cognates, modulo some regular
homographic/homophonic variation with MSA, e.g.,
EGY رًساتك Aitokas~ar and MSA رًستك takas~ar ‘become 13.2% are homographs/homophones
but with additional senses not in MSA, e.g., EGY حاجه HAjap and MSA شيئ $ay’ ‘thing’; and,
38.9% are completely distinct EGY entries, e.g., EGY .بس bas and فقط faqaT ‘only’.
18. 18
This field lists alternative naturally occurring orthographic variants of the EGY CODA entries
as obtained from their original sources BADAWI, and ECAL. This field can have multiple variants
both diacritized and undiacritized, e.g., EGY entry كثير kiviyr ‘many, a lot’ (pronounced/kitiyr/)
has the variant كتير kitiyr.
We have two POS tag fields, one for EGY and one for MSA. The POS tags comprises 34 tags
including verb, noun, adjective, adverb, particle, demonstrative, proper noun, and vbn (deverbal
nouns).
Each entry is marked as being a lemma or surface inflected form. Every surface form entry in
Tharwa is linked to its lemma entry.
The semantic features number, gender and rationality. For more information, see (Habash,
2010; Alkuhlani and Habash, 2011).
This is the root consonant radicals of the word before any derivation or inflection takes place.
The root information is provided for both EGY and MSA entries, e.g. root: ktb ‘writing related’,
has 39 derived lemmas in Tharwa.
This is the templatic structure of the word. We provide both the morphological (ordeep)
patterns and morpho-phonological (or surface) patterns for words, for both EGY lexical entries as
well as the MSA equivalents, e.g. EGY: اكتتب Aikotatab ‘subscribe’.
This is the corresponding MSA word of the EGY entry. The MSA words are fully diacritized
and are in the same morphological form of the EGY entry, e.g., EGY منكتب minokitib and MSA
مكتوب makotuwb ‘written’. ENG Equivalent This is the equivalent translation into English.
19. 19
Conclusion
The current Egyptians Arabic. Talked by approximately most Egyptian vernacular often known
locally. Originally came from the Arabic language brought to Egypt in the seventh century by the
Islamic invasion, influenced by the original Coptic Turkish/Ottoman Turkish, Italian, French,
While it is primarily a spoken language and in written form in novels, plays and poems dialect
(literature), as well as in some newspapers advertising and some texts of popular songs. In most
printed media and the Arab language news TV uses.
Literary Arabic standard language based on the language of the Quran, a "traditional". The
Egyptian vernacular almost everywhere Arabic alphabet for local consumption not written,
although it is usually transcribed into Latin letters or in the international phonetic alphabet in
linguistics to teaching text and textbooks, English-speaking students designed. In addition, it is
written form when chatting on line or sending SMSs. Currently there is a demand for such new
modern advanced dictionaries.
20. 20
References
Abo Bakr, H., Shaalan, K., and Ziedan, I. (2008). A Hybrid Approach for Converting
Written Egyptian Colloquial Dialect into Diacritized Arabic. In The 6th International
Conference on Informatics and Systems, INFOS2008.Cairo University.
Alkuhlani, S. and Habash, N. (2011). A Corpus for Modeling Morpho-Syntactic
Agreement in Arabic: Gender, Number and Rationality. In Proceedings of the 49th Annual
Meeting of the Association for Computational Linguistics (ACL’11), Portland, Oregon,
USA.
Badawi, E.-S. and Hinds, M. (1986). A Dictionary of Egyptian Arabic. Librairie du Liban.
Brustad, K. (2000). The Syntax of Spoken Arabic: A Comparative Study of Moroccan,
Egyptian, Syrian, and Kuwaiti Dialects. Georgetown University Press.
Buckwalter, T. (2004). Buckwalter Arabic Morphological Analyzer Version 2.0. LDC
catalog numberLDC2004L02, ISBN 1-58563-324-0.Elfardy, H. and Diab, M. (2012).
Elfardy, H. and Diab, M. (2012).Token level identification of linguistic code switching. In
Proceedings of the 24th International Conference on Computational Linguistics
(COLING), IIT Mumbai, India.
Elfardy, H. and Diab, M. (2013). Sentence level dialect identification in Arabic. In
Proceedings of ACL, Sofia, Bulgaria.
Ferguson, C. F. (1959). Diglossia. Word, 15(2):325–340. Gadalla, H., Kilany, H., Arram,
H., Yacoub, A., El-Habashi, A., Shalaby, A., Karins, K., Rowson, E., Mac- Intyre, R.,
Kingsbury, P., Graff, D., and McLemore, C. (1997). CALLHOME Egyptian Arabic
Transcripts. In Linguistic Data Consortium, Philadelphia.
21. 21
Graff, D., Maamouri, M., Bouziri, B., Krouna, S., Kulick,S., and Buckwalter, T. (2009).
Standard Arabic Morphological Analyzer (SAMA) Version 3.1. Linguistic Data
Consortium LDC2009E73.
Habash, N., Soudi, A., and Buckwalter, T. (2007). On Arabic transliteration. In Soudi, A.,
Neumann, G., and van den Bosch, A., editors, Arabic Computational Morphology, volume
38 of Text, Speech and Language Technology, chapter 2, pages 15–22. Springer.
Habash, N., Eskander, R., and Hawwari, A. (2012a).A Morphological Analyzer for
Egyptian Arabic. In NAACL-HLT 2012 Workshop on Computational Morphology and
Phonology (SIGMORPHON2012), pages1–9.
Habash, N., Diab, M., and Rabmow, O. (2012b). Conventional Orthography for Dialectal
Arabic. In Proceedings of the Language Resources and Evaluation Conference (LREC),
Istanbul.
Habash, N. (2010). Introduction to Arabic Natural Language Processing. Morgan &
Claypool Publishers. Kilany, H., Gadalla, H., Arram, H., Yacoub, A., El-Habashi, A., and
McLemore, C. (2002). Egyptian Colloquial Arabic Lexicon. LDC catalog
numberLDC99L22.
Maamouri, M., Bies, A., Buckwalter, T., Diab, M., Habash, N., Rambow, O., and Tabessi,
D. (2006). Developing and using a pilot dialectal Arabic treebank. In LREC,Genoa, Italy.
Och, F. J. and Ney, H. (2000). Improved statistical alignment models. In ACL ’00:
Proceedings of the 38th Annual Meeting on Association for Computational Linguistics,
pages 440–447, Morristown, NJ, USA. Association for Computational Linguistics.
22. 22
Papineni, K., Roukos, S., Ward, T., and Zhu, W.-J. (2002).BLEU: a Method for Automatic
Evaluation of Machine Translation. In Proceedings of the 40th Annual Meeting of the
Association for Computational Linguistics, pages311–318, Philadelphia, PA.
Pasha, A., Al-Badrashiny, M., Kholy, A. E., Eskander, R., Diab, M., Habash, N., Pooleery,
M., Rambow, O., and Roth, R. (2014). Madamira: A fast, comprehensive tool for
morphological analysis and disambiguation of Arabic .In In Proceedings of LREC,
Reykjavik, Iceland.
Saleh, I. and Habash, N. (2009). Automatic extraction oflemma-based bilingual
dictionaries for morphologically rich languages. In Third Workshop on Computational
Approaches to Arabic Script-based Languages at the MT Summit XII, Ottawa, Canada.
Salloum, W. and Habash, N. (2011). Dialectal to Standard Arabic Paraphrasing to Improve
Arabic-English Statistical Machine Translation. In Proceedings of the First Workshop on
Algorithms and Resources for Modelling of Dialects and Language Varieties, pages 10–
21, Edinburgh, Scotland.
Salloum, W. and Habash, N. (2013). Dialectal Arabic to English Machine Translation:
Pivoting through Modern Standard Arabic. In The 2013 Conference of the North American
Chapter of the Association for Computational Linguistics: Human Language Technologies
(NAACLHLT2013).
Schmid, H. (1995). Treetagger, a language independent part-of-speech tagger. Technical
report, Institut f¨urMaschinelle Sprachverarbeitung, Universit¨at Stuttgart.Spiro, S. (1895).
An Arabic-English Vocabulary of the Colloquial Arabic of Egypt. Al-Mokattam printing
office.
23. 23
Spiro, S. (1987). Arabic-English Dictionary of the ColloquialArabic of Egypt. Librairie Du
Liban.
Abdel-Massih, Ernest T.; A. Fathy Bahig (1978). Comprehensive Study of Egyptian
Arabic: Conversation Texts, Folk Literature, Cultural Ethnological and Socio Linguistic
Notes. Ann Arbor: University of Michigan. ISBN 0-932098-11-8.
Peter, Behnstedt; Manfred Woidich (1985). Die ägyptisch-arabischen Dialekte, vols. I, II.
Wiesbaden: L. Reichert.
Gary, Judith Olmsted, & Saad Gamal-Eldin. 1982. Cairene Egyptian Colloquial Arabic.
Lingua Descriptive Studies 6. Amsterdam: North Holland.
Haeri, Niloofar (2003). Sacred Language, Ordinary People: Dilemmas of Culture and
Politics in Egypt. Palgrave Macmillan. ISBN 0-312-23897-5.
Harrell, Richard S. 1957. The Phonology of Colloquial Egyptian Arabic. American Council
of Learned Societies Program in Oriental Languages Publications Series B, Aids, Number
9. New York: American Council of Learned Societies.
Hinds, Martin; El-Said Badawi (1987). A Dictionary of Egyptian Arabic. French &
European Pubns. ISBN 0-8288-0434-6.
Mitchell, T.F. 1956. An Introduction to Egyptian Colloquial Arabic. Oxford: Oxford
University Press.
Mitchell, T.F. 1962. Colloquial Arabic: the Living Language of Egypt. London:
The English universities Press.
Presse, Karl G.; Katrine Blanford; Elisabeth A. Moestrup; Iman El-Shoubary (2000). 5
Egyptian-Arabic One Act Plays: A First Reader (Bilingual edition ed.). Museum
Tusculanum. ISBN 87-7289-612-4.
24. 24
Youssef, Ahmad Abdel-Hamid (2003). From Pharaoh's Lips: Ancient Egyptian Language
in the Arabic of Today. American University in Cairo Press. ISBN 977-424-708-6.
Tomiche, Nada. 1964. Le parler arabe du Caire. Paris: Mouton.
Versteegh, Kees (2001). The Arabic Language. Edinburgh: Edinburgh University
Press. ISBN 0-7486-1436-2.
Watson, Janet (2002), The Phonology and Morphology of Arabic, New York: Oxford
University Press.
Arabist’s guide to Egyptian colloquial Arabic.pdf. (n.d.).
Abdel-massih, E. T., Abdel-malek, Z. N., Mccarus, E. N., & Arbor, A. (2011). Proverbs
and Metaphoric Expressions.
Egyptian_Colloquial_Arabic_1000199538.pdf. (n.d.).
Essentials, I., Ii, S., Nouns, G., Adjectives, P., Adverbs, V., Conjunctions, P., … Which,
Q. (n.d.). PRO version.
Abdel-massih, E. T., & Arbor, A. (2011). Egyptian Arabic.
https://youtu.be/YAD139oTu10?t=66
https://youtu.be/lWtG-GQ-cfg