Speech recognition for arabic

•Download as PPTX, PDF•

1 like•748 views

Arabic_NLP_ImamU2013

Technology Education

By...
Dhaifah AL-ammari
Wafa AL-shehri
Speech Recognition for Arabic

Table 1: Some differences between Modern Standard Arabic and
Egyptian Colloquial Arabic
GLOSSECAMSA
summerse:fsaif
‘he speacks’yitkallimyatakallam
TabletarabeezaTawila

Writing System
Arabic is written in script and from right to left. The
alphabet consists of twenty-eight letters, twenty-five
of which represent consonants. The remaining three
letters represent the long vowels of Arabic and, where
applicable, the corresponding Each letter can appear
in up to four different shapes, depending on whether it
occurs at the beginning, in the middle, or at the end of
a word, or in isolation. Letters are mostly connected
and there is no capitalization semivowels.

Examples of MSA pronominal and possessive
affixes (separated from stem by '-').

error rates on conversational speech, by contrast, are
unacceptably high. The currently best error
rate, 55.5%, is larger than those
for comparable data in other languages

problems
 the mismatch between spoken and written representation
(missing pronunciation information in Arabic script);
 the lack of conversational training data;
 morphological complexity.

Projects and contributions
IBM first established a system to learn to speak Arabic
and converted to text. (OS2)
After that the two versions of the Windows system
Then introduced Via Voice mulineum. In speech
recognition system to answer phone calls, and
responding to user voice command.
The problem was: the need for a large number of words,
a word 200,000 to cover 97% of the language used in
the modern day.

Recent Works
 Alghamdi . (2009) developed an Arabic broadcast news
transcription system.
 Elmahdy in. (2009)used acoustic models trained with large MSA
news broadcast speech corpus to work as multilingual or multi-accent
models to decode colloquial Arabic.
 Selouani and Alotaibi (2011)presented Genetic Algorithms to
adapt HMMs for
 non-native speech in a large vocabulary speech recognition system of
MSA.
 Saon et al. (2010) described the Arabic broadcast transcription
system
 Kuo et al. (2010) studied various syntactic and morphological
context features incorporated in an NNLM for Arabic speech
recognition

Viewers also liked

Discourse annotation for arabicArabic_NLP_ImamU2013

Discourse annotation for arabic 3Arabic_NLP_ImamU2013

Discourse annotationArabic_NLP_ImamU2013

Building corpus from www for arabicArabic_NLP_ImamU2013

The named entity recognition (ner)2Arabic_NLP_ImamU2013

Arabic to-english machine translationArabic_NLP_ImamU2013

Part of speech tagging for ArabicArabic_NLP_ImamU2013

Arabic spell checking approachesArabic_NLP_ImamU2013

Arabic tokenization and stemmingArabic_NLP_ImamU2013

Sentiment analysis of arabic,a surveyArabic_NLP_ImamU2013

Viewers also liked (10)

Discourse annotation for arabic

Discourse annotation for arabic 3

Discourse annotation

Building corpus from www for arabic

The named entity recognition (ner)2

Arabic to-english machine translation

Part of speech tagging for Arabic

Arabic spell checking approaches

Arabic tokenization and stemming

Sentiment analysis of arabic,a survey

Similar to Speech recognition for arabic

Hybrid Phonemic and Graphemic Modeling for Arabic Speech RecognitionWaqas Tariq

DEVELOPING A SIMPLIFIED MORPHOLOGICAL ANALYZER FOR ARABIC PRONOMINAL SYSTEMkevig

The Arabic Speech Database: PADASCSCJournals

Automatic Phonetization-based Statistical Linguistic Study of Standard ArabicCSCJournals

Rule-Based Standard Arabic Phonetization at Phoneme, Allophone, and Syllable ...CSCJournals

CONSTRUCTION OF AMHARIC-ARABIC PARALLEL TEXT CORPUS FOR NEURAL MACHINE TRANSL...ijaia

Construction of Amharic-arabic Parallel Text Corpus for Neural Machine Transl...gerogepatton

CONSTRUCTION OF AMHARIC-ARABIC PARALLEL TEXT CORPUS FOR NEURAL MACHINE TRANSL...gerogepatton

Hybrid approaches for automatic vowelization of arabic textsijnlc

Arabic words stemming approach using arabic wordnetIJDKP

STANDARD ARABIC VERBS INFLECTIONS USING NOOJ PLATFORMijnlc

A New Approach to Romanize Arabic WordsIJERA Editor

Deterministic Finite State Automaton of Arabic Verb System: A Morphological S...CSCJournals

Exploring the effects of stemming onijaia

CBAS: CONTEXT BASED ARABIC STEMMERijnlc

XMODEL: An XML-based Morphological Analyzer for Arabic LanguageWaqas Tariq

Designing a Rule Based Stemmer for Afaan Oromo TextWaqas Tariq

T URN S EGMENTATION I NTO U TTERANCES F OR A RABIC S PONTANEOUS D IALOGUES ...ijnlc

DICTIONARY BASED AMHARIC-ARABIC CROSS LANGUAGE INFORMATION RETRIEVALcsandit

Segmentation Words for Speech Synthesis in Persian Language Based On Silencepaperpublications3

Similar to Speech recognition for arabic (20)

Hybrid Phonemic and Graphemic Modeling for Arabic Speech Recognition

DEVELOPING A SIMPLIFIED MORPHOLOGICAL ANALYZER FOR ARABIC PRONOMINAL SYSTEM

The Arabic Speech Database: PADAS

Automatic Phonetization-based Statistical Linguistic Study of Standard Arabic

Rule-Based Standard Arabic Phonetization at Phoneme, Allophone, and Syllable ...

CONSTRUCTION OF AMHARIC-ARABIC PARALLEL TEXT CORPUS FOR NEURAL MACHINE TRANSL...

Construction of Amharic-arabic Parallel Text Corpus for Neural Machine Transl...

CONSTRUCTION OF AMHARIC-ARABIC PARALLEL TEXT CORPUS FOR NEURAL MACHINE TRANSL...

Hybrid approaches for automatic vowelization of arabic texts

Arabic words stemming approach using arabic wordnet

STANDARD ARABIC VERBS INFLECTIONS USING NOOJ PLATFORM

A New Approach to Romanize Arabic Words

Deterministic Finite State Automaton of Arabic Verb System: A Morphological S...

Exploring the effects of stemming on

CBAS: CONTEXT BASED ARABIC STEMMER

XMODEL: An XML-based Morphological Analyzer for Arabic Language

Designing a Rule Based Stemmer for Afaan Oromo Text

T URN S EGMENTATION I NTO U TTERANCES F OR A RABIC S PONTANEOUS D IALOGUES ...

DICTIONARY BASED AMHARIC-ARABIC CROSS LANGUAGE INFORMATION RETRIEVAL

Segmentation Words for Speech Synthesis in Persian Language Based On Silence

Recently uploaded

Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani

UiPath Community: Communication Mining from Zero to HeroUiPathCommunity

Data governance with Unity Catalog PresentationKnoldus Inc.

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3

React Native vs Ionic - The Best Mobile App FrameworkPixlogix Infotech

The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3

New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada

Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein

A Journey Into the Emotions of Software DevelopersNicole Novielli

Generative AI - Gitex v1Generative AI - Gitex v1.pptxfnnc6jmgwh

Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll

Zeshan Sattar- Assessing the skill requirements and industry expectations for...itnewsafrica

TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc

Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integrationmarketing932765

Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada

Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3

Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González

Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3

Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq

Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3

Recently uploaded (20)

Potential of AI (Generative AI) in Business: Learnings and Insights

UiPath Community: Communication Mining from Zero to Hero

Data governance with Unity Catalog Presentation

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx

React Native vs Ionic - The Best Mobile App Framework

The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx

New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024

Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24

A Journey Into the Emotions of Software Developers

Generative AI - Gitex v1Generative AI - Gitex v1.pptx

Emixa Mendix Meetup 11 April 2024 about Mendix Native development

Zeshan Sattar- Assessing the skill requirements and industry expectations for...

TrustArc Webinar - How to Build Consumer Trust Through Data Privacy

Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration

Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024

Digital Identity is Under Attack: FIDO Paris Seminar.pptx

Generative Artificial Intelligence: How generative AI works.pdf

Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx

Genislab builds better products and faster go-to-market with Lean project man...

Moving Beyond Passwords: FIDO Paris Seminar.pdf

Speech recognition for arabic

1. By... Dhaifah AL-ammari Wafa AL-shehri Speech Recognition for Arabic

2. Arabic linguistic varieties.

3. Table 1: Some differences between Modern Standard Arabic and Egyptian Colloquial Arabic GLOSSECAMSA summerse:fsaif ‘he speacks’yitkallimyatakallam TabletarabeezaTawila

4. Writing System Arabic is written in script and from right to left. The alphabet consists of twenty-eight letters, twenty-five of which represent consonants. The remaining three letters represent the long vowels of Arabic and, where applicable, the corresponding Each letter can appear in up to four different shapes, depending on whether it occurs at the beginning, in the middle, or at the end of a word, or in isolation. Letters are mostly connected and there is no capitalization semivowels.

6. Arabic diacritics

7. Morphology

8. Examples of MSA pronominal and possessive affixes (separated from stem by '-').

10. error rates on conversational speech, by contrast, are unacceptably high. The currently best error rate, 55.5%, is larger than those for comparable data in other languages

11. problems  the mismatch between spoken and written representation (missing pronunciation information in Arabic script);  the lack of conversational training data;  morphological complexity.

12. Projects and contributions IBM first established a system to learn to speak Arabic and converted to text. (OS2) After that the two versions of the Windows system Then introduced Via Voice mulineum. In speech recognition system to answer phone calls, and responding to user voice command. The problem was: the need for a large number of words, a word 200,000 to cover 97% of the language used in the modern day.

13. Recent Works  Alghamdi . (2009) developed an Arabic broadcast news transcription system.  Elmahdy in. (2009)used acoustic models trained with large MSA news broadcast speech corpus to work as multilingual or multi-accent models to decode colloquial Arabic.  Selouani and Alotaibi (2011)presented Genetic Algorithms to adapt HMMs for  non-native speech in a large vocabulary speech recognition system of MSA.  Saon et al. (2010) described the Arabic broadcast transcription system  Kuo et al. (2010) studied various syntactic and morphological context features incorporated in an NNLM for Arabic speech recognition

Speech recognition for arabic

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (10)

Similar to Speech recognition for arabic

Similar to Speech recognition for arabic (20)

Recently uploaded

Recently uploaded (20)

Speech recognition for arabic