SlideShare a Scribd company logo
1 of 8
Download to read offline
International Journal of Artificial Intelligence and Applications (IJAIA), Vol.14, No.6, November 2023
DOI:10.5121/ijaia.2023.14602 13
ENGLISH TO ARABIC MACHINE TRANSLATION OF
MATHEMATICAL DOCUMENTS
Mustapha Eddahibi and Mohammed Mensouri
IMIS Laboratory, Ibnou Zohr University – Agadir Morocco
ABSTRACT
This paper is about the development of a machine translation system tailored specifically for LATEX
mathematical documents. The system focuses on translating English LATEX mathematical documents into
Arabic LATEX, catering to the growing demand for multilingual accessibility in scientific and
mathematical literature. With the vast proliferation of LATEX mathematical documents the need for an
efficient and accurate translation system has become increasingly essential. This paper addresses the
necessity for a robust translation tool that enables seamless communication and comprehension of complex
mathematical content across language barriers. The proposed system leverages a Transformer model as
the core of the translation system, ensuring enhanced accuracy and fluency in the translated Arabic
LATEX documents. Furthermore, the integration of RyDArab, an Arabic mathematical TEX extension,
along with a rule-based translator for Arabic mathematical expressions, contributes to the precise
rendering of complex mathematical symbols and equations in the translated output. The paper discusses
the architecture, methodology, of the developed system, highlighting its efficacy in bridging the language
gap in the domain of mathematical documentation.
KEYWORDS
Machine Translation System, LATEX mathematical Documents, Arabic mathematical notation, Text-to-
Text Transfer Transformer, RyDArab, Language Processing.
1. INTRODUCTION
Machine Translation (MT), is the conversion of text from one language to another using
computers. This process involves the utilization of programming languages and software to
accomplish the translation tasks [1]. Various types of content, ranging from commercial and
business documents to scientific papers, instruction manuals, textbooks, and online materials,
require translation services. Presently, machine translation has found widespread application in
the language services industry and has become a common tool for professional translators.
However, machine translation has limitations and is not suitable for every type of textual content.
The challenges in Machine Translation stem from the disparities in grammatical structures
existing between the source and target languages. Furthermore, the complexity of translation can
be influenced by the specific category of the text. While the translation of everyday spoken
language is relatively straightforward, rendering poetry, philosophical treatises, scientific or
technical documents poses more intricate challenges [2]. The seamless translation of complex
mathematical documents from English to Arabic stands as a big challenge in the machine
translation field. Significant progress has been achieved in this domain, the intricacies of
mathematical language, particularly in the context of LATEX [3] syntax, present a critical gap
that demands attention.
This study delves into uncharted territory, aiming to bridge the gap in the translation of
mathematical content into the Arabic language, a domain remains relatively unexplored in the
International Journal of Artificial Intelligence and Applications (IJAIA), Vol.14, No.6, November 2023
14
current body of literature. Leveraging cutting-edge methodologies such as LATEX parsing,
mathematical expression tokenization, and the utilization of a Transformer for natural language
translation, alongside a rule-based translator for managing mathematical expressions, our
research aspires to contribute significantly to the burgeoning field of automated translation.
By building upon recent advancements in pre-trained language models, we seek to unravel the
complex interplay between mathematical expressions and linguistic structures, thereby unlocking
the potential for comprehensive and accurate translation.
Despite the acknowledging the inherent intricacies linked with handling symbolic mathematical
content, the absence of a suitable corpus remains a notable limitation. The intricate nature of
mathematical language, coupled with the unique challenges posed by Arabic notation,
necessitates a meticulous and nuanced approach to the translation process.
This research holds substantial promise for the educational landscape, benefitting researchers,
educators, and students alike. By facilitating a deeper comprehension of intricate mathematical
documents, our work endeavours to streamline the creation, modification, and dissemination of
these materials in both English and Arabic contexts. The digitized representation of such
documents not only fosters their creation and archiving but also enables their seamless exchange
via various communication networks. However, realizing this objective necessitates overcoming
several challenges, particularly in dealing with the distinctive nature of mathematical expressions
in Arabic notation.
2. RELATED WORK
The prominence of English as the universal language of science emerged a mere four centuries
ago, marking a significant turning point in global communication. Its unparalleled expansion
across the world post-World War II solidified its dominance, surpassing the influence of any
other language in history [4]. This trend has underscored the democratization of scientific
knowledge, granting access to a wider audience beyond traditional linguistic barriers.
Nonetheless, it emphasizes the critical importance of promoting education in one's native
language, highlighting the necessity for striking a balance between the advantages of a common
scientific language and the preservation of cultural and linguistic diversity for comprehensive
learning.
The recognition of the significance of automated translation emerged many years back.
Numerous studies have focused on machine translation within the scientific realm. Tehseen et al.
introduced an English-to-Urdu scientific text translator that employs term tagging and domain-
specific translation [5]. This translator was specifically designed for the translation of computer
science documents and was assessed using a self-generated corpus within the computer science
domain. The translator algorithm didn’t show mathematical expressions translation or used file
format.
Because of the widespread adoption of LATEX stems owing to its capacity to manage complex
mathematical expressions, equations, and symbols with unparalleled precision and elegance, the
translation of documents in such format is a great opportunity. Ohri et al. developed a machine
translation system named Polymath for converting LATEX documents with mathematical text
[6]. This system is capable of translating English LATEX content to French LATEX. It operates
by transforming the main content of an input LATEX document into English sentences with
mathematical tokens, followed by the application of a pre-trained Transformer-based translator
model.
International Journal of Artificial Intelligence and Applications (IJAIA), Vol.14, No.6, November 2023
15
The process of translating LATEX documents from English to French is relatively
straightforward compared to the challenges encountered when translating from English to Arabic.
This discrepancy arises from the fundamental differences between the Arabic and English
mathematical notations. The Arabic mathematical system, distinct in its structure and
representation, introduces complexities that are not encountered in the French translation process.
The nuances of Arabic mathematical symbols and expressions often require specific handling
techniques and an in-depth understanding of the language's unique grammatical rules and
conventions. Despite the potential complexities, the English to French LATEX translation
process proves to be more seamless due to the greater similarity in notation and linguistic
constructs between the two languages.
3. ARABIC MATHEMATICAL NOTATION
In Arabic scientific documents there are two models of mathematical expressions: mixed and
Pure Arabic notations. Mixed mathematical notation used in some countries like Morocco, is the
outcome of a literal word-to-word translation of mathematical French books. The syntactic
models of several formulations have been transported to the new language. The symbolic writing
was imported just as it was, without any changes.
Figure 1. Mixed Mathematical Notation
In the pure Arabic presentation, mathematical expressions spread from right to left and use
Arabic symbols from its alphabet. These symbols are used to note unknown variables and
functions names. As for common functions that are replaced by their abbreviated names. In this
notation we can distinguish two types. The first uses some mirrored latin symbols like the sum
. The second type uses some Arabic alphabet-based symbols like for the sum.
Figure 2. Pure Arabic Notation with mirrored symbols
International Journal of Artificial Intelligence and Applications (IJAIA), Vol.14, No.6, November 2023
16
Figure 3. Pure Arabic Notation with Arabic alphabet-based symbols
4. TRANSLATION OF THE LATEX DOCUMENT
The process of translating the LATEX document comprises two distinct phases, each contributing
to the accurate conversion of its contents. Initially, the document undergoes a comprehensive
parsing phase, allowing for the precise identification and isolation of its constituent elements.
Subsequently, the second phase entails the systematic extraction of mathematical expressions,
subsequently replaced by an indexed list of tokens (Exprr1, Exprr2, etc.). These identified
expressions, denoted as Exprri, serve as novel entities during the natural language translation
process, functioning as placeholders for the mathematical components. Consequently, the
mathematical expressions are stored within a list, with the corresponding tokens acting as their
respective indices. The subsequent translation process relies on a rule-based translation function
tailored to the unique characteristics of mathematical notations. This function operates using the
specific mathematical expression, its associated token, and the corresponding notation type,
ensuring the seamless and accurate transformation of the mathematical content within the
document.
4.1. English to Arabic Natural Language Translator
The natural language translation part of the system is based on transformer architecture [7].
Opting for transformers over RNNs is supported by their capacity to process input data in
parallel. They are not susceptible to the vanishing gradient issue. Transformers proficiently grasp
intricate connections among various components of the input and utilize attention mechanisms to
focus on relevant portions of the input sequence [8].
Before the translation a pre-processing phase is done. The content of parsed LATEX blocs is
segmented to sentences. Certain formatting commands, such as textbf and textit, which alter the
style of individual or multiple words within a sentence, are eliminated to streamline the text. This
process aims to enhance the translation, particularly considering that Arabic typographic
conventions do not employ bold, italic, or roman styles, among others.
The transformer undergoes training and validation using datasets comprising English words,
sentences, and paragraphs along with their corresponding Arabic translations. Its architecture
consists of two primary components: an encoder and a decoder for sequence-to-sequence
translation, alongside an attention mechanism. Figure 4 provides an overview of the system.
International Journal of Artificial Intelligence and Applications (IJAIA), Vol.14, No.6, November 2023
17
Figure 4. Overview of the developed system
4.2. Translation of Mathematical Expression to Arabic Notation
The authors note that the existing pool of translations from English to Arabic for mathematical
expressions lacks sufficiency for the utilization of neural network-based translation methods.
Consequently, the current optimal approach is rule-based translation, despite its non-exhaustive
nature [9]. The translation process involves the following steps:
 Utilization of TEX[10] packages, including RyDArab[11] for typesetting Arabic
mathematical expressions and curext [12] for handling stretched mathematical symbols in
the resulting LATEX file.
 Introduction of the command arabmath to each mathematical expression to ensure proper
alignment.
 Transliteration of alphabetical symbols into their Arabic equivalents.
 Implementation of the commands warabnum for standard Western Arabic digits
or earabnum for Eastern Arabic digits (Hindi digits)
.
 Use of alp without dots for alphabetic symbols without dots ( ) and alpwithdots for
those with dots ( ).
 Application of fun with dots and fun without dots commands for elementary functions
with and without dots, respectively ( , ).
When these options are incorporated into the preamble of the input file, the commands are
universally applied to all mathematical expressions. To address document encoding concerns, the
output file must adhere to either the ISO-8859-6 encoding system for ArabTEX [13] or UTF-8
for Omega [14].
 Furthermore, specific transformations are applied to mathematical symbols, such as replacing
sum with lsum ( ), csum ( ) for its Arabic literal linear and curved equivalents,
and ssum ( ) for the mirrored equivalent. Similar transformations are implemented for the
product command.
International Journal of Artificial Intelligence and Applications (IJAIA), Vol.14, No.6, November 2023
18
5. RESULTS, LIMITATIONS AND PERSPECTIVES
For system evaluation, we conducted a meticulous assessment using targeted documents that
explore mathematical concepts similar to the ones discovered in textbooks used in primary and
secondary education courses. The results for translating mathematical expressions were generally
satisfactory, showcasing the system's competency. However, some discrepancies were observed,
which can be attributed to the inherent limitations of the non-exhaustive rule-based method.
Notably, the machine translation performance, utilizing the transformer model, exhibited
markedly low BLUE scores. The BLEU (Bilingual Evaluation Understudy) score [15] is a
frequently employed measure for assessing the quality of machine-generated translations by
comparing them to human-generated references. In this context, the lower BLUE scores indicate
a misalignment between the machine-generated translations and the expected human references.
This suboptimal performance can be traced back to two key factors: the relatively modest size of
the dataset used for training and the substitution of mathematical expressions with "exprri" tokens
in the natural language input.
The limited dataset size may have hindered the model's exposure to diverse linguistic patterns,
leading to challenges in accurately capturing the nuances of natural language. Additionally, the
abstraction introduced by the "exprri" tokens may have contributed to a loss of contextual
information during the translation process, further impacting the system's ability to generate
linguistically fluent and contextually accurate translations. Addressing these issues by
augmenting the dataset and refining the tokenization approach could potentially enhance the
system's overall translation performance.
In future work, we aim to advance towards a comprehensive machine translation system capable
of translating entire LATEX documents, including intricate LATEX commands, into Arabic. This
expansion is driven by the integration of the DadTEX [16] extension, designed for completely
Arabic TEX environments. This ambitious undertaking holds potential to enhance the
accessibility of scientific literature by providing seamless translation of diverse document
components. The incorporation of DadTEX signifies a crucial step towards facilitating cross-
linguistic communication in scientific and mathematical domains. The future perspectives
emphasize a commitment to overcoming language barriers and promoting inclusivity in scientific
discourse.
6. CONCLUSIONS
In conclusion, this paper introduces a specialized machine translation system for LATEX
mathematical documents, concentrating on translating English LATEX mathematical content into
Arabic LATEX. Driven by the escalating demand for multilingual accessibility in scientific
literature, especially in the realm of complex mathematical expressions documented in LATEX,
the proposed system employs a text-to-text transfer transformer to enhance accuracy and fluency
in the translated Arabic LATEX documents.
The integration of RyDArab, an Arabic mathematical TEX extension, and a rule-based translator
for Arabic mathematical expressions ensures precise rendering of intricate symbols and equations
in the translated output. The discussed architecture, methodology, and performance evaluation
underscore the system's effectiveness in overcoming language barriers in mathematical
documentation.
International Journal of Artificial Intelligence and Applications (IJAIA), Vol.14, No.6, November 2023
19
Acknowledging the complexities of handling symbolic mathematical content, the study
underscores the necessity for an adequate corpus to improve translation accuracy. Despite
challenges posed by the unique nature of Arabic mathematical expressions, this research
promises significant benefits for education, aiding researchers, educators, and students in
comprehending intricate mathematical documents in both English and Arabic.
Addressing the unexplored realm of translating mathematical content into Arabic, this paper
recognizes and navigates the specific challenges posed by Arabic mathematical notations. It also
contributes valuable insights into related work, emphasizing the importance of automated
translation in scientific communication, particularly focusing on LATEX documents. In
summary, this study seeks to address a crucial void in automated translation, particularly in
mathematical documents, laying the groundwork for further advancements in multilingual
accessibility within the scientific and mathematical community.
REFERENCES
[1] Leslie Lamport. LATEX: A Document Preparation System. Addison-Wesley, 1986.
[2] Azzeddine Lazrek, RyDArab—Typesetting Arabic mathematical expressions , TUGboat , Volume
25, Number 2, pp. 141-149, 2004 ( http://www.tug.org/TUGboat/Articles/tb25-2/tb81lazrek.pdf )
[3] Donald E. Knuth. The TEXbook. Volume A, Addison-Wesley, 1984.
[4] Haifeng Wang, Hua Wu, Zhongjun He, Liang Huang, Kenneth Ward Church, Progress in Machine
Translation, Engineering, Volume 18, 2022, Pages 143-153, ISSN 2095-8099,
https://doi.org/10.1016/j.eng.2021.03.023.
[5] A. A. Zhivotova, V. D. Berdonosov and E. V. Redkolis, "Improving the Quality of Scientific
Articles Machine Translation While Writing Original Text," 2020 International Multi-Conference
on Industrial Engineering and Modern Technologies (FarEastCon), Vladivostok, Russia, 2020, pp.
1-4, doi: 10.1109/FarEastCon50210.2020.9271442.
[6] Emma Steigerwald, Valeria Ramírez-Castañeda, Débora Y C Brandt, András Báldi, Julie Teresa
Shapiro, Lynne Bowker, Rebecca D Tarvin,"Overcoming Language Barriers in Academia: Machine
Translation Tools and a Vision for a Multilingual Future", BioScience, Vo.72 N° 10, pp 988-
998,2022 doi 10.1093/biosci/biac062
[7] Irsha Tehseen, Ghulam Rasool Tahir, Khadija Shakeel & Mubbashir Ali (2018). Corpus Based
Machine Translation for Scientific Text. In: Iliadis, L., Maglogiannis, I., Plagianakos, V. (eds)
Artificial Intelligence Applications and Innovations. AIAI 2018. IFIP Advances in Information and
Communication Technology, vol 519. Springer, Cham. https://doi.org/10.1007/978-3-319-92007-
8_17
[8] ADITYA OHRI, TANYA SCHMAH, "Machine Translation of Mathematical Text," in IEEE
Access, vol. 9, pp. 38078-38086, 2021, doi: 10.1109/ACCESS.2021.3063715.
[9] Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez,
Łukasz Kaiser, and Illia Polosukhin. 2017. Attention is All You Need. In Proceedings of the 31st
International Conference on Neural Information Processing Systems (Long Beach, California,
USA). 6000–6010.
[10] Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Matena, Yanqi
Zhou, Wei Li, Peter J. Liu: Exploring the Limits of Transfer Learning with a Unified Text-to-Text
Transformer. J. Mach. Learn. Res. 21: 140:1-140:67 (2020)
[11] Felix Petersen, Moritz Schubotz, Andre Greiner-Petter, and Bela Gipp. 2023. Neural Machine
Translation for Mathematical Formulae. In Proceedings of the 61st Annual Meeting of the
Association for Computational Linguistics (Volume 1: Long Papers), pages 11534–11550, Toronto,
Canada. Association for Computational Linguistics
[12] Azzeddine Lazrek, CurExt, Typesetting variable-sized curved symbols , TUGboat, Volume 24,
number 3, pp. 323–327, 2003 EuroTeX2003: 14th European TEX Conference, Back to Typography,
Brest, France, EuroTeX2003 Preprints pp. 67-71, 2003
[13] Klaus Lagally. ArabTEX - Typesetting Arabic with Vowels and Ligatures. Proceedings of the 7th
European TEX Conference, Prague 1992.
International Journal of Artificial Intelligence and Applications (IJAIA), Vol.14, No.6, November 2023
20
[14] Yannis Haralambous and John Plaice. Multilingual Typesetting with Ω a Case Study: Arabic.
Proceedings of the International Symposium on Multilingual Information Processing, pp. 137–154,
Tsukuba, 1997.
[15] Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2002. BLEU: a method for
automatic evaluation of machine translation. In Proceedings of the 40th Annual Meeting on
Association for Computational Linguistics (ACL '02). Association for Computational Linguistics,
USA, 311–318. https://doi.org/10.3115/1073083.1073135
[16] Mustapha Eddahibi, Azzeddine Lazrek and Khalid Sami, DadTeX—A full Arabic interface,
TUGboat, Volume 27, Number 2, pp. 154-158, 2006
AUTHORS
Prof. Mustapha Eddahibi received his PhD from Unversity Cadi Ayyad Marrakech in
2007. He is currently computer science teacher researcher in University Ibn Zohr. He is
a former head of the decisional expert systems research team. His research interests are
in the area of intelligent computing, information engineering, digital Information
Encoding and Processing.
Prof. Mohammed Mensouri received the M.S degree in Networks and
Telecommunication in 2008, from Faculty of Sciences and Technology, Cadi Ayyad
University, Marrakech, Morocco. In 2015, He received Ph.D of Computer Science in
Faculty of Sciences, Chouaib Doukkali University, El Jadida, Morocco. He is professor
at Ibn Zohr University, Agadir, Moroc. His research interest information theory and
channel coding , especially error correction codes.

More Related Content

Similar to English to Arabic Machine Translation of Mathematical Documents

Design and Implementation of a Language Assistant for English – Arabic Texts
Design and Implementation of a Language Assistant for English – Arabic TextsDesign and Implementation of a Language Assistant for English – Arabic Texts
Design and Implementation of a Language Assistant for English – Arabic TextsIJCSIS Research Publications
 
Seminar report on a statistical approach to machine
Seminar report on a statistical approach to machineSeminar report on a statistical approach to machine
Seminar report on a statistical approach to machineHrishikesh Nair
 
A new hybrid metric for verifying
A new hybrid metric for verifyingA new hybrid metric for verifying
A new hybrid metric for verifyingcsandit
 
DICTIONARY BASED AMHARIC-ARABIC CROSS LANGUAGE INFORMATION RETRIEVAL
DICTIONARY BASED AMHARIC-ARABIC CROSS LANGUAGE INFORMATION RETRIEVALDICTIONARY BASED AMHARIC-ARABIC CROSS LANGUAGE INFORMATION RETRIEVAL
DICTIONARY BASED AMHARIC-ARABIC CROSS LANGUAGE INFORMATION RETRIEVALcsandit
 
Extracting numerical data from unstructured Arabic texts(ENAT)
Extracting numerical data from unstructured Arabic  texts(ENAT)Extracting numerical data from unstructured Arabic  texts(ENAT)
Extracting numerical data from unstructured Arabic texts(ENAT)nooriasukmaningtyas
 
Smart grammar a dynamic spoken language understanding grammar for inflective ...
Smart grammar a dynamic spoken language understanding grammar for inflective ...Smart grammar a dynamic spoken language understanding grammar for inflective ...
Smart grammar a dynamic spoken language understanding grammar for inflective ...ijnlc
 
Different valuable tools for Arabic sentiment analysis: a comparative evaluat...
Different valuable tools for Arabic sentiment analysis: a comparative evaluat...Different valuable tools for Arabic sentiment analysis: a comparative evaluat...
Different valuable tools for Arabic sentiment analysis: a comparative evaluat...IJECEIAES
 
MOLTO poster for ACL 2010, Uppsala Sweden
MOLTO poster for ACL 2010, Uppsala SwedenMOLTO poster for ACL 2010, Uppsala Sweden
MOLTO poster for ACL 2010, Uppsala SwedenOlga Caprotti
 
Computational linguistics
Computational linguisticsComputational linguistics
Computational linguisticsSpringer
 
Building of Database for English-Azerbaijani Machine Translation Expert System
Building of Database for English-Azerbaijani Machine Translation Expert SystemBuilding of Database for English-Azerbaijani Machine Translation Expert System
Building of Database for English-Azerbaijani Machine Translation Expert SystemWaqas Tariq
 
A Survey of Arabic Text Classification Models
A Survey of Arabic Text Classification Models A Survey of Arabic Text Classification Models
A Survey of Arabic Text Classification Models IJECEIAES
 
An Arabizi-English Social Media Statistical Machine Translation System
An Arabizi-English Social Media Statistical Machine Translation SystemAn Arabizi-English Social Media Statistical Machine Translation System
An Arabizi-English Social Media Statistical Machine Translation SystemSDL
 
T URN S EGMENTATION I NTO U TTERANCES F OR A RABIC S PONTANEOUS D IALOGUES ...
T URN S EGMENTATION I NTO U TTERANCES F OR  A RABIC  S PONTANEOUS D IALOGUES ...T URN S EGMENTATION I NTO U TTERANCES F OR  A RABIC  S PONTANEOUS D IALOGUES ...
T URN S EGMENTATION I NTO U TTERANCES F OR A RABIC S PONTANEOUS D IALOGUES ...ijnlc
 
Lexigraf - a multilingual lexicography DTP engine
Lexigraf - a multilingual lexicography DTP engineLexigraf - a multilingual lexicography DTP engine
Lexigraf - a multilingual lexicography DTP engineYiannis Hatzopoulos
 
XMODEL: An XML-based Morphological Analyzer for Arabic Language
XMODEL: An XML-based Morphological Analyzer for Arabic LanguageXMODEL: An XML-based Morphological Analyzer for Arabic Language
XMODEL: An XML-based Morphological Analyzer for Arabic LanguageWaqas Tariq
 
New text steganography method using the arabic letters dots
New text steganography method using the arabic letters dotsNew text steganography method using the arabic letters dots
New text steganography method using the arabic letters dotsnooriasukmaningtyas
 
04. 9990 16097-1-ed (edited arf)
04. 9990 16097-1-ed (edited arf)04. 9990 16097-1-ed (edited arf)
04. 9990 16097-1-ed (edited arf)IAESIJEECS
 

Similar to English to Arabic Machine Translation of Mathematical Documents (20)

Design and Implementation of a Language Assistant for English – Arabic Texts
Design and Implementation of a Language Assistant for English – Arabic TextsDesign and Implementation of a Language Assistant for English – Arabic Texts
Design and Implementation of a Language Assistant for English – Arabic Texts
 
Seminar report on a statistical approach to machine
Seminar report on a statistical approach to machineSeminar report on a statistical approach to machine
Seminar report on a statistical approach to machine
 
A new hybrid metric for verifying
A new hybrid metric for verifyingA new hybrid metric for verifying
A new hybrid metric for verifying
 
Arabic MT Project
Arabic MT ProjectArabic MT Project
Arabic MT Project
 
Jq3616701679
Jq3616701679Jq3616701679
Jq3616701679
 
DICTIONARY BASED AMHARIC-ARABIC CROSS LANGUAGE INFORMATION RETRIEVAL
DICTIONARY BASED AMHARIC-ARABIC CROSS LANGUAGE INFORMATION RETRIEVALDICTIONARY BASED AMHARIC-ARABIC CROSS LANGUAGE INFORMATION RETRIEVAL
DICTIONARY BASED AMHARIC-ARABIC CROSS LANGUAGE INFORMATION RETRIEVAL
 
Extracting numerical data from unstructured Arabic texts(ENAT)
Extracting numerical data from unstructured Arabic  texts(ENAT)Extracting numerical data from unstructured Arabic  texts(ENAT)
Extracting numerical data from unstructured Arabic texts(ENAT)
 
Smart grammar a dynamic spoken language understanding grammar for inflective ...
Smart grammar a dynamic spoken language understanding grammar for inflective ...Smart grammar a dynamic spoken language understanding grammar for inflective ...
Smart grammar a dynamic spoken language understanding grammar for inflective ...
 
Different valuable tools for Arabic sentiment analysis: a comparative evaluat...
Different valuable tools for Arabic sentiment analysis: a comparative evaluat...Different valuable tools for Arabic sentiment analysis: a comparative evaluat...
Different valuable tools for Arabic sentiment analysis: a comparative evaluat...
 
MOLTO poster for ACL 2010, Uppsala Sweden
MOLTO poster for ACL 2010, Uppsala SwedenMOLTO poster for ACL 2010, Uppsala Sweden
MOLTO poster for ACL 2010, Uppsala Sweden
 
Computational linguistics
Computational linguisticsComputational linguistics
Computational linguistics
 
Building of Database for English-Azerbaijani Machine Translation Expert System
Building of Database for English-Azerbaijani Machine Translation Expert SystemBuilding of Database for English-Azerbaijani Machine Translation Expert System
Building of Database for English-Azerbaijani Machine Translation Expert System
 
A Survey of Arabic Text Classification Models
A Survey of Arabic Text Classification Models A Survey of Arabic Text Classification Models
A Survey of Arabic Text Classification Models
 
An Arabizi-English Social Media Statistical Machine Translation System
An Arabizi-English Social Media Statistical Machine Translation SystemAn Arabizi-English Social Media Statistical Machine Translation System
An Arabizi-English Social Media Statistical Machine Translation System
 
T URN S EGMENTATION I NTO U TTERANCES F OR A RABIC S PONTANEOUS D IALOGUES ...
T URN S EGMENTATION I NTO U TTERANCES F OR  A RABIC  S PONTANEOUS D IALOGUES ...T URN S EGMENTATION I NTO U TTERANCES F OR  A RABIC  S PONTANEOUS D IALOGUES ...
T URN S EGMENTATION I NTO U TTERANCES F OR A RABIC S PONTANEOUS D IALOGUES ...
 
Lexigraf - a multilingual lexicography DTP engine
Lexigraf - a multilingual lexicography DTP engineLexigraf - a multilingual lexicography DTP engine
Lexigraf - a multilingual lexicography DTP engine
 
Lit mtap
Lit mtapLit mtap
Lit mtap
 
XMODEL: An XML-based Morphological Analyzer for Arabic Language
XMODEL: An XML-based Morphological Analyzer for Arabic LanguageXMODEL: An XML-based Morphological Analyzer for Arabic Language
XMODEL: An XML-based Morphological Analyzer for Arabic Language
 
New text steganography method using the arabic letters dots
New text steganography method using the arabic letters dotsNew text steganography method using the arabic letters dots
New text steganography method using the arabic letters dots
 
04. 9990 16097-1-ed (edited arf)
04. 9990 16097-1-ed (edited arf)04. 9990 16097-1-ed (edited arf)
04. 9990 16097-1-ed (edited arf)
 

More from gerogepatton

13th International Conference on Software Engineering and Applications (SEA 2...
13th International Conference on Software Engineering and Applications (SEA 2...13th International Conference on Software Engineering and Applications (SEA 2...
13th International Conference on Software Engineering and Applications (SEA 2...gerogepatton
 
International Journal of Artificial Intelligence & Applications (IJAIA)
International Journal of Artificial Intelligence & Applications (IJAIA)International Journal of Artificial Intelligence & Applications (IJAIA)
International Journal of Artificial Intelligence & Applications (IJAIA)gerogepatton
 
AN IMPROVED MT5 MODEL FOR CHINESE TEXT SUMMARY GENERATION
AN IMPROVED MT5 MODEL FOR CHINESE TEXT SUMMARY GENERATIONAN IMPROVED MT5 MODEL FOR CHINESE TEXT SUMMARY GENERATION
AN IMPROVED MT5 MODEL FOR CHINESE TEXT SUMMARY GENERATIONgerogepatton
 
10th International Conference on Artificial Intelligence and Applications (AI...
10th International Conference on Artificial Intelligence and Applications (AI...10th International Conference on Artificial Intelligence and Applications (AI...
10th International Conference on Artificial Intelligence and Applications (AI...gerogepatton
 
International Journal of Artificial Intelligence & Applications (IJAIA)
International Journal of Artificial Intelligence & Applications (IJAIA)International Journal of Artificial Intelligence & Applications (IJAIA)
International Journal of Artificial Intelligence & Applications (IJAIA)gerogepatton
 
International Journal of Artificial Intelligence & Applications (IJAIA)
International Journal of Artificial Intelligence & Applications (IJAIA)International Journal of Artificial Intelligence & Applications (IJAIA)
International Journal of Artificial Intelligence & Applications (IJAIA)gerogepatton
 
A Machine Learning Ensemble Model for the Detection of Cyberbullying
A Machine Learning Ensemble Model for the Detection of CyberbullyingA Machine Learning Ensemble Model for the Detection of Cyberbullying
A Machine Learning Ensemble Model for the Detection of Cyberbullyinggerogepatton
 
International Journal of Artificial Intelligence & Applications (IJAIA)
International Journal of Artificial Intelligence & Applications (IJAIA)International Journal of Artificial Intelligence & Applications (IJAIA)
International Journal of Artificial Intelligence & Applications (IJAIA)gerogepatton
 
10th International Conference on Data Mining (DaMi 2024)
10th International Conference on Data Mining (DaMi 2024)10th International Conference on Data Mining (DaMi 2024)
10th International Conference on Data Mining (DaMi 2024)gerogepatton
 
March 2024 - Top 10 Read Articles in Artificial Intelligence and Applications...
March 2024 - Top 10 Read Articles in Artificial Intelligence and Applications...March 2024 - Top 10 Read Articles in Artificial Intelligence and Applications...
March 2024 - Top 10 Read Articles in Artificial Intelligence and Applications...gerogepatton
 
WAVELET SCATTERING TRANSFORM FOR ECG CARDIOVASCULAR DISEASE CLASSIFICATION
WAVELET SCATTERING TRANSFORM FOR ECG CARDIOVASCULAR DISEASE CLASSIFICATIONWAVELET SCATTERING TRANSFORM FOR ECG CARDIOVASCULAR DISEASE CLASSIFICATION
WAVELET SCATTERING TRANSFORM FOR ECG CARDIOVASCULAR DISEASE CLASSIFICATIONgerogepatton
 
International Journal of Artificial Intelligence & Applications (IJAIA)
International Journal of Artificial Intelligence & Applications (IJAIA)International Journal of Artificial Intelligence & Applications (IJAIA)
International Journal of Artificial Intelligence & Applications (IJAIA)gerogepatton
 
Passive Sonar Detection and Classification Based on Demon-Lofar Analysis and ...
Passive Sonar Detection and Classification Based on Demon-Lofar Analysis and ...Passive Sonar Detection and Classification Based on Demon-Lofar Analysis and ...
Passive Sonar Detection and Classification Based on Demon-Lofar Analysis and ...gerogepatton
 
10th International Conference on Artificial Intelligence and Applications (AI...
10th International Conference on Artificial Intelligence and Applications (AI...10th International Conference on Artificial Intelligence and Applications (AI...
10th International Conference on Artificial Intelligence and Applications (AI...gerogepatton
 
The International Journal of Artificial Intelligence & Applications (IJAIA)
The International Journal of Artificial Intelligence & Applications (IJAIA)The International Journal of Artificial Intelligence & Applications (IJAIA)
The International Journal of Artificial Intelligence & Applications (IJAIA)gerogepatton
 
Foundations of ANNs: Tolstoy’s Genius Explored using Transformer Architecture
Foundations of ANNs: Tolstoy’s Genius Explored using Transformer ArchitectureFoundations of ANNs: Tolstoy’s Genius Explored using Transformer Architecture
Foundations of ANNs: Tolstoy’s Genius Explored using Transformer Architecturegerogepatton
 
2nd International Conference on Computer Science, Engineering and Artificial ...
2nd International Conference on Computer Science, Engineering and Artificial ...2nd International Conference on Computer Science, Engineering and Artificial ...
2nd International Conference on Computer Science, Engineering and Artificial ...gerogepatton
 
International Journal of Artificial Intelligence & Applications (IJAIA)
International Journal of Artificial Intelligence & Applications (IJAIA)International Journal of Artificial Intelligence & Applications (IJAIA)
International Journal of Artificial Intelligence & Applications (IJAIA)gerogepatton
 
Website URL:https://www.airccse.org/journal/ijaia/ijaia.html Review of AI Mat...
Website URL:https://www.airccse.org/journal/ijaia/ijaia.html Review of AI Mat...Website URL:https://www.airccse.org/journal/ijaia/ijaia.html Review of AI Mat...
Website URL:https://www.airccse.org/journal/ijaia/ijaia.html Review of AI Mat...gerogepatton
 
13th International Conference on Soft Computing, Artificial Intelligence and ...
13th International Conference on Soft Computing, Artificial Intelligence and ...13th International Conference on Soft Computing, Artificial Intelligence and ...
13th International Conference on Soft Computing, Artificial Intelligence and ...gerogepatton
 

More from gerogepatton (20)

13th International Conference on Software Engineering and Applications (SEA 2...
13th International Conference on Software Engineering and Applications (SEA 2...13th International Conference on Software Engineering and Applications (SEA 2...
13th International Conference on Software Engineering and Applications (SEA 2...
 
International Journal of Artificial Intelligence & Applications (IJAIA)
International Journal of Artificial Intelligence & Applications (IJAIA)International Journal of Artificial Intelligence & Applications (IJAIA)
International Journal of Artificial Intelligence & Applications (IJAIA)
 
AN IMPROVED MT5 MODEL FOR CHINESE TEXT SUMMARY GENERATION
AN IMPROVED MT5 MODEL FOR CHINESE TEXT SUMMARY GENERATIONAN IMPROVED MT5 MODEL FOR CHINESE TEXT SUMMARY GENERATION
AN IMPROVED MT5 MODEL FOR CHINESE TEXT SUMMARY GENERATION
 
10th International Conference on Artificial Intelligence and Applications (AI...
10th International Conference on Artificial Intelligence and Applications (AI...10th International Conference on Artificial Intelligence and Applications (AI...
10th International Conference on Artificial Intelligence and Applications (AI...
 
International Journal of Artificial Intelligence & Applications (IJAIA)
International Journal of Artificial Intelligence & Applications (IJAIA)International Journal of Artificial Intelligence & Applications (IJAIA)
International Journal of Artificial Intelligence & Applications (IJAIA)
 
International Journal of Artificial Intelligence & Applications (IJAIA)
International Journal of Artificial Intelligence & Applications (IJAIA)International Journal of Artificial Intelligence & Applications (IJAIA)
International Journal of Artificial Intelligence & Applications (IJAIA)
 
A Machine Learning Ensemble Model for the Detection of Cyberbullying
A Machine Learning Ensemble Model for the Detection of CyberbullyingA Machine Learning Ensemble Model for the Detection of Cyberbullying
A Machine Learning Ensemble Model for the Detection of Cyberbullying
 
International Journal of Artificial Intelligence & Applications (IJAIA)
International Journal of Artificial Intelligence & Applications (IJAIA)International Journal of Artificial Intelligence & Applications (IJAIA)
International Journal of Artificial Intelligence & Applications (IJAIA)
 
10th International Conference on Data Mining (DaMi 2024)
10th International Conference on Data Mining (DaMi 2024)10th International Conference on Data Mining (DaMi 2024)
10th International Conference on Data Mining (DaMi 2024)
 
March 2024 - Top 10 Read Articles in Artificial Intelligence and Applications...
March 2024 - Top 10 Read Articles in Artificial Intelligence and Applications...March 2024 - Top 10 Read Articles in Artificial Intelligence and Applications...
March 2024 - Top 10 Read Articles in Artificial Intelligence and Applications...
 
WAVELET SCATTERING TRANSFORM FOR ECG CARDIOVASCULAR DISEASE CLASSIFICATION
WAVELET SCATTERING TRANSFORM FOR ECG CARDIOVASCULAR DISEASE CLASSIFICATIONWAVELET SCATTERING TRANSFORM FOR ECG CARDIOVASCULAR DISEASE CLASSIFICATION
WAVELET SCATTERING TRANSFORM FOR ECG CARDIOVASCULAR DISEASE CLASSIFICATION
 
International Journal of Artificial Intelligence & Applications (IJAIA)
International Journal of Artificial Intelligence & Applications (IJAIA)International Journal of Artificial Intelligence & Applications (IJAIA)
International Journal of Artificial Intelligence & Applications (IJAIA)
 
Passive Sonar Detection and Classification Based on Demon-Lofar Analysis and ...
Passive Sonar Detection and Classification Based on Demon-Lofar Analysis and ...Passive Sonar Detection and Classification Based on Demon-Lofar Analysis and ...
Passive Sonar Detection and Classification Based on Demon-Lofar Analysis and ...
 
10th International Conference on Artificial Intelligence and Applications (AI...
10th International Conference on Artificial Intelligence and Applications (AI...10th International Conference on Artificial Intelligence and Applications (AI...
10th International Conference on Artificial Intelligence and Applications (AI...
 
The International Journal of Artificial Intelligence & Applications (IJAIA)
The International Journal of Artificial Intelligence & Applications (IJAIA)The International Journal of Artificial Intelligence & Applications (IJAIA)
The International Journal of Artificial Intelligence & Applications (IJAIA)
 
Foundations of ANNs: Tolstoy’s Genius Explored using Transformer Architecture
Foundations of ANNs: Tolstoy’s Genius Explored using Transformer ArchitectureFoundations of ANNs: Tolstoy’s Genius Explored using Transformer Architecture
Foundations of ANNs: Tolstoy’s Genius Explored using Transformer Architecture
 
2nd International Conference on Computer Science, Engineering and Artificial ...
2nd International Conference on Computer Science, Engineering and Artificial ...2nd International Conference on Computer Science, Engineering and Artificial ...
2nd International Conference on Computer Science, Engineering and Artificial ...
 
International Journal of Artificial Intelligence & Applications (IJAIA)
International Journal of Artificial Intelligence & Applications (IJAIA)International Journal of Artificial Intelligence & Applications (IJAIA)
International Journal of Artificial Intelligence & Applications (IJAIA)
 
Website URL:https://www.airccse.org/journal/ijaia/ijaia.html Review of AI Mat...
Website URL:https://www.airccse.org/journal/ijaia/ijaia.html Review of AI Mat...Website URL:https://www.airccse.org/journal/ijaia/ijaia.html Review of AI Mat...
Website URL:https://www.airccse.org/journal/ijaia/ijaia.html Review of AI Mat...
 
13th International Conference on Soft Computing, Artificial Intelligence and ...
13th International Conference on Soft Computing, Artificial Intelligence and ...13th International Conference on Soft Computing, Artificial Intelligence and ...
13th International Conference on Soft Computing, Artificial Intelligence and ...
 

Recently uploaded

RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
Data Warehouse , Data Cube Computation
Data Warehouse   , Data Cube ComputationData Warehouse   , Data Cube Computation
Data Warehouse , Data Cube Computationsit20ad004
 
Data Science Project: Advancements in Fetal Health Classification
Data Science Project: Advancements in Fetal Health ClassificationData Science Project: Advancements in Fetal Health Classification
Data Science Project: Advancements in Fetal Health ClassificationBoston Institute of Analytics
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 

Recently uploaded (20)

RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
Data Warehouse , Data Cube Computation
Data Warehouse   , Data Cube ComputationData Warehouse   , Data Cube Computation
Data Warehouse , Data Cube Computation
 
Russian Call Girls Dwarka Sector 15 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...
Russian Call Girls Dwarka Sector 15 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...Russian Call Girls Dwarka Sector 15 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...
Russian Call Girls Dwarka Sector 15 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
Data Science Project: Advancements in Fetal Health Classification
Data Science Project: Advancements in Fetal Health ClassificationData Science Project: Advancements in Fetal Health Classification
Data Science Project: Advancements in Fetal Health Classification
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 

English to Arabic Machine Translation of Mathematical Documents

  • 1. International Journal of Artificial Intelligence and Applications (IJAIA), Vol.14, No.6, November 2023 DOI:10.5121/ijaia.2023.14602 13 ENGLISH TO ARABIC MACHINE TRANSLATION OF MATHEMATICAL DOCUMENTS Mustapha Eddahibi and Mohammed Mensouri IMIS Laboratory, Ibnou Zohr University – Agadir Morocco ABSTRACT This paper is about the development of a machine translation system tailored specifically for LATEX mathematical documents. The system focuses on translating English LATEX mathematical documents into Arabic LATEX, catering to the growing demand for multilingual accessibility in scientific and mathematical literature. With the vast proliferation of LATEX mathematical documents the need for an efficient and accurate translation system has become increasingly essential. This paper addresses the necessity for a robust translation tool that enables seamless communication and comprehension of complex mathematical content across language barriers. The proposed system leverages a Transformer model as the core of the translation system, ensuring enhanced accuracy and fluency in the translated Arabic LATEX documents. Furthermore, the integration of RyDArab, an Arabic mathematical TEX extension, along with a rule-based translator for Arabic mathematical expressions, contributes to the precise rendering of complex mathematical symbols and equations in the translated output. The paper discusses the architecture, methodology, of the developed system, highlighting its efficacy in bridging the language gap in the domain of mathematical documentation. KEYWORDS Machine Translation System, LATEX mathematical Documents, Arabic mathematical notation, Text-to- Text Transfer Transformer, RyDArab, Language Processing. 1. INTRODUCTION Machine Translation (MT), is the conversion of text from one language to another using computers. This process involves the utilization of programming languages and software to accomplish the translation tasks [1]. Various types of content, ranging from commercial and business documents to scientific papers, instruction manuals, textbooks, and online materials, require translation services. Presently, machine translation has found widespread application in the language services industry and has become a common tool for professional translators. However, machine translation has limitations and is not suitable for every type of textual content. The challenges in Machine Translation stem from the disparities in grammatical structures existing between the source and target languages. Furthermore, the complexity of translation can be influenced by the specific category of the text. While the translation of everyday spoken language is relatively straightforward, rendering poetry, philosophical treatises, scientific or technical documents poses more intricate challenges [2]. The seamless translation of complex mathematical documents from English to Arabic stands as a big challenge in the machine translation field. Significant progress has been achieved in this domain, the intricacies of mathematical language, particularly in the context of LATEX [3] syntax, present a critical gap that demands attention. This study delves into uncharted territory, aiming to bridge the gap in the translation of mathematical content into the Arabic language, a domain remains relatively unexplored in the
  • 2. International Journal of Artificial Intelligence and Applications (IJAIA), Vol.14, No.6, November 2023 14 current body of literature. Leveraging cutting-edge methodologies such as LATEX parsing, mathematical expression tokenization, and the utilization of a Transformer for natural language translation, alongside a rule-based translator for managing mathematical expressions, our research aspires to contribute significantly to the burgeoning field of automated translation. By building upon recent advancements in pre-trained language models, we seek to unravel the complex interplay between mathematical expressions and linguistic structures, thereby unlocking the potential for comprehensive and accurate translation. Despite the acknowledging the inherent intricacies linked with handling symbolic mathematical content, the absence of a suitable corpus remains a notable limitation. The intricate nature of mathematical language, coupled with the unique challenges posed by Arabic notation, necessitates a meticulous and nuanced approach to the translation process. This research holds substantial promise for the educational landscape, benefitting researchers, educators, and students alike. By facilitating a deeper comprehension of intricate mathematical documents, our work endeavours to streamline the creation, modification, and dissemination of these materials in both English and Arabic contexts. The digitized representation of such documents not only fosters their creation and archiving but also enables their seamless exchange via various communication networks. However, realizing this objective necessitates overcoming several challenges, particularly in dealing with the distinctive nature of mathematical expressions in Arabic notation. 2. RELATED WORK The prominence of English as the universal language of science emerged a mere four centuries ago, marking a significant turning point in global communication. Its unparalleled expansion across the world post-World War II solidified its dominance, surpassing the influence of any other language in history [4]. This trend has underscored the democratization of scientific knowledge, granting access to a wider audience beyond traditional linguistic barriers. Nonetheless, it emphasizes the critical importance of promoting education in one's native language, highlighting the necessity for striking a balance between the advantages of a common scientific language and the preservation of cultural and linguistic diversity for comprehensive learning. The recognition of the significance of automated translation emerged many years back. Numerous studies have focused on machine translation within the scientific realm. Tehseen et al. introduced an English-to-Urdu scientific text translator that employs term tagging and domain- specific translation [5]. This translator was specifically designed for the translation of computer science documents and was assessed using a self-generated corpus within the computer science domain. The translator algorithm didn’t show mathematical expressions translation or used file format. Because of the widespread adoption of LATEX stems owing to its capacity to manage complex mathematical expressions, equations, and symbols with unparalleled precision and elegance, the translation of documents in such format is a great opportunity. Ohri et al. developed a machine translation system named Polymath for converting LATEX documents with mathematical text [6]. This system is capable of translating English LATEX content to French LATEX. It operates by transforming the main content of an input LATEX document into English sentences with mathematical tokens, followed by the application of a pre-trained Transformer-based translator model.
  • 3. International Journal of Artificial Intelligence and Applications (IJAIA), Vol.14, No.6, November 2023 15 The process of translating LATEX documents from English to French is relatively straightforward compared to the challenges encountered when translating from English to Arabic. This discrepancy arises from the fundamental differences between the Arabic and English mathematical notations. The Arabic mathematical system, distinct in its structure and representation, introduces complexities that are not encountered in the French translation process. The nuances of Arabic mathematical symbols and expressions often require specific handling techniques and an in-depth understanding of the language's unique grammatical rules and conventions. Despite the potential complexities, the English to French LATEX translation process proves to be more seamless due to the greater similarity in notation and linguistic constructs between the two languages. 3. ARABIC MATHEMATICAL NOTATION In Arabic scientific documents there are two models of mathematical expressions: mixed and Pure Arabic notations. Mixed mathematical notation used in some countries like Morocco, is the outcome of a literal word-to-word translation of mathematical French books. The syntactic models of several formulations have been transported to the new language. The symbolic writing was imported just as it was, without any changes. Figure 1. Mixed Mathematical Notation In the pure Arabic presentation, mathematical expressions spread from right to left and use Arabic symbols from its alphabet. These symbols are used to note unknown variables and functions names. As for common functions that are replaced by their abbreviated names. In this notation we can distinguish two types. The first uses some mirrored latin symbols like the sum . The second type uses some Arabic alphabet-based symbols like for the sum. Figure 2. Pure Arabic Notation with mirrored symbols
  • 4. International Journal of Artificial Intelligence and Applications (IJAIA), Vol.14, No.6, November 2023 16 Figure 3. Pure Arabic Notation with Arabic alphabet-based symbols 4. TRANSLATION OF THE LATEX DOCUMENT The process of translating the LATEX document comprises two distinct phases, each contributing to the accurate conversion of its contents. Initially, the document undergoes a comprehensive parsing phase, allowing for the precise identification and isolation of its constituent elements. Subsequently, the second phase entails the systematic extraction of mathematical expressions, subsequently replaced by an indexed list of tokens (Exprr1, Exprr2, etc.). These identified expressions, denoted as Exprri, serve as novel entities during the natural language translation process, functioning as placeholders for the mathematical components. Consequently, the mathematical expressions are stored within a list, with the corresponding tokens acting as their respective indices. The subsequent translation process relies on a rule-based translation function tailored to the unique characteristics of mathematical notations. This function operates using the specific mathematical expression, its associated token, and the corresponding notation type, ensuring the seamless and accurate transformation of the mathematical content within the document. 4.1. English to Arabic Natural Language Translator The natural language translation part of the system is based on transformer architecture [7]. Opting for transformers over RNNs is supported by their capacity to process input data in parallel. They are not susceptible to the vanishing gradient issue. Transformers proficiently grasp intricate connections among various components of the input and utilize attention mechanisms to focus on relevant portions of the input sequence [8]. Before the translation a pre-processing phase is done. The content of parsed LATEX blocs is segmented to sentences. Certain formatting commands, such as textbf and textit, which alter the style of individual or multiple words within a sentence, are eliminated to streamline the text. This process aims to enhance the translation, particularly considering that Arabic typographic conventions do not employ bold, italic, or roman styles, among others. The transformer undergoes training and validation using datasets comprising English words, sentences, and paragraphs along with their corresponding Arabic translations. Its architecture consists of two primary components: an encoder and a decoder for sequence-to-sequence translation, alongside an attention mechanism. Figure 4 provides an overview of the system.
  • 5. International Journal of Artificial Intelligence and Applications (IJAIA), Vol.14, No.6, November 2023 17 Figure 4. Overview of the developed system 4.2. Translation of Mathematical Expression to Arabic Notation The authors note that the existing pool of translations from English to Arabic for mathematical expressions lacks sufficiency for the utilization of neural network-based translation methods. Consequently, the current optimal approach is rule-based translation, despite its non-exhaustive nature [9]. The translation process involves the following steps:  Utilization of TEX[10] packages, including RyDArab[11] for typesetting Arabic mathematical expressions and curext [12] for handling stretched mathematical symbols in the resulting LATEX file.  Introduction of the command arabmath to each mathematical expression to ensure proper alignment.  Transliteration of alphabetical symbols into their Arabic equivalents.  Implementation of the commands warabnum for standard Western Arabic digits or earabnum for Eastern Arabic digits (Hindi digits) .  Use of alp without dots for alphabetic symbols without dots ( ) and alpwithdots for those with dots ( ).  Application of fun with dots and fun without dots commands for elementary functions with and without dots, respectively ( , ). When these options are incorporated into the preamble of the input file, the commands are universally applied to all mathematical expressions. To address document encoding concerns, the output file must adhere to either the ISO-8859-6 encoding system for ArabTEX [13] or UTF-8 for Omega [14].  Furthermore, specific transformations are applied to mathematical symbols, such as replacing sum with lsum ( ), csum ( ) for its Arabic literal linear and curved equivalents, and ssum ( ) for the mirrored equivalent. Similar transformations are implemented for the product command.
  • 6. International Journal of Artificial Intelligence and Applications (IJAIA), Vol.14, No.6, November 2023 18 5. RESULTS, LIMITATIONS AND PERSPECTIVES For system evaluation, we conducted a meticulous assessment using targeted documents that explore mathematical concepts similar to the ones discovered in textbooks used in primary and secondary education courses. The results for translating mathematical expressions were generally satisfactory, showcasing the system's competency. However, some discrepancies were observed, which can be attributed to the inherent limitations of the non-exhaustive rule-based method. Notably, the machine translation performance, utilizing the transformer model, exhibited markedly low BLUE scores. The BLEU (Bilingual Evaluation Understudy) score [15] is a frequently employed measure for assessing the quality of machine-generated translations by comparing them to human-generated references. In this context, the lower BLUE scores indicate a misalignment between the machine-generated translations and the expected human references. This suboptimal performance can be traced back to two key factors: the relatively modest size of the dataset used for training and the substitution of mathematical expressions with "exprri" tokens in the natural language input. The limited dataset size may have hindered the model's exposure to diverse linguistic patterns, leading to challenges in accurately capturing the nuances of natural language. Additionally, the abstraction introduced by the "exprri" tokens may have contributed to a loss of contextual information during the translation process, further impacting the system's ability to generate linguistically fluent and contextually accurate translations. Addressing these issues by augmenting the dataset and refining the tokenization approach could potentially enhance the system's overall translation performance. In future work, we aim to advance towards a comprehensive machine translation system capable of translating entire LATEX documents, including intricate LATEX commands, into Arabic. This expansion is driven by the integration of the DadTEX [16] extension, designed for completely Arabic TEX environments. This ambitious undertaking holds potential to enhance the accessibility of scientific literature by providing seamless translation of diverse document components. The incorporation of DadTEX signifies a crucial step towards facilitating cross- linguistic communication in scientific and mathematical domains. The future perspectives emphasize a commitment to overcoming language barriers and promoting inclusivity in scientific discourse. 6. CONCLUSIONS In conclusion, this paper introduces a specialized machine translation system for LATEX mathematical documents, concentrating on translating English LATEX mathematical content into Arabic LATEX. Driven by the escalating demand for multilingual accessibility in scientific literature, especially in the realm of complex mathematical expressions documented in LATEX, the proposed system employs a text-to-text transfer transformer to enhance accuracy and fluency in the translated Arabic LATEX documents. The integration of RyDArab, an Arabic mathematical TEX extension, and a rule-based translator for Arabic mathematical expressions ensures precise rendering of intricate symbols and equations in the translated output. The discussed architecture, methodology, and performance evaluation underscore the system's effectiveness in overcoming language barriers in mathematical documentation.
  • 7. International Journal of Artificial Intelligence and Applications (IJAIA), Vol.14, No.6, November 2023 19 Acknowledging the complexities of handling symbolic mathematical content, the study underscores the necessity for an adequate corpus to improve translation accuracy. Despite challenges posed by the unique nature of Arabic mathematical expressions, this research promises significant benefits for education, aiding researchers, educators, and students in comprehending intricate mathematical documents in both English and Arabic. Addressing the unexplored realm of translating mathematical content into Arabic, this paper recognizes and navigates the specific challenges posed by Arabic mathematical notations. It also contributes valuable insights into related work, emphasizing the importance of automated translation in scientific communication, particularly focusing on LATEX documents. In summary, this study seeks to address a crucial void in automated translation, particularly in mathematical documents, laying the groundwork for further advancements in multilingual accessibility within the scientific and mathematical community. REFERENCES [1] Leslie Lamport. LATEX: A Document Preparation System. Addison-Wesley, 1986. [2] Azzeddine Lazrek, RyDArab—Typesetting Arabic mathematical expressions , TUGboat , Volume 25, Number 2, pp. 141-149, 2004 ( http://www.tug.org/TUGboat/Articles/tb25-2/tb81lazrek.pdf ) [3] Donald E. Knuth. The TEXbook. Volume A, Addison-Wesley, 1984. [4] Haifeng Wang, Hua Wu, Zhongjun He, Liang Huang, Kenneth Ward Church, Progress in Machine Translation, Engineering, Volume 18, 2022, Pages 143-153, ISSN 2095-8099, https://doi.org/10.1016/j.eng.2021.03.023. [5] A. A. Zhivotova, V. D. Berdonosov and E. V. Redkolis, "Improving the Quality of Scientific Articles Machine Translation While Writing Original Text," 2020 International Multi-Conference on Industrial Engineering and Modern Technologies (FarEastCon), Vladivostok, Russia, 2020, pp. 1-4, doi: 10.1109/FarEastCon50210.2020.9271442. [6] Emma Steigerwald, Valeria Ramírez-Castañeda, Débora Y C Brandt, András Báldi, Julie Teresa Shapiro, Lynne Bowker, Rebecca D Tarvin,"Overcoming Language Barriers in Academia: Machine Translation Tools and a Vision for a Multilingual Future", BioScience, Vo.72 N° 10, pp 988- 998,2022 doi 10.1093/biosci/biac062 [7] Irsha Tehseen, Ghulam Rasool Tahir, Khadija Shakeel & Mubbashir Ali (2018). Corpus Based Machine Translation for Scientific Text. In: Iliadis, L., Maglogiannis, I., Plagianakos, V. (eds) Artificial Intelligence Applications and Innovations. AIAI 2018. IFIP Advances in Information and Communication Technology, vol 519. Springer, Cham. https://doi.org/10.1007/978-3-319-92007- 8_17 [8] ADITYA OHRI, TANYA SCHMAH, "Machine Translation of Mathematical Text," in IEEE Access, vol. 9, pp. 38078-38086, 2021, doi: 10.1109/ACCESS.2021.3063715. [9] Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, and Illia Polosukhin. 2017. Attention is All You Need. In Proceedings of the 31st International Conference on Neural Information Processing Systems (Long Beach, California, USA). 6000–6010. [10] Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Matena, Yanqi Zhou, Wei Li, Peter J. Liu: Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer. J. Mach. Learn. Res. 21: 140:1-140:67 (2020) [11] Felix Petersen, Moritz Schubotz, Andre Greiner-Petter, and Bela Gipp. 2023. Neural Machine Translation for Mathematical Formulae. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 11534–11550, Toronto, Canada. Association for Computational Linguistics [12] Azzeddine Lazrek, CurExt, Typesetting variable-sized curved symbols , TUGboat, Volume 24, number 3, pp. 323–327, 2003 EuroTeX2003: 14th European TEX Conference, Back to Typography, Brest, France, EuroTeX2003 Preprints pp. 67-71, 2003 [13] Klaus Lagally. ArabTEX - Typesetting Arabic with Vowels and Ligatures. Proceedings of the 7th European TEX Conference, Prague 1992.
  • 8. International Journal of Artificial Intelligence and Applications (IJAIA), Vol.14, No.6, November 2023 20 [14] Yannis Haralambous and John Plaice. Multilingual Typesetting with Ω a Case Study: Arabic. Proceedings of the International Symposium on Multilingual Information Processing, pp. 137–154, Tsukuba, 1997. [15] Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2002. BLEU: a method for automatic evaluation of machine translation. In Proceedings of the 40th Annual Meeting on Association for Computational Linguistics (ACL '02). Association for Computational Linguistics, USA, 311–318. https://doi.org/10.3115/1073083.1073135 [16] Mustapha Eddahibi, Azzeddine Lazrek and Khalid Sami, DadTeX—A full Arabic interface, TUGboat, Volume 27, Number 2, pp. 154-158, 2006 AUTHORS Prof. Mustapha Eddahibi received his PhD from Unversity Cadi Ayyad Marrakech in 2007. He is currently computer science teacher researcher in University Ibn Zohr. He is a former head of the decisional expert systems research team. His research interests are in the area of intelligent computing, information engineering, digital Information Encoding and Processing. Prof. Mohammed Mensouri received the M.S degree in Networks and Telecommunication in 2008, from Faculty of Sciences and Technology, Cadi Ayyad University, Marrakech, Morocco. In 2015, He received Ph.D of Computer Science in Faculty of Sciences, Chouaib Doukkali University, El Jadida, Morocco. He is professor at Ibn Zohr University, Agadir, Moroc. His research interest information theory and channel coding , especially error correction codes.