SlideShare a Scribd company logo
1 of 5
Role of Language Engineering to Preserve Endangered Languages
Amit Kumar Jha1, Sumit Kumar Gupta2, Piyush Pratap Singh3
Centre for Information and Language Engineering
Mahatma Gandhi Antarrashtriya Hindi Vishwvidyalaya , Wardha (MS)
Abstract:
Endangered language (EL) is the language community incorporates less number of
speakers of that particular language. EL is likely to become extinct in the near future. Many
languages are failing out of use and being substituted by others is more widely used in the
region or nation. Language Engineering (LE) is the subfield of computer science which
explores the field of language related software and its feasible hardware development. With
the help of language engineering, man and machine interface can be designed to preserve for
longer time.
The paper also states about how the applications of language engineering contributes
the significant role to preserve Endangered Language (EL). Documentation is the primary
task to preserve in appropriate shape to move ahead with time frame. This paper also
described that how the LE makes easy to the process of language documentation. Being
assisted of Digital data, we preserve EL due to durability of digital data more than others type
of data.
There are some languages whose literature is linguistically very rich but the number
of speaker is countable, such as Nihali, language which also count in future as EL. Therefore
for preserving these types of EL, the documentation and Digitalization of the EL is quite
crucial, which proliferate the ease of access of EL to major class of people.
Keywords: Language Engineering, Endangered Language, Language Documentation, Man
machine interaction, Speech technology.
Introduction:
Language Engineering (LE) is the subfield of computer science which explores the field of
language related software and its feasible hardware development.
Language Engineering
Computer Science Engineering
Software Engineering Hardware Engineering
Language RelatedSoftware Language RelatedHardware
In other words, Language Engineering is the application of knowledge of
language to the development of computer systems which can recognise, understand, interpret,
and generate human language in all its forms.
In practice, Language Engineering comprises a set of techniques and language
resources. The former are implemented in computer software and the latter are a repository of
knowledge which can be accessed by computer software.
The ultimate goal of LE is to develop a machine which is able to understand and
generate natural language. This language may be endangered. Two types of processing is
conduct in LE – Text Processing and Speech Processing.
As part of an important effort to document endangered languages before they
become extinct. A variety of endangered language repositories have emerged to provided
shared locations for field linguists to store data. These repositories vary greatly in the way
their collections are organized and in the metadata they collect from depositors. The
availability of collections of low resource language data has the potential for some interesting
tasks in Natural Language and Speech Processing. However, the variety of formats in these
repositories makes it difficult to know how much of the available data is suitable for such a
task.
The world is experiencing an unprecedented wave of language extinctions. There
are between 6,000 and 7,000 languages currently spoken, and between 50 to 90 per cent of
those will be extinct by the year 2100. Language extinction results in loss of cultural
identities, knowledge systems, and the variety of data needed to understand the structure of
language in the mind. Documenting endangered languages preserves data and stimulates
language maintenance and revitalisation.
If a person knows more than two languages his thinking and reasoning capacity is
more. The loss of speakers in one language is the gain of speakers of another language,
except for cases of genocide. Languages are generally replaced when an entire speech
community shifts to another language. Replacing languages are very often official state
languages. A language becomes endangered when the language does not transfer to the next
generation. There are some languages which has a number of speakers of old age but that
language didn’t transferred to the new or next generation then that language become
endangered.
As it is known to all that Indian society is the society of multi-language. So being
a speaker of the language every person should transfer their own language to the next
generation.
Application of Language Engineering:
The applications of Language Engineering are divided in two groups – Text Processing
application and Speech Processing application.
The applications of language engineering are as follow-
➢ Speech Generation– With the help of language engineering we can generate the
speech of Endangered Language by a machine. If a machine will be able to generate
EL then we can preserve that Language. Speech Generation is the application area of
LE, Which is used to generate the speech of any natural language.
➢ Language Translator– Language translator is the application of LE. Language
translator or Machine translator is a machine which is able to translate one language
to another language. The first language is called source language and the second
language is called the target language. If the Source language or the target language is
EL, EL is preventing by this Language Translator system.. If the Endangered
language translator is developed then that language may be used for a long time.
➢ Speech-to-Text– It is the process of converting speech to text. This is the task of
documentation. If we convert speech file to text file of EL then we preserve that
language.
➢ Text-to-Speech – Text-to-speech system is the system in which text data is input and
it return speech data as output. It plays important role in Man-Machine interaction.
➢ Language Teaching- Language Teaching is the process of teaching a language. With
the help of LE we can create a system for teaching a language. If EL teaching system
is created EL may be preserve. As it is known that there are some language which has
the speakers of old age and this language doesn’t transfer to the next generation. After
some that language becomes dead. To preserve this language this system is important.
➢ Transcription Tool -Transcription is the process in which one script to another
script. A person which is unknown to a specific language, its script and pronunciation,
the role of Transcription tool is important in this context. If Transcription tool for an
EL will be developed then we increase the number of people to understand that
language.
➢ Text or Document summarization–It is one of the earliest applications of discourse
structure analysis. it is used to summarize the text or Document if we summarize a
text than it is easy to read and understand.
➢ Information Extraction - The task of information extraction is to extract from text-
named entities13 relations that hold between them, and event structures in which they
play a role. IE systems focus on specific domains (e.g., terrorist incidents) or specific
types of relations (e.g., people and their dates of birth, protein–protein interactions).
Event structures are often described by templates in IE, where the named entities to be
extracted fill in specific slots.
➢ Speaker Identification and Verification Speech Recognition - Speaker
Identification and verification means to identify and verify that what is speaking. This
is an important application in forensic science.
➢ Speech Recognition – Speech Recognition is the application of language
engineering. It identifies what is to speak.
➢ Character Image Recognition – Character Image Recognition, recognize the
character in an image file. This is the application area of LE. It is used to captcha
recognition.
➢ Segmentation – Segmentation is a task to segment the natural language to its
consistent part i.e. we can segment the paragraph of natural language into sentences,
phrases, words and syllables.
➢ Question-Answering System – Question-Answering system is an application of LE.
If we asked a question to this system than this system return an appropriate answer of
this question. We can asked the question in natural language. If we have the facility to
asked the question in EL, then the speaker of EL feel comfort to asked the question.
➢ Word sense Disambiguation – As it is known that there are some words in all
natural languages which has more meaning, it is called ambiguity. To resolve this
problem is called Disambiguation. LE try to develop a system which is able to
disambiguate the sense of the words, this system is called word sense disambiguation.
To preserve EL the role of language engineering is very important. Language
documentation is the main process to preserve EL. Language Documentation is the process in
which the speech and text corpus of that language is collected. For collecting speech corpus
of endangered language the researcher has gone to the field where the speakers of endangered
language live. They record the sound in natural environment. After recording the sound file
they analyze that sound files.
If a Linguist researchers want to document one of the larger languages such as
English, Chinese, Hindi etc., they can rely on already existing data and quite easily find
samples of written and spoken language from which they could build up their documentation:
books, newspapers and other written documents from the past and the present, many of these
already digitalized, television and radio shows that can be recorded or simply downloaded
from the Internet, language used in Internet forums and other social media, and many more.
Because computers, the Internet and recording devices are widely available, the amount of
such data and its accessibility is growing rapidly. For endangered languages, the situation is
often completely different. Many of these languages do not have a written tradition and
written data may be completely unavailable or sparse, the languages are not used in the
media, or their speakers do not use the Internet (and if they do, they often use another
language). In such cases, linguists must start from scratch and collect as much data as
possible by recording speakers of a given language. Ideally, language documentation contains
representative samples from different speakers – representing different age groups, different
professions, of both sexes, and different origins –, but in the case of endangered languages
this may not be possible, because the number of speakers is too small and/or there are only
elder speakers.An important issue apart from the number of speakers and amount of data
concerns the communication between the linguists or other researchers who want to
document a language, and the language community. In the case of endangered or minority
languages, the documenters often are outsiders, not members of the community. They may
not be fluent speakers of the language in question and can communicate with the speakers in
a second or a third language. This often leads to an unnatural use of the language that is to be
documented.
To preserve an endangered language, digitalization of EL is necessary like
language documentation. Digitalization is the process in which data is the store in the form of
digital. The durability of digital data is more than others types of data. To preserve EL by
Digitalization we convert and store data in digital form i.e. text, sound, image etc. The
researchers should create study material of EL in digital form.
Systematic study of a system is called Engineering. Engineering is the process
through which any task become easy and efficient. So we engineered the language.
Reference List :
1. B. WEBBER, M. EGG and V. KORDONI (2012). Discourse structure and language
technology. Natural Language Engineering
2. Jurafsky, Martin (et.al. ) Speech and Language Processing.Prentice Hall, Englewood
Cliffs, New Jersey 07632
3. Reiter, E. and Dale, R. (2000). Building Natural Language Generation
Systems.Cambridge University Press, Cambridge.
4. Yarowsky, D. (1996). Homograph disambiguation in text-to-speech synthesis.
InProgress in Speech Synthesis, pp. 159–175. Springer-Verlag, Berlin.
5. Small, S. L. and Rieger, C. (1982). Parsing and comprehending withWord Experts.In
Lehnert,W. G. and Ringle, M. H. (Eds.), Strategies for Natural Language
Processing,pp. 89–147. Lawrence Erlbaum, New Jersey.
6. www.sppel.org

More Related Content

What's hot

A Computational Model of Yoruba Morphology Lexical Analyzer
A Computational Model of Yoruba Morphology Lexical AnalyzerA Computational Model of Yoruba Morphology Lexical Analyzer
A Computational Model of Yoruba Morphology Lexical AnalyzerWaqas Tariq
 
Language and Intelligence
Language and IntelligenceLanguage and Intelligence
Language and Intelligencebutest
 
A New Approach: Automatically Identify Proper Noun from Bengali Sentence for ...
A New Approach: Automatically Identify Proper Noun from Bengali Sentence for ...A New Approach: Automatically Identify Proper Noun from Bengali Sentence for ...
A New Approach: Automatically Identify Proper Noun from Bengali Sentence for ...Syeful Islam
 
Sanskrit and Computational Linguistic
Sanskrit and Computational Linguistic Sanskrit and Computational Linguistic
Sanskrit and Computational Linguistic Jaganadh Gopinadhan
 
Development of text to speech system for yoruba language
Development of text to speech system for yoruba languageDevelopment of text to speech system for yoruba language
Development of text to speech system for yoruba languageAlexander Decker
 
Computational linguistics
Computational linguisticsComputational linguistics
Computational linguistics1101989
 
Natural Language Processing from Object Automation
Natural Language Processing from Object Automation Natural Language Processing from Object Automation
Natural Language Processing from Object Automation Object Automation
 
Design and Implementation of a Language Assistant for English – Arabic Texts
Design and Implementation of a Language Assistant for English – Arabic TextsDesign and Implementation of a Language Assistant for English – Arabic Texts
Design and Implementation of a Language Assistant for English – Arabic TextsIJCSIS Research Publications
 
A SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGES
A SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGESA SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGES
A SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGEScsandit
 
Development of Bi-Directional English To Yoruba Translator for Real-Time Mobi...
Development of Bi-Directional English To Yoruba Translator for Real-Time Mobi...Development of Bi-Directional English To Yoruba Translator for Real-Time Mobi...
Development of Bi-Directional English To Yoruba Translator for Real-Time Mobi...CSCJournals
 
Optical character recognition for Ge'ez characters
Optical character recognition for Ge'ez charactersOptical character recognition for Ge'ez characters
Optical character recognition for Ge'ez charactershadmac
 
Voice input and speech recognition system in tourism/social media
Voice input and speech recognition system in tourism/social mediaVoice input and speech recognition system in tourism/social media
Voice input and speech recognition system in tourism/social mediacidroypaes
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognitionfathitarek
 
Speech Recognition Application for the Speech Impaired using the Android-base...
Speech Recognition Application for the Speech Impaired using the Android-base...Speech Recognition Application for the Speech Impaired using the Android-base...
Speech Recognition Application for the Speech Impaired using the Android-base...TELKOMNIKA JOURNAL
 

What's hot (19)

Corpus Linguistics
Corpus LinguisticsCorpus Linguistics
Corpus Linguistics
 
A Computational Model of Yoruba Morphology Lexical Analyzer
A Computational Model of Yoruba Morphology Lexical AnalyzerA Computational Model of Yoruba Morphology Lexical Analyzer
A Computational Model of Yoruba Morphology Lexical Analyzer
 
Language and Intelligence
Language and IntelligenceLanguage and Intelligence
Language and Intelligence
 
A New Approach: Automatically Identify Proper Noun from Bengali Sentence for ...
A New Approach: Automatically Identify Proper Noun from Bengali Sentence for ...A New Approach: Automatically Identify Proper Noun from Bengali Sentence for ...
A New Approach: Automatically Identify Proper Noun from Bengali Sentence for ...
 
Sanskrit and Computational Linguistic
Sanskrit and Computational Linguistic Sanskrit and Computational Linguistic
Sanskrit and Computational Linguistic
 
Development of text to speech system for yoruba language
Development of text to speech system for yoruba languageDevelopment of text to speech system for yoruba language
Development of text to speech system for yoruba language
 
Computational linguistics
Computational linguisticsComputational linguistics
Computational linguistics
 
Com ling
Com lingCom ling
Com ling
 
Natural Language Processing from Object Automation
Natural Language Processing from Object Automation Natural Language Processing from Object Automation
Natural Language Processing from Object Automation
 
Design and Implementation of a Language Assistant for English – Arabic Texts
Design and Implementation of a Language Assistant for English – Arabic TextsDesign and Implementation of a Language Assistant for English – Arabic Texts
Design and Implementation of a Language Assistant for English – Arabic Texts
 
A SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGES
A SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGESA SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGES
A SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGES
 
Development of Bi-Directional English To Yoruba Translator for Real-Time Mobi...
Development of Bi-Directional English To Yoruba Translator for Real-Time Mobi...Development of Bi-Directional English To Yoruba Translator for Real-Time Mobi...
Development of Bi-Directional English To Yoruba Translator for Real-Time Mobi...
 
Language engineering
Language engineeringLanguage engineering
Language engineering
 
Optical character recognition for Ge'ez characters
Optical character recognition for Ge'ez charactersOptical character recognition for Ge'ez characters
Optical character recognition for Ge'ez characters
 
Voice input and speech recognition system in tourism/social media
Voice input and speech recognition system in tourism/social mediaVoice input and speech recognition system in tourism/social media
Voice input and speech recognition system in tourism/social media
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
Speech Recognition Application for the Speech Impaired using the Android-base...
Speech Recognition Application for the Speech Impaired using the Android-base...Speech Recognition Application for the Speech Impaired using the Android-base...
Speech Recognition Application for the Speech Impaired using the Android-base...
 
Psycolinguistic
PsycolinguisticPsycolinguistic
Psycolinguistic
 
Psichology And Languaje Learning
Psichology And Languaje LearningPsichology And Languaje Learning
Psichology And Languaje Learning
 

Similar to Role of language engineering to preserve endangered languages

Computational linguistics
Computational linguisticsComputational linguistics
Computational linguisticsAdnanBaloch15
 
Natural Language Processing: State of The Art, Current Trends and Challenges
Natural Language Processing: State of The Art, Current Trends and ChallengesNatural Language Processing: State of The Art, Current Trends and Challenges
Natural Language Processing: State of The Art, Current Trends and Challengesantonellarose
 
Syracuse UniversitySURFACEThe School of Information Studie.docx
Syracuse UniversitySURFACEThe School of Information Studie.docxSyracuse UniversitySURFACEThe School of Information Studie.docx
Syracuse UniversitySURFACEThe School of Information Studie.docxdeanmtaylor1545
 
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text EditorDynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text EditorWaqas Tariq
 
Design and Development of Morphological Analyzer for Tigrigna Verbs using Hyb...
Design and Development of Morphological Analyzer for Tigrigna Verbs using Hyb...Design and Development of Morphological Analyzer for Tigrigna Verbs using Hyb...
Design and Development of Morphological Analyzer for Tigrigna Verbs using Hyb...kevig
 
A New Approach: Automatically Identify Naming Word from Bengali Sentence for ...
A New Approach: Automatically Identify Naming Word from Bengali Sentence for ...A New Approach: Automatically Identify Naming Word from Bengali Sentence for ...
A New Approach: Automatically Identify Naming Word from Bengali Sentence for ...Syeful Islam
 
NLP-ppt.pptx nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
NLP-ppt.pptx nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNLP-ppt.pptx nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
NLP-ppt.pptx nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnRAtna29
 
Design Analysis Rules to Identify Proper Noun from Bengali Sentence for Univ...
Design Analysis Rules to Identify Proper Noun  from Bengali Sentence for Univ...Design Analysis Rules to Identify Proper Noun  from Bengali Sentence for Univ...
Design Analysis Rules to Identify Proper Noun from Bengali Sentence for Univ...Syeful Islam
 
A SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGES
A SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGESA SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGES
A SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGESLinda Garcia
 
DESIGN AND DEVELOPMENT OF MORPHOLOGICAL ANALYZER FOR TIGRIGNA VERBS USING HYB...
DESIGN AND DEVELOPMENT OF MORPHOLOGICAL ANALYZER FOR TIGRIGNA VERBS USING HYB...DESIGN AND DEVELOPMENT OF MORPHOLOGICAL ANALYZER FOR TIGRIGNA VERBS USING HYB...
DESIGN AND DEVELOPMENT OF MORPHOLOGICAL ANALYZER FOR TIGRIGNA VERBS USING HYB...kevig
 
Outlining Bangla Word Dictionary for Universal Networking Language
Outlining Bangla Word Dictionary for Universal Networking  LanguageOutlining Bangla Word Dictionary for Universal Networking  Language
Outlining Bangla Word Dictionary for Universal Networking LanguageIOSR Journals
 
Natural language processing
Natural language processingNatural language processing
Natural language processingKarenVacca
 
Hidden markov model based part of speech tagger for sinhala language
Hidden markov model based part of speech tagger for sinhala languageHidden markov model based part of speech tagger for sinhala language
Hidden markov model based part of speech tagger for sinhala languageijnlc
 
Domain Specific Terminology Extraction (ICICT 2006)
Domain Specific Terminology Extraction (ICICT 2006)Domain Specific Terminology Extraction (ICICT 2006)
Domain Specific Terminology Extraction (ICICT 2006)IT Industry
 
IMPLEMENTATION OF NLIZATION FRAMEWORK FOR VERBS, PRONOUNS AND DETERMINERS WIT...
IMPLEMENTATION OF NLIZATION FRAMEWORK FOR VERBS, PRONOUNS AND DETERMINERS WIT...IMPLEMENTATION OF NLIZATION FRAMEWORK FOR VERBS, PRONOUNS AND DETERMINERS WIT...
IMPLEMENTATION OF NLIZATION FRAMEWORK FOR VERBS, PRONOUNS AND DETERMINERS WIT...ijnlc
 
Investigations of the Distributions of Phonemic Durations in Hindi and Dogri
Investigations of the Distributions of Phonemic Durations in Hindi and DogriInvestigations of the Distributions of Phonemic Durations in Hindi and Dogri
Investigations of the Distributions of Phonemic Durations in Hindi and Dogrikevig
 
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...kevig
 

Similar to Role of language engineering to preserve endangered languages (20)

Computational linguistics
Computational linguisticsComputational linguistics
Computational linguistics
 
Natural Language Processing: State of The Art, Current Trends and Challenges
Natural Language Processing: State of The Art, Current Trends and ChallengesNatural Language Processing: State of The Art, Current Trends and Challenges
Natural Language Processing: State of The Art, Current Trends and Challenges
 
Language
LanguageLanguage
Language
 
Syracuse UniversitySURFACEThe School of Information Studie.docx
Syracuse UniversitySURFACEThe School of Information Studie.docxSyracuse UniversitySURFACEThe School of Information Studie.docx
Syracuse UniversitySURFACEThe School of Information Studie.docx
 
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text EditorDynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
 
551 466-472
551 466-472551 466-472
551 466-472
 
Design and Development of Morphological Analyzer for Tigrigna Verbs using Hyb...
Design and Development of Morphological Analyzer for Tigrigna Verbs using Hyb...Design and Development of Morphological Analyzer for Tigrigna Verbs using Hyb...
Design and Development of Morphological Analyzer for Tigrigna Verbs using Hyb...
 
A New Approach: Automatically Identify Naming Word from Bengali Sentence for ...
A New Approach: Automatically Identify Naming Word from Bengali Sentence for ...A New Approach: Automatically Identify Naming Word from Bengali Sentence for ...
A New Approach: Automatically Identify Naming Word from Bengali Sentence for ...
 
NLP-ppt.pptx nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
NLP-ppt.pptx nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNLP-ppt.pptx nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
NLP-ppt.pptx nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
 
Design Analysis Rules to Identify Proper Noun from Bengali Sentence for Univ...
Design Analysis Rules to Identify Proper Noun  from Bengali Sentence for Univ...Design Analysis Rules to Identify Proper Noun  from Bengali Sentence for Univ...
Design Analysis Rules to Identify Proper Noun from Bengali Sentence for Univ...
 
A SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGES
A SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGESA SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGES
A SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGES
 
DESIGN AND DEVELOPMENT OF MORPHOLOGICAL ANALYZER FOR TIGRIGNA VERBS USING HYB...
DESIGN AND DEVELOPMENT OF MORPHOLOGICAL ANALYZER FOR TIGRIGNA VERBS USING HYB...DESIGN AND DEVELOPMENT OF MORPHOLOGICAL ANALYZER FOR TIGRIGNA VERBS USING HYB...
DESIGN AND DEVELOPMENT OF MORPHOLOGICAL ANALYZER FOR TIGRIGNA VERBS USING HYB...
 
Nlp (1)
Nlp (1)Nlp (1)
Nlp (1)
 
Outlining Bangla Word Dictionary for Universal Networking Language
Outlining Bangla Word Dictionary for Universal Networking  LanguageOutlining Bangla Word Dictionary for Universal Networking  Language
Outlining Bangla Word Dictionary for Universal Networking Language
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Hidden markov model based part of speech tagger for sinhala language
Hidden markov model based part of speech tagger for sinhala languageHidden markov model based part of speech tagger for sinhala language
Hidden markov model based part of speech tagger for sinhala language
 
Domain Specific Terminology Extraction (ICICT 2006)
Domain Specific Terminology Extraction (ICICT 2006)Domain Specific Terminology Extraction (ICICT 2006)
Domain Specific Terminology Extraction (ICICT 2006)
 
IMPLEMENTATION OF NLIZATION FRAMEWORK FOR VERBS, PRONOUNS AND DETERMINERS WIT...
IMPLEMENTATION OF NLIZATION FRAMEWORK FOR VERBS, PRONOUNS AND DETERMINERS WIT...IMPLEMENTATION OF NLIZATION FRAMEWORK FOR VERBS, PRONOUNS AND DETERMINERS WIT...
IMPLEMENTATION OF NLIZATION FRAMEWORK FOR VERBS, PRONOUNS AND DETERMINERS WIT...
 
Investigations of the Distributions of Phonemic Durations in Hindi and Dogri
Investigations of the Distributions of Phonemic Durations in Hindi and DogriInvestigations of the Distributions of Phonemic Durations in Hindi and Dogri
Investigations of the Distributions of Phonemic Durations in Hindi and Dogri
 
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
 

More from Dr. Amit Kumar Jha

राजभाषा हिंदी के विकास में कंप्यूटर एवं प्रौद्योगिकी का योगदान
राजभाषा हिंदी के विकास में कंप्यूटर एवं प्रौद्योगिकी का योगदानराजभाषा हिंदी के विकास में कंप्यूटर एवं प्रौद्योगिकी का योगदान
राजभाषा हिंदी के विकास में कंप्यूटर एवं प्रौद्योगिकी का योगदानDr. Amit Kumar Jha
 
भारतीय भाषाओं के लिए डिजिटल भाषिक मानचित्र
भारतीय भाषाओं के लिए डिजिटल भाषिक मानचित्रभारतीय भाषाओं के लिए डिजिटल भाषिक मानचित्र
भारतीय भाषाओं के लिए डिजिटल भाषिक मानचित्रDr. Amit Kumar Jha
 
Hindi Language and Information Technology
Hindi Language and Information TechnologyHindi Language and Information Technology
Hindi Language and Information TechnologyDr. Amit Kumar Jha
 
Information Management System Rajbhasha
Information Management System RajbhashaInformation Management System Rajbhasha
Information Management System RajbhashaDr. Amit Kumar Jha
 
कंप्यूटर पर हिंदी में कार्य
कंप्यूटर पर हिंदी में कार्यकंप्यूटर पर हिंदी में कार्य
कंप्यूटर पर हिंदी में कार्यDr. Amit Kumar Jha
 
Clickable Language Map of India
Clickable Language Map of IndiaClickable Language Map of India
Clickable Language Map of IndiaDr. Amit Kumar Jha
 
Machine translation And Anusaaraka
Machine translation And AnusaarakaMachine translation And Anusaaraka
Machine translation And AnusaarakaDr. Amit Kumar Jha
 
Scientific Research methodology
Scientific Research methodologyScientific Research methodology
Scientific Research methodologyDr. Amit Kumar Jha
 
LingPy : A Python Library for Historical Linguistics
LingPy : A Python Library for Historical LinguisticsLingPy : A Python Library for Historical Linguistics
LingPy : A Python Library for Historical LinguisticsDr. Amit Kumar Jha
 
कंप्यूटर की पीढ़ियाँ
कंप्यूटर की पीढ़ियाँ कंप्यूटर की पीढ़ियाँ
कंप्यूटर की पीढ़ियाँ Dr. Amit Kumar Jha
 

More from Dr. Amit Kumar Jha (20)

E learning app development
E learning app developmentE learning app development
E learning app development
 
Maithili Text-to-Speech
Maithili Text-to-SpeechMaithili Text-to-Speech
Maithili Text-to-Speech
 
राजभाषा हिंदी के विकास में कंप्यूटर एवं प्रौद्योगिकी का योगदान
राजभाषा हिंदी के विकास में कंप्यूटर एवं प्रौद्योगिकी का योगदानराजभाषा हिंदी के विकास में कंप्यूटर एवं प्रौद्योगिकी का योगदान
राजभाषा हिंदी के विकास में कंप्यूटर एवं प्रौद्योगिकी का योगदान
 
भारतीय भाषाओं के लिए डिजिटल भाषिक मानचित्र
भारतीय भाषाओं के लिए डिजिटल भाषिक मानचित्रभारतीय भाषाओं के लिए डिजिटल भाषिक मानचित्र
भारतीय भाषाओं के लिए डिजिटल भाषिक मानचित्र
 
Hindi Language and Information Technology
Hindi Language and Information TechnologyHindi Language and Information Technology
Hindi Language and Information Technology
 
Information Management System Rajbhasha
Information Management System RajbhashaInformation Management System Rajbhasha
Information Management System Rajbhasha
 
Morphology
MorphologyMorphology
Morphology
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Microsoft office & Internet
Microsoft office & InternetMicrosoft office & Internet
Microsoft office & Internet
 
कंप्यूटर पर हिंदी में कार्य
कंप्यूटर पर हिंदी में कार्यकंप्यूटर पर हिंदी में कार्य
कंप्यूटर पर हिंदी में कार्य
 
Clickable Language Map of India
Clickable Language Map of IndiaClickable Language Map of India
Clickable Language Map of India
 
Machine translation And Anusaaraka
Machine translation And AnusaarakaMachine translation And Anusaaraka
Machine translation And Anusaaraka
 
Networking and Topology
Networking and TopologyNetworking and Topology
Networking and Topology
 
Scientific Research methodology
Scientific Research methodologyScientific Research methodology
Scientific Research methodology
 
LingPy : A Python Library for Historical Linguistics
LingPy : A Python Library for Historical LinguisticsLingPy : A Python Library for Historical Linguistics
LingPy : A Python Library for Historical Linguistics
 
लिनक्स (Linux)
लिनक्स (Linux) लिनक्स (Linux)
लिनक्स (Linux)
 
कंप्यूटर की पीढ़ियाँ
कंप्यूटर की पीढ़ियाँ कंप्यूटर की पीढ़ियाँ
कंप्यूटर की पीढ़ियाँ
 
Online Examination Portal
Online Examination PortalOnline Examination Portal
Online Examination Portal
 
Information engineering
Information engineeringInformation engineering
Information engineering
 
E-R Diagram
E-R DiagramE-R Diagram
E-R Diagram
 

Recently uploaded

Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxupamatechverse
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingrakeshbaidya232001
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxupamatechverse
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)Suman Mia
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024hassan khalil
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝soniya singh
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...ranjana rawat
 
chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learningmisbanausheenparvam
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...srsj9000
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 

Recently uploaded (20)

Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptx
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writing
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptx
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
 
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptxExploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
 
chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learning
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
 

Role of language engineering to preserve endangered languages

  • 1. Role of Language Engineering to Preserve Endangered Languages Amit Kumar Jha1, Sumit Kumar Gupta2, Piyush Pratap Singh3 Centre for Information and Language Engineering Mahatma Gandhi Antarrashtriya Hindi Vishwvidyalaya , Wardha (MS) Abstract: Endangered language (EL) is the language community incorporates less number of speakers of that particular language. EL is likely to become extinct in the near future. Many languages are failing out of use and being substituted by others is more widely used in the region or nation. Language Engineering (LE) is the subfield of computer science which explores the field of language related software and its feasible hardware development. With the help of language engineering, man and machine interface can be designed to preserve for longer time. The paper also states about how the applications of language engineering contributes the significant role to preserve Endangered Language (EL). Documentation is the primary task to preserve in appropriate shape to move ahead with time frame. This paper also described that how the LE makes easy to the process of language documentation. Being assisted of Digital data, we preserve EL due to durability of digital data more than others type of data. There are some languages whose literature is linguistically very rich but the number of speaker is countable, such as Nihali, language which also count in future as EL. Therefore for preserving these types of EL, the documentation and Digitalization of the EL is quite crucial, which proliferate the ease of access of EL to major class of people. Keywords: Language Engineering, Endangered Language, Language Documentation, Man machine interaction, Speech technology. Introduction: Language Engineering (LE) is the subfield of computer science which explores the field of language related software and its feasible hardware development. Language Engineering Computer Science Engineering Software Engineering Hardware Engineering Language RelatedSoftware Language RelatedHardware
  • 2. In other words, Language Engineering is the application of knowledge of language to the development of computer systems which can recognise, understand, interpret, and generate human language in all its forms. In practice, Language Engineering comprises a set of techniques and language resources. The former are implemented in computer software and the latter are a repository of knowledge which can be accessed by computer software. The ultimate goal of LE is to develop a machine which is able to understand and generate natural language. This language may be endangered. Two types of processing is conduct in LE – Text Processing and Speech Processing. As part of an important effort to document endangered languages before they become extinct. A variety of endangered language repositories have emerged to provided shared locations for field linguists to store data. These repositories vary greatly in the way their collections are organized and in the metadata they collect from depositors. The availability of collections of low resource language data has the potential for some interesting tasks in Natural Language and Speech Processing. However, the variety of formats in these repositories makes it difficult to know how much of the available data is suitable for such a task. The world is experiencing an unprecedented wave of language extinctions. There are between 6,000 and 7,000 languages currently spoken, and between 50 to 90 per cent of those will be extinct by the year 2100. Language extinction results in loss of cultural identities, knowledge systems, and the variety of data needed to understand the structure of language in the mind. Documenting endangered languages preserves data and stimulates language maintenance and revitalisation. If a person knows more than two languages his thinking and reasoning capacity is more. The loss of speakers in one language is the gain of speakers of another language, except for cases of genocide. Languages are generally replaced when an entire speech community shifts to another language. Replacing languages are very often official state languages. A language becomes endangered when the language does not transfer to the next generation. There are some languages which has a number of speakers of old age but that language didn’t transferred to the new or next generation then that language become endangered. As it is known to all that Indian society is the society of multi-language. So being a speaker of the language every person should transfer their own language to the next generation. Application of Language Engineering: The applications of Language Engineering are divided in two groups – Text Processing application and Speech Processing application. The applications of language engineering are as follow- ➢ Speech Generation– With the help of language engineering we can generate the speech of Endangered Language by a machine. If a machine will be able to generate EL then we can preserve that Language. Speech Generation is the application area of LE, Which is used to generate the speech of any natural language. ➢ Language Translator– Language translator is the application of LE. Language translator or Machine translator is a machine which is able to translate one language
  • 3. to another language. The first language is called source language and the second language is called the target language. If the Source language or the target language is EL, EL is preventing by this Language Translator system.. If the Endangered language translator is developed then that language may be used for a long time. ➢ Speech-to-Text– It is the process of converting speech to text. This is the task of documentation. If we convert speech file to text file of EL then we preserve that language. ➢ Text-to-Speech – Text-to-speech system is the system in which text data is input and it return speech data as output. It plays important role in Man-Machine interaction. ➢ Language Teaching- Language Teaching is the process of teaching a language. With the help of LE we can create a system for teaching a language. If EL teaching system is created EL may be preserve. As it is known that there are some language which has the speakers of old age and this language doesn’t transfer to the next generation. After some that language becomes dead. To preserve this language this system is important. ➢ Transcription Tool -Transcription is the process in which one script to another script. A person which is unknown to a specific language, its script and pronunciation, the role of Transcription tool is important in this context. If Transcription tool for an EL will be developed then we increase the number of people to understand that language. ➢ Text or Document summarization–It is one of the earliest applications of discourse structure analysis. it is used to summarize the text or Document if we summarize a text than it is easy to read and understand. ➢ Information Extraction - The task of information extraction is to extract from text- named entities13 relations that hold between them, and event structures in which they play a role. IE systems focus on specific domains (e.g., terrorist incidents) or specific types of relations (e.g., people and their dates of birth, protein–protein interactions). Event structures are often described by templates in IE, where the named entities to be extracted fill in specific slots. ➢ Speaker Identification and Verification Speech Recognition - Speaker Identification and verification means to identify and verify that what is speaking. This is an important application in forensic science. ➢ Speech Recognition – Speech Recognition is the application of language engineering. It identifies what is to speak. ➢ Character Image Recognition – Character Image Recognition, recognize the character in an image file. This is the application area of LE. It is used to captcha recognition. ➢ Segmentation – Segmentation is a task to segment the natural language to its consistent part i.e. we can segment the paragraph of natural language into sentences, phrases, words and syllables. ➢ Question-Answering System – Question-Answering system is an application of LE. If we asked a question to this system than this system return an appropriate answer of this question. We can asked the question in natural language. If we have the facility to asked the question in EL, then the speaker of EL feel comfort to asked the question.
  • 4. ➢ Word sense Disambiguation – As it is known that there are some words in all natural languages which has more meaning, it is called ambiguity. To resolve this problem is called Disambiguation. LE try to develop a system which is able to disambiguate the sense of the words, this system is called word sense disambiguation. To preserve EL the role of language engineering is very important. Language documentation is the main process to preserve EL. Language Documentation is the process in which the speech and text corpus of that language is collected. For collecting speech corpus of endangered language the researcher has gone to the field where the speakers of endangered language live. They record the sound in natural environment. After recording the sound file they analyze that sound files. If a Linguist researchers want to document one of the larger languages such as English, Chinese, Hindi etc., they can rely on already existing data and quite easily find samples of written and spoken language from which they could build up their documentation: books, newspapers and other written documents from the past and the present, many of these already digitalized, television and radio shows that can be recorded or simply downloaded from the Internet, language used in Internet forums and other social media, and many more. Because computers, the Internet and recording devices are widely available, the amount of such data and its accessibility is growing rapidly. For endangered languages, the situation is often completely different. Many of these languages do not have a written tradition and written data may be completely unavailable or sparse, the languages are not used in the media, or their speakers do not use the Internet (and if they do, they often use another language). In such cases, linguists must start from scratch and collect as much data as possible by recording speakers of a given language. Ideally, language documentation contains representative samples from different speakers – representing different age groups, different professions, of both sexes, and different origins –, but in the case of endangered languages this may not be possible, because the number of speakers is too small and/or there are only elder speakers.An important issue apart from the number of speakers and amount of data concerns the communication between the linguists or other researchers who want to document a language, and the language community. In the case of endangered or minority languages, the documenters often are outsiders, not members of the community. They may not be fluent speakers of the language in question and can communicate with the speakers in a second or a third language. This often leads to an unnatural use of the language that is to be documented. To preserve an endangered language, digitalization of EL is necessary like language documentation. Digitalization is the process in which data is the store in the form of digital. The durability of digital data is more than others types of data. To preserve EL by Digitalization we convert and store data in digital form i.e. text, sound, image etc. The researchers should create study material of EL in digital form. Systematic study of a system is called Engineering. Engineering is the process through which any task become easy and efficient. So we engineered the language.
  • 5. Reference List : 1. B. WEBBER, M. EGG and V. KORDONI (2012). Discourse structure and language technology. Natural Language Engineering 2. Jurafsky, Martin (et.al. ) Speech and Language Processing.Prentice Hall, Englewood Cliffs, New Jersey 07632 3. Reiter, E. and Dale, R. (2000). Building Natural Language Generation Systems.Cambridge University Press, Cambridge. 4. Yarowsky, D. (1996). Homograph disambiguation in text-to-speech synthesis. InProgress in Speech Synthesis, pp. 159–175. Springer-Verlag, Berlin. 5. Small, S. L. and Rieger, C. (1982). Parsing and comprehending withWord Experts.In Lehnert,W. G. and Ringle, M. H. (Eds.), Strategies for Natural Language Processing,pp. 89–147. Lawrence Erlbaum, New Jersey. 6. www.sppel.org