SlideShare a Scribd company logo
Role of Language Engineering
to
Preserve Endangered Languages
Amit Kumar Jha
Ph.D. (Informatics and Language Engineering)
School of Language, MGAHV, Wardha
Sumit Kumar Gupta
MILE, School of Language,
MGAHV, Wardha
National Conference on the Approaches & the Methodologies on the Study of Indegnous & Endangered Language
Dr. Piyush Pratap Singh
Asst. Professor
School of Language
MGAHV, Wardha
Endangered Language
• Endangered language (EL) is the language community incorporates less
number of speakers of that particular language.
• EL is likely to become extinct in the near future. Many languages are failing
out of use and being substituted by others is more widely used in the region
or nation.
Language Engineering
• Language Engineering (LE) is the subfield of computer science which
explores the field of language related software and its feasible hardware
development.
Language Engineering
Goal of Language Engineering
• The ultimate goal of LE is to develop a machine which is able to understand
and generate natural language.
• If the Approaches of LE implements on EL, then EL may be Preserve.
Language Endangered
• The loss of speakers in one language is the gain of speakers of another
language, except for cases of genocide. Languages are generally replaced
when an entire speech community shifts to another language. Replacing
languages are very often official state languages.
• The world is experiencing an unprecedented wave of language extinctions.
There are between 6,000 and 7,000 languages currently spoken, and between
50 to 90 per cent of those will be extinct by the year 2100.
Language Extinction Results
• Language extinction results in loss of cultural identities, knowledge systems,
and the variety of data needed to understand the structure of language in the
mind.
• Documenting endangered languages preserves data and stimulates language
maintenance and revitalisation.
Language Documentation
• Many of these languages do not have a written tradition and written data may be completely
unavailable or sparse, the languages are not used in the media, or their speakers do not use the Internet
(and if they do, they often use another language). In such cases, linguists must start from scratch and
collect as much data as possible by recording speakers of a given language.
• Ideally, language documentation contains representative samples from different speakers – representing
different age groups, different professions, of both sexes, and different origins –, but in the case of
endangered languages this may not be possible, because the number of speakers is too small and/or
there are only elder speakers. An important issue apart from the number of speakers and amount of
data concerns the communication between the linguists or other researchers who want to document a
language, and the language community.
Language Documentation
• In the case of endangered or minority languages, the documenters often are outsiders, not members of
the community. They may not be fluent speakers of the language in question and can communicate
with the speakers in a second or a third language. This often leads to an unnatural use of the language
that is to be documented.
Digitalization
• Digitlizaion is the process in which data is the store in the form of digital.
The durability of digital data is more than others types of data. To preserve
EL by Digitaliztion we convert and store data in digital forrm i.e. text, sound,
image etc. The researchers should create study meterial of EL in digital
form.
Application of Language Engineering
• Speech Generation
• Language Translator
• Speech-to-Text
• Text-to-Speech
• Langauge Teaching
• Translitration Tool
Application of Language Engineering...
• Speaker Identification
• Verification Speech Recognition
• Character and Document Image Recognition
• Question-Answering System
• Word sense Disambiguation
• Information retrieval and Information Extraction
• Film Production and Dialogue Debbing
Speech Generation
• With the help of language engineering we can generate the speech of
Endangered Language by a machine. If a machie will be able to generate EL
then we can preserve that Language.
Language Translator
• Language translator or Machine translator is a machine which is able to
translate one language to another language. The first language is called source
language and the second language is called the target language. If the Source
language or the target language is EL, EL is preventing by this Language
Translator system.
Speech-to-Text
• It is the process of converting speech to text. This is the task of
documentation. If we convert speech file to text file of EL then we preserve
that language.
Language Translator
• Language translator or Machine translator is a machine which is able to
translate one language to another language. The first language is called sourse
language and the second language is called the target language. If the Sourse
language or the target language is EL, EL is prevent by this Language
Translator system.
Transcription Tool
• Transcription is the process in which one script to another script.
• A person which is unknown to a specific language, its script and
pronunciation, the role of Transcription tool is importnat in this context.
• If Transcription tool for an EL will be developed then we increase the
number of people to understand that language.
Text-to-Speech
• Text-to-speech system is the system in which text data is input and it return
speech data as output. It plays important role in Man-Machine interaction.
Langauge Teaching
• Language Teaching is the process of teaching a language. With the help of
LE we can create a system for teaching a language. If EL teaching system is
created EL may be preseve. As it is known that there are some language
which has the speakers of old age and this language doesn’t transfer to the
next generation. After some that language becomes dead. To preserve this
language this system is important.
Question Answering System
• Question-Answering system is a Natural Language Processing system. If a
person ask a question to the system, system returns the answer of that
question.
Extinct Language
• An endangered language is a language that is at a risk of falling out of use,
generally because it has few surviving speakers. If it loses all of its native
speakers, it becomes an extinct language.
Levels of Endangerement
• UNESCO defines four levels of language endangerment between "safe" (not
endangered) and "extinct":
1. Vulnerable
2. Definitely endangered
3. Severely endangered
4. Critically endangered
EL in India
• Indian Goverment started a scheme to preseve EL the name of this Scheme
is SPPEL(Scheme for Protection and Preservation of Endangered
Languages).
• The SPPEL has listed 117 languages to be documented in its current phase.
The Languages are some of lesser known Indian languages which are spoken
by less than 10,000 speakers.
Refrence
• Refrence List :
• B. WEBBER, M. EGG and V. KORDONI (2012). Discourse structure and language technology. Natural Language
Engineering
• Jurafsky, Martin (et.al. ) Sppech and Language Processing. Prentice Hall, Englewood Cliffs, New Jersey 07632
• Reiter, E. and Dale, R. (2000). Building Natural Language Generation Systems. Cambridge University Press, Cambridge.
• Yarowsky, D. (1996). Homograph disambiguation in text-to-speech synthesis. In Progress in Speech Synthesis, pp. 159–175.
Springer-Verlag, Berlin.
• Small, S. L. and Rieger, C. (1982). Parsing and comprehending withWord Experts. In Lehnert,W. G. and Ringle, M. H.
(Eds.), Strategies for Natural Language Processing, pp. 89–147. Lawrence Erlbaum, New Jersey.
• www.sppel.org
Thanks A Lot......

More Related Content

What's hot

Applied Linguistics
Applied LinguisticsApplied Linguistics
Applied Linguistics
Marisol Smith
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognition
anshu shrivastava
 
Psycholinguistics
PsycholinguisticsPsycholinguistics
PsycholinguisticsRaul Vargas
 
Linguistics
LinguisticsLinguistics
Linguistics
Dieudonnesawadogo1
 
Interlanguage errors
Interlanguage errorsInterlanguage errors
Interlanguage errorsShona Whyte
 
Code Switching: a paper by Krishna Bista
Code Switching: a paper by Krishna BistaCode Switching: a paper by Krishna Bista
Code Switching: a paper by Krishna BistaAna Azevedo
 
Chapter 5
Chapter 5Chapter 5
Chapter 5
Mona Dabagh
 
SLA-Inter-language presentation
SLA-Inter-language presentationSLA-Inter-language presentation
SLA-Inter-language presentation
amorenaz
 
10 symptoms of a multilingual brain
10 symptoms of a multilingual brain10 symptoms of a multilingual brain
10 symptoms of a multilingual brain
HERybol
 
Presentation on code switching
Presentation on code switchingPresentation on code switching
Presentation on code switchingRamshankar Yadav
 
Interlanguage Analysis of Spanish Learners
Interlanguage Analysis of Spanish LearnersInterlanguage Analysis of Spanish Learners
Interlanguage Analysis of Spanish Learners
Ardiansyah -
 
(Applied linguistics) cook's book ch 8
(Applied linguistics) cook's book ch 8(Applied linguistics) cook's book ch 8
(Applied linguistics) cook's book ch 8
VivaAs
 
Code switching &; code mixing
Code switching &; code mixingCode switching &; code mixing
Code switching &; code mixing
Youshaib Alam
 
Code switching
Code switchingCode switching
Code switchingmaiye
 
Transfer Analysis in Applied Linguistics
Transfer Analysis in Applied LinguisticsTransfer Analysis in Applied Linguistics
Transfer Analysis in Applied Linguistics
Hikmah Pravitasari
 
FUNCTIONS OF ENGLISH AS A LANGUAGE
FUNCTIONS OF ENGLISH AS A LANGUAGEFUNCTIONS OF ENGLISH AS A LANGUAGE
FUNCTIONS OF ENGLISH AS A LANGUAGE
Karthika Shibu
 
The Pedagogical Aspects of Philippine English
The Pedagogical Aspects of Philippine EnglishThe Pedagogical Aspects of Philippine English
The Pedagogical Aspects of Philippine EnglishCy Bonaobra
 
Code Switching, Types and Reasons
Code Switching, Types and ReasonsCode Switching, Types and Reasons
Code Switching, Types and Reasons
Sohail Khan
 
Code switching presentation387
Code switching presentation387Code switching presentation387
Code switching presentation387
helalsaid
 

What's hot (20)

Applied Linguistics
Applied LinguisticsApplied Linguistics
Applied Linguistics
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognition
 
Psycholinguistics
PsycholinguisticsPsycholinguistics
Psycholinguistics
 
Linguistics
LinguisticsLinguistics
Linguistics
 
Interlanguage errors
Interlanguage errorsInterlanguage errors
Interlanguage errors
 
Code Switching: a paper by Krishna Bista
Code Switching: a paper by Krishna BistaCode Switching: a paper by Krishna Bista
Code Switching: a paper by Krishna Bista
 
Chapter 5
Chapter 5Chapter 5
Chapter 5
 
SLA-Inter-language presentation
SLA-Inter-language presentationSLA-Inter-language presentation
SLA-Inter-language presentation
 
10 symptoms of a multilingual brain
10 symptoms of a multilingual brain10 symptoms of a multilingual brain
10 symptoms of a multilingual brain
 
Presentation on code switching
Presentation on code switchingPresentation on code switching
Presentation on code switching
 
Interlanguage Analysis of Spanish Learners
Interlanguage Analysis of Spanish LearnersInterlanguage Analysis of Spanish Learners
Interlanguage Analysis of Spanish Learners
 
Linguistic factors presentation
Linguistic factors presentationLinguistic factors presentation
Linguistic factors presentation
 
(Applied linguistics) cook's book ch 8
(Applied linguistics) cook's book ch 8(Applied linguistics) cook's book ch 8
(Applied linguistics) cook's book ch 8
 
Code switching &; code mixing
Code switching &; code mixingCode switching &; code mixing
Code switching &; code mixing
 
Code switching
Code switchingCode switching
Code switching
 
Transfer Analysis in Applied Linguistics
Transfer Analysis in Applied LinguisticsTransfer Analysis in Applied Linguistics
Transfer Analysis in Applied Linguistics
 
FUNCTIONS OF ENGLISH AS A LANGUAGE
FUNCTIONS OF ENGLISH AS A LANGUAGEFUNCTIONS OF ENGLISH AS A LANGUAGE
FUNCTIONS OF ENGLISH AS A LANGUAGE
 
The Pedagogical Aspects of Philippine English
The Pedagogical Aspects of Philippine EnglishThe Pedagogical Aspects of Philippine English
The Pedagogical Aspects of Philippine English
 
Code Switching, Types and Reasons
Code Switching, Types and ReasonsCode Switching, Types and Reasons
Code Switching, Types and Reasons
 
Code switching presentation387
Code switching presentation387Code switching presentation387
Code switching presentation387
 

Similar to Role of Language Engineering to Preserve Endangered Language

Applied linguistics presentation
Applied linguistics  presentationApplied linguistics  presentation
Applied linguistics presentation
Muhammad Furqan
 
Investigations of the Distributions of Phonemic Durations in Hindi and Dogri
Investigations of the Distributions of Phonemic Durations in Hindi and DogriInvestigations of the Distributions of Phonemic Durations in Hindi and Dogri
Investigations of the Distributions of Phonemic Durations in Hindi and Dogri
kevig
 
Investigations of the Distributions of Phonemic Durations in Hindi and Dogri
Investigations of the Distributions of Phonemic Durations in Hindi and DogriInvestigations of the Distributions of Phonemic Durations in Hindi and Dogri
Investigations of the Distributions of Phonemic Durations in Hindi and Dogri
kevig
 
NLP-ppt.pptx nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
NLP-ppt.pptx nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNLP-ppt.pptx nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
NLP-ppt.pptx nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
RAtna29
 
Applied Linguistics session 111 0_07_12_2021 Applied linguistics challenges.pdf
Applied Linguistics session 111 0_07_12_2021 Applied linguistics challenges.pdfApplied Linguistics session 111 0_07_12_2021 Applied linguistics challenges.pdf
Applied Linguistics session 111 0_07_12_2021 Applied linguistics challenges.pdf
Dr.Badriya Al Mamari
 
Substitution Error Analysis for Improving the Word Accuracy in Telugu Langua...
Substitution Error Analysis for Improving the Word Accuracy in  Telugu Langua...Substitution Error Analysis for Improving the Word Accuracy in  Telugu Langua...
Substitution Error Analysis for Improving the Word Accuracy in Telugu Langua...
IOSR Journals
 
Linguistics curriculum 001
Linguistics curriculum 001Linguistics curriculum 001
Linguistics curriculum 001
Mohammed Abdlhameed
 
B0340710
B0340710B0340710
B0340710
iosrjournals
 
L1 nlp intro
L1 nlp introL1 nlp intro
L1 nlp intro
Harshit Yadav
 
Week 1 an introduction to the course.pptx
Week 1 an introduction to the course.pptxWeek 1 an introduction to the course.pptx
Week 1 an introduction to the course.pptx
zeinebayachi2
 
A New Approach: Automatically Identify Proper Noun from Bengali Sentence for ...
A New Approach: Automatically Identify Proper Noun from Bengali Sentence for ...A New Approach: Automatically Identify Proper Noun from Bengali Sentence for ...
A New Approach: Automatically Identify Proper Noun from Bengali Sentence for ...
Syeful Islam
 
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
kevig
 
American Standard Sign Language Representation Using Speech Recognition
American Standard Sign Language Representation Using Speech RecognitionAmerican Standard Sign Language Representation Using Speech Recognition
American Standard Sign Language Representation Using Speech Recognition
paperpublications3
 
What is linguistics.pdf
What is linguistics.pdfWhat is linguistics.pdf
What is linguistics.pdf
ssuser958503
 
1.pdf
1.pdf1.pdf
1.pdf
enat6502
 
Hidden markov model based part of speech tagger for sinhala language
Hidden markov model based part of speech tagger for sinhala languageHidden markov model based part of speech tagger for sinhala language
Hidden markov model based part of speech tagger for sinhala language
ijnlc
 
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
ijnlc
 
English as second and foreign language
English as second and foreign languageEnglish as second and foreign language
English as second and foreign language
ANJU A
 
Natural language wikipedia
Natural language   wikipediaNatural language   wikipedia
Natural language wikipedia
Tên Họ Và
 
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text EditorDynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Waqas Tariq
 

Similar to Role of Language Engineering to Preserve Endangered Language (20)

Applied linguistics presentation
Applied linguistics  presentationApplied linguistics  presentation
Applied linguistics presentation
 
Investigations of the Distributions of Phonemic Durations in Hindi and Dogri
Investigations of the Distributions of Phonemic Durations in Hindi and DogriInvestigations of the Distributions of Phonemic Durations in Hindi and Dogri
Investigations of the Distributions of Phonemic Durations in Hindi and Dogri
 
Investigations of the Distributions of Phonemic Durations in Hindi and Dogri
Investigations of the Distributions of Phonemic Durations in Hindi and DogriInvestigations of the Distributions of Phonemic Durations in Hindi and Dogri
Investigations of the Distributions of Phonemic Durations in Hindi and Dogri
 
NLP-ppt.pptx nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
NLP-ppt.pptx nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNLP-ppt.pptx nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
NLP-ppt.pptx nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
 
Applied Linguistics session 111 0_07_12_2021 Applied linguistics challenges.pdf
Applied Linguistics session 111 0_07_12_2021 Applied linguistics challenges.pdfApplied Linguistics session 111 0_07_12_2021 Applied linguistics challenges.pdf
Applied Linguistics session 111 0_07_12_2021 Applied linguistics challenges.pdf
 
Substitution Error Analysis for Improving the Word Accuracy in Telugu Langua...
Substitution Error Analysis for Improving the Word Accuracy in  Telugu Langua...Substitution Error Analysis for Improving the Word Accuracy in  Telugu Langua...
Substitution Error Analysis for Improving the Word Accuracy in Telugu Langua...
 
Linguistics curriculum 001
Linguistics curriculum 001Linguistics curriculum 001
Linguistics curriculum 001
 
B0340710
B0340710B0340710
B0340710
 
L1 nlp intro
L1 nlp introL1 nlp intro
L1 nlp intro
 
Week 1 an introduction to the course.pptx
Week 1 an introduction to the course.pptxWeek 1 an introduction to the course.pptx
Week 1 an introduction to the course.pptx
 
A New Approach: Automatically Identify Proper Noun from Bengali Sentence for ...
A New Approach: Automatically Identify Proper Noun from Bengali Sentence for ...A New Approach: Automatically Identify Proper Noun from Bengali Sentence for ...
A New Approach: Automatically Identify Proper Noun from Bengali Sentence for ...
 
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
 
American Standard Sign Language Representation Using Speech Recognition
American Standard Sign Language Representation Using Speech RecognitionAmerican Standard Sign Language Representation Using Speech Recognition
American Standard Sign Language Representation Using Speech Recognition
 
What is linguistics.pdf
What is linguistics.pdfWhat is linguistics.pdf
What is linguistics.pdf
 
1.pdf
1.pdf1.pdf
1.pdf
 
Hidden markov model based part of speech tagger for sinhala language
Hidden markov model based part of speech tagger for sinhala languageHidden markov model based part of speech tagger for sinhala language
Hidden markov model based part of speech tagger for sinhala language
 
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
 
English as second and foreign language
English as second and foreign languageEnglish as second and foreign language
English as second and foreign language
 
Natural language wikipedia
Natural language   wikipediaNatural language   wikipedia
Natural language wikipedia
 
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text EditorDynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
 

More from Dr. Amit Kumar Jha

E learning app development
E learning app developmentE learning app development
E learning app development
Dr. Amit Kumar Jha
 
Maithili Text-to-Speech
Maithili Text-to-SpeechMaithili Text-to-Speech
Maithili Text-to-Speech
Dr. Amit Kumar Jha
 
राजभाषा हिंदी के विकास में कंप्यूटर एवं प्रौद्योगिकी का योगदान
राजभाषा हिंदी के विकास में कंप्यूटर एवं प्रौद्योगिकी का योगदानराजभाषा हिंदी के विकास में कंप्यूटर एवं प्रौद्योगिकी का योगदान
राजभाषा हिंदी के विकास में कंप्यूटर एवं प्रौद्योगिकी का योगदान
Dr. Amit Kumar Jha
 
भारतीय भाषाओं के लिए डिजिटल भाषिक मानचित्र
भारतीय भाषाओं के लिए डिजिटल भाषिक मानचित्रभारतीय भाषाओं के लिए डिजिटल भाषिक मानचित्र
भारतीय भाषाओं के लिए डिजिटल भाषिक मानचित्र
Dr. Amit Kumar Jha
 
Hindi Language and Information Technology
Hindi Language and Information TechnologyHindi Language and Information Technology
Hindi Language and Information Technology
Dr. Amit Kumar Jha
 
Information Management System Rajbhasha
Information Management System RajbhashaInformation Management System Rajbhasha
Information Management System Rajbhasha
Dr. Amit Kumar Jha
 
Morphology
MorphologyMorphology
Morphology
Dr. Amit Kumar Jha
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
Dr. Amit Kumar Jha
 
Microsoft office & Internet
Microsoft office & InternetMicrosoft office & Internet
Microsoft office & Internet
Dr. Amit Kumar Jha
 
कंप्यूटर पर हिंदी में कार्य
कंप्यूटर पर हिंदी में कार्यकंप्यूटर पर हिंदी में कार्य
कंप्यूटर पर हिंदी में कार्य
Dr. Amit Kumar Jha
 
Clickable Language Map of India
Clickable Language Map of IndiaClickable Language Map of India
Clickable Language Map of India
Dr. Amit Kumar Jha
 
Machine translation And Anusaaraka
Machine translation And AnusaarakaMachine translation And Anusaaraka
Machine translation And Anusaaraka
Dr. Amit Kumar Jha
 
Networking and Topology
Networking and TopologyNetworking and Topology
Networking and Topology
Dr. Amit Kumar Jha
 
Scientific Research methodology
Scientific Research methodologyScientific Research methodology
Scientific Research methodology
Dr. Amit Kumar Jha
 
LingPy : A Python Library for Historical Linguistics
LingPy : A Python Library for Historical LinguisticsLingPy : A Python Library for Historical Linguistics
LingPy : A Python Library for Historical Linguistics
Dr. Amit Kumar Jha
 
लिनक्स (Linux)
लिनक्स (Linux) लिनक्स (Linux)
लिनक्स (Linux)
Dr. Amit Kumar Jha
 
कंप्यूटर की पीढ़ियाँ
कंप्यूटर की पीढ़ियाँ कंप्यूटर की पीढ़ियाँ
कंप्यूटर की पीढ़ियाँ
Dr. Amit Kumar Jha
 
Online Examination Portal
Online Examination PortalOnline Examination Portal
Online Examination Portal
Dr. Amit Kumar Jha
 
Information engineering
Information engineeringInformation engineering
Information engineering
Dr. Amit Kumar Jha
 
Language engineering
Language engineeringLanguage engineering
Language engineering
Dr. Amit Kumar Jha
 

More from Dr. Amit Kumar Jha (20)

E learning app development
E learning app developmentE learning app development
E learning app development
 
Maithili Text-to-Speech
Maithili Text-to-SpeechMaithili Text-to-Speech
Maithili Text-to-Speech
 
राजभाषा हिंदी के विकास में कंप्यूटर एवं प्रौद्योगिकी का योगदान
राजभाषा हिंदी के विकास में कंप्यूटर एवं प्रौद्योगिकी का योगदानराजभाषा हिंदी के विकास में कंप्यूटर एवं प्रौद्योगिकी का योगदान
राजभाषा हिंदी के विकास में कंप्यूटर एवं प्रौद्योगिकी का योगदान
 
भारतीय भाषाओं के लिए डिजिटल भाषिक मानचित्र
भारतीय भाषाओं के लिए डिजिटल भाषिक मानचित्रभारतीय भाषाओं के लिए डिजिटल भाषिक मानचित्र
भारतीय भाषाओं के लिए डिजिटल भाषिक मानचित्र
 
Hindi Language and Information Technology
Hindi Language and Information TechnologyHindi Language and Information Technology
Hindi Language and Information Technology
 
Information Management System Rajbhasha
Information Management System RajbhashaInformation Management System Rajbhasha
Information Management System Rajbhasha
 
Morphology
MorphologyMorphology
Morphology
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Microsoft office & Internet
Microsoft office & InternetMicrosoft office & Internet
Microsoft office & Internet
 
कंप्यूटर पर हिंदी में कार्य
कंप्यूटर पर हिंदी में कार्यकंप्यूटर पर हिंदी में कार्य
कंप्यूटर पर हिंदी में कार्य
 
Clickable Language Map of India
Clickable Language Map of IndiaClickable Language Map of India
Clickable Language Map of India
 
Machine translation And Anusaaraka
Machine translation And AnusaarakaMachine translation And Anusaaraka
Machine translation And Anusaaraka
 
Networking and Topology
Networking and TopologyNetworking and Topology
Networking and Topology
 
Scientific Research methodology
Scientific Research methodologyScientific Research methodology
Scientific Research methodology
 
LingPy : A Python Library for Historical Linguistics
LingPy : A Python Library for Historical LinguisticsLingPy : A Python Library for Historical Linguistics
LingPy : A Python Library for Historical Linguistics
 
लिनक्स (Linux)
लिनक्स (Linux) लिनक्स (Linux)
लिनक्स (Linux)
 
कंप्यूटर की पीढ़ियाँ
कंप्यूटर की पीढ़ियाँ कंप्यूटर की पीढ़ियाँ
कंप्यूटर की पीढ़ियाँ
 
Online Examination Portal
Online Examination PortalOnline Examination Portal
Online Examination Portal
 
Information engineering
Information engineeringInformation engineering
Information engineering
 
Language engineering
Language engineeringLanguage engineering
Language engineering
 

Recently uploaded

5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
ihlasbinance2003
 
ACRP 4-09 Risk Assessment Method to Support Modification of Airfield Separat...
ACRP 4-09 Risk Assessment Method to Support Modification of Airfield Separat...ACRP 4-09 Risk Assessment Method to Support Modification of Airfield Separat...
ACRP 4-09 Risk Assessment Method to Support Modification of Airfield Separat...
Mukeshwaran Balu
 
Harnessing WebAssembly for Real-time Stateless Streaming Pipelines
Harnessing WebAssembly for Real-time Stateless Streaming PipelinesHarnessing WebAssembly for Real-time Stateless Streaming Pipelines
Harnessing WebAssembly for Real-time Stateless Streaming Pipelines
Christina Lin
 
Building Electrical System Design & Installation
Building Electrical System Design & InstallationBuilding Electrical System Design & Installation
Building Electrical System Design & Installation
symbo111
 
sieving analysis and results interpretation
sieving analysis and results interpretationsieving analysis and results interpretation
sieving analysis and results interpretation
ssuser36d3051
 
Water billing management system project report.pdf
Water billing management system project report.pdfWater billing management system project report.pdf
Water billing management system project report.pdf
Kamal Acharya
 
Swimming pool mechanical components design.pptx
Swimming pool  mechanical components design.pptxSwimming pool  mechanical components design.pptx
Swimming pool mechanical components design.pptx
yokeleetan1
 
Heap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTS
Heap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTSHeap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTS
Heap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTS
Soumen Santra
 
6th International Conference on Machine Learning & Applications (CMLA 2024)
6th International Conference on Machine Learning & Applications (CMLA 2024)6th International Conference on Machine Learning & Applications (CMLA 2024)
6th International Conference on Machine Learning & Applications (CMLA 2024)
ClaraZara1
 
Modelagem de um CSTR com reação endotermica.pdf
Modelagem de um CSTR com reação endotermica.pdfModelagem de um CSTR com reação endotermica.pdf
Modelagem de um CSTR com reação endotermica.pdf
camseq
 
Unbalanced Three Phase Systems and circuits.pptx
Unbalanced Three Phase Systems and circuits.pptxUnbalanced Three Phase Systems and circuits.pptx
Unbalanced Three Phase Systems and circuits.pptx
ChristineTorrepenida1
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
zwunae
 
digital fundamental by Thomas L.floydl.pdf
digital fundamental by Thomas L.floydl.pdfdigital fundamental by Thomas L.floydl.pdf
digital fundamental by Thomas L.floydl.pdf
drwaing
 
Planning Of Procurement o different goods and services
Planning Of Procurement o different goods and servicesPlanning Of Procurement o different goods and services
Planning Of Procurement o different goods and services
JoytuBarua2
 
14 Template Contractual Notice - EOT Application
14 Template Contractual Notice - EOT Application14 Template Contractual Notice - EOT Application
14 Template Contractual Notice - EOT Application
SyedAbiiAzazi1
 
Tutorial for 16S rRNA Gene Analysis with QIIME2.pdf
Tutorial for 16S rRNA Gene Analysis with QIIME2.pdfTutorial for 16S rRNA Gene Analysis with QIIME2.pdf
Tutorial for 16S rRNA Gene Analysis with QIIME2.pdf
aqil azizi
 
Recycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part IIIRecycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part III
Aditya Rajan Patra
 
ACEP Magazine edition 4th launched on 05.06.2024
ACEP Magazine edition 4th launched on 05.06.2024ACEP Magazine edition 4th launched on 05.06.2024
ACEP Magazine edition 4th launched on 05.06.2024
Rahul
 
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
obonagu
 
PROJECT FORMAT FOR EVS AMITY UNIVERSITY GWALIOR.ppt
PROJECT FORMAT FOR EVS AMITY UNIVERSITY GWALIOR.pptPROJECT FORMAT FOR EVS AMITY UNIVERSITY GWALIOR.ppt
PROJECT FORMAT FOR EVS AMITY UNIVERSITY GWALIOR.ppt
bhadouriyakaku
 

Recently uploaded (20)

5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
 
ACRP 4-09 Risk Assessment Method to Support Modification of Airfield Separat...
ACRP 4-09 Risk Assessment Method to Support Modification of Airfield Separat...ACRP 4-09 Risk Assessment Method to Support Modification of Airfield Separat...
ACRP 4-09 Risk Assessment Method to Support Modification of Airfield Separat...
 
Harnessing WebAssembly for Real-time Stateless Streaming Pipelines
Harnessing WebAssembly for Real-time Stateless Streaming PipelinesHarnessing WebAssembly for Real-time Stateless Streaming Pipelines
Harnessing WebAssembly for Real-time Stateless Streaming Pipelines
 
Building Electrical System Design & Installation
Building Electrical System Design & InstallationBuilding Electrical System Design & Installation
Building Electrical System Design & Installation
 
sieving analysis and results interpretation
sieving analysis and results interpretationsieving analysis and results interpretation
sieving analysis and results interpretation
 
Water billing management system project report.pdf
Water billing management system project report.pdfWater billing management system project report.pdf
Water billing management system project report.pdf
 
Swimming pool mechanical components design.pptx
Swimming pool  mechanical components design.pptxSwimming pool  mechanical components design.pptx
Swimming pool mechanical components design.pptx
 
Heap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTS
Heap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTSHeap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTS
Heap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTS
 
6th International Conference on Machine Learning & Applications (CMLA 2024)
6th International Conference on Machine Learning & Applications (CMLA 2024)6th International Conference on Machine Learning & Applications (CMLA 2024)
6th International Conference on Machine Learning & Applications (CMLA 2024)
 
Modelagem de um CSTR com reação endotermica.pdf
Modelagem de um CSTR com reação endotermica.pdfModelagem de um CSTR com reação endotermica.pdf
Modelagem de um CSTR com reação endotermica.pdf
 
Unbalanced Three Phase Systems and circuits.pptx
Unbalanced Three Phase Systems and circuits.pptxUnbalanced Three Phase Systems and circuits.pptx
Unbalanced Three Phase Systems and circuits.pptx
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
 
digital fundamental by Thomas L.floydl.pdf
digital fundamental by Thomas L.floydl.pdfdigital fundamental by Thomas L.floydl.pdf
digital fundamental by Thomas L.floydl.pdf
 
Planning Of Procurement o different goods and services
Planning Of Procurement o different goods and servicesPlanning Of Procurement o different goods and services
Planning Of Procurement o different goods and services
 
14 Template Contractual Notice - EOT Application
14 Template Contractual Notice - EOT Application14 Template Contractual Notice - EOT Application
14 Template Contractual Notice - EOT Application
 
Tutorial for 16S rRNA Gene Analysis with QIIME2.pdf
Tutorial for 16S rRNA Gene Analysis with QIIME2.pdfTutorial for 16S rRNA Gene Analysis with QIIME2.pdf
Tutorial for 16S rRNA Gene Analysis with QIIME2.pdf
 
Recycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part IIIRecycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part III
 
ACEP Magazine edition 4th launched on 05.06.2024
ACEP Magazine edition 4th launched on 05.06.2024ACEP Magazine edition 4th launched on 05.06.2024
ACEP Magazine edition 4th launched on 05.06.2024
 
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
 
PROJECT FORMAT FOR EVS AMITY UNIVERSITY GWALIOR.ppt
PROJECT FORMAT FOR EVS AMITY UNIVERSITY GWALIOR.pptPROJECT FORMAT FOR EVS AMITY UNIVERSITY GWALIOR.ppt
PROJECT FORMAT FOR EVS AMITY UNIVERSITY GWALIOR.ppt
 

Role of Language Engineering to Preserve Endangered Language

  • 1. Role of Language Engineering to Preserve Endangered Languages Amit Kumar Jha Ph.D. (Informatics and Language Engineering) School of Language, MGAHV, Wardha Sumit Kumar Gupta MILE, School of Language, MGAHV, Wardha National Conference on the Approaches & the Methodologies on the Study of Indegnous & Endangered Language Dr. Piyush Pratap Singh Asst. Professor School of Language MGAHV, Wardha
  • 2. Endangered Language • Endangered language (EL) is the language community incorporates less number of speakers of that particular language. • EL is likely to become extinct in the near future. Many languages are failing out of use and being substituted by others is more widely used in the region or nation.
  • 3. Language Engineering • Language Engineering (LE) is the subfield of computer science which explores the field of language related software and its feasible hardware development.
  • 5. Goal of Language Engineering • The ultimate goal of LE is to develop a machine which is able to understand and generate natural language. • If the Approaches of LE implements on EL, then EL may be Preserve.
  • 6. Language Endangered • The loss of speakers in one language is the gain of speakers of another language, except for cases of genocide. Languages are generally replaced when an entire speech community shifts to another language. Replacing languages are very often official state languages. • The world is experiencing an unprecedented wave of language extinctions. There are between 6,000 and 7,000 languages currently spoken, and between 50 to 90 per cent of those will be extinct by the year 2100.
  • 7. Language Extinction Results • Language extinction results in loss of cultural identities, knowledge systems, and the variety of data needed to understand the structure of language in the mind. • Documenting endangered languages preserves data and stimulates language maintenance and revitalisation.
  • 8. Language Documentation • Many of these languages do not have a written tradition and written data may be completely unavailable or sparse, the languages are not used in the media, or their speakers do not use the Internet (and if they do, they often use another language). In such cases, linguists must start from scratch and collect as much data as possible by recording speakers of a given language. • Ideally, language documentation contains representative samples from different speakers – representing different age groups, different professions, of both sexes, and different origins –, but in the case of endangered languages this may not be possible, because the number of speakers is too small and/or there are only elder speakers. An important issue apart from the number of speakers and amount of data concerns the communication between the linguists or other researchers who want to document a language, and the language community.
  • 9. Language Documentation • In the case of endangered or minority languages, the documenters often are outsiders, not members of the community. They may not be fluent speakers of the language in question and can communicate with the speakers in a second or a third language. This often leads to an unnatural use of the language that is to be documented.
  • 10. Digitalization • Digitlizaion is the process in which data is the store in the form of digital. The durability of digital data is more than others types of data. To preserve EL by Digitaliztion we convert and store data in digital forrm i.e. text, sound, image etc. The researchers should create study meterial of EL in digital form.
  • 11. Application of Language Engineering • Speech Generation • Language Translator • Speech-to-Text • Text-to-Speech • Langauge Teaching • Translitration Tool
  • 12. Application of Language Engineering... • Speaker Identification • Verification Speech Recognition • Character and Document Image Recognition • Question-Answering System • Word sense Disambiguation • Information retrieval and Information Extraction • Film Production and Dialogue Debbing
  • 13. Speech Generation • With the help of language engineering we can generate the speech of Endangered Language by a machine. If a machie will be able to generate EL then we can preserve that Language.
  • 14. Language Translator • Language translator or Machine translator is a machine which is able to translate one language to another language. The first language is called source language and the second language is called the target language. If the Source language or the target language is EL, EL is preventing by this Language Translator system.
  • 15. Speech-to-Text • It is the process of converting speech to text. This is the task of documentation. If we convert speech file to text file of EL then we preserve that language.
  • 16. Language Translator • Language translator or Machine translator is a machine which is able to translate one language to another language. The first language is called sourse language and the second language is called the target language. If the Sourse language or the target language is EL, EL is prevent by this Language Translator system.
  • 17. Transcription Tool • Transcription is the process in which one script to another script. • A person which is unknown to a specific language, its script and pronunciation, the role of Transcription tool is importnat in this context. • If Transcription tool for an EL will be developed then we increase the number of people to understand that language.
  • 18. Text-to-Speech • Text-to-speech system is the system in which text data is input and it return speech data as output. It plays important role in Man-Machine interaction.
  • 19. Langauge Teaching • Language Teaching is the process of teaching a language. With the help of LE we can create a system for teaching a language. If EL teaching system is created EL may be preseve. As it is known that there are some language which has the speakers of old age and this language doesn’t transfer to the next generation. After some that language becomes dead. To preserve this language this system is important.
  • 20. Question Answering System • Question-Answering system is a Natural Language Processing system. If a person ask a question to the system, system returns the answer of that question.
  • 21. Extinct Language • An endangered language is a language that is at a risk of falling out of use, generally because it has few surviving speakers. If it loses all of its native speakers, it becomes an extinct language.
  • 22. Levels of Endangerement • UNESCO defines four levels of language endangerment between "safe" (not endangered) and "extinct": 1. Vulnerable 2. Definitely endangered 3. Severely endangered 4. Critically endangered
  • 23. EL in India • Indian Goverment started a scheme to preseve EL the name of this Scheme is SPPEL(Scheme for Protection and Preservation of Endangered Languages). • The SPPEL has listed 117 languages to be documented in its current phase. The Languages are some of lesser known Indian languages which are spoken by less than 10,000 speakers.
  • 24. Refrence • Refrence List : • B. WEBBER, M. EGG and V. KORDONI (2012). Discourse structure and language technology. Natural Language Engineering • Jurafsky, Martin (et.al. ) Sppech and Language Processing. Prentice Hall, Englewood Cliffs, New Jersey 07632 • Reiter, E. and Dale, R. (2000). Building Natural Language Generation Systems. Cambridge University Press, Cambridge. • Yarowsky, D. (1996). Homograph disambiguation in text-to-speech synthesis. In Progress in Speech Synthesis, pp. 159–175. Springer-Verlag, Berlin. • Small, S. L. and Rieger, C. (1982). Parsing and comprehending withWord Experts. In Lehnert,W. G. and Ringle, M. H. (Eds.), Strategies for Natural Language Processing, pp. 89–147. Lawrence Erlbaum, New Jersey. • www.sppel.org