SlideShare a Scribd company logo
1 of 25
Role of Language Engineering
to
Preserve Endangered Languages
Amit Kumar Jha
Ph.D. (Informatics and Language Engineering)
School of Language, MGAHV, Wardha
Sumit Kumar Gupta
MILE, School of Language,
MGAHV, Wardha
National Conference on the Approaches & the Methodologies on the Study of Indegnous & Endangered Language
Dr. Piyush Pratap Singh
Asst. Professor
School of Language
MGAHV, Wardha
Endangered Language
• Endangered language (EL) is the language community incorporates less
number of speakers of that particular language.
• EL is likely to become extinct in the near future. Many languages are failing
out of use and being substituted by others is more widely used in the region
or nation.
Language Engineering
• Language Engineering (LE) is the subfield of computer science which
explores the field of language related software and its feasible hardware
development.
Language Engineering
Goal of Language Engineering
• The ultimate goal of LE is to develop a machine which is able to understand
and generate natural language.
• If the Approaches of LE implements on EL, then EL may be Preserve.
Language Endangered
• The loss of speakers in one language is the gain of speakers of another
language, except for cases of genocide. Languages are generally replaced
when an entire speech community shifts to another language. Replacing
languages are very often official state languages.
• The world is experiencing an unprecedented wave of language extinctions.
There are between 6,000 and 7,000 languages currently spoken, and between
50 to 90 per cent of those will be extinct by the year 2100.
Language Extinction Results
• Language extinction results in loss of cultural identities, knowledge systems,
and the variety of data needed to understand the structure of language in the
mind.
• Documenting endangered languages preserves data and stimulates language
maintenance and revitalisation.
Language Documentation
• Many of these languages do not have a written tradition and written data may be completely
unavailable or sparse, the languages are not used in the media, or their speakers do not use the Internet
(and if they do, they often use another language). In such cases, linguists must start from scratch and
collect as much data as possible by recording speakers of a given language.
• Ideally, language documentation contains representative samples from different speakers – representing
different age groups, different professions, of both sexes, and different origins –, but in the case of
endangered languages this may not be possible, because the number of speakers is too small and/or
there are only elder speakers. An important issue apart from the number of speakers and amount of
data concerns the communication between the linguists or other researchers who want to document a
language, and the language community.
Language Documentation
• In the case of endangered or minority languages, the documenters often are outsiders, not members of
the community. They may not be fluent speakers of the language in question and can communicate
with the speakers in a second or a third language. This often leads to an unnatural use of the language
that is to be documented.
Digitalization
• Digitlizaion is the process in which data is the store in the form of digital.
The durability of digital data is more than others types of data. To preserve
EL by Digitaliztion we convert and store data in digital forrm i.e. text, sound,
image etc. The researchers should create study meterial of EL in digital
form.
Application of Language Engineering
• Speech Generation
• Language Translator
• Speech-to-Text
• Text-to-Speech
• Langauge Teaching
• Translitration Tool
Application of Language Engineering...
• Speaker Identification
• Verification Speech Recognition
• Character and Document Image Recognition
• Question-Answering System
• Word sense Disambiguation
• Information retrieval and Information Extraction
• Film Production and Dialogue Debbing
Speech Generation
• With the help of language engineering we can generate the speech of
Endangered Language by a machine. If a machie will be able to generate EL
then we can preserve that Language.
Language Translator
• Language translator or Machine translator is a machine which is able to
translate one language to another language. The first language is called source
language and the second language is called the target language. If the Source
language or the target language is EL, EL is preventing by this Language
Translator system.
Speech-to-Text
• It is the process of converting speech to text. This is the task of
documentation. If we convert speech file to text file of EL then we preserve
that language.
Language Translator
• Language translator or Machine translator is a machine which is able to
translate one language to another language. The first language is called sourse
language and the second language is called the target language. If the Sourse
language or the target language is EL, EL is prevent by this Language
Translator system.
Transcription Tool
• Transcription is the process in which one script to another script.
• A person which is unknown to a specific language, its script and
pronunciation, the role of Transcription tool is importnat in this context.
• If Transcription tool for an EL will be developed then we increase the
number of people to understand that language.
Text-to-Speech
• Text-to-speech system is the system in which text data is input and it return
speech data as output. It plays important role in Man-Machine interaction.
Langauge Teaching
• Language Teaching is the process of teaching a language. With the help of
LE we can create a system for teaching a language. If EL teaching system is
created EL may be preseve. As it is known that there are some language
which has the speakers of old age and this language doesn’t transfer to the
next generation. After some that language becomes dead. To preserve this
language this system is important.
Question Answering System
• Question-Answering system is a Natural Language Processing system. If a
person ask a question to the system, system returns the answer of that
question.
Extinct Language
• An endangered language is a language that is at a risk of falling out of use,
generally because it has few surviving speakers. If it loses all of its native
speakers, it becomes an extinct language.
Levels of Endangerement
• UNESCO defines four levels of language endangerment between "safe" (not
endangered) and "extinct":
1. Vulnerable
2. Definitely endangered
3. Severely endangered
4. Critically endangered
EL in India
• Indian Goverment started a scheme to preseve EL the name of this Scheme
is SPPEL(Scheme for Protection and Preservation of Endangered
Languages).
• The SPPEL has listed 117 languages to be documented in its current phase.
The Languages are some of lesser known Indian languages which are spoken
by less than 10,000 speakers.
Refrence
• Refrence List :
• B. WEBBER, M. EGG and V. KORDONI (2012). Discourse structure and language technology. Natural Language
Engineering
• Jurafsky, Martin (et.al. ) Sppech and Language Processing. Prentice Hall, Englewood Cliffs, New Jersey 07632
• Reiter, E. and Dale, R. (2000). Building Natural Language Generation Systems. Cambridge University Press, Cambridge.
• Yarowsky, D. (1996). Homograph disambiguation in text-to-speech synthesis. In Progress in Speech Synthesis, pp. 159–175.
Springer-Verlag, Berlin.
• Small, S. L. and Rieger, C. (1982). Parsing and comprehending withWord Experts. In Lehnert,W. G. and Ringle, M. H.
(Eds.), Strategies for Natural Language Processing, pp. 89–147. Lawrence Erlbaum, New Jersey.
• www.sppel.org
Thanks A Lot......

More Related Content

What's hot

Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognitionanshu shrivastava
 
Psycholinguistics
PsycholinguisticsPsycholinguistics
PsycholinguisticsRaul Vargas
 
Interlanguage errors
Interlanguage errorsInterlanguage errors
Interlanguage errorsShona Whyte
 
Code Switching: a paper by Krishna Bista
Code Switching: a paper by Krishna BistaCode Switching: a paper by Krishna Bista
Code Switching: a paper by Krishna BistaAna Azevedo
 
SLA-Inter-language presentation
SLA-Inter-language presentationSLA-Inter-language presentation
SLA-Inter-language presentationamorenaz
 
10 symptoms of a multilingual brain
10 symptoms of a multilingual brain10 symptoms of a multilingual brain
10 symptoms of a multilingual brainHERybol
 
Presentation on code switching
Presentation on code switchingPresentation on code switching
Presentation on code switchingRamshankar Yadav
 
Interlanguage Analysis of Spanish Learners
Interlanguage Analysis of Spanish LearnersInterlanguage Analysis of Spanish Learners
Interlanguage Analysis of Spanish LearnersArdiansyah -
 
(Applied linguistics) cook's book ch 8
(Applied linguistics) cook's book ch 8(Applied linguistics) cook's book ch 8
(Applied linguistics) cook's book ch 8VivaAs
 
Code switching &; code mixing
Code switching &; code mixingCode switching &; code mixing
Code switching &; code mixingYoushaib Alam
 
Code switching
Code switchingCode switching
Code switchingmaiye
 
Transfer Analysis in Applied Linguistics
Transfer Analysis in Applied LinguisticsTransfer Analysis in Applied Linguistics
Transfer Analysis in Applied LinguisticsHikmah Pravitasari
 
FUNCTIONS OF ENGLISH AS A LANGUAGE
FUNCTIONS OF ENGLISH AS A LANGUAGEFUNCTIONS OF ENGLISH AS A LANGUAGE
FUNCTIONS OF ENGLISH AS A LANGUAGEKarthika Shibu
 
The Pedagogical Aspects of Philippine English
The Pedagogical Aspects of Philippine EnglishThe Pedagogical Aspects of Philippine English
The Pedagogical Aspects of Philippine EnglishCy Bonaobra
 
Code Switching, Types and Reasons
Code Switching, Types and ReasonsCode Switching, Types and Reasons
Code Switching, Types and ReasonsSohail Khan
 
Code switching presentation387
Code switching presentation387Code switching presentation387
Code switching presentation387helalsaid
 

What's hot (20)

Applied Linguistics
Applied LinguisticsApplied Linguistics
Applied Linguistics
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognition
 
Psycholinguistics
PsycholinguisticsPsycholinguistics
Psycholinguistics
 
Linguistics
LinguisticsLinguistics
Linguistics
 
Interlanguage errors
Interlanguage errorsInterlanguage errors
Interlanguage errors
 
Code Switching: a paper by Krishna Bista
Code Switching: a paper by Krishna BistaCode Switching: a paper by Krishna Bista
Code Switching: a paper by Krishna Bista
 
Chapter 5
Chapter 5Chapter 5
Chapter 5
 
SLA-Inter-language presentation
SLA-Inter-language presentationSLA-Inter-language presentation
SLA-Inter-language presentation
 
10 symptoms of a multilingual brain
10 symptoms of a multilingual brain10 symptoms of a multilingual brain
10 symptoms of a multilingual brain
 
Presentation on code switching
Presentation on code switchingPresentation on code switching
Presentation on code switching
 
Interlanguage Analysis of Spanish Learners
Interlanguage Analysis of Spanish LearnersInterlanguage Analysis of Spanish Learners
Interlanguage Analysis of Spanish Learners
 
Linguistic factors presentation
Linguistic factors presentationLinguistic factors presentation
Linguistic factors presentation
 
(Applied linguistics) cook's book ch 8
(Applied linguistics) cook's book ch 8(Applied linguistics) cook's book ch 8
(Applied linguistics) cook's book ch 8
 
Code switching &; code mixing
Code switching &; code mixingCode switching &; code mixing
Code switching &; code mixing
 
Code switching
Code switchingCode switching
Code switching
 
Transfer Analysis in Applied Linguistics
Transfer Analysis in Applied LinguisticsTransfer Analysis in Applied Linguistics
Transfer Analysis in Applied Linguistics
 
FUNCTIONS OF ENGLISH AS A LANGUAGE
FUNCTIONS OF ENGLISH AS A LANGUAGEFUNCTIONS OF ENGLISH AS A LANGUAGE
FUNCTIONS OF ENGLISH AS A LANGUAGE
 
The Pedagogical Aspects of Philippine English
The Pedagogical Aspects of Philippine EnglishThe Pedagogical Aspects of Philippine English
The Pedagogical Aspects of Philippine English
 
Code Switching, Types and Reasons
Code Switching, Types and ReasonsCode Switching, Types and Reasons
Code Switching, Types and Reasons
 
Code switching presentation387
Code switching presentation387Code switching presentation387
Code switching presentation387
 

Similar to Role of Language Engineering to Preserve Endangered Language

Applied linguistics presentation
Applied linguistics  presentationApplied linguistics  presentation
Applied linguistics presentationMuhammad Furqan
 
Investigations of the Distributions of Phonemic Durations in Hindi and Dogri
Investigations of the Distributions of Phonemic Durations in Hindi and DogriInvestigations of the Distributions of Phonemic Durations in Hindi and Dogri
Investigations of the Distributions of Phonemic Durations in Hindi and Dogrikevig
 
NLP-ppt.pptx nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
NLP-ppt.pptx nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNLP-ppt.pptx nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
NLP-ppt.pptx nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnRAtna29
 
Applied Linguistics session 111 0_07_12_2021 Applied linguistics challenges.pdf
Applied Linguistics session 111 0_07_12_2021 Applied linguistics challenges.pdfApplied Linguistics session 111 0_07_12_2021 Applied linguistics challenges.pdf
Applied Linguistics session 111 0_07_12_2021 Applied linguistics challenges.pdfDr.Badriya Al Mamari
 
Substitution Error Analysis for Improving the Word Accuracy in Telugu Langua...
Substitution Error Analysis for Improving the Word Accuracy in  Telugu Langua...Substitution Error Analysis for Improving the Word Accuracy in  Telugu Langua...
Substitution Error Analysis for Improving the Word Accuracy in Telugu Langua...IOSR Journals
 
Week 1 an introduction to the course.pptx
Week 1 an introduction to the course.pptxWeek 1 an introduction to the course.pptx
Week 1 an introduction to the course.pptxzeinebayachi2
 
A New Approach: Automatically Identify Proper Noun from Bengali Sentence for ...
A New Approach: Automatically Identify Proper Noun from Bengali Sentence for ...A New Approach: Automatically Identify Proper Noun from Bengali Sentence for ...
A New Approach: Automatically Identify Proper Noun from Bengali Sentence for ...Syeful Islam
 
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...kevig
 
American Standard Sign Language Representation Using Speech Recognition
American Standard Sign Language Representation Using Speech RecognitionAmerican Standard Sign Language Representation Using Speech Recognition
American Standard Sign Language Representation Using Speech Recognitionpaperpublications3
 
What is linguistics.pdf
What is linguistics.pdfWhat is linguistics.pdf
What is linguistics.pdfssuser958503
 
Hidden markov model based part of speech tagger for sinhala language
Hidden markov model based part of speech tagger for sinhala languageHidden markov model based part of speech tagger for sinhala language
Hidden markov model based part of speech tagger for sinhala languageijnlc
 
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...ijnlc
 
English as second and foreign language
English as second and foreign languageEnglish as second and foreign language
English as second and foreign languageANJU A
 
Natural language wikipedia
Natural language   wikipediaNatural language   wikipedia
Natural language wikipediaTên Họ Và
 
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text EditorDynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text EditorWaqas Tariq
 
LARG-20010118-Natasha e wejkwrlkwr klwrlknrklnr k.ppt
LARG-20010118-Natasha e wejkwrlkwr klwrlknrklnr k.pptLARG-20010118-Natasha e wejkwrlkwr klwrlknrklnr k.ppt
LARG-20010118-Natasha e wejkwrlkwr klwrlknrklnr k.pptMonsefJraid
 

Similar to Role of Language Engineering to Preserve Endangered Language (20)

Applied linguistics presentation
Applied linguistics  presentationApplied linguistics  presentation
Applied linguistics presentation
 
Investigations of the Distributions of Phonemic Durations in Hindi and Dogri
Investigations of the Distributions of Phonemic Durations in Hindi and DogriInvestigations of the Distributions of Phonemic Durations in Hindi and Dogri
Investigations of the Distributions of Phonemic Durations in Hindi and Dogri
 
NLP-ppt.pptx nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
NLP-ppt.pptx nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNLP-ppt.pptx nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
NLP-ppt.pptx nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
 
Applied Linguistics session 111 0_07_12_2021 Applied linguistics challenges.pdf
Applied Linguistics session 111 0_07_12_2021 Applied linguistics challenges.pdfApplied Linguistics session 111 0_07_12_2021 Applied linguistics challenges.pdf
Applied Linguistics session 111 0_07_12_2021 Applied linguistics challenges.pdf
 
Substitution Error Analysis for Improving the Word Accuracy in Telugu Langua...
Substitution Error Analysis for Improving the Word Accuracy in  Telugu Langua...Substitution Error Analysis for Improving the Word Accuracy in  Telugu Langua...
Substitution Error Analysis for Improving the Word Accuracy in Telugu Langua...
 
Linguistics curriculum 001
Linguistics curriculum 001Linguistics curriculum 001
Linguistics curriculum 001
 
B0340710
B0340710B0340710
B0340710
 
L1 nlp intro
L1 nlp introL1 nlp intro
L1 nlp intro
 
Week 1 an introduction to the course.pptx
Week 1 an introduction to the course.pptxWeek 1 an introduction to the course.pptx
Week 1 an introduction to the course.pptx
 
A New Approach: Automatically Identify Proper Noun from Bengali Sentence for ...
A New Approach: Automatically Identify Proper Noun from Bengali Sentence for ...A New Approach: Automatically Identify Proper Noun from Bengali Sentence for ...
A New Approach: Automatically Identify Proper Noun from Bengali Sentence for ...
 
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
 
American Standard Sign Language Representation Using Speech Recognition
American Standard Sign Language Representation Using Speech RecognitionAmerican Standard Sign Language Representation Using Speech Recognition
American Standard Sign Language Representation Using Speech Recognition
 
What is linguistics.pdf
What is linguistics.pdfWhat is linguistics.pdf
What is linguistics.pdf
 
1.pdf
1.pdf1.pdf
1.pdf
 
Hidden markov model based part of speech tagger for sinhala language
Hidden markov model based part of speech tagger for sinhala languageHidden markov model based part of speech tagger for sinhala language
Hidden markov model based part of speech tagger for sinhala language
 
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
 
English as second and foreign language
English as second and foreign languageEnglish as second and foreign language
English as second and foreign language
 
Natural language wikipedia
Natural language   wikipediaNatural language   wikipedia
Natural language wikipedia
 
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text EditorDynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
 
LARG-20010118-Natasha e wejkwrlkwr klwrlknrklnr k.ppt
LARG-20010118-Natasha e wejkwrlkwr klwrlknrklnr k.pptLARG-20010118-Natasha e wejkwrlkwr klwrlknrklnr k.ppt
LARG-20010118-Natasha e wejkwrlkwr klwrlknrklnr k.ppt
 

More from Dr. Amit Kumar Jha

राजभाषा हिंदी के विकास में कंप्यूटर एवं प्रौद्योगिकी का योगदान
राजभाषा हिंदी के विकास में कंप्यूटर एवं प्रौद्योगिकी का योगदानराजभाषा हिंदी के विकास में कंप्यूटर एवं प्रौद्योगिकी का योगदान
राजभाषा हिंदी के विकास में कंप्यूटर एवं प्रौद्योगिकी का योगदानDr. Amit Kumar Jha
 
भारतीय भाषाओं के लिए डिजिटल भाषिक मानचित्र
भारतीय भाषाओं के लिए डिजिटल भाषिक मानचित्रभारतीय भाषाओं के लिए डिजिटल भाषिक मानचित्र
भारतीय भाषाओं के लिए डिजिटल भाषिक मानचित्रDr. Amit Kumar Jha
 
Hindi Language and Information Technology
Hindi Language and Information TechnologyHindi Language and Information Technology
Hindi Language and Information TechnologyDr. Amit Kumar Jha
 
Information Management System Rajbhasha
Information Management System RajbhashaInformation Management System Rajbhasha
Information Management System RajbhashaDr. Amit Kumar Jha
 
कंप्यूटर पर हिंदी में कार्य
कंप्यूटर पर हिंदी में कार्यकंप्यूटर पर हिंदी में कार्य
कंप्यूटर पर हिंदी में कार्यDr. Amit Kumar Jha
 
Clickable Language Map of India
Clickable Language Map of IndiaClickable Language Map of India
Clickable Language Map of IndiaDr. Amit Kumar Jha
 
Machine translation And Anusaaraka
Machine translation And AnusaarakaMachine translation And Anusaaraka
Machine translation And AnusaarakaDr. Amit Kumar Jha
 
Scientific Research methodology
Scientific Research methodologyScientific Research methodology
Scientific Research methodologyDr. Amit Kumar Jha
 
LingPy : A Python Library for Historical Linguistics
LingPy : A Python Library for Historical LinguisticsLingPy : A Python Library for Historical Linguistics
LingPy : A Python Library for Historical LinguisticsDr. Amit Kumar Jha
 
कंप्यूटर की पीढ़ियाँ
कंप्यूटर की पीढ़ियाँ कंप्यूटर की पीढ़ियाँ
कंप्यूटर की पीढ़ियाँ Dr. Amit Kumar Jha
 

More from Dr. Amit Kumar Jha (20)

E learning app development
E learning app developmentE learning app development
E learning app development
 
Maithili Text-to-Speech
Maithili Text-to-SpeechMaithili Text-to-Speech
Maithili Text-to-Speech
 
राजभाषा हिंदी के विकास में कंप्यूटर एवं प्रौद्योगिकी का योगदान
राजभाषा हिंदी के विकास में कंप्यूटर एवं प्रौद्योगिकी का योगदानराजभाषा हिंदी के विकास में कंप्यूटर एवं प्रौद्योगिकी का योगदान
राजभाषा हिंदी के विकास में कंप्यूटर एवं प्रौद्योगिकी का योगदान
 
भारतीय भाषाओं के लिए डिजिटल भाषिक मानचित्र
भारतीय भाषाओं के लिए डिजिटल भाषिक मानचित्रभारतीय भाषाओं के लिए डिजिटल भाषिक मानचित्र
भारतीय भाषाओं के लिए डिजिटल भाषिक मानचित्र
 
Hindi Language and Information Technology
Hindi Language and Information TechnologyHindi Language and Information Technology
Hindi Language and Information Technology
 
Information Management System Rajbhasha
Information Management System RajbhashaInformation Management System Rajbhasha
Information Management System Rajbhasha
 
Morphology
MorphologyMorphology
Morphology
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Microsoft office & Internet
Microsoft office & InternetMicrosoft office & Internet
Microsoft office & Internet
 
कंप्यूटर पर हिंदी में कार्य
कंप्यूटर पर हिंदी में कार्यकंप्यूटर पर हिंदी में कार्य
कंप्यूटर पर हिंदी में कार्य
 
Clickable Language Map of India
Clickable Language Map of IndiaClickable Language Map of India
Clickable Language Map of India
 
Machine translation And Anusaaraka
Machine translation And AnusaarakaMachine translation And Anusaaraka
Machine translation And Anusaaraka
 
Networking and Topology
Networking and TopologyNetworking and Topology
Networking and Topology
 
Scientific Research methodology
Scientific Research methodologyScientific Research methodology
Scientific Research methodology
 
LingPy : A Python Library for Historical Linguistics
LingPy : A Python Library for Historical LinguisticsLingPy : A Python Library for Historical Linguistics
LingPy : A Python Library for Historical Linguistics
 
लिनक्स (Linux)
लिनक्स (Linux) लिनक्स (Linux)
लिनक्स (Linux)
 
कंप्यूटर की पीढ़ियाँ
कंप्यूटर की पीढ़ियाँ कंप्यूटर की पीढ़ियाँ
कंप्यूटर की पीढ़ियाँ
 
Online Examination Portal
Online Examination PortalOnline Examination Portal
Online Examination Portal
 
Information engineering
Information engineeringInformation engineering
Information engineering
 
Language engineering
Language engineeringLanguage engineering
Language engineering
 

Recently uploaded

IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024Mark Billinghurst
 
pipeline in computer architecture design
pipeline in computer architecture  designpipeline in computer architecture  design
pipeline in computer architecture designssuser87fa0c1
 
Introduction to Machine Learning Unit-3 for II MECH
Introduction to Machine Learning Unit-3 for II MECHIntroduction to Machine Learning Unit-3 for II MECH
Introduction to Machine Learning Unit-3 for II MECHC Sai Kiran
 
Concrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxConcrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxKartikeyaDwivedi3
 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineeringmalavadedarshan25
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxDeepakSakkari2
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile servicerehmti665
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...VICTOR MAESTRE RAMIREZ
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort servicejennyeacort
 
Churning of Butter, Factors affecting .
Churning of Butter, Factors affecting  .Churning of Butter, Factors affecting  .
Churning of Butter, Factors affecting .Satyam Kumar
 
EduAI - E learning Platform integrated with AI
EduAI - E learning Platform integrated with AIEduAI - E learning Platform integrated with AI
EduAI - E learning Platform integrated with AIkoyaldeepu123
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxJoão Esperancinha
 
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)Dr SOUNDIRARAJ N
 
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETEINFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETEroselinkalist12
 
Work Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvWork Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvLewisJB
 
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfAsst.prof M.Gokilavani
 

Recently uploaded (20)

IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024
 
pipeline in computer architecture design
pipeline in computer architecture  designpipeline in computer architecture  design
pipeline in computer architecture design
 
Introduction to Machine Learning Unit-3 for II MECH
Introduction to Machine Learning Unit-3 for II MECHIntroduction to Machine Learning Unit-3 for II MECH
Introduction to Machine Learning Unit-3 for II MECH
 
Concrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxConcrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptx
 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineering
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptx
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile service
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
 
Design and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdfDesign and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdf
 
Churning of Butter, Factors affecting .
Churning of Butter, Factors affecting  .Churning of Butter, Factors affecting  .
Churning of Butter, Factors affecting .
 
EduAI - E learning Platform integrated with AI
EduAI - E learning Platform integrated with AIEduAI - E learning Platform integrated with AI
EduAI - E learning Platform integrated with AI
 
young call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Serviceyoung call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Service
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
 
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
 
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETEINFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
 
Work Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvWork Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvv
 
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
 
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
 

Role of Language Engineering to Preserve Endangered Language

  • 1. Role of Language Engineering to Preserve Endangered Languages Amit Kumar Jha Ph.D. (Informatics and Language Engineering) School of Language, MGAHV, Wardha Sumit Kumar Gupta MILE, School of Language, MGAHV, Wardha National Conference on the Approaches & the Methodologies on the Study of Indegnous & Endangered Language Dr. Piyush Pratap Singh Asst. Professor School of Language MGAHV, Wardha
  • 2. Endangered Language • Endangered language (EL) is the language community incorporates less number of speakers of that particular language. • EL is likely to become extinct in the near future. Many languages are failing out of use and being substituted by others is more widely used in the region or nation.
  • 3. Language Engineering • Language Engineering (LE) is the subfield of computer science which explores the field of language related software and its feasible hardware development.
  • 5. Goal of Language Engineering • The ultimate goal of LE is to develop a machine which is able to understand and generate natural language. • If the Approaches of LE implements on EL, then EL may be Preserve.
  • 6. Language Endangered • The loss of speakers in one language is the gain of speakers of another language, except for cases of genocide. Languages are generally replaced when an entire speech community shifts to another language. Replacing languages are very often official state languages. • The world is experiencing an unprecedented wave of language extinctions. There are between 6,000 and 7,000 languages currently spoken, and between 50 to 90 per cent of those will be extinct by the year 2100.
  • 7. Language Extinction Results • Language extinction results in loss of cultural identities, knowledge systems, and the variety of data needed to understand the structure of language in the mind. • Documenting endangered languages preserves data and stimulates language maintenance and revitalisation.
  • 8. Language Documentation • Many of these languages do not have a written tradition and written data may be completely unavailable or sparse, the languages are not used in the media, or their speakers do not use the Internet (and if they do, they often use another language). In such cases, linguists must start from scratch and collect as much data as possible by recording speakers of a given language. • Ideally, language documentation contains representative samples from different speakers – representing different age groups, different professions, of both sexes, and different origins –, but in the case of endangered languages this may not be possible, because the number of speakers is too small and/or there are only elder speakers. An important issue apart from the number of speakers and amount of data concerns the communication between the linguists or other researchers who want to document a language, and the language community.
  • 9. Language Documentation • In the case of endangered or minority languages, the documenters often are outsiders, not members of the community. They may not be fluent speakers of the language in question and can communicate with the speakers in a second or a third language. This often leads to an unnatural use of the language that is to be documented.
  • 10. Digitalization • Digitlizaion is the process in which data is the store in the form of digital. The durability of digital data is more than others types of data. To preserve EL by Digitaliztion we convert and store data in digital forrm i.e. text, sound, image etc. The researchers should create study meterial of EL in digital form.
  • 11. Application of Language Engineering • Speech Generation • Language Translator • Speech-to-Text • Text-to-Speech • Langauge Teaching • Translitration Tool
  • 12. Application of Language Engineering... • Speaker Identification • Verification Speech Recognition • Character and Document Image Recognition • Question-Answering System • Word sense Disambiguation • Information retrieval and Information Extraction • Film Production and Dialogue Debbing
  • 13. Speech Generation • With the help of language engineering we can generate the speech of Endangered Language by a machine. If a machie will be able to generate EL then we can preserve that Language.
  • 14. Language Translator • Language translator or Machine translator is a machine which is able to translate one language to another language. The first language is called source language and the second language is called the target language. If the Source language or the target language is EL, EL is preventing by this Language Translator system.
  • 15. Speech-to-Text • It is the process of converting speech to text. This is the task of documentation. If we convert speech file to text file of EL then we preserve that language.
  • 16. Language Translator • Language translator or Machine translator is a machine which is able to translate one language to another language. The first language is called sourse language and the second language is called the target language. If the Sourse language or the target language is EL, EL is prevent by this Language Translator system.
  • 17. Transcription Tool • Transcription is the process in which one script to another script. • A person which is unknown to a specific language, its script and pronunciation, the role of Transcription tool is importnat in this context. • If Transcription tool for an EL will be developed then we increase the number of people to understand that language.
  • 18. Text-to-Speech • Text-to-speech system is the system in which text data is input and it return speech data as output. It plays important role in Man-Machine interaction.
  • 19. Langauge Teaching • Language Teaching is the process of teaching a language. With the help of LE we can create a system for teaching a language. If EL teaching system is created EL may be preseve. As it is known that there are some language which has the speakers of old age and this language doesn’t transfer to the next generation. After some that language becomes dead. To preserve this language this system is important.
  • 20. Question Answering System • Question-Answering system is a Natural Language Processing system. If a person ask a question to the system, system returns the answer of that question.
  • 21. Extinct Language • An endangered language is a language that is at a risk of falling out of use, generally because it has few surviving speakers. If it loses all of its native speakers, it becomes an extinct language.
  • 22. Levels of Endangerement • UNESCO defines four levels of language endangerment between "safe" (not endangered) and "extinct": 1. Vulnerable 2. Definitely endangered 3. Severely endangered 4. Critically endangered
  • 23. EL in India • Indian Goverment started a scheme to preseve EL the name of this Scheme is SPPEL(Scheme for Protection and Preservation of Endangered Languages). • The SPPEL has listed 117 languages to be documented in its current phase. The Languages are some of lesser known Indian languages which are spoken by less than 10,000 speakers.
  • 24. Refrence • Refrence List : • B. WEBBER, M. EGG and V. KORDONI (2012). Discourse structure and language technology. Natural Language Engineering • Jurafsky, Martin (et.al. ) Sppech and Language Processing. Prentice Hall, Englewood Cliffs, New Jersey 07632 • Reiter, E. and Dale, R. (2000). Building Natural Language Generation Systems. Cambridge University Press, Cambridge. • Yarowsky, D. (1996). Homograph disambiguation in text-to-speech synthesis. In Progress in Speech Synthesis, pp. 159–175. Springer-Verlag, Berlin. • Small, S. L. and Rieger, C. (1982). Parsing and comprehending withWord Experts. In Lehnert,W. G. and Ringle, M. H. (Eds.), Strategies for Natural Language Processing, pp. 89–147. Lawrence Erlbaum, New Jersey. • www.sppel.org