SlideShare a Scribd company logo
1 of 18
AUTOMATIC SPEECH RECOGNITION
OUTLINE
1. Introduction
2. Background with Literature Support
3. Motivation / Research Gaps
4. Hypothesis / Research Questions
5. Objectives
6. Methodology
1. Research Flow Diagram
2. Data Required
3. Analytical Tools Required
7. Proposed Plan for Next 3 Year over all plan
8. Proposed Activity for Next Year
INTRODUCTION
• Speech means the expression of or the ability to express thoughts and feelings
by articulating sounds orally.
• Recognition is the means of identification of those sounds
• Speech recognition, or speech-to-text, is the ability of a machine or a program
to identify words spoken aloud and convert them into readable text.
• But there are some problems in speech recognition of certain languages out of
which recognition of tribal languages has been major concern rather than the
urban languages like telugu,Hindi,Tamil,Malayalam etc.
• When huge amount of text or linguistic problem oriented languages like tribal
languages have been encountered there exist demand for Automatic speech
Recognition where the letters or words spoken need to be identified based on
certain mechanisms automatically.
INTRODUCTION
What is ASR(Automatic Speech Recognition?
• ASR is a Speech recognition technology which is capable of converting spoken
language (an audio signal) into written text that is often used as a command or
further process.
• It just address the aspect of Speech to Text(STT).
• The speech processing used in :
Digital speech coding,
Spoken language dialog systems,
Text-to-speech synthesis,
Automatic speech recognition.
Information (such as speaker, gender or language )
can also be extracted from speech
Speech Production/Speech Perception[1]
BACKGROUND WITH LITERATURE SUPPORT
No Title Author Journal Name & Year Methodology Adapted Key Findings Gap
s
1 A review on speech
processing using machine
learning paradigm.
Bhangale, Kishor &
Kothandaraman,
Mohanaprasad
International Journal of
Speech Technology
It concentrated on the
distinct feature extraction
and machine learning
classifiers for the speech
processing applications.
Statistical feature extraction
techniques such as PCA, ICA, and ZCR
are suitable for low vocabulary, but for
larger and noisy vocabulary it shows
poor performance.
2 Revisiting signal
processing with
spectrogram analysis on
EEG, ECG and speech
signals
Wang, W., Zhang, G., Yang, L.,
Balaji, V. S., Elamaran, V., &
Arunkumar, N.
Future Generation
Computer Systems, 98
Spectrogram is performed
on EEG, ECG and Speech
signals
Spectrogram is a better tool for ECG.
3 Recognition of Vowels in
Continuous Speech by
Using Formants
Prica, Biljana & Ili, Siniša. FACTA UNIVERSITATIS
(NIS) ˇ SER.: ELEC.
ENERG. vol. 23, no. 3,
December 2010, 379-
393
Using of LPC Method in
Speech Analysis
Investigated the correlations between
formants in each vowel and developed
the algorithm to reduce the overlap of
different vowels in F1-F2 and F2-F3
planes
4 Speech Processing for
Language Learning: A
Practical Approach to
Computer-Assisted
Pronunciation Teaching
Bogach, N., Boitsova, E.,
Chernonog, S., Lamtev, A.,
Lesnichaya, M., Lezhenin, I.,
Novopashenny, A.,
Svechnikov, R., Tsikach, D.,
Vasiliev, K., Pyshkin, E., &
Blake, J. (2021).
.Electronics
(Switzerland), 10(3).
Voice Activity Detection,
Prosodic Similarity
Evaluation
Improvements in signal processing
algorithms to help learning foreign
languages.
MOTIVATION / RESEARCH GAPS
0th REVIEW 2202030001 Department of CSE
• As a vast country with diverse Indigenous cultures and traditions, India
is a hub of languages.
• In a multilingual society like ours, language plays an important role in
shaping our social experiences.
• However, many of the indigenous languages, especially, that of Adivasis,
are at threat due to several reasons.
• In this context still some of tribal languages like Gondi and Koya/Koi is
not recognized by the government of India—making it difficult to
preserve the indigenous languages.
MOTIVATION / RESEARCH GAPS
0th REVIEW 2202030001 Department of CSE
 Another significant issue at hand is the non-tribal migration to the tribal
hamlets. Non-tribal people speak the official state language or regions’
dominant language.
• In this context after thorough literature survey I thought Automatic
speech recognition (ASR) may be implemented so as to recognize letter or
word or sentence so that the acknowledging the letters of a tribal language
may always bring the urban people closure to these adivasis so that entire
culture may converge to a single group in its understanding.
HYPOTHESIS / RESEARCH QUESTIONS
https://en.wikipedia.org/wiki/Alluri_Sitharama_Raju_district
In Andhra Pradesh state, tribal languages are like Adivasi odiya, Savara, Koya,
Kui, Gondi, Ollari, Chenchu etc.
OBJECTIVES
The Goal of Automatic Speech Recognition ( ASR ):
• To Transform a sequence of sound waves into a string of letters or words.
• Basic steps to implement Automatic Speech Recognition
• Speech wave form is processed to produce new representation as a sequence of
vectors containing values of features or parameters.
• The parameters values extracted from raw speech or used to build acoustic
models.
• The dictionary provides pronunciations for words found in the language model.
• And thereafter fixing a suitable Problem Statement/Objective in this regard.
Implementation of ASR :
METHODOLOGY
• Speech recognition : Pattern recognition
• Data required : Audio recordings of human speech
12
Speech Signal Processing
(Feature Extraction)
• Digitisation of analog speech signal
• Blocking signal into frames
• FFT → mel filter → log → IFFT ⇒ MFCC
• Sequence of feature vectors
: x1, x2,... xT
: o1, o2,... oT
Research Flow Diagram
METHODOLOGY
Analytical Tools required :
Speech Signal Processing (Feature Extraction)
i. Time-domain approach
ii. Frequency-Domain Approach
Analytical Tools required :
Source-Filter Model [7]
Fourier or
Harmonic or
Spectral analysis
is inverse of
Spectral
synthesis (wave
synthesis) as
shown on right.
Speech signal represented as a sequence of spectral vectors
Speech signal represented as a sequence of spectral vectors
Speech signal represented as a sequence of spectral vectors
METHODOLOGY
Models (Classifiers ) to be used in speech processing
• Dynamic time warping (DTW
• Hidden Markov model (HMM)
• Gaussian mixture model (GMM)
• Combination of HMM and GMM
• Artificial Neural Network
• Deep Neural Network etc.

More Related Content

Similar to Speech-Recognition.pptx

Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text EditorDynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text EditorWaqas Tariq
 
Implementation of Marathi Language Speech Databases for Large Dictionary
Implementation of Marathi Language Speech Databases for Large DictionaryImplementation of Marathi Language Speech Databases for Large Dictionary
Implementation of Marathi Language Speech Databases for Large Dictionaryiosrjce
 
On Developing an Automatic Speech Recognition System for Commonly used Englis...
On Developing an Automatic Speech Recognition System for Commonly used Englis...On Developing an Automatic Speech Recognition System for Commonly used Englis...
On Developing an Automatic Speech Recognition System for Commonly used Englis...rahulmonikasharma
 
Speech recognition An overview
Speech recognition An overviewSpeech recognition An overview
Speech recognition An overviewsajanazoya
 
IRJET- Text to Speech Synthesis for Hindi Language using Festival Framework
IRJET- Text to Speech Synthesis for Hindi Language using Festival FrameworkIRJET- Text to Speech Synthesis for Hindi Language using Festival Framework
IRJET- Text to Speech Synthesis for Hindi Language using Festival FrameworkIRJET Journal
 
Computational linguistics
Computational linguisticsComputational linguistics
Computational linguisticsAdnanBaloch15
 
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARSYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARijcseit
 
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARSYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARijcseit
 
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARSYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARijcseit
 
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARSYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARijcseit
 
5215ijcseit01
5215ijcseit015215ijcseit01
5215ijcseit01ijcsit
 
MULTILINGUAL SPEECH IDENTIFICATION USING ARTIFICIAL NEURAL NETWORK
 MULTILINGUAL SPEECH IDENTIFICATION USING ARTIFICIAL NEURAL NETWORK MULTILINGUAL SPEECH IDENTIFICATION USING ARTIFICIAL NEURAL NETWORK
MULTILINGUAL SPEECH IDENTIFICATION USING ARTIFICIAL NEURAL NETWORKijitcs
 
Implementation of English-Text to Marathi-Speech (ETMS) Synthesizer
Implementation of English-Text to Marathi-Speech (ETMS) SynthesizerImplementation of English-Text to Marathi-Speech (ETMS) Synthesizer
Implementation of English-Text to Marathi-Speech (ETMS) SynthesizerIOSR Journals
 
Project_Phase1_-_Literature_Review-1[1].pptx
Project_Phase1_-_Literature_Review-1[1].pptxProject_Phase1_-_Literature_Review-1[1].pptx
Project_Phase1_-_Literature_Review-1[1].pptxASHWIN808488
 
COMBINED FEATURE EXTRACTION TECHNIQUES AND NAIVE BAYES CLASSIFIER FOR SPEECH ...
COMBINED FEATURE EXTRACTION TECHNIQUES AND NAIVE BAYES CLASSIFIER FOR SPEECH ...COMBINED FEATURE EXTRACTION TECHNIQUES AND NAIVE BAYES CLASSIFIER FOR SPEECH ...
COMBINED FEATURE EXTRACTION TECHNIQUES AND NAIVE BAYES CLASSIFIER FOR SPEECH ...csandit
 

Similar to Speech-Recognition.pptx (20)

Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text EditorDynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
 
Implementation of Marathi Language Speech Databases for Large Dictionary
Implementation of Marathi Language Speech Databases for Large DictionaryImplementation of Marathi Language Speech Databases for Large Dictionary
Implementation of Marathi Language Speech Databases for Large Dictionary
 
On Developing an Automatic Speech Recognition System for Commonly used Englis...
On Developing an Automatic Speech Recognition System for Commonly used Englis...On Developing an Automatic Speech Recognition System for Commonly used Englis...
On Developing an Automatic Speech Recognition System for Commonly used Englis...
 
G1803013542
G1803013542G1803013542
G1803013542
 
Assign
AssignAssign
Assign
 
K010416167
K010416167K010416167
K010416167
 
Speech recognition An overview
Speech recognition An overviewSpeech recognition An overview
Speech recognition An overview
 
IRJET- Text to Speech Synthesis for Hindi Language using Festival Framework
IRJET- Text to Speech Synthesis for Hindi Language using Festival FrameworkIRJET- Text to Speech Synthesis for Hindi Language using Festival Framework
IRJET- Text to Speech Synthesis for Hindi Language using Festival Framework
 
Computational linguistics
Computational linguisticsComputational linguistics
Computational linguistics
 
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARSYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
 
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARSYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
 
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARSYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
 
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARSYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
 
5215ijcseit01
5215ijcseit015215ijcseit01
5215ijcseit01
 
MULTILINGUAL SPEECH IDENTIFICATION USING ARTIFICIAL NEURAL NETWORK
 MULTILINGUAL SPEECH IDENTIFICATION USING ARTIFICIAL NEURAL NETWORK MULTILINGUAL SPEECH IDENTIFICATION USING ARTIFICIAL NEURAL NETWORK
MULTILINGUAL SPEECH IDENTIFICATION USING ARTIFICIAL NEURAL NETWORK
 
F017163443
F017163443F017163443
F017163443
 
Implementation of English-Text to Marathi-Speech (ETMS) Synthesizer
Implementation of English-Text to Marathi-Speech (ETMS) SynthesizerImplementation of English-Text to Marathi-Speech (ETMS) Synthesizer
Implementation of English-Text to Marathi-Speech (ETMS) Synthesizer
 
Project_Phase1_-_Literature_Review-1[1].pptx
Project_Phase1_-_Literature_Review-1[1].pptxProject_Phase1_-_Literature_Review-1[1].pptx
Project_Phase1_-_Literature_Review-1[1].pptx
 
Ijetcas14 390
Ijetcas14 390Ijetcas14 390
Ijetcas14 390
 
COMBINED FEATURE EXTRACTION TECHNIQUES AND NAIVE BAYES CLASSIFIER FOR SPEECH ...
COMBINED FEATURE EXTRACTION TECHNIQUES AND NAIVE BAYES CLASSIFIER FOR SPEECH ...COMBINED FEATURE EXTRACTION TECHNIQUES AND NAIVE BAYES CLASSIFIER FOR SPEECH ...
COMBINED FEATURE EXTRACTION TECHNIQUES AND NAIVE BAYES CLASSIFIER FOR SPEECH ...
 

Recently uploaded

OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx959SahilShah
 
chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learningmisbanausheenparvam
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort servicejennyeacort
 
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2RajaP95
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
Current Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLCurrent Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLDeelipZope
 
complete construction, environmental and economics information of biomass com...
complete construction, environmental and economics information of biomass com...complete construction, environmental and economics information of biomass com...
complete construction, environmental and economics information of biomass com...asadnawaz62
 
power system scada applications and uses
power system scada applications and usespower system scada applications and uses
power system scada applications and usesDevarapalliHaritha
 
Concrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxConcrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxKartikeyaDwivedi3
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024Mark Billinghurst
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...VICTOR MAESTRE RAMIREZ
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024hassan khalil
 
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfAsst.prof M.Gokilavani
 
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerAnamika Sarkar
 
Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxPoojaBan
 

Recently uploaded (20)

OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx
 
chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learning
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
 
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
Current Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLCurrent Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCL
 
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
 
complete construction, environmental and economics information of biomass com...
complete construction, environmental and economics information of biomass com...complete construction, environmental and economics information of biomass com...
complete construction, environmental and economics information of biomass com...
 
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Serviceyoung call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
 
power system scada applications and uses
power system scada applications and usespower system scada applications and uses
power system scada applications and uses
 
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptxExploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
 
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCRCall Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
 
Concrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxConcrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptx
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024
 
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
 
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
 
Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptx
 

Speech-Recognition.pptx

  • 2. OUTLINE 1. Introduction 2. Background with Literature Support 3. Motivation / Research Gaps 4. Hypothesis / Research Questions 5. Objectives 6. Methodology 1. Research Flow Diagram 2. Data Required 3. Analytical Tools Required 7. Proposed Plan for Next 3 Year over all plan 8. Proposed Activity for Next Year
  • 3. INTRODUCTION • Speech means the expression of or the ability to express thoughts and feelings by articulating sounds orally. • Recognition is the means of identification of those sounds • Speech recognition, or speech-to-text, is the ability of a machine or a program to identify words spoken aloud and convert them into readable text. • But there are some problems in speech recognition of certain languages out of which recognition of tribal languages has been major concern rather than the urban languages like telugu,Hindi,Tamil,Malayalam etc. • When huge amount of text or linguistic problem oriented languages like tribal languages have been encountered there exist demand for Automatic speech Recognition where the letters or words spoken need to be identified based on certain mechanisms automatically.
  • 4. INTRODUCTION What is ASR(Automatic Speech Recognition? • ASR is a Speech recognition technology which is capable of converting spoken language (an audio signal) into written text that is often used as a command or further process. • It just address the aspect of Speech to Text(STT). • The speech processing used in : Digital speech coding, Spoken language dialog systems, Text-to-speech synthesis, Automatic speech recognition. Information (such as speaker, gender or language ) can also be extracted from speech
  • 6. BACKGROUND WITH LITERATURE SUPPORT No Title Author Journal Name & Year Methodology Adapted Key Findings Gap s 1 A review on speech processing using machine learning paradigm. Bhangale, Kishor & Kothandaraman, Mohanaprasad International Journal of Speech Technology It concentrated on the distinct feature extraction and machine learning classifiers for the speech processing applications. Statistical feature extraction techniques such as PCA, ICA, and ZCR are suitable for low vocabulary, but for larger and noisy vocabulary it shows poor performance. 2 Revisiting signal processing with spectrogram analysis on EEG, ECG and speech signals Wang, W., Zhang, G., Yang, L., Balaji, V. S., Elamaran, V., & Arunkumar, N. Future Generation Computer Systems, 98 Spectrogram is performed on EEG, ECG and Speech signals Spectrogram is a better tool for ECG. 3 Recognition of Vowels in Continuous Speech by Using Formants Prica, Biljana & Ili, Siniša. FACTA UNIVERSITATIS (NIS) ˇ SER.: ELEC. ENERG. vol. 23, no. 3, December 2010, 379- 393 Using of LPC Method in Speech Analysis Investigated the correlations between formants in each vowel and developed the algorithm to reduce the overlap of different vowels in F1-F2 and F2-F3 planes 4 Speech Processing for Language Learning: A Practical Approach to Computer-Assisted Pronunciation Teaching Bogach, N., Boitsova, E., Chernonog, S., Lamtev, A., Lesnichaya, M., Lezhenin, I., Novopashenny, A., Svechnikov, R., Tsikach, D., Vasiliev, K., Pyshkin, E., & Blake, J. (2021). .Electronics (Switzerland), 10(3). Voice Activity Detection, Prosodic Similarity Evaluation Improvements in signal processing algorithms to help learning foreign languages.
  • 7. MOTIVATION / RESEARCH GAPS 0th REVIEW 2202030001 Department of CSE • As a vast country with diverse Indigenous cultures and traditions, India is a hub of languages. • In a multilingual society like ours, language plays an important role in shaping our social experiences. • However, many of the indigenous languages, especially, that of Adivasis, are at threat due to several reasons. • In this context still some of tribal languages like Gondi and Koya/Koi is not recognized by the government of India—making it difficult to preserve the indigenous languages.
  • 8. MOTIVATION / RESEARCH GAPS 0th REVIEW 2202030001 Department of CSE  Another significant issue at hand is the non-tribal migration to the tribal hamlets. Non-tribal people speak the official state language or regions’ dominant language. • In this context after thorough literature survey I thought Automatic speech recognition (ASR) may be implemented so as to recognize letter or word or sentence so that the acknowledging the letters of a tribal language may always bring the urban people closure to these adivasis so that entire culture may converge to a single group in its understanding.
  • 9. HYPOTHESIS / RESEARCH QUESTIONS https://en.wikipedia.org/wiki/Alluri_Sitharama_Raju_district In Andhra Pradesh state, tribal languages are like Adivasi odiya, Savara, Koya, Kui, Gondi, Ollari, Chenchu etc.
  • 10. OBJECTIVES The Goal of Automatic Speech Recognition ( ASR ): • To Transform a sequence of sound waves into a string of letters or words. • Basic steps to implement Automatic Speech Recognition • Speech wave form is processed to produce new representation as a sequence of vectors containing values of features or parameters. • The parameters values extracted from raw speech or used to build acoustic models. • The dictionary provides pronunciations for words found in the language model. • And thereafter fixing a suitable Problem Statement/Objective in this regard.
  • 12. METHODOLOGY • Speech recognition : Pattern recognition • Data required : Audio recordings of human speech 12 Speech Signal Processing (Feature Extraction) • Digitisation of analog speech signal • Blocking signal into frames • FFT → mel filter → log → IFFT ⇒ MFCC • Sequence of feature vectors : x1, x2,... xT : o1, o2,... oT Research Flow Diagram
  • 13. METHODOLOGY Analytical Tools required : Speech Signal Processing (Feature Extraction) i. Time-domain approach ii. Frequency-Domain Approach
  • 14. Analytical Tools required : Source-Filter Model [7] Fourier or Harmonic or Spectral analysis is inverse of Spectral synthesis (wave synthesis) as shown on right.
  • 15. Speech signal represented as a sequence of spectral vectors
  • 16. Speech signal represented as a sequence of spectral vectors
  • 17. Speech signal represented as a sequence of spectral vectors
  • 18. METHODOLOGY Models (Classifiers ) to be used in speech processing • Dynamic time warping (DTW • Hidden Markov model (HMM) • Gaussian mixture model (GMM) • Combination of HMM and GMM • Artificial Neural Network • Deep Neural Network etc.