SlideShare a Scribd company logo
Types of ASR?????
Approaches to ASR
ASR(Automatic Speech Recognition)?
What is Voice Recognition???
What Is Voice??
Process of Voice Recognition????
Why Voices are Different???
Component of Sound???
How Speech Recognition Works????
Application of Speech Processing??
Process of Speech Production???
Classification to Speech Sounds??
Approaches to Speech Recognition??
 The voice consists of sound made by a
human being using the vocal folds for
talking, singing, laughing, crying,
screaming, etc.
The voice consists of sound made by a
human being using the vocal folds for
talking, singing, laughing, crying,
screaming, etc.
It is the process of converting voice into
electric signals.
Signals transform into CODING
PATTERN.
The first ASR device was used in 1952
and recognized single digits spoken by a
user
TEMPLATE MATCHING
Template matching is
the simplest technique
and has the highest
accuracy when used
properly, but it also
suffers from the most
limitations.
ASR
Feature Analysis
A more general form
of voice recognition is
available through
feature analysis and
this technique usually
leads to "speaker-
independent" voice
recognition.
•It is SPEAKE DEPENDENT.
•It match voice with already saved
templates.
•Before it we’ve to trained the system.
• System must be trained.
•User speak same word which are avail
in template.
•Recognition accuracy can be about 98
percent.
Template Matching
•It is SPEAKER INDEPENDENT.
•First process the giving voice as inputut
•Using LPC(Linear Productive Coding)
•Attempt to find similarities b/w
expected
•Input and Digitized input.
•Recognition accuracy for
speaker-independent systems is
somewhat less than for
speaker-dependent systems, usually
between 90 and 95 percent.
Feature Analysis
TEXT Phonems
Articulary
Motions
Speak/
Say Someting
Acoustic Wave Form
Acoustic Wave Form
Spectrum
Analysis
Feature
Retractions
Coding
Phonems/
Word/Sentence
Semantics
Discrete Input Continuous Input
 Vocal Tract
Consist of laryngeal pharynx, oral
phyrnax, oral cavity, nassal cavity,
nassal phyrnx.
 Specturm Analysis
MFCC used to produce voice
feaature. DTW to select the pattern
that match the database(matLab).
 Acoustic Model
provide the acoustic sound of a language
and can be recognized the chore of a
particular user speech pattern and
acoustic environment.
 To make pattern recognition PCM
transfer into frequency domain
 Speaker Dependent
 Speaker Independent
 Discrete Speaker Recognition
 Continuous Speech Recognition
 Natural Languages
 Pitch
 Timber
 Harmonics
 Loudness
 Rhythm
 Attack
 Sustain
 Decay
 Speed
 COMPRESSION
in which particles are crowded
together, appear as upward curves in
the line.
 RAREFACTION
in which particles are spread apart,
appear as downward curves in the line.
 WAVELENGTH
this is the distance from the crest of one
wave to the crest of the next.
 FREQUENCY
this is the number of waves that
pass a point in each second.
 AMPLITUDE
this is the measure of the amount
of energy in a sound wave.
High Frequency Sound Wave Low Frequency Sound Wave
This is how high or low a sound seems.
A bird makes a high pitch.
A lion makes a low pitch.
Voices are different caused
by
INTENSITY(depend on amplitude) ,
PITCH(frequency) ,
TONE(pleasant or unpleasent).
 Divide the sound wave into evenly spaced
blocks
 Process each block for important
characteristics, such as strength across
various frequency ranges, number of zero
crossings, and total energy.
 Using this characteristic vector, attempt to
associate each block with a phone, which is
the most basic unit of speech, producing a
string of phones.
 Find the word whose model is the most likely
match to the string of phones which was
produced.
 Transfer the PCM into Accoustic
 Apply GRAMMER
 Figure out which PHONEMS are spoken
 Convert PHONEMS into WORDS
 Acoustic Phonetic Approach
 Pattern Recognition Approach(HMM)
 Artificial Intelligence Approach(Neural Networks)
Speech Processing
Analysis/Syntactic Coding
Recognition
Speaker Recognition Language Identification
Speech Recognition
Speech Mode Speaking StyleVocabulary SizeSpeaker Mode
•Isolated Speech
•Continuous Speech
•Speaker Dependent
•Speaker In-Dependent
•Speaker Adaptive
•Small
•Medium
•large
•Dictation
•Spontaneous
•Vocal Chord play active role in the
production of SOUND.
e.g. a/e/I
•It has high frequency
Voiced Sound
•When Vocal Chord is Inactive
Called UN VOICED SOUND
e.g. s/f
•It build up by pressure
Un Voiced Sound
 Speech Coding
 Speech Recognition
 Speech Verification/Identification
 Speech Enhancement(remove background noises)
 Speech Synthesis
 Grammar Design
 Signal Processing
 Phonemic Recognition
 Word Recognition
 Result Recognition

More Related Content

What's hot

Deep Learning For Speech Recognition
Deep Learning For Speech RecognitionDeep Learning For Speech Recognition
Deep Learning For Speech Recognition
ananth
 
Ai based character recognition and speech synthesis
Ai based character recognition and speech  synthesisAi based character recognition and speech  synthesis
Ai based character recognition and speech synthesis
Ankita Jadhao
 
Automatic speech recognition system
Automatic speech recognition systemAutomatic speech recognition system
Automatic speech recognition system
Alok Tiwari
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
Hardik Kanjariya
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
Seminar Links
 
A seminar report on speech recognition technology
A seminar report on speech recognition technologyA seminar report on speech recognition technology
A seminar report on speech recognition technology
SrijanKumar18
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
Ahmed Moawad
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
Aamir-sheriff
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognition
anshu shrivastava
 
Voice Recognition
Voice RecognitionVoice Recognition
Voice Recognition
Gabija Ščiukauskaitė
 
Automatic speech recognition system
Automatic speech recognition systemAutomatic speech recognition system
Automatic speech recognition system
Alok Tiwari
 
Speech recognition
Speech recognitionSpeech recognition
Speech recognition
Charu Joshi
 
Ece speech-recognition-report
Ece speech-recognition-reportEce speech-recognition-report
Ece speech-recognition-report
Anakali Mahesh
 
Abstract of speech recognition
Abstract of speech recognitionAbstract of speech recognition
Abstract of speech recognition
Vinay Jaisriram
 
Speech recognition An overview
Speech recognition An overviewSpeech recognition An overview
Speech recognition An overview
sajanazoya
 
Speech recognition an overview
Speech recognition   an overviewSpeech recognition   an overview
Speech recognition an overview
Varun Jain
 
Automatic Speech Recognition
Automatic Speech RecognitionAutomatic Speech Recognition
Automatic Speech Recognition
Yogesh Vijay
 
Speech recognition system
Speech recognition systemSpeech recognition system
Speech recognition system
Ripal Ranpara
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognition
Manthan Gandhi
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
Hugo Moreno
 

What's hot (20)

Deep Learning For Speech Recognition
Deep Learning For Speech RecognitionDeep Learning For Speech Recognition
Deep Learning For Speech Recognition
 
Ai based character recognition and speech synthesis
Ai based character recognition and speech  synthesisAi based character recognition and speech  synthesis
Ai based character recognition and speech synthesis
 
Automatic speech recognition system
Automatic speech recognition systemAutomatic speech recognition system
Automatic speech recognition system
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
A seminar report on speech recognition technology
A seminar report on speech recognition technologyA seminar report on speech recognition technology
A seminar report on speech recognition technology
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognition
 
Voice Recognition
Voice RecognitionVoice Recognition
Voice Recognition
 
Automatic speech recognition system
Automatic speech recognition systemAutomatic speech recognition system
Automatic speech recognition system
 
Speech recognition
Speech recognitionSpeech recognition
Speech recognition
 
Ece speech-recognition-report
Ece speech-recognition-reportEce speech-recognition-report
Ece speech-recognition-report
 
Abstract of speech recognition
Abstract of speech recognitionAbstract of speech recognition
Abstract of speech recognition
 
Speech recognition An overview
Speech recognition An overviewSpeech recognition An overview
Speech recognition An overview
 
Speech recognition an overview
Speech recognition   an overviewSpeech recognition   an overview
Speech recognition an overview
 
Automatic Speech Recognition
Automatic Speech RecognitionAutomatic Speech Recognition
Automatic Speech Recognition
 
Speech recognition system
Speech recognition systemSpeech recognition system
Speech recognition system
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognition
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 

Viewers also liked

Speech Recognition
Speech Recognition Speech Recognition
Speech Recognition
Goa App
 
Speech recognition techniques
Speech recognition techniquesSpeech recognition techniques
Speech recognition techniques
sonukumar142
 
Affricate sounds 2010
Affricate sounds 2010Affricate sounds 2010
Affricate sounds 2010
Jordán Masías
 
Plosives
PlosivesPlosives
Consonants presentation
Consonants presentationConsonants presentation
Consonants presentation
Universidad de Costa Rica
 
Lecture 1 Consonants
Lecture 1 ConsonantsLecture 1 Consonants
Lecture 1 Consonants
Hai Ha Do
 
Classification and description of consonants.ppt
Classification and description of consonants.pptClassification and description of consonants.ppt
Classification and description of consonants.ppt
harshakrishna
 
Phonetics - Manner of Articulation
Phonetics - Manner of ArticulationPhonetics - Manner of Articulation
Phonetics - Manner of Articulation
Ajez Ahmad
 

Viewers also liked (8)

Speech Recognition
Speech Recognition Speech Recognition
Speech Recognition
 
Speech recognition techniques
Speech recognition techniquesSpeech recognition techniques
Speech recognition techniques
 
Affricate sounds 2010
Affricate sounds 2010Affricate sounds 2010
Affricate sounds 2010
 
Plosives
PlosivesPlosives
Plosives
 
Consonants presentation
Consonants presentationConsonants presentation
Consonants presentation
 
Lecture 1 Consonants
Lecture 1 ConsonantsLecture 1 Consonants
Lecture 1 Consonants
 
Classification and description of consonants.ppt
Classification and description of consonants.pptClassification and description of consonants.ppt
Classification and description of consonants.ppt
 
Phonetics - Manner of Articulation
Phonetics - Manner of ArticulationPhonetics - Manner of Articulation
Phonetics - Manner of Articulation
 

Similar to Automatic Speech Recognion

Speech processing
Speech processingSpeech processing
Assign
AssignAssign
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
SrijanKumar18
 
How speech reorganization works
How speech reorganization worksHow speech reorganization works
How speech reorganization works
Muhammad Taqi
 
Web AI.pptx
Web AI.pptxWeb AI.pptx
Speechrecognition 100423091251-phpapp01
Speechrecognition 100423091251-phpapp01Speechrecognition 100423091251-phpapp01
Speechrecognition 100423091251-phpapp01
girishjoshi1234
 
Speech and Language Processing
Speech and Language ProcessingSpeech and Language Processing
Speech and Language Processing
Vikalp Mahendra
 
Speech Technology Overview
Speech Technology OverviewSpeech Technology Overview
Speech Technology Overview
amr0mt
 
SoundSense
SoundSenseSoundSense
SoundSense
butest
 
Automatic Speech Recognition
Automatic Speech RecognitionAutomatic Speech Recognition
Automatic Speech Recognition
International Islamic University
 
Speech Recognition System
Speech Recognition SystemSpeech Recognition System
Speech recognition system
Speech recognition systemSpeech recognition system
Speech recognition system
Arun Tiwari
 
Animal Voice Morphing System
Animal Voice Morphing SystemAnimal Voice Morphing System
Animal Voice Morphing System
editor1knowledgecuddle
 
Speech recognition (dr. m. sabarimalai manikandan)
Speech recognition (dr. m. sabarimalai manikandan)Speech recognition (dr. m. sabarimalai manikandan)
Speech recognition (dr. m. sabarimalai manikandan)
Indian Institute of Technology Bhubaneswar
 
Speech recognizers & generators
Speech recognizers & generatorsSpeech recognizers & generators
Speech recognizers & generators
Paul Kahoro
 
Introduction to text to speech
Introduction to text to speechIntroduction to text to speech
Introduction to text to speech
Bilgin Aksoy
 
speech recognition and removal of disfluencies
speech recognition and removal of disfluenciesspeech recognition and removal of disfluencies
speech recognition and removal of disfluencies
Ankit Sharma
 
speech processing basics
speech processing basicsspeech processing basics
speech processing basics
sivakumar m
 
Speech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using VocoderSpeech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using Vocoder
IJTET Journal
 
Voice morphing-101113123852-phpapp01
Voice morphing-101113123852-phpapp01Voice morphing-101113123852-phpapp01
Voice morphing-101113123852-phpapp01
Rehan Ahmed
 

Similar to Automatic Speech Recognion (20)

Speech processing
Speech processingSpeech processing
Speech processing
 
Assign
AssignAssign
Assign
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
How speech reorganization works
How speech reorganization worksHow speech reorganization works
How speech reorganization works
 
Web AI.pptx
Web AI.pptxWeb AI.pptx
Web AI.pptx
 
Speechrecognition 100423091251-phpapp01
Speechrecognition 100423091251-phpapp01Speechrecognition 100423091251-phpapp01
Speechrecognition 100423091251-phpapp01
 
Speech and Language Processing
Speech and Language ProcessingSpeech and Language Processing
Speech and Language Processing
 
Speech Technology Overview
Speech Technology OverviewSpeech Technology Overview
Speech Technology Overview
 
SoundSense
SoundSenseSoundSense
SoundSense
 
Automatic Speech Recognition
Automatic Speech RecognitionAutomatic Speech Recognition
Automatic Speech Recognition
 
Speech Recognition System
Speech Recognition SystemSpeech Recognition System
Speech Recognition System
 
Speech recognition system
Speech recognition systemSpeech recognition system
Speech recognition system
 
Animal Voice Morphing System
Animal Voice Morphing SystemAnimal Voice Morphing System
Animal Voice Morphing System
 
Speech recognition (dr. m. sabarimalai manikandan)
Speech recognition (dr. m. sabarimalai manikandan)Speech recognition (dr. m. sabarimalai manikandan)
Speech recognition (dr. m. sabarimalai manikandan)
 
Speech recognizers & generators
Speech recognizers & generatorsSpeech recognizers & generators
Speech recognizers & generators
 
Introduction to text to speech
Introduction to text to speechIntroduction to text to speech
Introduction to text to speech
 
speech recognition and removal of disfluencies
speech recognition and removal of disfluenciesspeech recognition and removal of disfluencies
speech recognition and removal of disfluencies
 
speech processing basics
speech processing basicsspeech processing basics
speech processing basics
 
Speech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using VocoderSpeech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using Vocoder
 
Voice morphing-101113123852-phpapp01
Voice morphing-101113123852-phpapp01Voice morphing-101113123852-phpapp01
Voice morphing-101113123852-phpapp01
 

More from International Islamic University

Hash tables
Hash tablesHash tables
Binary Search Tree
Binary Search TreeBinary Search Tree
Graph 1
Graph 1Graph 1
Graph 2
Graph 2Graph 2
Graph 3
Graph 3Graph 3
Greedy algorithm
Greedy algorithmGreedy algorithm
Dynamic programming
Dynamic programmingDynamic programming
Quick sort
Quick sortQuick sort
Merge sort
Merge sortMerge sort
Linear timesorting
Linear timesortingLinear timesorting
Facial Expression Recognitino
Facial Expression RecognitinoFacial Expression Recognitino
Facial Expression Recognitino
International Islamic University
 
Lecture#4
Lecture#4Lecture#4
Lecture#3
Lecture#3 Lecture#3
Lecture#2
Lecture#2 Lecture#2
Case study
Case studyCase study
Arrays
ArraysArrays
Pcb
PcbPcb
Data transmission
Data transmissionData transmission
Basic organization of computer
Basic organization of computerBasic organization of computer
Basic organization of computer
International Islamic University
 
Sorting techniques
Sorting techniquesSorting techniques

More from International Islamic University (20)

Hash tables
Hash tablesHash tables
Hash tables
 
Binary Search Tree
Binary Search TreeBinary Search Tree
Binary Search Tree
 
Graph 1
Graph 1Graph 1
Graph 1
 
Graph 2
Graph 2Graph 2
Graph 2
 
Graph 3
Graph 3Graph 3
Graph 3
 
Greedy algorithm
Greedy algorithmGreedy algorithm
Greedy algorithm
 
Dynamic programming
Dynamic programmingDynamic programming
Dynamic programming
 
Quick sort
Quick sortQuick sort
Quick sort
 
Merge sort
Merge sortMerge sort
Merge sort
 
Linear timesorting
Linear timesortingLinear timesorting
Linear timesorting
 
Facial Expression Recognitino
Facial Expression RecognitinoFacial Expression Recognitino
Facial Expression Recognitino
 
Lecture#4
Lecture#4Lecture#4
Lecture#4
 
Lecture#3
Lecture#3 Lecture#3
Lecture#3
 
Lecture#2
Lecture#2 Lecture#2
Lecture#2
 
Case study
Case studyCase study
Case study
 
Arrays
ArraysArrays
Arrays
 
Pcb
PcbPcb
Pcb
 
Data transmission
Data transmissionData transmission
Data transmission
 
Basic organization of computer
Basic organization of computerBasic organization of computer
Basic organization of computer
 
Sorting techniques
Sorting techniquesSorting techniques
Sorting techniques
 

Recently uploaded

BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
Nguyen Thanh Tu Collection
 
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
Nguyen Thanh Tu Collection
 
Temple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation resultsTemple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation results
Krassimira Luka
 
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
GeorgeMilliken2
 
Main Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docxMain Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docx
adhitya5119
 
Leveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit InnovationLeveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit Innovation
TechSoup
 
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptxPrésentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
siemaillard
 
Chapter wise All Notes of First year Basic Civil Engineering.pptx
Chapter wise All Notes of First year Basic Civil Engineering.pptxChapter wise All Notes of First year Basic Civil Engineering.pptx
Chapter wise All Notes of First year Basic Civil Engineering.pptx
Denish Jangid
 
How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17
Celine George
 
Solutons Maths Escape Room Spatial .pptx
Solutons Maths Escape Room Spatial .pptxSolutons Maths Escape Room Spatial .pptx
Solutons Maths Escape Room Spatial .pptx
spdendr
 
BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...
BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...
BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...
Nguyen Thanh Tu Collection
 
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
imrankhan141184
 
How to deliver Powerpoint Presentations.pptx
How to deliver Powerpoint  Presentations.pptxHow to deliver Powerpoint  Presentations.pptx
How to deliver Powerpoint Presentations.pptx
HajraNaeem15
 
Hindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdfHindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdf
Dr. Mulla Adam Ali
 
B. Ed Syllabus for babasaheb ambedkar education university.pdf
B. Ed Syllabus for babasaheb ambedkar education university.pdfB. Ed Syllabus for babasaheb ambedkar education university.pdf
B. Ed Syllabus for babasaheb ambedkar education university.pdf
BoudhayanBhattachari
 
IGCSE Biology Chapter 14- Reproduction in Plants.pdf
IGCSE Biology Chapter 14- Reproduction in Plants.pdfIGCSE Biology Chapter 14- Reproduction in Plants.pdf
IGCSE Biology Chapter 14- Reproduction in Plants.pdf
Amin Marwan
 
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptxNEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
iammrhaywood
 
spot a liar (Haiqa 146).pptx Technical writhing and presentation skills
spot a liar (Haiqa 146).pptx Technical writhing and presentation skillsspot a liar (Haiqa 146).pptx Technical writhing and presentation skills
spot a liar (Haiqa 146).pptx Technical writhing and presentation skills
haiqairshad
 
Bed Making ( Introduction, Purpose, Types, Articles, Scientific principles, N...
Bed Making ( Introduction, Purpose, Types, Articles, Scientific principles, N...Bed Making ( Introduction, Purpose, Types, Articles, Scientific principles, N...
Bed Making ( Introduction, Purpose, Types, Articles, Scientific principles, N...
Leena Ghag-Sakpal
 
Liberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdfLiberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdf
WaniBasim
 

Recently uploaded (20)

BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
 
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
 
Temple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation resultsTemple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation results
 
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
 
Main Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docxMain Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docx
 
Leveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit InnovationLeveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit Innovation
 
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptxPrésentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
 
Chapter wise All Notes of First year Basic Civil Engineering.pptx
Chapter wise All Notes of First year Basic Civil Engineering.pptxChapter wise All Notes of First year Basic Civil Engineering.pptx
Chapter wise All Notes of First year Basic Civil Engineering.pptx
 
How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17
 
Solutons Maths Escape Room Spatial .pptx
Solutons Maths Escape Room Spatial .pptxSolutons Maths Escape Room Spatial .pptx
Solutons Maths Escape Room Spatial .pptx
 
BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...
BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...
BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...
 
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
 
How to deliver Powerpoint Presentations.pptx
How to deliver Powerpoint  Presentations.pptxHow to deliver Powerpoint  Presentations.pptx
How to deliver Powerpoint Presentations.pptx
 
Hindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdfHindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdf
 
B. Ed Syllabus for babasaheb ambedkar education university.pdf
B. Ed Syllabus for babasaheb ambedkar education university.pdfB. Ed Syllabus for babasaheb ambedkar education university.pdf
B. Ed Syllabus for babasaheb ambedkar education university.pdf
 
IGCSE Biology Chapter 14- Reproduction in Plants.pdf
IGCSE Biology Chapter 14- Reproduction in Plants.pdfIGCSE Biology Chapter 14- Reproduction in Plants.pdf
IGCSE Biology Chapter 14- Reproduction in Plants.pdf
 
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptxNEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
 
spot a liar (Haiqa 146).pptx Technical writhing and presentation skills
spot a liar (Haiqa 146).pptx Technical writhing and presentation skillsspot a liar (Haiqa 146).pptx Technical writhing and presentation skills
spot a liar (Haiqa 146).pptx Technical writhing and presentation skills
 
Bed Making ( Introduction, Purpose, Types, Articles, Scientific principles, N...
Bed Making ( Introduction, Purpose, Types, Articles, Scientific principles, N...Bed Making ( Introduction, Purpose, Types, Articles, Scientific principles, N...
Bed Making ( Introduction, Purpose, Types, Articles, Scientific principles, N...
 
Liberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdfLiberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdf
 

Automatic Speech Recognion

  • 1.
  • 2. Types of ASR????? Approaches to ASR ASR(Automatic Speech Recognition)? What is Voice Recognition??? What Is Voice?? Process of Voice Recognition???? Why Voices are Different??? Component of Sound??? How Speech Recognition Works????
  • 3. Application of Speech Processing?? Process of Speech Production??? Classification to Speech Sounds?? Approaches to Speech Recognition??
  • 4.  The voice consists of sound made by a human being using the vocal folds for talking, singing, laughing, crying, screaming, etc. The voice consists of sound made by a human being using the vocal folds for talking, singing, laughing, crying, screaming, etc.
  • 5. It is the process of converting voice into electric signals. Signals transform into CODING PATTERN.
  • 6. The first ASR device was used in 1952 and recognized single digits spoken by a user
  • 7. TEMPLATE MATCHING Template matching is the simplest technique and has the highest accuracy when used properly, but it also suffers from the most limitations. ASR Feature Analysis A more general form of voice recognition is available through feature analysis and this technique usually leads to "speaker- independent" voice recognition.
  • 8. •It is SPEAKE DEPENDENT. •It match voice with already saved templates. •Before it we’ve to trained the system. • System must be trained. •User speak same word which are avail in template. •Recognition accuracy can be about 98 percent. Template Matching •It is SPEAKER INDEPENDENT. •First process the giving voice as inputut •Using LPC(Linear Productive Coding) •Attempt to find similarities b/w expected •Input and Digitized input. •Recognition accuracy for speaker-independent systems is somewhat less than for speaker-dependent systems, usually between 90 and 95 percent. Feature Analysis
  • 9. TEXT Phonems Articulary Motions Speak/ Say Someting Acoustic Wave Form Acoustic Wave Form Spectrum Analysis Feature Retractions Coding Phonems/ Word/Sentence Semantics Discrete Input Continuous Input
  • 10.  Vocal Tract Consist of laryngeal pharynx, oral phyrnax, oral cavity, nassal cavity, nassal phyrnx.  Specturm Analysis MFCC used to produce voice feaature. DTW to select the pattern that match the database(matLab).
  • 11.  Acoustic Model provide the acoustic sound of a language and can be recognized the chore of a particular user speech pattern and acoustic environment.
  • 12.  To make pattern recognition PCM transfer into frequency domain
  • 13.  Speaker Dependent  Speaker Independent  Discrete Speaker Recognition  Continuous Speech Recognition  Natural Languages
  • 14.  Pitch  Timber  Harmonics  Loudness  Rhythm  Attack  Sustain  Decay  Speed
  • 15.
  • 16.  COMPRESSION in which particles are crowded together, appear as upward curves in the line.  RAREFACTION in which particles are spread apart, appear as downward curves in the line.  WAVELENGTH this is the distance from the crest of one wave to the crest of the next.
  • 17.  FREQUENCY this is the number of waves that pass a point in each second.  AMPLITUDE this is the measure of the amount of energy in a sound wave.
  • 18. High Frequency Sound Wave Low Frequency Sound Wave This is how high or low a sound seems. A bird makes a high pitch. A lion makes a low pitch.
  • 19. Voices are different caused by INTENSITY(depend on amplitude) , PITCH(frequency) , TONE(pleasant or unpleasent).
  • 20.  Divide the sound wave into evenly spaced blocks  Process each block for important characteristics, such as strength across various frequency ranges, number of zero crossings, and total energy.  Using this characteristic vector, attempt to associate each block with a phone, which is the most basic unit of speech, producing a string of phones.  Find the word whose model is the most likely match to the string of phones which was produced.
  • 21.  Transfer the PCM into Accoustic  Apply GRAMMER  Figure out which PHONEMS are spoken  Convert PHONEMS into WORDS
  • 22.  Acoustic Phonetic Approach  Pattern Recognition Approach(HMM)  Artificial Intelligence Approach(Neural Networks)
  • 23.
  • 24.
  • 25.
  • 26. Speech Processing Analysis/Syntactic Coding Recognition Speaker Recognition Language Identification Speech Recognition Speech Mode Speaking StyleVocabulary SizeSpeaker Mode •Isolated Speech •Continuous Speech •Speaker Dependent •Speaker In-Dependent •Speaker Adaptive •Small •Medium •large •Dictation •Spontaneous
  • 27. •Vocal Chord play active role in the production of SOUND. e.g. a/e/I •It has high frequency Voiced Sound •When Vocal Chord is Inactive Called UN VOICED SOUND e.g. s/f •It build up by pressure Un Voiced Sound
  • 28.  Speech Coding  Speech Recognition  Speech Verification/Identification  Speech Enhancement(remove background noises)  Speech Synthesis
  • 29.  Grammar Design  Signal Processing  Phonemic Recognition  Word Recognition  Result Recognition