SlideShare a Scribd company logo
Shree Manibhai Virani and Smt. Navalben Virani Science
College, Rajkot
(Autonomous)
Affiliated to Saurashtra University, Rajkot
Ms.Ripal Ranpara
Assistant Professor,
Department of Computer Science & Information Technology
Shree M.N. Virani Science College Rajkot
Monophones: It takes speech as input and divide speech into small segment,this
small segment are sound called monophones
.Grammer File : It contain grammer in form of rules. E.g English query.
.Voca File : It is used to define actual words in form of speech to text,it also contain
commanly used word.
.dict File & .dfa File: The .grammar file and .voca fi le are compiled to generate a
dictionary file and finite automata file, namely .dict and .dfa file, respectively.
These files are required at the time of execution of the system
.
HMM Creation Model:it is a statistical tool for modelling a wide range of time series
Data,it is mainly used as a part of speech tagging and noun phrase checking.
Julius Interface:its an interface to execute .dict file and .dfa file
Speaker dependent system :It is developed to operate for a single speaker.
These systems are usually easier to develop, cheaper to buy and more
accurate. Speaker–dependent software works by learning the unique
characteristics of a single person's voice, in a way similar to voice recognition.
Speaker independent system: It is developed to operate for any speaker of
a particular type (e.g. American English). Speaker–independent software is
designed to recognize anyone's voice, so no training is involved.
Speaker adaptive - A third variation of speaker models is now emerging,
called speaker adaptive. Speaker adaptive systems usually begin with a
speaker independent model and adjust these models more closely to each
individual during a brief training period.
Isolated word recognition - Isolated word recognizers usually require each
utterance to have quiet (lack of an audio signal) on BOTH sides of the sample
window. It doesn't mean that it accepts single words, but does require a single
utterance at a time.
Connected word recognition - Connect word systems (or more correctly
'connected utterances') are similar to Isolated words, but allow separate
utterances to be 'run−together' with a minimal pause between them.
Continuous speech recognition - Continuous recognition is the next step.
Recognizers with continuous speech capabilities are some of the most difficult
to create because they must utilize special methods to determine utterance
boundaries. Continuous speech recognizers allow users to speak almost
naturally, while the computer determines the content.
STEP 1: Basically, the microphone converts the voice to an analog signal. This
is processed by the sound card in the computer, which takes the signal to the
digital stage. This is the binary form of .1s. and .0s. that make up computer
programming languages.
STEP 2: Sound-recognition software has acoustic models ,An acoustic model is
created by taking audio recordings of speech, and their text transcriptions, and
using software to create statistical representations of the sounds that make up
each word. That each word is known as monophones
STEP 3: Once this is complete, a second sector of the software begins to
work. The language is compared to the digital dictionary that is stored in
computer memory. This is a large collection of words, usually more than
100,000. When it finds a match based on the digital form it displays the words
on the screen. This is the basic process for all speech recognition systems
and software.
Speech
Monophones
HMM
Creation
.dfa
.dict
Julius
Interface
Text As
Output
The conversion process of speech to text is divided into four phases, namely,
Data preparation,
Monophones HMM creation,
.grammer and .voca file creation,
Execution with Julius interface
Speech recognition system
Speech recognition system

More Related Content

What's hot

Speech recognition system seminar
Speech recognition system seminarSpeech recognition system seminar
Speech recognition system seminar
Diptimaya Sarangi
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversion
ankit_saluja
 

What's hot (20)

Speech recognition An overview
Speech recognition An overviewSpeech recognition An overview
Speech recognition An overview
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
A seminar report on speech recognition technology
A seminar report on speech recognition technologyA seminar report on speech recognition technology
A seminar report on speech recognition technology
 
Automatic speech recognition system
Automatic speech recognition systemAutomatic speech recognition system
Automatic speech recognition system
 
Speech recognition techniques
Speech recognition techniquesSpeech recognition techniques
Speech recognition techniques
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognition
 
speech processing and recognition basic in data mining
speech processing and recognition basic in  data miningspeech processing and recognition basic in  data mining
speech processing and recognition basic in data mining
 
Speech recognition system seminar
Speech recognition system seminarSpeech recognition system seminar
Speech recognition system seminar
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognition
 
Speech recognition final presentation
Speech recognition final presentationSpeech recognition final presentation
Speech recognition final presentation
 
Deep Learning For Speech Recognition
Deep Learning For Speech RecognitionDeep Learning For Speech Recognition
Deep Learning For Speech Recognition
 
Speech Recognition by Iqbal
Speech Recognition by IqbalSpeech Recognition by Iqbal
Speech Recognition by Iqbal
 
Seminar
SeminarSeminar
Seminar
 
Artificial Intelligence for Speech Recognition
Artificial Intelligence for Speech RecognitionArtificial Intelligence for Speech Recognition
Artificial Intelligence for Speech Recognition
 
Gujarati Text-to-Speech Presentation
Gujarati Text-to-Speech PresentationGujarati Text-to-Speech Presentation
Gujarati Text-to-Speech Presentation
 
Speech recognition
Speech recognitionSpeech recognition
Speech recognition
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversion
 
Speech Recognition by Iqbal
Speech Recognition by IqbalSpeech Recognition by Iqbal
Speech Recognition by Iqbal
 
Speech Recognition System
Speech Recognition SystemSpeech Recognition System
Speech Recognition System
 

Viewers also liked

PatSeer Patent Database Overview
PatSeer Patent Database OverviewPatSeer Patent Database Overview
PatSeer Patent Database Overview
Harshad Karmarkar
 
The Main Concepts of Speech Recognition
The Main Concepts of Speech RecognitionThe Main Concepts of Speech Recognition
The Main Concepts of Speech Recognition
子毅 楊
 
Smart wheel chair based on voice recognition for handicapped
Smart wheel chair based on voice recognition for handicappedSmart wheel chair based on voice recognition for handicapped
Smart wheel chair based on voice recognition for handicapped
Sagar Bayas
 

Viewers also liked (17)

Voice input and speech recognition system in tourism/social media
Voice input and speech recognition system in tourism/social mediaVoice input and speech recognition system in tourism/social media
Voice input and speech recognition system in tourism/social media
 
Speech Recognition System By Matlab
Speech Recognition System By MatlabSpeech Recognition System By Matlab
Speech Recognition System By Matlab
 
NFC Patent Analysis_Report-II
NFC Patent Analysis_Report-IINFC Patent Analysis_Report-II
NFC Patent Analysis_Report-II
 
Voice/Speech recognition in mobile devices
Voice/Speech recognition in mobile devicesVoice/Speech recognition in mobile devices
Voice/Speech recognition in mobile devices
 
Thin Film Batteries
Thin Film BatteriesThin Film Batteries
Thin Film Batteries
 
Advances in Voice & Speech Recognition
Advances in Voice & Speech RecognitionAdvances in Voice & Speech Recognition
Advances in Voice & Speech Recognition
 
PatSeer Patent Database Overview
PatSeer Patent Database OverviewPatSeer Patent Database Overview
PatSeer Patent Database Overview
 
A study of EMG based Speech Recognition
A study of EMG  based Speech Recognition A study of EMG  based Speech Recognition
A study of EMG based Speech Recognition
 
"Automatic speech recognition for mobile applications in Yandex" — Fran Campi...
"Automatic speech recognition for mobile applications in Yandex" — Fran Campi..."Automatic speech recognition for mobile applications in Yandex" — Fran Campi...
"Automatic speech recognition for mobile applications in Yandex" — Fran Campi...
 
SPEECH RECOGNITION USING NEURAL NETWORK
SPEECH RECOGNITION USING NEURAL NETWORK SPEECH RECOGNITION USING NEURAL NETWORK
SPEECH RECOGNITION USING NEURAL NETWORK
 
Developing with Speech and Voice Recognition in Mobile Apps
Developing with Speech and Voice Recognition in Mobile AppsDeveloping with Speech and Voice Recognition in Mobile Apps
Developing with Speech and Voice Recognition in Mobile Apps
 
The Main Concepts of Speech Recognition
The Main Concepts of Speech RecognitionThe Main Concepts of Speech Recognition
The Main Concepts of Speech Recognition
 
Speech recognition: ready to take off?
Speech recognition: ready to take off?Speech recognition: ready to take off?
Speech recognition: ready to take off?
 
Where's Jarvis? The future of Voice Recognition and Natural Language User Int...
Where's Jarvis? The future of Voice Recognition and Natural Language User Int...Where's Jarvis? The future of Voice Recognition and Natural Language User Int...
Where's Jarvis? The future of Voice Recognition and Natural Language User Int...
 
Doubly fed-induction-generator
Doubly fed-induction-generatorDoubly fed-induction-generator
Doubly fed-induction-generator
 
Artificial intelligence Speech recognition system
Artificial intelligence Speech recognition systemArtificial intelligence Speech recognition system
Artificial intelligence Speech recognition system
 
Smart wheel chair based on voice recognition for handicapped
Smart wheel chair based on voice recognition for handicappedSmart wheel chair based on voice recognition for handicapped
Smart wheel chair based on voice recognition for handicapped
 

Similar to Speech recognition system

Paper on Speech Recognition
Paper on Speech RecognitionPaper on Speech Recognition
Paper on Speech Recognition
Thejus Joby
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversion
ankit_saluja
 
Speech recognition - how does it work?
Speech recognition - how does it work?Speech recognition - how does it work?
Speech recognition - how does it work?
CarterRodriguez6
 
5215ijcseit01
5215ijcseit015215ijcseit01
5215ijcseit01
ijcsit
 
How speech reorganization works
How speech reorganization worksHow speech reorganization works
How speech reorganization works
Muhammad Taqi
 

Similar to Speech recognition system (20)

Paper on Speech Recognition
Paper on Speech RecognitionPaper on Speech Recognition
Paper on Speech Recognition
 
Assign
AssignAssign
Assign
 
VOICE RECOGNITION SYSTEM
VOICE RECOGNITION SYSTEMVOICE RECOGNITION SYSTEM
VOICE RECOGNITION SYSTEM
 
Speech Recognition
Speech Recognition Speech Recognition
Speech Recognition
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversion
 
ACHIEVING SECURITY VIA SPEECH RECOGNITION
ACHIEVING SECURITY VIA SPEECH RECOGNITIONACHIEVING SECURITY VIA SPEECH RECOGNITION
ACHIEVING SECURITY VIA SPEECH RECOGNITION
 
Speech recognition - how does it work?
Speech recognition - how does it work?Speech recognition - how does it work?
Speech recognition - how does it work?
 
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARSYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
 
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARSYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
 
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARSYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
 
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARSYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
 
5215ijcseit01
5215ijcseit015215ijcseit01
5215ijcseit01
 
Speech recognizers & generators
Speech recognizers & generatorsSpeech recognizers & generators
Speech recognizers & generators
 
De4201715719
De4201715719De4201715719
De4201715719
 
International journal of signal and image processing issues vol 2015 - no 1...
International journal of signal and image processing issues   vol 2015 - no 1...International journal of signal and image processing issues   vol 2015 - no 1...
International journal of signal and image processing issues vol 2015 - no 1...
 
How speech reorganization works
How speech reorganization worksHow speech reorganization works
How speech reorganization works
 
BTP paper
BTP paperBTP paper
BTP paper
 
Desktop assistant
Desktop assistant Desktop assistant
Desktop assistant
 
On Developing an Automatic Speech Recognition System for Commonly used Englis...
On Developing an Automatic Speech Recognition System for Commonly used Englis...On Developing an Automatic Speech Recognition System for Commonly used Englis...
On Developing an Automatic Speech Recognition System for Commonly used Englis...
 
Artificial Intelligence - An Introduction
Artificial Intelligence - An Introduction Artificial Intelligence - An Introduction
Artificial Intelligence - An Introduction
 

More from Ripal Ranpara (7)

Python final ppt
Python final pptPython final ppt
Python final ppt
 
Cloud computing
Cloud computingCloud computing
Cloud computing
 
Moodle
MoodleMoodle
Moodle
 
System Development Life Cycle:SDLC
System Development Life Cycle:SDLCSystem Development Life Cycle:SDLC
System Development Life Cycle:SDLC
 
How to develop Project
How to develop ProjectHow to develop Project
How to develop Project
 
Microsoft visio
Microsoft visioMicrosoft visio
Microsoft visio
 
Overview of computing paradigm
Overview of computing paradigmOverview of computing paradigm
Overview of computing paradigm
 

Recently uploaded

Industrial Training Report- AKTU Industrial Training Report
Industrial Training Report- AKTU Industrial Training ReportIndustrial Training Report- AKTU Industrial Training Report
Industrial Training Report- AKTU Industrial Training Report
Avinash Rai
 

Recently uploaded (20)

Sectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdfSectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdf
 
UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...
UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...
UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...
 
slides CapTechTalks Webinar May 2024 Alexander Perry.pptx
slides CapTechTalks Webinar May 2024 Alexander Perry.pptxslides CapTechTalks Webinar May 2024 Alexander Perry.pptx
slides CapTechTalks Webinar May 2024 Alexander Perry.pptx
 
Open Educational Resources Primer PowerPoint
Open Educational Resources Primer PowerPointOpen Educational Resources Primer PowerPoint
Open Educational Resources Primer PowerPoint
 
INU_CAPSTONEDESIGN_비밀번호486_업로드용 발표자료.pdf
INU_CAPSTONEDESIGN_비밀번호486_업로드용 발표자료.pdfINU_CAPSTONEDESIGN_비밀번호486_업로드용 발표자료.pdf
INU_CAPSTONEDESIGN_비밀번호486_업로드용 발표자료.pdf
 
B.ed spl. HI pdusu exam paper-2023-24.pdf
B.ed spl. HI pdusu exam paper-2023-24.pdfB.ed spl. HI pdusu exam paper-2023-24.pdf
B.ed spl. HI pdusu exam paper-2023-24.pdf
 
Industrial Training Report- AKTU Industrial Training Report
Industrial Training Report- AKTU Industrial Training ReportIndustrial Training Report- AKTU Industrial Training Report
Industrial Training Report- AKTU Industrial Training Report
 
How to Manage Notification Preferences in the Odoo 17
How to Manage Notification Preferences in the Odoo 17How to Manage Notification Preferences in the Odoo 17
How to Manage Notification Preferences in the Odoo 17
 
How to the fix Attribute Error in odoo 17
How to the fix Attribute Error in odoo 17How to the fix Attribute Error in odoo 17
How to the fix Attribute Error in odoo 17
 
[GDSC YCCE] Build with AI Online Presentation
[GDSC YCCE] Build with AI Online Presentation[GDSC YCCE] Build with AI Online Presentation
[GDSC YCCE] Build with AI Online Presentation
 
Operations Management - Book1.p - Dr. Abdulfatah A. Salem
Operations Management - Book1.p  - Dr. Abdulfatah A. SalemOperations Management - Book1.p  - Dr. Abdulfatah A. Salem
Operations Management - Book1.p - Dr. Abdulfatah A. Salem
 
The Benefits and Challenges of Open Educational Resources
The Benefits and Challenges of Open Educational ResourcesThe Benefits and Challenges of Open Educational Resources
The Benefits and Challenges of Open Educational Resources
 
How to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS ModuleHow to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS Module
 
Salient features of Environment protection Act 1986.pptx
Salient features of Environment protection Act 1986.pptxSalient features of Environment protection Act 1986.pptx
Salient features of Environment protection Act 1986.pptx
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
 
How to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERPHow to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERP
 
Telling Your Story_ Simple Steps to Build Your Nonprofit's Brand Webinar.pdf
Telling Your Story_ Simple Steps to Build Your Nonprofit's Brand Webinar.pdfTelling Your Story_ Simple Steps to Build Your Nonprofit's Brand Webinar.pdf
Telling Your Story_ Simple Steps to Build Your Nonprofit's Brand Webinar.pdf
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
Research Methods in Psychology | Cambridge AS Level | Cambridge Assessment In...
Research Methods in Psychology | Cambridge AS Level | Cambridge Assessment In...Research Methods in Psychology | Cambridge AS Level | Cambridge Assessment In...
Research Methods in Psychology | Cambridge AS Level | Cambridge Assessment In...
 
How to Break the cycle of negative Thoughts
How to Break the cycle of negative ThoughtsHow to Break the cycle of negative Thoughts
How to Break the cycle of negative Thoughts
 

Speech recognition system

  • 1. Shree Manibhai Virani and Smt. Navalben Virani Science College, Rajkot (Autonomous) Affiliated to Saurashtra University, Rajkot Ms.Ripal Ranpara Assistant Professor, Department of Computer Science & Information Technology Shree M.N. Virani Science College Rajkot
  • 2. Monophones: It takes speech as input and divide speech into small segment,this small segment are sound called monophones .Grammer File : It contain grammer in form of rules. E.g English query. .Voca File : It is used to define actual words in form of speech to text,it also contain commanly used word. .dict File & .dfa File: The .grammar file and .voca fi le are compiled to generate a dictionary file and finite automata file, namely .dict and .dfa file, respectively. These files are required at the time of execution of the system . HMM Creation Model:it is a statistical tool for modelling a wide range of time series Data,it is mainly used as a part of speech tagging and noun phrase checking. Julius Interface:its an interface to execute .dict file and .dfa file
  • 3. Speaker dependent system :It is developed to operate for a single speaker. These systems are usually easier to develop, cheaper to buy and more accurate. Speaker–dependent software works by learning the unique characteristics of a single person's voice, in a way similar to voice recognition. Speaker independent system: It is developed to operate for any speaker of a particular type (e.g. American English). Speaker–independent software is designed to recognize anyone's voice, so no training is involved. Speaker adaptive - A third variation of speaker models is now emerging, called speaker adaptive. Speaker adaptive systems usually begin with a speaker independent model and adjust these models more closely to each individual during a brief training period.
  • 4. Isolated word recognition - Isolated word recognizers usually require each utterance to have quiet (lack of an audio signal) on BOTH sides of the sample window. It doesn't mean that it accepts single words, but does require a single utterance at a time. Connected word recognition - Connect word systems (or more correctly 'connected utterances') are similar to Isolated words, but allow separate utterances to be 'run−together' with a minimal pause between them. Continuous speech recognition - Continuous recognition is the next step. Recognizers with continuous speech capabilities are some of the most difficult to create because they must utilize special methods to determine utterance boundaries. Continuous speech recognizers allow users to speak almost naturally, while the computer determines the content.
  • 5. STEP 1: Basically, the microphone converts the voice to an analog signal. This is processed by the sound card in the computer, which takes the signal to the digital stage. This is the binary form of .1s. and .0s. that make up computer programming languages. STEP 2: Sound-recognition software has acoustic models ,An acoustic model is created by taking audio recordings of speech, and their text transcriptions, and using software to create statistical representations of the sounds that make up each word. That each word is known as monophones STEP 3: Once this is complete, a second sector of the software begins to work. The language is compared to the digital dictionary that is stored in computer memory. This is a large collection of words, usually more than 100,000. When it finds a match based on the digital form it displays the words on the screen. This is the basic process for all speech recognition systems and software.
  • 6.
  • 8. The conversion process of speech to text is divided into four phases, namely, Data preparation, Monophones HMM creation, .grammer and .voca file creation, Execution with Julius interface