SlideShare a Scribd company logo
1 of 24
Download to read offline
Speech Recognizers & Generators 
Let’s Get Started… 
Presented by: P. Kahoro 
Presented to: Prof P. Okanda
Speech Recognizers: What are they? 
A Speech is the vocalized form of human communication. 
Incomputer scienceandelectrical engineering,speech recognition(SR) is the translation of spoken words into text. It is also known as "automatic speech recognition" (ASR). Speech Recognition (SR) is the ability to translate a dictation or spoken word to text. 
-Speech recognition has evolved quite a bit over the past few years. Initially, it used to work in discrete dictation mode, where you had to pause between each spoken word. Today, however, it uses continuous dictation. It’s also become smarter, with its own set of grammar rules to make out the meaning of what’s being said.
Terms and Concepts 
•Utterances 
•Pronounciation 
•Grammer 
•Speaker Dependent System 
•Speaker Independent System 
•Training 
•Accuracy
Terms &Concepts 
Utterances: 
An utterance is any stream of speech between two periods of silence. Silence delineates the start and end of an utterance. An utterance can be a single word, or it can contain multiple words (a phrase or a sentence) 
Pronunciations: 
One piece of information that the speech recognition engine uses to process a word is its pronunciation, which represents what the speech engine thinks a word should sound like. 
Words can have multiple pronunciations associated with them. For example, the word “the” has at least two pronunciations in the U.S. English language: “thee” and “thuh”.
Cont… 
Grammar: Grammars define the domain, or context, within which the recognition engine works. The engine compares the current utterance against the words and phrases in the active grammars. If the user says something that is not in the grammar, the speech engine will not be able to understand it correctly. So usually speech engines have a very vast grammar. 
Accuracy: The ability of a recognizer can be examined by measuring its accuracy − or how well it recognizes utterances. 
Training: 
Somespeechrecognizershavetheabilitytoadapttoaspeaker.Whenthesystemhasthisability,itmayallowtrainingtotakeplace.
Cont… 
Speaker Dependent Systems: Speech recognition systems that require a user to train the system to his/her voice are known as speaker-dependent systems. If you are familiar with desktop dictation systems, most are speaker dependent like IBM Via Voice. 
Speaker Independent Systems: 
Speech recognition systems that do not require a user to train the system are known as speaker-independent systems.
How do humans do it? 
Articulation produces sound waves which the ear conveys to the brain for processing
How might computers do it? 
Digitization 
Acoustic analysis of the speech signal 
Linguistic interpretation 
Acoustic waveform 
Acoustic signal 
Speech recognition
How Speech Recognition Work? 
•Audio input 
•Apply a "grammar" so the speech recognizer knows what phonemes to expect. 
•Acoustic Model 
•Recognized text
How do computers do it? 
•First, the user gives a voice command over the microphone, which is passed to the sound card in your system. This analog signal is sampled converted into digital form using a technique called Pulse Code Modulation or PCM. This digital waveform is a stream of amplitudes that look like a wavy line. 
•The audio signal is further sampled and each sample is converted into a frequency domain. So, the incoming stream is now a set of discrete frequency bands, in a form that can be used by the speech recognizer. 
•The next stage involves recognizing these bands of frequencies. For this, the speech recognition software has a database containing thousands of frequencies or "phonemes", as they’re called.
Hardware: 
Sound Cards 
Soundcard with the cleanest A/D (Analog to Digital) conversions are recommended. 
Microphone 
The best choice for microphone is the headset style. 
Computers / Processors 
The more the speed the better Speech Recognition would work. For good Speech Recognition you should be having 1 GHz processor and 1 GB of RAM.
Where can it be used? 
•GPS: System control/navigation e.g. GPS-connected digital maps: “How far is it to the motorway junction?” 
•Commercial/Industrial applicationsin-car steering systems 
•Mobile telephony: Voice dialing hands-free use of mobile in car e.g. “Dial office” 
•Home automation -heating, ventilation and air conditioning
Where can it be used? 
•Military: System control/navigation e.g. Military -High-performance fighter aircraft, Helicopters, Training Air Traffic Controllers 
•Computer and Video Games: Speech input has been used in a limited number of computer and video games. The Microsoft Xbox, Nintendo GameCube, and Sony PlayStation 2 consoles all offer games with speech input/output. 
•Usage in education -Students who are blind 
•Voice Security System: security locks of gates and doors 
•Wearable Computers: The most futuristic application is in the use and functionality of wearable computers.
Speech Recognition Software 
•Dragon Naturally Speeking 
•IBM Via Voice 
•Microsoft Speech Recognition System 
•MacSpeechDictate 
•Philips Speech Magic
Pros of Speech Recognition 
•Faster than “hand-writing”. 
•Allows for better spelling, whether it be in text or documents. 
•Helpful for people with a mental or physical disability . 
•Hands-free capability .
Cons of Speech Recognition 
•No program is 100% perfect 
•Factors that affect the accuracy of speech recognition are: slang, homonyms, signal-to-noise ratio, and overlapping speech 
•Can be expensive depending on the program 
•Easily misinterprets vocal commands e.gSIRI
Conclusion 
•Revolutionize the way people conduct business over the Web and ,differentiate world-class e-businesses. 
•VoiceXMLties speech recognition and telephony together 
•voice-enabled Web solutions TODAY!
Generators: 
•Software generators are programs that build other programs. In computer science, a generator is a special routine that can be used to control the iteration behavior of a loop. In fact, all generators are iterators. 
•A generator is very similar to a function that returns an array, in that a generator has parameters, can be called, and generates a sequence of values. However, instead of building an array containing all the values and returning them all at once, a generator yields the values one at a time, which requires less memory and allows the caller to get started processing the first few values immediately. In short, a generator looks like a function but behaves like an iterator.
Types of software generators: 
•key generator(key-gen) 
•RandomPassword Generators 
•Code generator 
•Natural language generator 
•Random test generator 
•Pseudorandom number generator
key generator(key-gen) 
•Akeygenerator(key-gen)isacomputerprogramthatgeneratesaproductlicensingkey,suchasaserialnumber,necessarytoactivateforuseasoftwareapplication. 
•Key-gensmaybelegitimatelydistributedbysoftwaremanufacturersforlicensingsoftwareincommercialenvironmentswheresoftwarehasbeenlicensedinbulkforanentiresiteorenterprise,ortheymaybedistributedillegitimatelyincircumstancesofcopyrightinfringementorsoftwarepiracy. 
•Asoftwarelicenseisalegalinstrumentthatgovernstheusageanddistributionofcomputersoftware. 
•Illegitimatekeygeneratorsaretypicallydistributedbysoftwarecrackerse.gkey-gensusedtocrackfakeWindowsOSe.gWindows8arealreadyavailable
Random password generator 
•Arandompasswordgeneratorissoftwareprogramorhardwaredevicethattakesinputfromarandomorpseudo- randomnumbergeneratorandautomaticallygeneratesapassword.Randompasswordscanbegeneratedmanually,usingsimplesourcesofrandomnesssuchasdiceorcoins,ortheycanbegeneratedusingacomputer. 
•Whiletherearemanyexamplesof"random"passwordgeneratorprogramsavailableontheInternet,generatingrandomnesscanbetrickyandmanyprogramsdonotgeneraterandomcharactersinawaythatensuresstrongsecurity.Acommonrecommendationistouseopensourcesecuritytoolswherepossible,sincetheyallowindependentchecksonthequalityofthemethodsused.Notethatsimplygeneratingapasswordatrandomdoesnotensurethepasswordisastrongpassword,becauseitispossible,althoughhighlyunlikely,togenerateaneasilyguessedorcrackedpassword.Infactthereisnoneedatallforapasswordtohavebeenproducedbyaperfectlyrandomprocess:itjustneedstobesufficientlydifficulttoguess.
Pseudorandom number generators 
•Apseudorandom number generator(PRNG), also known as adeterministic random bit generator(DRBG),is analgorithmfor generating a sequence of numbers whose properties approximate the properties of sequences ofrandom numbers. 
•Although sequences that are closer to truly random can be generated using hardware random number generators, pseudorandom number generators are important in practice for their speed in number generation and their reproducibility.
Code generator 
•In computing, code generation is the process by which a compiler's code generator converts some intermediate representation of source code into a form (e.g., machine code) that can be readily executed by a machine. 
•Sophisticated compilers typically perform multiple passes over various intermediate forms. This multi-stage process is used because many algorithms for code optimization are easier to apply one at a time, or because the input to one optimization relies on the completed processing performed by another optimization.
Men have become the tools of their tools. -P. Kahoro 
The End

More Related Content

What's hot

Speech Recognition: Transcription and transformation of human speech
Speech Recognition: Transcription and transformation of human speechSpeech Recognition: Transcription and transformation of human speech
Speech Recognition: Transcription and transformation of human speechSubmissionResearchpa
 
Speech recognition
Speech recognitionSpeech recognition
Speech recognitionCharu Joshi
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversionankit_saluja
 
Speech recognition-using-wavelet-transform
Speech recognition-using-wavelet-transformSpeech recognition-using-wavelet-transform
Speech recognition-using-wavelet-transformvidhateswapnil
 
speech processing and recognition basic in data mining
speech processing and recognition basic in  data miningspeech processing and recognition basic in  data mining
speech processing and recognition basic in data miningJimit Rupani
 
Gujarati Text-to-Speech Presentation
Gujarati Text-to-Speech PresentationGujarati Text-to-Speech Presentation
Gujarati Text-to-Speech Presentationsamyakbhuta
 
Speechrecognition 100423091251-phpapp01
Speechrecognition 100423091251-phpapp01Speechrecognition 100423091251-phpapp01
Speechrecognition 100423091251-phpapp01girishjoshi1234
 
Text to Speech for Mobile Voice
Text to Speech for Mobile Voice Text to Speech for Mobile Voice
Text to Speech for Mobile Voice June Hostetter
 
Automatic speech recognition system
Automatic speech recognition systemAutomatic speech recognition system
Automatic speech recognition systemAlok Tiwari
 
Automatic speech recognition system
Automatic speech recognition systemAutomatic speech recognition system
Automatic speech recognition systemAlok Tiwari
 
Artificial intelligence Speech recognition system
Artificial intelligence Speech recognition systemArtificial intelligence Speech recognition system
Artificial intelligence Speech recognition systemREHMAT ULLAH
 
Introduction to myanmar Text-To-Speech
Introduction to myanmar Text-To-SpeechIntroduction to myanmar Text-To-Speech
Introduction to myanmar Text-To-SpeechNgwe Tun
 
AUTOMATIC SPEECH RECOGNITION- A SURVEY
AUTOMATIC SPEECH RECOGNITION- A SURVEYAUTOMATIC SPEECH RECOGNITION- A SURVEY
AUTOMATIC SPEECH RECOGNITION- A SURVEYIJCERT
 
TEXT-SPEECH PPT.pptx
TEXT-SPEECH PPT.pptxTEXT-SPEECH PPT.pptx
TEXT-SPEECH PPT.pptxNsaroj kumar
 
Voice/Speech recognition in mobile devices
Voice/Speech recognition in mobile devicesVoice/Speech recognition in mobile devices
Voice/Speech recognition in mobile devicesHarshad Karmarkar
 
Speech recognition final presentation
Speech recognition final presentationSpeech recognition final presentation
Speech recognition final presentationhimanshubhatti
 

What's hot (20)

Speech Recognition: Transcription and transformation of human speech
Speech Recognition: Transcription and transformation of human speechSpeech Recognition: Transcription and transformation of human speech
Speech Recognition: Transcription and transformation of human speech
 
Speech recognition
Speech recognitionSpeech recognition
Speech recognition
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversion
 
Speech recognition-using-wavelet-transform
Speech recognition-using-wavelet-transformSpeech recognition-using-wavelet-transform
Speech recognition-using-wavelet-transform
 
speech processing and recognition basic in data mining
speech processing and recognition basic in  data miningspeech processing and recognition basic in  data mining
speech processing and recognition basic in data mining
 
Gujarati Text-to-Speech Presentation
Gujarati Text-to-Speech PresentationGujarati Text-to-Speech Presentation
Gujarati Text-to-Speech Presentation
 
Speechrecognition 100423091251-phpapp01
Speechrecognition 100423091251-phpapp01Speechrecognition 100423091251-phpapp01
Speechrecognition 100423091251-phpapp01
 
Automatic Speech Recognition
Automatic Speech RecognitionAutomatic Speech Recognition
Automatic Speech Recognition
 
Text to Speech for Mobile Voice
Text to Speech for Mobile Voice Text to Speech for Mobile Voice
Text to Speech for Mobile Voice
 
Automatic speech recognition system
Automatic speech recognition systemAutomatic speech recognition system
Automatic speech recognition system
 
Automatic speech recognition system
Automatic speech recognition systemAutomatic speech recognition system
Automatic speech recognition system
 
Artificial intelligence Speech recognition system
Artificial intelligence Speech recognition systemArtificial intelligence Speech recognition system
Artificial intelligence Speech recognition system
 
Introduction to myanmar Text-To-Speech
Introduction to myanmar Text-To-SpeechIntroduction to myanmar Text-To-Speech
Introduction to myanmar Text-To-Speech
 
AUTOMATIC SPEECH RECOGNITION- A SURVEY
AUTOMATIC SPEECH RECOGNITION- A SURVEYAUTOMATIC SPEECH RECOGNITION- A SURVEY
AUTOMATIC SPEECH RECOGNITION- A SURVEY
 
Voice recognition
Voice recognitionVoice recognition
Voice recognition
 
project indesh
project indeshproject indesh
project indesh
 
An Introduction To Speech Recognition
An Introduction To Speech RecognitionAn Introduction To Speech Recognition
An Introduction To Speech Recognition
 
TEXT-SPEECH PPT.pptx
TEXT-SPEECH PPT.pptxTEXT-SPEECH PPT.pptx
TEXT-SPEECH PPT.pptx
 
Voice/Speech recognition in mobile devices
Voice/Speech recognition in mobile devicesVoice/Speech recognition in mobile devices
Voice/Speech recognition in mobile devices
 
Speech recognition final presentation
Speech recognition final presentationSpeech recognition final presentation
Speech recognition final presentation
 

Viewers also liked

משימה 1 מדעים כיתה ח
משימה 1 מדעים כיתה חמשימה 1 מדעים כיתה ח
משימה 1 מדעים כיתה חmuhmadbdran
 
Manual para-realizar-estudios-de-prefactibilidad-y-factibilidad
Manual para-realizar-estudios-de-prefactibilidad-y-factibilidadManual para-realizar-estudios-de-prefactibilidad-y-factibilidad
Manual para-realizar-estudios-de-prefactibilidad-y-factibilidadAlicia Quispe
 
сейсмология
сейсмологиясейсмология
сейсмологияUlanenko
 
Кама сутра Media Relations или 50 оттенков Медиа
Кама сутра Media Relations или 50 оттенков МедиаКама сутра Media Relations или 50 оттенков Медиа
Кама сутра Media Relations или 50 оттенков МедиаФедор Крикун
 
Media Evaluation Question 3
Media Evaluation Question 3Media Evaluation Question 3
Media Evaluation Question 3laurentolhurst
 
Il congiuntivo imperfetto e trapassato
Il congiuntivo imperfetto e trapassatoIl congiuntivo imperfetto e trapassato
Il congiuntivo imperfetto e trapassatoDanilo Buccarello
 
Alignment of the New Orleans Citywide Master Plan and the BioDistrict
Alignment of the New Orleans Citywide Master Plan and the BioDistrict Alignment of the New Orleans Citywide Master Plan and the BioDistrict
Alignment of the New Orleans Citywide Master Plan and the BioDistrict Sugati - Natural Wellness Brands
 
Freedomof informationact(jones)(2012)
Freedomof informationact(jones)(2012)Freedomof informationact(jones)(2012)
Freedomof informationact(jones)(2012)aoconno2
 
اكمل المتواليات الاتية
اكمل المتواليات الاتيةاكمل المتواليات الاتية
اكمل المتواليات الاتيةmuhmadbdran
 
Intel SUSE Texperts Webinar
Intel SUSE Texperts WebinarIntel SUSE Texperts Webinar
Intel SUSE Texperts WebinarDirk Oppenkowski
 
Hari konvokesyen222
Hari konvokesyen222Hari konvokesyen222
Hari konvokesyen222syahreena
 
Cсправочник по продукции LR
Cсправочник по продукции LRCсправочник по продукции LR
Cсправочник по продукции LRt575ae
 
Diapositivas
DiapositivasDiapositivas
Diapositivassara1309
 

Viewers also liked (20)

משימה 1 מדעים כיתה ח
משימה 1 מדעים כיתה חמשימה 1 מדעים כיתה ח
משימה 1 מדעים כיתה ח
 
Manual para-realizar-estudios-de-prefactibilidad-y-factibilidad
Manual para-realizar-estudios-de-prefactibilidad-y-factibilidadManual para-realizar-estudios-de-prefactibilidad-y-factibilidad
Manual para-realizar-estudios-de-prefactibilidad-y-factibilidad
 
сейсмология
сейсмологиясейсмология
сейсмология
 
Кама сутра Media Relations или 50 оттенков Медиа
Кама сутра Media Relations или 50 оттенков МедиаКама сутра Media Relations или 50 оттенков Медиа
Кама сутра Media Relations или 50 оттенков Медиа
 
Media Evaluation Question 3
Media Evaluation Question 3Media Evaluation Question 3
Media Evaluation Question 3
 
Il congiuntivo imperfetto e trapassato
Il congiuntivo imperfetto e trapassatoIl congiuntivo imperfetto e trapassato
Il congiuntivo imperfetto e trapassato
 
Conexindia
ConexindiaConexindia
Conexindia
 
Ta1 7º ano p1
Ta1 7º ano p1Ta1 7º ano p1
Ta1 7º ano p1
 
Alignment of the New Orleans Citywide Master Plan and the BioDistrict
Alignment of the New Orleans Citywide Master Plan and the BioDistrict Alignment of the New Orleans Citywide Master Plan and the BioDistrict
Alignment of the New Orleans Citywide Master Plan and the BioDistrict
 
Freedomof informationact(jones)(2012)
Freedomof informationact(jones)(2012)Freedomof informationact(jones)(2012)
Freedomof informationact(jones)(2012)
 
Si spersonalizzante
Si spersonalizzanteSi spersonalizzante
Si spersonalizzante
 
اكمل المتواليات الاتية
اكمل المتواليات الاتيةاكمل المتواليات الاتية
اكمل المتواليات الاتية
 
Intel SUSE Texperts Webinar
Intel SUSE Texperts WebinarIntel SUSE Texperts Webinar
Intel SUSE Texperts Webinar
 
Hand Swage Cable Railing Basics
Hand Swage Cable Railing BasicsHand Swage Cable Railing Basics
Hand Swage Cable Railing Basics
 
Taurus y bovina
Taurus y bovinaTaurus y bovina
Taurus y bovina
 
2 3 task
2 3 task2 3 task
2 3 task
 
Hari konvokesyen222
Hari konvokesyen222Hari konvokesyen222
Hari konvokesyen222
 
Cсправочник по продукции LR
Cсправочник по продукции LRCсправочник по продукции LR
Cсправочник по продукции LR
 
Diapositivas
DiapositivasDiapositivas
Diapositivas
 
ICN Porträts
ICN PorträtsICN Porträts
ICN Porträts
 

Similar to Speech recognizers & generators

Artificial Intelligence- An Introduction
Artificial Intelligence- An IntroductionArtificial Intelligence- An Introduction
Artificial Intelligence- An Introductionacemindia
 
Artificial Intelligence - An Introduction
Artificial Intelligence - An Introduction Artificial Intelligence - An Introduction
Artificial Intelligence - An Introduction acemindia
 
Speech Recognition in Artificail Inteligence
Speech Recognition in Artificail InteligenceSpeech Recognition in Artificail Inteligence
Speech Recognition in Artificail InteligenceIlhaan Marwat
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition TechnologySrijanKumar18
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversionankit_saluja
 
Paper on Speech Recognition
Paper on Speech RecognitionPaper on Speech Recognition
Paper on Speech RecognitionThejus Joby
 
General Speereo Technology
General Speereo TechnologyGeneral Speereo Technology
General Speereo TechnologyDaniel Ischenko
 
Artificial intelligence - research areas
Artificial intelligence - research areasArtificial intelligence - research areas
Artificial intelligence - research areasLearnbay Datascience
 
Speech Recognition
Speech Recognition Speech Recognition
Speech Recognition Goa App
 
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...nehachhh
 
VIRTUAL PERSONAL ASSISTANT.pdf
VIRTUAL PERSONAL ASSISTANT.pdfVIRTUAL PERSONAL ASSISTANT.pdf
VIRTUAL PERSONAL ASSISTANT.pdfAnkushSolanki6
 
Speech recognition - how does it work?
Speech recognition - how does it work?Speech recognition - how does it work?
Speech recognition - how does it work?CarterRodriguez6
 
IRJET- Voice Command Execution with Speech Recognition and Synthesizer
IRJET- Voice Command Execution with Speech Recognition and SynthesizerIRJET- Voice Command Execution with Speech Recognition and Synthesizer
IRJET- Voice Command Execution with Speech Recognition and SynthesizerIRJET Journal
 

Similar to Speech recognizers & generators (20)

Artificial Intelligence- An Introduction
Artificial Intelligence- An IntroductionArtificial Intelligence- An Introduction
Artificial Intelligence- An Introduction
 
Artificial Intelligence - An Introduction
Artificial Intelligence - An Introduction Artificial Intelligence - An Introduction
Artificial Intelligence - An Introduction
 
Speech Recognition in Artificail Inteligence
Speech Recognition in Artificail InteligenceSpeech Recognition in Artificail Inteligence
Speech Recognition in Artificail Inteligence
 
Presentation.ai
Presentation.aiPresentation.ai
Presentation.ai
 
Assign
AssignAssign
Assign
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversion
 
Seminar
SeminarSeminar
Seminar
 
Paper on Speech Recognition
Paper on Speech RecognitionPaper on Speech Recognition
Paper on Speech Recognition
 
General Speereo Technology
General Speereo TechnologyGeneral Speereo Technology
General Speereo Technology
 
Artificial intelligence - research areas
Artificial intelligence - research areasArtificial intelligence - research areas
Artificial intelligence - research areas
 
Speech Recognition
Speech Recognition Speech Recognition
Speech Recognition
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
 
VIRTUAL PERSONAL ASSISTANT.pdf
VIRTUAL PERSONAL ASSISTANT.pdfVIRTUAL PERSONAL ASSISTANT.pdf
VIRTUAL PERSONAL ASSISTANT.pdf
 
Speech Recognition System
Speech Recognition SystemSpeech Recognition System
Speech Recognition System
 
Speech recognition - how does it work?
Speech recognition - how does it work?Speech recognition - how does it work?
Speech recognition - how does it work?
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
Desktop assistant
Desktop assistant Desktop assistant
Desktop assistant
 
IRJET- Voice Command Execution with Speech Recognition and Synthesizer
IRJET- Voice Command Execution with Speech Recognition and SynthesizerIRJET- Voice Command Execution with Speech Recognition and Synthesizer
IRJET- Voice Command Execution with Speech Recognition and Synthesizer
 

Recently uploaded

Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVshikhaohhpro
 
Asset Management Software - Infographic
Asset Management Software - InfographicAsset Management Software - Infographic
Asset Management Software - InfographicHr365.us smith
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackVICTOR MAESTRE RAMIREZ
 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...Christina Lin
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxTier1 app
 
XpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software SolutionsXpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software SolutionsMehedi Hasan Shohan
 
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationkaushalgiri8080
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdfWave PLM
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...soniya singh
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...stazi3110
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataBradBedford3
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...MyIntelliSource, Inc.
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software DevelopersVinodh Ram
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxbodapatigopi8531
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy
 
cybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningcybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningVitsRangannavar
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...gurkirankumar98700
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEOrtus Solutions, Corp
 

Recently uploaded (20)

Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
Asset Management Software - Infographic
Asset Management Software - InfographicAsset Management Software - Infographic
Asset Management Software - Infographic
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStack
 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
 
XpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software SolutionsXpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software Solutions
 
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service Consultant
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanation
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software Developers
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptx
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
cybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningcybersecurity notes for mca students for learning
cybersecurity notes for mca students for learning
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
 

Speech recognizers & generators

  • 1. Speech Recognizers & Generators Let’s Get Started… Presented by: P. Kahoro Presented to: Prof P. Okanda
  • 2. Speech Recognizers: What are they? A Speech is the vocalized form of human communication. Incomputer scienceandelectrical engineering,speech recognition(SR) is the translation of spoken words into text. It is also known as "automatic speech recognition" (ASR). Speech Recognition (SR) is the ability to translate a dictation or spoken word to text. -Speech recognition has evolved quite a bit over the past few years. Initially, it used to work in discrete dictation mode, where you had to pause between each spoken word. Today, however, it uses continuous dictation. It’s also become smarter, with its own set of grammar rules to make out the meaning of what’s being said.
  • 3. Terms and Concepts •Utterances •Pronounciation •Grammer •Speaker Dependent System •Speaker Independent System •Training •Accuracy
  • 4. Terms &Concepts Utterances: An utterance is any stream of speech between two periods of silence. Silence delineates the start and end of an utterance. An utterance can be a single word, or it can contain multiple words (a phrase or a sentence) Pronunciations: One piece of information that the speech recognition engine uses to process a word is its pronunciation, which represents what the speech engine thinks a word should sound like. Words can have multiple pronunciations associated with them. For example, the word “the” has at least two pronunciations in the U.S. English language: “thee” and “thuh”.
  • 5. Cont… Grammar: Grammars define the domain, or context, within which the recognition engine works. The engine compares the current utterance against the words and phrases in the active grammars. If the user says something that is not in the grammar, the speech engine will not be able to understand it correctly. So usually speech engines have a very vast grammar. Accuracy: The ability of a recognizer can be examined by measuring its accuracy − or how well it recognizes utterances. Training: Somespeechrecognizershavetheabilitytoadapttoaspeaker.Whenthesystemhasthisability,itmayallowtrainingtotakeplace.
  • 6. Cont… Speaker Dependent Systems: Speech recognition systems that require a user to train the system to his/her voice are known as speaker-dependent systems. If you are familiar with desktop dictation systems, most are speaker dependent like IBM Via Voice. Speaker Independent Systems: Speech recognition systems that do not require a user to train the system are known as speaker-independent systems.
  • 7. How do humans do it? Articulation produces sound waves which the ear conveys to the brain for processing
  • 8. How might computers do it? Digitization Acoustic analysis of the speech signal Linguistic interpretation Acoustic waveform Acoustic signal Speech recognition
  • 9. How Speech Recognition Work? •Audio input •Apply a "grammar" so the speech recognizer knows what phonemes to expect. •Acoustic Model •Recognized text
  • 10. How do computers do it? •First, the user gives a voice command over the microphone, which is passed to the sound card in your system. This analog signal is sampled converted into digital form using a technique called Pulse Code Modulation or PCM. This digital waveform is a stream of amplitudes that look like a wavy line. •The audio signal is further sampled and each sample is converted into a frequency domain. So, the incoming stream is now a set of discrete frequency bands, in a form that can be used by the speech recognizer. •The next stage involves recognizing these bands of frequencies. For this, the speech recognition software has a database containing thousands of frequencies or "phonemes", as they’re called.
  • 11. Hardware: Sound Cards Soundcard with the cleanest A/D (Analog to Digital) conversions are recommended. Microphone The best choice for microphone is the headset style. Computers / Processors The more the speed the better Speech Recognition would work. For good Speech Recognition you should be having 1 GHz processor and 1 GB of RAM.
  • 12. Where can it be used? •GPS: System control/navigation e.g. GPS-connected digital maps: “How far is it to the motorway junction?” •Commercial/Industrial applicationsin-car steering systems •Mobile telephony: Voice dialing hands-free use of mobile in car e.g. “Dial office” •Home automation -heating, ventilation and air conditioning
  • 13. Where can it be used? •Military: System control/navigation e.g. Military -High-performance fighter aircraft, Helicopters, Training Air Traffic Controllers •Computer and Video Games: Speech input has been used in a limited number of computer and video games. The Microsoft Xbox, Nintendo GameCube, and Sony PlayStation 2 consoles all offer games with speech input/output. •Usage in education -Students who are blind •Voice Security System: security locks of gates and doors •Wearable Computers: The most futuristic application is in the use and functionality of wearable computers.
  • 14. Speech Recognition Software •Dragon Naturally Speeking •IBM Via Voice •Microsoft Speech Recognition System •MacSpeechDictate •Philips Speech Magic
  • 15. Pros of Speech Recognition •Faster than “hand-writing”. •Allows for better spelling, whether it be in text or documents. •Helpful for people with a mental or physical disability . •Hands-free capability .
  • 16. Cons of Speech Recognition •No program is 100% perfect •Factors that affect the accuracy of speech recognition are: slang, homonyms, signal-to-noise ratio, and overlapping speech •Can be expensive depending on the program •Easily misinterprets vocal commands e.gSIRI
  • 17. Conclusion •Revolutionize the way people conduct business over the Web and ,differentiate world-class e-businesses. •VoiceXMLties speech recognition and telephony together •voice-enabled Web solutions TODAY!
  • 18. Generators: •Software generators are programs that build other programs. In computer science, a generator is a special routine that can be used to control the iteration behavior of a loop. In fact, all generators are iterators. •A generator is very similar to a function that returns an array, in that a generator has parameters, can be called, and generates a sequence of values. However, instead of building an array containing all the values and returning them all at once, a generator yields the values one at a time, which requires less memory and allows the caller to get started processing the first few values immediately. In short, a generator looks like a function but behaves like an iterator.
  • 19. Types of software generators: •key generator(key-gen) •RandomPassword Generators •Code generator •Natural language generator •Random test generator •Pseudorandom number generator
  • 20. key generator(key-gen) •Akeygenerator(key-gen)isacomputerprogramthatgeneratesaproductlicensingkey,suchasaserialnumber,necessarytoactivateforuseasoftwareapplication. •Key-gensmaybelegitimatelydistributedbysoftwaremanufacturersforlicensingsoftwareincommercialenvironmentswheresoftwarehasbeenlicensedinbulkforanentiresiteorenterprise,ortheymaybedistributedillegitimatelyincircumstancesofcopyrightinfringementorsoftwarepiracy. •Asoftwarelicenseisalegalinstrumentthatgovernstheusageanddistributionofcomputersoftware. •Illegitimatekeygeneratorsaretypicallydistributedbysoftwarecrackerse.gkey-gensusedtocrackfakeWindowsOSe.gWindows8arealreadyavailable
  • 21. Random password generator •Arandompasswordgeneratorissoftwareprogramorhardwaredevicethattakesinputfromarandomorpseudo- randomnumbergeneratorandautomaticallygeneratesapassword.Randompasswordscanbegeneratedmanually,usingsimplesourcesofrandomnesssuchasdiceorcoins,ortheycanbegeneratedusingacomputer. •Whiletherearemanyexamplesof"random"passwordgeneratorprogramsavailableontheInternet,generatingrandomnesscanbetrickyandmanyprogramsdonotgeneraterandomcharactersinawaythatensuresstrongsecurity.Acommonrecommendationistouseopensourcesecuritytoolswherepossible,sincetheyallowindependentchecksonthequalityofthemethodsused.Notethatsimplygeneratingapasswordatrandomdoesnotensurethepasswordisastrongpassword,becauseitispossible,althoughhighlyunlikely,togenerateaneasilyguessedorcrackedpassword.Infactthereisnoneedatallforapasswordtohavebeenproducedbyaperfectlyrandomprocess:itjustneedstobesufficientlydifficulttoguess.
  • 22. Pseudorandom number generators •Apseudorandom number generator(PRNG), also known as adeterministic random bit generator(DRBG),is analgorithmfor generating a sequence of numbers whose properties approximate the properties of sequences ofrandom numbers. •Although sequences that are closer to truly random can be generated using hardware random number generators, pseudorandom number generators are important in practice for their speed in number generation and their reproducibility.
  • 23. Code generator •In computing, code generation is the process by which a compiler's code generator converts some intermediate representation of source code into a form (e.g., machine code) that can be readily executed by a machine. •Sophisticated compilers typically perform multiple passes over various intermediate forms. This multi-stage process is used because many algorithms for code optimization are easier to apply one at a time, or because the input to one optimization relies on the completed processing performed by another optimization.
  • 24. Men have become the tools of their tools. -P. Kahoro The End