SlideShare a Scribd company logo
Speech Recognition
Kimberlee A. Kemble
Program Manager, Voice Systems Middleware
Education
IBM Corporation

Presenter:
Sajana.A
S2-ELT
Agenda
•
•
•
•
•
•
•

What is speech Recognition??
Closer look
Terms & concepts
Components
How it works??
Pros & cons
Applications
What is speech recognition?




Speech Recognition (SR) is the ability to translate a
dictation or spoken word to text.
Also known as “automatic speech recognition” (ASR),
“computer speech recognition”, or “speech to text”
(STT)
A Closer look!!!
• Speech recognition engine
1. Command and control application
The application can interpret the result of the
recognition as a command.
2.

Dictation application
Application handles the recognized text simply as
text.
Terms &Concepts
• Utterances
1.

An utterance is any stream of speech between
two periods of silence.
2. Silence delineates the start and end of an
utterance.
3. An utterance can be a single word, or it can
contain multiple words (a phrase or a sentence)
Continued..
• Pronunciations
Represents what the speech engine thinks a word
should sound like.
• Grammars


uses a particular syntax, or set of rules, to define the
words and phrases that can be recognized by the engine.
define the domain, or context, within which the
recognition engine works
Continued..
• Speaker-dependent systems
–
–
–
–

Require “training” to “teach” the individual System
More robust
But less convenient
And obviously less portable

• Speaker-independent systems
– Language coverage is reduced to compensate need to be
flexible in phoneme identification
– Clever compromise is to learn on the fly
Components
•
•
•
•
•

Audio input
Grammar
Speech Recognition Engine
Acoustic Model
Recognized text

TheMicrophoneStore.com
KnowBrainer.com
How it works??
Grammar

Audio
input

Speech
recognition
Engine

Acoustic
model

Recognized
Text
Process
Here’s another look at how SRS works...

Source:Automatic Speech Recognition: A Review
Preeti Saini#1, Parneet Kaur*2
Acceptance and Rejection
• An accepted utterance is one in which the
engine returns recognized text.
• confidence score along with the text to
indicate the likelihood that the returned text is
correct.
• Not all utterances that are processed by
the speech engine are accepted
What’s hard about that?
• Digitization
– Converting analogue signal into digital representation.

• Signal processing
– Separating speech from background noise.

• Phonetics
– Variability in human speech.

• Phonology
– Recognizing individual sound distinctions (similar phonemes.)

• Lexicology and syntax
– Disambiguating homophones.
– Features of continuous speech.

• Syntax and pragmatics
– Interpreting features.
– Filtering of performance errors (disfluencies).
The Uses
• Individuals With Disabilities – Assists those who have visual
impairment, hand immobility, dyslexia, etc.
• Medical Transcription – Reduces delays to write out
medical transcriptions

• Dictation - Converts words to text in emails or other word
documents (also helpful for English Language Learners).
• Access Menu Commands – Opens files using voice commands.
Applications of Speech Recognition
•

Speech recognition applications include





Voice dialling (e.g., "Call home"),
Call routing (e.g., "I would like to make a collect call"),
Simple data entry (e.g., entering a credit card number),
Preparation of structured documents (e.g., A radiology
report),
 Speech-to-text processing (e.g., word processors or
emails), and
 In aircraft cockpits (usually termed Direct Voice Input).
Applications
• Medical Transcription
• Military
• Telephony and other domains
• Serving the disabled
Further Applications
• Home automation
• Automobile audio systems
• Telematics
TheMicrophoneStore.com
KnowBrainer.com
Pros of Speech Recognition
•
•

•

•

Faster than “hand-writing”.
Allows for better spelling, whether it be in text
or documents.
Helpful for people with a mental or physical
disability .
Hands-free capability .
Cons of Speech Recognition
•

•

•

No program is 100% perfect
Factors that affect the accuracy of speech
recognition are: slang, homonyms, signal-tonoise ratio, and overlapping speech
Can be expensive depending on the program
Programs

Now let’s take a look at a
some of the many SRS
programs...
Dragon
Siri
Indigo
KnowBrainer.com
Using Dragon Mobile

ftp://public.dhe.ibm.com/software/pervasive/info/products/Introduction_
to_Speech_Recognition.pdf
Different Home Appliances Control
Scenarios

http://en.wikipedia.org/wiki/VoiceXML
The Future of Assistive Technology
in Schools
•Students who need assistance in their writing skills because they have
stronger oral skills.
•Students who need were absent for a class, have poor memory, or
need assistance hearing the lesson.
•Students who need assistance during Guided Reading.
•Students who are English Language Learners.

•Students with visual/hearing impairments and learning disabilities
regarding reading/spelling/writing.
Conclusion
• Revolutionize the way people conduct
business over the Web and ,differentiate
world-class e-businesses.
• VoiceXML ties speech recognition and
telephony together
• voice-enabled Web solutions TODAY!
References
• Kai-Fu Lee, Hsiao-Wuen Hon, and Raj Reddy, An Overview
of the SPHINX Speech Recognition System. IEEE
Transactions on Acoustics, Speech and Signal Processing,
• Pellom, B., Sonic: The University of Colorado Continuous
Speech Recognition System.
• http://www.tldp.org/HOWTO/Speech-RecognitionHOWTO/index.html
• http://www.zachary.com/s/xvoice
• http://xvoice.sourceforge.net/Willie Walker, Paul Lamere,
Philip Kwok, Bhiksha Raj, Rita Singh, Evandro Gouvea,
• Peter Wolf, Joe Woelfel, Sphinx-4: A Flexible Open Source
Framework for SpeechRecognition.
• A. Hagen, D. A. Connors, B. L. Pellom, The Analysis and
Design of Architecture Systems
thank you!

More Related Content

What's hot

Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
fathitarek
 
Speech Recognition in Artificail Inteligence
Speech Recognition in Artificail InteligenceSpeech Recognition in Artificail Inteligence
Speech Recognition in Artificail Inteligence
Ilhaan Marwat
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
Aamir-sheriff
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech RecognitionHugo Moreno
 
Speech recognition final presentation
Speech recognition final presentationSpeech recognition final presentation
Speech recognition final presentation
himanshubhatti
 
Speech Recognition System By Matlab
Speech Recognition System By MatlabSpeech Recognition System By Matlab
Speech Recognition System By Matlab
Ankit Gujrati
 
Automatic speech recognition system
Automatic speech recognition systemAutomatic speech recognition system
Automatic speech recognition system
Alok Tiwari
 
A seminar report on speech recognition technology
A seminar report on speech recognition technologyA seminar report on speech recognition technology
A seminar report on speech recognition technology
SrijanKumar18
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversionankit_saluja
 
Speech recognition
Speech recognitionSpeech recognition
Speech recognition
Charu Joshi
 
Speech recognition techniques
Speech recognition techniquesSpeech recognition techniques
Speech recognition techniques
sonukumar142
 
Visual speech to text conversion applicable to telephone communication
Visual speech to text conversion  applicable  to telephone communicationVisual speech to text conversion  applicable  to telephone communication
Visual speech to text conversion applicable to telephone communication
Swathi Venugopal
 
speech processing and recognition basic in data mining
speech processing and recognition basic in  data miningspeech processing and recognition basic in  data mining
speech processing and recognition basic in data mining
Jimit Rupani
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
Seminar Links
 
Voice Recognition
Voice RecognitionVoice Recognition
Voice RecognitionAmrita More
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognition
Richie
 
Speech Recognition System
Speech Recognition SystemSpeech Recognition System
Abstract of speech recognition
Abstract of speech recognitionAbstract of speech recognition
Abstract of speech recognitionVinay Jaisriram
 
Speech to text conversion for visually impaired person using µ law companding
Speech to text conversion for visually impaired person using µ law compandingSpeech to text conversion for visually impaired person using µ law companding
Speech to text conversion for visually impaired person using µ law companding
iosrjce
 
Text to speech converter in C#.NET
Text to speech converter in C#.NETText to speech converter in C#.NET
Text to speech converter in C#.NET
Mandeep Cheema
 

What's hot (20)

Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
Speech Recognition in Artificail Inteligence
Speech Recognition in Artificail InteligenceSpeech Recognition in Artificail Inteligence
Speech Recognition in Artificail Inteligence
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
Speech recognition final presentation
Speech recognition final presentationSpeech recognition final presentation
Speech recognition final presentation
 
Speech Recognition System By Matlab
Speech Recognition System By MatlabSpeech Recognition System By Matlab
Speech Recognition System By Matlab
 
Automatic speech recognition system
Automatic speech recognition systemAutomatic speech recognition system
Automatic speech recognition system
 
A seminar report on speech recognition technology
A seminar report on speech recognition technologyA seminar report on speech recognition technology
A seminar report on speech recognition technology
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversion
 
Speech recognition
Speech recognitionSpeech recognition
Speech recognition
 
Speech recognition techniques
Speech recognition techniquesSpeech recognition techniques
Speech recognition techniques
 
Visual speech to text conversion applicable to telephone communication
Visual speech to text conversion  applicable  to telephone communicationVisual speech to text conversion  applicable  to telephone communication
Visual speech to text conversion applicable to telephone communication
 
speech processing and recognition basic in data mining
speech processing and recognition basic in  data miningspeech processing and recognition basic in  data mining
speech processing and recognition basic in data mining
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
Voice Recognition
Voice RecognitionVoice Recognition
Voice Recognition
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognition
 
Speech Recognition System
Speech Recognition SystemSpeech Recognition System
Speech Recognition System
 
Abstract of speech recognition
Abstract of speech recognitionAbstract of speech recognition
Abstract of speech recognition
 
Speech to text conversion for visually impaired person using µ law companding
Speech to text conversion for visually impaired person using µ law compandingSpeech to text conversion for visually impaired person using µ law companding
Speech to text conversion for visually impaired person using µ law companding
 
Text to speech converter in C#.NET
Text to speech converter in C#.NETText to speech converter in C#.NET
Text to speech converter in C#.NET
 

Viewers also liked

Artificial intelligence Speech recognition system
Artificial intelligence Speech recognition systemArtificial intelligence Speech recognition system
Artificial intelligence Speech recognition system
REHMAT ULLAH
 
Speech recognition project report
Speech recognition project reportSpeech recognition project report
Speech recognition project reportSarang Afle
 
Speech Recognition by Iqbal
Speech Recognition by IqbalSpeech Recognition by Iqbal
Speech Recognition by Iqbal
Iqbal
 
Dev Days, Speech Recognition, LM Aubert
Dev Days, Speech Recognition, LM AubertDev Days, Speech Recognition, LM Aubert
Dev Days, Speech Recognition, LM Aubert
aubertlm
 
The Main Concepts of Speech Recognition
The Main Concepts of Speech RecognitionThe Main Concepts of Speech Recognition
The Main Concepts of Speech Recognition子毅 楊
 
Voice recognition system
Voice recognition systemVoice recognition system
Voice recognition system
avinash raibole
 
Input computer hardware notes for UCEand UACE ICT
Input computer hardware notes for UCEand UACE ICTInput computer hardware notes for UCEand UACE ICT
Input computer hardware notes for UCEand UACE ICT
Mukalele Rogers
 
Speech recognition challenges
Speech recognition challengesSpeech recognition challenges
Speech recognition challengesAlexandru Chica
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
Hinsdale South High School
 
fundamentals of speech recognition
fundamentals of speech recognitionfundamentals of speech recognition
fundamentals of speech recognition
Venkat RAGHAVENDRA REDDY
 
Multi Object Tracking | Presentation 2 | ID 103001
Multi Object Tracking | Presentation 2 | ID 103001Multi Object Tracking | Presentation 2 | ID 103001
Multi Object Tracking | Presentation 2 | ID 103001
Md. Minhazul Haque
 
Overview Of Video Object Tracking System
Overview Of Video Object Tracking SystemOverview Of Video Object Tracking System
Overview Of Video Object Tracking System
Editor IJMTER
 

Viewers also liked (14)

Artificial intelligence Speech recognition system
Artificial intelligence Speech recognition systemArtificial intelligence Speech recognition system
Artificial intelligence Speech recognition system
 
Speech recognition project report
Speech recognition project reportSpeech recognition project report
Speech recognition project report
 
Voice Recognition
Voice RecognitionVoice Recognition
Voice Recognition
 
An Introduction To Speech Recognition
An Introduction To Speech RecognitionAn Introduction To Speech Recognition
An Introduction To Speech Recognition
 
Speech Recognition by Iqbal
Speech Recognition by IqbalSpeech Recognition by Iqbal
Speech Recognition by Iqbal
 
Dev Days, Speech Recognition, LM Aubert
Dev Days, Speech Recognition, LM AubertDev Days, Speech Recognition, LM Aubert
Dev Days, Speech Recognition, LM Aubert
 
The Main Concepts of Speech Recognition
The Main Concepts of Speech RecognitionThe Main Concepts of Speech Recognition
The Main Concepts of Speech Recognition
 
Voice recognition system
Voice recognition systemVoice recognition system
Voice recognition system
 
Input computer hardware notes for UCEand UACE ICT
Input computer hardware notes for UCEand UACE ICTInput computer hardware notes for UCEand UACE ICT
Input computer hardware notes for UCEand UACE ICT
 
Speech recognition challenges
Speech recognition challengesSpeech recognition challenges
Speech recognition challenges
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
fundamentals of speech recognition
fundamentals of speech recognitionfundamentals of speech recognition
fundamentals of speech recognition
 
Multi Object Tracking | Presentation 2 | ID 103001
Multi Object Tracking | Presentation 2 | ID 103001Multi Object Tracking | Presentation 2 | ID 103001
Multi Object Tracking | Presentation 2 | ID 103001
 
Overview Of Video Object Tracking System
Overview Of Video Object Tracking SystemOverview Of Video Object Tracking System
Overview Of Video Object Tracking System
 

Similar to Speech recognition An overview

Proposal presentation.pptx
Proposal presentation.pptxProposal presentation.pptx
Proposal presentation.pptx
Nhlakanipho Majola
 
Speech Recognition, Text to Speech, and Voice Interfaces
Speech Recognition, Text to Speech, and Voice InterfacesSpeech Recognition, Text to Speech, and Voice Interfaces
Speech Recognition, Text to Speech, and Voice Interfaces
Christiana Vasquez
 
Noise Adaptive Training for Robust Automatic Speech Recognition
Noise Adaptive Training for Robust Automatic Speech RecognitionNoise Adaptive Training for Robust Automatic Speech Recognition
Noise Adaptive Training for Robust Automatic Speech Recognition
أحلام انصارى
 
Speechrecognition 100423091251-phpapp01
Speechrecognition 100423091251-phpapp01Speechrecognition 100423091251-phpapp01
Speechrecognition 100423091251-phpapp01
girishjoshi1234
 
NLP,expert,robotics.pptx
NLP,expert,robotics.pptxNLP,expert,robotics.pptx
NLP,expert,robotics.pptx
AmanBadesra1
 
Natural Language Processing (NLP).pptx
Natural Language Processing (NLP).pptxNatural Language Processing (NLP).pptx
Natural Language Processing (NLP).pptx
SHIBDASDUTTA
 
Natural language processing (nlp)
Natural language processing (nlp)Natural language processing (nlp)
Natural language processing (nlp)
Kuppusamy P
 
Introduction to myanmar Text-To-Speech
Introduction to myanmar Text-To-SpeechIntroduction to myanmar Text-To-Speech
Introduction to myanmar Text-To-Speech
Ngwe Tun
 
Speech-Recognition.pptx
Speech-Recognition.pptxSpeech-Recognition.pptx
Speech-Recognition.pptx
JyothiMedisetty2
 
Natural Language Processing for development
Natural Language Processing for developmentNatural Language Processing for development
Natural Language Processing for development
Aravind Reddy
 
Natural Language Processing for development
Natural Language Processing for developmentNatural Language Processing for development
Natural Language Processing for development
Aravind Reddy
 
Sequence to sequence model speech recognition
Sequence to sequence model speech recognitionSequence to sequence model speech recognition
Sequence to sequence model speech recognition
Aditya Kumar Khare
 
TRANSLATION TECHNIQUES.ppt
TRANSLATION TECHNIQUES.pptTRANSLATION TECHNIQUES.ppt
TRANSLATION TECHNIQUES.ppt
Gailan1
 
Assistive Technology - Module 4 gwen kenny
Assistive Technology - Module 4 gwen kennyAssistive Technology - Module 4 gwen kenny
Assistive Technology - Module 4 gwen kenny
gkenny1
 
Assistive Technology in the Classroom
Assistive Technology in the ClassroomAssistive Technology in the Classroom
Assistive Technology in the Classroom
Reading Horizons
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversionankit_saluja
 
Research data as an aid in teaching technical competence in subtitling
Research data as an aid in teaching technical competence in subtitlingResearch data as an aid in teaching technical competence in subtitling
Research data as an aid in teaching technical competence in subtitling
University of Warsaw
 

Similar to Speech recognition An overview (20)

Proposal presentation.pptx
Proposal presentation.pptxProposal presentation.pptx
Proposal presentation.pptx
 
Seminar
SeminarSeminar
Seminar
 
Speech Recognition, Text to Speech, and Voice Interfaces
Speech Recognition, Text to Speech, and Voice InterfacesSpeech Recognition, Text to Speech, and Voice Interfaces
Speech Recognition, Text to Speech, and Voice Interfaces
 
Noise Adaptive Training for Robust Automatic Speech Recognition
Noise Adaptive Training for Robust Automatic Speech RecognitionNoise Adaptive Training for Robust Automatic Speech Recognition
Noise Adaptive Training for Robust Automatic Speech Recognition
 
Speechrecognition 100423091251-phpapp01
Speechrecognition 100423091251-phpapp01Speechrecognition 100423091251-phpapp01
Speechrecognition 100423091251-phpapp01
 
NLP,expert,robotics.pptx
NLP,expert,robotics.pptxNLP,expert,robotics.pptx
NLP,expert,robotics.pptx
 
Natural Language Processing (NLP).pptx
Natural Language Processing (NLP).pptxNatural Language Processing (NLP).pptx
Natural Language Processing (NLP).pptx
 
Amadou
AmadouAmadou
Amadou
 
Natural language processing (nlp)
Natural language processing (nlp)Natural language processing (nlp)
Natural language processing (nlp)
 
Introduction to myanmar Text-To-Speech
Introduction to myanmar Text-To-SpeechIntroduction to myanmar Text-To-Speech
Introduction to myanmar Text-To-Speech
 
Speech-Recognition.pptx
Speech-Recognition.pptxSpeech-Recognition.pptx
Speech-Recognition.pptx
 
Natural Language Processing for development
Natural Language Processing for developmentNatural Language Processing for development
Natural Language Processing for development
 
Natural Language Processing for development
Natural Language Processing for developmentNatural Language Processing for development
Natural Language Processing for development
 
Sequence to sequence model speech recognition
Sequence to sequence model speech recognitionSequence to sequence model speech recognition
Sequence to sequence model speech recognition
 
TRANSLATION TECHNIQUES.ppt
TRANSLATION TECHNIQUES.pptTRANSLATION TECHNIQUES.ppt
TRANSLATION TECHNIQUES.ppt
 
Listen to learn
Listen to learnListen to learn
Listen to learn
 
Assistive Technology - Module 4 gwen kenny
Assistive Technology - Module 4 gwen kennyAssistive Technology - Module 4 gwen kenny
Assistive Technology - Module 4 gwen kenny
 
Assistive Technology in the Classroom
Assistive Technology in the ClassroomAssistive Technology in the Classroom
Assistive Technology in the Classroom
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversion
 
Research data as an aid in teaching technical competence in subtitling
Research data as an aid in teaching technical competence in subtitlingResearch data as an aid in teaching technical competence in subtitling
Research data as an aid in teaching technical competence in subtitling
 

Recently uploaded

Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
MysoreMuleSoftMeetup
 
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptxStudents, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
EduSkills OECD
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
Thiyagu K
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
BhavyaRajput3
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
Atul Kumar Singh
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
Delapenabediema
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
Jisc
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
beazzy04
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
Tamralipta Mahavidyalaya
 
Cambridge International AS A Level Biology Coursebook - EBook (MaryFosbery J...
Cambridge International AS  A Level Biology Coursebook - EBook (MaryFosbery J...Cambridge International AS  A Level Biology Coursebook - EBook (MaryFosbery J...
Cambridge International AS A Level Biology Coursebook - EBook (MaryFosbery J...
AzmatAli747758
 
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdfESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
Fundacja Rozwoju Społeczeństwa Przedsiębiorczego
 
Instructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptxInstructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptx
Jheel Barad
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
JosvitaDsouza2
 
How to Break the cycle of negative Thoughts
How to Break the cycle of negative ThoughtsHow to Break the cycle of negative Thoughts
How to Break the cycle of negative Thoughts
Col Mukteshwar Prasad
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
GeoBlogs
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
EugeneSaldivar
 
Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
Anna Sz.
 
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
Nguyen Thanh Tu Collection
 
The Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdfThe Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdf
kaushalkr1407
 

Recently uploaded (20)

Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
 
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptxStudents, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
 
Cambridge International AS A Level Biology Coursebook - EBook (MaryFosbery J...
Cambridge International AS  A Level Biology Coursebook - EBook (MaryFosbery J...Cambridge International AS  A Level Biology Coursebook - EBook (MaryFosbery J...
Cambridge International AS A Level Biology Coursebook - EBook (MaryFosbery J...
 
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdfESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
 
Instructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptxInstructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptx
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
 
How to Break the cycle of negative Thoughts
How to Break the cycle of negative ThoughtsHow to Break the cycle of negative Thoughts
How to Break the cycle of negative Thoughts
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
 
Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
 
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
 
The Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdfThe Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdf
 

Speech recognition An overview

  • 1. Speech Recognition Kimberlee A. Kemble Program Manager, Voice Systems Middleware Education IBM Corporation Presenter: Sajana.A S2-ELT
  • 2. Agenda • • • • • • • What is speech Recognition?? Closer look Terms & concepts Components How it works?? Pros & cons Applications
  • 3. What is speech recognition?   Speech Recognition (SR) is the ability to translate a dictation or spoken word to text. Also known as “automatic speech recognition” (ASR), “computer speech recognition”, or “speech to text” (STT)
  • 4. A Closer look!!! • Speech recognition engine 1. Command and control application The application can interpret the result of the recognition as a command. 2. Dictation application Application handles the recognized text simply as text.
  • 5. Terms &Concepts • Utterances 1. An utterance is any stream of speech between two periods of silence. 2. Silence delineates the start and end of an utterance. 3. An utterance can be a single word, or it can contain multiple words (a phrase or a sentence)
  • 6. Continued.. • Pronunciations Represents what the speech engine thinks a word should sound like. • Grammars  uses a particular syntax, or set of rules, to define the words and phrases that can be recognized by the engine. define the domain, or context, within which the recognition engine works
  • 7. Continued.. • Speaker-dependent systems – – – – Require “training” to “teach” the individual System More robust But less convenient And obviously less portable • Speaker-independent systems – Language coverage is reduced to compensate need to be flexible in phoneme identification – Clever compromise is to learn on the fly
  • 8. Components • • • • • Audio input Grammar Speech Recognition Engine Acoustic Model Recognized text TheMicrophoneStore.com KnowBrainer.com
  • 10. Process Here’s another look at how SRS works... Source:Automatic Speech Recognition: A Review Preeti Saini#1, Parneet Kaur*2
  • 11. Acceptance and Rejection • An accepted utterance is one in which the engine returns recognized text. • confidence score along with the text to indicate the likelihood that the returned text is correct. • Not all utterances that are processed by the speech engine are accepted
  • 12. What’s hard about that? • Digitization – Converting analogue signal into digital representation. • Signal processing – Separating speech from background noise. • Phonetics – Variability in human speech. • Phonology – Recognizing individual sound distinctions (similar phonemes.) • Lexicology and syntax – Disambiguating homophones. – Features of continuous speech. • Syntax and pragmatics – Interpreting features. – Filtering of performance errors (disfluencies).
  • 13. The Uses • Individuals With Disabilities – Assists those who have visual impairment, hand immobility, dyslexia, etc. • Medical Transcription – Reduces delays to write out medical transcriptions • Dictation - Converts words to text in emails or other word documents (also helpful for English Language Learners). • Access Menu Commands – Opens files using voice commands.
  • 14. Applications of Speech Recognition • Speech recognition applications include     Voice dialling (e.g., "Call home"), Call routing (e.g., "I would like to make a collect call"), Simple data entry (e.g., entering a credit card number), Preparation of structured documents (e.g., A radiology report),  Speech-to-text processing (e.g., word processors or emails), and  In aircraft cockpits (usually termed Direct Voice Input).
  • 15. Applications • Medical Transcription • Military • Telephony and other domains • Serving the disabled Further Applications • Home automation • Automobile audio systems • Telematics TheMicrophoneStore.com KnowBrainer.com
  • 16. Pros of Speech Recognition • • • • Faster than “hand-writing”. Allows for better spelling, whether it be in text or documents. Helpful for people with a mental or physical disability . Hands-free capability .
  • 17. Cons of Speech Recognition • • • No program is 100% perfect Factors that affect the accuracy of speech recognition are: slang, homonyms, signal-tonoise ratio, and overlapping speech Can be expensive depending on the program
  • 18. Programs Now let’s take a look at a some of the many SRS programs... Dragon Siri Indigo KnowBrainer.com
  • 20. Different Home Appliances Control Scenarios http://en.wikipedia.org/wiki/VoiceXML
  • 21. The Future of Assistive Technology in Schools •Students who need assistance in their writing skills because they have stronger oral skills. •Students who need were absent for a class, have poor memory, or need assistance hearing the lesson. •Students who need assistance during Guided Reading. •Students who are English Language Learners. •Students with visual/hearing impairments and learning disabilities regarding reading/spelling/writing.
  • 22. Conclusion • Revolutionize the way people conduct business over the Web and ,differentiate world-class e-businesses. • VoiceXML ties speech recognition and telephony together • voice-enabled Web solutions TODAY!
  • 23. References • Kai-Fu Lee, Hsiao-Wuen Hon, and Raj Reddy, An Overview of the SPHINX Speech Recognition System. IEEE Transactions on Acoustics, Speech and Signal Processing, • Pellom, B., Sonic: The University of Colorado Continuous Speech Recognition System. • http://www.tldp.org/HOWTO/Speech-RecognitionHOWTO/index.html • http://www.zachary.com/s/xvoice • http://xvoice.sourceforge.net/Willie Walker, Paul Lamere, Philip Kwok, Bhiksha Raj, Rita Singh, Evandro Gouvea, • Peter Wolf, Joe Woelfel, Sphinx-4: A Flexible Open Source Framework for SpeechRecognition. • A. Hagen, D. A. Connors, B. L. Pellom, The Analysis and Design of Architecture Systems