SlideShare a Scribd company logo
1 of 13
Speech Recognition
Prepended by :Huda Seyam
What is
• Speech recognition is the process of capturing
spoken words using a microphone or telephone and
converting them into a digitally stored set of words
• A subset of speech recognition is voice recognition,
which is the technology for identifying a person
based on their voice
History
• First known attempt at
speech recognition
• A toy from 1922
• Worked by analysing the
signal strength at 500Hz
• The most advanced version of currently developed ASR
technologies revolves around what is called
Natural Language Processing, or NLP in short.
This variant of ASR comes the closest to allowing real
conversation between people and machine intelligence and
though it still has a long way to go before reaching an apex
of development, we’re already seeing some remarkable
results in the form of intelligent smart phone interfaces like
the Siri program on the iPhone and other systems used in
business and advanced technology contexts.
Performance
The quality of a speech recognition
systems are assessed according to two
factors:
• its accuracy (error rate in converting
spoken words to digital data)
• speed (how well the software can
keep up with a human speaker).
Follow
• Today, developments in natural language
processing and neural network technology
have improved the speech and voice
technology, so much so that today it is
reportedly on par with humans.
• In 2017. For example, the word error rate
for Microsoft’s voice technology has been
recorded at 5.1 percent by the company,
while Google reports that it has reduced its
rate to 4.9 percent.
A Basic Primer
on How
Automatic
Speech
Recognition
Works
Applications
• speech to speech translation
• Dictation [google document]
• hands-free computing
• Robotics
• automated customer service, If you have
ever paid a bill over the phone using an
automated system
• Smart Speaker
Highlighting Amazon[Alexa]
The Tuning Test:
How ASR is made to “Learn”
from Humans
References
• https://www.techemergence.com/ai-for-
speech-recognition/
• https://usabilitygeek.com/automatic-
speech-recognition-asr-software-an-
introduction/
• https://www.streetdirectory.com/travel_gui
de/139545/technology/key_differences_bet
ween_speech_recognition_and_voice_recog
nition.html
• https://www.youtube.com/watch?v=q67z7P
TGRi8

More Related Content

What's hot

Adventures on the Road to Enterprise Virtual Assistants
Adventures on the Road to Enterprise Virtual AssistantsAdventures on the Road to Enterprise Virtual Assistants
Adventures on the Road to Enterprise Virtual Assistants
Editt Gonen-Friedman
 
Robotics and Coding - Technology Innovations for Learning and Teaching 2015
Robotics and Coding - Technology Innovations for Learning and Teaching 2015Robotics and Coding - Technology Innovations for Learning and Teaching 2015
Robotics and Coding - Technology Innovations for Learning and Teaching 2015
Hillary Rogers
 
Cegonsoft Final Year Projects In Coimbatore,
Cegonsoft Final Year Projects In Coimbatore,Cegonsoft Final Year Projects In Coimbatore,
Cegonsoft Final Year Projects In Coimbatore,
sathyancegon
 
An communication app for hearing impaired groups
An communication app for hearing impaired groupsAn communication app for hearing impaired groups
An communication app for hearing impaired groups
Vanessa Li
 
Voice To Text Presentation
Voice To Text PresentationVoice To Text Presentation
Voice To Text Presentation
shahinmehr
 
Team 3 status report#7
Team 3 status report#7Team 3 status report#7
Team 3 status report#7
Joaquim Jaime
 
ICT's role in Successful Studiies
ICT's role in Successful StudiiesICT's role in Successful Studiies
ICT's role in Successful Studiies
akinwunmi adelanwa
 

What's hot (20)

Virtual personal assistant
Virtual personal assistantVirtual personal assistant
Virtual personal assistant
 
JARVIS - The Digital Life Assistant
JARVIS - The Digital Life AssistantJARVIS - The Digital Life Assistant
JARVIS - The Digital Life Assistant
 
Evidence-aware Mobile Computational Offloading
Evidence-aware Mobile Computational OffloadingEvidence-aware Mobile Computational Offloading
Evidence-aware Mobile Computational Offloading
 
Mobile Code Offloading from Concept to Practice, and Beyond
Mobile Code Offloading from Concept to Practice, and BeyondMobile Code Offloading from Concept to Practice, and Beyond
Mobile Code Offloading from Concept to Practice, and Beyond
 
Modeling Mobile Code Acceleration in the Cloud
Modeling Mobile Code Acceleration in the CloudModeling Mobile Code Acceleration in the Cloud
Modeling Mobile Code Acceleration in the Cloud
 
5 pen pc ppt
5 pen pc ppt5 pen pc ppt
5 pen pc ppt
 
Adventures on the Road to Enterprise Virtual Assistants
Adventures on the Road to Enterprise Virtual AssistantsAdventures on the Road to Enterprise Virtual Assistants
Adventures on the Road to Enterprise Virtual Assistants
 
Robotics and Coding - Technology Innovations for Learning and Teaching 2015
Robotics and Coding - Technology Innovations for Learning and Teaching 2015Robotics and Coding - Technology Innovations for Learning and Teaching 2015
Robotics and Coding - Technology Innovations for Learning and Teaching 2015
 
Phonet
PhonetPhonet
Phonet
 
Cegonsoft Final Year Projects In Coimbatore,
Cegonsoft Final Year Projects In Coimbatore,Cegonsoft Final Year Projects In Coimbatore,
Cegonsoft Final Year Projects In Coimbatore,
 
An communication app for hearing impaired groups
An communication app for hearing impaired groupsAn communication app for hearing impaired groups
An communication app for hearing impaired groups
 
5 pen pc ppt2
5 pen pc ppt25 pen pc ppt2
5 pen pc ppt2
 
Presentationgroup
PresentationgroupPresentationgroup
Presentationgroup
 
Show me softwares
Show me softwaresShow me softwares
Show me softwares
 
Voice To Text Presentation
Voice To Text PresentationVoice To Text Presentation
Voice To Text Presentation
 
The Affordances Of Mobile Technologies
The Affordances Of Mobile TechnologiesThe Affordances Of Mobile Technologies
The Affordances Of Mobile Technologies
 
Team 3 status report#7
Team 3 status report#7Team 3 status report#7
Team 3 status report#7
 
Consumer Trends Michael Held Thales Innovation Retreat 2013
Consumer Trends Michael Held Thales Innovation Retreat 2013Consumer Trends Michael Held Thales Innovation Retreat 2013
Consumer Trends Michael Held Thales Innovation Retreat 2013
 
Seminar
SeminarSeminar
Seminar
 
ICT's role in Successful Studiies
ICT's role in Successful StudiiesICT's role in Successful Studiies
ICT's role in Successful Studiies
 

Similar to Speech Recognition

Abstract of speech recognition
Abstract of speech recognitionAbstract of speech recognition
Abstract of speech recognition
Vinay Jaisriram
 
VIRTUAL PERSONAL ASSISTANT.pdf
VIRTUAL PERSONAL ASSISTANT.pdfVIRTUAL PERSONAL ASSISTANT.pdf
VIRTUAL PERSONAL ASSISTANT.pdf
AnkushSolanki6
 
Voice Command Mobile Phone Dialer
Voice Command Mobile Phone DialerVoice Command Mobile Phone Dialer
Voice Command Mobile Phone Dialer
ijtsrd
 

Similar to Speech Recognition (20)

10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
 
Abstract of speech recognition
Abstract of speech recognitionAbstract of speech recognition
Abstract of speech recognition
 
VIRTUAL PERSONAL ASSISTANT.pdf
VIRTUAL PERSONAL ASSISTANT.pdfVIRTUAL PERSONAL ASSISTANT.pdf
VIRTUAL PERSONAL ASSISTANT.pdf
 
Voice recognition system
Voice recognition systemVoice recognition system
Voice recognition system
 
Google Voice-to-text
Google Voice-to-textGoogle Voice-to-text
Google Voice-to-text
 
sample PPT.pptx
sample PPT.pptxsample PPT.pptx
sample PPT.pptx
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
Speech Recognition in Artificail Inteligence
Speech Recognition in Artificail InteligenceSpeech Recognition in Artificail Inteligence
Speech Recognition in Artificail Inteligence
 
Speech recognizers & generators
Speech recognizers & generatorsSpeech recognizers & generators
Speech recognizers & generators
 
Let's talk about voice
Let's talk about voiceLet's talk about voice
Let's talk about voice
 
Artificial Intelligence for Speech Recognition
Artificial Intelligence for Speech RecognitionArtificial Intelligence for Speech Recognition
Artificial Intelligence for Speech Recognition
 
Top 10 Best Speech Recognition Software
Top 10 Best Speech Recognition Software Top 10 Best Speech Recognition Software
Top 10 Best Speech Recognition Software
 
Presentation.ai
Presentation.aiPresentation.ai
Presentation.ai
 
How does speech recognition AI work.pdf
How does speech recognition AI work.pdfHow does speech recognition AI work.pdf
How does speech recognition AI work.pdf
 
Seminar
SeminarSeminar
Seminar
 
Speechrecognition 100423091251-phpapp01
Speechrecognition 100423091251-phpapp01Speechrecognition 100423091251-phpapp01
Speechrecognition 100423091251-phpapp01
 
Jason Brenier's Presentation "Principles of Conversational Business" - Activa...
Jason Brenier's Presentation "Principles of Conversational Business" - Activa...Jason Brenier's Presentation "Principles of Conversational Business" - Activa...
Jason Brenier's Presentation "Principles of Conversational Business" - Activa...
 
Voice Command Mobile Phone Dialer
Voice Command Mobile Phone DialerVoice Command Mobile Phone Dialer
Voice Command Mobile Phone Dialer
 
Wake-up-word speech recognition using GPS on smart phone
Wake-up-word speech recognition using GPS on smart phoneWake-up-word speech recognition using GPS on smart phone
Wake-up-word speech recognition using GPS on smart phone
 
Desktop assistant
Desktop assistant Desktop assistant
Desktop assistant
 

More from Huda Seyam

More from Huda Seyam (14)

Blockchain Development Kit
Blockchain Development KitBlockchain Development Kit
Blockchain Development Kit
 
Traffic Sign Detection
Traffic Sign Detection Traffic Sign Detection
Traffic Sign Detection
 
Detect HTTP Brute Force attack using Snort IDS/IPS on PFSense Firewall
Detect HTTP Brute Force attack using Snort IDS/IPS on PFSense FirewallDetect HTTP Brute Force attack using Snort IDS/IPS on PFSense Firewall
Detect HTTP Brute Force attack using Snort IDS/IPS on PFSense Firewall
 
Snort Intrusion Detection / Prevention System on PFSense Firewall
Snort Intrusion Detection / Prevention System  on PFSense FirewallSnort Intrusion Detection / Prevention System  on PFSense Firewall
Snort Intrusion Detection / Prevention System on PFSense Firewall
 
Poisson Distribution
Poisson DistributionPoisson Distribution
Poisson Distribution
 
Docker
DockerDocker
Docker
 
WEP/WPA attacks
WEP/WPA attacksWEP/WPA attacks
WEP/WPA attacks
 
Security Policy
Security PolicySecurity Policy
Security Policy
 
Course registration system
Course registration systemCourse registration system
Course registration system
 
Network security situational awareness
Network security situational awarenessNetwork security situational awareness
Network security situational awareness
 
Wireless Site Survey
Wireless Site SurveyWireless Site Survey
Wireless Site Survey
 
Image compression
Image compressionImage compression
Image compression
 
Transport Layer Security
Transport Layer SecurityTransport Layer Security
Transport Layer Security
 
Software prototyping
Software prototyping  Software prototyping
Software prototyping
 

Recently uploaded

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Recently uploaded (20)

Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
JavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate GuideJavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate Guide
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Introduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMIntroduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDM
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Quantum Leap in Next-Generation Computing
Quantum Leap in Next-Generation ComputingQuantum Leap in Next-Generation Computing
Quantum Leap in Next-Generation Computing
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Stronger Together: Developing an Organizational Strategy for Accessible Desig...
Stronger Together: Developing an Organizational Strategy for Accessible Desig...Stronger Together: Developing an Organizational Strategy for Accessible Desig...
Stronger Together: Developing an Organizational Strategy for Accessible Desig...
 
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
 
AI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by Anitaraj
 
Simplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptxSimplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptx
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 

Speech Recognition

  • 2. What is • Speech recognition is the process of capturing spoken words using a microphone or telephone and converting them into a digitally stored set of words • A subset of speech recognition is voice recognition, which is the technology for identifying a person based on their voice
  • 3. History • First known attempt at speech recognition • A toy from 1922 • Worked by analysing the signal strength at 500Hz
  • 4. • The most advanced version of currently developed ASR technologies revolves around what is called Natural Language Processing, or NLP in short. This variant of ASR comes the closest to allowing real conversation between people and machine intelligence and though it still has a long way to go before reaching an apex of development, we’re already seeing some remarkable results in the form of intelligent smart phone interfaces like the Siri program on the iPhone and other systems used in business and advanced technology contexts.
  • 5. Performance The quality of a speech recognition systems are assessed according to two factors: • its accuracy (error rate in converting spoken words to digital data) • speed (how well the software can keep up with a human speaker).
  • 6. Follow • Today, developments in natural language processing and neural network technology have improved the speech and voice technology, so much so that today it is reportedly on par with humans. • In 2017. For example, the word error rate for Microsoft’s voice technology has been recorded at 5.1 percent by the company, while Google reports that it has reduced its rate to 4.9 percent.
  • 7. A Basic Primer on How Automatic Speech Recognition Works
  • 8.
  • 9.
  • 10. Applications • speech to speech translation • Dictation [google document] • hands-free computing • Robotics • automated customer service, If you have ever paid a bill over the phone using an automated system • Smart Speaker Highlighting Amazon[Alexa]
  • 11.
  • 12. The Tuning Test: How ASR is made to “Learn” from Humans
  • 13. References • https://www.techemergence.com/ai-for- speech-recognition/ • https://usabilitygeek.com/automatic- speech-recognition-asr-software-an- introduction/ • https://www.streetdirectory.com/travel_gui de/139545/technology/key_differences_bet ween_speech_recognition_and_voice_recog nition.html • https://www.youtube.com/watch?v=q67z7P TGRi8