SlideShare a Scribd company logo
1 of 23
Submitted to :- Neelesh Shrivastava Submitted by :- Deependra Kumar Kori
OUTLINES :-
 INTRODUCTION
HISTORY OF VOICE RECOGNITION
HOW VRS WORK ?
VOICE RECOGNITION SYSTEM MODEL
 SOFTWARES
 GOOGLE ASSISTANCE
INTRODUCTION :-
 Voice recognition is a computer software program or hardware device
with the ability to decode the human voice.
 Voice recognition is also known as speaker recognition.
 Voice recognition evaluates the voice biometrics of an individual, such
as the frequency and flow of their voice and their natural accent.
 Voice recognition is commonly used to operate a device, perform
commands, or write without having to use a keyboard, mouse, or press any
buttons.
HISTORY OF VOICE
RECOGNITION :-
…CONTINUE
When we speak, we create vibrations in the air. The analog-to-digital
converter (ADC) translates this analog wave into digital data that the
computer can understand. To do this, it samples, normalizes the sound by
taking precise measurements of the wave at frequent intervals.
…CONTINUE
 The signal is divided into small segments as short as a few hundredths of
a second, or even thousandths .
 The program then matches these segments to known phonemes in the
appropriate language.
 There are roughly 40 phonemes in the English language ,while other
languages have more or fewer phonemes.
 The program examines phonemes in the context of the other phonemes
around them and compares them to a large library of known words, phrases
and sentences.
 The program then determines what the user was probably saying and
either outputs it as text or issues a computer command.
VOICE RECOGNITION SYSTEM
MODEL:-
John Garofolo has proposed Hidden Markov
Model.
...CONTINUE
The smallest element of any language is said to be a phoneme.
According to this model, a phoneme is treated as a link in a chain, and
the completed chain represents a word.
To determine the next phoneme, the chain forms branches of different
sounds that can come next, a probability score is given to each branched
off phoneme based on the built in dictionary.
Thus, the complete word is finally determined.
SOFTWARES
:-
WHAT IS GOOGLE ASSISTANT:-
 Google Assistant is a virtual personal assistant developed by google
and announced at its developer conference in May 2016.
 Google Assistant is an upgrade of Google Now, which can engage in
two-way conversation.
Google
now
WHERE YOU WILL FIND GOOGLE
ASSISTANT:-
GOOGLE ASSISTANT INITIALLY DEBUTED AS PART OF:
 GOOGLE’S MESSAGING APP “ALLO” FOR ANDROID AND
IPHONE.
 ITS VOICE-ACTIVATED SPEAKER “GOOGLE HOME”.
 AND INITIALLY IN GOOGLE’S “PIXEL , PIXEL XL” SMART
PHONES.
WHAT GOOGLE ASSISTANT CAN DO?
 IT IS ABLE TO SEARCH THE INTERNET, SCHEDULE EVENTS
AND ALARMS, ADJUST HARDWARE SETTINGS ON THE
USER'S DEVICE, AND SHOW INFORMATION FROM THE
USER'S GOOGLE ACCOUNT.
 GOOGLE HAS ALSO ANNOUNCED THAT THE ASSISTANT
WILL BE ABLE TO IDENTIFY OBJECTS AND GATHER VISUAL
INFORMATION THROUGH THE DEVICE'S CAMERA, AND
SUPPORT PURCHASING PRODUCTS AND SENDING MONEY.
…CONTINUE:-
 YOU CAN ALSO USE GOOGLE ASSISTANT TO CONTROL
MOST OF THE PHONE SETTINGS FOUND IN THE
NOTIFICATION SHADE. LIKE TO OPEN WI-FI , TORCH , TO
CONTROL BRIGHTNESS ETC.
DEVELOPER SUPPORT FOR GOOGLE
ASSISTANT:-
IN DECEMBER 2016, GOOGLE LAUNCHED A DEVELOPER PLATFORM
FOR THE GOOGLE ASSISTANT CALLED "ACTIONS ON GOOGLE“.
 IN MARCH 2017, GOOGLE ADDED NEW TOOLS FOR DEVELOPING
ON “ACTIONS ON GOOGLE” TO SUPPORT THE CREATION OF
GAMES FOR THE GOOGLE ASSISTANT.
 IN APRIL 2017, A SOFTWARE DEVELOPMENT KIT (SDK) WAS
RELEASED, ALLOWING THIRD-PARTY DEVELOPERS TO BUILD THEIR
OWN HARDWARE THAT CAN RUN THE GOOGLE ASSISTANT. IT HAS
BEEN INTEGRATED INTO ”RASPBERRY PI”.
EMBED THE GOOGLE ASSISTANT IN ANY
DEVICE:-
A DEVELOPER PREVIEW OF THE SDK IS AVAILABLE TODAY FOR
EVERYONE INTERESTED IN WORKING WITH PLATFORMS SUCH AS
THE RASPBERRY PI 3.
 THE SDK PROVIDES TWO OPTIONS FOR INTEGRATING THE
ASSISTANT INTO YOUR DEVICE:
1. THE GOOGLE ASSISTANT LIBRARY.
2. THE GOOGLE ASSISTANT GRPC API.
…CONTINUE:-
THE LIBRARY OF GOOGLE ASSISTANT IS WRITTEN IN PYTHON AND
IT IS SUPPORTED ON THE DEVICES WITH LINUX-ARMV7L AND LINUX-
X86_64 ARCHITECTURES (LIKE THE RASPBERRY PI 3 B AND UBUNTU
DESKTOPS).
 THE GRPC API IS FOR OTHER PLATFORMS AND GENERATE
BINDINGS FOR LANGUAGES LIKE GO, JAVA (INCLUDING SUPPORT
FOR ANDROID THINGS), C#, NODE.JS, AND RUBY TO GIVE YOU THE
FLEXIBILITY YOU NEED.
GOOGLE ASSISTANT INTEGRATION:-
GOOGLE ASSISTANT CAN BE INTEGRATED IN MANY FORMS:
 YOU CAN MAKE YOUR OWN PROTOTYPE DEVICE WITH GOOGLE
ASSISTANT BUILT IN, LIKE “ROBOTIC MOCKTAILS MIXER” ETC. USING
“THE GOOGLE ASSISTANT LIBRARY” AND “RASPBERRY PI EMBED
SYSTEM”.
YOU CAN ALSO MAKE YOUR OWN ASSISTANT APP USING “API.AI”
AND “ACTIONS ON GOOGLE”, WITH WHICH YOU CAN INTERACT VIA
IN-BUILT GOOGLE ASSISTANT.
THANK YOU

More Related Content

What's hot (8)

Actions On Google - How? Why?
Actions On Google - How? Why?Actions On Google - How? Why?
Actions On Google - How? Why?
 
Google home
Google homeGoogle home
Google home
 
Flashh
FlashhFlashh
Flashh
 
Qualcomm AllPlay Overview (Brief)
Qualcomm AllPlay Overview (Brief)Qualcomm AllPlay Overview (Brief)
Qualcomm AllPlay Overview (Brief)
 
An Overview of Google Assistant
An Overview of Google Assistant An Overview of Google Assistant
An Overview of Google Assistant
 
Top 7 Gadgets for Smartphone
Top 7 Gadgets for SmartphoneTop 7 Gadgets for Smartphone
Top 7 Gadgets for Smartphone
 
Google Home
Google HomeGoogle Home
Google Home
 
Resources and Equipment
Resources and Equipment Resources and Equipment
Resources and Equipment
 

Similar to Voice recognition system

Personal Voice Assistant using python.pptx
Personal Voice Assistant using python.pptxPersonal Voice Assistant using python.pptx
Personal Voice Assistant using python.pptxyjrtytyuu
 
Paper on Speech Recognition
Paper on Speech RecognitionPaper on Speech Recognition
Paper on Speech RecognitionThejus Joby
 
virtual-assistant-160214154006.pdf
virtual-assistant-160214154006.pdfvirtual-assistant-160214154006.pdf
virtual-assistant-160214154006.pdfHarshKumar534677
 
Virtual personal assistant
Virtual personal assistantVirtual personal assistant
Virtual personal assistantShubham Bhalekar
 
VOICE-ASSISTANT-IN-PYTHON-pptx.pptx
VOICE-ASSISTANT-IN-PYTHON-pptx.pptxVOICE-ASSISTANT-IN-PYTHON-pptx.pptx
VOICE-ASSISTANT-IN-PYTHON-pptx.pptxITB450RUTIKASALUNKHE
 
Google Launch Event
Google Launch Event Google Launch Event
Google Launch Event Jeremy Olken
 
Google Launch Event
Google Launch EventGoogle Launch Event
Google Launch EventJeremy Olken
 
PERSONAL VOICE ASSISTANT - Copy.pptx
PERSONAL VOICE ASSISTANT - Copy.pptxPERSONAL VOICE ASSISTANT - Copy.pptx
PERSONAL VOICE ASSISTANT - Copy.pptxZubairSaifi19
 
Advanced Virtual Assistant Based on Speech Processing Oriented Technology on ...
Advanced Virtual Assistant Based on Speech Processing Oriented Technology on ...Advanced Virtual Assistant Based on Speech Processing Oriented Technology on ...
Advanced Virtual Assistant Based on Speech Processing Oriented Technology on ...ijtsrd
 
Speech recognizers & generators
Speech recognizers & generatorsSpeech recognizers & generators
Speech recognizers & generatorsPaul Kahoro
 
Linux Sucks
Linux SucksLinux Sucks
Linux Suckspapygeek
 
How to Create a Voice-Assistant App Like Alexa.pdf
How to Create a Voice-Assistant App Like Alexa.pdfHow to Create a Voice-Assistant App Like Alexa.pdf
How to Create a Voice-Assistant App Like Alexa.pdfgirijalakshmi2
 
Google pixel buds
Google pixel budsGoogle pixel buds
Google pixel budsSivadevsiva
 
International Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentInternational Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentIJERD Editor
 
How to build virtual assistant like Jarvis (in Ironman) with Google Assistant...
How to build virtual assistant like Jarvis (in Ironman) with Google Assistant...How to build virtual assistant like Jarvis (in Ironman) with Google Assistant...
How to build virtual assistant like Jarvis (in Ironman) with Google Assistant...Tu Le Dinh
 
Livescribe FAQ - Q
Livescribe FAQ - QLivescribe FAQ - Q
Livescribe FAQ - Qbutest
 
Human Attention (HA) for an AI augmented world #DevFest19
Human Attention (HA) for an AI augmented world #DevFest19Human Attention (HA) for an AI augmented world #DevFest19
Human Attention (HA) for an AI augmented world #DevFest19Alexandra Petruș
 
Smart Speaker PPT G2.pptx
Smart Speaker PPT G2.pptxSmart Speaker PPT G2.pptx
Smart Speaker PPT G2.pptxjollibeech
 
Top 10 Best Speech Recognition Software
Top 10 Best Speech Recognition Software Top 10 Best Speech Recognition Software
Top 10 Best Speech Recognition Software Jame Williamson
 

Similar to Voice recognition system (20)

Personal Voice Assistant using python.pptx
Personal Voice Assistant using python.pptxPersonal Voice Assistant using python.pptx
Personal Voice Assistant using python.pptx
 
Paper on Speech Recognition
Paper on Speech RecognitionPaper on Speech Recognition
Paper on Speech Recognition
 
virtual-assistant-160214154006.pdf
virtual-assistant-160214154006.pdfvirtual-assistant-160214154006.pdf
virtual-assistant-160214154006.pdf
 
Virtual personal assistant
Virtual personal assistantVirtual personal assistant
Virtual personal assistant
 
VOICE-ASSISTANT-IN-PYTHON-pptx.pptx
VOICE-ASSISTANT-IN-PYTHON-pptx.pptxVOICE-ASSISTANT-IN-PYTHON-pptx.pptx
VOICE-ASSISTANT-IN-PYTHON-pptx.pptx
 
Google Launch Event
Google Launch Event Google Launch Event
Google Launch Event
 
Google Launch Event
Google Launch EventGoogle Launch Event
Google Launch Event
 
PERSONAL VOICE ASSISTANT - Copy.pptx
PERSONAL VOICE ASSISTANT - Copy.pptxPERSONAL VOICE ASSISTANT - Copy.pptx
PERSONAL VOICE ASSISTANT - Copy.pptx
 
Advanced Virtual Assistant Based on Speech Processing Oriented Technology on ...
Advanced Virtual Assistant Based on Speech Processing Oriented Technology on ...Advanced Virtual Assistant Based on Speech Processing Oriented Technology on ...
Advanced Virtual Assistant Based on Speech Processing Oriented Technology on ...
 
Speech recognizers & generators
Speech recognizers & generatorsSpeech recognizers & generators
Speech recognizers & generators
 
Linux Sucks
Linux SucksLinux Sucks
Linux Sucks
 
Canvas real speaker
Canvas real speakerCanvas real speaker
Canvas real speaker
 
How to Create a Voice-Assistant App Like Alexa.pdf
How to Create a Voice-Assistant App Like Alexa.pdfHow to Create a Voice-Assistant App Like Alexa.pdf
How to Create a Voice-Assistant App Like Alexa.pdf
 
Google pixel buds
Google pixel budsGoogle pixel buds
Google pixel buds
 
International Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentInternational Journal of Engineering Research and Development
International Journal of Engineering Research and Development
 
How to build virtual assistant like Jarvis (in Ironman) with Google Assistant...
How to build virtual assistant like Jarvis (in Ironman) with Google Assistant...How to build virtual assistant like Jarvis (in Ironman) with Google Assistant...
How to build virtual assistant like Jarvis (in Ironman) with Google Assistant...
 
Livescribe FAQ - Q
Livescribe FAQ - QLivescribe FAQ - Q
Livescribe FAQ - Q
 
Human Attention (HA) for an AI augmented world #DevFest19
Human Attention (HA) for an AI augmented world #DevFest19Human Attention (HA) for an AI augmented world #DevFest19
Human Attention (HA) for an AI augmented world #DevFest19
 
Smart Speaker PPT G2.pptx
Smart Speaker PPT G2.pptxSmart Speaker PPT G2.pptx
Smart Speaker PPT G2.pptx
 
Top 10 Best Speech Recognition Software
Top 10 Best Speech Recognition Software Top 10 Best Speech Recognition Software
Top 10 Best Speech Recognition Software
 

Recently uploaded

Top 10 CodeIgniter Development Companies
Top 10 CodeIgniter Development CompaniesTop 10 CodeIgniter Development Companies
Top 10 CodeIgniter Development CompaniesTopCSSGallery
 
Intro to Passkeys and the State of Passwordless.pptx
Intro to Passkeys and the State of Passwordless.pptxIntro to Passkeys and the State of Passwordless.pptx
Intro to Passkeys and the State of Passwordless.pptxFIDO Alliance
 
API Governance and Monetization - The evolution of API governance
API Governance and Monetization -  The evolution of API governanceAPI Governance and Monetization -  The evolution of API governance
API Governance and Monetization - The evolution of API governanceWSO2
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxRemote DBA Services
 
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider  Progress from Awareness to Implementation.pptxTales from a Passkey Provider  Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider Progress from Awareness to Implementation.pptxFIDO Alliance
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Choreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software EngineeringChoreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software EngineeringWSO2
 
ADP Passwordless Journey Case Study.pptx
ADP Passwordless Journey Case Study.pptxADP Passwordless Journey Case Study.pptx
ADP Passwordless Journey Case Study.pptxFIDO Alliance
 
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data PlatformLess Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data PlatformWSO2
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021
 
ChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps ProductivityChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps ProductivityVictorSzoltysek
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Bhuvaneswari Subramani
 
AI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAnitaRaj43
 
JavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate GuideJavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate GuidePixlogix Infotech
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Simplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptxSimplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptxMarkSteadman7
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 

Recently uploaded (20)

Top 10 CodeIgniter Development Companies
Top 10 CodeIgniter Development CompaniesTop 10 CodeIgniter Development Companies
Top 10 CodeIgniter Development Companies
 
Intro to Passkeys and the State of Passwordless.pptx
Intro to Passkeys and the State of Passwordless.pptxIntro to Passkeys and the State of Passwordless.pptx
Intro to Passkeys and the State of Passwordless.pptx
 
API Governance and Monetization - The evolution of API governance
API Governance and Monetization -  The evolution of API governanceAPI Governance and Monetization -  The evolution of API governance
API Governance and Monetization - The evolution of API governance
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider  Progress from Awareness to Implementation.pptxTales from a Passkey Provider  Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Choreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software EngineeringChoreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software Engineering
 
ADP Passwordless Journey Case Study.pptx
ADP Passwordless Journey Case Study.pptxADP Passwordless Journey Case Study.pptx
ADP Passwordless Journey Case Study.pptx
 
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data PlatformLess Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
ChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps ProductivityChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps Productivity
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
AI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by Anitaraj
 
JavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate GuideJavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate Guide
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Simplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptxSimplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptx
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 

Voice recognition system

  • 1. Submitted to :- Neelesh Shrivastava Submitted by :- Deependra Kumar Kori
  • 2.
  • 3. OUTLINES :-  INTRODUCTION HISTORY OF VOICE RECOGNITION HOW VRS WORK ? VOICE RECOGNITION SYSTEM MODEL  SOFTWARES  GOOGLE ASSISTANCE
  • 4. INTRODUCTION :-  Voice recognition is a computer software program or hardware device with the ability to decode the human voice.  Voice recognition is also known as speaker recognition.  Voice recognition evaluates the voice biometrics of an individual, such as the frequency and flow of their voice and their natural accent.  Voice recognition is commonly used to operate a device, perform commands, or write without having to use a keyboard, mouse, or press any buttons.
  • 6.
  • 7.
  • 8. …CONTINUE When we speak, we create vibrations in the air. The analog-to-digital converter (ADC) translates this analog wave into digital data that the computer can understand. To do this, it samples, normalizes the sound by taking precise measurements of the wave at frequent intervals.
  • 9. …CONTINUE  The signal is divided into small segments as short as a few hundredths of a second, or even thousandths .  The program then matches these segments to known phonemes in the appropriate language.  There are roughly 40 phonemes in the English language ,while other languages have more or fewer phonemes.  The program examines phonemes in the context of the other phonemes around them and compares them to a large library of known words, phrases and sentences.  The program then determines what the user was probably saying and either outputs it as text or issues a computer command.
  • 10. VOICE RECOGNITION SYSTEM MODEL:- John Garofolo has proposed Hidden Markov Model.
  • 11. ...CONTINUE The smallest element of any language is said to be a phoneme. According to this model, a phoneme is treated as a link in a chain, and the completed chain represents a word. To determine the next phoneme, the chain forms branches of different sounds that can come next, a probability score is given to each branched off phoneme based on the built in dictionary. Thus, the complete word is finally determined.
  • 12.
  • 14.
  • 15. WHAT IS GOOGLE ASSISTANT:-  Google Assistant is a virtual personal assistant developed by google and announced at its developer conference in May 2016.  Google Assistant is an upgrade of Google Now, which can engage in two-way conversation. Google now
  • 16. WHERE YOU WILL FIND GOOGLE ASSISTANT:- GOOGLE ASSISTANT INITIALLY DEBUTED AS PART OF:  GOOGLE’S MESSAGING APP “ALLO” FOR ANDROID AND IPHONE.  ITS VOICE-ACTIVATED SPEAKER “GOOGLE HOME”.  AND INITIALLY IN GOOGLE’S “PIXEL , PIXEL XL” SMART PHONES.
  • 17. WHAT GOOGLE ASSISTANT CAN DO?  IT IS ABLE TO SEARCH THE INTERNET, SCHEDULE EVENTS AND ALARMS, ADJUST HARDWARE SETTINGS ON THE USER'S DEVICE, AND SHOW INFORMATION FROM THE USER'S GOOGLE ACCOUNT.  GOOGLE HAS ALSO ANNOUNCED THAT THE ASSISTANT WILL BE ABLE TO IDENTIFY OBJECTS AND GATHER VISUAL INFORMATION THROUGH THE DEVICE'S CAMERA, AND SUPPORT PURCHASING PRODUCTS AND SENDING MONEY.
  • 18. …CONTINUE:-  YOU CAN ALSO USE GOOGLE ASSISTANT TO CONTROL MOST OF THE PHONE SETTINGS FOUND IN THE NOTIFICATION SHADE. LIKE TO OPEN WI-FI , TORCH , TO CONTROL BRIGHTNESS ETC.
  • 19. DEVELOPER SUPPORT FOR GOOGLE ASSISTANT:- IN DECEMBER 2016, GOOGLE LAUNCHED A DEVELOPER PLATFORM FOR THE GOOGLE ASSISTANT CALLED "ACTIONS ON GOOGLE“.  IN MARCH 2017, GOOGLE ADDED NEW TOOLS FOR DEVELOPING ON “ACTIONS ON GOOGLE” TO SUPPORT THE CREATION OF GAMES FOR THE GOOGLE ASSISTANT.  IN APRIL 2017, A SOFTWARE DEVELOPMENT KIT (SDK) WAS RELEASED, ALLOWING THIRD-PARTY DEVELOPERS TO BUILD THEIR OWN HARDWARE THAT CAN RUN THE GOOGLE ASSISTANT. IT HAS BEEN INTEGRATED INTO ”RASPBERRY PI”.
  • 20. EMBED THE GOOGLE ASSISTANT IN ANY DEVICE:- A DEVELOPER PREVIEW OF THE SDK IS AVAILABLE TODAY FOR EVERYONE INTERESTED IN WORKING WITH PLATFORMS SUCH AS THE RASPBERRY PI 3.  THE SDK PROVIDES TWO OPTIONS FOR INTEGRATING THE ASSISTANT INTO YOUR DEVICE: 1. THE GOOGLE ASSISTANT LIBRARY. 2. THE GOOGLE ASSISTANT GRPC API.
  • 21. …CONTINUE:- THE LIBRARY OF GOOGLE ASSISTANT IS WRITTEN IN PYTHON AND IT IS SUPPORTED ON THE DEVICES WITH LINUX-ARMV7L AND LINUX- X86_64 ARCHITECTURES (LIKE THE RASPBERRY PI 3 B AND UBUNTU DESKTOPS).  THE GRPC API IS FOR OTHER PLATFORMS AND GENERATE BINDINGS FOR LANGUAGES LIKE GO, JAVA (INCLUDING SUPPORT FOR ANDROID THINGS), C#, NODE.JS, AND RUBY TO GIVE YOU THE FLEXIBILITY YOU NEED.
  • 22. GOOGLE ASSISTANT INTEGRATION:- GOOGLE ASSISTANT CAN BE INTEGRATED IN MANY FORMS:  YOU CAN MAKE YOUR OWN PROTOTYPE DEVICE WITH GOOGLE ASSISTANT BUILT IN, LIKE “ROBOTIC MOCKTAILS MIXER” ETC. USING “THE GOOGLE ASSISTANT LIBRARY” AND “RASPBERRY PI EMBED SYSTEM”. YOU CAN ALSO MAKE YOUR OWN ASSISTANT APP USING “API.AI” AND “ACTIONS ON GOOGLE”, WITH WHICH YOU CAN INTERACT VIA IN-BUILT GOOGLE ASSISTANT.