SlideShare a Scribd company logo
1 of 17
PERSONAL VOICE
ASSISTANT IN
PYTHON
1
Submitted By:
Vikas Kumar
Utkarsh sengar
Yash Jauhri
Yash Tripathi
Contents
 Introduction
 How does an intelligent personal assistant work?
 Existing system and proposed system
 Dataflow diagrams
 Automatic Speech Recognition System Model in Google AP
 Use case diagrams
 Modules
 screenshots
 Requirements
 Conclusion
2
Introduction
 A virtual assistant is a technology based on artificial intelligence. The
software uses a device’s microphone to receive voice requests while the
voice output takes place at the speaker. But the most exciting thing
happens between these two actions.
 It is a combination of several different technologies: voice recognition,
voice analysis and language processing.
 It is completely developed using one of the most powerful language
python.
3
How does an intelligent personal assistant
work?
 User asks a personal assistant to perform a task.
 The natural language audio signal is converted into digital data that can
be analyzed by the software.
 Compared with a database of the software using an innovative algorithm
to find a suitable answer.
 This database is located on distributed servers in cloud networks. For this
reason ,it must have a reliable Internet connection.
4
EXISTING SYSTEM V/S PROPOSED SYSTEM 5
Existing System Proposed System
Usage statistics and user data are sent to
the developer
Does not collect any user data
Installation required Installation required
User cannot edit or change the modules User can edit and add new modules
Not a free Software Free Software
lightweight
Simple User interface
Dataflow Diagrams 6
User
Personal
voice
assistant
Computer
Text to speech
Speech to
Text
Voice Action
Response
Voice output
DFD - 0
User Voice to text
Action perform
Database
Personal voice
assistant
Computer
Voice Command Perform action
DFD - 1
7
User This will convert
voice into binary
Microphone
Computer
This will convert
voice data into text
form
Google voice API
Flow sensor
value
Voice audio
data
Perform action
DFD - 2
8
Automatic Speech Recognition System Model in
Google API
9
Feature Extraction Decoder
Acoustic
models
Pronunciation
Dictionary
Language
Models
Speech signal Recognized words
Feature Extraction
 Feature Extraction is a common to extract a set of features from speech
signal.
 Classification of Feature extraction is carried out on the set of features
instead of the speech signals themselves.
 The feature extraction stage seeks to provide a compact representation of
the speech waveform. This form should minimize the loss of information
that discriminates between words, and provide a good match with the
distributional assumptions made by the acoustic models.
10
 ACOUSTIC MODELS
• Acoustic model is a relationship between audio signal and phoneme
• Phoneme means one of the smallest unit of speech that make one word different from another word
 PRONUNCIATION DICTIONARY
• The act or result of producing the sounds of speech, including articulation, stress, and intonation
• A phonetic transcription of a given word, sound, etc.
• An accepted standard of the sound and stress patterns of a word, phrase, etc.
 LANGUAGE MODELS
• The language model provides context to distinguish between words and phrases that sound similar.
for example, In American English the phrases “recognize speach” and “wreck a nice beach” sound
similar , but mean different things.
11
Use case diagrams 12
Input
voice
Sent
mail
Turn
on/off
Wi Fi
Wikipedia
Read
search
User
User Microphone Google API
Computer
Open Personal Assistant Accessing G-API
Voice response
Start Mic
Wait until user speak
Receive data
Convert audio to text
Match text with action
Perform action
Voice / Text
Response
13
 MODULES
 Speech recognition
 Process and system utilities ( psutil )
 PlaySound
 SMTP Protocol client ( smtplib )
 Google Text To Speech ( gtts )
14
Requirements
 Software requirements
 Pycharm IDE/visual studio code
 Inno Setup Compiler
 Pyinstaller
 Python 3.8.2 and its Sub modules
 Hardware requirements
 Intel core i3
 4gb RAM
 30 Gb Hard drive space
15
Conclusion
Voice Controlled Personal Assistant System will use the Natural language
processing and can be integrated with artificial intelligence techniques to
achieve a smart assistant that can control the computer and applications and
even solve user queries using web searches.. It can be designed to minimize
the human efforts to interact with many other subsystems, which would
otherwise have to be performed manually. By achieving this, the system will
make human life comfortable
16
THANK YOU
17

More Related Content

Similar to Personal Voice Assistant using python.pptx

A Voice Based Assistant Using Google Dialogflow And Machine Learning
A Voice Based Assistant Using Google Dialogflow And Machine LearningA Voice Based Assistant Using Google Dialogflow And Machine Learning
A Voice Based Assistant Using Google Dialogflow And Machine LearningEmily Smith
 
IRJET- Virtual Vision for Blinds
IRJET- Virtual Vision for BlindsIRJET- Virtual Vision for Blinds
IRJET- Virtual Vision for BlindsIRJET Journal
 
IRJET- Voice to Code Editor using Speech Recognition
IRJET- Voice to Code Editor using Speech RecognitionIRJET- Voice to Code Editor using Speech Recognition
IRJET- Voice to Code Editor using Speech RecognitionIRJET Journal
 
Virtual Personal Assistant
Virtual Personal AssistantVirtual Personal Assistant
Virtual Personal AssistantIRJET Journal
 
Top 10 Best Speech Recognition Software
Top 10 Best Speech Recognition Software Top 10 Best Speech Recognition Software
Top 10 Best Speech Recognition Software Jame Williamson
 
Advanced Virtual Assistant Based on Speech Processing Oriented Technology on ...
Advanced Virtual Assistant Based on Speech Processing Oriented Technology on ...Advanced Virtual Assistant Based on Speech Processing Oriented Technology on ...
Advanced Virtual Assistant Based on Speech Processing Oriented Technology on ...ijtsrd
 
Cloud-Native Roadshow Google Cloud Platform - Los Angeles
Cloud-Native Roadshow Google Cloud Platform - Los AngelesCloud-Native Roadshow Google Cloud Platform - Los Angeles
Cloud-Native Roadshow Google Cloud Platform - Los AngelesVMware Tanzu
 
“SKYE : Voice Based AI Desktop Assistant”
“SKYE : Voice Based AI Desktop Assistant”“SKYE : Voice Based AI Desktop Assistant”
“SKYE : Voice Based AI Desktop Assistant”IRJET Journal
 
Google Cloud Platform
Google Cloud PlatformGoogle Cloud Platform
Google Cloud PlatformVMware Tanzu
 
A Literature Survey On Voice Assistance
A Literature Survey On Voice AssistanceA Literature Survey On Voice Assistance
A Literature Survey On Voice AssistanceWendy Hager
 
VOCAL- Voice Command Application using Artificial Intelligence
VOCAL- Voice Command Application using Artificial IntelligenceVOCAL- Voice Command Application using Artificial Intelligence
VOCAL- Voice Command Application using Artificial IntelligenceIRJET Journal
 
Cloud-Native Roadshow - Google - St. Louis
Cloud-Native Roadshow - Google - St. LouisCloud-Native Roadshow - Google - St. Louis
Cloud-Native Roadshow - Google - St. LouisVMware Tanzu
 
Cloud-Native Roadshow – Google – Amsterdam
Cloud-Native Roadshow – Google – Amsterdam Cloud-Native Roadshow – Google – Amsterdam
Cloud-Native Roadshow – Google – Amsterdam VMware Tanzu
 
Cloud-Native Roadshow Google - Charlotte
Cloud-Native Roadshow Google - CharlotteCloud-Native Roadshow Google - Charlotte
Cloud-Native Roadshow Google - CharlotteVMware Tanzu
 
Cloud-Native Roadshow - Google - Toronto
Cloud-Native Roadshow - Google - TorontoCloud-Native Roadshow - Google - Toronto
Cloud-Native Roadshow - Google - TorontoVMware Tanzu
 
Cloud-Native Roadshow - Google - London
Cloud-Native Roadshow - Google - LondonCloud-Native Roadshow - Google - London
Cloud-Native Roadshow - Google - LondonVMware Tanzu
 

Similar to Personal Voice Assistant using python.pptx (20)

A Voice Based Assistant Using Google Dialogflow And Machine Learning
A Voice Based Assistant Using Google Dialogflow And Machine LearningA Voice Based Assistant Using Google Dialogflow And Machine Learning
A Voice Based Assistant Using Google Dialogflow And Machine Learning
 
IRJET- Virtual Vision for Blinds
IRJET- Virtual Vision for BlindsIRJET- Virtual Vision for Blinds
IRJET- Virtual Vision for Blinds
 
IRJET- Voice to Code Editor using Speech Recognition
IRJET- Voice to Code Editor using Speech RecognitionIRJET- Voice to Code Editor using Speech Recognition
IRJET- Voice to Code Editor using Speech Recognition
 
Virtual Personal Assistant
Virtual Personal AssistantVirtual Personal Assistant
Virtual Personal Assistant
 
Voice Assistant.pptx
Voice Assistant.pptxVoice Assistant.pptx
Voice Assistant.pptx
 
Top 10 Best Speech Recognition Software
Top 10 Best Speech Recognition Software Top 10 Best Speech Recognition Software
Top 10 Best Speech Recognition Software
 
Advanced Virtual Assistant Based on Speech Processing Oriented Technology on ...
Advanced Virtual Assistant Based on Speech Processing Oriented Technology on ...Advanced Virtual Assistant Based on Speech Processing Oriented Technology on ...
Advanced Virtual Assistant Based on Speech Processing Oriented Technology on ...
 
Cloud-Native Roadshow Google Cloud Platform - Los Angeles
Cloud-Native Roadshow Google Cloud Platform - Los AngelesCloud-Native Roadshow Google Cloud Platform - Los Angeles
Cloud-Native Roadshow Google Cloud Platform - Los Angeles
 
“SKYE : Voice Based AI Desktop Assistant”
“SKYE : Voice Based AI Desktop Assistant”“SKYE : Voice Based AI Desktop Assistant”
“SKYE : Voice Based AI Desktop Assistant”
 
NEURAL NETWORK BOT
NEURAL NETWORK BOTNEURAL NETWORK BOT
NEURAL NETWORK BOT
 
Intelligent ChatBot
Intelligent ChatBotIntelligent ChatBot
Intelligent ChatBot
 
Google Cloud Platform
Google Cloud PlatformGoogle Cloud Platform
Google Cloud Platform
 
Google Voice-to-text
Google Voice-to-textGoogle Voice-to-text
Google Voice-to-text
 
A Literature Survey On Voice Assistance
A Literature Survey On Voice AssistanceA Literature Survey On Voice Assistance
A Literature Survey On Voice Assistance
 
VOCAL- Voice Command Application using Artificial Intelligence
VOCAL- Voice Command Application using Artificial IntelligenceVOCAL- Voice Command Application using Artificial Intelligence
VOCAL- Voice Command Application using Artificial Intelligence
 
Cloud-Native Roadshow - Google - St. Louis
Cloud-Native Roadshow - Google - St. LouisCloud-Native Roadshow - Google - St. Louis
Cloud-Native Roadshow - Google - St. Louis
 
Cloud-Native Roadshow – Google – Amsterdam
Cloud-Native Roadshow – Google – Amsterdam Cloud-Native Roadshow – Google – Amsterdam
Cloud-Native Roadshow – Google – Amsterdam
 
Cloud-Native Roadshow Google - Charlotte
Cloud-Native Roadshow Google - CharlotteCloud-Native Roadshow Google - Charlotte
Cloud-Native Roadshow Google - Charlotte
 
Cloud-Native Roadshow - Google - Toronto
Cloud-Native Roadshow - Google - TorontoCloud-Native Roadshow - Google - Toronto
Cloud-Native Roadshow - Google - Toronto
 
Cloud-Native Roadshow - Google - London
Cloud-Native Roadshow - Google - LondonCloud-Native Roadshow - Google - London
Cloud-Native Roadshow - Google - London
 

Recently uploaded

Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxupamatechverse
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
High Profile Call Girls Nashik Megha 7001305949 Independent Escort Service Na...
High Profile Call Girls Nashik Megha 7001305949 Independent Escort Service Na...High Profile Call Girls Nashik Megha 7001305949 Independent Escort Service Na...
High Profile Call Girls Nashik Megha 7001305949 Independent Escort Service Na...Call Girls in Nagpur High Profile
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxwendy cai
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
Current Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLCurrent Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLDeelipZope
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxDeepakSakkari2
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N
 
Analog to Digital and Digital to Analog Converter
Analog to Digital and Digital to Analog ConverterAnalog to Digital and Digital to Analog Converter
Analog to Digital and Digital to Analog ConverterAbhinavSharma374939
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSCAESB
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 

Recently uploaded (20)

Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptx
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
High Profile Call Girls Nashik Megha 7001305949 Independent Escort Service Na...
High Profile Call Girls Nashik Megha 7001305949 Independent Escort Service Na...High Profile Call Girls Nashik Megha 7001305949 Independent Escort Service Na...
High Profile Call Girls Nashik Megha 7001305949 Independent Escort Service Na...
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptx
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
Current Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLCurrent Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCL
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptxExploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptx
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
 
Analog to Digital and Digital to Analog Converter
Analog to Digital and Digital to Analog ConverterAnalog to Digital and Digital to Analog Converter
Analog to Digital and Digital to Analog Converter
 
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEDJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentation
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 

Personal Voice Assistant using python.pptx

  • 1. PERSONAL VOICE ASSISTANT IN PYTHON 1 Submitted By: Vikas Kumar Utkarsh sengar Yash Jauhri Yash Tripathi
  • 2. Contents  Introduction  How does an intelligent personal assistant work?  Existing system and proposed system  Dataflow diagrams  Automatic Speech Recognition System Model in Google AP  Use case diagrams  Modules  screenshots  Requirements  Conclusion 2
  • 3. Introduction  A virtual assistant is a technology based on artificial intelligence. The software uses a device’s microphone to receive voice requests while the voice output takes place at the speaker. But the most exciting thing happens between these two actions.  It is a combination of several different technologies: voice recognition, voice analysis and language processing.  It is completely developed using one of the most powerful language python. 3
  • 4. How does an intelligent personal assistant work?  User asks a personal assistant to perform a task.  The natural language audio signal is converted into digital data that can be analyzed by the software.  Compared with a database of the software using an innovative algorithm to find a suitable answer.  This database is located on distributed servers in cloud networks. For this reason ,it must have a reliable Internet connection. 4
  • 5. EXISTING SYSTEM V/S PROPOSED SYSTEM 5 Existing System Proposed System Usage statistics and user data are sent to the developer Does not collect any user data Installation required Installation required User cannot edit or change the modules User can edit and add new modules Not a free Software Free Software lightweight Simple User interface
  • 6. Dataflow Diagrams 6 User Personal voice assistant Computer Text to speech Speech to Text Voice Action Response Voice output DFD - 0
  • 7. User Voice to text Action perform Database Personal voice assistant Computer Voice Command Perform action DFD - 1 7
  • 8. User This will convert voice into binary Microphone Computer This will convert voice data into text form Google voice API Flow sensor value Voice audio data Perform action DFD - 2 8
  • 9. Automatic Speech Recognition System Model in Google API 9 Feature Extraction Decoder Acoustic models Pronunciation Dictionary Language Models Speech signal Recognized words
  • 10. Feature Extraction  Feature Extraction is a common to extract a set of features from speech signal.  Classification of Feature extraction is carried out on the set of features instead of the speech signals themselves.  The feature extraction stage seeks to provide a compact representation of the speech waveform. This form should minimize the loss of information that discriminates between words, and provide a good match with the distributional assumptions made by the acoustic models. 10
  • 11.  ACOUSTIC MODELS • Acoustic model is a relationship between audio signal and phoneme • Phoneme means one of the smallest unit of speech that make one word different from another word  PRONUNCIATION DICTIONARY • The act or result of producing the sounds of speech, including articulation, stress, and intonation • A phonetic transcription of a given word, sound, etc. • An accepted standard of the sound and stress patterns of a word, phrase, etc.  LANGUAGE MODELS • The language model provides context to distinguish between words and phrases that sound similar. for example, In American English the phrases “recognize speach” and “wreck a nice beach” sound similar , but mean different things. 11
  • 12. Use case diagrams 12 Input voice Sent mail Turn on/off Wi Fi Wikipedia Read search User
  • 13. User Microphone Google API Computer Open Personal Assistant Accessing G-API Voice response Start Mic Wait until user speak Receive data Convert audio to text Match text with action Perform action Voice / Text Response 13
  • 14.  MODULES  Speech recognition  Process and system utilities ( psutil )  PlaySound  SMTP Protocol client ( smtplib )  Google Text To Speech ( gtts ) 14
  • 15. Requirements  Software requirements  Pycharm IDE/visual studio code  Inno Setup Compiler  Pyinstaller  Python 3.8.2 and its Sub modules  Hardware requirements  Intel core i3  4gb RAM  30 Gb Hard drive space 15
  • 16. Conclusion Voice Controlled Personal Assistant System will use the Natural language processing and can be integrated with artificial intelligence techniques to achieve a smart assistant that can control the computer and applications and even solve user queries using web searches.. It can be designed to minimize the human efforts to interact with many other subsystems, which would otherwise have to be performed manually. By achieving this, the system will make human life comfortable 16