Speech Recognition techniques for Home Automation

•

5 likes•868 views

In this presentation, Santosh Raj has built a novel speech recognition system that can be used to control home appliances. This could very well be integrated with various IoT devices.

Technology

Biometric/Speaker
Identification Based on Voice.
PRESENTED BY:
Project guide: SANTOSH RAJ
Ms. Swati Prasad

Speaker identification is the process of automatically
recognizing who is speaking based on unique characteristics
contained in speech signal. This technique makes it
possible to use the speaker's voice and the spoken word to
verify their identity and control access.

Speaker Model DatabaseTest Speech
Whose voice is it?
Identification system

1. Easier way to control home
appliances(controlling appliances using voice
command instead of remote of switches is
much convinient ).
2. Problem of forgetting password will be
resolved using aforementioned system(old age
people generally forgetting their password so
this should be useful tool for them).

1950s and 1960s- Bell Laborotories
designed in 1952 the Audrey system which
recognized digits spoken by the single voice.
1970s- Carnegie Mellon’s “Harpy Speech
understanding system” which can understand
1011 words.
1980s- speech recognition turns towards
prediction.
2013- Apple designed SIRI voice recognition
system which turns to a benchmark till today.

1. It must be noise free.
2. Voiced signal should only be considered for
further processing.
3. Voiced signal will be distinguised with unvoiced
signal through threshold amplitude.

Linear prediction coding(LPC)- It is the feature
extraction technique and used representation of a
speech signal which will be generating low data
rate discreet signal.
• Voiced/unvoiced(1 byte)
• Pitch(6 byte)
• Voiced signal amplitude(11 byte)
• Unvoiced amplitude(5byte)

GAUSSIAN MIXTURE MODELS (GMM)
• A Gaussian Mixture Model (GMM) is a parametric probability
density function represented as a weighted sum of Gaussian
component densities.
• GMMs are commonly used as a parametric model of the
probability distribution of continuous measurements or
features in a biometric system, such as vocal-tract related
spectral features in a speaker recognition system

1.Features extraction from the unknown voice
2.Calculate gmm parameter
3.Compare this with the existing model
4.Based on matching percentage identify the speaker

 we were sucessful in assembling the blocks in
our project.
 A basic speech recognition system which
recognises a digit from 0 to 9 was thus
developed.
 The system was trained in an environment with
minimum ambient noise.
 The system gives the most accurate results
when implemented in the environment where it
was trained.

 We have taken samples of 50 speakers
uttering digits 0-9 each 20 times from the
reliable database.
 The unknown voice is tested succesfully with
our existing model.
 The identification rate is approx. 90%.

 We intend to design a home automation
system which will be beased on the command
prompted by the authorised speaker. This will
enable us to control home appliances more
conveniently.

REFERENCES
[1] X.Huang, A. Acero, and H.-W. Hon, “Spoken Language Processing:
A Guide to Theory, Algorithm
and System Development”. Prentice Hall PTR May 2001
[2] Matthew Nicholas Stuttle, “A Gaussian Mixture Model Spectral
Representation for Speech
Recognition”. Hughes Hall and Cambridge University Engineering
Department. July 2003

Speech Recognition techniques for Home Automation

What's hot

Speech recognition techniquessonukumar142

Speech recognition-using-wavelet-transformvidhateswapnil

Speech recognition challengesAlexandru Chica

Speech Recognition TechnologySrijanKumar18

AUTOMATIC SPEECH RECOGNITION- A SURVEYIJCERT

Voice recognitionKenneth Carnesi, JD

Ece speech-recognition-reportAnakali Mahesh

A seminar report on speech recognition technologySrijanKumar18

Automatic speech recognition systemAlok Tiwari

Automatic Speech RecognionInternational Islamic University

TEXT-SPEECH PPT.pptxNsaroj kumar

Group 2 -innovation in smartphones-Fuyi Pan

Voice input and speech recognition system in tourism/social mediacidroypaes

Speech recognition final presentationhimanshubhatti

Speech RecognitionAhmed Moawad

Noise Adaptive Training for Robust Automatic Speech Recognitionأحلام انصارى

Automatic Speech RecognitionInternational Islamic University

Voice recognition systemavinash raibole

Speech Recognition: Transcription and transformation of human speechSubmissionResearchpa

Artificial Intelligence for Speech RecognitionRHIMRJ Journal

What's hot (20)

Speech recognition techniques

Speech recognition-using-wavelet-transform

Speech recognition challenges

Speech Recognition Technology

AUTOMATIC SPEECH RECOGNITION- A SURVEY

Voice recognition

Ece speech-recognition-report

A seminar report on speech recognition technology

Automatic speech recognition system

Automatic Speech Recognion

TEXT-SPEECH PPT.pptx

Group 2 -innovation in smartphones-

Voice input and speech recognition system in tourism/social media

Speech recognition final presentation

Speech Recognition

Noise Adaptive Training for Robust Automatic Speech Recognition

Automatic Speech Recognition

Voice recognition system

Speech Recognition: Transcription and transformation of human speech

Artificial Intelligence for Speech Recognition

Viewers also liked

CONTROLLING HOME APPLIANCES USING REMOTE(1)Ambar Gupta

Controlling Home Appliances Using VoiceEdgefxkits & Solutions

Thesis - Voice Control Home AutomationAbhishek Neb

Voice Control Home AutomationAbhishek Neb

Presentation on home automationSubhash Kumar Yadav

Smart homesBrad Fitzpatrick

Home automation using android mobilesDurairaja

Viewers also liked (7)

CONTROLLING HOME APPLIANCES USING REMOTE(1)

Controlling Home Appliances Using Voice

Thesis - Voice Control Home Automation

Voice Control Home Automation

Presentation on home automation

Smart homes

Home automation using android mobiles

Similar to Speech Recognition techniques for Home Automation

Voice Recognition System using Template MatchingIJORCS

Assignanshu agrawal

Speech Recognition Goa App

Utterance based speaker identificationIJCSEA Journal

Dy36749754IJERA Editor

Speech Recognition in Artificail InteligenceIlhaan Marwat

Utterance Based Speaker Identification Using ANNIJCSEA Journal

Speech recognizers & generatorsPaul Kahoro

Artificial Intelligence- An Introductionacemindia

Artificial Intelligence - An Introduction acemindia

Voicereplay21

Speech recognition using neural + fuzzy logicSnehal Patel

Speechrecognition 100423091251-phpapp01girishjoshi1234

De4201715719IJERA Editor

Volume 2-issue-6-2186-2189Editor IJARCET

Silent sound interfaceJeevitha Reddy

Kc3517481754IJERA Editor

VOICE RECOGNITION SYSTEMJournal For Research

Similar to Speech Recognition techniques for Home Automation (20)

Voice Recognition System using Template Matching

Assign

Speech Recognition

Utterance based speaker identification

Dy36749754

Speech Recognition in Artificail Inteligence

Utterance Based Speaker Identification Using ANN

Speech recognizers & generators

Artificial Intelligence- An Introduction

Artificial Intelligence - An Introduction

Voice

Speech recognition using neural + fuzzy logic

Speechrecognition 100423091251-phpapp01

De4201715719

Volume 2-issue-6-2186-2189

Silent sound interface

Kc3517481754

VOICE RECOGNITION SYSTEM

Recently uploaded

Partners Life - Insurer Innovation Award 2024The Digital Insurer

Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi

Manulife - Insurer Innovation Award 2024The Digital Insurer

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays

Scaling API-first – The story of a global engineering organizationRadu Cotescu

HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics

Why Teams call analytics are critical to your entire businesspanagenda

GenAI Risks & Security Meetup 01052024.pdflior mazor

A Domino Admins Adventures (Engage 2024)Gabriella Davis

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer

Top 10 Most Downloaded Games on Play Store in 2024SynarionITSolutions

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous

Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services

Artificial Intelligence: Facts and MythsJoaquim Jorge

Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal

Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2

Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya

Recently uploaded (20)

Partners Life - Insurer Innovation Award 2024

Top 5 Benefits OF Using Muvi Live Paywall For Live Streams

Manulife - Insurer Innovation Award 2024

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...

Scaling API-first – The story of a global engineering organization

HTML Injection Attacks: Impact and Mitigation Strategies

Why Teams call analytics are critical to your entire business

GenAI Risks & Security Meetup 01052024.pdf

A Domino Admins Adventures (Engage 2024)

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME

How to Troubleshoot Apps for the Modern Connected Worker

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024

Top 10 Most Downloaded Games on Play Store in 2024

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke

Strategies for Landing an Oracle DBA Job as a Fresher

Artificial Intelligence: Facts and Myths

Understanding Discord NSFW Servers A Guide for Responsible Users.pdf

Exploring the Future Potential of AI-Enabled Smartphone Processors

Artificial Intelligence Chap.5 : Uncertainty

Speech Recognition techniques for Home Automation

1. Biometric/Speaker Identification Based on Voice. PRESENTED BY: Project guide: SANTOSH RAJ Ms. Swati Prasad

2. Speaker identification is the process of automatically recognizing who is speaking based on unique characteristics contained in speech signal. This technique makes it possible to use the speaker's voice and the spoken word to verify their identity and control access.

3. Speaker Model DatabaseTest Speech Whose voice is it? Identification system

4. 1. Easier way to control home appliances(controlling appliances using voice command instead of remote of switches is much convinient ). 2. Problem of forgetting password will be resolved using aforementioned system(old age people generally forgetting their password so this should be useful tool for them).

5. 1950s and 1960s- Bell Laborotories designed in 1952 the Audrey system which recognized digits spoken by the single voice. 1970s- Carnegie Mellon’s “Harpy Speech understanding system” which can understand 1011 words. 1980s- speech recognition turns towards prediction. 2013- Apple designed SIRI voice recognition system which turns to a benchmark till today.

7. 1. It must be noise free. 2. Voiced signal should only be considered for further processing. 3. Voiced signal will be distinguised with unvoiced signal through threshold amplitude.

8. Linear prediction coding(LPC)- It is the feature extraction technique and used representation of a speech signal which will be generating low data rate discreet signal. • Voiced/unvoiced(1 byte) • Pitch(6 byte) • Voiced signal amplitude(11 byte) • Unvoiced amplitude(5byte)

9. GAUSSIAN MIXTURE MODELS (GMM) • A Gaussian Mixture Model (GMM) is a parametric probability density function represented as a weighted sum of Gaussian component densities. • GMMs are commonly used as a parametric model of the probability distribution of continuous measurements or features in a biometric system, such as vocal-tract related spectral features in a speaker recognition system

10. 1.Features extraction from the unknown voice 2.Calculate gmm parameter 3.Compare this with the existing model 4.Based on matching percentage identify the speaker

11.  we were sucessful in assembling the blocks in our project.  A basic speech recognition system which recognises a digit from 0 to 9 was thus developed.  The system was trained in an environment with minimum ambient noise.  The system gives the most accurate results when implemented in the environment where it was trained.

12.  We have taken samples of 50 speakers uttering digits 0-9 each 20 times from the reliable database.  The unknown voice is tested succesfully with our existing model.  The identification rate is approx. 90%.

13.  We intend to design a home automation system which will be beased on the command prompted by the authorised speaker. This will enable us to control home appliances more conveniently.

14. REFERENCES [1] X.Huang, A. Acero, and H.-W. Hon, “Spoken Language Processing: A Guide to Theory, Algorithm and System Development”. Prentice Hall PTR May 2001 [2] Matthew Nicholas Stuttle, “A Gaussian Mixture Model Spectral Representation for Speech Recognition”. Hughes Hall and Cambridge University Engineering Department. July 2003

Speech Recognition techniques for Home Automation

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (7)

Similar to Speech Recognition techniques for Home Automation

Similar to Speech Recognition techniques for Home Automation (20)

More from Mphasis

More from Mphasis (20)

Recently uploaded

Recently uploaded (20)

Speech Recognition techniques for Home Automation