SlideShare a Scribd company logo
1 of 7
Download to read offline
International Refereed Journal of Engineering and Science (IRJES)
ISSN (Online) 2319-183X, (Print) 2319-1821
Volume 4, Issue 3 (March 2015), PP.01-07
www.irjes.com 1 | Page
PSoC BASED SPEECH RECOGNITION SYSTEM
1
Shubhasini Sugumaran, 2
Mr.V.R.Prakash
1
Department of Electronics and Communication Hindustan University Chennai, India
2
Department of Electronics and Communication Hindustan University Chennai,India
Abstract:– Speech Recognition Systems(SRS) have been implemented by various processors including the
digital signal processors(DSPs) and field programmable gate arrays(FPGAs) and their performance has been
reported in literature. The fundamental purpose of speech is communication, i.e., the transmission of
messages.In the case of speech, the fundamental analog form of the message is an acoustic waveform, which we
call the speech signal. Speech signals can be converted to an electrical waveform by a microphone, further
manipulated by both analog and digital signal processing, and then converted back to acoustic form by a
loudspeaker, a telephone handset or headphone, as desired.The recognition of speech requires feature extraction
and classification. The systems that use speech as input require a microcontroller to carry out the desired
actions. In this paper, Cypress Programmable System on Chip (PSoC) has been studied and used for
implementation of SRS. From all the available PSoCs, PSoC5 containing ARM Cortex-M3 as its CPU is used.
The noise signals are firstly nullified from the speech signals using LogMMSE filtering. These signals are then
sent to the PSoC5 wherein the speech is recognized and desired actions are performed.
Keywords: PSoC, LogMMSE, Speech Recognition
I. INTRODUCTION
The basic idea of speech is the transmission of messages. A message is represented as a sequence of
discrete symbols that quantifies its information in bits and the rate at which information is transmitted as bits per
second (bps). Speech recognition techniques have seemed to be more efficient and convenient for human-
machine interaction. The speech recognition systems with fixed vocabulary were deployed in many
applications[8] and [9]. Speech recognition systems for voice operated application have been implemented using
various hardware platforms such as the DSPs [3], FPGAs [5] and microprocessors [10]. In speech production,
the information to be transmitted is encoded in the form of a continuously varying analog waveform that can be
transmitted, recorded, manipulated, and ultimately decoded by a human listener. This analog signal is the speech
signal. These signals tend to be corrupted by noise in the real world. If the noise can be estimated from the noise
source, this estimated noise can then be subtracted from the primary channel resulting in the desired signal. This
task is usually done by linear filtering. In real time situations, the corrupting noise is a nonlinear distortion
version of the source noise, so a nonlinear filter should be a better choice. To reduce the influence of noise in the
speech, speech enhancement is done. The recognition algorithm without the use of enhancement algorithms
proved to be less efficient.
Programmable System on Chip (PSoC) have and are being employed in a number of applications. They
are cost effective due to which they have limited storage and computational power. In context to this, the
recognition accuracy becomes important for PSoC and is addressed in the paper.
II. PROGRAMMABLE SYSTEM ON CHIP (PSOC)
Programmable System on Chip(PSoC) has been designed and implemented by Cypress semiconductors
[2] and [4]. Every PSoC contains a microcontroller, programmable analog blocks such as ADC, DAC, I/O
drivers and digital blocks such as Universal Digital Blocks (UDBs), CAN, I2C, PWM in a single chip.
Embedded Development kits from Cypress contain one of the three PSoCs – PsoC1, PSoC3 and PSoC5. The
processing performance, functionality, internal memories and configurability of the PSoC increases from
PSoC1through PSoc5.
PSoC BASED SPEECH RECOGNITION SYSTEM
www.irjes.com 2 | Page
Table 1: Comparison of PSoCs
PSoC5 uses PSoC Creator for its development and implementation. PSoC Creator is a visual
development tool and Integrated Development Environment for PSoC. It has a rich library of prebuilt
components and a schematic design entry tool. It combines C based development flow with an automatically
generated Application programmable interface (API). API rduces the errors in code and ensures proper
interfacing with the peripheral devices which enables the software development to be faster, easier and less
prone to errors. The PSoC Creator also has powerful, modern debugger, which is built in the IDE. It is used to
display the values after execution at each point.
III. SPEECH RECOGNITION SYSTEM
The block diagram is as shown in Fig.1 below.
Speech Recognition System
The speech is given as input through a microphone. The given speech might contain external
disturbances called noise. This noise is cancelled using LogMMSE algorithm. The speech signal after noise
cancellation is amplified and filtered to remove further disturbances in the signal. The speech signal in analog
form is then converted into digital signal using inbuilt analog to digital converter of PSoC. The feature
extraction and pattern recognition is done using MFCC algorithm. The analog to digital conversion is done
using delta sigma ADC present in PSoC5.
The sampling rate of ADC can be adjusted depending on the speech signal as required. The delta sigma
ADC contains three blocks – an amplifier, a modulator and a decimator. The decimator is a four stage CIC
decimation filter. It also contains a post processing unit. Continuous mode of the ADC is being used for
conversion. The output of the speech recognition system is shown through the movement of a robot in different
directions.
FEATURES PSoC1 PSoC3 PSoC5
CPU 8-bit M8C
core
8-bit 8051 32-bit ARM
Cortex
Flash 4 to 32kB 8 to 64kB 32 to 256kB
Interface I²C, SPI,
UART,
FS USB 2.0
I²C, SPI,
UART,
LIN,
FS USB
2.0, I²S,
CAN
I²C, SPI,
UART, LIN,
FS USB 2.0,
I²S,CAN
ADCs 1 delta-
sigma
1 Delta-
Sigma
1 Delta-Sigma,
2SARs
DACs 2(6 bit) 4(8 bit) 4(8 bit)
I/Os 64 72 72
PSoC BASED SPEECH RECOGNITION SYSTEM
www.irjes.com 3 | Page
IV. TECHNIQUES USED
The performance of the SRS degrades when implemented in real world environment. This degradation
is due to acoustic model mismatch. The acoustic model mismatch describes the difference between the
environment in which the SRS is tested and the actual environment in which it is deployed. This can include
echoes, background noise, speaker variability and transmission effects.
1. Log MMSE Filtering
Fig.2 Noise reduction using Log MMSE
1.1 Preprocessing
In preprocessing of audio signals start with pre-emphasis refers to a system process designed to
increase the magnitude of some frequencies with respect to the magnitude of other frequencies in order to
improve the overall signal-to-noise ratio by minimizing the adverse effects of such phenomena as attenuation
distortion or saturation of recording media in subsequent parts of the system. The mirror operation is called de-
emphasis, and the system as a whole is called emphasis.
Pre-emphasis is achieved with a pre-emphasis network which is essentially a calibrated filter. This
network composed of two resistors and one capacitor. The frequency response is decided by special time
constants. The cutoff frequency can be calculated from that value. Pre-emphasis is commonly used
in telecommunications, digital audio recording, record cutting, in FM broadcasting transmissions, and in
displaying the spectrograms of speech signals.
De-emphasis is the complement of pre-emphasis, in the anti noise system called emphasis. Emphasis is
a system process designed to decrease, (within a band of frequencies), the magnitude of some (usually higher)
frequencies with respect to the magnitude of other (usually lower) frequencies in order to improve the overall
signal-to-noise ratio by minimizing the adverse effects of such phenomena as attenuation differences or
saturation of recording media in subsequent parts of the system.
1.2 Segment
The signal samples are segmented into fixed number of frames and each frame samples are evaluated with
hamming window coefficients.
The total frames are calculated by,
Fn = (Ls – Ns)/ (Ns*Sp) +1
Where,
Ls = length of signal
Ns = Length of each frame
Sp = Shift Percentage
Finally the samples of each frames are separated from input signal using Fn and Sp and its scaled by the
hamming window coefficients.
PSoC BASED SPEECH RECOGNITION SYSTEM
www.irjes.com 4 | Page
Fig.3 Segmented signal
1.3 Filtering
After the signal segmentation, the magnitude and phase spectrum from noisy signal are computed by
applying fast fourier transform.The magnitude of noisy signal spectrum is further utilized for filtering process
and signal phase kept same.
The restored signal magnitude spectra is obtained by,
Rs = G .* Y
Where, G – Log spectral amplitude Gain function
Y – Magnitude response of noisy signal
The log spectral gain function is defined by,
G = x./(1+x) exp(eint (v))
Where, v = x./(1+x) * r
x and r– Priori and posteriori signal to noise ratio
eint – Exponential integral
The posteriori snr is defined by,
r = (Y.^2)/ lamda
lamda = E[(Y).^2]
Where, lamda - Noise power spectrum variance
E – Mean value
Complex spectrogram obtained by Filtered magnitude spectrum is combined with noisy signal phase spectrum.
The restored signal is reconstructed by applying inverse fast fourier transform to this complex spectrogram.The
performance of filtering is measured with SNR evaluation and it is defined by,
SNR= 10log10 (Msig^2/(sum(in-out)^2/Ls))
Where, Msig = Maximum amplitude of signal
in, out= Noisy input signal and restored output.
1.4 Concatenate
The signals that were obtained on segmentation are now filtered and the noise content is reduced. These signals
are then concatenated to reform the original signal.
1.5 Restore
The restored noiseless signal is restored to its original form.
2. MEL FREQUENCY CEPSTRUM COEFFICIENT(MFCC)
To increase the robustness in the frame selection process, a robust feature extraction with short time
Fourier transform (STFT) domain uncertainty propagation (STFT-UP) was used. Our implementation followed
using STFT-UP to compute a minimum mean square error (MMSE) estimate directly in the mel-frequency
cepstral coefficient (MFCC) domain. For this particular implementation, amplitude based MFCCs with cepstral
mean subtraction were used to attain improved performance.
0 20 40 60 80 100 120 140 160 180 200
-0.06
-0.04
-0.02
0
0.02
0.04
0.06
0 20 40 60 80 100 120
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
PSoC BASED SPEECH RECOGNITION SYSTEM
www.irjes.com 5 | Page
Fig. 4 MFCC
In speech recognition, the Mel-frequency cepstrum (MFC) is a representation of the shortterm power
spectrum of a sound, based on a linear cosine transform of a log power spectrum on a nonlinear Mel scale of
frequency.
MFCC = DCT [ LOG [ ABS [ FFT (SPEECH) ]]]
Mel-frequency cepstral coefficients (MFCCs) are coefficients that collectively make up an MFC. The difference
between the cepstrum and the mel-frequency cepstrum is that in the MFC, the frequency bands are equally
spaced on the mel scale, which approximates the human auditory system's response more closely than the
linearly spaced frequency bands used in the normal cepstrum. This frequency warping can allow for better
representation of sound.
V. TRAINING AND TESTING
The goal of the system training and inventory design stage is twofold: we need to divide the inventory
into collections of phonetically similar segments with varying lengths and we need to arrive at a statistical
description that tells us which set of collections is most likely to contain the inventory subsection that best
matches the underlying clean frame of an incoming noisy frame . The division of the inventory into the
collections is performed in a step-by-step fashion. First, is segmented and all silent segments are removed. The
non-silent part of the inventory is then divided into sections that each belong to one of 40 phonetic classes.
We are applying the feature extraction to the entire segment stream of the inventory. Because the inventory
signal is assumed to be virtually undistorted, it is sufficient to only retain the resulting short-time MFCC feature
means and to discard the associated variance estimates. The feature means become, thereby, essentially feature
vectors in their own right and we can develop a cluster model for them. We have decided to use the means and
not the actual cepstral vectors at this stage to ensure that the impact of the mean-extraction-processing is
captured in our feature representation.
VI. RESULTS
Fig. 5 Noisy Signal
PSoC BASED SPEECH RECOGNITION SYSTEM
www.irjes.com 6 | Page
Fig. 6 Log MMSE Filtering
Fig. 7 MFCC Feature Extraction
For experimental basis, speech signals from noisy environments were taken. The result shown is fr a
signal in the airport. The noise is cancelled using LogMMSE Filtering and feature extraction is done by MFCC.
VI. CONCLUSION
In this paper speech recognition is done and appliances are controlled using speech commands. The
commands are given by the user in the form of speech. These signals are filtered using LogMMSE filtering due
to which environmental noise is nullified. Further as a part of feature extraction, MFCC is used. A database is
created where all the commands are saved. These commands are then given as input to the PSoC5 kit where the
PSoC is programmed to give the desired result. According to the PSoC, the robot connected to the PSoC moves
in different directions as specified.
REFERENCES
[1]. V. Naresh, B. Venkataramani, Abhishek Karan and J. Manikandan, " PSoCbased isolated speech
recognition system, " International conference on Communication and Signal Processing, April 3-5,
2013.
[2]. R Namba, K Kobayashi, T Ohkubo and Y.Kurihara, "Development of PSoC microcontroller based
solar energy storage system," Proceedings of SICE Annual Conference (SICE), 2011, pp.718-721,
2011.
[3]. J. Manikandan, B. Venkataramani, K. Girish, H. Karthic, V. Siddharth, "Hardware Implementation of
Real-Time Speech Recognition System Using TMS320C6713 DSP",24th International Conference
onVLSI Design (VLSI Design),pp.250-255, 2011
[4]. Jingchuan Wang and WeidongChen , "Integration of PSoC technology with educational robotics",
International Conference on Field-Programmable Technology (FPT), 2010, pp.332-336, 2010.
PSoC BASED SPEECH RECOGNITION SYSTEM
www.irjes.com 7 | Page
[5]. Cheng-Yuan Chang , Ching-Fa Chen , Shing-Tai Pan , Xu-Yu Li, " The speech recognition chip
implementation on FPGA “, Mechanical and Electronics Engineering (ICMEE), 2nd International
Conference,2010.
[6]. V. Amudha, B. Venkataramani, J. Manikandan, "FPGA implementation of isolated digit recognition
system using modified back propagation algorithm,"International Conference on Electronic Design
ICED 2008, pp.1-6.
[7]. V.Amudha, B.Venkataramani, R.Vinoth Kumar and S. Ravishankar,“SOC Implementation of HMM
Based Speaker Independent Isolated Digit Recognition System”, in Proc. of IEEE Int. Conf. on
VLSIDesign VLSI’07, 2007, pp.848-853.
[8]. Trihandyo, A. Belloum, A. Kun-Mean Hou, ″A real-time speech recognition architecture for a multi-
channel interactive voice response system″, Iternational Conference on Acoustics, Speech, and Signal
ProcessingICASSP-97, vol.2, pp.1527-1530,1997
[9]. Mike Wald, Using Automatic Speech Recognition to Enhance Education for All Students: Turning a
Vision into Reality [A].In Proceedings of 34th ASEE/IEEE Frontiers in Education Conference S3G,
Indianapolis, Indiana, 2005, pp 22-25.
[10]. N. Hataoka, H. Kokubo, Y. Obuchi, and A. Amano, "Compact and robust speech recognition for
embedded use on microprocessors," IEEE Workshop on Multimedia Signal Processing, pp. 288-291, 9-
11 Dec. 2002.

More Related Content

What's hot

Introductory Lecture to Audio Signal Processing
Introductory Lecture to Audio Signal ProcessingIntroductory Lecture to Audio Signal Processing
Introductory Lecture to Audio Signal ProcessingAngelo Salatino
 
Lecture 8 audio compression
Lecture 8 audio compressionLecture 8 audio compression
Lecture 8 audio compressionMr SMAK
 
Speech Compression using LPC
Speech Compression using LPCSpeech Compression using LPC
Speech Compression using LPCDisha Modi
 
DSP_FOEHU - Lec 13 - Digital Signal Processing Applications I
DSP_FOEHU - Lec 13 - Digital Signal Processing Applications IDSP_FOEHU - Lec 13 - Digital Signal Processing Applications I
DSP_FOEHU - Lec 13 - Digital Signal Processing Applications IAmr E. Mohamed
 
Introduction to DSP
Introduction to DSPIntroduction to DSP
Introduction to DSPshinychristo
 
Audio and video compression
Audio and video compressionAudio and video compression
Audio and video compressionneeraj9217
 
Introduction to Digital Signal Processing
Introduction to Digital Signal ProcessingIntroduction to Digital Signal Processing
Introduction to Digital Signal Processingop205
 
Audio compression
Audio compression Audio compression
Audio compression Darshan IT
 
Introduction to digital signal processing 2
Introduction to digital signal processing 2Introduction to digital signal processing 2
Introduction to digital signal processing 2Hossam Hassan
 
Filtration and synthesis of different types of human voice signals
Filtration and synthesis of different types of human voice signalsFiltration and synthesis of different types of human voice signals
Filtration and synthesis of different types of human voice signalsAlexander Decker
 
Discrete-Time Signal Processing
Discrete-Time Signal ProcessingDiscrete-Time Signal Processing
Discrete-Time Signal Processinglancer350
 

What's hot (19)

Introductory Lecture to Audio Signal Processing
Introductory Lecture to Audio Signal ProcessingIntroductory Lecture to Audio Signal Processing
Introductory Lecture to Audio Signal Processing
 
Lecture 8 audio compression
Lecture 8 audio compressionLecture 8 audio compression
Lecture 8 audio compression
 
DSP Processor
DSP Processor DSP Processor
DSP Processor
 
Introduction to dsp
Introduction to dspIntroduction to dsp
Introduction to dsp
 
Speech Compression using LPC
Speech Compression using LPCSpeech Compression using LPC
Speech Compression using LPC
 
DSP_FOEHU - Lec 13 - Digital Signal Processing Applications I
DSP_FOEHU - Lec 13 - Digital Signal Processing Applications IDSP_FOEHU - Lec 13 - Digital Signal Processing Applications I
DSP_FOEHU - Lec 13 - Digital Signal Processing Applications I
 
Introduction to DSP
Introduction to DSPIntroduction to DSP
Introduction to DSP
 
Audio and video compression
Audio and video compressionAudio and video compression
Audio and video compression
 
Introduction to Digital Signal Processing
Introduction to Digital Signal ProcessingIntroduction to Digital Signal Processing
Introduction to Digital Signal Processing
 
Convolution
ConvolutionConvolution
Convolution
 
Audio compression
Audio compression Audio compression
Audio compression
 
Lecture 18 (5)
Lecture 18 (5)Lecture 18 (5)
Lecture 18 (5)
 
Speech compression-using-gsm
Speech compression-using-gsmSpeech compression-using-gsm
Speech compression-using-gsm
 
Subband Coding
Subband CodingSubband Coding
Subband Coding
 
Digital signal processor part4
Digital signal processor part4Digital signal processor part4
Digital signal processor part4
 
Introduction to digital signal processing 2
Introduction to digital signal processing 2Introduction to digital signal processing 2
Introduction to digital signal processing 2
 
Filtration and synthesis of different types of human voice signals
Filtration and synthesis of different types of human voice signalsFiltration and synthesis of different types of human voice signals
Filtration and synthesis of different types of human voice signals
 
Digital signal processing part1
Digital signal processing part1Digital signal processing part1
Digital signal processing part1
 
Discrete-Time Signal Processing
Discrete-Time Signal ProcessingDiscrete-Time Signal Processing
Discrete-Time Signal Processing
 

Viewers also liked

SPEECH RECOGNITION USING NEURAL NETWORK
SPEECH RECOGNITION USING NEURAL NETWORK SPEECH RECOGNITION USING NEURAL NETWORK
SPEECH RECOGNITION USING NEURAL NETWORK Kamonasish Hore
 
An Overview on Programmable System on Chip: PSoC-5
An Overview on Programmable System on Chip: PSoC-5An Overview on Programmable System on Chip: PSoC-5
An Overview on Programmable System on Chip: PSoC-5Premier Farnell
 
R0boCamp2016 Віктор Кремінь: Системи на кристалі Cypress PSoC для вбудованих ...
R0boCamp2016 Віктор Кремінь: Системи на кристалі Cypress PSoC для вбудованих ...R0boCamp2016 Віктор Кремінь: Системи на кристалі Cypress PSoC для вбудованих ...
R0boCamp2016 Віктор Кремінь: Системи на кристалі Cypress PSoC для вбудованих ...Lviv Startup Club
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognitionRichie
 
Speech recognition final presentation
Speech recognition final presentationSpeech recognition final presentation
Speech recognition final presentationhimanshubhatti
 
Speech Recognition System By Matlab
Speech Recognition System By MatlabSpeech Recognition System By Matlab
Speech Recognition System By MatlabAnkit Gujrati
 
Speech recognition
Speech recognitionSpeech recognition
Speech recognitionCharu Joshi
 

Viewers also liked (7)

SPEECH RECOGNITION USING NEURAL NETWORK
SPEECH RECOGNITION USING NEURAL NETWORK SPEECH RECOGNITION USING NEURAL NETWORK
SPEECH RECOGNITION USING NEURAL NETWORK
 
An Overview on Programmable System on Chip: PSoC-5
An Overview on Programmable System on Chip: PSoC-5An Overview on Programmable System on Chip: PSoC-5
An Overview on Programmable System on Chip: PSoC-5
 
R0boCamp2016 Віктор Кремінь: Системи на кристалі Cypress PSoC для вбудованих ...
R0boCamp2016 Віктор Кремінь: Системи на кристалі Cypress PSoC для вбудованих ...R0boCamp2016 Віктор Кремінь: Системи на кристалі Cypress PSoC для вбудованих ...
R0boCamp2016 Віктор Кремінь: Системи на кристалі Cypress PSoC для вбудованих ...
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognition
 
Speech recognition final presentation
Speech recognition final presentationSpeech recognition final presentation
Speech recognition final presentation
 
Speech Recognition System By Matlab
Speech Recognition System By MatlabSpeech Recognition System By Matlab
Speech Recognition System By Matlab
 
Speech recognition
Speech recognitionSpeech recognition
Speech recognition
 

Similar to PSoC speech recognition

AN ANALYSIS OF SPEECH RECOGNITION PERFORMANCE BASED UPON NETWORK LAYERS AND T...
AN ANALYSIS OF SPEECH RECOGNITION PERFORMANCE BASED UPON NETWORK LAYERS AND T...AN ANALYSIS OF SPEECH RECOGNITION PERFORMANCE BASED UPON NETWORK LAYERS AND T...
AN ANALYSIS OF SPEECH RECOGNITION PERFORMANCE BASED UPON NETWORK LAYERS AND T...IJCSEA Journal
 
An Evaluation Of Lms Based Adaptive Filtering
An Evaluation Of Lms Based Adaptive FilteringAn Evaluation Of Lms Based Adaptive Filtering
An Evaluation Of Lms Based Adaptive FilteringRenee Wardowski
 
Design and implementation of different audio restoration techniques for audio...
Design and implementation of different audio restoration techniques for audio...Design and implementation of different audio restoration techniques for audio...
Design and implementation of different audio restoration techniques for audio...eSAT Journals
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...ijceronline
 
Speech compression analysis using matlab
Speech compression analysis using matlabSpeech compression analysis using matlab
Speech compression analysis using matlabeSAT Journals
 
Speech compression analysis using matlab
Speech compression analysis using matlabSpeech compression analysis using matlab
Speech compression analysis using matlabeSAT Publishing House
 
Speaker Recognition System using MFCC and Vector Quantization Approach
Speaker Recognition System using MFCC and Vector Quantization ApproachSpeaker Recognition System using MFCC and Vector Quantization Approach
Speaker Recognition System using MFCC and Vector Quantization Approachijsrd.com
 
Low power fpga solution for dab audio decoder
Low power fpga solution for dab audio decoderLow power fpga solution for dab audio decoder
Low power fpga solution for dab audio decodereSAT Publishing House
 
Discrete time signal processing unit-2
Discrete time signal processing unit-2Discrete time signal processing unit-2
Discrete time signal processing unit-2selvalakshmi24
 
Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...IRJET Journal
 
Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...IRJET Journal
 
REAL TIME SPECIAL EFFECTS GENERATION AND NOISE FILTRATION OF AUDIO SIGNAL USI...
REAL TIME SPECIAL EFFECTS GENERATION AND NOISE FILTRATION OF AUDIO SIGNAL USI...REAL TIME SPECIAL EFFECTS GENERATION AND NOISE FILTRATION OF AUDIO SIGNAL USI...
REAL TIME SPECIAL EFFECTS GENERATION AND NOISE FILTRATION OF AUDIO SIGNAL USI...ijcsa
 
Introduction to dsp by bibhu prasad ganthia
Introduction to dsp by bibhu prasad ganthiaIntroduction to dsp by bibhu prasad ganthia
Introduction to dsp by bibhu prasad ganthiaDr. Bibhu Prasad Ganthia
 
GENDER RECOGNITION SYSTEM USING SPEECH SIGNAL
GENDER RECOGNITION SYSTEM USING SPEECH SIGNALGENDER RECOGNITION SYSTEM USING SPEECH SIGNAL
GENDER RECOGNITION SYSTEM USING SPEECH SIGNALIJCSEIT Journal
 

Similar to PSoC speech recognition (20)

AN ANALYSIS OF SPEECH RECOGNITION PERFORMANCE BASED UPON NETWORK LAYERS AND T...
AN ANALYSIS OF SPEECH RECOGNITION PERFORMANCE BASED UPON NETWORK LAYERS AND T...AN ANALYSIS OF SPEECH RECOGNITION PERFORMANCE BASED UPON NETWORK LAYERS AND T...
AN ANALYSIS OF SPEECH RECOGNITION PERFORMANCE BASED UPON NETWORK LAYERS AND T...
 
An Evaluation Of Lms Based Adaptive Filtering
An Evaluation Of Lms Based Adaptive FilteringAn Evaluation Of Lms Based Adaptive Filtering
An Evaluation Of Lms Based Adaptive Filtering
 
Speech Signal Analysis
Speech Signal AnalysisSpeech Signal Analysis
Speech Signal Analysis
 
Design and implementation of different audio restoration techniques for audio...
Design and implementation of different audio restoration techniques for audio...Design and implementation of different audio restoration techniques for audio...
Design and implementation of different audio restoration techniques for audio...
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
 
Speech compression analysis using matlab
Speech compression analysis using matlabSpeech compression analysis using matlab
Speech compression analysis using matlab
 
Speech compression analysis using matlab
Speech compression analysis using matlabSpeech compression analysis using matlab
Speech compression analysis using matlab
 
Speaker Recognition System using MFCC and Vector Quantization Approach
Speaker Recognition System using MFCC and Vector Quantization ApproachSpeaker Recognition System using MFCC and Vector Quantization Approach
Speaker Recognition System using MFCC and Vector Quantization Approach
 
[IJET-V1I6P21] Authors : Easwari.N , Ponmuthuramalingam.P
[IJET-V1I6P21] Authors : Easwari.N , Ponmuthuramalingam.P[IJET-V1I6P21] Authors : Easwari.N , Ponmuthuramalingam.P
[IJET-V1I6P21] Authors : Easwari.N , Ponmuthuramalingam.P
 
Low power fpga solution for dab audio decoder
Low power fpga solution for dab audio decoderLow power fpga solution for dab audio decoder
Low power fpga solution for dab audio decoder
 
Lab based report
Lab based reportLab based report
Lab based report
 
Discrete time signal processing unit-2
Discrete time signal processing unit-2Discrete time signal processing unit-2
Discrete time signal processing unit-2
 
Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...
 
Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...
 
REAL TIME SPECIAL EFFECTS GENERATION AND NOISE FILTRATION OF AUDIO SIGNAL USI...
REAL TIME SPECIAL EFFECTS GENERATION AND NOISE FILTRATION OF AUDIO SIGNAL USI...REAL TIME SPECIAL EFFECTS GENERATION AND NOISE FILTRATION OF AUDIO SIGNAL USI...
REAL TIME SPECIAL EFFECTS GENERATION AND NOISE FILTRATION OF AUDIO SIGNAL USI...
 
G010424248
G010424248G010424248
G010424248
 
H0814247
H0814247H0814247
H0814247
 
Introduction to dsp by bibhu prasad ganthia
Introduction to dsp by bibhu prasad ganthiaIntroduction to dsp by bibhu prasad ganthia
Introduction to dsp by bibhu prasad ganthia
 
F5242832
F5242832F5242832
F5242832
 
GENDER RECOGNITION SYSTEM USING SPEECH SIGNAL
GENDER RECOGNITION SYSTEM USING SPEECH SIGNALGENDER RECOGNITION SYSTEM USING SPEECH SIGNAL
GENDER RECOGNITION SYSTEM USING SPEECH SIGNAL
 

More from IJRES Journal

Exploratory study on the use of crushed cockle shell as partial sand replacem...
Exploratory study on the use of crushed cockle shell as partial sand replacem...Exploratory study on the use of crushed cockle shell as partial sand replacem...
Exploratory study on the use of crushed cockle shell as partial sand replacem...IJRES Journal
 
Congenital Malaria: Correlation of Umbilical Cord Plasmodium falciparum Paras...
Congenital Malaria: Correlation of Umbilical Cord Plasmodium falciparum Paras...Congenital Malaria: Correlation of Umbilical Cord Plasmodium falciparum Paras...
Congenital Malaria: Correlation of Umbilical Cord Plasmodium falciparum Paras...IJRES Journal
 
Review: Nonlinear Techniques for Analysis of Heart Rate Variability
Review: Nonlinear Techniques for Analysis of Heart Rate VariabilityReview: Nonlinear Techniques for Analysis of Heart Rate Variability
Review: Nonlinear Techniques for Analysis of Heart Rate VariabilityIJRES Journal
 
Dynamic Modeling for Gas Phase Propylene Copolymerization in a Fluidized Bed ...
Dynamic Modeling for Gas Phase Propylene Copolymerization in a Fluidized Bed ...Dynamic Modeling for Gas Phase Propylene Copolymerization in a Fluidized Bed ...
Dynamic Modeling for Gas Phase Propylene Copolymerization in a Fluidized Bed ...IJRES Journal
 
Study and evaluation for different types of Sudanese crude oil properties
Study and evaluation for different types of Sudanese crude oil propertiesStudy and evaluation for different types of Sudanese crude oil properties
Study and evaluation for different types of Sudanese crude oil propertiesIJRES Journal
 
A Short Report on Different Wavelets and Their Structures
A Short Report on Different Wavelets and Their StructuresA Short Report on Different Wavelets and Their Structures
A Short Report on Different Wavelets and Their StructuresIJRES Journal
 
A Case Study on Academic Services Application Using Agile Methodology for Mob...
A Case Study on Academic Services Application Using Agile Methodology for Mob...A Case Study on Academic Services Application Using Agile Methodology for Mob...
A Case Study on Academic Services Application Using Agile Methodology for Mob...IJRES Journal
 
Wear Analysis on Cylindrical Cam with Flexible Rod
Wear Analysis on Cylindrical Cam with Flexible RodWear Analysis on Cylindrical Cam with Flexible Rod
Wear Analysis on Cylindrical Cam with Flexible RodIJRES Journal
 
DDOS Attacks-A Stealthy Way of Implementation and Detection
DDOS Attacks-A Stealthy Way of Implementation and DetectionDDOS Attacks-A Stealthy Way of Implementation and Detection
DDOS Attacks-A Stealthy Way of Implementation and DetectionIJRES Journal
 
An improved fading Kalman filter in the application of BDS dynamic positioning
An improved fading Kalman filter in the application of BDS dynamic positioningAn improved fading Kalman filter in the application of BDS dynamic positioning
An improved fading Kalman filter in the application of BDS dynamic positioningIJRES Journal
 
Positioning Error Analysis and Compensation of Differential Precision Workbench
Positioning Error Analysis and Compensation of Differential Precision WorkbenchPositioning Error Analysis and Compensation of Differential Precision Workbench
Positioning Error Analysis and Compensation of Differential Precision WorkbenchIJRES Journal
 
Status of Heavy metal pollution in Mithi river: Then and Now
Status of Heavy metal pollution in Mithi river: Then and NowStatus of Heavy metal pollution in Mithi river: Then and Now
Status of Heavy metal pollution in Mithi river: Then and NowIJRES Journal
 
The Low-Temperature Radiant Floor Heating System Design and Experimental Stud...
The Low-Temperature Radiant Floor Heating System Design and Experimental Stud...The Low-Temperature Radiant Floor Heating System Design and Experimental Stud...
The Low-Temperature Radiant Floor Heating System Design and Experimental Stud...IJRES Journal
 
Experimental study on critical closing pressure of mudstone fractured reservoirs
Experimental study on critical closing pressure of mudstone fractured reservoirsExperimental study on critical closing pressure of mudstone fractured reservoirs
Experimental study on critical closing pressure of mudstone fractured reservoirsIJRES Journal
 
Correlation Analysis of Tool Wear and Cutting Sound Signal
Correlation Analysis of Tool Wear and Cutting Sound SignalCorrelation Analysis of Tool Wear and Cutting Sound Signal
Correlation Analysis of Tool Wear and Cutting Sound SignalIJRES Journal
 
Reduce Resources for Privacy in Mobile Cloud Computing Using Blowfish and DSA...
Reduce Resources for Privacy in Mobile Cloud Computing Using Blowfish and DSA...Reduce Resources for Privacy in Mobile Cloud Computing Using Blowfish and DSA...
Reduce Resources for Privacy in Mobile Cloud Computing Using Blowfish and DSA...IJRES Journal
 
Resistance of Dryland Rice to Stem Borer (Scirpophaga incertulas Wlk.) Using ...
Resistance of Dryland Rice to Stem Borer (Scirpophaga incertulas Wlk.) Using ...Resistance of Dryland Rice to Stem Borer (Scirpophaga incertulas Wlk.) Using ...
Resistance of Dryland Rice to Stem Borer (Scirpophaga incertulas Wlk.) Using ...IJRES Journal
 
A novel high-precision curvature-compensated CMOS bandgap reference without u...
A novel high-precision curvature-compensated CMOS bandgap reference without u...A novel high-precision curvature-compensated CMOS bandgap reference without u...
A novel high-precision curvature-compensated CMOS bandgap reference without u...IJRES Journal
 
Structural aspect on carbon dioxide capture in nanotubes
Structural aspect on carbon dioxide capture in nanotubesStructural aspect on carbon dioxide capture in nanotubes
Structural aspect on carbon dioxide capture in nanotubesIJRES Journal
 
Thesummaryabout fuzzy control parameters selected based on brake driver inten...
Thesummaryabout fuzzy control parameters selected based on brake driver inten...Thesummaryabout fuzzy control parameters selected based on brake driver inten...
Thesummaryabout fuzzy control parameters selected based on brake driver inten...IJRES Journal
 

More from IJRES Journal (20)

Exploratory study on the use of crushed cockle shell as partial sand replacem...
Exploratory study on the use of crushed cockle shell as partial sand replacem...Exploratory study on the use of crushed cockle shell as partial sand replacem...
Exploratory study on the use of crushed cockle shell as partial sand replacem...
 
Congenital Malaria: Correlation of Umbilical Cord Plasmodium falciparum Paras...
Congenital Malaria: Correlation of Umbilical Cord Plasmodium falciparum Paras...Congenital Malaria: Correlation of Umbilical Cord Plasmodium falciparum Paras...
Congenital Malaria: Correlation of Umbilical Cord Plasmodium falciparum Paras...
 
Review: Nonlinear Techniques for Analysis of Heart Rate Variability
Review: Nonlinear Techniques for Analysis of Heart Rate VariabilityReview: Nonlinear Techniques for Analysis of Heart Rate Variability
Review: Nonlinear Techniques for Analysis of Heart Rate Variability
 
Dynamic Modeling for Gas Phase Propylene Copolymerization in a Fluidized Bed ...
Dynamic Modeling for Gas Phase Propylene Copolymerization in a Fluidized Bed ...Dynamic Modeling for Gas Phase Propylene Copolymerization in a Fluidized Bed ...
Dynamic Modeling for Gas Phase Propylene Copolymerization in a Fluidized Bed ...
 
Study and evaluation for different types of Sudanese crude oil properties
Study and evaluation for different types of Sudanese crude oil propertiesStudy and evaluation for different types of Sudanese crude oil properties
Study and evaluation for different types of Sudanese crude oil properties
 
A Short Report on Different Wavelets and Their Structures
A Short Report on Different Wavelets and Their StructuresA Short Report on Different Wavelets and Their Structures
A Short Report on Different Wavelets and Their Structures
 
A Case Study on Academic Services Application Using Agile Methodology for Mob...
A Case Study on Academic Services Application Using Agile Methodology for Mob...A Case Study on Academic Services Application Using Agile Methodology for Mob...
A Case Study on Academic Services Application Using Agile Methodology for Mob...
 
Wear Analysis on Cylindrical Cam with Flexible Rod
Wear Analysis on Cylindrical Cam with Flexible RodWear Analysis on Cylindrical Cam with Flexible Rod
Wear Analysis on Cylindrical Cam with Flexible Rod
 
DDOS Attacks-A Stealthy Way of Implementation and Detection
DDOS Attacks-A Stealthy Way of Implementation and DetectionDDOS Attacks-A Stealthy Way of Implementation and Detection
DDOS Attacks-A Stealthy Way of Implementation and Detection
 
An improved fading Kalman filter in the application of BDS dynamic positioning
An improved fading Kalman filter in the application of BDS dynamic positioningAn improved fading Kalman filter in the application of BDS dynamic positioning
An improved fading Kalman filter in the application of BDS dynamic positioning
 
Positioning Error Analysis and Compensation of Differential Precision Workbench
Positioning Error Analysis and Compensation of Differential Precision WorkbenchPositioning Error Analysis and Compensation of Differential Precision Workbench
Positioning Error Analysis and Compensation of Differential Precision Workbench
 
Status of Heavy metal pollution in Mithi river: Then and Now
Status of Heavy metal pollution in Mithi river: Then and NowStatus of Heavy metal pollution in Mithi river: Then and Now
Status of Heavy metal pollution in Mithi river: Then and Now
 
The Low-Temperature Radiant Floor Heating System Design and Experimental Stud...
The Low-Temperature Radiant Floor Heating System Design and Experimental Stud...The Low-Temperature Radiant Floor Heating System Design and Experimental Stud...
The Low-Temperature Radiant Floor Heating System Design and Experimental Stud...
 
Experimental study on critical closing pressure of mudstone fractured reservoirs
Experimental study on critical closing pressure of mudstone fractured reservoirsExperimental study on critical closing pressure of mudstone fractured reservoirs
Experimental study on critical closing pressure of mudstone fractured reservoirs
 
Correlation Analysis of Tool Wear and Cutting Sound Signal
Correlation Analysis of Tool Wear and Cutting Sound SignalCorrelation Analysis of Tool Wear and Cutting Sound Signal
Correlation Analysis of Tool Wear and Cutting Sound Signal
 
Reduce Resources for Privacy in Mobile Cloud Computing Using Blowfish and DSA...
Reduce Resources for Privacy in Mobile Cloud Computing Using Blowfish and DSA...Reduce Resources for Privacy in Mobile Cloud Computing Using Blowfish and DSA...
Reduce Resources for Privacy in Mobile Cloud Computing Using Blowfish and DSA...
 
Resistance of Dryland Rice to Stem Borer (Scirpophaga incertulas Wlk.) Using ...
Resistance of Dryland Rice to Stem Borer (Scirpophaga incertulas Wlk.) Using ...Resistance of Dryland Rice to Stem Borer (Scirpophaga incertulas Wlk.) Using ...
Resistance of Dryland Rice to Stem Borer (Scirpophaga incertulas Wlk.) Using ...
 
A novel high-precision curvature-compensated CMOS bandgap reference without u...
A novel high-precision curvature-compensated CMOS bandgap reference without u...A novel high-precision curvature-compensated CMOS bandgap reference without u...
A novel high-precision curvature-compensated CMOS bandgap reference without u...
 
Structural aspect on carbon dioxide capture in nanotubes
Structural aspect on carbon dioxide capture in nanotubesStructural aspect on carbon dioxide capture in nanotubes
Structural aspect on carbon dioxide capture in nanotubes
 
Thesummaryabout fuzzy control parameters selected based on brake driver inten...
Thesummaryabout fuzzy control parameters selected based on brake driver inten...Thesummaryabout fuzzy control parameters selected based on brake driver inten...
Thesummaryabout fuzzy control parameters selected based on brake driver inten...
 

Recently uploaded

08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Hyundai Motor Group
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 

Recently uploaded (20)

08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 

PSoC speech recognition

  • 1. International Refereed Journal of Engineering and Science (IRJES) ISSN (Online) 2319-183X, (Print) 2319-1821 Volume 4, Issue 3 (March 2015), PP.01-07 www.irjes.com 1 | Page PSoC BASED SPEECH RECOGNITION SYSTEM 1 Shubhasini Sugumaran, 2 Mr.V.R.Prakash 1 Department of Electronics and Communication Hindustan University Chennai, India 2 Department of Electronics and Communication Hindustan University Chennai,India Abstract:– Speech Recognition Systems(SRS) have been implemented by various processors including the digital signal processors(DSPs) and field programmable gate arrays(FPGAs) and their performance has been reported in literature. The fundamental purpose of speech is communication, i.e., the transmission of messages.In the case of speech, the fundamental analog form of the message is an acoustic waveform, which we call the speech signal. Speech signals can be converted to an electrical waveform by a microphone, further manipulated by both analog and digital signal processing, and then converted back to acoustic form by a loudspeaker, a telephone handset or headphone, as desired.The recognition of speech requires feature extraction and classification. The systems that use speech as input require a microcontroller to carry out the desired actions. In this paper, Cypress Programmable System on Chip (PSoC) has been studied and used for implementation of SRS. From all the available PSoCs, PSoC5 containing ARM Cortex-M3 as its CPU is used. The noise signals are firstly nullified from the speech signals using LogMMSE filtering. These signals are then sent to the PSoC5 wherein the speech is recognized and desired actions are performed. Keywords: PSoC, LogMMSE, Speech Recognition I. INTRODUCTION The basic idea of speech is the transmission of messages. A message is represented as a sequence of discrete symbols that quantifies its information in bits and the rate at which information is transmitted as bits per second (bps). Speech recognition techniques have seemed to be more efficient and convenient for human- machine interaction. The speech recognition systems with fixed vocabulary were deployed in many applications[8] and [9]. Speech recognition systems for voice operated application have been implemented using various hardware platforms such as the DSPs [3], FPGAs [5] and microprocessors [10]. In speech production, the information to be transmitted is encoded in the form of a continuously varying analog waveform that can be transmitted, recorded, manipulated, and ultimately decoded by a human listener. This analog signal is the speech signal. These signals tend to be corrupted by noise in the real world. If the noise can be estimated from the noise source, this estimated noise can then be subtracted from the primary channel resulting in the desired signal. This task is usually done by linear filtering. In real time situations, the corrupting noise is a nonlinear distortion version of the source noise, so a nonlinear filter should be a better choice. To reduce the influence of noise in the speech, speech enhancement is done. The recognition algorithm without the use of enhancement algorithms proved to be less efficient. Programmable System on Chip (PSoC) have and are being employed in a number of applications. They are cost effective due to which they have limited storage and computational power. In context to this, the recognition accuracy becomes important for PSoC and is addressed in the paper. II. PROGRAMMABLE SYSTEM ON CHIP (PSOC) Programmable System on Chip(PSoC) has been designed and implemented by Cypress semiconductors [2] and [4]. Every PSoC contains a microcontroller, programmable analog blocks such as ADC, DAC, I/O drivers and digital blocks such as Universal Digital Blocks (UDBs), CAN, I2C, PWM in a single chip. Embedded Development kits from Cypress contain one of the three PSoCs – PsoC1, PSoC3 and PSoC5. The processing performance, functionality, internal memories and configurability of the PSoC increases from PSoC1through PSoc5.
  • 2. PSoC BASED SPEECH RECOGNITION SYSTEM www.irjes.com 2 | Page Table 1: Comparison of PSoCs PSoC5 uses PSoC Creator for its development and implementation. PSoC Creator is a visual development tool and Integrated Development Environment for PSoC. It has a rich library of prebuilt components and a schematic design entry tool. It combines C based development flow with an automatically generated Application programmable interface (API). API rduces the errors in code and ensures proper interfacing with the peripheral devices which enables the software development to be faster, easier and less prone to errors. The PSoC Creator also has powerful, modern debugger, which is built in the IDE. It is used to display the values after execution at each point. III. SPEECH RECOGNITION SYSTEM The block diagram is as shown in Fig.1 below. Speech Recognition System The speech is given as input through a microphone. The given speech might contain external disturbances called noise. This noise is cancelled using LogMMSE algorithm. The speech signal after noise cancellation is amplified and filtered to remove further disturbances in the signal. The speech signal in analog form is then converted into digital signal using inbuilt analog to digital converter of PSoC. The feature extraction and pattern recognition is done using MFCC algorithm. The analog to digital conversion is done using delta sigma ADC present in PSoC5. The sampling rate of ADC can be adjusted depending on the speech signal as required. The delta sigma ADC contains three blocks – an amplifier, a modulator and a decimator. The decimator is a four stage CIC decimation filter. It also contains a post processing unit. Continuous mode of the ADC is being used for conversion. The output of the speech recognition system is shown through the movement of a robot in different directions. FEATURES PSoC1 PSoC3 PSoC5 CPU 8-bit M8C core 8-bit 8051 32-bit ARM Cortex Flash 4 to 32kB 8 to 64kB 32 to 256kB Interface I²C, SPI, UART, FS USB 2.0 I²C, SPI, UART, LIN, FS USB 2.0, I²S, CAN I²C, SPI, UART, LIN, FS USB 2.0, I²S,CAN ADCs 1 delta- sigma 1 Delta- Sigma 1 Delta-Sigma, 2SARs DACs 2(6 bit) 4(8 bit) 4(8 bit) I/Os 64 72 72
  • 3. PSoC BASED SPEECH RECOGNITION SYSTEM www.irjes.com 3 | Page IV. TECHNIQUES USED The performance of the SRS degrades when implemented in real world environment. This degradation is due to acoustic model mismatch. The acoustic model mismatch describes the difference between the environment in which the SRS is tested and the actual environment in which it is deployed. This can include echoes, background noise, speaker variability and transmission effects. 1. Log MMSE Filtering Fig.2 Noise reduction using Log MMSE 1.1 Preprocessing In preprocessing of audio signals start with pre-emphasis refers to a system process designed to increase the magnitude of some frequencies with respect to the magnitude of other frequencies in order to improve the overall signal-to-noise ratio by minimizing the adverse effects of such phenomena as attenuation distortion or saturation of recording media in subsequent parts of the system. The mirror operation is called de- emphasis, and the system as a whole is called emphasis. Pre-emphasis is achieved with a pre-emphasis network which is essentially a calibrated filter. This network composed of two resistors and one capacitor. The frequency response is decided by special time constants. The cutoff frequency can be calculated from that value. Pre-emphasis is commonly used in telecommunications, digital audio recording, record cutting, in FM broadcasting transmissions, and in displaying the spectrograms of speech signals. De-emphasis is the complement of pre-emphasis, in the anti noise system called emphasis. Emphasis is a system process designed to decrease, (within a band of frequencies), the magnitude of some (usually higher) frequencies with respect to the magnitude of other (usually lower) frequencies in order to improve the overall signal-to-noise ratio by minimizing the adverse effects of such phenomena as attenuation differences or saturation of recording media in subsequent parts of the system. 1.2 Segment The signal samples are segmented into fixed number of frames and each frame samples are evaluated with hamming window coefficients. The total frames are calculated by, Fn = (Ls – Ns)/ (Ns*Sp) +1 Where, Ls = length of signal Ns = Length of each frame Sp = Shift Percentage Finally the samples of each frames are separated from input signal using Fn and Sp and its scaled by the hamming window coefficients.
  • 4. PSoC BASED SPEECH RECOGNITION SYSTEM www.irjes.com 4 | Page Fig.3 Segmented signal 1.3 Filtering After the signal segmentation, the magnitude and phase spectrum from noisy signal are computed by applying fast fourier transform.The magnitude of noisy signal spectrum is further utilized for filtering process and signal phase kept same. The restored signal magnitude spectra is obtained by, Rs = G .* Y Where, G – Log spectral amplitude Gain function Y – Magnitude response of noisy signal The log spectral gain function is defined by, G = x./(1+x) exp(eint (v)) Where, v = x./(1+x) * r x and r– Priori and posteriori signal to noise ratio eint – Exponential integral The posteriori snr is defined by, r = (Y.^2)/ lamda lamda = E[(Y).^2] Where, lamda - Noise power spectrum variance E – Mean value Complex spectrogram obtained by Filtered magnitude spectrum is combined with noisy signal phase spectrum. The restored signal is reconstructed by applying inverse fast fourier transform to this complex spectrogram.The performance of filtering is measured with SNR evaluation and it is defined by, SNR= 10log10 (Msig^2/(sum(in-out)^2/Ls)) Where, Msig = Maximum amplitude of signal in, out= Noisy input signal and restored output. 1.4 Concatenate The signals that were obtained on segmentation are now filtered and the noise content is reduced. These signals are then concatenated to reform the original signal. 1.5 Restore The restored noiseless signal is restored to its original form. 2. MEL FREQUENCY CEPSTRUM COEFFICIENT(MFCC) To increase the robustness in the frame selection process, a robust feature extraction with short time Fourier transform (STFT) domain uncertainty propagation (STFT-UP) was used. Our implementation followed using STFT-UP to compute a minimum mean square error (MMSE) estimate directly in the mel-frequency cepstral coefficient (MFCC) domain. For this particular implementation, amplitude based MFCCs with cepstral mean subtraction were used to attain improved performance. 0 20 40 60 80 100 120 140 160 180 200 -0.06 -0.04 -0.02 0 0.02 0.04 0.06 0 20 40 60 80 100 120 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7
  • 5. PSoC BASED SPEECH RECOGNITION SYSTEM www.irjes.com 5 | Page Fig. 4 MFCC In speech recognition, the Mel-frequency cepstrum (MFC) is a representation of the shortterm power spectrum of a sound, based on a linear cosine transform of a log power spectrum on a nonlinear Mel scale of frequency. MFCC = DCT [ LOG [ ABS [ FFT (SPEECH) ]]] Mel-frequency cepstral coefficients (MFCCs) are coefficients that collectively make up an MFC. The difference between the cepstrum and the mel-frequency cepstrum is that in the MFC, the frequency bands are equally spaced on the mel scale, which approximates the human auditory system's response more closely than the linearly spaced frequency bands used in the normal cepstrum. This frequency warping can allow for better representation of sound. V. TRAINING AND TESTING The goal of the system training and inventory design stage is twofold: we need to divide the inventory into collections of phonetically similar segments with varying lengths and we need to arrive at a statistical description that tells us which set of collections is most likely to contain the inventory subsection that best matches the underlying clean frame of an incoming noisy frame . The division of the inventory into the collections is performed in a step-by-step fashion. First, is segmented and all silent segments are removed. The non-silent part of the inventory is then divided into sections that each belong to one of 40 phonetic classes. We are applying the feature extraction to the entire segment stream of the inventory. Because the inventory signal is assumed to be virtually undistorted, it is sufficient to only retain the resulting short-time MFCC feature means and to discard the associated variance estimates. The feature means become, thereby, essentially feature vectors in their own right and we can develop a cluster model for them. We have decided to use the means and not the actual cepstral vectors at this stage to ensure that the impact of the mean-extraction-processing is captured in our feature representation. VI. RESULTS Fig. 5 Noisy Signal
  • 6. PSoC BASED SPEECH RECOGNITION SYSTEM www.irjes.com 6 | Page Fig. 6 Log MMSE Filtering Fig. 7 MFCC Feature Extraction For experimental basis, speech signals from noisy environments were taken. The result shown is fr a signal in the airport. The noise is cancelled using LogMMSE Filtering and feature extraction is done by MFCC. VI. CONCLUSION In this paper speech recognition is done and appliances are controlled using speech commands. The commands are given by the user in the form of speech. These signals are filtered using LogMMSE filtering due to which environmental noise is nullified. Further as a part of feature extraction, MFCC is used. A database is created where all the commands are saved. These commands are then given as input to the PSoC5 kit where the PSoC is programmed to give the desired result. According to the PSoC, the robot connected to the PSoC moves in different directions as specified. REFERENCES [1]. V. Naresh, B. Venkataramani, Abhishek Karan and J. Manikandan, " PSoCbased isolated speech recognition system, " International conference on Communication and Signal Processing, April 3-5, 2013. [2]. R Namba, K Kobayashi, T Ohkubo and Y.Kurihara, "Development of PSoC microcontroller based solar energy storage system," Proceedings of SICE Annual Conference (SICE), 2011, pp.718-721, 2011. [3]. J. Manikandan, B. Venkataramani, K. Girish, H. Karthic, V. Siddharth, "Hardware Implementation of Real-Time Speech Recognition System Using TMS320C6713 DSP",24th International Conference onVLSI Design (VLSI Design),pp.250-255, 2011 [4]. Jingchuan Wang and WeidongChen , "Integration of PSoC technology with educational robotics", International Conference on Field-Programmable Technology (FPT), 2010, pp.332-336, 2010.
  • 7. PSoC BASED SPEECH RECOGNITION SYSTEM www.irjes.com 7 | Page [5]. Cheng-Yuan Chang , Ching-Fa Chen , Shing-Tai Pan , Xu-Yu Li, " The speech recognition chip implementation on FPGA “, Mechanical and Electronics Engineering (ICMEE), 2nd International Conference,2010. [6]. V. Amudha, B. Venkataramani, J. Manikandan, "FPGA implementation of isolated digit recognition system using modified back propagation algorithm,"International Conference on Electronic Design ICED 2008, pp.1-6. [7]. V.Amudha, B.Venkataramani, R.Vinoth Kumar and S. Ravishankar,“SOC Implementation of HMM Based Speaker Independent Isolated Digit Recognition System”, in Proc. of IEEE Int. Conf. on VLSIDesign VLSI’07, 2007, pp.848-853. [8]. Trihandyo, A. Belloum, A. Kun-Mean Hou, ″A real-time speech recognition architecture for a multi- channel interactive voice response system″, Iternational Conference on Acoustics, Speech, and Signal ProcessingICASSP-97, vol.2, pp.1527-1530,1997 [9]. Mike Wald, Using Automatic Speech Recognition to Enhance Education for All Students: Turning a Vision into Reality [A].In Proceedings of 34th ASEE/IEEE Frontiers in Education Conference S3G, Indianapolis, Indiana, 2005, pp 22-25. [10]. N. Hataoka, H. Kokubo, Y. Obuchi, and A. Amano, "Compact and robust speech recognition for embedded use on microprocessors," IEEE Workshop on Multimedia Signal Processing, pp. 288-291, 9- 11 Dec. 2002.