SlideShare a Scribd company logo
International Refereed Journal of Engineering and Science (IRJES)
ISSN (Online) 2319-183X, (Print) 2319-1821
Volume 4, Issue 3 (March 2015), PP.01-07
www.irjes.com 1 | Page
PSoC BASED SPEECH RECOGNITION SYSTEM
1
Shubhasini Sugumaran, 2
Mr.V.R.Prakash
1
Department of Electronics and Communication Hindustan University Chennai, India
2
Department of Electronics and Communication Hindustan University Chennai,India
Abstract:– Speech Recognition Systems(SRS) have been implemented by various processors including the
digital signal processors(DSPs) and field programmable gate arrays(FPGAs) and their performance has been
reported in literature. The fundamental purpose of speech is communication, i.e., the transmission of
messages.In the case of speech, the fundamental analog form of the message is an acoustic waveform, which we
call the speech signal. Speech signals can be converted to an electrical waveform by a microphone, further
manipulated by both analog and digital signal processing, and then converted back to acoustic form by a
loudspeaker, a telephone handset or headphone, as desired.The recognition of speech requires feature extraction
and classification. The systems that use speech as input require a microcontroller to carry out the desired
actions. In this paper, Cypress Programmable System on Chip (PSoC) has been studied and used for
implementation of SRS. From all the available PSoCs, PSoC5 containing ARM Cortex-M3 as its CPU is used.
The noise signals are firstly nullified from the speech signals using LogMMSE filtering. These signals are then
sent to the PSoC5 wherein the speech is recognized and desired actions are performed.
Keywords: PSoC, LogMMSE, Speech Recognition
I. INTRODUCTION
The basic idea of speech is the transmission of messages. A message is represented as a sequence of
discrete symbols that quantifies its information in bits and the rate at which information is transmitted as bits per
second (bps). Speech recognition techniques have seemed to be more efficient and convenient for human-
machine interaction. The speech recognition systems with fixed vocabulary were deployed in many
applications[8] and [9]. Speech recognition systems for voice operated application have been implemented using
various hardware platforms such as the DSPs [3], FPGAs [5] and microprocessors [10]. In speech production,
the information to be transmitted is encoded in the form of a continuously varying analog waveform that can be
transmitted, recorded, manipulated, and ultimately decoded by a human listener. This analog signal is the speech
signal. These signals tend to be corrupted by noise in the real world. If the noise can be estimated from the noise
source, this estimated noise can then be subtracted from the primary channel resulting in the desired signal. This
task is usually done by linear filtering. In real time situations, the corrupting noise is a nonlinear distortion
version of the source noise, so a nonlinear filter should be a better choice. To reduce the influence of noise in the
speech, speech enhancement is done. The recognition algorithm without the use of enhancement algorithms
proved to be less efficient.
Programmable System on Chip (PSoC) have and are being employed in a number of applications. They
are cost effective due to which they have limited storage and computational power. In context to this, the
recognition accuracy becomes important for PSoC and is addressed in the paper.
II. PROGRAMMABLE SYSTEM ON CHIP (PSOC)
Programmable System on Chip(PSoC) has been designed and implemented by Cypress semiconductors
[2] and [4]. Every PSoC contains a microcontroller, programmable analog blocks such as ADC, DAC, I/O
drivers and digital blocks such as Universal Digital Blocks (UDBs), CAN, I2C, PWM in a single chip.
Embedded Development kits from Cypress contain one of the three PSoCs – PsoC1, PSoC3 and PSoC5. The
processing performance, functionality, internal memories and configurability of the PSoC increases from
PSoC1through PSoc5.
PSoC BASED SPEECH RECOGNITION SYSTEM
www.irjes.com 2 | Page
Table 1: Comparison of PSoCs
PSoC5 uses PSoC Creator for its development and implementation. PSoC Creator is a visual
development tool and Integrated Development Environment for PSoC. It has a rich library of prebuilt
components and a schematic design entry tool. It combines C based development flow with an automatically
generated Application programmable interface (API). API rduces the errors in code and ensures proper
interfacing with the peripheral devices which enables the software development to be faster, easier and less
prone to errors. The PSoC Creator also has powerful, modern debugger, which is built in the IDE. It is used to
display the values after execution at each point.
III. SPEECH RECOGNITION SYSTEM
The block diagram is as shown in Fig.1 below.
Speech Recognition System
The speech is given as input through a microphone. The given speech might contain external
disturbances called noise. This noise is cancelled using LogMMSE algorithm. The speech signal after noise
cancellation is amplified and filtered to remove further disturbances in the signal. The speech signal in analog
form is then converted into digital signal using inbuilt analog to digital converter of PSoC. The feature
extraction and pattern recognition is done using MFCC algorithm. The analog to digital conversion is done
using delta sigma ADC present in PSoC5.
The sampling rate of ADC can be adjusted depending on the speech signal as required. The delta sigma
ADC contains three blocks – an amplifier, a modulator and a decimator. The decimator is a four stage CIC
decimation filter. It also contains a post processing unit. Continuous mode of the ADC is being used for
conversion. The output of the speech recognition system is shown through the movement of a robot in different
directions.
FEATURES PSoC1 PSoC3 PSoC5
CPU 8-bit M8C
core
8-bit 8051 32-bit ARM
Cortex
Flash 4 to 32kB 8 to 64kB 32 to 256kB
Interface I²C, SPI,
UART,
FS USB 2.0
I²C, SPI,
UART,
LIN,
FS USB
2.0, I²S,
CAN
I²C, SPI,
UART, LIN,
FS USB 2.0,
I²S,CAN
ADCs 1 delta-
sigma
1 Delta-
Sigma
1 Delta-Sigma,
2SARs
DACs 2(6 bit) 4(8 bit) 4(8 bit)
I/Os 64 72 72
PSoC BASED SPEECH RECOGNITION SYSTEM
www.irjes.com 3 | Page
IV. TECHNIQUES USED
The performance of the SRS degrades when implemented in real world environment. This degradation
is due to acoustic model mismatch. The acoustic model mismatch describes the difference between the
environment in which the SRS is tested and the actual environment in which it is deployed. This can include
echoes, background noise, speaker variability and transmission effects.
1. Log MMSE Filtering
Fig.2 Noise reduction using Log MMSE
1.1 Preprocessing
In preprocessing of audio signals start with pre-emphasis refers to a system process designed to
increase the magnitude of some frequencies with respect to the magnitude of other frequencies in order to
improve the overall signal-to-noise ratio by minimizing the adverse effects of such phenomena as attenuation
distortion or saturation of recording media in subsequent parts of the system. The mirror operation is called de-
emphasis, and the system as a whole is called emphasis.
Pre-emphasis is achieved with a pre-emphasis network which is essentially a calibrated filter. This
network composed of two resistors and one capacitor. The frequency response is decided by special time
constants. The cutoff frequency can be calculated from that value. Pre-emphasis is commonly used
in telecommunications, digital audio recording, record cutting, in FM broadcasting transmissions, and in
displaying the spectrograms of speech signals.
De-emphasis is the complement of pre-emphasis, in the anti noise system called emphasis. Emphasis is
a system process designed to decrease, (within a band of frequencies), the magnitude of some (usually higher)
frequencies with respect to the magnitude of other (usually lower) frequencies in order to improve the overall
signal-to-noise ratio by minimizing the adverse effects of such phenomena as attenuation differences or
saturation of recording media in subsequent parts of the system.
1.2 Segment
The signal samples are segmented into fixed number of frames and each frame samples are evaluated with
hamming window coefficients.
The total frames are calculated by,
Fn = (Ls – Ns)/ (Ns*Sp) +1
Where,
Ls = length of signal
Ns = Length of each frame
Sp = Shift Percentage
Finally the samples of each frames are separated from input signal using Fn and Sp and its scaled by the
hamming window coefficients.
PSoC BASED SPEECH RECOGNITION SYSTEM
www.irjes.com 4 | Page
Fig.3 Segmented signal
1.3 Filtering
After the signal segmentation, the magnitude and phase spectrum from noisy signal are computed by
applying fast fourier transform.The magnitude of noisy signal spectrum is further utilized for filtering process
and signal phase kept same.
The restored signal magnitude spectra is obtained by,
Rs = G .* Y
Where, G – Log spectral amplitude Gain function
Y – Magnitude response of noisy signal
The log spectral gain function is defined by,
G = x./(1+x) exp(eint (v))
Where, v = x./(1+x) * r
x and r– Priori and posteriori signal to noise ratio
eint – Exponential integral
The posteriori snr is defined by,
r = (Y.^2)/ lamda
lamda = E[(Y).^2]
Where, lamda - Noise power spectrum variance
E – Mean value
Complex spectrogram obtained by Filtered magnitude spectrum is combined with noisy signal phase spectrum.
The restored signal is reconstructed by applying inverse fast fourier transform to this complex spectrogram.The
performance of filtering is measured with SNR evaluation and it is defined by,
SNR= 10log10 (Msig^2/(sum(in-out)^2/Ls))
Where, Msig = Maximum amplitude of signal
in, out= Noisy input signal and restored output.
1.4 Concatenate
The signals that were obtained on segmentation are now filtered and the noise content is reduced. These signals
are then concatenated to reform the original signal.
1.5 Restore
The restored noiseless signal is restored to its original form.
2. MEL FREQUENCY CEPSTRUM COEFFICIENT(MFCC)
To increase the robustness in the frame selection process, a robust feature extraction with short time
Fourier transform (STFT) domain uncertainty propagation (STFT-UP) was used. Our implementation followed
using STFT-UP to compute a minimum mean square error (MMSE) estimate directly in the mel-frequency
cepstral coefficient (MFCC) domain. For this particular implementation, amplitude based MFCCs with cepstral
mean subtraction were used to attain improved performance.
0 20 40 60 80 100 120 140 160 180 200
-0.06
-0.04
-0.02
0
0.02
0.04
0.06
0 20 40 60 80 100 120
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
PSoC BASED SPEECH RECOGNITION SYSTEM
www.irjes.com 5 | Page
Fig. 4 MFCC
In speech recognition, the Mel-frequency cepstrum (MFC) is a representation of the shortterm power
spectrum of a sound, based on a linear cosine transform of a log power spectrum on a nonlinear Mel scale of
frequency.
MFCC = DCT [ LOG [ ABS [ FFT (SPEECH) ]]]
Mel-frequency cepstral coefficients (MFCCs) are coefficients that collectively make up an MFC. The difference
between the cepstrum and the mel-frequency cepstrum is that in the MFC, the frequency bands are equally
spaced on the mel scale, which approximates the human auditory system's response more closely than the
linearly spaced frequency bands used in the normal cepstrum. This frequency warping can allow for better
representation of sound.
V. TRAINING AND TESTING
The goal of the system training and inventory design stage is twofold: we need to divide the inventory
into collections of phonetically similar segments with varying lengths and we need to arrive at a statistical
description that tells us which set of collections is most likely to contain the inventory subsection that best
matches the underlying clean frame of an incoming noisy frame . The division of the inventory into the
collections is performed in a step-by-step fashion. First, is segmented and all silent segments are removed. The
non-silent part of the inventory is then divided into sections that each belong to one of 40 phonetic classes.
We are applying the feature extraction to the entire segment stream of the inventory. Because the inventory
signal is assumed to be virtually undistorted, it is sufficient to only retain the resulting short-time MFCC feature
means and to discard the associated variance estimates. The feature means become, thereby, essentially feature
vectors in their own right and we can develop a cluster model for them. We have decided to use the means and
not the actual cepstral vectors at this stage to ensure that the impact of the mean-extraction-processing is
captured in our feature representation.
VI. RESULTS
Fig. 5 Noisy Signal
PSoC BASED SPEECH RECOGNITION SYSTEM
www.irjes.com 6 | Page
Fig. 6 Log MMSE Filtering
Fig. 7 MFCC Feature Extraction
For experimental basis, speech signals from noisy environments were taken. The result shown is fr a
signal in the airport. The noise is cancelled using LogMMSE Filtering and feature extraction is done by MFCC.
VI. CONCLUSION
In this paper speech recognition is done and appliances are controlled using speech commands. The
commands are given by the user in the form of speech. These signals are filtered using LogMMSE filtering due
to which environmental noise is nullified. Further as a part of feature extraction, MFCC is used. A database is
created where all the commands are saved. These commands are then given as input to the PSoC5 kit where the
PSoC is programmed to give the desired result. According to the PSoC, the robot connected to the PSoC moves
in different directions as specified.
REFERENCES
[1]. V. Naresh, B. Venkataramani, Abhishek Karan and J. Manikandan, " PSoCbased isolated speech
recognition system, " International conference on Communication and Signal Processing, April 3-5,
2013.
[2]. R Namba, K Kobayashi, T Ohkubo and Y.Kurihara, "Development of PSoC microcontroller based
solar energy storage system," Proceedings of SICE Annual Conference (SICE), 2011, pp.718-721,
2011.
[3]. J. Manikandan, B. Venkataramani, K. Girish, H. Karthic, V. Siddharth, "Hardware Implementation of
Real-Time Speech Recognition System Using TMS320C6713 DSP",24th International Conference
onVLSI Design (VLSI Design),pp.250-255, 2011
[4]. Jingchuan Wang and WeidongChen , "Integration of PSoC technology with educational robotics",
International Conference on Field-Programmable Technology (FPT), 2010, pp.332-336, 2010.
PSoC BASED SPEECH RECOGNITION SYSTEM
www.irjes.com 7 | Page
[5]. Cheng-Yuan Chang , Ching-Fa Chen , Shing-Tai Pan , Xu-Yu Li, " The speech recognition chip
implementation on FPGA “, Mechanical and Electronics Engineering (ICMEE), 2nd International
Conference,2010.
[6]. V. Amudha, B. Venkataramani, J. Manikandan, "FPGA implementation of isolated digit recognition
system using modified back propagation algorithm,"International Conference on Electronic Design
ICED 2008, pp.1-6.
[7]. V.Amudha, B.Venkataramani, R.Vinoth Kumar and S. Ravishankar,“SOC Implementation of HMM
Based Speaker Independent Isolated Digit Recognition System”, in Proc. of IEEE Int. Conf. on
VLSIDesign VLSI’07, 2007, pp.848-853.
[8]. Trihandyo, A. Belloum, A. Kun-Mean Hou, ″A real-time speech recognition architecture for a multi-
channel interactive voice response system″, Iternational Conference on Acoustics, Speech, and Signal
ProcessingICASSP-97, vol.2, pp.1527-1530,1997
[9]. Mike Wald, Using Automatic Speech Recognition to Enhance Education for All Students: Turning a
Vision into Reality [A].In Proceedings of 34th ASEE/IEEE Frontiers in Education Conference S3G,
Indianapolis, Indiana, 2005, pp 22-25.
[10]. N. Hataoka, H. Kokubo, Y. Obuchi, and A. Amano, "Compact and robust speech recognition for
embedded use on microprocessors," IEEE Workshop on Multimedia Signal Processing, pp. 288-291, 9-
11 Dec. 2002.

More Related Content

What's hot

Introductory Lecture to Audio Signal Processing
Introductory Lecture to Audio Signal ProcessingIntroductory Lecture to Audio Signal Processing
Introductory Lecture to Audio Signal Processing
Angelo Salatino
 
Lecture 8 audio compression
Lecture 8 audio compressionLecture 8 audio compression
Lecture 8 audio compressionMr SMAK
 
Introduction to dsp
Introduction to dspIntroduction to dsp
Speech Compression using LPC
Speech Compression using LPCSpeech Compression using LPC
Speech Compression using LPCDisha Modi
 
DSP_FOEHU - Lec 13 - Digital Signal Processing Applications I
DSP_FOEHU - Lec 13 - Digital Signal Processing Applications IDSP_FOEHU - Lec 13 - Digital Signal Processing Applications I
DSP_FOEHU - Lec 13 - Digital Signal Processing Applications I
Amr E. Mohamed
 
Introduction to DSP
Introduction to DSPIntroduction to DSP
Introduction to DSP
shinychristo
 
Audio and video compression
Audio and video compressionAudio and video compression
Audio and video compressionneeraj9217
 
Introduction to Digital Signal Processing
Introduction to Digital Signal ProcessingIntroduction to Digital Signal Processing
Introduction to Digital Signal Processing
op205
 
Convolution
ConvolutionConvolution
Convolution
sridharbommu
 
Audio compression
Audio compression Audio compression
Audio compression
Darshan IT
 
Lecture 18 (5)
Lecture 18 (5)Lecture 18 (5)
Lecture 18 (5)
Deepakkumar5880
 
Speech compression-using-gsm
Speech compression-using-gsmSpeech compression-using-gsm
Speech compression-using-gsm
Nghĩa Trần Trung
 
Digital signal processor part4
Digital signal processor part4Digital signal processor part4
Digital signal processor part4
Vaagdevi College of Engineering
 
Introduction to digital signal processing 2
Introduction to digital signal processing 2Introduction to digital signal processing 2
Introduction to digital signal processing 2
Hossam Hassan
 
Filtration and synthesis of different types of human voice signals
Filtration and synthesis of different types of human voice signalsFiltration and synthesis of different types of human voice signals
Filtration and synthesis of different types of human voice signals
Alexander Decker
 
Digital signal processing part1
Digital signal processing part1Digital signal processing part1
Digital signal processing part1
Vaagdevi College of Engineering
 
Discrete-Time Signal Processing
Discrete-Time Signal ProcessingDiscrete-Time Signal Processing
Discrete-Time Signal Processing
lancer350
 

What's hot (19)

Introductory Lecture to Audio Signal Processing
Introductory Lecture to Audio Signal ProcessingIntroductory Lecture to Audio Signal Processing
Introductory Lecture to Audio Signal Processing
 
Lecture 8 audio compression
Lecture 8 audio compressionLecture 8 audio compression
Lecture 8 audio compression
 
DSP Processor
DSP Processor DSP Processor
DSP Processor
 
Introduction to dsp
Introduction to dspIntroduction to dsp
Introduction to dsp
 
Speech Compression using LPC
Speech Compression using LPCSpeech Compression using LPC
Speech Compression using LPC
 
DSP_FOEHU - Lec 13 - Digital Signal Processing Applications I
DSP_FOEHU - Lec 13 - Digital Signal Processing Applications IDSP_FOEHU - Lec 13 - Digital Signal Processing Applications I
DSP_FOEHU - Lec 13 - Digital Signal Processing Applications I
 
Introduction to DSP
Introduction to DSPIntroduction to DSP
Introduction to DSP
 
Audio and video compression
Audio and video compressionAudio and video compression
Audio and video compression
 
Introduction to Digital Signal Processing
Introduction to Digital Signal ProcessingIntroduction to Digital Signal Processing
Introduction to Digital Signal Processing
 
Convolution
ConvolutionConvolution
Convolution
 
Audio compression
Audio compression Audio compression
Audio compression
 
Lecture 18 (5)
Lecture 18 (5)Lecture 18 (5)
Lecture 18 (5)
 
Speech compression-using-gsm
Speech compression-using-gsmSpeech compression-using-gsm
Speech compression-using-gsm
 
Subband Coding
Subband CodingSubband Coding
Subband Coding
 
Digital signal processor part4
Digital signal processor part4Digital signal processor part4
Digital signal processor part4
 
Introduction to digital signal processing 2
Introduction to digital signal processing 2Introduction to digital signal processing 2
Introduction to digital signal processing 2
 
Filtration and synthesis of different types of human voice signals
Filtration and synthesis of different types of human voice signalsFiltration and synthesis of different types of human voice signals
Filtration and synthesis of different types of human voice signals
 
Digital signal processing part1
Digital signal processing part1Digital signal processing part1
Digital signal processing part1
 
Discrete-Time Signal Processing
Discrete-Time Signal ProcessingDiscrete-Time Signal Processing
Discrete-Time Signal Processing
 

Viewers also liked

Swagger 2.0
Swagger 2.0Swagger 2.0
Swagger 2.0
Roman Krivtsov
 
เรียนเก่งไม่ต้องรอพ่อรวย
เรียนเก่งไม่ต้องรอพ่อรวยเรียนเก่งไม่ต้องรอพ่อรวย
เรียนเก่งไม่ต้องรอพ่อรวย
อลงกรณ์ อารามกูล
 
Computer integrated manufacturing
Computer integrated manufacturingComputer integrated manufacturing
Computer integrated manufacturing
Ashish Desai
 
Estrategias de instrucción diferenciada copy
Estrategias de instrucción diferenciada   copyEstrategias de instrucción diferenciada   copy
Estrategias de instrucción diferenciada copy
Sylvia Cuadra, Ed.D.
 
El primer negocio en la luna
El primer negocio en la lunaEl primer negocio en la luna
El primer negocio en la luna
Ignacio López Cózar
 
Ali Aziz Cv.docx modified
Ali Aziz Cv.docx modifiedAli Aziz Cv.docx modified
Ali Aziz Cv.docx modifiedAli Isa
 
Act 1 formulacion de proyectos
Act 1 formulacion de proyectosAct 1 formulacion de proyectos
Act 1 formulacion de proyectos
Maria Gomez
 

Viewers also liked (7)

Swagger 2.0
Swagger 2.0Swagger 2.0
Swagger 2.0
 
เรียนเก่งไม่ต้องรอพ่อรวย
เรียนเก่งไม่ต้องรอพ่อรวยเรียนเก่งไม่ต้องรอพ่อรวย
เรียนเก่งไม่ต้องรอพ่อรวย
 
Computer integrated manufacturing
Computer integrated manufacturingComputer integrated manufacturing
Computer integrated manufacturing
 
Estrategias de instrucción diferenciada copy
Estrategias de instrucción diferenciada   copyEstrategias de instrucción diferenciada   copy
Estrategias de instrucción diferenciada copy
 
El primer negocio en la luna
El primer negocio en la lunaEl primer negocio en la luna
El primer negocio en la luna
 
Ali Aziz Cv.docx modified
Ali Aziz Cv.docx modifiedAli Aziz Cv.docx modified
Ali Aziz Cv.docx modified
 
Act 1 formulacion de proyectos
Act 1 formulacion de proyectosAct 1 formulacion de proyectos
Act 1 formulacion de proyectos
 

Similar to PSoC BASED SPEECH RECOGNITION SYSTEM

AN ANALYSIS OF SPEECH RECOGNITION PERFORMANCE BASED UPON NETWORK LAYERS AND T...
AN ANALYSIS OF SPEECH RECOGNITION PERFORMANCE BASED UPON NETWORK LAYERS AND T...AN ANALYSIS OF SPEECH RECOGNITION PERFORMANCE BASED UPON NETWORK LAYERS AND T...
AN ANALYSIS OF SPEECH RECOGNITION PERFORMANCE BASED UPON NETWORK LAYERS AND T...
IJCSEA Journal
 
An Evaluation Of Lms Based Adaptive Filtering
An Evaluation Of Lms Based Adaptive FilteringAn Evaluation Of Lms Based Adaptive Filtering
An Evaluation Of Lms Based Adaptive Filtering
Renee Wardowski
 
Speech Signal Analysis
Speech Signal AnalysisSpeech Signal Analysis
Speech Signal Analysis
Pradeep Reddy Guvvala
 
Design and implementation of different audio restoration techniques for audio...
Design and implementation of different audio restoration techniques for audio...Design and implementation of different audio restoration techniques for audio...
Design and implementation of different audio restoration techniques for audio...
eSAT Journals
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...ijceronline
 
Speech compression analysis using matlab
Speech compression analysis using matlabSpeech compression analysis using matlab
Speech compression analysis using matlab
eSAT Journals
 
Speech compression analysis using matlab
Speech compression analysis using matlabSpeech compression analysis using matlab
Speech compression analysis using matlab
eSAT Publishing House
 
Speaker Recognition System using MFCC and Vector Quantization Approach
Speaker Recognition System using MFCC and Vector Quantization ApproachSpeaker Recognition System using MFCC and Vector Quantization Approach
Speaker Recognition System using MFCC and Vector Quantization Approach
ijsrd.com
 
[IJET-V1I6P21] Authors : Easwari.N , Ponmuthuramalingam.P
[IJET-V1I6P21] Authors : Easwari.N , Ponmuthuramalingam.P[IJET-V1I6P21] Authors : Easwari.N , Ponmuthuramalingam.P
[IJET-V1I6P21] Authors : Easwari.N , Ponmuthuramalingam.P
IJET - International Journal of Engineering and Techniques
 
Low power fpga solution for dab audio decoder
Low power fpga solution for dab audio decoderLow power fpga solution for dab audio decoder
Low power fpga solution for dab audio decoder
eSAT Publishing House
 
Lab based report
Lab based reportLab based report
Lab based report
Bhavna Singh
 
Discrete time signal processing unit-2
Discrete time signal processing unit-2Discrete time signal processing unit-2
Discrete time signal processing unit-2
selvalakshmi24
 
Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...
IRJET Journal
 
Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...
IRJET Journal
 
REAL TIME SPECIAL EFFECTS GENERATION AND NOISE FILTRATION OF AUDIO SIGNAL USI...
REAL TIME SPECIAL EFFECTS GENERATION AND NOISE FILTRATION OF AUDIO SIGNAL USI...REAL TIME SPECIAL EFFECTS GENERATION AND NOISE FILTRATION OF AUDIO SIGNAL USI...
REAL TIME SPECIAL EFFECTS GENERATION AND NOISE FILTRATION OF AUDIO SIGNAL USI...
ijcsa
 
G010424248
G010424248G010424248
G010424248
IOSR Journals
 
Introduction to dsp by bibhu prasad ganthia
Introduction to dsp by bibhu prasad ganthiaIntroduction to dsp by bibhu prasad ganthia
Introduction to dsp by bibhu prasad ganthia
Dr. Bibhu Prasad Ganthia
 
F5242832
F5242832F5242832
F5242832
IOSR-JEN
 
GENDER RECOGNITION SYSTEM USING SPEECH SIGNAL
GENDER RECOGNITION SYSTEM USING SPEECH SIGNALGENDER RECOGNITION SYSTEM USING SPEECH SIGNAL
GENDER RECOGNITION SYSTEM USING SPEECH SIGNAL
IJCSEIT Journal
 

Similar to PSoC BASED SPEECH RECOGNITION SYSTEM (20)

AN ANALYSIS OF SPEECH RECOGNITION PERFORMANCE BASED UPON NETWORK LAYERS AND T...
AN ANALYSIS OF SPEECH RECOGNITION PERFORMANCE BASED UPON NETWORK LAYERS AND T...AN ANALYSIS OF SPEECH RECOGNITION PERFORMANCE BASED UPON NETWORK LAYERS AND T...
AN ANALYSIS OF SPEECH RECOGNITION PERFORMANCE BASED UPON NETWORK LAYERS AND T...
 
An Evaluation Of Lms Based Adaptive Filtering
An Evaluation Of Lms Based Adaptive FilteringAn Evaluation Of Lms Based Adaptive Filtering
An Evaluation Of Lms Based Adaptive Filtering
 
Speech Signal Analysis
Speech Signal AnalysisSpeech Signal Analysis
Speech Signal Analysis
 
Design and implementation of different audio restoration techniques for audio...
Design and implementation of different audio restoration techniques for audio...Design and implementation of different audio restoration techniques for audio...
Design and implementation of different audio restoration techniques for audio...
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
 
Speech compression analysis using matlab
Speech compression analysis using matlabSpeech compression analysis using matlab
Speech compression analysis using matlab
 
Speech compression analysis using matlab
Speech compression analysis using matlabSpeech compression analysis using matlab
Speech compression analysis using matlab
 
Speaker Recognition System using MFCC and Vector Quantization Approach
Speaker Recognition System using MFCC and Vector Quantization ApproachSpeaker Recognition System using MFCC and Vector Quantization Approach
Speaker Recognition System using MFCC and Vector Quantization Approach
 
[IJET-V1I6P21] Authors : Easwari.N , Ponmuthuramalingam.P
[IJET-V1I6P21] Authors : Easwari.N , Ponmuthuramalingam.P[IJET-V1I6P21] Authors : Easwari.N , Ponmuthuramalingam.P
[IJET-V1I6P21] Authors : Easwari.N , Ponmuthuramalingam.P
 
Low power fpga solution for dab audio decoder
Low power fpga solution for dab audio decoderLow power fpga solution for dab audio decoder
Low power fpga solution for dab audio decoder
 
Lab based report
Lab based reportLab based report
Lab based report
 
Discrete time signal processing unit-2
Discrete time signal processing unit-2Discrete time signal processing unit-2
Discrete time signal processing unit-2
 
Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...
 
Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...
 
REAL TIME SPECIAL EFFECTS GENERATION AND NOISE FILTRATION OF AUDIO SIGNAL USI...
REAL TIME SPECIAL EFFECTS GENERATION AND NOISE FILTRATION OF AUDIO SIGNAL USI...REAL TIME SPECIAL EFFECTS GENERATION AND NOISE FILTRATION OF AUDIO SIGNAL USI...
REAL TIME SPECIAL EFFECTS GENERATION AND NOISE FILTRATION OF AUDIO SIGNAL USI...
 
G010424248
G010424248G010424248
G010424248
 
H0814247
H0814247H0814247
H0814247
 
Introduction to dsp by bibhu prasad ganthia
Introduction to dsp by bibhu prasad ganthiaIntroduction to dsp by bibhu prasad ganthia
Introduction to dsp by bibhu prasad ganthia
 
F5242832
F5242832F5242832
F5242832
 
GENDER RECOGNITION SYSTEM USING SPEECH SIGNAL
GENDER RECOGNITION SYSTEM USING SPEECH SIGNALGENDER RECOGNITION SYSTEM USING SPEECH SIGNAL
GENDER RECOGNITION SYSTEM USING SPEECH SIGNAL
 

More from irjes

Automatic Safety Door Lock System for Car
Automatic Safety Door Lock System for CarAutomatic Safety Door Lock System for Car
Automatic Safety Door Lock System for Car
irjes
 
Squared Multi-hole Extrusion Process: Experimentation & Optimization
Squared Multi-hole Extrusion Process: Experimentation & OptimizationSquared Multi-hole Extrusion Process: Experimentation & Optimization
Squared Multi-hole Extrusion Process: Experimentation & Optimization
irjes
 
Analysis of Agile and Multi-Agent Based Process Scheduling Model
Analysis of Agile and Multi-Agent Based Process Scheduling ModelAnalysis of Agile and Multi-Agent Based Process Scheduling Model
Analysis of Agile and Multi-Agent Based Process Scheduling Model
irjes
 
Effects of Cutting Tool Parameters on Surface Roughness
Effects of Cutting Tool Parameters on Surface RoughnessEffects of Cutting Tool Parameters on Surface Roughness
Effects of Cutting Tool Parameters on Surface Roughness
irjes
 
Possible limits of accuracy in measurement of fundamental physical constants
Possible limits of accuracy in measurement of fundamental physical constantsPossible limits of accuracy in measurement of fundamental physical constants
Possible limits of accuracy in measurement of fundamental physical constants
irjes
 
Performance Comparison of Energy Detection Based Spectrum Sensing for Cogniti...
Performance Comparison of Energy Detection Based Spectrum Sensing for Cogniti...Performance Comparison of Energy Detection Based Spectrum Sensing for Cogniti...
Performance Comparison of Energy Detection Based Spectrum Sensing for Cogniti...
irjes
 
Comparative Study of Pre-Engineered and Conventional Steel Frames for Differe...
Comparative Study of Pre-Engineered and Conventional Steel Frames for Differe...Comparative Study of Pre-Engineered and Conventional Steel Frames for Differe...
Comparative Study of Pre-Engineered and Conventional Steel Frames for Differe...
irjes
 
Flip bifurcation and chaos control in discrete-time Prey-predator model
Flip bifurcation and chaos control in discrete-time Prey-predator model Flip bifurcation and chaos control in discrete-time Prey-predator model
Flip bifurcation and chaos control in discrete-time Prey-predator model
irjes
 
Energy Awareness and the Role of “Critical Mass” In Smart Cities
Energy Awareness and the Role of “Critical Mass” In Smart CitiesEnergy Awareness and the Role of “Critical Mass” In Smart Cities
Energy Awareness and the Role of “Critical Mass” In Smart Cities
irjes
 
A Firefly Algorithm for Optimizing Spur Gear Parameters Under Non-Lubricated ...
A Firefly Algorithm for Optimizing Spur Gear Parameters Under Non-Lubricated ...A Firefly Algorithm for Optimizing Spur Gear Parameters Under Non-Lubricated ...
A Firefly Algorithm for Optimizing Spur Gear Parameters Under Non-Lubricated ...
irjes
 
The Effect of Orientation of Vortex Generators on Aerodynamic Drag Reduction ...
The Effect of Orientation of Vortex Generators on Aerodynamic Drag Reduction ...The Effect of Orientation of Vortex Generators on Aerodynamic Drag Reduction ...
The Effect of Orientation of Vortex Generators on Aerodynamic Drag Reduction ...
irjes
 
An Assessment of The Relationship Between The Availability of Financial Resou...
An Assessment of The Relationship Between The Availability of Financial Resou...An Assessment of The Relationship Between The Availability of Financial Resou...
An Assessment of The Relationship Between The Availability of Financial Resou...
irjes
 
The Choice of Antenatal Care and Delivery Place in Surabaya (Based on Prefere...
The Choice of Antenatal Care and Delivery Place in Surabaya (Based on Prefere...The Choice of Antenatal Care and Delivery Place in Surabaya (Based on Prefere...
The Choice of Antenatal Care and Delivery Place in Surabaya (Based on Prefere...
irjes
 
Prediction of the daily global solar irradiance received on a horizontal surf...
Prediction of the daily global solar irradiance received on a horizontal surf...Prediction of the daily global solar irradiance received on a horizontal surf...
Prediction of the daily global solar irradiance received on a horizontal surf...
irjes
 
HARMONIC ANALYSIS ASSOCIATED WITH A GENERALIZED BESSEL-STRUVE OPERATOR ON THE...
HARMONIC ANALYSIS ASSOCIATED WITH A GENERALIZED BESSEL-STRUVE OPERATOR ON THE...HARMONIC ANALYSIS ASSOCIATED WITH A GENERALIZED BESSEL-STRUVE OPERATOR ON THE...
HARMONIC ANALYSIS ASSOCIATED WITH A GENERALIZED BESSEL-STRUVE OPERATOR ON THE...
irjes
 
The Role of Community Participation in Planning Processes of Emerging Urban C...
The Role of Community Participation in Planning Processes of Emerging Urban C...The Role of Community Participation in Planning Processes of Emerging Urban C...
The Role of Community Participation in Planning Processes of Emerging Urban C...
irjes
 
Understanding the Concept of Strategic Intent
Understanding the Concept of Strategic IntentUnderstanding the Concept of Strategic Intent
Understanding the Concept of Strategic Intent
irjes
 
The (R, Q) Control of A Mixture Inventory Model with Backorders and Lost Sale...
The (R, Q) Control of A Mixture Inventory Model with Backorders and Lost Sale...The (R, Q) Control of A Mixture Inventory Model with Backorders and Lost Sale...
The (R, Q) Control of A Mixture Inventory Model with Backorders and Lost Sale...
irjes
 
Relation Between Stress And Menstrual Cycle At 18-21 Years Of Age
Relation Between Stress And Menstrual Cycle At 18-21 Years Of AgeRelation Between Stress And Menstrual Cycle At 18-21 Years Of Age
Relation Between Stress And Menstrual Cycle At 18-21 Years Of Age
irjes
 
Wave Transmission on Submerged Breakwater with Interlocking D-Block Armor
Wave Transmission on Submerged Breakwater with Interlocking D-Block ArmorWave Transmission on Submerged Breakwater with Interlocking D-Block Armor
Wave Transmission on Submerged Breakwater with Interlocking D-Block Armor
irjes
 

More from irjes (20)

Automatic Safety Door Lock System for Car
Automatic Safety Door Lock System for CarAutomatic Safety Door Lock System for Car
Automatic Safety Door Lock System for Car
 
Squared Multi-hole Extrusion Process: Experimentation & Optimization
Squared Multi-hole Extrusion Process: Experimentation & OptimizationSquared Multi-hole Extrusion Process: Experimentation & Optimization
Squared Multi-hole Extrusion Process: Experimentation & Optimization
 
Analysis of Agile and Multi-Agent Based Process Scheduling Model
Analysis of Agile and Multi-Agent Based Process Scheduling ModelAnalysis of Agile and Multi-Agent Based Process Scheduling Model
Analysis of Agile and Multi-Agent Based Process Scheduling Model
 
Effects of Cutting Tool Parameters on Surface Roughness
Effects of Cutting Tool Parameters on Surface RoughnessEffects of Cutting Tool Parameters on Surface Roughness
Effects of Cutting Tool Parameters on Surface Roughness
 
Possible limits of accuracy in measurement of fundamental physical constants
Possible limits of accuracy in measurement of fundamental physical constantsPossible limits of accuracy in measurement of fundamental physical constants
Possible limits of accuracy in measurement of fundamental physical constants
 
Performance Comparison of Energy Detection Based Spectrum Sensing for Cogniti...
Performance Comparison of Energy Detection Based Spectrum Sensing for Cogniti...Performance Comparison of Energy Detection Based Spectrum Sensing for Cogniti...
Performance Comparison of Energy Detection Based Spectrum Sensing for Cogniti...
 
Comparative Study of Pre-Engineered and Conventional Steel Frames for Differe...
Comparative Study of Pre-Engineered and Conventional Steel Frames for Differe...Comparative Study of Pre-Engineered and Conventional Steel Frames for Differe...
Comparative Study of Pre-Engineered and Conventional Steel Frames for Differe...
 
Flip bifurcation and chaos control in discrete-time Prey-predator model
Flip bifurcation and chaos control in discrete-time Prey-predator model Flip bifurcation and chaos control in discrete-time Prey-predator model
Flip bifurcation and chaos control in discrete-time Prey-predator model
 
Energy Awareness and the Role of “Critical Mass” In Smart Cities
Energy Awareness and the Role of “Critical Mass” In Smart CitiesEnergy Awareness and the Role of “Critical Mass” In Smart Cities
Energy Awareness and the Role of “Critical Mass” In Smart Cities
 
A Firefly Algorithm for Optimizing Spur Gear Parameters Under Non-Lubricated ...
A Firefly Algorithm for Optimizing Spur Gear Parameters Under Non-Lubricated ...A Firefly Algorithm for Optimizing Spur Gear Parameters Under Non-Lubricated ...
A Firefly Algorithm for Optimizing Spur Gear Parameters Under Non-Lubricated ...
 
The Effect of Orientation of Vortex Generators on Aerodynamic Drag Reduction ...
The Effect of Orientation of Vortex Generators on Aerodynamic Drag Reduction ...The Effect of Orientation of Vortex Generators on Aerodynamic Drag Reduction ...
The Effect of Orientation of Vortex Generators on Aerodynamic Drag Reduction ...
 
An Assessment of The Relationship Between The Availability of Financial Resou...
An Assessment of The Relationship Between The Availability of Financial Resou...An Assessment of The Relationship Between The Availability of Financial Resou...
An Assessment of The Relationship Between The Availability of Financial Resou...
 
The Choice of Antenatal Care and Delivery Place in Surabaya (Based on Prefere...
The Choice of Antenatal Care and Delivery Place in Surabaya (Based on Prefere...The Choice of Antenatal Care and Delivery Place in Surabaya (Based on Prefere...
The Choice of Antenatal Care and Delivery Place in Surabaya (Based on Prefere...
 
Prediction of the daily global solar irradiance received on a horizontal surf...
Prediction of the daily global solar irradiance received on a horizontal surf...Prediction of the daily global solar irradiance received on a horizontal surf...
Prediction of the daily global solar irradiance received on a horizontal surf...
 
HARMONIC ANALYSIS ASSOCIATED WITH A GENERALIZED BESSEL-STRUVE OPERATOR ON THE...
HARMONIC ANALYSIS ASSOCIATED WITH A GENERALIZED BESSEL-STRUVE OPERATOR ON THE...HARMONIC ANALYSIS ASSOCIATED WITH A GENERALIZED BESSEL-STRUVE OPERATOR ON THE...
HARMONIC ANALYSIS ASSOCIATED WITH A GENERALIZED BESSEL-STRUVE OPERATOR ON THE...
 
The Role of Community Participation in Planning Processes of Emerging Urban C...
The Role of Community Participation in Planning Processes of Emerging Urban C...The Role of Community Participation in Planning Processes of Emerging Urban C...
The Role of Community Participation in Planning Processes of Emerging Urban C...
 
Understanding the Concept of Strategic Intent
Understanding the Concept of Strategic IntentUnderstanding the Concept of Strategic Intent
Understanding the Concept of Strategic Intent
 
The (R, Q) Control of A Mixture Inventory Model with Backorders and Lost Sale...
The (R, Q) Control of A Mixture Inventory Model with Backorders and Lost Sale...The (R, Q) Control of A Mixture Inventory Model with Backorders and Lost Sale...
The (R, Q) Control of A Mixture Inventory Model with Backorders and Lost Sale...
 
Relation Between Stress And Menstrual Cycle At 18-21 Years Of Age
Relation Between Stress And Menstrual Cycle At 18-21 Years Of AgeRelation Between Stress And Menstrual Cycle At 18-21 Years Of Age
Relation Between Stress And Menstrual Cycle At 18-21 Years Of Age
 
Wave Transmission on Submerged Breakwater with Interlocking D-Block Armor
Wave Transmission on Submerged Breakwater with Interlocking D-Block ArmorWave Transmission on Submerged Breakwater with Interlocking D-Block Armor
Wave Transmission on Submerged Breakwater with Interlocking D-Block Armor
 

Recently uploaded

FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
Product School
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
Ralf Eggert
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
Paul Groth
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
Frank van Harmelen
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Ramesh Iyer
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
Product School
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Tobias Schneck
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
Jemma Hussein Allen
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
Product School
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
OnBoard
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi
Fwdays
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Product School
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 

Recently uploaded (20)

FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 

PSoC BASED SPEECH RECOGNITION SYSTEM

  • 1. International Refereed Journal of Engineering and Science (IRJES) ISSN (Online) 2319-183X, (Print) 2319-1821 Volume 4, Issue 3 (March 2015), PP.01-07 www.irjes.com 1 | Page PSoC BASED SPEECH RECOGNITION SYSTEM 1 Shubhasini Sugumaran, 2 Mr.V.R.Prakash 1 Department of Electronics and Communication Hindustan University Chennai, India 2 Department of Electronics and Communication Hindustan University Chennai,India Abstract:– Speech Recognition Systems(SRS) have been implemented by various processors including the digital signal processors(DSPs) and field programmable gate arrays(FPGAs) and their performance has been reported in literature. The fundamental purpose of speech is communication, i.e., the transmission of messages.In the case of speech, the fundamental analog form of the message is an acoustic waveform, which we call the speech signal. Speech signals can be converted to an electrical waveform by a microphone, further manipulated by both analog and digital signal processing, and then converted back to acoustic form by a loudspeaker, a telephone handset or headphone, as desired.The recognition of speech requires feature extraction and classification. The systems that use speech as input require a microcontroller to carry out the desired actions. In this paper, Cypress Programmable System on Chip (PSoC) has been studied and used for implementation of SRS. From all the available PSoCs, PSoC5 containing ARM Cortex-M3 as its CPU is used. The noise signals are firstly nullified from the speech signals using LogMMSE filtering. These signals are then sent to the PSoC5 wherein the speech is recognized and desired actions are performed. Keywords: PSoC, LogMMSE, Speech Recognition I. INTRODUCTION The basic idea of speech is the transmission of messages. A message is represented as a sequence of discrete symbols that quantifies its information in bits and the rate at which information is transmitted as bits per second (bps). Speech recognition techniques have seemed to be more efficient and convenient for human- machine interaction. The speech recognition systems with fixed vocabulary were deployed in many applications[8] and [9]. Speech recognition systems for voice operated application have been implemented using various hardware platforms such as the DSPs [3], FPGAs [5] and microprocessors [10]. In speech production, the information to be transmitted is encoded in the form of a continuously varying analog waveform that can be transmitted, recorded, manipulated, and ultimately decoded by a human listener. This analog signal is the speech signal. These signals tend to be corrupted by noise in the real world. If the noise can be estimated from the noise source, this estimated noise can then be subtracted from the primary channel resulting in the desired signal. This task is usually done by linear filtering. In real time situations, the corrupting noise is a nonlinear distortion version of the source noise, so a nonlinear filter should be a better choice. To reduce the influence of noise in the speech, speech enhancement is done. The recognition algorithm without the use of enhancement algorithms proved to be less efficient. Programmable System on Chip (PSoC) have and are being employed in a number of applications. They are cost effective due to which they have limited storage and computational power. In context to this, the recognition accuracy becomes important for PSoC and is addressed in the paper. II. PROGRAMMABLE SYSTEM ON CHIP (PSOC) Programmable System on Chip(PSoC) has been designed and implemented by Cypress semiconductors [2] and [4]. Every PSoC contains a microcontroller, programmable analog blocks such as ADC, DAC, I/O drivers and digital blocks such as Universal Digital Blocks (UDBs), CAN, I2C, PWM in a single chip. Embedded Development kits from Cypress contain one of the three PSoCs – PsoC1, PSoC3 and PSoC5. The processing performance, functionality, internal memories and configurability of the PSoC increases from PSoC1through PSoc5.
  • 2. PSoC BASED SPEECH RECOGNITION SYSTEM www.irjes.com 2 | Page Table 1: Comparison of PSoCs PSoC5 uses PSoC Creator for its development and implementation. PSoC Creator is a visual development tool and Integrated Development Environment for PSoC. It has a rich library of prebuilt components and a schematic design entry tool. It combines C based development flow with an automatically generated Application programmable interface (API). API rduces the errors in code and ensures proper interfacing with the peripheral devices which enables the software development to be faster, easier and less prone to errors. The PSoC Creator also has powerful, modern debugger, which is built in the IDE. It is used to display the values after execution at each point. III. SPEECH RECOGNITION SYSTEM The block diagram is as shown in Fig.1 below. Speech Recognition System The speech is given as input through a microphone. The given speech might contain external disturbances called noise. This noise is cancelled using LogMMSE algorithm. The speech signal after noise cancellation is amplified and filtered to remove further disturbances in the signal. The speech signal in analog form is then converted into digital signal using inbuilt analog to digital converter of PSoC. The feature extraction and pattern recognition is done using MFCC algorithm. The analog to digital conversion is done using delta sigma ADC present in PSoC5. The sampling rate of ADC can be adjusted depending on the speech signal as required. The delta sigma ADC contains three blocks – an amplifier, a modulator and a decimator. The decimator is a four stage CIC decimation filter. It also contains a post processing unit. Continuous mode of the ADC is being used for conversion. The output of the speech recognition system is shown through the movement of a robot in different directions. FEATURES PSoC1 PSoC3 PSoC5 CPU 8-bit M8C core 8-bit 8051 32-bit ARM Cortex Flash 4 to 32kB 8 to 64kB 32 to 256kB Interface I²C, SPI, UART, FS USB 2.0 I²C, SPI, UART, LIN, FS USB 2.0, I²S, CAN I²C, SPI, UART, LIN, FS USB 2.0, I²S,CAN ADCs 1 delta- sigma 1 Delta- Sigma 1 Delta-Sigma, 2SARs DACs 2(6 bit) 4(8 bit) 4(8 bit) I/Os 64 72 72
  • 3. PSoC BASED SPEECH RECOGNITION SYSTEM www.irjes.com 3 | Page IV. TECHNIQUES USED The performance of the SRS degrades when implemented in real world environment. This degradation is due to acoustic model mismatch. The acoustic model mismatch describes the difference between the environment in which the SRS is tested and the actual environment in which it is deployed. This can include echoes, background noise, speaker variability and transmission effects. 1. Log MMSE Filtering Fig.2 Noise reduction using Log MMSE 1.1 Preprocessing In preprocessing of audio signals start with pre-emphasis refers to a system process designed to increase the magnitude of some frequencies with respect to the magnitude of other frequencies in order to improve the overall signal-to-noise ratio by minimizing the adverse effects of such phenomena as attenuation distortion or saturation of recording media in subsequent parts of the system. The mirror operation is called de- emphasis, and the system as a whole is called emphasis. Pre-emphasis is achieved with a pre-emphasis network which is essentially a calibrated filter. This network composed of two resistors and one capacitor. The frequency response is decided by special time constants. The cutoff frequency can be calculated from that value. Pre-emphasis is commonly used in telecommunications, digital audio recording, record cutting, in FM broadcasting transmissions, and in displaying the spectrograms of speech signals. De-emphasis is the complement of pre-emphasis, in the anti noise system called emphasis. Emphasis is a system process designed to decrease, (within a band of frequencies), the magnitude of some (usually higher) frequencies with respect to the magnitude of other (usually lower) frequencies in order to improve the overall signal-to-noise ratio by minimizing the adverse effects of such phenomena as attenuation differences or saturation of recording media in subsequent parts of the system. 1.2 Segment The signal samples are segmented into fixed number of frames and each frame samples are evaluated with hamming window coefficients. The total frames are calculated by, Fn = (Ls – Ns)/ (Ns*Sp) +1 Where, Ls = length of signal Ns = Length of each frame Sp = Shift Percentage Finally the samples of each frames are separated from input signal using Fn and Sp and its scaled by the hamming window coefficients.
  • 4. PSoC BASED SPEECH RECOGNITION SYSTEM www.irjes.com 4 | Page Fig.3 Segmented signal 1.3 Filtering After the signal segmentation, the magnitude and phase spectrum from noisy signal are computed by applying fast fourier transform.The magnitude of noisy signal spectrum is further utilized for filtering process and signal phase kept same. The restored signal magnitude spectra is obtained by, Rs = G .* Y Where, G – Log spectral amplitude Gain function Y – Magnitude response of noisy signal The log spectral gain function is defined by, G = x./(1+x) exp(eint (v)) Where, v = x./(1+x) * r x and r– Priori and posteriori signal to noise ratio eint – Exponential integral The posteriori snr is defined by, r = (Y.^2)/ lamda lamda = E[(Y).^2] Where, lamda - Noise power spectrum variance E – Mean value Complex spectrogram obtained by Filtered magnitude spectrum is combined with noisy signal phase spectrum. The restored signal is reconstructed by applying inverse fast fourier transform to this complex spectrogram.The performance of filtering is measured with SNR evaluation and it is defined by, SNR= 10log10 (Msig^2/(sum(in-out)^2/Ls)) Where, Msig = Maximum amplitude of signal in, out= Noisy input signal and restored output. 1.4 Concatenate The signals that were obtained on segmentation are now filtered and the noise content is reduced. These signals are then concatenated to reform the original signal. 1.5 Restore The restored noiseless signal is restored to its original form. 2. MEL FREQUENCY CEPSTRUM COEFFICIENT(MFCC) To increase the robustness in the frame selection process, a robust feature extraction with short time Fourier transform (STFT) domain uncertainty propagation (STFT-UP) was used. Our implementation followed using STFT-UP to compute a minimum mean square error (MMSE) estimate directly in the mel-frequency cepstral coefficient (MFCC) domain. For this particular implementation, amplitude based MFCCs with cepstral mean subtraction were used to attain improved performance. 0 20 40 60 80 100 120 140 160 180 200 -0.06 -0.04 -0.02 0 0.02 0.04 0.06 0 20 40 60 80 100 120 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7
  • 5. PSoC BASED SPEECH RECOGNITION SYSTEM www.irjes.com 5 | Page Fig. 4 MFCC In speech recognition, the Mel-frequency cepstrum (MFC) is a representation of the shortterm power spectrum of a sound, based on a linear cosine transform of a log power spectrum on a nonlinear Mel scale of frequency. MFCC = DCT [ LOG [ ABS [ FFT (SPEECH) ]]] Mel-frequency cepstral coefficients (MFCCs) are coefficients that collectively make up an MFC. The difference between the cepstrum and the mel-frequency cepstrum is that in the MFC, the frequency bands are equally spaced on the mel scale, which approximates the human auditory system's response more closely than the linearly spaced frequency bands used in the normal cepstrum. This frequency warping can allow for better representation of sound. V. TRAINING AND TESTING The goal of the system training and inventory design stage is twofold: we need to divide the inventory into collections of phonetically similar segments with varying lengths and we need to arrive at a statistical description that tells us which set of collections is most likely to contain the inventory subsection that best matches the underlying clean frame of an incoming noisy frame . The division of the inventory into the collections is performed in a step-by-step fashion. First, is segmented and all silent segments are removed. The non-silent part of the inventory is then divided into sections that each belong to one of 40 phonetic classes. We are applying the feature extraction to the entire segment stream of the inventory. Because the inventory signal is assumed to be virtually undistorted, it is sufficient to only retain the resulting short-time MFCC feature means and to discard the associated variance estimates. The feature means become, thereby, essentially feature vectors in their own right and we can develop a cluster model for them. We have decided to use the means and not the actual cepstral vectors at this stage to ensure that the impact of the mean-extraction-processing is captured in our feature representation. VI. RESULTS Fig. 5 Noisy Signal
  • 6. PSoC BASED SPEECH RECOGNITION SYSTEM www.irjes.com 6 | Page Fig. 6 Log MMSE Filtering Fig. 7 MFCC Feature Extraction For experimental basis, speech signals from noisy environments were taken. The result shown is fr a signal in the airport. The noise is cancelled using LogMMSE Filtering and feature extraction is done by MFCC. VI. CONCLUSION In this paper speech recognition is done and appliances are controlled using speech commands. The commands are given by the user in the form of speech. These signals are filtered using LogMMSE filtering due to which environmental noise is nullified. Further as a part of feature extraction, MFCC is used. A database is created where all the commands are saved. These commands are then given as input to the PSoC5 kit where the PSoC is programmed to give the desired result. According to the PSoC, the robot connected to the PSoC moves in different directions as specified. REFERENCES [1]. V. Naresh, B. Venkataramani, Abhishek Karan and J. Manikandan, " PSoCbased isolated speech recognition system, " International conference on Communication and Signal Processing, April 3-5, 2013. [2]. R Namba, K Kobayashi, T Ohkubo and Y.Kurihara, "Development of PSoC microcontroller based solar energy storage system," Proceedings of SICE Annual Conference (SICE), 2011, pp.718-721, 2011. [3]. J. Manikandan, B. Venkataramani, K. Girish, H. Karthic, V. Siddharth, "Hardware Implementation of Real-Time Speech Recognition System Using TMS320C6713 DSP",24th International Conference onVLSI Design (VLSI Design),pp.250-255, 2011 [4]. Jingchuan Wang and WeidongChen , "Integration of PSoC technology with educational robotics", International Conference on Field-Programmable Technology (FPT), 2010, pp.332-336, 2010.
  • 7. PSoC BASED SPEECH RECOGNITION SYSTEM www.irjes.com 7 | Page [5]. Cheng-Yuan Chang , Ching-Fa Chen , Shing-Tai Pan , Xu-Yu Li, " The speech recognition chip implementation on FPGA “, Mechanical and Electronics Engineering (ICMEE), 2nd International Conference,2010. [6]. V. Amudha, B. Venkataramani, J. Manikandan, "FPGA implementation of isolated digit recognition system using modified back propagation algorithm,"International Conference on Electronic Design ICED 2008, pp.1-6. [7]. V.Amudha, B.Venkataramani, R.Vinoth Kumar and S. Ravishankar,“SOC Implementation of HMM Based Speaker Independent Isolated Digit Recognition System”, in Proc. of IEEE Int. Conf. on VLSIDesign VLSI’07, 2007, pp.848-853. [8]. Trihandyo, A. Belloum, A. Kun-Mean Hou, ″A real-time speech recognition architecture for a multi- channel interactive voice response system″, Iternational Conference on Acoustics, Speech, and Signal ProcessingICASSP-97, vol.2, pp.1527-1530,1997 [9]. Mike Wald, Using Automatic Speech Recognition to Enhance Education for All Students: Turning a Vision into Reality [A].In Proceedings of 34th ASEE/IEEE Frontiers in Education Conference S3G, Indianapolis, Indiana, 2005, pp 22-25. [10]. N. Hataoka, H. Kokubo, Y. Obuchi, and A. Amano, "Compact and robust speech recognition for embedded use on microprocessors," IEEE Workshop on Multimedia Signal Processing, pp. 288-291, 9- 11 Dec. 2002.