This document discusses a speaker identification system that uses Mel Frequency Cepstral Coefficients (MFCC) for feature extraction and a Support Vector Machine (SVM) classifier. MFCC is used to extract statistical features from voice samples that capture unique characteristics. An SVM classifier then classifies voices as original or disguised based on the extracted features. The system was tested on 176 voice samples from various sources with 30 for training and 72 for testing. MFCC extracted mean and correlation coefficients as features. SVM classification accurately identified original vs disguised voices 90% of the time, demonstrating the effectiveness of the proposed system.
In this paper we present the implementation of speaker identification system using artificial neural network
with digital signal processing. The system is designed to work with the text-dependent speaker
identification for Bangla Speech. The utterances of speakers are recorded for specific Bangla words using
an audio wave recorder. The speech features are acquired by the digital signal processing technique. The
identification of speaker using frequency domain data is performed using backpropagation algorithm.
Hamming window and Blackman-Harris window are used to investigate better speaker identification
performance. Endpoint detection of speech is developed in order to achieve high accuracy of the system.
Review Paper on Noise Reduction Using Different TechniquesIRJET Journal
This document reviews different techniques for noise reduction in hearing aids. It discusses how digital hearing aids use algorithms like fast Fourier transforms and adaptive filters to separate speech from noise and improve speech comprehension in noisy environments. The document also evaluates different noise reduction algorithms and filter bank designs that have been proposed in previous research to enhance the noise reduction capabilities of hearing aids. The goal of developing improved noise reduction for hearing aids is to help hearing-impaired individuals better understand speech in various real-world settings with background noise.
IRJET- Survey on Efficient Signal Processing Techniques for Speech EnhancementIRJET Journal
This document provides a survey of various speech enhancement techniques. It discusses five papers that propose different speech enhancement algorithms: 1) Discrete Tchebichef Transform and Discrete Krawtchouk Transform for removing noise using minimum mean square error. 2) Empirical mode decomposition and adaptive centre weighted average filtering that is effective for removing noise components. 3) Adaptive Wiener filtering that adapts the filter transfer function based on speech signal statistics. 4) Compressive sensing based speech enhancement that handles non-sparse noise. 5) Wavelet packet transform and non-negative matrix factorization to emphasize the speech components in each sub-band. The document also discusses speech enhancement using deep neural networks, empirical mode decomposition with Hurst exponent
This document discusses speech recognition techniques. It begins by defining biometrics and how speech can be used as a biometric for identity authentication. It describes how speech recognition aims to extract lexical information independently of the speaker, while speaker recognition focuses on extracting the identity of the speaker. The document then discusses feature extraction using MFCC and modeling speech using neural networks. It provides an overview of pattern recognition techniques including statistical and structural approaches. Finally, it discusses implementation details such as preprocessing, framing, windowing and feature extraction of speech signals.
Utterance Based Speaker Identification Using ANNIJCSEA Journal
This document summarizes a research paper on speaker identification using artificial neural networks. The paper presents a speaker identification system that uses digital signal processing and ANN techniques. Speech features are extracted from utterances using FFT and windowing. These features are used to train a multi-layer perceptron network to classify speakers. The system was tested on Bangla speech and achieved accurate identification of speakers from their utterances.
Utterance Based Speaker Identification Using ANNIJCSEA Journal
In this paper we present the implementation of speaker identification system using artificial neural network with digital signal processing. The system is designed to work with the text-dependent speaker identification for Bangla Speech. The utterances of speakers are recorded for specific Bangla words using an audio wave recorder. The speech features are acquired by the digital signal processing technique. The identification of speaker using frequency domain data is performed using back propagation algorithm. Hamming window and Blackman-Harris window are used to investigate better speaker identification performance. Endpoint detection of speech is developed in order to achieve high accuracy of the system.
This document discusses feature extraction techniques for isolated word speech recognition. It begins with an introduction to digital speech processing and speech recognition models. The main part of the document compares two common feature extraction techniques: Mel Frequency Cepstral Coefficients (MFCC) and Relative Spectral (RASTA) filtering. MFCC allows signals to extract feature vectors and provides high performance but lacks robustness. RASTA filtering reduces the impact of noise in signals and provides high robustness by band-passing feature coefficients in both log spectral and spectral domains. The document provides details on the process of MFCC feature extraction, which involves steps like framing, windowing, fast Fourier transform, mel filtering, discrete cosine transform, and calculating
Speaker Identification & Verification Using MFCC & SVMIRJET Journal
This document discusses a method for speaker identification and verification using Mel Frequency Cepstral Coefficients (MFCC) and Support Vector Machines (SVM). MFCC is used to extract features from speech samples, representing the spectral properties of the voice. SVM is then used to match these features between speech samples to identify or verify speakers. The authors claim this method can identify speakers with up to 95% accuracy and is computationally efficient. It is proposed that MFCC and SVM outperform other techniques for speaker recognition tasks. The system is described as having applications in security, voice interfaces, and other voice-based systems.
In this paper we present the implementation of speaker identification system using artificial neural network
with digital signal processing. The system is designed to work with the text-dependent speaker
identification for Bangla Speech. The utterances of speakers are recorded for specific Bangla words using
an audio wave recorder. The speech features are acquired by the digital signal processing technique. The
identification of speaker using frequency domain data is performed using backpropagation algorithm.
Hamming window and Blackman-Harris window are used to investigate better speaker identification
performance. Endpoint detection of speech is developed in order to achieve high accuracy of the system.
Review Paper on Noise Reduction Using Different TechniquesIRJET Journal
This document reviews different techniques for noise reduction in hearing aids. It discusses how digital hearing aids use algorithms like fast Fourier transforms and adaptive filters to separate speech from noise and improve speech comprehension in noisy environments. The document also evaluates different noise reduction algorithms and filter bank designs that have been proposed in previous research to enhance the noise reduction capabilities of hearing aids. The goal of developing improved noise reduction for hearing aids is to help hearing-impaired individuals better understand speech in various real-world settings with background noise.
IRJET- Survey on Efficient Signal Processing Techniques for Speech EnhancementIRJET Journal
This document provides a survey of various speech enhancement techniques. It discusses five papers that propose different speech enhancement algorithms: 1) Discrete Tchebichef Transform and Discrete Krawtchouk Transform for removing noise using minimum mean square error. 2) Empirical mode decomposition and adaptive centre weighted average filtering that is effective for removing noise components. 3) Adaptive Wiener filtering that adapts the filter transfer function based on speech signal statistics. 4) Compressive sensing based speech enhancement that handles non-sparse noise. 5) Wavelet packet transform and non-negative matrix factorization to emphasize the speech components in each sub-band. The document also discusses speech enhancement using deep neural networks, empirical mode decomposition with Hurst exponent
This document discusses speech recognition techniques. It begins by defining biometrics and how speech can be used as a biometric for identity authentication. It describes how speech recognition aims to extract lexical information independently of the speaker, while speaker recognition focuses on extracting the identity of the speaker. The document then discusses feature extraction using MFCC and modeling speech using neural networks. It provides an overview of pattern recognition techniques including statistical and structural approaches. Finally, it discusses implementation details such as preprocessing, framing, windowing and feature extraction of speech signals.
Utterance Based Speaker Identification Using ANNIJCSEA Journal
This document summarizes a research paper on speaker identification using artificial neural networks. The paper presents a speaker identification system that uses digital signal processing and ANN techniques. Speech features are extracted from utterances using FFT and windowing. These features are used to train a multi-layer perceptron network to classify speakers. The system was tested on Bangla speech and achieved accurate identification of speakers from their utterances.
Utterance Based Speaker Identification Using ANNIJCSEA Journal
In this paper we present the implementation of speaker identification system using artificial neural network with digital signal processing. The system is designed to work with the text-dependent speaker identification for Bangla Speech. The utterances of speakers are recorded for specific Bangla words using an audio wave recorder. The speech features are acquired by the digital signal processing technique. The identification of speaker using frequency domain data is performed using back propagation algorithm. Hamming window and Blackman-Harris window are used to investigate better speaker identification performance. Endpoint detection of speech is developed in order to achieve high accuracy of the system.
This document discusses feature extraction techniques for isolated word speech recognition. It begins with an introduction to digital speech processing and speech recognition models. The main part of the document compares two common feature extraction techniques: Mel Frequency Cepstral Coefficients (MFCC) and Relative Spectral (RASTA) filtering. MFCC allows signals to extract feature vectors and provides high performance but lacks robustness. RASTA filtering reduces the impact of noise in signals and provides high robustness by band-passing feature coefficients in both log spectral and spectral domains. The document provides details on the process of MFCC feature extraction, which involves steps like framing, windowing, fast Fourier transform, mel filtering, discrete cosine transform, and calculating
Speaker Identification & Verification Using MFCC & SVMIRJET Journal
This document discusses a method for speaker identification and verification using Mel Frequency Cepstral Coefficients (MFCC) and Support Vector Machines (SVM). MFCC is used to extract features from speech samples, representing the spectral properties of the voice. SVM is then used to match these features between speech samples to identify or verify speakers. The authors claim this method can identify speakers with up to 95% accuracy and is computationally efficient. It is proposed that MFCC and SVM outperform other techniques for speaker recognition tasks. The system is described as having applications in security, voice interfaces, and other voice-based systems.
Classification of Language Speech Recognition Systemijtsrd
This paper is aimed to implement Classification of Language Speech Recognition System by using feature extraction and classification. It is an Automatic language Speech Recognition system. This system is a software architecture which outputs digits from the input speech signals. The system is emphasized on Speaker Dependent Isolated Word Recognition System. To implement this system, a good quality microphone is required to record the speech signals. This system contains two main modules feature extraction and feature matching. Feature extraction is the process of extracting a small amount of data from the voice signal that can later be used to represent each speech signal. Feature matching involves the actual procedure to identify the unknown speech signal by comparing extracted features from the voice input of a set of known speech signals and the decision making process. In this system, the Mel frequency Cepstrum Coefficient MFCC is used for feature extraction and Vector Quantization VQ which uses the LBG algorithm is used for feature matching. Khin May Yee | Moh Moh Khaing | Thu Zar Aung "Classification of Language Speech Recognition System" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-3 | Issue-5 , August 2019, URL: https://www.ijtsrd.com/papers/ijtsrd26546.pdfPaper URL: https://www.ijtsrd.com/computer-science/speech-recognition/26546/classification-of-language-speech-recognition-system/khin-may-yee
The document provides an introduction and overview of sub-band coding for speech signals. It discusses how sub-band coding works to divide the speech spectrum into multiple sub-bands using filter banks, then encode each sub-band separately to reduce the bit rate. The key steps involve analysis filtering, quantization and coding of sub-bands, then synthesis filtering to reconstruct the signal. MATLAB code is presented to implement a sub-band coder using quadrature mirror filters. Applications of sub-band coding include speech and audio coding standards.
Adaptive wavelet thresholding with robust hybrid features for text-independe...IJECEIAES
The robustness of speaker identification system over additive noise channel is crucial for real-world applications. In speaker identification (SID) systems, the extracted features from each speech frame are an essential factor for building a reliable identification system. For clean environments, the identification system works well; in noisy environments, there is an additive noise, which is affect the system. To eliminate the problem of additive noise and to achieve a high accuracy in speaker identification system a proposed algorithm for feature extraction based on speech enhancement and a combined features is presents. In this paper, a wavelet thresholding pre-processing stage, and feature warping (FW) techniques are used with two combined features named power normalized cepstral coefficients (PNCC) and gammatone frequency cepstral coefficients (GFCC) to improve the identification system robustness against different types of additive noises. Universal background model Gaussian mixture model (UBM-GMM) is used for features matching between the claim and actual speakers. The results showed performance improvement for the proposed feature extraction algorithm of identification system comparing with conventional features over most types of noises and different SNR ratios.
This document summarizes research on speaker recognition technologies. It discusses how speaker recognition can be used for biometric authentication by analyzing a person's voiceprint. It reviews literature on MFCC-GMM models for text-independent speaker verification and the use of speaker recognition in biometric security systems. The document also outlines the basic components of a speaker recognition system, including enrollment, feature extraction using MFCCs, and verification through comparison to stored voice templates using algorithms like GMM.
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
This is a ppt on speech recognition system or automated speech recognition system. I hope that it would be helpful for all the people searching for a presentation on this technology
The following resources come from the 2009/10 B.Sc in Media Technology and Digital Broadcast (course number 2ELE0076) from the University of Hertfordshire. All the mini projects are designed as level two modules of the undergraduate programmes.
IRJET- Adverse Environment and its Compensation Approaches in Speech RecognitionIRJET Journal
1) Speech recognition systems often have lower accuracy in real-world applications compared to experimental settings due to noise, distortions, and adverse environments in real settings like vehicles and aircraft cockpits.
2) Adverse environments mismatch the training and testing conditions of speech recognition systems. Major causes of adverse environments include noise, distortions from equipment, and changes in speech from stress.
3) Several approaches can help compensate for adverse environments, such as multi-style training with varied speech, signal enhancement preprocessing to remove noise, and developing robust representations of speech that are resistant to noise corruption. Properly applying such techniques can improve the performance of speech recognition in noisy real-world conditions.
This document summarizes a speech recognition system (SRS). SRS uses speech identification and verification. Speech identification determines which registered speaker provided an utterance by extracting features like mel-frequency cepstrum coefficients and comparing them. Speech verification accepts or rejects an identity claim by clustering training vectors from an enrollment session into speaker-specific codebooks using vector quantization. Applications of SRS include banking by phone, voice dialing, voice mail, and security control.
In the present-day communications speech signals get contaminated due to
various sorts of noises that degrade the speech quality and adversely impacts
speech recognition performance. To overcome these issues, a novel approach
for speech enhancement using Modified Wiener filtering is developed and
power spectrum computation is applied for degraded signal to obtain the
noise characteristics from a noisy spectrum. In next phase, MMSE technique
is applied where Gaussian distribution of each signal i.e. original and noisy
signal is analyzed. The Gaussian distribution provides spectrum estimation
and spectral coefficient parameters which can be used for probabilistic model
formulation. Moreover, a-priori-SNR computation is also incorporated for
coefficient updation and noise presence estimation which operates similar to
the conventional VAD. However, conventional VAD scheme is based on the
hard threshold which is not capable to derive satisfactory performance and a
soft-decision based threshold is developed for improving the performance of
speech enhancement. An extensive simulation study is carried out using
MATLAB simulation tool on NOIZEUS speech database and a comparative
study is presented where proposed approach is proved better in comparison
with existing technique.
COMBINED FEATURE EXTRACTION TECHNIQUES AND NAIVE BAYES CLASSIFIER FOR SPEECH ...csandit
This document describes a study that developed a speech recognition system for recognizing spoken Malayalam digits. It used two wavelet-based feature extraction techniques - Discrete Wavelet Transforms (DWT) and Wavelet Packet Decomposition (WPD) - and evaluated their performance using a Naive Bayes classifier. DWT achieved 83.5% accuracy and WPD achieved 80.7% accuracy. To improve recognition accuracy, the study introduced a new technique called Discrete Wavelet Packet Decomposition (DWPD) that utilizes features from both DWT and WPD. DWPD achieved the highest accuracy of 86.2% along with the Naive Bayes classifier.
Implementation of Digital Hearing AID for Sensory Neural Impairmentijtsrd
Hearing impairment is a chronicle disability affecting on people in world. The hearing aid is to amplify sound to overcome a hearing loss or impairment. The hearing aid picks up the sound signal with the microphone and amplifies all frequency sound signals but the sensory neural impairment person cannot hear particular frequency of sound in a noisy environment, since the auditory nerve is damaged. In this paper we are using MATLAB to design an adaptive filter for noise removal and filter banks for amplifies the particular frequency that a person with hearing loss can listen. Navya Bharathi K S | Bindu Shree C | Dr. V Udayashankara "Implementation of Digital Hearing AID for Sensory Neural Impairment" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-4 | Issue-4 , June 2020, URL: https://www.ijtsrd.com/papers/ijtsrd31133.pdf Paper Url :https://www.ijtsrd.com/engineering/bio-mechanicaland-biomedical-engineering/31133/implementation-of-digital-hearing-aid-for-sensory-neural-impairment/navya-bharathi-k-s
This document provides an overview of speech recognition including:
- The topics that will be covered such as speech production, why speech recognition is difficult, and applications.
- How speech is produced through the lungs, larynx, and vocal tract and modified into different sounds.
- The main components of a speech recognition system including sound sampling, conversion to frequencies, and matching to a phoneme database.
- Some of the challenges in speech recognition including variations between speakers and dependence on neighboring sounds.
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD Editor
This document summarizes and compares several techniques for enhancing the intelligibility of speech signals corrupted by noise. It describes single channel techniques like spectral subtraction, spectral subtraction with oversubtraction, and nonlinear spectral subtraction. It also covers multi-channel techniques such as adaptive noise cancellation and multisensory beamforming. Additionally, it discusses spectral subtraction using adaptive averaging, noise reduction using enhanced Wiener filtering, and other adaptive neuro-fuzzy techniques for speech enhancement. The goal of these techniques is to improve the quality and intelligibility of noisy speech signals.
The document discusses automatic speech recognition (ASR) and its components. It covers two main approaches to ASR - template matching and feature analysis. Template matching uses pre-recorded templates to match voices, while feature analysis processes voices into features using techniques like linear predictive coding to determine similarities between expected and actual input. The document also describes the process of speech recognition including acoustic wave form analysis, feature extraction, and matching phonemes, words and sentences.
Speech recognition, also known as automatic speech recognition or computer speech recognition, allows computers to understand human voice. It has various applications such as dictation, system control/navigation, and commercial/industrial uses. The process involves converting analog audio of speech into digital format, then using acoustic and language models to analyze the speech and output text. There are two main types: speaker-dependent which requires training a model for each user, and speaker-independent which can recognize any voice without training. Accuracy is improving over time as technology advances.
This document discusses speech signal processing and speech recognition. It begins by defining speech processing and its relationship to digital signal processing. It then outlines several disciplines related to speech processing including signal processing, physics, pattern recognition, and computer science. The document discusses aspects of speech signals including phonemes, the speech waveform, and spectral envelope. It covers various aspects of speech processing including pre-processing, feature extraction, and recognition. It provides details on techniques for pre-processing, feature extraction including filtering, linear predictive coding, and cepstrum. Finally, it summarizes the main steps in a speech recognition procedure including endpoint detection, framing and windowing, feature extraction, and distortion measure calculations for recognition.
The document discusses voice recognition using MatLab. It introduces voice recognition as the process of converting acoustic signals to words. Voice recognition can be used for transcription, command and control, and information access. It discusses the principles and methods of voice recognition, including text-dependent and text-independent approaches. The document outlines the key components of a voice recognition system, including feature extraction using mel-frequency cepstrum coefficients (MFCC), recognition models, and applications like device control and mobile phones. It also reviews the advantages, limitations, and future of voice recognition technology.
A GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLABsipij
1) The document describes the development of a speaker-dependent speech recognition system using MATLAB. It uses Gaussian mixture models for acoustic modeling and mel-frequency cepstral coefficients for feature extraction.
2) The system is designed to recognize isolated digits 0-9. Voice activity detection is performed to detect segments of speech. Various windowing functions are evaluated to reduce spectral leakage during feature extraction.
3) 13 MFCCs plus energy are extracted from each 30ms frame with 10ms shift. The first and second derivatives are also calculated to capture dynamic information, resulting in a 39-dimensional feature vector.
Speaker Recognition System using MFCC and Vector Quantization Approachijsrd.com
This paper presents an approach to speaker recognition using frequency spectral information with Mel frequency for the improvement of speech feature representation in a Vector Quantization codebook based recognition approach. The Mel frequency approach extracts the features of the speech signal to get the training and testing vectors. The VQ Codebook approach uses training vectors to form clusters and recognize accurately with the help of LBG algorithm.
Voice recognition is the process of automatically recognizing who is speaking on the basis of individual information included in speech waves. This technique makes it possible to use the speaker's voice to verify their identity and control access to services such as voice dialing, banking by telephone, telephone shopping, database access services, information services, voice mail, security control for confidential information areas, and remote access to computers.
This document describes how to build a simple, yet complete and representative automatic speaker recognition system. Such a speaker recognition system has potential in many security applications. For example, users have to speak a PIN (Personal Identification Number) in order to gain access to the laboratory door, or users have to speak their credit card number over the telephone line to verify their identity. By checking the voice characteristics of the input utterance, using an automatic speaker recognition system similar to the one that we will describe, the system is able to add an extra level of security.
This document presents a text-dependent speaker recognition system using neural networks that aims to improve recognition accuracy. It proposes changing the number of Mel Frequency Cepstral Coefficients (MFCCs) used in training. Voice Activity Detection is also used as a preprocessing step. Experimental results show recognition accuracy increases from 70.41% to a maximum of 92.91% as the number of MFCCs increases from 14 to 20, but then decreases with more MFCCs. The system is implemented on a Raspberry Pi for hardware acceleration.
Classification of Language Speech Recognition Systemijtsrd
This paper is aimed to implement Classification of Language Speech Recognition System by using feature extraction and classification. It is an Automatic language Speech Recognition system. This system is a software architecture which outputs digits from the input speech signals. The system is emphasized on Speaker Dependent Isolated Word Recognition System. To implement this system, a good quality microphone is required to record the speech signals. This system contains two main modules feature extraction and feature matching. Feature extraction is the process of extracting a small amount of data from the voice signal that can later be used to represent each speech signal. Feature matching involves the actual procedure to identify the unknown speech signal by comparing extracted features from the voice input of a set of known speech signals and the decision making process. In this system, the Mel frequency Cepstrum Coefficient MFCC is used for feature extraction and Vector Quantization VQ which uses the LBG algorithm is used for feature matching. Khin May Yee | Moh Moh Khaing | Thu Zar Aung "Classification of Language Speech Recognition System" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-3 | Issue-5 , August 2019, URL: https://www.ijtsrd.com/papers/ijtsrd26546.pdfPaper URL: https://www.ijtsrd.com/computer-science/speech-recognition/26546/classification-of-language-speech-recognition-system/khin-may-yee
The document provides an introduction and overview of sub-band coding for speech signals. It discusses how sub-band coding works to divide the speech spectrum into multiple sub-bands using filter banks, then encode each sub-band separately to reduce the bit rate. The key steps involve analysis filtering, quantization and coding of sub-bands, then synthesis filtering to reconstruct the signal. MATLAB code is presented to implement a sub-band coder using quadrature mirror filters. Applications of sub-band coding include speech and audio coding standards.
Adaptive wavelet thresholding with robust hybrid features for text-independe...IJECEIAES
The robustness of speaker identification system over additive noise channel is crucial for real-world applications. In speaker identification (SID) systems, the extracted features from each speech frame are an essential factor for building a reliable identification system. For clean environments, the identification system works well; in noisy environments, there is an additive noise, which is affect the system. To eliminate the problem of additive noise and to achieve a high accuracy in speaker identification system a proposed algorithm for feature extraction based on speech enhancement and a combined features is presents. In this paper, a wavelet thresholding pre-processing stage, and feature warping (FW) techniques are used with two combined features named power normalized cepstral coefficients (PNCC) and gammatone frequency cepstral coefficients (GFCC) to improve the identification system robustness against different types of additive noises. Universal background model Gaussian mixture model (UBM-GMM) is used for features matching between the claim and actual speakers. The results showed performance improvement for the proposed feature extraction algorithm of identification system comparing with conventional features over most types of noises and different SNR ratios.
This document summarizes research on speaker recognition technologies. It discusses how speaker recognition can be used for biometric authentication by analyzing a person's voiceprint. It reviews literature on MFCC-GMM models for text-independent speaker verification and the use of speaker recognition in biometric security systems. The document also outlines the basic components of a speaker recognition system, including enrollment, feature extraction using MFCCs, and verification through comparison to stored voice templates using algorithms like GMM.
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
This is a ppt on speech recognition system or automated speech recognition system. I hope that it would be helpful for all the people searching for a presentation on this technology
The following resources come from the 2009/10 B.Sc in Media Technology and Digital Broadcast (course number 2ELE0076) from the University of Hertfordshire. All the mini projects are designed as level two modules of the undergraduate programmes.
IRJET- Adverse Environment and its Compensation Approaches in Speech RecognitionIRJET Journal
1) Speech recognition systems often have lower accuracy in real-world applications compared to experimental settings due to noise, distortions, and adverse environments in real settings like vehicles and aircraft cockpits.
2) Adverse environments mismatch the training and testing conditions of speech recognition systems. Major causes of adverse environments include noise, distortions from equipment, and changes in speech from stress.
3) Several approaches can help compensate for adverse environments, such as multi-style training with varied speech, signal enhancement preprocessing to remove noise, and developing robust representations of speech that are resistant to noise corruption. Properly applying such techniques can improve the performance of speech recognition in noisy real-world conditions.
This document summarizes a speech recognition system (SRS). SRS uses speech identification and verification. Speech identification determines which registered speaker provided an utterance by extracting features like mel-frequency cepstrum coefficients and comparing them. Speech verification accepts or rejects an identity claim by clustering training vectors from an enrollment session into speaker-specific codebooks using vector quantization. Applications of SRS include banking by phone, voice dialing, voice mail, and security control.
In the present-day communications speech signals get contaminated due to
various sorts of noises that degrade the speech quality and adversely impacts
speech recognition performance. To overcome these issues, a novel approach
for speech enhancement using Modified Wiener filtering is developed and
power spectrum computation is applied for degraded signal to obtain the
noise characteristics from a noisy spectrum. In next phase, MMSE technique
is applied where Gaussian distribution of each signal i.e. original and noisy
signal is analyzed. The Gaussian distribution provides spectrum estimation
and spectral coefficient parameters which can be used for probabilistic model
formulation. Moreover, a-priori-SNR computation is also incorporated for
coefficient updation and noise presence estimation which operates similar to
the conventional VAD. However, conventional VAD scheme is based on the
hard threshold which is not capable to derive satisfactory performance and a
soft-decision based threshold is developed for improving the performance of
speech enhancement. An extensive simulation study is carried out using
MATLAB simulation tool on NOIZEUS speech database and a comparative
study is presented where proposed approach is proved better in comparison
with existing technique.
COMBINED FEATURE EXTRACTION TECHNIQUES AND NAIVE BAYES CLASSIFIER FOR SPEECH ...csandit
This document describes a study that developed a speech recognition system for recognizing spoken Malayalam digits. It used two wavelet-based feature extraction techniques - Discrete Wavelet Transforms (DWT) and Wavelet Packet Decomposition (WPD) - and evaluated their performance using a Naive Bayes classifier. DWT achieved 83.5% accuracy and WPD achieved 80.7% accuracy. To improve recognition accuracy, the study introduced a new technique called Discrete Wavelet Packet Decomposition (DWPD) that utilizes features from both DWT and WPD. DWPD achieved the highest accuracy of 86.2% along with the Naive Bayes classifier.
Implementation of Digital Hearing AID for Sensory Neural Impairmentijtsrd
Hearing impairment is a chronicle disability affecting on people in world. The hearing aid is to amplify sound to overcome a hearing loss or impairment. The hearing aid picks up the sound signal with the microphone and amplifies all frequency sound signals but the sensory neural impairment person cannot hear particular frequency of sound in a noisy environment, since the auditory nerve is damaged. In this paper we are using MATLAB to design an adaptive filter for noise removal and filter banks for amplifies the particular frequency that a person with hearing loss can listen. Navya Bharathi K S | Bindu Shree C | Dr. V Udayashankara "Implementation of Digital Hearing AID for Sensory Neural Impairment" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-4 | Issue-4 , June 2020, URL: https://www.ijtsrd.com/papers/ijtsrd31133.pdf Paper Url :https://www.ijtsrd.com/engineering/bio-mechanicaland-biomedical-engineering/31133/implementation-of-digital-hearing-aid-for-sensory-neural-impairment/navya-bharathi-k-s
This document provides an overview of speech recognition including:
- The topics that will be covered such as speech production, why speech recognition is difficult, and applications.
- How speech is produced through the lungs, larynx, and vocal tract and modified into different sounds.
- The main components of a speech recognition system including sound sampling, conversion to frequencies, and matching to a phoneme database.
- Some of the challenges in speech recognition including variations between speakers and dependence on neighboring sounds.
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD Editor
This document summarizes and compares several techniques for enhancing the intelligibility of speech signals corrupted by noise. It describes single channel techniques like spectral subtraction, spectral subtraction with oversubtraction, and nonlinear spectral subtraction. It also covers multi-channel techniques such as adaptive noise cancellation and multisensory beamforming. Additionally, it discusses spectral subtraction using adaptive averaging, noise reduction using enhanced Wiener filtering, and other adaptive neuro-fuzzy techniques for speech enhancement. The goal of these techniques is to improve the quality and intelligibility of noisy speech signals.
The document discusses automatic speech recognition (ASR) and its components. It covers two main approaches to ASR - template matching and feature analysis. Template matching uses pre-recorded templates to match voices, while feature analysis processes voices into features using techniques like linear predictive coding to determine similarities between expected and actual input. The document also describes the process of speech recognition including acoustic wave form analysis, feature extraction, and matching phonemes, words and sentences.
Speech recognition, also known as automatic speech recognition or computer speech recognition, allows computers to understand human voice. It has various applications such as dictation, system control/navigation, and commercial/industrial uses. The process involves converting analog audio of speech into digital format, then using acoustic and language models to analyze the speech and output text. There are two main types: speaker-dependent which requires training a model for each user, and speaker-independent which can recognize any voice without training. Accuracy is improving over time as technology advances.
This document discusses speech signal processing and speech recognition. It begins by defining speech processing and its relationship to digital signal processing. It then outlines several disciplines related to speech processing including signal processing, physics, pattern recognition, and computer science. The document discusses aspects of speech signals including phonemes, the speech waveform, and spectral envelope. It covers various aspects of speech processing including pre-processing, feature extraction, and recognition. It provides details on techniques for pre-processing, feature extraction including filtering, linear predictive coding, and cepstrum. Finally, it summarizes the main steps in a speech recognition procedure including endpoint detection, framing and windowing, feature extraction, and distortion measure calculations for recognition.
The document discusses voice recognition using MatLab. It introduces voice recognition as the process of converting acoustic signals to words. Voice recognition can be used for transcription, command and control, and information access. It discusses the principles and methods of voice recognition, including text-dependent and text-independent approaches. The document outlines the key components of a voice recognition system, including feature extraction using mel-frequency cepstrum coefficients (MFCC), recognition models, and applications like device control and mobile phones. It also reviews the advantages, limitations, and future of voice recognition technology.
A GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLABsipij
1) The document describes the development of a speaker-dependent speech recognition system using MATLAB. It uses Gaussian mixture models for acoustic modeling and mel-frequency cepstral coefficients for feature extraction.
2) The system is designed to recognize isolated digits 0-9. Voice activity detection is performed to detect segments of speech. Various windowing functions are evaluated to reduce spectral leakage during feature extraction.
3) 13 MFCCs plus energy are extracted from each 30ms frame with 10ms shift. The first and second derivatives are also calculated to capture dynamic information, resulting in a 39-dimensional feature vector.
Speaker Recognition System using MFCC and Vector Quantization Approachijsrd.com
This paper presents an approach to speaker recognition using frequency spectral information with Mel frequency for the improvement of speech feature representation in a Vector Quantization codebook based recognition approach. The Mel frequency approach extracts the features of the speech signal to get the training and testing vectors. The VQ Codebook approach uses training vectors to form clusters and recognize accurately with the help of LBG algorithm.
Voice recognition is the process of automatically recognizing who is speaking on the basis of individual information included in speech waves. This technique makes it possible to use the speaker's voice to verify their identity and control access to services such as voice dialing, banking by telephone, telephone shopping, database access services, information services, voice mail, security control for confidential information areas, and remote access to computers.
This document describes how to build a simple, yet complete and representative automatic speaker recognition system. Such a speaker recognition system has potential in many security applications. For example, users have to speak a PIN (Personal Identification Number) in order to gain access to the laboratory door, or users have to speak their credit card number over the telephone line to verify their identity. By checking the voice characteristics of the input utterance, using an automatic speaker recognition system similar to the one that we will describe, the system is able to add an extra level of security.
This document presents a text-dependent speaker recognition system using neural networks that aims to improve recognition accuracy. It proposes changing the number of Mel Frequency Cepstral Coefficients (MFCCs) used in training. Voice Activity Detection is also used as a preprocessing step. Experimental results show recognition accuracy increases from 70.41% to a maximum of 92.91% as the number of MFCCs increases from 14 to 20, but then decreases with more MFCCs. The system is implemented on a Raspberry Pi for hardware acceleration.
This document summarizes a research paper on speaker recognition using vocal tract features. It discusses extracting pitch using FFT as a vocal source feature and Mel Frequency Cepstral Coefficients (MFCCs) as vocal tract features. These features are used as inputs to a Support Vector Machine (SVM) classifier for binary speaker classification. Experimental results show the SVM achieves up to 94.42% accuracy in classifying speakers based on their MFCC features, and accuracy increases with higher signal-to-noise ratios when noise is added to speech signals. The paper concludes vocal source and tract features can be used together with SVM for text-independent speaker recognition.
Voice Recognition Based Automation System for Medical Applications and for Ph...IRJET Journal
This document describes a voice recognition-based automation system for medical applications and physically challenged patients. The system uses a voice recognition model, Arduino microcontroller, relays, LEDs, buzzers, and a motor to control an adjustable bed. Voice commands are recognized using techniques like MFCC and HMM and used to control devices via the Arduino. The system is intended to allow paralyzed patients to control devices like lights, alarms, and their bed using only voice commands for increased independence. Testing showed the system provided accurate voice recognition under various conditions.
Voice Recognition Based Automation System for Medical Applications and for Ph...IRJET Journal
This document describes a voice recognition-based automation system for medical applications and physically challenged patients. The system uses a voice recognition model, Arduino microcontroller, relays, LEDs, buzzers, and a motor to control an adjustable bed. Voice commands are recognized using techniques like MFCC and HMM and used to control devices via the Arduino. The system is intended to allow paralyzed patients to control devices like lights, alarms, and their bed using only voice commands for increased independence. Testing showed the system can accurately recognize commands and control devices with 99% accuracy under suitable conditions.
The document describes the basic principles and components of a speech recognition system. It discusses the two main modules of feature extraction and feature matching. Feature extraction involves extracting representative data from voice signals, while feature matching compares extracted features to stored reference models to identify unknown speakers. The system has two phases - an enrollment phase where reference models are built for each registered speaker, and a testing phase where input speech is matched to models to make a recognition decision. Speech recognition has applications in security systems for verifying user identities.
IRJET- Emotion recognition using Speech Signal: A ReviewIRJET Journal
This document provides a review of speech emotion recognition techniques. It discusses how speech emotion recognition systems work, including common features extracted from speech like MFCCs and LPC coefficients. Classification techniques used in these systems are also examined, such as DTW, ANN, GMM, and K-NN. The document concludes that speech emotion recognition could be useful for applications requiring natural human-computer interaction, like car systems that monitor driver emotion or educational tutorials that adapt based on student emotion.
Wavelet Based Noise Robust Features for Speaker RecognitionCSCJournals
Extraction and selection of the best parametric representation of acoustic signal is the most important task in designing any speaker recognition system. A wide range of possibilities exists for parametrically representing the speech signal such as Linear Prediction Coding (LPC) ,Mel frequency Cepstrum coefficients (MFCC) and others. MFCC are currently the most popular choice for any speaker recognition system, though one of the shortcomings of MFCC is that the signal is assumed to be stationary within the given time frame and is therefore unable to analyze the non-stationary signal. Therefore it is not suitable for noisy speech signals. To overcome this problem several researchers used different types of AM-FM modulation/demodulation techniques for extracting features from speech signal. In some approaches it is proposed to use the wavelet filterbanks for extracting the features. In this paper a technique for extracting the features by combining the above mentioned approaches is proposed. Features are extracted from the envelope of the signal and then passed through wavelet filterbank. It is found that the proposed method outperforms the existing feature extraction techniques.
AN ANALYSIS OF SPEECH RECOGNITION PERFORMANCE BASED UPON NETWORK LAYERS AND T...IJCSEA Journal
This document analyzes speech recognition performance based on neural network layers and transfer functions. It describes a system setup that uses MFCC features and ANNs for acoustic modeling. Experiments were conducted by varying the number of neural network layers (1, 2, 3 layers) and transfer functions (linear, sigmoid, tangent sigmoid). Results showed that networks with linear transfer functions in all layers achieved the performance goal, while other configurations reached minimum gradient but not the goal. The best architecture was a 3-layer network with linear transfer functions.
Speech Recognized Automation System Using Speaker Identification through Wire...IOSR Journals
This document describes a speech recognized automation system using speaker identification through wireless communication. The system uses a speech processor and MATLAB coding with MFCC algorithms to perform speech recognition and speaker identification. It then wirelessly controls electrical devices based on speech commands. Testing showed 80-85% accuracy for the actual speaker and lower (10-20%) accuracy for other speakers. Future work could involve improving speaker recognition accuracy as the number of speakers increases.
Speech Recognized Automation System Using Speaker Identification through Wire...IOSR Journals
This paper discusses the methodology for a project named “Speech Recognized Automation System
using Speaker Identification through wireless communication”. This project gives the design of Automation
system using wireless communication and speaker recognition using Matlab code. Straightforward
programming interface of Matlab makes it an ideal tool for speech analysis in project. This automation system
is useful for home appliances as well as in industry. This paper discusses the overall design of a wireless
automation system which is built and implemented. The speech recognition centers on recognition of speech
commands stored in data base of Matlab and it is matched with incoming voice command of speaker. Mel
Frequency Cepstral Coefficient (MFCC) algorithm is used to recognize the speech of speaker and to extract
features of speech. It uses low-power RF ZigBee transceiver wireless communication modules which are
relatively cheap. This automation system is intended to control lights, fans and other electrical appliances in a
home or office using speech commands like Light, Fan etc. Further, if security is not big issue then Speech
processor is used to control the appliances without speaker identification
IRJET- Device Activation based on Voice Recognition using Mel Frequency Cepst...IRJET Journal
The document presents a voice recognition system that uses the Mel Frequency Cepstral Coefficients (MFCC) algorithm to extract voice features from a user's speech and activate a device if the voice matches a stored sample. The system operates in two phases: 1) recording and extracting voice features to create a database, and 2) comparing input voice samples to those in the database. If a match is found, the system activates a testing device (Arduino Uno). The MFCC algorithm frames the voice, applies windowing and FFT, filters to a triangular shape, and uses DCT to extract features. Vector quantization is used to match input features to the database for authentication and device activation. Simulation results demonstrate the system's
Speech Analysis and synthesis using VocoderIJTET Journal
Abstract— In this paper, I proposed a speech analysis and synthesis using a vocoder. Voice conversion systems do not create new speech signals, but just transform existing one. The proposed speech vocoding is different from speech coding. To analyze the speech signal and represent it with less number of bits, so that bandwidth efficiency can be increased. The Synthesis of speech signal from the received bits of information. In this paper three aspects of analysis have been discussed: pitch refinement, spectral envelope estimation and maximum voiced frequency estimation. A Quasi-harmonic analysis model can be used to implement a pitch refinement algorithm which improves the accuracy of the spectral estimation. Harmonic plus noise model to reconstruct the speech signal from parameter. Finally to achieve the highest possible resynthesis quality using the lowest possible number of bits to transmit the speech signal. Future work aims at incorporating the phase information into the analysis and modeling process and also synthesis these three aspects in different pitch period.
International Journal of Engineering and Science Invention (IJESI)inventionjournals
International Journal of Engineering and Science Invention (IJESI) is an international journal intended for professionals and researchers in all fields of computer science and electronics. IJESI publishes research articles and reviews within the whole field Engineering Science and Technology, new teaching methods, assessment, validation and the impact of new technologies and it will continue to provide information on the latest trends and developments in this ever-expanding subject. The publications of papers are selected through double peer reviewed to ensure originality, relevance, and readability. The articles published in our journal can be accessed online.
Speaker recognition is the computing task of validating a user's claimed identity using characteristics extracted from their voices. Voice -recognition is combination of the two where it uses learned aspects of a speaker’s voice to determine what is being said - such a system cannot recognize speech from random speakers very accurately, but it can reach high accuracy for individual voices it has been trained with, which gives us various applications in day today life.
This document summarizes the technical progress in speech recognition over the past few decades. It discusses the key components of a speech recognition system including speech analysis, feature extraction, language translation, and message understanding. The document also outlines the main challenges in speech recognition like variability in context, environmental conditions, speakers, and audio quality. Furthermore, it covers different types of speech recognition systems and their applications in various sectors such as education, medicine, transportation and more. The document concludes with an overview of the major approaches to speech recognition including acoustic-phonetic, pattern recognition, and artificial intelligence and discusses popular feature extraction methods used.
Speech processing is considered as crucial and an intensive field of research in the growth of robust and efficient speech recognition system. But the accuracy for speech recognition still focuses for variation of context, speaker’s variability, and environment conditions. In this paper, we stated curvelet based Feature Extraction (CFE) method for speech recognition in noisy environment and the input speech signal is decomposed into different frequency channels using the characteristics of curvelet transform for reduce the computational complication and the feature vector size successfully and they have better accuracy, varying window size because of which they are suitable for non –stationary signals. For better word classification and recognition, discrete hidden markov model can be used and as they consider time distribution of speech signals. The HMM classification method attained the maximum accuracy in term of identification rate for informal with 80.1%, scientific phrases with 86%, and control with 63.8 % detection rates. The objective of this study is to characterize the feature extraction methods and classification phage in speech recognition system. The various approaches available for developing speech recognition system are compared along with their merits and demerits. The statistical results shows that signal recognition accuracy will be increased by using discrete Curvelet transforms over conventional methods.
CURVELET BASED SPEECH RECOGNITION SYSTEM IN NOISY ENVIRONMENT: A STATISTICAL ...ijcsit
Speech processing is considered as crucial and an intensive field of research in the growth of robust and efficient speech recognition system. But the accuracy for speech recognition still focuses for variation of context, speaker’s variability, and environment conditions. In this paper, we stated curvelet based Feature Extraction (CFE) method for speech recognition in noisy environment and the input speech signal is decomposed into different frequency channels using the characteristics of curvelet transform for reduce the computational complication and the feature vector size successfully and they have better accuracy, varying window size because of which they are suitable for non –stationary signals. For better word classification and recognition, discrete hidden markov model can be used and as they consider time distribution of
speech signals. The HMM classification method attained the maximum accuracy in term of identification rate for informal with 80.1%, scientific phrases with 86%, and control with 63.8 % detection rates. The objective of this study is to characterize the feature extraction methods and classification phage in speech
recognition system. The various approaches available for developing speech recognition system are compared along with their merits and demerits. The statistical results shows that signal recognition accuracy will be increased by using discrete Curvelet transforms over conventional methods.
A Review On Speech Feature Techniques And Classification TechniquesNicole Heredia
This document discusses speech feature extraction and classification techniques for speech recognition systems. It provides an overview of common feature extraction methods like MFCC and LPC, and classification algorithms like ANN and SVM. MFCC mimics human auditory perception but provides weak power spectrum, while LPC is easy to calculate but does not capture information at a linear scale. ANN can learn from data but is complex for large datasets, while SVM is accurate and suitable for pattern recognition but requires fixed-length coefficients. The document evaluates these techniques and concludes that MFCC performance is more efficient than LPC for feature extraction in speech recognition.
Similar to Analysis of Suitable Extraction Methods and Classifiers For Speaker Identification (20)
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...IRJET Journal
1) The document discusses the Sungal Tunnel project in Jammu and Kashmir, India, which is being constructed using the New Austrian Tunneling Method (NATM).
2) NATM involves continuous monitoring during construction to adapt to changing ground conditions, and makes extensive use of shotcrete for temporary tunnel support.
3) The methodology section outlines the systematic geotechnical design process for tunnels according to Austrian guidelines, and describes the various steps of NATM tunnel construction including initial and secondary tunnel support.
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTUREIRJET Journal
This study examines the effect of response reduction factors (R factors) on reinforced concrete (RC) framed structures through nonlinear dynamic analysis. Three RC frame models with varying heights (4, 8, and 12 stories) were analyzed in ETABS software under different R factors ranging from 1 to 5. The results showed that displacement increased as the R factor decreased, indicating less linear behavior for lower R factors. Drift also decreased proportionally with increasing R factors from 1 to 5. Shear forces in the frames decreased with higher R factors. In general, R factors of 3 to 5 produced more satisfactory performance with less displacement and drift. The displacement variations between different building heights were consistent at different R factors. This study evaluated how R factors influence
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...IRJET Journal
This study compares the use of Stark Steel and TMT Steel as reinforcement materials in a two-way reinforced concrete slab. Mechanical testing is conducted to determine the tensile strength, yield strength, and other properties of each material. A two-way slab design adhering to codes and standards is executed with both materials. The performance is analyzed in terms of deflection, stability under loads, and displacement. Cost analyses accounting for material, durability, maintenance, and life cycle costs are also conducted. The findings provide insights into the economic and structural implications of each material for reinforcement selection and recommendations on the most suitable material based on the analysis.
Effect of Camber and Angles of Attack on Airfoil CharacteristicsIRJET Journal
This document discusses a study analyzing the effect of camber, position of camber, and angle of attack on the aerodynamic characteristics of airfoils. Sixteen modified asymmetric NACA airfoils were analyzed using computational fluid dynamics (CFD) by varying the camber, camber position, and angle of attack. The results showed the relationship between these parameters and the lift coefficient, drag coefficient, and lift to drag ratio. This provides insight into how changes in airfoil geometry impact aerodynamic performance.
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...IRJET Journal
This document reviews the progress and challenges of aluminum-based metal matrix composites (MMCs), focusing on their fabrication processes and applications. It discusses how various aluminum MMCs have been developed using reinforcements like borides, carbides, oxides, and nitrides to improve mechanical and wear properties. These composites have gained prominence for their lightweight, high-strength and corrosion resistance properties. The document also examines recent advancements in fabrication techniques for aluminum MMCs and their growing applications in industries such as aerospace and automotive. However, it notes that challenges remain around issues like improper mixing of reinforcements and reducing reinforcement agglomeration.
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...IRJET Journal
This document discusses research on using graph neural networks (GNNs) for dynamic optimization of public transportation networks in real-time. GNNs represent transit networks as graphs with nodes as stops and edges as connections. The GNN model aims to optimize networks using real-time data on vehicle locations, arrival times, and passenger loads. This helps increase mobility, decrease traffic, and improve efficiency. The system continuously trains and infers to adapt to changing transit conditions, providing decision support tools. While research has focused on performance, more work is needed on security, socio-economic impacts, contextual generalization of models, continuous learning approaches, and effective real-time visualization.
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...IRJET Journal
This document summarizes a research project that aims to compare the structural performance of conventional slab and grid slab systems in multi-story buildings using ETABS software. The study will analyze both symmetric and asymmetric building models under various loading conditions. Parameters like deflections, moments, shears, and stresses will be examined to evaluate the structural effectiveness of each slab type. The results will provide insights into the comparative behavior of conventional and grid slabs to help engineers and architects select appropriate slab systems based on building layouts and design requirements.
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...IRJET Journal
This document summarizes and reviews a research paper on the seismic response of reinforced concrete (RC) structures with plan and vertical irregularities, with and without infill walls. It discusses how infill walls can improve or reduce the seismic performance of RC buildings, depending on factors like wall layout, height distribution, connection to the frame, and relative stiffness of walls and frames. The reviewed research paper analyzes the behavior of infill walls, effects of vertical irregularities, and seismic performance of high-rise structures under linear static and dynamic analysis. It studies response characteristics like story drift, deflection and shear. The document also provides literature on similar research investigating the effects of infill walls, soft stories, plan irregularities, and different
This document provides a review of machine learning techniques used in Advanced Driver Assistance Systems (ADAS). It begins with an abstract that summarizes key applications of machine learning in ADAS, including object detection, recognition, and decision-making. The introduction discusses the integration of machine learning in ADAS and how it is transforming vehicle safety. The literature review then examines several research papers on topics like lightweight deep learning models for object detection and lane detection models using image processing. It concludes by discussing challenges and opportunities in the field, such as improving algorithm robustness and adaptability.
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...IRJET Journal
The document analyzes temperature and precipitation trends in Asosa District, Benishangul Gumuz Region, Ethiopia from 1993 to 2022 based on data from the local meteorological station. The results show:
1) The average maximum and minimum annual temperatures have generally decreased over time, with maximum temperatures decreasing by a factor of -0.0341 and minimum by -0.0152.
2) Mann-Kendall tests found the decreasing temperature trends to be statistically significant for annual maximum temperatures but not for annual minimum temperatures.
3) Annual precipitation in Asosa District showed a statistically significant increasing trend.
The conclusions recommend development planners account for rising summer precipitation and declining temperatures in
P.E.B. Framed Structure Design and Analysis Using STAAD ProIRJET Journal
This document discusses the design and analysis of pre-engineered building (PEB) framed structures using STAAD Pro software. It provides an overview of PEBs, including that they are designed off-site with building trusses and beams produced in a factory. STAAD Pro is identified as a key tool for modeling, analyzing, and designing PEBs to ensure their performance and safety under various load scenarios. The document outlines modeling structural parts in STAAD Pro, evaluating structural reactions, assigning loads, and following international design codes and standards. In summary, STAAD Pro is used to design and analyze PEB framed structures to ensure safety and code compliance.
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...IRJET Journal
This document provides a review of research on innovative fiber integration methods for reinforcing concrete structures. It discusses studies that have explored using carbon fiber reinforced polymer (CFRP) composites with recycled plastic aggregates to develop more sustainable strengthening techniques. It also examines using ultra-high performance fiber reinforced concrete to improve shear strength in beams. Additional topics covered include the dynamic responses of FRP-strengthened beams under static and impact loads, and the performance of preloaded CFRP-strengthened fiber reinforced concrete beams. The review highlights the potential of fiber composites to enable more sustainable and resilient construction practices.
Survey Paper on Cloud-Based Secured Healthcare SystemIRJET Journal
This document summarizes a survey on securing patient healthcare data in cloud-based systems. It discusses using technologies like facial recognition, smart cards, and cloud computing combined with strong encryption to securely store patient data. The survey found that healthcare professionals believe digitizing patient records and storing them in a centralized cloud system would improve access during emergencies and enable more efficient care compared to paper-based systems. However, ensuring privacy and security of patient data is paramount as healthcare incorporates these digital technologies.
Review on studies and research on widening of existing concrete bridgesIRJET Journal
This document summarizes several studies that have been conducted on widening existing concrete bridges. It describes a study from China that examined load distribution factors for a bridge widened with composite steel-concrete girders. It also outlines challenges and solutions for widening a bridge in the UAE, including replacing bearings and stitching the new and existing structures. Additionally, it discusses two bridge widening projects in New Zealand that involved adding precast beams and stitching to connect structures. Finally, safety measures and challenges for strengthening a historic bridge in Switzerland under live traffic are presented.
React based fullstack edtech web applicationIRJET Journal
The document describes the architecture of an educational technology web application built using the MERN stack. It discusses the frontend developed with ReactJS, backend with NodeJS and ExpressJS, and MongoDB database. The frontend provides dynamic user interfaces, while the backend offers APIs for authentication, course management, and other functions. MongoDB enables flexible data storage. The architecture aims to provide a scalable, responsive platform for online learning.
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...IRJET Journal
This paper proposes integrating Internet of Things (IoT) and blockchain technologies to help implement objectives of India's National Education Policy (NEP) in the education sector. The paper discusses how blockchain could be used for secure student data management, credential verification, and decentralized learning platforms. IoT devices could create smart classrooms, automate attendance tracking, and enable real-time monitoring. Blockchain would ensure integrity of exam processes and resource allocation, while smart contracts automate agreements. The paper argues this integration has potential to revolutionize education by making it more secure, transparent and efficient, in alignment with NEP goals. However, challenges like infrastructure needs, data privacy, and collaborative efforts are also discussed.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.IRJET Journal
This document provides a review of research on the performance of coconut fibre reinforced concrete. It summarizes several studies that tested different volume fractions and lengths of coconut fibres in concrete mixtures with varying compressive strengths. The studies found that coconut fibre improved properties like tensile strength, toughness, crack resistance, and spalling resistance compared to plain concrete. Volume fractions of 2-5% and fibre lengths of 20-50mm produced the best results. The document concludes that using a 4-5% volume fraction of coconut fibres 30-40mm in length with M30-M60 grade concrete would provide benefits based on previous research.
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...IRJET Journal
The document discusses optimizing business management processes through automation using Microsoft Power Automate and artificial intelligence. It provides an overview of Power Automate's key components and features for automating workflows across various apps and services. The document then presents several scenarios applying automation solutions to common business processes like data entry, monitoring, HR, finance, customer support, and more. It estimates the potential time and cost savings from implementing automation for each scenario. Finally, the conclusion emphasizes the transformative impact of AI and automation tools on business processes and the need for ongoing optimization.
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignIRJET Journal
The document describes the seismic design of a G+5 steel building frame located in Roorkee, India according to Indian codes IS 1893-2002 and IS 800. The frame was analyzed using the equivalent static load method and response spectrum method, and its response in terms of displacements and shear forces were compared. Based on the analysis, the frame was designed as a seismic-resistant steel structure according to IS 800:2007. The software STAAD Pro was used for the analysis and design.
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...IRJET Journal
This research paper explores using plastic waste as a sustainable and cost-effective construction material. The study focuses on manufacturing pavers and bricks using recycled plastic and partially replacing concrete with plastic alternatives. Initial results found that pavers and bricks made from recycled plastic demonstrate comparable strength and durability to traditional materials while providing environmental and cost benefits. Additionally, preliminary research indicates incorporating plastic waste as a partial concrete replacement significantly reduces construction costs without compromising structural integrity. The outcomes suggest adopting plastic waste in construction can address plastic pollution while optimizing costs, promoting more sustainable building practices.
Software Engineering and Project Management - Introduction, Modeling Concepts...Prakhyath Rai
Introduction, Modeling Concepts and Class Modeling: What is Object orientation? What is OO development? OO Themes; Evidence for usefulness of OO development; OO modeling history. Modeling
as Design technique: Modeling, abstraction, The Three models. Class Modeling: Object and Class Concept, Link and associations concepts, Generalization and Inheritance, A sample class model, Navigation of class models, and UML diagrams
Building the Analysis Models: Requirement Analysis, Analysis Model Approaches, Data modeling Concepts, Object Oriented Analysis, Scenario-Based Modeling, Flow-Oriented Modeling, class Based Modeling, Creating a Behavioral Model.
artificial intelligence and data science contents.pptxGauravCar
What is artificial intelligence? Artificial intelligence is the ability of a computer or computer-controlled robot to perform tasks that are commonly associated with the intellectual processes characteristic of humans, such as the ability to reason.
› ...
Artificial intelligence (AI) | Definitio
Design and optimization of ion propulsion dronebjmsejournal
Electric propulsion technology is widely used in many kinds of vehicles in recent years, and aircrafts are no exception. Technically, UAVs are electrically propelled but tend to produce a significant amount of noise and vibrations. Ion propulsion technology for drones is a potential solution to this problem. Ion propulsion technology is proven to be feasible in the earth’s atmosphere. The study presented in this article shows the design of EHD thrusters and power supply for ion propulsion drones along with performance optimization of high-voltage power supply for endurance in earth’s atmosphere.
Discover the latest insights on Data Driven Maintenance with our comprehensive webinar presentation. Learn about traditional maintenance challenges, the right approach to utilizing data, and the benefits of adopting a Data Driven Maintenance strategy. Explore real-world examples, industry best practices, and innovative solutions like FMECA and the D3M model. This presentation, led by expert Jules Oudmans, is essential for asset owners looking to optimize their maintenance processes and leverage digital technologies for improved efficiency and performance. Download now to stay ahead in the evolving maintenance landscape.
Embedded machine learning-based road conditions and driving behavior monitoringIJECEIAES
Car accident rates have increased in recent years, resulting in losses in human lives, properties, and other financial costs. An embedded machine learning-based system is developed to address this critical issue. The system can monitor road conditions, detect driving patterns, and identify aggressive driving behaviors. The system is based on neural networks trained on a comprehensive dataset of driving events, driving styles, and road conditions. The system effectively detects potential risks and helps mitigate the frequency and impact of accidents. The primary goal is to ensure the safety of drivers and vehicles. Collecting data involved gathering information on three key road events: normal street and normal drive, speed bumps, circular yellow speed bumps, and three aggressive driving actions: sudden start, sudden stop, and sudden entry. The gathered data is processed and analyzed using a machine learning system designed for limited power and memory devices. The developed system resulted in 91.9% accuracy, 93.6% precision, and 92% recall. The achieved inference time on an Arduino Nano 33 BLE Sense with a 32-bit CPU running at 64 MHz is 34 ms and requires 2.6 kB peak RAM and 139.9 kB program flash memory, making it suitable for resource-constrained embedded systems.
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...IJECEIAES
Climate change's impact on the planet forced the United Nations and governments to promote green energies and electric transportation. The deployments of photovoltaic (PV) and electric vehicle (EV) systems gained stronger momentum due to their numerous advantages over fossil fuel types. The advantages go beyond sustainability to reach financial support and stability. The work in this paper introduces the hybrid system between PV and EV to support industrial and commercial plants. This paper covers the theoretical framework of the proposed hybrid system including the required equation to complete the cost analysis when PV and EV are present. In addition, the proposed design diagram which sets the priorities and requirements of the system is presented. The proposed approach allows setup to advance their power stability, especially during power outages. The presented information supports researchers and plant owners to complete the necessary analysis while promoting the deployment of clean energy. The result of a case study that represents a dairy milk farmer supports the theoretical works and highlights its advanced benefits to existing plants. The short return on investment of the proposed approach supports the paper's novelty approach for the sustainable electrical system. In addition, the proposed system allows for an isolated power setup without the need for a transmission line which enhances the safety of the electrical network